BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 011045
(495 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 635 bits (1637), Expect = e-179, Method: Compositional matrix adjust.
Identities = 317/460 (68%), Positives = 380/460 (82%), Gaps = 1/460 (0%)
Query: 37 VLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHN 96
+LDV+S+LQQ +ILSF+ +T + + T + NSS SFSL LH RE ++K H
Sbjct: 39 ILDVASSLQQAHNILSFDLQTQKSSTHTTITTSTPSFSNSSLSFSLELHPRETIYKIHHK 98
Query: 97 DYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGS 156
DY+SLVLSRL RD+ R N+L +LQLA+ ++ + +LKP E +I PED STPV SG SQGS
Sbjct: 99 DYKSLVLSRLHRDTVRFNSLTARLQLALEDISKSDLKPLETEIKPEDLSTPVTSGTSQGS 158
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GEYF+R+GVG P RQF MVLDTGSDINWLQC+PCT+CYQQ+DPIFDP SS+Y+P+ C +
Sbjct: 159 GEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQS 218
Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
QC SL++S+CR+ +CLYQV YGDGS+T GD TE+VSFGNSGSVK +ALGCGHDNEGLF
Sbjct: 219 QQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSVKNVALGCGHDNEGLF 278
Query: 277 VGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR-GGDAVTAPLIRNK 335
VG+AGLLGLGGG LSLT Q+KATS +YCLV+RDS S L+FNSA+ G D+VTAPL++N+
Sbjct: 279 VGAAGLLGLGGGPLSLTNQLKATSFSYCLVNRDSAGSSTLDFNSAQLGVDSVTAPLMKNR 338
Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
K+DTFYYVGL+G SVGGQ V IP S F +DE+G+GGIIVDCGTAITRLQTQAYN LRD+F
Sbjct: 339 KIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQAYNPLRDAF 398
Query: 396 VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
VR+ NLK TS VALFDTCYD SG SVRVPTVS HF GK+ +LPA NYLIPVDSAGT+
Sbjct: 399 VRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSWNLPAANYLIPVDSAGTY 458
Query: 456 CFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
CFAFAPT+S+LSIIGNVQQQGTRV+FDLANNR+GF+PNKC
Sbjct: 459 CFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 630 bits (1626), Expect = e-178, Method: Compositional matrix adjust.
Identities = 325/500 (65%), Positives = 394/500 (78%), Gaps = 11/500 (2%)
Query: 6 PFVLFTITTILFSFCLFTS-ASSRGLSET-ATTVLDVSSALQQTEHILSFEPETLEPFAE 63
P L ++ + S CL T+ ASSR LS + TTVLDV S+LQQT+HILS +P A
Sbjct: 4 PRFLSLLSVVTLSICLTTTDASSRSLSTSHKTTVLDVVSSLQQTQHILSVDPTRSSLTAR 63
Query: 64 ESETAAESFP--LNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQ 121
E ES P LNSSS SL LHSR+ L ++H DY+SLVLSRLERDS+RV + K++
Sbjct: 64 IPEFKPESDPVFLNSSSPLSLELHSRDTLVASQHKDYKSLVLSRLERDSSRVAGIAAKIR 123
Query: 122 LAIYNVDRHELKPA---EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
A+ +DR +LKP E + PED +TPVVSG SQGSGEYFSRIGVGTP ++ +VLDT
Sbjct: 124 FAVEGIDRSDLKPVDIDETRFQPEDLTTPVVSGTSQGSGEYFSRIGVGTPAKEMYVVLDT 183
Query: 179 GSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAY 238
GSD+NW+QC PC+ECYQQSDPIFDP +SS++ L C+ P+C SLDVSACR+N+CLYQV+Y
Sbjct: 184 GSDVNWIQCLPCSECYQQSDPIFDPTSSSTFKSLTCSDPKCASLDVSACRSNKCLYQVSY 243
Query: 239 GDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA 298
GDGSFTVG+ T+TV+FG SG V +ALGCGHDNEGLF G+AGLLGLGGG LS+T QIKA
Sbjct: 244 GDGSFTVGNYATDTVTFGESGKVNDVALGCGHDNEGLFTGAAGLLGLGGGALSMTNQIKA 303
Query: 299 TSLAYCLVDRDSPASGVLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
S +YCLVDRDS S L+FNS + GDA TAPL+RN K+DTFYYVGL+GFSVGGQ V
Sbjct: 304 KSFSYCLVDRDSAKSSSLDFNSVQIGAGDA-TAPLLRNSKMDTFYYVGLSGFSVGGQQVS 362
Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKP-TSGVALFDTCY 415
IP SLFE+D +G GG+I+DCGTA+TRLQTQAYNSLRD+FV+L + K TS ++LFDTCY
Sbjct: 363 IPSSLFEVDASGAGGVILDCGTAVTRLQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCY 422
Query: 416 DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQ 475
DFS L +V+VPTV+ HF GK+L+LPAKNYLIP+D AGTFCFAFAPTSS+LSIIGNVQQQ
Sbjct: 423 DFSSLSTVKVPTVTFHFTGGKSLNLPAKNYLIPIDDAGTFCFAFAPTSSSLSIIGNVQQQ 482
Query: 476 GTRVSFDLANNRVGFTPNKC 495
GTR+++DLANN +G + NKC
Sbjct: 483 GTRITYDLANNLIGLSANKC 502
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 624 bits (1608), Expect = e-176, Method: Compositional matrix adjust.
Identities = 316/498 (63%), Positives = 393/498 (78%), Gaps = 9/498 (1%)
Query: 6 PFVLFTITTILFS-FCLFTSASSRGLS-ETATTVLDVSSALQQTEHILSFEPETLEPFAE 63
P L +TT+ S F T ASSR LS T TTVLDV S+LQQT+ ILS +P A
Sbjct: 4 PRFLSLLTTVTLSLFLTATDASSRSLSTSTKTTVLDVVSSLQQTQTILSLDPTRSSLTAT 63
Query: 64 ESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLA 123
+ E+ ++ NSSS SL LHSR+ L ++H DY+SLVLSRLERDS+RV + K++ A
Sbjct: 64 KPESISDPVFFNSSSPLSLELHSRDTLVASQHKDYKSLVLSRLERDSSRVAGIAAKIRFA 123
Query: 124 IYNVDRHELKPA---EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGS 180
+ +DR +LKP + + PE +TPVVSG SQGSGEYFSRIGVGTP ++ +VLDTGS
Sbjct: 124 VEGIDRSDLKPVNNEDTRYQPEALTTPVVSGVSQGSGEYFSRIGVGTPAKEMYLVLDTGS 183
Query: 181 DINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGD 240
D+NW+QC PC++CYQQSDP+F+P +SS+Y L C+APQC L+ SACR+N+CLYQV+YGD
Sbjct: 184 DVNWIQCEPCSDCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGD 243
Query: 241 GSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS 300
GSFTVG+L T+TV+FGNSG + +ALGCGHDNEGLF G+AGLLGLGGG LS+T Q+KATS
Sbjct: 244 GSFTVGELATDTVTFGNSGKINDVALGCGHDNEGLFTGAAGLLGLGGGALSITNQMKATS 303
Query: 301 LAYCLVDRDSPASGVLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIP 358
+YCLVDRDS S L+FNS + GDA TAPL+RN+K+DTFYYVGL+GFSVGGQ V +P
Sbjct: 304 FSYCLVDRDSGKSSSLDFNSVQLGSGDA-TAPLLRNQKIDTFYYVGLSGFSVGGQKVMMP 362
Query: 359 PSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKP-TSGVALFDTCYDF 417
++F++D +G GG+I+DCGTA+TRLQTQAYNSLRD+F++L NLK TS ++LFDTCYDF
Sbjct: 363 DAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTCYDF 422
Query: 418 SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGT 477
S L SV+VPTV+ HF GK+LDLPAKNYLIPVD GTFCFAFAPTSS+LSIIGNVQQQGT
Sbjct: 423 SSLSSVKVPTVAFHFTGGKSLDLPAKNYLIPVDDNGTFCFAFAPTSSSLSIIGNVQQQGT 482
Query: 478 RVSFDLANNRVGFTPNKC 495
R+++DLAN +G + NKC
Sbjct: 483 RITYDLANKIIGLSGNKC 500
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 617 bits (1590), Expect = e-174, Method: Compositional matrix adjust.
Identities = 318/491 (64%), Positives = 390/491 (79%), Gaps = 3/491 (0%)
Query: 5 KPFVLFTITTILFSFCLFTSASSRGLSETATTVLDVSSALQQTEHILSFEPETLEPFAEE 64
KPF F + TI+FS L S T TT+LDVSS+LQQ +ILSF P+ +++
Sbjct: 8 KPF--FFLFTIIFSLTLALSRDLLPPHATKTTILDVSSSLQQALNILSFNPQQQTALSQQ 65
Query: 65 SETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAI 124
+ + P +SSFSL L+ R+ +HKT H DY++LVLSRL RDS+RV + T+LQL +
Sbjct: 66 QQQTI-AIPSFLNSSFSLSLNPRDTIHKTPHKDYKALVLSRLHRDSSRVQAITTRLQLIL 124
Query: 125 YNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINW 184
V + +LKP + +I P+D STPV SG SQGSGEYF+R+GVG P + + MVLDTGSDINW
Sbjct: 125 NGVSKSDLKPLQTEIQPQDLSTPVSSGTSQGSGEYFTRVGVGNPAKSYYMVLDTGSDINW 184
Query: 185 LQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFT 244
+QC+PC++CYQQSDPIF P SSSYSPL C + QC SL +S+CR +C YQV YGDGSFT
Sbjct: 185 IQCQPCSDCYQQSDPIFTPAASSSYSPLTCDSQQCNSLQMSSCRNGQCRYQVNYGDGSFT 244
Query: 245 VGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYC 304
GD VTET+SFG SG+V IALGCGHDNEGLFVG+AGLLGLGGG LSLT Q+KATS +YC
Sbjct: 245 FGDFVTETMSFGGSGTVNSIALGCGHDNEGLFVGAAGLLGLGGGPLSLTSQLKATSFSYC 304
Query: 305 LVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
LV+RDS AS L+FNSA GD+V APL+++ K+DTFYYVGL+G SVGG+ ++IP +F++
Sbjct: 305 LVNRDSAASSTLDFNSAPVGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKL 364
Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
D++GDGG+IVDCGTAITRLQ++AYNSLRDSFV ++ +L+ TSGVALFDTCYD SG SV+
Sbjct: 365 DDSGDGGVIVDCGTAITRLQSEAYNSLRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVK 424
Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLA 484
VPTVS HF GK+ DLPA NYLIPVDSAGT+CFAFAPT+S+LSIIGNVQQQGTRVSFDLA
Sbjct: 425 VPTVSFHFDGGKSWDLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLA 484
Query: 485 NNRVGFTPNKC 495
NNRVGF+ NKC
Sbjct: 485 NNRVGFSTNKC 495
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 615 bits (1585), Expect = e-173, Method: Compositional matrix adjust.
Identities = 306/484 (63%), Positives = 385/484 (79%), Gaps = 8/484 (1%)
Query: 19 FCLFTSASSRGLSET-ATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSS 77
F T ASSR LS T VLDV S+LQQT+ ILS +P + E+ ++ NSS
Sbjct: 18 FLTTTDASSRSLSTPPKTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPESLSDPVFFNSS 77
Query: 78 SSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPA-- 135
S SL LHSR+ ++H DY+SL LSRLERDS+RV ++ K++ A+ VDR +LKP
Sbjct: 78 SPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYN 137
Query: 136 -EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECY 194
+ + ED +TPVVSGASQGSGEYFSRIGVGTP ++ +VLDTGSD+NW+QC PC +CY
Sbjct: 138 EDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCY 197
Query: 195 QQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVS 254
QQSDP+F+P +SS+Y L C+APQC L+ SACR+N+CLYQV+YGDGSFTVG+L T+TV+
Sbjct: 198 QQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVT 257
Query: 255 FGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASG 314
FGNSG + +ALGCGHDNEGLF G+AGLLGLGGG+LS+T Q+KATS +YCLVDRDS S
Sbjct: 258 FGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSS 317
Query: 315 VLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
L+FNS + GGDA TAPL+RNKK+DTFYYVGL+GFSVGG+ V +P ++F++D +G GG+
Sbjct: 318 SLDFNSVQLGGGDA-TAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGV 376
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKP-TSGVALFDTCYDFSGLRSVRVPTVSLH 431
I+DCGTA+TRLQTQAYNSLRD+F++L NLK +S ++LFDTCYDFS L +V+VPTV+ H
Sbjct: 377 ILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFH 436
Query: 432 FGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
F GK+LDLPAKNYLIPVD +GTFCFAFAPTSS+LSIIGNVQQQGTR+++DL+ N +G +
Sbjct: 437 FTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLS 496
Query: 492 PNKC 495
NKC
Sbjct: 497 GNKC 500
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 614 bits (1583), Expect = e-173, Method: Compositional matrix adjust.
Identities = 306/484 (63%), Positives = 384/484 (79%), Gaps = 8/484 (1%)
Query: 19 FCLFTSASSRGLSET-ATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSS 77
F T ASSR LS T VLDV S+LQQT+ ILS +P + E+ ++ NSS
Sbjct: 18 FLTTTDASSRSLSTPPKTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPESLSDPVFFNSS 77
Query: 78 SSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPA-- 135
S SL LHSR+ ++H DY+SL LSRLERDS+RV ++ K++ A+ VDR +LKP
Sbjct: 78 SPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYN 137
Query: 136 -EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECY 194
+ + ED +TPVVSGASQGSGEYFSRIGVGTP + +VLDTGSD+NW+QC PC +CY
Sbjct: 138 EDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCY 197
Query: 195 QQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVS 254
QQSDP+F+P +SS+Y L C+APQC L+ SACR+N+CLYQV+YGDGSFTVG+L T+TV+
Sbjct: 198 QQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVT 257
Query: 255 FGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASG 314
FGNSG + +ALGCGHDNEGLF G+AGLLGLGGG+LS+T Q+KATS +YCLVDRDS S
Sbjct: 258 FGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSS 317
Query: 315 VLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
L+FNS + GGDA TAPL+RNKK+DTFYYVGL+GFSVGG+ V +P ++F++D +G GG+
Sbjct: 318 SLDFNSVQLGGGDA-TAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGV 376
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKP-TSGVALFDTCYDFSGLRSVRVPTVSLH 431
I+DCGTA+TRLQTQAYNSLRD+F++L NLK +S ++LFDTCYDFS L +V+VPTV+ H
Sbjct: 377 ILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFH 436
Query: 432 FGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
F GK+LDLPAKNYLIPVD +GTFCFAFAPTSS+LSIIGNVQQQGTR+++DL+ N +G +
Sbjct: 437 FTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLS 496
Query: 492 PNKC 495
NKC
Sbjct: 497 GNKC 500
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 608 bits (1567), Expect = e-171, Method: Compositional matrix adjust.
Identities = 318/462 (68%), Positives = 377/462 (81%), Gaps = 4/462 (0%)
Query: 35 TTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTR 94
T VLDVSS+L Q ILSF P+ LE + SET + P +SSSSFSL LH RE L +
Sbjct: 34 TNVLDVSSSLHQAHQILSFNPQLLE--EQSSETETPTSPSSSSSSFSLQLHPRETLLNEQ 91
Query: 95 HNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL-PEDFSTPVVSGAS 153
H +Y++LVLSRL RD+ARVN+L TKLQLA+ +++R +L P E ++L PED STPV SG +
Sbjct: 92 HPNYKTLVLSRLARDTARVNSLNTKLQLALSSLNRSDLYPTETELLRPEDLSTPVSSGTA 151
Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
QGSGEYFSR+GVG P + F MVLDTGSD+NWLQC+PC++CYQQSDPIFDP SSSY+PL
Sbjct: 152 QGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDPTASSSYNPLT 211
Query: 214 CAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
C A QC+ L++SACR +CLYQV+YGDGSFTVG+ VTETVSFG +GSV +A+GCGHDNE
Sbjct: 212 CDAQQCQDLEMSACRNGKCLYQVSYGDGSFTVGEYVTETVSFG-AGSVNRVAIGCGHDNE 270
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIR 333
GLFVGSAGLLGLGGG LSLT QIKATS +YCLVDRDS S LEFNS R GD+V APL++
Sbjct: 271 GLFVGSAGLLGLGGGPLSLTSQIKATSFSYCLVDRDSGKSSTLEFNSPRPGDSVVAPLLK 330
Query: 334 NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRD 393
N+KV+TFYYV LTG SVGG+ V +PP F +D++G GG+IVD GTAITRL+TQAYNS+RD
Sbjct: 331 NQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAITRLRTQAYNSVRD 390
Query: 394 SFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAG 453
+F R NL+P GVALFDTCYD S L+SVRVPTVS HF +A LPAKNYLIPVD AG
Sbjct: 391 AFKRKTSNLRPAEGVALFDTCYDLSSLQSVRVPTVSFHFSGDRAWALPAKNYLIPVDGAG 450
Query: 454 TFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
T+CFAFAPT+S++SIIGNVQQQGTRVSFDLAN+ VGF+PNKC
Sbjct: 451 TYCFAFAPTTSSMSIIGNVQQQGTRVSFDLANSLVGFSPNKC 492
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 605 bits (1561), Expect = e-170, Method: Compositional matrix adjust.
Identities = 302/473 (63%), Positives = 373/473 (78%), Gaps = 4/473 (0%)
Query: 27 SRGLS---ETATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLP 83
SR LS ++ ++VLDVS ++++T +LS + +P + E P + +SSFSL
Sbjct: 24 SRELSLDTDSHSSVLDVSGSIRKTLDVLSHKSSVSKPSDQRDEKTTSFSPTSLASSFSLE 83
Query: 84 LHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL-PE 142
LH RE+LH H DYR+L+LSRL RDSARV + TKLQLA+ D+ +L P + +IL P+
Sbjct: 84 LHPRELLHGGSHKDYRALMLSRLARDSARVKAINTKLQLAVSGTDKSDLVPMDTEILHPQ 143
Query: 143 DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
DFSTPV SG SQGSGEYF R+G+G P + F MV+DTGSD+NWLQC+PC +CYQQ DPIFD
Sbjct: 144 DFSTPVTSGTSQGSGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDPIFD 203
Query: 203 PKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK 262
P +SSS+S L C PQC++LDV ACR + CLYQV+YGDGS+TVGD TETVSFGNSGSV
Sbjct: 204 PASSSSFSRLGCQTPQCRNLDVFACRNDSCLYQVSYGDGSYTVGDFATETVSFGNSGSVD 263
Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR 322
+A+GCGHDNEGLFVG+AGL+GLGGG LSLT QIKA+S +YCLV+RDS S LEFNSA+
Sbjct: 264 KVAIGCGHDNEGLFVGAAGLIGLGGGPLSLTSQIKASSFSYCLVNRDSVDSSTLEFNSAK 323
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
D+VTAP+ +N KVDTFYYVG+TG SVGG+ + IPPS+FE+D +G GGIIVDCGTA+TR
Sbjct: 324 PSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAVTR 383
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
LQTQAYN+LRD+FV+L +L TSG ALFDTCY+ S SVRVPTV+ F GK+L LP
Sbjct: 384 LQTQAYNALRDTFVKLTKDLPSTSGFALFDTCYNLSSRTSVRVPTVAFLFDGGKSLPLPP 443
Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
NYLIPVDSAGTFC AFAPT+++LSIIGNVQQQGTRV++DLAN++V F+ KC
Sbjct: 444 SNYLIPVDSAGTFCLAFAPTTASLSIIGNVQQQGTRVTYDLANSQVSFSSRKC 496
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 598 bits (1541), Expect = e-168, Method: Compositional matrix adjust.
Identities = 300/458 (65%), Positives = 367/458 (80%), Gaps = 4/458 (0%)
Query: 38 LDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHND 97
LDVS++LQQ +L F+P F ++ P NSS SFSL LH R+ LH H D
Sbjct: 38 LDVSASLQQANQVLKFDPTASISFQQQVHLV----PSNSSFSFSLQLHPRDSLHNAGHKD 93
Query: 98 YRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSG 157
Y+SLVLSRL RDS+RV ++ +L+ A+ + R +L+P + +ILPED STP++SG SQGSG
Sbjct: 94 YKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSDLEPLKTEILPEDLSTPIISGTSQGSG 153
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EYFSR+GVG P + F MVLDTGSDINWLQC+PCT+CYQQ+DPIFDP++SSS++ LPC +
Sbjct: 154 EYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQ 213
Query: 218 QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
QC++L+ S CRA++CLYQV+YGDGSFTVG+ VTET++FGNSG + +A+GCGHDNEGLFV
Sbjct: 214 QCQALETSGCRASKCLYQVSYGDGSFTVGEFVTETLTFGNSGMINDVAVGCGHDNEGLFV 273
Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV 337
GSAGLLGLGGG LSLT Q+KA+S +YCLVDRDS +S LEFNSA D+V APL+++ KV
Sbjct: 274 GSAGLLGLGGGPLSLTSQMKASSFSYCLVDRDSSSSSDLEFNSAAPSDSVNAPLLKSGKV 333
Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
DTFYYVGLTG SVGGQ + IPP+LF+MD++G GGIIVD GTAITRLQTQAYN+LRD+FV
Sbjct: 334 DTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNTLRDAFVS 393
Query: 398 LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
LK T+G ALFDTCYD S V +PTVS F GK+L LP KNYLIPVDS GTFCF
Sbjct: 394 RTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCF 453
Query: 458 AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
AFAPT+S+LSIIGNVQQQGTRV +DLAN+ VGF+P+KC
Sbjct: 454 AFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 597 bits (1538), Expect = e-168, Method: Compositional matrix adjust.
Identities = 299/458 (65%), Positives = 366/458 (79%), Gaps = 4/458 (0%)
Query: 38 LDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHND 97
LDVS++LQQ +L F+P F ++ P NSS SFSL LH R+ LH H D
Sbjct: 38 LDVSASLQQANQVLKFDPTASISFQQQVHLV----PSNSSFSFSLQLHPRDSLHNAGHKD 93
Query: 98 YRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSG 157
Y+SLVLSRL RDS+RV ++ +L+ A+ + R +L+P + +ILPED STP++SG SQGSG
Sbjct: 94 YKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSDLEPLKTEILPEDLSTPIISGTSQGSG 153
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EYFSR+GVG P + F MVLDTGSDINWLQC+PCT+CYQQ+DPIFDP++SSS++ LPC +
Sbjct: 154 EYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQ 213
Query: 218 QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
QC++L+ S CRA++CLYQV+YGDGSFTVG+ V ET++FGNSG + +A+GCGHDNEGLFV
Sbjct: 214 QCQALETSGCRASKCLYQVSYGDGSFTVGEFVIETLTFGNSGMINNVAVGCGHDNEGLFV 273
Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV 337
GSAGLLGLGGG LSLT Q+KA+S +YCLVDRDS +S LEFNSA D+V APL+++ KV
Sbjct: 274 GSAGLLGLGGGSLSLTSQMKASSFSYCLVDRDSSSSSDLEFNSAAPSDSVNAPLLKSGKV 333
Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
DTFYYVGLTG SVGGQ + IPP+LF+MD++G GGIIVD GTAITRLQTQAYN+LRD+FV
Sbjct: 334 DTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNTLRDAFVS 393
Query: 398 LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
LK T+G ALFDTCYD S V +PTVS F GK+L LP KNYLIPVDS GTFCF
Sbjct: 394 RTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCF 453
Query: 458 AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
AFAPT+S+LSIIGNVQQQGTRV +DLAN+ VGF+P+KC
Sbjct: 454 AFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 567 bits (1460), Expect = e-159, Method: Compositional matrix adjust.
Identities = 284/463 (61%), Positives = 352/463 (76%), Gaps = 10/463 (2%)
Query: 35 TTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTR 94
TT+LDV +++Q+ E I + + PF ++ + SSS ++ LHSR + KT+
Sbjct: 25 TTLLDVEASIQKAEAIFTSSATKMTPFNQQE-------IVTSSSQLTMELHSRTSVQKTK 77
Query: 95 HNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP--AEAQILPEDFSTPVVSGA 152
H DYRSL LSRLERDSARV ++ T+L LAI+ + +LKP ++Q ED P++SG
Sbjct: 78 HPDYRSLTLSRLERDSARVKSINTRLDLAIHGLSTSDLKPLDTDSQFRAEDLQGPIISGT 137
Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
SQGSGEYFSR+G+G P MVLDTGSD+NW+QC PC +CY Q+DPIF+P +S+SYSPL
Sbjct: 138 SQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPASSTSYSPL 197
Query: 213 PCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
C QC+SLDVS CR N CLY+V+YGDGS+TVGD VTET++ G S SV +A+GCGH+N
Sbjct: 198 SCDTKQCQSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITLG-SASVDNVAIGCGHNN 256
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI 332
EGLF+G+AGLLGLGGG LS QI A+S +YCLVDRDS ++ LEFNSA A+TAPL+
Sbjct: 257 EGLFIGAAGLLGLGGGKLSFPSQINASSFSYCLVDRDSDSASTLEFNSALLPHAITAPLL 316
Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
RN+++DTFYYVG+TG SVGG+ + IP S+FEMDE+G+GGII+D GTA+TRLQT AYN+LR
Sbjct: 317 RNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVTRLQTAAYNALR 376
Query: 393 DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
D+FV+ +L TS VALFDTCYD S SV VPTV+ H GK L LPA NYLIPVDS
Sbjct: 377 DAFVKGTKDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHLAGGKVLPLPATNYLIPVDSD 436
Query: 453 GTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
GTFCFAFAPTSSALSIIGNVQQQGTRV FDLAN+ VGF P +C
Sbjct: 437 GTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 556 bits (1433), Expect = e-156, Method: Compositional matrix adjust.
Identities = 283/466 (60%), Positives = 359/466 (77%), Gaps = 13/466 (2%)
Query: 33 TATTVLDVSSALQQTEHILSFEPETLEPF-AEESETAAESFPLNSSSSFSLPLHSREILH 91
+ TTVLDV++++Q+T++I S P+ + PF +E ET +SS ++ L SR +
Sbjct: 29 SETTVLDVAASIQRTKNIFSSGPK-MSPFNQQEKET--------TSSELTVELLSRTSIQ 79
Query: 92 KTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAE--AQILPEDFSTPVV 149
KT H Y+SL LSRL+RDSARV +L+T+L LAI ++ +LKP E ++ PED +P++
Sbjct: 80 KTTHTGYKSLTLSRLQRDSARVKSLVTRLDLAINSISSSDLKPLETDSEFKPEDLQSPII 139
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSY 209
SG SQGSGEYFSR+G+G PP Q ++LDTGSD+NW+QC PC +CYQQ+DPIF+P +S+S+
Sbjct: 140 SGTSQGSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPASSASF 199
Query: 210 SPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCG 269
S L C QC+SLDVS CR + CLY+V+YGDGS+TVGD VTET++ G S V +A+GCG
Sbjct: 200 STLSCNTRQCRSLDVSECRNDTCLYEVSYGDGSYTVGDFVTETITLG-SAPVDNVAIGCG 258
Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTA 329
H+NEGLFVG+AGLLGLGGG LS QI ATS +YCLVDRDS ++ LEFNS +AV+A
Sbjct: 259 HNNEGLFVGAAGLLGLGGGSLSFPSQINATSFSYCLVDRDSESASTLEFNSTLPPNAVSA 318
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
PL+RN +DTFYYVGLTG SVGG+ V IP S F++DE+G+GG+IVD GTAITRLQT YN
Sbjct: 319 PLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITRLQTDVYN 378
Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
SLRD+FV+ +L T+G+ALFDTCYD S +V VPTVS HF GK L LPAKNYL+P+
Sbjct: 379 SLRDAFVKRTRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAKNYLVPL 438
Query: 450 DSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
DS GTFCFAFAPT+S+LSIIGNVQQQGTRV +DL N+ VGF PNKC
Sbjct: 439 DSEGTFCFAFAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 556 bits (1432), Expect = e-155, Method: Compositional matrix adjust.
Identities = 293/489 (59%), Positives = 368/489 (75%), Gaps = 6/489 (1%)
Query: 10 FTITTILFSFCLF--TSASSRGLSETATTVLDVSSALQQTEHILSFEPETLEPFAEESET 67
F + + FC + + ++R LS TTVLDVS +++++ ++LS P+ + E +
Sbjct: 6 FLLCVLFAFFCTWGVSLVNARRLSLPRTTVLDVSGSIRESLNVLSLNPQYEQ---MEFQH 62
Query: 68 AAESFPLNSSSSFSLPLHS-REILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYN 126
SFP +SSSS R +HK+ H DY+SLVL+RLERDS RV +L T++ LAI
Sbjct: 63 QERSFPSSSSSSSLTLSLHSRTSIHKSSHKDYKSLVLARLERDSDRVRSLATRMDLAIAG 122
Query: 127 VDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQ 186
+ + +LKP E ++ E TP+VSGASQGSGEYFSR+G+G+PP+ MV+DTGSD+NW+Q
Sbjct: 123 ITKSDLKPVEKELEAEALETPLVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQ 182
Query: 187 CRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVG 246
C PC +CYQQ+DPIF+P SSSY+PL C QCKSLDVS CR + CLY+V+YGDGS+TVG
Sbjct: 183 CAPCADCYQQADPIFEPSFSSSYAPLTCETHQCKSLDVSECRNDSCLYEVSYGDGSYTVG 242
Query: 247 DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV 306
D TET++ S S+ +A+GCGHDNEGLFVG+AGLLGLGGG LS QI A+S +YCLV
Sbjct: 243 DFATETITLDGSASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCLV 302
Query: 307 DRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
+RD+ ++ LEFNS +VTAPL+RN ++DTFYY+G+TG VGGQ + IP S FE+DE
Sbjct: 303 NRDTDSASTLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDE 362
Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVP 426
+G+GGIIVD GTA+TRLQ+ YNSLRDSFVR +L TSGVALFDTCYD S SV VP
Sbjct: 363 SGNGGIIVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVP 422
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
TVS HF GK L LPAKNYLIPVDSAGTFCFAFAPT+SALSIIGNVQQQGTRVS+DL+N+
Sbjct: 423 TVSFHFPDGKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNS 482
Query: 487 RVGFTPNKC 495
VGF+PN C
Sbjct: 483 LVGFSPNGC 491
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 549 bits (1415), Expect = e-153, Method: Compositional matrix adjust.
Identities = 268/356 (75%), Positives = 310/356 (87%), Gaps = 1/356 (0%)
Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
PED STPV SG SQGSGEYF+R+GVG P RQF MVLDTGSDINWLQC+PCT+CYQQ+DPI
Sbjct: 2 PEDLSTPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPI 61
Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
FDP SS+Y+P+ C + QC SL++S+CR+ +CLYQV YGDGS+T GD TE+VSFGNSGS
Sbjct: 62 FDPTASSTYAPVTCQSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGS 121
Query: 261 VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS 320
VK +ALGCGHDNEGLFVG+AGLLGLGGG LSLT Q+KATS +YCLV+RDS S L+FNS
Sbjct: 122 VKNVALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCLVNRDSAGSSTLDFNS 181
Query: 321 AR-GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
A+ G D+VTAPL++N+K+DTFYYVGL+G SVGGQ V IP S F +DE+G+GGIIVDCGTA
Sbjct: 182 AQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTA 241
Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
ITRLQTQAYN LRD+FVR+ NLK TS VALFDTCYD SG SVRVPTVS HF GK+ +
Sbjct: 242 ITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSWN 301
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LPA NYLIPVDSAGT+CFAFAPT+S+LSIIGNVQQQGTRV+FDLANNR+GF+PNKC
Sbjct: 302 LPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 538 bits (1386), Expect = e-150, Method: Compositional matrix adjust.
Identities = 276/478 (57%), Positives = 359/478 (75%), Gaps = 17/478 (3%)
Query: 23 TSASSRGLSETATT---VLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSS 79
+S SR L ET+TT +L+V+ ++ +T++ SF L E++ +A SSS
Sbjct: 18 SSVFSRILPETSTTTTSILNVADSIHRTKYTSSFR---LNQQEEQTHSA--------SSS 66
Query: 80 FSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQI 139
FSL LHSR + T H+DY+SL L+RL RD+ARV +LIT+L LAI N+ + +LKP
Sbjct: 67 FSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPISTMY 126
Query: 140 LPE--DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS 197
E D P++SG +QGSGEYF+R+G+G P R+ MVLDTGSD+NWLQC PC +CY Q+
Sbjct: 127 TTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQT 186
Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
+PIF+P +SSSY PL C PQC +L+VS CR CLY+V+YGDGS+TVGD TET++ G
Sbjct: 187 EPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIG- 245
Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLE 317
S V+ +A+GCGH NEGLFVG+AGLLGLGGG+L+L Q+ TS +YCLVDRDS ++ ++
Sbjct: 246 STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSASTVD 305
Query: 318 FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
F ++ DAV APL+RN ++DTFYY+GLTG SVGG+ +QIP S FEMDE+G GGII+D G
Sbjct: 306 FGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSG 365
Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
TA+TRLQT+ YNSLRDSFV+ +L+ +GVA+FDTCY+ S +V VPTV+ HF GK
Sbjct: 366 TAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKM 425
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L LPAKNY+IPVDS GTFC AFAPT+S+L+IIGNVQQQGTRV+FDLAN+ +GF+ NKC
Sbjct: 426 LALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 532 bits (1370), Expect = e-148, Method: Compositional matrix adjust.
Identities = 280/475 (58%), Positives = 348/475 (73%), Gaps = 17/475 (3%)
Query: 28 RGLSETATT-VLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHS 86
R L T TT VLDV++++Q+T+ + + EP++ P ET ++ SS SL L+S
Sbjct: 22 RTLHPTPTTSVLDVAASIQRTQQVFAVEPKSSTP----DETT-----VSDPSSLSLQLNS 72
Query: 87 REILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP------AEAQIL 140
R + K H+DY+SL LSRL+RDSARV +L ++ LAI + +L+P +Q
Sbjct: 73 RISVMKASHSDYKSLTLSRLKRDSARVRSLTARIDLAIRGITGTDLEPLGNGGGGGSQFG 132
Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
EDF +P+VSGASQGSGEYFSR+G+G PP MVLDTGSD++W+QC PC ECY+Q+DPI
Sbjct: 133 TEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPI 192
Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
F+P +S+S++ L C QCKSLDVS CR CLY+V+YGDGS+TVGD VTETV+ G S S
Sbjct: 193 FEPTSSASFTSLSCETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLG-STS 251
Query: 261 VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS 320
+ IA+GCGH+NEGLF+G+AGLLGLGGG LS Q+ A+S +YCLVDRDS ++ L+FNS
Sbjct: 252 LGNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSDSTSTLDFNS 311
Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
DAVTAPL RN +DTF+Y+GLTG SVGG + IP + F+M E G+GGIIVD GTA+
Sbjct: 312 PITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAV 371
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
TRLQT YN LRD+FV+ +L+ GVALFDTCYD S V VPTVS HF G L L
Sbjct: 372 TRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPL 431
Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
PAKNYLIPVDS GTFCFAFAPT S LSI+GN QQQGTRV FDLAN+ VGF+PNKC
Sbjct: 432 PAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 529 bits (1363), Expect = e-147, Method: Compositional matrix adjust.
Identities = 279/475 (58%), Positives = 347/475 (73%), Gaps = 17/475 (3%)
Query: 28 RGLSETATT-VLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHS 86
R L T TT VLDV++++Q+T+ + + EP++ P ET ++ SS SL L+S
Sbjct: 22 RTLHPTPTTSVLDVAASIQRTQQVFAVEPKSSTP----DETT-----VSDPSSLSLQLNS 72
Query: 87 REILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP------AEAQIL 140
R + K H+DY+SL LSRL+RDSARV +L ++ LAI + +L+P +Q
Sbjct: 73 RISVMKASHSDYKSLTLSRLKRDSARVRSLTARIDLAIRGITGTDLEPLGNGGGGGSQFG 132
Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
EDF +P+VSGASQGSGEYFSR+G+G PP MVLDTGSD++W+QC PC ECY+Q+DP
Sbjct: 133 TEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPX 192
Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
F+P +S+S++ L C QCKSLDVS CR CLY+V+YGDGS+TVGD VTETV+ G S S
Sbjct: 193 FEPTSSASFTSLSCETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLG-STS 251
Query: 261 VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS 320
+ IA+GCGH+NEGLF+G+AGLLGLGGG LS Q+ A+S +YCLVDRDS ++ L+FNS
Sbjct: 252 LGNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSDSTSTLDFNS 311
Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
DAVTAPL RN +DTF+Y+GLTG SVGG + IP + F+M E G+GGIIVD GTA+
Sbjct: 312 PITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAV 371
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
TRLQT YN LRD+FV+ +L+ GVALFDTCYD S V VPTVS HF G L L
Sbjct: 372 TRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPL 431
Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
PAKNYLIPVDS GTFCFAFAPT S LSI+GN QQQGTRV FDLAN+ VGF+PNKC
Sbjct: 432 PAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 528 bits (1360), Expect = e-147, Method: Compositional matrix adjust.
Identities = 268/468 (57%), Positives = 350/468 (74%), Gaps = 15/468 (3%)
Query: 31 SETATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREIL 90
S T T++L+V+ ++ +T++ SF +E +T + S SSFSL LHSR +
Sbjct: 31 SVTTTSILNVADSIHRTKYTSSFRLN-----QQEEQTHSRS------SSFSLQLHSRVSV 79
Query: 91 HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPE---DFSTP 147
T H+DY+SL L+RL RD+ARV +LIT+L LAI N+ + +LKP D P
Sbjct: 80 RGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPVTTMYTTTEEEDIEAP 139
Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
++SG +QGSGEYF+R+G+G P R+ MVLDTGSD+NWLQC PC +CY Q++PIF+P +SS
Sbjct: 140 LISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSS 199
Query: 208 SYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
SY PL C PQC +L+VS CR CLY+V+YGDGS+TVGD TET++ G S V+ +A+G
Sbjct: 200 SYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIG-STLVQNVAVG 258
Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAV 327
CGH NEGLFVG+AGLLGLGGG+L+L Q+ TS +YCLVDRDS ++ +EF ++ DAV
Sbjct: 259 CGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSASTVEFGTSLPPDAV 318
Query: 328 TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
APL+RN ++DTFYY+GLTG SVGG+ +QIP S FEMDE+G GGII+D GTA+TRLQT
Sbjct: 319 VAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTGI 378
Query: 388 YNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
YNSLRDSF++ +L+ +GVA+FDTCY+ S ++ VPTV+ HF GK L LPAKNY+I
Sbjct: 379 YNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVAFHFPGGKMLALPAKNYMI 438
Query: 448 PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
PVDS GTFC AFAPT+S+L+IIGNVQQQGTRV+FDLAN+ +GF+ NKC
Sbjct: 439 PVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 486
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 519 bits (1336), Expect = e-144, Method: Compositional matrix adjust.
Identities = 273/465 (58%), Positives = 344/465 (73%), Gaps = 9/465 (1%)
Query: 33 TATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHK 92
+ TT+LDV S+LQ + ++F P L + E L SSSF + L SR + K
Sbjct: 27 SKTTLLDVVSSLQNAHNAVAFTPHHLNQHQRQQEA------LLLSSSFGIHLRSRASIQK 80
Query: 93 TRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAE--AQILPEDFSTPVVS 150
H DY+SL LSRL RDSARV +L T+L L + V +L PAE A+ PVVS
Sbjct: 81 PSHRDYKSLTLSRLARDSARVKSLQTRLDLVLKRVSNSDLHPAESNAEFEANALQGPVVS 140
Query: 151 GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYS 210
G SQGSGEYF R+G+G PP Q +VLDTGSD++W+QC PC+ECYQQSDPIFDP +S+SYS
Sbjct: 141 GTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPVSSNSYS 200
Query: 211 PLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
P+ C APQCKSLD+S CR CLY+V+YGDGS+TVG+ TETV+ G + +V+ +A+GCGH
Sbjct: 201 PIRCDAPQCKSLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTLG-TAAVENVAIGCGH 259
Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAP 330
+NEGLFVG+AGLLGLGGG LS Q+ ATS +YCLV+RDS A LEFNS + VTAP
Sbjct: 260 NNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLVNRDSDAVSTLEFNSPLPRNVVTAP 319
Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
L RN ++DTFYY+GL G SVGG+A+ IP S+FE+D G GGII+D GTA+TRL+++ Y++
Sbjct: 320 LRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGIIIDSGTAVTRLRSEVYDA 379
Query: 391 LRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
LRD+FV+ A + +GV+LFDTCYD S SV+VPTVS HF G+ L LPA+NYLIPVD
Sbjct: 380 LRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVQVPTVSFHFPEGRELPLPARNYLIPVD 439
Query: 451 SAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S GTFCFAFAPT+S+LSI+GNVQQQGTRV FD+AN+ VGF+ + C
Sbjct: 440 SVGTFCFAFAPTTSSLSIMGNVQQQGTRVGFDIANSLVGFSADSC 484
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 512 bits (1319), Expect = e-142, Method: Compositional matrix adjust.
Identities = 269/463 (58%), Positives = 342/463 (73%), Gaps = 9/463 (1%)
Query: 35 TTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTR 94
TT+LDV S+LQ ++++F + E++ + SSF + LHSR + K+
Sbjct: 29 TTLLDVVSSLQNAHNVVAFTHHHPNKHQRQQESSLLT------SSFGIQLHSRASIQKSS 82
Query: 95 HNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAE--AQILPEDFSTPVVSGA 152
H+DY+SL LSRL RDSARV L T+L L + V +L PAE A+ PVVSG
Sbjct: 83 HSDYKSLTLSRLARDSARVKALQTRLDLFLKRVSNSDLHPAESKAEFESNALQGPVVSGT 142
Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
SQGSGEYF R+G+G PP Q +VLDTGSD++W+QC PC+ECYQQSDPIFDP +S+SYSP+
Sbjct: 143 SQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPISSNSYSPI 202
Query: 213 PCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
C PQCKSLD+S CR CLY+V+YGDGS+TVG+ TETV+ G S +V+ +A+GCGH+N
Sbjct: 203 RCDEPQCKSLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTLG-SAAVENVAIGCGHNN 261
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI 332
EGLFVG+AGLLGLGGG LS Q+ ATS +YCLV+RDS A LEFNS +A TAPL+
Sbjct: 262 EGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLVNRDSDAVSTLEFNSPLPRNAATAPLM 321
Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
RN ++DTFYY+GL G SVGG+A+ IP S FE+D G GGII+D GTA+TRL+++ Y++LR
Sbjct: 322 RNPELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGGGIIIDSGTAVTRLRSEVYDALR 381
Query: 393 DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
D+FV+ A + +GV+LFDTCYD S SV +PTVS F G+ L LPA+NYLIPVDS
Sbjct: 382 DAFVKGAKGIPKANGVSLFDTCYDLSSRESVEIPTVSFRFPEGRELPLPARNYLIPVDSV 441
Query: 453 GTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
GTFCFAFAPT+S+LSIIGNVQQQGTRV FD+AN+ VGF+ + C
Sbjct: 442 GTFCFAFAPTTSSLSIIGNVQQQGTRVGFDIANSLVGFSVDSC 484
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 488 bits (1255), Expect = e-135, Method: Compositional matrix adjust.
Identities = 266/471 (56%), Positives = 325/471 (69%), Gaps = 10/471 (2%)
Query: 34 ATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREIL--- 90
AT LDV+++L + +S E L A + + + +L LHSR+ L
Sbjct: 35 ATETLDVAASLSRARAAVSAEAVPLHQSAAAAVSTEVVGEEHEEGRLALRLHSRDFLPEE 94
Query: 91 -HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEA---QILPEDFST 146
+ RH YRSLVL+RL RDSAR + + +A V R +L PA + +
Sbjct: 95 QGRQRHASYRSLVLARLRRDSARAAAVSARAAMAADGVSRFDLVPANVTAFEASAAEIQG 154
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
PVVSG GSGEYFSR+GVG+P RQ MVLDTGSD+ W+QC+PC +CYQQSDP+FDP S
Sbjct: 155 PVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLS 214
Query: 207 SSYSPLPCAAPQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
+SY+ + C P+C LD +ACR CLY+VAYGDGS+TVGD TET++ G+S V +
Sbjct: 215 TSYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDSAPVSSV 274
Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
A+GCGHDNEGLFVG+AGLL LGGG LS QI AT+ +YCLVDRDSP+S L+F A
Sbjct: 275 AIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPSSSTLQFGDAADA 334
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
+ VTAPLIR+ + TFYYVGL+G SVGGQ + IPPS F MD G GG+IVD GTA+TRLQ
Sbjct: 335 E-VTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSGTAVTRLQ 393
Query: 385 TQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN 444
+ AY +LRD+FVR +L TSGV+LFDTCYD S SV VP VSL F G L LPAKN
Sbjct: 394 SSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGELRLPAKN 453
Query: 445 YLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
YLIPVD AGT+C AFAPT++A+SIIGNVQQQGTRVSFD A + VGFT NKC
Sbjct: 454 YLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTTNKC 504
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 483 bits (1243), Expect = e-134, Method: Compositional matrix adjust.
Identities = 274/507 (54%), Positives = 343/507 (67%), Gaps = 16/507 (3%)
Query: 4 IKPFVLFTITTILFSFCLFTSASS------RGLSETATTVLDVSSALQQTEHILSFEPET 57
++P L + ++ + L +A S R S T LDV+++L + LS + +
Sbjct: 1 MQPPTLLPLGAVVVAILLLATAPSPAVSRHRHSSAADTETLDVAASLSRARAALSTDAVS 60
Query: 58 LEPFAEESETAAESFPLNSSSSFSLPLHSREIL--HKTRHNDYRSLVLSRLERDSARVNT 115
L A + A S P +L LHSR+ L + RH YRSLVLSRL RDSAR
Sbjct: 61 LHQSAAAAAGAKRS-PRAREGGLTLRLHSRDFLPEEQGRHETYRSLVLSRLRRDSARAAA 119
Query: 116 LITKLQLAIYNVDRHELKPAEAQIL---PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQF 172
+ + LA V R +L+PA + PVVSG QGSGEYFSR+G+G+P RQ
Sbjct: 120 VSARATLAADGVTRLDLRPANGSAVFAASAAIQGPVVSGVGQGSGEYFSRVGIGSPARQL 179
Query: 173 SMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACR--AN 230
MVLDTGSD+ W+QC+PC +CYQQSDP+FDP S+SY+ + C + +C+ LD +ACR
Sbjct: 180 YMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATG 239
Query: 231 RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGML 290
CLY+VAYGDGS+TVGD TET++ G+S V +A+GCGHDNEGLFVG+AGLL LGGG L
Sbjct: 240 ACLYEVAYGDGSYTVGDFATETLTLGDSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPL 299
Query: 291 SLTKQIKATSLAYCLVDRDSPASGVLEF-NSARGGDAVTAPLIRNKKVDTFYYVGLTGFS 349
S QI A++ +YCLVDRDSPA+ L+F + A VTAPL+R+ + TFYYV L+G S
Sbjct: 300 SFPSQISASTFSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGIS 359
Query: 350 VGGQAVQIPPSLFEMDE-AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV 408
VGGQ + IP S F MD +G GG+IVD GTA+TRLQ+ AY +LRD+FV+ A +L TSGV
Sbjct: 360 VGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGV 419
Query: 409 ALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSI 468
+LFDTCYD S SV VP VSL F G AL LPAKNYLIPVD AGT+C AFAPT++A+SI
Sbjct: 420 SLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSI 479
Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
IGNVQQQGTRVSFD A VGFTPNKC
Sbjct: 480 IGNVQQQGTRVSFDTARGAVGFTPNKC 506
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 482 bits (1241), Expect = e-133, Method: Compositional matrix adjust.
Identities = 256/425 (60%), Positives = 307/425 (72%), Gaps = 10/425 (2%)
Query: 80 FSLPLHSREIL----HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPA 135
+L LHSR+ L + RH YRSLVL+RL RDSAR + + +A V R +L PA
Sbjct: 77 LALRLHSRDFLPEEQGRQRHASYRSLVLARLRRDSARAAAVSARAAMAADGVSRFDLVPA 136
Query: 136 EA---QILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE 192
+ + PVVSG GSGEYFSR+GVG+P RQ MVLDTGSD+ W+QC+PC +
Sbjct: 137 NVTAFEASAAEIQGPVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCAD 196
Query: 193 CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVT 250
CYQQSDP+FDP S+SY+ + C P+C LD +ACR CLY+VAYGDGS+TVGD T
Sbjct: 197 CYQQSDPVFDPSLSTSYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFAT 256
Query: 251 ETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDS 310
ET++ G+S V +A+GCGHDNEGLFVG+AGLL LGGG LS QI AT+ +YCLVDRDS
Sbjct: 257 ETLTLGDSAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDS 316
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
P+S L+F A + VTAPLIR+ + TFYYVGL+G SVGGQ + IPPS F MD G G
Sbjct: 317 PSSSTLQFGDAADAE-VTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAG 375
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
G+IVD GTA+TRLQ+ AY +LRD+FVR +L TSGV+LFDTCYD S SV VP VSL
Sbjct: 376 GVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSL 435
Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
F G L LPAKNYLIPVD AGT+C AFAPT++A+SIIGNVQQQGTRVSFD A + VGF
Sbjct: 436 RFAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGF 495
Query: 491 TPNKC 495
T NKC
Sbjct: 496 TSNKC 500
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 481 bits (1239), Expect = e-133, Method: Compositional matrix adjust.
Identities = 257/427 (60%), Positives = 310/427 (72%), Gaps = 11/427 (2%)
Query: 80 FSLPLHSREIL--HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEA 137
+L LHSR+ L + RH YRSLV SRL RDSAR L + LA V R +L+PA
Sbjct: 83 LTLRLHSRDFLPEAQQRHATYRSLVQSRLRRDSARAAALSARATLAADGVTRQDLRPANE 142
Query: 138 QI-----LPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE 192
L PVVSG QGSGEYFSR+G+G+P R+ MVLDTGSD+ W+QC+PC +
Sbjct: 143 SAVFGASLAAAIQGPVVSGVGQGSGEYFSRVGIGSPARELYMVLDTGSDVTWVQCQPCAD 202
Query: 193 CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVT 250
CYQQSDP+FDP S+SY+ + C +P+C+ LD +ACR CLY+VAYGDGS+TVGD T
Sbjct: 203 CYQQSDPVFDPSLSASYAAVSCDSPRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFAT 262
Query: 251 ETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDS 310
ET++ G+S V +A+GCGHDNEGLFVG+AGLL LGGG LS QI A++ +YCLVDRDS
Sbjct: 263 ETLTLGDSTPVTNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDRDS 322
Query: 311 PASGVLEFNS-ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE-AG 368
PA+ L+F + D VTAPL+R+ + TFYYV L+G SVGGQA+ IP S F MD +G
Sbjct: 323 PAASTLQFGADGAEADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSG 382
Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
GG+IVD GTA+TRLQ+ AY +LRD+FVR +L TSGV+LFDTCYD S SV VP V
Sbjct: 383 SGGVIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSVEVPAV 442
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
SL F G AL LPAKNYLIPVD AGT+C AFAPT++A+SIIGNVQQQGTRVSFD A V
Sbjct: 443 SLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKGVV 502
Query: 489 GFTPNKC 495
GFTPNKC
Sbjct: 503 GFTPNKC 509
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 481 bits (1237), Expect = e-133, Method: Compositional matrix adjust.
Identities = 262/471 (55%), Positives = 327/471 (69%), Gaps = 11/471 (2%)
Query: 35 TTVLDVSSALQQTEHILSFEPETL--EPFAEESETAAESFPLNSSSSFSLPLHSREIL-- 90
T LDVS++L + +S + L + A A S +L LHSR+ L
Sbjct: 31 TETLDVSASLSRARAAVSTDARPLLHQSLASTDTDALVKEEQRSGGKLALRLHSRDFLPE 90
Query: 91 HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPE----DFST 146
+ RH Y SLVL+RL RDSAR L + LA + R +L+PA A + E +
Sbjct: 91 EQGRHESYSSLVLARLRRDSARAAALSARASLAADGISRADLRPANATPVFEASAAEIQG 150
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
PVVSG QGSGEYFSR+GVG P RQ MVLDTGSD+ WLQC+PC +CY QSDP++DP S
Sbjct: 151 PVVSGVGQGSGEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYDPSVS 210
Query: 207 SSYSPLPCAAPQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
+SY+ + C +P+C+ LD +ACR CLY+VAYGDGS+TVGD TET++ G+S V +
Sbjct: 211 TSYATVGCDSPRCRDLDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTLGDSAPVSNV 270
Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
A+GCGHDNEGLFVG+AGLL LGGG LS QI AT+ +YCLVDRDSP+S L+F +
Sbjct: 271 AIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPSSSTLQFGDSEQ- 329
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
AVTAPLIR+ + +TFYYV L+G SVGG+A+ IP S F MD+AG GG+IVD GTA+TRLQ
Sbjct: 330 PAVTAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVDSGTAVTRLQ 389
Query: 385 TQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN 444
+ AY +LR++FV+ +L SGV+LFDTCYD +G SV+VP V+L F G L LPAKN
Sbjct: 390 SGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAVALWFEGGGELKLPAKN 449
Query: 445 YLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
YLIPVD+AGT+C AFA TS +SIIGNVQQQG RVSFD A N VGFT +KC
Sbjct: 450 YLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNTVGFTADKC 500
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 250/473 (52%), Positives = 322/473 (68%), Gaps = 22/473 (4%)
Query: 35 TTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREIL---- 90
+T D+ S L + E ++P EE+ E P +S+PL R+ +
Sbjct: 23 STQKDIYSTLDVQATLRVARGEVVQPAKEET---LEIKP------WSIPLVHRDAMKGNS 73
Query: 91 HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQ---ILPEDFSTP 147
+K Y + RL+RD+ARV + ++L+LA+ + R LKP + + DF +P
Sbjct: 74 NKNNELSYAERMQQRLKRDAARVAAINSRLELAVNGIKRSSLKPDSSSSFTMAESDFQSP 133
Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
VVSG QGSGEYFSRIGVG P R MVLDTGSD+ W+QC PC++CYQQSDPI++P SS
Sbjct: 134 VVSGMDQGSGEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSDPIYNPALSS 193
Query: 208 SYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIAL 266
SY + C A C+ LDVS C R CLYQV+YGDGS+T G+ TET++ G + ++ +A+
Sbjct: 194 SYKLVGCQANLCQQLDVSGCSRNGSCLYQVSYGDGSYTQGNFATETLTLGGA-PLQNVAI 252
Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF-NSAR 322
GCGHDNEGLFVG+AGLLGLGGG LS Q+ + +YCLVDRDS +S L+F +A
Sbjct: 253 GCGHDNEGLFVGAAGLLGLGGGSLSFPSQLTDENGKIFSYCLVDRDSESSSTLQFGRAAV 312
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
AV AP+++N ++DTFYYV L+G SVGG+ + I S+F +D +G+GG+IVD GTA+TR
Sbjct: 313 PNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGIDASGNGGVIVDSGTAVTR 372
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
LQT AY+SLRD+F NL T GV+LFDTCYD S SV VPTV HF G ++ LPA
Sbjct: 373 LQTAAYDSLRDAFRAGTKNLPSTDGVSLFDTCYDLSSKESVDVPTVVFHFSGGGSMSLPA 432
Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
KNYL+PVDS GTFCFAFAPTSS+LSI+GN+QQQG RVSFD ANN+VGF NKC
Sbjct: 433 KNYLVPVDSMGTFCFAFAPTSSSLSIVGNIQQQGIRVSFDRANNQVGFAVNKC 485
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 443 bits (1140), Expect = e-122, Method: Compositional matrix adjust.
Identities = 226/358 (63%), Positives = 268/358 (74%), Gaps = 10/358 (2%)
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
PVVSG QGSGEYFSRIG+G+P RQ MVLDTGSD+ WLQC PC +CY QSDP+FDP S
Sbjct: 184 PVVSGVGQGSGEYFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALS 243
Query: 207 SSYSPLPCAAPQCKSLDVSACRAN------RCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
SSY+ +PC +P C++LD SAC N C+Y+VAYGDGS+TVGD TET++ G GS
Sbjct: 244 SSYATVPCDSPHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLGGDGS 303
Query: 261 --VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF 318
V +A+GCGHDNEGLFVG+AGLL LGGG LS QI AT +YCLVDRDSP++ L+F
Sbjct: 304 AAVHDVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATEFSYCLVDRDSPSASTLQF 363
Query: 319 NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV-QIPPSLFEMDEAGDGGIIVDCG 377
A VTAPL+R+ + +TFYYV L G SVGG+ + IPP+ F MDE G GG+IVD G
Sbjct: 364 G-ASDSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSGGVIVDSG 422
Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
TA+TRLQ+ AY++LRD+FVR L SGV+LFDTCYD +G SV+VP VSL F G
Sbjct: 423 TAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSSVQVPAVSLRFEGGGE 482
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L LPAKNYLIPVD AGT+C AFA T A+SI+GNVQQQG RVSFD A N VGF+PNKC
Sbjct: 483 LKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFDTAKNTVGFSPNKC 540
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 441 bits (1133), Expect = e-121, Method: Compositional matrix adjust.
Identities = 248/495 (50%), Positives = 336/495 (67%), Gaps = 22/495 (4%)
Query: 7 FVLFTITTILFSFCLFTSASSRGL--SETATTVLDVSSALQQTEHILSFEPETLEPFAEE 64
F+ TI T L F S SR L S +T++ DVS++ Q LS +P+ L+ +
Sbjct: 9 FLFLTIFTSL----QFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQNHSH- 63
Query: 65 SETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAI 124
P +S FSLPL+ R LH + DY +LV +RL RD+ARV L L+ ++
Sbjct: 64 -------LP---NSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSL 113
Query: 125 YNVDRHELKPAEAQILPEDFSTPVVSGASQGSG-EYFSRIGVGTPPRQFSMVLDTGSDIN 183
N H + ++ + + PVVSG S+GSG EY ++IGVG P + F +V DTGSD+
Sbjct: 114 -NGGTHFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVT 172
Query: 184 WLQCRPCTE---CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGD 240
WLQC+PC CY+Q DPIFDPK+SSSYSPL C + QCK LD + C ++ C+YQV YGD
Sbjct: 173 WLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGD 232
Query: 241 GSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS 300
GSFT G+L TET+SFGNS S+ + +GCGHDNEGLF G AGL+GLGGG +SL+ Q+KA+S
Sbjct: 233 GSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASS 292
Query: 301 LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPS 360
+YCLV+ DS +S LEFNS D++T+PL++N + ++ YV + G SVGG+ + I P+
Sbjct: 293 FSYCLVNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPT 352
Query: 361 LFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGL 420
FE+DE+G GGIIVD GT I+RL + Y SLR++FV+L +L P G+++FDTCY+FSG
Sbjct: 353 RFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQ 412
Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVS 480
+V VPT++ G +L LPA+NYLI +D+AGT+C AF T S+LSIIG+ QQQG RVS
Sbjct: 413 SNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVS 472
Query: 481 FDLANNRVGFTPNKC 495
+DL N+ VGF+ NKC
Sbjct: 473 YDLTNSLVGFSTNKC 487
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 440 bits (1132), Expect = e-121, Method: Compositional matrix adjust.
Identities = 248/495 (50%), Positives = 336/495 (67%), Gaps = 22/495 (4%)
Query: 7 FVLFTITTILFSFCLFTSASSRGL--SETATTVLDVSSALQQTEHILSFEPETLEPFAEE 64
F+ TI T L F S SR L S +T++ DVS++ Q LS +P+ L+ +
Sbjct: 9 FLFLTIFTSL----QFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQNHSH- 63
Query: 65 SETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAI 124
P +S FSLPL+ R LH + DY +LV +RL RD+ARV L L+ ++
Sbjct: 64 -------LP---NSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSL 113
Query: 125 YNVDRHELKPAEAQILPEDFSTPVVSGASQGSG-EYFSRIGVGTPPRQFSMVLDTGSDIN 183
N H + ++ + + PVVSG S+GSG EY ++IGVG P + F +V DTGSD+
Sbjct: 114 -NGGTHFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVT 172
Query: 184 WLQCRPCTE---CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGD 240
WLQC+PC CY+Q DPIFDPK+SSSYSPL C + QCK LD + C ++ C+YQV YGD
Sbjct: 173 WLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGD 232
Query: 241 GSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS 300
GSFT G+L TET+SFGNS S+ + +GCGHDNEGLF G AGL+GLGGG +SL+ Q+KA+S
Sbjct: 233 GSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASS 292
Query: 301 LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPS 360
+YCLV+ DS +S LEFNS D++T+PL++N + ++ YV + G SVGG+ + I P+
Sbjct: 293 FSYCLVNLDSDSSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPT 352
Query: 361 LFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGL 420
FE+DE+G GGIIVD GT I+RL + Y SLR++FV+L +L P G+++FDTCY+FSG
Sbjct: 353 RFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQ 412
Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVS 480
+V VPT++ G +L LPA+NYLI +D+AGT+C AF T S+LSIIG+ QQQG RVS
Sbjct: 413 SNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVS 472
Query: 481 FDLANNRVGFTPNKC 495
+DL N+ VGF+ NKC
Sbjct: 473 YDLTNSIVGFSTNKC 487
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 221/409 (54%), Positives = 287/409 (70%), Gaps = 13/409 (3%)
Query: 99 RSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP-----AEAQILPEDFSTPVVSGAS 153
+ ++ RL+RD+ARV+++ ++QLA V + E+KP +A+ +DFS+ ++SG +
Sbjct: 88 KEILQERLKRDAARVDSINARVQLAAMGVSKAEMKPLNGSSIDARFDAKDFSSSIISGLA 147
Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
QGSGEYF+R+GVGTPPR MVLDTGSDI W+QC PC +CY Q+DP+F+P SS+Y +P
Sbjct: 148 QGSGEYFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTDPLFNPAASSTYRKVP 207
Query: 214 CAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
CA P CK LD+S CR R C YQV+YGDGSFTVGD TET++F ++ +ALGCGHDN
Sbjct: 208 CATPLCKKLDISGCRNKRYCEYQVSYGDGSFTVGDFSTETLTF-RGQVIRRVALGCGHDN 266
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSP--ASGVLEFNSARGGDAV 327
EGLF+G+AGLLGLG G LS Q A +YCLVDR + AS ++ +A A+
Sbjct: 267 EGLFIGAAGLLGLGRGSLSFPSQTGAQFSKRFSYCLVDRSASGTASSLIFGKAAIPKSAI 326
Query: 328 TAPLIRNKKVDTFYYVGLTGFSVGGQAV-QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
PL+ N K+DTFYYV L G SVGG+ + IP S+F MD G+GG+I+D GT++TRL
Sbjct: 327 FTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATGNGGVIIDSGTSVTRLVDS 386
Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL 446
AY+++RD+F GNLK G +LFDTCYD SGL++V+VPT+ HF G + LPA NYL
Sbjct: 387 AYSTMRDAFRVGTGNLKSAGGFSLFDTCYDLSGLKTVKVPTLVFHFQGGAHISLPATNYL 446
Query: 447 IPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
IPVDS+ TFCFAFA + LSIIGN+QQQG RV FD NRVGF C
Sbjct: 447 IPVDSSATFCFAFAGNTGGLSIIGNIQQQGYRVVFDSLANRVGFKAGSC 495
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 207/326 (63%), Positives = 251/326 (76%), Gaps = 4/326 (1%)
Query: 174 MVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACR--ANR 231
MVLDTGSD+ W+QC+PC +CYQQSDP+FDP S+SY+ + C + +C+ LD +ACR
Sbjct: 1 MVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGA 60
Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
CLY+VAYGDGS+TVGD TET++ G+S V +A+GCGHDNEGLFVG+AGLL LGGG LS
Sbjct: 61 CLYEVAYGDGSYTVGDFATETLTLGDSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPLS 120
Query: 292 LTKQIKATSLAYCLVDRDSPASGVLEF-NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSV 350
QI A++ +YCLVDRDSPA+ L+F + A VTAPL+R+ + TFYYV L+G SV
Sbjct: 121 FPSQISASTFSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGISV 180
Query: 351 GGQAVQIPPSLFEMDE-AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
GGQ + IP S F MD +G GG+IVD GTA+TRLQ+ AY +LRD+FV+ A +L TSGV+
Sbjct: 181 GGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVS 240
Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSII 469
LFDTCYD S SV VP VSL F G AL LPAKNYLIPVD AGT+C AFAPT++A+SII
Sbjct: 241 LFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSII 300
Query: 470 GNVQQQGTRVSFDLANNRVGFTPNKC 495
GNVQQQGTRVSFD A VGFTPNKC
Sbjct: 301 GNVQQQGTRVSFDTARGAVGFTPNKC 326
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 240/468 (51%), Positives = 317/468 (67%), Gaps = 13/468 (2%)
Query: 31 SETATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREIL 90
S +T DVS+++ Q + LS +P+ PF +T ++ +S S SL H R +
Sbjct: 66 SPYSTNTFDVSASINQALNALSIKPK---PF----QTTHSNYHSSSPLSLSL--HPRLTV 116
Query: 91 HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVS 150
H + DY SLV +RL R +AR +L KL+L++ + + + PV S
Sbjct: 117 HNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFGRR-INGSDSTNSLTAPVTS 175
Query: 151 GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC---TECYQQSDPIFDPKTSS 207
GASQG+GEYF+RIGVG P + + V DTGSD++WLQC+PC CY+Q PIFDPK+SS
Sbjct: 176 GASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSS 235
Query: 208 SYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
SYSPL C + QC LD +AC AN C+Y+V YGDGSFTVG+L TET SF +S S+ + +G
Sbjct: 236 SYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIG 295
Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAV 327
CGHDNEGLFVG+ GL+GLGGG +SL+ Q++ATS +YCLVD DS +S L+FN+ + D++
Sbjct: 296 CGHDNEGLFVGADGLIGLGGGAISLSSQLEATSFSYCLVDLDSESSSTLDFNADQPSDSL 355
Query: 328 TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
T+PL++N + TF YV + G SVGG+ + I S FE+DE+G GGIIVD GT IT + +
Sbjct: 356 TSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDV 415
Query: 388 YNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
Y+ LRD+FV L NL P GV+ FDTCYD S +V VPT++ +L LPAKN LI
Sbjct: 416 YDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLI 475
Query: 448 PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
VDSAGTFC AF P++ LSIIGNVQQQG RVS+DLAN+ VGF+ +KC
Sbjct: 476 QVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 240/468 (51%), Positives = 317/468 (67%), Gaps = 13/468 (2%)
Query: 31 SETATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREIL 90
S +T DVS+++ Q + LS +P+ PF +T ++ +S S SL H R +
Sbjct: 66 SPYSTNTFDVSASINQALNALSIKPK---PF----QTTHSNYHSSSPLSLSL--HPRLTV 116
Query: 91 HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVS 150
H + DY SLV +RL R +AR +L KL+L++ + + + PV S
Sbjct: 117 HNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFGRR-INGSDSTNSLTAPVTS 175
Query: 151 GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC---TECYQQSDPIFDPKTSS 207
GASQG+GEYF+RIGVG P + + V DTGSD++WLQC+PC CY+Q PIFDPK+SS
Sbjct: 176 GASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSS 235
Query: 208 SYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
SYSPL C + QC LD +AC AN C+Y+V YGDGSFTVG+L TET SF +S S+ + +G
Sbjct: 236 SYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIG 295
Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAV 327
CGHDNEGLFVG+AGL+GLGGG +SL+ Q++ATS +YCLVD DS +S L+FN+ + D++
Sbjct: 296 CGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLVDLDSESSSTLDFNADQPSDSL 355
Query: 328 TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
T+PL++N + TF YV + G SVGG+ + I S FE+DE+G GGIIVD GT IT + +
Sbjct: 356 TSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDV 415
Query: 388 YNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
Y+ LRD+FV L NL P GV+ FDTCYD S +V VPT++ +L LPAKN L
Sbjct: 416 YDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLF 475
Query: 448 PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
VDSAGTFC AF P++ LSIIGNVQQQG RVS+DLAN+ VGF+ +KC
Sbjct: 476 QVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 396 bits (1018), Expect = e-107, Method: Compositional matrix adjust.
Identities = 210/423 (49%), Positives = 280/423 (66%), Gaps = 22/423 (5%)
Query: 80 FSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLA---IYNVDRHELKPAE 136
+ + + R+ L +D+R + RL+RD+ RV +LI +L Y VD
Sbjct: 72 WMMKVVHRDQLSFGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSYRVD-------- 123
Query: 137 AQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQ 196
DF T V+SG QGSGEYF RIGVG+PPR MV+D+GSDI W+QC+PCT+CY Q
Sbjct: 124 ------DFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQ 177
Query: 197 SDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG 256
SDP+FDP S+S++ + C++ C L+ + C A RC Y+V+YGDGS+T G L ET++FG
Sbjct: 178 SDPVFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFG 237
Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPAS 313
+ V+ +A+GCGH N G+FVG+AGLLGLGGG +S Q+ + +YCLV R + +S
Sbjct: 238 RT-MVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTDSS 296
Query: 314 GVLEFN-SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
G L F A A PL+RN + +FYY+GL G VGG V I +F + E GDGG+
Sbjct: 297 GSLVFGREALPAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGV 356
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
++D GTA+TRL T AY + RD+F+ NL +GVA+FDTCYD G SVRVPTVS +F
Sbjct: 357 VMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYF 416
Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
G L LPA+N+LIP+D AGTFCFAFAP++S LSI+GN+QQ+G ++SFD AN VGF P
Sbjct: 417 SGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGP 476
Query: 493 NKC 495
N C
Sbjct: 477 NIC 479
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 209/427 (48%), Positives = 278/427 (65%), Gaps = 20/427 (4%)
Query: 75 NSSSSFSLPLHSREIL--HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHEL 132
+S + + L L R+ + T H D+R+ +R++RD+ RV L R L
Sbjct: 61 SSPAKYKLKLVHRDKVPTFNTSH-DHRTRFNARMQRDTKRVAAL------------RRHL 107
Query: 133 KPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE 192
+ E F + VVSG QGSGEYF RIGVG+PPR +V+D+GSDI W+QC PCT+
Sbjct: 108 AAGKPTYAEEAFGSDVVSGMEQGSGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQ 167
Query: 193 CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTET 252
CY QSDP+F+P SSSY+ + CA+ C +D + C RC Y+V+YGDGS+T G L ET
Sbjct: 168 CYHQSDPVFNPADSSSYAGVSCASTVCSHVDNAGCHEGRCRYEVSYGDGSYTKGTLALET 227
Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRD 309
++FG + ++ +A+GCGH N+G+FVG+AGLLGLG G +S Q+ + +YCLV R
Sbjct: 228 LTFGRT-LIRNVAIGCGHHNQGMFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRG 286
Query: 310 SPASGVLEF-NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
+SG+L+F A A PLI N + +FYYVGL+G VGG V I +F++ E G
Sbjct: 287 IQSSGLLQFGREAVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELG 346
Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
DGG+++D GTA+TRL T AY + RD+F+ NL SGV++FDTCYD G SVRVPTV
Sbjct: 347 DGGVVMDTGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPTV 406
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
S +F G L LPA+N+LIPVD G+FCFAFAP+SS LSIIGN+QQ+G +S D AN V
Sbjct: 407 SFYFSGGPILTLPARNFLIPVDDVGSFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGFV 466
Query: 489 GFTPNKC 495
GF PN C
Sbjct: 467 GFGPNVC 473
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 394 bits (1011), Expect = e-107, Method: Compositional matrix adjust.
Identities = 236/454 (51%), Positives = 293/454 (64%), Gaps = 22/454 (4%)
Query: 51 LSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDS 110
+SF+PE+ EP +E F S S S+ L+ I + + + L SRL+RDS
Sbjct: 44 ISFQPES-EPDSES--LLGSEFESGSDSESSITLNLDHIDALSSNKTPQELFSSRLQRDS 100
Query: 111 ARVNTLIT-KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPP 169
RV ++ T Q+ NV H + FS+ VVSG SQGSGEYF+R+GVGTP
Sbjct: 101 RRVKSIATLAAQIPGRNVT-HAPRTG-------GFSSSVVSGLSQGSGEYFTRLGVGTPA 152
Query: 170 RQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRA 229
R MVLDTGSDI WLQC PC CY QSDPIFDP+ S +Y+ +PC++P C+ LD + C
Sbjct: 153 RYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNT 212
Query: 230 NR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGG 287
R CLYQV+YGDGSFTVGD TET++F VKG+ALGCGHDNEGLFVG+AGLLGLG
Sbjct: 213 RRKTCLYQVSYGDGSFTVGDFSTETLTF-RRNRVKGVALGCGHDNEGLFVGAAGLLGLGK 271
Query: 288 GMLSLTKQIKA---TSLAYCLVDR--DSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYY 342
G LS Q +YCLVDR S S V+ N+A A PL+ N K+DTFYY
Sbjct: 272 GKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYY 331
Query: 343 VGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
V L G SVGG V + SLF++D+ G+GG+I+D GT++TRL AY ++RD+F A
Sbjct: 332 VELLGISVGGTRVPGVAASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKA 391
Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
LK +LFDTC+D S + V+VPTV LHF G + LPA NYLIPVD+ G FCFAFA
Sbjct: 392 LKRAPDFSLFDTCFDLSNMNEVKVPTVVLHF-RGADVSLPATNYLIPVDTNGKFCFAFAG 450
Query: 462 TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
T LSIIGN+QQQG RV +DLA++RVGF P C
Sbjct: 451 TMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 225/404 (55%), Positives = 273/404 (67%), Gaps = 19/404 (4%)
Query: 101 LVLSRLERDSARVNTLIT-KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEY 159
L SRL+RDS RV ++ T Q+ NV H +P FS+ VVSG SQGSGEY
Sbjct: 91 LFSSRLQRDSRRVKSIATLAAQIPGRNVT-HAPRPG-------GFSSSVVSGLSQGSGEY 142
Query: 160 FSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
F+R+GVGTP R MVLDTGSDI WLQC PC CY QSDPIFDP+ S +Y+ +PC++P C
Sbjct: 143 FTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHC 202
Query: 220 KSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
+ LD + C R CLYQV+YGDGSFTVGD TET++F VKG+ALGCGHDNEGLFV
Sbjct: 203 RRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTF-RRNRVKGVALGCGHDNEGLFV 261
Query: 278 GSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDR--DSPASGVLEFNSARGGDAVTAPLI 332
G+AGLLGLG G LS Q +YCLVDR S S V+ N+A A PL+
Sbjct: 262 GAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLL 321
Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
N K+DTFYYVGL G SVGG V + SLF++D+ G+GG+I+D GT++TRL AY ++
Sbjct: 322 SNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAM 381
Query: 392 RDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
RD+F A LK +LFDTC+D S + V+VPTV LHF G + LPA NYLIPVD+
Sbjct: 382 RDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHF-RGADVSLPATNYLIPVDT 440
Query: 452 AGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
G FCFAFA T LSIIGN+QQQG RV +DLA++RVGF P C
Sbjct: 441 NGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 224/406 (55%), Positives = 273/406 (67%), Gaps = 19/406 (4%)
Query: 99 RSLVLSRLERDSARVNTLIT-KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSG 157
+ L SRL+RDS RV ++ T Q+ NV H +P FS+ VVSG SQGSG
Sbjct: 89 QELFSSRLQRDSRRVRSIATLAAQIPGRNVT-HAPRPG-------GFSSSVVSGLSQGSG 140
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EYF+R+GVGTP R MVLDTGSDI WLQC PC CY QSDPIFDP+ S +Y+ +PC++P
Sbjct: 141 EYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSP 200
Query: 218 QCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
C+ LD + C R CLYQV+YGDGSFTVGD TET++F VKG+ALGCGHDNEGL
Sbjct: 201 HCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTF-RRNRVKGVALGCGHDNEGL 259
Query: 276 FVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDR--DSPASGVLEFNSARGGDAVTAP 330
FVG+AGLLGLG G LS Q +YCLVDR S S V+ N+A A P
Sbjct: 260 FVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTP 319
Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
L+ N K+DTFYYVGL G SVGG V + SLF++D+ G+GG+I+D GT++TRL AY
Sbjct: 320 LLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYI 379
Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
++RD+F A LK +LFDTC+D S + V+VPTV LHF + LPA NYLIPV
Sbjct: 380 AMRDAFRVGAKTLKRAPNFSLFDTCFDLSNMNEVKVPTVVLHFRRAD-VSLPATNYLIPV 438
Query: 450 DSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
D+ G FCFAFA T LSIIGN+QQQG RV +DLA++RVGF P C
Sbjct: 439 DTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 215/468 (45%), Positives = 304/468 (64%), Gaps = 35/468 (7%)
Query: 37 VLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREIL---HKT 93
+L+V A+ +T+ + E F +++T E + L L R+ + +K+
Sbjct: 40 LLNVKEAITETK-----ASQYQELFDNQNDTLTEG-------KWKLKLVHRDKITAFNKS 87
Query: 94 RHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGAS 153
+ D+ +R++RD RV TLI +L + A + E+F VVSG +
Sbjct: 88 SY-DHSHNFHARIQRDKKRVATLIRRL----------SPRDATSSYSVEEFGAEVVSGMN 136
Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
QGSGEYF RIGVG+PPR+ +V+D+GSDI W+QC+PCT+CY Q+DP+FDP S+S+ +P
Sbjct: 137 QGSGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFMGVP 196
Query: 214 CAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
C++ C+ ++ + C A C Y+V YGDGS+T G L ET++FG + V+ +A+GCGH N
Sbjct: 197 CSSSVCERIENAGCHAGGCRYEVMYGDGSYTKGTLALETLTFGRT-VVRNVAIGCGHRNR 255
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTA- 329
G+FVG+AGLLGLGGG +SL Q+ + +YCLV R + ++G LEF RG V A
Sbjct: 256 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTDSAGSLEF--GRGAMPVGAA 313
Query: 330 --PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
PLIRN + +FYY+ L+G VGG V I +F+++E G+GG+++D GTA+TR+ T A
Sbjct: 314 WIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTRIPTVA 373
Query: 388 YNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
Y + RD+F+ GNL SGV++FDTCY+ +G SVRVPTVS +F G L LPA+N+LI
Sbjct: 374 YVAFRDAFIGQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYFAGGPILTLPARNFLI 433
Query: 448 PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
PVD GTFCFAFA + S LSIIGN+QQ+G ++SFD AN VGF PN C
Sbjct: 434 PVDDVGTFCFAFAASPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 481
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 221/401 (55%), Positives = 270/401 (67%), Gaps = 35/401 (8%)
Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIG 164
RL RD+ RV+ L ++ FS+ VVSG SQGSGEYF+R+G
Sbjct: 77 RLHRDTLRVHALNSR---------------------AAGFSSSVVSGLSQGSGEYFTRLG 115
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
VGTPPR MVLDTGSD+ WLQC PC +CY QSDPIF+P S S++ +PC++P C+ LD
Sbjct: 116 VGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKSKSFAGIPCSSPLCRRLDS 175
Query: 225 SAC--RANRCLYQVAYGDGSFTVGDLVTETVSF-GNSGSVKGIALGCGHDNEGLFVGSAG 281
S C R + CLYQV+YGDGSFT GD TET++F GN + +ALGCGH NEGLFVG+AG
Sbjct: 176 SGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGN--KIAKVALGCGHHNEGLFVGAAG 233
Query: 282 LLGLGGGMLSLTKQIKAT---SLAYCLVDRDS---PASGVLEFNSARGGDAVTAPLIRNK 335
LLGLG G LS Q +YCLVDR + P+S V ++A A PLIRN
Sbjct: 234 LLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFG-DAAISRLARFTPLIRNP 292
Query: 336 KVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS 394
K+DTFYYVGL G SVGG V+ + PSLF++D AG+GG+I+D GT++TRL AY +LRD+
Sbjct: 293 KLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDA 352
Query: 395 FVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
F A +LK +LFDTCYD SG SV+VPTV LHF G + LPA NYLIPVD G+
Sbjct: 353 FRVGARHLKRGPEFSLFDTCYDLSGQSSVKVPTVVLHF-RGADMALPATNYLIPVDENGS 411
Query: 455 FCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
FCFAFA T S LSIIGN+QQQG RV +DLA +R+GF P C
Sbjct: 412 FCFAFAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 452
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 235/510 (46%), Positives = 312/510 (61%), Gaps = 38/510 (7%)
Query: 3 PIKPFVLFTITTILFSFCLFTSASSRGLSET---ATTVLDVSSALQQTEHILSFEPETLE 59
P+ PF +L SA SR +S A LDV+S+L++T+
Sbjct: 10 PLLPFTFLLCVGMLL---FLQSAQSRPISVPEVPAYHALDVASSLRETDTA--------- 57
Query: 60 PFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHN---DYRSLVLSRLERDSARVNTL 116
A +E E+ P S S + +H +L K N Y + +L R++ RV L
Sbjct: 58 --AGGAEYKRETKPRRSPWSVEV-VHRDALLLKNAANATASYERRLKEKLRREAVRVRGL 114
Query: 117 ITKLQLAIY----NVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQF 172
+++ + V+R+E AE DF VVSG QGSGEYF+RIGVGTP R+
Sbjct: 115 ERQIERTLTLNKDPVNRYE-NVAEVD---ADFGGEVVSGMEQGSGEYFTRIGVGTPTREQ 170
Query: 173 SMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRC 232
MVLDTGSD+ W+QC PC ECY Q+DPIF+P S+S+S + C + C LD C + C
Sbjct: 171 YMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSGGC 230
Query: 233 LYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSL 292
LY+ +YGDGS++ G TET++FG + SV +A+GCGH N GLF+G+AGLLGLG G LS
Sbjct: 231 LYEASYGDGSYSTGSFATETLTFGTT-SVANVAIGCGHKNVGLFIGAAGLLGLGAGALSF 289
Query: 293 TKQI---KATSLAYCLVDRDSPASGVLEF--NSARGGDAVTAPLIRNKKVDTFYYVGLTG 347
QI + +YCLVDR+S +SG L+F S G T PL +N + TFYY+ +T
Sbjct: 290 PNQIGTQTGHTFSYCLVDRESDSSGPLQFGPKSVPVGSIFT-PLEKNPHLPTFYYLSVTA 348
Query: 348 FSVGGQAVQ-IPPSLFEMDE-AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT 405
SVGG + IPP +F +DE +G GG I+D GT +TRL T AY+++RD+FV G L T
Sbjct: 349 ISVGGALLDSIPPEVFRIDETSGHGGFIIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRT 408
Query: 406 SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA 465
V++FDTCYD SGL+ V VPTV HF G +L LPAKNYLIP+D+ GTFCFAFAP +S+
Sbjct: 409 DAVSIFDTCYDLSGLQFVSVPTVGFHFSNGASLILPAKNYLIPMDTVGTFCFAFAPAASS 468
Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+SI+GN QQQ RVSFD AN+ VGF ++C
Sbjct: 469 VSIMGNTQQQHIRVSFDSANSLVGFAFDQC 498
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 215/442 (48%), Positives = 291/442 (65%), Gaps = 21/442 (4%)
Query: 59 EPFAEESETAAESFPLNSSSSFSLPL-HSREILHKTRHNDYRSLVLSRLERDSARVNTLI 117
P ++ +A E+ +SS+ + L L H ++ ++D+R+ +R++RD+ R +L+
Sbjct: 50 HPHNKKLNSATEA---SSSAKYKLKLVHRDKVPTFNTYHDHRTRFNARMQRDTKRAASLL 106
Query: 118 TKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLD 177
+L KP A E F + VVSG QGSGEYF RIGVG+PPR +V+D
Sbjct: 107 RRLAAG---------KPTYAA---EAFGSDVVSGMEQGSGEYFVRIGVGSPPRNQYVVMD 154
Query: 178 TGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVA 237
+GSDI W+QC PCT+CY QSDP+F+P SSS+S + CA+ C +D +AC RC Y+V+
Sbjct: 155 SGSDIIWVQCEPCTQCYHQSDPVFNPADSSSFSGVSCASTVCSHVDNAACHEGRCRYEVS 214
Query: 238 YGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK 297
YGDGS+T G L ET++FG + ++ +A+GCGH N+G+FVG+AGLLGLGGG +S Q+
Sbjct: 215 YGDGSYTKGTLALETITFGRT-LIRNVAIGCGHHNQGMFVGAAGLLGLGGGPMSFVGQLG 273
Query: 298 ATS---LAYCLVDRDSPASGVLEF-NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQ 353
+ +YCLV R +SG+LEF A A PLI N + +FYY+GL+G VGG
Sbjct: 274 GQTGGAFSYCLVSRGIESSGLLEFGREAMPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGL 333
Query: 354 AVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT 413
V I +F++ E GDGG+++D GTA+TRL T AY + RD F+ NL SGV++FDT
Sbjct: 334 RVSISEDVFKLSELGDGGVVMDTGTAVTRLPTVAYEAFRDGFIAQTTNLPRASGVSIFDT 393
Query: 414 CYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQ 473
CYD G SVRVPTVS +F G L LPA+N+LIPVD GTFCFAFAP+SS LSIIGN+Q
Sbjct: 394 CYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGTFCFAFAPSSSGLSIIGNIQ 453
Query: 474 QQGTRVSFDLANNRVGFTPNKC 495
Q+G ++S D AN VGF PN C
Sbjct: 454 QEGIQISVDGANGFVGFGPNVC 475
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 204/422 (48%), Positives = 272/422 (64%), Gaps = 39/422 (9%)
Query: 80 FSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLA---IYNVDRHELKPAE 136
+ + + R+ L +D+R + RL+RD+ RV +LI +L Y VD
Sbjct: 133 WMMKVVHRDQLSFGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSYRVD-------- 184
Query: 137 AQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQ 196
DF T V+SG QGSGEYF RIGVG+PPR MV+D+GSDI W+QC+PCT+CY Q
Sbjct: 185 ------DFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQ 238
Query: 197 SDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG 256
SDP+FDP S+S++ + C++ C L+ + C A RC Y+V+YGDGS+T G L ET++FG
Sbjct: 239 SDPVFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFG 298
Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPAS 313
+ V+ +A+GCGH N G+FVG+AGLLGLGGG +S Q+ + +YCLV
Sbjct: 299 RT-MVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVS------ 351
Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
A PL+RN + +FYY+GL G VGG V I +F + E GDGG++
Sbjct: 352 ------------AAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVV 399
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
+D GTA+TRL T AY + RD+F+ NL +GVA+FDTCYD G SVRVPTVS +F
Sbjct: 400 MDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFS 459
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
G L LPA+N+LIP+D AGTFCFAFAP++S LSI+GN+QQ+G ++SFD AN VGF PN
Sbjct: 460 GGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPN 519
Query: 494 KC 495
C
Sbjct: 520 IC 521
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 210/402 (52%), Positives = 265/402 (65%), Gaps = 13/402 (3%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
+ RD+ RV ++ ++ + + R + + ++ +DF PVVSG S GSGEYF RI V
Sbjct: 5 ISRDNLRVASIHGRINQTVNGLTRSRSRDRQTKVPSQDFQAPVVSGLSLGSGEYFIRISV 64
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
GTPPR+ +V+DTGSDI WLQC PC CY QSD IFDP SS+YS L C+ QC +LD+
Sbjct: 65 GTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYKSSTYSTLGCSTRQCLNLDIG 124
Query: 226 ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV-----KGIALGCGHDNEGLFVGSA 280
C+AN+CLYQV YGDGSFT G+ T+ VS ++ V I LGCGHDNEG FVG+A
Sbjct: 125 TCQANKCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGYFVGAA 184
Query: 281 GLLGLGGGMLSLTKQIKATS---LAYCLVDR--DSPASGVLEFNSAR--GGDAVTAPLIR 333
GLLGLG G LS Q+ + +YCL DR DS L F A A P
Sbjct: 185 GLLGLGKGPLSFPNQVDPQNGGRFSYCLTDRETDSTEGSSLVFGEAAVPPAGARFTPQDS 244
Query: 334 NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRD 393
N +V TFYY+ +TG SVGG + IP S F++D G+GG+I+D GT++TRLQ AY SLRD
Sbjct: 245 NMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLRD 304
Query: 394 SFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAG 453
+F +L PT+G +LFDTCYD SGL SV VPTV+LHF G L LPA NYLIPVD++
Sbjct: 305 AFRAGTSDLAPTAGFSLFDTCYDLSGLASVDVPTVTLHFQGGTDLKLPASNYLIPVDNSN 364
Query: 454 TFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
TFC AFA T+ SIIGN+QQQG RV +D +N+VGF P++C
Sbjct: 365 TFCLAFAGTTGP-SIIGNIQQQGFRVIYDNLHNQVGFVPSQC 405
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 211/426 (49%), Positives = 278/426 (65%), Gaps = 20/426 (4%)
Query: 84 LHSREILHKTRHN---DYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPA----E 136
+H +L K N Y + +L R++ARV L +++ + + + PA
Sbjct: 76 VHRDSLLFKGAANATASYERRLEEKLRREAARVRALEQRIERKL----KLKKDPAGSYEN 131
Query: 137 AQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQ 196
+ +F + VVSG QGSGEYF+RIG+GTP R+ MVLDTGSD+ W+QC PC ECY Q
Sbjct: 132 VAGVTAEFGSEVVSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQ 191
Query: 197 SDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG 256
+DPIF+P +S S+S + C + C LD + C CLY+V+YGDGS+TVG TET++FG
Sbjct: 192 ADPIFNPSSSVSFSTVGCDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTFG 251
Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPAS 313
+ S++ +A+GCGHDN GLFVG+AGLLGLG G LS Q+ + +YCLVDRDS +S
Sbjct: 252 TT-SIQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDSESS 310
Query: 314 GVLEF--NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEA-GD 369
G LEF S G T PL+ N + TFYY+ + SVGG + +P F +DE G
Sbjct: 311 GTLEFGPESVPIGSIFT-PLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGR 369
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
GGII+D GTA+TRLQT AY++LRD+F+ +L G+++FDTCYD S L+SV +P V
Sbjct: 370 GGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVG 429
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
HF G LPAKN LIP+DS GTFCFAFAP S LSI+GN+QQQG RVSFD AN+ VG
Sbjct: 430 FHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVG 489
Query: 490 FTPNKC 495
F ++C
Sbjct: 490 FAIDQC 495
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 227/446 (50%), Positives = 292/446 (65%), Gaps = 21/446 (4%)
Query: 59 EPFAEESETAAESFPLNSSSSF-SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLI 117
EP + S P +S+++F S+ LH + L + + + L SRL RD+ARV +LI
Sbjct: 54 EPGTQTFTDQTTSEPSSSATTFLSVQLHHIDALSSDKSS--QDLFNSRLVRDAARVKSLI 111
Query: 118 TKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLD 177
+ LA V L A FS+ V+SG +QGSGEYF+R+GVGTP R MVLD
Sbjct: 112 S---LAA-TVGGTNLTRARG----PGFSSSVISGLAQGSGEYFTRLGVGTPARYVYMVLD 163
Query: 178 TGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR--CLYQ 235
TGSDI W+QC PC +CY Q+DP+FDP S S++ +PC +P C+ LD C + CLYQ
Sbjct: 164 TGSDIVWIQCAPCIKCYSQTDPVFDPTKSRSFANIPCGSPLCRRLDYPGCSTKKQICLYQ 223
Query: 236 VAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ 295
V+YGDGSFTVG+ TET++F + V + LGCGHDNEGLFVG+AGLLGLG G LS Q
Sbjct: 224 VSYGDGSFTVGEFSTETLTFRGT-RVGRVVLGCGHDNEGLFVGAAGLLGLGRGRLSFPSQ 282
Query: 296 IKA---TSLAYCLVDRDSPA--SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSV 350
I + +YCL DR + + S ++ +SA PL+ N K+DTFYYV L G SV
Sbjct: 283 IGRRFNSKFSYCLGDRSASSRPSSIVFGDSAISRTTRFTPLLSNPKLDTFYYVELLGISV 342
Query: 351 GGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
GG V I SLF++D G+GG+I+D GT++TRL AY +LRD+F+ A NLK +
Sbjct: 343 GGTRVSGISASLFKLDSTGNGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFS 402
Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSII 469
LFDTC+D SG V+VPTV LHF G + LPA NYLIPVD++G+FCFAFA T+S LSII
Sbjct: 403 LFDTCFDLSGKTEVKVPTVVLHF-RGADVPLPASNYLIPVDNSGSFCFAFAGTASGLSII 461
Query: 470 GNVQQQGTRVSFDLANNRVGFTPNKC 495
GN+QQQG RV +DLA +RVGF P C
Sbjct: 462 GNIQQQGFRVVYDLATSRVGFAPRGC 487
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 221/451 (49%), Positives = 287/451 (63%), Gaps = 25/451 (5%)
Query: 54 EPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARV 113
P +P +++ + + SS++FS+ LH + L + ++ +L +RL+RD+ARV
Sbjct: 34 NPLRSQPTLSWTDSESPTDTAESSATFSVQLHHVDAL--SFNSTPETLFTTRLQRDAARV 91
Query: 114 NTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFS 173
+ + A + + FS+ V+SG +QGSGEYF+RIGVGTPPR
Sbjct: 92 EAISYLAETA-----------GTGKRVGTGFSSSVISGLAQGSGEYFTRIGVGTPPRYVY 140
Query: 174 MVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-- 231
MVLDTGSDI W+QC PC CY QSDP+FDP+ S S++ + C +P C LD C +
Sbjct: 141 MVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASIACRSPLCHRLDSPGCNTQKQT 200
Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
C+YQV+YGDGSFT GD TET++F + V +ALGCGHDNEGLFVG+AGLLGLG G LS
Sbjct: 201 CMYQVSYGDGSFTFGDFSTETLTFRRT-RVARVALGCGHDNEGLFVGAAGLLGLGRGRLS 259
Query: 292 LTKQIKAT---SLAYCLVDRDS---PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGL 345
Q +YCLVDR + P+S V +SA A PL+ N K+DTFYYV L
Sbjct: 260 FPSQTGRRFNHKFSYCLVDRSASSKPSSMVFG-DSAVSRTARFTPLVSNPKLDTFYYVEL 318
Query: 346 TGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKP 404
G SVGG V I SLF++D+ G+GG+I+D GT++TRL AY + RD+F A NLK
Sbjct: 319 LGISVGGTRVPGITASLFKLDQTGNGGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKR 378
Query: 405 TSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS 464
+LFDTC+D SG V+VPTV LHF G + LPA NYLIPVD++G FC AFA T
Sbjct: 379 APQFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPASNYLIPVDTSGNFCLAFAGTMG 437
Query: 465 ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LSIIGN+QQQG RV +DLA +RVGF P+ C
Sbjct: 438 GLSIIGNIQQQGFRVVYDLAGSRVGFAPHGC 468
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 236/504 (46%), Positives = 307/504 (60%), Gaps = 26/504 (5%)
Query: 6 PFVLFTITTILFSFCLFTSASSRGLS--ETAT-TVLDVSSALQQTEHILSFEPETLEPFA 62
P V ++ + L SA SR +S E A LD+++ L +T+ P
Sbjct: 47 PLVPYSFLLCIQLLLLLQSAHSRPISAPEPANYHTLDIAAWLIETKT----APAPGRDEY 102
Query: 63 EESETAAESFPLNSSSSFSLPLHSREILHKTRHN---DYRSLVLSRLERDSARVNTLITK 119
E+ ET P + +H +L K N Y + L RD+ RV L +
Sbjct: 103 EKRETKPRQTPWSVQV-----VHRDSLLVKDAANATASYERRLEETLRRDARRVRGLEQR 157
Query: 120 LQLAI-YNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
++ + N D A++ E F VVSG +QGSGEYF+RIGVGTP R+ MVLDT
Sbjct: 158 IEKRLRLNKDPAGSHENVAEVAAE-FGGEVVSGMAQGSGEYFTRIGVGTPMREQYMVLDT 216
Query: 179 GSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAY 238
GSD+ W+QC PC++CY Q DPIF+P S+S+S L C + C LD C CLY+V+Y
Sbjct: 217 GSDVVWIQCEPCSKCYSQVDPIFNPSLSASFSTLGCNSAVCSYLDAYNCHGGGCLYKVSY 276
Query: 239 GDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI-- 296
GDGS+T+G TE ++FG + SV+ +A+GCGHDN GLFVG+AGLLGLG G+LS Q+
Sbjct: 277 GDGSYTIGSFATEMLTFGTT-SVRNVAIGCGHDNAGLFVGAAGLLGLGAGLLSFPSQLGT 335
Query: 297 -KATSLAYCLVDRDSPASGVLEF--NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQ 353
+ +YCLVDR S +SG LEF S G +T PL+ N + TFYYV L SVGG
Sbjct: 336 QTGRAFSYCLVDRFSESSGTLEFGPESVPLGSILT-PLLTNPSLPTFYYVPLISISVGGA 394
Query: 354 AVQ-IPPSLFEMDE-AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF 411
+ +PP +F +DE +G GG IVD GTA+TRLQT Y+++RD+FV L GV++F
Sbjct: 395 LLDSVPPDVFRIDETSGRGGFIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIF 454
Query: 412 DTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGN 471
DTCYD SGL V VPTV HF G +L LPAKNY+IP+D GTFCFAFAP +S LSI+GN
Sbjct: 455 DTCYDLSGLPLVNVPTVVFHFSNGASLILPAKNYMIPMDFMGTFCFAFAPATSDLSIMGN 514
Query: 472 VQQQGTRVSFDLANNRVGFTPNKC 495
+QQQG RVSFD AN+ VGF +C
Sbjct: 515 IQQQGIRVSFDTANSLVGFALRQC 538
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 212/458 (46%), Positives = 285/458 (62%), Gaps = 31/458 (6%)
Query: 46 QTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSR 105
Q H L+F + +S+ +F LN LH ++ H H R R
Sbjct: 48 QILHALNFSDGHRQVSGYKSDN--NTFKLNL-------LHRDKLSHVHGH---RRGFNDR 95
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPA--EAQILPEDFSTPVVSGASQGSGEYFSRI 163
++RD+ RV TL+ +L H A +++ +F+T V+SG GSGEYF RI
Sbjct: 96 MKRDAIRVATLVRRLS--------HGAPAAVKDSRYKVANFATDVISGMEAGSGEYFVRI 147
Query: 164 GVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD 223
GVG+PPR MV+D+GSDI W+QC+PC+ CYQQSDP+FDP SSS++ + C + C L+
Sbjct: 148 GVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADSSSFAGVSCGSDVCDRLE 207
Query: 224 VSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLL 283
+ C A RC Y+V+YGDGS+T G L ET++ G ++ +A+GCGH N+G+F+G+AGLL
Sbjct: 208 NTGCNAGRCRYEVSYGDGSYTKGTLALETLTVGQV-MIRDVAIGCGHTNQGMFIGAAGLL 266
Query: 284 GLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTA---PLIRNKKV 337
GLGGG +S Q+ + +YCLV R + ++G LEF RG V A LIRN +
Sbjct: 267 GLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGALEF--GRGALPVGATWISLIRNPRA 324
Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
+FYY+GL G VGG V +P F++ E G G+++D GTA+TR T AY + RDSF
Sbjct: 325 PSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTGTAVTRFPTAAYVAFRDSFTA 384
Query: 398 LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
NL GV++FDTCYD +G SVRVPTVS +F G L LPA+N+LIPVD GTFC
Sbjct: 385 QTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPARNFLIPVDGGGTFCL 444
Query: 458 AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
AFAP+ S LSIIGN+QQ+G ++SFD AN VGF PN C
Sbjct: 445 AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC 482
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 214/398 (53%), Positives = 268/398 (67%), Gaps = 18/398 (4%)
Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIG 164
RL+RD+ RV L ++ R+ KP FS+ V+SG +QGSGEYF+RIG
Sbjct: 84 RLQRDAIRVKKLS-----SLGATSRNLSKPGGT----TGFSSSVISGLAQGSGEYFTRIG 134
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
VGTPP+ MVLDTGSDI WLQC PC CY Q+DP+F+P S S++ + C P C+ L+
Sbjct: 135 VGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLES 194
Query: 225 SACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLL 283
C + CLYQV+YGDGS+T G+ VTET++F + V+ +ALGCGHDNEGLFVG+AGLL
Sbjct: 195 PGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRT-KVEQVALGCGHDNEGLFVGAAGLL 253
Query: 284 GLGGGMLSLTKQIKAT---SLAYCLVDR--DSPASGVLEFNSARGGDAVTAPLIRNKKVD 338
GLG G LS Q T +YCLVDR S S V+ NSA A PL+ N ++D
Sbjct: 254 GLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLD 313
Query: 339 TFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
TFYYV L G SVGG V I S F++D G+GG+I+DCGT++TRL AY +LRD+F
Sbjct: 314 TFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRA 373
Query: 398 LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
A +LK +LFDTCYD SG +V+VPTV LHF G + LPA NYLIPVD +G FCF
Sbjct: 374 GASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHF-RGADVSLPASNYLIPVDGSGRFCF 432
Query: 458 AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
AFA T+S LSIIGN+QQQG RV +DLA++RVGF+P C
Sbjct: 433 AFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 470
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 215/399 (53%), Positives = 266/399 (66%), Gaps = 19/399 (4%)
Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIG 164
RLERD+ARV TL T L A ++ +PA +S QGSGEYF+R+G
Sbjct: 85 RLERDAARVKTL-THLAAAT-----NKTRPANPGSGFSSSVVSGLS---QGSGEYFTRLG 135
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
VGTPP+ MVLDTGSD+ WLQC+PCT+CY Q+D IFDP S S++ +PC +P C+ LD
Sbjct: 136 VGTPPKYLYMVLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFAGIPCYSPLCRRLDS 195
Query: 225 SAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGL 282
C + N C YQV+YGDGSFT GD TET++F +V +A+GCGHDNEGLFVG+AGL
Sbjct: 196 PGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTF-RRAAVPRVAIGCGHDNEGLFVGAAGL 254
Query: 283 LGLGGGMLSLTKQIKA---TSLAYCLVDRDSPA--SGVLEFNSARGGDAVTAPLIRNKKV 337
LGLG G LS Q +YCL DR + A S ++ +SA A PL++N K+
Sbjct: 255 LGLGRGGLSFPTQTGTRFNNKFSYCLTDRTASAKPSSIVFGDSAVSRTARFTPLVKNPKL 314
Query: 338 DTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
DTFYYV L G SVGG V+ I S F +D G+GG+I+D GT++TRL AY SLRD+F
Sbjct: 315 DTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSGTSVTRLTRPAYVSLRDAFR 374
Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
A +LK +LFDTCYD SGL V+VPTV LHF G + LPA NYL+PVD++G+FC
Sbjct: 375 VGASHLKRAPEFSLFDTCYDLSGLSEVKVPTVVLHF-RGADVSLPAANYLVPVDNSGSFC 433
Query: 457 FAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
FAFA T S LSIIGN+QQQG RV FDLA +RVGF P C
Sbjct: 434 FAFAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFAPRGC 472
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 222/440 (50%), Positives = 282/440 (64%), Gaps = 16/440 (3%)
Query: 64 ESETAAESFPLNSSS-SFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQL 122
E+ET + P++ + + ++ L R++L + +L RL+RD+ RV L
Sbjct: 57 ETETQISTLPVSETDPTMTMHLEHRDVLAFNATPE--ALFNLRLQRDAFRVEALSKMAAA 114
Query: 123 AIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDI 182
A A+ FS+ V SG +QGSGEYF+R+GVGTPP+ MVLDTGSD+
Sbjct: 115 AGGRRAGRNGTHAQGG----GFSSSVTSGLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDV 170
Query: 183 NWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDG 241
W+QC PC +CY Q+DP+FDPK S S+S + C +P C LD C + + CLYQVAYGDG
Sbjct: 171 VWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAYGDG 230
Query: 242 SFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---A 298
SFT G+ TET++F + V +ALGCGHDNEGLFVG+AGLLGLG G LS Q
Sbjct: 231 SFTFGEFSTETLTFRGT-RVPKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLRFG 289
Query: 299 TSLAYCLVDR--DSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
+YCLVDR S S V+ SA AV PLI N K+DTFYY+ LTG SVGG V
Sbjct: 290 RKFSYCLVDRSASSKPSSVVFGQSAVSRTAVFTPLITNPKLDTFYYLELTGISVGGARVA 349
Query: 357 -IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCY 415
I SLF++D AG+GG+I+D GT++TRL +AY SLRD+F A +LK +LFDTC+
Sbjct: 350 GITASLFKLDTAGNGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYSLFDTCF 409
Query: 416 DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQ 475
D SG V+VPTV +HF G + LPA NYLIPVD+ G FCFAFA T S LSIIGN+QQQ
Sbjct: 410 DLSGKTEVKVPTVVMHF-RGADVSLPATNYLIPVDTNGVFCFAFAGTMSGLSIIGNIQQQ 468
Query: 476 GTRVSFDLANNRVGFTPNKC 495
G RV FD+A +R+GF C
Sbjct: 469 GFRVVFDVAASRIGFAARGC 488
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 198/394 (50%), Positives = 265/394 (67%), Gaps = 16/394 (4%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
++RD RV +LI ++ + A EDF + VVSG QGSGEYF RIGV
Sbjct: 1 MQRDVKRVVSLIRRVS-----------SGSTASYGVEDFGSEVVSGMDQGSGEYFVRIGV 49
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
G+PPR MV+D+GSDI W+QC+PCT+CY Q+DP+FDP S+S+ + C++ C +D +
Sbjct: 50 GSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDQVDNA 109
Query: 226 ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGL 285
C + RC Y+V+YGDGS T G L ET++ G + V+ +A+GCGH N+G+FVG+AGLLGL
Sbjct: 110 GCNSGRCRYEVSYGDGSSTKGTLALETLTLGRT-VVQNVAIGCGHMNQGMFVGAAGLLGL 168
Query: 286 GGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFNS-ARGGDAVTAPLIRNKKVDTFY 341
GGG +S Q+ + + +YCLV R + ++G LEF S A A PLIRN ++Y
Sbjct: 169 GGGSMSFVGQLSRERGNAFSYCLVSRVTNSNGFLEFGSEAMPVGAAWIPLIRNPHSPSYY 228
Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
Y+GL+G VG V I +FE+ E G+GG+++D GTA+TR T AY + RD+F+ GN
Sbjct: 229 YIGLSGLGVGDMKVPISEDIFELTELGNGGVVMDTGTAVTRFPTVAYEAFRDAFIDQTGN 288
Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
L SGV++FDTCY+ G SVRVPTVS +F G L LPA N+LIPVD AGTFCFAFAP
Sbjct: 289 LPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTLPANNFLIPVDDAGTFCFAFAP 348
Query: 462 TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ S LSI+GN+QQ+G ++S D AN VGF PN C
Sbjct: 349 SPSGLSILGNIQQEGIQISVDGANEFVGFGPNVC 382
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 213/405 (52%), Positives = 272/405 (67%), Gaps = 18/405 (4%)
Query: 99 RSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGE 158
+ L SRL RD++RV +L T L A+ + +R + FS+ V SG +QGSGE
Sbjct: 95 QDLFNSRLARDASRVKSL-TSLAAAVGSTNRTRARG-------PGFSSSVTSGLAQGSGE 146
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
YF+R+GVGTP R MVLDTGSD+ W+QC PC +CY Q+DP+F+P S S++ +PC +P
Sbjct: 147 YFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIPCGSPL 206
Query: 219 CKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
C+ LD C + CLYQV+YGDGSFT G+ TET++F + V +ALGCGHDNEGLF
Sbjct: 207 CRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGT-RVGRVALGCGHDNEGLF 265
Query: 277 VGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDR--DSPASGVLEFNSARGGDAVTAPL 331
+G+AGLLGLG G LS QI + +YCLVDR S S ++ +SA A PL
Sbjct: 266 IGAAGLLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSASSKPSYMVFGDSAISRTARFTPL 325
Query: 332 IRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
+ N K+DTFYYV L G SVGG V I SLF++D G+GG+I+D GT++TRL AY +
Sbjct: 326 VSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGTSVTRLTRPAYVA 385
Query: 391 LRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
LRD+F A NLK +LFDTC+D SG V+VPTV LHF G + LPA NYLIPVD
Sbjct: 386 LRDAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPASNYLIPVD 444
Query: 451 SAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
++G+FCFAFA T S LSI+GN+QQQG RV +DLA +RVGF P C
Sbjct: 445 NSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGFAPRGC 489
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 221/423 (52%), Positives = 279/423 (65%), Gaps = 20/423 (4%)
Query: 81 SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
+L LH I + + L RL+RD+ RV ++ LA N + A+
Sbjct: 61 ALSLHLHHIDALSSNKTPEQLFQLRLQRDAKRVEGVVA---LAALN-------QSHARRS 110
Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
FS+ ++SG +QGSGEYF+RIGVGTP R MVLDTGSD+ WLQC PC +CY Q+DP+
Sbjct: 111 GSSFSSSIISGLAQGSGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPV 170
Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNS 258
FDP S +Y+ +PC AP C+ LD C + C YQV+YGDGSFT GD TET++F +
Sbjct: 171 FDPTKSRTYAGIPCGAPLCRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRT 230
Query: 259 GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPA--S 313
V +ALGCGHDNEGLF+G+AGLLGLG G LS Q +YCLVDR + A S
Sbjct: 231 -RVTRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCLVDRSASAKPS 289
Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGI 372
V+ +SA A PLI+N K+DTFYY+ L G SVGG V+ + SLF +D AG+GG+
Sbjct: 290 SVVFGDSAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGV 349
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
I+D GT++TRL AY +LRD+F A +LK + +LFDTC+D SGL V+VPTV LHF
Sbjct: 350 IIDSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLHF 409
Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
G + LPA NYLIPVD++G+FCFAFA T S LSIIGN+QQQG RVSFDLA +RVGF P
Sbjct: 410 -RGADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAP 468
Query: 493 NKC 495
C
Sbjct: 469 RGC 471
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 370 bits (950), Expect = e-99, Method: Compositional matrix adjust.
Identities = 204/359 (56%), Positives = 253/359 (70%), Gaps = 9/359 (2%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
FS+ V+SG +QGSGEYF+RIGVGTPP+ MVLDTGSDI WLQC PC CY Q+DP+F+P
Sbjct: 27 FSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNP 86
Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVK 262
S S++ + C P C+ L+ C + CLYQV+YGDGS+T G+ VTET++F + V+
Sbjct: 87 VKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRT-KVE 145
Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDR--DSPASGVLE 317
+ALGCGHDNEGLFVG+AGLLGLG G LS Q T +YCLVDR S S V+
Sbjct: 146 QVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVF 205
Query: 318 FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDC 376
NSA A PL+ N ++DTFYYV L G SVGG V I S F++D G+GG+I+DC
Sbjct: 206 GNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDC 265
Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
GT++TRL AY +LRD+F A +LK +LFDTCYD SG +V+VPTV LHF G
Sbjct: 266 GTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHF-RGA 324
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ LPA NYLIPVD +G FCFAFA T+S LSIIGN+QQQG RV +DLA++RVGF+P C
Sbjct: 325 DVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 383
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 369 bits (948), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 192/394 (48%), Positives = 263/394 (66%), Gaps = 16/394 (4%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
+ RD RV +LI +L + A+ EDF + VVSG +QGSGEYF RIG+
Sbjct: 1 MHRDVKRVASLIHRLS-----------SGSAAKYEVEDFGSDVVSGMNQGSGEYFVRIGL 49
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
G+PPR MV+D+GSDI W+QC+PCT+CY Q+DP+FDP S+S+ + C++ C ++ +
Sbjct: 50 GSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDRVENA 109
Query: 226 ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGL 285
C + RC Y+V+YGDGS+T G L ET++FG + V+ +A+GCGH N G+FVG+AGLLGL
Sbjct: 110 GCNSGRCRYEVSYGDGSYTKGTLALETLTFGRT-VVRNVAIGCGHSNRGMFVGAAGLLGL 168
Query: 286 GGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNS-ARGGDAVTAPLIRNKKVDTFY 341
GGG +S Q+ + +YCLV R + +G LEF S A A PL+RN + +FY
Sbjct: 169 GGGSMSFMGQLSGQTGNAFSYCLVSRGTNTNGFLEFGSEAMPVGAAWIPLVRNPRAPSFY 228
Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
Y+ L G VG V + +F+++E G GG+++D GTA+TR T AY + R++F+ N
Sbjct: 229 YIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVTRFPTVAYEAFRNAFIEQTQN 288
Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
L SGV++FDTCY+ G SVRVPTVS +F G L +PA N+LIPVD AGTFCFAFAP
Sbjct: 289 LPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTIPANNFLIPVDDAGTFCFAFAP 348
Query: 462 TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ S LSI+GN+QQ+G ++S D AN VGF PN C
Sbjct: 349 SPSGLSILGNIQQEGIQISVDEANEFVGFGPNIC 382
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 369 bits (947), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 200/404 (49%), Positives = 268/404 (66%), Gaps = 10/404 (2%)
Query: 101 LVLSRLERDSARVNTLITKLQLAIYNVDRHEL-KPAEAQ--ILPEDFSTPVVSGASQGSG 157
LV +RL RD R+ ++ +++ L + + + L P + L +DF TP+ SG S GSG
Sbjct: 20 LVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNPFLQQDFETPLRSGLSDGSG 79
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EYF +GVGTPPR +MV DTGSD+ WLQC PC CY Q+DP+F+P SS++ + C +
Sbjct: 80 EYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQSITCGSS 139
Query: 218 QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
C+ L + CR N+CLYQV+YGDGSFTVG+ TET+SFG S +V +A+GCGH+N+GLF
Sbjct: 140 LCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFG-SNAVNSVAIGCGHNNQGLFT 198
Query: 278 GSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEF-NSARGGDAVTAPLIR 333
G+AGLLGLG G+LS Q+ + +YCL R+S S L F N A +A L+
Sbjct: 199 GAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTGSVPLIFGNQAVASNAQFTTLLT 258
Query: 334 NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDCGTAITRLQTQAYNSLR 392
N K+DTFYYV + G VGG +V IP +D + G+GG+I+D GTA+TRL T AYN +R
Sbjct: 259 NPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNGGVILDSGTAVTRLVTSAYNPMR 318
Query: 393 DSF-VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
D+F + + K TSG +LFDTCYD SG S+ +P VS F G + LPA+N ++PVD+
Sbjct: 319 DAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMALPAQNIMVPVDN 378
Query: 452 AGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+GT+C AFAP S SIIGN+QQQ R+SFD NRVG N+C
Sbjct: 379 SGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 200/404 (49%), Positives = 268/404 (66%), Gaps = 10/404 (2%)
Query: 101 LVLSRLERDSARVNTLITKLQLAIYNVDRHEL-KPAEAQ--ILPEDFSTPVVSGASQGSG 157
LV +RL RD R+ ++ +++ L + + + L P + L +DF TP+ SG S GSG
Sbjct: 20 LVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNPFLQQDFETPLRSGLSDGSG 79
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EYF +GVGTPPR +MV DTGSD+ WLQC PC CY Q+DP+F+P SS++ + C +
Sbjct: 80 EYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQSITCGSS 139
Query: 218 QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
C+ L + CR N+CLYQV+YGDGSFTVG+ TET+SFG S +V +A+GCGH+N+GLF
Sbjct: 140 LCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFG-SNAVNSVAIGCGHNNQGLFT 198
Query: 278 GSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEF-NSARGGDAVTAPLIR 333
G+AGLLGLG G+LS Q+ + +YCL R+S S L F N A +A L+
Sbjct: 199 GAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTGSVPLIFGNQAVASNAQFTTLLT 258
Query: 334 NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDCGTAITRLQTQAYNSLR 392
N K+DTFYYV + G VGG +V IP +D + G+GG+I+D GTA+TRL T AYN +R
Sbjct: 259 NPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNGGVILDSGTAVTRLVTSAYNPMR 318
Query: 393 DSF-VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
D+F + + K TSG +LFDTCYD SG S+ +P VS F G + LPA+N ++PVD+
Sbjct: 319 DAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMALPAQNIMVPVDN 378
Query: 452 AGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+GT+C AFAP S SIIGN+QQQ R+SFD NRVG N+C
Sbjct: 379 SGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 368 bits (945), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 192/422 (45%), Positives = 268/422 (63%), Gaps = 24/422 (5%)
Query: 81 SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
S L R+ + + + R VL + RD+AR L ++L A Y
Sbjct: 60 SFALVRRDAVTGSTYPSRRHAVLDLVARDNARAEYLASRLSPAAYQ-------------- 105
Query: 141 PEDFS---TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS 197
P FS + VVSG +GSGEYF R+G+G+PP + +V+D+GSD+ W+QC+PC ECY Q+
Sbjct: 106 PTGFSGSESKVVSGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQA 165
Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFG 256
DP+FDP TS+++S +PC + C++L S C + C Y+V+YGDGS+T G L ET++ G
Sbjct: 166 DPLFDPATSATFSAVPCGSAVCRTLRTSGCGDSGGCDYEVSYGDGSYTKGALALETLTLG 225
Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPAS 313
+ +V+G+A+GCGH N GLFVG+AGLLGLG G +SL Q+ + +YCL R + S
Sbjct: 226 GT-AVEGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGA-GS 283
Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
VL + A AV PL+RN + +FYYVGL+G VG + + + LF++ E G GG++
Sbjct: 284 LVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVV 343
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
+D GTA+TRL +AY +LRD+FV G L GV+L DTCYD SG SVRVPTVS +F
Sbjct: 344 MDTGTAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFD 403
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
L LPA+N L+ VD G +C AFAP+SS SI+GN+QQ+G +++ D AN +GF P
Sbjct: 404 GAATLTLPARNLLLEVD-GGIYCLAFAPSSSGPSILGNIQQEGIQITVDSANGYIGFGPT 462
Query: 494 KC 495
C
Sbjct: 463 TC 464
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 367 bits (942), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 194/424 (45%), Positives = 264/424 (62%), Gaps = 23/424 (5%)
Query: 81 SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
SL L R+ + + R V+ + RD+ARV L L + + L
Sbjct: 64 SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHL------------EKRLVASTSPYL 111
Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
PED + VV G GSGEYF R+GVG+PP +V+D+GSD+ W+QCRPC +CY Q+DP+
Sbjct: 112 PEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPL 171
Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSACRAN----RCLYQVAYGDGSFTVGDLVTETVSFG 256
FDP SSS+S + C + C++L + C +C Y V YGDGS+T G+L ET++ G
Sbjct: 172 FDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLG 231
Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPAS 313
+ +V+G+A+GCGH N GLFVG+AGLLGLG G +SL Q+ + +YCL R + +
Sbjct: 232 GT-AVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGA 290
Query: 314 G--VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
G VL A AV PL+RN + +FYYVGLTG VGG+ + + SLF++ E G GG
Sbjct: 291 GSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGG 350
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLH 431
+++D GTA+TRL +AY +LR +F G L + V+L DTCYD SG SVRVPTVS +
Sbjct: 351 VVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFY 410
Query: 432 FGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
F G L LPA+N L+ V A FC AFAP+SS +SI+GN+QQ+G +++ D AN VGF
Sbjct: 411 FDQGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFG 469
Query: 492 PNKC 495
PN C
Sbjct: 470 PNTC 473
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 365 bits (938), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 193/424 (45%), Positives = 263/424 (62%), Gaps = 23/424 (5%)
Query: 81 SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
SL L R+ + + R V+ + RD+ARV L L + + L
Sbjct: 64 SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHL------------EKRLVASTSPYL 111
Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
PED + VV G GSGEYF R+GVG+PP +V+D+GSD+ W+QCRPC +CY Q+DP+
Sbjct: 112 PEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPL 171
Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSACRAN----RCLYQVAYGDGSFTVGDLVTETVSFG 256
FDP SSS+S + C + C++L + C +C Y V YGDGS+T G+L ET++ G
Sbjct: 172 FDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLG 231
Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPAS 313
+ +V+G+A+GCGH N GLFVG+AGLLGLG G +SL Q+ + +YCL R + +
Sbjct: 232 GT-AVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLIGQLGGAAGGVFSYCLASRGAGGA 290
Query: 314 G--VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
G VL A AV PL+RN + +FYYVGLTG VGG+ + + LF++ E G GG
Sbjct: 291 GSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGG 350
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLH 431
+++D GTA+TRL +AY +LR +F G L + V+L DTCYD SG SVRVPTVS +
Sbjct: 351 VVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFY 410
Query: 432 FGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
F G L LPA+N L+ V A FC AFAP+SS +SI+GN+QQ+G +++ D AN VGF
Sbjct: 411 FDQGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFG 469
Query: 492 PNKC 495
PN C
Sbjct: 470 PNTC 473
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 363 bits (933), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 202/365 (55%), Positives = 240/365 (65%), Gaps = 16/365 (4%)
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
PVVSG +QGSGEYF++IGVGTP MVLDTGSD+ WLQC PC CY QS +FDP+ S
Sbjct: 130 PVVSGLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGQVFDPRRS 189
Query: 207 SSYSPLPCAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
SY + C+AP C+ LD C R CLYQVAYGDGS T GD TET++F V I
Sbjct: 190 RSYGAVGCSAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAGGARVARI 249
Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPA-----SGVL 316
ALGCGHDNEGLFV +AGLLGLG G LS QI S +YCLVDR S A S +
Sbjct: 250 ALGCGHDNEGLFVAAAGLLGLGRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASHSSTV 309
Query: 317 EFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMD-EAGDGG 371
F S G V A P+++N +++TFYYV L G SVGG V + S +D +G GG
Sbjct: 310 TFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRGG 369
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVRVPTVSL 430
+IVD GT++TRL AY++LRD+F A L+ + G +LFDTCYD SG + V+VPTVS+
Sbjct: 370 VIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSM 429
Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
HF G LP +NYLIPVDS GTFCFAFA T +SIIGN+QQQG RV FD RVGF
Sbjct: 430 HFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGF 489
Query: 491 TPNKC 495
P C
Sbjct: 490 VPKGC 494
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 363 bits (933), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 193/349 (55%), Positives = 245/349 (70%), Gaps = 9/349 (2%)
Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
QGSGEYF+RIG+GTP R+ MVLDTGSD+ W+QC PC ECY Q+DPIF+P +S S+S +
Sbjct: 3 QGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVG 62
Query: 214 CAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
C + C LD + C CLY+V+YGDGS+TVG TET++FG + S++ +A+GCGHDN
Sbjct: 63 CDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTFGTT-SIQNVAIGCGHDNV 121
Query: 274 GLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEF--NSARGGDAVT 328
GLFVG+AGLLGLG G LS Q+ + +YCLVDRDS +SG LEF S G T
Sbjct: 122 GLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDSESSGTLEFGPESVPIGSIFT 181
Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEA-GDGGIIVDCGTAITRLQTQ 386
PL+ N + TFYY+ + SVGG + +P F +DE G GGII+D GTA+TRLQT
Sbjct: 182 -PLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTRLQTS 240
Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL 446
AY++LRD+F+ +L G+++FDTCYD S L+SV +P V HF G LPAKN L
Sbjct: 241 AYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGAGFILPAKNCL 300
Query: 447 IPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
IP+DS GTFCFAFAP S LSI+GN+QQQG RVSFD AN+ VGF ++C
Sbjct: 301 IPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 362 bits (928), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 217/474 (45%), Positives = 298/474 (62%), Gaps = 36/474 (7%)
Query: 34 ATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPL------NSSSSFSLPLHSR 87
AT +L+V +++ E S P+ LE E++P+ +S S + L L R
Sbjct: 27 ATQLLNVKDTIKEAETAPSRLPQDLE--------LHENYPIFELDNNSSQSQWKLKLFHR 78
Query: 88 EILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTP 147
+ L D+ R+ RDS RV++L+ L ++ Q+ DF +
Sbjct: 79 DKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSG-----------SDEQV--TDFGSD 125
Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
VVSG QGSGEYF RIGVG+PPR +V+D+GSDI W+QC+PC+ECYQQSDP+FDP S+
Sbjct: 126 VVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSA 185
Query: 208 SYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
+Y+ + C + C LD + C RC Y+V+YGDGS+T G L ET++FG ++ IA+G
Sbjct: 186 TYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRV-LIRNIAIG 244
Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGG 324
CGH N G+F+G+AGLLGLGGG +S Q+ + +YCLV R + ++G LEF RG
Sbjct: 245 CGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEF--GRGA 302
Query: 325 DAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
V A PLIRN + +FYYVGL+G VGG V IP +FE+ + G GG+++D GTA+T
Sbjct: 303 MPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVT 362
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
RL AY + RD+F+ NL + V++FDTCY+ +G SVRVPTVS +F G L LP
Sbjct: 363 RLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLP 422
Query: 442 AKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A+N+LIPVD GTFCFAFA ++S LSIIGN+QQ+G ++S D +N VGF P C
Sbjct: 423 ARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 476
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 361 bits (926), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 190/430 (44%), Positives = 266/430 (61%), Gaps = 32/430 (7%)
Query: 81 SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
S L R+ + + R VL + RD+AR L ++L A
Sbjct: 59 SFALVRRDAVTGATYPSPRHAVLDLVSRDNARAEYLASRLSPAYQ--------------- 103
Query: 141 PEDF---STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS 197
P DF + VVSG +GSGEYF R+G+G+PP + +V+D+GSD+ W+QC+PC ECY Q+
Sbjct: 104 PTDFFGSESKVVSGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQA 163
Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFG 256
DP+FDP +S+++S + C + C++L S C + C Y+V+YGDGS+T G L ET++ G
Sbjct: 164 DPLFDPASSATFSAVSCGSAICRTLRTSGCGDSGGCEYEVSYGDGSYTKGTLALETLTLG 223
Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPAS 313
+ +V+G+A+GCGH N GLFVG+AGLLGLG G +SL Q+ + +YCL R S
Sbjct: 224 GT-AVEGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGS 282
Query: 314 G--------VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
G VL + A AV PL+RN + +FYYVG++G VG + + + LF++
Sbjct: 283 GAADAAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLT 342
Query: 366 EAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRV 425
E G GG+++D GTA+TRL +AY +LRD+FV G L GV+L DTCYD SG SVRV
Sbjct: 343 EDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCYDLSGYTSVRV 402
Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLAN 485
PTVS +F L LPA+N L+ VD G +C AFAP+SS LSI+GN+QQ+G +++ D AN
Sbjct: 403 PTVSFYFDGAATLTLPARNLLLEVD-GGIYCLAFAPSSSGLSILGNIQQEGIQITVDSAN 461
Query: 486 NRVGFTPNKC 495
+GF P C
Sbjct: 462 GYIGFGPATC 471
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 360 bits (925), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 201/448 (44%), Positives = 287/448 (64%), Gaps = 15/448 (3%)
Query: 56 ETLEPFAEESETAAE----SFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSA 111
+ L+P +ET + F +S+S ++L L R+ + ++ + +R+ RD+
Sbjct: 31 DVLQPPLTVTETLPDFNNTHFSDDSNSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTD 90
Query: 112 RVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQ 171
RV+ ++ ++ + + ++++ DF + VVSG QGSGEYF RIGVG+PPR
Sbjct: 91 RVSAILRRISGKVV------VASSDSRYEVNDFGSDVVSGMDQGSGEYFVRIGVGSPPRD 144
Query: 172 FSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR 231
MV+D+GSD+ W+QC+PC CY+QSDP+FDP S SY+ + C + C ++ S C +
Sbjct: 145 QYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGG 204
Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
C Y+V YGDGS+T G L ET++F + V+ +A+GCGH N G+F+G+AGLLG+GGG +S
Sbjct: 205 CRYEVMYGDGSYTKGTLALETLTFAKT-VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMS 263
Query: 292 LTKQIKATS---LAYCLVDRDSPASGVLEF-NSARGGDAVTAPLIRNKKVDTFYYVGLTG 347
Q+ + YCLV R + ++G L F A A PL+RN + +FYYVGL G
Sbjct: 264 FVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKG 323
Query: 348 FSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG 407
VGG + +P +F++ E GDGG+++D GTA+TRL T AY + RD F NL SG
Sbjct: 324 LGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRASG 383
Query: 408 VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS 467
V++FDTCYD SG SVRVPTVS +F G L LPA+N+L+PVD +GT+CFAFA + + LS
Sbjct: 384 VSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLS 443
Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
IIGN+QQ+G +VSFD AN VGF PN C
Sbjct: 444 IIGNIQQEGIQVSFDGANGFVGFGPNVC 471
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 360 bits (925), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 197/428 (46%), Positives = 278/428 (64%), Gaps = 12/428 (2%)
Query: 72 FPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHE 131
F SSS ++L L R+ + ++ + +R+ RD+ RV+ ++ ++ +
Sbjct: 51 FSDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKV------- 103
Query: 132 LKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT 191
+ ++++ DF + +VSG QGSGEYF RIGVG+PPR MV+D+GSD+ W+QC+PC
Sbjct: 104 IPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCK 163
Query: 192 ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTE 251
CY+QSDP+FDP S SY+ + C + C ++ S C + C Y+V YGDGS+T G L E
Sbjct: 164 LCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALE 223
Query: 252 TVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDR 308
T++F + V+ +A+GCGH N G+F+G+AGLLG+GGG +S Q+ + YCLV R
Sbjct: 224 TLTFAKT-VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSR 282
Query: 309 DSPASGVLEF-NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
+ ++G L F A A PL+RN + +FYYVGL G VGG + +P +F++ E
Sbjct: 283 GTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTET 342
Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
GDGG+++D GTA+TRL T AY + RD F NL SGV++FDTCYD SG SVRVPT
Sbjct: 343 GDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPT 402
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNR 487
VS +F G L LPA+N+L+PVD +GT+CFAFA + + LSIIGN+QQ+G +VSFD AN
Sbjct: 403 VSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGF 462
Query: 488 VGFTPNKC 495
VGF PN C
Sbjct: 463 VGFGPNVC 470
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 360 bits (924), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 189/356 (53%), Positives = 249/356 (69%), Gaps = 9/356 (2%)
Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
V SG + GSGEYF R+G+G+P + +V+DTGSD+ W+QC PC CY+Q+D +FDP+ SS
Sbjct: 3 VTSGLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASS 62
Query: 208 SYSPLPCAAPQCKSLDVSACRA--NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S+ L C+ PQCK LDV AC + NRCLYQV+YGDGSFTVGDL +++ S + G +
Sbjct: 63 SFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSV-SRGRTSPVV 121
Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDS---PASGVLEFNSAR 322
GCGHDNEGLFVG+AGLLGLG G LS Q+ + +YCLV RD+ +S +L +SA
Sbjct: 122 FGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLFGDSAL 181
Query: 323 GGDAVTA--PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDCGTA 379
A A L++N K+DTFYY GL+G S+GG + IP + F++ + G GG+I+D GT+
Sbjct: 182 PTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTS 241
Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
+TRL T AY +RD+F L + +LFDTCYDFS L SV +PTVS HF G ++
Sbjct: 242 VTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGGASVQ 301
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LP NYL+PVD++GTFCFAF+ TS LSIIGN+QQQ RV+ DL ++RVGF P +C
Sbjct: 302 LPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 360 bits (923), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 209/399 (52%), Positives = 263/399 (65%), Gaps = 24/399 (6%)
Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIG 164
RL+RD+ RV L+ ++ + FS+ ++SG +QGSGEYF+RIG
Sbjct: 78 RLQRDAKRVEALLNQIH--------------ARRSAGSSFSSSIISGLAQGSGEYFTRIG 123
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
VGTP R MVLDTGSD+ WLQC PC +CY Q+D +FDP S +Y+ +PC AP C+ LD
Sbjct: 124 VGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYAGIPCGAPLCRRLDS 183
Query: 225 SAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGL 282
C + C YQV+YGDGSFT GD TET++F V +ALGCGHDNEGLF G+AGL
Sbjct: 184 PGCSNKNKVCQYQVSYGDGSFTFGDFSTETLTF-RRNRVTRVALGCGHDNEGLFTGAAGL 242
Query: 283 LGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA--SGVLEFNSARGGDAVTAPLIRNKKV 337
LGLG G LS Q +YCLVDR + A S V+ +SA A PLI+N K+
Sbjct: 243 LGLGRGRLSFPVQTGRRFNHKFSYCLVDRSASAKPSSVIFGDSAVSRTAHFTPLIKNPKL 302
Query: 338 DTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
DTFYY+ L G SVGG V+ + SLF +D AG+GG+I+D GT++TRL AY +LRD+F
Sbjct: 303 DTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAFR 362
Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
A +LK +LFDTC+D SGL V+VPTV LHF G + LPA NYLIPVD++G+FC
Sbjct: 363 IGASHLKRAPEFSLFDTCFDLSGLTEVKVPTVVLHF-RGADVSLPATNYLIPVDNSGSFC 421
Query: 457 FAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
FAFA T S LSIIGN+QQQG R+S+DL +RVGF P C
Sbjct: 422 FAFAGTMSGLSIIGNIQQQGFRISYDLTGSRVGFAPRGC 460
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 360 bits (923), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 219/448 (48%), Positives = 282/448 (62%), Gaps = 25/448 (5%)
Query: 64 ESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLA 123
ES++ ++ S++S S+ L + L L RL+RDS RV ++ + LA
Sbjct: 48 ESKSFSDESVSESTTSLSVHLSHVDALSSFSDASPVDLFKLRLQRDSLRVKSITS---LA 104
Query: 124 IYNVDRHELK--PAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSD 181
+ R+ K P A FS V+SG SQGSGEYF R+GVGTP MVLDTGSD
Sbjct: 105 AVSTGRNATKRTPRSAG----GFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSD 160
Query: 182 INWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA-C---RANRCLYQVA 237
+ WLQC PC CY QSD IFDPK S +++ +PC + C+ LD S+ C R+ CLYQV+
Sbjct: 161 VVWLQCSPCKACYNQSDVIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVS 220
Query: 238 YGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK 297
YGDGSFT GD TET++F + V + LGCGHDNEGLFVG+AGLLGLG G LS Q K
Sbjct: 221 YGDGSFTEGDFSTETLTF-HGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTK 279
Query: 298 AT---SLAYCLVDRDSPASG------VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGF 348
+ +YCLVDR S S ++ N A +V PL+ N K+DTFYY+ L G
Sbjct: 280 SRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNDAVPKTSVFTPLLTNPKLDTFYYLQLLGI 339
Query: 349 SVGGQAV-QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG 407
SVGG V + S F++D G+GG+I+D GT++TRL AY +LRD+F A LK
Sbjct: 340 SVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATKLKRAPS 399
Query: 408 VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS 467
+LFDTC+D SG+ +V+VPTV HFG G+ + LPA NYLIPV++ G FCFAFA T +LS
Sbjct: 400 YSLFDTCFDLSGMTTVKVPTVVFHFGGGE-VSLPASNYLIPVNTEGRFCFAFAGTMGSLS 458
Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
IIGN+QQQG RV++DL +RVGF C
Sbjct: 459 IIGNIQQQGFRVAYDLVGSRVGFLSRAC 486
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 359 bits (922), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 209/418 (50%), Positives = 254/418 (60%), Gaps = 39/418 (9%)
Query: 101 LVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYF 160
L+ RL+RD R + E A + + PVVSG +QGSGEYF
Sbjct: 84 LLKHRLQRDKRRAARI-------------SEAAGAGGGNGRKGVAAPVVSGLAQGSGEYF 130
Query: 161 SRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK 220
++IGVGTP Q MVLDTGSD+ W+QC PC CY+QS P+FDP+ SSSY + C A C+
Sbjct: 131 TKIGVGTPATQALMVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCR 190
Query: 221 SLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
LD C R C+YQVAYGDGS T GD VTET++F V +ALGCGHDNEGLFV
Sbjct: 191 RLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHDNEGLFVA 250
Query: 279 SAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGV---------LEFNSARGG-- 324
+AGLLGLG G LS QI S +YCLVDR S +G + F + G
Sbjct: 251 AAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGAS 310
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-----EAGDGGIIVDCGTA 379
A P++RN +++TFYYV L G SVGG V P + E D G GG+IVD GT+
Sbjct: 311 SASFTPMVRNPRMETFYYVQLVGISVGGARV---PGVAESDLRLDPSTGRGGVIVDSGTS 367
Query: 380 ITRLQTQAYNSLRDSF-VRLAGNLKPT-SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
+TRL +Y++LRD+F AG L+ + G +LFDTCYD G R V+VPTVS+HF G
Sbjct: 368 VTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAE 427
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LP +NYLIPVDS GTFCFAFA T +SIIGN+QQQG RV FD RVGF P C
Sbjct: 428 AALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 485
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 359 bits (921), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 209/418 (50%), Positives = 250/418 (59%), Gaps = 35/418 (8%)
Query: 101 LVLSRLERD---SARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSG 157
L+ RL+RD +AR++ N R A PVVSG +QGSG
Sbjct: 88 LLRHRLQRDKRRAARISKAAAGGGAGAANGTRSRGGAVAA---------PVVSGLAQGSG 138
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EYF++IGVGTP MVLDTGSD+ WLQC PC CY QS P+FDP+ SSSY + CAAP
Sbjct: 139 EYFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPRRSSSYGAVDCAAP 198
Query: 218 QCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
C+ LD C R CLYQVAYGDGS T GD TET++F V +ALGCGHDNEGL
Sbjct: 199 LCRRLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFAGGARVARVALGCGHDNEGL 258
Query: 276 FVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDR---------DSPASGVLEFNSARG 323
FV +AGLLGLG G LS QI S +YCLVDR S + F
Sbjct: 259 FVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAASRSRSSTVTFGPPSA 318
Query: 324 GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-----EAGDGGIIVDCGT 378
A P++RN +++TFYYV L G SVGG V P + E D G GG+IVD GT
Sbjct: 319 SAASFTPMVRNPRMETFYYVQLVGISVGGARV---PGVAESDLRLDPSTGRGGVIVDSGT 375
Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
++TRL +Y++LRD+F A L+ + G +LFDTCYD G + V+VPTVS+HF G
Sbjct: 376 SVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLGGRKVVKVPTVSMHFAGGAE 435
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LP +NYLIPVDS GTFCFAFA T +SIIGN+QQQG RV FD RVGF P C
Sbjct: 436 AALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 493
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 358 bits (920), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 206/463 (44%), Positives = 280/463 (60%), Gaps = 43/463 (9%)
Query: 38 LDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREI-LHKTRHN 96
L+V +A+ +T+ L+P +++ + P + F H I L KT H
Sbjct: 32 LNVENAISETK---------LKPLKQQNHNTQQ--PQWKTKLF----HRDNINLKKTTH- 75
Query: 97 DYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGS 156
++ +SR+ RD RV L+ +L +++ + F + VVSG +GS
Sbjct: 76 --KTRFISRINRDIKRVTFLLNRL-------NKNTQEQQTTTATEASFGSDVVSGTEEGS 126
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GEYF RIG+G+P MV+D+GSDI W+QC PC +CY Q+DPIF+P TS+S+ + C++
Sbjct: 127 GEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSASFIGVACSS 186
Query: 217 PQCKSLDVS-ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
C LD ACR RC YQVAYGDGS+T G L ET++ G + ++ A+GCGH NEG+
Sbjct: 187 NVCNQLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITIGRT-VIQDTAIGCGHWNEGM 245
Query: 276 FVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAPLI 332
FVG+AGLLGLGGG +S Q+ A + YCLV R P A+ PLI
Sbjct: 246 FVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCLVSRAMPVG------------AMWVPLI 293
Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
N +FYYV L+G +VGG V I +F++ + G GG+++D GTAITRL T AYN+ R
Sbjct: 294 HNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGGVVMDTGTAITRLPTVAYNAFR 353
Query: 393 DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
D+F+ NL GV++FDTCYD +G +VRVPTVS +F G+ L PA+N+LIP D
Sbjct: 354 DAFIAQTTNLPRAPGVSIFDTCYDLNGFVTVRVPTVSFYFSGGQILTFPARNFLIPADDV 413
Query: 453 GTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
GTFCFAFAP+ S LSIIGN+QQ+G +VS D N VGF PN C
Sbjct: 414 GTFCFAFAPSPSGLSIIGNIQQEGIQVSIDGTNGFVGFGPNVC 456
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 358 bits (919), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 188/356 (52%), Positives = 248/356 (69%), Gaps = 9/356 (2%)
Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
V SG + GSGEYF R+G+G+P + +V+DTGSD+ W+QC PC CY+Q+D +FDP+ SS
Sbjct: 3 VTSGLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASS 62
Query: 208 SYSPLPCAAPQCKSLDVSACRA--NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S+ L C+ PQCK LDV AC + NRCLYQV+YGDGSFTVGDL +++ + G +
Sbjct: 63 SFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSF-LVSRGRTSPVV 121
Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDS---PASGVLEFNSAR 322
GCGHDNEGLFVG+AGLLGLG G LS Q+ + +YCLV RD+ +S +L +SA
Sbjct: 122 FGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLFGDSAL 181
Query: 323 GGDAVTA--PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDCGTA 379
A A L++N K+DTFYY GL+G S+GG + IP + F++ + G GG+I+D GT+
Sbjct: 182 PTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTS 241
Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
+TRL T AY +RD+F L + +LFDTCYDFS L SV +PTVS HF G ++
Sbjct: 242 VTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGGASVQ 301
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LP NYL+PVD++GTFCFAF+ TS LSIIGN+QQQ RV+ DL ++RVGF P +C
Sbjct: 302 LPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 357 bits (917), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 212/407 (52%), Positives = 266/407 (65%), Gaps = 25/407 (6%)
Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELK--PAEAQILPEDFSTPVVSGASQGSGEYFSR 162
RL+RDS RV +L + LA + R+ K P A FS V+SG SQGSGEYF R
Sbjct: 87 RLQRDSLRVESLTS---LAAVSAGRNVTKRPPRSA----GGFSGVVISGLSQGSGEYFMR 139
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
+GVGTP MVLDTGSD+ WLQC PC CY QSDP+F+P S +++ +PC + C+ L
Sbjct: 140 LGVGTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNPAKSKTFATVPCGSRLCRRL 199
Query: 223 DVSA-C---RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
D S+ C R+ CLYQV+YGDGSFTVGD TET++F + V +ALGCGHDNEGLFVG
Sbjct: 200 DDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTF-HGARVDHVALGCGHDNEGLFVG 258
Query: 279 SAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASG------VLEFNSARGGDAVTA 329
+AGLLGLG G LS Q K +YCLVDR S S ++ N A AV
Sbjct: 259 AAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNGAVPKTAVFT 318
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
PL+ N K+DTFYY+ L G SVGG V + S F++D G+GG+I+D GT++TRL AY
Sbjct: 319 PLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAY 378
Query: 389 NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
+LRD+F A LK +LFDTC+D SG+ +V+VPTV HF G+ + LPA NYLIP
Sbjct: 379 VALRDAFRLGATRLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFTGGE-VSLPASNYLIP 437
Query: 449 VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
V++ G FCFAFA T +LSIIGN+QQQG RV++DL +RVGF C
Sbjct: 438 VNNQGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 484
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 357 bits (917), Expect = 7e-96, Method: Compositional matrix adjust.
Identities = 199/431 (46%), Positives = 263/431 (61%), Gaps = 28/431 (6%)
Query: 84 LHSREILH--KTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILP 141
+H +L K + + + L+L L+RD RV + +K QLA D
Sbjct: 61 IHRNSLLREAKEKLHTHEQLLLETLQRDEQRVRWIESKAQLAGKKKDEAS---------S 111
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
D + PV SG GSGEYF R+GVGTP R MV+DTGSD+ WLQC+PC CY+Q+DPIF
Sbjct: 112 TDLNGPVTSGLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIF 171
Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRANR-----CLYQVAYGDGSFTVGDLVTETVSFG 256
DP+ SSS+ +PC +P CK+L++ +C +R C YQVAYGDGSF+VGD ++ + G
Sbjct: 172 DPRNSSSFQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLG 231
Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI--------KATSLAYCLVDR 308
+A GCG DNEGLF G+AGLLGLG G LS QI A S +YCLVDR
Sbjct: 232 TGSKAMSVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDR 291
Query: 309 DSP---ASGVLEFNSAR-GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
+P +S L F +A A +PL++N K+DTFYY + G SVGG + I ++
Sbjct: 292 SNPMTRSSSSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQL 351
Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
++G GG+I+D GT++TR T Y ++RD+F NL +LFDTCY+FSG SV
Sbjct: 352 SQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAPRYSLFDTCYNFSGKASVD 411
Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLA 484
VP + LHF G L LP NYLIP+++AG+FC AFAPTS L IIGN+QQQ R+ FDL
Sbjct: 412 VPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQ 471
Query: 485 NNRVGFTPNKC 495
+ + F P +C
Sbjct: 472 KSHLAFAPQQC 482
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 357 bits (916), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 188/422 (44%), Positives = 259/422 (61%), Gaps = 28/422 (6%)
Query: 81 SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
SL L R+ + + R V+ + RD+ARV L L + + L
Sbjct: 64 SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHL------------EKRLVASTSPYL 111
Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
PED + VV G GSGEYF R+GVG+PP +V+D+GSD+ W+QCRPC +CY Q+DP+
Sbjct: 112 PEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPL 171
Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSACRAN----RCLYQVAYGDGSFTVGDLVTETVSFG 256
FDP SSS+S + C + C++L + C +C Y V YGDGS+T G+L ET++ G
Sbjct: 172 FDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLG 231
Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPAS 313
+ +V+G+A+GCGH N GLFVG+AGLLGLG G +SL Q+ + +YCL R + +
Sbjct: 232 GT-AVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGA 290
Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
G L T + R ++ +FYYVGLTG VGG+ + + SLF++ E G GG++
Sbjct: 291 GSLVLGR-------TEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVV 343
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
+D GTA+TRL +AY +LR +F G L + V+L DTCYD SG SVRVPTVS +F
Sbjct: 344 MDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFD 403
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
G L LPA+N L+ V A FC AFAP+SS +SI+GN+QQ+G +++ D AN VGF PN
Sbjct: 404 QGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPN 462
Query: 494 KC 495
C
Sbjct: 463 TC 464
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 357 bits (915), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 210/407 (51%), Positives = 266/407 (65%), Gaps = 25/407 (6%)
Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELK--PAEAQILPEDFSTPVVSGASQGSGEYFSR 162
RL+RDS RV ++ + LA + R+ K P A FS V+SG SQGSGEYF R
Sbjct: 86 RLQRDSLRVKSITS---LAAVSTGRNATKRTPRTAG----GFSGAVISGLSQGSGEYFMR 138
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
+GVGTP MVLDTGSD+ WLQC PC CY Q+D IFDPK S +++ +PC + C+ L
Sbjct: 139 LGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLCRRL 198
Query: 223 DVSA-C---RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
D S+ C R+ CLYQV+YGDGSFT GD TET++F + V + LGCGHDNEGLFVG
Sbjct: 199 DDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF-HGARVDHVPLGCGHDNEGLFVG 257
Query: 279 SAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASG------VLEFNSARGGDAVTA 329
+AGLLGLG G LS Q K +YCLVDR S S ++ N+A +V
Sbjct: 258 AAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFT 317
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
PL+ N K+DTFYY+ L G SVGG V + S F++D G+GG+I+D GT++TRL AY
Sbjct: 318 PLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAY 377
Query: 389 NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
+LRD+F A LK +LFDTC+D SG+ +V+VPTV HFG G+ + LPA NYLIP
Sbjct: 378 VALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFGGGE-VSLPASNYLIP 436
Query: 449 VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
V++ G FCFAFA T +LSIIGN+QQQG RV++DL +RVGF C
Sbjct: 437 VNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 355 bits (910), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 196/359 (54%), Positives = 238/359 (66%), Gaps = 16/359 (4%)
Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
+QGSGEYF++IGVGTP MVLDTGSD+ WLQC PC CY+QS +FDP+ S SY+ +
Sbjct: 134 AQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYNAV 193
Query: 213 PCAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
CAAP C+ LD C R + CLYQVAYGDGS T GD TET++F V +ALGCGH
Sbjct: 194 GCAAPLCRRLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFAGGARVARVALGCGH 253
Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPA-----SGVLEFNSAR 322
DNEGLFV +AGLLGLG G LS QI S +YCLVDR S A S + F S
Sbjct: 254 DNEGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRSSTVTFGSGA 313
Query: 323 GGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMD-EAGDGGIIVDCG 377
G V + P+++N +++TFYYV L G SVGG V + S +D +G GG+IVD G
Sbjct: 314 VGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSSGRGGVIVDSG 373
Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
T++TRL AY++LRD+F A L+ + G +LFDTCYD SG + V+VPTVS+HF G
Sbjct: 374 TSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMHFAGGA 433
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LP +NYLIPVDS GTFCFAFA T +SIIGN+QQQG RV FD RV FTP C
Sbjct: 434 EAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVAFTPKGC 492
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 200/369 (54%), Positives = 242/369 (65%), Gaps = 17/369 (4%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
F PVVSG +QGSGEYF++IGVGTP MVLDTGSD+ WLQC PC CY QS +FDP
Sbjct: 132 FVAPVVSGLAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDP 191
Query: 204 KTSSSYSPLPCAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
+ S SY + CAAP C+ LD C R CLYQVAYGDGS T GD TET++F + V
Sbjct: 192 RASHSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFASGARV 251
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGV--- 315
+ALGCGHDNEGLFV +AGLLGLG G LS QI S +YCLVDR S ++
Sbjct: 252 PRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSR 311
Query: 316 ---LEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEA- 367
+ F S G + A P+++N +++TFYYV L G SVGG V + S +D +
Sbjct: 312 SSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPST 371
Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVRVP 426
G GG+IVD GT++TRL AY +LRD+F A L+ + G +LFDTCYD SGL+ V+VP
Sbjct: 372 GRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGLKVVKVP 431
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
TVS+HF G LP +NYLIPVDS GTFCFAFA T +SIIGN+QQQG RV FD
Sbjct: 432 TVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQ 491
Query: 487 RVGFTPNKC 495
R+GF P C
Sbjct: 492 RLGFVPKGC 500
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 352 bits (903), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 197/376 (52%), Positives = 249/376 (66%), Gaps = 21/376 (5%)
Query: 136 EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ 195
+ ++ +DF PV+SG S GSGEYF R+ VGTPPR +V+DTGSDI WLQC PC CY
Sbjct: 14 QTKVPSQDFQAPVISGLSLGSGEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYH 73
Query: 196 QSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF 255
Q D +FDP SS+YS L C + QC +LDV C N+CLYQV YGDGSF+ G+ T+ VS
Sbjct: 74 QCDEVFDPYKSSTYSTLGCNSRQCLNLDVGGCVGNKCLYQVDYGDGSFSTGEFATDAVSL 133
Query: 256 GNSGSVKG------IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLV 306
NS S G I LGCGHDNEG FVG+AGLLGLG G LS QI + + +YCL
Sbjct: 134 -NSTSGGGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQINSENGGRFSYCLT 192
Query: 307 DRDSPASGVLEFNSARGGDAVT-------APLIRNKKVDTFYYVGLTGFSVGGQAVQIPP 359
RD+ ++ E +S GDA P N +V TFYY+ +TG SVGG + IP
Sbjct: 193 GRDTDST---ERSSLIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPT 249
Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSG 419
S F++D G+GG+I+D GT++TRLQ AY SLR++F +L T+ +LFDTCY+ S
Sbjct: 250 SAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSD 309
Query: 420 LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRV 479
L SV VPTV+LHF G L LPA NYL+PVD++ TFC AFA T+ SIIGN+QQQG RV
Sbjct: 310 LSSVDVPTVTLHFQGGADLKLPASNYLVPVDNSSTFCLAFAGTTGP-SIIGNIQQQGFRV 368
Query: 480 SFDLANNRVGFTPNKC 495
+D +N+VGF P++C
Sbjct: 369 IYDNLHNQVGFVPSQC 384
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 179/334 (53%), Positives = 236/334 (70%), Gaps = 3/334 (0%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPC---TECYQQSDPIFDPKTSSSYSPLPCAAPQCKS 221
VG P + VLDTGSD+ WLQC PC CY+Q PIFDP+ SSSY+P+ C + QC+
Sbjct: 3 VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62
Query: 222 LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAG 281
LD + C N C+Y+V YGDGSFT+G+L TET++F +S S+ I++GCGHDNEGLFVG+ G
Sbjct: 63 LDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNISIGCGHDNEGLFVGADG 122
Query: 282 LLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFY 341
L+GLGGG +S++ Q+KA+S +YCLVD DSP+ L+FN+ D++ +PL++N + +F
Sbjct: 123 LIGLGGGAISISSQLKASSFSYCLVDIDSPSFSTLDFNTDPPSDSLISPLVKNDRFPSFR 182
Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
YV + G SVGG+ + I S FE+DE+G GGIIVD GT IT+L + Y LR++F+ L N
Sbjct: 183 YVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREAFLGLTTN 242
Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
L P ++ FDTCYD S +V VPT++ +L LPAKN LI VDSAGTFC AF
Sbjct: 243 LPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFVS 302
Query: 462 TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ LSIIGN QQQG RVS+DL N+ VGF+ NKC
Sbjct: 303 ATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 349 bits (895), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 195/415 (46%), Positives = 256/415 (61%), Gaps = 26/415 (6%)
Query: 98 YRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSG 157
+ L+L L+RD RV + +K +LA D D + PV SG GSG
Sbjct: 2 HEQLLLETLQRDERRVRWIESKAKLAGKKKDEAS---------STDLNGPVTSGLLYGSG 52
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EYF R+G+GTP R MV+DTGSD+ WLQC+PC CY+Q+DPIFDP+ SSS+ +PC +P
Sbjct: 53 EYFVRLGLGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSP 112
Query: 218 QCKSLDVSACRANR-----CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
CK+L+V +C +R C YQVAYGDGSF+VGD ++ + G +A GCG DN
Sbjct: 113 LCKALEVHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKAMSVAFGCGFDN 172
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQI--------KATSLAYCLVDRDSP---ASGVLEFN-S 320
EGLF G+AGLLGLG G LS QI A S +YCLVDR +P +S L F +
Sbjct: 173 EGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIFGVA 232
Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
A A +PL++N K+DTFYY + G SVGG + I ++ ++G GG+I+D GT++
Sbjct: 233 AIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSV 292
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
TR T Y ++RD+F NL +LFDTCY+FSG SV VP + LHF G L L
Sbjct: 293 TRFPTSVYATIRDAFRNATINLPSAPRYSLFDTCYNFSGKASVDVPALVLHFENGADLQL 352
Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
P NYLIP+++AG+FC AFAPTS L IIGN+QQQ R+ FDL + + F P +C
Sbjct: 353 PPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 407
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 348 bits (893), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 186/422 (44%), Positives = 254/422 (60%), Gaps = 41/422 (9%)
Query: 81 SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
SL L R+ + + R V+ + RD+ARV L L + + L
Sbjct: 64 SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHL------------EKRLVASTSPYL 111
Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
PED + VV G GSGEYF R+GVG+PP +V+D+GSD+ W+QCRPC +CY Q+DP+
Sbjct: 112 PEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPL 171
Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSACRAN----RCLYQVAYGDGSFTVGDLVTETVSFG 256
FDP SSS+S + C + C++L + C +C Y V YGDGS+T G+L ET++ G
Sbjct: 172 FDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLG 231
Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPAS 313
+ +V+G+A+GCGH N GLFVG+AGLLGLG G +SL Q+ + +YCL R + +
Sbjct: 232 GT-AVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGA 290
Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
G L +FYYVGLTG VGG+ + + SLF++ E G GG++
Sbjct: 291 GSL--------------------ASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVV 330
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
+D GTA+TRL +AY +LR +F G L + V+L DTCYD SG SVRVPTVS +F
Sbjct: 331 MDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFD 390
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
G L LPA+N L+ V A FC AFAP+SS +SI+GN+QQ+G +++ D AN VGF PN
Sbjct: 391 QGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPN 449
Query: 494 KC 495
C
Sbjct: 450 TC 451
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 347 bits (890), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 199/426 (46%), Positives = 273/426 (64%), Gaps = 27/426 (6%)
Query: 81 SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
SL L R+ + + R +L RD ARV L + L P +
Sbjct: 70 SLALLHRDAVSGRTYPSTRHAMLGLAARDGARVEYL------------QRRLSPTT---M 114
Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
+ + VVSG S+GSGEYF R+GVG+PP + +V+D+GSD+ W+QCRPC ECYQQ+DP+
Sbjct: 115 TTEVGSEVVSGISEGSGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQADPL 174
Query: 201 FDPKTSSSYSPLPCAAPQCKSL--DVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
FDP S+S++ +PC + C++L S C + C YQV+YGDGS+T G L ET++FG+
Sbjct: 175 FDPAASASFTAVPCDSGVCRTLPGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTFGD 234
Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPA-S 313
S V+G+A+GCGH N GLFVG+AGLLGLG G +SL Q+ + +YCL R + A +
Sbjct: 235 STPVQGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGADAGA 294
Query: 314 GVLEF--NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
G L F + A AV PL+RN + +FYYVGLTG VGG+ + + LF++ E G GG
Sbjct: 295 GSLVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGG 354
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
+++D GTA+TRL AY +LRD+F + G+L GV+L DTCYD SG SVRVPTV+L
Sbjct: 355 VVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDTCYDLSGYASVRVPTVAL 414
Query: 431 HFGA-GKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
+FG G AL LPA+N L+ + G +C AFA ++S LSI+GN+QQQG +++ D AN VG
Sbjct: 415 YFGRDGAALTLPARNLLVEM-GGGVYCLAFAASASGLSILGNIQQQGIQITVDSANGYVG 473
Query: 490 FTPNKC 495
F P+ C
Sbjct: 474 FGPSTC 479
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 340 bits (872), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 188/369 (50%), Positives = 241/369 (65%), Gaps = 17/369 (4%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
F+ P++SG QGSGEYF+++GVGTP MVLDTGSD+ WLQC PC CY QS +FDP
Sbjct: 113 FAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDP 172
Query: 204 KTSSSYSPLPCAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
+ S SY+ + C AP C+ LD + C R N CLYQVAYGDGS T GD +ET++F V
Sbjct: 173 RRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARV 232
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA------ 312
+ +A+GCGHDNEGLF+ ++GLLGLG G LS QI + S +YCLVDR S
Sbjct: 233 QRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTR 292
Query: 313 SGVLEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMD-EA 367
S + F + A A P+ RN ++ TFYYV L GFSVGG V+ + S ++
Sbjct: 293 SSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTT 352
Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVRVP 426
G GG+I+D GT++TRL Y ++RD+F A L+ + G +LFDTCY+ SG R V+VP
Sbjct: 353 GRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVP 412
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
TVS+H G ++ LP +NYLIPVD++GTFCFA A T +SIIGN+QQQG RV FD
Sbjct: 413 TVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQ 472
Query: 487 RVGFTPNKC 495
RVGF P C
Sbjct: 473 RVGFVPKSC 481
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 340 bits (871), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 188/369 (50%), Positives = 241/369 (65%), Gaps = 17/369 (4%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
F+ P++SG QGSGEYF+++GVGTP MVLDTGSD+ WLQC PC CY QS +FDP
Sbjct: 107 FAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDP 166
Query: 204 KTSSSYSPLPCAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
+ S SY+ + C AP C+ LD + C R N CLYQVAYGDGS T GD +ET++F V
Sbjct: 167 RRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARV 226
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA------ 312
+ +A+GCGHDNEGLF+ ++GLLGLG G LS QI + S +YCLVDR S
Sbjct: 227 QRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTR 286
Query: 313 SGVLEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMD-EA 367
S + F + A A P+ RN ++ TFYYV L GFSVGG V+ + S ++
Sbjct: 287 SSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTT 346
Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVRVP 426
G GG+I+D GT++TRL Y ++RD+F A L+ + G +LFDTCY+ SG R V+VP
Sbjct: 347 GRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVP 406
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
TVS+H G ++ LP +NYLIPVD++GTFCFA A T +SIIGN+QQQG RV FD
Sbjct: 407 TVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQ 466
Query: 487 RVGFTPNKC 495
RVGF P C
Sbjct: 467 RVGFVPKSC 475
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 340 bits (871), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 188/369 (50%), Positives = 241/369 (65%), Gaps = 17/369 (4%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
F+ P++SG QGSGEYF+++GVGTP MVLDTGSD+ WLQC PC CY QS +FDP
Sbjct: 107 FAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDP 166
Query: 204 KTSSSYSPLPCAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
+ S SY+ + C AP C+ LD + C R N CLYQVAYGDGS T GD +ET++F V
Sbjct: 167 RRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARV 226
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA------ 312
+ +A+GCGHDNEGLF+ ++GLLGLG G LS QI + S +YCLVDR S
Sbjct: 227 QRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPTQIARSFGRSFSYCLVDRTSSVRPSSTR 286
Query: 313 SGVLEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMD-EA 367
S + F + A A P+ RN ++ TFYYV L GFSVGG V+ + S ++
Sbjct: 287 SSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTT 346
Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVRVP 426
G GG+I+D GT++TRL Y ++RD+F A L+ + G +LFDTCY+ SG R V+VP
Sbjct: 347 GRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVP 406
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
TVS+H G ++ LP +NYLIPVD++GTFCFA A T +SIIGN+QQQG RV FD
Sbjct: 407 TVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQ 466
Query: 487 RVGFTPNKC 495
RVGF P C
Sbjct: 467 RVGFVPKSC 475
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 338 bits (868), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 193/422 (45%), Positives = 258/422 (61%), Gaps = 27/422 (6%)
Query: 99 RSLVLSRLERDSARVNTLITKLQLAIYNVDRHELK---PAEAQILP-------------- 141
+ L+L+RL +D R + + LA + +L+ P +++ L
Sbjct: 1 KQLLLARLRKDELRSKAIAATIALATNGWRKSDLRHPLPGQSESLAVAGLASGRGGRGHG 60
Query: 142 ---EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD 198
F++P++SG + GSG+YF+RIGVGTP R MV DTGSD++WLQC PC +CY+Q D
Sbjct: 61 GARRGFASPLISGIAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQD 120
Query: 199 PIFDPKTSSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
PIF+P SSS+ PL CA+ C L + C R N C+YQV+YGDGSFTVGD TET+SFG
Sbjct: 121 PIFNPSLSSSFKPLACASSICGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSFGE 180
Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASG 314
+V+ +A+GCG +N+GLF G+AGLLGLG G LS Q A+ +YCL R+S +
Sbjct: 181 H-AVRSVAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESAIAA 239
Query: 315 VLEFN-SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
L F SA A L+ N+++DT+YYVGL V G V IPP F M G GG+I
Sbjct: 240 SLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVI 299
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
VD GTAI+RL T AY +LRD+F L G++LFDTCYD S +++ +P V L F
Sbjct: 300 VDSGTAISRLTTPAYTALRDAFRSLV-TFPSAPGISLFDTCYDLSSMKTATLPAVVLDFD 358
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
G ++ LPA L+ VD GT+C AFAP A SIIGNVQQQ R+S D ++G P+
Sbjct: 359 GGASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPD 418
Query: 494 KC 495
+C
Sbjct: 419 QC 420
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 335 bits (860), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 181/355 (50%), Positives = 234/355 (65%), Gaps = 7/355 (1%)
Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
+P++SG + GSG+YF+RIGVGTP R MV DTGSD++WLQC PC +CY+Q DPIF+P
Sbjct: 1 SPLISGIAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSL 60
Query: 206 SSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
SSS+ PL CA+ C L + C R N+C+YQV+YGDGSFTVGD TET+SFG +V+ +
Sbjct: 61 SSSFKPLACASSICGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSFGEH-AVRSV 119
Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFN-S 320
A+GCG +N+GLF G+AGLLGLG G LS Q A+ +YCL R+S + L F S
Sbjct: 120 AMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESAIAASLVFGPS 179
Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
A A L+ N+++DT+YYVGL V G V IPP F M G GG+IVD GTAI
Sbjct: 180 AVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAI 239
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
+RL T AY +LRD+F L G++LFDTCYD S +++ +P V L F G ++ L
Sbjct: 240 SRLTTPAYTALRDAFRSLV-TFPSAPGISLFDTCYDLSSMKTATLPAVVLDFDGGASMPL 298
Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
PA L+ VD GT+C AFAP A SIIGNVQQQ R+S D ++G P++C
Sbjct: 299 PADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 353
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 174/368 (47%), Positives = 235/368 (63%), Gaps = 17/368 (4%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
F P+ SG + G+GEYF+ +GVGTP R +V+DTGSDI WLQC PCT CY+Q D +F+P
Sbjct: 1 FEAPIFSGLAFGTGEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNP 60
Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETV----SFGNSG 259
+SSS+ L C++ C +LDV C +N+CLYQ YGDGSFT+G+LVT+ V +FG
Sbjct: 61 SSSSSFKVLDCSSSLCLNLDVMGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQ 120
Query: 260 SV-KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPAS-- 313
V I LGCGHDNEG F +AG+LGLG G LS + A++ +YCL DR+S +
Sbjct: 121 VVLTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDASTRNIFSYCLPDRESDPNHK 180
Query: 314 GVLEFNSA-----RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV-QIPPSLFEMDEA 367
L F A G P +RN +V T+YYV +TG SVGG + IP S+F++D
Sbjct: 181 STLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSH 240
Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
G+GG I D GT ITRL+ +AY ++RD+F +L + +FDTCYDF+G+ S+ VPT
Sbjct: 241 GNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKIFDTCYDFTGMNSISVPT 300
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNR 487
V+ HF + LP NY++PV + FCFAFA S S+IGNVQQQ RV +D + +
Sbjct: 301 VTFHFQGDVDMRLPPSNYIVPVSNNNIFCFAFA-ASMGPSVIGNVQQQSFRVIYDNVHKQ 359
Query: 488 VGFTPNKC 495
+G P++C
Sbjct: 360 IGLLPDQC 367
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 333 bits (854), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 188/436 (43%), Positives = 259/436 (59%), Gaps = 38/436 (8%)
Query: 81 SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
SL L R+ + + + R VL + RD+AR L T+L A
Sbjct: 105 SLALVRRDEVTGSTYPSLRHAVLDLVARDNARAEYLATRLSPAYQ--------------- 149
Query: 141 PEDFS---TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS 197
P FS + VVSG +GSGEY R+ VG+PP + +V+D+GSD+ W+QC+PC ECY Q+
Sbjct: 150 PPGFSGSESKVVSGLDEGSGEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPCLECYVQA 209
Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSLDVSAC---RANRCLYQVAYGDGSFTVGDLVTETVS 254
DP+FDP TS+++S + C + C+ L SAC C Y+V+Y DGS+T G L ET++
Sbjct: 210 DPLFDPATSATFSGVSCGSAICRILPTSACGDGELGGCEYEVSYADGSYTKGALALETLT 269
Query: 255 FGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSP 311
G + +V+G+ +GCGH N GLFVG+AGL+GLG G +SL Q+ + +YCL R
Sbjct: 270 LGGT-AVEGVVIGCGHRNRGLFVGAAGLMGLGWGPMSLVGQLGGEVGGAFSYCLASRGGY 328
Query: 312 ASG---------VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLF 362
SG VL + A AV PL+RN + +FYYVGL+G VG + + + LF
Sbjct: 329 GSGAADDDAGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLF 388
Query: 363 EMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGV--ALFDTCYDFSG 419
++ E G G +++D GT +TRL +AY +LRD+FV LAG + GV ++ DTCYD SG
Sbjct: 389 QLTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLSG 448
Query: 420 LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRV 479
SVRVPTVS F L L A+N L+ VD G +C AFAP+SS LSI+GN QQ G ++
Sbjct: 449 YASVRVPTVSFCFDGDARLILAARNVLLEVD-MGIYCLAFAPSSSGLSIMGNTQQAGIQI 507
Query: 480 SFDLANNRVGFTPNKC 495
+ D AN +GF P C
Sbjct: 508 TVDSANGYIGFGPANC 523
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 325 bits (832), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 183/399 (45%), Positives = 250/399 (62%), Gaps = 18/399 (4%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
+ERD AR+ + ++Q + + R AQ V SG S GSGEYF+R+G+
Sbjct: 1 MERDEARLRWIHHRIQSSDHRHRRGRSLLQTAQ---------VSSGLSLGSGEYFARMGI 51
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
G+P R + + LDTGSD+ W+QC PC+ CY Q DPI+DP SSSY + C + C++LD S
Sbjct: 52 GSPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSALCQALDYS 111
Query: 226 ACRANRCLYQVAYGDGSFTVGDLVTETVSFG--NSGSVKGIALGCGHDNEGLFVGSAGLL 283
AC+ C Y+V YGD S + GDL E+ G +S +++ IA GCGH N GLF G AGLL
Sbjct: 112 ACQGMGCSYRVVYGDSSASSGDLGIESFYLGPNSSTAMRNIAFGCGHSNSGLFRGEAGLL 171
Query: 284 GLGGGMLSLTKQIKAT---SLAYCLVDR----DSPASGVLEFNSARGGDAVTAPLIRNKK 336
G+GGG LS QI A+ + +YCLVDR S +S ++ +A A PL++N +
Sbjct: 172 GMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAARFTPLLKNPR 231
Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
+DTFYY LTG SVGG A+ IPP+ F + G GG I+D GT++TR+ AY LRD++
Sbjct: 232 IDTFYYAILTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAYR 291
Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
+ NL P GV L DTC++F GL +V++P++ LHF + LP N LIPVD +GTFC
Sbjct: 292 AASRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGTFC 351
Query: 457 FAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
AFAP+S +S+IGNVQQQ R+ FDL + + P +C
Sbjct: 352 LAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 390
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 182/345 (52%), Positives = 219/345 (63%), Gaps = 26/345 (7%)
Query: 174 MVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSAC--RANR 231
MVLDTGSD+ W+QC PC CY+QS P+FDP+ SSSY + C A C+ LD C R
Sbjct: 1 MVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGA 60
Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
C+YQVAYGDGS T GD VTET++F V +ALGCGHDNEGLFV +AGLLGLG G LS
Sbjct: 61 CMYQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGGLS 120
Query: 292 LTKQIK---ATSLAYCLVDRDSPASGV---------LEFNSARGG--DAVTAPLIRNKKV 337
QI S +YCLVDR S +G + F + G A P++RN ++
Sbjct: 121 FPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMVRNPRM 180
Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMD-----EAGDGGIIVDCGTAITRLQTQAYNSLR 392
+TFYYV L G SVGG V P + E D G GG+IVD GT++TRL +Y++LR
Sbjct: 181 ETFYYVQLVGISVGGARV---PGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALR 237
Query: 393 DSF-VRLAGNLKPT-SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
D+F AG L+ + G +LFDTCYD G R V+VPTVS+HF G LP +NYLIPVD
Sbjct: 238 DAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVD 297
Query: 451 SAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S GTFCFAFA T +SIIGN+QQQG RV FD RVGF P C
Sbjct: 298 SRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 316 bits (809), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 176/468 (37%), Positives = 249/468 (53%), Gaps = 49/468 (10%)
Query: 23 TSASSRGLSETATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSL 82
TS ++ + T++ +V++A +T ILSF+P F + E +S
Sbjct: 98 TSKANSSSEYSITSIFNVTAANHKTSQILSFKP-----FHNQEEFPQTFSSSSSFKLKLY 152
Query: 83 PLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPE 142
P S H +H +Y SL
Sbjct: 153 PAASLYNTHH-QHKNYYSL----------------------------------------- 170
Query: 143 DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
D + + G + G+ + +IGVG PP++F M+ D +D WLQC+PC +CY Q D IFD
Sbjct: 171 DLNASLNPGITTGTSNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFD 230
Query: 203 PKTSSSYSPLPCAAPQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
P SSSY+ L C C L S+C + C Y + Y DG+ T G L+ ETVSF +SG V
Sbjct: 231 PSQSSSYTLLSCETKHCNLLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFESSGWV 290
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNS 320
++LGC + N+G FVGS G GLG G LS +I A+S++YCLV+ +D +S LEFNS
Sbjct: 291 DRVSLGCSNKNQGPFVGSDGTFGLGRGSLSFPSRINASSMSYCLVESKDGYSSSTLEFNS 350
Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
+V A L++N K + YYVGL G VGG+ + +P S F +D G+GG+IV + I
Sbjct: 351 PPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSSSLI 410
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
T L+ YN +RD+FV +L+ FDTCY+ S +V +P + GK+ L
Sbjct: 411 TMLENDTYNVVRDAFVAKTQHLERLKAFLQFDTCYNLSSNNTVELPILEFEVNDGKSWLL 470
Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
P ++YL VD GTFCFAFAP+ + SI+G +QQ GTRV+FDL N+ V
Sbjct: 471 PKESYLYAVDKNGTFCFAFAPSKGSFSILGTLQQYGTRVTFDLVNSFV 518
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 315 bits (808), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 172/357 (48%), Positives = 233/357 (65%), Gaps = 9/357 (2%)
Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
+ SG S GSGEYF+R+G+G P R + + LDTGSD+ W+QC PC+ CY Q DPI+DP SS
Sbjct: 1 ISSGLSLGSGEYFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSS 60
Query: 208 SYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG--NSGSVKGIA 265
SY + C + C++LD SAC+ C Y+V YGD S + GDL E+ G +S +++ IA
Sbjct: 61 SYRRVYCGSALCQALDYSACQGMGCSYRVVYGDSSASSGDLGIESFYLGPNSSTAMRNIA 120
Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDR----DSPASGVLEF 318
GCGH N GLF G AGLLG+GGG LS QI A+ + +YCLVDR S +S ++
Sbjct: 121 FGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFG 180
Query: 319 NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
+A A PL++N +++TFYY LTG SVGG + IPP+ F + G GG I+D GT
Sbjct: 181 RTAIPFAARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSGT 240
Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
++TR+ AY LRD++ + NL P GV L DTC++F GL +V++P++ LHF G +
Sbjct: 241 SVTRVVPPAYAVLRDAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNGVDM 300
Query: 439 DLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LP N LIPVD +GTFC AFAP+S +S+IGNVQQQ R+ FDL + + P +C
Sbjct: 301 VLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 357
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 311 bits (797), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 183/451 (40%), Positives = 257/451 (56%), Gaps = 33/451 (7%)
Query: 65 SETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAI 124
+ AA S P +++ SL L R+ + T+H R VL+ RD+ARV L +L +
Sbjct: 42 TAAAAPSVPSSTTRRPSLQLLHRDTVSGTKHPSRRHAVLALASRDTARVAYLQRRLSPSP 101
Query: 125 YNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINW 184
++ + S GSGEY R+G+G+PP + +V DTGSD+ W
Sbjct: 102 SPSSTSSVESGGTIV-------------SHGSGEYLVRVGIGSPPLEQHLVADTGSDVIW 148
Query: 185 LQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS-----LDVSACRANRCLYQVAYG 239
+QC PC++CY Q DP+FDP S+S+SP+PC + C++ C Y+V+YG
Sbjct: 149 VQCSPCSDCYAQGDPLFDPANSASFSPVPCNSGVCRAAARYSSSSCGGGGGECEYKVSYG 208
Query: 240 DGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI--- 296
D S+T G L ET++ V+G+A+GCGH+N GLF +AGLLGLG G +SL Q+
Sbjct: 209 DKSYTNGVLALETLTLDGGTEVQGVAMGCGHENRGLFAEAAGLLGLGWGPMSLVGQLGGA 268
Query: 297 KATSLAYCLVDRDSPASG-----VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVG 351
+ +YCL S VL A AV PL+RN +FYYVG+ G V
Sbjct: 269 AGGAFSYCLAGYYSGEGSGSGSLVLGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVA 328
Query: 352 GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS-GVAL 410
G+ +Q+ LF++ + G GG+++D GTA+TRL +AY +LR +F P + GV+L
Sbjct: 329 GERLQLQDGLFDLGDDGGGGVVMDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSL 388
Query: 411 FDTCYDFSGLRSVRVPTVSLHFGA------GKALDLPAKNYLIPVDSAGTFCFAFAPTSS 464
FDTCYD SG SVRVPTV+L+FG +L LPA+N L+PVD GT+C AFA +S
Sbjct: 389 FDTCYDLSGYASVRVPTVALYFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVAS 448
Query: 465 ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
SI+GN+QQQG ++ D A+ VGF P C
Sbjct: 449 GPSILGNIQQQGIEITVDSASGYVGFGPATC 479
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 300 bits (769), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 193/483 (39%), Positives = 260/483 (53%), Gaps = 37/483 (7%)
Query: 37 VLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTR-- 94
V+ +++ ++ H + P + ++ + ++ ++ S LH R +LH+ R
Sbjct: 21 VVGLATPVEYEYHSYAVTPLSPHAYSAPAAADDDAQAQEDVAASSSTLHIR-LLHRDRFA 79
Query: 95 -HNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGAS 153
+ L+ RL+RD R +I+K P F PVVS A
Sbjct: 80 ANATPAQLLARRLQRDVLRAAWIISK------AAANGTPPPVAGLSSARGFVAPVVSRAP 133
Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
SGEY ++I VGTP + + LDT SD+ WLQC+PC CY QS P+FDP+ S+SY +
Sbjct: 134 T-SGEYIAKIAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYREMS 192
Query: 214 CAAPQCKSLDVSA---CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
A C++L S + C+Y V YGDGS TVGD + ET++F + I++GCGH
Sbjct: 193 FNAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTFAGGVRLPRISIGCGH 252
Query: 271 DNEGLF-VGSAGLLGLGGGMLSLTKQIKAT-SLAYCLVDRDSPASGVLEFNSARGGDAVT 328
DN+GLF +AG+LGLG G++S QI + +YCLVD S G L G AV
Sbjct: 253 DNKGLFGAPAAGILGLGRGLMSFPNQIDHNGTFSYCLVDFLS-GPGSLSSTLTFGAGAVD 311
Query: 329 -------APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-----EAGDGGIIVDC 376
P + N + TFYYV LTG SVGG V P + E D G GG+IVD
Sbjct: 312 TSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRV---PGVTERDLQLDPYTGRGGVIVDS 368
Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTS---GVALFDTCYDFSGLRSVRVPTVSLHFG 433
GTA+TRL AY + RD+F +A +L S FDTCY G +VPTVS+HF
Sbjct: 369 GTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYTVGGRGMKKVPTVSMHFA 428
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRVGFTP 492
+ L KNYLIPVDS GT CFAFA T ++SIIGN+QQQG R+ +D+ RVGF P
Sbjct: 429 GSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIGNIQQQGFRIVYDI-GGRVGFAP 487
Query: 493 NKC 495
N C
Sbjct: 488 NSC 490
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 295 bits (756), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 161/367 (43%), Positives = 211/367 (57%), Gaps = 13/367 (3%)
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
+ +PV+SG SGEYF+ +GVGTPP +V+DTGSD+ WLQC+PC CY+Q P++
Sbjct: 82 DHLHSPVISGLPFASGEYFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLSPLY 141
Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
DP+ SS+Y+ PC+ PQC++ C Y++ YGD S T G+L T+ + F N SV
Sbjct: 142 DPRGSSTYAQTPCSPPQCRNPQTCDGTTGGCGYRIVYGDASSTSGNLATDRLVFSNDTSV 201
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASG---- 314
+ LGCGHDNEGLF +AGLLG+ G S Q+ AYCL DR S
Sbjct: 202 GNVTLGCGHDNEGLFGSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSSSYL 261
Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEA-GDGGI 372
V + +V PL N + + YYV + GFSVGG+ V + +D A G GG+
Sbjct: 262 VFGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRGGV 321
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLA---GNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
+VD GT+ITR AY +LRD+F A G K G+++FD CYD G+ P V
Sbjct: 322 VVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLRGVAVADAPGVV 381
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAF-APTSSALSIIGNVQQQGTRVSFDLANNRV 488
LHF G + LP +NYL+P +S CFA A LS+IGNV QQ RV FD+ N RV
Sbjct: 382 LHFAGGADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDVENERV 441
Query: 489 GFTPNKC 495
GF PN C
Sbjct: 442 GFEPNGC 448
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 291 bits (745), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 197/489 (40%), Positives = 265/489 (54%), Gaps = 42/489 (8%)
Query: 35 TTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTR 94
T V+ +++ ++ H P + P++ + A ++F ++SSS+ + L R+
Sbjct: 20 TAVVGLATPVEYEYHSYVVTPLSPHPYSAPA-AADDNFSVSSSSALHIHLLHRDSF--AV 76
Query: 95 HNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQ 154
+ L+ RL+RD R +I+K P PVVS A
Sbjct: 77 NATAAELLARRLQRDELRAAWIISKAAA------NGTPPPVVGLSTGRGLVAPVVSRAPT 130
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
SGEY ++I VGTP Q + LDT SD+ WLQC+PC CY QS P+FDP+ S+SY +
Sbjct: 131 -SGEYMAKIAVGTPAVQALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNY 189
Query: 215 AAPQCKSLDVSA---CRANRCLYQVAYGDG----SFTVGDLVTETVSFGNSGSVKGIALG 267
AP C++L S + C+Y V YGDG S +VGDLV ET++F +++G
Sbjct: 190 DAPDCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGGVRQAYLSIG 249
Query: 268 CGHDNEGLF-VGSAGLLGLGGGMLSLTKQIK----ATSLAYCLVD----RDSPASGVLEF 318
CGHDN+GLF +AG+LGLG G +S+ QI S +YCLVD SP+S L F
Sbjct: 250 CGHDNKGLFGAPAAGILGLGRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSS-TLTF 308
Query: 319 NSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-----EAGDG 370
+ + A P + N+ + TFYYV L G SVGG V P + E D G G
Sbjct: 309 GAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRV---PGVTERDLQLDPYTGRG 365
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG---VALFDTCYDFSGLRSVRVPT 427
G+I+D GT +TRL AY + RD+F A +L S LFDTCY G V+VP
Sbjct: 366 GVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDTCYTVGGRAGVKVPA 425
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANN 486
VS+HF G + L KNYLIPVDS GT CFAFA T ++S+IGN+ QQG RV +DLA
Sbjct: 426 VSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVYDLAGQ 485
Query: 487 RVGFTPNKC 495
RVGF PN C
Sbjct: 486 RVGFAPNNC 494
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 288 bits (738), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 185/375 (49%), Positives = 214/375 (57%), Gaps = 28/375 (7%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
F PVVSG +QGSGEYF++IGVGTP MVLDTGSD+ WLQC PC CY QS +FDP
Sbjct: 132 FVAPVVSGLAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDP 191
Query: 204 KTSSSYSPLPCAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
+ S SY + CAAP C+ LD C R CLYQVAYGDGS T GD TET++F + V
Sbjct: 192 RASHSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFASGARV 251
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGV--- 315
+ALGCGHDNEGLFV +AGLLGLG G LS QI S +YCLVDR S ++
Sbjct: 252 PRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSR 311
Query: 316 ---LEFNS-ARG--GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQI------PPSLFE 363
+ F S ARG G V P + G +A PP
Sbjct: 312 SSTVTFGSGARGALGRRVLHPDGEEPQDGDVLLRAAHGHQRRRRARPGRGRVRPPPD--- 368
Query: 364 MDEAGDGGIIVDCGT---AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGL 420
G GG+IVD G A R + R L P G +LFDTCYD SGL
Sbjct: 369 -PSTGRGGVIVDSGRPSPAWARAGRTPPCATRSRAAAAGLRLSP-GGFSLFDTCYDLSGL 426
Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVS 480
+ V+VPTVS+HF G LP +NYLIPVDS GTFCFAFA T +SIIGN+QQQG RV
Sbjct: 427 KVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVV 486
Query: 481 FDLANNRVGFTPNKC 495
FD R+GF P C
Sbjct: 487 FDGDGQRLGFVPKGC 501
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 288 bits (737), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 180/404 (44%), Positives = 227/404 (56%), Gaps = 48/404 (11%)
Query: 101 LVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYF 160
L+ RL RD+AR + ++ NV R FS PVVSG +QGSGEYF
Sbjct: 98 LLAHRLARDAARAEAI----SVSARNVTRAG----------GGFSAPVVSGLAQGSGEYF 143
Query: 161 SRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK 220
+ +GVGTPP +VLDTGSD+ WLQC PC +CY QS +FDP+ S SY+ + C AP C+
Sbjct: 144 ASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQCYAQSGRVFDPRRSRSYAAVRCGAPPCR 203
Query: 221 SLDVSACRANR-----CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
LD CLYQVAYGDGS T GDL TET+ F V +A+GCGHDNEGL
Sbjct: 204 GLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWFARGARVPRVAVGCGHDNEGL 263
Query: 276 FVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI 332
FV +AGLLGLG G LSL Q +YC +G D +I
Sbjct: 264 FVAAAGLLGLGRGRLSLPTQTARRYGRRFSYCF----------------QGSDLDHRTII 307
Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
R + G VG +++++ PS G GG+I+D GT++TRL Y ++R
Sbjct: 308 RTVHQ---HVGGARVRGVGERSLRLDPS------TGRGGVILDSGTSVTRLARPVYVAVR 358
Query: 393 DSFVRLAGNLK-PTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
++F AG L+ G +LFDTCYD G R V+VPTVS+H G + LP +NYLIPVD+
Sbjct: 359 EAFRAAAGGLRLAPGGFSLFDTCYDLRGRRVVKVPTVSVHLAGGAEVALPPENYLIPVDT 418
Query: 452 AGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
GTFC A A T +SI+GN+QQQG RV FD RV P C
Sbjct: 419 RGTFCLALAGTDGGVSIVGNIQQQGFRVVFDGDRQRVALVPKSC 462
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 288 bits (737), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 166/416 (39%), Positives = 225/416 (54%), Gaps = 33/416 (7%)
Query: 90 LHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVV 149
L + + +V +RD+ R+NT+ +K + L+P
Sbjct: 85 LRPINSSSWIDMVSQSFDRDNDRLNTIWSKNNGTYSTMSNLPLQP--------------- 129
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSY 209
G+ G+G Y G GTP + +++DTGSD+ W+QC+PC++CY Q DPIF+P+ SSSY
Sbjct: 130 -GSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQSSSY 188
Query: 210 SPLPCAAPQCKSL-DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
L C + C L ++ CR C+Y++ YGDGS + GD ET++ G S S A GC
Sbjct: 189 KHLSCLSSACTELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTLG-SDSFPSFAFGC 247
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGD 325
GH N GLF GSAGLLGLG LS Q K+ +YCL D S S F+ +G
Sbjct: 248 GHTNTGLFKGSAGLLGLGRTALSFPSQTKSKYGGQFSYCLPDFVSSTS-TGSFSVGQGSI 306
Query: 326 AVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
TA PL+ N +FY+VGL G SVGG+ + IPP++ G GG IVD GT ITR
Sbjct: 307 PATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVL-----GRGGTIVDSGTVITR 361
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
L QAY++L+ SF NL ++ DTCYD S VR+PT++ HF + + A
Sbjct: 362 LVPQAYDALKTSFRSKTRNLPSAKPFSILDTCYDLSSYSQVRIPTITFHFQNNADVAVSA 421
Query: 443 KNYLIPVDSAGT-FCFAFAPTSSALS--IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L + S G+ C AFA S ++S IIGN QQQ RV+FD R+GF P C
Sbjct: 422 VGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRMRVAFDTGAGRIGFAPGSC 477
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 173/366 (47%), Positives = 225/366 (61%), Gaps = 30/366 (8%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR---PCTECYQQSDPI 200
F+ P++SG QG+GEYF+++GVGTP MVLDTGSD+ W R P +Q
Sbjct: 107 FAAPLLSGLPQGTGEYFAQVGVGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGS-- 164
Query: 201 FDPKTSSSYSPLP---CAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF 255
T ++ +P P C AP C+ LD + C R N CLYQVAYGDGS T GD +ET++F
Sbjct: 165 ---STGAAPAPTPRWNCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF 221
Query: 256 GNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA 312
V+ +A+GCGHDNEGLF+ ++GLLGLG G LS QI + S +YCLVDR S
Sbjct: 222 ARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSR 281
Query: 313 SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMD-EAGDG 370
+ GG ++ TFYYV L GFSVGG V+ + S ++ G G
Sbjct: 282 R--ARPSRRWGG---------TPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRG 330
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVRVPTVS 429
G+I+D GT++TRL Y ++RD+F A L+ + G +LFDTCY+ SG R V+VPTVS
Sbjct: 331 GVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVS 390
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
+H G ++ LP +NYLIPVD++GTFCFA A T +SIIGN+QQQG RV FD RVG
Sbjct: 391 MHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVG 450
Query: 490 FTPNKC 495
F P C
Sbjct: 451 FVPKSC 456
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 286 bits (731), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 164/373 (43%), Positives = 211/373 (56%), Gaps = 21/373 (5%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
+PV+SG SGEYF+ IGVG PP +V+DTGSD+ WLQC PC CY+Q P++DP
Sbjct: 77 LRSPVMSGVPFDSGEYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTPLYDP 136
Query: 204 KTSSSYSPLPCAAPQCKS-LDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
+ S ++ +PCA+PQC+ L C R C+Y V YGDGS + GDL T+T+ +
Sbjct: 137 RNSKTHRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDTLVLPDDTR 196
Query: 261 VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPA---SG 314
V + LGCGHDNEGL +AGLLG G G LS Q+ +YCL DR S A S
Sbjct: 197 VHNVTLGCGHDNEGLLASAAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARNSSS 256
Query: 315 VLEF-NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ--IPPSLFEMDEAGDGG 371
L F + PL N + + YYV + GFSVGG+ V SL G GG
Sbjct: 257 YLVFGRTPELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPATGRGG 316
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVR---LAGNLKPTSGVALFDTCYDFSGL---RSVRV 425
++VD GTAI+R AY ++RD+FV AG + + ++FDTCYD G VRV
Sbjct: 317 VVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGNGPGTGVRV 376
Query: 426 PTVSLHFGAGKALDLPAKNYLIPV---DSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFD 482
P++ LHF A + LP NYLIPV D FC L+++GNVQQQG V FD
Sbjct: 377 PSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGVVFD 436
Query: 483 LANNRVGFTPNKC 495
+ R+GFTPN C
Sbjct: 437 VERGRIGFTPNGC 449
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 278 bits (712), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 170/449 (37%), Positives = 241/449 (53%), Gaps = 40/449 (8%)
Query: 71 SFPLNSSSSFS---------LPLHSREILHKTRHNDY-RSLV-LSRLERDSARVNTLITK 119
+FP +SS S LP H + + +H D+ ++L RL R AR + +
Sbjct: 281 TFPSTPNSSLSRRALQKPNKLPSHGFRV--RLKHVDHVKNLTRFERLRRGVARGKNRLHR 338
Query: 120 LQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTG 179
L + L A A + + PVV+G +GE+ ++ +G+PPR FS ++DTG
Sbjct: 339 LNAMV-------LAAANATV-GDQVKAPVVAG----NGEFLMKLAIGSPPRSFSAIMDTG 386
Query: 180 SDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYG 239
SD+ W QC+PC +C+ QS PIFDPK SSS+ + C++ C +L S C ++ C Y YG
Sbjct: 387 SDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTYG 446
Query: 240 DGSFTVGDLVTETVSFGNSG----SVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTK 294
D S T G L ET +FG+S S+ G+ GCG+DN G F AGL+GLG G LSL
Sbjct: 447 DSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVS 506
Query: 295 QIKATSLAYCL--VDRDSPASGVL----EFNSARGGDAV-TAPLIRNKKVDTFYYVGLTG 347
Q+K AYCL +D P+S +L D + T PLI+N +FYY+ L G
Sbjct: 507 QLKEQKFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQG 566
Query: 348 FSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG 407
SVGG + IP S FE+ + G GG+I+D GT IT ++ A+ SL++ F+ SG
Sbjct: 567 ISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSG 626
Query: 408 VALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL 466
D C++ +G V VP ++ HF G L+LP +NY+I AG C A +S +
Sbjct: 627 TGGLDLCFNLPAGTNQVEVPKLTFHF-KGADLELPGENYMIGDSKAGLLCLAIG-SSRGM 684
Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
SI GN+QQQ V DL + F P +C
Sbjct: 685 SIFGNLQQQNFMVVHDLQEETLSFLPTQC 713
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 278 bits (712), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 161/377 (42%), Positives = 214/377 (56%), Gaps = 23/377 (6%)
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
+ +PV+SG SGEYF+ I VG PP + +V+DTGSD+ WLQC PC CY+Q P++
Sbjct: 71 DRLRSPVMSGVPFDSGEYFAVINVGDPPTRALVVIDTGSDLIWLQCVPCRHCYRQVTPLY 130
Query: 202 DPKTSSSYSPLPCAAPQCKS-LDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNS 258
DP++SS++ +PCA+P+C+ L C R C+Y V YGDGS + GDL T+ + F +
Sbjct: 131 DPRSSSTHRRIPCASPRCRDVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPDD 190
Query: 259 GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPA--- 312
V + LGCGHDN GL +AGLLG+G G LS Q+ +YCL DR S A
Sbjct: 191 THVHNVTLGCGHDNVGLLESAAGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNG 250
Query: 313 SGVLEF-NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ--IPPSLFEMDEAGD 369
S L F + PL N + + YYV + GFSVGG+ V SL G
Sbjct: 251 SSYLVFGRTPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGR 310
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNL-KPTSGVALFDTCYDFSG----LR 421
GGI+VD GTAI+R AY ++RD+F AG + K + ++FD CYD G
Sbjct: 311 GGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAA 370
Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIPV---DSAGTFCFAFAPTSSALSIIGNVQQQGTR 478
+VRVP++ LHF G + LP NYLIPV D FC L+++GNVQQQG
Sbjct: 371 AVRVPSIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNVLGNVQQQGFG 430
Query: 479 VSFDLANNRVGFTPNKC 495
+ FD+ R+GFTPN C
Sbjct: 431 LVFDVERGRIGFTPNGC 447
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 278 bits (711), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 165/429 (38%), Positives = 234/429 (54%), Gaps = 31/429 (7%)
Query: 82 LPLHSREILHKTRHNDY-RSLV-LSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQI 139
LP H + + +H D+ ++L RL R AR + +L + L A A +
Sbjct: 46 LPSHGFRV--RLKHVDHVKNLTRFERLRRGVARGKNRLHRLNAMV-------LAAANATV 96
Query: 140 LPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP 199
+ PVV+G +GE+ ++ +G+PPR FS ++DTGSD+ W QC+PC +C+ QS P
Sbjct: 97 -GDQVKAPVVAG----NGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTP 151
Query: 200 IFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG 259
IFDPK SSS+ + C++ C +L S C ++ C Y YGD S T G L ET +FG+S
Sbjct: 152 IFDPKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDST 211
Query: 260 ----SVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSPA 312
S+ G+ GCG+DN G F AGL+GLG G LSL Q+K AYCL +D P+
Sbjct: 212 EDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQKFAYCLTAIDDSKPS 271
Query: 313 SGVL----EFNSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
S +L D + T PLI+N +FYY+ L G SVGG + IP S FE+ +
Sbjct: 272 SLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDD 331
Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-SGLRSVRVP 426
G GG+I+D GT IT ++ A+ SL++ F+ SG D C++ +G V VP
Sbjct: 332 GSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVP 391
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
++ HF G L+LP +NY+I AG C A +S +SI GN+QQQ V DL
Sbjct: 392 KLTFHF-KGADLELPGENYMIGDSKAGLLCLAIG-SSRGMSIFGNLQQQNFMVVHDLQEE 449
Query: 487 RVGFTPNKC 495
+ F P +C
Sbjct: 450 TLSFLPTQC 458
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 276 bits (705), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 190/437 (43%), Positives = 244/437 (55%), Gaps = 44/437 (10%)
Query: 100 SLVLSRLERDSARVNTLITKLQLAIYNVDRHELK-----------------PAEAQILPE 142
+L + L RDS VN T QL + R EL+ P
Sbjct: 60 ALHVRLLHRDSFAVNA--TPAQLLARRLQRDELRAAWIIKAAAPAAAANDTPVVGLSSGG 117
Query: 143 DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
F PVVS A SGEY ++I VGTP + + +DTGSDI WLQC+PC CY QS P+FD
Sbjct: 118 AFVAPVVSRAPTTSGEYMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQSGPVFD 177
Query: 203 PKTSSSYSPLPCAAPQCKSLDVSA---CRANRCLYQVAYG-DGSFTVGDLVTETVSFGNS 258
P+ S+SY + AP C++L S + C+Y V YG DGS TVGD + ET++F
Sbjct: 178 PRHSTSYREMGYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFAGG 237
Query: 259 GSVKGIALGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKA-----TSLAYCLVDRDSPA 312
V +++GCGHDN+GLF +AG+LGLG G +S QI A TS +YCL D +
Sbjct: 238 VQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLSS 297
Query: 313 SGVLEFNSARGGDAVTA--------PLIRNKKVDTFYYVGLTGFSVGGQAVQIP-PSLFE 363
G ++ GD A P ++N + TFYYV L G SVGG V +
Sbjct: 298 PGRSVSSTLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDLK 357
Query: 364 MDE-AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS---GVALFDTCYDFSG 419
+D G GG+I+D GTA+TRL +AY + RD+F A +L S FDTCY G
Sbjct: 358 LDPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGFFDTCYTMGG 417
Query: 420 LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTR 478
R+++VPTVS+HF G L LP KNYLIPVDS GT CFAFA T ++SIIGN+QQQG R
Sbjct: 418 -RAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVSIIGNIQQQGFR 476
Query: 479 VSFDLANNRVGFTPNKC 495
V +++ RVGF PN C
Sbjct: 477 VVYNIGGGRVGFAPNSC 493
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 166/419 (39%), Positives = 218/419 (52%), Gaps = 35/419 (8%)
Query: 90 LHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVV 149
L + + LV ERD+AR+NT+ +K + + P+
Sbjct: 84 LRPINSSSWIDLVSQSFERDNARLNTIRSKNSGPYTTMS----------------NLPLQ 127
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSY 209
SG + G+G Y G GTP + +++DTGSD+ W+QC+PC +CY Q D IF+PK SSSY
Sbjct: 128 SGTTVGTGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSSSY 187
Query: 210 SPLPCAAPQCKSLDVSA-----CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
LPC + C L S C C+Y++ YGDGS + GD ET++ G S S +
Sbjct: 188 KTLPCLSATCTELITSESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTLG-SDSFQNF 246
Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEF--N 319
A GCGH N GLF GS+GLLGLG LS Q K+ AYCL D S S
Sbjct: 247 AFGCGHTNTGLFKGSSGLLGLGQNSLSFPSQSKSKYGGQFAYCLPDFGSSTSTGSFSVGK 306
Query: 320 SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
+ AV PL+ N TFY+VGL G SVGG + IPP++ G G IVD GT
Sbjct: 307 GSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVL-----GRGSTIVDSGTV 361
Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
ITRL QAYN+L+ SF +L ++ DTCYD S VR+PT++ HF +
Sbjct: 362 ITRLLPQAYNALKTSFRSKTRDLPSAKPFSILDTCYDLSRHSQVRIPTITFHFQNNADVA 421
Query: 440 LPAKNYLIPVDSAGT-FCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ L+PV + G+ C AFA S +IIGN QQQ RV+FD R+GF C
Sbjct: 422 VSDVGILVPVQNGGSQVCLAFASASQMDGFNIIGNFQQQRMRVAFDTGAGRIGFASGSC 480
>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
Length = 256
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 139/252 (55%), Positives = 184/252 (73%), Gaps = 1/252 (0%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
++RD R+ ++ +I H + + + E TP+VSGASQGSGEYFSR+G+
Sbjct: 1 MDRD-LRLTLMVFHCCKSILATYFHVILLFSIKTIAEALETPLVSGASQGSGEYFSRVGI 59
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
G+PP+ MV+DTGSD+NW+QC PC +CYQQ+DPIF+P SSSY+PL C QCKSLDVS
Sbjct: 60 GSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCETHQCKSLDVS 119
Query: 226 ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGL 285
CR + CLY+V+YGDGS+TVGD TET++ S S+ +A+GCGHDNEGLFVG+AGLLGL
Sbjct: 120 ECRNDSCLYEVSYGDGSYTVGDFATETITLDGSASLNNVAIGCGHDNEGLFVGAAGLLGL 179
Query: 286 GGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGL 345
GGG LS QI A+S +YCLV+RD+ ++ LEFNS +VTAPL+RN ++DTFYY+G+
Sbjct: 180 GGGSLSFPSQINASSFSYCLVNRDTDSASTLEFNSPIPSHSVTAPLLRNNQLDTFYYLGM 239
Query: 346 TGFSVGGQAVQI 357
TG + +QI
Sbjct: 240 TGIGESYKILQI 251
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 172/441 (39%), Positives = 233/441 (52%), Gaps = 46/441 (10%)
Query: 79 SFSLPLHSREILHKTRHNDYR-SLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEA 137
+ +P+ R+ L R SL+ RL D+AR +L+
Sbjct: 26 TLHVPVFHRDALFPPPPGAKRGSLLRQRLAADAARYASLVDATG---------------- 69
Query: 138 QILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS 197
+PV SG SGEYF+ +GVGTP + +V+DTGSD+ WLQC PC CY Q
Sbjct: 70 -----RLHSPVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQR 124
Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSLDVSACRA-----NRCLYQVAYGDGSFTVGDLVTET 252
+FDP+ SS+Y +PC++PQC++L C + C Y VAYGDGS + GDL T+
Sbjct: 125 GQVFDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDK 184
Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRD 309
++F N V + LGCG DNEGLF +AGLLG+G G +S++ Q+ + YCL DR
Sbjct: 185 LAFANDTYVNNVTLGCGRDNEGLFDSAAGLLGVGRGKISISTQVAPAYGSVFEYCLGDRT 244
Query: 310 SPA--SGVLEFNSARG--GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEM 364
S + S L F A TA L+ N + + YYV + GFSVGG+ V + +
Sbjct: 245 SRSTRSSYLVFGRTPEPPSTAFTA-LLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLAL 303
Query: 365 DEA-GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV---ALFDTCYDFSGL 420
D A G GG++VD GTAI+R AY +LRD+F A ++FD CYD G
Sbjct: 304 DTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGR 363
Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDS----AGTF--CFAFAPTSSALSIIGNVQQ 474
+ P + LHF G + LP +NY +PVD A ++ C F LS+IGNVQQ
Sbjct: 364 PAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQ 423
Query: 475 QGTRVSFDLANNRVGFTPNKC 495
QG RV FD+ R+GF P C
Sbjct: 424 QGFRVVFDVEKERIGFAPKGC 444
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 270 bits (691), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 195/487 (40%), Positives = 259/487 (53%), Gaps = 56/487 (11%)
Query: 52 SFEPETLEPFAEESETAAE----SFPLNSSSSFSLPLHSREILHKTR---HNDYRSLVLS 104
S+ L P A S AAE + + ++S S +H R +LH+ + L+
Sbjct: 34 SYAVTPLSPHAHSSPEAAEDGAHAHQEDMAASSSSAMHVR-LLHRDSFAVNATGAELLAR 92
Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILP--EDFSTPVVSGASQGSGEYFSR 162
RL+RD R +I+ + P + L PVVS A SG+Y ++
Sbjct: 93 RLQRDELRAAWIIS-------TAAANGTPPPDVVGLSTGRGLVAPVVSRAPT-SGDYIAK 144
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
I VGTP + + LDT SD+ WLQC+PC CY QS P+FDP+ S+SY + AP C++L
Sbjct: 145 IAVGTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDCQAL 204
Query: 223 DVSA---CRANRCLYQVAYGDG------SFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
S + C+Y V YGDG S +VGDLV ET++F +++GCGHDN+
Sbjct: 205 GRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGGVRQAYLSIGCGHDNK 264
Query: 274 GLF-VGSAGLLGLGGGMLSLTKQIK----ATSLAYCLVD----RDSPASGVLEFNSARGG 324
GLF +AG+LGL G +S+ QI S +YCLVD SP+S L F +
Sbjct: 265 GLFGAPAAGILGLSRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSS-TLTFGAGAVD 323
Query: 325 DAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-----EAGDGGIIVDC 376
+ A P + N+ + TFYYV L G SVGG V P + E D G GG+I+D
Sbjct: 324 TSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRV---PGVTERDLQLDPYTGHGGVILDS 380
Query: 377 GTAITRLQTQAYNSLRDSFVRLA---GNLKPTSGVALFDTCYDF---SGLRS-VRVPTVS 429
GT +TRL AY + RD+F A G + LFDTCY +GLR V+VP VS
Sbjct: 381 GTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTCYTVGGRAGLRHCVKVPAVS 440
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRV 488
+HF G L L KNYLI VDS GT CFAFA T ++S+IGN+ QQG RV +D+ RV
Sbjct: 441 MHFAGGVELSLQPKNYLITVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVYDIGGQRV 500
Query: 489 GFTPNKC 495
GF PN C
Sbjct: 501 GFAPNSC 507
>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
Length = 280
Score = 270 bits (691), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 156/286 (54%), Positives = 195/286 (68%), Gaps = 27/286 (9%)
Query: 25 ASSRGLSETA-TTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSS-FSL 82
A SR + A TT+LDV S++Q+T +L+F + ++S P SS+S SL
Sbjct: 20 AHSRNIPHNAKTTILDVVSSIQKTYQVLNFNQNLKQQQQQKS-------PFTSSTSTLSL 72
Query: 83 PLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPE 142
LHSR L + H DY+SL LSRL+RDSARV + TKL +N D+
Sbjct: 73 QLHSRASL--SSHADYKSLTLSRLDRDSARVKYITTKLNQN-FNTDK------------- 116
Query: 143 DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
S P++SG SQGSGEYFSRIG+G PP Q MVLDTGSDI+W+QC PC +CY+Q+DPIF+
Sbjct: 117 -LSGPIISGTSQGSGEYFSRIGIGEPPSQAYMVLDTGSDISWVQCAPCADCYRQADPIFE 175
Query: 203 PKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK 262
P S+SY+PL C A QC+ LD S CR CLYQV+YGDGS+TVGD VTETV+ G VK
Sbjct: 176 PTASASYAPLSCEAAQCRYLDQSQCRNGNCLYQVSYGDGSYTVGDFVTETVTIG-VNKVK 234
Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR 308
+ALGCGH+NEGLFVG+AGL+GLGGG LS Q+ +TS +YCLVDR
Sbjct: 235 NVALGCGHNNEGLFVGAAGLIGLGGGPLSFPAQLNSTSFSYCLVDR 280
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 151/383 (39%), Positives = 211/383 (55%), Gaps = 21/383 (5%)
Query: 128 DRHELKPAEAQ-----ILPEDFST-PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSD 181
DRH + A+ + E +T PV SGAS GSG+Y +G+GTP ++F+++ DTGSD
Sbjct: 96 DRHRVDSIHARLSSHGVFQEKQATLPVQSGASIGSGDYAVTVGLGTPKKEFTLIFDTGSD 155
Query: 182 INWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS---ACRANRCLYQVA 237
+ W QC PC + CY+Q +P DP S+SY + C++ CK LD +C + CLYQV
Sbjct: 156 LTWTQCEPCAKTCYKQKEPRLDPTKSTSYKNISCSSAFCKLLDTEGGESCSSPTCLYQVQ 215
Query: 238 YGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSL---TK 294
YGDGS+++G TET++ +S K GCG N GLF G+AGLLGLG LSL T
Sbjct: 216 YGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNSGLFRGAAGLLGLGRTKLSLPSQTA 275
Query: 295 QIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQA 354
Q +YCL S + G L F PL + K FY + +T SVGG
Sbjct: 276 QKYKKLFSYCL-PASSSSKGYLSFGGQVSKTVKFTPLSEDFKSTPFYGLDITELSVGGNK 334
Query: 355 VQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTC 414
+ I S+F G ++D GT ITRL + AY++L +F +L + T G ++FDTC
Sbjct: 335 LSIDASIFSTS-----GTVIDSGTVITRLPSTAYSALSSAFQKLMTDYPSTDGYSIFDTC 389
Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL--SIIGNV 472
YDFS ++++P V + F G +D+ L PV+ C AFA + +I GN
Sbjct: 390 YDFSKNETIKIPKVGVSFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNGDDVKAAIFGNT 449
Query: 473 QQQGTRVSFDLANNRVGFTPNKC 495
QQ+ +V +D A RVGF P+ C
Sbjct: 450 QQKTYQVVYDDAKGRVGFAPSGC 472
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 170/441 (38%), Positives = 232/441 (52%), Gaps = 46/441 (10%)
Query: 79 SFSLPLHSREILHKTRHNDYR-SLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEA 137
+ +P+ R+ L R SL+ RL D+AR +L+
Sbjct: 26 TLHVPVFHRDALFPPPPGAKRGSLLRQRLAADAARYASLVDATG---------------- 69
Query: 138 QILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS 197
+PV SG SGEYF+ +GVGTP + +V+DTGSD+ WLQC PC CY Q
Sbjct: 70 -----RLHSPVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQR 124
Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSLDVSACRA-----NRCLYQVAYGDGSFTVGDLVTET 252
+FDP+ SS+Y +PC++PQC++L C + C Y VAYGDGS + G+L T+
Sbjct: 125 GQVFDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDK 184
Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRD 309
++F N V + LGCG DNEGLF +AGLLG+ G +S++ Q+ + YCL DR
Sbjct: 185 LAFANDTYVNNVTLGCGRDNEGLFDSAAGLLGVARGKISISTQVAPAYGSVFEYCLGDRT 244
Query: 310 SPA--SGVLEFNSARG--GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEM 364
S + S L F A TA L+ N + + YYV + GFSVGG+ V + +
Sbjct: 245 SRSTRSSYLVFGRTPEPPSTAFTA-LLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLAL 303
Query: 365 DEA-GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV---ALFDTCYDFSGL 420
D A G GG++VD GTAI+R AY +LRD+F A ++FD CYD G
Sbjct: 304 DTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGR 363
Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDS----AGTF--CFAFAPTSSALSIIGNVQQ 474
+ P + LHF G + LP +NY +PVD A ++ C F LS+IGNVQQ
Sbjct: 364 PAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQ 423
Query: 475 QGTRVSFDLANNRVGFTPNKC 495
QG RV FD+ R+GF P C
Sbjct: 424 QGFRVVFDVEKERIGFAPKGC 444
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 157/398 (39%), Positives = 225/398 (56%), Gaps = 21/398 (5%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
LERD ARV+++ K+ A + PA A + S P G S G+G Y +G+
Sbjct: 100 LERDQARVDSIHRKVAGA--GGAPSVVDPARAS--EQGVSLPAQRGISLGTGNYVVSVGL 155
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
GTP +Q++++ DTGSD++W+QC+PC +CY+Q DP+FDP SS+Y+ + C AP+C+ LD S
Sbjct: 156 GTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPECQELDAS 215
Query: 226 ACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLG 284
C ++ RC Y+V YGD S T G+LV +T++ S ++ G GCG N GLF GL G
Sbjct: 216 GCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASDTLPGFVFGCGDQNAGLFGQVDGLFG 275
Query: 285 LGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFY 341
LG +SL Q + YCL S G L A +A L + +FY
Sbjct: 276 LGREKVSLPSQGAPSYGPGFTYCLPSSSS-GRGYLSLGGAPPANAQFTALA-DGATPSFY 333
Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
Y+ L G VGG+A++IP + F ++D GT ITRL +AY LR +F R
Sbjct: 334 YIDLVGIKVGGRAIRIPATAFAAAGG----TVIDSGTVITRLPPRAYAPLRAAFARSMAQ 389
Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHF--GAGKALDLPAKNYLIPVDSAGTFCFAF 459
K +++ DTCYDF+G R+ ++PTV L F GA +LD Y+ V A C AF
Sbjct: 390 YKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLYVSKVSQA---CLAF 446
Query: 460 APTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
AP + S+++I+GN QQ+ V++D+AN R+GF C
Sbjct: 447 APNADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGC 484
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 269 bits (687), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 157/398 (39%), Positives = 225/398 (56%), Gaps = 21/398 (5%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
LERD ARV+++ K+ A + PA A + S P G S G+G Y +G+
Sbjct: 100 LERDQARVDSIHRKVAGA--GGAPSVVDPARAS--EQGVSLPAQRGISLGTGNYVVSVGL 155
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
GTP +Q++++ DTGSD++W+QC+PC +CY+Q DP+FDP SS+Y+ + C AP+C+ LD S
Sbjct: 156 GTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPECQELDAS 215
Query: 226 ACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLG 284
C ++ RC Y+V YGD S T G+LV +T++ S ++ G GCG N GLF GL G
Sbjct: 216 GCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASDTLPGFVFGCGDQNAGLFGQVDGLFG 275
Query: 285 LGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFY 341
LG +SL Q + YCL S G L A +A L + +FY
Sbjct: 276 LGREKVSLPSQGAPSYGPGFTYCLPSSSS-GRGYLSLGGAPPANAQFTALA-DGATPSFY 333
Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
Y+ L G VGG+A++IP + F ++D GT ITRL +AY LR +F R
Sbjct: 334 YIDLVGIKVGGRAIRIPATAFAAAGG----TVIDSGTVITRLPPRAYAPLRAAFARSMAQ 389
Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHF--GAGKALDLPAKNYLIPVDSAGTFCFAF 459
K +++ DTCYDF+G R+ ++PTV L F GA +LD Y+ V A C AF
Sbjct: 390 YKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLYVSKVSQA---CLAF 446
Query: 460 APTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
AP + S+++I+GN QQ+ V++D+AN R+GF C
Sbjct: 447 APNADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGC 484
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 150/360 (41%), Positives = 204/360 (56%), Gaps = 18/360 (5%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDP 203
S P SG + +G Y +G+GTP ++++V DTGSD W+QCRPC +CY+Q +P+FDP
Sbjct: 149 SLPATSGRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDP 208
Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
SS+Y+ + C C LD + C CLY V YGDGS+TVG +T++ + ++KG
Sbjct: 209 AKSSTYANVSCTDSACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHD-AIKG 267
Query: 264 IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFN- 319
GCG N GLF +AGL+GLG G SLT Q + AYCL + +G L+F
Sbjct: 268 FRFGCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTT-GTGYLDFGP 326
Query: 320 SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
+ G +A P++ +K TFYYVG+TG VGGQ V + S+F G +VD GT
Sbjct: 327 GSAGNNARLTPMLTDKG-QTFYYVGMTGIRVGGQQVPVAESVFST-----AGTLVDSGTV 380
Query: 380 ITRLQTQAYNSLRDSF--VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
ITRL AY +L +F V LA K G ++ DTCYDF+GL V +PTVSL F G
Sbjct: 381 ITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGAC 440
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LD+ + + A C AFA +++I+GN QQ+ V +DL VGF P C
Sbjct: 441 LDVDVSGIVYAISEA-QVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 170/433 (39%), Positives = 234/433 (54%), Gaps = 28/433 (6%)
Query: 75 NSSSSFSLPLHS-REILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELK 133
+SSS+ LPLH R + S VL+ D+AR+ + +L
Sbjct: 38 HSSSAVHLPLHHPRGPCSPLSADIPFSAVLTH---DAARIASFAARLAKKSSPSSASATT 94
Query: 134 PAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TE 192
A L S P+ G S G G Y +R+G+GTP + + MV+DTGS + WLQC PC
Sbjct: 95 QAAGSSLA---SVPLTPGTSVGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVS 151
Query: 193 CYQQSDPIFDPKTSSSYSPLPCAAPQC-----KSLDVSACR-ANRCLYQVAYGDGSFTVG 246
C++QS P+FDPKTSSSY+ + C++PQC +L+ + C +N C+YQ +YGD SF+VG
Sbjct: 152 CHRQSGPVFDPKTSSSYAAVSCSSPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVG 211
Query: 247 DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAY 303
L +TVSFG + SV GCG DNEGLF SAGL+GL LSL Q+ T S +Y
Sbjct: 212 YLSKDTVSFG-ANSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSY 270
Query: 304 CLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
CL S SG L S G P++ N D+ Y++ L+G +V G+ + + S +
Sbjct: 271 CLPSTSS--SGYLSIGSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYT 328
Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYDFSGLRS 422
I+D GT ITRL T Y +L + + G+ K + ++ DTC++ +
Sbjct: 329 SLP-----TIIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKL 383
Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFD 482
VP VS+ F G L L A N L+ VD A T C AFAP SA +IIGN QQQ V +D
Sbjct: 384 RAVPAVSMAFSGGATLKLSAGNLLVDVDGA-TTCLAFAPARSA-AIIGNTQQQTFSVVYD 441
Query: 483 LANNRVGFTPNKC 495
+ +NR+GF C
Sbjct: 442 VKSNRIGFAAAGC 454
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 265 bits (678), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 163/427 (38%), Positives = 234/427 (54%), Gaps = 21/427 (4%)
Query: 80 FSLPLHSREILHKTRHNDYRSLVLS-RLERDSARVNTLITKLQLAIYNVDR--HELKPAE 136
F P HS H++ + LE + N +TK +L V+R L+ E
Sbjct: 18 FVAPTHSTSRTALNHHHEPKVAGFQIMLEHVDSGKN--LTKFELLERAVERGSRRLQRLE 75
Query: 137 AQIL-PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ 195
A + P TPV +G GEY + +GTP + FS ++DTGSD+ W QC+PCT+C+
Sbjct: 76 AMLNGPSGVETPVYAG----DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFN 131
Query: 196 QSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF 255
QS PIF+P+ SSS+S LPC++ C++L C N C Y YGDGS T G + TET++F
Sbjct: 132 QSTPIFNPQGSSSFSTLPCSSQLCQALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTF 191
Query: 256 GNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASG 314
G S S+ I GCG +N+G G+ AGL+G+G G LSL Q+ T +YC+ S S
Sbjct: 192 G-SVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSNSS 250
Query: 315 VLEFNSARGGDAVTAP---LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-EAGDG 370
L S +P LI++ ++ TFYY+ L G SVG + I PS+F+++ G G
Sbjct: 251 TLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTG 310
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDF-SGLRSVRVPTV 428
GII+D GT +T AY ++R +F+ NL +G + FD C+ S ++++PT
Sbjct: 311 GIIIDSGTTLTYFVDNAYQAVRQAFISQM-NLSVVNGSSSGFDLCFQMPSDQSNLQIPTF 369
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
+HF G L LP++NY I S G C A +S +SI GN+QQQ V +D N+ V
Sbjct: 370 VMHFDGGD-LVLPSENYFIS-PSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVV 427
Query: 489 GFTPNKC 495
F +C
Sbjct: 428 SFLSAQC 434
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 265 bits (677), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 150/360 (41%), Positives = 203/360 (56%), Gaps = 18/360 (5%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDP 203
S P SG + +G Y +G+GTP ++++V DTGSD W+QCRPC +CY+Q P+FDP
Sbjct: 149 SLPATSGRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDP 208
Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
SS+Y+ + C C LD + C CLY V YGDGS+TVG +T++ + ++KG
Sbjct: 209 AKSSTYANVSCTDSACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHD-AIKG 267
Query: 264 IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFN- 319
GCG N GLF +AGL+GLG G SLT Q + AYCL + +G L+F
Sbjct: 268 FRFGCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTT-GTGYLDFGP 326
Query: 320 SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
+ G +A P++ +K TFYYVG+TG VGGQ V + S+F G +VD GT
Sbjct: 327 GSAGNNARLTPMLTDKG-QTFYYVGMTGIRVGGQQVPVAESVFST-----AGTLVDSGTV 380
Query: 380 ITRLQTQAYNSLRDSF--VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
ITRL AY +L +F V LA K G ++ DTCYDF+GL V +PTVSL F G
Sbjct: 381 ITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGAC 440
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LD+ + + A C AFA +++I+GN QQ+ V +DL VGF P C
Sbjct: 441 LDVDVSGIVYAISEA-QVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 265 bits (677), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 163/427 (38%), Positives = 233/427 (54%), Gaps = 21/427 (4%)
Query: 80 FSLPLHSREILHKTRHNDYRSLVLS-RLERDSARVNTLITKLQLAIYNVDR--HELKPAE 136
F P HS H++ + LE + N +TK +L V+R L+ E
Sbjct: 18 FVAPTHSTSRTALNHHHEPKVAGFQIMLEHVDSGKN--LTKFELLERAVERGSRRLQRLE 75
Query: 137 AQIL-PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ 195
A + P TPV +G GEY + +GTP + FS ++DTGSD+ W QC+PCT+C+
Sbjct: 76 AMLNGPSGVETPVYAG----DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFN 131
Query: 196 QSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF 255
QS PIF+P+ SSS+S LPC++ C++L C N C Y YGDGS T G + TET++F
Sbjct: 132 QSTPIFNPQGSSSFSTLPCSSQLCQALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTF 191
Query: 256 GNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASG 314
G S S+ I GCG +N+G G+ AGL+G+G G LSL Q+ T +YC+ S S
Sbjct: 192 G-SVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTSS 250
Query: 315 VLEFNSARGGDAVTAP---LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-EAGDG 370
L S +P LI + ++ TFYY+ L G SVG + I PS+F+++ G G
Sbjct: 251 TLLLGSLANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTG 310
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDF-SGLRSVRVPTV 428
GII+D GT +T AY ++R +F+ NL +G + FD C+ S ++++PT
Sbjct: 311 GIIIDSGTTLTYFADNAYQAVRQAFISQM-NLSVVNGSSSGFDLCFQMPSDQSNLQIPTF 369
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
+HF G L LP++NY I S G C A +S +SI GN+QQQ V +D N+ V
Sbjct: 370 VMHFDGGD-LVLPSENYFIS-PSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVV 427
Query: 489 GFTPNKC 495
F +C
Sbjct: 428 SFLFAQC 434
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 265 bits (677), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 150/353 (42%), Positives = 204/353 (57%), Gaps = 16/353 (4%)
Query: 151 GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSY 209
G + G+G Y +G+GTP ++++V DTGSD W+QC+PC CY+Q + +FDP +SS+Y
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 230
Query: 210 SPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCG 269
+ + CAAP C LDVS C CLY V YGDGS+++G +T++ + +VKG GCG
Sbjct: 231 ANVSCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCG 290
Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDA 326
N+GLF +AGLLGLG G SL Q A+CL R S +G L+F +
Sbjct: 291 ERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPAR-STGTGYLDFGAGSPPAT 349
Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
T P++ TFYYVG+TG VGG+ + I PS+F G IVD GT ITRL
Sbjct: 350 TTTPMLTGNG-PTFYYVGMTGIRVGGRLLPIAPSVFAA-----AGTIVDSGTVITRLPPA 403
Query: 387 AYNSLRDSFVRLAG--NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN 444
AY+SLR +F + + V+L DTCYDF+G+ V +PTVSL F G ALD+ A
Sbjct: 404 AYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASG 463
Query: 445 YLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ V SA C AFA + I+GN Q + V++D+ VGF+P C
Sbjct: 464 IMYTV-SASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 265 bits (676), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 150/353 (42%), Positives = 204/353 (57%), Gaps = 16/353 (4%)
Query: 151 GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSY 209
G + G+G Y +G+GTP ++++V DTGSD W+QC+PC CY+Q + +FDP +SS+Y
Sbjct: 172 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 231
Query: 210 SPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCG 269
+ + CAAP C LDVS C CLY V YGDGS+++G +T++ + +VKG GCG
Sbjct: 232 ANVSCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCG 291
Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDA 326
N+GLF +AGLLGLG G SL Q A+CL R S +G L+F +
Sbjct: 292 ERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPPR-STGTGYLDFGAGSPPAT 350
Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
T P++ TFYYVG+TG VGG+ + I PS+F G IVD GT ITRL
Sbjct: 351 TTTPMLTGNG-PTFYYVGMTGIRVGGRLLPIAPSVFAA-----AGTIVDSGTVITRLPPA 404
Query: 387 AYNSLRDSFVRLAG--NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN 444
AY+SLR +F + + V+L DTCYDF+G+ V +PTVSL F G ALD+ A
Sbjct: 405 AYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASG 464
Query: 445 YLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ V SA C AFA + I+GN Q + V++D+ VGF+P C
Sbjct: 465 IMYTV-SASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 265 bits (676), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 150/353 (42%), Positives = 204/353 (57%), Gaps = 16/353 (4%)
Query: 151 GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSY 209
G + G+G Y +G+GTP ++++V DTGSD W+QC+PC CY+Q + +FDP +SS+Y
Sbjct: 175 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 234
Query: 210 SPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCG 269
+ + CAAP C LDVS C CLY V YGDGS+++G +T++ + +VKG GCG
Sbjct: 235 ANVSCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCG 294
Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDA 326
N+GLF +AGLLGLG G SL Q A+CL R S +G L+F +
Sbjct: 295 ERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPAR-STGTGYLDFGAGSPPAT 353
Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
T P++ TFYYVG+TG VGG+ + I PS+F G IVD GT ITRL
Sbjct: 354 TTTPMLTGNG-PTFYYVGMTGIRVGGRLLPIAPSVFAA-----AGTIVDSGTVITRLPPA 407
Query: 387 AYNSLRDSFVRLAG--NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN 444
AY+SLR +F + + V+L DTCYDF+G+ V +PTVSL F G ALD+ A
Sbjct: 408 AYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASG 467
Query: 445 YLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ V SA C AFA + I+GN Q + V++D+ VGF+P C
Sbjct: 468 IMYTV-SASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 265 bits (676), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 152/357 (42%), Positives = 203/357 (56%), Gaps = 19/357 (5%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSS 208
SG + G+G Y +G+GTP ++++V DTGSD W+QC+PC CY+Q + +FDP SS+
Sbjct: 170 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSST 229
Query: 209 YSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
Y+ + CAAP C LD C CLY V YGDGS+++G +T++ + +VKG GC
Sbjct: 230 YANVSCAAPACFDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 289
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEF---NSAR 322
G NEGLF +AGLLGLG G SL Q A+CL R S +G L+F + A
Sbjct: 290 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSS-GTGYLDFGPGSPAA 348
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
G +T P++ + TFYYVG+TG VGGQ + IP S+F G IVD GT ITR
Sbjct: 349 AGARLTTPMLTDNG-PTFYYVGMTGIRVGGQLLSIPQSVFAT-----AGTIVDSGTVITR 402
Query: 383 LQTQAYNSLRDSFVR--LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
L AY+SLR +FV A K V+L DTCYDF+G+ V +PTVSL F G LD+
Sbjct: 403 LPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAILDV 462
Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A + S C FA + I+GN Q + V++D+ VGF+P C
Sbjct: 463 DASGIMYAA-SVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 518
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 264 bits (674), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 151/357 (42%), Positives = 202/357 (56%), Gaps = 19/357 (5%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSS 208
SG + G+G Y +G+GTP ++++V DTGSD W+QC+PC CY+Q + +FDP SS+
Sbjct: 171 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 230
Query: 209 YSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
Y+ + CAAP C LD C CLY V YGDGS+++G +T++ + +VKG GC
Sbjct: 231 YANISCAAPACSDLDTRGCSGGNCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 290
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEF---NSAR 322
G NEGLF +AGLLGLG G SL Q A+CL R S +G L+F + A
Sbjct: 291 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSS-GTGYLDFGPGSPAA 349
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
G +T P++ + TFYYVG+TG VGGQ + IP S+F G IVD GT ITR
Sbjct: 350 AGARLTTPMLTDNG-PTFYYVGMTGIRVGGQLLSIPQSVFTT-----AGTIVDSGTVITR 403
Query: 383 LQTQAYNSLRDSFVR--LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
L AY+SLR +F A K V+L DTCYDF+G+ V +PTVSL F G LD+
Sbjct: 404 LPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDV 463
Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A + S C FA + I+GN Q + V++D+ VGF+P C
Sbjct: 464 DASGIMYAA-SVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 261 bits (668), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 170/459 (37%), Positives = 237/459 (51%), Gaps = 38/459 (8%)
Query: 66 ETAAESFPLNSSSSFSLPLHSREILH--KTRHNDYRSLVLSRLERDSARVNTLITKLQLA 123
+ A E P + SSS L + R +TR + L E+D+ RV + ++ +
Sbjct: 60 DAADEQKPASPSSSLKLHMTHRRGAEGGRTRKGSFLDLA----EKDAVRVEAMHRRVASS 115
Query: 124 IYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDIN 183
+ R +++ V SG + GS EY + VGTPPR+F M++DTGSD+N
Sbjct: 116 SSSPRRGRALSESERVV-----ATVESGVAVGSAEYLMDVYVGTPPRRFQMIMDTGSDLN 170
Query: 184 WLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL------DVSACR---ANRCLY 234
WLQC PC +C++Q P+FDP SSSY L C P+C + ACR + C Y
Sbjct: 171 WLQCAPCLDCFEQRGPVFDPAASSSYRNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCPY 230
Query: 235 QVAYGDGSFTVGDLVTETVSF-----GNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGM 289
YGD S + GDL E+ + G S V G+ GCGH N GLF G+AGLLGLG G
Sbjct: 231 YYWYGDQSNSTGDLALESFTVNLTAPGASSRVDGVVFGCGHRNRGLFHGAAGLLGLGRGP 290
Query: 290 LSLTKQIKAT----SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIR-------NKKVD 338
LS Q++A + +YCLVD S + + F P ++ + D
Sbjct: 291 LSFASQLRAVYGGHTFSYCLVDHGSDVASKVVFGEDDALALAAHPRLKYTAFAPASSPAD 350
Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV-R 397
TFYYV LTG VGG+ + I ++ E G GG I+D GT ++ AY +R +F+ R
Sbjct: 351 TFYYVRLTGVLVGGELLNISSDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDR 410
Query: 398 LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
++G+ P + CY+ SG+ VP +SL F G D PA+NY I +D G C
Sbjct: 411 MSGSYPPVPDFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCL 470
Query: 458 AFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A T + +SIIGN QQQ V++DL NNR+GF P +C
Sbjct: 471 AVLGTPRTGMSIIGNFQQQNFHVAYDLHNNRLGFAPRRC 509
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 261 bits (667), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 172/446 (38%), Positives = 232/446 (52%), Gaps = 29/446 (6%)
Query: 73 PLNSSSSFSLPLHSREILH-KTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHE 131
P +SS S L + R +TR + L + E+D+ R+ T+ + A V R
Sbjct: 68 PASSSPSLQLRMKHRSAEGGRTRKESF----LDKAEKDAVRIETM--HRRAARSGVARMP 121
Query: 132 LKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT 191
+ + L E V SG + GSGEY + VGTPPR+F M++DTGSD+NWLQC PC
Sbjct: 122 ASSSPRRALSERMVATVESGVAVGSGEYLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCL 181
Query: 192 ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL----DVSACR---ANRCLYQVAYGDGSFT 244
+C++Q P+FDP SSSY + C +C + ACR + C Y YGD S T
Sbjct: 182 DCFEQRGPVFDPAASSSYRNVTCGDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNT 241
Query: 245 VGDLVTETVSF-----GNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT 299
GDL E+ + G S V G+ GCGH N GLF G+AGLLGLG G LS Q++A
Sbjct: 242 TGDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAV 301
Query: 300 ---SLAYCLVDRDSPASGVLEFNS-----ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVG 351
+ +YCLV+ S A + F A TA + DTFYYV L G VG
Sbjct: 302 YGHTFSYCLVEHGSDAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVG 361
Query: 352 GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKP-TSGVAL 410
G + I +++ + G GG I+D GT ++ AY +R +FV L L P +
Sbjct: 362 GDLLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPV 421
Query: 411 FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSII 469
+ CY+ SG+ VP +SL F G D PA+NY + +D G C A T + +SII
Sbjct: 422 LNPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTGMSII 481
Query: 470 GNVQQQGTRVSFDLANNRVGFTPNKC 495
GN QQQ V +DL NNR+GF P +C
Sbjct: 482 GNFQQQNFHVVYDLQNNRLGFAPRRC 507
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 261 bits (667), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 167/429 (38%), Positives = 238/429 (55%), Gaps = 25/429 (5%)
Query: 80 FSLPLHS--REILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDR--HELKPA 135
F P HS R L+ RH + LE + N +TK QL ++R L+
Sbjct: 18 FVAPTHSTSRTALNH-RHEAKVTGFQIMLEHVDSGKN--LTKFQLLERAIERGSRRLQRL 74
Query: 136 EAQIL-PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECY 194
EA + P T V +G GEY + +GTP + FS ++DTGSD+ W QC+PCT+C+
Sbjct: 75 EAMLNGPSGVETSVYAG----DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCF 130
Query: 195 QQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVS 254
QS PIF+P+ SSS+S LPC++ C++L C N C Y YGDGS T G + TET++
Sbjct: 131 NQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLT 190
Query: 255 FGNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSP 311
FG S S+ I GCG +N+G G+ AGL+G+G G LSL Q+ T +YC+ + +P
Sbjct: 191 FG-SVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTP 249
Query: 312 ASGVLE--FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-EAG 368
++ +L NS G T LI++ ++ TFYY+ L G SVG + I PS F ++ G
Sbjct: 250 SNLLLGSLANSVTAGSPNTT-LIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNG 308
Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDF-SGLRSVRVP 426
GGII+D GT +T AY S+R F+ NL +G + FD C+ S ++++P
Sbjct: 309 TGGIIIDSGTTLTYFVNNAYQSVRQEFISQI-NLPVVNGSSSGFDLCFQTPSDPSNLQIP 367
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
T +HF G L+LP++NY I S G C A +S +SI GN+QQQ V +D N+
Sbjct: 368 TFVMHFDGGD-LELPSENYFIS-PSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNS 425
Query: 487 RVGFTPNKC 495
V F +C
Sbjct: 426 VVSFASAQC 434
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 178/416 (42%), Positives = 229/416 (55%), Gaps = 46/416 (11%)
Query: 101 LVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYF 160
L+ RL+RD R +ITK PA+ PE+ + VV+GA SGEY
Sbjct: 85 LLARRLQRDMRRAAWIITK-----------AATPAD----PENGT--VVTGAPT-SGEYI 126
Query: 161 SRIGVGTPPRQ---FSMVL--DTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
++I VGTP F +L D GSD+ WLQC PC CY Q P+++ SSS S + C
Sbjct: 127 AKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYHQPGPVYNRLKSSSASDVGCY 186
Query: 216 APQCKSLDVS-ACRA--NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
AP C++L S C N C Y+V YGDGS + GD ET++F V G+A+GCG DN
Sbjct: 187 APACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTFPPGVRVPGVAIGCGSDN 246
Query: 273 EGLFVG-SAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPA-SGVLEFNSARGG--- 324
+GLF +AG+LGLG G LS QI S +YCL + + S L F S
Sbjct: 247 QGLFPAPAAGILGLGRGSLSFPSQIAGRYGRSFSYCLAGQGTGGRSSTLTFGSGASATTT 306
Query: 325 ---DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEA-GDGGIIVDCGTA 379
P++ N ++ TFYYVGL G SVGG V+ + S +D + G GG+IVD GTA
Sbjct: 307 TTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTESDLRLDPSTGHGGVIVDSGTA 366
Query: 380 ITRLQTQAYNSLRDSF----VRLAGNLKPTSGVALFDTCY-DFSGLRSVRVPTVSLHFGA 434
+TRL AY + RD+F V+ G P A FDTCY G +VP VS+HF
Sbjct: 367 VTRLSGPAYAAFRDAFRVAAVKELGWPSPGGPFAFFDTCYSSVRGRVMKKVPAVSMHFAG 426
Query: 435 GKALDLPAKNYLIPVDS-AGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRV 488
G + LP +NYLIPVDS GT CFAFA + +SIIGN+Q QG RV +D+ RV
Sbjct: 427 GVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRGVSIIGNIQLQGFRVVYDVDGQRV 482
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 148/353 (41%), Positives = 197/353 (55%), Gaps = 15/353 (4%)
Query: 151 GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSY 209
G + G+G Y +G+GTP ++++V DTGSD W+QC+PC CY+Q + +FDP SS+Y
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTY 230
Query: 210 SPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCG 269
+ + CAAP C LD C CLY V YGDGS+++G +T++ + +VKG GCG
Sbjct: 231 ANVSCAAPACSDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCG 290
Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFNSARGGDA 326
NEGLF +AGLLGLG G SL Q A+CL R S +G L+F +
Sbjct: 291 ERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPAR-STGTGYLDFGAGSPAAR 349
Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
+T + TFYYVGLTG VGG+ + IP S+F G IVD GT ITRL
Sbjct: 350 LTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFAT-----AGTIVDSGTVITRLPPA 404
Query: 387 AYNSLRDSFVRL--AGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN 444
AY+SLR +F A K V+L DTCYDF+G+ V +PTVSL F G LD+ A
Sbjct: 405 AYSSLRSAFAAAMSARGYKKAPAVSLLDTCYDFAGMSQVAIPTVSLLFQGGARLDVDASG 464
Query: 445 YLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ SA C AFA + I+GN Q + V++D+ V F+P C
Sbjct: 465 IMYAA-SASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 152/400 (38%), Positives = 217/400 (54%), Gaps = 26/400 (6%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L D ARV+++ K+ A V L A + + + P G S G+G Y +G+
Sbjct: 100 LNDDQARVDSIHRKIAAAASPV----LDQARGK---KGVTLPAQRGISLGTGNYVVSMGL 152
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
GTP R ++V DTGSD++W+QC PC++CY+Q DP+FDP SS+YS +PCA+P+C+ LD
Sbjct: 153 GTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPLFDPARSSTYSAVPCASPECQGLDSR 212
Query: 226 AC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLG 284
+C R +C Y+V YGD S T G L +T++ S + G GCG + GLF + GL+G
Sbjct: 213 SCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQSDVLPGFVFGCGEQDTGLFGRADGLVG 272
Query: 285 LGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFY 341
LG +SL+ Q + +YCL S A+G L +A + +FY
Sbjct: 273 LGREKVSLSSQAASKYGAGFSYCLPSSPS-AAGYLSLGGPAPANARFTAMETRHDSPSFY 331
Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
YV L G V G+ V++ P +F G ++D GT ITRL + Y +LR +F R G
Sbjct: 332 YVRLVGVKVAGRTVRVSPIVFSA-----AGTVIDSGTVITRLPPRVYAALRSAFARSMGR 386
Query: 402 --LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA--LDLPAKNYLIPVDSAGTFCF 457
K +++ DTCYDF+G +VR+P+V+L F G A LD Y+ V A C
Sbjct: 387 YGYKRAPALSILDTCYDFTGHTTVRIPSVALVFAGGAAVGLDFSGVLYVAKVSQA---CL 443
Query: 458 AFAPTSSALS--IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
AFAP IIGN QQ+ V +D+A ++GF N C
Sbjct: 444 AFAPNGDGADAGIIGNTQQKTLAVVYDVARQKIGFGANGC 483
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 258 bits (660), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 161/435 (37%), Positives = 225/435 (51%), Gaps = 35/435 (8%)
Query: 80 FSLPLHSREILHK----TRHNDYRSLVLSRLE---RDSARVNTLITKLQLAIYNVDRHEL 132
F LP S + H+ +R N+ ++ +E D ARVN++ +KL + E
Sbjct: 27 FFLPESSLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHSKLSKKLATDHVSES 86
Query: 133 KPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE 192
K + P G++ GSG Y +G+GTP S++ DTGSD+ W QC+PC
Sbjct: 87 KSTDL---------PAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVR 137
Query: 193 -CYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-----DVSACRANRCLYQVAYGDGSFTVG 246
CY Q +PIF+P S+SY + C++ C SL + +C A+ C+Y + YGD SF+VG
Sbjct: 138 TCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVG 197
Query: 247 DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAY 303
L E + NS G+ GCG +N+GLF G AGLLGLG LS Q +Y
Sbjct: 198 FLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSY 257
Query: 304 CLVDRDSPASGVLEFNSARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLF 362
CL S +G L F SA +V P+ +FY + + +VGGQ + IP ++F
Sbjct: 258 CLPSSAS-YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVF 316
Query: 363 EMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRS 422
G ++D GT ITRL +AY +LR SF TSGV++ DTC+D SG ++
Sbjct: 317 STP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKT 371
Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVS 480
V +P V+ F G ++L +K + V C AFA S S +I GNVQQQ V
Sbjct: 372 VTIPKVAFSFSGGAVVELGSKG-IFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVV 430
Query: 481 FDLANNRVGFTPNKC 495
+D A RVGF PN C
Sbjct: 431 YDGAGGRVGFAPNGC 445
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 258 bits (658), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 159/423 (37%), Positives = 229/423 (54%), Gaps = 31/423 (7%)
Query: 88 EILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTP 147
+ H H +Y L L L+R + R + +++L V +A D P
Sbjct: 43 RLTHVDAHGNYSRLQL--LQRAARRSHHRMSRLVARATGV--------KAVAGGGDLQVP 92
Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
V G+GE+ + +GTP ++ ++DTGSD+ W QC+PC +C++QS P+FDP +SS
Sbjct: 93 V----HAGNGEFLMDVAIGTPALSYAAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSS 148
Query: 208 SYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGN-SGSVKGIA 265
+Y+ +PC++ C L S C A++C Y YGD S T G L +ET + G + G+A
Sbjct: 149 TYATVPCSSALCSDLPTSTCTSASKCGYTYTYGDASSTQGVLASETFTLGKEKKKLPGVA 208
Query: 266 LGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV-----DRDSP----ASGV 315
GCG NEG F AGL+GLG G LSL Q+ +YCL D SP S
Sbjct: 209 FGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDGDGKSPLLLGGSAA 268
Query: 316 LEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
SA T PL++N +FYYV LTG +VG + +P S F + + G GG+IVD
Sbjct: 269 AISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIVD 328
Query: 376 CGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVSLHF 432
GT+IT L+ Q Y +L+ +FV ++A S + L D C+ G+ V+VP + LHF
Sbjct: 329 SGTSITYLELQGYRALKKAFVAQMALPTVDGSEIGL-DLCFQGPAKGVDEVQVPKLVLHF 387
Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
G LDLPA+NY++ ++G C AP S LSIIGN QQQ + +D+A + + F P
Sbjct: 388 DGGADLDLPAENYMVLDSASGALCLTVAP-SRGLSIIGNFQQQNFQFVYDVAGDTLSFAP 446
Query: 493 NKC 495
+C
Sbjct: 447 VQC 449
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 257 bits (657), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 164/398 (41%), Positives = 220/398 (55%), Gaps = 26/398 (6%)
Query: 117 ITKLQLAIYNVDRHE--LKPAEAQILPEDFSTP-----VVSGASQGSGEYFSRIGVGTPP 169
+TKL+ + + R + L+ A +L STP + + G+GEY + +GTPP
Sbjct: 60 LTKLERVQHGIKRGKSRLQKLNAMVLAAS-STPDSEDQLEAPIHAGNGEYLIELAIGTPP 118
Query: 170 RQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRA 229
+ VLDTGSD+ W QC+PCT CY+Q PIFDPK SSS+S + C + C +L S C +
Sbjct: 119 VSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCGSSLCSALPSSTC-S 177
Query: 230 NRCLYQVAYGDGSFTVGDLVTETVSFG---NSGSVKGIALGCGHDNEGL-FVGSAGLLGL 285
+ C Y +YGD S T G L TET +FG N SV I GCG DNEG F ++GL+GL
Sbjct: 178 DGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGL 237
Query: 286 GGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAVTAPLIRNKKVDTFY 341
G G LSL Q+K +YCL D VL S + VT PL++N +FY
Sbjct: 238 GRGPLSLVSQLKEQRFSYCLTPIDDTKESVLLLGSLGKVKDAKEVVTTPLLKNPLQPSFY 297
Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV---RL 398
Y+ L SVG + I S FE+ + G+GG+I+D GT IT +Q +AY +L+ F+ +L
Sbjct: 298 YLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALKKEFISQTKL 357
Query: 399 AGNLKPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
A L TS L D C+ SG V +P + HF G L+LPA+NY+I + G C
Sbjct: 358 A--LDKTSSTGL-DLCFSLPSGSTQVEIPKLVFHFKGGD-LELPAENYMIGDSNLGVACL 413
Query: 458 AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A SS +SI GNVQQQ V+ DL + F P C
Sbjct: 414 AMG-ASSGMSIFGNVQQQNILVNHDLEKETISFVPTSC 450
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 257 bits (657), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 150/357 (42%), Positives = 202/357 (56%), Gaps = 19/357 (5%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSS 208
SG + G+G Y +G+GTP ++++V DTGSD W+QC+PC CY+Q + +FDP SS+
Sbjct: 171 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 230
Query: 209 YSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
Y+ + CAAP C L++ C CLY V YGDGS+++G +T++ + +VKG GC
Sbjct: 231 YANVSCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 290
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFNSARGGD 325
G NEGLF +AGLLGLG G SL Q A+CL R S +G L+F +
Sbjct: 291 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPAR-STGTGYLDFGAGSLAA 349
Query: 326 A---VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
A +T P++ + TFYYVG+TG VGGQ + IP S+F G IVD GT ITR
Sbjct: 350 ARARLTTPML-TENGPTFYYVGMTGIRVGGQLLSIPQSVFAT-----AGTIVDSGTVITR 403
Query: 383 LQTQAYNSLR--DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
L AY+SLR + A K V+L DTCYDF+G+ V +PTVSL F G LD+
Sbjct: 404 LPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDV 463
Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A + SA C AFA + I+GN Q + V++D+ VGF P C
Sbjct: 464 DASGIMYAA-SASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 257 bits (657), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 166/414 (40%), Positives = 219/414 (52%), Gaps = 31/414 (7%)
Query: 108 RDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGT 167
+D AR++T++ ++ A P A L E V SG + GSGEY + VGT
Sbjct: 103 KDVARIHTMLRRVAGAGGGRAATNSTPRRA--LAERIVATVESGVAVGSGEYLVDLYVGT 160
Query: 168 PPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL----D 223
PPR+F M++DTGSD+NWLQC PC +C++Q P+FDP TS SY + C P+C +
Sbjct: 161 PPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATSLSYRNVTCGDPRCGLVAPPTA 220
Query: 224 VSACR---ANRCLYQVAYGDGSFTVGDLVTETVSF-----GNSGSVKGIALGCGHDNEGL 275
ACR ++ C Y YGD S T GDL E + G S V + GCGH N GL
Sbjct: 221 PRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVFGCGHSNRGL 280
Query: 276 FVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI 332
F G+AGLLGLG G LS Q++A + +YCLVD S + F DA+
Sbjct: 281 FHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGSSVGSKIVFGD---DDALLGHPR 337
Query: 333 RN---------KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
N DTFYYV L G VGG+ + I PS +++ + G GG I+D GT ++
Sbjct: 338 LNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYF 397
Query: 384 QTQAYNSLRDSFVRLAGNLKP-TSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
AY +R +FV P + + CY+ SG+ V VP SL F G D PA
Sbjct: 398 AEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPA 457
Query: 443 KNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+NY + +D G C A T SA+SIIGN QQQ V +DL NNR+GF P +C
Sbjct: 458 ENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQNFHVLYDLQNNRLGFAPRRC 511
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 152/391 (38%), Positives = 211/391 (53%), Gaps = 11/391 (2%)
Query: 109 DSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTP 168
DS T +LQ A+ R +L+ F + V + G+GE+ ++ +GTP
Sbjct: 50 DSGGNYTKFERLQRAM---KRGKLRLQRLSAKTASFESSVEAPVHAGNGEFLMKLAIGTP 106
Query: 169 PRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACR 228
+S ++DTGSD+ W QC+PC +C+ Q PIFDPK SSS+S LPC++ C +L +S+C
Sbjct: 107 AETYSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCSSDLCAALPISSC- 165
Query: 229 ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL-FVGSAGLLGLGG 287
++ C Y +YGD S T G L TET +FG++ SV I GCG DN+G F AGL+GLG
Sbjct: 166 SDGCEYLYSYGDYSSTQGVLATETFAFGDA-SVSKIGFGCGEDNDGSGFSQGAGLVGLGR 224
Query: 288 GMLSLTKQIKATSLAYCLVDRDSPA--SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGL 345
G LSL Q+ +YCL D S +L + A +A+T PLI+N +FYY+ L
Sbjct: 225 GPLSLISQLGEPKFSYCLTSMDDSKGISSLLVGSEATMKNAITTPLIQNPSQPSFYYLSL 284
Query: 346 TGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT 405
G SVG + I S F + G GG+I+D GT IT L+ A+ +L+ F+
Sbjct: 285 EGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAFAALKKEFISQLKLDVDE 344
Query: 406 SGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS 464
SG D C+ +V VP + HF G L LPA+NY+I G C +SS
Sbjct: 345 SGSTGLDLCFTLPPDASTVDVPQLVFHF-EGADLKLPAENYIIADSGLGVICLTMG-SSS 402
Query: 465 ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+SI GN QQQ V DL + F P +C
Sbjct: 403 GMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 178/461 (38%), Positives = 239/461 (51%), Gaps = 36/461 (7%)
Query: 61 FAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKL 120
AEE E + S S L + R T + L ++D R+ T+ ++
Sbjct: 57 LAEEEEQK------DRSPSLKLHMSRRSPAEATAGRTRKDSFLESAQKDGVRIATMHRRV 110
Query: 121 QL-AIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTG 179
L A R + + L E V SG + GSGEY + VGTPPR+F M++DTG
Sbjct: 111 ALQAQAQPGRRSASSSPRRALSERLVATVESGVAVGSGEYLVEVYVGTPPRRFQMIMDTG 170
Query: 180 SDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA----CRANR---C 232
SD+NWLQC PC +C+ Q P+FDP S+SY + C +C + A CR++R C
Sbjct: 171 SDLNWLQCAPCLDCFDQRGPVFDPMASTSYRNVTCGDTRCGLVSPPAAPRTCRSSRSDPC 230
Query: 233 LYQVAYGDGSFTVGDLVTE--TVSFGNSGS--VKGIALGCGHDNEGLFVGSAGLLGLGGG 288
Y YGD S T GDL E TV+ S S V G+ LGCGH N GLF G+AGLLGLG G
Sbjct: 231 PYYYWYGDQSNTTGDLALEAFTVNLTASSSRRVDGVVLGCGHRNRGLFHGAAGLLGLGRG 290
Query: 289 MLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAV--TAPLIR------NKKV 337
LS Q++A + +YCLVD S + F G D V + P + +
Sbjct: 291 PLSFASQLRAVYGHAFSYCLVDHGSAVGSKIVF----GDDNVLLSHPQLNYTAFAPSAAE 346
Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEM-DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
+TFYYV L G VGG+ + IP + + + E G GG I+D GT ++ AY ++R +FV
Sbjct: 347 NTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAFV 406
Query: 397 RLAGNLKP-TSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
P + + CY+ SG+ V VP SL F G D PA+NY I +D+ G
Sbjct: 407 DRMDKAYPLIADFPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFIRLDTEGIM 466
Query: 456 CFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C A T SA+SIIGN QQQ V +DL +NR+GF P +C
Sbjct: 467 CLAVLGTPRSAMSIIGNYQQQNFHVLYDLHHNRLGFAPRRC 507
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 157/394 (39%), Positives = 218/394 (55%), Gaps = 19/394 (4%)
Query: 117 ITKLQLAIYNVDRHE--LKPAEAQILPE---DFSTPVVSGASQGSGEYFSRIGVGTPPRQ 171
+TKL+ + + R + L+ A +L D + + G+GEY + +GTPP
Sbjct: 61 LTKLERVQHGIKRGKSRLQRLNAMVLAASTLDSEDQLEAPIHAGNGEYLMELAIGTPPVS 120
Query: 172 FSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR 231
+ VLDTGSD+ W QC+PCT+CY+Q PIFDPK SSS+S + C + C ++ S C ++
Sbjct: 121 YPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCGSSLCSAVPSSTC-SDG 179
Query: 232 CLYQVAYGDGSFTVGDLVTETVSFG---NSGSVKGIALGCGHDNEGL-FVGSAGLLGLGG 287
C Y +YGD S T G L TET +FG N SV I GCG DNEG F ++GL+GLG
Sbjct: 180 CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGR 239
Query: 288 GMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAVTAPLIRNKKVDTFYYV 343
G LSL Q+K +YCL D +L S + VT PL++N +FYY+
Sbjct: 240 GPLSLVSQLKEPRFSYCLTPMDDTKESILLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYL 299
Query: 344 GLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV-RLAGNL 402
L G SVG + I S FE+ + G+GG+I+D GT IT ++ +A+ +L+ F+ + L
Sbjct: 300 SLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYIEQKAFEALKKEFISQTKLPL 359
Query: 403 KPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
TS L D C+ SG V +P + HF G L+LPA+NY+I + G C A
Sbjct: 360 DKTSSTGL-DLCFSLPSGSTQVEIPKIVFHFKGGD-LELPAENYMIGDSNLGVACLAMG- 416
Query: 462 TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
SS +SI GNVQQQ V+ DL + F P C
Sbjct: 417 ASSGMSIFGNVQQQNILVNHDLEKETISFVPTSC 450
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 155/388 (39%), Positives = 213/388 (54%), Gaps = 13/388 (3%)
Query: 117 ITKLQLAIYNVDR--HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSM 174
+TK Q + + R H L+ A +L + + S G+GE+ + +GTPP +S
Sbjct: 56 LTKFQRIQHGIKRANHRLERLNAMVLAASSNAEINSPVLSGNGEFLMNLAIGTPPETYSA 115
Query: 175 VLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLY 234
++DTGSD+ W QC+PCT+C+ Q PIFDPK SSS+S L C++ CK+L S+C ++ C Y
Sbjct: 116 IMDTGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSSC-SDSCEY 174
Query: 235 QVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLT 293
YGD S T G + TET +FG S+ + GCG DNEG F +GL+GLG G LSL
Sbjct: 175 LYTYGDYSSTQGTMATETFTFGKV-SIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLV 233
Query: 294 KQIKATSLAYCLVDRDSPASGVL---EFNSARGGDAV--TAPLIRNKKVDTFYYVGLTGF 348
Q+K +YCL D + L S G A T PLI+N +FYY+ L G
Sbjct: 234 SQLKEAKFSYCLTSIDDTKTSTLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGI 293
Query: 349 SVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV 408
SVGG + I S F++ + G GG+I+D GT IT L+ A++ ++ F G SG
Sbjct: 294 SVGGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGA 353
Query: 409 ALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS 467
+ CY+ S + VP + LHF G L+LP +NY+I S G C A +S +S
Sbjct: 354 TGLELCYNLPSDTSELEVPKLVLHF-TGADLELPGENYMIADSSMGVICLAMG-SSGGMS 411
Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
I GNVQQQ VS DL + F P C
Sbjct: 412 IFGNVQQQNMFVSHDLEKETLSFLPTNC 439
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 151/362 (41%), Positives = 200/362 (55%), Gaps = 19/362 (5%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDP 203
S P SG++ G+G Y IG+GTP ++++V DTGSD W+QC PC CY+Q + +FDP
Sbjct: 147 SLPASSGSALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDP 206
Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
SS+Y+ + CAAP C L + C CLY V YGDGS+++G +T++ + ++KG
Sbjct: 207 ARSSTYANISCAAPACSDLYIKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKG 266
Query: 264 IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFNS 320
GCG NEGL+ +AGLLGLG G SL Q A+C R S +G L+F
Sbjct: 267 FRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSS-GTGYLDFGP 325
Query: 321 ARGGDAVTAPLIRNKKVD---TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
AV+A L VD TFYYVGLTG VGG+ + IP S+F G IVD G
Sbjct: 326 GS-LPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTS-----GTIVDSG 379
Query: 378 TAITRLQTQAYNSLRDSFVRLAGN--LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAG 435
T ITRL AY+SLR +F K ++L DTCYDF+G+ V +PTVSL F G
Sbjct: 380 TVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSEVAIPTVSLLFQGG 439
Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
+LD+ A +I S C FA + I+GN Q + V +D+ VGF P
Sbjct: 440 ASLDVHASG-IIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYDIGKKVVGFCPG 498
Query: 494 KC 495
C
Sbjct: 499 AC 500
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 153/362 (42%), Positives = 209/362 (57%), Gaps = 23/362 (6%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDP 203
S P+ G S G G Y +R+G+GTP + + MV+DTGS + WLQC PC C++QS P+FDP
Sbjct: 123 SVPLTPGTSYGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDP 182
Query: 204 KTSSSYSPLPCAAPQCK-----SLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
KTSSSY+ + C+ PQC +L+ +AC ++ C+YQ +YGD SF+VG L +TVSFG
Sbjct: 183 KTSSSYAAVSCSTPQCNDLSTATLNPAACSSSDVCIYQASYGDSSFSVGYLSKDTVSFG- 241
Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCL-VDRDSPAS 313
S SV GCG DNEGLF SAGL+GL LSL Q+ T S +YCL S
Sbjct: 242 SNSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSSSSSGYL 301
Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
+ +N G P++ + D+ Y++ L+G +V G+ + + S E I
Sbjct: 302 SIGSYNP---GQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSS-----EYSSLPTI 353
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
+D GT ITRL T Y++L + K ++ DTC+ S+RVP VS+ F
Sbjct: 354 IDSGTVITRLPTTVYDALSKAVAGAMKGTKRADAYSILDTCF-VGQASSLRVPAVSMAFS 412
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
G AL L A+N L+ VDS+ T C AFAP SA +IIGN QQQ V +D+ +NR+GF
Sbjct: 413 GGAALKLSAQNLLVDVDSSTT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKSNRIGFAAG 470
Query: 494 KC 495
C
Sbjct: 471 GC 472
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 256 bits (653), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 165/414 (39%), Positives = 218/414 (52%), Gaps = 31/414 (7%)
Query: 108 RDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGT 167
+D AR++T++ ++ A P A L E V SG + GSGEY + VGT
Sbjct: 103 KDVARIHTMLRRVAGAGGGRAATNSTPRRA--LAERIVATVESGVAVGSGEYLVDLYVGT 160
Query: 168 PPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL----D 223
PPR+F M++DTGSD+NWLQC PC +C++Q P+FDP S SY + C P+C +
Sbjct: 161 PPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASLSYRNVTCGDPRCGLVAPPTA 220
Query: 224 VSACR---ANRCLYQVAYGDGSFTVGDLVTETVSF-----GNSGSVKGIALGCGHDNEGL 275
ACR ++ C Y YGD S T GDL E + G S V + GCGH N GL
Sbjct: 221 PRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVFGCGHSNRGL 280
Query: 276 FVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI 332
F G+AGLLGLG G LS Q++A + +YCLVD S + F DA+
Sbjct: 281 FHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGSSVGSKIVFGD---DDALLGHPR 337
Query: 333 RN---------KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
N DTFYYV L G VGG+ + I PS +++ + G GG I+D GT ++
Sbjct: 338 LNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYF 397
Query: 384 QTQAYNSLRDSFVRLAGNLKP-TSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
AY +R +FV P + + CY+ SG+ V VP SL F G D PA
Sbjct: 398 AEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPA 457
Query: 443 KNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+NY + +D G C A T SA+SIIGN QQQ V +DL NNR+GF P +C
Sbjct: 458 ENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQNFHVLYDLQNNRLGFAPRRC 511
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 255 bits (652), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 158/419 (37%), Positives = 226/419 (53%), Gaps = 25/419 (5%)
Query: 89 ILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFS--- 145
+LH + L + + DS + +TK +L + R E + + + S
Sbjct: 30 LLHHGQKRPQPGLRVDLEQVDSGKN---LTKYELIKRAIKRGERRMRSINAMLQSSSGIE 86
Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
TPV +G GEY + +GTP FS ++DTGSD+ W QC PCT+C+ Q PIF+P+
Sbjct: 87 TPVYAG----DGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQD 142
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
SSS+S LPC + C+ L C N C Y YGDGS T G + TET +F S SV IA
Sbjct: 143 SSSFSTLPCESQYCQDLPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETS-SVPNIA 201
Query: 266 LGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
GCG DN+G G+ AGL+G+G G LSL Q+ +YC+ S + L SA G
Sbjct: 202 FGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSSPSTLALGSAASG 261
Query: 325 DAVTAP---LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
+P LI + T+YY+ L G +VGG + IP S F++ + G GG+I+D GT +T
Sbjct: 262 VPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLT 321
Query: 382 RLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKA 437
L AYN++ +F + L + +SG++ TC+ S +V+VP +S+ F G
Sbjct: 322 YLPQDAYNAVAQAFTDQINLPTVDESSSGLS---TCFQQPSDGSTVQVPEISMQFDGG-V 377
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L+L +N LI + G C A +S +SI GN+QQQ T+V +DL N V F P +C
Sbjct: 378 LNLGEQNILIS-PAEGVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 255 bits (651), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 169/422 (40%), Positives = 225/422 (53%), Gaps = 36/422 (8%)
Query: 100 SLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEY 159
S L E+D+ R++T+ + L+ R + P A L E V SG GSGEY
Sbjct: 92 SFFLDSAEKDAVRIDTMHRRAALSGSAAARRDSAPRRA--LSERVVATVESGVPVGSGEY 149
Query: 160 FSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
+ +GTPPR+F M++DTGSD+NWLQC PC +C++QS PIFDP S SY + C +C
Sbjct: 150 LVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFDPAASISYRNVTCGDDRC 209
Query: 220 KSLDVSA------CRANR---CLYQVAYGDGSFTVGDLVTE--TVSFGNSGS--VKGIAL 266
+ + A CR R C Y YGD S T GDL E TV+ SG+ V G+A
Sbjct: 210 RLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGVAF 269
Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT----SLAYCLVDRDSPASGVLEFNSAR 322
GCGH N GLF G+AGLLGLG G LS Q++ + +YCLV+ S A + F
Sbjct: 270 GCGHRNRGLFHGAAGLLGLGRGPLSFASQLRGVYGGHAFSYCLVEHGSAAGSKIIFGH-- 327
Query: 323 GGDAVTAPLIRN-------KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
DA+ A N DTFYY+ L VGG+AV I D GG I+D
Sbjct: 328 -DDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNI-----SSDTLSAGGTIID 381
Query: 376 CGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
GT ++ AY ++R +F+ R++ + G + CY+ SG V VP +SL F
Sbjct: 382 SGTTLSYFPEPAYQAIRQAFIDRMSPSYPLILGFPVLSPCYNVSGAEKVEVPELSLVFAD 441
Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
G A + PA+NY I ++ G C A T S +SIIGN QQQ V +DL +NR+GF P
Sbjct: 442 GAAWEFPAENYFIRLEPEGIMCLAVLGTPRSGMSIIGNYQQQNFHVLYDLEHNRLGFAPR 501
Query: 494 KC 495
+C
Sbjct: 502 RC 503
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 255 bits (651), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 154/402 (38%), Positives = 211/402 (52%), Gaps = 28/402 (6%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L D ARVN++ +KL + E K + P G++ GSG Y +G+
Sbjct: 88 LRLDQARVNSIHSKLSKKLATDHVSESKSTDL---------PAKDGSTLGSGNYIVTVGL 138
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-- 222
GTP S++ DTGSD+ W QC+PC CY Q +PIF+P S+SY + C++ C SL
Sbjct: 139 GTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSS 198
Query: 223 ---DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS 279
+ +C A+ C+Y + YGD SF+VG L E + NS G+ GCG +N+GLF G
Sbjct: 199 ATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGV 258
Query: 280 AGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVT-APLIRNK 335
AGLLGLG LS Q +YCL S +G L F SA +V P+
Sbjct: 259 AGLLGLGRDKLSFPSQTATAYNKIFSYCLPSSAS-YTGHLTFGSAGISRSVKFTPISTIT 317
Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
+FY + + +VGGQ + IP ++F G ++D GT ITRL +AY +LR SF
Sbjct: 318 DGTSFYGLNIVAITVGGQKLPIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSF 372
Query: 396 VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
TSGV++ DTC+D SG ++V +P V+ F G ++L +K + V
Sbjct: 373 KAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKG-IFYVFKISQV 431
Query: 456 CFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C AFA S S +I GNVQQQ V +D A RVGF PN C
Sbjct: 432 CLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 473
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 255 bits (651), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 150/357 (42%), Positives = 202/357 (56%), Gaps = 19/357 (5%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSS 208
SG + G+G Y +G+GTP ++++V DTGSD W+QC+PC CY+Q + +FDP SS+
Sbjct: 171 SGRALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 230
Query: 209 YSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
Y+ + CAAP C L++ C CLY V YGDGS+++G +T++ + +VKG GC
Sbjct: 231 YANVSCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 290
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFNSARGGD 325
G NEGLF +AGLLGLG G SL Q A+CL R S +G L+F +
Sbjct: 291 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPAR-STGTGYLDFGAGSLAA 349
Query: 326 A---VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
A +T P++ + TFYYVG+TG VGGQ + IP S+F G IVD GT ITR
Sbjct: 350 ASARLTTPMLTDNG-PTFYYVGMTGIRVGGQLLSIPQSVFAT-----AGTIVDSGTVITR 403
Query: 383 LQTQAYNSLR--DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
L AY+SLR + A K V+L DTCYDF+G+ V +PTVSL F G LD+
Sbjct: 404 LPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDV 463
Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A + SA C AFA + I+GN Q + V++D+ VGF P C
Sbjct: 464 DASGIMYAA-SASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 254 bits (650), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 158/416 (37%), Positives = 217/416 (52%), Gaps = 27/416 (6%)
Query: 103 LSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL------PEDFSTPVVSGASQGS 156
LS DS + T I K+Q I N H L A + P+D + + + GS
Sbjct: 47 LSLRHVDSGKNLTKIQKIQRGI-NRGFHRLNRLGAVAVLAVASKPDD-TNNIKAPTHGGS 104
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GE+ + +G P ++S ++DTGSD+ W QC+PCTEC+ Q PIFDP+ SSSYS + C++
Sbjct: 105 GEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSS 164
Query: 217 PQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
C +L S C ++ C Y YGD S T G L TET +F + S+ GI GCG +NEG
Sbjct: 165 GLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEG 224
Query: 275 L-FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNSARGGDA------ 326
F +GL+GLG G LSL Q+K T +YCL DS AS L S G
Sbjct: 225 DGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGAS 284
Query: 327 ------VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
T L+RN +FYY+ L G +VG + + + S FE+ E G GG+I+D GT I
Sbjct: 285 LDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTI 344
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALD 439
T L+ A+ L++ F SG D C+ +++ VP + HF G L+
Sbjct: 345 TYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHF-KGADLE 403
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LP +NY++ S G C A +S+ +SI GNVQQQ V DL V F P +C
Sbjct: 404 LPGENYMVADSSTGVLCLAMG-SSNGMSIFGNVQQQNFNVLHDLEKETVSFVPTEC 458
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 254 bits (649), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 149/357 (41%), Positives = 198/357 (55%), Gaps = 19/357 (5%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSS 208
SG + G+G Y IG+GTP ++++V DTGSD W+QC+PC CY+Q + +FDP SS+
Sbjct: 173 SGRALGTGNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSST 232
Query: 209 YSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
Y+ + CAAP C L C CLY V YGDGS+++G +T++ + +VKG GC
Sbjct: 233 YANVSCAAPACSDLYTRGCSGGHCLYSVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 292
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEF---NSAR 322
G NEGLF +AGLLGLG G SL Q A+CL R S +G L+F + A
Sbjct: 293 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSS-GTGYLDFGPGSPAA 351
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
G T P++ + TFYYVG+TG VGGQ + IP S+F G IVD GT ITR
Sbjct: 352 VGARQTTPMLTDNG-PTFYYVGMTGIRVGGQLLSIPQSVFST-----AGTIVDSGTVITR 405
Query: 383 LQTQAYNSLRDSFVR--LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
L AY+SLR +F A K ++L DTCYDF+G+ V +P VSL F G LD+
Sbjct: 406 LPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDFTGMSEVAIPKVSLLFQGGAYLDV 465
Query: 441 PAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A + S C FA + I+GN Q + V +D+ VGF+P C
Sbjct: 466 NASGIMYAA-SLSQVCLGFAANEDDDDVGIVGNTQLKTFGVVYDIGKKTVGFSPGAC 521
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 254 bits (649), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 161/451 (35%), Positives = 232/451 (51%), Gaps = 48/451 (10%)
Query: 92 KTRHNDYRSLVLSRLERDSARVNTLITKL-----QLAIYNVDRHELKPAEAQIL------ 140
K R ++ + + RD AR+ TL T++ Q I + + + +P E QI
Sbjct: 3 KDRKSEGKESFVESTNRDLARIQTLHTRIIEKKNQNDISRLKKDKERP-EKQIKTVVATA 61
Query: 141 --PEDFST--------PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC 190
PE + T + SG + GSGEYF + +GTPP+ +S++LDTGSD+NW+QC PC
Sbjct: 62 ASPESYGTGLSGQLMATLESGVTLGSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPC 121
Query: 191 TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS----ACRANR--CLYQVAYGDGSFT 244
+C++Q+ P +DPK SSS+ + C P+C + C+A C Y YGD S T
Sbjct: 122 HDCFEQNGPYYDPKESSSFRNIGCHDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNT 181
Query: 245 VGDLVTETVSF------GNS--GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI 296
GD TET + G S V+ + GCGH N GLF G++GLLGLG G LS + Q+
Sbjct: 182 TGDFATETFTVNLTSPTGKSEFKRVENVMFGCGHWNRGLFHGASGLLGLGRGPLSFSSQL 241
Query: 297 KA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI--------RNKKVDTFYYVGL 345
++ S +YCLVDR+S + + D + P + + VDTFYYV +
Sbjct: 242 QSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQI 301
Query: 346 TGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT 405
VGG+ + IP S + M G GG IVD GT ++ AY ++D+FV+
Sbjct: 302 KSIMVGGEVLNIPESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIV 361
Query: 406 SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SS 464
+ D CY+ SG+ + +P + F G + P +NY I +D C A T S
Sbjct: 362 QDFPILDPCYNVSGVEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRS 421
Query: 465 ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
ALSIIGN QQQ V +D +R+G+ P C
Sbjct: 422 ALSIIGNYQQQNFHVLYDTKKSRLGYAPMNC 452
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 156/416 (37%), Positives = 220/416 (52%), Gaps = 27/416 (6%)
Query: 103 LSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL------PEDFSTPVVSGASQGS 156
LS DS + T I K+Q I N H L A + P+D + + + GS
Sbjct: 48 LSLRHVDSGKNLTKIQKIQRGI-NRGFHRLNRLGAVAVLAVASNPDD-TNNIKAPTHGGS 105
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GE+ + +G P +++ ++DTGSD+ W QC+PCTEC+ Q PIFDP+ SSSYS + C++
Sbjct: 106 GEFLMELSIGNPAVKYAAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSS 165
Query: 217 PQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
C +L S C ++ C Y YGD S T G L TET +F + S+ GI GCG +NEG
Sbjct: 166 GLCNALPRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEG 225
Query: 275 L-FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRD-----------SPASGVLEFNSAR 322
F +GL+GLG G LSL Q+K T +YCL + S ASG++ A
Sbjct: 226 DGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGAN 285
Query: 323 --GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
G T L+RN +FYY+ L G +VG + + + S FE+ E G GG+I+D GT I
Sbjct: 286 LDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSEDGTGGMIIDSGTTI 345
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALD 439
T L+ A+ L++ F SG D C+ + +++ VP + HF G L+
Sbjct: 346 TYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPNAAKNIAVPKLIFHF-KGADLE 404
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LP +NY++ S G C A +S+ +SI GNVQQQ V DL V F P +C
Sbjct: 405 LPGENYMVADSSTGVLCLAMG-SSNGMSIFGNVQQQNFNVLHDLEKETVTFVPTEC 459
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 164/418 (39%), Positives = 228/418 (54%), Gaps = 19/418 (4%)
Query: 87 REILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDR--HELKPAEAQILPEDF 144
R + H +R RL+ + N +TKL+ + V R + L+ +A L
Sbjct: 29 RALEHPKMQKGFRV----RLKHVDSGKN--LTKLERIRHGVKRGRNRLQRLQAMALVASS 82
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
S+ + + G+GE+ ++ +GTPP +S +LDTGSD+ W QC+PCT+C+ QS PIFDPK
Sbjct: 83 SSEIEAPVLPGNGEFLMKLAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPK 142
Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
SSS+S L C++ C++L S+C N C Y +YGD S T G L +ET++FG + SV +
Sbjct: 143 KSSSFSKLSCSSQLCEALPQSSCN-NGCEYLYSYGDYSSTQGILASETLTFGKA-SVPNV 200
Query: 265 ALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG 323
A GCG DNEG F AGL+GLG G LSL Q+K +YCL D + L S
Sbjct: 201 AFGCGADNEGSGFSQGAGLVGLGRGPLSLVSQLKEPKFSYCLTTVDDTKTSTLLMGSLAS 260
Query: 324 GDA-----VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
+A T PLI + +FYY+ L G SVG + I S F + + G GG+I+D GT
Sbjct: 261 VNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGT 320
Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKA 437
IT L+ A+N + F +SG D C+ SG ++ VP + HF G
Sbjct: 321 TITYLEESAFNLVAKEFTAKINLPVDSSGSTGLDVCFTLPSGSTNIEVPKLVFHFD-GAD 379
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L+LPA+NY+I S G C A +SS +SI GNVQQQ V DL + F P +C
Sbjct: 380 LELPAENYMIGDSSMGVACLAMG-SSSGMSIFGNVQQQNMLVLHDLEKETLSFLPTQC 436
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 149/357 (41%), Positives = 202/357 (56%), Gaps = 19/357 (5%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSS 208
SG + G+G Y +G+GTP ++++V DTGSD W+QC+PC CY+Q + +FDP SS+
Sbjct: 169 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSST 228
Query: 209 YSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
Y+ + CAAP C L++ C CLY V YGDGS+++G +T++ + +VKG GC
Sbjct: 229 YANVSCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 288
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFNSARGGD 325
G NEGLF +AGLLGLG G SL Q A+CL R S +G L+F +
Sbjct: 289 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPAR-STGTGYLDFGAGSPAA 347
Query: 326 A---VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
A +T P++ + TFYY+G+TG VGGQ + IP S+F G IVD GT ITR
Sbjct: 348 ASARLTTPMLTDNG-PTFYYIGMTGIRVGGQLLSIPQSVFAT-----AGTIVDSGTVITR 401
Query: 383 LQTQAYNSLR--DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
L AY+SLR + A K V+L DTCYDF+G+ V +PTVSL F G LD+
Sbjct: 402 LPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDV 461
Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A + SA C AFA + I+GN Q + V++D+ VGF P C
Sbjct: 462 DASGIMYAA-SASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGVC 517
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 167/505 (33%), Positives = 255/505 (50%), Gaps = 53/505 (10%)
Query: 13 TTILFSFCLFTS--ASSRGLS-ETATTVLDVSSALQQTEHILSFEPETL---EPFAEESE 66
T L F L+++ +S RGL+ + T L S L HI S P ++ P ++
Sbjct: 7 TIFLLKFLLYSALLSSKRGLAFQGRKTALSTPSTLHNV-HITSLMPSSVCSPSPKGDDKR 65
Query: 67 TAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSR-LERDSARVNTLITKLQLAIY 125
+ E +H K + RS ++ L++D +RVN++ + +LA
Sbjct: 66 ASLEV------------IHKHGPCSKLSQDKGRSPSRTQMLDQDESRVNSI--RSRLAKN 111
Query: 126 NVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL 185
D +LK ++ + P SG++ G+G Y +G+GTP R + + DTGSD+ W
Sbjct: 112 PADGGKLKGSKVTL-------PSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWT 164
Query: 186 QCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-----DVSACRANRCLYQVAYG 239
QC PC CY Q +PIF+P S+SY+ + C++P C L + +C A+ C+Y + YG
Sbjct: 165 QCEPCARYCYHQQEPIFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYG 224
Query: 240 DGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSL---TKQI 296
D S++VG + ++ ++ GCG +N GLFVG AGL+GLG LSL T Q
Sbjct: 225 DQSYSVGFFAQDKLALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLVSQTAQK 284
Query: 297 KATSLAYCLVDRDSPASGVLEFNSARGGDAVT--APLIRNKKVDTFYYVGLTGFSVGGQA 354
+YCL S ++G L F S G P + N + +FY++ L SVGG+
Sbjct: 285 YGKLFSYCLPSTSS-STGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRK 343
Query: 355 VQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTC 414
+ S+F G I+D GT I+RL AY+ LR SF + + ++ DTC
Sbjct: 344 LSTSASVFST-----AGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTC 398
Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKN--YLIPVDSAGTFCFAFAPTSSA--LSIIG 470
YDFS +V VP ++L+F G +DL Y++ + C AFA S A ++I+G
Sbjct: 399 YDFSQYDTVDVPKINLYFSDGAEMDLDPSGIFYILNISQV---CLAFAGNSDATDIAILG 455
Query: 471 NVQQQGTRVSFDLANNRVGFTPNKC 495
NVQQ+ V +D+A R+GF P C
Sbjct: 456 NVQQKTFDVVYDVAGGRIGFAPGGC 480
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 251 bits (642), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 139/360 (38%), Positives = 199/360 (55%), Gaps = 17/360 (4%)
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKT 205
P +G S + E+ +G GTP + ++++ DTGSD++W+QC PC+ CY+Q DPIFDP
Sbjct: 123 PDSTGTSLDTLEFVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTK 182
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S++YS +PC PQC + D S C CLY+V YGDGS + G L ET+S ++ ++ G A
Sbjct: 183 SATYSVVPCGHPQCAAADGSKCSNGTCLYKVEYGDGSSSAGVLSHETLSLTSTRALPGFA 242
Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEF---N 319
GCG N G F GL+GLG G LSL+ Q A+ + +YCL D+ G L
Sbjct: 243 FGCGQTNLGDFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCL-PSDNTTHGYLTIGPTT 301
Query: 320 SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
A D +++ + +FY+V L +GG + +PP+LF D G +D GT
Sbjct: 302 PASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFT-----DDGTFLDSGTI 356
Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
+T L +AY +LRD F KP FDTCYDF+G ++ +P VS F G D
Sbjct: 357 LTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFKFSDGSVFD 416
Query: 440 LPAKNYLI-PVDSA---GTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L LI P D+A G F P++ +I+GN+QQ+ T V +D+A ++GF C
Sbjct: 417 LSFFGILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 251 bits (642), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 147/403 (36%), Positives = 220/403 (54%), Gaps = 26/403 (6%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L+RD RV+++ +LA P+ A + S P G G+ Y +G+
Sbjct: 91 LDRDQDRVDSI---HRLAAARPSSTADDPSSAS---KGVSLPARRGVPLGTANYIVSVGL 144
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
GTP R +V DTGSD++W+QC+PC CYQQ DP+FDP S++YS +PC A +C+ LD
Sbjct: 145 GTPKRDLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQSTTYSAVPCGAQECRRLDSG 204
Query: 226 ACRANRCLYQVAYGDGSFTVGDLVTETVSFG------NSGSVKGIALGCGHDNEGLFVGS 279
+C + +C Y+V YGD S T G+L +T++ G +S ++ GCG D+ GLF +
Sbjct: 205 SCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEFVFGCGDDDTGLFGKA 264
Query: 280 AGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
GL GLG +SL Q A +YCL S A G L SA +A ++
Sbjct: 265 DGLFGLGRDRVSLASQAAAKYGAGFSYCLPS-SSTAEGYLSLGSAAPPNARFTAMVTRSD 323
Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
+FYY+ L G V G+ V++ P++F G ++D GT ITRL ++AY +LR SF
Sbjct: 324 TPSFYYLNLVGIKVAGRTVRVSPAVFRTP-----GTVIDSGTVITRLPSRAYAALRSSFA 378
Query: 397 RLAG--NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
L + K +++ DTCYDF+G V++P+V+L F G L+L L V +
Sbjct: 379 GLMRRYSYKRAPALSILDTCYDFTGRNKVQIPSVALLFDGGATLNLGFGEVLY-VANKSQ 437
Query: 455 FCFAFAPT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C AFA ++++I+GN+QQ+ V +D+AN ++GF C
Sbjct: 438 ACLAFASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGC 480
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 251 bits (640), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 148/352 (42%), Positives = 194/352 (55%), Gaps = 19/352 (5%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLP 213
G+G Y IG+GTP ++++V DTGSD W+QC PC CY+Q + +FDP SS+ + +
Sbjct: 182 GTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANIS 241
Query: 214 CAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
CAAP C L C CLY V YGDGS+++G +T++ + ++KG GCG NE
Sbjct: 242 CAAPACSDLYTKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKGFRFGCGERNE 301
Query: 274 GLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFNSARGGDAVTAP 330
GLF +AGLLGLG G SL Q A+C R S +G L+F AV+
Sbjct: 302 GLFGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSS-GTGYLDFGPGS-SPAVSTK 359
Query: 331 LIRNKKVD---TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
L VD TFYYVGLTG VGG+ + IPPS+F G IVD GT ITRL A
Sbjct: 360 LTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTT-----AGTIVDSGTVITRLPPAA 414
Query: 388 YNSLRDSFVR--LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
Y+SLR +F A K ++L DTCYDF+G+ V +PTVSL F G +LD+ A
Sbjct: 415 YSSLRSAFASAIAARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQGGASLDVDASG- 473
Query: 446 LIPVDSAGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+I S C FA + I+GN Q + V +D+ VGF+P C
Sbjct: 474 IIYAASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 157/421 (37%), Positives = 218/421 (51%), Gaps = 30/421 (7%)
Query: 89 ILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPV 148
+ H H +Y L L L R + R + +++L V R +A P D PV
Sbjct: 61 LTHVDAHGNYTKLQL--LRRAARRSHHRMSRL------VARTATGSVKAAAAP-DLQVPV 111
Query: 149 VSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSS 208
G+GE+ + +GTP ++ ++DTGSD+ W QC+PC EC+ QS P+FDP +SS+
Sbjct: 112 ----HAGNGEFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPSSSST 167
Query: 209 YSPLPCAAPQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIAL 266
YS LPC++ C L S C A C Y YGD S T G L ET + + + G+A
Sbjct: 168 YSTLPCSSSLCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKT-KLPGVAF 226
Query: 267 GCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----- 320
GCG NEG F AGL+GLG G LSL Q+ +YCL D + L S
Sbjct: 227 GCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLGKFSYCLTSLDDTSKSPLLLGSLAAIS 286
Query: 321 ---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
A T PLI+N +FYYV L +VG + +P S F + + G GG+IVD G
Sbjct: 287 TDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVIVDSG 346
Query: 378 TAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVSLHFGA 434
T+IT L+ Q Y L+ +F ++ + S V L D C+ SG+ V VP + LHF
Sbjct: 347 TSITYLELQGYRPLKKAFAAQMKLPVADGSAVGL-DLCFKAPASGVDDVEVPKLVLHFDG 405
Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
G LDLPA+NY++ ++G C S LSIIGN QQQ + +D+ + + F P +
Sbjct: 406 GADLDLPAENYMVLDSASGALCLTVM-GSRGLSIIGNFQQQNIQFVYDVDKDTLSFAPVQ 464
Query: 495 C 495
C
Sbjct: 465 C 465
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 142/356 (39%), Positives = 196/356 (55%), Gaps = 22/356 (6%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
G+GE+ + +GTP ++ ++DTGSD+ W QC+PC EC+ QS P+FDP +SS+Y+ LPC
Sbjct: 98 GNGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYAALPC 157
Query: 215 AAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
++ C L S C + +C Y YGD S T G L ET + + + +A GCG NEG
Sbjct: 158 SSTLCSDLPSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTLAKT-KLPDVAFGCGDTNEG 216
Query: 275 -LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS--------ARGGD 325
F AGL+GLG G LSL Q+ +YCL D + L S A
Sbjct: 217 DGFTQGAGLVGLGRGPLSLVSQLGLNKFSYCLTSLDDTSKSPLLLGSLATISESAAAASS 276
Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
T PLIRN +FYYV L G +VG + +P S F + + G GG+IVD GT+IT L+
Sbjct: 277 VQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGGVIVDSGTSITYLEL 336
Query: 386 QAYNSLRDSFVRLAGNLK----PTSGVALFDTCYD--FSGLRSVRVPTVSLHFGAGKALD 439
Q Y +L+ +F A +K SG+ L DTC++ SG+ V VP + H G LD
Sbjct: 337 QGYRALKKAF---AAQMKLPAADGSGIGL-DTCFEAPASGVDQVEVPKLVFHLD-GADLD 391
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LPA+NY++ +G C S LSIIGN QQQ + +D+ N + F P +C
Sbjct: 392 LPAENYMVLDSGSGALCLTVM-GSRGLSIIGNFQQQNIQFVYDVGENTLSFAPVQC 446
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 160/419 (38%), Positives = 228/419 (54%), Gaps = 18/419 (4%)
Query: 86 SREIL-HKTRHNDYRSLVLSRLER-DSARVNTLITKLQLAIYNVDRHELKPAEAQILPED 143
SR +L H N +R+ +L+ DS + T ++Q + RH L+ +A L
Sbjct: 27 SRRVLEHPKVQNGFRA----KLKHVDSGKNLTKFERIQHGVKR-GRHRLQRFKAMALVAS 81
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
++ + + G+GE+ ++ +GTPP +S ++DTGSD+ W QC+PCT+C+ Q PIFDP
Sbjct: 82 SNSEIDAPVLPGNGEFLMKLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDP 141
Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
K SSS+S L C++ C++L S C ++ C Y YGD S T G L +ET++FG SV
Sbjct: 142 KKSSSFSKLSCSSKLCEALPQSTC-SDGCEYLYGYGDYSSTQGMLASETLTFGKV-SVPE 199
Query: 264 IALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS-- 320
+A GCG DNEG F +GL+GLG G LSL Q+K +YCL D + L S
Sbjct: 200 VAFGCGEDNEGSGFSQGSGLVGLGRGPLSLVSQLKEPKFSYCLTSVDDTKASTLLMGSLA 259
Query: 321 ---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
A + T PLI+N +FYY+ L G SVG ++ I S F + E G GG+I+D G
Sbjct: 260 SVKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSG 319
Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGK 436
T IT L+ A++ + F SG + C+ SG + VP + HF G
Sbjct: 320 TTITYLEQSAFDLVAKEFTSQINLPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHFD-GA 378
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L+LPA+NY+I S G C A +SS +SI GN+QQQ V DL + F P +C
Sbjct: 379 DLELPAENYMIADASMGVACLAMG-SSSGMSIFGNIQQQNMLVLHDLEKETLSFLPTQC 436
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 143/366 (39%), Positives = 200/366 (54%), Gaps = 20/366 (5%)
Query: 143 DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
D PV G+GE+ + +GTP +S ++DTGSD+ W QC+PC +C++QS P+FD
Sbjct: 93 DLQVPV----HAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFD 148
Query: 203 PKTSSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
P +SS+Y+ +PC++ C L S C A++C Y YGD S T G L TET + S +
Sbjct: 149 PSSSSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKS-KL 207
Query: 262 KGIALGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS 320
G+ GCG NEG F AGL+GLG G LSL Q+ +YCL D + L S
Sbjct: 208 PGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTNNSPLLLGS 267
Query: 321 ARG--------GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
G T PLI+N +FYYV L +VG + +P S F + + G GG+
Sbjct: 268 LAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGV 327
Query: 373 IVDCGTAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVS 429
IVD GT+IT L+ Q Y +L+ +F ++A SGV L D C+ G+ V VP +
Sbjct: 328 IVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGL-DLCFRAPAKGVDQVEVPRLV 386
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
HF G LDLPA+NY++ +G C S LSIIGN QQQ + +D+ ++ +
Sbjct: 387 FHFDGGADLDLPAENYMVLDGGSGALCLTVM-GSRGLSIIGNFQQQNFQFVYDVGHDTLS 445
Query: 490 FTPNKC 495
F P +C
Sbjct: 446 FAPVQC 451
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 155/358 (43%), Positives = 202/358 (56%), Gaps = 22/358 (6%)
Query: 151 GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSY 209
G S G+ Y IG+GTPP +F++V DTGSD W+QCRPC CY+Q D +FDP SS+Y
Sbjct: 155 GLSLGTANYVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTY 214
Query: 210 SPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCG 269
+ + CA P C LD S C A CLY + YGDGS+TVG +T++ ++KG GCG
Sbjct: 215 ANVSCADPACADLDASGCNAGHCLYGIQYGDGSYTVGFFAKDTLAVAQD-AIKGFKFGCG 273
Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEF----NSAR 322
N GLF +AGLLGLG G S+T Q S +YCL S A+G LEF S+
Sbjct: 274 EKNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFSYCL-PASSAATGYLEFGPLSPSSS 332
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV-QIPPSLFEMDEAGDGGIIVDCGTAIT 381
G +A T P++ +K TFYYVGLTG VGG+ + IP S+F + G +VD GT IT
Sbjct: 333 GSNAKTTPMLTDKG-PTFYYVGLTGIRVGGKQLGAIPESVFS-----NSGTLVDSGTVIT 386
Query: 382 RL--QTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
RL A S + A K + ++ DTCYDF+GL V +PTVSL F G LD
Sbjct: 387 RLPDTAYAALSSAFAAAMAASGYKKAAAYSILDTCYDFTGLSQVSLPTVSLVFQGGACLD 446
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L A + + S C FA ++ I+GN QQ+ V +D++ VGF P C
Sbjct: 447 LDASGIVYAI-SQSQVCLGFASNGDDESVGIVGNTQQRTYGVLYDVSKKVVGFAPGAC 503
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 143/366 (39%), Positives = 200/366 (54%), Gaps = 20/366 (5%)
Query: 143 DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
D PV G+GE+ + +GTP +S ++DTGSD+ W QC+PC +C++QS P+FD
Sbjct: 83 DLQVPV----HAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFD 138
Query: 203 PKTSSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
P +SS+Y+ +PC++ C L S C A++C Y YGD S T G L TET + S +
Sbjct: 139 PSSSSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKS-KL 197
Query: 262 KGIALGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS 320
G+ GCG NEG F AGL+GLG G LSL Q+ +YCL D + L S
Sbjct: 198 PGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTNNSPLLLGS 257
Query: 321 ARG--------GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
G T PLI+N +FYYV L +VG + +P S F + + G GG+
Sbjct: 258 LAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGV 317
Query: 373 IVDCGTAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVS 429
IVD GT+IT L+ Q Y +L+ +F ++A SGV L D C+ G+ V VP +
Sbjct: 318 IVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGL-DLCFRAPAKGVDQVEVPRLV 376
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
HF G LDLPA+NY++ +G C S LSIIGN QQQ + +D+ ++ +
Sbjct: 377 FHFDGGADLDLPAENYMVLDGGSGALCLTVM-GSRGLSIIGNFQQQNFQFVYDVGHDTLS 435
Query: 490 FTPNKC 495
F P +C
Sbjct: 436 FAPVQC 441
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 168/483 (34%), Positives = 245/483 (50%), Gaps = 56/483 (11%)
Query: 62 AEESETAAESFPLNSSSSFSLPLHSREILHKT--RHNDYRSLVLSRLERDSARVNTL--- 116
EE++ +E+FP S+ LH + H++ + + ++ V+ RD R+ L
Sbjct: 81 GEETDEESEAFPAPKPHKNSVKLHLK---HRSGSKGAEPKNSVIDSTVRDLTRIQNLHRR 137
Query: 117 ---------ITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSG---------ASQGSGE 158
I++LQ + KP A P ST VSG S GSGE
Sbjct: 138 VIENRNQNTISRLQRLQKEQPKQSFKPVFA---PAASSTSPVSGQLVATLESGVSLGSGE 194
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
YF + VGTPP+ FS++LDTGSD+NW+QC PC C++QS P +DPK SSS+ + C P+
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPR 254
Query: 219 CKSLDV----SACRANR--CLYQVAYGDGSFTVGDLVTETVSF------GNS--GSVKGI 264
C+ + + C+A C Y YGDGS T GD ET + G S V+ +
Sbjct: 255 CQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHVENV 314
Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSA 321
GCGH N GLF G+AGLLGLG G LS Q+++ S +YCLVDR+S AS +
Sbjct: 315 MFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSKLIFG 374
Query: 322 RGGDAVTAPLI--------RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
+ ++ P + ++ VDTFYYV + V + ++IP + + G GG I
Sbjct: 375 EDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETWHLSSEGAGGTI 434
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
+D GT +T AY ++++FVR + G+ CY+ SG+ + +P + F
Sbjct: 435 IDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKMELPDFGILFA 494
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
G + P +NY I +D C A SALSIIGN QQQ + +D+ +R+G+ P
Sbjct: 495 DGAVWNFPVENYFIQID-PDVVCLAILGNPRSALSIIGNYQQQNFHILYDMKKSRLGYAP 553
Query: 493 NKC 495
KC
Sbjct: 554 MKC 556
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 148/383 (38%), Positives = 207/383 (54%), Gaps = 18/383 (4%)
Query: 126 NVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL 185
N RH+L A+ S V A G+GE+ + +GTP +S ++DTGSD+ W
Sbjct: 43 NYSRHQLLRRAARRSHHRMSRLVPVHA--GNGEFLMDVSIGTPALAYSAIVDTGSDLVWT 100
Query: 186 QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFT 244
QC+PC +C++QS P+FDP +SS+Y+ +PC++ C L S C A++C Y YGD S T
Sbjct: 101 QCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSST 160
Query: 245 VGDLVTETVSFGNSGSVKGIALGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIKATSLAY 303
G L TET + S + G+ GCG NEG F AGL+GLG G LSL Q+ +Y
Sbjct: 161 QGVLATETFTLAKS-KLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSY 219
Query: 304 CLVDRDSPASGVLEFNSARG--------GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV 355
CL D + L S G T PLI+N +FYYV L +VG +
Sbjct: 220 CLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRI 279
Query: 356 QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTC 414
+P S F + + G GG+IVD GT+IT L+ Q Y +L+ +F ++A SGV L D C
Sbjct: 280 SLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGL-DLC 338
Query: 415 YD--FSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNV 472
+ G+ V VP + HF G LDLPA+NY++ +G C S LSIIGN
Sbjct: 339 FRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVM-GSRGLSIIGNF 397
Query: 473 QQQGTRVSFDLANNRVGFTPNKC 495
QQQ + +D+ ++ + F P +C
Sbjct: 398 QQQNFQFVYDVGHDTLSFAPVQC 420
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 158/403 (39%), Positives = 223/403 (55%), Gaps = 27/403 (6%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFS---TPVVSGASQGSGEYFSR 162
LE+ + +N +TK +L + R E + + + S TPV +G SGEY
Sbjct: 46 LEQVDSGMN--LTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVYAG----SGEYLMN 99
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
+ +GTP S ++DTGSD+ W QC PCT+C+ Q PIF+P+ SSS+S LPC + C+ L
Sbjct: 100 VAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDL 159
Query: 223 DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS-AG 281
+C N C Y YGDGS T G + TET +F S SV IA GCG DN+G G+ AG
Sbjct: 160 PSESCY-NDCQYTYGYGDGSSTQGYMATETFTFETS-SVPNIAFGCGEDNQGFGQGNGAG 217
Query: 282 LLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAP---LIRNKKVD 338
L+G+G G LSL Q+ +YC+ S + L SA G +P LI +
Sbjct: 218 LIGMGWGPLSLPSQLGVGQFSYCMTSSGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNP 277
Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
T+YY+ L G +VGG + IP S F++ + G GG+I+D GT +T L AYN++ +F
Sbjct: 278 TYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQ 337
Query: 399 AGNLKP----TSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAG 453
NL P +SG++ TC+ S +V+VP +S+ F G L+L +N LI + G
Sbjct: 338 I-NLSPVDESSSGLS---TCFQLPSDGSTVQVPEISMQFDGG-VLNLGEENVLIS-PAEG 391
Query: 454 TFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C A +S +SI GN+QQQ T+V +DL N V F P +C
Sbjct: 392 VICLAMGSSSQQGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 434
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 150/402 (37%), Positives = 213/402 (52%), Gaps = 28/402 (6%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L D ARVN++ +KL + + + +++ LP G++ GSG Y +G+
Sbjct: 89 LRLDQARVNSIHSKLS---KKLTTNHVSQSQSTDLPAK------DGSTLGSGNYIVTVGL 139
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-- 222
GTP S++ DTGSD+ W QC+PC CY Q +PIF+P S+SY + C++ C SL
Sbjct: 140 GTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSS 199
Query: 223 ---DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS 279
+ +C A+ C+Y + YGD SF+VG L + + +S G+ GCG +N+GLF G
Sbjct: 200 ATGNAGSCSASNCIYGIQYGDQSFSVGFLAKDKFTLTSSDVFDGVYFGCGENNQGLFTGV 259
Query: 280 AGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVT-APLIRNK 335
AGLLGLG LS Q +YCL S +G L F SA +V P+
Sbjct: 260 AGLLGLGRDKLSFPSQTATAYNKIFSYCLPSSAS-YTGHLTFGSAGISRSVKFTPISTIT 318
Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
+FY + + +VGGQ + IP ++F G ++D GT ITRL +AY +LR SF
Sbjct: 319 DGTSFYGLNIVAITVGGQKLPIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSF 373
Query: 396 VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
TSGV++ DTC+D SG ++V +P V+ F G ++L +K +
Sbjct: 374 KAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYAFKIS-QV 432
Query: 456 CFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C AFA S S +I GNVQQQ V +D A RVGF PN C
Sbjct: 433 CLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 474
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 248 bits (634), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 152/363 (41%), Positives = 205/363 (56%), Gaps = 25/363 (6%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDP 203
S P+ G S G G Y + +G+GTP ++MV+DTGS + WLQC PC C++Q P++DP
Sbjct: 120 SVPLTPGTSVGVGNYVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLYDP 179
Query: 204 KTSSSYSPLPCAAPQC-----KSLDVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSFGN 257
+ SS+Y+ +PC+A QC +L+ SAC N C+YQ +YGD SF+VG L +TVSFG
Sbjct: 180 RASSTYATVPCSASQCDELQAATLNPSACSVRNVCIYQASYGDSSFSVGYLSRDTVSFG- 238
Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPAS- 313
SGS GCG DNEGLF SAGL+GL LSL Q+ + S +YCL +PAS
Sbjct: 239 SGSYPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCL---PTPAST 295
Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
G L G P+ + + Y+V L+G SVGG + + P+ E I
Sbjct: 296 GYLSIGPYTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPA-----EYSSLPTI 350
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRS-VRVPTVSLHF 432
+D GT ITRL T Y +L + ++ ++ DTC F G S +RVP V++ F
Sbjct: 351 IDSGTVITRLPTAVYTALSKAVAAAMVGVQSAPAFSILDTC--FQGQASQLRVPAVAMAF 408
Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
G L L +N LI VD + T C AFAPT S +IIGN QQQ V +D+A +R+GF
Sbjct: 409 AGGATLKLATQNVLIDVDDS-TTCLAFAPTDST-TIIGNTQQQTFSVVYDVAQSRIGFAA 466
Query: 493 NKC 495
C
Sbjct: 467 GGC 469
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 248 bits (633), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 164/465 (35%), Positives = 239/465 (51%), Gaps = 54/465 (11%)
Query: 81 SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKL----------QLAIYNVDRH 130
S+ LH ++ T + S+ S + RD AR+ TL T++ +L NV+R
Sbjct: 99 SVKLHLKKRSTNTANKPKESITESAV-RDLARIQTLHTRITERKNQDTTSRLKKSNVERK 157
Query: 131 E-----LKPAEAQILPEDFS--------TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLD 177
+ PAE+ PE ++ + SG S GSGEYF + +G+PP+ FS++LD
Sbjct: 158 KPMEEVSSPAES---PESYADYFSGQLMATLESGVSLGSGEYFIDVFIGSPPKHFSLILD 214
Query: 178 TGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV----SACR--ANR 231
TGSD+NW+QC PC +C++Q+ P +DPK S S+ + C P+C+ + C+
Sbjct: 215 TGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQS 274
Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGS---------VKGIALGCGHDNEGLFVGSAGL 282
C Y YGD S T GD ET + + S V+ + GCGH N GLF G+AGL
Sbjct: 275 CPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGL 334
Query: 283 LGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAP------LIR 333
LGLG G LS + Q+++ S +YCLVDRDS S + D +T P LI
Sbjct: 335 LGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIA 394
Query: 334 NKK--VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
K+ VDTFYY+ + VGG+ +QIP + + G GG I+D GT ++ AY +
Sbjct: 395 GKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRII 454
Query: 392 RDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
+++F+R K + CY+ SG + P + F G + P +NY I +
Sbjct: 455 KEAFLRKVKGYKLVEDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQ 514
Query: 452 AGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C A T SALSIIGN QQQ + +D N+R+G+ P +C
Sbjct: 515 LDIVCLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMRC 559
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 248 bits (633), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 164/465 (35%), Positives = 239/465 (51%), Gaps = 54/465 (11%)
Query: 81 SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKL----------QLAIYNVDRH 130
S+ LH ++ T + S+ S + RD AR+ TL T++ +L NV+R
Sbjct: 99 SVKLHLKKRSTNTANKPKESITESAV-RDLARIQTLHTRITERKNQDTTSRLKKSNVERK 157
Query: 131 E-----LKPAEAQILPEDFS--------TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLD 177
+ PAE+ PE ++ + SG S GSGEYF + +G+PP+ FS++LD
Sbjct: 158 KPMEEVSSPAES---PESYADYFSGQLMATLESGVSLGSGEYFIDVFIGSPPKHFSLILD 214
Query: 178 TGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV----SACR--ANR 231
TGSD+NW+QC PC +C++Q+ P +DPK S S+ + C P+C+ + C+
Sbjct: 215 TGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQS 274
Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGS---------VKGIALGCGHDNEGLFVGSAGL 282
C Y YGD S T GD ET + + S V+ + GCGH N GLF G+AGL
Sbjct: 275 CPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGL 334
Query: 283 LGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAP------LIR 333
LGLG G LS + Q+++ S +YCLVDRDS S + D +T P LI
Sbjct: 335 LGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIA 394
Query: 334 NKK--VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
K+ VDTFYY+ + VGG+ +QIP + + G GG I+D GT ++ AY +
Sbjct: 395 GKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRII 454
Query: 392 RDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
+++F+R K + CY+ SG + P + F G + P +NY I +
Sbjct: 455 KEAFLRKVKGYKLVEDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQ 514
Query: 452 AGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C A T SALSIIGN QQQ + +D N+R+G+ P +C
Sbjct: 515 LDIVCLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMRC 559
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 248 bits (632), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 161/462 (34%), Positives = 233/462 (50%), Gaps = 45/462 (9%)
Query: 76 SSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKL-----QLAIYNVDRH 130
S + L L R I R + ++ ++ RD R+ TL ++ Q A+ +++
Sbjct: 96 SKQTLKLHLKHRWI---NRDSTHKESFVASTTRDLTRIQTLHKRILEKKNQNALSRLNKE 152
Query: 131 ELK-----PAE------AQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTG 179
E K PA A L + SG S GSGEYF + +GTPPR FS++LDTG
Sbjct: 153 EPKQPVVAPAASPESYPANGLSGQLMATLESGVSLGSGEYFMDVFIGTPPRHFSLILDTG 212
Query: 180 SDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV----SACRANR--CL 233
SD+NW+QC PC +C+ Q+ P +DPK SSS+ + C P+C + C+A C
Sbjct: 213 SDLNWIQCVPCYDCFVQNGPYYDPKESSSFKNIGCHDPRCHLVSSPDPPQPCKAENQTCP 272
Query: 234 YQVAYGDGSFTVGDLVTETVSF------GNS--GSVKGIALGCGHDNEGLFVGSAGLLGL 285
Y YGD S T GD ET + G S V+ + GCGH N GLF G+AGLLGL
Sbjct: 273 YFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRVENVMFGCGHWNRGLFHGAAGLLGL 332
Query: 286 GGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI--------RN 334
G G LS + Q+++ S +YCLVDR+S + + D + P + +
Sbjct: 333 GRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPEVNFTSLVAGKE 392
Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS 394
VDTFYYV + VGG+ ++IP + + G GG IVD GT ++ +Y ++D+
Sbjct: 393 NPVDTFYYVQIKSIMVGGEVLKIPEETWHLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDA 452
Query: 395 FVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
FV+ + D CY+ SG+ + +P + F G + P +NY I ++
Sbjct: 453 FVKKVKGYPVIKDFPILDPCYNVSGVEKMELPEFRILFEDGAVWNFPVENYFIKLEPEEI 512
Query: 455 FCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C A T SALSIIGN QQQ + +D +R+G+ P KC
Sbjct: 513 VCLAILGTPRSALSIIGNYQQQNFHILYDTKKSRLGYAPMKC 554
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 248 bits (632), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 141/368 (38%), Positives = 202/368 (54%), Gaps = 24/368 (6%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
F+ PV + GEY + + +GTP R FS+++DTGSD+ W+QC PC +CY Q+D +F P
Sbjct: 2 FTAPVAAA----RGEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLP 57
Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSG 259
TS+S++ L C + C L C C+Y +YGDGS T GD V +T++ G
Sbjct: 58 NTSTSFTKLACGSALCNGLPFPMCNQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQ 117
Query: 260 SVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGV- 315
V A GCGHDNEG F G+ G+LGLG G LS Q+K+ +YCLVD +P +
Sbjct: 118 QVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQTS 177
Query: 316 -LEFNSARG---GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
L F A D P++ N KV T+YYV L G SVG + I ++F++D G G
Sbjct: 178 PLLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAG 237
Query: 372 IIVDCGTAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTV-- 428
I D GT +T+L AY + + + ++ D C SG ++PTV
Sbjct: 238 TIFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDDISRLDLC--LSGFPKDQLPTVPA 295
Query: 429 -SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNR 487
+ HF G + LP NY I ++S+ ++CFA +S ++IIG+VQQQ +V +D A +
Sbjct: 296 MTFHFEGGDMV-LPPSNYFIYLESSQSYCFAMT-SSPDVNIIGSVQQQNFQVYYDTAGRK 353
Query: 488 VGFTPNKC 495
+GF P C
Sbjct: 354 LGFVPKDC 361
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 248 bits (632), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 164/465 (35%), Positives = 236/465 (50%), Gaps = 44/465 (9%)
Query: 73 PLNSSSSFSLPLHS--REILHKTRHNDYRSLVLSRLERDSARV-----NTLITKLQLAIY 125
P N S F L S EI K DY L+R++ RV I++LQ +
Sbjct: 91 PQNQSVKFHLKHISMKNEIEPKKSVIDYSIRDLTRIQTLHTRVIEKKNQNTISRLQKSTK 150
Query: 126 NV--DRHELKPAEAQIL---PEDFSTPVV----SGASQGSGEYFSRIGVGTPPRQFSMVL 176
+ KPA + + PE +S+ +V SG S GSGEYF + +GTPP+ +S++L
Sbjct: 151 KQTNSKQSYKPAVSPVAAASPE-YSSQLVATLESGVSLGSGEYFMDVFIGTPPKHYSLIL 209
Query: 177 DTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV----SACRANR- 231
DTGSD+NW+QC PC C++QS P +DPK SSS+ + C P+CK + C+
Sbjct: 210 DTGSDLNWIQCVPCIACFEQSGPYYDPKESSSFENITCHDPRCKLVSSPDPPKPCKDENQ 269
Query: 232 -CLYQVAYGDGSFTVGDLVTETVSFG--------NSGSVKGIALGCGHDNEGLFVGSAGL 282
C Y YGD S T GD ET + V+ + GCGH N GLF G+AGL
Sbjct: 270 TCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKHVENVMFGCGHWNRGLFHGAAGL 329
Query: 283 LGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI------- 332
LGLG G LS Q+++ S +YCLVDR+S S + + ++ P +
Sbjct: 330 LGLGRGPLSFASQLQSIYGHSFSYCLVDRNSDTSVSSKLIFGEDKELLSHPNLNFTSFVG 389
Query: 333 -RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
VDTFYYVG+ V G+ ++IP + + + G GG I+D GT +T AY +
Sbjct: 390 GEENSVDTFYYVGIKSIMVDGEVLKIPEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEII 449
Query: 392 RDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
+++F++ + G CY+ SG+ + +P + F G D P +NY I ++
Sbjct: 450 KEAFMKKIKGYELVEGFPPLKPCYNVSGIEKMELPDFGILFSDGAMWDFPVENYFIQIE- 508
Query: 452 AGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C A T SALSIIGN QQQ + +D+ +R+G+ P KC
Sbjct: 509 PDLVCLAILGTPKSALSIIGNYQQQNFHILYDMKKSRLGYAPMKC 553
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 247 bits (631), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 162/478 (33%), Positives = 238/478 (49%), Gaps = 44/478 (9%)
Query: 62 AEESETAAESFPLNSSSSFSLPLH------SREILHKTRHNDYRSLVLSRLERDSARV-- 113
EE++ +E+FP + H S++ K D+ L+R++ RV
Sbjct: 81 GEETDEESEAFPAQKPHQNLVKFHLKHRSGSKDAEPKQSVVDFTLSDLTRIQNLHRRVIE 140
Query: 114 ---NTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVV--------SGASQGSGEYFSR 162
I++LQ + + KP A ++PV SG S GSGEYF
Sbjct: 141 KKNQNTISRLQKSQKEQPKQSYKPVVAAPAASRTTSPVSGQLVATLESGVSLGSGEYFMD 200
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
+ VGTPP+ FS++LDTGSD+NW+QC PC C++QS P +DPK SSS+ + C P+C+ +
Sbjct: 201 VFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPRCQLV 260
Query: 223 DV----SACRANR--CLYQVAYGDGSFTVGDLVTETVSF------GNS--GSVKGIALGC 268
C+A C Y YGDGS T GD ET + G S V+ + GC
Sbjct: 261 SAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKHVENVMFGC 320
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGD 325
GH N GLF G+AGLLGLG G LS Q+++ S +YCLVDR+S AS + +
Sbjct: 321 GHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSKLIFGEDKE 380
Query: 326 AVTAPLI--------RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
++ P + ++ VDTFYYV + V + ++IP + + G GG I+D G
Sbjct: 381 LLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETWHLSSEGAGGTIIDSG 440
Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
T +T AY ++++FVR + G+ CY+ SG+ + +P + F
Sbjct: 441 TTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCYNVSGIEKMELPDFGILFADEAV 500
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ P +NY I +D SALSIIGN QQQ + +D+ +R+G+ P KC
Sbjct: 501 WNFPVENYFIWIDPEVVCLAILGNPRSALSIIGNYQQQNFHILYDMKKSRLGYAPMKC 558
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 247 bits (631), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 149/403 (36%), Positives = 214/403 (53%), Gaps = 29/403 (7%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L +D +RV ++ ++L + LK ++A + P S ++ GSG Y +G+
Sbjct: 103 LAQDESRVASIQSRLAKNL--AGGSNLKASKATL-------PSKSASTLGSGNYVVTVGL 153
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
G+P R + + DTGSD+ W QC PC CYQQ + IFDP TS SYS + C +P C+ L+
Sbjct: 154 GSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSPSCEKLES 213
Query: 225 S-----ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS 279
+ C ++ CLY + YGDGS+++G E +S ++ GCG +N GLF G+
Sbjct: 214 ATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTDVFNNFQFGCGQNNRGLFGGT 273
Query: 280 AGLLGLGGGMLSL---TKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVT--APLIRN 334
AGLLGL LSL T Q +YCL S ++G L F S G P N
Sbjct: 274 AGLLGLARNPLSLVSQTAQKYGKVFSYCL-PSSSSSTGYLSFGSGDGDSKAVKFTPSEVN 332
Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS 394
+FY++ + G SVG + + IP S+F G I+D GT I+RL Y+S++
Sbjct: 333 SDYPSFYFLDMVGISVGERKLPIPKSVFST-----AGTIIDSGTVISRLPPTVYSSVQKV 387
Query: 395 FVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
F L + GV++ DTCYD S ++V+VP + L+F G +DL A +I V
Sbjct: 388 FRELMSDYPRVKGVSILDTCYDLSKYKTVKVPKIILYFSGGAEMDL-APEGIIYVLKVSQ 446
Query: 455 FCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C AFA S ++IIGNVQQ+ V +D A RVGF P+ C
Sbjct: 447 VCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPSGC 489
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 247 bits (631), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 153/414 (36%), Positives = 215/414 (51%), Gaps = 14/414 (3%)
Query: 86 SREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFS 145
SR + + N +R +S DS T +LQ A V R L+ F
Sbjct: 30 SRSLDRRPEKNGFR---VSLRHVDSGGNYTKFERLQRA---VKRGRLRLQRLSAKTASFE 83
Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
V + G+GE+ + +GTP +S ++DTGSD+ W QC+PC C+ Q PIFDP+
Sbjct: 84 PSVEAPVHAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEK 143
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
SSS+S LPC++ C +L +S+C ++ C Y+ +YGD S T G L TET +FG++ SV I
Sbjct: 144 SSSFSKLPCSSDLCVALPISSC-SDGCEYRYSYGDHSSTQGVLATETFTFGDA-SVSKIG 201
Query: 266 LGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSPASGVLEFNSAR 322
GCG DN G + AGL+GLG G LSL Q+ +YCL +D S +L + A
Sbjct: 202 FGCGEDNRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCLTSIDDSKGISTLLVGSEAT 261
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
A+ PLI+N +FYY+ L G SVG + I S F + + G GG+I+D GT IT
Sbjct: 262 VKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITY 321
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRS-VRVPTVSLHFGAGKALDLP 441
L+ A+ +L+ F+ SG + C+ S V VP + HF G L LP
Sbjct: 322 LKDNAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVEVPQLVFHF-EGVDLKLP 380
Query: 442 AKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+NY+I + C +SS +SI GN QQQ V DL + F P +C
Sbjct: 381 KENYIIEDSALRVICLTMG-SSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 143/352 (40%), Positives = 199/352 (56%), Gaps = 12/352 (3%)
Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
S GSGEY +I +GTPP+QFS ++DTGSD+ W+QC PC C++Q DP+F P SSSYS
Sbjct: 2 SAGSGEYVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSNA 61
Query: 213 PCAAPQCKSLDVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD 271
C C +L C N C Y +YGDGS T GD ETV+ N ++ I GCGH+
Sbjct: 62 SCTDSLCDALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTL-NGSTLARIGFGCGHN 120
Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA--SGVLEFNSARGGDA 326
EG F G+ GL+GLG G LSL Q+ ++ +YCLVD+ + S + N+A A
Sbjct: 121 QEGTFAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTFSPITFGNAAENSRA 180
Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
PL++N+ ++YYVG+ SVG + V PPS F +D G GG+I+D GT IT +
Sbjct: 181 SFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSGTTITYWRLA 240
Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGL--RSVRVPTVSLHFGAGKALDLPAKN 444
A+ + R + + CYD S + S+ +P++++H ++P N
Sbjct: 241 AFIPILAELRRQISYPEADPTPYGLNLCYDISSVSASSLTLPSMTVHL-TNVDFEIPVSN 299
Query: 445 YLIPVDSAG-TFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ VD+ G T C A + TS SIIGNVQQQ + D+AN+RVGF C
Sbjct: 300 LWVLVDNFGETVCTAMS-TSDQFSIIGNVQQQNNLIVTDVANSRVGFLATDC 350
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 157/394 (39%), Positives = 212/394 (53%), Gaps = 22/394 (5%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
+ RD ARV ++ +KL N E+ A++ LP SG + GSG Y IG+
Sbjct: 89 IRRDQARVESIYSKLSKNSAN----EVSEAKSTELPAK------SGITLGSGNYIVTIGI 138
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
GTP S+V DTGSD+ W QC PC CY Q +P F+P +SS+Y + C++P C+ D
Sbjct: 139 GTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCE--DA 196
Query: 225 SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLG 284
+C A+ C+Y + YGD SFT G L E + NS ++ + GCG +N+GLF G AGLLG
Sbjct: 197 ESCSASNCVYSIGYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDGVAGLLG 256
Query: 285 LGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFY 341
LG G LSL Q T +YCL S ++G L F SA ++V I + Y
Sbjct: 257 LGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFTPISSFPSAFNY 316
Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
+ + G SVG + + I P+ F + G I+D GT TRL T+ Y LR F +
Sbjct: 317 GIDIIGISVGDKELAITPNSFSTE-----GAIIDSGTVFTRLPTKVYAELRSVFKEKMSS 371
Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
K TSG LFDTCYDF+GL +V PT++ F G ++L +P+ C AFA
Sbjct: 372 YKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGGTVVELDGSGISLPI-KISQVCLAFAG 430
Query: 462 TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+I GNVQQ V +D+A RVGF PN C
Sbjct: 431 NDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 143/397 (36%), Positives = 214/397 (53%), Gaps = 25/397 (6%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L+RD RV++ I ++ + + + S P G G+ Y +G+
Sbjct: 144 LDRDQDRVDS-IHRMTAGPWTAGQSSAS--------KGVSLPAHRGLRLGTANYIVSVGL 194
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
GTP R +V DTGSD++W+QC+PC CY+Q DP+FDP S++YS +PC A +C LD
Sbjct: 195 GTPRRDLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQSTTYSAVPCGAQEC--LDSG 252
Query: 226 ACRANRCLYQVAYGDGSFTVGDLVTETVSFG-NSGSVKGIALGCGHDNEGLFVGSAGLLG 284
C + +C Y+V YGD S T G+L +T++ G +S ++G GCG D+ GLF + GL G
Sbjct: 253 TCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSDQLQGFVFGCGDDDTGLFGRADGLFG 312
Query: 285 LGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARG-GDAVTAPLIRNKKVDTF 340
LG +SL Q A +YCL A G L SA A ++ +F
Sbjct: 313 LGRDRVSLASQAAARYGAGFSYCLPS-SWRAEGYLSLGSAAAPPHAQFTAMVTRSDTPSF 371
Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG 400
YY+ L G V G+ V++ P++F+ G ++D GT ITRL ++AY++LR SF
Sbjct: 372 YYLDLVGIKVAGRTVRVAPAVFKAP-----GTVIDSGTVITRLPSRAYSALRSSFAGFMR 426
Query: 401 NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA 460
K +++ DTCYDF+G V++P+V+L F G L+L L V + C AFA
Sbjct: 427 RYKRAPALSILDTCYDFTGRTKVQIPSVALLFDGGATLNLGFGGVLY-VANRSQACLAFA 485
Query: 461 PT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+++ I+GN+QQ+ V +DLAN ++GF C
Sbjct: 486 SNGDDTSVGILGNMQQKTFAVVYDLANQKIGFGAKGC 522
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 245 bits (625), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 158/430 (36%), Positives = 222/430 (51%), Gaps = 28/430 (6%)
Query: 74 LNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELK 133
+NS FS + E++H+ R NT T ++ + V R +
Sbjct: 7 INSFYDFSFQVLRTELIHREH------------PSSPLRSNTSKTTTEIFLAAVKRGAER 54
Query: 134 PAE--AQILPED--FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP 189
A+ IL E FSTPV SG +GEY I G+PP++ S+++DTGSD+ W QC P
Sbjct: 55 RAQLSKHILAEGRLFSTPVASG----NGEYLIDISFGSPPQKASVIVDTGSDLIWTQCLP 110
Query: 190 CTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLV 249
C C + IFDP SS+Y + CA+ C SL +C + C Y YGDGS T G L
Sbjct: 111 CETCNAAASVIFDPVKSSTYDTVSCASNFCSSLPFQSCTTS-CKYDYMYGDGSSTSGAL- 168
Query: 250 TETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLV 306
+ +G++ +A GCGH N G F G+AG++GLG G LSL Q I + +YCLV
Sbjct: 169 STETVTVGTGTIPNVAFGCGHTNLGSFAGAAGIVGLGQGPLSLISQASSITSKKFSYCLV 228
Query: 307 DRDS-PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
S S +L +SA G L+ N TFYY LTG SV G+AV P F +D
Sbjct: 229 PLGSTKTSPMLIGDSAAAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSID 288
Query: 366 EAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRV 425
+G GG I+D GT +T L+T A+N+L + + + D C+ +G+ +
Sbjct: 289 ASGQGGFILDSGTTLTYLETGAFNALVAALKAEVPFPEADGSLYGLDYCFSTAGVANPTY 348
Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLAN 485
PT++ HF G +LP +N + +D+ G+ C A A S+ SI+GN+QQQ + DL N
Sbjct: 349 PTMTFHF-KGADYELPPENVFVALDTGGSICLAMA-ASTGFSIMGNIQQQNHLIVHDLVN 406
Query: 486 NRVGFTPNKC 495
RVGF C
Sbjct: 407 QRVGFKEANC 416
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 244 bits (624), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 152/413 (36%), Positives = 214/413 (51%), Gaps = 14/413 (3%)
Query: 87 REILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFST 146
R + + N +R +S DS T +LQ A V R L+ F
Sbjct: 31 RSLDRRPEKNGFR---VSLRHVDSGGNYTKFERLQRA---VKRGRLRLQRLSAKTASFEP 84
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
V + G+GE+ + +GTP +S ++DTGSD+ W QC+PC C+ Q PIFDP+ S
Sbjct: 85 SVEAPVHAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKS 144
Query: 207 SSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIAL 266
SS+S LPC++ C +L +S+C ++ C Y+ +YGD S T G L TET +FG++ SV I
Sbjct: 145 SSFSKLPCSSDLCVALPISSC-SDGCEYRYSYGDHSSTQGVLATETFTFGDA-SVSKIGF 202
Query: 267 GCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSPASGVLEFNSARG 323
GCG DN G + AGL+GLG G LSL Q+ +YCL +D S +L + A
Sbjct: 203 GCGEDNRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCLTSIDDSKGISTLLVGSEATV 262
Query: 324 GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
A+ PLI+N +FYY+ L G SVG + I S F + + G GG+I+D GT IT L
Sbjct: 263 KSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYL 322
Query: 384 QTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRS-VRVPTVSLHFGAGKALDLPA 442
+ A+ +L+ F+ SG + C+ S V VP + HF G L LP
Sbjct: 323 KDSAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVDVPQLVFHF-EGVDLKLPK 381
Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+NY+I + C +SS +SI GN QQQ V DL + F P +C
Sbjct: 382 ENYIIEDSALRVICLTMG-SSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 244 bits (623), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 170/470 (36%), Positives = 235/470 (50%), Gaps = 42/470 (8%)
Query: 62 AEESETAAESFPLNSSSSFSLPLHSREILH-KTRHNDYRSLVLSRLERDSARVNTLITKL 120
A E + P + S S L L+ R +TR +L E+D+ R+ T+ +
Sbjct: 59 AAEEALDEQKQPASPSPSLKLRLNHRAAEGGRTREES----LLDLAEKDAVRIETMYRRA 114
Query: 121 QLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGS 180
A R + + L E V SG + GSGEY + VGTPPR+F M++DTGS
Sbjct: 115 --ARSGGGRMPASSSPRRALSERMVATVESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGS 172
Query: 181 DINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA---------CR--- 228
D+NWLQC PC +C++Q P+FDP SSSY + C +C + CR
Sbjct: 173 DLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDHRCGHVAPPPEPEASSPRTCRRPG 232
Query: 229 ANRCLYQVAYGDGSFTVGDLVTETVSF-----GNSGSVKGIALGCGHDNEGLFVGSAGLL 283
+ C Y YGD S T GDL E+ + G S V G+ GCGH N GLF G+AGLL
Sbjct: 233 EDPCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLL 292
Query: 284 GLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTA-PLIR------ 333
GLG G LS Q++A + +YCLVD S + F A+ A P ++
Sbjct: 293 GLGRGPLSFASQLRAVYGHTFSYCLVDHGSDVGSKVVFGEDDDALALAAHPQLKYTAFAP 352
Query: 334 ----NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
+ DTFYYV L G VGG+ + I +++ + G GG I+D GT ++ AY
Sbjct: 353 ASSSSSPADTFYYVKLKGVLVGGELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQ 412
Query: 390 SLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
+R +F+ R++ + + CY+ SG+ VP +SL F G D PA+NY I
Sbjct: 413 VIRHAFMDRMSRSYPLVPEFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIR 472
Query: 449 VDSAG--TFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+D G C A T + +SIIGN QQQ V +DL NNR+GF P +C
Sbjct: 473 LDPDGGSIMCLAVLGTPRTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRC 522
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 244 bits (622), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 156/394 (39%), Positives = 211/394 (53%), Gaps = 22/394 (5%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
+ RD ARV ++ +KL N E+ A++ LP SG + GSG Y IG+
Sbjct: 89 IRRDQARVESIYSKLSKNSAN----EVSEAKSTELPAK------SGITLGSGNYIVTIGI 138
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
GTP S+V DTGSD+ W QC PC CY Q +P F+P +SS+Y + C++P C+ D
Sbjct: 139 GTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCE--DA 196
Query: 225 SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLG 284
+C A+ C+Y + YGD SFT G L E + NS ++ + GCG +N+GLF G AGLLG
Sbjct: 197 ESCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDGVAGLLG 256
Query: 285 LGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFY 341
LG G LSL Q T +YCL S ++G L F SA ++V I + Y
Sbjct: 257 LGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFTPISSFPSAFNY 316
Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
+ + G SVG + + I P+ F + G I+D GT TRL T+ Y LR F +
Sbjct: 317 GIDIIGISVGDKELAITPNSFSTE-----GAIIDSGTVFTRLPTKVYAELRSVFKEKMSS 371
Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
K TSG LFDTCYDF+GL +V PT++ F ++L +P+ C AFA
Sbjct: 372 YKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPI-KISQVCLAFAG 430
Query: 462 TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+I GNVQQ V +D+A RVGF PN C
Sbjct: 431 NDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 144/403 (35%), Positives = 218/403 (54%), Gaps = 28/403 (6%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L +D +RV+ + +K+ + +VDR L+ ++A +P SGA+ GSG Y +G+
Sbjct: 86 LVKDQSRVDFIHSKIAGELESVDR--LRGSKATKIPAK------SGATIGSGNYIVSVGL 137
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
GTP + S++ DTGSD+ W QC+PC CY Q DP+F P S++YS + C++P C L+
Sbjct: 138 GTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQSTTYSNISCSSPDCSQLES 197
Query: 225 S-----ACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
C A R C+Y + YGD SF+VG ET++ ++ ++ GCG +N GLF
Sbjct: 198 GTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETLTLTSTDVIENFLFGCGQNNRGLFGS 257
Query: 279 SAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNSARGGDAVT-APLIRN 334
+AGL+GLG +S+ KQ +YCL + S ++G L F GG A+ P+ +
Sbjct: 258 AAGLIGLGQDKISIVKQTAQKYGQVFSYCL-PKTSSSTGYLTFGGGGGGGALKYTPITKA 316
Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS 394
V FY V + G VGG + I S+F G I+D GT ITRL AY++L+ +
Sbjct: 317 HGVANFYGVDIVGMKVGGTQIPISSSVFSTS-----GAIIDSGTVITRLPPDAYSALKSA 371
Query: 395 FVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
F + +++ DTCYD S ++++P V F G+ LDL + S
Sbjct: 372 FEKGMAKYPKAPELSILDTCYDLSKYSTIQIPKVGFVFKGGEELDLDGIGIMYGA-STSQ 430
Query: 455 FCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C AFA S ++IIGNVQQ+ +V +D+ ++GF N C
Sbjct: 431 VCLAFAGNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGFGYNGC 473
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 146/362 (40%), Positives = 209/362 (57%), Gaps = 22/362 (6%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDP 203
S P+ G S G G Y +++G+GTP ++MV+DTGS + WLQC PC C++Q P+FDP
Sbjct: 120 SVPLSPGTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDP 179
Query: 204 KTSSSYSPLPCAAPQC-----KSLDVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSFGN 257
+ SS+Y+ + C+A QC +L+ SAC A N C+YQ +YGD SF+VG L T+TVSFG+
Sbjct: 180 RASSTYTSVRCSASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSFGS 239
Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASG 314
+ S GCG DNEGLF SAGL+GL LSL Q+ + S +YCL S +G
Sbjct: 240 T-SYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAAS--TG 296
Query: 315 VLEFNSARGGDAVTAPLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
L G + + + +D + Y++ L+G SVGG + + PS E I
Sbjct: 297 YLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPS-----EYSSLPTI 351
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
+D GT ITRL T + +L + + + ++ DTC++ + +RVPTV + F
Sbjct: 352 IDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQ-LRVPTVVMAFA 410
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
G ++ L +N LI VD + T C AFAPT S +IIGN QQQ V +D+A +R+GF+
Sbjct: 411 GGASMKLTTRNVLIDVDDS-TTCLAFAPTDST-AIIGNTQQQTFSVIYDVAQSRIGFSAG 468
Query: 494 KC 495
C
Sbjct: 469 GC 470
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 145/362 (40%), Positives = 209/362 (57%), Gaps = 22/362 (6%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDP 203
S P+ G S G G Y +++G+GTP ++MV+DTGS + WLQC PC C++Q P+FDP
Sbjct: 120 SVPLSPGTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDP 179
Query: 204 KTSSSYSPLPCAAPQC-----KSLDVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSFGN 257
+ SS+Y+ + C+A QC +L+ SAC A N C+YQ +YGD SF+VG L T+TVSFG+
Sbjct: 180 RASSTYASVRCSASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTDTVSFGS 239
Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASG 314
+ GCG DNEGLF SAGL+GL LSL Q+ + S +YCL S +G
Sbjct: 240 T-RYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAAS--TG 296
Query: 315 VLEFNSARGGDAVTAPLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
L G + + + +D + Y++ L+G SVGG + + PS E I
Sbjct: 297 YLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPS-----EYSSLPTI 351
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
+D GT ITRL T + +L + + + ++ DTC++ + +RVPTV++ F
Sbjct: 352 IDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQ-LRVPTVAMAFA 410
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
G ++ L +N LI VD + T C AFAPT S +IIGN QQQ V +D+A +R+GF+
Sbjct: 411 GGASMKLTTRNVLIDVDDS-TTCLAFAPTDST-AIIGNTQQQTFSVIYDVAQSRIGFSAG 468
Query: 494 KC 495
C
Sbjct: 469 GC 470
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 139/351 (39%), Positives = 190/351 (54%), Gaps = 19/351 (5%)
Query: 162 RIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS 221
+ +G P ++S ++DTGSD+ W QC+PCTEC+ Q PIFDP+ SSSYS + C++ C +
Sbjct: 2 ELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNA 61
Query: 222 LDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL-FVG 278
L S C ++ C Y YGD S T G L TET +F + S+ GI GCG +NEG F
Sbjct: 62 LPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQ 121
Query: 279 SAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNSARGGDA----------- 326
+GL+GLG G LSL Q+K T +YCL DS AS L S G
Sbjct: 122 GSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEV 181
Query: 327 -VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
T L+RN +FYY+ L G +VG + + + S FE+ E G GG+I+D GT IT L+
Sbjct: 182 TKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEE 241
Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKN 444
A+ L++ F SG D C+ +++ VP + HF G L+LP +N
Sbjct: 242 TAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHF-KGADLELPGEN 300
Query: 445 YLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
Y++ S G C A +S+ +SI GNVQQQ V DL V F P +C
Sbjct: 301 YMVADSSTGVLCLAMG-SSNGMSIFGNVQQQNFNVLHDLEKETVSFVPTEC 350
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 133/358 (37%), Positives = 195/358 (54%), Gaps = 24/358 (6%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GEY +G+G+PPR FS ++DTGSD+ W QC PC C +Q P F+P S+SY+ LPC++
Sbjct: 86 GEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSS 145
Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS---VKGIALGCGHDNE 273
C +L C N C+YQ YGD + + G L ET +FG + + V ++ GCG+ N
Sbjct: 146 AMCNALYSPLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGCGNMNA 205
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF---------NSARGG 324
G +G++G G G LSL Q+ + +YCL SPA+ L F N++ G
Sbjct: 206 GTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPATSRLYFGAYATLNSTNTSSSG 265
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDCGTAITRL 383
+ P I N + T Y++ +TG SV G + I PS+F ++E G GG+I+D GT +T L
Sbjct: 266 PVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGTTVTFL 325
Query: 384 QTQAYNSLRDSFVRLAG----NLKPTSGVALFDTCYDF--SGLRSVRVPTVSLHFGAGKA 437
AY ++ +FV G N P+ FDTC+ + R V +P + LHF G
Sbjct: 326 AQPAYAMVQGAFVAWVGLPRANATPSD---TFDTCFKWPPPPRRMVTLPEMVLHFD-GAD 381
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
++LP +NY++ G C A P+ SIIG+ Q Q + +DL N+ + F P C
Sbjct: 382 MELPLENYMVMDGGTGNLCLAMLPSDDG-SIIGSFQHQNFHMLYDLENSLLSFVPAPC 438
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 133/358 (37%), Positives = 195/358 (54%), Gaps = 24/358 (6%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GEY +G+G+PPR FS ++DTGSD+ W QC PC C +Q P F+P S+SY+ LPC++
Sbjct: 83 GEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSS 142
Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS---VKGIALGCGHDNE 273
C +L C N C+YQ YGD + + G L ET +FG + + V ++ GCG+ N
Sbjct: 143 AMCNALYSPLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGCGNMNA 202
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF---------NSARGG 324
G +G++G G G LSL Q+ + +YCL SPA+ L F N++ G
Sbjct: 203 GTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPATSRLYFGAYATLNSTNTSSSG 262
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDCGTAITRL 383
+ P I N + T Y++ +TG SV G + I PS+F ++E G GG+I+D GT +T L
Sbjct: 263 PVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGTTVTFL 322
Query: 384 QTQAYNSLRDSFVRLAG----NLKPTSGVALFDTCYDF--SGLRSVRVPTVSLHFGAGKA 437
AY ++ +FV G N P+ FDTC+ + R V +P + LHF G
Sbjct: 323 AQPAYAMVQGAFVAWVGLPRANATPSD---TFDTCFKWPPPPRRMVTLPEMVLHFD-GAD 378
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
++LP +NY++ G C A P+ SIIG+ Q Q + +DL N+ + F P C
Sbjct: 379 MELPLENYMVMDGGTGNLCLAMLPSDDG-SIIGSFQHQNFHMLYDLENSLLSFVPAPC 435
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 143/379 (37%), Positives = 209/379 (55%), Gaps = 38/379 (10%)
Query: 143 DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
D+ +PV SG G+Y + I +GTP + FS++ DTGSD+ W+QC+PC C+ Q DPIFD
Sbjct: 28 DYESPVASGG----GDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFD 83
Query: 203 PKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNS 258
P+ SSSY+ + C C SL +C + C Y YGDGS T G L +ETV+ G
Sbjct: 84 PEGSSSYTTMSCGDTLCDSLPRKSCSPD-CDYSYGYGDGSGTRGTLSSETVTLTSTQGEK 142
Query: 259 GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVD-RDSPASG 314
+ K IA GCGH N G F ++GL+GLG G LS Q+ +YCLV RD+P+
Sbjct: 143 LAAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKT 202
Query: 315 VLEF--------NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
F +S + P+I N +++FYYV L S+ G+A++IP F++
Sbjct: 203 SPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKP 262
Query: 367 AGDGGIIVDCGTAITRLQTQAYN----SLRD--SFVRLAGNLKPTSGVALFDTCYDFSGL 420
G GG+I D GT +T L Y +LR SF ++ G+ A D CYD SG
Sbjct: 263 DGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGS------SAGLDLCYDVSGS 316
Query: 421 RS---VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF-CFAFAPTSSALSIIGNVQQQG 476
++ +++P + HF G LP +NY I + AGT C A ++ + I GN+ QQ
Sbjct: 317 KASYKMKIPAMVFHF-EGADYQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQN 375
Query: 477 TRVSFDLANNRVGFTPNKC 495
RV +D+ ++++G+ P++C
Sbjct: 376 FRVMYDIGSSKIGWAPSQC 394
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 145/376 (38%), Positives = 206/376 (54%), Gaps = 22/376 (5%)
Query: 132 LKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT 191
+ PAEA + + P +G S G+ E+ +G GTP + ++++ DTGSD++W+QC PC+
Sbjct: 97 IPPAEAPAV----TIPDSTGTSLGTLEFVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCS 152
Query: 192 -ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVT 250
CY+Q DPIFDP S++YS +PC PQC + CLY+V YGDGS T G L
Sbjct: 153 GHCYKQHDPIFDPTKSATYSAVPCGHPQCAAAGGKCSSNGTCLYKVQYGDGSSTAGVLSH 212
Query: 251 ETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSL---TKQIKATSLAYCLVD 307
ET+S ++ ++ G A GCG N G F GL+GLG G LSL + +YCL
Sbjct: 213 ETLSLTSARALPGFAFGCGETNLGDFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPS 272
Query: 308 RDSPASGVLEFNS---ARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
++ + G L + A G D V +I+ + +FY+V L VGG + +PP LF
Sbjct: 273 YNT-SHGYLTIGTTTPASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFT 331
Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
D G ++D GT +T L +AY +LRD F KP FDTCYDF+G ++
Sbjct: 332 RD-----GTLLDSGTVLTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFAGQNAI 386
Query: 424 RVPTVSLHFGAGKALDLPAKNYLI-PVDSA-GTFCFAFAPTSSAL--SIIGNVQQQGTRV 479
+P VS F G + DL LI P D+A T C AF P S + +I+GN QQ+ T +
Sbjct: 387 FMPLVSFKFSDGSSFDLSPFGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEM 446
Query: 480 SFDLANNRVGFTPNKC 495
+D+A ++GF C
Sbjct: 447 IYDVAAEKIGFVSGSC 462
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 157/400 (39%), Positives = 219/400 (54%), Gaps = 21/400 (5%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L D AR+++L +L A+A + S P+ GAS G G Y +R+G+
Sbjct: 69 LTHDDARISSLAARLAKTPSARATSLDADADAGLAGSLASVPLSPGASVGVGNYVTRMGL 128
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAAPQCK---- 220
GTP Q+ MV+DTGS + WLQC PC C++QS P+F+PK+SS+Y+ + C+A QC
Sbjct: 129 GTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSDLPS 188
Query: 221 -SLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
+L+ SAC +N C+YQ +YGD SF+VG L +TVSFG++ S+ GCG DNEGLF
Sbjct: 189 ATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGST-SLPNFYYGCGQDNEGLFGR 247
Query: 279 SAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNK 335
SAGL+GL LSL Q+ + S YCL S +SG L S G P++ +
Sbjct: 248 SAGLIGLARNKLSLLYQLAPSLGYSFTYCL--PSSSSSGYLSLGSYNPGQYSYTPMVSSS 305
Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
D+ Y++ L+G +V G P I+D GT ITRL T Y++L +
Sbjct: 306 LDDSLYFIKLSGMTVAGN-----PLSVSSSAYSSLPTIIDSGTVITRLPTSVYSALSKAV 360
Query: 396 VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
S ++ DTC+ R V P V++ F G AL L A+N L+ VD + T
Sbjct: 361 AAAMKGTSRASAYSILDTCFKGQASR-VSAPAVTMSFAGGAALKLSAQNLLVDVDDS-TT 418
Query: 456 CFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C AFAP SA +IIGN QQQ V +D+ ++R+GF C
Sbjct: 419 CLAFAPARSA-AIIGNTQQQTFSVVYDVKSSRIGFAAGGC 457
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 144/379 (37%), Positives = 207/379 (54%), Gaps = 38/379 (10%)
Query: 143 DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
D+ +PV SG G+Y + I +GTP + FS++ DTGSD+ W+QC+PC C+ Q DPIFD
Sbjct: 28 DYESPVASGG----GDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFD 83
Query: 203 PKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNS 258
P+ SSSY+ + C C SL +C N C Y YGDGS T G L +ETV+ G
Sbjct: 84 PEGSSSYTTMSCGDTLCDSLPRKSCSPN-CDYSYGYGDGSGTRGTLSSETVTLTSTQGEK 142
Query: 259 GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVD-RDSPASG 314
+ K IA GCGH N G F ++GL+GLG G LS Q+ +YCLV RD+P+
Sbjct: 143 LAAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKT 202
Query: 315 VLEF--------NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
F +S + P+I N +++FYYV L S+ G+A++IP F++
Sbjct: 203 SPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKP 262
Query: 367 AGDGGIIVDCGTAITRLQTQAYN----SLRD--SFVRLAGNLKPTSGVALFDTCYDFSGL 420
G GG+I D GT +T L Y +LR SF + G+ A D CYD SG
Sbjct: 263 DGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGS------SAGLDLCYDVSGS 316
Query: 421 RS---VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF-CFAFAPTSSALSIIGNVQQQG 476
++ ++P + HF G LP +NY I + AGT C A ++ + I GN+ QQ
Sbjct: 317 KASYKKKIPAMVFHF-EGADHQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQN 375
Query: 477 TRVSFDLANNRVGFTPNKC 495
RV +D+ ++++G+ P++C
Sbjct: 376 FRVMYDIGSSKIGWAPSQC 394
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 139/356 (39%), Positives = 193/356 (54%), Gaps = 19/356 (5%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GEY +G+GTPPR +S +LDTGSD+ W QC PC C Q P FDP S SY+ LPC +
Sbjct: 87 GEYLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNS 146
Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG---SVKGIALGCGHDNE 273
P C +L C N C+YQ YGD + T G L ET +FG + +V IA GCG+ N
Sbjct: 147 PMCNALYYPLCYRNVCVYQYFYGDSANTAGVLSNETFTFGTNDTRVTVPRIAFGCGNLNA 206
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF--------NSARGGD 325
G +G++G G G LSL Q+ + +YCL SP L F SA G+
Sbjct: 207 GSLFNGSGMVGFGRGPLSLVSQLGSPRFSYCLTSFMSPVPSRLYFGAYATLNSTSASTGE 266
Query: 326 AV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDCGTAITRL 383
V + P I N + T YY+ +TG SVGG+ + I PS+F +++A G GG+I+D G+ IT L
Sbjct: 267 PVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGTGGVIIDSGSTITYL 326
Query: 384 QTQAYNSLRDSFVRLAG--NLKPTSGVALFDTCYDF--SGLRSVRVPTVSLHFGAGKALD 439
AY+ + +F G TS + DTC+ + + V +P ++ HF G ++
Sbjct: 327 ARAAYDMVHQAFADQVGLPLTNATSLADVLDTCFVWPPPPRKIVTMPELAFHF-EGANME 385
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LP +NY++ G C A A + SIIG+ Q Q V +D N+ + FTP C
Sbjct: 386 LPLENYMLIDGDTGNLCLAIAASDDG-SIIGSFQHQNFHVLYDNENSLLSFTPATC 440
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 241 bits (614), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 140/372 (37%), Positives = 200/372 (53%), Gaps = 26/372 (6%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSY 209
SG S GSGEYF + VGTPP+ FS++LDTGSD+NW+QC PC C++Q+ P +DPK SSS+
Sbjct: 186 SGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKDSSSF 245
Query: 210 SPLPCAAPQCKSLDV----SACRA--NRCLYQVAYGDGSFTVGDLVTETVSFGNSGS--- 260
+ C P+C+ + C+ C Y YGD S T GD ET + +
Sbjct: 246 KNITCHDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGK 305
Query: 261 -----VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPA 312
V+ + GCGH N GLF G+AGLLGLG G LS Q+++ S +YCLVDR+S +
Sbjct: 306 PELKIVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQSLYGHSFSYCLVDRNSNS 365
Query: 313 SGVLEFNSARGGDAVTAPLI--------RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
S + + ++ P + + VDTFYYV + VGG+ ++IP + +
Sbjct: 366 SVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWHL 425
Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
G GG I+D GT +T AY ++++F+R CY+ SG+ +
Sbjct: 426 SAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPLKPCYNVSGVEKME 485
Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDL 483
+P ++ F G D P +NY I ++ C A T SALSIIGN QQQ + +DL
Sbjct: 486 LPEFAILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRSALSIIGNYQQQNFHILYDL 545
Query: 484 ANNRVGFTPNKC 495
+R+G+ P KC
Sbjct: 546 KKSRLGYAPMKC 557
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 241 bits (614), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 141/365 (38%), Positives = 197/365 (53%), Gaps = 17/365 (4%)
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
++F +PV +G +GEY + +G+PP+ F +++DTGSD+NW+QC PC CYQQ P F
Sbjct: 26 QEFQSPVKAG----NGEYLMTLTLGSPPQSFDVIVDTGSDLNWVQCLPCRVCYQQPGPKF 81
Query: 202 DPKTSSSYSPLPCAAPQCK--SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG 259
DP S S+ C C +L + AC AN C YQ YGD S T GDL ET+S N
Sbjct: 82 DPSKSRSFRKAACTDNLCNVSALPLKACAANVCQYQYTYGDQSNTNGDLAFETISLNNGA 141
Query: 260 ---SVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPAS 313
SV A GCG N G F G+AGL+GLG G LSL Q+ A +YCLV +S ++
Sbjct: 142 GTQSVPNFAFGCGTQNLGTFAGAAGLVGLGQGPLSLNSQLSHTFANKFSYCLVSLNSLSA 201
Query: 314 GVLEFNS-ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGG 371
L F S A + ++ N + T+YYV L VGGQ + + PS+F +D++ G GG
Sbjct: 202 SPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQSTGRGG 261
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLH 431
I+D GT IT L AY+++ ++ + D C++ +G+ + VP +
Sbjct: 262 TIIDSGTTITMLTLPAYSAVLRAYESFVNYPRLDGSAYGLDLCFNIAGVSNPSVPDMVFK 321
Query: 432 FGAGKALDLPAKNYLIPVD-SAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
F G + +N + VD SA T C A S SIIGN+QQQ V +DL ++GF
Sbjct: 322 F-QGADFQMRGENLFVLVDTSATTLCLAMG-GSQGFSIIGNIQQQNHLVVYDLEAKKIGF 379
Query: 491 TPNKC 495
C
Sbjct: 380 ATADC 384
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 165/434 (38%), Positives = 227/434 (52%), Gaps = 26/434 (5%)
Query: 75 NSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP 134
N SS S L S E++H RH +V D+ + + Q VD +
Sbjct: 39 NHSSKVSNSL-SLEVVH--RHGPCIGIVNQEKGADAPSNMEIFLRDQ---NRVDSIHARL 92
Query: 135 AEAQILPEDFST--PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE 192
+ + PE +T PV SGAS G+G+Y +G+GTP ++F+++ DTGSDI W QC PC +
Sbjct: 93 SSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVK 152
Query: 193 -CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV-----SACRANRCLYQVAYGDGSFTVG 246
CY+Q +P +P TS+SY + C++ CK + +C ++ CLYQV YGDGS+++G
Sbjct: 153 TCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIG 212
Query: 247 DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAY 303
TET++ +S K GCG N GLF G+AGLLGLG L+L Q T +Y
Sbjct: 213 FFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSY 272
Query: 304 CLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
CL S + G L PL + FY + +TG SVGG+ + I S F
Sbjct: 273 CL-PASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFS 331
Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
G ++D GT ITRL AY+ L +F L + TSG ++FDTCYDFS +V
Sbjct: 332 ------AGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTV 385
Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSF 481
R+P V + F G +D+ L PV+ C AFA S SI GNVQQ+ +V +
Sbjct: 386 RIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVY 445
Query: 482 DLANNRVGFTPNKC 495
D A RVGF P C
Sbjct: 446 DGAKGRVGFAPGGC 459
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 165/434 (38%), Positives = 227/434 (52%), Gaps = 26/434 (5%)
Query: 75 NSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP 134
N SS S L S E++H RH +V D+ + + Q VD +
Sbjct: 51 NHSSKVSNSL-SLEVVH--RHGPCIGIVNQEKGADAPSNMEIFLRDQ---NRVDSIHARL 104
Query: 135 AEAQILPEDFST--PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE 192
+ + PE +T PV SGAS G+G+Y +G+GTP ++F+++ DTGSDI W QC PC +
Sbjct: 105 SSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVK 164
Query: 193 -CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV-----SACRANRCLYQVAYGDGSFTVG 246
CY+Q +P +P TS+SY + C++ CK + +C ++ CLYQV YGDGS+++G
Sbjct: 165 TCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIG 224
Query: 247 DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAY 303
TET++ +S K GCG N GLF G+AGLLGLG L+L Q T +Y
Sbjct: 225 FFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSY 284
Query: 304 CLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
CL S + G L PL + FY + +TG SVGG+ + I S F
Sbjct: 285 CL-PASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFS 343
Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
G ++D GT ITRL AY+ L +F L + TSG ++FDTCYDFS +V
Sbjct: 344 ------AGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTV 397
Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSF 481
R+P V + F G +D+ L PV+ C AFA S SI GNVQQ+ +V +
Sbjct: 398 RIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVY 457
Query: 482 DLANNRVGFTPNKC 495
D A RVGF P C
Sbjct: 458 DGAKGRVGFAPGGC 471
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 137/344 (39%), Positives = 191/344 (55%), Gaps = 16/344 (4%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
+GTP +S ++DTGSD+ W QC+PC +C++QS P+FDP +SS+Y+ +PC++ C L
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPT 232
Query: 225 SAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG-LFVGSAGL 282
S C A++C Y YGD S T G L TET + S + G+ GCG NEG F AGL
Sbjct: 233 SKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKS-KLPGVVFGCGDTNEGDGFSQGAGL 291
Query: 283 LGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG--------GDAVTAPLIRN 334
+GLG G LSL Q+ +YCL D + L S G T PLI+N
Sbjct: 292 VGLGRGPLSLVSQLGLDKFSYCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKN 351
Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS 394
+FYYV L +VG + +P S F + + G GG+IVD GT+IT L+ Q Y +L+ +
Sbjct: 352 PSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKA 411
Query: 395 F-VRLAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
F ++A SGV L D C+ G+ V VP + HF G LDLPA+NY++
Sbjct: 412 FAAQMALPAADGSGVGL-DLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGG 470
Query: 452 AGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+G C S LSIIGN QQQ + +D+ ++ + F P +C
Sbjct: 471 SGALCLTVM-GSRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQC 513
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 138/354 (38%), Positives = 190/354 (53%), Gaps = 17/354 (4%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GEY +G+GTP R +S +LDTGSD+ W QC PC C Q P FDP SS+Y L C+A
Sbjct: 90 GEYLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYRSLGCSA 149
Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG---SVKGIALGCGHDNE 273
P C +L C C+YQ YGD + T G L ET +FG + ++ I+ GCG+ N
Sbjct: 150 PACNALYYPLCYQKTCVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISFGCGNLNA 209
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF------NSARGGDAV 327
G +G++G G G LSL Q+ + +YCL SP L F NS
Sbjct: 210 GSLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPVRSRLYFGAYATLNSTNASTVQ 269
Query: 328 TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM-DEAGDGGIIVDCGTAITRLQTQ 386
+ P I N + T Y++ +TG SVGG + I P++ + D G GG I+D GT IT L
Sbjct: 270 STPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGTTITYLAEP 329
Query: 387 AYNSLRDSFVRLAGNLKPTSGV---ALFDTCYDF--SGLRSVRVPTVSLHFGAGKALDLP 441
AY ++R++FV + P V ++ DTC+ + +SV +P + LHF G +LP
Sbjct: 330 AYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQLVLHFD-GADWELP 388
Query: 442 AKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+NY++ S G C A A TSS SIIG+ Q Q V +DL N+ + F P C
Sbjct: 389 LQNYMLVDPSTGGLCLAMA-TSSDGSIIGSYQHQNFNVLYDLENSLLSFVPAPC 441
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 131/366 (35%), Positives = 202/366 (55%), Gaps = 21/366 (5%)
Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
++ ASQG EY + +GTPP +++ ++DTGSD+ W QC PC C Q P F P S+
Sbjct: 83 ILVAASQG--EYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSA 140
Query: 208 SYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VK 262
+Y +PC +P C +L AC + + C+YQ YGD + T G L +ET +FG + S V
Sbjct: 141 TYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVS 200
Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFN--- 319
+A GCG+ N G S+G++GLG G LSL Q+ + +YCL SP L F
Sbjct: 201 DVAFGCGNINSGQLANSSGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFA 260
Query: 320 -------SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
S+ G + PL+ N + + Y++ L G S+G + + I P +F +++ G GG+
Sbjct: 261 TLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGV 320
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRS--VRVPTVS 429
+D GT++T LQ AY+++R V + L PT+ + +TC+ + S V VP +
Sbjct: 321 FIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVPDME 380
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
LHF G + +P +NY++ + G C A + A +IIGN QQQ + +D+AN+ +
Sbjct: 381 LHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDA-TIIGNYQQQNMHILYDIANSLLS 439
Query: 490 FTPNKC 495
F P C
Sbjct: 440 FVPAPC 445
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 160/404 (39%), Positives = 231/404 (57%), Gaps = 27/404 (6%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDF-STPVVSGASQGSGEYFSRIG 164
L D AR+ +L +L + + + + E S P+ G S G G Y +R+G
Sbjct: 67 LTHDHARIASLAARLAKTPSSRPTKLRRGSSSSPDAESLASVPLGPGTSVGVGNYVTRMG 126
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAAPQC---- 219
+GTP + + MV+DTGS + WLQC PC C++QS P+F+P++SSSY+ + C+APQC
Sbjct: 127 LGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASVSCSAPQCDALT 186
Query: 220 -KSLDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
+L+ S C +N C+YQ +YGD SF+VG L +TVSFG++ SV GCG DNEGLF
Sbjct: 187 TATLNPSTCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGST-SVPNFYYGCGQDNEGLFG 245
Query: 278 GSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRN 334
SAGL+GL LSL Q+ + S +YCL S + + + G + T P+ ++
Sbjct: 246 QSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSGYLSIGSYNPGQYSYT-PMAKS 304
Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS 394
D+ Y++ +TG +V G+ + + S + I+D GT ITRL T Y++L +
Sbjct: 305 SLDDSLYFIKMTGITVAGKPLSVSASAYSSLP-----TIIDSGTVITRLPTDVYSALSKA 359
Query: 395 FVRLAGNLKPT---SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
+AG +K T S ++ DTC+ R +RVP VS+ F G AL L A N L+ VDS
Sbjct: 360 ---VAGAMKGTPRASAFSILDTCFQGQASR-LRVPQVSMAFAGGAALKLKATNLLVDVDS 415
Query: 452 AGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A T C AFAP SA +IIGN QQQ V +D+ N+++GF C
Sbjct: 416 ATT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFAAGGC 457
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 131/366 (35%), Positives = 202/366 (55%), Gaps = 21/366 (5%)
Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
++ ASQG EY + +GTPP +++ ++DTGSD+ W QC PC C Q P F P S+
Sbjct: 83 ILVAASQG--EYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSA 140
Query: 208 SYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VK 262
+Y +PC +P C +L AC + + C+YQ YGD + T G L +ET +FG + S V
Sbjct: 141 TYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVS 200
Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFN--- 319
+A GCG+ N G S+G++GLG G LSL Q+ + +YCL SP L F
Sbjct: 201 DVAFGCGNINSGQLANSSGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFA 260
Query: 320 -------SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
S+ G + PL+ N + + Y++ L G S+G + + I P +F +++ G GG+
Sbjct: 261 TLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGV 320
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRS--VRVPTVS 429
+D GT++T LQ AY+++R V + L PT+ + +TC+ + S V VP +
Sbjct: 321 FIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVPDME 380
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
LHF G + +P +NY++ + G C A + A +IIGN QQQ + +D+AN+ +
Sbjct: 381 LHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDA-TIIGNYQQQNMHILYDIANSLLS 439
Query: 490 FTPNKC 495
F P C
Sbjct: 440 FVPAPC 445
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 238 bits (608), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 152/382 (39%), Positives = 208/382 (54%), Gaps = 20/382 (5%)
Query: 127 VDRHELKPAEAQILPEDFST--PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINW 184
VD + + + PE +T PV SGAS G+G+Y +G+GTP ++F+++ DTGSDI W
Sbjct: 37 VDSIHARLSSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITW 96
Query: 185 LQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV-----SACRANRCLYQVAY 238
QC PC + CY+Q +P +P TS+SY + C++ CK + +C ++ CLYQV Y
Sbjct: 97 TQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQY 156
Query: 239 GDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA 298
GDGS+++G TET++ +S K GCG N GLF G+AGLLGLG L+L Q
Sbjct: 157 GDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAK 216
Query: 299 TS---LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV 355
T +YCL S + G L PL + FY + +TG SVGG+ +
Sbjct: 217 TYKKLFSYCL-PASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRQL 275
Query: 356 QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCY 415
I S F G ++D GT ITRL AY+ L +F L + TSG ++FDTCY
Sbjct: 276 SIDESAFS------AGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCY 329
Query: 416 DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQ 473
DFS +VR+P V + F G +D+ L PV+ C AFA S SI GNVQ
Sbjct: 330 DFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQ 389
Query: 474 QQGTRVSFDLANNRVGFTPNKC 495
Q+ +V +D A RVGF P C
Sbjct: 390 QRTYQVVYDGAKGRVGFAPGGC 411
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 238 bits (607), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 142/401 (35%), Positives = 212/401 (52%), Gaps = 25/401 (6%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L RD RV+ + K+ + A + P+ V G + YF+ + +
Sbjct: 90 LGRDQDRVDAIRRKVA---------AVTTAASSSKPKGVPLQVGWGKYLDTTNYFTSLRL 140
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
GTP + LDTGSD +W+QC+PC +CY+Q + +FDP SS+YS + C++ +C+ L S
Sbjct: 141 GTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDPSKSSTYSDITCSSRECQELGSS 200
Query: 226 A---CRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAG 281
C ++ +C Y++ Y D S+TVG+L +T++ + +V G GCGH+N G F G
Sbjct: 201 HKHNCSSDKKCPYEITYADDSYTVGNLARDTLTLSPTDAVPGFVFGCGHNNAGSFGEIDG 260
Query: 282 LLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIR--NKK 336
LLGLG G SL+ Q+ A +YCL S A+G L F+ A A +
Sbjct: 261 LLGLGRGKASLSSQVAARYGAGFSYCLPSSPS-ATGYLSFSGAAAAAPTNAQFTEMVAGQ 319
Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
+FYY+ LTG +V G+A+++PPS+F A G I+D GTA + L AY +LR S
Sbjct: 320 HPSFYYLNLTGITVAGRAIKVPPSVF----ATAAGTIIDSGTAFSCLPPSAYAALRSSVR 375
Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
G K +FDTCYD +G +VR+P+V+L F G + L L + C
Sbjct: 376 SAMGRYKRAPSSTIFDTCYDLTGHETVRIPSVALVFADGATVHLHPSGVLYTWSNVSQTC 435
Query: 457 FAFAPT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
AF P ++L ++GN QQ+ V +D+ N +VGF N C
Sbjct: 436 LAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANGC 476
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 238 bits (606), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 145/361 (40%), Positives = 194/361 (53%), Gaps = 22/361 (6%)
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKT 205
P SG S +G Y I +GTP +F++V DTGSD W+QC+PC CYQQ +P+F P
Sbjct: 153 PAKSGLSLNTGNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTK 212
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S++Y+ + C + C LD C CLY V YGDGS+TVG +T++ G +VK
Sbjct: 213 SATYANISCTSSYCSDLDTRGCSGGHCLYAVQYGDGSYTVGFYAQDTLTLGYD-TVKDFR 271
Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFNSAR 322
GCG N GLF +AGL+GLG G S+ Q + AYC + S +G L+F
Sbjct: 272 FGCGEKNRGLFGKAAGLMGLGRGKTSVPVQAYDKYSGVFAYC-IPATSSGTGFLDFGPGA 330
Query: 323 GGDA---VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
A +T L+ N TFYYVG+TG VGG + IP ++F D G +VD GT
Sbjct: 331 PAAANARLTPMLVDNGP--TFYYVGMTGIKVGGHLLSIPATVFS-----DAGALVDSGTV 383
Query: 380 ITRLQTQAYNSLRDSFVRLAGNL--KPTSGVALFDTCYDFSGLR-SVRVPTVSLHFGAGK 436
ITRL AY LR +F + L K ++ DTCYD +G + S+ +P VSL F G
Sbjct: 384 ITRLPPSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTGYQGSIALPAVSLVFQGGA 443
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
LD+ A L D + C AFA + ++I+GN QQ+ V +DL VGF P
Sbjct: 444 CLDVDASGILYVADVSQA-CLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAPGA 502
Query: 495 C 495
C
Sbjct: 503 C 503
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 146/362 (40%), Positives = 203/362 (56%), Gaps = 21/362 (5%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDP 203
S P+ GAS G Y +R+G+GTP + MV+DTGS + WLQC PC+ C++Q+ P+FDP
Sbjct: 117 SVPLTPGASVAVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAGPVFDP 176
Query: 204 KTSSSYSPLPCAAPQC-----KSLDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
+ S +Y+ + C++ +C +L+ SAC +N C+YQ +YGD S++VG L +TVSFG
Sbjct: 177 RASGTYAAVQCSSSECGELQAATLNPSACSVSNVCIYQASYGDSSYSVGYLSKDTVSFG- 235
Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASG 314
SGS G GCG DNEGLF SAGL+GL LSL Q+ + + +YCL S A+G
Sbjct: 236 SGSFPGFYYGCGQDNEGLFGRSAGLIGLAKNKLSLLYQLAPSLGYAFSYCL-PTSSAAAG 294
Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
L S G P+ + + Y+V L+G SV G + +PPS + I+
Sbjct: 295 YLSIGSYNPGQYSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEYRSLP-----TII 349
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV-ALFDTCYDFSGLRSVRVPTVSLHFG 433
D GT ITRL Y +L + + P + ++ DTC+ S +RVP V + F
Sbjct: 350 DSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDTCFRGSA-AGLRVPRVDMAFA 408
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
G L L N LI VD + T C AFAPT +IIGN QQQ V +D+A +R+GF
Sbjct: 409 GGATLALSPGNVLIDVDDS-TTCLAFAPT-GGTAIIGNTQQQTFSVVYDVAQSRIGFAAG 466
Query: 494 KC 495
C
Sbjct: 467 GC 468
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 141/360 (39%), Positives = 198/360 (55%), Gaps = 20/360 (5%)
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKT 205
P G + G+G Y + +GTP +F++V DTGSD W+QC+PC CY+Q +P+FDP
Sbjct: 84 PASYGVALGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTK 143
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S++Y+ + C++ C L VS C CLY + YGDGS+T+G +T++ ++K
Sbjct: 144 SATYANISCSSSYCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYD-TIKNFR 202
Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFN-SA 321
GCG N GLF +AGLLGLG G SL Q AYCL S +G L+ A
Sbjct: 203 FGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCL-PATSAGTGFLDLGPGA 261
Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
+A P++ ++ TFYYVG+TG VGG + IP S+F G +VD GT IT
Sbjct: 262 PAANARLTPMLVDRG-PTFYYVGMTGIKVGGHVLPIPGSVFST-----AGTLVDSGTVIT 315
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVA--LFDTCYDFSGLR--SVRVPTVSLHFGAGKA 437
RL AY LR +F + L ++ A + DTCYD +G + S+ +P VSL F G
Sbjct: 316 RLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGAC 375
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LD+ A L D + C AFAP + + ++I+GN QQ+ V +D+ VGF P C
Sbjct: 376 LDVDASGILYVADVS-QACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 434
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 143/427 (33%), Positives = 210/427 (49%), Gaps = 26/427 (6%)
Query: 90 LHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQ---ILPEDFST 146
L R ND L D+ T TK QL + R + + A Q + P +
Sbjct: 17 LPVARCNDNVGFQLKLTHVDAG---TSYTKPQLLSRAIARSKARVAALQSAAVSPAPVAD 73
Query: 147 PVVSG---ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
P+ + + SGEY + +GTPP ++ ++DTGSD+ W QC PC C Q P FD
Sbjct: 74 PITAARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCAAQPTPYFDV 133
Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK- 262
K S++Y LPC + +C +L +C C+YQ YGD + T G L ET +FG + S K
Sbjct: 134 KRSATYRALPCRSSRCAALSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAASSTKV 193
Query: 263 ---GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF- 318
I+ GCG N G S+G++G G G LSL Q+ + +YCL SP L F
Sbjct: 194 RAANISFGCGSLNAGELANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSPTPSRLYFG 253
Query: 319 --------NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
N++ G + P + N + Y++ + G S+G + + I P +F +++ G G
Sbjct: 254 VFANLNSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTG 313
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR--SVRVPTV 428
G+I+D GT+IT LQ AY ++R DTC+ + +V VP
Sbjct: 314 GVIIDSGTSITWLQQDAYEAVRRGLASTIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDF 373
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
HF G + LP +NY++ + G C A APTS +IIGN QQQ + +D+AN+ +
Sbjct: 374 VFHFD-GANMTLPPENYMLIASTTGYLCLAMAPTSVG-TIIGNYQQQNLHLLYDIANSFL 431
Query: 489 GFTPNKC 495
F P C
Sbjct: 432 SFVPAPC 438
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 135/359 (37%), Positives = 200/359 (55%), Gaps = 28/359 (7%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GEY + + +GTP R FS+++DTGSD+ W+QC PC CY Q+D +F P TS+S++ L C
Sbjct: 1 GEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACGT 60
Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDN 272
C L C C+Y +YGDGS + GD V +T++ G V A GCGHDN
Sbjct: 61 ELCNGLPYPMCNQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGHDN 120
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAV-- 327
EG F G+ G+LGLG G LS Q+K +YCLVD +P + + + GDA
Sbjct: 121 EGSFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPT---QTSPLLFGDAAVP 177
Query: 328 TAP------LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
T P L+ N KV T+YYV L G SVGG+ + I + F++D G G I D GT +T
Sbjct: 178 TFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIFDSGTTVT 237
Query: 382 RLQTQAYN----SLRDSFVRLAGNLKPTSGVALFDTCY-DFSGLRSVRVPTVSLHFGAGK 436
+L + + ++ S + +SG+ D C F+ + VP+++ HF G
Sbjct: 238 QLAGEVHQEVLAAMNASTMDYPRKSDDSSGL---DLCLGGFAEGQLPTVPSMTFHFEGGD 294
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
++LP NY I ++S+ ++CF+ +S ++IIG++QQQ +V +D ++GF P C
Sbjct: 295 -MELPPSNYFIFLESSQSYCFSMV-SSPDVTIIGSIQQQNFQVYYDTVGRKIGFVPKSC 351
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 148/420 (35%), Positives = 215/420 (51%), Gaps = 52/420 (12%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
LE D ARV+++ H + E ++ +D S P G S G+G Y +G+
Sbjct: 45 LEHDQARVDSI-------------HRMIANETAVVGQDVSLPAERGISVGTGNYVVSVGL 91
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD 223
GTP R ++V DTGSD++W+QC PC+ CY Q DP+F P +SS++S + C P+C
Sbjct: 92 GTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSAVRCGEPECPRAR 151
Query: 224 VSACRA---NRCLYQVAYGDGSFTVGDLVTETVSFG----------NSGSVKGIALGCGH 270
S + +RC Y+V YGD S TVG L +T++ G NS + G GCG
Sbjct: 152 QSCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASENNSNKLPGFVFGCGE 211
Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAV 327
+N GLF + GL GLG G +SL+ Q +YCL S A G L + A
Sbjct: 212 NNTGLFGKADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPSSSSNAHGYLSLGTPAPAPAH 271
Query: 328 T--APLIRNKKVDTFYYVGLTGFSVGGQAVQIP--PSLFEMDEAGDGGIIVDCGTAITRL 383
P++ +FYYV L G V G+A+++ P+L+ G+IVD GT ITRL
Sbjct: 272 ARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWPA------GLIVDSGTVITRL 325
Query: 384 QTQAYNSLRDSFVRLAGN--LKPTSGVALFDTCYDFSGL--RSVRVPTVSLHFGAGK--A 437
+AY++LR +F+ G K +++ DTCYDF+ +V +P V+L F G +
Sbjct: 326 APRAYSALRTAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALVFAGGATIS 385
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALS--IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+D Y+ V A C AFAP + S I+GN QQ+ V +D+ ++GF C
Sbjct: 386 VDFSGVLYVAKVAQA---CLAFAPNGNGRSAGILGNTQQRTVAVVYDVGRQKIGFAAKGC 442
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 141/360 (39%), Positives = 198/360 (55%), Gaps = 20/360 (5%)
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKT 205
P G + G+G Y + +GTP +F++V DTGSD W+QC+PC CY+Q +P+FDP
Sbjct: 149 PASYGVALGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTK 208
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S++Y+ + C++ C L VS C CLY + YGDGS+T+G +T++ ++K
Sbjct: 209 SATYANISCSSSYCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYD-TIKNFR 267
Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFN-SA 321
GCG N GLF +AGLLGLG G SL Q AYCL S +G L+ A
Sbjct: 268 FGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCL-PATSAGTGFLDLGPGA 326
Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
+A P++ ++ TFYYVG+TG VGG + IP S+F G +VD GT IT
Sbjct: 327 PAANARLTPMLVDRG-PTFYYVGMTGIKVGGHVLPIPGSVFST-----AGTLVDSGTVIT 380
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVA--LFDTCYDFSGLR--SVRVPTVSLHFGAGKA 437
RL AY LR +F + L ++ A + DTCYD +G + S+ +P VSL F G
Sbjct: 381 RLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGAC 440
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LD+ A L D + C AFAP + + ++I+GN QQ+ V +D+ VGF P C
Sbjct: 441 LDVDASGILYVADVS-QACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 499
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 152/438 (34%), Positives = 226/438 (51%), Gaps = 42/438 (9%)
Query: 99 RSLVLSRLERDSARVNTLITKL-----QLAIYNVDRHELKP----------AEAQILPED 143
S+ +S++ +D AR+ TL ++ Q + + + + KP + A +
Sbjct: 107 ESVGVSKM-KDLARIQTLYKRMTEKKNQNTVSRLKKQQSKPQVAPPAAAPESSASVFSGQ 165
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
+ SG S GSGEYF + VGTPP+ FS++LDTGSD+NW+QC PC EC++Q+ P +DP
Sbjct: 166 LIATLESGVSLGSGEYFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNGPHYDP 225
Query: 204 KTSSSYSPLPCAAPQCKSLDV----SACRANR--CLYQVAYGDGSFTVGDLVTETVSFG- 256
SSSY + C +C + C+A C Y YGD S T GD ET +
Sbjct: 226 GQSSSYRNIGCHDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNL 285
Query: 257 --NSGS-----VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLV 306
+SG V+ + GCGH N GLF G+AGLLGLG G LS + Q+++ S +YCLV
Sbjct: 286 TMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLV 345
Query: 307 DRDSPASGVLEFNSARGGDAVTAPLI--------RNKKVDTFYYVGLTGFSVGGQAVQIP 358
DR+S A+ + D ++ P + + VDTFYYV + VGG+ V IP
Sbjct: 346 DRNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIP 405
Query: 359 PSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFS 418
+++ G GG I+D GT ++ AY ++++F+ + + CY+ +
Sbjct: 406 EEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEPCYNVT 465
Query: 419 GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGT 477
G+ +P + F G + P +NY I ++ C A T SALSIIGN QQQ
Sbjct: 466 GVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSALSIIGNYQQQNF 525
Query: 478 RVSFDLANNRVGFTPNKC 495
+ +D +R+GF P KC
Sbjct: 526 HILYDTKKSRLGFAPTKC 543
>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
Length = 165
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 111/165 (67%), Positives = 134/165 (81%)
Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
L RN ++DT+YYVGL G SVGG+ + IP + FE+D AG+GGIIVD GTA+TRLQ+ YN
Sbjct: 1 LRRNPQLDTYYYVGLVGISVGGELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNV 60
Query: 391 LRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
+RD+FV+ +L T+ V+LFDTCYD S SV VPTV+ HFG GK L LPAKNYL+PVD
Sbjct: 61 VRDAFVKGTKDLLATNEVSLFDTCYDLSSKTSVEVPTVAFHFGEGKVLVLPAKNYLVPVD 120
Query: 451 SAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S GTFCFAFAPT S+LSIIGN+QQQGTRVSFDLAN+ VGF+PN+C
Sbjct: 121 SVGTFCFAFAPTMSSLSIIGNIQQQGTRVSFDLANSLVGFSPNRC 165
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 235 bits (599), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 156/465 (33%), Positives = 237/465 (50%), Gaps = 36/465 (7%)
Query: 42 SALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSRE-ILHKTRHNDYRS 100
+A +T +LS +L+ A SE A P ++S ++PLH R N +
Sbjct: 27 AADHRTHKVLSVG--SLKSAATCSEPKAT--PPSTSGGITVPLHHRHGPCSPVPSNKMPA 82
Query: 101 LVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYF 160
+ RL+RD R + K A +++ ++A +P G S + EY
Sbjct: 83 SLEERLQRDQLRAAYIKRKFSGA----KGGDVEQSDAATVPTTL------GTSLSTLEYV 132
Query: 161 SRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK 220
+G+G+P +M +DTGSD++W+QC+PC++C+ + D +FDP SS+YSP C++ C
Sbjct: 133 ITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSASSTYSPFSCSSAACV 192
Query: 221 SLDVS----ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
L S C +++C Y V+Y DGS T G ++T++ G S ++KG GC G F
Sbjct: 193 QLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSDTLTLG-SNAIKGFQFGCSQSESGGF 251
Query: 277 VGSA-GLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI 332
GL+GLGG SL Q T + +YCL +SG L +A V P++
Sbjct: 252 SDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCLPPTPG-SSGFLTLGAASRSGFVKTPML 310
Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
R+ ++ T+Y V L VGGQ + IP S+F G ++D GT ITRL AY++L
Sbjct: 311 RSTQIPTYYGVLLEAIRVGGQQLNIPTSVFS------AGSVMDSGTVITRLPPTAYSALS 364
Query: 393 DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
+F P + DTC+DFSG SV +P+V+L F G ++L ++ +D+
Sbjct: 365 SAFKAGMKKYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVNLDFNGIMLELDN- 423
Query: 453 GTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+C AFA S S+L IGNVQQ+ V +D+ VGF C
Sbjct: 424 --WCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVGFRAGAC 466
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 234 bits (598), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 153/431 (35%), Positives = 213/431 (49%), Gaps = 29/431 (6%)
Query: 88 EILHKTRHNDYRSL-VLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFST 146
+ H H +Y L +L R R S + + + + A +D
Sbjct: 48 RLTHVDAHGNYSRLQLLQRAARRSHHRMSRLVARATGAASTSSSKAAAAGDGSGGKDLQV 107
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
PV G+GE+ + VGTP ++ ++DTGSD+ W QC+PC EC+ Q+ P+FDP S
Sbjct: 108 PV----HAGNGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPAAS 163
Query: 207 SSYSPLPCAAPQCKSLDVSACRANRCL--------YQVAYGDGSFTVGDLVTETVSFGNS 258
S+Y+ LPC++ C L S C ++ Y YGD S T G L TET +
Sbjct: 164 STYAALPCSSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLARQ 223
Query: 259 GSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA----- 312
V G+A GCG NEG F AGL+GLG G LSL Q+ +YCL D A
Sbjct: 224 -KVPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDRFSYCLTSLDDAAGRSPL 282
Query: 313 ---SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
S SA A T PL++N +FYYV LTG +VG + +P S F + + G
Sbjct: 283 LLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDDGT 342
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYD-----FSGLRSVR 424
GG+IVD GT+IT L+ +AY +LR +FV + D C+ V+
Sbjct: 343 GGVIVDSGTSITYLELRAYRALRKAFVAHMSLPTVDASEIGLDLCFQGPAGAVDQDVQVQ 402
Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLA 484
VP + LHF G LDLPA+NY++ ++G C S LSIIGN QQQ + +D+A
Sbjct: 403 VPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVM-ASRGLSIIGNFQQQNFQFVYDVA 461
Query: 485 NNRVGFTPNKC 495
+ + F P +C
Sbjct: 462 GDTLSFAPAEC 472
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 234 bits (598), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 150/404 (37%), Positives = 213/404 (52%), Gaps = 36/404 (8%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
+ D+AR+ L ++L D+ + + S P+ SGAS G G Y +R+G+
Sbjct: 68 ITHDAARIAGLASRLA----TKDKDWVAAS---------SVPLASGASVGVGNYITRLGL 114
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAAPQCK---- 220
GTP + MV+D+GS + WLQC PC C+ Q+ P++DP+ SS+Y+ +PC+APQC
Sbjct: 115 GTPTTTYVMVVDSGSSLTWLQCAPCAVSCHPQAGPLYDPRASSTYAAVPCSAPQCAELQA 174
Query: 221 -SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
+L+ S+C + C YQ +YGDGSF+ G L +TVS +SGS G GCG DN GLF
Sbjct: 175 ATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSGSFPGFYYGCGQDNVGLFGR 234
Query: 279 SAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNS----ARGGDAVTAPL 331
+AGL+GL LSL Q+ S AYCL + ++G L F S G +
Sbjct: 235 AAGLIGLARNKLSLLSQLAPSVGNSFAYCLPTSAAASAGYLSFGSNSDNKNPGKYSYTSM 294
Query: 332 IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
+ + + Y+V L G SV G + +P S E G I+D GT ITRL T Y +L
Sbjct: 295 VSSSLDASLYFVSLAGMSVAGSPLAVPSS-----EYGSLPTIIDSGTVITRLPTPVYTAL 349
Query: 392 RDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
+ V A ++ TC+ + + VP V++ F G L L N L+ V+
Sbjct: 350 SKA-VGAALAAPSAPAYSILQTCFK-GQVAKLPVPAVNMAFAGGATLRLTPGNVLVDVNE 407
Query: 452 AGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
T C AFAPT S +IIGN QQQ V +D+ +R+GF C
Sbjct: 408 T-TTCLAFAPTDST-AIIGNTQQQTFSVVYDVKGSRIGFAAGGC 449
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 234 bits (598), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 138/412 (33%), Positives = 207/412 (50%), Gaps = 30/412 (7%)
Query: 106 LERDSARVNTL---ITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSR 162
L D ARV+++ +T ++ + + + + P SG G+G Y
Sbjct: 98 LAHDQARVDSIQARVTDQSYDLFKKKDKKSSNKKKSVKDSKANLPAQSGLPLGTGNYIVN 157
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKS 221
+G+GTP + S++ DTGSD+ W QC+PC + CY Q PIFDP S +YS + C + C
Sbjct: 158 VGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKTYSNISCTSTACSG 217
Query: 222 LDVS-----ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
L + C ++ C+Y + YGD SFTVG +T++ + G GCG +N GLF
Sbjct: 218 LKSATGNSPGCSSSNCVYGIQYGDSSFTVGFFAKDTLTLTQNDVFDGFMFGCGQNNRGLF 277
Query: 277 VGSAGLLGLGGGMLSLTKQIK---ATSLAYCL-VDRDSPASGVLEFNSARG-------GD 325
+AGL+GLG LS+ +Q +YCL R S +G L F + G +
Sbjct: 278 GKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGS--NGHLTFGNGNGVKTSKAVKN 335
Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
+T + + TFY++ + G SVGG+A+ I P LF+ + G I+D GT ITRL +
Sbjct: 336 GITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQ-----NAGTIIDSGTVITRLPS 390
Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
Y SL+ +F + ++L DTCYD S S+ +P +S +F +DL
Sbjct: 391 TVYGSLKSTFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISFNFNGNANVDLEPNGI 450
Query: 446 LIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LI + A C AFA + I GN+QQQ V +D+A ++GF C
Sbjct: 451 LI-TNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVYDVAGGQLGFGYKGC 501
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 234 bits (597), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 153/408 (37%), Positives = 213/408 (52%), Gaps = 39/408 (9%)
Query: 103 LSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSR 162
L L D R + ++ A +L ++A +P + G S G+ +Y
Sbjct: 92 LDTLRADQRRAEYIQRRVSGAAAAAPGMQLAGSKAATVPANL------GFSIGTLQYVVT 145
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPLPCAAPQCK 220
+ +GTP ++ +DTGSD++W+QC+PC CY Q DP+FDP SSSYS +PCAA C
Sbjct: 146 VSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAAASCS 205
Query: 221 SLDV--SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
L + + C +C Y V+YGDGS T G ++T++ S ++KG GCGH +GLF G
Sbjct: 206 QLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNALKGFLFGCGHAQQGLFAG 265
Query: 279 SAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAV----TAPL 331
GLLGLG SL Q +T +YCL P + + S G + T PL
Sbjct: 266 VDGLLGLGRQGQSLVSQASSTYGGVFSYCL----PPTQNSVGYISLGGPSSTAGFSTTPL 321
Query: 332 IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
+ T+Y V L G SVGGQ + I S+F G +VD GT +TRL AY++L
Sbjct: 322 LTASNDPTYYIVMLAGISVGGQPLSIDASVFAS------GAVVDTGTVVTRLPPTAYSAL 375
Query: 392 RDSF-VRLAGNLKPTS-GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
R +F +A P++ + DTCYDF+ +V +PT+S+ FG G A+DL L
Sbjct: 376 RSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGTSGILT-- 433
Query: 450 DSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ C AFAPT S SI+GNVQQ+ V FD + VGF P C
Sbjct: 434 ----SGCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPASC 475
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 234 bits (597), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 137/412 (33%), Positives = 210/412 (50%), Gaps = 30/412 (7%)
Query: 106 LERDSARVNTL---ITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSR 162
L D ARV+++ IT ++ + + + + P SG G+G Y
Sbjct: 98 LAHDQARVDSIQARITDQSYDLFKKKDKKSSNKKKSVKDSKANLPAQSGLPLGTGNYIVN 157
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKS 221
+G+GTP + S++ DTGSD+ W QC+PC + CY Q PIFDP TS +YS + C + C S
Sbjct: 158 VGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKTYSNISCTSAACSS 217
Query: 222 LDVS-----ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
L + C ++ C+Y + YGD SFT+G + ++ + G GCG +N+GLF
Sbjct: 218 LKSATGNSPGCSSSNCVYGIQYGDSSFTIGFFAKDKLTLTQNDVFDGFMFGCGQNNKGLF 277
Query: 277 VGSAGLLGLGGGMLSLTKQIK---ATSLAYCL-VDRDSPASGVLEFNSARG-------GD 325
+AGL+GLG LS+ +Q +YCL R S +G L F + G +
Sbjct: 278 GKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGS--NGHLTFGNGNGVKASKAVKN 335
Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
+T + + +Y++ + G SVGG+A+ I P LF+ + G I+D GT ITRL +
Sbjct: 336 GITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQ-----NAGTIIDSGTVITRLPS 390
Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
AY SL+ +F + ++L DTCYD S S+ +P +S +F ++L
Sbjct: 391 TAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISFNFNGNANVELDPNGI 450
Query: 446 LIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LI + A C AFA ++ I GN+QQQ V +D+A ++GF C
Sbjct: 451 LI-TNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDVAGGQLGFGYKGC 501
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 167/468 (35%), Positives = 240/468 (51%), Gaps = 42/468 (8%)
Query: 66 ETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQ---- 121
+T + L S SL + + H + RSL+L L+RD R+ + ++
Sbjct: 67 QTPSRRVLLEESMKTSLKMELKHRDHGQPTRNRRSLLLESLKRDITRLQSFQKRVSEKLT 126
Query: 122 --------LAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFS 173
L + N + P+ + E ST V SGA G+GEYF + VG PPR F
Sbjct: 127 ASANPEAYLEMTNSSSTKSPPSPSSSWEEVDST-VESGAELGAGEYFMDVFVGNPPRHFL 185
Query: 174 MVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-- 231
+++DTGSD+ WLQC+PC C+ QS P+FDP S+S+ +PC A C + CR N
Sbjct: 186 LIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAACDLVVHDECRDNSSK 245
Query: 232 -----CLYQVAYGDGSFTVGDLVTETVSFG-----NSGSVKGIALGCGHDNEGLFVGSAG 281
C Y YGD S T GDL E++S +S ++ + +GCGH N+GLF G+ G
Sbjct: 246 TSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGG 305
Query: 282 LLGLGGGMLSLTKQIKAT----SLAYCLVDRDS--PASGVLEFNS----ARGGDAVT-AP 330
LLGLG G LS Q++++ S +YCLVDR + S + F + +R D + P
Sbjct: 306 LLGLGQGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSAISFGAGFALSRHFDQMRFTP 365
Query: 331 LIR-NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
+R N V+TFYY+G+ G + + + IP F + G GG I+D GT +T L AY
Sbjct: 366 FVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYR 425
Query: 390 SLRDSFVRLAGNLKPTSG-VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI- 447
++ +F LA P + + CY+ +G +V PT+S+ F G LDLP +NY I
Sbjct: 426 AVESAF--LARISYPRADPFDILGICYNATGRTAVPFPTLSIVFQNGAELDLPQENYFIQ 483
Query: 448 PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
P C A PT +SIIGN QQQ +D+ + R+GF C
Sbjct: 484 PDPQEAKHCLAILPT-DGMSIIGNFQQQNIHFLYDVQHARLGFANTDC 530
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 137/359 (38%), Positives = 192/359 (53%), Gaps = 17/359 (4%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFD 202
S P G G+ Y +G GTP + +++ DTGS++NW+QC+PC CY Q +P+FD
Sbjct: 1 ISIPARIGLYIGTANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFD 60
Query: 203 PKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK 262
P SS+Y + C + C L C + C+Y V YGDGS TVG L TET +
Sbjct: 61 PTLSSTYRNISCTSAACTGLSSRGCSGSTCVYGVTYGDGSSTVGFLATETFTLAAGNVFN 120
Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSL----AYCLVDRDSPASGVLEF 318
GCG +N+GLF G+AGL+GLG SL Q+ ATSL +YCL S A+G L
Sbjct: 121 NFIFGCGQNNQGLFTGAAGLIGLGRSPYSLNSQL-ATSLGNIFSYCL-PSTSSATGYLNI 178
Query: 319 NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
+ TA ++ N + T Y++ L G SVGG + + ++F+ G I+D GT
Sbjct: 179 GNPLRTPGYTA-MLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQ-----SVGTIIDSGT 232
Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
ITRL AY +LR +F + ++ DTCYDFS +V PT+ LH+ G +
Sbjct: 233 VITRLPPTAYGALRTAFRAAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLHY-TGLDV 291
Query: 439 DLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+P + V S+ C AFA S+ + IIGNVQQ+ V++D A R+GF C
Sbjct: 292 TIPGAG-VFYVISSSQVCLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 136/375 (36%), Positives = 211/375 (56%), Gaps = 35/375 (9%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDP 203
+TP+ SG S GSG Y+ +IG+GTP + FSM++DTGS ++WLQC+PC C+ Q DPIF P
Sbjct: 99 TTPLKSGLSIGSGNYYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTP 158
Query: 204 KTSSSYSPLPC-----AAPQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFG 256
TS +Y LPC ++ + +L+ C C+Y+ +YGD SF++G L + ++
Sbjct: 159 STSKTYKALPCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLT 218
Query: 257 NSGS-VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCL-----VD 307
S + G GCG DN+GLF S+G++GL +S+ Q+ + +YCL
Sbjct: 219 PSEAPSSGFVYGCGQDNQGLFGRSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAP 278
Query: 308 RDSPASGVLEFNSARGGDAVTA------PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
S SG L G ++T+ PL++N+K+ + Y++ LT +V G+ + + S
Sbjct: 279 NSSSLSGFLSI----GASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASS 334
Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGL 420
+ + I+D GT ITRL YN+L+ SFV ++ G ++ DTC+ S
Sbjct: 335 YNVPT------IIDSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVK 388
Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVS 480
VP + + F G L+L A N L+ ++ GT C A A +S+ +SIIGN QQQ +V+
Sbjct: 389 EMSTVPEIQIIFRGGAGLELKAHNSLVEIEK-GTTCLAIAASSNPISIIGNYQQQTFKVA 447
Query: 481 FDLANNRVGFTPNKC 495
+D+AN ++GF P C
Sbjct: 448 YDVANFKIGFAPGGC 462
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 153/408 (37%), Positives = 213/408 (52%), Gaps = 39/408 (9%)
Query: 103 LSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSR 162
L L D R + ++ A +L ++A +P + G S G+ +Y
Sbjct: 81 LDTLRADQRRAEYIQRRVSGAAAAAPGMQLAGSKAATVPANL------GFSIGTLQYVVT 134
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPLPCAAPQCK 220
+ +GTP ++ +DTGSD++W+QC+PC CY Q DP+FDP SSSYS +PCAA C
Sbjct: 135 VSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAAASCS 194
Query: 221 SLDV--SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
L + + C +C Y V+YGDGS T G ++T++ S ++KG GCGH +GLF G
Sbjct: 195 QLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNALKGFLFGCGHAQQGLFAG 254
Query: 279 SAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAV----TAPL 331
GLLGLG SL Q +T +YCL P + + S G + T PL
Sbjct: 255 VDGLLGLGRQGQSLVSQASSTYGGVFSYCL----PPTQNSVGYISLGGPSSTAGFSTTPL 310
Query: 332 IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
+ T+Y V L G SVGGQ + I S+F G +VD GT +TRL AY++L
Sbjct: 311 LTASNDPTYYIVMLAGISVGGQPLSIDASVFAS------GAVVDTGTVVTRLPPTAYSAL 364
Query: 392 RDSF-VRLAGNLKPTS-GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
R +F +A P++ + DTCYDF+ +V +PT+S+ FG G A+DL L
Sbjct: 365 RSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGTSGILT-- 422
Query: 450 DSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ C AFAPT S SI+GNVQQ+ V FD + VGF P C
Sbjct: 423 ----SGCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPASC 464
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 171/450 (38%), Positives = 227/450 (50%), Gaps = 34/450 (7%)
Query: 76 SSSSFSLPLH-SREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP 134
+S S SL LH +R R + VL ++D+ R+ T+ + A DR P
Sbjct: 69 ASLSPSLKLHMNRRAAEGGRTR--KESVLDLADKDAVRIETM--HRRAARSGGDRTPASP 124
Query: 135 AEA--QILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE 192
+ + + L E V SG + GSGEY + VGTPPR+F M++DTGSD+NWLQC PC +
Sbjct: 125 SSSPRRALSERMVATVESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLD 184
Query: 193 CYQQSDPIFDPKTSSSYSPLPCAAPQCKSL----DVSACR---ANRCLYQVAYGDGSFTV 245
C+ Q P+FDP SSSY + C +C + ACR + C Y YGD S T
Sbjct: 185 CFDQVGPVFDPAASSSYRNVTCGDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTT 244
Query: 246 GDLVTETVSF-----GNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT- 299
GDL E+ + G S V + GCGH N GLF G+AGLLGLG G LS Q++A
Sbjct: 245 GDLALESFTVNLTAPGASRRVDDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLRAVY 304
Query: 300 --SLAYCLVDRDSPASGVLEFNSARGGDA--------VTAPLIRNKKVDTFYYVGLTGFS 349
+ +YCLVD S + + F TA + DTFYYV L G
Sbjct: 305 GHTFSYCLVDHGSDVASKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVL 364
Query: 350 VGGQAVQIPPSLF--EMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKP-TS 406
VGG+ + I + E G GG I+D GT ++ AY +R +F+ G P
Sbjct: 365 VGGELLNISSDTWGVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIP 424
Query: 407 GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SSA 465
+ CY+ SG+ VP +SL F G D PA+NY I +D G C A T +
Sbjct: 425 DFPVLSPCYNVSGVDRPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTG 484
Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+SIIGN QQQ V +DL NNR+GF P +C
Sbjct: 485 MSIIGNFQQQNFHVVYDLKNNRLGFAPRRC 514
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 231 bits (590), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 139/365 (38%), Positives = 198/365 (54%), Gaps = 20/365 (5%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDP 203
+ P +G S + E+ +G G+P + +++ +DTGSD++W+QC PC+ CY+Q DP+FDP
Sbjct: 147 TIPDSTGTSLDTLEFVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDP 206
Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
S++YS +PC PQC + + CLY+V YGDGS T G L ET+S ++ + G
Sbjct: 207 TKSATYSAVPCGHPQCAAAGGKCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRDLPG 266
Query: 264 IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNS 320
A GCG N G F G GL+GLG G LSL Q AT + +YCL D+ G L S
Sbjct: 267 FAFGCGQTNLGEFGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPSYDT-THGYLTMGS 325
Query: 321 ARGG------DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
D +I+ + + Y+V + +GG + +PP++F D G +
Sbjct: 326 TTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFTRD-----GTLF 380
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
D GT +T L +AY SLRD F KP FDTCYDF+G ++ +P V+ F
Sbjct: 381 DSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDPFDTCYDFTGHNAIFMPAVAFKFSD 440
Query: 435 GKALDL-PAKNYLIPVDSA-GTFCFAFAPTSSAL--SIIGNVQQQGTRVSFDLANNRVGF 490
G DL P + P D+A T C AF P S + +IIGN QQ+GT V +D+A ++GF
Sbjct: 441 GAVFDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDVAAEKIGF 500
Query: 491 TPNKC 495
C
Sbjct: 501 GQFTC 505
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 231 bits (589), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 139/358 (38%), Positives = 189/358 (52%), Gaps = 16/358 (4%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDP 203
S P G GSG Y +G GTP R ++V DTGSD+NWLQC+PC CY Q +P+FDP
Sbjct: 2 SIPARIGLFIGSGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDP 61
Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
SS+Y + C P C L C ++ CLY V YGDGS T+G L +T + K
Sbjct: 62 SLSSTYRNVSCTEPACVGLSTRGCSSSTCLYGVFYGDGSSTIGFLAMDTFMLTPAQKFKN 121
Query: 264 IALGCGHDNEGLFVGSAGLLGLG-GGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFN 319
GCG +N GLF G+AGL+GLG SL Q+ +YCL S A+G L
Sbjct: 122 FIFGCGQNNTGLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCL-PSTSSATGYLNIG 180
Query: 320 SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
+ + TA ++ + +V T Y++ L G SVGG + + ++F+ G I+D GT
Sbjct: 181 NPQNTPGYTA-MLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQ-----SVGTIIDSGTV 234
Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
ITRL AY++L+ + V + DTCYDFS SV P + LHF AG +
Sbjct: 235 ITRLPPTAYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHF-AGLDVR 293
Query: 440 LPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+PA +S+ C AFA S+ + IIGNVQQ V++D R+GF+ C
Sbjct: 294 IPATGVFFVFNSS-QVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 231 bits (589), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 161/443 (36%), Positives = 232/443 (52%), Gaps = 42/443 (9%)
Query: 91 HKTRHNDYRSLVLSRLERDSARVNTLITKLQ------------LAIYNVDRHELKPAEAQ 138
H+ ++ RSL+L L+RD R+ + ++ L + N + P+ +
Sbjct: 8 HRQPTSNRRSLLLESLKRDITRLQSFQKRVSEKLTASANPEAYLEMTNSSSTKSPPSPSS 67
Query: 139 ILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD 198
E ST V SGA G+GEYF + VG PPR F +++DTGSD+ WLQC+PC C+ QS
Sbjct: 68 SWEEVDST-VESGAELGAGEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSG 126
Query: 199 PIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-------CLYQVAYGDGSFTVGDLVTE 251
P+FDP S+S+ +PC A C + CR N C Y YGD S T GDL E
Sbjct: 127 PVFDPSQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALE 186
Query: 252 TVSFG-----NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT----SLA 302
++S +S ++ + +GCGH N+GLF G+ GLLGLG G LS Q++++ S +
Sbjct: 187 SLSVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFS 246
Query: 303 YCLVDRDS--PASGVLEFNS----ARGGDAVT-APLIR-NKKVDTFYYVGLTGFSVGGQA 354
YCLVDR + S + F + +R D + P +R N V+TFYY+G+ G + +
Sbjct: 247 YCLVDRTNNLSVSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQEL 306
Query: 355 VQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG-VALFDT 413
+ IP F + G GG I+D GT +T L AY ++ +F LA P + +
Sbjct: 307 LPIPAERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAF--LARISYPRADPFDILGI 364
Query: 414 CYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI-PVDSAGTFCFAFAPTSSALSIIGNV 472
CY+ +G +V P +S+ F G LDLP +NY I P C A PT +SIIGN
Sbjct: 365 CYNATGRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPT-DGMSIIGNF 423
Query: 473 QQQGTRVSFDLANNRVGFTPNKC 495
QQQ +D+ + R+GF C
Sbjct: 424 QQQNIHFLYDVQHARLGFANTDC 446
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 231 bits (589), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 143/412 (34%), Positives = 213/412 (51%), Gaps = 29/412 (7%)
Query: 104 SRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFST--PVVSGASQGSGEYFS 161
S ER V + L + ++ H K + + + T P+ SG + Y
Sbjct: 65 SESERKGDWVEKQLVLDGLHVRSIQNHIRKRTSSSQIADSSETQVPLTSGIKFQTLNYIV 124
Query: 162 RIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS 221
+G+G+ + S+++DTGSD+ W+QC PC CY Q+ P+F P TS SY P+ C + C+S
Sbjct: 125 TMGLGS--QNMSVIVDTGSDLTWVQCEPCRSCYNQNGPLFKPSTSPSYQPILCNSTTCQS 182
Query: 222 LDVSACRAN-----RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
L++ AC ++ C Y V YGDGS+T G+L E + FG SV GCG +N+GLF
Sbjct: 183 LELGACGSDPSTSATCDYVVNYGDGSYTSGELGIEKLGFGGI-SVSNFVFGCGRNNKGLF 241
Query: 277 VGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSP-ASGVLEFNSARGGDAVTAP-- 330
G++GL+GLG LS+ Q AT +YCL D ASG L + G P
Sbjct: 242 GGASGLMGLGRSELSMISQTNATFGGVFSYCLPSTDQAGASGSLVMGNQSGVFKNVTPIA 301
Query: 331 ---LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
++ N ++ FY + LTG VGG ++ + S F G+GG+I+D GT I+RL
Sbjct: 302 YTRMLPNLQLSNFYILNLTGIDVGGVSLHVQASSF-----GNGGVILDSGTVISRLAPSV 356
Query: 388 YNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN--Y 445
Y +L+ F+ G ++ DTC++ +G V +PT+S++F L++ A Y
Sbjct: 357 YKALKAKFLEQFSGFPSAPGFSILDTCFNLTGYDQVNIPTISMYFEGNAELNVDATGIFY 416
Query: 446 LIPVDSAGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L+ D A C A A S + IIGN QQ+ RV +D ++VGF C
Sbjct: 417 LVKED-ASRVCLALASLSDEYEMGIIGNYQQRNQRVLYDAKLSQVGFAKEPC 467
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 231 bits (588), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 134/371 (36%), Positives = 195/371 (52%), Gaps = 28/371 (7%)
Query: 143 DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
D P+ SG + Y + +G R+ ++++DTGSD++W+QC+PC CY Q DP+F+
Sbjct: 119 DAPIPLTSGIRLQTLNYIVTVELGG--RKMTVIVDTGSDLSWVQCQPCKRCYNQQDPVFN 176
Query: 203 PKTSSSYSPLPCAAPQCKSL-----DVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSF 255
P TS SY + C++P C+SL ++ C +N C Y V YGDGS+T G+L TE +
Sbjct: 177 PSTSPSYRTVLCSSPTCQSLQSATGNLGVCGSNPPSCNYVVNYGDGSYTRGELGTEHLDL 236
Query: 256 GNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA 312
GNS +V GCG +N+GLF G++GL+GLG LSL Q A +YCL ++ A
Sbjct: 237 GNSTAVNNFIFGCGRNNQGLFGGASGLVGLGRSSLSLISQTSAMFGGVFSYCLPITETEA 296
Query: 313 SGVLEFNSARGGDAVTAPLIRNKKVDT----FYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
SG L T P+ + + FY++ LTG +VG AVQ P G
Sbjct: 297 SGSLVMGGNSSVYKNTTPISYTRMIPNPQLPFYFLNLTGITVGSVAVQAP-------SFG 349
Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
G+++D GT ITRL Y +L+D FV+ + DTC++ SG + V +P +
Sbjct: 350 KDGMMIDSGTVITRLPPSIYQALKDEFVKQFSGFPSAPAFMILDTCFNLSGYQEVEIPNI 409
Query: 429 SLHFGAGKAL--DLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLA 484
+HF L D+ Y + D A C A A S + + IIGN QQ+ RV +D
Sbjct: 410 KMHFEGNAELNVDVTGVFYFVKTD-ASQVCLAIASLSYENEVGIIGNYQQKNQRVIYDTK 468
Query: 485 NNRVGFTPNKC 495
+ +GF C
Sbjct: 469 GSMLGFAAEAC 479
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 136/375 (36%), Positives = 207/375 (55%), Gaps = 27/375 (7%)
Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDP 199
P STP+ SG S GSG Y+ +IGVGTP + FSM++DTGS ++WLQC+PC C+ Q DP
Sbjct: 89 PSLVSTPLKSGLSIGSGNYYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDP 148
Query: 200 IFDPKTSSSYSP-----LPCAAPQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTET 252
IF P S +Y C++ + +L+ C C+Y+ +YGD SF++G L +
Sbjct: 149 IFTPSVSKTYKALSCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDV 208
Query: 253 VSFGNSGS-VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCL--- 305
++ S + G GCG DN+GLF SAG++GL LS+ Q+ + +YCL
Sbjct: 209 LTLTPSAAPSSGFVYGCGQDNQGLFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSS 268
Query: 306 --VDRDSPASGVLEFNSARGGDAVT--APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
+S SG L ++ + PL++N K+ + Y++GLT +V G+ + + S
Sbjct: 269 FSAQPNSSVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASS 328
Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGL 420
+ + I+D GT ITRL YN+L+ SFV ++ G ++ DTC+ S
Sbjct: 329 YNVPT------IIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDTCFKGSVK 382
Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVS 480
VP + + F G L+L N L+ ++ GT C A A +S+ +SIIGN QQQ V+
Sbjct: 383 EMSTVPEIRIIFRGGAGLELKVHNSLVEIEK-GTTCLAIAASSNPISIIGNYQQQTFTVA 441
Query: 481 FDLANNRVGFTPNKC 495
+D+AN+++GF P C
Sbjct: 442 YDVANSKIGFAPGGC 456
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 141/412 (34%), Positives = 206/412 (50%), Gaps = 24/412 (5%)
Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELKPAEAQ---ILPE--DFSTPVVSGASQGSGEY 159
+L+ T TKLQL + R + + A Q +LP D T + SGEY
Sbjct: 30 QLKLTHVDAGTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVLVTASSGEY 89
Query: 160 FSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
+ +GTPP ++ ++DTGSD+ W QC PC C Q P FD K S++Y LPC + +C
Sbjct: 90 LVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRC 149
Query: 220 KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK----GIALGCGHDNEGL 275
SL +C C+YQ YGD + T G L ET +FG + S K IA GCG N G
Sbjct: 150 ASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGD 209
Query: 276 FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF---------NSARGGDA 326
S+G++G G G LSL Q+ + +YCL S L F N++ G
Sbjct: 210 LANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPV 269
Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
+ P + N + Y++ L S+G + + I P +F +++ G GG+I+D GT+IT LQ
Sbjct: 270 QSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQD 329
Query: 387 AYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLR--SVRVPTVSLHFGAGKALDLPAK 443
AY ++R V A L + + DTC+ + +V VP + HF + LP +
Sbjct: 330 AYEAVRRGLVS-AIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLP-E 387
Query: 444 NYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
NY++ + G C APT +IIGN QQQ + +D+ N+ + F P C
Sbjct: 388 NYMLIASTTGYLCLVMAPTGVG-TIIGNYQQQNLHLLYDIGNSFLSFVPAPC 438
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 229 bits (584), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 158/439 (35%), Positives = 217/439 (49%), Gaps = 66/439 (15%)
Query: 95 HNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQ 154
H+ Y + L RD RV ++ +L A+ + P G +
Sbjct: 76 HHHYTGI----LRRDRHRVRSIYRRL--------------TAAETTTTTTTIPARLGLAF 117
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPL 212
S EY IG+GTPPR F+++ DTGSD+ W+QC PC + CY Q +P+FDP SS+Y +
Sbjct: 118 QSLEYVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDV 177
Query: 213 PCAAPQCK--SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN----SGSVKGIAL 266
PC+AP+C + + C A C Y V YGD S T G L ET + + + G+
Sbjct: 178 PCSAPECHIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAPAATGVVF 237
Query: 267 GCGHDNEGLF----VGSAGLLGLGGGMLSLTKQIKAT------SLAYCLVDRDSPASGVL 316
GC H+ +F +G AGLLGLG G S+ Q + + +YCL R S ++G L
Sbjct: 238 GCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPRGS-STGYL 296
Query: 317 EFNSARGGDAVT---------APLIRN-KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
GG A PLI ++ + Y V L G SV G AV IP S F +
Sbjct: 297 TIG---GGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSL-- 351
Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLK--PTSGVALFDTCYDFSGLRSVR 424
G ++D GT +T + AY LRD F G+ K P + L DTCYD +G V
Sbjct: 352 ----GAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDTCYDVTGQDVVT 407
Query: 425 VPTVSLHFGAGKALDLPAKNYLIPV---DSAGT----FCFAFAPTSSA-LSIIGNVQQQG 476
P V+L FG G +D+ A L+ + D +G C AF PT+SA L I+GN+QQ+
Sbjct: 408 APRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLVIVGNMQQRA 467
Query: 477 TRVSFDLANNRVGFTPNKC 495
V FD+ R+GF PN C
Sbjct: 468 YNVVFDVDGGRIGFGPNGC 486
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 156/435 (35%), Positives = 216/435 (49%), Gaps = 41/435 (9%)
Query: 86 SREILHK------TRHNDYRSLVLSR---LERDSARVNTLITKLQLAIYNVDR-HELKPA 135
S E++HK HN +S + D+ RV + ++L N+ R + +K
Sbjct: 62 SLEVVHKHGPCSQLNHNGKAKTTISHTDIMNLDNERVKYIQSRLS---KNLGRENSVKEL 118
Query: 136 EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECY 194
++ LP SG+ GS YF +G+GTP R S+V DTGSD+ W QC PC CY
Sbjct: 119 DSTTLPAK------SGSLIGSANYFVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCY 172
Query: 195 QQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRA------NRCLYQVAYGDGSFTVGDL 248
+Q D IFDP SSSY + C + C L + ++ C+Y + YGD S +VG L
Sbjct: 173 KQQDAIFDPSKSSSYINITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFL 232
Query: 249 VTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCL 305
E ++ + V GCG DNEGLF GSAGL+GLG +S +Q I +YCL
Sbjct: 233 SQERLTITATDIVDDFLFGCGQDNEGLFSGSAGLIGLGRHPISFVQQTSSIYNKIFSYCL 292
Query: 306 VDRDSPASGVLEF--NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV-QIPPSLF 362
S + G L F ++A + PL +TFY + + G SVGG + + S F
Sbjct: 293 -PSTSSSLGHLTFGASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTF 351
Query: 363 EMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRS 422
GG I+D GT ITRL AY +LR +F + + LFDTCYDFSG +
Sbjct: 352 SA-----GGSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKE 406
Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP--TSSALSIIGNVQQQGTRVS 480
+ VP + F G ++LP LI SA C AFA + ++I GNVQQ+ V
Sbjct: 407 ISVPKIDFEFAGGVTVELPLVGILIG-RSAQQVCLAFAANGNDNDITIFGNVQQKTLEVV 465
Query: 481 FDLANNRVGFTPNKC 495
+D+ R+GF C
Sbjct: 466 YDVEGGRIGFGAAGC 480
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 228 bits (581), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 133/409 (32%), Positives = 220/409 (53%), Gaps = 30/409 (7%)
Query: 102 VLSRLERDSARVNTLITKLQLAIYNVDRHE----LKPAEAQILPEDFSTPVVSGASQGSG 157
+LSR E +++ + K + + RH+ L+P A I P+ G S GSG
Sbjct: 66 ILSRDEEHVKFLSSRLRKKDVQGASFSRHKSGHLLEPNSANI-------PLNPGLSIGSG 118
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAA 216
Y+ ++G+G+PP+ ++M+LDTGS ++WLQC+PC C+ Q DP+F+P S++Y PL C++
Sbjct: 119 NYYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSS 178
Query: 217 PQCKSLDVSA-----CRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
+C L + C A+ C+Y +YGD S+++G L + ++ S ++ GCG
Sbjct: 179 SECSLLKAATLNDPLCTASGVCVYTASYGDASYSMGYLSRDLLTLTPSQTLPSFTYGCGQ 238
Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNSARGGDAV 327
DNEGLF +AG++GL LS+ Q+ + +YCL S G L
Sbjct: 239 DNEGLFGKAAGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSGGGFLSIGKISPSSYK 298
Query: 328 TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
P+IRN + + Y++ L +V G+ V + + +++ I+D GT +TRL
Sbjct: 299 FTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPT------IIDSGTVVTRLPISI 352
Query: 388 YNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL 446
Y +LR++FV+ ++ + ++ DTC+ S P + + F G L L A N L
Sbjct: 353 YAALREAFVKIMSRRYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQGGADLSLRAPNIL 412
Query: 447 IPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
I D G C AFA +S+ ++IIGN QQQ +++D++ +++GF P C
Sbjct: 413 IEADK-GIACLAFA-SSNQIAIIGNHQQQTYNIAYDVSASKIGFAPGGC 459
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 228 bits (581), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 145/417 (34%), Positives = 217/417 (52%), Gaps = 38/417 (9%)
Query: 97 DYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGS 156
D+ + +L D RV ++ +++ V H ++ ++ QI P+ SG + +
Sbjct: 13 DWNRRLQKQLISDDLRVRSMQNRIRRV---VSSHNVEASQTQI-------PLSSGINLQT 62
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
Y +G+G+ ++++DTGSD+ W+QC PC CY Q PIF P TSSSY + C +
Sbjct: 63 LNYIVTMGLGST--NMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNS 120
Query: 217 PQCKSL-----DVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCG 269
C+SL + AC +N C Y V YGDGS+T G+L E +SFG SV GCG
Sbjct: 121 STCQSLQFATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVEQLSFGGV-SVSDFVFGCG 179
Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDA 326
+N+GLF G +GL+GLG LSL Q AT +YCL +S ASG L +
Sbjct: 180 RNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTESGASGSLVMGNESSVFK 239
Query: 327 VTAP-----LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
P ++ N ++ FY + LTG V G A+Q+P G+GG+++D GT IT
Sbjct: 240 NVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQVP-------SFGNGGVLIDSGTVIT 292
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
RL + Y +L+ F++ G ++ DTC++ +G V +PT+S+HF L +
Sbjct: 293 RLPSSVYKALKALFLKQFTGFPSAPGFSILDTCFNLTGYDEVSIPTISMHFEGNAELKVD 352
Query: 442 AK-NYLIPVDSAGTFCFAFAPTSSAL--SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A + + + A C A A S A +IIGN QQ+ RV +D ++VGF C
Sbjct: 353 ATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEESC 409
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 228 bits (580), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 153/412 (37%), Positives = 215/412 (52%), Gaps = 38/412 (9%)
Query: 102 VLSR-LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYF 160
+LSR L R SARV TL + LA + A +L D GEY
Sbjct: 49 LLSRALRRSSARVATLQSLAALA----PGDAITAARILVLASD-------------GEYL 91
Query: 161 SRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK 220
+G+GTP R +S +LDTGSD+ W QC PC C Q P FDP S++Y L CA+P C
Sbjct: 92 MEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACN 151
Query: 221 SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG---SVKGIALGCGHDNEGLFV 277
+L C C+YQ YGD + T G L ET +FG + S+ GI+ GCG+ N GL
Sbjct: 152 ALYYPLCYQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGLLA 211
Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF--------NSARGGDAVTA 329
+G++G G G LSL Q+ + +YCL SP L F +A +
Sbjct: 212 NGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQST 271
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM-DEAGDGGIIVDCGTAITRLQTQAY 388
P + N + T Y++ +TG SVGG + I P++F + D G GG I+D GT IT L AY
Sbjct: 272 PFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAY 331
Query: 389 NSLRDSFV-RLAGNLKPTSGVALFDTCYDF--SGLRSVRVPTVSLHFGAGKALDLPAKNY 445
+++R +F ++ L + ++ DTC+ + +SV +P + LHF G +LP +NY
Sbjct: 332 DAVRAAFASQITLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFD-GADWELPLQNY 390
Query: 446 LIPVD--SAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
++ VD + G C A A +SS SIIG+ Q Q V +DL N+ + F P C
Sbjct: 391 ML-VDPSTGGGLCLAMA-SSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 148/424 (34%), Positives = 211/424 (49%), Gaps = 36/424 (8%)
Query: 85 HSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDF 144
HS + HND +L D+ RV + ++L + +R +K ++ LP
Sbjct: 81 HSGKAEATISHNDIMNL-------DNERVKYIQSRLSKNLGGENR--VKELDSTTLPAK- 130
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDP 203
SG GS +Y+ +G+GTP R S++ DTGS + W QC PC CY+Q DPIFDP
Sbjct: 131 -----SGRLIGSADYYVVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYKQQDPIFDP 185
Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRAN---RCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
SSSY+ + C + C + C ++ C+Y V YGD S + G L E ++ +
Sbjct: 186 SKSSSYTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTITATDI 245
Query: 261 VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVDRDSPAS-GVL 316
V GCG DNEGLF G+AGL+GL +S +Q I +YCL +P+S G L
Sbjct: 246 VHDFLFGCGQDNEGLFRGTAGLMGLSRHPISFVQQTSSIYNKIFSYCL--PSTPSSLGHL 303
Query: 317 EF--NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV-QIPPSLFEMDEAGDGGII 373
F ++A + P ++FY + + G SVGG + + S F GG I
Sbjct: 304 TFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSA-----GGSI 358
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
+D GT ITRL AY +LR +F + G L DTCYDFSG + + VP + F
Sbjct: 359 IDSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVPRIDFEFA 418
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFT 491
G ++LP L +SA C AFA + ++I GNVQQ+ V +D+ R+GF
Sbjct: 419 GGVKVELPLVGILYG-ESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFG 477
Query: 492 PNKC 495
C
Sbjct: 478 AAGC 481
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 144/385 (37%), Positives = 199/385 (51%), Gaps = 33/385 (8%)
Query: 124 IYNVDRHELKPAEAQI---LPEDFST--------PVVSGASQGSGEYFSRIGVGTPPRQF 172
I N D+ +K ++I L +D S P SG+ GSG YF +G+GTP R
Sbjct: 99 ILNQDKERVKYINSRISKNLGQDSSVSELDSVTLPAKSGSLIGSGNYFVVVGLGTPKRDL 158
Query: 173 SMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS-----A 226
S++ DTGSD+ W QC PC CY+Q D IFDP S+SYS + C + C L +
Sbjct: 159 SLIFDTGSDLTWTQCEPCARSCYKQQDAIFDPSKSTSYSNITCTSTLCTQLSTATGNEPG 218
Query: 227 CRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLG 284
C A+ C+Y + YGD SF+VG E +S + V GCG +N+GLF GSAGL+G
Sbjct: 219 CSASTKACIYGIQYGDSSFSVGYFSRERLSVTATDIVDNFLFGCGQNNQGLFGGSAGLIG 278
Query: 285 LGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFY 341
LG +S +Q A +YCL S ++G L F + P + +FY
Sbjct: 279 LGRHPISFVQQTAAVYRKIFSYCL-PATSSSTGRLSFGTTTTSYVKYTPFSTISRGSSFY 337
Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
+ +TG SVGG + + S F GG I+D GT ITRL AY +LR +F R +
Sbjct: 338 GLDITGISVGGAKLPVSSSTFST-----GGAIIDSGTVITRLPPTAYTALRSAF-RQGMS 391
Query: 402 LKPTSG-VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA 460
P++G +++ DTCYD SG +P + F G + LP + L V SA C AFA
Sbjct: 392 KYPSAGELSILDTCYDLSGYEVFSIPKIDFSFAGGVTVQLPPQGILY-VASAKQVCLAFA 450
Query: 461 PT--SSALSIIGNVQQQGTRVSFDL 483
S ++I GNVQQ+ V +D+
Sbjct: 451 ANGDDSDVTIYGNVQQKTIEVVYDV 475
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 145/429 (33%), Positives = 219/429 (51%), Gaps = 28/429 (6%)
Query: 75 NSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP 134
+SS + ++PLH R + RL RD R + K ++ D +
Sbjct: 52 SSSGATTVPLHHRHGPCSPLPTKKMPSLEDRLHRDQLRAAYIKRK-----FSGDVKKDGQ 106
Query: 135 AEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECY 194
+ + P G S + EY + +G+P + ++++D+GSD++W+QC+PC +C+
Sbjct: 107 GAGGVEQSHVTVPTTLGTSLNTLEYLITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCH 166
Query: 195 QQSDPIFDPKTSSSYSPLPCAAPQCKSL--DVSAC-RANRCLYQVAYGDGSFTVGDLVTE 251
Q DP+FDP SS+YSP C++ C L D + C +++C Y V Y DGS T G ++
Sbjct: 167 SQVDPLFDPSLSSTYSPFSCSSAACAQLGQDGNGCSSSSQCQYIVRYADGSSTTGTYSSD 226
Query: 252 TVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDR 308
T++ G S ++ GC H G + GL+GLGGG SL Q T+ +YCL
Sbjct: 227 TLALG-SNTISNFQFGCSHVESGFNDLTDGLMGLGGGAPSLASQTAGTFGTAFSYCLPPT 285
Query: 309 DSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
S +SG L + G V P++R+ V TFY V L VGG + IP S+F
Sbjct: 286 PS-SSGFLTLGAGTSG-FVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFS----- 338
Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
G+++D GT ITRL AY++L +F +P ++ DTC+DFSG SVR+P+V
Sbjct: 339 -AGMVMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLPSV 397
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANN 486
+L F G ++L A ++ C AFA S S+ I+GNVQQ+ V +D+
Sbjct: 398 ALVFSGGAVVNLDANGIIL------GNCLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGG 451
Query: 487 RVGFTPNKC 495
VGF C
Sbjct: 452 AVGFKAGAC 460
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 126/334 (37%), Positives = 182/334 (54%), Gaps = 24/334 (7%)
Query: 174 MVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD--VSACRANR 231
+++DTGSDI W+QC PC +CY+Q D +F P S++Y PLPC + C+ L +C +
Sbjct: 3 LLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLNSS 62
Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSG----SVKGIALGCGHDNEGLFVGSAGLLGLGG 287
C Y V+YGD S T GD ET++ + SV A GCGH N+GLF G+AGL+GLG
Sbjct: 63 CNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNGAAGLMGLGK 122
Query: 288 GMLSLTKQIKAT---SLAYCLVDRDSPA-SGVLEFNSAR--GGDAVTAPLIRNKKVDTFY 341
+ Q +YCL S SG+L F A D PL+ + + Y
Sbjct: 123 SSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPLVDSSSGPSQY 182
Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
+V +TG +VG + + I + ++VD GT I+R + AY LRD+F ++
Sbjct: 183 FVSMTGINVGDELLPISAT-----------VMVDSGTVISRFEQSAYERLRDAFTQILPG 231
Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
L+ VA FDTC+ S + + +P ++LHF L L + L PVD G CFAFAP
Sbjct: 232 LQTAVSVAPFDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHILYPVDD-GVMCFAFAP 290
Query: 462 TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+SS S++GN QQQ R +D+ +R+G + +C
Sbjct: 291 SSSGRSVLGNFQQQNLRFVYDIPKSRLGISAFEC 324
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 153/471 (32%), Positives = 238/471 (50%), Gaps = 44/471 (9%)
Query: 62 AEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQ 121
++E + A E ++ S L L REI +T+ + V+ +D R+ TL + +
Sbjct: 63 SKEHDPAKE----HTRESVKLHLRRREIKQETKRTTHS--VVDLQIQDLTRIQTLHARFK 116
Query: 122 LAIYNVDRHELKPAEA--------QILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFS 173
+ + K + ++ P + SG + GSGEYF + VGTPP+ FS
Sbjct: 117 KSKKQRNEKVKKKITSDISLVGAPEVSPGKLIATLESGMTLGSGEYFMDVLVGTPPKHFS 176
Query: 174 MVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA----CRA 229
++LDTGSD+NWLQC PC +C+ Q++ +DPKTS+S+ + C P+C + C++
Sbjct: 177 LILDTGSDLNWLQCLPCYDCFHQNEAFYDPKTSASFKNITCNDPRCSLISSPEPPVQCKS 236
Query: 230 NR--CLYQVAYGDGSFTVGDLVTETVSF------GNSG--SVKGIALGCGHDNEGLFVGS 279
+ C Y YGD S T GD ET + G S V+ + GCGH N GLF G+
Sbjct: 237 DNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKVENMMFGCGHWNRGLFSGA 296
Query: 280 AGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI---- 332
+GLLGLG G LS + Q+++ S +YCLVDR+S + + D + +
Sbjct: 297 SGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHTNLNFTS 356
Query: 333 ----RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
+ V+TFYY+ + VGG+A+ IP + + G GG I+D GT ++ AY
Sbjct: 357 FVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPDGAGGTIIDSGTTLSYFAEPAY 416
Query: 389 NSLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLR--SVRVPTVSLHFGAGKALDLPAKNY 445
+++ F ++ N + D C++ SG+ ++ +P + + F G + PA+N
Sbjct: 417 EIIKNKFAEKMKENYLVFRDFPVLDPCFNVSGIEENNIHLPELGIAFADGAVWNFPAENS 476
Query: 446 LIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
I + S C A T S SIIGN QQQ + +D +R+GFTP KC
Sbjct: 477 FIWL-SEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKMSRLGFTPTKC 526
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 145/402 (36%), Positives = 214/402 (53%), Gaps = 34/402 (8%)
Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIG 164
R +R R + KLQ+++ E+K EA PV +G +GE+ ++
Sbjct: 79 RFKRAIKRSQDRLEKLQMSV-----DEVKAVEA---------PVYAG----NGEFLMKMA 120
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
+GTP FS +LDTGSD+ W QC+PCT+CY Q PI+DP SS+YS +PC++ C++L +
Sbjct: 121 IGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPTPIYDPSQSSTYSKVPCSSSMCQALPM 180
Query: 225 SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLG 284
+C C Y +YGD S T G L E+ + S S+ IA GCG +NEG G L
Sbjct: 181 YSCSGANCEYLYSYGDQSSTQGILSYESFTL-TSQSLPHIAFGCGQENEGGGFSQGGGLV 239
Query: 285 LGGGM-LSLTKQIKAT---SLAYCLVD-RDSPASGVLEF----NSARGGDAVTAPLIRNK 335
G LSL Q+ + +YCLV DSP+ F S + PL++++
Sbjct: 240 GFGRGPLSLISQLGQSLGNKFSYCLVSITDSPSKTSPLFIGKTASLNAKTVSSTPLVQSR 299
Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
TFYY+ L G SVGGQ + I F++ G GG+I+D GT +T L+ Y+ ++ +
Sbjct: 300 SRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTGGVIIDSGTTVTYLEQSGYDVVKKAV 359
Query: 396 VRLAGNLKPTSGVAL-FDTCYD-FSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAG 453
+ + NL G + D C++ SG + PT++ HF G +LP +NY I DS+G
Sbjct: 360 IS-SINLPQVDGSNIGLDLCFEPQSGSSTSHFPTITFHF-EGADFNLPKENY-IYTDSSG 416
Query: 454 TFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C A P S+ +SI GN+QQQ ++ +D N + F P C
Sbjct: 417 IACLAMLP-SNGMSIFGNIQQQNYQILYDNERNVLSFAPTVC 457
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 144/418 (34%), Positives = 222/418 (53%), Gaps = 38/418 (9%)
Query: 97 DYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGS 156
D+ + +L D RV ++ +++ H ++ ++ QI P+ SG + +
Sbjct: 13 DWNRRLQKQLILDDLRVRSMQNRIRRV---ASTHNVEASQTQI-------PLSSGINLQT 62
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
Y +G+G+ + ++++DTGSD+ W+QC PC CY Q PIF P TSSSY + C +
Sbjct: 63 LNYIVTMGLGS--KNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNS 120
Query: 217 PQCKSL-----DVSACRANR---CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
C+SL + AC ++ C Y V YGDGS+T G+L E +SFG SV GC
Sbjct: 121 STCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSFGGV-SVSDFVFGC 179
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVL----EFNSA 321
G +N+GLF G +GL+GLG LSL Q AT +YCL ++ +SG L E +
Sbjct: 180 GRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSSGSLVMGNESSVF 239
Query: 322 RGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
+ + +T ++ N ++ FY + LTG VGG A++ P S G+GGI++D GT I
Sbjct: 240 KNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLSF------GNGGILIDSGTVI 293
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
TRL + Y +L+ F++ G ++ DTC++ +G V +PT+SL F L++
Sbjct: 294 TRLPSSVYKALKAEFLKKFTGFPSAPGFSILDTCFNLTGYDEVSIPTISLRFEGNAQLNV 353
Query: 441 PAK-NYLIPVDSAGTFCFAFAPTSSAL--SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A + + + A C A A S A +IIGN QQ+ RV +D ++VGF C
Sbjct: 354 DATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEEPC 411
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 150/436 (34%), Positives = 224/436 (51%), Gaps = 46/436 (10%)
Query: 102 VLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVS----------- 150
VL RD R+ TL ++ LA N ++ + + + E +TPV S
Sbjct: 86 VLELQIRDLTRIQTLHKRV-LAKKN--QNTVSQKQKKKNKEVVTTPVASSVEEQAGQLVA 142
Query: 151 ----GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
G + GSGEYF + VG+PP+ FS++LDTGSD+NW+QC PC +C+QQ+ +DPK S
Sbjct: 143 TLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKAS 202
Query: 207 SSYSPLPCAAPQCKSLD----VSACRANR--CLYQVAYGDGSFTVGDLVTE--TVSFGNS 258
+SY + C P+C + C+++ C Y YGD S T GD E TV+ S
Sbjct: 203 ASYKNITCNDPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTS 262
Query: 259 G------SVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRD 309
G +V+ + GCGH N GLF G+AGLLGLG G LS + Q+++ S +YCLVDR+
Sbjct: 263 GGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRN 322
Query: 310 SPASGVLEFNSARGGDAVTAPLI--------RNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
S + + D ++ P + + VDTFYYV + V G+ + IP
Sbjct: 323 SDTNVSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEET 382
Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGL 420
+ + G GG I+D GT ++ AY +++ A P + D C++ SG+
Sbjct: 383 WNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGI 442
Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRV 479
S+++P + + F G + P +N I ++ C A T SA SIIGN QQQ +
Sbjct: 443 DSIQLPELGIAFADGAVWNFPTENSFIWLNE-DLVCLAILGTPKSAFSIIGNYQQQNFHI 501
Query: 480 SFDLANNRVGFTPNKC 495
+D +R+G+ P KC
Sbjct: 502 LYDTKRSRLGYAPTKC 517
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 224 bits (572), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 159/470 (33%), Positives = 224/470 (47%), Gaps = 64/470 (13%)
Query: 61 FAEESETAAESFPLNSSSSFSLPLHSREI----LHKTRH---------NDYRSLVLSRLE 107
F TA+E P+ S+S +L S + +H RH +D S RL
Sbjct: 28 FVAVPTTASEPEPVCSTSGVTLDPGSNTVSVPLVH--RHGPCAPTQLSSDKPSSFTDRLR 85
Query: 108 RDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGT 167
R+ AR +++++ + D D S P G S S EY +G+GT
Sbjct: 86 RNRARSKYIMSRVSKGMMGDD-------------ADVSIPTHLGGSVDSLEYVVTVGLGT 132
Query: 168 PPRQFSMVLDTGSDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD-- 223
P +++DTGSD++W+QC+PC T CY Q DP+FDP SS+Y+P+PC C+ L
Sbjct: 133 PSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYAPIPCNTDACRDLTDD 192
Query: 224 ------VSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
S A +C + + YGDGS T G ET++ +VK GCGHD +G
Sbjct: 193 GYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLALAPGVAVKDFRFGCGHDQDGAND 252
Query: 278 GSAGLLGLGGGMLSLTKQ---IKATSLAYCLVDRDSPASGVLEFNSARGGDAVT------ 328
GLLGLGG SL Q + + +YCL ++ + V
Sbjct: 253 KYDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNNQVGFLALGGGGAPSGGVVNTSGFV 312
Query: 329 -APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
P+IR ++ TFY V +TG +VGG+ + +PPS F GG+I+D GT +T LQ A
Sbjct: 313 FTPMIREEE--TFYVVNMTGITVGGEPIDVPPSAFS------GGMIIDSGTVVTELQHTA 364
Query: 388 YNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
YN+L+ +F R A P DTCYDFSG +V +P V+L F G +DL N ++
Sbjct: 365 YNALQAAF-RKAMAAYPLVRNGELDTCYDFSGYSNVTLPKVALTFSGGATIDLDVPNGIL 423
Query: 448 PVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
D C AF + I+GNV Q+ V +D RVGF C
Sbjct: 424 LDD-----CLAFQESGPDDQPGILGNVNQRTLEVLYDAGRGRVGFRAAVC 468
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 224 bits (572), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 152/412 (36%), Positives = 214/412 (51%), Gaps = 38/412 (9%)
Query: 102 VLSR-LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYF 160
+LSR L R SARV TL + LA + A +L D GEY
Sbjct: 49 LLSRALRRSSARVATLQSLAALA----PGDAITAARILVLASD-------------GEYL 91
Query: 161 SRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK 220
+G+GTP R +S +LDTGSD+ W QC PC C Q P FDP S++Y L CA+P C
Sbjct: 92 MEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACN 151
Query: 221 SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG---SVKGIALGCGHDNEGLFV 277
+L C C+YQ YGD + T G L ET +FG + S+ GI+ GCG+ N G
Sbjct: 152 ALYYPLCYQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGSLA 211
Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF--------NSARGGDAVTA 329
+G++G G G LSL Q+ + +YCL SP L F +A +
Sbjct: 212 NGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQST 271
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM-DEAGDGGIIVDCGTAITRLQTQAY 388
P + N + T Y++ +TG SVGG + I P++F + D G GG I+D GT IT L AY
Sbjct: 272 PFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAY 331
Query: 389 NSLRDSFV-RLAGNLKPTSGVALFDTCYDF--SGLRSVRVPTVSLHFGAGKALDLPAKNY 445
+++R +F ++ L + ++ DTC+ + +SV +P + LHF G +LP +NY
Sbjct: 332 DAVRAAFASQITLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFD-GADWELPLQNY 390
Query: 446 LIPVD--SAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
++ VD + G C A A +SS SIIG+ Q Q V +DL N+ + F P C
Sbjct: 391 ML-VDPSTGGGLCLAMA-SSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 148/444 (33%), Positives = 228/444 (51%), Gaps = 31/444 (6%)
Query: 59 EPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLIT 118
E A +S T A + PL+ PL ++++ + RL RD R
Sbjct: 47 ESKAVKSSTGAATVPLHHRHGPCSPLPTKKM----------PTLEERLHRDQLRA----A 92
Query: 119 KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
+Q + + + + P G S + EY + +G+P + +M++DT
Sbjct: 93 YIQRKFSGGGVNGSRGGAGDVQQSHATVPTTLGTSLDTLEYLITVRLGSPGKSQTMLIDT 152
Query: 179 GSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL--DVSACRANRCLYQV 236
GSD++W+QC+PC++C+ Q+DP+FDP +SS+YSP C++ C L + + C +++C Y V
Sbjct: 153 GSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCSSAACAQLGQEGNGCSSSQCQYTV 212
Query: 237 AYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI 296
YGDGS T G ++T++ G S +V+ GC + G + GL+GLGGG SL Q
Sbjct: 213 TYGDGSSTTGTYSSDTLALG-SNAVRKFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQT 271
Query: 297 KAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQ 353
T + +YCL S +SG L + G V P++R+ +V TFY V + VGG+
Sbjct: 272 AGTFGAAFSYCL-PATSSSSGFLTLGAGTSG-FVKTPMLRSSQVPTFYGVRIQAIRVGGR 329
Query: 354 AVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT 413
+ IP S+F G I+D GT +TRL AY++L +F + DT
Sbjct: 330 QLSIPTSVFS------AGTIMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGILDT 383
Query: 414 CYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGN 471
C+DFSG SV +PTV+L F G +D+ + ++ ++ C AFA S S+L IIGN
Sbjct: 384 CFDFSGQSSVSIPTVALVFSGGAVVDIASDGIMLQTSNS-ILCLAFAANSDDSSLGIIGN 442
Query: 472 VQQQGTRVSFDLANNRVGFTPNKC 495
VQQ+ V +D+ VGF C
Sbjct: 443 VQQRTFEVLYDVGGGAVGFKAGAC 466
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 142/365 (38%), Positives = 190/365 (52%), Gaps = 35/365 (9%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAA 216
EY IG+GTP R F+++ DTGSD+ W+QC+PCT+ CYQQ +P+FDP SS+Y +PC
Sbjct: 125 EYVVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSSTYVDVPCGT 184
Query: 217 PQCK---SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG-SVKGIALGCGHDN 272
PQCK D++ C C Y V YGD S T G+L E + S G+ GC H+
Sbjct: 185 PQCKIGGGQDLT-CGGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAAGVVFGCSHEY 243
Query: 273 EGLFVGS------AGLLGLGGGMLSLTKQIKATS----LAYCLVDRDSPASGVLEFNSA- 321
G+ AGLLGLG G S+ Q + + +YCL R S A G L +A
Sbjct: 244 SSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCLPPRGSSA-GYLTIGAAA 302
Query: 322 --RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
+ + T + N ++ + Y V L G SV G A+ I S F + G ++D GT
Sbjct: 303 PPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYI------GTVIDSGTV 356
Query: 380 ITRLQTQAYNSLRDSFVRLAG--NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
IT + AY LRD F R G + P V DTCYD +G V P V+L FG G
Sbjct: 357 ITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYDVTGHDVVTAPPVALEFGGGAR 416
Query: 438 LDLPAKNYLI--PVDSAGT----FCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGF 490
+D+ A L+ VD++G C AF PT+ IIGN+QQ+ V FD+ R+GF
Sbjct: 417 IDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQRAYNVVFDVEGRRIGF 476
Query: 491 TPNKC 495
N C
Sbjct: 477 GANGC 481
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 222 bits (566), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 133/425 (31%), Positives = 229/425 (53%), Gaps = 36/425 (8%)
Query: 92 KTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSG 151
K+ N L +D R+ ++L + + ++ P+ P+ SG
Sbjct: 42 KSPPNSTSLLFAYMFAKDEERIRYFHSRL------AKNSDANASSKKVGPKLAGIPLKSG 95
Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYS 210
S GSG Y+ ++G+G+P + ++M++DTGS +WLQC+PCT C+ Q DP+F+P S +Y
Sbjct: 96 LSMGSGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYK 155
Query: 211 PLPCAAPQCK-----SLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
+PC++ QC +L+ C ++N C+Y+ +YGD SF++G L + ++ S ++
Sbjct: 156 TVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQTLSS 215
Query: 264 IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDR----DSPASGVL 316
GCG DN+GLF + G++GL LS+ Q+ + +YCL +SP G L
Sbjct: 216 FVYGCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFL 275
Query: 317 EFNSARGGDAVT---APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
++ + + PL++N + Y++ L +V G+ + + S +++ I
Sbjct: 276 SIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPT------I 329
Query: 374 VDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVSL 430
+D GT ITRL T Y +L++++V L+ + G++L DTC+ +G+ V P + +
Sbjct: 330 IDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEV-APDIRI 388
Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
F G L L N L+ +++ G C A A SS+++IIGN QQQ +V++D+ N+RVGF
Sbjct: 389 IFKGGADLQLKGHNSLVELET-GITCLAMA-GSSSIAIIGNYQQQTVKVAYDVGNSRVGF 446
Query: 491 TPNKC 495
P C
Sbjct: 447 APGGC 451
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 222 bits (566), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 140/343 (40%), Positives = 194/343 (56%), Gaps = 21/343 (6%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAAPQCK- 220
+G+GTP Q+ MV+DTGS + WLQC PC C++QS P+F+PK+SS+Y+ + C+A QC
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSD 60
Query: 221 ----SLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
+L+ SAC +N C+YQ +YGD SF+VG L +TVSFG++ S+ GCG DNEGL
Sbjct: 61 LPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGST-SLPNFYYGCGQDNEGL 119
Query: 276 FVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI 332
F SAGL+GL LSL Q+ + S YCL S +SG L S G P++
Sbjct: 120 FGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCL--PSSSSSGYLSLGSYNPGQYSYTPMV 177
Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
+ D+ Y++ L+G +V G P I+D GT ITRL T Y++L
Sbjct: 178 SSSLDDSLYFIKLSGMTVAGN-----PLSVSSSAYSSLPTIIDSGTVITRLPTSVYSALS 232
Query: 393 DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
+ S ++ DTC+ R V P V++ F G AL L A+N L+ VD +
Sbjct: 233 KAVAAAMKGTSRASAYSILDTCFKGQASR-VSAPAVTMSFAGGAALKLSAQNLLVDVDDS 291
Query: 453 GTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
T C AFAP SA +IIGN QQQ V +D+ ++R+GF C
Sbjct: 292 TT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKSSRIGFAAGGC 332
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 164/476 (34%), Positives = 238/476 (50%), Gaps = 64/476 (13%)
Query: 45 QQTEHILSFEPETLEPFAEESETAAES---FPLNSSSSFSLPLHSRE--ILHKTRHNDYR 99
+ EH+L P + ++E + T + S + S++ S+PL R TR +D
Sbjct: 23 NEEEHVLVAVPTSR--YSEPAATCSTSRVRWLDEGSNTVSVPLVHRHGPCAPSTRSSDEP 80
Query: 100 SLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEY 159
SL RL R AR ++++ + + S P G S S EY
Sbjct: 81 SLS-ERLRRSRARSKYIMSRASKS-------------------NVSIPTHLGGSVDSLEY 120
Query: 160 FSRIGVGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPLPCAAP 217
+G+GTP +++DTGSD++W+QC PC T CY Q DP+FDP SS+Y+P+PC
Sbjct: 121 VVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAPIPCNTD 180
Query: 218 QCKSLDV----SACRAN-----RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
C+ L S C + +C Y + YGDGS T G ET++ +VK GC
Sbjct: 181 ACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMAPGVTVKDFHFGC 240
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSL---TKQIKATSLAYCLVDRDSPASGVLEF----NSA 321
GHD +G GLLGLGG SL T + + +YCL + A G L N A
Sbjct: 241 GHDQDGPNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPAANDQA-GFLALGAPVNDA 299
Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
G V P++R ++ TFY V +TG +VGG+ + +PPS F GG+I+D GT +T
Sbjct: 300 SG--FVFTPMVREQQ--TFYVVNMTGITVGGEPIDVPPSAFS------GGMIIDSGTVVT 349
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
LQ AY +L+ +F R A P DTCY+F+G +V VP V+L F G +DL
Sbjct: 350 ELQHTAYAALQAAF-RKAMAAYPLLPNGELDTCYNFTGHSNVTVPRVALTFSGGATVDLD 408
Query: 442 AKNYLIPVDSAGTFCFAF--APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ ++ +D+ C AF A + I+GNV Q+ V +D+ + RVGF + C
Sbjct: 409 VPDGIL-LDN----CLAFQEAGPDNQPGILGNVNQRTLEVLYDVGHGRVGFGADAC 459
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 154/441 (34%), Positives = 218/441 (49%), Gaps = 29/441 (6%)
Query: 74 LNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVD--RHE 131
+N+ S +L L S + H + + RDS + + VD R
Sbjct: 1 MNTLSFLTLSLFSLCFIASFSHALSNGFSVELIHRDSPKSPYYKPTENKYQHFVDAARRS 60
Query: 132 LKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT 191
+ A D STP S G Y VGTPP + + DTGSDI WLQC PC
Sbjct: 61 INRANHFFKDSDTSTPE-STVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCE 119
Query: 192 ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-DVSACRANRCLYQVAYGDGSFTVGDLVT 250
+CY Q+ PIF+P SSSY +PC++ C S+ D S N C Y+++YGD S + GDL
Sbjct: 120 QCYNQTTPIFNPSKSSSYKNIPCSSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSV 179
Query: 251 ETVSF----GNSGSVKGIALGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKAT---SLA 302
+T+S G+ S I +GCG DN G F G S+G++GLGGG +SL Q+ ++ +
Sbjct: 180 DTLSLESTSGSPVSFPKIVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFS 239
Query: 303 YCLV---DRDSPASGVLEFNSA---RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
YCLV +++S AS +L F A G V+ PLI+ V FY++ L FSVG + V+
Sbjct: 240 YCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPV--FYFLTLQAFSVGNKRVE 297
Query: 357 IPPSLFEMDEAGD--GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTC 414
S E GD G II+D GT +T + + Y +L + V L + F C
Sbjct: 298 FGGS----SEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLC 353
Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQ 474
Y P +++HF G ++L + + +P+ + G CFAF P+ SI GN+ Q
Sbjct: 354 YSLKS-NEYDFPIITVHF-KGADVELHSISTFVPI-TDGIVCFAFQPSPQLGSIFGNLAQ 410
Query: 475 QGTRVSFDLANNRVGFTPNKC 495
Q V +DL V F P C
Sbjct: 411 QNLLVGYDLQQKTVSFKPTDC 431
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 148/437 (33%), Positives = 221/437 (50%), Gaps = 47/437 (10%)
Query: 102 VLSRLERDSARVNTLITK-LQLAIYNVDRHELKPAEAQILPEDFSTPVVS---------- 150
VL RD R+ TL + L+ N + K + +++ +TPV S
Sbjct: 100 VLELQIRDLTRIQTLHKRVLEKNNQNTVSQKQKKNDKEVVT---TTPVASSVEEQAGQLV 156
Query: 151 -----GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
G + GSGEYF + VG+PP+ FS++LDTGSD+NW+QC PC +C+QQ+ +DPK
Sbjct: 157 ATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKA 216
Query: 206 SSSYSPLPCAAPQCKSLDV----SACRANR--CLYQVAYGDGSFTVGDLVTETVSFG--- 256
S+SY + C +C + C+++ C Y YGD S T GD ET +
Sbjct: 217 SASYKNITCNDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTT 276
Query: 257 NSGS-----VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDR 308
N GS V+ + GCGH N GLF G+AGLLGLG G LS + Q+++ S +YCLVDR
Sbjct: 277 NGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDR 336
Query: 309 DSPASGVLEFNSARGGDAVTAPLI--------RNKKVDTFYYVGLTGFSVGGQAVQIPPS 360
+S + + D ++ P + + VDTFYYV + V G+ + IP
Sbjct: 337 NSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEE 396
Query: 361 LFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSG 419
+ + G GG I+D GT ++ AY +++ A P + D C++ SG
Sbjct: 397 TWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSG 456
Query: 420 LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTR 478
+ +V++P + + F G + P +N I ++ C A T SA SIIGN QQQ
Sbjct: 457 IHNVQLPELGIAFADGAVWNFPTENSFIWLNE-DLVCLAMLGTPKSAFSIIGNYQQQNFH 515
Query: 479 VSFDLANNRVGFTPNKC 495
+ +D +R+G+ P KC
Sbjct: 516 ILYDTKRSRLGYAPTKC 532
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 221 bits (563), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 144/366 (39%), Positives = 198/366 (54%), Gaps = 32/366 (8%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFD 202
+ P G + G+ Y + +GTP ++ +DTGSD++W+QC PC CY Q DP+FD
Sbjct: 126 TVPANWGFNIGTLNYVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKDPLFD 185
Query: 203 PKTSSSYSPLPCAAPQCKSLDV--SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
P SSSY+ +PC P C L + S+C A +C Y V+YGDGS T G ++T++ + +
Sbjct: 186 PAQSSSYAAVPCGGPVCGGLGIYASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTLSPNDA 245
Query: 261 VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLE 317
V+G GCGH G F G+ GLLGLG SL +Q T +YCL R S +G L
Sbjct: 246 VRGFFFGCGHAQSG-FTGNDGLLGLGREEASLVEQTAGTYGGVFSYCLPTRPS-TTGYLT 303
Query: 318 FNSARGGDA---VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
G T L+ + T+Y V LTG SVGGQ + +P S+F GG +V
Sbjct: 304 LGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFA------GGTVV 357
Query: 375 DCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTS-GVALFDTCYDFSGLRSVRVPTVSLHF 432
D GT ITRL AY +LR +F +A P++ + DTCY+FSG +V +P V+L F
Sbjct: 358 DTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGTVTLPNVALTF 417
Query: 433 GAGKALDLPAKNYLIPVDSAGTF-CFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVG 489
G + L A L +F C AFAP+ S ++I+GNVQQ+ V D VG
Sbjct: 418 SGGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVG 468
Query: 490 FTPNKC 495
F P+ C
Sbjct: 469 FKPSSC 474
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 221 bits (562), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 133/425 (31%), Positives = 229/425 (53%), Gaps = 36/425 (8%)
Query: 92 KTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSG 151
K+ N L +D R+ ++L + + ++ P+ P+ SG
Sbjct: 42 KSPPNSTSLLFAYMFAKDEERIRYFHSRL------AKNSDANASFKKVGPKLAGIPLKSG 95
Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYS 210
S GSG Y+ ++G+G+P + ++M++DTGS +WLQC+PCT C+ Q DP+F+P S +Y
Sbjct: 96 LSMGSGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYK 155
Query: 211 PLPCAAPQCK-----SLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
+PC++ QC +L+ C ++N C+Y+ +YGD SF++G L + ++ S ++
Sbjct: 156 TVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQTLSS 215
Query: 264 IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDR----DSPASGVL 316
GCG DN+GLF + G++GL LS+ Q+ + +YCL +SP G L
Sbjct: 216 FVYGCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFL 275
Query: 317 EFNSARGGDAVT---APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
++ + + PL++N + Y++ L +V G+ + + S +++ I
Sbjct: 276 SIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPT------I 329
Query: 374 VDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVSL 430
+D GT ITRL T Y +L++++V L+ + G++L DTC+ +G+ V P + +
Sbjct: 330 IDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEV-APDIRI 388
Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
F G L L N L+ +++ G C A A SS+++IIGN QQQ +V++D+ N+RVGF
Sbjct: 389 IFKGGADLQLKGHNSLVELET-GITCLAMA-GSSSIAIIGNYQQQTVKVAYDVGNSRVGF 446
Query: 491 TPNKC 495
P C
Sbjct: 447 APGGC 451
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 221 bits (562), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 150/424 (35%), Positives = 215/424 (50%), Gaps = 43/424 (10%)
Query: 86 SREILHK----TRHNDYRSLVLSR------LERDSARVNTLITKLQLAI-YNVDRHELKP 134
S E++HK ++ ND+ S L +D RV + ++L + + EL
Sbjct: 71 SLEVVHKHGPCSQLNDHDGKAKSTTPHSDILNQDKERVKYINSRLSKNLGQDSSVEELDS 130
Query: 135 AEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-C 193
A + P SG+ GSG YF +G+GTP R S++ DTGSD+ W QC PC C
Sbjct: 131 A---------TLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSC 181
Query: 194 YQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS-----ACRANR--CLYQVAYGDGSFTVG 246
Y+Q D IFDP S+SYS + C + C L + C A+ C+Y + YGD SF+VG
Sbjct: 182 YKQQDVIFDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVG 241
Query: 247 DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAY 303
E ++ + V GCG +N+GLF GSAGL+GLG +S +Q A +Y
Sbjct: 242 YFSRERLTVTATDVVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFSY 301
Query: 304 CLVDRDSPASGVLEFNSARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLF 362
CL S ++G L F A G + P + +FY + +T +VGG + + S F
Sbjct: 302 CLPSTSS-STGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTF 360
Query: 363 EMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG-VALFDTCYDFSGLR 421
GG I+D GT ITRL AY +LR +F R + P++G +++ DTCYD SG +
Sbjct: 361 ST-----GGAIIDSGTVITRLPPTAYGALRSAF-RQGMSKYPSAGELSILDTCYDLSGYK 414
Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRV 479
+PT+ F G + LP + L V S C AFA S ++I GNVQQ+ V
Sbjct: 415 VFSIPTIEFSFAGGVTVKLPPQGILF-VASTKQVCLAFAANGDDSDVTIYGNVQQRTIEV 473
Query: 480 SFDL 483
+D+
Sbjct: 474 VYDV 477
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 221 bits (562), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 146/435 (33%), Positives = 221/435 (50%), Gaps = 63/435 (14%)
Query: 86 SREILHK-------TRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQ 138
S +++HK + N ++ L D +RV+++ KL D +K +A
Sbjct: 66 SLKVVHKHGPCSQLNQQNGNAPNLVEILLEDQSRVDSIHAKLS------DHSGVKETDAA 119
Query: 139 ILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD 198
LP SG S G+G Y IG+G+P + ++ DTGSD+ W +C +
Sbjct: 120 KLPTK------SGMSLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARC--------SAA 165
Query: 199 PIFDPKTSSSYSPLPCAAPQCKSL-----DVSACRANRCLYQVAYGDGSFTVGDLVTETV 253
FDP S+SY+ + C+ P C S+ + S C A+ C+Y + YGDGS+++G L E +
Sbjct: 166 ETFDPTKSTSYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERL 225
Query: 254 SFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI--KATSL-AYCLVDRDS 310
+ G++ GCG D +GLF +AGLLGLG LS+ Q K L +YCL S
Sbjct: 226 TIGSTDIFNNFYFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCL--PSS 283
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
++G L F S++ A PL + +FY + LTG +VGGQ + IP S+F
Sbjct: 284 SSTGFLSFGSSQSKSAKFTPL--SSGPSSFYNLDLTGITVGGQKLAIPLSVFST-----A 336
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
G I+D GT +TRL AY++LR +F + + +++ DTCYDFS ++++VP + +
Sbjct: 337 GTIIDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIKVPKIVI 396
Query: 431 HFGAGKALDLPAKNYLIPVDSAGTF--------CFAFAPTSSA--LSIIGNVQQQGTRVS 480
F G +D VD AG F C AFA + A +I GN QQ+ V
Sbjct: 397 SFSGGVDVD---------VDQAGIFVANGLKQVCLAFAGNTGARDTAIFGNTQQRNFEVV 447
Query: 481 FDLANNRVGFTPNKC 495
+D++ +VGF P C
Sbjct: 448 YDVSGGKVGFAPASC 462
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 151/407 (37%), Positives = 213/407 (52%), Gaps = 17/407 (4%)
Query: 98 YRSLVLSRLERDSA-RVNTLITKLQLAIYNVDR-HELKPAEAQ-ILPED--FSTPVVSGA 152
+R+ ++ R + S R TL T ++ I V R HE + A+ +L D F TPV SG
Sbjct: 28 FRAELIYREHQSSPLRSETLKTPSEIFIAAVKRGHERRARLAKHVLAGDQLFETPVASG- 86
Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
+GEY I G PP++ + ++DTGSD+NW+QC PC CY+ FDP S+SY L
Sbjct: 87 ---NGEYLIDISYGNPPQKSTAIVDTGSDLNWVQCLPCKSCYETLSAKFDPSKSASYKTL 143
Query: 213 PCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
C + C+ L +C A+ C Y YGDGS T G L T+ V+ G +G + +A GCG+ N
Sbjct: 144 GCGSNFCQDLPFQSCAAS-CQYDYMYGDGSSTSGALSTDDVTIG-TGKIPNVAFGCGNSN 201
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF-NSARGGDAVT 328
G F G+ GL+GLG G LSL Q+ T+ +YCLV S + L +S G
Sbjct: 202 LGTFAGAGGLVGLGKGPLSLVSQLGGTATKKFSYCLVPLGSTKTSPLYIGDSTLAGGVAY 261
Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
P++ N TFYY L G SV G+AV P + F++ G GG+I+D GT +T L A+
Sbjct: 262 TPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGTTLTYLDVDAF 321
Query: 389 NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
N + + + + C+ +G+ + PTV HF G + L N I
Sbjct: 322 NPMVAALKAALPYPEADGSFYGLEYCFSTAGVANPTYPTVVFHFN-GADVALAPDNTFIA 380
Query: 449 VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+D GT C A A +S+ SI GN+QQ + DL N R+GF C
Sbjct: 381 LDFEGTTCLAMA-SSTGFSIFGNIQQLNHVIVHDLVNKRIGFKSANC 426
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 154/364 (42%), Positives = 212/364 (58%), Gaps = 25/364 (6%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDP 203
S P+ G S G G Y +R+G+GTP + + MV+DTGS + WLQC PC C++QS P+F+P
Sbjct: 115 SVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNP 174
Query: 204 KTSSSYSPLPCAAPQCK-----SLDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
K SSSY+ + C+A QC +L+ ++C +N C+YQ +YGD SF+VG L +TVSFG+
Sbjct: 175 KASSSYTSVSCSAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGS 234
Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASG 314
+ SV GCG DNEGLF SAGL+GL LSL Q+ + S +YCL S +SG
Sbjct: 235 T-SVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSG 293
Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
L S G P+ + D+ Y++ +TG V G+ P I+
Sbjct: 294 YLSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTII 348
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPT---SGVALFDTCYDFSGLRSVRVPTVSLH 431
D GT ITRL T Y++L + +AG +K T S ++ DTC+ R +RVP V++
Sbjct: 349 DSGTVITRLPTGVYSALSKA---VAGAMKGTPRASAFSILDTCFQGQAAR-LRVPEVTMA 404
Query: 432 FGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
F G AL L A+N L+ VDSA T C AFAP SA +IIGN QQQ V +D+ N+++GF
Sbjct: 405 FAGGAALKLAARNLLVDVDSATT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFA 462
Query: 492 PNKC 495
C
Sbjct: 463 AGGC 466
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 154/364 (42%), Positives = 208/364 (57%), Gaps = 25/364 (6%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDP 203
S P+ G S G G Y +R+G+GTP + + MV+DTGS + WLQC PC C++QS P+F+P
Sbjct: 115 SVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNP 174
Query: 204 KTSSSYSPLPCAAPQCKSLD------VSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
K SSSY+ + C+A QC L S +N C+YQ +YGD SF+VG L +TVSFG+
Sbjct: 175 KASSSYTSVSCSAQQCSDLTTATLSPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGS 234
Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASG 314
+ SV GCG DNEGLF SAGL+GL LSL Q+ + S +YCL S +SG
Sbjct: 235 T-SVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSG 293
Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
L S G P+ + D+ Y++ +TG V G+ P I+
Sbjct: 294 YLSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTII 348
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPT---SGVALFDTCYDFSGLRSVRVPTVSLH 431
D GT ITRL T Y++L + +AG +K T S ++ DTC+ R +RVP V++
Sbjct: 349 DSGTVITRLPTGVYSALSKA---VAGAMKGTPRASAFSILDTCFQGQAAR-LRVPEVTMA 404
Query: 432 FGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
F G AL L A+N L+ VDSA T C AFAP SA +IIGN QQQ V +D+ N+++GF
Sbjct: 405 FAGGAALKLAARNLLVDVDSATT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFA 462
Query: 492 PNKC 495
C
Sbjct: 463 AGGC 466
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 152/448 (33%), Positives = 228/448 (50%), Gaps = 34/448 (7%)
Query: 59 EPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLIT 118
P + + +++ P +S+ + ++PLH R + L RD R +
Sbjct: 37 SPRTDSVCSQSKAVPSSSAGAATVPLHHRHGPCSPLPTKKMPTLEETLHRDQLRAAYIQR 96
Query: 119 KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
K A + D + P G S + EY +G+G+P +M++DT
Sbjct: 97 KFSGGGG---------AGGDVQRSDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDT 147
Query: 179 GSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL--DVSAC-RANRCLYQ 235
GSD++W+QC+PC++C+ Q+DP+FDP +SS+YSP C + C L + + C +++C Y
Sbjct: 148 GSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAACAQLGQEGNGCSSSSQCQYI 207
Query: 236 VAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ 295
V YGDGS T G ++T++ G+S +VK GC + G + GL+GLGGG SL Q
Sbjct: 208 VTYGDGSSTTGTYSSDTLALGSS-AVKSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQ 266
Query: 296 IKAT---SLAYCLVDRDSPASGVLEFNSARGGDA---VTAPLIRNKKVDTFYYVGLTGFS 349
T + +YCL S +SG L +A G V P++R+ +V TFY V L
Sbjct: 267 TAGTLGRAFSYCLPPTPS-SSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIR 325
Query: 350 VGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
VGG+ + IP S+F G ++D GT ITRL AY++L +F P
Sbjct: 326 VGGRQLSIPASVFS------AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSG 379
Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALS 467
+ DTC+DFSG SV +P+V+L F G + L A ++ + C AFA S S+L
Sbjct: 380 ILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL------SNCLAFAANSDDSSLG 433
Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
IIGNVQQ+ V +D+ VGF C
Sbjct: 434 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 142/421 (33%), Positives = 220/421 (52%), Gaps = 38/421 (9%)
Query: 94 RHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGAS 153
R ++ + +L D RV ++ +++ + + E + +E QI P+ SG +
Sbjct: 76 RKINWNRKLQKQLIFDDLRVRSMQNRIRAKVSGHNSSE-QSSEIQI-------PLASGIN 127
Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
+ Y IG+G + ++++DTGSD+ W+QC PC CY Q P+F+P SSSY+ L
Sbjct: 128 LETLNYIVTIGLGN--QNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNSLL 185
Query: 214 CAAPQCKSL-----DVSACRANR---CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
C + C++L + AC +N C + V+YGDGSFT G+L E +SFG SV
Sbjct: 186 CNSSTCQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHLSFGGI-SVSNFV 244
Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSAR 322
GCG +N+GLF G +G++GLG LS+ Q T +YCL DS ASG L +
Sbjct: 245 FGCGRNNKGLFGGVSGIMGLGRSNLSMISQTNTTFGGVFSYCLPTTDSGASGSLVIGNES 304
Query: 323 GGDAVTAP-----LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDC 376
P ++ N ++ FY + LTG VGG A+Q D + G+GGI++D
Sbjct: 305 SLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQ--------DTSFGNGGILIDS 356
Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
GT ITRL YN+L+ F++ +++ DTC++ +G+ V +PT+S+HF
Sbjct: 357 GTVITRLAPSLYNALKAEFLKQFSGYPIAPALSILDTCFNLTGIEEVSIPTLSMHFENNV 416
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
L++ A L C A A S + ++IIGN QQ+ RV +D +++GF
Sbjct: 417 DLNVDAVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQRNQRVIYDAKQSKIGFARED 476
Query: 495 C 495
C
Sbjct: 477 C 477
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 164/408 (40%), Positives = 227/408 (55%), Gaps = 30/408 (7%)
Query: 106 LERDSARVNTLITKLQLAIYNVDR--HELKPAEAQILPED---FSTPVVSGASQGSGEYF 160
L D AR+ +L +L + E + + P+D S P+ G S G G Y
Sbjct: 69 LAHDGARIASLAARLAKTPSSRPTLLDESRAGSSSSSPDDESLASVPLGPGTSVGVGNYV 128
Query: 161 SRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
+R+G+GTP + + MV+DTGS + WLQC PC C++QS P+F+PK SSSY+ + C+A QC
Sbjct: 129 TRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQQC 188
Query: 220 K-----SLDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
+L+ ++C +N C+YQ +YGD SF+VG L +TVSFG S SV GCG DNE
Sbjct: 189 SDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFG-STSVPNFYYGCGQDNE 247
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAP 330
GLF SAGL+GL LSL Q+ + S +YCL S +SG L S G P
Sbjct: 248 GLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSYNPGQYSYTP 307
Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
+ + D+ Y++ +TG V G+ P I+D GT ITRL T Y++
Sbjct: 308 MASSSLDDSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTIIDSGTVITRLPTGVYSA 362
Query: 391 LRDSFVRLAGNLKPT---SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
L + +AG +K T S ++ DTC+ R +RVP V++ F G AL L A+N L+
Sbjct: 363 LSKA---VAGAMKGTPRASAFSILDTCFQGQAAR-LRVPEVTMAFAGGAALKLAARNLLV 418
Query: 448 PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
VDSA T C AFAP SA +IIGN QQQ V +D+ N+++GF C
Sbjct: 419 DVDSATT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFAAGGC 464
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 160/440 (36%), Positives = 213/440 (48%), Gaps = 41/440 (9%)
Query: 81 SLPL-HSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQI 139
S+PL H + + + + RL RD AR N ++TK A +
Sbjct: 18 SVPLVHRHGPCAPSAASGGKPSLAERLRRDRARTNYIVTKA------TGGRTAATALSDA 71
Query: 140 LPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQS 197
S P G S S EY +G+GTP Q ++++DTGSD++W+QC+PC ECY Q
Sbjct: 72 AGGGTSIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQK 131
Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSL----------DVSACRANRCLYQVAYGDGSFTVGD 247
DP+FDP +SSSY+ +PC + C+ L VS A C Y + YG+ + T G
Sbjct: 132 DPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGV 191
Query: 248 LVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYC 304
TET++ V GCG G + GLLGLGG SL Q + +YC
Sbjct: 192 YSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYC 251
Query: 305 LVDRDSPASGVLEFNSARGGDAVTA-------PLIRNKKVDTFYYVGLTGFSVGGQAVQI 357
L S +G L + + TA P+ R V TFY V LTG SVGG + I
Sbjct: 252 L-PPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAI 310
Query: 358 PPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN--LKPTSGVALFDTCY 415
PPS F G+++D GT IT L AY +LR +F L P S + DTCY
Sbjct: 311 PPSAFSS------GMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCY 364
Query: 416 DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQ 475
DF+G +V VPT+SL F G +DL A ++ VD G FA A T +A+ IIGNV Q+
Sbjct: 365 DFTGHANVTVPTISLTFSGGATIDLAAPAGVL-VD--GCLAFAGAGTDNAIGIIGNVNQR 421
Query: 476 GTRVSFDLANNRVGFTPNKC 495
V +D VGF C
Sbjct: 422 TFEVLYDSGKGTVGFRAGAC 441
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 138/387 (35%), Positives = 204/387 (52%), Gaps = 30/387 (7%)
Query: 138 QILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS 197
++ P + SG + GSGEYF + VGTPP+ FS++LDTGSD+NWLQC PC +C+ Q+
Sbjct: 139 EVSPGKLIATLESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQN 198
Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSLDVS----ACRANR--CLYQVAYGDGSFTVGDLVTE 251
+DPKTS+S+ + C P+C + C ++ C Y YGD S T GD E
Sbjct: 199 GMFYDPKTSASFKNITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVE 258
Query: 252 TVSF------GNSGSVK--GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TS 300
T + G S K + GCGH N GLF G++GLLGLG G LS + Q+++ S
Sbjct: 259 TFTVNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHS 318
Query: 301 LAYCLVDRDSPASGVLEFNSARGGDAVTAPLI--------RNKKVDTFYYVGLTGFSVGG 352
+YCLVDR+S + + D + + + V+TFYY+ + VGG
Sbjct: 319 FSYCLVDRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGG 378
Query: 353 QAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALF 411
+A+ IP + + GDGG I+D GT ++ AY +++ F ++ N +
Sbjct: 379 KALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVL 438
Query: 412 DTCYDFSGLR--SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSI 468
D C++ SG+ ++ +P + + F G + PA+N I + S C A T S SI
Sbjct: 439 DPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWL-SEDLVCLAILGTPKSTFSI 497
Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
IGN QQQ + +D +R+GFTP KC
Sbjct: 498 IGNYQQQNFHILYDTKRSRLGFTPTKC 524
>gi|388515789|gb|AFK45956.1| unknown [Medicago truncatula]
Length = 225
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 119/227 (52%), Positives = 155/227 (68%), Gaps = 8/227 (3%)
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSAR---GGDAVT 328
+FVG+AGLLGLG G +S Q+ + +YCLV R + +SG LEF G V+
Sbjct: 1 MFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGTESSGSLEFGRESVPVGASWVS 60
Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
LI N + +FYY+GL+G VGG V I +F ++E G+GG+++D GTA+TRL AY
Sbjct: 61 --LIHNPRAPSFYYIGLSGLGVGGLRVPISEDIFRLNELGEGGVVMDTGTAVTRLPAAAY 118
Query: 389 NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
N+ RD+FV NL TSGV++FDTCYD +G +VRVPT+S +F G L LPA+N+LIP
Sbjct: 119 NAFRDAFVAQTTNLPKTSGVSIFDTCYDLNGFVTVRVPTISFYFLGGPILTLPARNFLIP 178
Query: 449 VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
VDS GTFCFAFAP+SS LSIIGN+QQ+G +S D AN +GF PN C
Sbjct: 179 VDSVGTFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGYIGFGPNIC 225
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 164/408 (40%), Positives = 227/408 (55%), Gaps = 30/408 (7%)
Query: 106 LERDSARVNTLITKLQLAIYNVDR--HELKPAEAQILPED---FSTPVVSGASQGSGEYF 160
L D AR+ +L +L + E + + P+D S P+ G S G G Y
Sbjct: 69 LAHDGARIASLAARLAKTPSSRPTLLDESRAGSSSSSPDDESLASVPLGPGTSVGVGNYV 128
Query: 161 SRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
+R+G+GTP + + MV+DTGS + WLQC PC C++QS P+F+PK SSSY+ + C+A QC
Sbjct: 129 TRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQQC 188
Query: 220 K-----SLDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
+L+ ++C +N C+YQ +YGD SF+VG L +TVSFG S SV GCG DNE
Sbjct: 189 SDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFG-STSVPNFYYGCGQDNE 247
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAP 330
GLF SAGL+GL LSL Q+ + S +YCL S +SG L S G P
Sbjct: 248 GLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSYNPGQYSYTP 307
Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
+ + D+ Y++ +TG V G+ P I+D GT ITRL T Y++
Sbjct: 308 MASSSLDDSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTIIDSGTVITRLPTGVYSA 362
Query: 391 LRDSFVRLAGNLKPT---SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
L + +AG +K T S ++ DTC+ R +RVP V++ F G AL L A+N L+
Sbjct: 363 LSKA---VAGAMKGTPRASAFSILDTCFQGQAAR-LRVPEVTMAFAGGAALKLAARNLLV 418
Query: 448 PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
VDSA T C AFAP SA +IIGN QQQ V +D+ N+++GF C
Sbjct: 419 DVDSATT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFAAAGC 464
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 160/440 (36%), Positives = 213/440 (48%), Gaps = 41/440 (9%)
Query: 81 SLPL-HSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQI 139
S+PL H + + + + RL RD AR N ++TK A +
Sbjct: 98 SVPLVHRHGPCAPSAASGGKPSLAERLRRDRARTNYIVTKA------TGGRTAATALSDA 151
Query: 140 LPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQS 197
S P G S S EY +G+GTP Q ++++DTGSD++W+QC+PC ECY Q
Sbjct: 152 AGGGTSIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQK 211
Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSL----------DVSACRANRCLYQVAYGDGSFTVGD 247
DP+FDP +SSSY+ +PC + C+ L VS A C Y + YG+ + T G
Sbjct: 212 DPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGV 271
Query: 248 LVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYC 304
TET++ V GCG G + GLLGLGG SL Q + +YC
Sbjct: 272 YSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYC 331
Query: 305 LVDRDSPASGVLEFNSARGGDAVTA-------PLIRNKKVDTFYYVGLTGFSVGGQAVQI 357
L S +G L + + TA P+ R V TFY V LTG SVGG + I
Sbjct: 332 L-PPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAI 390
Query: 358 PPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN--LKPTSGVALFDTCY 415
PPS F G+++D GT IT L AY +LR +F L P S + DTCY
Sbjct: 391 PPSAFSS------GMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCY 444
Query: 416 DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQ 475
DF+G +V VPT+SL F G +DL A ++ VD G FA A T +A+ IIGNV Q+
Sbjct: 445 DFTGHANVTVPTISLTFSGGATIDLAAPAGVL-VD--GCLAFAGAGTDNAIGIIGNVNQR 501
Query: 476 GTRVSFDLANNRVGFTPNKC 495
V +D VGF C
Sbjct: 502 TFEVLYDSGKGTVGFRAGAC 521
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 129/380 (33%), Positives = 209/380 (55%), Gaps = 26/380 (6%)
Query: 134 PAEAQIL-PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-T 191
P +L P S P+ G S GSG Y+ ++G+GTPP+ ++M+LDTGS ++WLQC+PC
Sbjct: 99 PKSGHLLEPNSASIPLNPGLSIGSGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAV 158
Query: 192 ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACR-------ANRCLYQVAYGDGSFT 244
C+ Q+DP++DP S +Y L CA+ +C L + +N CLY +YGD SF+
Sbjct: 159 YCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFS 218
Query: 245 VGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SL 301
+G L + ++ +S ++ GCG DN+GLF +AG++GL LS+ Q+ +
Sbjct: 219 IGYLSQDLLTLTSSQTLPQFTYGCGQDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAF 278
Query: 302 AYCL--VDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPP 359
+YCL + S G L S P++ + K + Y++ LT +V G+ + +
Sbjct: 279 SYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAA 338
Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFS 418
+++ + ++D GT ITRL Y +LR +FV+ ++ ++ DTC+ S
Sbjct: 339 AMYRVPT------LIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGS 392
Query: 419 GLRSVR-VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQ 475
L+S+ VP + + F G L L A + LI D G C AFA +S + ++IIGN QQQ
Sbjct: 393 -LKSISAVPEIKMIFQGGADLTLRAPSILIEADK-GITCLAFAGSSGTNQIAIIGNRQQQ 450
Query: 476 GTRVSFDLANNRVGFTPNKC 495
+++D++ +R+GF P C
Sbjct: 451 TYNIAYDVSTSRIGFAPGSC 470
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 151/450 (33%), Positives = 209/450 (46%), Gaps = 63/450 (14%)
Query: 79 SFSLPLHSREILHKTRHNDYR-SLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEA 137
+ +P+ R+ L R SL+ RL D+AR +L+
Sbjct: 26 TLHVPVFHRDALFPPPPGAKRGSLLRQRLAADAARYASLVDATGR--------------- 70
Query: 138 QILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS 197
+PV SG SGEYF+ +GVGTP + +V+DTGSD+ WLQC PC CY Q
Sbjct: 71 ------LHSPVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQR 124
Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSLDVSACRA-----NRCLYQVAYGDGSFTVGDLVTET 252
+FDP+ SS+Y +PC++PQC++L C + C Y VAYGDGS + GDL T+
Sbjct: 125 GQVFDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDK 184
Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA 312
++F N V + LGCG DNEGLF +AGLLG +++ R +P+
Sbjct: 185 LAFANDTYVNNVTLGCGRDNEGLFDSAAGLLGRRAAARYPSRRR--------WPRRTAPS 236
Query: 313 SGVLEFNSARGGDAVTAPLIRN------------------KKVDTFYYVGLTGFSVGGQA 354
S R A + T+ + G + G
Sbjct: 237 SSTASATGRRAQRAARTSCSAARRSRRPRRSPPCCRTRGARACTTWTWPGSASAARGSPG 296
Query: 355 VQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV---ALF 411
+ P S + GG++VD GTAI+R AY +LRD+F A ++F
Sbjct: 297 SRTPASRWTRRRG-RGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVF 355
Query: 412 DTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS----AGTF--CFAFAPTSSA 465
D CYD G + P + LHF G + LP +NY +PVD A ++ C F
Sbjct: 356 DACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDG 415
Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LS+IGNVQQQG RV FD+ R+GF P C
Sbjct: 416 LSVIGNVQQQGFRVVFDVEKERIGFAPKGC 445
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 142/359 (39%), Positives = 198/359 (55%), Gaps = 21/359 (5%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
G EY + +GTPP F + DTGSD+ W QC+PC C+ Q PI+D SSS+SP+PC
Sbjct: 89 GQAEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSSFSPVPC 148
Query: 215 AAPQCKSLDVSA-CRANR--CLYQVAYGDGSFTVGDLVTETVSF-GNSG-SVKGIALGCG 269
A+ C + S C A+ C Y+ AYGDG+++ G L TET++F G G SV GIA GCG
Sbjct: 149 ASATCLPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPGVSVGGIAFGCG 208
Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD--RDSPASGVL-----EFNSAR 322
DN GL S G +GLG G LSL Q+ +YCL D S S VL E +
Sbjct: 209 VDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLGSPVLFGALAELAAPS 268
Query: 323 GGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
G AV + PL+++ V T+YYV L G S+G + IP F++ + G GG+IVD GT T
Sbjct: 269 TGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVDSGTTFT 328
Query: 382 RLQTQAYNSLRDSFVRLAGNLK-PTSGVALFDT-CYD-FSGLRSV-RVPTVSLHFGAGKA 437
L A+ + D +AG L+ P + D+ C+ +G + + +P + LHF G
Sbjct: 329 FLVESAFRVVVD---HVAGVLRQPVVNASSLDSPCFPAATGEQQLPAMPDMVLHFAGGAD 385
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSA-LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ L NY+ +FC A + SA +SI+GN QQQ ++ FD+ ++ F P C
Sbjct: 386 MRLHRDNYMSFNQEESSFCLNIAGSPSADVSILGNFQQQNIQMLFDITVGQLSFMPTDC 444
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 158/455 (34%), Positives = 222/455 (48%), Gaps = 52/455 (11%)
Query: 63 EESETAAESFPLNSSSSFSLPLHSRE-----ILHKTRHNDYRSLVLSRLERDSARVNTLI 117
E SE + +S + +LPL R ++ K + + +L RD R +
Sbjct: 42 EPSEVCSGQKVTSSKNGATLPLVHRHGPCSPVMSKEKPSHEETL-----GRDQLRAANIH 96
Query: 118 TKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLD 177
KL + N EL+ + I P SG S G+ EY + +GTP M +D
Sbjct: 97 AKLS-SPRNSSAKELQQSGVTI-------PTSSGYSLGTPEYVITVSLGTPAVTQVMSID 148
Query: 178 TGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL--DVSACRANRCL 233
TGSD++W+QC PC C Q D +FDP S++YS C++ QC L + + C + C
Sbjct: 149 TGSDVSWVQCAPCAAQSCSSQKDKLFDPAKSATYSAFSCSSAQCAQLGGEGNGCLNSHCQ 208
Query: 234 YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSL 292
Y V Y D S T G ++T+ S +VK GC H G FVG GL+GLGG SL
Sbjct: 209 YIVKYVDHSNTTGTYGSDTLGLTTSDAVKNFQFGCSHRANG-FVGQLDGLMGLGGDTESL 267
Query: 293 TKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVT----APLIRNKKVDTFYYVGL 345
Q AT + +YCL S A G L +A GG + + PL+R V TFY V L
Sbjct: 268 VSQTAATYGKAFSYCLPPSSSSAGGFLTLGAAAGGTSSSRYSRTPLVRFN-VPTFYGVFL 326
Query: 346 TGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT 405
+V G + +P S+F G +VD GT IT+L AY +LR +F +
Sbjct: 327 QAITVAGTKLNVPASVFS------GASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSA 380
Query: 406 SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF---CFAFAPT 462
+ V + DTC+DFSG+++VRVP V+L F G +DL D +G F C AF T
Sbjct: 381 APVGILDTCFDFSGIKTVRVPVVTLTFSRGAVMDL---------DVSGIFYAGCLAFTAT 431
Query: 463 SS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ I+GNVQQ+ + FD+ + +GF P C
Sbjct: 432 AQDGDTGILGNVQQRTFEMLFDVGGSTLGFRPGAC 466
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 146/401 (36%), Positives = 198/401 (49%), Gaps = 31/401 (7%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L RD R + K+ NV + EL+ + I P SG S G+ EY + +
Sbjct: 84 LRRDQLRAAYIQAKVSSRYNNVAK-ELQQSAVTI-------PTSSGYSLGTTEYVITVTI 135
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL- 222
GTP M +DTGSD++W+QC PC C Q D +FDP S++YS C + QC L
Sbjct: 136 GTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAMSATYSAFSCGSAQCAQLG 195
Query: 223 -DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA- 280
+ + C ++C Y V YGDGS T G ++T+S +S +VK GC H G FVG
Sbjct: 196 DEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSLTSSDAVKSFQFGCSHRAAG-FVGELD 254
Query: 281 GLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVT---APLIRN 334
GL+GLGG SL Q AT + +YCL S G L +A G + P++R
Sbjct: 255 GLMGLGGDTESLVSQTAATYGKAFSYCLPPPSSSGGGFLTLGAAGGASSSRYSHTPMVRF 314
Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS 394
V TFY V L G +V G + +P S+F G +VD GT IT+L AY +LR +
Sbjct: 315 S-VPTFYGVFLQGITVAGTMLNVPASVFS------GASVVDSGTVITQLPPTAYQALRTA 367
Query: 395 FVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
F + + V DTC+DFSG ++ VPTV+L F G A+DL L AG
Sbjct: 368 FKKEMKAYPSAAPVGSLDTCFDFSGFNTITVPTVTLTFSRGAAMDLDISGILY----AGC 423
Query: 455 FCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
F I+GNVQQ+ + FD+ +GF C
Sbjct: 424 LAFTATAHDGDTGILGNVQQRTFEMLFDVGGRTIGFRSGAC 464
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 132/379 (34%), Positives = 188/379 (49%), Gaps = 37/379 (9%)
Query: 148 VVSGASQGSG----EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQ-SDPIFD 202
V +G G G EY + VGTPPR ++ LDTGSD+ W QC PC +C++Q + P+ D
Sbjct: 75 VRAGLGAGGGIVTNEYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLD 134
Query: 203 PKTSSSYSPLPCAAPQCKSLDVSACRA-----NRCLYQVAYGDGSFTVGDLVTETVSFGN 257
P SS+++ LPC AP C++L ++C C+Y YGD S TVG L T++ +FG
Sbjct: 135 PAASSTHAALPCDAPLCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGG 194
Query: 258 SGSVKGIA-----LGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR-DS 310
+ G+A GCGH N+G+F G+ G G G SL Q+ TS +YC D+
Sbjct: 195 DDNAGGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFDT 254
Query: 311 PASGVLEF-----------NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPP 359
+S V+ ++A GD T LI+N + Y+V L G SVGG V +P
Sbjct: 255 KSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPE 314
Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-- 417
S I+D G +IT L Y +++ FV G +G A D C+
Sbjct: 315 SRLR------SSTIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCFALPV 368
Query: 418 -SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQG 476
+ R VP ++LH G +LP NY+ +A C + +IGN QQQ
Sbjct: 369 AALWRRPAVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQQQN 428
Query: 477 TRVSFDLANNRVGFTPNKC 495
T V +DL N+ + F P +C
Sbjct: 429 THVVYDLENDVLSFAPARC 447
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 218 bits (556), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 151/448 (33%), Positives = 228/448 (50%), Gaps = 34/448 (7%)
Query: 59 EPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLIT 118
P + + +++ P +S+ + ++PLH R + L RD R +
Sbjct: 37 SPRTDSVCSQSKAVPSSSAGAATVPLHHRHGPCSPLPTKKMPTLEETLHRDQLRAAYIQR 96
Query: 119 KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
K A + D + P G S + EY +G+G+P +M++DT
Sbjct: 97 KFSGGGG---------AGGDVQRSDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDT 147
Query: 179 GSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL--DVSAC-RANRCLYQ 235
GSD++W+QC+PC++C+ Q+DP+FDP +SS+YSP C + C L + + C +++C Y
Sbjct: 148 GSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYI 207
Query: 236 VAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ 295
V YGDGS T G ++T++ G+S +V+ GC + G + GL+GLGGG SL Q
Sbjct: 208 VTYGDGSSTTGTYSSDTLALGSS-AVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQ 266
Query: 296 IKAT---SLAYCLVDRDSPASGVLEFNSARGGDA---VTAPLIRNKKVDTFYYVGLTGFS 349
T + +YCL S +SG L +A G V P++R+ +V TFY V L
Sbjct: 267 TAGTLGRAFSYCLPPTPS-SSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIR 325
Query: 350 VGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
VGG+ + IP S+F G ++D GT ITRL AY++L +F P
Sbjct: 326 VGGRQLSIPASVFS------AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSG 379
Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALS 467
+ DTC+DFSG SV +P+V+L F G + L A ++ + C AFA S S+L
Sbjct: 380 ILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL------SNCLAFAGNSDDSSLG 433
Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
IIGNVQQ+ V +D+ VGF C
Sbjct: 434 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 218 bits (556), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 141/407 (34%), Positives = 204/407 (50%), Gaps = 33/407 (8%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHE-LKPAEAQILPEDFSTPVVSGASQGSGEYFSRIG 164
+ D+ RV + ++L N+ R +K ++ LP + SG+ GS Y +G
Sbjct: 1 MNLDNERVKYIQSRLS---KNLGRENTVKDLDSTTLPAE------SGSLIGSANYVVVVG 51
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL- 222
+GTP R S+V DTGSD+ W QC PC CY+Q D IFDP SSSY+ + C + C L
Sbjct: 52 LGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSSLCTQLT 111
Query: 223 ------DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
+ S+ C+Y YGD S +VG L E ++ + V GCG DNEGLF
Sbjct: 112 SDGIKSECSSSTDASCIYDAKYGDNSTSVGFLSQERLTITATDIVDDFLFGCGQDNEGLF 171
Query: 277 VGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDA--VTAPL 331
GSAGL+GLG +S+ +Q + +YCL S + G L F ++ +A + PL
Sbjct: 172 NGSAGLMGLGRHPISIVQQTSSNYNKIFSYCL-PATSSSLGHLTFGASAATNASLIYTPL 230
Query: 332 IRNKKVDTFYYVGLTGFSVGGQAV-QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
++FY + + SVGG + + S F GG I+D GT ITRL Y +
Sbjct: 231 STISGDNSFYGLDIVSISVGGTKLPAVSSSTFSA-----GGSIIDSGTVITRLAPTVYAA 285
Query: 391 LRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
LR +F R + L DTCYD SG + + VP + F G ++L + ++ V+
Sbjct: 286 LRSAFRRXMEKYPVANEAGLLDTCYDLSGYKEISVPRIDFEFSGGVTVELXHRG-ILXVE 344
Query: 451 SAGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S C AFA S +++ GNVQQ+ V +D+ R+GF C
Sbjct: 345 SEQQVCLAFAANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGAAGC 391
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 218 bits (556), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 185/366 (50%), Gaps = 36/366 (9%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+ EY R+ VGTP R ++ LDTGSD+ W QC PC +C+ Q P+ DP SS+Y+ LPC
Sbjct: 81 TNEYLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCG 140
Query: 216 APQCKSLDVSAC------RANRCLYQVAYGDGSFTVGDLVTETVSFGNSG------SVKG 263
A +C++L ++C C+Y YGD S TVG++ T+ +FG+SG +
Sbjct: 141 AARCRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRR 200
Query: 264 IALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLE----- 317
+ GCGH N+G+F + G+ G G G SL Q+ TS +YC S ++
Sbjct: 201 LTFGCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFESKSSLVTLGGSP 260
Query: 318 ---FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
++ A G+ T P+++N + Y++ L G SVG + +P + F I+
Sbjct: 261 AALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFR-------STII 313
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDF---SGLRSVRVPTVS 429
D G +IT L + Y +++ F G P SGV + D C+ + R VP+++
Sbjct: 314 DSGASITTLPEEVYEAVKAEFAAQVG--LPPSGVEGSALDLCFALPVTALWRRPAVPSLT 371
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
LH G +LP NY+ A C ++IGN QQQ T V +DL N+R+
Sbjct: 372 LHL-EGADWELPRSNYVFEDLGARVMCIVLDAAPGEQTVIGNFQQQNTHVVYDLENDRLS 430
Query: 490 FTPNKC 495
F P +C
Sbjct: 431 FAPARC 436
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 218 bits (555), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 151/448 (33%), Positives = 228/448 (50%), Gaps = 34/448 (7%)
Query: 59 EPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLIT 118
P + + +++ P +S+ + ++PLH R + L RD R +
Sbjct: 107 SPRTDSVCSQSKAVPSSSAGAATVPLHHRHGPCSPLPTKKMPTLEETLHRDQLRAAYIQR 166
Query: 119 KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
K A + D + P G S + EY +G+G+P +M++DT
Sbjct: 167 KFSGGGG---------AGGDVQRSDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDT 217
Query: 179 GSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL--DVSAC-RANRCLYQ 235
GSD++W+QC+PC++C+ Q+DP+FDP +SS+YSP C + C L + + C +++C Y
Sbjct: 218 GSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYI 277
Query: 236 VAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ 295
V YGDGS T G ++T++ G+S +V+ GC + G + GL+GLGGG SL Q
Sbjct: 278 VTYGDGSSTTGTYSSDTLALGSS-AVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQ 336
Query: 296 IKAT---SLAYCLVDRDSPASGVLEFNSARGGDA---VTAPLIRNKKVDTFYYVGLTGFS 349
T + +YCL S +SG L +A G V P++R+ +V TFY V L
Sbjct: 337 TAGTLGRAFSYCLPPTPS-SSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIR 395
Query: 350 VGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
VGG+ + IP S+F G ++D GT ITRL AY++L +F P
Sbjct: 396 VGGRQLSIPASVFS------AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSG 449
Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALS 467
+ DTC+DFSG SV +P+V+L F G + L A ++ + C AFA S S+L
Sbjct: 450 ILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL------SNCLAFAGNSDDSSLG 503
Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
IIGNVQQ+ V +D+ VGF C
Sbjct: 504 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 531
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 149/415 (35%), Positives = 211/415 (50%), Gaps = 49/415 (11%)
Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIG 164
RL D AR + ++ K R + S P G S EY +G
Sbjct: 83 RLRSDRARADHILRKAS------GRRMMSEGGGA------SIPTYLGGFVDSLEYVVTLG 130
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
+GTP Q ++++DTGSD++W+QC+PC ++CY Q DP+FDP SS+++ +PCA+ CK L
Sbjct: 131 IGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDPSKSSTFATIPCASDACKQL 190
Query: 223 DV----SACRAN------RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
V + C N +C Y + YG+G+ T G TET++ G+S VK GCG D
Sbjct: 191 PVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLALGSSAVVKSFRFGCGSDQ 250
Query: 273 EGLFVGSAGLLGLGGG---MLSLTKQIKATSLAYCLVDRDSPASGVLEF------NSARG 323
G + GLLGLGG ++S T + + +YCL +S +G L N++
Sbjct: 251 HGPYDKFDGLLGLGGAPESLVSQTASVYGGAFSYCLPPLNS-GAGFLTLGAPNSTNNSNS 309
Query: 324 GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
G T + K+ TFY V LTG SVGG+A+ IPP++F G IVD GT IT +
Sbjct: 310 GFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVFAK------GNIVDSGTVITGI 363
Query: 384 QTQAYNSLRDSFVRLAGN--LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
T AY +LR +F L P + AL DTCY+F+G +V VP V+L F G +DL
Sbjct: 364 PTTAYKALRTAFRSAMAEYPLLPPADSAL-DTCYNFTGHGTVTVPKVALTFVGGATVDLD 422
Query: 442 AKNYLIPVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ ++ D C AFA + IIGNV + V +D +GF C
Sbjct: 423 VPSGVLVED-----CLAFADAGDGSFGIIGNVNTRTIEVLYDSGKGHLGFRAGAC 472
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 142/384 (36%), Positives = 196/384 (51%), Gaps = 27/384 (7%)
Query: 129 RHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR 188
R + A D STP S G Y VGTPP + + DTGSDI WLQC
Sbjct: 58 RRSINRANHFFKDSDTSTPE-STVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCE 116
Query: 189 PCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-DVSACRANRCLYQVAYGDGSFTVGD 247
PC +CY Q+ PIF+P SSSY +PC + C S+ D S N C Y+++YGD S + GD
Sbjct: 117 PCEQCYNQTTPIFNPSKSSSYKNIPCLSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGD 176
Query: 248 LVTETVSF----GNSGSVKGIALGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKAT--- 299
L +T+S G+ S +GCG DN G F G S+G++GLGGG +SL Q+ ++
Sbjct: 177 LSVDTLSLESTSGSPVSFPKTVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGG 236
Query: 300 SLAYCLV---DRDSPASGVLEFNSA---RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQ 353
+YCLV +++S AS +L F A G V+ PLI+ V FY++ L FSVG +
Sbjct: 237 KFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPV--FYFLTLQAFSVGNK 294
Query: 354 AVQIPPSLFEMDEAGD--GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF 411
V+ S E GD G II+D GT +T + + Y +L + V L + F
Sbjct: 295 RVEFGGS----SEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQF 350
Query: 412 DTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGN 471
CY P ++ HF G ++L + + +P+ + G CFAF P+ SI GN
Sbjct: 351 SLCYSLKS-NEYDFPIITAHF-KGADIELHSISTFVPI-TDGIVCFAFQPSPQLGSIFGN 407
Query: 472 VQQQGTRVSFDLANNRVGFTPNKC 495
+ QQ V +DL V F P C
Sbjct: 408 LAQQNLLVGYDLQQKTVSFKPTDC 431
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 162/469 (34%), Positives = 228/469 (48%), Gaps = 44/469 (9%)
Query: 46 QTEHILSFEPE-TLEPFAEESETAAESFPLNSSSSFSLPL-HSREILHKTRHNDYRSLVL 103
EH P + EP A S ++ P SS++ S+PL H ++++D +
Sbjct: 22 DNEHGFVVVPRRSYEPKAVCSASSVNLEP--SSATLSVPLVHRYGPCAASQYSDMPTPSF 79
Query: 104 SRLERDS-ARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSR 162
S R S AR N + ++ + + P +A + + P G S EY
Sbjct: 80 SETLRHSRARTNYIKSRASTGMAST------PDDAAV-----TVPTRLGGFVDSLEYMVT 128
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPLPCAAPQCK 220
+G GTP +++DTGSD++W+QC PC TECY Q DP+FDP SS+Y+P+ C A C
Sbjct: 129 LGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKDPLFDPSKSSTYAPIACGADACN 188
Query: 221 SLD---VSACRA--NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
L + C + +C Y+V YGDGS T G ET++F +VK GCGHD G
Sbjct: 189 KLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETITFAPGITVKDFHFGCGHDQRGP 248
Query: 276 FVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVDRDSPAS----GVLEFNSARGGDAVT 328
GLLGLGG SL Q + + +YCL +S A GV + V
Sbjct: 249 SDKFDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNSEAGFLALGVRPSAATNTSAFVF 308
Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
P+ T Y V +TG SVGG+ + IP S F GG+++D GT +T L AY
Sbjct: 309 TPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAFR------GGMLIDSGTIVTELPETAY 362
Query: 389 NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
N+L ++ +R A P FDTCY+F+G +V VP V+L F G +DL N ++
Sbjct: 363 NAL-NAALRKAFAAYPMVASEDFDTCYNFTGYSNVTVPRVALTFSGGATIDLDVPNGILV 421
Query: 449 VDSAGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
D C AF + L IIGNV Q+ V +D + +VGF C
Sbjct: 422 KD-----CLAFRESGPDVGLGIIGNVNQRTLEVLYDAGHGKVGFRAGAC 465
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 148/371 (39%), Positives = 198/371 (53%), Gaps = 38/371 (10%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT---ECYQQSDPIF 201
+ P G G+ Y +GTP +M +DTGSD++W+QC+PC+ CY Q DP+F
Sbjct: 126 TVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLF 185
Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRANRCL---YQVAYGDGSFTVGDLVTETVSFGNS 258
DP SSSY+ +PC P C L + A A Y V+YGDGS T G ++T++ S
Sbjct: 186 DPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSAS 245
Query: 259 GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGV 315
+V+G GCGH GLF G GLLGLG SL +Q T +YCL + S A G
Sbjct: 246 SAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTA-GY 304
Query: 316 LEFNSARGGDAVTAP------LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
L GG + AP L+ + T+Y V LTG SVGGQ + +P S F
Sbjct: 305 LTLG--LGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFA------ 356
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTS-GVALFDTCYDFSGLRSVRVPT 427
GG +VD GT ITRL AY +LR +F +A PT+ + DTCY+F+G +V +P
Sbjct: 357 GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPN 416
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTF-CFAFAPTSS--ALSIIGNVQQQGTRVSFDLA 484
V+L FG+G + L A L +F C AFAP+ S ++I+GNVQQ+ V D
Sbjct: 417 VALTFGSGATVMLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID-- 467
Query: 485 NNRVGFTPNKC 495
VGF P+ C
Sbjct: 468 GTSVGFKPSSC 478
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 167/466 (35%), Positives = 223/466 (47%), Gaps = 46/466 (9%)
Query: 52 SFEPETLEPFAEESETAAESFPLNSSSSFSLPL-HSREILHKTRHNDYRSLVLSRLERDS 110
SFEPE A S ++A S P +S +PL H + + + + RL RD
Sbjct: 24 SFEPE-----AACSTSSANSDPNRAS----VPLVHRHGPCAPSAASGGKPSLAERLRRDR 74
Query: 111 ARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPR 170
AR N ++TK R + S P G S S EY +G+GTP
Sbjct: 75 ARANYIVTKAAGG-----RTAATAVSDAVGGGGTSIPTFLGDSVDSLEYVVTLGIGTPAV 129
Query: 171 QFSMVLDTGSDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA-- 226
Q +++DTGSD++W+QC+PC ECY Q DP+FDP +SSSY+ +PC + C+ L A
Sbjct: 130 QQIVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSDACRKLAAGAYG 189
Query: 227 --CR---ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAG 281
C A C Y + YG+ + T G TET++ V GCG G + G
Sbjct: 190 HGCTSGAAALCEYGIEYGNRATTTGVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDG 249
Query: 282 LLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTA-------PL 331
LLGLGG SL Q + +YCL S +G L + + TA P+
Sbjct: 250 LLGLGGAPESLVSQTSSQFGGPFSYCL-PPTSGGAGFLALGAPNSSSSSTAAAGFLFTPM 308
Query: 332 IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
R V TFY V LTG SVGG + +PPS F G+++D GT IT L AY +L
Sbjct: 309 RRIPSVPTFYVVTLTGISVGGAPLAVPPSAFSS------GMVIDSGTVITGLPATAYAAL 362
Query: 392 RDSFVRLAGN--LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
R +F L P S A+ DTCYDF+G +V VPT++L F G +DL ++ V
Sbjct: 363 RSAFRSAMSEYRLLPPSNGAVLDTCYDFTGHTNVTVPTIALTFSGGATIDLATPAGVL-V 421
Query: 450 DSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
D G FA A T + IIGNV Q+ V +D VGF C
Sbjct: 422 D--GCLAFAGAGTDDTIGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 465
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 132/362 (36%), Positives = 189/362 (52%), Gaps = 28/362 (7%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EY + +GTPP+ ++LDTGSD+ W QCRPC C+ ++ DP SS++ LPC++P
Sbjct: 414 EYLVHLAIGTPPQPVQLILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTFDVLPCSSP 473
Query: 218 QCKSLDVSACRANR-----CLYQVAYGDGSFTVGDLVTETVSFGNS-----GSVKGIALG 267
C +L S+C + C+Y AY DGS T G L ET +F + +V +A G
Sbjct: 474 VCDNLTWSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFG 533
Query: 268 CGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSPASGVL----EFNS 320
CG N G+F + G+ G G G LSL Q+K + ++C + P+S +L S
Sbjct: 534 CGLFNNGIFTSNETGIAGFGRGALSLPSQLKVDNFSHCFTAITGSEPSSVLLGLPANLYS 593
Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
G + PL++N YY+ L G +VG + IP S F + + G GG I+D GT +
Sbjct: 594 DADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALKQDGTGGTIIDSGTGM 653
Query: 381 TRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDFSGLRSVR--VPTVSLHFGAG 435
T L AY + D+F VRL + +S ++ C+ FS R + VP + LHF G
Sbjct: 654 TTLPQDAYKLVHDAFTAQVRLPVDNATSSSLSRL--CFSFSVPRRAKPDVPKLVLHF-EG 710
Query: 436 KALDLPAKNYLIPVDSAG--TFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
LDLP +NY+ + AG C A L+IIGN QQQ V +DL N + F P
Sbjct: 711 ATLDLPRENYMFEFEDAGGSVTCLAIN-AGDDLTIIGNYQQQNLHVLYDLVRNMLSFVPA 769
Query: 494 KC 495
+C
Sbjct: 770 QC 771
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 148/401 (36%), Positives = 213/401 (53%), Gaps = 27/401 (6%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L +D +RV+++ +KL + ++K A LP G+ GSG YF +G+
Sbjct: 109 LLQDQSRVDSIHSKLS---KDSGLSDVKATAATTLPAK------DGSIIGSGNYFVTVGL 159
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-- 222
GTP + FS++ DTGSD+ W QC PC + CY Q + IF+P S+SY+ + C + C SL
Sbjct: 160 GTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEAIFNPSQSTSYANISCGSTLCDSLAS 219
Query: 223 ---DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS 279
++ C ++ C+Y + YGD SF++G E +S + GCG +N+GLF G+
Sbjct: 220 ATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLSLTATDVFNDFYFGCGQNNKGLFGGA 279
Query: 280 AGLLGLGGGMLSL---TKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
AGLLGLG LSL T Q +YCL S ++G L F + A PL
Sbjct: 280 AGLLGLGRDKLSLVSQTAQRYNKIFSYCL-PSSSSSTGFLTFGGSTSKSASFTPLATISG 338
Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
+FY + LTG SVGG+ + I PS+F G I+D GT ITRL AY++L +F
Sbjct: 339 GSSFYGLDLTGISVGGRKLAISPSVFST-----AGTIIDSGTVITRLPPAAYSALSSTFR 393
Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
+L +++ DTC+DFS ++ VP + L F G +D+ K + V+ C
Sbjct: 394 KLMSQYPAAPALSILDTCFDFSNHDTISVPKIGLFFSGGVVVDID-KTGIFYVNDLTQVC 452
Query: 457 FAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
AFA S A ++I GNVQQ+ V +D A RVGF P C
Sbjct: 453 LAFAGNSDASDVAIFGNVQQKTLEVVYDGAAGRVGFAPAGC 493
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 128/363 (35%), Positives = 198/363 (54%), Gaps = 24/363 (6%)
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKT 205
P+ GAS GSG Y+ ++G+G+P R +SM++DTGS ++WLQC+PC C+ Q+DP+FDP
Sbjct: 1 PLNPGASIGSGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSA 60
Query: 206 SSSYSPLPCAAPQCKSL-------DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNS 258
S +Y L C + QC SL + +N C+Y +YGD S+++G L + ++ S
Sbjct: 61 SKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPS 120
Query: 259 GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGV 315
++ G GCG D+EGLF +AG+LGLG LS+ Q+ + + +YCL R G
Sbjct: 121 QTLPGFVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRG--GGGF 178
Query: 316 LEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
L A G P+ + + Y++ LT +VGG+A+ + + + + I
Sbjct: 179 LSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPT------I 232
Query: 374 VDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
+D GT ITRL Y + +FV+ ++ G ++ DTC+ + VP V L F
Sbjct: 233 IDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDMQSVPEVRLIF 292
Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
G L+L N L+ VD G C AFA ++ ++IIGN QQQ +V+ D++ R+GF
Sbjct: 293 QGGADLNLRPVNVLLQVDE-GLTCLAFA-GNNGVAIIGNHQQQTFKVAHDISTARIGFAT 350
Query: 493 NKC 495
C
Sbjct: 351 GGC 353
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 128/355 (36%), Positives = 190/355 (53%), Gaps = 16/355 (4%)
Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
S G GE+ I +GTPP++ +++DTGSD+ W+Q PC C++Q+DPIFDP SS+Y+ +
Sbjct: 19 SAGYGEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADPIFDPSKSSTYNKI 78
Query: 213 PCAAPQCKS-LDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
C++ C L C A C+Y YGDGS T G ET++ ++ + + G
Sbjct: 79 ACSSSACADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEE-VKFGASV 137
Query: 271 DNEGLF--VGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPAS--GVLEFNSAR- 322
N G F G G+LGLG G +S+ Q+ + +YCLVD S S + F A
Sbjct: 138 YNTGTFGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVDWLSAGSETSTMYFGDAAV 197
Query: 323 -GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
G+ P++ N T+YY+ + G SVGG + I S++E+D G GG I+D GT IT
Sbjct: 198 PSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGTIIDSGTTIT 257
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
LQ + +N+L ++ TS L D C++ G S P +++H G L+LP
Sbjct: 258 YLQQEVFNALVAAYTSQVRYPTTTSATGL-DLCFNTRGTGSPVFPAMTIHLD-GVHLELP 315
Query: 442 AKNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
N I +++ C AFA ++I GN+QQQ + +DL N R+GF P C
Sbjct: 316 TANTFISLET-NIICLAFASALDFPIAIFGNIQQQNFDIVYDLDNMRIGFAPADC 369
>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
Length = 366
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 152/368 (41%), Positives = 208/368 (56%), Gaps = 38/368 (10%)
Query: 3 PIKPFV-LFTITTILFSFCLFTSASSRGLSET---ATTVLDVSSALQQTEHILSFEPETL 58
P+ PF L + +LF SA SR +S A LDV+S+L++T+
Sbjct: 10 PLLPFTFLLCVGMLLF----LQSAQSRPISVPEVPAYHALDVASSLRETDTA-------- 57
Query: 59 EPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHN---DYRSLVLSRLERDSARVNT 115
A +E E+ P S S + +H +L K N Y + +L R++ RV
Sbjct: 58 ---AGGAEYKRETKPRRSPWSVEV-VHRDALLLKNAANATASYERRLKEKLRREAVRVRG 113
Query: 116 LITKLQLAIY----NVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQ 171
L +++ + V+R+E AE DF VVSG QGSGEYF+RIGVGTP R+
Sbjct: 114 LERQIERTLTLNKDPVNRYE-NVAEVDA---DFGGEVVSGMEQGSGEYFTRIGVGTPTRE 169
Query: 172 FSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR 231
MVLDTGSD+ W+QC PC ECY Q+DPIF+P S+S+S + C + C LD C +
Sbjct: 170 QYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSGG 229
Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
CLY+ +YGDGS++ G TET++FG + SV +A+GCGH N GLF+G+AGLLGLG G LS
Sbjct: 230 CLYEASYGDGSYSTGSFATETLTFGTT-SVANVAIGCGHKNVGLFIGAAGLLGLGAGALS 288
Query: 292 LTKQI---KATSLAYCLVDRDSPASGVLEF--NSARGGDAVTAPLIRNKKVDTFYYVGLT 346
QI + +YCLVDR+S +SG L+F S G T PL +N + TFYY+ +T
Sbjct: 289 FPNQIGTQTGHTFSYCLVDRESDSSGPLQFGPKSVPVGSIFT-PLEKNPHLPTFYYLSVT 347
Query: 347 GFSVGGQA 354
S+ A
Sbjct: 348 AISISAIA 355
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 144/401 (35%), Positives = 215/401 (53%), Gaps = 32/401 (7%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
++R R + KLQ+ V+ H++K E + P+ GSGEY ++ +
Sbjct: 1 MKRAIQRSQERLEKLQIT-SAVNTHQMKDIETPVTPD-----------IGSGEYLIQMAI 48
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
GTP S ++DTGSD+ W +C PCT+C + I+DP +SS+YS + C + C+ +
Sbjct: 49 GTPALSLSAIMDTGSDLVWTKCNPCTDC--STSSIYDPSSSSTYSKVLCQSSLCQPPSIF 106
Query: 226 ACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLG 284
+C + C Y YGD S T G L ET S +S S+ I GCGHDN+G F GL+G
Sbjct: 107 SCNNDGDCEYVYPYGDRSSTSGILSDETFSI-SSQSLPNITFGCGHDNQG-FDKVGGLVG 164
Query: 285 LGGGMLSLTKQI---KATSLAYCLVDR-DSPASGVLEFNSARGGDAVTA---PLIRNKKV 337
G G LSL Q+ +YCLV R DS + L + +A T PL+++
Sbjct: 165 FGRGSLSLVSQLGPSMGNKFSYCLVSRTDSSKTSPLFIGNTASLEATTVGSTPLVQSSST 224
Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
+ YY+ L G SVGGQ++ IP F++ G GG+I+D GT +T LQ AY++++++ V
Sbjct: 225 N-HYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVS 283
Query: 398 LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
+ NL G D C++ G + P+++ HF G D+P +NYL P ++ C
Sbjct: 284 -SINLPQADGQ--LDLCFNQQGSSNPGFPSMTFHF-KGADYDVPKENYLFPDSTSDIVCL 339
Query: 458 AFAPTSSAL---SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A PT+S L +I GNVQQQ ++ +D NN + F P C
Sbjct: 340 AMMPTNSNLGNMAIFGNVQQQNYQILYDNENNVLSFAPTAC 380
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 215 bits (547), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 144/372 (38%), Positives = 190/372 (51%), Gaps = 26/372 (6%)
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
ED P+ SG + S Y ++G GTPP+ F VLDTGS+I W+ C PC+ C + P F
Sbjct: 107 EDADIPLASGQAISSSNYIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSSKQQP-F 165
Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSG 259
+P SS+Y+ L CA+ QC+ L V N C YGD S L +ET+S G S
Sbjct: 166 EPSKSSTYNYLTCASQQCQLLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSVG-SQ 224
Query: 260 SVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVDRDSPA-SGV 315
V+ GC + GL + L+G G LS Q + ++ +YCL S A +G
Sbjct: 225 QVENFVFGCSNAARGLIQRTPSLVGFGRNPLSFVSQTATLYDSTFSYCLPSLFSSAFTGS 284
Query: 316 LEFNSARGGDAVTA------PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
L G +A++A PL+ N + +FYYVGL G SVG + V IP +DE+
Sbjct: 285 LLL----GKEALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTG 340
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-SGLRSVRVPTV 428
G I+D GT ITRL AYN++RDSF NL S LFDTCY+ SG V P +
Sbjct: 341 RGTIIDSGTVITRLVEPAYNAMRDSFRSQLSNLTMASPTDLFDTCYNRPSG--DVEFPLI 398
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGT-FCFAF----APTSSALSIIGNVQQQGTRVSFDL 483
+LHF L LP N L P + G+ C AF LS GN QQQ R+ D+
Sbjct: 399 TLHFDDNLDLTLPLDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQKLRIVHDV 458
Query: 484 ANNRVGFTPNKC 495
A +R+G C
Sbjct: 459 AESRLGIASENC 470
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 215 bits (547), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 133/422 (31%), Positives = 221/422 (52%), Gaps = 42/422 (9%)
Query: 97 DYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGS 156
D+ + +R+ D+ VN+L + + AI+ H+L +++QI P+ SGA +
Sbjct: 92 DWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQL--SDSQI-------PISSGARLQT 142
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
Y +G+G + ++++DTGSD+ W+QC PC CY Q +P+F+P SSS+ LPC +
Sbjct: 143 LNYIVTVGIGG--QNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNS 200
Query: 217 PQCKSLDVSA-----C---RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
P C +L +A C + C YQ+ YGDGS++ G+L E ++ G + + GC
Sbjct: 201 PTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKT-EIDNFIFGC 259
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVL-----EFNS 320
G +N+GLF G++GL+GL LSL Q + + +YCL +SG L +F++
Sbjct: 260 GRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSN 319
Query: 321 ARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI--IVDCG 377
+ ++ +I+N ++ FY++ LTG S+GG + +P + + G+ ++D G
Sbjct: 320 FKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVP------RLSSNEGVLSLLDSG 373
Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF--GAG 435
T ITRL Y + + F + + T G ++ +TC++ +G V +PTV F A
Sbjct: 374 TVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAE 433
Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
+D+ Y + D A C AFA IIGN QQ+ RV ++ ++VGF
Sbjct: 434 MIVDVEGVFYFVKSD-ASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGE 492
Query: 494 KC 495
C
Sbjct: 493 PC 494
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 215 bits (547), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 148/410 (36%), Positives = 207/410 (50%), Gaps = 41/410 (10%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L RD RV+ + K+ + KP L ++ G S + Y + + +
Sbjct: 99 LRRDQDRVDAIRRKVTAS-------SNKPKGGVSLLANW------GKSLSTTNYVASLRL 145
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL--- 222
GTP + + LDTGSD +W+QC+PC +CY+Q DP+FDP SS+YS +PC A +C+ L
Sbjct: 146 GTPATELVVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASSTYSAVPCGARECQELASS 205
Query: 223 ----DVSACRANRCLYQVAYGDGSFTVGDLVTETVSF------GNSGSVKGIALGCGHDN 272
+ S+ C Y+V+Y D S TVGDL +T++ + +V G GCGH N
Sbjct: 206 SSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPSPSPADTVPGFVFGCGHSN 265
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTA 329
G F GLLGLG G SL Q+ A + +YCL S A+G L F A
Sbjct: 266 AGTFGEVDGLLGLGLGKASLPSQVAARYGAAFSYCLPSSPS-AAGYLSFGGAAARANAQF 324
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
+ + T YY+ LTG V G+A+++P S F A G I+D GTA +RL AY
Sbjct: 325 TEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAF----ATAAGTIIDSGTAFSRLPPSAYA 380
Query: 390 SLRDSFVRLAGNLK----PTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
+LR SF G + P+S +FDTCYDF+G +VR+P V L F G + L
Sbjct: 381 ALRSSFRSAMGRYRYKRAPSS--PIFDTCYDFTGHETVRIPAVELVFADGATVHLHPSGV 438
Query: 446 LIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L + C AF P L I+GN QQ+ V +D+ + R+GF C
Sbjct: 439 LYTWNDVAQTCLAFVPNHD-LGILGNTQQRTLAVIYDVGSQRIGFGRKGC 487
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 156/455 (34%), Positives = 227/455 (49%), Gaps = 50/455 (10%)
Query: 68 AAESFPLNSSSSFSLPLHSRE-----ILHKT-RHNDYRSLVLSRLERDSARVNTLITKLQ 121
+A SF +S+ S S P+ ++ +L T RH L S L S +TL +
Sbjct: 39 SAASFAPSSTCSASDPVAPQQNDTFTVLRLTHRHGPCAPLRASSLAAPSV-ADTLRADQR 97
Query: 122 LAIYNVDRHELKPAEAQILPEDF-------STPVVSGASQGSGEYFSRIGVGTPPRQFSM 174
A H L+ + P+ + + P G G+ Y +GTP ++
Sbjct: 98 RA-----EHILRRVSGRGAPQLWDYKAAAATVPANWGYDIGTSNYVVTASLGTPGMAQTL 152
Query: 175 VLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACRAN 230
+DTGSD++W+QC+PC CY+Q DP+FDP SSSY+ +PC C L + SAC A
Sbjct: 153 EVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPAQSSSYAAVPCGRSACAGLGIYASACSAA 212
Query: 231 RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH-DNEGLFVGSAGLLGLGGGM 289
+C Y V+YGDGS T G ++T++ + +V+G GCGH + GLF G GLLG G
Sbjct: 213 QCGYVVSYGDGSNTTGVYSSDTLTLAANATVQGFLFGCGHAQSGGLFTGIDGLLGFGREQ 272
Query: 290 LSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAP------LIRNKKVDTF 340
SL +Q +YCL + S +G L GG + AP L+ + T+
Sbjct: 273 PSLVQQTAGAYGGVFSYCLPTKSS-TTGYLTL----GGPSGVAPGFSTTQLLPSPNAPTY 327
Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG 400
Y V LTG SVGGQ + +P S F G +VD GT ITRL AY +LR +F
Sbjct: 328 YVVMLTGISVGGQPLSVPASAFAA------GTVVDTGTVITRLPPAAYAALRSAFRSGMA 381
Query: 401 NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA 460
+ + + DTCY F+G +V + +V+L F +G + L A + S G FA +
Sbjct: 382 SYPSAPPIGILDTCYSFAGYGTVNLTSVALTFSSGATMTLGADGIM----SFGCLAFASS 437
Query: 461 PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ +++I+GNVQQ+ V D + VGF P+ C
Sbjct: 438 GSDGSMAILGNVQQRSFEVRID--GSSVGFRPSSC 470
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 133/422 (31%), Positives = 221/422 (52%), Gaps = 42/422 (9%)
Query: 97 DYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGS 156
D+ + +R+ D+ VN+L + + AI+ H+L +++QI P+ SGA +
Sbjct: 13 DWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQL--SDSQI-------PISSGARLQT 63
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
Y +G+G + ++++DTGSD+ W+QC PC CY Q +P+F+P SSS+ LPC +
Sbjct: 64 LNYIVTVGIGG--QNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNS 121
Query: 217 PQCKSLDVSA-----C---RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
P C +L +A C + C YQ+ YGDGS++ G+L E ++ G + + GC
Sbjct: 122 PTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKT-EIDNFIFGC 180
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVL-----EFNS 320
G +N+GLF G++GL+GL LSL Q + + +YCL +SG L +F++
Sbjct: 181 GRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSN 240
Query: 321 ARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI--IVDCG 377
+ ++ +I+N ++ FY++ LTG S+GG + +P + + G+ ++D G
Sbjct: 241 FKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVP------RLSSNEGVLSLLDSG 294
Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF--GAG 435
T ITRL Y + + F + + T G ++ +TC++ +G V +PTV F A
Sbjct: 295 TVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAE 354
Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
+D+ Y + D A C AFA IIGN QQ+ RV ++ ++VGF
Sbjct: 355 MIVDVEGVFYFVKSD-ASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGE 413
Query: 494 KC 495
C
Sbjct: 414 PC 415
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 214 bits (546), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 127/371 (34%), Positives = 191/371 (51%), Gaps = 20/371 (5%)
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
DF +PVVSG++ GSG+YF +GTPP++FS+++D+GSD+ W+QC PC +CY Q P++
Sbjct: 48 HDFQSPVVSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQDTPLY 107
Query: 202 DPKTSSSYSPLPCAAPQC------KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF 255
P SS+++P+PC +P+C + C Y+ Y D S + G E+ +
Sbjct: 108 APSNSSTFNPVPCLSPECLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATV 167
Query: 256 GNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSP- 311
+ + +A GCG DN+G F + G+LGLG G LS Q+ AYCLV+ P
Sbjct: 168 DDV-RIDKVAFGCGRDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPT 226
Query: 312 -ASGVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
S L F + D P++ N + T YYV + VGG+++ I S + +D
Sbjct: 227 SVSSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFL 286
Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
G+GG I D GT +T AY ++ +F + + S V D C D +G+ P+
Sbjct: 287 GNGGSIFDSGTTVTYWLPPAYRNILAAFDKNVRYPRAAS-VQGLDLCVDVTGVDQPSFPS 345
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA--PTS-SALSIIGNVQQQGTRVSFDLA 484
++ G G NY + V + C A A P+S + IGN+ QQ V +D
Sbjct: 346 FTIVLGGGAVFQPQQGNYFVDV-APNVQCLAMAGLPSSVGGFNTIGNLLQQNFLVQYDRE 404
Query: 485 NNRVGFTPNKC 495
NR+GF P KC
Sbjct: 405 ENRIGFAPAKC 415
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 138/365 (37%), Positives = 202/365 (55%), Gaps = 25/365 (6%)
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
D + P G S + EY +G+G+P +M++DTGSD++W+QC+PC++C+ Q+DP+F
Sbjct: 35 SDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLF 94
Query: 202 DPKTSSSYSPLPCAAPQCKSL--DVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNS 258
DP +SS+YSP C + C L + + C +++C Y V YGDGS T G ++T++ G+S
Sbjct: 95 DPSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSS 154
Query: 259 GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGV 315
+V+ GC + G + GL+GLGGG SL Q T + +YCL S +SG
Sbjct: 155 -AVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPS-SSGF 212
Query: 316 LEFNSARGGDA---VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
L +A G V P++R+ +V TFY V L VGG+ + IP S+F G
Sbjct: 213 LTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS------AGT 266
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
++D GT ITRL AY++L +F P + DTC+DFSG SV +P+V+L F
Sbjct: 267 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 326
Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGF 490
G + L A ++ + C AFA S S+L IIGNVQQ+ V +D+ VGF
Sbjct: 327 SGGAVVSLDASGIIL------SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGF 380
Query: 491 TPNKC 495
C
Sbjct: 381 RAGAC 385
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 141/419 (33%), Positives = 208/419 (49%), Gaps = 53/419 (12%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L++D ARV++++ + E + S P G S G+G Y +G+
Sbjct: 114 LDQDQARVDSILGMIT-------------NETSAVGPGVSLPAERGISVGTGNYVVSVGL 160
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD 223
GTP R ++V DTGSD++W+QC PC+ CY+Q DP+F P SS++S + C A +C++
Sbjct: 161 GTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSAVRCGARECRARQ 220
Query: 224 VSACRA----NRCLYQVAYGDGSFTVGDLVTETVSFG----------NSGSVKGIALGCG 269
+C +RC Y+V YGD S T G L +T++ G N + G GCG
Sbjct: 221 --SCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAENDNKLPGFVFGCG 278
Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNS--ARGG 324
+N GLF + GL GLG G +SL+ Q +YCL S A G L +
Sbjct: 279 ENNTGLFGQADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSAPGYLSLGTPVPAPA 338
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
A P++ +FYYV L G V G+A+++ + +IVD GT ITRL
Sbjct: 339 HAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALP------LIVDSGTVITRLA 392
Query: 385 TQAYNSLRDSFVRLAGN--LKPTSGVALFDTCYDFSGL--RSVRVPTVSLHFGAGK--AL 438
+AY +LR +F+ G K +++ DTCYDF+ +V +P V+L F G ++
Sbjct: 393 PRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALVFAGGATISV 452
Query: 439 DLPAKNYLIPVDSAGTFCFAFAPTSSALS--IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
D Y+ V A C AFAP S I+GN QQ+ V +D+A ++GF C
Sbjct: 453 DFSGVLYVAKVAQA---CLAFAPNGDGRSAGILGNTQQRTLAVVYDVARQKIGFAAKGC 508
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 212 bits (539), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 136/403 (33%), Positives = 204/403 (50%), Gaps = 30/403 (7%)
Query: 107 ERDSARVNTLITKLQLAIYNVDRHELKP-AEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
E D R+N L+ +I V H P A A + P+ + V S GEY + +
Sbjct: 51 ETDLQRINN---ALRRSISRV--HHFDPIAAASVSPKAAESDVTSN----RGEYLMSLSL 101
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
GTPP + + DTGSD+ W QC+PC CY+Q DP+FDPK+S +Y C A QC LD S
Sbjct: 102 GTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPKSSKTYRDFSCDARQCSLLDQS 161
Query: 226 ACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNEGLFVGS-A 280
C N C YQ +YGD S+T+G++ ++T++ G+ S +GCGH+N+G F +
Sbjct: 162 TCSGNICQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVIGCGHENDGTFSDKGS 221
Query: 281 GLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA--SGVLEFNS---ARGGDAVTAPLI 332
G++GLG G LSL Q+ ++ +YCLV S A S L F S G + PL+
Sbjct: 222 GIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSKLNFGSNAVVSGPGVQSTPLL 281
Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
++ + +FY++ L SVG + ++ S G+G II+D GT +T + +++L
Sbjct: 282 SSETMSSFYFLTLEAMSVGNERIKFGDSSL---GTGEGNIIIDSGTTLTIVPDDFFSNLS 338
Query: 393 DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
+ + CY S ++VP ++ HF G + L N + V S
Sbjct: 339 TAVGNQVEGRRAEDPSGFLSVCY--SATSDLKVPAITAHF-TGADVKLKPINTFVQV-SD 394
Query: 453 GTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C AFA T+S +SI GNV Q V +++ + F P C
Sbjct: 395 DVVCLAFASTTSGISIYGNVAQMNFLVEYNIQGKSLSFKPTDC 437
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 135/364 (37%), Positives = 184/364 (50%), Gaps = 28/364 (7%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EY + +GTPP+ ++LDTGSD+ W QC PC C++QS P F+P S ++S LPC
Sbjct: 110 EYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLR 169
Query: 218 QCKSLDVSACRANR-----CLYQVAYGDGSFTVGDLVTETVSFGNS------GSVKGIAL 266
C+ L S+C C+Y AY D S T G L ++T SF ++ SV +
Sbjct: 170 ICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTF 229
Query: 267 GCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLV----DRDSPA-SGV---LE 317
GCG N G+FV + G+ G G LS+ Q+K + +YC SP GV L
Sbjct: 230 GCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEPSPVFLGVPPNLY 289
Query: 318 FNSARGGDAV--TAPLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
++A GG V + LIR YY+ L G +VG + IP S+F + E G GG IV
Sbjct: 290 SDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIV 349
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
D GT +T L YN + D+FV S +L C+ VP + LHF
Sbjct: 350 DSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHF-E 408
Query: 435 GKALDLPAKNYLIPVDSAGTF---CFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
G LDLP +NY+ ++ AG C A LS+IGN QQQ V +DLAN+ + F
Sbjct: 409 GATLDLPRENYMFEIEEAGGIRLTCLAIN-AGEDLSVIGNFQQQNMHVLYDLANDMLSFV 467
Query: 492 PNKC 495
P +C
Sbjct: 468 PARC 471
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 211 bits (536), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 135/364 (37%), Positives = 184/364 (50%), Gaps = 28/364 (7%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EY + +GTPP+ ++LDTGSD+ W QC PC C++QS P F+P S ++S LPC
Sbjct: 84 EYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLR 143
Query: 218 QCKSLDVSACRANR-----CLYQVAYGDGSFTVGDLVTETVSFGNS------GSVKGIAL 266
C+ L S+C C+Y AY D S T G L ++T SF ++ SV +
Sbjct: 144 ICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTF 203
Query: 267 GCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLV----DRDSPA-SGV---LE 317
GCG N G+FV + G+ G G LS+ Q+K + +YC SP GV L
Sbjct: 204 GCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEPSPVFLGVPPNLY 263
Query: 318 FNSARGGDAV--TAPLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
++A GG V + LIR YY+ L G +VG + IP S+F + E G GG IV
Sbjct: 264 SDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIV 323
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
D GT +T L YN + D+FV S +L C+ VP + LHF
Sbjct: 324 DSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHF-E 382
Query: 435 GKALDLPAKNYLIPVDSAGTF---CFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
G LDLP +NY+ ++ AG C A LS+IGN QQQ V +DLAN+ + F
Sbjct: 383 GATLDLPRENYMFEIEEAGGIRLTCLAIN-AGEDLSVIGNFQQQNMHVLYDLANDMLSFV 441
Query: 492 PNKC 495
P +C
Sbjct: 442 PARC 445
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 211 bits (536), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 135/364 (37%), Positives = 184/364 (50%), Gaps = 28/364 (7%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EY + +GTPP+ ++LDTGSD+ W QC PC C++QS P F+P S ++S LPC
Sbjct: 110 EYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLR 169
Query: 218 QCKSLDVSACRANR-----CLYQVAYGDGSFTVGDLVTETVSFGNS------GSVKGIAL 266
C+ L S+C C+Y AY D S T G L ++T SF ++ SV +
Sbjct: 170 ICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTF 229
Query: 267 GCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLV----DRDSPA-SGV---LE 317
GCG N G+FV + G+ G G LS+ Q+K + +YC SP GV L
Sbjct: 230 GCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEPSPVFLGVPPNLY 289
Query: 318 FNSARGGDAV--TAPLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
++A GG V + LIR YY+ L G +VG + IP S+F + E G GG IV
Sbjct: 290 SDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIV 349
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
D GT +T L YN + D+FV S +L C+ VP + LHF
Sbjct: 350 DSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHF-E 408
Query: 435 GKALDLPAKNYLIPVDSAGTF---CFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
G LDLP +NY+ ++ AG C A LS+IGN QQQ V +DLAN+ + F
Sbjct: 409 GATLDLPRENYMFEIEEAGGIRLTCLAIN-AGEDLSVIGNFQQQNMHVLYDLANDMLSFV 467
Query: 492 PNKC 495
P +C
Sbjct: 468 PARC 471
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 137/421 (32%), Positives = 211/421 (50%), Gaps = 40/421 (9%)
Query: 94 RHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGAS 153
+ D+ + L D RV +L ++++ +I++ + + ++QI P+ SG
Sbjct: 12 KSTDWNKKLQKSLILDDFRVRSLQSRIK-SIFS--GNNIDALDSQI-------PLSSGVR 61
Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
+ Y + +G R ++++DTGSD+ W+QC+PC CY Q DP+F+P S SY +
Sbjct: 62 LQTLNYIVTVEIGG--RNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGSPSYQTIL 119
Query: 214 CAAPQCKSL-----DVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIAL 266
C + C+SL ++ C +N C Y V YGDGS+T GDL E ++ G + V
Sbjct: 120 CNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNLGTT-HVSNFIF 178
Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARG 323
GCG +N+GLF G++GL+GLG LSL Q A +YCL + ASG L
Sbjct: 179 GCGRNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAADASGSLILGGNSS 238
Query: 324 GDAVTAP-----LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
T P +I N ++ TFY++ LTG S+GG A+Q P GI++D GT
Sbjct: 239 VYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAP-------NYRQSGILIDSGT 291
Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
ITRL Y L+ F++ ++ DTC++ +G V +PT+ + F L
Sbjct: 292 VITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQFEGNAEL 351
Query: 439 --DLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
D+ Y + D A C A A S + IIGN QQ+ RV ++ +++GF
Sbjct: 352 TVDVTGIFYFVKTD-ASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKLGFAAEA 410
Query: 495 C 495
C
Sbjct: 411 C 411
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 137/419 (32%), Positives = 210/419 (50%), Gaps = 30/419 (7%)
Query: 95 HNDYRSLVLSRLERDSAR---VNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSG 151
H+ S + RDS++ K Q + N R + A ++ + S S
Sbjct: 22 HSLRNSFSFELIHRDSSKSPLYKPAQNKFQHVV-NAARRSINRAN-RLFKDSLSNTPEST 79
Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSP 211
GEY VGTPP V+DTGSDI WLQC+PC +CY+Q+ PIF+P SSSY
Sbjct: 80 VYVNGGEYLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTTPIFNPSKSSSYKN 139
Query: 212 LPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIAL 266
+PC++ C+S+ ++C + N C Y + + D S++ G+L ET++ G+S S +
Sbjct: 140 IPCSSNLCQSVRYTSCNKQNSCEYTINFSDQSYSQGELSVETLTLDSTTGHSVSFPKTVI 199
Query: 267 GCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKAT---SLAYCLVDR--DSPASGVLEFNS 320
GCGH+N G+F G ++G++GLG G +SLT Q+K++ +YCL+ DS + L F
Sbjct: 200 GCGHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSIGGKFSYCLLPLLVDSNKTSKLNFGD 259
Query: 321 A---RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE-MDEAGDGGIIVDC 376
A G V+ P ++ K FYY+ L FSVG + ++ FE +D++ +G II+D
Sbjct: 260 AAVVSGDGVVSTPFVK-KDPQAFYYLTLEAFSVGNKRIE-----FEVLDDSEEGNIILDS 313
Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
GT +T L + Y +L + +L + L + CY + P ++ HF
Sbjct: 314 GTTLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLNLCYSITS-DQYDFPIITAHFKGAD 372
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
P + D G C AF + + I GN+ Q V +DL N V F P+ C
Sbjct: 373 IKLNPISTFAHVAD--GVVCLAFTSSQTG-PIFGNLAQLNLLVGYDLQQNIVSFKPSDC 428
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 142/357 (39%), Positives = 187/357 (52%), Gaps = 35/357 (9%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPL 212
G+ +Y + +GTP ++ +DTGSD++W+QC+PC+ C Q D +FDP SS+YS +
Sbjct: 139 GTFQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAV 198
Query: 213 PCAAPQCKSLDV--SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
PC A C L + + C ++C Y V+YGDGS T G ++T++ +V GCGH
Sbjct: 199 PCGADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGNTVGTFLFGCGH 258
Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFN--SARGGD 325
G+F G GLL LG +SL Q +YCL + S A+G L S+ G
Sbjct: 259 AQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQS-AAGYLTLGGPSSASGF 317
Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
A T L+ TFY V LTG SVGGQ V +P S F GG +VD GT ITRL
Sbjct: 318 ATTG-LLTAWAAPTFYMVMLTGISVGGQQVAVPASAFA------GGTVVDTGTVITRLPP 370
Query: 386 QAYNSLRDSFVRLAGNLKPTS-----GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
AY +LR +F G + P + DTCYDFS V +PTV+L F G L L
Sbjct: 371 TAYAALRSAF---RGAIAPCGYPSAPANGILDTCYDFSRYGVVTLPTVALTFSGGATLAL 427
Query: 441 PAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A L S+G C AFAP +I+GNVQQ+ V FD + VGF P C
Sbjct: 428 EAPGIL----SSG--CLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 124/373 (33%), Positives = 191/373 (51%), Gaps = 27/373 (7%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSY 209
SGAS G+GEYF + VGTPP+ ++LDTGSD++W+QC PC +C++Q+ P ++P SSSY
Sbjct: 161 SGASLGTGEYFIDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNESSSY 220
Query: 210 SPLPCAAPQCKSLD----VSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSG---- 259
+ C P+C+ + + C+ C Y Y DGS T GD ET + +
Sbjct: 221 RNISCYDPRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGK 280
Query: 260 ----SVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA 312
V + GCGH N+G F G+ GLLGLG G LS Q+++ S +YCL D S
Sbjct: 281 EKFKHVVDVMFGCGHWNKGFFHGAGGLLGLGRGPLSFPSQLQSIYGHSFSYCLTDLFSNT 340
Query: 313 SGVLEFNSARGGDAVTAPLIRNKKV--------DTFYYVGLTGFSVGGQAVQIPPSLFEM 364
S + + + + K+ DTFYY+ + VGG+ + IP +
Sbjct: 341 SVSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHW 400
Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
G GG I+D G+ +T AY+ ++++F + + + + CY+ SG V
Sbjct: 401 SSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQVE 460
Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF--APTSSALSIIGNVQQQGTRVSFD 482
+P +HF G + PA+NY + C A P S L+IIGN+ QQ + +D
Sbjct: 461 LPDYGIHFADGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTIIGNLLQQNFHILYD 520
Query: 483 LANNRVGFTPNKC 495
+ +R+G++P +C
Sbjct: 521 VKRSRLGYSPRRC 533
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 132/418 (31%), Positives = 209/418 (50%), Gaps = 40/418 (9%)
Query: 97 DYRSLVLSRLERDSARVNTLITKLQLAIY--NVDRHELKPAEAQILPEDFSTPVVSGASQ 154
D+ + RL D+ ++ +L ++++ I N+D + QI P+ SG
Sbjct: 13 DWNKKLQKRLIMDNFQLRSLQSRIKNIILSGNID----DSVDTQI-------PLTSGIRL 61
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
S Y + +G R+ ++++DTGSD++W+QC+PC CY Q DP+F+P S SY + C
Sbjct: 62 QSLNYIVTVELGG--RKMTVIVDTGSDLSWVQCQPCNRCYNQQDPVFNPSKSPSYRTVLC 119
Query: 215 AAPQCKSLDVS-----ACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
+ C+SL ++ C +N C Y V YGDGS+T G++ E ++ GN+ +V G
Sbjct: 120 NSLTCRSLQLATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNLGNT-TVNNFIFG 178
Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNSARGG 324
CG N+GLF G++GL+GLG LSL QI +YCL ++ ASG L
Sbjct: 179 CGRKNQGLFGGASGLVGLGRTDLSLISQISPMFGGVFSYCLPTTEAEASGSLVMGGNSSV 238
Query: 325 DAVTAPLIRNKKVDT----FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
T P+ + + FY++ LTG +VGG VQ P G +I+D GT I
Sbjct: 239 YKNTTPISYTRMIHNPLLPFYFLNLTGITVGGVEVQAP-------SFGKDRMIIDSGTVI 291
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
+RL Y +L+ FV+ + D+C++ SG + V++P + ++F L++
Sbjct: 292 SRLPPSIYQALKAEFVKQFSGYPSAPSFMILDSCFNLSGYQEVKIPDIKMYFEGSAELNV 351
Query: 441 PAKNYLIPVDS-AGTFCFAFA--PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
V + A C A A P + IIGN QQ+ R+ +D + +GF C
Sbjct: 352 DVTGVFYSVKTDASQVCLAIASLPYEDEVGIIGNYQQKNQRIIYDTKGSMLGFAEEAC 409
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 142/400 (35%), Positives = 197/400 (49%), Gaps = 25/400 (6%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L +D RV ++ + + N H K +A I PV SG G+G Y ++ +
Sbjct: 2 LLQDQLRVKSM--HARFSNKNAGSH-FKEMQADI-------PVQSGIPLGAGNYLVKMAL 51
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
GTP S+ LDTGSDI W QC PC CY+Q+ FDP+ SSSY + C++ C+ +
Sbjct: 52 GTPKLSLSLALDTGSDITWTQCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSSSCRIITD 111
Query: 225 SA----CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF--VG 278
S C ++ C+Y+V YGDGS++VG TE ++ S + GCG N G F +
Sbjct: 112 SGGARGCVSSTCIYKVQYGDGSYSVGFFATEKLTISPSDVISNFLFGCGQQNAGRFGRIA 171
Query: 279 SAGLLGLGGGMLSLTKQIKATSL-AYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV 337
LG G L+L K +L YCL S ++G L PL K
Sbjct: 172 GLLGLGRGKLSLALQTSEKYNNLFTYCLPSFSSSSTGHLTLGGQVPKSVKFTPLSPAFKN 231
Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
FY + + G SVGG + I S+F + G I+D GT ITRLQ Y++L F +
Sbjct: 232 TPFYGIDIKGLSVGGHVLPIDASVFS-----NAGAIIDSGTVITRLQPTVYSALSSKFQQ 286
Query: 398 LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
L + T G ++ DTCYDFSG S+ VP +S F G +D+ L +++ C
Sbjct: 287 LMKDYPKTDGFSILDTCYDFSGNESISVPRISFFFKGGVEVDIKFFGILTVINAWDKVCL 346
Query: 458 AFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
AFAP + GN QQQ V DLA R+GF P+ C
Sbjct: 347 AFAPNDDDGDFVVFGNSQQQTYDVVHDLAKGRIGFAPSGC 386
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 137/364 (37%), Positives = 187/364 (51%), Gaps = 23/364 (6%)
Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYS 210
A G+G Y + VGTPP F ++DTGSD+ W QC PCT C+ Q P++DP SS++S
Sbjct: 89 AENGAGAYHMILSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTPLYDPARSSTFS 148
Query: 211 PLPCAAPQCKSLDVS--ACRANRCLYQVAYGDGSFTVGDLVTETVSFGN-------SGSV 261
LPCA+P C++L + AC A C+Y Y G FT G L +T++ G+ S S
Sbjct: 149 KLPCASPLCQALPSAFRACNATGCVYDYRYAVG-FTAGYLAADTLAIGDGDGDGDASSSF 207
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCL-VDRDSPASGVL--EF 318
G+A GC N G G++G++GLG LSL QI +YCL D D+ AS +L
Sbjct: 208 AGVAFGCSTANGGDMDGASGIVGLGRSALSLLSQIGVGRFSYCLRSDADAGASPILFGAL 267
Query: 319 NSARGGDAVTAPLIRN----KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
+ G + L+RN ++ +YYV LTG +VG + + S F AG GG+IV
Sbjct: 268 ANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGVIV 327
Query: 375 DCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHF 432
D GT T L Y LR +F+ + AG L SG FD C++ +G VP + F
Sbjct: 328 DSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFE-AGAADTPVPRLVFRF 386
Query: 433 GAGKALDLPAKNYLIPVDSAGTF-CFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
G +P ++Y VD G C PT +S+IGNV Q V +DL F
Sbjct: 387 AGGAEYAVPRQSYFDAVDEGGRVACLLVLPT-RGVSVIGNVMQMDLHVLYDLDGATFSFA 445
Query: 492 PNKC 495
P C
Sbjct: 446 PADC 449
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 139/356 (39%), Positives = 183/356 (51%), Gaps = 33/356 (9%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPL 212
G+ +Y + +GTP ++ +DTGSD++W+QC+PC+ C Q D +FDP SS+YS +
Sbjct: 139 GTFQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAV 198
Query: 213 PCAAPQCKSLDV--SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
PC A C L + + C ++C Y V+YGDGS T G ++T++ +V GCGH
Sbjct: 199 PCGADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGNTVGTFLFGCGH 258
Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDA- 326
G+F G GLL LG +SL Q +YCL + S A+G L
Sbjct: 259 AQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQS-AAGYLTLGGPTSASGF 317
Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
T L+ TFY V LTG SVGGQ V +P S F GG +VD GT ITRL
Sbjct: 318 ATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFA------GGTVVDTGTVITRLPPT 371
Query: 387 AYNSLRDSFVRLAGNLKP-----TSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
AY +LR +F G + P + DTCYDFS V +PTV+L F G L L
Sbjct: 372 AYAALRSAF---RGAIAPYGYPSAPANGILDTCYDFSRYGVVTLPTVALTFSGGATLALE 428
Query: 442 AKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A L S+G C AFAP +I+GNVQQ+ V FD + VGF P C
Sbjct: 429 APGIL----SSG--CLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 163/468 (34%), Positives = 224/468 (47%), Gaps = 41/468 (8%)
Query: 43 ALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHK-----TRHND 97
A++ EHI + TLE S A++S + SS S ++LHK ND
Sbjct: 32 AVEANEHIKKY-VHTLEV---NSLLASDS--CDQSSKVIDKASSLQVLHKYGPCMQVLND 85
Query: 98 YRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSG 157
RS V L +D RV+++ +L + H + LP SG + G+G
Sbjct: 86 -RSHV-EFLLQDQLRVDSIQARLS----KISGHGIFEEMVTKLPAQ------SGIAIGTG 133
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCAA 216
Y +G+GTP F++V DTGS I W QC+PC CY Q + FDP S+SY+ + C++
Sbjct: 134 NYVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGSCYPQKEQKFDPTKSTSYNNVSCSS 193
Query: 217 PQCKSLDVS--ACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
C L S C A+ CLYQ+ YGD S++ G TET++ +S GCG N
Sbjct: 194 ASCNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTISSSDVFTNFLFGCGQSN 253
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTA 329
GLF +AGLLGL +SL Q +YCL S ++G L F A
Sbjct: 254 NGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPSTPS-STGYLNFGGKVSQTAGFT 312
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
P+ + +FY + + G SV G + I PS+F G I+D GT ITRL AY
Sbjct: 313 PI--SPAFSSFYGIDIVGISVAGSQLPIDPSIFTTS-----GAIIDSGTVITRLPPTAYK 365
Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
+L+++F N T+G L DTCYDFS +V P VS+ F G +D+ A L V
Sbjct: 366 ALKEAFDEKMSNYPKTNGDELLDTCYDFSNYTTVSFPKVSVSFKGGVEVDIDASGILYLV 425
Query: 450 DSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ C AFA S I GN QQ+ V +D A +GF C
Sbjct: 426 NGVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAGAC 473
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 138/442 (31%), Positives = 217/442 (49%), Gaps = 49/442 (11%)
Query: 79 SFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQ 138
S +L + RE L + D+ + L D+ RV +L +L++ E +E Q
Sbjct: 68 STTLEMKHRE-LCSGKTIDWGKKMRRALLLDNIRVQSL--QLRIKAMTSSTTEQSVSETQ 124
Query: 139 ILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD 198
I P+ SG + Y + +G + S+++DTGSD+ W+QC+PC CY Q
Sbjct: 125 I-------PLTSGIKLETLNYIVTVELG--GKNMSLIVDTGSDLTWVQCQPCRSCYNQQG 175
Query: 199 PIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-------------CLYQVAYGDGSFTV 245
P++DP SSSY + C + C+ D+ A N C Y V+YGDGS+T
Sbjct: 176 PLYDPSVSSSYKTVFCNSSTCQ--DLVAATGNSGPCGGFNGVVKTTCEYVVSYGDGSYTR 233
Query: 246 GDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLA 302
GDL +E++ G++ ++ + GCG +N+GLF G++GL+GLG +SL Q T +
Sbjct: 234 GDLASESIVLGDT-KLENLVFGCGRNNKGLFGGASGLMGLGRSSVSLVSQTLKTFNGVFS 292
Query: 303 YCLVDRDSPASGVLEFNS-----ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQI 357
YCL + ASG L F + PL++N ++ +FY + LTG S+GG V++
Sbjct: 293 YCLPSLEDGASGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGG--VEL 350
Query: 358 PPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF 417
F GI++D GT ITRL Y +++ F++ G ++ DTC++
Sbjct: 351 KTLSFGR------GILIDSGTVITRLPPSIYKAVKTEFLKQFSGFPSAPGYSILDTCFNL 404
Query: 418 SGLRSVRVPTVSLHFGAGKAL--DLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQ 473
+ + +PT+ + F L D+ Y + D A C A A S + + IIGN Q
Sbjct: 405 TSYEDISIPTIKMIFEGNAELEVDVTGVFYFVKPD-ASLVCLALASLSYENEVGIIGNYQ 463
Query: 474 QQGTRVSFDLANNRVGFTPNKC 495
Q+ RV +D R+G C
Sbjct: 464 QKNQRVIYDTTQERLGIAGENC 485
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 208 bits (530), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 128/369 (34%), Positives = 192/369 (52%), Gaps = 20/369 (5%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
F +PVVSG++ GSG+YF +GTPP++FS+++D+GSD+ W+QC PC +CY Q P++ P
Sbjct: 49 FQSPVVSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYVP 108
Query: 204 KTSSSYSPLPCAAPQCKSLDVSA---C---RANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
SS++SP+PC + C + + C C Y+ Y D S + G E+ + +
Sbjct: 109 SNSSTFSPVPCLSSDCLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATV-D 167
Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSP--A 312
+ +A GCG DN+G F + G+LGLG G LS Q+ AYCLV+ P
Sbjct: 168 GVRIDKVAFGCGSDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSV 227
Query: 313 SGVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
S L F + D P++ N K T YYV + +VGG+++ I S +E+D G+
Sbjct: 228 SSSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGN 287
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
GG I D GT +T AY+ + +F + S V D C + +G+ P+ +
Sbjct: 288 GGSIFDSGTTLTYWFPSAYSHILAAFDSGVHYPRAES-VQGLDLCVELTGVDQPSFPSFT 346
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL---SIIGNVQQQGTRVSFDLANN 486
+ F G A+NY + V + C A A +S L + IGN+ QQ V +D N
Sbjct: 347 IEFDDGAVFQPEAENYFVDV-APNVRCLAMAGLASPLGGFNTIGNLLQQNFFVQYDREEN 405
Query: 487 RVGFTPNKC 495
+GF P KC
Sbjct: 406 LIGFAPAKC 414
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 208 bits (529), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 124/386 (32%), Positives = 194/386 (50%), Gaps = 47/386 (12%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSY 209
SGAS G+GEYF + VGTPP+ ++LDTGSD++W+QC PC +C++Q+ + PK SS+Y
Sbjct: 162 SGASLGTGEYFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTY 221
Query: 210 SPLPCAAPQCKSLDVS----ACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSG---- 259
+ C P+C+ + S C+A C Y Y DGS T GD +ET + +
Sbjct: 222 RNISCYDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGK 281
Query: 260 ----SVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVD--RDS 310
V + GCGH N+G F G++GLLGLG G +S QI++ S +YCL D ++
Sbjct: 282 EKFKQVVDVMFGCGHWNKGFFYGASGLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNT 341
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKV-------------DTFYYVGLTGFSVGGQAVQI 357
S L F + L+ N + +TFYY+ + VGG+ + I
Sbjct: 342 SVSSKLIFGEDK-------ELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDI 394
Query: 358 PPSLFEMDEA-----GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD 412
+ GG I+D G+ +T AY+ ++++F + + + +
Sbjct: 395 SEQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAADDFVMS 454
Query: 413 TCYDFSG-LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF--APTSSALSII 469
CY+ SG + V +P +HF G + PA+NY + C A P S L+II
Sbjct: 455 PCYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPNHSHLTII 514
Query: 470 GNVQQQGTRVSFDLANNRVGFTPNKC 495
GN+ QQ + +D+ +R+G++P +C
Sbjct: 515 GNLLQQNFHILYDVKRSRLGYSPRRC 540
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 208 bits (529), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 125/375 (33%), Positives = 186/375 (49%), Gaps = 44/375 (11%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+ EY + VGTPPR ++ LDTGSD+ W QC PC +C+ Q P+ DP SS+Y+ LPC
Sbjct: 89 TNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPCG 148
Query: 216 APQCKSLDVSAC---------RANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGS----- 260
AP+C++L ++C NR C Y YGD S TVG++ T+ +FG
Sbjct: 149 APRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSR 208
Query: 261 --VKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDR--------- 308
+ + GCGH N+G+F + G+ G G G SL Q+ T+ +YC
Sbjct: 209 LPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNVTTFSYCFTSMFESKSSLVT 268
Query: 309 --DSPASGVLEFNSAR-GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
+PA+ +L ++A G+ T PL++N + Y++ L G SVG + +P
Sbjct: 269 LGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVP------- 321
Query: 366 EAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDF---SGL 420
EA I+D G +IT L Y +++ F G L PT V + D C+ +
Sbjct: 322 EAKLRSTIIDSGASITTLPEAVYEAVKAEFAAQVG-LPPTGVVEGSALDLCFALPVTALW 380
Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVS 480
R VP+++LH G +LP NY+ +A C ++IGN QQQ T V
Sbjct: 381 RRPPVPSLTLHLD-GADWELPRGNYVFEDLAARVMCVVLDAAPGDQTVIGNFQQQNTHVV 439
Query: 481 FDLANNRVGFTPNKC 495
+DL N+ + F P +C
Sbjct: 440 YDLENDWLSFAPARC 454
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 208 bits (529), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 122/350 (34%), Positives = 192/350 (54%), Gaps = 16/350 (4%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQC-RPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
Y I +GTPP + VLDTGSD+ W QC PC C+ Q P++ P S++Y+ + C +P
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 218 QCKSLDVSACRANR----CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
C++L R + C Y +YGDG+ T G L TET + G+ +V+G+A GCG +N
Sbjct: 152 MCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGTENL 211
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF-NSARGGDAV-TAPL 331
G S+GL+G+G G LSL Q+ T +YC ++ A+ L +SAR A T P
Sbjct: 212 GSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCFTPFNATAASPLFLGSSARLSSAAKTTPF 271
Query: 332 IRN-----KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
+ + ++ ++YY+ L G +VG + I P++F + GDGG+I+D GT T L+ +
Sbjct: 272 VPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEER 331
Query: 387 AYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
A+ +L + L SG L C+ + +V VP + LHF G ++L ++Y
Sbjct: 332 AFVALARALASRV-RLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFD-GADMELRRESY 389
Query: 446 LIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
++ SAG C ++ +S++G++QQQ T + +DL + F P KC
Sbjct: 390 VVEDRSAGVACLGMV-SARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 438
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 207 bits (528), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 136/366 (37%), Positives = 183/366 (50%), Gaps = 31/366 (8%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
G EY + VGTPP+ S +LDTGSD+ W QC PC C Q DPIF P SSSY P+ C
Sbjct: 100 GDLEYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRC 159
Query: 215 AAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG-------IAL 266
A C + +C R + C Y+ +YGDG+ T G TE +F +S S +
Sbjct: 160 AGELCNDILHHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGF 219
Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGG-- 324
GCG N+G +G++G G LSL Q+ +YCL S L F S RGG
Sbjct: 220 GCGTMNKGSLNNGSGIVGFGRAPLSLVSQLAIRRFSYCLTPYASGRKSTLLFGSLRGGVY 279
Query: 325 DAVTAP-----LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
DA TA L+R+++ TFYYV TG +VG + ++IP S F + G GG IVD GTA
Sbjct: 280 DAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAIVDSGTA 339
Query: 380 ITRLQTQAYNSLRDSF---VRLA----GNLKPTSGVALFDTCYDFSGLRSVR---VPTVS 429
+T + +F +RL G+ P GV C+ + R R VP +
Sbjct: 340 LTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGV-----CFAAAASRVPRPAVVPRMV 394
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
H G LDLP +NY++ G C A + + + IGN QQ RV +DL + +
Sbjct: 395 FHL-QGADLDLPRRNYVLDDQRKGNLCLLLADSGDSGTTIGNFVQQDMRVLYDLEADTLS 453
Query: 490 FTPNKC 495
F P +C
Sbjct: 454 FAPAQC 459
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 207 bits (527), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 122/349 (34%), Positives = 181/349 (51%), Gaps = 12/349 (3%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EY + +G PP F + DTGSD+ W QC+PC C+ Q P++DP SS++SPLPC++
Sbjct: 70 EYLMELAIGKPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPLPCSSA 129
Query: 218 QCKSLDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGNSG---SVKGIALGCGHDNE 273
C + C ++ C Y+ AYGDG+++ G L TET++ G S SV G+A GCG DN
Sbjct: 130 TCLPIWSRNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVAFGCGTDNG 189
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-----RDSP-ASGVLEFNSARGGDAV 327
G + S G +GLG G LSL Q+ +YCL D DSP G L +
Sbjct: 190 GDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLTDFFNSALDSPFLLGTLAELAPGPSTVQ 249
Query: 328 TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
+ PL+++ + + Y+V L G S+G + IP F++ G GG+IVD GT T L
Sbjct: 250 STPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMIVDSGTTFTILAESG 309
Query: 388 YNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
+ + R+ G P + +L C+ +P + LHF G + L NY+
Sbjct: 310 FREVVGRVARVLGQ-PPVNASSLDAPCFPAPAGEPPYMPDLVLHFAGGADMRLYRDNYMS 368
Query: 448 PVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ +FC A T+ + S++GN QQQ ++ FD ++ F P C
Sbjct: 369 YNEEDSSFCLNIAGTTPESTSVLGNFQQQNIQMLFDTTVGQLSFLPTDC 417
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 122/350 (34%), Positives = 191/350 (54%), Gaps = 16/350 (4%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQC-RPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
Y I +GTPP + VLDTGSD+ W QC PC C+ Q P++ P S++Y+ + C +P
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 218 QCKSLDVSACRANR----CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
C++L R + C Y +YGDG+ T G L TET + G+ +V+G+A GCG +N
Sbjct: 152 MCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGTENL 211
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF-NSARGGDAV-TAPL 331
G S+GL+G+G G LSL Q+ T +YC ++ A+ L +SAR A T P
Sbjct: 212 GSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCFTPFNATAASPLFLGSSARLSSAAKTTPF 271
Query: 332 IRN-----KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
+ + ++ ++YY+ L G +VG + I P++F + GDGG+I+D GT T L+
Sbjct: 272 VPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEES 331
Query: 387 AYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
A+ +L + L SG L C+ + +V VP + LHF G ++L ++Y
Sbjct: 332 AFVALARALASRV-RLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFD-GADMELRRESY 389
Query: 446 LIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
++ SAG C ++ +S++G++QQQ T + +DL + F P KC
Sbjct: 390 VVEDRSAGVACLGMV-SARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 438
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 150/421 (35%), Positives = 207/421 (49%), Gaps = 52/421 (12%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L D +R N+ QL I N DR A A P+ SG + Y + I +
Sbjct: 141 LAADESRANSF----QLRIRN-DR----AAAASTQSGSAEVPLTSGIRFQTLNYVTTIAL 191
Query: 166 G-----TPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC- 219
G +P ++++DTGSD+ W+QC+PC+ CY Q DP+FDP S++Y+ + C A C
Sbjct: 192 GGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACA 251
Query: 220 KSLDVS-----ACRA--NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
SL + +C RC Y +AYGDGSF+ G L T+TV+ G + S+ G GCG N
Sbjct: 252 ASLKAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGA-SLDGFVFGCGLSN 310
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCL-VDRDSPASGVLEFNSARGGDAV- 327
GLF G+AGL+GLG LSL Q +YCL ASG L GGDA
Sbjct: 311 RGLFGGTAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSL----GGDASS 366
Query: 328 ---TAPLIRNKKVDT-----FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
T P+ + + FY++ +TG +VGG A+ G +++D GT
Sbjct: 367 YRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTAL-------AAQGLGASNVLIDSGTV 419
Query: 380 ITRLQTQAYNSLRDSFVR-LAGNLKPTS-GVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
ITRL Y +R F R A PT+ G ++ DTCYD +G V+VP ++L G
Sbjct: 420 ITRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGAE 479
Query: 438 LDLPAKNYLIPVDSAGT-FCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
+ + A L V G+ C A A S IIGN QQ+ RV +D +R+GF
Sbjct: 480 VTVDAAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVVYDTVGSRLGFADED 539
Query: 495 C 495
C
Sbjct: 540 C 540
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 145/416 (34%), Positives = 197/416 (47%), Gaps = 33/416 (7%)
Query: 101 LVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYF 160
L+ + R AR L A ++ + PA +LP S G EY
Sbjct: 49 LIRRAMRRSKARAAALSAVRNRARFSGKNEQQTPAG--VLPVRPS---------GDLEYV 97
Query: 161 SRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK 220
+ +GTPP+ S +LDTGSD+ W QC PC C Q DP+F P S+SY P+ CA C
Sbjct: 98 VDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAGTLCS 157
Query: 221 SLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG------IALGCGHDNE 273
+ +C R + C Y+ YGDG+ TVG TE +F +SG + GCG N
Sbjct: 158 DILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCGSVNV 217
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG---GDAV--- 327
G +G++G G LSL Q+ +YCL S L F S GDA
Sbjct: 218 GSLNNGSGIVGFGRNPLSLVSQLSIRRFSYCLTSYASRRQSTLLFGSLSDGVYGDATGRV 277
Query: 328 -TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
T PL+++ + TFYYV TG +VG + ++IP S F + G GG+IVD GTA+T L
Sbjct: 278 QTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAA 337
Query: 387 AYNSLRDSF---VRL--AGNLKPTSGVA-LFDTCYDFSGLRS-VRVPTVSLHFGAGKALD 439
+ +F +RL A P GV L + S S + VP + LHF G LD
Sbjct: 338 VLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHF-QGADLD 396
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LP +NY++ G C A + S IGN+ QQ RV +DL + P +C
Sbjct: 397 LPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 452
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 136/358 (37%), Positives = 196/358 (54%), Gaps = 27/358 (7%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EY + +GTPP+ + LDTGSD+ W QC+PC C+ Q+ P FDP TSS+ S C +
Sbjct: 81 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 140
Query: 218 QCKSLDVSACRANR------CLYQVAYGDGSFTVGDLVTETVSF-GNSGSVKGIALGCGH 270
C+ L V++C + + C+Y +YGD S T G L + +F G SV G+A GCG
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL 200
Query: 271 DNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSPASGVLE-----FNSAR 322
N G+F + G+ G G G LSL Q+K + ++C V+ P++ +L+ + S R
Sbjct: 201 FNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKPSTVLLDLPADLYKSGR 260
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
G T PLI+N TFYY+ L G +VG + +P S F + G GG I+D GTA+T
Sbjct: 261 GAVQST-PLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKN-GTGGTIIDSGTAMTS 318
Query: 383 LQTQAYNSLRDSFVRLAGNLK-PTSGVALFDTCYDFSG-LRSV-RVPTVSLHFGAGKALD 439
L T+ Y +RD+F A +K P D + S LR+ VP + LHF G +D
Sbjct: 319 LPTRVYRLVRDAF---AAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHF-EGATMD 374
Query: 440 LPAKNYLIPVDSAGT--FCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LP +NY+ V+ AG+ C A ++ IGN QQQ V +DL N+++ F P +C
Sbjct: 375 LPRENYVFEVEDAGSSILCLAII-EGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 431
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 142/433 (32%), Positives = 213/433 (49%), Gaps = 39/433 (9%)
Query: 89 ILHKTRHNDYRSLVLSRLER-------DSARVNTLITKLQLAIYNVDRHELKPAEAQILP 141
+L H + S SR E D+ARV++L + ++ Y + R A A L
Sbjct: 42 VLELRHHASFSSGGKSRAEEAHAVLASDAARVSSL--QRRIGSYGLIRSS-DAASASKLA 98
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
+ PV SGA + Y + +G+G + ++++DT S++ W+QC PC C+ Q +P+F
Sbjct: 99 Q---VPVTSGARLRTLNYVATVGIGG--GEATVIVDTASELTWVQCEPCDACHDQQEPLF 153
Query: 202 DPKTSSSYSPLPCAAPQCKSLDVS------AC--RANRCLYQVAYGDGSFTVGDLVTETV 253
DP +S SY+ +PC + C +L V+ AC + C Y ++Y DGS++ G L + +
Sbjct: 154 DPSSSPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHDRL 213
Query: 254 SFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDS 310
S ++G GCG N+G F G++GL+GLG LSL Q +YCL ++S
Sbjct: 214 SLAGE-DIQGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPPKES 272
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDT-----FYYVGLTGFSVGGQAVQIPPSLFEMD 365
+SG L + P++ V FY LTG +VGG+ VQ P
Sbjct: 273 GSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGGEDVQSP----GFS 328
Query: 366 EAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRV 425
G G IVD GT IT L Y ++R FV + ++ DTC+D +GLR V+V
Sbjct: 329 AGGGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSILDTCFDLTGLREVQV 388
Query: 426 PTVSLHFGAGKALDLPAKNYLIPV-DSAGTFCFAFAPTSSALS--IIGNVQQQGTRVSFD 482
P++ L F G +++ +K L V A C A A S IIGN QQ+ RV FD
Sbjct: 389 PSLKLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQKNLRVIFD 448
Query: 483 LANNRVGFTPNKC 495
+++GF C
Sbjct: 449 TVGSQIGFAQETC 461
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 136/358 (37%), Positives = 196/358 (54%), Gaps = 27/358 (7%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EY + +GTPP+ + LDTGSD+ W QC+PC C+ Q+ P FDP TSS+ S C +
Sbjct: 81 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 140
Query: 218 QCKSLDVSACRANR------CLYQVAYGDGSFTVGDLVTETVSF-GNSGSVKGIALGCGH 270
C+ L V++C + + C+Y +YGD S T G L + +F G SV G+A GCG
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL 200
Query: 271 DNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSPASGVLE-----FNSAR 322
N G+F + G+ G G G LSL Q+K + ++C V+ P++ +L+ + S R
Sbjct: 201 FNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKPSTVLLDLPADLYKSGR 260
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
G T PLI+N TFYY+ L G +VG + +P S F + G GG I+D GTA+T
Sbjct: 261 GAVQST-PLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLKN-GTGGTIIDSGTAMTS 318
Query: 383 LQTQAYNSLRDSFVRLAGNLK-PTSGVALFDTCYDFSG-LRSV-RVPTVSLHFGAGKALD 439
L T+ Y +RD+F A +K P D + S LR+ VP + LHF G +D
Sbjct: 319 LPTRVYRLVRDAF---AAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHF-EGATMD 374
Query: 440 LPAKNYLIPVDSAGT--FCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LP +NY+ V+ AG+ C A ++ IGN QQQ V +DL N+++ F P +C
Sbjct: 375 LPRENYVFEVEDAGSSILCLAII-EGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 431
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 146/427 (34%), Positives = 218/427 (51%), Gaps = 30/427 (7%)
Query: 76 SSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPA 135
SS ++PLH R T + + L RD R + K + +
Sbjct: 53 SSGVVTVPLHHRHGPCSTVPSTNAPTLEDMLRRDQLRAAYITRKYS---------GVNGS 103
Query: 136 EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ 195
+ D + P G S + EY +G+G+P +M++DTGSD++W+QC+PC++C+
Sbjct: 104 AGDVEGSDVTVPTTLGTSLDTLEYLITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQCHS 163
Query: 196 QSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF 255
Q+D +FDP +SS+YS C + C L C +++C Y V YGDGS G ++T++
Sbjct: 164 QADSLFDPSSSSTYSAFSCTSAACAQLRQRGCSSSQCQYTVKYGDGSTGSGTYSSDTLAL 223
Query: 256 GNSGSVKGIALGCGHDNEGLFV--GSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDS 310
G+S +V+ GC G + +AGL+GLGGG SL Q T + +YCL
Sbjct: 224 GSS-TVENFQFGCSQSESGNLLQDQTAGLMGLGGGAESLATQTAGTFGKAFSYCLPPTPG 282
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
+SG L ++ G V P++R+ +V ++Y V L VGG+ + IP S F
Sbjct: 283 -SSGFLTLGASTSGFVVKTPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASAFS------A 335
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
G I+D GT ITRL AY++L +F P + +FDTC+DFSG SV +PTV+L
Sbjct: 336 GSIMDSGTIITRLPRTAYSALSSAFKAGMKQYPPAQPMGIFDTCFDFSGQSSVSIPTVAL 395
Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRV 488
F G +DL + ++ C AFA S ++L IIGNVQQ+ V +D+ V
Sbjct: 396 VFSGGAVVDLASDGIIL------GSCLAFAANSDDTSLGIIGNVQQRTFEVLYDVGGGAV 449
Query: 489 GFTPNKC 495
GF C
Sbjct: 450 GFKAGAC 456
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 130/368 (35%), Positives = 195/368 (52%), Gaps = 29/368 (7%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
SG Y I +G+PP++F+ ++DTGSD+ W+QC+PC++CY QSDPI+DP SS+++ C+
Sbjct: 1 SGAYTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCS 60
Query: 216 APQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCG 269
C+SL S C A C+Y YGD S T GD ET++ G+S + GCG
Sbjct: 61 TSSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCG 120
Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASG----VLEFNSAR 322
N G F G+AG++GLG G +SL+ Q+ + +YCLVD D +S + +++
Sbjct: 121 RLNSGSFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFGSSAST 180
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM-------------DEAGD 369
G A++ P+I N T+Y+VGL G SVGG+ + + + E
Sbjct: 181 GSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRALEVNS 240
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
GG I D GT +T L Y+ ++ +F + + FD CYD S ++ + P ++
Sbjct: 241 GGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYDVSKSKNFKFPALT 300
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTF-CFAF-APTSSALSIIGNVQQQGTRVSFDLANNR 487
L F G P KNY + VD+A T C A S L IIGN+ QQ V +D +
Sbjct: 301 LAF-KGTKFSPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLMQQNYHVVYDRGTST 359
Query: 488 VGFTPNKC 495
+ +P +C
Sbjct: 360 ISMSPAQC 367
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 128/375 (34%), Positives = 189/375 (50%), Gaps = 28/375 (7%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
F TP+VSG + GSG+YF +GTP ++F +++DTGSD+ ++QC PC CY+Q P++ P
Sbjct: 19 FRTPLVSGTTLGSGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQP 78
Query: 204 KTSSSYSPLPCAAPQCKSLDV---SACRANR--------CLYQVAYGDGSFTVGDLVTET 252
SS+++P+PC + +C + + C ++ C Y+ YGD S TVG ET
Sbjct: 79 SNSSTFTPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYET 138
Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRD 309
+ G V +A GCG+ N+G FV + G+LGLG G LS T Q AYCL
Sbjct: 139 ATVGGI-RVNHVAFGCGNRNQGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAYCLTSYL 197
Query: 310 SPASGVLEFNSARGGDAVTA--------PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
SP S F+S GD + + PL+ N + YYV + GG+ + IP S
Sbjct: 198 SPTS---VFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSA 254
Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR 421
+++D G+GG I D GT +T QAY + +F + + C + SG+
Sbjct: 255 WKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPLCVNVSGID 314
Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVS 480
P+ ++ F G NY I V S C A +SS ++IGN+ QQ V
Sbjct: 315 HPIYPSFTIEFDQGATYRPNQGNYFIEV-SPNIDCLAMLESSSDGFNVIGNIIQQNYLVQ 373
Query: 481 FDLANNRVGFTPNKC 495
+D +R+GF C
Sbjct: 374 YDREEHRIGFAHANC 388
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 133/366 (36%), Positives = 178/366 (48%), Gaps = 20/366 (5%)
Query: 149 VSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSS 208
VS G EY + +GTPP+ S +LDTGSD+ W QC PC C Q DP+F P S+S
Sbjct: 92 VSVRPSGDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPGESAS 151
Query: 209 YSPLPCAAPQCKSLDVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK----G 263
Y P+ CA C + C + C Y+ YGDG+ T+G TE +F +SG +
Sbjct: 152 YEPMRCAGQLCSDILHHGCEMPDTCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLMTVP 211
Query: 264 IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG 323
+ GCG N G +G++G G LSL Q+ +YCL S L F S G
Sbjct: 212 LGFGCGSMNVGSLNNGSGIVGFGRNPLSLVSQLSIRRFSYCLTSYGSGRKSTLLFGSLSG 271
Query: 324 ---GDAV----TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
GDA T PL+++ + TFYYV L G +VG + ++IP S F + G GG+IVD
Sbjct: 272 GVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDS 331
Query: 377 GTAITRLQTQAYNSLRDSF---VRL--AGNLKPTSGVALF--DTCYDFSGLRSVRVPTVS 429
GTA+T L + +F +RL A P GV S V VP +
Sbjct: 332 GTALTLLPGAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQVPVPRMV 391
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
HF LDLP +NY++ G C A + S IGN+ QQ RV +DL +
Sbjct: 392 FHFQDAD-LDLPRRNYVLDDHRKGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLS 450
Query: 490 FTPNKC 495
F P +C
Sbjct: 451 FAPAQC 456
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 126/361 (34%), Positives = 181/361 (50%), Gaps = 20/361 (5%)
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKT 205
P +G + + E+ +G GTP + +++LDTGSD++W+QC+PC+ CY+Q DP FDP
Sbjct: 125 PDHTGTNLDTLEFVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAK 184
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
SSSY+ +PC P C + C CLY V YGDGS T G L +T++F +S G
Sbjct: 185 SSSYAAVPCGTPVCAAAG-GMCNGTTCLYGVQYGDGSSTTGVLSRDTLTFNSSSKFTGFT 243
Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS----LAYCLVDRDSPASGVLEFNSA 321
GCG N G F G L G A S +YCL ++ G L +
Sbjct: 244 FGCGEKNIGDF-GEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPSYNT-TPGYLNIGAT 301
Query: 322 RGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
+ V +I+ + +FY++ L ++GG + +PPS+F G ++D GT
Sbjct: 302 KPTSTVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFTKT-----GTLLDSGT 356
Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
+T L AY SLRD F KP DTCYDF+G ++ +P VS +F G
Sbjct: 357 ILTYLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNFSDGAVF 416
Query: 439 DLPAKNYLIPVDSAGTF--CFAFA--PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
DL +I D A C AF P + SI+GN QQ+ V +D+ + ++GF P
Sbjct: 417 DLDFYGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKIGFIPIS 476
Query: 495 C 495
C
Sbjct: 477 C 477
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 136/402 (33%), Positives = 202/402 (50%), Gaps = 34/402 (8%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L RD RV ++ K H + + + E T V + + G Y +G+
Sbjct: 92 LRRDQLRVKSIRAK----------HSMNSSTTGVFNE-MKTRVPT--THFGGGYAVTVGL 138
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
GTP + FS++ DTGSD+ W QC PC+ C+ Q+D FDP S+SY L C++ CKS+
Sbjct: 139 GTPKKDFSLLFDTGSDLTWTQCEPCSGGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGK 198
Query: 225 SACR----ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA 280
+ + +N CLY V YG G +TVG L TET++ S + +GCG N G F G+A
Sbjct: 199 ESAQGCSSSNSCLYGVKYGTG-YTVGFLATETLTITPSDVFENFVIGCGERNGGRFSGTA 257
Query: 281 GLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV 337
GLLGLG ++L Q +T +YCL S ++G L F A P+ K+
Sbjct: 258 GLLGLGRSPVALPSQTSSTYKNLFSYCL-PASSSSTGHLSFGGGVSQAAKFTPI--TSKI 314
Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
Y + ++G SVGG+ + I PS+F G I+D GT +T L + A+++L +F
Sbjct: 315 PELYGLDVSGISVGGRKLPIDPSVFRT-----AGTIIDSGTTLTYLPSTAHSALSSAFQE 369
Query: 398 LAGNLKPTSGVALFDTCYDFS--GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
+ N T G + CYDFS ++ +P +S+ F G +D+ I +
Sbjct: 370 MMTNYTLTKGTSGLQPCYDFSKHANDNITIPQISIFFEGGVEVDIDDSGIFIAANGLEEV 429
Query: 456 CFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C AF + ++I GNVQQ+ V +D+A VGF P C
Sbjct: 430 CLAFKDNGNDTDVAIFGNVQQKTYEVVYDVAKGMVGFAPGGC 471
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 205 bits (522), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 129/369 (34%), Positives = 184/369 (49%), Gaps = 21/369 (5%)
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
P ++ + G EY + VGTPP+ + +LDTGSD+ W QC CT C +Q DP+F P+ S
Sbjct: 86 PGMAVRASGDLEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMS 145
Query: 207 SSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFG-NSGSVKGI 264
SSY P+ CA C + +C R + C Y+ +YGDG+ T+G TE +F +SG + +
Sbjct: 146 SSYEPMRCAGQLCGDILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSV 205
Query: 265 AL--GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR 322
L GCG N G ++G++G G LSL Q+ +YCL S L+F S
Sbjct: 206 PLGFGCGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFSYCLTPYASSRKSTLQFGSLA 265
Query: 323 G--------GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
G T P++++ + TFYYV TG +VG + ++IP S F + G GG+I+
Sbjct: 266 DVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVII 325
Query: 375 DCGTAITRLQTQAYNSLRDSF---VRL--AGNLKPTSGVALFDTCYDFSGLRSVR---VP 426
D GTA+T + +F +RL A P GV G R R VP
Sbjct: 326 DSGTALTLFPVAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRMARQVAVP 385
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
+ HF G LDLP +NY++ G C + + IGN QQ RV +DL
Sbjct: 386 RMVFHF-QGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLERE 444
Query: 487 RVGFTPNKC 495
+ F P +C
Sbjct: 445 TLSFAPVEC 453
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 129/369 (34%), Positives = 184/369 (49%), Gaps = 21/369 (5%)
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
P ++ + G EY + VGTPP+ + +LDTGSD+ W QC CT C +Q DP+F P+ S
Sbjct: 86 PGMAVRASGDLEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMS 145
Query: 207 SSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFG-NSGSVKGI 264
SSY P+ CA C + +C R + C Y+ +YGDG+ T+G TE +F +SG + +
Sbjct: 146 SSYEPMRCAGQLCGDILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSV 205
Query: 265 AL--GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR 322
L GCG N G ++G++G G LSL Q+ +YCL S L+F S
Sbjct: 206 PLGFGCGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFSYCLTPYASSRKSTLQFGSLA 265
Query: 323 G--------GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
G T P++++ + TFYYV TG +VG + ++IP S F + G GG+I+
Sbjct: 266 DVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVII 325
Query: 375 DCGTAITRLQTQAYNSLRDSF---VRL--AGNLKPTSGVALFDTCYDFSGLRSVR---VP 426
D GTA+T + +F +RL A P GV G R R VP
Sbjct: 326 DSGTALTLFPAAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRMARQVAVP 385
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
+ HF G LDLP +NY++ G C + + IGN QQ RV +DL
Sbjct: 386 RMVFHF-QGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLERE 444
Query: 487 RVGFTPNKC 495
+ F P +C
Sbjct: 445 TLSFAPVEC 453
>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 131/329 (39%), Positives = 179/329 (54%), Gaps = 62/329 (18%)
Query: 64 ESETAAESFPLNSSS-SFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQL 122
E+ET + P++ + + ++ L R++L + +L RL+RD+ RV L
Sbjct: 84 ETETQISTLPVSETDPTMTMHLEHRDVLAFNATPE--ALFNLRLQRDAFRVEALSKMAAA 141
Query: 123 AIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDI 182
A A+ FS+ V SG +QGSGEYF+R+GVGTPP+ MVLDTGSD+
Sbjct: 142 AGGRRAGRNGTHAQGG----GFSSSVTSGLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDV 197
Query: 183 NWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDG 241
W+QC PC +CY Q+DP+FDPK S S+S + C +P C LD C + + CLYQVAYGDG
Sbjct: 198 VWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAYGDG 257
Query: 242 SFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSL 301
SFT G+ TET++F + V +ALGCGHDNEGLFVG+AGLLGLG +Q +
Sbjct: 258 SFTFGEFSTETLTFRGT-RVPKVALGCGHDNEGLFVGAAGLLGLG-------RQPRL--- 306
Query: 302 AYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
+ P G AR +TA L K+DT
Sbjct: 307 -------NRPPVG-----GARVA-GITASLF---KLDT---------------------- 328
Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
AG+GG+I+D GT++TRL +AY +
Sbjct: 329 -----AGNGGVIIDSGTSVTRLTRRAYGT 352
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 135/402 (33%), Positives = 199/402 (49%), Gaps = 29/402 (7%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L+RD R + K + +L+ ++ S P G+S + EY +G+
Sbjct: 79 LKRDQLRAEHIQRKFAMNAAVDGAGDLQQSKV-----SSSVPTKLGSSLDTLEYVISVGL 133
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD 223
GTP ++ +DTGSD++W+QC PC C+ Q+ +FDP SS+Y + CAA +C L+
Sbjct: 134 GTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTGALFDPAKSSTYRAVSCAAAECAQLE 193
Query: 224 V--SACRAN--RCLYQVAYGDGSFTVGDLVTETVSF-GNSGSVKGIALGCGHDNEGLFVG 278
+ C A C Y V YGDGS T G +T++ G S +VKG GC H G
Sbjct: 194 QQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCSHLESGFSDQ 253
Query: 279 SAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNK 335
+ GL+GLGGG SL Q A S +YCL + + VT ++R+K
Sbjct: 254 TDGLMGLGGGAQSLVSQTAAAYGNSFSYCLPPTSGSSGFLTLGGGGGASGFVTTRMLRSK 313
Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
++ TFY L +VGG+ + + PS+F G +VD GT ITRL AY++L +F
Sbjct: 314 QIPTFYGARLQDIAVGGKQLGLSPSVFAA------GSVVDSGTIITRLPPTAYSALSSAF 367
Query: 396 VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
+ ++ DTC+DF+G + +PTV+L F G A+DL +
Sbjct: 368 KAGMKQYRSAPARSILDTCFDFAGQTQISIPTVALVFSGGAAIDLDPNGIMY------GN 421
Query: 456 CFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C AFA T IIGNVQQ+ V +D+ ++ +GF C
Sbjct: 422 CLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 135/383 (35%), Positives = 193/383 (50%), Gaps = 27/383 (7%)
Query: 133 KPAEAQILPEDFSTPVVSGASQGS---GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP 189
K ++L + PV GA EY + +GTPP+ + LDTGSD+ W QC+P
Sbjct: 62 KARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSDLVWTQCQP 121
Query: 190 CTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD--VSAC---RANRCLYQVAYGDGSFT 244
C C+ QS P +D SS+++ C + QCK LD V+ C C + +YGD S T
Sbjct: 122 CAVCFNQSLPYYDASRSSTFALPSCDSTQCK-LDPSVTMCVNQTVQTCAFSYSYGDKSAT 180
Query: 245 VGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAY 303
+G L ETVSF SV G+ GCG +N G+F + G+ G G G LSL Q+K + ++
Sbjct: 181 IGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSH 240
Query: 304 CL--VDRDSPASGVLE-----FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
C V P++ + + + + R G T PLI+N TFYY+ L G +VG +
Sbjct: 241 CFTAVSGRKPSTVLFDLPADLYKNGR-GTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLP 299
Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDT 413
+P S F + G GG I+D GTA T L + Y + D F V+L +G L
Sbjct: 300 VPESAFALKN-GTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLL--- 355
Query: 414 CYDFSGL-RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNV 472
C+ L ++ VP + LHF G + LP +NY+ G A ++IIGN
Sbjct: 356 CFSAPPLGKAPHVPKLVLHF-EGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNF 414
Query: 473 QQQGTRVSFDLANNRVGFTPNKC 495
QQQ V +DL N+++ F KC
Sbjct: 415 QQQNMHVLYDLKNSKLSFVRAKC 437
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 204 bits (520), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 133/369 (36%), Positives = 192/369 (52%), Gaps = 31/369 (8%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
G EY + +GTPP F + DTGSD+ W QC+PC C+ Q PI+D S+S+SP+PC
Sbjct: 91 GQAEYLMELAIGTPPVPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSPVPC 150
Query: 215 AAPQC-----KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG--------SV 261
A+ C S + +A + C Y+ AY DG+++ G L TET++F S SV
Sbjct: 151 ASATCLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSV 210
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGV------ 315
G+A GCG DN GL S G +GLG G LSL Q+ +YCL D + + G
Sbjct: 211 GGVAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLGSPVLFGS 270
Query: 316 ---LEFNSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
L S GG AV + PL++ + YYV L G S+G + IP F++ + G GG
Sbjct: 271 LAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRDDGSGG 330
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNL-KPTSGVALFDT-CYDFS-GLRSV-RVPT 427
+IVD GT T L A+ + + +AG L +P + D+ C+ + G + + +P
Sbjct: 331 MIVDSGTIFTVLVESAFRVVVN---HVAGVLNQPVVNASSLDSPCFPATAGEQQLPDMPD 387
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL-SIIGNVQQQGTRVSFDLANN 486
+ LHF G + L NY+ + +FC A SA SI+GN QQQ ++ FD+
Sbjct: 388 MLLHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAPSAYGSILGNFQQQNIQMLFDITVG 447
Query: 487 RVGFTPNKC 495
++ F P C
Sbjct: 448 QLSFVPTDC 456
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 147/401 (36%), Positives = 216/401 (53%), Gaps = 25/401 (6%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L +D +RV ++ ++L + + + ++K ++ +P G++ GSG Y +G+
Sbjct: 103 LLQDQSRVKSIHSRLSNSKTSGGK-DVKVTDSTTIPAK------DGSTVGSGNYIVTVGL 155
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-- 222
GTP + S++ DTGSDI W QC+PC CY+Q + IFDP S+SY+ + C++ C SL
Sbjct: 156 GTPKKDLSLIFDTGSDITWTQCQPCARSCYKQKEQIFDPSQSTSYTNISCSSSICNSLTS 215
Query: 223 ---DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS 279
+ C ++ C+Y + YGD SF+VG TE ++ ++ + I GCG +N+GLF GS
Sbjct: 216 ATGNTPGCASSACVYGIQYGDSSFSVGFFGTEKLTLTSTDAFNNIYFGCGQNNQGLFGGS 275
Query: 280 AGLLGLGGGMLSL---TKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
AGLLGLG LS+ T Q +YCL S ++G L F + +A PL
Sbjct: 276 AGLLGLGRDKLSVVSQTAQKYNKIFSYCL-PSSSSSTGFLTFGGSASKNAKFTPLSTISA 334
Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
+FY + TG SVGG+ + I S+F G I+D GT ITRL AY++LR SF
Sbjct: 335 GPSFYGLDFTGISVGGKKLAISASVFST-----AGAIIDSGTVITRLPPAAYSALRASFR 389
Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
L T +++ DTCYDFS ++ VP + F +G +D+ A L S C
Sbjct: 390 NLMSKYPMTKALSILDTCYDFSSYTTISVPKIGFSFSSGIEVDIDATGILY-ASSLSQVC 448
Query: 457 FAFAPTSSALS--IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
AFA S A I GNVQQ+ V +D + +VGF P C
Sbjct: 449 LAFAGNSDATDVFIFGNVQQKTLEVFYDGSAGKVGFAPGGC 489
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 130/356 (36%), Positives = 186/356 (52%), Gaps = 24/356 (6%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+GEY + +GTPP ++DTGSD+ W QCRPCT CY+Q P+FDPK SS+Y C
Sbjct: 89 AGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCG 148
Query: 216 APQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCG 269
C +L D S + +C ++ +Y DGSFT G+L +ET++ G S G A GCG
Sbjct: 149 TSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCG 208
Query: 270 HDNEGLF-VGSAGLLGLGGGMLSLTKQIKATS---LAYCL--VDRDSPASGVLEFNSA-- 321
H + G+F S+G++GLGGG LSL Q+K+T +YCL V DS S + F ++
Sbjct: 209 HSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDSSISSRINFGASGR 268
Query: 322 -RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLF-EMDEAGDGGIIVDCGTA 379
G V+ PL++ K DTFYY+ L G SVG + ++P + + E +G IIVD GT
Sbjct: 269 VSGYGTVSTPLVQ-KSPDTFYYLTLEGISVGKK--RLPYKGYSKKTEVEEGNIIVDSGTT 325
Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
T L + Y+ L S + +F CY+ + + P ++ HF
Sbjct: 326 YTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTA--EINAPIITAHFKDANVEL 383
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
P ++ + CF APTS + ++GN+ Q V FDL RV F C
Sbjct: 384 QPLNTFMRMQEDL--VCFTVAPTSD-IGVLGNLAQVNFLVGFDLRKKRVSFKAADC 436
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 137/375 (36%), Positives = 195/375 (52%), Gaps = 29/375 (7%)
Query: 145 STPVVSGASQG---SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
S PV GA + EY + +GTPP+ + LDTGSD+ W QC+PC C+ Q P F
Sbjct: 18 SAPVSPGAYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYF 77
Query: 202 DPKTSSSYSPLPCAAPQCKSLD--VSAC-RANR----CLYQVAYGDGSFTVGDLVTETVS 254
D SS+ + LPC + QCK LD V+ C + N+ C Y +YGD S T+G L + +
Sbjct: 78 DTSRSSTNALLPCESTQCK-LDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFT 136
Query: 255 FGNSGSVKGIALGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSP 311
F S+ G+ GCG +N G+F G+ G G G LSL Q+K + ++C + P
Sbjct: 137 FVAGTSLPGVTFGCGLNNTGVFNSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIP 196
Query: 312 ASGVLEFNS---ARGGDAV-TAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
++ +L+ + + G AV T PLI +N+ T YY+ L G +VG + +P S F +
Sbjct: 197 STVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFAL 256
Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYDFSGLRSV 423
G GG I+D GT+IT L Q Y +RD F ++ + P + + TC+
Sbjct: 257 TN-GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHY-TCFSAPSQAKP 314
Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPV-DSAGT--FCFAFAPTSSALSIIGNVQQQGTRVS 480
VP + LHF G +DLP +NY+ V D AG C A +IIGN QQQ V
Sbjct: 315 DVPKLVLHF-EGATMDLPRENYVFEVPDDAGNSIICLAIN-KGDETTIIGNFQQQNMHVL 372
Query: 481 FDLANNRVGFTPNKC 495
+DL NN + F +C
Sbjct: 373 YDLQNNMLSFVAAQC 387
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 133/388 (34%), Positives = 205/388 (52%), Gaps = 40/388 (10%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
F++PVV+ Q EY+ + VGTP + +++DTGSD++W+QC PC +C P F+P
Sbjct: 125 FTSPVVT-LGQAGLEYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNP 183
Query: 204 KTSSSYSPLPCAAPQCKSL-----DVSACRANRCLYQVAYGDGSFTVGDLVTETVS---- 254
+ SSS+ LPCA+ C ++ + CL+ + YGDGS + G L ET++
Sbjct: 184 RHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTP 243
Query: 255 -FGNSGSVK--GIALGCGH-DNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVD 307
FG+ VK I LGC D EGL G++GLLG+ +S Q+ A ++C D
Sbjct: 244 NFGDGEPVKLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPD 303
Query: 308 RDSP--ASGVLEFNSARGGDAVT-----APLIRNKKVDT----FYYVGLTGFSVGGQAVQ 356
+ + +SG++ F + D ++ PL++N V + +YYVGL G SV +
Sbjct: 304 KIAHLNSSGLVFFGES---DIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLP 360
Query: 357 IPPSLFEMDEA-GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCY 415
+ F++D+ G GG I+D GTA T L+ A+ ++R F+ +L + F CY
Sbjct: 361 LSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCY 420
Query: 416 DFS----GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA---GTFCFAFAPTSS-ALS 467
+ + L S +P+++LHF G + LP + LIPV S+ T C AF + +
Sbjct: 421 NITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDIPFN 480
Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
IIGN QQQ V +DL R+G P +C
Sbjct: 481 IIGNYQQQNLWVEYDLEKLRLGIAPAQC 508
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 138/389 (35%), Positives = 196/389 (50%), Gaps = 38/389 (9%)
Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-P 199
P+ +PVVSGAS GSG+YF + +GTPP++ +V DTGSD+ W++C C C + +
Sbjct: 71 PQSLKSPVVSGASTGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGS 130
Query: 200 IFDPKTSSSYSPLPCAAPQCKSLDVSA---CRANR----CLYQVAYGDGSFTVGDLVTET 252
F + S+++SP C C+ + + C R C Y+ +YGDGS T G ET
Sbjct: 131 AFLARHSTTFSPNHCYDSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKET 190
Query: 253 VSF----GNSGSVKGIALGCGHDNEGL------FVGSAGLLGLGGGMLSLTKQIK---AT 299
+ G +KGIA GC G F G+ G++GLG G +SL+ Q+
Sbjct: 191 TTLNTSSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGN 250
Query: 300 SLAYCLVDRD---SPASGVL----EFNSARGGDAVT-APLIRNKKVDTFYYVGLTGFSVG 351
+YCL+D D SP S +L + + A G + PL N TFYY+G+ SV
Sbjct: 251 KFSYCLMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVD 310
Query: 352 GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGV 408
G + I PS++ +DE G+GG IVD GT +T L AY + VRL +PT G
Sbjct: 311 GIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPTPG- 369
Query: 409 ALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP--TSSAL 466
FD C + S + R+P +S G P +NY + D C A T S
Sbjct: 370 --FDLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDE-DVKCLALQAVMTPSGF 426
Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S+IGN+ QQG + FD R+GF+ + C
Sbjct: 427 SVIGNLMQQGFLLEFDKDRTRLGFSRHGC 455
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 125/395 (31%), Positives = 192/395 (48%), Gaps = 26/395 (6%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L RD RV+++I + +K S P + + +Y +G+
Sbjct: 89 LRRDKLRVDSIIQARRSMNLTSSVEHMKS----------SVPFYGLSKITASDYIVNVGI 138
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
GTP ++ ++ DTGS + W QC+PC CY + P+FDP S+S+ LPC++ C+S+
Sbjct: 139 GTPKKEMPLIFDTGSGLIWTQCKPCKACYPKV-PVFDPTKSASFKGLPCSSKLCQSIR-Q 196
Query: 226 ACRANRCLYQVAYGDGSFTVGDLVTETVSFGN-SGSVKGIALGCGHDNEGLFVGSAGLLG 284
C + +C Y AY D S + G L TET+SF + K I +GC G +G +G++G
Sbjct: 197 GCSSPKCTYLTAYVDNSSSTGTLATETISFSHLKYDFKNILIGCSDQVSGESLGESGIMG 256
Query: 285 LGGGMLSLTKQ---IKATSLAYCLVDRDSPAS-GVLEFNSARGGDAVTAPLIRNKKVDTF 340
L +SL Q I +YC+ +P S G L F D +P+ + +
Sbjct: 257 LNRSPISLASQTANIYDKLFSYCI--PSTPGSTGHLTFGGKVPNDVRFSPVSKTAP-SSD 313
Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG 400
Y + +TG SVGG+ + I S F++ +D G +TRL +AY++LR F +
Sbjct: 314 YDIKMTGISVGGRKLLIDASAFKIAST------IDSGAVLTRLPPKAYSALRSVFREMMK 367
Query: 401 NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA 460
DTCYDFS +V +P++S+ F G +D+ + V + +C AFA
Sbjct: 368 GYPLLDQDDFLDTCYDFSNYSTVAIPSISVFFEGGVEMDIDVSGIMWQVPGSKVYCLAFA 427
Query: 461 PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+SI GN QQ+ V FD A R+GF P C
Sbjct: 428 ELDDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGC 462
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 139/420 (33%), Positives = 198/420 (47%), Gaps = 31/420 (7%)
Query: 99 RSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGE 158
R L+ ++R AR L +A R K A+ E P V G E
Sbjct: 50 RELIRRAMQRSKARA----AALSVARSGSGRVPGKSAQQG---EQHQQPGVPVRPSGDLE 102
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y + +GTPP+ S +LDTGSD+ W QC PC C Q DP+F P SSSY P+ C+
Sbjct: 103 YLIDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSGQL 162
Query: 219 CKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK---GIALGCGHDNEG 274
C + +C R + C Y+ YGDG+ T+G TE +F +S K + GCG N G
Sbjct: 163 CNDILHHSCQRPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSVPLGFGCGTMNVG 222
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSP----------ASGVLEFNSARGG 324
+G++G G LSL Q+ +YCL S + GV E + A G
Sbjct: 223 SLNNGSGIVGFGRDPLSLVSQLSIRRFSYCLTPYTSTRKSTLMFGSLSDGVFEGDDAATG 282
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
T L+++++ TFYYV TG +VG + ++IP S F + G GG+IVD GTA+T
Sbjct: 283 QVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSGGVIVDSGTALTLFP 342
Query: 385 TQAYNSLRDSF---VRL--AGNLKPTSGVA----LFDTCYDFSGLRSVRVPTVSLHFGAG 435
+ +F +RL + P GV + S V VP ++ HF G
Sbjct: 343 AAVLTEVLRAFRAQLRLPFTSSSSPDDGVCFATPMAAGGRRASAATVVSVPRMAFHF-QG 401
Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L+LP +NY++ G+ C A + + + IGN QQ RV +DL + F P +C
Sbjct: 402 ADLELPRRNYVLDDPRRGSLCILLADSGDSGATIGNFVQQDMRVLYDLEAETLSFAPAQC 461
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 131/388 (33%), Positives = 193/388 (49%), Gaps = 38/388 (9%)
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-PI 200
F +PV+SGAS GSG+YF + +GTPP+ +V DTGSD+ W++C PC C +S
Sbjct: 69 NSFRSPVISGASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSA 128
Query: 201 FDPKTSSSYSPLPCAAPQCKSL---DVSACRANR----CLYQVAYGDGSFTVGDLVTETV 253
F + S++YS + C +PQC+ + + C R C YQ Y D S T G E +
Sbjct: 129 FFARHSTTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEAL 188
Query: 254 SFGNS-GSVK---GIALGCGHDNEGL------FVGSAGLLGLGGGMLSLTKQIK---ATS 300
+ S G VK G++ GCG G F G+ G++GLG +S + Q+ +
Sbjct: 189 TLNTSTGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSK 248
Query: 301 LAYCLVDR--DSPASGVLEFNSA------RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGG 352
+YCL+D P + L A + G PL+ N TFYY+ + G V G
Sbjct: 249 FSYCLMDYTLSPPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNG 308
Query: 353 QAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVA 409
+ I PS++ +D+ G+GG I+D GT +T + AY + +F V+L +PT G
Sbjct: 309 VKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPG-- 366
Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--ALS 467
FD C + SG+ +P +S + G P +NY I C A P S S
Sbjct: 367 -FDLCMNVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQ-IKCLAVQPVSQDGGFS 424
Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
++GN+ QQG + FD +R+GFT C
Sbjct: 425 VLGNLMQQGFLLEFDRDKSRLGFTRRGC 452
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 155/439 (35%), Positives = 224/439 (51%), Gaps = 32/439 (7%)
Query: 81 SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
++PLH R N + RL RD R + KL Q
Sbjct: 63 TVPLHHRHGPCSPLPNKKMPTLEERLHRDKLRAAYIHRKLSRGKKQGGGGAGGDVVVQ-Q 121
Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPP-RQFSMVLDTGSDINWLQCRPC-TECYQQSD 198
+ P G S + EY + +G+PP + +M++DTGSDI+W++C+PC +C Q D
Sbjct: 122 SHAMTVPTTLGTSLDTLEYVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQQCRPQVD 181
Query: 199 PIFDPKTSSSYSPLPCAAPQCKSL----DVSACRAN-RCLYQVAYGDGSF-TVGDLVTET 252
P+FDP SS+YSP C++ C L + + C ++ +C Y YGDGS T G ++T
Sbjct: 182 PLFDPSLSSTYSPFSCSSAACAQLFQEGNANGCSSSGQCQYIAMYGDGSVGTTGTYSSDT 241
Query: 253 VSFG---NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA----TSLAYCL 305
++ G N+ V GC H G+ +AGL+GLGGG SL Q T+ +YCL
Sbjct: 242 LALGSNSNTVVVSKFRFGCSHAETGITGLTAGLMGLGGGAQSLVSQTAGTFGTTAFSYCL 301
Query: 306 VDRDSPASGVLEFNSARGGDA--VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
S +SG L +A A V P++R+ +V FY V L VGG+ + IP ++F
Sbjct: 302 PPTPS-SSGFLTLGAAGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTVFS 360
Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKP---TSGVALFDTCYDFSGL 420
G+I+D GT +TRL AY+SL +F P ++G DTC+D SG
Sbjct: 361 ------AGMIMDSGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTCFDMSGQ 414
Query: 421 RSVRVPTVSLHF-GAGKA-LDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQG 476
SV +PTV+L F GAG A ++L A L+ ++++ FC AF TS + IIGNVQQ+
Sbjct: 415 SSVSMPTVALVFSGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGSTGIIGNVQQRT 474
Query: 477 TRVSFDLANNRVGFTPNKC 495
+V +D+A VGF C
Sbjct: 475 FQVLYDVAGGAVGFKAGAC 493
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 135/383 (35%), Positives = 192/383 (50%), Gaps = 27/383 (7%)
Query: 133 KPAEAQILPEDFSTPVVSGASQGS---GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP 189
K ++L + PV GA EY + +GTPP+ + LDTGS + W QC+P
Sbjct: 6 KARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQP 65
Query: 190 CTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD--VSAC---RANRCLYQVAYGDGSFT 244
C C+ QS P +D SS+++ C + QCK LD V+ C C Y +YGD S T
Sbjct: 66 CAVCFNQSLPYYDASRSSTFALPSCDSTQCK-LDPSVTMCVNQTVQTCAYSYSYGDKSAT 124
Query: 245 VGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAY 303
+G L ETVSF SV G+ GCG +N G+F + G+ G G G LSL Q+K + ++
Sbjct: 125 IGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSH 184
Query: 304 CL--VDRDSPASGVLE-----FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
C V P++ + + + + R G T PLI+N TFYY+ L G +VG +
Sbjct: 185 CFTAVSGRKPSTVLFDLPADLYKNGR-GTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLP 243
Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDT 413
+P S F + G GG I+D GTA T L + Y + D F V+L +G L
Sbjct: 244 VPESAFALKN-GTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLL--- 299
Query: 414 CYDFSGL-RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNV 472
C+ L ++ VP + LHF G + LP +NY+ G A ++IIGN
Sbjct: 300 CFSAPPLGKAPHVPKLVLHF-EGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNF 358
Query: 473 QQQGTRVSFDLANNRVGFTPNKC 495
QQQ V +DL N+++ F KC
Sbjct: 359 QQQNMHVLYDLKNSKLSFVRAKC 381
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 135/383 (35%), Positives = 192/383 (50%), Gaps = 27/383 (7%)
Query: 133 KPAEAQILPEDFSTPVVSGASQGS---GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP 189
K ++L + PV GA EY + +GTPP+ + LDTGS + W QC+P
Sbjct: 62 KARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQP 121
Query: 190 CTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD--VSAC---RANRCLYQVAYGDGSFT 244
C C+ QS P +D SS+++ C + QCK LD V+ C C Y +YGD S T
Sbjct: 122 CAVCFNQSLPYYDASRSSTFALPSCDSTQCK-LDPSVTMCVNQTVQTCAYSYSYGDKSAT 180
Query: 245 VGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAY 303
+G L ETVSF SV G+ GCG +N G+F + G+ G G G LSL Q+K + ++
Sbjct: 181 IGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSH 240
Query: 304 CL--VDRDSPASGVLE-----FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
C V P++ + + + + R G T PLI+N TFYY+ L G +VG +
Sbjct: 241 CFTAVSGRKPSTVLFDLPADLYKNGR-GTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLP 299
Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDT 413
+P S F + G GG I+D GTA T L + Y + D F V+L +G L
Sbjct: 300 VPESAFALKN-GTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLL--- 355
Query: 414 CYDFSGL-RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNV 472
C+ L ++ VP + LHF G + LP +NY+ G A ++IIGN
Sbjct: 356 CFSAPPLGKAPHVPKLVLHF-EGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNF 414
Query: 473 QQQGTRVSFDLANNRVGFTPNKC 495
QQQ V +DL N+++ F KC
Sbjct: 415 QQQNMHVLYDLKNSKLSFVRAKC 437
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 119/336 (35%), Positives = 172/336 (51%), Gaps = 19/336 (5%)
Query: 176 LDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQ 235
+DTGSD+ W QC PC C Q P FD K S++Y LPC + +C SL +C C+YQ
Sbjct: 1 MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCVYQ 60
Query: 236 VAYGDGSFTVGDLVTETVSFGNSGSVK----GIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
YGD + T G L ET +FG + S K IA GCG N G S+G++G G G LS
Sbjct: 61 YYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLS 120
Query: 292 LTKQIKATSLAYCLVDRDSPASGVLEF---------NSARGGDAVTAPLIRNKKVDTFYY 342
L Q+ + +YCL S L F N++ G + P + N + Y+
Sbjct: 121 LVSQLGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYF 180
Query: 343 VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL 402
+ L S+G + + I P +F +++ G GG+I+D GT+IT LQ AY ++R V A L
Sbjct: 181 LSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVS-AIPL 239
Query: 403 KPTSGVAL-FDTCYDFSGLR--SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF 459
+ + DTC+ + +V VP + HF + LP +NY++ + G C
Sbjct: 240 PAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLP-ENYMLIASTTGYLCLVM 298
Query: 460 APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
APT +IIGN QQQ + +D+ N+ + F P C
Sbjct: 299 APTGVG-TIIGNYQQQNLHLLYDIGNSFLSFVPAPC 333
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 145/429 (33%), Positives = 220/429 (51%), Gaps = 32/429 (7%)
Query: 77 SSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAE 136
S+ ++PLH R + + RL RD R + K A +++ ++
Sbjct: 52 STGVTVPLHHRYDPCSPVPSKKVPTLEERLRRDQLRAAYIKRKFSGA------GDIEQSD 105
Query: 137 AQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQ 196
A +P G S + EY +G+G+P +M +DTGSD++W+QC+PC++C+ +
Sbjct: 106 AATVPTTL------GTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSE 159
Query: 197 SDPIFDPKTSSSYSPLPCAAPQCKSLDVS----ACRANRCLYQVAYGDGSFTVGDLVTET 252
D +FDP +SS+YSP C++ C L S C +++C Y V YGD S T G ++T
Sbjct: 160 VDSLFDPSSSSTYSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDT 219
Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIK---ATSLAYCLVDR 308
++ G+S ++ GC G F GL+GLGGG SL Q T+ +YCL
Sbjct: 220 LTLGSS-AMTDFQFGCSQSESGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCL-PP 277
Query: 309 DSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
S +SG L + G V P++R+ ++ T+Y V L VG Q + +P S+F
Sbjct: 278 TSGSSGFLTLGTGSSG-FVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVFS----- 331
Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
G ++D GT ITRL AY++L +F P + + DTC+DFSG S+ +PTV
Sbjct: 332 -AGSLMDSGTIITRLPPTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPTV 390
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANN 486
+L F G A+DL ++ + S+ C AF P S+L IIGNVQQ+ V +D+
Sbjct: 391 TLVFSGGAAVDLAFDGIMLEISSS-IRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGG 449
Query: 487 RVGFTPNKC 495
VGF C
Sbjct: 450 AVGFKAGAC 458
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 144/375 (38%), Positives = 199/375 (53%), Gaps = 33/375 (8%)
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQSDP 199
D S P GA+ S EY +G+GTP Q ++++DTGSD++W+QC+PC + CY Q DP
Sbjct: 110 SDVSIPTSLGAAVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDP 169
Query: 200 IFDPKTSSSYSPLPCAAPQCKSLDVSA----CR----ANRCLYQVAYGDGSFTVGDLVTE 251
++DP SS+Y+P+PC + CK L A C + C Y + YG+ TVG TE
Sbjct: 170 LYDPTASSTYAPVPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTE 229
Query: 252 TVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDR 308
T++ SVK GCG +G F GLLGLGG SL Q T + +YCL
Sbjct: 230 TLTLSPQVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSYCLPPG 289
Query: 309 DSPASGVLEFNSARGGDAVTA----PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
+S +G L + + PL + TFY V LTG SVGG+ + IPP++
Sbjct: 290 NS-TTGFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVLS- 347
Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSF--VRLAGNLKPTSGVALFDTCYDFSGLRS 422
GG+I+D GT IT L AY++LR +F A L P + + DTCY+F+G+ +
Sbjct: 348 -----GGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYNFTGIAN 402
Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--ALSIIGNVQQQGTRVS 480
V VPTV+L F G +DL + ++ D C AFA +S + IIGNV Q+ V
Sbjct: 403 VTVPTVALTFDGGATIDLDVPSGVLIQD-----CLAFAGGASDGDVGIIGNVNQRTFEVL 457
Query: 481 FDLANNRVGFTPNKC 495
+D VGF P C
Sbjct: 458 YDSGRGHVGFRPGAC 472
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 132/388 (34%), Positives = 205/388 (52%), Gaps = 40/388 (10%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
F++PVV+ Q EY+ + +GTP + +++DTGSD++W+QC PC +C P F+P
Sbjct: 124 FTSPVVT-LGQAGLEYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNP 182
Query: 204 KTSSSYSPLPCAAPQCKSL-----DVSACRANRCLYQVAYGDGSFTVGDLVTETVS---- 254
+ SSS+ LPCA+ C ++ + CL+ + YGDGS + G L ET++
Sbjct: 183 RHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTP 242
Query: 255 -FGNSGSVK--GIALGCGH-DNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVD 307
FG+ VK I LGC D EGL G++GLLG+ +S Q+ A ++C D
Sbjct: 243 NFGDGEPVKLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPD 302
Query: 308 RDSP--ASGVLEFNSARGGDAVT-----APLIRNKKVDT----FYYVGLTGFSVGGQAVQ 356
+ + +SG++ F + D ++ PL++N V + +YYVGL G SV +
Sbjct: 303 KIAHLNSSGLVFFGES---DIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLP 359
Query: 357 IPPSLFEMDEA-GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCY 415
+ F++D+ G GG I+D GTA T L+ A+ ++R F+ +L + F CY
Sbjct: 360 LSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCY 419
Query: 416 DFS----GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA---GTFCFAFAPTSS-ALS 467
+ + L S +P+++LHF G + LP + LIPV S+ T C AF + +
Sbjct: 420 NITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGDIPFN 479
Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
IIGN QQQ V +DL R+G P +C
Sbjct: 480 IIGNYQQQNLWVEYDLEKLRLGIAPAQC 507
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 201 bits (512), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 135/402 (33%), Positives = 199/402 (49%), Gaps = 29/402 (7%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L+RD R + K + +L+ ++ S P G+S + EY +G+
Sbjct: 79 LKRDQLRAEHIQRKFAMNAAVDGAGDLQQSKV-----SSSVPTKLGSSLDTLEYVISVGL 133
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD 223
GTP ++ +DTGSD++W+QC PC CY Q+ +FDP SS+Y + CAA +C L+
Sbjct: 134 GTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYAQTGALFDPAKSSTYRAVSCAAAECAQLE 193
Query: 224 V--SACRAN--RCLYQVAYGDGSFTVGDLVTETVSF-GNSGSVKGIALGCGHDNEGLFVG 278
+ C A C Y V YGDGS T G +T++ G S +VKG GC H G
Sbjct: 194 QQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCSHVESGFSDQ 253
Query: 279 SAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNK 335
+ GL+GLGGG SL Q A S +YCL + + VT ++R++
Sbjct: 254 TDGLMGLGGGAQSLVSQTAAAYGNSFSYCLPPTSGSSGFLTLGGGGGVSGFVTTRMLRSR 313
Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
++ TFY L +VGG+ + + PS+F G +VD GT ITRL AY++L +F
Sbjct: 314 QIPTFYGARLQDIAVGGKQLGLSPSVFAA------GSVVDSGTIITRLPPTAYSALSSAF 367
Query: 396 VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
+ ++ DTC+DF+G + +PTV+L F G A+DL +
Sbjct: 368 KAGMKQYRSAPARSILDTCFDFAGQTQISIPTVALVFSGGAAIDLDPNGIMY------GN 421
Query: 456 CFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C AFA T IIGNVQQ+ V +D+ ++ +GF C
Sbjct: 422 CLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 201 bits (511), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 133/384 (34%), Positives = 191/384 (49%), Gaps = 46/384 (11%)
Query: 147 PVVSGASQGSGEYFSRIGVG----TPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
P+ SG + Y + I +G +P ++++DTGSD+ W+QC+PC+ CY Q DP+FD
Sbjct: 132 PLTSGIRLQTLNYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFD 191
Query: 203 PKTSSSYSPLPCAAPQCK-----------SLDVSACRANRCLYQVAYGDGSFTVGDLVTE 251
P S++Y+ + C A C S + + +C Y +AYGDGSF+ G L T+
Sbjct: 192 PAGSATYAAVRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATD 251
Query: 252 TVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDR 308
TV+ G + S+ G GCG N GLF G+AGL+GLG LSL Q + +YCL
Sbjct: 252 TVALGGA-SLGGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAA 310
Query: 309 DS-PASGVLEFNSARGGDAV------TAPLIRNKKVDT-----FYYVGLTGFSVGGQAVQ 356
S ASG L GGD T P+ + + FY++ +TG +VGG A+
Sbjct: 311 TSGDASGSLSLG---GGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALA 367
Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL--AGNLKPTSGVALFDTC 414
G +++D GT ITRL Y ++R F+R A G ++ DTC
Sbjct: 368 -------AQGLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILDTC 420
Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT-FCFAFAPTS--SALSIIGN 471
YD +G V+VP ++L G + + A L V G+ C A A S IIGN
Sbjct: 421 YDLTGHDEVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGN 480
Query: 472 VQQQGTRVSFDLANNRVGFTPNKC 495
QQ+ RV +D +R+GF C
Sbjct: 481 YQQKNKRVVYDTLGSRLGFADEDC 504
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 201 bits (511), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 120/353 (33%), Positives = 180/353 (50%), Gaps = 23/353 (6%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
GSGEY + +GTPP + + DTGSD+ W QC PC +CYQQ PIF+P S+S+S +PC
Sbjct: 88 GSGEYLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPC 147
Query: 215 AAPQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
C ++D C C Y YGD +++ GDL E ++ G+S SVK + +GCGH +
Sbjct: 148 NTQTCHAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSS-SVKSV-IGCGHASS 205
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPASGVLEFNS---ARGGD 325
G F ++G++GLGGG LSL Q+ TS +YCL S A+G + F G
Sbjct: 206 GGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGENAVVSGPG 265
Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
V+ PLI V T+YY+ L S+G + M A G +I+D GT +T L
Sbjct: 266 VVSTPLISKNTV-TYYYITLEAISIGNER--------HMAFAKQGNVIIDSGTTLTILPK 316
Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVSLHFGAGKALD-LPA 442
+ Y+ + S +++ + D C+D + S+ +P ++ HF G ++ LP
Sbjct: 317 ELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASLGIPVITAHFSGGANVNLLPI 376
Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ D+ A ++ IIGN+ Q + +DL R+ F P C
Sbjct: 377 NTFRKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYDLEAKRLSFKPTVC 429
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 201 bits (510), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 134/401 (33%), Positives = 192/401 (47%), Gaps = 71/401 (17%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L +D +RV ++ ++L + LK ++A + P S ++ GSG Y +G+
Sbjct: 45 LAQDESRVASIQSRLAKNL--AGGSNLKASKATL-------PSKSASTLGSGNYVVTVGL 95
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
G+P R + + DTGSD+ W QC PC CYQQ + IFDP TS SYS + C +P C+ L+
Sbjct: 96 GSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSPSCEKLES 155
Query: 225 S-----ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS 279
+ C ++ CLY + YGDGS+++G E +S ++ GCG +N GLF G+
Sbjct: 156 ATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTDVFNNFQFGCGQNNRGLFGGT 215
Query: 280 AGLLGLGGGMLSL---TKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
AGLLGL LSL T Q +YCL S ++G L F S G
Sbjct: 216 AGLLGLARNPLSLVSQTAQKYGKVFSYCL-PSSSSSTGYLSFGSGDGDS----------- 263
Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
+AV+ P RL Y+S++ F
Sbjct: 264 ----------------KAVKFTP----------------------RLPPTVYSSVQKVFR 285
Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
L + GV++ DTCYD S ++V+VP + L+F G +DL A +I V C
Sbjct: 286 ELMSDYPRVKGVSILDTCYDLSKYKTVKVPKIILYFSGGAEMDL-APEGIIYVLKVSQVC 344
Query: 457 FAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
AFA S ++IIGNVQQ+ V +D A RVGF P+ C
Sbjct: 345 LAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPSGC 385
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 201 bits (510), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 138/375 (36%), Positives = 195/375 (52%), Gaps = 26/375 (6%)
Query: 134 PAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-- 191
PA A +P+ SG + E+ +G+GTP + +++ DTGSD++W+QC+PC
Sbjct: 125 PAPAVTIPDR------SGTYLDTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSS 178
Query: 192 -ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-DVSACRANRCLYQVAYGDGSFTVGDLV 249
C+ Q DP+FDP SS+Y+ + C PQC + D+ + CLY V YGDGS T G L
Sbjct: 179 GHCHPQQDPLFDPSKSSTYAAVHCGEPQCAAAGDLCSEDNTTCLYLVRYGDGSSTTGVLS 238
Query: 250 TETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLV 306
+T++ +S ++ G GCG N G F GLLGLG G LSL Q A+ +YCL
Sbjct: 239 RDTLALTSSRALTGFPFGCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLP 298
Query: 307 DRDSPASGVLEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
+S +G L + D A ++R + +FY+V L +GG + +PP++F
Sbjct: 299 SSNS-TTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFT 357
Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
GG ++D GT +T L QAY LRD F P + D CYDF+G V
Sbjct: 358 R-----GGTLLDSGTVLTYLPAQAYALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEV 412
Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA---LSIIGNVQQQGTRVS 480
VP VS FG G +L +I +D C AFA + LSIIGN QQ+ V
Sbjct: 413 VVPAVSFRFGDGAVFELDFFGVMIFLDE-NVGCLAFAAMDTGGLPLSIIGNTQQRSAEVI 471
Query: 481 FDLANNRVGFTPNKC 495
+D+A ++GF P C
Sbjct: 472 YDVAAEKIGFVPASC 486
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 201 bits (510), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 135/373 (36%), Positives = 191/373 (51%), Gaps = 36/373 (9%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC---TECYQQSDPIF 201
S P G+S + EY +G+G+P +V+DTGSD++W+QC PC + C+ + +F
Sbjct: 121 SVPTTLGSSLDTLEYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALF 180
Query: 202 DPKTSSSYSPLPCAAPQCKSL----DVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSFG 256
DP SS+Y+ C+A C L + + C A +RC Y V YGDGS T G ++ ++
Sbjct: 181 DPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLS 240
Query: 257 NSGSVKGIALGCGHD--NEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSP 311
S V+G GC H G+ + GL+GLGG SL Q A S +YCL +P
Sbjct: 241 GSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCL--PATP 298
Query: 312 A-SGVLEFNSARGGDA------VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
A SG L + G T P++R+KKV T+Y+ L +VGG+ + + PS+F
Sbjct: 299 ASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAA 358
Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
G +VD GT ITRL AY +L +F + + DTC++F+GL V
Sbjct: 359 ------GSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVS 412
Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFD 482
+PTV+L F G +DL A + S G C AFAPT A IGNVQQ+ V +D
Sbjct: 413 IPTVALVFAGGAVVDLDAHGIV----SGG--CLAFAPTRDDKAFGTIGNVQQRTFEVLYD 466
Query: 483 LANNRVGFTPNKC 495
+ GF C
Sbjct: 467 VGGGVFGFRAGAC 479
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 124/343 (36%), Positives = 172/343 (50%), Gaps = 38/343 (11%)
Query: 173 SMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS---AC 227
++V+DT SDI W+QC PC +C+ Q DP++DP SS+++P+PC +P CK L S C
Sbjct: 170 TVVVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGC 229
Query: 228 R--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG-SAGLLG 284
+ C Y V YGDG T G VT+T++ + VK GC H G F +AG+L
Sbjct: 230 SPTTDECKYIVNYGDGKATTGTYVTDTLTMSPTIVVKDFRFGCSHAVRGSFSNQNAGILA 289
Query: 285 LGGGMLSLTKQIK---ATSLAYCLVDRDS--------PASGVLEFNSARGGDAVTAPLIR 333
LGGG SL +Q + +YC+ S P L+F+ PLI+
Sbjct: 290 LGGGRGSLLEQTADAYGNAFSYCIPKPSSAGFLSLGGPVEASLKFS--------YTPLIK 341
Query: 334 NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRD 393
NK TFY V L V G+ + +PP+ F G ++D G +T+L Q Y +LR
Sbjct: 342 NKHAPTFYIVHLEAIIVAGKQLAVPPTAFAT------GAVMDSGAVVTQLPPQVYAALRA 395
Query: 394 SFVRLAGNLKPTSG-VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
+F P + V DTCYDF+ V+VP VSL F G LDL + ++
Sbjct: 396 AFRSAMAAYGPLAAPVRNLDTCYDFTRFPDVKVPKVSLVFAGGATLDLEPASIIL----D 451
Query: 453 GTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
G FA P ++ IGNVQQQ V +D+ +VGF C
Sbjct: 452 GCLAFAATPGEESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 150/452 (33%), Positives = 220/452 (48%), Gaps = 38/452 (8%)
Query: 64 ESETAAESFPLN---SSSSFSLPL-HSREILHKTRHNDYRSLVLSR-LERDSARVNTLIT 118
+SET + +N SS++ S+ L H +++++ + +S L R AR N +++
Sbjct: 36 DSETVCSASKVNLEPSSATVSMSLVHRYGPCAPSQYSNVPTPSISETLRRSRARTNYIMS 95
Query: 119 KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
+ ++ +A + + P G S EY +G GTP +++DT
Sbjct: 96 QASKSMGMGMASTPDDDDAAV-----TIPTRLGGFVDSLEYVVTLGFGTPSVPQVLLMDT 150
Query: 179 GSDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD---VSACRA--NR 231
GSD++W+QC PC T+CY Q DP+FDP SS+Y+P+ C C+ L + C + +
Sbjct: 151 GSDVSWVQCTPCNSTKCYPQKDPLFDPSKSSTYAPIACNTDACRKLGDHYHNGCTSGGTQ 210
Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
C Y V Y DGS + G ET++ +V+ GCG D G GLLGLGG +S
Sbjct: 211 CGYSVEYADGSHSRGVYSNETLTLAPGITVEDFHFGCGRDQRGPSDKYDGLLGLGGAPVS 270
Query: 292 L---TKQIKATSLAYCLVDRDSPASGVLEFNSARGGDA---VTAPLIRNKKVDTFYYVGL 345
L T + + +YCL +S A G L S G+ V P+ TFY V +
Sbjct: 271 LVVQTSSVYGGAFSYCLPALNSEA-GFLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTM 329
Query: 346 TGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT 405
TG SVGG+ + IP S F GG+I+D GT T L AYN+L ++ +R A P
Sbjct: 330 TGISVGGKPLHIPQSAFR------GGMIIDSGTVDTELPETAYNAL-EAALRKALKAYPL 382
Query: 406 SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT--S 463
FDTCY+F+G ++ VP V+ F G +DL N ++ D C AF +
Sbjct: 383 VPSDDFDTCYNFTGYSNITVPRVAFTFSGGATIDLDVPNGILVND-----CLAFQESGPD 437
Query: 464 SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L IIGNV Q+ V +D VGF C
Sbjct: 438 DGLGIIGNVNQRTLEVLYDAGRGNVGFRAGAC 469
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 122/353 (34%), Positives = 184/353 (52%), Gaps = 18/353 (5%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EY + +GTPP F + DTGSD+ W QC+PC C+ Q P++DP SS++SP+PC++
Sbjct: 65 EYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSA 124
Query: 218 QC----KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNS-----GSVKGIALGC 268
C +S + S ++ C Y +Y DG+++VG L TET++ G+S SV +A GC
Sbjct: 125 TCLPTWRSRNCSN-PSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFGC 183
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-----RDSP-ASGVLEFNSAR 322
G DN G + S G +GLG G LSL Q+ +YCL D DSP G L +
Sbjct: 184 GTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLTDFFNSTMDSPFFLGTLAELAPG 243
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
G + PL+++ + Y+V L G S+G + IP F++ G+GG++VD GT T
Sbjct: 244 PGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDSGTTFTI 303
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
L + + D +L G P + +L C+ S +P + LHF G + L
Sbjct: 304 LAKSGFREVVDRVAQLLGQ-PPVNASSLDSPCFP-SPDGEPFMPDLVLHFAGGADMRLHR 361
Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
NY+ + +FC + S S +GN QQQ ++ FD+ ++ F P C
Sbjct: 362 DNYMSYNEDDSSFCLNIVGSPSTWSRLGNFQQQNIQMLFDMTVGQLSFLPTDC 414
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 128/367 (34%), Positives = 193/367 (52%), Gaps = 38/367 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCA 215
GEY + +GTPP ++ V DTGSD+ W QC PC T+C++Q P+++P +S+++S LPC
Sbjct: 110 GEYLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCN 169
Query: 216 APQCKSLDVSACRANR----------CLYQVAYGDGSFTVGDLVTETVSFGNSGS----V 261
+ +S C C+Y YG G +T G +ET +FG+S + V
Sbjct: 170 S------SLSMCAGALAGAAPPPGCACMYNQTYGTG-WTAGVQGSETFTFGSSAADQARV 222
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV---DRDSPASGVLEF 318
G+A GC + + + GSAGL+GLG G LSL Q+ A +YCL D +S ++ +L
Sbjct: 223 PGVAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCLTPFQDTNSTSTLLLGP 282
Query: 319 NSARGGDAV-TAPLIRNKK---VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
++A G V + P + + + T+YY+ LTG S+G +A+ I P F + G GG+I+
Sbjct: 283 SAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLII 342
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVR---VPTVS 429
D GT IT L AY +R + L L G D C+ S +P+++
Sbjct: 343 DSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMT 402
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLANNRV 488
LHF G + LPA +Y+I +G +C A T A+S GN QQQ + +D+ +
Sbjct: 403 LHFD-GADMVLPADSYMI--SGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVREETL 459
Query: 489 GFTPNKC 495
F P KC
Sbjct: 460 SFAPAKC 466
>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
Length = 362
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 128/253 (50%), Positives = 159/253 (62%), Gaps = 23/253 (9%)
Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELK--PAEAQILPEDFSTPVVSGASQGSGEYFSR 162
RL+RDS RV ++ + LA + R+ K P A FS V+SG SQGSGEYF R
Sbjct: 86 RLQRDSLRVKSITS---LAAVSTGRNATKRTPRTAG----GFSGAVISGLSQGSGEYFMR 138
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
+GVGTP MVLDTGSD+ WLQC PC CY Q+D IFDPK S +++ +PC + C+ L
Sbjct: 139 LGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLCRRL 198
Query: 223 DVSA-C---RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
D S+ C R+ CLYQV+YGDGSFT GD TET++F + V + LGCGHDNEGLFVG
Sbjct: 199 DDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF-HGARVDHVPLGCGHDNEGLFVG 257
Query: 279 SAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASG------VLEFNSARGGDAVTA 329
+AGLLGLG G LS Q K +YCLVDR S S ++ N+A +V
Sbjct: 258 AAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFT 317
Query: 330 PLIRNKKVDTFYY 342
PL+ N K+DTFYY
Sbjct: 318 PLLTNPKLDTFYY 330
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 138/375 (36%), Positives = 193/375 (51%), Gaps = 26/375 (6%)
Query: 134 PAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-- 191
PA A +P+ SG + E+ +G+GTP + +++ DTGSD++W+QC+PC
Sbjct: 130 PAPAVTIPDR------SGTYLDTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSS 183
Query: 192 -ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLV 249
C+ Q DP+FDP SS+Y+ + C PQC + N CLY V YGDGS T G L
Sbjct: 184 GHCHPQQDPLFDPSKSSTYAAVHCGEPQCAAAGGLCSEDNTTCLYLVHYGDGSSTTGVLS 243
Query: 250 TETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLV 306
+T++ +S ++ G GCG N G F GLLGLG G LSL Q A+ +YCL
Sbjct: 244 RDTLALTSSRALAGFPFGCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLP 303
Query: 307 DRDSPASGVLEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
+S +G L + D A ++R + +FY+V L +GG + +PP++F
Sbjct: 304 SSNS-TTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFT 362
Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
GG ++D GT +T L QAY LRD F P + D CYDF+G V
Sbjct: 363 R-----GGTLLDSGTVLTYLPAQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEV 417
Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA---LSIIGNVQQQGTRVS 480
VP VS FG G +L +I +D C AFA + LSIIGN QQ+ V
Sbjct: 418 IVPAVSFRFGDGAVFELDFFGVMIFLDE-NVGCLAFAAMDAGGLPLSIIGNTQQRSAEVI 476
Query: 481 FDLANNRVGFTPNKC 495
+D+A ++GF P C
Sbjct: 477 YDVAAEKIGFVPASC 491
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 199 bits (505), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 121/381 (31%), Positives = 186/381 (48%), Gaps = 33/381 (8%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTP-PRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
+ PV + A SGEY +GTP P++ ++ +DTGSD+ W QC PC C+ Q P+FD
Sbjct: 72 YGQPVTATAVPSSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVCFDQPFPLFD 131
Query: 203 PKTSSSYSPLPCAAPQCK---SLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
P SS++ + C P C+ L VSAC + RC Y +YGD S T G + +T +F +
Sbjct: 132 PSVSSTFRAVACPDPICRPSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMS 191
Query: 258 SG-------SVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRD 309
+V G+A GCG N G+F + +G+ G G G LSL Q++ +YCL D
Sbjct: 192 PNGEGAPPVAVSGLAFGCGDYNTGVFASNESGIAGFGRGPLSLPSQLRVGRFSYCLTSHD 251
Query: 310 -------------SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
+P +G+ +S G + P+I + TFYY+ L G +VG +
Sbjct: 252 ETESNKTSAVFLGTPPNGLRAHSS---GPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLP 308
Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL--AGNLKPTSGVALFDTC 414
+ S+F + + G GG ++D GT +T + L++ FV TS V
Sbjct: 309 VDSSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQLPLPRYDNTSEVGNLLCF 368
Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQ 474
G + V VP + H A +DLP +NY+ +G C + +IGN QQ
Sbjct: 369 QRPKGGKQVPVPKLIFHL-ASADMDLPRENYIPEDTDSGVMCLMINGAEVDMVLIGNFQQ 427
Query: 475 QGTRVSFDLANNRVGFTPNKC 495
Q + +D+ N+++ F +C
Sbjct: 428 QNMHIVYDVENSKLLFASAQC 448
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 134/362 (37%), Positives = 195/362 (53%), Gaps = 31/362 (8%)
Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
+ SGEY I +GTPP + DTGSD+ W QC+PC +CY Q DP+FDPK SS+Y +
Sbjct: 88 TSNSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDPKASSTYKDV 147
Query: 213 PCAAPQCKSLDVSA---CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIA 265
C++ QC +L+ A N C Y +YGD S+T G++ +T++ G++ + +K I
Sbjct: 148 SCSSSQCTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNII 207
Query: 266 LGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFN-- 319
+GCGH+N G F +G++GLGGG +SL Q+ + +YCLV S + N
Sbjct: 208 IGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFG 267
Query: 320 ---SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
G V+ PLI K +TFYY+ L SVG + VQ P S +G+G II+D
Sbjct: 268 TNAVVSGTGVVSTPLIA-KSQETFYYLTLKSISVGSKEVQYPGS---DSGSGEGNIIIDS 323
Query: 377 GTAITRLQTQAYNSLRD---SFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
GT +T L T+ Y+ L D S + P +G++L CY +G ++VP +++HF
Sbjct: 324 GTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSL---CYSATG--DLKVPAITMHFD 378
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
G ++L N + + S CFAF S + SI GNV Q V +D + V F P
Sbjct: 379 -GADVNLKPSNCFVQI-SEDLVCFAFR-GSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPT 435
Query: 494 KC 495
C
Sbjct: 436 DC 437
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 130/362 (35%), Positives = 188/362 (51%), Gaps = 30/362 (8%)
Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
+ SGEY + +GTPP + DTGSD+ W QC PC +CY Q DP+FDPKTSS+Y +
Sbjct: 84 TSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDV 143
Query: 213 PCAAPQCKSLDVSA---CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIA 265
C++ QC +L+ A N C Y ++YGD S+T G++ +T++ G+S + +K I
Sbjct: 144 SCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNII 203
Query: 266 LGCGHDNEGLFVGSAGLLGLGGGM-LSLTKQIKAT---SLAYCLVDRDSPASGVLEFN-- 319
+GCGH+N G F + GG +SL KQ+ + +YCLV S + N
Sbjct: 204 IGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFG 263
Query: 320 ---SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
G V+ PLI +TFYY+ L SVG + +Q E+ +G II+D
Sbjct: 264 TNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQY---SGSDSESSEGNIIIDS 320
Query: 377 GTAITRLQTQAYNSLRD---SFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
GT +T L T+ Y+ L D S + P SG++L CY +G ++VP +++HF
Sbjct: 321 GTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSL---CYSATG--DLKVPVITMHFD 375
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
G + L + N + V S CFAF S + SI GNV Q V +D + V F P
Sbjct: 376 -GADVKLDSSNAFVQV-SEDLVCFAFR-GSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPT 432
Query: 494 KC 495
C
Sbjct: 433 DC 434
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 198 bits (503), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 137/375 (36%), Positives = 196/375 (52%), Gaps = 28/375 (7%)
Query: 145 STPVVSGASQG---SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
S PV GA + EY + +GTPP+ + LDTGSD+ W QC+PC C+ Q+ P F
Sbjct: 18 SAPVSPGAYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYF 77
Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRANR------CLYQVAYGDGSFTVGDLVTETVSF 255
DP TSS+ S C + C+ L V++C + + C+Y +YGD S T G L + +F
Sbjct: 78 DPSTSSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF 137
Query: 256 -GNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSP 311
G SV G+A GCG N G+F + G+ G G G LSL Q+K + ++C + P
Sbjct: 138 VGAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIP 197
Query: 312 ASGVLEFNS---ARGGDAV-TAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
++ +L+ + + G AV T PLI +N+ T YY+ L G +VG + +P S F +
Sbjct: 198 STVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFAL 257
Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYDFSGLRSV 423
G GG I+D GT+IT L Q Y +RD F ++ + P + + TC+
Sbjct: 258 TN-GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHY-TCFSAPSQAKP 315
Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPV-DSAGT--FCFAFAPTSSALSIIGNVQQQGTRVS 480
VP + LHF G +DLP +NY+ V D AG C A +IIGN QQQ V
Sbjct: 316 DVPKLVLHF-EGATMDLPRENYVFEVPDDAGNSIICLAIN-KGDETTIIGNFQQQNMHVL 373
Query: 481 FDLANNRVGFTPNKC 495
+DL NN + F +C
Sbjct: 374 YDLQNNMLSFVAAQC 388
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 197 bits (502), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 130/363 (35%), Positives = 197/363 (54%), Gaps = 31/363 (8%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPC 214
GEY + +GTPP + + DTGSD+ W QC PC+ +C+ Q P+++P +S+++ LPC
Sbjct: 90 GEYLMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPC 149
Query: 215 AAPQCKSLDVSACRAN----RCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIAL 266
+ V A +A C+Y YG G +T G +ET +FG++ + V GIA
Sbjct: 150 NSSLSMCAGVLAGKAPPPGCACMYNQTYGTG-WTAGVQGSETFTFGSAAADQARVPGIAF 208
Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV---DRDSPASGVLEFNSARG 323
GC + + + GSAGL+GLG G LSL Q+ A +YCL D +S ++ +L ++A
Sbjct: 209 GCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCLTPFQDTNSTSTLLLGPSAALN 268
Query: 324 GDAV-TAPLIRNKK---VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
G V + P + + + T+YY+ LTG S+G +A+ I P F + G GG+I+D GT
Sbjct: 269 GTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLIIDSGTT 328
Query: 380 ITRLQTQAYNSLR---DSFVRL-AGNLKPTSGVALFDTCYDFSGLRSV--RVPTVSLHFG 433
IT L AY +R S V L A + ++G+ D CY S +P+++LHF
Sbjct: 329 ITSLVNAAYQQVRAAVQSLVTLPAIDGSDSTGL---DLCYALPTPTSAPPAMPSMTLHFD 385
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
G + LPA +Y+I +G +C A T A+S GN QQQ + +D+ N + F P
Sbjct: 386 -GADMVLPADSYMI--SGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVRNEMLSFAP 442
Query: 493 NKC 495
KC
Sbjct: 443 AKC 445
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 197 bits (502), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 140/431 (32%), Positives = 205/431 (47%), Gaps = 71/431 (16%)
Query: 102 VLSRLERDSARVNTLITK-LQLAIYNVDRHELKPAEAQILPEDFSTPVVS---------- 150
VL RD R+ TL + L+ N + K + +++ +TPV S
Sbjct: 100 VLELQIRDLTRIQTLHKRVLEKNNQNTVSQKQKKNDKEVVT---TTPVASSVEEQAGQLV 156
Query: 151 -----GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
G + GSGEYF + VG+PP+ FS++LDTGSD+NW+QC PC +C+QQ+D
Sbjct: 157 ATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQND------- 209
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG---NSGS-- 260
C Y YGD S T GD ET + N GS
Sbjct: 210 -----------------------NQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSE 246
Query: 261 ---VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASG 314
V+ + GCGH N GLF G+AGLLGLG G LS + Q+++ S +YCLVDR+S +
Sbjct: 247 LYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNV 306
Query: 315 VLEFNSARGGDAVTAPLI--------RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
+ D ++ P + + VDTFYYV + V G+ + IP + +
Sbjct: 307 SSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISS 366
Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVRV 425
G GG I+D GT ++ AY +++ A P + D C++ SG+ +V++
Sbjct: 367 DGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQL 426
Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLA 484
P + + F G + P +N I ++ C A T SA SIIGN QQQ + +D
Sbjct: 427 PELGIAFADGAVWNFPTENSFIWLNE-DLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTK 485
Query: 485 NNRVGFTPNKC 495
+R+G+ P KC
Sbjct: 486 RSRLGYAPTKC 496
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 197 bits (502), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 130/362 (35%), Positives = 188/362 (51%), Gaps = 30/362 (8%)
Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
+ SGEY + +GTPP + DTGSD+ W QC PC +CY Q DP+FDPKTSS+Y +
Sbjct: 84 TSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDV 143
Query: 213 PCAAPQCKSLDVSA---CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIA 265
C++ QC +L+ A N C Y ++YGD S+T G++ +T++ G+S + +K I
Sbjct: 144 SCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNII 203
Query: 266 LGCGHDNEGLFVGSAGLLGLGGGM-LSLTKQIKAT---SLAYCLVDRDSPASGVLEFN-- 319
+GCGH+N G F + GG +SL KQ+ + +YCLV S + N
Sbjct: 204 IGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFG 263
Query: 320 ---SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
G V+ PLI +TFYY+ L SVG + +Q E+ +G II+D
Sbjct: 264 TNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQY---SGSDSESSEGNIIIDS 320
Query: 377 GTAITRLQTQAYNSLRD---SFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
GT +T L T+ Y+ L D S + P SG++L CY +G ++VP +++HF
Sbjct: 321 GTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSL---CYSATG--DLKVPVITMHFD 375
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
G + L + N + V S CFAF S + SI GNV Q V +D + V F P
Sbjct: 376 -GADVKLDSSNAFVQV-SEDLVCFAFR-GSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPT 432
Query: 494 KC 495
C
Sbjct: 433 DC 434
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 197 bits (501), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 131/359 (36%), Positives = 181/359 (50%), Gaps = 23/359 (6%)
Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
+Q GEY VG PP Q ++DTGSD+ WLQC+PC +CY Q+ IFDP S++Y L
Sbjct: 80 TQNDGEYLISYSVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTTRIFDPSKSNTYKIL 139
Query: 213 PCAAPQCKSLDVSACRA-NR--CLYQVAYGDGSFTVGDLVTETVSFG--NSGSVK--GIA 265
P ++ C+S++ ++C + NR C Y + YGDGS++ GDL ET++ G N SVK
Sbjct: 140 PFSSTTCQSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRTV 199
Query: 266 LGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKATS------LAYCLVDRDSPASGVLEF 318
+GCG +N F G S+G++GLG G +SL Q++ S +YCL S S L F
Sbjct: 200 IGCGRNNTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASM-SNISSKLNF 258
Query: 319 NSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
A GD + I FYY+ L FSVG ++ S F E G+ II+D
Sbjct: 259 GDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGEKGN--IIIDS 316
Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
GT +T L Y+ L + L + + CY S + P + HF +G
Sbjct: 317 GTTLTLLPNDIYSKLESAVADLVELDRVKDPLKQLSLCYR-STFDELNAPVIMAHF-SGA 374
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ L A N I V+ G C AF +S I GN+ QQ V +DL V F P C
Sbjct: 375 DVKLNAVNTFIEVEQ-GVTCLAFI-SSKIGPIFGNMAQQNFLVGYDLQKKIVSFKPTDC 431
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 119/355 (33%), Positives = 184/355 (51%), Gaps = 19/355 (5%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EY + +GTPP F + DTGSD+ W QC+PC C+ Q P++DP SS++SP+PC++
Sbjct: 76 EYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSA 135
Query: 218 QC----KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNS-----GSVKGIALGC 268
C +S + S ++ C Y +Y DG+++ G L TET++ G+S SV +A GC
Sbjct: 136 TCLPVLRSRNCST-PSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSDVAFGC 194
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-----RDSP-ASGVLEFNSAR 322
G DN G + S G +GLG G LSL Q+ +YCL D DSP G L +
Sbjct: 195 GTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLTDFFNSTLDSPFLLGTLAELAPG 254
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
G + PL+++ + Y V L G ++G + IP F++ GG++VD GT +
Sbjct: 255 PGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDLHANSTGGMVVDSGTTFSI 314
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-SGLRSVR-VPTVSLHFGAGKALDL 440
L + + D ++ G P + +L C+ +G R + +P + LHF G + L
Sbjct: 315 LPESGFRVVVDHVAQVLGQ-PPVNASSLDSPCFPAPAGERQLPFMPDLVLHFAGGADMRL 373
Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
NY+ +FC T+S S++GN QQQ ++ FD+ ++ F P C
Sbjct: 374 HRDNYMSYNQEDSSFCLNIVGTTSTWSMLGNFQQQNIQMLFDMTVGQLSFLPTDC 428
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 133/392 (33%), Positives = 192/392 (48%), Gaps = 48/392 (12%)
Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TEC-YQQSDPIFDP 203
+P++SGAS GSG+YF I +G+PP+ +V DTGSD+ W++C C T C F
Sbjct: 70 SPLMSGASSGSGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLA 129
Query: 204 KTSSSYSPLPCAAPQCKSL---DVSACRANR----CLYQVAYGDGSFTVGDLVTETVSF- 255
+ S+++SP C + C+ + + + C R C Y+ Y DGS T G ET +
Sbjct: 130 RHSTTFSPTHCFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLN 189
Query: 256 ---GNSGSVKGIALGCGHDNEG------LFVGSAGLLGLGGGMLSLTKQIK---ATSLAY 303
G +K IA GCG G F G++G++GLG G +S Q+ S +Y
Sbjct: 190 TSSGREMKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSY 249
Query: 304 CLVDR--DSPASGVLEFNSARGGDAVTA-----------PLIRNKKVDTFYYVGLTGFSV 350
CL+D P + L GD V+ PL+ N + TFYY+ + G V
Sbjct: 250 CLLDYTLSPPPTSYLMI-----GDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFV 304
Query: 351 GGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL 410
G + I PS++ +DE G+GG ++D GT +T L AY + +F R PT G A
Sbjct: 305 DGVKLHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGAS 364
Query: 411 ----FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT---S 463
FD C + +G+ R P +SL G P +NY I + S G C A P S
Sbjct: 365 TRSGFDLCVNVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDI-SEGIKCLAIQPVEAES 423
Query: 464 SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S+IGN+ QQG + FD +R+GF+ C
Sbjct: 424 GRFSVIGNLMQQGFLLEFDRGKSRLGFSRRGC 455
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 145/371 (39%), Positives = 195/371 (52%), Gaps = 38/371 (10%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT---ECYQQSDPIF 201
+ P G G+ Y +GTP +M +DTGSD++W+QC+PC+ CY Q DP+F
Sbjct: 126 TVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLF 185
Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRANRCL---YQVAYGDGSFTVGDLVTETVSFGNS 258
DP SSSY+ +PC P C L + A A Y V+YGDGS T G ++T++ S
Sbjct: 186 DPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSAS 245
Query: 259 GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGV 315
+V+G GCGH GLF G GLLGLG SL +Q T +YCL + S A G
Sbjct: 246 SAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTA-GY 304
Query: 316 LEFNSARGGDAVTAP------LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
L GG + AP L+ + T+Y V LTG SVGGQ + +P S F
Sbjct: 305 LTLG--VGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV-- 360
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTS-GVALFDTCYDFSGLRSVRVPT 427
VD GT +TRL AY +LR +F +A PT+ + DTCY+F+G +V +P
Sbjct: 361 ----VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPN 416
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTF-CFAFAPTSS--ALSIIGNVQQQGTRVSFDLA 484
V+L FG+G + L A L +F C AFAP+ S ++I+GNVQQ+ V D
Sbjct: 417 VALTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID-- 467
Query: 485 NNRVGFTPNKC 495
VGF P+ C
Sbjct: 468 GTSVGFKPSSC 478
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 128/368 (34%), Positives = 193/368 (52%), Gaps = 39/368 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCA 215
GEY + +GTPP ++ V DTGSD+ W QC PC T+C++Q P+++P +S+++S LPC
Sbjct: 112 GEYLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCN 171
Query: 216 APQCKSLDVSACRANR----------CLYQVAYGDGSFTVGDLVTETVSFGNSGS----V 261
+ +S C C+Y YG G +T G +ET +FG+S + V
Sbjct: 172 S------SLSMCAGALAGAAPPPGCACMYYQTYGTG-WTAGVQGSETFTFGSSAADQARV 224
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV---DRDSPASGVLEF 318
G+A GC + + + GSAGL+GLG G LSL Q+ A +YCL D +S ++ +L
Sbjct: 225 PGVAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCLTPFQDTNSTSTLLLGP 284
Query: 319 NSARGGDAV-TAPLIRNKK---VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
++A G V + P + + + T+YY+ LTG S+G +A+ I P F + G GG+I+
Sbjct: 285 SAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLII 344
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPT---SGVALFDTCYDFSGLRSVR---VPTV 428
D GT IT L AY +R + PT S D C+ S +P++
Sbjct: 345 DSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSM 404
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLANNR 487
+LHF G + LPA +Y+I +G +C A T A+S GN QQQ + +D+
Sbjct: 405 TLHFD-GADMVLPADSYMI--SGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVREET 461
Query: 488 VGFTPNKC 495
+ F P KC
Sbjct: 462 LSFAPAKC 469
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 128/367 (34%), Positives = 190/367 (51%), Gaps = 28/367 (7%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQSDPIFD 202
S P G+S S EY + +G+GTP +++LDTGS + W+QC+PC ++CY Q P+FD
Sbjct: 115 SVPTQLGSSYDSQEYVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFD 174
Query: 203 PKTSSSYSPLPCAAPQCKSL----DVSACRANR---CLYQVAYGDGSFTVGDLVTETVSF 255
P TSSSYSP+PC + +C++L D C ++ C Y++ YG G+ G+ T+ ++
Sbjct: 175 PNTSSSYSPVPCDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTL 234
Query: 256 GNSGSVKGIALGCGHDNE-GLFVGSAGLLGLGGGMLSLTKQIKATS----LAYCLVDRDS 310
G VK GCGH + G F + G+LGLG SL Q A ++CL
Sbjct: 235 GPGAIVKRFHFGCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGVFSHCLPPTGV 294
Query: 311 PASGVLEFNSARGGDA-VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
++G L + A V PL+ FY + T SV GQ + IPP++F
Sbjct: 295 -STGFLALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFRE----- 348
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
G+I D GT ++ LQ AY +LR +F V DTC++F+G +V VPTVS
Sbjct: 349 -GVITDSGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFNFTGYDNVTVPTVS 407
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS-IIGNVQQQGTRVSFDLANNRV 488
L F G + L A + ++ +D C AF + + +IG+V Q+ V +D+ +V
Sbjct: 408 LTFRGGATVHLDASSGVL-MDG----CLAFWSSGDEYTGLIGSVSQRTIEVLYDMPGRKV 462
Query: 489 GFTPNKC 495
GF C
Sbjct: 463 GFRTGAC 469
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 123/377 (32%), Positives = 190/377 (50%), Gaps = 35/377 (9%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
P+ SGA+ + Y + +G+G + ++V+DT S++ W+QC+PC C+ Q DP+FDP
Sbjct: 105 LQVPITSGANLRTLNYVATVGLGAA--EATVVVDTASELTWVQCQPCESCHDQQDPLFDP 162
Query: 204 KTSSSYSPLPCAAPQCKSLDV------SACRANR-----CLYQVAYGDGSFTVGDLVTET 252
+S SY+ +PC + C +L V S C + C Y ++Y DGS++ G L +
Sbjct: 163 SSSPSYAAVPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDK 222
Query: 253 VSFGNSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDR 308
+ ++G GCG N+G F G++GL+GLG +SL Q +YCL R
Sbjct: 223 LRLAGQ-DIEGFVFGCGTSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFGGVFSYCLPMR 281
Query: 309 DSPASGVLEFNSARGGDAVTAPLIRNKKVDT-------FYYVGLTGFSVGGQAVQIPPSL 361
+S +SG L + P++ V FY++ LTG +VGGQ V+ P
Sbjct: 282 ESGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVESP--W 339
Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR 421
F G +I+D GT IT L YN++R F+ ++ DTC++ +GL+
Sbjct: 340 FSA-----GRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSILDTCFNLTGLK 394
Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIPVDS-AGTFCFAFAPTSSAL--SIIGNVQQQGTR 478
V+VP++ F +++ +K L V S A C A A S SIIGN QQ+ R
Sbjct: 395 EVQVPSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIGNYQQKNLR 454
Query: 479 VSFDLANNRVGFTPNKC 495
V FD +++GF C
Sbjct: 455 VIFDTLGSQIGFAQETC 471
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 145/371 (39%), Positives = 194/371 (52%), Gaps = 38/371 (10%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT---ECYQQSDPIF 201
+ P G G+ Y +GTP +M +DTGSD++W+QC+PC CY Q DP+F
Sbjct: 34 TVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLF 93
Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRANRCL---YQVAYGDGSFTVGDLVTETVSFGNS 258
DP SSSY+ +PC P C L + A A Y V+YGDGS T G ++T++ S
Sbjct: 94 DPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSAS 153
Query: 259 GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGV 315
+V+G GCGH GLF G GLLGLG SL +Q T +YCL + S A G
Sbjct: 154 SAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTA-GY 212
Query: 316 LEFNSARGGDAVTAP------LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
L GG + AP L+ + T+Y V LTG SVGGQ + +P S F
Sbjct: 213 LTLG--VGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV-- 268
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTS-GVALFDTCYDFSGLRSVRVPT 427
VD GT +TRL AY +LR +F +A PT+ + DTCY+F+G +V +P
Sbjct: 269 ----VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPN 324
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTF-CFAFAPTSS--ALSIIGNVQQQGTRVSFDLA 484
V+L FG+G + L A L +F C AFAP+ S ++I+GNVQQ+ V D
Sbjct: 325 VALTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID-- 375
Query: 485 NNRVGFTPNKC 495
VGF P+ C
Sbjct: 376 GTSVGFKPSSC 386
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 143/361 (39%), Positives = 191/361 (52%), Gaps = 38/361 (10%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT---ECYQQSDPIFDPKTSSSYSP 211
G+ Y +GTP +M +DTGSD++W+QC+PC CY Q DP+FDP SSSY+
Sbjct: 136 GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAA 195
Query: 212 LPCAAPQCKSLDVSACRANRCL---YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
+PC P C L + A A Y V+YGDGS T G ++T++ S +V+G GC
Sbjct: 196 VPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGC 255
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGD 325
GH GLF G GLLGLG SL +Q T +YCL + S A G L GG
Sbjct: 256 GHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTA-GYLTLG--VGGP 312
Query: 326 AVTAP------LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
+ AP L+ + T+Y V LTG SVGGQ + +P S F VD GT
Sbjct: 313 SGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV------VDTGTV 366
Query: 380 ITRLQTQAYNSLRDSFVR-LAGNLKPTS-GVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
+TRL AY +LR +F +A PT+ + DTCY+F+G +V +P V+L FG+G
Sbjct: 367 VTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGAT 426
Query: 438 LDLPAKNYLIPVDSAGTF-CFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
+ L A L +F C AFAP+ S ++I+GNVQQ+ V D VGF P+
Sbjct: 427 VTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSS 477
Query: 495 C 495
C
Sbjct: 478 C 478
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 134/381 (35%), Positives = 201/381 (52%), Gaps = 44/381 (11%)
Query: 149 VSGASQGS---GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPK 204
VS +Q S GEY + +GTPP + + DTGSD+ W QC PCT +C++Q P+++P
Sbjct: 77 VSAPTQNSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPS 136
Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANR------------CLYQVAYGDGSFTVGDLVTET 252
+S++++ LPC + +S C A C Y V YG G +V +ET
Sbjct: 137 SSTTFAVLPCNS------SLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQ-GSET 189
Query: 253 VSFGNSGS----VKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIKATSLAYCLV- 306
+FG++ + V GIA GC + G SA GL+GLG G LSL Q+ +YCL
Sbjct: 190 FTFGSTPAGQSRVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCLTP 249
Query: 307 --DRDSPASGVLEFNSARGGDA--VTAPLIRNKK---VDTFYYVGLTGFSVGGQAVQIPP 359
D +S ++ +L +++ G A + P + + ++TFYY+ LTG S+G A+ IPP
Sbjct: 250 YQDTNSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPP 309
Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL--FDTCYDF 417
F ++ G GG+I+D GT IT L AY +R + V L L T G A D C+
Sbjct: 310 DAFLLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLV-TLPTTDGSAATGLDLCFML 368
Query: 418 SGLRSV--RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQ 474
S +P+++LHF G + LPA +Y++ DS G +C A T ++I+GN QQ
Sbjct: 369 PSSTSAPPAMPSMTLHFN-GADMVLPADSYMMSDDS-GLWCLAMQNQTDGEVNILGNYQQ 426
Query: 475 QGTRVSFDLANNRVGFTPNKC 495
Q + +D+ + F P KC
Sbjct: 427 QNMHILYDIGQETLSFAPAKC 447
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 130/371 (35%), Positives = 196/371 (52%), Gaps = 41/371 (11%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPC 214
+GEY + +GTPP + + DTGSD+ W QC PCT +C++Q P+++P +S++++ LPC
Sbjct: 89 AGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPC 148
Query: 215 AAPQCKSLDVSACRANR------------CLYQVAYGDGSFTVGDLVTETVSFGNS---- 258
+ +S C A C Y V YG G +V +ET +FG++
Sbjct: 149 NS------SLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQ-GSETFTFGSTPAGH 201
Query: 259 GSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIKATSLAYCLV---DRDSPASG 314
V GIA GC + G SA GL+GLG G LSL Q+ +YCL D +S ++
Sbjct: 202 ARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCLTPYQDTNSTSTL 261
Query: 315 VLEFNSARGGDA--VTAPLIRNKK---VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
+L +++ G A + P + + ++TFYY+ LTG S+G A+ IPP F ++ G
Sbjct: 262 LLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGT 321
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA--LFDTCYDFSGLRSV--RV 425
GG+I+D GT IT L AY +R + V L L T G A D C+ S +
Sbjct: 322 GGLIIDSGTTITLLGNTAYQQVRAAVVSLV-TLPTTDGSADTGLDLCFMLPSSTSAPPAM 380
Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLA 484
P+++LHF G + LPA +Y++ DS G +C A T ++I+GN QQQ + +D+
Sbjct: 381 PSMTLHFN-GADMVLPADSYMMSDDS-GLWCLAMQNQTDGEVNILGNYQQQNMHILYDIG 438
Query: 485 NNRVGFTPNKC 495
+ F P KC
Sbjct: 439 QETLSFAPAKC 449
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 195 bits (495), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 125/354 (35%), Positives = 183/354 (51%), Gaps = 25/354 (7%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
GSGEY + +GTPP + + DTGSD+ W QC PC +CY+QS PIFDP S+S+S +PC
Sbjct: 88 GSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPC 147
Query: 215 AAPQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
+ CK++D S C A C Y YGD ++T GDL E ++ G+S SVK + +GCGH++
Sbjct: 148 NSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITIGSS-SVKSV-IGCGHESG 205
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPASGVLEFNS---ARGGD 325
G F ++G++GLGGG LSL Q+ TS +YCL S A+G + F G
Sbjct: 206 GGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPG 265
Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
V+ PLI V T+YYV L S+G + M A G +I+D GT ++ L
Sbjct: 266 VVSTPLISKNPV-TYYYVTLEAISIGNER--------HMASAKQGNVIIDSGTTLSFLPK 316
Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVSLHFGAGKALDLPAK 443
+ Y+ + S +++ + +D C+D + S +P ++ F G ++L
Sbjct: 317 ELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPV 376
Query: 444 NYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
N V + C P S IIGN+ + +DL R+ F P C
Sbjct: 377 NTFQKV-ANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVC 429
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 139/430 (32%), Positives = 213/430 (49%), Gaps = 48/430 (11%)
Query: 109 DSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFS-------TPVVSGASQGSGEYFS 161
D R+ LQ A + P E+ +DF + +VSG+S GSG+YF
Sbjct: 2 DRGRIAAFGRVLQEAAQKNSTNSTLPRESLATIQDFQGEDPALFSRLVSGSSIGSGQYFV 61
Query: 162 RIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD---PIFDPKTSSSYSPLPCAAPQ 218
+ VGTP ++F +++DTGSD+ W+QC P S P +D +SSSY +PC +
Sbjct: 62 ELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPCTDDE 121
Query: 219 CKSLDV---SACRANR---CLYQVAYGDGSFTVGDLVTETVSF----------GNSGS-- 260
C+ L S+C C Y Y D S T G L ET+S GN +
Sbjct: 122 CQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAGNHKTRR 181
Query: 261 --VKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSL----AYCLVD--RDSP 311
+K +ALGC ++ G F+G++G+LGLG G +SL Q + T+L +YCLVD R S
Sbjct: 182 IRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGIFSYCLVDYLRGSN 241
Query: 312 ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDG 370
AS L P++RN +FYYV +TG +V G+ V I S + +D G+
Sbjct: 242 ASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIASSDWGIDGDGNK 301
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
G I D GT ++ L+ AY+ + + + L + G F+ CY+ + + +P
Sbjct: 302 GTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEG---FELCYNVTRMEK-GMPK 357
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP--TSSALSIIGNVQQQGTRVSFDLAN 485
+ + F G ++LP NY++ V + C A T++ +I+GN+ QQ + +DLA
Sbjct: 358 LGVEFQGGAVMELPWNNYMVLV-AENVQCVALQKVTTTNGSNILGNLLQQDHHIEYDLAK 416
Query: 486 NRVGFTPNKC 495
R+GF + C
Sbjct: 417 ARIGFKWSPC 426
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 194 bits (494), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 139/417 (33%), Positives = 214/417 (51%), Gaps = 36/417 (8%)
Query: 103 LSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSR 162
L+R+ D + + + L ++ RH + A S PV + GE+
Sbjct: 32 LTRVHADPSVTASQFVRAALH-RDMHRHNARKLAASSSDGTVSAPV--SPTTVPGEFLMT 88
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCAAPQCKS 221
+ +GTPP F + DTGSD+ W QC PC+ +C+QQ P+++P +S+++S LPC + S
Sbjct: 89 LAIGTPPLPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPCNS----S 144
Query: 222 LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS-----VKGIALGCGHDNEGLF 276
L + A A C+Y + YG G +T TET +FG+S V GIA GC + + G
Sbjct: 145 LGLCA-PACACMYNMTYGSG-WTYVFQGTETFTFGSSTPADQVRVPGIAFGCSNASSGFN 202
Query: 277 VGSA-GLLGLGGGMLSLTKQIKATSLAYCLV---DRDSPASGVLEFNSARGGDAVTA--P 330
SA GL+GLG G LSL Q+ A +YCL D +S ++ +L +++ V + P
Sbjct: 203 ASSASGLVGLGRGSLSLVSQLGAPKFSYCLTPYQDTNSTSTLLLGPSASLNDTGVVSSTP 262
Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
+ + +YY+ LTG S+G A+ IPP+ F + G GG+I+D GT IT L AY
Sbjct: 263 FVASPS-SIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLIIDSGTTITMLGNTAYQQ 321
Query: 391 LRDSFVRLAGNLKPTSGVAL--FDTCYDFSGLRSV--RVPTVSLHFGAGKALDLPAKNYL 446
+R + + L L T G A D C++ S +P+++LHF G + LPA NY+
Sbjct: 322 VRAAVLSLV-TLPTTDGSAATGLDLCFELPSSTSAPPSMPSMTLHFD-GADMVLPADNYM 379
Query: 447 I----PVDSAGTFCFAFAPTSS----ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ P + +C A + +SI+GN QQQ + +D+ + F P KC
Sbjct: 380 MSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHILYDVGKETLSFAPAKC 436
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 147/449 (32%), Positives = 212/449 (47%), Gaps = 58/449 (12%)
Query: 71 SFPLNSSSSFSLPLHSREIL----HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYN 126
SF + FS+ L R+ L +K N Y+ V D+AR + N
Sbjct: 19 SFSHAQKNGFSVELIHRDSLKSPLYKPTQNKYQYFV------DAARRSI----------N 62
Query: 127 VDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQ 186
H K + A I P+ P + GEY VGTPP + ++DTGSDI WLQ
Sbjct: 63 RANHFYKYSLANI-PQSTVIPDI-------GEYLMTYSVGTPPFKLYGIVDTGSDIVWLQ 114
Query: 187 CRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACR-ANRCLYQVAYGDGSFTV 245
C PC ECY Q+ P+F+P SSSY +PC + C+S++ ++C N C Y YGD S +
Sbjct: 115 CEPCQECYNQTTPMFNPSKSSSYKNIPCPSKLCQSMEDTSCNDKNYCEYSTYYGDNSHSG 174
Query: 246 GDLVTETVSF----GNSGSVKGIALGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKATS 300
GDL +T++ G + S I +GCG +N + G S+G++G G G S Q+ +++
Sbjct: 175 GDLSVDTLTLESTNGLTVSFPNIVIGCGTNNILSYEGASSGIVGFGSGPASFITQLGSST 234
Query: 301 ---LAYCL------VDRDSPASGVLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFS 349
+YCL + S A+ L F A GD V I K +TFYY+ L FS
Sbjct: 235 GGKFSYCLTPLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFS 294
Query: 350 VGGQAVQI---PPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS 406
VG + V+I P + +G II+D GT +T L Y+ L + V L +
Sbjct: 295 VGNRRVEIGGVP------NGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVDD 348
Query: 407 GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL 466
+ CY P +++HF G +DL + + V + G FC AF +S
Sbjct: 349 PTQTLNLCYSVKA-EGYDFPIITMHF-KGADVDLHPISTFVSV-ADGVFCLAFE-SSQDH 404
Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+I GN+ QQ V +DL V F P+ C
Sbjct: 405 AIFGNLAQQNLMVGYDLQQKIVSFKPSDC 433
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 144/440 (32%), Positives = 204/440 (46%), Gaps = 44/440 (10%)
Query: 84 LHSREILH-KTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL-- 140
+H E H + + + + + +SR S N TK Q R L+ + +
Sbjct: 19 IHFSEHSHAEAKIDGFTTDFISRDSPHSPFYNPSETKYQRLQKAFRRSILRGNHFRAMRA 78
Query: 141 -PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP 199
P D + V+SG G Y I +GTPP + DTGSD+ W QC PC CY+Q +P
Sbjct: 79 SPNDIQSDVISGG----GAYLMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNCYEQVEP 134
Query: 200 IFDPKTSSSYSPLPCAAPQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
+FDPK S +Y L C C+ L S N C Y +YGD S+T GDL ++T++ G+
Sbjct: 135 LFDPKESETYKTLDCDNEFCQDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGS 194
Query: 258 S----GSVKGIALGCGHDNEGLF-----VGSAGLLGLGGGMLSLTKQIKATSLAYCLV-- 306
+ S GIA GCGHDN G F G ++ L+ ++ +YCLV
Sbjct: 195 TEGDPASFPGIAFGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVGG-QFSYCLVPL 253
Query: 307 DRDSPASGVLEFNSA---RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIP----- 358
DS S + F + G V+ PLI+ DTFYY+ L G SVG + V
Sbjct: 254 SSDSTVSSKINFGKSGVVSGSGTVSTPLIKGTP-DTFYYLTLEGLSVGSETVAFKGFSEN 312
Query: 359 ---PSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCY 415
P+ E +G II+D GT +T L Y + + G T +F CY
Sbjct: 313 KSSPAAVE-----EGNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCY 367
Query: 416 DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQ 475
S + ++ +PT++ HF G + LP N + V CF+ P SS L+I GN+ Q
Sbjct: 368 --SSVNNLEIPTITAHF-TGADVQLPPLNTFVQV-QEDLVCFSMIP-SSNLAIFGNLAQI 422
Query: 476 GTRVSFDLANNRVGFTPNKC 495
V +DL NN+V F C
Sbjct: 423 NFLVGYDLKNNKVSFKQTDC 442
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 135/371 (36%), Positives = 185/371 (49%), Gaps = 34/371 (9%)
Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
+PV+S +GEY I +GTPP + DTGSD+ W QC+PC CY+Q +PIFDP
Sbjct: 86 SPVISN----NGEYLMNISLGTPPVSMHGIADTGSDLLWRQCKPCDSCYEQIEPIFDPAK 141
Query: 206 SSSYSPLPCAAPQCKSL-DVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGNSG---- 259
S +Y L C C +L C N C+Y +YGDGS T GDL +T++ G++
Sbjct: 142 SKTYQILSCEGKSCSNLGGQGGCSDDNTCIYSYSYGDGSHTSGDLAVDTLTIGSTTGRPV 201
Query: 260 SVKGIALGCGHDNEGLF----VGSAGLLGLGGGMLSLTKQIKATSLAYCLV--DRDSPAS 313
SV + GCGH+N G F G GL G M+S + + +YCLV D S
Sbjct: 202 SVPKVVFGCGHNNGGTFELHGSGLVGLGGGPLSMISQLRPLIGGRFSYCLVPLGNDPSVS 261
Query: 314 GVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV------QIPPSLFEM 364
+ F S G AV+ PL +++ DTFYY+ L SVG + + ++ L +
Sbjct: 262 SKMHFGSRGIVSGAGAVSTPL-ASRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPLADA 320
Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
DE G II+D GT +T L Y +L + V G +F CY S L +R
Sbjct: 321 DE---GNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVRDPNNVFSLCY--SNLSGLR 375
Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLA 484
+PT++ HF G L+L N + V FCFA P S L+I GN+ Q V +DL
Sbjct: 376 IPTITAHF-VGADLELKPLNTFVQVQED-LFCFAMIPVSD-LAIFGNLAQMNFLVGYDLK 432
Query: 485 NNRVGFTPNKC 495
+ V F P C
Sbjct: 433 SRTVSFKPTDC 443
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 137/421 (32%), Positives = 207/421 (49%), Gaps = 47/421 (11%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L D+ARV++L +++ Y + AE + PV SGA + Y + +G+
Sbjct: 93 LSTDAARVSSLQGRIEH--YRLTTTS-SSAEVAVTASKAQVPVSSGARLRTLNYVATVGL 149
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD-- 223
G + ++++DT S++ W+QC PC C+ Q P+FDP +S SY+ +PC +P C +L
Sbjct: 150 GGG--EATVIVDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDALQQQ 207
Query: 224 --------VSACRANR---CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
C A R C Y ++Y DGS++ G L + +S + G GCG N
Sbjct: 208 LATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSLAGE-VIDGFVFGCGTSN 266
Query: 273 EG-LFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCL-VDRDSPASGVLEFNSARGGDAV 327
+G F G++GL+GLG LSL Q +YCL + R+S ASG L
Sbjct: 267 QGPPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRESDASGSLVLGDDPSAYRN 326
Query: 328 TAPLIRNKKVDT--------FYYVGLTGFSVGGQAVQIPPSLFEMDEAG-DGGIIVDCGT 378
+ P++ V FY V LTG +VGGQ E++ G IVD GT
Sbjct: 327 STPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQ---------EVESTGFSARAIVDSGT 377
Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
IT L YN++R F+ G ++ DTC++ +GL+ V+VP+++L F G +
Sbjct: 378 VITSLVPSVYNAVRAEFMSQLAEYPQAPGFSILDTCFNMTGLKEVQVPSLTLVFDGGAEV 437
Query: 439 DLPAKN--YLIPVDSAGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
++ + Y + DS+ C A A S SIIGN QQ+ RV FD + ++VGF
Sbjct: 438 EVDSGGVLYFVSSDSS-QVCLAVASLKSEDETSIIGNYQQKNLRVVFDTSASQVGFAQET 496
Query: 495 C 495
C
Sbjct: 497 C 497
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 194 bits (493), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 129/371 (34%), Positives = 196/371 (52%), Gaps = 41/371 (11%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPC 214
+GEY + +GTPP + + DTGSD+ W QC PCT +C++Q P+++P +S++++ LPC
Sbjct: 29 AGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPC 88
Query: 215 AAPQCKSLDVSACRANR------------CLYQVAYGDGSFTVGDLVTETVSFGNS---- 258
+ +S C A C Y V YG G +V +ET +FG++
Sbjct: 89 NS------SLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQ-GSETFTFGSTPAGH 141
Query: 259 GSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIKATSLAYCLV---DRDSPASG 314
V GIA GC + G SA GL+GLG G LSL Q+ +YCL D +S ++
Sbjct: 142 ARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCLTPYQDTNSTSTL 201
Query: 315 VLEFNSARGGDA--VTAPLIRNKK---VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
+L +++ G A + P + + ++TFYY+ LTG S+G A+ IPP F ++ G
Sbjct: 202 LLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGT 261
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA--LFDTCYDFSGLRSV--RV 425
GG+I+D GT IT L AY +R + V L L T G A D C+ S +
Sbjct: 262 GGLIIDSGTTITLLGNTAYQQVRAAVVSLV-TLPTTDGSADTGLDLCFMLPSSTSAPPAM 320
Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLA 484
P+++LHF G + LPA +Y++ D +G +C A T ++I+GN QQQ + +D+
Sbjct: 321 PSMTLHFN-GADMVLPADSYMM-SDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIG 378
Query: 485 NNRVGFTPNKC 495
+ F P KC
Sbjct: 379 QETLSFAPAKC 389
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 132/400 (33%), Positives = 199/400 (49%), Gaps = 38/400 (9%)
Query: 129 RHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR 188
R +L P+ A P G GEY + +GTPP + + DTGSD+ W QC
Sbjct: 58 REQLAPSSAAAAGLTVGAPTQKDLRNG-GEYIMTLSIGTPPLSYRAIADTGSDLIWTQCA 116
Query: 189 PC--------TECYQQSDPIFDPKTSSSYSPLPCAAP--QCKSL-DVSACRANRCLYQVA 237
PC +C++QS +++P +S+++ LPC +P C ++ S C+Y
Sbjct: 117 PCGDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAMAGPSPPPGCACMYNQT 176
Query: 238 YGDGSFTVGDLVTETVSFGNSGS-----VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSL 292
YG G +T G ET +FG+S + V IA GC + + + GSAGL+GLG G +SL
Sbjct: 177 YGTG-WTAGVQSVETFTFGSSSTPPAVRVPNIAFGCSNASSNDWNGSAGLVGLGRGSMSL 235
Query: 293 TKQIKATSLAYCLV---DRDSPASGVLEFNSARG----GDAVTAPLI---RNKKVDTFYY 342
Q+ A + +YCL D +S ++ +L ++A G + P + + T+YY
Sbjct: 236 VSQLGAGAFSYCLTPFQDANSTSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYY 295
Query: 343 VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS-----FVR 397
+ LTG SVG A+ IPP F + G GG+I+D GT IT L AY +R + R
Sbjct: 296 LNLTGISVGETALAIPPDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTR 355
Query: 398 LAGNLKPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
L P L D C+ + +P+++LHF G + LP +NY+I +G +C
Sbjct: 356 LPLAHGPDHSTGL-DLCFALKASTPPPAMPSMTLHFEGGADMVLPVENYMI--LGSGVWC 412
Query: 457 FAFA-PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A T A+S++GN QQQ V +D+ + F P C
Sbjct: 413 LAMRNQTVGAMSMVGNYQQQNIHVLYDVRKETLSFAPAVC 452
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 123/352 (34%), Positives = 182/352 (51%), Gaps = 25/352 (7%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCA 215
G Y +G+GTP + F++ DTGSD+ W QC PC C+ Q+ P FDP TS+SY + C+
Sbjct: 138 GAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQPKFDPTTSTSYKNVSCS 197
Query: 216 APQCK-----SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
+ CK + C +N CLY + YG G +T+G L TET++ +S K GC
Sbjct: 198 SEFCKLIAEGNYPAQDCISNTCLYGIQYGSG-YTIGFLATETLAIASSDVFKNFLFGCSE 256
Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQI--KATSL-AYCLVDRDSPASGVLEFNSARGGDAV 327
++ G F G+ GLLGLG ++L Q K +L +YCL S ++G L F A
Sbjct: 257 ESRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYCLPASPS-STGHLSFGVEVSQAAK 315
Query: 328 TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
+ P+ + K+ Y + G SV G+ + I S+ I+D GT T L +
Sbjct: 316 STPI--SPKLKQLYGLNTVGISVRGRELPINGSISRT--------IIDSGTTFTFLPSPT 365
Query: 388 YNSLRDSFVRLAGNLKPTSGVALFDTCYDFS--GLRSVRVPTVSLHFGAGKALDLPAKNY 445
Y++L +F + N T+G + F CYDFS G ++ +P +S+ F G +++
Sbjct: 366 YSALGSAFREMMANYTLTNGTSSFQPCYDFSNIGNGTLTIPGISIFFEGGVEVEIDVSGI 425
Query: 446 LIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+IPV+ C AFA T S +I GN QQ+ V +D+A VGF P C
Sbjct: 426 MIPVNGLKEVCLAFADTGSDSDFAIFGNYQQKTYEVIYDVAKGMVGFAPKGC 477
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 135/435 (31%), Positives = 198/435 (45%), Gaps = 51/435 (11%)
Query: 96 NDYRSLVLSRLERDSAR---VNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGA 152
N + L DS R N L+ ++ L +L P+ + P + PV SG+
Sbjct: 26 NHHHGLRADLTHIDSGRGFTRNELLRRMVLRSRARAAKQLCPSRSGT-PVRVTAPVASGS 84
Query: 153 SQ-GSGEYFSRIGVGTP-PRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYS 210
G EY G+GTP P+Q ++ +DTGSD+ W QCRPC +C+ Q P FD S +
Sbjct: 85 HVVGYTEYLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPRFDTSASDTVH 144
Query: 211 PLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG----SVKGIAL 266
+ C P C++L AC C YQV YGD S T+G L ++ +F G +V +
Sbjct: 145 GVLCTDPICRALRPHACFLGGCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVF 204
Query: 267 GCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-----------RDSPASG 314
GCG N G F G+ G G G LSL +Q+ +S +YC +PA G
Sbjct: 205 GCGQYNTGNFHSNETGIAGFGRGPLSLPRQLGVSSFSYCFTTIFESKSTPVFLGGAPADG 264
Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
+ + G ++ P + N +YY+ L G +VG + +P S F + G GG I+
Sbjct: 265 LRAHAT---GPILSTPFLPNHP--EYYYLSLKGITVGKTRLAVPESAFVVKADGSGGTII 319
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSG--------------L 420
D GTAIT + SL ++FV + V L T Y+ +G
Sbjct: 320 DSGTAITAFPRAVFRSLWEAFV---------AQVPLPHTSYNDTGEPTLQCFSTESVPDA 370
Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVS 480
V VP ++LH G +LP +NY+ + C ++IGN QQQ +
Sbjct: 371 SKVPVPKMTLHL-EGADWELPRENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNMHIV 429
Query: 481 FDLANNRVGFTPNKC 495
DLA N++ P +C
Sbjct: 430 HDLAGNKLVIEPAQC 444
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 146/433 (33%), Positives = 205/433 (47%), Gaps = 48/433 (11%)
Query: 76 SSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPA 135
SSS ++PL+ R + +L LE D R + KL
Sbjct: 59 SSSGTTVPLNHRYGPCSPAPSAKVPTILELLEHDQLRAKYIQRKLS-------------G 105
Query: 136 EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ 195
+ P D + P G++ + EY +G+G+P +M++DTGSD++W++C
Sbjct: 106 TDGLQPLDLTVPTTLGSALDTMEYVITVGIGSPAVTQTMMIDTGSDVSWVRCNS-----T 160
Query: 196 QSDPIFDPKTSSSYSPLPCAAPQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETV 253
+FDP S++Y+P C++ C L + C + C Y+V YGDGS T G ++T+
Sbjct: 161 DGLTLFDPSKSTTYAPFSCSSAACAQLGNNGDGCSNSGCQYRVQYGDGSNTTGTYSSDTL 220
Query: 254 SFGNSGSVKGIALGCGHDNEGLFVGSA--GLLGLGGGMLSLTKQIKAT---SLAYCLVDR 308
+ S +V GC H E F G GL+GLGG SL Q AT S +YCL
Sbjct: 221 ALSASDTVTDFHFGCSHHEED-FDGEKIDGLMGLGGDAQSLVSQTAATYGKSFSYCLPPT 279
Query: 309 DSPASGVLEFNSARG--GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
+ SG L F + G G VT P++R K T Y V L SVGG + I PS+
Sbjct: 280 NR-TSGFLTFGAPNGTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVLS--- 335
Query: 367 AGDGGIIVDCGTAITRLQTQAYNSL----RDSFVRLAGNLKPTSGVALFDTCYDFSGLRS 422
G ++D GT IT L +AY++L R S RL + + + + DTCYDF+GL +
Sbjct: 336 ---NGSVMDSGTVITWLPRRAYSALSSAFRSSMTRL--RHQRAAPLGILDTCYDFTGLVN 390
Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFD 482
V +P VSL G +DL +I C AFA TS SIIGNVQQ+ V D
Sbjct: 391 VSIPAVSLVLDGGAVVDLDGNGIMI------QDCLAFAATSGD-SIIGNVQQRTFEVLHD 443
Query: 483 LANNRVGFTPNKC 495
+ GF C
Sbjct: 444 VGQGVFGFRSGAC 456
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 123/353 (34%), Positives = 187/353 (52%), Gaps = 22/353 (6%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GEY I +GTPP + DTGSD+ W QC PC +CYQQ+ P+FDPK SS+Y + C++
Sbjct: 84 GEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSS 143
Query: 217 PQCKSLDVSACRA--NRCLYQVAYGDGSFTVGDLVTETVSFGNSG----SVKGIALGCGH 270
QC++L+ ++C N C Y + YGD S+T GD+ +TV+ G+SG S++ + +GCGH
Sbjct: 144 SQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGCGH 203
Query: 271 DNEGLF-VGSAGLLGLGGGMLSLTKQIKAT---SLAYCLV--DRDSPASGVLEF--NSAR 322
+N G F +G++GLGGG SL Q++ + +YCLV ++ + + F N
Sbjct: 204 ENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGTNGIV 263
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
GD V + + K T+Y++ L SVG + +Q ++F G+G I++D GT +T
Sbjct: 264 SGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIF---GTGEGNIVIDSGTTLTL 320
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
L + Y L + + CY S S +VP +++HF G + L
Sbjct: 321 LPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSS--SFKVPDITVHFKGGD-VKLGN 377
Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
N + V S CFAFA + L+I GN+ Q V +D + V F C
Sbjct: 378 LNTFVAV-SEDVSCFAFA-ANEQLTIFGNLAQMNFLVGYDTVSGTVSFKKTDC 428
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 131/384 (34%), Positives = 199/384 (51%), Gaps = 41/384 (10%)
Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD---PIFDPK 204
+VSG+S GSG+YF + VGTP ++F +++DTGSD+ W+QC P S P +D
Sbjct: 16 LVSGSSIGSGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKS 75
Query: 205 TSSSYSPLPCAAPQCKSLDV---SACRANR---CLYQVAYGDGSFTVGDLVTETVSF--- 255
+SSSY +PC +C L S+C C Y Y D S T G L ET+S
Sbjct: 76 SSSSYREIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSR 135
Query: 256 -------GNSGS----VKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSL-- 301
GN + +K +ALGC ++ G F+G++G+LGLG G +SL Q + T+L
Sbjct: 136 KRSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGG 195
Query: 302 --AYCLVD--RDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ- 356
+YCLVD R S AS L R P++RN +FYYV +TG +V G+ V
Sbjct: 196 IFSYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDG 255
Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDT 413
I S + +D G+ G I D GT ++ L+ AY+ + + + L + G F+
Sbjct: 256 IASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEG---FEL 312
Query: 414 CYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP--TSSALSIIGN 471
CY+ + + +P + + F G ++LP NY++ V + C A T++ +I+GN
Sbjct: 313 CYNVTRMEK-GMPKLGVEFQGGAVMELPWNNYMVLV-AENVQCVALQKVTTTNGSNILGN 370
Query: 472 VQQQGTRVSFDLANNRVGFTPNKC 495
+ QQ + +DLA R+GF + C
Sbjct: 371 LLQQDHHIEYDLAKARIGFKWSPC 394
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 192 bits (488), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 120/369 (32%), Positives = 184/369 (49%), Gaps = 33/369 (8%)
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
PV SGA + Y + +G+G + ++++DT S++ W+QC PC C+ Q P+FDP +S
Sbjct: 115 PVTSGARLRTLNYVATVGLGGG--EATVIVDTASELTWVQCAPCASCHDQQGPLFDPASS 172
Query: 207 SSYSPLPCAAPQCKSLDVSACRAN---------RCLYQVAYGDGSFTVGDLVTETVSFGN 257
SY+ LPC + C +L V+ A C Y ++Y DGS++ G L + +S
Sbjct: 173 PSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAG 232
Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASG 314
+ G GCG N+G F G++GL+GLG LSL Q +YCL ++S +SG
Sbjct: 233 E-VIDGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPLKESESSG 291
Query: 315 VLEFNSARGGDAVTAPLIRNKKVDT-----FYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
L + P++ V FY+V LTG ++GGQ V E+
Sbjct: 292 SLVLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEV----------ESSA 341
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
G +IVD GT IT L YN+++ F+ G ++ DTC++ +G R V++P++
Sbjct: 342 GKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSILDTCFNLTGFREVQIPSLK 401
Query: 430 LHFGAGKALDLPAKNYLIPVDS-AGTFCFAFAPTSSAL--SIIGNVQQQGTRVSFDLANN 486
F +++ + L V S + C A A S SIIGN QQ+ RV FD +
Sbjct: 402 FVFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGS 461
Query: 487 RVGFTPNKC 495
++GF C
Sbjct: 462 QIGFAQETC 470
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 192 bits (488), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 137/420 (32%), Positives = 209/420 (49%), Gaps = 49/420 (11%)
Query: 99 RSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGE 158
R+LVL D+ RV +L +L++ E +E QI P+ SG S
Sbjct: 89 RALVL-----DNIRVQSL--QLKIKAMTSSTTEQSVSETQI-------PLTSGIKLESLN 134
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y + +G + S+++DTGSD+ W+QC+PC CY Q P++DP SSSY + C +
Sbjct: 135 YIVTVELG--GKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSST 192
Query: 219 CKSL-----DVSACRANR------CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
C+ L + C N C Y V+YGDGS+T GDL +E++ G++ ++ G
Sbjct: 193 CQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-KLENFVFG 251
Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNS---- 320
CG +N+GLF GS+GL+GLG +SL Q T +YCL + ASG L F +
Sbjct: 252 CGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSV 311
Query: 321 -ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
PL++N ++ +FY + LTG S+GG V++ S F GI++D GT
Sbjct: 312 YTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELKSSSFGR------GILIDSGTV 363
Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL- 438
ITRL Y +++ F++ G ++ DTC++ + + +P + + F L
Sbjct: 364 ITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELE 423
Query: 439 -DLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
D+ Y + D A C A A S + + IIGN QQ+ RV +D R+G C
Sbjct: 424 VDVTGVFYFVKPD-ASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 482
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 129/392 (32%), Positives = 189/392 (48%), Gaps = 48/392 (12%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC-YQQSDPIFD 202
+P++SGAS GSG+YF I +GTPP+ +V DTGSD+ W++C C C + F
Sbjct: 73 LKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFL 132
Query: 203 PKTSSSYSPLPCAAPQCKSLDVSA---CRANR----CLYQVAYGDGSFTVGDLVTETVSF 255
P+ SSS+SP C P C+ L + C R C + +Y DGS + G ET +
Sbjct: 133 PRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTL 192
Query: 256 ----GNSGSVKGIALGCGHDNEG------LFVGSAGLLGLGGGMLSLTKQIK---ATSLA 302
G+ +KG++ GCG G F G+ G++GLG G +S + Q+ +
Sbjct: 193 KSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFS 252
Query: 303 YCLVDR--DSPASGVLEFNSARGGDAVTAPLIRNKKVD-----------TFYYVGLTGFS 349
YCL+D P + L GG + PL K+ TFYY+ + +
Sbjct: 253 YCLMDYTLSPPPTSFLMI----GGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSIT 308
Query: 350 VGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTS 406
+ G + I P+++E+DE G+GG +VD GT +T L AY + S V+L + T
Sbjct: 309 IDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTP 368
Query: 407 GVALFDTCYDFSGL-RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS- 464
G FD C + SG R +P + G G P +NY + + G C A S
Sbjct: 369 G---FDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEE-GVMCLAIRAVESG 424
Query: 465 -ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S+IGN+ QQG + FD +R+GFT C
Sbjct: 425 NGFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 456
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 120/370 (32%), Positives = 185/370 (50%), Gaps = 35/370 (9%)
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
PV SGA + Y + +G+G + ++++DT S++ W+QC PC C+ Q P+FDP +S
Sbjct: 114 PVTSGARLRTLNYVATVGLGGG--EATVIVDTASELTWVQCAPCASCHDQQGPLFDPASS 171
Query: 207 SSYSPLPCAAPQCKSLDVSACRAN---------RCLYQVAYGDGSFTVGDLVTETVSFGN 257
SY+ LPC + C +L V+ A C Y ++Y DGS++ G L + +S
Sbjct: 172 PSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAG 231
Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASG 314
+ G GCG N+G F G++GL+GLG LSL Q +YCL ++S +SG
Sbjct: 232 E-VIDGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPLKESESSG 290
Query: 315 VLEFNSARGGDAVTAPLIRNKKVDT-----FYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
L + P++ V FY+V LTG ++GGQ V E+
Sbjct: 291 SLVLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEV----------ESSA 340
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
G +IVD GT IT L YN+++ F+ G ++ DTC++ +G R V++P++
Sbjct: 341 GKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSILDTCFNLTGFREVQIPSLK 400
Query: 430 LHFGAGKALDLPAKN--YLIPVDSAGTFCFAFAPTSSAL--SIIGNVQQQGTRVSFDLAN 485
F +++ + Y + DS+ C A A S SIIGN QQ+ RV FD
Sbjct: 401 FVFEGNVEVEVDSSGVLYFVSSDSS-QVCLALASLKSEYETSIIGNYQQKNLRVIFDTLG 459
Query: 486 NRVGFTPNKC 495
+++GF C
Sbjct: 460 SQIGFAQETC 469
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 124/376 (32%), Positives = 191/376 (50%), Gaps = 34/376 (9%)
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
PV SGA + Y + +G+G + ++++DT S++ W+QC PC C+ Q DP+FDP +S
Sbjct: 141 PVTSGAKLRTLNYVATVGLGGG--EATVIVDTASELTWVQCAPCESCHDQQDPLFDPSSS 198
Query: 207 SSYSPLPCAAPQCKSLDV---------SACR-----ANRCLYQVAYGDGSFTVGDLVTET 252
SY+ +PC + C +L + +AC+ A C Y ++Y DGS++ G L +
Sbjct: 199 PSYAAVPCNSSSCDALQLATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDR 258
Query: 253 VSFGNSGSVKGIALGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDR 308
+S + G GCG N+G F G++GL+GLG LSL Q +YCL +
Sbjct: 259 LSLAGE-VIDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCLPLK 317
Query: 309 DSPASGVLEFNSARGGDAVTAPLIRNKKVDT-----FYYVGLTGFSVGGQAVQIPPSLFE 363
+S +SG L + P++ V FY+V LTG +VGGQ V+
Sbjct: 318 ESDSSGSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEVESSGFSSG 377
Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
I+D GT IT L YN+++ F+ G ++ DTC++ +GLR V
Sbjct: 378 GGGG---KAIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAPGFSILDTCFNMTGLREV 434
Query: 424 RVPTVSLHFGAGKALDLPAKN--YLIPVDSAGTFCFAFAPTSSAL--SIIGNVQQQGTRV 479
+VP++ L F G +++ + Y + DS+ C A AP S +IIGN QQ+ RV
Sbjct: 435 QVPSLKLVFDGGVEVEVDSGGVLYFVSSDSS-QVCLAMAPLKSEYETNIIGNYQQKNLRV 493
Query: 480 SFDLANNRVGFTPNKC 495
FD + ++VGF C
Sbjct: 494 IFDTSGSQVGFAQETC 509
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 137/420 (32%), Positives = 209/420 (49%), Gaps = 49/420 (11%)
Query: 99 RSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGE 158
R+LVL D+ RV +L +L++ E +E QI P+ SG S
Sbjct: 41 RALVL-----DNIRVQSL--QLKIKAMTSSTTEQSVSETQI-------PLTSGIKLESLN 86
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y + +G + S+++DTGSD+ W+QC+PC CY Q P++DP SSSY + C +
Sbjct: 87 YIVTVELG--GKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSST 144
Query: 219 CKSL-----DVSACRANR------CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
C+ L + C N C Y V+YGDGS+T GDL +E++ G++ ++ G
Sbjct: 145 CQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-KLENFVFG 203
Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNS---- 320
CG +N+GLF GS+GL+GLG +SL Q T +YCL + ASG L F +
Sbjct: 204 CGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSV 263
Query: 321 -ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
PL++N ++ +FY + LTG S+GG V++ S F GI++D GT
Sbjct: 264 YTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELKSSSFGR------GILIDSGTV 315
Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL- 438
ITRL Y +++ F++ G ++ DTC++ + + +P + + F L
Sbjct: 316 ITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELE 375
Query: 439 -DLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
D+ Y + D A C A A S + + IIGN QQ+ RV +D R+G C
Sbjct: 376 VDVTGVFYFVKPD-ASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 434
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 137/420 (32%), Positives = 209/420 (49%), Gaps = 49/420 (11%)
Query: 99 RSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGE 158
R+LVL D+ RV +L +L++ E +E QI P+ SG S
Sbjct: 89 RALVL-----DNIRVQSL--QLKIKAMTSSTTEQSVSETQI-------PLTSGIKLESLN 134
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y + +G + S+++DTGSD+ W+QC+PC CY Q P++DP SSSY + C +
Sbjct: 135 YIVTVELG--GKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSST 192
Query: 219 CKSL-----DVSACRANR------CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
C+ L + C N C Y V+YGDGS+T GDL +E++ G++ ++ G
Sbjct: 193 CQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-KLENFVFG 251
Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNS---- 320
CG +N+GLF GS+GL+GLG +SL Q T +YCL + ASG L F +
Sbjct: 252 CGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSV 311
Query: 321 -ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
PL++N ++ +FY + LTG S+GG V++ S F GI++D GT
Sbjct: 312 YTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELKSSSFGR------GILIDSGTV 363
Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL- 438
ITRL Y +++ F++ G ++ DTC++ + + +P + + F L
Sbjct: 364 ITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELE 423
Query: 439 -DLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
D+ Y + D A C A A S + + IIGN QQ+ RV +D R+G C
Sbjct: 424 VDVTGVFYFVKPD-ASLVCLALASLSYENEVGIIGNYQQKNQRVIYDSTQERLGIVGENC 482
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 191 bits (485), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 148/410 (36%), Positives = 205/410 (50%), Gaps = 44/410 (10%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L RD AR N H L+ A + + S P GA S +Y +G
Sbjct: 84 LRRDRARRN---------------HILRKASGRRITLGVSIPTSLGAFVDSLQYVVTLGF 128
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD 223
GTP +++DTGSD++W+QC+PC + CY Q DP+FDP SS+Y+P+PC + C+ LD
Sbjct: 129 GTPAVPQVLLIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYAPVPCGSEACRDLD 188
Query: 224 V---------SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS--VKGIALGCGHDN 272
S+ A+ C Y + YG+G TVG TET++ + V + GCG
Sbjct: 189 PDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTLSPEAATVVNNFSFGCGLVQ 248
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTA 329
+G+F GLLGLGG SL Q T + +YCL +S A + A GG+
Sbjct: 249 KGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFSYCLPAGNSTAGFLALGAPATGGNNTAG 308
Query: 330 PLIRNKKV--DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
+V TFY V LTG SVGG+ + I P++F GG+I+D GT +T L A
Sbjct: 309 FQFTPLQVVETTFYLVKLTGISVGGKQLDIEPTVFA------GGMIIDSGTIVTGLPETA 362
Query: 388 YNSLRDSF--VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
Y++LR +F A L P + DTCYDF+G +V VPTV+L F G +DL +
Sbjct: 363 YSALRTAFRSAMSAYPLLPPNDDEDLDTCYDFTGNTNVTVPTVALTFEGGVTIDLDVPSG 422
Query: 446 LIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
++ +D G F + IIGNV Q+ V +D A VGF C
Sbjct: 423 VL-LD--GCLAFVAGASDGDTGIIGNVNQRTFEVLYDSARGHVGFRAGAC 469
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 191 bits (485), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 126/360 (35%), Positives = 192/360 (53%), Gaps = 27/360 (7%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPC- 214
GEY + +GTPP + + DTGSD+ W QC PC ++C++Q+ ++P +S+++ LPC
Sbjct: 86 GEYIMTLAIGTPPLSYPAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCN 145
Query: 215 -AAPQCKSL-DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIALGC 268
+ C +L S C+Y YG G +T G ET +FG++ + V GIA GC
Sbjct: 146 SSVSMCAALAGPSPPPGCSCMYNQTYGTG-WTAGIQSVETFTFGSTPADQTRVPGIAFGC 204
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV---DRDSPASGVLEFNSARGGD 325
+ + + GSAGL+GLG G +SL Q+ A +YCL D +S ++ +L ++A G
Sbjct: 205 SNASSDDWNGSAGLVGLGRGSMSLVSQLGAGMFSYCLTPFQDANSTSTLLLGPSAALNGT 264
Query: 326 AV-TAPLIRNKK---VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
V T P + + + T+YY+ LTG S+G A+ IPP+ F + G GG+I+D GT IT
Sbjct: 265 GVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAFALRTDGTGGLIIDSGTTIT 324
Query: 382 RLQTQAYNSLR---DSFVRLAGNLKPTSGVALFDTCYDFSGLRSV--RVPTVSLHFGAGK 436
L AY +R +S V L + S D C+ + S +P+++ HF G
Sbjct: 325 SLVDAAYQQVRAAIESLVTLP--VADGSDSTGLDLCFALTSETSTPPSMPSMTFHFD-GA 381
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ LP NY+I +G +C A T A+S GN QQQ + +D+ + F P KC
Sbjct: 382 DMVLPVDNYMI--LGSGVWCLAMRNQTVGAMSTFGNYQQQNVHLLYDIHEETLSFAPAKC 439
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 191 bits (485), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 130/388 (33%), Positives = 186/388 (47%), Gaps = 47/388 (12%)
Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP-IFDPK 204
+PVVSGA+ GSG+YF + +G PP+ ++ DTGSD+ W++C C C S +F P+
Sbjct: 71 SPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPR 130
Query: 205 TSSSYSPLPCAAPQCKSL----DVSACRANR----CLYQVAYGDGSFTVGDLVTETVSF- 255
SS++SP C P C+ + C R C Y+ Y DGS T G ET S
Sbjct: 131 HSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLK 190
Query: 256 ---GNSGSVKGIALGCGHDNEGL------FVGSAGLLGLGGGMLSLTKQIK---ATSLAY 303
G +K +A GCG G F G+ G++GLG G +S Q+ +Y
Sbjct: 191 TSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSY 250
Query: 304 CLVD---RDSPASGVLEFNSARGGDAVT----APLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
CL+D P S ++ N GGD ++ PL+ N TFYYV L V G ++
Sbjct: 251 CLMDYTLSPPPTSYLIIGN---GGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLR 307
Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL-----RDSFVRLAGNLKPTSGVALF 411
I PS++E+D++G+GG +VD GT + L AY S+ R + +A L P F
Sbjct: 308 IDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPG-----F 362
Query: 412 DTCYDFSGLRSVR--VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--ALS 467
D C + SG+ +P + F G P +NY I + C A S
Sbjct: 363 DLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQ-IQCLAIQSVDPKVGFS 421
Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+IGN+ QQG FD +R+GF+ C
Sbjct: 422 VIGNLMQQGFLFEFDRDRSRLGFSRRGC 449
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 191 bits (485), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 139/417 (33%), Positives = 196/417 (47%), Gaps = 40/417 (9%)
Query: 106 LERDSAR---VNTLITKLQLAIYNVDRHELKPAEAQIL---PEDFSTPVVSGASQGSGEY 159
+ RDS R N TK Q R L+ + + P D + V+SG G Y
Sbjct: 39 ISRDSPRSPFYNPSETKYQRLQKAFRRSILRGNHFRAIRASPNDIQSNVISGG----GSY 94
Query: 160 FSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
I +GTPP + DTGSD+ W QC PC +CY+Q +P+FDPK S +Y L C C
Sbjct: 95 LMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSKTYKTLGCNNDFC 154
Query: 220 KSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNS----GSVKGIALGCGHDNE 273
+ L S N C +YGD S+T DL +ET + G++ S G+A GCGH N
Sbjct: 155 QDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSSETFTIGSTEGDPASFPGLAFGCGHSNG 214
Query: 274 GLF-----VGSAGLLGLGGGMLSLTKQIKATSLAYCLV--DRDSPASGVLEFNSA---RG 323
G F G ++ L+ ++ +YCLV DS AS + F + G
Sbjct: 215 GTFNEKDSGLIGLGGGPLSLVMQLSSKVGG-QFSYCLVPLSSDSTASSKINFGKSAVVSG 273
Query: 324 GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE-----AGDGGIIVDCGT 378
V+ PLI+ DTFYY+ L G S+G + V F ++ A + II+D GT
Sbjct: 274 SGTVSTPLIKGTP-DTFYYLTLEGMSLGSEKVAFKG--FSKNKSSPAAAEESNIIIDSGT 330
Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
+T L Y + + ++ G T F CY SG++ + +PT++ HF G +
Sbjct: 331 TLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCY--SGVKKLEIPTITAHF-IGADV 387
Query: 439 DLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LP N + CF+ P SS L+I GN+ Q V +DL NN+V F P C
Sbjct: 388 QLPPLNTFVQAQE-DLVCFSMIP-SSNLAIFGNLSQMNFLVGYDLKNNKVSFKPTDC 442
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 191 bits (484), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 125/353 (35%), Positives = 186/353 (52%), Gaps = 27/353 (7%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
GSGEY ++ GTP + ++DTGSD+ W+ C+ C C+ + PIFDP SSSY P C
Sbjct: 111 GSGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTA-PIFDPAKSSSYKPFAC 169
Query: 215 AAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD--- 271
+ C+ + + ++C ++V+YGDG+ G L ++ ++ G S + + GC
Sbjct: 170 DSQPCQEISGNCGGNSKCQFEVSYGDGTQVDGTLASDAITLG-SQYLPNFSFGCAESLSE 228
Query: 272 --NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTA 329
+ + G L + T ++ + +YCL P+S + G +A +
Sbjct: 229 DTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCL-----PSSSTSSGSLVLGKEAAVS 283
Query: 330 P-------LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
LI++ + TFY+V L SVG + +P + + A GG I+D GT IT
Sbjct: 284 SSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVPGT----NIASGGGTIIDSGTTITH 339
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
L AY +LRD+F + +L+PT V DTCYD S SV VPT++LH L LP
Sbjct: 340 LVPSAYTALRDAFRQQLSSLQPTP-VEDMDTCYDLSS-SSVDVPTITLHLDRNVDLVLPK 397
Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+N LI +S G C AF+ T S SIIGNVQQQ R+ FD+ N++VGF +C
Sbjct: 398 ENILITQES-GLACLAFSSTDSR-SIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 191 bits (484), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 131/360 (36%), Positives = 185/360 (51%), Gaps = 36/360 (10%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC---TECYQQSDPIF 201
S P G+S + EY +G+G+P +V+DTGSD++W+QC PC + C+ + +F
Sbjct: 94 SVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALF 153
Query: 202 DPKTSSSYSPLPCAAPQCKSL----DVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSFG 256
DP SS+Y+ C+A C L + + C A +RC Y V YGDGS T G ++ ++
Sbjct: 154 DPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLS 213
Query: 257 NSGSVKGIALGCGHDN--EGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSP 311
S V+G GC H G+ + GL+GLGG S Q A S YCL +P
Sbjct: 214 GSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKSFFYCL--PATP 271
Query: 312 A-SGVLEFNSARGGDA------VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
A SG L + G T P++R+KKV T+Y+ L +VGG+ + + PS+F
Sbjct: 272 ASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAA 331
Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
G +VD GT ITRL AY +L +F + + DTC++F+GL V
Sbjct: 332 ------GSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVS 385
Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFD 482
+PTV+L F G +DL A + S G C AFAPT A IGNVQQ+ V +D
Sbjct: 386 IPTVALVFAGGAVVDLDAHGIV----SGG--CLAFAPTRDDKAFGTIGNVQQRTFEVLYD 439
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 191 bits (484), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 134/412 (32%), Positives = 202/412 (49%), Gaps = 31/412 (7%)
Query: 103 LSRLERDSAR------VNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGS 156
+ + RDS+R T ++ A++ ++ + P T V+S
Sbjct: 31 VEMIHRDSSRSPFFSPTETQFQRVANAVHRSINRANHLNQSFVSPNSPETTVISAL---- 86
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GEY VGTP Q +LDTGSDI WLQC+PC +CY+Q+ PIFD S +Y LPC +
Sbjct: 87 GEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTPIFDSSKSQTYKTLPCPS 146
Query: 217 PQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSG----SVKGIALGCGHD 271
C+S+ + C + + CLY + Y DGS ++GDL ET++ G++ G +GCG
Sbjct: 147 NTCQSVQGTFCSSRKHCLYSIHYVDGSQSLGDLSVETLTLGSTNGSPVQFPGTVIGCGRY 206
Query: 272 NE-GLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSA---RGG 324
N G+ ++G++GLG G +SL Q+ ++ +YCLV S AS L F +A G
Sbjct: 207 NAIGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSYCLVPGLSTASSKLNFGNAAVVSGR 266
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
V+ PL +K FY++ L FSVG ++ G G II+D GT +T L
Sbjct: 267 GTVSTPLF-SKNGLVFYFLTLEAFSVGRNRIEFG----SPGSGGKGNIIIDSGTTLTALP 321
Query: 385 TQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR-SVRVPTVSLHFGAGKALDLPAK 443
Y+ L + + + + CY + + VP ++ HF +G + L A
Sbjct: 322 NGVYSKLEAAVAKTVILQRVRDPNQVLGLCYKVTPDKLDASVPVITAHF-SGADVTLNAI 380
Query: 444 NYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
N + V + CFAF PT + ++ GN+ QQ V +DL N V F C
Sbjct: 381 NTFVQV-ADDVVCFAFQPTETG-AVFGNLAQQNLLVGYDLQMNTVSFKHTDC 430
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 122/355 (34%), Positives = 177/355 (49%), Gaps = 16/355 (4%)
Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
G G Y I VGTP F +V DTGSD+ W QC PCT+C+QQ P F P +SS++S LP
Sbjct: 81 NGVGGYNMNISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLP 140
Query: 214 CAAPQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD 271
C + C+ L + C A C+Y YG G +T G L TET+ G++ S +A GC +
Sbjct: 141 CTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGDA-SFPSVAFGCSTE 198
Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG---GDAVT 328
N G+ ++G+ GLG G LSL Q+ +YCL + + + F S G+ +
Sbjct: 199 N-GVGNSTSGIAGLGRGALSLIPQLGVGRFSYCLRSGSAAGASPILFGSLANLTDGNVQS 257
Query: 329 APLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAG-DGGIIVDCGTAITRLQTQ 386
P + N V ++YYV LTG +VG + + S F + G GG IVD GT +T L
Sbjct: 258 TPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKD 317
Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFS-GLRSVRVPTVSLHFGAGKALDLPAKNY 445
Y ++ +F+ N+ +G D C+ + G + VP++ L F G +P
Sbjct: 318 GYEMVKQAFLSQTANVTTVNGTRGLDLCFKSTGGGGGIAVPSLVLRFDGGAEYAVPTYFA 377
Query: 446 LIPVDSAGTF---CFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ DS G+ C P +S+IGNV Q + +DL F+P C
Sbjct: 378 GVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFSPADC 432
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 124/391 (31%), Positives = 180/391 (46%), Gaps = 50/391 (12%)
Query: 150 SGASQG--SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQ-SDPIFDPKTS 206
+GA G + EY + VGTPPR ++ LDTGSD+ W QC PC C+ Q + P+ DP S
Sbjct: 83 AGAGGGIVTNEYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPVLDPAAS 142
Query: 207 SSYSPLPCAAPQCKSLDVSAC-------RANRCLYQVAYGDGSFTVGDLVTETVSFGNSG 259
S+++ + C AP C++L ++C C+Y YGD S TVG L ++ +FG
Sbjct: 143 STHAAVRCDAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGD 202
Query: 260 SVKG-------IALGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSP 311
+ G + GCGH N+G+F G+ G G G SL Q+ TS +YC
Sbjct: 203 NADGGGVSERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTSFSYCFTSMFES 262
Query: 312 ASGVLEFNSARG-----GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
S ++ A G + PL+R+ + Y++ L +VG + IP + E
Sbjct: 263 TSSLVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLRE 322
Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFS------ 418
A I+D G +IT L Y +++ FV G P S V + D C+
Sbjct: 323 A---SAIIDSGASITTLPEDVYEAVKAEFVAQVG--LPVSAVEGSALDLCFALPSAAAPK 377
Query: 419 ---GLR--------SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA-- 465
G R VRVP + H G G +LP +NY+ A C +
Sbjct: 378 SAFGWRWRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGD 437
Query: 466 -LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+IGN QQQ T V +DL N+ + F P +C
Sbjct: 438 QTVVIGNYQQQNTHVVYDLENDVLSFAPARC 468
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 110/310 (35%), Positives = 158/310 (50%), Gaps = 18/310 (5%)
Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELKPAEAQ---ILPE--DFSTPVVSGASQGSGEY 159
+L+ T TKLQL + R + + A Q +LP D T + SGEY
Sbjct: 30 QLKLTHVDAGTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVLVTASSGEY 89
Query: 160 FSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
+ +GTPP ++ ++DTGSD+ W QC PC C Q P FD K S++Y LPC + +C
Sbjct: 90 LVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRC 149
Query: 220 KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK----GIALGCGHDNEGL 275
SL +C C+YQ YGD + T G L ET +FG + S K IA GCG N G
Sbjct: 150 ASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGD 209
Query: 276 FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF---------NSARGGDA 326
S+G++G G G LSL Q+ + +YCL S L F N++ G
Sbjct: 210 LANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPV 269
Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
+ P + N + Y++ L S+G + + I P +F +++ G GG+I+D GT+IT LQ
Sbjct: 270 QSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQD 329
Query: 387 AYNSLRDSFV 396
AY ++R V
Sbjct: 330 AYEAVRRGLV 339
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 140/432 (32%), Positives = 209/432 (48%), Gaps = 34/432 (7%)
Query: 76 SSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPA 135
SSS ++PL R + + L RD R + KL + ++ +
Sbjct: 49 SSSGTTVPLSHRHGPCSPAPSTVEPTMAELLRRDQLRAKYIQAKLSVN-SGSGTDGVQQS 107
Query: 136 EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ 195
A LP G++ + Y + +GTP ++++DTGSD++W+ C
Sbjct: 108 AAITLPTTL------GSALDTLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCH--ARAGA 159
Query: 196 QSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACRANR-CLYQVAYGDGSFTVGDLVTET 252
S FDP SS+Y+P C++ C L+ + C N C Y V YGDGS T G ++T
Sbjct: 160 GSSLFFDPGKSSTYTPFSCSSAACTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDT 219
Query: 253 VSFGNSGSVKGIALGCGHDN---EGLFVGSA-GLLGLGGGMLSLTKQIKAT---SLAYCL 305
++ ++ V+ GC + EGL GL+GLGGG SL Q AT + +YCL
Sbjct: 220 LALNSTEKVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCL 279
Query: 306 VDRDSPASGVLEFNSARGGDA-VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
+ +SG L ++ G VT P+ R+++ TFY+V L G +VGG V I P++F
Sbjct: 280 -PATTRSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFA- 337
Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
G I+D GT ITRL +AY++L +F ++ DTC+DF+G +V
Sbjct: 338 -----AGSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFTGQDNVS 392
Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL-SIIGNVQQQGTRVSFDL 483
+P V L F G +DL A + C AFAP + + SIIGNVQQ+ V D+
Sbjct: 393 IPAVELVFSGGAVVDLDADGIMY------GSCLAFAPATGGIGSIIGNVQQRTFEVLHDV 446
Query: 484 ANNRVGFTPNKC 495
+ +GF P C
Sbjct: 447 GQSVLGFRPGAC 458
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 128/353 (36%), Positives = 186/353 (52%), Gaps = 27/353 (7%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
GSGEY ++ GTP + ++DTGSD+ W+ C+ C C+ + PIFDP SSSY P C
Sbjct: 111 GSGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTA-PIFDPAKSSSYKPFAC 169
Query: 215 AAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD-NE 273
+ C+ + + ++C ++V YGDG+ G L ++ ++ G S + + GC +E
Sbjct: 170 DSQPCQEISGNCGGNSKCQFEVLYGDGTQVDGTLASDAITLG-SQYLPNFSFGCAESLSE 228
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKAT----SLAYCLVDRDSPASGVLEFNSARGGDAVTA 329
+ + GG + LT+ A + +YCL P+S + G +A +
Sbjct: 229 DTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCL-----PSSSTSSGSLVLGKEAAVS 283
Query: 330 P-------LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
LI++ TFY+V L SVG + +P + + A GG I+D GT IT
Sbjct: 284 SSSLKFTTLIKDPSFPTFYFVTLKAISVGNTRISVPAT----NIASGGGTIIDSGTTITY 339
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
L AY LRD+F + +L+PT V DTCYD S SV VPT++LH L LP
Sbjct: 340 LVPSAYKDLRDAFRQQLSSLQPTP-VEDMDTCYDLSS-SSVDVPTITLHLDRNVDLVLPK 397
Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+N LI +S G C AF+ T S SIIGNVQQQ R+ FD+ N++VGF +C
Sbjct: 398 ENILITQES-GLSCLAFSSTDSR-SIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 324
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 139/341 (40%), Positives = 183/341 (53%), Gaps = 36/341 (10%)
Query: 174 MVLDTGSDINWLQCRPCT---ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRAN 230
M +DTGSD++W+QC+PC CY Q DP+FDP SSSY+ +PC P C L + A A
Sbjct: 1 MEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASAC 60
Query: 231 RCL---YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGG 287
Y V+YGDGS T G ++T++ S +V+G GCGH GLF G GLLGLG
Sbjct: 61 SAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGR 120
Query: 288 GMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAP------LIRNKKVD 338
SL +Q T +YCL + S A G L GG + AP L+ +
Sbjct: 121 EQPSLVEQTAGTYGGVFSYCLPTKPSTA-GYLTLG--VGGPSGAAPGFSTTQLLPSPNAP 177
Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR- 397
T+Y V LTG SVGGQ + +P S F VD GT +TRL AY +LR +F
Sbjct: 178 TYYVVMLTGISVGGQQLSVPASAFAGGTV------VDTGTVVTRLPPTAYAALRSAFRSG 231
Query: 398 LAGNLKPTS-GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
+A PT+ + DTCY+F+G +V +P V+L FG+G + L A L S G C
Sbjct: 232 MASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL----SFG--C 285
Query: 457 FAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
AFAP+ S ++I+GNVQQ+ V D VGF P+ C
Sbjct: 286 LAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 324
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 122/356 (34%), Positives = 177/356 (49%), Gaps = 17/356 (4%)
Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
G G Y I VGTP FS+V DTGSD+ W QC PCT+C+QQ P F P +SS++S LP
Sbjct: 81 NGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLP 140
Query: 214 CAAPQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD 271
C + C+ L + C A C+Y YG G +T G L TET+ G++ S +A GC +
Sbjct: 141 CTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGDA-SFPSVAFGCSTE 198
Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG---GDAVT 328
N G+ ++G+ GLG G LSL Q+ +YCL + + + F S G+ +
Sbjct: 199 N-GVGNSTSGIAGLGRGALSLIPQLGVGRFSYCLRSGSAAGASPILFGSLANLTDGNVQS 257
Query: 329 APLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAG-DGGIIVDCGTAITRLQTQ 386
P + N V ++YYV LTG +VG + + S F + G GG IVD GT +T L
Sbjct: 258 TPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKD 317
Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFS--GLRSVRVPTVSLHFGAGKALDLPAKN 444
Y ++ +F+ ++ +G D C+ + G + VP++ L F G +P
Sbjct: 318 GYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTYF 377
Query: 445 YLIPVDSAGTF---CFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ DS G+ C P +S+IGNV Q + +DL F P C
Sbjct: 378 AGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADC 433
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 127/383 (33%), Positives = 182/383 (47%), Gaps = 37/383 (9%)
Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP-IFDPK 204
+PVVSGAS GSG+YF + +G PP+ ++ DTGSD+ W++C C C S +F P+
Sbjct: 70 SPVVSGASSGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPR 129
Query: 205 TSSSYSPLPCAAPQCKSL----DVSACRANR----CLYQVAYGDGSFTVGDLVTETVSF- 255
SS++SP C P C+ + C R C Y+ Y DGS T G ET S
Sbjct: 130 HSSTFSPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLK 189
Query: 256 ---GNSGSVKGIALGCGHDNEGL------FVGSAGLLGLGGGMLSLTKQIK---ATSLAY 303
G +K +A GCG G F G+ G++GLG G +S Q+ +Y
Sbjct: 190 TSSGKEAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSY 249
Query: 304 CLVDRDSPASGVLEFNSARGGDAVT----APLIRNKKVDTFYYVGLTGFSVGGQAVQIPP 359
CL+D GGDAV+ PL+ N TFYYV L V G ++I P
Sbjct: 250 CLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDP 309
Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYD 416
S++E+D++G+GG ++D GT + L AY + + ++L + T G FD C +
Sbjct: 310 SIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAAVKQRIKLPNADELTPG---FDLCVN 366
Query: 417 FSGLRSVR--VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--ALSIIGNV 472
SG+ +P + F G P +NY I + C A S+IGN+
Sbjct: 367 VSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQ-IQCLAIQSVDPKVGFSVIGNL 425
Query: 473 QQQGTRVSFDLANNRVGFTPNKC 495
QQG FD +R+GF+ C
Sbjct: 426 MQQGFLFEFDRDRSRLGFSRRGC 448
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 127/404 (31%), Positives = 200/404 (49%), Gaps = 58/404 (14%)
Query: 117 ITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVL 176
+++LQ + +D ++L E+ ++P+ GEY R +G+PP + ++
Sbjct: 62 MSRLQRVSHFLDENKL--PESLLIPDK-------------GEYLMRFYIGSPPVERLAMV 106
Query: 177 DTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA--C-RANRCL 233
DTGS + WLQC PC C+ Q P+F+P SS+Y C + C L S C + +C+
Sbjct: 107 DTGSSLIWLQCSPCHNCFPQETPLFEPLKSSTYKYATCDSQPCTLLQPSQRDCGKLGQCI 166
Query: 234 YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA-----LGCGHDNEGLFVGS---AGLLGL 285
Y + YGD SF+VG L TET+SFG++G + ++ GCG DN S G+ GL
Sbjct: 167 YGIMYGDKSFSVGILGTETLSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGL 226
Query: 286 GGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSA---RGGDAVTAPLIRNKKVDT 339
G G LSL Q+ A +YCL+ DS ++ L+F S V+ PLI + T
Sbjct: 227 GAGPLSLVSQLGAQIGHKFSYCLLPYDSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPT 286
Query: 340 FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLA 399
+Y++ L ++G + V + DG I++D GT +T L+ YN+
Sbjct: 287 YYFLNLEAVTIGQKVVSTGQT--------DGNIVIDSGTPLTYLENTFYNN-------FV 331
Query: 400 GNLKPTSGVALFD-------TCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
+L+ T GV L TC F ++ +P ++ F G ++ L KN LIP+ +
Sbjct: 332 ASLQETLGVKLLQDLPSPLKTC--FPNRANLAIPDIAFQF-TGASVALRPKNVLIPLTDS 388
Query: 453 GTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C A P+S +S+ G++ Q +V +DL +V F P C
Sbjct: 389 NILCLAVVPSSGIGISLFGSIAQYDFQVEYDLEGKKVSFAPTDC 432
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 136/390 (34%), Positives = 199/390 (51%), Gaps = 43/390 (11%)
Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR----PCTECYQQS---D 198
+P+ SGA G G+Y + GTPP++ ++ DTGSD+ WLQC P C +++
Sbjct: 41 SPMESGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR 100
Query: 199 PIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR----------CLYQVAYGDGSFTVGDL 248
P F S++ S +PC+A QC L V A R + C Y Y DGS T G L
Sbjct: 101 PAFVASKSATLSVVPCSAAQC--LLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFL 158
Query: 249 V--TETVSFGNSG--SVKGIALGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQ---IKATS 300
T T+S G SG +V+G+A GCG N+G F G+ G++GLG G LS Q + A +
Sbjct: 159 ARDTATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQT 218
Query: 301 LAYCLVD-----RDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV 355
+YCL+D R +S + R PL+ N TFYYVG+ VG + +
Sbjct: 219 FSYCLLDLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVL 278
Query: 356 QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF---D 412
+P S + +D G+GG ++D G+ +T L+ AY L +F + S F +
Sbjct: 279 PVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLE 338
Query: 413 TCYDFSGLRSVR-----VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--A 465
CY+ S S+ P +++ F G +L+LP NYL+ V + C A PT S A
Sbjct: 339 LCYNVSSSSSLAPANGGFPRLTIDFAQGLSLELPTGNYLVDV-ADDVKCLAIRPTLSPFA 397
Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+++GN+ QQG V FD A+ R+GF +C
Sbjct: 398 FNVLGNLMQQGYHVEFDRASARIGFARTEC 427
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 187 bits (476), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 132/365 (36%), Positives = 186/365 (50%), Gaps = 28/365 (7%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y + +GTPP FS++ DTGS + W QC PCTEC + P F P +SS++S LPCA
Sbjct: 87 AGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCA 146
Query: 216 APQCKSLDVS--ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
+ C+ L C A C+Y YG G FT G L TET+ G + S G+A GC +N
Sbjct: 147 SSLCQFLTSPYLTCNATGCVYYYPYGMG-FTAGYLATETLHVGGA-SFPGVAFGCSTEN- 203
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCL-VDRDSPASGVLEFNSAR--GGDAVTAP 330
G+ S+G++GLG LSL Q+ +YCL D D+ S +L + A+ GG+ + P
Sbjct: 204 GVGNSSSGIVGLGRSPLSLVSQVGVGRFSYCLRSDADAGDSPILFGSLAKVTGGNVQSTP 263
Query: 331 LIRNKKV--DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD----GGIIVDCGTAITRLQ 384
L+ N ++ ++YYV LTG +VG + + + F GG IVD GT +T L
Sbjct: 264 LLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLV 323
Query: 385 TQAYNSLRDSFVRLAGNLKPTSGVA----LFDTCYDFS---GLRSVRVPTVSLHFGAGKA 437
+ Y ++ +F+ T+ V FD C+D + G V VPT+ L F G
Sbjct: 324 KEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSGVPVPTLVLRFAGGAE 383
Query: 438 LDLPAKNY--LIPVDSAG---TFCFAFAPTSSAL--SIIGNVQQQGTRVSFDLANNRVGF 490
+ ++Y ++ VDS G C P S L SIIGNV Q V +DL F
Sbjct: 384 YAVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHVLYDLDGGMFSF 443
Query: 491 TPNKC 495
P C
Sbjct: 444 APADC 448
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 139/437 (31%), Positives = 216/437 (49%), Gaps = 61/437 (13%)
Query: 93 TRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGA 152
TR + S+ S+ RD+ R ++ RH + Q+ + VS
Sbjct: 33 TRIHADPSVTASQFVRDALR------------RDMHRHNAR----QLAASSSNGTTVSAP 76
Query: 153 SQGS---GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSS 208
+Q S GEY + +GTPP + + DTGSD+ W QC PC+ +C+QQ P+++P +S++
Sbjct: 77 TQISPTAGEYLMTLAIGTPPVSYQAIADTGSDLIWTQCAPCSSQCFQQPTPLYNPSSSTT 136
Query: 209 YSPLPCAAPQCKSLDVSACRAN----------RCLYQVAYGDGSFTVGDLVTETVSFG-- 256
++ LPC + +S C A C+Y + YG G +V +ET +FG
Sbjct: 137 FAVLPCNS------SLSMCAAALAGTTPPPGCTCMYNMTYGSGWTSVYQ-GSETFTFGSS 189
Query: 257 ---NSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIKATSLAYCLV---DRD 309
N V GIA GC + + G SA GL+GLG G LSL Q+ +YCL D +
Sbjct: 190 TPANQTGVPGIAFGCSNASGGFNTSSASGLVGLGRGSLSLVSQLGVPKFSYCLTPYQDTN 249
Query: 310 SPASGVLEFNSARG--GDAVTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
S ++ +L +++ G + P + + + T+YY+ LTG S+G A+ IP + +
Sbjct: 250 STSTLLLGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSL 309
Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL---FDTCYDFSGLR 421
G GG I+D GT IT L AY +R + V L L T G + D C++
Sbjct: 310 KADGTGGFIIDSGTTITLLGNTAYQQVRAAVVSLV-TLPTTDGGSAATGLDLCFELPSST 368
Query: 422 SV--RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTR 478
S +P+++LHF G + LPA +Y++ +DS +C A T +SI+GN QQQ
Sbjct: 369 SAPPTMPSMTLHFD-GADMVLPADSYMM-LDS-NLWCLAMQNQTDGGVSILGNYQQQNMH 425
Query: 479 VSFDLANNRVGFTPNKC 495
+ +D+ + F P KC
Sbjct: 426 ILYDVGQETLTFAPAKC 442
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 114/339 (33%), Positives = 185/339 (54%), Gaps = 25/339 (7%)
Query: 174 MVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA-----C 227
M+LDTGS ++WLQC+PC C+ Q+DP++DP S +Y L CA+ +C L + C
Sbjct: 1 MILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60
Query: 228 R--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGL 285
+N CLY +YGD SF++G L + ++ +S ++ GCG DN+GLF +AG++GL
Sbjct: 61 ETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQTLPQFTYGCGQDNQGLFGRAAGIIGL 120
Query: 286 GGGMLSLTKQIKAT---SLAYCL--VDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTF 340
LS+ Q+ + +YCL + S G L S P++ + K +
Sbjct: 121 ARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSL 180
Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LA 399
Y++ LT +V G+ + + +++ + ++D GT ITRL Y +LR +FV+ ++
Sbjct: 181 YFLRLTAITVSGRPLDLAAAMYRVPT------LIDSGTVITRLPMSMYAALRQAFVKIMS 234
Query: 400 GNLKPTSGVALFDTCYDFSGLRSVR-VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFA 458
++ DTC+ S L+S+ VP + + F G L L A + LI D G C A
Sbjct: 235 TKYAKAPAYSILDTCFKGS-LKSISAVPEIKMIFQGGADLTLRAPSILIEADK-GITCLA 292
Query: 459 FAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
FA +S + ++IIGN QQQ +++D++ +R+GF P C
Sbjct: 293 FAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 125/360 (34%), Positives = 181/360 (50%), Gaps = 30/360 (8%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
G EY + +GTPP F + DTGSD+ W QC+PC C+ Q PI+D TSSS+SPLPC
Sbjct: 79 GQAEYLMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSSFSPLPC 138
Query: 215 AAPQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
++ C + S C + C Y+ AY DG+++ E SV GIA GCG DN
Sbjct: 139 SSATCLPIWSSRCSTPSATCRYRYAYDDGAYS-----PECAGI----SVGGIAFGCGVDN 189
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNSARGGDAV---- 327
GL S G +GLG G LSL Q+ +YCL D ++ S + F S A
Sbjct: 190 GGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLSSPVFFGSLAELAASSASA 249
Query: 328 ------TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM-DEAGDGGIIVDCGTAI 380
+ PL+++ + YYV L G S+G + IP F++ D+ G GG+IVD GT
Sbjct: 250 DAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDDGSGGMIVDSGTIF 309
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT-CY--DFSGLRSV-RVPTVSLHFGAGK 436
T L + + D + G +P + D C+ +G++ + +P + LHF G
Sbjct: 310 TILVETGFRVVVDHVAGVLG--QPVVNASSLDRPCFPAPAAGVQELPDMPDMVLHFAGGA 367
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSAL-SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ L NY+ + +FC T SA S++GN QQQ ++ FD+ ++ F P C
Sbjct: 368 DMRLHRDNYMSFNEEESSFCLNIVGTESASGSVLGNFQQQNIQMLFDITVGQLSFMPTDC 427
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 150/422 (35%), Positives = 202/422 (47%), Gaps = 52/422 (12%)
Query: 102 VLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL--PEDFS--TPVVSGASQGSG 157
+L L D R + K +V L PA+ ++L DF+ +P G+ GS
Sbjct: 78 LLEMLRWDQVRTEYVRRKASGGAEDV----LNPAKPRVLMSQTDFAVRSPFGVGSGSGSS 133
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCA 215
+ G T Q +M +DT D+ W+QC PC +CY Q DP+FDP TSS+ + + C
Sbjct: 134 AWIDADGDPTVVSQQTMAIDTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCR 193
Query: 216 APQCKSLDV--SACRANR-----CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
+P C+SL + C +NR C Y + Y D T G +T+T++ + +V+ GC
Sbjct: 194 SPACRSLGPYGNGC-SNRSANAECRYLIEYSDDRATAGTYMTDTLTISGTTAVRNFRFGC 252
Query: 269 GHDNEGLFVG-SAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNSARGG 324
H G F +AG + LGGG SL Q + +YC+ + ASG L GG
Sbjct: 253 SHAVRGRFSDLTAGTMSLGGGAQSLLAQTARSLGNAFSYCV--PQASASGFLSI----GG 306
Query: 325 DA--------VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
A T PL+R+ + Y V L G V G+ + IPP F G ++D
Sbjct: 307 PATTNSTTVFATTPLVRSAINPSLYLVRLQGIVVAGRRLGIPPVAFS------AGAVMDS 360
Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV-ALFDTCYDFSGLRSVRVPTVSLHFGAG 435
IT+L AY +LR +F R A P SG DTCYDF GL +VRVP VSL FG G
Sbjct: 361 SAVITQLPPTAYRALRRAF-RNAMRAYPRSGATGTLDTCYDFLGLTNVRVPAVSLVFGGG 419
Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
+ L +I C AF TSS AL IGNVQQQ V +D+A VGF
Sbjct: 420 AVVVLDPPAVMI------GGCLAFTATSSDLALGFIGNVQQQTHEVLYDVAAGGVGFRRG 473
Query: 494 KC 495
C
Sbjct: 474 AC 475
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 120/362 (33%), Positives = 185/362 (51%), Gaps = 32/362 (8%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQC-------RPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G+GTPP+ ++++DTGSD+ W QC R +Q +P+++P+ SSS++ LPC+
Sbjct: 88 VGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSSFAYLPCS 147
Query: 216 APQCKSLDVS---ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK-GIALGCGHD 271
C+ S R NRC+Y YG G L +ET +FG + V + GCG
Sbjct: 148 DRLCQEGQFSYKNCARNNRCMYDELYGSAE-AGGVLASETFTFGVNAKVSLPLGFGCGAL 206
Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG-------G 324
+ G VG++GL+GL G++SL Q+ +YCL + L F + G
Sbjct: 207 SAGDLVGASGLMGLSPGIMSLVSQLSVPRFSYCLTPFAERKTSPLLFGAMADLRRYRTTG 266
Query: 325 DAVTAPLIRNKKVDT-FYYVGLTGFSVGGQAVQIPP-SLFEMDEAGDGGIIVDCGTAITR 382
T ++RN ++T +YYV L G S+G + + +P SL + G GG IVD G+ ++
Sbjct: 267 TVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSLGMIKPDGSGGTIVDSGSTMSY 326
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVAL----FDTCYDFS---GLRSVRVPTVSLHFGAG 435
L+ A+ +++ + V A L +G ++ C+ + +V+ P + LHF G
Sbjct: 327 LEETAFRAVKKAVVE-AVRLPVANGTDEDYDDYELCFALPTGVAMEAVKTPPLVLHFDGG 385
Query: 436 KALDLPAKNYLIPVDSAGTFCFAF--APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
A+ LP NY AG C A +P +SIIGNVQQQ V FD+ N + F P
Sbjct: 386 AAMTLPRDNYFQE-PRAGLMCLAVGTSPDGFGVSIIGNVQQQNMHVLFDVRNQKFSFAPT 444
Query: 494 KC 495
KC
Sbjct: 445 KC 446
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 126/373 (33%), Positives = 182/373 (48%), Gaps = 32/373 (8%)
Query: 147 PVVSGASQGSGEYFSRIGVGTP-PRQFSMVLDTGSDINWLQCRPC--TECYQQSDPIFDP 203
P+ SG + Y + I +G + ++++DTGSD+ W+QC PC + CY Q DP+FDP
Sbjct: 168 PLGSGIRYQTLNYVTTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDP 227
Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRA------------NRCLYQVAYGDGSFTVGDLVTE 251
S +++ +PC +P C + A A RC Y ++YGDGSF+ G L +
Sbjct: 228 AASPTFAAVPCGSPACAASLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQD 287
Query: 252 TVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDR 308
T+ G + + G GCG N GLF G+AGL+GLG LSL Q A +YCL
Sbjct: 288 TLGLGTTTKLDGFVFGCGLSNRGLFGGTAGLMGLGRTDLSLVSQTAARFGGVFSYCL-PA 346
Query: 309 DSPASGVLEFN---SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
+ ++G L S+ + +I + FY++ +TG +VGG A P
Sbjct: 347 TTTSTGSLSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGGAALTAPGF---- 402
Query: 366 EAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRV 425
G G ++VD GT ITRL Y ++R F R G ++ D CYD +G V V
Sbjct: 403 --GAGNVLVDSGTVITRLAPSVYKAVRAEFARRF-EYPAAPGFSILDACYDLTGRDEVNV 459
Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGT-FCFAFA--PTSSALSIIGNVQQQGTRVSFD 482
P ++L G + + A L V G+ C A A P IIGN QQ+ RV +D
Sbjct: 460 PLLTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVVYD 519
Query: 483 LANNRVGFTPNKC 495
+R+GF C
Sbjct: 520 TVGSRLGFADEDC 532
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 136/390 (34%), Positives = 198/390 (50%), Gaps = 43/390 (11%)
Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR----PCTECYQQS---D 198
+P+ SGA G G+Y + GTPP++ ++ DTGSD+ WLQC P C +++
Sbjct: 40 SPMESGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR 99
Query: 199 PIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR----------CLYQVAYGDGSFTVGDL 248
P F S++ S +PC+A QC L V A R + C Y Y DGS T G L
Sbjct: 100 PAFVASKSATLSVVPCSAAQC--LLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFL 157
Query: 249 V--TETVSFGNSG--SVKGIALGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQ---IKATS 300
T T+S G SG +V+G+A GCG N+G F G+ G++GLG G LS Q + A +
Sbjct: 158 ARDTATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQT 217
Query: 301 LAYCLVD-----RDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV 355
+YCL+D R +S + R PL+ N TFYYVG+ VG + +
Sbjct: 218 FSYCLLDLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVL 277
Query: 356 QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF---D 412
+P S + +D G+GG ++D G+ +T L+ AY L +F + S F +
Sbjct: 278 PVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLE 337
Query: 413 TCYDFSGLRSVR-----VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--A 465
CY+ S S P +++ F G +L+LP NYL+ V + C A PT S A
Sbjct: 338 LCYNVSSSSSSAPANGGFPRLTIDFAQGLSLELPTGNYLVDV-ADDVKCLAIRPTLSPFA 396
Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+++GN+ QQG V FD A+ R+GF +C
Sbjct: 397 FNVLGNLMQQGYHVEFDRASARIGFARTEC 426
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 184 bits (468), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 132/370 (35%), Positives = 190/370 (51%), Gaps = 36/370 (9%)
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT--ECYQQSDP 199
+ S P G S S EY R+ GTP +V+DTGSD++WLQC+PC+ +C+ Q DP
Sbjct: 62 KKVSVPAHLGTSVMSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDP 121
Query: 200 IFDPKTSSSYSPLPCAAPQCKSLDV----SACRANR-CLYQVAYGDGSFTVGDLVTETVS 254
++DP SS+YS +PCA+ CK L S C + + C + ++Y DG+ TVG + ++
Sbjct: 122 LYDPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLT 181
Query: 255 FGNSGSVKGIALGCGHDNE---GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSP 311
V+ GCGH GLF G+LGLG SL + +YCL S
Sbjct: 182 LAPGAIVQNFYFGCGHGKHAVRGLF---DGVLGLGRLRESLGARYGGV-FSYCLPSVSS- 236
Query: 312 ASGVLEFNSARGGDA-VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
G L + + V P+ TF V L G +VGG+ + + PS F G
Sbjct: 237 KPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS------G 290
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRL--AGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
G+IVD GT IT LQ+ AY +LR +F + A L P + DTCY+ +G ++V VP +
Sbjct: 291 GMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDL---DTCYNLTGYKNVVVPKI 347
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA---PTSSALSIIGNVQQQGTRVSFDLAN 485
+L F G ++L N ++ V+ C AFA P SA ++GNV Q+ V FD +
Sbjct: 348 ALTFTGGATINLDVPNGIL-VNG----CLAFAESGPDGSA-GVLGNVNQRAFEVLFDTST 401
Query: 486 NRVGFTPNKC 495
++ GF C
Sbjct: 402 SKFGFRAKAC 411
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 184 bits (468), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 132/370 (35%), Positives = 190/370 (51%), Gaps = 36/370 (9%)
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT--ECYQQSDP 199
+ S P G S S EY R+ GTP +V+DTGSD++WLQC+PC+ +C+ Q DP
Sbjct: 96 KKVSVPAHLGTSVMSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDP 155
Query: 200 IFDPKTSSSYSPLPCAAPQCKSLDV----SACRANR-CLYQVAYGDGSFTVGDLVTETVS 254
++DP SS+YS +PCA+ CK L S C + + C + ++Y DG+ TVG + ++
Sbjct: 156 LYDPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLT 215
Query: 255 FGNSGSVKGIALGCGHDNE---GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSP 311
V+ GCGH GLF G+LGLG SL + +YCL S
Sbjct: 216 LAPGAIVQNFYFGCGHGKHAVRGLF---DGVLGLGRLRESLGARYGGV-FSYCLPSVSS- 270
Query: 312 ASGVLEFNSARGGDA-VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
G L + + V P+ TF V L G +VGG+ + + PS F G
Sbjct: 271 KPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS------G 324
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRL--AGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
G+IVD GT IT LQ+ AY +LR +F + A L P + DTCY+ +G ++V VP +
Sbjct: 325 GMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDL---DTCYNLTGYKNVVVPKI 381
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA---PTSSALSIIGNVQQQGTRVSFDLAN 485
+L F G ++L N ++ V+ C AFA P SA ++GNV Q+ V FD +
Sbjct: 382 ALTFTGGATINLDVPNGIL-VNG----CLAFAESGPDGSA-GVLGNVNQRAFEVLFDTST 435
Query: 486 NRVGFTPNKC 495
++ GF C
Sbjct: 436 SKFGFRAKAC 445
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 184 bits (468), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 134/385 (34%), Positives = 182/385 (47%), Gaps = 31/385 (8%)
Query: 129 RHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR 188
R + +EA I P PV S +GEY +I +GTPP + DTGSD+ W QC
Sbjct: 65 RRFMSFSEASISPNTPEPPV----SSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCL 120
Query: 189 PCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR--CLYQVAYGDGSFTVG 246
PC CY+Q +P+FDP S+S+ + C + QC+ LD +C + C + YGDGS G
Sbjct: 121 PCLSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQG 180
Query: 247 DLVTETVSFG-NSG---SVKGIALGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKAT-- 299
+ TET++ NSG S+ I GCGH+N G F GL G GG LSLT QI +T
Sbjct: 181 VIATETLTLNSNSGQPTSILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLG 240
Query: 300 ---SLAYCLVD-RDSPA--SGVLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVG 351
+ CLV R P+ S ++ A G D V+ PL+ K T+Y+V L G SVG
Sbjct: 241 SGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLV-TKDDPTYYFVTLDGISVG 299
Query: 352 GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF 411
+ P A G + +D GT T L YN L V+ A ++P L
Sbjct: 300 DKLF---PFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQG-VKEAIPMEPVQDPDLQ 355
Query: 412 -DTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIG 470
CY + L + P ++ HF P ++ P + G +CFA P I G
Sbjct: 356 PQLCYRSATL--IDGPILTAHFDGADVQLKPLNTFISPKE--GVYCFAMQPIDGDTGIFG 411
Query: 471 NVQQQGTRVSFDLANNRVGFTPNKC 495
N Q + FDL +V F C
Sbjct: 412 NFVQMNFLIGFDLDGKKVSFKAVDC 436
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 184 bits (468), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 135/367 (36%), Positives = 182/367 (49%), Gaps = 21/367 (5%)
Query: 140 LPEDFSTPVVSG-ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD 198
L + S P+ SG A S Y R +GTP + + LDT +D W+ C C C S
Sbjct: 71 LAKKPSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGC--ASS 128
Query: 199 PIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN 257
+FDP SSS L C APQCK C A + C + + YG GS L +T++ N
Sbjct: 129 VLFDPSKSSSSRNLQCDAPQCKQAPNPTCTAGKSCGFNMTYG-GSTIEASLTQDTLTLAN 187
Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSL---TKQIKATSLAYCLVD-RDSPAS 313
+K GC G + + GL+GLG G LSL T+ + ++ +YCL + + S S
Sbjct: 188 D-VIKSYTFGCISKATGTSLPAQGLMGLGRGPLSLISQTQNLYMSTFSYCLPNSKSSNFS 246
Query: 314 GVLEFNSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
G L + T PL++N + + YYV L G VG + V IP S D + G
Sbjct: 247 GSLRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGT 306
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
I D GT TRL AY ++R+ F R N TS + FDTCY SG SV P+V+ F
Sbjct: 307 IFDSGTVFTRLVEPAYVAVRNEFRRRIKNANATS-LGGFDTCY--SG--SVVYPSVTFMF 361
Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRV 488
AG + LP N LI S T C A A +S L++I ++QQQ RV DL N+R+
Sbjct: 362 -AGMNVTLPPDNLLIHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRL 420
Query: 489 GFTPNKC 495
G + C
Sbjct: 421 GISRETC 427
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 125/356 (35%), Positives = 174/356 (48%), Gaps = 22/356 (6%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+GEY + +GTPP ++DTGSD+ W QCRPCT CY+Q P FDPK SS+Y C
Sbjct: 89 AGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPFFDPKNSSTYRDSSCG 148
Query: 216 APQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCG 269
C +L D S +C + +Y DGSFT G+L ET++ G S G A GC
Sbjct: 149 TSFCLALGNDRSCRNGKKCTFMYSYADGSFTGGNLAVETLTVASTAGKPVSFPGFAFGCV 208
Query: 270 HDNEGLF-VGSAGLLGLGGGMLSLTKQIKAT---SLAYCL--VDRDSPASGVLEFNSA-- 321
H + G+F S+G++GLG LS+ Q+K+T +YCL V DS S + F +
Sbjct: 209 HRSGGIFDEHSSGIVGLGVAELSMISQLKSTINGRFSYCLLPVFTDSSMSSRINFGRSGI 268
Query: 322 -RGGDAVTAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
G V+ PL+ K DT+YY + L GFSVG + + + E +G IIVD GT
Sbjct: 269 VSGAGTVSTPLVM-KGPDTYYYLITLEGFSVGKKRLSY-KGFSKKAEVEEGNIIVDSGTT 326
Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
T L + Y L +S + + CY+ + + + P ++ HF
Sbjct: 327 YTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYN-TTVDQIDAPIITAHFKDANVEL 385
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
P +L + CF PTS + I+GN+ Q V FDL RV F C
Sbjct: 386 QPWNTFLRMQEDL--VCFTVLPTSD-IGILGNLAQVNFLVGFDLRKKRVSFKAADC 438
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 102/301 (33%), Positives = 151/301 (50%), Gaps = 29/301 (9%)
Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
V + + EY + VGTPPR ++ LDTGSD+ W QC PC +C+ Q P+ DP SS
Sbjct: 75 VAAAGGIATNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQGIPLLDPAASS 134
Query: 208 SYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG---- 263
+Y+ LPC AP+C++L ++C C+Y YGD S TVG + T+ +FG++G G
Sbjct: 135 TYAALPCGAPRCRALPFTSCGGRSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSL 194
Query: 264 -----IALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLE 317
+ GCGH N+G+F + G+ G G G SL Q+ ATS +YC S ++
Sbjct: 195 PATRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNATSFSYCFTSMFDSKSSIVT 254
Query: 318 --------FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
++ A G+ T PL +N + Y++ L G SVG + +P + F
Sbjct: 255 LGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFR------ 308
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFSGLRSVRVPT 427
I+D G +IT L + Y +++ F G P SGV + D C+ R P
Sbjct: 309 -STIIDSGASITTLPEEVYEAVKAEFAAQVG--LPPSGVEGSALDVCFALPVSALWRRPA 365
Query: 428 V 428
V
Sbjct: 366 V 366
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 114/350 (32%), Positives = 171/350 (48%), Gaps = 21/350 (6%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK-- 220
+GVGTPP+ ++LD GSD+ W QC +Q +P+FD SSS+S LPC + C+
Sbjct: 111 VGVGTPPQPSKVILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLPCDSKLCEAG 170
Query: 221 SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG-NSGSVKGIALGCGHDNEGLFVGS 279
+ C +C Y+ YG + T G L TET +FG + G + GCG G +
Sbjct: 171 TFTNKTCTDRKCAYENDYGIMTAT-GVLATETFTFGAHHGVSANLTFGCGKLANGTIAEA 229
Query: 280 AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG-------GDAVTAPLI 332
+G+LGL G LS+ KQ+ T +YCL + + F + G T PL+
Sbjct: 230 SGILGLSPGPLSMLKQLAITKFSYCLTPFADRKTSPVMFGAMADLGKYKTTGKVQTIPLL 289
Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
+N D +YYV + G SVG + + +P + G GG ++D T + L A+ L+
Sbjct: 290 KNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSATTLAYLVEPAFTELK 349
Query: 393 DSFVRLAGNLKPTSGVALFD--TCYDFS---GLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
+ + G P + ++ D C++ + V+VP + LHF + LP NY
Sbjct: 350 KAV--MEGIKLPVANRSVDDYPVCFELPRGMSMEGVQVPPLVLHFDGDAEMSLPRDNYFQ 407
Query: 448 PVDSAGTFCFAF--APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S G C A AP A ++IGNVQQQ V +D+ N + + P KC
Sbjct: 408 E-PSPGMMCLAVMQAPFEGAPNVIGNVQQQNMHVLYDVGNRKFSYAPTKC 456
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 127/378 (33%), Positives = 176/378 (46%), Gaps = 34/378 (8%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
S PV SG Q Y R G+G+P +Q + LDT +D W C PC C S +F P
Sbjct: 67 SAPVASG--QAPPSYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSS--LFAPA 122
Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANR--------------CLYQVAYGDGSFTVGDLVT 250
SSSY+ LPC++ C AC A + C + + D SF L +
Sbjct: 123 NSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAA-LAS 181
Query: 251 ETVSFGNSGSVKGIALGCGHDNEG--LFVGSAGLLGLGGGMLSLTKQIKATS---LAYCL 305
+T+ G ++ GC G + GLLGLG G ++L Q + +YCL
Sbjct: 182 DTLRLGKD-AIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCL 240
Query: 306 VD-RDSPASGVLEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
R SG L A GG + P++RN + YYV +TG SVG V++P
Sbjct: 241 PSYRSYYFSGSLRLG-AGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGS 299
Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR 421
F D A G +VD GT ITR Y +LR+ F R + + FDTC++ +
Sbjct: 300 FAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVA 359
Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGT 477
+ P V++H G L LP +N LI + C A A +S +++I N+QQQ
Sbjct: 360 AGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNI 419
Query: 478 RVSFDLANNRVGFTPNKC 495
RV FD+AN+RVGF C
Sbjct: 420 RVVFDVANSRVGFAKESC 437
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 120/354 (33%), Positives = 171/354 (48%), Gaps = 21/354 (5%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
G Y + +GTPP + + DTGSD+ W C PC +CY+Q +PIFDP+ S+SY + C +
Sbjct: 23 GHYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISCDS 82
Query: 217 PQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHD 271
C LD C + C Y AY + T G L ET++ G S +KGI GCGH+
Sbjct: 83 KLCHKLDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIVFGCGHN 142
Query: 272 NEGLFVG-SAGLLGLGGGMLSLTKQIKAT----SLAYCLVDRDSPASGVLEFNSARGGD- 325
N G F G++GLGGG +S QI ++ + CLV + S + + +G +
Sbjct: 143 NTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTDVSVSSKMSLGKGSEV 202
Query: 326 ----AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
V+ PL+ K+ T Y+V L G SVG + S + E G+ + +D GT T
Sbjct: 203 SGKGVVSTPLVA-KQDKTPYFVTLLGISVGNTYLHFNGSSSQSVEKGN--VFLDSGTPPT 259
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
L TQ Y+ L VR +KP + + ++R P ++ HF G LP
Sbjct: 260 ILPTQLYDRLVAQ-VRSEVAMKPVTNDLDLGPQLCYRTKNNLRGPVLTAHFEGGDVKLLP 318
Query: 442 AKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ ++ P D G FC F TSS + GN Q + FDL V F P C
Sbjct: 319 TQTFVSPKD--GVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPMDC 370
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 128/359 (35%), Positives = 182/359 (50%), Gaps = 23/359 (6%)
Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSP 211
ASQG EY R VG+PP Q ++DTGSDI WLQC PC +CY+Q+ PIFDP S +Y
Sbjct: 86 ASQG--EYLMRYSVGSPPFQVLGIVDTGSDILWLQCEPCEDCYKQTTPIFDPSKSKTYKT 143
Query: 212 LPCAAPQCKSLDVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIAL 266
LPC++ C+SL +AC + N C Y + YGDGS + GDL ET++ G+S +
Sbjct: 144 LPCSSNTCESLRNTACSSDNVCEYSIDYGDGSHSDGDLSVETLTLGSTDGSSVHFPKTVI 203
Query: 267 GCGHDNEGLFVGSAGLLGLGG----GMLSLTKQIKATSLAYCL--VDRDSPASGVLEFNS 320
GCGH+N G F + G ++S +YCL + +S +S L F
Sbjct: 204 GCGHNNGGTFQEEGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSSSKLNFGD 263
Query: 321 A---RGGDAVTAPLI-RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
A G V+ PL N +V FY++ L FSVG ++ S +GDG II+D
Sbjct: 264 AAVVSGRGTVSTPLDPLNGQV--FYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDS 321
Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
GT +T L + Y +L + + + L CY + + +P ++ HF G
Sbjct: 322 GTTLTLLPQEDYLNLESAVSDVIKLERARDPSKLLSLCYKTTS-DELDLPVITAHF-KGA 379
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
++L + +PV+ G CFAF +S +I GN+ QQ V +DL V F P C
Sbjct: 380 DVELNPISTFVPVEK-GVVCFAFI-SSKIGAIFGNLAQQNLLVGYDLVKKTVSFKPTDC 436
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 126/378 (33%), Positives = 176/378 (46%), Gaps = 34/378 (8%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
S PV SG Q Y R G+G+P +Q + LDT +D W C PC C S +F P
Sbjct: 69 SAPVASG--QAPPSYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSS--LFAPA 124
Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANR--------------CLYQVAYGDGSFTVGDLVT 250
SSSY+ LPC++ C AC A + C + + D SF L +
Sbjct: 125 NSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAA-LAS 183
Query: 251 ETVSFGNSGSVKGIALGCGHDNEG--LFVGSAGLLGLGGGMLSLTKQIKATS---LAYCL 305
+T+ G ++ GC G + GLLGLG G ++L Q + +YCL
Sbjct: 184 DTLRLGKD-AIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCL 242
Query: 306 VD-RDSPASGVLEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
R SG L A GG + P++RN + YYV +TG SVG V++P
Sbjct: 243 PSYRSYYFSGSLRLG-AGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGRAWVKVPAGS 301
Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR 421
F D A G +VD GT ITR Y +LR+ F R + + FDTC++ +
Sbjct: 302 FAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVA 361
Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGT 477
+ P V++H G L LP +N LI + C A A +S +++I N+QQQ
Sbjct: 362 AGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNI 421
Query: 478 RVSFDLANNRVGFTPNKC 495
RV FD+AN+R+GF C
Sbjct: 422 RVVFDVANSRIGFAKESC 439
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 133/385 (34%), Positives = 181/385 (47%), Gaps = 31/385 (8%)
Query: 129 RHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR 188
R + +EA I P PV S +GEY +I +GTPP + DTGSD+ W QC
Sbjct: 65 RRFMSFSEASISPNTPEPPV----SSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCL 120
Query: 189 PCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR--CLYQVAYGDGSFTVG 246
PC CY+Q +P+FDP S+S+ + C + QC+ LD +C + C + YGDGS G
Sbjct: 121 PCLSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQG 180
Query: 247 DLVTETVSFG-NSG---SVKGIALGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKAT-- 299
+ TET++ NSG S+ I GCGH+N G F GL G GG LSLT QI +T
Sbjct: 181 VIATETLTLNSNSGQPXSIXNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLG 240
Query: 300 ---SLAYCLVD-RDSPA--SGVLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVG 351
+ CLV R P+ S ++ A G V+ PL+ K T+Y+V L G SVG
Sbjct: 241 SGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLV-TKDDPTYYFVTLDGISVG 299
Query: 352 GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF 411
+ P A G + +D GT T L YN L V+ A ++P L
Sbjct: 300 DKLF---PFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQG-VKEAIPMEPVQDPDLQ 355
Query: 412 -DTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIG 470
CY + L + P ++ HF P ++ P + G +CFA P I G
Sbjct: 356 PQLCYRSATL--IDGPILTAHFDGADVQLKPLNTFISPKE--GVYCFAMQPIDGDTGIFG 411
Query: 471 NVQQQGTRVSFDLANNRVGFTPNKC 495
N Q + FDL +V F C
Sbjct: 412 NFVQMNFLIGFDLDGKKVSFKAVDC 436
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 124/356 (34%), Positives = 177/356 (49%), Gaps = 28/356 (7%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EY + +GTPP + DTGSD+ W QC PCT+CY+Q +P+FDP++SSSY+ + C
Sbjct: 59 EYLMELSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQNPMFDPRSSSSYTNITCGTE 118
Query: 218 QCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHD 271
C LD S C ++ C Y +Y D S T G L ET++ G + +GI GCGH+
Sbjct: 119 SCNKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVAFQGIIFGCGHN 178
Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKAT------SLAYCLVDRDSPASGVLEFNSARGGD 325
N G GL+GLG G LSL QI ++ + CLV ++ S + N +G +
Sbjct: 179 NSGFNDREMGLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDPSITSQMNFGKGSE 238
Query: 326 A-----VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL-FEMDEAGDGGIIVDCGTA 379
V+ PLI K T Y+ L G SV + + +P S + G I++D GT
Sbjct: 239 VLGNGTVSTPLI--SKDGTGYFATLLGISV--EDINLPFSNGSSLGTITKGNILIDSGTT 294
Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
IT L + Y+ L + VR L+P + ++ CY ++ PT+++HF G L
Sbjct: 295 ITYLPEEFYHRLIEQ-VRNKVALEPFR-IDGYELCYQTP--TNLNGPTLTIHFEGGDVLL 350
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
PA+ + IPV FCFA T+ GN Q + FDL V F C
Sbjct: 351 TPAQMF-IPVQDD-NFCFAVFDTNEEYVTYGNYAQSNYLIGFDLERQVVSFKATDC 404
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 114/343 (33%), Positives = 174/343 (50%), Gaps = 31/343 (9%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQC-RPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
Y I +GTPP + VLDTGSD+ W QC PC C+ Q P++ P S++Y+ + C +P
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 218 QCKSLDVSACRANR----CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
C++L R + C Y +YGDG+ T G L TET + G+ +V+G+A GCG +N
Sbjct: 152 MCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGTENL 211
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIR 333
G S+GL+G+G G LSL Q+ T R + G T+P
Sbjct: 212 GSTDNSSGLVGMGRGPLSLVSQLGVT--------RPRRSCRARAAARGGGAPTTTSP--- 260
Query: 334 NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRD 393
L G +VG + I P++F + GDGG+I+D GT T L+ +A+ +L
Sbjct: 261 -----------LEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEERAFVALAR 309
Query: 394 SFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
+ L SG L C+ + +V VP + LHF G ++L ++Y++ SA
Sbjct: 310 ALASRV-RLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFD-GADMELRRESYVVEDRSA 367
Query: 453 GTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
G C ++ +S++G++QQQ T + +DL + F P KC
Sbjct: 368 GVACLGMV-SARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 409
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 138/419 (32%), Positives = 204/419 (48%), Gaps = 40/419 (9%)
Query: 90 LHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVV 149
LH + Y SL+ R +R TL+T L R + I+P+
Sbjct: 42 LHNPSLSRYDSLI-DAFRRSFSRSATLLTHLTSVSTACIR-------SPIIPD------- 86
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSY 209
SGE+ I +GTPP + DTGSD+ W QC PC EC+ QS PIF+P+ SSSY
Sbjct: 87 ------SGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSY 140
Query: 210 SPLPCAAPQCKSLDVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
+ CA+ C+SL+ C + C Y +YGD SFT GDL ++ ++ G S + +G
Sbjct: 141 RKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIG-SFKLPKTVIG 199
Query: 268 CGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPA--SGVLEFN 319
CGH N G F G ++G++GLGGG LSL Q++ + +YCL S A +G + F
Sbjct: 200 CGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFG 259
Query: 320 S---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
G V+ PL+ + DTFY++ L SVG + + + M G+ II+D
Sbjct: 260 RKAVVSGRQVVSTPLV-PRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGN--IIIDS 316
Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
GT +T L Y + + R+ + + + CY + + +P ++ HF G
Sbjct: 317 GTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGA 376
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ L N PV T C FAP ++ ++I GN+ Q V +DL N R+ F P C
Sbjct: 377 DVKLLPVNTFAPVADNVT-CLTFAP-ATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLC 433
>gi|3641868|emb|CAA09458.1| hypothetical protein [Cicer arietinum]
Length = 110
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 85/110 (77%), Positives = 94/110 (85%)
Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
QAY S+RD+F RL NL+ GVA+FDTCYD S LRSVRVPTVS HFG + DLPAKNY
Sbjct: 1 QAYESVRDAFKRLTQNLRSAEGVAIFDTCYDLSSLRSVRVPTVSFHFGNDRVWDLPAKNY 60
Query: 446 LIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LIPVDS GTFCFAFAPTSS+LSIIGNVQQQGTRVSFD+AN+ VGF+PNKC
Sbjct: 61 LIPVDSDGTFCFAFAPTSSSLSIIGNVQQQGTRVSFDIANSLVGFSPNKC 110
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 121/376 (32%), Positives = 178/376 (47%), Gaps = 28/376 (7%)
Query: 134 PAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-E 192
PAEA + P +G + + E+ +G G+P + + + DTGSD++W+QC+PC+
Sbjct: 91 PAEA----PSATIPDHTGTNLKTPEFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGH 146
Query: 193 CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTET 252
CY+Q DP+FDP SSSY+ +PC +C + C C+Y V YGDGS T G L ET
Sbjct: 147 CYKQHDPVFDPAKSSSYAVVPCGTTECAAAG-GECNGTTCVYGVEYGDGSSTTGVLARET 205
Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS----LAYCLVDR 308
++F +S G GCG N G F G L G A + +YCL
Sbjct: 206 LTFSSSSEFTGFIFGCGETNLGDF-GEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCLPSY 264
Query: 309 D-SPASGVLEFNSARGGDAVTAPLIRNK-KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
+ +P + G V + NK +FY++ L ++GG + +PPS F
Sbjct: 265 NTTPGYLSIGATPVTGQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFTKT- 323
Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVP 426
G ++D GT +T L AY +LRD F KP DTCYDF+G + +P
Sbjct: 324 ----GTLLDSGTILTYLPPPAYTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGILIP 379
Query: 427 TVSLHFGAGKALDLPAKNYL----IPVDS---AGTFCFAFAPTSSALSIIGNVQQQGTRV 479
VS +F G +L N+ P D+ G F P S++G+ Q+ V
Sbjct: 380 GVSFNFSDGAVFNL---NFFGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEV 436
Query: 480 SFDLANNRVGFTPNKC 495
+D+ ++GF P C
Sbjct: 437 IYDVPAQKIGFIPASC 452
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 181 bits (459), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 123/372 (33%), Positives = 174/372 (46%), Gaps = 26/372 (6%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
S PV SG S S Y R G+G+P + + LDT +D W C PC C S +F P
Sbjct: 65 SAPVASGQSPPS--YVVRAGLGSPAQPILLALDTSADATWAHCSPCGTC-PSSGSLFAPA 121
Query: 205 TSSSYSPLPCAAPQCKSLDVSACRAN----------RCLYQVAYGDGSFTVGDLVTETVS 254
S+SY+PLPC++ C L C A C + + D SF L ++ +
Sbjct: 122 NSTSYAPLPCSSTMCTVLQGQPCPAQDPYDSSAPLPMCAFTKPFADASFQ-ASLASDWLH 180
Query: 255 FGNSGSVKGIALGCGHDNEGLFVG--SAGLLGLGGGMLSLTKQIKATS---LAYCLVDRD 309
G ++ A GC G GLLGLG G ++L Q+ +YCL
Sbjct: 181 LGKD-AIPNYAFGCVSAVSGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPSYK 239
Query: 310 SPA-SGVLEFNSARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
S SG L +A V P+++N + YYV +TG SVG V++P F D A
Sbjct: 240 SYYFSGSLRLGAAGQPRGVRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAGSFAFDPA 299
Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
G +VD GT ITR Y +LR+ F R + + FDTC++ + + P
Sbjct: 300 TGAGTVVDSGTVITRWTPPVYAALREEFRRHVAAPSGYTSLGAFDTCFNTDEVAAGVAPA 359
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDL 483
V++H G L LP +N LI + C A A ++ ++++ N+QQQ RV FD+
Sbjct: 360 VTVHMDGGLDLALPMENTLIHSSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRVVFDV 419
Query: 484 ANNRVGFTPNKC 495
AN+RVGF C
Sbjct: 420 ANSRVGFARESC 431
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 127/356 (35%), Positives = 179/356 (50%), Gaps = 23/356 (6%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GEY ++ VGTPP V DTGSDI W QC PCT CYQQ P+F+P S++Y + C++
Sbjct: 83 GEYLMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQDLPMFNPSKSTTYRKVSCSS 142
Query: 217 PQCK--SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN-SGSVKGI---ALGCGH 270
P C D S C Y ++YGD S + GD +T++ G+ SG V A+GCGH
Sbjct: 143 PVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGH 202
Query: 271 DNEGLF-VGSAGLLGLGGGMLSLTKQIKAT---SLAYCL--VDRDSPASGVLEFNS---A 321
DN G F +G++GLG G SL KQ+ + +YCL + D S L F S
Sbjct: 203 DNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLNFGSNANV 262
Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
G AV+ P+ + K +FY + L SVG S G II+D GT +T
Sbjct: 263 SGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFY--STANSILGGKANIIIDSGTTLT 320
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALF-DTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
L Y++ + + + NL+ T F + C++ + +VP +++HF G L L
Sbjct: 321 LLPVDLYHNFAKA-ISNSINLQRTDDPNQFLEYCFE-TTTDDYKVPFIAMHF-EGANLRL 377
Query: 441 PAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+N LI V S C AFA + +SI GN+ Q V +D+ N + F P C
Sbjct: 378 QRENVLIRV-SDNVICLAFAGAQDNDISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 130/371 (35%), Positives = 184/371 (49%), Gaps = 33/371 (8%)
Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD--PIFDPKTSSSYSP 211
G+G Y I +GTPP F +++DTGS++ W QC PCT C+ + P+ P SS++S
Sbjct: 86 NGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSR 145
Query: 212 LPCAAPQCKSLDVSA----CRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIAL 266
LPC C+ L S+ C A C Y YG G +T G L TET++ G+ G+ +A
Sbjct: 146 LPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETLTVGD-GTFPKVAF 203
Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD--RDSPASGVLEFNSAR-- 322
GC +N S+G++GLG G LSL Q+ +YCL D AS +L + A+
Sbjct: 204 GCSTENG--VDNSSGIVGLGRGPLSLVSQLAVGRFSYCLRSDMADGGASPILFGSLAKLT 261
Query: 323 -GGDAVTAPLIRNKKVD--TFYYVGLTGFSVGGQAVQIPPSLFEMDEAG-DGGIIVDCGT 378
G + PL++N + T YYV LTG +V + + S F + G GG IVD GT
Sbjct: 262 EGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGT 321
Query: 379 AITRLQTQAYNSLRDSFVRLAGNLK---PTSGVAL-FDTCYDFS---GLRSVRVPTVSLH 431
+T L Y ++ +F NL P SG D CY S G ++VRVP ++L
Sbjct: 322 TLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALR 381
Query: 432 FGAGKALDLPAKNYL--IPVDSAGTF---CFAFAPTSSAL--SIIGNVQQQGTRVSFDLA 484
F G ++P +NY + DS G C P + L SIIGN+ Q + +D+
Sbjct: 382 FAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLLYDID 441
Query: 485 NNRVGFTPNKC 495
F P C
Sbjct: 442 GGMFSFAPADC 452
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 136/427 (31%), Positives = 202/427 (47%), Gaps = 45/427 (10%)
Query: 99 RSLVLSRLERDSAR---VNTLITKLQLAIYNVDRHELKPAEA-----QILPEDFSTPVVS 150
R + + DS+R N T+LQ I NV H +K A + D P +
Sbjct: 25 RGFSVELIHPDSSRSPFYNIRETQLQ-RISNVVTHSIKRAHYLNHVFSLSHNDLPKPTI- 82
Query: 151 GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYS 210
Y +GTPP Q V+DTGSD W QC+PC C Q+ PIF+P SS+Y
Sbjct: 83 -IPYAGSYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFNPSKSSTYK 141
Query: 211 PLPCAAPQCKSLDVSACRANR---CLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKG 263
+ C++P CK + + C +NR C Y++ Y D S + GD+ +T++ G+ S
Sbjct: 142 NIRCSSPICKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDGSPISFPK 201
Query: 264 IALGCGHDN----EGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA--SG 314
I +GCGH N EGL ++G++G G G S+ Q+ ++ +YCL S A S
Sbjct: 202 IVIGCGHKNSLTTEGL---ASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLFSKANISS 258
Query: 315 VLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
L F G V+ PLI++ V Y+ L FSVG +++ S D G+
Sbjct: 259 KLYFGDMAVVSGHGVVSTPLIQSFYVGN-YFTNLEAFSVGDHIIKLKDSSLIPDNEGNA- 316
Query: 372 IIVDCGTAITRLQTQAYNSLRD---SFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
++D G+ IT+L Y+ L S V+L PT ++L CY + L+ VP +
Sbjct: 317 -VIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSL---CYK-TTLKKYEVPII 371
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
+ HF G + L A N I ++ CFAF ++ + GN+ QQ V +D N +
Sbjct: 372 TAHF-RGADVKLNAFNTFIQMNHE-VMCFAFNSSAFPWVVYGNIAQQNFLVGYDTLKNII 429
Query: 489 GFTPNKC 495
F P C
Sbjct: 430 SFKPTNC 436
>gi|147866052|emb|CAN80962.1| hypothetical protein VITISV_022007 [Vitis vinifera]
Length = 150
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 85/146 (58%), Positives = 107/146 (73%)
Query: 350 VGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
VGG V I +F + E GDGG+++D GTA+TRL T AY + RD+F+ NL +GVA
Sbjct: 5 VGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVA 64
Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSII 469
+FDTCYD G SVRVPTVS +F G L LPA+N+LIP+D AGTFCFAFAP++S LSI+
Sbjct: 65 IFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSIL 124
Query: 470 GNVQQQGTRVSFDLANNRVGFTPNKC 495
GN+QQ+G ++SFD AN VGF PN C
Sbjct: 125 GNIQQEGIQISFDGANGYVGFGPNIC 150
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 131/432 (30%), Positives = 201/432 (46%), Gaps = 35/432 (8%)
Query: 87 REILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFST 146
R + KT++N + ++ + S NT + Q N+ +H L FS
Sbjct: 15 RVSVSKTQNNGFSVELIHPISSKSPFYNTAESHFQRMSNNM-KHSTN--RVHYLNHVFSF 71
Query: 147 P------VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
P +V G G Y +GTPP Q V+DT +D W QC PC C+ + P+
Sbjct: 72 PPNKVPNIVVSPFMGDG-YIISFLIGTPPFQLYGVMDTANDNIWFQCNPCKPCFNTTSPM 130
Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSACRANR---CLYQVAYGDGSFTVGDLVTETVSFGN 257
FDP SS+Y +PC++P+CK+++ + C ++ C Y YG +++ GDL +T++ +
Sbjct: 131 FDPSKSSTYKTIPCSSPKCKNVENTHCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNS 190
Query: 258 SG----SVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKAT---SLAYCLVD-- 307
+ S K I +GCGH N+G G +G +GLG G LS Q+ ++ +YCLV
Sbjct: 191 NNDTPISFKNIVIGCGHRNKGPLEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLVPLF 250
Query: 308 RDSPASGVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
+ SG L F G V+ P+ + Y L SVG ++ S +
Sbjct: 251 SNEGISGKLHFGDKSVVSGVGTVSTPITAG---EIGYSTTLNALSVGDHIIKFENSTSKN 307
Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
D G+ I+D GT +T L Y+ L + + S F CY + L+++
Sbjct: 308 DNLGN--TIIDSGTTLTILPENVYSRLESIVTSMVKLERAKSPNQQFKLCYK-ATLKNLD 364
Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDL 483
VP ++ HF G + L + N P+D CFAF + +IIGN+ QQ V FDL
Sbjct: 365 VPIITAHFN-GADVHLNSLNTFYPIDHE-VVCFAFVSVGNFPGTIIGNIAQQNFLVGFDL 422
Query: 484 ANNRVGFTPNKC 495
N + F P C
Sbjct: 423 QKNIISFKPTDC 434
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 130/367 (35%), Positives = 190/367 (51%), Gaps = 42/367 (11%)
Query: 145 STPVVSGASQG---SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
S PV GA + EY + +GTPP+ + LDTGSD+ W QC+PC C+ Q+ P F
Sbjct: 72 SAPVSPGAYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYF 131
Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
DP TSS+ S C + C+ L V++ R+++ + G S
Sbjct: 132 DPSTSSTLSLTSCDSTLCQGLPVASLPRSDKFTF--------------------VGAGAS 171
Query: 261 VKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSPASGVLE 317
V G+A GCG N G+F + G+ G G G LSL Q+K + ++C + P++ +L+
Sbjct: 172 VPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLLD 231
Query: 318 FNS---ARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
+ + G AV T PLI+N TFYY+ L G +VG + +P S F + G GG I
Sbjct: 232 LPADLFSNGQGAVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKN-GTGGTI 290
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLK-PTSGVALFDTCYDFSG-LRSV-RVPTVSL 430
+D GTA+T L T+ Y +RD+F A +K P D + S LR+ VP + L
Sbjct: 291 IDSGTAMTSLPTRVYRLVRDAF---AAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVL 347
Query: 431 HFGAGKALDLPAKNYLIPVDSAGT--FCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
HF G +DLP +NY+ V+ AG+ C A ++ IGN QQQ V +DL N+++
Sbjct: 348 HF-EGATMDLPRENYVFEVEDAGSSILCLAII-EGGEVTTIGNFQQQNMHVLYDLQNSKL 405
Query: 489 GFTPNKC 495
F P +C
Sbjct: 406 SFVPAQC 412
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 136/426 (31%), Positives = 199/426 (46%), Gaps = 47/426 (11%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L D AR N+L + + A + A A E P+ SG + Y + I +
Sbjct: 107 LAADEARANSLQLRNKAAFTQSGKKATAAAAAAAGAE---VPLTSGIRFQTLNYVTTIAL 163
Query: 166 GTPPR------QFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
G ++++DTGSD+ W+QC+PC+ CY Q DP+FDP S+SY+ +PC A C
Sbjct: 164 GGGGSSRAGAGNLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASAC 223
Query: 220 KSLDVSAC----------------RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
++ +A ++ RC Y +AYGDGSF+ G L T+TV+ G + SV G
Sbjct: 224 EASLKAATGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGA-SVDG 282
Query: 264 IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDS-PASGVLEF- 318
GCG N GLF G+AGL+GLG LSL Q +YCL S A+G L
Sbjct: 283 FVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLG 342
Query: 319 ---NSARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
+S R V+ +I + FY++ +T + + G +++
Sbjct: 343 GDTSSYRNATPVSYTRMIADPAQPPFYFMNVT-------GASVGGAAVAAAGLGAANVLL 395
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRVPTVSLHF 432
D GT ITRL Y ++R F R G + + +L D CY+ +G V+VP ++L
Sbjct: 396 DSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRL 455
Query: 433 GAGKALDLPAKNYLIPVDSAGT-FCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVG 489
G + + A L G+ C A A S IIGN QQ+ RV +D +R+G
Sbjct: 456 EGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLG 515
Query: 490 FTPNKC 495
F C
Sbjct: 516 FADEDC 521
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 120/355 (33%), Positives = 175/355 (49%), Gaps = 38/355 (10%)
Query: 171 QFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSAC--- 227
++++DTGSD+ W+QC+PC+ CY Q DP+FDP S+SY+ +PC A C++ +A
Sbjct: 176 NLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVP 235
Query: 228 -------------RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
++ RC Y +AYGDGSF+ G L T+TV+ G + SV G GCG N G
Sbjct: 236 GSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGA-SVDGFVFGCGLSNRG 294
Query: 275 LFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDS-PASGVLEF----NSARGGDA 326
LF G+AGL+GLG LSL Q +YCL S A+G L +S R
Sbjct: 295 LFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTSSYRNATP 354
Query: 327 VT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
V+ +I + FY++ +T + + G +++D GT ITRL
Sbjct: 355 VSYTRMIADPAQPPFYFMNVT-------GASVGGAAVAAAGLGAANVLLDSGTVITRLAP 407
Query: 386 QAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAK 443
Y ++R F R G + + +L D CY+ +G V+VP ++L G + + A
Sbjct: 408 SVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAA 467
Query: 444 NYLIPVDSAGT-FCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L G+ C A A S IIGN QQ+ RV +D +R+GF C
Sbjct: 468 GMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 522
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 127/356 (35%), Positives = 179/356 (50%), Gaps = 23/356 (6%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GEY ++ VGTPP V DTGSDI W QC PCT CYQQ P+F+P S++Y + C++
Sbjct: 83 GEYLMKLSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQDLPMFNPSKSTTYRKVSCSS 142
Query: 217 PQCK--SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN-SGSVKGI---ALGCGH 270
P C D S C Y ++YGD S + GD +T++ G+ SG V A+GCGH
Sbjct: 143 PVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGH 202
Query: 271 DNEGLF-VGSAGLLGLGGGMLSLTKQIKAT---SLAYCL--VDRDSPASGVLEFNS---A 321
DN G F +G++GLG G SL KQ+ + +YCL + D S L F S
Sbjct: 203 DNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLNFGSNANV 262
Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
G AV+ P+ + K +FY + L SVG S G II+D GT +T
Sbjct: 263 SGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFY--STANSILGGKANIIIDSGTTLT 320
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALF-DTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
L Y++ + + + NL+ T F + C++ + +VP +++HF G L L
Sbjct: 321 LLPVDLYHNFAKA-ISNSINLQRTDDPNQFLEYCFE-TTTDDYKVPFIAMHF-EGANLRL 377
Query: 441 PAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+N LI V S C AFA + +SI GN+ Q V +D+ N + F P C
Sbjct: 378 QRENVLIRV-SDNVICLAFAGAQDNDISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 143/458 (31%), Positives = 210/458 (45%), Gaps = 61/458 (13%)
Query: 59 EPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLIT 118
EP +S ++ + PLN P+ S K + + L L RD R N +
Sbjct: 47 EPKVRDSSSSGATVPLNHRHGPCSPVPS----GKKKQPTFTEL----LRRDQLRANYIQR 98
Query: 119 KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
+ D H P + + + P+ G+ + EY + +G+P +M +DT
Sbjct: 99 QFS------DEH--YPRTGGLQQSEATVPIALGSLLNTLEYVITVSIGSPAVAXTMFIDT 150
Query: 179 GSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD---VSACRANRCLYQ 235
GSD++WL+C+ ++DP TSS+Y+P C+AP C L + C+Y
Sbjct: 151 GSDVSWLRCK---------SRLYDPGTSSTYAPFSCSAPACAQLGRRGTGCSSGSTCVYS 201
Query: 236 VAYGDGSFTVGDLVTETVSFGNSGS--VKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSL 292
V YGDGS T G ++T++ + + G GC G + GL+GLGG S
Sbjct: 202 VKYGDGSNTTGTYGSDTLTLAGTSEPLISGFQFGCSAVEHGFEEDNTDGLMGLGGDAQSF 261
Query: 293 TKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTA---PLIRNKKVDTFYYVGLT 346
Q AT + +YCL + +SG L + + P++R+K+ TFY + L
Sbjct: 262 VSQTAATYGSAFSYCLPPTWN-SSGFLTLGAPSSSTSAAFSTTPMLRSKQAATFYGLLLR 320
Query: 347 GFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL----RDSFVRLAGNL 402
G SVGG+ ++IP S+F G IVD GT ITRL AY +L RD R
Sbjct: 321 GISVGGKTLEIPSSVFS------AGSIVDSGTVITRLPPTAYGALSAAFRDGMARY--QY 372
Query: 403 KPTSGVALFDTCYDFSGL---RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF 459
+P + L DTC+DF+G + VP+V+L G +DL I D C AF
Sbjct: 373 QPAAPRGLLDTCFDFTGHGEGNNFTVPSVALVLDGGAVVDLHPNG--IVQDG----CLAF 426
Query: 460 APT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A T IIGNVQQ+ V +D+ + GF P C
Sbjct: 427 AATDDDGRTGIIGNVQQRTFEVLYDVGQSVFGFRPGAC 464
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 124/338 (36%), Positives = 163/338 (48%), Gaps = 30/338 (8%)
Query: 174 MVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACRA 229
M +DT D+ W+QC PC ECY Q + +FDP+ S + + +PC + C L + C
Sbjct: 164 MSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSN 223
Query: 230 NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGG 288
N+C Y V YGDG T G + + ++ S V GC H G F S +G + LGGG
Sbjct: 224 NQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTSGTMSLGGG 283
Query: 289 MLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDA----VTAPLIRNKK-VDTF 340
SL Q AT + +YC+ D S SG L G PL+RN + T
Sbjct: 284 RQSLLSQTAATFGNAFSYCVPDPSS--SGFLSLGGPADGGGAGRFARTPLVRNPSIIPTL 341
Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LA 399
Y V L G VGG+ + +PP +F GG ++D IT+L AY +LR +F +A
Sbjct: 342 YLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMA 395
Query: 400 GNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF 459
+ G A DTCYDF SV VP VSL F G + L A ++ C AF
Sbjct: 396 AYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------EGCLAF 449
Query: 460 APTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
PT AL IGNVQQQ V +D+ VGF C
Sbjct: 450 VPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 124/338 (36%), Positives = 163/338 (48%), Gaps = 30/338 (8%)
Query: 174 MVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACRA 229
M +DT D+ W+QC PC ECY Q + +FDP+ S + + +PC + C L + C
Sbjct: 148 MSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSN 207
Query: 230 NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGG 288
N+C Y V YGDG T G + + ++ S V GC H G F S +G + LGGG
Sbjct: 208 NQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTSGTMSLGGG 267
Query: 289 MLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDA----VTAPLIRNKK-VDTF 340
SL Q AT + +YC+ D S SG L G PL+RN + T
Sbjct: 268 RQSLLSQTAATFGNAFSYCVPDPSS--SGFLSLGGPADGGGAGRFARTPLVRNPSIIPTL 325
Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LA 399
Y V L G VGG+ + +PP +F GG ++D IT+L AY +LR +F +A
Sbjct: 326 YLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMA 379
Query: 400 GNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF 459
+ G A DTCYDF SV VP VSL F G + L A ++ C AF
Sbjct: 380 AYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------EGCLAF 433
Query: 460 APTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
PT AL IGNVQQQ V +D+ VGF C
Sbjct: 434 VPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 114/344 (33%), Positives = 171/344 (49%), Gaps = 25/344 (7%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
+GTPP + + DTGSD+ W QC PC +CYQQ PIF+P S+S+S +PC C ++D
Sbjct: 86 IGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDD 145
Query: 225 SACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLL 283
C C Y YGD +++ GDL E ++ G+S SVK + +GCGH + G F ++G++
Sbjct: 146 GHCGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSS-SVKSV-IGCGHASSGGFGFASGVI 203
Query: 284 GLGGGMLSLTKQIKATS-----LAYCLVDRDSPASGVLEFNS---ARGGDAVTAPLIRNK 335
GLGGG LSL Q+ TS +YCL S A+G + F G V+ PLI
Sbjct: 204 GLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKN 263
Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
V T+YY+ L S+G + M A G +I+D GT ++ L + Y+ + S
Sbjct: 264 TV-TYYYITLEAISIGNER--------HMAFAKQGNVIIDSGTTLSFLPKELYDGVVSSL 314
Query: 396 VRLAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAG 453
+++ + +D C+D + S +P ++ F G ++L N V +
Sbjct: 315 LKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKV-ANN 373
Query: 454 TFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C P S IIGN+ + +DL R+ F P C
Sbjct: 374 VNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVC 417
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 132/369 (35%), Positives = 192/369 (52%), Gaps = 38/369 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCA 215
GEY + +GTPP+ + + DTGSD+ W QC PC E C++Q P+++P +S ++ LPC+
Sbjct: 90 GEYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCS 149
Query: 216 APQCKSLDVSACRAN----------RCLYQVAYGDGSFTVGDLVTETVSFGNSGS----V 261
+ +L++ A A C Y YG G +T G +ET +FG+S + V
Sbjct: 150 S----ALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRV 204
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNS 320
GIA GC + + + GSAGL+GLG G LSL Q+ A +YCL +D+ + L
Sbjct: 205 PGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGP 264
Query: 321 A------RGGDAVTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
A G + P + + T+YY+ LTG SVG A+ IPP F + G GG
Sbjct: 265 AAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRADGTGG 324
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDF--SGLRSVRVPT 427
+I+D GT IT L AY +R + VR L T G D C+ S +P+
Sbjct: 325 LIIDSGTTITSLVDAAYKRVRAA-VRSLVKLPVTDGSNATGLDLCFALPSSSAPPATLPS 383
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF-APTSSALSIIGNVQQQGTRVSFDLANN 486
++LHFG G + LP +NY+I +D G +C A + T LS +GN QQQ + +D+
Sbjct: 384 MTLHFGGGADMVLPVENYMI-LD-GGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKE 441
Query: 487 RVGFTPNKC 495
+ F P KC
Sbjct: 442 TLSFAPAKC 450
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 112/339 (33%), Positives = 173/339 (51%), Gaps = 28/339 (8%)
Query: 173 SMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACR 228
++++D+GSD++W+QC+PC C++Q DP+FDP S++Y+ +PC + C L C
Sbjct: 169 TVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCS 228
Query: 229 AN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG--LFVGSAGLLGL 285
AN +C + + YGDGS G + ++ G ++G GC H + G AG L L
Sbjct: 229 ANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLAL 288
Query: 286 GGGMLSLTKQIK---ATSLAYCLVDRDSPAS----GVLEFNSARGGDAVTAPLIRNKKVD 338
GGG SL +Q +YCL S GV + V+ PL+ +
Sbjct: 289 GGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVSTPLLSSSMAP 348
Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
TFY V L V G+ + +PP++F ++D T I+RL AY +LR +F
Sbjct: 349 TFYRVLLRAIIVAGRPLAVPPAVFSASS------VIDSSTIISRLPPTAYQALRAAFRSA 402
Query: 399 AGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFA 458
+ V++ DTCYDF+G+RS+ +P+++L F G ++L A L+ G+ C A
Sbjct: 403 MTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-----GS-CLA 456
Query: 459 FAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
FAPT+S IGNVQQ+ V +D+ + F C
Sbjct: 457 FAPTASDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 120/352 (34%), Positives = 173/352 (49%), Gaps = 23/352 (6%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
Y +R G+GTP + + +D +D W+ C C C S P F P SS+Y +PC +P
Sbjct: 101 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSP 159
Query: 218 QCKSLDVSACRA---NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
QC + +C A + C + + Y +F L ++++ N+ V GC G
Sbjct: 160 QCAQVPSPSCPAGVGSSCGFNLTYAASTFQ-AVLGQDSLALENN-VVVSYTFGCLRVVSG 217
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVD-RDSPASGVLEFNSARGGDAV-TA 329
V GL+G G G LS Q K T +YCL + R S SG L+ + T
Sbjct: 218 NSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIGQPKRIKTT 277
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
PL+ N + YYV + G VG + VQ+P S + G I+D GT TRL Y
Sbjct: 278 PLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYA 337
Query: 390 SLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
++RD+F R+ + P G FDTCY+ +V VPTV+ F A+ LP +N +I
Sbjct: 338 AVRDAFRGRVRTPVAPPLGG--FDTCYNV----TVSVPTVTFMFAGAVAVTLPEENVMIH 391
Query: 449 VDSAGTFCFAFAP-----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S G C A A ++AL+++ ++QQQ RV FD+AN RVGF+ C
Sbjct: 392 SSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 443
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 129/371 (34%), Positives = 184/371 (49%), Gaps = 33/371 (8%)
Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD--PIFDPKTSSSYSP 211
G+G Y I +GTPP F +++DTGS++ W QC PCT C+ + P+ P SS++S
Sbjct: 86 NGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSR 145
Query: 212 LPCAAPQCKSLDVSA----CRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIAL 266
LPC C+ L S+ C A C Y YG G +T G L TET++ G+ G+ +A
Sbjct: 146 LPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETLTVGD-GTFPKVAF 203
Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD--RDSPASGVLEFNSARGG 324
GC +N S+G++GLG G LSL Q+ +YCL D AS +L + A+
Sbjct: 204 GCSTENG--VDNSSGIVGLGRGPLSLVSQLAVGRFSYCLRSDMADGGASPILFGSLAKLT 261
Query: 325 D---AVTAPLIRNKKVD--TFYYVGLTGFSVGGQAVQIPPSLFEMDEAG-DGGIIVDCGT 378
+ + PL++N + T YYV LTG +V + + S F + G GG IVD GT
Sbjct: 262 ERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGT 321
Query: 379 AITRLQTQAYNSLRDSFVRLAGNLK---PTSGVAL-FDTCYDFS---GLRSVRVPTVSLH 431
+T L Y ++ +F NL P SG D CY S G ++VRVP ++L
Sbjct: 322 TLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALR 381
Query: 432 FGAGKALDLPAKNYL--IPVDSAGTF---CFAFAPTSSAL--SIIGNVQQQGTRVSFDLA 484
F G ++P +NY + DS G C P + L SIIGN+ Q + +D+
Sbjct: 382 FAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLLYDID 441
Query: 485 NNRVGFTPNKC 495
F P C
Sbjct: 442 GGMFSFAPADC 452
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 130/362 (35%), Positives = 179/362 (49%), Gaps = 21/362 (5%)
Query: 145 STPVVSG-ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
S P+ SG A S Y R +GTP + + LDT +D W+ C C C S +FDP
Sbjct: 73 SVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGC--SSSVLFDP 130
Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVK 262
SSS L C APQCK +C ++ C + + YG GS L +T++ S +
Sbjct: 131 SKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLA-SDVIP 188
Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVD-RDSPASGVLEF 318
GC + G + + GL+GLG G LSL Q + ++ +YCL + + S SG L
Sbjct: 189 NYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRL 248
Query: 319 NSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
+ T PL++N + + YYV L G VG + V IP S D A G I D G
Sbjct: 249 GPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSG 308
Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
T TRL AY ++R+ F R N TS + FDTCY SG SV P+V+ F AG
Sbjct: 309 TVYTRLVEPAYVAVRNEFRRRVKNANATS-LGGFDTCY--SG--SVVFPSVTFMF-AGMN 362
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
+ LP N LI + C A A +S L++I ++QQQ RV D+ N+R+G +
Sbjct: 363 VTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRE 422
Query: 494 KC 495
C
Sbjct: 423 TC 424
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 132/369 (35%), Positives = 192/369 (52%), Gaps = 38/369 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCA 215
GEY + +GTPP+ + + DTGSD+ W QC PC E C++Q P+++P +S ++ LPC+
Sbjct: 95 GEYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCS 154
Query: 216 APQCKSLDVSACRAN----------RCLYQVAYGDGSFTVGDLVTETVSFGNSGS----V 261
+ +L++ A A C Y YG G +T G +ET +FG+S + V
Sbjct: 155 S----ALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRV 209
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNS 320
GIA GC + + + GSAGL+GLG G LSL Q+ A +YCL +D+ + L
Sbjct: 210 PGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGP 269
Query: 321 A------RGGDAVTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
A G + P + + T+YY+ LTG SVG A+ IPP F + G GG
Sbjct: 270 AAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGG 329
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDF--SGLRSVRVPT 427
+I+D GT IT L AY +R + VR L T G D C+ S +P+
Sbjct: 330 LIIDSGTTITSLVDAAYKRVRAA-VRSLVKLPVTDGSNATGLDLCFALPSSSAPPATLPS 388
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF-APTSSALSIIGNVQQQGTRVSFDLANN 486
++LHFG G + LP +NY+I +D G +C A + T LS +GN QQQ + +D+
Sbjct: 389 MTLHFGGGADMVLPVENYMI-LD-GGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKE 446
Query: 487 RVGFTPNKC 495
+ F P KC
Sbjct: 447 TLSFAPAKC 455
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 130/362 (35%), Positives = 179/362 (49%), Gaps = 21/362 (5%)
Query: 145 STPVVSG-ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
S P+ SG A S Y R +GTP + + LDT +D W+ C C C S +FDP
Sbjct: 73 SVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGC--SSSVLFDP 130
Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVK 262
SSS L C APQCK +C ++ C + + YG GS L +T++ S +
Sbjct: 131 SKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLA-SDVIP 188
Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVD-RDSPASGVLEF 318
GC + G + + GL+GLG G LSL Q + ++ +YCL + + S SG L
Sbjct: 189 NYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRL 248
Query: 319 NSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
+ T PL++N + + YYV L G VG + V IP S D A G I D G
Sbjct: 249 GPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSG 308
Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
T TRL AY ++R+ F R N TS + FDTCY SG SV P+V+ F AG
Sbjct: 309 TVYTRLVEPAYVAVRNEFRRRVKNANATS-LGGFDTCY--SG--SVVFPSVTFMF-AGMN 362
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
+ LP N LI + C A A +S L++I ++QQQ RV D+ N+R+G +
Sbjct: 363 VTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRE 422
Query: 494 KC 495
C
Sbjct: 423 TC 424
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 132/414 (31%), Positives = 200/414 (48%), Gaps = 57/414 (13%)
Query: 121 QLAIYNVDRH-ELKPAEAQILP---EDFST--------------PVVSGASQGSGEYFSR 162
QL + ++R P +++++P EDF T P+ S A Q S
Sbjct: 86 QLRVDGIERRLSDNPHDSKLVPAGGEDFQTNGNLLQVNYGNSGQPMSSEAQQSGVVNASA 145
Query: 163 IGVGT----PPRQFSMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAA 216
G G+ P ++VLD+ SD+ W+QC PC C+ Q D +DP S S +P C++
Sbjct: 146 AGGGSRSKLPGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSS 205
Query: 217 PQCKSLD--VSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
P C +L + C N+C Y V Y DGS T G + + ++ +V G GC H +G
Sbjct: 206 PTCTALGPYANGCANNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAEQG 265
Query: 275 LF-VGSAGLLGLGGG---MLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDA--VT 328
F +AG++ LGGG +LS T + +YC+ S SG R + V
Sbjct: 266 SFDARAAGIMALGGGPESLLSQTASRYGNAFSYCIPATAS-DSGFFTLGVPRRASSRYVV 324
Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
P++R ++ TFY V L +VGGQ + + P++F G ++D TAITRL AY
Sbjct: 325 TPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFA------AGSVLDSRTAITRLPPTAY 378
Query: 389 NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
+LR +F + DTCYDF+G+ ++R+P +SL F +N ++P
Sbjct: 379 QALRSAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFD---------RNAVLP 429
Query: 449 VDSAGTF---CFAFAPTSSA----LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+D +G C AF TS+A ++G+VQQQ V +D+ VGF C
Sbjct: 430 LDPSGILFNDCLAF--TSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 120/352 (34%), Positives = 173/352 (49%), Gaps = 23/352 (6%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
Y +R G+GTP + + +D +D W+ C C C S P F P SS+Y +PC +P
Sbjct: 82 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSP 140
Query: 218 QCKSLDVSACRA---NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
QC + +C A + C + + Y +F L ++++ N+ V GC G
Sbjct: 141 QCAQVPSPSCPAGVGSSCGFNLTYAASTFQ-AVLGQDSLALENN-VVVSYTFGCLRVVSG 198
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVD-RDSPASGVLEFNSARGGDAV-TA 329
V GL+G G G LS Q K T +YCL + R S SG L+ + T
Sbjct: 199 NSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIGQPKRIKTT 258
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
PL+ N + YYV + G VG + VQ+P S + G I+D GT TRL Y
Sbjct: 259 PLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYA 318
Query: 390 SLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
++RD+F R+ + P G FDTCY+ +V VPTV+ F A+ LP +N +I
Sbjct: 319 AVRDAFRGRVRTPVAPPLGG--FDTCYNV----TVSVPTVTFMFAGAVAVTLPEENVMIH 372
Query: 449 VDSAGTFCFAFAP-----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S G C A A ++AL+++ ++QQQ RV FD+AN RVGF+ C
Sbjct: 373 SSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 424
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 122/354 (34%), Positives = 168/354 (47%), Gaps = 40/354 (11%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GEY VGTPP + + DTGSDI WLQC PC ECY Q+ P F P SS+Y +PC++
Sbjct: 85 GEYLMTYSVGTPPFKLYGIADTGSDIVWLQCEPCKECYNQTTPKFKPSKSSTYKNIPCSS 144
Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
CKS G+ +V L E+ S G+ S +GCG DN F
Sbjct: 145 DLCKSGQ----------------QGNLSVDTLTLES-STGHPISFPKTVIGCGTDNTVSF 187
Query: 277 VG-SAGLLGLGGGMLSLTKQIKAT---SLAYCLVDR--DSPASGVLEF--NSARGGDAVT 328
G S+G++GLGGG SL Q+ ++ +YCL+ +S + L F + GD V
Sbjct: 188 EGASSGIVGLGGGPASLITQLGSSIDAKFSYCLLPNPVESNTTSKLNFGDTAVVSGDGVV 247
Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG--DGGIIVDCGTAITRLQTQ 386
+ I K FYY+ L FSVG + ++ FE G +G II+D GT +T + T
Sbjct: 248 STPIVKKDPIVFYYLTLEAFSVGNKRIE-----FEGSSNGGHEGNIIIDSGTTLTVIPTD 302
Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL 446
YN+L + + L + LF+ CY + P ++ HF P ++
Sbjct: 303 VYNNLESAVLELVKLKRVNDPTRLFNLCYSVTS-DGYDFPIITTHFKGADVKLHPISTFV 361
Query: 447 IPVDSAGTFCFAFAPTSS-----ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
D G C AFA TS+ +SI GN+ QQ V +DL V F P C
Sbjct: 362 DVAD--GIVCLAFATTSAFIPSDVVSIFGNLAQQNLLVGYDLQQKIVSFKPTDC 413
>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 491
Score = 178 bits (451), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 120/346 (34%), Positives = 172/346 (49%), Gaps = 37/346 (10%)
Query: 173 SMVLDTGSDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRAN 230
+M +DT D+ W+QC PC +CY Q + FDP+ SS+ +P+ C + C++L A +
Sbjct: 160 TMAIDTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPVRCGSRACRTLGGYANGCS 219
Query: 231 R------CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA-GLL 283
+ CLY++ Y D T+G +T+T++ S + GC H G F A G +
Sbjct: 220 KPNSTGDCLYRIEYSDHRLTLGTYMTDTLTISPSTTFLNFRFGCSHAVRGKFSAQASGTM 279
Query: 284 GLGGG---MLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDA-------VTAPLIR 333
LGGG +LS T + + +YC+ A+G L GD T PL+R
Sbjct: 280 SLGGGPQSLLSQTARAYGNAFSYCVPGPS--AAGFLSIGGPVNGDDGGGSGAFATTPLVR 337
Query: 334 NKKV--DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
+ V T Y V L G V G+ + +PP +F GG ++D IT+L AY +L
Sbjct: 338 SANVINPTIYVVRLQGIEVAGRRLNVPPVVFS------GGTVMDSSAVITQLPPTAYRAL 391
Query: 392 RDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
R +F K + DTC+DF G+ V VPTVSL F G ++L + L+ DS
Sbjct: 392 RLAFRNAMRAYKTRAPTGNLDTCFDFVGVSKVTVPTVSLVFDGGAVIELGLLSVLL--DS 449
Query: 452 AGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C AFAP ++ AL IGNVQQQ V +D+A VGF C
Sbjct: 450 ----CLAFAPMAADFALGFIGNVQQQTHEVLYDVAGGAVGFRHGAC 491
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 132/369 (35%), Positives = 193/369 (52%), Gaps = 38/369 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCA 215
GEY + +GTPP+ + + DTGSD+ W QC PC E C++Q P+++P +S ++ LPC+
Sbjct: 90 GEYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCS 149
Query: 216 APQCKSLDVSACRAN----------RCLYQVAYGDGSFTVGDLVTETVSFGNSGS----V 261
+ +L++ A A C Y YG G +T G +ET +FG+S + V
Sbjct: 150 S----ALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRV 204
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNS 320
GIA GC + + + GSAGL+GLG G LSL Q+ A +YCL +D+ + L
Sbjct: 205 PGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGP 264
Query: 321 ARGGDAVTAPLIRN---------KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
A A+ +R+ + T+YY+ LTG SVG A+ IPP F + G GG
Sbjct: 265 AAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGG 324
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDF--SGLRSVRVPT 427
+I+D GT IT L AY +R + VR L T G D C+ S +P+
Sbjct: 325 LIIDSGTTITSLVDAAYKRVRAA-VRSLVKLPVTDGSNATGLDLCFALPSSSAPPATLPS 383
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF-APTSSALSIIGNVQQQGTRVSFDLANN 486
++LHFG G + LP +NY+I +D G +C A + T LS +GN QQQ + +D+
Sbjct: 384 MTLHFGGGADMVLPVENYMI-LD-GGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKE 441
Query: 487 RVGFTPNKC 495
+ F P KC
Sbjct: 442 TLSFAPAKC 450
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 177 bits (450), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 131/365 (35%), Positives = 183/365 (50%), Gaps = 25/365 (6%)
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT--ECYQQSDP 199
+ S P G S S EY + + GTP +V+DTGSD+ WLQC+PC+ +C Q DP
Sbjct: 95 KKVSVPAHLGTSVKSLEYVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDP 154
Query: 200 IFDPKTSSSYSPLPCAAPQCKSLDV----SACRANR-CLYQVAYGDGSFTVGDLVTETVS 254
+FDP SS+YS +PCA+ +CK L S C + C + ++Y DG+ TVG + ++
Sbjct: 155 LFDPSHSSTYSAVPCASGECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLT 214
Query: 255 FGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI-KATSLAYCLVDRDSPAS 313
VK GCGH L GLLGLG SL Q +YCL +S
Sbjct: 215 LAPGAIVKDFYFGCGHSKSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPAVNS-KP 273
Query: 314 GVLEFNSARGGDA-VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
G L F + R V P+ R TF V L G +VGG+ + + PS F GG+
Sbjct: 274 GFLAFGAGRNPSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAFS------GGM 327
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
IVD GT +T LQ+ Y +LR +F + G DTCYD +G ++V VP ++L F
Sbjct: 328 IVDSGTVVTVLQSTVYRALRAAFREAMKAYRLVHGD--LDTCYDLTGYKNVVVPKIALTF 385
Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGF 490
G ++L N ++ V+ C AFA T ++GNV Q+ V FD + ++ GF
Sbjct: 386 SGGATINLDVPNGIL-VNG----CLAFAETGKDGTAGVLGNVNQRTFEVLFDTSASKFGF 440
Query: 491 TPNKC 495
C
Sbjct: 441 RAKAC 445
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 177 bits (450), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 112/356 (31%), Positives = 180/356 (50%), Gaps = 25/356 (7%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCR----PCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
+G+GTPP+ +++DTGSD+ W QC+ S P++DP SS+++ LPC+
Sbjct: 95 VGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAFLPCSDRL 154
Query: 219 CKSLDVS---ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK-GIALGCGHDNEG 274
C+ S NRC+Y+ YG + VG L +ET +FG +V + GCG + G
Sbjct: 155 CQEGQFSFKNCTSKNRCVYEDVYGSAA-AVGVLASETFTFGARRAVSLRLGFGCGALSAG 213
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV----DRDSPA--SGVLEFNSARGGDAV- 327
+G+ G+LGL LSL Q+K +YCL + SP + + + + +
Sbjct: 214 SLIGATGILGLSPESLSLITQLKIQRFSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQ 273
Query: 328 TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
T ++ N +YYV L G S+G + + +P + M G GG IVD G+ + L A
Sbjct: 274 TTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAA 333
Query: 388 YNSLRDSFVRLAGNLKPTSGVALFDTCYDF------SGLRSVRVPTVSLHFGAGKALDLP 441
+ +++++ + + V ++ C+ + + +V+VP + LHF G A+ LP
Sbjct: 334 FEAVKEAVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLP 393
Query: 442 AKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
NY AG C A T+ S +SIIGNVQQQ V FD+ +++ F P +C
Sbjct: 394 RDNYFQE-PRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 448
>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 506
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 126/347 (36%), Positives = 176/347 (50%), Gaps = 39/347 (11%)
Query: 173 SMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD------V 224
SMV+DT SD+ W+QC PC + CY QSD ++DP S +P PC++PQC+SL
Sbjct: 175 SMVVDTASDVPWVQCAPCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCT 234
Query: 225 SACRANRCLYQVAYGDGSFTVGDLVTE--TVSFGNSGSVKGIALGCGHD--NEGLFVG-S 279
A C Y+V Y DGS T G V++ T++ G+V GC H G F +
Sbjct: 235 GAGNTGTCQYRVLYPDGSGTSGTYVSDLLTLNADPKGAVSKFQFGCSHALLRPGSFNNKT 294
Query: 280 AGLLGLGGGMLSLTKQIKAT-----SLAYCLVDRDSPAS----GVLEFNSARGGDAVTAP 330
AG + LG G SL+ Q K T +YCL S GV + ++R AVT P
Sbjct: 295 AGFMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLSLGVPQHAASR--YAVT-P 351
Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
++++K Y V L G V GQ + +PP++F + A +D T ITRL AY +
Sbjct: 352 MLKSKMAPMIYMVRLIGIDVAGQRLPVPPAVFAANAA------MDSRTIITRLPPTAYMA 405
Query: 391 LRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
LR +F + + DTCYDF+G+ VR+P V+L F A++L ++ D
Sbjct: 406 LRAAFRAQMRAYRAVAPKGQLDTCYDFTGVPMVRLPKVTLVFDRNAAVELDPSGVML--D 463
Query: 451 SAGTFCFAFAPTSSAL--SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S C AFAP ++ IIGNVQQQ V +++ VGF C
Sbjct: 464 S----CLAFAPNANDFMPGIIGNVQQQTLEVLYNVDGASVGFRRAAC 506
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 130/362 (35%), Positives = 180/362 (49%), Gaps = 21/362 (5%)
Query: 145 STPVVSGAS-QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
S P+ SG S Y R +GTP + + LDT +D W+ C C C S +FDP
Sbjct: 73 SVPIASGRGIVQSPTYIVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGC--SSSVLFDP 130
Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVK 262
SSS L C APQCK +C ++ C + + YG GS L +T++ + +
Sbjct: 131 SKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYG-GSAIEAYLTQDTLTLA-TDVIP 188
Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVD-RDSPASGVLEF 318
GC + G + + GL+GLG G LSL Q + ++ +YCL + + S SG L
Sbjct: 189 NYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRL 248
Query: 319 NSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
+ T PL++N + + YYV L G VG + V IP S D A G I D G
Sbjct: 249 GPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSG 308
Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
T TRL AY ++R+ F R N TS + FDTCY SG SV P+V+ F AG
Sbjct: 309 TVYTRLVEPAYVAMRNEFRRRVKNANATS-LGGFDTCY--SG--SVVFPSVTFMF-AGMN 362
Query: 438 LDLPAKNYLIPVDSAGTFCFAF--APT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
+ LP N LI + C A APT +S L++I ++QQQ RV D+ N+R+G +
Sbjct: 363 VTLPPDNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRE 422
Query: 494 KC 495
C
Sbjct: 423 TC 424
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 110/340 (32%), Positives = 175/340 (51%), Gaps = 29/340 (8%)
Query: 173 SMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACR 228
++++D+GSD+ W+QC+PC C+ Q DP+FDP TS++Y+ +PC++ C L C
Sbjct: 82 TVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLGPYRRGCL 141
Query: 229 AN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG--LFVGSAGLLGL 285
AN +C + + Y +G+ G ++ ++ G V+G GC H ++G AG L L
Sbjct: 142 ANSQCQFGITYANGATATGTYSSDDLTLGPYDVVRGFLFGCAHADQGSTFSYDVAGTLAL 201
Query: 286 GGGMLSLTKQIKAT---SLAYCLVDRDSPAS----GVLEFNSARGGDAVTAPLIRNKKVD 338
GGG S +Q + +YC+ S GV +A V+ PL+ + +
Sbjct: 202 GGGSQSFVQQTASQYSRVFSYCVPPSTSSFGFIMFGVPPQRAALVPTFVSTPLLSSSTMS 261
Query: 339 -TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
TFY V L V G+ + +PP++F ++D T I+R+ AY +LR +F
Sbjct: 262 PTFYRVLLRSIIVAGRPLPVPPTVFSASS------VIDSATVISRIPPTAYQALRAAFRS 315
Query: 398 LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
+P V++ DTCYDFSG+RS+ +P+++L F G ++L A L+ C
Sbjct: 316 AMTMYRPAPPVSILDTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILL------QGCL 369
Query: 458 AFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
AFAPT+S IGNVQQ+ V +D+ + F C
Sbjct: 370 AFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 409
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 127/359 (35%), Positives = 189/359 (52%), Gaps = 25/359 (6%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
G GEYF RI +GTPP + ++ DTGSD+ W+QC+PC ECY+Q PIF+PK SS+Y + C
Sbjct: 90 GGGEYFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQECYKQKSPIFNPKQSSTYRRVLC 149
Query: 215 AAPQCKSL--DVSACRAN----RCLYQVAYGDGSFTVGDLVTETVSFGNS-GSVKGIALG 267
C +L D+ AC A+ C Y +YGD SFT+G L TE G++ S++ +A G
Sbjct: 150 ETRYCNALNSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGSTNNSIQELAFG 209
Query: 268 CGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKA---TSLAYCLV---DRDSPASGVLEF-- 318
CG+ N G F +G++GLGGG LSL Q+ +YCLV ++ + + G + F
Sbjct: 210 CGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGKIVFGD 269
Query: 319 NSARGGD--AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
NS G V+ PL+ +K+ +TFYY+ L SVG + + S + + G II+D
Sbjct: 270 NSFISGSDTYVSTPLV-SKEPETFYYLTLEAISVGNERLAYENSRNDGN-VEKGNIIIDS 327
Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
GT +T L ++ YN L + + + +F C F + +P +++HF
Sbjct: 328 GTTLTFLDSKLYNKLELVLEKAVEGERVSDPNGIFSIC--FRDKIGIELPIITVHFTDAD 385
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
P + + CF P S+ ++I GN+ Q V +DL N V F P C
Sbjct: 386 VELKPINTFAKAEEDL--LCFTMIP-SNGIAIFGNLAQMNFLVGYDLDKNCVSFMPTDC 441
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 119/356 (33%), Positives = 174/356 (48%), Gaps = 22/356 (6%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GEY I VGTPP V DTGSD+ W QC+PC+ CYQQ+ P+FDP S++Y + C++
Sbjct: 81 GEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPSKSTTYKNVACSS 140
Query: 217 PQCK-SLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGH 270
P C S D S+C + CLY +AYGD S + G+L +TV+ G + +GCGH
Sbjct: 141 PVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVAFPRTVIGCGH 200
Query: 271 DNEGLFVGS-AGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPA---SGVLEFNS--- 320
DN G F + +G++GLG G SL Q+ + +YCL+ + + S L F S
Sbjct: 201 DNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIGTGSTNDSTKLNFGSNAN 260
Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
G V+ P+ + + TFY + L SVG P ++ G+ II+D GT +
Sbjct: 261 VSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKL--GGESNIIIDSGTTL 318
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
T L + NS + + D C+ + +P V++HF G + L
Sbjct: 319 TYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYCFA-TTTDDYEMPPVTMHF-EGADVPL 376
Query: 441 PAKNYLIPVDSAGTFCFAFAP-TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+N + + S T C AF + I GN+ Q V +D+ N V F P C
Sbjct: 377 QRENLFVRL-SDDTICLAFGSFPDDNIFIYGNIAQSNFLVGYDIKNLAVSFQPAHC 431
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 134/414 (32%), Positives = 202/414 (48%), Gaps = 36/414 (8%)
Query: 106 LERDSAR---VNTLITKLQLAIYNVDR-----HELKPAEAQILPEDFSTPVVSGASQGSG 157
+ RDS + N+ T LQ + R H + A + P++ + +++ G
Sbjct: 36 VHRDSPKSPLYNSQQTHLQRWNKAMRRSVSRVHHFQRTAATVSPKEVESEIIANG----G 91
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EY + +GTPP + + DTGSD+ W QC PC +CY+Q P+FDPK+S +Y L C
Sbjct: 92 EYLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPKSSKTYRDLSCDTR 151
Query: 218 QCKSL-DVSACRANR-CLYQVAYGDGSFTVGDLVTETVSF--GNSGSV--KGIALGCGHD 271
QC++L + S+C + + C Y YGD SFT G+L +TV+ N G V +GCG
Sbjct: 152 QCQNLGESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTVIGCGRR 211
Query: 272 NEGLF-VGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASG---VLEF--NSAR 322
N G F +G++GLGGG +SL Q+ ++ +YCLV S ++G L F N+
Sbjct: 212 NNGTFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSESAGNSSKLHFGRNAVV 271
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
G V + + +K DTFYY+ L SVG + ++ S F E II+D GT++T
Sbjct: 272 SGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIEFGGSSFGGSEG---NIIIDSGTSLTL 328
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVA-LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
+ + N + T + L CY + ++VP ++ HF G + L
Sbjct: 329 FPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPT--PDLKVPVITAHFN-GADVVLQ 385
Query: 442 AKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
N I + S C AF T S +I GNV Q + +D+ V F P C
Sbjct: 386 TLNTFILI-SDDVLCLAFNSTQSG-AIFGNVAQMNFLIGYDIQGKSVSFKPTDC 437
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 118/307 (38%), Positives = 169/307 (55%), Gaps = 24/307 (7%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EY + +GTPP+ + LDTGSD+ W QC+PC C+ Q+ P FDP TSS+ S C +
Sbjct: 81 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 140
Query: 218 QCKSLDVSACRANR------CLYQVAYGDGSFTVGDLVTETVSF-GNSGSVKGIALGCGH 270
C+ L V++C + + C+Y +YGD S T G L + +F G SV G+A GCG
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL 200
Query: 271 DNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSPASGVLE-----FNSAR 322
N G+F + G+ G G G LSL Q+K + ++C V+ P++ +L+ + S R
Sbjct: 201 FNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKPSTVLLDLPADLYKSGR 260
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
G T PLI+N TFYY+ L G +VG + +P S F + G GG I+D GTA+T
Sbjct: 261 GAVQST-PLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKN-GTGGTIIDSGTAMTS 318
Query: 383 LQTQAYNSLRDSFVRLAGNLK-PTSGVALFDTCYDFSG-LRSV-RVPTVSLHFGAGKALD 439
L T+ Y +RD+F A +K P D + S LR+ VP + LHF G +D
Sbjct: 319 LPTRVYRLVRDAF---AAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHF-EGATMD 374
Query: 440 LPAKNYL 446
LP +NY+
Sbjct: 375 LPRENYV 381
>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
distachyon]
Length = 836
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 144/370 (38%), Positives = 192/370 (51%), Gaps = 32/370 (8%)
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ--QSDP 199
+ + P G S G+ +Y + +GTP ++ +DTGSD++W+QC PC Q D
Sbjct: 483 KSVTIPANIGHSIGTLQYVVTVSLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACYAQKDQ 542
Query: 200 IFDPKTSSSYSPLPCAAPQCKSLDV--SACRA-NRCLYQVAYGDGSFTVGDLVTETVSFG 256
+FDP SSSYS +PCAA C L C A ++C Y V+YGDGS T G ++T++
Sbjct: 543 LFDPAKSSSYSAVPCAADACSELSTYGHGCAAGSQCGYVVSYGDGSNTTGVYGSDTLTLT 602
Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS----LAYCLVDRDSPA 312
++ +V G GCGH GLF G GLL LG +SLT Q +YCL S +
Sbjct: 603 DADAVTGFLFGCGHAQAGLFAGIDGLLALGRKGMSLTSQTSGAYGGGVFSYCLPPSPS-S 661
Query: 313 SGVLEFN--SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGD 369
+G L S+ G A T L+ V TFY V LTG VGGQ + +P S F
Sbjct: 662 TGFLTLGGPSSASGFATTG-LLTAWDVPTFYMVMLTGIGVGGQQLSGVPASAFA------ 714
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL----KPTSGVALFDTCYDFSGLRSVRV 425
GG +VD GT ITRL AY +LR +F P +G+ DTCY+F+ +V +
Sbjct: 715 GGTVVDTGTVITRLPPTAYAALRAAFRAAMAPYGYPAAPATGI--LDTCYNFTDYGTVTL 772
Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLAN 485
PTVSL F G L L A +L S+G FA +I+GNVQQ+ V FD
Sbjct: 773 PTVSLTFSGGATLKLDAPGFL----SSGCLAFATNSGDGDPAILGNVQQRSFAVRFD--G 826
Query: 486 NRVGFTPNKC 495
+ VGF P+ C
Sbjct: 827 SSVGFMPHSC 836
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 123/357 (34%), Positives = 169/357 (47%), Gaps = 27/357 (7%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
G++ I +GTPP + + ++DTGSD+ W+QC PC CY+Q P+FDP SS+Y+ + C +
Sbjct: 66 GQHLMEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPLKSSTYNNISCDS 125
Query: 217 PQCKSLDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHD 271
P C LD C RC Y YGD S T G L +T +F G S+ GCGH+
Sbjct: 126 PLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVSLSRFLFGCGHN 185
Query: 272 NEGLFVG-SAGLLGLGGGMLSLTKQI----KATSLAYCLVD--RDSPASGVLEFNSAR-- 322
N G F GL+GLGGG SL QI + CLV D S + F
Sbjct: 186 NTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGKGSQV 245
Query: 323 -GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDCGTAI 380
G VT PL+ +K DT Y+V L G SV + F M+ G ++VD GT
Sbjct: 246 LGNGVVTTPLVPREK-DTSYFVTLLGISVED-------TYFPMNSTIGKANMLVDSGTPP 297
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
L Q Y+ + VR LKP + T + +++ PT++ HF L
Sbjct: 298 ILLPQQLYDKVFAE-VRNKVALKPITDDPSLGTQLCYRTQTNLKGPTLTFHFVGANVLLT 356
Query: 441 PAKNYLIPV-DSAGTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
P + ++ P + G FC A + T+S + GN Q + FDL V F P C
Sbjct: 357 PIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIGFDLDRQVVSFKPTDC 413
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 140/408 (34%), Positives = 189/408 (46%), Gaps = 43/408 (10%)
Query: 113 VNTLITKLQLAIYNVDRHELKPAEA--QILPEDFSTPVVSGASQGSGEYFSRIGVGTPPR 170
V TL K + RH P A QIL TP Y +R +GTPP+
Sbjct: 66 VATLAAKPKPKPKGHSRHTFVPIAAGRQIL----RTP----------SYVARARLGTPPQ 111
Query: 171 QFSMVLDTGSDINWLQCRPCTECYQ-QSDPIFDPKTSSSYSPLPCAAPQCKSLD--VSAC 227
+ +D +D W+ C C C S P FDP SS+Y P+ C APQC + +C
Sbjct: 112 TLLVAIDPSNDAAWVPCSACLGCAPGASSPSFDPTQSSTYRPVRCGAPQCAQVPPATPSC 171
Query: 228 RAN---RCLYQVAYGDGSFTV---GDLVTETVSFGNSGSVKGIALGCGH--DNEGLFVGS 279
A C + ++Y + D ++ + S G + GC G V
Sbjct: 172 PAGPGASCAFNLSYASSTLHAVLGQDALSLSDSNGAAVPDDHYTFGCLRVVTGSGGSVPP 231
Query: 280 AGLLGLGGGMLSLTKQIKATS---LAYCLVD-RDSPASGVLEFNSARGGDAV-TAPLIRN 334
GL+G G G LS Q KAT +YCL + S SG L A + T PL+ N
Sbjct: 232 QGLVGFGRGPLSFLSQTKATYGSIFSYCLPSYKSSNFSGTLRLGPAGQPRRIKTTPLLSN 291
Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDCGTAITRLQTQAYNSLRD 393
+ YYV + G V G+AV IP S +D A G GG IVD GT TRL AY +LR+
Sbjct: 292 PHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGRGGTIVDAGTMFTRLSPPAYAALRN 351
Query: 394 SFVRLAGNLKPTS-GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
+F R G P + + FDTCY +G +S VP V+ F G + LP +N +I S
Sbjct: 352 AFRR--GVSAPAAPALGGFDTCYYVNGTKS--VPAVAFVFAGGARVTLPEENVVISSTSG 407
Query: 453 GTFCFAFAP-----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
G C A A ++ L+++ ++QQQ RV FD+ N RVGF+ C
Sbjct: 408 GVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNGRVGFSRELC 455
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 117/359 (32%), Positives = 174/359 (48%), Gaps = 21/359 (5%)
Query: 147 PVVSGAS-QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
P+ SG S Y R +GTP + + +DT +D W+ C C C S +F+
Sbjct: 83 PIASGRQIVQSPTYIVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCVGC---SSTVFNNVK 139
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S+++ + C APQCK + S C + C + + YG S +L + V+ + S+
Sbjct: 140 STTFKTVGCEAPQCKQVPNSKCGGSACAFNMTYGSSSI-AANLSQDVVTLA-TDSIPSYT 197
Query: 266 LGCGHDNEGLFVGSAGLLGLGGG---MLSLTKQIKATSLAYCLVD-RDSPASGVLEFNSA 321
GC + G + GLLGLG G +LS T+ + ++ +YCL R SG L
Sbjct: 198 FGCLTEATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFSYCLPSFRSLNFSGSLRLGPV 257
Query: 322 RGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
+ T PL++N + + YYV L VG + V IPPS + G I D GT
Sbjct: 258 GQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDSGTVF 317
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
TRL AY ++RD+F + GN TS + FDTCY + PT++ F +G + L
Sbjct: 318 TRLVAPAYTAVRDAFRKRVGNATVTS-LGGFDTCYT----SPIVAPTITFMF-SGMNVTL 371
Query: 441 PAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
P N LI ++ C A A +S L++I N+QQQ R+ FD+ N+R+G C
Sbjct: 372 PPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRLGVAREPC 430
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 121/354 (34%), Positives = 171/354 (48%), Gaps = 26/354 (7%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
G+Y +GTPP ++DT SDI W+QC+ C CY + P+FDP S +Y LPC++
Sbjct: 86 GDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPCSS 145
Query: 217 PQCKSLDVSACRANR---CLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIALGCG 269
CKS+ ++C ++ C + V Y DGS + GDL+ ETV+ G+ +GC
Sbjct: 146 TTCKSVQGTSCSSDERKICEHTVNYKDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIGCI 205
Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLV---DRDSPASGVLEFNSAR- 322
N + S G++GLGGG +SL Q+ ++ +YCL DR S L+F A
Sbjct: 206 R-NTNVSFDSIGIVGLGGGPVSLVPQLSSSISKKFSYCLAPISDRSSK----LKFGDAAM 260
Query: 323 -GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
GD + I K FYY+ L FSVG ++ S +G G II+D GT T
Sbjct: 261 VSGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFRSS--SSRSSGKGNIIIDSGTTFT 318
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
L Y+ L + + + + F CY S V VP ++ HF +G + L
Sbjct: 319 VLPDDVYSKLESAVADVVKLERAEDPLKQFSLCYK-STYDKVDVPVITAHF-SGADVKLN 376
Query: 442 AKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A N I V S C AF + S +I GN+ QQ V +DL V F P C
Sbjct: 377 ALNTFI-VASHRVVCLAFLSSQSG-AIFGNLAQQNFLVGYDLQRKIVSFKPTDC 428
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 134/364 (36%), Positives = 183/364 (50%), Gaps = 25/364 (6%)
Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
+GSG+Y G+GTP S DTGSD+ W +C C C + P + P +SSS + +
Sbjct: 87 KGSGDYAMSFGIGTPATGLSGEADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVA 146
Query: 214 CAAPQCKSLDVSACR--------ANRCLYQVAYGDG----SFTVGDLVTETVSFG-NSGS 260
C C L C + C Y AYG+ +T G L+TET +FG ++ +
Sbjct: 147 CGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDAAA 206
Query: 261 VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCL---VDRDSPAS-GVL 316
GIA GC +EG F +GL+GLG G LSL Q+ + Y L + SP S G L
Sbjct: 207 FPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYRLSSDLSAPSPISFGSL 266
Query: 317 EFNSARGGDA-VTAPLIRNKKVDT--FYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGI 372
+ GD+ ++ PL+ N V FYYVGLTG SVGG+ VQIP F D + G GG+
Sbjct: 267 ADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGV 326
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
I D GT +T L AY +RD + G KP D G + P++ LHF
Sbjct: 327 IFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHF 386
Query: 433 GAGKALDLPAKNYLIPV---DSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN-RV 488
G +DL +NYL + + C++ +S AL+IIGN+ Q V FDL+ N R+
Sbjct: 387 DGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDLSGNARM 446
Query: 489 GFTP 492
F P
Sbjct: 447 LFQP 450
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 102/302 (33%), Positives = 160/302 (52%), Gaps = 22/302 (7%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L D ARV TL ++L + L + + P+ S P+ GAS GSG Y+ ++G
Sbjct: 66 LAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIR-FPKSVSVPLNPGASIGSGNYYVKVGF 124
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-- 222
G+P R +SM++DTGS ++WLQC+PC C+ Q+DP+FDP S +Y L C + QC SL
Sbjct: 125 GSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQCSSLVD 184
Query: 223 -----DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
+ +N C+Y +YGD S+++G L + ++ S ++ G GCG D++GLF
Sbjct: 185 ATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTLPGFVYGCGQDSDGLFG 244
Query: 278 GSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSAR--GGDAVTAPLI 332
+AG+LGLG LS+ Q+ + + +YCL R G L A G P+
Sbjct: 245 RAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRG--GGGFLSIGKASLAGSAYKFTPMT 302
Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
+ + Y++ LT +VGG+A+ + + + + I+D GT ITRL Y +
Sbjct: 303 TDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPT------IIDSGTVITRLPMSVYTPFQ 356
Query: 393 DS 394
+
Sbjct: 357 QA 358
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 134/364 (36%), Positives = 183/364 (50%), Gaps = 25/364 (6%)
Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
+GSG+Y G+GTP S DTGSD+ W +C C C + P + P +SSS + +
Sbjct: 87 KGSGDYAMSFGIGTPATGLSGEADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVA 146
Query: 214 CAAPQCKSLDVSACR--------ANRCLYQVAYGDG----SFTVGDLVTETVSFG-NSGS 260
C C L C + C Y AYG+ +T G L+TET +FG ++ +
Sbjct: 147 CGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDAAA 206
Query: 261 VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCL---VDRDSPAS-GVL 316
GIA GC +EG F +GL+GLG G LSL Q+ + Y L + SP S G L
Sbjct: 207 FPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYRLSSDLSAPSPISFGSL 266
Query: 317 EFNSARGGDA-VTAPLIRNKKVDT--FYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGI 372
+ GD+ ++ PL+ N V FYYVGLTG SVGG+ VQIP F D + G GG+
Sbjct: 267 ADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGV 326
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
I D GT +T L AY +RD + G KP D G + P++ LHF
Sbjct: 327 IFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHF 386
Query: 433 GAGKALDLPAKNYLIPV---DSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN-RV 488
G +DL +NYL + + C++ +S AL+IIGN+ Q V FDL+ N R+
Sbjct: 387 DGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDLSGNARM 446
Query: 489 GFTP 492
F P
Sbjct: 447 LFQP 450
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 132/401 (32%), Positives = 183/401 (45%), Gaps = 27/401 (6%)
Query: 108 RDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQ-GSGEYFSRIGVG 166
+ + VNT+IT + + D LK + + P+ G Y R+ +G
Sbjct: 51 KQESWVNTVIT-----MASKDPERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLG 105
Query: 167 TPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA 226
TP +Q MVLDT +D W+ C CT C S F P S++ L C+ QC + +
Sbjct: 106 TPGQQMFMVLDTSNDAAWVPCSGCTGC---SSTTFLPNASTTLGSLDCSGAQCSQVRGFS 162
Query: 227 CRA---NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLL 283
C A + CL+ +YG S LV + ++ N + G GC + G + GLL
Sbjct: 163 CPATGSSACLFNQSYGGDSSLTATLVQDAITLAND-VIPGFTFGCINAVSGGSIPPQGLL 221
Query: 284 GLGGGMLSLTKQIKAT---SLAYCLVDRDSPA-SGVLEFNSARGGDAV-TAPLIRNKKVD 338
GLG G +SL Q A +YCL S SG L+ ++ T PL+RN
Sbjct: 222 GLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRP 281
Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
+ YYV LTG SVG V IP D G I+D GT ITR Y ++RD F +
Sbjct: 282 SLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQ 341
Query: 399 AGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFA 458
P S + FDTC F+ P ++LHF G L LP +N LI S C +
Sbjct: 342 VNG--PISSLGAFDTC--FAATNEAEAPAITLHF-EGLNLVLPMENSLIHSSSGSLACLS 396
Query: 459 FAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A +S L++I N+QQQ R+ FD N+R+G C
Sbjct: 397 MAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELC 437
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 122/415 (29%), Positives = 196/415 (47%), Gaps = 46/415 (11%)
Query: 118 TKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLD 177
++ +LA + R E A ++ E TP++ GEY ++G+GTPP +F+ +D
Sbjct: 55 SRYRLAGIGMARGEAASARKAVVAE---TPIMPAG----GEYLVKLGIGTPPYKFTAAID 107
Query: 178 TGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRAN---RCLY 234
T SD+ W QC+PCT CY Q DP+F+P+ SS+Y+ LPC++ C LDV C + C Y
Sbjct: 108 TASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQY 167
Query: 235 QVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF--VGSAGLLGLGGGMLSL 292
Y + T G L + + G + +G+A GC + G ++G++GLG G LSL
Sbjct: 168 TYTYSGNATTEGTLAVDKLVIGED-AFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSL 226
Query: 293 TKQIKATSLAYCLVDRDSPASGVL----EFNSARGG-DAVTAPLIRNKKVDTFYYVGLTG 347
Q+ AYCL S G L + ++AR + + P+ R+ + ++YY+ L G
Sbjct: 227 VSQLSVRRFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDG 286
Query: 348 FSVGGQAVQIP-----------------------PSLFEMDEAGDGGIIVDCGTAITRLQ 384
+G +A+ +P + + +A G+I+D + IT L+
Sbjct: 287 LLIGDRAMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLE 346
Query: 385 TQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
Y+ L + +RL + G+ L D V VP V+L F G+ L L
Sbjct: 347 ASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGVAFDRVYVPAVALAFD-GRWLRLD 405
Query: 442 AKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+G C + ++SI+GN QQQ +V ++L RV F + C
Sbjct: 406 KARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 123/361 (34%), Positives = 182/361 (50%), Gaps = 22/361 (6%)
Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSP 211
ASQG EY VGTPP + V+DTGS I W+QC+ C +CY+Q+ PIFDP S +Y
Sbjct: 92 ASQG--EYLMSYSVGTPPFEILGVVDTGSGITWMQCQRCEDCYEQTTPIFDPSKSKTYKT 149
Query: 212 LPCAAPQCKS-LDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFG--NSGSVK--GI 264
LPC++ C+S + +C +++ C Y + YGDGS + GDL ET++ G N SV+
Sbjct: 150 LPCSSNMCQSVISTPSCSSDKIGCKYTIKYGDGSHSQGDLSVETLTLGSTNGSSVQFPNT 209
Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT----SLAYCLVD--RDSPASGVLEF 318
+GCGH+N+G F G + GG ++ +YCL S +S L F
Sbjct: 210 VIGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGGKFSYCLAPMFSQSNSSSKLNF 269
Query: 319 NSA---RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIV 374
A G AV+ PL+ + FYY+ L FSVG + ++ + S G+G II+
Sbjct: 270 GDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGNIII 329
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
D GT +T L + Y++L + + + CY + + VP ++ HF
Sbjct: 330 DSGTTLTLLPQEDYSNLESAVADAIQANRVSDPSNFLSLCYQTTPSGQLDVPVITAHF-K 388
Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
G ++L + + V + G CFAF +S +SI GN+ Q V +DL V F P
Sbjct: 389 GADVELNPISTFVQV-AEGVVCFAFH-SSEVVSIFGNLAQLNLLVGYDLMEQTVSFKPTD 446
Query: 495 C 495
C
Sbjct: 447 C 447
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 175 bits (443), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 108/353 (30%), Positives = 171/353 (48%), Gaps = 21/353 (5%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GEY R +GTPP + + DT SD+ W+QC PC C+ Q P+F+P SS+++ L C +
Sbjct: 88 GEYLMRFYIGTPPVERLAIADTASDLIWVQCSPCETCFPQDTPLFEPHKSSTFANLSCDS 147
Query: 217 PQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGN-SGSVKGIALGCGHDNE 273
C S ++ C N CLY YGDGS T G L TE++ FG+ + + GCG +N+
Sbjct: 148 QPCTSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTESIHFGSQTVTFPKTIFGCGSNND 207
Query: 274 GLFVGS---AGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEF---NSARGG 324
+ S G++GLG G LSL Q+ +YCL+ S ++ L+F + G
Sbjct: 208 FMHQISNKVTGIVGLGAGPLSLVSQLGDQIGHKFSYCLLPFTSTSTIKLKFGNDTTITGN 267
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
V+ PLI + ++Y++ L G ++G + +Q+ + +G II+D GT +T L+
Sbjct: 268 GVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQV-----RTTDHTNGNIIIDLGTVLTYLE 322
Query: 385 TQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN 444
Y++ + +R A + T + + F ++ P + F K L KN
Sbjct: 323 VNFYHNFV-TLLREALGISETKDDIPYPFDFCFPNQANITFPKIVFQFTGAKVF-LSPKN 380
Query: 445 YLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
D C A P + S+ GN+ Q +V +D +V F P C
Sbjct: 381 LFFRFDDLNMICLAVLPDFYAKGFSVFGNLAQVDFQVEYDRKGKKVSFAPADC 433
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 131/449 (29%), Positives = 201/449 (44%), Gaps = 40/449 (8%)
Query: 79 SFSLPLH---SREILHKTRHNDYRS------LVLSRLERDSARVNTLITKLQLAIYNVDR 129
SF++P H SR I + H D R V +R RVN L+ + R
Sbjct: 17 SFAVPGHGQPSRGIRLELTHVDARGDFTGSDRVRRAADRSHRRVNGLLAAAPPPAASTLR 76
Query: 130 HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQC-R 188
+ A S + Y +GTPP S VLDTGSD+ W QC
Sbjct: 77 SDGGGGGACAATAAASV------HASTATYLVDFAIGTPPLALSAVLDTGSDLIWTQCDA 130
Query: 189 PCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-------------DVSACRANRCLYQ 235
PC C+ Q P++ P S +Y+ + C + C +L A C Y
Sbjct: 131 PCRRCFPQPAPLYAPARSVTYANVSCGSRLCDALPSLRPSSRCSASASAPAPERGGCTYY 190
Query: 236 VAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ 295
+YGDGS T G L TET +FG +V +A GCG DN G S+GL+G+G G LSL Q
Sbjct: 191 YSYGDGSSTDGVLATETFTFGAGTTVHDLAFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQ 250
Query: 296 IKATSLAYCLV---DRDSPASGVLEFNSARGGDAVTAPLI---RNKKVDTFYYVGLTGFS 349
+ T +YC D + + L +++ A + P + + ++YY+ L G +
Sbjct: 251 LGVTKFSYCFTPFNDTTTSSPLFLGSSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGIT 310
Query: 350 VGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
VG + I P++F + +G GG+I+D GT T L+ +A+ L + +
Sbjct: 311 VGDTLLPIDPAVFRLTASGRGGLIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHL 370
Query: 410 LFDTCY---DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL 466
C+ G +V VP + LHF G ++LP + ++ AG C ++ +
Sbjct: 371 GLSVCFAAPQGRGPEAVDVPRLVLHFD-GADMELPRSSAVVEDRVAGVACLGIV-SARGM 428
Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S++G++QQQ V +D+ + + F P C
Sbjct: 429 SVLGSMQQQNMHVRYDVGRDVLSFEPANC 457
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 120/351 (34%), Positives = 171/351 (48%), Gaps = 44/351 (12%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+GEY + +GTPP ++DTGSD+ W QCRPCT CY+Q P+FDPK SS+Y C
Sbjct: 89 AGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCG 148
Query: 216 APQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCG 269
C +L D S + +C ++ +Y DGSFT G+L +ET++ G S G A GCG
Sbjct: 149 TSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCG 208
Query: 270 HDNEGLF-VGSAGLLGLGGGMLSLTKQIKATS---LAYCL--VDRDSPASGVLEFNSA-- 321
H + G+F S+G++GLGGG LSL Q+K+T +YCL V DS S + F ++
Sbjct: 209 HSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDSSISSRINFGASGR 268
Query: 322 -RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
G V+ PL K G+S + E +G IIVD GT
Sbjct: 269 VSGYGTVSTPLRLPYK----------GYS-------------KKTEVEEGNIIVDSGTTY 305
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
T L + Y+ L S + +F CY+ + + P ++ HF
Sbjct: 306 TFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTA--EINAPIITAHFKDANVELQ 363
Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
P ++ + CF APTS + ++GN+ Q V FDL R GF+
Sbjct: 364 PLNTFMRMQEDL--VCFTVAPTSD-IGVLGNLAQVNFLVGFDLRKKR-GFS 410
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 53/130 (40%), Gaps = 4/130 (3%)
Query: 366 EAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRV 425
E +G IIVD GT T L + Y L +S + + CY+ + + +
Sbjct: 414 EVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYN-TTVDQIDA 472
Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLAN 485
P ++ HF P +L + CF PTS + I+GN+ Q V FDL
Sbjct: 473 PIITAHFKDANVELQPWNTFLRMQEDL--VCFTVLPTSD-IGILGNLAQVNFLVGFDLRK 529
Query: 486 NRVGFTPNKC 495
RV F C
Sbjct: 530 KRVSFKAADC 539
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 116/354 (32%), Positives = 167/354 (47%), Gaps = 22/354 (6%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
G Y + +GTPP + + DTGSD+ W C PC CY+Q +P+FDP+ S++Y + C +
Sbjct: 70 GHYLMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKSTTYRNISCDS 129
Query: 217 PQCKSLDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHD 271
C LD C RC Y AY + T G L ET++ G S +KGI GCGH+
Sbjct: 130 KLCHKLDTGVCSPQKRCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLKGIVFGCGHN 189
Query: 272 NEGLFVG-SAGLLGLGGGMLSLTKQIKAT----SLAYCLV--DRDSPASGVLEF---NSA 321
N G F G++GLGGG +SL Q+ ++ + CLV D S + F +
Sbjct: 190 NTGGFNDHEMGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHTDVSVSSKMSFGKGSKV 249
Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
G V+ PL+ K+ T Y+V L G SV + S +++ G + +D GT T
Sbjct: 250 SGKGVVSTPLVA-KQDKTPYFVTLLGISVENTYLHFNGSSQNVEK---GNMFLDSGTPPT 305
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
L TQ Y+ + VR +KP + + ++R P ++ HF P
Sbjct: 306 ILPTQLYDQVVAQ-VRSEVAMKPVTDDPDLGPQLCYRTKNNLRGPVLTAHFEGADVKLSP 364
Query: 442 AKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ ++ P D G FC F TSS + GN Q + FDL V F P C
Sbjct: 365 TQTFISPKD--GVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPKDC 416
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 174 bits (442), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 132/415 (31%), Positives = 191/415 (46%), Gaps = 37/415 (8%)
Query: 106 LERDSAR---VNTLITKLQLAIYNVDR-----HELKPAEAQILPEDFSTPVVSGASQGSG 157
+ RDS + N T Q + V R H P + + F+ S G
Sbjct: 34 INRDSPKSPFYNPRETPTQRIVSAVRRSMSRVHHFSPTKNSDI---FTDTAQSEMISNQG 90
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EY + +GTP + DTGSD+ W QC+PC +CY+Q P+FDPK+SS+Y + C+
Sbjct: 91 EYLMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPLFDPKSSSTYRDISCSTK 150
Query: 218 QCKSLDVSAC---RANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIALGCG 269
QC L A N+ C Y +YGD SFT G++ +T++ G++ + +GCG
Sbjct: 151 QCDLLKEGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITLGSTSGRPVLLPKAIIGCG 210
Query: 270 HDNEGLFVGSAGLLGLGGGM-LSLTKQIKAT---SLAYCLVDRDSPA--SGVLEFNS--- 320
H+N G F + GG +SL Q+ +T +YCLV S A S L F S
Sbjct: 211 HNNGGSFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLVPLSSNATNSSKLNFGSNGI 270
Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
GG + PLI +K DTFY++ L SVG + ++ P S F E G II+D GT +
Sbjct: 271 VSGGGVQSTPLI-SKDPDTFYFLTLEAVSVGSERIKFPGSSFGTSE---GNIIIDSGTTL 326
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
T ++ L + + CY ++ P+++ HF G + L
Sbjct: 327 TLFPEDFFSELSSAVQDAVAGTPVEDPSGILSLCYSIDA--DLKFPSITAHFD-GADVKL 383
Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
N + V S CFAF P +S +I GN+ Q V +DL V F P C
Sbjct: 384 NPLNTFVQV-SDTVLCFAFNPINSG-AIFGNLAQMNFLVGYDLEGKTVSFKPTDC 436
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 174 bits (441), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 110/335 (32%), Positives = 157/335 (46%), Gaps = 22/335 (6%)
Query: 173 SMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA---C 227
+MVLDT SD+ W+QC PC CY Q D ++DP SSS C +P C L A
Sbjct: 145 TMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCT 204
Query: 228 RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV---GSAGLLG 284
N+C Y+V Y DG+ T G +++ ++ + +V+ GC H +G F +AG++
Sbjct: 205 NNNQCQYRVRYPDGTSTAGTYISDLLTITPATAVRSFQFGCSHGVQGSFSFGSSAAGIMA 264
Query: 285 LGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVD-TF 340
LGGG SL Q AT ++C L V P+++N + TF
Sbjct: 265 LGGGPESLVSQTAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAIPPTF 324
Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG 400
Y V L +V GQ + +PP++F G +D TAITRL AY +LR +F
Sbjct: 325 YMVRLEAIAVAGQRIAVPPTVFA------AGAALDSRTAITRLPPTAYQALRQAFRDRMA 378
Query: 401 NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA 460
+P DTCYD +G+RS +P ++L F A++L L G F
Sbjct: 379 MYQPAPPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLF----QGCLAFTAG 434
Query: 461 PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
P IIGN+Q Q V +++ VGF C
Sbjct: 435 PNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 469
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 110/335 (32%), Positives = 157/335 (46%), Gaps = 22/335 (6%)
Query: 173 SMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA---C 227
+MVLDT SD+ W+QC PC CY Q D ++DP SSS C +P C L A
Sbjct: 170 TMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCT 229
Query: 228 RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV---GSAGLLG 284
N+C Y+V Y DG+ T G +++ ++ + +V+ GC H +G F +AG++
Sbjct: 230 NNNQCQYRVRYPDGTSTAGTYISDLLTITPATAVRSFQFGCSHGVQGSFSFGSSAAGIMA 289
Query: 285 LGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVD-TF 340
LGGG SL Q AT ++C L V P+++N + TF
Sbjct: 290 LGGGPESLVSQTAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAIPPTF 349
Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG 400
Y V L +V GQ + +PP++F G +D TAITRL AY +LR +F
Sbjct: 350 YMVRLEAIAVAGQRIAVPPTVFA------AGAALDSRTAITRLPPTAYQALRQAFRDRMA 403
Query: 401 NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA 460
+P DTCYD +G+RS +P ++L F A++L L G F
Sbjct: 404 MYQPAPPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLF----QGCLAFTAG 459
Query: 461 PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
P IIGN+Q Q V +++ VGF C
Sbjct: 460 PNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 494
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 129/390 (33%), Positives = 185/390 (47%), Gaps = 36/390 (9%)
Query: 126 NVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL 185
NV+R + A A I E + V Q + VG PP + +DTGSD+ W+
Sbjct: 62 NVERRRTRRA-AFITDEIQANMVADDRGQA---FLVNFSVGRPPVPQLVGIDTGSDLLWV 117
Query: 186 QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC-KSLDVSACRANRCLYQVAYGDGSFT 244
QCRPC +C++QS PIFDP SS+Y L +P C S N+C+Y +Y DGS +
Sbjct: 118 QCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTS 177
Query: 245 VGDLVTETVSFGNSG----SVKGIALGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKAT 299
G+L TE + F S +V + GCGH N G F G +G+LGL G S+ ++ +
Sbjct: 178 SGNLATEDIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRL-GS 236
Query: 300 SLAYCLVDRDSPASGVLEFNSARGGDAV-----TAPLIRNKKVDTFYYVGLTGFSVGGQA 354
+YC+ D P N GD V + P + FYYV L G SVG
Sbjct: 237 RFSYCIGDLFDPH---YTHNQLVLGDGVKMEGSSTPF---HTFNGFYYVTLEGISVGETR 290
Query: 355 VQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLA-GNLKPTSGVALFDT 413
+ I P +F+ E+G GG+++D GT T L ++ L + RL G+ + ++ T
Sbjct: 291 LDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQ----VIYRT 346
Query: 414 -----CYDFSGLRSVR-VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SA 465
CY +R P ++ HF G L L A N L + FC A ++ +
Sbjct: 347 IPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDA-NSLFVQKNQDVFCLAVLESNLKNI 405
Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S+IG + QQ V++DL RV F C
Sbjct: 406 GSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 435
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 121/415 (29%), Positives = 195/415 (46%), Gaps = 46/415 (11%)
Query: 118 TKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLD 177
++ +LA + R E A ++ E TP++ GEY ++G+GTPP +F+ +D
Sbjct: 55 SRYRLAGIGMARGEAASARKAVVAE---TPIMPAG----GEYLVKLGIGTPPYKFTAAID 107
Query: 178 TGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRAN---RCLY 234
T SD+ W QC+PCT CY Q DP+F+P+ SS+Y+ LPC++ C LDV C + C Y
Sbjct: 108 TASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQY 167
Query: 235 QVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF--VGSAGLLGLGGGMLSL 292
Y + T G L + + G + +G+A GC + G ++G++GLG G LSL
Sbjct: 168 TYTYSGNATTEGTLAVDKLVIGED-AFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSL 226
Query: 293 TKQIKATSLAYCLVDRDSPASGVL----EFNSARGG-DAVTAPLIRNKKVDTFYYVGLTG 347
Q+ AYCL S G L + ++AR + + P+ R+ + ++YY+ L G
Sbjct: 227 VSQLSVRRFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDG 286
Query: 348 FSVGGQAVQIP-----------------------PSLFEMDEAGDGGIIVDCGTAITRLQ 384
+G + + +P + + +A G+I+D + IT L+
Sbjct: 287 LLIGDRTMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLE 346
Query: 385 TQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
Y+ L + +RL + G+ L D V VP V+L F G+ L L
Sbjct: 347 ASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGVAFDRVYVPAVALAFD-GRWLRLD 405
Query: 442 AKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+G C + ++SI+GN QQQ +V ++L RV F + C
Sbjct: 406 KARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460
>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
Length = 720
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 108/319 (33%), Positives = 166/319 (52%), Gaps = 28/319 (8%)
Query: 173 SMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACR 228
++++D+GSD++W+QC+PC C++Q DP+FDP S++Y+ +PC + C L C
Sbjct: 169 TVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCS 228
Query: 229 AN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG--LFVGSAGLLGL 285
AN +C + + YGDGS G + ++ G ++G GC H + G AG L L
Sbjct: 229 ANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLAL 288
Query: 286 GGGMLSLTKQIK---ATSLAYCLVDRDSPAS----GVLEFNSARGGDAVTAPLIRNKKVD 338
GGG SL +Q +YCL S GV + V+ PL+ +
Sbjct: 289 GGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVSTPLLSSSMAP 348
Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
TFY V L V G+ + +PP++F ++D T I+RL AY +LR +F
Sbjct: 349 TFYRVLLRAIIVAGRPLAVPPAVFSASS------VIDSSTIISRLPPTAYQALRAAFRSA 402
Query: 399 AGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFA 458
+ V++ DTCYDF+G+RS+ +P+++L F G ++L A L+ G+ C A
Sbjct: 403 MTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-----GS-CLA 456
Query: 459 FAPTSS--ALSIIGNVQQQ 475
FAPT+S IGNVQQ+
Sbjct: 457 FAPTASDRMPGFIGNVQQK 475
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 81/282 (28%), Positives = 130/282 (46%), Gaps = 38/282 (13%)
Query: 218 QCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
Q K+L+ C AN +C + + YGDGS G + ++ G D +GL
Sbjct: 473 QQKTLE--GCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDV----------DRQGLP 520
Query: 277 VGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
+ +A G + S +SL + + GV +A V+ PL+ +
Sbjct: 521 LRTATQYGR---VFSYCIPPSPSSLGFITL-------GVPPQRAALVPTFVSTPLLSSSS 570
Query: 337 VD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
+ TFY V L V G+ + +PP++F ++ T I+RL AY +LR +F
Sbjct: 571 MPPTFYRVLLRAIIVAGRPLPVPPTVFSTSS------VIASTTVISRLPPTAYQALRAAF 624
Query: 396 VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
R + V++ DTCYDF+G+RS+ +P+++L F G ++L A L+
Sbjct: 625 RRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL------QG 678
Query: 456 CFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C AFAPT++ IGNVQQ+ V +D+ + F C
Sbjct: 679 CLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 720
>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
Length = 452
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 135/368 (36%), Positives = 179/368 (48%), Gaps = 58/368 (15%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT---ECYQQSDPIF 201
+ P G G+ Y +GTP +M +DTGSD++W+QC+PC+ CY Q DP+F
Sbjct: 126 TVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLF 185
Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
DP SSSY+ +PC P C L G S G+V
Sbjct: 186 DPAQSSSYAAVPCGGPVCAGL-----------------------GIYAASACSAAQCGAV 222
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF 318
+G GCGH GLF G GLLGLG SL +Q T +YCL + S A G L
Sbjct: 223 QGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTA-GYLTL 281
Query: 319 NSARGGDAVTAP------LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
GG + AP L+ + T+Y V LTG SVGGQ + +P S F
Sbjct: 282 G--VGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV----- 334
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTS-GVALFDTCYDFSGLRSVRVPTVSL 430
VD GT +TRL AY +LR +F +A PT+ + DTCY+F+G +V +P V+L
Sbjct: 335 -VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVAL 393
Query: 431 HFGAGKALDLPAKNYLIPVDSAGTF-CFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNR 487
FG+G + L A L +F C AFAP+ S ++I+GNVQQ+ V D
Sbjct: 394 TFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTS 444
Query: 488 VGFTPNKC 495
VGF P+ C
Sbjct: 445 VGFKPSSC 452
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 127/370 (34%), Positives = 177/370 (47%), Gaps = 25/370 (6%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
S PV SG Q Y R G+GTP +Q + LDT +D W C PC C S F P
Sbjct: 67 SAPVASG--QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPA 122
Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANR--------CLYQVAYGDGSFTVGDLVTETVSFG 256
+SSSY+ LPCA+ C + C AN+ C + + D SF L ++T+ G
Sbjct: 123 SSSSYASLPCASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSFQA-SLGSDTLRLG 181
Query: 257 NSGSVKGIALGCGHDNEG--LFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVD-RDS 310
++ G A GC G + GLLGLG G +SL Q +T +YCL R
Sbjct: 182 KD-AIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLPSYRSY 240
Query: 311 PASGVLEFNSA-RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
SG L +A + + PL+ N + YYV +TG SVG V++P F D A
Sbjct: 241 YFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATG 300
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
G ++D GT ITR Y +LR+ F R + + FDTC++ + + P V+
Sbjct: 301 AGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVT 360
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT----SSALSIIGNVQQQGTRVSFDLAN 485
LH G L LP +N LI + C A A ++ ++++ N+QQQ RV D+A
Sbjct: 361 LHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAG 420
Query: 486 NRVGFTPNKC 495
+RVGF C
Sbjct: 421 SRVGFAREPC 430
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 124/368 (33%), Positives = 171/368 (46%), Gaps = 22/368 (5%)
Query: 141 PEDFSTPVVSGAS-QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP 199
P+ S P+ SG G Y R+ +GTP + MVLDT D W+ PC +C S P
Sbjct: 80 PKATSVPIASGQQVLNIGNYVVRVKLGTPGQLMFMVLDTSRDAAWV---PCADCAGCSSP 136
Query: 200 IFDPKTSSSYSPLPCAAPQCKSLDVSACRAN---RCLYQVAYGDGSFTVGDLVTETVSFG 256
F P TSS+Y+ L C+ PQC + +C C + YG S L +++
Sbjct: 137 TFSPNTSSTYASLQCSVPQCTQVRGLSCPTTGTAACFFNQTYGGDSSFSAMLSQDSLGLA 196
Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVDRDSPA- 312
++ + GC + G + GLLGLG G +SL Q + + +YC S
Sbjct: 197 -VDTLPSYSFGCVNAVSGSTLPPQGLLGLGRGPMSLLSQSGSLYSGVFSYCFPSFKSYYF 255
Query: 313 SGVLEFNS-ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
SG L + + T PL+RN T YYV LTG SVG V + P L D G
Sbjct: 256 SGSLRLGPLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAG 315
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLH 431
I+D GT ITR Y ++RD F + P + + FDTC F+ P V+ H
Sbjct: 316 TIIDSGTVITRFVEPVYAAIRDEFRKQVKG--PFATIGAFDTC--FAATNEDIAPPVTFH 371
Query: 432 FGAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNR 487
F G L LP +N LI + C A A +S L++I N+QQQ R+ FD+ N+R
Sbjct: 372 F-TGMDLKLPLENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSR 430
Query: 488 VGFTPNKC 495
+G C
Sbjct: 431 LGIARELC 438
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 129/390 (33%), Positives = 185/390 (47%), Gaps = 36/390 (9%)
Query: 126 NVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL 185
NV+R + A A I E + V Q + VG PP + +DTGSD+ W+
Sbjct: 30 NVERRRTRRA-AFITDEIQANMVADDRGQA---FLVNFSVGRPPVPQLVGIDTGSDLLWV 85
Query: 186 QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC-KSLDVSACRANRCLYQVAYGDGSFT 244
QCRPC +C++QS PIFDP SS+Y L +P C S N+C+Y +Y DGS +
Sbjct: 86 QCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTS 145
Query: 245 VGDLVTETVSFGNSG----SVKGIALGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKAT 299
G+L TE + F S +V + GCGH N G F G +G+LGL G S+ ++ +
Sbjct: 146 SGNLATEDIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRL-GS 204
Query: 300 SLAYCLVDRDSPASGVLEFNSARGGDAV-----TAPLIRNKKVDTFYYVGLTGFSVGGQA 354
+YC+ D P N GD V + P + FYYV L G SVG
Sbjct: 205 RFSYCIGDLFDPH---YTHNQLVLGDGVKMEGSSTPF---HTFNGFYYVTLEGISVGETR 258
Query: 355 VQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLA-GNLKPTSGVALFDT 413
+ I P +F+ E+G GG+++D GT T L ++ L + RL G+ + ++ T
Sbjct: 259 LDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQ----VIYRT 314
Query: 414 -----CYDFSGLRSVR-VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SA 465
CY +R P ++ HF G L L A N L + FC A ++ +
Sbjct: 315 IPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDA-NSLFVQKNQDVFCLAVLESNLKNI 373
Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S+IG + QQ V++DL RV F C
Sbjct: 374 GSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403
>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
Length = 629
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 108/319 (33%), Positives = 166/319 (52%), Gaps = 28/319 (8%)
Query: 173 SMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACR 228
++++D+GSD++W+QC+PC C++Q DP+FDP S++Y+ +PC + C L C
Sbjct: 78 TVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCS 137
Query: 229 AN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG--LFVGSAGLLGL 285
AN +C + + YGDGS G + ++ G ++G GC H + G AG L L
Sbjct: 138 ANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLAL 197
Query: 286 GGGMLSLTKQIK---ATSLAYCLVDRDSPAS----GVLEFNSARGGDAVTAPLIRNKKVD 338
GGG SL +Q +YCL S GV + V+ PL+ +
Sbjct: 198 GGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVSTPLLSSSMAP 257
Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
TFY V L V G+ + +PP++F ++D T I+RL AY +LR +F
Sbjct: 258 TFYRVLLRAIIVAGRPLAVPPAVFSASS------VIDSSTIISRLPPTAYQALRAAFRSA 311
Query: 399 AGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFA 458
+ V++ DTCYDF+G+RS+ +P+++L F G ++L A L+ G+ C A
Sbjct: 312 MTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-----GS-CLA 365
Query: 459 FAPTSS--ALSIIGNVQQQ 475
FAPT+S IGNVQQ+
Sbjct: 366 FAPTASDRMPGFIGNVQQK 384
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 81/282 (28%), Positives = 130/282 (46%), Gaps = 38/282 (13%)
Query: 218 QCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
Q K+L+ C AN +C + + YGDGS G + ++ G D +GL
Sbjct: 382 QQKTLE--GCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDV----------DRQGLP 429
Query: 277 VGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
+ +A G + S +SL + + GV +A V+ PL+ +
Sbjct: 430 LRTATQYGR---VFSYCIPPSPSSLGFITL-------GVPPQRAALVPTFVSTPLLSSSS 479
Query: 337 VD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
+ TFY V L V G+ + +PP++F ++ T I+RL AY +LR +F
Sbjct: 480 MPPTFYRVLLRAIIVAGRPLPVPPTVFSTSS------VIASTTVISRLPPTAYQALRAAF 533
Query: 396 VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
R + V++ DTCYDF+G+RS+ +P+++L F G ++L A L+
Sbjct: 534 RRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL------QG 587
Query: 456 CFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C AFAPT++ IGNVQQ+ V +D+ + F C
Sbjct: 588 CLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 629
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 129/429 (30%), Positives = 207/429 (48%), Gaps = 49/429 (11%)
Query: 95 HNDYRSLVLSRLERDSAR---VNTLITKLQLAIYNVDRHELKPAEAQILPEDFS---TPV 148
H + L + + RD ++ + +TK Q A YNV + ++FS
Sbjct: 22 HASKKGLSIEMIHRDFSKSPLYHPTVTKFQRA-YNVVHRSIN--RVNYFTKEFSLNKNQP 78
Query: 149 VSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSS 208
VS + GEY VGTPP + +DTGS+I WLQC+PC C+ Q+ PIF+P SSS
Sbjct: 79 VSTLTPELGEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQTSPIFNPSKSSS 138
Query: 209 YSPLPCAAPQCKSLDVS--ACR--ANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGS 260
Y +PC + CK + + +C + C Y + YG + + GDL ++++ G+S
Sbjct: 139 YKNIPCTSSTCKDTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVL 198
Query: 261 VKGIALGCGH-----DNEGLFVGSAGLLGLGGGMLSLTKQIKATSL----AYCLV--DRD 309
I +GCGH DN S+G++G+G G +SL KQ+ ++S+ +YCL+ + D
Sbjct: 199 FPNIVIGCGHINVLQDNS----QSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYNSD 254
Query: 310 SPASGVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
S +S L F G V+ P+++ + +Y++ L FSVG ++ E
Sbjct: 255 SNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEYG----ERSN 310
Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDFSGLRSV 423
A I++D GT +T L + L V+L P ++L CY+ +G + +
Sbjct: 311 ASTQNILIDSGTPLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHLSL---CYNTTG-KQL 366
Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDL 483
VP ++ HF G + L + P + G CF F +S+ L I GN+ Q + +DL
Sbjct: 367 NVPDITAHFN-GADVKLNSNGTFFPFED-GIMCFGFI-SSNGLEIFGNIAQNNLLIDYDL 423
Query: 484 ANNRVGFTP 492
+ F P
Sbjct: 424 EKEIISFKP 432
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 129/390 (33%), Positives = 185/390 (47%), Gaps = 36/390 (9%)
Query: 126 NVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL 185
NV+R + A A I E + V Q + VG PP + +DTGSD+ W+
Sbjct: 30 NVERRRTRRA-AFIXDEIQANMVADDRGQA---FLVNFSVGRPPVPQLVGIDTGSDLLWV 85
Query: 186 QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC-KSLDVSACRANRCLYQVAYGDGSFT 244
QCRPC +C++QS PIFDP SS+Y L +P C S N+C+Y +Y DGS +
Sbjct: 86 QCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTS 145
Query: 245 VGDLVTETVSFGNSG----SVKGIALGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKAT 299
G+L TE + F S +V + GCGH N G F G +G+LGL G S+ ++ +
Sbjct: 146 SGNLATEDIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRL-GS 204
Query: 300 SLAYCLVDRDSPASGVLEFNSARGGDAV-----TAPLIRNKKVDTFYYVGLTGFSVGGQA 354
+YC+ D P N GD V + P + FYYV L G SVG
Sbjct: 205 RFSYCIGDLFDPH---YTHNQLVLGDGVKMEGSSTPF---HTFNGFYYVTLEGISVGETR 258
Query: 355 VQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLA-GNLKPTSGVALFDT 413
+ I P +F+ E+G GG+++D GT T L ++ L + RL G+ + ++ T
Sbjct: 259 LDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQ----VIYRT 314
Query: 414 -----CYDFSGLRSVR-VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SA 465
CY +R P ++ HF G L L A N L + FC A ++ +
Sbjct: 315 IPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDA-NSLFVQKNQDVFCLAVLESNLKNI 373
Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S+IG + QQ V++DL RV F C
Sbjct: 374 GSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 114/280 (40%), Positives = 150/280 (53%), Gaps = 19/280 (6%)
Query: 227 CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLG 286
C CLY V YGDGS+T+G +T++ + ++KG GCG NEGLF +AGLLGLG
Sbjct: 16 CSGGHCLYGVQYGDGSYTIGFFAMDTLTLSSHDAIKGFRFGCGERNEGLFGEAAGLLGLG 75
Query: 287 GGMLSLTKQI---KATSLAYCLVDRDSPASGVLEF----NSARGGDAVTAPLIRNKKVDT 339
G SL Q A+C R S +G LEF + A T P++ + T
Sbjct: 76 RGKTSLPVQTYDKYGGVFAHCFPARSS-GTGYLEFGPGSSPAVSAKLSTTPMLIDTG-PT 133
Query: 340 FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV--R 397
FYYVG+TG VGG+ + IP S+F G IVD GT ITRL AY+SLR +F
Sbjct: 134 FYYVGMTGIRVGGKLLPIPQSVFAA-----AGTIVDSGTVITRLPPAAYSSLRSAFAASM 188
Query: 398 LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
A K ++L DTCYD +G V +PTVSL F G +LD+ A +I S C
Sbjct: 189 AARGYKRAPALSLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASG-IIYAASVSQACL 247
Query: 458 AFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
FA +A ++I+GN Q + V +D+A+ VGF P C
Sbjct: 248 GFAGNEAADDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 131/401 (32%), Positives = 183/401 (45%), Gaps = 27/401 (6%)
Query: 108 RDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQ-GSGEYFSRIGVG 166
+ + VNT+IT + + D LK + + P+ G Y R+ +G
Sbjct: 51 KQESWVNTVIT-----MASKDPERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLG 105
Query: 167 TPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA 226
TP +Q MVLDT +D W+ PC+ C S F P S++ L C+ QC + +
Sbjct: 106 TPGQQMFMVLDTSNDAAWV---PCSGCTGFSSTTFLPNASTTLGSLDCSGAQCSQVRGFS 162
Query: 227 CRA---NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLL 283
C A + CL+ +YG S LV + ++ N + G GC + G + GLL
Sbjct: 163 CPATGSSACLFNQSYGGDSSLTATLVQDAITLAND-VIPGFTFGCINAVSGGSIPPQGLL 221
Query: 284 GLGGGMLSLTKQIKAT---SLAYCLVDRDSPA-SGVLEFNSARGGDAV-TAPLIRNKKVD 338
GLG G +SL Q A +YCL S SG L+ ++ T PL+RN
Sbjct: 222 GLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRP 281
Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
+ YYV LTG SVG V IP D G I+D GT ITR Y ++RD F +
Sbjct: 282 SLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQ 341
Query: 399 AGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFA 458
P S + FDTC F+ P ++LHF G L LP +N LI S C +
Sbjct: 342 VNG--PISSLGAFDTC--FAATNEAEAPAITLHF-EGLNLVLPMENSLIHSSSGSLACLS 396
Query: 459 FAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A +S L++I N+QQQ R+ FD N+R+G C
Sbjct: 397 MAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELC 437
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 113/340 (33%), Positives = 172/340 (50%), Gaps = 35/340 (10%)
Query: 173 SMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD--VSACR 228
++VLD+ SD+ W+QC PC C+ Q D +DP S + + C++P C +L + C
Sbjct: 30 TVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPYANGCA 89
Query: 229 ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF-VGSAGLLGLGG 287
N+C Y V Y DGS T G + + ++ +V G GC H +G F +AG++ LGG
Sbjct: 90 NNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAEQGSFDARAAGIMALGG 149
Query: 288 G---MLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDA--VTAPLIRNKKVDTFYY 342
G +LS T + +YC+ S SG R + V P++R ++ TFY
Sbjct: 150 GPESLLSQTASRYGNAFSYCIPATAS-DSGFFTLGVPRRASSRYVVTPMVRFRQAATFYG 208
Query: 343 VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL 402
V L +VGGQ + + P++F G ++D TAITRL AY +LR +F
Sbjct: 209 VLLRTITVGGQRLGVAPAVFA------AGSVLDSRTAITRLPPTAYQALRAAFRSSMTMY 262
Query: 403 KPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF---CFAF 459
+ DTCYDF+G+ ++R+P +SL F +N ++P+D +G C AF
Sbjct: 263 RSAPPKGYLDTCYDFTGVVNIRLPKISLVFD---------RNAVLPLDPSGILFNDCLAF 313
Query: 460 APTSSA----LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
TS+A ++G+VQQQ V +D+ VGF C
Sbjct: 314 --TSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 121/368 (32%), Positives = 177/368 (48%), Gaps = 37/368 (10%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y R +GTPP++ + +DT +D W+ C C C + P F+P +S+++ P+PC AP
Sbjct: 94 YLVRASLGTPPQRLLLAVDTSNDAAWVPCAGCHGC-PTTAPSFNPASSATFRPVPCGAPP 152
Query: 219 CKSLDVSACRA-----NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
C +C + N C + ++YGD S N G +KG GC +
Sbjct: 153 CSQAPNPSCTSLAKSKNSCGFSLSYGDSSLDATLSQDNLAVTANGGVIKGYTFGCLTKSN 212
Query: 274 GLFVGS---AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA---SGVLEFNSARGGDAV 327
G + GL G ++ TK I + +YCL A SG L R G
Sbjct: 213 GSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAANFSGSLTLG--RKGQPA 270
Query: 328 -----TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
T PL+ + + YYV +TG +G ++V IPPS D A G ++D GT R
Sbjct: 271 PEKMKTTPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALAFDAATGAGTVLDSGTMFAR 330
Query: 383 LQTQAYNSLRDSF-VRLAGNLK---------PTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
L AY ++RD R+AG+L+ S + FDTCY+ S +V P V+L F
Sbjct: 331 LAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSLGGFDTCYNVS---TVAWPAVTLVF 387
Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPT-----SSALSIIGNVQQQGTRVSFDLANNR 487
G G + LP +N +I T C A A + ++AL++IG++QQQ RV FD+ N R
Sbjct: 388 GGGMEVRLPEENVVIRSTYGSTSCLAMAASPADGVNAALNVIGSLQQQNHRVLFDVPNAR 447
Query: 488 VGFTPNKC 495
VGF +C
Sbjct: 448 VGFARERC 455
>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 452
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 133/358 (37%), Positives = 175/358 (48%), Gaps = 58/358 (16%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT---ECYQQSDPIFDPKTSSSYSP 211
G+ Y +GTP +M +DTGSD++W+QC+PC CY Q DP+FDP SSSY+
Sbjct: 136 GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAA 195
Query: 212 LPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD 271
+PC P C L G S G+V+G GCGH
Sbjct: 196 VPCGGPVCAGL-----------------------GIYAASACSAAQCGAVQGFFFGCGHA 232
Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVT 328
GLF G GLLGLG SL +Q T +YCL + S A G L GG +
Sbjct: 233 QSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTA-GYLTLGV--GGPSGA 289
Query: 329 AP------LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
AP L+ + T+Y V LTG SVGGQ + +P S F VD GT +TR
Sbjct: 290 APGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV------VDTGTVVTR 343
Query: 383 LQTQAYNSLRDSFVR-LAGNLKPTS-GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
L AY +LR +F +A PT+ + DTCY+F+G +V +P V+L FG+G + L
Sbjct: 344 LPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTL 403
Query: 441 PAKNYLIPVDSAGTF-CFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A L +F C AFAP+ S ++I+GNVQQ+ V D VGF P+ C
Sbjct: 404 GADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 452
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 171 bits (434), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 120/356 (33%), Positives = 169/356 (47%), Gaps = 33/356 (9%)
Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
G G Y I VGTP FS+V DTGSD+ W QC PCT+C+QQ P F P +SS++S LP
Sbjct: 81 NGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLP 140
Query: 214 CAAPQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD 271
C + C+ L + C A C+Y YG G +T G L TET+ G++ S +A GC +
Sbjct: 141 CTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGDA-SFPSVAFGCSTE 198
Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG---GDAVT 328
N GL G L LG G S YCL + + + F S G+ +
Sbjct: 199 N-GL-----GQLDLGVGRFS-----------YCLRSGSAAGASPILFGSLANLTDGNVQS 241
Query: 329 APLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAG-DGGIIVDCGTAITRLQTQ 386
P + N V ++YYV LTG +VG + + S F + G GG IVD GT +T L
Sbjct: 242 TPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKD 301
Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFS--GLRSVRVPTVSLHFGAGKALDLPAKN 444
Y ++ +F+ ++ +G D C+ + G + VP++ L F G +P
Sbjct: 302 GYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTYF 361
Query: 445 YLIPVDSAGTF---CFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ DS G+ C P +S+IGNV Q + +DL F P C
Sbjct: 362 AGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADC 417
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 171 bits (433), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 121/374 (32%), Positives = 179/374 (47%), Gaps = 40/374 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GEY + +GTPP + DTGSD+ WLQ +PC +CY Q PIFDP S+++ LPC
Sbjct: 78 GEYMMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKGPIFDPSNSTTFHKLPCTT 137
Query: 217 PQCKSLDVSA---CRANRCLYQVAYGDGSFTVGDLVTETVSFGN-SGSVKGIALGCGHDN 272
C +LD SA C Y +YGD S+T G L ++TV+ GN S ++ +A GCG N
Sbjct: 138 APCNALDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNASVQIRNVAFGCGTRN 197
Query: 273 EGLFVGSAGLLGLGGGM-LSLTKQIKAT---SLAYCLV---------DRDSPASGVLEF- 318
G F + GG LS Q+ T +YCL+ DSPA+ + F
Sbjct: 198 GGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQPSDSPATSRIVFG 257
Query: 319 -------NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM------- 364
+S G T PL+ NK+ T+YY+ + +VG + + S +
Sbjct: 258 DNPVFSSSSTNGVVFATTPLV-NKEPSTYYYLTIEAITVGRKKLLYSSSSSKTASYDSGS 316
Query: 365 -DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFSGLR 421
+G II+D GT +T L+ + Y +L + V ++ + V ++F C+ SG
Sbjct: 317 KSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEI-KMERVNDVKNSMFSLCFK-SGKE 374
Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSF 481
V +P + +HF G ++L N + + G CF PT+ + I GN+ Q V +
Sbjct: 375 EVELPLMKVHFRGGADVELKPVNTFVRAEE-GLVCFTMLPTND-VGIYGNLAQMNFVVGY 432
Query: 482 DLANNRVGFTPNKC 495
DL V F P C
Sbjct: 433 DLGKRTVSFLPADC 446
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 171 bits (433), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 120/358 (33%), Positives = 175/358 (48%), Gaps = 36/358 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
G + + GTPP++F ++LDTGS I W QC+ C C + S FD SS+YS C
Sbjct: 125 GNFLVDVAFGTPPQKFKLILDTGSSITWTQCKACVHCLKDSHRHFDSLASSTYSFGSC-- 182
Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
+ + N Y + YGD S +VG+ +T++ S + GCG +NEG F
Sbjct: 183 -------IPSTVGN--TYNMTYGDKSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNNEGDF 233
Query: 277 -VGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVL------EFNSARGGDA 326
G+ G+LGLG G LS Q + +YCL + +S S + + +S +
Sbjct: 234 GSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEENSIGSLLFGEKATSQSSSLKFTSL 293
Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
V P + +Y+V L SVG + + IP S+F G I+D GT ITRL +
Sbjct: 294 VNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGTIIDSGTVITRLPQR 348
Query: 387 AYNSLRDSFVRLAGNLKPTSGVA----LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
AY++L+ +F + ++G + DTCY+ SG + V +P LHFG G + L
Sbjct: 349 AYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHFGDGADVRLNG 408
Query: 443 KNYLIPVDSAGTFCFAFAPTSSA-----LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
K + D A C AFA S + L+IIGN QQ V +D+ R+GF N C
Sbjct: 409 KRVVWGND-ASRLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRRIGFGGNGC 465
>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
Length = 435
Score = 171 bits (433), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 117/374 (31%), Positives = 180/374 (48%), Gaps = 35/374 (9%)
Query: 146 TPVVSGASQGSG--EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
TP+V+ S G EY G G P ++F + DT ++ L+C+PC DP F+P
Sbjct: 73 TPMVAPISVAPGALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGG-APCDPAFEP 131
Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
SSS++ +PC +P+C C C + + +G+ + G LV +T++ S + G
Sbjct: 132 SRSSSFAAIPCGSPECAV----ECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAG 187
Query: 264 IALGC---GHDNEGLFVGSAGLLGLGGGMLSLTKQI-------KATSLAYCLVDRDSPAS 313
GC G D + F G+ GL+ L SL ++ A + +YCL + +S
Sbjct: 188 FTFGCIEVGADAD-TFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSS 246
Query: 314 -GVLEFNSAR----GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
G L ++R GGD AP+ N Y+V L G SVGG+ + +PP++F
Sbjct: 247 RGFLSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVELVGISVGGEDLPVPPAVFAAH--- 303
Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
G +++ T T L AY +LRD+F R + DTCY+ +GL S+ VPTV
Sbjct: 304 --GTLLEAATEFTFLAPAAYAALRDAFRRDMAPYPAAPPFRVLDTCYNLTGLASLAVPTV 361
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFC-------FAFAPTSSALSIIGNVQQQGTRVSF 481
+L F G L+L + + D + F A + +S+IG + Q+ T V +
Sbjct: 362 ALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVY 421
Query: 482 DLANNRVGFTPNKC 495
DL RVGF P +C
Sbjct: 422 DLRGGRVGFIPGRC 435
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 171 bits (433), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 126/370 (34%), Positives = 176/370 (47%), Gaps = 25/370 (6%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
S PV SG Q Y R G+GTP +Q + LDT +D W C PC C S F P
Sbjct: 67 SAPVASG--QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPA 122
Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANR--------CLYQVAYGDGSFTVGDLVTETVSFG 256
+SSSY+ LPCA+ C + C AN+ C + + D SF L ++T+ G
Sbjct: 123 SSSSYASLPCASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSFQA-SLGSDTLRLG 181
Query: 257 NSGSVKGIALGCGHDNEG--LFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVD-RDS 310
++ G A GC G + GLLGLG G +SL Q + +YCL R
Sbjct: 182 KD-AIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSY 240
Query: 311 PASGVLEFNSA-RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
SG L +A + + PL+ N + YYV +TG SVG V++P F D A
Sbjct: 241 YFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATG 300
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
G ++D GT ITR Y +LR+ F R + + FDTC++ + + P V+
Sbjct: 301 AGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVT 360
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT----SSALSIIGNVQQQGTRVSFDLAN 485
LH G L LP +N LI + C A A ++ ++++ N+QQQ RV D+A
Sbjct: 361 LHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAG 420
Query: 486 NRVGFTPNKC 495
+RVGF C
Sbjct: 421 SRVGFAREPC 430
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 171 bits (433), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 126/370 (34%), Positives = 176/370 (47%), Gaps = 25/370 (6%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
S PV SG Q Y R G+GTP +Q + LDT +D W C PC C S F P
Sbjct: 67 SAPVASG--QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPA 122
Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANR--------CLYQVAYGDGSFTVGDLVTETVSFG 256
+SSSY+ LPCA+ C + C AN+ C + + D SF L ++T+ G
Sbjct: 123 SSSSYASLPCASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSFQA-SLGSDTLRLG 181
Query: 257 NSGSVKGIALGCGHDNEG--LFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVD-RDS 310
++ G A GC G + GLLGLG G +SL Q + +YCL R
Sbjct: 182 KD-AIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSY 240
Query: 311 PASGVLEFNSA-RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
SG L +A + + PL+ N + YYV +TG SVG V++P F D A
Sbjct: 241 YFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATG 300
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
G ++D GT ITR Y +LR+ F R + + FDTC++ + + P V+
Sbjct: 301 AGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVT 360
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT----SSALSIIGNVQQQGTRVSFDLAN 485
LH G L LP +N LI + C A A ++ ++++ N+QQQ RV D+A
Sbjct: 361 LHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAG 420
Query: 486 NRVGFTPNKC 495
+RVGF C
Sbjct: 421 SRVGFAREPC 430
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 136/437 (31%), Positives = 206/437 (47%), Gaps = 51/437 (11%)
Query: 80 FSLPLHSREI----LHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPA 135
FSL L R+ L+ H D+ L + R +RVN TK ++
Sbjct: 34 FSLNLIHRDSPLSPLYNPNHTDFDRL-RNAFSRSISRVNVFKTKAV---------DINSF 83
Query: 136 EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ 195
+ ++P GEYF ++ +GTP + ++ DTGSD+ W+QC PC CY+
Sbjct: 84 QNDLVPN-------------GGEYFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYR 130
Query: 196 QSDPIFDPKTSSSYSPLPCAAPQCKSLDVS--ACR--ANRCLYQVAYGDGSFTVGDLVTE 251
Q P+FDP SSSY + C + C +LDVS AC N C Y +YGD S+T G+L TE
Sbjct: 131 QKSPLFDPSRSSSYRHMLCGSRFCNALDVSEQACTMDTNICEYHYSYGDKSYTNGNLATE 190
Query: 252 TVSFGNSGS----VKGIALGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKAT---SLAY 303
+ G++ S + I GCG N G F +G++GLGGG LSL Q+ + +Y
Sbjct: 191 KFTIGSTSSRPVHLSPIVFGCGTGNGGTFDELGSGIVGLGGGALSLVSQLSSIIKGKFSY 250
Query: 304 CLV--DRDSPASGVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIP 358
CLV S + ++F + G V+ PL+ +K+ DT+YYV L SVG + +
Sbjct: 251 CLVPLSEQSNVTSKIKFGTDSVISGPQVVSTPLV-SKQPDTYYYVTLEAISVGNKRLPYT 309
Query: 359 PSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFS 418
L + G +I+D GT +T L ++ + L + + LF C+ +
Sbjct: 310 NGLLNGN-VEKGNVIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRGLFSVCFRSA 368
Query: 419 GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTR 478
G + +P +++HF + L N + D CF +S+ + I GN+ Q
Sbjct: 369 G--DIDLPVIAVHFNDAD-VKLQPLNTFVKADE-DLLCFTMI-SSNQIGIFGNLAQMDFL 423
Query: 479 VSFDLANNRVGFTPNKC 495
V +DL V F P C
Sbjct: 424 VGYDLEKRTVSFKPTDC 440
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 110/369 (29%), Positives = 171/369 (46%), Gaps = 25/369 (6%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQ-FSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
+ PV + + EY + +G P Q + LDTGSD+ W QC PC EC+ Q P FD
Sbjct: 78 TAPVGRANTDVNSEYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCEPCAECFTQPLPRFDT 137
Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF-----GNS 258
S++ + C+ P C + C + C Y YGDGS + G + ++ +F G
Sbjct: 138 AASNTVRSVACSDPLCNAHSEHGCFLHGCTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGK 197
Query: 259 GSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDR----DSPA- 312
+V I GCG N G F+ + G+ G G G LSL Q+K +YC R SP
Sbjct: 198 VTVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLPSQLKVRQFSYCFTTRFEAKSSPVF 257
Query: 313 -SGVLEFNSARGGDAVTAPLIRN---KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
G + + G ++ P +R+ ++ Y + G +VG + +P E+ G
Sbjct: 258 LGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRLPVP----EIKADG 313
Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
G +D GT IT + L+ +F+ A L D C+ + G ++ +P +
Sbjct: 314 SGATFIDSGTDITTFPDAVFRQLKSAFIAQAA-LPVNKTADEDDICFSWDGKKTAAMPKL 372
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL--SIIGNVQQQGTRVSFDLANN 486
H G DLP +NY+ +G C A + TS + ++IGN QQQ T + +DLA
Sbjct: 373 VFHL-EGADWDLPRENYVTEDRESGQVCVAVS-TSGQMDRTLIGNFQQQNTHIVYDLAAG 430
Query: 487 RVGFTPNKC 495
++ P +C
Sbjct: 431 KLLLVPAQC 439
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 117/352 (33%), Positives = 179/352 (50%), Gaps = 24/352 (6%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GEY + +GTPP V DTGS++ W QC+PC +CY Q DP+FDPK SS+Y + C++
Sbjct: 92 GEYLMNLSLGTPPSPIMAVADTGSNLIWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSS 151
Query: 217 PQCKSLDVSA---CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIALGCG 269
QC +L+ A C Y V+Y DGS+T+G +T++ G++ + +K I +GCG
Sbjct: 152 SQCTALENQASCSTEDKTCSYLVSYADGSYTMGKFAVDTLTLGSTDNRPVQLKNIIIGCG 211
Query: 270 HDNEGLFVGS-AGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSA--RG 323
+N F +G++GLGGG +SL KQ+ + +YCLV + S + +A G
Sbjct: 212 QNNAVTFRNKSSGVVGLGGGAVSLIKQLGDSIDGKFSYCLVPENDQTSKINFGTNAVVSG 271
Query: 324 GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
V+ PL+ K DTFYY+ L SVG + +Q P D G +++D GT +T L
Sbjct: 272 PGTVSTPLVV-KSRDTFYYLTLKSISVGSKNMQTP------DSNIKGNMVIDSGTTLTLL 324
Query: 384 QTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAK 443
+ Y + ++ L K CY+ + + +P +++HF G + L
Sbjct: 325 PVKYYIEIENAVASLINADKSKDERIGSSLCYNATA--DLNIPVITMHF-EGADVKLYPY 381
Query: 444 NYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
N V + C AF + I GNV Q+ V +D A+ + F P C
Sbjct: 382 NSFFKV-TEDLVCLAFGMSFYRNGIYGNVAQKNFLVGYDTASKTMSFKPTDC 432
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 128/391 (32%), Positives = 191/391 (48%), Gaps = 49/391 (12%)
Query: 108 RDSARVNTLITKL-QLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVG 166
RD +RV+ + +K Q A N+ H ++ ED G + + G
Sbjct: 126 RDESRVSFINSKFNQYAPENLKDHT---PNNKLFDED-------------GNFLVDVAFG 169
Query: 167 TPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA 226
TPP++F+++LDTGS I W QC+PC C + S FDP S +YS C + +
Sbjct: 170 TPPQKFTLILDTGSSITWTQCKPCVRCLKASRRHFDPSASLTYSLGSC---------IPS 220
Query: 227 CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF-VGSAGLLGL 285
N Y + YGD S +VG+ +T++ +S GCG +NEG F G+ G+LGL
Sbjct: 221 TVGN--TYNMTYGDKSTSVGNYGCDTMTLEHSDVFPKFQFGCGRNNEGDFGSGADGMLGL 278
Query: 286 GGGMLSLTKQIKA---TSLAYCLVDRDSPASGVL------EFNSARGGDAVTAPLIRNKK 336
G G LS Q + +YCL + DS S + + +S + V P +
Sbjct: 279 GQGQLSTVSQTASKFKKVFSYCLPEEDSIGSLLFGEKATSQSSSLKFTSLVNGPGTSGLE 338
Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
+Y+V L SVG + + IP S+F G I+D GT ITRL +AY++L+ +F
Sbjct: 339 ESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGTIIDSGTVITRLPQRAYSALKAAFK 393
Query: 397 RLAGNLKPTSGVA----LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
+ ++G + DTCY+ SG + V +P + LHFG G + L K +I + A
Sbjct: 394 KAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKR-VIWGNDA 452
Query: 453 GTFCFAFAPTSSALSIIGNVQQQGTRVSFDL 483
C AFA +S L+IIGN QQ V +D+
Sbjct: 453 SRLCLAFA-GNSELTIIGNRQQVSLTVLYDI 482
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 119/367 (32%), Positives = 177/367 (48%), Gaps = 39/367 (10%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC--YQQSDPIFDPKTSSSYSPL 212
G GEY + +GTPP+ ++DTGSD+ WL+C C C + IF SSSY L
Sbjct: 1 GEGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKL 60
Query: 213 PCAAPQCKSLDVSA----CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS-------V 261
PC + C + + C C Y+ YGDGS T GD+ ++ +SF + G+
Sbjct: 61 PCNSTHCSGMSSAGIGPRCEET-CKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFF 119
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSP--ASGVL 316
G GCG +G + + GL+GLG SL +Q+ +YCLV DSP A L
Sbjct: 120 DGFLFGCGRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFL 179
Query: 317 EFNSA---RGGDAVTAPLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
S+ RG D V+ P++ +D T YYV L +VGG +P +++ + + +
Sbjct: 180 FLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGG----VPVVVYDKESGHNTSV 235
Query: 373 --------IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG-VALFDTCYDFSGLRSV 423
++D GT T L Y ++R S + PT G A D C++ SG S
Sbjct: 236 GPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQV--ILPTLGNSAGLDLCFNSSGDTSY 293
Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDL 483
P+V+ +F L LP +N + V S C + + LSIIGN+QQQ + +DL
Sbjct: 294 GFPSVTFYFANQVQLVLPFEN-IFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYDL 352
Query: 484 ANNRVGF 490
+++ F
Sbjct: 353 VASQISF 359
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 117/360 (32%), Positives = 176/360 (48%), Gaps = 31/360 (8%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
+Y + +GTPP + +DTGSD+ WLQC PCT CY+Q +P+FDP++SS+YS + +
Sbjct: 58 DYLMELSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLNPMFDPQSSSTYSNIAYGSE 117
Query: 218 QCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHD 271
C L ++C N C Y +Y D S T G L ET++ G ++KG+ GCGH+
Sbjct: 118 SCSKLYSTSCSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVALKGVIFGCGHN 177
Query: 272 NEGLFVGS-AGLLGLGGGMLSLTKQIKAT----SLAYCLVDRDSPASGVLEFNSARGGD- 325
N G+F G++GLG G LSL QI ++ + CLV + S + +G +
Sbjct: 178 NNGVFNDKEMGIIGLGRGPLSLVSQIGSSFGGKMFSQCLVPFHTNPSITSPMSFGKGSEV 237
Query: 326 ----AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE----MDEAGDGGIIVDCG 377
V+ PL+ FY+V L G SV + + +P F ++ G +++D G
Sbjct: 238 LGNGVVSTPLVSKNTHQAFYFVTLLGISV--EDINLP---FNDGSSLEPITKGNMVIDSG 292
Query: 378 TAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
T T L Y+ L + ++A + P + CY +++ T++ HF
Sbjct: 293 TPTTLLPEDFYHRLVEEVRNKVALDPIPIDPTLGYQLCYRTP--TNLKGTTLTAHFEGAD 350
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L P + + IPV G FCFAF T S+ I GN Q + FDL V F C
Sbjct: 351 VLLTPTQIF-IPVQD-GIFCFAFTSTFSNEYGIYGNHAQSNYLIGFDLEKQLVSFKATDC 408
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 126/425 (29%), Positives = 202/425 (47%), Gaps = 43/425 (10%)
Query: 103 LSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSR 162
LS RD + I + QLA R A++ F+ P+ SGA G+G+YF R
Sbjct: 51 LSDRARDDLHRHAYI-RSQLASSRRGRRA-----AEVGASAFAMPLSSGAYTGTGQYFVR 104
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP----IFDPKTSSSYSPLPCAAPQ 218
VGTP + F +V DTGSD+ W++CR +F S S++P+ C++
Sbjct: 105 FRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPARVFRTAASKSWAPIACSSDT 164
Query: 219 CKS---LDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFG---------------NS 258
C S ++ C A+ C Y Y DGS G + T++ +
Sbjct: 165 CTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSATIALSSGSGRGGGDSSGGRR 224
Query: 259 GSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSP--A 312
++G+ LGC +G F S G+L LG +S + A +YCLVD +P A
Sbjct: 225 AKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNA 284
Query: 313 SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
+ L F A PL+ ++++ FY V + V G+A+ IP ++++D +GG
Sbjct: 285 TSYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDVDR--NGGA 342
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
I+D GT++T L T AY ++ + + L P + F+ CY+++ ++ +P + +HF
Sbjct: 343 ILDSGTSLTILATPAYRAVVTALSKHLAGL-PRVTMDPFEYCYNWTDAGALEIPKMEVHF 401
Query: 433 GAGKALDLPAKNYLIPVDSA-GTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGF 490
L+ PAK+Y+I D+A G C S +S+IGN+ QQ FDL + + F
Sbjct: 402 AGSARLEPPAKSYVI--DAAPGVKCIGVQEGSWPGVSVIGNILQQEHLWEFDLRDRWLRF 459
Query: 491 TPNKC 495
+C
Sbjct: 460 KHTRC 464
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 132/378 (34%), Positives = 180/378 (47%), Gaps = 49/378 (12%)
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQSDP 199
E+ S+ A+ GS R GV RQ M+LDT SD+ W+QC PC ++CY Q+D
Sbjct: 157 EELSSAADPAATGGSRRSRLRPGV----RQL-MLLDTASDVAWVQCFPCPASQCYAQTDV 211
Query: 200 IFDPKTSSSYSPLPCAAPQCKSL-------DVSACRANRCLYQVAYGDGSFTVGDLVTET 252
++DP S S C++P C+ L S+ A +C Y+V Y DGS T G LV +
Sbjct: 212 LYDPSKSRSSESFACSSPTCRQLGPYANGCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQ 271
Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGS--AGLLGLGGGMLSLTKQIK---ATSLAYCLVD 307
+S + V GC H G F S AG++ LG G+ SL Q +YC
Sbjct: 272 LSLSPTSQVPKFEFGCSHAARGSFSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFPP 331
Query: 308 RDSPAS----GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
S GV +S+R AVT P++ K Y V L +V GQ + +PP++F
Sbjct: 332 TASHKGFFVLGVPRRSSSR--YAVT-PML---KTPMLYQVRLEAIAVAGQRLDVPPTVFA 385
Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
G +D T ITRL AY +LR +F +P + DTCYDF+G+ S+
Sbjct: 386 ------AGAALDSRTVITRLPPTAYQALRSAFRDKMSMYRPAAANGQLDTCYDFTGVSSI 439
Query: 424 RVPTVSLHF---GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS---ALSIIGNVQQQGT 477
+PT+SL F GAG LD P C AFA T+ A IIG +Q Q
Sbjct: 440 MLPTISLVFDRTGAGVQLD--------PSGVLFGSCLAFASTAGDDRATGIIGFLQLQTI 491
Query: 478 RVSFDLANNRVGFTPNKC 495
V +++A VGF C
Sbjct: 492 EVLYNVAGGSVGFRRGAC 509
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 168 bits (426), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 109/353 (30%), Positives = 175/353 (49%), Gaps = 23/353 (6%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK-- 220
+ +GTPP+ +++LDTGSD+ W QC+ + P++DP SSS++ PC C+
Sbjct: 93 VSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDGRLCETG 152
Query: 221 SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK-GIALGCGHDNEGLFVGS 279
S + C N+C+Y YG + T G+L +ET +FG V + GCG G G+
Sbjct: 153 SFNTKNCSRNKCIYTYNYGSAT-TKGELASETFTFGEHRRVSVSLDFGCGKLTSGSLPGA 211
Query: 280 AGLLGLGGGMLSLTKQIKATSLAYCL---VDRDSPAS----GVLEFNSAR-GGDAVTAPL 331
+G+LG+ LSL Q++ +YCL +DR++ + + + + R G T L
Sbjct: 212 SGILGISPDRLSLVSQLQIPRFSYCLTPFLDRNTTSHIFFGAMADLSKYRTTGPIQTTSL 271
Query: 332 IRNKK-VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
+ N + +YYV L G SVG + + +P S F + G GG VD G L + +
Sbjct: 272 VTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFVDSGDTTGMLPSVVMEA 331
Query: 391 LRDSFVRLAG--NLKPTSGVALFDTCYDF------SGLRSVRVPTVSLHFGAGKALDLPA 442
L+++ V + T ++ C+ + +V+VP + HF G A+ L
Sbjct: 332 LKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGAVETAVQVPPLVYHFDGGAAMLLRR 391
Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+Y++ V SAG C + + +IIGN QQQ V FD+ N+ F P +C
Sbjct: 392 DSYMVEV-SAGRMCLVISSGARG-AIIGNYQQQNMHVLFDVENHEFSFAPTQC 442
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 116/359 (32%), Positives = 171/359 (47%), Gaps = 21/359 (5%)
Query: 147 PVVSGASQ-GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
P+ SG S Y + VGTPP+ M LD D W+ C+ C C S +F+
Sbjct: 22 PIASGRGVIQSPSYIVKAKVGTPPQTLLMALDNSYDAAWIPCKGCVGC---SSTVFNTVK 78
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S+++ L C APQCK + C + C + YG + + +L +T++ + V A
Sbjct: 79 STTFKTLGCGAPQCKQVPNPICGGSTCTWNTTYGSSTI-LSNLTRDTIAL-SMDPVPYYA 136
Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSL---TKQIKATSLAYCLVD-RDSPASGVLEFNSA 321
GC G V GLLG G G LS T+ + ++ +YCL R SG L
Sbjct: 137 FGCIQKATGSSVPPQGLLGFGRGPLSFLSQTQNLYKSTFSYCLPSFRTLNFSGSLRLGPV 196
Query: 322 RGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
+ T PL++N + + YYV L G VG + V IP S + G I D GT
Sbjct: 197 GQPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTGAGTIFDSGTVF 256
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
TRL AY ++R+ F + GN S + FDTCY + PT++ F +G + +
Sbjct: 257 TRLVAPAYIAVRNEFRKRVGNAT-VSSLGGFDTCYSV----PIVPPTITFMF-SGMNVTM 310
Query: 441 PAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
P +N LI + T C A A +S L++I ++QQQ R+ FD+ N+R+G +C
Sbjct: 311 PPENLLIHSTAGVTSCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRLGVAREQC 369
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 168 bits (425), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 118/356 (33%), Positives = 168/356 (47%), Gaps = 24/356 (6%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
G+Y + +GTPP + S +DTGSD+ W+QC PC CY Q +P+FDP SS+Y+ + C +
Sbjct: 62 GQYLMELYIGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQINPMFDPLKSSTYTNISCDS 121
Query: 217 PQCKSLDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHD 271
P C + C RC Y Y D S T G L ETV+ G S++GI GCGH+
Sbjct: 122 PLCYKPYIGECSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKPISLQGILFGCGHN 181
Query: 272 NEGLFVG-SAGLLGLGGGMLSLTKQI----KATSLAYCLVD--RDSPASGVLEFNSAR-- 322
N G F GL+GLGGG SL QI + CLV D S + F
Sbjct: 182 NTGNFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITISSQMSFGKGSEV 241
Query: 323 -GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
G VT PL++ ++ T YYV L G SV + + ++ + G ++VD GT
Sbjct: 242 LGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTIEK------GNMLVDSGTPPN 295
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
L Q Y+ + V+ L+P + + +++ PT++ HF L P
Sbjct: 296 ILPQQLYDRVYVE-VKNKVPLEPITDDPSLGPQLCYRTQTNLKGPTLTYHFEGANLLLTP 354
Query: 442 AKNYLIPV-DSAGTFCFAFAP-TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ ++ P ++ G FC A +S I GN Q + FDL V F P C
Sbjct: 355 IQTFIPPTPETKGVFCLAITNCANSDPGIYGNFAQTNYLIGFDLDRQIVSFKPTDC 410
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 168 bits (425), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 121/377 (32%), Positives = 186/377 (49%), Gaps = 32/377 (8%)
Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
T + SG GE+F I +GTPP + + DTGSD+ W+QC+PC +CY+++ PIFD K
Sbjct: 72 TDLQSGLIGADGEFFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKK 131
Query: 206 SSSYSPLPCAAPQCKSLDVS--AC--RANRCLYQVAYGDGSFTVGDLVTETVSF----GN 257
SS+Y PC + C +L S C N C Y+ +YGD SF+ GD+ TET+S G+
Sbjct: 132 SSTYKSEPCDSRNCHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGS 191
Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGM-LSLTKQIKAT---SLAYCLVDRDSPAS 313
S G GCG++N G F + + GG LSL Q+ ++ +YCL + + +
Sbjct: 192 PVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTN 251
Query: 314 GV----LEFNS-----ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
G L NS ++ ++ PL+ +K+ T+YY+ L SVG + + S +
Sbjct: 252 GTSVINLGTNSIPSSLSKDSGVISTPLV-DKEPRTYYYLTLEAISVGKKKIPYTGSSYNP 310
Query: 365 DEAG-----DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG-VALFDTCYDFS 418
++ G G II+D GT +T L + ++ + L K S L C+ S
Sbjct: 311 NDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKRVSDPQGLLSHCFK-S 369
Query: 419 GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTR 478
G + +P +++HF G + L N + V S C + PT+ ++I GN Q
Sbjct: 370 GSAEIGLPEITVHF-TGADVRLSPINAFVKV-SEDMVCLSMVPTTE-VAIYGNFAQMDFL 426
Query: 479 VSFDLANNRVGFTPNKC 495
V +DL V F C
Sbjct: 427 VGYDLETRTVSFQRMDC 443
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 119/347 (34%), Positives = 174/347 (50%), Gaps = 27/347 (7%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP-CAAPQCKSLD 223
+GTPP + L+ G+++ W P EC++Q+ P F+P T S P C +P+
Sbjct: 1 MGTPPNPVKLKLENGNELIWNHSNPSPECFEQAFPYFEPLTFSRGLPFASCGSPKF---- 56
Query: 224 VSACRANRCLYQVAYGDGSFTVGDLVTETVSF-GNSGSVKGIALGCGHDNEGLFVGS-AG 281
C+Y +YGD S T G L + +F G SV G+A GCG N G+F + G
Sbjct: 57 ---WPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLFNNGVFKSNETG 113
Query: 282 LLGLGGGMLSLTKQIKATSLAYCL--VDRDSPASGVLEFNS---ARGGDAV-TAPLI--- 332
+ G G G LSL Q+K + ++C + P++ +L+ + + G AV T PLI
Sbjct: 114 IAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLLDLPADLFSNGQGAVQTTPLIQYA 173
Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
+N+ T YY+ L G +VG + +P S F + G GG I+D GT+IT L Q Y +R
Sbjct: 174 KNEANPTLYYLSLKGITVGSTRLPVPESAFALTN-GTGGTIIDSGTSITSLPPQVYQVVR 232
Query: 393 DSF-VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV-D 450
D F ++ + P + + TC+ VP + LHF G +DLP +NY+ V D
Sbjct: 233 DEFAAQIKLPVVPGNATGHY-TCFSAPSQAKPDVPKLVLHF-EGATMDLPRENYVFEVPD 290
Query: 451 SAGT--FCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
AG C A +IIGN QQQ V +DL NN + F +C
Sbjct: 291 DAGNSIICLAIN-KGDETTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 336
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 117/367 (31%), Positives = 176/367 (47%), Gaps = 39/367 (10%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC--YQQSDPIFDPKTSSSYSPL 212
G GEY + +GTPP+ ++DTGSD+ WL+C C C + IF SSSY L
Sbjct: 1 GEGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKL 60
Query: 213 PCAAPQCKSLDVSA----CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS-------V 261
PC + C + + C C Y+ YGDGS T GD+ ++ +SF + G+
Sbjct: 61 PCNSTHCSGMSSAGIGPRCEET-CKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFF 119
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSP--ASGVL 316
G GC +G + + GL+GLG SL +Q+ +YCLV DSP A L
Sbjct: 120 DGFLFGCARKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFL 179
Query: 317 EFNSA---RGGDAVTAPLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
S+ RG D V+ P++ +D T YYV L ++GG +P +++ + + +
Sbjct: 180 FLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGG----VPVVVYDKESGHNTSV 235
Query: 373 --------IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG-VALFDTCYDFSGLRSV 423
++D GT T L Y ++R S + PT G A D C++ SG S
Sbjct: 236 GPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQV--ILPTLGNSAGLDLCFNSSGDTSY 293
Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDL 483
P+V+ +F L LP +N + V S C + + LSIIGN+QQQ + +DL
Sbjct: 294 GFPSVTFYFANQVQLVLPFEN-IFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYDL 352
Query: 484 ANNRVGF 490
+++ F
Sbjct: 353 VASQISF 359
>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
Length = 393
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 118/350 (33%), Positives = 163/350 (46%), Gaps = 73/350 (20%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC---TECYQQSDPIF 201
S P G+S + EY +G+G+P +V+DTGSD++W+QC PC + C+ + +F
Sbjct: 92 SVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALF 151
Query: 202 DPKTSSSYSPLPCAAPQCKSL----DVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSFG 256
DP SS+Y+ C+A C L + + C A +RC Y V YGDGS T G
Sbjct: 152 DPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTG---------- 201
Query: 257 NSGSVKGIALGCGHDN--EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASG 314
G GC H G+ + GL+GLGG SL Q A
Sbjct: 202 -----TGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAA---------------- 240
Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
R+KKV T+Y+ L +VGG+ + + PS+F G +V
Sbjct: 241 ------------------RSKKVPTYYFAALEDIAVGGKKLGLSPSVFAA------GSLV 276
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
D GT ITRL AY +L +F + + DTC++F+GL V +PTV+L F
Sbjct: 277 DSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAG 336
Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFD 482
G +DL A + S G C AFAPT A IGNVQQ+ V +D
Sbjct: 337 GAVVDLDAHGIV----SGG--CLAFAPTRDDKAFGTIGNVQQRTFEVLYD 380
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 110/350 (31%), Positives = 176/350 (50%), Gaps = 27/350 (7%)
Query: 169 PRQFSMVLDTGSDINWLQCR----PCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
PR+ +++DTGSD+ W QC+ S P++DP SS+++ LPC+ C+
Sbjct: 25 PRK--LIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQF 82
Query: 225 S---ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK-GIALGCGHDNEGLFVGSA 280
S NRC+Y+ YG + VG L +ET +FG +V + GCG + G +G+
Sbjct: 83 SFKNCTSKNRCVYEDVYGSAA-AVGVLASETFTFGARRAVSLRLGFGCGALSAGSLIGAT 141
Query: 281 GLLGLGGGMLSLTKQIKATSLAYCLV----DRDSPA--SGVLEFNSARGGDAVTAPLIRN 334
G+LGL LSL Q+K +YCL + SP + + + + + I +
Sbjct: 142 GILGLSPESLSLITQLKIQRFSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVS 201
Query: 335 KKVDT-FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRD 393
V+T +YYV L G S+G + + +P + M G GG IVD G+ + L A+ ++++
Sbjct: 202 NPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKE 261
Query: 394 SFVRLAGNLKPTSGVALFDTCYDF------SGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
+ + + V ++ C+ + + +V+VP + LHF G A+ LP NY
Sbjct: 262 AVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYF- 320
Query: 448 PVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
AG C A T+ S +SIIGNVQQQ V FD+ +++ F P +C
Sbjct: 321 QEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 370
>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
Length = 435
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 111/365 (30%), Positives = 175/365 (47%), Gaps = 33/365 (9%)
Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
+ G+ EY G G P ++F + DT ++ L+C+PC DP F+P SSS++ +
Sbjct: 82 APGALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGG-APCDPAFEPSRSSSFAAI 140
Query: 213 PCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC---G 269
PC +P+C C C + + +G+ + G LV +T++ S + G GC G
Sbjct: 141 PCGSPECAV----ECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEVG 196
Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQI-------KATSLAYCLVDRDSPAS-GVLEFNSA 321
D + F G+ GL+ L SL ++ A + +YCL + +S G L ++
Sbjct: 197 ADAD-TFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGAS 255
Query: 322 R----GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
R GGD AP+ N Y+V L G SVGG+ + +PP++F G +++
Sbjct: 256 RPEYSGGDIKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVFAAH-----GTLLEAA 310
Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
T T L AY +LRD+F + + DTCY+ +GL S+ VP V+L F G
Sbjct: 311 TEFTFLAPAAYAALRDAFRKDMAPYPAAPPFRVLDTCYNLTGLASLAVPAVALRFAGGTE 370
Query: 438 LDLPAKNYLIPVDSAGTFC-------FAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
L+L + + D + F A + +S+IG + Q+ T V +DL RVGF
Sbjct: 371 LELDVRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGF 430
Query: 491 TPNKC 495
P +C
Sbjct: 431 IPGRC 435
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 120/361 (33%), Positives = 172/361 (47%), Gaps = 34/361 (9%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
STP S + GEY +GTPP + +DTGSD+ WLQC PC +CY Q PIFDP
Sbjct: 75 STPQ-STVNSDKGEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITPIFDPS 133
Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
SSSY +PC + C S+ ++C G +V L ++ + G S S
Sbjct: 134 LSSSYQNIPCLSDTCHSMRTTSCDVR----------GYLSVETLTLDSTT-GYSVSFPKT 182
Query: 265 ALGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNS 320
+GCG+ N G F G S+G++GLG G +SL Q+ + +YCL ++ L F
Sbjct: 183 MIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCLGPWLPNSTSKLNFGD 242
Query: 321 A---RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
A G A+T P+++ K + YY+ L FSVG + ++ + +E G I++D G
Sbjct: 243 AAIVYGDGAMTTPIVK-KDAQSGYYLTLEAFSVGNKLIEFGGPTYGGNE---GNILIDSG 298
Query: 378 TAITRLQTQAYNSLRDS---FVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
T T L Y + ++ L P F CY+ + P ++ HF
Sbjct: 299 TTFTFLPYDVYYRFESAVAEYINLEHVEDPN---GTFKLCYNVA-YHGFEAPLITAHF-K 353
Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
G + L + I V S G C AF P+ +A I GNV QQ V ++L N V F P
Sbjct: 354 GADIKLYYISTFIKV-SDGIACLAFIPSQTA--IFGNVAQQNLLVGYNLVQNTVTFKPVD 410
Query: 495 C 495
C
Sbjct: 411 C 411
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 120/359 (33%), Positives = 170/359 (47%), Gaps = 21/359 (5%)
Query: 147 PVVSGAS-QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
P+ SG S Y + VGTP + F M LDT +D W+ C C C S +F+ T
Sbjct: 77 PIASGRQIVQSPTYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSVT 133
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S+++ L C APQCK + C + C + YG GS + +L +T++ ++ V G
Sbjct: 134 STTFKTLGCDAPQCKQVPNPTCGGSTCTWNTTYG-GSTILSNLTRDTIAL-STDIVPGYT 191
Query: 266 LGCGHDNEGLFV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNSA 321
GC G V G GL LS T+ + ++ +YCL R SG L A
Sbjct: 192 FGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNFSGTLRLGPA 251
Query: 322 RGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
+ T PL++N + + YYV L G VG + V IP S + G I D GT
Sbjct: 252 GQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVF 311
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
TRL Y ++RD F + GN S + FDTCY + PT++ F +G + L
Sbjct: 312 TRLVAPVYTAVRDEFRKRVGNAI-VSSLGGFDTCYT----GPIVAPTMTFMF-SGMNVTL 365
Query: 441 PAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
P N LI + T C A A +S L++I N+QQQ R+ FD+ N+R+G C
Sbjct: 366 PTDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPC 424
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 127/362 (35%), Positives = 170/362 (46%), Gaps = 37/362 (10%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP----IFDPKTSSSYSPLP 213
EY + VGTPP Q + DTGSD+ W+ C +D +F P SS+YS L
Sbjct: 102 EYLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSSTYSQLS 161
Query: 214 CAAPQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSF---GNSGSVK--GIALG 267
C + C++L ++C A+ C YQ +YGDGS T+G L TET SF G G V+ + G
Sbjct: 162 CQSNACQALSQASCDADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQVRVPRVNFG 221
Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS-----LAYCLV-DRDSPASGVLEFNS- 320
C + G F S GL+GLG G SL Q+ AT+ L+YCL+ D+ +S L F S
Sbjct: 222 CSTASAGTFR-SDGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDANSSSTLNFGSR 280
Query: 321 --ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
A + PL+ + VD++Y V L +VGGQ V D IIVD GT
Sbjct: 281 AVVSEPGAASTPLVPS-DVDSYYTVALESVAVGGQEVATH----------DSRIIVDSGT 329
Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR---VPTVSLHFGAG 435
+T L L R + L CYD G +P V+L FG G
Sbjct: 330 TLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGKSETDNFGIPDVTLRFGGG 389
Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPN 493
A+ L +N + GT C P S + +SI+GN+ QQ V +DL V F
Sbjct: 390 AAVTLRPENTFSLLQE-GTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDARTVTFAAA 448
Query: 494 KC 495
C
Sbjct: 449 DC 450
>gi|20975624|emb|CAD31717.1| putative nucleoid DNA-binding protein [Cicer arietinum]
Length = 144
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 78/144 (54%), Positives = 102/144 (70%)
Query: 352 GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF 411
G V I +F ++E G+GG+++D GTA+TRL T AY++ RD+F+ NL +S V++F
Sbjct: 1 GVRVPISEDVFRLNELGEGGVVMDTGTAVTRLPTAAYDAFRDAFIGQTTNLPRSSDVSIF 60
Query: 412 DTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGN 471
DTCYD G SVRVPT+S +F G L LPA+N+LIPV+ GTFCFAFAP+ S LSIIGN
Sbjct: 61 DTCYDLYGFVSVRVPTISFYFLGGPILTLPARNFLIPVNDVGTFCFAFAPSPSGLSIIGN 120
Query: 472 VQQQGTRVSFDLANNRVGFTPNKC 495
+QQ+G +S D N VGF PN C
Sbjct: 121 IQQEGIEISVDGVNGFVGFGPNIC 144
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 180/366 (49%), Gaps = 33/366 (9%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
G GEY RI +G P + + DTGSD+ W+QC+PC CY+Q+ PIFDP+ SSSY + C
Sbjct: 89 GGGEYLMRISIGNPQVEILAIADTGSDLIWVQCQPCEMCYKQNSPIFDPRRSSSYRNVLC 148
Query: 215 AAPQCKSLDVSA--CRA----NRCLYQVAYGDGSFTVGDLVTETVSFGNSGS-------- 260
C LD A C A C Y +YGD SF+ G L E G++ S
Sbjct: 149 GNEFCNKLDGEARSCDARGFVKTCGYTYSYGDQSFSDGHLAIERFGIGSTNSNTSAAIAY 208
Query: 261 VKGIALGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVL 316
+ +A GCG N G F +G++GLGGG +SL Q+ + +YCLV ++
Sbjct: 209 FQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGPKLSGKFSYCLVPTSEQSNYTS 268
Query: 317 EFN-------SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
+ N S + V+ PL+ KK +T+YY+ L SV + ++P + E
Sbjct: 269 KINFGNDINISGSNYNVVSTPLLP-KKPETYYYLTLEAISVENK--RLPYTNLWNGEVEK 325
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
G II+D GT +T L ++ +N+L + + + LF+ C F +++ +P ++
Sbjct: 326 GNIIIDSGTTLTFLDSEFFNNLDSAVEEAVKGERVSDPHGLFNIC--FKDEKAIELPIIT 383
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
HF G ++L N V+ CF P S+ ++I GN+ Q V +DL V
Sbjct: 384 AHF-TGADVELQPVNTFAKVEE-DLLCFTMIP-SNDIAIFGNLAQMNFLVGYDLEKKAVS 440
Query: 490 FTPNKC 495
F P C
Sbjct: 441 FLPTDC 446
>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 523
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 111/363 (30%), Positives = 174/363 (47%), Gaps = 33/363 (9%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
G+ EY G G P ++F + DT ++ L+C+PC DP F+P SSS++ +PC
Sbjct: 172 GALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGG-APCDPAFEPSRSSSFAAIPC 230
Query: 215 AAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC---GHD 271
+P+C C C + + +G+ + G LV +T++ S + G GC G D
Sbjct: 231 GSPEC----AVECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEVGAD 286
Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQI-------KATSLAYCLVDRDSPAS-GVLEFNSAR- 322
+ F G+ GL+ L SL ++ A + +YCL + +S G L ++R
Sbjct: 287 AD-TFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRP 345
Query: 323 ---GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
GGD AP+ N Y+V L G SVGG+ + +PP++F G +++ T
Sbjct: 346 EYSGGDIKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVFAAH-----GTLLEAATE 400
Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
T L AY +LRD+F + + DTCY+ +GL S+ VP V+L F G L+
Sbjct: 401 FTFLAPAAYAALRDAFRKDMAPYPAAPPFRVLDTCYNLTGLASLAVPAVALRFAGGTELE 460
Query: 440 LPAKNYLIPVDSAGTFC-------FAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
L + + D + F A + +S+IG + Q+ T V +DL RVGF P
Sbjct: 461 LDVRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIP 520
Query: 493 NKC 495
+C
Sbjct: 521 GRC 523
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 122/388 (31%), Positives = 193/388 (49%), Gaps = 37/388 (9%)
Query: 140 LPED--FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS 197
+PE F+ P+ SGA G+G+YF + VGTP + F +V DTGSD+ W++CR +
Sbjct: 89 MPEASAFAMPLTSGAYTGTGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDA 148
Query: 198 DP-----IFDPKTSSSYSPLPCAAPQCKS---LDVSACRANR-----CLYQVAYGDGSFT 244
P +F P S S++P+PC++ CKS ++ C A C Y Y D S
Sbjct: 149 SPLASPRVFRPANSKSWAPIPCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSA 208
Query: 245 VGDLVTETVSFGNSGS-------VKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQI 296
G + T+ + SGS ++ + LGC +G F S G+L LG +S +
Sbjct: 209 RGVVGTDAATIALSGSGSDRKAKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNISFASRA 268
Query: 297 KAT---SLAYCLVDRDSP--ASGVLEFNSARGGDAVT-APLIRNKKVDTFYYVGLTGFSV 350
A +YCLVD +P A+ L F + + PL+ + +V FY V + SV
Sbjct: 269 AARFGGRFSYCLVDHLAPRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSV 328
Query: 351 GGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL 410
G+A+ IP ++++ + +GG I+D GT++T L T AY ++ + + + P +
Sbjct: 329 AGKALNIPAEVWDVKK--NGGAILDSGTSLTILATPAYKAVVAALSKQLARV-PRVTMDP 385
Query: 411 FDTCYDFSGLRS-VRVPTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFAFAP-TSSALS 467
F+ CY+++ R VP + + F L P K+Y+I D+A G C +S
Sbjct: 386 FEYCYNWTATRRPPAVPRLEVRFAGSARLRPPTKSYVI--DAAPGVKCIGLQEGVWPGVS 443
Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+IGN+ QQ FDLAN + F ++C
Sbjct: 444 VIGNILQQEHLWEFDLANRWLRFQESRC 471
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 166 bits (419), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 120/359 (33%), Positives = 170/359 (47%), Gaps = 21/359 (5%)
Query: 147 PVVSGAS-QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
P+ SG S Y + VGTP + F M LDT +D W+ C C C S +F+ T
Sbjct: 77 PIASGRQIVQSPTYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSVT 133
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S+++ L C APQCK + C + C + YG GS + +L +T++ ++ V G
Sbjct: 134 STTFKTLGCDAPQCKQVPNPTCGGSTCTWNTTYG-GSTILSNLTRDTIAL-STDIVPGYT 191
Query: 266 LGCGHDNEGLFV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNSA 321
GC G V G GL LS T+ + ++ +YCL R SG L A
Sbjct: 192 FGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNFSGTLRLGPA 251
Query: 322 RGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
+ T PL++N + + YYV L G VG + V IP S + G I D GT
Sbjct: 252 GQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVF 311
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
TRL Y ++RD F + GN S + FDTCY + PT++ F +G + L
Sbjct: 312 TRLVAPVYTAVRDEFRKRVGNAI-VSSLGGFDTCYT----GPIVAPTMTFMF-SGMNVTL 365
Query: 441 PAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
P N LI + T C A A +S L++I N+QQQ R+ FD+ N+R+G C
Sbjct: 366 PPDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPC 424
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 122/354 (34%), Positives = 172/354 (48%), Gaps = 28/354 (7%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
S + R +GTP + + LDT +D W+ C C C S +F SSS+ PLPC
Sbjct: 100 SPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGC--PSTTVFSSDKSSSFRPLPCQ 157
Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
+PQC + +C + C + + YG S DLV + ++ + SV GC G
Sbjct: 158 SPQCNQVPNPSCSGSACGFNLTYG-SSTVAADLVQDNLTLA-TDSVPSYTFGCIRKATGS 215
Query: 276 FVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFN-SARGGDAVT--- 328
V GLLGLG G LSL Q ++ ++ +YCL P+ + F+ S R G
Sbjct: 216 SVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCL-----PSFKSVNFSGSLRLGPVAQPIR 270
Query: 329 ---APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
PL+RN + + YYV L VG + V IPPS + A G ++D GT TRL
Sbjct: 271 IKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVA 330
Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
AY ++RD F R G S + FDTCY + PT++ F AG + LP N+
Sbjct: 331 PAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTV----PIISPTITFMF-AGMNVTLPPDNF 385
Query: 446 LIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LI + T C A A +S L++I ++QQQ R+ FD+ N+RVG C
Sbjct: 386 LIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESC 439
>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
sativus]
Length = 364
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 123/349 (35%), Positives = 170/349 (48%), Gaps = 18/349 (5%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
S + R +GTP + + LDT +D W+ C C C S +F SSS+ PLPC
Sbjct: 23 SPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGC--PSTTVFSSDKSSSFRPLPCQ 80
Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
+PQC + +C + C + + YG S DLV + ++ + SV GC G
Sbjct: 81 SPQCNQVPNPSCSGSACGFNLTYGS-STVAADLVQDNLTLA-TDSVPSYTFGCIRKATGS 138
Query: 276 FVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPA-SGVLEFNS-ARGGDAVTAP 330
V GLLGLG G LSL Q ++ ++ +YCL S SG L A+ P
Sbjct: 139 SVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTP 198
Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
L+RN + + YYV L VG + V IPPS + A G ++D GT TRL AY +
Sbjct: 199 LLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTA 258
Query: 391 LRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
+RD F R G S + FDTCY + PT++ F AG + LP N+LI
Sbjct: 259 VRDEFRRRVGRNVTVSSLGGFDTCYTV----PIISPTITFMF-AGMNVTLPPDNFLIHST 313
Query: 451 SAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S T C A A +S L++I ++QQQ R+ FD+ N+RVG C
Sbjct: 314 SGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESC 362
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 165 bits (418), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 129/387 (33%), Positives = 180/387 (46%), Gaps = 38/387 (9%)
Query: 144 FSTPVVSGASQ-GSGEYFSRIGVGTP-PRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
+ PV G S GS EY +G+GTP P++ + LDTGSD+ W QC CT C+ Q P+F
Sbjct: 78 LTAPVDHGGSDVGSSEYLIHLGIGTPRPQRVVLHLDTGSDLVWTQCA-CTVCFDQPVPVF 136
Query: 202 DPKTSSSYSPLPCAAPQCKS---LDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFG 256
S ++S +PC+ P C L +S C R C Y Y D S T G + +T +F
Sbjct: 137 RASVSHTFSRVPCSDPLCGHAVYLPLSGCAARDRSCFYAYGYMDHSITTGKMAEDTFTFK 196
Query: 257 ------NSGSVKGIALGCGHDNEGLFV-GSAGLLGLGGGMLSLTKQIKATSLAYCLV--- 306
+ +V I GCG N GLF +G+ G G G LSL Q+K +YC
Sbjct: 197 APDRADTAAAVPNIRFGCGMMNYGLFTPNQSGIAGFGTGPLSLPSQLKVRRFSYCFTAME 256
Query: 307 -DRDSPA--SGVLEFNSARGGDAVT----APLIRNKKVDT--FYYVGLTGFSVGGQAVQI 357
R SP G E A + AP V + FY++ L G +VG +
Sbjct: 257 ESRVSPVILGGEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPF 316
Query: 358 PPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF 417
S F + G GG +D GTAIT + SLR++FV L G D F
Sbjct: 317 NASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQV-PLPVAKGYTDPDNLLCF 375
Query: 418 S---GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT-----FCFA-FAPTSSALSI 468
S ++ VP + LH G +LP +NY++ D G+ C + +S +I
Sbjct: 376 SVPAKKKAPAVPKLILHL-EGADWELPRENYVLDNDDDGSGAGRKLCVVILSAGNSNGTI 434
Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
IGN QQQ + +DL +N++ F P +C
Sbjct: 435 IGNFQQQNMHIVYDLESNKMVFAPARC 461
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 117/377 (31%), Positives = 186/377 (49%), Gaps = 32/377 (8%)
Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
T + SG GE+F I +GTPP + + DTGSD+ W+QC+PC +CY+++ PIFD K
Sbjct: 72 TDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKK 131
Query: 206 SSSYSPLPCAAPQCKSLDVS--ACRA--NRCLYQVAYGDGSFTVGDLVTETVSF----GN 257
SS+Y PC + C++L + C N C Y+ +YGD SF+ GD+ TETVS G+
Sbjct: 132 SSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGS 191
Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGM-LSLTKQIKAT---SLAYCLVDRDSPAS 313
S G GCG++N G F + + GG LSL Q+ ++ +YCL + + +
Sbjct: 192 PVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTN 251
Query: 314 GVLEFN---------SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
G N ++ V+ PL+ +K+ T+YY+ L SVG + + S +
Sbjct: 252 GTSVINLGTNSIPSSLSKDSGVVSTPLV-DKEPLTYYYLTLEAISVGKKKIPYTGSSYNP 310
Query: 365 DEAG-----DGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFS 418
++ G G II+D GT +T L+ ++ + + G + + L C+ S
Sbjct: 311 NDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLSHCFK-S 369
Query: 419 GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTR 478
G + +P +++HF G + L N + + S C + PT+ ++I GN Q
Sbjct: 370 GSAEIGLPEITVHF-TGADVRLSPINAFVKL-SEDMVCLSMVPTTE-VAIYGNFAQMDFL 426
Query: 479 VSFDLANNRVGFTPNKC 495
V +DL V F C
Sbjct: 427 VGYDLETRTVSFQHMDC 443
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 129/391 (32%), Positives = 186/391 (47%), Gaps = 24/391 (6%)
Query: 122 LAIYNVDRHELKPAEAQIL--PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTG 179
L + + D H L + + P+ S PV SG G Y R +GTPP+ MVLDT
Sbjct: 65 LHMASSDSHRLTYLSSLVAGKPKPTSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTS 124
Query: 180 SDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSAC-----RANRCLY 234
+D WL C C+ C + F+ +SS+YS + C+ QC C + + C +
Sbjct: 125 NDAVWLPCSGCSGC-SNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSVCSF 183
Query: 235 QVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTK 294
+YG S LV +T++ + + GC + G + GL+GLG G +SL
Sbjct: 184 NQSYGGDSSFSASLVQDTLTLA-PDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVS 242
Query: 295 Q---IKATSLAYCLVD-RDSPASGVLEFNSARGGDAVT-APLIRNKKVDTFYYVGLTGFS 349
Q + + +YCL R SG L+ ++ PL+RN + + YYV LTG S
Sbjct: 243 QTTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVS 302
Query: 350 VGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
VG V + P D G I+D GT ITR Y ++RD F R N+ S +
Sbjct: 303 VGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEF-RKQVNVSSFSTLG 361
Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF-CFAFA----PTSS 464
FDTC FS P ++LH + L LP +N LI SAGT C + A ++
Sbjct: 362 AFDTC--FSADNENVAPKITLHMTSLD-LKLPMENTLI-HSSAGTLTCLSMAGIRQNANA 417
Query: 465 ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L++I N+QQQ R+ FD+ N+R+G P C
Sbjct: 418 VLNVIANLQQQNLRILFDVPNSRIGIAPEPC 448
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 165 bits (417), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 123/376 (32%), Positives = 186/376 (49%), Gaps = 38/376 (10%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
S P+ SGA G+G+YF ++ VGTP ++F++V DTGSD+ W++C + + +F PK
Sbjct: 102 SLPMSSGAYSGTGQYFVKLRVGTPVQEFTLVADTGSDLTWVKCAGASPPGR----VFRPK 157
Query: 205 TSSSYSPLPCAAPQCKSLDVSACRAN------RCLYQVAYGDGSFTVGDLV-TETVSF-- 255
TS S++P+PC++ CK LDV AN C Y Y +GS +V TE+ +
Sbjct: 158 TSRSWAPIPCSSDTCK-LDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIAL 216
Query: 256 --GNSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRD 309
G +K + LGC ++G F + G+L LG +S Q A S +YCLVD
Sbjct: 217 PGGKVAQLKDVVLGCSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCLVDHL 276
Query: 310 SP--ASGVLEFNSARGGDAVTAPLIRNK----KVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
+P A+G L F G P + K FY V + V G+A+ IP E
Sbjct: 277 APRNATGYLAFGP---GQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPA---E 330
Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
+ +A GG+I+D G +T L AY ++ + + + P F+ CY+++ R
Sbjct: 331 VWDAKSGGVILDSGNTLTVLAAPAYKAVVAALSKHLDGV-PKVSFPPFEHCYNWTARRPG 389
Query: 424 R---VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRV 479
+P +++ F L+ PAK+Y+I V G C LS+IGN+ QQ
Sbjct: 390 APEIIPKLAVQFAGSARLEPPAKSYVIDVKP-GVKCIGVQEGEWPGLSVIGNIMQQEHLW 448
Query: 480 SFDLANNRVGFTPNKC 495
FDL N +V F + C
Sbjct: 449 EFDLKNMQVRFKQSNC 464
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 120/372 (32%), Positives = 176/372 (47%), Gaps = 41/372 (11%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
GEY ++G+GTP FS +DT SD+ WLQC+PC CY+Q DPIF+P+ SSSY+ +PC++
Sbjct: 86 GEYLVKLGIGTPQHYFSAAIDTASDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCSS 145
Query: 217 PQCKSLDVSACRANR---CLYQVAYGDGSFTVGDLVTETVSFGNSGSV-KGIALGCGHDN 272
C LD C + C Y Y + T G L + ++ G G+V + LGC +
Sbjct: 146 DTCSQLDGHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLAVG--GNVFHAVVLGCSDSS 203
Query: 273 EGLFVGSA-GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDA----- 326
G A GL+GL G LSL Q+ YCL S G L + G DA
Sbjct: 204 VGGPPPQASGLVGLARGPLSLLSQLSVRRFMYCLPPPMSRTPGKLVLGAGAGADAVRNVS 263
Query: 327 --VTAPLIRNKKVDTFYYVGLTGFSVGGQ---AVQIPPS------------LFEMDEAGD 369
VT + + + ++YY+ G +VG Q ++ P S A
Sbjct: 264 DRVTVTMSSSTRYPSYYYLNFDGLAVGDQTPGTIRRPTSPPATGGGVGGGGGDGGSGANA 323
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCY---DFSGLRSV 423
G+IVD + I+ L+ Y+ L D +RL P++ + L D C+ + G+ V
Sbjct: 324 YGMIVDVASTISFLEASLYDELADDLEEEIRLP-RATPSTRLGL-DLCFILPEGVGIDRV 381
Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDL 483
VPTVS+ F G+ L+L + + C T S +SI+GN QQQ V ++L
Sbjct: 382 YVPTVSMSFD-GRWLELERDRLFL--EDGRMMCLMIGRT-SGVSILGNYQQQNMHVLYNL 437
Query: 484 ANNRVGFTPNKC 495
++ F C
Sbjct: 438 RRGKITFAKASC 449
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 122/373 (32%), Positives = 167/373 (44%), Gaps = 37/373 (9%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
G GEY ++G GTP FS +DT SD+ W+QC+PC CY+Q DP+F+PK SSSY+ +PC
Sbjct: 88 GGGEYLVKLGTGTPQHFFSAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPC 147
Query: 215 AAPQCKSLDVSACRANR---CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD 271
+ C LD C + C Y Y T G L + ++ G + GC
Sbjct: 148 TSDTCAQLDGHRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAIGGD-VFHAVVFGCSDS 206
Query: 272 NEGLFVGSA-GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGG-----D 325
+ G A GL+GLG G LSL Q+ YCL S SG L + D
Sbjct: 207 SVGGPAAQASGLVGLGRGPLSLVSQLSVHRFMYCLPPPMSRTSGKLVLGAGADAVRNMSD 266
Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQA------VQIPPSLFEMDEAGDG--------- 370
VT + + + ++YY+ L G +VG Q PPS G G
Sbjct: 267 RVTVTMSSSTRYPSYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGG 326
Query: 371 ----GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCY---DFSGLRS 422
G+IVD + I+ L+T Y+ L D + T + L D C+ + G+
Sbjct: 327 ANAYGMIVDVASTISFLETSLYDELADDLEEEIRLPRATPSLRLGLDLCFILPEGVGMDR 386
Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFD 482
V VPTVSL F G+ L+L V C T S +SI+GN Q Q RV F+
Sbjct: 387 VYVPTVSLSFD-GRWLELDRDRLF--VTDGRMMCLMIGRT-SGVSILGNFQLQNMRVLFN 442
Query: 483 LANNRVGFTPNKC 495
L ++ F C
Sbjct: 443 LRRGKITFAKASC 455
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 111/359 (30%), Positives = 171/359 (47%), Gaps = 32/359 (8%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+GEY + +GTPP + + DTGSD+ W+QC PC C+ Q P+F+P SS++ C
Sbjct: 89 NGEYLMTLYIGTPPVERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEPLKSSTFKAATCD 148
Query: 216 APQCKSLDVS--AC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA-----LG 267
+ C S+ S C + +C+Y +YGD SFTVG + TET+SFG++G + ++ G
Sbjct: 149 SQPCTSVPPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPSSIFG 208
Query: 268 CG-------HDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS 320
CG H ++ + G + L QI +YCL+ S ++ L+F S
Sbjct: 209 CGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQI-GYKFSYCLLPFSSNSTSKLKFGS 267
Query: 321 ---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
V+ PLI +FY++ L ++G + V DG II+D G
Sbjct: 268 EAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVVPT--------GRTDGNIIIDSG 319
Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
T +T L+ YN+ S + F C+ + R + +P ++ F G +
Sbjct: 320 TVLTYLEQTFYNNFVASLQEVLSVESAQDLPFPFKFCFPY---RDMTIPVIAFQF-TGAS 375
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ L KN LI + C A P+S S +SI GNV Q +V +DL +V F P C
Sbjct: 376 VALQPKNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQFDFQVVYDLEGKKVSFAPTDC 434
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 128/401 (31%), Positives = 187/401 (46%), Gaps = 47/401 (11%)
Query: 108 RDSARVNTLITKL-QLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVG 166
RD +RV+ + +K Q N+ H + ED G + + G
Sbjct: 92 RDESRVSFINSKCNQYTSGNLKNHA---HNNNLFDED-------------GNFLVDVAFG 135
Query: 167 TPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA 226
TP + ++LDTGS I W QC+ C C Q S+ FD SS+YS C S
Sbjct: 136 TPXTEIXLILDTGSSITWTQCKACVNCLQDSNRYFDSSASSTYSFGSCIP--------ST 187
Query: 227 CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF-VGSAGLLGL 285
N Y + YGD S +VG+ +T++ S + GCG +N+G F G G+LGL
Sbjct: 188 VENN---YNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNNKGDFGSGVDGMLGL 244
Query: 286 GGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNK----KVD 338
G G LS Q + +YCL + DS S + + ++ + N +
Sbjct: 245 GQGQLSTVSQTASKFNKVFSYCLPEEDSIGSLLFGEKATSQSSSLKFTSLVNGPGTLQES 304
Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
+Y+V L+ SVG + + IP S+F G I+D T ITRL +AY++L+ +F +
Sbjct: 305 GYYFVNLSDISVGNERLNIPSSVFASP-----GTIIDSRTVITRLPQRAYSALKAAFKKA 359
Query: 399 AGNLKPTSGVA----LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
++G + DTCY+ SG + V +P + LHFG G + L N + D A
Sbjct: 360 MAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTNIVWGSD-ASR 418
Query: 455 FCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C AFA TS L+IIGN QQ V +D+ R+GF N C
Sbjct: 419 LCLAFAGTSE-LTIIGNRQQLSLTVLYDIQGRRIGFGGNGC 458
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 126/363 (34%), Positives = 179/363 (49%), Gaps = 29/363 (7%)
Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSP 211
ASQG EY VGTPP Q ++DTGSDI WLQC+PC +CY Q+ PIFDP S +Y
Sbjct: 89 ASQG--EYLMSYSVGTPPFQILGIVDTGSDIIWLQCQPCEDCYNQTTPIFDPSQSKTYKT 146
Query: 212 LPCAAPQCKSLDVSA-CRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNS--GSVK--GI 264
LPC++ C+S+ +A C +N C Y + YGD S + GDL ET++ G++ SV+
Sbjct: 147 LPCSSNICQSVQSAASCSSNNDECEYTITYGDNSHSQGDLSVETLTLGSTDGSSVQFPKT 206
Query: 265 ALGCGHDNEGLFVGSAGLLGLGG----GMLSLTKQIKATSLAYCLVD--RDSPASGVLEF 318
+GCGH+N+G F + G ++S +YCL S +S L F
Sbjct: 207 VIGCGHNNKGTFQREGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPLFSQSNSSSKLNF 266
Query: 319 NS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
G V+ P++ + FY++ L FSVG ++ S G+G II+D
Sbjct: 267 GDEAVVSGRGTVSTPIVPKNGLG-FYFLTLEAFSVGDNRIEF-GSSSFESSGGEGNIIID 324
Query: 376 CGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
GT +T L Y +L + + L P+ + L CY + + VP ++ HF
Sbjct: 325 SGTTLTILPEDDYLNLESAVADAIELERVEDPSKFLRL---CYRTTSSDELNVPVITAHF 381
Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
G ++L + I VD G CFAF +S I GN+ QQ V +DL V F P
Sbjct: 382 -KGADVELNPISTFIEVDE-GVVCFAFR-SSKIGPIFGNLAQQNLLVGYDLVKQTVSFKP 438
Query: 493 NKC 495
C
Sbjct: 439 TDC 441
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 124/397 (31%), Positives = 197/397 (49%), Gaps = 42/397 (10%)
Query: 131 ELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQC--- 187
E PAE+ F+ P+ SGA G+G+YF R+ VGTP + F +V DTGSD+ W++C
Sbjct: 80 ETSPAESSA----FAMPLTSGAYTGTGQYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSP 135
Query: 188 --RPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS---LDVSACRA--NRCLYQVAYGD 240
+ +F P S S+SPLPC + CKS ++ C + + C Y Y D
Sbjct: 136 SSSSSSPAASPPQRVFRPAGSKSWSPLPCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKD 195
Query: 241 GSFTVG--DLVTETVSF-GNSGSVKG----IALGCGHDNEGL-FVGSAGLLGLGGGMLSL 292
S G L + TVS GN G+ K + LGC +G F S G+L LG +S
Sbjct: 196 NSSARGVVGLDSATVSLSGNDGTRKAKLQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISF 255
Query: 293 TKQIKAT---SLAYCLVDRDSP--ASGVLEFNSARGGDAVTAP-------LIRNKKVDTF 340
+ + +YCLVD +P A+ L F + + L+ + + F
Sbjct: 256 ASRAASRFGGRFSYCLVDHLAPRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPF 315
Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG 400
Y+V + +V G+ ++I P +++ + +GG I+D GT++T L T AY+++ + +
Sbjct: 316 YFVSVDAVTVAGERLEILPDVWDFRK--NGGAILDSGTSLTILATPAYDAVVKAISKQFA 373
Query: 401 NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFAF 459
+ P + F+ CY+++G+ S +P + L F L P K+Y+I D+A G C
Sbjct: 374 GV-PRVNMDPFEYCYNWTGV-SAEIPRMELRFAGAATLAPPGKSYVI--DTAPGVKCIGV 429
Query: 460 APTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ +S+IGN+ QQ FDLAN + F ++C
Sbjct: 430 VEGAWPGVSVIGNILQQEHLWEFDLANRWLRFKQSRC 466
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 117/377 (31%), Positives = 184/377 (48%), Gaps = 34/377 (9%)
Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
T + SG GEYF I +GTPP +F + DTGSD+ W+QC+PC +CY+Q+ P+FD K
Sbjct: 72 TDLQSGLISNGGEYFMSISIGTPPSKFLAIADTGSDLTWVQCKPCQQCYKQNTPLFDKKK 131
Query: 206 SSSYSPLPCAAPQCKSLD--VSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSG-- 259
SS+Y C + C +L C +R C Y+ +YGD SFT G++ TET+S +S
Sbjct: 132 SSTYKTESCDSITCNALSEHEEGCDESRNACKYRYSYGDESFTKGEVATETISIDSSSGS 191
Query: 260 --SVKGIALGCGHDNEGLFVGSAGLLGLGGGM-LSLTKQIKAT---SLAYCLVDRDSPAS 313
S G A GCG++N G F + + GG LSL Q+ ++ +YCL + +
Sbjct: 192 PVSFPGTAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTSATTN 251
Query: 314 GVLEFN---------SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPS---- 360
G N ++ +T PLI+ K +T+Y++ L +VG ++P +
Sbjct: 252 GTSVINLGTNSMTSKPSKDSAILTTPLIQ-KDPETYYFLTLEAITVG--KTKLPYTGGGG 308
Query: 361 -LFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFS 418
G II+D GT +T L + Y+ + G + + + C+ S
Sbjct: 309 YSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKRVSDPQGILTHCFK-S 367
Query: 419 GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTR 478
G + + +PT+++HF G + L N + + S C + PT+ ++I GN+ Q
Sbjct: 368 GDKEIGLPTITMHF-TGADVKLSPINSFVKL-SEDIVCLSMIPTTE-VAIYGNMVQMDFL 424
Query: 479 VSFDLANNRVGFTPNKC 495
V +DL V F C
Sbjct: 425 VGYDLETKTVSFQRMDC 441
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 119/356 (33%), Positives = 165/356 (46%), Gaps = 31/356 (8%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK---- 220
+GTPP+ M+LDTGS ++W+QC +FDP SSS+S LPC P CK
Sbjct: 88 IGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVLPCNHPLCKPRIP 147
Query: 221 --SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
+L S C NR C Y Y DG+ G+LV E ++F S S + LGC ++
Sbjct: 148 DFTLPTS-CDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQSTPPLILGCAEESS---- 202
Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRD-----SP-ASGVLEFNSARGGDAVTAPL 331
+ G+LG+ G LS Q K T +YC+ R +P S L N GG L
Sbjct: 203 DAKGILGMNLGRLSFASQAKLTKFSYCVPTRQVRPGFTPTGSFYLGENPNSGGFRYINLL 262
Query: 332 I-----RNKKVDTF-YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
R +D Y V + G +G Q + IP S F D +G G ++D G+ T L
Sbjct: 263 TFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGAGQTMIDSGSEFTYLVD 322
Query: 386 QAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFSGLRSVR-VPTVSLHFGAGKALDLPA 442
+AYN +R+ VRL G V + D C++ + + R + + F G + +
Sbjct: 323 EAYNKVREEVVRLVGARLKKGYVYGGVSDMCFNGNAIEIGRLIGNMVFEFDKGVEIVVEK 382
Query: 443 KNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ L V G C + +A +IIGN QQ V FDLAN RVGF C
Sbjct: 383 ERVLADV-GGGVHCVGIGRSEMLGAASNIIGNFHQQNIWVEFDLANRRVGFGKADC 437
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 125/370 (33%), Positives = 177/370 (47%), Gaps = 22/370 (5%)
Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
P+ S PV SG G Y R +GTPP+ MVLDT +D WL C C+ C +
Sbjct: 12 PKPTSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGC-SNASTS 70
Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSAC-----RANRCLYQVAYGDGSFTVGDLVTETVSF 255
F+ +SS+YS + C+ QC C + + C + +YG S LV +T++
Sbjct: 71 FNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTL 130
Query: 256 GNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVD-RDSP 311
+ + GC + G + GL+GLG G +SL Q + + +YCL R
Sbjct: 131 A-PDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFY 189
Query: 312 ASGVLEFNSARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
SG L+ ++ PL+RN + + YYV LTG SVG V + P D
Sbjct: 190 FSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGA 249
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
G I+D GT ITR Y ++RD F R N+ S + FDTC FS P ++L
Sbjct: 250 GTIIDSGTVITRFAQPVYEAIRDEF-RKQVNVSSFSTLGAFDTC--FSADNENVAPKITL 306
Query: 431 HFGAGKALDLPAKNYLIPVDSAGTF-CFAFA----PTSSALSIIGNVQQQGTRVSFDLAN 485
H L LP +N LI SAGT C + A ++ L++I N+QQQ R+ FD+ N
Sbjct: 307 HM-TSLDLKLPMENTLI-HSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPN 364
Query: 486 NRVGFTPNKC 495
+R+G P C
Sbjct: 365 SRIGIAPEPC 374
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 123/377 (32%), Positives = 186/377 (49%), Gaps = 40/377 (10%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP--IFD 202
S P+ SGA G+G+YF ++ VGTP ++F++V DTGS++ W++C S P +F
Sbjct: 77 SLPMSSGAYAGTGQYFVKVLVGTPAQEFTLVADTGSELTWVKC-----AGGASPPGLVFR 131
Query: 203 PKTSSSYSPLPCAAPQCKSLDVSACRAN------RCLYQVAYGDGSF----TVG-DLVTE 251
P+ S S++P+PC++ CK LDV AN C Y Y +GS VG D T
Sbjct: 132 PEASKSWAPVPCSSDTCK-LDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATI 190
Query: 252 TVSFGNSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVD 307
+ G ++ + LGC ++G F G+L LG +S + A S +YCLVD
Sbjct: 191 ALPGGKVAQLQDVVLGCSSTHDGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCLVD 250
Query: 308 RDSP--ASGVLEFNSARGGDAVTAPLIRNK----KVDTFYYVGLTGFSVGGQAVQIPPSL 361
+P A+G L F G P + K FY V + V GQA+ IP +
Sbjct: 251 HLAPRNATGYLAFGP---GQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEV 307
Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR 421
++ GG+I+D GT +T L T AY ++ + +L + P F+ CY+++ R
Sbjct: 308 WDPKS---GGVILDSGTTLTVLATPAYKAVVAALTKLLAGV-PKVDFPPFEHCYNWTAPR 363
Query: 422 --SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS-SALSIIGNVQQQGTR 478
+ +P +++ F L+ PAK+Y+I V G C +S+IGN+ QQ
Sbjct: 364 PGAPEIPKLAVQFTGCARLEPPAKSYVIDVKP-GVKCIGLQEGEWPGVSVIGNIMQQEHL 422
Query: 479 VSFDLANNRVGFTPNKC 495
FDL N V F P+ C
Sbjct: 423 WEFDLKNMEVRFMPSTC 439
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 120/419 (28%), Positives = 190/419 (45%), Gaps = 57/419 (13%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
++RD R + + + + N D K E P + P+ SG GEYF+ + V
Sbjct: 62 VKRDKLRRQRMNQRWGV-VSNYDSRR-KGFEMTTTPAEVEMPMHSGRDDALGEYFAEVKV 119
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK----- 220
G+P ++F +V+DTGS+ WL C S S+ + CA+ +CK
Sbjct: 120 GSPGQRFWLVVDTGSEFTWLNC------------------SKSFEAVTCASRKCKVDLSE 161
Query: 221 --SLDVSACRANRCLYQVAYGDGSFTVG----DLVTETVSFGNSGSVKGIALGCGHDNEG 274
SL V ++ CLY ++Y DGS G D +T ++ G G + + +GC +
Sbjct: 162 LFSLSVCPKPSDPCLYDISYADGSSAKGFFGTDSITVGLTNGKQGKLNNLTIGC---TKS 218
Query: 275 LFVG------SAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNSARGGD 325
+ G + G+LGLG S + +YCLVD S S + N GG
Sbjct: 219 MLNGVNFNEETGGILGLGFAKDSFIDKAANKYGAKFSYCLVDHLSHRS--VSSNLTIGGH 276
Query: 326 AVTAPLIRNKKVDT-----FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
L ++ + FY V + G S+GGQ ++IPP +++ + +GG ++D GT +
Sbjct: 277 HNAKLLGEIRRTELILFPPFYGVNVVGISIGGQMLKIPPQVWDFN--AEGGTLIDSGTTL 334
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVAL--FDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
T L AY ++ ++ + +K +G + C+D G VP + HF G
Sbjct: 335 TSLLLPAYEAVFEALTKSLTKVKRVTGEDFDALEFCFDAEGFDDSVVPRLVFHFAGGARF 394
Query: 439 DLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ P K+Y+I V + C P S+IGN+ QQ FDL+ N VGF P+ C
Sbjct: 395 EPPVKSYIIDV-APLVKCIGIVPIDGIGGASVIGNIMQQNHLWEFDLSTNTVGFAPSTC 452
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 122/359 (33%), Positives = 178/359 (49%), Gaps = 25/359 (6%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPLPCA 215
+Y + +G PP++ ++DTGSD+ W QC C C +Q+ P ++ SS+++P+PCA
Sbjct: 89 QYVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCA 148
Query: 216 APQCKSLD--VSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC---G 269
A C + D + C A C YG G G L TE +F SG+ + +A GC
Sbjct: 149 ARICAANDDIIHFCDLAAGCSVIAGYGAG-VVAGTLGTEAFAF-QSGTAE-LAFGCVTFT 205
Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD--RDSPASGVLEFNSARG---- 323
+G G++GL+GLG G LSL Q AT +YCL ++ A+G L ++
Sbjct: 206 RIVQGALHGASGLIGLGRGRLSLVSQTGATKFSYCLTPYFHNNGATGHLFVGASASLGGH 265
Query: 324 GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG----DGGIIVDCGTA 379
GD +T ++ K FYY+ L G +VG + IP ++F++ E GG+I+D G+
Sbjct: 266 GDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGSP 325
Query: 380 ITRLQTQAYNSLRDSF-VRLAGNL-KPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
T L AY++L RL G+L P C + V VP V HF G
Sbjct: 326 FTSLVHDAYDALASELAARLNGSLVAPPPDADDGALCVARRDVGRV-VPAVVFHFRGGAD 384
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ +PA++Y PVD A + S+IGN QQQ RV +DLAN F P C
Sbjct: 385 MAVPAESYWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPADC 443
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 125/420 (29%), Positives = 200/420 (47%), Gaps = 77/420 (18%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR--------------- 188
F+ P+ SGA G+G+YF R VGTP + F +V DTGSD+ W++C
Sbjct: 72 FAMPLSSGAYTGTGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASS 131
Query: 189 ---PCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK-----SLDVSACRANRCLYQVAYGD 240
P +++ F P S +++P+PC++ C+ SL A AN C Y Y D
Sbjct: 132 LPAPAPASPRRT---FRPDKSRTWAPIPCSSATCRESLPFSLAACATPANPCAYDYRYKD 188
Query: 241 GSFTVGDLVTETVSFGNSG------SVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLT 293
GS G + ++ + SG ++G+ LGC G F+ S G+L LG +S
Sbjct: 189 GSAARGTVGVDSATIALSGRAARKAKLRGVVLGCTTSYNGQSFLASDGVLSLGYSNISFA 248
Query: 294 KQIKAT---SLAYCLVDRDSP--ASGVL------EFNSARGGDAVTA------------- 329
+ + +YCLVD +P A+ L F+S R + + +
Sbjct: 249 SRAASRFGGRFSYCLVDHLAPRNATSYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAG 308
Query: 330 -------PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
PL+ + + FY V + G SV G+ ++IP +++++++ GG I+D GT++T
Sbjct: 309 APGARQTPLVLDHRTRPFYAVTVKGVSVAGELLKIPRAVWDVEQG--GGAILDSGTSLTM 366
Query: 383 LQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYDFSGLRSVRV----PTVSLHFGAGKA 437
L AY ++ + RLAG P + FD CY+++ V P +++HF
Sbjct: 367 LAKPAYRAVVAALSKRLAG--LPRVTMDPFDYCYNWTSPSGSDVAAPLPMLAVHFAGSAR 424
Query: 438 LDLPAKNYLIPVDSA-GTFCFAFAP-TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L+ PAK+Y+I D+A G C LS+IGN+ QQ +DL N R+ F ++C
Sbjct: 425 LEPPAKSYVI--DAAPGVKCIGLQEGPWPGLSVIGNILQQEHLWEYDLKNRRLRFKRSRC 482
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 108/354 (30%), Positives = 163/354 (46%), Gaps = 36/354 (10%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
+GTPP+ S ++D ++ W QC C+ C++Q P+F P SS++ P PC CKS+
Sbjct: 73 IGTPPQPASAIIDVAGELVWTQCSMCSRCFKQDLPLFVPNASSTFRPEPCGTDACKSIPT 132
Query: 225 SACRANRCLYQVAYGD--GSFTVGDLVTETVSFGNSGSVKGIALGC----GHDNEGLFVG 278
S C +N C Y+ G T+G + T+T + G + + GC G D G G
Sbjct: 133 SNCSSNMCTYEGTINSKLGGHTLGIVATDTFAIGT--ATASLGFGCVVASGIDTMG---G 187
Query: 279 SAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAVTAPLIRN 334
+GL+GLG SL Q+ T +YCL DS + L S A GG++ T P ++
Sbjct: 188 PSGLIGLGRAPSSLVSQMNITKFSYCLTPHDSGKNSRLLLGSSAKLAGGGNSTTTPFVKT 247
Query: 335 KKVD---TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
D +Y + L G G A+ +PPS ++V ++ L AY +L
Sbjct: 248 SPGDDMSQYYPIQLDGIKAGDAAIALPPS--------GNTVLVQTLAPMSFLVDSAYQAL 299
Query: 392 RDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAG-KALDLPAKNYLIPV- 449
+ + G + + FD C+ +GL + P + F G AL +P YLI V
Sbjct: 300 KKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQQGAAALTVPPPKYLIDVG 359
Query: 450 DSAGTFCFAFAPTS--------SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ GT C A TS L+I+G++QQ+ T DL + F P C
Sbjct: 360 EEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPADC 413
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 170/358 (47%), Gaps = 22/358 (6%)
Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSP 211
+ +G+Y ++ +G+PP ++DTGSD+ W QC PC CY+Q P+F+P S +YSP
Sbjct: 75 VTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQKSPMFEPLRSKTYSP 134
Query: 212 LPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALG 267
+PC + QC S C Y +Y D S T G L E ++F G+ V I G
Sbjct: 135 IPCESEQCSFFGYSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVGDIIFG 194
Query: 268 CGHDNEGLF-VGSAGLLGLGGGMLSLTKQI----KATSLAYCLV--DRDSPASGVLEF-- 318
CGH N G F G++G+GGG LSL QI + + CLV D+ SG + F
Sbjct: 195 CGHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTSGTINFGE 254
Query: 319 -NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
+ G VT PL +++ T Y V L G SVG V+ S + G I++D G
Sbjct: 255 ESDVSGEGVVTTPLA-SEEGQTSYLVTLEGISVGDTFVRFNSS----ETLSKGNIMIDSG 309
Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
T T + + Y L + +++ +L P T + ++ P ++ HF
Sbjct: 310 TPATYIPQEFYERLVEE-LKVQSSLLPIEDDPDLGTQLCYRSETNLEGPILTAHFEGADV 368
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LP + ++ P D G FCFA A ++ I GN Q + FDL + F P C
Sbjct: 369 QLLPIQTFIPPKD--GVFCFAMAGSTDGDYIFGNFAQSNILMGFDLDRKTISFKPTDC 424
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 123/358 (34%), Positives = 166/358 (46%), Gaps = 35/358 (9%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK---- 220
+GTPP+ M+LDTGS ++W+QC +FDP SSS+S LPC P CK
Sbjct: 83 IGTPPQSQQMILDTGSQLSWIQCHKKVPRKPPPSTVFDPSLSSSFSVLPCNHPLCKPRIP 142
Query: 221 --SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
+L S C NR C Y Y DG+ G+LV E ++F S S + LGC D
Sbjct: 143 DFTLPTS-CDLNRLCHYSYFYADGTLAEGNLVREKITFSTSQSTPPLILGCAEDAS---- 197
Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRD-----SP-ASGVLEFNSARGGDAVTAPL 331
G+LG+ G LS Q K T +YC+ R +P S L N G + L
Sbjct: 198 DDKGILGMNLGRLSFASQAKITKFSYCVPTRQVRPGFTPTGSFYLGENPNSAGFQYISLL 257
Query: 332 I-----RNKKVDTF-YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
R +D + V L G +G + + IP S F D +G G ++D G+ T L
Sbjct: 258 TFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQSMIDSGSEFTYLVD 317
Query: 386 QAYNSLRDSFVRLAG-NLKP---TSGVALFDTCYDFSGLRSVR-VPTVSLHFGAGKALDL 440
AYN +R+ VRLAG LK SGV+ D C+D + + R + + F G + +
Sbjct: 318 VAYNKVREEVVRLAGPRLKKGYVYSGVS--DMCFDGNAMEIGRLIGNMVFEFDKGVEIVI 375
Query: 441 PAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L V G C + +A +IIGN QQ V FD+AN RVGF C
Sbjct: 376 EKGRVLADV-GGGVHCVGIGRSEMLGAASNIIGNFHQQNLWVEFDIANRRVGFGKADC 432
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 116/355 (32%), Positives = 164/355 (46%), Gaps = 28/355 (7%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
Y +R +GTP + + +D +D W+ C P FDP SS+Y P+ C AP
Sbjct: 106 SYVARARLGTPAQALLVAIDPSNDAAWVPCA--ACAGCARAPSFDPTRSSTYRPVRCGAP 163
Query: 218 QCKSLDVSACRA---NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
QC +C + C + ++Y +F + +V GC H G
Sbjct: 164 QCSQAPAPSCPGGLGSSCAFNLSYAASTFQALLGQDALALHDDVDAVAAYTFGCLHVVTG 223
Query: 275 LFVGSAGLLGLGGGMLSL---TKQIKATSLAYCLVD-RDSPASGVLEFNSARGGDAV-TA 329
V GL+G G G LS TK + + +YCL + S SG L A + T
Sbjct: 224 GSVPPQGLVGFGRGPLSFPSQTKDVYGSVFSYCLPSYKSSNFSGTLRLGPAGQPKRIKTT 283
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
PL+ N + YYV + G VGG+ V +P S D G IVD GT TRL Y
Sbjct: 284 PLLSNPHRPSLYYVNMVGIRVGGRPVPVPASALAFDPTSGRGTIVDAGTMFTRLSAPVYA 343
Query: 390 SLRDSF---VRLAGNLKPTSG-VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
++RD F VR P +G + FDTCY+ ++ VPTV+ F ++ LP +N
Sbjct: 344 AVRDVFRSRVR-----APVAGPLGGFDTCYNV----TISVPTVTFSFDGRVSVTLPEENV 394
Query: 446 LIPVDSAGTFCFAFAP-----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+I S G C A A +AL+++ ++QQQ RV FD+AN RVGF+ C
Sbjct: 395 VIRSSSGGIACLAMAAGPPDGVDAALNVLASMQQQNHRVLFDVANGRVGFSRELC 449
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 161 bits (407), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 123/365 (33%), Positives = 173/365 (47%), Gaps = 21/365 (5%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
S V + + +G+Y ++ +GTPP ++DTGSD+ W QC PC CY+Q P+F+P
Sbjct: 36 SNGVFTRVTSNNGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKSPMFEPL 95
Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSF----GNSG 259
S++Y+P+PC + +C SL +C + C Y AY D S T G L ETV+F G
Sbjct: 96 RSNTYTPIPCDSEECNSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPV 155
Query: 260 SVKGIALGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQI----KATSLAYCLV--DRDSPA 312
V I GCGH N G F G++GLGGG LSL Q + + CLV D
Sbjct: 156 VVGDIVFGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADPHT 215
Query: 313 SGVLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
G + F A G+ V A + +++ T Y V L G SVG V S EM G
Sbjct: 216 LGTISFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSS--EM--LSKG 271
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
I++D GT T L + Y+ L +++ N+ P T + ++ P +
Sbjct: 272 NIMIDSGTPATYLPQEFYDRLVKE-LKVQSNMLPIDDDPDLGTQLCYRSETNLEGPILIA 330
Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
HF +P + ++ P D G FCFA A T+ I GN Q + FDL V F
Sbjct: 331 HFEGADVQLMPIQTFIPPKD--GVFCFAMAGTTDGEYIFGNFAQSNVLIGFDLDRKTVSF 388
Query: 491 TPNKC 495
C
Sbjct: 389 KATDC 393
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 161 bits (407), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 110/389 (28%), Positives = 184/389 (47%), Gaps = 26/389 (6%)
Query: 122 LAIYNVDRHELKPAEAQI------LPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMV 175
L ++++ R + ++A++ L D S P+ + +G Y IG+GTPP+ +++
Sbjct: 51 LPVHDMWRRSARASKARVARLEARLTGDMSVPLARISDEG---YTVTIGIGTPPQLHTLI 107
Query: 176 LDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD--VSACRANRCL 233
DT SD+ W QC + +Q +P+FDP SSS++ + C++ C + C C
Sbjct: 108 ADTASDLTWTQCNLFNDTAKQVEPLFDPAKSSSFAFVTCSSKLCTEDNPGTKRCSNKTCR 167
Query: 234 YQVAYGDGSFTVGDLVTE--TVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
Y Y G L E T+S N GCG +G +G++G+LG+ +LS
Sbjct: 168 YVYPYVSVE-AAGVLAYESFTLSDNNQHICMSFGFGCGALTDGNLLGASGILGMSPAILS 226
Query: 292 LTKQIKATSLAYCLVDRDSPASGVLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFS 349
+ Q+ +YCL S L F + G T P+ K + +YYV L G S
Sbjct: 227 MVSQLAIPKFSYCLTPYTDRKSSPLFFGAWADLGRYKTTGPI--QKSLTFYYYVPLVGLS 284
Query: 350 VGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
+G + + +P + F + + GG +VD G + +L A+ +L+++ + V
Sbjct: 285 LGTRRLDVPAATFALKQ---GGTVVDLGCTVGQLAEPAFTALKEAVLHTLNLPLTNRTVK 341
Query: 410 LFDTCYDFS---GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL 466
+ C+ + +V+ P + L+F G + LP NY +AG C A P +
Sbjct: 342 DYKVCFALPSGVAMGAVQTPPLVLYFDGGADMVLPRDNYF-QEPTAGLMCLALVP-GGGM 399
Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
SIIGNVQQQ + FD+ +++ F P C
Sbjct: 400 SIIGNVQQQNFHLLFDVHDSKFLFAPTIC 428
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 161 bits (407), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 119/400 (29%), Positives = 173/400 (43%), Gaps = 48/400 (12%)
Query: 135 AEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQC---RPCT 191
A A L +TPV S G Y + GTPP+ S V+DTGS W C C
Sbjct: 56 ARAHHLKNPQTTPVFS---HSYGGYSISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCN 112
Query: 192 EC-YQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVT 250
C + F PK SSS + C P+C + + R C + S +
Sbjct: 113 NCSFTSRISPFLPKHSSSSKIIGCKNPKCSWIHQTDLRCTDC------DNNSRNCSQICP 166
Query: 251 ETVSFGNSGSVKGIALGCGHDNEGLFVGS-------------AGLLGLGGGMLSLTKQIK 297
+ SG+ G+AL GL V + AG+ G G G SL Q+
Sbjct: 167 PYLILYGSGTTGGVALSETLHLHGLIVPNFLVGCSVFSSRQPAGIAGFGRGPSSLPSQLG 226
Query: 298 ATSLAYCLVDR---DSPASGVLEFNSARGGDAVTA-----PLIRNKKVD------TFYYV 343
T +YCL+ D+ S L +S D TA PL++N KV +YYV
Sbjct: 227 LTKFSYCLLSHKFDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYV 286
Query: 344 GLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLK 403
L S+GG++V+IP D+ G+GG I+D GT T + T+A+ L + F+ N +
Sbjct: 287 SLRRISIGGRSVKIPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYE 346
Query: 404 P---TSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA 460
++ C++ SG + + +P + LHF G ++LP +NY + S CF
Sbjct: 347 RALMVEALSGLKPCFNVSGAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTVV 406
Query: 461 PTSSALS-----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ + I+GN Q Q V +DL N R+GF C
Sbjct: 407 TDGAEKASGPGMILGNFQMQNFYVEYDLQNERLGFKKESC 446
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 110/352 (31%), Positives = 165/352 (46%), Gaps = 45/352 (12%)
Query: 15 ILFSFCLFTSASSRGLSETATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPL 74
+L S CL + E L + QQ + PE+ +
Sbjct: 28 LLVSLCLIIANGVSSFEEKKVFNLQILQRKQQLGSLGCLHPESRQ--------------- 72
Query: 75 NSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP 134
+ L + R K + N +R L ++L D V ++ +L+ V H ++
Sbjct: 73 -EKGAIMLEMKDRSYCSKKKVNWHRKL-HNQLTLDDLHVRSMQNRLRKM---VSSHSVEV 127
Query: 135 AEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECY 194
++ QI P+ SG + + Y + +G + ++++DTGSD+ W+QC PC CY
Sbjct: 128 SQIQI-------PLASGVNFQTLNYIVTMELGG--QDMTVIIDTGSDLTWVQCEPCMSCY 178
Query: 195 QQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS-----ACRAN--RCLYQVAYGDGSFTVGD 247
Q P+F P TSSSY +PC + C+SL ++ AC +N C Y V YGDGS+T G+
Sbjct: 179 NQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGE 238
Query: 248 LVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYC 304
L E +SFG SV GCG +N+GLF G +GL+GLG LSL Q +T +YC
Sbjct: 239 LGAEHLSFGGI-SVSNFVFGCGKNNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYC 297
Query: 305 LVDRDSPASGVLEFNSARGGDAVTAP-----LIRNKKVDTFYYVGLTGFSVG 351
L D+ ASG L + P ++ N ++ FY + LTG VG
Sbjct: 298 LPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVG 349
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 123/399 (30%), Positives = 190/399 (47%), Gaps = 55/399 (13%)
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP------- 199
P+ S A G G+YF R VGTP + F +V DTGSD+ W++CRP ++
Sbjct: 83 PLTSAAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASAS 142
Query: 200 ----IFDPKTSSSYSPLPCAAPQC-KSL--DVSAC--RANRCLYQVAYGDGSFTVGDLVT 250
F P+ S +++P+PCA+ C KSL +S C + C Y Y DGS G + T
Sbjct: 143 SPRRAFRPEKSKTWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGT 202
Query: 251 ETVSFG------------NSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIK 297
E+ + ++G+ LGC G F S G+L LG +S
Sbjct: 203 ESATIALSSSSSSSKNKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASHAA 262
Query: 298 AT---SLAYCLVDRDSP--ASGVLEFN----------SARGGDAVTAPLIRNKKVDTFYY 342
+ +YCLVD SP A+ L F +A G A PL+ + ++ FY
Sbjct: 263 SRFGGRFSYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYD 322
Query: 343 VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL 402
V + SV G+ ++IP ++E+D G GG+IVD GT++T L AY ++ + +
Sbjct: 323 VSIKAISVDGELLKIPRDVWEVD--GGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARF 380
Query: 403 KPTSGVALFDTCYDFSGL----RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCF 457
P + F+ CY+++ +P +++HF L+ P+K+Y+I D+A G C
Sbjct: 381 -PRVAMDPFEYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVI--DAAPGVKCI 437
Query: 458 AFAP-TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+S+IGN+ QQ FDL N R+ F ++C
Sbjct: 438 GVQEGPWPGISVIGNILQQEHLWEFDLKNRRLRFKRSRC 476
>gi|125524351|gb|EAY72465.1| hypothetical protein OsI_00321 [Oryza sativa Indica Group]
Length = 343
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 93/211 (44%), Positives = 124/211 (58%), Gaps = 9/211 (4%)
Query: 34 ATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILH-- 91
AT LDV+++L + +S E L A + + + +L LHSR+ L
Sbjct: 35 ATETLDVAASLSRARAAVSAEAVPLHQSAAAAVSTEVVGEEHEEGRLALRLHSRDFLPEE 94
Query: 92 --KTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEA---QILPEDFST 146
+ RH YRSLVL+RL RDSAR + + +A V R +L PA + +
Sbjct: 95 QGRQRHASYRSLVLARLRRDSARAAAVSARAAMAADGVSRFDLVPANVTAFEASAAEIQG 154
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
PVVSG GSGEYFSR+GVG+P RQ MVLDTGSD+ W+QC+PC +CYQQSDP+FDP S
Sbjct: 155 PVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLS 214
Query: 207 SSYSPLPCAAPQCKSLDVSACR--ANRCLYQ 235
+SY+ + C P+C LD +ACR CLY+
Sbjct: 215 TSYASVACDNPRCHDLDAAACRNSTGACLYE 245
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 124/388 (31%), Positives = 178/388 (45%), Gaps = 28/388 (7%)
Query: 127 VDRHELKPAEAQILP-----EDFSTPVVSGAS-QGSGEYFSRIGVGTPPRQFSMVLDTGS 180
+D PA + L + + P+ SG G Y R+ +GTP + MVLDT +
Sbjct: 57 IDMASKDPARIRYLSSLTAQKTVAAPIASGQQVLNVGNYVVRVQLGTPGQTMYMVLDTSN 116
Query: 181 DINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRAN---RCLYQVA 237
D W C C C S F + SS+++ L C+ P+C +C CL+
Sbjct: 117 DAAWAPCSGCIGC--SSTTTFSAQNSSTFATLDCSKPECTQARGLSCPTTGNVDCLFNQT 174
Query: 238 YGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ-- 295
YG S LV +++ G + + GC G + GL+GLG G LSL Q
Sbjct: 175 YGGDSTFSATLVQDSLHLG-PNVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSG 233
Query: 296 -IKATSLAYCLVDRDSPA-SGVLEFNSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGG 352
+ + +YCL S SG L+ A+ T PL+ N + YYV LTG SVG
Sbjct: 234 SLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKAIRTTPLLHNPHRPSLYYVNLTGISVGR 293
Query: 353 QAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALF 411
V I P L D G I+D GT ITR Y ++RD F + + G+ P + F
Sbjct: 294 VLVPISPELLAFDPNTGAGTIIDSGTVITRFVPAIYTAVRDEFRKQVGGSFSP---LGAF 350
Query: 412 DTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT----SSALS 467
DTC F+ V P ++LH +G L LP +N LI + C A A +S ++
Sbjct: 351 DTC--FATNNEVSAPAITLHL-SGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVVN 407
Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+I N+QQQ R+ FD+ N+++G C
Sbjct: 408 VIANLQQQNHRILFDINNSKLGIARELC 435
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 137/447 (30%), Positives = 201/447 (44%), Gaps = 53/447 (11%)
Query: 71 SFPLNSSSSFSLPLHSREI----LHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYN 126
SF S+ FS+ L R+ +K N Y+ +V + R RVN
Sbjct: 19 SFSQAVSNGFSIELIHRDSSKSPFYKPTQNKYQHVV-DAVHRSINRVN------------ 65
Query: 127 VDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQ 186
H K + A STP + S G+Y VGTPP + ++DTGSDI WLQ
Sbjct: 66 ---HSNKNSLA-------STPESTVISY-EGDYIMSYSVGTPPIKSYGIVDTGSDIVWLQ 114
Query: 187 CRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTV 245
C PC +CY Q+ P F+P SSSY + C++ C+S+ ++C + C Y + YG+ S +
Sbjct: 115 CEPCEQCYNQTTPKFNPSKSSSYKNISCSSKLCQSVRDTSCNDKKNCEYSINYGNQSHSQ 174
Query: 246 GDLVTETVSF----GNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGM-LSLTKQIKAT- 299
GDL ET++ G S +GCG +N G F + + GG SL Q+ +
Sbjct: 175 GDLSLETLTLESTTGRPVSFPKTVIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSI 234
Query: 300 --SLAYCLVDRD------SPASGVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGF 348
+YCLV S S L F G + ++ P+++ K FYY+ + F
Sbjct: 235 GGKFSYCLVRMSITLKNMSMGSSKLNFGDVAIVSGHNVLSTPIVK-KDHSFFYYLTIEAF 293
Query: 349 SVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV 408
SVG + V+ S ++E G II+D T +T + + Y L + V L +
Sbjct: 294 SVGDKRVEFAGSSKGVEE---GNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTLERVDDPN 350
Query: 409 ALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSI 468
F CY+ S P ++ HF L L A N + V + CFAFAP++ +I
Sbjct: 351 QQFSLCYNVSSDEEYDFPYMTAHFKGADIL-LYATNTFVEV-ARDVLCFAFAPSNGG-AI 407
Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
G+ QQ V +DL V F C
Sbjct: 408 FGSFSQQDFMVGYDLQQKTVSFKSVDC 434
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 117/350 (33%), Positives = 163/350 (46%), Gaps = 23/350 (6%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y R +GTPP+Q + +DT +D W+ C C C S P FDP S+SY +PC +P
Sbjct: 110 YVVRARLGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSAPPFDPAASTSYRSVPCGSPL 169
Query: 219 CKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
C +AC C + + Y D S L ++++ +VK GC G
Sbjct: 170 CAQAPNAACPPGGKACGFSLTYADSSLQAA-LSQDSLAVAGD-AVKTYTFGCLQKATGTA 227
Query: 277 V---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDA---VTA 329
G GL LS T+ + + +YCL S SG L R G T
Sbjct: 228 APPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLPSFKSLNFSGTLRLG--RNGQPPRIKTT 285
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
PL+ N + YYV +TG VG + V IPP D A G ++D GT TRL AY
Sbjct: 286 PLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMFTRLVAPAYV 345
Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
++RD R G P S + FDTC++ + +V P V+L F G + LP +N +I
Sbjct: 346 AVRDEVRRRVG--APVSSLGGFDTCFNTT---AVAWPPVTLLFD-GMQVTLPEENVVIHS 399
Query: 450 DSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C A A ++ L++I ++QQQ RV FD+ N RVGF +C
Sbjct: 400 TYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 449
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 127/396 (32%), Positives = 192/396 (48%), Gaps = 52/396 (13%)
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP------- 199
P+ SGA G G+YF R VGTP + F +V DTGSD+ W++CR P
Sbjct: 85 PLTSGAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGP 144
Query: 200 --IFDPKTSSSYSPLPCAAPQC-KSLDVSACR----ANRCLYQVAYGDGSFTVGDLVTET 252
F P+ S +++P+ CA+ C KSL S + C Y Y DGS G + TE+
Sbjct: 145 GRAFRPEDSRTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTES 204
Query: 253 VSFGNSG------SVKGIALGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIKAT---SLA 302
+ SG +KG+ LGC G F S G+L LG +S + +
Sbjct: 205 ATIALSGREERKAKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFS 264
Query: 303 YCLVDRDSP--ASGVLEF------NSAR---------GGDAVTAPLIRNKKVDTFYYVGL 345
YCLVD SP A+ L F +S R A PL+ ++++ FY V L
Sbjct: 265 YCLVDHLSPRNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSL 324
Query: 346 TGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT 405
SV G+ ++IP +++++ EAG GG+I+D GT++T L AY ++ + + L P
Sbjct: 325 KAISVAGEFLKIPRAVWDV-EAG-GGVILDSGTSLTVLAKPAYRAVVAALSKGLAGL-PR 381
Query: 406 SGVALFDTCYDFSGLRS----VRVPTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFAFA 460
+ F+ CY+++ V VP +++HF L+ P K+Y+I D+A G C
Sbjct: 382 VTMDPFEYCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVI--DAAPGVKCIGLQ 439
Query: 461 P-TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+S+IGN+ QQ FD+ N R+ F ++C
Sbjct: 440 EGPWPGISVIGNILQQEHLWEFDIKNRRLKFQRSRC 475
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 176/387 (45%), Gaps = 45/387 (11%)
Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQC------RPCTECYQ---Q 196
P+ A G G+YF VGTP ++F +V DTGSD+ W+ C R C+ +
Sbjct: 70 VPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIR 129
Query: 197 SDPIFDPKTSSSYSPLPCAAPQCK-------SLDVSACRANRCLYQVAYGDGSFTVGDLV 249
+F SSS+ +PC CK SL C Y Y DGS +G
Sbjct: 130 HKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFA 189
Query: 250 TETVSF----GNSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKAT----- 299
ETV+ G + + +GC +G F + G++GLG S IKA
Sbjct: 190 NETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFA--IKAAEKFGG 247
Query: 300 SLAYCLVDRDSP--ASGVLEFNSARGGDAVTAPLIRNK----KVDTFYYVGLTGFSVGGQ 353
+YCLVD S S L F S+R +A+ + + V++FY V + G S+GG
Sbjct: 248 KFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGA 307
Query: 354 AVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN----SLRDSFVRLAGNLKPTSGVA 409
++IP ++ D G GG I+D G+++T L AY +LR S ++ K +
Sbjct: 308 MLKIPSEVW--DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFR---KVEMDIG 362
Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSI 468
+ C++ +G VP + HF G + P K+Y+I + G C F + S+
Sbjct: 363 PLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISA-ADGVRCLGFVSVAWPGTSV 421
Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
+GN+ QQ FDL ++GF P+ C
Sbjct: 422 VGNIMQQNHLWEFDLGLKKLGFAPSSC 448
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 115/364 (31%), Positives = 171/364 (46%), Gaps = 30/364 (8%)
Query: 147 PVVSGASQ-GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
P+ SG S Y R +GTPP+ + +DT +D W+ PCT C + +F P+
Sbjct: 65 PIASGRQIIQSPTYIVRAKIGTPPQTLLLAMDTSNDAAWI---PCTACDGCASTLFAPEK 121
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S+++ + CAAP+CK + C + C + + YG S +LV +T++ + V
Sbjct: 122 STTFKNVSCAAPECKQVPNPGCGVSSCNFNLTYGSSSI-AANLVQDTITLA-TDPVPSYT 179
Query: 266 LGCGHDNEGLFV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFN-SA 321
GC G G GL +LS T+ + ++ +YCL P+ L F+ S
Sbjct: 180 FGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCL-----PSFKSLNFSGSL 234
Query: 322 RGGDAVT------APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
R G PL++N + + YYV L VG + V IPP+ + G I D
Sbjct: 235 RLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFD 294
Query: 376 CGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAG 435
GT TRL Y ++RD F R G + + FDTCY+ + VPT++ F G
Sbjct: 295 SGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNV----PIVVPTITFIF-TG 349
Query: 436 KALDLPAKNYLIPVDSAGTFCFAFA----PTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
+ LP N LI + T C A A +S L++I N+QQQ RV +D+ N+RVG
Sbjct: 350 MNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSRVGVA 409
Query: 492 PNKC 495
C
Sbjct: 410 RELC 413
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 113/358 (31%), Positives = 167/358 (46%), Gaps = 38/358 (10%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI--FDPKTSSSYSPLPCAAPQCKSL 222
+GTPP+ MVLDTGS ++W+QC + ++ P FDP SSS+ LPC P CK
Sbjct: 94 IGTPPQPQQMVLDTGSQLSWIQC------HNKTPPTASFDPSLSSSFYVLPCTHPLCKPR 147
Query: 223 DV-----SACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
+ C NR C Y Y DG++ G+LV E ++F S + + LGC ++
Sbjct: 148 VPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPSQTTPPLILGCSSESRD-- 205
Query: 277 VGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
+ G+LG+ G LS Q K T +YC+ R + S G+ + R
Sbjct: 206 --ARGILGMNLGRLSFPFQAKVTKFSYCVPTRQPANNNNFPTGSFYLGNNPNSARFRYVS 263
Query: 337 VDTF-------------YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
+ TF Y V + G +GG+ + IPPS+F + G G +VD G+ T L
Sbjct: 264 MLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGSGQTMVDSGSEFTFL 323
Query: 384 QTQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFSGLRSVR-VPTVSLHFGAGKALDL 440
AY+ +R+ +R+ G V + D C+D + + R + V+ F G + +
Sbjct: 324 VDVAYDRVREEIIRVLGPRVKKGYVYGGVADMCFDGNAMEIGRLLGDVAFEFEKGVEIVV 383
Query: 441 PAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
P + L V G C + +A +IIGN QQ V FDLAN R+GF C
Sbjct: 384 PKERVLADV-GGGVHCVGIGRSERLGAASNIIGNFHQQNLWVEFDLANRRIGFGVADC 440
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 125/415 (30%), Positives = 193/415 (46%), Gaps = 38/415 (9%)
Query: 102 VLSRLERDSARVNTLITKLQL----AIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSG 157
++ R S N+ +T+ +L A+ ++ R + QI P +P+++ G
Sbjct: 30 LIPRHSPISPLYNSQMTQTELVKSAALRSITRSKRVNFIGQISPP--LSPIITPIPD-HG 86
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EY R +GTP + + DTGSD++WLQC PC CY Q P+FDP SS+Y +PC +
Sbjct: 87 EYLMRFSLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPCESQ 146
Query: 218 QCKSL--DVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA------LGC 268
C + C +++ C+Y YG SFT+G L +T+SF ++G +G A GC
Sbjct: 147 PCTLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSVFGC 206
Query: 269 GHDNEGLF---VGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNS-A 321
+ F + G +GLG G LSL Q+ +YC+V S ++G L+F S A
Sbjct: 207 AFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCMVPFSSTSTGKLKFGSMA 266
Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
+ V+ P + N ++Y + L G +VG + V + G II+D +T
Sbjct: 267 PTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKV--------LTGQIGGNIIIDSVPILT 318
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
L+ Y S V+ A N++ F+ C ++ P HF G + L
Sbjct: 319 HLEQGIYTDFISS-VKEAINVEVAEDAPTPFEYC--VRNPTNLNFPEFVFHF-TGADVVL 374
Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
KN I +D+ C P S +SI GN Q +V +DL +V F P C
Sbjct: 375 GPKNMFIALDN-NLVCMTVVP-SKGISIFGNWAQVNFQVEYDLGEKKVSFAPTNC 427
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 123/366 (33%), Positives = 175/366 (47%), Gaps = 23/366 (6%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
S PV SG G Y R +GTPP+ MVLDT +D WL C C+ C + F+
Sbjct: 91 SVPVASGNQLHIGNYVVRARLGTPPQLMFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTN 149
Query: 205 TSSSYSPLPCAAPQCKSLDVSAC-----RANRCLYQVAYGDGSFTVGDLVTETVSFGNSG 259
+SS+YS + C+ QC C + + C + +YG S +LV +T++ +
Sbjct: 150 SSSTYSTVSCSTTQCTQARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQDTLTL-SPD 208
Query: 260 SVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVD-RDSPASGV 315
+ + GC + G + GL+GLG G +SL Q + + +YCL R SG
Sbjct: 209 VIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGS 268
Query: 316 LEFNSARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
L+ ++ PL+RN + + YYV LTG SVG V + P D G I+
Sbjct: 269 LKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNSGAGTII 328
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
D GT ITR Y ++RD F + T G FDTC FS P ++LH
Sbjct: 329 DSGTVITRFAQPVYEAIRDEFRKQVNGSFSTLGA--FDTC--FSADNENVTPKITLHM-T 383
Query: 435 GKALDLPAKNYLIPVDSAGTF-CFAFA----PTSSALSIIGNVQQQGTRVSFDLANNRVG 489
L LP +N LI SAGT C + A ++ L++I N+QQQ R+ FD+ N+R+G
Sbjct: 384 SLDLKLPMENTLIH-SSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIG 442
Query: 490 FTPNKC 495
P C
Sbjct: 443 IAPEPC 448
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 120/365 (32%), Positives = 176/365 (48%), Gaps = 36/365 (9%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPLPCA 215
+Y + VG PP++ ++DTGS + W QC C C +Q P F+ +S S++P+PC
Sbjct: 85 QYIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAPVPCQ 144
Query: 216 APQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE- 273
C + C + C ++V YG G +G L T+ +F + G+ +A GC
Sbjct: 145 DKACAGNYLHFCALDGTCTFRVTYGAGGI-IGFLGTDAFTFQSGGAT--LAFGCVSFTRF 201
Query: 274 ---GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD--RDSPASGVLEFNSAR-----G 323
+ G++GL+GLG G LSL Q A +YCL ++ AS L +A G
Sbjct: 202 AAPDVLHGASGLIGLGRGRLSLASQTGAKRFSYCLTPYFHNNGASSHLFVGAAASLSGGG 261
Query: 324 GDAVTAPLIRNKK---VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA----GDGGIIVDC 376
G ++ + + K TFYY+ L G +VG + IP + F++ E +GG+I+D
Sbjct: 262 GAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEEGFWEGGVIIDS 321
Query: 377 GTAITRLQTQAYNSLRDSFVR-LAGNLKP-----TSGVALFDTCYDFSGLRSVRVPTVSL 430
G+ T L AY L R L G+L P G+AL C L V VPT+ L
Sbjct: 322 GSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMAL---CVARGDLDRV-VPTLVL 377
Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
HF G + LP +NY P++ + T C A SIIGN QQQ + FD+ R+ F
Sbjct: 378 HFSGGADMALPPENYWAPLEKS-TACMAIV-RGYLQSIIGNFQQQNMHILFDVGGGRLSF 435
Query: 491 TPNKC 495
C
Sbjct: 436 QNADC 440
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 119/364 (32%), Positives = 169/364 (46%), Gaps = 43/364 (11%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EY R +GTPP + + DTGSD+ W+QC PC +C Q+ P+FDP+ SS++ +PC +
Sbjct: 91 EYLMRFYIGTPPVERFAIADTGSDLIWVQCAPCEKCVPQNAPLFDPRKSSTFKTVPCDSQ 150
Query: 218 QCKSLDVS--AC--RANRCLYQVAYGDGSFTVGDLVTETVSFG---NSGSVKGIALGCGH 270
C L S AC ++ +C YQ YGD + G L E+++FG N+ + GC
Sbjct: 151 PCTLLPPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFPKLTFGCTF 210
Query: 271 DNEGLFVGSA---GLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNSARGG 324
N S GL+GLG G LSL Q+ +YC S ++ + F G
Sbjct: 211 SNNDTVDESKRNMGLVGLGVGPLSLISQLGYQIGRKFSYCFPPLSSNSTSKMRF----GN 266
Query: 325 DA--------VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
DA V+ PLI ++YY+ L G S+G + V+ S DG I++D
Sbjct: 267 DAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTSES------QTDGNILIDS 320
Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL----FDTCYDFSGLRSVRVPTVSLHF 432
GT+ T L+ YN FV L + V + ++ C++ G R R P V F
Sbjct: 321 GTSFTILKQSFYN----KFVALVKEVYGVEAVKIPPLVYNFCFENKGKRK-RFPDVVFLF 375
Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA-LSIIGNVQQQGTRVSFDLANNRVGFT 491
G + + A N L + C PTS SI GN Q G +V +DL V F
Sbjct: 376 -TGAKVRVDASN-LFEAEDNNLLCMVALPTSDEDDSIFGNHAQIGYQVEYDLQGGMVSFA 433
Query: 492 PNKC 495
P C
Sbjct: 434 PADC 437
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 111/361 (30%), Positives = 173/361 (47%), Gaps = 55/361 (15%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCR--PCTECYQQSDPIFDPKTSSSYSPLPCA 215
EY + GTPP++ + LDTGSDI W QC+ P + C+ Q+ P+FDP SSS++ LPC+
Sbjct: 87 EYLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCS 146
Query: 216 APQCKSLDVSA----CRANRCLYQVAYGDGSFTVGDLVTETVSF------GNSGSVKGIA 265
+P C++ + C Y ++YGDGS + G++ E +F G+S +V G+
Sbjct: 147 SPACETTPPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLV 206
Query: 266 LGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
GCGH N G+F + G+ G G G LSL Q+K + ++C + +
Sbjct: 207 FGCGHANRGVFTSNETGIAGFGRGSLSLPSQLKVGNFSHCFTTITGSKTSAVLLGLPGVA 266
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
+PL R + G + P + GT+IT L
Sbjct: 267 PPSASPLGRRR---------------GSYRCRSTPR------------SSNSGTSITSLP 299
Query: 385 TQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYDFSGLRSVR--VPTVSLHFGAGKALDLP 441
+ Y ++R+ F ++ + P + F TC+ + LR + VPT++LHF G + LP
Sbjct: 300 PRTYRAVREEFAAQVKLPVVPGNATDPF-TCFS-APLRGPKPDVPTMALHF-EGATMRLP 356
Query: 442 AKNYLIPV---DSAGT----FCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
+NY+ V D AG C A I+GN+QQQ V +DL N+++ F P +
Sbjct: 357 QENYVFEVVDDDDAGNSSRIICLAV--IEGGEIILGNIQQQNMHVLYDLQNSKLSFVPAQ 414
Query: 495 C 495
C
Sbjct: 415 C 415
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 113/364 (31%), Positives = 171/364 (46%), Gaps = 40/364 (10%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+GEY R +GTPP + DTGSD+ W+QC PC C+ QS P+F P SS++ P C
Sbjct: 87 NGEYLMRFYIGTPPVERLATADTGSDLIWVQCSPCASCFPQSTPLFQPLKSSTFMPTTCR 146
Query: 216 APQCKSL--DVSAC-RANRCLYQVAYGDG-SFTVGDLVTETVSFGNSGSVKGIA-----L 266
+ C L + C ++ C+Y YGD SF+ G L TET+ F + G V+ +A
Sbjct: 147 SQPCTLLLPEQKGCGKSGECIYTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSFF 206
Query: 267 GCG-HDNEGLF--VGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNS 320
GCG ++N +F G++GLG G LSL QI +YCL+ S ++ L+F +
Sbjct: 207 GCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHKFSYCLLPLGSTSTSKLKFGN 266
Query: 321 AR---GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
G V+ P+I + T+Y++ L +V + V + DG +I+D G
Sbjct: 267 ESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVP--------TGSTDGNVIIDSG 318
Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTC-----YDFSGLRSVRVPTVSLHF 432
T +T L Y A +L+ + V L + F + P ++ F
Sbjct: 319 TLLTYLGESFY-------YNFAASLQESLAVELVQDVLSPLPFCFPYRDNFVFPEIAFQF 371
Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFT 491
G + L N + + T C AP+S S +SI G+ Q +V +DL +V F
Sbjct: 372 -TGARVSLKPANLFVMTEDRNTVCLMIAPSSVSGISIFGSFSQIDFQVEYDLEGKKVSFQ 430
Query: 492 PNKC 495
P C
Sbjct: 431 PTDC 434
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 121/349 (34%), Positives = 173/349 (49%), Gaps = 27/349 (7%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
G G Y +GTPP++ S + DTGSD+ W +C CT C Q P + P SSS+S LPC
Sbjct: 78 GGGAYDMTFSIGTPPQELSALADTGSDLIWAKCGACTRCVPQGSPSYYPNKSSSFSKLPC 137
Query: 215 AAPQCKSLDVSACRAN--RCLYQVAYGDGS----FTVGDLVTETVSFGNSGSVKGIALGC 268
+ C L S C A C Y+ +YG S +T G L +ET + G S +V GI GC
Sbjct: 138 SGSLCSDLPSSQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTLG-SDAVPGIGFGC 196
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAV- 327
+EG + +GL+GLG G LSL Q+ + +YCL + S +L + A G V
Sbjct: 197 TTMSEGGYGSGSGLVGLGRGPLSLVSQLNVGAFSYCLTSDAAKTSPLLFGSGALTGAGVQ 256
Query: 328 TAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
+ PL+R T+YY V L S+G G GII D GT + L
Sbjct: 257 STPLLRTS---TYYYTVNLESISIGAATTA---------GTGSSGIIFDSGTTVAFLAEP 304
Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL 446
AY +++ + NL SG ++ C+ SG P++ LHF G +DLP +NY
Sbjct: 305 AYTLAKEAVLSQTTNLTMASGRDGYEVCFQTSG---AVFPSMVLHFDGGD-MDLPTENYF 360
Query: 447 IPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
VD + + C+ S +LSI+GN+ Q + +D+ + + F P C
Sbjct: 361 GAVDDSVS-CW-IVQKSPSLSIVGNIMQMNYHIRYDVEKSMLSFQPANC 407
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 122/356 (34%), Positives = 180/356 (50%), Gaps = 35/356 (9%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
G Y I VGTP ++F + DTGSD+ W+Q PCT C IFDP+ SS++ + C++
Sbjct: 53 GGYVMDISVGTPGKRFRAIADTGSDLVWVQSEPCTGC--SGGTIFDPRQSSTFREMDCSS 110
Query: 217 PQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNS--GSVK--GIALGCGH 270
C L S C ++ C Y YG G T G+ +T+S G + GS K A+GCG
Sbjct: 111 QLCAELPGS-CEPGSSTCSYSYEYGSGE-TEGEFARDTISLGTTSDGSQKFPSFAVGCGM 168
Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPA-SGVLEF--NSARGG 324
N G F G GL+GLG G +SLT Q+ A + +YCLVD +S + S L F ++A G
Sbjct: 169 VNSG-FDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAALHG 227
Query: 325 DAVTAPLIR--NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
+ + I + T+Y + + G +V GQ + P G I+D GT +T
Sbjct: 228 TGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSP-----------GTTIIDSGTTLTY 276
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
+ + Y + + L G ++ D CYD S R+ + P +++ AG + P
Sbjct: 277 VPSGVYGRVLSRMESMV-TLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRL-AGATMTPP 334
Query: 442 AKNYLIPVDSAG-TFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ NY + VD +G T C A S +SIIGNV QQG + +D ++ + F KC
Sbjct: 335 SSNYFLVVDDSGDTVCLAMGSASGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 121/359 (33%), Positives = 175/359 (48%), Gaps = 34/359 (9%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI--FDPKTSSSYSPLPCAAPQCK-- 220
+GTP + +VLDTGS ++W+QC P P FDP SSS+S LPC+ P CK
Sbjct: 87 IGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPLCKPR 146
Query: 221 ----SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
+L S C +NR C Y Y DG+F G+LV E +F NS + + LGC ++ +
Sbjct: 147 IPDFTLPTS-CDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPLILGCAKESTDV 205
Query: 276 FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRD------SPASGVLEFN-SARGGDAVT 328
G+LG+ G LS Q K + +YC+ R S S L N ++RG V+
Sbjct: 206 ----KGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGENPNSRGFKYVS 261
Query: 329 APLI----RNKKVDTF-YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
R +D Y V L G +G + + IP S+F D G G +VD G+ T L
Sbjct: 262 LLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGGSGQTMVDSGSEFTHL 321
Query: 384 QTQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFSGLRSV--RVPTVSLHFGAGKALD 439
AY+ +++ VRL G+ V + D C+D + + + + FG G +
Sbjct: 322 VDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRLIGDLVFEFGRGVEIL 381
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ + L+ V G C +S +A +IIGNV QQ V FD+AN RVGF+ +C
Sbjct: 382 VEKQRLLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVANRRVGFSKAEC 439
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 133/429 (31%), Positives = 200/429 (46%), Gaps = 53/429 (12%)
Query: 100 SLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEY 159
S V L D R + KL+ + + R + Q+ + P V QG+G
Sbjct: 83 SSVAETLRWDQHRAGYIQRKLEDQV-PITRSVIT----QVSHQGVVQPKVGTQGQGTGVQ 137
Query: 160 FSRIGVGTPPRQFS------MVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSP 211
+ VG P S MV+DT SD+ W+QC PC C+ Q+D ++DP SSS +
Sbjct: 138 PAGEPVGDAPTGGSGGVAQTMVIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAA 197
Query: 212 LPCAAPQCKSLD--VSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA-- 265
PC++P C++L + C ++C Y+V Y DGS + G +++ ++ + I+
Sbjct: 198 FPCSSPACRNLGPYANGCTPAGDQCQYRVQYPDGSASAGTYISDVLTLNPAKPASAISEF 257
Query: 266 -LGCGHD--NEGLFVG-SAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPAS----G 314
GC H G F ++G++ LG G SL Q KAT +YCL + G
Sbjct: 258 RFGCSHALLQPGSFSNKTSGIMALGRGAQSLPTQTKATYGDVFSYCLPPTPVHSGFFILG 317
Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
V ++R AVT P++R+K Y V L V G+ + +PP++F G ++
Sbjct: 318 VPRVAASR--YAVT-PMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFA------AGAVM 368
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFS-----GLRSVRVPTVS 429
D T +TRL AY +LR +FV + + DTCYDFS G V++P ++
Sbjct: 369 DSRTIVTRLPPTAYMALRAAFVAEMRAYRAAAPKEHLDTCYDFSGAAPGGGGGVKLPKIT 428
Query: 430 LHF-GAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANN 486
L F G A++L L+ C AFAP + IIGNVQQQ V +++
Sbjct: 429 LVFDGPNGAVELDPSGVLL------DGCLAFAPNTDDQMTGIIGNVQQQALEVLYNVDGA 482
Query: 487 RVGFTPNKC 495
VGF C
Sbjct: 483 TVGFRRGAC 491
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 174/365 (47%), Gaps = 35/365 (9%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
+ VGTPP+ +MVLDTGS+++WL C P + S F P+ SS+++ +PCA+ QC+S
Sbjct: 89 LAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCASAQCRSR 148
Query: 223 DV---SAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC---GHDNEG 274
D+ AC ++RC ++Y DGS + G L T+ + G+ ++ A GC D+
Sbjct: 149 DLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAVGSGPPLRA-AFGCMSSAFDSSP 207
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR-------GGDAV 327
V SAGLLG+ G LS Q +YC+ DRD +GVL + +
Sbjct: 208 DGVASAGLLGMNRGALSFVSQASTRRFSYCISDRDD--AGVLLLGHSDLPTFLPLNYTPM 265
Query: 328 TAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
P + D Y V L G VGG+ + IP S+ D G G +VD GT T L
Sbjct: 266 YQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGD 325
Query: 387 AYNSLRDSFVRLAGNLKPT------SGVALFDTCYDFSGLRS---VRVPTVSLHF-GAGK 436
AY++L+ F R A L P + FDTC+ RS R+P V+L F GA
Sbjct: 326 AYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQGRSPPTARLPGVTLLFNGAEM 385
Query: 437 ALDLPAKNYLIPVDSA---GTFCFAFAPTSSA---LSIIGNVQQQGTRVSFDLANNRVGF 490
A+ Y +P + G +C F +IG+ Q V +DL RVG
Sbjct: 386 AVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPIMAYVIGHHHQMNVWVEYDLERGRVGL 445
Query: 491 TPNKC 495
P +C
Sbjct: 446 APVRC 450
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 117/350 (33%), Positives = 163/350 (46%), Gaps = 25/350 (7%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y R +GTPP+Q + +DT +D W+ C C C + F+P S SY +PC +P
Sbjct: 108 YVVRARLGTPPQQLLLAVDTSNDAAWIPCSGCAGCPTTTP--FNPAASKSYRAVPCGSPA 165
Query: 219 CKSLDVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
C +C N C + + Y D S L ++++ N VK GC G
Sbjct: 166 CSRAPNPSCSLNTKSCGFSLTYADSSLEAA-LSQDSLAVAND-VVKSYTFGCLQKATGTA 223
Query: 277 V---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDAV---TA 329
G GL LS TK + + +YCL S SG L R G + T
Sbjct: 224 TPPQGLLGLGRGPLSFLSQTKDMYEGTFSYCLPSFKSLNFSGTLRLG--RKGQPLRIKTT 281
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
PL+ N + YYV +TG VG + V IPP+ D A G ++D GT TRL AY
Sbjct: 282 PLLVNPHRSSLYYVSMTGIRVGKKVVPIPPAALAFDPATGAGTVLDSGTMFTRLVAPAYV 341
Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
++RD VR P S + FDTCY+ +V+ P V+ F G + LPA N +I
Sbjct: 342 AVRDE-VRRRIRGAPLSSLGGFDTCYN----TTVKWPPVTFMF-TGMQVTLPADNLVIHS 395
Query: 450 DSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
T C A A ++ L++I ++QQQ R+ FD+ N RVGF +C
Sbjct: 396 TYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRILFDVPNGRVGFAREQC 445
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 120/362 (33%), Positives = 173/362 (47%), Gaps = 40/362 (11%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI--FDPKTSSSYSPLPCAAPQCK-- 220
+GTP + +VLDTGS ++W+QC P P FDP SSS+S LPC+ P CK
Sbjct: 86 IGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPLCKPR 145
Query: 221 ----SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
+L S C +NR C Y Y DG+F G+LV E +F NS + + LGC ++
Sbjct: 146 IPDFTLPTS-CDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPLILGCAKES--- 201
Query: 276 FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNK 335
G+LG+ G LS Q K + +YC+ R S G+ S GD + +
Sbjct: 202 -TDEKGILGMNLGRLSFISQAKISKFSYCIPTR-SNRPGLASTGSFYLGDNPNSRGFKYV 259
Query: 336 KVDTF-------------YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
+ TF Y V L G +G + + IP S+F D G G +VD G+ T
Sbjct: 260 SLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMVDSGSEFTH 319
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFSGLRSVRVPT----VSLHFGAGK 436
L AY+ +++ VRL G+ V + D C+D G S+ + + FG G
Sbjct: 320 LVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFD--GNHSMEIGRLIGDLVFEFGRGV 377
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
+ + ++ L+ V G C +S +A +IIGNV QQ V FD+ N RVGF+
Sbjct: 378 EILVEKQSLLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFSKA 436
Query: 494 KC 495
+C
Sbjct: 437 EC 438
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 126/427 (29%), Positives = 198/427 (46%), Gaps = 78/427 (18%)
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC-------- 193
E F+ P+ SGA G+G+YF R VGTP R F +V DTGSD+ W++CR
Sbjct: 38 EAFAMPLSSGAYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAP 97
Query: 194 ---YQQSDP-----------------IFDPKTSSSYSPLPCAAPQCKS---LDVSACR-- 228
Y P +F P S +++P+PC++ C + ++AC
Sbjct: 98 GYNYGYGAPASNDSSSVSAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTP 157
Query: 229 ANRCLYQVAYGDGSFTVGDLVTETVSFGNSG----------SVKGIALGCGHDNEGL-FV 277
+ C Y+ Y DGS G + T++ + SG ++G+ LGC G F+
Sbjct: 158 GSPCAYEYRYKDGSAARGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFL 217
Query: 278 GSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSP--ASGVLEFN------------- 319
S G+L LG +S + A +YCLVD +P A+ L F
Sbjct: 218 ASDGVLSLGYSNVSFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRT 277
Query: 320 ----SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
SA A PL+ + ++ FY V + G SV G+ ++IP ++++ + GG I+D
Sbjct: 278 ACAGSAAAPGARQTPLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKG--GGAILD 335
Query: 376 CGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGL-----RSVRVPTVSL 430
GT++T L + AY ++ + + L P + FD CY+++ +V VP +++
Sbjct: 336 SGTSLTVLVSPAYRAVVAALGKKLVGL-PRVAMDPFDYCYNWTSPLTGEDLAVAVPALAV 394
Query: 431 HFGAGKALDLPAKNYLIPVDSA-GTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRV 488
HF L P K+Y+I D+A G C +S+IGN+ QQ FDL N R+
Sbjct: 395 HFAGSARLQPPPKSYVI--DAAPGVKCIGLQEGDWPGVSVIGNILQQEHLWEFDLKNRRL 452
Query: 489 GFTPNKC 495
F ++C
Sbjct: 453 RFKRSRC 459
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 123/363 (33%), Positives = 170/363 (46%), Gaps = 37/363 (10%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP--IFDPKTSSSYSPLPCA 215
EY + VGTPP Q + DTGSD+ W+ C SD +F P S++YS L C
Sbjct: 99 EYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRSTTYSLLSCQ 158
Query: 216 APQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGS-------VKGIALG 267
+ C++L ++C A+ C YQ AYGDGS T+G L TET SF +G V ++ G
Sbjct: 159 SAACQALSQASCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVRVPRVSFG 218
Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPA--SGVLEFNS 320
C + G F S GL+GLG G LSL Q+ A + +YCLV + A S L F +
Sbjct: 219 CSTGSAGSFR-SDGLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAANSSSTLSFGA 277
Query: 321 ---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
A + PL+ + +VD++Y V L +V GQ ++ A IIVD G
Sbjct: 278 RAVVSDPGAASTPLVPS-EVDSYYTVALESVAVAGQ---------DVASANSSRIIVDSG 327
Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR---VPTVSLHFGA 434
T +T L L R + L CYD G +P V+L FG
Sbjct: 328 TTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQGKSQAEDFGIPDVTLRFGG 387
Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTP 492
G ++ L +N ++ GT C P S + +SI+GN+ QQ V +DL V F
Sbjct: 388 GASVTLRPENTFSLLEE-GTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDARTVTFAA 446
Query: 493 NKC 495
C
Sbjct: 447 VDC 449
>gi|110739922|dbj|BAF01866.1| chloroplast nucleoid DNA binding protein like [Arabidopsis
thaliana]
Length = 142
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 77/139 (55%), Positives = 99/139 (71%), Gaps = 1/139 (0%)
Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYD 416
+ SLF++D+ G+GG+I+D GT++TRL AY ++RD+F A LK +LFDTC+D
Sbjct: 4 VTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFD 63
Query: 417 FSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQG 476
S + V+VPTV LHF G + LPA NYLIPVD+ G FCFAFA T LSIIGN+QQQG
Sbjct: 64 LSNMNEVKVPTVVLHF-RGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQG 122
Query: 477 TRVSFDLANNRVGFTPNKC 495
RV +DLA++RVGF P C
Sbjct: 123 FRVVYDLASSRVGFAPGGC 141
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 117/359 (32%), Positives = 165/359 (45%), Gaps = 37/359 (10%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK---- 220
+GTPP+ MVLDTGS ++W+QC FDP SS++S LPC P CK
Sbjct: 103 IGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVCKPRIP 162
Query: 221 --SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
+L S C NR C Y Y DG++ G+LV E +F S + LGC ++
Sbjct: 163 DFTLPTS-CDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSLFTPPLILGCATES----T 217
Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV 337
G+LG+ G LS Q K T +YC+ R + G S G + R ++
Sbjct: 218 DPRGILGMNRGRLSFASQSKITKFSYCVPTRVT-RPGYTPTGSFYLGHNPNSNTFRYIEM 276
Query: 338 DTF-------------YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
TF Y V L G +GG+ + I P++F D G G ++D G+ T L
Sbjct: 277 LTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSGQTMLDSGSEFTYLV 336
Query: 385 TQAYNSLRDSFVRLAG-NLKP---TSGVALFDTCYDFSGLRSVR-VPTVSLHFGAGKALD 439
+AY+ +R VR G +K GVA D C+D + + R + + F G +
Sbjct: 337 NEAYDKVRAEVVRAVGPRMKKGYVYGGVA--DMCFDGNAIEIGRLIGDMVFEFEKGVQIV 394
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+P + L V+ G C A + +A +IIGN QQ V FDL N R+GF C
Sbjct: 395 VPKERVLATVE-GGVHCIGIANSDKLGAASNIIGNFHQQNLWVEFDLVNRRMGFGTADC 452
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 120/370 (32%), Positives = 174/370 (47%), Gaps = 43/370 (11%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC----K 220
+GTPPR+ +++DT S++ W+Q CT C P F+P SSS+ PC + C K
Sbjct: 5 IGTPPREVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCLGRSK 64
Query: 221 SLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNEG 274
SAC C +QVAY DGS G + E S G + ++ + GC +
Sbjct: 65 LGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCASKDLQ 124
Query: 275 LFVG-SAGLLGLGGGMLSLTKQIKATS-------LAYCLVDRDSP--ASGVLEFNSARGG 324
V S+G LGL G S QI + S +YC +R +SGV+ F G
Sbjct: 125 RPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVIIF----GD 180
Query: 325 DAVTAPLIRNKKVDT---------FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
+ A + ++ FYYVGL G SVGG+ + IP S F++D G+GG D
Sbjct: 181 SGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNGGTYFD 240
Query: 376 CGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF-DTCYDFSG--LRSVRVPTVSLHF 432
GT ++ L A+ +L ++F R +L TSG + CYD + R P V+LHF
Sbjct: 241 SGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGDARLPTAPLVTLHF 300
Query: 433 GAGKALDLPAKNYLIPV---DSAGTFCFAF----APTSSALSIIGNVQQQGTRVSFDLAN 485
++L + +P+ T C AF A +++IGN QQQ + DL
Sbjct: 301 KNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYLIEHDLER 360
Query: 486 NRVGFTPNKC 495
+R+GF P C
Sbjct: 361 SRIGFAPANC 370
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 121/356 (33%), Positives = 179/356 (50%), Gaps = 35/356 (9%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
G Y I VGTP ++F + DTGSD+ W+Q PCT C IFDP+ SS++ + C++
Sbjct: 53 GGYVMDISVGTPGKRFRAIADTGSDLVWVQSEPCTGC--SGGTIFDPRQSSTFREMDCSS 110
Query: 217 PQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNS--GSVK--GIALGCGH 270
C L S C ++ C Y YG G T G+ +T+S G + GS K A+GCG
Sbjct: 111 QLCTELPGS-CEPGSSACSYSYEYGSGE-TEGEFARDTISLGTTSGGSQKFPSFAVGCGM 168
Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPA-SGVLEF--NSARGG 324
N G F G GL+GLG G +SLT Q+ A + +YCLVD +S + S L F ++A G
Sbjct: 169 VNSG-FDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAALHG 227
Query: 325 DAVTAPLIR--NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
+ + I + T+Y + + G +V GQ + P G I+D GT +T
Sbjct: 228 TGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSP-----------GTTIIDSGTTLTY 276
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
+ + Y + + L G ++ D CYD S R+ + P +++ AG + P
Sbjct: 277 VPSGVYGRVLSRMESMV-TLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRL-AGATMTPP 334
Query: 442 AKNYLIPVDSAG-TFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ NY + VD +G T C A +SIIGNV QQG + +D ++ + F KC
Sbjct: 335 SSNYFLVVDDSGDTVCLAMGSAGGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 112/375 (29%), Positives = 183/375 (48%), Gaps = 30/375 (8%)
Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
T + SG GEYF I +GTPP + + DTGSD+ W+QC+PC +CY+Q+ P+FD K
Sbjct: 72 TDLQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKK 131
Query: 206 SSSYSPLPCAAPQCKSL--DVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
SS+Y C + C++L C ++ C Y+ +YGD SFT GD+ TET+S +S
Sbjct: 132 SSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGS 191
Query: 262 K----GIALGCGHDNEGLFVGSAGLLGLGGGM-LSLTKQIKAT---SLAYCLVDRDSPAS 313
G GCG++N G F + + GG LSL Q+ ++ +YCL + +
Sbjct: 192 SVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTAATTN 251
Query: 314 GV---------LEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
G + N ++ +T PLI+ K +T+Y++ L +VG + + +
Sbjct: 252 GTSVINLGTNSIPSNPSKDSATLTTPLIQ-KDPETYYFLTLEAVTVGKTKLPYTGGGYGL 310
Query: 365 DEAGD---GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGL 420
+ G II+D GT +T L + Y+ + + G + + L C+ SG
Sbjct: 311 NGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTHCFK-SGD 369
Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVS 480
+ + +P +++HF + L N + ++ T C + PT+ ++I GN+ Q V
Sbjct: 370 KEIGLPAITMHF-TNADVKLSPINAFVKLNE-DTVCLSMIPTTE-VAIYGNMVQMDFLVG 426
Query: 481 FDLANNRVGFTPNKC 495
+DL V F C
Sbjct: 427 YDLETKTVSFQRMDC 441
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 122/382 (31%), Positives = 177/382 (46%), Gaps = 36/382 (9%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFD 202
S PV SQ EY +G PP+Q ++DTGS++ W QC C C+ Q+ +D
Sbjct: 61 SAPVHWAESQYIAEYL----IGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYD 116
Query: 203 PKTSSSYSPLPCAAPQCKSLDVSAC-RANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
P S + P+ C C + C R N+ C AYG G G L TE +F
Sbjct: 117 PSRSRTARPVACNDTACALGSETRCARDNKACAVLTAYGAGVIG-GVLGTEAFTFQPQSE 175
Query: 261 VKGIALGC---GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPAS---- 313
+A GC G G++G++GLG G LSL Q+ +YCL S ++
Sbjct: 176 NVSLAFGCIAATRLTPGSLDGASGIIGLGRGNLSLVSQLGDNKFSYCLTPYFSQSTNTSR 235
Query: 314 ---GVLEFNSARGGDAVTAPLIRNKKVD---TFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
G S+ G A + P ++N VD TFYY+ LTG +VG + +P + F++ +
Sbjct: 236 LFVGASAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQV 295
Query: 368 GDG---GIIVDCGTAITRLQTQAYNSLRDSFVRLAGN--LKPTSGVALFDTCYDFS-GLR 421
G G ++D G+ T L AY +LRD V+ G + P +G D C + G
Sbjct: 296 ATGLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDLCAAVAHGDV 355
Query: 422 SVRVPTVSLHFGAGKA-LDLPAKNYLIPVDSAGTFCFAFA---PTSS----ALSIIGNVQ 473
VP + LHFG+G + +P +NY PVD + F+ P S+ +IIGN
Sbjct: 356 GKLVPPLVLHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTIIGNYM 415
Query: 474 QQGTRVSFDLANNRVGFTPNKC 495
QQ + +DL + F P C
Sbjct: 416 QQDMHLLYDLEKGMLSFQPADC 437
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 116/387 (29%), Positives = 175/387 (45%), Gaps = 45/387 (11%)
Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQC------RPCTECYQ---Q 196
P+ A G G+Y VGTP ++F +V DTGSD+ W+ C R C+ +
Sbjct: 70 VPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIR 129
Query: 197 SDPIFDPKTSSSYSPLPCAAPQCK-------SLDVSACRANRCLYQVAYGDGSFTVGDLV 249
+F SSS+ +PC CK SL C Y Y DGS +G
Sbjct: 130 HKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFA 189
Query: 250 TETVSF----GNSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKAT----- 299
ETV+ G + + +GC +G F + G++GLG S IKA
Sbjct: 190 NETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFA--IKAAEKFGG 247
Query: 300 SLAYCLVDRDSP--ASGVLEFNSARGGDAVTAPLIRNK----KVDTFYYVGLTGFSVGGQ 353
+YCLVD S S L F S+R +A+ + + V++FY V + G S+GG
Sbjct: 248 KFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGA 307
Query: 354 AVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN----SLRDSFVRLAGNLKPTSGVA 409
++IP ++ D G GG I+D G+++T L AY +LR S ++ K +
Sbjct: 308 MLKIPSEVW--DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFR---KVEMDIG 362
Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSI 468
+ C++ +G VP + HF G + P K+Y+I + G C F + S+
Sbjct: 363 PLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISA-ADGVRCLGFVSVAWPGTSV 421
Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
+GN+ QQ FDL ++GF P+ C
Sbjct: 422 VGNIMQQNHLWEFDLGLKKLGFAPSSC 448
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 100/276 (36%), Positives = 141/276 (51%), Gaps = 21/276 (7%)
Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
C Y + YGDGSFT G+L E + FG VK GCG +N+GLF G +GL+GLG LS
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKFGTI-LVKDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 191
Query: 292 LTKQ---IKATSLAYCL--VDRDSPASGVLEFNSA--RGGDAVT-APLIRNKKVDTFYYV 343
L Q I +YCL +R S +L NS+ R ++ A +I N ++ FY++
Sbjct: 192 LISQTSGIFGGVFSYCLPSTERKGSGSLILGGNSSVYRNSSPISYAKMIENPQLYNFYFI 251
Query: 344 GLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLK 403
LTG S+GG A+Q P G I+VD GT ITRL Y +L+ F++
Sbjct: 252 NLTGISIGGVALQAP-------SVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFP 304
Query: 404 PTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL--DLPAKNYLIPVDSAGTFCFAFAP 461
P ++ DTC++ S + V +PT+ +HF L D+ Y + D A C A A
Sbjct: 305 PAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSD-ASQVCLALAS 363
Query: 462 TS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
++I+GN QQ+ RV +D +VGF C
Sbjct: 364 LEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETC 399
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 108/312 (34%), Positives = 153/312 (49%), Gaps = 22/312 (7%)
Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
SQ G+Y + +G PP +DTGSD+ W++C PC C P++DP S S L
Sbjct: 81 SQKGGKYIMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGKL 140
Query: 213 PCAAPQCKSLDVSACRANRCL-------YQVAYGDGS--FTVGDLVTETVSFGNSGSVKG 263
PC++ C++L +++C Y AYG T G L TET +FG+
Sbjct: 141 PCSSQLCQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTFGDGYVANN 200
Query: 264 IALGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS-- 320
++ G +G F G+AGL+GLG G LSL Q+ A AYCL + S +L F S
Sbjct: 201 VSFGRSDTIDGSQFGGTAGLVGLGRGHLSLVSQLGAGRFAYCLAADPNVYSTIL-FGSLA 259
Query: 321 ---ARGGDAVTAPLIRNKK--VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
GD + PL+ N K DT YYV L G SVGG + I F ++ G GG+ D
Sbjct: 260 ALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFD 319
Query: 376 CGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV-RVPTVSLHFGA 434
G T L+ AY +R + L +G DTC+ + ++V ++P + LHF
Sbjct: 320 SGAIDTSLKDAAYQVVRQAITSEIQRLGYDAG---DDTCFVAANQQAVAQMPPLVLHFDD 376
Query: 435 GKALDLPAKNYL 446
G + L +NYL
Sbjct: 377 GADMSLNGRNYL 388
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 119/351 (33%), Positives = 160/351 (45%), Gaps = 25/351 (7%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y R +GTP +Q + +DT +D W+ C C C S F+P S+SY P+PC +PQ
Sbjct: 107 YVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSP--FNPAASASYRPVPCGSPQ 164
Query: 219 CKSLDVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
C +C N C + ++Y D S L +T++ VK GC G
Sbjct: 165 CVLAPNPSCSPNAKSCGFSLSYADSSLQAA-LSQDTLAVAGD-VVKAYTFGCLQRATGTA 222
Query: 277 V---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDA---VTA 329
G GL LS TK + + +YCL S SG L R G T
Sbjct: 223 APPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLG--RNGQPRRIKTT 280
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
PL+ N + YYV +TG VG + V IP S D A G ++D GT TRL Y
Sbjct: 281 PLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYL 340
Query: 390 SLRDSFVRLAG-NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
+LRD R G S + FDTCY+ +V P V+L F G + LP +N +I
Sbjct: 341 ALRDEVRRRVGAGAAAVSSLGGFDTCYN----TTVAWPPVTLLFD-GMQVTLPEENVVIH 395
Query: 449 VDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
T C A A ++ L++I ++QQQ RV FD+ N RVGF C
Sbjct: 396 TTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 446
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 114/373 (30%), Positives = 175/373 (46%), Gaps = 45/373 (12%)
Query: 162 RIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS 221
++G+G+ + S ++DTGS+ +QC +S P+FDP S SY +PC + C +
Sbjct: 103 QLGIGSLQKNLSAIIDTGSEAVLVQCGS------RSRPVFDPAASQSYRQVPCISQLCLA 156
Query: 222 LDVSACRANR---------CLYQVAYGDGSFTVGDL------VTETVSFGNSGSVKGIAL 266
+ + C Y ++YGD + GD + T S G + + +A
Sbjct: 157 VQQQTSNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFRDVAF 216
Query: 267 GCGHDNEGLFV--GSAGLLGLGGGMLSLTKQIK----ATSLAYCLVDR--DSPASGVLEF 318
GC H +G V GS G++G G LSL Q+K + +YC + A+GV+
Sbjct: 217 GCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFL 276
Query: 319 NSARGGDAVTA--PLIRNKKV---DTFYYVGLTGFSVGGQAVQIPPSLFEMD-EAGDGGI 372
+ + PL+ N YYVGLT SV G+ + IP S F++D GDGG
Sbjct: 277 GDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGT 336
Query: 373 IVDCGTAITRLQTQAYNSLRDSFV--RLAGNLKPTSGVALFDTCYDFSGLRSV-RVPTVS 429
++D GT TR+ AY + R++F +G K A FD CY+ S S+ VP V
Sbjct: 337 VLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEVR 396
Query: 430 LHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPTSSA----LSIIGNVQQQGTRVSFD 482
L L+L ++ +PV +AG T C A + + ++++GN QQ V +D
Sbjct: 397 LSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYD 456
Query: 483 LANNRVGFTPNKC 495
+RVGF C
Sbjct: 457 NERSRVGFERADC 469
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 120/404 (29%), Positives = 189/404 (46%), Gaps = 58/404 (14%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP---- 199
F+ P+ SGA G+G+YF R VGTP + F ++ DTGSD+ W++CR +
Sbjct: 95 FAMPLSSGAYTGTGQYFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPA 154
Query: 200 -----------IFDPKTSSSYSPLPCAAPQCKS---LDVSACRANR--CLYQVAYGDGSF 243
+F P S ++SP+PC++ CKS ++ C ++ C Y Y D S
Sbjct: 155 AAPSPAVAPPRVFRPGDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSA 214
Query: 244 TVGDLVTETVSFGNS------------GSVKGIALGC--GHDNEGLFVGSAGLLGLGGGM 289
G + T++ + S ++G+ LGC H +G F S G+L LG
Sbjct: 215 ARGVVGTDSATVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQG-FEASDGVLSLGYSN 273
Query: 290 LSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTA---------PLIRNKKV 337
+S + + +YCLVD +P + G DA ++ PL+ + +V
Sbjct: 274 ISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARV 333
Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
FY V + SV G A+ IP ++ D +GG I+D GT++T L T AY ++ +
Sbjct: 334 RPFYAVAVDSVSVDGVALDIPAEVW--DVGSNGGTIIDSGTSLTVLATPAYKAVVAALSE 391
Query: 398 LAGNLKPTSGVALFDTCYDFS----GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA- 452
L P + FD CY+++ G + VP +++ F L+ PAK+Y+I D+A
Sbjct: 392 QLAGL-PRVAMDPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVI--DAAP 448
Query: 453 GTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
G C + +S+IGN+ QQ FDL N + F C
Sbjct: 449 GVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLNNRWLRFRQTSC 492
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 117/375 (31%), Positives = 172/375 (45%), Gaps = 38/375 (10%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLP 213
G +Y + +G PP++ ++DTGS++ W QC C C++Q+ P +DP S + +
Sbjct: 67 GQSQYIAEYLIGDPPQRAEAIIDTGSNLIWTQCSRCRPTCFRQNLPYYDPSRSRAARAVG 126
Query: 214 CAAPQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC--- 268
C C + C ++ C YG G+ G L TE ++F S +V + GC
Sbjct: 127 CNDAACALGSETQCLSDNKTCAVVTGYGAGNI-AGTLATENLTF-QSETVS-LVFGCIVV 183
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV----DRDSP------ASGVLEF 318
+ G G++G++GLG G LSL Q+ T +YCL D P AS L
Sbjct: 184 TKLSPGSLNGASGIIGLGRGKLSLPSQLGDTRFSYCLTPYFEDTIEPSHMVVGASAGLIN 243
Query: 319 NSARGGDAVTAPLIRNKKVD---TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG---GI 372
SA T P +R+ D TFYY+ LTG + G + +P + F++ + G G
Sbjct: 244 GSASSTPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQVAPGMWTGT 303
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGN--LKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
+D G +T L AY +LR R G ++P +G FD C + VP + L
Sbjct: 304 FIDSGAPLTSLVDVAYQALRAELARQLGAALVQPLAGTTGFDLCVALKDAERL-VPPLVL 362
Query: 431 HFGAGKA----LDLPAKNYLIPVDSAGTFCFAFAPTS------SALSIIGNVQQQGTRVS 480
HFG G L +P NY PVDSA F+ + ++IGN QQ V
Sbjct: 363 HFGGGSGTGTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNETTVIGNYMQQNMHVL 422
Query: 481 FDLANNRVGFTPNKC 495
+DLA + F P C
Sbjct: 423 YDLAGGVLSFQPADC 437
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 109/349 (31%), Positives = 161/349 (46%), Gaps = 18/349 (5%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
S Y + GTPP+ + LDT SD W+ C C C S P F P S+S+ + C
Sbjct: 94 SPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGC-STSKP-FAPIKSTSFRNVSCG 151
Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
+P CK + C + C + YG S +V +T++ + + G GC + G
Sbjct: 152 SPHCKQVPNPTCGGSACAFNFTYGSSSI-AASVVQDTLTLA-TDPIPGYTFGCVNKTTGS 209
Query: 276 FV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDAVT-AP 330
G GL +LS ++ + ++ +YCL S SG L + P
Sbjct: 210 SAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSINFSGSLRLGPVYQPKRIKYTP 269
Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
L+RN + + YYV L VG + V IPP+ + G I D GT TRL Y +
Sbjct: 270 LLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFTRLAEPVYTA 329
Query: 391 LRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
+R+ F R G P + + FDTCY+ + VPT++ F +G + LP N +I
Sbjct: 330 VRNEFRRRVGPKLPVTTLGGFDTCYNV----PIVVPTITFLF-SGMNVTLPPDNIVIHST 384
Query: 451 SAGTFCFAFA----PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ T C A A +S L++I N+QQQ RV FD+ N+R+G C
Sbjct: 385 AGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELC 433
>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
Length = 396
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 119/351 (33%), Positives = 160/351 (45%), Gaps = 25/351 (7%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y R +GTP +Q + +DT +D W+ C C C S F+P S+SY P+PC +PQ
Sbjct: 54 YVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSP--FNPAASASYRPVPCGSPQ 111
Query: 219 CKSLDVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
C +C N C + ++Y D S L +T++ VK GC G
Sbjct: 112 CVLAPNPSCSPNAKSCGFSLSYADSSLQAA-LSQDTLAVAGD-VVKAYTFGCLQRATGTA 169
Query: 277 V---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDA---VTA 329
G GL LS TK + + +YCL S SG L R G T
Sbjct: 170 APPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLG--RNGQPRRIKTT 227
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
PL+ N + YYV +TG VG + V IP S D A G ++D GT TRL Y
Sbjct: 228 PLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYL 287
Query: 390 SLRDSFVRLAG-NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
+LRD R G S + FDTCY+ +V P V+L F G + LP +N +I
Sbjct: 288 ALRDEVRRRVGAGAAAVSSLGGFDTCYN----TTVAWPPVTLLFD-GMQVTLPEENVVIH 342
Query: 449 VDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
T C A A ++ L++I ++QQQ RV FD+ N RVGF C
Sbjct: 343 TTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 393
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 115/416 (27%), Positives = 179/416 (43%), Gaps = 56/416 (13%)
Query: 129 RHELKPAEAQILPEDFSTPVVSGAS---QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL 185
R L ++LP VV + GEY ++G+GTP F+ +DT SD+ W
Sbjct: 55 RDRLASIAPRLLPTSSRNKVVVAEAPVLSAGGEYLVKLGLGTPQHCFTAAIDTASDLIWT 114
Query: 186 QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSAC-------RANRCLYQVAY 238
QC+PC +CY+Q DP+F+P S+SY+ +PC + C LD C + C Y +Y
Sbjct: 115 QCQPCVKCYKQLDPVFNPVASTSYAVVPCNSDTCDELDTHRCARDGDSDDEDACQYTYSY 174
Query: 239 GDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTK 294
G + T G L + ++ G+ +G+ GC + G G +G++GLG G LSL
Sbjct: 175 GGNATTRGILAVDRLAIGDD-VFRGVVFGCSSSSVG---GPPPQVSGVVGLGRGALSLVS 230
Query: 295 QIKATSLAYCLVDRDSPASGVLEFNS------ARGGDAVTAPLIRNKKVDTFYYVGLTGF 348
Q+ YCL S ++G L + + V P+ + ++YY+ L G
Sbjct: 231 QLSVRRFMYCLPPPVSRSAGRLVLGADAAATVRNASERVVVPMSTGSRYPSYYYLNLDGI 290
Query: 349 SVGGQAVQIPPSLFEMDEAGDG--------------------------GIIVDCGTAITR 382
S+G +A+ S M+ G G+I+D + IT
Sbjct: 291 SIGDRAMSF-RSRNRMNATTPGTAAGAPASPVSGSGDGDGSGTGPDAYGMIIDIASTITF 349
Query: 383 LQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
L+ Y + D +RL G+ L + + V P VSL F G L
Sbjct: 350 LEESLYEEMVDDLEEEIRLPRGSGSDLGLDLCFILPEGVPMSRVYAPPVSLAF-EGVWLR 408
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L + + ++G C T +SI+GN QQQ +V ++L R+ F C
Sbjct: 409 LDKEQMFVEDRASGMMCLMVGKT-DGVSILGNYQQQNMQVMYNLRRGRITFIKTAC 463
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 107/350 (30%), Positives = 167/350 (47%), Gaps = 21/350 (6%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC-AAP 217
+ + I +G PP +++DTGSD+ W+QC PC +CY Q+ P F P SS+Y C +AP
Sbjct: 88 FLANISIGDPPVPQLLLIDTGSDLTWIQCLPC-KCYPQTIPFFHPSRSSTYRNASCESAP 146
Query: 218 QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG----SVKGIALGCGHDNE 273
+ C Y + Y D S T G L E ++F S S I GCG DN
Sbjct: 147 HAMPQIFRDEKTGNCRYHLRYRDFSNTRGILAKEKLTFQTSDEGLISKPNIVFGCGQDNS 206
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYC---LVDRDSPASGVLEFNSAR-GGDAVTA 329
G F +G+LGLG G S+ + + +YC L+D P + ++ N AR GD
Sbjct: 207 G-FTQYSGVLGLGPGTFSIVTRNFGSKFSYCFGSLIDPTYPHNFLILGNGARIEGDPTPL 265
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
+ +++ YY+ L S+G + + I P +F+ + GG ++D G + T L +AY
Sbjct: 266 QIFQDR-----YYLDLQAISLGEKLLDIEPGIFQRYRS-KGGTVIDTGCSPTILAREAYE 319
Query: 390 SLRDSFVRLAGNL--KPTSGVALFDTCYDFS-GLRSVRVPTVSLHFGAGKALDLPAKNYL 446
+L + L G + + + CY+ + L P V+ HF G L L ++
Sbjct: 320 TLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGFPVVTFHFAGGAELALDVESLF 379
Query: 447 IPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ +S +FC A T +S+IG + QQ V ++L +V F C
Sbjct: 380 VSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 429
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 109/349 (31%), Positives = 161/349 (46%), Gaps = 18/349 (5%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
S Y + GTPP+ + LDT SD W+ C C C S P F P S+S+ + C
Sbjct: 94 SPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGC-STSKP-FAPIKSTSFRNVSCG 151
Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
+P CK + C + C + YG S +V +T++ + + G GC + G
Sbjct: 152 SPHCKQVPNPTCGGSACAFNFTYGSSSI-AASVVQDTLTLA-ADPIPGYTFGCVNKTTGS 209
Query: 276 FV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDAVT-AP 330
G GL +LS ++ + ++ +YCL S SG L + P
Sbjct: 210 SAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSINFSGSLRLGPVYQPKRIKYTP 269
Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
L+RN + + YYV L VG + V IPP+ + G I D GT TRL Y +
Sbjct: 270 LLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFTRLAEPVYTA 329
Query: 391 LRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
+R+ F R G P + + FDTCY+ + VPT++ F +G + LP N +I
Sbjct: 330 VRNEFRRRVGPKLPVTTLGGFDTCYNV----PIVVPTITFLF-SGMNVALPPDNIVIHST 384
Query: 451 SAGTFCFAFA----PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ T C A A +S L++I N+QQQ RV FD+ N+R+G C
Sbjct: 385 AGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELC 433
>gi|115466078|ref|NP_001056638.1| Os06g0121500 [Oryza sativa Japonica Group]
gi|113594678|dbj|BAF18552.1| Os06g0121500 [Oryza sativa Japonica Group]
Length = 442
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 109/330 (33%), Positives = 144/330 (43%), Gaps = 61/330 (18%)
Query: 174 MVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACRA 229
M +DT D+ W+QC PC ECY Q + +FDP+ S + + +PC + C L + C
Sbjct: 166 MSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSN 225
Query: 230 NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGM 289
N+C Y V YGDG T G + + ++ S V GC H G F S G M
Sbjct: 226 NQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTS-----GTM 280
Query: 290 LSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV-DTFYYVGLTGF 348
+ T PL+RN + T Y V L G
Sbjct: 281 FART------------------------------------PLVRNPSIIPTLYLVRLRGI 304
Query: 349 SVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSG 407
VGG+ + +PP +F GG ++D IT+L AY +LR +F +A + G
Sbjct: 305 EVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYPRVAGG 358
Query: 408 VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--A 465
A DTCYDF SV VP VSL F G + L A ++ C AF PT A
Sbjct: 359 RAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------EGCLAFVPTPGDFA 412
Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L IGNVQQQ V +D+ VGF C
Sbjct: 413 LGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 442
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 117/350 (33%), Positives = 160/350 (45%), Gaps = 27/350 (7%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y R +GTPP+Q + +DT +D +W+ C C C S FDP +S+SY +PC +P
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPASSASYRTVPCGSPL 171
Query: 219 CKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
C +AC C + + Y D S GN +VK GC G
Sbjct: 172 CAQAPNAACPPGGKACGFSLTYADSSLQAALSQDSLAVAGN--AVKAYTFGCLQRATGTA 229
Query: 277 V---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDA---VTA 329
G GL LS TK + + +YCL S SG L R G T
Sbjct: 230 APPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSLNFSGTLRLG--RNGQPQRIKTT 287
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
PL+ N + YYV +TG VG + V IP D A G ++D GT TRL AY
Sbjct: 288 PLLANPHRSSLYYVNMTGIRVGRKVVPIP----AFDPATGAGTVLDSGTMFTRLVAPAYV 343
Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
++RD R G P S + FDTC++ + +V P V+L F G + LP +N +I
Sbjct: 344 AVRDEVRRRVG--APVSSLGGFDTCFNTT---AVAWPPVTLLFD-GMQVTLPEENVVIHS 397
Query: 450 DSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C A A ++ L++I ++QQQ RV FD+ N RVGF +C
Sbjct: 398 TYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 120/398 (30%), Positives = 188/398 (47%), Gaps = 50/398 (12%)
Query: 128 DRHELKPAEAQILPEDFSTPVVSGASQG--SGEYFSRIGVGTPPRQFSMVLDTGSDINWL 185
D+ L+ +ILPE + P+ SG +G Y++RI +GTPP+QF + +DTGSD+ W+
Sbjct: 20 DQRRLR----RILPEVVAFPI-SGDDDTFTTGLYYTRIYLGTPPQQFYVHVDTGSDVAWV 74
Query: 186 QCRPCTECYQQSD-----PIFDPKTSSSYSPLPCAAPQCKSLDVSACRAN--RCLYQVAY 238
C PCT C + S+ IFDP+ S+S + + C +C S C N C Y Y
Sbjct: 75 NCVPCTNCKRASNVALPISIFDPEKSTSKTSISCTDEECYLASNSKCSFNSMSCPYSTLY 134
Query: 239 GDGSFTVGDLVTETVSF-----GNSGSVKGIA---LGCGHDNEGLFVGSAGLLGLGGGML 290
GDGS T G L+ + +SF GNS + G A GCG + G ++ + GL+G G +
Sbjct: 135 GDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLTFGCGSNQTGTWL-TDGLVGFGQAEV 193
Query: 291 SLTKQIKATSL-----AYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGL 345
SL Q+ ++ A+CL D+ SG L R V P++ + + Y V L
Sbjct: 194 SLPSQLSKQNVSVNIFAHCL-QGDNKGSGTLVIGHIREPGLVYTPIVPKQ---SHYNVEL 249
Query: 346 TGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT 405
V G V P + D + GG+I+D GT +T L AY+ + ++
Sbjct: 250 LNIGVSGTNVTTPTAF---DLSNSGGVIMDSGTTLTYLVQPAYDQFQ-------AKVRDC 299
Query: 406 SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL---IPVDSAGTFCFAFAPT 462
+ + F P V+L+F G A+ L +YL + +CF++ +
Sbjct: 300 MRSGVLPVAFQFFCTIEGYFPNVTLYFAGGAAMLLSPSSYLYKEMLTTGLSAYCFSWLES 359
Query: 463 SS-----ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+S + +I G+ + V +D NNR+G+ C
Sbjct: 360 TSVYGYLSYTIFGDNVLKDQLVVYDNVNNRIGWKNFDC 397
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 124/401 (30%), Positives = 188/401 (46%), Gaps = 54/401 (13%)
Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQC-RPCTECYQQSDP--- 199
F P+ SGA G G+YF R VGTP + F +V DTGSD+ W++C RP +
Sbjct: 79 FEMPLTSGAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGR 138
Query: 200 IFDPKTSSSYSPLPCAAPQC-KSLDVSACR----ANRCLYQVAYGDGSFTVGDLVTETVS 254
F P+ S +++P+ CA+ C KSL S + C Y Y DGS G + TE+ +
Sbjct: 139 AFRPEDSRTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESAT 198
Query: 255 FGNSG--------SVKGIALGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIK---ATSLA 302
SG +KG+ LGC G F S G+L LG +S A +
Sbjct: 199 IALSGRGREERKAKLKGLVLGCTSSYTGPSFEVSDGVLSLGYSDVSFASHAASRFAGRFS 258
Query: 303 YCLVDRDSP--ASGVLEFN-----------------------SARGGDAVTAPLIRNKKV 337
YCLVD SP A+ L F A PL+ ++++
Sbjct: 259 YCLVDHLSPRNATSYLTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRM 318
Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
FY V + SV GQ ++IP +++++D GG+I+D GT++T L AY ++ +
Sbjct: 319 RPFYDVAVKAVSVAGQFLKIPRAVWDVDAG--GGVILDSGTSLTVLAKPAYRAVVAALSE 376
Query: 398 LAGNLKPTSGVALFDTCYDFSGLRS-VRVPTVSLHFGAGKALDLPAKNYLIPVDSA-GTF 455
L P + F+ CY+++ V +P +++HF L+ P K+Y+I D+A G
Sbjct: 377 GLAGL-PRVTMDPFEYCYNWTSPSGDVTLPKMAVHFAGAARLEPPGKSYVI--DAAPGVK 433
Query: 456 CFAFAP-TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C +S+IGN+ QQ FD+ N R+ F ++C
Sbjct: 434 CIGLQEGPWPGISVIGNILQQEHLWEFDIKNRRLKFQRSRC 474
>gi|55296886|dbj|BAD68338.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|55296941|dbj|BAD68392.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
Length = 424
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 109/330 (33%), Positives = 144/330 (43%), Gaps = 61/330 (18%)
Query: 174 MVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACRA 229
M +DT D+ W+QC PC ECY Q + +FDP+ S + + +PC + C L + C
Sbjct: 148 MSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSN 207
Query: 230 NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGM 289
N+C Y V YGDG T G + + ++ S V GC H G F S G M
Sbjct: 208 NQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTS-----GTM 262
Query: 290 LSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV-DTFYYVGLTGF 348
+ T PL+RN + T Y V L G
Sbjct: 263 FART------------------------------------PLVRNPSIIPTLYLVRLRGI 286
Query: 349 SVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSG 407
VGG+ + +PP +F GG ++D IT+L AY +LR +F +A + G
Sbjct: 287 EVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYPRVAGG 340
Query: 408 VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--A 465
A DTCYDF SV VP VSL F G + L A ++ C AF PT A
Sbjct: 341 RAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------EGCLAFVPTPGDFA 394
Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L IGNVQQQ V +D+ VGF C
Sbjct: 395 LGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 424
>gi|55296937|dbj|BAD68388.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|218197467|gb|EEC79894.1| hypothetical protein OsI_21421 [Oryza sativa Indica Group]
Length = 424
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 109/330 (33%), Positives = 144/330 (43%), Gaps = 61/330 (18%)
Query: 174 MVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACRA 229
M +DT D+ W+QC PC ECY Q + +FDP+ S + + +PC + C L + C
Sbjct: 148 MSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSN 207
Query: 230 NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGM 289
N+C Y V YGDG T G + + ++ S V GC H G F S G M
Sbjct: 208 NQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTS-----GTM 262
Query: 290 LSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV-DTFYYVGLTGF 348
+ T PL+RN + T Y V L G
Sbjct: 263 FART------------------------------------PLVRNPSIIPTLYLVRLRGI 286
Query: 349 SVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSG 407
VGG+ + +PP +F GG ++D IT+L AY +LR +F +A + G
Sbjct: 287 EVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYPRVAGG 340
Query: 408 VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--A 465
A DTCYDF SV VP VSL F G + L A ++ C AF PT A
Sbjct: 341 RAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------EGCLAFVPTPGDFA 394
Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L IGNVQQQ V +D+ VGF C
Sbjct: 395 LGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 424
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 117/362 (32%), Positives = 171/362 (47%), Gaps = 47/362 (12%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI--FDPKTSSSYSPLPCAAPQCK-- 220
+GTPP+ MVLDTGS ++W+QC +++ P FDP SS++S LPC P CK
Sbjct: 81 IGTPPQTQPMVLDTGSQLSWIQC------HKKQPPTASFDPSLSSTFSILPCTHPLCKPR 134
Query: 221 ----SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
+L S C NR C Y Y DG++ G+LV E +F S S + LGC ++
Sbjct: 135 IPDFTLPTS-CDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSVSTPPLILGCATES--- 190
Query: 276 FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDS-----PASGVLEFN--SARGGDAVT 328
G+LG+ G LS KQ K T +YC+ R + P N S++G V
Sbjct: 191 -TDPRGILGMNLGRLSFAKQSKITKFSYCVPPRQTRPGFTPTGSFYLGNNPSSKGFKYVG 249
Query: 329 APLIRNKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
+++ F Y + + G + G+ + I P++F D G G ++D G+ T L
Sbjct: 250 MMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNISPAVFRADAGGSGQTMIDSGSEFTYLV 309
Query: 385 TQAYNSLRDSFVRLAG-NLKP---TSGVALFDTCYDFSGLRSVR----VPTVSLHFGAGK 436
++AY+ +R VR G LK GVA D C+D +++V + + F G
Sbjct: 310 SEAYDKVRAQVVRAVGPRLKKGYVYGGVA--DMCFD--SVKAVEIGRLIGEMVFEFERGV 365
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
+ +P + L V G C + +A +IIGN QQ V FDL RVGF
Sbjct: 366 EVVIPKERVLADV-GGGVHCVGIGSSDKLGAASNIIGNFHQQNLWVEFDLVRRRVGFGKA 424
Query: 494 KC 495
C
Sbjct: 425 DC 426
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 118/369 (31%), Positives = 171/369 (46%), Gaps = 46/369 (12%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK-- 220
+ VG+PP+ +MVLDTGS+++WL C+ + +FDP SSSYSP+PC +P C+
Sbjct: 67 LTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCTSPTCRTR 122
Query: 221 ----SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----N 272
S+ VS + C ++Y D S G+L ++T GNS ++ GC N
Sbjct: 123 TRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNS-AIPATIFGCMDSGFSSN 181
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD------- 325
+ GL+G+ G LS Q+ +YC+ +DS SG+L F +
Sbjct: 182 SDEDSKTTGLIGMNRGSLSFVTQMGLQKFSYCISGQDS--SGILLFGESSFSWLKALKYT 239
Query: 326 ---AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
++ PL +V Y V L G V +Q+P S++ D G G +VD GT T
Sbjct: 240 PLVQISTPLPYFDRVA--YTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTF 297
Query: 383 LQTQAYNSLRDSFVR-LAGNLKPTSGVAL-----FDTCYDFSGLRSVR--VPTVSLHF-G 433
L Y +L++ FVR +LK D CY R +PTV+L F G
Sbjct: 298 LLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFRG 357
Query: 434 AGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALS----IIGNVQQQGTRVSFDLANN 486
A ++ Y +P S +CF F S L IIG+ QQ + FDLA +
Sbjct: 358 AEMSVSAERLMYRVPGVIRGSDSVYCFTFG-NSELLGVESYIIGHHHQQNVWMEFDLAKS 416
Query: 487 RVGFTPNKC 495
RVGF +C
Sbjct: 417 RVGFAEVRC 425
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 81/214 (37%), Positives = 124/214 (57%), Gaps = 12/214 (5%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L D ARV TL ++L + L + + P+ S P+ GAS GSG Y+ ++G
Sbjct: 66 LAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIR-FPKSVSVPLNPGASIGSGNYYVKVGF 124
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-- 222
G+P R +SM++DTGS ++WLQC+PC C+ Q+DP+FDP S +Y L C + QC SL
Sbjct: 125 GSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQCSSLVD 184
Query: 223 -----DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
+ +N C+Y +YGD S+++G L + ++ S ++ G GCG D++GLF
Sbjct: 185 ATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTLPGFVYGCGQDSDGLFG 244
Query: 278 GSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDR 308
+AG+LGLG LS+ Q+ + + +YCL R
Sbjct: 245 RAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTR 278
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 114/360 (31%), Positives = 165/360 (45%), Gaps = 35/360 (9%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
+ I +G+PP + +DT SD+ WLQCRPC CY QS PIFDP S ++ C Q
Sbjct: 85 FLVNISIGSPPVTQLLHMDTASDLLWLQCRPCINCYAQSLPIFDPSRSYTHRNESCRTSQ 144
Query: 219 --CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG------NSGSVKGIALGCGH 270
SL +A + C Y + Y DG+ + G L E + F +S ++ + GCGH
Sbjct: 145 YSMPSLRFNA-KTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALHDVVFGCGH 203
Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA--SGVLEFNSARGGDAV- 327
DN G + G+LGLG G SL + T +YC D P+ VL G D
Sbjct: 204 DNYGEPLVGTGILGLGYGEFSLVHRF-GTKFSYCFGSLDDPSYPHNVL----VLGDDGAN 258
Query: 328 ----TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-EAGDGGIIVDCGTAITR 382
T PL + + FYYV + SV G + I P +F + + G GG I+D G ++T
Sbjct: 259 ILGDTTPL---EIYNGFYYVTIEAISVDGIILPIDPWVFNRNHQTGLGGTIIDTGNSLTS 315
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDT----CYDFSGLRSVR---VPTVSLHFGAG 435
L +AY L++ + V D CY+ + R + P V+ HF G
Sbjct: 316 LVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLERDLVESGFPIVTFHFSDG 375
Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L L K+ + + S FC A P + ++ IG QQ + +DL ++ F C
Sbjct: 376 AELSLDVKSVFMKL-SPNVFCLAVTPGN--MNSIGATAQQSYNIGYDLEAKKISFERIDC 432
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 118/369 (31%), Positives = 171/369 (46%), Gaps = 46/369 (12%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK-- 220
+ VG+PP+ +MVLDTGS+++WL C+ + +FDP SSSYSP+PC +P C+
Sbjct: 60 LTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCTSPTCRTR 115
Query: 221 ----SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----N 272
S+ VS + C ++Y D S G+L ++T GNS ++ GC N
Sbjct: 116 TRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNS-AIPATIFGCMDSGFSSN 174
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD------- 325
+ GL+G+ G LS Q+ +YC+ +DS SG+L F +
Sbjct: 175 SDEDSKTTGLIGMNRGSLSFVTQMGLQKFSYCISGQDS--SGILLFGESSFSWLKALKYT 232
Query: 326 ---AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
++ PL +V Y V L G V +Q+P S++ D G G +VD GT T
Sbjct: 233 PLVQISTPLPYFDRVA--YTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTF 290
Query: 383 LQTQAYNSLRDSFVR-LAGNLKPTSGVAL-----FDTCYDFSGLRSVR--VPTVSLHF-G 433
L Y +L++ FVR +LK D CY R +PTV+L F G
Sbjct: 291 LLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFRG 350
Query: 434 AGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALS----IIGNVQQQGTRVSFDLANN 486
A ++ Y +P S +CF F S L IIG+ QQ + FDLA +
Sbjct: 351 AEMSVSAERLMYRVPGVIRGSDSVYCFTFG-NSELLGVESYIIGHHHQQNVWMEFDLAKS 409
Query: 487 RVGFTPNKC 495
RVGF +C
Sbjct: 410 RVGFAEVRC 418
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 114/373 (30%), Positives = 178/373 (47%), Gaps = 45/373 (12%)
Query: 162 RIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS 221
++G+G+ + S ++DTGS+ +QC +S P+FDP S SY +PC + C +
Sbjct: 2 QLGIGSLQKNLSAIIDTGSEAVLVQCG------SRSRPVFDPAASQSYRQVPCISQLCLA 55
Query: 222 LDVSACRANR---------CLYQVAYGDGSFTVGDLVTETVSFGNSGS------VKGIAL 266
+ + C Y ++YGD + GD + + ++ S + +A
Sbjct: 56 VQQQTSNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVAF 115
Query: 267 GCGHDNEGLFV--GSAGLLGLGGGMLSLTKQIK----ATSLAYCLVDR--DSPASGVLEF 318
GC H +G V GS G++G G LSL Q+K + +YC + A+GV+
Sbjct: 116 GCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFL 175
Query: 319 -NSARGGDAVT-APLIRNKKV---DTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGI 372
+S V+ PL+ N YYVGLT SV G+ + IP S F++D + GDGG
Sbjct: 176 GDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGT 235
Query: 373 IVDCGTAITRLQTQAYNSLRDSFV--RLAGNLKPTSGVALFDTCYDFSGLRSV-RVPTVS 429
++D GT TR+ AY + R++F +G K A FD CY+ S S+ VP V
Sbjct: 236 VLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEVR 295
Query: 430 LHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPTSSA----LSIIGNVQQQGTRVSFD 482
L L+L ++ +PV +AG T C A + + ++++GN QQ V +D
Sbjct: 296 LSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYD 355
Query: 483 LANNRVGFTPNKC 495
+RVGF C
Sbjct: 356 NERSRVGFERADC 368
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 120/404 (29%), Positives = 183/404 (45%), Gaps = 53/404 (13%)
Query: 104 SRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRI 163
++++R S+ +N I +++ Y P + Q +P + G+G Y
Sbjct: 47 TQIQRISSILNYSINRVR---YLNHVFSFSPNKIQDVPLS--------SFMGAG-YVMSY 94
Query: 164 GVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD 223
+GTPP Q ++DTG+D W QC+PC C Q+ P+F P SS+Y +PC +P CK+
Sbjct: 95 SIGTPPFQLYSLIDTGNDNIWFQCKPCKPCLNQTSPMFHPSKSSTYKTIPCTSPICKN-- 152
Query: 224 VSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS-AGL 282
DG + D +T + G S K I +GCGH N+G G +G
Sbjct: 153 ---------------ADGHYLGVDTLTLNSNNGTPISFKNIVIGCGHRNQGPLEGYVSGN 197
Query: 283 LGLGGGMLSLTKQIKAT---SLAYCLVDRDSP--ASGVLEF---NSARGGDAVTAPLIRN 334
+GL G LS Q+ ++ +YCLV S S L F ++ G V+ P+
Sbjct: 198 IGLARGPLSFISQLNSSIGGKFSYCLVPLFSKENVSSKLHFGDKSTVSGLGTVSTPI--- 254
Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS 394
K + Y+V L FSVG +++ S D G+ I+D GT +T L Y+ L
Sbjct: 255 -KEENGYFVSLEAFSVGDHIIKLENS----DNRGNS--IIDSGTTMTILPKDVYSRLESV 307
Query: 395 FVRLAGNLKPTSGVALFDTCYDFSGLRSV-RVPTVSLHFGAGKALDLPAKNYLIPVDSAG 453
+ + + F+ CY + + +V ++ HF +G + L A N P+
Sbjct: 308 VLDMVKLKRVKDPSQQFNLCYQTTSTTLLTKVLIITAHF-SGSEVHLNALNTFYPITDE- 365
Query: 454 TFCFAFAP--TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
CFAF S+L+I GNV QQ V FDL + F P C
Sbjct: 366 VICFAFVSGGNFSSLAIFGNVVQQNFLVGFDLNKKTISFKPTDC 409
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 129/413 (31%), Positives = 198/413 (47%), Gaps = 43/413 (10%)
Query: 108 RDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGT 167
+ R +++ +LA Y + +L+ + D S PV Q EY +G
Sbjct: 44 EERVRRAVAVSRERLA-YTQQQQQLRASG------DVSAPVHLATRQYIAEYL----IGD 92
Query: 168 PPRQFSMVLDTGSDINWLQC-RPC--TECYQQSDPIFDPKTSSSYSPLPCA--APQCKSL 222
PP++ + ++DTGS++ W QC C C +Q P ++ SS+++ +PCA A C +
Sbjct: 93 PPQRAAALIDTGSNLIWTQCGTTCGLKACAKQDLPYYNLSRSSTFAAVPCADSAKLCAAN 152
Query: 223 DVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC---GHDNEGLFVG 278
V C + C + +YG GS G L TE +F SG+ K + GC +G G
Sbjct: 153 GVHLCGLDGSCTFAASYGAGS-VFGSLGTEAFTF-QSGAAK-LGFGCVSLTRITKGALNG 209
Query: 279 SAGLLGLGGGMLSLTKQIKATSLAYCLVD--RDSPASGVLEFNSAR----GGDAVTA-PL 331
++GL+GLG G LSL Q AT +YCL R+ AS L ++ GG AVT+ P
Sbjct: 210 ASGLIGLGRGRLSLVSQTGATKFSYCLTPYLRNHGASSHLFVGASASLSGGGGAVTSIPF 269
Query: 332 IRNKK---VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG----DGGIIVDCGTAITRLQ 384
+++ + TFYY+ L G SVG + IP + FE+ GG+I+D G+ +T L
Sbjct: 270 VKSPEDYPYSTFYYLPLVGISVGETKLPIPSAAFELRRVAAGYWSGGVIIDTGSPVTSLA 329
Query: 385 TQAYNSLRDSFVRLAGN--LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
AY++L D R ++P + L D C + V VP + HFG G + + A
Sbjct: 330 EAAYSALSDEVARQLNRSLVQPPADTGL-DLCVARQDVDKV-VPVLVFHFGGGADMAVSA 387
Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+Y PVD + T C ++IGN QQQ + +D+ + F C
Sbjct: 388 GSYWGPVDKS-TACM-LIEEGGYETVIGNFQQQDVHLLYDIGKGELSFQTADC 438
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 110/365 (30%), Positives = 171/365 (46%), Gaps = 28/365 (7%)
Query: 147 PVVSGASQ-GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
P+ SG S Y ++ +GTP + + +DT SD+ W+ C C C S+ F P
Sbjct: 86 PIASGRQMLQSTTYIVKVLIGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAK 143
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S+S+ + C+APQCK + AC A C + + YG S +L +T+ + +K
Sbjct: 144 STSFKNVSCSAPQCKQVPNPACGARACSFNLTYGSSSI-AANLSQDTIRLA-ADPIKAFT 201
Query: 266 LGCGHDNEGLFV-----GSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFN 319
GC + G G GL ++S + + ++ +YCL R SG L
Sbjct: 202 FGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPSFRSLTFSGSLRLG 261
Query: 320 SARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
V L+RN + + YYV L VG + V +PP+ + + G I D GT
Sbjct: 262 PTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGT 321
Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL----FDTCYDFSGLRSVRVPTVSLHFGA 434
TRL Y ++R+ F + +KP + V FDTCY SG V+VPT++ F
Sbjct: 322 VYTRLAKPVYEAVRNEFRK---RVKPPTAVVTSLGGFDTCY--SG--QVKVPTITFMF-K 373
Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGF 490
G + +PA N ++ + T C A A +S +++I ++QQQ RV D+ N R+G
Sbjct: 374 GVNMTMPADNLMLHSTAGSTSCLAMASAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGL 433
Query: 491 TPNKC 495
+C
Sbjct: 434 ARERC 438
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 111/365 (30%), Positives = 170/365 (46%), Gaps = 28/365 (7%)
Query: 147 PVVSGASQ-GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
P+ SG S Y + +GTP + + +DT SD+ W+ C C C S+ F P
Sbjct: 86 PIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAK 143
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S+S+ + C+APQCK + C A C + + YG S +L +T+ + +K
Sbjct: 144 STSFKNVSCSAPQCKQVPNPTCGARACSFNLTYGSSSI-AANLSQDTIRLA-ADPIKAFT 201
Query: 266 LGCGHDNEGLFV-----GSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFN 319
GC + G G GL ++S + I ++ +YCL R SG L
Sbjct: 202 FGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLG 261
Query: 320 SARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
V L+RN + + YYV L VG + V +PP+ + + G I D GT
Sbjct: 262 PTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGT 321
Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL----FDTCYDFSGLRSVRVPTVSLHFGA 434
TRL Y ++R+ F + +KPT+ V FDTCY SG V+VPT++ F
Sbjct: 322 VYTRLAKPVYEAVRNEFRK---RVKPTTAVVTSLGGFDTCY--SG--QVKVPTITFMF-K 373
Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGF 490
G + +PA N ++ + T C A A +S +++I ++QQQ RV D+ N R+G
Sbjct: 374 GVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGL 433
Query: 491 TPNKC 495
+C
Sbjct: 434 ARERC 438
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 115/381 (30%), Positives = 173/381 (45%), Gaps = 45/381 (11%)
Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQC------RPCTECYQ---QSDPIFD 202
A G G+Y VGTP ++F +V DTGSD+ W+ C R C+ + +F
Sbjct: 5 ADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFH 64
Query: 203 PKTSSSYSPLPCAAPQCK-------SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF 255
SSS+ +PC CK SL C Y Y DGS +G ETV+
Sbjct: 65 ANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTV 124
Query: 256 ----GNSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKAT-----SLAYCL 305
G + + +GC +G F + G++GLG S IKA +YCL
Sbjct: 125 ELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFA--IKAAEKFGGKFSYCL 182
Query: 306 VDRDSP--ASGVLEFNSARGGDAVTAPLIRNK----KVDTFYYVGLTGFSVGGQAVQIPP 359
VD S S L F S+R +A+ + + V++FY V + G S+GG ++IP
Sbjct: 183 VDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPS 242
Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYN----SLRDSFVRLAGNLKPTSGVALFDTCY 415
++ D G GG I+D G+++T L AY +LR S ++ K + + C+
Sbjct: 243 EVW--DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFR---KVEMDIGPLEYCF 297
Query: 416 DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQ 474
+ +G VP + HF G + P K+Y+I + G C F + S++GN+ Q
Sbjct: 298 NSTGFEESLVPRLVFHFADGAEFEPPVKSYVISA-ADGVRCLGFVSVAWPGTSVVGNIMQ 356
Query: 475 QGTRVSFDLANNRVGFTPNKC 495
Q FDL ++GF P+ C
Sbjct: 357 QNHLWEFDLGLKKLGFAPSSC 377
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 106/357 (29%), Positives = 166/357 (46%), Gaps = 35/357 (9%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
Y + +GTPP+ S V+D ++ W QC+ C+ C++Q P+FDP S++Y PC P
Sbjct: 50 NYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTP 109
Query: 218 QCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC----GHD 271
C+S+ D C N C YQ + G T G + T+T + G + + +A GC D
Sbjct: 110 LCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGTAKA--SLAFGCVVASDID 166
Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAV 327
G G +G++GLG SL Q + +YCL D+ + L S A GG A
Sbjct: 167 TMG---GPSGIVGLGRTPWSLVTQTGVAAFSYCLAPHDAGRNSALFLGSSAKLAGGGKAA 223
Query: 328 TAPLI----RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
+ P + + +Y V L G G + +PPS +++D + I+ L
Sbjct: 224 STPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPS--------GSTVLLDTFSPISFL 275
Query: 384 QTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAK 443
AY +++ + G + V FD C+ SG S P + F G A+ +PA
Sbjct: 276 VDGAYQAVKKAVTAAVGAPPMATPVEPFDLCFPKSG-ASGAAPDLVFTFRGGAAMTVPAT 334
Query: 444 NYLIPVDSAGTFCFAFAP-----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
NYL+ + GT C A +++ LS++G++QQ+ FDL + F P C
Sbjct: 335 NYLLDYKN-GTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADC 390
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 130/456 (28%), Positives = 205/456 (44%), Gaps = 57/456 (12%)
Query: 60 PFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITK 119
PF E S+T SSF++ L + +N S+ S+L R++A +
Sbjct: 19 PFTEPSKTP---------SSFTIDLIHHDSPPSPFYN--SSMTRSQLIRNAAMRSISRAN 67
Query: 120 LQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTG 179
+ ++LK + PE P +G Y RI +GTP + + DTG
Sbjct: 68 QLSLSLSHSLNQLKESS----PEPIIIP-------NNGNYLMRIYIGTPSVERLAIADTG 116
Query: 180 SDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS--ACR-ANRCLY 234
SD+ W+QC PC T+C+ Q+ P++DP SS+++ LPC + C L S C C+Y
Sbjct: 117 SDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLPCDSQPCTQLPYSQYVCSDYGDCIY 176
Query: 235 QVAYGDGSFTVGDLVTETVSFG--NSGSVKGIALGCGHDNEGLFVG-----SAGLLGLGG 287
YGD S++ G L ++++ I GCG N+ F + G++GLG
Sbjct: 177 AYTYGDNSYSYGGLSSDSIRLMLLQLHYNSKICFGCGFQNK--FTADKSGKTTGIVGLGA 234
Query: 288 GMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNSA---RGGDAVTAPLIRNKKVDTFY 341
G LSL Q+ +YCL+ S ++ L+F A +G V+ PLI + FY
Sbjct: 235 GPLSLVSQLGDEIGHKFSYCLLPFSSNSNSKLKFGEAAIVQGNGVVSTPLIIKPDL-PFY 293
Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
Y+ L G +VG + V+ + DG II+D G+ +T L+ YN S V+
Sbjct: 294 YLNLEGITVGAKTVKTGQT--------DGNIIIDSGSTLTYLEESFYNEFV-SLVKETVA 344
Query: 402 LKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA 460
++ + FD C+ + S P V HF G + P ++ D+ C
Sbjct: 345 VEEDQYIPYPFDFCFTYKEGMSTP-PDVVFHFTGGDVVLKPMNTLVLIEDNL--ICSTVV 401
Query: 461 PTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
P+ ++I GN+ Q V +D+ +V F P C
Sbjct: 402 PSHFDGIAIFGNLGQIDFHVGYDIQGGKVSFAPTDC 437
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 111/365 (30%), Positives = 170/365 (46%), Gaps = 28/365 (7%)
Query: 147 PVVSGASQ-GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
P+ SG S Y + +GTP + + +DT SD+ W+ C C C S+ F P
Sbjct: 102 PIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAK 159
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S+S+ + C+APQCK + C A C + + YG S +L +T+ + +K
Sbjct: 160 STSFKNVSCSAPQCKQVPNPTCGARACSFNLTYGSSSI-AANLSQDTIRLA-ADPIKAFT 217
Query: 266 LGCGHDNEGLFV-----GSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFN 319
GC + G G GL ++S + I ++ +YCL R SG L
Sbjct: 218 FGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLG 277
Query: 320 SARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
V L+RN + + YYV L VG + V +PP+ + + G I D GT
Sbjct: 278 PTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGT 337
Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL----FDTCYDFSGLRSVRVPTVSLHFGA 434
TRL Y ++R+ F + +KPT+ V FDTCY SG V+VPT++ F
Sbjct: 338 VYTRLAKPVYEAVRNEFRK---RVKPTTAVVTSLGGFDTCY--SG--QVKVPTITFMF-K 389
Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGF 490
G + +PA N ++ + T C A A +S +++I ++QQQ RV D+ N R+G
Sbjct: 390 GVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGL 449
Query: 491 TPNKC 495
+C
Sbjct: 450 ARERC 454
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 106/357 (29%), Positives = 166/357 (46%), Gaps = 35/357 (9%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
Y + +GTPP+ S V+D ++ W QC+ C C++Q P+FDP S++Y PC P
Sbjct: 50 NYVANFTIGTPPQPASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPCGTP 109
Query: 218 QCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC----GHD 271
C+S+ DV C N C Y+ + G T G + T+T + G + + +A GC D
Sbjct: 110 LCESIPSDVRNCSGNVCAYEASTNAGD-TGGKVGTDTFAVGTAKA--SLAFGCVVASDID 166
Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAV 327
G G +G++GLG SL Q + +YCL D+ + L S A GG A
Sbjct: 167 TMG---GPSGIVGLGRTPWSLVTQTGVAAFSYCLAPHDAGKNSALFLGSSAKLAGGGKAA 223
Query: 328 TAPLI----RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
+ P + + +Y V L G G + +PPS +++D + I+ L
Sbjct: 224 STPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPS--------GSTVLLDTFSPISFL 275
Query: 384 QTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAK 443
AY +++ + G + V FD C+ SG S P + F G A+ +PA
Sbjct: 276 VDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSG-ASGAAPDLVFTFRGGAAMTVPAT 334
Query: 444 NYLIPVDSAGTFCFAFAP-----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
NYL+ + GT C A +++ LS++G++QQ+ FDL + F P C
Sbjct: 335 NYLLDYKN-GTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADC 390
>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
Length = 450
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 120/360 (33%), Positives = 170/360 (47%), Gaps = 68/360 (18%)
Query: 171 QFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK-SLDVS---- 225
++++DTGSD+ W+QC+PC+ CY Q DP+FDP S+SY+ +PC A C+ SL +
Sbjct: 121 NLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVP 180
Query: 226 -AC----------RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
+C ++ RC Y +AYGDGSF+ G L T+TV+ G + SV G GCG N G
Sbjct: 181 GSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGA-SVDGFVFGCGLSNRG 239
Query: 275 LFV-GSA---------GLLGLGGGMLSL----TKQIKATSLAYCLVDRDSPASGVLEFNS 320
L GSA G G G LSL + AT ++Y
Sbjct: 240 LRRPGSAASSPTASPPGTSGDAAGSLSLGGDTSSYRNATPVSY----------------- 282
Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
+I + FY++ +TG SVGG AV G +++D GT I
Sbjct: 283 --------TRMIADPAQPPFYFMNVTGASVGGAAVA-------AAGLGAANVLLDSGTVI 327
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
TRL Y ++R F R G + + +L D CY+ +G V+VP ++L AG +
Sbjct: 328 TRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEAGADM 387
Query: 439 DLPAKNYLIPVDSAGT-FCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ A L G+ C A A S IIGN QQ+ RV +D +R+GF C
Sbjct: 388 TVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 447
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 116/350 (33%), Positives = 159/350 (45%), Gaps = 27/350 (7%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y R +GTPP+Q + +DT +D +W+ C C C S FDP S+SY +PC +P
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPAASASYRTVPCGSPL 171
Query: 219 CKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
C +AC C + + Y D S GN +VK GC G
Sbjct: 172 CAQAPNAACPPGGKACGFSLTYADSSLQAALSQDSLAVAGN--AVKAYTFGCLQRATGTA 229
Query: 277 V---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDA---VTA 329
G GL LS TK + + +YCL S SG L R G T
Sbjct: 230 APPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSLNFSGTLRLG--RNGQPQRIKTT 287
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
PL+ N + YYV +TG VG + V IP D A G ++D GT TRL AY
Sbjct: 288 PLLANPHRSSLYYVNMTGVRVGRKVVPIP----AFDPATGAGTVLDSGTMFTRLVAPAYV 343
Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
++RD R G P S + FDTC++ + +V P ++L F G + LP +N +I
Sbjct: 344 AVRDEVRRRVG--APVSSLGGFDTCFNTT---AVAWPPMTLLFD-GMQVTLPEENVVIHS 397
Query: 450 DSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C A A ++ L++I ++QQQ RV FD+ N RVGF +C
Sbjct: 398 TYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447
>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
Length = 334
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 116/319 (36%), Positives = 159/319 (49%), Gaps = 25/319 (7%)
Query: 199 PIFDPKTSSSYSPLPCAAPQCKSLDVSACR--------ANRCLYQVAYGDG----SFTVG 246
P+ P +SSS + + C C L C + C Y AYG+ +T G
Sbjct: 13 PLLYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEG 72
Query: 247 DLVTETVSFGN-SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCL 305
L+TET +FG+ + + GIA GC +EG F +GL+GLG G LSL Q+ + Y L
Sbjct: 73 ILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYRL 132
Query: 306 ---VDRDSPAS-GVLEFNSARGGDA-VTAPLIRNKKVDT--FYYVGLTGFSVGGQAVQIP 358
+ SP S G L + GD+ ++ PL+ N V FYYVGLTG SVGG+ VQIP
Sbjct: 133 SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIP 192
Query: 359 PSLFEMDEA-GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF 417
F D + G GG+I D GT +T L AY +RD + G KP D
Sbjct: 193 SGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFT 252
Query: 418 SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV---DSAGTFCFAFAPTSSALSIIGNVQQ 474
G + P++ LHF G +DL +NYL + + C++ +S AL+IIGN+ Q
Sbjct: 253 GGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQ 312
Query: 475 QGTRVSFDLANN-RVGFTP 492
V FDL+ N R+ F P
Sbjct: 313 MDFHVVFDLSGNARMLFQP 331
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 117/378 (30%), Positives = 180/378 (47%), Gaps = 50/378 (13%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYS 210
+G Y++RI +GTPPR F + +DTGSDI W+ C+PC C S FDP+ SS+ S
Sbjct: 38 AGLYYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTAS 97
Query: 211 PLPCAAPQCKS---LDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFG--------NS 258
PL C +C S + S C +R C Y YGDGS T+G V++ + N+
Sbjct: 98 PLSCIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNN 157
Query: 259 GSVKGIALGCGHDNEGLFV----GSAGLLGLGGGMLSLTKQIKATSLA-----YCLVDRD 309
S K I GC ++ G G+ G G LS+ Q+ + LA +CL D
Sbjct: 158 ASAK-ITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGAD 216
Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
P G+L V P++ ++ Y + L G +V GQ + I P +F
Sbjct: 217 -PGGGILVLGEITEPGMVYTPIVPSQP---HYNLNLQGIAVNGQQLSIDPQVFATTNT-- 270
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFV-RLAGNLKP--TSGVALFDTCYDFSGLRSVRVP 426
G I+DCGT + L +AY ++ + ++ + +P G F T + + P
Sbjct: 271 RGTIIDCGTTLAYLAEEAYEPFVNTIIAAVSQSTQPFMLKGNPCFLTVHSIDEI----FP 326
Query: 427 TVSLHFGAGKALDLPAKNYLIPV---DSAGTFCFAF------APTSSALSIIGNVQQQGT 477
+V+L+F G +DL K+YLI DS+ +C + A SS ++I+G++ +
Sbjct: 327 SVTLYF-EGAPMDLKPKDYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILGDLVLKDK 385
Query: 478 RVSFDLANNRVGFTPNKC 495
+DL N R+G+T C
Sbjct: 386 VFVYDLENQRIGWTSFDC 403
>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 440
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 126/363 (34%), Positives = 173/363 (47%), Gaps = 21/363 (5%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
+ P+ SG + G Y R+ +GTP + MVLDT +D ++ C CT C SD F PK
Sbjct: 86 TAPIASGQTFNIGNYVVRVKLGTPGQLLFMVLDTSTDEAFVPCSGCTGC---SDTTFSPK 142
Query: 205 TSSSYSPLPCAAPQC---KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
S+SY PL C+ PQC + L A C + +Y SF+ LV +++ + +
Sbjct: 143 ASTSYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSFS-ATLVQDSLRLA-TDVI 200
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPA-SGVLE 317
+ GC + G V + GLLGLG G LSL Q + +YCL S SG L+
Sbjct: 201 PNYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCLPSFKSYYFSGSLK 260
Query: 318 FNSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
++ T PL+R+ + YYV TG SVG V P + G I+D
Sbjct: 261 LGPVGQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDS 320
Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
GT ITR YN++R+ F + G TS + FDTC F P ++LHF G
Sbjct: 321 GTVITRFVEPVYNAVREEFRKQVGGTTFTS-IGAFDTC--FVKTYETLAPPITLHF-EGL 376
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
L LP +N LI + C A A +S L++I N QQQ R+ FD NN+VG
Sbjct: 377 DLKLPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRILFDTVNNKVGIAR 436
Query: 493 NKC 495
C
Sbjct: 437 EVC 439
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 119/373 (31%), Positives = 160/373 (42%), Gaps = 58/373 (15%)
Query: 135 AEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECY 194
+EA I P PV S +GEY +I +GTPP + DTGSD+ W QC PC CY
Sbjct: 4 SEASISPNTPEPPV----SSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCY 59
Query: 195 QQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVS 254
+Q +P+FDP S+S+ + C + QC+ LD
Sbjct: 60 KQKNPMFDPSKSTSFKEVSCESQQCRLLDTPT---------------------------- 91
Query: 255 FGNSGSVKGIALGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKAT-----SLAYCLVD- 307
S+ I GCGH+N G F GL G GG LSLT QI +T + CLV
Sbjct: 92 -----SILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPF 146
Query: 308 RDSPA--SGVLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
R P+ S ++ A G D V+ PL+ K T+Y+V L G SVG + P
Sbjct: 147 RTDPSITSKIIFGPEAEVSGSDVVSTPLV-TKDDPTYYFVTLDGISVGDKLF---PFSSS 202
Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF-DTCYDFSGLRS 422
A G + +D GT T L YN L V+ A ++P L CY + L
Sbjct: 203 SPMATKGNVFIDAGTPPTLLPRDFYNRLVQG-VKEAIPMEPVQDPDLQPQLCYRSATL-- 259
Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFD 482
+ P ++ HF P ++ P + G +CFA P I GN Q + FD
Sbjct: 260 IDGPILTAHFDGADVQLKPLNTFISPKE--GVYCFAMQPIDGDTGIFGNFVQMNFLIGFD 317
Query: 483 LANNRVGFTPNKC 495
L +V F C
Sbjct: 318 LDGKKVSFKAVDC 330
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 137/427 (32%), Positives = 191/427 (44%), Gaps = 58/427 (13%)
Query: 115 TLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTP------ 168
T IT L +Y V R E AQI PVV G + G FS P
Sbjct: 71 TSITSLPPQVYQVVRDEFA---AQI-----KLPVVPGNATGPYTCFSAPSQAKPDVPKLV 122
Query: 169 ----------PRQ---FSMVLDTGSDINWLQCRPCTEC-----YQQSD----PIFDPKTS 206
PR+ F + D G+ I L E +QQ + P FD TS
Sbjct: 123 LHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHALPYFDRSTS 182
Query: 207 SSYSPLPCAAPQCKSLDVSACRANR------CLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
S+ C + C+ L V++C + C+Y Y D S T G L + +FG S
Sbjct: 183 STLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLLEVDKFTFGAGAS 242
Query: 261 VKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSPASGVLE 317
V G+A GCG N G+F + G+ G G G LSL Q+K + ++C V+ ++ +L+
Sbjct: 243 VPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKQSTVLLD 302
Query: 318 -----FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
+ + RG T PLI+N T YY+ L G +VG + +P S F + G GG
Sbjct: 303 LLADLYKNGRGAVQST-PLIQNSANPTLYYLSLKGITVGSTRLPVPESAFALTN-GTGGT 360
Query: 373 IVDCGTAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLH 431
I+D GT+IT L Q Y +RD F ++ + P + + TC+ VP + LH
Sbjct: 361 IIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPY-TCFSAPSQAKPDVPKLVLH 419
Query: 432 FGAGKALDLPAKNYLIPV-DSAGT--FCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
F G +DLP +NY+ V D AG C A + IGN QQQ V +DL NN +
Sbjct: 420 F-EGATMDLPRENYVFEVPDDAGNSMICLAINELGDERATIGNFQQQNMHVLYDLQNNML 478
Query: 489 GFTPNKC 495
F +C
Sbjct: 479 SFVAAQC 485
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/136 (36%), Positives = 67/136 (49%), Gaps = 8/136 (5%)
Query: 344 GLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF-VRLAGNL 402
G G +VG + +P S F + G GG I+D GT+IT L Q Y +RD F ++ +
Sbjct: 38 GRPGITVGSTRLPVPESAFALTN-GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPV 96
Query: 403 KPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV-DSAGT--FCFAF 459
P + + TC+ VP + LHF G +DLP +NY+ V D AG C A
Sbjct: 97 VPGNATGPY-TCFSAPSQAKPDVPKLVLHF-EGATMDLPRENYVFEVPDDAGNSIICLAI 154
Query: 460 APTSSALSIIGNVQQQ 475
+IIGN QQQ
Sbjct: 155 N-KGDETTIIGNFQQQ 169
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 113/366 (30%), Positives = 168/366 (45%), Gaps = 31/366 (8%)
Query: 147 PVVSGAS-QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
P+ SG S Y + +GTP + + +DT +D +W+ C C C + F P
Sbjct: 85 PIASGRQITQSPTYIVKAKIGTPAQTLLLAMDTSNDASWVPCTACVGCSTTTP--FAPAK 142
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S+++ + C A QCK + C + C + YG S LV +TV+ + V A
Sbjct: 143 STTFKKVGCGASQCKQVRNPTCDGSACAFNFTYGTSS-VAASLVQDTVTLA-TDPVPAYA 200
Query: 266 LGCGHDNEGLFV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFN-SA 321
GC G V G GL +L+ T+++ ++ +YCL P+ L F+ S
Sbjct: 201 FGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCL-----PSFKTLNFSGSL 255
Query: 322 RGGDAVT------APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
R G PL++N + + YYV L VG + V IPP + G + D
Sbjct: 256 RLGPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNANTGAGTVFD 315
Query: 376 CGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL--FDTCYDFSGLRSVRVPTVSLHFG 433
GT TRL AYN++R+ F R K + +L FDTCY + PT++ F
Sbjct: 316 SGTVFTRLVEPAYNAVRNEFRRRIAVHKKLTVTSLGGFDTCYT----APIVAPTITFMF- 370
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVG 489
+G + LP N LI + C A AP +S L++I N+QQQ RV FD+ N+R+G
Sbjct: 371 SGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLG 430
Query: 490 FTPNKC 495
C
Sbjct: 431 VARELC 436
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 116/353 (32%), Positives = 175/353 (49%), Gaps = 26/353 (7%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
G G Y +GTPP+ S + DTGSD+ W +C C C + + P SSS+S LPC
Sbjct: 77 GGGAYDMTFSMGTPPQTLSALADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSFSKLPC 136
Query: 215 AAPQCKSLD---VSACRANR-----CLYQVAYGDGS----FTVGDLVTETVSFGNSGSVK 262
++ C++L+ ++ C R C Y+ +YG S +T G + +ET + G S +V+
Sbjct: 137 SSALCRTLESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLG-SDAVQ 195
Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR 322
GI GC +EG + +GL+GLG G LSL +Q+K + +YCL S +S +L A
Sbjct: 196 GIGFGCTTMSEGGYGSGSGLVGLGRGKLSLVRQLKVGAFSYCLTSDPSTSSPLLFGAGAL 255
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
G V + + N K TFY V L S+G A + P + G GII D GT +T
Sbjct: 256 TGPGVQSTPLVNLKTSTFYTVNLDSISIG--AAKTPGT-------GRHGIIFDSGTTLTF 306
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
L AY + NL G ++ C+ SG P++ LHF G + L
Sbjct: 307 LAEPAYTLAEAGLLSQTTNLTRVPGTDGYEVCFQTSG--GAVFPSMVLHFDGGD-MALKT 363
Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+NY V+ + + C+ + S +SI+GN+ Q + +DL + + F P C
Sbjct: 364 ENYFGAVNDSVS-CWLVQKSPSEMSIVGNIMQMDYHIRYDLDKSVLSFQPTNC 415
>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
Length = 360
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 101/296 (34%), Positives = 148/296 (50%), Gaps = 22/296 (7%)
Query: 222 LDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGS--------VKGIALGCGHD 271
L + C+A C Y YGD S T GD ET + + S V+ + GCGH
Sbjct: 62 LVTNPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVENVMFGCGHW 121
Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVT 328
N GLF G+AGLLGLG G LS + Q+++ S +YCLVDR+S A+ + D ++
Sbjct: 122 NRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLS 181
Query: 329 APLI--------RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
P + + VDTFYYV + VGG+ V IP +++ G GG I+D GT +
Sbjct: 182 HPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTL 241
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
+ AY ++++F+ + + CY+ +G+ +P + F G +
Sbjct: 242 SYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNF 301
Query: 441 PAKNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
P +NY I ++ C A T SALSIIGN QQQ + +D +R+GF P KC
Sbjct: 302 PVENYFIEIEPREVVCLAILGTPPSALSIIGNYQQQNFHILYDTKKSRLGFAPTKC 357
>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 126/363 (34%), Positives = 174/363 (47%), Gaps = 21/363 (5%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
+ P+ SG + G Y R+ +GTP + MVLDT +D ++ C CT C SD F PK
Sbjct: 85 TAPIASGQAFNIGNYVVRVKLGTPGQLLFMVLDTSTDEAFVPCSGCTGC---SDTTFSPK 141
Query: 205 TSSSYSPLPCAAPQCKSLDVSACRAN---RCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
S+SY PL C+ PQC + +C A C + +Y SF+ LV + + + +
Sbjct: 142 ASTSYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSFS-ATLVQDALRLA-TDVI 199
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPA-SGVLE 317
+ GC + G V + GLLGLG G LSL Q + +YCL S SG L+
Sbjct: 200 PYYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCLPSFKSYYFSGSLK 259
Query: 318 FNSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
++ T PL+R+ + YYV TG SVG V P + G I+D
Sbjct: 260 LGPVGQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDS 319
Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
GT ITR YN++R+ F + G TS + FDTC F P ++LHF G
Sbjct: 320 GTVITRFVEPVYNAVREEFRKQVGGTTFTS-IGAFDTC--FVKTYETLAPPITLHF-EGL 375
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
L LP +N LI + C A A +S L++I N QQQ R+ FD+ NN+VG
Sbjct: 376 DLKLPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRILFDIVNNKVGIAR 435
Query: 493 NKC 495
C
Sbjct: 436 EVC 438
>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
Length = 486
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 120/368 (32%), Positives = 174/368 (47%), Gaps = 41/368 (11%)
Query: 151 GASQGSGEYFSRI-------GVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
G SQ S E S I G +PP ++VLDT D+ W++C PCT Q +D +DP
Sbjct: 137 GTSQTSSEPSSGIHPAAATDGSSSPP--VTVVLDTAGDVPWMRCVPCTFA-QCAD--YDP 191
Query: 204 KTSSSYSPLPCAAPQCKSLD--VSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
SS+YS PC + CK L + C AN +C Y V SFT + V NSG
Sbjct: 192 TRSSTYSAFPCNSSACKQLGRYANGCDANGQCQYMVVTAGDSFTTSGTYSSDVLTINSGD 251
Query: 261 -VKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGV 315
V+G GC + +G F A G++ LG G+ SL Q +T + +YCL ++ G
Sbjct: 252 RVEGFRFGCSQNEQGSFENQADGIMALGRGVQSLMAQTSSTYGDAFSYCLPPTET-TKGF 310
Query: 316 LEFNSARGGDA--VTAPLIRNK-----KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
+ G VT P+++ + T Y L +V G+ + +P +F
Sbjct: 311 FQIGVPIGASYRFVTTPMLKERGGASAAAATLYRALLLAITVDGKELNVPAEVFA----- 365
Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
G ++D T ITRL AY +LR +F R+ + P DTCYD +G+R R+P
Sbjct: 366 -AGTVMDSRTIITRLPVTAYGALRAAFRNRMRYRVAPPQ--EELDTCYDLTGVRYPRLPR 422
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNR 487
++L F +++ L+ G FA S+ SI+GNVQQQ +V D+ R
Sbjct: 423 IALVFDGNAVVEMDRSGILL----NGCLAFASNDDDSSPSILGNVQQQTIQVLHDVGGGR 478
Query: 488 VGFTPNKC 495
+GF C
Sbjct: 479 IGFRSAAC 486
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 113/369 (30%), Positives = 172/369 (46%), Gaps = 36/369 (9%)
Query: 147 PVVSGASQ-GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
P+ SG S Y R +G+PP+ + +DT +D W+ PCT C + +F P+
Sbjct: 85 PIASGRQIIQSPTYIVRAKIGSPPQTLLLAMDTSNDAAWI---PCTACDGCTSTLFAPEK 141
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S+++ + C +PQC + +C + C + + YG S ++V +TV+ + +
Sbjct: 142 STTFKNVSCGSPQCNQVPNPSCGTSACTFNLTYGSSSI-AANVVQDTVTLA-TDPIPDYT 199
Query: 266 LGCGHDNEGLFV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFN-SA 321
GC G G GL +LS T+ + ++ +YCL P+ L F+ S
Sbjct: 200 FGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCL-----PSFKSLNFSGSL 254
Query: 322 RGGDAVT------APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
R G PL++N + + YYV L VG + V IPP + A G + D
Sbjct: 255 RLGPVAQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFNAATGAGTVFD 314
Query: 376 CGTAITRLQTQAYNSLRDSFVR-----LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
GT TRL AY ++RD F R NL TS + FDTCY + PT++
Sbjct: 315 SGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTS-LGGFDTCYTV----PIVAPTITF 369
Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANN 486
F +G + LP N LI + T C A A +S L++I N+QQQ RV +D+ N+
Sbjct: 370 MF-SGMNVTLPEDNILIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNS 428
Query: 487 RVGFTPNKC 495
R+G C
Sbjct: 429 RLGVARELC 437
>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
Length = 519
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 126/433 (29%), Positives = 197/433 (45%), Gaps = 84/433 (19%)
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-----CYQQ 196
E F+ P+ SGA G+G+YF R VGTP R F +V DTGSD+ W++C Y
Sbjct: 90 EAFAMPLSSGAYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPGYGY 149
Query: 197 SDP----------------------IFDPKTSSSYSPLPCAAPQCKS---LDVSAC--RA 229
+ P +F P S +++P+PC++ C + ++AC
Sbjct: 150 AAPASNDSSTSSLSAAAASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPG 209
Query: 230 NRCLYQVAYGDGSFTVGDLVTETVSFGNSG----------SVKGIALGCGHDNEG-LFVG 278
+ C Y Y DGS G + T++ + SG ++G+ LGC G F+
Sbjct: 210 SPCAYDYRYKDGSAARGTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFLA 269
Query: 279 SAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSP--ASGVLEF--NSARGGDAVT--- 328
S G+L LG +S + A +YCLVD +P A+ L F N A +
Sbjct: 270 SDGVLSLGYSNISFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSSPPSKTA 329
Query: 329 -------------------APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
PL+ + ++ FY V + G SV G+ ++IP ++ D A
Sbjct: 330 CAGGGSPAAAPPGPGGARQTPLLLDHRMRPFYAVTVNGISVDGELLRIPRLVW--DVAKG 387
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR-----SVR 424
GG I+D GT++T L + AY ++ + + L P + FD CY+++ +V
Sbjct: 388 GGAILDSGTSLTVLVSPAYRAVVAALNKKLAGL-PRVTMDPFDYCYNWTSPSTGEDLTVA 446
Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFAFAPTS-SALSIIGNVQQQGTRVSFD 482
+P +++HF L PAK+Y+I D+A G C +S+IGN+ QQ FD
Sbjct: 447 MPELAVHFAGSARLQPPAKSYVI--DAAPGVKCIGLQEGEWPGVSVIGNILQQEHLWEFD 504
Query: 483 LANNRVGFTPNKC 495
L N R+ F ++C
Sbjct: 505 LKNRRLRFKRSRC 517
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 151 bits (382), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 119/368 (32%), Positives = 178/368 (48%), Gaps = 43/368 (11%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
+ VGTPP+ +MVLDTGS+++WL C +D F P+ S++++ +PC + +C S
Sbjct: 65 LAVGTPPQNVTMVLDTGSELSWLLCATGRAAAAAAD-SFRPRASATFAAVPCGSARCSSR 123
Query: 223 DVSA---CRA--NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC---GHDNEG 274
D+ A C A RC ++Y DGS + G L T+ + G++ ++ A GC +D+
Sbjct: 124 DLPAPPSCDAASRRCRVSLSYADGSASDGALATDVFAVGDAPPLRS-AFGCMSAAYDSSP 182
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSA---------RGGD 325
V +AGLLG+ G LS Q +YC+ DRD +GVL +
Sbjct: 183 DAVATAGLLGMNRGALSFVTQASTRRFSYCISDRDD--AGVLLLGHSDLPFLPLNYTPLY 240
Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
T PL +V Y V L G VGG+ + IPPS+ D G G +VD GT T L
Sbjct: 241 QPTPPLPYFDRVA--YSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQTMVDSGTQFTFLLG 298
Query: 386 QAYNSLRDSFVRLAGNLKPT------SGVALFDTCYDFSGLR---SVRVPTVSLHF-GAG 435
AY++++ F++ L P + FDTC+ R S R+P V+L F GA
Sbjct: 299 DAYSAVKAEFLKQTKPLLPALEDPSFAFQEAFDTCFRVPKGRPPPSARLPPVTLLFNGAQ 358
Query: 436 KALDLPAKNYLIPVD---SAGTFCFAFA-----PTSSALSIIGNVQQQGTRVSFDLANNR 487
++ Y +P + + G +C F P ++ +IG+ Q V +DL R
Sbjct: 359 MSVAGDRLLYKVPGERRGADGVWCLTFGNADMVPLTA--YVIGHHHQMNLWVEYDLERGR 416
Query: 488 VGFTPNKC 495
VG P KC
Sbjct: 417 VGLAPVKC 424
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 113/369 (30%), Positives = 173/369 (46%), Gaps = 36/369 (9%)
Query: 147 PVVSGASQ-GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
P+ SG S Y R +GTPP+ + +DT +D W+ PCT C + +F P+
Sbjct: 84 PIASGRQIIQSPTYIVRAKIGTPPQTLLLAIDTSNDAAWI---PCTACDGCTSTLFAPEK 140
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S+++ + C +P+C + +C + C + + YG S ++V +TV+ + + G
Sbjct: 141 STTFKNVSCGSPECNKVPSPSCGTSACTFNLTYGSSSI-AANVVQDTVTLA-TDPIPGYT 198
Query: 266 LGCGHDNEGLFV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFN-SA 321
GC G G GL +LS T+ + ++ +YCL P+ L F+ S
Sbjct: 199 FGCVAKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCL-----PSFKSLNFSGSL 253
Query: 322 RGGDAVT------APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
R G PL++N + + YYV L VG + V IPP+ + A G + D
Sbjct: 254 RLGPVAQPIRIKYTPLLKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFNAATGAGTVFD 313
Query: 376 CGTAITRLQTQAYNSLRDSFVR-----LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
GT TRL Y ++RD F R NL TS + FDTCY + PT++
Sbjct: 314 SGTVFTRLVAPVYTAVRDEFRRRVAMAAKANLTVTS-LGGFDTCYTV----PIVAPTITF 368
Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANN 486
F +G + LP N LI + T C A A +S L++I N+QQQ RV +D+ N+
Sbjct: 369 MF-SGMNVTLPQDNILIHSTAGSTSCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNS 427
Query: 487 RVGFTPNKC 495
R+G C
Sbjct: 428 RLGVARELC 436
>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
Length = 376
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 95/296 (32%), Positives = 151/296 (51%), Gaps = 35/296 (11%)
Query: 13 TTILFSFCLFTS--ASSRGLS-ETATTVLDVSSALQQTEHILSFEPETL---EPFAEESE 66
T L F L+++ +S RGL+ + T L S L HI S P ++ P ++
Sbjct: 7 TIFLLKFLLYSALLSSKRGLAFQGRKTALSTPSTLHNV-HITSLMPSSVCSPSPKGDDKR 65
Query: 67 TAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSR-LERDSARVNTLITKLQLAIY 125
+ E +H K + RS ++ L++D +RVN++ + +LA
Sbjct: 66 ASLEV------------IHKHGPCSKLSQDKGRSPSRTQMLDQDESRVNSI--RSRLAKN 111
Query: 126 NVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL 185
D +LK ++ + P SG++ G+G Y +G+GTP R + + DTGSD+ W
Sbjct: 112 PADGGKLKGSKVTL-------PSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWT 164
Query: 186 QCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-----DVSACRANRCLYQVAYG 239
QC PC CY Q +PIF+P S+SY+ + C++P C L + +C A+ C+Y + YG
Sbjct: 165 QCEPCARYCYHQQEPIFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYG 224
Query: 240 DGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ 295
D S++VG + ++ ++ GCG +N GLFVG AGL+GLG LSL +
Sbjct: 225 DQSYSVGFFAQDKLALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLMSK 280
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 36/91 (39%), Positives = 53/91 (58%), Gaps = 7/91 (7%)
Query: 409 ALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN--YLIPVDSAGTFCFAFAPTSSA- 465
++ DTCYDFS +V VP ++L+F G +DL Y++ + C AFA S A
Sbjct: 288 SILDTCYDFSQYDTVDVPKINLYFSDGAEMDLDPSGIFYILNISQV---CLAFAGNSDAT 344
Query: 466 -LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
++I+GNVQQ+ V +D+A R+GF P C
Sbjct: 345 DIAILGNVQQKTFDVVYDVAGGRIGFAPGGC 375
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 112/379 (29%), Positives = 170/379 (44%), Gaps = 48/379 (12%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----------PIFDPKT 205
G Y +GTPP++ S+VLDTGS + W C T Y + PI+
Sbjct: 72 GGYSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNK 131
Query: 206 SSSYSPLPCAAPQCKSL---DVSACRANRC-LYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
SS+ LPC +P+C + D++ RC Y + YG GS T G LV++ + +
Sbjct: 132 SSTVQSLPCRSPKCNWVFGSDLNCSTTKRCPYYGLEYGLGS-TTGQLVSDVLGLSKLNRI 190
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DSPASGVLEF 318
GC + G+ G G G+ S+ Q+ T +YCLV D+P SG L
Sbjct: 191 PDFLFGCSLVSN---RQPEGIAGFGRGLASIPAQLGLTKFSYCLVSHRFDDTPQSGDLVL 247
Query: 319 NSAR-GGDAVT-----APLIRNKKVD---TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
+ R DA AP ++ + +YY+ L+ VGG+ V IPP + GD
Sbjct: 248 HRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGD 307
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSF------VRLAGNLKPTSGVALFDTCYDFSGLRSV 423
GG+IVD G+ T ++ ++ + + A ++ +SG+ CY+ +G V
Sbjct: 308 GGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLG---PCYNITGQSEV 364
Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF-------APTSSALSIIGNVQQQG 476
VP ++ F G +DLP +Y V + G C T+ I+GN QQQ
Sbjct: 365 DVPKLTFSFKGGANMDLPLTDYFSLV-TDGVVCMTVLTDPDEPGSTTGPAIILGNYQQQN 423
Query: 477 TRVSFDLANNRVGFTPNKC 495
+ +DL R GF P +C
Sbjct: 424 FYIEYDLKKQRFGFKPQQC 442
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 104/353 (29%), Positives = 162/353 (45%), Gaps = 42/353 (11%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y ++ VGTPP + ++DTGS+I W QC PC CY+Q+ PIFDP SS++
Sbjct: 65 YLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKE------- 117
Query: 219 CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNEG 274
C + C Y+V Y D ++T+G L TET++ G + +GCGH+N
Sbjct: 118 ------KRCDGHSCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGCGHNNSW 171
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF--NSARGGDAVTA 329
+G++GL G SL Q+ ++YC + + + F N+ GD V +
Sbjct: 172 FKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCFSGQ---GTSKINFGANAIVAGDGVVS 228
Query: 330 -PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
+ FYY+ L SVG ++ + F A +G I++D GT +T
Sbjct: 229 TTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTF---HALEGNIVIDSGTTLTYFPVSYC 285
Query: 389 NSLRDSFVRLAGNLK---PTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
N +R + + ++ PT L CY+ + P +++HF G L L N
Sbjct: 286 NLVRQAVEHVVTAVRAADPTGNDML---CYNSDTID--IFPVITMHFSGGVDLVLDKYNM 340
Query: 446 LIPVDSAGTFCFAF---APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ ++ G FC A +PT A I GN Q V +D ++ V F+P C
Sbjct: 341 YMESNNGGVFCLAIICNSPTQEA--IFGNRAQNNFLVGYDSSSLLVSFSPTNC 391
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 113/354 (31%), Positives = 163/354 (46%), Gaps = 29/354 (8%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
G G Y +GTPP++ + + DTGSD+ W +C + P SS+++ LPC
Sbjct: 96 GGGAYDMEFSIGTPPQKLTALADTGSDLIWTKCDAGGGAAWGGSSSYHPNASSTFTRLPC 155
Query: 215 AAPQC---KSLDVSACRAN--RCLYQVAYG---DGSFTVGDLVTETVSFGNSGSVKGIAL 266
+ C +S ++ C A C Y+ AYG D FT G L +ET + G +V G+
Sbjct: 156 SDRLCAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTLGGD-AVPGVGF 214
Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPAS----GVLEFNSAR 322
GC EG + AGL+GLG G LSL Q+ A + YCL S AS G L +
Sbjct: 215 GCTTALEGDYGEGAGLVGLGRGPLSLVSQLDAGTFMYCLTADASKASPLLFGALATMTGA 274
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
G + L+ + TFY V L ++G + + D GT +T
Sbjct: 275 GAGVQSTGLLAST---TFYAVNLRSITIGSATTAGVGGPGGV--------VFDSGTTLTY 323
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR-VPTVSLHFGAGKALDLP 441
L AY + +F+ +L P G F+ CY+ S R +P + LHF G + LP
Sbjct: 324 LAEPAYTEAKAAFLSQTTSLTPVEGRYGFEACYEKP--DSARLIPAMVLHFDGGADMALP 381
Query: 442 AKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
NY++ VD G C+ S +LSIIGN+ Q V D+ + + F P C
Sbjct: 382 VANYVVEVDD-GVVCWV-VQRSPSLSIIGNIMQMNYLVLHDVRKSVLSFQPANC 433
>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
Length = 423
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 110/362 (30%), Positives = 159/362 (43%), Gaps = 65/362 (17%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
Y +R G+GTP + + +D +D W+ C C C S P F P SS+Y +PC +P
Sbjct: 101 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSP 159
Query: 218 QCKSLDVSACRA---NRCLYQVAY---------GDGSFTVGDLVTETVSFGNSGSVKGIA 265
QC + +C A + C + + Y G S + + V + +FG V G +
Sbjct: 160 QCAQVPSPSCPAGVGSSCGFNLTYAASTFQAVLGQDSLALENNVVVSYTFGCLRVVNGNS 219
Query: 266 LGCGHDNEG------LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFN 319
+ L V G LG G K+IK T
Sbjct: 220 RAAAGAHRLRPRAALLLVADQGHLGPIG----QPKRIKTT-------------------- 255
Query: 320 SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
PL+ N + YYV + G VG + VQ+P S + G I+D GT
Sbjct: 256 ----------PLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTM 305
Query: 380 ITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
TRL Y ++RD+F R+ + P G FDTCY+ +V VPTV+ F A+
Sbjct: 306 FTRLAAPVYAAVRDAFRGRVRTPVAPPLGG--FDTCYNV----TVSVPTVTFMFAGAVAV 359
Query: 439 DLPAKNYLIPVDSAGTFCFAFAP-----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
LP +N +I S G C A A ++AL+++ ++QQQ RV FD+AN RVGF+
Sbjct: 360 TLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRE 419
Query: 494 KC 495
C
Sbjct: 420 LC 421
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 121/393 (30%), Positives = 172/393 (43%), Gaps = 58/393 (14%)
Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTECYQQSDPI------FDP 203
S G Y + GTPP+ S ++DTGSDI W C C C S F P
Sbjct: 61 SHSYGGYSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIP 120
Query: 204 KTSSSYSPLPCAAPQCKSLDVSA------CRANRCL------YQVAYGDGSFTVGDLVTE 251
K SSS L C P+C + S C CL Y + YG G+ T G ++E
Sbjct: 121 KESSSSKLLGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCPPYMIFYGSGT-TGGVALSE 179
Query: 252 TVSFGNSGSVKGIALGCGHDNEGLFVGS--AGLLGLGGGMLSLTKQIKATSLAYCLV--- 306
T+ +S S +GC +F AG+ G G G+ SL Q+ +YCL+
Sbjct: 180 TLHL-HSLSKPNFLVGCS-----VFSSHQPAGIAGFGRGLSSLPSQLGLGKFSYCLLSHR 233
Query: 307 -DRDSPASGVL-----EFNSARGGDA-VTAPLIRNKKVDT------FYYVGLTGFSVGGQ 353
D D+ S L + +S + +A V P ++N KVD +YY+GL +VGG
Sbjct: 234 FDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGH 293
Query: 354 AVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN---LKPTSGVAL 410
V++P E G+GG+I+D GT T + +A+ L D F+R + +K
Sbjct: 294 HVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIG 353
Query: 411 FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS--- 467
C++ S ++V P + L+F G + LP +NY V C A
Sbjct: 354 LRPCFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFV-GGEVACLTVVTDGVAGPERV 412
Query: 468 -----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
I+GN Q Q V +DL N R+GF KC
Sbjct: 413 GGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 134/467 (28%), Positives = 206/467 (44%), Gaps = 51/467 (10%)
Query: 55 PETL-EPFAEESETAAESFPLNSSSSFSLPLHSREILHK-TRHNDYRSLVLSRLERDSAR 112
P++L PF + + F ++ FSL EI+H+ +R + + ++ ER
Sbjct: 2 PQSLASPFVYLTILSLIHFAISKPDGFSL-----EIVHRYSRESPFYPGNITDYER---- 52
Query: 113 VNTLITKLQLAIYNVDRHELKPAEAQ-ILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQ 171
IT+L + + + H L + PE F + SQ Y ++ +G+P
Sbjct: 53 ----ITRL-VELSKIRAHNLAITTSSGFSPEAFRLRI----SQDDTCYLVKVIIGSPGVP 103
Query: 172 FSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC-KSLDVSACRAN 230
+V DTGS + W QC PCT ++Q PIF+ S +Y LPC C + +V CR +
Sbjct: 104 LYLVPDTGSGLFWTQCEPCTRRFRQLPPIFNSTASRTYRDLPCQHQFCTNNQNVFQCRDD 163
Query: 231 RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL-----FVGSAGLLGL 285
+C+Y++AY GS T G + + + + GC DN+ G++GL
Sbjct: 164 KCVYRIAYAGGSATAGVAAQDILQSAENDRIP-FYFGCSRDNQNFSTFESSGKGGGIIGL 222
Query: 286 GGGMLSLTKQ---IKATSLAYC--LVDRDSP--ASGVLEF-NSARGG--DAVTAPLIRNK 335
+SL +Q I +YC L D SP A+ +L F N R ++ P + +
Sbjct: 223 NMSPVSLLQQMNHITKNRFSYCLNLFDLSSPSHATSLLRFGNDIRKSRRKYLSTPFVSPR 282
Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
+ Y++ L SV G +QIPP F + G GG I+D GTA+T + AY + +F
Sbjct: 283 GMPN-YFLNLIDVSVAGNRMQIPPGTFALKPDGTGGTIIDSGTAVTYISQTAYFPVITAF 341
Query: 396 VRLAGNLKPTSGVALFD------TCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
N G + CY G P+++ HF P YL V
Sbjct: 342 ----KNYFDQHGFQRVNIQLSGYICYKQQGHTFHNYPSMAFHFQGADFFVEPEYVYLT-V 396
Query: 450 DSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
G FC A P S +IIG + Q T+ +D AN ++ FTP C
Sbjct: 397 QDRGAFCVALQPISPQQRTIIGALNQANTQFIYDAANRQLLFTPENC 443
>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 442
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 118/362 (32%), Positives = 167/362 (46%), Gaps = 43/362 (11%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSP---LPCAAPQCKS 221
+GTPP+ MVLDTGS ++W+QC ++ P S S LPC P CK
Sbjct: 88 IGTPPQLQQMVLDTGSQLSWIQCHNKKTPQKKQPPTTSSFDPSLSSSFFVLPCNHPLCKP 147
Query: 222 L--DVSA---CRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
D S C AN C Y Y DG++ G+LV E ++F S + I LGC ++
Sbjct: 148 RVPDFSLPTDCDANSLCHYSYFYADGTYAEGNLVREKIAFSPSQTTPPIILGCATQSDD- 206
Query: 276 FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDS-PASGVLEFNSARGGDAVTAPLIRN 334
+ G+LG+ G L Q K T +YC+ + + PASG G+ + R
Sbjct: 207 ---ARGILGMNLGRLGFPSQAKITKFSYCVPTKQAQPASGSFYL-----GNNPASSSFRY 258
Query: 335 KKVDTF-------------YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
+ TF Y + L G S+GG+ + IPPS+F+ + G G ++D G+ T
Sbjct: 259 VNLLTFGQSQRMPNLDPLAYTLPLQGISIGGKKLNIPPSVFKPNAGGSGQTMIDSGSEFT 318
Query: 382 RLQTQAYNSLRDSFVRLAG-NLKP---TSGVALFDTCYDFSGLRSVR-VPTVSLHFGAGK 436
L +AYN +R+ V+ G +K GVA D C+D + R V + F G
Sbjct: 319 YLVDEAYNVIREELVKKVGPKIKKGYMYGGVA--DICFDGDAIEIGRLVGDMVFEFEKGV 376
Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
+ +P + L VD G C + + +IIGN QQ V FDLAN RVGF
Sbjct: 377 QIVIPKERVLATVD-GGVHCLGMGRSERLGAGGNIIGNFHQQNLWVEFDLANRRVGFGEA 435
Query: 494 KC 495
C
Sbjct: 436 DC 437
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 111/356 (31%), Positives = 167/356 (46%), Gaps = 30/356 (8%)
Query: 147 PVVSGASQ-GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
P+ SG S Y R +GTPP+ + +DT +D W+ PCT C + +F P+
Sbjct: 80 PIASGRQIIQSPTYIVRAKIGTPPQTLLLAMDTSNDAAWI---PCTACDGCASTLFAPEK 136
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S+++ + CAAP+CK + C + + + YG S +LV +T++ + V
Sbjct: 137 STTFKNVSCAAPECKQVPNPGCGVSSRNFNLTYGSSSI-AANLVQDTITLA-TDPVPSYT 194
Query: 266 LGCGHDNEGLFV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFN-SA 321
GC G G GL +LS T+ + ++ +YCL P+ L F+ S
Sbjct: 195 FGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCL-----PSFKSLNFSGSL 249
Query: 322 RGGDAVT------APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
R G PL++N + + YYV L VG + V IPP+ + G I D
Sbjct: 250 RLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFD 309
Query: 376 CGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAG 435
GT TRL Y ++RD F R G + + FDTCY+ + VPT++ F G
Sbjct: 310 SGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNV----PIVVPTITFIF-TG 364
Query: 436 KALDLPAKNYLIPVDSAGTFCFAFA----PTSSALSIIGNVQQQGTRVSFDLANNR 487
+ LP N LI + T C A A +S L++I N+QQQ RV +D+ N+R
Sbjct: 365 MNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSR 420
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 125/374 (33%), Positives = 172/374 (45%), Gaps = 58/374 (15%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS--- 221
VGTPP+ +MVLDTGS+++WL C+ Q + +F+P SSSY+P+PC +P CK+
Sbjct: 76 VGTPPQSVTMVLDTGSELSWLHCKK----QQNINSVFNPHLSSSYTPIPCMSPICKTRTR 131
Query: 222 ---LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----NEG 274
+ VS N C V+Y D + G+L ++T + SG GI G N
Sbjct: 132 DFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAISGSGQ-PGIIFGSMDSGFSSNAN 190
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG---GDAVTAPL 331
+ GL+G+ G LS Q+ +YC+ +D ASGVL F A G PL
Sbjct: 191 EDSKTTGLMGMNRGSLSFVTQMGFPKFSYCISGKD--ASGVLLFGDATFKWLGPLKYTPL 248
Query: 332 IR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
++ N + F Y V L G VG + +Q+P +F D G G +VD GT T L
Sbjct: 249 VKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAGQTMVDSGTRFTFLLGS 308
Query: 387 AYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFSG-----LRSVR------VPTVSLHFG 433
Y +LR+ FV T GV L D + F G R R VP V++ F
Sbjct: 309 VYTALRNEFV------AQTRGVLTLLEDPNFVFEGAMDLCFRVRRGGVVPAVPAVTMVF- 361
Query: 434 AGKALDLPAKNYLIPVDSAG--------TFCFAFAPTSSALSI----IGNVQQQGTRVSF 481
G + + + L V G +C F S L I IG+ QQ + F
Sbjct: 362 EGAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTFG-NSDLLGIEAYVIGHHHQQNVWMEF 420
Query: 482 DLANNRVGFTPNKC 495
DL N+RVGF KC
Sbjct: 421 DLVNSRVGFADTKC 434
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 110/352 (31%), Positives = 166/352 (47%), Gaps = 39/352 (11%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EY ++ +GTPP + VLDTGS+ W QC PC CY Q+ PIFDP SS++ +
Sbjct: 64 EYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEI----- 118
Query: 218 QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNE 273
+C + D S C Y++ YG S+T G LVTETV+ G + +GCG +N
Sbjct: 119 RCDTHDHS------CPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNNS 172
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF--NSARGGDAVT 328
G G AG++GL G SL Q+ ++YC + + + F N+ GD V
Sbjct: 173 GFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSK---INFGANAIVAGDGVV 229
Query: 329 APLIRNKKVDT-FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
+ + K FYY+ L SVG ++ + F A G I++D G+ +T
Sbjct: 230 STTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPF---HALKGNIVIDSGSTLTYFPESY 286
Query: 388 YNSLRDSFVRLAGNLK-PTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL 446
N +R + ++ ++ P S + CY +S + P +++HF G L L N
Sbjct: 287 CNLVRKAVEQVVTAVRFPRSDIL----CY-YSKTIDI-FPVITMHFSGGADLVLDKYNMY 340
Query: 447 IPVDSAGTFCFAF---APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ ++ G FC A +P A I GN Q V +D ++ V F P C
Sbjct: 341 VASNTGGVFCLAIICNSPIEEA--IFGNRAQNNFLVGYDSSSLLVSFKPTNC 390
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 110/352 (31%), Positives = 166/352 (47%), Gaps = 39/352 (11%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EY ++ +GTPP + VLDTGS+ W QC PC CY Q+ PIFDP SS++ +
Sbjct: 58 EYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEI----- 112
Query: 218 QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNE 273
+C + D S C Y++ YG S+T G LVTETV+ G + +GCG +N
Sbjct: 113 RCDTHDHS------CPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNNS 166
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF--NSARGGDAVT 328
G G AG++GL G SL Q+ ++YC + + + F N+ GD V
Sbjct: 167 GFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSK---INFGANAIVAGDGVV 223
Query: 329 APLIRNKKVDT-FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
+ + K FYY+ L SVG ++ + F A G I++D G+ +T
Sbjct: 224 STTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPF---HALKGNIVIDSGSTLTYFPESY 280
Query: 388 YNSLRDSFVRLAGNLK-PTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL 446
N +R + ++ ++ P S + CY +S + P +++HF G L L N
Sbjct: 281 CNLVRKAVEQVVTAVRFPRSDIL----CY-YSKTIDI-FPVITMHFSGGADLVLDKYNMY 334
Query: 447 IPVDSAGTFCFAF---APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ ++ G FC A +P A I GN Q V +D ++ V F P C
Sbjct: 335 VASNTGGVFCLAIICNSPIEEA--IFGNRAQNNFLVGYDSSSLLVSFKPTNC 384
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 112/377 (29%), Positives = 176/377 (46%), Gaps = 32/377 (8%)
Query: 147 PVVSGASQGSGEYFSRIGVGTP-PRQFSMVLDTGSDINWLQCRPCTECYQQSDP----IF 201
P+ SGA G +YF I +GTP P++F +V DTGSD+ W+ C + + +P +F
Sbjct: 107 PIHSGADSGQSQYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVF 166
Query: 202 DPKTSSSYSPLPCAAPQCK-----SLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVS 254
SSS+ +PC++ CK ++ C CL+ Y +G +G ETV+
Sbjct: 167 RANDSSSFRTIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVT 226
Query: 255 FGNSGSVK----GIALGCGHDNEGLFVGSAGLLGLGGGMLSLT---KQIKATSLAYCLVD 307
G + K + +GC G++GLG SL +I +YCLVD
Sbjct: 227 VGLNDHKKIRLFDVLIGCTESFNETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSYCLVD 286
Query: 308 RDSPASGVLEFNSARGGDAVTAPLIRNKK-----VDTFYYVGLTGFSVGGQAVQIPPSLF 362
S +S F S + P +++ + ++ FY V ++G SVGG + I ++
Sbjct: 287 HLS-SSNHKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSDIW 345
Query: 363 EMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLK---PTSGVALFDTCYDFSG 419
+ G GG+IVD GT++T L +AY+ + D+ + K P L + C++ G
Sbjct: 346 NV--TGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNNFCFEDKG 403
Query: 420 LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS-SALSIIGNVQQQGTR 478
VP + +HF G P K+Y+I V + G C SI+GNV QQ
Sbjct: 404 FDRAAVPRLLIHFADGAIFKPPVKSYIIDV-AEGIKCLGIIKADFPGSSILGNVMQQNHL 462
Query: 479 VSFDLANNRVGFTPNKC 495
+DL ++GF P+ C
Sbjct: 463 WEYDLGRGKLGFGPSSC 479
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 105/357 (29%), Positives = 165/357 (46%), Gaps = 35/357 (9%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
Y + +GTPP+ S V+D ++ W QC+ C+ C++Q P+FDP S++Y PC P
Sbjct: 50 NYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTP 109
Query: 218 QCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC----GHD 271
C+S+ D C N C YQ + G T G + T+T + G + + +A GC D
Sbjct: 110 LCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGTAKA--SLAFGCVVASDID 166
Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAV 327
G G +G++GLG SL Q + +YCL D+ + L S A GG A
Sbjct: 167 TMG---GPSGIVGLGRTPWSLVTQTGVAAFSYCLAPHDAGKNSALFLGSSAKLAGGGKAA 223
Query: 328 TAPLI----RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
+ P + + +Y V L G G + +PPS +++D + I+ L
Sbjct: 224 STPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPS--------GSTVLLDTFSPISFL 275
Query: 384 QTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAK 443
AY +++ + G + V FD C+ SG S P + F G A+ + A
Sbjct: 276 VDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSG-ASGAAPDLVFTFRGGAAMTVAAS 334
Query: 444 NYLIPVDSAGTFCFAFAP-----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
NYL+ + GT C A +++ LS++G++QQ+ FDL + F P C
Sbjct: 335 NYLLDYKN-GTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADC 390
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 119/358 (33%), Positives = 163/358 (45%), Gaps = 61/358 (17%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y + +GTPP FS++ DTGS + W QC PCTEC + P F P +SS++S LPCA
Sbjct: 87 AGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCA 146
Query: 216 APQCKSLD--VSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
+ C+ L C A C+Y YG G FT G L TET+ G + S G+ GC +N
Sbjct: 147 SSLCQFLTSPYRTCNATGCVYYYPYGMG-FTAGYLATETLHVGGA-SFPGVTFGCSTEN- 203
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS---ARGGDAVTAP 330
G+ S+G++GLG LSL Q+ +YCL + F S GG+ + P
Sbjct: 204 GVGNSSSGIVGLGRSPLSLVSQVGVARFSYCLRSNADAGDSPILFGSLAKVTGGNVQSTP 263
Query: 331 LIRNKKV--DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
L+ N ++ ++YYV LTG +VG A +P ++
Sbjct: 264 LLENPEMPSSSYYYVNLTGITVG--ATDLPMAM--------------------------- 294
Query: 389 NSLRDSFVRLAGNLKPTSGVAL-FDTCYD---FSGLRSVRVPTVSLHFGAGKALDLPAKN 444
NL +G FD C+D G V VPT+ L F G + ++
Sbjct: 295 -----------ANLTTVNGTRFGFDLCFDATAAGGGGGVPVPTLVLRFAGGAEYAVRRRS 343
Query: 445 Y--LIPVDSAG---TFCFAFAPTSSAL--SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
Y ++ VDS G C P S L SIIGNV Q V +DL F P C
Sbjct: 344 YFGVVEVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADC 401
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 106/345 (30%), Positives = 168/345 (48%), Gaps = 20/345 (5%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y G+GTPP+Q S LD SD+ W C F+P S++ + +PC
Sbjct: 97 AGMYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP--------FNPVRSTTVADVPCT 148
Query: 216 APQCKSLDVSACRA--NRCLYQVAYGDGSF-TVGDLVTETVSFGNSGSVKGIALGCGHDN 272
C+ C A + C Y YG G+ T G L TE +FG++ + G+ GCG N
Sbjct: 149 DDACQQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTFGDT-RIDGVVFGCGLKN 207
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDS--PASGVLEFNSA--RGGDAVT 328
G F G +G++GLG G LSL Q++ +Y DS S +L + A + ++
Sbjct: 208 VGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAPDDSVDTQSFILFGDDATPQTSHTLS 267
Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM-DEAGDGGIIVDCGTAITRLQTQA 387
L+ + + YYV L G V G+ + IP F++ ++ G GG+ + +T L+ A
Sbjct: 268 TRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTVLEEAA 327
Query: 388 YNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL 446
Y LR + G L +G AL D CY L +VP+++L F G ++L NY
Sbjct: 328 YKPLRQAVASKIG-LPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMELELGNYF 386
Query: 447 IPVDSAGTFCFAFAPTSSA-LSIIGNVQQQGTRVSFDLANNRVGF 490
+ G C P+S+ S++G++ Q GT + +D+ +++ F
Sbjct: 387 YMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVF 431
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 116/404 (28%), Positives = 190/404 (47%), Gaps = 46/404 (11%)
Query: 129 RHELKPAEAQILPEDFSTPVVSGASQGS-----GEYFSRIGVGTPPRQFSMVLDTGSDIN 183
+ L P A I + P +S + GEY++ I +G+P ++ +++DTGS++
Sbjct: 65 QKSLFPYSAHIFQQHTKNPAALRSSTTTLGRKFGEYYTSIKLGSPGQEAILIVDTGSELT 124
Query: 184 WLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC-----KSLDVSACRANRCLYQVAY 238
WLQC PC C D I+D S+SY P+ C Q + R ++C + Y
Sbjct: 125 WLQCLPCKVCAPSVDTIYDAARSASYRPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFY 184
Query: 239 GDGSFTVGD-----LVTETVSFGNSGSVKGIALGCGH-DNEGLFVGSAGLLGLGGGMLSL 292
GDGSF+ G L+ ETV G +V+ A GC D E + G++G+LGL G ++L
Sbjct: 185 GDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFGCAQGDLELVPTGASGILGLNAGKMAL 244
Query: 293 TKQIK---ATSLAYCLVDRDSP--ASGVLEFNSA----RGGDAVTAPLIRNKKVDTFYYV 343
Q+ ++C DR S ++GV+ F +A + L ++ FY+V
Sbjct: 245 PMQLGQRFGWKFSHCFPDRSSHLNSTGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHV 304
Query: 344 GLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNL 402
L G S+ + P +I+D G++ + ++ LR++F++ +L
Sbjct: 305 ALKGVSINSHELVFLPR--------GSVVILDSGSSFSSFVRPFHSQLREAFLKHRPPSL 356
Query: 403 KPTSGVALFD--TCY-----DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV---DSA 452
K G + D TC+ D L +P++SL F G + +P+ L+PV +
Sbjct: 357 KHLEGDSFGDLGTCFKVSNDDIDELHRT-LPSLSLVFEDGVTIGIPSIGVLLPVARFQNH 415
Query: 453 GTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
CFAF + +++IGN QQQ V +D+ +RVGF C
Sbjct: 416 VKMCFAFEDGGPNPVNVIGNYQQQNLWVEYDIQRSRVGFARASC 459
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 121/372 (32%), Positives = 174/372 (46%), Gaps = 55/372 (14%)
Query: 168 PPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI--FDPKTSSSYSPLPCAAPQCKS---- 221
PP+ SMV+DTGS+++WL+C + +P+ FDP SSSYSP+PC++P C++
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSN----PNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD 137
Query: 222 -LDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC-----GHDNEG 274
L ++C +++ C ++Y D S + G+L E FGNS + + GC G D E
Sbjct: 138 FLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEE 197
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVL--EFNSARGGDAVTAPLI 332
+ GLLG+ G LS Q+ +YC+ D +L + N PLI
Sbjct: 198 D-TKTTGLLGMNRGSLSFISQMGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLI 256
Query: 333 R-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
R + + F Y V LTG V G+ + IP S+ D G G +VD GT T L
Sbjct: 257 RISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTMVDSGTQFTFLLGPV 316
Query: 388 YNSLRDSFVRLAGNLKPTSGV------------ALFDTCYDFSGLRSV-----RVPTVSL 430
Y +LR F L T+G+ D CY S R R+PTVSL
Sbjct: 317 YTALRSDF------LNQTNGILTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLPTVSL 370
Query: 431 HF-GAGKALDLPAKNYLIPVDSAGT---FCFAFAPTS---SALSIIGNVQQQGTRVSFDL 483
F GA A+ Y +P +AG +CF F + +IG+ QQ + FDL
Sbjct: 371 VFEGAEIAVSGQPLLYRVPHLTAGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDL 430
Query: 484 ANNRVGFTPNKC 495
+R+G P +C
Sbjct: 431 QRSRIGLAPVQC 442
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 106/350 (30%), Positives = 165/350 (47%), Gaps = 21/350 (6%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC-AAP 217
+ + I +G PP +++DTGSD+ W+ C PC +CY Q+ P F P SS+Y C +AP
Sbjct: 78 FLANISIGNPPVPQLLLIDTGSDLTWIHCLPC-KCYPQTIPFFHPSRSSTYRNASCVSAP 136
Query: 218 QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG----SVKGIALGCGHDNE 273
+ C Y + Y D S T G L E ++F S S + I GCG DN
Sbjct: 137 HAMPQIFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQDNS 196
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYC---LVDRDSPASGVLEFNSAR-GGDAVTA 329
G F +G+LGLG G S+ + + +YC L + P + ++ N A+ GD
Sbjct: 197 G-FTKYSGVLGLGPGTFSIVTRNFGSKFSYCFGSLTNPTYPHNILILGNGAKIEGDPTPL 255
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
+ +++ YY+ L S G + + I P F+ + GG ++D G + T L +AY
Sbjct: 256 QIFQDR-----YYLDLQAISFGEKLLDIEPGTFQRYRS-QGGTVIDTGCSPTILAREAYE 309
Query: 390 SLRDSFVRLAGN-LKPTSGVALFDT-CYDFS-GLRSVRVPTVSLHFGAGKALDLPAKNYL 446
+L + L G L+ + T CY+ + L P V+ HF G L L ++
Sbjct: 310 TLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGFPVVTFHFAGGAELALDVESLF 369
Query: 447 IPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ +S +FC A T +S+IG + QQ V ++L +V F C
Sbjct: 370 VSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 419
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 148 bits (374), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 106/349 (30%), Positives = 168/349 (48%), Gaps = 24/349 (6%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y G+GTPP+Q S LD SD+ W C F+P S++ + +PC
Sbjct: 97 AGMYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP--------FNPVRSTTVADVPCT 148
Query: 216 APQCKSLDVSACRA------NRCLYQVAYGDGSF-TVGDLVTETVSFGNSGSVKGIALGC 268
C+ C A + C Y YG G+ T G L TE +FG++ + G+ GC
Sbjct: 149 DDACQQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTFGDT-RIDGVVFGC 207
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDS--PASGVLEFNSA--RGG 324
G N G F G +G++GLG G LSL Q++ +Y DS S +L + A +
Sbjct: 208 GLQNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAPDDSVDTQSFILFGDDATPQTS 267
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM-DEAGDGGIIVDCGTAITRL 383
++ L+ + + YYV L G V G+ + IP F++ ++ G GG+ + +T L
Sbjct: 268 HTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTVL 327
Query: 384 QTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
+ AY LR + G L +G AL D CY L +VP+++L F G ++L
Sbjct: 328 EEAAYKPLRQAVASKIG-LPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMELEL 386
Query: 443 KNYLIPVDSAGTFCFAFAPTSSA-LSIIGNVQQQGTRVSFDLANNRVGF 490
NY + G C P+S+ S++G++ Q GT + +D+ +++ F
Sbjct: 387 GNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVF 435
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 148 bits (374), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 111/362 (30%), Positives = 167/362 (46%), Gaps = 42/362 (11%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI-FDPKTSSSYSPLPCAAPQCKSLD 223
+GTPP+ MVLDTGS ++W+QC + + FDP SSS+S LPC P CK
Sbjct: 86 IGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHPLCKPRI 145
Query: 224 V-----SACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
+ C NR C Y Y DG++ G LV E ++F +S S + LGC +
Sbjct: 146 PDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQSTPPLILGCAEAS----T 201
Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---------------DSPASGVLEFNSAR 322
G+LG+ G S Q K + +YC+ R ++P SG ++ +
Sbjct: 202 DEKGILGMNLGRRSFASQAKISKFSYCVPTRQARAGLSSTGSFYLGNNPNSGRFQYINL- 260
Query: 323 GGDAVTAPLIRNKKVDTF-YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
P R+ +D Y + + G +G + I +LF D +G G I+D G+ T
Sbjct: 261 ---LTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDPSGAGQTIIDSGSEFT 317
Query: 382 RLQTQAYNSLRDSFVRLAG-NLKP---TSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
L +AYN +R+ VRL G LK GV+ D C+D + + R+ ++ F K
Sbjct: 318 YLVDEAYNKVREEVVRLVGPKLKKGYVYGGVS--DMCFDGNPMEIGRL-IGNMVFEFEKG 374
Query: 438 LDLPAKNYLIPVD-SAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
+++ + + D G C + +A +IIGN QQ V +DLAN R+G
Sbjct: 375 VEIVIDKWRVLADVGGGVHCIGIGRSEMLGAASNIIGNFHQQNLWVEYDLANRRIGLGKA 434
Query: 494 KC 495
C
Sbjct: 435 DC 436
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 107/353 (30%), Positives = 160/353 (45%), Gaps = 38/353 (10%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y ++ VGTPP + +DTGSDI W QC PC CY Q PIFDP SS++
Sbjct: 421 YLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFRE------- 473
Query: 219 CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDN-- 272
C N C Y++ Y D +++ G L TETV+ G + +GCG DN
Sbjct: 474 ------QRCNGNSCHYEIIYADKTYSKGILATETVTIPSTSGEPFVMAETKIGCGLDNTN 527
Query: 273 ---EGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF--NSARGG 324
G S+G++GL G LSL Q+ ++YC + + + F N+ G
Sbjct: 528 LQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCFSGQ---GTSKINFGTNAIVAG 584
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
D A + KK + FYY+ L SV + + F A DG I +D GT +T
Sbjct: 585 DGTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPF---HAEDGNIFIDSGTTLTYFP 641
Query: 385 TQAYNSLRDSFVRLAGNLK-PTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAK 443
N +R++ ++ +K P G CY +S + P +++HF G L L
Sbjct: 642 MSYCNLVREAVEQVVTAVKVPDMGSDNL-LCY-YSDTIDI-FPVITMHFSGGADLVLDKY 698
Query: 444 NYLIPVDSAGTFCFAFAPTSSAL-SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
N + + G FC A ++ ++ GN Q V +D ++N + F+P C
Sbjct: 699 NMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGYDPSSNVISFSPTNC 751
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 106/344 (30%), Positives = 159/344 (46%), Gaps = 46/344 (13%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y ++ VGTPP + + +DTGSD+ W QC PC +CY Q DPIFDP SS+++
Sbjct: 82 YLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFNE------- 134
Query: 219 CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCG-H--- 270
C C Y++ Y D +++ G L TETV+ G + +GCG H
Sbjct: 135 ------QRCHGKSCHYEIIYEDNTYSKGILATETVTIHSTSGEPFVMAETTIGCGLHNTD 188
Query: 271 -DNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF--NSARGG 324
DN G S+G++GL G SL Q+ ++YC + + + F N+ G
Sbjct: 189 LDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLISYCFSGQ---GTSKINFGTNAIVAG 245
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
D A + KK + FYY+ L SV ++ + F A DG I++D G+ +T
Sbjct: 246 DGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPF---HAEDGNIVIDSGSTVTYFP 302
Query: 385 TQAYNSLRDSFVRLAGNLK---PTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
N +R + ++ ++ P+ L CY FS + P +++HF G L L
Sbjct: 303 VSYCNLVRKAVEQVVTAVRVPDPSGNDML---CY-FSETIDI-FPVITMHFSGGADLVLD 357
Query: 442 AKNYLIPVDSAGTFCFAF---APTSSALSIIGNVQQQGTRVSFD 482
N + +S G FC A +PT A I GN Q V +D
Sbjct: 358 KYNMYMESNSGGLFCLAIICNSPTQEA--IFGNRAQNNFLVGYD 399
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 113/372 (30%), Positives = 168/372 (45%), Gaps = 24/372 (6%)
Query: 138 QILPEDFSTPVVSGASQ----GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC 193
Q +P D G SQ +G Y VGTPP+ + VLD SD W+QC C C
Sbjct: 72 QAVPADGGENGGGGQSQDPATNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATC 131
Query: 194 -----YQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR--CLYQVAYGDGSF--T 244
S P F SS+ + CA C+ L C A+ C Y YG G+ T
Sbjct: 132 GADAPAATSAPPFYAFLSSTIREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTT 191
Query: 245 VGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYC 304
G L + +F G+ GC EG G++GLG G LSL Q++ +Y
Sbjct: 192 AGLLAVDAFAFATV-RADGVIFGCAVATEGDI---GGVIGLGRGELSLVSQLQIGRFSYY 247
Query: 305 LVDRDSPASG----VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPS 360
L D+ G L+ R AV+ PL+ N+ + YYV L G V G+ + IP
Sbjct: 248 LAPDDAVDVGSFILFLDDAKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRG 307
Query: 361 LFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSG 419
F++ G GG+++ +T L AY +R + G L+ G L D CY
Sbjct: 308 TFDLQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKIG-LRAADGSELGLDLCYTSES 366
Query: 420 LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA-LSIIGNVQQQGTR 478
L + +VP+++L F G ++L NY + G C P+ + S++G++ Q GT
Sbjct: 367 LATAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTH 426
Query: 479 VSFDLANNRVGF 490
+ +D++ +R+ F
Sbjct: 427 MIYDISGSRLVF 438
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 148 bits (373), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 118/364 (32%), Positives = 172/364 (47%), Gaps = 41/364 (11%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD- 223
VG+PP+Q +MVLDTGS+++WL C+ +F+P +SSSYSP+PC++P C++
Sbjct: 46 VGSPPQQVTMVLDTGSELSWLHCKKSPNLTS----VFNPLSSSSYSPIPCSSPVCRTRTR 101
Query: 224 -----VSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----NEG 274
V+ C V+Y D S G+L ++ G+S ++ G GC N
Sbjct: 102 DLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSS-ALPGTLFGCMDSGFSSNSE 160
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG---GDAVTAPL 331
+ GL+G+ G LS Q+ +YC+ RDS SGVL F + G+ PL
Sbjct: 161 EDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRDS--SGVLLFGDSHLSWLGNLTYTPL 218
Query: 332 IR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
++ + + F Y V L G VG + + +P S+F D G G +VD GT T L
Sbjct: 219 VQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGP 278
Query: 387 AYNSLRDSFV-RLAGNLKPTSGVAL-----FDTCYDF-SGLRSVRVPTVSLHF-GAGKAL 438
Y +LR+ F+ + G L P D CY +G + +P VSL F GA +
Sbjct: 279 VYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVPAGGKLPELPAVSLMFRGAEMVV 338
Query: 439 DLPAKNYLIPVDSAG---TFCFAFAPTSSALSI----IGNVQQQGTRVSFDLANNRVGFT 491
Y +P G +C F S L I IG+ QQ + FDL +RVGF
Sbjct: 339 GGEVLLYKVPGMMKGKEWVYCLTFG-NSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFV 397
Query: 492 PNKC 495
+C
Sbjct: 398 ETRC 401
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 115/404 (28%), Positives = 190/404 (47%), Gaps = 46/404 (11%)
Query: 129 RHELKPAEAQILPEDFSTPVVSGASQGS-----GEYFSRIGVGTPPRQFSMVLDTGSDIN 183
+ L P A I + P +S + GEY++ I +G+P ++ +++DTGS++
Sbjct: 65 QKSLFPYSAHIFQQHTKNPAALRSSTTTLGRKFGEYYTSIKLGSPGQEAILIVDTGSELT 124
Query: 184 WLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC-----KSLDVSACRANRCLYQVAY 238
WL+C PC C D I+D S SY P+ C Q + R ++C + Y
Sbjct: 125 WLKCLPCKVCAPSVDTIYDAARSVSYKPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFY 184
Query: 239 GDGSFTVGD-----LVTETVSFGNSGSVKGIALGCGH-DNEGLFVGSAGLLGLGGGMLSL 292
GDGSF+ G L+ ETV G +V+ A GC D E + G++G+LGL G ++L
Sbjct: 185 GDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFGCAQGDLELVPTGASGILGLNAGKMAL 244
Query: 293 TKQIK---ATSLAYCLVDRDSP--ASGVLEFNSA----RGGDAVTAPLIRNKKVDTFYYV 343
Q+ ++C DR S ++GV+ F +A + L ++ FY+V
Sbjct: 245 PMQLGQRFGWKFSHCFPDRSSHLNSTGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHV 304
Query: 344 GLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNL 402
L G S+ + + P +I+D G++ + ++ LR++F++ +L
Sbjct: 305 ALKGVSINSHELVLLPR--------GSVVILDSGSSFSSFVRPFHSQLREAFLKHRPPSL 356
Query: 403 KPTSGVALFD--TCY-----DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV---DSA 452
K G + D TC+ D L +P++SL F G + +P+ L+PV +
Sbjct: 357 KHLEGDSFGDLGTCFKVSNDDIDELHRT-LPSLSLVFEDGVTIGIPSIGVLLPVARYQNH 415
Query: 453 GTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
CFAF + +++IGN QQQ V +D+ +RVGF C
Sbjct: 416 VKMCFAFEDGGPNPVNVIGNYQQQNLWVEYDIQRSRVGFARASC 459
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 120/372 (32%), Positives = 174/372 (46%), Gaps = 55/372 (14%)
Query: 168 PPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI--FDPKTSSSYSPLPCAAPQCKS---- 221
PP+ SMV+DTGS+++WL+C + +P+ FDP SSSYSP+PC++P C++
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSS----NPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD 137
Query: 222 -LDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC-----GHDNEG 274
L ++C +++ C ++Y D S + G+L E FGNS + + GC G D E
Sbjct: 138 FLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEE 197
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVL--EFNSARGGDAVTAPLI 332
+ GLLG+ G LS Q+ +YC+ D +L + N PLI
Sbjct: 198 D-TKTTGLLGMNRGSLSFISQMGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLI 256
Query: 333 R-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
R + + F Y V LTG V G+ + IP S+ D G G +VD GT T L
Sbjct: 257 RISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPV 316
Query: 388 YNSLRDSFVRLAGNLKPTSGV------------ALFDTCYDFSGLRSV-----RVPTVSL 430
Y +LR F L T+G+ D CY S +R R+PTVSL
Sbjct: 317 YTALRSHF------LNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSL 370
Query: 431 HF-GAGKALDLPAKNYLIP---VDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDL 483
F GA A+ Y +P V + +CF F + +IG+ QQ + FDL
Sbjct: 371 VFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDL 430
Query: 484 ANNRVGFTPNKC 495
+R+G P +C
Sbjct: 431 QRSRIGLAPVEC 442
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 120/456 (26%), Positives = 191/456 (41%), Gaps = 77/456 (16%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
+ RD R + + ++ Y+ R L+ + P+ +G GEYF+ + V
Sbjct: 62 VNRDGLRRQRMNQRWGVSNYDRRRKGLETTTTTEV----EMPMRAGRDDALGEYFTEVKV 117
Query: 166 GTPPRQFSMVLDTGSDINWLQC-----------------------------------RPC 190
G+P ++F + DTGS+ W C R
Sbjct: 118 GSPGQRFWLAADTGSEFTWFNCVMRNATTTATTKKTRKNKTKKKHHHHSKRNRTRTTRRT 177
Query: 191 TECYQQSDP---IFDPKTSSSYSPLPCAAPQCK-------SLDVSACRANRCLYQVAYGD 240
+ +S+P +F P S S+ + CA+ +CK SL + ++ CLY ++Y D
Sbjct: 178 KKKKAKSNPCKGVFCPHRSKSFQAVTCASQKCKIDLSQLFSLSLCPKPSDPCLYDISYAD 237
Query: 241 GSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNEG---LFVGSAGLLGLGGGMLSLT 293
GS G T+T++ G G + + +GC E + G+LGLG S
Sbjct: 238 GSSAKGFFGTDTITVDLKNGKEGKLNNLTIGCTKSMENGVNFNEDTGGILGLGFAKDSFI 297
Query: 294 KQIK---ATSLAYCLVDRDSP--ASGVLEFNSARGGDAVTAPLIRNKKVDT-----FYYV 343
+ +YCLVD S S L GG L K+ + FY V
Sbjct: 298 DKAAYEYGAKFSYCLVDHLSHRNVSSYLTI----GGHHNAKLLGEIKRTELILFPPFYGV 353
Query: 344 GLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLK 403
+ G S+GGQ ++IPP +++ + GG ++D GT +T L AY + ++ ++ +K
Sbjct: 354 NVVGISIGGQMLKIPPQVWDFNS--QGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVK 411
Query: 404 PTSG--VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
+G D C+D G VP + HF G + P K+Y+I V + C P
Sbjct: 412 RVTGEDFGALDFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDV-APLVKCIGIVP 470
Query: 462 TSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S+IGN+ QQ FDL+ N +GF P+ C
Sbjct: 471 IDGIGGASVIGNIMQQNHLWEFDLSTNTIGFAPSIC 506
>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 90/234 (38%), Positives = 125/234 (53%), Gaps = 16/234 (6%)
Query: 75 NSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP 134
N+ SS + H + + D R L RD ARV ++ +KL I + E+
Sbjct: 60 NTKSSLRVVHMHGACSHLSSNKDARLDHDEILRRDEARVESIHSKLSKNIAD----EVSK 115
Query: 135 AEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-EC 193
A++ LP +G GS Y IG+GTP S++ DTGSD+ W QC PC C
Sbjct: 116 AKSTKLPAK------NGIILGSPNYIVTIGIGTPKHDISLMFDTGSDLTWTQCEPCLGSC 169
Query: 194 YQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETV 253
Y Q +P F+P +SSSY + C++P C + + +C A+ CLY + YGDGS TVG L E
Sbjct: 170 YSQKEPKFNPSSSSSYHNVSCSSPMCGNPE--SCSASNCLYGIGYGDGSVTVGFLAKEKF 227
Query: 254 SFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYC 304
+ NS + I GCG +N+G+F+GSAG+LGLG G S Q T +YC
Sbjct: 228 TLTNSDVLDDIYFGCGENNKGVFIGSAGILGLGPGKFSFPLQTTTTYNNIFSYC 281
>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
Length = 389
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 96/263 (36%), Positives = 136/263 (51%), Gaps = 21/263 (7%)
Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
C Y + YGDGSFT G+L E + FG VK GCG +N+GLF G +GL+GLG LS
Sbjct: 76 CNYAINYGDGSFTRGELGHEKLKFGTI-LVKDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 134
Query: 292 LTKQ---IKATSLAYCL--VDRDSPASGVLEFNSA--RGGDAVT-APLIRNKKVDTFYYV 343
L Q I +YCL +R S +L NS+ R ++ A +I N ++ FY++
Sbjct: 135 LISQTSGIFGGVFSYCLPSTERKGSGSLILGGNSSVYRNSSPISYAKMIENPQLYNFYFI 194
Query: 344 GLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLK 403
LTG S+GG A+Q P G I+VD GT ITRL Y +L+ F++
Sbjct: 195 NLTGISIGGVALQAP-------SVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFP 247
Query: 404 PTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL--DLPAKNYLIPVDSAGTFCFAFAP 461
P ++ DTC++ S + V +PT+ +HF L D+ Y + D A C A A
Sbjct: 248 PAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSD-ASQVCLALAS 306
Query: 462 TS--SALSIIGNVQQQGTRVSFD 482
++I+GN QQ+ RV +D
Sbjct: 307 LEYQDEVAILGNYQQKNLRVIYD 329
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 106/298 (35%), Positives = 140/298 (46%), Gaps = 17/298 (5%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
Y R+ +GTP +Q MVLDT +D W+ C CT C S F P S++ L C+
Sbjct: 44 NYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC---SSTTFLPNASTTLGSLDCSEA 100
Query: 218 QCKSLDVSACRA---NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
QC + +C A + CL+ +YG S LV + ++ N + G GC + G
Sbjct: 101 QCSQVRGFSCPATGSSACLFNQSYGGDSSLAATLVQDAITLAND-VIPGFTFGCINAVSG 159
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA-SGVLEFNSARGGDAV-TA 329
+ GLLGLG G +SL Q A +YCL S SG L+ ++ T
Sbjct: 160 GSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTT 219
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
PL+RN + YYV LTG SVG V IP D G I+D GT ITR Y
Sbjct: 220 PLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYF 279
Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
++RD F + P S + FDTC F+ P V+LHF G L LP +N LI
Sbjct: 280 AIRDEFRKQVNG--PISSLGAFDTC--FAATNEAEAPAVTLHF-EGLNLVLPMENSLI 332
>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 118/370 (31%), Positives = 167/370 (45%), Gaps = 35/370 (9%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
S P+ SG + G Y R+ +GTP + MVLDT +D ++ C C S F P
Sbjct: 84 SAPIASGQAFNIGNYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGC---SATTFSPN 140
Query: 205 TSSSYSPLPCAAPQCKSLDVSACRAN---RCLYQVAYGDGSFT---VGD---LVTETV-- 253
S+SY PL C+ PQC + +C A C + +Y +++ V D L T+ +
Sbjct: 141 ASTSYVPLECSVPQCSQVRGLSCPATGSGACSFNKSYAGSTYSATLVQDSLRLATDVIPS 200
Query: 254 -SFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA 312
SFG+ ++ G ++ + +LS T + + +YCL S
Sbjct: 201 YSFGSINAISGSSIPAQGLLGLGRGPLS--------LLSQTGSLYSGVFSYCLPSFKSYY 252
Query: 313 -SGVLEFNSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
SG L+ ++ T PL+RN + + Y+V LTG +VG V P L D
Sbjct: 253 FSGSLKLGPVGQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDVNTGS 312
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
G I+D GT ITR YN++RD F + P S + FDTC F P ++L
Sbjct: 313 GTIIDSGTVITRFVEPVYNAVRDEFRKQVTG--PFSSLGAFDTC--FVKNYETLAPAITL 368
Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS-----SALSIIGNVQQQGTRVSFDLAN 485
HF L LP +N LI S C A A T + L++I N QQQ RV FD N
Sbjct: 369 HF-TDLDLKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFDTVN 427
Query: 486 NRVGFTPNKC 495
N+VG C
Sbjct: 428 NKVGIARELC 437
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 117/361 (32%), Positives = 178/361 (49%), Gaps = 29/361 (8%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC---TECYQQSDPIFDPKTSSSYSPLPC 214
+Y + +G+PP++ ++DTGSD+ W QC C +Q P ++ SS++ P+PC
Sbjct: 85 QYIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQSSTFVPVPC 144
Query: 215 A--APQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC--- 268
A A C + V C + C + +YG G +G L TE+ +F SG+ +A GC
Sbjct: 145 ADKAGFCAANGVHLCGLDGSCTFIASYGAGR-VIGSLGTESFAF-ESGTTS-LAFGCVSL 201
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD--RDSPASGVL--EFNSARGG 324
G ++GL+GLG G LSL QI AT +YCL S AS L +++ GG
Sbjct: 202 TRITSGALNDASGLIGLGRGRLSLVSQIGATRFSYCLTPYFHSSGASSHLFVGASASLGG 261
Query: 325 DAVTAPLIRNKK---VDTFYYVGLTGFSVGGQAV-QIPPSLFEMDEA----GDGGIIVDC 376
+ P +++ K TFYY+ L G +VG + + + F++ + GG+I+D
Sbjct: 262 GGASMPFVKSPKDYPYSTFYYLPLEGITVGKTRLPAVNSTTFQLRQLFKGYWAGGVIIDT 321
Query: 377 GTAITRLQTQAYNSLRDSFVRLAGN--LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
G+ +T+L + AY +L++ GN L P + + C G + V VP + HFG
Sbjct: 322 GSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGLELCVAREGFQKV-VPALVFHFGG 380
Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
G + +PA +Y PVD A C SIIGN QQQ + +DL R F
Sbjct: 381 GADMAVPAASYWAPVDKAAA-CMMILEGGYD-SIIGNFQQQDMHLLYDLRRGRFSFQTAD 438
Query: 495 C 495
C
Sbjct: 439 C 439
>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
Length = 469
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 131/427 (30%), Positives = 199/427 (46%), Gaps = 50/427 (11%)
Query: 103 LSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSR 162
L RD AR + I + QLA + A + F+ P+ SGA G+G+YF R
Sbjct: 57 LGERARDDARRHAYI-RSQLA-------SRRRRAADVGASAFAMPLSSGAYTGTGQYFVR 108
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI--FDPKTSSSYSPLPCAAPQCK 220
VGTP + F +V DTGSD+ W++CR P F S S++PL C++ C
Sbjct: 109 FRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSWAPLACSSDTCT 168
Query: 221 S---LDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFG--------------NSGSV 261
S ++ C A+ C Y Y DGS G + T+ + +
Sbjct: 169 SYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGGGRRAKL 228
Query: 262 KGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSP--ASGV 315
+G+ LGC +G F S G+L LG +S + A +YCLVD +P AS
Sbjct: 229 QGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNASSY 288
Query: 316 LEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
L F G A PL+ +++V FY V + V G+A+ IP ++++ GG
Sbjct: 289 LTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPADVWDVGRG--GGA 346
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL--FDTCYDFSGLRSVRVPTVSL 430
I+D GT++T L T AY ++ L G L VA+ F+ CY+++ + +P + +
Sbjct: 347 ILDSGTSLTVLATPAYRAV---VAALGGRLAALPRVAMDPFEYCYNWTA-GAPEIPKLEV 402
Query: 431 HFGAGKALDLPAKNYLIPVDSA-GTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRV 488
F L+ PAK+Y+I D+A G C + +S+IGN+ QQ FDL + +
Sbjct: 403 SFAGSARLEPPAKSYVI--DAAPGVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLRDRWL 460
Query: 489 GFTPNKC 495
F +C
Sbjct: 461 RFKHTRC 467
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/356 (30%), Positives = 159/356 (44%), Gaps = 43/356 (12%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y R+ +GTPP + +DTGSD+ W QC PC CY Q PIFDP SS++
Sbjct: 61 YLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSSTFKE------- 113
Query: 219 CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNEG 274
C N C Y++ Y D S++ G L TETV+ G + ++GCG +N
Sbjct: 114 ------KRCHGNSCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAETSIGCGLNNSN 167
Query: 275 LFV-----GSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF--NSARGG 324
L S+G++GL G SL Q+ ++YC S + + F N+ G
Sbjct: 168 LMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCF---SSQGTSKINFGTNAVVAG 224
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
D A + KK FYY+ L SVG + ++ + F A DG I +D GT T L
Sbjct: 225 DGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPF---HAQDGNIFIDSGTTYTYLP 281
Query: 385 TQAYNSLRDSFVRLAGNL----KPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
T N +R++ P+S L CY++ + P ++LHF G L L
Sbjct: 282 TSYCNLVREAVAASVVAANQVPDPSSENLL---CYNWDTME--IFPVITLHFAGGADLVL 336
Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSAL-SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
N + + GTFC A ++ +I GN V +D + + F+P C
Sbjct: 337 DKYNMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPTNC 392
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/353 (30%), Positives = 154/353 (43%), Gaps = 42/353 (11%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y ++ VGTPP + +DTGSD+ W QC PCT CY Q PIFDP SS++
Sbjct: 61 YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKE------- 113
Query: 219 CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNEG 274
C N C Y++ Y D +++ G L TETV+ G + +GCGH++
Sbjct: 114 ------KRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSW 167
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF--NSARGGDAVTA 329
+G++GL G SL Q+ ++YC S + + F N+ GD V +
Sbjct: 168 FKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFA---SQGTSKINFGTNAIVAGDGVVS 224
Query: 330 -PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
+ YY+ L SVG V+ + F A +G II+D GT +T
Sbjct: 225 TTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTF---HALEGNIIIDSGTTLTYFPVSYC 281
Query: 389 NSLR---DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
N +R D +V PT L CY + P +++HF G L L N
Sbjct: 282 NLVREAVDHYVTAVRTADPTGNDML---CYYTDTID--IFPVITMHFSGGADLVLDKYNM 336
Query: 446 LIPVDSAGTFCFAFA---PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
I + GTFC A P A I GN Q V +D ++ V F+P C
Sbjct: 337 YIETITRGTFCLAIICNNPPQDA--IFGNRAQNNFLVGYDSSSLLVSFSPTNC 387
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 130/425 (30%), Positives = 203/425 (47%), Gaps = 44/425 (10%)
Query: 108 RDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGT 167
RD AR + I LA A F+ P+ SGA G+G+YF R VGT
Sbjct: 61 RDDARRHAYIRSQLLAASRTRGRRAAEVGASASASAFAMPLSSGAYTGTGQYFVRFRVGT 120
Query: 168 PPRQFSMVLDTGSDINWLQCRPCTECYQQS-DPIFDPKTSSSYSPLPCAAPQCKS---LD 223
P + F +V DTGSD+ W++C + + +F S S++P+ C++ C S
Sbjct: 121 PAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRRVFRAAASRSWAPIACSSDTCTSYVPFS 180
Query: 224 VSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS-----------VKGIALGCGH 270
++ C A+ C Y Y DGS G + T++ + SGS ++G+ LGC
Sbjct: 181 LANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESRDGGGRRAKLQGVVLGCTA 240
Query: 271 DNEGL-FVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSP--ASGVLEFN--SAR 322
+G F S G+L LG +S + A +YCLVD +P A+ L F
Sbjct: 241 SYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPPGPE 300
Query: 323 GGDAVTA---------PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
GG A ++ PL+ ++++ FY V + V G+A+ IP ++ D A GG I
Sbjct: 301 GGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAGEALDIPADVW--DVARGGGAI 358
Query: 374 VDCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
+D GT++T L T AY ++ + RLAG P + F+ CY+++ ++ +P + + F
Sbjct: 359 LDSGTSLTVLATPAYRAVVAALSERLAG--LPRVSMDPFEYCYNWTAA-ALEIPGLEVRF 415
Query: 433 GAGKALDLPAKNYLIPVDSA-GTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGF 490
L PAK+Y+ VD+A G C + +S+IGN+ QQ FDL + + F
Sbjct: 416 AGSARLQPPAKSYV--VDAAPGVKCIGVQEGAWPGVSVIGNILQQDHLWEFDLRDRWLRF 473
Query: 491 TPNKC 495
+C
Sbjct: 474 KHTRC 478
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 106/298 (35%), Positives = 140/298 (46%), Gaps = 17/298 (5%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
Y R+ +GTP +Q MVLDT +D W+ C CT C S F P S++ L C+
Sbjct: 44 NYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC---SSTTFLPNASTTLGSLDCSEA 100
Query: 218 QCKSLDVSACRA---NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
QC + +C A + CL+ +YG S LV + ++ N + G GC + G
Sbjct: 101 QCSQVRGFSCPATGSSACLFNQSYGGDSSLAATLVQDAITLAND-VIPGFTFGCINAVSG 159
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA-SGVLEFNSARGGDAV-TA 329
+ GLLGLG G +SL Q A +YCL S SG L+ ++ T
Sbjct: 160 GSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTT 219
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
PL+RN + YYV LTG SVG V IP D G I+D GT ITR Y
Sbjct: 220 PLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYF 279
Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
++RD F + P S + FDTC F+ P V+LHF G L LP +N LI
Sbjct: 280 AIRDEFRKQVNG--PISSLGAFDTC--FAETNEAEAPAVTLHF-EGLNLVLPMENSLI 332
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 102/368 (27%), Positives = 175/368 (47%), Gaps = 51/368 (13%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP++F++++DTGS + ++ C C +C + DP F P+ S+SY L C
Sbjct: 73 NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKC- 131
Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG--SVKGIALGCGHDNE 273
P C D C+Y+ Y + S + G L + +SFGN S + GC ++
Sbjct: 132 NPDCNCDD----EGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCENEET 187
Query: 274 G-LFVGSA-GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLE--FNSARGGDAVTA 329
G LF A G++GLG G LS+ Q LVD+ GV+E F+ GG V
Sbjct: 188 GDLFSQRADGIMGLGRGKLSVVDQ---------LVDK-----GVIEDVFSLCYGGMEVGG 233
Query: 330 PLIRNKKV---------------DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
+ K+ +Y + L V G+++++ P +F G G ++
Sbjct: 234 GAMVLGKISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN----GKHGTVL 289
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTV 428
D GT +A+ +++D+ ++ +LK G D C+ +G + P +
Sbjct: 290 DSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEI 349
Query: 429 SLHFGAGKALDLPAKNYLI-PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNR 487
++ FG G+ L L +NYL G +C P + +++G + + T V++D N++
Sbjct: 350 AMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDK 409
Query: 488 VGFTPNKC 495
+GF C
Sbjct: 410 LGFLKTNC 417
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 116/359 (32%), Positives = 166/359 (46%), Gaps = 38/359 (10%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK---- 220
+GTPP+ MVLDTGS ++W+QC + + FDP SSS+S LPC+ P CK
Sbjct: 78 IGTPPQAQQMVLDTGSQLSWIQCH-RKKLPPKPKTSFDPSLSSSFSTLPCSHPLCKPRIP 136
Query: 221 --SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
+L S C +NR C Y Y DG+F G+LV E ++F N+ + LGC ++
Sbjct: 137 DFTLPTS-CDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGCATESS---- 191
Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV 337
G+LG+ G LS Q K + +YC+ + S G S GD + + +
Sbjct: 192 DDRGILGMNRGRLSFVSQAKISKFSYCIPPK-SNRPGFTPTGSFYLGDNPNSHGFKYVSL 250
Query: 338 DTF-------------YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
TF Y V + G G + + I S+F D G G +VD G+ T L
Sbjct: 251 LTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQTMVDSGSEFTHLV 310
Query: 385 TQAYNSLR-DSFVRLAGNLKP---TSGVALFDTCYDFSGLRSVR-VPTVSLHFGAGKALD 439
AY+ +R + R+ LK G A D C+D + R + + F G +
Sbjct: 311 DAAYDKVRAEIMTRVGRRLKKGYVYGGTA--DMCFDGNVAMIPRLIGDLVFVFTRGVEIF 368
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+P + L+ V G C +S +A +IIGNV QQ V FD+ N RVGF C
Sbjct: 369 VPKERVLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKADC 426
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 116/359 (32%), Positives = 166/359 (46%), Gaps = 38/359 (10%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK---- 220
+GTPP+ MVLDTGS ++W+QC + + FDP SSS+S LPC+ P CK
Sbjct: 78 IGTPPQAQQMVLDTGSQLSWIQCH-RKKLPPKPKTSFDPSLSSSFSTLPCSHPLCKPRIP 136
Query: 221 --SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
+L S C +NR C Y Y DG+F G+LV E ++F N+ + LGC ++
Sbjct: 137 DFTLPTS-CDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGCATESS---- 191
Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV 337
G+LG+ G LS Q K + +YC+ + S G S GD + + +
Sbjct: 192 DDRGILGMNRGRLSFVSQAKISKFSYCIPPK-SNRPGFTPTGSFYLGDNPNSHGFKYVSL 250
Query: 338 DTF-------------YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
TF Y V + G G + + I S+F D G G +VD G+ T L
Sbjct: 251 LTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQTMVDSGSEFTHLV 310
Query: 385 TQAYNSLR-DSFVRLAGNLKP---TSGVALFDTCYDFSGLRSVR-VPTVSLHFGAGKALD 439
AY+ +R + R+ LK G A D C+D + R + + F G +
Sbjct: 311 DAAYDKVRAEIMTRVGRRLKKGYVYGGTA--DMCFDGNVAMIPRLIGDLVFVFTRGVEIL 368
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+P + L+ V G C +S +A +IIGNV QQ V FD+ N RVGF C
Sbjct: 369 VPKERVLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKADC 426
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 124/408 (30%), Positives = 190/408 (46%), Gaps = 46/408 (11%)
Query: 98 YRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQ--ILPEDFSTPVVSGASQG 155
Y+++ L +D+A +TL RH A Q + P DF P + +
Sbjct: 57 YKNVKAESLAKDTALESTL-----------SRHAYLRARQQKALQPADFVPPPLI---RD 102
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+ + + +G PP +VLDTGSD+ W+QC PC CY+Q DPI++ S SY+ + C
Sbjct: 103 KSAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCN 162
Query: 216 APQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCG 269
P C SL + + CLYQ +Y DGS T G L E V+F + + GCG
Sbjct: 163 EPPCLSLGREGQCSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDKTAQVGFGCG 222
Query: 270 HDNEGLFVGS--AGLLGLGGGMLSLTKQIKA-----TSLAYCLVDRDSP-ASGVLEFNSA 321
N S G+LGLG G++SL Q+ A S AYC + +P A G L F A
Sbjct: 223 LQNLNFVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNLSNPNAGGFLVFGDA 282
Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQA--VQIPPSLFEMDEAGDGGIIVDCGTA 379
+ P++ + FYYV L G +G + + I S FE G GG+I+D G+
Sbjct: 283 TYLNGDMTPMV----IAEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGGVIIDSGST 338
Query: 380 ITRLQTQAYNSLRDSFV---RLAGNLKPTSGVALFDTCYDFSGLRSVRV-PTVSLHFGAG 435
++ + Y +R++ V + N+ P + C++ R + + PT+ L+ +
Sbjct: 339 LSIFPPEVYEVVRNAVVDKLKKGYNISPLTSSP---DCFEGKIGRDLPLFPTLVLYLEST 395
Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDL 483
L+ +L D FC F + LSIIG + QQ + ++L
Sbjct: 396 GILNDRWSIFLQRYDE--LFCLGFT-SGEGLSIIGTLAQQSYKFGYNL 440
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 110/358 (30%), Positives = 157/358 (43%), Gaps = 45/358 (12%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCA 215
G Y+S I +G+PP+ FS+V+DTGSD+ W++C PC+ +C FD S++Y L CA
Sbjct: 1 GVYYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDC----SSTFDRLASNTYKALTCA 56
Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK-----GIALGCGH 270
Y YGDGSFT GDL +T+ + S + G GCG
Sbjct: 57 DD----------------YSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFGCGS 100
Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVD-------RDSP---ASGVLE 317
+GL G G+L L G LS QI +YCL+ + SP +E
Sbjct: 101 LLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVE 160
Query: 318 FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
G + +Y V L G SVG Q + + PS F + D I D G
Sbjct: 161 LKEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSAFLNGQ--DKPTIFDSG 218
Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
T +T L +S++ S + + + + D C+ +P ++ HF G
Sbjct: 219 TTLTMLPPGVCDSIKQSLASMVSGAEFVA-IKGLDACFRVPPSSGQGLPDITFHFNGGAD 277
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
NY+I + S C F PT+ +SI GN+QQQ V D+ N R+GF C
Sbjct: 278 FVTRPSNYVIDLGSLQ--CLIFVPTNE-VSIFGNLQQQDFFVLHDMDNRRIGFKETDC 332
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 102/368 (27%), Positives = 175/368 (47%), Gaps = 51/368 (13%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP++F++++DTGS + ++ C C +C + DP F P+ S+SY L C
Sbjct: 73 NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKC- 131
Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG--SVKGIALGCGHDNE 273
P C D C+Y+ Y + S + G L + +SFGN S + GC ++
Sbjct: 132 NPDCNCDD----EGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCENEET 187
Query: 274 G-LFVGSA-GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLE--FNSARGGDAVTA 329
G LF A G++GLG G LS+ Q LVD+ GV+E F+ GG V
Sbjct: 188 GDLFSQRADGIMGLGRGKLSVVDQ---------LVDK-----GVIEDVFSLCYGGMEVGG 233
Query: 330 PLIRNKKV---------------DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
+ K+ +Y + L V G+++++ P +F G G ++
Sbjct: 234 GAMVLGKISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN----GKHGTVL 289
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTV 428
D GT +A+ +++D+ ++ +LK G D C+ +G + P +
Sbjct: 290 DSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEI 349
Query: 429 SLHFGAGKALDLPAKNYLI-PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNR 487
++ FG G+ L L +NYL G +C P + +++G + + T V++D N++
Sbjct: 350 AMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDK 409
Query: 488 VGFTPNKC 495
+GF C
Sbjct: 410 LGFLKTNC 417
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 100/367 (27%), Positives = 175/367 (47%), Gaps = 49/367 (13%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP++F++++DTGS + ++ C C +C + DP F P+ SSSY L C
Sbjct: 77 NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSSSYKALKC- 135
Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHDNE 273
P C D C+Y+ Y + S + G L + +SFGN + + GC +
Sbjct: 136 NPDCNCDD----EGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLTPQRAVFGCENVET 191
Query: 274 G-LFVGSA-GLLGLGGGMLSLTKQI-------KATSLAY---------CLVDRDSPASGV 315
G LF A G++GLG G LS+ Q+ SL Y ++ + SP +G+
Sbjct: 192 GDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKISPPAGM 251
Query: 316 LEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
+ +S D +P +Y + L V G+++++ P +F G G ++D
Sbjct: 252 VFSHS----DPFRSP---------YYNIDLKQMHVAGKSLKLNPKVFN----GKHGTVLD 294
Query: 376 CGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTVS 429
GT +A+ +++D+ ++ +LK G D C+ +G + P +
Sbjct: 295 SGTTYAYFPKEAFIAIKDAIIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEID 354
Query: 430 LHFGAGKALDLPAKNYLI-PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
+ FG G+ L L +NYL G +C P + +++G + + T V++D N+++
Sbjct: 355 MEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKL 414
Query: 489 GFTPNKC 495
GF C
Sbjct: 415 GFLKTNC 421
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 124/408 (30%), Positives = 189/408 (46%), Gaps = 46/408 (11%)
Query: 98 YRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQ--ILPEDFSTPVVSGASQG 155
Y+++ L +D+A +TL RH A Q + P DF P + +
Sbjct: 44 YKNVKAESLAKDTALESTL-----------SRHAYLRARQQKALQPADFVPPPLI---RD 89
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+ + + +G PP +VLDTGSD+ W+QC PC CY+Q DPI++ S SY+ + C
Sbjct: 90 KSAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCN 149
Query: 216 APQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCG 269
P C SL + + CLYQ AY DG+ T G L E V+F + + GCG
Sbjct: 150 EPPCVSLGREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKTAQVGFGCG 209
Query: 270 HDNEGLFVGS--AGLLGLGGGMLSLTKQIKA-----TSLAYCLVDRDSP-ASGVLEFNSA 321
N + G+LGLG G++SL Q+ A S AYC + +P A G L F A
Sbjct: 210 LQNLNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPNAGGFLVFGDA 269
Query: 322 RGGDAVTAPLIRNKKVDTFYYVGL--TGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
+ P++ + FYYV L G VG + I S FE G GG+I+D G+
Sbjct: 270 TYLNGDMTPMV----IAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGGVIIDSGST 325
Query: 380 ITRLQTQAYNSLRDSFV---RLAGNLKPTSGVALFDTCYDFSGLRSVRV-PTVSLHFGAG 435
++ + Y +R++ V + N+ P + C++ R + + PT+ L+ +
Sbjct: 326 LSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSP---DCFEGKIERDLPLFPTLVLYLEST 382
Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDL 483
L+ +L D FC F + LSIIG + QQ + ++L
Sbjct: 383 GILNDRWSIFLQRYDE--LFCLGFT-SGEGLSIIGTLAQQSYKFGYNL 427
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 109/353 (30%), Positives = 154/353 (43%), Gaps = 42/353 (11%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y ++ VGTPP + +DTGSD+ W QC PCT CY Q PIFDP SS++
Sbjct: 61 YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKE------- 113
Query: 219 CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNEG 274
C N C Y++ Y D +++ G L TETV+ G + +GCGH++
Sbjct: 114 ------KRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSW 167
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF--NSARGGD-AVT 328
+G++GL G SL Q+ ++YC S + + F N+ GD V+
Sbjct: 168 FKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFA---SQGTSKINFGTNAIVAGDGVVS 224
Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
+ YY+ L SVG V+ + F A +G II+D GT +T
Sbjct: 225 TTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTF---HALEGNIIIDSGTTLTYFPVSYC 281
Query: 389 NSLR---DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
N +R D +V PT L CY + P +++HF G L L N
Sbjct: 282 NLVREAVDHYVTAVRTADPTGNDML---CYYTDTID--IFPVITMHFSGGADLVLDKYNM 336
Query: 446 LIPVDSAGTFCFAFA---PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
I + GTFC A P A I GN Q V +D ++ V F+P C
Sbjct: 337 YIETITRGTFCLAIICNNPPQDA--IFGNRAQNNFLVGYDSSSLLVFFSPTNC 387
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 115/385 (29%), Positives = 170/385 (44%), Gaps = 52/385 (13%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTEC-YQQSDP---IFDPKTSSSY 209
G Y + GTPP+ +++DTGSD+ W C C C + S+P IF PK+SSS
Sbjct: 88 GAYSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSS 147
Query: 210 SPLPCAAPQCKSLDVSACRANRCL---------------YQVAYGDGSFTVGDLVTETVS 254
L C P+C + S ++ RC Y V YG G T G +++ET+
Sbjct: 148 KVLGCVNPKCGWIHGSKVQS-RCRDCEPTSPNCTQICPPYLVFYGSG-ITGGIMLSETLD 205
Query: 255 FGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DSP 311
G V +GC + AG+ G G G SL Q+ +YCL+ R D+
Sbjct: 206 LPGKG-VPNFIVGCSVLSTSQ---PAGISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTT 261
Query: 312 ASGVLEFNSARGGDAVTA-----PLIRNKKV------DTFYYVGLTGFSVGGQAVQIPPS 360
S L + TA P ++N KV +YY+GL +VGG+ V+IP
Sbjct: 262 ESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYK 321
Query: 361 LFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT--SGVALFDTCYDFS 418
GDGG I+D GT T ++ + + + F + + + T G+ C++ S
Sbjct: 322 YLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITGLRPCFNIS 381
Query: 419 GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS--------IIG 470
GL + P ++L F G ++LP NY+ + C +A I+G
Sbjct: 382 GLNTPSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAIILG 441
Query: 471 NVQQQGTRVSFDLANNRVGFTPNKC 495
N QQQ V +DL N R+GF C
Sbjct: 442 NFQQQNFYVEYDLRNERLGFRQQSC 466
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 117/371 (31%), Positives = 173/371 (46%), Gaps = 41/371 (11%)
Query: 137 AQILPEDFSTPVVSGASQG---SGEYFSRIGVG--TPPRQFSMVLDTGSD-INWLQCRPC 190
+ +LP++ + G SQG + +Y G G PP ++ + D I W QC+PC
Sbjct: 47 SSLLPKNKCSASARGGSQGLPITQKYGPCSGSGHSQPPSPQEILAEMNPDSITWTQCKPC 106
Query: 191 TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVT 250
C + S FDP S +YS C + + N Y + YGD S +VG+
Sbjct: 107 VRCLKDSHRHFDPSASLTYSLGSC---------IPSTVGNT--YNMTYGDKSTSVGNYGC 155
Query: 251 ETVSFGNSGSVKGIALGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKA---TSLAYCLV 306
+T++ S GCG +NEG F G+ G+LGLG G LS Q + +YCL
Sbjct: 156 DTMTLEPSDVFPKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLP 215
Query: 307 DRDSPASGVL-----EFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
+ DS S + +S + V P + +Y+V L SVG + + +P S+
Sbjct: 216 EEDSIGSLLFGEKATSQSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNVPSSV 275
Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA----LFDTCYDF 417
F G I+D GT IT L +AY++L +F + ++G + DTCY+
Sbjct: 276 F-----ASPGTIIDSGTVITCLPQRAYSALTAAFKKAMAKYPLSNGRRKKGDILDTCYNL 330
Query: 418 SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS-----SALSIIGNV 472
SG + V +P + LHFG G + L K +I + A C AFA S S L+IIGN
Sbjct: 331 SGRKDVLLPEIVLHFGEGADVRLNGKR-VIWGNDASRLCLAFAGNSKSTMNSELTIIGNR 389
Query: 473 QQQGTRVSFDL 483
QQ V +D+
Sbjct: 390 QQVSLTVLYDI 400
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 109/355 (30%), Positives = 170/355 (47%), Gaps = 28/355 (7%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
+ +G P ++DTGS+I W++C PC C QQ+ P+ DP SS+Y+ LPC
Sbjct: 99 FLVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYASLPCTNTM 158
Query: 219 CKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNS----GSVKGIALGCGHDNE 273
C + C R N+C Y ++Y G + G L TE + F +S +V + GC H+N
Sbjct: 159 CHYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVFGCSHEN- 217
Query: 274 GLFVGS--AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGV--LEFNSARGGDAVTA 329
G + G+ GLG G+ S ++ + +YCL + P G L F + +
Sbjct: 218 GDYKDRRFTGVFGLGKGITSFVTRM-GSKFSYCLGNIADPHYGYNQLVFGEKANFEGYST 276
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
PL K V+ YYV L G SVG + + I + F M + + ++D GTA+T L A+
Sbjct: 277 PL---KVVNGHYYVTLEGISVGEKRLDIDSTAFSM-KGNEKSALIDSGTALTWLAESAFR 332
Query: 390 SLRDSFVR--LAGNLKPTSGVALFDTCYDFSGLRS-VRVPTVSLHFGAGKALDLPAKNYL 446
+L D+ VR L G L P + CY + + + P V+ HF G LDL ++
Sbjct: 333 AL-DNEVRQLLDGVLMPFWRGSF--ACYKGTVSQDLIGFPVVTFHFSGGADLDLDTESMF 389
Query: 447 IPVDSAGTFCFAFAPTSS------ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ C A S+ + S+IG + QQ +++DL +N++ F C
Sbjct: 390 YQA-TPDILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNSNKLFFQRIDC 443
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 144 bits (364), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 112/353 (31%), Positives = 162/353 (45%), Gaps = 31/353 (8%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
+ I +G+PP + +DT SD+ W+QC PC CY QS PIFDP S ++ C Q
Sbjct: 85 FLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQSLPIFDPSRSYTHRNETCRTSQ 144
Query: 219 --CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG------NSGSVKGIALGCGH 270
SL +A C Y + Y D + + G L E + F +S ++ + GCGH
Sbjct: 145 YSMPSLKFNA-NTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALHDVVFGCGH 203
Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA--SGVLEFNSARGGDAV- 327
DN G + G+LGLG G SL + +YC D P+ VL G + +
Sbjct: 204 DNYGEPLVGTGILGLGYGEFSLVHRF-GKKFSYCFGSLDDPSYPHNVLVLGD-DGANILG 261
Query: 328 -TAPL-IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-EAGDGGIIVDCGTAITRLQ 384
T PL I N FYYV + SV G + I P +F + + G GG I+D G ++T L
Sbjct: 262 DTTPLEIHNG----FYYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNSLTSLV 317
Query: 385 TQAYNSLRDSFVRLAGNLKPTSGVALFDT----CYDFSGLRSVR---VPTVSLHFGAGKA 437
+AY L++ + + V+ D CY+ + R + P V+ HF G
Sbjct: 318 EEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGNFERDLVESGFPIVTFHFSEGAE 377
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
L L K+ + + S FC A P + L+ IG QQ + +DL V F
Sbjct: 378 LSLDVKSLFMKL-SPNVFCLAVTPGN--LNSIGATAQQSYNIGYDLEAMEVSF 427
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 144 bits (363), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 92/307 (29%), Positives = 151/307 (49%), Gaps = 41/307 (13%)
Query: 118 TKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLD 177
++ +LA + R E A ++ E TP++ GEY ++G+GTPP +F+ +D
Sbjct: 55 SRYRLAGIGMARGEAASARKAVVAE---TPIMPAG----GEYLVKLGIGTPPYKFTAAID 107
Query: 178 TGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRAN---RCLY 234
T SD+ W QC+PCT CY Q DP+F+P+ SS+Y+ LPC++ C LDV C + C Y
Sbjct: 108 TASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQY 167
Query: 235 QVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF--VGSAGLLGLGGGMLSL 292
Y + T G L + + G + +G+A GC + G ++G++GLG G LSL
Sbjct: 168 TYTYSGNATTEGTLAVDKLVIGED-AFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSL 226
Query: 293 TKQIKATSLAYCLVDRDSPASGVL----EFNSARGG-DAVTAPLIRNKKVDTFYYVGLTG 347
Q+ AYCL S G L + ++AR + + P+ R+ + ++YY+ L G
Sbjct: 227 VSQLSVRRFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDG 286
Query: 348 FSVGGQAVQIP-----------------------PSLFEMDEAGDGGIIVDCGTAITRLQ 384
+G + + +P + + +A G+I+D + IT L+
Sbjct: 287 LLIGDRTMSLPPTTTTTATATATATAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLE 346
Query: 385 TQAYNSL 391
Y+ L
Sbjct: 347 ASLYDEL 353
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 116/367 (31%), Positives = 169/367 (46%), Gaps = 39/367 (10%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ--QSDPIFDPKTSSSYSPLPCAAPQCK 220
+ VGTPP+ +MVLDTGS+++WL C P +S F P+ S +++ +PC + QC+
Sbjct: 69 LAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCGSAQCR 128
Query: 221 SLDVS---AC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC---GHDN 272
S D+ AC + +C ++Y DGS + G L TE + G ++ A GC D
Sbjct: 129 SRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRA-AFGCMATAFDT 187
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR------GGDA 326
V +AGLLG+ G LS Q +YC+ DRD +GVL +
Sbjct: 188 SPDGVATAGLLGMNRGALSFVSQASTRRFSYCISDRDD--AGVLLLGHSDLPFLPLNYTP 245
Query: 327 VTAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
+ P + D Y V L G VGG+ + IP S+ D G G +VD GT T L
Sbjct: 246 LYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLG 305
Query: 386 QAYNSLRDSFVRLAGNLKPT------SGVALFDTCYDFSGLRS--VRVPTVSLHF-GAGK 436
AY++L+ F R P + FDTC+ R+ R+P V+L F GA
Sbjct: 306 DAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFNGAQM 365
Query: 437 ALDLPAKNYLIPVDSA---GTFCFAFA-----PTSSALSIIGNVQQQGTRVSFDLANNRV 488
+ Y +P + G +C F P ++ +IG+ Q V +DL RV
Sbjct: 366 TVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITA--YVIGHHHQMNVWVEYDLERGRV 423
Query: 489 GFTPNKC 495
G P +C
Sbjct: 424 GLAPIRC 430
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 122/370 (32%), Positives = 177/370 (47%), Gaps = 45/370 (12%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS- 221
I VGTPP+ SMV+DTGS+++WL C T P F+P SSSY+P+ C++P C +
Sbjct: 70 ITVGTPPQNMSMVIDTGSELSWLHCNTNTTA-TIPYPFFNPNISSSYTPISCSSPTCTTR 128
Query: 222 -----LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----N 272
+ S N C ++Y D S + G+L ++T FG+S + GI GC + N
Sbjct: 129 TRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGFGSSFN-PGIVFGCMNSSYSTN 187
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVL---EFNSARGGDAVTA 329
+ GL+G+ G LSL Q+K +YC+ D SG+L E N + GG
Sbjct: 188 SESDSNTTGLMGMNLGSLSLVSQLKIPKFSYCISGSD--FSGILLLGESNFSWGGSLNYT 245
Query: 330 PLIR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
PL++ + + F Y V L G + + + I +LF D G G + D GT + L
Sbjct: 246 PLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDHTGAGQTMFDLGTQFSYLL 305
Query: 385 TQAYNSLRDSFV-RLAGNLKPTSG------VALFDTCYDFSGLRSV--RVPTVSLHF-GA 434
YN+LRD F+ + G L+ +A+ D CY +S +P+VSL F GA
Sbjct: 306 GPVYNALRDEFLNQTNGTLRALDDPNFVFQIAM-DLCYRVPVNQSELPELPSVSLVFEGA 364
Query: 435 -----GKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS----IIGNVQQQGTRVSFDLAN 485
G L ++ DS +CF F S L IIG+ QQ + FDL
Sbjct: 365 EMRVFGDQLLYRVPGFVWGNDSV--YCFTFG-NSDLLGVEAFIIGHHHQQSMWMEFDLVE 421
Query: 486 NRVGFTPNKC 495
+RVG +C
Sbjct: 422 HRVGLAHARC 431
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 116/367 (31%), Positives = 169/367 (46%), Gaps = 39/367 (10%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ--QSDPIFDPKTSSSYSPLPCAAPQCK 220
+ VGTPP+ +MVLDTGS+++WL C P +S F P+ S +++ +PC + QC+
Sbjct: 70 LAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCDSAQCR 129
Query: 221 SLDVS---AC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC---GHDN 272
S D+ AC + +C ++Y DGS + G L TE + G ++ A GC D
Sbjct: 130 SRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRA-AFGCMATAFDT 188
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR------GGDA 326
V +AGLLG+ G LS Q +YC+ DRD +GVL +
Sbjct: 189 SPDGVATAGLLGMNRGALSFVSQASTRRFSYCISDRDD--AGVLLLGHSDLPFLPLNYTP 246
Query: 327 VTAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
+ P + D Y V L G VGG+ + IP S+ D G G +VD GT T L
Sbjct: 247 LYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLG 306
Query: 386 QAYNSLRDSFVRLAGNLKPT------SGVALFDTCYDFSGLRS--VRVPTVSLHF-GAGK 436
AY++L+ F R P + FDTC+ R+ R+P V+L F GA
Sbjct: 307 DAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFNGAQM 366
Query: 437 ALDLPAKNYLIPVDSA---GTFCFAFA-----PTSSALSIIGNVQQQGTRVSFDLANNRV 488
+ Y +P + G +C F P ++ +IG+ Q V +DL RV
Sbjct: 367 TVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITA--YVIGHHHQMNVWVEYDLERGRV 424
Query: 489 GFTPNKC 495
G P +C
Sbjct: 425 GLAPIRC 431
>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
Length = 434
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 116/367 (31%), Positives = 166/367 (45%), Gaps = 35/367 (9%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
S P+ SG + G Y R+ +GTP + MVLDT +D ++ C C S F P
Sbjct: 84 SAPIASGQAFNIGNYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGC---SATTFSPN 140
Query: 205 TSSSYSPLPCAAPQCKSLDVSACRAN---RCLYQVAYGDGSFT---VGD---LVTETV-- 253
S+SY PL C+ PQC + +C A C + +Y +++ V D L T+ +
Sbjct: 141 ASTSYVPLECSVPQCSQVRGLSCPATGSGACSFNKSYAGSTYSATLVQDSLRLATDVIPS 200
Query: 254 -SFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA 312
SFG+ ++ G ++ + +LS T + + +YCL S
Sbjct: 201 YSFGSINAISGSSIPAQGLLGLGRGPLS--------LLSQTGSLYSGVFSYCLPSFKSYY 252
Query: 313 -SGVLEFNSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
SG L+ ++ T PL+RN + + Y+V LTG +VG V P L D
Sbjct: 253 FSGSLKLGPVGQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDVNTGS 312
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
G I+D GT ITR YN++RD F + P S + FDTC F P ++L
Sbjct: 313 GTIIDSGTVITRFVEPVYNAVRDEFRKQVTG--PFSSLGAFDTC--FVKNYETLAPAITL 368
Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS-----SALSIIGNVQQQGTRVSFDLAN 485
HF L LP +N LI S C A A T + L++I N QQQ RV FD N
Sbjct: 369 HF-TDLDLKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFDTVN 427
Query: 486 NRVGFTP 492
N+ + P
Sbjct: 428 NKGWYCP 434
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 105/350 (30%), Positives = 151/350 (43%), Gaps = 36/350 (10%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y ++ VGTPP + V+DTGS+I W QC PC CY+Q+ PIFDP SS++
Sbjct: 380 YLMKLQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSSTFKE------- 432
Query: 219 CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNEG 274
C + C Y+V Y D ++T G L T+TV+ G + +GCG +N
Sbjct: 433 ------KRCHDHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETIIGCGRNNSW 486
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAPL 331
G +GL G LSL Q+ ++YC + GG V+ +
Sbjct: 487 FRPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCFAGNGTSKINFGTNAIVGGGGVVSTTM 546
Query: 332 IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
FYY+ L SVG ++ + F A +G I++D GT +T N +
Sbjct: 547 FVTTARPGFYYLNLDAVSVGDTRIETLGTPF---HALEGNIVIDSGTTLTYFPESYCNLV 603
Query: 392 RDSFVRLAGNL---KPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
R + + + PT L CY +S + P +++HF G L L N +
Sbjct: 604 RQAVEHVVPAVPAADPTGNDLL---CY-YSNTTEI-FPVITMHFSGGADLVLDKYNMFME 658
Query: 449 VDSAGTFCFAFA---PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S G FC A PT A I GN Q V +D ++ V F P C
Sbjct: 659 SYSGGLFCLAIICNNPTQEA--IFGNRAQNNFLVGYDSSSLLVSFKPTNC 706
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 100/335 (29%), Positives = 150/335 (44%), Gaps = 50/335 (14%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
EY ++ +GTPP + VLDTGS++ W QC PC CY Q PIFDP SS++ C P
Sbjct: 64 EYLMKLQIGTPPFEVEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCNTP 123
Query: 218 QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA----LGCGHDN- 272
+ C Y++ Y D S+T G L TETV+ ++ V + +GC +N
Sbjct: 124 D-----------HSCPYKLVYDDKSYTQGTLATETVTIHSTSGVPFVMPETIIGCSRNNS 172
Query: 273 -EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPL 331
G S+G++GL G LSL Q+ AY P GV V+ +
Sbjct: 173 GSGFRPSSSGIVGLSRGSLSLISQMGG---AY-------PGDGV-----------VSTTM 211
Query: 332 IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
YY+ L SVG ++ + F A +G I++D GT +T N +
Sbjct: 212 FAKTAKRGQYYLNLDAVSVGDTRIETVGTPF---HALNGNIVIDSGTPLTYFPVSYCNLV 268
Query: 392 RDSFVRLAGN---LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
R + R+ + P+ L CY +S + P +++HF G L L N +
Sbjct: 269 RKAVERVVTADRVVDPSRNDML---CY-YSNTIEI-FPVITVHFSGGADLVLDKYNMYME 323
Query: 449 VDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFD 482
++ G FC A + + ++I GN Q V +D
Sbjct: 324 LNRGGVFCLAIICNNPTQVAIFGNRAQNNFLVGYD 358
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 112/420 (26%), Positives = 192/420 (45%), Gaps = 38/420 (9%)
Query: 101 LVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQ------ 154
+VLS + + +I L L+ N+ H KP + + A
Sbjct: 24 VVLSATDIPNHNHRPMIIPLHLSTSNISSHR-KPFTSNYHRRQLHNSDLPNAHMRLYDDL 82
Query: 155 -GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
+G Y +R+ +GTPP++F++++DTGS + ++ C C +C + DP F P++SS+Y P+
Sbjct: 83 LSNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYKPMQ 142
Query: 214 CAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHD 271
C P C D +C Y+ Y + S + G L + +SFGN + + GC
Sbjct: 143 C-NPSCNCDD----EGKQCTYERRYAEMSSSSGLLAEDVLSFGNESELTPQRAIFGCETV 197
Query: 272 NEG-LFVGSA-GLLGLGGGMLSLT-----KQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
G LF A G++GLG G LS+ K++ S + C D ++ N
Sbjct: 198 ETGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGAMVLGNIPPPP 257
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
D V A + +Y + L V G+ +++ P +F+ G G ++D GT L
Sbjct: 258 DMVFAH--SDPYRSAYYNIELKELHVAGKRLKLNPRVFD----GKHGTVLDSGTTYAYLP 311
Query: 385 TQAYNSLRDSFVRLAGNLKPTSG--VALFDTCY-----DFSGLRSVRVPTVSLHFGAGKA 437
+A+ + +D+ ++ LK G + D C+ D S L + P V++ FG G+
Sbjct: 312 EEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQLSKI-FPEVNMVFGNGQK 370
Query: 438 LDLPAKNYLI-PVDSAGTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L L +NYL +G +C F +++G + + T V++D N+++GF C
Sbjct: 371 LSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRNTLVTYDRDNDKIGFWKTNC 430
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 117/373 (31%), Positives = 171/373 (45%), Gaps = 36/373 (9%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSY 209
SG G+ +YF+ I VGTP ++F +V+DTGS++ W+ CR + + +F S S+
Sbjct: 97 SGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARG-KDNRRVFRADESKSF 155
Query: 210 SPLPCAAPQCK-------SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNS 258
+ C CK SL + C Y Y DGS G ET++ G
Sbjct: 156 KTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRM 215
Query: 259 GSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSL-----AYCLVDR--DS 310
+ G +GC G F G+ G+LGL S T ATSL +YCLVD +
Sbjct: 216 ARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTST--ATSLYGAKFSYCLVDHLSNK 273
Query: 311 PASGVLEFNSARGGDAV---TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
S L F S+R T PL ++ FY + + G S+G + IP ++ D
Sbjct: 274 NVSNYLIFGSSRSTKTAFRRTTPLDLT-RIPPFYAINVIGISLGYDMLDIPSQVW--DAT 330
Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS--GVALFDTCYDF-SGLRSVR 424
GG I+D GT++T L AY + R LK GV + + C+ F SG +
Sbjct: 331 SGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPI-EYCFSFTSGFNVSK 389
Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFAFAPTSS-ALSIIGNVQQQGTRVSFD 482
+P ++ H G + K+YL VD+A G C F + A ++IGN+ QQ FD
Sbjct: 390 LPQLTFHLKGGARFEPHRKSYL--VDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFD 447
Query: 483 LANNRVGFTPNKC 495
L + + F P+ C
Sbjct: 448 LMASTLSFAPSAC 460
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 110/351 (31%), Positives = 155/351 (44%), Gaps = 25/351 (7%)
Query: 160 FSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
+ I +G PP +V+DTGSDI W+ C PCT C +FDP SS++SPL C P
Sbjct: 102 MANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNDLGLLFDPSKSSTFSPL-CKTP-- 158
Query: 220 KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIALGCGHD-NEG 274
D CR + + V Y D S G +TV F + + + GCGH+
Sbjct: 159 --CDFEGCRCDPIPFTVTYADNSTASGTFGRDTVVFETTDEGTSRISDVLFGCGHNIGHD 216
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD--AVTAPLI 332
G G+LGL G SL ++ +YC+ + P + G D + P
Sbjct: 217 TDPGHNGILGLNNGPDSLVTKL-GQKFSYCIGNLADPYYNYHQLILGEGADLEGYSTPF- 274
Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
+ + FYYV + G SVG + + I P FEM E GG+I+D G+ IT L + L
Sbjct: 275 --EVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGGVIIDTGSTITFLVDSVHKLLS 332
Query: 393 DSFVRLAGN--LKPTSGVALFDTCYDFSGLRS-VRVPTVSLHFGAGKALDLPAKNYLIPV 449
L G + T + + C+ S R V P V+ HF G L L + ++ +
Sbjct: 333 KEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVVTFHFSDGADLALDSGSFFNQL 392
Query: 450 DSAGTFCFAFAPTS-----SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ FC P S S S+IG + QQ V +DL N V F C
Sbjct: 393 ND-NVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYDLVNQFVYFQRIDC 442
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 118/402 (29%), Positives = 184/402 (45%), Gaps = 45/402 (11%)
Query: 129 RHELKPAEAQILPEDFSTPVVSGASQGS------GEYFSRIGVGTPPRQFSMVLDTGSDI 182
R L+ A L + F VV + QGS G YF+R+ +GTPPR+F++ +DTGSD+
Sbjct: 48 RDHLRHAR---LLQGFVGGVVDFSVQGSSDPYLVGLYFTRVKLGTPPREFNVQIDTGSDV 104
Query: 183 NWLQCRPCTECYQQSD-----PIFDPKTSSSYSPLPCAAPQCKS---LDVSAC--RANRC 232
W+ C C+ C Q S FD +SS+ +PC+ P C S + C ++N+C
Sbjct: 105 LWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLVPCSHPICTSQIQTTATQCPPQSNQC 164
Query: 233 LYQVAYGDGSFTVGDLVTETVSF----GNS---GSVKGIALGCGHDNEGLFVGS----AG 281
Y YGDGS T G V++T F G S S I GC G + G
Sbjct: 165 SYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIANSSAAIVFGCSTYQSGDLTKTDKAVDG 224
Query: 282 LLGLGGGMLSLTKQIKATSL-----AYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
+ G G G LS+ Q+ + + ++CL DS G+L V +PL+ ++
Sbjct: 225 IFGFGQGELSVISQLSSHGITPRVFSHCLKGEDS-GGGILVLGEILEPGIVYSPLVPSQP 283
Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
Y + L +V GQ + I P+ F + + G I+D GT + L +AY+ +
Sbjct: 284 ---HYNLDLQSIAVSGQLLPIDPAAFA--TSSNRGTIIDTGTTLAYLVEEAYDPFVSAIT 338
Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS---AG 453
L T + + CY S S P VS +F G + L + YL+ + + A
Sbjct: 339 AAVSQLA-TPTINKGNQCYLVSNSVSEVFPPVSFNFAGGATMLLKPEEYLMYLTNYAGAA 397
Query: 454 TFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+C F ++I+G++ + +DLA+ R+G+ C
Sbjct: 398 LWCIGFQKIQGGITILGDLVLKDKIFVYDLAHQRIGWANYDC 439
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 121/390 (31%), Positives = 172/390 (44%), Gaps = 60/390 (15%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTEC-YQQSDPI----FDPKTSSS 208
G Y + +GTPP+ VLDTGS + W C C+ C + DP F PK SS+
Sbjct: 86 GGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSST 145
Query: 209 YSPLPCAAPQCKSL---DVSACRANRCL-------------YQVAYGDGSFTVGDLVTET 252
L C P+C L DV + R +C Y + YG G+ T G L+ +
Sbjct: 146 AKLLGCRNPKCGYLFGPDVES-RCPQCKKPGSQNCSLTCPSYIIQYGLGA-TAGFLLLDN 203
Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---D 309
++F +V +GC + +G+ G G G SL Q+ +YCLV D
Sbjct: 204 LNFPGK-TVPQFLVGCSILS---IRQPSGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDD 259
Query: 310 SPASGVLEFNSARGGDAVT-----APLIRNKKVDT----FYYVGLTGFSVGGQAVQIPPS 360
+P S L + GD T P N ++ +YYV L VGG V+IP
Sbjct: 260 TPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDVKIPYK 319
Query: 361 LFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG-------NLKPTSGVALFDT 413
E G+GG IVD G+ T ++ YN + F+R G N++ SG++
Sbjct: 320 FLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGLS---P 376
Query: 414 CYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF-------AFAPTSSAL 466
C++ SG++++ P + F G + P NY V A CF A P ++
Sbjct: 377 CFNISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQPKTAGP 436
Query: 467 SII-GNVQQQGTRVSFDLANNRVGFTPNKC 495
+II GN QQQ V +DL N R GF P C
Sbjct: 437 AIILGNYQQQNFYVEYDLENERFGFGPRNC 466
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 116/363 (31%), Positives = 171/363 (47%), Gaps = 30/363 (8%)
Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
++GSG + + +G+PP +V+DTGS + W+QC PC C+QQS FDP S S+ L
Sbjct: 99 NRGSG-FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTL 157
Query: 213 PCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFG--NSGSVK--GIALG 267
C P ++ C R N+ Y++ Y G + G L E++ F + G +K I G
Sbjct: 158 GCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGKIKKSNITFG 217
Query: 268 CGHDNEGLFVGSA--GLLGLGGG-MLSLTKQIKATSLAYCLVDRDSPASG----VLEFNS 320
CGH N A G+ GLG +++ Q+ +YC+ D ++P VL S
Sbjct: 218 CGHMNIKTNNDDAYNGVFGLGAYPHITMATQL-GNKFSYCIGDINNPLYTHNHLVLGQGS 276
Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
GD+ + YYV L SVG + ++I P+ F++ G GG+++D G
Sbjct: 277 YIEGDSTPLQIHFGH-----YYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTY 331
Query: 381 TRLQTQAYNSLRDSFVRL-AGNLKPTSGVALFD-TCYDFSGLRS---VRVPTVSLHFGAG 435
T+L + L D V L G L+ F+ C F G+ S V P V+ HF G
Sbjct: 332 TKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLC--FKGVVSRDLVGFPAVTFHFAGG 389
Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPTSSA---LSIIGNVQQQGTRVSFDLANNRVGFTP 492
L L + + L FC A P++S LS+IG + QQ V FDL +V F
Sbjct: 390 ADLVLESGS-LFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRR 448
Query: 493 NKC 495
C
Sbjct: 449 IDC 451
>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 119/419 (28%), Positives = 184/419 (43%), Gaps = 56/419 (13%)
Query: 121 QLAIYNVDR-HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTG 179
LA ++ R H LK + L + +S + G + + GTPP++ S ++DTG
Sbjct: 54 HLATASLSRAHHLKHGKTSPLTQ------ISLSPHSYGGHSIPLSFGTPPQKLSFLVDTG 107
Query: 180 SDINWLQCRP---CTEC-----YQQSDPIFDPKTSSSYSPLPCAAPQCKS-------LDV 224
S + W C CT C + PIF+PK SSS L C P+C + L
Sbjct: 108 SHVVWAPCTTHYTCTNCSFSDAEPKKVPIFNPKLSSSSKILGCRNPKCVNTSSPDVHLGC 167
Query: 225 SACRAN--RCL-----YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
C N C Y + YG G+ + GD + E ++F ++ +GC G V
Sbjct: 168 PPCNGNSKNCSHACPPYSLQYGTGA-SSGDFLLENLNFPGK-TIHEFLVGCTTSAVGE-V 224
Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRD-----SPASGVLEFNSARGGDAVTAPLI 332
SA L G G M SL Q+ AYCL D + + +L+++ AP +
Sbjct: 225 TSAALAGFGRSMFSLPMQMGVKKFAYCLNSHDYDDTRNSSKLILDYSDGETKGLSYAPFL 284
Query: 333 RNK-KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY--- 388
+N +YY+G+ +G + ++IP G GG+++D G A + +
Sbjct: 285 KNPPDFPIYYYLGVKDIKIGNKLLRIPSKYLAPGSDGRGGLMIDSGFAYGYMTGPVFKKV 344
Query: 389 -NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY-- 445
N L+ + +L+ + + + CY+F+G +S+++P + F G + +P KNY
Sbjct: 345 TNELKKRMSKYRRSLEAEAEIGV-TPCYNFTGQKSIKIPDLIYQFRGGATMVVPGKNYFV 403
Query: 446 LIPVDS---------AGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LIP S AGT F P S I+GN Q V FDL N R+GF C
Sbjct: 404 LIPEISLACFPLTTDAGTNTLEFTPGPSI--ILGNSQHVDYYVEFDLKNERLGFRQQTC 460
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 109/372 (29%), Positives = 173/372 (46%), Gaps = 38/372 (10%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSY 209
G G Y +++ +GTPPR+F++ +DTGSDI W+ C C+ C + S FD SS+
Sbjct: 80 GYGLYTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTA 139
Query: 210 SPLPCAAPQCKSLDVSAC-----RANRCLYQVAYGDGSFTVGDLVTETVSF--------- 255
+ +PC+ P C S A + N+C Y Y DGS T G V++ + F
Sbjct: 140 ALVPCSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTP 199
Query: 256 GNSGSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLV 306
N S I GC G + G+LG G G LS+ Q+ + + ++CL
Sbjct: 200 ANVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCL- 258
Query: 307 DRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
D G+L V +PL+ ++ Y + L +V GQ + I P++F +
Sbjct: 259 KGDGNGGGILVLGEILEPSIVYSPLVPSQP---HYNLNLQSIAVNGQVLSINPAVFATSD 315
Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVP 426
G I+D GT ++ L +AY+ L ++ V A + TS ++ CY P
Sbjct: 316 --KRGTIIDSGTTLSYLVQEAYDPLVNA-VDTAVSQFATSFISKGSQCYLVLTSIDDSFP 372
Query: 427 TVSLHFGAGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDL 483
TVS +F G ++DL YL+ D A +C F ++I+G++ + V +DL
Sbjct: 373 TVSFNFEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQKVQEGVTILGDLVLKDKIVVYDL 432
Query: 484 ANNRVGFTPNKC 495
A ++G+T C
Sbjct: 433 ARQQIGWTNYDC 444
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 117/373 (31%), Positives = 171/373 (45%), Gaps = 36/373 (9%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSY 209
SG G+ +YF+ I VGTP ++F +V+DTGS++ W+ CR + + +F S S+
Sbjct: 75 SGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARG-KDNRRVFRADESKSF 133
Query: 210 SPLPCAAPQCK-------SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNS 258
+ C CK SL + C Y Y DGS G ET++ G
Sbjct: 134 KTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRM 193
Query: 259 GSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSL-----AYCLVDR--DS 310
+ G +GC G F G+ G+LGL S T ATSL +YCLVD +
Sbjct: 194 ARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTST--ATSLYGAKFSYCLVDHLSNK 251
Query: 311 PASGVLEFNSARGGDAV---TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
S L F S+R T PL ++ FY + + G S+G + IP ++ D
Sbjct: 252 NVSNYLIFGSSRSTKTAFRRTTPLDLT-RIPPFYAINVIGISLGYDMLDIPSQVW--DAT 308
Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS--GVALFDTCYDF-SGLRSVR 424
GG I+D GT++T L AY + R LK GV + + C+ F SG +
Sbjct: 309 SGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPI-EYCFSFTSGFNVSK 367
Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFAFAPTSS-ALSIIGNVQQQGTRVSFD 482
+P ++ H G + K+YL VD+A G C F + A ++IGN+ QQ FD
Sbjct: 368 LPQLTFHLKGGARFEPHRKSYL--VDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFD 425
Query: 483 LANNRVGFTPNKC 495
L + + F P+ C
Sbjct: 426 LMASTLSFAPSAC 438
>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 444
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 110/366 (30%), Positives = 163/366 (44%), Gaps = 32/366 (8%)
Query: 147 PVVSGAS-QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
P+ SG S Y R GTP + + +DT +D W+ C C C + F P
Sbjct: 93 PIASGRQITQSPTYIVRAKFGTPAQTLLLAMDTSNDAAWVPCTACVGCSTTTP--FAPPK 150
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
S+++ + C A QCK + C + C + YG S LV +TV+ + V
Sbjct: 151 STTFKKVGCGASQCKQVRNPTCDGSACAFNFTYGTSS-VAASLVQDTVTLA-TDPVPAYT 208
Query: 266 LGCGHDNEGLFV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS-- 320
GC G + G GL +L+ T+++ ++ +YCL P+ L F+
Sbjct: 209 FGCIQKATGSSLPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCL-----PSFKTLNFSGHX 263
Query: 321 -----ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
A+ D V P +N + + YYV L VG + V IPP + G + D
Sbjct: 264 DLXPVAQPRDQV-YPSFKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNPXTGAGTVFD 322
Query: 376 CGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL--FDTCYDFSGLRSVRVPTVSLHFG 433
GT TRL AY ++R+ F R K + +L FDTCY + PT++ F
Sbjct: 323 SGTVFTRLVEPAYTAVRNEFRRRVSVHKKLTVTSLGGFDTCYTV----PIVAPTITFMF- 377
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVG 489
+G + LP N LI + C A AP +S L++I N+QQQ RV FD+ N+R+G
Sbjct: 378 SGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLG 437
Query: 490 FTPNKC 495
C
Sbjct: 438 VARELC 443
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 101/351 (28%), Positives = 158/351 (45%), Gaps = 26/351 (7%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS--DPIFDPKTSSSYSPLPCAA 216
+F VG PP ++DTGS + W+QC PC C P+F+P SS++ C
Sbjct: 68 FFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVECSCDD 127
Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDN 272
C+ C +N+C+Y+ Y G+ + G L E ++F GN+ + IA GCGH+N
Sbjct: 128 RFCRYAPNGHCSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGHEN 187
Query: 273 -EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPL 331
E L G+LGLG SL Q+ + +YC+ D + G + D + P
Sbjct: 188 GEQLESEFTGILGLGAKPTSLAVQL-GSKFSYCIGDLANKNYGYNQLVLGEDADILGDPT 246
Query: 332 -IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
I + + YY+ L G SVG + + I P +F+ G+I+D GT T L AY
Sbjct: 247 PIEFETENGIYYMNLEGISVGDKQLNIEPVVFKR-RGSRTGVILDTGTLYTWLADIAY-- 303
Query: 391 LRDSFVRLAGNLKPTSGVALFDTCYDFSGLRS---VRVPTVSLHFGAGKALDLPAKNYLI 447
R+ + + L P F + G + + P V+ HF G L + A +
Sbjct: 304 -RELYNEIKSILDPKLERFWFRDFLCYHGRVNEELIGFPVVTFHFAGGAELAMEATSMFY 362
Query: 448 PVDSAGT----FCFAFAPTSSA------LSIIGNVQQQGTRVSFDLANNRV 488
P+ + T FC + PT+ + IG + QQ +++DL +
Sbjct: 363 PMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYDLKERNI 413
>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 417
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 128/401 (31%), Positives = 180/401 (44%), Gaps = 66/401 (16%)
Query: 153 SQGSGEYFSRIGVGTPPRQ-FSMVLDTGSDINWLQCRP--CTEC---YQQSDPIF----- 201
S +Y +G+ P Q ++ +DTGSD+ W C P C C + + P+
Sbjct: 13 SNRESDYTLSFNLGSHPSQSITLYMDTGSDLVWFPCAPFECILCEGKFNATKPLNITRSH 72
Query: 202 -----DPKTSSSYSPLP----CAAPQC--KSLDVSACRANRCL-YQVAYGDGSFTVGDLV 249
P S+++S + CA +C +++ S C + C + AYGDGSF + L
Sbjct: 73 RVSCQSPACSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGDGSF-IAHLH 131
Query: 250 TETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAY 303
+T+S +K GC H G+ G G G+LSL Q+ S +Y
Sbjct: 132 RDTLSMSQL-FLKNFTFGCAHT---ALAEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSY 187
Query: 304 CLV----DRD---SPASGVL----EFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGG 352
CLV D++ P+ +L +++S R + V ++RN K FY VGLTG SVG
Sbjct: 188 CLVSHSFDKERVRKPSPLILGHYDDYSSERV-EFVYTSMLRNPKHSYFYCVGLTGISVGK 246
Query: 353 QAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD 412
+ + P L +D GDGG++VD GT T L YNS+ F R G + +
Sbjct: 247 RTILAPEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRASEVEEK 306
Query: 413 T----CYDFSGLRSVRVPTVSLHF-GAGKALDLPAKNYLIPV----DSA----GTFCFAF 459
T CY GL V VPTV+ HF G + LP NY D A G
Sbjct: 307 TGLGPCYFLEGL--VEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGEDEARRKVGCLMLMN 364
Query: 460 APTSSALS-----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ LS I+GN QQQG V +DL N RVGF +C
Sbjct: 365 GGDDTELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQC 405
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 111/372 (29%), Positives = 168/372 (45%), Gaps = 24/372 (6%)
Query: 138 QILPEDFSTPVVSGASQ----GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC 193
Q +P D G SQ +G Y VGTPP+ + VLD SD W+QC C C
Sbjct: 72 QAVPADGGENGGGGQSQDPATNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATC 131
Query: 194 -----YQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR--CLYQVAYGDGSF--T 244
S P F SS+ + CA C+ L C A+ C Y YG G+ T
Sbjct: 132 GADAPAATSAPPFYAFLSSTIREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTT 191
Query: 245 VGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYC 304
G L + +F G+ GC EG G++GLG G LS Q++ +Y
Sbjct: 192 AGLLAVDAFAFATV-RADGVIFGCAVATEGDI---GGVIGLGRGELSPVSQLQIGRFSYY 247
Query: 305 LVDRDSPASG--VLEFNSA--RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPS 360
L D+ G +L + A R AV+ PL+ ++ + YYV L G V G+ + IP
Sbjct: 248 LAPDDAVDVGSFILFLDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRG 307
Query: 361 LFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSG 419
F++ G GG+++ +T L AY +R + L+ G L D CY
Sbjct: 308 TFDLQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKI-ELRAADGSELGLDLCYTSES 366
Query: 420 LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL-SIIGNVQQQGTR 478
L + +VP+++L F G ++L NY + G C P+ + S++G++ Q GT
Sbjct: 367 LATAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTH 426
Query: 479 VSFDLANNRVGF 490
+ +D++ +R+ F
Sbjct: 427 MIYDISGSRLVF 438
>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
vinifera]
Length = 437
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 107/346 (30%), Positives = 163/346 (47%), Gaps = 19/346 (5%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y R +GTP + M +DT SD+ W+ C C C S +F+ S++Y L C A Q
Sbjct: 101 YIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGC---SSTLFNSPASTTYKSLGCQAAQ 157
Query: 219 CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
CK + C C + + YG GS +L +T++ + +V G + GC G +
Sbjct: 158 CKQVPKPTCGGGVCSFNLTYG-GSSLAANLSQDTITLA-TDAVPGYSFGCIQKATGGSLP 215
Query: 279 S---AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDAVT-APLIR 333
+ GL +LS T+ + ++ +YCL S SG L + PL++
Sbjct: 216 AQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIKYTPLLK 275
Query: 334 NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRD 393
N + + Y+V L VG + V +PP F + + G I D GT TRL T AY ++RD
Sbjct: 276 NPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRD 335
Query: 394 SFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAG 453
+F G + + FDTCY + PT++ F G + LP N LI +
Sbjct: 336 AFRNRVGRNLTVTSLGGFDTCYTV----PIAAPTITFMF-TGMNVTLPPDNLLIHSTAGS 390
Query: 454 TFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
T C A A +S L++I N+QQQ R+ +D+ N+R+G C
Sbjct: 391 TTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELC 436
>gi|357125298|ref|XP_003564331.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 524
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 120/372 (32%), Positives = 167/372 (44%), Gaps = 62/372 (16%)
Query: 173 SMVLDTGSDINWLQCRPCTECYQQS--DPIFDPKTSSSYSPLPCAAPQCKSLD------- 223
+M +DT DI W+QCRPC + +FDP S S + +PC + C++L
Sbjct: 166 TMAIDTTIDIPWIQCRPCPPPQCYPQRNALFDPTKSFSAAAVPCGSRACRALGNYGNGCS 225
Query: 224 ------------VSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD 271
S C Y+VAY DG + G +T+ ++ S GC H
Sbjct: 226 NNSRRNKKKNKSKSNNSTGDCNYRVAYSDGRVSSGTYMTDILTISPGTSFLNFRFGCSHG 285
Query: 272 NEGLFVG-SAGLLGLGGG---MLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDA- 326
G F G ++G + LGGG +LS T + + +YC V + S ASG L A
Sbjct: 286 VRGSFSGETSGTMSLGGGRQSLLSQTARAYGNAFSYC-VPKPS-ASGFLSLGGAINDGDS 343
Query: 327 --------VTAPLIRNKKV--DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
VT PL+RN ++ T+Y V L G V G+ + +PP +F GG ++D
Sbjct: 344 DSDSPSSFVTTPLMRNARIVNPTYYVVRLQGIDVAGRRLNVPPVVFS------GGTLMDS 397
Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLK-----------PTSGVALFDTCYDFSGLRSVRV 425
+T+L AY +LR +F + P G + DTCYDF GL +V V
Sbjct: 398 SAVVTQLPPTAYRALRLAFRNAMRGYRMNTRNGSTSSTPAGGEMILDTCYDFEGLDNVTV 457
Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDL 483
PTVSL F G +DL ++ C AF PT + L IGNVQQQ V +D+
Sbjct: 458 PTVSLVFFGGAVVDLDPTTAVMMEG-----CLAFVPTPADFDLGFIGNVQQQTHEVLYDV 512
Query: 484 ANNRVGFTPNKC 495
VGF C
Sbjct: 513 GARNVGFRRGAC 524
>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
Length = 315
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 100/315 (31%), Positives = 150/315 (47%), Gaps = 29/315 (9%)
Query: 206 SSSYSPLPCAAPQCK---SLDVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSF----G 256
SS++ + C P C+ + VSAC +C Y +YGD S T G + +T +F G
Sbjct: 2 SSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTFTFMSPNG 61
Query: 257 NSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGV 315
+V +A GCG N GLFV + +G+ G G G SL Q+K +YCL S V
Sbjct: 62 VPVAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLPSQLKVGRFSYCLTLVTESKSSV 121
Query: 316 LEFNSARGGDAVTA---------PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
+ + D + A P+I N + TFYY+ L G +VG + S+F + +
Sbjct: 122 VILGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTRLPFDKSVFALKK 181
Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL----AGNLKPTSGVALFDTCYDF-SGLR 421
G GG ++D GT++T L + L++ V + P G L C+ G +
Sbjct: 182 DGSGGTVIDSGTSLTTLPEAVFELLQEELVAQFPLPRYDNTPEVGDRL---CFRRPKGGK 238
Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF-APTSSALSIIGNVQQQGTRVS 480
V VP + LH AG +DLP NY + +G C + + +IGN QQQ V
Sbjct: 239 QVPVPKLILHL-AGADMDLPRDNYFVEEPDSGVMCLQINGAEDTTMVLIGNFQQQNMHVV 297
Query: 481 FDLANNRVGFTPNKC 495
+D+ NN++ F P +C
Sbjct: 298 YDVENNKLLFAPAQC 312
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 121/389 (31%), Positives = 176/389 (45%), Gaps = 43/389 (11%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQSDPIFD 202
S P+ +Q EY +G PP+Q + ++DTGS++ W QC C C+ Q +D
Sbjct: 74 SAPIHWNETQYIAEYL----IGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYD 129
Query: 203 PKTSSSYSPLPCAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
P S + P+ C C + C C AYG G+ G L TE +FG+ S
Sbjct: 130 PSRSRTAKPVACNDTACLLGSETRCARDGKACAVLTAYGAGAIG-GFLGTEVFTFGHGQS 188
Query: 261 VKG---IALGC---GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASG 314
+ +A GC G G++G++GLG G LSL Q+ +YCL S A+
Sbjct: 189 SENNVSLAFGCITASRLTPGSLDGASGIIGLGRGKLSLPSQLGDNKFSYCLTPYFSDAAN 248
Query: 315 VLEF-------NSARGGDAVTAPLIRN---KKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
S G A + P ++N D+FYY+ LTG +VG + +P + F++
Sbjct: 249 TSTLFVGASAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDL 308
Query: 365 DE---AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN--LKPTSGVALFDTCYD--F 417
E A GG ++D G+ T L AY +LRD VR G + P +G D C
Sbjct: 309 REVAPAKWGGTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDLCVGGVA 368
Query: 418 SGLRSVRVPTVSLHFGAGKA----LDLPAKNYLIPVDSAGTFCFAFA---PTSS----AL 466
G VP + LHFG+G + +P +NY PVD + F+ P S+
Sbjct: 369 PGDAGKLVPPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLPLNET 428
Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+IIGN QQ + +DL + F P C
Sbjct: 429 TIIGNYMQQDMHLLYDLGQGVLSFQPADC 457
>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
Length = 372
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 111/347 (31%), Positives = 167/347 (48%), Gaps = 21/347 (6%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y R +GTP + M +DT SD+ W+ C C C S +F+ S++Y L C A Q
Sbjct: 36 YIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGC---SSTLFNSPASTTYKSLGCQAAQ 92
Query: 219 CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
CK + C C + + YG GS +L +T++ + +V G + GC G +
Sbjct: 93 CKQVPKPTCGGGVCSFNLTYG-GSSLAANLSQDTITLA-TDAVPGYSFGCIQKATGGSLP 150
Query: 279 S---AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDAVT-APLIR 333
+ GL +LS T+ + ++ +YCL S SG L + PL++
Sbjct: 151 AQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIKYTPLLK 210
Query: 334 NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRD 393
N + + Y+V L VG + V +PP F + + G I D GT TRL T AY ++RD
Sbjct: 211 NPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRD 270
Query: 394 SFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
+F R+ NL TS + FDTCY + PT++ F G + LP N LI +
Sbjct: 271 AFRNRVGRNLTVTS-LGGFDTCYTV----PIAAPTITFMF-TGMNVTLPPDNLLIHSTAG 324
Query: 453 GTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
T C A A +S L++I N+QQQ R+ +D+ N+R+G C
Sbjct: 325 STTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELC 371
>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
gi|223942623|gb|ACN25395.1| unknown [Zea mays]
Length = 378
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 120/383 (31%), Positives = 183/383 (47%), Gaps = 42/383 (10%)
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI--FDPK 204
P+ SGA G+G+YF R VGTP + F +V DTGSD+ W++CR P F
Sbjct: 2 PLSSGAYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRAS 61
Query: 205 TSSSYSPLPCAAPQCKS---LDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFG--- 256
S S++PL C++ C S ++ C A+ C Y Y DGS G + T+ +
Sbjct: 62 ESRSWAPLACSSDTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSG 121
Query: 257 -----------NSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKAT---SL 301
++G+ LGC +G F S G+L LG +S + A
Sbjct: 122 SGSEDGSGGGGRRAKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRF 181
Query: 302 AYCLVDRDSP--ASGVLEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
+YCLVD +P AS L F G A PL+ +++V FY V + V G+A+
Sbjct: 182 SYCLVDHLAPRNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALD 241
Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL--FDTC 414
IP ++++ GG I+D GT++T L T AY ++ L G L VA+ F+ C
Sbjct: 242 IPADVWDVGRG--GGAILDSGTSLTVLATPAYRAV---VAALGGRLAALPRVAMDPFEYC 296
Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFAFAPTS-SALSIIGNV 472
Y+++ + +P + + F L+ PAK+Y+I D+A G C + +S+IGN+
Sbjct: 297 YNWTA-GAPEIPKLEVSFAGSARLEPPAKSYVI--DAAPGVKCIGVQEGAWPGVSVIGNI 353
Query: 473 QQQGTRVSFDLANNRVGFTPNKC 495
QQ FDL + + F +C
Sbjct: 354 LQQEHLWEFDLRDRWLRFKHTRC 376
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 142 bits (357), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 118/374 (31%), Positives = 173/374 (46%), Gaps = 57/374 (15%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS- 221
+ VG PP+ SMVLDTGS+++WL C+ +F+P +SS+YSP+PC++P C++
Sbjct: 69 LAVGDPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPVPCSSPICRTR 124
Query: 222 ---LDVSAC---RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD---- 271
L + A + + C ++Y D + G+L ET G S + G GC
Sbjct: 125 TRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIG-SVTRPGTLFGCMDSGLSS 183
Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR----GGDAV 327
N S GL+G+ G LS Q+ + +YC+ DS SG L A G
Sbjct: 184 NSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCISGSDS--SGFLLLGDASYSWLGPIQY 241
Query: 328 TAPLIRNKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
T ++++ + F Y V L G VG + + +P S+F D G G +VD GT T L
Sbjct: 242 TPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFL 301
Query: 384 QTQAYNSLRDSFVRLAGNLKPTSGVALF------DTCY--------DFSGLRSVRVPTVS 429
Y +L++ F+ ++ F D CY +FSGL P VS
Sbjct: 302 MGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGL-----PMVS 356
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGT------FCFAFAPTSSALSI----IGNVQQQGTRV 479
L F G + + + L V+ AG+ +CF F S L I IG+ QQ +
Sbjct: 357 LMF-RGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFG-NSDLLGIEAFVIGHHHQQNVWM 414
Query: 480 SFDLANNRVGFTPN 493
FDLA +RVGF N
Sbjct: 415 EFDLAKSRVGFAGN 428
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 122/375 (32%), Positives = 179/375 (47%), Gaps = 58/375 (15%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC--K 220
+ VGTPP+ SMVLDTGS+++WL+C T+ +Q + FDP SSSYSP+PC++ C +
Sbjct: 89 LTVGTPPQNVSMVLDTGSELSWLRCNK-TQTFQTT---FDPNRSSSYSPVPCSSLTCTDR 144
Query: 221 SLDV---SACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----N 272
+ D ++C +N+ C ++Y D S + G+L ++T GNS + G GC N
Sbjct: 145 TRDFPIPASCDSNQLCHAILSYADASSSEGNLASDTFYIGNS-DMPGTIFGCMDSSFSTN 203
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD------- 325
+ GL+G+ G LS Q+ +YC+ D D SGVL A
Sbjct: 204 TEEDSKNTGLMGMNRGSLSFVSQMDFPKFSYCISDSD--FSGVLLLGDANFSWLMPLNYT 261
Query: 326 ---AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
++ PL +V Y V L G V + + +P S+F D G G +VD GT T
Sbjct: 262 PLIQISTPLPYFDRVA--YTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGTQFTF 319
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFSGLRSV------------RVPTV 428
L Y++LR+ F L TS + L D Y F G + +PTV
Sbjct: 320 LLGPVYSALRNEF------LNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLPTV 373
Query: 429 SLHF-GAGKALDLPAKNYLIPVDSAGT---FCFAFAPTSSALS----IIGNVQQQGTRVS 480
SL F GA + Y +P + G+ +CF F S L+ +IG+ QQ +
Sbjct: 374 SLMFRGAEMKVSGDRLLYRVPGEVRGSDSVYCFTFG-NSDLLAVEAYVIGHHHQQNVWME 432
Query: 481 FDLANNRVGFTPNKC 495
FDL +R+GF +C
Sbjct: 433 FDLEKSRIGFAQVQC 447
>gi|125595873|gb|EAZ35653.1| hypothetical protein OsJ_19940 [Oryza sativa Japonica Group]
Length = 468
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 114/336 (33%), Positives = 149/336 (44%), Gaps = 45/336 (13%)
Query: 174 MVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR 231
M +DT D+ W+QC PC ECY Q + +FDP+ S + + +PC + C L R R
Sbjct: 164 MSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELG----RYGR 219
Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGML 290
L Q + + C H G F S +G + LGGG
Sbjct: 220 WLLQQP------------VPVLRRLRRRQGQPRGRTC-HAVRGNFSASTSGTMSLGGGRQ 266
Query: 291 SLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDA----VTAPLIRNKKV-DTFYY 342
SL Q AT + +YC+ D S SG L G PL+RN + T Y
Sbjct: 267 SLLSQTAATFGNAFSYCVPDPSS--SGFLSLGGPADGGGAGRFARTPLVRNPSIIPTLYL 324
Query: 343 VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGN 401
V L G VGG+ + +PP +F GG ++D IT+L AY +LR +F +A
Sbjct: 325 VRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMAAY 378
Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
+ G A DTCYDF SV VP VSL F G + L A ++ C AF P
Sbjct: 379 PRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------EGCLAFVP 432
Query: 462 TSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
T AL IGNVQQQ V +D+ VGF C
Sbjct: 433 TPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 468
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 109/312 (34%), Positives = 150/312 (48%), Gaps = 29/312 (9%)
Query: 199 PIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR------CLYQVAYGDGSFTVGDLVTET 252
P FD TSS+ C + C+ L V++C + C+Y Y D S T G + +
Sbjct: 23 PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLIEVDK 82
Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCL------ 305
+FG SV G+A GCG N G+F + G+ G G G LSL Q+K + ++C
Sbjct: 83 FTFGAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGL 142
Query: 306 ----VDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
V D PA + + RG T PLI+N TFYY+ L G +VG + +P S
Sbjct: 143 KQSTVLLDLPAD---LYKNGRGAVQST-PLIQNSANPTFYYLSLKGITVGSTRLPVPESA 198
Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYDFSGL 420
F + G GG I+D GT+IT L Q Y +RD F ++ + P + + TC+
Sbjct: 199 FALTN-GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPY-TCFSAPSQ 256
Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPV-DSAGT--FCFAFAPTSSALSIIGNVQQQGT 477
VP + LHF G +DLP +NY+ V D AG C A +IIGN QQQ
Sbjct: 257 AKPDVPKLVLHF-EGATMDLPRENYVFEVPDDAGNSIICLAIN-KGDETTIIGNFQQQNM 314
Query: 478 RVSFDLANNRVG 489
V +DL N G
Sbjct: 315 HVLYDLQNMHRG 326
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 141 bits (355), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 115/389 (29%), Positives = 168/389 (43%), Gaps = 58/389 (14%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTEC-YQQSD----PIFDPKTSSS 208
G Y + +GTPP+ VLDTGS + W C C+ C + D P F PK SS+
Sbjct: 90 GGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSST 149
Query: 209 YSPLPCAAPQCKSL--------------DVSACRANRCLYQVAYGDGSFTVGDLVTETVS 254
L C P+C + + C Y + YG GS T G L+ + ++
Sbjct: 150 AKLLGCRNPKCGYIFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLGS-TAGFLLLDNLN 208
Query: 255 FGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DSP 311
F +V +GC + +G+ G G G SL Q+ +YCLV D+P
Sbjct: 209 FPGK-TVPQFLVGCSILS---IRQPSGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDDTP 264
Query: 312 ASGVLEFNSARGGDAVTA----------PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
S L + GD T P N +YY+ L VGG+ V+IP +
Sbjct: 265 QSSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKVIVGGKDVKIPYTF 324
Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-------LAGNLKPTSGVALFDTC 414
E G+GG IVD G+ T ++ YN + FV+ A + + SG++ C
Sbjct: 325 LEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAETQSGLS---PC 381
Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF-------AFAPTSSALS 467
++ SG+++V P ++ F G + P +NY V A C A P ++ +
Sbjct: 382 FNISGVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVVCLTVVSDGGAGPPKTTGPA 441
Query: 468 II-GNVQQQGTRVSFDLANNRVGFTPNKC 495
II GN QQQ + +DL N R GF P C
Sbjct: 442 IILGNYQQQNFYIEYDLENERFGFGPRSC 470
>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
gi|194689376|gb|ACF78772.1| unknown [Zea mays]
gi|224031455|gb|ACN34803.1| unknown [Zea mays]
gi|238011528|gb|ACR36799.1| unknown [Zea mays]
gi|238015454|gb|ACR38762.1| unknown [Zea mays]
Length = 304
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 103/305 (33%), Positives = 141/305 (46%), Gaps = 22/305 (7%)
Query: 212 LPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG------I 264
+ CA C + +C R + C Y+ YGDG+ TVG TE +F +SG +
Sbjct: 1 MRCAGTLCSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPL 60
Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
GCG N G +G++G G LSL Q+ +YCL S L F S G
Sbjct: 61 GFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLSIRRFSYCLTSYASRRQSTLLFGSLSDG 120
Query: 325 ---DAV----TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
DA T PL+++ + TFYYV TG +VG + ++IP S F + G GG+IVD G
Sbjct: 121 VYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSG 180
Query: 378 TAITRLQTQAYNSLRDSF---VRL--AGNLKPTSGVALF--DTCYDFSGLRSVRVPTVSL 430
TA+T L + +F +RL A P GV S + VP + L
Sbjct: 181 TALTLLPAAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVL 240
Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
HF G LDLP +NY++ G C A + S IGN+ QQ RV +DL +
Sbjct: 241 HF-QGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSI 299
Query: 491 TPNKC 495
P +C
Sbjct: 300 APARC 304
>gi|84453222|dbj|BAE71208.1| hypothetical protein [Trifolium pratense]
gi|84453226|dbj|BAE71210.1| hypothetical protein [Trifolium pratense]
Length = 437
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 117/369 (31%), Positives = 166/369 (44%), Gaps = 34/369 (9%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
S P+ SG + G Y R+ +GTP + MVLDT +D ++ C C S F P
Sbjct: 84 SAPIASGQTFNIGNYVVRVKIGTPGQLLFMVLDTSTDEAFVPSSGCIGC---SATTFYPN 140
Query: 205 TSSSYSPLPCAAPQC---KSLDVSACRANRCLYQVAYGDGSFT---VGD---LVTETV-- 253
S+S+ PL C+ PQC + L A + C + +Y +F+ V D L T+ +
Sbjct: 141 VSTSFVPLDCSVPQCGQVRGLSCPATGSGACSFNQSYAGSTFSATLVQDSLRLATDVIPS 200
Query: 254 -SFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA 312
SFG+ ++ G ++ + +LS + I + +YCL S
Sbjct: 201 YSFGSINAISGSSVPAQGLLGLGRGPLS--------LLSQSGAIYSGVFSYCLPSFKSYY 252
Query: 313 -SGVLEFNSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
SG L+ ++ T PL+ N + YYV LT SVG V +P L + +
Sbjct: 253 FSGSLKLGPVGQPKSIRTTPLLHNPHRPSLYYVNLTAISVGRVYVPLPSELLAFNPSTGA 312
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
G I+D GT ITR YN++RD F + P S + FDTC F P ++L
Sbjct: 313 GTIIDSGTVITRFVEPIYNAVRDEFRKQVTG--PFSSLGAFDTC--FVKNYETLAPAITL 368
Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANN 486
HF L LP +N LI S C A A +S L++I N QQQ RV FD NN
Sbjct: 369 HF-TDLDLKLPLENSLIHSSSGSLACLAMAAAPSNVNSVLNVIANFQQQNLRVLFDTVNN 427
Query: 487 RVGFTPNKC 495
+VG C
Sbjct: 428 KVGIARELC 436
>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 437
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 125/352 (35%), Positives = 171/352 (48%), Gaps = 23/352 (6%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
G Y R+ +GTP + MVLDT +D W+ C CT C + TSS+Y L C+
Sbjct: 95 GNYVVRVKLGTPGQFMFMVLDTSNDAAWVPCSGCTGCSSTTF---STNTSSTYGSLDCSM 151
Query: 217 PQCKSLDVSACRA---NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
QC + +C A + C++ +YG S LV +++ N + A GC +
Sbjct: 152 AQCTQVRGFSCPATGSSSCVFNQSYGGDSSFSATLVEDSLRLVND-VIPNFAFGCINSIS 210
Query: 274 GLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVDRDSPA-SGVLEFNSARGGDAVT- 328
G V GLLGLG G LSL Q + + +YCL S SG L+ A ++
Sbjct: 211 GGSVPPQGLLGLGRGPLSLIAQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPAGQPKSIRY 270
Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
PL+RN + YYV LTG SVG V I P L + G I+D GT ITR Y
Sbjct: 271 TPLLRNPHRPSLYYVNLTGVSVGRTLVPIAPELLAFNPNTGAGTIIDSGTVITRFVQPIY 330
Query: 389 NSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
++RD F + +AG P S + FDTC F+ P V+LHF G L LP +N LI
Sbjct: 331 TAIRDEFRKQVAG---PFSSLGAFDTC--FAATNEAVAPAVTLHF-TGLNLVLPMENSLI 384
Query: 448 PVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ C A A +S L++I N+QQQ R+ FD+ N+R+G C
Sbjct: 385 HSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRLLFDVPNSRLGIARELC 436
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 119/410 (29%), Positives = 172/410 (41%), Gaps = 55/410 (13%)
Query: 130 HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP 189
H++K ++ + F +P+ + G Y + + GTP + ++ DTGS + W C
Sbjct: 58 HQIKTPKSNSV---FKSPL---SPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTS 111
Query: 190 ---CTEC-YQQSDPI----FDPKTSSSYSPLPCAAPQCKSL---DV-SACRA-----NRC 232
C+EC + + DP F PK SSS + C P+C + DV S CR+ C
Sbjct: 112 RYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENC 171
Query: 233 L-----YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGG 287
Y V YG GS T G L++ET+ F + + +GC + +G+ G G
Sbjct: 172 TQTCPAYVVQYGSGS-TAGLLLSETLDFPDK-XIPNFVVGCSFLS---IHQPSGIAGFGR 226
Query: 288 GMLSLTKQIKATSLAYCLVDR---DSPASGVLEFNSA---RGGDAVTA----PLIRNKKV 337
G SL Q+ AYCL R DSP SG L +S G T P + N
Sbjct: 227 GSESLPSQMGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAY 286
Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
+YY+ + VG QAV++P G+GG I+D G+ T + + F +
Sbjct: 287 KEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEK 346
Query: 398 LAGNLKPTSGVALFD---TCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
N + V C+D S +SV+ P + F G LP NY V S+G
Sbjct: 347 QLANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGV 406
Query: 455 FCFAFA---------PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C I+G QQQ V +DL N R+GF C
Sbjct: 407 ACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 119/410 (29%), Positives = 172/410 (41%), Gaps = 55/410 (13%)
Query: 130 HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP 189
H++K ++ + F +P+ + G Y + + GTP + ++ DTGS + W C
Sbjct: 58 HQIKTPKSNSV---FKSPL---SPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTS 111
Query: 190 ---CTEC-YQQSDPI----FDPKTSSSYSPLPCAAPQCKSL---DV-SACRA-----NRC 232
C+EC + + DP F PK SSS + C P+C + DV S CR+ C
Sbjct: 112 RYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENC 171
Query: 233 L-----YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGG 287
Y V YG GS T G L++ET+ F + + +GC + +G+ G G
Sbjct: 172 TQTCPAYVVQYGSGS-TAGLLLSETLDFPDK-KIPNFVVGCSFLS---IHQPSGIAGFGR 226
Query: 288 GMLSLTKQIKATSLAYCLVDR---DSPASGVLEFNSA---RGGDAVTA----PLIRNKKV 337
G SL Q+ AYCL R DSP SG L +S G T P + N
Sbjct: 227 GSESLPSQMGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAY 286
Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
+YY+ + VG QAV++P G+GG I+D G+ T + + F +
Sbjct: 287 KEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEK 346
Query: 398 LAGNLKPTSGVALFD---TCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
N + V C+D S +SV+ P + F G LP NY V S+G
Sbjct: 347 QLANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGV 406
Query: 455 FCFAFA---------PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C I+G QQQ V +DL N R+GF C
Sbjct: 407 ACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456
>gi|356500756|ref|XP_003519197.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 451
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 129/429 (30%), Positives = 197/429 (45%), Gaps = 39/429 (9%)
Query: 82 LPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILP 141
+P++ K + ++++ +D RV ++ L ++ KP A
Sbjct: 46 IPIYGNCSPFKNYSTSWENIIIDMASKDPERV-VYLSSLDASL------RRKPISA---- 94
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
P+ SG + G G Y R+ +G+P + F MVLDT +D W+ C CT C S +
Sbjct: 95 ----APIASGQAFGIGSYVVRVKLGSPNQLFFMVLDTSTDEAWVPCTGCTGC-SSSSTYY 149
Query: 202 DPKTSSSY-SPLPCAAPQCK----SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG 256
P+ S++Y + C AP+C +L + C + +Y +F+ LV +++ G
Sbjct: 150 SPQASTTYGGAVACYAPRCAQARGALPCPYTGSKACTFNQSYAGSTFS-ATLVQDSLRLG 208
Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGML---SLTKQIKATSLAYCLVD-RDSPA 312
++ A GC + G + + GLLGLG G L S + ++ + +YCL + S
Sbjct: 209 ID-TLPSYAFGCVNSASGWTLPAQGLLGLGRGPLSLPSQSSKLYSGIFSYCLPSFQSSYF 267
Query: 313 SGVLEFN-SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
SG L+ + + T PL++N + + YYV LTG +VG V +P D G
Sbjct: 268 SGSLKLGPTGQPRRIRTTPLLQNPRRPSLYYVNLTGVTVGRVKVPLPIEYLAFDPNKGSG 327
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
I+D GT ITR Y+++RD F ++ G G FDTC F P + L
Sbjct: 328 TILDSGTVITRFVGPVYSAIRDEFRNQVKGPFFSRGG---FDTC--FVKTYENLTPLIKL 382
Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANN 486
F G + LP +N LI G C A A +S L++I N QQQ RV FD NN
Sbjct: 383 RF-TGLDVTLPYENTLIHTAYGGMACLAMAAAPNNVNSVLNVIANYQQQNLRVLFDTVNN 441
Query: 487 RVGFTPNKC 495
RVG C
Sbjct: 442 RVGIARELC 450
>gi|356537173|ref|XP_003537104.1| PREDICTED: uncharacterized protein LOC100817302 [Glycine max]
Length = 328
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 67/141 (47%), Positives = 92/141 (65%)
Query: 355 VQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTC 414
+ I L+ + + GD G ++D G +TRL T AY + RD+FV NL GV++F+TC
Sbjct: 188 LNISEDLYRVTDLGDEGAVMDTGITVTRLPTVAYGAFRDAFVAQTTNLPRAPGVSIFNTC 247
Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQ 474
YD +G +VRVPTV +F G+ L + +N+LIP D GTF FAFA + SALSIIGN+QQ
Sbjct: 248 YDLNGFVTVRVPTVLFYFSGGQILTILTQNFLIPADDVGTFYFAFAASPSALSIIGNIQQ 307
Query: 475 QGTRVSFDLANNRVGFTPNKC 495
+G ++S D AN +GF N C
Sbjct: 308 EGIQISVDGANGFLGFGRNVC 328
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 116/375 (30%), Positives = 176/375 (46%), Gaps = 59/375 (15%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS- 221
+ VG+PP+ SMVLDTGS+++WL C+ +F+P +SS+YSP+PC++P C++
Sbjct: 65 LAVGSPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPVPCSSPICRTR 120
Query: 222 ---LDVSAC---RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC-----GH 270
L + A + + C ++Y D + G+L +T G S + G GC
Sbjct: 121 TRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIG-SVTRPGTLFGCMDSGLSS 179
Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR----GGDA 326
D+E S GL+G+ G LS Q+ + +YC+ DS SG+L A G
Sbjct: 180 DSEE-DAKSTGLMGMNRGSLSFVNQLGFSKFSYCISGSDS--SGILLLGDASYSWLGPIQ 236
Query: 327 VTAPLIRNKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
T +++ + F Y V L G VG + + +P S+F D G G +VD GT T
Sbjct: 237 YTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTF 296
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALF------DTCY--------DFSGLRSVRVPTV 428
L Y +L++ F+ ++ F D CY +F+GL P +
Sbjct: 297 LMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGTMDLCYRVGSSTRPNFTGL-----PVI 351
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGT------FCFAFAPTSSALSI----IGNVQQQGTR 478
SL F G + + + L V+ AG+ +CF F S L I IG+ QQ
Sbjct: 352 SLMF-RGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFG-NSDLLGIEAFVIGHHHQQNVW 409
Query: 479 VSFDLANNRVGFTPN 493
+ FDLA +RVGF N
Sbjct: 410 MEFDLAKSRVGFAGN 424
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 116/372 (31%), Positives = 172/372 (46%), Gaps = 53/372 (14%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS- 221
+ VG PP+ SMVLDTGS+++WL C+ +F+P +SS+YSP+PC++P C++
Sbjct: 69 LAVGDPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPVPCSSPICRTR 124
Query: 222 ---LDVSAC---RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD---- 271
L + A + + C ++Y D + G+L ET G S + G GC
Sbjct: 125 TRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIG-SVTRPGTLFGCMDSGLSS 183
Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVL--EFNSARGGDAVTA 329
N S GL+G+ G LS Q+ + +YC+ DS +L S G T
Sbjct: 184 NSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCISGSDSSVFLLLGDASYSWLGPIQYTP 243
Query: 330 PLIRNKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
++++ + F Y V L G VG + + +P S+F D G G +VD GT T L
Sbjct: 244 LVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMG 303
Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALF------DTCY--------DFSGLRSVRVPTVSLH 431
Y +L++ F+ ++ F D CY +FSGL P VSL
Sbjct: 304 PVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGL-----PMVSLM 358
Query: 432 FGAGKALDLPAKNYLIPVDSAGT------FCFAFAPTSSALSI----IGNVQQQGTRVSF 481
F G + + + L V+ AG+ +CF F S L I IG+ QQ + F
Sbjct: 359 F-RGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFG-NSDLLGIEAFVIGHHHQQNVWMEF 416
Query: 482 DLANNRVGFTPN 493
DLA +RVGF N
Sbjct: 417 DLAKSRVGFAGN 428
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 124/374 (33%), Positives = 166/374 (44%), Gaps = 39/374 (10%)
Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC---TECYQQSDPIFDPK 204
VVS S EY + +G+PPR + DTGSD+ W++C+ T FDP
Sbjct: 90 VVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPS 149
Query: 205 TSSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS--- 260
SS+Y + C C++L + C + C Y AYGDGS T G L TET +F + GS
Sbjct: 150 RSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGSGRS 209
Query: 261 -----VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSL----AYCLVDRDSP 311
V G+ GC G F + GG + +T+ ATSL +YCLV
Sbjct: 210 PRQVRVGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFSYCLVPHSVN 269
Query: 312 ASGVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
AS L F + A + PL+ VDT+Y V L VG + V A
Sbjct: 270 ASSALNFGALADVTEPGAASTPLVAG-DVDTYYTVVLDSVKVGNKTVA---------SAA 319
Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVR--- 424
IIVD GT +T L + D R L P S L CY+ +G R V
Sbjct: 320 SSRIIVDSGTTLTFLDPSLLGPIVDELSRRI-TLPPVQSPDGLLQLCYNVAG-REVEAGE 377
Query: 425 -VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSF 481
+P ++L FG G A+ L +N + V GT C A T+ +SI+GN+ QQ V +
Sbjct: 378 SIPDLTLEFGGGAAVALKPENAFVAVQE-GTLCLAIVATTEQQPVSILGNLAQQNIHVGY 436
Query: 482 DLANNRVGFTPNKC 495
DL V F C
Sbjct: 437 DLDAGTVTFAGADC 450
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/357 (27%), Positives = 171/357 (47%), Gaps = 28/357 (7%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP++F++++DTGS + ++ C C +C + DP F P SS+Y P+ C
Sbjct: 74 NGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKC- 132
Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHDNE 273
P C D +C Y+ Y + S + G + + VSFGN +K GC +
Sbjct: 133 NPSCNCDD----EGKQCTYERRYAEMSSSSGVIAEDVVSFGNESELKPQRAVFGCENVET 188
Query: 274 G-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASGVLEFNSARGGDA 326
G L+ A G++GLG G LS+ Q + S + C D ++ + +
Sbjct: 189 GDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMVLGQISPPPNM 248
Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
V + N +Y + L V G+ +++ P +F+ G ++D GT
Sbjct: 249 VFSH--SNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKH----GTVLDSGTTYAYFPEA 302
Query: 387 AYNSLRDSFVRLAGNLK--PTSGVALFDTCYDFSGLR----SVRVPTVSLHFGAGKALDL 440
A+++L+D+ ++ +LK P D C+ +G S P V++ FG+G+ L L
Sbjct: 303 AFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVFGSGQKLSL 362
Query: 441 PAKNYLI-PVDSAGTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+NYL +G +C F + +++G + + T V++D N+++GF C
Sbjct: 363 SPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVVRNTLVTYDRENDKIGFWKTNC 419
>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
Length = 408
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 116/357 (32%), Positives = 157/357 (43%), Gaps = 22/357 (6%)
Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
S PV SG Q Y R G+GTP +Q + LDT +D W C PC C S F P
Sbjct: 67 SAPVASG--QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPA 122
Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
+SSSY+ LPCA+ C A G+ L+ SG +
Sbjct: 123 SSSSYASLPCASDWCPLFRRPAVPGEPGRV------GAAADVRLLQAASRTPRSGVLA-- 174
Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNSA-R 322
A CG +G + L LS T +YCL R SG L +A +
Sbjct: 175 ATRCGWARTPSPATRSGPMSL----LSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQ 230
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
+ PL+ N + YYV +TG SVG V+ P F D + G ++D GT ITR
Sbjct: 231 PRNVRYTPLLTNPHRPSLYYVNVTGLSVGRALVKAPAGSFAFDPSTGAGTVIDSGTVITR 290
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
Y +LRD F R + + FDTC++ + + P V+LH G G L LP
Sbjct: 291 WTAPVYAALRDEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMGGGVDLTLPM 350
Query: 443 KNYLIPVDSAGTFCFAFAPT----SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+N LI + C A A +S ++++ N+QQQ RV D+A +RVGF C
Sbjct: 351 ENTLIHSSATPLACLAMAEAPQNVNSVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 407
>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 439
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 118/406 (29%), Positives = 179/406 (44%), Gaps = 78/406 (19%)
Query: 108 RDSARVNTLITKL-QLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVG 166
RD +RV+ + +K Q A N+ H ++ ED G + + G
Sbjct: 92 RDESRVSFINSKFNQYAPENLKDHT---PNNKLFDED-------------GNFLVDVAFG 135
Query: 167 TPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA 226
TPP+ F+++LDTGS I W QC+ CT
Sbjct: 136 TPPQNFTLILDTGSSITWTQCKACT----------------------------------- 160
Query: 227 CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF-VGSAGLLGL 285
N Y + YGD S +VG+ +T++ S + G G +N+G F G G+LGL
Sbjct: 161 VENN---YNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGRGRNNKGDFGSGVDGMLGL 217
Query: 286 GGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNK----KVD 338
G G LS Q + +YCL + DS S + + ++ + N +
Sbjct: 218 GQGQLSTVSQTASKFNKVFSYCLPEEDSIGSLLFGEKATSQSSSLKFTSLVNGPGTLQES 277
Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
+Y+V L+ SVG + + IP S+F G I+D T ITRL +AY++L+ +F +
Sbjct: 278 GYYFVNLSDISVGNERLNIPSSVFASP-----GTIIDSRTVITRLPQRAYSALKAAFKKA 332
Query: 399 AGNLKPTSGVA----LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
++G + DTCY+ SG + V +P + LHFG G + L N + D +
Sbjct: 333 MAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTNIVWGSDES-R 391
Query: 455 FCFAFAPTSSA-----LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C AFA S + L+IIGN QQ V +D+ R+GF N C
Sbjct: 392 LCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYDIQGGRIGFRSNGC 437
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 122/408 (29%), Positives = 177/408 (43%), Gaps = 70/408 (17%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC----------TECYQQSDPIFDPK 204
G +Y + G+G PP+ V+DTGSD+ W QC C C+ Q+ P ++
Sbjct: 74 GKTQYIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFS 133
Query: 205 TSSSYSPLPC---------AAPQCKSLDVSACRA-NRCLYQVAYGDGSFTVGDLVTETVS 254
S + +PC AP+ + C+ +YG G +G L T+ +
Sbjct: 134 LSRTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFT 192
Query: 255 FGNSGSVKGIALGCGHDNE---GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD--RD 309
F +S SV +A GC G G++G++GLG G LSL Q+ AT +YCL RD
Sbjct: 193 FPSSSSVT-LAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATEFSYCLTPYFRD 251
Query: 310 SPASGVLEFNSARGGD--------------AVTAPLIRNKK---VDTFYYVGLTGFSVGG 352
+ + L T P +N K TFYY+ L G + G
Sbjct: 252 TVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGN 311
Query: 353 QAVQIPPSLFEMDEAGD----GGIIVDCGTAITRLQTQAYNSLRDSFVRL---AGNLKPT 405
V +P F++ EA GG ++D G+ TRL A+ +L R +G+L P
Sbjct: 312 ATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPP 371
Query: 406 S---GVAL---FDTCYDFSGLRSVRVPTVSLHF----GAGKALDLPAKNYLIPVDSAGTF 455
G AL + D L + VP + L F G G+ L +PA+ Y V+ A T+
Sbjct: 372 PAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVE-ASTW 430
Query: 456 CFAFAPTSSA--------LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C A ++S +IIGN QQ RV +DLAN + F P C
Sbjct: 431 CMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 478
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 116/369 (31%), Positives = 176/369 (47%), Gaps = 43/369 (11%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC--K 220
+ VGTPP+ +MV+DTGS+++WL C ++ S F+P SSSYSP+PC++ C +
Sbjct: 77 LTVGTPPQNVTMVIDTGSELSWLHCN-TSQNSSSSSSTFNPVWSSSYSPIPCSSSTCTDQ 135
Query: 221 SLDVS---ACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----N 272
+ D +C +N+ C ++Y D S + G+L T+T G+SG + + GC N
Sbjct: 136 TRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSG-IPNVVFGCMDSIFSSN 194
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI 332
+ GL+G+ G LS Q+ +YC+ + D SG+L A + APL
Sbjct: 195 SEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYD--FSGLLLLGDANF--SWLAPLN 250
Query: 333 RNKKVD----------TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
++ Y V L G V + + IP S+FE D G G +VD GT T
Sbjct: 251 YTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQTMVDSGTQFTF 310
Query: 383 LQTQAYNSLRDSFV-RLAGNLKPTSGVAL-----FDTCYDF--SGLRSVRVPTVSLHF-G 433
L AY +LRD F+ + AG+L+ D CY + R +P+V+L F G
Sbjct: 311 LLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPPLPSVTLVFRG 370
Query: 434 AGKALDLPAKNYLIPVDSAGT---FCFAFAPTSSALS----IIGNVQQQGTRVSFDLANN 486
A + Y +P + G CF F S L +IG++ QQ + FDL +
Sbjct: 371 AEMTVTGDRILYRVPGERRGNDSIHCFTFG-NSDLLGVEAFVIGHLHQQNVWMEFDLKKS 429
Query: 487 RVGFTPNKC 495
R+G +C
Sbjct: 430 RIGLAEIRC 438
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 116/372 (31%), Positives = 177/372 (47%), Gaps = 44/372 (11%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQC---RPCTECYQQSDPI---FDPKTSSSYSPLPCAA 216
+ VGTPP+ +MVLDTGS+++WL C R + + + F P+ S++++ +PC +
Sbjct: 67 LAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPCGS 126
Query: 217 PQCKSLDVSA---C--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC--- 268
QC S D+ A C + +C ++Y DGS + G L T+ + G + ++ A GC
Sbjct: 127 TQCSSRDLPAPPSCDGASRQCHVSLSYADGSASDGALATDVFAVGEAPPLRS-AFGCMST 185
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR------ 322
+D+ V +AGLLG+ G LS Q +YC+ DRD +GVL +
Sbjct: 186 AYDSSPDGVATAGLLGMNRGTLSFVTQASTRRFSYCISDRDD--AGVLLLGHSDLPFLPL 243
Query: 323 GGDAVTAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
+ P + D Y V L G VGG+A+ IP S+ D G G +VD GT T
Sbjct: 244 NYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQTMVDSGTQFT 303
Query: 382 RLQTQAYNSLRDSFVRLAGNL-----KPTSGV-ALFDTCYDFSGLR---SVRVPTVSLHF 432
L AY++L+ F++ L P+ DTC+ R S R+P V+L F
Sbjct: 304 FLLGDAYSALKAEFLKQTKPLLRALDDPSFAFQEALDTCFRVPAGRPPPSARLPPVTLLF 363
Query: 433 -GAGKALDLPAKNYLIPVD---SAGTFCFAFA-----PTSSALSIIGNVQQQGTRVSFDL 483
GA ++ Y +P + + G +C F P ++ +IG+ Q V +DL
Sbjct: 364 NGAEMSVAGDRLLYKVPGEHRGADGVWCLTFGNADMVPLTA--YVIGHHHQMNLWVEYDL 421
Query: 484 ANNRVGFTPNKC 495
RVG P KC
Sbjct: 422 ERGRVGLAPVKC 433
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 111/352 (31%), Positives = 156/352 (44%), Gaps = 26/352 (7%)
Query: 160 FSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
+ I +G PP +V+DTGSDI W+ C PCT C +FDP SS++SPL C P
Sbjct: 102 MANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGLLFDPSMSSTFSPL-CKTP-- 158
Query: 220 KSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIALGCGHD-NE 273
D C R + + V Y D S G +TV F + + + GCGH+ +
Sbjct: 159 --CDFKGCSRCDPIPFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLFGCGHNIGQ 216
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD--AVTAPL 331
G G+LGL G SL +I +YC+ D P + G D + P
Sbjct: 217 DTDPGHNGILGLNNGPDSLATKI-GQKFSYCIGDLADPYYNYHQLILGEGADLEGYSTPF 275
Query: 332 IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
+ + FYYV + G SVG + + I P FEM + GG+I+D G+ IT L + L
Sbjct: 276 ---EVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTITFLVDSVHRLL 332
Query: 392 RDSFVRLAGN--LKPTSGVALFDTCYDFSGLRS-VRVPTVSLHFGAGKALDLPAKNYLIP 448
L G + T + + C+ S R V P V+ HF G L L + ++
Sbjct: 333 SKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFHFADGADLALDSGSFFNQ 392
Query: 449 VDSAGTFCFAFAPTS-----SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
++ FC P S S S+IG + QQ V +DL N V F C
Sbjct: 393 LND-NVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDLVNQFVYFQRIDC 443
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 120/381 (31%), Positives = 173/381 (45%), Gaps = 47/381 (12%)
Query: 158 EYFSRIGVGTP-PRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
EY + +GTP P++ ++ LDTGSD+ W QC C C+ Q P FD S + +PC+
Sbjct: 99 EYLIHLSIGTPRPQRVALTLDTGSDLVWTQC-ACHVCFAQPFPTFDALASQTTLAVPCSD 157
Query: 217 PQCKS--LDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGS-------V 261
P C S +S C N C Y Y D S T G +V +T +F GN+GS V
Sbjct: 158 PICTSGKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGVAV 217
Query: 262 KGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVD----RDSP----- 311
+ GCG N+G+F + +G+ G G +SL Q+K ++C R SP
Sbjct: 218 PNVRFGCGQYNKGIFKSNESGIAGFSRGPMSLPSQLKVARFSHCFTAIADARTSPVFLGG 277
Query: 312 ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
A G + G + P + + YY+ L G +VG + + F G G
Sbjct: 278 APGPDNLGAHATGPVQSTPFANSNG--SLYYLTLKGITVGKTRLPLNALAFAGKGTGSGS 335
Query: 372 I--IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT-- 427
I+D GT I L Y SLR +FV A ++ F RS +P
Sbjct: 336 GGTIIDSGTGIRTLPGPMYRSLRAAFVARVKLPVANESAADAESTLCFEAARSASLPPEA 395
Query: 428 -------VSLHFGAGKALDLPAKNYLIPV----DSAGT-FCFAF-APTSSALSIIGNVQQ 474
V LH AG DLP ++Y++ + D +G+ C + S L+IIGN QQ
Sbjct: 396 PAPALPKVVLHV-AGADWDLPRESYVLDLLEDEDGSGSGLCLVMNSAGDSDLTIIGNFQQ 454
Query: 475 QGTRVSFDLANNRVGFTPNKC 495
Q V++DL N++ F P +C
Sbjct: 455 QNMHVAYDLEKNKLVFVPARC 475
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 112/332 (33%), Positives = 161/332 (48%), Gaps = 27/332 (8%)
Query: 188 RPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS--ACRANRCLYQVAYGDGSFTV 245
R EC + P F P +SS++S LPCA+ C+ L C A C+Y YG G FT
Sbjct: 83 RAVHECAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMG-FTA 141
Query: 246 GDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCL 305
G L TET+ G + S G+A GC +N G+ S+G++GLG LSL Q+ +YCL
Sbjct: 142 GYLATETLHVGGA-SFPGVAFGCSTEN-GVGNSSSGIVGLGRSPLSLVSQVGVGRFSYCL 199
Query: 306 -VDRDSPASGVLEFNSARGGDAVTAP-LIRNKKV--DTFYYVGLTGFSVGGQAVQIPPSL 361
D D+ S +L + A+ ++P ++ N ++ ++YYV LTG +VG + + +
Sbjct: 200 RSDADAGDSPILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVGATDLPVTSTT 259
Query: 362 FEMDEAGD----GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA----LFDT 413
F GG IVD GT +T L + Y ++ +F+ T+ V FD
Sbjct: 260 FGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDL 319
Query: 414 CYDFS---GLRSVRVPTVSLHFGAGKALDLPAKNY--LIPVDSAG---TFCFAFAPTSSA 465
C+D + G V VPT+ L F G + ++Y ++ VDS G C P S
Sbjct: 320 CFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLVLPASEK 379
Query: 466 L--SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L SIIGNV Q V +DL F P C
Sbjct: 380 LSISIIGNVMQMDLHVLYDLDGGMFSFAPADC 411
>gi|357491945|ref|XP_003616260.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517595|gb|AES99218.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 441
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 117/364 (32%), Positives = 168/364 (46%), Gaps = 45/364 (12%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI---FDPKTSSSYSPLPCAAPQCKS 221
+GTPP+ MVLDTGS ++W+ C ++ P FDP SSS+ LPC P CK
Sbjct: 75 IGTPPQLQQMVLDTGSQVSWIHCDNKKGPQKKQPPTTSSFDPSLSSSFFALPCNHPLCKP 134
Query: 222 L--DVSA---CRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
D+S C ANR C Y +Y DG+ G+LV E ++ S + I LGC + ++
Sbjct: 135 QVPDISLPTDCDANRLCHYSFSYTDGTVVEGNLVRENIALSPSLTTPPIILGCANQSDD- 193
Query: 276 FVGSAGLLGLGGGMLSLTKQIKATSLAYCL-VDRDSPASGVLEFNSARGGDAVTAPLIRN 334
+ G+LG+ G LS Q K T +Y + V + P SG L G+ + R
Sbjct: 194 ---ARGILGMNLGRLSFPNQAKITKFSYFVPVKQTQPGSGSLYL-----GNNPNSSCFRY 245
Query: 335 KKVDTF---------------YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
K+ TF + + + G S+GG+ + IPPS+F+ D G G I+D G+
Sbjct: 246 VKLLTFSKSQSQRMPNLDPLAFTLPMQGISIGGKKLNIPPSVFKPDTTGFGQTIIDSGSE 305
Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTS----GVALFDTCYDFSGLRSVR-VPTVSLHFGA 434
+ + +AYN +R+ V+ G+ GVA D C+D R V + F
Sbjct: 306 FSYMVDKAYNVIRNELVKKVGSKIKKDYIYGGVA--DICFDGDATEIGRLVGDMVFEFEK 363
Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQ---QQGTRVSFDLANNRVGFT 491
G + +P + LI VD G CF + QQ V FDLA +RVGF
Sbjct: 364 GVEIVIPKERVLIEVD-GGVHCFGIGRAEGLGGGGNIIGNFYQQNLWVEFDLAKHRVGFR 422
Query: 492 PNKC 495
C
Sbjct: 423 GANC 426
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 122/421 (28%), Positives = 177/421 (42%), Gaps = 71/421 (16%)
Query: 130 HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL---- 185
H PA A + P + G Y +GTPP+ ++LDTGS + W+
Sbjct: 82 HPSVPATAALYPHSY------------GGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTS 129
Query: 186 --QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCL---------- 233
+CR C+ + P+F PK SSS + C P C+ + +A A +C
Sbjct: 130 SYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAAN 189
Query: 234 -----------YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGL 282
Y V YG GS T G L+ +T+ +V G LGC + +GL
Sbjct: 190 CPAAASNVCPPYAVVYGSGS-TAGLLIADTLR-APGRAVPGFVLGC--SLVSVHQPPSGL 245
Query: 283 LGLGGGMLSLTKQIKATSLAYCLVDR----DSPASGVLEFNSARGGDAVT-APLIRNKKV 337
G G G S+ Q+ +YCL+ R ++ SG L GG+ + PL+++
Sbjct: 246 AGFGRGAPSVPAQLGLPKFSYCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAG 305
Query: 338 D-----TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
D +YY+ L G +VGG+AV++P F + AG GG IVD GT T L + +
Sbjct: 306 DKLPYGVYYYLALRGVTVGGKAVRLPARAFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVA 365
Query: 393 DSFVRLAGNLKPTS-----GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
D+ V G S G+ L G RS+ +P +S HF G + LP +NY +
Sbjct: 366 DAVVAAVGGRYKRSKDAEDGLGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFV 425
Query: 448 PVDSAGT--FCFAF-----------APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
C A S I+G+ QQQ V +DL R+GF
Sbjct: 426 VAGRGAVEAICLAVVTDFGGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQS 485
Query: 495 C 495
C
Sbjct: 486 C 486
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 116/397 (29%), Positives = 166/397 (41%), Gaps = 58/397 (14%)
Query: 149 VSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSS 208
VS + G Y + GTPP+ S + DTGS + W C C + S P DP T S
Sbjct: 122 VSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISK 181
Query: 209 YSP--------LPCAAPQCKSLD----VSACR-----ANRCL-----YQVAYGDGSFTVG 246
+ P + C P+C + S CR + +C Y + YG G+ T G
Sbjct: 182 FVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGA-TAG 240
Query: 247 DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV 306
L++ET+ N V +GC + AG+ G G G SL Q++ ++CLV
Sbjct: 241 ILLSETLDLENK-RVPDFLVGCSVMSVH---QPAGIAGFGRGPESLPSQMRLKRFSHCLV 296
Query: 307 DR---DSPASGVLEFNSARGGDA------VTAPLIRNKKVDT-----FYYVGLTGFSVGG 352
R DSP S L +S D + AP N V +YY+ L +GG
Sbjct: 297 SRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGG 356
Query: 353 QAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR------LAGNLKPTS 406
+ V+ P D G+GG I+D G+ T L + ++ D + A +++ S
Sbjct: 357 KPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQS 416
Query: 407 GVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA 465
G+ C++ S P V L F G L L A+NYL V G C +
Sbjct: 417 GL---RPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAV 473
Query: 466 LS-------IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ I+G QQQ V +DLA R+GF KC
Sbjct: 474 VGGGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 510
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 116/386 (30%), Positives = 165/386 (42%), Gaps = 55/386 (14%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTEC-----YQQSDPIFDPKTSSS 208
G Y + GTPP+ V+DTGS + W C C+EC + P F PK SSS
Sbjct: 81 GGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSS 140
Query: 209 YSPLPCAAPQCKSL----DVSACR-----ANRCL-----YQVAYGDGSFTVGDLVTETVS 254
+ C P+C + S C+ A C Y + YG GS T G L++ET+
Sbjct: 141 SKLIGCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQYGSGS-TAGLLLSETLD 199
Query: 255 FGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DSP 311
F N ++ +GC + G+ G G SL Q+ +YCLV D+P
Sbjct: 200 FPNKKTIPDFLVGCSIFS---IKQPEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTP 256
Query: 312 ASGVLEFNSARG-GDAVTA-----PLIRNKKV--DTFYYVGLTGFSVGGQAVQIPPSLFE 363
S L ++ G G TA P ++N +YYV L +G V++P
Sbjct: 257 TSSDLVLDTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHVKVPYKFLV 316
Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR------LAGNLKPTSGVALFDTCYDF 417
G+GG IVD GT T ++ Y + F + +A ++ +G+ CY+
Sbjct: 317 PGTDGNGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQNLTGLR---PCYNI 373
Query: 418 SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS--------II 469
SG +S+ VP + F G + LP NY VDS G C + A I+
Sbjct: 374 SGEKSLSVPDLIFQFKGGAKMALPLSNYFSIVDS-GVICLTIVSDNVAGPGLGGGPAIIL 432
Query: 470 GNVQQQGTRVSFDLANNRVGFTPNKC 495
GN QQ+ V FDL N + GF C
Sbjct: 433 GNYQQRNFYVEFDLENEKFGFKQQSC 458
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 113/385 (29%), Positives = 156/385 (40%), Gaps = 53/385 (13%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTEC-----YQQSDPIFDPKTSSS 208
G Y + GTPP+ V+DTGS + W C C+ C P F PK SSS
Sbjct: 90 GGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSS 149
Query: 209 YSPLPCAAPQCKSL--------------DVSACRANRCLYQVAYGDGSFTVGDLVTETVS 254
+ + C +C L C + Y + YG GS T G L++ET+
Sbjct: 150 SNLIGCKNHKCSWLFGPKVQSKCQECDPTTQNCTQSCPPYVIQYGLGS-TAGLLLSETLD 208
Query: 255 FGNSGSVKGIALGCGHDNEGLFV--GSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---D 309
F + ++ G +GC LF G+ G G SL Q+ +YCLV D
Sbjct: 209 FPHKKTIPGFLVGCS-----LFSIRQPEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDD 263
Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDT--------FYYVGLTGFSVGGQAVQIPPSL 361
+PAS L ++ G D P + +YYV L +G V++P
Sbjct: 264 TPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIGDTHVKVPYKF 323
Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL---FDTCYDFS 418
G+GG IVD GT T ++ Y + F + + + V C++ S
Sbjct: 324 LVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGLRPCFNIS 383
Query: 419 GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS--------IIG 470
G +SV VP HF G + LP NY VDS G C + + S I+G
Sbjct: 384 GEKSVSVPEFIFHFKGGAKMALPLANYFSFVDS-GVICLTIVSDNMSGSGIGGGPAIILG 442
Query: 471 NVQQQGTRVSFDLANNRVGFTPNKC 495
N QQ+ V FDL N R GF C
Sbjct: 443 NYQQRNFHVEFDLKNERFGFKQQNC 467
>gi|298204765|emb|CBI25263.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 88/268 (32%), Positives = 123/268 (45%), Gaps = 48/268 (17%)
Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
C Y + YGDGSFT G+L E + FG + VK GCG +N+GLF G +GL+GLG LS
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKFG-TILVKDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 191
Query: 292 LTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVG 351
L Q N ++ FY++ LTG S+G
Sbjct: 192 LISQTS-----------------------------------ENPQLYNFYFINLTGISIG 216
Query: 352 GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF 411
G A+Q P G I+VD GT ITRL Y +L+ F++ P ++
Sbjct: 217 GVALQAP-------SVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFPPAPAFSIL 269
Query: 412 DTCYDFSGLRSVRVPTVSLHFGAGKAL--DLPAKNYLIPVDSAGTFCFAFA--PTSSALS 467
DTC++ S + V +PT+ +HF L D+ Y + D A C A A ++
Sbjct: 270 DTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSD-ASQVCLALASLEYQDEVA 328
Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
I+GN QQ+ RV +D +VGF C
Sbjct: 329 ILGNYQQKNLRVIYDTKETKVGFALETC 356
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 112/358 (31%), Positives = 165/358 (46%), Gaps = 38/358 (10%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI-FDPKTSSSYSPLPCAAPQCK--- 220
+GTPP+ MVLDTGS ++W+QC+ ++ P FDP SSS+S LPC CK
Sbjct: 84 IGTPPQTQQMVLDTGSQLSWIQCK----VPPKTPPTAFDPLLSSSFSVLPCNHSLCKPRV 139
Query: 221 ---SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
+L S C NR C Y Y DG++ G+LV E +F +S + + LGC D+
Sbjct: 140 PDYTLPTS-CDQNRLCHYSYFYADGTYAEGNLVREKFTFSSSQTTPPLILGCATDSSD-- 196
Query: 277 VGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA------SGVLEFNSARGGDAVTAP 330
+ G+LG+ G LS + K + +YC+ R S + S L N + G
Sbjct: 197 --TQGILGMNLGRLSFSSLAKISKFSYCVPPRRSQSGSSPTGSFYLGPNPSSAGFKYVNL 254
Query: 331 LI-----RNKKVDTF-YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
+ R +D Y + + G + G+ + I S F D +G G ++D GT T L
Sbjct: 255 MTYRQSQRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPSGAGQTLIDSGTWFTFLV 314
Query: 385 TQAYNSLRDSFVRLAGNLKPTSGVAL---FDTCYDFSGLRSVR-VPTVSLHFGAGKALDL 440
+AY+ +++ V+LAG K G D C+D + R + ++ F G + +
Sbjct: 315 DEAYSKVKEEIVKLAGP-KLKKGYVYGGSLDMCFDGDAMVIGRMIGNMAFEFENGVEIVV 373
Query: 441 PAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ L V G C + A +IIGN QQ V FDL RVGF C
Sbjct: 374 EREKMLADV-GGGVQCLGIGRSDLLGVASNIIGNFHQQDLWVEFDLVGRRVGFGRTDC 430
>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
Length = 308
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 109/362 (30%), Positives = 153/362 (42%), Gaps = 78/362 (21%)
Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
P D + V+SG G Y I +GTPP + DTGSD+ W QC PC +CY+Q +P+
Sbjct: 15 PNDIQSNVISGG----GSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPL 70
Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
FDPK S +Y L + + +FT+G TE G+ S
Sbjct: 71 FDPKKSKTYKTLGYLSSE-----------------------TFTIGS--TE----GDPAS 101
Query: 261 VKGIALGCGHDNEGLF-----VGSAGLLGLGGGMLSLTKQIKATSLAYCLV--DRDSPAS 313
G+A GCGH N G F G ++ L+ ++ +YCLV DS AS
Sbjct: 102 FPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGG-QFSYCLVPLSSDSTAS 160
Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
+ F G AV V G P + A + II
Sbjct: 161 SKINF----GKSAV----------------------VSGSGTSSPAA------AEESNII 188
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
+D GT +T L Y + + ++ G T F CY SG++ + +PT++ HF
Sbjct: 189 IDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCY--SGVKKLEIPTITAHF- 245
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
G + LP N + CF+ P SS L+I GN+ Q V +DL NN+V F P
Sbjct: 246 IGADVQLPPLNTFVQAQE-DLVCFSMIP-SSNLAIFGNLSQMNFLVGYDLKNNKVSFKPT 303
Query: 494 KC 495
C
Sbjct: 304 DC 305
>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 131/379 (34%), Positives = 181/379 (47%), Gaps = 60/379 (15%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCR-----PCTECYQQSDP-----IFDPKTSS 207
EY + +GTPP + + DTGSD+ WL C P + +D FDP S+
Sbjct: 99 EYLMAVNIGTPPTRMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQFDPSKST 158
Query: 208 SYSPLPCAAPQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG--- 263
++ + C + C L ++C A+ +C Y +YGDGS T G L TET +F ++ +G
Sbjct: 159 TFRLVDCDSVACSELPEASCGADSKCRYSYSYGDGSHTSGVLSTETFTFADAPGARGDGT 218
Query: 264 ------IALGCGHDNEGLFVGSA---GLLGLGGGMLSLTKQIKA-TSL----AYCLVDRD 309
+ GC FVGS+ GL+GLGGG LSL Q+ A TSL +YCLV
Sbjct: 219 TTRVANVNFGCST----TFVGSSVGDGLVGLGGGDLSLVSQLGADTSLGRRFSYCLVPYS 274
Query: 310 SPASGVLEFN---SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
AS L F + AVT PLI + +V +Y V L VG + + P D
Sbjct: 275 VKASSALNFGPRAAVTDPGAVTTPLIPS-QVKAYYIVELRSVKVGNKTFEAP------DR 327
Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLK---PTSGVALFDTCYDFSGLR- 421
+ +IVD GT +T L +L D V+ L G +K S L C+D SG+R
Sbjct: 328 S---PLIVDSGTTLTFLP----EALVDPLVKELTGRIKLPPAQSPERLLPLCFDVSGVRE 380
Query: 422 ---SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL--SIIGNVQQQG 476
+ +P V++ G G A+ L A+N + V GT C A + S SIIGN+ QQ
Sbjct: 381 GQVAAMIPDVTVGLGGGAAVTLKAENTFVEVQE-GTLCLAVSAMSEQFPASIIGNIAQQN 439
Query: 477 TRVSFDLANNRVGFTPNKC 495
V +DL V F P C
Sbjct: 440 MHVGYDLDKGTVTFAPAAC 458
>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
Length = 469
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 115/358 (32%), Positives = 169/358 (47%), Gaps = 35/358 (9%)
Query: 162 RIGVGTPPRQ-FSMVLDTGSDINWLQCRPCTECYQQSDP---IFDPKTSSSYSPLPCAAP 217
I VGTP Q S ++D S W QC PC P F P S+++SPLPC++
Sbjct: 91 NITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCSSD 150
Query: 218 QCKSLDVSACRAN----------RC-LYQVAYG-DGSFTVGDLVTETVSFGNSGSVKGIA 265
C + C RC Y + YG + T G L T+T +FG + +V G+
Sbjct: 151 MCLPVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSGYLATDTFTFGAT-AVPGVV 209
Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV----DRDSPASGVLEFNSA 321
GC + G F G++G++G+G G LSL Q++ +Y L+ D A V+ F
Sbjct: 210 FGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLLAPEATDDGSADSVIRF--- 266
Query: 322 RGGDAV-------TAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGII 373
G DAV + PL+ + FYYV LTG V G + IP F++ G GG+I
Sbjct: 267 -GDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVI 325
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHF 432
+ T +T L+ AY+ +R + G AL D CY+ S + V+VP ++L F
Sbjct: 326 LSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSAALELDLCYNASSMAKVKVPKLTLVF 385
Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
G +DL A NY + G C P+ S++G + Q GT + +D+ R+ F
Sbjct: 386 DGGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVLGTLLQTGTNMIYDVDAGRLTF 442
>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
Length = 469
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 115/357 (32%), Positives = 169/357 (47%), Gaps = 35/357 (9%)
Query: 163 IGVGTPPRQ-FSMVLDTGSDINWLQCRPCTECYQQSDP---IFDPKTSSSYSPLPCAAPQ 218
I VGTP Q S ++D S W QC PC P F P S+++SPLPC++
Sbjct: 92 ITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCSSDM 151
Query: 219 CKSLDVSACRAN----------RC-LYQVAYG-DGSFTVGDLVTETVSFGNSGSVKGIAL 266
C + C RC Y + YG + T G L T+T +FG + +V G+
Sbjct: 152 CLPVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSGYLATDTFTFGAT-AVPGVVF 210
Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV----DRDSPASGVLEFNSAR 322
GC + G F G++G++G+G G LSL Q++ +Y L+ D A V+ F
Sbjct: 211 GCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLLAPEATDDGSADSVIRF---- 266
Query: 323 GGDAV-------TAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIV 374
G DAV + PL+ + FYYV LTG V G + IP F++ G GG+I+
Sbjct: 267 GDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVIL 326
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFG 433
T +T L+ AY+ +R + G AL D CY+ S + V+VP ++L F
Sbjct: 327 SSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSAALELDLCYNASSMAKVKVPKLTLVFD 386
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
G +DL A NY + G C P+ S++G + Q GT + +D+ R+ F
Sbjct: 387 GGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVLGTLLQTGTNMIYDVDAGRLTF 442
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 114/373 (30%), Positives = 169/373 (45%), Gaps = 54/373 (14%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP----IFDPKTSSSYSPLPCAAPQ 218
+ +GTPP+ +MVLDTGS+++WL+C+ +P IF+P S +Y+ +PC++
Sbjct: 71 LTIGTPPQNITMVLDTGSELSWLRCK--------KEPNFTSIFNPLASKTYTKIPCSSQT 122
Query: 219 CK------SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC---- 268
CK +L V+ A C + ++Y D S G L ET FG S + GC
Sbjct: 123 CKTRTSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFRFG-SLTRPATVFGCMDSG 181
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD--- 325
N + GL+G+ G LS Q+ +YC+ DS +G L AR
Sbjct: 182 SSSNTEEDAKTTGLMGMNRGSLSFVNQMGFRKFSYCISGLDS--TGFLLLGEARYSWLKP 239
Query: 326 -------AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
++ PL +V Y V L G V + + +P S+F D G G +VD GT
Sbjct: 240 LNYTPLVQISTPLPYFDRVA--YSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQTMVDSGT 297
Query: 379 AITRLQTQAYNSLRDSF-VRLAGNLKPTSG-----VALFDTCYDFSGLRSV--RVPTVSL 430
T L Y++LR F ++ AG L+ + D CY S +P V L
Sbjct: 298 QFTFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSSTLPNLPVVKL 357
Query: 431 HF-GAGKALDLPAKNYLIPVDSAG---TFCFAFAPTSSALSI----IGNVQQQGTRVSFD 482
F GA ++ Y +P + G +CF F S L I IG+ QQQ + +D
Sbjct: 358 MFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFG-NSDELGISSFLIGHHQQQNVWMEYD 416
Query: 483 LANNRVGFTPNKC 495
L N+R+GF +C
Sbjct: 417 LENSRIGFAELRC 429
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 116/364 (31%), Positives = 170/364 (46%), Gaps = 45/364 (12%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD- 223
VG+PP+Q +MVLDTGS+++WL C+ +F+P +SSSYSP+PC++P C++
Sbjct: 1006 VGSPPQQVTMVLDTGSELSWLHCKKSPNL----TSVFNPLSSSSYSPIPCSSPICRTRTR 1061
Query: 224 -----VSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----NEG 274
V+ C V+Y D S G+L ++ G+S ++ G GC N
Sbjct: 1062 DLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSS-ALPGTLFGCMDSGFSSNSE 1120
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS---ARGGDAVTAPL 331
+ GL+G+ G LS Q+ +YC+ RDS SGVL F + G+ PL
Sbjct: 1121 EDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRDS--SGVLLFGDLHLSWLGNLTYTPL 1178
Query: 332 IR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
++ + + F Y V L G VG + + +P S+F D G G +VD GT T L
Sbjct: 1179 VQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGP 1238
Query: 387 AYNSLRDSFV-RLAGNLKPTSGVAL-----FDTCYDF-SGLRSVRVPTVSLHF-GAGKAL 438
Y +LR+ F+ + G L P D CY +G + +P+VSL F GA +
Sbjct: 1239 VYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVAAGGKLPTLPSVSLMFRGAEMVV 1298
Query: 439 DLPAKNYLIPVDSAG---TFCFAFAPTSSALSI----IGNVQQQGTRVSFDLANNRVGFT 491
Y +P G +C F S L I IG+ QQ + FDL V F
Sbjct: 1299 GGEVLLYRVPEMMKGNEWVYCLTFG-NSDLLGIEAFVIGHHHQQNVWMEFDL----VAFA 1353
Query: 492 PNKC 495
+ C
Sbjct: 1354 ADLC 1357
>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
Length = 450
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 107/375 (28%), Positives = 161/375 (42%), Gaps = 48/375 (12%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQC---RPCTEC-YQQSD----PIFDPKTSSSYSPLPC 214
+ GTPP++ S ++DTGSD+ W C CT C + +D PIFDPK SSS L C
Sbjct: 82 LSFGTPPQKLSFLVDTGSDVVWAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSSSKILDC 141
Query: 215 AAPQCKS-------LDVSACRANR------CLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
P+C S L C N C Y YG G+ + G + E + F ++
Sbjct: 142 RNPKCVSTYFPYVHLGCPRCNGNSKHCSYACPYSTQYGTGA-SSGYFLLENLKFPRK-TI 199
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DSPASG--VL 316
+ LGC + + S L G G M SL Q+ AYCL D+ SG +L
Sbjct: 200 RNFLLGCT-TSAARELSSDALAGFGRSMFSLPIQMGVKKFAYCLNSHDYDDTRNSGKLIL 258
Query: 317 EFNSARGGDAVTAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
++ + P +++ FYY +G+ +G + ++IP G G+I+D
Sbjct: 259 DYRDGKTKGLSYTPFLKSPPASAFYYHLGVKDIKIGNKLLRIPSKYLAPGSDGRSGVIID 318
Query: 376 CGTAITRLQTQ-----AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
G T N L+ + +L+ + L CY+F+G +S+++P +
Sbjct: 319 SGYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEAETQTGL-TPCYNFTGHKSIKIPPLIY 377
Query: 431 HFGAGKALDLPAKNYL----------IPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVS 480
F G + +P KNY +D+ GT P S I+GN Q V
Sbjct: 378 QFRGGANMVVPGKNYFGISPQESLACFLMDTNGTNALEITPDPSI--ILGNSQHVDYYVE 435
Query: 481 FDLANNRVGFTPNKC 495
+DL N+R GF C
Sbjct: 436 YDLKNDRFGFRRQTC 450
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 100/368 (27%), Positives = 166/368 (45%), Gaps = 39/368 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF++I +G+PP+++ + +DTGSDI W+ C PC +C ++D ++D K SS+
Sbjct: 75 GLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASSTSKN 134
Query: 212 LPCAAPQCK-SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN-SGSVKG----- 263
+ C C + C A + C Y V YGDGS + GD V + ++ +G+++
Sbjct: 135 VGCEDAFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAPLAQ 194
Query: 264 -IALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPAS 313
+ GCG + G + G++G G S+ Q+ A ++CL + +
Sbjct: 195 EVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMN--GG 252
Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
G+ T PL+ N+ Y V L G V G+ + +PPSL + GDGG I
Sbjct: 253 GIFAIGEVESPVVKTTPLVPNQ---VHYNVILKGMDVDGEPIDLPPSLASTN--GDGGTI 307
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
+D GT + L YNSL + A V C+ F+ P V+LHF
Sbjct: 308 IDSGTTLAYLPQNLYNSLIEKIT--AKQQVKLHMVQETFACFSFTSNTDKAFPVVNLHFE 365
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGTRVSFDLANNR 487
L + +YL + +CF + + + ++G++ V +DL N
Sbjct: 366 DSLKLSVYPHDYLFSL-REDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEV 424
Query: 488 VGFTPNKC 495
+G+ + C
Sbjct: 425 IGWADHNC 432
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 115/356 (32%), Positives = 159/356 (44%), Gaps = 40/356 (11%)
Query: 162 RIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS 221
+ +G P +V+DTGSDI W+ C PCT C +FDP SS++SPL C P
Sbjct: 104 NLSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGLLFDPSMSSTFSPL-CKTP---- 158
Query: 222 LDVSACRANRCLYQVAYGD-----GSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
C+ + + ++Y D G+F LV ET G S + + +GCGH N G
Sbjct: 159 CGFKGCKCDPIPFTISYVDNSSASGTFGRDILVFETTDEGTS-QISDVIIGCGH-NIGFN 216
Query: 277 V--GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD--AVTAPLI 332
G G+LGL G SL QI +YC+ + P + G D + P
Sbjct: 217 SDPGYNGILGLNNGPNSLATQI-GRKFSYCIGNLADPYYNYNQLRLGEGADLEGYSTPF- 274
Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
+ FYYV + G SVG + + I FEM G GG+I+D GT IT L A+ L
Sbjct: 275 --EVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVDSAHKLLY 332
Query: 393 DSFVRLAGNLKPTSGVALFDT-----CYDFSGLRS---VRVPTVSLHFGAGKALDLPAKN 444
+ L LK + +F+ CY G+ S V P V+ HF G L L +
Sbjct: 333 NEVRNL---LKWSFRQVIFENAPWKLCY--YGIISRDLVGFPVVTFHFVDGADLALDTGS 387
Query: 445 YLIPVDSAGTFCFAFAP-----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ D FC +P T+ + S+IG + QQ V +DL N V F C
Sbjct: 388 FFSQRDD--IFCMTVSPASILNTTISPSVIGLLAQQSYNVGYDLVNQFVYFQRIDC 441
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 88/239 (36%), Positives = 126/239 (52%), Gaps = 34/239 (14%)
Query: 147 PVVSGASQGSGEYFSRIGVG----TPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
P+ SG + Y + I +G +P ++++DTGSD+ W+QC+PC+ CY Q DP+FD
Sbjct: 80 PLTSGIRLQTLNYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFD 139
Query: 203 PKTSSSYSPLPCAAPQCK-----------SLDVSACRANRCLYQVAYGDGSFTVGDLVTE 251
P S++Y+ + C A C S + + +C Y +AYGDGSF+ G L T+
Sbjct: 140 PAGSATYAAVRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATD 199
Query: 252 TVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDR 308
TV+ G + S+ G GCG N GLF G+AGL+GLG LSL Q + +YCL
Sbjct: 200 TVALGGA-SLGGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAA 258
Query: 309 DS-PASGVLEFNSARGGDAV------TAP-----LIRNKKVDTFYYVGLTGFSVGGQAV 355
S ASG L GGD T P +I + FY++ +TG +VGG A+
Sbjct: 259 TSGDASGSLSLG---GGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTAL 314
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 100/356 (28%), Positives = 161/356 (45%), Gaps = 33/356 (9%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y + + +GTPP+ S ++ + W QC PC C++Q P+F+ SS+Y P PC
Sbjct: 28 YMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPCGTAL 87
Query: 219 CKSLDVSACRAN-RCLYQV--AYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD-NEG 274
C+S+ S C + C Y+V +GD S G T+T + G + +A GC D N
Sbjct: 88 CESVPASTCSGDGVCSYEVETMFGDTSGIGG---TDTFAIGT--ATASLAFGCAMDSNIK 142
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA--SGVLEFNSAR---GGDAVTA 329
+G++G++GLG SL Q+ AT+ +YCL + S +L SA+ G A T
Sbjct: 143 QLLGASGVVGLGRTPWSLVGQMNATAFSYCLAPHGAAGKKSALLLGASAKLAGGKSAATT 202
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
PL+ + Y + L G G + PP+ ++VD ++ L A+
Sbjct: 203 PLVNTSDDSSDYMIHLEGIKFGDVIIAPPPN--------GSVVLVDTIFGVSFLVDAAFQ 254
Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGL-----RSVRVPTVSLHFGAGKALDLPAKN 444
+++ + G + FD C+ + S+ +P V L F AL +P
Sbjct: 255 AIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAALTVPPSK 314
Query: 445 YLIPVDSAGTFCFAFAPT-----SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
Y+ + GT C A + ++ LSI+G + Q+ FDL + F P C
Sbjct: 315 YMYDAGN-GTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPADC 369
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 116/376 (30%), Positives = 170/376 (45%), Gaps = 43/376 (11%)
Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
++GSG + + +G+PP +V+DTGS + W+QC PC C+QQS FDP S S+ L
Sbjct: 99 NRGSG-FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTL 157
Query: 213 PCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFG--NSGSV-------- 261
C P ++ C R N+ Y++ Y G + G L E++ F + G V
Sbjct: 158 GCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGRVFQYNAIST 217
Query: 262 -------KGIALGCGHDNEGLFVGSA--GLLGLGGG-MLSLTKQIKATSLAYCLVDRDSP 311
I GCGH N A G+ GLG +++ Q+ +YC+ D ++P
Sbjct: 218 QISKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQL-GNKFSYCIGDINNP 276
Query: 312 ASG----VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
VL S GD+ + YYV L SVG + ++I P+ F++
Sbjct: 277 LYTHNHLVLGQGSYIEGDSTPLQIHFGH-----YYVTLQSISVGSKTLKIDPNAFKISSD 331
Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRL-AGNLKPTSGVALFD-TCYDFSGLRS--- 422
G GG+++D G T+L + L D V L G L+ F+ C F G+ S
Sbjct: 332 GSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLC--FKGVVSRDL 389
Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA---LSIIGNVQQQGTRV 479
V P V+ HF G L L + + L FC A P++S LS+IG + QQ V
Sbjct: 390 VGFPAVTFHFAGGADLVLESGS-LFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNV 448
Query: 480 SFDLANNRVGFTPNKC 495
FDL +V F C
Sbjct: 449 GFDLEQMKVFFRRIDC 464
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 115/368 (31%), Positives = 170/368 (46%), Gaps = 50/368 (13%)
Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS---- 221
GTP + +MVLDTGS+++WL C+ + IF+P S +Y+ +PC++P C++
Sbjct: 74 GTPLQNITMVLDTGSELSWLHCKK----EPNFNSIFNPLASKTYTKIPCSSPTCETRTRD 129
Query: 222 --LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS 279
L VS A C + ++Y D S G+L ET G SV G A G + G S
Sbjct: 130 LPLPVSCDPAKLCHFIISYADASSVEGNLAFETFRVG---SVTGPATVFGCMDSGFSSNS 186
Query: 280 ------AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD-------- 325
GL+G+ G LS Q+ +YC+ DRDS SGVL A
Sbjct: 187 EEDAKTTGLMGMNRGSLSFVNQMGFRKFSYCISDRDS--SGVLLLGEASFSWLKPLNYTP 244
Query: 326 --AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
++ PL +V Y V L G V + + +P S+F D G G +VD GT T L
Sbjct: 245 LVEMSTPLPYFDRVA--YSVQLEGIRVSDKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFL 302
Query: 384 QTQAYNSLRDSF-VRLAGNLKPTSG-----VALFDTCYDFSGLRSV--RVPTVSLHF-GA 434
Y++L+ F ++ G L+ + D CY R+ +P V+L F GA
Sbjct: 303 LGPVYSALKQEFLLQTKGVLRVLNEPRYVFQGAMDLCYLIEPTRAALPNLPVVNLMFRGA 362
Query: 435 GKALDLPAKNYLIPVDSAG---TFCFAFAPTSSALSI----IGNVQQQGTRVSFDLANNR 487
++ Y +P + G +CF F S +L I IG+ QQQ + +DL +R
Sbjct: 363 EMSVSGQRLLYRVPGEVRGKDSVWCFTFG-NSDSLGIESFVIGHHQQQNVWMEYDLEKSR 421
Query: 488 VGFTPNKC 495
+GF +C
Sbjct: 422 IGFAEVRC 429
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 102/368 (27%), Positives = 166/368 (45%), Gaps = 39/368 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF++I +G+PP+++ + +DTGSDI W+ C PC +C ++D ++D KTSS+
Sbjct: 76 GLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKN 135
Query: 212 LPCAAPQCK-SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSF----GNSGS---VK 262
+ C C + C A + C Y V YGDGS + GD + + ++ GN + +
Sbjct: 136 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQ 195
Query: 263 GIALGCGHDNEGLF--VGSA--GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPAS 313
+ GCG + G SA G++G G S+ Q+ A ++CL + +
Sbjct: 196 EVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMN--GG 253
Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
G+ T P++ N+ Y V L G V G + +PPSL + GDGG I
Sbjct: 254 GIFAVGEVESPVVKTTPIVPNQ---VHYNVILKGMDVDGDPIDLPPSLASTN--GDGGTI 308
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
+D GT + L YNSL + A V C+ F+ P V+LHF
Sbjct: 309 IDSGTTLAYLPQNLYNSLIEKIT--AKQQVKLHMVQETFACFSFTSNTDKAFPVVNLHFE 366
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGTRVSFDLANNR 487
L + +YL + +CF + + + ++G++ V +DL N
Sbjct: 367 DSLKLSVYPHDYLFSL-REDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEV 425
Query: 488 VGFTPNKC 495
+G+ + C
Sbjct: 426 IGWADHNC 433
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 106/369 (28%), Positives = 174/369 (47%), Gaps = 52/369 (14%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP+QF++++DTGS + ++ C C +C + DP FDP++SS+Y P+ C
Sbjct: 80 NGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN 139
Query: 216 AP-QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHDN 272
C S V +C+Y+ Y + S + G L + +SFGN + + GC +
Sbjct: 140 IDCICDSDGV------QCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENME 193
Query: 273 EG-LFVGSA-GLLGLGGGMLSLTKQIKAT-----SLAYCLVDRD-----------SPASG 314
G LF A G++GLG G LSL Q+ S + C D SP S
Sbjct: 194 TGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGISPPSD 253
Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
++ S D V +P +Y V L V G+ + + +F+ G G ++
Sbjct: 254 MIFTYS----DPVRSP---------YYNVDLKEIHVAGKKLPLSSGIFD----GRYGAVL 296
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLR----SVRVPTV 428
D GT L +A+++ +D+ + +LK G D C+ +G S + PTV
Sbjct: 297 DSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTV 356
Query: 429 SLHFGAGKALDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANN 486
+ F G+ L L +NY G +C F + +++G + + T V +D AN+
Sbjct: 357 DMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANS 416
Query: 487 RVGFTPNKC 495
++GF C
Sbjct: 417 KIGFWKTNC 425
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 102/368 (27%), Positives = 166/368 (45%), Gaps = 39/368 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF++I +G+PP+++ + +DTGSDI W+ C PC +C ++D ++D KTSS+
Sbjct: 72 GLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKN 131
Query: 212 LPCAAPQCK-SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSF----GNSGS---VK 262
+ C C + C A + C Y V YGDGS + GD + + ++ GN + +
Sbjct: 132 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQ 191
Query: 263 GIALGCGHDNEGLF--VGSA--GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPAS 313
+ GCG + G SA G++G G S+ Q+ A ++CL + +
Sbjct: 192 EVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMN--GG 249
Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
G+ T P++ N+ Y V L G V G + +PPSL + GDGG I
Sbjct: 250 GIFAVGEVESPVVKTTPIVPNQ---VHYNVILKGMDVDGDPIDLPPSLASTN--GDGGTI 304
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
+D GT + L YNSL + A V C+ F+ P V+LHF
Sbjct: 305 IDSGTTLAYLPQNLYNSLIEKIT--AKQQVKLHMVQETFACFSFTSNTDKAFPVVNLHFE 362
Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGTRVSFDLANNR 487
L + +YL + +CF + + + ++G++ V +DL N
Sbjct: 363 DSLKLSVYPHDYLFSL-REDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEV 421
Query: 488 VGFTPNKC 495
+G+ + C
Sbjct: 422 IGWADHNC 429
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 106/369 (28%), Positives = 174/369 (47%), Gaps = 52/369 (14%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP+QF++++DTGS + ++ C C +C + DP FDP++SS+Y P+ C
Sbjct: 80 NGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN 139
Query: 216 AP-QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHDN 272
C S V +C+Y+ Y + S + G L + +SFGN + + GC +
Sbjct: 140 IDCICDSDGV------QCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENME 193
Query: 273 EG-LFVGSA-GLLGLGGGMLSLTKQIKAT-----SLAYCLVDRD-----------SPASG 314
G LF A G++GLG G LSL Q+ S + C D SP S
Sbjct: 194 TGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGISPPSD 253
Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
++ S D V +P +Y V L V G+ + + +F+ G G ++
Sbjct: 254 MIFTYS----DPVRSP---------YYNVDLKEIHVAGKKLPLSSGIFD----GRYGAVL 296
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLR----SVRVPTV 428
D GT L +A+++ +D+ + +LK G D C+ +G S + PTV
Sbjct: 297 DSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTV 356
Query: 429 SLHFGAGKALDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANN 486
+ F G+ L L +NY G +C F + +++G + + T V +D AN+
Sbjct: 357 DMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANS 416
Query: 487 RVGFTPNKC 495
++GF C
Sbjct: 417 KIGFWKTNC 425
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 110/368 (29%), Positives = 173/368 (47%), Gaps = 35/368 (9%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G Y++++ +GTPP++F++ +DTGSDI W+ C C+ C Q S FD SS+ +
Sbjct: 76 GLYYTKVKMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAAL 135
Query: 212 LPCAAPQCKSLDVSAC-----RANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVK 262
+PC+ P C S A R N+C Y YGDGS T G V++ + F G +V
Sbjct: 136 IPCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAVN 195
Query: 263 G---IALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDS 310
I GC G + G+ G G G LS+ Q+ + + ++CL D
Sbjct: 196 SSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCL-KGDG 254
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
GVL V +PL+ ++ Y + L +V GQ + I P++F + G
Sbjct: 255 DGGGVLVLGEILEPSIVYSPLVPSQP---HYNLNLQSIAVNGQLLPINPAVFSISN-NRG 310
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
G IVDCGT + L +AY+ L + + A + + + CY S P+VSL
Sbjct: 311 GTIVDCGTTLAYLIQEAYDPLVTA-INTAVSQSARQTNSKGNQCYLVSTSIGDIFPSVSL 369
Query: 431 HFGAGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNR 487
+F G ++ L + YL+ +D A +C F SI+G++ + V +D+A R
Sbjct: 370 NFEGGASMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGASILGDLVLKDKIVVYDIAQQR 429
Query: 488 VGFTPNKC 495
+G+ C
Sbjct: 430 IGWANYDC 437
>gi|56784900|dbj|BAD82194.1| aspartic proteinase nepenthesin I-like [Oryza sativa Japonica
Group]
Length = 260
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 99/257 (38%), Positives = 135/257 (52%), Gaps = 13/257 (5%)
Query: 249 VTETVSFGN-SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCL-- 305
+TET +FG+ + + GIA GC +EG F +GL+GLG G LSL Q+ + Y L
Sbjct: 1 MTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYRLSS 60
Query: 306 -VDRDSPAS-GVLEFNSARGGDA-VTAPLIRNKKVDT--FYYVGLTGFSVGGQAVQIPPS 360
+ SP S G L + GD+ ++ PL+ N V FYYVGLTG SVGG+ VQIP
Sbjct: 61 DLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSG 120
Query: 361 LFEMDEA-GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSG 419
F D + G GG+I D GT +T L AY +RD + G KP D G
Sbjct: 121 TFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTGG 180
Query: 420 LRSVRVPTVSLHFGAGKALDLPAKNYLIPV---DSAGTFCFAFAPTSSALSIIGNVQQQG 476
+ P++ LHF G +DL +NYL + + C++ +S AL+IIGN+ Q
Sbjct: 181 SSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMD 240
Query: 477 TRVSFDLANN-RVGFTP 492
V FDL+ N R+ F P
Sbjct: 241 FHVVFDLSGNARMLFQP 257
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 114/366 (31%), Positives = 157/366 (42%), Gaps = 48/366 (13%)
Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
VVS + EY + V TPP + + DTGS + WL+C+ P SS
Sbjct: 65 VVSPMVPQNFEYLMALDVSTPPVRMLALADTGSSLVWLKCK---------LPAAHTPASS 115
Query: 208 SYSPLPCAAPQCKSL-DVSACRA-----NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
SY+ LPC A CK+L D ++CRA N C+Y+ A+ DGS T G + + +F
Sbjct: 116 SYARLPCDAFACKALGDAASCRATGSGNNICVYRYAFADGSCTAGPVTVDAFTFSTR--- 172
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS-----LAYCLV--DRDSPASG 314
+ GC EGL V GL+GL G +SL Q+ A + +YCLV S
Sbjct: 173 --LDFGCATRTEGLSVPDDGLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYSSSETVSS 230
Query: 315 VLEFNS----ARGGDAVTAPLI--RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
L F S + A T PL+ RNK +FY + L V G+ V + +
Sbjct: 231 SLNFGSHAIVSSSPGAATTPLVAGRNK---SFYTIALDSIKVAGKPVPL--------QTT 279
Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR----SVR 424
+IVD GT +T L + L + + S L+ CYD
Sbjct: 280 TTKLIVDSGTMLTYLPKAVLDPLVAALTAAIKLPRVKSPETLYAVCYDVRRRAPEDVGKS 339
Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLA 484
+P V+L G G + LP N + + T C A + I+GNV QQ V FDL
Sbjct: 340 IPDVTLVLGGGGEVRLPWGNTFVVENKGTTVCLALVESHLPEFILGNVAQQNLHVGFDLE 399
Query: 485 NNRVGF 490
V F
Sbjct: 400 RRTVSF 405
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 112/371 (30%), Positives = 160/371 (43%), Gaps = 42/371 (11%)
Query: 144 FSTPVVSGASQG--------SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ 195
+ P S AS G +G+Y ++ +GTPP ++DT SD+ W QC PC CY+
Sbjct: 8 YQVPKKSYASNGPFTRVTSNNGDYLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPCQGCYK 67
Query: 196 QSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVS 254
Q +P+FDP +C S +C + C Y AY D S T G L E +
Sbjct: 68 QKNPMFDP------------LKECNSFFDHSCSPEKACDYVYAYADDSATKGMLAKEIAT 115
Query: 255 FGNSGS---VKGIALGCGHDNEGLF-----VGSAGLLGLGGGMLSLTKQIKATSLAYCLV 306
F ++ V+ I GCGH+N G+F G + + + + CLV
Sbjct: 116 FSSTDGKPIVESIIFGCGHNNTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCLV 175
Query: 307 --DRDSPASGVLEFNSA---RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
D SG + A G VT PL+ +++ T Y V L G SVG V S
Sbjct: 176 PFHADPHTSGTISLGEASDVSGEGVVTTPLV-SEEGQTPYLVTLEGISVGDTFVPFNSS- 233
Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR 421
EM G I++D GT T L + Y+ L + +++ NL P T +
Sbjct: 234 -EM--LSKGNIMIDSGTPETYLPQEFYDRLVEE-LKVQINLPPIHVDPDLGTQLCYKSET 289
Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSF 481
++ P ++ HF LP + ++ P D G FCFA T+ L I GN Q + F
Sbjct: 290 NLEGPILTAHFEGADVKLLPLQTFIPPKD--GVFCFAMTGTTDGLYIFGNFAQSNVLIGF 347
Query: 482 DLANNRVGFTP 492
DL V F P
Sbjct: 348 DLDKRIVFFKP 358
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 112/373 (30%), Positives = 176/373 (47%), Gaps = 44/373 (11%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF+R+ +G+PP+++ + +DTGSDI W+ C PCT C S F+P TSS+ S
Sbjct: 89 GLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSK 148
Query: 212 LPCAAPQCKS---LDVSACRANR---CLYQVAYGDGSFTVGDLVTETVSF----GN---S 258
+PC+ +C + + C+ + C Y YGDGS T G V++T+ F GN +
Sbjct: 149 IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTA 208
Query: 259 GSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRD 309
S I GC + G + G+ G G LS+ Q+ + + ++CL D
Sbjct: 209 NSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSD 268
Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
+ G+L V PL+ ++ Y + L V GQ + I SLF
Sbjct: 269 N-GGGILVLGEIVEPGLVYTPLVPSQP---HYNLNLESIVVNGQKLPIDSSLFTTSNT-- 322
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPT--SGVALFDTCYDFSGLRSVRVP 426
G IVD GT + L AY D FV + + P+ S V+ + C+ S P
Sbjct: 323 QGTIVDSGTTLAYLADGAY----DPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFP 378
Query: 427 TVSLHFGAGKALDLPAKNYLI---PVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFD 482
TVSL+F G A+ + +NYL+ +D+ +C + ++I+G++ + +D
Sbjct: 379 TVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYD 438
Query: 483 LANNRVGFTPNKC 495
LAN R+G+T C
Sbjct: 439 LANMRMGWTDYDC 451
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 108/367 (29%), Positives = 165/367 (44%), Gaps = 38/367 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI----FDPKTSSSYSPL 212
G YF++IG+GTP R F + +DTGSDI W+ C C C ++SD + +D SS+ +
Sbjct: 83 GLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDADASSTAKSV 142
Query: 213 PCAAPQCKSLDV-SACRA-NRCLYQVAYGDGSFTVGDLVTETVSF----GN--SGSVKG- 263
C+ C ++ S C + + C Y + YGDGS T G LV + V GN +GS G
Sbjct: 143 SCSDNFCSYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGSTNGT 202
Query: 264 IALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKAT-----SLAYCLVDRDSPASG 314
I GCG G S G++G G S Q+ + S A+CL + + G
Sbjct: 203 IIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNN--GGG 260
Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
+ T P++ Y V L VG +Q+ F D D G+I+
Sbjct: 261 IFAIGEVVSPKVKTTPMLSKS---AHYSVNLNAIEVGNSVLQLSSDAF--DSGDDKGVII 315
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
D GT + L YN L + + L + F TC+ + R R PTV+ F
Sbjct: 316 DSGTTLVYLPDAVYNPLMNQILASHQELNLHTVQDSF-TCFHYID-RLDRFPTVTFQFDK 373
Query: 435 GKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGTRVSFDLANNRV 488
+L + + YL V T+CF + ++L+I+G++ V +D+ N +
Sbjct: 374 SVSLAVYPQEYLFQV-REDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVI 432
Query: 489 GFTPNKC 495
G+T + C
Sbjct: 433 GWTNHNC 439
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 121/373 (32%), Positives = 165/373 (44%), Gaps = 59/373 (15%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC--- 219
+ VG+PP+ +MVLDTGS+++WL C+ + F+P SSSY+P PC + C
Sbjct: 64 LTVGSPPQNVTMVLDTGSELSWLHCKKLPNL----NSTFNPLLSSSYTPTPCNSSICTTR 119
Query: 220 -KSLDVSA-CRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
+ L + A C N C V+Y D S G L ET S + G GC D+ G
Sbjct: 120 TRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQ-PGTLFGC-MDSAGY 177
Query: 276 FV------GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVT- 328
+ GL+G+ G LSL Q+ +YC+ D A GVL G DA +
Sbjct: 178 TSDINEDSKTTGLMGMNRGSLSLVTQMSLPKFSYCISGED--ALGVLLLGD--GTDAPSP 233
Query: 329 ---APLIRNKKVDTF-----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
PL+ + Y V L G V + +Q+P S+F D G G +VD GT
Sbjct: 234 LQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGTQF 293
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGV------------ALFDTCYDFSGLRSVRVPTV 428
T L Y+SL+D F L+ T GV D CY + VP V
Sbjct: 294 TFLLGSVYSSLKDEF------LEQTKGVLTRIEDPNFVFEGAMDLCYH-APASFAAVPAV 346
Query: 429 SLHFGAGKALDLPAKNYLIPVD--SAGTFCFAFAPTSSALSI----IGNVQQQGTRVSFD 482
+L F +G + + + L V S +CF F S L I IG+ QQ + FD
Sbjct: 347 TLVF-SGAEMRVSGERLLYRVSKGSDWVYCFTFG-NSDLLGIEAYVIGHHHQQNVWMEFD 404
Query: 483 LANNRVGFTPNKC 495
L +RVGFT C
Sbjct: 405 LLKSRVGFTQTTC 417
>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
Length = 375
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 99/343 (28%), Positives = 159/343 (46%), Gaps = 38/343 (11%)
Query: 169 PRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACR 228
PR+ +++DTGSD+ W QC+ + + PL AP C
Sbjct: 52 PRK--LIVDTGSDLIWTQCKLSSSTAAAA--------RHGSPPLSRTAPARTGAFTRTCT 101
Query: 229 ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK-GIALGCGHDNEGLFVGSAGLLGLGG 287
A+ + VG L +ET +FG +V + GCG + G +G+ G+LGL
Sbjct: 102 AS-----------AAAVGVLASETFTFGARRAVSLRLGFGCGALSAGSLIGATGILGLSP 150
Query: 288 GMLSLTKQIKATSLAYCLV----DRDSPA--SGVLEFNSARGGDAVTAPLIRNKKVDT-F 340
LSL Q+K +YCL + SP + + + + + I + V+T +
Sbjct: 151 ESLSLITQLKIQRFSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETVY 210
Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG 400
YYV L G S+G + + +P + M G GG IVD G+ + L A+ +++++ + +
Sbjct: 211 YYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVR 270
Query: 401 NLKPTSGVALFDTCYDF------SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
V ++ C+ + + +V+VP + LHF G A+ LP NY AG
Sbjct: 271 LPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQE-PRAGL 329
Query: 455 FCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C A T+ S +SIIGNVQQQ V FD+ +++ F P +C
Sbjct: 330 MCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 372
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 120/376 (31%), Positives = 164/376 (43%), Gaps = 65/376 (17%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC--- 219
+ +G+PP+ +MVLDTGS+++WL C+ + F+P SSSY+P PC + C
Sbjct: 63 LTIGSPPQNVTMVLDTGSELSWLHCKKLPNL----NSTFNPLLSSSYTPTPCNSSVCMTR 118
Query: 220 -KSLDVSA-CRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
+ L + A C N C V+Y D S G L ET S + G GC D+ G
Sbjct: 119 TRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQ-PGTLFGC-MDSAGY 176
Query: 276 F------VGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTA 329
+ GL+G+ G LSL Q+ +YC+ D A GVL GD +A
Sbjct: 177 TSDINEDAKTTGLMGMNRGSLSLVTQMVLPKFSYCISGED--AFGVLLL-----GDGPSA 229
Query: 330 P-------LIRNKKVDTF-----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
P L+ + Y V L G V + +Q+P S+F D G G +VD G
Sbjct: 230 PSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSG 289
Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGV------------ALFDTCYDFSGLRSVRV 425
T T L YNSL+D F L+ T GV D CY + V
Sbjct: 290 TQFTFLLGPVYNSLKDEF------LEQTKGVLTRIEDPNFVFEGAMDLCYHAPASLAA-V 342
Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSA--GTFCFAFAPTSSALSI----IGNVQQQGTRV 479
P V+L F +G + + + L V +CF F S L I IG+ QQ +
Sbjct: 343 PAVTLVF-SGAEMRVSGERLLYRVSKGRDWVYCFTFG-NSDLLGIEAYVIGHHHQQNVWM 400
Query: 480 SFDLANNRVGFTPNKC 495
FDL +RVGFT C
Sbjct: 401 EFDLVKSRVGFTETTC 416
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 112/373 (30%), Positives = 176/373 (47%), Gaps = 44/373 (11%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF+R+ +G+PP+++ + +DTGSDI W+ C PCT C S F+P TSS+ S
Sbjct: 89 GLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSK 148
Query: 212 LPCAAPQCKS---LDVSACRANR---CLYQVAYGDGSFTVGDLVTETVSF----GN---S 258
+PC+ +C + + C+ + C Y YGDGS T G V++T+ F GN +
Sbjct: 149 IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTA 208
Query: 259 GSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRD 309
S I GC + G + G+ G G LS+ Q+ + + ++CL D
Sbjct: 209 NSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSD 268
Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
+ G+L V PL+ ++ Y + L V GQ + I SLF
Sbjct: 269 N-GGGILVLGEIVEPGLVYTPLVPSQP---HYNLNLESIVVNGQKLPIDSSLFTTSNT-- 322
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPT--SGVALFDTCYDFSGLRSVRVP 426
G IVD GT + L AY D FV + + P+ S V+ + C+ S P
Sbjct: 323 QGTIVDSGTTLAYLADGAY----DPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFP 378
Query: 427 TVSLHFGAGKALDLPAKNYLI---PVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFD 482
TVSL+F G A+ + +NYL+ +D+ +C + ++I+G++ + +D
Sbjct: 379 TVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYD 438
Query: 483 LANNRVGFTPNKC 495
LAN R+G+T C
Sbjct: 439 LANMRMGWTDYDC 451
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 100/362 (27%), Positives = 169/362 (46%), Gaps = 39/362 (10%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP++F++++DTGS + ++ C C C + DP F P SS+Y P+ C
Sbjct: 85 NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHCGKHQDPRFQPDESSTYHPVKCN 144
Query: 216 AP-QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHDN 272
C V+ C+Y+ Y + S + G L + +SFGN V + GC +
Sbjct: 145 MDCNCDHDGVN------CVYERRYAEMSSSSGVLGEDIISFGNQSEVVPQRAVFGCENVE 198
Query: 273 EG-LFVGSA-GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVT-- 328
G L+ A G++GLG G LS+ Q+ ++ + D S G + GG A+
Sbjct: 199 TGDLYSQRADGIMGLGRGQLSIVDQLVDKNV---INDSFSLCYGGMHV----GGGAMVLG 251
Query: 329 ----APLIRNKKVD----TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
P + + D +Y + L V G+ +++ PS F+ G ++D GT
Sbjct: 252 GIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKH----GTVLDSGTTY 307
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLR----SVRVPTVSLHFGA 434
L +A+ + RD+ ++ + NLK G D C+ +G S P V + F
Sbjct: 308 AYLPEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEVDMVFSN 367
Query: 435 GKALDLPAKNYLIP-VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
G+ L L +NYL G +C + +++G + + T V++D N ++GF
Sbjct: 368 GQKLSLTPENYLFQHTKVHGAYCLGIFRNGDSTTLLGGIIVRNTLVTYDRENEKIGFWKT 427
Query: 494 KC 495
C
Sbjct: 428 NC 429
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 116/373 (31%), Positives = 175/373 (46%), Gaps = 45/373 (12%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC-----YQQSDPIFDPKTSSSYSP 211
G Y++R+ +G PP+ F + +DTGSD+ W+ C C C Q FDP +S++ S
Sbjct: 81 GLYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASL 140
Query: 212 LPCAAPQC----KSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF-------GNS 258
+ C+ C +S D SAC ++N+C Y YGDGS T G V + + S
Sbjct: 141 VSCSDQICALGVQSSD-SACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTS 199
Query: 259 GSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSLA-----YCLVDRD 309
S + GC G S G+ G G LS+ Q+ + +A +CL D
Sbjct: 200 NSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDD 259
Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
S G+L + V PL+ ++ Y + L SV GQ + I P++F +
Sbjct: 260 S-GGGILVLGEIVEPNVVYTPLVPSQP---HYNLNLQSISVNGQVLPISPAVFATSSS-- 313
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL--KPTSGVALF-DTCYDFSGLRSVRVP 426
G I+D GT + L +AYN +FV N+ + T V L + CY S S P
Sbjct: 314 QGTIIDSGTTLAYLAEEAYN----AFVVAVTNIVSQSTQSVVLKGNRCYVTSSSVSDIFP 369
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPT-SSALSIIGNVQQQGTRVSFD 482
VSL+F G +L L A++YLI +S G +C F ++I+G++ + +D
Sbjct: 370 QVSLNFAGGASLVLGAQDYLIQQNSVGGTTVWCIGFQKIPGQGITILGDLVLKDKIFIYD 429
Query: 483 LANNRVGFTPNKC 495
LAN R+G+T C
Sbjct: 430 LANQRIGWTNYDC 442
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 100/351 (28%), Positives = 155/351 (44%), Gaps = 26/351 (7%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECY--QQSDPIFDPKTSSSYSPLPCAA 216
+ VG PP ++DTGS + W+QC+PC C P+F+P SS++ C
Sbjct: 96 FLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDD 155
Query: 217 PQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHD 271
C+ C +N+C+Y+ Y G+ + G L E ++F GN+ + IA GCG++
Sbjct: 156 RFCRYAPNGHCGSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGYE 215
Query: 272 N-EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAP 330
N E L G+LGLG SL Q+ + +YC+ D + G + D + P
Sbjct: 216 NGEQLESHFTGILGLGAKPTSLAVQL-GSKFSYCIGDLANKNYGYNQLVLGEDADILGDP 274
Query: 331 L-IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
I + ++ YY+ L G SVG + I P +F+ G+I+D GT T L AY
Sbjct: 275 TPIEFETENSIYYMNLEGISVGDTQLNIEPVVFKR-RGPRTGVILDSGTLYTWLADIAY- 332
Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRS---VRVPTVSLHFGAGKALDLPAKNYL 446
R+ + + L P F + G S + P V+ HF G L + A +
Sbjct: 333 --RELYNEIKSILDPKLERFWFRDFLCYHGRVSEELIGFPVVTFHFAGGAELAMEATSMF 390
Query: 447 IPVDSAGT---FCFAFAPTS------SALSIIGNVQQQGTRVSFDLANNRV 488
P+ T FC + PT + IG + QQ + +DL +
Sbjct: 391 YPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQYYNIGYDLKEKNI 441
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 123/421 (29%), Positives = 179/421 (42%), Gaps = 71/421 (16%)
Query: 130 HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL---- 185
H PA A + P + G Y +GTPP+ ++LDTGS + W+
Sbjct: 50 HPSVPATAALYPHSY------------GGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTS 97
Query: 186 --QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCL---------- 233
+CR C+ + P+F PK SSS + C P C+ + +A A +C
Sbjct: 98 SYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAAN 157
Query: 234 -----------YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGL 282
Y V YG GS T G L+ +T+ +V G LGC + + +GL
Sbjct: 158 CPAAASNVCPPYAVVYGSGS-TAGLLIADTLR-APGRAVPGFVLGCSLVS--VHQPPSGL 213
Query: 283 LGLGGGMLSLTKQIKATSLAYCLVDR----DSPASGVLEFNSARGGDAVT-APLIRNKKV 337
G G G S+ Q+ +YCL+ R ++ SG L GG+ + PL+++
Sbjct: 214 AGFGRGAPSVPAQLGLPKFSYCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAG 273
Query: 338 D-----TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
D +YY+ L G +VGG+AV++P F + AG GG IVD GT T L + +
Sbjct: 274 DKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVA 333
Query: 393 DSFVRLAGNLKPTSGVAL----FDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
D+ V G S A C+ G RS+ +P +S HF G + LP +NY +
Sbjct: 334 DAVVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFV 393
Query: 448 PVDSAGT--FCFAFAPTSSALS-----------IIGNVQQQGTRVSFDLANNRVGFTPNK 494
C A S S I+G+ QQQ V +DL R+GF
Sbjct: 394 VAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQS 453
Query: 495 C 495
C
Sbjct: 454 C 454
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 104/359 (28%), Positives = 165/359 (45%), Gaps = 32/359 (8%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP++F++++D+GS + ++ C C +C DP F P SS+YSP+ C
Sbjct: 85 NGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCN 144
Query: 216 AP-QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHDN 272
C S N+C Y+ Y + S + G L + VSFG +K GC +
Sbjct: 145 VDCTCDS------DKNQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENSE 198
Query: 273 EG-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRD-SPASGVLEFNSARGG 324
G LF A G++GLG G LS+ Q + S + C D + VL A G
Sbjct: 199 TGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPAPPG 258
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
T N +Y + L V G+A+++ P +F+ G G ++D GT L
Sbjct: 259 MIYTH---SNAVRSPYYNIELKEMHVAGKALRVDPRIFD----GKHGTVLDSGTTYAYLP 311
Query: 385 TQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTVSLHFGAGKAL 438
QA+ + +D+ LK G D C+ +G ++ P V + FG G+ L
Sbjct: 312 EQAFVAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQKL 371
Query: 439 DLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L +NYL G +C F +++G + + T V++D N ++GF C
Sbjct: 372 SLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 430
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 77/209 (36%), Positives = 113/209 (54%), Gaps = 16/209 (7%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
+Y + +GTPP + DTGSD+ WLQC PCT CY+Q +P+FD ++SS++S + C +
Sbjct: 58 DYLMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLNPMFDSQSSSTFSNIACGSE 117
Query: 218 QCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHD 271
C L ++C ++ C Y +Y DGS T G L ET++ G + KG+ GCGH+
Sbjct: 118 SCSKLYSTSCSPDQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEPVAFKGVIFGCGHN 177
Query: 272 NEGLFV-GSAGLLGLGGGMLSLTKQIKAT----SLAYCLV--DRDSPASGVLEFNSAR-- 322
N G F G++GLG G LSL QI ++ + CLV + + S + F
Sbjct: 178 NNGAFNDKEMGIIGLGRGPLSLVSQIGSSLGGNMFSQCLVPFNTNPSISSPMSFGKGSEV 237
Query: 323 -GGDAVTAPLIRNKKVDTFYYVGLTGFSV 350
G V+ PL+ +FY+V L G SV
Sbjct: 238 LGNGVVSTPLVSKTTYQSFYFVTLLGISV 266
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 104/359 (28%), Positives = 165/359 (45%), Gaps = 32/359 (8%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP++F++++D+GS + ++ C C +C DP F P SS+YSP+ C
Sbjct: 85 NGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCN 144
Query: 216 AP-QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHDN 272
C S N+C Y+ Y + S + G L + VSFG +K GC +
Sbjct: 145 VDCTCDS------DKNQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENSE 198
Query: 273 EG-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRD-SPASGVLEFNSARGG 324
G LF A G++GLG G LS+ Q + S + C D + VL A G
Sbjct: 199 TGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPAPPG 258
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
T N +Y + L V G+A+++ P +F+ G G ++D GT L
Sbjct: 259 MIYTH---SNAVRSPYYNIELKEMHVAGKALRVDPRIFD----GKHGTVLDSGTTYAYLP 311
Query: 385 TQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTVSLHFGAGKAL 438
QA+ + +D+ LK G D C+ +G ++ P V + FG G+ L
Sbjct: 312 EQAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQKL 371
Query: 439 DLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L +NYL G +C F +++G + + T V++D N ++GF C
Sbjct: 372 SLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 430
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 106/367 (28%), Positives = 167/367 (45%), Gaps = 38/367 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI----FDPKTSSSYSPL 212
G YF++IG+GTP R F + +DTGSDI W+ C C C ++SD + +D SS+ +
Sbjct: 83 GLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDVDASSTAKSV 142
Query: 213 PCAAPQCKSLDV-SACRA-NRCLYQVAYGDGSFTVGDLVTETVSF----GN--SGSVKG- 263
C+ C ++ S C + + C Y + YGDGS T G LV + V GN +GS G
Sbjct: 143 SCSDNFCSYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGT 202
Query: 264 IALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKAT-----SLAYCLVDRDSPASG 314
I GCG G S G++G G S Q+ + S A+CL + + G
Sbjct: 203 IIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNN--GGG 260
Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
+ T P++ Y V L VG +++ + F D D G+I+
Sbjct: 261 IFAIGEVVSPKVKTTPMLSKS---AHYSVNLNAIEVGNSVLELSSNAF--DSGDDKGVII 315
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
D GT + L YN L + + L + F TC+ ++ + R PTV+ F
Sbjct: 316 DSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQESF-TCFHYTD-KLDRFPTVTFQFDK 373
Query: 435 GKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGTRVSFDLANNRV 488
+L + + YL V T+CF + ++L+I+G++ V +D+ N +
Sbjct: 374 SVSLAVYPREYLFQV-REDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVI 432
Query: 489 GFTPNKC 495
G+T + C
Sbjct: 433 GWTNHNC 439
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 97/359 (27%), Positives = 168/359 (46%), Gaps = 32/359 (8%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP++F++++DTGS + ++ C C C + DP F P S +Y P+ C
Sbjct: 86 NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDLSETYQPVKC- 144
Query: 216 APQCKSLDVSACRA--NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHD 271
P C C N+C+Y Y + S + G L + VSFGN + + GC +D
Sbjct: 145 TPDCN------CDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFGNLSELAPQRAVFGCEND 198
Query: 272 NEG-LFVGSA-GLLGLGGGMLSLT-----KQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
G L+ A G++GLG G LS+ K++ + S + C D ++ +
Sbjct: 199 ETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMILGGISPPE 258
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
D V + +Y + L V G+ +Q+ P +F+ G G ++D GT L
Sbjct: 259 DMVFTH--SDPDRSPYYNINLKEMHVAGKKLQLNPKVFD----GKHGTVLDSGTTYAYLP 312
Query: 385 TQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTVSLHFGAGKAL 438
A+ + + + ++ +LK +G D C+ +G+ ++ P V + F G L
Sbjct: 313 ETAFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENGHKL 372
Query: 439 DLPAKNYLIPVDSA-GTFCF-AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L +NYL G +C F+ +++G + + T V +D N+++GF C
Sbjct: 373 SLSPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFVRNTLVMYDRENSKIGFWKTNC 431
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 111/371 (29%), Positives = 175/371 (47%), Gaps = 44/371 (11%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSPLP 213
YF+R+ +G+PP+++ + +DTGSDI W+ C PCT C S F+P TSS+ S +P
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176
Query: 214 CAAPQCKS---LDVSACRANR---CLYQVAYGDGSFTVGDLVTETVSF----GN---SGS 260
C+ +C + + C+ + C Y YGDGS T G V++T+ F GN + S
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 236
Query: 261 VKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSP 311
I GC + G + G+ G G LS+ Q+ + + ++CL D+
Sbjct: 237 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDN- 295
Query: 312 ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
G+L V PL+ ++ Y + L V GQ + I SLF G
Sbjct: 296 GGGILVLGEIVEPGLVYTPLVPSQP---HYNLNLESIVVNGQKLPIDSSLFTTSNT--QG 350
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPT--SGVALFDTCYDFSGLRSVRVPTV 428
IVD GT + L AY D FV + + P+ S V+ + C+ S PTV
Sbjct: 351 TIVDSGTTLAYLADGAY----DPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTV 406
Query: 429 SLHFGAGKALDLPAKNYLI---PVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLA 484
SL+F G A+ + +NYL+ +D+ +C + ++I+G++ + +DLA
Sbjct: 407 SLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLA 466
Query: 485 NNRVGFTPNKC 495
N R+G+T C
Sbjct: 467 NMRMGWTDYDC 477
>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
vinifera]
Length = 451
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 107/360 (29%), Positives = 163/360 (45%), Gaps = 33/360 (9%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y R +GTP + M +DT SD+ W+ C C C S +F+ S++Y L C A Q
Sbjct: 101 YIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGC---SSTLFNSPASTTYKSLGCQAAQ 157
Query: 219 CKSL--------------DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
CK + C C + + YG GS +L +T++ + +V G
Sbjct: 158 CKQVLHLLSPLLTSPSVVPKPTCGGGVCSFNLTYG-GSSLAANLSQDTITLA-TDAVPGY 215
Query: 265 ALGCGHDNEGLFVGS---AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNS 320
+ GC G + + GL +LS T+ + ++ +YCL S SG L
Sbjct: 216 SFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGP 275
Query: 321 ARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
+ PL++N + + Y+V L VG + V +PP F + + G I D GT
Sbjct: 276 VGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTV 335
Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
TRL T AY ++RD+F G + + FDTCY + PT++ F G +
Sbjct: 336 FTRLVTPAYIAVRDAFRNRVGRNLTVTSLGGFDTCYTV----PIAAPTITFMF-TGMNVT 390
Query: 440 LPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LP N LI + T C A A +S L++I N+QQQ R+ +D+ N+R+G C
Sbjct: 391 LPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELC 450
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 102/358 (28%), Positives = 160/358 (44%), Gaps = 30/358 (8%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
G Y + +GTPP+ S V+D ++ W QC PC C++Q P+FDP SS++ LPC +
Sbjct: 55 GLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGS 114
Query: 217 PQCKSLDVSA--CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
C+S+ S+ C ++ C+Y+ G T G T+T + G + G D
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGMAGTDTFAIGAAKETLGFGCVVMTDKRL 173
Query: 275 LFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA--SGVLEFNSARGGDAVTAPL 331
+G +G++GLG SL Q+ T+ +YCL + S A G A G ++ T +
Sbjct: 174 KTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGKSSGALFLGATAKQLAGGKNSSTPFV 233
Query: 332 IR------NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
I+ + + +Y V L G GG +Q S +++D + + L
Sbjct: 234 IKTSAGSSDNGSNPYYMVKLAGIKAGGAPLQAASS-------SGSTVLLDTVSRASYLAD 286
Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
AY +L+ + G S +D C FS + P + F G AL +P NY
Sbjct: 287 GAYKALKKALTAAVGVQPVASPPKPYDLC--FSKAVAGDAPELVFTFDGGAALTVPPANY 344
Query: 446 LIPVDSAGTFCFAFAPTSS--------ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L+ GT C ++S SI+G++QQ+ V FDL + F P C
Sbjct: 345 LL-ASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSFKPADC 401
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 102/371 (27%), Positives = 166/371 (44%), Gaps = 40/371 (10%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYS 210
+G YF++IG+G PP+ + + +DTGSDI W+ C C +C +SD ++DP++S+S +
Sbjct: 79 AGLYFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSAT 138
Query: 211 PLPCAAPQCKSLD---VSACRANR-CLYQVAYGDGSFTVGDLVTETVSF----GN--SGS 260
+ C C + + C + C Y V YGDGS T G V + + F GN + S
Sbjct: 139 RIYCDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQTSS 198
Query: 261 VKG-IALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDS 310
G + GCG G S+ G+LG G S+ Q+ A A+CL +
Sbjct: 199 ANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCL--DNV 256
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
G+ T P++ N+ Y V + VGG +++P +F D
Sbjct: 257 KGGGIFAIGEVVSPKVNTTPMVPNQP---HYNVVMKEIEVGGNVLELPTDIF--DTGDRR 311
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
G I+D GT + L Y S+ V LK + F TC+ ++G + P V
Sbjct: 312 GTIIDSGTTLAYLPEVVYESMMTKIVSEQPGLKLHTVEEQF-TCFQYTGNVNEGFPVVKF 370
Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGTRVSFDLA 484
HF +L + +YL + +CF + + ++++G++ V +DL
Sbjct: 371 HFNGSLSLTVNPHDYLFQIHEE-VWCFGWQNSGMQSKDGRDMTLLGDLVLSNKLVLYDLE 429
Query: 485 NNRVGFTPNKC 495
N +G+T C
Sbjct: 430 NQAIGWTDYNC 440
>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 466
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 127/422 (30%), Positives = 175/422 (41%), Gaps = 56/422 (13%)
Query: 119 KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
KL ++ H LK + + TPV + G Y + GTP + F VLDT
Sbjct: 52 KLAVSTSITRAHHLKNHKPN---KSLETPV---HPKTYGGYSIDLEFGTPSQTFPFVLDT 105
Query: 179 GSDINWLQCRP---CTECYQQSD-PIFDPKTSSSYSPLPCAAPQCKSL---DVSA--CRA 229
GS + WL C C++C S+ P F PK SSS + C P+C + DV + CR
Sbjct: 106 GSTLVWLPCSSHYLCSKCNSFSNTPKFIPKNSSSSKFVGCTNPKCAWVFGPDVKSHCCRQ 165
Query: 230 -----NRC-----LYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS 279
N C Y V YG GS T G L++E ++F + LGC +
Sbjct: 166 DKAAFNNCSQTCPAYTVQYGLGS-TAGFLLSENLNFP-TKKYSDFLLGCSVVS---VYQP 220
Query: 280 AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASG------VLEFNSARGGD---AVTAP 330
AG+ G G G SL Q+ T +YCL+ S VLE S+R G P
Sbjct: 221 AGIAGFGRGEESLPSQMNLTRFSYCLLSHQFDDSATITSNLVLETASSRDGKTNGVSYTP 280
Query: 331 LIRN---KKVDTF---YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
++N KK F YY+ L VG + V++P L E + GDGG IVD G+ T ++
Sbjct: 281 FLKNPTTKKNPAFGAYYYITLKRIVVGEKRVRVPRRLLEPNVDGDGGFIVDSGSTFTFME 340
Query: 385 TQAYNSLRDSFVRLAGNLKPTSGVALF--DTCYDFS-GLRSVRVPTVSLHFGAGKALDLP 441
++ + F + + F C+ + G + P + F G + LP
Sbjct: 341 RPIFDLVAQEFAKQVSYTRAREAEKQFGLSPCFVLAGGAETASFPELRFEFRGGAKMRLP 400
Query: 442 AKNYLIPVDSAGTFCFAFAPTSSALS--------IIGNVQQQGTRVSFDLANNRVGFTPN 493
NY V C A S I+GN QQQ V +DL N R GF
Sbjct: 401 VANYFSLVGKGDVACLTIVSDDVAGSGGTVGPAVILGNYQQQNFYVEYDLENERFGFRSQ 460
Query: 494 KC 495
C
Sbjct: 461 SC 462
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 167/370 (45%), Gaps = 39/370 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF+++ +G+P + F + +DTGSDI W+ C C+ C S FD SS+ +
Sbjct: 81 GLYFTKVKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAAL 140
Query: 212 LPCAAPQCK---SLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGN--------S 258
+ CA P C S C +AN+C Y YGDGS T G V++T+ F +
Sbjct: 141 VSCADPICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVA 200
Query: 259 GSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRD 309
S I GC G + G+ G G G LS+ Q+ + + ++CL +
Sbjct: 201 NSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGE 260
Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
+ GVL V +PL+ + Y + L +V GQ + I ++F +
Sbjct: 261 N-GGGVLVLGEILEPSIVYSPLVPSLP---HYNLNLQSIAVNGQLLPIDSNVFA--TTNN 314
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL-KPTSGVALFDTCYDFSGLRSVRVPTV 428
G IVD GT + L +AYN D+ KP ++ + CY S P V
Sbjct: 315 QGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPI--ISKGNQCYLVSNSVGDIFPQV 372
Query: 429 SLHFGAGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLAN 485
SL+F G ++ L ++YL+ +DSA +C F +I+G++ + +DLAN
Sbjct: 373 SLNFMGGASMVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDLVLKDKIFVYDLAN 432
Query: 486 NRVGFTPNKC 495
R+G+ C
Sbjct: 433 QRIGWADYNC 442
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 103/357 (28%), Positives = 163/357 (45%), Gaps = 31/357 (8%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
G Y SR+ +GTPP +FS+++DTGS + ++ C CT C DP F P SSSY PL C +
Sbjct: 33 GYYTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSCTHCGNHQDPRFSPALSSSYKPLECGS 92
Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG--IALGCGHDNEG 274
+C + C +R YQ Y + S + G L + + F NS + G + GC G
Sbjct: 93 -ECST---GFCDGSR-KYQRQYAEKSTSSGVLGKDVIGFSNSSDLGGQRLVFGCETAETG 147
Query: 275 -LFVGSA-GLLGLGGGMLSLTKQI-------KATSLAYCLVDRDSPASGVLEFNSARGGD 325
L+ +A G++GLG G LS+ Q+ SL Y +D A + F + D
Sbjct: 148 DLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILGGFQPPK--D 205
Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
V + +Y + L G VGG +++ P +F+ G G ++D GT
Sbjct: 206 MVFT--ASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFD----GKYGTVLDSGTTYAYFPG 259
Query: 386 QAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLR----SVRVPTVSLHFGAGKALD 439
A+ + + + G+LK G D CY +G S P+V FG G+++
Sbjct: 260 AAFQAFKSAVKEQVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFGDGQSVT 319
Query: 440 LPAKNYLI-PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L +NYL +G +C +++G + + V+++ +GF KC
Sbjct: 320 LSPENYLFRHTKISGAYCLGVFENGDPTTLLGGIIVRNMLVTYNRGKASIGFLKTKC 376
>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
Length = 468
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 122/429 (28%), Positives = 177/429 (41%), Gaps = 65/429 (15%)
Query: 121 QLAIYNVDR-HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTG 179
LA ++ R H LK + +FS S+ G Y + +GTP + +++DTG
Sbjct: 50 HLATTSISRAHHLKSPKT-----NFSLIKTPLFSRSYGGYSMSLSLGTPSQTVKLIMDTG 104
Query: 180 SDINWLQCRP---CTEC-YQQSD----PIFDPKTSSSYSPLPCAAPQCK----SLDVSAC 227
S + W C C C + +D P F P+ SSS + C P+C S S C
Sbjct: 105 SSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSKLIGCKNPKCAWVFGSSVQSKC 164
Query: 228 -----RANRCL-----YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
+A C Y + YG GS T G L++ET++F N ++ GC +
Sbjct: 165 HNCNPQAQNCTQACPPYIIQYGLGS-TAGLLLSETINFPNK-TISDFLAGCSLLSTR--- 219
Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DSPASG--VLEFNSARGGDAVTA--- 329
G+ G G SL Q+ +YCLV R DSP S +L+ + T
Sbjct: 220 QPEGIAGFGRSQESLPLQLGLKKFSYCLVSRRFDDSPVSSDLILDMGPSTSDSKTTGLSY 279
Query: 330 -PLIRN------KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
P +N +YYV L VG V++P S G+GG IVD G+ T
Sbjct: 280 TPFQKNLASQSNPAFQEYYYVMLRKIIVGKTHVKVPYSFLVPGSDGNGGTIVDSGSTFTF 339
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFD---TCYDFSGLRSVRVPTVSLHFGAGKALD 439
++ + L F + N + V C+D SG +SV +P ++ F G +
Sbjct: 340 VEGHVFELLAKEFEKQMANYTVATNVQKLTGLRPCFDISGEKSVVIPDLTFQFKGGAKMQ 399
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSA-------------LSIIGNVQQQGTRVSFDLANN 486
LP NY VD G C ++A I+GN QQQ + +DL N+
Sbjct: 400 LPLSNYFAFVD-MGVVCLTIVSDNAAALGGDGGVRSSGPAIILGNFQQQNFYIEYDLEND 458
Query: 487 RVGFTPNKC 495
R GF C
Sbjct: 459 RFGFKEQSC 467
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 103/358 (28%), Positives = 166/358 (46%), Gaps = 30/358 (8%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP++F++++D+GS + ++ C C +C DP F P SS+YSP+ C+
Sbjct: 82 NGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCS 141
Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHDNE 273
A D S +C Y+ Y + S + G L + VSFG +K GC +
Sbjct: 142 ADCTCDSDKS-----QCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENSET 196
Query: 274 G-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASGVLEFNSARGGDA 326
G LF A G++GLG G LS+ Q + S + C D ++ D
Sbjct: 197 GDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPAPPDM 256
Query: 327 VTAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
V + R+ V + YY + L V G+A+++ P +F+ G ++D GT L
Sbjct: 257 VFS---RSDPVRSPYYNIELKEIHVAGKALRLDPRIFDSKH----GTVLDSGTTYAYLPE 309
Query: 386 QAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLR----SVRVPTVSLHFGAGKALD 439
QA+ + +D+ LK G D C+ +G S P V + FG G+ L
Sbjct: 310 QAFVAFKDAVTSKVRPLKKIRGPDPNYKDICFAGAGRNVSQLSQAFPDVDMVFGDGQKLS 369
Query: 440 LPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L +NYL G +C F +++G + + T V++D N ++GF C
Sbjct: 370 LSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 427
>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 165/366 (45%), Gaps = 46/366 (12%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP-------CAAP 217
+GTPP+ +VLDTGS ++W+QC + ++ P+ PKT+S L C P
Sbjct: 72 IGTPPQPTDLVLDTGSQLSWIQCHD-KKVKKRLPPLPKPKTASFDPSLSSSFSLLPCNHP 130
Query: 218 QCK------SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
CK +L S C NR C Y Y DG+ G+LV E +F S S + LGC
Sbjct: 131 ICKPRIPDFTLPTS-CDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVILGCAQ 189
Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR------------DSPASGVLEF 318
+ + G+LG+ G LS Q K + +YC+ R D+P S ++
Sbjct: 190 AS----TENRGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPNSSKFKY 245
Query: 319 NSARGGDAVTAPLIRNK-KVDTF-YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
+ +T P ++ +D Y + + + G+ + IPP+ F+ D G G ++D
Sbjct: 246 VTM-----LTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDS 300
Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA--LFDTCYDFSGLRSV--RVPTVSLHF 432
G+ +T L +AY +++ VRL G + V + D C+D V R+ +S F
Sbjct: 301 GSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGISFEF 360
Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS---ALSIIGNVQQQGTRVSFDLANNRVG 489
G + + ++ G C + +IIG V QQ V +DLAN RVG
Sbjct: 361 DNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRVG 420
Query: 490 FTPNKC 495
F +C
Sbjct: 421 FGGAEC 426
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 101/358 (28%), Positives = 159/358 (44%), Gaps = 30/358 (8%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
G Y + +GTPP+ S V+D ++ W QC PC C++Q P+FDP SS++ LPC +
Sbjct: 55 GLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGS 114
Query: 217 PQCKSLDVSA--CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
C+S+ S+ C ++ C+Y+ G T G T+T + G + G D
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGKAGTDTFAIGAAKETLGFGCVVMTDKRL 173
Query: 275 LFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA--SGVLEFNSARGGDAVTAPL 331
+G +G++GLG SL Q+ T+ +YCL + S A G A G ++ T +
Sbjct: 174 KTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGKSSGALFLGATAKQLAGGKNSSTPFV 233
Query: 332 IR------NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
I+ + + +Y V L G GG +Q S +++D + + L
Sbjct: 234 IKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAASS-------SGSTVLLDTVSRASYLAD 286
Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
AY +L+ + G S +D C F + P + F G AL +P NY
Sbjct: 287 GAYKALKKALTAAVGVQPVASPPKPYDLC--FPKAVAGDAPELVFTFDGGAALTVPPANY 344
Query: 446 LIPVDSAGTFCFAFAPTSS--------ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L+ GT C ++S SI+G++QQ+ V FDL + F P C
Sbjct: 345 LL-ASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSFKPADC 401
>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
Length = 458
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 119/419 (28%), Positives = 181/419 (43%), Gaps = 58/419 (13%)
Query: 121 QLAIYNVDR-HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTG 179
LA ++ R H LK +A L + P GA + + GTPP++ S ++DTG
Sbjct: 54 HLATASMSRSHHLKHGKASPLIQTSLFPHSYGA------HTIPLSFGTPPQKLSFLMDTG 107
Query: 180 SDINWLQC---RPCTEC---YQQSDPIFDPKTSSSYSPLPCAAPQCK-------SLDVSA 226
S + W C CT C + PIF+P+ SSS L C P+C L
Sbjct: 108 SHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGCRDPKCADTSSPBVHLGXPR 167
Query: 227 CRAN--RC-----LYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC--GHDNEGLFV 277
C N +C Y + YG G+ + G + E + F ++ +GC D E
Sbjct: 168 CNGNSKKCSHACPQYTLQYGTGAAS-GFFLLENLDFPGK-TIHKFLVGCTTSADREP--- 222
Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRD---SPASG--VLEFNSARGGDAVTAPLI 332
S L G G M SL Q+ AYCL D + SG +L+++ AP
Sbjct: 223 SSDALAGFGRTMFSLPMQMGVKKFAYCLNSHDYDDTRNSGKLILDYSDGETQGLSYAPFX 282
Query: 333 RNK-KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY--- 388
+N +YY+G+ +G + ++IP GG+++D G A + + +
Sbjct: 283 KNPPDYPIYYYLGVKDMKIGNKVLRIPGKYLTPGSDSRGGVVIDSGFAYSYMTLPVFKIV 342
Query: 389 -NSLRD--SFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
N L+ S R + L+ +GV CY+F+G +S+++P + F G + +P NY
Sbjct: 343 TNELKKQMSKYRRSLELEAQTGVT---PCYNFTGHKSIKIPDLIYQFTGGANMVVPGMNY 399
Query: 446 LIPVDSAGTFCFAF---APTSS------ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ A CF +PTS+ I+GN QQ V FDL N R+GF C
Sbjct: 400 FLLFSEASLGCFPVTTDSPTSNLEFTPGPSIILGNYQQVDHYVEFDLKNERLGFRQQTC 458
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 108/372 (29%), Positives = 172/372 (46%), Gaps = 43/372 (11%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G Y++++ +GTPP +F++ +DTGSD+ W+ C C C Q S FDP +SS+ S
Sbjct: 76 GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSM 135
Query: 212 LPCAAPQC----KSLDVS-ACRANRCLYQVAYGDGSFTVGDLVTETVSFG-------NSG 259
+ C+ +C +S D + + + N+C Y YGDGS T G V++ + +
Sbjct: 136 IACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTTN 195
Query: 260 SVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSLA-----YCLVDRDS 310
S + GC + G S G+ G G +S+ Q+ + +A +CL DS
Sbjct: 196 STAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCL-KGDS 254
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
G+L + V L+ + Y + L SV GQ +QI S+F +
Sbjct: 255 SGGGILVLGEIVEPNIVYTSLVPAQP---HYNLNLQSISVNGQTLQIDSSVFATSNS--R 309
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS---GVALFDTCYDFSGLRSVRVPT 427
G IVD GT + L +AY D FV P S V+ + CY + + P
Sbjct: 310 GTIVDSGTTLAYLAEEAY----DPFVSAITAAIPQSVRTVVSRGNQCYLITSSVTDVFPQ 365
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPTS-SALSIIGNVQQQGTRVSFDL 483
VSL+F G ++ L ++YLI +S G +C F ++I+G++ + V +DL
Sbjct: 366 VSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDL 425
Query: 484 ANNRVGFTPNKC 495
A R+G+ C
Sbjct: 426 AGQRIGWANYDC 437
>gi|414869114|tpg|DAA47671.1| TPA: hypothetical protein ZEAMMB73_872184 [Zea mays]
Length = 492
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 106/367 (28%), Positives = 167/367 (45%), Gaps = 29/367 (7%)
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
P+ G+ +Y +G GTP +QF M LDT ++ + C+PC DP FD S
Sbjct: 137 PIDGSPDAGALDYTVNVGYGTPEQQFPMFLDTIFGVSLVLCKPCAPGSTSCDPAFDTSQS 196
Query: 207 SSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
++++ +PC +P C S + C A C + + + +G+F+ + ++ S +V+
Sbjct: 197 TTFTHVPCDSPDCPS--TANCSAGSVCPFNLFFVEGTFS-----QDVLTVAPSVAVQDFT 249
Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDR-DSPASGVLEFNSA 321
C + G L L SL ++ + + +YC+ DSP L ++
Sbjct: 250 FVCLDAGASDGMPEVGTLDLSRDRNSLPSRLAGSASAAFSYCMPQYPDSPGFLSLGDDAT 309
Query: 322 RGGDAVT--APLIRNKKVD--TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
GD T APL+ + D Y++ + G S+G + IP F + IV+ G
Sbjct: 310 VRGDNCTAHAPLLSSDDPDLANMYFIDVVGMSLGDVDLPIPSGTF----GNNASTIVEAG 365
Query: 378 TAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
T T L AY LRD+F + +A + G FDTCY+F+GL+ + VP V FG G
Sbjct: 366 TTFTMLAPDAYTPLRDAFRQAMAQYNRSVPGFYDFDTCYNFTGLQELTVPLVEFKFGNGD 425
Query: 437 ALDLPAKNYL-IPVDSAGTF---CFAFAPTSSAL----SIIGNVQQQGTRVSFDLANNRV 488
+L + L + S G F C AF+ ++IG T V +D+A V
Sbjct: 426 SLLIDGDQMLYYDIPSEGPFTVTCLAFSTLDVDDDDVSAVIGAYSLATTEVVYDVAGGTV 485
Query: 489 GFTPNKC 495
GF P C
Sbjct: 486 GFIPESC 492
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 103/363 (28%), Positives = 170/363 (46%), Gaps = 40/363 (11%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP++F++++DTGS + ++ C C C DP F P+ S +Y P+ C
Sbjct: 90 NGYYTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHCGSHQDPKFRPEDSETYQPVKCT 149
Query: 216 APQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSG--SVKGIALGCGHD 271
QC C +R C Y+ Y + S + G L + VSFGN S + GC +D
Sbjct: 150 W-QCN------CDNDRKQCTYERRYAEMSTSSGALGEDVVSFGNQTELSPQRAIFGCEND 202
Query: 272 NEGLFVG--SAGLLGLGGGMLSLT-----KQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
G + G++GLG G LS+ K++ + S + C GV GG
Sbjct: 203 ETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYG-----GMGVGGGAMVLGG 257
Query: 325 DAVTAPLI--RNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
+ A ++ R+ V + YY + L V G+ + + P +F+ G G ++D GT
Sbjct: 258 ISPPADMVFTRSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFD----GKHGTVLDSGTTYA 313
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCY-----DFSGLRSVRVPTVSLHFGA 434
L A+ + + + ++ +LK SG D C+ D S + S P V + FG
Sbjct: 314 YLPESAFLAFKHAIMKETHSLKRISGPDPRYNDICFSGAEIDVSQI-SKSFPVVEMVFGN 372
Query: 435 GKALDLPAKNYLIPVDSA-GTFCF-AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
G L L +NYL G +C F+ + +++G + + T V +D + ++GF
Sbjct: 373 GHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHTKIGFWK 432
Query: 493 NKC 495
C
Sbjct: 433 TNC 435
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 104/360 (28%), Positives = 168/360 (46%), Gaps = 34/360 (9%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP++F++++DTGS + ++ C C C DP F P+ S +Y P+ C
Sbjct: 90 NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCKHCGSHQDPKFRPEASETYQPVKCT 149
Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG--SVKGIALGCGHDNE 273
QC D +C Y+ Y + S + G L + VSFGN S + GC +D
Sbjct: 150 W-QCNCDD----DRKQCTYERRYAEMSTSSGVLGEDVVSFGNQSELSPQRAIFGCENDET 204
Query: 274 GLFVG--SAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFNSARGGDAVT 328
G + G++GLG G LS+ Q+ K S A+ L GV GG +
Sbjct: 205 GDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLC---YGGMGVGGGAMVLGGISPP 261
Query: 329 APLI--RNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
A ++ + V + YY + L V G+ + + P +F+ G G ++D GT L
Sbjct: 262 ADMVFTHSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFD----GKHGTVLDSGTTYAYLPE 317
Query: 386 QAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLR------SVRVPTVSLHFGAGKA 437
A+ + + + ++ +LK SG D C FSG S P V + FG G
Sbjct: 318 SAFLAFKHAIMKETHSLKRISGPDPHYNDIC--FSGAEINVSQLSKSFPVVEMVFGNGHK 375
Query: 438 LDLPAKNYLIPVDSA-GTFCF-AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L L +NYL G +C F+ + +++G + + T V +D ++++GF C
Sbjct: 376 LSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHSKIGFWKTNC 435
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 118/428 (27%), Positives = 181/428 (42%), Gaps = 55/428 (12%)
Query: 88 EILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFS-T 146
E+LH+ +D+ R A + L++ + D L AE + + D + T
Sbjct: 65 ELLHEVVTHDF--------ARARALASRLVSSNSPNRSSSDHRHL--AEEEEVEHDLAQT 114
Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKT 205
PV + G Y+S I +G+PP+ FS+V+DTGSD+ W++C PC+ +C FD
Sbjct: 115 PV---SFTNGGVYYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDC----SSTFDRLA 167
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--- 262
S++Y L CA D+ R ++ F G + +T+ + S +
Sbjct: 168 SNTYKALTCAD------DLRLPVLLRLWRRL------FHSGRSLRDTLKMAGAASDELEE 215
Query: 263 --GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVD-------RDS 310
G GCG +GL G G+L L G LS QI +YCL+ + S
Sbjct: 216 FPGFVFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKS 275
Query: 311 P---ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
P +E G + +Y V L G SVG Q + + PS F
Sbjct: 276 PMVFGEAAVELKEPGSGKPQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSTFL--NG 333
Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
D I D GT +T L + +S++ S + + + + D C+ +P
Sbjct: 334 QDKPTIFDSGTTLTMLPSGVCDSIKQSLASMVSGAEFVA-IKGLDACFRVPPSSGQGLPD 392
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNR 487
++ HF G NY+I + S C F PT+ +SI GN+QQQ V D+ N R
Sbjct: 393 ITFHFNGGADFVTRPSNYVIDLGSLQ--CLIFVPTNE-VSIFGNLQQQDFFVLHDMDNRR 449
Query: 488 VGFTPNKC 495
+GF C
Sbjct: 450 IGFKETDC 457
>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 601
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 139/486 (28%), Positives = 201/486 (41%), Gaps = 74/486 (15%)
Query: 71 SFPLNSSSS-FSLPLHSREILHKTRHNDY---------RSLVLSRLERDSARVNTLITKL 120
SFP N+SSS +SLPL + H T H+ Y L S+ + T L
Sbjct: 124 SFPQNASSSHYSLPL-LFPLHHITIHHHYFIHPHPQHHHHPPLFTHHPSSSNSHPFHT-L 181
Query: 121 QLAI-YNVDR-HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
QLA+ ++ R H LK P T V + G Y + GTPP+ F VLDT
Sbjct: 182 QLAVSTSITRAHHLKNHNN---PSSLKTLV---HPKTYGGYSIDLKFGTPPQTFPFVLDT 235
Query: 179 GSDINWLQCRP---CTECYQQSD---PIFDPKTSSSYSPLPCAAPQCKSL---DVSA--C 227
GS + WL C C++C S+ P F PK S S + C P+C + DV++ C
Sbjct: 236 GSSLVWLPCYSHYLCSKCNSFSNNNTPKFIPKDSFSSKFVGCRNPKCAWVFGSDVTSHCC 295
Query: 228 R--------ANRC-----LYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
+ N C Y V YG GS T G L++E ++F + +V +GC +
Sbjct: 296 KLAKAAFSNNNNCSQTCPAYTVQYGLGS-TAGFLLSENLNFP-AKNVSDFLVGCSVVS-- 351
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DSPASGVLEFNSARGGDA----- 326
G+ G G G SL Q+ T +YCL+ +SP + L + G+
Sbjct: 352 -VYQPGGIAGFGRGEESLPAQMNLTRFSYCLLSHQFDESPENSDLVMEATNSGEGKKTNG 410
Query: 327 ------VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
+ P + +YY+ L VG + V++P + E D GDGG IVD G+ +
Sbjct: 411 VSYTAFLKNPSTKKPAFGAYYYITLRKIVVGEKRVRVPRRMLEPDVNGDGGFIVDSGSTL 470
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALF--DTCYDFS-GLRSVRVPTVSLHFGAGKA 437
T ++ ++ + + FV+ + F C+ + G + P + F G
Sbjct: 471 TFMERPIFDLVAEEFVKQVNYTRARELEKQFGLSPCFVLAGGAETASFPEMRFEFRGGAK 530
Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSA--------LSIIGNVQQQGTRVSFDLANNRVG 489
+ LP NY V C A I+GN QQQ V DL N R G
Sbjct: 531 MRLPVANYFSRVGKGDVACLTIVSDDVAGQGGAVGPAVILGNYQQQNFYVECDLENERFG 590
Query: 490 FTPNKC 495
F C
Sbjct: 591 FRSQSC 596
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 100/352 (28%), Positives = 156/352 (44%), Gaps = 39/352 (11%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
+GTPP+ S +D ++ W QC C C++Q P+F P SS++ P PC CKS+
Sbjct: 60 IGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPT 119
Query: 225 SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC----GHDNEGLFVGSA 280
C ++ C Y G G TVG + T+T + G + + GC D G G +
Sbjct: 120 PKCASDVCAYDGVTGLGGHTVGIVATDTFAIGTAAPAS-LGFGCVVASDIDTMG---GPS 175
Query: 281 GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSA---RGGDAVTAPLIR---N 334
G +GLG SL Q+K T +YCL D+ + L ++ GG A T P ++ N
Sbjct: 176 GFIGLGRTPWSLVAQMKLTRFSYCLAPHDTGKNSRLFLGASAKLAGGGAWT-PFVKTSPN 234
Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR---LQTQAYNSL 391
+ +Y + L G + +P G ++V TA+ R L Y
Sbjct: 235 DGMSQYYPIELEEIKAGDATITMP--------RGRNTVLVQ--TAVVRVSLLVDSVYQEF 284
Query: 392 RDSFVRLAGNLKPTSGV-ALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
+ + + G + V A F+ C+ +G+ P + F AG AL +P NYL V
Sbjct: 285 KKAVMASVGAAPTATPVGAPFEVCFPKAGVSG--APDLVFTFQAGAALTVPPANYLFDVG 342
Query: 451 SAGTFCFAFAPTS-------SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ T C + + L+I+G+ QQ+ + FDL + + F P C
Sbjct: 343 N-DTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADC 393
>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 469
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 115/391 (29%), Positives = 167/391 (42%), Gaps = 63/391 (16%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----------PIFDPKT 205
G Y + GTP + V DTGS + W PCT Y SD P F PK
Sbjct: 88 GGYSVSLSFGTPSQTIPFVFDTGSSLVWF---PCTSRYLCSDCNFSGLDPTQIPRFIPKN 144
Query: 206 SSSYSPLPCAAPQCK-----SLDVSACRAN--RCL-----YQVAYGDGSFTVGDLVTETV 253
SSS + C P+C+ ++ C N C Y + YG GS T G L++E +
Sbjct: 145 SSSSRVIGCQNPKCQFLFGANVQCRGCDPNTRNCTVPCPPYILQYGLGS-TAGILISEKL 203
Query: 254 SFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DS 310
F + +V +GC + AG+ G G G SL Q+K S ++CLV R D+
Sbjct: 204 DFPDL-TVPDFVVGCSVISTRT---PAGIAGFGRGPESLPSQMKLKSFSHCLVSRRFDDT 259
Query: 311 PASGVLEFNSARGGDAVT-------APLIRNKKVDT-----FYYVGLTGFSVGGQAVQIP 358
+ L ++ G + + P +N V +YY+ L VG + V+IP
Sbjct: 260 NVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSNTAFLEYYYLNLRRIYVGSKHVKIP 319
Query: 359 PSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN------LKPTSGVALFD 412
G+GG IVD G+ T ++ + + + F N L+ SG+A
Sbjct: 320 YKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMSNYTREKDLEKVSGIA--- 376
Query: 413 TCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------- 465
C++ SG V VP + F G ++LP NY V +A T C ++
Sbjct: 377 PCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNADTVCLTVVSDNTVNPGGGTG 436
Query: 466 -LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
I+G+ QQQ V +DL N+R GF KC
Sbjct: 437 PAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 111/373 (29%), Positives = 179/373 (47%), Gaps = 44/373 (11%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G Y++++ +GTPPR+ + +DTGSD+ W+ C C C Q S FDP +SS+ S
Sbjct: 75 GLYYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPGSSSTSSL 134
Query: 212 LPCAAPQCKS---LDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGN-------SG 259
+ C +C+S ++C R N+C Y YGDGS T G V++ + F + +
Sbjct: 135 ISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTTN 194
Query: 260 SVKGIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATSLA-----YCLVDRDS 310
S + GC G S G+ G G +S+ Q+ + +A +CL D+
Sbjct: 195 SSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCL-KGDN 253
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
GVL + V +PL+ ++ Y + L SV GQ V+I PS+F + +
Sbjct: 254 SGGGVLVLGEIVEPNIVYSPLVPSQP---HYNLNLQSISVNGQIVRIAPSVFA--TSNNR 308
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF---DTCYDFSGLRSVRV-P 426
G IVD GT + L +AYN FV + P S ++ + CY + +V + P
Sbjct: 309 GTIVDSGTTLAYLAEEAYN----PFVIAIAAVIPQSVRSVLSRGNQCYLITTSSNVDIFP 364
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPTS-SALSIIGNVQQQGTRVSFD 482
VSL+F G +L L ++YL+ + G +C F S +++I+G++ + +D
Sbjct: 365 QVSLNFAGGASLVLRPQDYLMQQNFIGEGSVWCIGFQKISGQSITILGDLVLKDKIFVYD 424
Query: 483 LANNRVGFTPNKC 495
LA R+G+ C
Sbjct: 425 LAGQRIGWANYDC 437
>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 105/366 (28%), Positives = 165/366 (45%), Gaps = 46/366 (12%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP-------CAAP 217
+GTPP+ +VLDTGS ++W+QC + ++ P+ PKT+S L C P
Sbjct: 72 IGTPPQPTDLVLDTGSQLSWIQCHD-KKIKKRLPPLPKPKTTSFDPSLSSSFSLLPCNHP 130
Query: 218 QCK------SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
CK +L S C NR C Y Y DG+ G+LV E +F S S + LGC
Sbjct: 131 ICKPRIPDFTLPTS-CDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVILGCAQ 189
Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR------------DSPASGVLEF 318
+ + G+LG+ G LS Q K + +YC+ R D+P S ++
Sbjct: 190 AS----TENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPNSSKFKY 245
Query: 319 NSARGGDAVTAPLIRNK-KVDTF-YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
+ +T P ++ +D Y + + + G+ + +PP+ F+ D G G ++D
Sbjct: 246 VTM-----LTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNVPPAAFKPDAGGSGQTMIDS 300
Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA--LFDTCYDFSGLRSV--RVPTVSLHF 432
G+ +T L +AY +++ VRL G + V + D C+D V R+ +S F
Sbjct: 301 GSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGISFEF 360
Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS---ALSIIGNVQQQGTRVSFDLANNRVG 489
G + + ++ G C + +IIG V QQ V +DLAN RVG
Sbjct: 361 DNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRVG 420
Query: 490 FTPNKC 495
F +C
Sbjct: 421 FGGAEC 426
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 113/394 (28%), Positives = 170/394 (43%), Gaps = 59/394 (14%)
Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTEC-YQQSDPI----FDP 203
+++ G Y + GTP + V DTGS + WL C C+ C + DP F P
Sbjct: 83 SAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIP 142
Query: 204 KTSSSYSPLPCAAPQCKSL------------DVSACRANRCLYQVAYGDGSFTVGDLVTE 251
K SSS + C +P+C+ L + C Y + YG GS T G L+TE
Sbjct: 143 KNSSSSKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGS-TAGVLITE 201
Query: 252 TVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR--- 308
+ F + +V +GC + AG+ G G G +SL Q+ ++CLV R
Sbjct: 202 KLDFPDL-TVPDFVVGCSIIST---RQPAGIAGFGRGPVSLPSQMNLKRFSHCLVSRRFD 257
Query: 309 DSPASGVLEFNSARGGDAVTA------------PLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
D+ + L+ ++ G ++ + P + NK +YY+ L VG + V+
Sbjct: 258 DTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVK 317
Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN------LKPTSGVAL 410
IP GDGG IVD G+ T ++ + + + F N L+ +G+
Sbjct: 318 IPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLG- 376
Query: 411 FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP--------- 461
C++ SG V VP + F G L+LP NY V + T C
Sbjct: 377 --PCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGG 434
Query: 462 TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
T A+ I+G+ QQQ V +DL N+R GF KC
Sbjct: 435 TGPAI-ILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 105/364 (28%), Positives = 175/364 (48%), Gaps = 38/364 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC-YQQS--DPIFDPKTSSSYSPLP 213
G Y SR+ +GTP ++F++++DTGS + ++ C CT C + Q+ DP F P SSSY +
Sbjct: 97 GYYTSRVFIGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSSSYQTVS 156
Query: 214 CAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHD 271
C +P C + A R ++C Y+ Y + S + G L + + FGN ++ + GC
Sbjct: 157 CNSPDCITKMCDA-RVHQCKYERVYAEMSSSKGVLGKDLLGFGNGSRLQPHPLLFGCETA 215
Query: 272 NEG-LFVGSA-GLLGLGGGMLSLTKQIKAT-------SLAYCLVDRDSPASGVLEFNSAR 322
G L++ A G++GLG G LS+ Q+ T SL Y +D + S VL
Sbjct: 216 ETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMD-EGGGSMVL------ 268
Query: 323 GGDAVTAPLIRNKKVD----TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
G P + K D +Y + L+ V G ++ +P +F G G ++D GT
Sbjct: 269 -GAIPPPPAMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVFN----GRLGTVLDSGT 323
Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRS----VRVPTVSLHF 432
L +A+++ +D+ + G+L+ G + D C+ +G S P V F
Sbjct: 324 TYAYLPDKAFDAFKDAITQQLGSLQAVPGPDPSYPDVCFAGAGSDSKALGKHFPPVDFVF 383
Query: 433 GAGKALDLPAKNYLIP-VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
+ + L +NYL G +C F A +++G + + T V++D AN+++GF
Sbjct: 384 SGNQKVFLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIVVRNTLVTYDRANHQIGFF 443
Query: 492 PNKC 495
C
Sbjct: 444 KTNC 447
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 107/372 (28%), Positives = 173/372 (46%), Gaps = 43/372 (11%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G Y++++ +GTPP +F++ +DTGSD+ W+ C C+ C Q S FDP +SS+ S
Sbjct: 73 GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSM 132
Query: 212 LPCAAPQC----KSLDVS-ACRANRCLYQVAYGDGSFTVGDLVTETVSFG-------NSG 259
+ C+ +C +S D + + + N+C Y YGDGS T G V++ + +
Sbjct: 133 IACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTN 192
Query: 260 SVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSLA-----YCLVDRDS 310
S + GC + G S G+ G G +S+ Q+ + +A +CL DS
Sbjct: 193 STAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL-KGDS 251
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
G+L + V L+ + Y + L +V GQ +QI S+F +
Sbjct: 252 SGGGILVLGEIVEPNIVYTSLVPAQP---HYNLNLQSIAVNGQTLQIDSSVFATSNS--R 306
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS---GVALFDTCYDFSGLRSVRVPT 427
G IVD GT + L +AY D FV P S V+ + CY + + P
Sbjct: 307 GTIVDSGTTLAYLAEEAY----DPFVSAITASIPQSVHTVVSRGNQCYLITSSVTEVFPQ 362
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPTS-SALSIIGNVQQQGTRVSFDL 483
VSL+F G ++ L ++YLI +S G +C F ++I+G++ + V +DL
Sbjct: 363 VSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDL 422
Query: 484 ANNRVGFTPNKC 495
A R+G+ C
Sbjct: 423 AGQRIGWANYDC 434
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 103/375 (27%), Positives = 165/375 (44%), Gaps = 46/375 (12%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYS 210
+G YF++IG+G+P + + + +DTGSDI W+ C CT C ++SD ++DPK S +
Sbjct: 66 TGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSE 125
Query: 211 PLPCAAPQCKSL---DVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSV- 261
+ C C S + C+A N C Y ++YGDGS T G V + ++F GN +
Sbjct: 126 FVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTAT 185
Query: 262 --KGIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRD 309
I GCG G F S+ G++G G S+ Q+ A+ ++CL
Sbjct: 186 QNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL--DT 243
Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
+ G+ T PL+ N Y V L V G +Q+P F D
Sbjct: 244 NVGGGIFSIGEVVEPKVKTTPLVPNM---AHYNVILKNIEVDGDILQLPSDTF--DSENG 298
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD---TCYDFSGLRSVRVP 426
G ++D GT + L Y+ L + LK V L + +C+ ++G P
Sbjct: 299 KGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLK----VYLVEEQYSCFQYTGNVDSGFP 354
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQGTRVS 480
V LHF +L + +YL +C + ++S ++++G+ V
Sbjct: 355 IVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVV 414
Query: 481 FDLANNRVGFTPNKC 495
+DL N +G+T C
Sbjct: 415 YDLENMTIGWTDYNC 429
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 109/376 (28%), Positives = 165/376 (43%), Gaps = 66/376 (17%)
Query: 176 LDTGSDINWLQC---RPCTECYQQS--DPIFDPKTSSSYSPLPCAAPQCKSL-------- 222
+DTGSD+ W+ C C C + S + +F P+ SSS + CA CK+L
Sbjct: 1 MDTGSDLVWVPCTRNYSCINCPEDSASNGVFLPRMSSSLHLVTCADSNCKTLYGNNTELL 60
Query: 223 ------DVSACRANRCLYQVAYGDGSFTVGDLVTETVSF-----GNSGSVKGIALGCGHD 271
+ C Y + YG GS T G L+TET++ + ++ A+GC
Sbjct: 61 CQSCAGSLKNCSETCPPYGIQYGRGS-TAGLLLTETLNLPLENGEGARAITHFAVGCS-- 117
Query: 272 NEGLFVGS---AGLLGLGGGMLSLTKQ----IKATSLAYCL----VDRDSPASGVLEFNS 320
V S +G+ G G G LS+ Q I AYCL D ++ S ++ +
Sbjct: 118 ----IVSSQQPSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRFDEENKKSLMVLGDK 173
Query: 321 ARGGDAVT--APLIRNKKV------DTFYYVGLTGFSVGGQAV-QIPPSLFEMDEAGDGG 371
A + P + N + +YY+GL G S+GG+ + Q+P L D G+GG
Sbjct: 174 ALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFDTKGNGG 233
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFV-----RLAGNLKPTSGVALFDTCYDFSGLRSVRVP 426
I+D GT T + + + F R AG ++ +G+ L CYD +GL ++ +P
Sbjct: 234 TIIDSGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVEDKTGMGL---CYDVTGLENIVLP 290
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS-------IIGNVQQQGTRV 479
+ HF G + LP NY S + C + L I+GN QQQ +
Sbjct: 291 EFAFHFKGGSDMVLPVANYFSYFSSFDSICLTMISSRGLLEVDSGPAVILGNDQQQDFYL 350
Query: 480 SFDLANNRVGFTPNKC 495
+D NR+GFT C
Sbjct: 351 LYDREKNRLGFTQQTC 366
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 107/378 (28%), Positives = 170/378 (44%), Gaps = 42/378 (11%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
+G +G Y++RIG+G+PP F + +DTGSDI W+ C C+ C ++SD +++PK
Sbjct: 64 NGHPAETGLYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPK 123
Query: 205 TSSSYSPLPCAAPQCKSL---DVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSF----G 256
+SS+ + + C P C + + C+ + C Y+V YGDGS T G V + + G
Sbjct: 124 SSSTSTLITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVG 183
Query: 257 NSGSVK---GIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYC 304
N + + I GCG G S+ G+LG G S+ Q+ AT A+C
Sbjct: 184 NHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHC 243
Query: 305 LVDRDS-PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
L DS G+ T P++ N+ Y V L G VG A+ +P LFE
Sbjct: 244 L---DSISGGGIFAIGEVVEPKLKTTPVVPNQ---AHYNVVLNGVKVGDTALDLPLGLFE 297
Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
G I+D GT + L Y L + + +LK + F TC+ F
Sbjct: 298 TSYK--RGAIIDSGTTLAYLPDSIYLPLMEKILGAQPDLKLRTVDDQF-TCFVFDKNVDD 354
Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGT 477
PTV+ F L + YL + +C + + + ++++G++ Q
Sbjct: 355 GFPTVTFKFEESLILTIYPHEYLFQIRDD-VWCVGWQNSGAQSKDGNEVTLLGDLVLQNK 413
Query: 478 RVSFDLANNRVGFTPNKC 495
V ++L N +G+T C
Sbjct: 414 LVYYNLENQTIGWTEYNC 431
>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
gi|224030351|gb|ACN34251.1| unknown [Zea mays]
Length = 342
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 107/343 (31%), Positives = 148/343 (43%), Gaps = 37/343 (10%)
Query: 185 LQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR---CLYQVAYGDG 241
+QC+PC CY+Q DP+F+PK SSSY+ +PC + C LD C + C Y Y
Sbjct: 1 MQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSGH 60
Query: 242 SFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIKATS 300
T G L + ++ G + GC + G A GL+GLG G LSL Q+
Sbjct: 61 GVTKGTLAIDKLAIGGD-VFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQLSVHR 119
Query: 301 LAYCLVDRDSPASGVLEFNSARGG-----DAVTAPLIRNKKVDTFYYVGLTGFSVGGQA- 354
YCL S SG L + D VT + + + ++YY+ L G +VG Q
Sbjct: 120 FMYCLPPPMSRTSGKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQTP 179
Query: 355 -----VQIPPSLFEMDEAGDG-------------GIIVDCGTAITRLQTQAYNSLRDSFV 396
PPS G G G+IVD + I+ L+T Y+ L D
Sbjct: 180 GTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADDLE 239
Query: 397 RLAGNLKPTSGVAL-FDTCY---DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
+ T + L D C+ + G+ V VPTVSL F G+ L+L V
Sbjct: 240 EEIRLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSFD-GRWLELDRDRLF--VTDG 296
Query: 453 GTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C T S +SI+GN Q Q RV F+L ++ F C
Sbjct: 297 RMMCLMIGRT-SGVSILGNFQLQNMRVLFNLRRGKITFAKASC 338
>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 482
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 129/461 (27%), Positives = 190/461 (41%), Gaps = 98/461 (21%)
Query: 116 LITKLQLAIYNVDRHELKPAEAQILPE----------DFSTPVVSGASQGSGEYFSRIGV 165
L L + +N H LK L S P+ G+ +Y +
Sbjct: 27 LTHSLSMIEFNTTHHLLKSTSTHSLSRFHRHKHHHHNQLSLPLSPGS-----DYTLSFNL 81
Query: 166 GTPPRQFSMVLDTGSDINWLQCRP--CTECYQQ----SDPIFDPKTSSSYS-PLPCAAPQ 218
G + ++ +DTGSD+ W C P C C + SDP P T+ S+S P+ C +
Sbjct: 82 GPHSQPITLYMDTGSDLVWFPCTPFNCILCELKPKLTSDP--SPPTNISHSTPISCNSHA 139
Query: 219 CK--------------------SLDVSACRANRCL-YQVAYGDGSFTVGDLVTETVSFGN 257
C S++ C + C + AYGDGS + L +T+S +
Sbjct: 140 CSVAHSSTPSSDLCTMAHCPLDSIETKDCGSFHCPPFYYAYGDGSL-IASLYRDTLSL-S 197
Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAYCLVDRD-- 309
+ + GC H F G+ G G G+LSL Q+ S +YCLV
Sbjct: 198 TLQLTNFTFGCAHTT---FSEPTGVAGFGRGLLSLPAQLATHSPQLGNRFSYCLVSHSFR 254
Query: 310 -----SPASGVL-EFNSAR--GGDAVT----APLIRNKKVDTFYYVGLTGFSVGGQAVQI 357
P+ +L +N + GD V ++ N K FY VGL G SVG + V
Sbjct: 255 SERIRKPSPLILGRYNDEKQSNGDEVVEFVYTSMLENPKHSYFYTVGLKGISVGKKTVPA 314
Query: 358 PPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN-------LKPTSGVAL 410
P L +++ GDGG++VD GT T L + YNS+ + F R A ++ +G++
Sbjct: 315 PKILRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKSNRRAPEIEQKTGLS- 373
Query: 411 FDTCYDFSGLRSVRVPTVSLHF-GAGKALDLPAKNYLIPV----------DSAGTFCFAF 459
CY + + VP V+L F G ++ LP KNY + G F
Sbjct: 374 --PCYYLN--TAAIVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGDGVRRKERVGCLMFMN 429
Query: 460 APTSSALS-----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ +S ++GN QQQG V +DL RVGF KC
Sbjct: 430 GGDEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFARRKC 470
>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
Length = 458
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 122/459 (26%), Positives = 192/459 (41%), Gaps = 61/459 (13%)
Query: 84 LHSREILHKTRHNDYRSLV---LSRLERDSARVNTLITKLQLAIYNVDR-HELKPAEAQI 139
L SR +L + N+ + + L+ + L+ LA ++ R H LK +A
Sbjct: 14 LFSRLVLASSSKNNIPATITIPLTPTFTKNPSTEPLLFLQHLATASMSRSHHLKHGKASP 73
Query: 140 LPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQC---RPCTEC--- 193
L + P G + + GTPP++ S ++DTGS + W C CT C
Sbjct: 74 LIQTSLFP------HSHGGHTIPLSFGTPPQKLSFLVDTGSHVVWAPCTTHYTCTNCSFS 127
Query: 194 YQQSDPIFDPKTSSSYSPLPCAAPQCKS-------LDVSACRAN--RC-----LYQVAYG 239
+ PIF+P+ SSS L C P+C + L C N +C Y + YG
Sbjct: 128 NPKKVPIFNPELSSSDKILGCRDPKCANTSSPDVHLGCPRCNGNSKKCSHACPQYTLQYG 187
Query: 240 DGSFTVGDLVTETVSFGNSGSVKGIALGC--GHDNEGLFVGSAGLLGLGGGMLSLTKQIK 297
G+ + G + E + F ++ +GC D E S L G G M SL Q+
Sbjct: 188 TGAAS-GFFLLENLDFPGK-TIHKFLVGCTTSADRE---PSSDALAGFGRTMFSLPMQMG 242
Query: 298 ATSLAYCLVDRD---SPASG--VLEFNSARGGDAVTAPLIRNK-KVDTFYYVGLTGFSVG 351
AYCL D + SG +L+++ AP ++N +YY+G+ +G
Sbjct: 243 VKKFAYCLNSHDYDDTRNSGKLILDYSDGETQGLSYAPFLKNPPDYPFYYYLGVKDMKIG 302
Query: 352 GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY----NSLRDSFVRLAGNLKPTSG 407
+ ++IP GG+++D G A + + N L+ + +L+ +
Sbjct: 303 NKLLRIPGKYLTPGSDSRGGVMIDSGFAYGYMTLPVFKIVTNELKKQMSKYRRSLEAETQ 362
Query: 408 VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF---------- 457
L CY+F+G +S+++P + F G + +P NY + A CF
Sbjct: 363 SGL-TPCYNFTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLGCFPVTTDSPTNN 421
Query: 458 -AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
F P S I+GN QQ V FDL N R+GF C
Sbjct: 422 LEFTPGPSI--ILGNYQQVDHYVEFDLKNERLGFRQQTC 458
>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 396
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 101/353 (28%), Positives = 157/353 (44%), Gaps = 27/353 (7%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQC-RPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
Y + +GTPP+ S ++D G ++ W QC + C C++Q P+FD SS++ P PC A
Sbjct: 51 YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGAA 110
Query: 218 QCKSLDVSACRANRCLYQVAYGDGSF--TVGDLVTETVSFGNSGSVKGIALGCGHDNE-G 274
C+S+ +C + SF TVG + T+ V+ G + + + +A GC +E
Sbjct: 111 VCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAATAR-LAFGCAVASEMD 169
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSA-----RGGDAVTA 329
GS+G +GLG LSL Q+ AT+ +YCL D+ S L ++ G A T
Sbjct: 170 TMWGSSGSVGLGRTNLSLAAQMNATAFSYCLAPPDTGKSSALFLGASAKLAGAGKGAGTT 229
Query: 330 PLIR-----NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
P ++ N + Y + L G + +P S I V T +T L
Sbjct: 230 PFVKTSTPPNSGLSRSYLLRLEAIRAGNATIAMPQS--------GNTITVSTATPVTALV 281
Query: 385 TQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN 444
Y LR + G V +D C+ + S P + L F G + +P +
Sbjct: 282 DSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKAS-ASGGAPDLVLAFQGGAEMTVPVSS 340
Query: 445 YLIPVDSAGTFCFAF--APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
YL + T C A +P +SI+G++QQ + FDL + F P C
Sbjct: 341 YLFDAGN-DTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADC 392
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 117/396 (29%), Positives = 177/396 (44%), Gaps = 48/396 (12%)
Query: 124 IYNVDRHELKPAEAQIL-------PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVL 176
I+ DR ++ A+I +D +P G + +G GTP ++F++++
Sbjct: 87 IFLQDRSRVRSINAKIFGQYSTQESKDGWSPESMDTLNEDGLFLVNVGFGTPQQKFNLII 146
Query: 177 DTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQV 236
DTGSD W+QC C+ + F+P SSSYS C S D + Y +
Sbjct: 147 DTGSDTTWIQCNSCSLGNCHNKKTFNPSLSSSYSNRSCIP----STDTN--------YTM 194
Query: 237 AYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGG----MLSL 292
Y D S++ G V + V+ K GCG G F ++G+LGL G ++S
Sbjct: 195 KYEDNSYSKGVFVCDEVTLKPDVFPK-FQFGCGDSGGGEFGTASGVLGLAKGEQYSLISQ 253
Query: 293 TKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTA-PLIR-----NKKVDTFYYVGLT 346
T +YC ++ +L G A++A P ++ N Y+V L
Sbjct: 254 TASKFKKKFSYCFPPKEHTLGSLL-----FGEKAISASPSLKFTQLLNPPSGLGYFVELI 308
Query: 347 GFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR---LAGNLK 403
G SV + + + SLF G I+D GT ITRL T AY +LR +F + ++
Sbjct: 309 GISVAKKRLNVSSSLF-----ASPGTIIDSGTVITRLPTAAYEALRTAFQQEMLHCPSIS 363
Query: 404 PTSGVALFDTCYDFSGL--RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
P L DTCY+ G R++++P + LHF + L L C AFA
Sbjct: 364 PPPQEKLLDTCYNLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGILWANGDLTQACLAFAR 423
Query: 462 TS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S S ++IIGN QQ +V +D+ R+GF N C
Sbjct: 424 KSNPSHVTIIGNRQQVSLKVVYDIEGGRLGFG-NDC 458
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 108/371 (29%), Positives = 174/371 (46%), Gaps = 41/371 (11%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G Y++++ +GTPPR F + +DTGSD+ W+ C C C Q S FDP +S + SP
Sbjct: 79 GLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASP 138
Query: 212 LPCAAPQC----KSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS--- 258
+ C+ +C +S D S C + N C Y YGDGS T G V++ + F G+S
Sbjct: 139 ISCSDQRCSWGIQSSD-SGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197
Query: 259 GSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSLA-----YCLVDRD 309
S + GC G V S G+ G G +S+ Q+ + +A +CL +
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGEN 257
Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
G+L + V PL+ ++ Y V L SV GQA+ I PS+F
Sbjct: 258 G-GGGILVLGEIVEPNMVFTPLVPSQP---HYNVNLLSISVNGQALPINPSVFSTSNG-- 311
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
G I+D GT + L AY ++ ++ +++P V+ + CY + P V
Sbjct: 312 QGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPV--VSKGNQCYVITTSVGDIFPPV 369
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPT-SSALSIIGNVQQQGTRVSFDLA 484
SL+F G ++ L ++YLI ++ G +C F + ++I+G++ + +DL
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLV 429
Query: 485 NNRVGFTPNKC 495
R+G+ C
Sbjct: 430 GQRIGWANYDC 440
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 108/371 (29%), Positives = 175/371 (47%), Gaps = 41/371 (11%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G Y+++I +G+PPR F + +DTGSD+ W+ C C C Q S FDP +S + +P
Sbjct: 79 GLYYTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATP 138
Query: 212 LPCAAPQC----KSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS--- 258
+ C+ +C +S D S C + N C Y YGDGS T G V++ + F G+S
Sbjct: 139 VSCSDQRCSWGIQSSD-SGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197
Query: 259 GSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSLA-----YCLVDRD 309
S + GC G V S G+ G G +S+ Q+ + LA +CL +
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKGEN 257
Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
G+L + V PL+ ++ Y V L SV GQA+ I PS+F
Sbjct: 258 G-GGGILVLGEIVEPNMVFTPLVPSQP---HYNVNLLSISVNGQALPINPSVFSTSNG-- 311
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
G I+D GT + L AY ++ ++ +++P V+ + CY + + P V
Sbjct: 312 QGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPV--VSKGNQCYVIATSVADIFPPV 369
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPT-SSALSIIGNVQQQGTRVSFDLA 484
SL+F G ++ L ++YLI ++ G +C F + ++I+G++ + +DL
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLV 429
Query: 485 NNRVGFTPNKC 495
R+G+ C
Sbjct: 430 GQRIGWANYDC 440
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 102/363 (28%), Positives = 167/363 (46%), Gaps = 35/363 (9%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRP--CTECYQQSDPIFDPKTSSSYSPLPCAA 216
Y + +G+PP + + DTGS+I W+QC CT CY+Q P+F+P SS+Y+ C
Sbjct: 108 YVMKFNIGSPPVETYAIPDTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIRLCGH 167
Query: 217 PQCKSL-----DVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG-----I 264
+CK + C+++ C Y ++Y D SF+ G + T+ ++F + G +
Sbjct: 168 RECKQALWGLGEYLGCKSSVQVCRYHISYEDHSFSEGTISTDIITFPEHIAEFGNYSLRM 227
Query: 265 ALGCGHDNEGL------FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRD-SPASGVLE 317
GCG++N + G++GLG M SL Q+ +YC+ D +G +E
Sbjct: 228 FFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASLVGQLTLGQFSYCISTPDVQKPNGTIE 287
Query: 318 FNSARGGDAVT----APLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGI 372
R G A + + + N + + + G V V+ P +F+ E G GG+
Sbjct: 288 I---RFGLAASISGHSTALANNLEGWYIFQNVDGIYVDDTKVKGYPEWVFQFAEGGIGGL 344
Query: 373 IVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
I+D GT T L A ++L + LA + + S + + CY+ + VP +
Sbjct: 345 IMDSGTTYTELYFSALDALIGELKEQIELAPDTQDHSN-SNYSLCYNAANFLLTYVPAIE 403
Query: 430 LHFGAGKALDLPAKNYLIPVDSAG-TFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
L F K P +D+ +C A T S +SIIG Q + ++ +DL N V
Sbjct: 404 LKFTDNKEAYFPFTLRNAWIDNGNDQYCLAMFGT-SGISIIGIYQHRDIKIGYDLKYNLV 462
Query: 489 GFT 491
FT
Sbjct: 463 SFT 465
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 108/371 (29%), Positives = 174/371 (46%), Gaps = 41/371 (11%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G Y++++ +GTPPR F + +DTGSD+ W+ C C C Q S FDP +S + SP
Sbjct: 79 GLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASP 138
Query: 212 LPCAAPQC----KSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS--- 258
+ C+ +C +S D S C + N C Y YGDGS T G V++ + F G+S
Sbjct: 139 ISCSDQRCSWGIQSSD-SGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197
Query: 259 GSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSLA-----YCLVDRD 309
S + GC G V S G+ G G +S+ Q+ + +A +CL +
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGEN 257
Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
G+L + V PL+ ++ Y V L SV GQA+ I PS+F
Sbjct: 258 G-GGGILVLGEIVEPNMVFTPLVPSQP---HYNVNLLSISVNGQALPINPSVFSTSNG-- 311
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
G I+D GT + L AY ++ ++ +++P V+ + CY + P V
Sbjct: 312 QGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPV--VSKGNQCYVITTSVGDIFPPV 369
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPT-SSALSIIGNVQQQGTRVSFDLA 484
SL+F G ++ L ++YLI ++ G +C F + ++I+G++ + +DL
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLV 429
Query: 485 NNRVGFTPNKC 495
R+G+ C
Sbjct: 430 GQRIGWANYDC 440
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 106/378 (28%), Positives = 170/378 (44%), Gaps = 42/378 (11%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
+G +G Y++RIG+G+PP F + +DTGSDI W+ C C+ C ++SD +++PK
Sbjct: 64 NGHPAETGLYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPK 123
Query: 205 TSSSYSPLPCAAPQCKSL---DVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSF----G 256
+SS+ + + C P C + + C+ + C Y+V YGDGS T G V + + G
Sbjct: 124 SSSTSTLITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVG 183
Query: 257 NSGSVK---GIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYC 304
N + + I GCG G S+ G+LG G S+ Q+ AT A+C
Sbjct: 184 NHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHC 243
Query: 305 LVDRDS-PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
L DS G+ P++ N+ Y V L G VG A+ +P LFE
Sbjct: 244 L---DSISGGGIFAIGEVVEPKLXNTPVVPNQ---AHYNVVLNGVKVGDTALDLPLGLFE 297
Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
+ G I+D GT + L Y L + + +LK + F TC+ F
Sbjct: 298 T--SYKRGAIIDSGTTLAYLPESIYLPLMEKILGAQPDLKLRTVDDQF-TCFVFDKNVDD 354
Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGT 477
PTV+ F L + YL + +C + + + ++++G++ Q
Sbjct: 355 GFPTVTFKFEESLILTIYPHEYLFQIRDD-VWCVGWQNSGAQSKDGNEVTLLGDLVLQNK 413
Query: 478 RVSFDLANNRVGFTPNKC 495
V ++L N +G+T C
Sbjct: 414 LVYYNLENQTIGWTEYNC 431
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 102/349 (29%), Positives = 165/349 (47%), Gaps = 19/349 (5%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQC-RPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
Y + +GTPP+ S ++D G ++ W QC + C C++Q P+FD SS++ P PC A
Sbjct: 51 YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGAA 110
Query: 218 QCKSLDVSACRANRCLYQVAYGDGSF--TVGDLVTETVSFGNSGSVKGIALGCGHDNE-G 274
C+S+ +C + SF TVG + T+ V+ G + + + +A GC +E
Sbjct: 111 VCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAATAR-LAFGCAVASEMD 169
Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSA-----RGGDAVTA 329
GS+G +GLG LSL Q+ AT+ +YCL D+ S L ++ G A T
Sbjct: 170 TMWGSSGSVGLGRTNLSLAAQMNATAFSYCLAPPDTGKSSALFLGASAKLAGAGKGAGTT 229
Query: 330 PLIRNKKVDTFYYVGLT-GFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
P + K T + GL+ + + +A++ + M ++G+ I+V T +T L Y
Sbjct: 230 PFV---KTSTPPHSGLSRSYLLRLEAIRAGNATIAMPQSGN-TIMVSTATPVTALVDSVY 285
Query: 389 NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
LR + G V +D C+ + S P + L F G + +P +YL
Sbjct: 286 RDLRKAVADAVGAAPVPPPVQNYDLCFPKAS-ASGGAPDLVLAFQGGAEMTVPVSSYLFD 344
Query: 449 VDSAGTFCFAF--APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ T C A +P +SI+G++QQ + FDL + F P C
Sbjct: 345 AGN-DTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADC 392
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 108/375 (28%), Positives = 168/375 (44%), Gaps = 47/375 (12%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD------PIFDPKTSSSY 209
+G Y+++I +GTPP + + +DTGSD+ WL C PCT C ++ +DP SS+
Sbjct: 34 TGLYYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTD 93
Query: 210 SPLPCAAPQC----KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF---GNSGSVK 262
L C C S +VS A C Y YGDGS T G + + ++F N+ V
Sbjct: 94 GALSCRDSNCGAALGSNEVSCTSAGYCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVN 153
Query: 263 GIA---LGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKA-----TSLAYCLVDRDS 310
G A GCG G + S+ GL+G G +S+ Q+ + A+CL D+
Sbjct: 154 GTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCL-QGDN 212
Query: 311 PASGVLEFNSARGGDAVTAPLI-RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
G + S + P++ RN Y VG+ +V G+ V P S F+
Sbjct: 213 QGGGTIVIGSVSEPNISYTPIVSRNH-----YAVGMQNIAVNGRNVTTPAS-FDTTSTSA 266
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR-SVRVPTV 428
GG+I+D GT + L AY FV + +S + C + PTV
Sbjct: 267 GGVIMDSGTTLAYLVDPAYT----QFVNAVSTFE-SSMFSSHSQCLQLAWCSLQADFPTV 321
Query: 429 SLHFGAGKALDLPAKNYLI--PVDSA-GTFCFAFAPTSS-----ALSIIGNVQQQGTRVS 480
L F AG ++L +NYL P+ + +C + +++ + SI+G++ + V
Sbjct: 322 KLFFDAGAVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSILGDIVLKDHLVV 381
Query: 481 FDLANNRVGFTPNKC 495
+D N VG+ C
Sbjct: 382 YDNDNRVVGWKSFDC 396
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 102/377 (27%), Positives = 170/377 (45%), Gaps = 41/377 (10%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
+G +G YF++IG+GTP + + + +DTGSDI W+ C C C +SD ++D K
Sbjct: 146 NGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMK 205
Query: 205 TSSSYSPLPCAAPQCKSLD--VSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGN-SGS 260
S++ + C C D + C+ +CLY V YGDGS T G V + V + SG+
Sbjct: 206 ASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGN 265
Query: 261 VK------GIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYCL 305
+ + GCG+ G S+ G+LG G S+ Q+ ++ ++CL
Sbjct: 266 FQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL 325
Query: 306 VDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
+ D G+ PL++N+ Y V + VGG + +P F
Sbjct: 326 DNVD--GGGIFAIGEVVEPKVNITPLVQNQ---AHYNVVMKEIEVGGDPLDVPSDAF--- 377
Query: 366 EAGD-GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
E+GD G I+D GT + + Y L + + +L+ + F TC+D++G
Sbjct: 378 ESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAF-TCFDYTGNVDDG 436
Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQGTR 478
PTV+LHF +L + YL V +C + + + L+++G++
Sbjct: 437 FPTVTLHFDKSISLTVYPHEYLFQVKEF-EWCIGWQNSGAQTKDGKDLTLLGDLVLSNKL 495
Query: 479 VSFDLANNRVGFTPNKC 495
V +DL +G+ C
Sbjct: 496 VVYDLEKQGIGWVEYNC 512
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 103/371 (27%), Positives = 173/371 (46%), Gaps = 56/371 (15%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP++F++++D+GS + ++ C C +C DP F P SSSYSP+ C
Sbjct: 86 NGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKC- 144
Query: 216 APQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHD 271
++D + C +++ C Y+ Y + S + G L + VSFG +K GC +
Sbjct: 145 -----NVDCT-CDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKPQRAVFGCENS 198
Query: 272 NEG-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRD----------SPASG 314
G LF A G++GLG G LS+ Q + + S + C D PA
Sbjct: 199 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGVPAPS 258
Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
+ F+ + D + +P +Y + L V G+A+++ +F G ++
Sbjct: 259 DMVFSHS---DPLRSP---------YYNIELKEIHVAGKALRVDSRVFNSKH----GTVL 302
Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTV 428
D GT L QA+ + +D+ +LK G D C+ +G ++ P V
Sbjct: 303 DSGTTYAYLPEQAFVAFKDAVTSKVHSLKKIRGPDPNYKDICFAGAGRNVSKLHEVFPDV 362
Query: 429 SLHFGAGKALDLPAKNYLI---PVDSAGTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLA 484
+ FG G+ L L +NYL VD G +C F +++G + + T V++D
Sbjct: 363 DMVFGNGQKLSLTPENYLFRHSKVD--GAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRH 420
Query: 485 NNRVGFTPNKC 495
N ++GF C
Sbjct: 421 NEKIGFWKTNC 431
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 106/379 (27%), Positives = 166/379 (43%), Gaps = 42/379 (11%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ-------QSDPIFD 202
SG G+ +YF+ + VGTP ++F +V+DTGS++ W+ CR Y+ ++ +F
Sbjct: 79 SGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCR-----YRGRGKGKVKNRRVFR 133
Query: 203 PKTSSSYSPLPCAAPQCK-------SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF 255
+ S S+ + C CK SL + C Y Y DGS G ET++
Sbjct: 134 AEESKSFKTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETITV 193
Query: 256 ----GNSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTK---QIKATSLAYCLVD 307
G ++G+ +GC G A G+LGL S T + L+YCLVD
Sbjct: 194 GLTNGRKARLRGLLVGCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSLFGAKLSYCLVD 253
Query: 308 R--DSPASGVLEFNSARGGDAVTAPLIRNKKVDT-----FYYVGLTGFSVGGQAVQIPPS 360
+ S L F + + R +D FY + + G S+G + IP
Sbjct: 254 HLSNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQ 313
Query: 361 LFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCY-DFS 418
++ D GG I+D GT++T L AY + R LK + + C+ S
Sbjct: 314 VW--DATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTS 371
Query: 419 GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFAFAPTSS-ALSIIGNVQQQG 476
G ++P ++ H G + K+YL VD+A G C F + A +++GN+ QQ
Sbjct: 372 GFNESKLPQLTFHLKGGARFEPHRKSYL--VDAAPGVKCLGFMSAGTPATNVVGNIMQQN 429
Query: 477 TRVSFDLANNRVGFTPNKC 495
FDL + + F P+ C
Sbjct: 430 YLWEFDLMASTLSFAPSTC 448
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 116/413 (28%), Positives = 187/413 (45%), Gaps = 54/413 (13%)
Query: 125 YNVDRHELKPAE----AQILPEDFSTPVVSGASQGS------GEYFSRIGVGTPPRQFSM 174
+ ++ H+L+ + A++L + F VV + QGS G YF+++ +G+PPR+F++
Sbjct: 23 HGLELHQLRARDRLRHARLL-QGFVGGVVDFSVQGSSDPYLVGLYFTKVKLGSPPREFNV 81
Query: 175 VLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSPLPCAAPQCKS---LDVSA 226
+DTGSD+ W+ C C C + S FD +SS+ + C+ P C S +
Sbjct: 82 QIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQVRCSDPICTSAVQTTATQ 141
Query: 227 C--RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS---GSVKGIALGCGHDNEGLFV 277
C + ++C Y YGDGS T G V++T+ F G S S I GC G
Sbjct: 142 CSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNSSALIVFGCSAYQSGDLT 201
Query: 278 GS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSPASGVLEFNSARGGDAVT 328
+ G+ G G G LS+ Q+ + ++CL D G+L V
Sbjct: 202 KTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCL-KGDGSGGGILVLGEILEPGIVY 260
Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
+PL+ ++ Y + L +V GQ + I P+ F + G IVD GT + L +AY
Sbjct: 261 SPLVPSQP---HYNLNLLSIAVNGQLLPIDPAAFATSNS--QGTIVDSGTTLAYLVAEAY 315
Query: 389 NSLRDSFVRLAGNLKPTSGVALF---DTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
D FV + S + + CY S S P S +F G ++ L ++Y
Sbjct: 316 ----DPFVSAVNAIVSPSVTPITSKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDY 371
Query: 446 LIPVDSAG---TFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LIP S+G +C F ++I+G++ + +DL R+G+ C
Sbjct: 372 LIPFGSSGGSAMWCIGFQKV-QGVTILGDLVLKDKIFVYDLVRQRIGWANYDC 423
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 166/370 (44%), Gaps = 39/370 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF+++ +G+P ++F + +DTGSDI W+ C C+ C S FD SS+ +
Sbjct: 81 GLYFTKVKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAAL 140
Query: 212 LPCAAPQCK---SLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGN--------S 258
+ C P C S C +AN+C Y YGDGS T G V++T+ F +
Sbjct: 141 VSCGDPICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVA 200
Query: 259 GSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRD 309
S I GC G + G+ G G G LS+ Q+ + + ++CL +
Sbjct: 201 NSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGE 260
Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
+ GVL V +PL+ ++ Y + L +V GQ + I ++F +
Sbjct: 261 N-GGGVLVLGEILEPSIVYSPLVPSQP---HYNLNLQSIAVNGQLLPIDSNVFATTN--N 314
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL-KPTSGVALFDTCYDFSGLRSVRVPTV 428
G IVD GT + L +AYN + KP ++ + CY S P V
Sbjct: 315 QGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFSKPI--ISKGNQCYLVSNSVGDIFPQV 372
Query: 429 SLHFGAGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLAN 485
SL+F G ++ L ++YL+ +D A +C F +I+G++ + +DLAN
Sbjct: 373 SLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDLVLKDKIFVYDLAN 432
Query: 486 NRVGFTPNKC 495
R+G+ C
Sbjct: 433 QRIGWADYDC 442
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 103/369 (27%), Positives = 172/369 (46%), Gaps = 52/369 (14%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP++F++++DTGS + ++ C C +C + DP F P SS+Y + C
Sbjct: 10 NGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKCN 69
Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHDNE 273
C D +C+Y+ Y + S + G L + +SFGN ++ + GC +
Sbjct: 70 I-DCNCDD----EKQQCVYERQYAEMSTSSGVLGEDIISFGNLSALAPQRAVFGCENMET 124
Query: 274 G-LFVGSA-GLLGLGGGMLSLTKQI-------KATSLAY---------CLVDRDSPASGV 315
G L+ A G++G+G G LS+ + + SL Y ++ SP S +
Sbjct: 125 GDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAMVLGGISPPSNM 184
Query: 316 LEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
+ S D V +P +Y + L V G+ + + P++F+ G G I+D
Sbjct: 185 VFSQS----DPVRSP---------YYNIDLKEIHVAGKPLPLNPTVFD----GKHGTILD 227
Query: 376 CGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCY-----DFSGLRSVRVPTV 428
GT L A+ S +D+ ++ +LKP G D C+ D S L S P V
Sbjct: 228 SGTTYAYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSS-SFPAV 286
Query: 429 SLHFGAGKALDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANN 486
+ FG G+ L L +NYL G +C F +++G + + T V +D N+
Sbjct: 287 EMVFGNGQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRENS 346
Query: 487 RVGFTPNKC 495
++GF C
Sbjct: 347 KIGFWKTNC 355
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 118/392 (30%), Positives = 171/392 (43%), Gaps = 50/392 (12%)
Query: 131 ELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC 190
+LK + + E + P ++G YF+++ +GTPPR +++ +DTGSD+ W+ C PC
Sbjct: 14 KLKSSAVSLPVEGVADPYIAGL------YFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPC 67
Query: 191 TECYQQSD---PI--FDPKTSSSYSPLPCAAPQC---KSLDVSACR-ANRCLYQVAYGDG 241
C SD PI +D K S+S S +PC+ P C + S C N+C Y YGDG
Sbjct: 68 IGCPAFSDLKIPIVPYDVKASASSSKVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDG 127
Query: 242 SFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIK 297
S T+G LV + + + + + + GCG G S G++G G LS Q+
Sbjct: 128 SGTLGYLVEDVLHYMVNATAT-VIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLA 186
Query: 298 ATS-----LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGG 352
A+CL D G+L + D PL+ + Y V L SV
Sbjct: 187 KQGKTPNVFAHCL-DGGERGGGILVLGNVIEPDIQYTPLVPYM---SHYNVVLQSISVNN 242
Query: 353 QAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD 412
+ I P LF D G I D GT + L +AY + + V L VA F
Sbjct: 243 ANLTIDPKLFSNDVM--QGTIFDSGTTLAYLPDEAYQAFTQA-VSLV--------VAPFL 291
Query: 413 TC-YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT---FCFAFAPTSSALS- 467
C S P V L+F G ++ L YLI SA +C + SA S
Sbjct: 292 LCDTRLSRFIYKLFPNVVLYF-EGASMTLTPAEYLIRQASAANAPIWCMGWQSMGSAESE 350
Query: 468 ----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
I G++ + V +DL R+G+ P C
Sbjct: 351 LQYTIFGDLVLKNKLVVYDLERGRIGWRPFDC 382
>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 252
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 84/231 (36%), Positives = 125/231 (54%), Gaps = 24/231 (10%)
Query: 97 DYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGS 156
D+ + +L D RV ++ +++ H ++ ++ QI P+ SG + +
Sbjct: 13 DWNRRLQKQLILDDLRVRSMQNRIRRV---ASTHNVEASQTQI-------PLSSGINLQT 62
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
Y +G+G+ + ++++DT SD+ W+QC PC CY Q PIF P TSSSY + C +
Sbjct: 63 LNYIVTMGLGS--KNMTVIIDTRSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNS 120
Query: 217 PQCKSL-----DVSACRANR---CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
C+SL + AC ++ C Y V YGDGS+T GDL E +SFG SV GC
Sbjct: 121 STCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGDLGVEALSFGGV-SVSDFVFGC 179
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVL 316
G +N+GLF G +GL+GLG LSL Q AT +YCL ++ +SG L
Sbjct: 180 GRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSSGSL 230
>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
Length = 497
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 123/424 (29%), Positives = 179/424 (42%), Gaps = 74/424 (17%)
Query: 130 HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL---- 185
H+ PA A + P + G Y +GTPP+ ++LDTGS + W+
Sbjct: 86 HKSIPATAALYPHSY------------GGYAFTASLGTPPQPLPVLLDTGSQLTWVPCTS 133
Query: 186 --QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC----KSLDVSACRA---------- 229
CR C+ + + P+F PK SSS + C P C + V+ CRA
Sbjct: 134 NYDCRNCSSPFAAAVPVFHPKNSSSSRLVGCRNPSCLWVHSAEHVAKCRAPCSRGANCTP 193
Query: 230 --NRCL-YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLG 286
N C Y V YG GS T G L+ +T+ +V G LGC + +GL G G
Sbjct: 194 ASNVCPPYAVVYGSGS-TAGLLIADTLR-APGRAVSGFVLGC--SLVSVHQPPSGLAGFG 249
Query: 287 GGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD---AVTAPLIRNKKVD----- 338
G S+ Q+ + +YCL+ R + + + GGD PL+++ D
Sbjct: 250 RGAPSVPAQLGLSKFSYCLLSRRFDDNAAVSGSLVLGGDNDGMQYVPLVKSAAGDKQPYA 309
Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
+YY+ L+G +VGG+AV++P F + AG GG IVD GT T L + + D+ V
Sbjct: 310 VYYYLALSGVTVGGKAVRLPARAFAANAAGSGGAIVDSGTTFTYLDPTVFQPVADAVVAA 369
Query: 399 AGNLKPTS-----GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA- 452
G S G+ L G +S+ +P +SLHF G + LP +NY + A
Sbjct: 370 VGGRYKRSKDVEEGLGLHPCFALPQGAKSMALPELSLHFKGGAVMQLPLENYFVVAGRAP 429
Query: 453 -----------GTFCFAFA----------PTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
C A I+G+ QQQ V +DL R+GF
Sbjct: 430 VPGAGAGAGAAEAICLAVVTDFGGSGAGDEGGGPAIILGSFQQQNYLVEYDLEKERLGFR 489
Query: 492 PNKC 495
C
Sbjct: 490 RQPC 493
>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 101/329 (30%), Positives = 155/329 (47%), Gaps = 19/329 (5%)
Query: 176 LDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQ 235
+DT SD+ W+ C C C S +F+ S++Y L C A QCK + C C +
Sbjct: 1 MDTSSDVAWIPCNGCLGC---SSTLFNSPASTTYKSLGCQAAQCKQVPKPTCGGGVCSFN 57
Query: 236 VAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS---AGLLGLGGGMLSL 292
+ YG GS +L +T++ + +V G + GC G + + GL +LS
Sbjct: 58 LTYG-GSSLAANLSQDTITLA-TDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQ 115
Query: 293 TKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDAVT-APLIRNKKVDTFYYVGLTGFSV 350
T+ + ++ +YCL S SG L + PL++N + + Y+V L V
Sbjct: 116 TQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRV 175
Query: 351 GGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL 410
G + V +PP F + + G I D GT TRL T AY ++RD+F G + +
Sbjct: 176 GRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGG 235
Query: 411 FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSAL 466
FDTCY + PT++ F G + LP N LI + T C A A +S L
Sbjct: 236 FDTCYTV----PIAAPTITFMF-TGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVL 290
Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
++I N+QQQ R+ +D+ N+R+G C
Sbjct: 291 NVIANLQQQNHRLLYDVPNSRLGVARELC 319
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 98/352 (27%), Positives = 155/352 (44%), Gaps = 39/352 (11%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
+GTPP+ S +D ++ W QC C C++Q P+F P SS++ P PC CKS+
Sbjct: 30 IGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPT 89
Query: 225 SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC----GHDNEGLFVGSA 280
C ++ C + G G TVG + T+T + G + + GC D G G +
Sbjct: 90 PKCASDVCAFDGVTGLGGHTVGIVATDTFAIGTAAPAS-LGFGCVVASDIDTMG---GPS 145
Query: 281 GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSA---RGGDAVTAPLIR---N 334
G +GLG SL Q+K T +YCL D+ + L ++ GG A T P ++ N
Sbjct: 146 GFIGLGRTPWSLVAQMKLTRFSYCLAPHDTGKNSRLFLGASAKLAGGGAWT-PFVKTSPN 204
Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR---LQTQAYNSL 391
+ +Y + L G + +P G ++V TA+ R L Y
Sbjct: 205 DGMSQYYPIELEEIKAGDATITMP--------RGRNTVLVQ--TAVVRVSLLVDSVYQEF 254
Query: 392 RDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
+ + + G + V F+ C+ +G+ P + F AG AL +P NYL V
Sbjct: 255 KKAVMASVGAAPTATPVGEPFEVCFPKAGVSG--APDLVFTFQAGAALTVPPANYLFDVG 312
Query: 451 SAGTFCFAFAPTS-------SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ T C + + L+I+G+ QQ+ + FDL + + F P C
Sbjct: 313 N-DTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADC 363
>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 445
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 108/368 (29%), Positives = 168/368 (45%), Gaps = 48/368 (13%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
IG G + +VLDT S + W++C C +Q P+FDP SSSY PL +P C++
Sbjct: 80 IGTGRGKSTYFLVLDTASSLPWMRCAHCLPVQRQRSPVFDPSDSSSYRPLHPTSPLCRAP 139
Query: 223 DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN-SGSVKGIALGC-----GHDNEGLF 276
+ ++C + + G+ VG T+T+ GN + + +A GC G D +G F
Sbjct: 140 NPVLPAGDKCSFHLP-GEAHGYVG---TDTIILGNPTLPIHSVAFGCAQSTEGFDTKGTF 195
Query: 277 VGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDR-DSPA-SGVLEFNS----------A 321
AG LG+G SL QIK + +YCL+ SP +G + F +
Sbjct: 196 ---AGTLGMGKLPTSLIMQIKDRVGSRFSYCLIGLGHSPGRNGFIRFGADIPDPTLLVHH 252
Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAI 380
R T P + + D+ YYV L G S+ G + I ++FE G GG VD GT +
Sbjct: 253 RIKILPTPPHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMFERRSDGSGGCFVDAGTQV 312
Query: 381 TRLQTQAYNSLRDSFVRLAGNL------KPTSGVALFDTCY-DFSGLRSVRVPTVSLHFG 433
T L AY + ++ + P F C+ + G+ S +P ++L F
Sbjct: 313 THLVPAAYAVVEEAVAHMVQQWGYKRVRDPN-----FSLCFREHPGIWS-HIPKLTLDFE 366
Query: 434 AGKA-----LDLPAKNYLIPVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNR 487
+ L++ ++N + VD+ CF TS + +++G +QQ TR FDL N
Sbjct: 367 GPASRTVAHLEIVSRNLFLKVDNQPLVCFGVYRTSRGSPTVVGAMQQVDTRFIFDLHANT 426
Query: 488 VGFTPNKC 495
+ F C
Sbjct: 427 ITFHRESC 434
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 122/410 (29%), Positives = 168/410 (40%), Gaps = 58/410 (14%)
Query: 122 LAIYNVDRHELKPAEAQILPEDFSTPVVSGA--SQGSGEYFSRIGVGTPPRQFSMVLDTG 179
L+ YN +IL + FS +S S + +G PP V+DTG
Sbjct: 54 LSPYNSKDTIWDHYSHKILKQTFSNDYISNLVPSPRYVVFLMNFSIGEPPIPQLAVMDTG 113
Query: 180 SDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAY- 238
S + W+ C PC+ C QQS PIFDP SS+YS L C+ +C DV C Y V Y
Sbjct: 114 SSLTWVMCHPCSSCSQQSVPIFDPSKSSTYSNLSCS--ECNKCDVV---NGECPYSVEYV 168
Query: 239 GDGS----FTVGDLVTETVSFGNSGSVKGIALGCGHD-----NEGLFVGSAGLLGLGGGM 289
G GS + L ET+ + V + GCG N + G G+ GLG G
Sbjct: 169 GSGSSQGIYAREQLTLETID-ESIIKVPSLIFGCGRKFSISSNGYPYQGINGVFGLGSGR 227
Query: 290 LSLTKQIKATSLAYCLVDRDSPASG----VLEFNSARGGDAVTAPLIRNKKVDTFYYVGL 345
SL +YC+ + + VL + GD+ T +I + YYV L
Sbjct: 228 FSLLPSF-GKKFSYCIGNLRNTNYKFNRLVLGDKANMQGDSTTLNVI-----NGLYYVNL 281
Query: 346 TGFSVGGQAVQIPPSLFEMD-EAGDGGIIVDCGTAITRLQTQAY-------NSLRDSFVR 397
S+GG+ + I P+LFE + G+I+D G T L + +L + +
Sbjct: 282 EAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLV 341
Query: 398 LAGNLKPTSGVALFDTCY------DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
LA K + CY D SG P V+ HF G LDL + I +
Sbjct: 342 LAQQDKHNP----YTLCYSGVVSQDLSGF-----PLVTFHFAEGAVLDLDVTSMFIQT-T 391
Query: 452 AGTFCFAFAPTS------SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
FC A P + + S IG + QQ V +DL RV F C
Sbjct: 392 ENEFCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNRMRVYFQRIDC 441
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 111/373 (29%), Positives = 179/373 (47%), Gaps = 44/373 (11%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G Y++++ +GTPPR+F + +DTGSD+ W+ C C C Q S FDP++SS+ S
Sbjct: 75 GLYYTKVKLGTPPREFYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPRSSSTSSL 134
Query: 212 LPCAAPQCKS----LDVS-ACRANRCLYQVAYGDGSFTVGDLVTETVSFG-------NSG 259
+ C+ +C+S D S + + N+C Y YGDGS T G V++ + F +
Sbjct: 135 ISCSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLMHFAGIFEGTLTTN 194
Query: 260 SVKGIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATSLA-----YCLVDRDS 310
S + GC G S G+ G G +S+ Q+ +A +CL D+
Sbjct: 195 SSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCL-KGDN 253
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
GVL + V +PL++++ Y + L SV GQ V I P++F + +
Sbjct: 254 SGGGVLVLGEIVEPNIVYSPLVQSQP---HYNLNLQSISVNGQIVPIAPAVFA--TSNNR 308
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF---DTCYDFSGLRSVRV-P 426
G IVD GT + L +AYN FV L P S ++ + CY + +V + P
Sbjct: 309 GTIVDSGTTLAYLAEEAYN----PFVNAITALVPQSVRSVLSRGNQCYLITTSSNVDIFP 364
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPT-SSALSIIGNVQQQGTRVSFD 482
VSL+F G +L L ++YL+ + G +C F +++I+G++ + +D
Sbjct: 365 QVSLNFAGGASLVLRPQDYLMQQNYIGEGSVWCIGFQRIPGQSITILGDLVLKDKIFVYD 424
Query: 483 LANNRVGFTPNKC 495
LA R+G+ C
Sbjct: 425 LAGQRIGWANYDC 437
>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
Length = 454
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 166/387 (42%), Gaps = 65/387 (16%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQC---RPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
+ VG PP+ +MVLDTGS+++WL+C R + Q+ F+ SS+Y+ C++P+C
Sbjct: 66 VAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSPEC 125
Query: 220 ----KSLDV----SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC--- 268
+ L V + +N C ++Y D S G L +T G + V+ + GC
Sbjct: 126 QWRGRDLPVPPFCAGPPSNSCRVSLSYADASSADGILAADTFLLGGAPPVRAL-FGCVTS 184
Query: 269 --------GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS 320
D+E + GLLG+ G LS Q AYC+ D P VL
Sbjct: 185 YSSATATNSSDSEA----ATGLLGMNRGSLSFVTQTATLRFAYCIAPGDGPGLLVL---- 236
Query: 321 ARGGDAVT-------APLIR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
GGD PLI+ ++ + F Y V L G VG + IP S+ D G
Sbjct: 237 --GGDGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHTG 294
Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA------LFDTCYDFSGLR- 421
G +VD GT T L AY L+ F+ L G + FD C+ S R
Sbjct: 295 AGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGAFDACFRASEARV 354
Query: 422 ---SVRVPTVSLHF-GAGKALDLPAKNYLIPVDSAG------TFCFAFAPTSSA---LSI 468
S +P V L GA A+ Y +P + G +C F + A +
Sbjct: 355 AAASQMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNSDMAGMSAYV 414
Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
IG+ QQ V +DL N RVGF P +C
Sbjct: 415 IGHHHQQNVWVEYDLQNGRVGFAPARC 441
>gi|125575541|gb|EAZ16825.1| hypothetical protein OsJ_32297 [Oryza sativa Japonica Group]
Length = 416
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 152/354 (42%), Gaps = 52/354 (14%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
+GTPP+ S ++D PC+ P SS++ P PC CKS+
Sbjct: 73 IGTPPQPASAIIDVAGP------APCSF----------PNASSTFRPEPCGTDACKSIPT 116
Query: 225 SACRANRCLYQVAYGD--GSFTVGDLVTETVSFGNSGSVKGIALGC----GHDNEGLFVG 278
S C +N C Y+ G T+G + T+T + G + + GC G D G G
Sbjct: 117 SNCSSNMCTYEGTINSKLGGHTLGIVATDTFAIGT--ATASLGFGCVVASGIDTMG---G 171
Query: 279 SAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAVTAPLIRN 334
+GL+GLG SL Q+ T +YCL DS + L S A GG++ T P ++
Sbjct: 172 PSGLIGLGRAPSSLVSQMNITKFSYCLTPHDSGKNSRLLLGSSAKLAGGGNSTTTPFVKT 231
Query: 335 KKVD---TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
D +Y + L G G A+ +PPS ++V ++ L AY +L
Sbjct: 232 SPGDDMSQYYPIQLDGIKAGDAAIALPPS--------GNTVLVQTLAPMSFLVDSAYQAL 283
Query: 392 RDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAG-KALDLPAKNYLIPV- 449
+ + G + + FD C+ +GL + P + F G AL +P YLI V
Sbjct: 284 KKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQQGAAALTVPPPKYLIDVG 343
Query: 450 DSAGTFCFAFAPTS--------SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ GT C A TS L+I+G++QQ+ T DL + F P C
Sbjct: 344 EEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPADC 397
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 102/377 (27%), Positives = 170/377 (45%), Gaps = 41/377 (10%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
+G +G YF++IG+GTP + + + +DTGSDI W+ C C C +SD ++D K
Sbjct: 65 NGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMK 124
Query: 205 TSSSYSPLPCAAPQCKSLD--VSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGN-SGS 260
S++ + C C D + C+ +CLY V YGDGS T G V + V + SG+
Sbjct: 125 ASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGN 184
Query: 261 VK------GIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYCL 305
+ + GCG+ G S+ G+LG G S+ Q+ ++ ++CL
Sbjct: 185 FQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL 244
Query: 306 VDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
+ D G+ PL++N+ Y V + VGG + +P F
Sbjct: 245 DNVD--GGGIFAIGEVVEPKVNITPLVQNQ---AHYNVVMKEIEVGGDPLDVPSDAF--- 296
Query: 366 EAGD-GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
E+GD G I+D GT + + Y L + + +L+ + F TC+D++G
Sbjct: 297 ESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAF-TCFDYTGNVDDG 355
Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQGTR 478
PTV+LHF +L + YL V +C + + + L+++G++
Sbjct: 356 FPTVTLHFDKSISLTVYPHEYLFQVKEF-EWCIGWQNSGAQTKDGKDLTLLGDLVLSNKL 414
Query: 479 VSFDLANNRVGFTPNKC 495
V +DL +G+ C
Sbjct: 415 VVYDLEKQGIGWVEYNC 431
>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
gi|238008190|gb|ACR35130.1| unknown [Zea mays]
gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
Length = 269
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 79/267 (29%), Positives = 127/267 (47%), Gaps = 18/267 (6%)
Query: 244 TVGDLVTETVSFGNSGSVKG-IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLA 302
+ G L TET +FG + + GCG G G++G++G+ G LS+ KQ+ T +
Sbjct: 3 STGVLATETFTFGAHQNFSANLTFGCGKLTNGTIAGASGIMGVSPGPLSVLKQLSITKFS 62
Query: 303 YCLVDRDSPASGVLEFNSARG-------GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV 355
YCL + + F + G T PL++N D +YYV + G S+G + +
Sbjct: 63 YCLTPFTDHKTSPVMFGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGISIGSKRL 122
Query: 356 QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD--T 413
+P ++ + G GG ++D T + L A+ L+ + + G P + ++ D
Sbjct: 123 DVPEAILALRPDGTGGTVLDSATTLAYLVEPAFKELKKAVME--GMKLPAANRSIDDYPV 180
Query: 414 CYDFS---GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF--APTSSALSI 468
C++ + V+VP + LHF + LP +Y S G C A AP A ++
Sbjct: 181 CFELPRGMSMEGVQVPPLVLHFAGDAEMSLPRDSYF-QEPSPGMMCLAVMQAPFEGAPNV 239
Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
IGNVQQQ V +DL N + + P KC
Sbjct: 240 IGNVQQQNMHVLYDLGNRKFSYAPTKC 266
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 118/392 (30%), Positives = 170/392 (43%), Gaps = 50/392 (12%)
Query: 131 ELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC 190
+LK + + E + P ++G YF+++ +GTPPR +++ +DTGSD+ W+ C PC
Sbjct: 14 KLKSSAVSLPVEGVADPYIAGL------YFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPC 67
Query: 191 TECYQQSD---PI--FDPKTSSSYSPLPCAAPQC---KSLDVSACR-ANRCLYQVAYGDG 241
C SD PI +D K S+S S +PC+ P C + S C N+C Y YGDG
Sbjct: 68 IGCPAFSDLKIPIVPYDVKASASSSKVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDG 127
Query: 242 SFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIK 297
S T+G LV + + + + + + GCG G S G++G G LS Q+
Sbjct: 128 SGTLGYLVEDVLHYMVNATAT-VIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLA 186
Query: 298 ATS-----LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGG 352
A+CL D G+L + D PL+ Y V L SV
Sbjct: 187 KQGKTPNVFAHCL-DGGERGGGILVLGNVIEPDIQYTPLVPYMY---HYNVVLQSISVNN 242
Query: 353 QAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD 412
+ I P LF D G I D GT + L +AY + + V L VA F
Sbjct: 243 ANLTIDPKLFSNDVM--QGTIFDSGTTLAYLPDEAYQAFTQA-VSLV--------VAPFL 291
Query: 413 TC-YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT---FCFAFAPTSSALS- 467
C S P V L+F G ++ L YLI SA +C + SA S
Sbjct: 292 LCDTRLSRFIYKLFPNVVLYF-EGASMTLTPAEYLIRQASAANAPIWCMGWQSMGSAESE 350
Query: 468 ----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
I G++ + V +DL R+G+ P C
Sbjct: 351 LQYTIFGDLVLKNKLVVYDLERGRIGWRPFDC 382
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 108/371 (29%), Positives = 167/371 (45%), Gaps = 43/371 (11%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF+++ +G+PP +F++ +DTGSDI W+ C C+ C S FD S +
Sbjct: 98 GLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGS 157
Query: 212 LPCAAPQCKSL---DVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS---GS 260
+ C+ P C S+ + C N+C Y YGDGS T G +T+T F G S S
Sbjct: 158 VTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANS 217
Query: 261 VKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSP 311
I GC G S G+ G G G LS+ Q+ + + ++CL D
Sbjct: 218 SAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL-KGDGS 276
Query: 312 ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
GV V +PL+ ++ Y + L V GQ + + ++FE G
Sbjct: 277 GGGVFVLGEILVPGMVYSPLVPSQP---HYNLNLLSIGVNGQMLPLDAAVFEASNT--RG 331
Query: 372 IIVDCGTAITRLQTQAY----NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
IVD GT +T L +AY N++ +S +L T ++ + CY S S P+
Sbjct: 332 TIVDTGTTLTYLVKEAYDLFLNAISNSVSQLV-----TPIISNGEQCYLVSTSISDMFPS 386
Query: 428 VSLHFGAGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLA 484
VSL+F G ++ L ++YL D A +C F +I+G++ + +DLA
Sbjct: 387 VSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLA 446
Query: 485 NNRVGFTPNKC 495
R+G+ C
Sbjct: 447 RQRIGWASYDC 457
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 108/371 (29%), Positives = 168/371 (45%), Gaps = 43/371 (11%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF+++ +G+PP +F++ +DTGSDI W+ C C+ C S FD S +
Sbjct: 98 GLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGS 157
Query: 212 LPCAAPQCKSL---DVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS---GS 260
+ C+ P C S+ + C N+C Y YGDGS T G +T+T F G S S
Sbjct: 158 VTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANS 217
Query: 261 VKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSP 311
I GC G S G+ G G G LS+ Q+ + + ++CL D
Sbjct: 218 SAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL-KGDGS 276
Query: 312 ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
GV V +PL+ ++ Y + L V GQ + + ++FE + G
Sbjct: 277 GGGVFVLGEILVPGMVYSPLVPSQP---HYNLNLLSIGVNGQMLPLDAAVFE--ASNTRG 331
Query: 372 IIVDCGTAITRLQTQAY----NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
IVD GT +T L +AY N++ +S +L T ++ + CY S S P+
Sbjct: 332 TIVDTGTTLTYLVKEAYDLFLNAISNSVSQLV-----TPIISNGEQCYLVSTSISDMFPS 386
Query: 428 VSLHFGAGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLA 484
VSL+F G ++ L ++YL D A +C F +I+G++ + +DLA
Sbjct: 387 VSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLA 446
Query: 485 NNRVGFTPNKC 495
R+G+ C
Sbjct: 447 RQRIGWASYDC 457
>gi|326529727|dbj|BAK04810.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 488
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 134/460 (29%), Positives = 199/460 (43%), Gaps = 90/460 (19%)
Query: 103 LSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSR 162
LSRL R S L +L ++ + P A + P + G Y
Sbjct: 47 LSRLARAS-----LARASRLRGHHQGQAASSPVRAALYPHSY------------GGYAFS 89
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD--------PIFDPKTSSSYSPLPC 214
+ +GTPP+ ++LDTGS + W+ PCT YQ + P+F PK+SSS + C
Sbjct: 90 LSLGTPPQPLPVLLDTGSHLTWV---PCTSNYQCQNCSAAAGSFPVFHPKSSSSSLLVSC 146
Query: 215 AAPQCKSL-----------DVSACR----------ANRC-LYQVAYGDGSFTVGDLVTET 252
++P C + D + CR N C Y V YG GS T G LV++T
Sbjct: 147 SSPSCLWIHSKSHLSDCARDSAPCRPSTANCSATATNVCPPYLVVYGSGS-TAGLLVSDT 205
Query: 253 VSFGNSGSV-KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR--- 308
+ G+ + A+GC + +GL G G G S+ Q+ +YCL+ R
Sbjct: 206 LRLSPRGAASRNFAVGC--SLASVHQPPSGLAGFGRGAPSVPAQLGVNKFSYCLLSRRFD 263
Query: 309 -DSPASGVLEFNSARGGDAVT----APLIRN----KKVDTFYYVGLTGFSVGGQAVQIPP 359
D+ SG L ++ G A APL++N +YY+ LTG +VGG++V +P
Sbjct: 264 DDAAISGELVLGASSAGKAKAMMQYAPLLKNAGARPPYSVYYYLSLTGIAVGGKSVALPA 323
Query: 360 -SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL----KPTSGVALFDTC 414
+L + G GG I+D GT T L + + + V G K G C
Sbjct: 324 RALAPVSGGGGGGAIIDSGTTFTYLDPTVFKPVAAAMVAAVGGRYNRSKDVEGALGLRPC 383
Query: 415 YDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAG-----TFCFAFAPTSSALS- 467
+ +G R++ +P +SLHF G + LP +NY + A C A S+ S
Sbjct: 384 FALPAGARTMDLPELSLHFSGGAEMRLPIENYFLAAGPASGVAPEAICLAVVSDVSSASG 443
Query: 468 ------------IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
I+G+ QQQ +V +DL NR+GF C
Sbjct: 444 GAGVSGGGGPAIILGSFQQQNYQVEYDLEKNRLGFRQQPC 483
>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
Length = 204
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 72/204 (35%), Positives = 106/204 (51%), Gaps = 7/204 (3%)
Query: 296 IKATSLAYCLVDRDSPASGVLEFNSARGG--DAVTAPLIRNKKVDTFYYVGLTGFSVGGQ 353
+K +YCL D + VL S DA++ PL+ N +FYY+ L G VGG
Sbjct: 1 MKEAKFSYCLTSMDDSKASVLLLGSLAKATKDAISTPLLTNPSQPSFYYLSLEGIPVGGT 60
Query: 354 AVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG-NLKPTSGVALFD 412
+ I S+F++ + G GG+I+D GT IT L+ +++L+ F+ + L +S L D
Sbjct: 61 QLSIEQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEFISQSNLQLDKSSSTGL-D 119
Query: 413 TCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGN 471
C+ S V VP + HF G L+LPA++Y+I G C A S+ +SI GN
Sbjct: 120 VCFSLPSETTQVEVPKLVFHFKGGD-LELPAESYMIADSKLGVACLAMG-ASNGMSIFGN 177
Query: 472 VQQQGTRVSFDLANNRVGFTPNKC 495
VQQQ V+ DL + F P +C
Sbjct: 178 VQQQNILVNHDLEKETISFVPTQC 201
>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 485
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 116/401 (28%), Positives = 164/401 (40%), Gaps = 81/401 (20%)
Query: 168 PPRQFSMVLDTGSDINWLQCRP--CTECYQQSDPI----FDPKTSSSYSPLPCAAPQCKS 221
PP+ S+ +DTGSD+ W C P C C + D P +S + + C +P C +
Sbjct: 83 PPQPISLYMDTGSDLVWFPCAPFECILCEGKYDTAATGGLSPPNITSSASVSCKSPACSA 142
Query: 222 LDVSA-----CRANRCLYQV----------------AYGDGSFTVGDLVTETVSFGNSGS 260
S C RC ++ AYGDGS V L +++S S
Sbjct: 143 AHTSLSSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDGSL-VARLYRDSLSMPASSP 201
Query: 261 V--KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAYCLVD----- 307
+ GC H G VG AG G G+LSL Q+ + S +YCLV
Sbjct: 202 LVLHNFTFGCAHTALGEPVGVAGF---GRGVLSLPAQLASFSPHLGNQFSYCLVSHSFDA 258
Query: 308 ----RDSP-ASGVLEFNSARG-------GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV 355
R SP G + + G+ V ++ N K FY VGL G +VG + +
Sbjct: 259 DRVRRPSPLILGRYSLDDEKKKRVGHDRGEFVYTAMLDNPKHPYFYCVGLEGITVGNRKI 318
Query: 356 QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF-------VRLAGNLKPTSGV 408
+P L +D G+GG++VD GT T L Y SL F + A ++ +G+
Sbjct: 319 PVPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRMGRVYKRATQIEERTGL 378
Query: 409 ALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV--------DSAGTFCFAF- 459
CY +S + +VP V+LHF + LP NY C
Sbjct: 379 G---PCY-YSDDSAAKVPAVALHFVGNSTVILPRNNYYYEFFDGRDGQKKKRKVGCLMLM 434
Query: 460 -----APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A + + +GN QQQG V +DL +RVGF KC
Sbjct: 435 NGGDEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFARRKC 475
>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
Length = 499
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 114/391 (29%), Positives = 155/391 (39%), Gaps = 69/391 (17%)
Query: 170 RQFSMVLDTGSDINWLQCRP--CTECYQQSDP-IFDPKTSSSYSPLPCAAPQCKS----- 221
+ S+ +DTGSDI W C P C C + +P P S S + C + C +
Sbjct: 103 QTLSVYMDTGSDIVWFPCSPFECILCEGKFEPGTLTPLNVSKSSLISCKSRACSTAHNSP 162
Query: 222 ---------------LDVSACRANRC-LYQVAYGDGSFTVG----DLVTETVSFGNSGSV 261
++ S C C + AYGDGS +L+ + S S+
Sbjct: 163 STSDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDGSLIAKLHKHNLIMPSTS-NKPFSL 221
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAYCLVDR------- 308
K GC H G +G AG G G LSL Q+ S +YCLV
Sbjct: 222 KDFTFGCAHSALGEPIGVAGF---GFGSLSLPAQLANLSPDLGNQFSYCLVSHSFDSTKL 278
Query: 309 DSPASGVLEFNSARGGDAVT----APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
P+ +L R D +T P++ N K FY V + SVG V+ P +L +
Sbjct: 279 HHPSPLILGKVKERDFDEITQFVYTPMLDNPKHPYFYSVSMEAISVGSSRVRAPNALIRI 338
Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL----KPTSGVALFDTCYDFSGL 420
D G+GG++VD GT T L T YNS+ R G + T CY G
Sbjct: 339 DRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRVFKRASETESKTGLSPCYYLEGN 398
Query: 421 RSVR----VPTVSLHFGAGKALDLPAKNYLIPV---------DSAGTFCFAFAPTSSA-- 465
R VP ++ HFG ++ LP +NY G S
Sbjct: 399 GVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGEDEKKGRKVGCLMLMDGGDESEGG 458
Query: 466 -LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ +GN QQQG +V +DL RVGF P KC
Sbjct: 459 PGATLGNYQQQGFQVVYDLEERRVGFAPRKC 489
>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
Length = 469
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 114/397 (28%), Positives = 168/397 (42%), Gaps = 65/397 (16%)
Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----------PI 200
+++ G Y + GTP + V DTGS L C PCT Y S P
Sbjct: 83 SAKSYGGYSVSLSFGTPSQTIPFVFDTGSS---LVCLPCTSRYLCSGCDFSGLDPTLIPR 139
Query: 201 FDPKTSSSYSPLPCAAPQCKSL------------DVSACRANRCLYQVAYGDGSFTVGDL 248
F PK SSS + C +P+C+ L + C Y + YG GS T G L
Sbjct: 140 FIPKNSSSSKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGS-TAGVL 198
Query: 249 VTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR 308
+TE + F + +V +GC + AG+ G G G +SL Q+ ++CLV R
Sbjct: 199 ITEKLDFPDL-TVPDFVVGCSIIST---RQPAGIAGFGRGPVSLPSQMNLKRFSHCLVSR 254
Query: 309 ---DSPASGVLEFNSARGGDAVTA------------PLIRNKKVDTFYYVGLTGFSVGGQ 353
D+ + L+ ++ G ++ + P + NK +YY+ L VG +
Sbjct: 255 RFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRK 314
Query: 354 AVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN------LKPTSG 407
V+IP GDGG IVD G+ T ++ + + + F N L+ +G
Sbjct: 315 HVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETG 374
Query: 408 VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP------ 461
+ C++ SG V VP + F G L+LP NY V + T C
Sbjct: 375 LG---PCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNP 431
Query: 462 ---TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
T A+ I+G+ QQQ V +DL N+R GF KC
Sbjct: 432 SGGTGPAI-ILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>gi|242095592|ref|XP_002438286.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
gi|241916509|gb|EER89653.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
Length = 495
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 165/372 (44%), Gaps = 44/372 (11%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC------TECYQQSDPIFDPKTSSS 208
G EY G GTP +Q + D S ++ ++C+PC E D FDP SSS
Sbjct: 134 GVFEYTVLAGYGTPAQQLPLFFDV-SGMSNMRCKPCFSGSSGGETTTTCDVAFDPSMSSS 192
Query: 209 YSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
+ + C +P C SA C + + F G +V +T++ S + + A+GC
Sbjct: 193 FRSVLCGSPDCGGHSCSA--GGSCTFTLQNSTFVFGNGTIVMDTLTLSPSATFENFAVGC 250
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT-----------SLAYCL-VDRDSP----- 311
+ LF + +G LSL++ AT + +YCL D D+
Sbjct: 251 MQLDNDLFTDG---VAVGNIDLSLSRHSLATRVLNSSPPGMAAFSYCLPADTDTHGFLTI 307
Query: 312 ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
A + +++ G V PL+ N FYYV L ++ G+ + IPP+LF + G
Sbjct: 308 APALSDYSDHAGVKYV--PLVTNPTGPNFYYVDLVAIAINGEDLPIPPALFTGN-----G 360
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLH 431
++D +A T L Y +LRD F + +P DTCY+F+ ++ +P ++L
Sbjct: 361 TMIDSQSAFTYLNPPIYAALRDEFRKAMLQYQPVPAFGGLDTCYNFTLAENIYLPDITLR 420
Query: 432 FGAGKALDLPAKNYLIPVDSA-------GTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDL 483
F G+ +DL + ++ G FA AP + + +G+ Q+ + +D+
Sbjct: 421 FSNGETMDLDDRQFMYFFREHLTDGFPFGCLAFAAAPDQNFPWNYLGSQVQRTKEIVYDV 480
Query: 484 ANNRVGFTPNKC 495
V F P++C
Sbjct: 481 RGGMVAFVPSRC 492
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 98/362 (27%), Positives = 169/362 (46%), Gaps = 38/362 (10%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP++F++++D+GS + ++ C C +C DP F P SSSYSP+ C
Sbjct: 85 NGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQCGNHQDPRFQPDLSSSYSPVKC- 143
Query: 216 APQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHD 271
++D + C +++ C Y+ Y + S + G L + VSFG +K GC +
Sbjct: 144 -----NVDCT-CDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKPQHAIFGCENS 197
Query: 272 NEG-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASGVLEFNSARGG 324
G LF A G++GLG G LS+ Q + + S + C D ++ GG
Sbjct: 198 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMV-----LGG 252
Query: 325 DAVTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
+I + +Y + L V G+A+++ +F G ++D GT
Sbjct: 253 MLAPPDMIFSNSDPLRSPYYNIELKEIHVAGKALRVESRIFNSKH----GTVLDSGTTYA 308
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTVSLHFGAG 435
L QA+ + +++ +LK G + D C+ +G ++ P V + FG G
Sbjct: 309 YLPEQAFVAFKEAVTSKVHSLKKIRGPDPSYKDICFAGAGRNVSKLHEVFPDVDMVFGNG 368
Query: 436 KALDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
+ L L +NYL G +C F +++G + + T V++D N ++GF
Sbjct: 369 QKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKT 428
Query: 494 KC 495
C
Sbjct: 429 NC 430
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 170/360 (47%), Gaps = 34/360 (9%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP+ F++++DTGS + ++ C C +C + DP F P SS+Y P+ C
Sbjct: 78 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTYQPVKC- 136
Query: 216 APQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHD 271
+LD + C +R C+Y+ Y + S + G L + VSFGN + + GC +
Sbjct: 137 -----TLDCN-CDNDRMQCVYERQYAEMSTSSGVLGEDVVSFGNQSELAPQRAVFGCENV 190
Query: 272 NEG-LFVGSA-GLLGLGGGMLSLT-----KQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
G L+ A G++GLG G LS+ K + + S + C D ++ +
Sbjct: 191 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGGISPPS 250
Query: 325 DAVTAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
D V A ++ V + YY + L V G+ + + PS+F+ G G ++D GT L
Sbjct: 251 DMVFA---QSDPVRSPYYNIDLKEIHVAGKRLPLNPSVFD----GKHGSVLDSGTTYAYL 303
Query: 384 QTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLR----SVRVPTVSLHFGAGKA 437
+A+ + +++ V+ + SG D C+ +G+ S P V + FG G
Sbjct: 304 PEEAFLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFPVVDMIFGNGHK 363
Query: 438 LDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L +NY+ G +C F +++G + + T V +D ++GF C
Sbjct: 364 YSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDREQTKIGFWKTNC 423
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 115/418 (27%), Positives = 191/418 (45%), Gaps = 52/418 (12%)
Query: 123 AIYNVDRHELKPAEA----QILPEDFSTPVVSGASQGS------GEYFSRIGVGTPPRQF 172
A + ++ +LK ++ +IL S VV QG+ G YF+R+ +G+PP+ F
Sbjct: 38 ASHKLELSQLKERDSFRHRRILQSTTSGGVVDFPVQGTFNPFLVGLYFTRVQLGSPPKDF 97
Query: 173 SMVLDTGSDINWLQCRPCTEC-----YQQSDPIFDPKTSSSYSPLPCAAPQC-----KSL 222
+ +DTGSD+ W+ C C C Q FDP +S++ + + C+ +C S
Sbjct: 98 YVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAALVSCSDQRCTAGIQSSD 157
Query: 223 DVSACRANRCLYQVAYGDGS----FTVGDLVTETVSFGNSGSVKGI--------ALGCGH 270
+ + R N+C Y YGDGS + V DL+ +SG + I + C
Sbjct: 158 SLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGELSQICQTYDSSVSFMCST 217
Query: 271 DNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSPASGVLEFNSA 321
G S G+ G G +S+ Q+ + + ++CL DS GVL
Sbjct: 218 LQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHCLKGDDS-GGGVLVLGEI 276
Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
+ V PL+ ++ Y + L SV GQ + I PS+F + + G IVD GT +
Sbjct: 277 VEPNIVYTPLVPSQP---HYNLYLQSISVAGQTLAIDPSVF--GASSNQGTIVDSGTTLA 331
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
L AY+ + + +L + ++ + CY + + P VSL+F G +L L
Sbjct: 332 YLAEGAYDPFVSAITSVV-SLNARTYLSKGNQCYLVTSSVNDVFPQVSLNFAGGASLILN 390
Query: 442 AKNYLIPVDSAG---TFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
++YL+ +S G +C F T ++I+G++ + +D+AN RVG+T C
Sbjct: 391 PQDYLLQQNSVGGAAVWCVGFQKTPGQQITILGDLVLKDKIFVYDIANQRVGWTNYDC 448
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 101/377 (26%), Positives = 170/377 (45%), Gaps = 42/377 (11%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
+G +G YF++IG+GTP + + + +DTGSDI W+ C C C +SD ++D K
Sbjct: 146 NGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMK 205
Query: 205 TSSSYSPLPCAAPQCKSLD--VSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGN-SGS 260
S++ + C C D + C+ +CLY V YGDGS T G V + V + SG+
Sbjct: 206 ASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGN 265
Query: 261 VK------GIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYCL 305
+ + GCG+ G S+ G+LG G S+ Q+ ++ ++CL
Sbjct: 266 FQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL 325
Query: 306 VDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
+ D G+ PL++N+ Y V + VGG + +P F
Sbjct: 326 DNVD--GGGIFAIGEVVEPKVNITPLVQNQ---AHYNVVMKEIEVGGDPLDVPSDAF--- 377
Query: 366 EAGD-GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
E+GD G I+D GT + + Y L + + +L+ + F TC+D++G
Sbjct: 378 ESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAF-TCFDYTGNVDDG 436
Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQGTR 478
PTV+LHF +L + YL + +C + + + L+++G++
Sbjct: 437 FPTVTLHFDKSISLTVYPHEYLFQHEFE--WCIGWQNSGAQTKDGKDLTLLGDLVLSNKL 494
Query: 479 VSFDLANNRVGFTPNKC 495
V +DL +G+ C
Sbjct: 495 VVYDLEKQGIGWVEYNC 511
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 166/362 (45%), Gaps = 38/362 (10%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTP ++F++++D+GS + ++ C C +C DP F P SS+YSP+ C
Sbjct: 88 NGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTYSPVKC- 146
Query: 216 APQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHD 271
++D + C R C Y+ Y + S + G L + +SFG +K GC +
Sbjct: 147 -----NVDCT-CDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENT 200
Query: 272 NEG-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASGVLEFNSARGG 324
G LF A G++GLG G LS+ Q + + S + C D G + GG
Sbjct: 201 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDV-GGGTMVL----GG 255
Query: 325 DAVTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
++ N +Y + L V G+A+++ P +F G ++D GT
Sbjct: 256 MPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKH----GTVLDSGTTYA 311
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTVSLHFGAG 435
L QA+ + +D+ +LK G D C+ +G ++ P V + FG G
Sbjct: 312 YLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNG 371
Query: 436 KALDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
+ L L +NYL G +C F +++G + + T V++D N ++GF
Sbjct: 372 QKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKT 431
Query: 494 KC 495
C
Sbjct: 432 NC 433
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 120/373 (32%), Positives = 168/373 (45%), Gaps = 54/373 (14%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS- 221
+ VGTPP+ SMV+DTGS+++WL C T Y + FDP S+SY +PC++P C +
Sbjct: 35 LTVGTPPQNVSMVIDTGSELSWLHCNK-TLSYPTT---FDPTRSTSYQTIPCSSPTCTNR 90
Query: 222 -----LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----N 272
+ S N C ++Y D S + G+L ++ G+S + G+ GC N
Sbjct: 91 TQDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHIGSS-DISGLVFGCMDSVFSSN 149
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVL---EFNSARGGDAVTA 329
S GL+G+ G LS Q+ +YC+ D SG+L E N
Sbjct: 150 SDEDSKSTGLMGMNRGSLSFVSQLGFPKFSYCISGTD--FSGLLLLGESNLTWSVPLNYT 207
Query: 330 PLIR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
PLI+ + + F Y V L G V + + IP S FE D G G +VD GT T L
Sbjct: 208 PLIQISTPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEPDHTGAGQTMVDSGTQFTFLL 267
Query: 385 TQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFSGLR--------SVRV----PTVSL 430
YN+LR +F L TS V L D + F G S RV PTV+L
Sbjct: 268 GPVYNALRSAF------LNQTSSVLRVLEDPDFVFQGAMDLCYLVPLSQRVLPLLPTVTL 321
Query: 431 HF-GAGKALDLPAKNYLIPVDSAG---TFCFAFAPTSSALS----IIGNVQQQGTRVSFD 482
F GA + Y +P + G C +F S L +IG+ QQ + FD
Sbjct: 322 VFRGAEMTVSGDRVLYRVPGELRGNDSVHCLSFG-NSDLLGVEAYVIGHHHQQNVWMEFD 380
Query: 483 LANNRVGFTPNKC 495
L +R+G +C
Sbjct: 381 LEKSRIGLAQVRC 393
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 170/360 (47%), Gaps = 34/360 (9%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP+ F++++DTGS + ++ C C +C + DP F P++SS+Y P+ C
Sbjct: 109 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC- 167
Query: 216 APQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHD 271
++D + C +R C+Y+ Y + S + G L + +SFGN + + GC +
Sbjct: 168 -----TIDCN-CDGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENV 221
Query: 272 NEG-LFVGSA-GLLGLGGGMLSLT-----KQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
G L+ A G++GLG G LS+ K++ + S + C D ++ +
Sbjct: 222 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMVLGGISPPS 281
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
D A + +Y + L V G+ + + ++F+ G G ++D GT L
Sbjct: 282 DMTFA--YSDPDRSPYYNIDLKEMHVAGKRLPLNANVFD----GKHGTVLDSGTTYAYLP 335
Query: 385 TQAYNSLRDSFVRLAGNLKPTSG--VALFDTCY-----DFSGLRSVRVPTVSLHFGAGKA 437
A+ + +D+ V+ +LK SG D C+ D S L S P V + FG G
Sbjct: 336 EAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQL-SKSFPVVDMVFGNGHK 394
Query: 438 LDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L +NY+ G +C F + +++G + + T V +D ++GF C
Sbjct: 395 YSLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTLVMYDREQTKIGFWKTNC 454
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 111/369 (30%), Positives = 166/369 (44%), Gaps = 46/369 (12%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK-- 220
+ VG+PP+ +MVLDTGS+++WL C+ Q + +F+P +S +YS +PC +P CK
Sbjct: 73 LTVGSPPQNVTMVLDTGSELSWLHCKKT----QFLNSVFNPLSSKTYSKVPCLSPTCKTR 128
Query: 221 ----SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH----DN 272
++ VS C V+Y D + G+L ET G S + GC N
Sbjct: 129 TRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLG-SLTKPATIFGCMDSGFSSN 187
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD------- 325
+ GL+G+ G LS Q+ +YC+ DS +GVL +A
Sbjct: 188 SEEDSKTTGLIGMNRGSLSFVNQMGYPKFSYCISGFDS--AGVLLLGNASFPWLKPLSYT 245
Query: 326 ---AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
++ PL +V Y V L G V + + +P S+F D G G +VD GT T
Sbjct: 246 PLVQISTPLPYFDRVA--YTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMVDSGTQFTF 303
Query: 383 LQTQAYNSLRDSFV-RLAGNLKPTSGVAL-----FDTCY--DFSGLRSVRVPTVSLHF-G 433
L Y +L++ F+ + G LK + D CY D S +P VSL F G
Sbjct: 304 LLGPVYTALKNEFLSQTRGILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNLPVVSLMFQG 363
Query: 434 AGKALDLPAKNYLIPVDSAG---TFCFAFAPTSSALS----IIGNVQQQGTRVSFDLANN 486
A ++ Y +P + G +CF F S L +IG+ QQ + FDL +
Sbjct: 364 AEMSVSGERLLYRVPGEVRGRDSVWCFTFG-NSDLLGVEAFVIGHHHQQNVWMEFDLEKS 422
Query: 487 RVGFTPNKC 495
R+G +C
Sbjct: 423 RIGLADVRC 431
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 172/375 (45%), Gaps = 46/375 (12%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSPLP 213
YF+++G+G P + + + +DTGSD+ W+ CRPC+ C ++S ++DP+ SS+ S +
Sbjct: 2 YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 61
Query: 214 CAAPQC---KSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSF------GNSGSVK 262
C+ P C + + C N C Y +YGDGS + G V + + + G + +
Sbjct: 62 CSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTS 121
Query: 263 GIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPAS 313
+ GC G S G++G G LS+ Q+ A ++CL
Sbjct: 122 QVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGGG 181
Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
++ A G T PL+ + Y V L G SV ++P + D G+I
Sbjct: 182 ILVIGGIAEPGMTYT-PLVPDS---VHYNVVLRGISVNSN--RLPIDAEDFSSTNDTGVI 235
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
+D GT + + AYN + +R A + P + C+ SG S P V+L+F
Sbjct: 236 MDSGTTLAYFPSGAYNVFVQA-IREATSATPVRVQGMDTQCFLVSGRLSDLFPNVTLNF- 293
Query: 434 AGKALDLPAKNYLI-----PVDSAGTFCFAFAPTSSA--------LSIIGNVQQQGTRVS 480
G A++L NYL+ P + +C + +SS+ L+I+G++ + V
Sbjct: 294 EGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLVV 353
Query: 481 FDLANNRVGFTPNKC 495
+DL N+R+G+ C
Sbjct: 354 YDLDNSRIGWMSYNC 368
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 110/375 (29%), Positives = 174/375 (46%), Gaps = 46/375 (12%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF+R+ +G P ++F + +DTGSDI W+ C PCT C S F+P +SS+ S
Sbjct: 89 GLYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASR 148
Query: 212 LPCAAPQCKS--------LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GN-- 257
+ C+ +C + S +++ C Y YGDGS T G V++T+ F GN
Sbjct: 149 ITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ 208
Query: 258 -SGSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVD 307
+ S I GC + G + G+ G G LS+ Q+ + + ++CL
Sbjct: 209 TANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKG 268
Query: 308 RDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
D+ G+L V PL+ ++ Y + L +V GQ + I SLF
Sbjct: 269 SDN-GGGILVLGEIVEPGLVYTPLVPSQP---HYNLNLESIAVNGQKLPIDSSLFTTSNT 324
Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPT--SGVALFDTCYDFSGLRSVR 424
G IVD GT + L AY D FV +A + P+ S V+ C+ S
Sbjct: 325 --QGTIVDSGTTLAYLADGAY----DPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSS 378
Query: 425 VPTVSLHFGAGKALDLPAKNYLI---PVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVS 480
PTV+L+F G A+ + +NYL+ VD++ +C + ++I+G++ +
Sbjct: 379 FPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFV 438
Query: 481 FDLANNRVGFTPNKC 495
+DLAN R+G+ C
Sbjct: 439 YDLANMRMGWADYDC 453
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 116/357 (32%), Positives = 174/357 (48%), Gaps = 36/357 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR-PCT-ECYQQSDPIFDPKTSSSYSPLPC 214
G Y +GTPP++ + + DTGSD+ W +C CT C Q P + P SS+++ LPC
Sbjct: 89 GAYDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPC 148
Query: 215 AAPQC---KSLDVSACRAN--RCLYQVAYG----DGSFTVGDLVTETVSFGNSGSVKGIA 265
+ C +S V+ C A C Y+ +YG D +T G L ET + G + +V +
Sbjct: 149 SDRLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLG-ADAVPSVR 207
Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVL--EFNSARG 323
GC +EG + +GL+GLG G LSL Q+ A++ YCL S AS +L S G
Sbjct: 208 FGCTTASEGGYGSGSGLVGLGRGPLSLVSQLNASTFMYCLTSDASKASPLLFGSLASLTG 267
Query: 324 GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
+ L+ + TFY V L S+G P + E + G++ D GT +T L
Sbjct: 268 AQVQSTGLLAST---TFYAVNLRSISIGSATT---PGVGEPE-----GVVFDSGTTLTYL 316
Query: 384 QTQAYNSLRDSFVRLAG--NLKPTSGVALFDTCYDFSG---LRSVRVPTVSLHFGAGKAL 438
AY+ + +F+ ++ T G F+ C+ L + VPT+ LHF G +
Sbjct: 317 AEPAYSEAKAAFLSQTSLDQVEDTDG---FEACFQKPANGRLSNAAVPTMVLHFD-GADM 372
Query: 439 DLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LP NY++ V+ G C+ S +LSIIGN+ Q V D+ + + F P C
Sbjct: 373 ALPVANYVVEVED-GVVCW-IVQRSPSLSIIGNIMQVNYLVLHDVHRSVLSFQPANC 427
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 161/361 (44%), Gaps = 38/361 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
G Y +RI +GTPP+ F++++DTGS + ++ C C +C + DP F P SS+Y PL C+
Sbjct: 90 GYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSM 149
Query: 217 P-QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHDNE 273
C S C+Y Y + S + G L + VSFG +K GC +
Sbjct: 150 ECTCDS------EMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFGCENVET 203
Query: 274 GLFVG--SAGLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASGVLEFNSARGGDA 326
G + G++GLG G LS+ Q + S + C D ++ GG +
Sbjct: 204 GDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMV-----LGGIS 258
Query: 327 VTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
A ++ + +Y + L + G+ + I P +F+ G G I+D GT L
Sbjct: 259 PPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVFD----GKYGTILDSGTTYAYL 314
Query: 384 QTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCY-----DFSGLRSVRVPTVSLHFGAGK 436
A+ + +D+ ++ +LK G D C+ D S L S P V L F G
Sbjct: 315 PEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQL-SKTFPAVDLVFSNGN 373
Query: 437 ALDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
L L +NYL A G +C F + +++G + + T V +D + ++GF
Sbjct: 374 RLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTN 433
Query: 495 C 495
C
Sbjct: 434 C 434
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 110/375 (29%), Positives = 174/375 (46%), Gaps = 46/375 (12%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF+R+ +G P ++F + +DTGSDI W+ C PCT C S F+P +SS+ S
Sbjct: 87 GLYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASR 146
Query: 212 LPCAAPQCKS--------LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GN-- 257
+ C+ +C + S +++ C Y YGDGS T G V++T+ F GN
Sbjct: 147 ITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ 206
Query: 258 -SGSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVD 307
+ S I GC + G + G+ G G LS+ Q+ + + ++CL
Sbjct: 207 TANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKG 266
Query: 308 RDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
D+ G+L V PL+ ++ Y + L +V GQ + I SLF
Sbjct: 267 SDN-GGGILVLGEIVEPGLVYTPLVPSQP---HYNLNLESIAVNGQKLPIDSSLFTTSNT 322
Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPT--SGVALFDTCYDFSGLRSVR 424
G IVD GT + L AY D FV +A + P+ S V+ C+ S
Sbjct: 323 --QGTIVDSGTTLAYLADGAY----DPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSS 376
Query: 425 VPTVSLHFGAGKALDLPAKNYLI---PVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVS 480
PTV+L+F G A+ + +NYL+ VD++ +C + ++I+G++ +
Sbjct: 377 FPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFV 436
Query: 481 FDLANNRVGFTPNKC 495
+DLAN R+G+ C
Sbjct: 437 YDLANMRMGWADYDC 451
>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
distachyon]
Length = 473
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 111/385 (28%), Positives = 165/385 (42%), Gaps = 67/385 (17%)
Query: 159 YFSRIGVGTPP--RQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP--- 213
Y +GVGT + + +D + +W+QC PC C Q +P+FDP S ++ P+
Sbjct: 101 YAVAVGVGTEHGYENYELEMDMAAGFSWMQCAPCHPCLPQLNPVFDPAKSPTFRPVSGHN 160
Query: 214 ---CAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIAL 266
C P D RC + +AY +G+ G L +T SF N + GI
Sbjct: 161 AVLCRPPYHPLQD------GRCGFGIAYRNGASAAGYLARDTFSFPTGDNNFQHLPGIVF 214
Query: 267 GCGH-----DNEGLFVGSAGLLGLGGG-----MLSLTKQIKATS---LAYCLVDRDSPAS 313
GC + D G AG+LG+G G + +Q+ +YC + + A
Sbjct: 215 GCANRIARFDTHGAL---AGVLGMGMGAEGKPLTGFMRQLYHNGGGRFSYCPIVPGTTAY 271
Query: 314 GVLEFNS----------ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIP---PS 360
L F + R AV AP ++ YYV L G SVG A+++P P
Sbjct: 272 SFLRFGNDIPSQPPAGVHRQSMAVLAPTTTSEA----YYVKLAGISVG--ALRVPGVTPE 325
Query: 361 LFEMDEAGDGGIIVDCGTAITRLQTQAY----NSLRDSFVRLAGNLKPTSGVALFDTCYD 416
+FE D+ G GG +D GT +T + AY ++R R + G L C
Sbjct: 326 MFERDQHGRGGCAIDIGTKMTAIVQTAYAHVEAAVRGHLQRNRARFVQSPGHHL---CVH 382
Query: 417 FSGLRSVRVPTVSLHFGAGKALDL-PAKNYLI---PVDSAGTFCFAFAPTSSALSIIGNV 472
+ R+P+++LHF G L + P +L+ P C P + +++IG +
Sbjct: 383 RTPAIEERLPSMTLHFVGGPWLRVKPQHLFLVVGSPTGGGEYLCLGLVPDAE-MTVIGAM 441
Query: 473 QQQGTRVSFDLANN--RVGFTPNKC 495
QQ TR FDL NN V F P C
Sbjct: 442 QQIDTRFIFDLHNNIPIVSFNPEDC 466
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 170/370 (45%), Gaps = 39/370 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD---PI--FDPKTSSSYSP 211
G Y++R+ +GTPPR F + +DTGSD+ W+ C C C S P+ FDP +S + S
Sbjct: 50 GLYYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASL 109
Query: 212 LPCAAPQC-----KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN--SGSVKG- 263
+ C+ +C S V + + N C Y YGDGS T G V++ + F GSV
Sbjct: 110 ISCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMNN 169
Query: 264 ----IALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDS 310
I GC G S G+ G G +S+ Q I + ++CL DS
Sbjct: 170 SSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDDS 229
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
G+L + V PL+ ++ Y + + SV GQ + I PS+F +
Sbjct: 230 -GGGILVLGEIVEPNIVYTPLVPSQP---HYNLNMQSISVNGQTLAIDPSVFGTSSS--Q 283
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAG-NLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
G I+D GT + L AY+ + + +++P ++ + CY S + P VS
Sbjct: 284 GTIIDSGTTLAYLAEAAYDPFISAITSIVSPSVRPY--LSKGNHCYLISSSINDIFPQVS 341
Query: 430 LHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPTS-SALSIIGNVQQQGTRVSFDLAN 485
L+F G ++ L ++YLI S G +C F ++I+G++ + +D+AN
Sbjct: 342 LNFAGGASMILIPQDYLIQQSSIGGAALWCIGFQKIQGQGITILGDLVLKDKIFVYDIAN 401
Query: 486 NRVGFTPNKC 495
R+G+ C
Sbjct: 402 QRIGWANYDC 411
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 110/375 (29%), Positives = 174/375 (46%), Gaps = 46/375 (12%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF+R+ +G P ++F + +DTGSDI W+ C PCT C S F+P +SS+ S
Sbjct: 3 GLYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASR 62
Query: 212 LPCAAPQCKS--------LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GN-- 257
+ C+ +C + S +++ C Y YGDGS T G V++T+ F GN
Sbjct: 63 ITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ 122
Query: 258 -SGSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVD 307
+ S I GC + G + G+ G G LS+ Q+ + + ++CL
Sbjct: 123 TANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKG 182
Query: 308 RDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
D+ G+L V PL+ ++ Y + L +V GQ + I SLF
Sbjct: 183 SDN-GGGILVLGEIVEPGLVYTPLVPSQP---HYNLNLESIAVNGQKLPIDSSLFTTSNT 238
Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPT--SGVALFDTCYDFSGLRSVR 424
G IVD GT + L AY D FV +A + P+ S V+ C+ S
Sbjct: 239 --QGTIVDSGTTLAYLADGAY----DPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSS 292
Query: 425 VPTVSLHFGAGKALDLPAKNYLI---PVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVS 480
PTV+L+F G A+ + +NYL+ VD++ +C + ++I+G++ +
Sbjct: 293 FPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFV 352
Query: 481 FDLANNRVGFTPNKC 495
+DLAN R+G+ C
Sbjct: 353 YDLANMRMGWADYDC 367
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 161/361 (44%), Gaps = 38/361 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
G Y +RI +GTPP+ F++++DTGS + ++ C C +C + DP F P SS+Y PL C+
Sbjct: 90 GYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSM 149
Query: 217 P-QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHDNE 273
C S C+Y Y + S + G L + VSFG +K GC +
Sbjct: 150 ECTCDS------EMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFGCENVET 203
Query: 274 GLFVG--SAGLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASGVLEFNSARGGDA 326
G + G++GLG G LS+ Q + S + C D ++ GG +
Sbjct: 204 GDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMV-----LGGIS 258
Query: 327 VTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
A ++ + +Y + L + G+ + I P +F+ G G I+D GT L
Sbjct: 259 PPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVFD----GKYGTILDSGTTYAYL 314
Query: 384 QTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCY-----DFSGLRSVRVPTVSLHFGAGK 436
A+ + +D+ ++ +LK G D C+ D S L S P V L F G
Sbjct: 315 PEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQL-SKTFPAVDLVFSNGN 373
Query: 437 ALDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
L L +NYL A G +C F + +++G + + T V +D + ++GF
Sbjct: 374 RLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTN 433
Query: 495 C 495
C
Sbjct: 434 C 434
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 115/368 (31%), Positives = 158/368 (42%), Gaps = 56/368 (15%)
Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC---TECYQQSDPIFDPK 204
VVS S EY + +G+PPR + DTGSD+ W++C+ T FDP
Sbjct: 90 VVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPS 149
Query: 205 TSSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS--- 260
SS+Y + C C++L + C + C Y AYGDGS T G L TET +F + G+
Sbjct: 150 RSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGAGRS 209
Query: 261 -----VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSL----AYCLVDRDSP 311
+ G+ GC G F + GG + +T+ ATSL +YCLV
Sbjct: 210 PRQVRIGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFSYCLVPHSVN 269
Query: 312 ASGVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
AS L F + A + PL+ NK V + A
Sbjct: 270 ASSALNFGALADVTEPGAASTPLVGNKTVAS---------------------------AA 302
Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVR--- 424
IIVD GT +T L + D R L P S L CY+ +G R V
Sbjct: 303 SSRIIVDSGTTLTFLDPSLLGPIVDELSRRI-TLPPVQSPDGLLQLCYNVAG-REVEAGE 360
Query: 425 -VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSF 481
+P ++L FG G A+ L +N + V GT C A T+ +SI+GN+ QQ V +
Sbjct: 361 SIPDLTLEFGGGAAVALKPENAFVAVQE-GTLCLAIVATTEQQPVSILGNLAQQNIHVGY 419
Query: 482 DLANNRVG 489
DL VG
Sbjct: 420 DLDAGTVG 427
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 60/131 (45%), Gaps = 10/131 (7%)
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVR----VP 426
IIVD GT +T L + D R L P S L CY+ +G R V +P
Sbjct: 439 IIVDSGTTLTFLDPSLLGPIVDELSRRI-TLPPVQSPDGLLQLCYNVAG-REVEAGESIP 496
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLA 484
++L FG G A+ L +N + V GT C A T+ +SI+GN+ QQ V +DL
Sbjct: 497 DLTLEFGGGAAVALKPENAFVAVQE-GTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLD 555
Query: 485 NNRVGFTPNKC 495
V F C
Sbjct: 556 AGTVTFAVADC 566
>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 342
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 82/252 (32%), Positives = 132/252 (52%), Gaps = 22/252 (8%)
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV----DRDSPA----- 312
+ + GCG + G VG++GL+GL G +SL Q+ +YCL + SP
Sbjct: 92 RALGFGCGALSAGSLVGASGLMGLSPGTMSLISQLSVPRFSYCLTPFAERKTSPMLFGAM 151
Query: 313 SGVLEFNSARGGDAVTAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
+ + ++N+ G T ++RN +DTFYY V L G S+G + +++P + ++ G GG
Sbjct: 152 ADLRKYNTT--GPIQTTAILRNPAMDTFYYYVPLVGLSLGTKRLRVPAASLAINPDGTGG 209
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG-VALFDTCYDFS---GLRSVRVPT 427
IVD G+ + L +A+++++ + + A L +G V ++ C+ + +V+ P
Sbjct: 210 TIVDSGSTMAHLAGKAFDAVKKAVLE-AVKLPVFNGTVEDYELCFAVPSGVAMAAVKTPP 268
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT----SSALSIIGNVQQQGTRVSFDL 483
+ LHF G A+ LP NY AG C A A + + +SIIGNVQQQ V FD+
Sbjct: 269 LVLHFDGGAAMALPRDNYFQ-EPRAGLMCLAVARSPEDLGAPISIIGNVQQQNMHVLFDV 327
Query: 484 ANNRVGFTPNKC 495
N + F P KC
Sbjct: 328 HNQKFSFAPTKC 339
>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
Length = 464
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 117/389 (30%), Positives = 168/389 (43%), Gaps = 72/389 (18%)
Query: 175 VLDTGSDINWLQCRPC----------TECYQQSDPIFDPKTSSSYSPLPC---------A 215
V+DTGSD+ W QC C C+ Q+ P ++ S + +PC
Sbjct: 77 VVDTGSDLVWTQCSTCRLPAVAAAGGGGCFPQNLPYYNFSLSRTARAVPCDDDDGALCGV 136
Query: 216 APQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE- 273
AP+ + C+ +YG G +G L T+ +F +S SV +A GC
Sbjct: 137 APETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFTFPSSSSVT-LAFGCVSQTRI 194
Query: 274 --GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD--RDSPASGVLEFNSARGGD---- 325
G G++G++GLG G LSL Q+ AT +YCL RD+ + L
Sbjct: 195 SPGALNGASGIIGLGRGALSLVSQLNATEFSYCLTPYFRDTVSPSHLFVGDGELAGLRAA 254
Query: 326 ----------AVTAPLIRNKK---VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD--- 369
T P +N K TFYY+ L G + G V +P F++ EA
Sbjct: 255 AGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGNATVALPAGAFDLREAAPKVW 314
Query: 370 -GGIIVDCGTAITRLQTQAYNSLRDSFVRL---AGNLKPTS---GVALFDTCY----DFS 418
GG ++D G+ TRL A+ +L R +G+L P G AL + C D
Sbjct: 315 AGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGAL-ELCVEAGDDGD 373
Query: 419 GLRSVRVPTVSLHF----GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA--------L 466
L + VP + L F G G+ L +PA+ Y V+ A T+C A ++S
Sbjct: 374 SLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVE-ASTWCMAVVSSASGNATLPTNET 432
Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+IIGN QQ RV +DLAN + F P C
Sbjct: 433 TIIGNFMQQDMRVLYDLANGLLSFQPANC 461
>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 480
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 122/411 (29%), Positives = 166/411 (40%), Gaps = 87/411 (21%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP--CTECYQQSDPIFDPKTSSSYSPLP- 213
G+Y +G+ + S+ +DTGSD+ W C P C C + PK S PLP
Sbjct: 74 GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGK------PKIQS---PLPK 124
Query: 214 ---------------------------CAAPQC--KSLDVSACRANRCL-YQVAYGDGSF 243
CA +C +S+++S C + C + AYGDGS
Sbjct: 125 IANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSL 184
Query: 244 TVGDLVTETVSFGNSG-----SVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA 298
V L +++S +V+ GC H G VG AG G G+LS+ Q+
Sbjct: 185 -VARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGF---GRGVLSMPSQLAT 240
Query: 299 TS------LAYCLV------DR-DSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGL 345
S +YCLV DR P+ +L + + L+ N K FY VGL
Sbjct: 241 FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL 300
Query: 346 TGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT 405
G SVG + P L ++DE G GG++VD GT T L Y S+ F G +
Sbjct: 301 AGISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR 360
Query: 406 SGVALFDT----CYDFSGLRSVRVPTVSLHF-GAGKALDLPAKNYLIPVDSAGTFCFAFA 460
+ +T CY + SV VP V LHF G + LP KNY G
Sbjct: 361 ARRIEENTGLSPCYYYE--NSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRK 418
Query: 461 PTSSALSI----------------IGNVQQQGTRVSFDLANNRVGFTPNKC 495
L + +GN QQQG V +DL NRVGF +C
Sbjct: 419 RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 469
>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
Length = 397
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 100/356 (28%), Positives = 152/356 (42%), Gaps = 36/356 (10%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
+GTPP+ S ++D ++ W QC C+ C++Q P+F P SS++ P PC CKS
Sbjct: 49 IGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTPT 108
Query: 225 SACRANRCLYQVAYG---DGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE-GLFVGSA 280
S C + C Y+ D T+G + TET + G + +A GC ++ G++
Sbjct: 109 SNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIGT--ATASLAFGCVVASDIDTMDGTS 166
Query: 281 GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAVTAPLIRNKK 336
G +GLG SL Q+K T +YCL R + S L S A G TAP I+
Sbjct: 167 GFIGLGRTPRSLVAQMKLTKFSYCLSPRGTGKSSRLFLGSSAKLAGGESTSTAPFIKTSP 226
Query: 337 VDT---FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV-DCGTAITRLQTQAYNSLR 392
D +Y + L G + A GGI+V + + L AY + +
Sbjct: 227 DDDSHHYYLLSLDAIRAGNTTIAT---------AQSGGILVMHTVSPFSLLVDSAYRAFK 277
Query: 393 DSFVRLAGN---LKPTSGVALFDTCY-DFSGLRSVRVPTVSLHF-GAGKALDLPAKNYLI 447
+ G + FD C+ +G P + F G G AL +P YLI
Sbjct: 278 KAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGGGAALTVPPAKYLI 337
Query: 448 PV-DSAGTFCFAFAPTS-------SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
V + T C A + +S++G++QQ+ +DL + F P C
Sbjct: 338 DVGEEKDTACAAILSMARLNRTGLEGVSVLGSLQQENVHFLYDLKKETLSFEPADC 393
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 107/367 (29%), Positives = 163/367 (44%), Gaps = 35/367 (9%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF+++ +G+PP +F++ +DTGSDI W+ C C+ C S FD S +
Sbjct: 98 GLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGS 157
Query: 212 LPCAAPQCKSL---DVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS---GS 260
+ C+ P C S+ + C N+C Y YGDGS T G +T+T F G S S
Sbjct: 158 VTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANS 217
Query: 261 VKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSP 311
I GC G S G+ G G G LS+ Q+ + + ++CL D
Sbjct: 218 SAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL-KGDGS 276
Query: 312 ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
GV V +PL+ ++ Y + L V GQ + I ++FE G
Sbjct: 277 GGGVFVLGEILVPGMVYSPLLPSQP---HYNLNLLSIGVNGQILPIDAAVFEASNT--RG 331
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLH 431
IVD GT +T L +AY+ ++ L T ++ + CY S S P VSL+
Sbjct: 332 TIVDTGTTLTYLVKEAYDPFLNAISNSVSQLV-TLIISNGEQCYLVSTSISDMFPPVSLN 390
Query: 432 FGAGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
F G ++ L ++YL D A +C F +I+G++ + +DLA R+
Sbjct: 391 FAGGASMMLRPQDYLFHYGFYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRI 450
Query: 489 GFTPNKC 495
G+ C
Sbjct: 451 GWANYDC 457
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 103/377 (27%), Positives = 174/377 (46%), Gaps = 46/377 (12%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF+++G+G P + + + +DTGSD+ W+ CRPC+ C ++S ++DP+ SS+ S
Sbjct: 27 GLYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSL 86
Query: 212 LPCAAPQC---KSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSF------GNSGS 260
+ C+ P C + + C N C Y +YGDGS + G V + + + G + +
Sbjct: 87 VSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANT 146
Query: 261 VKGIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSP 311
+ GC G S G++G G LS+ Q+ A ++CL
Sbjct: 147 TSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRG 206
Query: 312 ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
++ A G T PL+ + Y V L G SV + ++P + D G
Sbjct: 207 GGILVIGGIAEPGMTYT-PLVPDS---VHYNVVLRGISV--NSNRLPIDAEDFSSTNDTG 260
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLH 431
+I+D GT + + AYN + +R A + P + C+ SG S P V+L+
Sbjct: 261 VIMDSGTTLAYFPSGAYNVFVQA-IREATSATPVRVQGMDTQCFLVSGRLSDLFPNVTLN 319
Query: 432 FGAGKALDLPAKNYLI-----PVDSAGTFCFAFAPTSSA--------LSIIGNVQQQGTR 478
F G A++L NYL+ P + +C + +SS+ L+I+G++ +
Sbjct: 320 F-EGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKL 378
Query: 479 VSFDLANNRVGFTPNKC 495
V +DL N+R+G+ C
Sbjct: 379 VVYDLDNSRIGWMSYNC 395
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 167/362 (46%), Gaps = 38/362 (10%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +G+PP++F++++DTGS + ++ C C +C DP F P+ SS+Y P+ C
Sbjct: 86 NGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCN 145
Query: 216 APQCKSLDVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHD 271
A C C N +C Y+ Y + S + G L + +SFG + + GC
Sbjct: 146 A-DCN------CDENGVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCETM 198
Query: 272 NEG-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASGVLEFNSARGG 324
G L+ A G++GLG G LS+ Q + + S + C D ++ GG
Sbjct: 199 ESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMV-----LGG 253
Query: 325 DAVTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
+ ++ + +Y + L V G+ +++ P F+ G G I+D GT
Sbjct: 254 ISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFD----GKYGAILDSGTTYA 309
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRVPT----VSLHFGAG 435
+AY + +D+ ++ LK SG D C+ +G +P V + F G
Sbjct: 310 YFPEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANG 369
Query: 436 KALDLPAKNYLI-PVDSAGTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
+ + L +NYL +G +C F + +++G + + T V+++ N+ +GF
Sbjct: 370 QKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKT 429
Query: 494 KC 495
C
Sbjct: 430 NC 431
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 107/369 (28%), Positives = 167/369 (45%), Gaps = 43/369 (11%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSPLP 213
YF+++ +G+PP +F++ +DTGSDI W+ C C+ C S FD S + +
Sbjct: 105 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 164
Query: 214 CAAPQCKSL---DVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS---GSVK 262
C+ P C S+ + C N+C Y YGDGS T G +T+T F G S S
Sbjct: 165 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 224
Query: 263 GIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSPAS 313
I GC G S G+ G G G LS+ Q+ + + ++CL D
Sbjct: 225 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL-KGDGSGG 283
Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
GV V +PL+ ++ Y + L V GQ + + ++FE + G I
Sbjct: 284 GVFVLGEILVPGMVYSPLVPSQP---HYNLNLLSIGVNGQMLPLDAAVFE--ASNTRGTI 338
Query: 374 VDCGTAITRLQTQAY----NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
VD GT +T L +AY N++ +S +L T ++ + CY S S P+VS
Sbjct: 339 VDTGTTLTYLVKEAYDLFLNAISNSVSQLV-----TPIISNGEQCYLVSTSISDMFPSVS 393
Query: 430 LHFGAGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
L+F G ++ L ++YL D A +C F +I+G++ + +DLA
Sbjct: 394 LNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQ 453
Query: 487 RVGFTPNKC 495
R+G+ C
Sbjct: 454 RIGWASYDC 462
>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
Length = 467
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 118/396 (29%), Positives = 168/396 (42%), Gaps = 65/396 (16%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCR----PCTECYQQSDPIFDPKTSSSYSPLPCAA-P 217
+ VG PP+ +MVLDTGS+++WL C P T Q+ F+ SS+Y+ C++ P
Sbjct: 63 VAVGAPPQNVTMVLDTGSELSWLLCNGSRVPSTPPQPQAPAAFNGSASSTYAAAHCSSSP 122
Query: 218 QC----KSLDV----SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI----- 264
+C + L V + +N C ++Y D S G L +T G + V+ +
Sbjct: 123 ECQWRGRDLPVPPFCAGPPSNSCRVSLSYADASSADGVLAADTFLLGGAPPVRALFGCIT 182
Query: 265 -------ALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPAS 313
A G G+ N+ S+ GLLG+ G LS Q AYC+ D P
Sbjct: 183 SYSSSSTADGNGNGNDASATNSSEAATGLLGMNRGSLSFVTQTGTLRFAYCIAPGDGP-- 240
Query: 314 GVLEFNSARGGDAVTA-------PLIRNKKVDTF-----YYVGLTGFSVGGQAVQIPPSL 361
G+L G A++A PLI + + Y V L G VG + IP S+
Sbjct: 241 GLLVLGGDGDGAALSAAPQLNYTPLIEMSQPLPYFDRVAYSVQLEGIRVGAALLPIPKSV 300
Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG------VALFDTCY 415
D G G +VD GT T L AY L+ F+ L G FD C+
Sbjct: 301 LAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGEPDFVFQGAFDACF 360
Query: 416 DFSGLR------SVRVPTVSLHF-GAGKALDLPAKNYLIPVDSAG------TFCFAFAPT 462
S R S +P V L GA A+ Y++P + G +C F +
Sbjct: 361 RASEARVAAATASQLLPEVGLVLRGAEVAVGGEKLLYMVPGERRGEGGSEAVWCLTFGNS 420
Query: 463 SSA---LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
A +IG+ QQ V +DL N+RVGF P +C
Sbjct: 421 DMAGMSAYVIGHHHQQNVWVEYDLQNSRVGFAPARC 456
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 180/370 (48%), Gaps = 40/370 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD---PI--FDPKTSSSYSP 211
G YF+R+ +G+PP++F + +DTGSD+ W+ C C C Q S P+ FDP +SS+ S
Sbjct: 66 GLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASL 125
Query: 212 LPCAAPQCKSLDVSACRA------NRCLYQVAYGDGSFTVGDLVTETVSF----GNS--G 259
+ C+ +C SL V + A N+C+Y YGDGS T G V++ ++F G+S
Sbjct: 126 ISCSDQRC-SLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTN 184
Query: 260 SVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDS 310
S I GC G S G+ G G +S+ Q+ + + ++CL
Sbjct: 185 SSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGG 244
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
++ D V +PL+ ++ Y + L SV G+++ I P +F + +
Sbjct: 245 GGGILVLGEIVE-EDIVYSPLVPSQP---HYNLNLQSISVNGKSLAIDPEVFAT--STNR 298
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
G IVD GT + L +AY+ + ++ +++P ++ CY + PTVS
Sbjct: 299 GTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPL--LSKGTQCYLITSSVKGIFPTVS 356
Query: 430 LHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPTS-SALSIIGNVQQQGTRVSFDLAN 485
L+F G +++L ++YL+ +S G +C F ++I+G++ + +DLA
Sbjct: 357 LNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLAG 416
Query: 486 NRVGFTPNKC 495
R+G+ C
Sbjct: 417 QRIGWANYDC 426
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 180/370 (48%), Gaps = 40/370 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD---PI--FDPKTSSSYSP 211
G YF+R+ +G+PP++F + +DTGSD+ W+ C C C Q S P+ FDP +SS+ S
Sbjct: 81 GLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASL 140
Query: 212 LPCAAPQCKSLDVSACRA------NRCLYQVAYGDGSFTVGDLVTETVSF----GNS--G 259
+ C+ +C SL V + A N+C+Y YGDGS T G V++ ++F G+S
Sbjct: 141 ISCSDQRC-SLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTN 199
Query: 260 SVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDS 310
S I GC G S G+ G G +S+ Q+ + + ++CL
Sbjct: 200 SSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGG 259
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
++ D V +PL+ ++ Y + L SV G+++ I P +F + +
Sbjct: 260 GGGILVLGEIVE-EDIVYSPLVPSQP---HYNLNLQSISVNGKSLAIDPEVFA--TSTNR 313
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
G IVD GT + L +AY+ + ++ +++P ++ CY + PTVS
Sbjct: 314 GTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPL--LSKGTQCYLITSSVKGIFPTVS 371
Query: 430 LHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPTS-SALSIIGNVQQQGTRVSFDLAN 485
L+F G +++L ++YL+ +S G +C F ++I+G++ + +DLA
Sbjct: 372 LNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLAG 431
Query: 486 NRVGFTPNKC 495
R+G+ C
Sbjct: 432 QRIGWANYDC 441
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 104/349 (29%), Positives = 153/349 (43%), Gaps = 29/349 (8%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQ-SDPIFDPKTSSSYSPLPCAAP 217
+ +G PP ++DTGS + W+QC PC C QQ P+FDP SS+Y L C
Sbjct: 102 FLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPCKSCSQQIIGPMFDPSISSTYDSLSCKNI 161
Query: 218 QCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNS----GSVKGIALGCGHDN 272
C+ C +++C+Y Y +G +VG + TE + FG+S +V + GC H N
Sbjct: 162 ICRYAPSGECDSSSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNNVLFGCSHRN 221
Query: 273 EGLFVGS--AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSP--ASGVLEFNSARGGDAVT 328
G + G+ GLG G+ S+ Q+ + +YC+ + P + L + + +
Sbjct: 222 -GNYKDRRFTGVFGLGSGITSVVNQM-GSKFSYCIGNIADPDYSYNQLVLSEGVNMEGYS 279
Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
PL VD Y V L G SVG + I PS F+ E +I+D GTA T L Y
Sbjct: 280 TPL---DVVDGHYQVILEGISVGETRLVIDPSAFKRTEK-QRRVIIDSGTAPTWLAENEY 335
Query: 389 NSLRDSFVRLAGN-LKPTSGVALFDTCYDFS-GLRSVRVPTVSLHFGAGKALDLPAKNYL 446
+L L L P + CY G V P V+ HF G L
Sbjct: 336 RALEREVRNLLDRFLTPFMRESFL--CYKGKVGQDLVGFPAVTFHFAEGADL-------- 385
Query: 447 IPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
VD+ + S+IG + QQ V++DL +++ F C
Sbjct: 386 -VVDTEMRQASVYGKDFKDFSVIGLMAQQYYNVAYDLNKHKLFFQRIDC 433
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 108/369 (29%), Positives = 172/369 (46%), Gaps = 38/369 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF+++ +GTPP +F++ +DTGSDI W+ C C C + S FD +SSS S
Sbjct: 77 GLYFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSL 136
Query: 212 LPCAAPQCKS---LDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS---G 259
+ C+ P C S + C ++N+C Y YGDGS T G V+E++ F G S
Sbjct: 137 VSCSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMVMGQSMIAN 196
Query: 260 SVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDS 310
S + GC G S G+ G G G LS+ Q+ A + ++CL +
Sbjct: 197 SSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCL-KGEG 255
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
G+L V +PL+ ++ Y L SV GQ + I PS+F + +
Sbjct: 256 NGGGILVLGEVLEPGIVYSPLVPSQPHYNLY---LQSISVNGQTLPIDPSVFA--TSINR 310
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
G I+D GT + L +AY + ++ ++ PT ++ + CY S P VS
Sbjct: 311 GTIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPT--ISKGNQCYLVSTSVGEIFPLVS 368
Query: 430 LHFGAGKALDLPAKNYLIPV---DSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
L+F ++ L + YL+ + D A +C F ++I+G++ + +DLA
Sbjct: 369 LNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEGVTILGDLVMKDKIFVYDLARQ 428
Query: 487 RVGFTPNKC 495
R+G+ C
Sbjct: 429 RIGWASYDC 437
>gi|147866226|emb|CAN79938.1| hypothetical protein VITISV_027777 [Vitis vinifera]
Length = 454
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 103/372 (27%), Positives = 156/372 (41%), Gaps = 39/372 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTEC-YQQSDP---IFDPKTSSSY 209
G Y + GTPP+ +++DTGSD+ W C C C + S+P IF PK+SSS
Sbjct: 88 GAYSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSS 147
Query: 210 SPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNS--GSVKGIALG 267
L C P+C + S ++ RC + + F + L
Sbjct: 148 KVLGCVNPKCGWIHGSKVQS-RCRDCEPTSPNCTQICPPYLNFLRFWDHRRSQFHRRMLC 206
Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DSPASGVLEFNSARGG 324
H + + G G G SL Q+ +YCL+ R D+ S L +
Sbjct: 207 PLHQST-----RREISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTESSSLVLDGESDS 261
Query: 325 DAVTA-----PLIRNKKV------DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
TA P ++N KV +YY+GL +VGG+ V+IP GDGG I
Sbjct: 262 GEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYKYLIPGADGDGGTI 321
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT--SGVALFDTCYDFSGLRSVRVPTVSLH 431
+D GT T ++ + + + F + + + T G+ C++ SGL + P ++L
Sbjct: 322 IDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITGLRPCFNISGLNTPSFPELTLK 381
Query: 432 FGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS--------IIGNVQQQGTRVSFDL 483
F G ++LP NY+ + C +A I+GN QQQ V +DL
Sbjct: 382 FRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAIILGNFQQQNFYVEYDL 441
Query: 484 ANNRVGFTPNKC 495
N R+GF C
Sbjct: 442 RNERLGFRQQSC 453
>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
Length = 396
Score = 125 bits (314), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 99/355 (27%), Positives = 152/355 (42%), Gaps = 35/355 (9%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
+GTPP+ S ++D ++ W QC C+ C++Q P+F P SS++ P PC CKS
Sbjct: 49 IGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTPT 108
Query: 225 SACRANRCLYQVAYG---DGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE-GLFVGSA 280
S C + C Y+ D T+G + TET + G + +A GC ++ G++
Sbjct: 109 SNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIGT--ATASLAFGCVVASDIDTMDGTS 166
Query: 281 GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAVTAPLIRNKK 336
G +GLG SL Q+K T +YCL R + S L S A G TAP I+
Sbjct: 167 GFIGLGRTPRSLVAQMKLTKFSYCLSPRGTGKSSRLFLGSSAKLAGGESTSTAPFIKTSP 226
Query: 337 VDT---FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV-DCGTAITRLQTQAYNSLR 392
D +Y + L G + A GGI+V + + L AY + +
Sbjct: 227 DDDSHHYYLLSLDAIRAGNTTIAT---------AQSGGILVMHTVSPFSLLVDSAYRAFK 277
Query: 393 DSFVRLAGNL--KPTSGVAL-FDTCY-DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
+ G +P + FD C+ +G P + F AL +P YLI
Sbjct: 278 KAVTEAVGGAAEQPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTVPPAKYLID 337
Query: 449 V-DSAGTFCFAFAPTS-------SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
V + T C A + +S++G++QQ+ +DL + F P C
Sbjct: 338 VGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEPADC 392
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 167/362 (46%), Gaps = 38/362 (10%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +G+PP++F++++DTGS + ++ C C +C DP F P+ SS+Y P+ C
Sbjct: 86 NGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCN 145
Query: 216 APQCKSLDVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHD 271
A C C N +C Y+ Y + S + G L + +SFG + + GC
Sbjct: 146 A-DCN------CDENGVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCETM 198
Query: 272 NEG-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASGVLEFNSARGG 324
G L+ A G++GLG G LS+ Q + + S + C D ++ GG
Sbjct: 199 ESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMV-----LGG 253
Query: 325 DAVTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
+ ++ + +Y + L V G+ +++ P F+ G G I+D GT
Sbjct: 254 ISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFD----GKYGAILDSGTTYA 309
Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRVPT----VSLHFGAG 435
+AY + +D+ ++ LK SG D C+ +G +P V + F G
Sbjct: 310 YFPEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANG 369
Query: 436 KALDLPAKNYLI-PVDSAGTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
+ + L +NYL +G +C F + +++G + + T V+++ N+ +GF
Sbjct: 370 QKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKT 429
Query: 494 KC 495
C
Sbjct: 430 NC 431
>gi|125602787|gb|EAZ42112.1| hypothetical protein OsJ_26672 [Oryza sativa Japonica Group]
Length = 477
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 75/203 (36%), Positives = 107/203 (52%), Gaps = 26/203 (12%)
Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
L D AR N+L + + A + A A E P+ SG + Y + I +
Sbjct: 107 LAADEARANSLQLRNKAAFTQSGKKATAAAAAAAGAE---VPLTSGIRFQTLNYVTTIAL 163
Query: 166 GTPPR------QFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
G ++++DTGSD+ W+QC+PC+ CY Q DP+FDP S+SY+ +PC A C
Sbjct: 164 GGGGSSRAGAGNLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASAC 223
Query: 220 KSLDVSAC----------------RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
++ +A ++ RC Y +AYGDGSF+ G L T+TV+ G + SV G
Sbjct: 224 EASLKAATGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGA-SVDG 282
Query: 264 IALGCGHDNEGLFVGSAGLLGLG 286
GCG N GLF G+AGL+GLG
Sbjct: 283 FVFGCGLSNRGLFGGTAGLMGLG 305
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 62/129 (48%), Gaps = 5/129 (3%)
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRVPTVS 429
+++D GT ITRL Y ++R F R G + + +L D CY+ +G V+VP ++
Sbjct: 346 VLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLT 405
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGT-FCFAFAPTS--SALSIIGNVQQQGTRVSFDLANN 486
L G + + A L G+ C A A S IIGN QQ+ RV +D +
Sbjct: 406 LRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGS 465
Query: 487 RVGFTPNKC 495
R+GF C
Sbjct: 466 RLGFADEDC 474
>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 481
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 118/399 (29%), Positives = 168/399 (42%), Gaps = 85/399 (21%)
Query: 168 PPRQFSMVLDTGSDINWLQCRP--CTECY---QQSDPIFDPKTSSSYS---PLP------ 213
PP+ ++ +DTGSD+ W C P C C Q + P K + S S P
Sbjct: 85 PPQLITLYMDTGSDLVWFPCSPFECILCEGKPQTTKPANITKQTHSVSCQSPACSAAHAS 144
Query: 214 ------CAAPQC--KSLDVSACRANRCL-YQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
CA +C ++ S C + C + AYGDGSF V +L +T+S +S ++
Sbjct: 145 MSSSNLCAISRCPLDYIETSDCSSFSCPPFYYAYGDGSF-VANLYQQTLSL-SSLHLQNF 202
Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAYCLV------DR-DSP 311
GC H G+ G G G+LSL Q+ S +YCLV DR P
Sbjct: 203 TFGCAHT---ALAEPTGVAGFGRGILSLPAQLSTLSPHLGNRFSYCLVSHSFDGDRLRRP 259
Query: 312 ASGVLEFNSARGGDAVTAP------------LIRNKKVDTFYYVGLTGFSVGGQAVQIPP 359
+ +L R D +T ++ N K +Y VGL G SVG + V P
Sbjct: 260 SPLIL----GRHNDTITGAGDGESVEFVYTSMLSNPKHPYYYCVGLAGISVGKRTVPAPE 315
Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS-------FVRLAGNLKPTSGVALFD 412
L +DE G+GG++VD GT T L YN++ + F + A ++ +G+
Sbjct: 316 ILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEFDKRVNRFHKRASEIETKTGLG--- 372
Query: 413 TCYDFSGLRSVRVPTVSLHF-GAGKALDLPAKNYLIPVDSAG--------TFCFAFAPTS 463
CY +GL ++P + LHF G + LP KNY G C
Sbjct: 373 PCYYLNGLS--QIPVLKLHFVGNNSDVVLPRKNYFYEFMDGGDGIRRKGKVGCMMLMNGE 430
Query: 464 SALSI-------IGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ +GN QQQG V +DL RVGF +C
Sbjct: 431 DETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKEC 469
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 97/370 (26%), Positives = 167/370 (45%), Gaps = 46/370 (12%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF++I +G+PP+++ + +DTGSDI W+ C+PC EC +++ +FD SS+
Sbjct: 72 GLYFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKK 131
Query: 212 LPCAAPQCKSLDVS-ACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGN-SGSVKG----- 263
+ C C + S +C+ A C Y + Y D S + G+ + + ++ +G ++
Sbjct: 132 VGCDDDFCSFISQSDSCQPAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDLQTGPLGQ 191
Query: 264 -IALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPAS 313
+ GCG D G S G++G G S+ Q+ AT ++CL +
Sbjct: 192 EVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL--DNVKGG 249
Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
G+ T P++ N+ Y V L G V G A+ +PPS+ +GG I
Sbjct: 250 GIFAVGVVDSPKVKTTPMVPNQ---MHYNVMLMGMDVDGTALDLPPSIMR-----NGGTI 301
Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT--CYDFSGLRSVRVPTVSLH 431
VD GT + Y+SL ++ + +P + DT C+ FS V P VS
Sbjct: 302 VDSGTTLAYFPKVLYDSLIETILA----RQPVKLHIVEDTFQCFSFSENVDVAFPPVSFE 357
Query: 432 FGAGKALDLPAKNYLIPVDSAGTFCFAFAP------TSSALSIIGNVQQQGTRVSFDLAN 485
F L + +YL ++ +CF + + + ++G++ V +DL N
Sbjct: 358 FEDSVKLTVYPHDYLFTLEKE-LYCFGWQAGGLTTGERTEVILLGDLVLSNKLVVYDLEN 416
Query: 486 NRVGFTPNKC 495
+G+ + C
Sbjct: 417 EVIGWADHNC 426
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 111/369 (30%), Positives = 173/369 (46%), Gaps = 40/369 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G Y++++ +GTPPR+F++ +DTGSD+ W+ C C C + S+ FDP SSS S
Sbjct: 82 GLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASL 141
Query: 212 LPCAAPQCKS--LDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNS-------GSV 261
+ C+ +C S S C N C Y YGDGS T G +++ +SF S
Sbjct: 142 VSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINSS 201
Query: 262 KGIALGCGHDNEGLFV----GSAGLLGLGGGMLSLTKQIKATSLA-----YCLVDRDSPA 312
GC + G G+ GLG G LS+ Q+ LA +CL D
Sbjct: 202 APFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCL-KGDKSG 260
Query: 313 SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
G++ + D V PL+ ++ Y V L +V GQ + I PS+F + GDG I
Sbjct: 261 GGIMVLGQIKRPDTVYTPLVPSQP---HYNVNLQSIAVNGQILPIDPSVFTI-ATGDGTI 316
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDT--CYDFSGLRSVRVPTVS 429
I D GT + L +AY+ F++ N G + +++ C++ + P VS
Sbjct: 317 I-DTGTTLAYLPDEAYS----PFIQAVANAVSQYGRPITYESYQCFEITAGDVDVFPQVS 371
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGT--FCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANN 486
L F G ++ L + YL S+G+ +C F S ++I+G++ + V +DL
Sbjct: 372 LSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQ 431
Query: 487 RVGFTPNKC 495
R+G+ C
Sbjct: 432 RIGWAEYDC 440
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 106/372 (28%), Positives = 164/372 (44%), Gaps = 43/372 (11%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYS 210
+G YF+ I +GTPP+++ + +DTGSDI W+ C C +C ++S +DPK SSS S
Sbjct: 81 TGLYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASSSGS 140
Query: 211 PLPCAAPQCKSL---DVSACRANR-CLYQVAYGDGSFTVGDLVTETVSF----GNSGSVK 262
+ C C + + C AN C Y V YGDGS T G VT+ + F G+ +
Sbjct: 141 TVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQTQP 200
Query: 263 G---IALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRD 309
G + GCG +G +GS+ G+LG G S+ Q+ A A+CL
Sbjct: 201 GNATVTFGCGA-QQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCL--DT 257
Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
G+ + T PL+ + Y V L VGG +Q+P +FE E
Sbjct: 258 IKGGGIFAIGNVVQPKVKTTPLVADMP---HYNVNLKSIDVGGTTLQLPAHVFETGER-- 312
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
G I+D GT +T L + + + ++ V F C+ + G PT++
Sbjct: 313 KGTIIDSGTTLTYLPELVFKEVMAAIFNKHQDIV-FHNVQDF-MCFQYPGSVDDGFPTIT 370
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGTRVSFDL 483
HF AL + Y P + +C F + + ++G++ V +DL
Sbjct: 371 FHFEDDLALHVYPHEYFFP-NGNDMYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVIYDL 429
Query: 484 ANNRVGFTPNKC 495
N +G+T C
Sbjct: 430 ENQVIGWTDYNC 441
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 108/369 (29%), Positives = 177/369 (47%), Gaps = 37/369 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF+++ +G+PPR+F++ +DTGSDI W+ C C +C + S FDP +SS+ S
Sbjct: 84 GLYFTKVKLGSPPREFNVQIDTGSDILWVTCNSCNDCPRTSGLGIELSFFDPSSSSTTSL 143
Query: 212 LPCAAPQCKSL---DVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS---G 259
+ C+ P C SL + C ++N+C Y YGDGS T G V++ + F G+S
Sbjct: 144 VSCSHPICTSLVQTTAAECSPQSNQCSYSFHYGDGSGTTGYYVSDMLYFDTVLGDSLIAN 203
Query: 260 SVKGIALGCGHDNEGLF--VGSA--GLLGLGGGMLSLTKQIKATSL-----AYCLVDRDS 310
S I GC G V A G+ G G LS+ Q+ + + ++CL +
Sbjct: 204 SSASIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCL-KGEG 262
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
G L + + +PL+ ++ + Y + L SV GQ + I P++F + +
Sbjct: 263 DGGGKLVLGEILEPNIIYSPLVPSQ---SHYNLNLQSISVNGQLLPIDPAVFA--TSNNQ 317
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
G IVD GT +T L AY+ + + T ++ + CY S P VSL
Sbjct: 318 GTIVDSGTTLTYLVETAYDPFVSAITATVSS-STTPVLSKGNQCYLVSTSVDEIFPPVSL 376
Query: 431 HFGAGKALDLPAKNYLIPV---DSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANN 486
+F G ++ L YL+ + D A +C F + ++I+G++ + +DLA+
Sbjct: 377 NFAGGASMVLKPGEYLMHLGFSDGAAMWCIGFQKVAEPGITILGDLVLKDKIFVYDLAHQ 436
Query: 487 RVGFTPNKC 495
R+G+ C
Sbjct: 437 RIGWANYDC 445
>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
Length = 452
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 116/387 (29%), Positives = 164/387 (42%), Gaps = 65/387 (16%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQC---RPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
+ VG PP+ +MVLDTGS+++WL+C R + Q+ F+ SS+Y+ C++P+C
Sbjct: 64 VAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSPEC 123
Query: 220 ----KSLDV----SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC--- 268
+ L V + + C ++Y D S G L +T G + V + GC
Sbjct: 124 QWRGRDLPVPPFCAGPPSXSCRVSLSYADASSADGILAADTFLLGGAPPVXAL-FGCVTS 182
Query: 269 --------GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS 320
D+E + GLLG+ G LS Q AYC+ D P VL
Sbjct: 183 YSSATATNSSDSEA----ATGLLGMNRGSLSFVTQTATLRFAYCIAPGDGPGLLVL---- 234
Query: 321 ARGGDAVT-------APLIR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
GGD PLI+ ++ + F Y V L G VG + IP S+ D G
Sbjct: 235 --GGDGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHTG 292
Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA------LFDTCYDFSGLR- 421
G +VD GT T L AY L+ F+ L G + FD C+ S R
Sbjct: 293 AGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGAFDACFRASEARV 352
Query: 422 ---SVRVPTVSLHF-GAGKALDLPAKNYLIPVDSAG------TFCFAFAPTSSA---LSI 468
S +P V L GA A+ Y +P + G +C F + A +
Sbjct: 353 AAASXMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNSDMAGMSAYV 412
Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
IG+ QQ V +DL N RVGF P +C
Sbjct: 413 IGHHHQQNVWVEYDLQNGRVGFAPARC 439
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/359 (26%), Positives = 170/359 (47%), Gaps = 32/359 (8%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP+ F++++DTGS + ++ C C +C + DP F P++SS+Y P+ C
Sbjct: 81 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC- 139
Query: 216 APQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHD 271
++D + C ++R C+Y+ Y + S + G L + +SFGN + + GC +
Sbjct: 140 -----TIDCN-CDSDRMQCVYERQYAEMSTSSGVLGEDLISFGNQSELAPQRAVFGCENV 193
Query: 272 NEG-LFVGSA-GLLGLGGGMLSLT-----KQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
G L+ A G++GLG G LS+ K + + S + C D ++ +
Sbjct: 194 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVGGGAMVLGGISPPS 253
Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
D A + +Y + L V G+ + + ++F+ G G ++D GT L
Sbjct: 254 DMAFA--YSDPVRSPYYNIDLKEIHVAGKRLPLNANVFD----GKHGTVLDSGTTYAYLP 307
Query: 385 TQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLR----SVRVPTVSLHFGAGKAL 438
A+ + +D+ V+ +LK SG D C+ +G+ S P V + F G+
Sbjct: 308 EAAFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMVFENGQKY 367
Query: 439 DLPAKNYLIPVDSA-GTFCF-AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L +NY+ G +C F + +++G + + T V +D ++GF C
Sbjct: 368 TLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLGGIIVRNTLVVYDREQTKIGFWKTNC 426
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 160/356 (44%), Gaps = 28/356 (7%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
+++ + +GTP R FS+++DTGS I ++ C+ C+ C + + FDP S++ L C P
Sbjct: 13 FYTTLKLGTPERTFSVIIDTGSTITYIPCKDCSHCGKHTAEWFDPDKSTTAKKLACGDPL 72
Query: 219 CKSLDVS-ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC--GHDNEGL 275
C S C +RC Y Y + S + G ++ +T F +S S + GC G E
Sbjct: 73 CNCGTPSCTCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSPVRLVFGCENGETGEIY 132
Query: 276 FVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVL---EFNSARGGDAVTA 329
+ G++G+G + Q+ K + L P G+L + G + V
Sbjct: 133 RQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLC-FGYPKDGILLLGDVTLPEGANTVYT 191
Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
PL+ + + +Y V + G +V GQ + S+F+ G ++D GT T L T A+
Sbjct: 192 PLLTHLHLH-YYNVKMDGITVNGQTLAFDASVFDRGY----GTVLDSGTTFTYLPTDAFK 246
Query: 390 SLRDS---FVRLAGNLKPTSGV--ALFDTCY-----DFSGLRSVRVPTVSLHFGAGKALD 439
++ + +V G L+ T G D C+ F L P FG G L
Sbjct: 247 AMAKAVGDYVEKKG-LQSTPGADPQYNDICWKGAPDQFKDLDKY-FPPAEFVFGGGAKLT 304
Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
LP YL + +C ++ +++G V + V++D N++VGFT C
Sbjct: 305 LPPLRYLF-LSKPAEYCLGIFDNGNSGALVGGVSVRDVVVTYDRRNSKVGFTTMAC 359
>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 480
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 131/454 (28%), Positives = 189/454 (41%), Gaps = 88/454 (19%)
Query: 116 LITKLQLAIYNVDRHELKPAE---AQILPEDFSTPVVSGASQGSGEYFS-RIGVGTPPRQ 171
L L A +N H LK A+ S P+ S GS S +G +
Sbjct: 29 LTHTLSKAQFNSTHHLLKSTSTRSAKRFRRQLSLPL----SPGSDYTLSFNLGPQAQAQP 84
Query: 172 FSMVLDTGSDINWLQCRP--CTECY-QQSDPIFDPKT---------------SSSYSPLP 213
++ +DTGSD+ W C P C C + ++P P T S++++ P
Sbjct: 85 ITLYMDTGSDLVWFPCAPFKCILCEGKPNEPNASPPTNITQSVAVSCKSPACSAAHNLAP 144
Query: 214 ----CAAPQC--KSLDVSACRANRCL-YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIAL 266
CAA +C +S++ S C +C + AYGDGS + L +T+S +S ++
Sbjct: 145 PSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSL-IARLYRDTLSL-SSLFLRNFTF 202
Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAYCLVDRD-------SPAS 313
GC H G+ G G G+LSL Q+ S +YCLV P+
Sbjct: 203 GCAHTT---LAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHSFDSERVRKPSP 259
Query: 314 GVL------EFNSARGGDA--VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
+L E GG A V ++ N K FY V L G +VG + + P L ++
Sbjct: 260 LILGRYEEKEKEKIGGGVAEFVYTSMLENPKHPYFYTVSLIGIAVGKRTIPAPEMLRRVN 319
Query: 366 EAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG-------NLKPTSGVALFDTCYDFS 418
GDGG++VD GT T L YNS+ D F R G ++ +G+A CY +
Sbjct: 320 NRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRRVGRDNKRARKIEEKTGLA---PCYYLN 376
Query: 419 GLRSVRVPTVSLHFGAGK--ALDLPAKNYLIPVD----------SAGTFCFAFAPTSSAL 466
+ VP ++L F GK ++ LP KNY G + L
Sbjct: 377 SV--ADVPALTLRFAGGKNSSVVLPRKNYFYEFSDGSDGAKGKRKVGCLMLMNGGDEADL 434
Query: 467 S-----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S +GN QQQG V +DL RVGF +C
Sbjct: 435 SGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQC 468
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 117/378 (30%), Positives = 165/378 (43%), Gaps = 51/378 (13%)
Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP---IFDPKTSSSYSPLPC 214
EY I VGTPP + + DTGSD+ W++C+ + P F P SS+Y + C
Sbjct: 109 EYLMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFVPSASSTYGRVGC 168
Query: 215 AAPQCKSLDVSA-CRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSG------------- 259
C++L +A C + C Y +YGDGS G L TET +F
Sbjct: 169 DTKACRALSSAASCSPDGSCEYLYSYGDGSRASGQLSTETFTFSTIADSSKTNSHGNNNN 228
Query: 260 --------SVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS-----LAYCLV 306
+ + GC G F L+GLGGG +SL Q+ AT+ +YCL
Sbjct: 229 NSSSHGQVEIAKLDFGCSTTTTGTFRADG-LVGLGGGPVSLASQLGATTSLGRKFSYCLA 287
Query: 307 D-RDSPASGVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLF 362
++ AS L F S A + PLI +V+T+Y + L +V G + P +
Sbjct: 288 PYANTNASSALNFGSRAVVSEPGAASTPLITG-EVETYYTIALDSINVAG--TKRPTT-- 342
Query: 363 EMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR- 421
A IIVD GT +T L + L R + S + D CYD SG+R
Sbjct: 343 ----AAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESPEKILDLCYDISGVRG 398
Query: 422 --SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--ALSIIGNVQQQGT 477
++ +P V+L G G + L N + V G C A TS ++SI+GN+ QQ
Sbjct: 399 EDALGIPDVTLVLGGGGEVTLKPDNTFVVVQE-GVLCLALVATSERQSVSILGNIAQQNL 457
Query: 478 RVSFDLANNRVGFTPNKC 495
V +DL V F C
Sbjct: 458 HVGYDLEKGTVTFAAADC 475
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 115/367 (31%), Positives = 169/367 (46%), Gaps = 39/367 (10%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC--K 220
+ VGTPP+ SMV+DTGS+++WL C T F+ S SY P+PC++ C +
Sbjct: 35 LTVGTPPQNVSMVIDTGSELSWLYCNKTTTTTSYPT-TFNQTRSISYRPIPCSSSTCTNQ 93
Query: 221 SLDVS---ACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----N 272
+ D S +C +N C ++Y D S + G+L ++T G S + G+ GC N
Sbjct: 94 TRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHMGAS-DIPGMVFGCMDSVFSSN 152
Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVT---A 329
+ GL+G+ G LS Q+ +YC+ D SG+L + AV
Sbjct: 153 SDEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISGTD--FSGMLLLGESNFTWAVPLNYT 210
Query: 330 PLIR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
PL++ + + F Y V L G V + + IP S+FE D G G +VD GT T L
Sbjct: 211 PLVQISTPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTGAGQTMVDSGTQFTFLL 270
Query: 385 TQAYNSLRDSFV-RLAGNLKPTSGVAL-----FDTCYDFSGLRSV--RVPTVSLHF-GAG 435
AY +LR F+ + G L+ D CY + V R+PTVSL F GA
Sbjct: 271 GPAYTALRSEFLNQTTGFLRVLEDPDFVFQGAMDLCYRVPISQRVLPRLPTVSLVFNGAE 330
Query: 436 KALDLPAKNYLIPVDSAGT---FCFAFAPTSSALS----IIGNVQQQGTRVSFDLANNRV 488
+ Y +P + G C +F S L +IG+ QQ + FDL +R+
Sbjct: 331 MTVADERVLYRVPGEIRGNDSVHCLSFG-NSDLLGVEAYVIGHHHQQNVWMEFDLERSRI 389
Query: 489 GFTPNKC 495
G +C
Sbjct: 390 GLAQVRC 396
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 107/379 (28%), Positives = 163/379 (43%), Gaps = 44/379 (11%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
SG G Y++++G+GTP + + + +DTGSDI W+ C C EC + S +++ K
Sbjct: 77 SGRPDTVGLYYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIK 136
Query: 205 TSSSYSPLPCAAPQCKSLD---VSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN-SG 259
S S +PC C ++ +S C AN C Y YGDGS T G V + V + SG
Sbjct: 137 DSVSGKLVPCDEEFCYEVNGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSG 196
Query: 260 SVK------GIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAY 303
++ + GCG G ++ G+LG G S+ Q+ AT A+
Sbjct: 197 DLQTTSSNGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAH 256
Query: 304 CLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
CL + G+ PLI N+ Y V +T VG + +P F
Sbjct: 257 CLDGIN--GGGIFAIGHVVQPKVNMTPLIPNQP---HYNVNMTAVQVGEDFLHLPTEEF- 310
Query: 364 MDEAGD-GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRS 422
EAGD G I+D GT + L Y L + +LK V TC+ +SG
Sbjct: 311 --EAGDRKGAIIDSGTTLAYLPEIVYEPLVSKIISQQPDLK-VHIVRDEYTCFQYSGSVD 367
Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQG 476
P V+ HF L + YL P + G +C + + ++++G++
Sbjct: 368 DGFPNVTFHFENSVFLKVHPHEYLFPFE--GLWCIGWQNSGMQSRDRRNMTLLGDLVLSN 425
Query: 477 TRVSFDLANNRVGFTPNKC 495
V +DL N +G+T C
Sbjct: 426 KLVLYDLENQAIGWTEYNC 444
>gi|125552105|gb|EAY97814.1| hypothetical protein OsI_19735 [Oryza sativa Indica Group]
Length = 424
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 113/389 (29%), Positives = 165/389 (42%), Gaps = 89/389 (22%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC----------TECYQQSDPIFDPK 204
G +Y + G+G PP+ V+DTGSD+ W QC C C+ Q+ P ++
Sbjct: 74 GKTQYIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFS 133
Query: 205 TSSSYSPLPC---------AAPQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVS 254
S + +PC AP+ + C+ +YG G +G L T+ +
Sbjct: 134 LSRTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFT 192
Query: 255 FGNSGSVKGIALGCGHDNE---GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSP 311
F +S SV +A GC G G++G++GLG G LSL +DSP
Sbjct: 193 FPSSSSVT-LAFGCVSQTRISPGALTGASGIIGLGRGALSLNP-------------KDSP 238
Query: 312 ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD-- 369
S TFYY+ L G + G V +P F++ EA
Sbjct: 239 FS-------------------------TFYYLPLVGLAAGNATVALPAGAFDLREAAPKV 273
Query: 370 --GGIIVDCGTAITRLQTQAYNSLRDSFVRL---AGNLKPTS---GVAL---FDTCYDFS 418
GG ++D G+ TRL A+ +L R +G+L P G AL + D
Sbjct: 274 WAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGALELCVEAGDDGD 333
Query: 419 GLRSVRVPTVSLHF----GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA--------L 466
L + VP++ L F G G+ L +PA+ Y V+ A T+C A ++S
Sbjct: 334 SLAAAAVPSLVLRFDDGVGGGRELVIPAEKYWARVE-ASTWCMAVVSSASGNATLPTNET 392
Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+IIGN QQ RV +DLAN + F P C
Sbjct: 393 TIIGNFMQQDMRVLYDLANGLLSFQPANC 421
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 109/368 (29%), Positives = 164/368 (44%), Gaps = 36/368 (9%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
G G Y ++ +GTPP + +DTGS++ W+ C C +C+ QS IF+P SS+Y PC
Sbjct: 94 GDGNYLMKLLIGTPPTEIHAAIDTGSNVIWIPCINCKDCFNQSSSIFNPLASSTYQDAPC 153
Query: 215 AAPQCKSLDVSACRANRCLY------QVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
+ QC++ S N CLY Q+ +G V D +T T S G + C
Sbjct: 154 DSYQCETTSSSCQSDNVCLYSCDEKHQLNCPNGRIAV-DTMTLTSSDGRPFPLPYSDFVC 212
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDS--PAS---GVLEFNS 320
G+ F G G++GLG G LSLT ++ S +YCL D S P+ G+ F S
Sbjct: 213 GNSIYKTFAG-VGVIGLGRGALSLTSKLYHLSDGKFSYCLADYYSKQPSKINFGLQSFIS 271
Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD---GGIIVDCG 377
+ V+ L ++ YYV L G SVG + L+ +D+ G +++D G
Sbjct: 272 DDDLEVVSTTLGHHRHSGN-YYVTLEGISVGEKRQD----LYYVDDPFAPPVGNMLIDSG 326
Query: 378 TAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYD--------FSGLRSVRVPTV 428
T T L Y+ L + + N + + F D F ++ P +
Sbjct: 327 TMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFSMDNTLKLSPCFWYYPELKFPKI 386
Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS-IIGNVQQQGTRVSFDLANNR 487
++HF ++L N I V + CFAFA T S + G+ QQ + +DL
Sbjct: 387 TIHFTDAD-VELSDDNSFIRV-AEDVVCFAFAATQPGQSTVYGSWQQMNFILGYDLKRGT 444
Query: 488 VGFTPNKC 495
V F C
Sbjct: 445 VSFKRTDC 452
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 111/369 (30%), Positives = 172/369 (46%), Gaps = 40/369 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G Y++++ +GTPPR+F++ +DTGSD+ W+ C C C + S+ FDP SSS S
Sbjct: 82 GLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASL 141
Query: 212 LPCAAPQCKS--LDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNS-------GSV 261
+ C+ +C S S C N C Y YGDGS T G +++ +SF S
Sbjct: 142 VSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINSS 201
Query: 262 KGIALGCGHDNEGLFV----GSAGLLGLGGGMLSLTKQIKATSLA-----YCLVDRDSPA 312
GC + G G+ GLG G LS+ Q+ LA +CL D
Sbjct: 202 APFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCL-KGDKSG 260
Query: 313 SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
G++ + D V PL+ ++ Y V L +V GQ + I PS+F + GDG I
Sbjct: 261 GGIMVLGQIKRPDTVYTPLVPSQP---HYNVNLQSIAVNGQILPIDPSVFTI-ATGDGTI 316
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDT--CYDFSGLRSVRVPTVS 429
I D GT + L +AY+ F++ N G + +++ C++ + P VS
Sbjct: 317 I-DTGTTLAYLPDEAYS----PFIQAIANAVSQYGRPITYESYQCFEITAGDVDVFPEVS 371
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGT--FCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANN 486
L F G ++ L YL S+G+ +C F S ++I+G++ + V +DL
Sbjct: 372 LSFAGGASMVLRPHAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQ 431
Query: 487 RVGFTPNKC 495
R+G+ C
Sbjct: 432 RIGWAEYDC 440
>gi|449467979|ref|XP_004151699.1| PREDICTED: probable aspartic protease At2g35615-like, partial
[Cucumis sativus]
Length = 209
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 60/123 (48%), Positives = 82/123 (66%), Gaps = 3/123 (2%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
GSGEY + +GTPP + + DTGSD+ W QC PC +CY+QS PIFDP S+S+S +PC
Sbjct: 88 GSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPC 147
Query: 215 AAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
+ CK++D S C A C Y YGD ++T GDL E ++ G+S SVK + +GCGH++
Sbjct: 148 NSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITIGSS-SVKSV-IGCGHESG 205
Query: 274 GLF 276
G F
Sbjct: 206 GGF 208
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 168/361 (46%), Gaps = 35/361 (9%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP+ F++++D+GS + ++ C C +C + DP F P+ SS+Y P+ C
Sbjct: 91 NGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPELSSTYQPVKCN 150
Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHDNE 273
C D +C+Y+ Y + S + G L + +SFGN + + GC
Sbjct: 151 M-DCNCDD----DKEQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETVET 205
Query: 274 G-LFVGSA-GLLGLGGGMLSLTKQIK-----ATSLAYCLVDRDSPASGVLEFNSARGGDA 326
G L+ A G++GLG G LSL Q+ + S C D ++ G D
Sbjct: 206 GDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMI----LGGFDY 261
Query: 327 VTAPLIRNKKVDT--FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
+ + + D +Y + LTG V G+ + + +F+ G+ G ++D GT L
Sbjct: 262 PSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNSRVFD----GEHGAVLDSGTTYAYLP 317
Query: 385 TQAYNSLRDSFVRLAGNLKPTSG--VALFDTCY------DFSGLRSVRVPTVSLHFGAGK 436
A+ + ++ +R LK G DTC+ D S L + P+V + F +G+
Sbjct: 318 DAAFAAFEEAVMREVSPLKQIDGPDPNFKDTCFLVAASNDVSELSKI-FPSVEMIFKSGQ 376
Query: 437 ALDLPAKNYLIPVDSA-GTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
+ L +NY+ G +C P +++G + + T V +D N++VGF
Sbjct: 377 SWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYDRENSKVGFWRTN 436
Query: 495 C 495
C
Sbjct: 437 C 437
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 111/406 (27%), Positives = 176/406 (43%), Gaps = 47/406 (11%)
Query: 121 QLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGS 180
L ++ RH A A LP +G +G YF++IG+GTP + + + +DTGS
Sbjct: 48 NLRAHDARRHGRSLAAAVDLPLG-----GNGLPTETGLYFTQIGIGTPAKSYYVQVDTGS 102
Query: 181 DINWLQCRPCTECYQQSD-----PIFDPKTSSSYSPLPCAAPQCKSLD---VSACR-ANR 231
DI W+ C C C ++S ++DP SSS + + C C + + +C A
Sbjct: 103 DILWVNCVFCDTCPRKSGLGIELTLYDPSGSSSGTGVTCGQDFCVATHGGVIPSCVPAAP 162
Query: 232 CLYQVAYGDGSFTVGDLVTETVSF----GNSGSV---KGIALGCGHDNEGLFVGSA---- 280
C Y ++YGDGS T G VT+ + + GNS + I GCG G S+
Sbjct: 163 CQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQTTLANTSITFGCGAKIGGDLGSSSQALD 222
Query: 281 GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNK 335
G+LG G S+ Q+ A A+CL + G+ T PL+
Sbjct: 223 GILGFGQSNSSMLSQLAAAGKVRKVFAHCLDTIN--GGGIFAIGDVVQPKVSTTPLVPGM 280
Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
Y V L VGG +Q+P ++F++ E+ G I+D GT + L YN++
Sbjct: 281 P---HYNVNLEAIDVGGVKLQLPTNIFDIGES--KGTIIDSGTTLAYLPGVVYNAIMSKV 335
Query: 396 VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
G++ P F C+ +SG P ++ HF G L++ +YL + +
Sbjct: 336 FAQYGDM-PLKNDQDFQ-CFRYSGSVDDGFPIITFHFEGGLPLNIHPHDYLF--QNGELY 391
Query: 456 CFAF------APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C F + ++G++ V +DL N +G+T C
Sbjct: 392 CMGFQTGGLQTKDGKDMVLLGDLAFSNRLVLYDLENQVIGWTDYNC 437
>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
max]
Length = 455
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 117/392 (29%), Positives = 168/392 (42%), Gaps = 81/392 (20%)
Query: 173 SMVLDTGSDINWLQCRP--CTECYQQSDPIFDPKTSSSYS-PLPCAAPQC---------- 219
++ +DTGSD+ W C P C C + P P +++ S + C +P C
Sbjct: 64 TLYMDTGSDLVWFPCAPFKCILC--EGKPNASPPVNTTRSVAVSCKSPACSAAHNLASPS 121
Query: 220 ----------KSLDVSACRANRCL-YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
+S++ S C +C + AYGDGS + L +T+S +S ++ GC
Sbjct: 122 DLCAAARCPLESIETSDCANFKCPPFYYAYGDGSL-IARLYRDTLSL-SSLFLRNFTFGC 179
Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAYCLVDRD-------SPASGV 315
+ G+ G G G+LSL Q+ S +YCLV P+ +
Sbjct: 180 AYTT---LAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHSFDSERVRKPSPLI 236
Query: 316 L-------EFNSARGGDA--VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
L E GG A V P++ N K FY VGL G SVG + V P L ++
Sbjct: 237 LGRYEEEEEEEKVGGGVAEFVYTPMLENPKHPYFYTVGLIGISVGKRIVPAPEMLRRVNN 296
Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG-------NLKPTSGVALFDTCYDFSG 419
GDGG++VD GT T L YNS+ D F R G ++ +G+A CY +
Sbjct: 297 RGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRGVGRVNERARKIEEKTGLA---PCYYLNS 353
Query: 420 LRSVRVPTVSLHFGAGK-ALDLPAKNYLIPV----DSA------GTFCFAFAPTSSALS- 467
+ VP ++L F G ++ LP KNY D+A G + LS
Sbjct: 354 V--AEVPVLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKRRVGCLMLMNGGDEAELSG 411
Query: 468 ----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+GN QQQG V +DL RVGF +C
Sbjct: 412 GPGATLGNYQQQGFEVEYDLEEKRVGFARRQC 443
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 97/296 (32%), Positives = 142/296 (47%), Gaps = 32/296 (10%)
Query: 120 LQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQ--GSGEYFSRIGVGTPPRQFSMVLD 177
+ L Y+ R + ++LPE S P+ SG + G Y++RI +GTPP+QF + +D
Sbjct: 1 MSLDHYHTLRKHDQRRLRRMLPEVVSFPI-SGDNDIFAMGLYYTRISLGTPPQQFYVDVD 59
Query: 178 TGSDINWLQCRPCTECYQQSD-PI----FDPKTSSSYSPLPCAAPQCKSLDVS-ACRANR 231
TGS++ W++C PCT C D P+ FDP+ S++ + C +C L+ C R
Sbjct: 60 TGSNVAWVKCAPCTGCEHSGDVPVPMSTFDPRKSTTKISISCTDAECGVLNKKLQCSPER 119
Query: 232 --CLYQVAYGDGSFTVGDLVTETVSFG-----NSGSVKGIA---LGCGHDNEGLFVGSAG 281
C Y + YGDGS T G + + +F NS + G A GCG G + G
Sbjct: 120 LSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTGSW-SVDG 178
Query: 282 LLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
LLG G +SL Q I A+CL D G L + R D V P++ +
Sbjct: 179 LLGFGPTTVSLPNQLAQQNISVNIFAHCL-QGDVSGRGSLVIGTIREPDLVYTPMVFGED 237
Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
Y V L + G+ V P S F+++ GG+I+D GT +T L AY+ R
Sbjct: 238 ---HYNVQLLNIGISGRNVTTPAS-FDLEYT--GGVIIDSGTTLTYLVQPAYDEFR 287
>gi|413944378|gb|AFW77027.1| hypothetical protein ZEAMMB73_570500 [Zea mays]
Length = 484
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 111/370 (30%), Positives = 157/370 (42%), Gaps = 48/370 (12%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSD-INWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
G+ EY G GTP ++ + DT + LQC PC +D FDP SSS S +P
Sbjct: 134 GAFEYHVVAGFGTPMQKLPVGFDTTTTGATLLQCTPCG---SGADHAFDPSASSSVSQVP 190
Query: 214 CAAPQCK--------SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
C +P C S +S N L + + T+ + TV ++GIA
Sbjct: 191 CGSPDCPFHGCSGRPSCTLSVSFNNTLLGNATFFTDTLTLTPSSSATVDKFRFACLEGIA 250
Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAYCLVDRDSPAS----GV 315
G D GSAG+L L SL ++ A+S +YCL PAS G
Sbjct: 251 PGPAED------GSAGILDLSRNSHSLPSRLVASSPPHAVAFSYCL-----PASTADVGF 299
Query: 316 LEFNSAR----GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
L + + G PL + Y V L G +GG + IPP+ D+
Sbjct: 300 LSLGATKPELLGRKVSYTPLRGSPSNGNLYVVDLVGLGLGGPDLPIPPAAIAGDD----- 354
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLH 431
I++ T T L+ Q Y LRDSF + + DTCY+F+GL + VP V+L
Sbjct: 355 TILELHTTFTYLKPQVYKVLRDSFRKSMSEYPAAPPLGSLDTCYNFTGLDAFSVPAVTLK 414
Query: 432 FGAGKALDLPAKNYLIPVDSAGTF---CFAFAPTSSAL---SIIGNVQQQGTRVSFDLAN 485
F G +DL + D F C AF ++IG++ Q T V +D+
Sbjct: 415 FAGGADVDLWMDEMMYFTDPDNHFSIGCLAFVAQDDDCDGGTVIGSMAQMSTEVVYDVRG 474
Query: 486 NRVGFTPNKC 495
+VGF P +C
Sbjct: 475 GKVGFVPYRC 484
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 97/362 (26%), Positives = 170/362 (46%), Gaps = 37/362 (10%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +R+ +GTPP+ F++++D+GS + ++ C C +C + DP F P+ SS+Y P+ C
Sbjct: 90 NGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKC- 148
Query: 216 APQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHD 271
++D + C +R C+Y+ Y + S + G L + +SFGN + + GC
Sbjct: 149 -----NMDCN-CDDDREQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETV 202
Query: 272 NEG-LFVGSA-GLLGLGGGMLSLTKQIK-----ATSLAYCLVDRDSPASGVLEFNSARGG 324
G L+ A G++GLG G LSL Q+ + S C D ++ G
Sbjct: 203 ETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMI----LGGF 258
Query: 325 DAVTAPLIRNKKVDT--FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
D + + + D +Y + LTG V G+ + + +F+ G+ G ++D GT
Sbjct: 259 DYPSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVFD----GEHGAVLDSGTTYAY 314
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVR-----VPTVSLHFGAG 435
L A+ + ++ +R LK G DTC+ + V P+V + F +G
Sbjct: 315 LPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSG 374
Query: 436 KALDLPAKNYLIPVDSA-GTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
++ L +NY+ G +C P +++G + + T V +D N++VGF
Sbjct: 375 QSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYDRENSKVGFWRT 434
Query: 494 KC 495
C
Sbjct: 435 NC 436
>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 492
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 119/389 (30%), Positives = 154/389 (39%), Gaps = 71/389 (18%)
Query: 173 SMVLDTGSDINWLQCRP--CTECYQQSDPIFDPKTSSSYSP------LPCAAPQCKSLDV 224
S+ LDTGSD+ W C P C C + P + +S+ P +PCA+P C +
Sbjct: 99 SLFLDTGSDLVWFPCAPFTCMLCEGKPTPPGNNNSSNPLPPPTDSRRIPCASPFCSAAHS 158
Query: 225 SA-----CRANRC------------------LYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
SA C A RC LY AYGDGS V L V S +V
Sbjct: 159 SAPPADLCAAARCPLDDIETGSCAASHACPPLY-YAYGDGSL-VARLRRGRVGIAASVAV 216
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLA----YCLVDR----DSP-A 312
+ C H G VG AG G G LSL Q+ +L+ YCLV D P
Sbjct: 217 ENFTFACAHTALGEPVGVAGF---GRGPLSLPAQLAPAALSGRFSYCLVAHSFRADRPIR 273
Query: 313 SGVLEFNSARGGDA------VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
L + G D V PL+ N K FY V L SVGG + P L +
Sbjct: 274 PSPLILGRSPGEDPASETGIVYTPLLHNPKHPYFYSVALEAVSVGGTRIPARPELGRVGR 333
Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT-----CYDFSGLR 421
AGDGG++VD GT T L + Y + + F R + A D CY +
Sbjct: 334 AGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMAAARFERAEAAEDQTGLAPCYYYDHDA 393
Query: 422 SV-------RVPTVSLHFGAGKALDLPAKNYLIPVDSA---GTFCFAFA-----PTSSAL 466
S VP +++HF + LP +NY + S C
Sbjct: 394 SAAEEGSARAVPPLAMHFRGEATVVLPRRNYFMGFRSEERRRVGCLMLMNGGEDDGGGPA 453
Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+GN QQQG V +D+ RVGF +C
Sbjct: 454 GTLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 111/383 (28%), Positives = 157/383 (40%), Gaps = 61/383 (15%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
+ VGTPP+ +MVLDTGS+++WL C + D FD SSSY+P+PC++P C L
Sbjct: 67 VAVGTPPQNVTMVLDTGSELSWLLCN-----GSRHDAPFDASASSSYAPVPCSSPACTWL 121
Query: 223 --DVSA---CRANRCLYQVAYGDGSFTVGDLVTETVSFGNS--GSVKGIALGCGHDNEGL 275
D+ C ++ C ++Y D S G L +T G+S ++ G +
Sbjct: 122 GRDLPVRPFCDSSACRVSLSYADASSADGLLAADTFLLGSSPMPALFGCITSYSSSTDPS 181
Query: 276 FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIR-- 333
GLLG+ G LS Q AYC+ P G+L GG+ PL
Sbjct: 182 ETPPTGLLGMNRGGLSFVTQTATRRFAYCIAAGQGP--GILLL----GGNDTETPLTSPP 235
Query: 334 ------------NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
++ + F Y V L G VG + IP L D G G +VD G
Sbjct: 236 QQQLNYTPLVEISQPLPYFDRAAYTVQLEGIRVGSALLAIPKHLLTPDHTGAGQTMVDSG 295
Query: 378 TAITRLQTQAYNSLRDSFVR-----LAGNLKPTSGVAL-----FDTCYDFSGLRSVR--- 424
T T L AY +L+ F L G L P FD C+ + R
Sbjct: 296 TRFTFLLPDAYAALKAEFANQLTRSLDGGLAPLGEPGFVFQGAFDACFRGTEARVSAAAA 355
Query: 425 ---VPTVSLHFGAGKALDLPAKNYLIPV------DSAGTFCFAFAPTSSA---LSIIGNV 472
+P V L + + A+ L V + G +C F + A +IG+
Sbjct: 356 GGLLPEVGLVLRGAEVVVAGAEKLLYRVPGERRGEGEGVWCLTFGSSDMAGVSAYVIGHH 415
Query: 473 QQQGTRVSFDLANNRVGFTPNKC 495
QQ V +DL N R+GF +C
Sbjct: 416 HQQDVWVEYDLRNARLGFAAARC 438
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 102/372 (27%), Positives = 159/372 (42%), Gaps = 44/372 (11%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G Y+++IG+GTP + + + +DTGSDI W+ C C EC + S +++ S +
Sbjct: 76 GLYYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKL 135
Query: 212 LPCAAPQCKSLD---VSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN-SGSVK---- 262
+PC C ++ + C AN C Y YGDGS T G V + V + SG +K
Sbjct: 136 VPCDQEFCYEINGGQLPGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAA 195
Query: 263 --GIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDS 310
+ GCG G S G+LG G S+ Q+ T A+CL +
Sbjct: 196 NGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCL--DGT 253
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD- 369
G+ PLI N+ Y V +T VG + + +P +F EAGD
Sbjct: 254 NGGGIFVIGHVVQPKVNMTPLIPNQP---HYNVNMTAVQVGHEFLSLPTDVF---EAGDR 307
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
G I+D GT + L Y L + +LK + + TC+ +S P V+
Sbjct: 308 KGAIIDSGTTLAYLPEMVYKPLVSKIISQQPDLKVHTVRDEY-TCFQYSDSLDDGFPNVT 366
Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS------SALSIIGNVQQQGTRVSFDL 483
HF L + YL P + G +C + + ++++G++ V +DL
Sbjct: 367 FHFENSVILKVYPHEYLFPFE--GLWCIGWQNSGVQSRDRRNMTLLGDLVLSNKLVLYDL 424
Query: 484 ANNRVGFTPNKC 495
N +G+T C
Sbjct: 425 ENQAIGWTEYNC 436
>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
Length = 367
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 66/184 (35%), Positives = 102/184 (55%), Gaps = 13/184 (7%)
Query: 118 TKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLD 177
++ +LA + R E A ++ E TP++ GEY ++G+GTPP +F+ +D
Sbjct: 55 SRYRLAGIGMARGEAASARKAVVAE---TPIMPAG----GEYLVKLGIGTPPYKFTAAID 107
Query: 178 TGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRAN---RCLY 234
T SD+ W QC+PCT CY Q DP+F+P+ SS+Y+ LPC++ C LDV C + C Y
Sbjct: 108 TASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQY 167
Query: 235 QVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF--VGSAGLLGLGGGMLSL 292
Y + T G L + + G + +G+A GC + G ++G++GLG G LSL
Sbjct: 168 TYTYSGNATTEGTLAVDKLVIGED-AFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSL 226
Query: 293 TKQI 296
Q+
Sbjct: 227 VSQL 230
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 57/129 (44%), Gaps = 5/129 (3%)
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
G+I+D + IT L+ Y+ L + +RL + G+ L D V VP
Sbjct: 236 GMIIDIASTITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGVAFDRVYVPA 295
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANN 486
V+L F G+ L L +G C + ++SI+GN QQQ +V ++L
Sbjct: 296 VALAFD-GRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRG 354
Query: 487 RVGFTPNKC 495
RV F + C
Sbjct: 355 RVTFVQSPC 363
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 122 bits (305), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 106/381 (27%), Positives = 163/381 (42%), Gaps = 61/381 (16%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYS 210
+G Y++ I +GTPP+ + + +DTGSDI W+ C C +C +S ++DPK SS+ S
Sbjct: 83 TGLYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASSTGS 142
Query: 211 PLPCAAPQCKSL---DVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN---SGSVK- 262
+ C C + + C AN C Y V YGDGS T+G VT+ + F G +
Sbjct: 143 MVMCDQAFCAATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQTQP 202
Query: 263 ---GIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRD 309
+ GCG +G +GS+ G+LG G S+ Q+ A+CL
Sbjct: 203 ANASVIFGCGA-QQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCL--DT 259
Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
G+ T PL+ +K Y V L VGG +Q+P +FE E
Sbjct: 260 IKGGGIFSIGDVVQPKVKTTPLVADKP---HYNVNLKTIDVGGTTLQLPAHIFEPGEK-- 314
Query: 370 GGIIVDCGTAITRL--------QTQAYNSLRD-SFVRLAGNLKPTSGVALFDTCYDFSGL 420
G I+D GT +T L +N +D +F + G L C+ + G
Sbjct: 315 KGTIIDSGTTLTYLPELVFKEVMLAVFNKHQDITFHDVQGFL-----------CFQYPGS 363
Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQ 474
PT++ HF AL + Y + +C F +S + ++G++
Sbjct: 364 VDDGFPTITFHFEDDLALHVYPHEYFF-ANGNDVYCVGFQNGASQSKDGKDIVLMGDLVL 422
Query: 475 QGTRVSFDLANNRVGFTPNKC 495
V +DL N +G+T C
Sbjct: 423 SNKLVIYDLENRVIGWTDYNC 443
>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 101/359 (28%), Positives = 149/359 (41%), Gaps = 42/359 (11%)
Query: 170 RQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRA 229
+ + + LD G ++W+QC PC C Q P+FDP S ++S +P
Sbjct: 109 QNYQLALDMGGGLSWMQCLPCRHCLLQMSPVFDPTKSPTFSNIPAHNTVWCRPPYQPLAN 168
Query: 230 NRCLYQVAYGDGSFTVGDLVTETVSF--GNSGSV--KGIALGCGHDNEGLF--VGSAGLL 283
C + +AY D + G L +T SF GN V I GC H E AG+L
Sbjct: 169 GACGFDIAYRDNTHASGYLARDTFSFPAGNDDFVPLSAIVFGCAHQTEHFKNQRAVAGIL 228
Query: 284 GLGGG-----MLSLTKQI---KATSLAYCLVDRDSPASGVLEF----------NSARGGD 325
GLG G + TKQ+ +YC L F N R
Sbjct: 229 GLGMGPAGKPPTAFTKQVLPAHGGRFSYCPFVPGMSMYSYLRFGSDIPSHPPPNVHRQST 288
Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
V AP ++ Y+V L G SVG + + P++F + G GG +VD GT +T
Sbjct: 289 PVLAPAHNSEA----YFVKLAGVSVGANRLSGVTPAMFRRNAHGAGGCVVDIGTRMTAFI 344
Query: 385 TQAY----NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
AY +++R R ++ G +TC +P+++LHF G L +
Sbjct: 345 HSAYVHIDHAVRQHLQRRGAHIVVVRG----NTCVQQPAPHHDVLPSMTLHFENGAWLRV 400
Query: 441 PAKNYLIPVDSAGTF--CFAFAPTSSALSIIGNVQQQGTRVSFDLANN--RVGFTPNKC 495
++ +P G CF F +S+ L++IG QQ R FDL + + F P C
Sbjct: 401 MPEHVFMPFVVGGHHYQCFGFV-SSTDLTVIGARQQVNHRFIFDLHDTIPIMSFNPEDC 458
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 166/375 (44%), Gaps = 50/375 (13%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G Y+++IG+GTP + + + +DTGSDI W+ C C +C ++S +++ S S
Sbjct: 78 GLYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKL 137
Query: 212 LPCAAPQCKSLD---VSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN-SGSVK---- 262
+ C C + +S C+AN C Y YGDGS T G V + V + + +G +K
Sbjct: 138 VSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTA 197
Query: 263 --GIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDS 310
+ GCG G S G+LG G S+ Q+ ++ A+CL R+
Sbjct: 198 NGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRN- 256
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD- 369
G+ PL+ N+ Y V +T VG + + IP LF + GD
Sbjct: 257 -GGGIFAIGRVVQPKVNMTPLVPNQP---HYNVNMTAVQVGQEFLNIPADLF---QPGDR 309
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD---TCYDFSGLRSVRVP 426
G I+D GT + L Y L V+ + +P V + D C+ +SG P
Sbjct: 310 KGAIIDSGTTLAYLPEIIYEPL----VKKITSQEPALKVHIVDKDYKCFQYSGRVDEGFP 365
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQGTRVS 480
V+ HF L + +YL P + G +C + ++ ++++G++ V
Sbjct: 366 NVTFHFENSVFLRVYPHDYLFPYE--GMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVL 423
Query: 481 FDLANNRVGFTPNKC 495
+DL N +G+T C
Sbjct: 424 YDLENQLIGWTEYNC 438
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 121 bits (304), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 100/375 (26%), Positives = 157/375 (41%), Gaps = 49/375 (13%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYS 210
+G Y++ + +GTPP++F + +DTGSDI W+ C C +C +S ++DPK SS+ S
Sbjct: 85 TGLYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASSTGS 144
Query: 211 PLPCAAPQCKSL---DVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN---SGSVK- 262
+ C C + C AN C Y V YGDGS TVG V + + F G +
Sbjct: 145 TVMCDQGFCADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGDGQTQP 204
Query: 263 ---GIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDS 310
+ GCG G S+ G+LG G S+ Q+ A+CL
Sbjct: 205 ANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCL--DTI 262
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
G+ T PL+ +K Y V L VGG +++P +F+ E
Sbjct: 263 KGGGIFAIGDVVQPKVKTTPLVADKP---HYNVNLKTIDVGGTTLELPADIFKPGEK--R 317
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT----CYDFSGLRSVRVP 426
G I+D GT +T L + + + + D C+++SG P
Sbjct: 318 GTIIDSGTTLTYLPELVFKKV------MLAVFNKHQDITFHDVQDFLCFEYSGSVDDGFP 371
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGTRVS 480
T++ HF AL + Y P + +C F + + ++G++ V
Sbjct: 372 TLTFHFEDDLALHVYPHEYFFP-NGNDVYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVV 430
Query: 481 FDLANNRVGFTPNKC 495
+DL N +G+T C
Sbjct: 431 YDLENRVIGWTDYNC 445
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 121 bits (304), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 102/370 (27%), Positives = 171/370 (46%), Gaps = 42/370 (11%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC--YQQS---------DPIFDPKT 205
G Y SR+ +GTPP +F++++DTGS + ++ C CT C +Q S DP F P+
Sbjct: 38 GYYTSRVFIGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPEN 97
Query: 206 SSSYSPLPCAAPQCKSLDVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
SSSY + C + C + C +N +C Y+ Y + S + G L + + FG + ++
Sbjct: 98 SSSYQKIGCRSSDCIT---GLCDSNSHQCKYERMYAEMSTSKGVLGKDLLDFGPASRLQS 154
Query: 264 --IALGCGHDNEG-LFVGSA-GLLGLGGGMLSLTKQIKAT-------SLAYCLVDRDSPA 312
++ GC G L++ A G++GLG G LS+ Q+ SL Y +D +
Sbjct: 155 QLLSFGCETAESGDLYLQVADGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMD-EGGG 213
Query: 313 SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
S VL A G + + +Y + LT V G ++++ ++F G G
Sbjct: 214 SMVLGAIPAPSGMVFAKS---DPRRSNYYNLELTEIQVQGASLKLDSNVFN----GKFGT 266
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----P 426
I+D GT L +A+ + D+ V G+L+ G D CY +G + + P
Sbjct: 267 ILDSGTTYAYLPDRAFEAFTDAVVAQLGSLQAVDGPDPNYPDICYAGAGTDTKELGKHFP 326
Query: 427 TVSLHFGAGKALDLPAKNYLIP-VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLAN 485
V F + + L +NYL G +C F A +++G + + V++D N
Sbjct: 327 LVDFVFAENQKVSLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIIVRNMLVTYDRYN 386
Query: 486 NRVGFTPNKC 495
+++GF C
Sbjct: 387 HQIGFLKTNC 396
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 107/377 (28%), Positives = 165/377 (43%), Gaps = 35/377 (9%)
Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFD 202
VV + QG+ + S G F++ +DTGSDI W+ C C+ C Q S FD
Sbjct: 57 VVDFSVQGTSDPNSVGMYGXXXXXFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFD 116
Query: 203 PKTSSSYSPLPCAAPQCKSLDVSAC-----RANRCLYQVAYGDGSFTVGDLVTETVSFG- 256
SS+ + +PC+ C S A R N+C Y YGDGS T G V++ + F
Sbjct: 117 TVGSSTAALIPCSDLICTSGVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFNL 176
Query: 257 ------NSGSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL----- 301
S I GC G + G+ G G G LS+ Q+ + +
Sbjct: 177 IMGQPPAVNSTATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVF 236
Query: 302 AYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
++CL D G+L V +PL+ ++ Y + L +V GQ + I P++
Sbjct: 237 SHCL-KGDGNGGGILVLGEILEPSIVYSPLVPSQP---HYNLNLQSIAVNGQPLPINPAV 292
Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR 421
F + GG IVDCGT + L +AY+ L + + A + + + CY S
Sbjct: 293 FSISN-NRGGTIVDCGTTLAYLIQEAYDPLVTA-INTAVSQSARQTNSKGNQCYLVSTSI 350
Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTR 478
P VSL+F G ++ L + YL+ +D A +C F SI+G++ +
Sbjct: 351 GDIFPLVSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCVGFQKLQEGASILGDLVLKDKI 410
Query: 479 VSFDLANNRVGFTPNKC 495
V +D+A R+G+ C
Sbjct: 411 VVYDIAQQRIGWANYDC 427
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 107/381 (28%), Positives = 166/381 (43%), Gaps = 48/381 (12%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
SG G Y+++IG+GTPP+ + + +DTGSDI W+ C C EC +S+ ++D K
Sbjct: 76 SGRPDAVGLYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIK 135
Query: 205 TSSSYSPLPCAAPQCKSLD---VSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGN-SG 259
SSS +PC CK ++ ++ C AN C Y YGDGS T G V + V + SG
Sbjct: 136 ESSSGKFVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSG 195
Query: 260 SVK------GIALGCGHDNEGLFVGS-----AGLLGLGGGMLSLTKQIKATS-----LAY 303
+K I GCG G S G+LG G S+ Q+ ++ A+
Sbjct: 196 DLKTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAH 255
Query: 304 CLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
CL G+ PL+ ++ Y V +T VG + + +
Sbjct: 256 CL--NGVNGGGIFAIGHVVQPKVNMTPLLPDQP---HYSVNMTAVQVGHAFLSLST---D 307
Query: 364 MDEAGD-GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD--TCYDFSGL 420
GD G I+D GT + L Y L + +LK + L D TC+ +S
Sbjct: 308 TSTQGDRKGTIIDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRT---LHDEYTCFQYSES 364
Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT------SSALSIIGNVQQ 474
P V+ +F G +L + +YL P S +C + + S ++++G++
Sbjct: 365 VDDGFPAVTFYFENGLSLKVYPHDYLFP--SGDFWCIGWQNSGTQSRDSKNMTLLGDLVL 422
Query: 475 QGTRVSFDLANNRVGFTPNKC 495
V +DL N +G+T C
Sbjct: 423 SNKLVFYDLENQVIGWTEYNC 443
>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
Length = 492
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 125/410 (30%), Positives = 165/410 (40%), Gaps = 86/410 (20%)
Query: 158 EYFSRIGVGTP--PRQFSMVLDTGSDINWLQCRP--CTECYQQSDPIFDPKTSSSYSPLP 213
+Y + VG P S+ LDTGSD+ W C P C C ++ P + SPLP
Sbjct: 87 DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATP-----GGNHSSPLP 141
Query: 214 ---------CAAPQCKSLDVSA-----CRANRC-----------------LYQVAYGDGS 242
CA+P C + SA C A RC LY AYGDGS
Sbjct: 142 PPIDSRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLY-YAYGDGS 200
Query: 243 FTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT--- 299
V +L V S +V+ C H VG AG G G LSL Q+ +
Sbjct: 201 L-VANLRRGRVGLAASMAVENFTFACAHTALAEPVGVAGF---GRGPLSLPAQLAPSLSG 256
Query: 300 SLAYCLVD---------RDSPASGVLEFNSARGG----DAVTAPLIRNKKVDTFYYVGLT 346
+YCLV R SP ++A G D V PL+ N K FY V L
Sbjct: 257 RFSYCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALE 316
Query: 347 GFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG------ 400
SVGG+ +Q P L ++D G+GG++VD GT T L + + + D F R
Sbjct: 317 AVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTR 376
Query: 401 --NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA---GTF 455
+ +G+A CY +S VP V+LHF + LP +NY + S
Sbjct: 377 AEGAEAQTGLA---PCYHYSPSDRA-VPPVALHFRGNATVALPRRNYFMGFKSEEGRSVG 432
Query: 456 CFAFAPT----------SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C +GN QQQG V +D+ RVGF +C
Sbjct: 433 CLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
Length = 519
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 125/410 (30%), Positives = 165/410 (40%), Gaps = 86/410 (20%)
Query: 158 EYFSRIGVGTP--PRQFSMVLDTGSDINWLQCRP--CTECYQQSDPIFDPKTSSSYSPLP 213
+Y + VG P S+ LDTGSD+ W C P C C ++ P + SPLP
Sbjct: 87 DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATP-----GGNHSSPLP 141
Query: 214 ---------CAAPQCKSLDVSA-----CRANRC-----------------LYQVAYGDGS 242
CA+P C + SA C A RC LY AYGDGS
Sbjct: 142 PPIDSRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLY-YAYGDGS 200
Query: 243 FTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT--- 299
V +L V S +V+ C H VG AG G G LSL Q+ +
Sbjct: 201 L-VANLRRGRVGLAASMAVENFTFACAHTALAEPVGVAGF---GRGPLSLPAQLAPSLSG 256
Query: 300 SLAYCLVD---------RDSPASGVLEFNSARGG----DAVTAPLIRNKKVDTFYYVGLT 346
+YCLV R SP ++A G D V PL+ N K FY V L
Sbjct: 257 RFSYCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALE 316
Query: 347 GFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG------ 400
SVGG+ +Q P L ++D G+GG++VD GT T L + + + D F R
Sbjct: 317 AVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTR 376
Query: 401 --NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA---GTF 455
+ +G+A CY +S VP V+LHF + LP +NY + S
Sbjct: 377 AEGAEAQTGLA---PCYHYSPSDRA-VPPVALHFRGNATVALPRRNYFMGFKSEEGRSVG 432
Query: 456 CFAFAPT----------SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C +GN QQQG V +D+ RVGF +C
Sbjct: 433 CLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 100/359 (27%), Positives = 162/359 (45%), Gaps = 39/359 (10%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS--------SYS 210
Y +G+GTP + + +DTGS +W+ C C C+ ++++ S
Sbjct: 82 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 140
Query: 211 PLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
L + P C+ + C ++V+Y DGS + G L +T++F + + G + GC
Sbjct: 141 LLGGSDPHCQDSE----NYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCNM 196
Query: 271 DNEGL--FVGSAGLLGLGGGMLSLTKQIKAT--SLAYCLVDRD------SPASGVLEFNS 320
D+ G F GLLG+G G +S+ KQ T +YCL + S +G
Sbjct: 197 DSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGYFSLGK 256
Query: 321 -ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
A D ++ KK ++V LT SV G+ + + PS+F G++ D G+
Sbjct: 257 VATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRK-----GVVFDSGSE 311
Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT---CYDFSGLRSVRVPTVSLHFGAGK 436
++ + +A + L L LK G A ++ CYD + +P +SLHF G
Sbjct: 312 LSYIPDRALSVLSQRIRELL--LK--RGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 367
Query: 437 ALDLPAKNYLIP--VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
DL + + V +C AFAPT S +SIIG++ Q V +DL +G P+
Sbjct: 368 RFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKRQLIGIGPS 425
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 174/381 (45%), Gaps = 48/381 (12%)
Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI------- 200
+++G+S Y+++IGVG P + + ++DTGSDI W +C+ C C + + I
Sbjct: 77 MLNGSSTSDATYYAQIGVGHPVQFLNAIVDTGSDILWFKCKLCQGCSSKKNVIVCSSIIM 136
Query: 201 ------FDPKTSSSYSPLPCAAPQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTET 252
+DP+ S + SP C+ P C + +CR N C Y ++Y D S + G +
Sbjct: 137 QGPITLYDPELSITASPATCSDPLCS--EGGSCRGNNNSCAYDISYEDTSSSTGIYFRDV 194
Query: 253 VSFGNSGSVK-GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAY-----CLV 306
V G+ S+ + LGC GL+ G++G G +S+ Q+ A + +Y CL
Sbjct: 195 VHLGHKASLNTTMFLGCATSISGLW-PVDGIMGFGRSKVSVPNQLAAQAGSYNIFYHCLS 253
Query: 307 DRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
++ + + V P++ N D Y V L SV +A+ I S FE +
Sbjct: 254 GEKEGGGILVLGKNDEFPEMVYTPMLAN---DIVYNVKLVSLSVNSKALPIEASEFEYNA 310
Query: 367 -AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-----SGVALFDTCYDFSGL 420
G+GG I+D GT+ ++A + + + PT SG F + D + +
Sbjct: 311 TVGNGGTIIDSGTSSATFPSKALALFVKAVSKFTTAI-PTAPLESSGSPCFISISDRNSV 369
Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDS-----------AGTFCFAFAPTSSALSII 469
V P V+L F G ++L A NYL V S C +++ +S +I+
Sbjct: 370 E-VDFPNVTLKFDGGATMELTAHNYLEAVVSRKLSESTHFQGVRLVCISWSVGNS--TIL 426
Query: 470 GNVQQQGTRVSFDLANNRVGF 490
G+ + V +D+ +R+G+
Sbjct: 427 GDAILKDKVVVYDMEKSRIGW 447
>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
Length = 493
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 117/399 (29%), Positives = 164/399 (41%), Gaps = 76/399 (19%)
Query: 167 TPPRQFSMVLDTGSDINWLQCRP--CTECYQQSDPIF----DPKTSSS------------ 208
PP+ S+ LDTGSD+ W C+P C C +++ P+ SS+
Sbjct: 91 NPPQHVSLYLDTGSDLVWFPCKPFECILCEGKAENTTASTPPPRLSSTARSVHCKSSACS 150
Query: 209 --YSPLP----CAAPQC--KSLDVSACRANRC-LYQVAYGDGSFTVGDLVTETVSF---G 256
+S LP CA C +S++ S C + C + AYGDGS V L +++
Sbjct: 151 AAHSNLPTSDLCAIADCPLESIETSDCHSFSCPSFYYAYGDGSL-VARLYHDSIKLPLAT 209
Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA------TSLAYCLV---- 306
S S+ GC H VG AG G G+LSL Q+ + +YCLV
Sbjct: 210 PSLSLHNFTFGCAHTALAEPVGVAGF---GRGVLSLPAQLASFAPQLGNRFSYCLVSHSF 266
Query: 307 --DR-DSPASGVLEFNSARGGDA-------VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
DR P+ +L + + V ++ N K FY VGL G S+G + +
Sbjct: 267 NSDRLRLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISIGKKKIP 326
Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL----KPTSGVALFD 412
P L +D G GG++VD GT T L YNS+ F G + K
Sbjct: 327 APEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVEDKTGLG 386
Query: 413 TCYDFSGLRSVRVPTVSLHF-GAGKALDLPAKNYLIPVDSAG--------TFCFAFAP-- 461
CY + + V +P++ LHF G ++ LP KNY G C
Sbjct: 387 PCYYYDTV--VNIPSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVRRKRRVGCLMLMNGG 444
Query: 462 -----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
T + +GN QQ G V +DL RVGF KC
Sbjct: 445 EEAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKC 483
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 103/371 (27%), Positives = 173/371 (46%), Gaps = 40/371 (10%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF+R+ +G P +++ + +DTGSDI W+ C PCT C S F+P +SS+ S
Sbjct: 87 GLYFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSR 146
Query: 212 LPCAAPQCKS---LDVSACRANR-----CLYQVAYGDGSFTVGDLVTETVSF----GN-- 257
+PC+ +C + + C+++ C Y YGDGS T G V++T+ F GN
Sbjct: 147 IPCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQ 206
Query: 258 -SGSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQ-----IKATSLAYCLVD 307
+ S + GC + G + + G+ G G LS+ Q + + ++CL
Sbjct: 207 TANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCLKG 266
Query: 308 RDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
D+ G+L V PL+ ++ Y + L +V GQ + I SLF
Sbjct: 267 SDN-GGGILVLGEIVEPGLVFTPLVPSQP---HYNLNLESIAVSGQKLPIDSSLFATSNT 322
Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
G IVD GT + L AY+ ++ + A + S V+ C+ + PT
Sbjct: 323 --QGTIVDSGTTLVYLVDGAYDPFINA-IAAAVSPSVRSVVSKGIQCFVTTSSVDSSFPT 379
Query: 428 VSLHFGAGKALDLPAKNYLI---PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLA 484
+L+F G ++ + +NYL+ VD+ +C + S ++I+G++ + +DLA
Sbjct: 380 ATLYFKGGVSMTVKPENYLLQQGSVDNNVLWCIGWQ-RSQGITILGDLVLKDKIFVYDLA 438
Query: 485 NNRVGFTPNKC 495
N R+G+ C
Sbjct: 439 NMRMGWADYDC 449
>gi|54290725|dbj|BAD62395.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 500
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 102/372 (27%), Positives = 159/372 (42%), Gaps = 40/372 (10%)
Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQC---RPCTECYQQSDPIFDPKTSSSY 209
+ G +Y +G GTP +Q +M DTG I+ ++C RP C + FDP SS++
Sbjct: 140 APGFHDYTVVVGYGTPAQQLAMAFDTGLGISLVRCAACRPGAPCDGLAS--FDPSRSSTF 197
Query: 210 SPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCG 269
+P+PC +P C+S S + L F G + + ++ S SV GC
Sbjct: 198 APVPCGSPDCRSGCSSGSTPSCPLTSF-----PFLSGAVAQDVLTLTPSASVDDFTFGCV 252
Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDA 326
+ G +G+AGLL L S+ ++ A + +YCL + + G L A
Sbjct: 253 EGSSGEPLGAAGLLDLSRDSRSVASRLAADAGGTFSYCLPLSTTSSHGFLAIGEADVPHN 312
Query: 327 VT------APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
T APL+ + Y + L G S+GG+ + IPP A +++D
Sbjct: 313 RTARVTAVAPLVYDPAFPNHYVIDLAGVSLGGRDIPIPPHAATASAA----MVLDTALPY 368
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR-SVRVPTVSLHF-----GA 434
T ++ Y LRD+F R + DTCY+F+G+R V +P V L F G
Sbjct: 369 TYMKPSMYAPLRDAFRRAMARYPRAPAMGDLDTCYNFTGVRHEVLIPLVHLTFRGIGGGG 428
Query: 435 GKALDLPAKNYLIPVDSAGTF----CFAFAPTSS-------ALSIIGNVQQQGTRVSFDL 483
G + + + + G F C AFA S ++G + Q V D+
Sbjct: 429 GGQVLGLGADQMFYMSEPGNFFSVTCLAFAALPSDGDAEAPLAMVMGTLAQSSMEVVHDV 488
Query: 484 ANNRVGFTPNKC 495
++GF P C
Sbjct: 489 PGGKIGFIPGSC 500
>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 409
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/265 (32%), Positives = 131/265 (49%), Gaps = 19/265 (7%)
Query: 239 GDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA 298
G + T G L T+T +FG + +V G+ GC + G F G++G++G+G G LSL Q++
Sbjct: 124 GSAANTSGYLATDTFTFGAT-AVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQF 182
Query: 299 TSLAYCLV----DRDSPASGVLEFNSARGGDAV-------TAPLIRNKKVDTFYYVGLTG 347
+Y L+ D A V+ F G DAV + PL+ + FYYV LTG
Sbjct: 183 GKFSYQLLAPEATDDGSADSVIRF----GDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTG 238
Query: 348 FSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS 406
V G + IP F++ G GG+I+ T +T L+ AY+ +R + G
Sbjct: 239 VRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNG 298
Query: 407 GVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA 465
AL D CY+ S + V+VP ++L F G +DL A NY + G C P+
Sbjct: 299 SAALELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG 358
Query: 466 LSIIGNVQQQGTRVSFDLANNRVGF 490
S++G + Q GT + +D+ R+ F
Sbjct: 359 -SVLGTLLQTGTNMIYDVDAGRLTF 382
>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
Length = 491
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 119/415 (28%), Positives = 178/415 (42%), Gaps = 90/415 (21%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ--------QSDP--IFDPKTS 206
G Y + +GTPP+ ++LDTGS ++W+ PCT YQ + P +F PK S
Sbjct: 87 GGYAFTVSLGTPPQPLPVLLDTGSHLSWV---PCTSSYQCRNCSSLSAASPLHVFHPKNS 143
Query: 207 SSYSPLPCAAPQCKSL----DVSACRA-----------------NRC-LYQVAYGDGSFT 244
SS + C P C + +S CRA N C Y V YG GS T
Sbjct: 144 SSSRLIGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSGS-T 202
Query: 245 VGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYC 304
G L+++T+ +V+ +GC + +GL G G G S+ Q+ T +YC
Sbjct: 203 AGLLISDTLRTPGR-AVRNFVIGC--SLASVHQPPSGLAGFGRGAPSVPSQLGLTKFSYC 259
Query: 305 LVDRDSPASGVLEFNSARGGDAVT--------------APLIRNKKV----DTFYYVGLT 346
L+ R + N+A G+ + APL R+ +YY+ LT
Sbjct: 260 LLSRR------FDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALT 313
Query: 347 GFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS 406
+VGG++VQ+P F + GG IVD GT + + + + V G S
Sbjct: 314 AITVGGKSVQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRS 372
Query: 407 GVAL----FDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLI---PVDSAG----- 453
V C+ G +++ +P +SLHF G ++LP +NY + P S G
Sbjct: 373 KVVEEGLGLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMA 432
Query: 454 -TFCFAF---APTSSALS---------IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C A PTSS + I+G+ QQQ + +DL R+GF +C
Sbjct: 433 EAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 487
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 165/375 (44%), Gaps = 50/375 (13%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G Y+++IG+GTP + + + +DTGSDI W+ C C +C ++S +++ S S
Sbjct: 78 GLYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKL 137
Query: 212 LPCAAPQCKSLD---VSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN-SGSVK---- 262
+ C C + +S C+AN C Y YGDGS T G V + V + + +G +K
Sbjct: 138 VSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTA 197
Query: 263 --GIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDS 310
+ GCG G S G+LG G S+ Q+ ++ A+CL R+
Sbjct: 198 NGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRN- 256
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD- 369
G+ PL+ N+ Y V +T VG + + IP LF + GD
Sbjct: 257 -GGGIFAIGRVVQPKVNMTPLVPNQP---HYNVNMTAVQVGQEFLTIPADLF---QPGDR 309
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD---TCYDFSGLRSVRVP 426
G I+D GT + L Y L V+ + +P V + D C+ +SG P
Sbjct: 310 KGAIIDSGTTLAYLPEIIYEPL----VKKITSQEPALKVHIVDKDYKCFQYSGRVDEGFP 365
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQGTRVS 480
V+ HF L + +YL P G +C + ++ ++++G++ V
Sbjct: 366 NVTFHFENSVFLRVYPHDYLFP--HEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVL 423
Query: 481 FDLANNRVGFTPNKC 495
+DL N +G+T C
Sbjct: 424 YDLENQLIGWTEYNC 438
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 167/372 (44%), Gaps = 48/372 (12%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC----------YQQSDPIFDPKT 205
+G Y +R+ +GTP ++F++++D+GS + ++ C C +C + DP F P
Sbjct: 89 NGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDL 148
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVK- 262
SS+YSP+ C ++D + C R C Y+ Y + S + G L + +SFG +K
Sbjct: 149 SSTYSPVKC------NVDCT-CDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKP 201
Query: 263 -GIALGCGHDNEG-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASG 314
GC + G LF A G++GLG G LS+ Q + + S + C D G
Sbjct: 202 QRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDV-GGG 260
Query: 315 VLEFNSARGGDAVTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
+ GG ++ N +Y + L V G+A+++ P +F G
Sbjct: 261 TMVL----GGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKH----G 312
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV---- 425
++D GT L QA+ + +D+ +LK G D C+ +G ++
Sbjct: 313 TVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVF 372
Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDL 483
P V + FG G+ L L +NYL G +C F +++G + + T V++D
Sbjct: 373 PDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDR 432
Query: 484 ANNRVGFTPNKC 495
N ++GF C
Sbjct: 433 HNEKIGFWKTNC 444
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 167/372 (44%), Gaps = 48/372 (12%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC----------YQQSDPIFDPKT 205
+G Y +R+ +GTP ++F++++D+GS + ++ C C +C + DP F P
Sbjct: 88 NGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDL 147
Query: 206 SSSYSPLPCAAPQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVK- 262
SS+YSP+ C ++D + C R C Y+ Y + S + G L + +SFG +K
Sbjct: 148 SSTYSPVKC------NVDCT-CDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKP 200
Query: 263 -GIALGCGHDNEG-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASG 314
GC + G LF A G++GLG G LS+ Q + + S + C D G
Sbjct: 201 QRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDV-GGG 259
Query: 315 VLEFNSARGGDAVTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
+ GG ++ N +Y + L V G+A+++ P +F G
Sbjct: 260 TMVL----GGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKH----G 311
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV---- 425
++D GT L QA+ + +D+ +LK G D C+ +G ++
Sbjct: 312 TVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVF 371
Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDL 483
P V + FG G+ L L +NYL G +C F +++G + + T V++D
Sbjct: 372 PDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDR 431
Query: 484 ANNRVGFTPNKC 495
N ++GF C
Sbjct: 432 HNEKIGFWKTNC 443
>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 469
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 111/387 (28%), Positives = 160/387 (41%), Gaps = 54/387 (13%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTEC-YQQSDP----IFDPKTSSS 208
G Y + GTP + S V+DTGS + W C CT C + DP F PK SSS
Sbjct: 88 GGYSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSS 147
Query: 209 YSPLPCAAPQCKSLDVSACRANRC---------------LYQVAYGDGSFTVGDLVTETV 253
+ C P+C + S R RC Y + YG G+ TVG L+ E++
Sbjct: 148 AKIVGCLNPKCGFVMDSEVRT-RCPGCDQNSANCTKACPTYAIQYGLGT-TVGLLLLESL 205
Query: 254 SFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DS 310
F + +GC + +G+ G G G SL KQ+ +YCL+ DS
Sbjct: 206 VFAER-TEPDFVVGCSILSSR---QPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDS 261
Query: 311 PASGVLEF-------NSARGGDAVTA----PLIRNKKVDTFYYVGLTGFSVGGQAVQIPP 359
P S + + GG + T P+ N +YYV L VG + V++P
Sbjct: 262 PKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPY 321
Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL---FDTCYD 416
S G+GG IVD G+ T ++ + ++ F R N + V C++
Sbjct: 322 SFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPCFN 381
Query: 417 FSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS--------I 468
SG+ SV +P++ F G ++LP NY V C + S I
Sbjct: 382 LSGVGSVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTLSSGPSII 441
Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
+GN Q Q +DL N R GF +C
Sbjct: 442 LGNYQSQNFYTEYDLENERFGFRRQRC 468
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 169/370 (45%), Gaps = 43/370 (11%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD---PI--FDPKTSSSYSPLP 213
Y++R+ +G+PPR F + +DTGSD+ W+ C C C S P+ FDP +S + S +
Sbjct: 90 YYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLIS 149
Query: 214 CAAPQC-----KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN--SGSVKG--- 263
C+ +C S V A + N+C Y YGDGS T G V++ + F GSV
Sbjct: 150 CSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNSS 209
Query: 264 --IALGCGHDNEGLFV----GSAGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSPA 312
I GC G G+ G G +S+ Q+ + + ++CL DS
Sbjct: 210 APIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDDS-G 268
Query: 313 SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
G+L + V PL+ ++ Y + L V GQ + I PS+F + + G
Sbjct: 269 GGILVLGEIVEPNIVYTPLVPSQP---HYNLNLQSIYVNGQTLAIDPSVFA--TSSNQGT 323
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALF--DTCYDFSGLRSVRVPTVS 429
I+D GT + L AY D F+ + + P+ L + CY S + P VS
Sbjct: 324 IIDSGTTLAYLTEAAY----DPFISAITSTVSPSVSPYLSKGNQCYLTSSSINDVFPQVS 379
Query: 430 LHFGAGKALDLPAKNYLI---PVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLAN 485
L+F G ++ L ++YLI ++ A +C F ++I+G++ + +D+A
Sbjct: 380 LNFAGGTSMILIPQDYLIQQSSINGAALWCVGFQKIQGQEITILGDLVLKDKIFVYDIAG 439
Query: 486 NRVGFTPNKC 495
R+G+ C
Sbjct: 440 QRIGWANYDC 449
>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
Length = 534
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 164/384 (42%), Gaps = 45/384 (11%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE--------------------CYQQ 196
G Y + +GTP +++VLDT +D+ W+ CR +
Sbjct: 123 GMYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSTGQTMSMGGEGAKEA 182
Query: 197 SDPIFDPKTSSSYSPLPCAAPQCKSLDVSAC----RANRCLYQVAYGDGSFTVG----DL 248
S + P SSS+ + C+ +C L + C +A C Y DG+ T+G +
Sbjct: 183 SKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQDGTVTIGIYGKEK 242
Query: 249 VTETVSFGNSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIK---ATSLAYC 304
T TVS G + G+ LGC G V + G+L LG G +S ++C
Sbjct: 243 ATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAKRFGQRFSFC 302
Query: 305 LVDRDSP--ASGVLEFN---SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPP 359
L+ +S AS L F + G + ++ N V Y +TG VGG+ + IP
Sbjct: 303 LLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAQVTGVLVGGERLDIPD 362
Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYD--F 417
+++ + GG+I+D T++T L +AY + + R +L + F+ CY F
Sbjct: 363 EVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELEGFEYCYKWTF 422
Query: 418 SG-----LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP-TSSALSIIGN 471
+G +V +P+ ++ G L+ AK+ ++P G C AF I+GN
Sbjct: 423 TGDGVDPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEPGVACLAFRKLLRGGPGILGN 482
Query: 472 VQQQGTRVSFDLANNRVGFTPNKC 495
V Q D + ++ F +KC
Sbjct: 483 VFMQEYIWEIDHGDGKIRFRKDKC 506
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 98/335 (29%), Positives = 155/335 (46%), Gaps = 42/335 (12%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G Y++++ +GTPP +F++ +DTGSD+ W+ C C+ C Q S FDP +SS+ S
Sbjct: 23 GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSM 82
Query: 212 LPCAAPQC----KSLDVS-ACRANRCLYQVAYGDGSFTVGDLVTETVSFG-------NSG 259
+ C+ +C +S D + + + N+C Y YGDGS T G V++ + +
Sbjct: 83 IACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTN 142
Query: 260 SVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSLA-----YCLVDRDS 310
S + GC + G S G+ G G +S+ Q+ + +A +CL DS
Sbjct: 143 STAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL-KGDS 201
Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
G+L + V L+ + Y + L +V GQ +QI S+F +
Sbjct: 202 SGGGILVLGEIVEPNIVYTSLVPAQP---HYNLNLQSIAVNGQTLQIDSSVFATSNS--R 256
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS---GVALFDTCYDFSGLRSVRVPT 427
G IVD GT + L +AY D FV P S V+ + CY + + P
Sbjct: 257 GTIVDSGTTLAYLAEEAY----DPFVSAITASIPQSVHTAVSRGNQCYLITSSVTEVFPQ 312
Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAG---TFCFAF 459
VSL+F G ++ L ++YLI +S G +C F
Sbjct: 313 VSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGF 347
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 115/403 (28%), Positives = 182/403 (45%), Gaps = 47/403 (11%)
Query: 129 RHELKPAEAQILPEDFSTPVVSGASQGS------GEYFSRIGVGTPPRQFSMVLDTGSDI 182
R L+ A L + F VV + QGS G YF+++ +G+PPR+F++ +DTGSD+
Sbjct: 33 RDRLRHAR---LLQGFVGGVVDFSVQGSPDPYLVGLYFTKVKLGSPPREFNVQIDTGSDV 89
Query: 183 NWLQCRPCTECYQQSD-----PIFDPKTSSSYSPLPCAAPQCKS---LDVSAC--RANRC 232
W+ C C C + S FD +SS+ + C+ P C S V+ C + N+C
Sbjct: 90 LWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGLVHCSDPICTSAVQTTVTQCSPQTNQC 149
Query: 233 LYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIAL---GCGHDNEGLFVGS----AG 281
Y Y DGS T G V++T+ F G S V AL GC G + G
Sbjct: 150 SYTFQYEDGSGTSGYYVSDTLYFDAILGESLVVNSSALIVFGCSTFQSGDLTMTDKAVDG 209
Query: 282 LLGLGGGMLSLTKQIKATSL-----AYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
+ G G G LS+ Q+ + ++CL ++ G V +PL+ ++
Sbjct: 210 IFGFGQGELSVISQLSTHGITPRVFSHCLKGEGIGGGILVLGEILEPG-MVYSPLVPSQP 268
Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
Y + L +V G+ + I PS+F + G IVD GT + L +AY+ S V
Sbjct: 269 ---HYNLNLQSIAVNGKLLPIDPSVFATSNS--QGTIVDSGTTLAYLVAEAYDPFV-SAV 322
Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD-SAG-- 453
+ + T ++ + CY S S P S +F G ++ L ++YLIP S G
Sbjct: 323 NVIVSPSVTPIISKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYLIPFGPSQGGS 382
Query: 454 -TFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+C F ++I+G++ + +DL R+G+ C
Sbjct: 383 VMWCIGFQKV-QGVTILGDLVLKDKIFVYDLVRQRIGWANYDC 424
>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
Length = 538
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 115/456 (25%), Positives = 190/456 (41%), Gaps = 58/456 (12%)
Query: 94 RHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPE------DFSTP 147
R N +R++ L R + + + R + K E+ LPE F P
Sbjct: 57 RRNYFRAMEAKDLFRHQQMIKMMGNGSGTGSASSRRRQAK--ESSKLPEVMSATSMFELP 114
Query: 148 VVSGASQGS-GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR------------------ 188
+ S + G Y + GTP +++VLDT +D+ W+ CR
Sbjct: 115 MRSALNIAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAG 174
Query: 189 ----PCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSAC----RANRCLYQVAYGD 240
E +++ + P SSS+ + C+ +C L + C +A C Y D
Sbjct: 175 DDGAAAKEARRKN--WYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQD 232
Query: 241 GSFTVG----DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQ 295
G+ T+G + T TVS G + G+ LGC G V + G+L LG G +S
Sbjct: 233 GTLTMGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVH 292
Query: 296 IK---ATSLAYCLVDRDSP--ASGVLEFN---SARGGDAVTAPLIRNKKVDTFYYVGLTG 347
++CL+ +S AS L F + G + ++ N V Y +TG
Sbjct: 293 AAKRFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTG 352
Query: 348 FSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG 407
VGG+ + IP +++ ++ GG+I+D T++T L +AY ++ + R +L
Sbjct: 353 IFVGGERLDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYE 412
Query: 408 VALFDTCYD--FSG-----LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA 460
+ F+ CY F+G +V VP +++ G L+ AK+ ++P G C AF
Sbjct: 413 LDGFEYCYRWTFAGDGVDLTHNVTVPRLTVEMAGGARLEPEAKSVVMPEVVPGVACLAFR 472
Query: 461 PT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
I+GNV Q D ++ F +KC
Sbjct: 473 KLPRGGPGILGNVLMQEYIWEIDHGKGKMRFRKDKC 508
>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
Length = 648
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 118/412 (28%), Positives = 178/412 (43%), Gaps = 84/412 (20%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWL------QCRPCTECYQQSD-PIFDPKTSSSY 209
G Y + +GTPP+ ++LDTGS ++W+ QCR C+ S +F PK SSS
Sbjct: 87 GGYAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSLSAASPLHVFHPKNSSSS 146
Query: 210 SPLPCAAPQCKSL----DVSACRA-----------------NRC-LYQVAYGDGSFTVGD 247
+ C P C + +S CRA N C Y V YG GS T G
Sbjct: 147 RLIGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSGS-TAGL 205
Query: 248 LVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD 307
L+++T+ +V+ +GC + + +GL G G G S+ Q+ T +YCL+
Sbjct: 206 LISDTLRTPGR-AVRNFVIGCSLAS--VHQPPSGLAGFGRGAPSVPSQLGLTKFSYCLLS 262
Query: 308 RDSPASGVLEFNSARGGDAVT--------------APLIRNKKV----DTFYYVGLTGFS 349
R + N+A G+ + APL R+ +YY+ LT +
Sbjct: 263 RR------FDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAIT 316
Query: 350 VGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
VGG++VQ+P F + GG IVD GT + + + + V G S V
Sbjct: 317 VGGKSVQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVV 375
Query: 410 L----FDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLI---PVDSAG------TF 455
C+ G +++ +P +SLHF G ++LP +NY + P S G
Sbjct: 376 EEGLGLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAI 435
Query: 456 CFAF---APTSSALS---------IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
C A PTSS + I+G+ QQQ + +DL R+GF +C
Sbjct: 436 CLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 487
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 103/378 (27%), Positives = 166/378 (43%), Gaps = 55/378 (14%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYS 210
+G Y++ I +GTPP+Q+ + +DTGSDI W+ C C +C ++SD ++DPK SSS S
Sbjct: 80 TGLYYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDLGIDLRLYDPKGSSSGS 139
Query: 211 PLPCAAPQCKSL---DVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG--- 263
+ C C + + C N C Y V YGDGS T G V++++ + N S G
Sbjct: 140 TVSCDQKFCAATYGGKLPGCAKNIPCEYSVMYGDGSSTTGYFVSDSLQY-NQVSGDGQTR 198
Query: 264 -----IALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAYCLVDR 308
+ GCG +G +GS G++G G S+ Q+ A ++CL
Sbjct: 199 HANASVIFGCGA-QQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCL--D 255
Query: 309 DSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
G+ + PL+ + Y V L +VGG +Q+P +FE E
Sbjct: 256 TIKGGGIFAIGDVVQPKVKSTPLVPDMP---HYNVNLESINVGGTTLQLPSHMFETGEK- 311
Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR---- 424
G I+D GT +T L Y +D + T+ F + DF ++ +
Sbjct: 312 -KGTIIDSGTTLTYLPELVY---KDVLAAVFAKHPDTT----FHSVQDFLCIQYFQSVDD 363
Query: 425 -VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGT 477
P ++ HF L++ +Y + +CF F + + ++G++
Sbjct: 364 GFPKITFHFEDDLGLNVYPHDYFFQ-NGDNLYCFGFQNGGLQSKDGKDMVLLGDLVLSNK 422
Query: 478 RVSFDLANNRVGFTPNKC 495
V +DL N VG+T C
Sbjct: 423 VVVYDLENQVVGWTDYNC 440
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 101/330 (30%), Positives = 156/330 (47%), Gaps = 38/330 (11%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G Y++++ +GTPPR F + +DTGSD+ W+ C C C Q S FDP +S + SP
Sbjct: 79 GLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASP 138
Query: 212 LPCAAPQC----KSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS--- 258
+ C+ +C +S D S C + N C Y YGDGS T G V++ + F G+S
Sbjct: 139 ISCSDQRCSWGIQSSD-SGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197
Query: 259 GSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSLA-----YCLVDRD 309
S + GC G V S G+ G G +S+ Q+ + +A +CL +
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGEN 257
Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
G+L + V PL+ ++ Y V L SV GQA+ I PS+F
Sbjct: 258 G-GGGILVLGEIVEPNMVFTPLVPSQP---HYNVNLLSISVNGQALPINPSVFSTSNG-- 311
Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
G I+D GT + L AY ++ ++ +++P V+ + CY + P V
Sbjct: 312 QGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPV--VSKGNQCYVITTSVGDIFPPV 369
Query: 429 SLHFGAGKALDLPAKNYLIPVDS-AGTFCF 457
SL+F G ++ L ++YLI ++ A CF
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVASALCF 399
>gi|125606590|gb|EAZ45626.1| hypothetical protein OsJ_30294 [Oryza sativa Japonica Group]
Length = 431
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 102/353 (28%), Positives = 161/353 (45%), Gaps = 41/353 (11%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
+G+GTP ++V DT SD+ W QC+PC C Q+ ++DP + +Y+ L ++
Sbjct: 92 LGIGTPAMNVTLVFDTTSDLLWTQCQPCLSCVAQAGDMYDPNKTETYANLTSSS------ 145
Query: 223 DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA-- 280
Y Y SFT G TET + GN +V I GCG N+G + A
Sbjct: 146 -----------YNYTYSKQSFTSGYFATETFALGNV-TVANITFGCGTRNQGYYDNVAGV 193
Query: 281 -GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSA-------RGGDAVTAPLI 332
G+ G G +SL Q+ +YC +P S + + A + P++
Sbjct: 194 FGVGRGGRGGVSLLNQLGIDRFSYCFSSSGAPGSSAVFLGGSPELATNATTTPAASTPMV 253
Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
+ + + Y+V L G +VG V + + E G +++D + +T L Y +R
Sbjct: 254 ADPVLKSGYFVKLVGVTVGATLVDVAGA--SSAEGGGRALVIDSTSPVTVLDEATYGPVR 311
Query: 393 DSFV-RLA----GNLKPTSGVALFDTCYDFSGLRSVRVP---TVSLHFGAGKA-LDLPAK 443
+ V +LA N ++GV L D C++ + + P T++LHF G A L LP
Sbjct: 312 RALVAQLAPLKEANANASAGVGL-DLCFELAAGGATPTPPNVTMTLHFDGGAADLVLPPA 370
Query: 444 NYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+YL + G C P+SS + ++G+ T V +DLA N V F P C
Sbjct: 371 SYLAKDSAGGLICLTMTPSSSNGVPVLGSWALLDTLVLYDLAKNVVSFQPLDC 423
>gi|242089103|ref|XP_002440384.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
gi|241945669|gb|EES18814.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
Length = 555
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 106/393 (26%), Positives = 163/393 (41%), Gaps = 57/393 (14%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR--------------------------PC 190
G Y + GTP +++VLDT +D+ W+ CR
Sbjct: 138 GMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRQSSKTMSVGGDDDVVAA 197
Query: 191 TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRA----NRCLYQVAYGDGSFTVG 246
+ + P SSS+ + C+ QC L + C++ C Y DG+ T+G
Sbjct: 198 LAKKEARKNWYRPAKSSSWRRIRCSEQQCAHLPYNTCQSPSKLESCSYYQKTQDGTVTIG 257
Query: 247 ----DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIKAT-- 299
+ T TVS G + G+ LGC G V + G+L LG G +S I A
Sbjct: 258 IYGNEKATVTVSDGRMAKLPGLVLGCSVLEAGASVDAHDGVLSLGNGHMSFA--IHAVLR 315
Query: 300 ---SLAYCLVDRDSP--ASGVLEFN---SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVG 351
++CL+ +S AS L F + G + ++ N V Y +T VG
Sbjct: 316 FGGRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETEILYNVDVKAAYGPRVTAVLVG 375
Query: 352 GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF 411
G+ + IP ++ +D+ G+I+D T++T L +AY L + R +L P A F
Sbjct: 376 GERLDIPDDVWNIDKGLGSGVILDTSTSVTSLVPEAYEPLVAALDRHLAHL-PRESFAGF 434
Query: 412 DTCYD--FSG-----LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA--PT 462
+ CY F+G +V +P V++ G L+ AK+ ++P G C AF P
Sbjct: 435 EYCYRWTFTGDGVDPAHNVTIPKVTVEMTGGARLEPEAKSVVMPEVGHGVACLAFRKLPW 494
Query: 463 SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
IIGNV Q D + F +KC
Sbjct: 495 GGGPCIIGNVLMQEYIWEIDHSKATFRFRKDKC 527
>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
Length = 538
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 101/386 (26%), Positives = 166/386 (43%), Gaps = 49/386 (12%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR----------------------PCTECY 194
G Y + GTP +++VLDT +D+ W+ CR E
Sbjct: 125 GMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKEAR 184
Query: 195 QQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSAC----RANRCLYQVAYGDGSFTVG---- 246
+++ + P SSS+ + C+ +C L + C +A C Y DG+ T+G
Sbjct: 185 RKN--WYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQDGTLTMGIYGK 242
Query: 247 DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIK---ATSLA 302
+ T TVS G + G+ LGC G V + G+L LG G +S +
Sbjct: 243 EKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHAAKRFGQRFS 302
Query: 303 YCLVDRDSP--ASGVLEFN---SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQI 357
+CL+ +S AS L F + G + ++ N V Y +TG VGG+ + I
Sbjct: 303 FCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERLDI 362
Query: 358 PPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYD- 416
P +++ ++ GG+I+D T++T L +AY ++ + R +L + F+ CY
Sbjct: 363 PQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELDGFEYCYRW 422
Query: 417 -FSG-----LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSII 469
F+G +V VP +++ G L+ AK+ ++P G C AF I+
Sbjct: 423 TFAGDGVDLAHNVTVPRLTVEMAGGARLEPEAKSVVMPEVVPGVACLAFRKLPRGGPGIL 482
Query: 470 GNVQQQGTRVSFDLANNRVGFTPNKC 495
GNV Q D ++ F +KC
Sbjct: 483 GNVLMQEYIWEIDHGKGKMRFRKDKC 508
>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
Length = 537
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 100/388 (25%), Positives = 164/388 (42%), Gaps = 49/388 (12%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR------------------------PCTE 192
G Y + +GTP +++VLDT +D+ W+ CR
Sbjct: 122 GMYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSMGQTMSVGGEGATAA 181
Query: 193 CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSAC----RANRCLYQVAYGDGSFTVG-- 246
+ S + P SSS+ + C+ +C L + C +A C Y DG+ T+G
Sbjct: 182 KKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQDGTVTIGIY 241
Query: 247 --DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIK---ATS 300
+ T TVS G + G+ LGC G V + G+L LG G +S
Sbjct: 242 GKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAKRFGQR 301
Query: 301 LAYCLVDRDSP--ASGVLEFN---SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV 355
++CL+ +S AS L F + G + ++ N V Y +TG VGG+ +
Sbjct: 302 FSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTGVLVGGERL 361
Query: 356 QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCY 415
IP +++ + GG+I+D T++T L +AY + + R +L + F+ CY
Sbjct: 362 DIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELEGFEYCY 421
Query: 416 D--FSG-----LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP-TSSALS 467
F+G +V +P+ ++ G L+ AK+ ++P G C AF
Sbjct: 422 KWTFTGDGVXPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEPGVACLAFRKLLRGGPG 481
Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
I+GNV Q D + ++ F +KC
Sbjct: 482 ILGNVFMQEYIWEIDHGDGKIRFRKDKC 509
>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 413
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 97/362 (26%), Positives = 154/362 (42%), Gaps = 39/362 (10%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y + +GTPP+ S ++D ++ W QC C C++Q P+F P SS++ P PC
Sbjct: 62 YVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAV 121
Query: 219 CKSLDVSACRANRCLYQ----VAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE- 273
C+S+ +C + C Y+ G+ T G T+T + G + +V+ +A GC ++
Sbjct: 122 CESIPTRSCSGDVCSYKGPPTQLRGN---TSGFAATDTFAIGTA-TVR-LAFGCVVASDI 176
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAVTA 329
G +G +GLG SL Q+K T +YCL R++ S L S A G TA
Sbjct: 177 DTMDGPSGFIGLGRTPWSLVAQMKLTRFSYCLSPRNTGKSSRLFLGSSAKLAGGESTSTA 236
Query: 330 PLIRNKKVDT---FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV-DCGTAITRLQT 385
P I+ D +Y + L G + A GGI+V + + L
Sbjct: 237 PFIKTSPDDDSHHYYLLSLDAIRAGNTTIAT---------AQSGGILVMHTVSPFSLLVD 287
Query: 386 QAYNSLRDSFVRLAGN---LKPTSGVALFDTCY-DFSGLRSVRVPTVSLHFGAGKALDLP 441
AY + + + G + FD C+ +G P + F AL +P
Sbjct: 288 SAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTVP 347
Query: 442 AKNYLIPV-DSAGTFCFAFAPTS-------SALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
YLI V + T C A + +S++G++QQ+ +DL + F P
Sbjct: 348 PAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEPA 407
Query: 494 KC 495
C
Sbjct: 408 DC 409
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 161/376 (42%), Gaps = 40/376 (10%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
+G + G Y+++IG+GTP R + + +DTGSDI W+ C C EC ++S ++D K
Sbjct: 89 TGRPEAVGLYYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIK 148
Query: 205 TSSSYSPLPCAAPQCKSLD---VSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN-SG 259
S + + C C +++ S C AN C Y Y DGS + G V + V + SG
Sbjct: 149 ESLTGKLVSCDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSG 208
Query: 260 SVK------GIALGCGHDNEGLFVGSA---GLLGLGGGMLSLTKQIKATS-----LAYCL 305
++ + GC G G+LG G S+ Q+ ++ A+CL
Sbjct: 209 DLETTSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL 268
Query: 306 VDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
+ G+ T PL+ N+ T Y V + VGG + +P +F++
Sbjct: 269 DGLN--GGGIFAIGHIVQPKVNTTPLVPNQ---THYNVNMKAVEVGGYFLNLPTDVFDVG 323
Query: 366 EAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRV 425
+ G I+D GT + L Y+ L +LK + F TC+ +S
Sbjct: 324 DK--KGTIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQF-TCFQYSESLDDGF 380
Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQGTRV 479
P V+ HF L + YL D G +C + + ++++G++ V
Sbjct: 381 PAVTFHFENSLYLKVHPHEYLFSYD--GLWCIGWQNSGMQSRDRRNITLLGDLALSNKLV 438
Query: 480 SFDLANNRVGFTPNKC 495
+DL N +G+T C
Sbjct: 439 LYDLENQVIGWTEYNC 454
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 91/311 (29%), Positives = 153/311 (49%), Gaps = 36/311 (11%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
+G Y +RI +GTPP+ F++++DTGS + ++ C C +C + DP F+P+ SS+Y P+ C
Sbjct: 87 NGYYTTRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQCGRHQDPKFEPELSSTYQPVSC- 145
Query: 216 APQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHD 271
++D + C R C+Y+ Y + S + G L + +SFGN + + GC +
Sbjct: 146 -----NIDCT-CDNERKQCVYERQYAEMSSSSGVLGEDIISFGNQSELVPQRAIFGCENQ 199
Query: 272 NEG-LFVGSA-GLLGLGGGMLSLTKQI-------KATSLAYCLVDRDSPASGVLEFNSAR 322
G L+ A G++GLG G LS+ Q+ + SL Y +D A + +
Sbjct: 200 ETGDLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGGGAMILGGISPPS 259
Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
G + +R++ +Y + L V G+ + + PS+F+ G G ++D GT
Sbjct: 260 GMVFAESDPVRSQ----YYNIDLKAIHVAGKQLHLDPSIFD----GKHGTVLDSGTTYAY 311
Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCY-----DFSGLRSVRVPTVSLHFGAG 435
L A+ + +D+ ++ +LK G D C+ D S L S P V + F G
Sbjct: 312 LPEAAFTAFKDAMMKELTSLKQIHGPDPNYNDICFSGAESDVSQL-SNTFPAVEMVFSNG 370
Query: 436 KALDLPAKNYL 446
+ L L +NYL
Sbjct: 371 QKLSLSPENYL 381
>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
Length = 466
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 123/394 (31%), Positives = 161/394 (40%), Gaps = 80/394 (20%)
Query: 158 EYFSRIGVGTP--PRQFSMVLDTGSDINWLQCRP--CTECYQQSDPIFDPKTSSSYSPLP 213
+Y + VG P S+ LDTGSD+ W C P C C ++ P + SPLP
Sbjct: 87 DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATP-----GGNHSSPLP 141
Query: 214 ---------CAAPQCKSLDVSA-----CRANRC-----------------LYQVAYGDGS 242
CA+P C + SA C A RC LY AYGDGS
Sbjct: 142 PPIDSRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLY-YAYGDGS 200
Query: 243 FTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLA 302
V +L V S +V+ C H VG AG G G LSL Q+ A SL+
Sbjct: 201 L-VANLRRGRVGLAASMAVENFTFACAHTALAEPVGVAGF---GRGPLSLPAQL-APSLS 255
Query: 303 YCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLF 362
D+ A G E D V PL+ N K FY V L SVGG+ +Q P L
Sbjct: 256 G---STDAAAIGASET------DFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQPELG 306
Query: 363 EMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG--------NLKPTSGVALFDTC 414
++D G+GG++VD GT T L + + + D F R + +G+A C
Sbjct: 307 DVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLA---PC 363
Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA---GTFCFAFAPT--------- 462
Y +S VP V+LHF + LP +NY + S C
Sbjct: 364 YHYSPSDRA-VPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDGED 422
Query: 463 -SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+GN QQQG V +D+ RVGF +C
Sbjct: 423 GGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 456
>gi|242094480|ref|XP_002437730.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
gi|241915953|gb|EER89097.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
Length = 507
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 107/342 (31%), Positives = 152/342 (44%), Gaps = 47/342 (13%)
Query: 173 SMVLDTGSDINWLQCRPCTECYQQSDPI--FDPKTSSSYSPLPCAAPQCKSLDV---SAC 227
++VLDT SD+ W+QC P +DP SS+Y L C + C L AC
Sbjct: 125 TVVLDTASDVPWVQCHPLASSATTDSSSSSYDPARSSTYYALACNSAACTELGRLYRGAC 184
Query: 228 RANRCLYQVAYGDGSF------TVGDLVTETVSFGNSGSVKGIALGCGHDN-----EG-L 275
N+C Y+V T G + + + G+ GC H EG +
Sbjct: 185 VNNQCQYRVPIPSSPASSSSSGTYGSDLLKLTADPADGASMSFKFGCSHGEAKQGGEGSI 244
Query: 276 FVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDS--PASGVLEFN----SARGGDA 326
+AG++ LGGG SL Q A ++ +YC+ +S P VL S GG A
Sbjct: 245 DNATAGIMALGGGPESLVSQNAAMYGSAFSYCIPATESRRPGFFVLGGGVGDLSGAGGYA 304
Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
VT P++R +V T Y V L +V GQ + + PS+F G ++D TAITRL
Sbjct: 305 VT-PMLRYARVPTLYRVRLLAIAVDGQQLNVTPSVFA------SGSVLDSRTAITRLPPT 357
Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL 446
AY +LR++F + DTCYDF+G V VP V+L L N +
Sbjct: 358 AYQALREAFRSRMAMYREAPPQGNLDTCYDFAGAFLVMVPRVAL---------LLDGNAV 408
Query: 447 IPVDSAGTF---CFAFAPTSS--ALSIIGNVQQQGTRVSFDL 483
+ +D G C F + I+GNVQQQ V +++
Sbjct: 409 VALDRQGILFHDCLVFTSNTDDRMPGILGNVQQQTMEVLYNV 450
>gi|357492303|ref|XP_003616440.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517775|gb|AES99398.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 521
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 101/358 (28%), Positives = 162/358 (45%), Gaps = 55/358 (15%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS- 221
+ VG+PP++ +MVLDTGS+++WL C+ + IF+P SSSY+P PC +P C +
Sbjct: 40 LTVGSPPQRVTMVLDTGSELSWLHCKKLPNL----NFIFNPLVSSSYTPTPCTSPICTTQ 95
Query: 222 ----LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG--NSGSVKGIALGCGHDNEGL 275
++ +C AN+ + + +F VG + FG ++G+ G D +
Sbjct: 96 TRDLINPVSCDANKLCHII-----TFFVGGPAQRGMVFGCMDTGTSSG-------DEDS- 142
Query: 276 FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLE--FNSARGGDAVTAPLIR 333
+ GL+G+ G LS + Q++ +YC+ ++DS VLE N R G PL++
Sbjct: 143 --KTTGLMGMDLGSLSFSNQMRLPKFSYCISNKDSTGVLVLENIANPPRLGPLHYTPLVK 200
Query: 334 NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRD 393
++ F S F D G G +VD T T L+ Y +L++
Sbjct: 201 KTTPLPYFNRNCCLFQK---------SAFLPDHTGAGQTMVDSATQFTFLRQPVYTALKN 251
Query: 394 SFVRLAGNLKPTSGVALF------DTCYDFSGLRSVRV-PTVSLHFGAGKALDLPAKNYL 446
F N+ G F D C+ ++ V P V+L F G L + + L
Sbjct: 252 EFAIQTKNILTPLGDPKFVFQGVMDLCFRVPIGSTLPVLPVVTLMFD-GAELRVTGERLL 310
Query: 447 IPVDSAGT-----FCFAFAPTSSALS----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
V + +CF F S L IIG+ Q+ + +DLAN+R+GF+ C
Sbjct: 311 YKVSNVAKSNSWIYCFTFG-NSDLLGIEAFIIGHHHQRNVWMEYDLANSRIGFSDTNC 367
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 101/378 (26%), Positives = 165/378 (43%), Gaps = 42/378 (11%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
+G +G Y++++G+G+P ++F + +DTGSDI W+ C CT C ++S ++DP
Sbjct: 63 NGLPSSTGLYYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPN 122
Query: 205 TSSSYSPLPCAAPQCK---SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN-SG 259
S + + +PC C S +S C+ + C Y + YGDGS T G V ++++F SG
Sbjct: 123 GSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSG 182
Query: 260 SVK------GIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAY 303
++ + GCG G ++ G++G G S+ Q+ A+ ++
Sbjct: 183 NLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSH 242
Query: 304 CLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
CL G+ T PL+ Y V L V G+ + +P LF
Sbjct: 243 CL--DSHHGGGIFSIGQVMEPKFNTTPLVPRM---AHYNVILKDMDVDGEPILLPLYLF- 296
Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
D G I+D GT + L YN L + LK F TC+ +S
Sbjct: 297 -DSGSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQF-TCFHYSDKLDE 354
Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQGT 477
P V HF G +L + +YL + +C + +S+ L +IG++
Sbjct: 355 GFPVVKFHF-EGLSLTVHPHDYLF-LYKEDIYCIGWQKSSTQTKEGRDLILIGDLVLSNK 412
Query: 478 RVSFDLANNRVGFTPNKC 495
V +DL N +G+T C
Sbjct: 413 LVVYDLENMVIGWTNFNC 430
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 100/401 (24%), Positives = 169/401 (42%), Gaps = 44/401 (10%)
Query: 122 LAIYNVDRHELKPAEAQILP-EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGS 180
L ++ +RH + A LP F+ P G+G Y++ IG+GTP ++ + LDTGS
Sbjct: 51 LQTHDENRHRRRNLMAAELPLGGFNIPY------GTGLYYTDIGIGTPAVKYYVQLDTGS 104
Query: 181 DINWLQCRPCTECYQQSDPI-----FDPKTSSSYSPLPCAAPQCKSLDVSACRAN-RCLY 234
W+ C +C +SD + +DP++S S + C C S C RC Y
Sbjct: 105 KAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTICTSR--PPCNMTLRCPY 162
Query: 235 QVAYGDGSFTVGDLVTETVS----FGNSG---SVKGIALGCGHDNEGLFVGSA----GLL 283
Y DG T+G L T+ + +GN + + GCG G SA G++
Sbjct: 163 ITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGII 222
Query: 284 GLGGGMLSLTKQIKATS-----LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVD 338
G G + Q+ A ++CL + G+ T P+++N +V
Sbjct: 223 GFGNSNQTALSQLAAAGKTKKIFSHCL--DSTNGGGIFAIGEVVEPKVKTTPIVKNNEV- 279
Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
++ V L +V G +Q+P ++F + G +D G+ + L Y+ L
Sbjct: 280 -YHLVNLKSINVAGTTLQLPANIFGTTKT--KGTFIDSGSTLVYLPEIIYSEL--ILAVF 334
Query: 399 AGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFA 458
A + T G C+ F G + P ++ HF LD+ +YL+ + +CF
Sbjct: 335 AKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFENDLTLDVYPYDYLLEYE-GNQYCFG 393
Query: 459 FAPTS----SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
F + I+G++ V +D+ +G+T + C
Sbjct: 394 FQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGWTEHNC 434
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 161/376 (42%), Gaps = 40/376 (10%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
+G + G Y+++IG+GTP R + + +DTGSDI W+ C C EC ++S ++D K
Sbjct: 89 TGRPEAVGLYYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIK 148
Query: 205 TSSSYSPLPCAAPQCKSLD---VSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN-SG 259
S + + C C +++ S C AN C Y Y DGS + G V + V + SG
Sbjct: 149 ESLTGKLVSCDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSG 208
Query: 260 SVK------GIALGCGHDNEGLFVGSA---GLLGLGGGMLSLTKQIKATS-----LAYCL 305
++ + GC G G+LG G S+ Q+ ++ A+CL
Sbjct: 209 DLETTSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL 268
Query: 306 VDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
+ G+ T PL+ N+ T Y V + VGG + +P +F++
Sbjct: 269 DGLN--GGGIFAIGHIVQPKVNTTPLVPNQ---THYNVNMKAVEVGGYFLNLPTDVFDVG 323
Query: 366 EAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRV 425
+ G I+D GT + L Y+ L +LK + F TC+ +S
Sbjct: 324 DK--KGTIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQF-TCFQYSESLDDGF 380
Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQGTRV 479
P V+ HF L + YL D G +C + + ++++G++ V
Sbjct: 381 PAVTFHFENSLYLKVHPHEYLFSYD--GLWCIGWQNSGMQSRDRRNITLLGDLALSNKLV 438
Query: 480 SFDLANNRVGFTPNKC 495
+DL N +G+T C
Sbjct: 439 LYDLENQVIGWTEYNC 454
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 118 bits (296), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 165/386 (42%), Gaps = 58/386 (15%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
SG G Y+++IG+GTPP+ + + +DTGSDI W+ C C EC +S ++D K
Sbjct: 74 SGRPDAVGLYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIK 133
Query: 205 TSSSYSPLPCAAPQCKSLD---VSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGN-SG 259
SSS +PC CK ++ ++ C AN C Y YGDGS T G V + V + SG
Sbjct: 134 ESSSGKLVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSG 193
Query: 260 SVK------GIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAY 303
+K I GCG G S G+LG G S+ Q+ ++ A+
Sbjct: 194 DLKTDSANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAH 253
Query: 304 CLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLF 362
CL N GG + KV+ T +SV AVQ+ +
Sbjct: 254 CL-------------NGVNGGGIFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHTFL 300
Query: 363 EM--DEAGDG---GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD--TCY 415
+ D + G G I+D GT + L Y L + +LK + L D TC+
Sbjct: 301 SLSTDTSAQGDRKGTIIDSGTTLAYLPEGIYEPLVYKMISQHPDLKVQT---LHDEYTCF 357
Query: 416 DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT------SSALSII 469
+S P V+ F G +L + +YL P S +C + + S ++++
Sbjct: 358 QYSESVDDGFPAVTFFFENGLSLKVYPHDYLFP--SVNFWCIGWQNSGTQSRDSKNMTLL 415
Query: 470 GNVQQQGTRVSFDLANNRVGFTPNKC 495
G++ V +DL N +G+ C
Sbjct: 416 GDLVLSNKLVFYDLENQAIGWAEYNC 441
>gi|125572774|gb|EAZ14289.1| hypothetical protein OsJ_04213 [Oryza sativa Japonica Group]
Length = 492
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 154/370 (41%), Gaps = 38/370 (10%)
Query: 138 QILPEDFSTPVVSGASQ----GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC 193
Q +P D G SQ +G Y VGTPP+ + VLD SD W+QC C C
Sbjct: 72 QAVPADGGENGGGGQSQDPATNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATC 131
Query: 194 YQQSDPIFDPKTSSSYSPLPCAAPQCKSL----DVSACRANRCLYQVAYGDGSF--TVGD 247
+ +P +AP + D A C Y YG G+ T G
Sbjct: 132 -------------GADAPAATSAPPFYAFLSFHDTRAPTTPPCGYSYVYGGGAANTTAGL 178
Query: 248 LVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD 307
L + +F G+ GC EG G++GLG G LS Q++ +Y L
Sbjct: 179 LAVDAFAFATV-RADGVIFGCAVATEGDI---GGVIGLGRGELSPVSQLQIGRFSYYLAP 234
Query: 308 RDSPASG----VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
D+ G L+ R AV+ PL+ ++ + YYV L G V G+ + IP F+
Sbjct: 235 DDAVDVGSFILFLDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFD 294
Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRS 422
+ G GG+++ +T L AY +R + L+ G L D CY L +
Sbjct: 295 LQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKI-ELRAADGSELGLDLCYTSESLAT 353
Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA-LSIIGNVQQQGTRVSF 481
+VP+++L F G ++L NY + G C P+ + S++G++ Q VS
Sbjct: 354 AKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQ----VSL 409
Query: 482 DLANNRVGFT 491
R FT
Sbjct: 410 LSCRRRADFT 419
>gi|388505490|gb|AFK40811.1| unknown [Medicago truncatula]
Length = 193
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 65/170 (38%), Positives = 89/170 (52%), Gaps = 3/170 (1%)
Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
VT PLI N +FYY+ L SVG + I S FE+ + G GG+I+D GT IT ++
Sbjct: 23 VTTPLITNPLQPSFYYISLEVISVGDTKLSIEQSTFEVSDDGSGGVIIDSGTTITYIEEN 82
Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNY 445
A++SL+ F SG D C+ SG V +P + HF G L+LP +NY
Sbjct: 83 AFDSLKKEFTSQTKLPVDKSGSTGLDVCFSLPSGKTEVEIPKLVFHFKGGD-LELPGENY 141
Query: 446 LIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+I S G C A S+ +SI GN+QQQ V+ DL + F P +C
Sbjct: 142 MIADSSLGVACLAMG-ASNGMSIFGNIQQQNILVNHDLQKETITFIPTQC 190
>gi|242091057|ref|XP_002441361.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
gi|241946646|gb|EES19791.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
Length = 439
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 118/416 (28%), Positives = 171/416 (41%), Gaps = 84/416 (20%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQC-----RPCTECYQ--QSDPIFDPKTSSSYSP 211
Y + +GTPP+ F + LDTGSD+ W+ C C +C + P F P S+S +
Sbjct: 25 YLLSLNLGTPPQVFQVYLDTGSDLTWVPCGSSSSYQCLDCGSSVKPTPTFLPSESTSNTR 84
Query: 212 LPCAAPQCKSLDVSACRANRCL--------------------YQVAYGDGSFTVGDLVTE 251
C + C + S R + C + YG G+ +G L +
Sbjct: 85 DLCGSRFCVDVHSSDNRFDPCAAAGCAIPAFTGGQCPRPCPPFSYTYGGGALVLGSLSRD 144
Query: 252 TVSFGNSGSVKGIALGCGHDNEGL------FVGSA-----GLLGLGGGMLSLTKQIK--A 298
+V+ GS G G G VGS+ G+ G G G LSL Q+
Sbjct: 145 SVTL--HGSTHGSGAGAGPLPVAFPGFGFGCVGSSIREPLGIAGFGRGALSLPSQLGFLG 202
Query: 299 TSLAYCLV--------DRDSP-ASGVLEFNSAR-GGDAVTAPLIRNKKVDTFYYVGLTGF 348
++C + + SP G L +SA G V P++ + FYYVGL G
Sbjct: 203 KGFSHCFLGFRFARNPNFTSPLVMGDLALSSASTDGGFVFTPMLTSATYPNFYYVGLEGV 262
Query: 349 SVG----GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG---- 400
+G G A+ PPSL +D G+GG++VD GT T+L Y S+ S + A
Sbjct: 263 VLGDDDGGSAMAAPPSLSGIDAQGNGGVLVDTGTTYTQLPDPFYASVLASLISAAPPYER 322
Query: 401 --NLKPTSGVALFDTCYDFSGLRSV----RVPTVSLHFGAGKALDLPAKNYLIPV----D 450
+L+ +G FD C+ R+ +P ++LH G L LP + PV D
Sbjct: 323 SRDLEARTG---FDLCFKVPCARAPCADDELPPITLHLAGGARLALPKLSSYYPVTAIRD 379
Query: 451 SAGTFCFAF-----------APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
S C F +++G+ Q Q V +DLA RVGF P C
Sbjct: 380 SVVVKCLLFQRMEMEDDGDGTSGGGPAAVLGSFQMQNVEVVYDLAAGRVGFRPRDC 435
>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
Length = 761
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 106/359 (29%), Positives = 149/359 (41%), Gaps = 87/359 (24%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
+ VG+PP+ +MVLDTGS+++WL C+ + +FDP SSSYSP+PC +P
Sbjct: 379 LTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCTSP----- 429
Query: 223 DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGL 282
CR T T S + GL
Sbjct: 430 ---TCR---------------------TRTHS-----------------------KTTGL 442
Query: 283 LGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD----------AVTAPLI 332
+G+ G LS Q+ +YC+ +DS SG+L F + ++ PL
Sbjct: 443 IGMNRGSLSFVTQMGLQKFSYCISGQDS--SGILLFGESSFSWLKALKYTPLVQISTPLP 500
Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
+V Y V L G V +Q+P S++ D G G +VD GT T L Y +L+
Sbjct: 501 YFDRVA--YTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALK 558
Query: 393 DSFVR-LAGNLKPTSGVAL-----FDTCYDFSGLRSVR--VPTVSLHF-GAGKALDLPAK 443
+ FVR +LK D CY R +PTV+L F GA ++
Sbjct: 559 NEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFRGAEMSVSAERL 618
Query: 444 NYLIP---VDSAGTFCFAFAPTSSALS----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
Y +P S +CF F S L IIG+ QQ + FDLA +RVGF +C
Sbjct: 619 MYRVPGVIRGSDSVYCFTFG-NSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 676
>gi|147789749|emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
Length = 609
Score = 118 bits (295), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 111/387 (28%), Positives = 159/387 (41%), Gaps = 54/387 (13%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTEC-YQQSDP----IFDPKTSSS 208
G Y + GTP + S V+DTGS + W C CT C + DP F PK SSS
Sbjct: 88 GGYSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSS 147
Query: 209 YSPLPCAAPQCKSLDVSACRANRC---------------LYQVAYGDGSFTVGDLVTETV 253
+ C P+C + S R RC Y + YG G+ TVG L+ E++
Sbjct: 148 AKIVGCLNPKCGFVMDSEVR-TRCPGCDQNSANCTKACPTYAIQYGLGT-TVGLLLLESL 205
Query: 254 SFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DS 310
F + +GC + +G+ G G G SL KQ+ +YCL+ DS
Sbjct: 206 VFAER-TEPDFVVGCSILSSR---QPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDS 261
Query: 311 PASGVLEF-------NSARGGDAVTA----PLIRNKKVDTFYYVGLTGFSVGGQAVQIPP 359
P S + + GG + T P+ N +YYV L VG + V+ P
Sbjct: 262 PKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKXPY 321
Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL---FDTCYD 416
S G+GG IVD G+ T ++ + ++ F R N + V C++
Sbjct: 322 SFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPCFN 381
Query: 417 FSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS--------I 468
SG+ SV +P++ F G ++LP NY V C + S I
Sbjct: 382 LSGVGSVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTLSSGPSII 441
Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
+GN Q Q +DL N R GF +C
Sbjct: 442 LGNYQSQNFYTEYDLENERFGFRRQRC 468
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 117 bits (294), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 108/387 (27%), Positives = 163/387 (42%), Gaps = 57/387 (14%)
Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYS 210
+G YF+ I +GTPP+++ + +DTGSDI W+ C C++C ++S +DPK SSS S
Sbjct: 84 TGLYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKASSSGS 143
Query: 211 PLPCAAPQCKSL---DVSACRANR-CLYQVAYGDGSFTVGDLVTETVSF----GNSGSVK 262
+ C C + + C AN C Y V YGDGS T G +T+ + F G+ +
Sbjct: 144 TVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQTQP 203
Query: 263 G---IALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDS 310
G I GCG G S G+LG G S+ Q+ A A+CL D+
Sbjct: 204 GNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCL---DT 260
Query: 311 PASG-------------VLEFNSARGGDAVTAPLIRNKKV---DTFYYVGLTGFSVGGQA 354
G F A G + PL + Y V L VGG
Sbjct: 261 IKGGGIFAIGNVVQPKCYFVFFFAHG--LLNIPLFLLVMILLSRPHYNVNLKSIDVGGTT 318
Query: 355 VQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTC 414
+Q+P +FE E G I+D GT +T L + + D ++ + C
Sbjct: 319 LQLPAHVFETGEK--KGTIIDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQDFL--C 374
Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSI 468
+ +SG PT++ HF AL + Y P + +C F + + +
Sbjct: 375 FQYSGSVDDGFPTITFHFEDDLALHVYPHEYFFP-NGNDIYCVGFQNGALQSKDGKDIVL 433
Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
+G++ V +DL N +G+T C
Sbjct: 434 MGDLVLSNKLVVYDLENQVIGWTDYNC 460
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 117 bits (294), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 109/377 (28%), Positives = 172/377 (45%), Gaps = 60/377 (15%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR-PCTECYQQSDPIFDPKTSSSYSPLPCA 215
G Y+ + +G PPR + + +DTGSD+ WLQC PC C + P++ P + +PC
Sbjct: 56 GLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKL---VPCV 112
Query: 216 APQCKSLD-----VSACRA--NRCLYQVAYGDGSFTVGDLVTET--VSFGNSGSVK-GIA 265
C +L C + +C Y++ Y D ++G LVT++ + NS V+ G+A
Sbjct: 113 DQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLA 172
Query: 266 LGCGHDNEGLFVGSA-------GLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSPAS 313
GCG+D + VGS+ G+LGLG G +SL Q+K + +CL R
Sbjct: 173 FGCGYDQQ---VGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTR---GG 226
Query: 314 GVLEFNS--ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
G L F A AP+ R+ + +Y G GG+ + + P M+
Sbjct: 227 GFLFFGDDIVPYSRATWAPMARSTSRN-YYSPGSANLYFGGRPLGVRP----ME------ 275
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYD----FSGLRSVR-- 424
++ D G++ T Q Y +L D+ L+ NLK +L C+ F + V+
Sbjct: 276 VVFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSL-PLCWKGKKPFKSVLDVKKE 334
Query: 425 VPTVSLHFGAGKA--LDLPAKNYLIPVDSAGTFCFAFAPTSSA----LSIIGNVQQQGTR 478
TV L F GK +++P +NYLI V G C S L+I+G++ Q
Sbjct: 335 FKTVVLSFSNGKKALMEIPPENYLI-VTKYGNACLGILNGSEVGLKDLNIVGDITMQDQM 393
Query: 479 VSFDLANNRVGFTPNKC 495
V +D ++G+ C
Sbjct: 394 VIYDNERGQIGWIRAPC 410
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 111/418 (26%), Positives = 170/418 (40%), Gaps = 87/418 (20%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G YF+++ +G+P ++F + +DTGSDI WL C C C + S FD +SS+ +
Sbjct: 69 GLYFTKVKMGSPAKEFYVQIDTGSDILWLNCNTCNNCPKSSGLGIDLNYFDTASSSTAAL 128
Query: 212 LPCAAPQCK---SLDVSAC--RANRCLYQVAYGDGSFTVG---------DLVTETVSFGN 257
+ C+ P C S C +AN+C Y YGDGS T G D++ F N
Sbjct: 129 VSCSDPVCSYAVQTATSQCSSQANQCSYTFQYGDGSGTSGYYVYDAMYFDVIMGQSVFSN 188
Query: 258 SGSVKGIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATSLA-----YCLVDR 308
S S + GC G + G+ G G G LS+ Q+ + +A +CL +
Sbjct: 189 SSST--VVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHCLKGQ 246
Query: 309 DSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
S G+L + V PL+ + Y + L +V GQ + I +F
Sbjct: 247 GS-GGGILVLGEILEPNIVYTPLV---PLQPHYNLNLQSIAVNGQILPIDQDVFA--TGN 300
Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDS---------FVRLAGNLKPTSG------------ 407
+ G IVD GT + L +AY+ ++ F N+K G
Sbjct: 301 NRGTIVDSGTTLAYLVQEAYDPFLNAGSPCHFFTHFNEPTNNIKYEDGNNNHQSRVKRHY 360
Query: 408 -------------------VALFDTCYDFSGLRSVRVPT--------VSLHFGAGKALDL 440
V+ F G + VPT VSL+F G ++ L
Sbjct: 361 YDEVTLRLVLKHSAIITTTVSQFSKPIISKGNQCYLVPTSLGDIFPLVSLNFMGGASMVL 420
Query: 441 PAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ YLI +D A +C F +I+G++ + +DLAN R+G+T C
Sbjct: 421 KPEQYLIHYGFLDGAAMWCIGFQKVQKGYTILGDLVLKDKIFVYDLANQRIGWTDYDC 478
>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
Length = 137
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 59/133 (44%), Positives = 80/133 (60%), Gaps = 5/133 (3%)
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
+D PV S G+GE+ ++ +G P +S +LDTGSD+ W QC PC++CY+Q PI+
Sbjct: 8 KDVQAPV----SAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCMPCSDCYKQPTPIY 63
Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
DP SS+Y + C + C +L SAC + C Y YGD S T G L ET + +S S+
Sbjct: 64 DPSLSSTYGTVSCKSSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTL-SSQSI 122
Query: 262 KGIALGCGHDNEG 274
IA GCG DNEG
Sbjct: 123 PHIAFGCGQDNEG 135
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 109/377 (28%), Positives = 172/377 (45%), Gaps = 60/377 (15%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR-PCTECYQQSDPIFDPKTSSSYSPLPCA 215
G Y+ + +G PPR + + +DTGSD+ WLQC PC C + P++ P + +PC
Sbjct: 56 GLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKL---VPCV 112
Query: 216 APQCKSLD-----VSACRA--NRCLYQVAYGDGSFTVGDLVTET--VSFGNSGSVK-GIA 265
C +L C + +C Y++ Y D ++G LVT++ + NS V+ G+A
Sbjct: 113 DQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLA 172
Query: 266 LGCGHDNEGLFVGSA-------GLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSPAS 313
GCG+D + VGS+ G+LGLG G +SL Q+K + +CL R
Sbjct: 173 FGCGYDQQ---VGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTR---GG 226
Query: 314 GVLEFNS--ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
G L F A AP+ R+ + +Y G GG+ + + P M+
Sbjct: 227 GFLFFGDDIVPYSRATWAPMARSTSRN-YYSPGSANLYFGGRPLGVRP----ME------ 275
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYD----FSGLRSVR-- 424
++ D G++ T Q Y +L D+ L+ NLK +L C+ F + V+
Sbjct: 276 VVFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSL-PLCWKGKKPFKSVLDVKKE 334
Query: 425 VPTVSLHFGAGKA--LDLPAKNYLIPVDSAGTFCFAFAPTSSA----LSIIGNVQQQGTR 478
TV L F GK +++P +NYLI V G C S L+I+G++ Q
Sbjct: 335 FRTVVLSFSNGKKALMEIPPENYLI-VTKYGNACLGILNGSEVGLKDLNIVGDITMQDQM 393
Query: 479 VSFDLANNRVGFTPNKC 495
V +D ++G+ C
Sbjct: 394 VIYDNERGQIGWIRAPC 410
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 118/385 (30%), Positives = 160/385 (41%), Gaps = 59/385 (15%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCR-----PCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
+ VGTPP+ +MVLDTGS+++WL C P T P F+ SSSY +PC +
Sbjct: 59 VAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLT-------PAFNASGSSSYGAVPCPST 111
Query: 218 QC----KSLDVSA-CR---ANRCLYQVAYGDGSFTVGDLVTET--VSFGNSGSVKGIALG 267
C + L V C +N C ++Y D S G L T+T ++ G G G
Sbjct: 112 ACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPPVAVGAYFG 171
Query: 268 C----------GHDNEGLFVGSA--GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGV 315
C + G V A GLLG+ G LS Q AYC+ + P +
Sbjct: 172 CITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTRRFAYCIAPGEGPGVLL 231
Query: 316 LEFNSARGGDAVTAPLIR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
L + PLI ++ + F Y V L G VG + IP S+ D G G
Sbjct: 232 LGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTGAG 291
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG------VALFDTCYDFSGLR--- 421
+VD GT T L AY +L+ F A L G FD C+ R
Sbjct: 292 QTMVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPGFVFQGAFDACFRGPEARVAA 351
Query: 422 -SVRVPTVSLHF-GAGKALDLPAKNYLIPVDSAG------TFCFAFAPTSSA---LSIIG 470
S +P V L GA A+ Y++P + G +C F + A +IG
Sbjct: 352 ASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIG 411
Query: 471 NVQQQGTRVSFDLANNRVGFTPNKC 495
+ QQ V +DL N RVGF P +C
Sbjct: 412 HHHQQNVWVEYDLQNGRVGFAPARC 436
>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
Length = 447
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 118/385 (30%), Positives = 160/385 (41%), Gaps = 59/385 (15%)
Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCR-----PCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
+ VGTPP+ +MVLDTGS+++WL C P T P F+ SSSY +PC +
Sbjct: 59 VAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLT-------PAFNASGSSSYGAVPCPST 111
Query: 218 QC----KSLDVSA-CR---ANRCLYQVAYGDGSFTVGDLVTET--VSFGNSGSVKGIALG 267
C + L V C +N C ++Y D S G L T+T ++ G G G
Sbjct: 112 ACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPPVAVGAYFG 171
Query: 268 C----------GHDNEGLFVGSA--GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGV 315
C + G V A GLLG+ G LS Q AYC+ + P +
Sbjct: 172 CITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTRRFAYCIAPGEGPGVLL 231
Query: 316 LEFNSARGGDAVTAPLIR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
L + PLI ++ + F Y V L G VG + IP S+ D G G
Sbjct: 232 LGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTGAG 291
Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG------VALFDTCYDFSGLR--- 421
+VD GT T L AY +L+ F A L G FD C+ R
Sbjct: 292 QTMVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPGFVFQGAFDACFRGPEARVAA 351
Query: 422 -SVRVPTVSLHF-GAGKALDLPAKNYLIPVDSAG------TFCFAFAPTSSA---LSIIG 470
S +P V L GA A+ Y++P + G +C F + A +IG
Sbjct: 352 ASGLLPVVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIG 411
Query: 471 NVQQQGTRVSFDLANNRVGFTPNKC 495
+ QQ V +DL N RVGF P +C
Sbjct: 412 HHHQQNVWVEYDLQNGRVGFAPARC 436
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 109/377 (28%), Positives = 172/377 (45%), Gaps = 60/377 (15%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR-PCTECYQQSDPIFDPKTSSSYSPLPCA 215
G Y+ + +G PPR + + +DTGSD+ WLQC PC C + P++ P + +PC
Sbjct: 56 GLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKL---VPCV 112
Query: 216 APQCKSLD-----VSACRA--NRCLYQVAYGDGSFTVGDLVTET--VSFGNSGSVK-GIA 265
C +L C + +C Y++ Y D ++G LVT++ + NS V+ G+A
Sbjct: 113 DQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLA 172
Query: 266 LGCGHDNEGLFVGSA-------GLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSPAS 313
GCG+D + VGS+ G+LGLG G +SL Q+K + +CL R
Sbjct: 173 FGCGYDQQ---VGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTR---GG 226
Query: 314 GVLEFNS--ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
G L F A AP+ R+ + +Y G GG+ + + P M+
Sbjct: 227 GFLFFGDDIVPYSRATWAPMARSTSRN-YYSPGSANLYFGGRPLGVRP----ME------ 275
Query: 372 IIVDCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYD----FSGLRSVR-- 424
++ D G++ T Q Y +L D+ L+ NLK +L C+ F + V+
Sbjct: 276 VVFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSL-PLCWKGKKPFKSVLDVKKE 334
Query: 425 VPTVSLHFGAGKA--LDLPAKNYLIPVDSAGTFCFAFAPTSSA----LSIIGNVQQQGTR 478
TV L F GK +++P +NYLI V G C S L+I+G++ Q
Sbjct: 335 FRTVVLSFSNGKKALMEIPPENYLI-VTKYGNACLGILNGSEVGLKDLNIVGDITMQDQM 393
Query: 479 VSFDLANNRVGFTPNKC 495
V +D ++G+ C
Sbjct: 394 VIYDNERGQIGWIRAPC 410
>gi|242092874|ref|XP_002436927.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
gi|241915150|gb|EER88294.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
Length = 484
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 101/360 (28%), Positives = 151/360 (41%), Gaps = 35/360 (9%)
Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSD-INWLQCRPCTE---CYQQSDPIFDPKTSSSYS 210
G+ EY G GTP +QF++ DT + LQC+PC C+ FDP SSS +
Sbjct: 141 GAFEYHVTAGFGTPVQQFTVGFDTTTTGATQLQCKPCAADEPCHHA----FDPSASSSIA 196
Query: 211 PLPCAAPQC---KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
+PC +P C K +C + + G+ +F L + + + G
Sbjct: 197 HVPCGSPDCPFNKGCSGHSCTLSVSINNTLLGNATFFTDKLTLTPWNIVDDFRFVCLEAG 256
Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQI-----KATSLAYCLVDRDSPASGVLEFNSAR 322
D++ S G+L L SL + A + +YCL S G L + +
Sbjct: 257 FRPDDD-----STGILDLSRNSHSLASRAAPSSPDAVAFSYCLPSYPSDV-GFLSLGATK 310
Query: 323 ----GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
G PL N+ Y V L G +GG + +P + AG GG I++ T
Sbjct: 311 PELLGRKVSYTPLRSNRHNGNLYVVELVGLGLGGVDLPVPRAAI----AG-GGTILELHT 365
Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
T L+ + Y +LRD F + DTCY+F+ L S VP V+L F G
Sbjct: 366 TFTYLKPKVYAALRDEFRKSMSQYPVAPPQGSLDTCYNFTALSSYSVPAVTLKFDGGAEF 425
Query: 439 DLPAKNYLIPVDSAGTF---CFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
DL + + F C AF ++IG++ Q T V +D+ +VGF P +C
Sbjct: 426 DLWIDEMMYFPEPGSYFSVGCLAFVAQDGG-AVIGSMAQMSTEVVYDVRGGKVGFVPYRC 484
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 92/372 (24%), Positives = 154/372 (41%), Gaps = 41/372 (11%)
Query: 159 YFSRIGVG--------TPPRQFSMVLDTGSDINWLQCRPCTE----CYQQSDPIFDPKTS 206
+ +++GVG T + + +DTG++++W+QC C C+ DP + S
Sbjct: 80 FLAQVGVGSFQEKSHRTHFKTYYFQIDTGNELSWIQCEGCQNKGNMCFPHKDPPYTSSQS 139
Query: 207 SSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVK 262
SY P+ C Q + + C+ C Y V YG GS+T G+L ET +F G ++K
Sbjct: 140 KSYKPVSCN--QHSFCEPNQCKEGLCAYNVTYGPGSYTSGNLANETFTFYSNHGKHTALK 197
Query: 263 GIALGCGHDNEGLFVG-------SAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPA 312
I+ GC D+ + +G+LG+G G S Q+ + S +YC+ ++
Sbjct: 198 SISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLAQLGSISHGKFSYCITANNTHN 257
Query: 313 SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
+ + + I K Y+V L G SV G + I + + + G G
Sbjct: 258 TYLRFGKHVVKSKNLQTTKIMQVKPSAAYHVNLLGISVNGVKLNITKTDLAVRKDGSRGC 317
Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF-------DTCYD-FSGLRSVR 424
I+D GT T L +++L + L+ +L + + D CY+ S
Sbjct: 318 IIDAGTLATLLVKPIFDTLHTA---LSNHLSSNQNLKRWVIHKLHKDLCYEQLSDAGRKN 374
Query: 425 VPTVSLHFGAGKALDLPAKNYLI-PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDL 483
+P V+ H P +L + FC + S +IIG QQ + +D
Sbjct: 375 LPVVTFHLENADLEVKPEAIFLFREFEGKNVFCLSMLSDDSK-TIIGAYQQMKQKFVYDT 433
Query: 484 ANNRVGFTPNKC 495
+ F P C
Sbjct: 434 KARVLSFGPEDC 445
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 96/362 (26%), Positives = 156/362 (43%), Gaps = 39/362 (10%)
Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
Y + +GTPP+ S ++D ++ W QC C C++Q P+F P SS++ P PC
Sbjct: 45 YVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAV 104
Query: 219 CKSLDVSACRANRCLYQ----VAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE- 273
C+S+ +C + C Y+ G+ T G T+T + G + +V+ +A GC ++
Sbjct: 105 CESIPTRSCSGDVCSYKGPPTQLRGN---TSGFAATDTFAIGTA-TVR-LAFGCVVASDI 159
Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSA---RGGDAV-TA 329
G +G +GLG SL Q+K T +YCL R++ S L S+ G ++ TA
Sbjct: 160 DTMDGPSGFIGLGRTPWSLVAQMKLTRFSYCLSPRNTGKSSRLFLGSSAKLAGSESTSTA 219
Query: 330 PLIRNKKVD---TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV-DCGTAITRLQT 385
P I+ D +Y + L G + A GGI+V + + L
Sbjct: 220 PFIKTSPDDDGSNYYLLSLDAIRAGNTTIAT---------AQSGGILVMHTVSPFSLLVD 270
Query: 386 QAYNSLRDSFVRLAGN---LKPTSGVALFDTCY-DFSGLRSVRVPTVSLHFGAGKALDLP 441
AY + + + G + FD C+ +G P + F AL +P
Sbjct: 271 SAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTVP 330
Query: 442 AKNYLIPV-DSAGTFCFAFAPTS-------SALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
YLI V + T C A + +S++G++QQ+ +DL + F P
Sbjct: 331 PAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEPA 390
Query: 494 KC 495
C
Sbjct: 391 DC 392
>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
Length = 137
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/133 (44%), Positives = 80/133 (60%), Gaps = 5/133 (3%)
Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
+D PV S G+GE+ ++ +G P +S +LDTGSD+ W QC PC++CY+Q PI+
Sbjct: 8 KDVQAPV----SAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCIPCSDCYKQPTPIY 63
Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
DP SS+Y + C + C +L SAC + C Y YGD S T G L ET + +S S+
Sbjct: 64 DPSLSSTYGTVSCKSSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTL-SSQSI 122
Query: 262 KGIALGCGHDNEG 274
IA GCG DNEG
Sbjct: 123 PHIAFGCGQDNEG 135
>gi|194701538|gb|ACF84853.1| unknown [Zea mays]
gi|194703714|gb|ACF85941.1| unknown [Zea mays]
Length = 208
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 83/221 (37%), Positives = 113/221 (51%), Gaps = 21/221 (9%)
Query: 283 LGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDA---VTAPLIRNKK 336
+GLGGG SL Q T + +YCL S +SG L +A G V P++R+ +
Sbjct: 1 MGLGGGAQSLVSQTAGTLGRAFSYCLPPTPS-SSGFLTLGAAGGSGTSGFVKTPMLRSSQ 59
Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
V TFY V L VGG+ + IP S+F G ++D GT ITRL AY++L +F
Sbjct: 60 VPTFYGVRLQAIRVGGRQLSIPASVFS------AGTVMDSGTVITRLPPTAYSALSSAFK 113
Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
P + DTC+DFSG SV +P+V+L F G + L A ++ + C
Sbjct: 114 AGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL------SNC 167
Query: 457 FAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
AFA S S+L IIGNVQQ+ V +D+ VGF C
Sbjct: 168 LAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 208
>gi|194708432|gb|ACF88300.1| unknown [Zea mays]
Length = 452
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 109/373 (29%), Positives = 159/373 (42%), Gaps = 59/373 (15%)
Query: 178 TGSDINWL------QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR 231
+GS + W+ +CR C+ + P+F PK SSS + C P C+ + +A A +
Sbjct: 79 SGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATK 138
Query: 232 CL---------------------YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
C Y V YG GS T G L+ +T+ +V G LGC
Sbjct: 139 CRRAPCSPGAANCPAAASNVCPPYAVVYGSGS-TAGLLIADTLR-APGRAVPGFVLGC-- 194
Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR----DSPASGVLEFNSARGGDA 326
+ +GL G G G S+ Q+ +YCL+ R ++ SG L GG+
Sbjct: 195 SLVSVHQPPSGLAGFGRGAPSVPAQLGLPKFSYCLLSRRFDDNAAVSGSLVLGGTGGGEG 254
Query: 327 VT-APLIRNKKVD-----TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
+ PL+++ D +YY+ L G +VGG+AV++P F + AG GG IVD GT
Sbjct: 255 MQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGTTF 314
Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVAL----FDTCYDF-SGLRSVRVPTVSLHFGAG 435
T L + + D+ V G S A C+ G RS+ +P +S HF G
Sbjct: 315 TYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSFHFEGG 374
Query: 436 KALDLPAKNYLIPVDSAGT--FCFAFAPTSSALS-----------IIGNVQQQGTRVSFD 482
+ LP +NY + C A S S I+G+ QQQ V +D
Sbjct: 375 AVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGSFQQQNYLVEYD 434
Query: 483 LANNRVGFTPNKC 495
L R+GF C
Sbjct: 435 LEKERLGFRRQSC 447
>gi|224074147|ref|XP_002304273.1| predicted protein [Populus trichocarpa]
gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 110/398 (27%), Positives = 159/398 (39%), Gaps = 80/398 (20%)
Query: 173 SMVLDTGSDINWLQCRP--CTECYQQSD-----PIFDPKTSSSYSPLPCAAPQC------ 219
S+ LDTGSD+ W C+P C C +++ PK S + +P+ C + C
Sbjct: 94 SLYLDTGSDLVWFPCQPFECILCEGKAENASLASTPPPKLSKTATPVSCKSSACSAVHSN 153
Query: 220 --------------KSLDVSACRANRC-LYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
+S+++S CR + C + AYGDGS + L +++ S I
Sbjct: 154 LPSSDLCAISNCPLESIEISDCRKHSCPQFYYAYGDGSL-IARLYRDSIRLPLSNQTNLI 212
Query: 265 ----ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAYCLV----DRDS 310
GC H +G AG G G+LSL Q+ S +YCLV D D
Sbjct: 213 FNNFTFGCAHTTLAEPIGVAGF---GRGVLSLPAQLATLSPQLGNQFSYCLVSHSFDSDR 269
Query: 311 ---PASGVL----------EFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQI 357
P+ +L N + V ++ N + FY VGL G S+G + +
Sbjct: 270 VRRPSPLILGRYDHDEKERRVNGVKKPSFVYTSMLDNPRHPYFYCVGLEGISIGRKKIPA 329
Query: 358 PPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT---- 413
P L ++D G GG++VD GT T L Y+ + F G + + V +T
Sbjct: 330 PDFLRKVDRKGSGGVVVDSGTTFTMLPASLYDFVVAEFENRVGRVNERASVIEENTGLSP 389
Query: 414 CYDFSGLRSVRVPTVSLHF-GAGKALDLPAKNYLIPV--------DSAGTFCFAFAPTSS 464
CY F V LHF G G ++ LP +NY C
Sbjct: 390 CYYFDNNVVNVP-RVVLHFVGNGSSVVLPRRNYFYEFLDGGHGKGKKRKVGCLMLMNGGD 448
Query: 465 ALSI-------IGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ +GN QQQG V +DL N RVGF +C
Sbjct: 449 EAELSGGPGATLGNYQQQGFEVVYDLENRRVGFARRQC 486
>gi|326524806|dbj|BAK04339.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 460
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 113/392 (28%), Positives = 168/392 (42%), Gaps = 66/392 (16%)
Query: 153 SQGSGEY--FSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYS 210
+Q G Y + +G G R + + LD +++ W+QC+P E + Q P F+P S S+
Sbjct: 78 TQVGGMYSVVTSVGTGAGRRTYVLALDMTTNLLWMQCKPVQEPFTQLPPPFEPAKSPSFR 137
Query: 211 PLPCAAPQCKSLDVSACRANR------CLYQVAYGDGSFTV-GDLVTETVSFGNSGS--- 260
LP C + A R +R C + DGS G L ET++F SG
Sbjct: 138 RLPGNNAFC----LPAPRGHRRTVQDPCKFHSIRLDGSADARGVLSNETLAFAASGQQQT 193
Query: 261 -VKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTK--------QIKATSLAYCL-- 305
V G+ +GC H+++G S AG+LGLG SL ++ +YCL
Sbjct: 194 EVTGVVIGCTHNSKGFNFNSHGVLAGVLGLGRQAPSLIWTLGQHRHGTVQVHRFSYCLPS 253
Query: 306 -------------VDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGG 352
D D P + + D+ T+ R Y+V LTG SV G
Sbjct: 254 HGSSSSDHHTFLRFDDDVPNTQHMVSTKIMYMDSTTSRDFRA------YFVSLTGISVAG 307
Query: 353 QAVQIPPSLFEMDEAGD---GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
+ +Q LF+ G G D GT + AYN L+D+ VR +LKP G+
Sbjct: 308 KPLQDVKELFKRHVHGQVWTSGCAFDAGTPTMVMIMPAYNKLKDAVVR---HLKPL-GLQ 363
Query: 410 L----FDTCYDFSGLRSVRVPTVSLHFGAGKA-LDLPAKNYLIPVDSAGTFCFAFAPTSS 464
+ + C+ + +PTV L F +A L LP + + V C A S
Sbjct: 364 IVSGQYHLCFRATSQLWQHLPTVMLQFAETEARLVLPPQRLFVAVGY--DICLAVV-RSY 420
Query: 465 ALSIIGNVQQQGTRVSFDLANNRVGFTP-NKC 495
++IIG +QQ R +D+ + R+ F P N C
Sbjct: 421 DITIIGAMQQVDKRFVYDVRHGRIYFVPENAC 452
>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 462
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 114/393 (29%), Positives = 178/393 (45%), Gaps = 49/393 (12%)
Query: 124 IYNVDRHELKPAEAQIL-------PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVL 176
I+ DR ++ A+IL +D +P + G + +G G P + ++++
Sbjct: 87 IFLQDRSRVRSINARILGQYSTEESKDGGSPESMHSLNEDGFFLVNVGFGKPQQNLNLII 146
Query: 177 DTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLY 234
DTGSD W++C C+ C+ + P F+P SSSYS C + + + N Y
Sbjct: 147 DTGSDTTWIRCNSCSLGNCHNKKIPTFNPSLSSSYSNRSC---------IPSTKTN---Y 194
Query: 235 QVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGG----ML 290
+ Y D S++ G V + V+ K GCG G F ++G+LGL G ++
Sbjct: 195 TMNYEDNSYSKGVFVCDEVTLKPDVFPK-FQFGCGDSGGGDFGSASGVLGLAQGEQYSLI 253
Query: 291 SLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTA-PLIR-----NKKVDTFYYVG 344
S T +YC ++ G L F G A++A P ++ N + Y+V
Sbjct: 254 SQTASKFKKKFSYCFPHNEN-TRGSLLF----GEKAISASPSLKFTRLLNPSSGSVYFVE 308
Query: 345 LTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR---LAGN 401
L G SV + + + SLF G I+D GT IT L T AY +LR +F + +
Sbjct: 309 LIGISVAKKRLNVSSSLF-----ASPGTIIDSGTVITHLPTAAYEALRTAFQQEMLHCPS 363
Query: 402 LKPTSGVALFDTCYDFSGL--RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF 459
+ P DTCY+ G R++++P + LHF + L L C AF
Sbjct: 364 VSPPPQEKPLDTCYNLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGILWANGDLTQACLAF 423
Query: 460 APTS--SALSIIGNVQQQGTRVSFDLANNRVGF 490
A S S ++IIGN QQ +V +D+ R+GF
Sbjct: 424 ARKSHPSHVTIIGNRQQVSLKVVYDIEGGRLGF 456
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 100/388 (25%), Positives = 160/388 (41%), Gaps = 62/388 (15%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF-------- 201
+G SG YF++IG+GTP + + + +DTGSDI W+ C CT C ++SD
Sbjct: 65 NGHPSESGLYFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKSDLGIELSLYSPS 124
Query: 202 ------------DPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLV 249
D TS+ P+P P+ C Y+VAYGDGS T G V
Sbjct: 125 SSSTSNRVTCNQDFCTSTYDGPIPGCTPEL-----------LCEYRVAYGDGSSTAGYFV 173
Query: 250 TETV-------SFGNSGSVKGIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKA 298
+ V +F + + I GCG G ++ G+LG G S+ Q+ +
Sbjct: 174 RDHVVLDRVTGNFQTTSTNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLAS 233
Query: 299 TS-----LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQ 353
+ A+CL + + G+ T PL+ + Y V + V +
Sbjct: 234 SGKVKRVFAHCLDNIN--GGGIFAIGEVVQPKVRTTPLVPQQ---AHYNVFMKAIEVDNE 288
Query: 354 AVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT 413
+ +P +F+ D G I+D GT + Y L LK + F T
Sbjct: 289 VLNLPTDVFDTDLR--KGTIIDSGTTLAYFPDVIYEPLISKIFARQSTLKLHTVEEQF-T 345
Query: 414 CYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LS 467
C+++ G PTV+ HF +L + YL +DS +C + + + +
Sbjct: 346 CFEYDGNVDDGFPTVTFHFEDSLSLTVYPHEYLFDIDS-NKWCVGWQNSGAQSRDGKDMI 404
Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
++G++ Q V +DL N +G+T C
Sbjct: 405 LLGDLVLQNRLVMYDLENQTIGWTEYNC 432
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 95/379 (25%), Positives = 165/379 (43%), Gaps = 43/379 (11%)
Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
+G +G YF+++G+G+PP+ + + +DTGSDI W+ C C+ C ++SD ++DPK
Sbjct: 61 NGLPTETGLYFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPK 120
Query: 205 TSSSYSPLPCAAPQCKSL---DVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSG- 259
S + + C C + + C++ C Y + YGDGS T G V + +++ +
Sbjct: 121 GSETSELISCDQEFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVND 180
Query: 260 ------SVKGIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAY 303
I GCG G S+ G++G G S+ Q+ A+ ++
Sbjct: 181 NLRTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSH 240
Query: 304 CLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
CL + G+ T PL+ Y V L V +Q+P +F
Sbjct: 241 CL--DNIRGGGIFAIGEVVEPKVSTTPLVPRM---AHYNVVLKSIEVDTDILQLPSDIF- 294
Query: 364 MDEAGDG-GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRS 422
++G+G G I+D GT + L Y+ L + LK F +C+ ++G
Sbjct: 295 --DSGNGKGTIIDSGTTLAYLPAIVYDELIPKVMARQPRLKLYLVEQQF-SCFQYTGNVD 351
Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQG 476
P V LHF +L + +YL G +C + + + ++++G++
Sbjct: 352 RGFPVVKLHFEDSLSLTVYPHDYLFQFKD-GIWCIGWQKSVAQTKNGKDMTLLGDLVLSN 410
Query: 477 TRVSFDLANNRVGFTPNKC 495
V +DL N +G+T C
Sbjct: 411 KLVIYDLENMAIGWTDYNC 429
>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
Japonica Group]
Length = 377
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 91/328 (27%), Positives = 149/328 (45%), Gaps = 23/328 (7%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
G Y + +GTPP+ S V+D ++ W QC PC C++Q P+FDP SS++ LPC +
Sbjct: 55 GLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGS 114
Query: 217 PQCKSLDVSA--CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
C+S+ S+ C ++ C+Y+ G T G T+T + G + G D
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGKAGTDTFAIGAAKETLGFGCVVMTDKRL 173
Query: 275 LFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA--SGVLEFNSARGGDAVTAPL 331
+G +G++GLG SL Q+ T+ +YCL + S A G A G ++ T +
Sbjct: 174 KTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGKSSGALFLGATAKQLAGGKNSSTPFV 233
Query: 332 IR------NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
I+ + + +Y V L G GG +Q S +++D + + L
Sbjct: 234 IKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAASSSGST-------VLLDTVSRASYLAD 286
Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
AY +L+ + G S +D C F + P + F G AL +P NY
Sbjct: 287 GAYKALKKALTAAVGVQPVASPPKPYDLC--FPKAVAGDAPELVFTFDGGAALTVPPANY 344
Query: 446 LIPVDSAGTFCFAFAPTSSALSIIGNVQ 473
L+ GT C +S++L++ G ++
Sbjct: 345 LL-ASGNGTVCLTIG-SSASLNLTGELE 370
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 97/353 (27%), Positives = 154/353 (43%), Gaps = 38/353 (10%)
Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
+GTPP++F++++DTGS + ++ C C +C DP F P S +Y P+ C P C
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKC-NPDC----T 56
Query: 225 SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG--IALGCGHDNEG-LFVGSA- 280
++C Y+ Y + S + G L + VSFGN +K GC + G LF A
Sbjct: 57 CDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKPQRAVFGCENAETGDLFSQHAD 116
Query: 281 GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI-------- 332
G++GLG G LS+ Q+ + + D S G +E GG A+ I
Sbjct: 117 GIMGLGRGDLSIVDQLVEKGV---INDSFSLCYGGMEV----GGGAMVLGQISPPSDMVF 169
Query: 333 --RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
+ +Y + L G V G+ + I P +F+ G G I+D GT L A+
Sbjct: 170 SHSDPDRSPYYNIELRGLHVAGKKLDINPQVFD----GKHGTILDSGTTYAYLPEAAFLP 225
Query: 391 LRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTVSLHFGAGKALDLPAKN 444
+ LK G D C+ +G + P+V + F G+ L +N
Sbjct: 226 FIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGEKYSLSPEN 285
Query: 445 YLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
YL G +C F +++G + + T V++D +++VGF C
Sbjct: 286 YLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNC 338
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 103/364 (28%), Positives = 164/364 (45%), Gaps = 34/364 (9%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G Y++ IG+G P ++ +++DTGSDI W++C PC C + D I++ SS+ S
Sbjct: 81 GLYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQDIIPPLSIYNLSASSTSSV 140
Query: 212 LPCAAPQCKSLDVSACRANR---CLYQVAYGDGSFTVGDLVTETVSF---GNSGSVKGIA 265
C+ P C +V R+ C Y +Y D S +VG V + + + G + + I
Sbjct: 141 SSCSDPLCTGEEVVCSRSGNNSACAYVSSYQDKSASVGAYVRDDMHYVLHGGNATTSRIF 200
Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPASGVLEFNS 320
GC + G + G++G G ++ QI ++CL + G+LEF
Sbjct: 201 FGCATNITGSWP-VDGIMGFGLISKTVPNQIATQRNMSRVFSHCL-GGEKHGGGILEFGE 258
Query: 321 A-RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM--DEAGDGGIIVDCG 377
A + V PL+ V T Y V L SV + + I P F + + G+I+D G
Sbjct: 259 APNTTEMVFTPLL---NVTTHYNVDLLSISVNSKVLPIDPKEFSYVRNSTNNTGVIIDSG 315
Query: 378 TAITRLQTQAYNSLRDSFVRL-AGNLKPT-SGVALFDTCYDFSGL-RSVRVPTVSLHFGA 434
T L T+A L L L P G+ F Y SGL P V+L F
Sbjct: 316 TTFVLLTTKANRMLFQEIKSLTTAKLGPKLEGLECF---YLKSGLTMETSFPNVTLTFSG 372
Query: 435 GKALDLPAKNYLIPVD---SAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
G + L NYL+ + +C+A++ ++ L+I G + + V +D+ N R+G+
Sbjct: 373 GSTMKLKPDNYLVMAEYKKKRNGYCYAWS-SADGLTIFGEIVLKDKLVFYDVENRRIGWK 431
Query: 492 PNKC 495
C
Sbjct: 432 GQNC 435
>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
Length = 371
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 90/328 (27%), Positives = 142/328 (43%), Gaps = 32/328 (9%)
Query: 186 QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTV 245
C C C++Q P+F P SS++ P PC CKS+ C ++ C Y G G TV
Sbjct: 54 NCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPTPKCASDVCAYDGVTGLGGHTV 113
Query: 246 GDLVTETVSFGNSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSLAYC 304
G + T+T + G + + A G + G +G +GLG SL Q+K T +YC
Sbjct: 114 GIVATDTFAIGTAAPARPPASGASWRATSTPWAGPSGFIGLGRTPWSLVAQMKLTRFSYC 173
Query: 305 LVDRDSPASGVLEFNSA---RGGDAVTAPLIR---NKKVDTFYYVGLTGFSVGGQAVQIP 358
L D+ + L ++ GG A T P ++ N + +Y + L G + +P
Sbjct: 174 LAPHDTGKNSRLFLGASAKLAGGGAWT-PFVKTSPNDGMSQYYPIELEEIKAGDATITMP 232
Query: 359 PSLFEMDEAGDGGIIVDCGTAITR---LQTQAYNSLRDSFVRLAGNLKPTSGV-ALFDTC 414
G ++V TA+ R L Y + + + G + V A F+ C
Sbjct: 233 --------RGRNTVLVQ--TAVVRVSLLVDSVYQEFKKAVMASVGAAPTATPVGAPFEVC 282
Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS-------SALS 467
+ +G+ P + F AG AL +P NYL V + T C + + L+
Sbjct: 283 FPKAGVSG--APDLVFTFQAGAALTVPPANYLFDVGN-DTVCLSVMSIALLNITALDGLN 339
Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
I+G+ QQ+ + FDL + + F P C
Sbjct: 340 ILGSFQQENVHLLFDLDKDMLSFEPADC 367
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 101/357 (28%), Positives = 154/357 (43%), Gaps = 26/357 (7%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPI---FDPKTSSSYSPL 212
GEY +G P Q LDT + + W+QC C ++C + + F S +Y
Sbjct: 73 GEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFLSSKSFTYEME 132
Query: 213 PCAAPQCKSLD-VSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSG----SVKGIA 265
PC + C SL C ++ C Y++ YGD T G L +++ F S V +
Sbjct: 133 PCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDTSDGMLVDVGFLN 192
Query: 266 LGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPAS-GVLEFNS 320
GC +E G G +GL LSL Q+ +YCLV ++ S + F S
Sbjct: 193 FGC---SEAPLTGDEQSYTGNVGLNQTPLSLISQLGIKKFSYCLVPFNNLGSTSKMYFGS 249
Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
PL+ YYV + G S+G +F++ E DG II D G
Sbjct: 250 LPVTSGGQTPLLYPNS--DAYYVKVLGISIGNDEPHFD-GVFDVYEVRDGWII-DTGITY 305
Query: 381 TRLQTQAYNSLRDSFVRLAG-NLKPTSGVALFDTCYDFSGLRSVR-VPTVSLHFGAGKAL 438
+ L+T A++SL F+ L + F+ C++ + P V++HF G L
Sbjct: 306 SSLETDAFDSLLAKFLTLKDFPQRKDDPKERFELCFELQNANDLESFPDVTVHFD-GADL 364
Query: 439 DLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
L ++ + ++ G FC A + S +SI+GN Q Q V +DL + F P C
Sbjct: 365 ILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPVDC 421
>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 113/400 (28%), Positives = 158/400 (39%), Gaps = 86/400 (21%)
Query: 174 MVLDTGSDINWLQCRP--CTECYQQSDPIF-----DPKTSSSYSPLPCAAPQC------- 219
+ LDTGSD+ W C+P C C +++ PK S + +P+ C + C
Sbjct: 95 LYLDTGSDLVWFPCQPFECILCEGKAENTSLASTPPPKLSKTATPVSCKSSACSAAHSNL 154
Query: 220 -------------KSLDVSACRANRC-LYQVAYGDGSFTVGDLVTETVSFGNSGS----V 261
+S++ S C+ + C + AYGDGS + L +++S S V
Sbjct: 155 PSSDLCAISNCPLESIETSDCQKHSCPQFYYAYGDGSL-IARLYRDSISLPLSNPTNLIV 213
Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAYCLV--------- 306
GC H +G AG G G+LSL Q+ S +YCLV
Sbjct: 214 NNFTFGCAHTALAEPIGVAGF---GRGVLSLPAQLATLSPQLGNQFSYCLVSHSFDSDRL 270
Query: 307 -----------DRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV 355
D D V N R V ++ N + FY VGL G S+G + +
Sbjct: 271 RRPSPLILGRYDHDEKERRVNGVNKPR---FVYTSMLDNLEHPYFYCVGLEGISIGRKKI 327
Query: 356 QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT-- 413
P L ++D G GG++VD GT T L Y S+ F G + + V DT
Sbjct: 328 PAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNERARVIEEDTGL 387
Query: 414 --CYDFSGLRSVRVPTVSLHF-GAGKALDLPAKNYLIPV----------DSAGTFCFAFA 460
CY F +V LHF G G ++ LP +NY G
Sbjct: 388 SPCYYFDNNVVNVP-SVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKRKVGCLMLMNG 446
Query: 461 PTSSALS-----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
+ LS +GN QQQG V +DL N RVGF +C
Sbjct: 447 GEEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQC 486
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 100/375 (26%), Positives = 164/375 (43%), Gaps = 48/375 (12%)
Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
G Y+++IG+GTP + + + +DTG+D+ W+ C C EC +S+ +++ K SSS
Sbjct: 71 GLYYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGKL 130
Query: 212 LPCAAPQCKSLD---VSACRA---NRCLYQVAYGDGSFTVGDLVTETVSFGN-SGSVK-- 262
+PC CK ++ ++ C + + C Y YGDGS T G V + V F SG +K
Sbjct: 131 VPCDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLKTA 190
Query: 263 ----GIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAYCLVDR 308
+ GCG G S G+LG G S+ Q+ ++ A+CL
Sbjct: 191 SANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCL--N 248
Query: 309 DSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
G+ T PL+ ++ Y V +T VG + + E ++
Sbjct: 249 GVNGGGIFAIGHVVQPTVNTTPLLPDQP---HYSVNMTAIQVGHTFLNLSTDASEQRDS- 304
Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD--TCYDFSGLRSVRVP 426
G I+D GT + L Y L + NLK + L D TC+ +SG P
Sbjct: 305 -KGTIIDSGTTLAYLPDGIYQPLVYKILSQQPNLKVQT---LHDEYTCFQYSGSVDDGFP 360
Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT------SSALSIIGNVQQQGTRVS 480
V+ +F G +L + +YL S +C + + S ++++G++ V
Sbjct: 361 NVTFYFENGLSLKVYPHDYLFL--SENLWCIGWQNSGAQSRDSKNMTLLGDLVLSNKLVF 418
Query: 481 FDLANNRVGFTPNKC 495
+DL N +G+T C
Sbjct: 419 YDLENQVIGWTEYNC 433
>gi|300078619|gb|ADJ67210.1| aspartic proteinase nepenthesin-1 precursor [Jatropha curcas]
Length = 84
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 56/84 (66%), Positives = 64/84 (76%), Gaps = 1/84 (1%)
Query: 412 DTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGN 471
DTC+D SG V+VPTV+LHF G + LPA NYLIPVDS G+FCFAFA T S LSIIGN
Sbjct: 1 DTCFDLSGKTEVKVPTVALHF-RGADVSLPASNYLIPVDSDGSFCFAFAGTMSGLSIIGN 59
Query: 472 VQQQGTRVSFDLANNRVGFTPNKC 495
+QQQG RV +DLA +RVGF P C
Sbjct: 60 IQQQGFRVVYDLAGSRVGFAPRGC 83
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 99/400 (24%), Positives = 168/400 (42%), Gaps = 44/400 (11%)
Query: 122 LAIYNVDRHELKPAEAQILP-EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGS 180
L ++ +RH + A LP F+ P G+G Y++ IG+GTP ++ + LDTGS
Sbjct: 27 LQTHDENRHRRRNLMAAELPLGGFNIPY------GTGLYYTDIGIGTPAVKYYVQLDTGS 80
Query: 181 DINWLQCRPCTECYQQSDPI-----FDPKTSSSYSPLPCAAPQCKSLDVSACRAN-RCLY 234
W+ C +C +SD + +DP++S S + C C S C RC Y
Sbjct: 81 KAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTICTSR--PPCNMTLRCPY 138
Query: 235 QVAYGDGSFTVGDLVTETVS----FGNSG---SVKGIALGCGHDNEGLFVGSA----GLL 283
Y DG T+G L T+ + +GN + + GCG G SA G++
Sbjct: 139 ITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGII 198
Query: 284 GLGGGMLSLTKQIKATS-----LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVD 338
G G + Q+ A ++CL + G+ T P+++N +V
Sbjct: 199 GFGNSNQTALSQLAAAGKTKKIFSHCL--DSTNGGGIFAIGEVVEPKVKTTPIVKNNEV- 255
Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
++ V L +V G +Q+P ++F + G +D G+ + L Y+ L
Sbjct: 256 -YHLVNLKSINVAGTTLQLPANIFGTTKT--KGTFIDSGSTLVYLPEIIYSEL--ILAVF 310
Query: 399 AGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFA 458
A + T G C+ F G + P ++ HF LD+ +YL+ + +CF
Sbjct: 311 AKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFENDLTLDVYPYDYLLEYE-GNQYCFG 369
Query: 459 FAPTS----SALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
F + I+G++ V +D+ +G+T +
Sbjct: 370 FQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGWTEHN 409
>gi|125555046|gb|EAZ00652.1| hypothetical protein OsI_22673 [Oryza sativa Indica Group]
Length = 340
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 80/265 (30%), Positives = 123/265 (46%), Gaps = 26/265 (9%)
Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
D FDP SSS++ +PC +P+C C C + + +G+ + G LV +T++
Sbjct: 30 DVAFDPSRSSSFAAIPCGSPEC----AVECTGASCPFTIQFGNVTVANGTLVRDTLTLSP 85
Query: 258 SGSVKGIALGC---GHDNEGLFVGSAGLLGLGGGMLSLTKQI--------KATSLAYCLV 306
S + G GC G D + F G+ GL+ L SL ++ + +YCL
Sbjct: 86 SATFAGFTFGCIEVGADAD-TFDGAVGLIDLSRSSHSLASRVISNGATTTTTAAFSYCLP 144
Query: 307 DRDSPAS-GVLEFNSAR----GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
S S G L ++R GGD AP+ N Y+V L G SVGG+ + +PP++
Sbjct: 145 SLSSTRSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAV 204
Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR 421
G +++ T T L AY +LRD+F + DTCY+ +GL
Sbjct: 205 LAAH-----GTLLEAATEFTFLAPAAYAALRDAFRNDMAQYPAAPPFRVLDTCYNLTGLA 259
Query: 422 SVRVPTVSLHFGAGKALDLPAKNYL 446
S+ VP V+L F G L+L + +
Sbjct: 260 SLAVPAVALRFAGGTELELDVRQTM 284
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 99/400 (24%), Positives = 168/400 (42%), Gaps = 44/400 (11%)
Query: 122 LAIYNVDRHELKPAEAQILP-EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGS 180
L ++ +RH + A LP F+ P G+G Y++ IG+GTP ++ + LDTGS
Sbjct: 51 LQTHDENRHRRRNLMAAELPLGGFNIPY------GTGLYYTDIGIGTPAVKYYVQLDTGS 104
Query: 181 DINWLQCRPCTECYQQSDPI-----FDPKTSSSYSPLPCAAPQCKSLDVSACRAN-RCLY 234
W+ C +C +SD + +DP++S S + C C S C RC Y
Sbjct: 105 KAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTICTSR--PPCNMTLRCPY 162
Query: 235 QVAYGDGSFTVGDLVTETVS----FGNSG---SVKGIALGCGHDNEGLFVGSA----GLL 283
Y DG T+G L T+ + +GN + + GCG G SA G++
Sbjct: 163 ITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGII 222
Query: 284 GLGGGMLSLTKQIKATS-----LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVD 338
G G + Q+ A ++CL + G+ T P+++N +V
Sbjct: 223 GFGNSNQTALSQLAAAGKTKKIFSHCL--DSTNGGGIFAIGEVVEPKVKTTPIVKNNEV- 279
Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
++ V L +V G +Q+P ++F + G +D G+ + L Y+ L
Sbjct: 280 -YHLVNLKSINVAGTTLQLPANIFGTTKT--KGTFIDSGSTLVYLPEIIYSEL--ILAVF 334
Query: 399 AGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFA 458
A + T G C+ F G + P ++ HF LD+ +YL+ + +CF
Sbjct: 335 AKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFENDLTLDVYPYDYLLEYE-GNQYCFG 393
Query: 459 FAPTS----SALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
F + I+G++ V +D+ +G+T +
Sbjct: 394 FQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGWTEHN 433
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.134 0.395
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,721,407,498
Number of Sequences: 23463169
Number of extensions: 337237611
Number of successful extensions: 836930
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2548
Number of HSP's successfully gapped in prelim test: 1930
Number of HSP's that attempted gapping in prelim test: 825596
Number of HSP's gapped (non-prelim): 5502
length of query: 495
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 348
effective length of database: 8,910,109,524
effective search space: 3100718114352
effective search space used: 3100718114352
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)