BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 037264
(249 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 396 bits (1017), Expect = e-108, Method: Compositional matrix adjust.
Identities = 201/249 (80%), Positives = 231/249 (92%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
GDFVTET+TLGSA VDN+AIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA++FSYCLV
Sbjct: 236 GDFVTETITLGSAPVDNVAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINATSFSYCLV 295
Query: 61 DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
DRDS+S STLEF+S+LPPNAV+APLLRNH LDTFYY+GLTG+SVGG+L+ I E+AF+IDE
Sbjct: 296 DRDSESASTLEFNSTLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDE 355
Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
SGNGG+IVDSGTA+TRLQT+ YN+LRDAFV+ TR L T+G+ALFDTCYD SS+ +VEVP
Sbjct: 356 SGNGGVIVDSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIALFDTCYDLSSKGNVEVP 415
Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
TVSFHFP+GK LPLPAKNYL+P+DS GTFCFAFAPT+SSLSIIGNVQQQGTRV ++L N
Sbjct: 416 TVSFHFPDGKELPLPAKNYLVPLDSEGTFCFAFAPTASSLSIIGNVQQQGTRVVYDLVNH 475
Query: 241 LIGFTPNKC 249
L+GF PNKC
Sbjct: 476 LVGFVPNKC 484
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 390 bits (1002), Expect = e-106, Method: Compositional matrix adjust.
Identities = 199/249 (79%), Positives = 225/249 (90%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
GDFVTET+TLGSASVDN+AIGCGHNNEGLF+GAAGLLGLGGG LSFPSQINAS+FSYCLV
Sbjct: 231 GDFVTETITLGSASVDNVAIGCGHNNEGLFIGAAGLLGLGGGKLSFPSQINASSFSYCLV 290
Query: 61 DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
DRDSDS STLEF+S+L P+A+TAPLLRN ELDTFYY+G+TG+SVGG+LL I E+ F++DE
Sbjct: 291 DRDSDSASTLEFNSALLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDE 350
Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
SGNGGII+DSGTAVTRLQT YNALRDAFV+GT+ L T VALFDTCYD S ++SVEVP
Sbjct: 351 SGNGGIIIDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVEVP 410
Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
TV+FH GKVLPLPA NYLIPVDS+GTFCFAFAPTSS+LSIIGNVQQQGTRV F+L NS
Sbjct: 411 TVTFHLAGGKVLPLPATNYLIPVDSDGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANS 470
Query: 241 LIGFTPNKC 249
L+GF P +C
Sbjct: 471 LVGFEPRQC 479
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 197/250 (78%), Positives = 229/250 (91%), Gaps = 1/250 (0%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
GDF TET+TL GSAS++N+AIGCGH+NEGLFVGAAGLLGLGGGSLSFPSQINAS+FSYCL
Sbjct: 242 GDFATETITLDGSASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCL 301
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
V+RD+DS STLEF+S +P ++VTAPLLRN++LDTFYYLG+TGI VGG +L I ++F++D
Sbjct: 302 VNRDTDSASTLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVD 361
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
ESGNGGIIVDSGTAVTRLQ++ YN+LRD+FVRGT+ L T GVALFDTCYD SSRSSVEV
Sbjct: 362 ESGNGGIIVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEV 421
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
PTVSFHFP+GK L LPAKNYLIPVDS GTFCFAFAPT+S+LSIIGNVQQQGTRVS++L N
Sbjct: 422 PTVSFHFPDGKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSN 481
Query: 240 SLIGFTPNKC 249
SL+GF+PN C
Sbjct: 482 SLVGFSPNGC 491
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 196/249 (78%), Positives = 217/249 (87%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
GDFVTETVTLGS S+ NIAIGCGHNNEGLF+GAAGLLGLGGGSLSFPSQ+NAS+FSYCLV
Sbjct: 238 GDFVTETVTLGSTSLGNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLV 297
Query: 61 DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
DRDSDSTSTL+F+S + P+AVTAPL RN LDTF+YLGLTG+SVGG +LPI ET+F++ E
Sbjct: 298 DRDSDSTSTLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSE 357
Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
GNGGIIVDSGTAVTRLQT YN LRDAFV+ T L GVALFDTCYD SS+S VEVP
Sbjct: 358 DGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVP 417
Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
TVSFHF G LPLPAKNYLIPVDS GTFCFAFAPT S+LSI+GN QQQGTRV F+L NS
Sbjct: 418 TVSFHFANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANS 477
Query: 241 LIGFTPNKC 249
L+GF+PNKC
Sbjct: 478 LVGFSPNKC 486
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 196/249 (78%), Positives = 217/249 (87%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
GDFVTETVTLGS S+ NIAIGCGHNNEGLF+GAAGLLGLGGGSLSFPSQ+NAS+FSYCLV
Sbjct: 238 GDFVTETVTLGSTSLGNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLV 297
Query: 61 DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
DRDSDSTSTL+F+S + P+AVTAPL RN LDTF+YLGLTG+SVGG +LPI ET+F++ E
Sbjct: 298 DRDSDSTSTLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSE 357
Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
GNGGIIVDSGTAVTRLQT YN LRDAFV+ T L GVALFDTCYD SS+S VEVP
Sbjct: 358 DGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVP 417
Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
TVSFHF G LPLPAKNYLIPVDS GTFCFAFAPT S+LSI+GN QQQGTRV F+L NS
Sbjct: 418 TVSFHFANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANS 477
Query: 241 LIGFTPNKC 249
L+GF+PNKC
Sbjct: 478 LVGFSPNKC 486
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 363 bits (932), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 183/249 (73%), Positives = 222/249 (89%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
GDF TET+T+GS V N+A+GCGH+NEGLFVGAAGLLGLGGG L+ PSQ+N ++FSYCLV
Sbjct: 235 GDFATETLTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLV 294
Query: 61 DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
DRDSDS ST++F +SL P+AV APLLRNH+LDTFYYLGLTGISVGG+LL I +++F++DE
Sbjct: 295 DRDSDSASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDE 354
Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
SG+GGII+DSGTAVTRLQTE YN+LRD+FV+GT L GVA+FDTCY+ S++++VEVP
Sbjct: 355 SGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVP 414
Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
TV+FHFP GK+L LPAKNY+IPVDS GTFC AFAPT+SSL+IIGNVQQQGTRV+F+L NS
Sbjct: 415 TVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANS 474
Query: 241 LIGFTPNKC 249
LIGF+ NKC
Sbjct: 475 LIGFSSNKC 483
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 362 bits (929), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 182/249 (73%), Positives = 222/249 (89%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
GDF TET+T+GS V N+A+GCGH+NEGLFVGAAGLLGLGGG L+ PSQ+N ++FSYCLV
Sbjct: 238 GDFATETLTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLV 297
Query: 61 DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
DRDSDS ST+EF +SLPP+AV APLLRNH+LDTFYYLGLTGISVGG+LL I +++F++DE
Sbjct: 298 DRDSDSASTVEFGTSLPPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDE 357
Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
SG+GGII+DSGTAVTRLQT YN+LRD+F++GT L GVA+FDTCY+ S+++++EVP
Sbjct: 358 SGSGGIIIDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIEVP 417
Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
TV+FHFP GK+L LPAKNY+IPVDS GTFC AFAPT+SSL+IIGNVQQQGTRV+F+L NS
Sbjct: 418 TVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANS 477
Query: 241 LIGFTPNKC 249
LIGF+ NKC
Sbjct: 478 LIGFSSNKC 486
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 358 bits (918), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 186/249 (74%), Positives = 221/249 (88%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G+F TETVTLGSA+V+N+AIGCGHNNEGLFVGAAGLLGLGGG LSFP+Q+NA++FSYCLV
Sbjct: 236 GEFATETVTLGSAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLV 295
Query: 61 DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
+RDSD+ STLEF+S LP NA TAPL+RN ELDTFYYLGL GISVGG+ LPI E++F++D
Sbjct: 296 NRDSDAVSTLEFNSPLPRNAATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEVDA 355
Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
G GGII+DSGTAVTRL++E Y+ALRDAFV+G + + +GV+LFDTCYD SSR SVE+P
Sbjct: 356 IGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVEIP 415
Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
TVSF FPEG+ LPLPA+NYLIPVDS GTFCFAFAPT+SSLSIIGNVQQQGTRV F++ NS
Sbjct: 416 TVSFRFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVGFDIANS 475
Query: 241 LIGFTPNKC 249
L+GF+ + C
Sbjct: 476 LVGFSVDSC 484
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 357 bits (915), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 185/249 (74%), Positives = 220/249 (88%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G+F TETVTLG+A+V+N+AIGCGHNNEGLFVGAAGLLGLGGG LSFP+Q+NA++FSYCLV
Sbjct: 236 GEFATETVTLGTAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLV 295
Query: 61 DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
+RDSD+ STLEF+S LP N VTAPL RN ELDTFYYLGL GISVGG+ LPI E+ F++D
Sbjct: 296 NRDSDAVSTLEFNSPLPRNVVTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDA 355
Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
G GGII+DSGTAVTRL++E Y+ALRDAFV+G + + +GV+LFDTCYD SSR SV+VP
Sbjct: 356 IGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVQVP 415
Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
TVSFHFPEG+ LPLPA+NYLIPVDS GTFCFAFAPT+SSLSI+GNVQQQGTRV F++ NS
Sbjct: 416 TVSFHFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLSIMGNVQQQGTRVGFDIANS 475
Query: 241 LIGFTPNKC 249
L+GF+ + C
Sbjct: 476 LVGFSADSC 484
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 328 bits (840), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 171/251 (68%), Positives = 212/251 (84%), Gaps = 2/251 (0%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
GDF TE+V+ G S SV N+A+GCGH+NEGLFVGAAGLLGLGGG LS +Q+ A++FSYCL
Sbjct: 248 GDFATESVSFGNSGSVKNVALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCL 307
Query: 60 VDRDSDSTSTLEFDSS-LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
V+RDS +STL+F+S+ L ++VTAPL++N ++DTFYY+GL+G+SVGG ++ I E+ F++
Sbjct: 308 VNRDSAGSSTLDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRL 367
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE 178
DESGNGGIIVD GTA+TRLQT+ YN LRDAFVR T+ L T VALFDTCYD S ++SV
Sbjct: 368 DESGNGGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVR 427
Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLR 238
VPTVSFHF +GK LPA NYLIPVDS GT+CFAFAPT+SSLSIIGNVQQQGTRV+F+L
Sbjct: 428 VPTVSFHFADGKSWNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLA 487
Query: 239 NSLIGFTPNKC 249
N+ +GF+PNKC
Sbjct: 488 NNRMGFSPNKC 498
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 327 bits (838), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 171/251 (68%), Positives = 212/251 (84%), Gaps = 2/251 (0%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
GDF TE+V+ G S SV N+A+GCGH+NEGLFVGAAGLLGLGGG LS +Q+ A++FSYCL
Sbjct: 107 GDFATESVSFGNSGSVKNVALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCL 166
Query: 60 VDRDSDSTSTLEFDSS-LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
V+RDS +STL+F+S+ L ++VTAPL++N ++DTFYY+GL+G+SVGG ++ I E+ F++
Sbjct: 167 VNRDSAGSSTLDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRL 226
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE 178
DESGNGGIIVD GTA+TRLQT+ YN LRDAFVR T+ L T VALFDTCYD S ++SV
Sbjct: 227 DESGNGGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVR 286
Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLR 238
VPTVSFHF +GK LPA NYLIPVDS GT+CFAFAPT+SSLSIIGNVQQQGTRV+F+L
Sbjct: 287 VPTVSFHFADGKSWNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLA 346
Query: 239 NSLIGFTPNKC 249
N+ +GF+PNKC
Sbjct: 347 NNRMGFSPNKC 357
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 327 bits (837), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 167/249 (67%), Positives = 207/249 (83%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G++VTETV+ G+ SV+ +AIGCGH+NEGLFVG+AGLLGLGGG LS SQI A++FSYCLV
Sbjct: 244 GEYVTETVSFGAGSVNRVAIGCGHDNEGLFVGSAGLLGLGGGPLSLTSQIKATSFSYCLV 303
Query: 61 DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
DRDS +STLEF+S P ++V APLL+N +++TFYY+ LTG+SVGG+++ + F +D+
Sbjct: 304 DRDSGKSSTLEFNSPRPGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQ 363
Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
SG GG+IVDSGTA+TRL+T+ YN++RDAF R T L P +GVALFDTCYD SS SV VP
Sbjct: 364 SGAGGVIVDSGTAITRLRTQAYNSVRDAFKRKTSNLRPAEGVALFDTCYDLSSLQSVRVP 423
Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
TVSFHF + LPAKNYLIPVD GT+CFAFAPT+SS+SIIGNVQQQGTRVSF+L NS
Sbjct: 424 TVSFHFSGDRAWALPAKNYLIPVDGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSFDLANS 483
Query: 241 LIGFTPNKC 249
L+GF+PNKC
Sbjct: 484 LVGFSPNKC 492
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 325 bits (834), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 176/253 (69%), Positives = 207/253 (81%), Gaps = 4/253 (1%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFSY 57
G+F TET+TLG A + N+AIGCGH+NEGLFVGAAGLLGLGGGSLSFPSQ+ N FSY
Sbjct: 233 GNFATETLTLGGAPLQNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQLTDENGKIFSY 292
Query: 58 CLVDRDSDSTSTLEFDSSLPPN-AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CLVDRDS+S+STL+F + PN AV AP+L+N LDTFYY+ L+GISVGG +L IS++ F
Sbjct: 293 CLVDRDSESSSTLQFGRAAVPNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVF 352
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
ID SGNGG+IVDSGTAVTRLQT Y++LRDAF GT+ L TDGV+LFDTCYD SS+ S
Sbjct: 353 GIDASGNGGVIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGVSLFDTCYDLSSKES 412
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
V+VPTV FHF G + LPAKNYL+PVDS GTFCFAFAPTSSSLSI+GN+QQQG RVSF+
Sbjct: 413 VDVPTVVFHFSGGGSMSLPAKNYLVPVDSMGTFCFAFAPTSSSLSIVGNIQQQGIRVSFD 472
Query: 237 LRNSLIGFTPNKC 249
N+ +GF NKC
Sbjct: 473 RANNQVGFAVNKC 485
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 169/250 (67%), Positives = 213/250 (85%), Gaps = 1/250 (0%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
GDFVTET++ G S +V++IA+GCGH+NEGLFVGAAGLLGLGGG LS SQ+ A++FSYCL
Sbjct: 246 GDFVTETMSFGGSGTVNSIALGCGHDNEGLFVGAAGLLGLGGGPLSLTSQLKATSFSYCL 305
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
V+RDS ++STL+F+S+ ++V APLL++ ++DTFYY+GL+G+SVGG+LL I + FK+D
Sbjct: 306 VNRDSAASSTLDFNSAPVGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLD 365
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
+SG+GG+IVD GTA+TRLQ+E YN+LRD+FV +R L T GVALFDTCYD S +SSV+V
Sbjct: 366 DSGDGGVIVDCGTAITRLQSEAYNSLRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVKV 425
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
PTVSFHF GK LPA NYLIPVDS GT+CFAFAPT+SSLSIIGNVQQQGTRVSF+L N
Sbjct: 426 PTVSFHFDGGKSWDLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLAN 485
Query: 240 SLIGFTPNKC 249
+ +GF+ NKC
Sbjct: 486 NRVGFSTNKC 495
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 318 bits (815), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 169/250 (67%), Positives = 204/250 (81%), Gaps = 1/250 (0%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
GDF TETV+ G S SVD +AIGCGH+NEGLFVGAAGL+GLGGG LS SQI AS+FSYCL
Sbjct: 247 GDFATETVSFGNSGSVDKVAIGCGHDNEGLFVGAAGLIGLGGGPLSLTSQIKASSFSYCL 306
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
V+RDS +STLEF+S+ P ++VTAP+ +N ++DTFYY+G+TG+SVGG+ L I + F++D
Sbjct: 307 VNRDSVDSSTLEFNSAKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVD 366
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
SG GGIIVD GTAVTRLQT+ YNALRD FV+ T+ L T G ALFDTCY+ SSR+SV V
Sbjct: 367 GSGKGGIIVDCGTAVTRLQTQAYNALRDTFVKLTKDLPSTSGFALFDTCYNLSSRTSVRV 426
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
PTV+F F GK LPLP NYLIPVDS GTFC AFAPT++SLSIIGNVQQQGTRV+++L N
Sbjct: 427 PTVAFLFDGGKSLPLPPSNYLIPVDSAGTFCLAFAPTTASLSIIGNVQQQGTRVTYDLAN 486
Query: 240 SLIGFTPNKC 249
S + F+ KC
Sbjct: 487 SQVSFSSRKC 496
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 313 bits (802), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 170/250 (68%), Positives = 201/250 (80%), Gaps = 2/250 (0%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
GDF TET+TLG SA V ++AIGCGH+NEGLFVGAAGLL LGGG LSFPSQI+A+TFSYCL
Sbjct: 256 GDFATETLTLGDSAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCL 315
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
VDRDS S+STL+F + VTAPL+R+ TFYY+GL+G+SVGG +L I +AF +D
Sbjct: 316 VDRDSPSSSTLQFGDAADAE-VTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMD 374
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
+G GG+IVDSGTAVTRLQ+ Y ALRDAFVRGT++L T GV+LFDTCYD S R+SVEV
Sbjct: 375 STGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEV 434
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
P VS F G L LPAKNYLIPVD GT+C AFAPT++++SIIGNVQQQGTRVSF+
Sbjct: 435 PAVSLRFAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAK 494
Query: 240 SLIGFTPNKC 249
S +GFT NKC
Sbjct: 495 STVGFTTNKC 504
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 313 bits (801), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 172/250 (68%), Positives = 208/250 (83%), Gaps = 1/250 (0%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+FV ET+T G S ++N+A+GCGH+NEGLFVG+AGLLGLGGGSLS SQ+ AS+FSYCL
Sbjct: 242 GEFVIETLTFGNSGMINNVAVGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMKASSFSYCL 301
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
VDRDS S+S LEF+S+ P ++V APLL++ ++DTFYY+GLTG+SVGG LL I F++D
Sbjct: 302 VDRDSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMD 361
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
+SG GGIIVDSGTA+TRLQT+ YN LRDAFV T L T+G ALFDTCYD SS+S V +
Sbjct: 362 DSGYGGIIVDSGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTI 421
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
PTVSF F GK L LP KNYLIPVDS GTFCFAFAPT+SSLSIIGNVQQQGTRV ++L N
Sbjct: 422 PTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLAN 481
Query: 240 SLIGFTPNKC 249
S++GF+P+KC
Sbjct: 482 SVVGFSPHKC 491
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 312 bits (800), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 171/250 (68%), Positives = 201/250 (80%), Gaps = 2/250 (0%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
GDF TET+TLG SA V ++AIGCGH+NEGLFVGAAGLL LGGG LSFPSQI+A+TFSYCL
Sbjct: 252 GDFATETLTLGDSAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCL 311
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
VDRDS S+STL+F + VTAPL+R+ TFYY+GL+GISVGG +L I +AF +D
Sbjct: 312 VDRDSPSSSTLQFGDAADAE-VTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMD 370
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
+G GG+IVDSGTAVTRLQ+ Y ALRDAFVRGT++L T GV+LFDTCYD S R+SVEV
Sbjct: 371 GTGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEV 430
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
P VS F G L LPAKNYLIPVD GT+C AFAPT++++SIIGNVQQQGTRVSF+
Sbjct: 431 PAVSLRFAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAK 490
Query: 240 SLIGFTPNKC 249
S +GFT NKC
Sbjct: 491 STVGFTSNKC 500
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 312 bits (800), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 171/250 (68%), Positives = 208/250 (83%), Gaps = 1/250 (0%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+FVTET+T G S ++++A+GCGH+NEGLFVG+AGLLGLGGG LS SQ+ AS+FSYCL
Sbjct: 242 GEFVTETLTFGNSGMINDVAVGCGHDNEGLFVGSAGLLGLGGGPLSLTSQMKASSFSYCL 301
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
VDRDS S+S LEF+S+ P ++V APLL++ ++DTFYY+GLTG+SVGG LL I F++D
Sbjct: 302 VDRDSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMD 361
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
+SG GGIIVDSGTA+TRLQT+ YN LRDAFV T L T+G ALFDTCYD SS+S V +
Sbjct: 362 DSGYGGIIVDSGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTI 421
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
PTVSF F GK L LP KNYLIPVDS GTFCFAFAPT+SSLSIIGNVQQQGTRV ++L N
Sbjct: 422 PTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLAN 481
Query: 240 SLIGFTPNKC 249
S++GF+P+KC
Sbjct: 482 SVVGFSPHKC 491
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 310 bits (794), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 171/252 (67%), Positives = 199/252 (78%), Gaps = 3/252 (1%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
GDF TET+TLG S V N+AIGCGH+NEGLFVGAAGLL LGGG LSFPSQI+ASTFSYCL
Sbjct: 258 GDFATETLTLGDSTPVTNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCL 317
Query: 60 VDRDSDSTSTLEFDSS-LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
VDRDS + STL+F + + VTAPL+R+ TFYY+ L+GISVGG L I +AF +
Sbjct: 318 VDRDSPAASTLQFGADGAEADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAM 377
Query: 119 DE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
D SG+GG+IVDSGTAVTRLQ+ Y ALRDAFVRGT +L T GV+LFDTCYD S R+SV
Sbjct: 378 DATSGSGGVIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSV 437
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
EVP VS F G L LPAKNYLIPVD GT+C AFAPT++++SIIGNVQQQGTRVSF+
Sbjct: 438 EVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDT 497
Query: 238 RNSLIGFTPNKC 249
++GFTPNKC
Sbjct: 498 AKGVVGFTPNKC 509
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 306 bits (785), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 170/252 (67%), Positives = 197/252 (78%), Gaps = 3/252 (1%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
GDF TET+TLG S V N+AIGCGH+NEGLFVGAAGLL LGGG LSFPSQI+ASTFSYCL
Sbjct: 255 GDFATETLTLGDSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCL 314
Query: 60 VDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
VDRDS + STL+F D + VTAPL+R+ TFYY+ L+GISVGG L I +AF +
Sbjct: 315 VDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAM 374
Query: 119 DE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
D SG+GG+IVDSGTAVTRLQ+ Y ALRDAFV+G +L T GV+LFDTCYD S R+SV
Sbjct: 375 DATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDRTSV 434
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
EVP VS F G L LPAKNYLIPVD GT+C AFAPT++++SIIGNVQQQGTRVSF+
Sbjct: 435 EVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDT 494
Query: 238 RNSLIGFTPNKC 249
+GFTPNKC
Sbjct: 495 ARGAVGFTPNKC 506
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 306 bits (784), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 170/252 (67%), Positives = 197/252 (78%), Gaps = 3/252 (1%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
GDF TET+TLG S V N+AIGCGH+NEGLFVGAAGLL LGGG LSFPSQI+ASTFSYCL
Sbjct: 75 GDFATETLTLGDSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCL 134
Query: 60 VDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
VDRDS + STL+F D + VTAPL+R+ TFYY+ L+GISVGG L I +AF +
Sbjct: 135 VDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAM 194
Query: 119 DE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
D SG+GG+IVDSGTAVTRLQ+ Y ALRDAFV+G +L T GV+LFDTCYD S R+SV
Sbjct: 195 DATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDRTSV 254
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
EVP VS F G L LPAKNYLIPVD GT+C AFAPT++++SIIGNVQQQGTRVSF+
Sbjct: 255 EVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDT 314
Query: 238 RNSLIGFTPNKC 249
+GFTPNKC
Sbjct: 315 ARGAVGFTPNKC 326
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 306 bits (783), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 166/250 (66%), Positives = 201/250 (80%), Gaps = 2/250 (0%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
GDF TET+TLG SA V N+AIGCGH+NEGLFVGAAGLL LGGG LSFPSQI+A+TFSYCL
Sbjct: 252 GDFATETLTLGDSAPVSNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCL 311
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
VDRDS S+STL+F S P AVTAPL+R+ +TFYY+ L+GISVGG+ L I +AF +D
Sbjct: 312 VDRDSPSSSTLQFGDSEQP-AVTAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMD 370
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
++G+GG+IVDSGTAVTRLQ+ Y ALR+AFV+GT++L GV+LFDTCYD + RSSV+V
Sbjct: 371 DAGSGGVIVDSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQV 430
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
P V+ F G L LPAKNYLIPVD+ GT+C AFA TS +SIIGNVQQQG RVSF+
Sbjct: 431 PAVALWFEGGGELKLPAKNYLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAK 490
Query: 240 SLIGFTPNKC 249
+ +GFT +KC
Sbjct: 491 NTVGFTADKC 500
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 300 bits (769), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 161/252 (63%), Positives = 203/252 (80%), Gaps = 3/252 (1%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ T+TVT G S ++++A+GCGH+NEGLF GAAGLLGLGGG+LS +Q+ A++FSYCL
Sbjct: 249 GELATDTVTFGNSGKINDVALGCGHDNEGLFTGAAGLLGLGGGALSITNQMKATSFSYCL 308
Query: 60 VDRDSDSTSTLEFDS-SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
VDRDS +S+L+F+S L TAPLLRN ++DTFYY+GL+G SVGG + + + F +
Sbjct: 309 VDRDSGKSSSLDFNSVQLGSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDV 368
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGVALFDTCYDFSSRSSV 177
D SG+GG+I+D GTAVTRLQT+ YN+LRDAF++ T L T ++LFDTCYDFSS SSV
Sbjct: 369 DASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTCYDFSSLSSV 428
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
+VPTV+FHF GK L LPAKNYLIPVD NGTFCFAFAPTSSSLSIIGNVQQQGTR++++L
Sbjct: 429 KVPTVAFHFTGGKSLDLPAKNYLIPVDDNGTFCFAFAPTSSSLSIIGNVQQQGTRITYDL 488
Query: 238 RNSLIGFTPNKC 249
N +IG + NKC
Sbjct: 489 ANKIIGLSGNKC 500
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 300 bits (768), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 166/253 (65%), Positives = 198/253 (78%), Gaps = 5/253 (1%)
Query: 1 GDFVTETVTLG---SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
GDF TET+TLG SA+V ++AIGCGH+NEGLFVGAAGLL LGGG LSFPSQI+A+ FSY
Sbjct: 289 GDFATETLTLGGDGSAAVHDVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATEFSY 348
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISETAF 116
CLVDRDS S STL+F +S + VTAPL+R+ +TFYY+ L GISVGG+ L I AF
Sbjct: 349 CLVDRDSPSASTLQFGAS-DSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAF 407
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
+DE G+GG+IVDSGTAVTRLQ+ Y+ALRDAFVRGT+AL GV+LFDTCYD + RSS
Sbjct: 408 AMDEQGSGGVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSS 467
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
V+VP VS F G L LPAKNYLIPVD GT+C AFA T ++SI+GNVQQQG RVSF+
Sbjct: 468 VQVPAVSLRFEGGGELKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFD 527
Query: 237 LRNSLIGFTPNKC 249
+ +GF+PNKC
Sbjct: 528 TAKNTVGFSPNKC 540
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 297 bits (760), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 161/252 (63%), Positives = 202/252 (80%), Gaps = 3/252 (1%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G++ T+TVT G S V+++A+GCGH+NEGLF GAAGLLGLGGG+LS +QI A +FSYCL
Sbjct: 251 GNYATDTVTFGESGKVNDVALGCGHDNEGLFTGAAGLLGLGGGALSMTNQIKAKSFSYCL 310
Query: 60 VDRDSDSTSTLEFDS-SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
VDRDS +S+L+F+S + TAPLLRN ++DTFYY+GL+G SVGG + I + F++
Sbjct: 311 VDRDSAKSSSLDFNSVQIGAGDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEV 370
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGVALFDTCYDFSSRSSV 177
D SG GG+I+D GTAVTRLQT+ YN+LRDAFV+ T T ++LFDTCYDFSS S+V
Sbjct: 371 DASGAGGVILDCGTAVTRLQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFSSLSTV 430
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
+VPTV+FHF GK L LPAKNYLIP+D GTFCFAFAPTSSSLSIIGNVQQQGTR++++L
Sbjct: 431 KVPTVTFHFTGGKSLNLPAKNYLIPIDDAGTFCFAFAPTSSSLSIIGNVQQQGTRITYDL 490
Query: 238 RNSLIGFTPNKC 249
N+LIG + NKC
Sbjct: 491 ANNLIGLSANKC 502
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 295 bits (755), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 158/252 (62%), Positives = 203/252 (80%), Gaps = 3/252 (1%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ T+TVT G S ++N+A+GCGH+NEGLF GAAGLLGLGGG LS +Q+ A++FSYCL
Sbjct: 249 GELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCL 308
Query: 60 VDRDSDSTSTLEFDS-SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
VDRDS +S+L+F+S L TAPLLRN ++DTFYY+GL+G SVGG+ + + + F +
Sbjct: 309 VDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDV 368
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGVALFDTCYDFSSRSSV 177
D SG+GG+I+D GTAVTRLQT+ YN+LRDAF++ T L + ++LFDTCYDFSS S+V
Sbjct: 369 DASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTV 428
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
+VPTV+FHF GK L LPAKNYLIPVD +GTFCFAFAPTSSSLSIIGNVQQQGTR++++L
Sbjct: 429 KVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDL 488
Query: 238 RNSLIGFTPNKC 249
++IG + NKC
Sbjct: 489 SKNVIGLSGNKC 500
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 295 bits (755), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 158/252 (62%), Positives = 203/252 (80%), Gaps = 3/252 (1%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ T+TVT G S ++N+A+GCGH+NEGLF GAAGLLGLGGG LS +Q+ A++FSYCL
Sbjct: 249 GELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCL 308
Query: 60 VDRDSDSTSTLEFDS-SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
VDRDS +S+L+F+S L TAPLLRN ++DTFYY+GL+G SVGG+ + + + F +
Sbjct: 309 VDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDV 368
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGVALFDTCYDFSSRSSV 177
D SG+GG+I+D GTAVTRLQT+ YN+LRDAF++ T L + ++LFDTCYDFSS S+V
Sbjct: 369 DASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTV 428
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
+VPTV+FHF GK L LPAKNYLIPVD +GTFCFAFAPTSSSLSIIGNVQQQGTR++++L
Sbjct: 429 KVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDL 488
Query: 238 RNSLIGFTPNKC 249
++IG + NKC
Sbjct: 489 SKNVIGLSGNKC 500
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 281 bits (718), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 147/250 (58%), Positives = 199/250 (79%), Gaps = 1/250 (0%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ TET++ G S S+ N+ IGCGH+NEGLF G AGL+GLGGG++S SQ+ AS+FSYCL
Sbjct: 238 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 297
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
V+ DSDS+STLEF+S++P +++T+PL++N ++ Y+ + GISVGG LPIS T F+ID
Sbjct: 298 VNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEID 357
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
ESG GGIIVDSGT ++RL ++ Y +LR+AFV+ T +LSP G+++FDTCY+FS +S+VEV
Sbjct: 358 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEV 417
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
PT++F EG L LPA+NYLI +D+ GT+C AF T SSLSIIG+ QQQG RVS++L N
Sbjct: 418 PTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 477
Query: 240 SLIGFTPNKC 249
SL+GF+ NKC
Sbjct: 478 SLVGFSTNKC 487
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 280 bits (717), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 155/255 (60%), Positives = 191/255 (74%), Gaps = 6/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
G + TET+T G+ S+ N+AIGCGH+N GLFVGAAGLLGLG GSLSFP+Q+ T FSY
Sbjct: 241 GSYATETLTFGTTSIQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSY 300
Query: 58 CLVDRDSDSTSTLEFD-SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL-PISETA 115
CLVDRDS+S+ TLEF S+P ++ PL+ N L TFYYL + ISVGG +L + A
Sbjct: 301 CLVDRDSESSGTLEFGPESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEA 360
Query: 116 FKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
F+IDE+ G GGII+DSGTAVTRLQT Y+ALRDAF+ GT+ L DG+++FDTCYD S+
Sbjct: 361 FRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSAL 420
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
SV +P V FHF G LPAKN LIP+DS GTFCFAFAP S+LSI+GN+QQQG RVS
Sbjct: 421 QSVSIPAVGFHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVS 480
Query: 235 FNLRNSLIGFTPNKC 249
F+ NSL+GF ++C
Sbjct: 481 FDSANSLVGFAIDQC 495
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 279 bits (714), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 155/255 (60%), Positives = 191/255 (74%), Gaps = 6/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
G + TET+T G+ S+ N+AIGCGH+N GLFVGAAGLLGLG GSLSFP+Q+ T FSY
Sbjct: 95 GSYATETLTFGTTSIQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSY 154
Query: 58 CLVDRDSDSTSTLEFD-SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL-PISETA 115
CLVDRDS+S+ TLEF S+P ++ PL+ N L TFYYL + ISVGG +L + A
Sbjct: 155 CLVDRDSESSGTLEFGPESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEA 214
Query: 116 FKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
F+IDE+ G GGII+DSGTAVTRLQT Y+ALRDAF+ GT+ L DG+++FDTCYD S+
Sbjct: 215 FRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSAL 274
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
SV +P V FHF G LPAKN LIP+DS GTFCFAFAP S+LSI+GN+QQQG RVS
Sbjct: 275 QSVSIPAVGFHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVS 334
Query: 235 FNLRNSLIGFTPNKC 249
F+ NSL+GF ++C
Sbjct: 335 FDSANSLVGFAIDQC 349
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 279 bits (714), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 146/250 (58%), Positives = 198/250 (79%), Gaps = 1/250 (0%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ TET++ G S S+ N+ IGCGH+NEGLF G AGL+GLGGG++S SQ+ AS+FSYCL
Sbjct: 238 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 297
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
V+ DSDS+STLEF+S +P +++T+PL++N ++ Y+ + GISVGG LPIS T F+ID
Sbjct: 298 VNLDSDSSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEID 357
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
ESG GGIIVDSGT ++RL ++ Y +LR+AFV+ T +LSP G+++FDTCY+FS +S+VEV
Sbjct: 358 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEV 417
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
PT++F EG L LPA+NYLI +D+ GT+C AF T SSLSIIG+ QQQG RVS++L N
Sbjct: 418 PTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 477
Query: 240 SLIGFTPNKC 249
S++GF+ NKC
Sbjct: 478 SIVGFSTNKC 487
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 275 bits (702), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 159/255 (62%), Positives = 185/255 (72%), Gaps = 6/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
G F TE +T G+ SV N+AIGCGH+N GLFVGAAGLLGLG G LSFPSQ+ T FSY
Sbjct: 284 GSFATEMLTFGTTSVRNVAIGCGHDNAGLFVGAAGLLGLGAGLLSFPSQLGTQTGRAFSY 343
Query: 58 CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL-PISETA 115
CLVDR S+S+ TLEF S+P ++ PLL N L TFYY+ L ISVGG LL +
Sbjct: 344 CLVDRFSESSGTLEFGPESVPLGSILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDV 403
Query: 116 FKIDE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
F+IDE SG GG IVDSGTAVTRLQT Y+A+RDAFV GTR L +GV++FDTCYD S
Sbjct: 404 FRIDETSGRGGFIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIFDTCYDLSGL 463
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
V VPTV FHF G L LPAKNY+IP+D GTFCFAFAP +S LSI+GN+QQQG RVS
Sbjct: 464 PLVNVPTVVFHFSNGASLILPAKNYMIPMDFMGTFCFAFAPATSDLSIMGNIQQQGIRVS 523
Query: 235 FNLRNSLIGFTPNKC 249
F+ NSL+GF +C
Sbjct: 524 FDTANSLVGFALRQC 538
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 271 bits (692), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 146/250 (58%), Positives = 193/250 (77%), Gaps = 1/250 (0%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ TET + S S+ N+ IGCGH+NEGLFVGA GL+GLGGG++S SQ+ A++FSYCL
Sbjct: 274 GELATETFSFRHSNSIPNLPIGCGHDNEGLFVGADGLIGLGGGAISLSSQLEATSFSYCL 333
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
VD DS+S+STL+F++ P +++T+PL++N TF Y+ + G+SVGG LPIS ++F+ID
Sbjct: 334 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 393
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
ESG+GGIIVDSGT +T + ++ Y+ LRDAFV T+ L P GV+ FDTCYD SS+S+VEV
Sbjct: 394 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 453
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
PT++F P L LPAKN LI VDS GTFC AF P++ LSIIGNVQQQG RVS++L N
Sbjct: 454 PTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 513
Query: 240 SLIGFTPNKC 249
SL+GF+ +KC
Sbjct: 514 SLVGFSTDKC 523
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 269 bits (687), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 146/250 (58%), Positives = 193/250 (77%), Gaps = 1/250 (0%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ TET + S S+ N+ IGCGH+NEGLFVGAAGL+GLGGG++S SQ+ A++FSYCL
Sbjct: 274 GELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCL 333
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
VD DS+S+STL+F++ P +++T+PL++N TF Y+ + G+SVGG LPIS ++F+ID
Sbjct: 334 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 393
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
ESG+GGIIVDSGT +T + ++ Y+ LRDAFV T+ L P GV+ FDTCYD SS+S+VEV
Sbjct: 394 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 453
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
PT++F P L LPAKN L VDS GTFC AF P++ LSIIGNVQQQG RVS++L N
Sbjct: 454 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 513
Query: 240 SLIGFTPNKC 249
SL+GF+ +KC
Sbjct: 514 SLVGFSTDKC 523
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 269 bits (687), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 156/255 (61%), Positives = 186/255 (72%), Gaps = 6/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFSY 57
G F TET+T G+ SV N+AIGCGH N GLF+GAAGLLGLG G+LSFP+QI TFSY
Sbjct: 244 GSFATETLTFGTTSVANVAIGCGHKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTFSY 303
Query: 58 CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL-PISETA 115
CLVDR+SDS+ L+F S+P ++ PL +N L TFYYL +T ISVGG LL I
Sbjct: 304 CLVDRESDSSGPLQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEV 363
Query: 116 FKIDE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
F+IDE SG+GG I+DSGT VTRL T Y+A+RDAFV GT L TD V++FDTCYD S
Sbjct: 364 FRIDETSGHGGFIIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRTDAVSIFDTCYDLSGL 423
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
V VPTV FHF G L LPAKNYLIP+D+ GTFCFAFAP +SS+SI+GN QQQ RVS
Sbjct: 424 QFVSVPTVGFHFSNGASLILPAKNYLIPMDTVGTFCFAFAPAASSVSIMGNTQQQHIRVS 483
Query: 235 FNLRNSLIGFTPNKC 249
F+ NSL+GF ++C
Sbjct: 484 FDSANSLVGFAFDQC 498
>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
Length = 165
Score = 261 bits (668), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 123/165 (74%), Positives = 147/165 (89%)
Query: 85 LLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNA 144
L RN +LDT+YY+GL GISVGG+LL I ET+F++D +GNGGIIVDSGTAVTRLQ++ YN
Sbjct: 1 LRRNPQLDTYYYVGLVGISVGGELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNV 60
Query: 145 LRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD 204
+RDAFV+GT+ L T+ V+LFDTCYD SS++SVEVPTV+FHF EGKVL LPAKNYL+PVD
Sbjct: 61 VRDAFVKGTKDLLATNEVSLFDTCYDLSSKTSVEVPTVAFHFGEGKVLVLPAKNYLVPVD 120
Query: 205 SNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S GTFCFAFAPT SSLSIIGN+QQQGTRVSF+L NSL+GF+PN+C
Sbjct: 121 SVGTFCFAFAPTMSSLSIIGNIQQQGTRVSFDLANSLVGFSPNRC 165
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 256 bits (653), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 143/250 (57%), Positives = 186/250 (74%), Gaps = 1/250 (0%)
Query: 1 GDFVTETVT-LGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ TET+T + S S+ NI+IGCGH+NEGLFVGA GL+GLGGG++S SQ+ AS+FSYCL
Sbjct: 87 GELATETLTFVHSNSIPNISIGCGHDNEGLFVGADGLIGLGGGAISISSQLKASSFSYCL 146
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
VD DS S STL+F++ P +++ +PL++N +F Y+ + G+SVGG LPIS + F+ID
Sbjct: 147 VDIDSPSFSTLDFNTDPPSDSLISPLVKNDRFPSFRYVKVIGMSVGGKPLPISSSRFEID 206
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
ESG GGIIVDSGT +T+L ++ Y LR+AF+ T L P ++ FDTCYD SS+S+VEV
Sbjct: 207 ESGLGGIIVDSGTTITQLPSDVYEVLREAFLGLTTNLPPAPEISPFDTCYDLSSQSNVEV 266
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
PT++F P L LPAKN LI VDS GTFC AF + LSIIGN QQQG RVS++L N
Sbjct: 267 PTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFVSATFPLSIIGNFQQQGIRVSYDLTN 326
Query: 240 SLIGFTPNKC 249
SL+GF+ NKC
Sbjct: 327 SLVGFSTNKC 336
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 254 bits (649), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 135/253 (53%), Positives = 173/253 (68%), Gaps = 4/253 (1%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G ET+T G + N+AIGCGH+N+G+FVGAAGLLGLG G +SF Q+ TFSY
Sbjct: 221 GTLALETLTFGRTLIRNVAIGCGHHNQGMFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSY 280
Query: 58 CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CLV R S+ L+F ++P A PL+ N +FYY+GL+G+ VGG +PISE F
Sbjct: 281 CLVSRGIQSSGLLQFGREAVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVF 340
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
K+ E G+GG+++D+GTAVTRL T Y A RDAF+ T L GV++FDTCYD S
Sbjct: 341 KLSELGDGGVVMDTGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGVSIFDTCYDLFGFVS 400
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
V VPTVSF+F G +L LPA+N+LIPVD G+FCFAFAP+SS LSIIGN+QQ+G +S +
Sbjct: 401 VRVPTVSFYFSGGPILTLPARNFLIPVDDVGSFCFAFAPSSSGLSIIGNIQQEGIEISVD 460
Query: 237 LRNSLIGFTPNKC 249
N +GF PN C
Sbjct: 461 GANGFVGFGPNVC 473
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 253 bits (645), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 139/253 (54%), Positives = 177/253 (69%), Gaps = 4/253 (1%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
G ET+T G V ++AIGCGH N G+FVGAAGLLGLGGGS+SF Q+ T FSY
Sbjct: 227 GTLALETLTFGRTMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSY 286
Query: 58 CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CLV R +DS+ +L F +LP A PL+RN +FYY+GL G+ VGG +PISE F
Sbjct: 287 CLVSRGTDSSGSLVFGREALPAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVF 346
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
++ E G+GG+++D+GTAVTRL T Y A RDAF+ T L GVA+FDTCYD S
Sbjct: 347 RLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVS 406
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
V VPTVSF+F G +L LPA+N+LIP+D GTFCFAFAP++S LSI+GN+QQ+G ++SF+
Sbjct: 407 VRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFD 466
Query: 237 LRNSLIGFTPNKC 249
N +GF PN C
Sbjct: 467 GANGYVGFGPNIC 479
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 251 bits (642), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 142/255 (55%), Positives = 182/255 (71%), Gaps = 6/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
GD +++ ++ + GCGH+NEGLFVGAAGLLGLG G LSFPSQ+++ FSYCLV
Sbjct: 103 GDLASDSFSVSRGRTSPVVFGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLV 162
Query: 61 DRDS--DSTSTLEF-DSSLPPNAVTA--PLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
RD+ ++S L F DS+LP +A A LL+N +LDTFYY GL+GIS+GG LL I TA
Sbjct: 163 SRDNGVRASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTA 222
Query: 116 FKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
FK+ S G GG+I+DSGT+VTRL T Y +RDAF T+ L +LFDTCYDFS+
Sbjct: 223 FKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSAL 282
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
+SV +PTVSFHF G + LP NYL+PVD++GTFCFAF+ TS LSIIGN+QQQ RV+
Sbjct: 283 TSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVA 342
Query: 235 FNLRNSLIGFTPNKC 249
+L +S +GF P +C
Sbjct: 343 IDLDSSRVGFAPRQC 357
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 251 bits (641), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 126/254 (49%), Positives = 170/254 (66%), Gaps = 6/254 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G+ ET+TLG +V +AIGCGH N GLFVGAAGLLGLG G++S Q+ + FSY
Sbjct: 221 GELALETLTLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSY 280
Query: 58 CLVDRDSDSTSTLEFD--SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
CL R + +L ++P AV PL+RN++ +FYY+GLTGI VGG+ LP+ ++
Sbjct: 281 CLASRGAGGAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSL 340
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
F++ E G GG+++D+GTAVTRL E Y ALR AF AL + V+L DTCYD S +
Sbjct: 341 FQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYA 400
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSF 235
SV VPTVSF+F +G VL LPA+N L+ V FC AFAP+SS +SI+GN+QQ+G +++
Sbjct: 401 SVRVPTVSFYFDQGAVLTLPARNLLVEV-GGAVFCLAFAPSSSGISILGNIQQEGIQITV 459
Query: 236 NLRNSLIGFTPNKC 249
+ N +GF PN C
Sbjct: 460 DSANGYVGFGPNTC 473
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 250 bits (639), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 142/255 (55%), Positives = 181/255 (70%), Gaps = 6/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
GD +++ + + GCGH+NEGLFVGAAGLLGLG G LSFPSQ+++ FSYCLV
Sbjct: 103 GDLASDSFLVSRGRTSPVVFGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLV 162
Query: 61 DRDS--DSTSTLEF-DSSLPPNAVTA--PLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
RD+ ++S L F DS+LP +A A LL+N +LDTFYY GL+GIS+GG LL I TA
Sbjct: 163 SRDNGVRASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTA 222
Query: 116 FKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
FK+ S G GG+I+DSGT+VTRL T Y +RDAF T+ L +LFDTCYDFS+
Sbjct: 223 FKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSAL 282
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
+SV +PTVSFHF G + LP NYL+PVD++GTFCFAF+ TS LSIIGN+QQQ RV+
Sbjct: 283 TSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVA 342
Query: 235 FNLRNSLIGFTPNKC 249
+L +S +GF P +C
Sbjct: 343 IDLDSSRVGFAPRQC 357
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 250 bits (639), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 149/255 (58%), Positives = 179/255 (70%), Gaps = 7/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
GDF TET+T + +A+GCGH+NEGLFVGAAGLLGLG G LSFPSQ FSY
Sbjct: 199 GDFATETLTFRGNKIAKVALGCGHHNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSY 258
Query: 58 CLVDRDSDST-STLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG-DLLPISET 114
CLVDR + S S++ F D+++ A PL+RN +LDTFYY+GL GISVGG + +S +
Sbjct: 259 CLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPS 318
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
FK+D +GNGG+I+DSGT+VTRL Y ALRDAF G R L +LFDTCYD S +
Sbjct: 319 LFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCYDLSGQ 378
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
SSV+VPTV HF G + LPA NYLIPVD NG+FCFAFA T S LSIIGN+QQQG RV
Sbjct: 379 SSVKVPTVVLHF-RGADMALPATNYLIPVDENGSFCFAFAGTISGLSIIGNIQQQGFRVV 437
Query: 235 FNLRNSLIGFTPNKC 249
++L S IGF P C
Sbjct: 438 YDLAGSRIGFAPRGC 452
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 250 bits (639), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 126/254 (49%), Positives = 169/254 (66%), Gaps = 6/254 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G+ ET+TLG +V +AIGCGH N GLFVGAAGLLGLG G++S Q+ + FSY
Sbjct: 221 GELALETLTLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLIGQLGGAAGGVFSY 280
Query: 58 CLVDRDSDSTSTLEFD--SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
CL R + +L ++P AV PL+RN++ +FYY+GLTGI VGG+ LP+ +
Sbjct: 281 CLASRGAGGAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGL 340
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
F++ E G GG+++D+GTAVTRL E Y ALR AF AL + V+L DTCYD S +
Sbjct: 341 FQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYA 400
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSF 235
SV VPTVSF+F +G VL LPA+N L+ V FC AFAP+SS +SI+GN+QQ+G +++
Sbjct: 401 SVRVPTVSFYFDQGAVLTLPARNLLVEV-GGAVFCLAFAPSSSGISILGNIQQEGIQITV 459
Query: 236 NLRNSLIGFTPNKC 249
+ N +GF PN C
Sbjct: 460 DSANGYVGFGPNTC 473
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 249 bits (636), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 150/259 (57%), Positives = 182/259 (70%), Gaps = 11/259 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
GDF TET+T A VD++A+GCGH+NEGLFVGAAGLLGLG G LSFPSQ FSY
Sbjct: 227 GDFSTETLTFHGARVDHVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSY 286
Query: 58 CLVDRDSDSTS-----TLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP- 110
CLVDR S +S T+ F + ++P AV PLL N +LDTFYYL L GISVGG +P
Sbjct: 287 CLVDRTSSGSSSKPPSTIVFGNGAVPKTAVFTPLLTNPKLDTFYYLQLLGISVGGSRVPG 346
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
+SE+ FK+D +GNGG+I+DSGT+VTRL Y ALRDAF G L +LFDTC+D
Sbjct: 347 VSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATRLKRAPSYSLFDTCFD 406
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
S ++V+VPTV FHF G+V LPA NYLIPV++ G FCFAFA T SLSIIGN+QQQG
Sbjct: 407 LSGMTTVKVPTVVFHFTGGEV-SLPASNYLIPVNNQGRFCFAFAGTMGSLSIIGNIQQQG 465
Query: 231 TRVSFNLRNSLIGFTPNKC 249
RV+++L S +GF C
Sbjct: 466 FRVAYDLVGSRVGFLSRAC 484
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 248 bits (634), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 137/253 (54%), Positives = 176/253 (69%), Gaps = 4/253 (1%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
G ET+T G V N+AIGCGH N G+FVGAAGLLGLGGGS+S Q+ T FSY
Sbjct: 229 GTLALETLTFGRTVVRNVAIGCGHRNRGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 288
Query: 58 CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CLV R +DS +LEF ++P A PL+RN +FYY+ L+G+ VGG +PISE F
Sbjct: 289 CLVSRGTDSAGSLEFGRGAMPVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVF 348
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
+++E GNGG+++D+GTAVTR+ T Y A RDAF+ T L GV++FDTCY+ + S
Sbjct: 349 QLNEMGNGGVVMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGVSIFDTCYNLNGFVS 408
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
V VPTVSF+F G +L LPA+N+LIPVD GTFCFAFA + S LSIIGN+QQ+G ++SF+
Sbjct: 409 VRVPTVSFYFAGGPILTLPARNFLIPVDDVGTFCFAFAASPSGLSIIGNIQQEGIQISFD 468
Query: 237 LRNSLIGFTPNKC 249
N +GF PN C
Sbjct: 469 GANGFVGFGPNVC 481
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 248 bits (632), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 148/259 (57%), Positives = 181/259 (69%), Gaps = 11/259 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
GDF TET+T A VD++ +GCGH+NEGLFVGAAGLLGLG G LSFPSQ + FSY
Sbjct: 229 GDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKSRYNGKFSY 288
Query: 58 CLVDR-----DSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP- 110
CLVDR S ST+ F + ++P +V PLL N +LDTFYYL L GISVGG +P
Sbjct: 289 CLVDRTSSGSSSKPPSTIVFGNDAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPG 348
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
+SE+ FK+D +GNGG+I+DSGT+VTRL Y ALRDAF G L +LFDTC+D
Sbjct: 349 VSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATKLKRAPSYSLFDTCFD 408
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
S ++V+VPTV FHF G+V LPA NYLIPV++ G FCFAFA T SLSIIGN+QQQG
Sbjct: 409 LSGMTTVKVPTVVFHFGGGEV-SLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQG 467
Query: 231 TRVSFNLRNSLIGFTPNKC 249
RV+++L S +GF C
Sbjct: 468 FRVAYDLVGSRVGFLSRAC 486
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 246 bits (629), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 128/252 (50%), Positives = 165/252 (65%), Gaps = 5/252 (1%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
G ET+TLG +V+ +AIGCGH N GLFVGAAGLLGLG G +S Q+ + FSY
Sbjct: 215 GALALETLTLGGTAVEGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSY 274
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
CL R + S L ++P AV PL+RN + +FYY+GL+GI VG + LP+ E F+
Sbjct: 275 CLASRGAGSL-VLGRSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQ 333
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
+ E G GG+++D+GTAVTRL E Y ALRDAFV AL GV+L DTCYD S +SV
Sbjct: 334 LTEDGAGGVVMDTGTAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLLDTCYDLSGYTSV 393
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
VPTVSF+F L LPA+N L+ VD G +C AFAP+SS SI+GN+QQ+G +++ +
Sbjct: 394 RVPTVSFYFDGAATLTLPARNLLLEVD-GGIYCLAFAPSSSGPSILGNIQQEGIQITVDS 452
Query: 238 RNSLIGFTPNKC 249
N IGF P C
Sbjct: 453 ANGYIGFGPTTC 464
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 246 bits (629), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 148/259 (57%), Positives = 182/259 (70%), Gaps = 11/259 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
GDF TET+T A VD++ +GCGH+NEGLFVGAAGLLGLG G LSFPSQ FSY
Sbjct: 226 GDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSY 285
Query: 58 CLVDRDSDSTS-----TLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP- 110
CLVDR S +S T+ F ++++P +V PLL N +LDTFYYL L GISVGG +P
Sbjct: 286 CLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPG 345
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
+SE+ FK+D +GNGG+I+DSGT+VTRL Y ALRDAF G L +LFDTC+D
Sbjct: 346 VSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFD 405
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
S ++V+VPTV FHF G+V LPA NYLIPV++ G FCFAFA T SLSIIGN+QQQG
Sbjct: 406 LSGMTTVKVPTVVFHFGGGEV-SLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQG 464
Query: 231 TRVSFNLRNSLIGFTPNKC 249
RV+++L S +GF C
Sbjct: 465 FRVAYDLVGSRVGFLSRAC 483
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 244 bits (624), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 127/260 (48%), Positives = 167/260 (64%), Gaps = 12/260 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
G ET+TLG +V+ +AIGCGH N GLFVGAAGLLGLG G +S Q+ + FSY
Sbjct: 213 GTLALETLTLGGTAVEGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSY 272
Query: 58 CLVDR--------DSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
CL R D+ + L ++P AV PL+RN + +FYY+G++GI VG + L
Sbjct: 273 CLASRGGSGSGAADAAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERL 332
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
P+ + F++ E G GG+++D+GTAVTRL E Y ALRDAFV AL GV+L DTCY
Sbjct: 333 PLQDGLFQLTEDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCY 392
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
D S +SV VPTVSF+F L LPA+N L+ VD G +C AFAP+SS LSI+GN+QQ+
Sbjct: 393 DLSGYTSVRVPTVSFYFDGAATLTLPARNLLLEVD-GGIYCLAFAPSSSGLSILGNIQQE 451
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
G +++ + N IGF P C
Sbjct: 452 GIQITVDSANGYIGFGPATC 471
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 243 bits (619), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 142/255 (55%), Positives = 177/255 (69%), Gaps = 6/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
GDF TET+T + +A+GCGH+NEGLF+GAAGLLGLG GSLSFPSQ A FSY
Sbjct: 241 GDFSTETLTFRGQVIRRVALGCGHDNEGLFIGAAGLLGLGRGSLSFPSQTGAQFSKRFSY 300
Query: 58 CLVDRDSDST-STLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL-PISET 114
CLVDR + T S+L F +++P +A+ PLL N +LDTFYY+ L GISVGG L I +
Sbjct: 301 CLVDRSASGTASSLIFGKAAIPKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPAS 360
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
F++D +GNGG+I+DSGT+VTRL Y+ +RDAF GT L G +LFDTCYD S
Sbjct: 361 VFRMDATGNGGVIIDSGTSVTRLVDSAYSTMRDAFRVGTGNLKSAGGFSLFDTCYDLSGL 420
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
+V+VPT+ FHF G + LPA NYLIPVDS+ TFCFAFA + LSIIGN+QQQG RV
Sbjct: 421 KTVKVPTLVFHFQGGAHISLPATNYLIPVDSSATFCFAFAGNTGGLSIIGNIQQQGYRVV 480
Query: 235 FNLRNSLIGFTPNKC 249
F+ + +GF C
Sbjct: 481 FDSLANRVGFKAGSC 495
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 242 bits (618), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 136/253 (53%), Positives = 176/253 (69%), Gaps = 4/253 (1%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
G ET+TLG V N+AIGCGH N+G+FVGAAGLLGLGGGS+SF Q++ + FSY
Sbjct: 130 GTLALETLTLGRTVVQNVAIGCGHMNQGMFVGAAGLLGLGGGSMSFVGQLSRERGNAFSY 189
Query: 58 CLVDRDSDSTSTLEFDS-SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CLV R ++S LEF S ++P A PL+RN ++YY+GL+G+ VG +PISE F
Sbjct: 190 CLVSRVTNSNGFLEFGSEAMPVGAAWIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIF 249
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
++ E GNGG+++D+GTAVTR T Y A RDAF+ T L GV++FDTCY+ S
Sbjct: 250 ELTELGNGGVVMDTGTAVTRFPTVAYEAFRDAFIDQTGNLPRASGVSIFDTCYNLFGFLS 309
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
V VPTVSF+F G +L LPA N+LIPVD GTFCFAFAP+ S LSI+GN+QQ+G ++S +
Sbjct: 310 VRVPTVSFYFSGGPILTLPANNFLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVD 369
Query: 237 LRNSLIGFTPNKC 249
N +GF PN C
Sbjct: 370 GANEFVGFGPNVC 382
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 242 bits (617), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 142/255 (55%), Positives = 179/255 (70%), Gaps = 7/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G+FVTET+T V+ +A+GCGH+NEGLFVGAAGLLGLG G LSFPSQ + FSY
Sbjct: 130 GEFVTETLTFRRTKVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSY 189
Query: 58 CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISET 114
CLVDR + S +S + +S++ A PLL N LDTFYY+ L GISVGG + I+ +
Sbjct: 190 CLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITAS 249
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
FK+D +GNGG+I+D GT+VTRL Y ALRDAF G +L +LFDTCYD S +
Sbjct: 250 HFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGK 309
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
++V+VPTV HF G + LPA NYLIPVD +G FCFAFA T+S LSIIGN+QQQG RV
Sbjct: 310 TTVKVPTVVLHF-RGADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVV 368
Query: 235 FNLRNSLIGFTPNKC 249
++L +S +GF+P C
Sbjct: 369 YDLASSRVGFSPRGC 383
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 241 bits (616), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 142/255 (55%), Positives = 179/255 (70%), Gaps = 7/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G+FVTET+T V+ +A+GCGH+NEGLFVGAAGLLGLG G LSFPSQ + FSY
Sbjct: 217 GEFVTETLTFRRTKVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSY 276
Query: 58 CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISET 114
CLVDR + S +S + +S++ A PLL N LDTFYY+ L GISVGG + I+ +
Sbjct: 277 CLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITAS 336
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
FK+D +GNGG+I+D GT+VTRL Y ALRDAF G +L +LFDTCYD S +
Sbjct: 337 HFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGK 396
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
++V+VPTV HF G + LPA NYLIPVD +G FCFAFA T+S LSIIGN+QQQG RV
Sbjct: 397 TTVKVPTVVLHF-RGADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVV 455
Query: 235 FNLRNSLIGFTPNKC 249
++L +S +GF+P C
Sbjct: 456 YDLASSRVGFSPRGC 470
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 241 bits (614), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 144/255 (56%), Positives = 178/255 (69%), Gaps = 7/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
G+F TET+T V + +GCGH+NEGLFVGAAGLLGLG G LSFPSQI S FSY
Sbjct: 234 GEFSTETLTFRGTRVGRVVLGCGHDNEGLFVGAAGLLGLGRGRLSFPSQIGRRFNSKFSY 293
Query: 58 CLVDRDSDST-STLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISET 114
CL DR + S S++ F DS++ PLL N +LDTFYY+ L GISVGG + IS +
Sbjct: 294 CLGDRSASSRPSSIVFGDSAISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISAS 353
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
FK+D +GNGG+I+DSGT+VTRL Y ALRDAF+ G L +LFDTC+D S +
Sbjct: 354 LFKLDSTGNGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSLFDTCFDLSGK 413
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
+ V+VPTV HF G +PLPA NYLIPVD++G+FCFAFA T+S LSIIGN+QQQG RV
Sbjct: 414 TEVKVPTVVLHF-RGADVPLPASNYLIPVDNSGSFCFAFAGTASGLSIIGNIQQQGFRVV 472
Query: 235 FNLRNSLIGFTPNKC 249
++L S +GF P C
Sbjct: 473 YDLATSRVGFAPRGC 487
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 240 bits (612), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 147/266 (55%), Positives = 181/266 (68%), Gaps = 19/266 (7%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
GDF TET+T G A V +A+GCGH+NEGLFV AAGLLGLG GSLSFP+QI+ +FS
Sbjct: 229 GDFATETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFS 288
Query: 57 YCLVDRDSDS-----TSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDL 108
YCLVDR S + +ST+ F S + V + P+++N ++TFYY+ L GISVGG
Sbjct: 289 YCLVDRTSSANTASRSSTVTFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGAR 348
Query: 109 LP-ISETAFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA---LSPTDGVA 163
+P ++ + ++D SG GG+IVDSGT+VTRL Y+ALRDAF RG A LSP G +
Sbjct: 349 VPGVANSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAF-RGAAAGLRLSP-GGFS 406
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
LFDTCYD S R V+VPTVS HF G LP +NYLIPVDS GTFCFAFA T +SII
Sbjct: 407 LFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSII 466
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN+QQQG RV F+ + FTP C
Sbjct: 467 GNIQQQGFRVVFDGDGQRVAFTPKGC 492
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 240 bits (612), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 133/253 (52%), Positives = 171/253 (67%), Gaps = 4/253 (1%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
G ET+T+G + ++AIGCGH N+G+F+GAAGLLGLGGGS+SF Q+ T FSY
Sbjct: 230 GTLALETLTVGQVMIRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFIGQLGGQTGGAFSY 289
Query: 58 CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CLV R + ST LEF +LP A L+RN +FYY+GL GI VGG + + E F
Sbjct: 290 CLVSRGTGSTGALEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETF 349
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
++ E G G+++D+GTAVTR T Y A RD+F T L GV++FDTCYD + S
Sbjct: 350 QLTEYGTNGVVMDTGTAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFES 409
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
V VPTVSF+F +G VL LPA+N+LIPVD GTFC AFAP+ S LSIIGN+QQ+G ++SF+
Sbjct: 410 VRVPTVSFYFSDGPVLTLPARNFLIPVDGGGTFCLAFAPSPSGLSIIGNIQQEGIQISFD 469
Query: 237 LRNSLIGFTPNKC 249
N +GF PN C
Sbjct: 470 GANGFVGFGPNIC 482
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 239 bits (610), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 141/255 (55%), Positives = 178/255 (69%), Gaps = 7/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
G+F TET+T V +A+GCGH+NEGLF+GAAGLLGLG G LSFPSQI + FSY
Sbjct: 236 GEFSTETLTFRGTRVGRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPSQIGRRFSRKFSY 295
Query: 58 CLVDRDSDST-STLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISET 114
CLVDR + S S + F DS++ A PL+ N +LDTFYY+ L G+SVGG +P I+ +
Sbjct: 296 CLVDRSASSKPSYMVFGDSAISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITAS 355
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
FK+D +GNGG+I+DSGT+VTRL Y ALRDAF G L +LFDTC+D S +
Sbjct: 356 LFKLDSTGNGGVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSLFDTCFDLSGK 415
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
+ V+VPTV HF G + LPA NYLIPVD++G+FCFAFA T S LSI+GN+QQQG RV
Sbjct: 416 TEVKVPTVVLHF-RGADVSLPASNYLIPVDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVV 474
Query: 235 FNLRNSLIGFTPNKC 249
++L S +GF P C
Sbjct: 475 YDLAASRVGFAPRGC 489
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 238 bits (608), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 131/253 (51%), Positives = 175/253 (69%), Gaps = 4/253 (1%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
G ET+T G V N+AIGCGH+N G+FVGAAGLLGLGGGS+SF Q++ T FSY
Sbjct: 130 GTLALETLTFGRTVVRNVAIGCGHSNRGMFVGAAGLLGLGGGSMSFMGQLSGQTGNAFSY 189
Query: 58 CLVDRDSDSTSTLEFDS-SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CLV R +++ LEF S ++P A PL+RN +FYY+ L G+ VG +P+SE F
Sbjct: 190 CLVSRGTNTNGFLEFGSEAMPVGAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVF 249
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
+++E G+GG+++D+GTAVTR T Y A R+AF+ T+ L GV++FDTCY+ S
Sbjct: 250 QLNELGSGGVVMDTGTAVTRFPTVAYEAFRNAFIEQTQNLPRASGVSIFDTCYNLFGFLS 309
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
V VPTVSF+F G +L +PA N+LIPVD GTFCFAFAP+ S LSI+GN+QQ+G ++S +
Sbjct: 310 VRVPTVSFYFSGGPILTIPANNFLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVD 369
Query: 237 LRNSLIGFTPNKC 249
N +GF PN C
Sbjct: 370 EANEFVGFGPNIC 382
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 238 bits (607), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 144/262 (54%), Positives = 186/262 (70%), Gaps = 14/262 (5%)
Query: 1 GDFVTETVTLGSAS------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---N 51
G+F T+ V+L S S ++ I +GCGH+NEG FVGAAGLLGLG G LSFP+Q+ N
Sbjct: 145 GEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQVDPQN 204
Query: 52 ASTFSYCLVDRDSDST--STLEF-DSSLPP-NAVTAPLLRNHELDTFYYLGLTGISVGGD 107
FSYCL DR++DST S+L F ++++PP A P N + TFYYL +TGISVGG
Sbjct: 205 GGRFSYCLTDRETDSTEGSSLVFGEAAVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGT 264
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
+L I +AF++D GNGG+I+DSGT+VTRLQ Y +LRDAF GT L+PT G +LFDT
Sbjct: 265 ILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLRDAFRAGTSDLAPTAGFSLFDT 324
Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQ 227
CYD S +SV+VPTV+ HF G L LPA NYLIPVD++ TFC AFA T+ SIIGN+Q
Sbjct: 325 CYDLSGLASVDVPTVTLHFQGGTDLKLPASNYLIPVDNSNTFCLAFAGTTGP-SIIGNIQ 383
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
QQG RV ++ ++ +GF P++C
Sbjct: 384 QQGFRVIYDNLHNQVGFVPSQC 405
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 238 bits (607), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 141/255 (55%), Positives = 175/255 (68%), Gaps = 7/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
GDF TET+T V +A+GCGH+NEGLFVGAAGLLGLG G LSFPSQ FSY
Sbjct: 215 GDFSTETLTFRRTRVARVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSY 274
Query: 58 CLVDRDSDST-STLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISET 114
CLVDR + S S++ F DS++ A PL+ N +LDTFYY+ L GISVGG +P I+ +
Sbjct: 275 CLVDRSASSKPSSMVFGDSAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITAS 334
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
FK+D++GNGG+I+DSGT+VTRL Y A RDAF G L +LFDTC+D S +
Sbjct: 335 LFKLDQTGNGGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGK 394
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
+ V+VPTV HF G + LPA NYLIPVD++G FC AFA T LSIIGN+QQQG RV
Sbjct: 395 TEVKVPTVVLHF-RGADVSLPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVV 453
Query: 235 FNLRNSLIGFTPNKC 249
++L S +GF P+ C
Sbjct: 454 YDLAGSRVGFAPHGC 468
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 238 bits (607), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 147/266 (55%), Positives = 180/266 (67%), Gaps = 19/266 (7%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
GDF TET+T G A V IA+GCGH+NEGLFV AAGLLGLG GSLSFP+QI+ +FS
Sbjct: 231 GDFATETLTFAGGARVARIALGCGHDNEGLFVAAAGLLGLGRGSLSFPAQISRRYGRSFS 290
Query: 57 YCLVDRDSDS-----TSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDL 108
YCLVDR S + +ST+ F S + V A P+++N ++TFYY+ L GISVGG
Sbjct: 291 YCLVDRTSSANPASHSSTVTFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGAR 350
Query: 109 LP-ISETAFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVA 163
+ ++++ ++D SG GG+IVDSGT+VTRL Y+ALRDAF G R LSP G +
Sbjct: 351 VSGVADSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGLR-LSP-GGFS 408
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
LFDTCYD S R V+VPTVS HF G LP +NYLIPVDS GTFCFAFA T +SII
Sbjct: 409 LFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSII 468
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN+QQQG RV F+ +GF P C
Sbjct: 469 GNIQQQGFRVVFDGDGQRVGFVPKGC 494
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 238 bits (606), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 121/244 (49%), Positives = 166/244 (68%), Gaps = 2/244 (0%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G + ETV+ S+ VD +++GC + N+G FVG+ G GLG GSLSFPS+INAS+ SYCL
Sbjct: 275 GVLINETVSFESSGWVDRVSLGCSNKNQGPFVGSDGTFGLGRGSLSFPSRINASSMSYCL 334
Query: 60 VD-RDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
V+ +D S+STLEF+S +V A LL+N + + YY+GL GI VGG+ + + + F I
Sbjct: 335 VESKDGYSSSTLEFNSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTI 394
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE 178
D GNGG+IV S + +T L+ +TYN +RDAFV T+ L FDTCY+ SS ++VE
Sbjct: 395 DPYGNGGMIVSSSSLITMLENDTYNVVRDAFVAKTQHLERLKAFLQFDTCYNLSSNNTVE 454
Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLR 238
+P + F +GK LP ++YL VD NGTFCFAFAP+ S SI+G +QQ GTRV+F+L
Sbjct: 455 LPILEFEVNDGKSWLLPKESYLYAVDKNGTFCFAFAPSKGSFSILGTLQQYGTRVTFDLV 514
Query: 239 NSLI 242
NS +
Sbjct: 515 NSFV 518
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 237 bits (605), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 142/255 (55%), Positives = 174/255 (68%), Gaps = 7/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
G+F TET+T V +A+GCGH+NEGLFVGAAGLLGLG G LSFP+Q FSY
Sbjct: 235 GEFSTETLTFRGTRVPKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLRFGRKFSY 294
Query: 58 CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISET 114
CLVDR + S +S + S++ AV PL+ N +LDTFYYL LTGISVGG + I+ +
Sbjct: 295 CLVDRSASSKPSSVVFGQSAVSRTAVFTPLITNPKLDTFYYLELTGISVGGARVAGITAS 354
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
FK+D +GNGG+I+DSGT+VTRL Y +LRDAF G L +LFDTC+D S +
Sbjct: 355 LFKLDTAGNGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYSLFDTCFDLSGK 414
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
+ V+VPTV HF G + LPA NYLIPVD+NG FCFAFA T S LSIIGN+QQQG RV
Sbjct: 415 TEVKVPTVVMHF-RGADVSLPATNYLIPVDTNGVFCFAFAGTMSGLSIIGNIQQQGFRVV 473
Query: 235 FNLRNSLIGFTPNKC 249
F++ S IGF C
Sbjct: 474 FDVAASRIGFAARGC 488
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 237 bits (605), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 140/255 (54%), Positives = 175/255 (68%), Gaps = 7/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
GDF TET+T V +A+GCGH+NEGLFVGAAGLLGLG G LSFP Q FSY
Sbjct: 231 GDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSY 290
Query: 58 CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISET 114
CLVDR + S +S + ++++ A PLL N +LDTFYY+GL GISVGG +P ++ +
Sbjct: 291 CLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTAS 350
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
FK+D+ GNGG+I+DSGT+VTRL Y A+RDAF G + L +LFDTC+D S+
Sbjct: 351 LFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFDTCFDLSNM 410
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
+ V+VPTV HF V LPA NYLIPVD+NG FCFAFA T LSIIGN+QQQG RV
Sbjct: 411 NEVKVPTVVLHFRRADV-SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVV 469
Query: 235 FNLRNSLIGFTPNKC 249
++L +S +GF P C
Sbjct: 470 YDLASSRVGFAPGGC 484
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 237 bits (605), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 142/255 (55%), Positives = 176/255 (69%), Gaps = 7/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
GDF TET+T V +A+GCGH+NEGLF+GAAGLLGLG G LSFP Q FSY
Sbjct: 218 GDFSTETLTFRRTRVTRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSY 277
Query: 58 CLVDRDSDST-STLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGD-LLPISET 114
CLVDR + + S++ F DS++ A PL++N +LDTFYYL L GISVGG + +S +
Sbjct: 278 CLVDRSASAKPSSVVFGDSAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSAS 337
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
F++D +GNGG+I+DSGT+VTRL Y ALRDAF G L +LFDTC+D S
Sbjct: 338 LFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGL 397
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
+ V+VPTV HF G + LPA NYLIPVD++G+FCFAFA T S LSIIGN+QQQG RVS
Sbjct: 398 TEVKVPTVVLHF-RGADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVS 456
Query: 235 FNLRNSLIGFTPNKC 249
F+L S +GF P C
Sbjct: 457 FDLAGSRVGFAPRGC 471
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 237 bits (604), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 143/255 (56%), Positives = 178/255 (69%), Gaps = 7/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
GDF TET+T A+V +AIGCGH+NEGLFVGAAGLLGLG G LSFP+Q + FSY
Sbjct: 219 GDFSTETLTFRRAAVPRVAIGCGHDNEGLFVGAAGLLGLGRGGLSFPTQTGTRFNNKFSY 278
Query: 58 CLVDRDSDST-STLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG-DLLPISET 114
CL DR + + S++ F DS++ A PL++N +LDTFYY+ L GISVGG + IS +
Sbjct: 279 CLTDRTASAKPSSIVFGDSAVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISAS 338
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
F++D +GNGG+I+DSGT+VTRL Y +LRDAF G L +LFDTCYD S
Sbjct: 339 FFRLDSTGNGGVIIDSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSLFDTCYDLSGL 398
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
S V+VPTV HF G + LPA NYL+PVD++G+FCFAFA T S LSIIGN+QQQG RV
Sbjct: 399 SEVKVPTVVLHF-RGADVSLPAANYLVPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVV 457
Query: 235 FNLRNSLIGFTPNKC 249
F+L S +GF P C
Sbjct: 458 FDLAGSRVGFAPRGC 472
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 237 bits (604), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 140/255 (54%), Positives = 176/255 (69%), Gaps = 7/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
GDF TET+T V +A+GCGH+NEGLFVGAAGLLGLG G LSFP Q FSY
Sbjct: 231 GDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSY 290
Query: 58 CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISET 114
CLVDR + S +S + ++++ A PLL N +LDTFYY+GL GISVGG +P ++ +
Sbjct: 291 CLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTAS 350
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
FK+D+ GNGG+I+DSGT+VTRL Y A+RDAF G + L +LFDTC+D S+
Sbjct: 351 LFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNM 410
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
+ V+VPTV HF G + LPA NYLIPVD+NG FCFAFA T LSIIGN+QQQG RV
Sbjct: 411 NEVKVPTVVLHF-RGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVV 469
Query: 235 FNLRNSLIGFTPNKC 249
++L +S +GF P C
Sbjct: 470 YDLASSRVGFAPGGC 484
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 236 bits (603), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 149/267 (55%), Positives = 180/267 (67%), Gaps = 20/267 (7%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
GDF TET+T G A V +A+GCGH+NEGLFV AAGLLGLG GSLSFP+QI+ +FS
Sbjct: 229 GDFATETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGKSFS 288
Query: 57 YCLVDRDSDSTSTLEFDSSL------PPNAVTA---PLLRNHELDTFYYLGLTGISVGGD 107
YCLVDR S S+S S PP+A A P++RN ++TFYY+ L GISVGG
Sbjct: 289 YCLVDRTSSSSSGAASRSRSSTVTFGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGA 348
Query: 108 LLP-ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGV 162
+P ++E+ ++D S G GG+IVDSGT+VTRL +Y+ALRDAF G R LSP G
Sbjct: 349 RVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLR-LSP-GGF 406
Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
+LFDTCYD R V+VPTVS HF G LP +NYLIPVDS GTFCFAFA T +SI
Sbjct: 407 SLFDTCYDLGGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSI 466
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IGN+QQQG RV F+ +GF P C
Sbjct: 467 IGNIQQQGFRVVFDGDGQRVGFAPKGC 493
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 236 bits (602), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 121/252 (48%), Positives = 161/252 (63%), Gaps = 11/252 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G+ ET+TLG +V +AIGCGH N GLFVGAAGLLGLG G++S Q+ + FSY
Sbjct: 221 GELALETLTLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSY 280
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
CL R + +L T + R +FYY+GLTGI VGG+ LP+ ++ F+
Sbjct: 281 CLASRGAGGAGSLVLGR-------TEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQ 333
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
+ E G GG+++D+GTAVTRL E Y ALR AF AL + V+L DTCYD S +SV
Sbjct: 334 LTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASV 393
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
VPTVSF+F +G VL LPA+N L+ V FC AFAP+SS +SI+GN+QQ+G +++ +
Sbjct: 394 RVPTVSFYFDQGAVLTLPARNLLVEV-GGAVFCLAFAPSSSGISILGNIQQEGIQITVDS 452
Query: 238 RNSLIGFTPNKC 249
N +GF PN C
Sbjct: 453 ANGYVGFGPNTC 464
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 236 bits (602), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 140/255 (54%), Positives = 176/255 (69%), Gaps = 7/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
GDF TET+T V +A+GCGH+NEGLFVGAAGLLGLG G LSFP Q FSY
Sbjct: 231 GDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSY 290
Query: 58 CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISET 114
CLVDR + S +S + ++++ A PLL N +LDTFYY+ L GISVGG +P ++ +
Sbjct: 291 CLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAAS 350
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
FK+D+ GNGG+I+DSGT+VTRL Y A+RDAF G +AL +LFDTC+D S+
Sbjct: 351 LFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSLFDTCFDLSNM 410
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
+ V+VPTV HF G + LPA NYLIPVD+NG FCFAFA T LSIIGN+QQQG RV
Sbjct: 411 NEVKVPTVVLHF-RGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVV 469
Query: 235 FNLRNSLIGFTPNKC 249
++L +S +GF P C
Sbjct: 470 YDLASSRVGFAPGGC 484
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 234 bits (596), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 132/252 (52%), Positives = 167/252 (66%), Gaps = 21/252 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
G ET+T G V ++AIGCGH N G+FVGAAGLLGLGGGS+SF Q+ T FSY
Sbjct: 288 GTLALETLTFGRTMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSY 347
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
CLV +A PL+RN +FYY+GL G+ VGG +PISE F+
Sbjct: 348 CLV------------------SAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFR 389
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
+ E G+GG+++D+GTAVTRL T Y A RDAF+ T L GVA+FDTCYD SV
Sbjct: 390 LTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVSV 449
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
VPTVSF+F G +L LPA+N+LIP+D GTFCFAFAP++S LSI+GN+QQ+G ++SF+
Sbjct: 450 RVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDG 509
Query: 238 RNSLIGFTPNKC 249
N +GF PN C
Sbjct: 510 ANGYVGFGPNIC 521
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 234 bits (596), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 146/269 (54%), Positives = 178/269 (66%), Gaps = 21/269 (7%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
GDFVTET+T G A V +A+GCGH+NEGLFV AAGLLGLG G LSFP+QI+ +FS
Sbjct: 218 GDFVTETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFS 277
Query: 57 YCLVDR---------DSDSTSTLEFD--SSLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
YCLVDR S +ST+ F S +A P++RN ++TFYY+ L GISVG
Sbjct: 278 YCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVG 337
Query: 106 GDLLP-ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTD 160
G +P ++E+ ++D S G GG+IVDSGT+VTRL +Y+ALRDAF G LSP
Sbjct: 338 GARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSP-G 396
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
G +LFDTCYD R V+VPTVS HF G LP +NYLIPVDS GTFCFAFA T +
Sbjct: 397 GFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGV 456
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
SIIGN+QQQG RV F+ +GF P C
Sbjct: 457 SIIGNIQQQGFRVVFDGDGQRVGFAPKGC 485
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 233 bits (595), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 146/269 (54%), Positives = 178/269 (66%), Gaps = 21/269 (7%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
GDFVTET+T G A V +A+GCGH+NEGLFV AAGLLGLG G LSFP+QI+ +FS
Sbjct: 75 GDFVTETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFS 134
Query: 57 YCLVDR---------DSDSTSTLEFD--SSLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
YCLVDR S +ST+ F S +A P++RN ++TFYY+ L GISVG
Sbjct: 135 YCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVG 194
Query: 106 GDLLP-ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTD 160
G +P ++E+ ++D S G GG+IVDSGT+VTRL +Y+ALRDAF G LSP
Sbjct: 195 GARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSP-G 253
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
G +LFDTCYD R V+VPTVS HF G LP +NYLIPVDS GTFCFAFA T +
Sbjct: 254 GFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGV 313
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
SIIGN+QQQG RV F+ +GF P C
Sbjct: 314 SIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 233 bits (593), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 142/262 (54%), Positives = 183/262 (69%), Gaps = 14/262 (5%)
Query: 1 GDFVTETVTLGSAS------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST 54
G+F T+ V+L S S ++ I +GCGH+NEG FVGAAGLLGLG G LSFP+QIN+
Sbjct: 124 GEFATDAVSLNSTSGGGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQINSEN 183
Query: 55 ---FSYCLVDRDSDST--STLEF-DSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGD 107
FSYCL RD+DST S+L F D+++PP V P N + TFYYL +TGISVGG
Sbjct: 184 GGRFSYCLTGRDTDSTERSSLIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGS 243
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
+L I +AF++D GNGG+I+DSGT+VTRLQ Y +LR+AF GT L T +LFDT
Sbjct: 244 ILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDT 303
Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQ 227
CY+ S SSV+VPTV+ HF G L LPA NYL+PVD++ TFC AFA T+ SIIGN+Q
Sbjct: 304 CYNLSDLSSVDVPTVTLHFQGGADLKLPASNYLVPVDNSSTFCLAFAGTTGP-SIIGNIQ 362
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
QQG RV ++ ++ +GF P++C
Sbjct: 363 QQGFRVIYDNLHNQVGFVPSQC 384
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 232 bits (592), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 139/255 (54%), Positives = 174/255 (68%), Gaps = 7/255 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
GDF TET+T V +A+GCGH+NEGLF GAAGLLGLG G LSFP Q FSY
Sbjct: 207 GDFSTETLTFRRNRVTRVALGCGHDNEGLFTGAAGLLGLGRGRLSFPVQTGRRFNHKFSY 266
Query: 58 CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG-DLLPISET 114
CLVDR + + +S + DS++ A PL++N +LDTFYYL L GISVGG + +S +
Sbjct: 267 CLVDRSASAKPSSVIFGDSAVSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSAS 326
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
F++D +GNGG+I+DSGT+VTRL Y ALRDAF G L +LFDTC+D S
Sbjct: 327 LFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAFRIGASHLKRAPEFSLFDTCFDLSGL 386
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
+ V+VPTV HF G + LPA NYLIPVD++G+FCFAFA T S LSIIGN+QQQG R+S
Sbjct: 387 TEVKVPTVVLHF-RGADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRIS 445
Query: 235 FNLRNSLIGFTPNKC 249
++L S +GF P C
Sbjct: 446 YDLTGSRVGFAPRGC 460
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 231 bits (589), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 136/253 (53%), Positives = 174/253 (68%), Gaps = 4/253 (1%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
G ET+T G + N+AIGCGH+N+G+FVGAAGLLGLGGG +SF Q+ T FSY
Sbjct: 223 GTLALETITFGRTLIRNVAIGCGHHNQGMFVGAAGLLGLGGGPMSFVGQLGGQTGGAFSY 282
Query: 58 CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CLV R +S+ LEF ++P A PL+ N +FYY+GL+G+ VGG + ISE F
Sbjct: 283 CLVSRGIESSGLLEFGREAMPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVF 342
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
K+ E G+GG+++D+GTAVTRL T Y A RD F+ T L GV++FDTCYD S
Sbjct: 343 KLSELGDGGVVMDTGTAVTRLPTVAYEAFRDGFIAQTTNLPRASGVSIFDTCYDLFGFVS 402
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
V VPTVSF+F G +L LPA+N+LIPVD GTFCFAFAP+SS LSIIGN+QQ+G ++S +
Sbjct: 403 VRVPTVSFYFSGGPILTLPARNFLIPVDDVGTFCFAFAPSSSGLSIIGNIQQEGIQISVD 462
Query: 237 LRNSLIGFTPNKC 249
N +GF PN C
Sbjct: 463 GANGFVGFGPNVC 475
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 231 bits (588), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 128/252 (50%), Positives = 169/252 (67%), Gaps = 15/252 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
G ET+T+G + + AIGCGH NEG+FVGAAGLLGLGGG +SF Q+ A T F Y
Sbjct: 217 GTLALETITIGRTVIQDTAIGCGHWNEGMFVGAAGLLGLGGGPMSFVGQLGAQTGGAFGY 276
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
CLV R ++P A+ PL+ N +FYY+ L+G++VGG +PISE F+
Sbjct: 277 CLVSR------------AMPVGAMWVPLIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQ 324
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
+ + G GG+++D+GTA+TRL T YNA RDAF+ T L GV++FDTCYD + +V
Sbjct: 325 LTDIGTGGVVMDTGTAITRLPTVAYNAFRDAFIAQTTNLPRAPGVSIFDTCYDLNGFVTV 384
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
VPTVSF+F G++L PA+N+LIP D GTFCFAFAP+ S LSIIGN+QQ+G +VS +
Sbjct: 385 RVPTVSFYFSGGQILTFPARNFLIPADDVGTFCFAFAPSPSGLSIIGNIQQEGIQVSIDG 444
Query: 238 RNSLIGFTPNKC 249
N +GF PN C
Sbjct: 445 TNGFVGFGPNVC 456
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 229 bits (584), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 119/252 (47%), Positives = 158/252 (62%), Gaps = 24/252 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G+ ET+TLG +V +AIGCGH N GLFVGAAGLLGLG G++S Q+ + FSY
Sbjct: 221 GELALETLTLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSY 280
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
CL R + +L +FYY+GLTGI VGG+ LP+ ++ F+
Sbjct: 281 CLASRGAGGAGSLA--------------------SSFYYVGLTGIGVGGERLPLQDSLFQ 320
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
+ E G GG+++D+GTAVTRL E Y ALR AF AL + V+L DTCYD S +SV
Sbjct: 321 LTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASV 380
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
VPTVSF+F +G VL LPA+N L+ V FC AFAP+SS +SI+GN+QQ+G +++ +
Sbjct: 381 RVPTVSFYFDQGAVLTLPARNLLVEV-GGAVFCLAFAPSSSGISILGNIQQEGIQITVDS 439
Query: 238 RNSLIGFTPNKC 249
N +GF PN C
Sbjct: 440 ANGYVGFGPNTC 451
>gi|388515789|gb|AFK45956.1| unknown [Medicago truncatula]
Length = 225
Score = 228 bits (581), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 124/225 (55%), Positives = 159/225 (70%), Gaps = 4/225 (1%)
Query: 29 LFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTSTLEF-DSSLPPNAVTAP 84
+FVGAAGLLGLG G +SF Q+ TFSYCLV R ++S+ +LEF S+P A
Sbjct: 1 MFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGTESSGSLEFGRESVPVGASWVS 60
Query: 85 LLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNA 144
L+ N +FYY+GL+G+ VGG +PISE F+++E G GG+++D+GTAVTRL YNA
Sbjct: 61 LIHNPRAPSFYYIGLSGLGVGGLRVPISEDIFRLNELGEGGVVMDTGTAVTRLPAAAYNA 120
Query: 145 LRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD 204
RDAFV T L T GV++FDTCYD + +V VPT+SF+F G +L LPA+N+LIPVD
Sbjct: 121 FRDAFVAQTTNLPKTSGVSIFDTCYDLNGFVTVRVPTISFYFLGGPILTLPARNFLIPVD 180
Query: 205 SNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S GTFCFAFAP+SS LSIIGN+QQ+G +S + N IGF PN C
Sbjct: 181 SVGTFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGYIGFGPNIC 225
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 227 bits (579), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 128/266 (48%), Positives = 167/266 (62%), Gaps = 18/266 (6%)
Query: 1 GDFVTETVTL------GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST 54
G+ VT+ V L G + NI +GCGH+NEG F AAG+LGLG G LSFP+ ++AST
Sbjct: 103 GELVTDNVVLDDAFGPGQVVLTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDAST 162
Query: 55 ---FSYCLVDRDSD--STSTLEFDSSLPPNAVTA-----PLLRNHELDTFYYLGLTGISV 104
FSYCL DR+SD STL F + P+ T P LRN + T+YY+ +TGISV
Sbjct: 163 RNIFSYCLPDRESDPNHKSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISV 222
Query: 105 GGDLLP-ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
GG+LL I + F++D GNGG I DSGT +TRL+ Y A+RDAF T L+
Sbjct: 223 GGNLLTNIPASVFQLDSHGNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFK 282
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
+FDTCYDF+ +S+ VPTV+FHF + LP NY++PV +N FCFAFA S S+I
Sbjct: 283 IFDTCYDFTGMNSISVPTVTFHFQGDVDMRLPPSNYIVPVSNNNIFCFAFA-ASMGPSVI 341
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GNVQQQ RV ++ + IG P++C
Sbjct: 342 GNVQQQSFRVIYDNVHKQIGLLPDQC 367
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 227 bits (578), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 136/259 (52%), Positives = 177/259 (68%), Gaps = 14/259 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
G+F TET++ GS +V+++AIGCGHNN+GLF GAAGLLGLG G LSFPSQ+ S FSY
Sbjct: 168 GEFSTETLSFGSNAVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSY 227
Query: 58 CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CL R+S + L F + ++ NA LL N +LDTFYY+ + GI VGG + I +
Sbjct: 228 CLPTRESTGSVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSL 287
Query: 117 KIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD-----GVALFDTCYD 170
+D S GNGG+I+DSGTAVTRL T YN +RDAF RA P+D G +LFDTCYD
Sbjct: 288 SLDSSTGNGGVILDSGTAVTRLVTSAYNPMRDAF----RAGMPSDAKMTSGFSLFDTCYD 343
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
S RSS+ +P VSF F G + LPA+N ++PVD++GT+C AFAP S + SIIGN+QQQ
Sbjct: 344 LSGRSSIMLPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQS 403
Query: 231 TRVSFNLRNSLIGFTPNKC 249
R+SF+ + +G N+C
Sbjct: 404 FRMSFDSTGNRVGIGANQC 422
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 226 bits (577), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 134/262 (51%), Positives = 178/262 (67%), Gaps = 13/262 (4%)
Query: 1 GDFVTETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI--------N 51
GDF ++ TLG+ S ++A GCG +NEGLF GAAGLLGLG G LSFPSQI
Sbjct: 221 GDFSSDLFTLGTGSKAMSVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSST 280
Query: 52 ASTFSYCLVDRD---SDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
A++FSYCLVDR + S+S+L F +++P A +PLL+N +LDTFYY + G+SVGG
Sbjct: 281 ANSFSYCLVDRSNPMTRSSSSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGA 340
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
LPIS + ++ +SG+GG+I+DSGT+VTR T Y +RDAF T L +LFDT
Sbjct: 341 QLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAPRYSLFDT 400
Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQ 227
CY+FS ++SV+VP + HF G L LP NYLIP+++ G+FC AFAPTS L IIGN+Q
Sbjct: 401 CYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQ 460
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
QQ R+ F+L+ S + F P +C
Sbjct: 461 QQSFRIGFDLQKSHLAFAPQQC 482
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 226 bits (576), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 136/259 (52%), Positives = 177/259 (68%), Gaps = 14/259 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
G+F TET++ GS +V+++AIGCGHNN+GLF GAAGLLGLG G LSFPSQ+ S FSY
Sbjct: 168 GEFSTETLSFGSNAVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSY 227
Query: 58 CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CL R+S + L F + ++ NA LL N +LDTFYY+ + GI VGG + I +
Sbjct: 228 CLPTRESTGSVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSL 287
Query: 117 KIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD-----GVALFDTCYD 170
+D S GNGG+I+DSGTAVTRL T YN +RDAF RA P+D G +LFDTCYD
Sbjct: 288 SLDSSTGNGGVILDSGTAVTRLVTSAYNPMRDAF----RAGMPSDAKMTSGFSLFDTCYD 343
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
S RSS+ +P VSF F G + LPA+N ++PVD++GT+C AFAP S + SIIGN+QQQ
Sbjct: 344 LSGRSSIMLPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQS 403
Query: 231 TRVSFNLRNSLIGFTPNKC 249
R+SF+ + +G N+C
Sbjct: 404 FRMSFDSTGNRVGIGANQC 422
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 225 bits (574), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 134/262 (51%), Positives = 178/262 (67%), Gaps = 13/262 (4%)
Query: 1 GDFVTETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI--------N 51
GDF ++ TLG+ S ++A GCG +NEGLF GAAGLLGLG G LSFPSQI
Sbjct: 146 GDFSSDLFTLGTGSKAMSVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSST 205
Query: 52 ASTFSYCLVDRD---SDSTSTLEFD-SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
A++FSYCLVDR + S+S+L F +++P A +PLL+N +LDTFYY + G+SVGG
Sbjct: 206 ANSFSYCLVDRSNPMTRSSSSLIFGVAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGA 265
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
LPIS + ++ +SG+GG+I+DSGT+VTR T Y +RDAF T L +LFDT
Sbjct: 266 QLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATINLPSAPRYSLFDT 325
Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQ 227
CY+FS ++SV+VP + HF G L LP NYLIP+++ G+FC AFAPTS L IIGN+Q
Sbjct: 326 CYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQ 385
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
QQ R+ F+L+ S + F P +C
Sbjct: 386 QQSFRIGFDLQKSHLAFAPQQC 407
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 225 bits (573), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 147/267 (55%), Positives = 179/267 (67%), Gaps = 20/267 (7%)
Query: 1 GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
GDF TET+T S A V +A+GCGH+NEGLFV AAGLLGLG GSLSFPSQI+ +FS
Sbjct: 236 GDFATETLTFASGARVPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFS 295
Query: 57 YCLVDRDSDSTST------LEFDS-SLPPNAVTA--PLLRNHELDTFYYLGLTGISVGGD 107
YCLVDR S S S + F S ++ P+A + P+++N ++TFYY+ L GISVGG
Sbjct: 296 YCLVDRTSSSASATSRSSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGA 355
Query: 108 LLP-ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGV 162
+P ++ + ++D S G GG+IVDSGT+VTRL Y ALRDAF G R LSP G
Sbjct: 356 RVPGVAVSDLRLDPSTGRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLR-LSP-GGF 413
Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
+LFDTCYD S V+VPTVS HF G LP +NYLIPVDS GTFCFAFA T +SI
Sbjct: 414 SLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSI 473
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IGN+QQQG RV F+ +GF P C
Sbjct: 474 IGNIQQQGFRVVFDGDGQRLGFVPKGC 500
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 224 bits (571), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 133/253 (52%), Positives = 174/253 (68%), Gaps = 4/253 (1%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
G ET+T V N+A+GCGH N G+F+GAAGLLG+GGGS+SF Q++ T F Y
Sbjct: 219 GTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGY 278
Query: 58 CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CLV R +DST +L F +LP A PL+RN +FYY+GL G+ VGG +P+ + F
Sbjct: 279 CLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVF 338
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
+ E+G+GG+++D+GTAVTRL T Y A RD F T L GV++FDTCYD S S
Sbjct: 339 DLTETGDGGVVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVS 398
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
V VPTVSF+F EG VL LPA+N+L+PVD +GT+CFAFA + + LSIIGN+QQ+G +VSF+
Sbjct: 399 VRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFD 458
Query: 237 LRNSLIGFTPNKC 249
N +GF PN C
Sbjct: 459 GANGFVGFGPNVC 471
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 224 bits (571), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 133/253 (52%), Positives = 174/253 (68%), Gaps = 4/253 (1%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
G ET+T V N+A+GCGH N G+F+GAAGLLG+GGGS+SF Q++ T F Y
Sbjct: 218 GTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGY 277
Query: 58 CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CLV R +DST +L F +LP A PL+RN +FYY+GL G+ VGG +P+ + F
Sbjct: 278 CLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVF 337
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
+ E+G+GG+++D+GTAVTRL T Y A RD F T L GV++FDTCYD S S
Sbjct: 338 DLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVS 397
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
V VPTVSF+F EG VL LPA+N+L+PVD +GT+CFAFA + + LSIIGN+QQ+G +VSF+
Sbjct: 398 VRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFD 457
Query: 237 LRNSLIGFTPNKC 249
N +GF PN C
Sbjct: 458 GANGFVGFGPNVC 470
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 221 bits (564), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 132/253 (52%), Positives = 174/253 (68%), Gaps = 4/253 (1%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
G ET+T G + NIAIGCGH N G+F+GAAGLLGLGGG++SF Q+ T FSY
Sbjct: 224 GTLALETLTFGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSY 283
Query: 58 CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CLV R ++ST TLEF ++P A PL+RN +FYY+GL+G+ VGG +PI E F
Sbjct: 284 CLVSRGTESTGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIF 343
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
++ + G GG+++D+GTAVTRL Y A RD F+ T L +D V++FDTCY+ + S
Sbjct: 344 ELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVS 403
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
V VPTVSF+F G +L LPA+N+LIPVD GTFCFAFA ++S LSIIGN+QQ+G ++S +
Sbjct: 404 VRVPTVSFYFSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISID 463
Query: 237 LRNSLIGFTPNKC 249
N +GF P C
Sbjct: 464 GSNGFVGFGPTIC 476
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 216 bits (550), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 121/264 (45%), Positives = 154/264 (58%), Gaps = 16/264 (6%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G ET+TLG +V+ + IGCGH N GLFVGAAGL+GLG G +S Q+ FSY
Sbjct: 261 GALALETLTLGGTAVEGVVIGCGHRNRGLFVGAAGLMGLGWGPMSLVGQLGGEVGGAFSY 320
Query: 58 CLVDR---------DSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
CL R D L ++P AV PL+RN +FYY+GL+GI VG +
Sbjct: 321 CLASRGGYGSGAADDDAGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDER 380
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGV--ALF 165
LP+ F++ E G G +++D+GT VTRL E Y ALRDAFV P GV ++
Sbjct: 381 LPLQAGLFQLTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVL 440
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
DTCYD S +SV VPTVSF F L L A+N L+ VD G +C AFAP+SS LSI+GN
Sbjct: 441 DTCYDLSGYASVRVPTVSFCFDGDARLILAARNVLLEVD-MGIYCLAFAPSSSGLSIMGN 499
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
QQ G +++ + N IGF P C
Sbjct: 500 TQQAGIQITVDSANGYIGFGPANC 523
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 214 bits (544), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 132/259 (50%), Positives = 168/259 (64%), Gaps = 10/259 (3%)
Query: 1 GDFVTETVTLG---SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---T 54
GD E+ LG S ++ NIA GCGH+N GLF G AGLLG+GGG+LSF SQI AS
Sbjct: 99 GDLGIESFYLGPNSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPA 158
Query: 55 FSYCLVDRDSD---STSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
FSYCLVDR S +S L F +++P A PLL+N ++TFYY LTGISVGG LP
Sbjct: 159 FSYCLVDRYSQLQSRSSPLIFGRTAIPFAARFTPLLKNPRINTFYYAVLTGISVGGTPLP 218
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
I F + +G GG I+DSGT+VTR+ Y LRDA+ +R L P GV L DTC++
Sbjct: 219 IPPAQFALTGNGTGGAILDSGTSVTRVVPPAYAVLRDAYRAASRNLPPAPGVYLLDTCFN 278
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
F +V++P++ HF G + LP N LIPVD +GTFC AFAP+S +S+IGNVQQQ
Sbjct: 279 FQGLPTVQIPSLVLHFDNGVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQT 338
Query: 231 TRVSFNLRNSLIGFTPNKC 249
R+ F+L+ SLI P +C
Sbjct: 339 FRIGFDLQRSLIAIAPREC 357
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 214 bits (544), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 132/259 (50%), Positives = 167/259 (64%), Gaps = 10/259 (3%)
Query: 1 GDFVTETVTLG---SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---T 54
GD E+ LG S ++ NIA GCGH+N GLF G AGLLG+GGG+LSF SQI AS
Sbjct: 132 GDLGIESFYLGPNSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPA 191
Query: 55 FSYCLVDRDSD---STSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
FSYCLVDR S +S L F +++P A PLL+N +DTFYY LTGISVGG LP
Sbjct: 192 FSYCLVDRYSQLQSRSSPLIFGRTAIPFAARFTPLLKNPRIDTFYYAILTGISVGGTALP 251
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
I F + +G GG I+DSGT+VTR+ Y LRDA+ +R L P GV L DTC++
Sbjct: 252 IPPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAYRAASRNLPPAPGVYLLDTCFN 311
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
F +V++P++ HF + LP N LIPVD +GTFC AFAP+S +S+IGNVQQQ
Sbjct: 312 FQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQT 371
Query: 231 TRVSFNLRNSLIGFTPNKC 249
R+ F+L+ SLI P +C
Sbjct: 372 FRIGFDLQRSLIAIAPREC 390
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 212 bits (539), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 126/253 (49%), Positives = 159/253 (62%), Gaps = 5/253 (1%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
GDF TET++ G +V ++A+GCG NN+GLF GAAGLLGLG G LSFPSQ AS FSY
Sbjct: 169 GDFSTETLSFGEHAVRSVAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSY 228
Query: 58 CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CL R+S ++L F S++P A LL N LDT+YY+GL I V G + I AF
Sbjct: 229 CLPRRESAIAASLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAF 288
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
+ G GG+IVDSGTA++RL T Y ALRDAF R G++LFDTCYD SS +
Sbjct: 289 AMGSRGTGGVIVDSGTAISRLTTPAYTALRDAF-RSLVTFPSAPGISLFDTCYDLSSMKT 347
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
+P V F G +PLPA L+ VD GT+C AFAP + SIIGNVQQQ R+S +
Sbjct: 348 ATLPAVVLDFDGGASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISID 407
Query: 237 LRNSLIGFTPNKC 249
+ +G P++C
Sbjct: 408 NQKEQMGIAPDQC 420
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 211 bits (537), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 126/253 (49%), Positives = 159/253 (62%), Gaps = 5/253 (1%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
GDF TET++ G +V ++A+GCG NN+GLF GAAGLLGLG G LSFPSQ AS FSY
Sbjct: 102 GDFSTETLSFGEHAVRSVAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSY 161
Query: 58 CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CL R+S ++L F S++P A LL N LDT+YY+GL I V G + I AF
Sbjct: 162 CLPRRESAIAASLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAF 221
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
+ G GG+IVDSGTA++RL T Y ALRDAF R G++LFDTCYD SS +
Sbjct: 222 AMGSRGTGGVIVDSGTAISRLTTPAYTALRDAF-RSLVTFPSAPGISLFDTCYDLSSMKT 280
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
+P V F G +PLPA L+ VD GT+C AFAP + SIIGNVQQQ R+S +
Sbjct: 281 ATLPAVVLDFDGGASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISID 340
Query: 237 LRNSLIGFTPNKC 249
+ +G P++C
Sbjct: 341 NQKEQMGIAPDQC 353
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 211 bits (537), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 134/266 (50%), Positives = 170/266 (63%), Gaps = 18/266 (6%)
Query: 1 GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
GDF +ET+T A V +AIGCGH+NEGLF+ A+GLLGLG G LSFPSQI S +FS
Sbjct: 217 GDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFS 276
Query: 57 YCLVDRDSD------STSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGD 107
YCLVDR S +ST+ F + A A P+ RN + TFYY+ L G SVGG
Sbjct: 277 YCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGA 336
Query: 108 LLP-ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDGVA 163
+ +S++ +++ + G GG+I+DSGT+VTRL Y A+RDAF L SP G +
Sbjct: 337 RVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSP-GGFS 395
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
LFDTCY+ S R V+VPTVS H G + LP +NYLIPVD++GTFCFA A T +SII
Sbjct: 396 LFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSII 455
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN+QQQG RV F+ +GF P C
Sbjct: 456 GNIQQQGFRVVFDGDAQRVGFVPKSC 481
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 211 bits (537), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 134/266 (50%), Positives = 170/266 (63%), Gaps = 18/266 (6%)
Query: 1 GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
GDF +ET+T A V +AIGCGH+NEGLF+ A+GLLGLG G LSFPSQI S +FS
Sbjct: 211 GDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFS 270
Query: 57 YCLVDRDSD------STSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGD 107
YCLVDR S +ST+ F + A A P+ RN + TFYY+ L G SVGG
Sbjct: 271 YCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGA 330
Query: 108 LLP-ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDGVA 163
+ +S++ +++ + G GG+I+DSGT+VTRL Y A+RDAF L SP G +
Sbjct: 331 RVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSP-GGFS 389
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
LFDTCY+ S R V+VPTVS H G + LP +NYLIPVD++GTFCFA A T +SII
Sbjct: 390 LFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSII 449
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN+QQQG RV F+ +GF P C
Sbjct: 450 GNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 210 bits (534), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 125/260 (48%), Positives = 169/260 (65%), Gaps = 14/260 (5%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FS 56
G ET+T G S V +AIGCGH N GLFVGAAGLLGLG G +S Q+ + FS
Sbjct: 223 GVLAMETLTFGDSTPVQGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFS 282
Query: 57 YCLVDRDSDS-TSTLEF--DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
YCL R +D+ +L F D ++P AV PLLRN + +FYY+GLTG+ VGG+ LP+ +
Sbjct: 283 YCLASRGADAGAGSLVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQD 342
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVALFDTCYD 170
F + E G GG+++D+GTAVTRL + Y ALRDAF + G +P GV+L DTCYD
Sbjct: 343 GLFDLTEDGGGGVVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAP--GVSLLDTCYD 400
Query: 171 FSSRSSVEVPTVSFHF-PEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
S +SV VPTV+ +F +G L LPA+N L+ + G +C AFA ++S LSI+GN+QQQ
Sbjct: 401 LSGYASVRVPTVALYFGRDGAALTLPARNLLVEM-GGGVYCLAFAASASGLSILGNIQQQ 459
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
G +++ + N +GF P+ C
Sbjct: 460 GIQITVDSANGYVGFGPSTC 479
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 210 bits (534), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 133/266 (50%), Positives = 170/266 (63%), Gaps = 18/266 (6%)
Query: 1 GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
GDF +ET+T A V +AIGCGH+NEGLF+ A+GLLGLG G LSFP+QI S +FS
Sbjct: 211 GDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPTQIARSFGRSFS 270
Query: 57 YCLVDRDSD------STSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGD 107
YCLVDR S +ST+ F + A A P+ RN + TFYY+ L G SVGG
Sbjct: 271 YCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGA 330
Query: 108 LLP-ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDGVA 163
+ +S++ +++ + G GG+I+DSGT+VTRL Y A+RDAF L SP G +
Sbjct: 331 RVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSP-GGFS 389
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
LFDTCY+ S R V+VPTVS H G + LP +NYLIPVD++GTFCFA A T +SII
Sbjct: 390 LFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSII 449
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN+QQQG RV F+ +GF P C
Sbjct: 450 GNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 133/269 (49%), Positives = 168/269 (62%), Gaps = 26/269 (9%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS-TFSY 57
GDF+ ET+T G + I+IGCGH+N+GLF AAG+LGLG G +SFP+QI+ + TFSY
Sbjct: 228 GDFIEETLTFAGGVRLPRISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQIDHNGTFSY 287
Query: 58 CLVDRDSDS---TSTLEFDSSL----PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
CLVD S +STL F + PP + T P + N + TFYY+ LTGISVGG +P
Sbjct: 288 CLVDFLSGPGSLSSTLTFGAGAVDTSPPVSFT-PTVLNLNMPTFYYVRLTGISVGGVRVP 346
Query: 111 -ISETAFKIDE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV------ 162
++E ++D +G GG+IVDSGTAVTRL Y A RDAF RA++ G
Sbjct: 347 GVTERDLQLDPYTGRGGVIVDSGTAVTRLARPAYTAFRDAF----RAVAVDLGQVSIGGP 402
Query: 163 -ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSL 220
FDTCY R +VPTVS HF + L KNYLIPVDS GT CFAFA T S+
Sbjct: 403 SGFFDTCYTVGGRGMKKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSV 462
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
SIIGN+QQQG R+ +++ +GF PN C
Sbjct: 463 SIIGNIQQQGFRIVYDI-GGRVGFAPNSC 490
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 122/258 (47%), Positives = 151/258 (58%), Gaps = 15/258 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
GDF ET+TLGS S + A GCGH N GLF G+AGLLGLG +LSFPSQ + FSY
Sbjct: 226 GDFSQETLTLGSDSFPSFAFGCGHTNTGLFKGSAGLLGLGRTALSFPSQTKSKYGGQFSY 285
Query: 58 CLVDRDSDSTSTLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
CL D S STST F S+P A PL+ N +FY++GL GISVGG+ L I
Sbjct: 286 CLPDFVS-STSTGSFSVGQGSIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPA 344
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
G GG IVDSGT +TRL + Y+AL+ +F TR L ++ DTCYD SS
Sbjct: 345 VL-----GRGGTIVDSGTVITRLVPQAYDALKTSFRSKTRNLPSAKPFSILDTCYDLSSY 399
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTSSSLS--IIGNVQQQGT 231
S V +PT++FHF + + A L + S+G+ C AFA S S+S IIGN QQQ
Sbjct: 400 SQVRIPTITFHFQNNADVAVSAVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRM 459
Query: 232 RVSFNLRNSLIGFTPNKC 249
RV+F+ IGF P C
Sbjct: 460 RVAFDTGAGRIGFAPGSC 477
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 199 bits (506), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 127/257 (49%), Positives = 160/257 (62%), Gaps = 20/257 (7%)
Query: 1 GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
GDF +ET+T A V +AIGCGH+NEGLF+ A+GLLGLG G LSFPSQI S +FS
Sbjct: 212 GDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFS 271
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISETA 115
YCLVDR S + P + TFYY+ L G SVGG + +S++
Sbjct: 272 YCLVDRTSSRRARPSRRWGGTP-----------RMATFYYVHLLGFSVGGARVKGVSQSD 320
Query: 116 FKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDGVALFDTCYDFS 172
+++ + G GG+I+DSGT+VTRL Y A+RDAF L SP G +LFDTCY+ S
Sbjct: 321 LRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSP-GGFSLFDTCYNLS 379
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTR 232
R V+VPTVS H G + LP +NYLIPVD++GTFCFA A T +SIIGN+QQQG R
Sbjct: 380 GRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFR 439
Query: 233 VSFNLRNSLIGFTPNKC 249
V F+ +GF P C
Sbjct: 440 VVFDGDAQRVGFVPKSC 456
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 194 bits (492), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 116/257 (45%), Positives = 146/257 (56%), Gaps = 13/257 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
GDF ET+TLGS S N A GCGH N GLF G++GLLGLG SLSFPSQ + F+Y
Sbjct: 229 GDFSQETLTLGSDSFQNFAFGCGHTNTGLFKGSSGLLGLGQNSLSFPSQSKSKYGGQFAY 288
Query: 58 CLVDRDSDSTSTLEF--DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
CL D S +++ S+P +AV PL+ N TFY++GL GISVGGD L I
Sbjct: 289 CLPDFGSSTSTGSFSVGKGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAV 348
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
G G IVDSGT +TRL + YNAL+ +F TR L ++ DTCYD S S
Sbjct: 349 L-----GRGSTIVDSGTVITRLLPQAYNALKTSFRSKTRDLPSAKPFSILDTCYDLSRHS 403
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTSS--SLSIIGNVQQQGTR 232
V +PT++FHF + + L+PV + G+ C AFA S +IIGN QQQ R
Sbjct: 404 QVRIPTITFHFQNNADVAVSDVGILVPVQNGGSQVCLAFASASQMDGFNIIGNFQQQRMR 463
Query: 233 VSFNLRNSLIGFTPNKC 249
V+F+ IGF C
Sbjct: 464 VAFDTGAGRIGFASGSC 480
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 193 bits (491), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 116/265 (43%), Positives = 155/265 (58%), Gaps = 16/265 (6%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FS 56
G ET+TL G V +A+GCGH N GLF AAGLLGLG G +S Q+ + FS
Sbjct: 215 GVLALETLTLDGGTEVQGVAMGCGHENRGLFAEAAGLLGLGWGPMSLVGQLGGAAGGAFS 274
Query: 57 YCLVDRDSDSTS-----TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
YCL S S L + + P AV PL+RN + +FYY+G+ G+ V G+ L +
Sbjct: 275 YCLAGYYSGEGSGSGSLVLGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQL 334
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGVALFDTCYD 170
+ F + + G GG+++D+GTAVTRL E Y ALR AF +P GV+LFDTCYD
Sbjct: 335 QDGLFDLGDDGGGGVVMDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSLFDTCYD 394
Query: 171 FSSRSSVEVPTVSFHF------PEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
S +SV VPTV+ +F E L LPA+N L+PVD GT+C AFA +S SI+G
Sbjct: 395 LSGYASVRVPTVALYFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPSILG 454
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
N+QQQG ++ + + +GF P C
Sbjct: 455 NIQQQGIEITVDSASGYVGFGPATC 479
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 191 bits (484), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 131/269 (48%), Positives = 164/269 (60%), Gaps = 22/269 (8%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQI-----NAS 53
GD V ET+T G ++IGCGH+N+GLF AAG+LGLG G +S P QI NAS
Sbjct: 228 GDLVEETLTFAGGVRQAYLSIGCGHDNKGLFGAPAAGILGLGRGQISIPHQIAFLGYNAS 287
Query: 54 TFSYCLVDRDS---DSTSTLEFDSSL----PPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCLVD S +STL F + PP + T P + N + TFYY+ L G+SVGG
Sbjct: 288 -FSYCLVDFISGPGSPSSTLTFGAGAVDTSPPASFT-PTVLNQNMPTFYYVRLIGVSVGG 345
Query: 107 DLLP-ISETAFKIDE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP--TDGV 162
+P ++E ++D +G GG+I+DSGT VTRL Y A RDAF +L T G
Sbjct: 346 VRVPGVTERDLQLDPYTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGP 405
Query: 163 A-LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSL 220
+ LFDTCY R+ V+VP VS HF G + L KNYLIPVDS GT CFAFA T S+
Sbjct: 406 SGLFDTCYTVGGRAGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSV 465
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S+IGN+ QQG RV ++L +GF PN C
Sbjct: 466 SVIGNILQQGFRVVYDLAGQRVGFAPNNC 494
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 185 bits (470), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 126/258 (48%), Positives = 156/258 (60%), Gaps = 21/258 (8%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAS---TF 55
GDF ET+T V +AIGCG +N+GLF AAG+LGLG GSLSFPSQI +F
Sbjct: 220 GDFGVETLTFPPGVRVPGVAIGCGSDNQGLFPAPAAGILGLGRGSLSFPSQIAGRYGRSF 279
Query: 56 SYCLVDRDSD-STSTLEFDSSLPPNAVTAP------LLRNHELDTFYYLGLTGISVGG-D 107
SYCL + + +STL F S T +L N + TFYY+GL GISVGG
Sbjct: 280 SYCLAGQGTGGRSSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVR 339
Query: 108 LLPISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAF-VRGTRAL---SPTDGV 162
+ ++E+ ++D S G+GG+IVDSGTAVTRL Y A RDAF V + L SP
Sbjct: 340 VRGVTESDLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPSPGGPF 399
Query: 163 ALFDTCY-DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSN-GTFCFAFAPTS-SS 219
A FDTCY R +VP VS HF G + LP +NYLIPVDSN GT CFAFA +
Sbjct: 400 AFFDTCYSSVRGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRG 459
Query: 220 LSIIGNVQQQGTRVSFNL 237
+SIIGN+Q QG RV +++
Sbjct: 460 VSIIGNIQLQGFRVVYDV 477
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 181 bits (459), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 114/271 (42%), Positives = 152/271 (56%), Gaps = 25/271 (9%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
GD T+T+ L V N+ +GCGH+NEGL AAGLLG G G LSFP+Q+ + FS
Sbjct: 182 GDLATDTLVLPDDTRVHNVTLGCGHDNEGLLASAAGLLGAGRGQLSFPTQLAPAYGHVFS 241
Query: 57 YCLVDRDS---DSTSTLEFDSS--LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP- 110
YCL DR S +S+S L F + LP A T PL N + YY+ + G SVGG+ +
Sbjct: 242 YCLGDRMSRARNSSSYLVFGRTPELPSTAFT-PLRTNPRRPSLYYVDMVGFSVGGERVAG 300
Query: 111 ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVR-----GTRALSPTDGVAL 164
S + ++ + G GG++VDSGTA++R + Y A+RDAFV G R L + ++
Sbjct: 301 FSNASLALNPATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLR--NKFSV 358
Query: 165 FDTCYDFSSR---SSVEVPTVSFHFPEGKVLPLPAKNYLIPV---DSNGTFCFAFAPTSS 218
FDTCYD + V VP++ HF + LP NYLIPV D FC
Sbjct: 359 FDTCYDVHGNGPGTGVRVPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADD 418
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
L+++GNVQQQG V F++ IGFTPN C
Sbjct: 419 GLNVLGNVQQQGFGVVFDVERGRIGFTPNGC 449
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 181 bits (458), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 128/273 (46%), Positives = 161/273 (58%), Gaps = 26/273 (9%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQI-----NAS 53
GD V ET+T G ++IGCGH+N+GLF AAG+LGL G +S P QI NAS
Sbjct: 237 GDLVEETLTFAGGVRQAYLSIGCGHDNKGLFGAPAAGILGLSRGQISIPHQIAFLGYNAS 296
Query: 54 TFSYCLVDRDS---DSTSTLEFDSSL----PPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCLVD S +STL F + PP + T P + N + TFYY+ L G+SVGG
Sbjct: 297 -FSYCLVDFISGPGSPSSTLTFGAGAVDTSPPASFT-PTVLNQNMPTFYYVRLIGVSVGG 354
Query: 107 DLLP-ISETAFKIDE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVR---GTRALSPTDG 161
+P ++E ++D +G+GG+I+DSGT VTRL Y A RDAF G +S
Sbjct: 355 VRVPGVTERDLQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGP 414
Query: 162 VALFDTCYDFSSRSS----VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS 217
LFDTCY R+ V+VP VS HF G L L KNYLI VDS GT CFAFA T
Sbjct: 415 SGLFDTCYTVGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAFAGTG 474
Query: 218 -SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S+S+IGN+ QQG RV +++ +GF PN C
Sbjct: 475 DRSVSVIGNILQQGFRVVYDIGGQRVGFAPNSC 507
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 180 bits (456), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 133/273 (48%), Positives = 169/273 (61%), Gaps = 28/273 (10%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQI-----NAS 53
GDF+ ET+T G V +++IGCGH+N+GLF AAG+LGLG G +S PSQI N +
Sbjct: 225 GDFIEETLTFAGGVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVT 284
Query: 54 TFSYCLVD-------RDSDSTSTLEFDSSL--PPNAVTAPLLRNHELDTFYYLGLTGISV 104
+FSYCL D R ST T+ ++ PP + T P ++N + TFYY+ L G+SV
Sbjct: 285 SFSYCLADFFLSSPGRSVSSTLTIGDGAAAGSPPPSFT-PTVQNLNMATFYYVRLVGVSV 343
Query: 105 GGDLLPI-SETAFKIDE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVR-----GTRALS 157
GG +P +E K+D +G GG+I+DSGTAVTRL Y A RDAF G ++
Sbjct: 344 GGVRVPGVTEDDLKLDPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIG 403
Query: 158 PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT- 216
G FDTCY R+ ++VPTVS HF G L LP KNYLIPVDS GT CFAFA T
Sbjct: 404 GPSG--FFDTCYTMGGRA-MKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTG 460
Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S+SIIGN+QQQG RV +N+ +GF PN C
Sbjct: 461 DRSVSIIGNIQQQGFRVVYNIGGGRVGFAPNSC 493
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 177 bits (448), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 108/256 (42%), Positives = 141/256 (55%), Gaps = 9/256 (3%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G +ET+T G ASV N+A GCG +NEG F AGL+GLG G LS SQ+ FSYCL
Sbjct: 183 GILASETLTFGKASVPNVAFGCGADNEGSGFSQGAGLVGLGRGPLSLVSQLKEPKFSYCL 242
Query: 60 VDRDSDSTSTLEFDSSLPPNA-----VTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
D TSTL S NA T PL+ + +FYYL L GISVG LPI ++
Sbjct: 243 TTVDDTKTSTLLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKS 302
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
F + + G+GG+I+DSGT +T L+ +N + F + G D C+ S
Sbjct: 303 TFSLQDDGSGGLIIDSGTTITYLEESAFNLVAKEFTAKINLPVDSSGSTGLDVCFTLPSG 362
Query: 175 SS-VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
S+ +EVP + FHF +G L LPA+NY+I S G C A +SS +SI GNVQQQ V
Sbjct: 363 STNIEVPKLVFHF-DGADLELPAENYMIGDSSMGVACLAMG-SSSGMSIFGNVQQQNMLV 420
Query: 234 SFNLRNSLIGFTPNKC 249
+L + F P +C
Sbjct: 421 LHDLEKETLSFLPTQC 436
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 177 bits (448), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 104/256 (40%), Positives = 137/256 (53%), Gaps = 9/256 (3%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G TET T G S+ N+ GCG +NEG F +GL+GLG G LS SQ+ + FSYCL
Sbjct: 186 GTMATETFTFGKVSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLVSQLKEAKFSYCL 245
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTA-----PLLRNHELDTFYYLGLTGISVGGDLLPISET 114
D TSTL S N +A PL++N +FYYL L GISVGG LPI E+
Sbjct: 246 TSIDDTKTSTLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKES 305
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF-SS 173
F++ + G GG+I+DSGT +T L+ ++ ++ F G + CY+ S
Sbjct: 306 TFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGATGLELCYNLPSD 365
Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
S +EVP + HF G L LP +NY+I S G C A +S +SI GNVQQQ V
Sbjct: 366 TSELEVPKLVLHF-TGADLELPGENYMIADSSMGVICLAMG-SSGGMSIFGNVQQQNMFV 423
Query: 234 SFNLRNSLIGFTPNKC 249
S +L + F P C
Sbjct: 424 SHDLEKETLSFLPTNC 439
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 176 bits (446), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 112/260 (43%), Positives = 143/260 (55%), Gaps = 14/260 (5%)
Query: 1 GDFVTETVTLGSA----SVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTF 55
G TET T G + SV NI GCG +NEG F A+GL+GLG G LS SQ+ F
Sbjct: 194 GVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKEQRF 253
Query: 56 SYCLVDRDSDSTSTLEFDS----SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
SYCL D S L S VT PLL+N +FYYL L ISVG L I
Sbjct: 254 SYCLTPIDDTKESVLLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSI 313
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYD 170
++ F++ + GNGG+I+DSGT +T +Q + Y AL+ F+ T+ AL T L D C+
Sbjct: 314 EKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALKKEFISQTKLALDKTSSTGL-DLCFS 372
Query: 171 FSSRSS-VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
S S+ VE+P + FHF +G L LPA+NY+I + G C A SS +SI GNVQQQ
Sbjct: 373 LPSGSTQVEIPKLVFHF-KGGDLELPAENYMIGDSNLGVACLAMG-ASSGMSIFGNVQQQ 430
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
V+ +L I F P C
Sbjct: 431 NILVNHDLEKETISFVPTSC 450
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 106/251 (42%), Positives = 145/251 (57%), Gaps = 17/251 (6%)
Query: 14 SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVD--RDSDSTS 68
SV N+ +GCGH+NEGLF AAGLLG+ G+ SF +Q+ S F+YCL D R S+S
Sbjct: 200 SVGNVTLGCGHDNEGLFGSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSSS 259
Query: 69 TLEFDSSL--PPNAVTAPLLRNHELDTFYYLGLTGISVGGD-LLPISETAFKIDES-GNG 124
L F + PP++V PL N + YY+ + G SVGG+ + S + +D + G G
Sbjct: 260 YLVFGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRG 319
Query: 125 GIIVDSGTAVTRLQTETYNALRDAF-----VRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
G++VDSGT++TR + Y ALRDAF G R + G+++FD CYD + +
Sbjct: 320 GVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVG--RGISVFDACYDLRGVAVADA 377
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF-APTSSSLSIIGNVQQQGTRVSFNLR 238
P V HF G + LP +NYL+P +S CFA A LS+IGNV QQ RV F++
Sbjct: 378 PGVVLHFAGGADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDVE 437
Query: 239 NSLIGFTPNKC 249
N +GF PN C
Sbjct: 438 NERVGFEPNGC 448
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 175 bits (444), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 111/264 (42%), Positives = 149/264 (56%), Gaps = 17/264 (6%)
Query: 1 GDFVTETVTLG-----SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA--- 52
GDFV +T+T+ V N A GCGH+NEG F GA G+LGLG G LSF SQ+ +
Sbjct: 100 GDFVYDTITMDGINGQKQQVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFHSQLKSVYN 159
Query: 53 STFSYCLVDRDSDSTST---LEFDSSLP--PNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
FSYCLVD + T T L D+++P P+ P+L N ++ T+YY+ L GISVG +
Sbjct: 160 GKFSYCLVDWLAPPTQTSPLLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDN 219
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS-PTDGVALFD 166
LL IS T F ID G G I DSGT VT+L Y + A T A S D ++ D
Sbjct: 220 LLNISSTVFDIDSVGGAGTIFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDDISRLD 279
Query: 167 TCYD-FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
C F VP ++FHF EG + LP NY I ++S+ ++CFA +S ++IIG+
Sbjct: 280 LCLSGFPKDQLPTVPAMTFHF-EGGDMVLPPSNYFIYLESSQSYCFAMT-SSPDVNIIGS 337
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
VQQQ +V ++ +GF P C
Sbjct: 338 VQQQNFQVYYDTAGRKLGFVPKDC 361
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 105/253 (41%), Positives = 136/253 (53%), Gaps = 6/253 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G TET G ASV I GCG +N+G F AGL+GLG G LS SQ+ FSYCL
Sbjct: 183 GVLATETFAFGDASVSKIGFGCGEDNDGSGFSQGAGLVGLGRGPLSLISQLGEPKFSYCL 242
Query: 60 VDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
D +S L + NA+T PL++N +FYYL L GISVG LLPI ++ F
Sbjct: 243 TSMDDSKGISSLLVGSEATMKNAITTPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFS 302
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF-SSRSS 176
I G+GG+I+DSGT +T L+ + AL+ F+ + G D C+ S+
Sbjct: 303 IQNDGSGGLIIDSGTTITYLEDSAFAALKKEFISQLKLDVDESGSTGLDLCFTLPPDAST 362
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
V+VP + FHF EG L LPA+NY+I G C +SS +SI GN QQQ V +
Sbjct: 363 VDVPQLVFHF-EGADLKLPAENYIIADSGLGVICLTMG-SSSGMSIFGNFQQQNIVVLHD 420
Query: 237 LRNSLIGFTPNKC 249
L I F P +C
Sbjct: 421 LEKETISFAPAQC 433
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 174 bits (440), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 110/260 (42%), Positives = 143/260 (55%), Gaps = 14/260 (5%)
Query: 1 GDFVTETVTLGSA----SVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTF 55
G TET T G + SV NI GCG +NEG F A+GL+GLG G LS SQ+ F
Sbjct: 194 GVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKEPRF 253
Query: 56 SYCLVDRDSDSTSTLEFDS----SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
SYCL D S L S VT PLL+N +FYYL L GISVG L I
Sbjct: 254 SYCLTPMDDTKESILLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSI 313
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYD 170
++ F++ + GNGG+I+DSGT +T ++ + + AL+ F+ T+ L T L D C+
Sbjct: 314 EKSTFEVGDDGNGGVIIDSGTTITYIEQKAFEALKKEFISQTKLPLDKTSSTGL-DLCFS 372
Query: 171 FSSRSS-VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
S S+ VE+P + FHF +G L LPA+NY+I + G C A SS +SI GNVQQQ
Sbjct: 373 LPSGSTQVEIPKIVFHF-KGGDLELPAENYMIGDSNLGVACLAMG-ASSGMSIFGNVQQQ 430
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
V+ +L I F P C
Sbjct: 431 NILVNHDLEKETISFVPTSC 450
>gi|147866052|emb|CAN80962.1| hypothetical protein VITISV_022007 [Vitis vinifera]
Length = 150
Score = 174 bits (440), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 83/146 (56%), Positives = 107/146 (73%)
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
VGG +PISE F++ E G+GG+++D+GTAVTRL T Y A RDAF+ T L GVA
Sbjct: 5 VGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVA 64
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
+FDTCYD SV VPTVSF+F G +L LPA+N+LIP+D GTFCFAFAP++S LSI+
Sbjct: 65 IFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSIL 124
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN+QQ+G ++SF+ N +GF PN C
Sbjct: 125 GNIQQEGIQISFDGANGYVGFGPNIC 150
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 174 bits (440), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 108/258 (41%), Positives = 154/258 (59%), Gaps = 14/258 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCL 59
G TET+T GS S+ NI GCG NN+G G AGL+G+G G LS PSQ++ + FSYC+
Sbjct: 182 GSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCM 241
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTA-----PLLRNHELDTFYYLGLTGISVGGDLLPISET 114
S ++STL S N+VTA L+++ ++ TFYY+ L G+SVG LPI +
Sbjct: 242 TPIGSSNSSTLLLGSL--ANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPS 299
Query: 115 AFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDF- 171
FK++ +G GGII+DSGT +T Y A+R AF+ LS +G + FD C+
Sbjct: 300 VFKLNSNNGTGGIIIDSGTTLTYFVDNAYQAVRQAFISQMN-LSVVNGSSSGFDLCFQMP 358
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGT 231
S +S++++PT HF +G L LP++NY I SNG C A +S +SI GN+QQQ
Sbjct: 359 SDQSNLQIPTFVMHF-DGGDLVLPSENYFIS-PSNGLICLAMGSSSQGMSIFGNIQQQNL 416
Query: 232 RVSFNLRNSLIGFTPNKC 249
V ++ NS++ F +C
Sbjct: 417 LVVYDTGNSVVSFLSAQC 434
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 172 bits (437), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 108/258 (41%), Positives = 153/258 (59%), Gaps = 14/258 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCL 59
G TET+T GS S+ NI GCG NN+G G AGL+G+G G LS PSQ++ + FSYC+
Sbjct: 182 GSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCM 241
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTA-----PLLRNHELDTFYYLGLTGISVGGDLLPISET 114
S ++STL S N+VTA L+ + ++ TFYY+ L G+SVG LPI +
Sbjct: 242 TPIGSSTSSTLLLGSL--ANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPS 299
Query: 115 AFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDF- 171
FK++ +G GGII+DSGT +T Y A+R AF+ LS +G + FD C+
Sbjct: 300 VFKLNSNNGTGGIIIDSGTTLTYFADNAYQAVRQAFISQMN-LSVVNGSSSGFDLCFQMP 358
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGT 231
S +S++++PT HF +G L LP++NY I SNG C A +S +SI GN+QQQ
Sbjct: 359 SDQSNLQIPTFVMHF-DGGDLVLPSENYFIS-PSNGLICLAMGSSSQGMSIFGNIQQQNL 416
Query: 232 RVSFNLRNSLIGFTPNKC 249
V ++ NS++ F +C
Sbjct: 417 LVVYDTGNSVVSFLFAQC 434
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 172 bits (437), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 111/264 (42%), Positives = 149/264 (56%), Gaps = 17/264 (6%)
Query: 1 GDFVTETVTL-----GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA--- 52
GDFV +T+T+ V N A GCGH+NEG F GA G+LGLG G LSFPSQ+
Sbjct: 90 GDFVYDTITMDGINGQKQQVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFPSQLKTVFN 149
Query: 53 STFSYCLVDRDSDSTST---LEFDSSLP--PNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
FSYCLVD + T T L D+++P P LL N ++ T+YY+ L GISVGG
Sbjct: 150 GKFSYCLVDWLAPPTQTSPLLFGDAAVPTFPGVKYISLLTNPKVPTYYYVKLNGISVGGK 209
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL-SPTDGVALFD 166
LL IS TAF ID G G I DSGT VT+L E + + A T +D + D
Sbjct: 210 LLNISSTAFDIDSVGRAGTIFDSGTTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSSGLD 269
Query: 167 TCY-DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
C F+ VP+++FHF EG + LP NY I ++S+ ++CF+ +S ++IIG+
Sbjct: 270 LCLGGFAEGQLPTVPSMTFHF-EGGDMELPPSNYFIFLESSQSYCFSMV-SSPDVTIIGS 327
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
+QQQ +V ++ IGF P C
Sbjct: 328 IQQQNFQVYYDTVGRKIGFVPKSC 351
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 171 bits (433), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 106/270 (39%), Positives = 147/270 (54%), Gaps = 21/270 (7%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
GD T+ + V N+ +GCGH+N GL AAGLLG+G G LSFP+Q+ + FS
Sbjct: 178 GDLATDRLVFPDDTHVHNVTLGCGHDNVGLLESAAGLLGVGRGQLSFPTQLAPAYGHVFS 237
Query: 57 YCLVDRDS---DSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-I 111
YCL DR S + +S L F + PP+ PL N + YY+ + G SVGG+ +
Sbjct: 238 YCLGDRLSRAQNGSSYLVFGRTPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGF 297
Query: 112 SETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA----LFD 166
S + ++ + G GGI+VDSGTA++R + Y A+RDAF A +A +FD
Sbjct: 298 SNASLALNPATGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFD 357
Query: 167 TCYDF----SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV---DSNGTFCFAFAPTSSS 219
CYD + ++V VP++ HF G + LP NYLIPV D FC
Sbjct: 358 ACYDLRGNGAPAAAVRVPSIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDG 417
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
L+++GNVQQQG + F++ IGFTPN C
Sbjct: 418 LNVLGNVQQQGFGLVFDVERGRIGFTPNGC 447
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 169 bits (429), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 102/256 (39%), Positives = 137/256 (53%), Gaps = 9/256 (3%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G +ET+T G SV +A GCG +NEG F +GL+GLG G LS SQ+ FSYCL
Sbjct: 183 GMLASETLTFGKVSVPEVAFGCGEDNEGSGFSQGSGLVGLGRGPLSLVSQLKEPKFSYCL 242
Query: 60 VDRDSDSTSTLEFDSSLPPNA-----VTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
D STL S A T PL++N +FYYL L GISVG LPI ++
Sbjct: 243 TSVDDTKASTLLMGSLASVKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKS 302
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
F + E G+GG+I+DSGT +T L+ ++ + F G + C+ S
Sbjct: 303 TFSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTSQINLPVDNSGSTGLEVCFTLPSG 362
Query: 175 SS-VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
S+ +EVP + FHF +G L LPA+NY+I S G C A +SS +SI GN+QQQ V
Sbjct: 363 STDIEVPKLVFHF-DGADLELPAENYMIADASMGVACLAMG-SSSGMSIFGNIQQQNMLV 420
Query: 234 SFNLRNSLIGFTPNKC 249
+L + F P +C
Sbjct: 421 LHDLEKETLSFLPTQC 436
>gi|3641868|emb|CAA09458.1| hypothetical protein [Cicer arietinum]
Length = 110
Score = 169 bits (427), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 80/110 (72%), Positives = 92/110 (83%)
Query: 140 ETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNY 199
+ Y ++RDAF R T+ L +GVA+FDTCYD SS SV VPTVSFHF +V LPAKNY
Sbjct: 1 QAYESVRDAFKRLTQNLRSAEGVAIFDTCYDLSSLRSVRVPTVSFHFGNDRVWDLPAKNY 60
Query: 200 LIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
LIPVDS+GTFCFAFAPTSSSLSIIGNVQQQGTRVSF++ NSL+GF+PNKC
Sbjct: 61 LIPVDSDGTFCFAFAPTSSSLSIIGNVQQQGTRVSFDIANSLVGFSPNKC 110
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 104/253 (41%), Positives = 135/253 (53%), Gaps = 6/253 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEG-LFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G TET T G ASV I GCG +N G + AGL+GLG G LS SQ+ FSYCL
Sbjct: 183 GVLATETFTFGDASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCL 242
Query: 60 VD-RDSDSTSTLEFDSSLP-PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
DS STL S +A+ PL++N +FYYL L GISVG LLPI ++ F
Sbjct: 243 TSIDDSKGISTLLVGSEATVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFS 302
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS- 176
I + G+GG+I+DSGT +T L+ + AL+ F+ + G + C+ S
Sbjct: 303 IQDDGSGGLIIDSGTTITYLKDNAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSP 362
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
VEVP + FHF EG L LP +NY+I + C +SS +SI GN QQQ V +
Sbjct: 363 VEVPQLVFHF-EGVDLKLPKENYIIEDSALRVICLTMG-SSSGMSIFGNFQQQNIVVLHD 420
Query: 237 LRNSLIGFTPNKC 249
L I F P +C
Sbjct: 421 LEKETISFAPAQC 433
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 104/260 (40%), Positives = 145/260 (55%), Gaps = 23/260 (8%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
GDF ET+TL S SV N A GCGH N+GLF GAAGL+GLG S+ FP+Q + +
Sbjct: 77 GDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNGAAGLMGLGKSSIGFPAQTSVAFG 136
Query: 54 -TFSYCLVDRDSDSTS-TLEFDSS--LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
FSYCL S S L F + L + PL+ + + Y++ +TGI+VG +LL
Sbjct: 137 KVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPLVDSSSGPSQYFVSMTGINVGDELL 196
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
PIS T ++VDSGT ++R + Y LRDAF + L VA FDTC+
Sbjct: 197 PISAT-----------VMVDSGTVISRFEQSAYERLRDAFTQILPGLQTAVSVAPFDTCF 245
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
S+ + +P ++ HF + L L + L PVD +G CFAFAP+SS S++GN QQQ
Sbjct: 246 RVSTVDDINIPLITLHFRDDAELRLSPVHILYPVD-DGVMCFAFAPSSSGRSVLGNFQQQ 304
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
R +++ S +G + +C
Sbjct: 305 NLRFVYDIPKSRLGISAFEC 324
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 166 bits (421), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 103/253 (40%), Positives = 135/253 (53%), Gaps = 6/253 (2%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEG-LFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G TET T G ASV I GCG +N G + AGL+GLG G LS SQ+ FSYCL
Sbjct: 183 GVLATETFTFGDASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCL 242
Query: 60 VD-RDSDSTSTLEFDSSLP-PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
DS STL S +A+ PL++N +FYYL L GISVG LLPI ++ F
Sbjct: 243 TSIDDSKGISTLLVGSEATVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFS 302
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS- 176
I + G+GG+I+DSGT +T L+ + AL+ F+ + G + C+ S
Sbjct: 303 IQDDGSGGLIIDSGTTITYLKDSAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSP 362
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
V+VP + FHF EG L LP +NY+I + C +SS +SI GN QQQ V +
Sbjct: 363 VDVPQLVFHF-EGVDLKLPKENYIIEDSALRVICLTMG-SSSGMSIFGNFQQQNIVVLHD 420
Query: 237 LRNSLIGFTPNKC 249
L I F P +C
Sbjct: 421 LEKETISFAPAQC 433
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 166 bits (421), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 101/255 (39%), Positives = 146/255 (57%), Gaps = 8/255 (3%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCL 59
G TET T ++SV NIA GCG +N+G G AGL+G+G G LS PSQ+ FSYC+
Sbjct: 183 GYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCM 242
Query: 60 VDRDSDSTSTLEFDSS---LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
S S STL S+ +P + + L+ + T+YY+ L GI+VGGD L I + F
Sbjct: 243 TSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTF 302
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR-S 175
++ + G GG+I+DSGT +T L + YNA+ AF + + + TC+ S S
Sbjct: 303 QLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGS 362
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SLSIIGNVQQQGTRVS 234
+V+VP +S F +G VL L +N LI + G C A +S +SI GN+QQQ T+V
Sbjct: 363 TVQVPEISMQF-DGGVLNLGEQNILI-SPAEGVICLAMGSSSQLGISIFGNIQQQETQVL 420
Query: 235 FNLRNSLIGFTPNKC 249
++L+N + F P +C
Sbjct: 421 YDLQNLAVSFVPTQC 435
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 166 bits (421), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 102/257 (39%), Positives = 146/257 (56%), Gaps = 12/257 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCL 59
G TET+T GS S+ NI GCG NN+G G AGL+G+G G LS PSQ++ + FSYC+
Sbjct: 182 GSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCM 241
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTA-----PLLRNHELDTFYYLGLTGISVGGDLLPISET 114
S + S L S N+VTA L+++ ++ TFYY+ L G+SVG LPI +
Sbjct: 242 TPIGSSTPSNLLLGSL--ANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPS 299
Query: 115 AFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF-S 172
AF ++ +G GGII+DSGT +T Y ++R F+ + FD C+ S
Sbjct: 300 AFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPS 359
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTR 232
S++++PT HF +G L LP++NY I SNG C A +S +SI GN+QQQ
Sbjct: 360 DPSNLQIPTFVMHF-DGGDLELPSENYFIS-PSNGLICLAMGSSSQGMSIFGNIQQQNML 417
Query: 233 VSFNLRNSLIGFTPNKC 249
V ++ NS++ F +C
Sbjct: 418 VVYDTGNSVVSFASAQC 434
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 164 bits (416), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 105/256 (41%), Positives = 148/256 (57%), Gaps = 10/256 (3%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCL 59
G TET T ++SV NIA GCG +N+G G AGL+G+G G LS PSQ+ FSYC+
Sbjct: 182 GYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCM 241
Query: 60 VDRDSDSTSTLEFDSS---LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
S S STL S+ +P + + L+ + T+YY+ L GI+VGGD L I + F
Sbjct: 242 TSSGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTF 301
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD-GVALFDTCYDFSSR- 174
++ + G GG+I+DSGT +T L + YNA+ AF LSP D + TC+ S
Sbjct: 302 QLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQIN-LSPVDESSSGLSTCFQLPSDG 360
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS-LSIIGNVQQQGTRV 233
S+V+VP +S F +G VL L +N LI + G C A +S +SI GN+QQQ T+V
Sbjct: 361 STVQVPEISMQF-DGGVLNLGEENVLIS-PAEGVICLAMGSSSQQGISIFGNIQQQETQV 418
Query: 234 SFNLRNSLIGFTPNKC 249
++L+N + F P +C
Sbjct: 419 LYDLQNLAVSFVPTQC 434
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 164 bits (415), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 106/267 (39%), Positives = 145/267 (54%), Gaps = 18/267 (6%)
Query: 1 GDFVTETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
GD T+ + + + V+N+ +GCG +NEGLF AAGLLG+G G +S +Q+ S F
Sbjct: 178 GDLATDKLAFANDTYVNNVTLGCGRDNEGLFDSAAGLLGVGRGKISISTQVAPAYGSVFE 237
Query: 57 YCLVDRDSDST--STLEFDSS-LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-IS 112
YCL DR S ST S L F + PP+ LL N + YY+ + G SVGG+ + S
Sbjct: 238 YCLGDRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFS 297
Query: 113 ETAFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV---ALFDTC 168
+ +D +G GG++VDSGTA++R + Y ALRDAF RA ++FD C
Sbjct: 298 NASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDAC 357
Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD------SNGTFCFAFAPTSSSLSI 222
YD R + P + HF G + LP +NY +PVD ++ C F LS+
Sbjct: 358 YDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSV 417
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IGNVQQQG RV F++ IGF P C
Sbjct: 418 IGNVQQQGFRVVFDVEKERIGFAPKGC 444
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 163 bits (413), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 113/268 (42%), Positives = 148/268 (55%), Gaps = 19/268 (7%)
Query: 1 GDFVTETVTL------GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA-- 52
GD E+ T+ S+ VD + GCGH N GLF GAAGLLGLG G LSF SQ+ A
Sbjct: 242 GDLALESFTVNLTAPGASSRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVY 301
Query: 53 --STFSYCLVDRDSDSTSTLEF--DSSL-----PPNAVTAPLLRNHELDTFYYLGLTGIS 103
TFSYCLVD SD S + F D +L P TA + DTFYY+ LTG+
Sbjct: 302 GGHTFSYCLVDHGSDVASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVL 361
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGV 162
VGG+LL IS + E G+GG I+DSGT ++ Y +R AF+ R + + P
Sbjct: 362 VGGELLNISSDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDF 421
Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLS 221
+ CY+ S EVP +S F +G V PA+NY I +D +G C A T + +S
Sbjct: 422 PVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMS 481
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IIGN QQQ V+++L N+ +GF P +C
Sbjct: 482 IIGNFQQQNFHVAYDLHNNRLGFAPRRC 509
>gi|20975624|emb|CAD31717.1| putative nucleoid DNA-binding protein [Cicer arietinum]
Length = 144
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 78/144 (54%), Positives = 103/144 (71%)
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
G +PISE F+++E G GG+++D+GTAVTRL T Y+A RDAF+ T L + V++F
Sbjct: 1 GVRVPISEDVFRLNELGEGGVVMDTGTAVTRLPTAAYDAFRDAFIGQTTNLPRSSDVSIF 60
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
DTCYD SV VPT+SF+F G +L LPA+N+LIPV+ GTFCFAFAP+ S LSIIGN
Sbjct: 61 DTCYDLYGFVSVRVPTISFYFLGGPILTLPARNFLIPVNDVGTFCFAFAPSPSGLSIIGN 120
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
+QQ+G +S + N +GF PN C
Sbjct: 121 IQQEGIEISVDGVNGFVGFGPNIC 144
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 110/256 (42%), Positives = 139/256 (54%), Gaps = 34/256 (13%)
Query: 1 GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
GD TET+ A V +A+GCGH+NEGLFV AAGLLGLG G LS P+Q FS
Sbjct: 234 GDLATETLWFARGARVPRVAVGCGHDNEGLFVAAAGLLGLGRGRLSLPTQTARRYGRRFS 293
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
YC D D + + + H + G V G + E +
Sbjct: 294 YCFQGSDLDHRTIIR-------------TVHQH---------VGGARVRG----VGERSL 327
Query: 117 KIDES-GNGGIIVDSGTAVTRLQTETYNALRDAF--VRGTRALSPTDGVALFDTCYDFSS 173
++D S G GG+I+DSGT+VTRL Y A+R+AF G L+P G +LFDTCYD
Sbjct: 328 RLDPSTGRGGVILDSGTSVTRLARPVYVAVREAFRAAAGGLRLAP-GGFSLFDTCYDLRG 386
Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
R V+VPTVS H G + LP +NYLIPVD+ GTFC A A T +SI+GN+QQQG RV
Sbjct: 387 RRVVKVPTVSVHLAGGAEVALPPENYLIPVDTRGTFCLALAGTDGGVSIVGNIQQQGFRV 446
Query: 234 SFNLRNSLIGFTPNKC 249
F+ + P C
Sbjct: 447 VFDGDRQRVALVPKSC 462
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 160 bits (405), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 104/267 (38%), Positives = 144/267 (53%), Gaps = 18/267 (6%)
Query: 1 GDFVTETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
G+ T+ + + + V+N+ +GCG +NEGLF AAGLLG+ G +S +Q+ S F
Sbjct: 178 GELATDKLAFANDTYVNNVTLGCGRDNEGLFDSAAGLLGVARGKISISTQVAPAYGSVFE 237
Query: 57 YCLVDRDSDST--STLEFDSS-LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-IS 112
YCL DR S ST S L F + PP+ LL N + YY+ + G SVGG+ + S
Sbjct: 238 YCLGDRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFS 297
Query: 113 ETAFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV---ALFDTC 168
+ +D +G GG++VDSGTA++R + Y ALRDAF RA ++FD C
Sbjct: 298 NASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDAC 357
Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD------SNGTFCFAFAPTSSSLSI 222
YD R + P + HF G + LP +NY +PVD ++ C F LS+
Sbjct: 358 YDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSV 417
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IGNVQQQG RV F++ IGF P C
Sbjct: 418 IGNVQQQGFRVVFDVEKERIGFAPKGC 444
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 159 bits (403), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 101/270 (37%), Positives = 144/270 (53%), Gaps = 21/270 (7%)
Query: 1 GDFVTETVTLGSAS---------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
GDF TET T+ S V+N+ GCGH N GLF GA+GLLGLG G LSF SQ+
Sbjct: 183 GDFATETFTVNLTSPTGKSEFKRVENVMFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQ 242
Query: 52 A---STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLL--------RNHELDTFYYLGLT 100
+ +FSYCLVDR+SD+ + + + + P L + + +DTFYY+ +
Sbjct: 243 SLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIK 302
Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD 160
I VGG++L I E+ + + G GG IVDSGT ++ Y ++DAFV+ +
Sbjct: 303 SIMVGGEVLNIPESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQ 362
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSS 219
+ D CY+ S +++P F +G V P +NY I +D C A T S+
Sbjct: 363 DFPILDPCYNVSGVEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRSA 422
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
LSIIGN QQQ V ++ + S +G+ P C
Sbjct: 423 LSIIGNYQQQNFHVLYDTKKSRLGYAPMNC 452
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 159 bits (403), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 109/265 (41%), Positives = 145/265 (54%), Gaps = 16/265 (6%)
Query: 1 GDFVTETVTL------GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA-- 52
GD E+ T+ S VD + GCGH N GLF GAAGLLGLG G LSF SQ+ A
Sbjct: 243 GDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVY 302
Query: 53 -STFSYCLVDRDSDSTSTLEFDS-----SLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
TFSYCLV+ SD+ S + F + P TA + DTFYY+ L G+ VGG
Sbjct: 303 GHTFSYCLVEHGSDAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGG 362
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGVALF 165
DLL IS + + + G+GG I+DSGT ++ Y +R AFV L P +
Sbjct: 363 DLLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPVL 422
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSIIG 224
+ CY+ S EVP +S F +G V PA+NY + +D +G C A T + +SIIG
Sbjct: 423 NPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTGMSIIG 482
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
N QQQ V ++L+N+ +GF P +C
Sbjct: 483 NFQQQNFHVVYDLQNNRLGFAPRRC 507
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 102/257 (39%), Positives = 144/257 (56%), Gaps = 10/257 (3%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
GDF ETVTL +++ I GCGHN EG F GA GL+GLG G LS PSQ+N+S FSY
Sbjct: 96 GDFAFETVTLNGSTLARIGFGCGHNQEGTFAGADGLIGLGQGPLSLPSQLNSSFTHIFSY 155
Query: 58 CLVDRDSDST-STLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
CLVD+ + T S + F +++ A PLL+N + ++YY+G+ ISVG +P +A
Sbjct: 156 CLVDQSTTGTFSPITFGNAAENSRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSA 215
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS--S 173
F+ID +G GG+I+DSGT +T + + + R + CYD S S
Sbjct: 216 FRIDANGVGGVILDSGTTITYWRLAAFIPILAELRRQISYPEADPTPYGLNLCYDISSVS 275
Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG-TFCFAFAPTSSSLSIIGNVQQQGTR 232
SS+ +P+++ H +P N + VD+ G T C A + TS SIIGNVQQQ
Sbjct: 276 ASSLTLPSMTVHLTNVD-FEIPVSNLWVLVDNFGETVCTAMS-TSDQFSIIGNVQQQNNL 333
Query: 233 VSFNLRNSLIGFTPNKC 249
+ ++ NS +GF C
Sbjct: 334 IVTDVANSRVGFLATDC 350
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 123/280 (43%), Positives = 142/280 (50%), Gaps = 45/280 (16%)
Query: 1 GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
GDF TET+T S A V +A+GCGH+NEGLFV AAGLLGLG GSLSFPSQI+ +FS
Sbjct: 236 GDFATETLTFASGARVPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFS 295
Query: 57 YCLVD---------------------RDSDSTSTLEFDSSLPPNAVTAPLLR---NHELD 92
YCLVD R + L D P + LLR H+
Sbjct: 296 YCLVDRTSSSASATSRSSTVTFGSGARGALGRRVLHPDGEEPQDGDV--LLRAAHGHQRR 353
Query: 93 TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGT---AVTRLQTETYNALRDAF 149
G + D +G GG+IVDSG A R A R
Sbjct: 354 RRARPGRGRVRPPPD-----------PSTGRGGVIVDSGRPSPAWARAGRTPPCATRSRA 402
Query: 150 VRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF 209
LSP G +LFDTCYD S V+VPTVS HF G LP +NYLIPVDS GTF
Sbjct: 403 AAAGLRLSP-GGFSLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTF 461
Query: 210 CFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
CFAFA T +SIIGN+QQQG RV F+ +GF P C
Sbjct: 462 CFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRLGFVPKGC 501
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 96/267 (35%), Positives = 139/267 (52%), Gaps = 19/267 (7%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
G +ET T G+A+ V ++A GCG+ N G ++G++GLG G LS SQ+ S F
Sbjct: 180 GVLASETFTFGAANSSKVMVSDVAFGCGNINSGQLANSSGMVGLGRGPLSLVSQLGPSRF 239
Query: 56 SYCLVDRDSDSTSTLEFDSSLPPNAVTA----------PLLRNHELDTFYYLGLTGISVG 105
SYCL S S L F N A PL+ N L + Y++ L GIS+G
Sbjct: 240 SYCLTSFLSPEPSRLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLG 299
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL- 164
LPI F I++ G GG+ +DSGT++T LQ + Y+A+R V R L PT+ +
Sbjct: 300 QKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIG 359
Query: 165 FDTCYDFSSRSSVE--VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
+TC+ + SV VP + HF G + +P +NY++ + G C A S +I
Sbjct: 360 LETCFPWPPPPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMI-RSGDATI 418
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IGN QQQ + +++ NSL+ F P C
Sbjct: 419 IGNYQQQNMHILYDIANSLLSFVPAPC 445
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 99/261 (37%), Positives = 140/261 (53%), Gaps = 15/261 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G ET TL + ++A GCG NEG F AGL+GLG G LS SQ+ + FSYCL
Sbjct: 189 GVLAAETFTLAKTKLPDVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLNKFSYCL 248
Query: 60 VDRDSDSTSTLEFDS--------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
D S S L S + + T PL+RN +FYY+ L G++VG + +
Sbjct: 249 TSLDDTSKSPLLLGSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITL 308
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYD 170
+AF + + G GG+IVDSGT++T L+ + Y AL+ AF + L DG + DTC++
Sbjct: 309 PSSAFAVQDDGTGGVIVDSGTSITYLELQGYRALKKAFAAQMK-LPAADGSGIGLDTCFE 367
Query: 171 --FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
S VEVP + FH +G L LPA+NY++ +G C S LSIIGN QQ
Sbjct: 368 APASGVDQVEVPKLVFHL-DGADLDLPAENYMVLDSGSGALCLTVM-GSRGLSIIGNFQQ 425
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q + +++ + + F P +C
Sbjct: 426 QNIQFVYDVGENTLSFAPVQC 446
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 96/267 (35%), Positives = 139/267 (52%), Gaps = 19/267 (7%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
G +ET T G+A+ V ++A GCG+ N G ++G++GLG G LS SQ+ S F
Sbjct: 180 GVLASETFTFGAANSSKVMVSDVAFGCGNINSGQLANSSGMVGLGRGPLSLVSQLGPSRF 239
Query: 56 SYCLVDRDSDSTSTLEFDSSLPPNAVTA----------PLLRNHELDTFYYLGLTGISVG 105
SYCL S S L F N A PL+ N L + Y++ L GIS+G
Sbjct: 240 SYCLTSFLSPEPSRLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLG 299
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL- 164
LPI F I++ G GG+ +DSGT++T LQ + Y+A+R V R L PT+ +
Sbjct: 300 QKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIG 359
Query: 165 FDTCYDFSSRSSVE--VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
+TC+ + SV VP + HF G + +P +NY++ + G C A S +I
Sbjct: 360 LETCFPWPPPPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMI-RSGDATI 418
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IGN QQQ + +++ NSL+ F P C
Sbjct: 419 IGNYQQQNMHILYDIANSLLSFVPAPC 445
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 157 bits (397), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 97/261 (37%), Positives = 135/261 (51%), Gaps = 14/261 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G ET TL + +A GCG NEG F AGL+GLG G LS SQ+ FSYCL
Sbjct: 207 GVLAAETFTLAKTKLPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLGKFSYCL 266
Query: 60 VDRDSDSTSTLEF--------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
D S S L D++ T PL++N +FYY+ L ++VG +P+
Sbjct: 267 TSLDDTSKSPLLLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPL 326
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYD 170
+AF + + G GG+IVDSGT++T L+ + Y L+ AF + L DG A+ D C+
Sbjct: 327 PGSAFAVQDDGTGGVIVDSGTSITYLELQGYRPLKKAFAAQMK-LPVADGSAVGLDLCFK 385
Query: 171 --FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
S VEVP + HF G L LPA+NY++ ++G C S LSIIGN QQ
Sbjct: 386 APASGVDDVEVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVM-GSRGLSIIGNFQQ 444
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q + +++ + F P +C
Sbjct: 445 QNIQFVYDVDKDTLSFAPVQC 465
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 157 bits (397), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 102/254 (40%), Positives = 144/254 (56%), Gaps = 16/254 (6%)
Query: 6 ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR 62
ET ++ S S+ NI GCGH+N+G F GL+G G GSLS SQ+ S FSYCLV R
Sbjct: 133 ETFSISSQSLPNITFGCGHDNQG-FDKVGGLVGFGRGSLSLVSQLGPSMGNKFSYCLVSR 191
Query: 63 -DSDSTSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
DS TS L ++ A T PL+++ + YYL L GISVGG L I F I
Sbjct: 192 TDSSKTSPLFIGNTASLEATTVGSTPLVQSSSTN-HYYLSLEGISVGGQSLAIPTGTFDI 250
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE 178
G+GG+I+DSGT +T LQ Y+A+++A V L DG D C++ S+
Sbjct: 251 QSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSSIN-LPQADG--QLDLCFNQQGSSNPG 307
Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL---SIIGNVQQQGTRVSF 235
P+++FHF +G +P +NYL P ++ C A PT+S+L +I GNVQQQ ++ +
Sbjct: 308 FPSMTFHF-KGADYDVPKENYLFPDSTSDIVCLAMMPTNSNLGNMAIFGNVQQQNYQILY 366
Query: 236 NLRNSLIGFTPNKC 249
+ N+++ F P C
Sbjct: 367 DNENNVLSFAPTAC 380
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 157 bits (397), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 102/264 (38%), Positives = 141/264 (53%), Gaps = 17/264 (6%)
Query: 1 GDFVTETVTLGS--ASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSY 57
G +ET TLG + +A GCG NEG F AGL+GLG G LS SQ+ FSY
Sbjct: 188 GVLASETFTLGKEKKKLPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLDKFSY 247
Query: 58 CLVD-RDSDSTSTLEFDSSLPPNAV--------TAPLLRNHELDTFYYLGLTGISVGGDL 108
CL D D S L S + T PL++N +FYY+ LTG++VG
Sbjct: 248 CLTSLDDGDGKSPLLLGGSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTR 307
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDT 167
+ + +AF I + G GG+IVDSGT++T L+ + Y AL+ AFV AL DG + D
Sbjct: 308 ITLPASAFAIQDDGTGGVIVDSGTSITYLELQGYRALKKAFV-AQMALPTVDGSEIGLDL 366
Query: 168 CYDFSSR--SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
C+ ++ V+VP + HF G L LPA+NY++ ++G C AP S LSIIGN
Sbjct: 367 CFQGPAKGVDEVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVAP-SRGLSIIGN 425
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
QQQ + +++ + F P +C
Sbjct: 426 FQQQNFQFVYDVAGDTLSFAPVQC 449
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 96/263 (36%), Positives = 142/263 (53%), Gaps = 16/263 (6%)
Query: 1 GDFVTETVTLGSASVDNIAI-----GCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAST 54
G ET T G ++ D I+I GCG++N G F AGL+GLG G LS SQ+
Sbjct: 198 GVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQK 257
Query: 55 FSYCLVDRDSDSTSTLEFDS--SLPPNA-----VTAPLLRNHELDTFYYLGLTGISVGGD 107
F+YCL D S+L S ++ P T PL++N +FYYL L GISVGG
Sbjct: 258 FAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGT 317
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
L I ++ F++ + G+GG+I+DSGT +T ++ + +L++ F+ G D
Sbjct: 318 QLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTGGLDL 377
Query: 168 CYDF-SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
C++ + + VEVP ++FHF +G L LP +NY+I G C A +S +SI GN+
Sbjct: 378 CFNLPAGTNQVEVPKLTFHF-KGADLELPGENYMIGDSKAGLLCLAIG-SSRGMSIFGNL 435
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQQ V +L+ + F P +C
Sbjct: 436 QQQNFMVVHDLQEETLSFLPTQC 458
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 156 bits (395), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 95/258 (36%), Positives = 141/258 (54%), Gaps = 16/258 (6%)
Query: 6 ETVTLGSASVDNIAI-----GCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
ET T G ++ D I+I GCG++N G F AGL+GLG G LS SQ+ F+YCL
Sbjct: 458 ETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQKFAYCL 517
Query: 60 VDRDSDSTSTLEFDS--SLPPNA-----VTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
D S+L S ++ P T PL++N +FYYL L GISVGG L I
Sbjct: 518 TAIDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIP 577
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF- 171
++ F++ + G+GG+I+DSGT +T ++ + +L++ F+ G D C++
Sbjct: 578 KSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLP 637
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGT 231
+ + VEVP ++FHF +G L LP +NY+I G C A +S +SI GN+QQQ
Sbjct: 638 AGTNQVEVPKLTFHF-KGADLELPGENYMIGDSKAGLLCLAIG-SSRGMSIFGNLQQQNF 695
Query: 232 RVSFNLRNSLIGFTPNKC 249
V +L+ + F P +C
Sbjct: 696 MVVHDLQEETLSFLPTQC 713
>gi|110739922|dbj|BAF01866.1| chloroplast nucleoid DNA binding protein like [Arabidopsis
thaliana]
Length = 142
Score = 156 bits (395), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 76/139 (54%), Positives = 98/139 (70%), Gaps = 1/139 (0%)
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
++ + FK+D+ GNGG+I+DSGT+VTRL Y A+RDAF G + L +LFDTC+D
Sbjct: 4 VTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFD 63
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
S+ + V+VPTV HF G + LPA NYLIPVD+NG FCFAFA T LSIIGN+QQQG
Sbjct: 64 LSNMNEVKVPTVVLHF-RGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQG 122
Query: 231 TRVSFNLRNSLIGFTPNKC 249
RV ++L +S +GF P C
Sbjct: 123 FRVVYDLASSRVGFAPGGC 141
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 156 bits (394), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 97/260 (37%), Positives = 139/260 (53%), Gaps = 19/260 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G+ E ++ G SV N GCG NN+GLF G +G++GLG +LS SQ N + FSY
Sbjct: 226 GELGVEHLSFGGISVSNFVFGCGRNNKGLFGGVSGIMGLGRSNLSMISQTNTTFGGVFSY 285
Query: 58 CLVDRDSDSTSTL------EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
CL DS ++ +L +L P A T+ ++ N +L FY L LTGI VGG + I
Sbjct: 286 CLPTTDSGASGSLVIGNESSLFKNLTPIAYTS-MVSNPQLSNFYVLNLTGIDVGG--VAI 342
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
+T+F GNGGI++DSGT +TRL YNAL+ F++ +++ DTC++
Sbjct: 343 QDTSF-----GNGGILIDSGTVITRLAPSLYNALKAEFLKQFSGYPIAPALSILDTCFNL 397
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQ 229
+ V +PT+S HF L + A L C A A S + ++IIGN QQ+
Sbjct: 398 TGIEEVSIPTLSMHFENNVDLNVDAVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQR 457
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
RV ++ + S IGF C
Sbjct: 458 NQRVIYDAKQSKIGFAREDC 477
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 156 bits (394), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 107/270 (39%), Positives = 146/270 (54%), Gaps = 21/270 (7%)
Query: 1 GDFVTETVTLGSAS---------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
GDF ET T+ S V+N+ GCGH N GLF GAAGLLGLG G LSF SQ+
Sbjct: 285 GDFALETFTVNLTSPAGKSEFKRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQ 344
Query: 52 A---STFSYCLVDRDSDS--TSTLEF--DSSL--PPNAVTAPLLRNHE--LDTFYYLGLT 100
+ +FSYCLVDR+SD+ +S L F D L P L+ E +DTFYY+ +
Sbjct: 345 SLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIK 404
Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD 160
I VGG++L I E + + G GG IVDSGT ++ +Y ++DAFV+ +
Sbjct: 405 SIMVGGEVLKIPEETWHLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIK 464
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSS 219
+ D CY+ S +E+P F +G V P +NY I ++ C A T S+
Sbjct: 465 DFPILDPCYNVSGVEKMELPEFRILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSA 524
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
LSIIGN QQQ + ++ + S +G+ P KC
Sbjct: 525 LSIIGNYQQQNFHILYDTKKSRLGYAPMKC 554
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 155 bits (392), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 97/261 (37%), Positives = 137/261 (52%), Gaps = 14/261 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G TET TL + + + GCG NEG F AGL+GLG G LS SQ+ FSYCL
Sbjct: 193 GVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCL 252
Query: 60 VDRDSDSTSTLEFDS--------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
D + S L S + + T PL++N +FYY+ L I+VG + +
Sbjct: 253 TSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISL 312
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYD 170
+AF + + G GG+IVDSGT++T L+ + Y AL+ AF AL DG + D C+
Sbjct: 313 PSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFA-AQMALPAADGSGVGLDLCFR 371
Query: 171 FSSR--SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
++ VEVP + FHF G L LPA+NY++ +G C S LSIIGN QQ
Sbjct: 372 APAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVM-GSRGLSIIGNFQQ 430
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q + +++ + + F P +C
Sbjct: 431 QNFQFVYDVGHDTLSFAPVQC 451
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 97/261 (37%), Positives = 137/261 (52%), Gaps = 14/261 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G TET TL + + + GCG NEG F AGL+GLG G LS SQ+ FSYCL
Sbjct: 183 GVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCL 242
Query: 60 VDRDSDSTSTLEFDS--------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
D + S L S + + T PL++N +FYY+ L I+VG + +
Sbjct: 243 TSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISL 302
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYD 170
+AF + + G GG+IVDSGT++T L+ + Y AL+ AF AL DG + D C+
Sbjct: 303 PSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFA-AQMALPAADGSGVGLDLCFR 361
Query: 171 FSSR--SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
++ VEVP + FHF G L LPA+NY++ +G C S LSIIGN QQ
Sbjct: 362 APAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVM-GSRGLSIIGNFQQ 420
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q + +++ + + F P +C
Sbjct: 421 QNFQFVYDVGHDTLSFAPVQC 441
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 97/261 (37%), Positives = 137/261 (52%), Gaps = 14/261 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G TET TL + + + GCG NEG F AGL+GLG G LS SQ+ FSYCL
Sbjct: 162 GVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCL 221
Query: 60 VDRDSDSTSTLEFDS--------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
D + S L S + + T PL++N +FYY+ L I+VG + +
Sbjct: 222 TSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISL 281
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYD 170
+AF + + G GG+IVDSGT++T L+ + Y AL+ AF AL DG + D C+
Sbjct: 282 PSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFA-AQMALPAADGSGVGLDLCFR 340
Query: 171 FSSR--SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
++ VEVP + FHF G L LPA+NY++ +G C S LSIIGN QQ
Sbjct: 341 APAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVM-GSRGLSIIGNFQQ 399
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q + +++ + + F P +C
Sbjct: 400 QNFQFVYDVGHDTLSFAPVQC 420
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 101/269 (37%), Positives = 142/269 (52%), Gaps = 25/269 (9%)
Query: 1 GDFVTETVTLGSAS----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G ET T G+ S V ++ GCG+ N G +G++G G G+LS SQ+ + FS
Sbjct: 172 GVLANETFTFGTNSTRVAVPRVSFGCGNMNAGTLFNGSGMVGFGRGALSLVSQLGSPRFS 231
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAV---------TAPLLRNHELDTFYYLGLTGISVGGD 107
YCL S +TS L F + N+ + P + N L T Y+L +TGISV GD
Sbjct: 232 YCLTSFMSPATSRLYFGAYATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGD 291
Query: 108 LLPISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---RA-LSPTDGV 162
LLPI + F I+E+ G GG+I+DSGT VT L Y ++ AFV RA +P+D
Sbjct: 292 LLPIDPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSD-- 349
Query: 163 ALFDTCYDF--SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
FDTC+ + R V +P + HF +G + LP +NY++ G C A P+
Sbjct: 350 -TFDTCFKWPPPPRRMVTLPEMVLHF-DGADMELPLENYMVMDGGTGNLCLAMLPSDDG- 406
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
SIIG+ Q Q + ++L NSL+ F P C
Sbjct: 407 SIIGSFQHQNFHMLYDLENSLLSFVPAPC 435
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 154 bits (390), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 97/261 (37%), Positives = 137/261 (52%), Gaps = 14/261 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G TET TL + + + GCG NEG F AGL+GLG G LS SQ+ FSYCL
Sbjct: 255 GVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCL 314
Query: 60 VDRDSDSTSTLEFDS--------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
D + S L S + + T PL++N +FYY+ L I+VG + +
Sbjct: 315 TSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISL 374
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYD 170
+AF + + G GG+IVDSGT++T L+ + Y AL+ AF AL DG + D C+
Sbjct: 375 PSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFA-AQMALPAADGSGVGLDLCFR 433
Query: 171 FSSR--SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
++ VEVP + FHF G L LPA+NY++ +G C S LSIIGN QQ
Sbjct: 434 APAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVM-GSRGLSIIGNFQQ 492
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q + +++ + + F P +C
Sbjct: 493 QNFQFVYDVGHDTLSFAPVQC 513
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 154 bits (390), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 98/255 (38%), Positives = 131/255 (51%), Gaps = 12/255 (4%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
G F TET+TL S++V N GCG N GLF GAAGLLGLG LS PSQ FS
Sbjct: 224 GFFATETLTLSSSNVFKNFLFGCGQQNSGLFRGAAGLLGLGRTKLSLPSQTAQKYKKLFS 283
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
YCL S S L F + PL + + FY L +T +SVGG+ L I + F
Sbjct: 284 YCL-PASSSSKGYLSFGGQVSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIF 342
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
G ++DSGT +TRL + Y+AL AF + TDG ++FDTCYDFS +
Sbjct: 343 -----STSGTVIDSGTVITRLPSTAYSALSSAFQKLMTDYPSTDGYSIFDTCYDFSKNET 397
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SIIGNVQQQGTRVS 234
+++P V F G + + L PV+ C AFA + +I GN QQ+ +V
Sbjct: 398 IKIPKVGVSFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNGDDVKAAIFGNTQQKTYQVV 457
Query: 235 FNLRNSLIGFTPNKC 249
++ +GF P+ C
Sbjct: 458 YDDAKGRVGFAPSGC 472
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 101/269 (37%), Positives = 142/269 (52%), Gaps = 25/269 (9%)
Query: 1 GDFVTETVTLGSAS----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G ET T G+ S V ++ GCG+ N G +G++G G G+LS SQ+ + FS
Sbjct: 175 GVLANETFTFGTNSTRVAVPRVSFGCGNMNAGTLFNGSGMVGFGRGALSLVSQLGSPRFS 234
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAV---------TAPLLRNHELDTFYYLGLTGISVGGD 107
YCL S +TS L F + N+ + P + N L T Y+L +TGISV GD
Sbjct: 235 YCLTSFMSPATSRLYFGAYATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGD 294
Query: 108 LLPISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---RA-LSPTDGV 162
LLPI + F I+E+ G GG+I+DSGT VT L Y ++ AFV RA +P+D
Sbjct: 295 LLPIDPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSD-- 352
Query: 163 ALFDTCYDF--SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
FDTC+ + R V +P + HF +G + LP +NY++ G C A P+
Sbjct: 353 -TFDTCFKWPPPPRRMVTLPEMVLHF-DGADMELPLENYMVMDGGTGNLCLAMLPSDDG- 409
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
SIIG+ Q Q + ++L NSL+ F P C
Sbjct: 410 SIIGSFQHQNFHMLYDLENSLLSFVPAPC 438
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 102/257 (39%), Positives = 136/257 (52%), Gaps = 16/257 (6%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G F +T+TL S +V GCG N+GLF AAGLLGLG G S P Q F+
Sbjct: 271 GFFAMDTLTLSSYDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFA 330
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
+CL R S T L+F + PP T P+L + TFYY+G+TGI VGG LLPI+ + F
Sbjct: 331 HCLPAR-STGTGYLDFGAGSPPATTTTPMLTGNG-PTFYYVGMTGIRVGGRLLPIAPSVF 388
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRD--AFVRGTRALSPTDGVALFDTCYDFSSR 174
G IVDSGT +TRL Y++LR A R V+L DTCYDF+
Sbjct: 389 AA-----AGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGM 443
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA--PTSSSLSIIGNVQQQGTR 232
S V +PTVS F G L + A + V ++ C AFA + I+GN Q +
Sbjct: 444 SQVAIPTVSLLFQGGAALDVDASGIMYTVSAS-QVCLAFAGNEDGGDVGIVGNTQLKTFG 502
Query: 233 VSFNLRNSLIGFTPNKC 249
V++++ ++GF+P C
Sbjct: 503 VAYDIGKKVVGFSPGAC 519
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 102/257 (39%), Positives = 136/257 (52%), Gaps = 16/257 (6%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G F +T+TL S +V GCG N+GLF AAGLLGLG G S P Q F+
Sbjct: 267 GFFAMDTLTLSSYDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFA 326
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
+CL R S T L+F + PP T P+L + TFYY+G+TGI VGG LLPI+ + F
Sbjct: 327 HCLPAR-STGTGYLDFGAGSPPATTTTPMLTGNG-PTFYYVGMTGIRVGGRLLPIAPSVF 384
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRD--AFVRGTRALSPTDGVALFDTCYDFSSR 174
G IVDSGT +TRL Y++LR A R V+L DTCYDF+
Sbjct: 385 AA-----AGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGM 439
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA--PTSSSLSIIGNVQQQGTR 232
S V +PTVS F G L + A + V ++ C AFA + I+GN Q +
Sbjct: 440 SQVAIPTVSLLFQGGAALDVDASGIMYTVSAS-QVCLAFAGNEDGGDVGIVGNTQLKTFG 498
Query: 233 VSFNLRNSLIGFTPNKC 249
V++++ ++GF+P C
Sbjct: 499 VAYDIGKKVVGFSPGAC 515
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 99/264 (37%), Positives = 135/264 (51%), Gaps = 17/264 (6%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G TET TL V +A GCG NEG F AGL+GLG G LS SQ+ FSYCL
Sbjct: 211 GVLATETFTLARQKVPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDRFSYCL 270
Query: 60 VDRDSDS--------TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
D + ++ S+ A T PL++N +FYY+ LTG++VG L +
Sbjct: 271 TSLDDAAGRSPLLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLAL 330
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYD 170
+AF I + G GG+IVDSGT++T L+ Y ALR AFV +L D + D C+
Sbjct: 331 PSSAFAIQDDGTGGVIVDSGTSITYLELRAYRALRKAFV-AHMSLPTVDASEIGLDLCFQ 389
Query: 171 -----FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
V+VP + HF G L LPA+NY++ ++G C S LSIIGN
Sbjct: 390 GPAGAVDQDVQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVM-ASRGLSIIGN 448
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
QQQ + +++ + F P +C
Sbjct: 449 FQQQNFQFVYDVAGDTLSFAPAEC 472
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 102/257 (39%), Positives = 136/257 (52%), Gaps = 16/257 (6%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G F +T+TL S +V GCG N+GLF AAGLLGLG G S P Q F+
Sbjct: 268 GFFAMDTLTLSSYDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFA 327
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
+CL R S T L+F + PP T P+L + TFYY+G+TGI VGG LLPI+ + F
Sbjct: 328 HCLPPR-STGTGYLDFGAGSPPATTTTPMLTGNG-PTFYYVGMTGIRVGGRLLPIAPSVF 385
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRD--AFVRGTRALSPTDGVALFDTCYDFSSR 174
G IVDSGT +TRL Y++LR A R V+L DTCYDF+
Sbjct: 386 AA-----AGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGM 440
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA--PTSSSLSIIGNVQQQGTR 232
S V +PTVS F G L + A + V ++ C AFA + I+GN Q +
Sbjct: 441 SQVAIPTVSLLFQGGAALDVDASGIMYTVSAS-QVCLAFAGNEDGGDVGIVGNTQLKTFG 499
Query: 233 VSFNLRNSLIGFTPNKC 249
V++++ ++GF+P C
Sbjct: 500 VAYDIGKKVVGFSPGAC 516
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 154 bits (388), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 108/266 (40%), Positives = 144/266 (54%), Gaps = 17/266 (6%)
Query: 1 GDFVTETVTL------GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA-- 52
GD E T+ S VD++ GCGH+N GLF GAAGLLGLG G+LSF SQ+ A
Sbjct: 246 GDLALEAFTVNLTAPGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVY 305
Query: 53 -STFSYCLVDRDSDSTSTLEF--DSSL----PPNAVTAPLLRNHELDTFYYLGLTGISVG 105
FSYCLVD S S + F D +L N DTFYY+ L G+ VG
Sbjct: 306 GHAFSYCLVDHGSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVG 365
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVAL 164
G+ L IS + + + + G+GG I+DSGT ++ Y +R AFV R +A +
Sbjct: 366 GEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPV 425
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSII 223
CY+ S VEVP S F +G V PA+NY + +D +G C A T S++SII
Sbjct: 426 LSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSII 485
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN QQQ V ++L+N+ +GF P +C
Sbjct: 486 GNFQQQNFHVLYDLQNNRLGFAPRRC 511
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 154 bits (388), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 108/266 (40%), Positives = 144/266 (54%), Gaps = 17/266 (6%)
Query: 1 GDFVTETVTL------GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA-- 52
GD E T+ S VD++ GCGH+N GLF GAAGLLGLG G+LSF SQ+ A
Sbjct: 246 GDLALEAFTVNLTAPGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVY 305
Query: 53 -STFSYCLVDRDSDSTSTLEF--DSSL----PPNAVTAPLLRNHELDTFYYLGLTGISVG 105
FSYCLVD S S + F D +L N DTFYY+ L G+ VG
Sbjct: 306 GHAFSYCLVDHGSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVG 365
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVAL 164
G+ L IS + + + + G+GG I+DSGT ++ Y +R AFV R +A +
Sbjct: 366 GEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPV 425
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSII 223
CY+ S VEVP S F +G V PA+NY + +D +G C A T S++SII
Sbjct: 426 LSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSII 485
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN QQQ V ++L+N+ +GF P +C
Sbjct: 486 GNFQQQNFHVLYDLQNNRLGFAPRRC 511
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 154 bits (388), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 103/265 (38%), Positives = 137/265 (51%), Gaps = 18/265 (6%)
Query: 1 GDFVTETVTLGS----ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G ET T G+ ++ I+ GCG+ N G +G++G G GSLS SQ+ + FS
Sbjct: 179 GVLANETFTFGTNDTRVTLPRISFGCGNLNAGSLANGSGMVGFGRGSLSLVSQLGSPRFS 238
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTA------PLLRNHELDTFYYLGLTGISVGGDLLP 110
YCL S S L F + N+ A P + N L T Y+L +TGISVGG+ LP
Sbjct: 239 YCLTSFLSPVRSRLYFGAYATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLP 298
Query: 111 ISETAFKI-DESGNGGIIVDSGTAVTRLQTETYNALRDAFV---RGTRALSPTDGVALFD 166
I I D G GG I+DSGT +T L Y A+R+AFV T L ++ D
Sbjct: 299 IDPAVLAINDTDGTGGTIIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLD 358
Query: 167 TCYDF--SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
TC+ + R SV +P + HF +G LP +NY++ S G C A A TSS SIIG
Sbjct: 359 TCFQWPPPPRQSVTLPQLVLHF-DGADWELPLQNYMLVDPSTGGLCLAMA-TSSDGSIIG 416
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
+ Q Q V ++L NSL+ F P C
Sbjct: 417 SYQHQNFNVLYDLENSLLSFVPAPC 441
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 153 bits (386), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 99/259 (38%), Positives = 142/259 (54%), Gaps = 13/259 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLF--VGAAGLLGLGGGSLSFPSQINA---STF 55
G F ET+T + + + G N G F G G+LGLG G +S PSQ+ + + F
Sbjct: 114 GYFSKETITATDTAGEEVKFGASVYNTGTFGDTGGEGILGLGQGPVSMPSQLGSVLGNKF 173
Query: 56 SYCLVDRDS--DSTSTLEF-DSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPI 111
SYCLVD S TST+ F D+++P V P++ N + T+YY+ + GISVGG LL I
Sbjct: 174 SYCLVDWLSAGSETSTMYFGDAAVPSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDI 233
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
++ ++ID G+GG I+DSGT +T LQ E +NAL A+ R + T L D C++
Sbjct: 234 DQSVYEIDSGGSGGTIIDSGTTITYLQQEVFNALVAAYTSQVRYPTTTSATGL-DLCFNT 292
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SLSIIGNVQQQG 230
S P ++ H +G L LP N I +++N C AFA ++I GN+QQQ
Sbjct: 293 RGTGSPVFPAMTIHL-DGVHLELPTANTFISLETN-IICLAFASALDFPIAIFGNIQQQN 350
Query: 231 TRVSFNLRNSLIGFTPNKC 249
+ ++L N IGF P C
Sbjct: 351 FDIVYDLDNMRIGFAPADC 369
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 152 bits (385), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 97/240 (40%), Positives = 129/240 (53%), Gaps = 8/240 (3%)
Query: 15 VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVDRDSDSTSTLE 71
+ N+A GCGH N G F GAAG++GLG G LS SQ I + FSYCLV S TS +
Sbjct: 180 IPNVAFGCGHTNLGSFAGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTKTSPML 239
Query: 72 F-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDS 130
DS+ LL N TFYY LTGISV G + F ID SG GG I+DS
Sbjct: 240 IGDSAAAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDS 299
Query: 131 GTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRSSVEVPTVSFHFPEG 189
GT +T L+T +NAL A ++ DG D C+ + ++ PT++FHF +G
Sbjct: 300 GTTLTYLETGAFNALVAA-LKAEVPFPEADGSLYGLDYCFSTAGVANPTYPTMTFHF-KG 357
Query: 190 KVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
LP +N + +D+ G+ C A A S+ SI+GN+QQQ + +L N +GF C
Sbjct: 358 ADYELPPENVFVALDTGGSICLAMA-ASTGFSIMGNIQQQNHLIVHDLVNQRVGFKEANC 416
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 95/265 (35%), Positives = 134/265 (50%), Gaps = 18/265 (6%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
G ET T G+AS NI+ GCG N G ++G++G G G LS SQ+ S F
Sbjct: 176 GVLANETFTFGAASSTKVRAANISFGCGSLNAGELANSSGMVGFGRGPLSLVSQLGPSRF 235
Query: 56 SYCLVDRDSDSTSTLEF---------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
SYCL S + S L F ++S + P + N L Y+L + GIS+G
Sbjct: 236 SYCLTSYLSPTPSRLYFGVFANLNSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGT 295
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
LPI F I++ G GG+I+DSGT++T LQ + Y A+R + D D
Sbjct: 296 KRLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLASTIPLPAMNDTDIGLD 355
Query: 167 TCYDF--SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
TC+ + +V VP FHF +G + LP +NY++ + G C A APTS +IIG
Sbjct: 356 TCFQWPPPPNVTVTVPDFVFHF-DGANMTLPPENYMLIASTTGYLCLAMAPTSVG-TIIG 413
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
N QQQ + +++ NS + F P C
Sbjct: 414 NYQQQNLHLLYDIANSFLSFVPAPC 438
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 103/260 (39%), Positives = 136/260 (52%), Gaps = 19/260 (7%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
G F +T+TL S +V GCG NEGLF AAGLLGLG G S P Q F+
Sbjct: 267 GFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFA 326
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
+CL R S T L+F P A +T P+L ++ TFYY+G+TGI VGG LL I +
Sbjct: 327 HCLPAR-SSGTGYLDFGPGSPAAAGARLTTPMLTDNG-PTFYYVGMTGIRVGGQLLSIPQ 384
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV--RGTRALSPTDGVALFDTCYDF 171
+ F G IVDSGT +TRL Y++LR AFV R V+L DTCYDF
Sbjct: 385 SVFA-----TAGTIVDSGTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYDF 439
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQ 229
+ S V +PTVS F G +L + A + S C FA + I+GN Q +
Sbjct: 440 TGMSQVAIPTVSLLFQGGAILDVDASGIMYAA-SVSQVCLGFAANEDGGDVGIVGNTQLK 498
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
V++++ ++GF+P C
Sbjct: 499 TFGVAYDIGKKVVGFSPGAC 518
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 101/268 (37%), Positives = 141/268 (52%), Gaps = 24/268 (8%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYC 58
G TET T S+ I GCG NEG F +GL+GLG G LS SQ+ + FSYC
Sbjct: 196 GLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYC 255
Query: 59 LVD-RDSDSTSTLEFDSSLPPNAV-------------TAPLLRNHELDTFYYLGLTGISV 104
L DS+++S+L F SL V T LLRN + +FYYL L GI+V
Sbjct: 256 LTSIEDSEASSSL-FIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITV 314
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD--GV 162
G L + ++ F++ E G GG+I+DSGT +T L+ + L++ F +R P D G
Sbjct: 315 GAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFT--SRMSLPVDDSGS 372
Query: 163 ALFDTCYDF-SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
D C+ + ++ VP + FHF +G L LP +NY++ S G C A +S+ +S
Sbjct: 373 TGLDLCFKLPDAAKNIAVPKMIFHF-KGADLELPGENYMVADSSTGVLCLAMG-SSNGMS 430
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I GNVQQQ V +L + F P +C
Sbjct: 431 IFGNVQQQNFNVLHDLEKETVSFVPTEC 458
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 101/268 (37%), Positives = 141/268 (52%), Gaps = 24/268 (8%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYC 58
G TET T S+ I GCG NEG F +GL+GLG G LS SQ+ + FSYC
Sbjct: 88 GLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYC 147
Query: 59 LVD-RDSDSTSTLEFDSSLPPNAV-------------TAPLLRNHELDTFYYLGLTGISV 104
L DS+++S+L F SL V T LLRN + +FYYL L GI+V
Sbjct: 148 LTSIEDSEASSSL-FIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITV 206
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD--GV 162
G L + ++ F++ E G GG+I+DSGT +T L+ + L++ F +R P D G
Sbjct: 207 GAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFT--SRMSLPVDDSGS 264
Query: 163 ALFDTCYDF-SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
D C+ + ++ VP + FHF +G L LP +NY++ S G C A +S+ +S
Sbjct: 265 TGLDLCFKLPDAAKNIAVPKMIFHF-KGADLELPGENYMVADSSTGVLCLAMG-SSNGMS 322
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I GNVQQQ V +L + F P +C
Sbjct: 323 IFGNVQQQNFNVLHDLEKETVSFVPTEC 350
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 152 bits (383), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 101/268 (37%), Positives = 142/268 (52%), Gaps = 24/268 (8%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYC 58
G TET T S+ I GCG NEG F +GL+GLG G LS SQ+ + FSYC
Sbjct: 197 GLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYC 256
Query: 59 LVD-RDSDSTSTLEFDSSLPPNAV-------------TAPLLRNHELDTFYYLGLTGISV 104
L DS+++S+L F SL V T LLRN + +FYYL L GI+V
Sbjct: 257 LTSIEDSEASSSL-FIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITV 315
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD--GV 162
G L + ++ F++ E G GG+I+DSGT +T L+ + L++ F +R P D G
Sbjct: 316 GAKRLSVEKSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFT--SRMSLPVDDSGS 373
Query: 163 ALFDTCYDF-SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
D C+ ++ ++ VP + FHF +G L LP +NY++ S G C A +S+ +S
Sbjct: 374 TGLDLCFKLPNAAKNIAVPKLIFHF-KGADLELPGENYMVADSSTGVLCLAMG-SSNGMS 431
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I GNVQQQ V +L + F P +C
Sbjct: 432 IFGNVQQQNFNVLHDLEKETVTFVPTEC 459
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 110/273 (40%), Positives = 147/273 (53%), Gaps = 24/273 (8%)
Query: 1 GDFVTETVTL------GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA-- 52
GD E+ T+ S VD + GCGH N GLF GAAGLLGLG G LSF SQ+ A
Sbjct: 250 GDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVY 309
Query: 53 -STFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-PLLR----------NHELDTFYYLGLT 100
TFSYCLVD SD S + F A+ A P L+ + DTFYY+ L
Sbjct: 310 GHTFSYCLVDHGSDVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLK 369
Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPT 159
G+ VGG+LL IS + + + G+GG I+DSGT ++ Y +R AF+ R +R+
Sbjct: 370 GVLVGGELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLV 429
Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG--TFCFAFAPT- 216
+ CY+ S EVP +S F +G V PA+NY I +D +G C A T
Sbjct: 430 PEFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAVLGTP 489
Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +SIIGN QQQ V ++L+N+ +GF P +C
Sbjct: 490 RTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRC 522
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 97/270 (35%), Positives = 145/270 (53%), Gaps = 21/270 (7%)
Query: 1 GDFVTETVTLGSAS---------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
GDF ET T+ + V+N+ GCGH N GLF GAAGLLGLG G LSF +Q+
Sbjct: 288 GDFALETFTVNLTTPEGKPELKIVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQ 347
Query: 52 A---STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLL--------RNHELDTFYYLGLT 100
+ +FSYCLVDR+S+S+ + + ++ P L + + +DTFYY+ +
Sbjct: 348 SLYGHSFSYCLVDRNSNSSVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIK 407
Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD 160
I VGG++L I E + + G GG I+DSGT +T Y +++AF+R + +
Sbjct: 408 SIMVGGEVLKIPEETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVE 467
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSS 219
CY+ S +E+P + F +G + P +NY I ++ C A T S+
Sbjct: 468 TFPPLKPCYNVSGVEKMELPEFAILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRSA 527
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
LSIIGN QQQ + ++L+ S +G+ P KC
Sbjct: 528 LSIIGNYQQQNFHILYDLKKSRLGYAPMKC 557
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 151 bits (381), Expect = 3e-34, Method: Composition-based stats.
Identities = 100/271 (36%), Positives = 142/271 (52%), Gaps = 22/271 (8%)
Query: 1 GDFVTETVTLGSAS----------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI 50
GDF ET T+ S V+N+ GCGH N GLF GAAGLLGLG G LSF SQ+
Sbjct: 289 GDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQL 348
Query: 51 NA---STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLL--------RNHELDTFYYLGL 99
+ +FSYCLVDRDSD++ + + + +T P L + + +DTFYYL +
Sbjct: 349 QSLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQI 408
Query: 100 TGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT 159
I VGG+ L I E + + G GG I+DSGT ++ Y +++AF+R +
Sbjct: 409 KSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLV 468
Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SS 218
+ + CY+ S + P F +G V P +NY I + C A T S
Sbjct: 469 EDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKS 528
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+LSIIGN QQQ + ++ +NS +G+ P +C
Sbjct: 529 ALSIIGNYQQQNFHILYDTKNSRLGYAPMRC 559
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 109/264 (41%), Positives = 142/264 (53%), Gaps = 15/264 (5%)
Query: 1 GDFVTETVTLG-----SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA--- 52
GD E T+ S VD + +GCGH N GLF GAAGLLGLG G LSF SQ+ A
Sbjct: 244 GDLALEAFTVNLTASSSRRVDGVVLGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYG 303
Query: 53 STFSYCLVDRDSDSTSTLEF--DSSL--PPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
FSYCLVD S S + F D+ L P + +TFYY+ L GI VGG++
Sbjct: 304 HAFSYCLVDHGSAVGSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEM 363
Query: 109 LPISETAFKI-DESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVALFD 166
L I + + E G+GG I+DSGT ++ Y A+R AFV R +A +
Sbjct: 364 LDIPSNTWGVSKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLS 423
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSIIGN 225
CY+ S VEVP S F +G V PA+NY I +D+ G C A T S++SIIGN
Sbjct: 424 PCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSAMSIIGN 483
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
QQQ V ++L ++ +GF P +C
Sbjct: 484 YQQQNFHVLYDLHHNRLGFAPRRC 507
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 151 bits (381), Expect = 3e-34, Method: Composition-based stats.
Identities = 100/271 (36%), Positives = 142/271 (52%), Gaps = 22/271 (8%)
Query: 1 GDFVTETVTLGSAS----------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI 50
GDF ET T+ S V+N+ GCGH N GLF GAAGLLGLG G LSF SQ+
Sbjct: 289 GDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQL 348
Query: 51 NA---STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLL--------RNHELDTFYYLGL 99
+ +FSYCLVDRDSD++ + + + +T P L + + +DTFYYL +
Sbjct: 349 QSLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQI 408
Query: 100 TGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT 159
I VGG+ L I E + + G GG I+DSGT ++ Y +++AF+R +
Sbjct: 409 KSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLV 468
Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SS 218
+ + CY+ S + P F +G V P +NY I + C A T S
Sbjct: 469 EDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKS 528
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+LSIIGN QQQ + ++ +NS +G+ P +C
Sbjct: 529 ALSIIGNYQQQNFHILYDTKNSRLGYAPMRC 559
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 150 bits (380), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 101/267 (37%), Positives = 137/267 (51%), Gaps = 20/267 (7%)
Query: 1 GDFVTETVTLGS----ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G ET T G+ +V IA GCG+ N G +G++G G G LS SQ+ + FS
Sbjct: 176 GVLSNETFTFGTNDTRVTVPRIAFGCGNLNAGSLFNGSGMVGFGRGPLSLVSQLGSPRFS 235
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTA---------PLLRNHELDTFYYLGLTGISVGGD 107
YCL S S L F + N+ +A P + N L T YYL +TGISVGG+
Sbjct: 236 YCLTSFMSPVPSRLYFGAYATLNSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGE 295
Query: 108 LLPISETAFKI-DESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVAL 164
LLPI + F I D G GG+I+DSG+ +T L Y+ + AF G + T +
Sbjct: 296 LLPIDPSVFAINDADGTGGVIIDSGSTITYLARAAYDMVHQAFADQVGLPLTNATSLADV 355
Query: 165 FDTCYDF--SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
DTC+ + R V +P ++FHF EG + LP +NY++ G C A A S SI
Sbjct: 356 LDTCFVWPPPPRKIVTMPELAFHF-EGANMELPLENYMLIDGDTGNLCLAIA-ASDDGSI 413
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IG+ Q Q V ++ NSL+ FTP C
Sbjct: 414 IGSFQHQNFHVLYDNENSLLSFTPATC 440
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 150 bits (379), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 97/256 (37%), Positives = 135/256 (52%), Gaps = 14/256 (5%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSF---PSQINASTFS 56
G F ET+TL S V +N GCG NN GLF AAGL+GLG +S +Q FS
Sbjct: 225 GYFAKETLTLTSTDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQVFS 284
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
YCL + S ST L F A+ P+ + H + FY + + G+ VGG +PIS +
Sbjct: 285 YCL-PKTSSSTGYLTFGGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISSSV 343
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
F G I+DSGT +TRL + Y+AL+ AF +G +++ DTCYD S S
Sbjct: 344 FS-----TSGAIIDSGTVITRLPPDAYSALKSAFEKGMAKYPKAPELSILDTCYDLSKYS 398
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
++++P V F F G+ L L + S C AFA S+++IIGNVQQ+ +V
Sbjct: 399 TIQIPKVGFVFKGGEELDLDGIGIMYGA-STSQVCLAFAGNQDPSTVAIIGNVQQKTLQV 457
Query: 234 SFNLRNSLIGFTPNKC 249
+++ IGF N C
Sbjct: 458 VYDVGGGKIGFGYNGC 473
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 150 bits (378), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 93/261 (35%), Positives = 138/261 (52%), Gaps = 20/261 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G+ E ++ G SV + GCG NN+GLF G +GL+GLG LS SQ NA+ FSY
Sbjct: 157 GELGVEQLSFGGVSVSDFVFGCGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSY 216
Query: 58 CLVDRDSDSTSTLEF--DSSLPPNAV---TAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
CL +S ++ +L +SS+ N +L N +L FY L LTGI V G
Sbjct: 217 CLPTTESGASGSLVMGNESSVFKNVTPITYTRMLPNPQLSNFYILNLTGIDVDG------ 270
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
A ++ GNGG+++DSGT +TRL + Y AL+ F++ G ++ DTC++ +
Sbjct: 271 -VALQVPSFGNGGVLIDSGTVITRLPSSVYKALKALFLKQFTGFPSAPGFSILDTCFNLT 329
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTSSSL--SIIGNVQQ 228
V +PT+S HF L + A Y++ D++ C A A S + +IIGN QQ
Sbjct: 330 GYDEVSIPTISMHFEGNAELKVDATGTFYVVKEDAS-QVCLALASLSDAYDTAIIGNYQQ 388
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
+ RV ++ + S +GF C
Sbjct: 389 RNQRVIYDTKQSKVGFAEESC 409
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 149 bits (377), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 100/262 (38%), Positives = 142/262 (54%), Gaps = 21/262 (8%)
Query: 5 TETVTLGSASVDN-----IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
+ET T GSA+ D IA GC + + + G+AGL+GLG GSLS SQ+ A FSYCL
Sbjct: 188 SETFTFGSAAADQARVPGIAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCL 247
Query: 60 VD-RDSDSTSTLEFDSSLPPNAV---TAPLLR---NHELDTFYYLGLTGISVGGDLLPIS 112
+D++STSTL S N + P + + T+YYL LTGIS+G L IS
Sbjct: 248 TPFQDTNSTSTLLLGPSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSIS 307
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG--VALFDTCYD 170
AF + G GG+I+DSGT +T L Y +R A V+ L DG D CY
Sbjct: 308 PDAFSLKADGTGGLIIDSGTTITSLVNAAYQQVRAA-VQSLVTLPAIDGSDSTGLDLCYA 366
Query: 171 FSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSIIGNVQ 227
+ +S +P+++ HF +G + LPA +Y+I +G +C A T ++S GN Q
Sbjct: 367 LPTPTSAPPAMPSMTLHF-DGADMVLPADSYMI--SGSGVWCLAMRNQTDGAMSTFGNYQ 423
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
QQ + +++RN ++ F P KC
Sbjct: 424 QQNMHILYDVRNEMLSFAPAKC 445
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 149 bits (376), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 99/263 (37%), Positives = 138/263 (52%), Gaps = 21/263 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G+ E + G SV N GCG NN+GLF GA+GL+GLG LS SQ NA+ FSY
Sbjct: 212 GELGIEKLGFGGISVSNFVFGCGRNNKGLFGGASGLMGLGRSELSMISQTNATFGGVFSY 271
Query: 58 CL--VDRDSDSTSTLEFDSS-----LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
CL D+ S S + + S + P A T +L N +L FY L LTGI VGG L
Sbjct: 272 CLPSTDQAGASGSLVMGNQSGVFKNVTPIAYTR-MLPNLQLSNFYILNLTGIDVGGVSLH 330
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
+ ++F GNGG+I+DSGT ++RL Y AL+ F+ G ++ DTC++
Sbjct: 331 VQASSF-----GNGGVILDSGTVISRLAPSVYKALKAKFLEQFSGFPSAPGFSILDTCFN 385
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTSS--SLSIIGNV 226
+ V +PT+S +F L + A YL+ D++ C A A S + IIGN
Sbjct: 386 LTGYDQVNIPTISMYFEGNAELNVDATGIFYLVKEDAS-RVCLALASLSDEYEMGIIGNY 444
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQ+ RV ++ + S +GF C
Sbjct: 445 QQRNQRVLYDAKLSQVGFAKEPC 467
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 94/265 (35%), Positives = 132/265 (49%), Gaps = 18/265 (6%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
G ET T G+A+ NIA GCG N G ++G++G G G LS SQ+ S F
Sbjct: 176 GVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGPSRF 235
Query: 56 SYCLVDRDSDSTSTLEF---------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
SYCL S + S L F ++S + P + N L Y+L L IS+G
Sbjct: 236 SYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGT 295
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
LLPI F I++ G GG+I+DSGT++T LQ + Y A+R V + D D
Sbjct: 296 KLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMNDTDIGLD 355
Query: 167 TCYDF--SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
TC+ + +V VP + FHF + LP +NY++ + G C APT +IIG
Sbjct: 356 TCFQWPPPPNVTVTVPDLVFHFDSANMTLLP-ENYMLIASTTGYLCLVMAPTGVG-TIIG 413
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
N QQQ + +++ NS + F P C
Sbjct: 414 NYQQQNLHLLYDIGNSFLSFVPAPC 438
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 94/265 (35%), Positives = 132/265 (49%), Gaps = 18/265 (6%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
G ET T G+A+ NIA GCG N G ++G++G G G LS SQ+ S F
Sbjct: 71 GVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGPSRF 130
Query: 56 SYCLVDRDSDSTSTLEF---------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
SYCL S + S L F ++S + P + N L Y+L L IS+G
Sbjct: 131 SYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGT 190
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
LLPI F I++ G GG+I+DSGT++T LQ + Y A+R V + D D
Sbjct: 191 KLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMNDTDIGLD 250
Query: 167 TCYDF--SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
TC+ + +V VP + FHF + LP +NY++ + G C APT +IIG
Sbjct: 251 TCFQWPPPPNVTVTVPDLVFHFDSANMTLLP-ENYMLIASTTGYLCLVMAPTGVG-TIIG 308
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
N QQQ + +++ NS + F P C
Sbjct: 309 NYQQQNLHLLYDIGNSFLSFVPAPC 333
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 97/263 (36%), Positives = 140/263 (53%), Gaps = 23/263 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G+ E ++ G SV + GCG NN+GLF G +GL+GLG LS SQ NA+ FSY
Sbjct: 158 GELGVEALSFGGVSVSDFVFGCGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSY 217
Query: 58 CLVDRDSDSTSTLEF--DSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLL--P 110
CL ++ S+ +L +SS+ NA +L N +L FY L LTGI VGG L P
Sbjct: 218 CLPTTEAGSSGSLVMGNESSVFKNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAP 277
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
+S GNGGI++DSGT +TRL + Y AL+ F++ G ++ DTC++
Sbjct: 278 LS--------FGNGGILIDSGTVITRLPSSVYKALKAEFLKKFTGFPSAPGFSILDTCFN 329
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTSSSL--SIIGNV 226
+ V +PT+S F L + A Y++ D++ C A A S + +IIGN
Sbjct: 330 LTGYDEVSIPTISLRFEGNAQLNVDATGTFYVVKEDAS-QVCLALASLSDAYDTAIIGNY 388
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQ+ RV ++ + S +GF C
Sbjct: 389 QQRNQRVIYDTKQSKVGFAEEPC 411
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 102/260 (39%), Positives = 134/260 (51%), Gaps = 19/260 (7%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
G F +T+TL S +V GCG NEGLF AAGLLGLG G S P Q F+
Sbjct: 268 GFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFA 327
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
+CL R S T L+F P A +T P+L ++ TFYY+G+TGI VGG LL I +
Sbjct: 328 HCLPAR-SSGTGYLDFGPGSPAAAGARLTTPMLTDNG-PTFYYVGMTGIRVGGQLLSIPQ 385
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF--VRGTRALSPTDGVALFDTCYDF 171
+ F G IVDSGT +TRL Y++LR AF R V+L DTCYDF
Sbjct: 386 SVFT-----TAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCYDF 440
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQ 229
+ S V +PTVS F G L + A + S C FA + I+GN Q +
Sbjct: 441 TGMSQVAIPTVSLLFQGGARLDVDASGIMYAA-SVSQVCLGFAANEDGGDVGIVGNTQLK 499
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
V++++ ++GF+P C
Sbjct: 500 TFGVAYDIGKKVVGFSPGAC 519
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 103/271 (38%), Positives = 146/271 (53%), Gaps = 24/271 (8%)
Query: 1 GDFVTETVTLGSA-----SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---A 52
G +ETVTL S + NIA GCGH N G F A+GL+GLG G+LSF SQ+
Sbjct: 126 GTLSSETVTLTSTQGEKLAAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFG 185
Query: 53 STFSYCLVD-RDSDSTSTLEF--------DSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
FSYCLV RD+ S ++ F S + P++ N +++FYY+ L IS
Sbjct: 186 HKFSYCLVPWRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDIS 245
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG-V 162
+ G L I +F I G+GG+I DSGT +T L Y + A +R + DG
Sbjct: 246 IAGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRA-LRSKISFPKIDGSS 304
Query: 163 ALFDTCYDFS-SRSS--VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF-CFAFAPTSS 218
A D CYD S S++S +++P + FHF EG LP +NY I + GT C A ++
Sbjct: 305 AGLDLCYDVSGSKASYKMKIPAMVFHF-EGADYQLPVENYFIAANDAGTIVCLAMVSSNM 363
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ I GN+ QQ RV +++ +S IG+ P++C
Sbjct: 364 DIGIYGNMMQQNFRVMYDIGSSKIGWAPSQC 394
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 148 bits (374), Expect = 2e-33, Method: Composition-based stats.
Identities = 103/270 (38%), Positives = 148/270 (54%), Gaps = 22/270 (8%)
Query: 1 GDFVTETVTLGSAS---------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
GDF ET T+ + V+N+ GCGH N GLF GAAGLLGLG G LSF SQ+
Sbjct: 285 GDFALETFTVNLTTPNGKSEQKHVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQ 344
Query: 52 A---STFSYCLVDRDSDST--STLEF--DSSL--PPNAVTAPLLRNHE--LDTFYYLGLT 100
+ +FSYCLVDR+SD++ S L F D L PN + E +DTFYY+G+
Sbjct: 345 SIYGHSFSYCLVDRNSDTSVSSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIK 404
Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD 160
I V G++L I E + + + G GG I+DSGT +T Y +++AF++ + +
Sbjct: 405 SIMVDGEVLKIPEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVE 464
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSS 219
G CY+ S +E+P F +G + P +NY I ++ + C A T S+
Sbjct: 465 GFPPLKPCYNVSGIEKMELPDFGILFSDGAMWDFPVENYFIQIEPD-LVCLAILGTPKSA 523
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
LSIIGN QQQ + ++++ S +G+ P KC
Sbjct: 524 LSIIGNYQQQNFHILYDMKKSRLGYAPMKC 553
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 134/255 (52%), Gaps = 15/255 (5%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G TET TL + +V +N GCG NN+GLF GAAGL+GLG S SQ+ S FS
Sbjct: 104 GFLATETFTLAAGNVFNNFIFGCGQNNQGLFTGAAGLIGLGRSPYSLNSQLATSLGNIFS 163
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
YCL S +T L + L TA +L N T Y++ L GISVGG L +S T F
Sbjct: 164 YCL-PSTSSATGYLNIGNPLRTPGYTA-MLTNSRAPTLYFIDLIGISVGGTRLALSSTVF 221
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
+ + G I+DSGT +TRL Y ALR AF + ++ DTCYDFS ++
Sbjct: 222 Q-----SVGTIIDSGTVITRLPPTAYGALRTAFRAAMTQYTRAAAASILDTCYDFSRTTT 276
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--LSIIGNVQQQGTRVS 234
V PT+ H+ G + +P + V S+ C AFA S S + IIGNVQQ+ V+
Sbjct: 277 VTFPTIKLHY-TGLDVTIPGAG-VFYVISSSQVCLAFAGNSDSTQIGIIGNVQQRTMEVT 334
Query: 235 FNLRNSLIGFTPNKC 249
++ IGF C
Sbjct: 335 YDNALKRIGFAAGAC 349
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 147 bits (372), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 102/258 (39%), Positives = 134/258 (51%), Gaps = 18/258 (6%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G E TL ++ V D + GCG NN+GLF G AGLLGLG LSFPSQ + FS
Sbjct: 225 GFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFS 284
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
YCL S T L F S+ +V P+ + +FY L + I+VGG LPI T
Sbjct: 285 YCLPSSAS-YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTV 343
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
F G ++DSGT +TRL + Y ALR +F T GV++ DTC+D S
Sbjct: 344 FSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFK 398
Query: 176 SVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGT 231
+V +P V+F F G V+ L +K Y+ + C AFA S S+ +I GNVQQQ
Sbjct: 399 TVTIPKVAFSFSGGAVVELGSKGIFYVFKISQ---VCLAFAGNSDDSNAAIFGNVQQQTL 455
Query: 232 RVSFNLRNSLIGFTPNKC 249
V ++ +GF PN C
Sbjct: 456 EVVYDGAGGRVGFAPNGC 473
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 147 bits (372), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 102/258 (39%), Positives = 134/258 (51%), Gaps = 18/258 (6%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G E TL ++ V D + GCG NN+GLF G AGLLGLG LSFPSQ + FS
Sbjct: 197 GFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFS 256
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
YCL S T L F S+ +V P+ + +FY L + I+VGG LPI T
Sbjct: 257 YCLPSSAS-YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTV 315
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
F G ++DSGT +TRL + Y ALR +F T GV++ DTC+D S
Sbjct: 316 FSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFK 370
Query: 176 SVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGT 231
+V +P V+F F G V+ L +K Y+ + C AFA S S+ +I GNVQQQ
Sbjct: 371 TVTIPKVAFSFSGGAVVELGSKGIFYVFKISQ---VCLAFAGNSDDSNAAIFGNVQQQTL 427
Query: 232 RVSFNLRNSLIGFTPNKC 249
V ++ +GF PN C
Sbjct: 428 EVVYDGAGGRVGFAPNGC 445
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 147 bits (372), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 103/271 (38%), Positives = 145/271 (53%), Gaps = 24/271 (8%)
Query: 1 GDFVTETVTLGSA-----SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---A 52
G +ETVTL S + NIA GCGH N G F A+GL+GLG G+LSF SQ+
Sbjct: 126 GTLSSETVTLTSTQGEKLAAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFG 185
Query: 53 STFSYCLVD-RDSDSTSTLEF--------DSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
FSYCLV RD+ S ++ F S + P++ N +++FYY+ L IS
Sbjct: 186 HKFSYCLVPWRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDIS 245
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG-V 162
+ G L I +F I G+GG+I DSGT +T L Y + A +R + DG
Sbjct: 246 IAGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRA-LRSKVSFPEIDGSS 304
Query: 163 ALFDTCYDFS-SRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF-CFAFAPTSS 218
A D CYD S S++S ++P + FHF EG LP +NY I + GT C A ++
Sbjct: 305 AGLDLCYDVSGSKASYKKKIPAMVFHF-EGADHQLPVENYFIAANDAGTIVCLAMVSSNM 363
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ I GN+ QQ RV +++ +S IG+ P++C
Sbjct: 364 DIGIYGNMMQQNFRVMYDIGSSKIGWAPSQC 394
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 147 bits (372), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 99/257 (38%), Positives = 131/257 (50%), Gaps = 15/257 (5%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
G F +T+TL S +V GCG NEGLF AAGLLGLG G S P Q F+
Sbjct: 267 GFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFA 326
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
+CL R S T L+F + P +T + TFYY+GLTGI VGG LL I ++ F
Sbjct: 327 HCLPAR-STGTGYLDFGAGSPAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVF 385
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG--TRALSPTDGVALFDTCYDFSSR 174
G IVDSGT +TRL Y++LR AF R V+L DTCYDF+
Sbjct: 386 -----ATAGTIVDSGTVITRLPPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCYDFAGM 440
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQQGTR 232
S V +PTVS F G L + A + ++ C AFA + I+GN Q +
Sbjct: 441 SQVAIPTVSLLFQGGARLDVDASGIMYAASAS-QVCLAFAANEDGGDVGIVGNTQLKTFG 499
Query: 233 VSFNLRNSLIGFTPNKC 249
V++++ ++ F+P C
Sbjct: 500 VAYDIGKKVVSFSPGAC 516
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 147 bits (370), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 96/284 (33%), Positives = 141/284 (49%), Gaps = 42/284 (14%)
Query: 1 GDFVTETVTLGS---------ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
GDF +ET T+ V ++ GCGH N+G F GA+GLLGLG G +SFPSQI
Sbjct: 264 GDFASETFTVNLTWPNGKEKFKQVVDVMFGCGHWNKGFFYGASGLLGLGRGPISFPSQIQ 323
Query: 52 A---STFSYCLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHEL-------------DT 93
+ +FSYCL D S++ +S L F LL NH L +T
Sbjct: 324 SIYGHSFSYCLTDLFSNTSVSSKLIFGED-------KELLNNHNLNFTTLLAGEETPDET 376
Query: 94 FYYLGLTGISVGGDLLPISETAFKIDES-----GNGGIIVDSGTAVTRLQTETYNALRDA 148
FYYL + I VGG++L ISE + GG I+DSG+ +T Y+ +++A
Sbjct: 377 FYYLQIKSIMVGGEVLDISEQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEA 436
Query: 149 FVRGTRALSPTDGVALFDTCYDFS-SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG 207
F + + + CY+ S + VE+P HF +G V PA+NY + +
Sbjct: 437 FEKKIKLQQIAADDFVMSPCYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDE 496
Query: 208 TFCFAF--APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
C A P S L+IIGN+ QQ + ++++ S +G++P +C
Sbjct: 497 VICLAIMKTPNHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRC 540
>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
Length = 360
Score = 146 bits (369), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 95/270 (35%), Positives = 143/270 (52%), Gaps = 21/270 (7%)
Query: 1 GDFVTETVTLGSA---------SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
GDF ET T+ V+N+ GCGH N GLF GAAGLLGLG G LSF SQ+
Sbjct: 88 GDFALETFTVNLTMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQ 147
Query: 52 A---STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLL--------RNHELDTFYYLGLT 100
+ +FSYCLVDR+SD+ + + + ++ P L + + +DTFYY+ +
Sbjct: 148 SLYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIK 207
Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD 160
I VGG+++ I E ++I G+GG I+DSGT ++ Y +++AF+ +
Sbjct: 208 SIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVK 267
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSS 219
+ + CY+ + ++P F +G V P +NY I ++ C A T S+
Sbjct: 268 DFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSA 327
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
LSIIGN QQQ + ++ + S +GF P KC
Sbjct: 328 LSIIGNYQQQNFHILYDTKKSRLGFAPTKC 357
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 146 bits (368), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 94/269 (34%), Positives = 142/269 (52%), Gaps = 20/269 (7%)
Query: 1 GDFVTETVTLGSAS---------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
GDF ET T+ + V+N+ GCGH N GLF GAAGLLGLG G LSF SQ+
Sbjct: 288 GDFALETFTVNLTTPNGKSELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQ 347
Query: 52 A---STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLL--------RNHELDTFYYLGLT 100
+ +FSYCLVDR+S+++ + + ++ P L ++ +DTFYY+ +
Sbjct: 348 SLYGQSFSYCLVDRNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIN 407
Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD 160
+ V ++L I E + + G GG I+DSGT +T Y +++AFVR + +
Sbjct: 408 SVMVDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVE 467
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
G+ CY+ S +E+P F +G V P +NY I +D + S+L
Sbjct: 468 GLPPLKPCYNVSGIEKMELPDFGILFADGAVWNFPVENYFIQIDPDVVCLAILGNPRSAL 527
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
SIIGN QQQ + ++++ S +G+ P KC
Sbjct: 528 SIIGNYQQQNFHILYDMKKSRLGYAPMKC 556
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 146 bits (368), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 101/256 (39%), Positives = 132/256 (51%), Gaps = 14/256 (5%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G + TL S+ V D + GCG NN+GLF G AGLLGLG LSFPSQ + FS
Sbjct: 226 GFLAKDKFTLTSSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFS 285
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
YCL S T L F S+ +V P+ + +FY L + I+VGG LPI T
Sbjct: 286 YCLPSSAS-YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTV 344
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
F G ++DSGT +TRL + Y ALR +F T GV++ DTC+D S
Sbjct: 345 FSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFK 399
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
+V +P V+F F G V+ L +K + C AFA S S+ +I GNVQQQ V
Sbjct: 400 TVTIPKVAFSFSGGAVVELGSKGIFYAFKIS-QVCLAFAGNSDDSNAAIFGNVQQQTLEV 458
Query: 234 SFNLRNSLIGFTPNKC 249
++ +GF PN C
Sbjct: 459 VYDGAGGRVGFAPNGC 474
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 97/259 (37%), Positives = 132/259 (50%), Gaps = 18/259 (6%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFSY 57
G + +T+TL ++ N GCG N GLF AAGLLGLG G S P Q F+Y
Sbjct: 184 GFYAQDTLTLAYDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAY 243
Query: 58 CLVDRDSDSTSTLEFDSSLP-PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CL S T L+ P NA P+L + TFYY+G+TGI VGG +LPI + F
Sbjct: 244 CL-PATSAGTGFLDLGPGAPAANARLTPMLVDRG-PTFYYVGMTGIKVGGHVLPIPGSVF 301
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDGVALFDTCYDFSSR 174
G +VDSGT +TRL Y LR AF + + L S ++ DTCYD +
Sbjct: 302 S-----TAGTLVDSGTVITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGH 356
Query: 175 S--SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQG 230
S+ +P VS F G L + A L D + C AFAP + + ++I+GN QQ+
Sbjct: 357 KGGSIALPAVSLVFQGGACLDVDASGILYVADVSQA-CLAFAPNADDTDVAIVGNTQQKT 415
Query: 231 TRVSFNLRNSLIGFTPNKC 249
V +++ ++GF P C
Sbjct: 416 HGVLYDIGKKIVGFAPGAC 434
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 97/259 (37%), Positives = 132/259 (50%), Gaps = 18/259 (6%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFSY 57
G + +T+TL ++ N GCG N GLF AAGLLGLG G S P Q F+Y
Sbjct: 249 GFYAQDTLTLAYDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAY 308
Query: 58 CLVDRDSDSTSTLEFDSSLP-PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CL S T L+ P NA P+L + TFYY+G+TGI VGG +LPI + F
Sbjct: 309 CL-PATSAGTGFLDLGPGAPAANARLTPMLVDRG-PTFYYVGMTGIKVGGHVLPIPGSVF 366
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDGVALFDTCYDFSSR 174
G +VDSGT +TRL Y LR AF + + L S ++ DTCYD +
Sbjct: 367 S-----TAGTLVDSGTVITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGH 421
Query: 175 S--SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQG 230
S+ +P VS F G L + A L D + C AFAP + + ++I+GN QQ+
Sbjct: 422 KGGSIALPAVSLVFQGGACLDVDASGILYVADVSQA-CLAFAPNADDTDVAIVGNTQQKT 480
Query: 231 TRVSFNLRNSLIGFTPNKC 249
V +++ ++GF P C
Sbjct: 481 HGVLYDIGKKIVGFAPGAC 499
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 96/263 (36%), Positives = 140/263 (53%), Gaps = 21/263 (7%)
Query: 5 TETVTLGSASVDN-----IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
+ET T GS++ D +A GC + + + G+AGL+GLG GSLS SQ+ A FSYCL
Sbjct: 207 SETFTFGSSAADQARVPGVAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCL 266
Query: 60 VD-RDSDSTSTLEFDSSLPPNAV---TAPLLRN---HELDTFYYLGLTGISVGGDLLPIS 112
+D++STSTL S N + P + + + T+YYL LTGIS+G LPIS
Sbjct: 267 TPFQDTNSTSTLLLGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPIS 326
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG--VALFDTCYD 170
AF + G GG+I+DSGT +T L Y +R A L DG D C+
Sbjct: 327 PGAFSLKPDGTGGLIIDSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFA 386
Query: 171 FSSRSSVE---VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSIIGNV 226
+ +S +P+++ HF +G + LPA +Y+I +G +C A T ++S GN
Sbjct: 387 LPAPTSAPPAVLPSMTLHF-DGADMVLPADSYMI--SGSGVWCLAMRNQTDGAMSTFGNY 443
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQQ + +++R + F P KC
Sbjct: 444 QQQNMHILYDVREETLSFAPAKC 466
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 102/261 (39%), Positives = 134/261 (51%), Gaps = 20/261 (7%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
G F +T+TL S ++ GCG NEGLF AAGLLGLG G S P Q F+
Sbjct: 35 GFFAMDTLTLSSHDAIKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFA 94
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHEL----DTFYYLGLTGISVGGDLLPIS 112
+C R S T LEF P AV+A L L TFYY+G+TGI VGG LLPI
Sbjct: 95 HCFPAR-SSGTGYLEFGPGSSP-AVSAKLSTTPMLIDTGPTFYYVGMTGIRVGGKLLPIP 152
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV--RGTRALSPTDGVALFDTCYD 170
++ F G IVDSGT +TRL Y++LR AF R ++L DTCYD
Sbjct: 153 QSVFA-----AAGTIVDSGTVITRLPPAAYSSLRSAFAASMAARGYKRAPALSLLDTCYD 207
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--LSIIGNVQQ 228
+ S V +PTVS F G L + A +I S C FA ++ ++I+GN Q
Sbjct: 208 LTGASEVAIPTVSLLFQGGVSLDVDASG-IIYAASVSQACLGFAGNEAADDVAIVGNTQL 266
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
+ V +++ + ++GF P C
Sbjct: 267 KTFGVVYDIASKVVGFCPGAC 287
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 95/270 (35%), Positives = 143/270 (52%), Gaps = 21/270 (7%)
Query: 1 GDFVTETVTLGSA---------SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
GDF ET T+ V+N+ GCGH N GLF GAAGLLGLG G LSF SQ+
Sbjct: 274 GDFALETFTVNLTMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQ 333
Query: 52 A---STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLL--------RNHELDTFYYLGLT 100
+ +FSYCLVDR+SD+ + + + ++ P L + + +DTFYY+ +
Sbjct: 334 SLYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIK 393
Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD 160
I VGG+++ I E ++I G+GG I+DSGT ++ Y +++AF+ +
Sbjct: 394 SIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVK 453
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSS 219
+ + CY+ + ++P F +G V P +NY I ++ C A T S+
Sbjct: 454 DFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSA 513
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
LSIIGN QQQ + ++ + S +GF P KC
Sbjct: 514 LSIIGNYQQQNFHILYDTKKSRLGFAPTKC 543
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 96/257 (37%), Positives = 130/257 (50%), Gaps = 16/257 (6%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFSY 57
G F +T+T+ ++ GCG N GLF AGL+GLG G S Q F+Y
Sbjct: 251 GFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAY 310
Query: 58 CLVDRDSDSTSTLEFD-SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CL + T L+F S NA P+L + + TFYY+G+TGI VGG +P++E+ F
Sbjct: 311 CLPAL-TTGTGYLDFGPGSAGNNARLTPMLTD-KGQTFYYVGMTGIRVGGQQVPVAESVF 368
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAF--VRGTRALSPTDGVALFDTCYDFSSR 174
G +VDSGT +TRL Y AL AF V R G ++ DTCYDF+
Sbjct: 369 S-----TAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGL 423
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTR 232
S VE+PTVS F G L + + + S C AFA S++I+GN QQ+
Sbjct: 424 SDVELPTVSLVFQGGACLDVDVSGIVYAI-SEAQVCLAFASNGDDESVAIVGNTQQKTYG 482
Query: 233 VSFNLRNSLIGFTPNKC 249
V ++L +GF P C
Sbjct: 483 VLYDLGKKTVGFAPGSC 499
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 96/257 (37%), Positives = 130/257 (50%), Gaps = 16/257 (6%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFSY 57
G F +T+T+ ++ GCG N GLF AGL+GLG G S Q F+Y
Sbjct: 251 GFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAY 310
Query: 58 CLVDRDSDSTSTLEFD-SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CL + T L+F S NA P+L + + TFYY+G+TGI VGG +P++E+ F
Sbjct: 311 CLPAL-TTGTGYLDFGPGSAGNNARLTPMLTD-KGQTFYYVGMTGIRVGGQQVPVAESVF 368
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAF--VRGTRALSPTDGVALFDTCYDFSSR 174
G +VDSGT +TRL Y AL AF V R G ++ DTCYDF+
Sbjct: 369 S-----TAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGL 423
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTR 232
S VE+PTVS F G L + + + S C AFA S++I+GN QQ+
Sbjct: 424 SDVELPTVSLVFQGGACLDVDVSGIVYAI-SEAQVCLAFASNGDDESVAIVGNTQQKTYG 482
Query: 233 VSFNLRNSLIGFTPNKC 249
V ++L +GF P C
Sbjct: 483 VLYDLGKKTVGFAPGSC 499
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 100/243 (41%), Positives = 124/243 (51%), Gaps = 15/243 (6%)
Query: 15 VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVDRDSDSTSTLE 71
VD+ GCG +NEGLF G+AGL+GLG +SF Q I FSYCL S S L
Sbjct: 245 VDDFLFGCGQDNEGLFSGSAGLIGLGRHPISFVQQTSSIYNKIFSYCL-PSTSSSLGHLT 303
Query: 72 FDSSLPPNA--VTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISETAFKIDESGNGGIIV 128
F +S NA PL +TFY L + GISVGG LP +S + F GG I+
Sbjct: 304 FGASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSA-----GGSII 358
Query: 129 DSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPE 188
DSGT +TRL Y ALR AF +G + LFDTCYDFS + VP + F F
Sbjct: 359 DSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKEISVPKIDFEFAG 418
Query: 189 GKVLPLPAKNYLIPVDSNGTFCFAFAP--TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTP 246
G + LP LI S C AFA + ++I GNVQQ+ V +++ IGF
Sbjct: 419 GVTVELPLVGILIG-RSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVEGGRIGFGA 477
Query: 247 NKC 249
C
Sbjct: 478 AGC 480
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 104/263 (39%), Positives = 141/263 (53%), Gaps = 20/263 (7%)
Query: 1 GDFVTETVTL----GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---AS 53
GD ET++L G+ SV N A GCG N G F GAAGL+GLG G LS SQ++ A+
Sbjct: 128 GDLAFETISLNNGAGTQSVPNFAFGCGTQNLGTFAGAAGLVGLGQGPLSLNSQLSHTFAN 187
Query: 54 TFSYCLVDRDSDSTSTLEFDS-SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
FSYCLV +S S S L F S + N ++ N T+YY+ L I VGG L ++
Sbjct: 188 KFSYCLVSLNSLSASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLA 247
Query: 113 ETAFKIDES-GNGGIIVDSGTAVTRLQTETYNAL---RDAFVRGTRALSPTDGVAL-FDT 167
+ F ID+S G GG I+DSGT +T L Y+A+ ++FV R DG A D
Sbjct: 248 PSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPR----LDGSAYGLDL 303
Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG-TFCFAFAPTSSSLSIIGNV 226
C++ + S+ VP + F F +G + +N + VD++ T C A S SIIGN+
Sbjct: 304 CFNIAGVSNPSVPDMVFKF-QGADFQMRGENLFVLVDTSATTLCLAMG-GSQGFSIIGNI 361
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQQ V ++L IGF C
Sbjct: 362 QQQNHLVVYDLEAKKIGFATADC 384
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 144 bits (362), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 101/260 (38%), Positives = 132/260 (50%), Gaps = 19/260 (7%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
G F +T+TL S ++ GCG NEGLF AAGLLGLG G S P Q F+
Sbjct: 274 GFFAMDTLTLSSYDAIKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQAYDKYGGVFA 333
Query: 57 YCLVDRDSDSTSTLEFDSSLPP---NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
+C R S T L+F P +T P+L ++ L TFYY+GLTGI VGG LL I
Sbjct: 334 HCFPAR-SSGTGYLDFGPGSSPAVSTKLTTPMLVDNGL-TFYYVGLTGIRVGGKLLSIPP 391
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCYDF 171
+ F G IVDSGT +TRL Y++LR AF R ++L DTCYDF
Sbjct: 392 SVFT-----TAGTIVDSGTVITRLPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYDF 446
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQQ 229
+ S V +PTVS F G L + A +I S C FA + I+GN Q +
Sbjct: 447 TGMSQVAIPTVSLLFQGGASLDVDASG-IIYAASVSQACLGFAANEEDDDVGIVGNTQLK 505
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
V +++ ++GF+P C
Sbjct: 506 TFGVVYDIGKKVVGFSPGAC 525
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 144 bits (362), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 97/265 (36%), Positives = 141/265 (53%), Gaps = 24/265 (9%)
Query: 5 TETVTLGSASVDN-----IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
+ET T GS++ D +A GC + + + G+AGL+GLG GSLS SQ+ A FSYCL
Sbjct: 209 SETFTFGSSAADQARVPGVAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCL 268
Query: 60 VD-RDSDSTSTLEFDSSLPPNAV---TAPLLRN---HELDTFYYLGLTGISVGGDLLPIS 112
+D++STSTL S N + P + + + T+YYL LTGIS+G LPIS
Sbjct: 269 TPFQDTNSTSTLLLGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPIS 328
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT----DGVALFDTC 168
AF + G GG+I+DSGT +T L Y +R A PT D L D C
Sbjct: 329 PGAFSLKPDGTGGLIIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGL-DLC 387
Query: 169 YDFSSRSSVE---VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSIIG 224
+ + +S +P+++ HF +G + LPA +Y+I +G +C A T ++S G
Sbjct: 388 FALPAPTSAPPAVLPSMTLHF-DGADMVLPADSYMI--SGSGVWCLAMRNQTDGAMSTFG 444
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
N QQQ + +++R + F P KC
Sbjct: 445 NYQQQNMHILYDVREETLSFAPAKC 469
>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
Length = 256
Score = 144 bits (362), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 86/115 (74%), Positives = 103/115 (89%), Gaps = 1/115 (0%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
GDF TET+TL GSAS++N+AIGCGH+NEGLFVGAAGLLGLGGGSLSFPSQINAS+FSYCL
Sbjct: 140 GDFATETITLDGSASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCL 199
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
V+RD+DS STLEF+S +P ++VTAPLLRN++LDTFYYLG+TGI +L I+ T
Sbjct: 200 VNRDTDSASTLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGESYKILQITCT 254
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 143 bits (361), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 93/269 (34%), Positives = 140/269 (52%), Gaps = 20/269 (7%)
Query: 1 GDFVTETVTLGSAS---------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
GDF ET T+ + V+N+ GCGH N GLF GAAGLLGLG G LSF SQ+
Sbjct: 290 GDFALETFTVNLTTPNGTSELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQ 349
Query: 52 A---STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLL--------RNHELDTFYYLGLT 100
+ +FSYCLVDR+S+++ + + ++ P L ++ +DTFYY+ +
Sbjct: 350 SLYGQSFSYCLVDRNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIK 409
Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD 160
+ V ++L I E + + G GG I+DSGT +T Y +++AFVR + +
Sbjct: 410 SVMVDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVE 469
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
G+ CY+ S +E+P F + V P +NY I +D S+L
Sbjct: 470 GLPPLKPCYNVSGIEKMELPDFGILFADEAVWNFPVENYFIWIDPEVVCLAILGNPRSAL 529
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
SIIGN QQQ + ++++ S +G+ P KC
Sbjct: 530 SIIGNYQQQNFHILYDMKKSRLGYAPMKC 558
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 143 bits (360), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 101/259 (38%), Positives = 132/259 (50%), Gaps = 18/259 (6%)
Query: 5 TETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVD 61
+ET+++GS V+N GC + GL L+G G LSF SQ + STFSYCL
Sbjct: 216 SETLSVGSQQVENFVFGCSNAARGLIQRTPSLVGFGRNPLSFVSQTATLYDSTFSYCLPS 275
Query: 62 RDSDS--TSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
S + S L +L + PLL N +FYY+GL GISVG +L+ I +
Sbjct: 276 LFSSAFTGSLLLGKEALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSL 335
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL---SPTDGVALFDTCYDFSSRS 175
DES G I+DSGT +TRL YNA+RD+F L SPTD LFDTCY+ S
Sbjct: 336 DESTGRGTIIDSGTVITRLVEPAYNAMRDSFRSQLSNLTMASPTD---LFDTCYNRPS-G 391
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPT----SSSLSIIGNVQQQG 230
VE P ++ HF + L LP N L P + +G+ C AF LS GN QQQ
Sbjct: 392 DVEFPLITLHFDDNLDLTLPLDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQK 451
Query: 231 TRVSFNLRNSLIGFTPNKC 249
R+ ++ S +G C
Sbjct: 452 LRIVHDVAESRLGIASENC 470
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 142 bits (357), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 96/252 (38%), Positives = 132/252 (52%), Gaps = 12/252 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G +TV+ GS SV N GCG +NEGLF +AGL+GL LS Q+ S +FSY
Sbjct: 215 GYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSY 274
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
CL S S + P P+ ++ D+ Y++ +TGI+V G L +S +A+
Sbjct: 275 CLPTSSSSSGYLSIGSYN-PGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYS 333
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
+ I+DSGT +TRL T+ Y+AL A + ++ DTC+ S +
Sbjct: 334 SLPT-----IIDSGTVITRLPTDVYSALSKAVAGAMKGTPRASAFSILDTCFQ-GQASRL 387
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
VP VS F G L L A N L+ VDS T C AFAP S+ +IIGN QQQ V +++
Sbjct: 388 RVPQVSMAFAGGAALKLKATNLLVDVDS-ATTCLAFAPARSA-AIIGNTQQQTFSVVYDV 445
Query: 238 RNSLIGFTPNKC 249
+NS IGF C
Sbjct: 446 KNSKIGFAAGGC 457
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 103/261 (39%), Positives = 133/261 (50%), Gaps = 21/261 (8%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
G F +T+TL S ++ GCG NEGL+ AAGLLGLG G S P Q F+
Sbjct: 249 GFFAMDTLTLSSYDAIKGFRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVFA 308
Query: 57 YCLVDRDSDSTSTLEFD-SSLPPNAVTAPLLRNHELD---TFYYLGLTGISVGGDLLPIS 112
+C R S T L+F SLP AV+A L +D TFYY+GLTGI VGG LL I
Sbjct: 309 HCFPAR-SSGTGYLDFGPGSLP--AVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIP 365
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT--RALSPTDGVALFDTCYD 170
++ F G IVDSGT +TRL Y++LR AF R ++L DTCYD
Sbjct: 366 QSVFTTS-----GTIVDSGTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYD 420
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA--PTSSSLSIIGNVQQ 228
F+ S V +PTVS F G L + A +I S C FA + I+GN Q
Sbjct: 421 FTGMSEVAIPTVSLLFQGGASLDVHASG-IIYAASVSQACLGFAGNKEDDDVGIVGNTQL 479
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
+ V +++ ++GF P C
Sbjct: 480 KTFGVVYDIGKKVVGFCPGAC 500
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 110/270 (40%), Positives = 143/270 (52%), Gaps = 21/270 (7%)
Query: 1 GDFVTETVTL------GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA-- 52
GD E+ T+ S VD++ GCGH N GLF GAAGLLGLG G LSF SQ+ A
Sbjct: 245 GDLALESFTVNLTAPGASRRVDDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLRAVY 304
Query: 53 -STFSYCLVDRDSDSTSTLEFDSSL--------PPNAVTAPLLRNHELDTFYYLGLTGIS 103
TFSYCLVD SD S + F P TA + DTFYY+ L G+
Sbjct: 305 GHTFSYCLVDHGSDVASKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVL 364
Query: 104 VGGDLLPISETAF--KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTD 160
VGG+LL IS + E G+GG I+DSGT ++ Y +R AF+ R R+
Sbjct: 365 VGGELLNISSDTWGVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIP 424
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS- 219
+ CY+ S EVP +S F +G V PA+NY I +D +G C A T +
Sbjct: 425 DFPVLSPCYNVSGVDRPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTG 484
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+SIIGN QQQ V ++L+N+ +GF P +C
Sbjct: 485 MSIIGNFQQQNFHVVYDLKNNRLGFAPRRC 514
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 91/228 (39%), Positives = 117/228 (51%), Gaps = 12/228 (5%)
Query: 15 VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSYCLVDRDSDSTSTLE 71
VDN GCG NN+GLF G+AGL+GLG +SF Q A FSYCL S ST L
Sbjct: 255 VDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAVYRKIFSYCL-PATSSSTGRLS 313
Query: 72 FDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSG 131
F ++ P +FY L +TGISVGG LP+S + F GG I+DSG
Sbjct: 314 FGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFS-----TGGAIIDSG 368
Query: 132 TAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKV 191
T +TRL Y ALR AF +G +++ DTCYD S +P + F F G
Sbjct: 369 TVITRLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVFSIPKIDFSFAGGVT 428
Query: 192 LPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQQGTRVSFNL 237
+ LP + L V S C AFA S ++I GNVQQ+ V +++
Sbjct: 429 VQLPPQGILY-VASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYDV 475
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 106/267 (39%), Positives = 140/267 (52%), Gaps = 21/267 (7%)
Query: 1 GDFVTETVTLGS----ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G ET T G+ S+ I+ GCG+ N GL +G++G G GSLS SQ+ + FS
Sbjct: 177 GVLANETFTFGTNETRVSLPGISFGCGNLNAGLLANGSGMVGFGRGSLSLVSQLGSPRFS 236
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTA--------PLLRNHELDTFYYLGLTGISVGGDL 108
YCL S S L F N+ A P + N L T Y+L +TGISVGG L
Sbjct: 237 YCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYL 296
Query: 109 LPISETAFKI-DESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVALFD 166
LPI F I D G GG I+DSGT +T L Y+A+R AF + T L ++ D
Sbjct: 297 LPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQITLPLLNVTDASVLD 356
Query: 167 TCYDF--SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD--SNGTFCFAFAPTSSSLSI 222
TC+ + R SV +P + HF +G LP +NY++ VD + G C A A +SS SI
Sbjct: 357 TCFQWPPPPRQSVTLPQLVLHF-DGADWELPLQNYML-VDPSTGGGLCLAMA-SSSDGSI 413
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IG+ Q Q V ++L NSL+ F P C
Sbjct: 414 IGSYQHQNFNVLYDLENSLMSFVPAPC 440
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 90/264 (34%), Positives = 126/264 (47%), Gaps = 21/264 (7%)
Query: 3 FVTETVTLGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVD 61
F + +G ASV ++ GCG N G+FV G+ G G+LS P+Q+ FSYC
Sbjct: 186 FASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTA 245
Query: 62 RDSDSTSTLEFDSSLPPN------------AVTAPLLRNHELD-TFYYLGLTGISVGGDL 108
S + +PPN + L+R H YY+ L G++VG
Sbjct: 246 ITGSEPSPVFL--GVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTR 303
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
LPI E+ F + E G GG IVDSGT +T L YN + DAFV T+ +L C
Sbjct: 304 LPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLC 363
Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTSSSLSIIGN 225
+ + +VP + HF EG L LP +NY+ ++ G C A LS+IGN
Sbjct: 364 FSVPPGAKPDVPALVLHF-EGATLDLPRENYMFEIEEAGGIRLTCLAIN-AGEDLSVIGN 421
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
QQQ V ++L N ++ F P +C
Sbjct: 422 FQQQNMHVLYDLANDMLSFVPARC 445
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 97/259 (37%), Positives = 133/259 (51%), Gaps = 18/259 (6%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFSY 57
G + +T+TLG +V + GCG N GLF AAGL+GLG G S P Q + F+Y
Sbjct: 253 GFYAQDTLTLGYDTVKDFRFGCGEKNRGLFGKAAGLMGLGRGKTSVPVQAYDKYSGVFAY 312
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVT--APLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
C + S T L+F P A P+L ++ TFYY+G+TGI VGG LL I T
Sbjct: 313 C-IPATSSGTGFLDFGPGAPAAANARLTPMLVDNG-PTFYYVGMTGIKVGGHLLSIPATV 370
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDGVALFDTCYDFSS 173
F + G +VDSGT +TRL Y LR AF +G L ++ DTCYD +
Sbjct: 371 FS-----DAGALVDSGTVITRLPPSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTG 425
Query: 174 -RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQQG 230
+ S+ +P VS F G L + A L D + C AFA + ++I+GN QQ+
Sbjct: 426 YQGSIALPAVSLVFQGGACLDVDASGILYVADVSQA-CLAFAANDDDTDMTIVGNTQQKT 484
Query: 231 TRVSFNLRNSLIGFTPNKC 249
V ++L ++GF P C
Sbjct: 485 YSVLYDLGKKVVGFAPGAC 503
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 90/264 (34%), Positives = 126/264 (47%), Gaps = 21/264 (7%)
Query: 3 FVTETVTLGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVD 61
F + +G ASV ++ GCG N G+FV G+ G G+LS P+Q+ FSYC
Sbjct: 212 FASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTA 271
Query: 62 RDSDSTSTLEFDSSLPPN------------AVTAPLLRNHELD-TFYYLGLTGISVGGDL 108
S + +PPN + L+R H YY+ L G++VG
Sbjct: 272 ITGSEPSPVFL--GVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTR 329
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
LPI E+ F + E G GG IVDSGT +T L YN + DAFV T+ +L C
Sbjct: 330 LPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLC 389
Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTSSSLSIIGN 225
+ + +VP + HF EG L LP +NY+ ++ G C A LS+IGN
Sbjct: 390 FSVPPGAKPDVPALVLHF-EGATLDLPRENYMFEIEEAGGIRLTCLAIN-AGEDLSVIGN 447
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
QQQ V ++L N ++ F P +C
Sbjct: 448 FQQQNMHVLYDLANDMLSFVPARC 471
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 90/264 (34%), Positives = 126/264 (47%), Gaps = 21/264 (7%)
Query: 3 FVTETVTLGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVD 61
F + +G ASV ++ GCG N G+FV G+ G G+LS P+Q+ FSYC
Sbjct: 212 FASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTA 271
Query: 62 RDSDSTSTLEFDSSLPPN------------AVTAPLLRNHELD-TFYYLGLTGISVGGDL 108
S + +PPN + L+R H YY+ L G++VG
Sbjct: 272 ITGSEPSPVFL--GVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTR 329
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
LPI E+ F + E G GG IVDSGT +T L YN + DAFV T+ +L C
Sbjct: 330 LPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLC 389
Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTSSSLSIIGN 225
+ + +VP + HF EG L LP +NY+ ++ G C A LS+IGN
Sbjct: 390 FSVPPGAKPDVPALVLHF-EGATLDLPRENYMFEIEEAGGIRLTCLAIN-AGEDLSVIGN 447
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
QQQ V ++L N ++ F P +C
Sbjct: 448 FQQQNMHVLYDLANDMLSFVPARC 471
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 90/248 (36%), Positives = 132/248 (53%), Gaps = 14/248 (5%)
Query: 13 ASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINASTFSYCL--VDRDSDSTST 69
ASV +A GCG N G+F G+ G G G LS PSQ+ FS+C V+ ST
Sbjct: 241 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKQSTVL 300
Query: 70 LEFDSSLPPNAVTA----PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGG 125
L+ + L N A PL++N T YYL L GI+VG LP+ E+AF + +G GG
Sbjct: 301 LDLLADLYKNGRGAVQSTPLIQNSANPTLYYLSLKGITVGSTRLPVPESAFAL-TNGTGG 359
Query: 126 IIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSRSSVEVPTVSF 184
I+DSGT++T L + Y +RD F + + P + + TC+ S++ +VP +
Sbjct: 360 TIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPY-TCFSAPSQAKPDVPKLVL 418
Query: 185 HFPEGKVLPLPAKNYL--IPVDS-NGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSL 241
HF EG + LP +NY+ +P D+ N C A + IGN QQQ V ++L+N++
Sbjct: 419 HF-EGATMDLPRENYVFEVPDDAGNSMICLAINELGDERATIGNFQQQNMHVLYDLQNNM 477
Query: 242 IGFTPNKC 249
+ F +C
Sbjct: 478 LSFVAAQC 485
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 76/136 (55%), Gaps = 8/136 (5%)
Query: 98 GLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-AL 156
G GI+VG LP+ E+AF + +G GG I+DSGT++T L + Y +RD F + +
Sbjct: 38 GRPGITVGSTRLPVPESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPV 96
Query: 157 SPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL--IPVDS-NGTFCFAF 213
P + + TC+ S++ +VP + HF EG + LP +NY+ +P D+ N C A
Sbjct: 97 VPGNATGPY-TCFSAPSQAKPDVPKLVLHF-EGATMDLPRENYVFEVPDDAGNSIICLAI 154
Query: 214 APTSSSLSIIGNVQQQ 229
+ +IIGN QQQ
Sbjct: 155 NKGDET-TIIGNFQQQ 169
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 91/258 (35%), Positives = 136/258 (52%), Gaps = 12/258 (4%)
Query: 1 GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G TET TLGS +V +A GCG N G ++GL+G+G G LS SQ+ + FSYC
Sbjct: 184 GVLATETFTLGSDTAVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCF 243
Query: 60 VDRDSDSTSTLEFDSS--LPPNAVTAPLLRN-----HELDTFYYLGLTGISVGGDLLPIS 112
++ + S L SS L A T P + + ++YYL L GI+VG LLPI
Sbjct: 244 TPFNATAASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPID 303
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDF 171
F++ G+GG+I+DSGT T L+ + AL A R L G L C+
Sbjct: 304 PAVFRLTPMGDGGVIIDSGTTFTALEERAFVALARALASRVR-LPLASGAHLGLSLCFAA 362
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGT 231
+S +VEVP + HF +G + L ++Y++ S G C ++ +S++G++QQQ T
Sbjct: 363 ASPEAVEVPRLVLHF-DGADMELRRESYVVEDRSAGVACLGMV-SARGMSVLGSMQQQNT 420
Query: 232 RVSFNLRNSLIGFTPNKC 249
+ ++L ++ F P KC
Sbjct: 421 HILYDLERGILSFEPAKC 438
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 99/260 (38%), Positives = 129/260 (49%), Gaps = 19/260 (7%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
G F +T+TL S +V GCG NEGLF AAGLLGLG G S P Q F+
Sbjct: 270 GFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFA 329
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
+CL R S T L+F P T P+L ++ TFYY+G+TGI VGG LL I +
Sbjct: 330 HCLPAR-SSGTGYLDFGPGSPAAVGARQTTPMLTDNG-PTFYYVGMTGIRVGGQLLSIPQ 387
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF--VRGTRALSPTDGVALFDTCYDF 171
+ F G IVDSGT +TRL Y++LR AF R ++L DTCYDF
Sbjct: 388 SVFS-----TAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDF 442
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQ 229
+ S V +P VS F G L + A + S C FA + I+GN Q +
Sbjct: 443 TGMSEVAIPKVSLLFQGGAYLDVNASGIMYAA-SLSQVCLGFAANEDDDDVGIVGNTQLK 501
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
V +++ +GF+P C
Sbjct: 502 TFGVVYDIGKKTVGFSPGAC 521
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 94/262 (35%), Positives = 137/262 (52%), Gaps = 22/262 (8%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFS 56
G+ TE + LG S +V+N GCG NN+GLF GA+GL+GLG SLS SQ +A FS
Sbjct: 227 GELGTEHLDLGNSTAVNNFIFGCGRNNQGLFGGASGLVGLGRSSLSLISQTSAMFGGVFS 286
Query: 57 YCLVDRDSDSTSTLEF--DSSLPPNAVTAPLLR---NHELDTFYYLGLTGISVGGDLLPI 111
YCL +++++ +L +SS+ N R N +L FY+L LTGI+VG
Sbjct: 287 YCLPITETEASGSLVMGGNSSVYKNTTPISYTRMIPNPQL-PFYFLNLTGITVG------ 339
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
A + G G+++DSGT +TRL Y AL+D FV+ + DTC++
Sbjct: 340 -SVAVQAPSFGKDGMMIDSGTVITRLPPSIYQALKDEFVKQFSGFPSAPAFMILDTCFNL 398
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTS--SSLSIIGNVQ 227
S VE+P + HF L + Y + D++ C A A S + + IIGN Q
Sbjct: 399 SGYQEVEIPNIKMHFEGNAELNVDVTGVFYFVKTDAS-QVCLAIASLSYENEVGIIGNYQ 457
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
Q+ RV ++ + S++GF C
Sbjct: 458 QKNQRVIYDTKGSMLGFAAEAC 479
>gi|302141829|emb|CBI19032.3| unnamed protein product [Vitis vinifera]
Length = 382
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 96/244 (39%), Positives = 134/244 (54%), Gaps = 10/244 (4%)
Query: 14 SVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEF 72
S+ I GCG NN + AGLLGLG G LS SQ+ FSYCL + TS+L F
Sbjct: 138 SIPRIGFGCGVNNRATGMDQTAGLLGLGRGVLSLVSQLGTQKFSYCLTSIHENKTSSLLF 197
Query: 73 DSSL-----PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGII 127
S P PL++N L ++YYL L GI+VG LLPI E AF++ + G+GG+I
Sbjct: 198 GSLAYSNFNPGKIPRTPLIQNPFLPSYYYLALKGITVGYTLLPIPEFAFQLGKDGSGGMI 257
Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS--VEVPTVSFH 185
+DSGT +T LQ + ++ L++AF+ T D C+ +++ V+VP + FH
Sbjct: 258 LDSGTTITYLQEDAFDVLKNAFISQTELQVANSSTTGLDLCFHLPVKNAAEVKVPKLIFH 317
Query: 186 FPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFT 245
F +G L LP +NY++ G C A T SLSI GN+QQQ V +L+ S +
Sbjct: 318 F-KGLDLALPVENYMVSDPEMGLICLAIDAT-GSLSIFGNIQQQNMLVLHDLKKSTLSLV 375
Query: 246 PNKC 249
P +C
Sbjct: 376 PTQC 379
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 105/264 (39%), Positives = 137/264 (51%), Gaps = 20/264 (7%)
Query: 1 GDFVTETVTL-----GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---- 51
GD E T+ G+ VD +A GCGH N GLF GAAGLLGLG G LSF SQ+
Sbjct: 245 GDLALEAFTVNLTQSGTRRVDGVAFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRGVYG 304
Query: 52 ASTFSYCLVDRDSDSTSTLEF--DSSL--PPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
FSYCLV+ S + S + F D +L P + DTFYYL L I VGG+
Sbjct: 305 GHAFSYCLVEHGSAAGSKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGE 364
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVALFD 166
+ IS D GG I+DSGT ++ Y A+R AF+ R + + G +
Sbjct: 365 AVNISS-----DTLSAGGTIIDSGTTLSYFPEPAYQAIRQAFIDRMSPSYPLILGFPVLS 419
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSIIGN 225
CY+ S VEVP +S F +G PA+NY I ++ G C A T S +SIIGN
Sbjct: 420 PCYNVSGAEKVEVPELSLVFADGAAWEFPAENYFIRLEPEGIMCLAVLGTPRSGMSIIGN 479
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
QQQ V ++L ++ +GF P +C
Sbjct: 480 YQQQNFHVLYDLEHNRLGFAPRRC 503
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 95/261 (36%), Positives = 137/261 (52%), Gaps = 15/261 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
G F E+ T+ +D +A GCG +N+G F A G+LGLG G LSF SQ+ + F+Y
Sbjct: 157 GVFAYESATVDGVRIDKVAFGCGSDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAY 216
Query: 58 CLV---DRDSDSTSTL---EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
CLV D S S+S + E S++ T P++ N + T YY+ + ++VGG LPI
Sbjct: 217 CLVNYLDPTSVSSSLIFGDELISTIHDMQYT-PIVSNPKSPTLYYVQIEKVTVGGKSLPI 275
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
S++A++ID GNGG I DSGT +T Y+ + AF G + V D C +
Sbjct: 276 SDSAWEIDLLGNGGSIFDSGTTLTYWFPSAYSHILAAFDSGVH-YPRAESVQGLDLCVEL 334
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL---SIIGNVQQ 228
+ P+ + F +G V A+NY + V N C A A +S L + IGN+ Q
Sbjct: 335 TGVDQPSFPSFTIEFDDGAVFQPEAENYFVDVAPN-VRCLAMAGLASPLGGFNTIGNLLQ 393
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q V ++ +LIGF P KC
Sbjct: 394 QNFFVQYDREENLIGFAPAKC 414
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 91/258 (35%), Positives = 136/258 (52%), Gaps = 12/258 (4%)
Query: 1 GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G TET TLGS +V +A GCG N G ++GL+G+G G LS SQ+ + FSYC
Sbjct: 184 GVLATETFTLGSDTAVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCF 243
Query: 60 VDRDSDSTSTLEFDSS--LPPNAVTAPLLRN-----HELDTFYYLGLTGISVGGDLLPIS 112
++ + S L SS L A T P + + ++YYL L GI+VG LLPI
Sbjct: 244 TPFNATAASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPID 303
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDF 171
F++ G+GG+I+DSGT T L+ + AL A R L G L C+
Sbjct: 304 PAVFRLTPMGDGGVIIDSGTTFTALEESAFVALARALASRVR-LPLASGAHLGLSLCFAA 362
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGT 231
+S +VEVP + HF +G + L ++Y++ S G C ++ +S++G++QQQ T
Sbjct: 363 ASPEAVEVPRLVLHF-DGADMELRRESYVVEDRSAGVACLGMV-SARGMSVLGSMQQQNT 420
Query: 232 RVSFNLRNSLIGFTPNKC 249
+ ++L ++ F P KC
Sbjct: 421 HILYDLERGILSFEPAKC 438
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 100/260 (38%), Positives = 133/260 (51%), Gaps = 19/260 (7%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
G F +T+TL S +V GCG NEGLF AAGLLGLG G S P Q F+
Sbjct: 268 GFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFA 327
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
+CL R S T L+F + A +T P+L + TFYY+G+TGI VGG LL I +
Sbjct: 328 HCLPAR-STGTGYLDFGAGSLAAARARLTTPMLTENG-PTFYYVGMTGIRVGGQLLSIPQ 385
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALR--DAFVRGTRALSPTDGVALFDTCYDF 171
+ F G IVDSGT +TRL Y++LR A R V+L DTCYDF
Sbjct: 386 SVF-----ATAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDF 440
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQ 229
+ S V +PTVS F G L + A + ++ C AFA + I+GN Q +
Sbjct: 441 TGMSQVAIPTVSLLFQGGARLDVDASGIMYAASAS-QVCLAFAANEDGGDVGIVGNTQLK 499
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
V++++ ++GF P C
Sbjct: 500 TFGVAYDIGKKVVGFYPGAC 519
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 107/273 (39%), Positives = 152/273 (55%), Gaps = 31/273 (11%)
Query: 1 GDFVTETVTLG------SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS- 53
GD E++++ S + ++ IGCGH+N+GLF GA GLLGLG G+LSFPSQ+ +S
Sbjct: 265 GDLALESLSVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSP 324
Query: 54 ---TFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-----------PLLR-NHELDTFYYLG 98
+FSYCLVDR T+ L S++ A A P +R N+ ++TFYYLG
Sbjct: 325 IGQSFSYCLVDR----TNNLSVSSAISFGAGFALSRHFDQMRFTPFVRTNNSVETFYYLG 380
Query: 99 LTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP 158
+ GI + +LLPI F I +G+GG I+DSGT +T L + Y A+ AF+ R P
Sbjct: 381 IQGIKIDQELLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAFL--ARISYP 438
Query: 159 -TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLI-PVDSNGTFCFAFAPT 216
D + CY+ + R++V PT+S F G L LP +NY I P C A PT
Sbjct: 439 RADPFDILGICYNATGRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPT 498
Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+SIIGN QQQ ++++++ +GF C
Sbjct: 499 -DGMSIIGNFQQQNIHFLYDVQHARLGFANTDC 530
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 139 bits (351), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 89/261 (34%), Positives = 129/261 (49%), Gaps = 20/261 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
GD E + LG+ V N GCG NN+GLF GA+GL+GLG LS SQ +A FSY
Sbjct: 159 GDLGMEQLNLGTTHVSNFIFGCGRNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSY 218
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAP-----LLRNHELDTFYYLGLTGISVGGDLLPIS 112
CL +D++ +L + T P ++ N +L TFY+L LTGIS+GG
Sbjct: 219 CLPTTAADASGSLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGG------ 272
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
A + GI++DSGT +TRL Y L+ F++ ++ DTC++ +
Sbjct: 273 -VALQAPNYRQSGILIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLN 331
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQ 228
V++PT+ F L + Y + D++ C A A S + IIGN QQ
Sbjct: 332 GYDEVDIPTIRMQFEGNAELTVDVTGIFYFVKTDAS-QVCLALASLSFDDEIPIIGNYQQ 390
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
+ RV +N + S +GF C
Sbjct: 391 RNQRVIYNTKESKLGFAAEAC 411
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 100/260 (38%), Positives = 134/260 (51%), Gaps = 19/260 (7%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
G F +T+TL S +V GCG NEGLF AAGLLGLG G S P Q F+
Sbjct: 268 GFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFA 327
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
+CL R S T L+F + A +T P+L ++ TFYY+G+TGI VGG LL I +
Sbjct: 328 HCLPAR-STGTGYLDFGAGSLAAASARLTTPMLTDNG-PTFYYVGMTGIRVGGQLLSIPQ 385
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALR--DAFVRGTRALSPTDGVALFDTCYDF 171
+ F G IVDSGT +TRL Y++LR A R V+L DTCYDF
Sbjct: 386 SVF-----ATAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDF 440
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQ 229
+ S V +PTVS F G L + A + ++ C AFA + I+GN Q +
Sbjct: 441 TGMSQVAIPTVSLLFQGGARLDVDASGIMYAASAS-QVCLAFAANEDGGDVGIVGNTQLK 499
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
V++++ ++GF P C
Sbjct: 500 TFGVAYDIGKKVVGFYPGAC 519
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 101/260 (38%), Positives = 135/260 (51%), Gaps = 19/260 (7%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
G F +T+TL S +V GCG NEGLF AAGLLGLG G S P Q F+
Sbjct: 266 GFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFA 325
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
+CL R S T L+F + P A +T P+L ++ TFYY+G+TGI VGG LL I +
Sbjct: 326 HCLPAR-STGTGYLDFGAGSPAAASARLTTPMLTDNG-PTFYYIGMTGIRVGGQLLSIPQ 383
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALR--DAFVRGTRALSPTDGVALFDTCYDF 171
+ F G IVDSGT +TRL Y++LR A R V+L DTCYDF
Sbjct: 384 SVFA-----TAGTIVDSGTVITRLPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDF 438
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQ 229
+ S V +PTVS F G L + A + ++ C AFA + I+GN Q +
Sbjct: 439 TGMSQVAIPTVSLLFQGGARLDVDASGIMYAASAS-QVCLAFAANEDGGDVGIVGNTQLK 497
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
V++++ ++GF P C
Sbjct: 498 TFGVAYDIGKKVVGFYPGVC 517
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 92/256 (35%), Positives = 126/256 (49%), Gaps = 17/256 (6%)
Query: 6 ETVT-LGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVDRD 63
ETV+ + ASV + GCG NN G+F G+ G G G LS PSQ+ FS+C
Sbjct: 187 ETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVS 246
Query: 64 SDSTSTLEFDSSLPPN--------AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
ST+ FD LP + T PL++N TFYYL L GI+VG LP+ E+A
Sbjct: 247 GRKPSTVLFD--LPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESA 304
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSR 174
F + ++G GG I+DSGTA T L Y + D F + + P++ C+
Sbjct: 305 FAL-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPL-LCFSAPPL 362
Query: 175 SSV-EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
VP + HF EG + LP +NY+ G A ++IIGN QQQ V
Sbjct: 363 GKAPHVPKLVLHF-EGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHV 421
Query: 234 SFNLRNSLIGFTPNKC 249
++L+NS + F KC
Sbjct: 422 LYDLKNSKLSFVRAKC 437
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 92/256 (35%), Positives = 126/256 (49%), Gaps = 17/256 (6%)
Query: 6 ETVT-LGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVDRD 63
ETV+ + ASV + GCG NN G+F G+ G G G LS PSQ+ FS+C
Sbjct: 131 ETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVS 190
Query: 64 SDSTSTLEFDSSLPPN--------AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
ST+ FD LP + T PL++N TFYYL L GI+VG LP+ E+A
Sbjct: 191 GRKPSTVLFD--LPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESA 248
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSR 174
F + ++G GG I+DSGTA T L Y + D F + + P++ C+
Sbjct: 249 FAL-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPL-LCFSAPPL 306
Query: 175 SSV-EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
VP + HF EG + LP +NY+ G A ++IIGN QQQ V
Sbjct: 307 GKAPHVPKLVLHF-EGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHV 365
Query: 234 SFNLRNSLIGFTPNKC 249
++L+NS + F KC
Sbjct: 366 LYDLKNSKLSFVRAKC 381
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 92/256 (35%), Positives = 126/256 (49%), Gaps = 17/256 (6%)
Query: 6 ETVT-LGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVDRD 63
ETV+ + ASV + GCG NN G+F G+ G G G LS PSQ+ FS+C
Sbjct: 187 ETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVS 246
Query: 64 SDSTSTLEFDSSLPPN--------AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
ST+ FD LP + T PL++N TFYYL L GI+VG LP+ E+A
Sbjct: 247 GRKPSTVLFD--LPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESA 304
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSR 174
F + ++G GG I+DSGTA T L Y + D F + + P++ C+
Sbjct: 305 FAL-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPL-LCFSAPPL 362
Query: 175 SSV-EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
VP + HF EG + LP +NY+ G A ++IIGN QQQ V
Sbjct: 363 GKAPHVPKLVLHF-EGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHV 421
Query: 234 SFNLRNSLIGFTPNKC 249
++L+NS + F KC
Sbjct: 422 LYDLKNSKLSFVRAKC 437
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 95/250 (38%), Positives = 133/250 (53%), Gaps = 22/250 (8%)
Query: 15 VDNIA---IGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVD-RDSDST 67
VD +A GC H G V GL+G G G LSFPSQ + S FSYCL + S+ +
Sbjct: 207 VDAVAAYTFGCLHVVTGGSVPPQGLVGFGRGPLSFPSQTKDVYGSVFSYCLPSYKSSNFS 266
Query: 68 STLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGI 126
TL + P + T PLL N + YY+ + GI VGG +P+ +A D + G
Sbjct: 267 GTLRLGPAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVPVPASALAFDPTSGRGT 326
Query: 127 IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG-VALFDTCYDFSSRSSVEVPTVSFH 185
IVD+GT TRL Y A+RD F RA P G + FDTCY+ ++ VPTV+F
Sbjct: 327 IVDAGTMFTRLSAPVYAAVRDVFRSRVRA--PVAGPLGGFDTCYNV----TISVPTVTFS 380
Query: 186 FPEGKV-LPLPAKNYLIPVDSNGTFCFAFAP-----TSSSLSIIGNVQQQGTRVSFNLRN 239
F +G+V + LP +N +I S G C A A ++L+++ ++QQQ RV F++ N
Sbjct: 381 F-DGRVSVTLPEENVVIRSSSGGIACLAMAAGPPDGVDAALNVLASMQQQNHRVLFDVAN 439
Query: 240 SLIGFTPNKC 249
+GF+ C
Sbjct: 440 GRVGFSRELC 449
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 87/261 (33%), Positives = 137/261 (52%), Gaps = 21/261 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
GD +E++ LG ++N+ GCG NN+GLF GA+GL+GLG S+S SQ + FSY
Sbjct: 234 GDLASESIVLGDTKLENLVFGCGRNNKGLFGGASGLMGLGRSSVSLVSQTLKTFNGVFSY 293
Query: 58 CLVDRDSDSTSTLEF--DSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLPIS 112
CL + ++ TL F D S+ N+ + PL++N +L +FY L LTG S+GG + +
Sbjct: 294 CLPSLEDGASGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGG--VELK 351
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
+F GI++DSGT +TRL Y A++ F++ G ++ DTC++ +
Sbjct: 352 TLSF------GRGILIDSGTVITRLPPSIYKAVKTEFLKQFSGFPSAPGYSILDTCFNLT 405
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQ 228
S + +PT+ F L + Y + D++ C A A S + + IIGN QQ
Sbjct: 406 SYEDISIPTIKMIFEGNAELEVDVTGVFYFVKPDAS-LVCLALASLSYENEVGIIGNYQQ 464
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
+ RV ++ +G C
Sbjct: 465 KNQRVIYDTTQERLGIAGENC 485
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 91/255 (35%), Positives = 127/255 (49%), Gaps = 22/255 (8%)
Query: 11 GSASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTST 69
G A+V ++A GCG N G+F G+ G G G+LS PSQ+ FS+C S+
Sbjct: 523 GQATVPDLAFGCGLFNNGIFTSNETGIAGFGRGALSLPSQLKVDNFSHCFTAITGSEPSS 582
Query: 70 LEFDSSLPPN--------AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
+ LP N + PL++N YYL L GI+VG LPI E+ F + +
Sbjct: 583 VLL--GLPANLYSDADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALKQD 640
Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV---ALFDTCYDFS--SRSS 176
G GG I+DSGT +T L + Y + DAF R P D +L C+ FS R+
Sbjct: 641 GTGGTIIDSGTGMTTLPQDAYKLVHDAFTAQVRL--PVDNATSSSLSRLCFSFSVPRRAK 698
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG--TFCFAFAPTSSSLSIIGNVQQQGTRVS 234
+VP + HF EG L LP +NY+ + G C A L+IIGN QQQ V
Sbjct: 699 PDVPKLVLHF-EGATLDLPRENYMFEFEDAGGSVTCLAIN-AGDDLTIIGNYQQQNLHVL 756
Query: 235 FNLRNSLIGFTPNKC 249
++L +++ F P +C
Sbjct: 757 YDLVRNMLSFVPAQC 771
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 96/261 (36%), Positives = 136/261 (52%), Gaps = 21/261 (8%)
Query: 6 ETVTLGSASVDN-----IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
ET T GS D IA GC + + + G+AGL+GLG GS+S SQ+ A FSYCL
Sbjct: 183 ETFTFGSTPADQTRVPGIAFGCSNASSDDWNGSAGLVGLGRGSMSLVSQLGAGMFSYCLT 242
Query: 61 D-RDSDSTSTLEFDSSLPPN---AVTAPLL---RNHELDTFYYLGLTGISVGGDLLPISE 113
+D++STSTL S N +T P + + T+YYL LTGIS+G L I
Sbjct: 243 PFQDANSTSTLLLGPSAALNGTGVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALSIPP 302
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG--VALFDTCYDF 171
AF + G GG+I+DSGT +T L Y +R A + L DG D C+
Sbjct: 303 NAFALRTDGTGGLIIDSGTTITSLVDAAYQQVRAA-IESLVTLPVADGSDSTGLDLCFAL 361
Query: 172 SSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSIIGNVQQ 228
+S +S +P+++FHF +G + LP NY+I +G +C A T ++S GN QQ
Sbjct: 362 TSETSTPPSMPSMTFHF-DGADMVLPVDNYMI--LGSGVWCLAMRNQTVGAMSTFGNYQQ 418
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q + +++ + F P KC
Sbjct: 419 QNVHLLYDIHEETLSFAPAKC 439
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 91/248 (36%), Positives = 125/248 (50%), Gaps = 16/248 (6%)
Query: 13 ASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
ASV +A GCG N G+F G+ G G G LS PSQ+ FS+C ST+
Sbjct: 170 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVL 229
Query: 72 FDSSLPPN--------AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
D LP + T PL++N TFYYL L GI+VG LP+ E+ F + ++G
Sbjct: 230 LD--LPADLFSNGQGAVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFAL-KNGT 286
Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
GG I+DSGTA+T L T Y +RDAF + + C R+ VP +
Sbjct: 287 GGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLV 346
Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGT--FCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSL 241
HF EG + LP +NY+ V+ G+ C A ++ IGN QQQ V ++L+NS
Sbjct: 347 LHF-EGATMDLPRENYVFEVEDAGSSILCLAII-EGGEVTTIGNFQQQNMHVLYDLQNSK 404
Query: 242 IGFTPNKC 249
+ F P +C
Sbjct: 405 LSFVPAQC 412
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 100/271 (36%), Positives = 139/271 (51%), Gaps = 34/271 (12%)
Query: 6 ETVTLGSAS------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
ET T GS+S V NIA GC + + + G+AGL+GLG GS+S SQ+ A FSYCL
Sbjct: 189 ETFTFGSSSTPPAVRVPNIAFGCSNASSNDWNGSAGLVGLGRGSMSLVSQLGAGAFSYCL 248
Query: 60 VD-RDSDSTSTLEFDSSLPPNAVTA----------PLL---RNHELDTFYYLGLTGISVG 105
+D++STSTL L P+A A P + + T+YYL LTGISVG
Sbjct: 249 TPFQDANSTSTLL----LGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVG 304
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDA-----FVRGTRALSPTD 160
L I AF + G GG+I+DSGT +T L Y +R A R A P
Sbjct: 305 ETALAIPPDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDH 364
Query: 161 GVALFDTCYDF-SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSS 218
L D C+ +S +P+++ HF G + LP +NY+I +G +C A T
Sbjct: 365 STGL-DLCFALKASTPPPAMPSMTLHFEGGADMVLPVENYMI--LGSGVWCLAMRNQTVG 421
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++S++GN QQQ V +++R + F P C
Sbjct: 422 AMSMVGNYQQQNIHVLYDVRKETLSFAPAVC 452
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 98/260 (37%), Positives = 137/260 (52%), Gaps = 20/260 (7%)
Query: 9 TLGSASVDNIAIGCGHNNEG-LFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVD--- 61
T G A+V +A GCG N+G F G G++GLG G LSFP+Q + A TFSYCL+D
Sbjct: 169 TSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEG 228
Query: 62 -RDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
R S+S L A PL+ N TFYY+G+ I VG +LP+ + + ID
Sbjct: 229 GRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAID 288
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVALFDTCYDFSSRSS 176
GNGG ++DSG+ +T L+ Y L AF V R S + CY+ SS SS
Sbjct: 289 VLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSSS 348
Query: 177 VE-----VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQQ 229
+ P ++ F +G L LP NYL+ V ++ C A PT S + +++GN+ QQ
Sbjct: 349 LAPANGGFPRLTIDFAQGLSLELPTGNYLVDV-ADDVKCLAIRPTLSPFAFNVLGNLMQQ 407
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
G V F+ ++ IGF +C
Sbjct: 408 GYHVEFDRASARIGFARTEC 427
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 106/273 (38%), Positives = 151/273 (55%), Gaps = 31/273 (11%)
Query: 1 GDFVTETVTLG------SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS- 53
GD E++++ S + ++ IGCGH+N+GLF GA GLLGLG G+LSFPSQ+ +S
Sbjct: 181 GDLALESLSVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSP 240
Query: 54 ---TFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-----------PLLR-NHELDTFYYLG 98
+FSYCLVDR T+ L S++ A A P +R N+ ++TFYYLG
Sbjct: 241 IGQSFSYCLVDR----TNNLSVSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLG 296
Query: 99 LTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP 158
+ GI + +LLPI F I +G+GG I+DSGT +T L + Y A+ AF+ R P
Sbjct: 297 IQGIKIDQELLPIPAERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFL--ARISYP 354
Query: 159 -TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-SNGTFCFAFAPT 216
D + CY+ + R++V P +S F G L LP +NY I D C A PT
Sbjct: 355 RADPFDILGICYNATGRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPT 414
Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+SIIGN QQQ ++++++ +GF C
Sbjct: 415 -DGMSIIGNFQQQNIHFLYDVQHARLGFANTDC 446
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 90/248 (36%), Positives = 126/248 (50%), Gaps = 16/248 (6%)
Query: 13 ASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
ASV +A GCG N G+F G+ G G G LS PSQ+ FS+C + ST+
Sbjct: 189 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKPSTVL 248
Query: 72 FDSSLPPN--------AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
D LP + + PL++N TFYYL L GI+VG LP+ E+ F + ++G
Sbjct: 249 LD--LPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTL-KNGT 305
Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
GG I+DSGTA+T L T Y +RDAF + + C R+ VP +
Sbjct: 306 GGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLV 365
Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGT--FCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSL 241
HF EG + LP +NY+ V+ G+ C A ++ IGN QQQ V ++L+NS
Sbjct: 366 LHF-EGATMDLPRENYVFEVEDAGSSILCLAII-EGGEVTTIGNFQQQNMHVLYDLQNSK 423
Query: 242 IGFTPNKC 249
+ F P +C
Sbjct: 424 LSFVPAQC 431
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 90/248 (36%), Positives = 126/248 (50%), Gaps = 16/248 (6%)
Query: 13 ASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
ASV +A GCG N G+F G+ G G G LS PSQ+ FS+C + ST+
Sbjct: 189 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKPSTVL 248
Query: 72 FDSSLPPN--------AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
D LP + + PL++N TFYYL L GI+VG LP+ E+ F + ++G
Sbjct: 249 LD--LPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFAL-KNGT 305
Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
GG I+DSGTA+T L T Y +RDAF + + C R+ VP +
Sbjct: 306 GGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLV 365
Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGT--FCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSL 241
HF EG + LP +NY+ V+ G+ C A ++ IGN QQQ V ++L+NS
Sbjct: 366 LHF-EGATMDLPRENYVFEVEDAGSSILCLAII-EGGEVTTIGNFQQQNMHVLYDLQNSK 423
Query: 242 IGFTPNKC 249
+ F P +C
Sbjct: 424 LSFVPAQC 431
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 105/267 (39%), Positives = 139/267 (52%), Gaps = 21/267 (7%)
Query: 1 GDFVTETVTLGS----ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G ET T G+ S+ I+ GCG+ N G +G++G G GSLS SQ+ + FS
Sbjct: 177 GVLANETFTFGTNETRVSLPGISFGCGNLNAGSLANGSGMVGFGRGSLSLVSQLGSPRFS 236
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTA--------PLLRNHELDTFYYLGLTGISVGGDL 108
YCL S S L F N+ A P + N L T Y+L +TGISVGG L
Sbjct: 237 YCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYL 296
Query: 109 LPISETAFKI-DESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVALFD 166
LPI F I D G GG I+DSGT +T L Y+A+R AF + T L ++ D
Sbjct: 297 LPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQITLPLLNVTDASVLD 356
Query: 167 TCYDF--SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD--SNGTFCFAFAPTSSSLSI 222
TC+ + R SV +P + HF +G LP +NY++ VD + G C A A +SS SI
Sbjct: 357 TCFQWPPPPRQSVTLPQLVLHF-DGADWELPLQNYML-VDPSTGGGLCLAMA-SSSDGSI 413
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IG+ Q Q V ++L NSL+ F P C
Sbjct: 414 IGSYQHQNFNVLYDLENSLMSFVPAPC 440
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 93/252 (36%), Positives = 135/252 (53%), Gaps = 13/252 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G +TV+ GS SV N GCG +NEGLF +AGL+GL LS Q+ + +FSY
Sbjct: 231 GYLSKDTVSFGSNSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSY 290
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
CL S ++ + P P++ + D+ Y++ L+G++V G L +S +
Sbjct: 291 CLPSSSSSGYLSIGSYN--PGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSS--- 345
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
E + I+DSGT +TRL T Y+AL A + D ++ DTC+ SS+
Sbjct: 346 --EYSSLPTIIDSGTVITRLPTTVYDALSKAVAGAMKGTKRADAYSILDTCF-VGQASSL 402
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
VP VS F G L L A+N L+ VDS+ T C AFAP S+ +IIGN QQQ V +++
Sbjct: 403 RVPAVSMAFSGGAALKLSAQNLLVDVDSSTT-CLAFAPARSA-AIIGNTQQQTFSVVYDV 460
Query: 238 RNSLIGFTPNKC 249
+++ IGF C
Sbjct: 461 KSNRIGFAAGGC 472
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 96/278 (34%), Positives = 138/278 (49%), Gaps = 36/278 (12%)
Query: 1 GDFVTETVTLGS---------ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
GDF ET T+ V ++ GCGH N+G F GA GLLGLG G LSFPSQ+
Sbjct: 263 GDFALETFTVNLTWPNGKEKFKHVVDVMFGCGHWNKGFFHGAGGLLGLGRGPLSFPSQLQ 322
Query: 52 A---STFSYCLVDRDSDST--STLEFDSSLPPNAVTAPLLRNHEL-------------DT 93
+ +FSYCL D S+++ S L F LL +H L DT
Sbjct: 323 SIYGHSFSYCLTDLFSNTSVSSKLIFGED-------KELLNHHNLNFTKLLAGEETPDDT 375
Query: 94 FYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT 153
FYYL + I VGG++L I E + G GG I+DSG+ +T Y+ +++AF +
Sbjct: 376 FYYLQIKSIVVGGEVLDIPEKTWHWSSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKI 435
Query: 154 RALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF 213
+ + CY+ S VE+P HF +G V PA+NY + + C A
Sbjct: 436 KLQQIAADDFIMSPCYNVSGAMQVELPDYGIHFADGAVWNFPAENYFYQYEPDEVICLAI 495
Query: 214 --APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
P S L+IIGN+ QQ + ++++ S +G++P +C
Sbjct: 496 LKTPNHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRC 533
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 92/243 (37%), Positives = 125/243 (51%), Gaps = 15/243 (6%)
Query: 15 VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTSTLE 71
VD+ GCG +NEGLF G+AGL+GLG +S Q +++ FSYCL S S L
Sbjct: 156 VDDFLFGCGQDNEGLFNGSAGLMGLGRHPISIVQQTSSNYNKIFSYCL-PATSSSLGHLT 214
Query: 72 FDSSLPPNA--VTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISETAFKIDESGNGGIIV 128
F +S NA + PL ++FY L + ISVGG LP +S + F GG I+
Sbjct: 215 FGASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSSSTFSA-----GGSII 269
Query: 129 DSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPE 188
DSGT +TRL Y ALR AF R + L DTCYD S + VP + F F
Sbjct: 270 DSGTVITRLAPTVYAALRSAFRRXMEKYPVANEAGLLDTCYDLSGYKEISVPRIDFEFSG 329
Query: 189 GKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQQGTRVSFNLRNSLIGFTP 246
G + L + ++ V+S C AFA S +++ GNVQQ+ V ++++ IGF
Sbjct: 330 GVTVELXHRG-ILXVESEQQVCLAFAANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGA 388
Query: 247 NKC 249
C
Sbjct: 389 AGC 391
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 93/257 (36%), Positives = 139/257 (54%), Gaps = 21/257 (8%)
Query: 14 SVDNIAIGCGH-NNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDS--DST 67
+ NI +GC + EGL GA+GLLG+ +SFPSQ++ A FS+C D+ + +S+
Sbjct: 251 KLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDKIAHLNSS 310
Query: 68 STLEFDSS--LPPNAVTAPLLRNHELDT----FYYLGLTGISVGGDLLPISETAFKIDE- 120
+ F S + P PL++N + + +YY+GL GISV LP+S F ID+
Sbjct: 311 GLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKV 370
Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS----RSS 176
+G+GG I+DSGTA T L+ + A+R F+ T L+ D + F CY+ +S S
Sbjct: 371 TGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGTAALES 430
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDS---NGTFCFAFAPTSS-SLSIIGNVQQQGTR 232
+P+++ HF G + LP + LIPV S T C AF + +IIGN QQQ
Sbjct: 431 TILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGDIPFNIIGNYQQQNLW 490
Query: 233 VSFNLRNSLIGFTPNKC 249
V ++L +G P +C
Sbjct: 491 VEYDLEKLRLGIAPAQC 507
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 93/257 (36%), Positives = 139/257 (54%), Gaps = 21/257 (8%)
Query: 14 SVDNIAIGCGH-NNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDS--DST 67
+ NI +GC + EGL GA+GLLG+ +SFPSQ++ A FS+C D+ + +S+
Sbjct: 252 KLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDKIAHLNSS 311
Query: 68 STLEFDSS--LPPNAVTAPLLRNHELDT----FYYLGLTGISVGGDLLPISETAFKIDE- 120
+ F S + P PL++N + + +YY+GL GISV LP+S F ID+
Sbjct: 312 GLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKV 371
Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS----RSS 176
+G+GG I+DSGTA T L+ + A+R F+ T L+ D + F CY+ +S S
Sbjct: 372 TGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGTAALES 431
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDS---NGTFCFAFAPTSS-SLSIIGNVQQQGTR 232
+P+++ HF G + LP + LIPV S T C AF + +IIGN QQQ
Sbjct: 432 TILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDIPFNIIGNYQQQNLW 491
Query: 233 VSFNLRNSLIGFTPNKC 249
V ++L +G P +C
Sbjct: 492 VEYDLEKLRLGIAPAQC 508
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 136 bits (343), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 97/249 (38%), Positives = 125/249 (50%), Gaps = 24/249 (9%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G F E +T+ + V DN GCG NN+GLF G+AGL+GLG +SF Q A FS
Sbjct: 241 GYFSRERLTVTATDVVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFS 300
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDT------FYYLGLTGISVGGDLLP 110
YCL S ST L F A T L+ T FY L +T I+VGG LP
Sbjct: 301 YCL-PSTSSSTGHLSFGP-----AATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLP 354
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
+S + F GG I+DSGT +TRL Y ALR AF +G +++ DTCYD
Sbjct: 355 VSSSTFS-----TGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYD 409
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQ 228
S +PT+ F F G + LP + L V S C AFA S ++I GNVQQ
Sbjct: 410 LSGYKVFSIPTIEFSFAGGVTVKLPPQGILF-VASTKQVCLAFAANGDDSDVTIYGNVQQ 468
Query: 229 QGTRVSFNL 237
+ V +++
Sbjct: 469 RTIEVVYDV 477
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 86/260 (33%), Positives = 130/260 (50%), Gaps = 18/260 (6%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
G+ E + G+ V + GCG NN+GLF G +GL+GLG LS SQ I FSY
Sbjct: 147 GELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLISQTSGIFGGVFSY 206
Query: 58 CL--VDRDSDSTSTLEFDSSLPPNAV---TAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
CL +R + L +SS+ N+ A ++ N +L FY++ LTGIS+GG
Sbjct: 207 CLPSTERKGSGSLILGGNSSVYRNSSPISYAKMIENPQLYNFYFINLTGISIGG------ 260
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
A + G I+VDSGT +TRL Y AL+ F++ P ++ DTC++ S
Sbjct: 261 -VALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFPPAPAFSILDTCFNLS 319
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFA--PTSSSLSIIGNVQQQ 229
+ V++PT+ HF L + V S+ + C A A ++I+GN QQ+
Sbjct: 320 AYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALASLEYQDEVAILGNYQQK 379
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
RV ++ + + +GF C
Sbjct: 380 NLRVIYDTKETKVGFALETC 399
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 94/256 (36%), Positives = 133/256 (51%), Gaps = 19/256 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G + ++T+ LGS++V + GC + G GL+GLGGG+ S SQ + FSY
Sbjct: 218 GTYSSDTLALGSSAVKSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSY 277
Query: 58 CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
CL S S + S V P+LR+ ++ TFY + L I VGG L I +
Sbjct: 278 CLPPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASV 337
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
F + G ++DSGT +TRL Y+AL AF G + P + DTC+DFS +S
Sbjct: 338 F------SAGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQS 391
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
SV +P+V+ F G V+ L A ++ SN C AFA S SSL IIGNVQQ+ V
Sbjct: 392 SVSIPSVALVFSGGAVVSLDASGIIL---SN---CLAFAANSDDSSLGIIGNVQQRTFEV 445
Query: 234 SFNLRNSLIGFTPNKC 249
+++ ++GF C
Sbjct: 446 LYDVGRGVVGFRAGAC 461
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 104/255 (40%), Positives = 139/255 (54%), Gaps = 15/255 (5%)
Query: 6 ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGS-LSFPSQINAS---TFSYCLVD 61
E+ TL S S+ +IA GCG NEG G L G LS SQ+ S FSYCLV
Sbjct: 207 ESFTLTSQSLPHIAFGCGQENEGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLVS 266
Query: 62 -RDSDS-TSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
DS S TS L + NA T PL+++ TFYYL L GISVGG LL I++ F
Sbjct: 267 ITDSPSKTSPLFIGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTF 326
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRS 175
+ G GG+I+DSGT VT L+ Y+ ++ A + L DG + D C++ S S
Sbjct: 327 DLQLDGTGGVIIDSGTTVTYLEQSGYDVVKKAVISSIN-LPQVDGSNIGLDLCFEPQSGS 385
Query: 176 SV-EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
S PT++FHF EG LP +NY I DS+G C A P S+ +SI GN+QQQ ++
Sbjct: 386 STSHFPTITFHF-EGADFNLPKENY-IYTDSSGIACLAMLP-SNGMSIFGNIQQQNYQIL 442
Query: 235 FNLRNSLIGFTPNKC 249
++ +++ F P C
Sbjct: 443 YDNERNVLSFAPTVC 457
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 94/256 (36%), Positives = 130/256 (50%), Gaps = 13/256 (5%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
G F + + L S V +N GCG NN GLFVG AGL+GLG +LS SQ FS
Sbjct: 231 GFFAQDKLALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLVSQTAQKYGKLFS 290
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTA--PLLRNHELDTFYYLGLTGISVGGDLLPISET 114
YCL S ST L F S + P L N + +FY+L L ISVGG L S +
Sbjct: 291 YCL-PSTSSSTGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSAS 349
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
F G I+DSGT ++RL Y+ LR +F + ++ DTCYDFS
Sbjct: 350 VFS-----TAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYDFSQY 404
Query: 175 SSVEVPTVSFHFPEGKVLPL-PAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
+V+VP ++ +F +G + L P+ + I S FA ++ ++I+GNVQQ+ V
Sbjct: 405 DTVDVPKINLYFSDGAEMDLDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTFDV 464
Query: 234 SFNLRNSLIGFTPNKC 249
+++ IGF P C
Sbjct: 465 VYDVAGGRIGFAPGGC 480
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 94/256 (36%), Positives = 133/256 (51%), Gaps = 19/256 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G + ++T+ LGS++V + GC + G GL+GLGGG+ S SQ + FSY
Sbjct: 142 GTYSSDTLALGSSAVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSY 201
Query: 58 CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
CL S S + S V P+LR+ ++ TFY + L I VGG L I +
Sbjct: 202 CLPPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASV 261
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
F + G ++DSGT +TRL Y+AL AF G + P + DTC+DFS +S
Sbjct: 262 F------SAGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQS 315
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
SV +P+V+ F G V+ L A ++ SN C AFA S SSL IIGNVQQ+ V
Sbjct: 316 SVSIPSVALVFSGGAVVSLDASGIIL---SN---CLAFAGNSDDSSLGIIGNVQQRTFEV 369
Query: 234 SFNLRNSLIGFTPNKC 249
+++ ++GF C
Sbjct: 370 LYDVGRGVVGFRAGAC 385
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 94/256 (36%), Positives = 133/256 (51%), Gaps = 19/256 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G + ++T+ LGS++V + GC + G GL+GLGGG+ S SQ + FSY
Sbjct: 288 GTYSSDTLALGSSAVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSY 347
Query: 58 CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
CL S S + S V P+LR+ ++ TFY + L I VGG L I +
Sbjct: 348 CLPPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASV 407
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
F + G ++DSGT +TRL Y+AL AF G + P + DTC+DFS +S
Sbjct: 408 F------SAGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQS 461
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
SV +P+V+ F G V+ L A ++ SN C AFA S SSL IIGNVQQ+ V
Sbjct: 462 SVSIPSVALVFSGGAVVSLDASGIIL---SN---CLAFAGNSDDSSLGIIGNVQQRTFEV 515
Query: 234 SFNLRNSLIGFTPNKC 249
+++ ++GF C
Sbjct: 516 LYDVGRGVVGFRAGAC 531
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 89/255 (34%), Positives = 136/255 (53%), Gaps = 20/255 (7%)
Query: 7 TVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCL---- 59
T+T +A GCG +N+GLF +AG++GL LS Q++ + FSYCL
Sbjct: 210 TLTPSAAPSSGFVYGCGQDNQGLFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSF 269
Query: 60 -VDRDSDSTSTLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
+S + L +S P T PL++N ++ + Y+LGLT I+V G L +S ++
Sbjct: 270 SAQPNSSVSGFLSIGASSLSSSPYKFT-PLVKNPKIPSLYFLGLTTITVAGKPLGVSASS 328
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR-GTRALSPTDGVALFDTCYDFSSR 174
+ + I+DSGT +TRL YNAL+ +FV ++ + G ++ DTC+ S +
Sbjct: 329 YNVPT------IIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDTCFKGSVK 382
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
VP + F G L L N L+ ++ GT C A A +S+ +SIIGN QQQ V+
Sbjct: 383 EMSTVPEIRIIFRGGAGLELKVHNSLVEIE-KGTTCLAIAASSNPISIIGNYQQQTFTVA 441
Query: 235 FNLRNSLIGFTPNKC 249
+++ NS IGF P C
Sbjct: 442 YDVANSKIGFAPGGC 456
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 94/256 (36%), Positives = 133/256 (51%), Gaps = 19/256 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G + ++T+ LGS++V + GC + G GL+GLGGG+ S SQ + FSY
Sbjct: 218 GTYSSDTLALGSSAVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSY 277
Query: 58 CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
CL S S + S V P+LR+ ++ TFY + L I VGG L I +
Sbjct: 278 CLPPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASV 337
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
F + G ++DSGT +TRL Y+AL AF G + P + DTC+DFS +S
Sbjct: 338 F------SAGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQS 391
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
SV +P+V+ F G V+ L A ++ SN C AFA S SSL IIGNVQQ+ V
Sbjct: 392 SVSIPSVALVFSGGAVVSLDASGIIL---SN---CLAFAGNSDDSSLGIIGNVQQRTFEV 445
Query: 234 SFNLRNSLIGFTPNKC 249
+++ ++GF C
Sbjct: 446 LYDVGRGVVGFRAGAC 461
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 90/238 (37%), Positives = 128/238 (53%), Gaps = 15/238 (6%)
Query: 13 ASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINASTFSYCL--VDRDSDSTST 69
ASV +A GCG N G+F G+ G G G LS PSQ+ FS+C V+ ST
Sbjct: 89 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKQSTVL 148
Query: 70 LEFDSSLPPNAVTA----PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGG 125
L+ + L N A PL++N TFYYL L GI+VG LP+ E+AF + +G GG
Sbjct: 149 LDLPADLYKNGRGAVQSTPLIQNSANPTFYYLSLKGITVGSTRLPVPESAFAL-TNGTGG 207
Query: 126 IIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSRSSVEVPTVSF 184
I+DSGT++T L + Y +RD F + + P + + TC+ S++ +VP +
Sbjct: 208 TIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPY-TCFSAPSQAKPDVPKLVL 266
Query: 185 HFPEGKVLPLPAKNYL--IPVDS-NGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
HF EG + LP +NY+ +P D+ N C A +IIGN QQQ V ++L+N
Sbjct: 267 HF-EGATMDLPRENYVFEVPDDAGNSIICLAIN-KGDETTIIGNFQQQNMHVLYDLQN 322
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 93/263 (35%), Positives = 132/263 (50%), Gaps = 21/263 (7%)
Query: 1 GDFVTETVTLGSA-------SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA- 52
G+ +T+TLG + + GCG ++ GLF A GL GLG +S SQ A
Sbjct: 225 GNLARDTLTLGPSSSSSSSDQLQEFVFGCGDDDTGLFGKADGLFGLGRDRVSLASQAAAK 284
Query: 53 --STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
+ FSYCL S + L S+ PPNA ++ + +FYYL L GI V G +
Sbjct: 285 YGAGFSYCLPS-SSTAEGYLSLGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVR 343
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTC 168
+S F+ G ++DSGT +TRL + Y ALR +F R S AL DTC
Sbjct: 344 VSPAVFRTP-----GTVIDSGTVITRLPSRAYAALRSSFAGLMRRYSYKRAPALSILDTC 398
Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNV 226
YDF+ R+ V++P+V+ F G L L L V + C AFA +S++I+GN+
Sbjct: 399 YDFTGRNKVQIPSVALLFDGGATLNLGFGEVLY-VANKSQACLAFASNGDDTSIAILGNM 457
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQ+ V +++ N IGF C
Sbjct: 458 QQKTFAVVYDVANQKIGFGAKGC 480
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 88/256 (34%), Positives = 130/256 (50%), Gaps = 13/256 (5%)
Query: 1 GDFVTETVTLGSAS--VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STF 55
G+ +T+TLG +S + GCG ++ GLF A GL GLG +S SQ A + F
Sbjct: 273 GNLARDTLTLGPSSDQLQGFVFGCGDDDTGLFGRADGLFGLGRDRVSLASQAAARYGAGF 332
Query: 56 SYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
SYCL ++ PP+A ++ + +FYYL L GI V G + ++
Sbjct: 333 SYCLPSSWRAEGYLSLGSAAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAV 392
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
FK G ++DSGT +TRL + Y+ALR +F R +++ DTCYDF+ R+
Sbjct: 393 FKAP-----GTVIDSGTVITRLPSRAYSALRSSFAGFMRRYKRAPALSILDTCYDFTGRT 447
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQQGTRV 233
V++P+V+ F G L L L V + C AFA +S+ I+GN+QQ+ V
Sbjct: 448 KVQIPSVALLFDGGATLNLGFGGVLY-VANRSQACLAFASNGDDTSVGILGNMQQKTFAV 506
Query: 234 SFNLRNSLIGFTPNKC 249
++L N IGF C
Sbjct: 507 VYDLANQKIGFGAKGC 522
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 94/255 (36%), Positives = 131/255 (51%), Gaps = 16/255 (6%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
G + ++T+ LGS +V GC + G GL+GLGGG+ S SQ + FSY
Sbjct: 222 GTYSSDTLALGSNAVRKFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSY 281
Query: 58 CLVDRDSDST-STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CL S S TL +S V P+LR+ ++ TFY + + I VGG L I + F
Sbjct: 282 CLPATSSSSGFLTLGAGTS---GFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVF 338
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
+ G I+DSGT +TRL Y+AL AF G + + DTC+DFS +SS
Sbjct: 339 ------SAGTIMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGILDTCFDFSGQSS 392
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRVS 234
V +PTV+ F G V+ + + ++ SN C AFA S SSL IIGNVQQ+ V
Sbjct: 393 VSIPTVALVFSGGAVVDIASDGIMLQT-SNSILCLAFAANSDDSSLGIIGNVQQRTFEVL 451
Query: 235 FNLRNSLIGFTPNKC 249
+++ +GF C
Sbjct: 452 YDVGGGAVGFKAGAC 466
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 99/268 (36%), Positives = 140/268 (52%), Gaps = 25/268 (9%)
Query: 5 TETVTLGSAS------VDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINASTFSY 57
TET T GS++ V IA GC + + G +A GL+GLG GSLS SQ+ A FSY
Sbjct: 171 TETFTFGSSTPADQVRVPGIAFGCSNASSGFNASSASGLVGLGRGSLSLVSQLGAPKFSY 230
Query: 58 CLVD-RDSDSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
CL +D++STSTL S N V++ +YYL LTGIS+G LPI
Sbjct: 231 CLTPYQDTNSTSTLLLGPSASLNDTGVVSSTPFVASPSSIYYYLNLTGISLGTTALPIPP 290
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTCYDF 171
AF + G GG+I+DSGT +T L Y +R A V L TDG A D C++
Sbjct: 291 NAFSLKADGTGGLIIDSGTTITMLGNTAYQQVRAA-VLSLVTLPTTDGSAATGLDLCFEL 349
Query: 172 SSRSSV--EVPTVSFHFPEGKVLPLPAKNYLI----PVDSNGTFCFAFAPTSSS----LS 221
S +S +P+++ HF +G + LPA NY++ P + +C A + + +S
Sbjct: 350 PSSTSAPPSMPSMTLHF-DGADMVLPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVS 408
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I+GN QQQ + +++ + F P KC
Sbjct: 409 ILGNYQQQNMHILYDVGKETLSFAPAKC 436
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 134 bits (337), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 90/253 (35%), Positives = 131/253 (51%), Gaps = 22/253 (8%)
Query: 13 ASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
ASV +A GCG N G+F G+ G G G LS PSQ+ FS+C ST+
Sbjct: 142 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVL 201
Query: 72 FDSSLPPN--------AVTAPLL---RNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
D LP + T PL+ +N T YYL L GI+VG LP+ E+AF +
Sbjct: 202 LD--LPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFAL-T 258
Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSRSSVEV 179
+G GG I+DSGT++T L + Y +RD F + + P + + TC+ S++ +V
Sbjct: 259 NGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHY-TCFSAPSQAKPDV 317
Query: 180 PTVSFHFPEGKVLPLPAKNYL--IPVDS-NGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
P + HF EG + LP +NY+ +P D+ N C A +IIGN QQQ V ++
Sbjct: 318 PKLVLHF-EGATMDLPRENYVFEVPDDAGNSIICLAIN-KGDETTIIGNFQQQNMHVLYD 375
Query: 237 LRNSLIGFTPNKC 249
L+N+++ F +C
Sbjct: 376 LQNNMLSFVAAQC 388
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 134 bits (337), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 89/264 (33%), Positives = 132/264 (50%), Gaps = 22/264 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
G+ E +TLG +DN GCG NN+GLF GA+GL+GL LS SQ ++ S FSY
Sbjct: 238 GELGFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSY 297
Query: 58 CLVDRDSDSTSTLEFD-------SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
CL S+ +L ++ P + T +++N ++ FY+L LTGIS+GG L
Sbjct: 298 CLPTTGVGSSGSLTLGGADFSNFKNISPISYTR-MIQNPQMSNFYFLNLTGISIGGVNLN 356
Query: 111 ISETAFKIDESGNGGI--IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
+ S N G+ ++DSGT +TRL Y A + F + T G ++ +TC
Sbjct: 357 VPRL------SSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTC 410
Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPT--SSSLSIIGN 225
++ + V +PTV F F + + + V S+ + C AFA IIGN
Sbjct: 411 FNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGN 470
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
QQ+ RV +N + S +GF C
Sbjct: 471 YQQKNQRVIYNSKESKVGFAGEPC 494
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 89/264 (33%), Positives = 132/264 (50%), Gaps = 22/264 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
G+ E +TLG +DN GCG NN+GLF GA+GL+GL LS SQ ++ S FSY
Sbjct: 159 GELGFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSY 218
Query: 58 CLVDRDSDSTSTLEFD-------SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
CL S+ +L ++ P + T +++N ++ FY+L LTGIS+GG L
Sbjct: 219 CLPTTGVGSSGSLTLGGADFSNFKNISPISYTR-MIQNPQMSNFYFLNLTGISIGGVNLN 277
Query: 111 ISETAFKIDESGNGGI--IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
+ S N G+ ++DSGT +TRL Y A + F + T G ++ +TC
Sbjct: 278 VPRL------SSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTC 331
Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPT--SSSLSIIGN 225
++ + V +PTV F F + + + V S+ + C AFA IIGN
Sbjct: 332 FNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGN 391
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
QQ+ RV +N + S +GF C
Sbjct: 392 YQQKNQRVIYNSKESKVGFAGEPC 415
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 134 bits (336), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 90/253 (35%), Positives = 131/253 (51%), Gaps = 22/253 (8%)
Query: 13 ASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
ASV +A GCG N G+F G+ G G G LS PSQ+ FS+C ST+
Sbjct: 90 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVL 149
Query: 72 FDSSLPPN--------AVTAPLL---RNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
D LP + T PL+ +N T YYL L GI+VG LP+ E+AF +
Sbjct: 150 LD--LPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFAL-T 206
Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSRSSVEV 179
+G GG I+DSGT++T L + Y +RD F + + P + + TC+ S++ +V
Sbjct: 207 NGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHY-TCFSAPSQAKPDV 265
Query: 180 PTVSFHFPEGKVLPLPAKNYL--IPVDS-NGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
P + HF EG + LP +NY+ +P D+ N C A +IIGN QQQ V ++
Sbjct: 266 PKLVLHF-EGATMDLPRENYVFEVPDDAGNSIICLAIN-KGDETTIIGNFQQQNMHVLYD 323
Query: 237 LRNSLIGFTPNKC 249
L+N+++ F +C
Sbjct: 324 LQNNMLSFVAAQC 336
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 133 bits (335), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 90/253 (35%), Positives = 131/253 (51%), Gaps = 13/253 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G +TV+ G+ SV N GCG +NEGLF +AGL+GL LS Q+ + +FSY
Sbjct: 211 GYLSKDTVSFGANSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSY 270
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
CL + S+ L S P P++ N D+ Y++ L+G++V G L +S + +
Sbjct: 271 CL--PSTSSSGYLSIGSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYT 328
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSRSS 176
+ I+DSGT +TRL T Y AL A + + ++ DTC++ +
Sbjct: 329 SLPT-----IIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKL 383
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
VP VS F G L L A N L+ VD T C AFAP S+ +IIGN QQQ V ++
Sbjct: 384 RAVPAVSMAFSGGATLKLSAGNLLVDVD-GATTCLAFAPARSA-AIIGNTQQQTFSVVYD 441
Query: 237 LRNSLIGFTPNKC 249
++++ IGF C
Sbjct: 442 VKSNRIGFAAAGC 454
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 133 bits (335), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 101/271 (37%), Positives = 145/271 (53%), Gaps = 23/271 (8%)
Query: 1 GDFVTETVTL------GSA---SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
GDF ET T+ GS+ +V+N+ GCGH N GLF GAAGLLGLG G LSF SQ+
Sbjct: 248 GDFAVETFTVNLTTSGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQ 307
Query: 52 A---STFSYCLVDRDSDS--TSTLEFDS-----SLPPNAVTAPLLRNHEL-DTFYYLGLT 100
+ +FSYCLVDR+SD+ +S L F S P T+ + R L DTFYY+ +
Sbjct: 308 SLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIK 367
Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT- 159
I V G++L I E + I G GG I+DSGT ++ Y +++ + P
Sbjct: 368 SIIVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVY 427
Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SS 218
+ D C++ S S+++P + F +G V P +N I ++ + C A T S
Sbjct: 428 RDFPILDPCFNVSGIDSIQLPELGIAFADGAVWNFPTENSFIWLNED-LVCLAILGTPKS 486
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ SIIGN QQQ + ++ + S +G+ P KC
Sbjct: 487 AFSIIGNYQQQNFHILYDTKRSRLGYAPTKC 517
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 88/252 (34%), Positives = 130/252 (51%), Gaps = 22/252 (8%)
Query: 14 SVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEF 72
S+ + GCG NN G+F G+ G G G LS PSQ+ FS+C ST+
Sbjct: 142 SLPGVTFGCGLNNTGVFNSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLL 201
Query: 73 DSSLPPN--------AVTAPLL---RNHELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
D LP + T PL+ +N T YYL L GI+VG LP+ E+AF + +
Sbjct: 202 D--LPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFAL-TN 258
Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSRSSVEVP 180
G GG I+DSGT++T L + Y +RD F + + P + + TC+ S++ +VP
Sbjct: 259 GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHY-TCFSAPSQAKPDVP 317
Query: 181 TVSFHFPEGKVLPLPAKNYL--IPVDS-NGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
+ HF EG + LP +NY+ +P D+ N C A +IIGN QQQ V ++L
Sbjct: 318 KLVLHF-EGATMDLPRENYVFEVPDDAGNSIICLAIN-KGDETTIIGNFQQQNMHVLYDL 375
Query: 238 RNSLIGFTPNKC 249
+N+++ F +C
Sbjct: 376 QNNMLSFVAAQC 387
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 133 bits (334), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 92/247 (37%), Positives = 127/247 (51%), Gaps = 16/247 (6%)
Query: 16 DNIAIGCGH--NNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSYCLVD-RDSDSTST 69
D+ GC G V GL+G G G LSF SQ A S FSYCL + S+ + T
Sbjct: 212 DHYTFGCLRVVTGSGGSVPPQGLVGFGRGPLSFLSQTKATYGSIFSYCLPSYKSSNFSGT 271
Query: 70 LEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDES-GNGGII 127
L + P + T PLL N + YY+ + G+ V G +PI +A +D + G GG I
Sbjct: 272 LRLGPAGQPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGRGGTI 331
Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFP 187
VD+GT TRL Y ALR+AF RG A + + FDTCY + S VP V+F F
Sbjct: 332 VDAGTMFTRLSPPAYAALRNAFRRGVSAPA-APALGGFDTCYYVNGTKS--VPAVAFVFA 388
Query: 188 EGKVLPLPAKNYLIPVDSNGTFCFAFAP-----TSSSLSIIGNVQQQGTRVSFNLRNSLI 242
G + LP +N +I S G C A A ++ L+++ ++QQQ RV F++ N +
Sbjct: 389 GGARVTLPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNGRV 448
Query: 243 GFTPNKC 249
GF+ C
Sbjct: 449 GFSRELC 455
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 132 bits (333), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 92/255 (36%), Positives = 129/255 (50%), Gaps = 17/255 (6%)
Query: 6 ETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVD 61
ET++L S ++ A GCG N G F GL+GLG G LS SQ AS TFSYCL
Sbjct: 228 ETLSLTSTRALPGFAFGCGQTNLGDFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCL-P 286
Query: 62 RDSDSTSTLEFDSSLPP---NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
D+ + L + P + +++ + +FY++ L I +GG +LP+ T F
Sbjct: 287 SDNTTHGYLTIGPTTPASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTD 346
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE 178
D G +DSGT +T L E Y ALRD F P FDTCYDF+ +S++
Sbjct: 347 D-----GTFLDSGTILTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIF 401
Query: 179 VPTVSFHFPEGKVLPLPAKNYLI-PVDSN---GTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
+P VSF F +G V L LI P D+ G F P++ +I+GN+QQ+ T V
Sbjct: 402 IPAVSFKFSDGSVFDLSFFGILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVI 461
Query: 235 FNLRNSLIGFTPNKC 249
+++ IGF C
Sbjct: 462 YDVAAEKIGFASASC 476
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/264 (37%), Positives = 138/264 (52%), Gaps = 22/264 (8%)
Query: 5 TETVTLGS-----ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINASTFSYC 58
+ET T GS A V IA GC + G +A GL+GLG G LS SQ+ FSYC
Sbjct: 189 SETFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYC 248
Query: 59 LVD-RDSDSTSTLEFDSSLPPNAV----TAPLLRNHE---LDTFYYLGLTGISVGGDLLP 110
L +D++STSTL S N + P + + ++TFYYL LTGIS+G L
Sbjct: 249 LTPYQDTNSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALS 308
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTC 168
I AF ++ G GG+I+DSGT +T L Y +R A V L TDG A D C
Sbjct: 309 IPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVS-LVTLPTTDGSADTGLDLC 367
Query: 169 YDFSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSIIGN 225
+ S +S +P+++ HF G + LPA +Y++ D +G +C A T ++I+GN
Sbjct: 368 FMLPSSTSAPPAMPSMTLHF-NGADMVLPADSYMM-SDDSGLWCLAMQNQTDGEVNILGN 425
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
QQQ + +++ + F P KC
Sbjct: 426 YQQQNMHILYDIGQETLSFAPAKC 449
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 95/258 (36%), Positives = 132/258 (51%), Gaps = 27/258 (10%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G + ++T+ LGS ++ N GC H G GL+GLGGG+ S SQ + FSY
Sbjct: 221 GTYSSDTLALGSNTISNFQFGCSHVESGFNDLTDGLMGLGGGAPSLASQTAGTFGTAFSY 280
Query: 58 CLVDRDSDST-STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CL S S TL +S V P+LR+ + TFY + L I VGG L I + F
Sbjct: 281 CLPPTPSSSGFLTLGAGTS---GFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVF 337
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
+ G+++DSGT +TRL Y+AL AF G + P ++ DTC+DFS +SS
Sbjct: 338 ------SAGMVMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSS 391
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTS--SSLSIIGNVQQQGT 231
V +P+V+ F G V+ L D+NG C AFA S SS I+GNVQQ+
Sbjct: 392 VRLPSVALVFSGGAVVNL---------DANGIILGNCLAFAANSDDSSPGIVGNVQQRTF 442
Query: 232 RVSFNLRNSLIGFTPNKC 249
V +++ +GF C
Sbjct: 443 EVLYDVGGGAVGFKAGAC 460
>gi|356537173|ref|XP_003537104.1| PREDICTED: uncharacterized protein LOC100817302 [Glycine max]
Length = 328
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 66/141 (46%), Positives = 93/141 (65%)
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
L ISE +++ + G+ G ++D+G VTRL T Y A RDAFV T L GV++F+TC
Sbjct: 188 LNISEDLYRVTDLGDEGAVMDTGITVTRLPTVAYGAFRDAFVAQTTNLPRAPGVSIFNTC 247
Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
YD + +V VPTV F+F G++L + +N+LIP D GTF FAFA + S+LSIIGN+QQ
Sbjct: 248 YDLNGFVTVRVPTVLFYFSGGQILTILTQNFLIPADDVGTFYFAFAASPSALSIIGNIQQ 307
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
+G ++S + N +GF N C
Sbjct: 308 EGIQISVDGANGFLGFGRNVC 328
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/264 (37%), Positives = 138/264 (52%), Gaps = 22/264 (8%)
Query: 5 TETVTLGS-----ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINASTFSYC 58
+ET T GS A V IA GC + G +A GL+GLG G LS SQ+ FSYC
Sbjct: 129 SETFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYC 188
Query: 59 LVD-RDSDSTSTLEFDSSLPPNAV----TAPLLRNHE---LDTFYYLGLTGISVGGDLLP 110
L +D++STSTL S N + P + + ++TFYYL LTGIS+G L
Sbjct: 189 LTPYQDTNSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALS 248
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTC 168
I AF ++ G GG+I+DSGT +T L Y +R A V L TDG A D C
Sbjct: 249 IPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVS-LVTLPTTDGSADTGLDLC 307
Query: 169 YDFSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSIIGN 225
+ S +S +P+++ HF G + LPA +Y++ D +G +C A T ++I+GN
Sbjct: 308 FMLPSSTSAPPAMPSMTLHF-NGADMVLPADSYMM-SDDSGLWCLAMQNQTDGEVNILGN 365
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
QQQ + +++ + F P KC
Sbjct: 366 YQQQNMHILYDIGQETLSFAPAKC 389
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 85/242 (35%), Positives = 125/242 (51%), Gaps = 12/242 (4%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS 68
S ++ + GCG +NEGLF AAG++GL LS +Q++ FSYCL S
Sbjct: 226 SQTLPSFTYGCGQDNEGLFGKAAGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSGGG 285
Query: 69 TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIV 128
L P + P++RN + + Y+L L I+V G + ++ +++ I+
Sbjct: 286 FLSIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPT------II 339
Query: 129 DSGTAVTRLQTETYNALRDAFVR-GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFP 187
DSGT VTRL Y ALR+AFV+ +R ++ DTC+ S +S P + F
Sbjct: 340 DSGTVVTRLPISIYAALREAFVKIMSRRYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQ 399
Query: 188 EGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPN 247
G L L A N LI D G C AFA +S+ ++IIGN QQQ +++++ S IGF P
Sbjct: 400 GGADLSLRAPNILIEAD-KGIACLAFA-SSNQIAIIGNHQQQTYNIAYDVSASKIGFAPG 457
Query: 248 KC 249
C
Sbjct: 458 GC 459
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 95/270 (35%), Positives = 130/270 (48%), Gaps = 24/270 (8%)
Query: 1 GDFVTETVTLGSASVDNI-----AIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
G + TE T S+ D + GCG N G +G++G G LS SQ++ F
Sbjct: 190 GVYATERFTFTSSGGDRLMTVPLGFGCGSMNVGSLNNGSGIVGFGRNPLSLVSQLSIRRF 249
Query: 56 SYCLVDRDSDSTSTLEF---------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
SYCL S STL F D++ P T PLL++ + TFYY+ L G++VG
Sbjct: 250 SYCLTSYGSGRKSTLLFGSLSGGVYGDATGP--VQTTPLLQSLQNPTFYYVHLAGLTVGA 307
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-----ALSPTDG 161
L I E+AF + G+GG+IVDSGTA+T L + AF + R +P DG
Sbjct: 308 RRLRIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLRLPFANGGNPEDG 367
Query: 162 VALF--DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS 219
V SS S V VP + FHF + L LP +NY++ G C A +
Sbjct: 368 VCFLVPAAWRRSSSTSQVPVPRMVFHFQDAD-LDLPRRNYVLDDHRKGRLCLLLADSGDD 426
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S IGN+ QQ RV ++L + F P +C
Sbjct: 427 GSTIGNLVQQDMRVLYDLEAETLSFAPAQC 456
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 101/271 (37%), Positives = 145/271 (53%), Gaps = 23/271 (8%)
Query: 1 GDFVTETVTL------GSA---SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
GDF ET T+ GS+ +V+N+ GCGH N GLF GAAGLLGLG G LSF SQ+
Sbjct: 227 GDFAVETFTVNLTTNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQ 286
Query: 52 A---STFSYCLVDRDSDS--TSTLEF--DSSL--PPNAVTAPLLRNHE--LDTFYYLGLT 100
+ +FSYCLVDR+SD+ +S L F D L PN + E +DTFYY+ +
Sbjct: 287 SLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIK 346
Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT- 159
I V G++L I E + I G GG I+DSGT ++ Y +++ + P
Sbjct: 347 SILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVY 406
Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SS 218
+ D C++ S +V++P + F +G V P +N I ++ + C A T S
Sbjct: 407 RDFPILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNED-LVCLAMLGTPKS 465
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ SIIGN QQQ + ++ + S +G+ P KC
Sbjct: 466 AFSIIGNYQQQNFHILYDTKRSRLGYAPTKC 496
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 89/255 (34%), Positives = 132/255 (51%), Gaps = 16/255 (6%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINAS---TFS 56
G + ++T+TLGS ++ GC + G F GL+GLGG + S SQ + FS
Sbjct: 222 GTYSSDTLTLGSNAIKGFQFGCSQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFS 281
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
YCL S+ L ++ V P+LR+ ++ T+Y + L I VGG L I + F
Sbjct: 282 YCLPPTPG-SSGFLTLGAASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVF 340
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
+ G ++DSGT +TRL Y+AL AF G + P + DTC+DFS +SS
Sbjct: 341 ------SAGSVMDSGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSS 394
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRVS 234
V +P+V+ F G V+ L ++ +D+ +C AFA S SSL IGNVQQ+ V
Sbjct: 395 VSIPSVALVFSGGAVVNLDFNGIMLELDN---WCLAFAANSDDSSLGFIGNVQQRTFEVL 451
Query: 235 FNLRNSLIGFTPNKC 249
+++ +GF C
Sbjct: 452 YDVGGGAVGFRAGAC 466
>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
Length = 204
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/203 (36%), Positives = 106/203 (52%), Gaps = 5/203 (2%)
Query: 50 INASTFSYCLVDRDSDSTSTLEFDS--SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
+ + FSYCL D S L S +A++ PLL N +FYYL L GI VGG
Sbjct: 1 MKEAKFSYCLTSMDDSKASVLLLGSLAKATKDAISTPLLTNPSQPSFYYLSLEGIPVGGT 60
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
L I ++ F + + G+GG+I+DSGT +T L+ ++ L+ F+ + D
Sbjct: 61 QLSIEQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEFISQSNLQLDKSSSTGLDV 120
Query: 168 CYDFSSRSS-VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
C+ S ++ VEVP + FHF G L LPA++Y+I G C A S+ +SI GNV
Sbjct: 121 CFSLPSETTQVEVPKLVFHFKGGD-LELPAESYMIADSKLGVACLAMG-ASNGMSIFGNV 178
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQQ V+ +L I F P +C
Sbjct: 179 QQQNILVNHDLEKETISFVPTQC 201
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 101/271 (37%), Positives = 145/271 (53%), Gaps = 23/271 (8%)
Query: 1 GDFVTETVTL------GSA---SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
GDF ET T+ GS+ +V+N+ GCGH N GLF GAAGLLGLG G LSF SQ+
Sbjct: 263 GDFAVETFTVNLTTNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQ 322
Query: 52 A---STFSYCLVDRDSDS--TSTLEF--DSSL--PPNAVTAPLLRNHE--LDTFYYLGLT 100
+ +FSYCLVDR+SD+ +S L F D L PN + E +DTFYY+ +
Sbjct: 323 SLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIK 382
Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT- 159
I V G++L I E + I G GG I+DSGT ++ Y +++ + P
Sbjct: 383 SILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVY 442
Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SS 218
+ D C++ S +V++P + F +G V P +N I ++ + C A T S
Sbjct: 443 RDFPILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNED-LVCLAMLGTPKS 501
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ SIIGN QQQ + ++ + S +G+ P KC
Sbjct: 502 AFSIIGNYQQQNFHILYDTKRSRLGYAPTKC 532
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 101/276 (36%), Positives = 138/276 (50%), Gaps = 28/276 (10%)
Query: 1 GDFVTETVTL-----GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
GDF ET+TL S + N GCG N G F GAAG++GLG G +S +Q+ ++
Sbjct: 93 GDFALETLTLRSSGGSSKAFPNFQFGCGRLNSGSFGGAAGIVGLGQGKISLSTQLGSAIN 152
Query: 54 -TFSYCLVDRDSDS--TSTLEFDSSLP--PNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
FSYCLVD D DS TS L F SS A++ P++ N T+Y++GL GISVGG
Sbjct: 153 NKFSYCLVDFDDDSSKTSPLIFGSSASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQ 212
Query: 109 LPISETAF-------------KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA 155
L ++ A + E +GG I DSGT +T L Y+ ++ AF
Sbjct: 213 LSLATRAIDFLSVRSKKKLRVRALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVSL 272
Query: 156 LSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF-CFAF- 213
+ + FD CYD S + + P ++ F K P P KNY + VD+ T C A
Sbjct: 273 PTVDASSSGFDLCYDVSKSKNFKFPALTLAFKGTKFSP-PQKNYFVIVDTAETVACLAMG 331
Query: 214 APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S L IIGN+ QQ V ++ S I +P +C
Sbjct: 332 GSGSLGLGIIGNLMQQNYHVVYDRGTSTISMSPAQC 367
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 97/264 (36%), Positives = 138/264 (52%), Gaps = 22/264 (8%)
Query: 5 TETVTLGS-----ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINASTFSYC 58
+ET T GS + V IA GC + G +A GL+GLG G LS SQ+ FSYC
Sbjct: 187 SETFTFGSTPAGQSRVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYC 246
Query: 59 LVD-RDSDSTSTLEFDSSLPPNAV----TAPLLRNHE---LDTFYYLGLTGISVGGDLLP 110
L +D++STSTL S N + P + + ++TFYYL LTGIS+G L
Sbjct: 247 LTPYQDTNSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALS 306
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTC 168
I AF ++ G GG+I+DSGT +T L Y +R A V L TDG A D C
Sbjct: 307 IPPDAFLLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVS-LVTLPTTDGSAATGLDLC 365
Query: 169 YDFSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSIIGN 225
+ S +S +P+++ HF G + LPA +Y++ D +G +C A T ++I+GN
Sbjct: 366 FMLPSSTSAPPAMPSMTLHF-NGADMVLPADSYMM-SDDSGLWCLAMQNQTDGEVNILGN 423
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
QQQ + +++ + F P KC
Sbjct: 424 YQQQNMHILYDIGQETLSFAPAKC 447
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 85/254 (33%), Positives = 139/254 (54%), Gaps = 18/254 (7%)
Query: 7 TVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCL-VDR 62
T+T A GCG +N+GLF ++G++GL +S Q++ FSYCL
Sbjct: 216 TLTPSEAPSSGFVYGCGQDNQGLFGRSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSF 275
Query: 63 DSDSTSTLEFDSSLPPNAVTA------PLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
+ ++S+L S+ +++T+ PL++N ++ + Y+L LT I+V G L +S +++
Sbjct: 276 SAPNSSSLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSY 335
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR-GTRALSPTDGVALFDTCYDFSSRS 175
+ I+DSGT +TRL YNAL+ +FV ++ + G ++ DTC+ S +
Sbjct: 336 NVPT------IIDSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVKE 389
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSF 235
VP + F G L L A N L+ ++ GT C A A +S+ +SIIGN QQQ +V++
Sbjct: 390 MSTVPEIQIIFRGGAGLELKAHNSLVEIE-KGTTCLAIAASSNPISIIGNYQQQTFKVAY 448
Query: 236 NLRNSLIGFTPNKC 249
++ N IGF P C
Sbjct: 449 DVANFKIGFAPGGC 462
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 98/260 (37%), Positives = 136/260 (52%), Gaps = 20/260 (7%)
Query: 9 TLGSASVDNIAIGCGHNNEG-LFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVD--- 61
T G A+V +A GCG N+G F G G++GLG G LSFP+Q + A TFSYCL+D
Sbjct: 168 TSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEG 227
Query: 62 -RDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
R S+S L A PL+ N TFYY+G+ I VG +LP+ + + ID
Sbjct: 228 GRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAID 287
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVALFDTCYDFSSRSS 176
GNGG ++DSG+ +T L+ Y L AF V R S + CY+ SS SS
Sbjct: 288 VLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSSS 347
Query: 177 VE-----VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQQ 229
P ++ F +G L LP NYL+ V ++ C A PT S + +++GN+ QQ
Sbjct: 348 SAPANGGFPRLTIDFAQGLSLELPTGNYLVDV-ADDVKCLAIRPTLSPFAFNVLGNLMQQ 406
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
G V F+ ++ IGF +C
Sbjct: 407 GYHVEFDRASARIGFARTEC 426
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 98/281 (34%), Positives = 136/281 (48%), Gaps = 36/281 (12%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGL------FVGAAGLLGLGGGSLSFPSQ 49
G F ET TL ++S + +IA GCG + G F GA+G++GLG G +SF SQ
Sbjct: 179 GFFSKETTTLNTSSGREMKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQ 238
Query: 50 IN---ASTFSYCLVDRD-----------SDSTSTLEFDSSLPPNAVTAPLLRNHELDTFY 95
+ +FSYCL+D D ST + + S+ PLL N E TFY
Sbjct: 239 LGRRFGRSFSYCLLDYTLSPPPTSYLMIGDVVSTKKDNKSM---MSFTPLLINPEAPTFY 295
Query: 96 YLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA 155
Y+ + G+ V G L I + + +DE GNGG ++DSGT +T L Y + AF R +
Sbjct: 296 YISIKGVFVDGVKLHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKL 355
Query: 156 LSPTDGVAL----FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCF 211
SPT G A FD C + + S P +S + P +NY I + S G C
Sbjct: 356 PSPTPGGASTRSGFDLCVNVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDI-SEGIKCL 414
Query: 212 AFAPT---SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
A P S S+IGN+ QQG + F+ S +GF+ C
Sbjct: 415 AIQPVEAESGRFSVIGNLMQQGFLLEFDRGKSRLGFSRRGC 455
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 83/259 (32%), Positives = 133/259 (51%), Gaps = 17/259 (6%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
G+ E + LG+ +V+N GCG N+GLF GA+GL+GLG LS SQI+ FSY
Sbjct: 158 GEVGMEHLNLGNTTVNNFIFGCGRKNQGLFGGASGLVGLGRTDLSLISQISPMFGGVFSY 217
Query: 58 CLVDRDSDSTSTLEF--DSSLPPNAVTAPLLR--NHELDTFYYLGLTGISVGGDLLPISE 113
CL +++++ +L +SS+ N R ++ L FY+L LTGI+VGG + +
Sbjct: 218 CLPTTEAEASGSLVMGGNSSVYKNTTPISYTRMIHNPLLPFYFLNLTGITVGG--VEVQA 275
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
+F D +I+DSGT ++RL Y AL+ FV+ + D+C++ S
Sbjct: 276 PSFGKDR-----MIIDSGTVISRLPPSIYQALKAEFVKQFSGYPSAPSFMILDSCFNLSG 330
Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFA--PTSSSLSIIGNVQQQG 230
V++P + +F L + V ++ + C A A P + IIGN QQ+
Sbjct: 331 YQEVKIPDIKMYFEGSAELNVDVTGVFYSVKTDASQVCLAIASLPYEDEVGIIGNYQQKN 390
Query: 231 TRVSFNLRNSLIGFTPNKC 249
R+ ++ + S++GF C
Sbjct: 391 QRIIYDTKGSMLGFAEEAC 409
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 95/257 (36%), Positives = 127/257 (49%), Gaps = 15/257 (5%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
G F E ++L S V +N GCG NN GLF G AGLLGL LS SQ FS
Sbjct: 240 GFFAREKLSLTSTDVFNNFQFGCGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKVFS 299
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTA--PLLRNHELDTFYYLGLTGISVGGDLLPISET 114
YCL S ST L F S + P N + +FY+L + GISVG LPI ++
Sbjct: 300 YCL-PSSSSSTGYLSFGSGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKS 358
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
F G I+DSGT ++RL Y++++ F GV++ DTCYD S
Sbjct: 359 VFS-----TAGTIIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKY 413
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTR 232
+V+VP + +F G + L A +I V C AFA S ++IIGNVQQ+
Sbjct: 414 KTVKVPKIILYFSGGAEMDL-APEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIH 472
Query: 233 VSFNLRNSLIGFTPNKC 249
V ++ +GF P+ C
Sbjct: 473 VVYDDAEGRVGFAPSGC 489
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 97/259 (37%), Positives = 134/259 (51%), Gaps = 23/259 (8%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNN---EGLFVGAA-GLLGLGGGSLSFPSQINA--- 52
G + ++T+ L S V+N GC + EGL GL+GLGGG+ S SQ A
Sbjct: 213 GTYGSDTLALNSTEKVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYG 272
Query: 53 STFSYCLVDRDSDSTSTLEFDSSLPPNA-VTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
S FSYCL + S+ L +S + VT P+ R+ TFY++ L GI+VGGD + I
Sbjct: 273 SAFSYCL-PATTRSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAI 331
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
S T F G I+DSGT +TRL Y+AL AF G R ++ DTC+DF
Sbjct: 332 SPTVFA------AGSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDF 385
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL-SIIGNVQQQG 230
+ + +V +P V F G V+ L A + C AFAP + + SIIGNVQQ+
Sbjct: 386 TGQDNVSIPAVELVFSGGAVVDLDADGIMY------GSCLAFAPATGGIGSIIGNVQQRT 439
Query: 231 TRVSFNLRNSLIGFTPNKC 249
V ++ S++GF P C
Sbjct: 440 FEVLHDVGQSVLGFRPGAC 458
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 101/247 (40%), Positives = 125/247 (50%), Gaps = 29/247 (11%)
Query: 14 SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS-T 69
+V GCGH G+F G GLL LG S+S SQ + FSYCL + S + T
Sbjct: 248 TVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLT 307
Query: 70 LEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVD 129
L SS A T LL TFY + LTGISVGG + + +AF GG +VD
Sbjct: 308 LGGPSSASGFATTG-LLTAWAAPTFYMVMLTGISVGGQQVAVPASAFA------GGTVVD 360
Query: 130 SGTAVTRLQTETYNALRDAFVRGTRA-----LSPTDGVALFDTCYDFSSRSSVEVPTVSF 184
+GT +TRL Y ALR AF RG A +P +G+ DTCYDFS V +PTV+
Sbjct: 361 TGTVITRLPPTAYAALRSAF-RGAIAPCGYPSAPANGI--LDTCYDFSRYGVVTLPTVAL 417
Query: 185 HFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRVSFNLRNSLI 242
F G L L A L S+G C AFAP +I+GNVQQ+ V F+ S +
Sbjct: 418 TFSGGATLALEAPGIL----SSG--CLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTV 469
Query: 243 GFTPNKC 249
GF P C
Sbjct: 470 GFMPGAC 476
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 130 bits (327), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 96/255 (37%), Positives = 126/255 (49%), Gaps = 11/255 (4%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G F TE +T+ + V N GCG N G F AGLLGLG G LS Q + F+
Sbjct: 137 GFFATEKLTISPSDVISNFLFGCGQQNAGRFGRIAGLLGLGRGKLSLALQTSEKYNNLFT 196
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
YCL S ST L +P + PL + FY + + G+SVGG +LPI + F
Sbjct: 197 YCLPSFSSSSTGHLTLGGQVPKSVKFTPLSPAFKNTPFYGIDIKGLSVGGHVLPIDASVF 256
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
N G I+DSGT +TRLQ Y+AL F + + TDG ++ DTCYDFS S
Sbjct: 257 S-----NAGAIIDSGTVITRLQPTVYSALSSKFQQLMKDYPKTDGFSILDTCYDFSGNES 311
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQQGTRVS 234
+ VP +SF F G + + L +++ C AFAP + GN QQQ V
Sbjct: 312 ISVPRISFFFKGGVEVDIKFFGILTVINAWDKVCLAFAPNDDDGDFVVFGNSQQQTYDVV 371
Query: 235 FNLRNSLIGFTPNKC 249
+L IGF P+ C
Sbjct: 372 HDLAKGRIGFAPSGC 386
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 130 bits (327), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 96/264 (36%), Positives = 130/264 (49%), Gaps = 21/264 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
G F E+ T+ +D +A GCG +N+G F A G+LGLG G LSF SQ+ + F+Y
Sbjct: 158 GVFAYESATVDDVRIDKVAFGCGRDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAY 217
Query: 58 CLVDR-DSDSTSTL-----EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
CLV+ D S S+ E S++ T P++ N T YY+ + + VGG+ LPI
Sbjct: 218 CLVNYLDPTSVSSWLIFGDELISTIHDLQFT-PIVSNSRNPTLYYVQIEKVMVGGESLPI 276
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVALFDTC 168
S +A+ +D GNGG I DSGT VT Y + AF VR RA S V D C
Sbjct: 277 SHSAWSLDFLGNGGSIFDSGTTVTYWLPPAYRNILAAFDKNVRYPRAAS----VQGLDLC 332
Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL---SIIGN 225
D + P+ + G V NY + V N C A A SS+ + IGN
Sbjct: 333 VDVTGVDQPSFPSFTIVLGGGAVFQPQQGNYFVDVAPN-VQCLAMAGLPSSVGGFNTIGN 391
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
+ QQ V ++ + IGF P KC
Sbjct: 392 LLQQNFLVQYDREENRIGFAPAKC 415
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 130 bits (326), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 98/266 (36%), Positives = 141/266 (53%), Gaps = 25/266 (9%)
Query: 5 TETVTLGSAS------VDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINASTFSY 57
+ET T GS++ V IA GC + + G +A GL+GLG GSLS SQ+ FSY
Sbjct: 181 SETFTFGSSTPANQTGVPGIAFGCSNASGGFNTSSASGLVGLGRGSLSLVSQLGVPKFSY 240
Query: 58 CLVD-RDSDSTSTLEFDSSLPPN----AVTAPLL---RNHELDTFYYLGLTGISVGGDLL 109
CL +D++STSTL S N + P + + + T+YYL LTGIS+G L
Sbjct: 241 CLTPYQDTNSTSTLLLGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTAL 300
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL---FD 166
I TA + G GG I+DSGT +T L Y +R A V L TDG + D
Sbjct: 301 SIPTTALSLKADGTGGFIIDSGTTITLLGNTAYQQVRAAVVS-LVTLPTTDGGSAATGLD 359
Query: 167 TCYDFSSRSSVE--VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSII 223
C++ S +S +P+++ HF +G + LPA +Y++ +DSN +C A T +SI+
Sbjct: 360 LCFELPSSTSAPPTMPSMTLHF-DGADMVLPADSYMM-LDSN-LWCLAMQNQTDGGVSIL 416
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN QQQ + +++ + F P KC
Sbjct: 417 GNYQQQNMHILYDVGQETLTFAPAKC 442
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 129 bits (325), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 91/252 (36%), Positives = 134/252 (53%), Gaps = 13/252 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G +TV+ GS S+ N GCG +NEGLF +AGL+GL LS Q+ S +F+Y
Sbjct: 91 GYLSKDTVSFGSTSLPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTY 150
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
CL S +L + P P++ + D+ Y++ L+G++V G+ L +S +A+
Sbjct: 151 CLPSSSSSGYLSLGSYN--PGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYS 208
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
+ I+DSGT +TRL T Y+AL A + S ++ DTC+ S V
Sbjct: 209 SLPT-----IIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFK-GQASRV 262
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
P V+ F G L L A+N L+ VD + T C AFAP S+ +IIGN QQQ V +++
Sbjct: 263 SAPAVTMSFAGGAALKLSAQNLLVDVD-DSTTCLAFAPARSA-AIIGNTQQQTFSVVYDV 320
Query: 238 RNSLIGFTPNKC 249
++S IGF C
Sbjct: 321 KSSRIGFAAGGC 332
>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
Length = 389
Score = 129 bits (325), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 83/251 (33%), Positives = 126/251 (50%), Gaps = 18/251 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
G+ E + G+ V + GCG NN+GLF G +GL+GLG LS SQ I FSY
Sbjct: 90 GELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLISQTSGIFGGVFSY 149
Query: 58 CL--VDRDSDSTSTLEFDSSLPPNAV---TAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
CL +R + L +SS+ N+ A ++ N +L FY++ LTGIS+GG
Sbjct: 150 CLPSTERKGSGSLILGGNSSVYRNSSPISYAKMIENPQLYNFYFINLTGISIGG------ 203
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
A + G I+VDSGT +TRL Y AL+ F++ P ++ DTC++ S
Sbjct: 204 -VALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFPPAPAFSILDTCFNLS 262
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFA--PTSSSLSIIGNVQQQ 229
+ V++PT+ HF L + V S+ + C A A ++I+GN QQ+
Sbjct: 263 AYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALASLEYQDEVAILGNYQQK 322
Query: 230 GTRVSFNLRNS 240
RV ++ + +
Sbjct: 323 NLRVIYDTKET 333
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 129 bits (325), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 100/247 (40%), Positives = 125/247 (50%), Gaps = 29/247 (11%)
Query: 14 SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS-T 69
+V GCGH G+F G GLL LG S+S SQ + FSYCL + S + T
Sbjct: 248 TVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLT 307
Query: 70 LEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVD 129
L +S A T LL TFY + LTGISVGG + + +AF GG +VD
Sbjct: 308 LGGPTSASGFATTG-LLTAWAAPTFYMVMLTGISVGGQQVAVPASAFA------GGTVVD 360
Query: 130 SGTAVTRLQTETYNALRDAFVRGTRA-----LSPTDGVALFDTCYDFSSRSSVEVPTVSF 184
+GT +TRL Y ALR AF RG A +P +G+ DTCYDFS V +PTV+
Sbjct: 361 TGTVITRLPPTAYAALRSAF-RGAIAPYGYPSAPANGI--LDTCYDFSRYGVVTLPTVAL 417
Query: 185 HFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRVSFNLRNSLI 242
F G L L A L S+G C AFAP +I+GNVQQ+ V F+ S +
Sbjct: 418 TFSGGATLALEAPGIL----SSG--CLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTV 469
Query: 243 GFTPNKC 249
GF P C
Sbjct: 470 GFMPGAC 476
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 129 bits (325), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 94/243 (38%), Positives = 118/243 (48%), Gaps = 15/243 (6%)
Query: 15 VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVDRDSDSTSTLE 71
V + GCG +NEGLF G AGL+GL +SF Q I FSYCL S S L
Sbjct: 246 VHDFLFGCGQDNEGLFRGTAGLMGLSRHPISFVQQTSSIYNKIFSYCLPSTPS-SLGHLT 304
Query: 72 FDSSLPPNA--VTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISETAFKIDESGNGGIIV 128
F +S NA P ++FY L + GISVGG LP +S + F GG I+
Sbjct: 305 FGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSA-----GGSII 359
Query: 129 DSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPE 188
DSGT +TRL Y ALR AF + G L DTCYDFS + VP + F F
Sbjct: 360 DSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVPRIDFEFAG 419
Query: 189 GKVLPLPAKNYLIPVDSNGTFCFAFAP--TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTP 246
G + LP L +S C AFA + ++I GNVQQ+ V +++ IGF
Sbjct: 420 GVKVELPLVGILYG-ESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFGA 478
Query: 247 NKC 249
C
Sbjct: 479 AGC 481
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 129 bits (325), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 93/258 (36%), Positives = 129/258 (50%), Gaps = 15/258 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
+T+TL + + + GC G + A GL+GLG G LS SQ + STFSY
Sbjct: 176 ASLTQDTLTLANDVIKSYTFGCISKATGTSLPAQGLMGLGRGPLSLISQTQNLYMSTFSY 235
Query: 58 CLVD-RDSDSTSTLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
CL + + S+ + +L P + T PLL+N + YY+ L GI VG ++ I +A
Sbjct: 236 CLPNSKSSNFSGSLRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSA 295
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
D S G I DSGT TRL Y A+R+ F R + + T + FDTCY
Sbjct: 296 LAFDASTGAGTIFDSGTVFTRLVEPAYVAVRNEFRRRIKNANATS-LGGFDTCYS----G 350
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQQGT 231
SV P+V+F F G + LP N LI S T C A A +S L++I ++QQQ
Sbjct: 351 SVVYPSVTFMF-AGMNVTLPPDNLLIHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNH 409
Query: 232 RVSFNLRNSLIGFTPNKC 249
RV +L NS +G + C
Sbjct: 410 RVLIDLPNSRLGISRETC 427
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 129 bits (325), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 91/261 (34%), Positives = 124/261 (47%), Gaps = 24/261 (9%)
Query: 3 FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSYCL 59
V + +TL + + GC + G + GLLGLG G +S SQ A FSYCL
Sbjct: 187 LVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCL 246
Query: 60 VDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
S + F SL P + T PLLRN + YY+ LTG+SVG +PI
Sbjct: 247 -----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIP 301
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
D + G I+DSGT +TR Y A+RD F + P + FDTC F+
Sbjct: 302 SEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG--PISSLGAFDTC--FA 357
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQ 228
+ + E P ++ HF EG L LP +N LI S C + A +S L++I N+QQ
Sbjct: 358 ATNEAEAPAITLHF-EGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQ 416
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q R+ F+ NS +G C
Sbjct: 417 QNLRIMFDTTNSRLGIARELC 437
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 91/261 (34%), Positives = 124/261 (47%), Gaps = 24/261 (9%)
Query: 3 FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSYCL 59
V + +TL + + GC + G + GLLGLG G +S SQ A FSYCL
Sbjct: 187 LVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCL 246
Query: 60 VDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
S + F SL P + T PLLRN + YY+ LTG+SVG +PI
Sbjct: 247 -----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIP 301
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
D + G I+DSGT +TR Y A+RD F + P + FDTC F+
Sbjct: 302 SEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG--PISSLGAFDTC--FA 357
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQ 228
+ + E P ++ HF EG L LP +N LI S C + A +S L++I N+QQ
Sbjct: 358 ATNEAEAPAITLHF-EGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQ 416
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q R+ F+ NS +G C
Sbjct: 417 QNLRIMFDTTNSRLGIARELC 437
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 91/252 (36%), Positives = 134/252 (53%), Gaps = 13/252 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G +TV+ GS S+ N GCG +NEGLF +AGL+GL LS Q+ S +F+Y
Sbjct: 216 GYLSKDTVSFGSTSLPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTY 275
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
CL S +L + P P++ + D+ Y++ L+G++V G+ L +S +A+
Sbjct: 276 CLPSSSSSGYLSLGSYN--PGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYS 333
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
+ I+DSGT +TRL T Y+AL A + S ++ DTC+ S V
Sbjct: 334 SLPT-----IIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFK-GQASRV 387
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
P V+ F G L L A+N L+ VD + T C AFAP S+ +IIGN QQQ V +++
Sbjct: 388 SAPAVTMSFAGGAALKLSAQNLLVDVD-DSTTCLAFAPARSA-AIIGNTQQQTFSVVYDV 445
Query: 238 RNSLIGFTPNKC 249
++S IGF C
Sbjct: 446 KSSRIGFAAGGC 457
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 93/252 (36%), Positives = 125/252 (49%), Gaps = 13/252 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G +TV+ GS S N GCG +NEGLF +AGL+GL LS Q+ S +FSY
Sbjct: 228 GYLSRDTVSFGSGSYPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSY 287
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
CL ST L + P+ + + Y++ L+G+SVGG L +S
Sbjct: 288 CL--PTPASTGYLSIGPYTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPA--- 342
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
E + I+DSGT +TRL T Y AL A + ++ DTC+ S +
Sbjct: 343 --EYSSLPTIIDSGTVITRLPTAVYTALSKAVAAAMVGVQSAPAFSILDTCFQ-GQASQL 399
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
VP V+ F G L L +N LI VD + T C AFAPT S+ +IIGN QQQ V +++
Sbjct: 400 RVPAVAMAFAGGATLKLATQNVLIDVD-DSTTCLAFAPTDST-TIIGNTQQQTFSVVYDV 457
Query: 238 RNSLIGFTPNKC 249
S IGF C
Sbjct: 458 AQSRIGFAAGGC 469
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 96/258 (37%), Positives = 127/258 (49%), Gaps = 32/258 (12%)
Query: 6 ETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVD 61
E +T+GS + +N GCG + +GLF AAGLLGLG LS SQ FSYCL
Sbjct: 223 ERLTIGSTDIFNNFYFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCL-- 280
Query: 62 RDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
S ST L F SS +A PL + +FY L LTGI+VGG L I + F
Sbjct: 281 PSSSSTGFLSFGSSQSKSAKFTPL--SSGPSSFYNLDLTGITVGGQKLAIPLSVFS---- 334
Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPT 181
G I+DSGT VTRL Y+ALR AF + + +++ DTCYDFS +++VP
Sbjct: 335 -TAGTIIDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIKVPK 393
Query: 182 VSFHFPEGKVLPLPAKNYLIPVDSNGTF--------CFAFAPTSSS--LSIIGNVQQQGT 231
+ F G + VD G F C AFA + + +I GN QQ+
Sbjct: 394 IVISFSGG---------VDVDVDQAGIFVANGLKQVCLAFAGNTGARDTAIFGNTQQRNF 444
Query: 232 RVSFNLRNSLIGFTPNKC 249
V +++ +GF P C
Sbjct: 445 EVVYDVSGGKVGFAPASC 462
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 98/264 (37%), Positives = 134/264 (50%), Gaps = 36/264 (13%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G + ++T+TL GS ++ GCGH +GLF G GLLGLG S SQ +++ FS
Sbjct: 233 GVYSSDTLTLTGSNALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFS 292
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAV----TAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
YCL + +++ + S P++ T PLL T+Y + L GISVGG L I
Sbjct: 293 YCL----PPTQNSVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSID 348
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-----TDGVALFDT 167
+ F G +VD+GT VTRL Y+ALR AF A++P + DT
Sbjct: 349 ASVFA------SGAVVDTGTVVTRLPPTAYSALRSAF---RAAMAPYGYPSAPATGILDT 399
Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGN 225
CYDF+ +V +PT+S F G + L L + C AFAPT S SI+GN
Sbjct: 400 CYDFTRYGTVTLPTISIAFGGGAAMDLGTSGILT------SGCLAFAPTGGDSQASILGN 453
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
VQQ+ V F+ S +GF P C
Sbjct: 454 VQQRSFEVRFD--GSTVGFMPASC 475
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 98/264 (37%), Positives = 134/264 (50%), Gaps = 36/264 (13%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G + ++T+TL GS ++ GCGH +GLF G GLLGLG S SQ +++ FS
Sbjct: 222 GVYSSDTLTLTGSNALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFS 281
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAV----TAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
YCL + +++ + S P++ T PLL T+Y + L GISVGG L I
Sbjct: 282 YCL----PPTQNSVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSID 337
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-----TDGVALFDT 167
+ F G +VD+GT VTRL Y+ALR AF A++P + DT
Sbjct: 338 ASVFA------SGAVVDTGTVVTRLPPTAYSALRSAF---RAAMAPYGYPSAPATGILDT 388
Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGN 225
CYDF+ +V +PT+S F G + L L + C AFAPT S SI+GN
Sbjct: 389 CYDFTRYGTVTLPTISIAFGGGAAMDLGTSGILT------SGCLAFAPTGGDSQASILGN 442
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
VQQ+ V F+ S +GF P C
Sbjct: 443 VQQRSFEVRFD--GSTVGFMPASC 464
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 93/272 (34%), Positives = 135/272 (49%), Gaps = 24/272 (8%)
Query: 1 GDFVTETVTLGSASVDNIAI----GCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G + TE T S+S + +++ GCG N G +G++G G LS SQ++ FS
Sbjct: 191 GVYATERFTFASSSGEKLSVPLGFGCGTMNVGSLNNGSGIVGFGRDPLSLVSQLSIRRFS 250
Query: 57 YCLVDRDSDSTSTLEF----------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
YCL S STL F D + T LL++ + TFYY+ TG++VG
Sbjct: 251 YCLTPYTSTRKSTLMFGSLSDGVFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGT 310
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVT----RLQTETYNALRDAF-VRGTRALSPTDG 161
L I +AF + G+GG+IVDSGTA+T + TE A R + T + SP DG
Sbjct: 311 RRLRIPLSAFALRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRAQLRLPFTSSSSPDDG 370
Query: 162 VA----LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS 217
V + S+ + V VP ++FHF +G L LP +NY++ G+ C A +
Sbjct: 371 VCFATPMAAGGRRASAATVVSVPRMAFHF-QGADLELPRRNYVLDDPRRGSLCILLADSG 429
Query: 218 SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S + IGN QQ RV ++L + F P +C
Sbjct: 430 DSGATIGNFVQQDMRVLYDLEAETLSFAPAQC 461
>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 342
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 84/247 (34%), Positives = 123/247 (49%), Gaps = 16/247 (6%)
Query: 18 IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFD--SS 75
+ GCG + G VGA+GL+GL G++S SQ++ FSYCL TS + F +
Sbjct: 94 LGFGCGALSAGSLVGASGLMGLSPGTMSLISQLSVPRFSYCLTPFAERKTSPMLFGAMAD 153
Query: 76 LPPNAVTAP-----LLRNHELDTF-YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVD 129
L T P +LRN +DTF YY+ L G+S+G L + + I+ G GG IVD
Sbjct: 154 LRKYNTTGPIQTTAILRNPAMDTFYYYVPLVGLSLGTKRLRVPAASLAINPDGTGGTIVD 213
Query: 130 SGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS---RSSVEVPTVSFHF 186
SG+ + L + ++A++ A + + V ++ C+ S ++V+ P + HF
Sbjct: 214 SGSTMAHLAGKAFDAVKKAVLEAVKLPVFNGTVEDYELCFAVPSGVAMAAVKTPPLVLHF 273
Query: 187 PEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL----SIIGNVQQQGTRVSFNLRNSLI 242
G + LP NY + G C A A + L SIIGNVQQQ V F++ N
Sbjct: 274 DGGAAMALPRDNYFQEPRA-GLMCLAVARSPEDLGAPISIIGNVQQQNMHVLFDVHNQKF 332
Query: 243 GFTPNKC 249
F P KC
Sbjct: 333 SFAPTKC 339
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 99/265 (37%), Positives = 137/265 (51%), Gaps = 35/265 (13%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G + ++T+TL S++V GCGH GLF G GLLGLG S Q + FS
Sbjct: 233 GVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFS 292
Query: 57 YCLVDRDSDS---TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
YCL + S + T L S P T LL + T+Y + LTGISVGG L +
Sbjct: 293 YCLPTKPSTAGYLTLGLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPA 352
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGVALFDTCY 169
+AF GG +VD+GT +TRL Y ALR AF G + +P++G+ DTCY
Sbjct: 353 SAFA------GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGI--LDTCY 404
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF-CFAFAPTSS--SLSIIGNV 226
+F+ +V +P V+ F G + L A L +F C AFAP+ S ++I+GNV
Sbjct: 405 NFAGYGTVTLPNVALTFGSGATVMLGADGIL-------SFGCLAFAPSGSDGGMAILGNV 457
Query: 227 QQQGTRVSFNLR--NSLIGFTPNKC 249
QQ+ SF +R + +GF P+ C
Sbjct: 458 QQR----SFEVRIDGTSVGFKPSSC 478
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 92/253 (36%), Positives = 132/253 (52%), Gaps = 14/253 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G T+TV+ GS + GCG +NEGLF +AGL+GL LS Q+ S +FSY
Sbjct: 228 GSLSTDTVSFGSTRYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSY 287
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPISETAF 116
CL + ST L + + + + LD + Y++ L+G+SVGG L +S +
Sbjct: 288 CL--PTAASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPS-- 343
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
E + I+DSGT +TRL T + AL A + ++ DTC++ S
Sbjct: 344 ---EYSSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFE-GQASQ 399
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
+ VPTV+ F G + L +N LI VD + T C AFAPT S+ +IIGN QQQ V ++
Sbjct: 400 LRVPTVAMAFAGGASMKLTTRNVLIDVD-DSTTCLAFAPTDST-AIIGNTQQQTFSVIYD 457
Query: 237 LRNSLIGFTPNKC 249
+ S IGF+ C
Sbjct: 458 VAQSRIGFSAGGC 470
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 85/239 (35%), Positives = 122/239 (51%), Gaps = 15/239 (6%)
Query: 21 GCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVD-RDSDSTSTLEFDSSL 76
GC G V GL+G G G LSF SQ S FSYCL + R S+ + TL+
Sbjct: 191 GCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIG 250
Query: 77 PPNAV-TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVT 135
P + T PLL N + YY+ + GI VG ++ + ++A + G I+D+GT T
Sbjct: 251 QPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFT 310
Query: 136 RLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLP 195
RL Y A+RDAF RG + FDTCY+ +V VPTV+F F + LP
Sbjct: 311 RLAAPVYAAVRDAF-RGRVRTPVAPPLGGFDTCYNV----TVSVPTVTFMFAGAVAVTLP 365
Query: 196 AKNYLIPVDSNGTFCFAFAP-----TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+N +I S G C A A +++L+++ ++QQQ RV F++ N +GF+ C
Sbjct: 366 EENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 424
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 85/239 (35%), Positives = 122/239 (51%), Gaps = 15/239 (6%)
Query: 21 GCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVD-RDSDSTSTLEFDSSL 76
GC G V GL+G G G LSF SQ S FSYCL + R S+ + TL+
Sbjct: 210 GCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIG 269
Query: 77 PPNAV-TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVT 135
P + T PLL N + YY+ + GI VG ++ + ++A + G I+D+GT T
Sbjct: 270 QPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFT 329
Query: 136 RLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLP 195
RL Y A+RDAF RG + FDTCY+ +V VPTV+F F + LP
Sbjct: 330 RLAAPVYAAVRDAF-RGRVRTPVAPPLGGFDTCYNV----TVSVPTVTFMFAGAVAVTLP 384
Query: 196 AKNYLIPVDSNGTFCFAFAP-----TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+N +I S G C A A +++L+++ ++QQQ RV F++ N +GF+ C
Sbjct: 385 EENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 443
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 92/256 (35%), Positives = 131/256 (51%), Gaps = 15/256 (5%)
Query: 3 FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCL 59
+T+TL + + N GC + G + A GL+GLG G LS SQ + STFSYCL
Sbjct: 175 LTQDTLTLATDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCL 234
Query: 60 VD-RDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
+ + S+ + +L + P T PLL+N + YY+ L GI VG ++ I +A
Sbjct: 235 PNSKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALA 294
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
D + G I DSGT TRL Y A+R+ F R + + T + FDTCY SV
Sbjct: 295 FDPATGAGTIFDSGTVYTRLVEPAYVAMRNEFRRRVKNANATS-LGGFDTCYS----GSV 349
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF--APT--SSSLSIIGNVQQQGTRV 233
P+V+F F G + LP N LI + C A APT +S L++I ++QQQ RV
Sbjct: 350 VFPSVTFMF-AGMNVTLPPDNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRV 408
Query: 234 SFNLRNSLIGFTPNKC 249
++ NS +G + C
Sbjct: 409 LIDVPNSRLGISRETC 424
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 89/258 (34%), Positives = 128/258 (49%), Gaps = 24/258 (9%)
Query: 6 ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVDR 62
+++ L ++ + + GC + G + GLLGLG G +S SQ + + FSYC
Sbjct: 191 DSLGLAVDTLPSYSFGCVNAVSGSTLPPQGLLGLGRGPMSLLSQSGSLYSGVFSYCF--- 247
Query: 63 DSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
S + F SL P N T PLLRN T YY+ LTG+SVG L+P++
Sbjct: 248 --PSFKSYYFSGSLRLGPLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPEL 305
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
D + G I+DSGT +TR Y A+RD F + + P + FDTC F++ +
Sbjct: 306 LAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKG--PFATIGAFDTC--FAATN 361
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQQGT 231
P V+FHF G L LP +N LI + C A A +S L++I N+QQQ
Sbjct: 362 EDIAPPVTFHF-TGMDLKLPLENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNL 420
Query: 232 RVSFNLRNSLIGFTPNKC 249
R+ F++ NS +G C
Sbjct: 421 RIMFDVTNSRLGIARELC 438
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 93/253 (36%), Positives = 132/253 (52%), Gaps = 14/253 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G T+TV+ GS S + GCG +NEGLF +AGL+GL LS Q+ S +FSY
Sbjct: 228 GYLSTDTVSFGSTSYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSY 287
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPISETAF 116
CL + ST L + + + + LD + Y++ L+G+SVGG L +S +
Sbjct: 288 CL--PTAASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPS-- 343
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
E + I+DSGT +TRL T + AL A + ++ DTC++ S
Sbjct: 344 ---EYSSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFE-GQASQ 399
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
+ VPTV F G + L +N LI VD + T C AFAPT S+ +IIGN QQQ V ++
Sbjct: 400 LRVPTVVMAFAGGASMKLTTRNVLIDVD-DSTTCLAFAPTDST-AIIGNTQQQTFSVIYD 457
Query: 237 LRNSLIGFTPNKC 249
+ S IGF+ C
Sbjct: 458 VAQSRIGFSAGGC 470
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 91/270 (33%), Positives = 125/270 (46%), Gaps = 22/270 (8%)
Query: 1 GDFVTETVTLGSASVDN-------IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS 53
G + TE T S+ + GCG N G +G++G G LS SQ++
Sbjct: 184 GVYATERFTFASSGGGGLTTTTVPLGFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLSIR 243
Query: 54 TFSYCLVDRDSDSTSTLEFDS-------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCL S STL F S T PLL++ + TFYY+ TG++VG
Sbjct: 244 RFSYCLTSYASRRQSTLLFGSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGA 303
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL-----SPTDG 161
L I E+AF + G+GG+IVDSGTA+T L + AF + R +P DG
Sbjct: 304 RRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFRQQLRLPFANGGNPEDG 363
Query: 162 VALF--DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS 219
V SS S + VP + HF +G L LP +NY++ G C A +
Sbjct: 364 VCFLVPAAWRRSSSTSQMPVPRMVLHF-QGADLDLPRRNYVLDDHRRGRLCLLLADSGDD 422
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S IGN+ QQ RV ++L + P +C
Sbjct: 423 GSTIGNLVQQDMRVLYDLEAETLSIAPARC 452
>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
gi|194689376|gb|ACF78772.1| unknown [Zea mays]
gi|224031455|gb|ACN34803.1| unknown [Zea mays]
gi|238011528|gb|ACR36799.1| unknown [Zea mays]
gi|238015454|gb|ACR38762.1| unknown [Zea mays]
Length = 304
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 91/270 (33%), Positives = 125/270 (46%), Gaps = 22/270 (8%)
Query: 1 GDFVTETVTLGSASVDN-------IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS 53
G + TE T S+ + GCG N G +G++G G LS SQ++
Sbjct: 36 GVYATERFTFASSGGGGLTTTTVPLGFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLSIR 95
Query: 54 TFSYCLVDRDSDSTSTLEFDS-------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCL S STL F S T PLL++ + TFYY+ TG++VG
Sbjct: 96 RFSYCLTSYASRRQSTLLFGSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGA 155
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-----ALSPTDG 161
L I E+AF + G+GG+IVDSGTA+T L + AF + R +P DG
Sbjct: 156 RRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFRQQLRLPFANGGNPEDG 215
Query: 162 VALF--DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS 219
V SS S + VP + HF +G L LP +NY++ G C A +
Sbjct: 216 VCFLVPAAWRRSSSTSQMPVPRMVLHF-QGADLDLPRRNYVLDDHRRGRLCLLLADSGDD 274
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S IGN+ QQ RV ++L + P +C
Sbjct: 275 GSTIGNLVQQDMRVLYDLEAETLSIAPARC 304
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 91/256 (35%), Positives = 129/256 (50%), Gaps = 15/256 (5%)
Query: 3 FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCL 59
+T+TL S + N GC + G + A GL+GLG G LS SQ + STFSYCL
Sbjct: 175 LTQDTLTLASDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCL 234
Query: 60 VD-RDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
+ + S+ + +L + P T PLL+N + YY+ L GI VG ++ I +A
Sbjct: 235 PNSKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALA 294
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
D + G I DSGT TRL Y A+R+ F R + + T + FDTCY SV
Sbjct: 295 FDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATS-LGGFDTCYS----GSV 349
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQQGTRV 233
P+V+F F G + LP N LI + C A A +S L++I ++QQQ RV
Sbjct: 350 VFPSVTFMF-AGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRV 408
Query: 234 SFNLRNSLIGFTPNKC 249
++ NS +G + C
Sbjct: 409 LIDVPNSRLGISRETC 424
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 94/264 (35%), Positives = 127/264 (48%), Gaps = 29/264 (10%)
Query: 2 DFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYC 58
+ +T+ L V A GC G V GLLG G G LSF SQ + STFSYC
Sbjct: 119 NLTRDTIALSMDPVPYYAFGCIQKATGSSVPPQGLLGFGRGPLSFLSQTQNLYKSTFSYC 178
Query: 59 LVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
L S TL F SL PP T PLL+N + YY+ L GI VG ++ I
Sbjct: 179 L-----PSFRTLNFSGSLRLGPVGQPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDI 233
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCY 169
+A + + G I DSGT TRL Y A+R+ F + G +S G FDTCY
Sbjct: 234 PRSALAFNPTTGAGTIFDSGTVFTRLVAPAYIAVRNEFRKRVGNATVSSLGG---FDTCY 290
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGN 225
+ PT++F F G + +P +N LI + T C A A +S L++I +
Sbjct: 291 SV----PIVPPTITFMF-SGMNVTMPPENLLIHSTAGVTSCLAMAAAPDNVNSVLNVIAS 345
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
+QQQ R+ F++ NS +G +C
Sbjct: 346 MQQQNHRILFDVPNSRLGVAREQC 369
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 91/256 (35%), Positives = 129/256 (50%), Gaps = 15/256 (5%)
Query: 3 FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCL 59
+T+TL S + N GC + G + A GL+GLG G LS SQ + STFSYCL
Sbjct: 175 LTQDTLTLASDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCL 234
Query: 60 VD-RDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
+ + S+ + +L + P T PLL+N + YY+ L GI VG ++ I +A
Sbjct: 235 PNSKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALA 294
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
D + G I DSGT TRL Y A+R+ F R + + T + FDTCY SV
Sbjct: 295 FDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATS-LGGFDTCYS----GSV 349
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQQGTRV 233
P+V+F F G + LP N LI + C A A +S L++I ++QQQ RV
Sbjct: 350 VFPSVTFMF-AGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRV 408
Query: 234 SFNLRNSLIGFTPNKC 249
++ NS +G + C
Sbjct: 409 LIDVPNSRLGISRETC 424
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 89/273 (32%), Positives = 130/273 (47%), Gaps = 25/273 (9%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGL------FVGAAGLLGLGGGSLSFPSQ 49
G F E +TL +++ ++ ++ GCG G F GA G++GLG +SF SQ
Sbjct: 181 GFFSKEALTLNTSTGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQ 240
Query: 50 IN---ASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTA--------PLLRNHELDTFYYLG 98
+ S FSYCL+D T N + PLL N TFYY+
Sbjct: 241 LGRRFGSKFSYCLMDYTLSPPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIA 300
Query: 99 LTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP 158
+ G+ V G LPI+ + + ID+ GNGG I+DSGT +T + Y + AF + + SP
Sbjct: 301 IKGVYVNGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSP 360
Query: 159 TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS 218
+ FD C + S + +P +SF+ G V P +NY I + C A P S
Sbjct: 361 AEPTPGFDLCMNVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIET-GDQIKCLAVQPVSQ 419
Query: 219 S--LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S++GN+ QQG + F+ S +GFT C
Sbjct: 420 DGGFSVLGNLMQQGFLLEFDRDKSRLGFTRRGC 452
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 127 bits (318), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 94/242 (38%), Positives = 125/242 (51%), Gaps = 19/242 (7%)
Query: 16 DNIAIGCGHNNEGLFVGAAGLLGLGGGS-LSFPSQINAS---TFSYCLVDRDSDSTSTLE 71
N GCG NN GLF G AGL+GLG S S SQ+ S FSYCL S S++T
Sbjct: 120 KNFIFGCGQNNTGLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCL---PSTSSATGY 176
Query: 72 FDSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVD 129
+ P N TA +L + + T Y++ L GISVGG L +S T F+ + G I+D
Sbjct: 177 LNIGNPQNTPGYTA-MLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQ-----SVGTIID 230
Query: 130 SGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEG 189
SGT +TRL Y+AL+ A + V + DTCYDFS +SV P + HF G
Sbjct: 231 SGTVITRLPPTAYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHF-AG 289
Query: 190 KVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--LSIIGNVQQQGTRVSFNLRNSLIGFTPN 247
+ +PA +S+ C AFA + S + IIGNVQQ V+++ IGF+
Sbjct: 290 LDVRIPATGVFFVFNSS-QVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAG 348
Query: 248 KC 249
C
Sbjct: 349 AC 350
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 90/269 (33%), Positives = 127/269 (47%), Gaps = 21/269 (7%)
Query: 1 GDFVTETVTLGSASVDNIAI----GCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G + TE T S+S + ++ GCG N G A+G++G G LS SQ++ FS
Sbjct: 186 GYYATERFTFASSSGETQSVPLGFGCGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFS 245
Query: 57 YCLVDRDSDSTSTLEF----DSSLPPNAV----TAPLLRNHELDTFYYLGLTGISVGGDL 108
YCL S STL+F D L +A T P+L++ + TFYY+ TG++VG
Sbjct: 246 YCLTPYASSRKSTLQFGSLADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARR 305
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-----ALSPTDGVA 163
L I +AF + G+GG+I+DSGTA+T + AF R SP DGV
Sbjct: 306 LRIPASAFALRPDGSGGVIIDSGTALTLFPVAVLAEVVRAFRSQLRLPFANGSSPDDGVC 365
Query: 164 LFDTCYDFSSR---SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
V VP + FHF +G L LP +NY++ G C +
Sbjct: 366 FAAPAVAAGGGRMARQVAVPRMVFHF-QGADLDLPRENYVLEDHRRGHLCVLLGDSGDDG 424
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ IGN QQ RV ++L + F P +C
Sbjct: 425 ATIGNFVQQDMRVVYDLERETLSFAPVEC 453
>gi|413937239|gb|AFW71790.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 537
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 93/250 (37%), Positives = 130/250 (52%), Gaps = 22/250 (8%)
Query: 15 VDNIA---IGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVD-RDSDST 67
VD +A GC G V GL+G G G LSFPSQ + FSYCL + S+ +
Sbjct: 293 VDVVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQNKDVYGFVFSYCLPSYKSSNFS 352
Query: 68 STLEFDSSLPPNAVTA-PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGI 126
STL + P + PLL N + YY+ + GI VGG + + +A D + G
Sbjct: 353 STLRLGPAGQPKRIKMTPLLSNPHRPSLYYVNMVGIHVGGRPMLVPASALAFDPASGRGT 412
Query: 127 IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG-VALFDTCYDFSSRSSVEVPTVSFH 185
IVD+GT TRL Y A+RD F RA P G + FDTCY+ ++ VPTV+F
Sbjct: 413 IVDAGTMFTRLSAPVYAAVRDVFRSRVRA--PVTGPLGGFDTCYNV----TISVPTVTFS 466
Query: 186 FPEGKV-LPLPAKNYLIPVDSNGTFCFAFAPTSSS-----LSIIGNVQQQGTRVSFNLRN 239
F +G+V + LP +N +I S+G C A A S L+++ ++QQQ RV F++ N
Sbjct: 467 F-DGRVSVTLPEENVVIRSSSDGIACLAMAAGPSDGVDAVLNVLASMQQQNHRVLFDVAN 525
Query: 240 SLIGFTPNKC 249
+GF+ C
Sbjct: 526 GRVGFSRELC 535
>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
gi|238008190|gb|ACR35130.1| unknown [Zea mays]
gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
Length = 269
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 80/263 (30%), Positives = 120/263 (45%), Gaps = 15/263 (5%)
Query: 1 GDFVTETVTLGSAS--VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
G TET T G+ N+ GCG G GA+G++G+ G LS Q++ + FSYC
Sbjct: 5 GVLATETFTFGAHQNFSANLTFGCGKLTNGTIAGASGIMGVSPGPLSVLKQLSITKFSYC 64
Query: 59 LVDRDSDSTSTLEFDS-------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
L TS + F + T PLL+N D +YY+ + GIS+G L +
Sbjct: 65 LTPFTDHKTSPVMFGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGISIGSKRLDV 124
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
E + G GG ++DS T + L + L+ A + G + + + + C++
Sbjct: 125 PEAILALRPDGTGGTVLDSATTLAYLVEPAFKELKKAVMEGMKLPAANRSIDDYPVCFEL 184
Query: 172 S---SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF--APTSSSLSIIGNV 226
S V+VP + HF + LP +Y S G C A AP + ++IGNV
Sbjct: 185 PRGMSMEGVQVPPLVLHFAGDAEMSLPRDSYF-QEPSPGMMCLAVMQAPFEGAPNVIGNV 243
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQQ V ++L N + P KC
Sbjct: 244 QQQNMHVLYDLGNRKFSYAPTKC 266
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 87/264 (32%), Positives = 126/264 (47%), Gaps = 22/264 (8%)
Query: 3 FVTETVTLGSASVDNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
++T+ LG ++ N GC + G + GLLGLG G ++ SQ + FSY
Sbjct: 181 LASDTLRLGKDAIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSY 240
Query: 58 CLVDRDSDSTSTLEFDSSL--------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
CL S + F SL P + P+LRN + YY+ +TG+SVG +
Sbjct: 241 CL-----PSYRSYYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGRAWV 295
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
+ +F D + G +VDSGT +TR Y ALR+ F R A S + FDTC+
Sbjct: 296 KVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCF 355
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT----SSSLSIIGN 225
+ ++ P V+ H G L LP +N LI + C A A +S +++I N
Sbjct: 356 NTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIAN 415
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
+QQQ RV F++ NS IGF C
Sbjct: 416 LQQQNIRVVFDVANSRIGFAKESC 439
>gi|413937238|gb|AFW71789.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 598
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 93/250 (37%), Positives = 130/250 (52%), Gaps = 22/250 (8%)
Query: 15 VDNIA---IGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVD-RDSDST 67
VD +A GC G V GL+G G G LSFPSQ + FSYCL + S+ +
Sbjct: 354 VDVVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQNKDVYGFVFSYCLPSYKSSNFS 413
Query: 68 STLEFDSSLPPNAVTA-PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGI 126
STL + P + PLL N + YY+ + GI VGG + + +A D + G
Sbjct: 414 STLRLGPAGQPKRIKMTPLLSNPHRPSLYYVNMVGIHVGGRPMLVPASALAFDPASGRGT 473
Query: 127 IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG-VALFDTCYDFSSRSSVEVPTVSFH 185
IVD+GT TRL Y A+RD F RA P G + FDTCY+ ++ VPTV+F
Sbjct: 474 IVDAGTMFTRLSAPVYAAVRDVFRSRVRA--PVTGPLGGFDTCYNV----TISVPTVTFS 527
Query: 186 FPEGKV-LPLPAKNYLIPVDSNGTFCFAFAPTSSS-----LSIIGNVQQQGTRVSFNLRN 239
F +G+V + LP +N +I S+G C A A S L+++ ++QQQ RV F++ N
Sbjct: 528 F-DGRVSVTLPEENVVIRSSSDGIACLAMAAGPSDGVDAVLNVLASMQQQNHRVLFDVAN 586
Query: 240 SLIGFTPNKC 249
+GF+ C
Sbjct: 587 GRVGFSRELC 596
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 94/262 (35%), Positives = 130/262 (49%), Gaps = 23/262 (8%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFS 56
G+ +T+TL + +V GCGHNN G F GLLGLG G S SQ+ A + FS
Sbjct: 225 GNLARDTLTLSPTDAVPGFVFGCGHNNAGSFGEIDGLLGLGRGKASLSSQVAARYGAGFS 284
Query: 57 YCLVDRDSDSTSTLEFD---SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
YCL S +T L F ++ P NA ++ +FYYL LTGI+V G + +
Sbjct: 285 YCLPSSPS-ATGYLSFSGAAAAAPTNAQFTEMVAGQH-PSFYYLNLTGITVAGRAIKVPP 342
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRD----AFVRGTRALSPTDGVALFDTCY 169
+ F G I+DSGTA + L Y ALR A R RA S T +FDTCY
Sbjct: 343 SVFAT----AAGTIIDSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPSST----IFDTCY 394
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQ 227
D + +V +P+V+ F +G + L L + C AF P +SL ++GN Q
Sbjct: 395 DLTGHETVRIPSVALVFADGATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQ 454
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
Q+ V +++ N +GF N C
Sbjct: 455 QRTLAVIYDVDNQKVGFGANGC 476
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 126 bits (317), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 86/254 (33%), Positives = 122/254 (48%), Gaps = 13/254 (5%)
Query: 1 GDFVTETVTLGSASVD--NIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTF 55
G TET++ D NI IGC G +G +G++GL +S SQ I F
Sbjct: 217 GTLATETISFSHLKYDFKNILIGCSDQVSGESLGESGIMGLNRSPISLASQTANIYDKLF 276
Query: 56 SYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
SYC + ST L F +P + +P+ + + Y + +TGISVGG L I +A
Sbjct: 277 SYC-IPSTPGSTGHLTFGGKVPNDVRFSPVSKTAP-SSDYDIKMTGISVGGRKLLIDASA 334
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
FKI + +DSG +TRL + Y+ALR F + D DTCYDFS+ S
Sbjct: 335 FKIAST------IDSGAVLTRLPPKAYSALRSVFREMMKGYPLLDQDDFLDTCYDFSNYS 388
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSF 235
+V +P++S F G + + + V + +C AFA +SI GN QQ+ V F
Sbjct: 389 TVAIPSISVFFEGGVEMDIDVSGIMWQVPGSKVYCLAFAELDDEVSIFGNFQQKTYTVVF 448
Query: 236 NLRNSLIGFTPNKC 249
+ IGF P C
Sbjct: 449 DGAKERIGFAPGGC 462
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 126 bits (317), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 101/273 (36%), Positives = 145/273 (53%), Gaps = 25/273 (9%)
Query: 1 GDFVTETVTL-------GSA--SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
GDF ET T+ GS+ V N+ GCGH N GLF GA+GLLGLG G LSF SQ+
Sbjct: 253 GDFAVETFTVNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQ 312
Query: 52 A---STFSYCLVDRDSDS--TSTLEF--DSSL----PPNAVTAPLLRNHELDTFYYLGLT 100
+ +FSYCLVDR+S++ +S L F D L N + + + ++TFYY+ +
Sbjct: 313 SLYGHSFSYCLVDRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIK 372
Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT- 159
I VGG L I E + I G+GG I+DSGT ++ Y +++ F + P
Sbjct: 373 SILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIF 432
Query: 160 DGVALFDTCYDFS--SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT- 216
+ D C++ S +++ +P + F +G V PA+N I + S C A T
Sbjct: 433 RDFPVLDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWL-SEDLVCLAILGTP 491
Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S+ SIIGN QQQ + ++ + S +GFTP KC
Sbjct: 492 KSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKC 524
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 126 bits (317), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 90/269 (33%), Positives = 127/269 (47%), Gaps = 21/269 (7%)
Query: 1 GDFVTETVTLGSASVDNIAI----GCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G + TE T S+S + ++ GCG N G A+G++G G LS SQ++ FS
Sbjct: 186 GYYATERFTFASSSGETQSVPLGFGCGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFS 245
Query: 57 YCLVDRDSDSTSTLEF----DSSLPPNAV----TAPLLRNHELDTFYYLGLTGISVGGDL 108
YCL S STL+F D L +A T P+L++ + TFYY+ TG++VG
Sbjct: 246 YCLTPYASSRKSTLQFGSLADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARR 305
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-----ALSPTDGVA 163
L I +AF + G+GG+I+DSGTA+T + AF R SP DGV
Sbjct: 306 LRIPASAFALRPDGSGGVIIDSGTALTLFPAAVLAEVVRAFRSQLRLPFANGSSPDDGVC 365
Query: 164 LFDTCYDFSSR---SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
V VP + FHF +G L LP +NY++ G C +
Sbjct: 366 FAAPAVAAGGGRMARQVAVPRMVFHF-QGADLDLPRENYVLEDHRRGHLCVLLGDSGDDG 424
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ IGN QQ RV ++L + F P +C
Sbjct: 425 ATIGNFVQQDMRVVYDLERETLSFAPVEC 453
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 86/264 (32%), Positives = 126/264 (47%), Gaps = 22/264 (8%)
Query: 3 FVTETVTLGSASVDNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
++T+ LG ++ N GC + G + GLLGLG G ++ SQ + FSY
Sbjct: 179 LASDTLRLGKDAIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSY 238
Query: 58 CLVDRDSDSTSTLEFDSSL--------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
CL S + F SL P + P+LRN + YY+ +TG+SVG +
Sbjct: 239 CL-----PSYRSYYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWV 293
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
+ +F D + G +VDSGT +TR Y ALR+ F R A S + FDTC+
Sbjct: 294 KVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCF 353
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT----SSSLSIIGN 225
+ ++ P V+ H G L LP +N LI + C A A +S +++I N
Sbjct: 354 NTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIAN 413
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
+QQQ RV F++ NS +GF C
Sbjct: 414 LQQQNIRVVFDVANSRVGFAKESC 437
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 101/255 (39%), Positives = 130/255 (50%), Gaps = 13/255 (5%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G F TET+TL S++V N GCG N GLF GAAGLLGLG L+ PSQ + FS
Sbjct: 164 GFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFS 223
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
YCL S S L + + PL + + FY L +TG+SVGG L I E+AF
Sbjct: 224 YCL-PASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAF 282
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
+ G ++DSGT +TRL Y+ L AF T G ++FDTCYDFS +
Sbjct: 283 ------SAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDT 336
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA--PTSSSLSIIGNVQQQGTRVS 234
V +P V F G + + L PV+ C AFA S SI GNVQQ+ +V
Sbjct: 337 VRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVV 396
Query: 235 FNLRNSLIGFTPNKC 249
++ +GF P C
Sbjct: 397 YDGAKGRVGFAPGGC 411
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 93/273 (34%), Positives = 126/273 (46%), Gaps = 25/273 (9%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGL------FVGAAGLLGLGGGSLSFPSQ 49
G F ET TL + A + IA GC G F GA G++GLG G +S SQ
Sbjct: 184 GFFSKETTTLNTSSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQ 243
Query: 50 IN---ASTFSYCLVDRDSDSTSTLEFDSSLPPNAVT--------APLLRNHELDTFYYLG 98
+ + FSYCL+D D + T N V PL N TFYY+G
Sbjct: 244 LGHRFGNKFSYCLMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIG 303
Query: 99 LTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP 158
+ +SV G LPI+ + + +DE GNGG IVDSGT +T L Y + R R SP
Sbjct: 304 IESVSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSP 363
Query: 159 TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP--T 216
+ FD C + S +P +SF V P +NY + D + C A T
Sbjct: 364 AEPTPGFDLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDED-VKCLALQAVMT 422
Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S S+IGN+ QQG + F+ + +GF+ + C
Sbjct: 423 PSGFSVIGNLMQQGFLLEFDKDRTRLGFSRHGC 455
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 101/255 (39%), Positives = 130/255 (50%), Gaps = 13/255 (5%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G F TET+TL S++V N GCG N GLF GAAGLLGLG L+ PSQ + FS
Sbjct: 212 GFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFS 271
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
YCL S S L + + PL + + FY L +TG+SVGG L I E+AF
Sbjct: 272 YCL-PASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAF 330
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
+ G ++DSGT +TRL Y+ L AF T G ++FDTCYDFS +
Sbjct: 331 ------SAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDT 384
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA--PTSSSLSIIGNVQQQGTRVS 234
V +P V F G + + L PV+ C AFA S SI GNVQQ+ +V
Sbjct: 385 VRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVV 444
Query: 235 FNLRNSLIGFTPNKC 249
++ +GF P C
Sbjct: 445 YDGAKGRVGFAPGGC 459
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 93/256 (36%), Positives = 125/256 (48%), Gaps = 15/256 (5%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FS 56
G+ V +T+TL S ++ GCG N GLF GL GLG +S PSQ S F+
Sbjct: 237 GNLVRDTLTLSASDTLPGFVFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFT 296
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNA-VTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
YCL S L + P NA TA L + +FYY+ L GI VGG + I TA
Sbjct: 297 YCLPS-SSSGRGYLSLGGAPPANAQFTA--LADGATPSFYYIDLVGIKVGGRAIRIPATA 353
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
F ++DSGT +TRL Y LR AF R +++ DTCYDF+
Sbjct: 354 FAAAGG----TVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHR 409
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
+ ++PTV F G + L L V C AFAP + SS++I+GN QQ+ V
Sbjct: 410 TAQIPTVELAFAGGATVSLDFTGVLY-VSKVSQACLAFAPNADDSSIAILGNTQQKTFAV 468
Query: 234 SFNLRNSLIGFTPNKC 249
++++ N IGF C
Sbjct: 469 AYDVANQRIGFGAKGC 484
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 92/263 (34%), Positives = 124/263 (47%), Gaps = 21/263 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
G T+TV LG AS+D GCG +N GLF G AGL+GLG LS SQ FSY
Sbjct: 285 GVLATDTVALGGASLDGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTALRYGGVFSY 344
Query: 58 CLVDRDS-DSTSTLEFDSSLPPNAVTAP-----LLRNHELDTFYYLGLTGISVGGDLLPI 111
CL S D++ +L T P ++ + FY+L +TG +VGG
Sbjct: 345 CLPATTSGDASGSLSLGGDASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGG----- 399
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS-PTD-GVALFDTCY 169
TA G +++DSGT +TRL Y +R F R A PT G ++ DTCY
Sbjct: 400 --TALAAQGLGASNVLIDSGTVITRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCY 457
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTS--SSLSIIGNV 226
D + V+VP ++ G + + A L V +G+ C A A S IIGN
Sbjct: 458 DLTGHDEVKVPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNY 517
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQ+ RV ++ S +GF C
Sbjct: 518 QQKNKRVVYDTVGSRLGFADEDC 540
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 101/255 (39%), Positives = 130/255 (50%), Gaps = 13/255 (5%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G F TET+TL S++V N GCG N GLF GAAGLLGLG L+ PSQ + FS
Sbjct: 224 GFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFS 283
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
YCL S S L + + PL + + FY L +TG+SVGG L I E+AF
Sbjct: 284 YCL-PASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAF 342
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
+ G ++DSGT +TRL Y+ L AF T G ++FDTCYDFS +
Sbjct: 343 ------SAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDT 396
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA--PTSSSLSIIGNVQQQGTRVS 234
V +P V F G + + L PV+ C AFA S SI GNVQQ+ +V
Sbjct: 397 VRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVV 456
Query: 235 FNLRNSLIGFTPNKC 249
++ +GF P C
Sbjct: 457 YDGAKGRVGFAPGGC 471
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 97/254 (38%), Positives = 132/254 (51%), Gaps = 8/254 (3%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
G T+ VT+G+ + N+A GCG++N G F GA GL+GLG G LS SQ+ + FSY
Sbjct: 176 GALSTDDVTIGTGKIPNVAFGCGNSNLGTFAGAGGLVGLGKGPLSLVSQLGGTATKKFSY 235
Query: 58 CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
CLV S TS L DS+L P+L N+ TFYY L GISV G + F
Sbjct: 236 CLVPLGSTKTSPLYIGDSTLAGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTF 295
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRS 175
I +G GG+I+DSGT +T L + +N + A ++ DG + C+ + +
Sbjct: 296 DIAATGRGGLILDSGTTLTYLDVDAFNPMVAA-LKAALPYPEADGSFYGLEYCFSTAGVA 354
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSF 235
+ PTV FHF G + L N I +D GT C A A +S+ SI GN+QQ +
Sbjct: 355 NPTYPTVVFHF-NGADVALAPDNTFIALDFEGTTCLAMA-SSTGFSIFGNIQQLNHVIVH 412
Query: 236 NLRNSLIGFTPNKC 249
+L N IGF C
Sbjct: 413 DLVNKRIGFKSANC 426
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 101/274 (36%), Positives = 144/274 (52%), Gaps = 27/274 (9%)
Query: 1 GDFVTETVTLG---------SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
GDF ET T+ V+N+ GCGH N GLF GA+GLLGLG G LSF SQ+
Sbjct: 255 GDFAVETFTVNLTTTEGRSSEYKVENMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQ 314
Query: 52 A---STFSYCLVDRDSDS--TSTLEF--DSSL----PPNAVTAPLLRNHELDTFYYLGLT 100
+ +FSYCLVDR+SD+ +S L F D L N + + + ++TFYY+ +
Sbjct: 315 SLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIK 374
Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA--LSP 158
I VGG+ L I E + I G GG I+DSGT ++ Y +++ F + L
Sbjct: 375 SILVGGEALDIPEETWNISPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVF 434
Query: 159 TDGVALFDTCYDFS--SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT 216
D + D C++ S +++ +P + F +G V PA+N I + S C A T
Sbjct: 435 RD-FPVLDPCFNVSGIEENNIHLPELGIAFADGAVWNFPAENSFIWL-SEDLVCLAILGT 492
Query: 217 -SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S+ SIIGN QQQ + ++ + S +GFTP KC
Sbjct: 493 PKSTFSIIGNYQQQNFHILYDTKMSRLGFTPTKC 526
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 93/256 (36%), Positives = 125/256 (48%), Gaps = 15/256 (5%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FS 56
G+ V +T+TL S ++ GCG N GLF GL GLG +S PSQ S F+
Sbjct: 237 GNLVRDTLTLSASDTLPGFVFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFT 296
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNA-VTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
YCL S L + P NA TA L + +FYY+ L GI VGG + I TA
Sbjct: 297 YCLPS-SSSGRGYLSLGGAPPANAQFTA--LADGATPSFYYIDLVGIKVGGRAIRIPATA 353
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
F ++DSGT +TRL Y LR AF R +++ DTCYDF+
Sbjct: 354 FAAAGG----TVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHR 409
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
+ ++PTV F G + L L V C AFAP + SS++I+GN QQ+ V
Sbjct: 410 TAQIPTVELAFAGGATVSLDFTGVLY-VSKVSQACLAFAPNADDSSIAILGNTQQKTFAV 468
Query: 234 SFNLRNSLIGFTPNKC 249
++++ N IGF C
Sbjct: 469 TYDVANQRIGFGAKGC 484
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 87/257 (33%), Positives = 121/257 (47%), Gaps = 15/257 (5%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFS 56
G +T+TL + V GCG + GLF A GL+GLG +S SQ + + FS
Sbjct: 234 GALARDTLTLTQSDVLPGFVFGCGEQDTGLFGRADGLVGLGREKVSLSSQAASKYGAGFS 293
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
YCL S + L P NA + H+ +FYY+ L G+ V G + +S F
Sbjct: 294 YCLPSSPS-AAGYLSLGGPAPANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVF 352
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCYDFSSR 174
G ++DSGT +TRL Y ALR AF R G +++ DTCYDF+
Sbjct: 353 SA-----AGTVIDSGTVITRLPPRVYAALRSAFARSMGRYGYKRAPALSILDTCYDFTGH 407
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQQGTR 232
++V +P+V+ F G + L L V C AFAP + IIGN QQ+
Sbjct: 408 TTVRIPSVALVFAGGAAVGLDFSGVLY-VAKVSQACLAFAPNGDGADAGIIGNTQQKTLA 466
Query: 233 VSFNLRNSLIGFTPNKC 249
V +++ IGF N C
Sbjct: 467 VVYDVARQKIGFGANGC 483
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 82/251 (32%), Positives = 119/251 (47%), Gaps = 18/251 (7%)
Query: 14 SVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVDRD-SDSTSTLE 71
+V +A GCG N G+F +G+ G G G LS PSQ+ FSYCL D ++S T
Sbjct: 201 AVSGLAFGCGDYNTGVFASNESGIAGFGRGPLSLPSQLRVGRFSYCLTSHDETESNKTSA 260
Query: 72 FDSSLPPNAVTA---------PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
PPN + A P++ + TFYYL L GI+VG LP+ + F + + G
Sbjct: 261 VFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDSSVFALKKDG 320
Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR---SSVEV 179
+GG ++DSGT VT + L++ FV L D + F V V
Sbjct: 321 SGGTVIDSGTGVTTFPAAVFEQLKNEFV-AQLPLPRYDNTSEVGNLLCFQRPKGGKQVPV 379
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSN-GTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLR 238
P + FH + LP +NY IP D++ G C + +IGN QQQ + +++
Sbjct: 380 PKLIFHLASAD-MDLPRENY-IPEDTDSGVMCLMINGAEVDMVLIGNFQQQNMHIVYDVE 437
Query: 239 NSLIGFTPNKC 249
NS + F +C
Sbjct: 438 NSKLLFASAQC 448
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 87/257 (33%), Positives = 128/257 (49%), Gaps = 16/257 (6%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G TET+T+ + V +N IGCG N G F G AGLLGLG ++ PSQ +++ FS
Sbjct: 223 GFLATETLTITPSDVFENFVIGCGERNGGRFSGTAGLLGLGRSPVALPSQTSSTYKNLFS 282
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
YCL S ST L F + A P+ ++ Y L ++GISVGG LPI + F
Sbjct: 283 YCL-PASSSSTGHLSFGGGVSQAAKFTPI--TSKIPELYGLDVSGISVGGRKLPIDPSVF 339
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS- 175
+ G I+DSGT +T L + ++AL AF + T G + CYDFS +
Sbjct: 340 R-----TAGTIIDSGTTLTYLPSTAHSALSSAFQEMMTNYTLTKGTSGLQPCYDFSKHAN 394
Query: 176 -SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP--TSSSLSIIGNVQQQGTR 232
++ +P +S F G + + I + C AF + ++I GNVQQ+
Sbjct: 395 DNITIPQISIFFEGGVEVDIDDSGIFIAANGLEEVCLAFKDNGNDTDVAIFGNVQQKTYE 454
Query: 233 VSFNLRNSLIGFTPNKC 249
V +++ ++GF P C
Sbjct: 455 VVYDVAKGMVGFAPGGC 471
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 91/266 (34%), Positives = 126/266 (47%), Gaps = 24/266 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G T+TV LG AS+ GCG +N GLF G AGL+GLG LS SQ + FSY
Sbjct: 246 GVLATDTVALGGASLGGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFSY 305
Query: 58 CLVDRDS-DSTSTLEF---DSSLPPNAVTAP-----LLRNHELDTFYYLGLTGISVGGDL 108
CL S D++ +L D + T P ++ + FY+L +TG +VGG
Sbjct: 306 CLPAATSGDASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGG-- 363
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFD 166
TA G +++DSGT +TRL Y A+R F+R G G ++ D
Sbjct: 364 -----TALAAQGLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILD 418
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTS--SSLSII 223
TCYD + V+VP ++ G + + A L V +G+ C A A S II
Sbjct: 419 TCYDLTGHDEVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDETPII 478
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN QQ+ RV ++ S +GF C
Sbjct: 479 GNYQQKNKRVVYDTLGSRLGFADEDC 504
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 96/252 (38%), Positives = 132/252 (52%), Gaps = 11/252 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G +TV+ GS SV N GCG +NEGLF +AGL+GL LS Q+ S +FSY
Sbjct: 223 GYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSY 282
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
CL S S+ L S P P+ + D+ Y++ +TGI V G L +S +A+
Sbjct: 283 CLPTSSSSSSGYLSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYS 342
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
+ I+DSGT +TRL T Y+AL A + ++ DTC+ + +
Sbjct: 343 SLPT-----IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQ-GQAARL 396
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
VP V+ F G L L A+N L+ VDS T C AFAP S+ +IIGN QQQ V +++
Sbjct: 397 RVPEVTMAFAGGAALKLAARNLLVDVDS-ATTCLAFAPARSA-AIIGNTQQQTFSVVYDV 454
Query: 238 RNSLIGFTPNKC 249
+NS IGF C
Sbjct: 455 KNSKIGFAAGGC 466
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 89/259 (34%), Positives = 124/259 (47%), Gaps = 25/259 (9%)
Query: 1 GDFVTETVTLGSAS--VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TF 55
G + +T+TL AS V GC H G GL+GLGGG+ S SQ A+ +F
Sbjct: 220 GTYSRDTLTLSGASDAVKGFQFGCSHLESGFSDQTDGLMGLGGGAQSLVSQTAAAYGNSF 279
Query: 56 SYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
SYCL S VT +LR+ ++ TFY L I+VGG L +S +
Sbjct: 280 SYCLPPTSGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSPSV 339
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
F G +VDSGT +TRL Y+AL AF G + ++ DTC+DF+ ++
Sbjct: 340 FA------AGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQT 393
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPT--SSSLSIIGNVQQQG 230
+ +PTV+ F G + L D NG C AFA T + IIGNVQQ+
Sbjct: 394 QISIPTVALVFSGGAAIDL---------DPNGIMYGNCLAFAATGDDGTTGIIGNVQQRT 444
Query: 231 TRVSFNLRNSLIGFTPNKC 249
V +++ +S +GF C
Sbjct: 445 FEVLYDVGSSTLGFRSGAC 463
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 92/265 (34%), Positives = 127/265 (47%), Gaps = 29/265 (10%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
+ + VTL + S+ + GC G + GLLGLG G +S SQ + STFSY
Sbjct: 179 ANLSQDVVTLATDSIPSYTFGCLTEATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFSY 238
Query: 58 CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
CL S +L F SL P T PLL+N + YY+ L I VG ++
Sbjct: 239 CL-----PSFRSLNFSGSLRLGPVGQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVD 293
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTC 168
I +A + + G I DSGT TRL Y A+RDAF + G ++ G FDTC
Sbjct: 294 IPPSALAFNPTTGAGTIFDSGTVFTRLVAPAYTAVRDAFRKRVGNATVTSLGG---FDTC 350
Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIG 224
Y S + PT++F F G + LP N LI ++ C A A +S L++I
Sbjct: 351 YT----SPIVAPTITFMF-SGMNVTLPPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIA 405
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
N+QQQ R+ F++ NS +G C
Sbjct: 406 NMQQQNHRILFDVPNSRLGVAREPC 430
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 102/261 (39%), Positives = 129/261 (49%), Gaps = 20/261 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G F +T+ + ++ GCG N GLF AGLLGLG G S Q +FSY
Sbjct: 251 GFFAKDTLAVAQDAIKGFKFGCGEKNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFSY 310
Query: 58 CLVDRDSDSTSTLEF----DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL-PIS 112
CL S +T LEF SS NA T P+L + + TFYY+GLTGI VGG L I
Sbjct: 311 CL-PASSAATGYLEFGPLSPSSSGSNAKTTPMLTD-KGPTFYYVGLTGIRVGGKQLGAIP 368
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA--LFDTCYD 170
E+ F N G +VDSGT +TRL Y AL AF A A + DTCYD
Sbjct: 369 ESVFS-----NSGTLVDSGTVITRLPDTAYAALSSAFAAAMAASGYKKAAAYSILDTCYD 423
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQ 228
F+ S V +PTVS F G L L A + + S C FA S+ I+GN QQ
Sbjct: 424 FTGLSQVSLPTVSLVFQGGACLDLDASGIVYAI-SQSQVCLGFASNGDDESVGIVGNTQQ 482
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
+ V +++ ++GF P C
Sbjct: 483 RTYGVLYDVSKKVVGFAPGAC 503
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 88/249 (35%), Positives = 127/249 (51%), Gaps = 17/249 (6%)
Query: 8 VTLGSASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINAS---TFSYCLVDRD 63
+TLGS+++ + GC + G F GL+GLGGG+ S SQ + FSYCL
Sbjct: 220 LTLGSSAMTDFQFGCSQSESGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCLPPTS 279
Query: 64 SDST-STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
S TL SS V P+LR+ ++ T+Y + L I VG L + + F
Sbjct: 280 GSSGFLTLGTGSS---GFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVF------ 330
Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTV 182
+ G ++DSGT +TRL Y+AL AF G + P + DTC+DFS +SS+ +PTV
Sbjct: 331 SAGSLMDSGTIITRLPPTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPTV 390
Query: 183 SFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQQGTRVSFNLRNS 240
+ F G + L ++ + S+ C AF P SSL IIGNVQQ+ V +++
Sbjct: 391 TLVFSGGAAVDLAFDGIMLEISSS-IRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGG 449
Query: 241 LIGFTPNKC 249
+GF C
Sbjct: 450 AVGFKAGAC 458
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 96/252 (38%), Positives = 132/252 (52%), Gaps = 11/252 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G +TV+ GS SV N GCG +NEGLF +AGL+GL LS Q+ S +FSY
Sbjct: 223 GYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSY 282
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
CL S S+ L S P P+ + D+ Y++ +TGI V G L +S +A+
Sbjct: 283 CLPTSSSSSSGYLSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYS 342
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
+ I+DSGT +TRL T Y+AL A + ++ DTC+ + +
Sbjct: 343 SLPT-----IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQ-GQAARL 396
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
VP V+ F G L L A+N L+ VDS T C AFAP S+ +IIGN QQQ V +++
Sbjct: 397 RVPEVTMAFAGGAALKLAARNLLVDVDS-ATTCLAFAPARSA-AIIGNTQQQTFSVVYDV 454
Query: 238 RNSLIGFTPNKC 249
+NS IGF C
Sbjct: 455 KNSKIGFAAGGC 466
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 96/252 (38%), Positives = 132/252 (52%), Gaps = 11/252 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G +TV+ GS SV N GCG +NEGLF +AGL+GL LS Q+ S +FSY
Sbjct: 221 GYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSY 280
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
CL S S+ L S P P+ + D+ Y++ +TGI V G L +S +A+
Sbjct: 281 CLPTSSSSSSGYLSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYS 340
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
+ I+DSGT +TRL T Y+AL A + ++ DTC+ + +
Sbjct: 341 SLPT-----IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQ-GQAARL 394
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
VP V+ F G L L A+N L+ VDS T C AFAP S+ +IIGN QQQ V +++
Sbjct: 395 RVPEVTMAFAGGAALKLAARNLLVDVDS-ATTCLAFAPARSA-AIIGNTQQQTFSVVYDV 452
Query: 238 RNSLIGFTPNKC 249
+NS IGF C
Sbjct: 453 KNSKIGFAAGGC 464
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 90/261 (34%), Positives = 124/261 (47%), Gaps = 18/261 (6%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
G F +T+TL V D GCG NN GLF AGL+GLG LS Q FS
Sbjct: 247 GFFAKDTLTLTQNDVFDGFMFGCGQNNRGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFS 306
Query: 57 YCL-VDRDSDSTSTLE-----FDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
YCL R S+ T S N +T + + TFY++ + GISVGG L
Sbjct: 307 YCLPTSRGSNGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALS 366
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
IS F+ N G I+DSGT +TRL + Y +L+ F + ++L DTCYD
Sbjct: 367 ISPMLFQ-----NAGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDTCYD 421
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQ 228
S+ +S+ +P +SF+F + L LI + C AFA ++ I GN+QQ
Sbjct: 422 LSNYTSISIPKISFNFNGNANVDLEPNGILI-TNGASQVCLAFAGNGDDDTIGIFGNIQQ 480
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q V +++ +GF C
Sbjct: 481 QTLEVVYDVAGGQLGFGYKGC 501
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 87/263 (33%), Positives = 126/263 (47%), Gaps = 23/263 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
V +T+TL + N + GC ++ G + GL+GLG G +S SQ + + FSY
Sbjct: 195 ASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSY 254
Query: 58 CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
CL S + F SL P + PLLRN + YY+ LTG+SVG +P
Sbjct: 255 CL-----PSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVP 309
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
+ D + G I+DSGT +TR Y A+RD F R +S + FDTC
Sbjct: 310 VDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEF-RKQVNVSSFSTLGAFDTC-- 366
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA----PTSSSLSIIGNV 226
FS+ + P ++ H L LP +N LI + C + A ++ L++I N+
Sbjct: 367 FSADNENVAPKITLHMTSLD-LKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANL 425
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQQ R+ F++ NS IG P C
Sbjct: 426 QQQNLRILFDVPNSRIGIAPEPC 448
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 96/252 (38%), Positives = 132/252 (52%), Gaps = 11/252 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G +TV+ GS SV N GCG +NEGLF +AGL+GL LS Q+ S +FSY
Sbjct: 221 GYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSY 280
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
CL S S+ L S P P+ + D+ Y++ +TGI V G L +S +A+
Sbjct: 281 CLPTSSSSSSGYLSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYS 340
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
+ I+DSGT +TRL T Y+AL A + ++ DTC+ + +
Sbjct: 341 SLPT-----IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQ-GQAARL 394
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
VP V+ F G L L A+N L+ VDS T C AFAP S+ +IIGN QQQ V +++
Sbjct: 395 RVPEVTMAFAGGAALKLAARNLLVDVDS-ATTCLAFAPARSA-AIIGNTQQQTFSVVYDV 452
Query: 238 RNSLIGFTPNKC 249
+NS IGF C
Sbjct: 453 KNSKIGFAAAGC 464
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 91/257 (35%), Positives = 130/257 (50%), Gaps = 16/257 (6%)
Query: 3 FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCL 59
V +++ LG + N + GC + G + GL+GLG G LS SQ + + FSYCL
Sbjct: 185 LVQDSLHLGPNVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCL 244
Query: 60 VDRDSDSTS-TLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
S S +L+ P A+ T PLL N + YY+ LTGISVG L+PIS
Sbjct: 245 PSFKSYYFSGSLKLGPVGQPKAIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPELLA 304
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT-RALSPTDGVALFDTCYDFSSRSS 176
D + G I+DSGT +TR Y A+RD F + + SP + FDTC F++ +
Sbjct: 305 FDPNTGAGTIIDSGTVITRFVPAIYTAVRDEFRKQVGGSFSP---LGAFDTC--FATNNE 359
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT----SSSLSIIGNVQQQGTR 232
V P ++ H G L LP +N LI + C A A +S +++I N+QQQ R
Sbjct: 360 VSAPAITLHL-SGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVVNVIANLQQQNHR 418
Query: 233 VSFNLRNSLIGFTPNKC 249
+ F++ NS +G C
Sbjct: 419 ILFDINNSKLGIARELC 435
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 87/263 (33%), Positives = 126/263 (47%), Gaps = 23/263 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
V +T+TL + N + GC ++ G + GL+GLG G +S SQ + + FSY
Sbjct: 121 ASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSY 180
Query: 58 CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
CL S + F SL P + PLLRN + YY+ LTG+SVG +P
Sbjct: 181 CL-----PSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVP 235
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
+ D + G I+DSGT +TR Y A+RD F R +S + FDTC
Sbjct: 236 VDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEF-RKQVNVSSFSTLGAFDTC-- 292
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA----PTSSSLSIIGNV 226
FS+ + P ++ H L LP +N LI + C + A ++ L++I N+
Sbjct: 293 FSADNENVAPKITLHMTSLD-LKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANL 351
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQQ R+ F++ NS IG P C
Sbjct: 352 QQQNLRILFDVPNSRIGIAPEPC 374
>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
Length = 375
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 85/266 (31%), Positives = 127/266 (47%), Gaps = 18/266 (6%)
Query: 1 GDFVTETVTLGSASVDNIAIG--CGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
G +ET T G+ ++ +G CG + G +GA G+LGL SLS +Q+ FSYC
Sbjct: 108 GVLASETFTFGARRAVSLRLGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYC 167
Query: 59 LVDRDSDSTSTLEFDS--SLPPNAVTAPL-----LRNHELDTFYYLGLTGISVGGDLLPI 111
L TS L F + L + T P+ + N +YY+ L GIS+G L +
Sbjct: 168 LTPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAV 227
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
+ + G GG IVDSG+ V L + A+++A + R V ++ C+
Sbjct: 228 PAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVEDYELCFVL 287
Query: 172 SSRS------SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSII 223
R+ +V+VP + HF G + LP NY + G C A T+ S +SII
Sbjct: 288 PRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRA-GLMCLAVGKTTDGSGVSII 346
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GNVQQQ V F++++ F P +C
Sbjct: 347 GNVQQQNMHVLFDVQHHKFSFAPTQC 372
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 124 bits (310), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 93/258 (36%), Positives = 133/258 (51%), Gaps = 20/258 (7%)
Query: 6 ETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSYCLVD 61
ET++L S + A GCG N G F G GL+GLG G+LS PSQ A +TFSYCL
Sbjct: 254 ETLSLSSTRDLPGFAFGCGQTNLGEFGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPS 313
Query: 62 RDSDSTSTLEFDSSLPP------NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
D+ + L S+ P + +++ + + Y++ + I +GG +LP+ T
Sbjct: 314 YDT-THGYLTMGSTTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTV 372
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
F D G + DSGT +T L E Y +LRD F P FDTCYDF+ +
Sbjct: 373 FTRD-----GTLFDSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDPFDTCYDFTGHN 427
Query: 176 SVEVPTVSFHFPEGKVLPL-PAKNYLIPVDSN-GTFCFAFAPTSSSL--SIIGNVQQQGT 231
++ +P V+F F +G V L P + P D+ T C AF P S++ +IIGN QQ+GT
Sbjct: 428 AIFMPAVAFKFSDGAVFDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGT 487
Query: 232 RVSFNLRNSLIGFTPNKC 249
V +++ IGF C
Sbjct: 488 EVIYDVAAEKIGFGQFTC 505
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 124 bits (310), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 97/280 (34%), Positives = 137/280 (48%), Gaps = 31/280 (11%)
Query: 1 GDFVTETVTLGS-------ASVDNIAIGCGHNNEGLFV--GAAGLLGLGGGSLSFPSQIN 51
GDF + + L S ++A GC H+ +G V G+ G++G G+LS PSQ+
Sbjct: 89 GDFSQDVIFLNSTNSSSQAVQFRDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLK 148
Query: 52 ----ASTFSYCLVDRDSDSTST---LEFDSSLPPNAVT-APLLRNH---ELDTFYYLGLT 100
S FSYC + +T DS L + V+ PLL N YY+GLT
Sbjct: 149 DRLGGSKFSYCFPSQPWQPRATGVIFLGDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLT 208
Query: 101 GISVGGDLLPISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA-LSP 158
ISV G L I E+AFK+D S G+GG ++DSGT TR+ + Y A R+AF R+ L
Sbjct: 209 SISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRK 268
Query: 159 TDGVAL-FDTCYDFSSRSSVE-VPTVSFHFPEGKVLPLPAKNYLIPVDSNG---TFCFAF 213
G A FD CY+ S+ SS+ VP V L L ++ +PV + G T C A
Sbjct: 269 KVGAAAGFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAI 328
Query: 214 APTSSS----LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ S ++++GN QQ V ++ S +GF C
Sbjct: 329 LSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADC 368
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 124 bits (310), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 93/266 (34%), Positives = 135/266 (50%), Gaps = 35/266 (13%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHN-----NEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
G ++ +TLGS + N + GC + + + G L + + +++ TF
Sbjct: 201 GTLASDAITLGSQYLPNFSFGCAESLSEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTF 260
Query: 56 SYCLV------------DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
SYCL + S+S+L+F + L+++ + TFY++ L IS
Sbjct: 261 SYCLPSSSTSSGSLVLGKEAAVSSSSLKFTT----------LIKDPSIPTFYFVTLKAIS 310
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
VG + + T + GG I+DSGT +T L Y ALRDAF + +L PT V
Sbjct: 311 VGNTRISVPGTNI----ASGGGTIIDSGTTITHLVPSAYTALRDAFRQQLSSLQPTP-VE 365
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
DTCYD SS SSV+VPT++ H L LP +N LI +S G C AF+ T S SII
Sbjct: 366 DMDTCYDLSS-SSVDVPTITLHLDRNVDLVLPKENILITQES-GLACLAFSSTDSR-SII 422
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GNVQQQ R+ F++ NS +GF +C
Sbjct: 423 GNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 124 bits (310), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 87/263 (33%), Positives = 126/263 (47%), Gaps = 24/263 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
+ V +T+TL + N + GC ++ G + GL+GLG G +S SQ + + FSY
Sbjct: 196 ANLVQDTLTLSPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSY 255
Query: 58 CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
CL S + F SL P + PLLRN + YY+ LTG+SVG +P
Sbjct: 256 CL-----PSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVP 310
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
+ D + G I+DSGT +TR Y A+RD F + T G FDTC
Sbjct: 311 VDPVYLTFDSNSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNGSFSTLGA--FDTC-- 366
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA----PTSSSLSIIGNV 226
FS+ + P ++ H L LP +N LI + C + A ++ L++I N+
Sbjct: 367 FSADNENVTPKITLHMTSLD-LKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANL 425
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQQ R+ F++ NS IG P C
Sbjct: 426 QQQNLRILFDVPNSRIGIAPEPC 448
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 124 bits (310), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 85/266 (31%), Positives = 127/266 (47%), Gaps = 18/266 (6%)
Query: 1 GDFVTETVTLGSASVDNIAIG--CGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
G +ET T G+ ++ +G CG + G +GA G+LGL SLS +Q+ FSYC
Sbjct: 184 GVLASETFTFGARRAVSLRLGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYC 243
Query: 59 LVDRDSDSTSTLEFDS--SLPPNAVTAPL-----LRNHELDTFYYLGLTGISVGGDLLPI 111
L TS L F + L + T P+ + N +YY+ L GIS+G L +
Sbjct: 244 LTPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAV 303
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
+ + G GG IVDSG+ V L + A+++A + R V ++ C+
Sbjct: 304 PAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVEDYELCFVL 363
Query: 172 SSRS------SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSII 223
R+ +V+VP + HF G + LP NY + G C A T+ S +SII
Sbjct: 364 PRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRA-GLMCLAVGKTTDGSGVSII 422
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GNVQQQ V F++++ F P +C
Sbjct: 423 GNVQQQNMHVLFDVQHHKFSFAPTQC 448
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 85/266 (31%), Positives = 127/266 (47%), Gaps = 18/266 (6%)
Query: 1 GDFVTETVTLGSASVDNIAIG--CGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
G +ET T G+ ++ +G CG + G +GA G+LGL SLS +Q+ FSYC
Sbjct: 106 GVLASETFTFGARRAVSLRLGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYC 165
Query: 59 LVDRDSDSTSTLEFDS--SLPPNAVTAPL-----LRNHELDTFYYLGLTGISVGGDLLPI 111
L TS L F + L + T P+ + N +YY+ L GIS+G L +
Sbjct: 166 LTPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAV 225
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
+ + G GG IVDSG+ V L + A+++A + R V ++ C+
Sbjct: 226 PAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVEDYELCFVL 285
Query: 172 SSRS------SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSII 223
R+ +V+VP + HF G + LP NY + G C A T+ S +SII
Sbjct: 286 PRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRA-GLMCLAVGKTTDGSGVSII 344
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GNVQQQ V F++++ F P +C
Sbjct: 345 GNVQQQNMHVLFDVQHHKFSFAPTQC 370
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 92/257 (35%), Positives = 129/257 (50%), Gaps = 22/257 (8%)
Query: 1 GDFVTETVTLGS-ASVDNIAIGCGHNNEG-LFVGAAGLLGLGGGSLSFPSQINAS---TF 55
G + ++T+TL + A+V GCGH G LF G GLLG G S Q + F
Sbjct: 228 GVYSSDTLTLAANATVQGFLFGCGHAQSGGLFTGIDGLLGFGREQPSLVQQTAGAYGGVF 287
Query: 56 SYCLVDRDSDSTS-TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
SYCL + S + TL S + P T LL + T+Y + LTGISVGG L + +
Sbjct: 288 SYCLPTKSSTTGYLTLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPAS 347
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
AF G +VD+GT +TRL Y ALR AF G + + + DTCY F+
Sbjct: 348 AFA------AGTVVDTGTVITRLPPAAYAALRSAFRSGMASYPSAPPIGILDTCYSFAGY 401
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
+V + +V+ F G + L A + S G FA + + S++I+GNVQQ+ S
Sbjct: 402 GTVNLTSVALTFSSGATMTLGADGIM----SFGCLAFASSGSDGSMAILGNVQQR----S 453
Query: 235 FNLR--NSLIGFTPNKC 249
F +R S +GF P+ C
Sbjct: 454 FEVRIDGSSVGFRPSSC 470
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 97/280 (34%), Positives = 136/280 (48%), Gaps = 31/280 (11%)
Query: 1 GDFVTETVTLGS-------ASVDNIAIGCGHNNEGLFV--GAAGLLGLGGGSLSFPSQIN 51
GDF + + L S ++A GC H+ +G V G+ G++G G+LS PSQ+
Sbjct: 190 GDFSQDVIFLNSTNSSGQAVQFRDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLK 249
Query: 52 ----ASTFSYCLVDRDSDSTST---LEFDSSLPPNAV-TAPLLRNH---ELDTFYYLGLT 100
S FSYC + +T DS L + V PLL N YY+GLT
Sbjct: 250 DRLGGSKFSYCFPSQPWQPRATGVIFLGDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLT 309
Query: 101 GISVGGDLLPISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA-LSP 158
ISV G L I E+AFK+D S G+GG ++DSGT TR+ + Y A R+AF R+ L
Sbjct: 310 SISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRK 369
Query: 159 TDGVAL-FDTCYDFSSRSSVE-VPTVSFHFPEGKVLPLPAKNYLIPVDSNG---TFCFAF 213
G A FD CY+ S+ SS+ VP V L L ++ +PV + G T C A
Sbjct: 370 KVGAAAGFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAI 429
Query: 214 APTSSS----LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ S ++++GN QQ V ++ S +GF C
Sbjct: 430 LSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADC 469
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 123 bits (308), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 89/261 (34%), Positives = 124/261 (47%), Gaps = 18/261 (6%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
G F + +TL V D GCG NN+GLF AGL+GLG LS Q FS
Sbjct: 247 GFFAKDKLTLTQNDVFDGFMFGCGQNNKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFS 306
Query: 57 YCL-VDRDSDSTSTLE-----FDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
YCL R S+ T S N +T + + +Y++ + GISVGG L
Sbjct: 307 YCLPTSRGSNGHLTFGNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALS 366
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
IS F+ N G I+DSGT +TRL + Y +L+ AF + ++L DTCYD
Sbjct: 367 ISPMLFQ-----NAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYD 421
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQ 228
S+ +S+ +P +SF+F + L LI + C AFA S+ I GN+QQ
Sbjct: 422 LSNYTSISIPKISFNFNGNANVELDPNGILI-TNGASQVCLAFAGNGDDDSIGIFGNIQQ 480
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q V +++ +GF C
Sbjct: 481 QTLEVVYDVAGGQLGFGYKGC 501
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 123 bits (308), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 82/263 (31%), Positives = 117/263 (44%), Gaps = 15/263 (5%)
Query: 1 GDFVTETVTLGSAS--VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
G TET T G+ N+ GCG G A+G+LGL G LS Q+ + FSYC
Sbjct: 195 GVLATETFTFGAHHGVSANLTFGCGKLANGTIAEASGILGLSPGPLSMLKQLAITKFSYC 254
Query: 59 LVDRDSDSTSTLEFDS-------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
L TS + F + T PLL+N D +YY+ + G+SVG L +
Sbjct: 255 LTPFADRKTSPVMFGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDV 314
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
+ I G GG ++DS T + L + L+ A + G + V + C++
Sbjct: 315 PQETLAIKPDGTGGTVLDSATTLAYLVEPAFTELKKAVMEGIKLPVANRSVDDYPVCFEL 374
Query: 172 S---SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF--APTSSSLSIIGNV 226
S V+VP + HF + LP NY S G C A AP + ++IGNV
Sbjct: 375 PRGMSMEGVQVPPLVLHFDGDAEMSLPRDNYFQE-PSPGMMCLAVMQAPFEGAPNVIGNV 433
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQQ V +++ N + P KC
Sbjct: 434 QQQNMHVLYDVGNRKFSYAPTKC 456
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 123 bits (308), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 87/250 (34%), Positives = 132/250 (52%), Gaps = 20/250 (8%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS 68
S ++ + GCG +N+GLF G++GL LS SQ++ FSYCL S S
Sbjct: 210 SQTLSSFVYGCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNS 269
Query: 69 TLEF-----DSSLPPNAVTA--PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
E SSL P++ PLL+N + Y++ L I+V G L ++ +++K+
Sbjct: 270 PKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPT- 328
Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVR-GTRALSPTDGVALFDTCYDFSSRSSVEV- 179
I+DSGT +TRL T Y L++A+V ++ G++L DTC+ S EV
Sbjct: 329 -----IIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVA 383
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
P + F G L L N L+ +++ G C A A SSS++IIGN QQQ +V++++ N
Sbjct: 384 PDIRIIFKGGADLQLKGHNSLVELET-GITCLAMA-GSSSIAIIGNYQQQTVKVAYDVGN 441
Query: 240 SLIGFTPNKC 249
S +GF P C
Sbjct: 442 SRVGFAPGGC 451
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 122 bits (307), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 87/250 (34%), Positives = 132/250 (52%), Gaps = 20/250 (8%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS 68
S ++ + GCG +N+GLF G++GL LS SQ++ FSYCL S S
Sbjct: 210 SQTLSSFVYGCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNS 269
Query: 69 TLEF-----DSSLPPNAVTA--PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
E SSL P++ PLL+N + Y++ L I+V G L ++ +++K+
Sbjct: 270 PKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPT- 328
Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVR-GTRALSPTDGVALFDTCYDFSSRSSVEV- 179
I+DSGT +TRL T Y L++A+V ++ G++L DTC+ S EV
Sbjct: 329 -----IIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVA 383
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
P + F G L L N L+ +++ G C A A SSS++IIGN QQQ +V++++ N
Sbjct: 384 PDIRIIFKGGADLQLKGHNSLVELET-GITCLAMA-GSSSIAIIGNYQQQTVKVAYDVGN 441
Query: 240 SLIGFTPNKC 249
S +GF P C
Sbjct: 442 SRVGFAPGGC 451
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 92/266 (34%), Positives = 138/266 (51%), Gaps = 27/266 (10%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINAST 54
G+ ++T+TL S S IGCGH N+G F +G++GLG G LS SQ+ +S
Sbjct: 182 GNVASDTITLDSTTGSPVSFPKTVIGCGHENDGTFSDKGSGIVGLGAGPLSLISQMGSSV 241
Query: 55 ---FSYCLVDRDSDS--TSTLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCLV S + +S L F S+ P + PLL + + +FY+L L +SVG
Sbjct: 242 GGKFSYCLVPLSSRAGNSSKLNFGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGN 301
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVA 163
+ + +++ +G G II+DSGT +T + + ++ L A V G RA P+
Sbjct: 302 ERIKFGDSSLG---TGEGNIIIDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPS---G 355
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
CY S+ S ++VP ++ HF G + L N + V S+ C AFA T+S +SI
Sbjct: 356 FLSVCY--SATSDLKVPAITAHF-TGADVKLKPINTFVQV-SDDVVCLAFASTTSGISIY 411
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GNV Q V +N++ + F P C
Sbjct: 412 GNVAQMNFLVEYNIQGKSLSFKPTDC 437
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 99/265 (37%), Positives = 138/265 (52%), Gaps = 23/265 (8%)
Query: 5 TETVTLGSASVDN-----IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
+ET T GS+ D IA GC + + + G+AGL+GLG G LS SQ+ A FSYCL
Sbjct: 189 SETFTFGSSPADQVRVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL 248
Query: 60 VD-RDSDSTSTLEFDSSLPPNAVTAPLLR---------NHELDTFYYLGLTGISVGGDLL 109
+D+ S STL + A+ +R + T+YYL LTGISVG L
Sbjct: 249 TPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGAAAL 308
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG--VALFDT 167
PI AF + G GG+I+DSGT +T L Y +R A VR L TDG D
Sbjct: 309 PIPPGAFALRADGTGGLIIDSGTTITSLVDAAYKRVRAA-VRSLVKLPVTDGSNATGLDL 367
Query: 168 CYDFSSRSS--VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF-APTSSSLSIIG 224
C+ S S+ +P+++ HF G + LP +NY+I +D G +C A + T LS +G
Sbjct: 368 CFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMI-LD-GGMWCLAMRSQTDGELSTLG 425
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
N QQQ + ++++ + F P KC
Sbjct: 426 NYQQQNLHILYDVQKETLSFAPAKC 450
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 90/277 (32%), Positives = 125/277 (45%), Gaps = 45/277 (16%)
Query: 1 GDFVTETVTLG-------SASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINA 52
G+ T+ T G S + GCGH N+G+F G+ G G G S PSQ+N
Sbjct: 177 GEIATDRFTFGDSGGSGESLHTRRLTFGCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNV 236
Query: 53 STFSYCLVDRDSDSTSTLEFDSSL-----PPNAV----------TAPLLRNHELDTFYYL 97
++FSYC TS E SSL P A+ T P+L+N + Y+L
Sbjct: 237 TSFSYCF-------TSMFESKSSLVTLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFL 289
Query: 98 GLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS 157
L GISVG LP+ ET F+ I+DSG ++T L E Y A++ F L
Sbjct: 290 SLKGISVGKTRLPVPETKFR-------STIIDSGASITTLPEEVYEAVKAEFA-AQVGLP 341
Query: 158 PT--DGVALFDTCYDF---SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFA 212
P+ +G AL D C+ + VP+++ H EG LP NY+ C
Sbjct: 342 PSGVEGSAL-DLCFALPVTALWRRPAVPSLTLHL-EGADWELPRSNYVFEDLGARVMCIV 399
Query: 213 FAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++IGN QQQ T V ++L N + F P +C
Sbjct: 400 LDAAPGEQTVIGNFQQQNTHVVYDLENDRLSFAPARC 436
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 97/265 (36%), Positives = 134/265 (50%), Gaps = 29/265 (10%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G + TET+TL V + GCG + G + GLLGLGG S SQ ++ FS
Sbjct: 270 GVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFS 329
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTA---------PLLRNHELDTFYYLGLTGISVGGD 107
YCL S L + PPN+ ++ P+ R + TFY + LTGISVGG
Sbjct: 330 YCL-PPTSGGAGFLTLGA--PPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGA 386
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---RALSPTDGVAL 164
L I +AF + G+++DSGT +T L Y ALR AF R L P++G +
Sbjct: 387 PLAIPPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG-GV 439
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
DTCYDF+ ++V VPT+S F G + L A ++ VD G FA A T +++ IIG
Sbjct: 440 LDTCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVL-VD--GCLAFAGAGTDNAIGIIG 496
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
NV Q+ V ++ +GF C
Sbjct: 497 NVNQRTFEVLYDSGKGTVGFRAGAC 521
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 91/261 (34%), Positives = 128/261 (49%), Gaps = 16/261 (6%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
G F ET T+G V+++A GCG+ N+G FV A G+LGLG G+LSF SQ + F+Y
Sbjct: 132 GVFAYETATVGGIRVNHVAFGCGNRNQGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAY 191
Query: 58 CLVDRDSDST--STLEFDSSLPP---NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
CL S ++ S+L F + + PL+ N + YY+ + I GG+ L I
Sbjct: 192 CLTSYLSPTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIP 251
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---RALSPTDGVALFDTCY 169
++A+KID GNGG I DSGT VT + Y + AF + RA G+ L C
Sbjct: 252 DSAWKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPL---CV 308
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS-LSIIGNVQQ 228
+ S P+ + F +G NY I V N C A +SS ++IGN+ Q
Sbjct: 309 NVSGIDHPIYPSFTIEFDQGATYRPNQGNYFIEVSPN-IDCLAMLESSSDGFNVIGNIIQ 367
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q V ++ IGF C
Sbjct: 368 QNYLVQYDREEHRIGFAHANC 388
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 96/253 (37%), Positives = 130/253 (51%), Gaps = 10/253 (3%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G E TL ++ V +++ GCG NN+GLF G AGLLGLG G LS P+Q + FS
Sbjct: 218 GFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFS 277
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
YCL S+ST L F S+ +V + + Y + + GISVG L I+ +F
Sbjct: 278 YCLPSFTSNSTGHLTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSF 337
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
+ G I+DSGT TRL T+ Y LR F + T G LFDTCYDF+ +
Sbjct: 338 STE-----GAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDT 392
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
V PT++F F G V+ L +P+ + C AFA +I GNVQQ V ++
Sbjct: 393 VTYPTIAFSFAGGTVVELDGSGISLPIKIS-QVCLAFAGNDDLPAIFGNVQQTTLDVVYD 451
Query: 237 LRNSLIGFTPNKC 249
+ +GF PN C
Sbjct: 452 VAGGRVGFAPNGC 464
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 99/255 (38%), Positives = 126/255 (49%), Gaps = 14/255 (5%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
G F TET+T+ S+ V N GCG +N GLF AAGLLGL S+S PSQ FS
Sbjct: 227 GFFATETLTISSSDVFTNFLFGCGQSNNGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFS 286
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
YCL S ST L F + A P+ + +FY + + GISV G LPI + F
Sbjct: 287 YCLPSTPS-STGYLNFGGKVSQTAGFTPI--SPAFSSFYGIDIVGISVAGSQLPIDPSIF 343
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
G I+DSGT +TRL Y AL++AF T+G L DTCYDFS+ ++
Sbjct: 344 T-----TSGAIIDSGTVITRLPPTAYKALKEAFDEKMSNYPKTNGDELLDTCYDFSNYTT 398
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRVS 234
V P VS F G + + A L V+ C AFA S I GN QQ+ V
Sbjct: 399 VSFPKVSVSFKGGVEVDIDASGILYLVNGVKMVCLAFAANKDDSEFGIFGNHQQKTYEVV 458
Query: 235 FNLRNSLIGFTPNKC 249
++ +IGF C
Sbjct: 459 YDGAKGMIGFAAGAC 473
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 83/260 (31%), Positives = 125/260 (48%), Gaps = 11/260 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINA---STF 55
++ + LG ++ N A GC G + GLLGLG G ++ SQ+ F
Sbjct: 172 ASLASDWLHLGKDAIPNYAFGCVSAVSGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVF 231
Query: 56 SYCLVDRDSDSTS-TLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPISE 113
SYCL S S +L ++ P V P+L+N + YY+ +TG+SVG + +
Sbjct: 232 SYCLPSYKSYYFSGSLRLGAAGQPRGVRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPA 291
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
+F D + G +VDSGT +TR Y ALR+ F R A S + FDTC++
Sbjct: 292 GSFAFDPATGAGTVVDSGTVITRWTPPVYAALREEFRRHVAAPSGYTSLGAFDTCFNTDE 351
Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS----LSIIGNVQQQ 229
++ P V+ H G L LP +N LI + C A A + ++++ N+QQQ
Sbjct: 352 VAAGVAPAVTVHMDGGLDLALPMENTLIHSSATPLACLAMAEAPQNVNAVVNVLANLQQQ 411
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
RV F++ NS +GF C
Sbjct: 412 NLRVVFDVANSRVGFARESC 431
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 90/268 (33%), Positives = 131/268 (48%), Gaps = 20/268 (7%)
Query: 1 GDFVTETVTLG-SASVD-NIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
G +ET T G +A V + GCG + G VGA+GL+GL G +S SQ++ FSYC
Sbjct: 180 GVLASETFTFGVNAKVSLPLGFGCGALSAGDLVGASGLMGLSPGIMSLVSQLSVPRFSYC 239
Query: 59 LVDRDSDSTSTLEFDS-------SLPPNAVTAPLLRNHELDT-FYYLGLTGISVGGDLLP 110
L TS L F + T +LRN ++T +YY+ L G+S+G L
Sbjct: 240 LTPFAERKTSPLLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLD 299
Query: 111 ISETAF-KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR---ALSPTDGVALFD 166
+ T+ I G+GG IVDSG+ ++ L+ + A++ A V R A + ++
Sbjct: 300 VPATSLGMIKPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRLPVANGTDEDYDDYE 359
Query: 167 TCYDFS---SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF--APTSSSLS 221
C+ + +V+ P + HF G + LP NY + G C A +P +S
Sbjct: 360 LCFALPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRA-GLMCLAVGTSPDGFGVS 418
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IIGNVQQQ V F++RN F P KC
Sbjct: 419 IIGNVQQQNMHVLFDVRNQKFSFAPTKC 446
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 97/265 (36%), Positives = 134/265 (50%), Gaps = 29/265 (10%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G + TET+TL V + GCG + G + GLLGLGG S SQ ++ FS
Sbjct: 190 GVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFS 249
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTA---------PLLRNHELDTFYYLGLTGISVGGD 107
YCL S L + PPN+ ++ P+ R + TFY + LTGISVGG
Sbjct: 250 YCL-PPTSGGAGFLTLGA--PPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGA 306
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---RALSPTDGVAL 164
L I +AF + G+++DSGT +T L Y ALR AF R L P++G +
Sbjct: 307 PLAIPPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG-GV 359
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
DTCYDF+ ++V VPT+S F G + L A ++ VD G FA A T +++ IIG
Sbjct: 360 LDTCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVL-VD--GCLAFAGAGTDNAIGIIG 416
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
NV Q+ V ++ +GF C
Sbjct: 417 NVNQRTFEVLYDSGKGTVGFRAGAC 441
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 92/271 (33%), Positives = 127/271 (46%), Gaps = 27/271 (9%)
Query: 1 GDFVTETVTLGS-----ASVDN------IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ 49
G +T+TLG+ AS +N GCG NN GLF A GL GLG G +S SQ
Sbjct: 177 GHLGNDTLTLGTTPSTNASENNSNKLPGFVFGCGENNTGLFGKADGLFGLGRGKVSLSSQ 236
Query: 50 INAST---FSYCLVDRDSDSTSTLEF--DSSLPPNAVTAPLLRNHELDTFYYLGLTGISV 104
FSYCL S++ L + P +A P+L +FYY+ L GI V
Sbjct: 237 AAGKYGEGFSYCLPSSSSNAHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRV 296
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV--RGTRALSPTDGV 162
G + +S G+IVDSGT +TRL Y+ALR AF+ G +
Sbjct: 297 AGRAIKVSSRPALWP----AGLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRL 352
Query: 163 ALFDTCYDFSSR--SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-- 218
++ DTCYDF++ ++V +P V+ F G + + L V C AFAP +
Sbjct: 353 SILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLY-VAKVAQACLAFAPNGNGR 411
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S I+GN QQ+ V +++ IGF C
Sbjct: 412 SAGILGNTQQRTVAVVYDVGRQKIGFAAKGC 442
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 95/275 (34%), Positives = 129/275 (46%), Gaps = 27/275 (9%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEG------LFVGAAGLLGLGGGSLSFPSQ 49
G F ET TL S S + ++ GCG G F GA G++GLG GS+SF SQ
Sbjct: 183 GFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQ 242
Query: 50 IN---ASTFSYCLVDR--DSDSTSTLEFD---SSLPPNAVTA----PLLRNHELDTFYYL 97
+ + FSYCL+D TS L SLP T PL N TFYY+
Sbjct: 243 LGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYI 302
Query: 98 GLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS 157
+ I++ G LPI+ ++IDE GNGG +VDSGT +T L Y + + R + +
Sbjct: 303 TIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPN 362
Query: 158 PTDGVALFDTCYDFSSRSSV-EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT 216
+ FD C + S S +P + F G V P +NY + + G C A
Sbjct: 363 AAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETE-EGVMCLAIRAV 421
Query: 217 SS--SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S S+IGN+ QQG + F+ S +GFT C
Sbjct: 422 ESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 456
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 93/266 (34%), Positives = 135/266 (50%), Gaps = 35/266 (13%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHN-NEGLF----VGAAGLLGLGGGSLSFPSQINASTF 55
G ++ +TLGS + N + GC + +E + + G L + + +++ TF
Sbjct: 201 GTLASDAITLGSQYLPNFSFGCAESLSEDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTF 260
Query: 56 SYCLV------------DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
SYCL + S+S+L+F + L+++ TFY++ L IS
Sbjct: 261 SYCLPSSSTSSGSLVLGKEAAVSSSSLKFTT----------LIKDPSFPTFYFVTLKAIS 310
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
VG + + T + GG I+DSGT +T L Y LRDAF + +L PT V
Sbjct: 311 VGNTRISVPATNI----ASGGGTIIDSGTTITYLVPSAYKDLRDAFRQQLSSLQPTP-VE 365
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
DTCYD SS SSV+VPT++ H L LP +N LI +S G C AF+ T S SII
Sbjct: 366 DMDTCYDLSS-SSVDVPTITLHLDRNVDLVLPKENILITQES-GLSCLAFSSTDSR-SII 422
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GNVQQQ R+ F++ NS +GF +C
Sbjct: 423 GNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/265 (37%), Positives = 138/265 (52%), Gaps = 23/265 (8%)
Query: 5 TETVTLGSASVDN-----IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
+ET T GS+ D IA GC + + + G+AGL+GLG G LS SQ+ A FSYCL
Sbjct: 189 SETFTFGSSPADQVRVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL 248
Query: 60 VD-RDSDSTSTLEFDSSLPPNAVTAPLLR---------NHELDTFYYLGLTGISVGGDLL 109
+D+ S STL + A+ +R + T+YYL LTGISVG L
Sbjct: 249 TPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAAL 308
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG--VALFDT 167
PI AF + G GG+I+DSGT +T L Y +R A VR L TDG D
Sbjct: 309 PIPPGAFALRADGTGGLIIDSGTTITSLVDAAYKRVRAA-VRSLVKLPVTDGSNATGLDL 367
Query: 168 CYDFSSRSS--VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF-APTSSSLSIIG 224
C+ S S+ +P+++ HF G + LP +NY+I +D G +C A + T LS +G
Sbjct: 368 CFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMI-LD-GGMWCLAMRSQTDGELSTLG 425
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
N QQQ + ++++ + F P KC
Sbjct: 426 NYQQQNLHILYDVQKETLSFAPAKC 450
>gi|414871328|tpg|DAA49885.1| TPA: hypothetical protein ZEAMMB73_545054 [Zea mays]
Length = 565
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 93/250 (37%), Positives = 131/250 (52%), Gaps = 22/250 (8%)
Query: 15 VDNIA---IGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVD-RDSDST 67
VD IA GC G V + GL+G G LSFPSQ + S FSYCL + S+ +
Sbjct: 321 VDAIAAYTFGCLCVVTGGSVPSQGLVGFNRGPLSFPSQNKNVYGSVFSYCLPSYKSSNFS 380
Query: 68 STLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGI 126
TL + P + T PLL N + YY+ + GI VGG + + +A D + G
Sbjct: 381 GTLRLGPAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVAVPASALAFDPASGHGT 440
Query: 127 IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG-VALFDTCYDFSSRSSVEVPTVSFH 185
IVD+GT TRL Y A+ D F RA P G + FDTCY+ ++ VPTV+F
Sbjct: 441 IVDAGTMFTRLSAPVYAAVCDVFRSRVRA--PVAGPLGGFDTCYNV----TISVPTVTFL 494
Query: 186 FPEGKV-LPLPAKNYLIPVDSNGTFCFAFA--PTSS---SLSIIGNVQQQGTRVSFNLRN 239
F +G+V + LP +N +I +G C A A P+ S L+++ ++QQQ RV F++ N
Sbjct: 495 F-DGRVSVTLPEENVVIRSSLDGIACLAMAAGPSDSVDAVLNVMASMQQQNHRVLFDVAN 553
Query: 240 SLIGFTPNKC 249
+GF+ C
Sbjct: 554 GRVGFSRELC 563
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/265 (37%), Positives = 138/265 (52%), Gaps = 23/265 (8%)
Query: 5 TETVTLGSASVDN-----IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
+ET T GS+ D IA GC + + + G+AGL+GLG G LS SQ+ A FSYCL
Sbjct: 194 SETFTFGSSPADQVRVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL 253
Query: 60 VD-RDSDSTSTLEFDSSLPPNAVTAPLLR---------NHELDTFYYLGLTGISVGGDLL 109
+D+ S STL + A+ +R + T+YYL LTGISVG L
Sbjct: 254 TPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAAL 313
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG--VALFDT 167
PI AF + G GG+I+DSGT +T L Y +R A VR L TDG D
Sbjct: 314 PIPPGAFALRADGTGGLIIDSGTTITSLVDAAYKRVRAA-VRSLVKLPVTDGSNATGLDL 372
Query: 168 CYDFSSRSS--VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF-APTSSSLSIIG 224
C+ S S+ +P+++ HF G + LP +NY+I +D G +C A + T LS +G
Sbjct: 373 CFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMI-LD-GGMWCLAMRSQTDGELSTLG 430
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
N QQQ + ++++ + F P KC
Sbjct: 431 NYQQQNLHILYDVQKETLSFAPAKC 455
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 89/259 (34%), Positives = 124/259 (47%), Gaps = 25/259 (9%)
Query: 1 GDFVTETVTLGSAS--VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TF 55
G + +T+TL AS V GC H G GL+GLGGG+ S SQ A+ +F
Sbjct: 220 GTYSRDTLTLSGASDAVKGFQFGCSHVESGFSDQTDGLMGLGGGAQSLVSQTAAAYGNSF 279
Query: 56 SYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
SYCL S VT +LR+ ++ TFY L I+VGG L +S +
Sbjct: 280 SYCLPPTSGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSPSV 339
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
F G +VDSGT +TRL Y+AL AF G + ++ DTC+DF+ ++
Sbjct: 340 FA------AGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQT 393
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPT--SSSLSIIGNVQQQG 230
+ +PTV+ F G + L D NG C AFA T + IIGNVQQ+
Sbjct: 394 QISIPTVALVFSGGAAIDL---------DPNGIMYGNCLAFAATGDDGTTGIIGNVQQRT 444
Query: 231 TRVSFNLRNSLIGFTPNKC 249
V +++ +S +GF C
Sbjct: 445 FEVLYDVGSSTLGFRSGAC 463
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 82/246 (33%), Positives = 126/246 (51%), Gaps = 15/246 (6%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS 68
S ++ GCG +N+GLF AAG++GL LS +Q++ FSYCL +S S+
Sbjct: 93 SQTLPQFTYGCGQDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSG 152
Query: 69 TLEFDSSL--PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGI 126
P + P+L + + + Y+L LT I+V G L ++ +++
Sbjct: 153 GGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPT------ 206
Query: 127 IVDSGTAVTRLQTETYNALRDAFVR-GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFH 185
++DSGT +TRL Y ALR AFV+ + + ++ DTC+ S +S VP +
Sbjct: 207 LIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKMI 266
Query: 186 FPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRVSFNLRNSLIG 243
F G L L A + LI D G C AFA +S + ++IIGN QQQ +++++ S IG
Sbjct: 267 FQGGADLTLRAPSILIEAD-KGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIG 325
Query: 244 FTPNKC 249
F P C
Sbjct: 326 FAPGSC 331
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 87/248 (35%), Positives = 121/248 (48%), Gaps = 12/248 (4%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV-DRDSDSTSTL 70
S+S +A GC N G GA+G++GLG +LS SQI FSYCL D D+ ++ L
Sbjct: 204 SSSFAGVAFGCSTANGGDMDGASGIVGLGRSALSLLSQIGVGRFSYCLRSDADAGASPIL 263
Query: 71 --EFDSSLPPNAVTAPLLRN----HELDTFYYLGLTGISVGGDLLPISETAFKIDESGNG 124
+ + LLRN +YY+ LTGI+VG LP++ + F +G G
Sbjct: 264 FGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAG 323
Query: 125 GIIVDSGTAVTRLQTETYNALRDAFVRGTRA-LSPTDGVAL-FDTCYDFSSRSSVEVPTV 182
G+IVDSGT T L Y LR AF+ T L+ G FD C++ + + VP +
Sbjct: 324 GVIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFE-AGAADTPVPRL 382
Query: 183 SFHFPEGKVLPLPAKNYLIPVDSNGTF-CFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSL 241
F F G +P ++Y VD G C PT +S+IGNV Q V ++L +
Sbjct: 383 VFRFAGGAEYAVPRQSYFDAVDEGGRVACLLVLPT-RGVSVIGNVMQMDLHVLYDLDGAT 441
Query: 242 IGFTPNKC 249
F P C
Sbjct: 442 FSFAPADC 449
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 85/256 (33%), Positives = 130/256 (50%), Gaps = 18/256 (7%)
Query: 2 DFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYC 58
D +T T S ++ GCG +N+GLF AAG++GL LS +Q++ FSYC
Sbjct: 225 DLLTLT---SSQTLPQFTYGCGQDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYC 281
Query: 59 LVDRDSDSTSTLEFDSSL--PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
L +S S+ P + P+L + + + Y+L LT I+V G L ++ +
Sbjct: 282 LPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMY 341
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR-GTRALSPTDGVALFDTCYDFSSRS 175
++ ++DSGT +TRL Y ALR AFV+ + + ++ DTC+ S +S
Sbjct: 342 RVPT------LIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKS 395
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
VP + F G L L A + LI D G C AFA +S + ++IIGN QQQ +
Sbjct: 396 ISAVPEIKMIFQGGADLTLRAPSILIEAD-KGITCLAFAGSSGTNQIAIIGNRQQQTYNI 454
Query: 234 SFNLRNSLIGFTPNKC 249
++++ S IGF P C
Sbjct: 455 AYDVSTSRIGFAPGSC 470
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 120 bits (301), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 97/273 (35%), Positives = 126/273 (46%), Gaps = 31/273 (11%)
Query: 1 GDFVTETVTLGSAS--VDNIAIGCGHNNEGLFVGA------AGLLGLGGGSLSFPSQI-- 50
G+ E TL ++ + GC H GA AGLLGLG G S SQ
Sbjct: 216 GNLAQEAFTLSPSAPPAAGVVFGCSHEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRR 275
Query: 51 --NASTFSYCLVDRDSDSTSTLEFDSSLPP--NAVTAPLLR-NHELDTFYYLGLTGISVG 105
+ FSYCL R S S L ++ PP N PL+ N +L + Y + L GISV
Sbjct: 276 GNSGDVFSYCLPPRGS-SAGYLTIGAAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVS 334
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVA 163
G LPI +AF I G ++DSGT +T + Y LRD F R G + P V
Sbjct: 335 GAALPIDASAFYI------GTVIDSGTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVE 388
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLI--PVDSNGT----FCFAFAPTS 217
DTCYD + V P V+ F G + + A L+ VD++G C AF PT+
Sbjct: 389 SLDTCYDVTGHDVVTAPPVALEFGGGARIDVDASGILLVFAVDASGQSLTLACLAFVPTN 448
Query: 218 -SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IIGN+QQ+ V F++ IGF N C
Sbjct: 449 LPGFVIIGNMQQRAYNVVFDVEGRRIGFGANGC 481
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 98/265 (36%), Positives = 133/265 (50%), Gaps = 37/265 (13%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G + ++T+TL + +V GCGH G F G GLLGLG S Q + FS
Sbjct: 231 GVYSSDTLTLSPNDAVRGFFFGCGHAQSG-FTGNDGLLGLGREEASLVEQTAGTYGGVFS 289
Query: 57 YCLVDRDSDSTSTLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
YCL R S +T L + PP T LL + T+Y + LTGISVGG L +
Sbjct: 290 YCLPTRPS-TTGYLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPS 348
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGVALFDTCY 169
+ F GG +VD+GT +TRL Y ALR AF G + +P G+ DTCY
Sbjct: 349 SVFA------GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGI--LDTCY 400
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF-CFAFAPTSS--SLSIIGNV 226
+FS +V +P V+ F G + L A L +F C AFAP+ S ++I+GNV
Sbjct: 401 NFSGYGTVTLPNVALTFSGGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNV 453
Query: 227 QQQGTRVSFNLR--NSLIGFTPNKC 249
QQ+ SF +R + +GF P+ C
Sbjct: 454 QQR----SFEVRIDGTSVGFKPSSC 474
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 97/273 (35%), Positives = 136/273 (49%), Gaps = 26/273 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G TET+ +G AS +A GC N G+ ++G++GLG LS SQ+ FSYCL
Sbjct: 178 GYLATETLHVGGASFPGVAFGCSTEN-GVGNSSSGIVGLGRSPLSLVSQVGVGRFSYCL- 235
Query: 61 DRDSDSTSTLEFDSSLPP----NAVTAPLLRNHEL--DTFYYLGLTGISVGGDLLPISET 114
D+D+ + SL N + PLL N E+ ++YY+ LTGI+VG LP++ T
Sbjct: 236 RSDADAGDSPILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTST 295
Query: 115 AFKIDESGN----GGIIVDSGTAVTRLQTETYNALRDAFV--RGTRALSPT-DGVAL-FD 166
F GG IVDSGT +T L E Y ++ AF+ T L+ T +G FD
Sbjct: 296 TFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFD 355
Query: 167 TCYDFSSR---SSVEVPTVSFHFPEGKVLPLPAKNY--LIPVDSNGTF---CFAFAPTSS 218
C+D ++ S V VPT+ F G + ++Y ++ VDS G C P S
Sbjct: 356 LCFDATAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASE 415
Query: 219 --SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S+SIIGNV Q V ++L + F P C
Sbjct: 416 KLSISIIGNVMQMDLHVLYDLDGGMFSFAPADC 448
>gi|388505490|gb|AFK40811.1| unknown [Medicago truncatula]
Length = 193
Score = 120 bits (300), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 63/172 (36%), Positives = 96/172 (55%), Gaps = 3/172 (1%)
Query: 79 NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQ 138
VT PL+ N +FYY+ L ISVG L I ++ F++ + G+GG+I+DSGT +T ++
Sbjct: 21 KQVTTPLITNPLQPSFYYISLEVISVGDTKLSIEQSTFEVSDDGSGGVIIDSGTTITYIE 80
Query: 139 TETYNALRDAFVRGTRALSPTDGVALFDTCYDF-SSRSSVEVPTVSFHFPEGKVLPLPAK 197
+++L+ F T+ G D C+ S ++ VE+P + FHF G L LP +
Sbjct: 81 ENAFDSLKKEFTSQTKLPVDKSGSTGLDVCFSLPSGKTEVEIPKLVFHFKGGD-LELPGE 139
Query: 198 NYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
NY+I S G C A S+ +SI GN+QQQ V+ +L+ I F P +C
Sbjct: 140 NYMIADSSLGVACLAMG-ASNGMSIFGNIQQQNILVNHDLQKETITFIPTQC 190
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 120 bits (300), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 90/263 (34%), Positives = 132/263 (50%), Gaps = 21/263 (7%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAST 54
G+ +ET+T+ S S A GCGH++ G+F ++G++GLGGG LS SQ+ ++
Sbjct: 181 GNLASETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTI 240
Query: 55 ---FSYCL--VDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCL V DS +S + F +S + V+ PL++ DTFYYL L GISVG
Sbjct: 241 NGLFSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSP-DTFYYLTLEGISVGK 299
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
LP + K E G IIVDSGT T L E Y+ L + + D +F
Sbjct: 300 KRLPYKGYS-KKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFS 358
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
CY+ + + + P ++ HF + V P ++ + CF APT S + ++GN+
Sbjct: 359 LCYN--TTAEINAPIITAHFKDANVELQPLNTFMRMQED--LVCFTVAPT-SDIGVLGNL 413
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
Q V F+LR + F C
Sbjct: 414 AQVNFLVGFDLRKKRVSFKAADC 436
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 119 bits (299), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 95/253 (37%), Positives = 129/253 (50%), Gaps = 10/253 (3%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G E TL ++ V +++ GCG NN+GLF G AGLLGLG G LS P+Q + FS
Sbjct: 218 GFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFS 277
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
YCL S+ST L F S+ +V + + Y + + GISVG L I+ +F
Sbjct: 278 YCLPSFTSNSTGHLTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSF 337
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
+ G I+DSGT TRL T+ Y LR F + T G LFDTCYDF+ +
Sbjct: 338 STE-----GAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDT 392
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
V PT++F F V+ L +P+ + C AFA +I GNVQQ V ++
Sbjct: 393 VTYPTIAFSFAGSTVVELDGSGISLPIKIS-QVCLAFAGNDDLPAIFGNVQQTTLDVVYD 451
Query: 237 LRNSLIGFTPNKC 249
+ +GF PN C
Sbjct: 452 VAGGRVGFAPNGC 464
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 119 bits (299), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 91/260 (35%), Positives = 125/260 (48%), Gaps = 25/260 (9%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
G TET+ + S+ V N GC + G F G GLLGLG ++ PSQ + FS
Sbjct: 232 GFLATETLAIASSDVFKNFLFGCSEESRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFS 291
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
YCL S ST L F + A + P+ + +L Y L GISV G LPI
Sbjct: 292 YCLPASPS-STGHLSFGVEVSQAAKSTPI--SPKLKQLYGLNTVGISVRGRELPI----- 343
Query: 117 KIDESGNGGI---IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
NG I I+DSGT T L + TY+AL AF + T+G + F CYDFS+
Sbjct: 344 ------NGSISRTIIDSGTTFTFLPSPTYSALGSAFREMMANYTLTNGTSSFQPCYDFSN 397
Query: 174 --RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQQ 229
++ +P +S F G + + +IPV+ C AFA T S +I GN QQ+
Sbjct: 398 IGNGTLTIPGISIFFEGGVEVEIDVSGIMIPVNGLKEVCLAFADTGSDSDFAIFGNYQQK 457
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
V +++ ++GF P C
Sbjct: 458 TYEVIYDVAKGMVGFAPKGC 477
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 88/264 (33%), Positives = 141/264 (53%), Gaps = 22/264 (8%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINAS- 53
GD +T++L S S I IGCG +N G F GA+ G++GLGGG +S +Q+ +S
Sbjct: 175 GDLSVDTLSLESTSGSPVSFPKIVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSI 234
Query: 54 --TFSYCLV---DRDSDSTSTLEF-DSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVG 105
FSYCLV +++S+++S L F D+++ V+ PL++ + FY+L L SVG
Sbjct: 235 GGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPV--FYFLTLQAFSVG 292
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
+ ++ D+ GN II+DSGT +T + ++ Y L A V + D F
Sbjct: 293 NKRVEFGGSSEGGDDEGN--IIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQF 350
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
CY S + + P ++ HF +G + L + + +P+ ++G CFAF P+ SI GN
Sbjct: 351 SLCYSLKS-NEYDFPIITVHF-KGADVELHSISTFVPI-TDGIVCFAFQPSPQLGSIFGN 407
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
+ QQ V ++L+ + F P C
Sbjct: 408 LAQQNLLVGYDLQQKTVSFKPTDC 431
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 85/261 (32%), Positives = 137/261 (52%), Gaps = 21/261 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
GD +E++ LG ++N GCG NN+GLF G++GL+GLG S+S SQ + FSY
Sbjct: 183 GDLASESILLGDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSY 242
Query: 58 CLVDRDSDSTSTLEF--DSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLPIS 112
CL + ++ +L F DSS+ N+ + PL++N +L +FY L LTG S+GG + +
Sbjct: 243 CLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELK 300
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
++F GI++DSGT +TRL Y A++ F++ G ++ DTC++ +
Sbjct: 301 SSSF------GRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLT 354
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQ 228
S + +P + F L + Y + D++ C A A S + + IIGN QQ
Sbjct: 355 SYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDAS-LVCLALASLSYENEVGIIGNYQQ 413
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
+ RV ++ +G C
Sbjct: 414 KNQRVIYDTTQERLGIVGENC 434
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 85/261 (32%), Positives = 137/261 (52%), Gaps = 21/261 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
GD +E++ LG ++N GCG NN+GLF G++GL+GLG S+S SQ + FSY
Sbjct: 231 GDLASESILLGDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSY 290
Query: 58 CLVDRDSDSTSTLEF--DSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLPIS 112
CL + ++ +L F DSS+ N+ + PL++N +L +FY L LTG S+GG + +
Sbjct: 291 CLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELK 348
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
++F GI++DSGT +TRL Y A++ F++ G ++ DTC++ +
Sbjct: 349 SSSF------GRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLT 402
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQ 228
S + +P + F L + Y + D++ C A A S + + IIGN QQ
Sbjct: 403 SYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDAS-LVCLALASLSYENEVGIIGNYQQ 461
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
+ RV ++ +G C
Sbjct: 462 KNQRVIYDTTQERLGIVGENC 482
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 84/260 (32%), Positives = 129/260 (49%), Gaps = 21/260 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
G + ++L +D GCG +N+G F G +GL+GLG LS SQ FSY
Sbjct: 221 GVLAHDKLSLAGEVIDGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSY 280
Query: 58 CLVDRDSDSTSTLEF--DSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
CL ++S+S+ +L D+S+ N+ V ++ + FY++ LTGI++GG +
Sbjct: 281 CLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEV--- 337
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
ES G +IVDSGT +T L YNA++ F+ G ++ DTC++ +
Sbjct: 338 -------ESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSILDTCFNLT 390
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTSSSL--SIIGNVQQQ 229
V++P++ F F + + + L V S+ + C A A S SIIGN QQ+
Sbjct: 391 GFREVQIPSLKFVFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQK 450
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
RV F+ S IGF C
Sbjct: 451 NLRVIFDTLGSQIGFAQETC 470
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 94/256 (36%), Positives = 137/256 (53%), Gaps = 20/256 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINAS---TF 55
G + ++T+ LGS++V+N GC + G L AGL+GLGGG+ S +Q + F
Sbjct: 214 GTYSSDTLALGSSTVENFQFGCSQSESGNLLQDQTAGLMGLGGGAESLATQTAGTFGKAF 273
Query: 56 SYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
SYCL S+ L +S V P+LR+ ++ ++Y + L I VGG L I +A
Sbjct: 274 SYCL-PPTPGSSGFLTLGASTSGFVVKTPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASA 332
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
F + G I+DSGT +TRL Y+AL AF G + P + +FDTC+DFS +S
Sbjct: 333 F------SAGSIMDSGTIITRLPRTAYSALSSAFKAGMKQYPPAQPMGIFDTCFDFSGQS 386
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
SV +PTV+ F G V+ L + ++ C AFA S +SL IIGNVQQ+ V
Sbjct: 387 SVSIPTVALVFSGGAVVDLASDGIIL------GSCLAFAANSDDTSLGIIGNVQQRTFEV 440
Query: 234 SFNLRNSLIGFTPNKC 249
+++ +GF C
Sbjct: 441 LYDVGGGAVGFKAGAC 456
>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
Length = 315
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 78/257 (30%), Positives = 121/257 (47%), Gaps = 28/257 (10%)
Query: 13 ASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
+V +A GCG N GLFV +G+ G G G S PSQ+ FSYCL +S +
Sbjct: 64 VAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLPSQLKVGRFSYCLTLVTESKSSVVI 123
Query: 72 FDSSLPPNAVTA---------PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
+ P+ + A P++ N + TFYYL L GI+VG LP ++ F + + G
Sbjct: 124 LGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTRLPFDKSVFALKKDG 183
Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR-------- 174
+GG ++DSGT++T L + L++ V A P + +D + R
Sbjct: 184 SGGTVIDSGTSLTTLPEAVFELLQEELV----AQFP---LPRYDNTPEVGDRLCFRRPKG 236
Query: 175 -SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF-APTSSSLSIIGNVQQQGTR 232
V VP + H G + LP NY + +G C +++ +IGN QQQ
Sbjct: 237 GKQVPVPKLILHL-AGADMDLPRDNYFVEEPDSGVMCLQINGAEDTTMVLIGNFQQQNMH 295
Query: 233 VSFNLRNSLIGFTPNKC 249
V +++ N+ + F P +C
Sbjct: 296 VVYDVENNKLLFAPAQC 312
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 85/246 (34%), Positives = 115/246 (46%), Gaps = 17/246 (6%)
Query: 18 IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDS--- 74
+ GCG N+G +G++G G LS SQ+ FSYCL S STL F S
Sbjct: 217 LGFGCGTMNKGSLNNGSGIVGFGRAPLSLVSQLAIRRFSYCLTPYASGRKSTLLFGSLRG 276
Query: 75 ----SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDS 130
+ T LLR+ + TFYY+ TG++VG L I +AF + G+GG IVDS
Sbjct: 277 GVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAIVDS 336
Query: 131 GTAVTRLQTETYNALRDAFVRGTR-------ALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
GTA+T + AF R + P DGV F R +V VP +
Sbjct: 337 GTALTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGVC-FAAAASRVPRPAV-VPRMV 394
Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIG 243
FH +G L LP +NY++ G C A + S + IGN QQ RV ++L +
Sbjct: 395 FHL-QGADLDLPRRNYVLDDQRKGNLCLLLADSGDSGTTIGNFVQQDMRVLYDLEADTLS 453
Query: 244 FTPNKC 249
F P +C
Sbjct: 454 FAPAQC 459
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 84/262 (32%), Positives = 136/262 (51%), Gaps = 23/262 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAST----F 55
G+ T T+ + N+A+GC + G F+GA+G+LGLG G +S +Q + F
Sbjct: 143 GNHKTRTI-----RIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGIF 197
Query: 56 SYCLVD--RDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-IS 112
SYCLVD R S+++S L + P++RN +FYY+ +TG++V G + I+
Sbjct: 198 SYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIA 257
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVALFDTCY 169
+ + ID GN G I DSGT ++ L+ Y+ + A + RA +G F+ CY
Sbjct: 258 SSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEG---FELCY 314
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP--TSSSLSIIGNVQ 227
+ +R +P + F G V+ LP NY++ V N C A T++ +I+GN+
Sbjct: 315 NV-TRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAEN-VQCVALQKVTTTNGSNILGNLL 372
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
QQ + ++L + IGF + C
Sbjct: 373 QQDHHIEYDLAKARIGFKWSPC 394
>gi|194701538|gb|ACF84853.1| unknown [Zea mays]
gi|194703714|gb|ACF85941.1| unknown [Zea mays]
Length = 208
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 83/220 (37%), Positives = 114/220 (51%), Gaps = 19/220 (8%)
Query: 37 LGLGGGSLSFPSQINAS---TFSYCLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHEL 91
+GLGGG+ S SQ + FSYCL S S + S V P+LR+ ++
Sbjct: 1 MGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQV 60
Query: 92 DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR 151
TFY + L I VGG L I + F + G ++DSGT +TRL Y+AL AF
Sbjct: 61 PTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGTVITRLPPTAYSALSSAFKA 114
Query: 152 GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCF 211
G + P + DTC+DFS +SSV +P+V+ F G V+ L A ++ SN C
Sbjct: 115 GMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL---SN---CL 168
Query: 212 AFAPTS--SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
AFA S SSL IIGNVQQ+ V +++ ++GF C
Sbjct: 169 AFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 208
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 82/269 (30%), Positives = 127/269 (47%), Gaps = 27/269 (10%)
Query: 4 VTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDR- 62
++ET+ L V N +GC + AG+ G G G S PSQ+ + FSYCL+
Sbjct: 182 LSETLHLHGLIVPNFLVGCSVFSSR---QPAGIAGFGRGPSSLPSQLGLTKFSYCLLSHK 238
Query: 63 --DSDSTSTLEFDSSLPPNAVTA-----PLLRNHELD------TFYYLGLTGISVGGDLL 109
D+ +S+L DS + TA PL++N ++ +YY+ L IS+GG +
Sbjct: 239 FDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSV 298
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGVALF 165
I D+ GNGG I+DSGT T + TE + L + F+ RAL + ++
Sbjct: 299 KIPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALM-VEALSGL 357
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS---- 221
C++ S +E+P + HF G + LP +NY + S CF + +
Sbjct: 358 KPCFNVSGAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTVVTDGAEKASGPG 417
Query: 222 -IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I+GN Q Q V ++L+N +GF C
Sbjct: 418 MILGNFQMQNFYVEYDLQNERLGFKKESC 446
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 85/261 (32%), Positives = 137/261 (52%), Gaps = 21/261 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
GD +E++ LG ++N GCG NN+GLF G++GL+GLG S+S SQ + FSY
Sbjct: 231 GDLASESILLGDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSY 290
Query: 58 CLVDRDSDSTSTLEF--DSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLPIS 112
CL + ++ +L F DSS+ N+ + PL++N +L +FY L LTG S+GG + +
Sbjct: 291 CLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELK 348
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
++F GI++DSGT +TRL Y A++ F++ G ++ DTC++ +
Sbjct: 349 SSSF------GRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLT 402
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQ 228
S + +P + F L + Y + D++ C A A S + + IIGN QQ
Sbjct: 403 SYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDAS-LVCLALASLSYENEVGIIGNYQQ 461
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
+ RV ++ +G C
Sbjct: 462 KNQRVIYDSTQERLGIVGENC 482
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 84/260 (32%), Positives = 129/260 (49%), Gaps = 21/260 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
G + ++L +D GCG +N+G F G +GL+GLG LS SQ FSY
Sbjct: 220 GVLAHDKLSLAGEVIDGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSY 279
Query: 58 CLVDRDSDSTSTLEF--DSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
CL ++S+S+ +L D+S+ N+ V ++ + FY++ LTGI++GG +
Sbjct: 280 CLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEV--- 336
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
ES G +IVDSGT +T L YNA++ F+ G ++ DTC++ +
Sbjct: 337 -------ESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSILDTCFNLT 389
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTSSSL--SIIGNVQQQ 229
V++P++ F F + + + L V S+ + C A A S SIIGN QQ+
Sbjct: 390 GFREVQIPSLKFVFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQK 449
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
RV F+ S IGF C
Sbjct: 450 NLRVIFDTLGSQIGFAQETC 469
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 91/271 (33%), Positives = 125/271 (46%), Gaps = 29/271 (10%)
Query: 1 GDFVTETVTLGS-----ASVDN------IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ 49
G +T+TLG+ AS +N GCG NN GLF A GL GLG G +S SQ
Sbjct: 245 GHLGNDTLTLGTMAPANASAENDNKLPGFVFGCGENNTGLFGQADGLFGLGRGKVSLSSQ 304
Query: 50 INAS---TFSYCLVDRDSDSTSTLEFDSSL--PPNAVTAPLLRNHELDTFYYLGLTGISV 104
FSYCL S + L + + P +A P+L +FYY+ L GI V
Sbjct: 305 AAGKFGEGFSYCLPSSSSSAPGYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRV 364
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV--RGTRALSPTDGV 162
G + +S + +IVDSGT +TRL Y ALR AF+ G +
Sbjct: 365 AGRAIRVSSPRVALP------LIVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRL 418
Query: 163 ALFDTCYDFSSR--SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SS 218
++ DTCYDF++ ++V +P V+ F G + + L V C AFAP
Sbjct: 419 SILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLY-VAKVAQACLAFAPNGDGR 477
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S I+GN QQ+ V +++ IGF C
Sbjct: 478 SAGILGNTQQRTLAVVYDVARQKIGFAAKGC 508
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 81/248 (32%), Positives = 131/248 (52%), Gaps = 18/248 (7%)
Query: 15 VDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAST----FSYCLVD--RDSDST 67
+ N+A+GC + G F+GA+G+LGLG G +S +Q + FSYCLVD R S+++
Sbjct: 184 IKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGIFSYCLVDYLRGSNAS 243
Query: 68 STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISETAFKIDESGNGGI 126
S L + P++RN +FYY+ +TG++V G + I+ + + ID GN G
Sbjct: 244 SFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIASSDWGIDGDGNKGT 303
Query: 127 IVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
I DSGT ++ L+ Y+ + A + RA +G F+ CY+ +R +P +
Sbjct: 304 IFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEG---FELCYNV-TRMEKGMPKLG 359
Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP--TSSSLSIIGNVQQQGTRVSFNLRNSL 241
F G V+ LP NY++ V N C A T++ +I+GN+ QQ + ++L +
Sbjct: 360 VEFQGGAVMELPWNNYMVLVAEN-VQCVALQKVTTTNGSNILGNLLQQDHHIEYDLAKAR 418
Query: 242 IGFTPNKC 249
IGF + C
Sbjct: 419 IGFKWSPC 426
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 91/257 (35%), Positives = 131/257 (50%), Gaps = 20/257 (7%)
Query: 6 ETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS---YCLVD 61
ET++L SA ++ A GCG N G F GL+GLG G LS SQ AS + YCL
Sbjct: 213 ETLSLTSARALPGFAFGCGETNLGDFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPS 272
Query: 62 RDSDSTSTLEFDSSLPPNA-----VTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
++ S L ++ P + TA +++ + +FY++ L I VGG +LP+ F
Sbjct: 273 YNT-SHGYLTIGTTTPASGSDGVRYTA-MIQKQDYPSFYFVDLVSIVVGGFVLPVPPILF 330
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
D G ++DSGT +T L E Y ALRD F P FDTCYDF+ +++
Sbjct: 331 TRD-----GTLLDSGTVLTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFAGQNA 385
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVD--SNGTFCFAFAPTSSSL--SIIGNVQQQGTR 232
+ +P VSF F +G L LI D + T C AF P S++ +I+GN QQ+ T
Sbjct: 386 IFMPLVSFKFSDGSSFDLSPFGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTE 445
Query: 233 VSFNLRNSLIGFTPNKC 249
+ +++ IGF C
Sbjct: 446 MIYDVAAEKIGFVSGSC 462
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 105/255 (41%), Positives = 131/255 (51%), Gaps = 13/255 (5%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G F TE +TL S + +NI GCG NN+GLF G+AGLLGLG LS SQ FS
Sbjct: 242 GFFGTEKLTLTSTDAFNNIYFGCGQNNQGLFGGSAGLLGLGRDKLSVVSQTAQKYNKIFS 301
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
YCL S ST L F S NA PL +FY L TGISVGG L IS + F
Sbjct: 302 YCL-PSSSSSTGFLTFGGSASKNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAISASVF 360
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
G I+DSGT +TRL Y+ALR +F T +++ DTCYDFSS ++
Sbjct: 361 S-----TAGAIIDSGTVITRLPPAAYSALRASFRNLMSKYPMTKALSILDTCYDFSSYTT 415
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--LSIIGNVQQQGTRVS 234
+ VP + F F G + + A L S C AFA S + + I GNVQQ+ V
Sbjct: 416 ISVPKIGFSFSSGIEVDIDATGILY-ASSLSQVCLAFAGNSDATDVFIFGNVQQKTLEVF 474
Query: 235 FNLRNSLIGFTPNKC 249
++ +GF P C
Sbjct: 475 YDGSAGKVGFAPGGC 489
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 84/258 (32%), Positives = 118/258 (45%), Gaps = 27/258 (10%)
Query: 11 GSASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTST 69
G +V ++ GCG N G F G+ G G G LS P Q+ S+FSYC +S ST
Sbjct: 195 GKVTVPDLVFGCGQYNTGNFHSNETGIAGFGRGPLSLPRQLGVSSFSYCFTT-IFESKST 253
Query: 70 LEFDSSLPPNAVTA---------PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
F P + + A P L NH +YYL L GI+VG L + E+AF +
Sbjct: 254 PVFLGGAPADGLRAHATGPILSTPFLPNHP--EYYYLSLKGITVGKTRLAVPESAFVVKA 311
Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT------CYDFSS- 173
G+GG I+DSGTA+T + +L +AFV A P + DT C+ S
Sbjct: 312 DGSGGTIIDSGTAITAFPRAVFRSLWEAFV----AQVPLPHTSYNDTGEPTLQCFSTESV 367
Query: 174 --RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGT 231
S V VP ++ H EG LP +NY+ + C ++IGN QQQ
Sbjct: 368 PDASKVPVPKMTLHL-EGADWELPRENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNM 426
Query: 232 RVSFNLRNSLIGFTPNKC 249
+ +L + + P +C
Sbjct: 427 HIVHDLAGNKLVIEPAQC 444
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 89/264 (33%), Positives = 129/264 (48%), Gaps = 25/264 (9%)
Query: 11 GSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAST-------FSYCLVDR 62
++++ ++ GC + V ++G LGL GS SFP+QI + + FSYC +R
Sbjct: 107 AASTLGDVIFGCASKDLQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNR 166
Query: 63 DS--DSTSTLEF-DSSLPPNAVTAPLLRNH----ELDTFYYLGLTGISVGGDLLPISETA 115
+S+ + F DS +P + L + FYY+GL GISVGG+LL I +A
Sbjct: 167 AEHLNSSGVIIFGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSA 226
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSR 174
FKID GNGG DSGT V+ L + AL +AF R L+ T G + CYD ++
Sbjct: 227 FKIDRLGNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAG 286
Query: 175 SSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNG---TFCFAF----APTSSSLSIIGN 225
+ P V+ HF + L + +P+ T C AF A +++IGN
Sbjct: 287 DARLPTAPLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGN 346
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
QQQ + +L S IGF P C
Sbjct: 347 YQQQDYLIEHDLERSRIGFAPANC 370
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 117 bits (292), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 92/263 (34%), Positives = 130/263 (49%), Gaps = 25/263 (9%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G + TET+TL V + GCG + G + GLLGLGG S SQ ++ FS
Sbjct: 214 GVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFS 273
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTA-------PLLRNHELDTFYYLGLTGISVGGDLL 109
YCL S L + ++ TA P+ R + TFY + LTGISVGG L
Sbjct: 274 YCL-PPTSGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPL 332
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---RALSPTDGVALFD 166
+ +AF + G+++DSGT +T L Y ALR AF R L P++G A+ D
Sbjct: 333 AVPPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG-AVLD 385
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
TCYDF+ ++V VPT++ F G + L ++ +G FA A T ++ IIGNV
Sbjct: 386 TCYDFTGHTNVTVPTIALTFSGGATIDLATPAGVL---VDGCLAFAGAGTDDTIGIIGNV 442
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
Q+ V ++ +GF C
Sbjct: 443 NQRTFEVLYDSGKGTVGFRAGAC 465
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 117 bits (292), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 87/264 (32%), Positives = 142/264 (53%), Gaps = 22/264 (8%)
Query: 1 GDFVTETVTLGSASVDNIA-----IGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINAS- 53
GD +T++L S S ++ IGCG +N G F GA+ G++GLGGG +S +Q+ +S
Sbjct: 175 GDLSVDTLSLESTSGSPVSFPKTVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSI 234
Query: 54 --TFSYCLV---DRDSDSTSTLEF-DSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVG 105
FSYCLV +++S+++S L F D+++ V+ PL++ + FY+L L SVG
Sbjct: 235 GGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPV--FYFLTLQAFSVG 292
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
+ ++ D+ GN II+DSGT +T + ++ Y L A V + D F
Sbjct: 293 NKRVEFGGSSEGGDDEGN--IIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQF 350
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
CY S + + P ++ HF +G + L + + +P+ ++G CFAF P+ SI GN
Sbjct: 351 SLCYSLKS-NEYDFPIITAHF-KGADIELHSISTFVPI-TDGIVCFAFQPSPQLGSIFGN 407
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
+ QQ V ++L+ + F P C
Sbjct: 408 LAQQNLLVGYDLQQKTVSFKPTDC 431
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 116 bits (291), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 87/274 (31%), Positives = 125/274 (45%), Gaps = 33/274 (12%)
Query: 4 VTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV--- 60
++ET+ L S S N +GC + AG+ G G G S PSQ+ FSYCL+
Sbjct: 177 LSETLHLHSLSKPNFLVGCSVFSSH---QPAGIAGFGRGLSSLPSQLGLGKFSYCLLSHR 233
Query: 61 -DRDSDSTSTL-----EFDSSLPPNA-VTAPLLRNHELD------TFYYLGLTGISVGGD 107
D D+ +S+L + DS NA V P ++N ++D +YYLGL I+VGG
Sbjct: 234 FDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGH 293
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGVA 163
+ + E GNGG+I+DSGT T + E + L D F+R R D +
Sbjct: 294 HVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIG 353
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS-- 221
L C++ S +V P + +F G + LP +NY V C +
Sbjct: 354 L-RPCFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFVGGE-VACLTVVTDGVAGPER 411
Query: 222 ------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I+GN Q Q V ++LRN +GF KC
Sbjct: 412 VGGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 116 bits (290), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 80/259 (30%), Positives = 128/259 (49%), Gaps = 12/259 (4%)
Query: 1 GDFVTETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G TET T G+ + V ++A GCG +N G ++GL+G+G G LS SQ+ + FSYC
Sbjct: 201 GVLATETFTFGAGTTVHDLAFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQLGVTKFSYCF 260
Query: 60 VDRDSDSTSTLEF---DSSLPPNAVTAPLL---RNHELDTFYYLGLTGISVGGDLLPISE 113
+ +TS+ F +SL P A + P + ++YYL L GI+VG LLPI
Sbjct: 261 TPFNDTTTSSPLFLGSSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDP 320
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY---D 170
F++ SG GG+I+DSGT T L+ + L A + C+
Sbjct: 321 AVFRLTASGRGGLIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHLGLSVCFAAPQ 380
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
+V+VP + HF +G + LP + ++ G C ++ +S++G++QQQ
Sbjct: 381 GRGPEAVDVPRLVLHF-DGADMELPRSSAVVEDRVAGVACLGIV-SARGMSVLGSMQQQN 438
Query: 231 TRVSFNLRNSLIGFTPNKC 249
V +++ ++ F P C
Sbjct: 439 MHVRYDVGRDVLSFEPANC 457
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 116 bits (290), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 98/265 (36%), Positives = 135/265 (50%), Gaps = 22/265 (8%)
Query: 1 GDFVTETVTLGSAS----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ----INA 52
G + ++T+ LGS S V GC H G+ AGL+GLGGG+ S SQ
Sbjct: 235 GTYSSDTLALGSNSNTVVVSKFRFGCSHAETGITGLTAGLMGLGGGAQSLVSQTAGTFGT 294
Query: 53 STFSYCLVDRDSDST-STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
+ FSYCL S S TL + V P+LR+ ++ FY + L I VGG L I
Sbjct: 295 TAFSYCLPPTPSSSGFLTLGAAGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSI 354
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP---TDGVALFDTC 168
T F + G+I+DSGT VTRL Y++L AF G + P + G DTC
Sbjct: 355 PTTVF------SAGMIMDSGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTC 408
Query: 169 YDFSSRSSVEVPTVSFHF--PEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIG 224
+D S +SSV +PTV+ F G V+ L A L+ ++++ FC AF TS S IIG
Sbjct: 409 FDMSGQSSVSMPTVALVFSGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGSTGIIG 468
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
NVQQ+ +V +++ +GF C
Sbjct: 469 NVQQRTFQVLYDVAGGAVGFKAGAC 493
>gi|298204765|emb|CBI25263.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 74/252 (29%), Positives = 112/252 (44%), Gaps = 45/252 (17%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G+ E + G+ V + GCG NN+GLF G +GL+GLG LS SQ +
Sbjct: 147 GELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLISQTS--------- 197
Query: 61 DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
N +L FY++ LTGIS+GG A +
Sbjct: 198 --------------------------ENPQLYNFYFINLTGISIGG-------VALQAPS 224
Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
G I+VDSGT +TRL Y AL+ F++ P ++ DTC++ S+ V++P
Sbjct: 225 VGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFPPAPAFSILDTCFNLSAYQEVDIP 284
Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFA--PTSSSLSIIGNVQQQGTRVSFNL 237
T+ HF L + V S+ + C A A ++I+GN QQ+ RV ++
Sbjct: 285 TIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALASLEYQDEVAILGNYQQKNLRVIYDT 344
Query: 238 RNSLIGFTPNKC 249
+ + +GF C
Sbjct: 345 KETKVGFALETC 356
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 115 bits (289), Expect = 1e-23, Method: Composition-based stats.
Identities = 96/274 (35%), Positives = 132/274 (48%), Gaps = 36/274 (13%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS---TF 55
G++ +T+TL + V GCG NNEG F GA G+LGLG G LS SQ + F
Sbjct: 203 GNYGCDTMTLEPSDVFQKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVF 262
Query: 56 SYCLVDRDS-----------DSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISV 104
SYCL + +S +S+L+F S V P E +Y++ L ISV
Sbjct: 263 SYCLPEENSIGSLLFGEKATSQSSSLKFTS-----LVNGPGTSGLEESGYYFVKLLDISV 317
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA- 163
G L I + F + G I+DSGT +TRL Y+AL+ AF + ++G
Sbjct: 318 GNKRLNIPSSVF-----ASPGTIIDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRK 372
Query: 164 ---LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS- 219
+ DTCY+ S R V +P HF +G + L K + D++ C AFA S S
Sbjct: 373 ENDMLDTCYNLSGRKDVLLPEXVLHFGDGADVRLNGKRVVWGNDAS-RLCLAFAGNSKST 431
Query: 220 ----LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
L+IIGN QQ V +++R IGF N C
Sbjct: 432 MNPELTIIGNRQQVSLTVLYDIRGRRIGFGGNGC 465
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 89/257 (34%), Positives = 120/257 (46%), Gaps = 20/257 (7%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGA-AGLLGLGGGSLSFPSQIN---ASTF 55
G +VT+T+T+ + V + GC H G F AG+L LGGG S Q + F
Sbjct: 250 GTYVTDTLTMSPTIVVKDFRFGCSHAVRGSFSNQNAGILALGGGRGSLLEQTADAYGNAF 309
Query: 56 SYCLVDRDSDSTSTL--EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
SYC+ S +L ++SL PL++N TFY + L I V G L +
Sbjct: 310 SYCIPKPSSAGFLSLGGPVEASL--KFSYTPLIKNKHAPTFYIVHLEAIIVAGKQLAVPP 367
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG-VALFDTCYDFS 172
TAF G ++DSG VT+L + Y ALR AF A P V DTCYDF+
Sbjct: 368 TAFAT------GAVMDSGAVVTQLPPQVYAALRAAFRSAMAAYGPLAAPVRNLDTCYDFT 421
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTR 232
V+VP VS F G L L + ++ +G FA P S+ IGNVQQQ
Sbjct: 422 RFPDVKVPKVSLVFAGGATLDLEPASIIL----DGCLAFAATPGEESVGFIGNVQQQTYE 477
Query: 233 VSFNLRNSLIGFTPNKC 249
V +++ +GF C
Sbjct: 478 VLYDVGGGKVGFRRGAC 494
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 93/262 (35%), Positives = 129/262 (49%), Gaps = 30/262 (11%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFS 56
G + ET+TL +V++ GCG + G GLLGLGG +S Q + FS
Sbjct: 225 GVYSNETLTLAPGITVEDFHFGCGRDQRGPSDKYDGLLGLGGAPVSLVVQTSSVYGGAFS 284
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNA-----VTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
YCL +S++ L S PP+ V P+ TFY + +TGISVGG L I
Sbjct: 285 YCLPALNSEA-GFLVLGS--PPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHI 341
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA--LSPTDGVALFDTCY 169
++AF+ GG+I+DSGT T L YNAL A + +A L P+D FDTCY
Sbjct: 342 PQSAFR------GGMIIDSGTVDTELPETAYNALEAALRKALKAYPLVPSDD---FDTCY 392
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQ 227
+F+ S++ VP V+F F G + L N ++ D C AF + L IIGNV
Sbjct: 393 NFTGYSNITVPRVAFTFSGGATIDLDVPNGILVND-----CLAFQESGPDDGLGIIGNVN 447
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
Q+ V ++ +GF C
Sbjct: 448 QRTLEVLYDAGRGNVGFRAGAC 469
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 85/259 (32%), Positives = 128/259 (49%), Gaps = 15/259 (5%)
Query: 5 TETVTLGSA---SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVD 61
TET+T A SV IA GCG +N GL + G +GLG GSLS +Q+ FSYCL D
Sbjct: 187 TETLTFPGAPGVSVGGIAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTD 246
Query: 62 RDSDSTSTLEFDSSLPPNAV--------TAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
+ S + +L A + PL+++ + T+YY+ L GIS+G LPI
Sbjct: 247 FFNTSLGSPVLFGALAELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPN 306
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
F + + G+GG+IVDSGT T L + + D V G + +L C+ ++
Sbjct: 307 GTFDLRDDGSGGMIVDSGTTFTFLVESAFRVVVD-HVAGVLRQPVVNASSLDSPCFPAAT 365
Query: 174 --RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSIIGNVQQQG 230
+ +P + HF G + L NY+ +FC A + S+ +SI+GN QQQ
Sbjct: 366 GEQQLPAMPDMVLHFAGGADMRLHRDNYMSFNQEESSFCLNIAGSPSADVSILGNFQQQN 425
Query: 231 TRVSFNLRNSLIGFTPNKC 249
++ F++ + F P C
Sbjct: 426 IQMLFDITVGQLSFMPTDC 444
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 87/247 (35%), Positives = 124/247 (50%), Gaps = 14/247 (5%)
Query: 5 TETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDS 64
+ET TLG +V + GC EG + AGL+GLG G LS SQ++A TF YCL D+
Sbjct: 199 SETFTLGGDAVPGVGFGCTTALEGDYGEGAGLVGLGRGPLSLVSQLDAGTFMYCLT-ADA 257
Query: 65 DSTSTLEFDSSLPPNAVTAPLLRNHEL--DTFYYLGLTGISVGGDLLPISETAFKIDESG 122
S L F + A + L TFY + L I++G +A G
Sbjct: 258 SKASPLLFGALATMTGAGAGVQSTGLLASTTFYAVNLRSITIG--------SATTAGVGG 309
Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTV 182
GG++ DSGT +T L Y + AF+ T +L+P +G F+ CY+ S+ +P +
Sbjct: 310 PGGVVFDSGTTLTYLAEPAYTEAKAAFLSQTTSLTPVEGRYGFEACYE-KPDSARLIPAM 368
Query: 183 SFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLI 242
HF G + LP NY++ VD +G C+ S SLSIIGN+ Q V ++R S++
Sbjct: 369 VLHFDGGADMALPVANYVVEVD-DGVVCWV-VQRSPSLSIIGNIMQMNYLVLHDVRKSVL 426
Query: 243 GFTPNKC 249
F P C
Sbjct: 427 SFQPANC 433
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 87/258 (33%), Positives = 127/258 (49%), Gaps = 16/258 (6%)
Query: 5 TETVTLGSAS----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
TET+TLG +S V +A GCG +N G + + G +GLG G+LS +Q+ FSYCL
Sbjct: 163 TETLTLGPSSAPVSVGGVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLT 222
Query: 61 D---RDSDSTSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLPISET 114
D DS L + L P T PLL++ + + Y++ L GIS+G LPI
Sbjct: 223 DFFNSALDSPFLLGTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNG 282
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDGVALFDTCYDFS 172
F + G GG+IVDSGT T L + R+ R R L P + +L C+
Sbjct: 283 TFDLRGDGTGGMIVDSGTTFTILAESGF---REVVGRVARVLGQPPVNASSLDAPCFPAP 339
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQGT 231
+ +P + HF G + L NY+ + + +FC A T+ S S++GN QQQ
Sbjct: 340 AGEPPYMPDLVLHFAGGADMRLYRDNYMSYNEEDSSFCLNIAGTTPESTSVLGNFQQQNI 399
Query: 232 RVSFNLRNSLIGFTPNKC 249
++ F+ + F P C
Sbjct: 400 QMLFDTTVGQLSFLPTDC 417
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 119/266 (44%), Gaps = 31/266 (11%)
Query: 11 GSASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTST 69
G S + GCGH N+G+F G+ G G G S PSQ+ ++FSYC ++S
Sbjct: 207 GGVSERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTSFSYCFTSMFESTSSL 266
Query: 70 LEF-----DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNG 124
+ + L + PLLR+ + Y+L L I+VG +PI E ++ E+
Sbjct: 267 VTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLREAS-- 324
Query: 125 GIIVDSGTAVTRLQTETYNALRDAFVRGT-RALSPTDGVALFDTCYDFSSRSS------- 176
I+DSG ++T L + Y A++ FV +S +G AL D C+ S ++
Sbjct: 325 -AIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSAL-DLCFALPSAAAPKSAFGW 382
Query: 177 ----------VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS---SSLSII 223
V VP + FH G LP +NY+ C + +I
Sbjct: 383 RWRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGDQTVVI 442
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN QQQ T V ++L N ++ F P +C
Sbjct: 443 GNYQQQNTHVVYDLENDVLSFAPARC 468
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 78/244 (31%), Positives = 121/244 (49%), Gaps = 11/244 (4%)
Query: 13 ASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSDS 66
A + + +GC +G F + G+L LG ++SF S+ A FSYCLVD ++
Sbjct: 225 AKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNA 284
Query: 67 TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGI 126
TS L F A PLL + + FY + + + V G+ L I + +D NGG
Sbjct: 285 TSYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDVDR--NGGA 342
Query: 127 IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHF 186
I+DSGT++T L T Y A+ A + L P + F+ CY+++ ++E+P + HF
Sbjct: 343 ILDSGTSLTILATPAYRAVVTALSKHLAGL-PRVTMDPFEYCYNWTDAGALEIPKMEVHF 401
Query: 187 PEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSLIGFT 245
L PAK+Y+I + G C S +S+IGN+ QQ F+LR+ + F
Sbjct: 402 AGSARLEPPAKSYVIDA-APGVKCIGVQEGSWPGVSVIGNILQQEHLWEFDLRDRWLRFK 460
Query: 246 PNKC 249
+C
Sbjct: 461 HTRC 464
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 114 bits (285), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 87/261 (33%), Positives = 132/261 (50%), Gaps = 15/261 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
G + ++L +D GCG +N+G F G +GL+GLG LS SQ FS
Sbjct: 252 GVLAHDRLSLAGEVIDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFS 311
Query: 57 YCLVDRDSDSTSTLEF--DSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
YCL ++SDS+ +L DSS+ N+ V A ++ + FY++ LTGI+VGG +
Sbjct: 312 YCLPLKESDSSGSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEV-- 369
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
E++ G G I+DSGT +T L YNA++ F+ G ++ DTC++
Sbjct: 370 -ESSGFSSGGGGGKAIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAPGFSILDTCFNM 428
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTSSSL--SIIGNVQQ 228
+ V+VP++ F G + + + L V S+ + C A AP S +IIGN QQ
Sbjct: 429 TGLREVQVPSLKLVFDGGVEVEVDSGGVLYFVSSDSSQVCLAMAPLKSEYETNIIGNYQQ 488
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
+ RV F+ S +GF C
Sbjct: 489 KNLRVIFDTSGSQVGFAQETC 509
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 114 bits (284), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 93/271 (34%), Positives = 132/271 (48%), Gaps = 23/271 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G TET+ +G AS +A GC N G+ ++G++GLG LS SQ+ FSYCL
Sbjct: 142 GYLATETLHVGGASFPGVAFGCSTEN-GVGNSSSGIVGLGRSPLSLVSQVGVGRFSYCLR 200
Query: 61 DRDSDSTSTLEFDS--SLPPNAVTAPLLRNHEL--DTFYYLGLTGISVGGDLLPISETAF 116
S + F S + + +L N E+ ++YY+ LTGI+VG LP++ T F
Sbjct: 201 SDADAGDSPILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVGATDLPVTSTTF 260
Query: 117 KIDESGN----GGIIVDSGTAVTRLQTETYNALRDAFV--RGTRALSPT-DGVAL-FDTC 168
GG IVDSGT +T L E Y ++ AF+ T L+ T +G FD C
Sbjct: 261 GFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLC 320
Query: 169 YDFSSR---SSVEVPTVSFHFPEGKVLPLPAKNY--LIPVDSNGTF---CFAFAPTSS-- 218
+D ++ S V VPT+ F G + ++Y ++ VDS G C P S
Sbjct: 321 FDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLVLPASEKL 380
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S+SIIGNV Q V ++L + F P C
Sbjct: 381 SISIIGNVMQMDLHVLYDLDGGMFSFAPADC 411
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 114 bits (284), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 85/250 (34%), Positives = 119/250 (47%), Gaps = 28/250 (11%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS 68
S +V N GC H G GL+GLGG + S SQ A+ FSYCL S +
Sbjct: 233 SDAVKNFQFGCSHRANGFVGQLDGLMGLGGDTESLVSQTAATYGKAFSYCLPPSSSSAGG 292
Query: 69 TLEFDSSLPPNAVT----APLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNG 124
L ++ + + PL+R + TFY + L I+V G L + + F +G
Sbjct: 293 FLTLGAAAGGTSSSRYSRTPLVR-FNVPTFYGVFLQAITVAGTKLNVPASVF------SG 345
Query: 125 GIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSF 184
+VDSGT +T+L Y ALR AF + +A V + DTC+DFS +V VP V+
Sbjct: 346 ASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTCFDFSGIKTVRVPVVTL 405
Query: 185 HFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTSS--SLSIIGNVQQQGTRVSFNLRN 239
F G V+ L D +G F C AF T+ I+GNVQQ+ + F++
Sbjct: 406 TFSRGAVMDL---------DVSGIFYAGCLAFTATAQDGDTGILGNVQQRTFEMLFDVGG 456
Query: 240 SLIGFTPNKC 249
S +GF P C
Sbjct: 457 STLGFRPGAC 466
>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
Length = 334
Score = 113 bits (283), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 92/261 (35%), Positives = 126/261 (48%), Gaps = 16/261 (6%)
Query: 1 GDFVTETVTLG--SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
G +TET T G +A+ IA GC +EG F +GL+GLG G LS +Q+N F Y
Sbjct: 72 GILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYR 131
Query: 59 LVDRDSDSTSTLEFDSSLPPNA------VTAPLLRNHELD--TFYYLGLTGISVGGDLLP 110
L D + S + F S ++ PLL N + FYY+GLTGISVGG L+
Sbjct: 132 L-SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQ 190
Query: 111 ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
I F D S G GG+I DSGT +T L Y +RD + P D
Sbjct: 191 IPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLIC 250
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-SNG--TFCFAFAPTSSSLSIIGNV 226
S+ P++ HF G + L +NYL + NG C++ +S +L+IIGN+
Sbjct: 251 FTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNI 310
Query: 227 QQQGTRVSFNLR-NSLIGFTP 246
Q V F+L N+ + F P
Sbjct: 311 MQMDFHVVFDLSGNARMLFQP 331
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 113 bits (283), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 80/268 (29%), Positives = 129/268 (48%), Gaps = 21/268 (7%)
Query: 1 GDFVTETVTLGSASVDNIAI--GCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
G+ +ET T G ++++ GCG G GA+G+LG+ LS SQ+ FSYC
Sbjct: 177 GELASETFTFGEHRRVSVSLDFGCGKLTSGSLPGASGILGISPDRLSLVSQLQIPRFSYC 236
Query: 59 L---VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHEL------DTFYYLGLTGISVGGDLL 109
L +DR++ S + L T P+ + + +YY+ L GISVG L
Sbjct: 237 LTPFLDRNTTSHIFFGAMADLSKYRTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRL 296
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR--ALSPTDGVALFDT 167
+ ++F I G+GG VDSG L + AL++A V + ++ TD ++
Sbjct: 297 NVPVSSFAIGRDGSGGTFVDSGDTTGMLPSVVMEALKEAMVEAVKLPVVNATDHGYEYEL 356
Query: 168 CYDF------SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
C+ + ++V+VP + +HF G + L +Y++ V S G C + + + +
Sbjct: 357 CFQLPRNGGGAVETAVQVPPLVYHFDGGAAMLLRRDSYMVEV-SAGRMCLVIS-SGARGA 414
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IIGN QQQ V F++ N F P +C
Sbjct: 415 IIGNYQQQNMHVLFDVENHEFSFAPTQC 442
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 113 bits (283), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 80/260 (30%), Positives = 131/260 (50%), Gaps = 21/260 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST-----F 55
GD E +T+GS+SV ++ IGCGH + G F A+G++GLGGG LS SQ++ ++ F
Sbjct: 180 GDLGFEKITIGSSSVKSV-IGCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRF 238
Query: 56 SYCLVDRDSDSTSTLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
SYCL S + + F + P V+ PL+ + + T+YY+ L IS+G +
Sbjct: 239 SYCLPTLLSHANGKINFGENAVVSGPGVVSTPLISKNTV-TYYYITLEAISIGNE----R 293
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD-- 170
AF + G +I+DSGT +T L E Y+ + + ++ +A D D C+D
Sbjct: 294 HMAF----AKQGNVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDG 349
Query: 171 FSSRSSVEVPTVSFHFPEG-KVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
++ +S+ +P ++ HF G V LP + D+ A ++ IIGN+ Q
Sbjct: 350 INAAASLGIPVITAHFSGGANVNLLPINTFRKVADNVNCLTLKAASPTTEFGIIGNLAQA 409
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
+ ++L + F P C
Sbjct: 410 NFLIGYDLEAKRLSFKPTVC 429
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 113 bits (283), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 81/257 (31%), Positives = 127/257 (49%), Gaps = 14/257 (5%)
Query: 5 TETVTLGSA------SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
TET+T+GS+ SV ++A GCG +N G + + G +GLG G+LS +Q+ FSYC
Sbjct: 160 TETLTIGSSVPGQTVSVGSVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYC 219
Query: 59 LVDRDSDSTSTLEFDSSL------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
L D + + + F +L P + PLL++ + Y++ L GIS+G LPI
Sbjct: 220 LTDFFNSTMDSPFFLGTLAELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIP 279
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
F + GNGG++VDSGT T L + + D V P + +L C+ S
Sbjct: 280 NGTFDLRADGNGGMMVDSGTTFTILAKSGFREVVDR-VAQLLGQPPVNASSLDSPCFP-S 337
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTR 232
+P + HF G + L NY+ + + +FC + S+ S +GN QQQ +
Sbjct: 338 PDGEPFMPDLVLHFAGGADMRLHRDNYMSYNEDDSSFCLNIVGSPSTWSRLGNFQQQNIQ 397
Query: 233 VSFNLRNSLIGFTPNKC 249
+ F++ + F P C
Sbjct: 398 MLFDMTVGQLSFLPTDC 414
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 113 bits (282), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 80/260 (30%), Positives = 126/260 (48%), Gaps = 12/260 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G TET+ +G AS ++A GC N G+ +G+ GLG G+LS Q+ FSYCL
Sbjct: 174 GYLATETLKVGDASFPSVAFGCSTEN-GVGNSTSGIAGLGRGALSLIPQLGVGRFSYCLR 232
Query: 61 DRDSDSTSTLEFDSSL---PPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPISETAF 116
+ S + F S N + P + N + ++YY+ LTGI+VG LP++ + F
Sbjct: 233 SGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTF 292
Query: 117 KIDESG-NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS-SR 174
++G GG IVDSGT +T L + Y ++ AF+ T ++ +G D C+ +
Sbjct: 293 GFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTANVTTVNGTRGLDLCFKSTGGG 352
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTS--SSLSIIGNVQQQ 229
+ VP++ F G +P + DS G+ C P +S+IGNV Q
Sbjct: 353 GGIAVPSLVLRFDGGAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQM 412
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
+ ++L + F+P C
Sbjct: 413 DMHLLYDLDGGIFSFSPADC 432
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 113 bits (282), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 92/247 (37%), Positives = 122/247 (49%), Gaps = 16/247 (6%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS 68
S ++ GCG N G F GLLGLG G LS PSQ AS FSYCL +S +T
Sbjct: 252 SRALAGFPFGCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNS-TTG 310
Query: 69 TLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGG 125
L ++ + A +LR + +FY++ L I +GG +LP+ F GG
Sbjct: 311 YLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFT-----RGG 365
Query: 126 IIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFH 185
++DSGT +T L + Y LRD F +P + D CYDF+ S V VP VSF
Sbjct: 366 TLLDSGTVLTYLPAQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVIVPAVSFR 425
Query: 186 FPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS---SLSIIGNVQQQGTRVSFNLRNSLI 242
F +G V L +I +D N C AFA + LSIIGN QQ+ V +++ I
Sbjct: 426 FGDGAVFELDFFGVMIFLDEN-VGCLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKI 484
Query: 243 GFTPNKC 249
GF P C
Sbjct: 485 GFVPASC 491
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 113 bits (282), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 92/261 (35%), Positives = 126/261 (48%), Gaps = 16/261 (6%)
Query: 1 GDFVTETVTLG--SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
G +TET T G +A+ IA GC +EG F +GL+GLG G LS +Q+N F Y
Sbjct: 191 GILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYR 250
Query: 59 LVDRDSDSTSTLEFDSSLPPNA------VTAPLLRNHELD--TFYYLGLTGISVGGDLLP 110
L D + S + F S ++ PLL N + FYY+GLTGISVGG L+
Sbjct: 251 L-SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQ 309
Query: 111 ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
I F D S G GG+I DSGT +T L Y +RD + P D
Sbjct: 310 IPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLIC 369
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-SNG--TFCFAFAPTSSSLSIIGNV 226
S+ P++ HF G + L +NYL + NG C++ +S +L+IIGN+
Sbjct: 370 FTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNI 429
Query: 227 QQQGTRVSFNLR-NSLIGFTP 246
Q V F+L N+ + F P
Sbjct: 430 MQMDFHVVFDLSGNARMLFQP 450
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 113 bits (282), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 92/261 (35%), Positives = 126/261 (48%), Gaps = 16/261 (6%)
Query: 1 GDFVTETVTLG--SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
G +TET T G +A+ IA GC +EG F +GL+GLG G LS +Q+N F Y
Sbjct: 191 GILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYR 250
Query: 59 LVDRDSDSTSTLEFDSSLPPNA------VTAPLLRNHELD--TFYYLGLTGISVGGDLLP 110
L D + S + F S ++ PLL N + FYY+GLTGISVGG L+
Sbjct: 251 L-SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQ 309
Query: 111 ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
I F D S G GG+I DSGT +T L Y +RD + P D
Sbjct: 310 IPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLIC 369
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-SNG--TFCFAFAPTSSSLSIIGNV 226
S+ P++ HF G + L +NYL + NG C++ +S +L+IIGN+
Sbjct: 370 FTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNI 429
Query: 227 QQQGTRVSFNLR-NSLIGFTP 246
Q V F+L N+ + F P
Sbjct: 430 MQMDFHVVFDLSGNARMLFQP 450
>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 409
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 81/247 (32%), Positives = 125/247 (50%), Gaps = 12/247 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G T+T T G+ +V + GC + G F GA+G++G+G G+LS SQ+ FSY L+
Sbjct: 131 GYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLL 190
Query: 61 ----DRDSDSTSTLEF-DSSLP--PNAVTAPLLRNHELDTFYYLGLTGISVGGDLL-PIS 112
D + S + F D ++P + PLL + FYY+ LTG+ V G+ L I
Sbjct: 191 APEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIP 250
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTCYD 170
F + +G GG+I+ S T VT L+ Y+ +R A V L +G A D CY+
Sbjct: 251 AGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAA-VASRIGLPAVNGSAALELDLCYN 309
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
SS + V+VP ++ F G + L A NY + G C P+ S++G + Q G
Sbjct: 310 ASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVLGTLLQTG 368
Query: 231 TRVSFNL 237
T + +++
Sbjct: 369 TNMIYDV 375
>gi|56784900|dbj|BAD82194.1| aspartic proteinase nepenthesin I-like [Oryza sativa Japonica
Group]
Length = 260
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 91/258 (35%), Positives = 125/258 (48%), Gaps = 16/258 (6%)
Query: 4 VTETVTLG--SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVD 61
+TET T G +A+ IA GC +EG F +GL+GLG G LS +Q+N F Y L
Sbjct: 1 MTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYRL-S 59
Query: 62 RDSDSTSTLEFDSSLPPNA------VTAPLLRNHELDT--FYYLGLTGISVGGDLLPISE 113
D + S + F S ++ PLL N + FYY+GLTGISVGG L+ I
Sbjct: 60 SDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPS 119
Query: 114 TAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
F D S G GG+I DSGT +T L Y +RD + P D
Sbjct: 120 GTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTG 179
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-SNG--TFCFAFAPTSSSLSIIGNVQQQ 229
S+ P++ HF G + L +NYL + NG C++ +S +L+IIGN+ Q
Sbjct: 180 GSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQM 239
Query: 230 GTRVSFNLR-NSLIGFTP 246
V F+L N+ + F P
Sbjct: 240 DFHVVFDLSGNARMLFQP 257
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 80/261 (30%), Positives = 124/261 (47%), Gaps = 13/261 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G TET+ +G AS ++A GC N G+ +G+ GLG G+LS Q+ FSYCL
Sbjct: 174 GYLATETLKVGDASFPSVAFGCSTEN-GVGNSTSGIAGLGRGALSLIPQLGVGRFSYCLR 232
Query: 61 DRDSDSTSTLEFDSSL---PPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPISETAF 116
+ S + F S N + P + N + ++YY+ LTGI+VG LP++ + F
Sbjct: 233 SGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTF 292
Query: 117 KIDESG-NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD--FSS 173
++G GG IVDSGT +T L + Y ++ AF+ T ++ +G D C+
Sbjct: 293 GFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGG 352
Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTS--SSLSIIGNVQQ 228
+ VP++ F G +P + DS G+ C P +S+IGNV Q
Sbjct: 353 GGGIAVPSLVLRFDGGAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQ 412
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
+ ++L + F P C
Sbjct: 413 MDMHLLYDLDGGIFSFAPADC 433
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 90/253 (35%), Positives = 123/253 (48%), Gaps = 13/253 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G +TV+ GS S GCG +NEGLF +AGL+GL LS Q+ S FSY
Sbjct: 225 GYLSKDTVSFGSGSFPGFYYGCGQDNEGLFGRSAGLIGLAKNKLSLLYQLAPSLGYAFSY 284
Query: 58 CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
CL S + L S P P+ + + Y++ L+GISV G L + + ++
Sbjct: 285 CL-PTSSAAAGYLSIGSYNPGQYSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEYR 343
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV-ALFDTCYDFSSRSS 176
+ I+DSGT +TRL Y AL A + +P ++ DTC+ S +
Sbjct: 344 SLPT-----IIDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDTCFR-GSAAG 397
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
+ VP V F G L L N LI VD + T C AFAPT + +IIGN QQQ V ++
Sbjct: 398 LRVPRVDMAFAGGATLALSPGNVLIDVD-DSTTCLAFAPTGGT-AIIGNTQQQTFSVVYD 455
Query: 237 LRNSLIGFTPNKC 249
+ S IGF C
Sbjct: 456 VAQSRIGFAAGGC 468
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 81/251 (32%), Positives = 119/251 (47%), Gaps = 27/251 (10%)
Query: 1 GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G TET TLGS +V +A GCG N G ++GL+G+G G LS SQ+
Sbjct: 184 GVLATETFTLGSDTAVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLG-------- 235
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
V R S T+PL GI+VG LLPI F++
Sbjct: 236 VTRPRRSCRARAAARGGGAPTTTSPL--------------EGITVGDTLLPIDPAVFRLT 281
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRSSVE 178
G+GG+I+DSGT T L+ + AL A R L G L C+ +S +VE
Sbjct: 282 PMGDGGVIIDSGTTFTALEERAFVALARALASRVR-LPLASGAHLGLSLCFAAASPEAVE 340
Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLR 238
VP + HF +G + L ++Y++ S G C ++ +S++G++QQQ T + ++L
Sbjct: 341 VPRLVLHF-DGADMELRRESYVVEDRSAGVACLGMV-SARGMSVLGSMQQQNTHILYDLE 398
Query: 239 NSLIGFTPNKC 249
++ F P KC
Sbjct: 399 RGILSFEPAKC 409
>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
Length = 469
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 82/254 (32%), Positives = 127/254 (50%), Gaps = 12/254 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G T+T T G+ +V + GC + G F GA+G++G+G G+LS SQ+ FSY L+
Sbjct: 191 GYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLL 250
Query: 61 ----DRDSDSTSTLEF-DSSLP--PNAVTAPLLRNHELDTFYYLGLTGISVGGDLL-PIS 112
D + S + F D ++P + PLL + FYY+ LTG+ V G+ L I
Sbjct: 251 APEATDDGSADSVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIP 310
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTCYD 170
F + +G GG+I+ S T VT L+ Y+ +R A V L +G A D CY+
Sbjct: 311 AGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAA-VASRIGLPAVNGSAALELDLCYN 369
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
SS + V+VP ++ F G + L A NY + G C P+ S++G + Q G
Sbjct: 370 ASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVLGTLLQTG 428
Query: 231 TRVSFNLRNSLIGF 244
T + +++ + F
Sbjct: 429 TNMIYDVDAGRLTF 442
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 88/262 (33%), Positives = 122/262 (46%), Gaps = 27/262 (10%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFS 56
G + ET+ L +V + GCGH+ +G GLLGLGG S Q + FS
Sbjct: 221 GVYSNETLALAPGVAVKDFRFGCGHDQDGANDKYDGLLGLGGAPESLVVQTASVYGGAFS 280
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVT-------APLLRNHELDTFYYLGLTGISVGGDLL 109
YCL ++ P V P++R E TFY + +TGI+VGG+ +
Sbjct: 281 YCLPALNNQVGFLALGGGGAPSGGVVNTSGFVFTPMIREEE--TFYVVNMTGITVGGEPI 338
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
+ +AF +GG+I+DSGT VT LQ YNAL+ AF R A P DTCY
Sbjct: 339 DVPPSAF------SGGMIIDSGTVVTELQHTAYNALQAAF-RKAMAAYPLVRNGELDTCY 391
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQ 227
DFS S+V +P V+ F G + L N ++ D C AF + I+GNV
Sbjct: 392 DFSGYSNVTLPKVALTFSGGATIDLDVPNGILLDD-----CLAFQESGPDDQPGILGNVN 446
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
Q+ V ++ +GF C
Sbjct: 447 QRTLEVLYDAGRGRVGFRAAVC 468
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 88/266 (33%), Positives = 123/266 (46%), Gaps = 33/266 (12%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINAS---T 54
G + ++ +TLG V GC H ++G AG L LGGGS SF Q +
Sbjct: 160 GTYSSDDLTLGPYDVVRGFLFGCAHADQGSTFSYDVAGTLALGGGSQSFVQQTASQYSRV 219
Query: 55 FSYCLVDRDSDSTSTLEF--------DSSLPPNAVTAPLLRNHELD-TFYYLGLTGISVG 105
FSYC+ STS+ F ++L P V+ PLL + + TFY + L I V
Sbjct: 220 FSYCV----PPSTSSFGFIMFGVPPQRAALVPTFVSTPLLSSSTMSPTFYRVLLRSIIVA 275
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
G LP+ T F ++DS T ++R+ Y ALR AF P V++
Sbjct: 276 GRPLPVPPTVFSASS------VIDSATVISRIPPTAYQALRAAFRSAMTMYRPAPPVSIL 329
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SII 223
DTCYDFS S+ +P+++ F G + L A L+ C AFAPT+S I
Sbjct: 330 DTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILL------QGCLAFAPTASDRMPGFI 383
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GNVQQ+ V +++ I F C
Sbjct: 384 GNVQQRTLEVVYDVPGKAIRFRSAAC 409
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 89/267 (33%), Positives = 130/267 (48%), Gaps = 26/267 (9%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINAST 54
GD +T+TL S S NI IGCG NN + GA+ G++G G G SF +Q+ +ST
Sbjct: 175 GDLSVDTLTLESTNGLTVSFPNIVIGCGTNNILSYEGASSGIVGFGSGPASFITQLGSST 234
Query: 55 ---FSYCLV------DRDSDSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGI 102
FSYCL + S++TS L F + + VT P+L+ + +TFYYL L
Sbjct: 235 GGKFSYCLTPLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKK-DPETFYYLTLEAF 293
Query: 103 SVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
SVG + I +E G II+DSGT +T L + Y+ L A V + D
Sbjct: 294 SVGNRRVEIGGVPNGDNE---GNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVDDPT 350
Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
+ CY + + P ++ HF V P ++ D G FC AF +S +I
Sbjct: 351 QTLNLCYSVKAE-GYDFPIITMHFKGADVDLHPISTFVSVAD--GVFCLAFE-SSQDHAI 406
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN+ QQ V ++L+ ++ F P+ C
Sbjct: 407 FGNLAQQNLMVGYDLQQKIVSFKPSDC 433
>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
Length = 469
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 82/254 (32%), Positives = 127/254 (50%), Gaps = 12/254 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G T+T T G+ +V + GC + G F GA+G++G+G G+LS SQ+ FSY L+
Sbjct: 191 GYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLL 250
Query: 61 ----DRDSDSTSTLEF-DSSLP--PNAVTAPLLRNHELDTFYYLGLTGISVGGDLL-PIS 112
D + S + F D ++P + PLL + FYY+ LTG+ V G+ L I
Sbjct: 251 APEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIP 310
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTCYD 170
F + +G GG+I+ S T VT L+ Y+ +R A V L +G A D CY+
Sbjct: 311 AGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAA-VASRIGLPAVNGSAALELDLCYN 369
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
SS + V+VP ++ F G + L A NY + G C P+ S++G + Q G
Sbjct: 370 ASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVLGTLLQTG 428
Query: 231 TRVSFNLRNSLIGF 244
T + +++ + F
Sbjct: 429 TNMIYDVDAGRLTF 442
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 74/255 (29%), Positives = 115/255 (45%), Gaps = 22/255 (8%)
Query: 11 GSASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINASTFSYC---LVDRDSDS 66
G + + GCGH N+G+F G+ G G G S PSQ+N ++FSYC + D S S
Sbjct: 199 GGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFDTKSSS 258
Query: 67 TSTL---------EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
TL ++ + T L++N + Y++ L GISVGG + + E+ +
Sbjct: 259 VVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPESRLR 318
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
I+DSG ++T L + Y A++ FV + G A D C+ +
Sbjct: 319 ------SSTIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCFALPVAALW 372
Query: 178 E---VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
VP ++ H G LP NY+ + C + +IGN QQQ T V
Sbjct: 373 RRPAVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQQQNTHVV 432
Query: 235 FNLRNSLIGFTPNKC 249
++L N ++ F P +C
Sbjct: 433 YDLENDVLSFAPARC 447
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 85/260 (32%), Positives = 122/260 (46%), Gaps = 15/260 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
G + ++L + GCG +N+G F G +GL+GLG LS SQ FSY
Sbjct: 206 GVLAHDRLSLAGEDIQGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSY 265
Query: 58 CLVDRDSDSTSTLEF--DSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
CL ++S S+ +L D+S+ N+ V ++ + FY LTGI+VGG+ +
Sbjct: 266 CLPPKESGSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGGE--DVQ 323
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
F G G IVDSGT +T L Y A+R FV ++ DTC+D +
Sbjct: 324 SPGFS--AGGGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSILDTCFDLT 381
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTSSSLS--IIGNVQQQ 229
V+VP++ F G + + +K L V + + C A A S IIGN QQ+
Sbjct: 382 GLREVQVPSLKLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQK 441
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
RV F+ S IGF C
Sbjct: 442 NLRVIFDTVGSQIGFAQETC 461
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 93/280 (33%), Positives = 130/280 (46%), Gaps = 37/280 (13%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLF----VGAAGLLGLGGGSLSFPSQIN 51
G ET TL S + GC H +F +G AGLLGLG G S SQ
Sbjct: 213 GSLAEETFTLSPPSPLAPAATGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTR 272
Query: 52 AS------TFSYCLVDRDSDSTS-TLEFDSSLPPNAVT----APLLRN-HELDTFYYLGL 99
S FSYCL R S + T+ ++ P + PL+ +L + Y + L
Sbjct: 273 RSINSGGGVFSYCLPPRGSSTGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNL 332
Query: 100 TGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV--RGTRALS 157
G+SV G + I +AF + G ++DSGT VT + Y LRD F G+ +
Sbjct: 333 AGVSVNGAAVDIPASAFSL------GAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKML 386
Query: 158 PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV---DSNGT----FC 210
P + L DTCYD + + V P V+ F G + + A L+ + D +G C
Sbjct: 387 PEGSMKLLDTCYDVTGQDVVTAPRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLAC 446
Query: 211 FAFAPTSSS-LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
AF PT+S+ L I+GN+QQ+ V F++ IGF PN C
Sbjct: 447 LAFLPTNSAGLVIVGNMQQRAYNVVFDVDGGRIGFGPNGC 486
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 91/262 (34%), Positives = 126/262 (48%), Gaps = 35/262 (13%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS---TF 55
G++ +T+TL + V GCG NNEG F GA G+LGLG G LS SQ + F
Sbjct: 151 GNYGCDTMTLEPSDVFPKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVF 210
Query: 56 SYCLVDRDS----------DSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
SYCL + DS S S+L+F S V P E +Y++ L ISVG
Sbjct: 211 SYCLPEEDSIGSLLFGEKATSQSSLKFTS-----LVNGPGTSGLEESGYYFVKLLDISVG 265
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA-- 163
L + + F + G I+DSGT +T L Y+AL AF + ++G
Sbjct: 266 NKRLNVPSSVF-----ASPGTIIDSGTVITCLPQRAYSALTAAFKKAMAKYPLSNGRRKK 320
Query: 164 --LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS-- 219
+ DTCY+ S R V +P + HF EG + L K + D++ C AFA S S
Sbjct: 321 GDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRVIWGNDAS-RLCLAFAGNSKSTM 379
Query: 220 ---LSIIGNVQQQGTRVSFNLR 238
L+IIGN QQ V ++++
Sbjct: 380 NSELTIIGNRQQVSLTVLYDIQ 401
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 92/247 (37%), Positives = 122/247 (49%), Gaps = 16/247 (6%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS 68
S ++ GCG N G F GLLGLG G LS PSQ AS FSYCL +S +T
Sbjct: 247 SRALTGFPFGCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNS-TTG 305
Query: 69 TLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGG 125
L ++ + A +LR + +FY++ L I +GG +LP+ F GG
Sbjct: 306 YLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFT-----RGG 360
Query: 126 IIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFH 185
++DSGT +T L + Y LRD F +P + D CYDF+ S V VP VSF
Sbjct: 361 TLLDSGTVLTYLPAQAYALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVVVPAVSFR 420
Query: 186 FPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS---SLSIIGNVQQQGTRVSFNLRNSLI 242
F +G V L +I +D N C AFA + LSIIGN QQ+ V +++ I
Sbjct: 421 FGDGAVFELDFFGVMIFLDEN-VGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKI 479
Query: 243 GFTPNKC 249
GF P C
Sbjct: 480 GFVPASC 486
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 83/264 (31%), Positives = 124/264 (46%), Gaps = 32/264 (12%)
Query: 12 SASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTS-- 68
+A+V NI GCG N GLF +G+ G G G LS PSQ+ FSYC + S
Sbjct: 204 AAAVPNIRFGCGMMNYGLFTPNQSGIAGFGTGPLSLPSQLKVRRFSYCFTAMEESRVSPV 263
Query: 69 -------TLEFDSSLP-------PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
+E ++ P P AP+ FY+L L G++VG LP + +
Sbjct: 264 ILGGEPENIEAHATGPIQSTPFAPGPAGAPVGSQ----PFYFLSLRGVTVGETRLPFNAS 319
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS-- 172
F + G+GG +DSGTA+T + +LR+AFV L G D FS
Sbjct: 320 TFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAFV-AQVPLPVAKGYTDPDNLLCFSVP 378
Query: 173 -SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-----FCFA-FAPTSSSLSIIGN 225
+ + VP + H EG LP +NY++ D +G+ C + +S+ +IIGN
Sbjct: 379 AKKKAPAVPKLILHL-EGADWELPRENYVLDNDDDGSGAGRKLCVVILSAGNSNGTIIGN 437
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
QQQ + ++L ++ + F P +C
Sbjct: 438 FQQQNMHIVYDLESNKMVFAPARC 461
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 81/243 (33%), Positives = 124/243 (51%), Gaps = 14/243 (5%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS 68
S ++ GCG ++EGLF AAG+LGLG LS Q+++ FSYCL R
Sbjct: 120 SQTLPGFVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGGFL 179
Query: 69 TLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGII 127
++ +SL +A P+ + + Y+L LT I+VGG L ++ +++ I
Sbjct: 180 SIG-KASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPT------I 232
Query: 128 VDSGTAVTRLQTETYNALRDAFVR-GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHF 186
+DSGT +TRL Y + AFV+ + + G ++ DTC+ + + VP V F
Sbjct: 233 IDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDMQSVPEVRLIF 292
Query: 187 PEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTP 246
G L L N L+ VD G C AFA ++ ++IIGN QQQ +V+ ++ + IGF
Sbjct: 293 QGGADLNLRPVNVLLQVD-EGLTCLAFA-GNNGVAIIGNHQQQTFKVAHDISTARIGFAT 350
Query: 247 NKC 249
C
Sbjct: 351 GGC 353
>gi|147776519|emb|CAN74010.1| hypothetical protein VITISV_003547 [Vitis vinifera]
Length = 429
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 67/177 (37%), Positives = 93/177 (52%), Gaps = 9/177 (5%)
Query: 77 PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTR 136
P N T PLLRN T YY+ LTG+SVG L+P++ D + G I+DSGT +TR
Sbjct: 257 PKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAGTIIDSGTVITR 316
Query: 137 LQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPA 196
Y A+RD F + + P + FDTC F++ + P V+FHF G L LP
Sbjct: 317 FVEPVYAAIRDEFRKQVKG--PFATIGAFDTC--FAATNEDIAPPVTFHF-TGMDLKLPL 371
Query: 197 KNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+N LI + C A A +S L++I N+QQQ R+ F++ NS +G C
Sbjct: 372 ENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGIARELC 428
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 90/272 (33%), Positives = 137/272 (50%), Gaps = 34/272 (12%)
Query: 1 GDFVTETVTLGSA-----SVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINAS- 53
G+ +T+TL S+ S IGCG +N F GA+ G++GLGGG S +Q+ +S
Sbjct: 153 GNLSVDTLTLESSTGHPISFPKTVIGCGTDNTVSFEGASSGIVGLGGGPASLITQLGSSI 212
Query: 54 --TFSYCLVDR--DSDSTSTLEF-DSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCL+ +S++TS L F D+++ V+ P+++ + FYYL L SVG
Sbjct: 213 DAKFSYCLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPI-VFYYLTLEAFSVGN 271
Query: 107 DLLPISETAFKIDESGNGG----IIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
+ + + S NGG II+DSGT +T + T+ YN L A + + D
Sbjct: 272 KRI-------EFEGSSNGGHEGNIIIDSGTTLTVIPTDVYNNLESAVLELVKLKRVNDPT 324
Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS---- 218
LF+ CY +S + P ++ HF V P ++ D G C AFA TS+
Sbjct: 325 RLFNLCYSVTS-DGYDFPIITTHFKGADVKLHPISTFVDVAD--GIVCLAFATTSAFIPS 381
Query: 219 -SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+SI GN+ QQ V ++L+ ++ F P C
Sbjct: 382 DVVSIFGNLAQQNLLVGYDLQQKIVSFKPTDC 413
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/262 (32%), Positives = 124/262 (47%), Gaps = 25/262 (9%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNN--EGLFVGAAGLLGLGGGSLSFPSQINA---ST 54
G + ++ +TL GS V GC H G+ GL+GLGG + S SQ A +
Sbjct: 230 GTYSSDVLTLSGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKS 289
Query: 55 FSYCLVDRDSDS-----TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
FSYCL + S + T P+LR+ ++ T+Y+ L I+VGG L
Sbjct: 290 FSYCLPATPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKL 349
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
+S + F G +VDSGT +TRL Y AL AF G + + + + DTC+
Sbjct: 350 GLSPSVFA------AGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCF 403
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQ 227
+F+ V +PTV+ F G V+ L A + S G C AFAPT + IGNVQ
Sbjct: 404 NFTGLDKVSIPTVALVFAGGAVVDLDAHGIV----SGG--CLAFAPTRDDKAFGTIGNVQ 457
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
Q+ V +++ + GF C
Sbjct: 458 QRTFEVLYDVGGGVFGFRAGAC 479
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 88/283 (31%), Positives = 123/283 (43%), Gaps = 51/283 (18%)
Query: 1 GDFVTETVTLGSASVD--------NIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQIN 51
G+ T+ T G + D + GCGH N+G+F G+ G G G S PSQ+N
Sbjct: 189 GEIATDRFTFGGDNGDGDSRLPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLN 248
Query: 52 ASTFSYCLVDRDSDSTSTLEFDSSL------PPNAV-------------TAPLLRNHELD 92
+TFSYC TS E SSL P A+ T PLL+N
Sbjct: 249 VTTFSYCF-------TSMFESKSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQP 301
Query: 93 TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG 152
+ Y+L L GISVG L + E + I+DSG ++T L Y A++ F
Sbjct: 302 SLYFLSLKGISVGKTRLAVPEAKLR-------STIIDSGASITTLPEAVYEAVKAEFA-A 353
Query: 153 TRALSPT---DGVALFDTCYDFSSRSSVE---VPTVSFHFPEGKVLPLPAKNYLIPVDSN 206
L PT +G AL D C+ + VP+++ H +G LP NY+ +
Sbjct: 354 QVGLPPTGVVEGSAL-DLCFALPVTALWRRPPVPSLTLHL-DGADWELPRGNYVFEDLAA 411
Query: 207 GTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
C ++IGN QQQ T V ++L N + F P +C
Sbjct: 412 RVMCVVLDAAPGDQTVIGNFQQQNTHVVYDLENDWLSFAPARC 454
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 87/258 (33%), Positives = 123/258 (47%), Gaps = 21/258 (8%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAS---TF 55
G ++T+T+T+ G+ +V N GC H G F AG + LGGG+ S +Q S F
Sbjct: 230 GTYMTDTLTISGTTAVRNFRFGCSHAVRGRFSDLTAGTMSLGGGAQSLLAQTARSLGNAF 289
Query: 56 SYCLVDRDSDSTSTLEFDSSLPPNAV--TAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
SYC+ + ++ ++ V T PL+R+ + Y + L GI V G L I
Sbjct: 290 SYCVPQASASGFLSIGGPATTNSTTVFATTPLVRSAINPSLYLVRLQGIVVAGRRLGIPP 349
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
AF + G ++DS +T+L Y ALR AF RA + DTCYDF
Sbjct: 350 VAF------SAGAVMDSSAVITQLPPTAYRALRRAFRNAMRAYPRSGATGTLDTCYDFLG 403
Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI--IGNVQQQGT 231
++V VP VS F G V+ L +I C AF TSS L++ IGNVQQQ
Sbjct: 404 LTNVRVPAVSLVFGGGAVVVLDPPAVMI------GGCLAFTATSSDLALGFIGNVQQQTH 457
Query: 232 RVSFNLRNSLIGFTPNKC 249
V +++ +GF C
Sbjct: 458 EVLYDVAAGGVGFRRGAC 475
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 73/197 (37%), Positives = 100/197 (50%), Gaps = 13/197 (6%)
Query: 13 ASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
ASV +A GCG N G+F G+ G G G LS PSQ+ FS+C + ST+
Sbjct: 189 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKPSTVL 248
Query: 72 FDSSLPPN--------AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
D LP + + PL++N TFYYL L GI+VG LP+ E+ F + ++G
Sbjct: 249 LD--LPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFAL-KNGT 305
Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
GG I+DSGTA+T L T Y +RDAF + + C R+ VP +
Sbjct: 306 GGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLV 365
Query: 184 FHFPEGKVLPLPAKNYL 200
HF EG + LP +NY+
Sbjct: 366 LHF-EGATMDLPRENYV 381
>gi|56542455|gb|AAV92892.1| Avr9/Cf-9 rapidly elicited protein 36, partial [Nicotiana tabacum]
Length = 191
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 52/164 (31%), Positives = 86/164 (52%), Gaps = 1/164 (0%)
Query: 87 RNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALR 146
+ + L+TFYY+ + + VGG++L I E + + G GG I+DSGT ++ Y ++
Sbjct: 25 KENHLETFYYVQIKSVIVGGEVLNIPEETWNLSTEGVGGTIIDSGTTLSYFAEPAYEIIK 84
Query: 147 DAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSN 206
AFV + D + CY+ S +E+P+ F +G + P +NY I ++
Sbjct: 85 QAFVNKVKRYPILDDFPILKPCYNVSGVEKLELPSFGIVFGDGAIWTFPVENYFIKLEPE 144
Query: 207 GTFCFAFAPT-SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
C A T S++SIIGN QQQ + ++ + S +GF P +C
Sbjct: 145 DIVCLAILGTPHSAMSIIGNYQQQNFHILYDTKRSRLGFAPRRC 188
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 90/271 (33%), Positives = 130/271 (47%), Gaps = 24/271 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G TET+T+G + +A GC N ++G++GLG G LS SQ+ FSYCL
Sbjct: 184 GYLATETLTVGDGTFPKVAFGCSTENG--VDNSSGIVGLGRGPLSLVSQLAVGRFSYCLR 241
Query: 61 DRDSD-STSTLEFDS--SLPPNAV--TAPLLRNHELD--TFYYLGLTGISVGGDLLPISE 113
+D S + F S L +V + PLL+N L T YY+ LTGI+V LP++
Sbjct: 242 SDMADGGASPILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTG 301
Query: 114 TAFKIDESG-NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL---SPTDGVAL-FDTC 168
+ F ++G GG IVDSGT +T L + Y ++ AF L +P G D C
Sbjct: 302 STFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLC 361
Query: 169 YDFSS---RSSVEVPTVSFHFPEGKVLPLPAKNYL--IPVDSNGTF---CFAFAPTSSSL 220
Y S+ +V VP ++ F G +P +NY + DS G C P + L
Sbjct: 362 YKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL 421
Query: 221 --SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
SIIGN+ Q + +++ + F P C
Sbjct: 422 PISIIGNLMQMDMHLLYDIDGGMFSFAPADC 452
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 85/263 (32%), Positives = 129/263 (49%), Gaps = 23/263 (8%)
Query: 1 GDFVTETVTL-----GSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAS- 53
G+ ET+TL S S IGCGHNN G+F G +G++GLG G +S +Q+ +S
Sbjct: 175 GELSVETLTLDSTTGHSVSFPKTVIGCGHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSI 234
Query: 54 --TFSYCLVDR--DSDSTSTLEF-DSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCL+ DS+ TS L F D+++ V+ P ++ + FYYL L SVG
Sbjct: 235 GGKFSYCLLPLLVDSNKTSKLNFGDAAVVSGDGVVSTPFVKK-DPQAFYYLTLEAFSVGN 293
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
+ +D+S G II+DSGT +T L + Y L A + + D L +
Sbjct: 294 KRIEFEV----LDDSEEGNIILDSGTTLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLN 349
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
CY +S + P ++ HF + P + D G C AF +S + I GN+
Sbjct: 350 LCYSITS-DQYDFPIITAHFKGADIKLNPISTFAHVAD--GVVCLAFT-SSQTGPIFGNL 405
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
Q V ++L+ +++ F P+ C
Sbjct: 406 AQLNLLVGYDLQQNIVSFKPSDC 428
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 92/257 (35%), Positives = 128/257 (49%), Gaps = 17/257 (6%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G +TV+L S+ S GCG +N GLF AAGL+GL LS SQ+ S +F+
Sbjct: 202 GYLSKDTVSLSSSGSFPGFYYGCGQDNVGLFGRAAGLIGLARNKLSLLSQLAPSVGNSFA 261
Query: 57 YCLVDRDSDSTSTLEFDS---SLPPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPIS 112
YCL + S L F S + P + + + LD + Y++ L G+SV G L +
Sbjct: 262 YCLPTSAAASAGYLSFGSNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPLAVP 321
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
+ E G+ I+DSGT +TRL T Y AL A V A ++ TC+
Sbjct: 322 SS-----EYGSLPTIIDSGTVITRLPTPVYTALSKA-VGAALAAPSAPAYSILQTCFK-G 374
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTR 232
+ + VP V+ F G L L N L+ V+ T C AFAPT S+ +IIGN QQQ
Sbjct: 375 QVAKLPVPAVNMAFAGGATLRLTPGNVLVDVNET-TTCLAFAPTDST-AIIGNTQQQTFS 432
Query: 233 VSFNLRNSLIGFTPNKC 249
V ++++ S IGF C
Sbjct: 433 VVYDVKGSRIGFAAGGC 449
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 90/271 (33%), Positives = 130/271 (47%), Gaps = 24/271 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G TET+T+G + +A GC N ++G++GLG G LS SQ+ FSYCL
Sbjct: 184 GYLATETLTVGDGTFPKVAFGCSTENG--VDNSSGIVGLGRGPLSLVSQLAVGRFSYCLR 241
Query: 61 DRDSD-STSTLEFDS--SLPPNAV--TAPLLRNHELD--TFYYLGLTGISVGGDLLPISE 113
+D S + F S L +V + PLL+N L T YY+ LTGI+V LP++
Sbjct: 242 SDMADGGASPILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTG 301
Query: 114 TAFKIDESG-NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL---SPTDGVAL-FDTC 168
+ F ++G GG IVDSGT +T L + Y ++ AF L +P G D C
Sbjct: 302 STFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLC 361
Query: 169 YDFSS---RSSVEVPTVSFHFPEGKVLPLPAKNYL--IPVDSNGTF---CFAFAPTSSSL 220
Y S+ +V VP ++ F G +P +NY + DS G C P + L
Sbjct: 362 YKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL 421
Query: 221 --SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
SIIGN+ Q + +++ + F P C
Sbjct: 422 PISIIGNLMQMDMHLLYDIDGGMFSFAPADC 452
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 92/286 (32%), Positives = 130/286 (45%), Gaps = 53/286 (18%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGL------FVGAAGLLGLGGGSLSFPSQ 49
G F ET +L ++S + ++A GCG G F GA G++GLG G +SF SQ
Sbjct: 180 GLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQ 239
Query: 50 IN---ASTFSYCLVDRDSDSTSTLEFDSSLPPNAVT--------------APLLRNHELD 92
+ + FSYCL+D + S PP + PLL N
Sbjct: 240 LGRRFGNKFSYCLMD----------YTLSPPPTSYLIIGNGGDGISKLFFTPLLTNPLSP 289
Query: 93 TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG 152
TFYY+ L + V G L I + ++ID+SGNGG +VDSGT + L Y ++ A R
Sbjct: 290 TFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRR 349
Query: 153 TR-----ALSPTDGVALFDTCYDFSSRSSVE--VPTVSFHFPEGKVLPLPAKNYLIPVDS 205
+ AL+P FD C + S + E +P + F F G V P +NY I +
Sbjct: 350 VKLPIADALTPG-----FDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEE 404
Query: 206 NGTFCFAFAPTSSSL--SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
C A + S+IGN+ QQG F+ S +GF+ C
Sbjct: 405 Q-IQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 449
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 81/259 (31%), Positives = 126/259 (48%), Gaps = 15/259 (5%)
Query: 5 TETVTLGSA------SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
TET+TLGS+ SV ++A GCG +N G + + G +GLG G+LS +Q+ FSYC
Sbjct: 171 TETLTLGSSVPGQAVSVSDVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYC 230
Query: 59 LVD---RDSDSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
L D DS L + L P + PLL++ + Y + L GI++G LPI
Sbjct: 231 LTDFFNSTLDSPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIP 290
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
F + + GG++VDSGT + L + + D V P + +L C+
Sbjct: 291 NKTFDLHANSTGGMVVDSGTTFSILPESGFRVVVD-HVAQVLGQPPVNASSLDSPCFPAP 349
Query: 173 S--RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
+ R +P + HF G + L NY+ + +FC T+S+ S++GN QQQ
Sbjct: 350 AGERQLPFMPDLVLHFAGGADMRLHRDNYMSYNQEDSSFCLNIVGTTSTWSMLGNFQQQN 409
Query: 231 TRVSFNLRNSLIGFTPNKC 249
++ F++ + F P C
Sbjct: 410 IQMLFDMTVGQLSFLPTDC 428
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 110 bits (275), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 88/263 (33%), Positives = 125/263 (47%), Gaps = 21/263 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
G + + L ++ GCG +N+G F G +GL+GLG +S SQ FS
Sbjct: 216 GVLARDKLRLAGQDIEGFVFGCGTSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFGGVFS 275
Query: 57 YCLVDRDSDSTSTLEF--DSSL----PPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLL 109
YCL R+S S+ +L DSS P TA + + L FY+L LTGI+VGG
Sbjct: 276 YCLPMRESGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQ-- 333
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
+ F G +I+DSGT +T L YNA+R F+ ++ DTC+
Sbjct: 334 EVESPWFSA-----GRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSILDTCF 388
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTSSSL--SIIGNV 226
+ + V+VP++ F F + + +K L V S+ + C A A S SIIGN
Sbjct: 389 NLTGLKEVQVPSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIGNY 448
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQ+ RV F+ S IGF C
Sbjct: 449 QQKNLRVIFDTLGSQIGFAQETC 471
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 88/260 (33%), Positives = 125/260 (48%), Gaps = 27/260 (10%)
Query: 17 NIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEF- 72
++A GC G GA+G++GLG G+LS SQ+ + FSYCL S ST+T
Sbjct: 178 SLAFGCIAATRLTPGSLDGASGIIGLGRGNLSLVSQLGDNKFSYCLTPYFSQSTNTSRLF 237
Query: 73 ------DSSLPPNAVTAPLLRNHELD---TFYYLGLTGISVGGDLLPISETAFKIDESGN 123
SS A + P L+N ++D TFYYL LTGI+VG L + E AF + +
Sbjct: 238 VGASAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQVAT 297
Query: 124 G---GIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCYDFSSRSSVE 178
G G ++DSG+ T L Y ALRD V+ G + P G D C + +
Sbjct: 298 GLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDLCAAVAHGDVGK 357
Query: 179 -VPTVSFHFPE-GKVLPLPAKNYLIPVDSNGTFCFAFA---PTSS----SLSIIGNVQQQ 229
VP + HF G + +P +NY PVD + F+ P S+ +IIGN QQ
Sbjct: 358 LVPPLVLHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTIIGNYMQQ 417
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
+ ++L ++ F P C
Sbjct: 418 DMHLLYDLEKGMLSFQPADC 437
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 83/261 (31%), Positives = 121/261 (46%), Gaps = 21/261 (8%)
Query: 5 TETVTLGSASVDNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINAS---TFSYCL 59
++T+ LG ++ A GC G + GLLGLG G +S SQ ++ FSYCL
Sbjct: 175 SDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCL 234
Query: 60 VDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
S + F SL P N PLL N + YY+ +TG+SVG + +
Sbjct: 235 -----PSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVP 289
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
+F D + G ++DSGT +TR Y ALR+ F R A S + FDTC++
Sbjct: 290 AGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTD 349
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS----SLSIIGNVQQ 228
++ P V+ H G L LP +N LI + C A A ++++ N+QQ
Sbjct: 350 EVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQ 409
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q RV ++ S +GF C
Sbjct: 410 QNVRVVVDVAGSRVGFAREPC 430
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 87/268 (32%), Positives = 131/268 (48%), Gaps = 41/268 (15%)
Query: 5 TETVTLGS------ASVDNIAIGCG-HNNEGLFVG--AAGLLGLGGGSLSFPSQINAS-- 53
TET++ GS S N GCG NN ++ G+ GLG G LS SQ+ A
Sbjct: 183 TETLSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIG 242
Query: 54 -TFSYCLVDRDSDSTSTLEFDSS--LPPN-AVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
FSYCL+ DS STS L+F S + N V+ PL+ L T+Y+L L +++G ++
Sbjct: 243 HKFSYCLLPYDSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVV 302
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD--- 166
+T +G I++DSGT +T L+ YN +L T GV L
Sbjct: 303 STGQT--------DGNIVIDSGTPLTYLENTFYNNF-------VASLQETLGVKLLQDLP 347
Query: 167 ----TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SLS 221
TC F +R+++ +P ++F F G + L KN LIP+ + C A P+S +S
Sbjct: 348 SPLKTC--FPNRANLAIPDIAFQF-TGASVALRPKNVLIPLTDSNILCLAVVPSSGIGIS 404
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ G++ Q +V ++L + F P C
Sbjct: 405 LFGSIAQYDFQVEYDLEGKKVSFAPTDC 432
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 110 bits (274), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 80/247 (32%), Positives = 121/247 (48%), Gaps = 8/247 (3%)
Query: 5 TETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDS 64
TE T G +D + GCG N G F G +G++GLG G+LS SQ+ FSY DS
Sbjct: 186 TEAFTFGDTRIDGVVFGCGLKNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAPDDS 245
Query: 65 -DSTSTLEFDSSLPP---NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI-D 119
D+ S + F P + ++ LL + + YY+ L GI V G L I F + +
Sbjct: 246 VDTQSFILFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRN 305
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRSSVE 178
+ G+GG+ + VT L+ Y LR A V L +G AL D CY S + +
Sbjct: 306 KDGSGGVFLSITDLVTVLEEAAYKPLRQA-VASKIGLPAVNGSALGLDLCYTGESLAKAK 364
Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SLSIIGNVQQQGTRVSFNL 237
VP+++ F G V+ L NY + G C P+S+ S++G++ Q GT + +++
Sbjct: 365 VPSMALVFAGGAVMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDI 424
Query: 238 RNSLIGF 244
S + F
Sbjct: 425 NGSKLVF 431
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 110 bits (274), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 80/247 (32%), Positives = 121/247 (48%), Gaps = 8/247 (3%)
Query: 5 TETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDS 64
TE T G +D + GCG N G F G +G++GLG G+LS SQ+ FSY DS
Sbjct: 190 TEAFTFGDTRIDGVVFGCGLQNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAPDDS 249
Query: 65 -DSTSTLEFDSSLPP---NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI-D 119
D+ S + F P + ++ LL + + YY+ L GI V G L I F + +
Sbjct: 250 VDTQSFILFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRN 309
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRSSVE 178
+ G+GG+ + VT L+ Y LR A V L +G AL D CY S + +
Sbjct: 310 KDGSGGVFLSITDLVTVLEEAAYKPLRQA-VASKIGLPAVNGSALGLDLCYTGESLAKAK 368
Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SLSIIGNVQQQGTRVSFNL 237
VP+++ F G V+ L NY + G C P+S+ S++G++ Q GT + +++
Sbjct: 369 VPSMALVFAGGAVMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDI 428
Query: 238 RNSLIGF 244
S + F
Sbjct: 429 NGSKLVF 435
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 110 bits (274), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 87/257 (33%), Positives = 131/257 (50%), Gaps = 23/257 (8%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFS 56
G + ET+T+ +V + GCGH+ +G GLLGLGG S Q + FS
Sbjct: 218 GVYSNETLTMAPGVTVKDFHFGCGHDQDGPNDKYDGLLGLGGAPESLVVQTSSVYGGAFS 277
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNA--VTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
YCL + D L + + + V P++R E TFY + +TGI+VGG+ + + +
Sbjct: 278 YCLPAAN-DQAGFLALGAPVNDASGFVFTPMVR--EQQTFYVVNMTGITVGGEPIDVPPS 334
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
AF +GG+I+DSGT VT LQ Y AL+ AF R A P DTCY+F+
Sbjct: 335 AF------SGGMIIDSGTVVTELQHTAYAALQAAF-RKAMAAYPLLPNGELDTCYNFTGH 387
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF--APTSSSLSIIGNVQQQGTR 232
S+V VP V+ F G + L + ++ +D+ C AF A + I+GNV Q+
Sbjct: 388 SNVTVPRVALTFSGGATVDLDVPDGIL-LDN----CLAFQEAGPDNQPGILGNVNQRTLE 442
Query: 233 VSFNLRNSLIGFTPNKC 249
V +++ + +GF + C
Sbjct: 443 VLYDVGHGRVGFGADAC 459
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 87/271 (32%), Positives = 126/271 (46%), Gaps = 23/271 (8%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGL------FVGAAGLLGLGGGSLSFPSQ 49
G F ET +L + A + ++A GCG G F GA G++GLG G +SF SQ
Sbjct: 179 GLFARETTSLKTSSGKEAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQ 238
Query: 50 IN---ASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTA----PLLRNHELDTFYYLGLTGI 102
+ + FSYCL+D T +AV+ PLL N TFYY+ L +
Sbjct: 239 LGRRFGNKFSYCLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSV 298
Query: 103 SVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
V G L I + ++ID+SGNGG ++DSGT + L Y + A + + + +
Sbjct: 299 FVNGAKLRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAAVKQRIKLPNADELT 358
Query: 163 ALFDTCYDFSSRSSVE--VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
FD C + S + E +P + F F G V P +NY I + C A +
Sbjct: 359 PGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQ-IQCLAIQSVDPKV 417
Query: 221 --SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S+IGN+ QQG F+ S +GF+ C
Sbjct: 418 GFSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 448
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 109 bits (273), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 73/207 (35%), Positives = 104/207 (50%), Gaps = 22/207 (10%)
Query: 54 TFSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
TFSYCL S +L F +L P T PLL N + YY+ +TGI VG
Sbjct: 250 TFSYCL-----PSFKSLNFSGTLRLGRKGQPLRIKTTPLLVNPHRSSLYYVSMTGIRVGK 304
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
++PI A D + G ++DSGT TRL Y A+RD R R +P + FD
Sbjct: 305 KVVPIPPAALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRIRG-APLSSLGGFD 363
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT----SSSLSI 222
TCY+ ++V+ P V+F F G + LPA N +I T C A A ++ L++
Sbjct: 364 TCYN----TTVKWPPVTFMF-TGMQVTLPADNLVIHSTYGTTSCLAMAAAPDGVNTVLNV 418
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I ++QQQ R+ F++ N +GF +C
Sbjct: 419 IASMQQQNHRILFDVPNGRVGFAREQC 445
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 109 bits (273), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 91/267 (34%), Positives = 122/267 (45%), Gaps = 38/267 (14%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFS 56
G + ET+T +V + GCGH+ G GLLGLGG S Q + FS
Sbjct: 219 GVYSNETITFAPGITVKDFHFGCGHDQRGPSDKFDGLLGLGGAPESLVVQTASVYGGAFS 278
Query: 57 YCLVD------------RDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISV 104
YCL R S +T+T F V P+ T Y + +TGISV
Sbjct: 279 YCLPALNSEAGFLALGVRPSAATNTSAF--------VFTPMWHLPMDATSYMVNMTGISV 330
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL 164
GG L I +AF+ GG+++DSGT VT L YNAL +A +R A P
Sbjct: 331 GGKPLDIPRSAFR------GGMLIDSGTIVTELPETAYNAL-NAALRKAFAAYPMVASED 383
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--LSI 222
FDTCY+F+ S+V VP V+ F G + L N ++ D C AF + L I
Sbjct: 384 FDTCYNFTGYSNVTVPRVALTFSGGATIDLDVPNGILVKD-----CLAFRESGPDVGLGI 438
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IGNV Q+ V ++ + +GF C
Sbjct: 439 IGNVNQRTLEVLYDAGHGKVGFRAGAC 465
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 109 bits (273), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 88/263 (33%), Positives = 126/263 (47%), Gaps = 19/263 (7%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAS- 53
G ET+TL S + I GCGHNN G F G++GLGGG +SF SQI +S
Sbjct: 113 GVLAQETITLSSTKGESVPLKGIVFGCGHNNTGGFNDREMGIIGLGGGPVSFISQIGSSF 172
Query: 54 ---TFSYCLVDRDSD----STSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
FS CLV +D S +L S + V + L + T Y++ L GISVG
Sbjct: 173 GGKRFSQCLVPFHTDVSVSSKMSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGN 232
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
L + ++ + E GN + +DSGT T L T+ Y+ L A VR A+ P
Sbjct: 233 TYLHFNGSSSQSVEKGN--VFLDSGTPPTILPTQLYDRLV-AQVRSEVAMKPVTNDLDLG 289
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
+ +++++ P ++ HF G V LP + ++ P D G FC F TSS + GN
Sbjct: 290 PQLCYRTKNNLRGPVLTAHFEGGDVKLLPTQTFVSPKD--GVFCLGFTNTSSDGGVYGNF 347
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
Q + F+L ++ F P C
Sbjct: 348 AQSNYLIGFDLDRQVVSFKPMDC 370
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 109 bits (273), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 83/261 (31%), Positives = 120/261 (45%), Gaps = 21/261 (8%)
Query: 5 TETVTLGSASVDNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINAS---TFSYCL 59
++T+ LG ++ A GC G + GLLGLG G +S SQ + FSYCL
Sbjct: 175 SDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCL 234
Query: 60 VDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
S + F SL P N PLL N + YY+ +TG+SVG + +
Sbjct: 235 -----PSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVP 289
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
+F D + G ++DSGT +TR Y ALR+ F R A S + FDTC++
Sbjct: 290 AGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTD 349
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS----SLSIIGNVQQ 228
++ P V+ H G L LP +N LI + C A A ++++ N+QQ
Sbjct: 350 EVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQ 409
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q RV ++ S +GF C
Sbjct: 410 QNVRVVVDVAGSRVGFAREPC 430
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 83/261 (31%), Positives = 120/261 (45%), Gaps = 21/261 (8%)
Query: 5 TETVTLGSASVDNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINAS---TFSYCL 59
++T+ LG ++ A GC G + GLLGLG G +S SQ + FSYCL
Sbjct: 175 SDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCL 234
Query: 60 VDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
S + F SL P N PLL N + YY+ +TG+SVG + +
Sbjct: 235 -----PSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVP 289
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
+F D + G ++DSGT +TR Y ALR+ F R A S + FDTC++
Sbjct: 290 AGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTD 349
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS----SLSIIGNVQQ 228
++ P V+ H G L LP +N LI + C A A ++++ N+QQ
Sbjct: 350 EVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQ 409
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q RV ++ S +GF C
Sbjct: 410 QNVRVVVDVAGSRVGFAREPC 430
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 91/260 (35%), Positives = 126/260 (48%), Gaps = 20/260 (7%)
Query: 1 GDFVTETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G +T+ LG+ + +D GCG +N GLF G AGL+GLG LS SQ A FS
Sbjct: 282 GVLAQDTLGLGTTTKLDGFVFGCGLSNRGLFGGTAGLMGLGRTDLSLVSQTAARFGGVFS 341
Query: 57 YCLVDRDSDSTSTLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
YCL + ST +L SS PN ++ + FY++ +TG +VGG ++
Sbjct: 342 YCL-PATTTSTGSLSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGGAA-LTA 399
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGVALFDTCYDFS 172
F G G ++VDSGT +TRL Y A+R F R R P G ++ D CYD +
Sbjct: 400 PGF-----GAGNVLVDSGTVITRLAPSVYKAVRAEFAR--RFEYPAAPGFSILDACYDLT 452
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFA--PTSSSLSIIGNVQQQ 229
R V VP ++ G + + A L V +G+ C A A P IIGN QQ+
Sbjct: 453 GRDEVNVPLLTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQR 512
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
RV ++ S +GF C
Sbjct: 513 NKRVVYDTVGSRLGFADEDC 532
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 70/199 (35%), Positives = 96/199 (48%), Gaps = 14/199 (7%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
G ET T G+A+ NIA GCG N G ++G++G G G LS SQ+ S F
Sbjct: 176 GVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGPSRF 235
Query: 56 SYCLVDRDSDSTSTLEF---------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
SYCL S + S L F ++S + P + N L Y+L L IS+G
Sbjct: 236 SYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGT 295
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
LLPI F I++ G GG+I+DSGT++T LQ + Y A+R V + D D
Sbjct: 296 KLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLTAMNDTDIGLD 355
Query: 167 TCYDFSSRSSVEVPTVSFH 185
TC+ + +V V F
Sbjct: 356 TCFQWPPPPNVTVTVPDFR 374
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 96/264 (36%), Positives = 125/264 (47%), Gaps = 29/264 (10%)
Query: 2 DFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYC 58
+ +T+ L + V GC G V GLLGLG G LSF SQ + STFSYC
Sbjct: 174 NLTRDTIALSTDIVPGYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYC 233
Query: 59 LVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
L S TL F +L P T PLL+N + YY+ L GI VG ++ I
Sbjct: 234 L-----PSFRTLNFSGTLRLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDI 288
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCY 169
+A + + G I DSGT TRL Y A+RD F + G +S G FDTCY
Sbjct: 289 PASALAFNPTTGAGTIFDSGTVFTRLVAPVYTAVRDEFRKRVGNAIVSSLGG---FDTCY 345
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGN 225
+ PT++F F G + LP N LI + T C A A +S L++I N
Sbjct: 346 T----GPIVAPTMTFMF-SGMNVTLPTDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIAN 400
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
+QQQ R+ F++ NS IG C
Sbjct: 401 MQQQNHRILFDVPNSRIGVAREPC 424
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 96/264 (36%), Positives = 125/264 (47%), Gaps = 29/264 (10%)
Query: 2 DFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYC 58
+ +T+ L + V GC G V GLLGLG G LSF SQ + STFSYC
Sbjct: 174 NLTRDTIALSTDIVPGYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYC 233
Query: 59 LVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
L S TL F +L P T PLL+N + YY+ L GI VG ++ I
Sbjct: 234 L-----PSFRTLNFSGTLRLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDI 288
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCY 169
+A + + G I DSGT TRL Y A+RD F + G +S G FDTCY
Sbjct: 289 PASALAFNPTTGAGTIFDSGTVFTRLVAPVYTAVRDEFRKRVGNAIVSSLGG---FDTCY 345
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGN 225
+ PT++F F G + LP N LI + T C A A +S L++I N
Sbjct: 346 T----GPIVAPTMTFMF-SGMNVTLPPDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIAN 400
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
+QQQ R+ F++ NS IG C
Sbjct: 401 MQQQNHRILFDVPNSRIGVAREPC 424
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 90/267 (33%), Positives = 135/267 (50%), Gaps = 29/267 (10%)
Query: 1 GDFVTETVTLGSASVDNI-----AIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAST 54
GD ET+TLGS + ++ IGCG NN F G ++G++GLG G +S +Q+ +
Sbjct: 176 GDLSVETLTLGSTNGSSVKFRRTVIGCGRNNTVSFEGKSSGIVGLGNGPVSLINQLRRRS 235
Query: 55 ------FSYCLVDRDSDSTSTLEF-DSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVG 105
FSYCL S+ +S L F D+++ V+ P++ H+ FYYL L SVG
Sbjct: 236 SSIGRKFSYCLASM-SNISSKLNFGDAAVVSGDGTVSTPIV-THDPKVFYYLTLEAFSVG 293
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDA---FVRGTRALSPTDGV 162
+ + + ++F+ E GN II+DSGT +T L + Y+ L A V R P +
Sbjct: 294 NNRIEFTSSSFRFGEKGN--IIIDSGTTLTLLPNDIYSKLESAVADLVELDRVKDPLKQL 351
Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
+L CY S+ + P + HF G + L A N I V+ G C AF +S I
Sbjct: 352 SL---CYR-STFDELNAPVIMAHF-SGADVKLNAVNTFIEVE-QGVTCLAFI-SSKIGPI 404
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN+ QQ V ++L+ ++ F P C
Sbjct: 405 FGNMAQQNFLVGYDLQKKIVSFKPTDC 431
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 95/265 (35%), Positives = 123/265 (46%), Gaps = 26/265 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
V +TVTL + V A GC G V GLLGLG G LS +Q + STFSY
Sbjct: 182 ASLVQDTVTLATDPVPAYAFGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQSTFSY 241
Query: 58 CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
CL S TL F SL P PLL+N + YY+ L I VG ++
Sbjct: 242 CL-----PSFKTLNFSGSLRLGPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIVD 296
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTC 168
I A + + G + DSGT TRL YNA+R+ F R +L FDTC
Sbjct: 297 IPPEALAFNANTGAGTVFDSGTVFTRLVEPAYNAVRNEFRRRIAVHKKLTVTSLGGFDTC 356
Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIG 224
Y + + PT++F F G + LP N LI + C A AP +S L++I
Sbjct: 357 YT----APIVAPTITFMF-SGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIA 411
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
N+QQQ RV F++ NS +G C
Sbjct: 412 NMQQQNHRVLFDVPNSRLGVARELC 436
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/261 (35%), Positives = 121/261 (46%), Gaps = 25/261 (9%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGA-AGLLGLGGGSLSFPSQINAS---TF 55
G ++ + +TL S V N GC H G F + +G + LGGG S SQ A+ F
Sbjct: 240 GTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAF 299
Query: 56 SYCLVDRDSDSTSTLEFDSSLPPNAVTA--PLLRNHEL-DTFYYLGLTGISVGGDLLPIS 112
SYC+ D S +L + A PL+RN + T Y + L GI VGG L +
Sbjct: 300 SYCVPDPSSSGFLSLGGPADGGGAGRFARTPLVRNPSIIPTLYLVRLRGIEVGGRRLNVP 359
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP--TDGVALFDTCYD 170
F GG ++DS +T+L Y ALR AF R A P G A DTCYD
Sbjct: 360 PVVFA------GGAVMDSSVIITQLPPTAYRALRLAF-RSAMAAYPRVAGGRAGLDTCYD 412
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQ 228
F +SV VP VS F G V+ L A ++ C AF PT +L IGNVQQ
Sbjct: 413 FVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------EGCLAFVPTPGDFALGFIGNVQQ 466
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q V +++ +GF C
Sbjct: 467 QTHEVLYDVGGGSVGFRRGAC 487
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 98/264 (37%), Positives = 134/264 (50%), Gaps = 33/264 (12%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G + ++T+TL S++V GCGH GLF G GLLGLG S Q + FS
Sbjct: 141 GVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFS 200
Query: 57 YCLVDRDSDS---TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
YCL + S + T + S P T LL + T+Y + LTGISVGG L +
Sbjct: 201 YCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPA 260
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGVALFDTCY 169
+AF VD+GT VTRL Y ALR AF G + +P++G+ DTCY
Sbjct: 261 SAFAGGTV------VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGI--LDTCY 312
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQ 227
+F+ +V +P V+ F G + L A L S G C AFAP+ S ++I+GNVQ
Sbjct: 313 NFAGYGTVTLPNVALTFGSGATVTLGADGIL----SFG--CLAFAPSGSDGGMAILGNVQ 366
Query: 228 QQGTRVSFNLR--NSLIGFTPNKC 249
Q+ SF +R + +GF P+ C
Sbjct: 367 QR----SFEVRIDGTSVGFKPSSC 386
>gi|300078619|gb|ADJ67210.1| aspartic proteinase nepenthesin-1 precursor [Jatropha curcas]
Length = 84
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 51/84 (60%), Positives = 63/84 (75%), Gaps = 1/84 (1%)
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
DTC+D S ++ V+VPTV+ HF G + LPA NYLIPVDS+G+FCFAFA T S LSIIGN
Sbjct: 1 DTCFDLSGKTEVKVPTVALHF-RGADVSLPASNYLIPVDSDGSFCFAFAGTMSGLSIIGN 59
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
+QQQG RV ++L S +GF P C
Sbjct: 60 IQQQGFRVVYDLAGSRVGFAPRGC 83
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/261 (35%), Positives = 121/261 (46%), Gaps = 25/261 (9%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGA-AGLLGLGGGSLSFPSQINAS---TF 55
G ++ + +TL S V N GC H G F + +G + LGGG S SQ A+ F
Sbjct: 224 GTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAF 283
Query: 56 SYCLVDRDSDSTSTLEFDSSLPPNAVTA--PLLRNHEL-DTFYYLGLTGISVGGDLLPIS 112
SYC+ D S +L + A PL+RN + T Y + L GI VGG L +
Sbjct: 284 SYCVPDPSSSGFLSLGGPADGGGAGRFARTPLVRNPSIIPTLYLVRLRGIEVGGRRLNVP 343
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP--TDGVALFDTCYD 170
F GG ++DS +T+L Y ALR AF R A P G A DTCYD
Sbjct: 344 PVVFA------GGAVMDSSVIITQLPPTAYRALRLAF-RSAMAAYPRVAGGRAGLDTCYD 396
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQ 228
F +SV VP VS F G V+ L A ++ C AF PT +L IGNVQQ
Sbjct: 397 FVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------EGCLAFVPTPGDFALGFIGNVQQ 450
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q V +++ +GF C
Sbjct: 451 QTHEVLYDVGGGSVGFRRGAC 471
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 91/262 (34%), Positives = 124/262 (47%), Gaps = 20/262 (7%)
Query: 1 GDFVTETVTLGSA-------SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA- 52
GD +T+TL + +V GCGH+N G F GLLGLG G S PSQ+ A
Sbjct: 233 GDLARDTLTLSPSPSPSPADTVPGFVFGCGHSNAGTFGEVDGLLGLGLGKASLPSQVAAR 292
Query: 53 --STFSYCLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
+ FSYCL S + L F ++ NA ++ + T YYL LTGI V G +
Sbjct: 293 YGAAFSYCLPSSPS-AAGYLSFGGAAARANAQFTEMVTGQD-PTSYYLNLTGIVVAGRAI 350
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF--VRGTRALSPTDGVALFDT 167
+ +AF G I+DSGTA +RL Y ALR +F G +FDT
Sbjct: 351 KVPASAFAT----AAGTIIDSGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDT 406
Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQ 227
CYDF+ +V +P V F +G + L L + C AF P + L I+GN Q
Sbjct: 407 CYDFTGHETVRIPAVELVFADGATVHLHPSGVLYTWNDVAQTCLAFVP-NHDLGILGNTQ 465
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
Q+ V +++ + IGF C
Sbjct: 466 QRTLAVIYDVGSQRIGFGRKGC 487
>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 324
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 98/264 (37%), Positives = 134/264 (50%), Gaps = 33/264 (12%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G + ++T+TL S++V GCGH GLF G GLLGLG S Q + FS
Sbjct: 79 GVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFS 138
Query: 57 YCLVDRDSDS---TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
YCL + S + T + S P T LL + T+Y + LTGISVGG L +
Sbjct: 139 YCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPA 198
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGVALFDTCY 169
+AF VD+GT VTRL Y ALR AF G + +P++G+ DTCY
Sbjct: 199 SAFAGGTV------VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGI--LDTCY 250
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQ 227
+F+ +V +P V+ F G + L A L S G C AFAP+ S ++I+GNVQ
Sbjct: 251 NFAGYGTVTLPNVALTFGSGATVTLGADGIL----SFG--CLAFAPSGSDGGMAILGNVQ 304
Query: 228 QQGTRVSFNLR--NSLIGFTPNKC 249
Q+ SF +R + +GF P+ C
Sbjct: 305 QR----SFEVRIDGTSVGFKPSSC 324
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 97/265 (36%), Positives = 134/265 (50%), Gaps = 35/265 (13%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G + ++T+TL S++V GCGH GLF G GLLGLG S Q + FS
Sbjct: 233 GVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFS 292
Query: 57 YCLVDRDSDS---TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
YCL + S + T + S P T LL + T+Y + LTGISVGG L +
Sbjct: 293 YCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPA 352
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGVALFDTCY 169
+AF VD+GT VTRL Y ALR AF G + +P++G+ DTCY
Sbjct: 353 SAFAGGTV------VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGI--LDTCY 404
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF-CFAFAPTSS--SLSIIGNV 226
+F+ +V +P V+ F G + L A L +F C AFAP+ S ++I+GNV
Sbjct: 405 NFAGYGTVTLPNVALTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNV 457
Query: 227 QQQGTRVSFNLR--NSLIGFTPNKC 249
QQ+ SF +R + +GF P+ C
Sbjct: 458 QQR----SFEVRIDGTSVGFKPSSC 478
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 97/265 (36%), Positives = 134/265 (50%), Gaps = 35/265 (13%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G + ++T+TL S++V GCGH GLF G GLLGLG S Q + FS
Sbjct: 233 GVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFS 292
Query: 57 YCLVDRDSDS---TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
YCL + S + T + S P T LL + T+Y + LTGISVGG L +
Sbjct: 293 YCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPA 352
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGVALFDTCY 169
+AF VD+GT VTRL Y ALR AF G + +P++G+ DTCY
Sbjct: 353 SAFAGGTV------VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGI--LDTCY 404
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF-CFAFAPTSS--SLSIIGNV 226
+F+ +V +P V+ F G + L A L +F C AFAP+ S ++I+GNV
Sbjct: 405 NFAGYGTVTLPNVALTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNV 457
Query: 227 QQQGTRVSFNLR--NSLIGFTPNKC 249
QQ+ SF +R + +GF P+ C
Sbjct: 458 QQR----SFEVRIDGTSVGFKPSSC 478
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 92/260 (35%), Positives = 122/260 (46%), Gaps = 24/260 (9%)
Query: 4 VTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLV 60
V +T+TL + + GC + G GLLGLG G LS SQ + STFSYCL
Sbjct: 184 VQDTLTLAADPIPGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCL- 242
Query: 61 DRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
S ++ F SL P PLLRN + YY+ L I VG ++ I
Sbjct: 243 ----PSFKSINFSGSLRLGPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPP 298
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
A + + G I DSGT TRL Y A+R+ F R P + FDTCY+
Sbjct: 299 AALAFNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYNV-- 356
Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA----PTSSSLSIIGNVQQQ 229
+ VPT++F F G + LP N +I + T C A A +S L++I N+QQQ
Sbjct: 357 --PIVVPTITFLF-SGMNVALPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQ 413
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
RV F++ NS IG C
Sbjct: 414 NHRVLFDVPNSRIGIARELC 433
>gi|300078594|gb|ADJ67200.1| aspartic proteinase nepenthesin-1 precursor [Jatropha curcas]
Length = 84
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 51/84 (60%), Positives = 63/84 (75%), Gaps = 1/84 (1%)
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
DTC+D S ++ V+VPTV+ HF G + LPA NYLIPVDS+G+FCFAFA T S LSIIGN
Sbjct: 1 DTCFDLSGKTEVKVPTVALHF-RGVDVSLPASNYLIPVDSDGSFCFAFAGTMSGLSIIGN 59
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
+QQQG RV ++L S +GF P C
Sbjct: 60 IQQQGFRVVYDLAGSRVGFAPRGC 83
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 120/273 (43%), Gaps = 27/273 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G ++ET+ L V N +GC + AG+ G G G S PSQ+ FSYCL+
Sbjct: 197 GIMLSETLDLPGKGVPNFIVGCSVLSTSQ---PAGISGFGRGPPSLPSQLGLKKFSYCLL 253
Query: 61 DRDSDST---STLEFDSSLPPNAVTA-----PLLRN------HELDTFYYLGLTGISVGG 106
R D T S+L D TA P ++N H +YYLGL I+VGG
Sbjct: 254 SRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGG 313
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGVAL 164
+ I G+GG I+DSGT T ++ E + + F + ++ T +G+
Sbjct: 314 KHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITG 373
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS--- 221
C++ S ++ P ++ F G + LP NY+ + + C ++
Sbjct: 374 LRPCFNISGLNTPSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFS 433
Query: 222 -----IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I+GN QQQ V ++LRN +GF C
Sbjct: 434 GGPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 466
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 91/267 (34%), Positives = 130/267 (48%), Gaps = 25/267 (9%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-----VGAAGLLGLGGGSLSFPSQI 50
GD ++T+T+GS AS IA GCGH+N G F G + S++
Sbjct: 183 GDLSSDTLTIGSTEGDPASFPGIAFGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEV 242
Query: 51 NASTFSYCLVDRDSDST--STLEFDSS---LPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
FSYCLV SDST S + F S V+ PL++ DTFYYL L G+SVG
Sbjct: 243 GGQ-FSYCLVPLSSDSTVSSKINFGKSGVVSGSGTVSTPLIKGTP-DTFYYLTLEGLSVG 300
Query: 106 GDLLPI---SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
+ + SE G II+DSGT +T L + Y + A + TD
Sbjct: 301 SETVAFKGFSENKSSPAAVEEGNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPN 360
Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
+F CY SS +++E+PT++ HF G + LP N + V + CF+ P SS+L+I
Sbjct: 361 GIFSLCY--SSVNNLEIPTITAHF-TGADVQLPPLNTFVQVQED-LVCFSMIP-SSNLAI 415
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN+ Q V ++L+N+ + F C
Sbjct: 416 FGNLAQINFLVGYDLKNNKVSFKQTDC 442
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 92/260 (35%), Positives = 122/260 (46%), Gaps = 24/260 (9%)
Query: 4 VTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLV 60
V +T+TL + + GC + G GLLGLG G LS SQ + STFSYCL
Sbjct: 184 VQDTLTLATDPIPGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCL- 242
Query: 61 DRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
S ++ F SL P PLLRN + YY+ L I VG ++ I
Sbjct: 243 ----PSFKSINFSGSLRLGPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPP 298
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
A + + G I DSGT TRL Y A+R+ F R P + FDTCY+
Sbjct: 299 AALAFNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYNV-- 356
Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA----PTSSSLSIIGNVQQQ 229
+ VPT++F F G + LP N +I + T C A A +S L++I N+QQQ
Sbjct: 357 --PIVVPTITFLF-SGMNVTLPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQ 413
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
RV F++ NS IG C
Sbjct: 414 NHRVLFDVPNSRIGIARELC 433
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 88/259 (33%), Positives = 129/259 (49%), Gaps = 18/259 (6%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
GD + ETVTLGS IGC N F + G++GLGGG +S Q+++S
Sbjct: 178 GDLIVETVTLGSYNDPFVHFPRTVIGCIRNTNVSF-DSIGIVGLGGGPVSLVPQLSSSIS 236
Query: 54 -TFSYCLVDRDSDSTSTLEF-DSSLPP-NAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
FSYCL SD +S L+F D+++ + + + + FYYL L SVG + +
Sbjct: 237 KKFSYCLAPI-SDRSSKLKFGDAAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIE 295
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
++ + SG G II+DSGT T L + Y+ L A + D + F CY
Sbjct: 296 FRSSSSR--SSGKGNIIIDSGTTFTVLPDDVYSKLESAVADVVKLERAEDPLKQFSLCYK 353
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
S+ V+VP ++ HF G + L A N I V S+ C AF +S S +I GN+ QQ
Sbjct: 354 -STYDKVDVPVITAHF-SGADVKLNALNTFI-VASHRVVCLAFL-SSQSGAIFGNLAQQN 409
Query: 231 TRVSFNLRNSLIGFTPNKC 249
V ++L+ ++ F P C
Sbjct: 410 FLVGYDLQRKIVSFKPTDC 428
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 94/267 (35%), Positives = 134/267 (50%), Gaps = 29/267 (10%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAS- 53
GD +T+TL S S NI IGCGH N+G G +G +GLG G LSF SQ+N+S
Sbjct: 179 GDLSIDTLTLNSNNDTPISFKNIVIGCGHRNKGPLEGYVSGNIGLGRGPLSFISQLNSSI 238
Query: 54 --TFSYCLVDRDSDS--TSTLEF-DSSLPPNA--VTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCLV S+ + L F D S+ V+ P+ + Y L +SVG
Sbjct: 239 GGKFSYCLVPLFSNEGISGKLHFGDKSVVSGVGTVSTPITAG---EIGYSTTLNALSVGD 295
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRD---AFVRGTRALSPTDGVA 163
++ + K D GN I+DSGT +T L Y+ L + V+ RA SP
Sbjct: 296 HIIKFENSTSKNDNLGN--TIIDSGTTLTILPENVYSRLESIVTSMVKLERAKSPNQQ-- 351
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SLSI 222
F CY ++ +++VP ++ HF G + L + N P+D + CFAF + +I
Sbjct: 352 -FKLCYK-ATLKNLDVPIITAHF-NGADVHLNSLNTFYPID-HEVVCFAFVSVGNFPGTI 407
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IGN+ QQ V F+L+ ++I F P C
Sbjct: 408 IGNIAQQNFLVGFDLQKNIISFKPTDC 434
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 97/265 (36%), Positives = 137/265 (51%), Gaps = 27/265 (10%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS- 53
G+ +T+TLGS + NI IGCGHNN G F +G++GLGGG++S +Q+ S
Sbjct: 184 GNIAVDTLTLGSTDTRPVQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSI 243
Query: 54 --TFSYCLV--DRDSDSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCLV ++D TS + F ++ + V+ PL+ + +TFYYL L ISVG
Sbjct: 244 DGKFSYCLVPLTSENDRTSKINFGTNAVVSGTGVVSTPLIAKSQ-ETFYYLTLKSISVGS 302
Query: 107 DLL--PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL 164
+ P S++ SG G II+DSGT +T L TE Y+ L DA A D
Sbjct: 303 KEVQYPGSDSG-----SGEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTG 357
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
CY S+ ++VP ++ HF +G + L N + + S CFAF S S SI G
Sbjct: 358 LSLCY--SATGDLKVPAITMHF-DGADVNLKPSNCFVQI-SEDLVCFAFR-GSPSFSIYG 412
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
NV Q V ++ + + F P C
Sbjct: 413 NVAQMNFLVGYDTVSKTVSFKPTDC 437
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 78/268 (29%), Positives = 123/268 (45%), Gaps = 27/268 (10%)
Query: 1 GDFVTETVTL------GSASVDNIAIGCGHNNEGLFVGA-AGLLGLGGGSLSFPSQINAS 53
G F+ ++ T G +V +I GCG N G F+ G+ G G G LS PSQ+
Sbjct: 180 GHFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLPSQLKVR 239
Query: 54 TFSYCLVDRDSDSTSTL------EFDSSLPPNAVTAPLLRNHELDT---FYYLGLTGISV 104
FSYC R +S + + + ++ P +R+ T Y L G++V
Sbjct: 240 QFSYCFTTRFEAKSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTV 299
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA- 163
G LP+ E I G+G +DSGT +T + L+ AF+ +A P + A
Sbjct: 300 GKTRLPVPE----IKADGSGATFIDSGTDITTFPDAVFRQLKSAFI--AQAALPVNKTAD 353
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--S 221
D C+ + + + +P + FH EG LP +NY+ +G C A + TS + +
Sbjct: 354 EDDICFSWDGKKTAAMPKLVFHL-EGADWDLPRENYVTEDRESGQVCVAVS-TSGQMDRT 411
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+IGN QQQ T + ++L + P +C
Sbjct: 412 LIGNFQQQNTHIVYDLAAGKLLLVPAQC 439
>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 74/270 (27%), Positives = 116/270 (42%), Gaps = 25/270 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
GDF+ E + ++ +GC + G V +A L G G S P Q+ F+YCL
Sbjct: 195 GDFLLENLNFPGKTIHEFLVGCTTSAVGE-VTSAALAGFGRSMFSLPMQMGVKKFAYCLN 253
Query: 61 DRDSDSTST-----LEFDSSLPPNAVTAPLLRNH-ELDTFYYLGLTGISVGGDLLPISET 114
D D T L++ AP L+N + +YYLG+ I +G LL I
Sbjct: 254 SHDYDDTRNSSKLILDYSDGETKGLSYAPFLKNPPDFPIYYYLGVKDIKIGNKLLRIPSK 313
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETY----NALRDAFVRGTRALSPTDGVALFDTCYD 170
G GG+++DSG A + + N L+ + R+L + + CY+
Sbjct: 314 YLAPGSDGRGGLMIDSGFAYGYMTGPVFKKVTNELKKRMSKYRRSLEAEAEIGV-TPCYN 372
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCF-----------AFAPTSSS 219
F+ + S+++P + + F G + +P KNY + + CF F P S
Sbjct: 373 FTGQKSIKIPDLIYQFRGGATMVVPGKNYFVLIPEISLACFPLTTDAGTNTLEFTPGPS- 431
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I+GN Q V F+L+N +GF C
Sbjct: 432 -IILGNSQHVDYYVEFDLKNERLGFRQQTC 460
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 76/209 (36%), Positives = 100/209 (47%), Gaps = 20/209 (9%)
Query: 3 FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSYCL 59
V + +TL + + GC + G + GLLGLG G +S SQ A FSYCL
Sbjct: 134 LVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCL 193
Query: 60 VDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
S + F SL P + T PLLRN + YY+ LTG+SVG +PI
Sbjct: 194 -----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIP 248
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
D + G I+DSGT +TR Y A+RD F + P + FDTC F+
Sbjct: 249 SEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG--PISSLGAFDTC--FA 304
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLI 201
+ + E P V+ HF EG L LP +N LI
Sbjct: 305 ATNEAEAPAVTLHF-EGLNLVLPMENSLI 332
>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
sativus]
Length = 364
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 91/263 (34%), Positives = 123/263 (46%), Gaps = 24/263 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
D V + +TL + SV + GC G V GLLGLG G LS Q + STFSY
Sbjct: 110 ADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSY 169
Query: 58 CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
CL S ++ F SL P PLLRN + YY+ L I VG ++
Sbjct: 170 CL-----PSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVD 224
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
I +A + + G ++DSGT TRL Y A+RD F R + FDTCY
Sbjct: 225 IPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYT 284
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNV 226
+ PT++F F G + LP N+LI S T C A A +S L++I ++
Sbjct: 285 V----PIISPTITFMF-AGMNVTLPPDNFLIHSTSGSTTCLAMAAAPDNVNSVLNVIASM 339
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQQ R+ F++ NS +G C
Sbjct: 340 QQQNHRILFDIPNSRVGVARESC 362
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 96/280 (34%), Positives = 131/280 (46%), Gaps = 35/280 (12%)
Query: 5 TETVTLG--SASVDNI--AIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
TE T G +S +N+ A GC G GA+G++GLG G LS PSQ+ + FSY
Sbjct: 178 TEVFTFGHGQSSENNVSLAFGCITASRLTPGSLDGASGIIGLGRGKLSLPSQLGDNKFSY 237
Query: 58 CLVDRDSDS--TSTL-----EFDSSLPPNAVTAPLLRNHE---LDTFYYLGLTGISVGGD 107
CL SD+ TSTL S A + P L+N + D+FYYL LTGI+VG
Sbjct: 238 CLTPYFSDAANTSTLFVGASAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTA 297
Query: 108 LLPISETAFKIDE---SGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGV 162
L + AF + E + GG ++DSG+ T L Y ALRD VR G + P G
Sbjct: 298 KLDVPAAAFDLREVAPAKWGGTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGA 357
Query: 163 ALFDTCYD--FSSRSSVEVPTVSFHFPEGKV----LPLPAKNYLIPVDSNGTFCFAFA-- 214
D C + VP + HF G + +P +NY PVD + F+
Sbjct: 358 EGLDLCVGGVAPGDAGKLVPPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSG 417
Query: 215 -PTSS----SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
P S+ +IIGN QQ + ++L ++ F P C
Sbjct: 418 GPNSTLPLNETTIIGNYMQQDMHLLYDLGQGVLSFQPADC 457
>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 437
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 71/201 (35%), Positives = 103/201 (51%), Gaps = 11/201 (5%)
Query: 55 FSYCLVDRDSDSTS-TLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPIS 112
FSYCL S S +L+ + P ++ PLLRN + YY+ LTG+SVG L+PI+
Sbjct: 241 FSYCLPSFKSYYFSGSLKLGPAGQPKSIRYTPLLRNPHRPSLYYVNLTGVSVGRTLVPIA 300
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
+ + G I+DSGT +TR Y A+RD F + + P + FDTC F+
Sbjct: 301 PELLAFNPNTGAGTIIDSGTVITRFVQPIYTAIRDEFRK--QVAGPFSSLGAFDTC--FA 356
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQ 228
+ + P V+ HF G L LP +N LI + C A A +S L++I N+QQ
Sbjct: 357 ATNEAVAPAVTLHF-TGLNLVLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQ 415
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q R+ F++ NS +G C
Sbjct: 416 QNLRLLFDVPNSRLGIARELC 436
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 88/269 (32%), Positives = 128/269 (47%), Gaps = 32/269 (11%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
G + ++L +D GCG +N+G F G +GL+GLG LS SQ FS
Sbjct: 241 GVLAHDRLSLAGEVIDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFS 300
Query: 57 YCL-VDRDSDSTSTLEF--DSSLPPNAV----------TAPLLRNHELDTFYYLGLTGIS 103
YCL + R+SD++ +L D S N+ + PLL+ FY + LTGI+
Sbjct: 301 YCLPLSRESDASGSLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQG----PFYLVNLTGIT 356
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
VGG + T F + IVDSGT +T L YNA+R F+ G +
Sbjct: 357 VGGQ--EVESTGF------SARAIVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFS 408
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTSS--SL 220
+ DTC++ + V+VP+++ F G + + + L V S+ + C A A S
Sbjct: 409 ILDTCFNMTGLKEVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSEDET 468
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
SIIGN QQ+ RV F+ S +GF C
Sbjct: 469 SIIGNYQQKNLRVVFDTSASQVGFAQETC 497
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 76/209 (36%), Positives = 99/209 (47%), Gaps = 20/209 (9%)
Query: 3 FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSYCL 59
V + +TL + + GC + G + GLLGLG G +S SQ A FSYCL
Sbjct: 134 LVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCL 193
Query: 60 VDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
S + F SL P + T PLLRN + YY+ LTG+SVG +PI
Sbjct: 194 -----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIP 248
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
D + G I+DSGT +TR Y A+RD F + P + FDTC F+
Sbjct: 249 SEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG--PISSLGAFDTC--FA 304
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLI 201
+ E P V+ HF EG L LP +N LI
Sbjct: 305 ETNEAEAPAVTLHF-EGLNLVLPMENSLI 332
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 80/255 (31%), Positives = 114/255 (44%), Gaps = 16/255 (6%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINA---STF 55
G ++ + +TL + +V GC H +G F AAG++ LGGG S SQ + + F
Sbjct: 107 GAYIADLLTLDAGNAVSGFKFGCSHAEQGSFDARAAGIMALGGGPESLLSQTASRYGNAF 166
Query: 56 SYCLVDRDSDSTS-TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
SYC+ SDS TL V P++R + TFY + L I+VGG L ++
Sbjct: 167 SYCIPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPA 226
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
F G ++DS TA+TRL Y ALR AF DTCYDF+
Sbjct: 227 VFA------AGSVLDSRTAITRLPPTAYQALRAAFRSSMTMYRSAPPKGYLDTCYDFTGV 280
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
++ +P +S F VLPL L N F ++G+VQQQ V
Sbjct: 281 VNIRLPKISLVFDRNAVLPLDPSGILF----NDCLAFTSNADDRMPGVLGSVQQQTIEVL 336
Query: 235 FNLRNSLIGFTPNKC 249
+++ +GF C
Sbjct: 337 YDVGGGAVGFRQGAC 351
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 87/256 (33%), Positives = 128/256 (50%), Gaps = 19/256 (7%)
Query: 6 ETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVD 61
+T+T S+S GCG N G F GLLGLG G LS PSQ S FSYCL
Sbjct: 229 DTLTFNSSSKFTGFTFGCGEKNIGDFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPS 288
Query: 62 RDSDSTSTLEFDSSLP----PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
++ + L ++ P P TA +++ + +FY++ L I++GG +LP+ + F
Sbjct: 289 YNT-TPGYLNIGATKPTSTVPVQYTA-MIKKPQYPSFYFIELVSINIGGYILPVPPSVFT 346
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
G ++DSGT +T L Y +LRD F + P DTCYDF+ + ++
Sbjct: 347 -----KTGTLLDSGTILTYLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAI 401
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLI-PVDSN---GTFCFAFAPTSSSLSIIGNVQQQGTRV 233
+P VSF+F +G V L +I P D+ G F P + SI+GN QQ+ V
Sbjct: 402 VIPAVSFNFSDGAVFDLDFYGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEV 461
Query: 234 SFNLRNSLIGFTPNKC 249
+++ + IGF P C
Sbjct: 462 IYDVPSQKIGFIPISC 477
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 106 bits (265), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 71/207 (34%), Positives = 103/207 (49%), Gaps = 22/207 (10%)
Query: 54 TFSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
TFSYCL S +L F +L PP T PLL N + YY+ +TGI VG
Sbjct: 254 TFSYCL-----PSFKSLNFSGTLRLGRNGQPPRIKTTPLLANPHRSSLYYVNMTGIRVGR 308
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
++PI A D + G ++DSGT TRL Y A+RD R R +P + FD
Sbjct: 309 KVVPIPPPALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRR--RVGAPVSSLGGFD 366
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT----SSSLSI 222
TC++ ++V P V+ F +G + LP +N +I C A A ++ L++
Sbjct: 367 TCFN---TTAVAWPPVTLLF-DGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNV 422
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I ++QQQ RV F++ N +GF +C
Sbjct: 423 IASMQQQNHRVLFDVPNGRVGFARERC 449
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 106 bits (265), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 83/258 (32%), Positives = 124/258 (48%), Gaps = 22/258 (8%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G + ++T++L S+ +V + GC H G GL+GLGG + S SQ A+ FS
Sbjct: 220 GTYGSDTLSLTSSDAVKSFQFGCSHRAAGFVGELDGLMGLGGDTESLVSQTAATYGKAFS 279
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVT---APLLRNHELDTFYYLGLTGISVGGDLLPISE 113
YCL S L ++ ++ P++R + TFY + L GI+V G +L +
Sbjct: 280 YCLPPPSSSGGGFLTLGAAGGASSSRYSHTPMVR-FSVPTFYGVFLQGITVAGTMLNVPA 338
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
+ F +G +VDSGT +T+L Y ALR AF + +A V DTC+DFS
Sbjct: 339 SVF------SGASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFDFSG 392
Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQQGT 231
+++ VPTV+ F G + L L C AF T+ I+GNVQQ+
Sbjct: 393 FNTITVPTVTLTFSRGAAMDLDISGILY------AGCLAFTATAHDGDTGILGNVQQRTF 446
Query: 232 RVSFNLRNSLIGFTPNKC 249
+ F++ IGF C
Sbjct: 447 EMLFDVGGRTIGFRSGAC 464
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 106 bits (265), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 80/255 (31%), Positives = 114/255 (44%), Gaps = 16/255 (6%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINA---STF 55
G ++ + +TL + +V GC H +G F AAG++ LGGG S SQ + + F
Sbjct: 237 GAYIADLLTLDAGNAVSGFKFGCSHAEQGSFDARAAGIMALGGGPESLLSQTASRYGNAF 296
Query: 56 SYCLVDRDSDSTS-TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
SYC+ SDS TL V P++R + TFY + L I+VGG L ++
Sbjct: 297 SYCIPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPA 356
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
F G ++DS TA+TRL Y ALR AF DTCYDF+
Sbjct: 357 VFA------AGSVLDSRTAITRLPPTAYQALRSAFRSSMTMYRSAPPKGYLDTCYDFTGV 410
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
++ +P +S F VLPL L N F ++G+VQQQ V
Sbjct: 411 VNIRLPKISLVFDRNAVLPLDPSGILF----NDCLAFTSNADDRMPGVLGSVQQQTIEVL 466
Query: 235 FNLRNSLIGFTPNKC 249
+++ +GF C
Sbjct: 467 YDVGGGAVGFRQGAC 481
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 90/263 (34%), Positives = 123/263 (46%), Gaps = 24/263 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
D V + +TL + SV + GC G V GLLGLG G LS Q + STFSY
Sbjct: 187 ADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSY 246
Query: 58 CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
CL S ++ F SL P PLLRN + YY+ L I VG ++
Sbjct: 247 CL-----PSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVD 301
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
I +A + + G ++DSGT TRL Y A+RD F R + FDTCY
Sbjct: 302 IPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYT 361
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNV 226
+ PT++F F G + LP N+LI + T C A A +S L++I ++
Sbjct: 362 V----PIISPTITFMF-AGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASM 416
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQQ R+ F++ NS +G C
Sbjct: 417 QQQNHRILFDIPNSRVGVARESC 439
>gi|388520263|gb|AFK48193.1| unknown [Lotus japonicus]
Length = 157
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/162 (38%), Positives = 94/162 (58%), Gaps = 10/162 (6%)
Query: 91 LDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV 150
L T Y L LT I+VGG L ++ +++K+ I+DSGT +TRL Y AL+++FV
Sbjct: 2 LPTLYGLDLTAITVGGKPLGLAASSYKVPT------IIDSGTVITRLPMPVYTALKNSFV 55
Query: 151 R-GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF 209
R ++ + G+++ DTC+ + + EVP + F G LPL A N LI +D G
Sbjct: 56 RIMSKKYAQAPGISILDTCFKGNVKEMSEVPEIQMIFGGGADLPLKAHNTLIELD-KGVT 114
Query: 210 CFAFAPTSSS--LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
C A A +S + ++IIGN QQQ +V++++ NS IGF C
Sbjct: 115 CLAIAGSSENNPIAIIGNYQQQTFKVAYDVANSKIGFAAGGC 156
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 84/265 (31%), Positives = 129/265 (48%), Gaps = 21/265 (7%)
Query: 1 GDFVTETVTLGSASVDNIA-----IGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAST 54
G+ +TVT+ S S +A IGCGH+N G F +G++GLG G S +Q+ +T
Sbjct: 172 GNLAVDTVTMQSTSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPAT 231
Query: 55 ---FSYCLVDRDSDST---STLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVG 105
FSYCL+ + ST + L F S+ + V+ P+ + + TFY L L +SVG
Sbjct: 232 GGKFSYCLIPIGTGSTNDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVG 291
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
E A K+ G II+DSGT +T L + N+ A + D
Sbjct: 292 DTKFNFPEGASKL--GGESNIIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFL 349
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIG 224
D C+ ++ E+P V+ HF EG +PL +N + + S+ T C AF ++ I G
Sbjct: 350 DYCFA-TTTDDYEMPPVTMHF-EGADVPLQRENLFVRL-SDDTICLAFGSFPDDNIFIYG 406
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
N+ Q V ++++N + F P C
Sbjct: 407 NIAQSNFLVGYDIKNLAVSFQPAHC 431
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 89/261 (34%), Positives = 125/261 (47%), Gaps = 21/261 (8%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAS- 53
G ET+TL S S IGCG+ N G F G ++G++GLG G +S PSQ+ S
Sbjct: 160 GYLSVETLTLDSTTGYSVSFPKTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSI 219
Query: 54 --TFSYCLVDRDSDSTSTLEF-DSSLP--PNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
FSYCL +STS L F D+++ A+T P+++ + + YYL L SVG L
Sbjct: 220 GGKFSYCLGPWLPNSTSKLNFGDAAIVYGDGAMTTPIVKK-DAQSGYYLTLEAFSVGNKL 278
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
+ + +E G I++DSGT T L + Y A D F C
Sbjct: 279 IEFGGPTYGGNE---GNILIDSGTTFTFLPYDVYYRFESAVAEYINLEHVEDPNGTFKLC 335
Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
Y+ + E P ++ HF +G + L + I V S+G C AF P S +I GNV Q
Sbjct: 336 YNVAYHG-FEAPLITAHF-KGADIKLYYISTFIKV-SDGIACLAFIP--SQTAIFGNVAQ 390
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q V +NL + + F P C
Sbjct: 391 QNLLVGYNLVQNTVTFKPVDC 411
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 88/263 (33%), Positives = 118/263 (44%), Gaps = 21/263 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
G T+TV LG ASVD GCG +N GLF G AGL+GLG LS SQ FSY
Sbjct: 266 GVLATDTVALGGASVDGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSY 325
Query: 58 CL---VDRDSDSTSTLEFDSSLPPNAVTAPLLR---NHELDTFYYLGLTGISVGGDLLPI 111
CL D+ + +L D+S NA R + FY++ +TG SV
Sbjct: 326 CLPAATSGDAAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASV------- 378
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCY 169
A G +++DSGT +TRL Y A+R F R G +L D CY
Sbjct: 379 GGAAVAAAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACY 438
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTS--SSLSIIGNV 226
+ + V+VP ++ G + + A L +G+ C A A S IIGN
Sbjct: 439 NLTGHDEVKVPLLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNY 498
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQ+ RV ++ S +GF C
Sbjct: 499 QQKNKRVVYDTVGSRLGFADEDC 521
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 88/263 (33%), Positives = 118/263 (44%), Gaps = 21/263 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
G T+TV LG ASVD GCG +N GLF G AGL+GLG LS SQ FSY
Sbjct: 267 GVLATDTVALGGASVDGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSY 326
Query: 58 CL---VDRDSDSTSTLEFDSSLPPNAVTAPLLR---NHELDTFYYLGLTGISVGGDLLPI 111
CL D+ + +L D+S NA R + FY++ +TG SV
Sbjct: 327 CLPAATSGDAAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASV------- 379
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCY 169
A G +++DSGT +TRL Y A+R F R G +L D CY
Sbjct: 380 GGAAVAAAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACY 439
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTS--SSLSIIGNV 226
+ + V+VP ++ G + + A L +G+ C A A S IIGN
Sbjct: 440 NLTGHDEVKVPLLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNY 499
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQ+ RV ++ S +GF C
Sbjct: 500 QQKNKRVVYDTVGSRLGFADEDC 522
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 91/258 (35%), Positives = 128/258 (49%), Gaps = 32/258 (12%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS---TF 55
G++ +T+TL + V GCG NNEG F GA G+LGLG G LS SQ + F
Sbjct: 238 GNYGCDTMTLEHSDVFPKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVF 297
Query: 56 SYCLVDRDS-----------DSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISV 104
SYCL + DS +S+L+F S V P E +Y++ L ISV
Sbjct: 298 SYCLPEEDSIGSLLFGEKATSQSSSLKFTS-----LVNGPGTSGLEESGYYFVKLLDISV 352
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA- 163
G L I + F + G I+DSGT +TRL Y+AL+ AF + ++G
Sbjct: 353 GNKRLNIPSSVF-----ASPGTIIDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRK 407
Query: 164 ---LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
+ DTCY+ S R V +P + HF EG + L K + D++ C AFA +S L
Sbjct: 408 KGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRVIWGNDAS-RLCLAFA-GNSEL 465
Query: 221 SIIGNVQQQGTRVSFNLR 238
+IIGN QQ V ++++
Sbjct: 466 TIIGNRQQVSLTVLYDIQ 483
>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
distachyon]
Length = 836
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 99/260 (38%), Positives = 130/260 (50%), Gaps = 26/260 (10%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS----TF 55
G + ++T+TL A +V GCGH GLF G GLL LG +S SQ + + F
Sbjct: 592 GVYGSDTLTLTDADAVTGFLFGCGHAQAGLFAGIDGLLALGRKGMSLTSQTSGAYGGGVF 651
Query: 56 SYCLVDRDSDST-STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISE 113
SYCL S + TL SS A T LL ++ TFY + LTGI VGG L +
Sbjct: 652 SYCLPPSPSSTGFLTLGGPSSASGFATTG-LLTAWDVPTFYMVMLTGIGVGGQQLSGVPA 710
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGVALFDTCY 169
+AF GG +VD+GT +TRL Y ALR AF +P G+ DTCY
Sbjct: 711 SAFA------GGTVVDTGTVITRLPPTAYAALRAAFRAAMAPYGYPAAPATGI--LDTCY 762
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
+F+ +V +PTVS F G L L A +L S+G FA +I+GNVQQ+
Sbjct: 763 NFTDYGTVTLPTVSLTFSGGATLKLDAPGFL----SSGCLAFATNSGDGDPAILGNVQQR 818
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
V F+ S +GF P+ C
Sbjct: 819 SFAVRFD--GSSVGFMPHSC 836
>gi|413950927|gb|AFW83576.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 316
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 82/266 (30%), Positives = 128/266 (48%), Gaps = 33/266 (12%)
Query: 13 ASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSDS 66
A + + +GC + G F+ + G+L LG ++SF S+ A FSYCLVD ++
Sbjct: 53 AKLRGVVLGCTTSYTGESFLASDGVLSLGYSNVSFASRAAARFGGRFSYCLVDHLAPRNA 112
Query: 67 TSTLEFD-----------------SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
TS L F S+ P A PLL +H + FY + + G+SV G+LL
Sbjct: 113 TSYLTFGPNPAVSSASASRTACAGSAAAPGARQTPLLLDHRMRPFYAVAVNGVSVDGELL 172
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
I + + + GG I+DSGT++T L + Y A+ A + L P + FD CY
Sbjct: 173 RIPRLVWDVQK--GGGAILDSGTSLTVLVSPAYRAVVAALGKKLVGL-PRVAMDPFDYCY 229
Query: 170 DFSS-----RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSII 223
+++S +V VP ++ HF L P K+Y+I + G C +S+I
Sbjct: 230 NWTSPLTGEDLAVAVPALAVHFAGSARLQPPPKSYVIDA-APGVKCIGLQEGDWPGVSVI 288
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN+ QQ F+L+N + F ++C
Sbjct: 289 GNILQQEHLWEFDLKNRRLRFKRSRC 314
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 120/282 (42%), Gaps = 37/282 (13%)
Query: 1 GDFVTETVTLGSASVDNIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
G TE +T S +V ++ GC + G GA+G++GLG G LS PSQ+ + FSY
Sbjct: 160 GTLATENLTFQSETV-SLVFGCIVVTKLSPGSLNGASGIIGLGRGKLSLPSQLGDTRFSY 218
Query: 58 CLVDRDSDSTSTLEF------------DSSLPPNAVTAPLLRNHELD---TFYYLGLTGI 102
CL D+ SS P T P +R+ D TFYYL LTGI
Sbjct: 219 CLTPYFEDTIEPSHMVVGASAGLINGSASSTP--VTTVPFVRSPSDDPFSTFYYLPLTGI 276
Query: 103 SVGGDLLPISETAFKIDESGNG---GIIVDSGTAVTRLQTETYNALRDAFVR--GTRALS 157
+ G L + AF + + G G +DSG +T L Y ALR R G +
Sbjct: 277 TAGKVKLAVPSAAFDLRQVAPGMWTGTFIDSGAPLTSLVDVAYQALRAELARQLGAALVQ 336
Query: 158 PTDGVALFDTCYDFSSRSSVEVPTVSFHF----PEGKVLPLPAKNYLIPVDSNGTFCFAF 213
P G FD C + VP + HF G L +P NY PVDS F
Sbjct: 337 PLAGTTGFDLCVALKDAERL-VPPLVLHFGGGSGTGTDLVVPPANYWAPVDSATACMVVF 395
Query: 214 APTS------SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ + ++IGN QQ V ++L ++ F P C
Sbjct: 396 SSVDRKSLPMNETTVIGNYMQQNMHVLYDLAGGVLSFQPADC 437
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 92/263 (34%), Positives = 122/263 (46%), Gaps = 24/263 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
+ V +T+TL + V + GC G GLLGLG G LS SQ + STFSY
Sbjct: 161 ANLVQDTITLATDPVPSYTFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSY 220
Query: 58 CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
CL S +L F SL P PLL+N + YY+ L I VG ++
Sbjct: 221 CL-----PSFKSLNFSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVD 275
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
I A + + G I DSGT TRL Y A+RD F R + FDTCY+
Sbjct: 276 IPPAALAFNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYN 335
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA----PTSSSLSIIGNV 226
+ VPT++F F G + LP N LI + T C A A +S L++I N+
Sbjct: 336 V----PIVVPTITFIF-TGMNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANM 390
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQQ RV +++ NS +G C
Sbjct: 391 QQQNHRVLYDVPNSRVGVARELC 413
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 81/262 (30%), Positives = 115/262 (43%), Gaps = 28/262 (10%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFV---GAAGLLGLGGGSLSFPSQINAS--- 53
G ++++ +T+ A +V + GC H +G F AAG++ LGGG S SQ A+
Sbjct: 223 GTYISDLLTITPATAVRSFQFGCSHGVQGSFSFGSSAAGIMALGGGPESLVSQTAATYGR 282
Query: 54 TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPIS 112
FS+C TL V P+L+N + TFY + L I+V G + +
Sbjct: 283 VFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVP 342
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
T F G +DS TA+TRL Y ALR AF P DTCYD +
Sbjct: 343 PTVFA------AGAALDSRTAITRLPPTAYQALRQAFRDRMAMYQPAPPKGPLDTCYDMA 396
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAF--APTSSSLSIIGNVQ 227
S +P ++ F KN + +D +G C AF P IIGN+Q
Sbjct: 397 GVRSFALPRITLVF---------DKNAAVELDPSGVLFQGCLAFTAGPNDQVPGIIGNIQ 447
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
Q V +N+ +L+GF C
Sbjct: 448 LQTLEVLYNIPAALVGFRHAAC 469
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 97/263 (36%), Positives = 134/263 (50%), Gaps = 22/263 (8%)
Query: 1 GDFVTETVTLGSA-----SVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS- 53
G+ +T+TLGS+ + NI IGCGHNN G F +G++GLGGG +S Q+ S
Sbjct: 180 GNIAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSI 239
Query: 54 --TFSYCLVDRDS--DSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCLV S D TS + F ++ + V+ PL+ +TFYYL L ISVG
Sbjct: 240 DGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGS 299
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
+ + + ES G II+DSGT +T L TE Y+ L DA A D +
Sbjct: 300 KQI---QYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLS 356
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
CY S+ ++VP ++ HF +G + L + N + V S CFAF S S SI GNV
Sbjct: 357 LCY--SATGDLKVPVITMHF-DGADVKLDSSNAFVQV-SEDLVCFAFR-GSPSFSIYGNV 411
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
Q V ++ + + F P C
Sbjct: 412 AQMNFLVGYDTVSKTVSFKPTDC 434
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 81/262 (30%), Positives = 115/262 (43%), Gaps = 28/262 (10%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFV---GAAGLLGLGGGSLSFPSQINAS--- 53
G ++++ +T+ A +V + GC H +G F AAG++ LGGG S SQ A+
Sbjct: 248 GTYISDLLTITPATAVRSFQFGCSHGVQGSFSFGSSAAGIMALGGGPESLVSQTAATYGR 307
Query: 54 TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPIS 112
FS+C TL V P+L+N + TFY + L I+V G + +
Sbjct: 308 VFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVP 367
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
T F G +DS TA+TRL Y ALR AF P DTCYD +
Sbjct: 368 PTVFA------AGAALDSRTAITRLPPTAYQALRQAFRDRMAMYQPAPPKGPLDTCYDMA 421
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAF--APTSSSLSIIGNVQ 227
S +P ++ F KN + +D +G C AF P IIGN+Q
Sbjct: 422 GVRSFALPRITLVF---------DKNAAVELDPSGVLFQGCLAFTAGPNDQVPGIIGNIQ 472
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
Q V +N+ +L+GF C
Sbjct: 473 LQTLEVLYNIPAALVGFRHAAC 494
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 97/263 (36%), Positives = 134/263 (50%), Gaps = 22/263 (8%)
Query: 1 GDFVTETVTLGSA-----SVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS- 53
G+ +T+TLGS+ + NI IGCGHNN G F +G++GLGGG +S Q+ S
Sbjct: 180 GNIAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSI 239
Query: 54 --TFSYCLVDRDS--DSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCLV S D TS + F ++ + V+ PL+ +TFYYL L ISVG
Sbjct: 240 DGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGS 299
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
+ + + ES G II+DSGT +T L TE Y+ L DA A D +
Sbjct: 300 KQI---QYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLS 356
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
CY S+ ++VP ++ HF +G + L + N + V S CFAF S S SI GNV
Sbjct: 357 LCY--SATGDLKVPVITMHF-DGADVKLDSSNAFVQV-SEDLVCFAFR-GSPSFSIYGNV 411
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
Q V ++ + + F P C
Sbjct: 412 AQMNFLVGYDTVSKTVSFKPTDC 434
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 85/267 (31%), Positives = 128/267 (47%), Gaps = 26/267 (9%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAST 54
GDF +T+T+GS S AIGCGH+N G F +G++GLG G S Q+ ++
Sbjct: 174 GDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAV 233
Query: 55 ---FSYCL--VDRDSDSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCL + D ++ L F S+ + AV+ P+ + + +FY L L +SVG
Sbjct: 234 GGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGR 293
Query: 107 DLLPISETAFKIDES---GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
+ T + S G II+DSGT +T L + Y+ A D
Sbjct: 294 N-----NTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQ 348
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSI 222
+ C++ ++ +VP ++ HF EG L L +N LI V N C AFA + +SI
Sbjct: 349 FLEYCFE-TTTDDYKVPFIAMHF-EGANLRLQRENVLIRVSDN-VICLAFAGAQDNDISI 405
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN+ Q V +++ N + F P C
Sbjct: 406 YGNIAQINFLVGYDVTNMSLSFKPMNC 432
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 91/269 (33%), Positives = 129/269 (47%), Gaps = 26/269 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
G TE T S + + GC +G GA+GL+GLG G LS SQ A+ FSY
Sbjct: 176 GSLGTEAFTFQSGAA-KLGFGCVSLTRITKGALNGASGLIGLGRGRLSLVSQTGATKFSY 234
Query: 58 CLVD--RDSDSTSTLEFDSSLPPN----AVTA-PLLRNHE---LDTFYYLGLTGISVGGD 107
CL R+ ++S L +S + AVT+ P +++ E TFYYL L GISVG
Sbjct: 235 CLTPYLRNHGASSHLFVGASASLSGGGGAVTSIPFVKSPEDYPYSTFYYLPLVGISVGET 294
Query: 108 LLPISETAFKIDESG----NGGIIVDSGTAVTRLQTETYNALRDAFVRG-TRALSPTDGV 162
LPI AF++ +GG+I+D+G+ VT L Y+AL D R R+L
Sbjct: 295 KLPIPSAAFELRRVAAGYWSGGVIIDTGSPVTSLAEAAYSALSDEVARQLNRSLVQPPAD 354
Query: 163 ALFDTCYDFSSRSSVE--VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
D C +R V+ VP + FHF G + + A +Y PVD + T C
Sbjct: 355 TGLDLCV---ARQDVDKVVPVLVFHFGGGADMAVSAGSYWGPVDKS-TACMLIEEGGYE- 409
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++IGN QQQ + +++ + F C
Sbjct: 410 TVIGNFQQQDVHLLYDIGKGELSFQTADC 438
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 79/261 (30%), Positives = 129/261 (49%), Gaps = 23/261 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST-----F 55
GD E +T+GS+SV ++ IGCGH + G F A+G++GLGGG LS SQ++ ++ F
Sbjct: 168 GDLGFEKITIGSSSVKSV-IGCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRF 226
Query: 56 SYCLVDRDSDSTSTLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
SYCL S + + F + P V+ PL+ + + T+YY+ L IS+G +
Sbjct: 227 SYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNTV-TYYYITLEAISIGNE----R 281
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD-- 170
AF + G +I+DSGT ++ L E Y+ + + ++ +A D +D C+D
Sbjct: 282 HMAF----AKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDG 337
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--LSIIGNVQQ 228
+ +S +P ++ F G + L N V +N C P S + IIGN+
Sbjct: 338 INVATSSGIPIITAQFSGGANVNLLPVNTFQKV-ANNVNCLTLTPASPTDEFGIIGNLAL 396
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
+ ++L + F P C
Sbjct: 397 ANFLIGYDLEAKRLSFKPTVC 417
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 89/268 (33%), Positives = 125/268 (46%), Gaps = 24/268 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
G TE S + + +A GC +G GA+GL+GLG G LS SQ A+ FSY
Sbjct: 181 GTLGTEAFAFQSGTAE-LAFGCVTFTRIVQGALHGASGLIGLGRGRLSLVSQTGATKFSY 239
Query: 58 CLVD--RDSDSTSTLEFDSSLP----PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
CL ++ +T L +S + +T ++ + FYYL L G++VG LPI
Sbjct: 240 CLTPYFHNNGATGHLFVGASASLGGHGDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPI 299
Query: 112 SETAFKIDESG----NGGIIVDSGTAVTRLQTETYNALRD---AFVRGTRALSPTDGVAL 164
T F + E +GG+I+DSG+ T L + Y+AL A + G+ P D
Sbjct: 300 PATVFDLREVAPGLFSGGVIIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPDA--- 356
Query: 165 FDTCYDFSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLS 221
D +R V VP V FHF G + +PA++Y PVD + S
Sbjct: 357 -DDGALCVARRDVGRVVPAVVFHFRGGADMAVPAESYWAPVDKAAACMAIASAGPYRRQS 415
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+IGN QQQ RV ++L N F P C
Sbjct: 416 VIGNYQQQNMRVLYDLANGDFSFQPADC 443
>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
Length = 423
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/178 (34%), Positives = 92/178 (51%), Gaps = 10/178 (5%)
Query: 77 PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTR 136
P T PLL N + YY+ + GI VG ++ + ++A + G I+D+GT TR
Sbjct: 249 PKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTR 308
Query: 137 LQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPA 196
L Y A+RDAF RG + FDTCY+ +V VPTV+F F + LP
Sbjct: 309 LAAPVYAAVRDAF-RGRVRTPVAPPLGGFDTCYNV----TVSVPTVTFMFAGAVAVTLPE 363
Query: 197 KNYLIPVDSNGTFCFAFAP-----TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+N +I S G C A A +++L+++ ++QQQ RV F++ N +GF+ C
Sbjct: 364 ENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 421
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 89/268 (33%), Positives = 132/268 (49%), Gaps = 36/268 (13%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA--- 52
G+F +T++LG+ S + A+GCG N G F G GL+GLG G +S SQ++A
Sbjct: 140 GEFARDTISLGTTSDGSQKFPSFAVGCGMVNSG-FDGVDGLVGLGQGPVSLTSQLSAAID 198
Query: 53 STFSYCLVDRDSDSTST-LEFDSS-------LPPNAVTAPLLRNHELDTFYYLGLTGISV 104
S FSYCLVD +S S S+ L F S + +T P + T+Y L + GI+V
Sbjct: 199 SKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPP---SDTYPTYYLLTVNGIAV 255
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL 164
G + G I+DSGT +T + + Y + + + L DG ++
Sbjct: 256 AGQTM-----------GSPGTTIIDSGTTLTYVPSGVYGRVL-SRMESMVTLPRVDGSSM 303
Query: 165 -FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG-TFCFAFAPTSS-SLS 221
D CYD SS + + P ++ + P P+ NY + VD +G T C A S +S
Sbjct: 304 GLDLCYDRSSNRNYKFPALTIRLAGATMTP-PSSNYFLVVDDSGDTVCLAMGSASGLPVS 362
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IIGNV QQG + ++ +S + F KC
Sbjct: 363 IIGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 85/267 (31%), Positives = 128/267 (47%), Gaps = 26/267 (9%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAST 54
GDF +T+T+GS S AIGCGH+N G F +G++GLG G S Q+ ++
Sbjct: 174 GDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAV 233
Query: 55 ---FSYCL--VDRDSDSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCL + D ++ L F S+ + AV+ P+ + + +FY L L +SVG
Sbjct: 234 GGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGR 293
Query: 107 DLLPISETAFKIDES---GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
+ T + S G II+DSGT +T L + Y+ A D
Sbjct: 294 N-----NTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQ 348
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSI 222
+ C++ ++ +VP ++ HF EG L L +N LI V N C AFA + +SI
Sbjct: 349 FLEYCFE-TTTDDYKVPFIAMHF-EGANLRLQRENVLIRVSDN-VICLAFAGAQDNDISI 405
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN+ Q V +++ N + F P C
Sbjct: 406 YGNIAQINFLVGYDVTNMSLSFKPMNC 432
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 98/255 (38%), Positives = 129/255 (50%), Gaps = 13/255 (5%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFS 56
G F E ++L + V ++ GCG NN+GLF GAAGLLGLG LS SQ FS
Sbjct: 246 GFFGKEKLSLTATDVFNDFYFGCGQNNKGLFGGAAGLLGLGRDKLSLVSQTAQRYNKIFS 305
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
YCL S ST L F S +A PL +FY L LTGISVGG L IS + F
Sbjct: 306 YCL-PSSSSSTGFLTFGGSTSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVF 364
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
G I+DSGT +TRL Y+AL F + +++ DTC+DFS+ +
Sbjct: 365 S-----TAGTIIDSGTVITRLPPAAYSALSSTFRKLMSQYPAAPALSILDTCFDFSNHDT 419
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRVS 234
+ VP + F G V+ + K + V+ C AFA S S ++I GNVQQ+ V
Sbjct: 420 ISVPKIGLFFSGGVVVDID-KTGIFYVNDLTQVCLAFAGNSDASDVAIFGNVQQKTLEVV 478
Query: 235 FNLRNSLIGFTPNKC 249
++ +GF P C
Sbjct: 479 YDGAAGRVGFAPAGC 493
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 82/266 (30%), Positives = 128/266 (48%), Gaps = 33/266 (12%)
Query: 13 ASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSDS 66
A + + +GC + G F+ + G+L LG ++SF S+ A FSYCLVD ++
Sbjct: 198 AKLRGVVLGCTTSYTGESFLASDGVLSLGYSNVSFASRAAARFGGRFSYCLVDHLAPRNA 257
Query: 67 TSTLEFD-----------------SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
TS L F S+ P A PLL +H + FY + + G+SV G+LL
Sbjct: 258 TSYLTFGPNPAVSSASASRTACAGSAAAPGARQTPLLLDHRMRPFYAVAVNGVSVDGELL 317
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
I + + + GG I+DSGT++T L + Y A+ A + L P + FD CY
Sbjct: 318 RIPRLVWDVQK--GGGAILDSGTSLTVLVSPAYRAVVAALGKKLVGL-PRVAMDPFDYCY 374
Query: 170 DFSS-----RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSII 223
+++S +V VP ++ HF L P K+Y+I + G C +S+I
Sbjct: 375 NWTSPLTGEDLAVAVPALAVHFAGSARLQPPPKSYVIDA-APGVKCIGLQEGDWPGVSVI 433
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN+ QQ F+L+N + F ++C
Sbjct: 434 GNILQQEHLWEFDLKNRRLRFKRSRC 459
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 92/267 (34%), Positives = 127/267 (47%), Gaps = 29/267 (10%)
Query: 1 GDFVTETVTLGSASVDNIAI-----GCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
G ET+TL S + + +A GCGHNN G GL+GLG G LS SQI +S
Sbjct: 149 GVLAQETLTLTSTTGEPVAFQGIIFGCGHNNSGFNDREMGLIGLGRGPLSLISQIGSSLG 208
Query: 54 ----TFSYCLVDRDSDS--TSTLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISV 104
FS CLV ++D TS + F L V+ PL+ T Y+ L GISV
Sbjct: 209 AGGNMFSQCLVPFNTDPSITSQMNFGKGSEVLGNGTVSTPLISKD--GTGYFATLLGISV 266
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP--TDGV 162
LP S + + G I++DSGT +T L E Y+ L + VR AL P DG
Sbjct: 267 EDINLPFSNGS-SLGTITKGNILIDSGTTITYLPEEFYHRLIEQ-VRNKVALEPFRIDG- 323
Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
++ CY + +++ PT++ HF G VL PA+ ++ D N FCFA T+
Sbjct: 324 --YELCYQ--TPTNLNGPTLTIHFEGGDVLLTPAQMFIPVQDDN--FCFAVFDTNEEYVT 377
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN Q + F+L ++ F C
Sbjct: 378 YGNYAQSNYLIGFDLERQVVSFKATDC 404
>gi|326526699|dbj|BAK00738.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 182
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/166 (37%), Positives = 93/166 (56%), Gaps = 8/166 (4%)
Query: 84 PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN 143
P++ + D+ Y++ L+G++V G L +S + E + I+DSGT +TRL T Y+
Sbjct: 24 PMVSSTLDDSLYFIKLSGMTVAGKPLAVSSS-----EYSSLPTIIDSGTVITRLPTTVYD 78
Query: 144 ALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV 203
AL A + D ++ DTC+ SS+ VP VS F G L L A+N L+ V
Sbjct: 79 ALSKAVAGAMKGTKRADAYSILDTCF-VGQASSLRVPAVSMAFSGGAALKLSAQNLLVDV 137
Query: 204 DSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
DS+ T C AFAP S+ +IIGN QQQ V ++++++ IGF C
Sbjct: 138 DSSTT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKSNRIGFAAGGC 181
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 76/206 (36%), Positives = 109/206 (52%), Gaps = 14/206 (6%)
Query: 5 TETVTLGSASV-DNIAIGCGHNNEG-LFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDR 62
TET T G V +N++ G +G F G AGL+GLG G LS SQ+ A F+YCL
Sbjct: 187 TETFTFGDGYVANNVSFGRSDTIDGSQFGGTAGLVGLGRGHLSLVSQLGAGRFAYCLA-A 245
Query: 63 DSDSTSTLEFDS-----SLPPNAVTAPLLRNH--ELDTFYYLGLTGISVGGDLLPISETA 115
D + ST+ F S + + + PL+ N + DT YY+ L GISVGG LPI +
Sbjct: 246 DPNVYSTILFGSLAALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGT 305
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
F I+ G+GG+ DSG T L+ Y +R A + L G DTC+ +++
Sbjct: 306 FAINSDGSGGVFFDSGAIDTSLKDAAYQVVRQAITSEIQRLGYDAG---DDTCFVAANQQ 362
Query: 176 SV-EVPTVSFHFPEGKVLPLPAKNYL 200
+V ++P + HF +G + L +NYL
Sbjct: 363 AVAQMPPLVLHFDDGADMSLNGRNYL 388
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 78/249 (31%), Positives = 119/249 (47%), Gaps = 14/249 (5%)
Query: 14 SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFD 73
SV +A GCG +N GL + G +GLG GSLS +Q+ FSYCL D + S +
Sbjct: 209 SVGGVAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLGSPVLF 268
Query: 74 SSLPPNAV----------TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
SL A + PL++ + YY+ L GIS+G LPI F + + G+
Sbjct: 269 GSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRDDGS 328
Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS--RSSVEVPT 181
GG+IVDSGT T L + + + V G + +L C+ ++ + ++P
Sbjct: 329 GGMIVDSGTIFTVLVESAFRVVVN-HVAGVLNQPVVNASSLDSPCFPATAGEQQLPDMPD 387
Query: 182 VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL-SIIGNVQQQGTRVSFNLRNS 240
+ HF G + L NY+ + +FC A S+ SI+GN QQQ ++ F++
Sbjct: 388 MLLHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAPSAYGSILGNFQQQNIQMLFDITVG 447
Query: 241 LIGFTPNKC 249
+ F P C
Sbjct: 448 QLSFVPTDC 456
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 84/259 (32%), Positives = 122/259 (47%), Gaps = 19/259 (7%)
Query: 2 DFVTETVTLGSASVDNIAIGCGHNNEGLFVGA-AGLLGLGGGSLSFPSQINASTFSYCLV 60
D V ET G+ +V ++ GCGH+N G F G +G+LGL G S S++ S FSYC+
Sbjct: 153 DIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRL-GSRFSYCIG 211
Query: 61 DR-DSDST-STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
D D T + L + + P H + FYY+ L GISVG L I+ F+
Sbjct: 212 DLFDPHYTHNQLVLGDGVKMEGSSTPF---HTFNGFYYVTLEGISVGETRLDINPEVFQR 268
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT-----CYDFSS 173
ESG GG+++DSGT T L + ++ L + R R ++ T CY
Sbjct: 269 TESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQ---VIYRTIPGWLCYKGRV 325
Query: 174 RSSVE-VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SIIGNVQQQG 230
+ P ++FHF EG L L A N L + FC A ++ S+IG + QQ
Sbjct: 326 NEDLRGFPELAFHFAEGADLVLDA-NSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQH 384
Query: 231 TRVSFNLRNSLIGFTPNKC 249
V+++L + F C
Sbjct: 385 YNVAYDLIGKRVYFQRTDC 403
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 80/281 (28%), Positives = 122/281 (43%), Gaps = 35/281 (12%)
Query: 1 GDFVTETVTL------GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ----I 50
G +TET+ L G+ ++ + A+GC + +G+ G G G+LS PSQ I
Sbjct: 89 GLLLTETLNLPLENGEGARAITHFAVGCSIVSS---QQPSGIAGFGRGALSMPSQLGEHI 145
Query: 51 NASTFSYCL----VDRDSDSTSTLEFDSSLPPNAVT--APLLRNH------ELDTFYYLG 98
F+YCL D ++ + + D +LP N P L N + +YY+G
Sbjct: 146 GKDRFAYCLQSHRFDEENKKSLMVLGDKALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIG 205
Query: 99 LTGISVGGDLLP-ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRA 155
L G+S+GG L + + D GNGG I+DSGT T E + + F G R
Sbjct: 206 LRGVSIGGKRLKQLPSKLLRFDTKGNGGTIIDSGTTFTVFSDEIFKHIAAGFASQIGYRR 265
Query: 156 LSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP 215
+ CYD + ++ +P +FHF G + LP NY S + C
Sbjct: 266 AGEVEDKTGMGLCYDVTGLENIVLPEFAFHFKGGSDMVLPVANYFSYFSSFDSICLTMIS 325
Query: 216 TSSSLS-------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ L I+GN QQQ + ++ + +GFT C
Sbjct: 326 SRGLLEVDSGPAVILGNDQQQDFYLLYDREKNRLGFTQQTC 366
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 90/267 (33%), Positives = 132/267 (49%), Gaps = 24/267 (8%)
Query: 1 GDFVTETVTLGSASVDNIAI-----GCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINAS- 53
G ET+TL S + +A+ GCGHNN G+F G++GLG G LS SQI +S
Sbjct: 148 GVLAQETLTLTSTTGKPVALKGVIFGCGHNNNGVFNDKEMGIIGLGRGPLSLVSQIGSSF 207
Query: 54 ---TFSYCLVDRDSDS--TSTLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
FS CLV ++ TS + F L V+ PL+ + FY++ L GISV
Sbjct: 208 GGKMFSQCLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISVE 267
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS--PTDGVA 163
LP ++ + ++ G +++DSGT T L + Y+ L + VR AL P D
Sbjct: 268 DINLPFNDGS-SLEPITKGNMVIDSGTPTTLLPEDFYHRLVEE-VRNKVALDPIPIDPTL 325
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSI 222
+ CY + ++++ T++ HF VL P + + IPV +G FCFAF T S+ I
Sbjct: 326 GYQLCY--RTPTNLKGTTLTAHFEGADVLLTPTQIF-IPVQ-DGIFCFAFTSTFSNEYGI 381
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN Q + F+L L+ F C
Sbjct: 382 YGNHAQSNYLIGFDLEKQLVSFKATDC 408
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 84/259 (32%), Positives = 122/259 (47%), Gaps = 19/259 (7%)
Query: 2 DFVTETVTLGSASVDNIAIGCGHNNEGLFVGA-AGLLGLGGGSLSFPSQINASTFSYCLV 60
D V ET G+ +V ++ GCGH+N G F G +G+LGL G S S++ S FSYC+
Sbjct: 153 DIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRL-GSRFSYCIG 211
Query: 61 DR-DSDST-STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
D D T + L + + P H + FYY+ L GISVG L I+ F+
Sbjct: 212 DLFDPHYTHNQLVLGDGVKMEGSSTPF---HTFNGFYYVTLEGISVGETRLDINPEVFQR 268
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT-----CYDFSS 173
ESG GG+++DSGT T L + ++ L + R R ++ T CY
Sbjct: 269 TESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQ---VIYRTIPGWLCYKGRV 325
Query: 174 RSSVE-VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SIIGNVQQQG 230
+ P ++FHF EG L L A N L + FC A ++ S+IG + QQ
Sbjct: 326 NEDLRGFPELAFHFAEGADLVLDA-NSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQH 384
Query: 231 TRVSFNLRNSLIGFTPNKC 249
V+++L + F C
Sbjct: 385 YNVAYDLIGKRVYFQRTDC 403
>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 452
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 92/251 (36%), Positives = 124/251 (49%), Gaps = 34/251 (13%)
Query: 14 SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDS---T 67
+V GCGH GLF G GLLGLG S Q + FSYCL + S + T
Sbjct: 221 AVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLT 280
Query: 68 STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGII 127
+ S P T LL + T+Y + LTGISVGG L + +AF
Sbjct: 281 LGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV------ 334
Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGVALFDTCYDFSSRSSVEVPTVS 183
VD+GT VTRL Y ALR AF G + +P++G+ DTCY+F+ +V +P V+
Sbjct: 335 VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGI--LDTCYNFAGYGTVTLPNVA 392
Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTF-CFAFAPTSS--SLSIIGNVQQQGTRVSFNLR-- 238
F G + L A L +F C AFAP+ S ++I+GNVQQ+ SF +R
Sbjct: 393 LTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQR----SFEVRID 441
Query: 239 NSLIGFTPNKC 249
+ +GF P+ C
Sbjct: 442 GTSVGFKPSSC 452
>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
Length = 519
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 86/274 (31%), Positives = 131/274 (47%), Gaps = 40/274 (14%)
Query: 12 SASVDNIAIGCGHNNEG-LFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSD 65
A + + +GC + G F+ + G+L LG ++SF S+ A FSYCLVD +
Sbjct: 248 QAKLRGVVLGCTTSYTGDSFLASDGVLSLGYSNISFASRAAARFGGRFSYCLVDHLAPRN 307
Query: 66 STSTLEFD-----SSLPPNAVTA-------------------PLLRNHELDTFYYLGLTG 101
+TS L F SS PP+ PLL +H + FY + + G
Sbjct: 308 ATSYLTFGPNPAVSSSPPSKTACAGGGSPAAAPPGPGGARQTPLLLDHRMRPFYAVTVNG 367
Query: 102 ISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG 161
ISV G+LL I + D + GG I+DSGT++T L + Y A+ A + L P
Sbjct: 368 ISVDGELLRIPRLVW--DVAKGGGAILDSGTSLTVLVSPAYRAVVAALNKKLAGL-PRVT 424
Query: 162 VALFDTCYDFSSRS-----SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP- 215
+ FD CY+++S S +V +P ++ HF L PAK+Y+I + G C
Sbjct: 425 MDPFDYCYNWTSPSTGEDLTVAMPELAVHFAGSARLQPPAKSYVIDA-APGVKCIGLQEG 483
Query: 216 TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+S+IGN+ QQ F+L+N + F ++C
Sbjct: 484 EWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRC 517
>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 491
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 92/268 (34%), Positives = 123/268 (45%), Gaps = 36/268 (13%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS---TF 55
G ++T+T+T+ S + N GC H G F A+G + LGGG S SQ + F
Sbjct: 241 GTYMTDTLTISPSTTFLNFRFGCSHAVRGKFSAQASGTMSLGGGPQSLLSQTARAYGNAF 300
Query: 56 SYCLVDRDSDSTSTLEFDS-SLPPNA---------VTAPLLRNHEL--DTFYYLGLTGIS 103
SYC+ S F S P N T PL+R+ + T Y + L GI
Sbjct: 301 SYCV-----PGPSAAGFLSIGGPVNGDDGGGSGAFATTPLVRSANVINPTIYVVRLQGIE 355
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
V G L + F +GG ++DS +T+L Y ALR AF RA
Sbjct: 356 VAGRRLNVPPVVF------SGGTVMDSSAVITQLPPTAYRALRLAFRNAMRAYKTRAPTG 409
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLS 221
DTC+DF S V VPTVS F G V+ L + L+ DS C AFAP ++ +L
Sbjct: 410 NLDTCFDFVGVSKVTVPTVSLVFDGGAVIELGLLSVLL--DS----CLAFAPMAADFALG 463
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IGNVQQQ V +++ +GF C
Sbjct: 464 FIGNVQQQTHEVLYDVAGGAVGFRHGAC 491
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 84/259 (32%), Positives = 122/259 (47%), Gaps = 19/259 (7%)
Query: 2 DFVTETVTLGSASVDNIAIGCGHNNEGLFVGA-AGLLGLGGGSLSFPSQINASTFSYCLV 60
D V ET G+ +V ++ GCGH+N G F G +G+LGL G S S++ S FSYC+
Sbjct: 185 DIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRL-GSRFSYCIG 243
Query: 61 DR-DSDST-STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
D D T + L + + P H + FYY+ L GISVG L I+ F+
Sbjct: 244 DLFDPHYTHNQLVLGDGVKMEGSSTPF---HTFNGFYYVTLEGISVGETRLDINPEVFQR 300
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT-----CYDFSS 173
ESG GG+++DSGT T L + ++ L + R R ++ T CY
Sbjct: 301 TESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQ---VIYRTIPGWLCYKGRV 357
Query: 174 RSSVE-VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SIIGNVQQQG 230
+ P ++FHF EG L L A N L + FC A ++ S+IG + QQ
Sbjct: 358 NEDLRGFPELAFHFAEGADLVLDA-NSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQH 416
Query: 231 TRVSFNLRNSLIGFTPNKC 249
V+++L + F C
Sbjct: 417 YNVAYDLIGKRVYFQRTDC 435
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 91/268 (33%), Positives = 129/268 (48%), Gaps = 28/268 (10%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQIN--- 51
G ETVTL S S+ I GCGHNN G F GL+GLGGG S SQI
Sbjct: 152 GVLAQETVTLTSNTGKPISLQGILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVSQIGPLF 211
Query: 52 -ASTFSYCLVDRDSDST--STLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
FS CLV +D T S + F L VT PL++ + T YY+ L GISV
Sbjct: 212 GGKKFSQCLVPFLTDITISSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVE 271
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGVAL 164
LP++ T K G ++VDSGT L + Y+ + V+ L P TD +L
Sbjct: 272 DTYLPMNSTIEK------GNMLVDSGTPPNILPQQLYDRVY-VEVKNKVPLEPITDDPSL 324
Query: 165 -FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV-DSNGTFCFAFAPTSSS-LS 221
CY ++++++ PT+++HF +L P + ++ P ++ G FC A ++S
Sbjct: 325 GPQLCY--RTQTNLKGPTLTYHFEGANLLLTPIQTFIPPTPETKGVFCLAITNCANSDPG 382
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I GN Q + F+L ++ F P C
Sbjct: 383 IYGNFAQTNYLIGFDLDRQIVSFKPTDC 410
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 82/252 (32%), Positives = 121/252 (48%), Gaps = 17/252 (6%)
Query: 14 SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEF- 72
SV IA GCG +N GL + G +GLG GSLS +Q+ FSYCL D + S S+ F
Sbjct: 177 SVGGIAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLSSPVFF 236
Query: 73 ----------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI-DES 121
S+ + PL+++ + YY+ L GIS+G LPI F + D+
Sbjct: 237 GSLAELAASSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDD 296
Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPT 181
G+GG+IVDSGT T L + + D V G + +L C+ + E+P
Sbjct: 297 GSGGMIVDSGTIFTILVETGFRVVVD-HVAGVLGQPVVNASSLDRPCFPAPAAGVQELPD 355
Query: 182 VS---FHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSIIGNVQQQGTRVSFNL 237
+ HF G + L NY+ + +FC T S+S S++GN QQQ ++ F++
Sbjct: 356 MPDMVLHFAGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGSVLGNFQQQNIQMLFDI 415
Query: 238 RNSLIGFTPNKC 249
+ F P C
Sbjct: 416 TVGQLSFMPTDC 427
>gi|21668075|gb|AAM74221.1|AF518565_1 putative chloroplast nucleoid DNA-binding protein [Brassica
oleracea]
Length = 165
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 62/159 (38%), Positives = 86/159 (54%), Gaps = 8/159 (5%)
Query: 93 TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG 152
+FY L + GISVGG L I +T F G ++DSGT ++RL + Y ALR AF
Sbjct: 12 SFYGLDIVGISVGGQKLAIPQTVFSTP-----GALIDSGTVISRLPPKAYAALRGAFKAK 66
Query: 153 TRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFA 212
T V++ DTC+D + +V +PTVSF+F G V+ L +K L + C A
Sbjct: 67 MSQYKNTSAVSILDTCFDLTGFKTVTIPTVSFYFNGGAVVELGSKGVLYAFKMS-QVCLA 125
Query: 213 FAPTS--SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
FA S ++ +I GNVQQQ V ++ +GF PN C
Sbjct: 126 FAGNSDDNNAAIFGNVQQQTLEVVYDGAAGRVGFAPNGC 164
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 125/264 (47%), Gaps = 24/264 (9%)
Query: 1 GDFVTETVTLGSASVD--NIAIGCGHNNEGLFV---GAAGLLGLGGGSLSFPSQIN---A 52
G TE++ GS +V GCG NN+ + G++GLG G LS SQ+
Sbjct: 179 GVLCTESIHFGSQTVTFPKTIFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIG 238
Query: 53 STFSYCLVDRDSDSTSTLEF--DSSLPPN-AVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
FSYCL+ S ST L+F D+++ N V+ PL+ + ++Y+L L GI++G +L
Sbjct: 239 HKFSYCLLPFTSTSTIKLKFGNDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKML 298
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGVALFDT 167
+ T + NG II+D GT +T L+ Y+ +R +S T D FD
Sbjct: 299 QVRTT-----DHTNGNIIIDLGTVLTYLEVNFYHNFV-TLLREALGISETKDDIPYPFDF 352
Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP--TSSSLSIIGN 225
C F +++++ P + F F KV L KN D C A P + S+ GN
Sbjct: 353 C--FPNQANITFPKIVFQFTGAKVF-LSPKNLFFRFDDLNMICLAVLPDFYAKGFSVFGN 409
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
+ Q +V ++ + + F P C
Sbjct: 410 LAQVDFQVEYDRKGKKVSFAPADC 433
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 85/269 (31%), Positives = 135/269 (50%), Gaps = 32/269 (11%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNN----EGLFVGAAGLLGLGGGSLSFPSQIN 51
GD +T+TL S S I IGCGH N EGL A+G++G G G+ S SQ+
Sbjct: 180 GDISKDTLTLNSNDGSPISFPKIVIGCGHKNSLTTEGL---ASGIIGFGRGNFSIVSQLG 236
Query: 52 AS---TFSYCLVDRDSDS--TSTLEF-DSSLPPN--AVTAPLLRNHELDTFYYLGLTGIS 103
+S FSYCL S + +S L F D ++ V+ PL+++ + Y+ L S
Sbjct: 237 SSIGGKFSYCLASLFSKANISSKLYFGDMAVVSGHGVVSTPLIQSFYVGN-YFTNLEAFS 295
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDA---FVRGTRALSPTD 160
VG ++ + +++ D GN ++DSG+ +T+L + Y+ L A V+ R PT
Sbjct: 296 VGDHIIKLKDSSLIPDNEGNA--VIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQ 353
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
++L CY ++ EVP ++ HF G + L A N I ++ + CFAF ++
Sbjct: 354 QLSL---CYK-TTLKKYEVPIITAHF-RGADVKLNAFNTFIQMN-HEVMCFAFNSSAFPW 407
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ GN+ QQ V ++ ++I F P C
Sbjct: 408 VVYGNIAQQNFLVGYDTLKNIISFKPTNC 436
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 92/269 (34%), Positives = 132/269 (49%), Gaps = 34/269 (12%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS---TF 55
G++ +T+TL + V GCG NN+G F G G+LGLG G LS SQ + F
Sbjct: 204 GNYGCDTMTLEPSDVFQKFQFGCGRNNKGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVF 263
Query: 56 SYCLVDRDS-----------DSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISV 104
SYCL + DS +S+L+F S V P + +Y++ L+ ISV
Sbjct: 264 SYCLPEEDSIGSLLFGEKATSQSSSLKFTS-----LVNGP--GTLQESGYYFVNLSDISV 316
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA- 163
G + L I + F + G I+DS T +TRL Y+AL+ AF + ++G
Sbjct: 317 GNERLNIPSSVF-----ASPGTIIDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRK 371
Query: 164 ---LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
+ DTCY+ S R V +P + HF G + L N + D++ C AFA T S L
Sbjct: 372 KGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTNIVWGSDAS-RLCLAFAGT-SEL 429
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+IIGN QQ V ++++ IGF N C
Sbjct: 430 TIIGNRQQLSLTVLYDIQGRRIGFGGNGC 458
>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
Length = 452
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 92/251 (36%), Positives = 124/251 (49%), Gaps = 34/251 (13%)
Query: 14 SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDS---T 67
+V GCGH GLF G GLLGLG S Q + FSYCL + S + T
Sbjct: 221 AVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLT 280
Query: 68 STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGII 127
+ S P T LL + T+Y + LTGISVGG L + +AF
Sbjct: 281 LGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV------ 334
Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGVALFDTCYDFSSRSSVEVPTVS 183
VD+GT VTRL Y ALR AF G + +P++G+ DTCY+F+ +V +P V+
Sbjct: 335 VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGI--LDTCYNFAGYGTVTLPNVA 392
Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTF-CFAFAPTSS--SLSIIGNVQQQGTRVSFNLR-- 238
F G + L A L +F C AFAP+ S ++I+GNVQQ+ SF +R
Sbjct: 393 LTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQR----SFEVRID 441
Query: 239 NSLIGFTPNKC 249
+ +GF P+ C
Sbjct: 442 GTSVGFKPSSC 452
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 103 bits (256), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 89/278 (32%), Positives = 123/278 (44%), Gaps = 51/278 (18%)
Query: 1 GDFVTETVTLGSA----SVDNIAIGCGHN--NEGLFVGA-AGLLGLGGGSLSFPSQINAS 53
G ++++ +TL A ++ GC H G F +G++ LG G+ S P+Q A+
Sbjct: 236 GTYISDVLTLNPAKPASAISEFRFGCSHALLQPGSFSNKTSGIMALGRGAQSLPTQTKAT 295
Query: 54 ---TFSYCLVDRDSDSTSTLEFDSSLPPNAVT----APLLRNHELDTFYYLGLTGISVGG 106
FSYCL S F +P A + P+LR+ Y + L I V G
Sbjct: 296 YGDVFSYCLPPTPVHSGF---FILGVPRVAASRYAVTPMLRSKAAPMLYLVRLIAIEVAG 352
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV---RGTRALSPTDGVA 163
LP+ F G ++DS T VTRL Y ALR AFV R RA +P +
Sbjct: 353 KRLPVPPAVFA------AGAVMDSRTIVTRLPPTAYMALRAAFVAEMRAYRAAAPKEH-- 404
Query: 164 LFDTCYDFS-----SRSSVEVPTVSFHF--PEGKVLPLPAKNYLIPVDSNGTF---CFAF 213
DTCYDFS V++P ++ F P G V +D +G C AF
Sbjct: 405 -LDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGAVE----------LDPSGVLLDGCLAF 453
Query: 214 APTS--SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
AP + IIGNVQQQ V +N+ + +GF C
Sbjct: 454 APNTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGAC 491
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 85/267 (31%), Positives = 127/267 (47%), Gaps = 23/267 (8%)
Query: 1 GDFVTETVTLG-----SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFP---SQINA 52
G F ETVT+G + ++ IGC + G++GLG S ++I
Sbjct: 218 GVFANETVTVGLNDHKKIRLFDVLIGCTESFNETNGFPDGVMGLGYRKHSLALRLAEIFG 277
Query: 53 STFSYCLVDRDSDSTST--LEF----DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
+ FSYCLVD S S L F + LP T LL ++ FY + ++GISVGG
Sbjct: 278 NKFSYCLVDHLSSSNHKNFLSFGDIPEMKLPKMQHTELLLG--YINAFYPVNVSGISVGG 335
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVA 163
+L IS + + +G GG+IVDSGT++T L E Y+ + DA + + P +
Sbjct: 336 SMLSISSDIWNV--TGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPE 393
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSI 222
L + C++ VP + HF +G + P K+Y+I V + G C SI
Sbjct: 394 LNNFCFEDKGFDRAAVPRLLIHFADGAIFKPPVKSYIIDV-AEGIKCLGIIKADFPGSSI 452
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+GNV QQ ++L +GF P+ C
Sbjct: 453 LGNVMQQNHLWEYDLGRGKLGFGPSSC 479
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 75/216 (34%), Positives = 108/216 (50%), Gaps = 18/216 (8%)
Query: 44 LSFPSQINASTFSYCLVDRDS-DSTSTLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTG 101
LS + +TFSYCL S + + TL + P + T PLL N + YY+ +TG
Sbjct: 239 LSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRRIKTTPLLANPHRSSLYYVNMTG 298
Query: 102 ISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR----GTRALS 157
I VG ++ I +A D + G ++DSGT TRL Y ALRD R G A+S
Sbjct: 299 IRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLALRDEVRRRVGAGAAAVS 358
Query: 158 PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT- 216
G FDTCY+ ++V P V+ F +G + LP +N +I T C A A
Sbjct: 359 SLGG---FDTCYN----TTVAWPPVTLLF-DGMQVTLPEENVVIHTTYGTTSCLAMAAAP 410
Query: 217 ---SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++ L++I ++QQQ RV F++ N +GF C
Sbjct: 411 DGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 446
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 96/267 (35%), Positives = 126/267 (47%), Gaps = 36/267 (13%)
Query: 1 GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFS 56
G + TET+ LGS A V + GCG + G + GLLGLGG S SQ + FS
Sbjct: 224 GVYSTETLALGSSAVVKSFRFGCGSDQHGPYDKFDGLLGLGGAPESLVSQTASVYGGAFS 283
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNA--------VTAPLLR-NHELDTFYYLGLTGISVGGD 107
YCL +S + F + PN+ V P+ + ++ TFY + LTGISVGG
Sbjct: 284 YCLPPLNSGA----GFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGK 339
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA----LSPTDGVA 163
L I F GN IVDSGT +T + T Y ALR AF R A L P D +
Sbjct: 340 ALDIPPAVF---AKGN---IVDSGTVITGIPTTAYKALRTAF-RSAMAEYPLLPPAD--S 390
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSI 222
DTCY+F+ +V VP V+ F G + L + ++ D C AFA S I
Sbjct: 391 ALDTCYNFTGHGTVTVPKVALTFVGGATVDLDVPSGVLVED-----CLAFADAGDGSFGI 445
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IGNV + V ++ +GF C
Sbjct: 446 IGNVNTRTIEVLYDSGKGHLGFRAGAC 472
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 88/268 (32%), Positives = 131/268 (48%), Gaps = 36/268 (13%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA--- 52
G+F +T++LG+ S + A+GCG N G F G GL+GLG G +S SQ++A
Sbjct: 140 GEFARDTISLGTTSGGSQKFPSFAVGCGMVNSG-FDGVDGLVGLGQGPVSLTSQLSAAID 198
Query: 53 STFSYCLVDRDSDSTST-LEFDSS-------LPPNAVTAPLLRNHELDTFYYLGLTGISV 104
S FSYCLVD +S S S+ L F S + +T P + T+Y L + GI+V
Sbjct: 199 SKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPP---SDTYPTYYLLTVNGIAV 255
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL 164
G + G I+DSGT +T + + Y + + + L DG ++
Sbjct: 256 AGQTM-----------GSPGTTIIDSGTTLTYVPSGVYGRVL-SRMESMVTLPRVDGSSM 303
Query: 165 -FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG-TFCFAFAPTSS-SLS 221
D CYD SS + + P ++ + P P+ NY + VD +G T C A +S
Sbjct: 304 GLDLCYDRSSNRNYKFPALTIRLAGATMTP-PSSNYFLVVDDSGDTVCLAMGSAGGLPVS 362
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IIGNV QQG + ++ +S + F KC
Sbjct: 363 IIGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 89/292 (30%), Positives = 132/292 (45%), Gaps = 48/292 (16%)
Query: 5 TETVTLGSASVDNIAIGCGHNNE---GLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVD 61
T+ T S+S +A GC G GA+G++GLG G+LS SQ+NA+ FSYCL
Sbjct: 188 TDAFTFPSSSSVTLAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATEFSYCLTP 247
Query: 62 --RDSDSTSTL--------------EFDSSLPPNAVTAPLLRNHE---LDTFYYLGLTGI 102
RD+ S S L T P +N + TFYYL L G+
Sbjct: 248 YFRDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGL 307
Query: 103 SVGGDLLPISETAFKIDESG----NGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRA 155
+ G + + AF + E+ GG ++DSG+ TRL + AL +RG+ +
Sbjct: 308 AAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGS 367
Query: 156 LSPTDGV--ALFDTCY----DFSSRSSVEVPTVSFHFPE----GKVLPLPAKNYLIPVDS 205
L P + C D S ++ VP + F + G+ L +PA+ Y V++
Sbjct: 368 LVPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEA 427
Query: 206 NGTFCFAFAPTSS--------SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ T+C A ++S +IIGN QQ RV ++L N L+ F P C
Sbjct: 428 S-TWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 478
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/261 (31%), Positives = 131/261 (50%), Gaps = 25/261 (9%)
Query: 10 LGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDS- 64
+ A + + +GC + G F + G+L LG ++SF S + FSYCLVD S
Sbjct: 220 VKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASHAASRFGGRFSYCLVDHLSP 279
Query: 65 -DSTSTLEF--DSSLP--------PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
++TS L F +S+L P A PL+ + + FY + + ISV G+LL I
Sbjct: 280 RNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAISVDGELLKIPR 339
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
+++D G GG+IVDSGT++T L Y A+ A + A P + F+ CY+++S
Sbjct: 340 DVWEVD--GGGGVIVDSGTSLTVLAKPAYRAVVAALGKKL-ARFPRVAMDPFEYCYNWTS 396
Query: 174 RSSV----EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIGNVQQ 228
S ++P ++ HF L P+K+Y+I + G C +S+IGN+ Q
Sbjct: 397 PSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDA-APGVKCIGVQEGPWPGISVIGNILQ 455
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q F+L+N + F ++C
Sbjct: 456 QEHLWEFDLKNRRLRFKRSRC 476
>gi|357125298|ref|XP_003564331.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 524
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 93/276 (33%), Positives = 128/276 (46%), Gaps = 38/276 (13%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAS---TF 55
G ++T+ +T+ S N GC H G F G +G + LGGG S SQ + F
Sbjct: 260 GTYMTDILTISPGTSFLNFRFGCSHGVRGSFSGETSGTMSLGGGRQSLLSQTARAYGNAF 319
Query: 56 SYCLVDRDSDSTSTL-------EFDSSLPPNAVTAPLLRNHEL--DTFYYLGLTGISVGG 106
SYC+ + +L + DS P + VT PL+RN + T+Y + L GI V G
Sbjct: 320 SYCVPKPSASGFLSLGGAINDGDSDSDSPSSFVTTPLMRNARIVNPTYYVVRLQGIDVAG 379
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTR--------A 155
L + F +GG ++DS VT+L Y ALR AF +RG R +
Sbjct: 380 RRLNVPPVVF------SGGTLMDSSAVVTQLPPTAYRALRLAFRNAMRGYRMNTRNGSTS 433
Query: 156 LSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP 215
+P G + DTCYDF +V VPTVS F G V+ L ++ C AF P
Sbjct: 434 STPAGGEMILDTCYDFEGLDNVTVPTVSLVFFGGAVVDLDPTTAVMMEG-----CLAFVP 488
Query: 216 TSS--SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
T + L IGNVQQQ V +++ +GF C
Sbjct: 489 TPADFDLGFIGNVQQQTHEVLYDVGARNVGFRRGAC 524
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 96/266 (36%), Positives = 132/266 (49%), Gaps = 23/266 (8%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQIN--- 51
GD +T+T+GS SV + GCGHNN G F + +GL+GLGGG LS SQ+
Sbjct: 184 GDLAVDTLTIGSTTGRPVSVPKVVFGCGHNNGGTFELHGSGLVGLGGGPLSMISQLRPLI 243
Query: 52 ASTFSYCLVD--RDSDSTSTLEFDSS---LPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCLV D +S + F S AV+ PL + + DTFYYL L +SVG
Sbjct: 244 GGRFSYCLVPLGNDPSVSSKMHFGSRGIVSGAGAVSTPL-ASRQPDTFYYLTLESMSVGS 302
Query: 107 DLLP---ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
L S+ + ++ G II+DSGT +T L + Y L V D
Sbjct: 303 KKLAYKGFSKVGSPLADADEGNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVRDPNN 362
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
+F CY S+ S + +PT++ HF G L L N + V + FCFA P S L+I
Sbjct: 363 VFSLCY--SNLSGLRIPTITAHF-VGADLELKPLNTFVQVQED-LFCFAMIPV-SDLAIF 417
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN+ Q V ++L++ + F P C
Sbjct: 418 GNLAQMNFLVGYDLKSRTVSFKPTDC 443
>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
Length = 464
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 89/292 (30%), Positives = 132/292 (45%), Gaps = 48/292 (16%)
Query: 5 TETVTLGSASVDNIAIGCGHNNE---GLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVD 61
T+ T S+S +A GC G GA+G++GLG G+LS SQ+NA+ FSYCL
Sbjct: 171 TDAFTFPSSSSVTLAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATEFSYCLTP 230
Query: 62 --RDSDSTSTL--------------EFDSSLPPNAVTAPLLRNHE---LDTFYYLGLTGI 102
RD+ S S L T P +N + TFYYL L G+
Sbjct: 231 YFRDTVSPSHLFVGDGELAGLRAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGL 290
Query: 103 SVGGDLLPISETAFKIDESG----NGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRA 155
+ G + + AF + E+ GG ++DSG+ TRL + AL +RG+ +
Sbjct: 291 AAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGS 350
Query: 156 LSPTDGV--ALFDTCY----DFSSRSSVEVPTVSFHFPE----GKVLPLPAKNYLIPVDS 205
L P + C D S ++ VP + F + G+ L +PA+ Y V++
Sbjct: 351 LVPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEA 410
Query: 206 NGTFCFAFAPTSS--------SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ T+C A ++S +IIGN QQ RV ++L N L+ F P C
Sbjct: 411 S-TWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 461
>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
Length = 396
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 75/216 (34%), Positives = 108/216 (50%), Gaps = 18/216 (8%)
Query: 44 LSFPSQINASTFSYCLVDRDS-DSTSTLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTG 101
LS + +TFSYCL S + + TL + P + T PLL N + YY+ +TG
Sbjct: 186 LSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRRIKTTPLLANPHRSSLYYVNMTG 245
Query: 102 ISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR----GTRALS 157
I VG ++ I +A D + G ++DSGT TRL Y ALRD R G A+S
Sbjct: 246 IRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLALRDEVRRRVGAGAAAVS 305
Query: 158 PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT- 216
G FDTCY+ ++V P V+ F +G + LP +N +I T C A A
Sbjct: 306 SLGG---FDTCYN----TTVAWPPVTLLF-DGMQVTLPEENVVIHTTYGTTSCLAMAAAP 357
Query: 217 ---SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++ L++I ++QQQ RV F++ N +GF C
Sbjct: 358 DGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 393
>gi|110740049|dbj|BAF01928.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
Length = 183
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 62/161 (38%), Positives = 85/161 (52%), Gaps = 12/161 (7%)
Query: 93 TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG 152
+FY L + I+VGG LPI T F G ++DSGT +TRL + Y ALR +F
Sbjct: 30 SFYGLNIVAITVGGQKLPIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAK 84
Query: 153 TRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFC 210
T GV++ DTC+D S +V +P V+F F G V+ L +K Y+ + C
Sbjct: 85 MSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFKISQ---VC 141
Query: 211 FAFAPTS--SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
AFA S S+ +I GNVQQQ V ++ +GF PN C
Sbjct: 142 LAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 182
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 82/269 (30%), Positives = 128/269 (47%), Gaps = 26/269 (9%)
Query: 1 GDFVTETVTL-----GSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFP---SQIN 51
G F ETVT+ + N+ IGC + +G F A G++GLG SF ++
Sbjct: 186 GFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKF 245
Query: 52 ASTFSYCLVDRDSDS--TSTLEFDSSLPPNA----VTAPLLRNHELDTFYYLGLTGISVG 105
FSYCLVD S ++ L F SS A +T L +++FY + + GIS+G
Sbjct: 246 GGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIG 305
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN----ALRDAFVRGTRALSPTDG 161
G +L I + D G GG I+DSG+++T L Y ALR + ++ +
Sbjct: 306 GAMLKIPSEVW--DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKV---EMD 360
Query: 162 VALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SL 220
+ + C++ + VP + FHF +G P K+Y+I ++G C F +
Sbjct: 361 IGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISA-ADGVRCLGFVSVAWPGT 419
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S++GN+ QQ F+L +GF P+ C
Sbjct: 420 SVVGNIMQQNHLWEFDLGLKKLGFAPSSC 448
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 82/269 (30%), Positives = 128/269 (47%), Gaps = 26/269 (9%)
Query: 1 GDFVTETVTL-----GSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFP---SQIN 51
G F ETVT+ + N+ IGC + +G F A G++GLG SF ++
Sbjct: 115 GFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKF 174
Query: 52 ASTFSYCLVDRDSDS--TSTLEFDSSLPPNA----VTAPLLRNHELDTFYYLGLTGISVG 105
FSYCLVD S ++ L F SS A +T L +++FY + + GIS+G
Sbjct: 175 GGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIG 234
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN----ALRDAFVRGTRALSPTDG 161
G +L I + D G GG I+DSG+++T L Y ALR + ++ +
Sbjct: 235 GAMLKIPSEVW--DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKV---EMD 289
Query: 162 VALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SL 220
+ + C++ + VP + FHF +G P K+Y+I ++G C F +
Sbjct: 290 IGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISA-ADGVRCLGFVSVAWPGT 348
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S++GN+ QQ F+L +GF P+ C
Sbjct: 349 SVVGNIMQQNHLWEFDLGLKKLGFAPSSC 377
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 82/269 (30%), Positives = 128/269 (47%), Gaps = 26/269 (9%)
Query: 1 GDFVTETVTL-----GSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFP---SQIN 51
G F ETVT+ + N+ IGC + +G F A G++GLG SF ++
Sbjct: 186 GFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKF 245
Query: 52 ASTFSYCLVDRDSDS--TSTLEFDSSLPPNA----VTAPLLRNHELDTFYYLGLTGISVG 105
FSYCLVD S ++ L F SS A +T L +++FY + + GIS+G
Sbjct: 246 GGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIG 305
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN----ALRDAFVRGTRALSPTDG 161
G +L I + D G GG I+DSG+++T L Y ALR + ++ +
Sbjct: 306 GAMLKIPSEVW--DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKV---EMD 360
Query: 162 VALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SL 220
+ + C++ + VP + FHF +G P K+Y+I ++G C F +
Sbjct: 361 IGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISA-ADGVRCLGFVSVAWPGT 419
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S++GN+ QQ F+L +GF P+ C
Sbjct: 420 SVVGNIMQQNHLWEFDLGLKKLGFAPSSC 448
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 90/254 (35%), Positives = 119/254 (46%), Gaps = 24/254 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
+ V +T+TL + V + GC G GLLGLG G LS SQ + STFSY
Sbjct: 176 ANLVQDTITLATDPVPSYTFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSY 235
Query: 58 CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
CL S +L F SL P PLL+N + YY+ L I VG ++
Sbjct: 236 CL-----PSFKSLNFSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVD 290
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
I A + + G I DSGT TRL Y A+RD F R + FDTCY+
Sbjct: 291 IPPAALAFNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYN 350
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA----PTSSSLSIIGNV 226
+ VPT++F F G + LP N LI + T C A A +S L++I N+
Sbjct: 351 V----PIVVPTITFIF-TGMNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANM 405
Query: 227 QQQGTRVSFNLRNS 240
QQQ RV +++ NS
Sbjct: 406 QQQNHRVLYDVPNS 419
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 94/266 (35%), Positives = 129/266 (48%), Gaps = 39/266 (14%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAA--GLLGLGGGSLSFPSQINAS---T 54
G + ++T+ L S +V + GC H+ E F G GL+GLGG + S SQ A+ +
Sbjct: 213 GTYSSDTLALSASDTVTDFHFGCSHHEED-FDGEKIDGLMGLGGDAQSLVSQTAATYGKS 271
Query: 55 FSYCLVDRDSDSTSTLEFDSSLPPNA-----VTAPLLRNHELDTFYYLGLTGISVGGDLL 109
FSYCL + S L F + PN VT P+LR + T Y + L ISVGG L
Sbjct: 272 FSYCLPPTNRTS-GFLTFGA---PNGTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPL 327
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF------VRGTRALSPTDGVA 163
I + + G ++DSGT +T L Y+AL AF +R RA +P +
Sbjct: 328 GIQPSVL------SNGSVMDSGTVITWLPRRAYSALSSAFRSSMTRLRHQRA-AP---LG 377
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
+ DTCYDF+ +V +P VS G V+ L +I C AFA TS SII
Sbjct: 378 ILDTCYDFTGLVNVSIPAVSLVLDGGAVVDLDGNGIMI------QDCLAFAATSGD-SII 430
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GNVQQ+ V ++ + GF C
Sbjct: 431 GNVQQRTFEVLHDVGQGVFGFRSGAC 456
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/268 (32%), Positives = 128/268 (47%), Gaps = 29/268 (10%)
Query: 2 DFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-----VGAAGLLGLGGGSLSFPSQIN 51
D +ET T+GS AS +A GCGH+N G F G + S++
Sbjct: 184 DLSSETFTIGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVG 243
Query: 52 ASTFSYCLVDRDSDST--STLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCLV SDST S + F S + V+ PL++ DTFYYL L G+S+G
Sbjct: 244 GQ-FSYCLVPLSSDSTASSKINFGKSAVVSGSGTVSTPLIKGTP-DTFYYLTLEGMSLGS 301
Query: 107 DLLPISETAFKIDESG-----NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG 161
+ ++ F ++S II+DSGT +T L + Y + A + + TD
Sbjct: 302 E--KVAFKGFSKNKSSPAAAEESNIIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDP 359
Query: 162 VALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
F CY S +E+PT++ HF G + LP N + + CF+ P SS+L+
Sbjct: 360 RGTFSLCY--SGVKKLEIPTITAHFI-GADVQLPPLNTFVQAQED-LVCFSMIP-SSNLA 414
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I GN+ Q V ++L+N+ + F P C
Sbjct: 415 IFGNLSQMNFLVGYDLKNNKVSFKPTDC 442
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 83/275 (30%), Positives = 116/275 (42%), Gaps = 30/275 (10%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G ++ET+ L + V + +GC + AG+ G G G S PSQ+ FS+CLV
Sbjct: 240 GILLSETLDLENKRVPDFLVGCSVMSVH---QPAGIAGFGRGPESLPSQMRLKRFSHCLV 296
Query: 61 DR---DSDSTSTL------EFDSSLPPNAVTAPLLRNHELDT-----FYYLGLTGISVGG 106
R DS +S L E D S + + AP N + +YYL L I +GG
Sbjct: 297 SRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGG 356
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRD----AFVRGTRALSPTDGV 162
+ D +GNGG I+DSG+ T L + A+ D V+ RA +
Sbjct: 357 KPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKD-VEAQ 415
Query: 163 ALFDTCYDF-SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
+ C++ S E P V F G L L A+NYL V G C + +
Sbjct: 416 SGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAVVG 475
Query: 222 -------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I+G QQQ V ++L IGF KC
Sbjct: 476 GGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 510
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 93/263 (35%), Positives = 121/263 (46%), Gaps = 24/263 (9%)
Query: 1 GDFVTETVTLGSAS---VDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINA---S 53
G + ++T+TL S + GC G GL+GLGG + SF SQ A S
Sbjct: 212 GTYGSDTLTLAGTSEPLISGFQFGCSAVEHGFEEDNTDGLMGLGGDAQSFVSQTAATYGS 271
Query: 54 TFSYCLV-DRDSDSTSTL-EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
FSYCL +S TL SS T P+LR+ + TFY L L GISVGG L I
Sbjct: 272 AFSYCLPPTWNSSGFLTLGAPSSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTLEI 331
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR--ALSPTDGVALFDTCY 169
+ F + G IVDSGT +TRL Y AL AF G P L DTC+
Sbjct: 332 PSSVF------SAGSIVDSGTVITRLPPTAYGALSAAFRDGMARYQYQPAAPRGLLDTCF 385
Query: 170 DFSSR---SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
DF+ ++ VP+V+ G V+ L + +G FA IIGNV
Sbjct: 386 DFTGHGEGNNFTVPSVALVLDGGAVVDLHPNG----IVQDGCLAFAATDDDGRTGIIGNV 441
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQ+ V +++ S+ GF P C
Sbjct: 442 QQRTFEVLYDVGQSVFGFRPGAC 464
>gi|56202144|dbj|BAD73477.1| chloroplast nucleoid DNA binding protein-like [Oryza sativa
Japonica Group]
gi|125571574|gb|EAZ13089.1| hypothetical protein OsJ_03009 [Oryza sativa Japonica Group]
Length = 316
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 83/275 (30%), Positives = 130/275 (47%), Gaps = 41/275 (14%)
Query: 12 SASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSD 65
A + + +GC + G F+ + G+L LG ++SF S+ + FSYCLVD +
Sbjct: 44 KAKLRGVVLGCTTSYNGQSFLASDGVLSLGYSNISFASRAASRFGGRFSYCLVDHLAPRN 103
Query: 66 STSTLEFD-----SSLPPNAVTA---------------------PLLRNHELDTFYYLGL 99
+TS L F SS P+ TA PL+ +H FY + +
Sbjct: 104 ATSYLTFGPNPAFSSRRPSEGTASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTV 163
Query: 100 TGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT 159
G+SV G+LL I + +++ GG I+DSGT++T L Y A+ A + L P
Sbjct: 164 KGVSVAGELLKIPRAVWDVEQ--GGGAILDSGTSLTMLAKPAYRAVVAALSKRLAGL-PR 220
Query: 160 DGVALFDTCYDFSSRSSVEV----PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP 215
+ FD CY+++S S +V P ++ HF L PAK+Y+I + G C
Sbjct: 221 VTMDPFDYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVIDA-APGVKCIGLQE 279
Query: 216 -TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
LS+IGN+ QQ ++L+N + F ++C
Sbjct: 280 GPWPGLSVIGNILQQEHLWEYDLKNRRLRFKRSRC 314
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 91/267 (34%), Positives = 122/267 (45%), Gaps = 28/267 (10%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
+ V +TVTL + + + GC G GLLGLG G LS SQ + STFSY
Sbjct: 181 ANVVQDTVTLATDPIPDYTFGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSY 240
Query: 58 CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
CL S +L F SL P PLL+N + YY+ L I VG ++
Sbjct: 241 CL-----PSFKSLNFSGSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVD 295
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR----GTRALSPTDGVALFD 166
I A + + G + DSGT TRL Y A+RD F R +A + FD
Sbjct: 296 IPPEALAFNAATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFD 355
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSI 222
TCY + PT++F F G + LP N LI + T C A A +S L++
Sbjct: 356 TCYTV----PIVAPTITFMF-SGMNVTLPEDNILIHSTAGSTTCLAMASAPDNVNSVLNV 410
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I N+QQQ RV +++ NS +G C
Sbjct: 411 IANMQQQNHRVLYDVPNSRLGVARELC 437
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 85/266 (31%), Positives = 121/266 (45%), Gaps = 26/266 (9%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS- 53
G TET+TL S S+ NI GCGHNN G F GL G GG LS SQI ++
Sbjct: 180 GVIATETLTLNSNSGQPXSIXNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTL 239
Query: 54 ----TFSYCLVDRDSDS--TSTLEF--DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
FS CLV +D TS + F ++ + + V + L + T+Y++ L GISVG
Sbjct: 240 GSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISVG 299
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
L P S ++ + G + +D+GT T L + YN L V+G + P + V
Sbjct: 300 DKLFPFSSSS---PMATKGNVFIDAGTPPTLLPRDFYNRL----VQGVKEAIPMEPVQDP 352
Query: 166 DTCYDFSSRSS--VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
D RS+ ++ P ++ HF V P ++ P G +CFA P I
Sbjct: 353 DLQPQLCYRSATLIDGPILTAHFDGADVQLKPLNTFISP--KEGVYCFAMQPIDGDTGIF 410
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN Q + F+L + F C
Sbjct: 411 GNFVQMNFLIGFDLDGKKVSFKAVDC 436
>gi|414589629|tpg|DAA40200.1| TPA: hypothetical protein ZEAMMB73_727364, partial [Zea mays]
Length = 201
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 63/175 (36%), Positives = 89/175 (50%), Gaps = 8/175 (4%)
Query: 82 TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTET 141
T PLL++ + TFYY+ TG++VG L I E+AF + G+GG+IVDSGTA+T L
Sbjct: 28 TTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAV 87
Query: 142 YNALRDAFVRGTRAL-----SPTDGVALF--DTCYDFSSRSSVEVPTVSFHFPEGKVLPL 194
+ AF + R +P DGV SS S + VP + HF +G L L
Sbjct: 88 LAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHF-QGADLDL 146
Query: 195 PAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
P +NY++ G C A + S IGN+ QQ RV ++L + P +C
Sbjct: 147 PRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 201
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 87/266 (32%), Positives = 127/266 (47%), Gaps = 21/266 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
G TE+ S + ++A GC G A+GL+GLG G LS SQI A+ FSY
Sbjct: 178 GSLGTESFAFESGTT-SLAFGCVSLTRITSGALNDASGLIGLGRGRLSLVSQIGATRFSY 236
Query: 58 CLVD--RDSDSTSTL--EFDSSLPPNAVTAPLL---RNHELDTFYYLGLTGISVGGDLLP 110
CL S ++S L +SL + P + +++ TFYYL L GI+VG LP
Sbjct: 237 CLTPYFHSSGASSHLFVGASASLGGGGASMPFVKSPKDYPYSTFYYLPLEGITVGKTRLP 296
Query: 111 -ISETAFKIDE----SGNGGIIVDSGTAVTRLQTETYNALRD--AFVRGTRALSPTDGVA 163
++ T F++ + GG+I+D+G+ +T+L + Y AL++ A G +L P +
Sbjct: 297 AVNSTTFQLRQLFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDS 356
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
+ C V VP + FHF G + +PA +Y PVD C SII
Sbjct: 357 GLELCVAREGFQKV-VPALVFHFGGGADMAVPAASYWAPVDKAAA-CMMILEGGYD-SII 413
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN QQQ + ++LR F C
Sbjct: 414 GNFQQQDMHLLYDLRRGRFSFQTADC 439
>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 417
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 84/278 (30%), Positives = 117/278 (42%), Gaps = 39/278 (14%)
Query: 6 ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN------ASTFSYCL 59
+T+++ + N GC H G+ G G G LS P+Q+ + FSYCL
Sbjct: 133 DTLSMSQLFLKNFTFGCAHT---ALAEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSYCL 189
Query: 60 VDRDSDSTSTLE--------FD--SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
V D + +D SS V +LRN + FY +GLTGISVG +
Sbjct: 190 VSHSFDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYCVGLTGISVGKRTI 249
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGVALF 165
E ++D G+GG++VDSGT T L YN++ F R + S +
Sbjct: 250 LAPEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRASEVEEKTGL 309
Query: 166 DTCYDFSSRSSVEVPTVSFHF-PEGKVLPLPAKNYLIPV--------DSNGTFCFAFAPT 216
CY VEVPTV++HF + LP NY G
Sbjct: 310 GPCYFLEGL--VEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGEDEARRKVGCLMLMNGGD 367
Query: 217 SSSLS-----IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ LS I+GN QQQG V ++L N +GF +C
Sbjct: 368 DTELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQC 405
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 89/261 (34%), Positives = 127/261 (48%), Gaps = 29/261 (11%)
Query: 6 ETVTLGS-----ASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAS---TFS 56
+T+TL S S NI IGCGH N+G G +G +GL G LSF SQ+N+S FS
Sbjct: 161 DTLTLNSNNGTPISFKNIVIGCGHRNQGPLEGYVSGNIGLARGPLSFISQLNSSIGGKFS 220
Query: 57 YCLVD--RDSDSTSTLEF-DSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
YCLV + +S L F D S V+ P+ + + Y++ L SVG ++ +
Sbjct: 221 YCLVPLFSKENVSSKLHFGDKSTVSGLGTVSTPI----KEENGYFVSLEAFSVGDHIIKL 276
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
+ D GN I+DSGT +T L + Y+ L + + D F+ CY
Sbjct: 277 ENS----DNRGNS--IIDSGTTMTILPKDVYSRLESVVLDMVKLKRVKDPSQQFNLCYQT 330
Query: 172 SSRSSV-EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP--TSSSLSIIGNVQQ 228
+S + + +V ++ HF G + L A N P+ ++ CFAF SSL+I GNV Q
Sbjct: 331 TSTTLLTKVLIITAHF-SGSEVHLNALNTFYPI-TDEVICFAFVSGGNFSSLAIFGNVVQ 388
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q V F+L I F P C
Sbjct: 389 QNFLVGFDLNKKTISFKPTDC 409
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 82/249 (32%), Positives = 118/249 (47%), Gaps = 25/249 (10%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNN--EGLFVGAAGLLGLGGGSLSFPSQINA---ST 54
G + ++ +TL GS V GC H G+ GL+GLGG + S SQ A +
Sbjct: 203 GTYSSDVLTLSGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKS 262
Query: 55 FSYCLVDRDSDS-----TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
F YCL + S + T P+LR+ ++ T+Y+ L I+VGG L
Sbjct: 263 FFYCLPATPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKL 322
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
+S + F G +VDSGT +TRL Y AL AF G + + + + DTC+
Sbjct: 323 GLSPSVFA------AGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCF 376
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQ 227
+F+ V +PTV+ F G V+ L A + S G C AFAPT + IGNVQ
Sbjct: 377 NFTGLDKVSIPTVALVFAGGAVVDLDAHGIV----SGG--CLAFAPTRDDKAFGTIGNVQ 430
Query: 228 QQGTRVSFN 236
Q+ V ++
Sbjct: 431 QRTFEVLYD 439
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 72/224 (32%), Positives = 104/224 (46%), Gaps = 30/224 (13%)
Query: 50 INASTFSYCLVDRDSDSTSTLEFDSSL---------PPNAVTAPLLRNHELDTFYYLGLT 100
I TFSYCL S S F SL P T PLL + + YY+ +T
Sbjct: 238 IYEGTFSYCL---PSYYRSAANFSGSLTLGRKGQPAPEKMKTTPLLASPHRPSLYYVAMT 294
Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--------- 151
G+ +G +PI +A D + G ++DSGT RL Y A+RD R
Sbjct: 295 GVRIGKKSVPIPPSALAFDAATGAGTVLDSGTMFARLAQPAYAAVRDEVRRRVAGSLRRR 354
Query: 152 -GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFC 210
G A + FDTCY+ S+V P V+ F G + LP +N +I T C
Sbjct: 355 GGGGASVSVSSLGGFDTCYNV---STVAWPAVTLVFGGGMEVRLPEENVVIRSTYGSTSC 411
Query: 211 FAFAPT-----SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
A A + +++L++IG++QQQ RV F++ N+ +GF +C
Sbjct: 412 LAMAASPADGVNAALNVIGSLQQQNHRVLFDVPNARVGFARERC 455
>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
Length = 499
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 81/273 (29%), Positives = 111/273 (40%), Gaps = 40/273 (14%)
Query: 14 SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN------ASTFSYCLVDRDSDST 67
S+ + GC H+ G +G AG G GSLS P+Q+ + FSYCLV DST
Sbjct: 220 SLKDFTFGCAHSALGEPIGVAGF---GFGSLSLPAQLANLSPDLGNQFSYCLVSHSFDST 276
Query: 68 S-----------TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
E D V P+L N + FY + + ISVG +
Sbjct: 277 KLHHPSPLILGKVKERDFDEITQFVYTPMLDNPKHPYFYSVSMEAISVGSSRVRAPNALI 336
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGVALFDTCYDFS 172
+ID GNGG++VDSGT T L T YN++ R + S T+ CY
Sbjct: 337 RIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRVFKRASETESKTGLSPCYYLE 396
Query: 173 ----SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV-------DSNGTFCFAFAPTSSSL- 220
R + VP ++FHF + LP +NY C
Sbjct: 397 GNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGEDEKKGRKVGCLMLMDGGDESE 456
Query: 221 ----SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +GN QQQG +V ++L +GF P KC
Sbjct: 457 GGPGATLGNYQQQGFQVVYDLEERRVGFAPRKC 489
>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 506
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 83/269 (30%), Positives = 118/269 (43%), Gaps = 38/269 (14%)
Query: 1 GDFVTETVTLGS---ASVDNIAIGCGHN--NEGLFVGA-AGLLGLGGGSLSFPSQINAS- 53
G +V++ +TL + +V GC H G F AG + LG G+ S SQ +
Sbjct: 256 GTYVSDLLTLNADPKGAVSKFQFGCSHALLRPGSFNNKTAGFMALGRGAQSLSSQTKGTF 315
Query: 54 ----TFSYCLVDRDSDST-STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
FSYCL S +L P+L++ Y + L GI V G
Sbjct: 316 SKGNVFSYCLPPTGSHKGFLSLGVPQHAASRYAVTPMLKSKMAPMIYMVRLIGIDVAGQR 375
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVALF 165
LP+ F + + +DS T +TRL Y ALR AF +R RA++P
Sbjct: 376 LPVPPAVFAANAA------MDSRTIITRLPPTAYMALRAAFRAQMRAYRAVAPK---GQL 426
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTSSSL-- 220
DTCYDF+ V +P V+ F +N + +D +G C AFAP ++
Sbjct: 427 DTCYDFTGVPMVRLPKVTLVF---------DRNAAVELDPSGVMLDSCLAFAPNANDFMP 477
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IIGNVQQQ V +N+ + +GF C
Sbjct: 478 GIIGNVQQQTLEVLYNVDGASVGFRRAAC 506
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 100 bits (249), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 86/267 (32%), Positives = 120/267 (44%), Gaps = 28/267 (10%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS- 53
G TET+TL S S+ NI GCGHNN G F GL G GG LS SQI ++
Sbjct: 180 GVIATETLTLNSNSGQPTSILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTL 239
Query: 54 ----TFSYCLVDRDSDS--TSTLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISV 104
FS CLV +D TS + F + V+ PL+ + T+Y++ L GISV
Sbjct: 240 GSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDD-PTYYFVTLDGISV 298
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL 164
G L P S ++ + G + +D+GT T L + YN L V+G + P + V
Sbjct: 299 GDKLFPFSSSS---PMATKGNVFIDAGTPPTLLPRDFYNRL----VQGVKEAIPMEPVQD 351
Query: 165 FDTCYDFSSRSS--VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
D RS+ ++ P ++ HF V P ++ P G +CFA P I
Sbjct: 352 PDLQPQLCYRSATLIDGPILTAHFDGADVQLKPLNTFISP--KEGVYCFAMQPIDGDTGI 409
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN Q + F+L + F C
Sbjct: 410 FGNFVQMNFLIGFDLDGKKVSFKAVDC 436
>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
Length = 366
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 64/108 (59%), Positives = 79/108 (73%), Gaps = 4/108 (3%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFSY 57
G F TET+T G+ SV N+AIGCGH N GLF+GAAGLLGLG G+LSFP+QI TFSY
Sbjct: 244 GSFATETLTFGTTSVANVAIGCGHKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTFSY 303
Query: 58 CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISV 104
CLVDR+SDS+ L+F S+P ++ PL +N L TFYYL +T IS+
Sbjct: 304 CLVDRESDSSGPLQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISI 351
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 80/275 (29%), Positives = 128/275 (46%), Gaps = 41/275 (14%)
Query: 12 SASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSD 65
A + + +GC + G F+ + G+L LG ++SF S+ + FSYCLVD +
Sbjct: 212 KAKLRGVVLGCTTSYNGQSFLASDGVLSLGYSNISFASRAASRFGGRFSYCLVDHLAPRN 271
Query: 66 STSTL------EFDSSLPPNAVTA--------------------PLLRNHELDTFYYLGL 99
+TS L F S P + + PL+ +H FY + +
Sbjct: 272 ATSYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTV 331
Query: 100 TGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT 159
G+SV G+LL I + +++ GG I+DSGT++T L Y A+ A + L P
Sbjct: 332 KGVSVAGELLKIPRAVWDVEQ--GGGAILDSGTSLTMLAKPAYRAVVAALSKRLAGL-PR 388
Query: 160 DGVALFDTCYDFSSRSSVEV----PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP 215
+ FD CY+++S S +V P ++ HF L PAK+Y+I + G C
Sbjct: 389 VTMDPFDYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVIDA-APGVKCIGLQE 447
Query: 216 -TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
LS+IGN+ QQ ++L+N + F ++C
Sbjct: 448 GPWPGLSVIGNILQQEHLWEYDLKNRRLRFKRSRC 482
>gi|84453222|dbj|BAE71208.1| hypothetical protein [Trifolium pratense]
gi|84453226|dbj|BAE71210.1| hypothetical protein [Trifolium pratense]
Length = 437
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 72/211 (34%), Positives = 96/211 (45%), Gaps = 21/211 (9%)
Query: 50 INASTFSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGI 102
I + FSYCL S + F SL P + T PLL N + YY+ LT I
Sbjct: 236 IYSGVFSYCL-----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLHNPHRPSLYYVNLTAI 290
Query: 103 SVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
SVG +P+ + S G I+DSGT +TR YNA+RD F + + P +
Sbjct: 291 SVGRVYVPLPSELLAFNPSTGAGTIIDSGTVITRFVEPIYNAVRDEFRK--QVTGPFSSL 348
Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--- 219
FDTC F P ++ HF + L LP +N LI S C A A S+
Sbjct: 349 GAFDTC--FVKNYETLAPAITLHFTDLD-LKLPLENSLIHSSSGSLACLAMAAAPSNVNS 405
Query: 220 -LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
L++I N QQQ RV F+ N+ +G C
Sbjct: 406 VLNVIANFQQQNLRVLFDTVNNKVGIARELC 436
>gi|413923981|gb|AFW63913.1| hypothetical protein ZEAMMB73_837345 [Zea mays]
Length = 414
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 65/178 (36%), Positives = 93/178 (52%), Gaps = 19/178 (10%)
Query: 54 TFSYCLVDRDSDSTSTLEFDS-----SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
TFSY LV+ DSD+ S + F + P TA + DTFYY+ L G+ VGG+L
Sbjct: 5 TFSYRLVEHDSDAVSKVVFREDDLVLAHPELKYTAFTPTSSPADTFYYVKLKGVLVGGEL 64
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD-GVALFDT 167
L IS + + + G+GG I+DSGT ++ Y A+ P+D G+ +
Sbjct: 65 LKISSDTWDVGKDGSGGTIIDSGTTLSYFVEPVYQAV------------PSDPGLLGAEP 112
Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS-LSIIG 224
CY+ S EVP +S FP+G V PA+NY + +D + C A TS + +SIIG
Sbjct: 113 CYNVSGMERPEVPELSLLFPDGAVWDFPAENYFVRLDPDDIMCLAVLGTSRTGMSIIG 170
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 81/268 (30%), Positives = 123/268 (45%), Gaps = 38/268 (14%)
Query: 12 SASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDS--D 65
A + + +GC + G F + G+L LG +SF S + FSYCLVD S +
Sbjct: 216 KAKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFSYCLVDHLSPRN 275
Query: 66 STSTLEFDSSLPPN-------------------AVTAPLLRNHELDTFYYLGLTGISVGG 106
+TS L F PN A PLL + + FY + L ISV G
Sbjct: 276 ATSYLTFG----PNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAG 331
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
+ L I + ++ GG+I+DSGT++T L Y A+ A +G L P + F+
Sbjct: 332 EFLKIPRAVWDVE--AGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGL-PRVTMDPFE 388
Query: 167 TCYDFSSRS----SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLS 221
CY+++S S V VP ++ HF L P K+Y+I + G C +S
Sbjct: 389 YCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVIDA-APGVKCIGLQEGPWPGIS 447
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+IGN+ QQ F+++N + F ++C
Sbjct: 448 VIGNILQQEHLWEFDIKNRRLKFQRSRC 475
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 82/249 (32%), Positives = 116/249 (46%), Gaps = 20/249 (8%)
Query: 13 ASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQIN----ASTFSYCLVD--RDSD 65
S+ GCGHNN G F GL+GLGGG S SQI FS CLV D
Sbjct: 173 VSLSRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIK 232
Query: 66 STSTLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
+S + F L VT PL+ E DT Y++ L GISV P++ T G
Sbjct: 233 ISSRMSFGKGSQVLGNGVVTTPLVP-REKDTSYFVTLLGISVEDTYFPMNSTI------G 285
Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTV 182
++VDSGT L + Y+ + A VR AL P T + ++++++ PT+
Sbjct: 286 KANMLVDSGTPPILLPQQLYDKVF-AEVRNKVALKPITDDPSLGTQLCYRTQTNLKGPTL 344
Query: 183 SFHFPEGKVLPLPAKNYLIPV-DSNGTFCFA-FAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
+FHF VL P + ++ P + G FC A + T+S + GN Q + F+L
Sbjct: 345 TFHFVGANVLLTPIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIGFDLDRQ 404
Query: 241 LIGFTPNKC 249
++ F P C
Sbjct: 405 VVSFKPTDC 413
>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
Length = 435
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 82/271 (30%), Positives = 122/271 (45%), Gaps = 27/271 (9%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGH--NNEGLFVGAAGLLGLGGGSLSFPSQI------- 50
G V +T+TL SA+ GC + F GA GL+ L S S S++
Sbjct: 170 GTLVRDTLTLPPSATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATT 229
Query: 51 NASTFSYCLVDRDSDSTST-LEFDSSLPP----NAVTAPLLRNHELDTFYYLGLTGISVG 105
+A+ FSYCL + S+ L +S P + AP+ N Y++ L GISVG
Sbjct: 230 SAAAFSYCLPSSSATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVELVGISVG 289
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
G+ LP+ F G ++++ T T L Y ALRDAF R +
Sbjct: 290 GEDLPVPPAVFAAH-----GTLLEAATEFTFLAPAAYAALRDAFRRDMAPYPAAPPFRVL 344
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFC-------FAFAPTSS 218
DTCY+ + +S+ VPTV+ F G L L + + D + F A +
Sbjct: 345 DTCYNLTGLASLAVPTVALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAF 404
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+S+IG + Q+ T V ++LR +GF P +C
Sbjct: 405 PVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 435
>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 71/207 (34%), Positives = 96/207 (46%), Gaps = 22/207 (10%)
Query: 55 FSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
FSYCL S + F SL P + T PLLRN + Y++ LTGI+VG
Sbjct: 241 FSYCL-----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKV 295
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
+P + D + G I+DSGT +TR YNA+RD F + + P + FDT
Sbjct: 296 NVPFPKELLAFDVNTGSGTIIDSGTVITRFVEPVYNAVRDEFRK--QVTGPFSSLGAFDT 353
Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS-----LSI 222
C F P ++ HF + L LP +N LI S C A A T + L++
Sbjct: 354 C--FVKNYETLAPAITLHFTDLD-LKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNV 410
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I N QQQ RV F+ N+ +G C
Sbjct: 411 IANYQQQNLRVLFDTVNNKVGIARELC 437
>gi|125579874|gb|EAZ21020.1| hypothetical protein OsJ_36669 [Oryza sativa Japonica Group]
Length = 382
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 79/232 (34%), Positives = 111/232 (47%), Gaps = 20/232 (8%)
Query: 34 AGLLGLGGGSLSFPSQINASTFSYCLVD--RDSDSTSTLEFDSSLP----PNAVTAPLLR 87
+GL+GLG G LS SQ A+ FSYCL ++ +T L +S + +T ++
Sbjct: 152 SGLMGLGRGRLSLVSQTGATKFSYCLTPYFHNNGATGHLFVGASASLGGHGDVMTTQFVK 211
Query: 88 NHELDTFYYLGLTGISVGGDLLPISETAFKIDESG----NGGIIVDSGTAVTRLQTETYN 143
+ FYYL L G++VG LPI T F + E +GG+I+DSG+ T L + Y+
Sbjct: 212 GPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGSPFTSLVHDAYD 271
Query: 144 ALRD---AFVRGTRALSPTDGVALFDTCYDFSSRSSV--EVPTVSFHFPEGKVLPLPAKN 198
AL A + G+ P D D +R V VP V FHF G + +PA++
Sbjct: 272 ALASELAARLNGSLVAPPPDA----DDGALCVARRDVGRVVPAVVFHFRGGADMAVPAES 327
Query: 199 YLIPVDSNGTFCFAFAP-TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
Y PVD + S+IGN QQQ RV ++L N F P C
Sbjct: 328 YWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPADC 379
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 72/221 (32%), Positives = 103/221 (46%), Gaps = 28/221 (12%)
Query: 44 LSFPSQINASTFSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYY 96
+S I STFSYCL S +L F SL P LLRN + YY
Sbjct: 231 MSQAQSIYKSTFSYCL-----PSFRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYY 285
Query: 97 LGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL 156
+ L I VG ++ + A + S G I DSGT TRL Y A+R+ F + +
Sbjct: 286 VNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVK-- 343
Query: 157 SPTDGVAL----FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFA 212
PT V FDTCY V+VPT++F F +G + +PA N ++ + T C A
Sbjct: 344 -PTTAVVTSLGGFDTCYS----GQVKVPTITFMF-KGVNMTMPADNLMLHSTAGSTSCLA 397
Query: 213 FAP----TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
A +S +++I ++QQQ RV ++ N +G +C
Sbjct: 398 MAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERC 438
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 87/257 (33%), Positives = 121/257 (47%), Gaps = 20/257 (7%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNE-GLFVGAAGLLGLGGGSLSFPSQINAS----T 54
G++ T+ +TLG A V GCGH+ + G F A G+LGLG S Q +A
Sbjct: 225 GEYSTDALTLGPGAIVKRFHFGCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGV 284
Query: 55 FSYCLVDRDSDSTSTLEFDSSLPPNA-VTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
FS+CL ST L + +A V PLL + FY L T ISV G LL I
Sbjct: 285 FSHCLPPTGV-STGFLALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPP 343
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
F+ G+I DSGT ++ LQ Y ALR AF V DTC++F+
Sbjct: 344 AVFR------EGVITDSGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFNFTG 397
Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS-IIGNVQQQGTR 232
+V VPTVS F G + L A + ++ +D C AF + + +IG+V Q+
Sbjct: 398 YDNVTVPTVSLTFRGGATVHLDASSGVL-MDG----CLAFWSSGDEYTGLIGSVSQRTIE 452
Query: 233 VSFNLRNSLIGFTPNKC 249
V +++ +GF C
Sbjct: 453 VLYDMPGRKVGFRTGAC 469
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 82/271 (30%), Positives = 119/271 (43%), Gaps = 25/271 (9%)
Query: 1 GDFVTETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSL-----SFPSQINAST 54
GD T+ + + + V+N+ +GCG +NEGLF AAGLLG + +P + S+
Sbjct: 178 GDLATDKLAFANDTYVNNVTLGCGRDNEGLFDSAAGLLGRRAAARYPSRRRWPRRTAPSS 237
Query: 55 FSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRN-----HELDTFYYLGLTGISVG--GD 107
+ R + + ++ T+ + G + G G
Sbjct: 238 STASATGRRAQRAARTSCSAARRSRRPRRSPPCCRTRGARACTTWTWPGSASAARGSPGS 297
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV---AL 164
P S + G +VDSGTA++R + Y ALRDAF RA ++
Sbjct: 298 RTPASRWTRRRGRGGV---VVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSV 354
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD------SNGTFCFAFAPTSS 218
FD CYD R + P + HF G + LP +NY +PVD ++ C F
Sbjct: 355 FDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADD 414
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
LS+IGNVQQQG RV F++ IGF P C
Sbjct: 415 GLSVIGNVQQQGFRVVFDVEKERIGFAPKGC 445
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 91/267 (34%), Positives = 121/267 (45%), Gaps = 28/267 (10%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
+ V +TVTL + + GC G GLLGLG G LS SQ + STFSY
Sbjct: 180 ANVVQDTVTLATDPIPGYTFGCVAKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSY 239
Query: 58 CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
CL S +L F SL P PLL+N + YY+ L I VG ++
Sbjct: 240 CL-----PSFKSLNFSGSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLFAIRVGRKIVD 294
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR----GTRALSPTDGVALFD 166
I A + + G + DSGT TRL Y A+RD F R +A + FD
Sbjct: 295 IPPAALAFNAATGAGTVFDSGTVFTRLVAPVYTAVRDEFRRRVAMAAKANLTVTSLGGFD 354
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSI 222
TCY + PT++F F G + LP N LI + T C A A +S L++
Sbjct: 355 TCYTV----PIVAPTITFMF-SGMNVTLPQDNILIHSTAGSTSCLAMASAPDNVNSVLNV 409
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I N+QQQ RV +++ NS +G C
Sbjct: 410 IANMQQQNHRVLYDVPNSRLGVARELC 436
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 72/221 (32%), Positives = 103/221 (46%), Gaps = 28/221 (12%)
Query: 44 LSFPSQINASTFSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYY 96
+S I STFSYCL S +L F SL P LLRN + YY
Sbjct: 247 MSQAQSIYKSTFSYCL-----PSFRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYY 301
Query: 97 LGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL 156
+ L I VG ++ + A + S G I DSGT TRL Y A+R+ F + +
Sbjct: 302 VNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVK-- 359
Query: 157 SPTDGVAL----FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFA 212
PT V FDTCY V+VPT++F F +G + +PA N ++ + T C A
Sbjct: 360 -PTTAVVTSLGGFDTCYS----GQVKVPTITFMF-KGVNMTMPADNLMLHSTAGSTSCLA 413
Query: 213 FAP----TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
A +S +++I ++QQQ RV ++ N +G +C
Sbjct: 414 MAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERC 454
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 72/234 (30%), Positives = 109/234 (46%), Gaps = 12/234 (5%)
Query: 21 GCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFD--SSLPP 78
GCG +G +GA+G+LG+ LS SQ+ FSYCL +S L F + L
Sbjct: 202 GCGALTDGNLLGASGILGMSPAILSMVSQLAIPKFSYCLTPYTDRKSSPLFFGAWADLGR 261
Query: 79 NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQ 138
T P+ ++ L +YY+ L G+S+G L + F + + GG +VD G V +L
Sbjct: 262 YKTTGPIQKS--LTFYYYVPLVGLSLGTRRLDVPAATFALKQ---GGTVVDLGCTVGQLA 316
Query: 139 TETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS---RSSVEVPTVSFHFPEGKVLPLP 195
+ AL++A + V + C+ S +V+ P + +F G + LP
Sbjct: 317 EPAFTALKEAVLHTLNLPLTNRTVKDYKVCFALPSGVAMGAVQTPPLVLYFDGGADMVLP 376
Query: 196 AKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
NY + G C A P +SIIGNVQQQ + F++ +S F P C
Sbjct: 377 RDNYF-QEPTAGLMCLALVP-GGGMSIIGNVQQQNFHLLFDVHDSKFLFAPTIC 428
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 74/250 (29%), Positives = 112/250 (44%), Gaps = 10/250 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G + + D + GC EG G++GLG G LS SQ+ FSY L
Sbjct: 193 GLLAVDAFAFATVRADGVIFGCAVATEGDI---GGVIGLGRGELSLVSQLQIGRFSYYLA 249
Query: 61 DRDS-DSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
D+ D S + F P AV+ PL+ N + YY+ L GI V G+ L I F
Sbjct: 250 PDDAVDVGSFILFLDDAKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRGTF 309
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRS 175
+ G+GG+++ VT L Y +R A L DG L D CY S +
Sbjct: 310 DLQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKI-GLRAADGSELGLDLCYTSESLA 368
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSIIGNVQQQGTRVS 234
+ +VP+++ F G V+ L NY + G C P+ + S++G++ Q GT +
Sbjct: 369 TAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMI 428
Query: 235 FNLRNSLIGF 244
+++ S + F
Sbjct: 429 YDISGSRLVF 438
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 81/257 (31%), Positives = 120/257 (46%), Gaps = 24/257 (9%)
Query: 12 SASVDNIAIGC--GHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DS 64
A + + +GC H +G F + G+L LG ++SF S+ + FSYCLVD
Sbjct: 241 KAKLQGVVLGCTTAHAGQG-FEASDGVLSLGYSNISFASRAASRFGGRFSYCLVDHLAPR 299
Query: 65 DSTSTLEF-------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
++TS L F SS P PLL + + FY + + +SV G L I +
Sbjct: 300 NATSYLTFGAGPDAASSSAPAPGSRTPLLLDARVRPFYAVAVDSVSVDGVALDIPAEVW- 358
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR--- 174
D NGG I+DSGT++T L T Y A+ A L P + FD CY++++R
Sbjct: 359 -DVGSNGGTIIDSGTSLTVLATPAYKAVVAALSEQLAGL-PRVAMDPFDYCYNWTARGDG 416
Query: 175 -SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQGTR 232
+ VP ++ F L PAK+Y+I + G C + +S+IGN+ QQ
Sbjct: 417 GGDLAVPKLAVQFAGSARLEPPAKSYVIDA-APGVKCIGVQEGAWPGVSVIGNILQQEHL 475
Query: 233 VSFNLRNSLIGFTPNKC 249
F+L N + F C
Sbjct: 476 WEFDLNNRWLRFRQTSC 492
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 78/278 (28%), Positives = 117/278 (42%), Gaps = 35/278 (12%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G +TE + +V + +GC + AG+ G G G +S PSQ+N FS+CLV
Sbjct: 196 GVLITEKLDFPDLTVPDFVVGCSIIST---RQPAGIAGFGRGPVSLPSQMNLKRFSHCLV 252
Query: 61 DR---DSDSTSTLEFDS-------SLPPNAVTAPLLRNHELDT-----FYYLGLTGISVG 105
R D++ T+ L+ D+ S P P +N + +YYL L I VG
Sbjct: 253 SRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVG 312
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT------RALSPT 159
+ I +G+GG IVDSG+ T ++ + + + F + L
Sbjct: 313 RKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKE 372
Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS- 218
G+ C++ S + V VP + F F G L LP NY V + T C +
Sbjct: 373 TGLG---PCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTV 429
Query: 219 -------SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I+G+ QQQ V ++L N GF KC
Sbjct: 430 NPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
Length = 469
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 78/278 (28%), Positives = 117/278 (42%), Gaps = 35/278 (12%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G +TE + +V + +GC + AG+ G G G +S PSQ+N FS+CLV
Sbjct: 196 GVLITEKLDFPDLTVPDFVVGCSIIST---RQPAGIAGFGRGPVSLPSQMNLKRFSHCLV 252
Query: 61 DR---DSDSTSTLEFDS-------SLPPNAVTAPLLRNHELDT-----FYYLGLTGISVG 105
R D++ T+ L+ D+ S P P +N + +YYL L I VG
Sbjct: 253 SRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVG 312
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT------RALSPT 159
+ I +G+GG IVDSG+ T ++ + + + F + L
Sbjct: 313 RKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKE 372
Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS- 218
G+ C++ S + V VP + F F G L LP NY V + T C +
Sbjct: 373 TGLG---PCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTV 429
Query: 219 -------SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I+G+ QQQ V ++L N GF KC
Sbjct: 430 NPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/262 (31%), Positives = 126/262 (48%), Gaps = 21/262 (8%)
Query: 1 GDFVTETVTLGSASVDNI-----AIGCGHNNE-GLFVGAAGLLGLGGGSLSFPSQINAST 54
GD ET+TLGS + + IGCG N G+ +G++GLG G +S +Q++ ST
Sbjct: 177 GDLSVETLTLGSTNGSPVQFPGTVIGCGRYNAIGIEEKNSGIVGLGRGPMSLITQLSPST 236
Query: 55 ---FSYCLVDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDL 108
FSYCLV S ++S L F ++ + V+ PL + L FY+L L SVG +
Sbjct: 237 GGKFSYCLVPGLSTASSKLNFGNAAVVSGRGTVSTPLFSKNGL-VFYFLTLEAFSVGRNR 295
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
+ G G II+DSGT +T L Y+ L A + D + C
Sbjct: 296 IEFGSPG----SGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLC 351
Query: 169 YDFS-SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQ 227
Y + + VP ++ HF G + L A N + V ++ CFAF PT + ++ GN+
Sbjct: 352 YKVTPDKLDASVPVITAHF-SGADVTLNAINTFVQV-ADDVVCFAFQPTETG-AVFGNLA 408
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
QQ V ++L+ + + F C
Sbjct: 409 QQNLLVGYDLQMNTVSFKHTDC 430
>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 70/206 (33%), Positives = 97/206 (47%), Gaps = 20/206 (9%)
Query: 55 FSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
FSYCL S + F SL P + T PLLR+ + YY+ TGISVG
Sbjct: 242 FSYCL-----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRV 296
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
L+P + + G I+DSGT +TR YNA+R+ F + + T + FDT
Sbjct: 297 LVPFPSEYLGFNPNTGSGTIIDSGTVITRFVEPVYNAVREEFRKQVGGTTFTS-IGAFDT 355
Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSII 223
C F P ++ HF EG L LP +N LI + C A A +S L++I
Sbjct: 356 C--FVKTYETLAPPITLHF-EGLDLKLPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVI 412
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
N QQQ R+ F++ N+ +G C
Sbjct: 413 ANFQQQNLRILFDIVNNKVGIAREVC 438
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 84/265 (31%), Positives = 117/265 (44%), Gaps = 32/265 (12%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINAS---T 54
G + + +TLG V GC H + G AG L LGGGS S Q
Sbjct: 247 GTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLALGGGSQSLVQQTATRYGRV 306
Query: 55 FSYCLVDRDSDSTSTLEF--------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCL + S+L F + L P+ V+ PLL + TFY + L I V G
Sbjct: 307 FSYCL----PPTASSLGFLVLGVPPERAQLIPSFVSTPLLSSSMAPTFYRVLLRAIIVAG 362
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
L + F ++DS T ++RL Y ALR AF V++ D
Sbjct: 363 RPLAVPPAVFSASS------VIDSSTIISRLPPTAYQALRAAFRSAMTMYRAAPPVSILD 416
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SIIG 224
TCYDF+ S+ +P+++ F G + L A L+ G+ C AFAPT+S IG
Sbjct: 417 TCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-----GS-CLAFAPTASDRMPGFIG 470
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
NVQQ+ V +++ + F C
Sbjct: 471 NVQQKTLEVVYDVPAKAMRFRTAAC 495
>gi|413936472|gb|AFW71023.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
Length = 289
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 85/255 (33%), Positives = 115/255 (45%), Gaps = 22/255 (8%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNN---EGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G + + +TL A V N GCGH GLF G+LGLG S ++ FS
Sbjct: 51 GAYSQDKLTLAPGAIVQNFYFGCGHGKHAVRGLF---DGVLGLGRLRESLGARYGG-VFS 106
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
YCL S P V P+ TF + L GI+VGG L + +AF
Sbjct: 107 YCLPSVSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF 166
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA--LSPTDGVALFDTCYDFSSR 174
+GG+IVDSGT +T LQ+ Y ALR AF + A L P + DTCY+ +
Sbjct: 167 ------SGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDL---DTCYNLTGY 217
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
+V VP ++ F G + L N ++ NG FA + S ++GNV Q+ V
Sbjct: 218 KNVVVPKIALTFTGGATINLDVPNGIL---VNGCLAFAESGPDGSAGVLGNVNQRAFEVL 274
Query: 235 FNLRNSLIGFTPNKC 249
F+ S GF C
Sbjct: 275 FDTSTSKFGFRAKAC 289
>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 440
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 89/261 (34%), Positives = 124/261 (47%), Gaps = 23/261 (8%)
Query: 3 FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCL 59
V +++ L + + N + GC + G V A GLLGLG G LS SQ ++ FSYCL
Sbjct: 188 LVQDSLRLATDVIPNYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCL 247
Query: 60 VDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
S + F SL P + T PLLR+ + YY+ TGISVG L+P
Sbjct: 248 -----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFP 302
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
+ + G I+DSGT +TR YNA+R+ F + + T + FDTC F
Sbjct: 303 SEYLGFNPNTGSGTIIDSGTVITRFVEPVYNAVREEFRKQVGGTTFTS-IGAFDTC--FV 359
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQ 228
P ++ HF EG L LP +N LI + C A A +S L++I N QQ
Sbjct: 360 KTYETLAPPITLHF-EGLDLKLPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQ 418
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q R+ F+ N+ +G C
Sbjct: 419 QNLRILFDTVNNKVGIAREVC 439
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 72/220 (32%), Positives = 103/220 (46%), Gaps = 26/220 (11%)
Query: 44 LSFPSQINASTFSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYY 96
+S + STFSYCL S +L F SL P LLRN + YY
Sbjct: 231 MSQAQSVYKSTFSYCL-----PSFRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYY 285
Query: 97 LGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL 156
+ L I VG ++ + A + S G I DSGT TRL Y A+R+ F + R
Sbjct: 286 VNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRK--RVK 343
Query: 157 SPTDGVAL---FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF 213
PT V FDTCY V+VPT++F F +G + +PA N ++ + T C A
Sbjct: 344 PPTAVVTSLGGFDTCYS----GQVKVPTITFMF-KGVNMTMPADNLMLHSTAGSTSCLAM 398
Query: 214 AP----TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
A +S +++I ++QQQ RV ++ N +G +C
Sbjct: 399 ASAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERC 438
>gi|147866226|emb|CAN79938.1| hypothetical protein VITISV_027777 [Vitis vinifera]
Length = 454
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 69/236 (29%), Positives = 104/236 (44%), Gaps = 24/236 (10%)
Query: 38 GLGGGSLSFPSQINASTFSYCLVDRDSDST---STLEFDSSLPPNAVTA-----PLLRN- 88
G G G S PSQ+ FSYCL+ R D T S+L D TA P ++N
Sbjct: 218 GFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNP 277
Query: 89 -----HELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN 143
H +YYLGL I+VGG + I G+GG I+DSGT T ++ E +
Sbjct: 278 KVAGKHAFSVYYYLGLRHITVGGKHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFE 337
Query: 144 ALRDAFVRGTRALSPT--DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLI 201
+ F + ++ T +G+ C++ S ++ P ++ F G + LP NY+
Sbjct: 338 LVAAEFEKQVQSKRATEVEGITGLRPCFNISGLNTPSFPELTLKFRGGAEMELPLANYVA 397
Query: 202 PVDSNGTFCFAFAPTSSSLS--------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ + C ++ I+GN QQQ V ++LRN +GF C
Sbjct: 398 FLGGDDVVCLTIVTDGAAGKEFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 453
>gi|125595873|gb|EAZ35653.1| hypothetical protein OsJ_19940 [Oryza sativa Japonica Group]
Length = 468
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 85/237 (35%), Positives = 109/237 (45%), Gaps = 24/237 (10%)
Query: 24 HNNEGLFVGA-AGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTSTLEFDSSLPPN 79
H G F + +G + LGGG S SQ A+ FSYC+ D S +L +
Sbjct: 245 HAVRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGA 304
Query: 80 AVTA--PLLRNHEL-DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTR 136
A PL+RN + T Y + L GI VGG L + F GG ++DS +T+
Sbjct: 305 GRFARTPLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQ 358
Query: 137 LQTETYNALRDAFVRGTRALSP--TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPL 194
L Y ALR AF R A P G A DTCYDF +SV VP VS F G V+ L
Sbjct: 359 LPPTAYRALRLAF-RSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRL 417
Query: 195 PAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
A ++ C AF PT +L IGNVQQQ V +++ +GF C
Sbjct: 418 DAMGVMV------EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 468
>gi|115448347|ref|NP_001047953.1| Os02g0720500 [Oryza sativa Japonica Group]
gi|113537484|dbj|BAF09867.1| Os02g0720500, partial [Oryza sativa Japonica Group]
Length = 172
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 68/175 (38%), Positives = 87/175 (49%), Gaps = 24/175 (13%)
Query: 82 TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTET 141
T PLL T+Y + L GISVGG L I + F G +VD+GT VTRL
Sbjct: 15 TTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFA------SGAVVDTGTVVTRLPPTA 68
Query: 142 YNALRDAFVRGTRALSP-----TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPA 196
Y+ALR AF A++P + DTCYDF+ +V +PT+S F G + L
Sbjct: 69 YSALRSAF---RAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGT 125
Query: 197 KNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
L + C AFAPT S SI+GNVQQ+ V F+ S +GF P C
Sbjct: 126 SGILT------SGCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPASC 172
>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
Length = 408
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 73/222 (32%), Positives = 101/222 (45%), Gaps = 19/222 (8%)
Query: 42 GSLSFPSQINAS---TFSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHEL 91
G +S SQ + FSYCL S + F SL P N PLL N
Sbjct: 191 GPMSLLSQTGSRYNGVFSYCL-----PSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHR 245
Query: 92 DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR 151
+ YY+ +TG+SVG L+ +F D S G ++DSGT +TR Y ALRD F R
Sbjct: 246 PSLYYVNVTGLSVGRALVKAPAGSFAFDPSTGAGTVIDSGTVITRWTAPVYAALRDEFRR 305
Query: 152 GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCF 211
A S + FDTC++ ++ P V+ H G L LP +N LI + C
Sbjct: 306 QVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMGGGVDLTLPMENTLIHSSATPLACL 365
Query: 212 AFAPTSS----SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
A A ++++ N+QQQ RV ++ S +GF C
Sbjct: 366 AMAEAPQNVNSVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 407
>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
max]
Length = 455
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 84/289 (29%), Positives = 121/289 (41%), Gaps = 53/289 (18%)
Query: 6 ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN------ASTFSYCL 59
+T++L S + N GC + G+ G G G LS P+Q+ + FSYCL
Sbjct: 163 DTLSLSSLFLRNFTFGCAYTT---LAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCL 219
Query: 60 VDRDSDSTSTLEFDSSL----------------PPNAVTAPLLRNHELDTFYYLGLTGIS 103
V DS + + V P+L N + FY +GL GIS
Sbjct: 220 VSHSFDSERVRKPSPLILGRYEEEEEEEKVGGGVAEFVYTPMLENPKHPYFYTVGLIGIS 279
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT-------RAL 156
VG ++P E +++ G+GG++VDSGT T L YN++ D F RG R +
Sbjct: 280 VGKRIVPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRGVGRVNERARKI 339
Query: 157 SPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGK-VLPLPAKNYLIP-VDSN-------- 206
G+A CY + S EVP ++ F G + LP KNY +D
Sbjct: 340 EEKTGLA---PCYYLN--SVAEVPVLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKRR 394
Query: 207 -GTFCFAFAPTSSSLS-----IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
G + LS +GN QQQG V ++L +GF +C
Sbjct: 395 VGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQC 443
>gi|242059939|ref|XP_002459115.1| hypothetical protein SORBIDRAFT_03g046190 [Sorghum bicolor]
gi|241931090|gb|EES04235.1| hypothetical protein SORBIDRAFT_03g046190 [Sorghum bicolor]
Length = 153
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 60/157 (38%), Positives = 85/157 (54%), Gaps = 12/157 (7%)
Query: 99 LTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP 158
+ GI VGG +P+ +A D + G IVD+GT TRL Y A+RDAF R RA P
Sbjct: 1 MVGIRVGGKPVPVPASALAFDPTSGRGTIVDAGTMFTRLSAPVYAAVRDAFRRRVRA--P 58
Query: 159 TDG-VALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-- 215
G + FDTCY+ +V VPTV+F F + LP +N +I S G C A A
Sbjct: 59 VAGPLGGFDTCYNV----TVSVPTVTFVFDGPVSVTLPEENVVIRSSSGGIACLAMAAGP 114
Query: 216 ---TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++L+++ ++QQQ RV F++ N +GF+ C
Sbjct: 115 PDGVDAALNVLASMQQQNHRVLFDVANGRVGFSRELC 151
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 85/255 (33%), Positives = 115/255 (45%), Gaps = 22/255 (8%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNE---GLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G + + +TL A V N GCGH GLF G+LGLG S ++ FS
Sbjct: 173 GAYSQDKLTLAPGAIVQNFYFGCGHGKHAVRGLF---DGVLGLGRLRESLGARYGG-VFS 228
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
YCL S P V P+ TF + L GI+VGG L + +AF
Sbjct: 229 YCLPSVSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF 288
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA--LSPTDGVALFDTCYDFSSR 174
+GG+IVDSGT +T LQ+ Y ALR AF + A L P + DTCY+ +
Sbjct: 289 ------SGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDL---DTCYNLTGY 339
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
+V VP ++ F G + L N ++ NG FA + S ++GNV Q+ V
Sbjct: 340 KNVVVPKIALTFTGGATINLDVPNGIL---VNGCLAFAESGPDGSAGVLGNVNQRAFEVL 396
Query: 235 FNLRNSLIGFTPNKC 249
F+ S GF C
Sbjct: 397 FDTSTSKFGFRAKAC 411
>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 485
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 113/279 (40%), Gaps = 51/279 (18%)
Query: 15 VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA------STFSYCLV------DR 62
+ N GC H G VG AG G G LS P+Q+ + + FSYCLV DR
Sbjct: 204 LHNFTFGCAHTALGEPVGVAGF---GRGVLSLPAQLASFSPHLGNQFSYCLVSHSFDADR 260
Query: 63 --------------DSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
D + + D V +L N + FY +GL GI+VG
Sbjct: 261 VRRPSPLILGRYSLDDEKKKRVGHDRG---EFVYTAMLDNPKHPYFYCVGLEGITVGNRK 317
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV----RGTRALSPTDGVAL 164
+P+ E ++D GNGG++VDSGT T L Y +L F R + + +
Sbjct: 318 IPVPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRMGRVYKRATQIEERTG 377
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV--------DSNGTFCFAF--- 213
CY +S S+ +VP V+ HF + LP NY C
Sbjct: 378 LGPCY-YSDDSAAKVPAVALHFVGNSTVILPRNNYYYEFFDGRDGQKKKRKVGCLMLMNG 436
Query: 214 ---APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
A + + +GN QQQG V ++L +GF KC
Sbjct: 437 GDEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFARRKC 475
>gi|168008086|ref|XP_001756738.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691976|gb|EDQ78335.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 174
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 86/170 (50%), Gaps = 8/170 (4%)
Query: 84 PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN 143
PLL++ ++TFY++ L ++V G LPIS K++ GNGG I+D T TR +
Sbjct: 6 PLLKHPLVETFYFVNLVAVAVNGAKLPISSKVLKMNSEGNGGAILDMSTRFTRFPNSAF- 64
Query: 144 ALRDAFVRGTRALS--PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLI 201
D V+ +AL PT V F CY + ++ +PTV+ F G + LP +N +
Sbjct: 65 ---DHLVKALKALIRLPTMVVPRFQLCYSTVNTGTLIIPTVTLIFENGVRMRLPMENTFV 121
Query: 202 PVDSNG-TFCFAFAP-TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
V G C A P + ++IG+ QQQ + + S +GF P +C
Sbjct: 122 SVTEQGDVMCLAMVPGNPGTATVIGSAQQQNFLIVIDREASRLGFAPLQC 171
>gi|125561847|gb|EAZ07295.1| hypothetical protein OsI_29543 [Oryza sativa Indica Group]
Length = 205
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 60/166 (36%), Positives = 85/166 (51%), Gaps = 17/166 (10%)
Query: 36 LLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEF------------DSSLPPNAVTA 83
++GLG G LS SQ+ S FSYCL S S L F S LP +
Sbjct: 1 MVGLGRGLLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFATLNGTNASSSGLP--VQST 58
Query: 84 PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN 143
PL+ N L + Y++ L GIS+G LPI F I++ G GG+ +DSGT++T LQ + Y+
Sbjct: 59 PLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDVYD 118
Query: 144 ALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRSSV--EVPTVSFHF 186
A+R V R L P + + +TC+ + +V VP + HF
Sbjct: 119 AVRRELVSVLRPLPPANDTEIGLETCFPWPPPPTVTMTVPDMELHF 164
>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 439
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 91/274 (33%), Positives = 130/274 (47%), Gaps = 38/274 (13%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS---TF 55
G++ +T+TL + V G G NN+G F G G+LGLG G LS SQ + F
Sbjct: 177 GNYGCDTMTLEPSDVFQKFQFGRGRNNKGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVF 236
Query: 56 SYCLVDRDS-----------DSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISV 104
SYCL + DS +S+L+F S V P + +Y++ L+ ISV
Sbjct: 237 SYCLPEEDSIGSLLFGEKATSQSSSLKFTS-----LVNGP--GTLQESGYYFVNLSDISV 289
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA- 163
G + L I + F + G I+DS T +TRL Y+AL+ AF + ++G
Sbjct: 290 GNERLNIPSSVF-----ASPGTIIDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRK 344
Query: 164 ---LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS- 219
+ DTCY+ S R V +P + HF G + L N + D + C AFA S S
Sbjct: 345 KGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTNIVWGSDES-RLCLAFAGNSKST 403
Query: 220 ----LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
L+IIGN QQ V ++++ IGF N C
Sbjct: 404 MNPELTIIGNRQQLSLTVLYDIQGGRIGFRSNGC 437
>gi|42407406|dbj|BAD09564.1| nucleoid DNA-binding protein-like [Oryza sativa Japonica Group]
Length = 205
Score = 97.1 bits (240), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 60/166 (36%), Positives = 85/166 (51%), Gaps = 17/166 (10%)
Query: 36 LLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEF------------DSSLPPNAVTA 83
++GLG G LS SQ+ S FSYCL S S L F S LP +
Sbjct: 1 MVGLGRGLLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFATLNGTNASSSGLP--VQST 58
Query: 84 PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN 143
PL+ N L + Y++ L GIS+G LPI F I++ G GG+ +DSGT++T LQ + Y+
Sbjct: 59 PLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDVYD 118
Query: 144 ALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRSSV--EVPTVSFHF 186
A+R V R L P + + +TC+ + +V VP + HF
Sbjct: 119 AVRRELVSVLRPLPPANDTEIGLETCFPWPPPPTVTMTVPDMELHF 164
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 97.1 bits (240), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 82/252 (32%), Positives = 113/252 (44%), Gaps = 16/252 (6%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G + + +TL A V N GCGH + G+LGLG S ++ FSYCL
Sbjct: 207 GAYSQDKLTLAPGAIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGARYGG-VFSYCL 265
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
S P V P+ TF + L GI+VGG L + +AF
Sbjct: 266 PSVSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF--- 322
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA--LSPTDGVALFDTCYDFSSRSSV 177
+GG+IVDSGT +T LQ+ Y ALR AF + A L P + DTCY+ + +V
Sbjct: 323 ---SGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDL---DTCYNLTGYKNV 376
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
VP ++ F G + L N ++ NG FA + S ++GNV Q+ V F+
Sbjct: 377 VVPKIALTFTGGATINLDVPNGIL---VNGCLAFAESGPDGSAGVLGNVNQRAFEVLFDT 433
Query: 238 RNSLIGFTPNKC 249
S GF C
Sbjct: 434 STSKFGFRAKAC 445
>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
Length = 434
Score = 97.1 bits (240), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 70/204 (34%), Positives = 95/204 (46%), Gaps = 22/204 (10%)
Query: 55 FSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
FSYCL S + F SL P + T PLLRN + Y++ LTGI+VG
Sbjct: 241 FSYCL-----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKV 295
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
+P + D + G I+DSGT +TR YNA+RD F + + P + FDT
Sbjct: 296 NVPFPKELLAFDVNTGSGTIIDSGTVITRFVEPVYNAVRDEFRK--QVTGPFSSLGAFDT 353
Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS-----LSI 222
C F P ++ HF + L LP +N LI S C A A T + L++
Sbjct: 354 C--FVKNYETLAPAITLHFTDLD-LKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNV 410
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTP 246
I N QQQ RV F+ N+ + P
Sbjct: 411 IANYQQQNLRVLFDTVNNKGWYCP 434
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 96/261 (36%), Positives = 128/261 (49%), Gaps = 24/261 (9%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G + TET+TL SV + GCG +G F GLLGLGG S SQ + FS
Sbjct: 224 GVYSTETLTLSPQVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFS 283
Query: 57 YCLVDRDSDSTSTLEFDSSLPPNAVTA----PLLRNHELDTFYYLGLTGISVGGDLLPIS 112
YCL +S +T L + N PL E TFY + LTG+SVGG L I
Sbjct: 284 YCLPPGNS-TTGFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIP 342
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA--LSPTDGVALFDTCYD 170
T +GG+I+DSGT +T L Y+ALR AF A L P + + DTCY+
Sbjct: 343 PTVL------SGGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYN 396
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQ 228
F+ ++V VPTV+ F G + L + ++ D C AFA +S + IIGNV Q
Sbjct: 397 FTGIANVTVPTVALTFDGGATIDLDVPSGVLIQD-----CLAFAGGASDGDVGIIGNVNQ 451
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
+ V ++ +GF P C
Sbjct: 452 RTFEVLYDSGRGHVGFRPGAC 472
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 76/235 (32%), Positives = 108/235 (45%), Gaps = 20/235 (8%)
Query: 32 GAAGLLGLGGGSLSFPSQINASTFSYCLVD--RDSDSTSTLEFDSSLPPNAVTAPLL--- 86
GA+GL+GLG G LS SQ A FSYCL ++ ++S L ++ + ++
Sbjct: 209 GASGLIGLGRGRLSLASQTGAKRFSYCLTPYFHNNGASSHLFVGAAASLSGGGGAVMSMA 268
Query: 87 -----RNHELDTFYYLGLTGISVGGDLLPISETAFKIDES----GNGGIIVDSGTAVTRL 137
+++ TFYYL L GI+VG L I TAF + E GG+I+DSG+ T L
Sbjct: 269 FVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEEGFWEGGVIIDSGSPFTSL 328
Query: 138 QTETYNALRDAFVR---GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPL 194
+ Y L R G+ P + C V VPT+ HF G + L
Sbjct: 329 VEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVARGDLDRV-VPTLVLHFSGGADMAL 387
Query: 195 PAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
P +NY P++ + T C A SIIGN QQQ + F++ + F C
Sbjct: 388 PPENYWAPLEKS-TACMAIV-RGYLQSIIGNFQQQNMHILFDVGGGRLSFQNADC 440
>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 482
Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 78/291 (26%), Positives = 121/291 (41%), Gaps = 59/291 (20%)
Query: 6 ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST------FSYCL 59
+T++L + + N GC H F G+ G G G LS P+Q+ + FSYCL
Sbjct: 192 DTLSLSTLQLTNFTFGCAHTT---FSEPTGVAGFGRGLLSLPAQLATHSPQLGNRFSYCL 248
Query: 60 V----------------------DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYL 97
V ++ S+ +EF V +L N + FY +
Sbjct: 249 VSHSFRSERIRKPSPLILGRYNDEKQSNGDEVVEF--------VYTSMLENPKHSYFYTV 300
Query: 98 GLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR----GT 153
GL GISVG +P + ++++ G+GG++VDSGT T L + YN++ + F R
Sbjct: 301 GLKGISVGKKTVPAPKILRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKSN 360
Query: 154 RALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV---------- 203
R + CY ++ + V T+ F V+ LP KNY
Sbjct: 361 RRAPEIEQKTGLSPCYYLNTAAIVPAVTLRFVGMNSSVV-LPRKNYFYEFMDGGDGVRRK 419
Query: 204 DSNGTFCFAFAPTSSSLS-----IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ G F + +S ++GN QQQG V ++L +GF KC
Sbjct: 420 ERVGCLMFMNGGDEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFARRKC 470
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 70/212 (33%), Positives = 106/212 (50%), Gaps = 16/212 (7%)
Query: 44 LSFPSQINASTFSYCLVDRDS-DSTSTLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTG 101
LS + +TFSYCL S + + TL + P + T PLL N + YY+ +TG
Sbjct: 246 LSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTG 305
Query: 102 ISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG 161
I VG ++PI D + G ++DSGT TRL Y A+RD R R +P
Sbjct: 306 IRVGRKVVPIPA----FDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRR--RVGAPVSS 359
Query: 162 VALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT----S 217
+ FDTC++ ++V P V+ F +G + LP +N +I C A A +
Sbjct: 360 LGGFDTCFN---TTAVAWPPVTLLF-DGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVN 415
Query: 218 SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ L++I ++QQQ RV F++ N +GF +C
Sbjct: 416 TVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447
>gi|357118734|ref|XP_003561105.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 404
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 78/238 (32%), Positives = 102/238 (42%), Gaps = 20/238 (8%)
Query: 21 GCGHNNEGLFVGA-AGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSD---STSTLEFD 73
GC H+ G F G +G + LGGG S SQ ++ FSYC+ + S
Sbjct: 178 GCSHSVRGRFSGQTSGTMSLGGGRQSLRSQTASAYGDAFSYCVPQPSASGFLSLGGAIGS 237
Query: 74 SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTA 133
S + PL+ TFY + L GI V G L + F + G ++DS
Sbjct: 238 SGSGSGFASTPLVATAN-PTFYVVRLQGIDVAGRRLNVPPAVF------SAGTLMDSSAV 290
Query: 134 VTRLQTETYNALRDAFVRGTRALS--PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKV 191
VT+L Y ALR AF R P G + DTCYDF +V VP VS F G V
Sbjct: 291 VTQLPPTAYRALRRAFRNAMRRYRRVPAGGKQILDTCYDFEGLGNVTVPAVSLVFSGGAV 350
Query: 192 LPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ L ++ G F P S L IGNVQQQ V +++ +GF C
Sbjct: 351 VRLEPMAVMM----EGCLAFVPTPADSDLGFIGNVQQQTHEVLYDVGARNVGFRRGAC 404
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 77/261 (29%), Positives = 128/261 (49%), Gaps = 23/261 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST-----F 55
GD E +T+GS+SV ++ IGCGH + G F A+G++GLGGG LS SQ++ ++ F
Sbjct: 180 GDLGFEKITIGSSSVKSV-IGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRF 238
Query: 56 SYCLVDRDSDSTSTLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
SYCL S + + F + P V+ PL+ + + T+YY+ L IS+G +
Sbjct: 239 SYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPV-TYYYVTLEAISIGNER---- 293
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD-- 170
+ + G +I+DSGT ++ L E Y+ + + ++ +A D +D C+D
Sbjct: 294 ----HMASAKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDG 349
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--LSIIGNVQQ 228
+ +S +P ++ F G + L N V +N C P S + IIGN+
Sbjct: 350 INVATSSGIPIITAQFSGGANVNLLPVNTFQKV-ANNVNCLTLTPASPTDEFGIIGNLAL 408
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
+ ++L + F P C
Sbjct: 409 ANFLIGYDLEAKRLSFKPTVC 429
>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
Length = 435
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 80/271 (29%), Positives = 121/271 (44%), Gaps = 27/271 (9%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGH--NNEGLFVGAAGLLGLGGGSLSFPSQI------- 50
G V +T+TL SA+ GC + F GA GL+ L S S S++
Sbjct: 170 GTLVRDTLTLPPSATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATT 229
Query: 51 NASTFSYCLVDRDSDSTST-LEFDSSLPP----NAVTAPLLRNHELDTFYYLGLTGISVG 105
+A+ FSYCL + S+ L +S P + AP+ N Y++ L GISVG
Sbjct: 230 SAAAFSYCLPSSSATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVDLVGISVG 289
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
G+ LP+ F G ++++ T T L Y ALRDAF + +
Sbjct: 290 GEDLPVPPAVFAAH-----GTLLEAATEFTFLAPAAYAALRDAFRKDMAPYPAAPPFRVL 344
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFC-------FAFAPTSS 218
DTCY+ + +S+ VP V+ F G L L + + D + F A +
Sbjct: 345 DTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAF 404
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+S+IG + Q+ T V ++LR +GF P +C
Sbjct: 405 PVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 435
>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 523
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 80/271 (29%), Positives = 121/271 (44%), Gaps = 27/271 (9%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGH--NNEGLFVGAAGLLGLGGGSLSFPSQI------- 50
G V +T+TL SA+ GC + F GA GL+ L S S S++
Sbjct: 258 GTLVRDTLTLPPSATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATT 317
Query: 51 NASTFSYCLVDRDSDSTST-LEFDSSLPP----NAVTAPLLRNHELDTFYYLGLTGISVG 105
+A+ FSYCL + S+ L +S P + AP+ N Y++ L GISVG
Sbjct: 318 SAAAFSYCLPSSSATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVDLVGISVG 377
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
G+ LP+ F G ++++ T T L Y ALRDAF + +
Sbjct: 378 GEDLPVPPAVFAAH-----GTLLEAATEFTFLAPAAYAALRDAFRKDMAPYPAAPPFRVL 432
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFC-------FAFAPTSS 218
DTCY+ + +S+ VP V+ F G L L + + D + F A +
Sbjct: 433 DTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAF 492
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+S+IG + Q+ T V ++LR +GF P +C
Sbjct: 493 PVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 523
>gi|242095592|ref|XP_002438286.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
gi|241916509|gb|EER89653.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
Length = 495
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 76/271 (28%), Positives = 126/271 (46%), Gaps = 28/271 (10%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVG--AAGLLGLGGGSLSFPSQI------N 51
G V +T+TL SA+ +N A+GC + LF A G + L S +++
Sbjct: 228 GTIVMDTLTLSPSATFENFAVGCMQLDNDLFTDGVAVGNIDLSLSRHSLATRVLNSSPPG 287
Query: 52 ASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-----PLLRNHELDTFYYLGLTGISVGG 106
+ FSYCL D+D+ L +L + A PL+ N FYY+ L I++ G
Sbjct: 288 MAAFSYCL-PADTDTHGFLTIAPALSDYSDHAGVKYVPLVTNPTGPNFYYVDLVAIAING 346
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
+ LPI F +GNG +I DS +A T L Y ALRD F + P D
Sbjct: 347 EDLPIPPALF----TGNGTMI-DSQSAFTYLNPPIYAALRDEFRKAMLQYQPVPAFGGLD 401
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSN-------GTFCFAFAPTSS- 218
TCY+F+ ++ +P ++ F G+ + L + ++ + G FA AP +
Sbjct: 402 TCYNFTLAENIYLPDITLRFSNGETMDLDDRQFMYFFREHLTDGFPFGCLAFAAAPDQNF 461
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +G+ Q+ + +++R ++ F P++C
Sbjct: 462 PWNYLGSQVQRTKEIVYDVRGGMVAFVPSRC 492
>gi|224030719|gb|ACN34435.1| unknown [Zea mays]
Length = 216
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 95/206 (46%), Gaps = 16/206 (7%)
Query: 55 FSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
FSYCL S + F SL P N PLL N + YY+ +TG+SVG
Sbjct: 15 FSYCL-----PSYRSYYFSGSLRLGAAGQPRNVRHTPLLTNPHRPSLYYVNVTGLSVGRT 69
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
+ + +F D + G ++DSGT +TR Y ALR+ F R A S + FDT
Sbjct: 70 WVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDT 129
Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS----SLSII 223
C++ ++ P V+ H G L LP +N LI + C A A ++++
Sbjct: 130 CFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVV 189
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
N+QQQ RV ++ S +GF C
Sbjct: 190 ANLQQQNVRVVVDVAGSRVGFAREPC 215
>gi|194707292|gb|ACF87730.1| unknown [Zea mays]
Length = 216
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 95/206 (46%), Gaps = 16/206 (7%)
Query: 55 FSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
FSYCL S + F SL P N PLL N + YY+ +TG+SVG
Sbjct: 15 FSYCL-----PSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRT 69
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
+ + +F D + G ++DSGT +TR Y ALR+ F R A S + FDT
Sbjct: 70 WVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDT 129
Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS----SLSII 223
C++ ++ P V+ H G L LP +N LI + C A A ++++
Sbjct: 130 CFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVV 189
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
N+QQQ RV ++ S +GF C
Sbjct: 190 ANLQQQNVRVVVDVAGSRVGFAREPC 215
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 85/266 (31%), Positives = 127/266 (47%), Gaps = 26/266 (9%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINAS- 53
G ET+TL S + I GCGHNN G F G++GLGGG +S SQ+ +S
Sbjct: 160 GVLAQETITLSSTKGKSVPLKGIVFGCGHNNTGGFNDHEMGIIGLGGGPVSLISQMGSSF 219
Query: 54 ---TFSYCLVDRDSDST--STLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISVG 105
FS CLV +D + S + F V+ PL+ + T Y++ L GISV
Sbjct: 220 GGKRFSQCLVPFHTDVSVSSKMSFGKGSKVSGKGVVSTPLVAKQD-KTPYFVTLLGISVE 278
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGVAL 164
L + ++ +++ G + +DSGT T L T+ Y+ + A VR A+ P TD L
Sbjct: 279 NTYLHFNGSSQNVEK---GNMFLDSGTPPTILPTQLYDQVV-AQVRSEVAMKPVTDDPDL 334
Query: 165 F-DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
CY +++++ P ++ HF V P + ++ P D G FC F TSS +
Sbjct: 335 GPQLCY--RTKNNLRGPVLTAHFEGADVKLSPTQTFISPKD--GVFCLGFTNTSSDGGVY 390
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN Q + F+L ++ F P C
Sbjct: 391 GNFAQSNYLIGFDLDRQVVSFKPKDC 416
>gi|242044812|ref|XP_002460277.1| hypothetical protein SORBIDRAFT_02g025885 [Sorghum bicolor]
gi|241923654|gb|EER96798.1| hypothetical protein SORBIDRAFT_02g025885 [Sorghum bicolor]
Length = 369
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 89/172 (51%), Gaps = 10/172 (5%)
Query: 82 TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTET 141
T PLL N + YY+ +TGI VG ++PI A D + G ++DSGT TRL
Sbjct: 201 TTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMFTRLVAPA 260
Query: 142 YNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLI 201
Y A+RD R R +P + FDTC++ ++V P V+ F +G + LP +N +I
Sbjct: 261 YVAVRDEVRR--RVGAPVSSLGGFDTCFNT---TAVAWPPVTLLF-DGMQVTLPEENVVI 314
Query: 202 PVDSNGTFCFAFAPT----SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
C A A ++ L++I ++QQQ RV F++ N +GF +C
Sbjct: 315 HSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 366
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 89/264 (33%), Positives = 133/264 (50%), Gaps = 23/264 (8%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINA-- 52
G+ TE T+GS S + I GCG N G F +G++GLGGG+LS SQ+++
Sbjct: 185 GNLATEKFTIGSTSSRPVHLSPIVFGCGTGNGGTFDELGSGIVGLGGGALSLVSQLSSII 244
Query: 53 -STFSYCLV--DRDSDSTSTLEF--DSSLP-PNAVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCLV S+ TS ++F DS + P V+ PL+ + + DT+YY+ L ISVG
Sbjct: 245 KGKFSYCLVPLSEQSNVTSKIKFGTDSVISGPQVVSTPLV-SKQPDTYYYVTLEAISVGN 303
Query: 107 DLLPISETAFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
LP + + E GN +I+DSGT +T L +E + L +A +D LF
Sbjct: 304 KRLPYTNGLLNGNVEKGN--VIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRGLF 361
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
C F S +++P ++ HF + V P N + D + CF +S+ + I GN
Sbjct: 362 SVC--FRSAGDIDLPVIAVHFNDADVKLQPL-NTFVKADED-LLCFTMI-SSNQIGIFGN 416
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
+ Q V ++L + F P C
Sbjct: 417 LAQMDFLVGYDLEKRTVSFKPTDC 440
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 79/267 (29%), Positives = 117/267 (43%), Gaps = 21/267 (7%)
Query: 1 GDFVTETVTLG-----SASVDNIAIGCGH---NNEGLFVGAAGLLGLGGGSLSF---PSQ 49
G F T+++T+G ++N+ IGC N G+LGLG SF +
Sbjct: 189 GFFGTDSITVGLTNGKQGKLNNLTIGCTKSMLNGVNFNEETGGILGLGFAKDSFIDKAAN 248
Query: 50 INASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHEL---DTFYYLGLTGISVGG 106
+ FSYCLVD S + + NA +R EL FY + + GIS+GG
Sbjct: 249 KYGAKFSYCLVDHLSHRSVSSNLTIGGHHNAKLLGEIRRTELILFPPFYGVNVVGISIGG 308
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-- 164
+L I + D + GG ++DSGT +T L Y A+ +A + + G
Sbjct: 309 QMLKIPPQVW--DFNAEGGTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTGEDFDA 366
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSI 222
+ C+D VP + FHF G P K+Y+I V + C P S+
Sbjct: 367 LEFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDV-APLVKCIGIVPIDGIGGASV 425
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IGN+ QQ F+L + +GF P+ C
Sbjct: 426 IGNIMQQNHLWEFDLSTNTVGFAPSTC 452
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 78/261 (29%), Positives = 117/261 (44%), Gaps = 29/261 (11%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G TET+ +G AS ++A GC N GL G L LG G FSYCL
Sbjct: 174 GYLATETLKVGDASFPSVAFGCSTEN-GL-----GQLDLGVGR-----------FSYCLR 216
Query: 61 DRDSDSTSTLEFDSSL---PPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPISETAF 116
+ S + F S N + P + N + ++YY+ LTGI+VG LP++ + F
Sbjct: 217 SGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTF 276
Query: 117 KIDESG-NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD--FSS 173
++G GG IVDSGT +T L + Y ++ AF+ T ++ +G D C+
Sbjct: 277 GFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGG 336
Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTS--SSLSIIGNVQQ 228
+ VP++ F G +P + DS G+ C P +S+IGNV Q
Sbjct: 337 GGGIAVPSLVLRFDGGAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQ 396
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
+ ++L + F P C
Sbjct: 397 MDMHLLYDLDGGIFSFAPADC 417
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 73/250 (29%), Positives = 112/250 (44%), Gaps = 10/250 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G + + D + GC EG G++GLG G LS SQ+ FSY L
Sbjct: 193 GLLAVDAFAFATVRADGVIFGCAVATEG---DIGGVIGLGRGELSPVSQLQIGRFSYYLA 249
Query: 61 DRDS-DSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
D+ D S + F P AV+ PL+ + + YY+ L GI V G+ L I F
Sbjct: 250 PDDAVDVGSFILFLDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTF 309
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRS 175
+ G+GG+++ VT L Y +R A L DG L D CY S +
Sbjct: 310 DLQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKIE-LRAADGSELGLDLCYTSESLA 368
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSIIGNVQQQGTRVS 234
+ +VP+++ F G V+ L NY + G C P+ + S++G++ Q GT +
Sbjct: 369 TAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMI 428
Query: 235 FNLRNSLIGF 244
+++ S + F
Sbjct: 429 YDISGSRLVF 438
>gi|356500756|ref|XP_003519197.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 451
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 87/256 (33%), Positives = 124/256 (48%), Gaps = 14/256 (5%)
Query: 3 FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCL 59
V +++ LG ++ + A GC ++ G + A GLLGLG G LS PSQ + + FSYCL
Sbjct: 200 LVQDSLRLGIDTLPSYAFGCVNSASGWTLPAQGLLGLGRGPLSLPSQSSKLYSGIFSYCL 259
Query: 60 VDRDSDSTS-TLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
S S +L+ + P + T PLL+N + YY+ LTG++VG +P+
Sbjct: 260 PSFQSSYFSGSLKLGPTGQPRRIRTTPLLQNPRRPSLYYVNLTGVTVGRVKVPLPIEYLA 319
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
D + G I+DSGT +TR Y+A+RD F + P FDTC F
Sbjct: 320 FDPNKGSGTILDSGTVITRFVGPVYSAIRDEFRNQVKG--PFFSRGGFDTC--FVKTYEN 375
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQQGTRV 233
P + F G + LP +N LI G C A A +S L++I N QQQ RV
Sbjct: 376 LTPLIKLRF-TGLDVTLPYENTLIHTAYGGMACLAMAAAPNNVNSVLNVIANYQQQNLRV 434
Query: 234 SFNLRNSLIGFTPNKC 249
F+ N+ +G C
Sbjct: 435 LFDTVNNRVGIARELC 450
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 79/249 (31%), Positives = 112/249 (44%), Gaps = 23/249 (9%)
Query: 14 SVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS-----TFSYCLVDRDSDS- 66
S+ NI GCGHNN G F GL G GG LS SQI ++ FS CLV +D
Sbjct: 92 SILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPS 151
Query: 67 -TSTLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
TS + F + V+ PL+ + T+Y++ L GISVG L P S ++ +
Sbjct: 152 ITSKIIFGPEAEVSGSDVVSTPLVTKDD-PTYYFVTLDGISVGDKLFPFSSSS---PMAT 207
Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS--VEVP 180
G + +D+GT T L + YN L V+G + P + V D RS+ ++ P
Sbjct: 208 KGNVFIDAGTPPTLLPRDFYNRL----VQGVKEAIPMEPVQDPDLQPQLCYRSATLIDGP 263
Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
++ HF V P ++ P G +CFA P I GN Q + F+L
Sbjct: 264 ILTAHFDGADVQLKPLNTFISP--KEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGK 321
Query: 241 LIGFTPNKC 249
+ F C
Sbjct: 322 KVSFKAVDC 330
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 75/242 (30%), Positives = 105/242 (43%), Gaps = 27/242 (11%)
Query: 34 AGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTS-----TLEFDSS--LPPNAVTA--- 83
+G+ G G G S PSQ+N FSYCLV D T L+ S+ N ++
Sbjct: 230 SGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPF 289
Query: 84 ---PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTE 140
P N +YYL L + VGG + I T + GNGG IVDSG+ T ++
Sbjct: 290 RSNPSTNNPAFKEYYYLTLRKVIVGGKDVKIPYTFLEPGSDGNGGTIVDSGSTFTFMERP 349
Query: 141 TYNALRDAFVRG-----TRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLP 195
YN + FV+ +RA L C++ S +V P ++F F G + P
Sbjct: 350 VYNLVAQEFVKQLEKNYSRAEDAETQSGL-SPCFNISGVKTVTFPELTFKFKGGAKMTQP 408
Query: 196 AKNYLIPVDSNGTFCF-------AFAPTSSSLSII-GNVQQQGTRVSFNLRNSLIGFTPN 247
+NY V C A P ++ +II GN QQQ + ++L N GF P
Sbjct: 409 LQNYFSLVGDAEVVCLTVVSDGGAGPPKTTGPAIILGNYQQQNFYIEYDLENERFGFGPR 468
Query: 248 KC 249
C
Sbjct: 469 SC 470
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 89/284 (31%), Positives = 119/284 (41%), Gaps = 47/284 (16%)
Query: 1 GDFVTETVTLGSASVDNIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
G TE T+G A GC + V AGLLG+ G+LSF SQ + FSY
Sbjct: 159 GALATEVFTVGQGPPLRAAFGCMATAFDTSPDGVATAGLLGMNRGALSFVSQASTRRFSY 218
Query: 58 CLVDRDSDSTSTLEFDSSLP--PNAVTAPLLRNHELDTF----YYLGLTGISVGGDLLPI 111
C+ DRD D+ L S LP P T L F Y + L GI VGG LPI
Sbjct: 219 CISDRD-DAGVLLLGHSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPI 277
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
+ D +G G +VDSGT T L + Y+AL+ F R T+ P AL D + F
Sbjct: 278 PASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLP----ALNDPNFAF 333
Query: 172 SSRSSVEVPTVSFHFPEGKVLP--LPAKN----------------YLIPVDS---NGTFC 210
E F P+G+ P LPA Y +P + +G +C
Sbjct: 334 Q-----EAFDTCFRVPQGRAPPARLPAVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWC 388
Query: 211 FAF-----APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
F P ++ +IG+ Q V ++L +G P +C
Sbjct: 389 LTFGNADMVPITA--YVIGHHHQMNVWVEYDLERGRVGLAPIRC 430
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 85/277 (30%), Positives = 120/277 (43%), Gaps = 31/277 (11%)
Query: 1 GDFVTETVTLGSASVDNIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
G T+ +GS A GC ++ V +AGLLG+ G+LSF SQ + FSY
Sbjct: 177 GALATDVFAVGSGPPLRAAFGCMSSAFDSSPDGVASAGLLGMNRGALSFVSQASTRRFSY 236
Query: 58 CLVDRDSDSTSTL---EFDSSLPPN--AVTAPLLRNHELDTFYY-LGLTGISVGGDLLPI 111
C+ DRD L + + LP N + P L D Y + L GI VGG LPI
Sbjct: 237 CISDRDDAGVLLLGHSDLPTFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPI 296
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT-DGVAL-----F 165
+ D +G G +VDSGT T L + Y+AL+ F R R L P D + F
Sbjct: 297 PASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAF 356
Query: 166 DTCYDF---SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-----SNGTFCFAF---- 213
DTC+ S + +P V+ F G + + L V +G +C F
Sbjct: 357 DTCFRVPQGRSPPTARLPGVTLLF-NGAEMAVAGDRLLYKVPGERRGGDGVWCLTFGNAD 415
Query: 214 -APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
P + +IG+ Q V ++L +G P +C
Sbjct: 416 MVPIMA--YVIGHHHQMNVWVEYDLERGRVGLAPVRC 450
>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 469
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 77/278 (27%), Positives = 117/278 (42%), Gaps = 35/278 (12%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G ++E + +V + +GC + AG+ G G G S PSQ+ +FS+CLV
Sbjct: 196 GILISEKLDFPDLTVPDFVVGCSVISTRT---PAGIAGFGRGPESLPSQMKLKSFSHCLV 252
Query: 61 DR---DSDSTSTLEFDS-------SLPPNAVTAPLLRNHELDT-----FYYLGLTGISVG 105
R D++ T+ L D+ S P P +N + +YYL L I VG
Sbjct: 253 SRRFDDTNVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSNTAFLEYYYLNLRRIYVG 312
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT------RALSPT 159
+ I +GNGG IVDSG+ T ++ + + + F + L
Sbjct: 313 SKHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMSNYTREKDLEKV 372
Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS- 218
G+A C++ S + V VP + F F G + LP NY V + T C ++
Sbjct: 373 SGIA---PCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNADTVCLTVVSDNTV 429
Query: 219 -------SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I+G+ QQQ V ++L N GF KC
Sbjct: 430 NPGGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 89/284 (31%), Positives = 119/284 (41%), Gaps = 47/284 (16%)
Query: 1 GDFVTETVTLGSASVDNIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
G TE T+G A GC + V AGLLG+ G+LSF SQ + FSY
Sbjct: 160 GALATEVFTVGQGPPLRAAFGCMATAFDTSPDGVATAGLLGMNRGALSFVSQASTRRFSY 219
Query: 58 CLVDRDSDSTSTLEFDSSLP--PNAVTAPLLRNHELDTF----YYLGLTGISVGGDLLPI 111
C+ DRD D+ L S LP P T L F Y + L GI VGG LPI
Sbjct: 220 CISDRD-DAGVLLLGHSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPI 278
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
+ D +G G +VDSGT T L + Y+AL+ F R T+ P AL D + F
Sbjct: 279 PASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLP----ALNDPNFAF 334
Query: 172 SSRSSVEVPTVSFHFPEGKVLP--LPAKN----------------YLIPVDS---NGTFC 210
E F P+G+ P LPA Y +P + +G +C
Sbjct: 335 Q-----EAFDTCFRVPQGRAPPARLPAVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWC 389
Query: 211 FAF-----APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
F P ++ +IG+ Q V ++L +G P +C
Sbjct: 390 LTFGNADMVPITA--YVIGHHHQMNVWVEYDLERGRVGLAPIRC 431
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/262 (32%), Positives = 125/262 (47%), Gaps = 21/262 (8%)
Query: 1 GDFVTETVTLGSA-----SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSL-SFPSQINAS- 53
GD +TVT+GS+ S+ N+ IGCGH N G F A + GG S SQ+ S
Sbjct: 175 GDVAVDTVTMGSSGRRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSI 234
Query: 54 --TFSYCLVDRDSDS--TSTLEFDSS--LPPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
FSYCLV S++ TS + F ++ + + V + + + T+Y+L L ISVG
Sbjct: 235 NGKFSYCLVPFTSETGLTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSK 294
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
+ + T F +G G I++DSGT +T L + Y L +A D +
Sbjct: 295 KIQFTSTIFG---TGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSL 351
Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQ 227
CY S SS +VP ++ HF G V L N + V S CFAFA + L+I GN+
Sbjct: 352 CYRDS--SSFKVPDITVHFKGGDV-KLGNLNTFVAV-SEDVSCFAFA-ANEQLTIFGNLA 406
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
Q V ++ + + F C
Sbjct: 407 QMNFLVGYDTVSGTVSFKKTDC 428
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 83/275 (30%), Positives = 119/275 (43%), Gaps = 28/275 (10%)
Query: 1 GDFVTETVTLGSASVDNIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
G T+ +G A A GC +++ V AGLLG+ G+LSF +Q + FSY
Sbjct: 152 GALATDVFAVGDAPPLRSAFGCMSAAYDSSPDAVATAGLLGMNRGALSFVTQASTRRFSY 211
Query: 58 CLVDRDSDSTSTLEFDSSLP--PNAVTAPLLRNHELDTF----YYLGLTGISVGGDLLPI 111
C+ DRD D+ L S LP P T L F Y + L GI VGG LPI
Sbjct: 212 CISDRD-DAGVLLLGHSDLPFLPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPI 270
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT------DGVALF 165
+ D +G G +VDSGT T L + Y+A++ F++ T+ L P F
Sbjct: 271 PPSVLAPDHTGAGQTMVDSGTQFTFLLGDAYSAVKAEFLKQTKPLLPALEDPSFAFQEAF 330
Query: 166 DTCYDFSS---RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-----SNGTFCFAFAPTS 217
DTC+ S +P V+ F G + + L V ++G +C F
Sbjct: 331 DTCFRVPKGRPPPSARLPPVTLLF-NGAQMSVAGDRLLYKVPGERRGADGVWCLTFGNAD 389
Query: 218 S---SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +IG+ Q V ++L +G P KC
Sbjct: 390 MVPLTAYVIGHHHQMNLWVEYDLERGRVGLAPVKC 424
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 82/262 (31%), Positives = 119/262 (45%), Gaps = 25/262 (9%)
Query: 3 FVTETVTLGSASVDNIAIGCGHN---NEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
V ET G++ + ++ IGCGHN N G G+LGL G S +QI FSYC+
Sbjct: 190 LVFETTDEGTSQISDVIIGCGHNIGFNSD--PGYNGILGLNNGPNSLATQI-GRKFSYCI 246
Query: 60 --VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
+ + + L + P H FYY+ + GISVG L I+ F+
Sbjct: 247 GNLADPYYNYNQLRLGEGADLEGYSTPFEVYH---GFYYVTMEGISVGEKRLDIALETFE 303
Query: 118 IDESGNGGIIVDSGTAVTRL----QTETYNALRDAFVRGTRALSPTDGVALFDTC-YDFS 172
+ +G GG+I+DSGT +T L YN +R+ R + + A + C Y
Sbjct: 304 MKRNGTGGVILDSGTTITYLVDSAHKLLYNEVRNLLKWSFRQVIFEN--APWKLCYYGII 361
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-----TSSSLSIIGNVQ 227
SR V P V+FHF +G L L ++ D FC +P T+ S S+IG +
Sbjct: 362 SRDLVGFPVVTFHFVDGADLALDTGSFFSQRDD--IFCMTVSPASILNTTISPSVIGLLA 419
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
QQ V ++L N + F C
Sbjct: 420 QQSYNVGYDLVNQFVYFQRIDC 441
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 87/264 (32%), Positives = 124/264 (46%), Gaps = 18/264 (6%)
Query: 1 GDFVTETVTLGS---ASVD--NIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
GD ET+TLGS +SV N IGCGHNN+G F G + GG +S
Sbjct: 187 GDLSVETLTLGSTNGSSVQFPNTVIGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSI 246
Query: 54 --TFSYCLVDR--DSDSTSTLEF-DSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCL S+S+S L F D+++ AV+ PL+ + FYYL L SVG
Sbjct: 247 GGKFSYCLAPMFSQSNSSSKLNFGDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGD 306
Query: 107 DLLP-ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
+ + ++ +G G II+DSGT +T L E Y+ L A +A +D
Sbjct: 307 KRIEFVGGSSSSGSSNGEGNIIIDSGTTLTLLPQEDYSNLESAVADAIQANRVSDPSNFL 366
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
CY + ++VP ++ HF V P ++ + G CFAF +S +SI GN
Sbjct: 367 SLCYQTTPSGQLDVPVITAHFKGADVELNPISTFVQVAE--GVVCFAFH-SSEVVSIFGN 423
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
+ Q V ++L + F P C
Sbjct: 424 LAQLNLLVGYDLMEQTVSFKPTDC 447
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 87/248 (35%), Positives = 123/248 (49%), Gaps = 22/248 (8%)
Query: 5 TETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDS 64
+ET TLGS +V I GC +EG + +GL+GLG G LS SQ+N FSYCL D+
Sbjct: 179 SETFTLGSDAVPGIGFGCTTMSEGGYGSGSGLVGLGRGPLSLVSQLNVGAFSYCLTS-DA 237
Query: 65 DSTSTLEFDSSLPPNA--VTAPLLRNHELDTFYY-LGLTGISVGGDLLPISETAFKIDES 121
TS L F S A + PLLR T+YY + L IS+G A +
Sbjct: 238 AKTSPLLFGSGALTGAGVQSTPLLRT---STYYYTVNLESISIG---------AATTAGT 285
Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPT 181
G+ GII DSGT V L Y ++A + T L+ G ++ C+ S P+
Sbjct: 286 GSSGIIFDSGTTVAFLAEPAYTLAKEAVLSQTTNLTMASGRDGYEVCFQ---TSGAVFPS 342
Query: 182 VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSL 241
+ HF +G + LP +NY VD + C+ S SLSI+GN+ Q + +++ S+
Sbjct: 343 MVLHF-DGGDMDLPTENYFGAVD-DSVSCW-IVQKSPSLSIVGNIMQMNYHIRYDVEKSM 399
Query: 242 IGFTPNKC 249
+ F P C
Sbjct: 400 LSFQPANC 407
>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 444
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 88/264 (33%), Positives = 114/264 (43%), Gaps = 25/264 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
V +TVTL + V GC G + GLLGLG G LS +Q STFSY
Sbjct: 190 ASLVQDTVTLATDPVPAYTFGCIQKATGSSLPPQGLLGLGRGPLSLLAQTQKLYQSTFSY 249
Query: 58 CLVDRDSDSTSTLEFDSSL------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
CL S TL F P P +N + YY+ L I VG ++ I
Sbjct: 250 CL-----PSFKTLNFSGHXDLXPVAQPRDQVYPSFKNPRRSSLYYVNLVAIRVGRRIVDI 304
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTCY 169
A + G + DSGT TRL Y A+R+ F R +L FDTCY
Sbjct: 305 PPEALAFNPXTGAGTVFDSGTVFTRLVEPAYTAVRNEFRRRVSVHKKLTVTSLGGFDTCY 364
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGN 225
+ PT++F F G + LP N LI + C A AP +S L++I N
Sbjct: 365 TV----PIVAPTITFMF-SGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIAN 419
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
+QQQ RV F++ NS +G C
Sbjct: 420 MQQQNHRVLFDVPNSRLGVARELC 443
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 68/212 (32%), Positives = 106/212 (50%), Gaps = 16/212 (7%)
Query: 44 LSFPSQINASTFSYCLVDRDS-DSTSTLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTG 101
LS + +TFSYCL S + + TL + P + T PLL N + YY+ +TG
Sbjct: 246 LSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTG 305
Query: 102 ISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG 161
+ VG ++PI D + G ++DSGT TRL Y A+RD R R +P
Sbjct: 306 VRVGRKVVPIPA----FDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRR--RVGAPVSS 359
Query: 162 VALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT----S 217
+ FDTC++ ++V P ++ F +G + LP +N +I C A A +
Sbjct: 360 LGGFDTCFN---TTAVAWPPMTLLF-DGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVN 415
Query: 218 SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ L++I ++QQQ RV F++ N +GF +C
Sbjct: 416 TVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 79/260 (30%), Positives = 114/260 (43%), Gaps = 16/260 (6%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFP-SQIN-----AST 54
GD ++ +T+GS + IGCGH N G F G + GG SQ+
Sbjct: 179 GDLASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPR 238
Query: 55 FSYCLVD--RDSDSTSTLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
FSYCL +++ T T+ F V+ PL+ DTFY+L L ISVG
Sbjct: 239 FSYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSP-DTFYFLTLEAISVGKKRF 297
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
+ + GN II+DSGT +T L Y + R +A D + + CY
Sbjct: 298 KAANGISAMTNHGN--IIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCY 355
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
+ +P ++ HF G + L N PV N T C FAP ++ ++I GN+ Q
Sbjct: 356 SAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVT-CLTFAP-ATQVAIFGNLAQI 413
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
V ++L N + F P C
Sbjct: 414 NFEVGYDLGNKRLSFEPKLC 433
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 79/267 (29%), Positives = 116/267 (43%), Gaps = 21/267 (7%)
Query: 1 GDFVTETVTL-----GSASVDNIAIGCGHNNEG---LFVGAAGLLGLGGGSLSFPSQIN- 51
G F T+T+T+ ++N+ IGC + E G+LGLG SF +
Sbjct: 243 GFFGTDTITVDLKNGKEGKLNNLTIGCTKSMENGVNFNEDTGGILGLGFAKDSFIDKAAY 302
Query: 52 --ASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHEL---DTFYYLGLTGISVGG 106
+ FSYCLVD S + NA ++ EL FY + + GIS+GG
Sbjct: 303 EYGAKFSYCLVDHLSHRNVSSYLTIGGHHNAKLLGEIKRTELILFPPFYGVNVVGISIGG 362
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG--VAL 164
+L I + D + GG ++DSGT +T L Y + +A ++ + G
Sbjct: 363 QMLKIPPQVW--DFNSQGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVKRVTGEDFGA 420
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSI 222
D C+D VP + FHF G P K+Y+I V + C P S+
Sbjct: 421 LDFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDV-APLVKCIGIVPIDGIGGASV 479
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IGN+ QQ F+L + IGF P+ C
Sbjct: 480 IGNIMQQNHLWEFDLSTNTIGFAPSIC 506
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 78/258 (30%), Positives = 118/258 (45%), Gaps = 36/258 (13%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
SA++ ++ GCGH+N G + G+LGLG G S + + FSYC D
Sbjct: 191 SAALHDVVFGCGHDNYGEPLVGTGILGLGYGEFSLVHRF-GTKFSYCFGSLD-------- 241
Query: 72 FDSSLPPNAV------------TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF-KI 118
D S P N + T PL + FYY+ + ISV G +LPI F +
Sbjct: 242 -DPSYPHNVLVLGDDGANILGDTTPL---EIYNGFYYVTIEAISVDGIILPIDPWVFNRN 297
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDA---FVRGTRALSPTDGVALFDT-CYDFS-S 173
++G GG I+D+G ++T L E Y L++ + G + + +F CY+ +
Sbjct: 298 HQTGLGGTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLE 357
Query: 174 RSSVE--VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGT 231
R VE P V+FHF +G L L K+ + + N FC A P +++ IG QQ
Sbjct: 358 RDLVESGFPIVTFHFSDGAELSLDVKSVFMKLSPN-VFCLAVTP--GNMNSIGATAQQSY 414
Query: 232 RVSFNLRNSLIGFTPNKC 249
+ ++L I F C
Sbjct: 415 NIGYDLEAKKISFERIDC 432
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 85/276 (30%), Positives = 124/276 (44%), Gaps = 29/276 (10%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHN----NEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G+ T+T +GS+ + N+ GC + N GL+G+ GSLSF SQ+ FS
Sbjct: 165 GNLATDTFYIGSSGIPNVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFS 224
Query: 57 YCLVDRDSDSTSTL---EFDSSLPPNAVTAPLLRNHELDTF----YYLGLTGISVGGDLL 109
YC+ + D L F S L P T + + L F Y + L GI V LL
Sbjct: 225 YCISEYDFSGLLLLGDANF-SWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLL 283
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGV--A 163
PI E+ F+ D +G G +VDSGT T L Y ALRD F+ T R ++ V
Sbjct: 284 PIPESVFEPDHTGAGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQG 343
Query: 164 LFDTCYDFSSRSSV--EVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGT---FCFAFAPT 216
D CY + + +P+V+ F G + + Y +P + G CF F +
Sbjct: 344 AMDLCYRVPTNQTRLPPLPSVTLVF-RGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNS 402
Query: 217 S---SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+IG++ QQ + F+L+ S IG +C
Sbjct: 403 DLLGVEAFVIGHLHQQNVWMEFDLKKSRIGLAEIRC 438
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 76/249 (30%), Positives = 122/249 (48%), Gaps = 17/249 (6%)
Query: 12 SASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSD 65
A + + +GC + +G F + G+L LG ++SF S+ A FSYCLVD +
Sbjct: 229 KAKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRN 288
Query: 66 STSTLEFD---SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
+TS L F ++ P+ PLL + ++ FY + + +SV G L I + + +
Sbjct: 289 ATSYLTFGPVGAAHSPS--RTPLLLDAQVAPFYAVTVDAVSVAGKALNIPAEVWDVKK-- 344
Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF-SSRSSVEVPT 181
NGG I+DSGT++T L T Y A+ A + A P + F+ CY++ ++R VP
Sbjct: 345 NGGAILDSGTSLTILATPAYKAVVAALSKQL-ARVPRVTMDPFEYCYNWTATRRPPAVPR 403
Query: 182 VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIGNVQQQGTRVSFNLRNS 240
+ F L P K+Y+I + G C +S+IGN+ QQ F+L N
Sbjct: 404 LEVRFAGSARLRPPTKSYVIDA-APGVKCIGLQEGVWPGVSVIGNILQQEHLWEFDLANR 462
Query: 241 LIGFTPNKC 249
+ F ++C
Sbjct: 463 WLRFQESRC 471
>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 480
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 87/272 (31%), Positives = 113/272 (41%), Gaps = 43/272 (15%)
Query: 14 SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN------ASTFSYCLVDRDSDST 67
+V N GC H G VG AG G G LS PSQ+ + FSYCLV S +
Sbjct: 205 NVRNFTFGCAHTTLGEPVGVAGF---GRGVLSMPSQLATFSPQLGNRFSYCLVSH-SFAA 260
Query: 68 STLEFDSSL--------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
+ S L + LL N + FY +GL GISVG +P E K+D
Sbjct: 261 DRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGLAGISVGNIRIPAPEFLTKVD 320
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT-----RALSPTDGVALFDTCYDFSSR 174
E G+GG++VDSGT T L Y ++ F T RA + L CY +
Sbjct: 321 EGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARRIEENTGL-SPCYYY--E 377
Query: 175 SSVEVPTVSFHFP-EGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI----------- 222
+SV VP V HF E + LP KNY G L +
Sbjct: 378 NSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKVGCLMLMNGGDEAELAG 437
Query: 223 -----IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+GN QQQG V ++L + +GF +C
Sbjct: 438 GPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 469
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 83/248 (33%), Positives = 116/248 (46%), Gaps = 27/248 (10%)
Query: 16 DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDS--DSTSTL 70
D GCG +G + GL+GLG S S Q+ FSYCLV DS + S L
Sbjct: 120 DGFLFGCGRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFL 179
Query: 71 EFDSSLP---PNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPISETAFKIDESGNG-- 124
SS + V+ P+L LD T YY+ L I+VGG +P+ ESG+
Sbjct: 180 FLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGG--VPV---VVYDKESGHNTS 234
Query: 125 -------GIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG-VALFDTCYDFSSRSS 176
++DSGT T L Y A+R + + + PT G A D C++ S +S
Sbjct: 235 VGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEE--QVILPTLGNSAGLDLCFNSSGDTS 292
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
P+V+F+F L LP +N + V S C + + LSIIGN+QQQ + ++
Sbjct: 293 YGFPSVTFYFANQVQLVLPFEN-IFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYD 351
Query: 237 LRNSLIGF 244
L S I F
Sbjct: 352 LVASQISF 359
>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
Length = 629
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 81/245 (33%), Positives = 110/245 (44%), Gaps = 32/245 (13%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINAS---T 54
G + + +TLG V GC H + G AG L LGGGS S Q
Sbjct: 156 GTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLALGGGSQSLVQQTATRYGRV 215
Query: 55 FSYCLVDRDSDSTSTLEF--------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCL + S+L F + L P+ V+ PLL + TFY + L I V G
Sbjct: 216 FSYCL----PPTASSLGFLVLGVPPERAQLIPSFVSTPLLSSSMAPTFYRVLLRAIIVAG 271
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
L + F ++DS T ++RL Y ALR AF V++ D
Sbjct: 272 RPLAVPPAVFSASS------VIDSSTIISRLPPTAYQALRAAFRSAMTMYRAAPPVSILD 325
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SIIG 224
TCYDF+ S+ +P+++ F G + L A L+ G+ C AFAPT+S IG
Sbjct: 326 TCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-----GS-CLAFAPTASDRMPGFIG 379
Query: 225 NVQQQ 229
NVQQ+
Sbjct: 380 NVQQK 384
Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 67/206 (32%), Positives = 96/206 (46%), Gaps = 27/206 (13%)
Query: 55 FSYCLVDRDSDSTSTLEF--------DSSLPPNAVTAPLLRNHELD-TFYYLGLTGISVG 105
FSYC+ S S+L F ++L P V+ PLL + + TFY + L I V
Sbjct: 440 FSYCI----PPSPSSLGFITLGVPPQRAALVPTFVSTPLLSSSSMPPTFYRVLLRAIIVA 495
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
G LP+ T F ++ S T ++RL Y ALR AF R V++
Sbjct: 496 GRPLPVPPTVFSTSS------VIASTTVISRLPPTAYQALRAAFRRAMTMYRTAPPVSIL 549
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SII 223
DTCYDF+ S+ +P+++ F G + L A L+ G C AFAPT++ I
Sbjct: 550 DTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL----QG--CLAFAPTATDRMPGFI 603
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GNVQQ+ V +++ I F C
Sbjct: 604 GNVQQRTLEVVYDVPGKAIRFRSAAC 629
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 85/269 (31%), Positives = 127/269 (47%), Gaps = 27/269 (10%)
Query: 1 GDFVTETVTLG-----SASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPS---QIN 51
G F ET+T+G A + IGC + G F GA G+LGL SF S +
Sbjct: 177 GVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLY 236
Query: 52 ASTFSYCLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDT-----FYYLGLTGISV 104
+ FSYCLVD S+ ++ L F SS + R LD FY + + GIS+
Sbjct: 237 GAKFSYCLVDHLSNKNVSNYLIFGSS---RSTKTAFRRTTPLDLTRIPPFYAINVIGISL 293
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGV 162
G D+L I + D + GG I+DSGT++T L Y + R L +GV
Sbjct: 294 GYDMLDIPSQVW--DATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGV 351
Query: 163 ALFDTCYDFSSRSSV-EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SL 220
+ + C+ F+S +V ++P ++FH G K+YL+ + G C F + +
Sbjct: 352 PI-EYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDA-APGVKCLGFVSAGTPAT 409
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++IGN+ QQ F+L S + F P+ C
Sbjct: 410 NVIGNIMQQNYLWEFDLMASTLSFAPSAC 438
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 83/262 (31%), Positives = 127/262 (48%), Gaps = 21/262 (8%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGS-LSFPSQINAS- 53
G+ +T+TLGS S + IGCGHNN G F + GG +S SQ+ ++
Sbjct: 183 GNVAADTITLGSTSGRPVLLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQLGSTI 242
Query: 54 --TFSYCLVDRDSDST--STLEFDSS--LPPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
FSYCLV S++T S L F S+ + V + L + + DTFY+L L +SVG +
Sbjct: 243 DGKFSYCLVPLSSNATNSSKLNFGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSE 302
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
+ ++F E G II+DSGT +T + ++ L A D +
Sbjct: 303 RIKFPGSSFGTSE---GNIIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGILSL 359
Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQ 227
CY S + ++ P+++ HF +G + L N + V S+ CFAF P +S +I GN+
Sbjct: 360 CY--SIDADLKFPSITAHF-DGADVKLNPLNTFVQV-SDTVLCFAFNPINSG-AIFGNLA 414
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
Q V ++L + F P C
Sbjct: 415 QMNFLVGYDLEGKTVSFKPTDC 436
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 86/265 (32%), Positives = 122/265 (46%), Gaps = 23/265 (8%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAST 54
G+ ET+T+ S S A GC H + G+F ++G++GLG LS SQ+ ++
Sbjct: 181 GNLAVETLTVASTAGKPVSFPGFAFGCVHRSGGIFDEHSSGIVGLGVAELSMISQLKSTI 240
Query: 55 ---FSYCL--VDRDSDSTSTLEFDSS---LPPNAVTAPLLRNHELDTFYYL-GLTGISVG 105
FSYCL V DS +S + F S V+ PL+ DT+YYL L G SVG
Sbjct: 241 NGRFSYCLLPVFTDSSMSSRINFGRSGIVSGAGTVSTPLVMKGP-DTYYYLITLEGFSVG 299
Query: 106 GDLLPISETAF-KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL 164
L S F K E G IIVDSGT T L E Y L ++ + D +
Sbjct: 300 KKRL--SYKGFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGI 357
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
CY+ ++ ++ P ++ HF + V P +L + CF PT S + I+G
Sbjct: 358 SSLCYN-TTVDQIDAPIITAHFKDANVELQPWNTFLRMQED--LVCFTVLPT-SDIGILG 413
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
N+ Q V F+LR + F C
Sbjct: 414 NLAQVNFLVGFDLRKKRVSFKAADC 438
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 70/238 (29%), Positives = 107/238 (44%), Gaps = 25/238 (10%)
Query: 35 GLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTL--------EFDSSLPPNAVT-APL 85
G+ G G G S P+Q+ + FSYCLV D T + N V AP
Sbjct: 207 GIAGFGRGLASIPAQLGLTKFSYCLVSHRFDDTPQSGDLVLHRGRRHADAAANGVAYAPF 266
Query: 86 LRNHELD---TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETY 142
++ L +YY+ L+ I VGG +PI + G+GG+IVDSG+ T ++ +
Sbjct: 267 TKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGDGGMIVDSGSTFTFMERIIF 326
Query: 143 N----ALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKN 198
+ L + RA D L CY+ + +S V+VP ++F F G + LP +
Sbjct: 327 DPVARELEKHMTKYKRAKEIEDSSGL-GPCYNITGQSEVDVPKLTFSFKGGANMDLPLTD 385
Query: 199 YLIPVDSNGTFCFAF-------APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
Y V ++G C T+ I+GN QQQ + ++L+ GF P +C
Sbjct: 386 YFSLV-TDGVVCMTVLTDPDEPGSTTGPAIILGNYQQQNFYIEYDLKKQRFGFKPQQC 442
>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
Length = 720
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 81/245 (33%), Positives = 110/245 (44%), Gaps = 32/245 (13%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINAS---T 54
G + + +TLG V GC H + G AG L LGGGS S Q
Sbjct: 247 GTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLALGGGSQSLVQQTATRYGRV 306
Query: 55 FSYCLVDRDSDSTSTLEF--------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCL + S+L F + L P+ V+ PLL + TFY + L I V G
Sbjct: 307 FSYCL----PPTASSLGFLVLGVPPERAQLIPSFVSTPLLSSSMAPTFYRVLLRAIIVAG 362
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
L + F ++DS T ++RL Y ALR AF V++ D
Sbjct: 363 RPLAVPPAVFSASS------VIDSSTIISRLPPTAYQALRAAFRSAMTMYRAAPPVSILD 416
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SIIG 224
TCYDF+ S+ +P+++ F G + L A L+ G+ C AFAPT+S IG
Sbjct: 417 TCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-----GS-CLAFAPTASDRMPGFIG 470
Query: 225 NVQQQ 229
NVQQ+
Sbjct: 471 NVQQK 475
Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 67/206 (32%), Positives = 96/206 (46%), Gaps = 27/206 (13%)
Query: 55 FSYCLVDRDSDSTSTLEF--------DSSLPPNAVTAPLLRNHELD-TFYYLGLTGISVG 105
FSYC+ S S+L F ++L P V+ PLL + + TFY + L I V
Sbjct: 531 FSYCI----PPSPSSLGFITLGVPPQRAALVPTFVSTPLLSSSSMPPTFYRVLLRAIIVA 586
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
G LP+ T F ++ S T ++RL Y ALR AF R V++
Sbjct: 587 GRPLPVPPTVFSTSS------VIASTTVISRLPPTAYQALRAAFRRAMTMYRTAPPVSIL 640
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SII 223
DTCYDF+ S+ +P+++ F G + L A L+ G C AFAPT++ I
Sbjct: 641 DTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL----QG--CLAFAPTATDRMPGFI 694
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GNVQQ+ V +++ I F C
Sbjct: 695 GNVQQRTLEVVYDVPGKAIRFRSAAC 720
>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 481
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 81/298 (27%), Positives = 120/298 (40%), Gaps = 62/298 (20%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN------AST 54
+ +T++L S + N GC H G+ G G G LS P+Q++ +
Sbjct: 185 ANLYQQTLSLSSLHLQNFTFGCAHT---ALAEPTGVAGFGRGILSLPAQLSTLSPHLGNR 241
Query: 55 FSYCLVD-----------------RDSDSTS------TLEFDSSLPPNAVTAPLLRNHEL 91
FSYCLV R +D+ + ++EF V +L N +
Sbjct: 242 FSYCLVSHSFDGDRLRRPSPLILGRHNDTITGAGDGESVEF--------VYTSMLSNPKH 293
Query: 92 DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF-- 149
+Y +GL GISVG +P E ++DE GNGG++VDSGT T L YNA+ + F
Sbjct: 294 PYYYCVGLAGISVGKRTVPAPEILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEFDK 353
Query: 150 --VRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFP-EGKVLPLPAKNYLIPVDSN 206
R + S + CY + S ++P + HF + LP KNY
Sbjct: 354 RVNRFHKRASEIETKTGLGPCYYLNGLS--QIPVLKLHFVGNNSDVVLPRKNYFYEFMDG 411
Query: 207 G--------TFCFAFAPTSSSLSI-------IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
G C + +GN QQQG V ++L +GF +C
Sbjct: 412 GDGIRRKGKVGCMMLMNGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKEC 469
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 85/269 (31%), Positives = 127/269 (47%), Gaps = 27/269 (10%)
Query: 1 GDFVTETVTLG-----SASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPS---QIN 51
G F ET+T+G A + IGC + G F GA G+LGL SF S +
Sbjct: 199 GVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLY 258
Query: 52 ASTFSYCLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDT-----FYYLGLTGISV 104
+ FSYCLVD S+ ++ L F SS + R LD FY + + GIS+
Sbjct: 259 GAKFSYCLVDHLSNKNVSNYLIFGSS---RSTKTAFRRTTPLDLTRIPPFYAINVIGISL 315
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGV 162
G D+L I + D + GG I+DSGT++T L Y + R L +GV
Sbjct: 316 GYDMLDIPSQVW--DATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGV 373
Query: 163 ALFDTCYDFSSRSSV-EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SL 220
+ + C+ F+S +V ++P ++FH G K+YL+ + G C F + +
Sbjct: 374 PI-EYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDA-APGVKCLGFVSAGTPAT 431
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++IGN+ QQ F+L S + F P+ C
Sbjct: 432 NVIGNIMQQNYLWEFDLMASTLSFAPSAC 460
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 78/274 (28%), Positives = 112/274 (40%), Gaps = 31/274 (11%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G ++ET+ + N +GC + +G+ G G GS S PSQ+ F+YCL
Sbjct: 189 GLLLSETLDFPDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGLKKFAYCLA 245
Query: 61 DR---DSDSTSTLEFDSS-LPPNAVTA------PLLRNHELDTFYYLGLTGISVGGDLLP 110
R DS + L DS+ + + +T P + N+ +YYL + I VG +
Sbjct: 246 SRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVK 305
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR----GTRALSPTDGVAL-- 164
+ GNGG I+DSG+ T + + F + TRA TD L
Sbjct: 306 VPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRA---TDVETLTG 362
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA---------P 215
C+D S SV+ P + F F G LP NY V S+G C
Sbjct: 363 LRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGG 422
Query: 216 TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I+G QQQ V ++L N +GF C
Sbjct: 423 GGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 87/267 (32%), Positives = 127/267 (47%), Gaps = 29/267 (10%)
Query: 1 GDFVTETVTL-----GSASVDNIAIGCGHNNEGLFVGA-AGLLGLGGGSLSFPSQINAST 54
G+ +TVTL G IGCG N G F +G++GLGGG +S SQ+ +S
Sbjct: 182 GNLAVDTVTLPSTNGGPVYFPKTVIGCGRRNNGTFDKKDSGIIGLGGGPMSLISQMGSSV 241
Query: 55 ---FSYCLVDRDSDS---TSTLEF--DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCLV S+S +S L F ++ + + V + L + DTFYYL L +SVG
Sbjct: 242 GGKFSYCLVPFSSESAGNSSKLHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGD 301
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQ----TETYNALRDAFVRGTRALSPTDGV 162
+ ++F E II+DSGT++T TE A+ +A + G R D
Sbjct: 302 KKIEFGGSSFGGSEG---NIIIDSGTSLTLFPVNFFTEFATAVENAVINGERT---QDAS 355
Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
L CY ++VP ++ HF V+ L N I + S+ C AF T S +I
Sbjct: 356 GLLSHCY--RPTPDLKVPVITAHFNGADVV-LQTLNTFILI-SDDVLCLAFNSTQSG-AI 410
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
GNV Q + ++++ + F P C
Sbjct: 411 FGNVAQMNFLIGYDIQGKSVSFKPTDC 437
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 78/274 (28%), Positives = 112/274 (40%), Gaps = 31/274 (11%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G ++ET+ + N +GC + +G+ G G GS S PSQ+ F+YCL
Sbjct: 189 GLLLSETLDFPDKXIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGLKKFAYCLA 245
Query: 61 DR---DSDSTSTLEFDSS-LPPNAVTA------PLLRNHELDTFYYLGLTGISVGGDLLP 110
R DS + L DS+ + + +T P + N+ +YYL + I VG +
Sbjct: 246 SRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVK 305
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR----GTRALSPTDGVAL-- 164
+ GNGG I+DSG+ T + + F + TRA TD L
Sbjct: 306 VPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRA---TDVETLTG 362
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA---------P 215
C+D S SV+ P + F F G LP NY V S+G C
Sbjct: 363 LRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGG 422
Query: 216 TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I+G QQQ V ++L N +GF C
Sbjct: 423 GGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456
>gi|222635172|gb|EEE65304.1| hypothetical protein OsJ_20543 [Oryza sativa Japonica Group]
Length = 274
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 66/252 (26%), Positives = 102/252 (40%), Gaps = 60/252 (23%)
Query: 11 GSASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINASTFSYC---LVDRDSDS 66
G + + GCGH N+G+F G+ G G G S PSQ+N ++FSYC + D S S
Sbjct: 63 GGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFDTKSSS 122
Query: 67 TSTL---------EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
TL ++ + T L++N + Y++ L GISVGG + + E+ +
Sbjct: 123 VVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPESRLR 182
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
I+DSG ++T L + Y A++ FV
Sbjct: 183 ------SSTIIDSGASITTLPEDVYEAVKAEFVS-------------------------- 210
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
LP NY+ + C + +IGN QQQ T V ++L
Sbjct: 211 ---------------QLPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQQQNTHVVYDL 255
Query: 238 RNSLIGFTPNKC 249
N ++ F P +C
Sbjct: 256 ENDVLSFAPARC 267
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 85/239 (35%), Positives = 112/239 (46%), Gaps = 14/239 (5%)
Query: 13 ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST-FSYCLVDRDSDSTSTLE 71
A V + GCGH+ L GLLGLG S S +Q FSYCL +S L
Sbjct: 219 AIVKDFYFGCGHSKSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPAVNSKP-GFLA 277
Query: 72 FDSSLPPNA-VTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDS 130
F + P+ V P+ R TF + L GI+VGG L + +AF +GG+IVDS
Sbjct: 278 FGAGRNPSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAF------SGGMIVDS 331
Query: 131 GTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGK 190
GT VT LQ+ Y ALR AF +A G DTCYD + +V VP ++ F G
Sbjct: 332 GTVVTVLQSTVYRALRAAFREAMKAYRLVHGD--LDTCYDLTGYKNVVVPKIALTFSGGA 389
Query: 191 VLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ L N ++ NG FA + ++GNV Q+ V F+ S GF C
Sbjct: 390 TINLDVPNGIL---VNGCLAFAETGKDGTAGVLGNVNQRTFEVLFDTSASKFGFRAKAC 445
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 83/255 (32%), Positives = 120/255 (47%), Gaps = 40/255 (15%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
SA++ ++ GCGH+N G + G+LGLG G S + FSYC D
Sbjct: 191 SAALHDVVFGCGHDNYGEPLVGTGILGLGYGEFSLVHRF-GKKFSYCFGSLD-------- 241
Query: 72 FDSSLPPNAV------------TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF-KI 118
D S P N + T PL + + FYY+ + ISV G +LPI F +
Sbjct: 242 -DPSYPHNVLVLGDDGANILGDTTPLEIH---NGFYYVTIEAISVDGIILPIDPRVFNRN 297
Query: 119 DESGNGGIIVDSGTAVTRLQTETY----NALRDAFV-RGTRA-LSPTDGVALFDTCYDFS 172
++G GG I+D+G ++T L E Y N + D F R T A +S D + + CY+ +
Sbjct: 298 HQTGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKM--ECYNGN 355
Query: 173 -SRSSVE--VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
R VE P V+FHF EG L L K+ + + N FC A P +L+ IG QQ
Sbjct: 356 FERDLVESGFPIVTFHFSEGAELSLDVKSLFMKLSPN-VFCLAVTP--GNLNSIGATAQQ 412
Query: 230 GTRVSFNLRNSLIGF 244
+ ++L + F
Sbjct: 413 SYNIGYDLEAMEVSF 427
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 93.6 bits (231), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 86/277 (31%), Positives = 127/277 (45%), Gaps = 31/277 (11%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHN----NEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G+ ++T +G++ + GC + N GL+G+ GSLSF SQ++ FS
Sbjct: 174 GNLASDTFYIGNSDMPGTIFGCMDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMDFPKFS 233
Query: 57 YCLVDRDSDSTSTL---EFDSSLPPNAVTAPLLR-NHELDTF----YYLGLTGISVGGDL 108
YC+ D D L F +P N PL++ + L F Y + L GI V L
Sbjct: 234 YCISDSDFSGVLLLGDANFSWLMPLNY--TPLIQISTPLPYFDRVAYTVQLEGIKVSSKL 291
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGVAL 164
LP+ ++ F D +G G +VDSGT T L Y+ALR+ F+ T R L + V
Sbjct: 292 LPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQ 351
Query: 165 --FDTCYD--FSSRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGT---FCFAFAP 215
D CY S S +PTVS F G + + Y +P + G+ +CF F
Sbjct: 352 GGMDLCYRVPLSQTSLPWLPTVSLMF-RGAEMKVSGDRLLYRVPGEVRGSDSVYCFTFGN 410
Query: 216 T---SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ + +IG+ QQ + F+L S IGF +C
Sbjct: 411 SDLLAVEAYVIGHHHQQNVWMEFDLEKSRIGFAQVQC 447
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 72/237 (30%), Positives = 98/237 (41%), Gaps = 23/237 (9%)
Query: 35 GLLGLGGGSLSFPSQINASTFSYCLVDR---DSDSTSTLEFDS------SLPPNAVTAPL 85
G+ G G S PSQ+ FSYCLV D+ ++S L D+ + P P
Sbjct: 232 GIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPASSDLVLDTGSGSDDTKTPGLSYTPF 291
Query: 86 LRN--HELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN 143
+N +YY+ L I +G + + GNGG IVDSGT T ++ Y
Sbjct: 292 QKNPTAAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYE 351
Query: 144 ALRDAFVRGTRALSPTDGVAL---FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL 200
+ F + + V C++ S SV VP FHF G + LP NY
Sbjct: 352 LVAKEFEKQVAHYTVATEVQNQTGLRPCFNISGEKSVSVPEFIFHFKGGAKMALPLANYF 411
Query: 201 IPVDSNGTFCFAFAPTSSSLS--------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
VDS G C + S S I+GN QQ+ V F+L+N GF C
Sbjct: 412 SFVDS-GVICLTIVSDNMSGSGIGGGPAIILGNYQQRNFHVEFDLKNERFGFKQQNC 467
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 99/281 (35%), Positives = 139/281 (49%), Gaps = 36/281 (12%)
Query: 1 GDFVTETVTLGSASVD--NIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQIN---AST 54
G ++TVT+G+ASV N+A GCG N G F +G++GLGGG+LSF SQ+
Sbjct: 170 GYLASDTVTVGNASVQIRNVAFGCGTRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKK 229
Query: 55 FSYCLVD---------RDSDSTSTLEFD-----SSLPPNAV---TAPLLRNHELDTFYYL 97
FSYCL+ DS +TS + F SS N V T PL+ N E T+YYL
Sbjct: 230 FSYCLLPLENEISSQPSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLV-NKEPSTYYYL 288
Query: 98 GLTGISVGGDLLPISETAFKID--ESGN------GGIIVDSGTAVTRLQTETYNALRDAF 149
+ I+VG L S ++ K +SG+ G II+DSGT +T L+ E Y AL A
Sbjct: 289 TIEAITVGRKKLLYSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAAL 348
Query: 150 VRGTRALSPTD-GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT 208
V + D ++F C+ S + VE+P + HF G + L N + + G
Sbjct: 349 VEEIKMERVNDVKNSMFSLCFK-SGKEEVELPLMKVHFRGGADVELKPVNTFVRAE-EGL 406
Query: 209 FCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
CF PT + + I GN+ Q V ++L + F P C
Sbjct: 407 VCFTMLPT-NDVGIYGNLAQMNFVVGYDLGKRTVSFLPADC 446
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 93.2 bits (230), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 92/280 (32%), Positives = 127/280 (45%), Gaps = 32/280 (11%)
Query: 1 GDFVTETVTLGSASVD-NIAIGCGHNNEG----LFVGAAGLLGLGGGSLSFPSQINASTF 55
G+ E G+++ D N+ GC + G GLLG+ GSLSF SQ+ F
Sbjct: 164 GNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKF 223
Query: 56 SYCLVDRDSDSTSTLEFDSS---LPPNAVTAPLLR-NHELDTF----YYLGLTGISVGGD 107
SYC+ D L DS+ L P T PL+R + L F Y + LTGI V G
Sbjct: 224 SYCISGTDDFPGFLLLGDSNFTWLTPLNYT-PLIRISTPLPYFDRVAYTVQLTGIKVNGK 282
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGV- 162
LLPI ++ D +G G +VDSGT T L Y ALR F+ T + D V
Sbjct: 283 LLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVF 342
Query: 163 -ALFDTCYDFSS---RSSV--EVPTVSFHFPEGKVL----PLPAKNYLIPVDSNGTFCFA 212
D CY S RS + +PTVS F ++ PL + + V ++ +CF
Sbjct: 343 QGTMDLCYRISPVRIRSGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFT 402
Query: 213 FAPTS---SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
F + +IG+ QQ + F+L+ S IG P +C
Sbjct: 403 FGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVEC 442
>gi|326490700|dbj|BAJ90017.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326493830|dbj|BAJ85377.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 459
Score = 93.2 bits (230), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 82/257 (31%), Positives = 123/257 (47%), Gaps = 22/257 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGC-GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G ETV +GS V +GC N+ G VG G G G+LS SQ++ S FSY L
Sbjct: 169 GFLANETVAVGS-FVGAAILGCSAANSTGPLVGEVGSFGFNRGALSLVSQLSVSKFSYYL 227
Query: 60 VDRD---SDSTSTLEF-DSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLP-I 111
+ SDS S + D+++P + PLLR+ YY+ L+ I V G L I
Sbjct: 228 APDEAGSSDSESVVLLGDAAVPQTRGGGRSTPLLRSTAFPDVYYVKLSAIQVDGQALSGI 287
Query: 112 SETAFKIDESG-NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA----LFD 166
AF + G +GG+++ + +TRLQ + YNA+R A V A +G A +FD
Sbjct: 288 PAGAFDLAADGSSGGVVMGTLYPITRLQEDAYNAVRQALVSKINA-QEVNGSAFAGGVFD 346
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKV---LPLPAKNYLIPVDSNGTFCFAFAPTSSSL--- 220
CYD S +++ P ++ F G L L +Y + G CF P
Sbjct: 347 LCYDAQSVATLTFPKITLVFDGGNAPATLELTTVHYFFKDNVTGLQCFTMLPMPVGTPFG 406
Query: 221 SIIGNVQQQGTRVSFNL 237
S++G++ Q GT + +++
Sbjct: 407 SVLGSMVQAGTNMIYDV 423
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 93.2 bits (230), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 73/247 (29%), Positives = 117/247 (47%), Gaps = 15/247 (6%)
Query: 13 ASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSDS 66
A + ++ +GC ++G F G+L LG +SF S+ A +FSYCLVD ++
Sbjct: 198 AQLQDVVLGCSSTHDGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCLVDHLAPRNA 257
Query: 67 TSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGG 125
T L F +P T L FY + + + V G L I ++ + +GG
Sbjct: 258 TGYLAFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPA---EVWDPKSGG 314
Query: 126 IIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS--SVEVPTVS 183
+I+DSGT +T L T Y A+ A + + D F+ CY++++ + E+P ++
Sbjct: 315 VILDSGTTLTVLATPAYKAVVAALTKLLAGVPKVD-FPPFEHCYNWTAPRPGAPEIPKLA 373
Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIGNVQQQGTRVSFNLRNSLI 242
F L PAK+Y+I V G C +S+IGN+ QQ F+L+N +
Sbjct: 374 VQFTGCARLEPPAKSYVIDVKP-GVKCIGLQEGEWPGVSVIGNIMQQEHLWEFDLKNMEV 432
Query: 243 GFTPNKC 249
F P+ C
Sbjct: 433 RFMPSTC 439
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 93.2 bits (230), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 77/250 (30%), Positives = 115/250 (46%), Gaps = 27/250 (10%)
Query: 15 VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST-----FSYCLVD--RDSDST 67
V ++ GC + G F + GL+GLG G+LS SQ+ A+ FSYCLV ++S+
Sbjct: 212 VPRVSFGCSTGSAGSF-RSDGLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAANSS 270
Query: 68 STLEFDSSL---PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNG 124
STL F + P A + PL+ + E+D++Y + L ++V G + + +
Sbjct: 271 STLSFGARAVVSDPGAASTPLVPS-EVDSYYTVALESVAVAGQ---------DVASANSS 320
Query: 125 GIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE---VPT 181
IIVDSGT +T L L R R L CYD +S E +P
Sbjct: 321 RIIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQGKSQAEDFGIPD 380
Query: 182 VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--LSIIGNVQQQGTRVSFNLRN 239
V+ F G + L +N ++ GT C P S S +SI+GN+ QQ V ++L
Sbjct: 381 VTLRFGGGASVTLRPENTFSLLE-EGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDA 439
Query: 240 SLIGFTPNKC 249
+ F C
Sbjct: 440 RTVTFAAVDC 449
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 93.2 bits (230), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 76/253 (30%), Positives = 115/253 (45%), Gaps = 27/253 (10%)
Query: 11 GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST-----FSYCLV-DRDS 64
G V + GC + G F + GL+GLG G+ S SQ+ A+T SYCL+ D+
Sbjct: 211 GQVRVPRVNFGCSTASAGTFR-SDGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDA 269
Query: 65 DSTSTLEFDSSL---PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
+S+STL F S P A + PL+ + ++D++Y + L ++VGG + ++
Sbjct: 270 NSSSTLNFGSRAVVSEPGAASTPLVPS-DVDSYYTVALESVAVGGQEVATHDSR------ 322
Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE--- 178
IIVDSGT +T L L R + L CYD +S +
Sbjct: 323 ----IIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGKSETDNFG 378
Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--LSIIGNVQQQGTRVSFN 236
+P V+ F G + L +N + GT C P S S +SI+GN+ QQ V ++
Sbjct: 379 IPDVTLRFGGGAAVTLRPEN-TFSLLQEGTLCLVLVPVSESQPVSILGNIAQQNFHVGYD 437
Query: 237 LRNSLIGFTPNKC 249
L + F C
Sbjct: 438 LDARTVTFAAADC 450
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 73/240 (30%), Positives = 103/240 (42%), Gaps = 24/240 (10%)
Query: 34 AGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTS-----TLEFDSS--LPPNAVTAPLL 86
+G+ G G G S PSQ+N FSYCLV D T L+ S+ N ++
Sbjct: 227 SGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPF 286
Query: 87 R-----NHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTET 141
R N +YY+ L + VGG + I + GNGG IVDSG+ T ++
Sbjct: 287 RSNPSNNSVFREYYYVTLRKLIVGGVDVKIPYKFLEPGSDGNGGTIVDSGSTFTFMERPV 346
Query: 142 YNALRDAFVRGT-RALSPTDGVAL---FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAK 197
YN + F+R + S + V C++ S ++ P +F F G + P
Sbjct: 347 YNLVAQEFLRQLGKKYSREENVEAQSGLSPCFNISGVKTISFPEFTFQFKGGAKMSQPLL 406
Query: 198 NYLIPVDSNGTFCF-------AFAPTSSSLSII-GNVQQQGTRVSFNLRNSLIGFTPNKC 249
NY V CF A P ++ +II GN QQQ V ++L N GF P C
Sbjct: 407 NYFSFVGDAEVLCFTVVSDGGAGQPKTAGPAIILGNYQQQNFYVEYDLENERFGFGPRNC 466
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 74/249 (29%), Positives = 118/249 (47%), Gaps = 18/249 (7%)
Query: 13 ASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSDS 66
A + ++ +GC +++G F A G+L LG +SF +Q A +FSYCLVD ++
Sbjct: 222 AQLKDVVLGCSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNA 281
Query: 67 TSTLEFDSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNG 124
T L F P A L + E+ FY + + I V G L I + ++ +G
Sbjct: 282 TGYLAFGPGQVPRTPATQTKLFLDPEM-PFYGVKVDAIHVAGKALDIPAEVW---DAKSG 337
Query: 125 GIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR---SSVEVPT 181
G+I+DSG +T L Y A+ A + + P F+ CY++++R + +P
Sbjct: 338 GVILDSGNTLTVLAAPAYKAVVAALSKHLDGV-PKVSFPPFEHCYNWTARRPGAPEIIPK 396
Query: 182 VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIGNVQQQGTRVSFNLRNS 240
++ F L PAK+Y+I V G C LS+IGN+ QQ F+L+N
Sbjct: 397 LAVQFAGSARLEPPAKSYVIDVKP-GVKCIGVQEGEWPGLSVIGNIMQQEHLWEFDLKNM 455
Query: 241 LIGFTPNKC 249
+ F + C
Sbjct: 456 QVRFKQSNC 464
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/258 (34%), Positives = 131/258 (50%), Gaps = 28/258 (10%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G ET TLG+ +V ++ GC +EG + +GL+GLG G LS SQ+NASTF YCL
Sbjct: 189 GFLARETFTLGADAVPSVRFGCTTASEGGYGSGSGLVGLGRGPLSLVSQLNASTFMYCLT 248
Query: 61 DRDSDSTSTLEFDSSLPPNAVTAPLLRNHEL---DTFYYLGLTGISVGGDLLP-ISETAF 116
D+ S L F S ++T +++ L TFY + L IS+G P + E
Sbjct: 249 S-DASKASPLLFGSL---ASLTGAQVQSTGLLASTTFYAVNLRSISIGSATTPGVGEPE- 303
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR--ALSPTDGVALFDTCYDFSSR 174
G++ DSGT +T L Y+ + AF+ T + TDG F+ C+ +
Sbjct: 304 --------GVVFDSGTTLTYLAEPAYSEAKAAFLSQTSLDQVEDTDG---FEACFQKPAN 352
Query: 175 ---SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGT 231
S+ VPT+ HF +G + LP NY++ V+ +G C+ S SLSIIGN+ Q
Sbjct: 353 GRLSNAAVPTMVLHF-DGADMALPVANYVVEVE-DGVVCW-IVQRSPSLSIIGNIMQVNY 409
Query: 232 RVSFNLRNSLIGFTPNKC 249
V ++ S++ F P C
Sbjct: 410 LVLHDVHRSVLSFQPANC 427
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 128/279 (45%), Gaps = 35/279 (12%)
Query: 1 GDFVTETVTLGSASVDNIAIGCG----HNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G+ ++T +G++++ GC +N GL+G+ GSLSF +Q+ FS
Sbjct: 145 GNLASDTFHIGNSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQKFS 204
Query: 57 YCLVDRDSD--------STSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
YC+ +DS S S L+ P ++ PL + Y + L GI V +
Sbjct: 205 YCISGQDSSGILLFGESSFSWLKALKYTPLVQISTPLPYFDRVA--YTVQLEGIKVANSM 262
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA----LSPTDGV-- 162
L + ++ + D +G G +VDSGT T L Y AL++ FVR T+A L + V
Sbjct: 263 LQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQ 322
Query: 163 ALFDTCYD--FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-----SNGTFCFAFAP 215
D CY + R+ +PTV+ F G + + A+ + V S+ +CF F
Sbjct: 323 GAMDLCYRVPLTRRTLPPLPTVTLMF-RGAEMSVSAERLMYRVPGVIRGSDSVYCFTFG- 380
Query: 216 TSSSL-----SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+S L IIG+ QQ + F+L S +GF +C
Sbjct: 381 -NSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 418
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 121/273 (44%), Gaps = 43/273 (15%)
Query: 12 SASVDNIAIGCGHNNEG-LFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDS--D 65
A + + +GC + G F + G+L LG +SF S A FSYCLVD S +
Sbjct: 210 KAKLKGLVLGCTSSYTGPSFEVSDGVLSLGYSDVSFASHAASRFAGRFSYCLVDHLSPRN 269
Query: 66 STSTLEFDSSLPPNAVTA---------------------------PLLRNHELDTFYYLG 98
+TS L F PN A PLL + + FY +
Sbjct: 270 ATSYLTFG----PNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDVA 325
Query: 99 LTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP 158
+ +SV G L I + +D GG+I+DSGT++T L Y A+ A G L P
Sbjct: 326 VKAVSVAGQFLKIPRAVWDVD--AGGGVILDSGTSLTVLAKPAYRAVVAALSEGLAGL-P 382
Query: 159 TDGVALFDTCYDFSSRS-SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-T 216
+ F+ CY+++S S V +P ++ HF L P K+Y+I + G C
Sbjct: 383 RVTMDPFEYCYNWTSPSGDVTLPKMAVHFAGAARLEPPGKSYVIDA-APGVKCIGLQEGP 441
Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+S+IGN+ QQ F+++N + F ++C
Sbjct: 442 WPGISVIGNILQQEHLWEFDIKNRRLKFQRSRC 474
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 128/279 (45%), Gaps = 35/279 (12%)
Query: 1 GDFVTETVTLGSASVDNIAIGCG----HNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G+ ++T +G++++ GC +N GL+G+ GSLSF +Q+ FS
Sbjct: 152 GNLASDTFHIGNSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQKFS 211
Query: 57 YCLVDRDSD--------STSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
YC+ +DS S S L+ P ++ PL + Y + L GI V +
Sbjct: 212 YCISGQDSSGILLFGESSFSWLKALKYTPLVQISTPLPYFDRVA--YTVQLEGIKVANSM 269
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA----LSPTDGV-- 162
L + ++ + D +G G +VDSGT T L Y AL++ FVR T+A L + V
Sbjct: 270 LQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQ 329
Query: 163 ALFDTCYD--FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-----SNGTFCFAFAP 215
D CY + R+ +PTV+ F G + + A+ + V S+ +CF F
Sbjct: 330 GAMDLCYRVPLTRRTLPPLPTVTLMF-RGAEMSVSAERLMYRVPGVIRGSDSVYCFTFG- 387
Query: 216 TSSSL-----SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+S L IIG+ QQ + F+L S +GF +C
Sbjct: 388 -NSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 425
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 83/275 (30%), Positives = 121/275 (44%), Gaps = 28/275 (10%)
Query: 1 GDFVTETVTLGSASVDNIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
G T+ +G A A GC +++ V AGLLG+ G+LSF +Q + FSY
Sbjct: 161 GALATDVFAVGEAPPLRSAFGCMSTAYDSSPDGVATAGLLGMNRGTLSFVTQASTRRFSY 220
Query: 58 CLVDRDSDSTSTLEFDSSLP--P---NAVTAPLLRNHELDTFYY-LGLTGISVGGDLLPI 111
C+ DRD D+ L S LP P + P L D Y + L GI VGG LPI
Sbjct: 221 CISDRD-DAGVLLLGHSDLPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPI 279
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL-----SPTDGVA-LF 165
+ D +G G +VDSGT T L + Y+AL+ F++ T+ L P+
Sbjct: 280 PASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFLKQTKPLLRALDDPSFAFQEAL 339
Query: 166 DTCYDFSS---RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-----SNGTFCFAFAPTS 217
DTC+ + S +P V+ F G + + L V ++G +C F
Sbjct: 340 DTCFRVPAGRPPPSARLPPVTLLF-NGAEMSVAGDRLLYKVPGEHRGADGVWCLTFGNAD 398
Query: 218 S---SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +IG+ Q V ++L +G P KC
Sbjct: 399 MVPLTAYVIGHHHQMNLWVEYDLERGRVGLAPVKC 433
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 84/248 (33%), Positives = 120/248 (48%), Gaps = 19/248 (7%)
Query: 13 ASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQIN---ASTFSYCLV--DRDSDS 66
A +A GCG N G F +G++GLGGGS+S SQ+ + FSYCLV S+
Sbjct: 207 AYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGPKLSGKFSYCLVPTSEQSNY 266
Query: 67 TSTLEFDSSLP-----PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
TS + F + + N V+ PLL +T+YYL L ISV LP T E
Sbjct: 267 TSKINFGNDINISGSNYNVVSTPLLPKKP-ETYYYLTLEAISVENKRLPY--TNLWNGEV 323
Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPT 181
G II+DSGT +T L +E +N L A + +D LF+ C F ++E+P
Sbjct: 324 EKGNIIIDSGTTLTFLDSEFFNNLDSAVEEAVKGERVSDPHGLFNIC--FKDEKAIELPI 381
Query: 182 VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSL 241
++ HF G + L N V+ + CF P S+ ++I GN+ Q V ++L
Sbjct: 382 ITAHF-TGADVELQPVNTFAKVEED-LLCFTMIP-SNDIAIFGNLAQMNFLVGYDLEKKA 438
Query: 242 IGFTPNKC 249
+ F P C
Sbjct: 439 VSFLPTDC 446
>gi|361068719|gb|AEW08671.1| Pinus taeda anonymous locus CL1136Contig1_03 genomic sequence
gi|376338612|gb|AFB33836.1| hypothetical protein CL1136Contig1_03, partial [Pinus mugo]
gi|376338614|gb|AFB33837.1| hypothetical protein CL1136Contig1_03, partial [Pinus mugo]
gi|376338616|gb|AFB33838.1| hypothetical protein CL1136Contig1_03, partial [Pinus mugo]
gi|383135631|gb|AFG48834.1| Pinus taeda anonymous locus CL1136Contig1_03 genomic sequence
Length = 70
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 44/69 (63%), Positives = 51/69 (73%)
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
G +LFDTCYD S +V+VPTV FHF + LPA NYLIPVDS+ TFCFAFA + L
Sbjct: 2 GFSLFDTCYDLSGLKTVKVPTVVFHFQGRADVSLPATNYLIPVDSSATFCFAFAGNTGGL 61
Query: 221 SIIGNVQQQ 229
SIIGN+QQQ
Sbjct: 62 SIIGNIQQQ 70
>gi|297740344|emb|CBI30526.3| unnamed protein product [Vitis vinifera]
Length = 379
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 80/239 (33%), Positives = 113/239 (47%), Gaps = 27/239 (11%)
Query: 35 GLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTL---EFDSSLPPNAVTAPLLR-NHE 90
GL+G+ GSLSF SQ++ FSYC+ D D L F +P N PL++ +
Sbjct: 133 GLMGMNRGSLSFVSQMDFPKFSYCISDSDFSGVLLLGDANFSWLMPLNY--TPLIQISTP 190
Query: 91 LDTF----YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALR 146
L F Y + L GI V LLP+ ++ F D +G G +VDSGT T L Y+ALR
Sbjct: 191 LPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALR 250
Query: 147 DAFVRGT----RALSPTDGVAL--FDTCYD--FSSRSSVEVPTVSFHFPEGKVLPLPAKN 198
+ F+ T R L + V D CY S S +PTVS F G + +
Sbjct: 251 NEFLNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSLMF-RGAEMKVSGDR 309
Query: 199 --YLIPVDSNGT---FCFAFAPT---SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
Y +P + G+ +CF F + + +IG+ QQ + F+L S IGF +C
Sbjct: 310 LLYRVPGEVRGSDSVYCFTFGNSDLLAVEAYVIGHHHQQNVWMEFDLEKSRIGFAQVQC 368
>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
Length = 450
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 72/273 (26%), Positives = 110/273 (40%), Gaps = 31/273 (11%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G F+ E + ++ N +GC + + + L G G S P Q+ F+YCL
Sbjct: 185 GYFLLENLKFPRKTIRNFLLGCT-TSAARELSSDALAGFGRSMFSLPIQMGVKKFAYCLN 243
Query: 61 DRDSDSTST-----LEFDSSLPPNAVTAPLLRNHELDTFYY-LGLTGISVGGDLLPISET 114
D D T L++ P L++ FYY LG+ I +G LL I
Sbjct: 244 SHDYDDTRNSGKLILDYRDGKTKGLSYTPFLKSPPASAFYYHLGVKDIKIGNKLLRIPSK 303
Query: 115 AFKIDESGNGGIIVDSGTA--------VTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
G G+I+DSG V ++ T N L+ + R+L L
Sbjct: 304 YLAPGSDGRSGVIIDSGYGGAGYMTGPVFKIVT---NELKKQMSKYRRSLEAETQTGL-T 359
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL----------IPVDSNGTFCFAFAPT 216
CY+F+ S+++P + + F G + +P KNY +D+NGT P
Sbjct: 360 PCYNFTGHKSIKIPPLIYQFRGGANMVVPGKNYFGISPQESLACFLMDTNGTNALEITPD 419
Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S I+GN Q V ++L+N GF C
Sbjct: 420 PS--IILGNSQHVDYYVEYDLKNDRFGFRRQTC 450
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 80/259 (30%), Positives = 112/259 (43%), Gaps = 45/259 (17%)
Query: 3 FVTETVTLGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVD 61
F + T SA+V + GCGH N G+F G+ G G GSLS PSQ+ FS+C
Sbjct: 190 FASGTGEGSSAAVPGLVFGCGHANRGVFTSNETGIAGFGRGSLSLPSQLKVGNFSHCFTT 249
Query: 62 RDSDSTST--LEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
TS L PP+A +PL R S
Sbjct: 250 ITGSKTSAVLLGLPGVAPPSA--SPLGRRRG---------------------SYRCRSTP 286
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSR-SSV 177
S N SGT++T L TY A+R+ F + + P + F TC+ R
Sbjct: 287 RSSN------SGTSITSLPPRTYRAVREEFAAQVKLPVVPGNATDPF-TCFSAPLRGPKP 339
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPV-------DSNGTFCFAFAPTSSSLSIIGNVQQQG 230
+VPT++ HF EG + LP +NY+ V +S+ C A I+GN+QQQ
Sbjct: 340 DVPTMALHF-EGATMRLPQENYVFEVVDDDDAGNSSRIICLAV--IEGGEIILGNIQQQN 396
Query: 231 TRVSFNLRNSLIGFTPNKC 249
V ++L+NS + F P +C
Sbjct: 397 MHVLYDLQNSKLSFVPAQC 415
>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
vinifera]
Length = 451
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 93/264 (35%), Positives = 129/264 (48%), Gaps = 26/264 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
+ +T+TL + +V + GC G + A GLLGLG G LS SQ + STFSY
Sbjct: 198 ANLSQDTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSY 257
Query: 58 CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
CL S +L F SL P PLL+N + Y++ L + VG ++
Sbjct: 258 CL-----PSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVD 312
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVALFDTCY 169
+ +F + S G I DSGT TRL T Y A+RDAF R R L+ T + FDTCY
Sbjct: 313 VPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTS-LGGFDTCY 371
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGN 225
+ PT++F F G + LP N LI + T C A A +S L++I N
Sbjct: 372 TV----PIAAPTITFMF-TGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIAN 426
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
+QQQ R+ +++ NS +G C
Sbjct: 427 LQQQNHRLLYDVPNSRLGVARELC 450
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 81/248 (32%), Positives = 115/248 (46%), Gaps = 27/248 (10%)
Query: 16 DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDS--DSTSTL 70
D GC +G + GL+GLG S S Q+ FSYCLV DS + S L
Sbjct: 120 DGFLFGCARKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFL 179
Query: 71 EFDSSLP---PNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPISETAFKIDESGNG-- 124
SS + V+ P+L LD T YY+ L I++GG +P+ ESG+
Sbjct: 180 FLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGG--VPV---VVYDKESGHNTS 234
Query: 125 -------GIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG-VALFDTCYDFSSRSS 176
++DSGT T L Y A+R + + + PT G A D C++ S +S
Sbjct: 235 VGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEE--QVILPTLGNSAGLDLCFNSSGDTS 292
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
P+V+F+F L LP +N + V S C + + LSIIGN+QQQ + ++
Sbjct: 293 YGFPSVTFYFANQVQLVLPFEN-IFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYD 351
Query: 237 LRNSLIGF 244
L S I F
Sbjct: 352 LVASQISF 359
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 123/279 (44%), Gaps = 34/279 (12%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G + +T+ +V +GC + + +GL G G G+ S P+Q+ FSYCL+
Sbjct: 212 GLLIADTLRAPGRAVPGFVLGC--SLVSVHQPPSGLAGFGRGAPSVPAQLGLPKFSYCLL 269
Query: 61 DRDSDSTSTLEFDSSLPPNAVTA-----PLLRNHELD-----TFYYLGLTGISVGGDLLP 110
R D + + L PL+++ D +YYL L G++VGG +
Sbjct: 270 SRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVR 329
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTDGVALF 165
+ AF + +G+GG IVDSGT T L + + DA V R R+ DG+ L
Sbjct: 330 LPARAFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDGLGL- 388
Query: 166 DTCYDFSSRS-SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG---TFCFAF-------- 213
C+ + S+ +P +SFHF G V+ LP +NY + V G C A
Sbjct: 389 HPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFV-VAGRGAVEAICLAVVTDFGGGS 447
Query: 214 ---APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S I+G+ QQQ V ++L +GF C
Sbjct: 448 GAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSC 486
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 81/272 (29%), Positives = 135/272 (49%), Gaps = 36/272 (13%)
Query: 3 FVTETVTLGS-ASVDNIAIGCGHNN-EGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
+ ETV G +V + A GC + E + GA+G+LGL G ++ P Q+ FS+
Sbjct: 199 LIMETVVGGKPVTVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSH 258
Query: 58 CLVDRDS--DSTSTLEF-DSSLPPNAV--TAPLLRNHELD-TFYYLGLTGISVGGD---L 108
C DR S +ST + F ++ LP V T+ L N EL FY++ L G+S+ L
Sbjct: 259 CFPDRSSHLNSTGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVL 318
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFD- 166
LP +I+DSG++ + ++ LR+AF++ +L +G + D
Sbjct: 319 LPRGSV-----------VILDSGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDL 367
Query: 167 -TCYDFSSRSSVEV----PTVSFHFPEGKVLPLPAKNYLIPV---DSNGTFCFAFAPTSS 218
TC+ S+ E+ P++S F +G + +P+ L+PV ++ CFAF
Sbjct: 368 GTCFKVSNDDIDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARYQNHVKMCFAFEDGGP 427
Query: 219 S-LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +++IGN QQQ V ++++ S +GF C
Sbjct: 428 NPVNVIGNYQQQNLWVEYDIQRSRVGFARASC 459
>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
Length = 372
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 93/259 (35%), Positives = 128/259 (49%), Gaps = 26/259 (10%)
Query: 6 ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVDR 62
+T+TL + +V + GC G + A GLLGLG G LS SQ + STFSYCL
Sbjct: 124 DTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCL--- 180
Query: 63 DSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
S +L F SL P PLL+N + Y++ L + VG ++ + +
Sbjct: 181 --PSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGS 238
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVALFDTCYDFSSR 174
F + S G I DSGT TRL T Y A+RDAF R R L+ T + FDTCY
Sbjct: 239 FTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTS-LGGFDTCYTV--- 294
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQQG 230
+ PT++F F G + LP N LI + T C A A +S L++I N+QQQ
Sbjct: 295 -PIAAPTITFMF-TGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQN 352
Query: 231 TRVSFNLRNSLIGFTPNKC 249
R+ +++ NS +G C
Sbjct: 353 HRLLYDVPNSRLGVARELC 371
>gi|125575541|gb|EAZ16825.1| hypothetical protein OsJ_32297 [Oryza sativa Japonica Group]
Length = 416
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/270 (28%), Positives = 121/270 (44%), Gaps = 33/270 (12%)
Query: 1 GDFVTETVTLGSASVDNIAIGC----GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G T+T +G+A+ ++ GC G + G G +GL+GLG S SQ+N + FS
Sbjct: 140 GIVATDTFAIGTATA-SLGFGCVVASGIDTMG---GPSGLIGLGRAPSSLVSQMNITKFS 195
Query: 57 YCLVDRDSDSTSTLEFDSSLP----PNAVTAPLLRNH---ELDTFYYLGLTGISVGGDLL 109
YCL DS S L SS N+ T P ++ ++ +Y + L GI G
Sbjct: 196 YCLTPHDSGKNSRLLLGSSAKLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAG---- 251
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
+ A + SGN ++V + ++ L Y AL+ + A + FD C+
Sbjct: 252 ---DAAIALPPSGN-TVLVQTLAPMSFLVDSAYQALKKEVTKAVGAAPTATPLQPFDLCF 307
Query: 170 DFSSRSSVEVPTVSFHFPEG-KVLPLPAKNYLIPV-DSNGTFCFAFAPTS--------SS 219
+ S+ P + F F +G L +P YLI V + GT C A TS +
Sbjct: 308 PKAGLSNASAPDLVFTFQQGAAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDEN 367
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
L+I+G++QQ+ T +L + F P C
Sbjct: 368 LNILGSLQQENTHFLLDLEKKTLSFEPADC 397
>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
vinifera]
Length = 437
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 93/264 (35%), Positives = 129/264 (48%), Gaps = 26/264 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
+ +T+TL + +V + GC G + A GLLGLG G LS SQ + STFSY
Sbjct: 184 ANLSQDTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSY 243
Query: 58 CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
CL S +L F SL P PLL+N + Y++ L + VG ++
Sbjct: 244 CL-----PSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVD 298
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVALFDTCY 169
+ +F + S G I DSGT TRL T Y A+RDAF R R L+ T + FDTCY
Sbjct: 299 VPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTS-LGGFDTCY 357
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGN 225
+ PT++F F G + LP N LI + T C A A +S L++I N
Sbjct: 358 TV----PIAAPTITFMF-TGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIAN 412
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
+QQQ R+ +++ NS +G C
Sbjct: 413 LQQQNHRLLYDVPNSRLGVARELC 436
>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 492
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 77/273 (28%), Positives = 110/273 (40%), Gaps = 38/273 (13%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST----FSYCLVDRDSDST 67
S +V+N C H G VG AG G G LS P+Q+ + FSYCLV +
Sbjct: 213 SVAVENFTFACAHTALGEPVGVAGF---GRGPLSLPAQLAPAALSGRFSYCLVAHSFRAD 269
Query: 68 STLE-----------FDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
+ D + V PLL N + FY + L +SVGG +P
Sbjct: 270 RPIRPSPLILGRSPGEDPASETGIVYTPLLHNPKHPYFYSVALEAVSVGGTRIPARPELG 329
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT-----CYDF 171
++ +G+GG++VDSGT T L ETY + + F R A A D CY +
Sbjct: 330 RVGRAGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMAAARFERAEAAEDQTGLAPCYYY 389
Query: 172 SSRSSV-------EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTSS--- 218
+S VP ++ HF + LP +NY + S C
Sbjct: 390 DHDASAAEEGSARAVPPLAMHFRGEATVVLPRRNYFMGFRSEERRRVGCLMLMNGGEDDG 449
Query: 219 --SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+GN QQQG V +++ +GF +C
Sbjct: 450 GGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 78/270 (28%), Positives = 121/270 (44%), Gaps = 33/270 (12%)
Query: 1 GDFVTETVTLGSASVDNIAIGC----GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G T+T +G+A+ ++ GC G + G G +GL+GLG S SQ+N + FS
Sbjct: 156 GIVATDTFAIGTATA-SLGFGCVVASGIDTMG---GPSGLIGLGRAPSSLVSQMNITKFS 211
Query: 57 YCLVDRDSDSTSTLEFDSSLP----PNAVTAPLLRNH---ELDTFYYLGLTGISVGGDLL 109
YCL DS S L SS N+ T P ++ ++ +Y + L GI G
Sbjct: 212 YCLTPHDSGKNSRLLLGSSAKLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAG---- 267
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
+ A + SGN ++V + ++ L Y AL+ + A + FD C+
Sbjct: 268 ---DAAIALPPSGN-TVLVQTLAPMSFLVDSAYQALKKEVTKAVGAAPTATPLQPFDLCF 323
Query: 170 DFSSRSSVEVPTVSFHFPEG-KVLPLPAKNYLIPV-DSNGTFCFAFAPTS--------SS 219
+ S+ P + F F +G L +P YLI V + GT C A TS +
Sbjct: 324 PKAGLSNASAPDLVFTFQQGAAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDEN 383
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
L+I+G++QQ+ T +L + F P C
Sbjct: 384 LNILGSLQQENTHFLLDLEKKTLSFEPADC 413
>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 442
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 80/268 (29%), Positives = 120/268 (44%), Gaps = 27/268 (10%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ V E + S + I +GC ++ A G+LG+ G L FPSQ + FSYC+
Sbjct: 178 GNLVREKIAFSPSQTTPPIILGCATQSDD----ARGILGMNLGRLGFPSQAKITKFSYCV 233
Query: 60 VDRDSDSTSTLEFDSSLPP-------NAVT-APLLRNHELDTFYY-LGLTGISVGGDLLP 110
+ + S + + P N +T R LD Y L L GIS+GG L
Sbjct: 234 PTKQAQPASGSFYLGNNPASSSFRYVNLLTFGQSQRMPNLDPLAYTLPLQGISIGGKKLN 293
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTC 168
I + FK + G+G ++DSG+ T L E YN +R+ V+ G + + D C
Sbjct: 294 IPPSVFKPNAGGSGQTMIDSGSEFTYLVDEAYNVIREELVKKVGPKIKKGYMYGGVADIC 353
Query: 169 YDFSSRSSVE----VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS---SSLS 221
+D ++E V + F F +G + +P + L VD G C + + +
Sbjct: 354 FD---GDAIEIGRLVGDMVFEFEKGVQIVIPKERVLATVDG-GVHCLGMGRSERLGAGGN 409
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IIGN QQ V F+L N +GF C
Sbjct: 410 IIGNFHQQNLWVEFDLANRRVGFGEADC 437
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 84/257 (32%), Positives = 116/257 (45%), Gaps = 19/257 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGG-SLSFPSQINAS---TFS 56
G FV + VTL GCG + G F A+G+LGL G S SQ + FS
Sbjct: 204 GVFVCDEVTLKPDVFPKFQFGCGDSGGGEFGTASGVLGLAKGEQYSLISQTASKFKKKFS 263
Query: 57 YCLVDRDSDSTSTL--EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
YC ++ S L E S P+ LL N Y++ L GISV L +S +
Sbjct: 264 YCFPPKEHTLGSLLFGEKAISASPSLKFTQLL-NPPSGLGYFVELIGISVAKKRLNVSSS 322
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR---GTRALSPTDGVALFDTCYDF 171
F + G I+DSGT +TRL T Y ALR AF + ++SP L DTCY+
Sbjct: 323 LF-----ASPGTIIDSGTVITRLPTAAYEALRTAFQQEMLHCPSISPPPQEKLLDTCYNL 377
Query: 172 S--SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQ 227
++++P + HF + L L C AFA S S ++IIGN Q
Sbjct: 378 KGCGGRNIKLPEIVLHFVGEVDVSLHPSGILWANGDLTQACLAFARKSNPSHVTIIGNRQ 437
Query: 228 QQGTRVSFNLRNSLIGF 244
Q +V +++ +GF
Sbjct: 438 QVSLKVVYDIEGGRLGF 454
>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 93/259 (35%), Positives = 128/259 (49%), Gaps = 26/259 (10%)
Query: 6 ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVDR 62
+T+TL + +V + GC G + A GLLGLG G LS SQ + STFSYCL
Sbjct: 72 DTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCL--- 128
Query: 63 DSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
S +L F SL P PLL+N + Y++ L + VG ++ + +
Sbjct: 129 --PSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGS 186
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVALFDTCYDFSSR 174
F + S G I DSGT TRL T Y A+RDAF R R L+ T + FDTCY
Sbjct: 187 FTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTS-LGGFDTCYTV--- 242
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQQG 230
+ PT++F F G + LP N LI + T C A A +S L++I N+QQQ
Sbjct: 243 -PIAAPTITFMF-TGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQN 300
Query: 231 TRVSFNLRNSLIGFTPNKC 249
R+ +++ NS +G C
Sbjct: 301 HRLLYDVPNSRLGVARELC 319
>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 480
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 80/289 (27%), Positives = 117/289 (40%), Gaps = 53/289 (18%)
Query: 6 ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN------ASTFSYCL 59
+T++L S + N GC H G+ G G G LS P+Q+ + FSYCL
Sbjct: 188 DTLSLSSLFLRNFTFGCAHTT---LAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCL 244
Query: 60 VDRDSDSTSTLE--------FDSSLPPNA-------VTAPLLRNHELDTFYYLGLTGISV 104
V DS + ++ V +L N + FY + L GI+V
Sbjct: 245 VSHSFDSERVRKPSPLILGRYEEKEKEKIGGGVAEFVYTSMLENPKHPYFYTVSLIGIAV 304
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT-------RALS 157
G +P E +++ G+GG++VDSGT T L YN++ D F R R +
Sbjct: 305 GKRTIPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRRVGRDNKRARKIE 364
Query: 158 PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGK--VLPLPAKNYLIPVDSN--------- 206
G+A CY + S +VP ++ F GK + LP KNY
Sbjct: 365 EKTGLA---PCYYLN--SVADVPALTLRFAGGKNSSVVLPRKNYFYEFSDGSDGAKGKRK 419
Query: 207 -GTFCFAFAPTSSSLS-----IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
G + LS +GN QQQG V ++L +GF +C
Sbjct: 420 VGCLMLMNGGDEADLSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQC 468
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 84/270 (31%), Positives = 124/270 (45%), Gaps = 29/270 (10%)
Query: 1 GDFVTETVTLGSASVDNI------AIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN--- 51
GD +T+ + A+ D + GCG +GL G G+L L GSLSFPSQI
Sbjct: 71 GDLSVDTLKMAGAASDELEEFPGFVFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKY 130
Query: 52 ASTFSYCLVD---RDSDSTSTLEFDSSL----PPNAVTAPLLRNH---ELDTFYYLGLTG 101
+ FSYCL+ ++S S + F + P + L+ E +Y + L G
Sbjct: 131 GNKFSYCLLRQTAQNSLKKSPMVFGEAAVELKEPGSGKLQELQYTPIGESSIYYTVRLDG 190
Query: 102 ISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG 161
ISVG L +S +AF + + I DSGT +T L ++++ + +S +
Sbjct: 191 ISVGNQRLDLSPSAFLNGQ--DKPTIFDSGTTLTMLPPGVCDSIKQSLA---SMVSGAEF 245
Query: 162 VAL--FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS 219
VA+ D C+ S +P ++FHF G NY+I D C F PT +
Sbjct: 246 VAIKGLDACFRVPPSSGQGLPDITFHFNGGADFVTRPSNYVI--DLGSLQCLIFVPT-NE 302
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+SI GN+QQQ V ++ N IGF C
Sbjct: 303 VSIFGNLQQQDFFVLHDMDNRRIGFKETDC 332
>gi|300681439|emb|CBH32531.1| hypothetical protein TAA_ctg0091b.00060.1 [Triticum aestivum]
Length = 426
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 77/257 (29%), Positives = 121/257 (47%), Gaps = 18/257 (7%)
Query: 6 ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDS- 64
E +G+ GC + G +G+LG G S SQ+ S FSY ++ D+
Sbjct: 158 EVTAVGTHITGRALFGCSLASTVPLDGESGVLGFSRGPYSLLSQLKISRFSYFMLPDDAD 217
Query: 65 --DSTSTLEF-DSSLPP--NAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISETAFKI 118
DS S L D ++P ++ + PLLRN YY+ LTGI V L I F +
Sbjct: 218 KPDSESVLLLGDDAVPQTNSSRSTPLLRNEAYPDLYYVKLTGIKVDDKSLSGIPAGTFDL 277
Query: 119 DESG-NGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVALFDTCYDFSSR 174
+G +GG+++ + + +T LQ YNAL A ++ D VA CY+ S
Sbjct: 278 AANGCSGGVVMSTLSPITYLQPAAYNALTRALASKIKSQPVRPKADDVADLRLCYNIQSV 337
Query: 175 SSVEVPTVS--FHFPEGKVLP--LPAKNYLIPVDSNGTFCFAFAPT---SSSLSIIGNVQ 227
+++ P ++ FH +G+ P L +Y I +S G C PT S S++G++
Sbjct: 338 ANLTFPKITLVFHGVDGRPAPMELTTAHYFIRENSTGLQCLTMLPTPAGSPVSSVLGSLL 397
Query: 228 QQGTRVSFNLRNSLIGF 244
Q GT + ++LR + F
Sbjct: 398 QTGTHMIYDLRGGSLTF 414
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 95/272 (34%), Positives = 126/272 (46%), Gaps = 45/272 (16%)
Query: 1 GDFVTETVTL---GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---T 54
G + TET+TL + V+N + GCG +G+F GLLGLGG S SQ +
Sbjct: 220 GVYSTETLTLSPEAATVVNNFSFGCGLVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGA 279
Query: 55 FSYCLVDRDS--------------DSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLT 100
FSYCL +S ++T+ +F PL TFY + LT
Sbjct: 280 FSYCLPAGNSTAGFLALGAPATGGNNTAGFQF----------TPLQVVET--TFYLVKLT 327
Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA---LS 157
GISVGG L I T F GG+I+DSGT VT L Y+ALR AF A L
Sbjct: 328 GISVGGKQLDIEPTVFA------GGMIIDSGTIVTGLPETAYSALRTAFRSAMSAYPLLP 381
Query: 158 PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS 217
P D L DTCYDF+ ++V VPTV+ F G + L + ++ +G F +
Sbjct: 382 PNDDEDL-DTCYDFTGNTNVTVPTVALTFEGGVTIDLDVPSGVL---LDGCLAFVAGASD 437
Query: 218 SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IIGNV Q+ V ++ +GF C
Sbjct: 438 GDTGIIGNVNQRTFEVLYDSARGHVGFRAGAC 469
>gi|326523515|dbj|BAJ92928.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 459
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 81/257 (31%), Positives = 123/257 (47%), Gaps = 22/257 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGC-GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G ETV +GS V +GC N+ G VG G G G+LS SQ++ S FSY L
Sbjct: 169 GFLANETVAVGS-FVGAAILGCSAANSTGPLVGEVGSFGFNRGALSLVSQLSVSKFSYYL 227
Query: 60 VDRD---SDSTSTLEF-DSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLP-I 111
+ SDS S + D+++P + PLLR+ +Y+ L+ I V G L I
Sbjct: 228 APDEAGSSDSESVVLLGDAAVPQTRGGGRSTPLLRSTAFPDVHYVKLSAIQVDGQALSGI 287
Query: 112 SETAFKIDESG-NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA----LFD 166
AF + G +GG+++ + +TRLQ + YNA+R A V A +G A +FD
Sbjct: 288 PAGAFDLAADGSSGGVVMGTLYPITRLQEDAYNAVRQALVSKINA-QEVNGSAFAGGVFD 346
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKV---LPLPAKNYLIPVDSNGTFCFAFAPTSSSL--- 220
CYD S +++ P ++ F G L L +Y + G CF P
Sbjct: 347 LCYDAQSVATLTFPKITLVFDGGNAPATLELTTVHYFFKDNVTGLQCFTMLPMPVGTPFG 406
Query: 221 SIIGNVQQQGTRVSFNL 237
S++G++ Q GT + +++
Sbjct: 407 SVLGSMVQAGTNMIYDV 423
>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
Length = 467
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 78/250 (31%), Positives = 110/250 (44%), Gaps = 34/250 (13%)
Query: 33 AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDS-----SLPPNAVTAPLLR 87
A GLLG+ GSLSF +Q F+YC+ D L D S P PL+
Sbjct: 208 ATGLLGMNRGSLSFVTQTGTLRFAYCIAPGDGPGLLVLGGDGDGAALSAAPQLNYTPLIE 267
Query: 88 -NHELDTF----YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETY 142
+ L F Y + L GI VG LLPI ++ D +G G +VDSGT T L + Y
Sbjct: 268 MSQPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAY 327
Query: 143 NALRDAFVRGTRALSPTDG------VALFDTCYDFSSRSSVEVPTVSFHFPE------GK 190
L+ F+ T AL G FD C+ +S + V T S PE G
Sbjct: 328 APLKGEFLNQTSALLAPLGEPDFVFQGAFDACFR-ASEARVAAATASQLLPEVGLVLRGA 386
Query: 191 VLPLPAKN--YLIPVDSNG------TFCFAFAPTS---SSLSIIGNVQQQGTRVSFNLRN 239
+ + + Y++P + G +C F + S +IG+ QQ V ++L+N
Sbjct: 387 EVAVGGEKLLYMVPGERRGEGGSEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQN 446
Query: 240 SLIGFTPNKC 249
S +GF P +C
Sbjct: 447 SRVGFAPARC 456
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 77/250 (30%), Positives = 109/250 (43%), Gaps = 11/250 (4%)
Query: 6 ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL--VDRD 63
ET G S NI GCG +N G F +G+LGLG G+ S ++ S FSYC +
Sbjct: 175 ETSDDGLISKQNIVFGCGQDNSG-FTKYSGVLGLGPGTFSIVTRNFGSKFSYCFGSLTNP 233
Query: 64 SDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
+ + L + PL YYL L IS G LL I F+ S
Sbjct: 234 TYPHNILILGNGAKIEGDPTPL---QIFQDRYYLDLQAISFGEKLLDIEPGTFQRYRS-Q 289
Query: 124 GGIIVDSGTAVTRLQTETYNALRDA--FVRGTRALSPTDGVALFDTCYDFSSRSSVE-VP 180
GG ++D+G + T L E Y L + F+ G D CY+ + + + P
Sbjct: 290 GGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGFP 349
Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIGNVQQQGTRVSFNLRN 239
V+FHF G L L ++ + +S +FC A T +S+IG + QQ V +NLR
Sbjct: 350 VVTFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRT 409
Query: 240 SLIGFTPNKC 249
+ F C
Sbjct: 410 MKVYFQRTDC 419
>gi|194702702|gb|ACF85435.1| unknown [Zea mays]
gi|414885969|tpg|DAA61983.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 163
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 83/163 (50%), Gaps = 14/163 (8%)
Query: 93 TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRD----A 148
+FYYL LTGI+V G + + + F + G I+DSGTA + L Y ALR A
Sbjct: 8 SFYYLNLTGITVAGRAIKVPPSVF----ATAAGTIIDSGTAFSCLPPSAYAALRSSVRSA 63
Query: 149 FVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT 208
R RA S T +FDTCYD + +V +P+V+ F +G + L L +
Sbjct: 64 MGRYKRAPSST----IFDTCYDLTGHETVRIPSVALVFADGATVHLHPSGVLYTWSNVSQ 119
Query: 209 FCFAFAPT--SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
C AF P +SL ++GN QQ+ V +++ N +GF N C
Sbjct: 120 TCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANGC 162
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 90/280 (32%), Positives = 127/280 (45%), Gaps = 32/280 (11%)
Query: 1 GDFVTETVTLGSASVD-NIAIGCGHNNEG----LFVGAAGLLGLGGGSLSFPSQINASTF 55
G+ E G+++ D N+ GC + G GLLG+ GSLSF SQ+ F
Sbjct: 164 GNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKF 223
Query: 56 SYCLVDRDSDSTSTLEFDSS---LPPNAVTAPLLR-NHELDTF----YYLGLTGISVGGD 107
SYC+ D L DS+ L P T PL+R + L F Y + LTGI V G
Sbjct: 224 SYCISGTDDFPGFLLLGDSNFTWLTPLNYT-PLIRISTPLPYFDRVAYTVQLTGIKVNGK 282
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGVALF 165
LLPI ++ D +G G +VDSGT T L Y ALR F+ T + D +F
Sbjct: 283 LLPIPKSVLLPDHTGAGQTMVDSGTQFTFLLGPVYTALRSDFLNQTNGILTVYEDPEFVF 342
Query: 166 ----DTCYD---FSSRSSV--EVPTVSFHFPEGKVL----PLPAKNYLIPVDSNGTFCFA 212
D CY F R+ + +PTVS F ++ PL + + ++ +CF
Sbjct: 343 QGTMDLCYRISPFRIRTGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTAGNDSVYCFT 402
Query: 213 FAPTS---SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
F + +IG+ QQ + F+L+ S IG P +C
Sbjct: 403 FGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVQC 442
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 80/272 (29%), Positives = 134/272 (49%), Gaps = 36/272 (13%)
Query: 3 FVTETVTLGS-ASVDNIAIGCGHNN-EGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
+ ETV G +V + A GC + E + GA+G+LGL G ++ P Q+ FS+
Sbjct: 199 LIMETVVGGKPVTVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSH 258
Query: 58 CLVDRDS--DSTSTLEF-DSSLPPNAV--TAPLLRNHELD-TFYYLGLTGISVGGD---L 108
C DR S +ST + F ++ LP V T+ L N EL FY++ L G+S+
Sbjct: 259 CFPDRSSHLNSTGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVF 318
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFD- 166
LP +I+DSG++ + ++ LR+AF++ +L +G + D
Sbjct: 319 LPRGSV-----------VILDSGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDL 367
Query: 167 -TCYDFSSRSSVEV----PTVSFHFPEGKVLPLPAKNYLIPV---DSNGTFCFAFAPTSS 218
TC+ S+ E+ P++S F +G + +P+ L+PV ++ CFAF
Sbjct: 368 GTCFKVSNDDIDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARFQNHVKMCFAFEDGGP 427
Query: 219 S-LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +++IGN QQQ V ++++ S +GF C
Sbjct: 428 NPVNVIGNYQQQNLWVEYDIQRSRVGFARASC 459
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 73/243 (30%), Positives = 111/243 (45%), Gaps = 21/243 (8%)
Query: 17 NIAIGCG-HNNEGLF--VGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDSDSTSTL 70
N GCG +NN +F G++GLG G LS SQI FSYCL+ S STS L
Sbjct: 203 NSFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHKFSYCLLPLGSTSTSKL 262
Query: 71 EFDSS---LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGII 127
+F + V+ P++ L T+Y+L L ++V +P T +G +I
Sbjct: 263 KFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVPTGST--------DGNVI 314
Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFP 187
+DSGT +T L Y + D ++ C+ + R + P ++F F
Sbjct: 315 IDSGTLLTYLGESFYYNFAASLQESLAVELVQDVLSPLPFCFPY--RDNFVFPEIAFQFT 372
Query: 188 EGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSLIGFTP 246
+V PA +++ D N T C AP+S S +SI G+ Q +V ++L + F P
Sbjct: 373 GARVSLKPANLFVMTEDRN-TVCLMIAPSSVSGISIFGSFSQIDFQVEYDLEGKKVSFQP 431
Query: 247 NKC 249
C
Sbjct: 432 TDC 434
>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
Length = 458
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 69/270 (25%), Positives = 109/270 (40%), Gaps = 25/270 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G F+ E + ++ +GC + + + L G G S P Q+ F+YCL
Sbjct: 193 GFFLLENLDFPGKTIHKFLVGCTTSADRE-PSSDALAGFGRTMFSLPMQMGVKKFAYCLN 251
Query: 61 DRDSDSTST-----LEFDSSLPPNAVTAPLLRNH-ELDTFYYLGLTGISVGGDLLPISET 114
D D T L++ AP L+N + +YYLG+ + +G LL I
Sbjct: 252 SHDYDDTRNSGKLILDYSDGETQGLSYAPFLKNPPDYPFYYYLGVKDMKIGNKLLRIPGK 311
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETY----NALRDAFVRGTRALSPTDGVALFDTCYD 170
GG+++DSG A + + N L+ + R+L L CY+
Sbjct: 312 YLTPGSDSRGGVMIDSGFAYGYMTLPVFKIVTNELKKQMSKYRRSLEAETQSGL-TPCYN 370
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCF-----------AFAPTSSS 219
F+ S+++P + + F G + +P NY + CF F P S
Sbjct: 371 FTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLGCFPVTTDSPTNNLEFTPGPS- 429
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I+GN QQ V F+L+N +GF C
Sbjct: 430 -IILGNYQQVDHYVEFDLKNERLGFRQQTC 458
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 79/249 (31%), Positives = 116/249 (46%), Gaps = 13/249 (5%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G +ET TLGS +V I GC +EG + +GL+GLG G LS Q+ FSYCL
Sbjct: 180 GYMGSETFTLGSDAVQGIGFGCTTMSEGGYGSGSGLVGLGRGKLSLVRQLKVGAFSYCLT 239
Query: 61 DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
S S+ L +L V + L N + TFY + L IS+G A K
Sbjct: 240 SDPSTSSPLLFGAGALTGPGVQSTPLVNLKTSTFYTVNLDSISIG---------AAKTPG 290
Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
+G GII DSGT +T L Y + T L+ G ++ C F + P
Sbjct: 291 TGRHGIIFDSGTTLTFLAEPAYTLAEAGLLSQTTNLTRVPGTDGYEVC--FQTSGGAVFP 348
Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
++ HF +G + L +NY V+ + C+ + S +SI+GN+ Q + ++L S
Sbjct: 349 SMVLHF-DGGDMALKTENYFGAVN-DSVSCWLVQKSPSEMSIVGNIMQMDYHIRYDLDKS 406
Query: 241 LIGFTPNKC 249
++ F P C
Sbjct: 407 VLSFQPTNC 415
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 72/252 (28%), Positives = 119/252 (47%), Gaps = 19/252 (7%)
Query: 12 SASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSD 65
A + + +GC + +G F + G+L LG ++SF S+ + FSYCLVD +
Sbjct: 220 KAKLQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFGGRFSYCLVDHLAPRN 279
Query: 66 STSTLEFDSSLPPNAVTAP-------LLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
+TS L F + + LL + FY++ + ++V G+ L I +
Sbjct: 280 ATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEILPDVW-- 337
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE 178
D NGG I+DSGT++T L T Y+A+ A + + P + F+ CY+++ S E
Sbjct: 338 DFRKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGV-PRVNMDPFEYCYNWTG-VSAE 395
Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIGNVQQQGTRVSFNL 237
+P + F L P K+Y+I + G C +S+IGN+ QQ F+L
Sbjct: 396 IPRMELRFAGAATLAPPGKSYVIDT-APGVKCIGVVEGAWPGVSVIGNILQQEHLWEFDL 454
Query: 238 RNSLIGFTPNKC 249
N + F ++C
Sbjct: 455 ANRWLRFKQSRC 466
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 81/263 (30%), Positives = 110/263 (41%), Gaps = 30/263 (11%)
Query: 3 FVTETVTLGSASVD------NIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST-- 54
F + T G +VD + GC EGL V GL+GL G +S SQ++A T
Sbjct: 152 FADGSCTAGPVTVDAFTFSTRLDFGCATRTEGLSVPDDGLVGLANGPISLVSQLSAKTPF 211
Query: 55 ---FSYCLV--DRDSDSTSTLEFDS----SLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
FSYCLV +S+L F S S P A T PL+ +FY + L I V
Sbjct: 212 AHKFSYCLVPYSSSETVSSSLNFGSHAIVSSSPGAATTPLVAGRN-KSFYTIALDSIKVA 270
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
G +P+ T K+ IVDSGT +T L + L A + L+
Sbjct: 271 GKPVPLQTTTTKL--------IVDSGTMLTYLPKAVLDPLVAALTAAIKLPRVKSPETLY 322
Query: 166 DTCYDFSSRSSVEV----PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
CYD R+ +V P V+ G + LP N + + T C A +
Sbjct: 323 AVCYDVRRRAPEDVGKSIPDVTLVLGGGGEVRLPWGNTFVVENKGTTVCLALVESHLPEF 382
Query: 222 IIGNVQQQGTRVSFNLRNSLIGF 244
I+GNV QQ V F+L + F
Sbjct: 383 ILGNVAQQNLHVGFDLERRTVSF 405
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 89/280 (31%), Positives = 127/280 (45%), Gaps = 34/280 (12%)
Query: 1 GDFVTETVTLGSASVDNIAIGCG----HNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G+ ET +GS + GC +N + GL+G+ GSLSF +Q+ S FS
Sbjct: 155 GNLAHETFVIGSVTRPGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFS 214
Query: 57 YCLVDRDSDSTSTLEFDSS---LPPNAVTAPLLRNHELDTF----YYLGLTGISVGGDLL 109
YC+ DS S L D+S L P T +L++ L F Y + L GI VG +L
Sbjct: 215 YCISGSDS-SVFLLLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKIL 273
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGV--A 163
+ ++ F D +G G +VDSGT T L Y AL++ F+ T R + D V
Sbjct: 274 SLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQG 333
Query: 164 LFDTCYDFSSRSSVE---VPTVSFHFPEGKVLPLPAKNYLIPVDSNGT------FCFAFA 214
D CY S + +P VS F G + + + L V+ G+ +CF F
Sbjct: 334 TMDLCYKVGSTTRPNFSGLPMVSLMF-RGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFG 392
Query: 215 PTSSSLSI----IGNVQQQGTRVSFNLRNSLIGFTPN-KC 249
S L I IG+ QQ + F+L S +GF N +C
Sbjct: 393 -NSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGFAGNVRC 431
>gi|222617032|gb|EEE53164.1| hypothetical protein OsJ_35998 [Oryza sativa Japonica Group]
Length = 384
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 71/244 (29%), Positives = 111/244 (45%), Gaps = 33/244 (13%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G T+T T G+ +V + GC + G F GA+G++G+G G+LS SQ+ FSY L+
Sbjct: 133 GYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLL 192
Query: 61 ----DRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
D + S + F D ++P + LD I
Sbjct: 193 APEATDDGSADSVIRFGDDAVPKT-------KRGRLDA-----------------IPAGT 228
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTCYDFSS 173
F + +G GG+I+ S T VT L+ Y+ +R A V L +G A D CY+ SS
Sbjct: 229 FDLRANGTGGVILSSTTPVTYLEQAAYDVVRAA-VASRIGLPAVNGSAALELDLCYNASS 287
Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
+ V+VP ++ F G + L A NY + G C P+ S++G + Q GT +
Sbjct: 288 MAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVLGTLLQTGTNM 346
Query: 234 SFNL 237
+++
Sbjct: 347 IYDV 350
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 78/251 (31%), Positives = 113/251 (45%), Gaps = 13/251 (5%)
Query: 6 ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC---LVDR 62
+T G S NI GCG +N G F +G+LGLG G+ S ++ S FSYC L+D
Sbjct: 185 QTSDEGLISKPNIVFGCGQDNSG-FTQYSGVLGLGPGTFSIVTRNFGSKFSYCFGSLID- 242
Query: 63 DSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
+ + L + PL YYL L IS+G LL I F+ S
Sbjct: 243 PTYPHNFLILGNGARIEGDPTPL---QIFQDRYYLDLQAISLGEKLLDIEPGIFQRYRS- 298
Query: 123 NGGIIVDSGTAVTRLQTETYNALRDA--FVRGTRALSPTDGVALFDTCYDFSSRSSVE-V 179
GG ++D+G + T L E Y L + F+ G D + CY+ + + +
Sbjct: 299 KGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGF 358
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIGNVQQQGTRVSFNLR 238
P V+FHF G L L ++ + +S +FC A T +S+IG + QQ V +NLR
Sbjct: 359 PVVTFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLR 418
Query: 239 NSLIGFTPNKC 249
+ F C
Sbjct: 419 TMKVYFQRTDC 429
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 86/275 (31%), Positives = 125/275 (45%), Gaps = 28/275 (10%)
Query: 1 GDFVTETVTLGSASVDNIAIGCG----HNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G+ ++ +GS+++ GC +N GL+G+ GSLSF +Q+ FS
Sbjct: 129 GNLASDNFRIGSSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFS 188
Query: 57 YCLVDRDSDSTSTLEFDSSLPP--NAVTAPLLR-NHELDTF----YYLGLTGISVGGDLL 109
YC+ RDS S L DS L N PL++ + L F Y + L GI VG +L
Sbjct: 189 YCISGRDS-SGVLLFGDSHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKIL 247
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF---- 165
P+ ++ F D +G G +VDSGT T L Y ALR+ F+ T+ + G F
Sbjct: 248 PLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQG 307
Query: 166 --DTCYDFSSRSSV-EVPTVSFHFPEGK-VLPLPAKNYLIPVDSNG---TFCFAFAPTSS 218
D CY + + E+P VS F + V+ Y +P G +C F S
Sbjct: 308 AMDLCYRVPAGGKLPELPAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYCLTFG-NSD 366
Query: 219 SLSI----IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
L I IG+ QQ + F+L S +GF +C
Sbjct: 367 LLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC 401
>gi|297605070|ref|NP_001056627.2| Os06g0118000 [Oryza sativa Japonica Group]
gi|55296430|dbj|BAD68553.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|215692556|dbj|BAG87976.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255676664|dbj|BAF18541.2| Os06g0118000 [Oryza sativa Japonica Group]
Length = 175
Score = 89.7 bits (221), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 88/179 (49%), Gaps = 15/179 (8%)
Query: 74 SSLPPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGT 132
++L P V+ PLL + + TFY + L I V G LP+ T F + ++DS T
Sbjct: 9 AALVPTFVSTPLLSSSTMSPTFYRVLLRSIIVAGRPLPVPPTVF------SASSVIDSAT 62
Query: 133 AVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVL 192
++R+ Y ALR AF P V++ DTCYDFS S+ +P+++ F G +
Sbjct: 63 VISRIPPTAYQALRAAFRSAMTMYRPAPPVSILDTCYDFSGVRSITLPSIALVFDGGATV 122
Query: 193 PLPAKNYLIPVDSNGTFCFAFAPTSSSLS--IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
L A L+ G C AFAPT+S IGNVQQ+ V +++ I F C
Sbjct: 123 NLDAAGILL----QG--CLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 175
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 89.7 bits (221), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 120/277 (43%), Gaps = 31/277 (11%)
Query: 1 GDFVTETVTLGSASVDNIAIGC----GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G ET GS + GC +N GL+G+ GSLSF +Q+ FS
Sbjct: 156 GHLAFETFRFGSLTRPATVFGCMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQMGFRKFS 215
Query: 57 YCLVDRDSD--------STSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
YC+ DS S L+ + P ++ PL + Y + L GI V +
Sbjct: 216 YCISGLDSTGFLLLGEARYSWLKPLNYTPLVQISTPLPYFDRVA--YSVQLEGIKVNNKV 273
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGV-- 162
LP+ ++ F D +G G +VDSGT T L Y+ALR F+ T R L+ V
Sbjct: 274 LPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQ 333
Query: 163 ALFDTCYDFSSRSSV--EVPTVSFHFPEGKVLPLPAKN--YLIPVDSNG---TFCFAFAP 215
D CY S SS +P V F G + + + Y +P + G +CF F
Sbjct: 334 GAMDLCYLIDSTSSTLPNLPVVKLMF-RGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGN 392
Query: 216 TSS---SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ S +IG+ QQQ + ++L NS IGF +C
Sbjct: 393 SDELGISSFLIGHHQQQNVWMEYDLENSRIGFAELRC 429
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 89.7 bits (221), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 77/246 (31%), Positives = 115/246 (46%), Gaps = 28/246 (11%)
Query: 17 NIAIGCGHNNEGLFVG-----AAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDSDSTS 68
I GCG N+ F G++GLG G LS SQ+ FSYCL+ S+S S
Sbjct: 207 KICFGCGFQNK--FTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFSSNSNS 264
Query: 69 TLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGG 125
L+F + V+ PL+ +L FYYL L GI+VG + +T +G
Sbjct: 265 KLKFGEAAIVQGNGVVSTPLIIKPDL-PFYYLNLEGITVGAKTVKTGQT--------DGN 315
Query: 126 IIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRSSVEVPTVSF 184
II+DSG+ +T L+ YN + V+ T A+ + FD C+ + S P V F
Sbjct: 316 IIIDSGSTLTYLEESFYNEFV-SLVKETVAVEEDQYIPYPFDFCFTYKEGMSTP-PDVVF 373
Query: 185 HFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSLIG 243
HF G V+ L N L+ ++ N C P+ ++I GN+ Q V ++++ +
Sbjct: 374 HFTGGDVV-LKPMNTLVLIEDN-LICSTVVPSHFDGIAIFGNLGQIDFHVGYDIQGGKVS 431
Query: 244 FTPNKC 249
F P C
Sbjct: 432 FAPTDC 437
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 81/273 (29%), Positives = 122/273 (44%), Gaps = 35/273 (12%)
Query: 1 GDFVTETVTLGSASVDNI------AIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS- 53
G + +T+ + A+ D + GCG +GL G G+L L GSLSFPSQI
Sbjct: 196 GRSLRDTLKMAGAASDELEEFPGFVFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKY 255
Query: 54 --TFSYCLVDRDSDST----------STLEFD---SSLPPNAVTAPLLRNHELDTFYYLG 98
FSYCL+ + + ++ + +E S P P+ E +Y +
Sbjct: 256 GNKFSYCLLRQTAQNSLKKSPMVFGEAAVELKEPGSGKPQELQYTPI---GESSIYYTVR 312
Query: 99 LTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP 158
L GISVG L +S + F + I DSGT +T L + ++++ + +S
Sbjct: 313 LDGISVGNQRLDLSPSTFL--NGQDKPTIFDSGTTLTMLPSGVCDSIKQSLA---SMVSG 367
Query: 159 TDGVAL--FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT 216
+ VA+ D C+ S +P ++FHF G NY+I D C F PT
Sbjct: 368 AEFVAIKGLDACFRVPPSSGQGLPDITFHFNGGADFVTRPSNYVI--DLGSLQCLIFVPT 425
Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +SI GN+QQQ V ++ N IGF C
Sbjct: 426 -NEVSIFGNLQQQDFFVLHDMDNRRIGFKETDC 457
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 86/277 (31%), Positives = 123/277 (44%), Gaps = 31/277 (11%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHN----NEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G+ ++ +GS+ + + GC + N + GL+G+ GSLSF SQ+ FS
Sbjct: 120 GNLASDVFHIGSSDISGLVFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQLGFPKFS 179
Query: 57 YCLVDRDSDSTSTL---EFDSSLPPNAVTAPLLR-NHELDTF----YYLGLTGISVGGDL 108
YC+ D L S+P N PL++ + L F Y + L GI V L
Sbjct: 180 YCISGTDFSGLLLLGESNLTWSVPLNY--TPLIQISTPLPYFDRVAYTVQLEGIKVLDKL 237
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGV-- 162
LPI ++ F+ D +G G +VDSGT T L YNALR AF+ T R L D V
Sbjct: 238 LPIPKSTFEPDHTGAGQTMVDSGTQFTFLLGPVYNALRSAFLNQTSSVLRVLEDPDFVFQ 297
Query: 163 ALFDTCY--DFSSRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNG---TFCFAFAP 215
D CY S R +PTV+ F G + + Y +P + G C +F
Sbjct: 298 GAMDLCYLVPLSQRVLPLLPTVTLVF-RGAEMTVSGDRVLYRVPGELRGNDSVHCLSFGN 356
Query: 216 TS---SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +IG+ QQ + F+L S IG +C
Sbjct: 357 SDLLGVEAYVIGHHHQQNVWMEFDLEKSRIGLAQVRC 393
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 93/199 (46%), Gaps = 16/199 (8%)
Query: 55 FSYCLVDRDSDSTSTLEFDSSL---PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
FSYCL+ S+STS L+F S V+ PL+ +FY+L L +++G ++P
Sbjct: 248 FSYCLLPFSSNSTSKLKFGSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVVPT 307
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
T +G II+DSGT +T L+ YN + S D F C+ +
Sbjct: 308 GRT--------DGNIIIDSGTVLTYLEQTFYNNFVASLQEVLSVESAQDLPFPFKFCFPY 359
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQG 230
+ +P ++F F G + L KN LI + C A P+S S +SI GNV Q
Sbjct: 360 RDMT---IPVIAFQF-TGASVALQPKNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQFD 415
Query: 231 TRVSFNLRNSLIGFTPNKC 249
+V ++L + F P C
Sbjct: 416 FQVVYDLEGKKVSFAPTDC 434
>gi|449444520|ref|XP_004140022.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 229
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 67/232 (28%), Positives = 105/232 (45%), Gaps = 25/232 (10%)
Query: 37 LGLGGGSLSFPSQINAST--FSYCLVDRDSDSTSTLEF--------------DSSLPPNA 80
LG SL++ + NA+ FSYCLVD +D + F + LP
Sbjct: 3 LGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKM 62
Query: 81 VTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTE 140
L +FY + L GIS G +L I + I+ G G I+DSGT++T L
Sbjct: 63 TYTKLYVGDPYSSFYGVDLIGISANGIMLNIPSRVWDINSGG--GTIIDSGTSLTILAAP 120
Query: 141 TYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL 200
++ + +A + + + FD C++ S + P + FHF +G V P K+Y+
Sbjct: 121 AFDMVMEALTPRLKKFQQLE-IEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYI 179
Query: 201 IPVDSNGTF--CFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ V G F C F + +IIGN+ QQ F+ + +GF P++C
Sbjct: 180 VSV---GKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC 228
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 83/277 (29%), Positives = 121/277 (43%), Gaps = 31/277 (11%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHN----NEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G+ ++T +G++ + + GC + N GL+G+ GSLSF SQ+ FS
Sbjct: 123 GNLASDTFHMGASDIPGMVFGCMDSVFSSNSDEDSKNTGLMGMNRGSLSFVSQMGFPKFS 182
Query: 57 YCLVDRDSDSTSTL---EFDSSLPPN-----AVTAPLLRNHELDTFYYLGLTGISVGGDL 108
YC+ D L F ++P N ++ PL + Y + L GI V L
Sbjct: 183 YCISGTDFSGMLLLGESNFTWAVPLNYTPLVQISTPLPYFDRIA--YTVQLEGIKVSDRL 240
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGV-- 162
LPI ++ F+ D +G G +VDSGT T L Y ALR F+ T R L D V
Sbjct: 241 LPIPKSVFEPDHTGAGQTMVDSGTQFTFLLGPAYTALRSEFLNQTTGFLRVLEDPDFVFQ 300
Query: 163 ALFDTCYD--FSSRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNG---TFCFAFAP 215
D CY S R +PTVS F G + + + Y +P + G C +F
Sbjct: 301 GAMDLCYRVPISQRVLPRLPTVSLVF-NGAEMTVADERVLYRVPGEIRGNDSVHCLSFGN 359
Query: 216 TS---SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +IG+ QQ + F+L S IG +C
Sbjct: 360 SDLLGVEAYVIGHHHQQNVWMEFDLERSRIGLAQVRC 396
>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
Length = 452
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 76/243 (31%), Positives = 111/243 (45%), Gaps = 26/243 (10%)
Query: 33 AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFD-SSLPPNAVTAPLLR-NHE 90
A GLLG+ GSLSF +Q F+YC+ D L D ++L P PL++ +
Sbjct: 197 ATGLLGMNRGSLSFVTQTATLRFAYCIAPGDGPGLLVLGGDGAALAPQLNYTPLIQISRP 256
Query: 91 LDTF----YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALR 146
L F Y + L GI VG LLPI ++ D +G G +VDSGT T L + Y L+
Sbjct: 257 LPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLK 316
Query: 147 DAFVRGTRALSPTDGVA--LFDTCYDFSSRSS-VEVPTVSFHFPEGKVLPLPAKN----- 198
F+ T AL G + +F +D R+S V S PE ++ A+
Sbjct: 317 GEFLNQTSALLAPLGESDFVFQGAFDACFRASEARVAAASXMLPEVGLVLRGAEVAVGGE 376
Query: 199 ---YLIPVDSNG------TFCFAFAPTS---SSLSIIGNVQQQGTRVSFNLRNSLIGFTP 246
Y +P + G +C F + S +IG+ QQ V ++L+N +GF P
Sbjct: 377 KLLYRVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAP 436
Query: 247 NKC 249
+C
Sbjct: 437 ARC 439
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 89/280 (31%), Positives = 127/280 (45%), Gaps = 34/280 (12%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHN----NEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G+ ET +GS + GC + N + GL+G+ GSLSF +Q+ S FS
Sbjct: 155 GNLAHETFVIGSVTRPGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFS 214
Query: 57 YCLVDRDSDSTSTLEFDSS---LPPNAVTAPLLRNHELDTF----YYLGLTGISVGGDLL 109
YC+ DS S L D+S L P T +L++ L F Y + L GI VG +L
Sbjct: 215 YCISGSDS-SGFLLLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKIL 273
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGV--A 163
+ ++ F D +G G +VDSGT T L Y AL++ F+ T R + D V
Sbjct: 274 SLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQG 333
Query: 164 LFDTCYDFSSRSSVE---VPTVSFHFPEGKVLPLPAKNYLIPVDSNGT------FCFAFA 214
D CY S + +P VS F G + + + L V+ G+ +CF F
Sbjct: 334 TMDLCYKVGSTTRPNFSGLPMVSLMF-RGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFG 392
Query: 215 PTSSSLSI----IGNVQQQGTRVSFNLRNSLIGFTPN-KC 249
S L I IG+ QQ + F+L S +GF N +C
Sbjct: 393 -NSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGFAGNVRC 431
>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
Length = 761
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 75/241 (31%), Positives = 113/241 (46%), Gaps = 31/241 (12%)
Query: 35 GLLGLGGGSLSFPSQINASTFSYCLVDRDSD--------STSTLEFDSSLPPNAVTAPLL 86
GL+G+ GSLSF +Q+ FSYC+ +DS S S L+ P ++ PL
Sbjct: 441 GLIGMNRGSLSFVTQMGLQKFSYCISGQDSSGILLFGESSFSWLKALKYTPLVQISTPLP 500
Query: 87 RNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALR 146
+ Y + L GI V +L + ++ + D +G G +VDSGT T L Y AL+
Sbjct: 501 YFDRVA--YTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALK 558
Query: 147 DAFVRGTRA----LSPTDGV--ALFDTCYD--FSSRSSVEVPTVSFHFPEGKVLPLPAKN 198
+ FVR T+A L + V D CY + R+ +PTV+ F G + + A+
Sbjct: 559 NEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMF-RGAEMSVSAER 617
Query: 199 YLIPVD-----SNGTFCFAFAPTSSSL-----SIIGNVQQQGTRVSFNLRNSLIGFTPNK 248
+ V S+ +CF F +S L IIG+ QQ + F+L S +GF +
Sbjct: 618 LMYRVPGVIRGSDSVYCFTFG--NSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVR 675
Query: 249 C 249
C
Sbjct: 676 C 676
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 77/262 (29%), Positives = 117/262 (44%), Gaps = 26/262 (9%)
Query: 4 VTETVTLGSASVDNIAIGCGHN-NEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDR 62
V ET G++ + ++ GCGHN + G G+LGL G S ++I FSYC+ D
Sbjct: 192 VFETTDEGTSRIPDVLFGCGHNIGQDTDPGHNGILGLNNGPDSLATKI-GQKFSYCIGDL 250
Query: 63 DSD--STSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
+ L + P + + FYY+ + GISVG L I+ F++ +
Sbjct: 251 ADPYYNYHQLILGEGADLEGYSTPFEVH---NGFYYVTMEGISVGEKRLDIAPETFEMKK 307
Query: 121 SGNGGIIVDSGTAVT--------RLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
+ GG+I+D+G+ +T L E N L +F + T SP Y
Sbjct: 308 NRTGGVIIDTGSTITFLVDSVHRLLSKEVRNLLGWSFRQTTIEKSP-----WMQCFYGSI 362
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-----SSLSIIGNVQ 227
SR V P V+FHF +G L L + ++ ++ N FC P S S S+IG +
Sbjct: 363 SRDLVGFPVVTFHFADGADLALDSGSFFNQLNDN-VFCMTVGPVSSLNLKSKPSLIGLLA 421
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
QQ V ++L N + F C
Sbjct: 422 QQSYSVGYDLVNQFVYFQRIDC 443
>gi|222624328|gb|EEE58460.1| hypothetical protein OsJ_09701 [Oryza sativa Japonica Group]
Length = 360
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/254 (28%), Positives = 107/254 (42%), Gaps = 42/254 (16%)
Query: 3 FVTETVTLGSASVDNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
++T+ LG ++ N GC + G + GLLGLG G ++ SQ +
Sbjct: 141 LASDTLRLGKDAIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGS-------- 192
Query: 61 DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYL-GLTGISVGGDLLPISETAFKID 119
++ LP L EL L +G G +F D
Sbjct: 193 ----------LYNGRLP--------LLPPELQVILLLRACSGFPAG---------SFAFD 225
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
+ G +VDSGT +TR Y ALR+ F R A S + FDTC++ ++
Sbjct: 226 AATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGA 285
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQQGTRVSF 235
P V+ H G L LP +N LI + C A A +S +++I N+QQQ RV F
Sbjct: 286 PAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVF 345
Query: 236 NLRNSLIGFTPNKC 249
++ NS +GF C
Sbjct: 346 DVANSRVGFAKESC 359
>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
Length = 458
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 66/268 (24%), Positives = 111/268 (41%), Gaps = 21/268 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G F+ E + ++ +GC + + + L G G S P Q+ F+YCL
Sbjct: 193 GFFLLENLDFPGKTIHKFLVGCTTSADRE-PSSDALAGFGRTMFSLPMQMGVKKFAYCLN 251
Query: 61 DRDSDSTST-----LEFDSSLPPNAVTAPLLRNH-ELDTFYYLGLTGISVGGDLLPISET 114
D D T L++ AP +N + +YYLG+ + +G +L I
Sbjct: 252 SHDYDDTRNSGKLILDYSDGETQGLSYAPFXKNPPDYPIYYYLGVKDMKIGNKVLRIPGK 311
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETY----NALRDAFVRGTRALSPTDGVALFDTCYD 170
GG+++DSG A + + + N L+ + R+L + CY+
Sbjct: 312 YLTPGSDSRGGVVIDSGFAYSYMTLPVFKIVTNELKKQMSKYRRSLE-LEAQTGVTPCYN 370
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF---APTSS------SLS 221
F+ S+++P + + F G + +P NY + CF +PTS+
Sbjct: 371 FTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLGCFPVTTDSPTSNLEFTPGPSI 430
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I+GN QQ V F+L+N +GF C
Sbjct: 431 ILGNYQQVDHYVEFDLKNERLGFRQQTC 458
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 86/267 (32%), Positives = 126/267 (47%), Gaps = 31/267 (11%)
Query: 1 GDFVTETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G ET+T S+S GCG N G F GLLGLG GSLS SQ + FS
Sbjct: 199 GVLARETLTFSSSSEFTGFIFGCGETNLGDFGEVDGLLGLGRGSLSLSSQAAPAFGGIFS 258
Query: 57 YCLVDRDSD----STSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
YCL ++ S +P ++ + +FY++ L I++GG +LP+
Sbjct: 259 YCLPSYNTTPGYLSIGATPVTGQIPVQYTA--MVNKPDYPSFYFIELVSINIGGYVLPVP 316
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVALFDTCY 169
+ F G ++DSGT +T L Y ALRD F ++G++ P D + DTCY
Sbjct: 317 PSEFT-----KTGTLLDSGTILTYLPPPAYTALRDRFKFTMQGSKPAPPYDEL---DTCY 368
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL----IPVDSN---GTFCFAFAPTSSSLSI 222
DF+ +S + +P VSF+F +G V L N+ P D+ G F P S+
Sbjct: 369 DFTGQSGILIPGVSFNFSDGAVFNL---NFFGIMTFPDDTKPAVGCLAFVSRPADMPFSV 425
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+G+ Q+ V +++ IGF P C
Sbjct: 426 VGSTTQRSAEVIYDVPAQKIGFIPASC 452
>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
Length = 454
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 76/243 (31%), Positives = 111/243 (45%), Gaps = 26/243 (10%)
Query: 33 AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFD-SSLPPNAVTAPLLR-NHE 90
A GLLG+ GSLSF +Q F+YC+ D L D ++L P PL++ +
Sbjct: 199 ATGLLGMNRGSLSFVTQTATLRFAYCIAPGDGPGLLVLGGDGAALAPQLNYTPLIQISRP 258
Query: 91 LDTF----YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALR 146
L F Y + L GI VG LLPI ++ D +G G +VDSGT T L + Y L+
Sbjct: 259 LPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLK 318
Query: 147 DAFVRGTRALSPTDGVA--LFDTCYDFSSRSS-VEVPTVSFHFPEGKVLPLPAKN----- 198
F+ T AL G + +F +D R+S V S PE ++ A+
Sbjct: 319 GEFLNQTSALLAPLGESDFVFQGAFDACFRASEARVAAASQMLPEVGLVLRGAEVAVGGE 378
Query: 199 ---YLIPVDSNG------TFCFAFAPTS---SSLSIIGNVQQQGTRVSFNLRNSLIGFTP 246
Y +P + G +C F + S +IG+ QQ V ++L+N +GF P
Sbjct: 379 KLLYRVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAP 438
Query: 247 NKC 249
+C
Sbjct: 439 ARC 441
>gi|125572774|gb|EAZ14289.1| hypothetical protein OsJ_04213 [Oryza sativa Japonica Group]
Length = 492
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 69/234 (29%), Positives = 103/234 (44%), Gaps = 10/234 (4%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G + + D + GC EG G++GLG G LS SQ+ FSY L
Sbjct: 177 GLLAVDAFAFATVRADGVIFGCAVATEGDI---GGVIGLGRGELSPVSQLQIGRFSYYLA 233
Query: 61 DRDS-DSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
D+ D S + F P AV+ PL+ + + YY+ L GI V G+ L I F
Sbjct: 234 PDDAVDVGSFILFLDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTF 293
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRS 175
+ G+GG+++ VT L Y +R A L DG L D CY S +
Sbjct: 294 DLQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKIE-LRAADGSELGLDLCYTSESLA 352
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSIIGNVQQ 228
+ +VP+++ F G V+ L NY + G C P+ + S++G++ Q
Sbjct: 353 TAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQ 406
>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 315
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 79/254 (31%), Positives = 119/254 (46%), Gaps = 29/254 (11%)
Query: 13 ASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQIN----ASTFSYCLVD--RDSD 65
S+ GCGHNN G F GL+GLGGG S SQI FS CLV D
Sbjct: 71 VSLSRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIK 130
Query: 66 STSTLEFDSS---LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
+S + F L VT PL++ + T Y++ L GISV LP++ T K
Sbjct: 131 ISSRMSFGKGSQVLGDGVVTTPLVQREQDMTSYFVTLLGISVEDTYLPMNSTIEK----- 185
Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS-----PTDGVALFDTCYDFSSRSSV 177
G ++VDSGT L + Y+ + V+ L P+ G L CY +++++
Sbjct: 186 -GNMLVDSGTPPNILPQQLYDRVY-VEVKNNVPLELITNDPSLGPQL---CY--RTQTNL 238
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPV-DSNGTFCFAFAP-TSSSLSIIGNVQQQGTRVSF 235
+ PT+++HF +L P + ++ P ++ G FC A T+S+ + GN Q + F
Sbjct: 239 KGPTLTYHFEGANLLLTPIQTFIPPTPETKGVFCLAINNYTNSNGGVYGNFAQSNYLIGF 298
Query: 236 NLRNSLIGFTPNKC 249
+L ++ F C
Sbjct: 299 DLDRQVVSFKATDC 312
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 75/255 (29%), Positives = 122/255 (47%), Gaps = 23/255 (9%)
Query: 13 ASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSDS 66
A + + +GC + +G F + G+L LG ++SF S+ A FSYCLVD ++
Sbjct: 229 AKLQGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNA 288
Query: 67 TSTLEFDSSLPPNAVTA-----------PLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
TS L F P A PLL + + FY + + + V G+ L I
Sbjct: 289 TSYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAGEALDIPADV 348
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
+ D + GG I+DSGT++T L T Y A+ A L P + F+ CY++++ +
Sbjct: 349 W--DVARGGGAILDSGTSLTVLATPAYRAVVAALSERLAGL-PRVSMDPFEYCYNWTA-A 404
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVS 234
++E+P + F L PAK+Y++ + G C + +S+IGN+ QQ
Sbjct: 405 ALEIPGLEVRFAGSARLQPPAKSYVVDA-APGVKCIGVQEGAWPGVSVIGNILQQDHLWE 463
Query: 235 FNLRNSLIGFTPNKC 249
F+LR+ + F +C
Sbjct: 464 FDLRDRWLRFKHTRC 478
>gi|361068717|gb|AEW08670.1| Pinus taeda anonymous locus CL1136Contig1_03 genomic sequence
Length = 70
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 41/69 (59%), Positives = 51/69 (73%)
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
G +LFDTCYD S +V+VPT+ FHF + LPA NYLIPVD++ FCF+FA +S L
Sbjct: 2 GFSLFDTCYDLSVLKTVKVPTLVFHFQGRADVSLPATNYLIPVDTSAIFCFSFAGNTSGL 61
Query: 221 SIIGNVQQQ 229
SIIGN+QQQ
Sbjct: 62 SIIGNIQQQ 70
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 87/277 (31%), Positives = 126/277 (45%), Gaps = 31/277 (11%)
Query: 1 GDFVTETVTLGSASVDNIAIGCG----HNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G+ ET +GS + GC +N GL+G+ GSLSF +Q+ FS
Sbjct: 156 GNLAFETFRVGSVTGPATVFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVNQMGFRKFS 215
Query: 57 YCLVDRDSDSTSTL---EFDSSLPPNAVTAPLLRNHELDTF----YYLGLTGISVGGDLL 109
YC+ DRDS L F S L P T + + L F Y + L GI V +L
Sbjct: 216 YCISDRDSSGVLLLGEASF-SWLKPLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVL 274
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGV--A 163
+ ++ F D +G G +VDSGT T L Y+AL+ F+ T R L+ V
Sbjct: 275 SLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALKQEFLLQTKGVLRVLNEPRYVFQG 334
Query: 164 LFDTCYDFS-SRSSV-EVPTVSFHFPEGKVLPLPAKN--YLIPVDSNG---TFCFAFAPT 216
D CY +R+++ +P V+ F G + + + Y +P + G +CF F
Sbjct: 335 AMDLCYLIEPTRAALPNLPVVNLMF-RGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFG-N 392
Query: 217 SSSLSI----IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S SL I IG+ QQQ + ++L S IGF +C
Sbjct: 393 SDSLGIESFVIGHHQQQNVWMEYDLEKSRIGFAEVRC 429
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 87.8 bits (216), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 109/218 (50%), Gaps = 8/218 (3%)
Query: 35 GLLGLGGGSLSFPSQINASTFSYCLVDRDS-DSTSTLEFDSSLPPNAVTAPLLRNHELDT 93
G +GL LS SQ+ FSYCLV ++ STS + F S + PLL +
Sbjct: 209 GNVGLNQTPLSLISQLGIKKFSYCLVPFNNLGSTSKMYFGSLPVTSGGQTPLLYPNS--D 266
Query: 94 FYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR-G 152
YY+ + GIS+G D P + F + E +G II D+G + L+T+ +++L F+
Sbjct: 267 AYYVKVLGISIGNDE-PHFDGVFDVYEVRDGWII-DTGITYSSLETDAFDSLLAKFLTLK 324
Query: 153 TRALSPTDGVALFDTCYDFSSRSSVE-VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCF 211
D F+ C++ + + +E P V+ HF +G L L ++ + ++ +G FC
Sbjct: 325 DFPQRKDDPKERFELCFELQNANDLESFPDVTVHF-DGADLILNVESTFVKIEDDGIFCL 383
Query: 212 AFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
A + S +SI+GN Q Q V ++L +I F P C
Sbjct: 384 ALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPVDC 421
>gi|125602787|gb|EAZ42112.1| hypothetical protein OsJ_26672 [Oryza sativa Japonica Group]
Length = 477
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 77/256 (30%), Positives = 103/256 (40%), Gaps = 54/256 (21%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLG--GGSLSFPSQINASTFSYC 58
G T+TV LG ASVD GCG +N GLF G AGL+GLG G P
Sbjct: 266 GVLATDTVALGGASVDGFVFGCGLSNRGLFGGTAGLMGLGPDGALAGLP----------- 314
Query: 59 LVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
D + PP FY++ +TG SV A
Sbjct: 315 --------------DGAPPP---------------FYFMNVTGASV-------GGAAVAA 338
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCYDFSSRSS 176
G +++DSGT +TRL Y A+R F R G +L D CY+ +
Sbjct: 339 AGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDE 398
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTS--SSLSIIGNVQQQGTRV 233
V+VP ++ G + + A L +G+ C A A S IIGN QQ+ RV
Sbjct: 399 VKVPLLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRV 458
Query: 234 SFNLRNSLIGFTPNKC 249
++ S +GF C
Sbjct: 459 VYDTVGSRLGFADEDC 474
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 115/262 (43%), Gaps = 26/262 (9%)
Query: 4 VTETVTLGSASVDNIAIGCGHN-NEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDR 62
V ET G++ + ++ GCGHN G G+LGL G S +++ FSYC+ +
Sbjct: 191 VFETTDEGTSRISDVLFGCGHNIGHDTDPGHNGILGLNNGPDSLVTKL-GQKFSYCIGNL 249
Query: 63 DSD--STSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
+ L + P + FYY+ + GISVG L I+ F++ E
Sbjct: 250 ADPYYNYHQLILGEGADLEGYSTPF---EVYNGFYYVTMEGISVGEKRLDIAPETFEMKE 306
Query: 121 SGNGGIIVDSGTAVT--------RLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
+ GG+I+D+G+ +T L E N L +F + T SP Y
Sbjct: 307 NRAGGVIIDTGSTITFLVDSVHKLLSKEVRNLLGWSFRQATIEKSP-----WMQCFYGSI 361
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-----SSLSIIGNVQ 227
SR V P V+FHF +G L L + ++ ++ N FC P S S S+IG +
Sbjct: 362 SRDLVGFPVVTFHFSDGADLALDSGSFFNQLNDN-VFCMTVGPVSSLNIKSKPSLIGLLA 420
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
QQ V ++L N + F C
Sbjct: 421 QQSYNVGYDLVNQFVYFQRIDC 442
>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
Length = 492
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 80/272 (29%), Positives = 111/272 (40%), Gaps = 38/272 (13%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDS-- 66
S +V+N C H VG AG G G LS P+Q+ S FSYCLV +
Sbjct: 215 SMAVENFTFACAHTALAEPVGVAGF---GRGPLSLPAQLAPSLSGRFSYCLVAHSFRADR 271
Query: 67 ---TSTLEFDSSLPPNAVTA--------PLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
+S L S A+ A PLL N + FY + L +SVGG +
Sbjct: 272 LIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQPEL 331
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-----TDGVALFDTCYD 170
+D GNGG++VDSGT T L ++T+ + D F R A + CY
Sbjct: 332 GDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLAPCYH 391
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDS---NGTFCFAFAPTSSS-------- 219
+S S VP V+ HF + LP +NY + S C +
Sbjct: 392 YSP-SDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDGEDGG 450
Query: 220 --LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+GN QQQG V +++ +GF +C
Sbjct: 451 GPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 76/264 (28%), Positives = 130/264 (49%), Gaps = 29/264 (10%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNE-GLFVGAAGLLGLGGGSLSFPSQINAST 54
GD +++TL S S NI IGCGH N ++G++G+G G +S Q+ +S+
Sbjct: 180 GDLSNDSLTLDSTSGSSVLFPNIVIGCGHINVLQDNSQSSGVVGMGRGPMSLIKQVGSSS 239
Query: 55 ----FSYCLV--DRDSDSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVG 105
FSYCL+ + DS+S+S L F + + V+ P+++ + + +Y+L L SVG
Sbjct: 240 VGSKFSYCLIPYNSDSNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVG 299
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGV 162
+ + E + + I++DSGT +T L + L V+ R P +
Sbjct: 300 NNRIEYGERS----NASTQNILIDSGTPLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHL 355
Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
+L CY+ + + + VP ++ HF G + L + P + +G CF F +S+ L I
Sbjct: 356 SL---CYNTTGKQ-LNVPDITAHF-NGADVKLNSNGTFFPFE-DGIMCFGFI-SSNGLEI 408
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTP 246
GN+ Q + ++L +I F P
Sbjct: 409 FGNIAQNNLLIDYDLEKEIISFKP 432
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 85/262 (32%), Positives = 130/262 (49%), Gaps = 20/262 (7%)
Query: 1 GDFVTETVTLGSA--SVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINA---ST 54
G TE +GS S+ +A GCG++N G F +G++GLGGGSLS SQ+ +
Sbjct: 187 GYLATERFIIGSTNNSIQELAFGCGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDNK 246
Query: 55 FSYCLV---DRDSDSTSTLEF-DSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGD 107
FSYCLV ++ + S + F D+S + V+ PL+ + E +TFYYL L ISVG +
Sbjct: 247 FSYCLVPILEKSNFSLGKIVFGDNSFISGSDTYVSTPLV-SKEPETFYYLTLEAISVGNE 305
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
L E + G II+DSGT +T L ++ YN L + +D +F
Sbjct: 306 RLAY-ENSRNDGNVEKGNIIIDSGTTLTFLDSKLYNKLELVLEKAVEGERVSDPNGIFSI 364
Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQ 227
C F + +E+P ++ HF + V P + + CF P S+ ++I GN+
Sbjct: 365 C--FRDKIGIELPIITVHFTDADVELKPINTFAKAEED--LLCFTMIP-SNGIAIFGNLA 419
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
Q V ++L + + F P C
Sbjct: 420 QMNFLVGYDLDKNCVSFMPTDC 441
>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
Length = 519
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 111/273 (40%), Gaps = 38/273 (13%)
Query: 11 GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDS- 66
S +V+N C H VG AG G G LS P+Q+ S FSYCLV +
Sbjct: 214 ASMAVENFTFACAHTALAEPVGVAGF---GRGPLSLPAQLAPSLSGRFSYCLVAHSFRAD 270
Query: 67 ----TSTLEFDSSLPPNAVTA--------PLLRNHELDTFYYLGLTGISVGGDLLPISET 114
+S L S A+ A PLL N + FY + L +SVGG +
Sbjct: 271 RLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQPE 330
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-----TDGVALFDTCY 169
+D GNGG++VDSGT T L ++T+ + D F R A + CY
Sbjct: 331 LGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLAPCY 390
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDS---NGTFCFAFAPTSSS------- 219
+S S VP V+ HF + LP +NY + S C +
Sbjct: 391 HYSP-SDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDGEDG 449
Query: 220 ---LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+GN QQQG V +++ +GF +C
Sbjct: 450 GGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 396
Score = 87.4 bits (215), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 78/262 (29%), Positives = 114/262 (43%), Gaps = 23/262 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNE-GLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G T+ V +G+A+ +A GC +E G++G +GLG +LS +Q+NA+ FSYCL
Sbjct: 141 GRIGTDAVAIGTAATARLAFGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYCL 200
Query: 60 VDRDSDSTSTLEFDSSLP-----PNAVTAPLLR-----NHELDTFYYLGLTGISVGGDLL 109
D+ +S L +S A T P ++ N L Y L L I G
Sbjct: 201 APPDTGKSSALFLGASAKLAGAGKGAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAG---- 256
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
+ +SGN I V + T VT L Y LR A A V +D C+
Sbjct: 257 ---NATIAMPQSGN-TITVSTATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCF 312
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF--APTSSSLSIIGNVQ 227
+S S P + F G + +P +YL N T C A +P +SI+G++Q
Sbjct: 313 PKASASG-GAPDLVLAFQGGAEMTVPVSSYLFDA-GNDTACVAILGSPALGGVSILGSLQ 370
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
Q + F+L + F P C
Sbjct: 371 QVNIHLLFDLDKETLSFEPADC 392
>gi|55296937|dbj|BAD68388.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|218197467|gb|EEC79894.1| hypothetical protein OsI_21421 [Oryza sativa Indica Group]
Length = 424
Score = 87.4 bits (215), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 65/171 (38%), Positives = 82/171 (47%), Gaps = 18/171 (10%)
Query: 84 PLLRNHEL-DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETY 142
PL+RN + T Y + L GI VGG L + F GG ++DS +T+L Y
Sbjct: 267 PLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAY 320
Query: 143 NALRDAFVRGTRALSP--TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL 200
ALR AF R A P G A DTCYDF +SV VP VS F G V+ L A +
Sbjct: 321 RALRLAF-RSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVM 379
Query: 201 IPVDSNGTFCFAFAPTSS--SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ C AF PT +L IGNVQQQ V +++ +GF C
Sbjct: 380 V------EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 424
>gi|55296886|dbj|BAD68338.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|55296941|dbj|BAD68392.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
Length = 424
Score = 87.4 bits (215), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 65/171 (38%), Positives = 82/171 (47%), Gaps = 18/171 (10%)
Query: 84 PLLRNHEL-DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETY 142
PL+RN + T Y + L GI VGG L + F GG ++DS +T+L Y
Sbjct: 267 PLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAY 320
Query: 143 NALRDAFVRGTRALSP--TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL 200
ALR AF R A P G A DTCYDF +SV VP VS F G V+ L A +
Sbjct: 321 RALRLAF-RSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVM 379
Query: 201 IPVDSNGTFCFAFAPTSS--SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ C AF PT +L IGNVQQQ V +++ +GF C
Sbjct: 380 V------EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 424
>gi|357444933|ref|XP_003592744.1| hypothetical protein MTR_1g115080, partial [Medicago truncatula]
gi|355481792|gb|AES62995.1| hypothetical protein MTR_1g115080, partial [Medicago truncatula]
Length = 65
Score = 87.4 bits (215), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 41/65 (63%), Positives = 50/65 (76%)
Query: 185 HFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGF 244
+F G +L LPA+N+LIPVDS GTFCFAFAP+SS LSIIGN+QQ+G +S + N IGF
Sbjct: 1 YFLGGPILTLPARNFLIPVDSVGTFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGYIGF 60
Query: 245 TPNKC 249
PN C
Sbjct: 61 GPNIC 65
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 87.4 bits (215), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 70/242 (28%), Positives = 108/242 (44%), Gaps = 22/242 (9%)
Query: 17 NIAIGCGHNNEG-LFVGAA--GLLGLGGGSLSFPSQINAS-----TFSYCLVDRDSDSTS 68
++ +GCG G F GAA G++GLG G +S PS + + FS C + DS
Sbjct: 230 SVVLGCGRKQGGSFFDGAAPDGVMGLGPGDISVPSLLAKAGLIQNCFSLCFDENDS---G 286
Query: 69 TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIV 128
+ F + + P L Y++G+ VG L + FK +V
Sbjct: 287 RILFGDRGHASQQSTPFLPIQGTYVAYFVGVESYCVGNSCL--KRSGFKA--------LV 336
Query: 129 DSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPE 188
DSG++ T L +E YN L F + A + L+D CY+ SS+ ++P + FP
Sbjct: 337 DSGSSFTYLPSEVYNELVSEFDKQVNAKRISFQDGLWDYCYNASSQELHDIPAIQLKFPR 396
Query: 189 GKVLPLPAKNYLIPVDSNGT-FCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPN 247
+ + Y IP T FC + PT S IIG G R+ F++ N +G++ +
Sbjct: 397 NQNFVVHNPTYSIPHHQGFTMFCLSLQPTDGSYGIIGQNFMIGYRMVFDIENLKLGWSNS 456
Query: 248 KC 249
C
Sbjct: 457 SC 458
>gi|383128174|gb|AFG44740.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
Length = 103
Score = 87.4 bits (215), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 46/102 (45%), Positives = 61/102 (59%), Gaps = 5/102 (4%)
Query: 85 LLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNA 144
L+ N ++Y++ L GISVGG L I+ G GG IVDSGT +TRL + YNA
Sbjct: 1 LVSNSIYTSYYFVVLNGISVGGQRLSITPAVL-----GKGGTIVDSGTIITRLVPQAYNA 55
Query: 145 LRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHF 186
L+ +F T+ L + ++ DTCYD SS S V VP V+FHF
Sbjct: 56 LKTSFRSQTQNLPSAEPYSILDTCYDLSSYSQVRVPIVTFHF 97
>gi|115466078|ref|NP_001056638.1| Os06g0121500 [Oryza sativa Japonica Group]
gi|113594678|dbj|BAF18552.1| Os06g0121500 [Oryza sativa Japonica Group]
Length = 442
Score = 87.4 bits (215), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 65/171 (38%), Positives = 82/171 (47%), Gaps = 18/171 (10%)
Query: 84 PLLRNHEL-DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETY 142
PL+RN + T Y + L GI VGG L + F GG ++DS +T+L Y
Sbjct: 285 PLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAY 338
Query: 143 NALRDAFVRGTRALSP--TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL 200
ALR AF R A P G A DTCYDF +SV VP VS F G V+ L A +
Sbjct: 339 RALRLAF-RSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVM 397
Query: 201 IPVDSNGTFCFAFAPTSS--SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ C AF PT +L IGNVQQQ V +++ +GF C
Sbjct: 398 V------EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 442
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 60/182 (32%), Positives = 86/182 (47%), Gaps = 24/182 (13%)
Query: 14 SVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEF 72
+ + GCGH N+G+F G+ G G G S PSQ+NA++FSYC +S +
Sbjct: 196 ATRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNATSFSYCFTSMFDSKSSIVTL 255
Query: 73 DSSLPPNAV----------TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
+ P A+ T PL +N + Y+L L GISVG LP+ ET F+
Sbjct: 256 GGA--PAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFR----- 308
Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGVALFDTCYDFSSRSSVEVP 180
I+DSG ++T L E Y A++ F L P+ +G AL D C+ + P
Sbjct: 309 --STIIDSGASITTLPEEVYEAVKAEFA-AQVGLPPSGVEGSAL-DVCFALPVSALWRRP 364
Query: 181 TV 182
V
Sbjct: 365 AV 366
>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
Length = 486
Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 80/264 (30%), Positives = 116/264 (43%), Gaps = 30/264 (11%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINAS---TF 55
G + ++ +T+ S V+ GC N +G F A G++ LG G S +Q +++ F
Sbjct: 238 GTYSSDVLTINSGDRVEGFRFGCSQNEQGSFENQADGIMALGRGVQSLMAQTSSTYGDAF 297
Query: 56 SYCLVDRDSDSTSTLEFDSSLPPNA----VTAPLLRNH-----ELDTFYYLGLTGISVGG 106
SYCL + T+ F +P A VT P+L+ T Y L I+V G
Sbjct: 298 SYCLPPTE---TTKGFFQIGVPIGASYRFVTTPMLKERGGASAAAATLYRALLLAITVDG 354
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALF 165
L + F G ++DS T +TRL Y ALR AF R ++P
Sbjct: 355 KELNVPAEVFA------AGTVMDSRTIITRLPVTAYGALRAAFRNRMRYRVAPPQ--EEL 406
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
DTCYD + +P ++ F V+ + L+ NG FA SS SI+GN
Sbjct: 407 DTCYDLTGVRYPRLPRIALVFDGNAVVEMDRSGILL----NGCLAFASNDDDSSPSILGN 462
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
VQQQ +V ++ IGF C
Sbjct: 463 VQQQTIQVLHDVGGGRIGFRSAAC 486
>gi|383128168|gb|AFG44737.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
gi|383128170|gb|AFG44738.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
gi|383128172|gb|AFG44739.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
gi|383128176|gb|AFG44741.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
gi|383128178|gb|AFG44742.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
gi|383128180|gb|AFG44743.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
gi|383128182|gb|AFG44744.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
gi|383128184|gb|AFG44745.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
gi|383128186|gb|AFG44746.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
gi|383128188|gb|AFG44747.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
gi|383128190|gb|AFG44748.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
gi|383128192|gb|AFG44749.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
gi|383128194|gb|AFG44750.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
gi|383128196|gb|AFG44751.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
gi|383128198|gb|AFG44752.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
gi|383128200|gb|AFG44753.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
Length = 103
Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 46/102 (45%), Positives = 61/102 (59%), Gaps = 5/102 (4%)
Query: 85 LLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNA 144
L+ N ++Y++ L GISVGG L I+ G GG IVDSGT +TRL + YNA
Sbjct: 1 LVSNSIYTSYYFVVLNGISVGGQRLSITPAVL-----GRGGTIVDSGTIITRLVPQAYNA 55
Query: 145 LRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHF 186
L+ +F T+ L + ++ DTCYD SS S V VP V+FHF
Sbjct: 56 LKTSFRSQTQNLPSAEPYSILDTCYDLSSYSQVRVPIVTFHF 97
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 77/262 (29%), Positives = 115/262 (43%), Gaps = 23/262 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNE-GLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G T+ V +G+A+ +A GC +E G++G +GLG +LS +Q+NA+ FSYCL
Sbjct: 141 GRIGTDAVAIGTAATARLAFGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYCL 200
Query: 60 VDRDSDSTSTLEFDSSLP-----PNAVTAPLLR-----NHELDTFYYLGLTGISVGGDLL 109
D+ +S L +S A T P ++ + L Y L L I G
Sbjct: 201 APPDTGKSSALFLGASAKLAGAGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAG---- 256
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
+ +SGN I+V + T VT L Y LR A A V +D C+
Sbjct: 257 ---NATIAMPQSGN-TIMVSTATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCF 312
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF--APTSSSLSIIGNVQ 227
+S S P + F G + +P +YL N T C A +P +SI+G++Q
Sbjct: 313 PKASASG-GAPDLVLAFQGGAEMTVPVSSYLFDA-GNDTACVAILGSPALGGVSILGSLQ 370
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
Q + F+L + F P C
Sbjct: 371 QVNIHLLFDLDKETLSFEPADC 392
>gi|295830689|gb|ADG39013.1| AT5G10770-like protein [Neslia paniculata]
Length = 159
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 60/161 (37%), Positives = 81/161 (50%), Gaps = 10/161 (6%)
Query: 44 LSFPSQINAS---TFSYCLVDRDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGL 99
LSFPSQ + FSYCL + T L F S+ +V P+ + +FY L +
Sbjct: 1 LSFPSQTATAYNKIFSYCL-PSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLSI 59
Query: 100 TGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT 159
I+VGG LPI T F G ++DSGT +TRL + Y ALR F T
Sbjct: 60 VAITVGGQKLPIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSEFKAKMSKYPTT 114
Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL 200
GV++ DTC+D S +V +P V+F F G V+ L +K L
Sbjct: 115 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIL 155
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 86.7 bits (213), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 75/260 (28%), Positives = 123/260 (47%), Gaps = 24/260 (9%)
Query: 14 SVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEF 72
+V N+ GCG N+G+F +G+ G G +S PSQ+ + FS+C TS +
Sbjct: 216 AVPNVRFGCGQYNKGIFKSNESGIAGFSRGPMSLPSQLKVARFSHCFTAIADARTSPVFL 275
Query: 73 DSSLPPNAV----TAPLLRN---HELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGG 125
+ P+ + T P+ + + YYL L GI+VG LP++ AF +G+G
Sbjct: 276 GGAPGPDNLGAHATGPVQSTPFANSNGSLYYLTLKGITVGKTRLPLNALAFAGKGTGSGS 335
Query: 126 I--IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
I+DSGT + L Y +LR AFV + + A ++ F + S +P +
Sbjct: 336 GGTIIDSGTGIRTLPGPMYRSLRAAFVARVKLPVANESAADAESTLCFEAARSASLPPEA 395
Query: 184 FHFPEGKVL--------PLPAKNYLIPV--DSNGT---FCFAF-APTSSSLSIIGNVQQQ 229
KV+ LP ++Y++ + D +G+ C + S L+IIGN QQQ
Sbjct: 396 PAPALPKVVLHVAGADWDLPRESYVLDLLEDEDGSGSGLCLVMNSAGDSDLTIIGNFQQQ 455
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
V+++L + + F P +C
Sbjct: 456 NMHVAYDLEKNKLVFVPARC 475
>gi|242089103|ref|XP_002440384.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
gi|241945669|gb|EES18814.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
Length = 555
Score = 86.7 bits (213), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 80/266 (30%), Positives = 122/266 (45%), Gaps = 24/266 (9%)
Query: 5 TETVTLGS-ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINA-----STFSY 57
T TV+ G A + + +GC G V A G+L LG G +SF I+A FS+
Sbjct: 265 TVTVSDGRMAKLPGLVLGCSVLEAGASVDAHDGVLSLGNGHMSF--AIHAVLRFGGRFSF 322
Query: 58 CLVDRDS--DSTSTLEFD---SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
CL+ +S D++S L F + + P + +L N ++ Y +T + VGG+ L I
Sbjct: 323 CLLSANSSRDASSYLTFGPNPAVMGPGTMETEILYNVDVKAAYGPRVTAVLVGGERLDIP 382
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
+ + ID+ G+I+D+ T+VT L E Y L A R L P + A F+ CY ++
Sbjct: 383 DDVWNIDKGLGSGVILDTSTSVTSLVPEAYEPLVAALDRHLAHL-PRESFAGFEYCYRWT 441
Query: 173 -------SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA--PTSSSLSII 223
+V +P V+ G L AK+ ++P +G C AF P II
Sbjct: 442 FTGDGVDPAHNVTIPKVTVEMTGGARLEPEAKSVVMPEVGHGVACLAFRKLPWGGGPCII 501
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GNV Q + + F +KC
Sbjct: 502 GNVLMQEYIWEIDHSKATFRFRKDKC 527
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 86.7 bits (213), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 80/269 (29%), Positives = 114/269 (42%), Gaps = 36/269 (13%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
G TET+TL S S + IGCGHNN +G++GL G S +Q+
Sbjct: 139 GTLATETITLHSTSGEPFVMPETIIGCGHNNSWFKPSFSGMVGLNWGPSSLITQMGGEYP 198
Query: 54 -TFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-------PLLRNHELDTFYYLGLTGISVG 105
SYC TS + F + NA+ A + FYYL L +SVG
Sbjct: 199 GLMSYCF---SGQGTSKINFGA----NAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVG 251
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDA---FVRGTRALSPTDGV 162
+ T F E G I++DSGT +T N +R A V RA PT
Sbjct: 252 NTRIETMGTTFHALE---GNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGND 308
Query: 163 ALFDTCYDFSSRSSVEV-PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSL 220
L CY+ ++++ P ++ HF G L L N + ++ G FC A S +
Sbjct: 309 ML---CYN---SDTIDIFPVITMHFSGGVDLVLDKYNMYMESNNGGVFCLAIICNSPTQE 362
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+I GN Q V ++ + L+ F+P C
Sbjct: 363 AIFGNRAQNNFLVGYDSSSLLVSFSPTNC 391
>gi|222624645|gb|EEE58777.1| hypothetical protein OsJ_10300 [Oryza sativa Japonica Group]
Length = 431
Score = 86.7 bits (213), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 72/245 (29%), Positives = 104/245 (42%), Gaps = 31/245 (12%)
Query: 33 AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLR-NHEL 91
A GLLG+ G+LSF +Q F+YC+ + L D + P PL+ + L
Sbjct: 179 ATGLLGMNRGTLSFVTQTGTRRFAYCIAPGEGPGVLLLGDDGGVAPPLNYTPLIEISQPL 238
Query: 92 DTF----YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRD 147
F Y + L GI VG LLPI ++ D +G G +VDSGT T L + Y AL+
Sbjct: 239 PYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKA 298
Query: 148 AFVRGTRALSPTDG------VALFDTCYDFSSRSSVEVPTVSFHFPE------GKVLPLP 195
F R L G FD C+ V S PE G + +
Sbjct: 299 EFTSQARLLLAPLGEPGFVFQGAFDACF---RGPEARVAAASGLLPEVGLVLRGAEVAVS 355
Query: 196 AKN--YLIPVDSNG------TFCFAFAPT---SSSLSIIGNVQQQGTRVSFNLRNSLIGF 244
+ Y++P + G +C F + S +IG+ QQ V ++L+N +GF
Sbjct: 356 GEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGF 415
Query: 245 TPNKC 249
P +C
Sbjct: 416 APARC 420
>gi|115461432|ref|NP_001054316.1| Os04g0685200 [Oryza sativa Japonica Group]
gi|113565887|dbj|BAF16230.1| Os04g0685200, partial [Oryza sativa Japonica Group]
Length = 330
Score = 86.7 bits (213), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 84/288 (29%), Positives = 128/288 (44%), Gaps = 43/288 (14%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G +++T+ +V N IGC + + +GL G G G+ S PSQ+ + FSYCL+
Sbjct: 43 GLLISDTLRTPGRAVRNFVIGCSLAS--VHQPPSGLAGFGRGAPSVPSQLGLTKFSYCLL 100
Query: 61 DRDSDSTSTLEFDSSLPPNAVT--------APLLRNHE----LDTFYYLGLTGISVGGDL 108
R D + + + L APL R+ +YYL LT I+VGG
Sbjct: 101 SRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKS 160
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTDGVA 163
+ + E AF + GG IVDSGT + + + A V R +R+ +G+
Sbjct: 161 VQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLG 219
Query: 164 LFDTCYDFSS-RSSVEVPTVSFHFPEGKVLPLPAKNYLI---PVDSNG------TFCFAF 213
L C+ ++E+P +S HF G V+ LP +NY + P S G C A
Sbjct: 220 L-SPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAV 278
Query: 214 ---APTSSSLS---------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
PTSS + I+G+ QQQ + ++L +GF +C
Sbjct: 279 VSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 326
>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
Length = 493
Score = 86.7 bits (213), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 83/281 (29%), Positives = 112/281 (39%), Gaps = 51/281 (18%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA------STFSYCLVDRDSD 65
S S+ N GC H VG AG G G LS P+Q+ + + FSYCLV +
Sbjct: 211 SLSLHNFTFGCAHTALAEPVGVAGF---GRGVLSLPAQLASFAPQLGNRFSYCLVSHSFN 267
Query: 66 STSTLEFDSSL---------------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
S L S L V +L N + FY +GL GIS+G +P
Sbjct: 268 S-DRLRLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISIGKKKIP 326
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT-----RALSPTDGVALF 165
E ++D G+GG++VDSGT T L YN++ F RA D L
Sbjct: 327 APEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVEDKTGL- 385
Query: 166 DTCYDFSSRSSVEVPTVSFHF--PEGKVLPLPAKNYLIPVDSNG--------TFCFAFAP 215
CY + + V +P++ HF E V+ LP KNY G C
Sbjct: 386 GPCYYYD--TVVNIPSLVLHFVGNESSVV-LPKKNYFYDFLDGGDGVRRKRRVGCLMLMN 442
Query: 216 TSSSLSI-------IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +GN QQ G V ++L +GF KC
Sbjct: 443 GGEEAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKC 483
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 72/245 (29%), Positives = 104/245 (42%), Gaps = 31/245 (12%)
Query: 33 AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLR-NHEL 91
A GLLG+ G+LSF +Q F+YC+ + L D + P PL+ + L
Sbjct: 195 ATGLLGMNRGTLSFVTQTGTRRFAYCIAPGEGPGVLLLGDDGGVAPPLNYTPLIEISQPL 254
Query: 92 DTF----YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRD 147
F Y + L GI VG LLPI ++ D +G G +VDSGT T L + Y AL+
Sbjct: 255 PYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKA 314
Query: 148 AFVRGTRALSPTDG------VALFDTCYDFSSRSSVEVPTVSFHFPE------GKVLPLP 195
F R L G FD C+ V S PE G + +
Sbjct: 315 EFTSQARLLLAPLGEPGFVFQGAFDACF---RGPEARVAAASGLLPEVGLVLRGAEVAVS 371
Query: 196 AKN--YLIPVDSNG------TFCFAFAPT---SSSLSIIGNVQQQGTRVSFNLRNSLIGF 244
+ Y++P + G +C F + S +IG+ QQ V ++L+N +GF
Sbjct: 372 GEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGF 431
Query: 245 TPNKC 249
P +C
Sbjct: 432 APARC 436
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 68/237 (28%), Positives = 96/237 (40%), Gaps = 23/237 (9%)
Query: 35 GLLGLGGGSLSFPSQINASTFSYCLVDR---DSDSTSTLEFDS------SLPPNAVTAPL 85
G+ G G S PSQ+ FSYCLV D+ ++S L D+ + P
Sbjct: 223 GIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPF 282
Query: 86 LRN--HELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN 143
L+N +YY+ L I +G + + GNGG IVDSGT T ++ Y
Sbjct: 283 LKNPTTAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYE 342
Query: 144 ALRDAFVRGT---RALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL 200
+ F + + + CY+ S S+ VP + F F G + LP NY
Sbjct: 343 LVAKEFEKQMAHYTVATEIQNLTGLRPCYNISGEKSLSVPDLIFQFKGGAKMALPLSNYF 402
Query: 201 IPVDSNGTFCFAFAPTSSSLS--------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
VDS G C + + I+GN QQ+ V F+L N GF C
Sbjct: 403 SIVDS-GVICLTIVSDNVAGPGLGGGPAIILGNYQQRNFYVEFDLENEKFGFKQQSC 458
>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 469
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 67/242 (27%), Positives = 102/242 (42%), Gaps = 27/242 (11%)
Query: 34 AGLLGLGGGSLSFPSQINASTFSYCLV-DRDSDSTSTLEFDSSLPPNAV----------- 81
+G+ G G G S P Q+ FSYCL+ R DS + + + P++
Sbjct: 228 SGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGGLSYTP 287
Query: 82 --TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQT 139
P+ N +YY+ L I VG + + + GNGG IVDSG+ T ++
Sbjct: 288 FRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGGTIVDSGSTFTFMEK 347
Query: 140 ETYNALRDAFVRG----TRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLP 195
+ A+ F R TRA + + ++ C++ S SV +P++ F F G + LP
Sbjct: 348 PVFEAVATEFDRQMANYTRA-ADVEALSGLKPCFNLSGVGSVALPSLVFQFKGGAKMELP 406
Query: 196 AKNYLIPVDSNGTFCFAFAPTS---SSLS-----IIGNVQQQGTRVSFNLRNSLIGFTPN 247
NY V C S+LS I+GN Q Q ++L N GF
Sbjct: 407 VANYFSLVGDLSVLCLTIVSNEAVGSTLSSGPSIILGNYQSQNFYTEYDLENERFGFRRQ 466
Query: 248 KC 249
+C
Sbjct: 467 RC 468
>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
Length = 491
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 84/288 (29%), Positives = 128/288 (44%), Gaps = 43/288 (14%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G +++T+ +V N IGC + + +GL G G G+ S PSQ+ + FSYCL+
Sbjct: 204 GLLISDTLRTPGRAVRNFVIGC--SLASVHQPPSGLAGFGRGAPSVPSQLGLTKFSYCLL 261
Query: 61 DRDSDSTSTLEFDSSLPPNAVT--------APLLRNHE----LDTFYYLGLTGISVGGDL 108
R D + + + L APL R+ +YYL LT I+VGG
Sbjct: 262 SRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKS 321
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTDGVA 163
+ + E AF + GG IVDSGT + + + A V R +R+ +G+
Sbjct: 322 VQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLG 380
Query: 164 LFDTCYDFSS-RSSVEVPTVSFHFPEGKVLPLPAKNYLI---PVDSNG------TFCFAF 213
L C+ ++E+P +S HF G V+ LP +NY + P S G C A
Sbjct: 381 L-SPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAV 439
Query: 214 ---APTSSSLS---------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
PTSS + I+G+ QQQ + ++L +GF +C
Sbjct: 440 VSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 487
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 73/246 (29%), Positives = 103/246 (41%), Gaps = 20/246 (8%)
Query: 11 GSASVDNIAIGCGHNNEGLF---VGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDS 64
G A+ GC + F A G +GLG G LS SQ+ FSYC+V S
Sbjct: 195 GGATFPKSVFGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCMVPFSS 254
Query: 65 DSTSTLEFDSSLPPN-AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
ST L+F S P N V+ P + N ++Y L L GI+VG + +
Sbjct: 255 TSTGKLKFGSMAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKVLTGQIG-------- 306
Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
G II+DS +T L+ Y + D F+ C + +++ P
Sbjct: 307 GNIIIDSVPILTHLEQGIYTDFISSVKEAINVEVAEDAPTPFEYC--VRNPTNLNFPEFV 364
Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIG 243
FHF V+ L KN I +D+N C P S +SI GN Q +V ++L +
Sbjct: 365 FHFTGADVV-LGPKNMFIALDNN-LVCMTVVP-SKGISIFGNWAQVNFQVEYDLGEKKVS 421
Query: 244 FTPNKC 249
F P C
Sbjct: 422 FAPTNC 427
>gi|32488713|emb|CAE03456.1| OSJNBa0088H09.14 [Oryza sativa Japonica Group]
Length = 490
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 84/288 (29%), Positives = 128/288 (44%), Gaps = 43/288 (14%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G +++T+ +V N IGC + + +GL G G G+ S PSQ+ + FSYCL+
Sbjct: 203 GLLISDTLRTPGRAVRNFVIGC--SLASVHQPPSGLAGFGRGAPSVPSQLGLTKFSYCLL 260
Query: 61 DRDSDSTSTLEFDSSLPPNAVT--------APLLRNHE----LDTFYYLGLTGISVGGDL 108
R D + + + L APL R+ +YYL LT I+VGG
Sbjct: 261 SRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKS 320
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTDGVA 163
+ + E AF + GG IVDSGT + + + A V R +R+ +G+
Sbjct: 321 VQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLG 379
Query: 164 LFDTCYDFSS-RSSVEVPTVSFHFPEGKVLPLPAKNYLI---PVDSNG------TFCFAF 213
L C+ ++E+P +S HF G V+ LP +NY + P S G C A
Sbjct: 380 L-SPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAV 438
Query: 214 ---APTSSSLS---------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
PTSS + I+G+ QQQ + ++L +GF +C
Sbjct: 439 VSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 486
>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
Length = 468
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 74/280 (26%), Positives = 106/280 (37%), Gaps = 35/280 (12%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G ++ET+ + ++ + GC + G+ G G S P Q+ FSYCLV
Sbjct: 192 GLLLSETINFPNKTISDFLAGCSLLSTR---QPEGIAGFGRSQESLPLQLGLKKFSYCLV 248
Query: 61 DR---DSDSTSTLEFDS------------SLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
R DS +S L D S P N +YY+ L I VG
Sbjct: 249 SRRFDDSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVG 308
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
+ + + GNGG IVDSG+ T ++ + L F + + V
Sbjct: 309 KTHVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQKL 368
Query: 166 ---DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP------- 215
C+D S SV +P ++F F G + LP NY VD G C
Sbjct: 369 TGLRPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAFVDM-GVVCLTIVSDNAAALG 427
Query: 216 ------TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+S I+GN QQQ + ++L N GF C
Sbjct: 428 GDGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQSC 467
>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
Length = 447
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 72/242 (29%), Positives = 105/242 (43%), Gaps = 25/242 (10%)
Query: 33 AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLR-NHEL 91
A GLLG+ G+LSF +Q F+YC+ + L D + P PL+ + L
Sbjct: 195 ATGLLGMNRGTLSFVTQTGTRRFAYCIAPGEGPGVLLLGDDGGVAPPLNYTPLIEISQPL 254
Query: 92 DTF----YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRD 147
F Y + L GI VG LLPI ++ D +G G +VDSGT T L + Y AL+
Sbjct: 255 PYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKA 314
Query: 148 AFVRGTRALSPTDG------VALFDTCYDFS----SRSSVEVPTVSFHFPEGKVLPLPAK 197
F R L G FD C+ + +S +P V +V K
Sbjct: 315 EFTSQARLLLAPLGEPGFVFQGAFDACFRGPEARVAAASGLLPVVGLVLRGAEVAVSGEK 374
Query: 198 -NYLIPVDSNG------TFCFAFAPT---SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPN 247
Y++P + G +C F + S +IG+ QQ V ++L+N +GF P
Sbjct: 375 LLYMVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPA 434
Query: 248 KC 249
+C
Sbjct: 435 RC 436
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 79/261 (30%), Positives = 117/261 (44%), Gaps = 24/261 (9%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLS-FPSQINAS- 53
G F +T+TLGS + NI IGCG NN F + + GG Q+ S
Sbjct: 184 GKFAVDTLTLGSTDNRPVQLKNIIIGCGQNNAVTFRNKSSGVVGLGGGAVSLIKQLGDSI 243
Query: 54 --TFSYCLVDRDSDSTSTLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
FSYCLV ++D TS + F ++ P V+ PL+ DTFYYL L ISVG
Sbjct: 244 DGKFSYCLVP-ENDQTSKINFGTNAVVSGPGTVSTPLVVKSR-DTFYYLTLKSISVGSKN 301
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
+ ++ K G +++DSGT +T L + Y + +A A D C
Sbjct: 302 MQTPDSNIK------GNMVIDSGTTLTLLPVKYYIEIENAVASLINADKSKDERIGSSLC 355
Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
Y+ + + + +P ++ HF EG + L N V + C AF + I GNV Q
Sbjct: 356 YN--ATADLNIPVITMHF-EGADVKLYPYNSFFKV-TEDLVCLAFGMSFYRNGIYGNVAQ 411
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
+ V ++ + + F P C
Sbjct: 412 KNFLVGYDTASKTMSFKPTDC 432
>gi|295830681|gb|ADG39009.1| AT5G10770-like protein [Capsella grandiflora]
gi|295830683|gb|ADG39010.1| AT5G10770-like protein [Capsella grandiflora]
gi|295830685|gb|ADG39011.1| AT5G10770-like protein [Capsella grandiflora]
gi|295830687|gb|ADG39012.1| AT5G10770-like protein [Capsella grandiflora]
Length = 159
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/158 (36%), Positives = 81/158 (51%), Gaps = 10/158 (6%)
Query: 44 LSFPSQINAS---TFSYCLVDRDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGL 99
LSFPSQ + FSYCL + T L F S+ +V P+ + ++FY L +
Sbjct: 1 LSFPSQTATAYNKIFSYCL-PSSASYTGHLTFGSAGISRSVKFTPIATISDGNSFYGLNI 59
Query: 100 TGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT 159
GI+VGG L I T F G ++DSGT +TRL + Y ALR +F
Sbjct: 60 VGITVGGQKLAIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAQMSKYPTA 114
Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAK 197
GV++ DTC+D S +V +P V+F F G V+ L +K
Sbjct: 115 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSK 152
>gi|295830679|gb|ADG39008.1| AT5G10770-like protein [Capsella grandiflora]
Length = 159
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/158 (36%), Positives = 81/158 (51%), Gaps = 10/158 (6%)
Query: 44 LSFPSQINAS---TFSYCLVDRDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGL 99
LSFPSQ + FSYCL + T L F S+ +V P+ + ++FY L +
Sbjct: 1 LSFPSQTATAYNKIFSYCL-PSSASYTGHLTFGSAGISRSVKFTPIXTISDGNSFYGLNI 59
Query: 100 TGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT 159
GI+VGG L I T F G ++DSGT +TRL + Y ALR +F
Sbjct: 60 VGITVGGQKLAIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAQMSKYPTA 114
Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAK 197
GV++ DTC+D S +V +P V+F F G V+ L +K
Sbjct: 115 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSK 152
>gi|194689804|gb|ACF78986.1| unknown [Zea mays]
Length = 158
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 60/158 (37%), Positives = 86/158 (54%), Gaps = 8/158 (5%)
Query: 92 DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR 151
D+ Y++ +TGI V G L +S +A+ + I+DSGT +TRL T Y+AL A
Sbjct: 8 DSLYFIKMTGIKVAGKPLSVSSSAYSSLPT-----IIDSGTVITRLPTGVYSALSKAVAG 62
Query: 152 GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCF 211
+ ++ DTC+ + + VP V+ F G L L A+N L+ VDS T C
Sbjct: 63 AMKGTPRASAFSILDTCFQ-GQAARLRVPEVTMAFAGGAALKLAARNLLVDVDS-ATTCL 120
Query: 212 AFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
AFAP S+ +IIGN QQQ V ++++NS IGF C
Sbjct: 121 AFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFAAGGC 157
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/281 (30%), Positives = 124/281 (44%), Gaps = 36/281 (12%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLF------VGAAGLLGLGGGSLSFPSQINAST 54
G+ +T +GS + GC + GL + GL+G+ GSLSF +Q+ S
Sbjct: 151 GNLAHDTFVIGSVTRPGTLFGC--MDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFSK 208
Query: 55 FSYCLVDRDSDSTSTL--EFDSSLPPNAVTAPLLRNHELDTF----YYLGLTGISVGGDL 108
FSYC+ DS L S L P T +L+ L F Y + L GI VG +
Sbjct: 209 FSYCISGSDSSGILLLGDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKI 268
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA-LSPTDG-----V 162
L + ++ F D +G G +VDSGT T L Y AL++ F+ T++ L D
Sbjct: 269 LSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQ 328
Query: 163 ALFDTCYDFSSRSS---VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT------FCFAF 213
D CY S + +P +S F G + + + L V+ G+ +CF F
Sbjct: 329 GTMDLCYRVGSSTRPNFTGLPVISLMF-RGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTF 387
Query: 214 APTSSSLSI----IGNVQQQGTRVSFNLRNSLIGFTPN-KC 249
S L I IG+ QQ + F+L S +GF N +C
Sbjct: 388 G-NSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGFAGNVRC 427
>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
Length = 648
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 84/288 (29%), Positives = 128/288 (44%), Gaps = 43/288 (14%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G +++T+ +V N IGC + + +GL G G G+ S PSQ+ + FSYCL+
Sbjct: 204 GLLISDTLRTPGRAVRNFVIGC--SLASVHQPPSGLAGFGRGAPSVPSQLGLTKFSYCLL 261
Query: 61 DRDSDSTSTLEFDSSLPPNAVT--------APLLRNHE----LDTFYYLGLTGISVGGDL 108
R D + + + L APL R+ +YYL LT I+VGG
Sbjct: 262 SRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKS 321
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTDGVA 163
+ + E AF + GG IVDSGT + + + A V R +R+ +G+
Sbjct: 322 VQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLG 380
Query: 164 LFDTCYDFSS-RSSVEVPTVSFHFPEGKVLPLPAKNYLI---PVDSNGTFCFAFA----- 214
L C+ ++E+P +S HF G V+ LP +NY + P S G A A
Sbjct: 381 L-SPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAV 439
Query: 215 ----PTSSSLS---------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
PTSS + I+G+ QQQ + ++L +GF +C
Sbjct: 440 VSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 487
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 87/265 (32%), Positives = 126/265 (47%), Gaps = 23/265 (8%)
Query: 1 GDFVTETVTLGSASVDNI-----AIGCGHNNEGLFVGAAGLLGLGGGS----LSFPSQIN 51
GD ET+TLGS ++ IGCGHNN G F + GG +S S
Sbjct: 179 GDLSVETLTLGSTDGSSVHFPKTVIGCGHNNGGTFQEEGSGIVGLGGGPVSLISQLSSSI 238
Query: 52 ASTFSYCL--VDRDSDSTSTLEF-DSSLPPN--AVTAPL--LRNHELDTFYYLGLTGISV 104
FSYCL + +S+S+S L F D+++ V+ PL L FY+L L SV
Sbjct: 239 GGKFSYCLAPIFSESNSSSKLNFGDAAVVSGRGTVSTPLDPLNGQ---VFYFLTLEAFSV 295
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL 164
G + + S ++ SG+G II+DSGT +T L E Y L A + D L
Sbjct: 296 GDNRIEFSGSSSSGSGSGDGNIIIDSGTTLTLLPQEDYLNLESAVSDVIKLERARDPSKL 355
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
CY +S +++P ++ HF +G + L + +PV+ G CFAF +S +I G
Sbjct: 356 LSLCYKTTS-DELDLPVITAHF-KGADVELNPISTFVPVE-KGVVCFAFI-SSKIGAIFG 411
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
N+ QQ V ++L + F P C
Sbjct: 412 NLAQQNLLVGYDLVKKTVSFKPTDC 436
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 77/254 (30%), Positives = 115/254 (45%), Gaps = 42/254 (16%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAST 54
G+ +ET+T+ S S A GCGH++ G+F ++G++GLGGG LS SQ+ ++
Sbjct: 181 GNLASETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTI 240
Query: 55 ---FSYCL--VDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCL V DS +S + F +S + V+ PL L G S
Sbjct: 241 NGLFSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPL----------RLPYKGYS--- 287
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
K E G IIVDSGT T L E Y+ L + + D +F
Sbjct: 288 ----------KKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFS 337
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
CY+ + + + P ++ HF + V P ++ + CF APT S + ++GN+
Sbjct: 338 LCYN--TTAEINAPIITAHFKDANVELQPLNTFMRMQED--LVCFTVAPT-SDIGVLGNL 392
Query: 227 QQQGTRVSFNLRNS 240
Q V F+LR
Sbjct: 393 AQVNFLVGFDLRKK 406
Score = 57.4 bits (137), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 58/133 (43%), Gaps = 4/133 (3%)
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
K E G IIVDSGT T L E Y L ++ + D + CY+ ++
Sbjct: 411 KKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYN-TTVDQ 469
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
++ P ++ HF + V P +L + CF PT S + I+GN+ Q V F+
Sbjct: 470 IDAPIITAHFKDANVELQPWNTFLRMQED--LVCFTVLPT-SDIGILGNLAQVNFLVGFD 526
Query: 237 LRNSLIGFTPNKC 249
LR + F C
Sbjct: 527 LRKKRVSFKAADC 539
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/274 (29%), Positives = 120/274 (43%), Gaps = 36/274 (13%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ V E T S + +GC + G+LG+ G LSF SQ + FSYC+
Sbjct: 190 GNLVREKFTFSRSLFTPPLILGCATES----TDPRGILGMNRGRLSFASQSKITKFSYCV 245
Query: 60 VDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTF-------------YYLGLTGISV 104
R + T T F PN+ T R E+ TF Y + L GI +
Sbjct: 246 PTRVTRPGYTPTGSFYLGHNPNSNT---FRYIEMLTFARSQRMPNLDPLAYTVALQGIRI 302
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGV 162
GG L IS F+ D G+G ++DSG+ T L E Y+ +R VR G R
Sbjct: 303 GGRKLNISPAVFRADAGGSGQTMLDSGSEFTYLVNEAYDKVRAEVVRAVGPRMKKGYVYG 362
Query: 163 ALFDTCYDFSSRSSVEVPT----VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS- 217
+ D C+D +++E+ + F F +G + +P + L V+ G C A +
Sbjct: 363 GVADMCFD---GNAIEIGRLIGDMVFEFEKGVQIVVPKERVLATVEG-GVHCIGIANSDK 418
Query: 218 --SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++ +IIGN QQ V F+L N +GF C
Sbjct: 419 LGAASNIIGNFHQQNLWVEFDLVNRRMGFGTADC 452
>gi|345292859|gb|AEN82921.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292861|gb|AEN82922.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292863|gb|AEN82923.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292865|gb|AEN82924.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292867|gb|AEN82925.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292869|gb|AEN82926.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292871|gb|AEN82927.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292873|gb|AEN82928.1| AT5G10770-like protein, partial [Capsella rubella]
Length = 161
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/158 (36%), Positives = 81/158 (51%), Gaps = 10/158 (6%)
Query: 44 LSFPSQINAS---TFSYCLVDRDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGL 99
LSFPSQ + FSYCL + T L F S+ +V P+ + ++FY L +
Sbjct: 1 LSFPSQTATAYNKIFSYCL-PSSASYTGHLTFGSAGISRSVKFTPISTISDGNSFYGLNI 59
Query: 100 TGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT 159
GI+VGG L I T F G ++DSGT +TRL + Y ALR +F
Sbjct: 60 VGITVGGQKLAIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAQMSKYPTA 114
Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAK 197
GV++ DTC+D S +V +P V+F F G V+ L +K
Sbjct: 115 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSK 152
>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
Length = 508
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 80/280 (28%), Positives = 113/280 (40%), Gaps = 50/280 (17%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDSDSTS 68
S +VDN C H G VG AG G G LS P Q+ + FSYCLV +
Sbjct: 227 SVAVDNFTFACAHTALGEPVGVAGF---GRGPLSLPGQLAPQLSGRFSYCLVSHSFRADR 283
Query: 69 TLEFDSSL---PPNA-------VTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
+ + P+A V PLL N + FY + L +SVG + ++
Sbjct: 284 LIRPSPLILGRSPDAAAETGGFVYTPLLHNPKHPYFYSVALEAVSVGATRIQARPELARV 343
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---------RALSPTDGVALFDTCY 169
D +GNGG++VDSGT T L ETY + +AF R RA T CY
Sbjct: 344 DRAGNGGMVVDSGTTFTMLPNETYARVAEAFARAMAAAGFARAERAEEQTG----LTPCY 399
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDS----------NGTFCFAF------ 213
+++ S VP ++ HF + LP +NY + S + C
Sbjct: 400 HYAA-SDRGVPPLALHFRGNATVALPRRNYFMGFKSEEEAGGAGRKDDVGCLMLMNGGDV 458
Query: 214 ----APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+GN QQQG V +++ +GF +C
Sbjct: 459 SGEDGGDDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 498
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 80/261 (30%), Positives = 116/261 (44%), Gaps = 28/261 (10%)
Query: 6 ETVTLGSASVDNIAIGCGH-----NNEGLFVGAAGLLGLGGG-SLSFPSQINASTFSYCL 59
ET+ G NI GCGH NN+ + G+ GLG ++ +Q+ + FSYC+
Sbjct: 202 ETLDEGKIKKSNITFGCGHMNIKTNNDDAY---NGVFGLGAYPHITMATQL-GNKFSYCI 257
Query: 60 VDRD----SDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
D + + + L S + ++ + H YY+ L ISVG L I A
Sbjct: 258 GDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGH-----YYVTLQSISVGSKTLKIDPNA 312
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV---RGTRALSPTDGVALFDTCYD-F 171
FKI G+GG+++DSG T+L + L D V +G PT C+
Sbjct: 313 FKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR-KFEGLCFKGV 371
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS---SLSIIGNVQQ 228
SR V P V+FHF G L L + + L FC A P++S +LS+IG + Q
Sbjct: 372 VSRDLVGFPAVTFHFAGGADLVLESGS-LFRQHGGDRFCLAILPSNSELLNLSVIGILAQ 430
Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
Q V F+L + F C
Sbjct: 431 QNYNVGFDLEQMKVFFRRIDC 451
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 86/265 (32%), Positives = 123/265 (46%), Gaps = 25/265 (9%)
Query: 1 GDFVTETVTLGSASVD-----NIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQIN--- 51
G E +T S D +I GCGH+N G F G++G+GGG LS SQI
Sbjct: 169 GVLAREAITFSSTDGDPVVVGDIIFGCGHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLY 228
Query: 52 -ASTFSYCLV--DRDSDSTSTLEF--DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
+ FS CLV D+ ++ T+ F +S + V L + E T Y + L GISVG
Sbjct: 229 GSKRFSQCLVPFHTDAHTSGTINFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGD 288
Query: 107 DLLPI--SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL 164
+ SET K G I++DSGT T + E Y L + ++ +L P +
Sbjct: 289 TFVRFNSSETLSK------GNIMIDSGTPATYIPQEFYERLVEE-LKVQSSLLPIEDDPD 341
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
T + S +++E P ++ HF V LP + ++ P D G FCFA A ++ I G
Sbjct: 342 LGTQLCYRSETNLEGPILTAHFEGADVQLLPIQTFIPPKD--GVFCFAMAGSTDGDYIFG 399
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
N Q + F+L I F P C
Sbjct: 400 NFAQSNILMGFDLDRKTISFKPTDC 424
>gi|147789749|emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
Length = 609
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 67/242 (27%), Positives = 101/242 (41%), Gaps = 27/242 (11%)
Query: 34 AGLLGLGGGSLSFPSQINASTFSYCLV-DRDSDSTSTLEFDSSLPPNAV----------- 81
+G+ G G G S P Q+ FSYCL+ R DS + + + P++
Sbjct: 228 SGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGGLSYTP 287
Query: 82 --TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQT 139
P+ N +YY+ L I VG + + GNGG IVDSG+ T ++
Sbjct: 288 FRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKXPYSFMVAGSDGNGGTIVDSGSTFTFMEK 347
Query: 140 ETYNALRDAFVRG----TRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLP 195
+ A+ F R TRA + + ++ C++ S SV +P++ F F G + LP
Sbjct: 348 PVFEAVATEFDRQMANYTRA-ADVEALSGLKPCFNLSGVGSVALPSLVFQFKGGAKMELP 406
Query: 196 AKNYLIPVDSNGTFCFAFAPTS---SSLS-----IIGNVQQQGTRVSFNLRNSLIGFTPN 247
NY V C S+LS I+GN Q Q ++L N GF
Sbjct: 407 VANYFSLVGDLSVLCLTIVSNEAVGSTLSSGPSIILGNYQSQNFYTEYDLENERFGFRRQ 466
Query: 248 KC 249
+C
Sbjct: 467 RC 468
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 76/260 (29%), Positives = 112/260 (43%), Gaps = 23/260 (8%)
Query: 6 ETVTLGSASVDN----IAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPSQINA----- 52
+TV LG + V N I GC G G+ G G G+LS SQ+++
Sbjct: 190 DTVLLGQSMVANSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTP 249
Query: 53 STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
FS+CL + + L L P+ V +PL+ + Y L L I+V G LLPI
Sbjct: 250 KVFSHCLKGGE-NGGGVLVLGEILEPSIVYSPLVPSLP---HYNLNLQSIAVNGQLLPID 305
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
F + N G IVDSGT + L E YN DA S ++ + CY S
Sbjct: 306 SNVFA--TTNNQGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPI-ISKGNQCYLVS 362
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
+ P VS +F G + L ++YL+ +DS +C F +I+G++ +
Sbjct: 363 NSVGDIFPQVSLNFMGGASMVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDLVLK 422
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
++L N IG+ C
Sbjct: 423 DKIFVYDLANQRIGWADYNC 442
>gi|194708432|gb|ACF88300.1| unknown [Zea mays]
Length = 452
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 123/279 (44%), Gaps = 34/279 (12%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G + +T+ +V +GC + + +GL G G G+ S P+Q+ FSYCL+
Sbjct: 173 GLLIADTLRAPGRAVPGFVLGC--SLVSVHQPPSGLAGFGRGAPSVPAQLGLPKFSYCLL 230
Query: 61 DRDSDSTSTLEFDSSLPPNAVTA-----PLLRNHELDT-----FYYLGLTGISVGGDLLP 110
R D + + L PL+++ D +YYL L G++VGG +
Sbjct: 231 SRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVR 290
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTDGVALF 165
+ AF + +G+GG IVDSGT T L + + DA V R R+ D + L
Sbjct: 291 LPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGL- 349
Query: 166 DTCYDFSSRS-SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG---TFCFAFAPTSSSLS 221
C+ + S+ +P +SFHF G V+ LP +NY + V G C A S S
Sbjct: 350 HPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFV-VAGRGAVEAICLAVVTDFSGGS 408
Query: 222 -----------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I+G+ QQQ V ++L +GF C
Sbjct: 409 GAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSC 447
>gi|316927704|gb|ADU58605.1| xyloglucan-specific endoglucanase inhibitor 4 [Solanum tuberosum]
Length = 440
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 77/271 (28%), Positives = 118/271 (43%), Gaps = 42/271 (15%)
Query: 14 SVDNIAIGCGHNN--EGLFVGAAGLLGLGGGSLSFPSQI-NA----STFSYCLVDRDSDS 66
S + + C ++ EGL G G+LGLG G + FP+Q+ NA F+ CL +
Sbjct: 154 STNGVVFDCAPHSLLEGLAKGVKGILGLGNGYVGFPTQLANAFSVPRKFAICLTSSTTSR 213
Query: 67 TSTLEFDSS---LP-----PNAVTAPLLRNH----------ELDTFYYLGLTGISVGGDL 108
DS LP V PLL+N E T Y++G+T I + G++
Sbjct: 214 GVIFFGDSPYVFLPGMDVSKRLVYTPLLKNPVSTSGSYFEGEPSTDYFIGVTSIKINGNV 273
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
+PI+ T I + G GG + + T+L+T YNAL AFV+ + VA F C
Sbjct: 274 VPINTTLLNITKDGKGGTKISTVDPYTKLETSIYNALTKAFVKSLAKVPRVKPVAPFKVC 333
Query: 169 YDFSSRSSVE----VPTVSFHFPEGKV---LPLPAKNYLIPVDSNGTFCFA-------FA 214
Y+ +S S VP + + N ++ ++ N C F
Sbjct: 334 YNRTSLGSTRVGRGVPPIELVLGNKNATTSWTIWGVNSMVAMN-NDVLCLGFLDGGVEFE 392
Query: 215 PTSSSLSIIGNVQQQGTRVSFNLRNSLIGFT 245
PT+S +IG Q + + F++ N +GFT
Sbjct: 393 PTTS--IVIGAHQIEDNLLQFDIANKRLGFT 421
>gi|413944378|gb|AFW77027.1| hypothetical protein ZEAMMB73_570500 [Zea mays]
Length = 484
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 73/259 (28%), Positives = 112/259 (43%), Gaps = 30/259 (11%)
Query: 12 SASVDNIAIGCGHNNEGLFVG-----AAGLLGLGGGSLSFPSQINAST------FSYCLV 60
SA+VD C EG+ G +AG+L L S S PS++ AS+ FSYCL
Sbjct: 235 SATVDKFRFAC---LEGIAPGPAEDGSAGILDLSRNSHSLPSRLVASSPPHAVAFSYCLP 291
Query: 61 DRDSDSTSTLEFDSSLPP----NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
+D L ++ P PL + Y + L G+ +GG LPI A
Sbjct: 292 ASTAD-VGFLSLGATKPELLGRKVSYTPLRGSPSNGNLYVVDLVGLGLGGPDLPIPPAAI 350
Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
D++ I++ T T L+ + Y LRD+F + + DTCY+F+ +
Sbjct: 351 AGDDT-----ILELHTTFTYLKPQVYKVLRDSFRKSMSEYPAAPPLGSLDTCYNFTGLDA 405
Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTSSSL---SIIGNVQQQG 230
VP V+ F G + L + D + F C AF ++IG++ Q
Sbjct: 406 FSVPAVTLKFAGGADVDLWMDEMMYFTDPDNHFSIGCLAFVAQDDDCDGGTVIGSMAQMS 465
Query: 231 TRVSFNLRNSLIGFTPNKC 249
T V +++R +GF P +C
Sbjct: 466 TEVVYDVRGGKVGFVPYRC 484
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 123/279 (44%), Gaps = 34/279 (12%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G + +T+ +V +GC + + +GL G G G+ S P+Q+ FSYCL+
Sbjct: 180 GLLIADTLRAPGRAVPGFVLGC--SLVSVHQPPSGLAGFGRGAPSVPAQLGLPKFSYCLL 237
Query: 61 DRDSDSTSTLEFDSSLPPNAVTA-----PLLRNHELDT-----FYYLGLTGISVGGDLLP 110
R D + + L PL+++ D +YYL L G++VGG +
Sbjct: 238 SRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVR 297
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTDGVALF 165
+ AF + +G+GG IVDSGT T L + + DA V R R+ D + L
Sbjct: 298 LPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGL- 356
Query: 166 DTCYDFSSRS-SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG---TFCFAFAPTSSSLS 221
C+ + S+ +P +SFHF G V+ LP +NY + V G C A S S
Sbjct: 357 HPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFV-VAGRGAVEAICLAVVTDFSGGS 415
Query: 222 -----------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I+G+ QQQ V ++L +GF C
Sbjct: 416 GAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSC 454
>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
Length = 280
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 48/62 (77%), Positives = 57/62 (91%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
GDFVTETVT+G V N+A+GCGHNNEGLFVGAAGL+GLGGG LSFP+Q+N+++FSYCLV
Sbjct: 219 GDFVTETVTIGVNKVKNVALGCGHNNEGLFVGAAGLIGLGGGPLSFPAQLNSTSFSYCLV 278
Query: 61 DR 62
DR
Sbjct: 279 DR 280
>gi|54290725|dbj|BAD62395.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 500
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 79/276 (28%), Positives = 117/276 (42%), Gaps = 31/276 (11%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
G + +TL SASVD+ GC + G +GAAGLL L S S S++ A TFS
Sbjct: 229 GAVAQDVLTLTPSASVDDFTFGCVEGSSGEPLGAAGLLDLSRDSRSVASRLAADAGGTFS 288
Query: 57 YCLVDRDSDSTSTLEF-DSSLPPN-----AVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
YCL + S L ++ +P N APL+ + Y + L G+S+GG +P
Sbjct: 289 YCLPLSTTSSHGFLAIGEADVPHNRTARVTAVAPLVYDPAFPNHYVIDLAGVSLGGRDIP 348
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
I A + + +++D+ T ++ Y LRDAF R + DTCY+
Sbjct: 349 IPPHA----ATASAAMVLDTALPYTYMKPSMYAPLRDAFRRAMARYPRAPAMGDLDTCYN 404
Query: 171 FSS-RSSVEVPTVSFHF-----PEGKVLPLPAKNYLIPVDSNGTF----CFAFAPTSSS- 219
F+ R V +P V F G + + + + G F C AFA S
Sbjct: 405 FTGVRHEVLIPLVHLTFRGIGGGGGGQVLGLGADQMFYMSEPGNFFSVTCLAFAALPSDG 464
Query: 220 ------LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++G + Q V ++ IGF P C
Sbjct: 465 DAEAPLAMVMGTLAQSSMEVVHDVPGGKIGFIPGSC 500
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 84/272 (30%), Positives = 117/272 (43%), Gaps = 34/272 (12%)
Query: 1 GDFVTETV----TLGSASVDN----IAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPS 48
G +V++T+ LG + +DN I GC G G+ G G G LS S
Sbjct: 163 GYYVSDTLYFDAILGQSLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVIS 222
Query: 49 Q-----INASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
Q I FS+CL D L L P V +PL+ + Y L L I+
Sbjct: 223 QLSTRGITPRVFSHCL-KGDGSGGGILVLGEILEPGIVYSPLVPSQP---HYNLNLLSIA 278
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL---SPTD 160
V G LLPI AF S + G IVDSGT + L E Y D FV A+ S T
Sbjct: 279 VNGQLLPIDPAAFA--TSNSQGTIVDSGTTLAYLVAEAY----DPFVSAVNAIVSPSVTP 332
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG---TFCFAFAPTS 217
+ + CY S+ S P SF+F G + L ++YLIP S+G +C F
Sbjct: 333 ITSKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYLIPFGSSGGSAMWCIGFQKV- 391
Query: 218 SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++I+G++ + ++L IG+ C
Sbjct: 392 QGVTILGDLVLKDKIFVYDLVRQRIGWANYDC 423
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 84.0 bits (206), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 75/224 (33%), Positives = 103/224 (45%), Gaps = 16/224 (7%)
Query: 35 GLLGLGGGSLSFPSQINA-----STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNH 89
G+ G G LS SQ+++ FS+CL DS L + PN V PL+ +
Sbjct: 226 GIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDDSGG-GILVLGEIVEPNVVYTPLVPSQ 284
Query: 90 ELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF 149
Y L L ISV G +LPIS F S + G I+DSGT + L E YNA A
Sbjct: 285 P---HYNLNLQSISVNGQVLPISPAVFA--TSSSQGTIIDSGTTLAYLAEEAYNAFVVA- 338
Query: 150 VRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG-- 207
V + S V + CY SS S P VS +F G L L A++YLI +S G
Sbjct: 339 VTNIVSQSTQSVVLKGNRCYVTSSSVSDIFPQVSLNFAGGASLVLGAQDYLIQQNSVGGT 398
Query: 208 -TFCFAFAPT-SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+C F ++I+G++ + ++L N IG+T C
Sbjct: 399 TVWCIGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWTNYDC 442
>gi|326489434|dbj|BAK01698.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 429
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 76/252 (30%), Positives = 108/252 (42%), Gaps = 37/252 (14%)
Query: 27 EGLFVGAAGLLGLGGGSLSFPSQIN-----ASTFSYCLVDRDSDSTST------------ 69
E L GAAG+ G LS P+Q A+ F+ CL SD +
Sbjct: 161 ESLPAGAAGVAGFSRLPLSLPTQFASLLKVANEFALCLPSGGSDGVAVFGGGPFQLLAAP 220
Query: 70 -LEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID-ESGNGGII 127
+E L N + PLL+ H + YY +TGI+V L+P F +D SG GG +
Sbjct: 221 PVELAGRLRENPL--PLLK-HPYNGGYYFNITGIAVNQQLVPTPPGVFDLDASSGTGGAV 277
Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS----RSSVEVPTVS 183
+ T T L+ + Y LR+AF T ++ D V FD CY S+ R V +
Sbjct: 278 FSTVTPYTALRWDIYWPLRNAFDAATSGIARADKVEPFDLCYQASALTVTRVGYGVANIE 337
Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS----------IIGNVQQQGTRV 233
G+ LP + L+ V+ N T CFAF +SS S I+G Q + +
Sbjct: 338 LMLDGGRNWTLPGASSLVQVN-NQTVCFAFVQMASSSSMPAALDSPAVILGGHQMENNLL 396
Query: 234 SFNLRNSLIGFT 245
F+L F+
Sbjct: 397 MFDLVKETFAFS 408
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 83.6 bits (205), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 79/271 (29%), Positives = 118/271 (43%), Gaps = 30/271 (11%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ V E +T S S + +GC + A G+LG+ G LSF SQ + FSYC+
Sbjct: 175 GNLVREKITFSRSQSTPPLILGCAEESSD----AKGILGMNLGRLSFASQAKLTKFSYCV 230
Query: 60 VDRDSDS--TSTLEFDSSLPPNA---------VTAPLLRNHELDTFYY-LGLTGISVGGD 107
R T T F PN+ + R LD Y + + GI +G
Sbjct: 231 PTRQVRPGFTPTGSFYLGENPNSGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQ 290
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALF 165
L I +AF+ D SG G ++DSG+ T L E YN +R+ VR G R +
Sbjct: 291 KLNIPISAFRPDPSGAGQTMIDSGSEFTYLVDEAYNKVREEVVRLVGARLKKGYVYGGVS 350
Query: 166 DTCYDFSSRSSVE----VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS---S 218
D C++ +++E + + F F +G + + + L V G C + +
Sbjct: 351 DMCFN---GNAIEIGRLIGNMVFEFDKGVEIVVEKERVLADV-GGGVHCVGIGRSEMLGA 406
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +IIGN QQ V F+L N +GF C
Sbjct: 407 ASNIIGNFHQQNIWVEFDLANRRVGFGKADC 437
>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
Length = 393
Score = 83.6 bits (205), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 96/220 (43%), Gaps = 50/220 (22%)
Query: 21 GCGHNN--EGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDSSLPP 78
GC H G+ GL+GLGG + S SQ A
Sbjct: 207 GCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAA-------------------------- 240
Query: 79 NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQ 138
R+ ++ T+Y+ L I+VGG L +S + F G +VDSGT +TRL
Sbjct: 241 --------RSKKVPTYYFAALEDIAVGGKKLGLSPSVFA------AGSLVDSGTVITRLP 286
Query: 139 TETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKN 198
Y AL AF G + + + + DTC++F+ V +PTV+ F G V+ L A
Sbjct: 287 PAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAGGAVVDLDAHG 346
Query: 199 YLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQQGTRVSFN 236
+ S G C AFAPT + IGNVQQ+ V ++
Sbjct: 347 IV----SGG--CLAFAPTRDDKAFGTIGNVQQRTFEVLYD 380
>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 445
Score = 83.6 bits (205), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 78/278 (28%), Positives = 118/278 (42%), Gaps = 39/278 (14%)
Query: 5 TETVTLGSAS--VDNIAIGCGHNNEGLFVGA--AGLLGLGGGSLSFPSQIN---ASTFSY 57
T+T+ LG+ + + ++A GC + EG AG LG+G S QI S FSY
Sbjct: 163 TDTIILGNPTLPIHSVAFGCAQSTEGFDTKGTFAGTLGMGKLPTSLIMQIKDRVGSRFSY 222
Query: 58 CLVD--RDSDSTSTLEFDSSLPPNAV----------TAPLLRNHELDTFYYLGLTGISVG 105
CL+ + F + +P + T P L + D+ YY+ L GIS+
Sbjct: 223 CLIGLGHSPGRNGFIRFGADIPDPTLLVHHRIKILPTPPHLPHGVADSAYYVKLLGISLN 282
Query: 106 GDLLP-ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT------RALSP 158
G +P I + F+ G+GG VD+GT VT L Y + +A R P
Sbjct: 283 GTPIPGIRQAMFERRSDGSGGCFVDAGTQVTHLVPAAYAVVEEAVAHMVQQWGYKRVRDP 342
Query: 159 TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKV------LPLPAKNYLIPVDSNGTFCFA 212
F C+ +P ++ F EG L + ++N + VD+ CF
Sbjct: 343 N-----FSLCFREHPGIWSHIPKLTLDF-EGPASRTVAHLEIVSRNLFLKVDNQPLVCFG 396
Query: 213 FAPTSS-SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
TS S +++G +QQ TR F+L + I F C
Sbjct: 397 VYRTSRGSPTVVGAMQQVDTRFIFDLHANTITFHRESC 434
>gi|302142046|emb|CBI19249.3| unnamed protein product [Vitis vinifera]
Length = 191
Score = 83.6 bits (205), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 52/150 (34%), Positives = 76/150 (50%), Gaps = 9/150 (6%)
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
VG L+P++ D + G I+DSGT +TR Y A+RD F + + P +
Sbjct: 46 VGRVLVPVAPELLAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKG--PFATIG 103
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSS 219
FDTC F++ + P V+FHF G L LP +N LI + C A A +S
Sbjct: 104 AFDTC--FAATNEDIAPPVTFHF-TGMDLKLPLENTLIHSSAGSLACLAMAAAPNNVNSV 160
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
L++I N+QQQ R+ F++ NS +G C
Sbjct: 161 LNVIANLQQQNLRIMFDVTNSRLGIARELC 190
>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 466
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 74/276 (26%), Positives = 111/276 (40%), Gaps = 31/276 (11%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G ++E + + + +GC + AG+ G G G S PSQ+N + FSYCL+
Sbjct: 191 GFLLSENLNFPTKKYSDFLLGCSVVS---VYQPAGIAGFGRGEESLPSQMNLTRFSYCLL 247
Query: 61 DRDSDSTST------LEFDSSL--PPNAVT-APLL------RNHELDTFYYLGLTGISVG 105
D ++T LE SS N V+ P L +N +YY+ L I VG
Sbjct: 248 SHQFDDSATITSNLVLETASSRDGKTNGVSYTPFLKNPTTKKNPAFGAYYYITLKRIVVG 307
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG---TRALSPTDGV 162
+ + + + G+GG IVDSG+ T ++ ++ + F + TRA
Sbjct: 308 EKRVRVPRRLLEPNVDGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQVSYTRAREAEKQF 367
Query: 163 ALFDTCYDFS-SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP------ 215
L C+ + + P + F F G + LP NY V C
Sbjct: 368 GL-SPCFVLAGGAETASFPELRFEFRGGAKMRLPVANYFSLVGKGDVACLTIVSDDVAGS 426
Query: 216 --TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
T I+GN QQQ V ++L N GF C
Sbjct: 427 GGTVGPAVILGNYQQQNFYVEYDLENERFGFRSQSC 462
>gi|224035171|gb|ACN36661.1| unknown [Zea mays]
Length = 378
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 80/286 (27%), Positives = 115/286 (40%), Gaps = 62/286 (21%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDSDSTS 68
+ +VDN C H G VG AG G G LS P Q++ + FSYCLV +
Sbjct: 97 AVAVDNFTFACAHTALGEPVGVAGF---GRGPLSLPGQLSPQLSGRFSYCLV------SH 147
Query: 69 TLEFDSSLPPNA-------------------VTAPLLRNHELDTFYYLGLTGISVGGDLL 109
+ D + P+ V PLL N + FY + L +SVG +
Sbjct: 148 SFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAARI 207
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---------RALSPTD 160
++D +GNGG++VDSGT T L E Y + +AF R RA T
Sbjct: 208 QARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQTG 267
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSN----GTF-----CF 211
CY +++ S VP ++ HF + LP +NY + S GT C
Sbjct: 268 ----LTPCYRYAA-SDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGCL 322
Query: 212 AFAPTSSS--------LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +GN QQQG V +++ +GF +C
Sbjct: 323 MLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 368
>gi|242094480|ref|XP_002437730.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
gi|241915953|gb|EER89097.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
Length = 507
Score = 83.2 bits (204), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 79/246 (32%), Positives = 106/246 (43%), Gaps = 40/246 (16%)
Query: 21 GCGHNN-----EGLFVGA-AGLLGLGGGSLSFPSQ---INASTFSYCLVDRDSDS----- 66
GC H EG A AG++ LGGG S SQ + S FSYC+ +S
Sbjct: 230 GCSHGEAKQGGEGSIDNATAGIMALGGGPESLVSQNAAMYGSAFSYCIPATESRRPGFFV 289
Query: 67 TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGI 126
D S P+LR + T Y + L I+V G L ++ + F G
Sbjct: 290 LGGGVGDLSGAGGYAVTPMLRYARVPTLYRVRLLAIAVDGQQLNVTPSVFA------SGS 343
Query: 127 IVDSGTAVTRLQTETYNALRDAFVRGTRAL---SPTDGVALFDTCYDFSSRSSVEVPTVS 183
++DS TA+TRL Y ALR+AF R A+ +P G DTCYDF+ V VP V+
Sbjct: 344 VLDSRTAITRLPPTAYQALREAF-RSRMAMYREAPPQGN--LDTCYDFAGAFLVMVPRVA 400
Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTSSSL--SIIGNVQQQGTRVSFNLR 238
L N ++ +D G C F + I+GNVQQQ V +N+
Sbjct: 401 L---------LLDGNAVVALDRQGILFHDCLVFTSNTDDRMPGILGNVQQQTMEVLYNVG 451
Query: 239 NSLIGF 244
LI
Sbjct: 452 GVLISM 457
>gi|125564663|gb|EAZ10043.1| hypothetical protein OsI_32347 [Oryza sativa Indica Group]
Length = 330
Score = 83.2 bits (204), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 77/266 (28%), Positives = 127/266 (47%), Gaps = 20/266 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G F TET LG+ +V NI GCG N+G + AG+ G+G G +S +Q+ FSYC
Sbjct: 60 GYFATETFALGNVTVANITFGCGTRNQGYYDNVAGVFGVGRGGVSLLNQLGIDRFSYCFS 119
Query: 61 DRDSDSTSTLEFDSS-------LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
+ +S + S A + P++ + L + Y++ L G++VG + ++
Sbjct: 120 SSGAPGSSAVFLGGSPELATNATTTPAASTPMVADPVLKSGYFVKLVGVTVGATRVDVAG 179
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD-----GVALFDTC 168
+ E G +++DS + VT L TY +R A V L + GV L D C
Sbjct: 180 ASSA--EGGGRALVIDSTSPVTVLDEATYGPVRRALVAQLAPLKEANANASAGVGL-DLC 236
Query: 169 YDFSSRSSVEVP---TVSFHFPEGKV-LPLPAKNYLIPVDSNGTFCFAFAPTSSS-LSII 223
++ ++ + P T++ HF G L LP NYL + G C P+SS+ + ++
Sbjct: 237 FELAAGGATPTPPNVTMTLHFDGGAADLVLPPANYLAKDSAGGLICLTMTPSSSNGVPVL 296
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
G+ T V ++L +++ F P C
Sbjct: 297 GSSALLDTLVLYDLAKNVVSFQPLDC 322
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 83.2 bits (204), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 74/242 (30%), Positives = 106/242 (43%), Gaps = 31/242 (12%)
Query: 35 GLLGLGGGSLSFPSQINASTFSYCLVDRDSDST-----STLEFDSSLPPNAVTAPLLRNH 89
GL+G+ GSLSF +Q+ FSYC+ +D+ +T ++ L P T + N
Sbjct: 197 GLMGMNRGSLSFVTQMGFPKFSYCISGKDASGVLLFGDATFKW---LGPLKYTPLVKMNT 253
Query: 90 ELDTF----YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNAL 145
L F Y + L GI VG L + + F D +G G +VDSGT T L Y AL
Sbjct: 254 PLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAGQTMVDSGTRFTFLLGSVYTAL 313
Query: 146 RDAFVRGTRALSP--TDGVALFDTCYDFSSRSSV-----EVPTVSFHFPEGKVLPLPAKN 198
R+ FV TR + D +F+ D R VP V+ F EG + + +
Sbjct: 314 RNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGGVVPAVPAVTMVF-EGAEMSVSGER 372
Query: 199 YLIPVDSNG--------TFCFAFAPTS---SSLSIIGNVQQQGTRVSFNLRNSLIGFTPN 247
L V +G +C F + +IG+ QQ + F+L NS +GF
Sbjct: 373 LLYRVGGDGDVAKGNGDVYCLTFGNSDLLGIEAYVIGHHHQQNVWMEFDLVNSRVGFADT 432
Query: 248 KC 249
KC
Sbjct: 433 KC 434
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 83.2 bits (204), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 79/254 (31%), Positives = 115/254 (45%), Gaps = 16/254 (6%)
Query: 2 DFVTETVTLGSASVDNIAIGCGHNNEGLF----VGAAGLLGLGGGSLSFPSQINAST-FS 56
+ T + T G V++I GCGHNN G+F +G GL G +S + S FS
Sbjct: 112 EIATFSSTDGKPIVESIIFGCGHNNTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFS 171
Query: 57 YCLV----DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
CLV D + T +L S + V L + E T Y + L GISVG +P +
Sbjct: 172 QCLVPFHADPHTSGTISLGEASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVPFN 231
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
+ G I++DSGT T L E Y+ L + ++ L P T +
Sbjct: 232 SSEML----SKGNIMIDSGTPETYLPQEFYDRLVEE-LKVQINLPPIHVDPDLGTQLCYK 286
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTR 232
S +++E P ++ HF V LP + ++ P D G FCFA T+ L I GN Q
Sbjct: 287 SETNLEGPILTAHFEGADVKLLPLQTFIPPKD--GVFCFAMTGTTDGLYIFGNFAQSNVL 344
Query: 233 VSFNLRNSLIGFTP 246
+ F+L ++ F P
Sbjct: 345 IGFDLDKRIVFFKP 358
>gi|125555054|gb|EAZ00660.1| hypothetical protein OsI_22681 [Oryza sativa Indica Group]
Length = 337
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/278 (28%), Positives = 116/278 (41%), Gaps = 38/278 (13%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFS 56
G + +TL SASVD+ GC + G +GAAGLL L S S S++ A TFS
Sbjct: 69 GAVAQDVLTLTPSASVDDFTFGCVEGSSGEPLGAAGLLDLSRDSRSLASRLAAGAGGTFS 128
Query: 57 YCLVDRDSDSTSTLEF-DSSLPPN-----AVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
YCL + S L ++ +P N APL+ + Y + L G+S+GG +P
Sbjct: 129 YCLPLSTTSSHGFLVIGEADVPHNRSARVTAVAPLVYDPAFPNHYVIDLAGVSLGGRDIP 188
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
I A +++D+ T ++ Y LRDAF R + DTCY+
Sbjct: 189 IPPHA---------AMVLDTALPYTYMKPSMYAPLRDAFRRAMARYPRAPAMGDLDTCYN 239
Query: 171 FSS-RSSVEVPTVSFHF-------PEGKVLPLPAKNYLIPVDSNGTF----CFAFAPTSS 218
F+ R V +P V F + + ++ + G F C AFA S
Sbjct: 240 FTGVRHEVLIPLVHLTFRGISGGGGGEGQVLGLGADQMLYMSEPGNFFSVTCLAFAALPS 299
Query: 219 S-------LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++G + Q V +++ IGF P C
Sbjct: 300 DGDAAAPLAMVMGTLAQSSMEVVHDVQGGKIGFIPGSC 337
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 80/269 (29%), Positives = 113/269 (42%), Gaps = 36/269 (13%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
G TETVT+ S S + IGCGHN+ +G++GL G S +Q+
Sbjct: 135 GTLATETVTIHSTSGEPFVMPETTIGCGHNSSWFKPTFSGMVGLSWGPSSLITQMGGEYP 194
Query: 54 -TFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-------PLLRNHELDTFYYLGLTGISVG 105
SYC S TS + F + NA+ A + YYL L +SVG
Sbjct: 195 GLMSYCFA---SQGTSKINFGT----NAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVG 247
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDA---FVRGTRALSPTDGV 162
+ T F E G II+DSGT +T N +R+A +V R PT
Sbjct: 248 DTHVETMGTTFHALE---GNIIIDSGTTLTYFPVSYCNLVREAVDHYVTAVRTADPTGND 304
Query: 163 ALFDTCYDFSSRSSVEV-PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL- 220
L CY ++++ P ++ HF G L L N I + GTFC A +
Sbjct: 305 ML---CY---YTDTIDIFPVITMHFSGGADLVLDKYNMYIETITRGTFCLAIICNNPPQD 358
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+I GN Q V ++ + L+ F+P C
Sbjct: 359 AIFGNRAQNNFLVGYDSSSLLVSFSPTNC 387
>gi|357119741|ref|XP_003561592.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 410
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 76/251 (30%), Positives = 111/251 (44%), Gaps = 26/251 (10%)
Query: 14 SVDNIAIGCGH-----NNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSYCLVDRDSD 65
SV I GC H +N+G +G+L L LSF + + + FSYCL +
Sbjct: 170 SVPGIMFGCAHSVTGFHNDGTL---SGVLSLSHSPLSFLTLLGGRSSGRFSYCLPKPTTH 226
Query: 66 ST-STLEFDS---SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
+ S L F + SLPP+A T L+ H Y+L + GIS+G L I F +
Sbjct: 227 NPDSFLRFGADVPSLPPHAHTTTLV--HAGVPGYHLNIVGISLGNKRLHIDRHVF----A 280
Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP--TDGVALFDTCYDFSSRS-SVE 178
GG ++ +TR+ Y A+ A V + L G+ C+D RS V+
Sbjct: 281 AGGGCSINPAVTITRIMELAYLAVEHALVAHMKELGSGRVKGMPGRSLCFDHMDRSVRVQ 340
Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLR 238
+P +SFHF +G L A+ L V CF ++IG QQ TR +F++
Sbjct: 341 LPGMSFHFEDGAELRFAAEQ-LFDVRVMAA-CFLVVGRGHHQTVIGAAQQVDTRFTFDIA 398
Query: 239 NSLIGFTPNKC 249
+ F P C
Sbjct: 399 AGRLAFVPETC 409
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 71/263 (26%), Positives = 116/263 (44%), Gaps = 26/263 (9%)
Query: 2 DFVTETVTLGSASVDNIAIGCGHNNEGLF----VGAAGLLGLGGGSLSFPSQINAS---- 53
D VT GS + I GCG G G++G G + SF SQ+ +
Sbjct: 188 DLVTGNRQTGSTN-GTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVK 246
Query: 54 -TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
+F++CL +++ + P T P+L Y + L I VG +L +S
Sbjct: 247 RSFAHCL--DNNNGGGIFAIGEVVSPKVKTTPMLSK---SAHYSVNLNAIEVGNSVLQLS 301
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
AF D + G+I+DSGT + L YN L + + + L+ F TC+ +
Sbjct: 302 SDAF--DSGDDKGVIIDSGTTLVYLPDAVYNPLMNQILASHQELNLHTVQDSF-TCFHYI 358
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA------PTSSSLSIIGNV 226
R PTV+F F + L + + YL V + T+CF + +SL+I+G++
Sbjct: 359 DRLD-RFPTVTFQFDKSVSLAVYPQEYLFQVRED-TWCFGWQNGGLQTKGGASLTILGDM 416
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
V +++ N +IG+T + C
Sbjct: 417 ALSNKLVVYDIENQVIGWTNHNC 439
>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 488
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 79/271 (29%), Positives = 112/271 (41%), Gaps = 43/271 (15%)
Query: 17 NIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDSSL 76
N AIGC + + +GL G G G+ S PSQ+ FSYCL+ R D S + + L
Sbjct: 219 NFAIGC--SIVSVHQPPSGLAGFGRGAPSVPSQLKVPKFSYCLLSRRFDDNSAVSGELVL 276
Query: 77 PPNAVTA----------PLLRNH----ELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
V A PLL N +YYL LTGISVGG + + AF + SG
Sbjct: 277 GDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLALTGISVGGKPVNLPSRAF-VPSSG 335
Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTDGVALFDTCYDFSSRS-- 175
GG I+DSGT T L + + A R R+ D + L C+
Sbjct: 336 -GGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRYNRSRPVEDALGL-RPCFALPPGPGG 393
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYL-----------------IPVDSNGTFCFAFAPTSS 218
++E+P + F G V+ LP +NY + V S+ +
Sbjct: 394 AMELPDLELKFKGGAVMRLPVENYFVAAGPAGGPAAGPVAICLAVVSDLPASGGDGAAAG 453
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I+G+ QQQ + ++L +GF C
Sbjct: 454 PAIILGSFQQQNYHIEYDLGKERLGFRQQPC 484
>gi|242092874|ref|XP_002436927.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
gi|241915150|gb|EER88294.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
Length = 484
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/215 (29%), Positives = 91/215 (42%), Gaps = 14/215 (6%)
Query: 42 GSLSFPSQINASTFSYCLVDRDSDSTSTLEFDSSLPP----NAVTAPLLRNHELDTFYYL 97
S + PS +A FSYCL SD L ++ P PL N Y +
Sbjct: 277 ASRAAPSSPDAVAFSYCLPSYPSD-VGFLSLGATKPELLGRKVSYTPLRSNRHNGNLYVV 335
Query: 98 GLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS 157
L G+ +GG LP+ A GG I++ T T L+ + Y ALRD F +
Sbjct: 336 ELVGLGLGGVDLPVPRAAI-----AGGGTILELHTTFTYLKPKVYAALRDEFRKSMSQYP 390
Query: 158 PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFA 214
DTCY+F++ SS VP V+ F G L + + F C AF
Sbjct: 391 VAPPQGSLDTCYNFTALSSYSVPAVTLKFDGGAEFDLWIDEMMYFPEPGSYFSVGCLAFV 450
Query: 215 PTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++IG++ Q T V +++R +GF P +C
Sbjct: 451 AQDGG-AVIGSMAQMSTEVVYDVRGGKVGFVPYRC 484
>gi|223950045|gb|ACN29106.1| unknown [Zea mays]
Length = 392
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 71/257 (27%), Positives = 109/257 (42%), Gaps = 24/257 (9%)
Query: 11 GSASVDNIAIGCGHNNEGLFVGAA--GLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTS 68
GS S + +AIGC + F + G+ GLG + S P Q+N S FSYCL
Sbjct: 138 GSQSFEEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFSKFSYCLSSYQKPDLP 197
Query: 69 TLEFDSSLP----------PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
+ ++ P T L N + T Y++ L GIS+GG LP T
Sbjct: 198 SYLLLTAAPDMATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIGGTRLPAVST---- 253
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNAL---RDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
+SG G + VD+GT+ TRL+ + L D ++ + + G CY S +
Sbjct: 254 -KSG-GNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVKEQPGRNNGQICYSPPSTA 311
Query: 176 SVE---VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTR 232
+ E +P + HF + + LP +YL S + +S++GN Q Q T
Sbjct: 312 ADESSKLPDMVLHFADSANMVLPWDSYLWKTTSKLCLAIDKSNIKGGISVLGNFQMQNTH 371
Query: 233 VSFNLRNSLIGFTPNKC 249
+ + N + F C
Sbjct: 372 MLLDTGNEKLSFVRADC 388
>gi|414586111|tpg|DAA36682.1| TPA: pepsin A [Zea mays]
Length = 503
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 80/286 (27%), Positives = 115/286 (40%), Gaps = 62/286 (21%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDSDSTS 68
+ +VDN C H G VG AG G G LS P Q++ + FSYCLV +
Sbjct: 222 AVAVDNFTFACAHTALGEPVGVAGF---GRGPLSLPGQLSPQLSGRFSYCLV------SH 272
Query: 69 TLEFDSSLPPNA-------------------VTAPLLRNHELDTFYYLGLTGISVGGDLL 109
+ D + P+ V PLL N + FY + L +SVG +
Sbjct: 273 SFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAARI 332
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---------RALSPTD 160
++D +GNGG++VDSGT T L E Y + +AF R RA T
Sbjct: 333 QARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQTG 392
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSN----GTF-----CF 211
CY +++ S VP ++ HF + LP +NY + S GT C
Sbjct: 393 ----LTPCYRYAA-SDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGCL 447
Query: 212 AFAPTSSS--------LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +GN QQQG V +++ +GF +C
Sbjct: 448 MLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 493
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 84/275 (30%), Positives = 139/275 (50%), Gaps = 33/275 (12%)
Query: 1 GDFVTETVTLGSASVDNIAI-----GCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS- 53
GD TET+++ SAS ++ GCG+NN G F +G++GLGGG LS SQ+ +S
Sbjct: 176 GDVATETISIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSI 235
Query: 54 --TFSYCLVDRDS--DSTSTLEFDSSLPPNA-------VTAPLLRNHELDTFYYLGLTGI 102
FSYCL + + + TS + ++ P++ ++ PL+ + E T+YYL L I
Sbjct: 236 SKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVISTPLV-DKEPRTYYYLTLEAI 294
Query: 103 SVGGDLLPISETAFKIDESG-----NGGIIVDSGTAVTRLQT---ETYNALRDAFVRGTR 154
SVG +P + +++ ++ G +G II+DSGT +T L + + + A + V G +
Sbjct: 295 SVGKKKIPYTGSSYNPNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAK 354
Query: 155 ALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA 214
+S G L C+ S + + +P ++ HF G + L N + V S C +
Sbjct: 355 RVSDPQG--LLSHCFK-SGSAEIGLPEITVHF-TGADVRLSPINAFVKV-SEDMVCLSMV 409
Query: 215 PTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
PT + ++I GN Q V ++L + F C
Sbjct: 410 PT-TEVAIYGNFAQMDFLVGYDLETRTVSFQRMDC 443
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 79/268 (29%), Positives = 115/268 (42%), Gaps = 24/268 (8%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G V E +T S+ S + +GC + G+LG+ G SF SQ S FSYC+
Sbjct: 174 GSLVREKITFSSSQSTPPLILGCAEAS----TDEKGILGMNLGRRSFASQAKISKFSYCV 229
Query: 60 VDRDSDS--TSTLEFDSSLPPNA---------VTAPLLRNHELDTFYY-LGLTGISVGGD 107
R + + +ST F PN+ P R+ LD Y + + GI +G
Sbjct: 230 PTRQARAGLSSTGSFYLGNNPNSGRFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNA 289
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALF 165
L IS T F+ D SG G I+DSG+ T L E YN +R+ VR G + +
Sbjct: 290 RLNISATLFRPDPSGAGQTIIDSGSEFTYLVDEAYNKVREEVVRLVGPKLKKGYVYGGVS 349
Query: 166 DTCYDFSSRSSVE-VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS---SSLS 221
D C+D + + + F F +G + + L V G C + ++ +
Sbjct: 350 DMCFDGNPMEIGRLIGNMVFEFEKGVEIVIDKWRVLADV-GGGVHCIGIGRSEMLGAASN 408
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IIGN QQ V ++L N IG C
Sbjct: 409 IIGNFHQQNLWVEYDLANRRIGLGKADC 436
>gi|376338606|gb|AFB33833.1| hypothetical protein CL1136Contig1_03, partial [Larix decidua]
gi|376338608|gb|AFB33834.1| hypothetical protein CL1136Contig1_03, partial [Larix decidua]
gi|376338610|gb|AFB33835.1| hypothetical protein CL1136Contig1_03, partial [Larix decidua]
Length = 71
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/70 (58%), Positives = 49/70 (70%), Gaps = 1/70 (1%)
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDS-NGTFCFAFAPTSSS 219
G +LFDTCYD S +V+VPT+ FHF + LPA NYLI VDS + FCFAFA +
Sbjct: 2 GFSLFDTCYDLSGLKTVKVPTLDFHFKGRADVSLPATNYLILVDSASAVFCFAFAGNTGG 61
Query: 220 LSIIGNVQQQ 229
LSIIGN+QQQ
Sbjct: 62 LSIIGNIQQQ 71
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 72/226 (31%), Positives = 105/226 (46%), Gaps = 23/226 (10%)
Query: 35 GLLGLGGGSLSFPSQIN---ASTFSYCLVDRDSDSTSTLEF--DSSLP--PNAVTAPLLR 87
GL+GLG G LS SQ+ FSYC S+STS + F D+ + V+ PL+
Sbjct: 224 GLVGLGVGPLSLISQLGYQIGRKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVSTPLII 283
Query: 88 NHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRD 147
++YYL L G+S+G + SE+ +G I++DSGT+ T L+ YN
Sbjct: 284 KSIGPSYYYLNLEGVSIGNKKVKTSES------QTDGNILIDSGTSFTILKQSFYNK--- 334
Query: 148 AFVRGTRALSPTDGVALFDTCYDFSSRSS---VEVPTVSFHFPEGKVLPLPAKNYLIPVD 204
FV + + + V + Y+F + P V F F KV + A N L +
Sbjct: 335 -FVALVKEVYGVEAVKIPPLVYNFCFENKGKRKRFPDVVFLFTGAKVR-VDASN-LFEAE 391
Query: 205 SNGTFCFAFAPTSSSL-SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
N C PTS SI GN Q G +V ++L+ ++ F P C
Sbjct: 392 DNNLLCMVALPTSDEDDSIFGNHAQIGYQVEYDLQGGMVSFAPADC 437
>gi|226494967|ref|NP_001141737.1| uncharacterized protein LOC100273869 [Zea mays]
gi|194705750|gb|ACF86959.1| unknown [Zea mays]
gi|195645950|gb|ACG42443.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 163
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/165 (32%), Positives = 85/165 (51%), Gaps = 10/165 (6%)
Query: 91 LDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV 150
+ FY + + G+SV G+LL I + +++ GG I+DSGT++T L + Y A+ A
Sbjct: 1 MRPFYAVAVNGVSVDGELLRIPRRVWDVEK--GGGAILDSGTSLTVLVSPAYRAVVAALS 58
Query: 151 RGTRALSPTDGVALFDTCYDFSSRS-----SVEVPTVSFHFPEGKVLPLPAKNYLIPVDS 205
R L P + FD CY+++S S +V VP ++ HF L P K+Y+I +
Sbjct: 59 RKLAGL-PRVAMDPFDYCYNWTSPSTGEDLAVAVPELALHFAGSARLQPPPKSYVIDA-A 116
Query: 206 NGTFCFAFAP-TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
G C +S+IGN+ QQ F+L+N + F ++C
Sbjct: 117 PGVKCIGLQEGDWPGVSVIGNIMQQEHLWEFDLKNRRLRFKRSRC 161
>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 320
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 70/263 (26%), Positives = 116/263 (44%), Gaps = 26/263 (9%)
Query: 2 DFVTETVTLGSASVDNIAIGCGHNNEGLF----VGAAGLLGLGGGSLSFPSQINAS---- 53
D VT GS + I GCG G G++G G + SF SQ+ +
Sbjct: 20 DLVTGNRQTGSTN-GTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVK 78
Query: 54 -TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
+F++CL +++ + P T P+L Y + L I VG +L +S
Sbjct: 79 RSFAHCL--DNNNGGGIFAIGEVVSPKVKTTPMLSK---SAHYSVNLNAIEVGNSVLELS 133
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
AF D + G+I+DSGT + L YN L + + L+ F TC+ ++
Sbjct: 134 SNAF--DSGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQESF-TCFHYT 190
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA------PTSSSLSIIGNV 226
+ PTV+F F + L + + YL V + T+CF + +SL+I+G++
Sbjct: 191 DKLD-RFPTVTFQFDKSVSLAVYPREYLFQVRED-TWCFGWQNGGLQTKGGASLTILGDM 248
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
V +++ N +IG+T + C
Sbjct: 249 ALSNKLVVYDIENQVIGWTNHNC 271
>gi|413923876|gb|AFW63808.1| hypothetical protein ZEAMMB73_793799 [Zea mays]
Length = 415
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 71/257 (27%), Positives = 109/257 (42%), Gaps = 24/257 (9%)
Query: 11 GSASVDNIAIGCGHNNEGLFVGAA--GLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTS 68
GS S + +AIGC + F + G+ GLG + S P Q+N S FSYCL
Sbjct: 161 GSQSFEEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFSKFSYCLSSYQKPDLP 220
Query: 69 TLEFDSSLP----------PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
+ ++ P T L N + T Y++ L GIS+GG LP T
Sbjct: 221 SYLLLTAAPDMATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIGGTRLPAVST---- 276
Query: 119 DESGNGGIIVDSGTAVTRLQTETYNAL---RDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
+SG G + VD+GT+ TRL+ + L D ++ + + G CY S +
Sbjct: 277 -KSG-GNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVKEQPGRNNGQICYSPPSTA 334
Query: 176 SVE---VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTR 232
+ E +P + HF + + LP +YL S + +S++GN Q Q T
Sbjct: 335 ADESSKLPDMVLHFADSANMVLPWDSYLWKTTSKLCLAIDKSNIKGGISVLGNFQMQNTH 394
Query: 233 VSFNLRNSLIGFTPNKC 249
+ + N + F C
Sbjct: 395 MLLDTGNEKLSFVRADC 411
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/276 (30%), Positives = 125/276 (45%), Gaps = 29/276 (10%)
Query: 1 GDFVTETVTLGSASVDNIAIGCG----HNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G+ ET LGS + GC +N GL+G+ GSLSF +Q+ FS
Sbjct: 158 GNLAFETFRLGSLTKPATIFGCMDSGFSSNSEEDSKTTGLIGMNRGSLSFVNQMGYPKFS 217
Query: 57 YCLVDRDSDSTSTLEFDSSLP---PNAVTAPLLRNHELDTF----YYLGLTGISVGGDLL 109
YC+ DS L ++S P P + T + + L F Y + L GI V +L
Sbjct: 218 YCISGFDSAGVLLLG-NASFPWLKPLSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVL 276
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGVALF-- 165
+ ++ F D +G G +VDSGT T L Y AL++ F+ TR + D +F
Sbjct: 277 SLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLSQTRGILKVLNDDNFVFQG 336
Query: 166 --DTCYDF-SSRSSVE-VPTVSFHFPEGKVLPLPAKN--YLIPVDSNG---TFCFAFAPT 216
D CY SSR +++ +P VS F +G + + + Y +P + G +CF F +
Sbjct: 337 AMDLCYLLDSSRPNLQNLPVVSLMF-QGAEMSVSGERLLYRVPGEVRGRDSVWCFTFGNS 395
Query: 217 S---SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+IG+ QQ + F+L S IG +C
Sbjct: 396 DLLGVEAFVIGHHHQQNVWMEFDLEKSRIGLADVRC 431
>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 417
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 73/252 (28%), Positives = 103/252 (40%), Gaps = 48/252 (19%)
Query: 35 GLLGLGGGSLSFPSQIN--ASTFSYCLV-----------------DRDSDSTSTLEFDSS 75
G+ G G G LS PSQ+ FS+C + D S L+F S
Sbjct: 164 GIAGFGRGVLSLPSQLGFLQKGFSHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTS- 222
Query: 76 LPPNAVTAPLLRNHELDTFYYLGLTGISVG-GDLLPISETAFKIDESGNGGIIVDSGTAV 134
LL+N +YY+GL I+VG + + + + D GNGG+I+DSGT
Sbjct: 223 ---------LLKNPMYPNYYYIGLEAITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTY 273
Query: 135 TRLQTETYNAL---RDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE------VPTVSFH 185
T L Y L + + RA + FD CY ++V +P++SFH
Sbjct: 274 THLPGPFYTQLLSMLQSIITYPRA-QEQEARTGFDLCYRIPCPNNVVTDHDHLLPSISFH 332
Query: 186 FPEGKVLPLPAKNYLI----PVDSNGTFCFAFAPTSSSLS----IIGNVQQQGTRVSFNL 237
F L LP N+ P +S C S S + G+ QQQ +V ++L
Sbjct: 333 FSNNVSLVLPQGNHFYAMGAPSNSTVVKCLLLQNMDDSDSGPAGVFGSFQQQNVKVVYDL 392
Query: 238 RNSLIGFTPNKC 249
IGF P C
Sbjct: 393 EKERIGFQPMDC 404
>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
Length = 362
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 60/105 (57%), Positives = 72/105 (68%), Gaps = 9/105 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
GDF TET+T A VD++ +GCGH+NEGLFVGAAGLLGLG G LSFPSQ FSY
Sbjct: 226 GDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSY 285
Query: 58 CLVDR-----DSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYY 96
CLVDR S ST+ F ++++P +V PLL N +LDTFYY
Sbjct: 286 CLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYY 330
>gi|297800470|ref|XP_002868119.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313955|gb|EFH44378.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 499
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 86/302 (28%), Positives = 126/302 (41%), Gaps = 66/302 (21%)
Query: 5 TETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA------STFSYC 58
+++++L S SV N GC H +G AG G G LS P+Q++ ++FSYC
Sbjct: 197 SDSLSLPSVSVANFTFGCAHTTLAEPIGVAGF---GRGRLSLPAQLSVHSPHLGNSFSYC 253
Query: 59 LVDRDSDSTSTLE---------FDSSLPPNA------------------VTAPLLRNHEL 91
LV DS D A V +L N +
Sbjct: 254 LVSHSFDSDRVRRPSPLILGRFVDKKEKRVATTDDDDDGDETKKKKNEFVFTEMLVNPKH 313
Query: 92 DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF-- 149
FY + L GIS+G +P +ID++G GG++VDSGT T L + YN++ + F
Sbjct: 314 PYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDS 373
Query: 150 ------VRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFP-EGKVLPLPAKNYLIP 202
R R + P+ G++ CY + +V+VP + HF G + LP +NY
Sbjct: 374 RVGRVHERADR-VEPSSGMS---PCYYLN--QTVKVPALVLHFAGNGSTVTLPRRNYFYE 427
Query: 203 V----------DSNGTFCFAFAPTSSSL-----SIIGNVQQQGTRVSFNLRNSLIGFTPN 247
G S L +I+GN QQQG V ++L N +GF
Sbjct: 428 FMDGGDGKEEKRKVGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKR 487
Query: 248 KC 249
KC
Sbjct: 488 KC 489
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 78/262 (29%), Positives = 105/262 (40%), Gaps = 30/262 (11%)
Query: 1 GDFVTETVTLGSAS-VDNIAIGCGHNNEGLFVGA--AGLLGLGGGSLSFPSQIN---AST 54
G V + ++L S V GC H G F + AG++ LG G S SQ +
Sbjct: 265 GTLVADQLSLSPTSQVPKFEFGCSHAARGSFSRSKTAGIMALGRGVQSLVSQTSTKYGQV 324
Query: 55 FSYCLVDRDSDSTS-TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
FSYC S L P+L+ L Y + L I+V G L +
Sbjct: 325 FSYCFPPTASHKGFFVLGVPRRSSSRYAVTPMLKTPML---YQVRLEAIAVAGQRLDVPP 381
Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
T F G +DS T +TRL Y ALR AF P DTCYDF+
Sbjct: 382 TVFA------AGAALDSRTVITRLPPTAYQALRSAFRDKMSMYRPAAANGQLDTCYDFTG 435
Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTS---SSLSIIGNVQ 227
SS+ +PT+S F + +D +G C AFA T+ + IIG +Q
Sbjct: 436 VSSIMLPTISLVFDR--------TGAGVQLDPSGVLFGSCLAFASTAGDDRATGIIGFLQ 487
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
Q V +N+ +GF C
Sbjct: 488 LQTIEVLYNVAGGSVGFRRGAC 509
>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
Length = 434
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 73/252 (28%), Positives = 103/252 (40%), Gaps = 48/252 (19%)
Query: 35 GLLGLGGGSLSFPSQIN--ASTFSYCLV-----------------DRDSDSTSTLEFDSS 75
G+ G G G LS PSQ+ FS+C + D S L+F S
Sbjct: 181 GIAGFGRGVLSLPSQLGFLQKGFSHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTS- 239
Query: 76 LPPNAVTAPLLRNHELDTFYYLGLTGISVG-GDLLPISETAFKIDESGNGGIIVDSGTAV 134
LL+N +YY+GL I+VG + + + + D GNGG+I+DSGT
Sbjct: 240 ---------LLKNPMYPNYYYIGLEAITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTY 290
Query: 135 TRLQTETYNAL---RDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE------VPTVSFH 185
T L Y L + + RA + FD CY ++V +P++SFH
Sbjct: 291 THLPGPFYTQLLSMLQSIITYPRA-QEQEARTGFDLCYRIPCPNNVVTDHDHLLPSISFH 349
Query: 186 FPEGKVLPLPAKNYLI----PVDSNGTFCFAFAPTSSSLS----IIGNVQQQGTRVSFNL 237
F L LP N+ P +S C S S + G+ QQQ +V ++L
Sbjct: 350 FSNNVSLVLPQGNHFYAMGAPSNSTVVKCLLLQNMDDSDSGPAGVFGSFQQQNVKVVYDL 409
Query: 238 RNSLIGFTPNKC 249
IGF P C
Sbjct: 410 EKERIGFQPMDC 421
>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
Length = 504
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/287 (27%), Positives = 115/287 (40%), Gaps = 63/287 (21%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDSDSTS 68
+ +VDN C H G VG AG G G LS P Q++ + FSYCLV +
Sbjct: 222 AVAVDNFTFACAHTALGEPVGVAGF---GRGPLSLPGQLSPQLSGRFSYCLV------SH 272
Query: 69 TLEFDSSLPPNA--------------------VTAPLLRNHELDTFYYLGLTGISVGGDL 108
+ D + P+ V PLL N + FY + L +SVG
Sbjct: 273 SFRADRLIRPSPLILGRSPDDADAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAAR 332
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---------RALSPT 159
+ ++D +GNGG++VDSGT T L E Y + +AF R RA T
Sbjct: 333 IQARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQT 392
Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSN----GTF-----C 210
CY +++ S VP ++ HF + LP +NY + S GT C
Sbjct: 393 G----LTPCYRYAA-SDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGC 447
Query: 211 FAFAPTSSS--------LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +GN QQQG V +++ +GF +C
Sbjct: 448 LMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 494
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 78/269 (28%), Positives = 121/269 (44%), Gaps = 25/269 (9%)
Query: 1 GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ V E T S + + +GC + G+LG+ G LSF SQ S FSYC+
Sbjct: 176 GNLVKEKFTFSNSQTTPPLILGCAKES----TDVKGILGMNLGRLSFISQAKISKFSYCI 231
Query: 60 VDRDSDS--TSTLEFDSSLPPNA--------VTAPL-LRNHELDTFYY-LGLTGISVGGD 107
R + ST F PN+ +T P R LD Y + L GI +G
Sbjct: 232 PTRSNRPGLASTGSFYLGENPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQK 291
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALF 165
L I + F+ D G+G +VDSG+ T L Y+ +++ VR G+R +
Sbjct: 292 RLNIPSSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTA 351
Query: 166 DTCYDFSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS---SSL 220
D C+D + + + + + F F G + + + L+ V G C +S ++
Sbjct: 352 DMCFDGNHQMVIGRLIGDLVFEFGRGVEILVEKQRLLVNV-GGGIHCVGIGRSSMLGAAS 410
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+IIGNV QQ V F++ N +GF+ +C
Sbjct: 411 NIIGNVHQQNLWVEFDVANRRVGFSKAEC 439
>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 106/278 (38%), Gaps = 47/278 (16%)
Query: 15 VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA------STFSYCLVDRDSDSTS 68
V+N GC H +G AG G G LS P+Q+ + FSYCLV DS
Sbjct: 213 VNNFTFGCAHTALAEPIGVAGF---GRGVLSLPAQLATLSPQLGNQFSYCLVSHSFDS-D 268
Query: 69 TLEFDSSL------------------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
L S L P V +L N E FY +GL GIS+G +P
Sbjct: 269 RLRRPSPLILGRYDHDEKERRVNGVNKPRFVYTSMLDNLEHPYFYCVGLEGISIGRKKIP 328
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT--- 167
K+D G+GG++VDSGT T L Y ++ F ++ V DT
Sbjct: 329 APGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNERARVIEEDTGLS 388
Query: 168 -CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV----------DSNGTFCFAFAPT 216
CY F + V G + LP +NY G
Sbjct: 389 PCYYFDNNVVNVPSVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKRKVGCLMLMNGGE 448
Query: 217 SSSLS-----IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ LS +GN QQQG V ++L N +GF +C
Sbjct: 449 EAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQC 486
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/268 (30%), Positives = 123/268 (45%), Gaps = 27/268 (10%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGS-LSFPSQINAS- 53
GD ET+TL S S IGCG NN G F + + GG S +Q+ S
Sbjct: 175 GDLSLETLTLESTTGRPVSFPKTVIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSI 234
Query: 54 --TFSYCLVDRD------SDSTSTLEF-DSSLPP--NAVTAPLLR-NHELDTFYYLGLTG 101
FSYCLV S +S L F D ++ N ++ P+++ +H FYYL +
Sbjct: 235 GGKFSYCLVRMSITLKNMSMGSSKLNFGDVAIVSGHNVLSTPIVKKDHSF--FYYLTIEA 292
Query: 102 ISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG 161
SVG + + ++ ++E G II+DS T VT + ++ Y L A V D
Sbjct: 293 FSVGDKRVEFAGSSKGVEE---GNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTLERVDDP 349
Query: 162 VALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
F CY+ SS + P ++ HF +L L A N + V + CFAFAP++ +
Sbjct: 350 NQQFSLCYNVSSDEEYDFPYMTAHFKGADIL-LYATNTFVEV-ARDVLCFAFAPSNGG-A 406
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I G+ QQ V ++L+ + F C
Sbjct: 407 IFGSFSQQDFMVGYDLQQKTVSFKSVDC 434
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/269 (29%), Positives = 113/269 (42%), Gaps = 36/269 (13%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
G TETVT+ S S + IGCGHN+ +G++GL G S +Q+
Sbjct: 135 GTLATETVTIHSTSGEPFVMPETTIGCGHNSSWFKPTFSGMVGLSWGPSSLITQMGGEYP 194
Query: 54 -TFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-------PLLRNHELDTFYYLGLTGISVG 105
SYC S TS + F + NA+ A + YYL L +SVG
Sbjct: 195 GLMSYCFA---SQGTSKINFGT----NAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVG 247
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDA---FVRGTRALSPTDGV 162
+ T F E G II+DSGT +T N +R+A +V R PT
Sbjct: 248 DTHVETMGTTFHALE---GNIIIDSGTTLTYFPVSYCNLVREAVDHYVTAVRTADPTGND 304
Query: 163 ALFDTCYDFSSRSSVEV-PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL- 220
L CY ++++ P ++ HF G L L N I + GTFC A +
Sbjct: 305 ML---CY---YTDTIDIFPVITMHFSGGADLVLDKYNMYIETITRGTFCLAIICNNPPQD 358
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+I GN Q V ++ + L+ F+P C
Sbjct: 359 AIFGNRAQNNFLVGYDSSSLLVFFSPTNC 387
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 70/263 (26%), Positives = 116/263 (44%), Gaps = 26/263 (9%)
Query: 2 DFVTETVTLGSASVDNIAIGCGHNNEGLF----VGAAGLLGLGGGSLSFPSQINAS---- 53
D VT GS + I GCG G G++G G + SF SQ+ +
Sbjct: 188 DLVTGNRQTGSTN-GTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVK 246
Query: 54 -TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
+F++CL +++ + P T P+L Y + L I VG +L +S
Sbjct: 247 RSFAHCL--DNNNGGGIFAIGEVVSPKVKTTPMLSK---SAHYSVNLNAIEVGNSVLELS 301
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
AF D + G+I+DSGT + L YN L + + L+ F TC+ ++
Sbjct: 302 SNAF--DSGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQESF-TCFHYT 358
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA------PTSSSLSIIGNV 226
+ PTV+F F + L + + YL V + T+CF + +SL+I+G++
Sbjct: 359 DKLD-RFPTVTFQFDKSVSLAVYPREYLFQVRED-TWCFGWQNGGLQTKGGASLTILGDM 416
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
V +++ N +IG+T + C
Sbjct: 417 ALSNKLVVYDIENQVIGWTNHNC 439
>gi|361068721|gb|AEW08672.1| Pinus taeda anonymous locus CL1136Contig1_04 genomic sequence
gi|383173175|gb|AFG69965.1| Pinus taeda anonymous locus CL1136Contig1_04 genomic sequence
gi|383173176|gb|AFG69966.1| Pinus taeda anonymous locus CL1136Contig1_04 genomic sequence
Length = 80
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/80 (55%), Positives = 54/80 (67%), Gaps = 1/80 (1%)
Query: 86 LRNHELDTFYYLGLTGISVGGD-LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNA 144
L N +LDTFYY+ L GISVGG L I + FK+D +GNGG+I+DSGT+VTRL Y A
Sbjct: 1 LSNPKLDTFYYVELVGISVGGRRLTSIPASVFKMDATGNGGVIIDSGTSVTRLVESAYTA 60
Query: 145 LRDAFVRGTRALSPTDGVAL 164
+RDAF GT L G +L
Sbjct: 61 MRDAFRAGTGNLKSAGGFSL 80
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/283 (25%), Positives = 118/283 (41%), Gaps = 35/283 (12%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFV--GAAGLLGLGGGSLSFPSQINASTFSYC 58
G + + +G + +A GC ++ G A+G++GLG G LS SQ++ F+YC
Sbjct: 179 GTLAVDKLVIGEDAFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYC 238
Query: 59 LVDRDSDSTSTLEF--DSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPI-- 111
L S L D+ NA + P+ R+ ++YYL L G+ +G + +
Sbjct: 239 LPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRAMSLPP 298
Query: 112 ---------------------SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV 150
+ TA + ++ G+I+D + +T L+ Y+ L +
Sbjct: 299 TTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLE 358
Query: 151 RGTRALSPTDGVALFDTCY---DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG 207
R T D C+ D + V VP V+ F +G+ L L +G
Sbjct: 359 VEIRLPRGTGSSLGLDLCFILPDGVAFDRVYVPAVALAF-DGRWLRLDKARLFAEDRESG 417
Query: 208 TFCFAFAPTSS-SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
C + S+SI+GN QQQ +V +NLR + F + C
Sbjct: 418 MMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460
>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 75/271 (27%), Positives = 117/271 (43%), Gaps = 31/271 (11%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ V E T S S + +GC + G+LG+ G LSF SQ S FSYC+
Sbjct: 165 GNLVREKFTFSKSLSTPPVILGCAQAS----TENRGILGMNHGRLSFISQAKISKFSYCV 220
Query: 60 VDR------------DSDSTSTLEFDSSLP-PNAVTAPLLRNHELDTFYY-LGLTGISVG 105
R D+ ++S ++ + L P + ++P LD Y L + I +
Sbjct: 221 PSRTGSNPTGLFYLGDNPNSSKFKYVTMLTFPESQSSP-----NLDPLAYTLPMKAIKIA 275
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA-- 163
G L I AFK D G+G ++DSG+ +T L E Y +++ VR A+ V
Sbjct: 276 GKRLNIPPAAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAD 335
Query: 164 LFDTCYDFSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--- 218
+ D C+D + V + +SF F G + + ++ G C +
Sbjct: 336 VADMCFDAGVTAEVGRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGI 395
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+IIG V QQ V ++L N +GF +C
Sbjct: 396 GSNIIGTVHQQNMWVEYDLANKRVGFGGAEC 426
>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
gi|223942623|gb|ACN25395.1| unknown [Zea mays]
Length = 378
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 73/247 (29%), Positives = 119/247 (48%), Gaps = 15/247 (6%)
Query: 13 ASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSDS 66
A + + +GC +G F + G+L LG ++SF S+ A FSYCLVD ++
Sbjct: 135 AKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNA 194
Query: 67 TSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
+S L F A PL+ + + FY + + + V G+ L I + +
Sbjct: 195 SSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPADVWDVGR--G 252
Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
GG I+DSGT++T L T Y A+ A + G A P + F+ CY++++ + E+P +
Sbjct: 253 GGAILDSGTSLTVLATPAYRAVVAA-LGGRLAALPRVAMDPFEYCYNWTA-GAPEIPKLE 310
Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSLI 242
F L PAK+Y+I + G C + +S+IGN+ QQ F+LR+ +
Sbjct: 311 VSFAGSARLEPPAKSYVIDA-APGVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLRDRWL 369
Query: 243 GFTPNKC 249
F +C
Sbjct: 370 RFKHTRC 376
>gi|224101053|ref|XP_002334311.1| predicted protein [Populus trichocarpa]
gi|222871031|gb|EEF08162.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 106/278 (38%), Gaps = 47/278 (16%)
Query: 15 VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN------ASTFSYCLVDRDSDSTS 68
V+N GC H +G AG G G LS P+Q+ + FSYCLV DS
Sbjct: 213 VNNFTFGCAHTALAEPIGVAGF---GRGVLSLPAQLATLSPQLGNQFSYCLVSHSFDS-D 268
Query: 69 TLEFDSSL------------------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
L S L P V +L N E FY +GL GIS+G +P
Sbjct: 269 RLRRPSPLILGRYDHDEKERRVNGVNKPRFVYTSMLDNLEHPYFYCVGLEGISIGRKKIP 328
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT--- 167
K+D G+GG++VDSGT T L Y ++ F ++ V DT
Sbjct: 329 APGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNERARVIEEDTGLS 388
Query: 168 -CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV----------DSNGTFCFAFAPT 216
CY F + V G + LP +NY G
Sbjct: 389 PCYYFDNNVVNVPSVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKRKVGCLMLMNGGD 448
Query: 217 SSSLS-----IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ LS +GN QQQG V ++L N +GF +C
Sbjct: 449 EAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQC 486
>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
Length = 469
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 73/247 (29%), Positives = 119/247 (48%), Gaps = 15/247 (6%)
Query: 13 ASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSDS 66
A + + +GC +G F + G+L LG ++SF S+ A FSYCLVD ++
Sbjct: 226 AKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNA 285
Query: 67 TSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
+S L F A PL+ + + FY + + + V G+ L I + +
Sbjct: 286 SSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPADVWDVGR--G 343
Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
GG I+DSGT++T L T Y A+ A + G A P + F+ CY++++ + E+P +
Sbjct: 344 GGAILDSGTSLTVLATPAYRAV-VAALGGRLAALPRVAMDPFEYCYNWTA-GAPEIPKLE 401
Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSLI 242
F L PAK+Y+I + G C + +S+IGN+ QQ F+LR+ +
Sbjct: 402 VSFAGSARLEPPAKSYVIDA-APGVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLRDRWL 460
Query: 243 GFTPNKC 249
F +C
Sbjct: 461 RFKHTRC 467
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/283 (25%), Positives = 118/283 (41%), Gaps = 35/283 (12%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFV--GAAGLLGLGGGSLSFPSQINASTFSYC 58
G + + +G + +A GC ++ G A+G++GLG G LS SQ++ F+YC
Sbjct: 179 GTLAVDKLVIGEDAFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYC 238
Query: 59 LVDRDSDSTSTLEF--DSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPI-- 111
L S L D+ NA + P+ R+ ++YYL L G+ +G + +
Sbjct: 239 LPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSLPP 298
Query: 112 ---------------------SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV 150
+ TA + ++ G+I+D + +T L+ Y+ L +
Sbjct: 299 TTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLE 358
Query: 151 RGTRALSPTDGVALFDTCY---DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG 207
R T D C+ D + V VP V+ F +G+ L L +G
Sbjct: 359 VEIRLPRGTGSSLGLDLCFILPDGVAFDRVYVPAVALAF-DGRWLRLDKARLFAEDRESG 417
Query: 208 TFCFAFAPTSS-SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
C + S+SI+GN QQQ +V +NLR + F + C
Sbjct: 418 MMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460
>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
Length = 466
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 74/262 (28%), Positives = 105/262 (40%), Gaps = 44/262 (16%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
S +V+N C H VG AG G G LS P+Q+ S D + S +
Sbjct: 215 SMAVENFTFACAHTALAEPVGVAGF---GRGPLSLPAQLAPSLSGS--TDAAAIGASETD 269
Query: 72 FDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSG 131
F V PLL N + FY + L +SVGG + +D GNGG++VDSG
Sbjct: 270 F--------VYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSG 321
Query: 132 TAVTRLQTETYNALRD-----------AFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
T T L ++T+ + D G A + G+A CY +S S VP
Sbjct: 322 TTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQT---GLA---PCYHYSP-SDRAVP 374
Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDS---NGTFCFAFAPTSSS----------LSIIGNVQ 227
V+ HF + LP +NY + S C + +GN Q
Sbjct: 375 PVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDGEDGGGPAGTLGNFQ 434
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
QQG V +++ +GF +C
Sbjct: 435 QQGFEVVYDVDAGRVGFARRRC 456
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 69/263 (26%), Positives = 117/263 (44%), Gaps = 25/263 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNE-GLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G T+T +G+A ++A GC ++ G +G++GLG S +Q + FSYCL
Sbjct: 139 GKVGTDTFAVGTAKA-SLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCL 197
Query: 60 VDRDSDSTSTLEFDSSLP----PNAVTAPLL----RNHELDTFYYLGLTGISVGGDLLPI 111
D+ S L SS A + P + ++L +Y + L G+ G ++P+
Sbjct: 198 APHDAGKNSALFLGSSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPL 257
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
+ +++D+ + ++ L Y A++ A A V FD C+
Sbjct: 258 PPS--------GSTVLLDTFSPISFLVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPK 309
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-----SSLSIIGNV 226
S S P + F F G + +PA NYL+ NGT C A ++ + LS++G++
Sbjct: 310 SGASGA-APDLVFTFRGGAAMTVPATNYLLDYK-NGTVCLAMLSSARLNSTTELSLLGSL 367
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQ+ F+L + F P C
Sbjct: 368 QQENIHFLFDLDKETLSFEPADC 390
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 69/263 (26%), Positives = 118/263 (44%), Gaps = 25/263 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNE-GLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G T+T +G+A ++A GC ++ G +G++GLG S +Q + FSYCL
Sbjct: 139 GKVGTDTFAVGTAKA-SLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCL 197
Query: 60 VDRDSDSTSTLEFDSSLP----PNAVTAPLL----RNHELDTFYYLGLTGISVGGDLLPI 111
D+ S L SS A + P + ++L +Y + L G+ G ++P+
Sbjct: 198 APHDAGRNSALFLGSSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPL 257
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
+ +++D+ + ++ L Y A++ A A V FD C+
Sbjct: 258 PPS--------GSTVLLDTFSPISFLVDGAYQAVKKAVTAAVGAPPMATPVEPFDLCFP- 308
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-----SSLSIIGNV 226
S +S P + F F G + +PA NYL+ NGT C A ++ + LS++G++
Sbjct: 309 KSGASGAAPDLVFTFRGGAAMTVPATNYLLDYK-NGTVCLAMLSSARLNSTTELSLLGSL 367
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQ+ F+L + F P C
Sbjct: 368 QQENIHFLFDLDKETLSFEPADC 390
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 74/260 (28%), Positives = 110/260 (42%), Gaps = 23/260 (8%)
Query: 6 ETVTLGSASVDN----IAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPSQINA----- 52
+TV LG + V N I GC G G+ G G G+LS SQ+++
Sbjct: 190 DTVLLGQSVVANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTP 249
Query: 53 STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
FS+CL + + L L P+ V +PL+ + Y L L I+V G LLPI
Sbjct: 250 KVFSHCLKGGE-NGGGVLVLGEILEPSIVYSPLVPSQP---HYNLNLQSIAVNGQLLPID 305
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
F + N G IVDSGT + L E YN A S ++ + CY S
Sbjct: 306 SNVFA--TTNNQGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFSKPI-ISKGNQCYLVS 362
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
+ P VS +F G + L ++YL+ +D +C F +I+G++ +
Sbjct: 363 NSVGDIFPQVSLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDLVLK 422
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
++L N IG+ C
Sbjct: 423 DKIFVYDLANQRIGWADYDC 442
>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
Length = 450
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 79/271 (29%), Positives = 109/271 (40%), Gaps = 57/271 (21%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFV-----------------GAAGLLGLGGGS 43
G T+TV LG ASVD GCG +N GL AAG L LGG +
Sbjct: 212 GVLATDTVALGGASVDGFVFGCGLSNRGLRRPGSAASSPTASPPGTSGDAAGSLSLGGDT 271
Query: 44 LSFPSQINASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
S+ NA+ SY + + D + PP FY++ +TG S
Sbjct: 272 SSY---RNATPVSY----------TRMIADPAQPP---------------FYFMNVTGAS 303
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDG 161
V A G +++DSGT +TRL Y A+R F R G
Sbjct: 304 V-------GGAAVAAAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPP 356
Query: 162 VALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTS--S 218
+L D CY+ + V+VP ++ G + + A L +G+ C A A S
Sbjct: 357 FSLLDACYNLTGHDEVKVPLLTLRLEAGADMTVDAAGMLFMARKDGSQVCLAMASLSFED 416
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IIGN QQ+ RV ++ S +GF C
Sbjct: 417 QTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 447
>gi|326490597|dbj|BAJ89966.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 450
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 73/243 (30%), Positives = 104/243 (42%), Gaps = 25/243 (10%)
Query: 32 GAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLR-NHE 90
A GLLG+ GSLSF +Q F+YC+ L D P PL+ +
Sbjct: 197 AATGLLGMNRGSLSFVTQTATLRFAYCIAPGQGPGILLLGGDGGAAPPLNYTPLIEISQP 256
Query: 91 LDTF----YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALR 146
L F Y + L GI VG LL I ++ D +G G +VDSGT T L + Y AL+
Sbjct: 257 LPYFDRVAYSVQLEGIRVGSALLQIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALK 316
Query: 147 DAFVRGTRALSPTDG------VALFDTCY----DFSSRSSVEVPTVSFHFPEGKVLPLPA 196
F+ R+L G FD C+ + S +S +P V +V
Sbjct: 317 AEFLNQARSLLAPLGEPGFVFQGAFDACFRGPEERVSAASRLLPEVGLVLRGAEVAVAGE 376
Query: 197 K-NYLIPVDSNG------TFCFAFAPTS---SSLSIIGNVQQQGTRVSFNLRNSLIGFTP 246
K Y +P + G +C F + S +IG+ QQ V ++L+N +GF P
Sbjct: 377 KLLYSVPGERRGEEGAEAVWCLTFGNSDMAGMSAYVIGHHHQQDVWVEYDLQNGRVGFAP 436
Query: 247 NKC 249
+C
Sbjct: 437 ARC 439
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 85/275 (30%), Positives = 137/275 (49%), Gaps = 33/275 (12%)
Query: 1 GDFVTETVTLGSASVDNIA-----IGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS- 53
GD TETV++ SAS ++ GCG+NN G F +G++GLGGG LS SQ+ +S
Sbjct: 176 GDVATETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSI 235
Query: 54 --TFSYCLVDRDS--DSTSTLEFDSSLPPNA-------VTAPLLRNHELDTFYYLGLTGI 102
FSYCL + + + TS + ++ P++ V+ PL+ L T+YYL L I
Sbjct: 236 SKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPL-TYYYLTLEAI 294
Query: 103 SVGGDLLPISETAFKIDESG-----NGGIIVDSGTAVTRLQTETYNALRDAF---VRGTR 154
SVG +P + +++ ++ G +G II+DSGT +T L+ ++ A V G +
Sbjct: 295 SVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAK 354
Query: 155 ALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA 214
+S G L C+ S + + +P ++ HF G + L N + + S C +
Sbjct: 355 RVSDPQG--LLSHCFK-SGSAEIGLPEITVHF-TGADVRLSPINAFVKL-SEDMVCLSMV 409
Query: 215 PTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
PT + ++I GN Q V ++L + F C
Sbjct: 410 PT-TEVAIYGNFAQMDFLVGYDLETRTVSFQHMDC 443
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 77/250 (30%), Positives = 112/250 (44%), Gaps = 28/250 (11%)
Query: 17 NIAIGCGH-----NNEGLFVGAAGLLGLGGG-SLSFPSQINASTFSYCLVDRD----SDS 66
NI GCGH NN+ + G+ GLG ++ +Q+ + FSYC+ D + + +
Sbjct: 226 NITFGCGHMNIKTNNDDAY---NGVFGLGAYPHITMATQL-GNKFSYCIGDINNPLYTHN 281
Query: 67 TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGI 126
L S + ++ + H YY+ L ISVG L I AFKI G+GG+
Sbjct: 282 HLVLGQGSYIEGDSTPLQIHFGH-----YYVTLQSISVGSKTLKIDPNAFKISSDGSGGV 336
Query: 127 IVDSGTAVTRLQTETYNALRDAFV---RGTRALSPTDGVALFDTCYD-FSSRSSVEVPTV 182
++DSG T+L + L D V +G PT C+ SR V P V
Sbjct: 337 LIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR-KFEGLCFKGVVSRDLVGFPAV 395
Query: 183 SFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS---SLSIIGNVQQQGTRVSFNLRN 239
+FHF G L L + + L FC A P++S +LS+IG + QQ V F+L
Sbjct: 396 TFHFAGGADLVLESGS-LFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQ 454
Query: 240 SLIGFTPNKC 249
+ F C
Sbjct: 455 MKVFFRRIDC 464
>gi|326524806|dbj|BAK04339.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 460
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 113/262 (43%), Gaps = 29/262 (11%)
Query: 14 SVDNIAIGCGHNNEGLFVGA----AGLLGLGGGSLSF--------PSQINASTFSYCL-- 59
V + IGC HN++G + AG+LGLG + S + FSYCL
Sbjct: 194 EVTGVVIGCTHNSKGFNFNSHGVLAGVLGLGRQAPSLIWTLGQHRHGTVQVHRFSYCLPS 253
Query: 60 -VDRDSDSTSTLEFDSSLP--PNAVTAPLLRNHELDTF----YYLGLTGISVGGDLLPIS 112
SD + L FD +P + V+ ++ + Y++ LTGISV G L
Sbjct: 254 HGSSSSDHHTFLRFDDDVPNTQHMVSTKIMYMDSTTSRDFRAYFVSLTGISVAGKPLQDV 313
Query: 113 ETAFKIDESGN---GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
+ FK G G D+GT + YN L+DA VR + L + C+
Sbjct: 314 KELFKRHVHGQVWTSGCAFDAGTPTMVMIMPAYNKLKDAVVRHLKPLGLQIVSGQYHLCF 373
Query: 170 DFSSRSSVEVPTVSFHFPEGKV-LPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
+S+ +PTV F E + L LP + + V + C A S ++IIG +QQ
Sbjct: 374 RATSQLWQHLPTVMLQFAETEARLVLPPQRLFVAVGYD--ICLAVV-RSYDITIIGAMQQ 430
Query: 229 QGTRVSFNLRNSLIGFTP-NKC 249
R +++R+ I F P N C
Sbjct: 431 VDKRFVYDVRHGRIYFVPENAC 452
>gi|194690050|gb|ACF79109.1| unknown [Zea mays]
Length = 166
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 57/169 (33%), Positives = 82/169 (48%), Gaps = 15/169 (8%)
Query: 84 PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN 143
PLL+ FY + LTGI+VGG + T F + IVDSGT +T L YN
Sbjct: 7 PLLQGP----FYLVNLTGITVGGQ--EVESTGF------SARAIVDSGTVITSLVPSVYN 54
Query: 144 ALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV 203
A+R F+ G ++ DTC++ + V+VP+++ F G + + + L V
Sbjct: 55 AVRAEFMSQLAEYPQAPGFSILDTCFNMTGLKEVQVPSLTLVFDGGAEVEVDSGGVLYFV 114
Query: 204 DSNGT-FCFAFAPTSS--SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S+ + C A A S SIIGN QQ+ RV F+ S +GF C
Sbjct: 115 SSDSSQVCLAVASLKSEDETSIIGNYQQKNLRVVFDTSASQVGFAQETC 163
>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
Length = 416
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 70/244 (28%), Positives = 111/244 (45%), Gaps = 32/244 (13%)
Query: 35 GLLGLGGGSLSFPSQIN--ASTFSYCLV-----DRDSDSTSTLEFDSSL--PPNAVTAPL 85
G+ G G+LSFPSQ+ FS+C + + + S+ + D++L N P+
Sbjct: 164 GIAGFVRGTLSFPSQLGLLKKGFSHCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPM 223
Query: 86 LRNHELDTFYYLGLTGISVG---GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETY 142
L++ +YY+GL I+VG +P++ F D GNGG+++DSGT T L Y
Sbjct: 224 LKSPMYPNYYYIGLEAITVGNVSATTVPLNLREF--DSQGNGGMLIDSGTTYTHLPEPFY 281
Query: 143 NALRDAF---VRGTRALSPTDGVALFDTCYDFSSRSSVEV------PTVSFHFPEGKVLP 193
+ L F + RA + + A FD CY ++ P+++FHF
Sbjct: 282 SQLLSIFKAIITYPRA-TEVEMRAGFDLCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFV 340
Query: 194 LPAKNYLI----PVDSNGTFCFAFAPTSSS----LSIIGNVQQQGTRVSFNLRNSLIGFT 245
LP N+ P +S C F + S + G+ QQQ ++ ++L IGF
Sbjct: 341 LPQGNHFYAMSAPSNSTVVKCLLFQSMADSDYGPAGVFGSFQQQNVQIVYDLEKERIGFQ 400
Query: 246 PNKC 249
P C
Sbjct: 401 PMDC 404
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 81/273 (29%), Positives = 115/273 (42%), Gaps = 29/273 (10%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHN-------NEGLFVGAAGLLGLGGGSLSFPSQINAS 53
G ET +L A+ GC + NE GL+G+ GSLS +Q+
Sbjct: 149 GTLAAETFSLAGAAQPGTLFGCMDSAGYTSDINED--AKTTGLMGMNRGSLSLVTQMVLP 206
Query: 54 TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTF-----YYLGLTGISVGGDL 108
FSYC+ D+ L S P PL+ + Y + L GI V L
Sbjct: 207 KFSYCISGEDAFGVLLLGDGPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKL 266
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDGVALF- 165
L + ++ F D +G G +VDSGT T L YN+L+D F+ T+ + D +F
Sbjct: 267 LQLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFE 326
Query: 166 ---DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDS--NGTFCFAFAPTSSSL 220
D CY + S VP V+ F G + + + L V + +CF F S L
Sbjct: 327 GAMDLCYH-APASLAAVPAVTLVF-SGAEMRVSGERLLYRVSKGRDWVYCFTFG-NSDLL 383
Query: 221 SI----IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I IG+ QQ + F+L S +GFT C
Sbjct: 384 GIEAYVIGHHHQQNVWMEFDLVKSRVGFTETTC 416
>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 74/271 (27%), Positives = 117/271 (43%), Gaps = 31/271 (11%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ V E T S S + +GC + G+LG+ G LSF SQ S FSYC+
Sbjct: 165 GNLVREKFTFSKSLSTPPVILGCAQAS----TENRGILGMNRGRLSFISQAKISKFSYCV 220
Query: 60 VDR------------DSDSTSTLEFDSSLP-PNAVTAPLLRNHELDTFYY-LGLTGISVG 105
R D+ ++S ++ + L P + ++P LD Y L + I +
Sbjct: 221 PSRTGSNPTGLFYLGDNPNSSKFKYVTMLTFPESQSSP-----NLDPLAYTLPMKAIKIA 275
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA-- 163
G L + AFK D G+G ++DSG+ +T L E Y +++ VR A+ V
Sbjct: 276 GKRLNVPPAAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAD 335
Query: 164 LFDTCYDFSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--- 218
+ D C+D + V + +SF F G + + ++ G C +
Sbjct: 336 VADMCFDAGVTAEVGRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGI 395
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+IIG V QQ V ++L N +GF +C
Sbjct: 396 GSNIIGTVHQQNMWVEYDLANKRVGFGGAEC 426
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 80.9 bits (198), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 76/269 (28%), Positives = 115/269 (42%), Gaps = 27/269 (10%)
Query: 1 GDFVTET----VTLGSASVDN----IAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPS 48
G +V++T LG + + N I GC G G+ G G G LS S
Sbjct: 178 GYYVSDTFYFDAVLGESLIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVIS 237
Query: 49 QINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
Q+++ FS+CL DS L L P V +PL+ + Y L L I+
Sbjct: 238 QLSSHGITPRVFSHCLKGEDSGG-GILVLGEILEPGIVYSPLVPSQP---HYNLDLQSIA 293
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
V G LLPI AF S N G I+D+GT + L E Y+ A L+ T +
Sbjct: 294 VSGQLLPIDPAAFA--TSSNRGTIIDTGTTLAYLVEEAYDPFVSAITAAVSQLA-TPTIN 350
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDS---NGTFCFAFAPTSSSL 220
+ CY S+ S P VSF+F G + L + YL+ + + +C F +
Sbjct: 351 KGNQCYLVSNSVSEVFPPVSFNFAGGATMLLKPEEYLMYLTNYAGAALWCIGFQKIQGGI 410
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+I+G++ + ++L + IG+ C
Sbjct: 411 TILGDLVLKDKIFVYDLAHQRIGWANYDC 439
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 112/266 (42%), Gaps = 33/266 (12%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
G VTETVT+ S S + IGCG NN G G AG++GL G S +Q+
Sbjct: 135 GTLVTETVTIHSTSGQPFVMPETIIGCGRNNSGFKPGFAGVVGLDRGPKSLITQMGGEYP 194
Query: 54 -TFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-------PLLRNHELDTFYYLGLTGISVG 105
SYC + TS + F + NA+ A + FYYL L +SVG
Sbjct: 195 GLMSYCFAGK---GTSKINFGA----NAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVG 247
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS-PTDGVAL 164
+ T F + G I++DSG+ +T N +R A + A+ P +
Sbjct: 248 NTRIETVGTPF---HALKGNIVIDSGSTLTYFPESYCNLVRKAVEQVVTAVRFPRSDIL- 303
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SLSII 223
CY S++ P ++ HF G L L N + ++ G FC A S +I
Sbjct: 304 ---CY--YSKTIDIFPVITMHFSGGADLVLDKYNMYVASNTGGVFCLAIICNSPIEEAIF 358
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN Q V ++ + L+ F P C
Sbjct: 359 GNRAQNNFLVGYDSSSLLVSFKPTNC 384
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 112/266 (42%), Gaps = 33/266 (12%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
G VTETVT+ S S + IGCG NN G G AG++GL G S +Q+
Sbjct: 141 GTLVTETVTIHSTSGQPFVMPETIIGCGRNNSGFKPGFAGVVGLDRGPKSLITQMGGEYP 200
Query: 54 -TFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-------PLLRNHELDTFYYLGLTGISVG 105
SYC + TS + F + NA+ A + FYYL L +SVG
Sbjct: 201 GLMSYCFAGK---GTSKINFGA----NAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVG 253
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS-PTDGVAL 164
+ T F + G I++DSG+ +T N +R A + A+ P +
Sbjct: 254 NTRIETVGTPF---HALKGNIVIDSGSTLTYFPESYCNLVRKAVEQVVTAVRFPRSDIL- 309
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SLSII 223
CY S++ P ++ HF G L L N + ++ G FC A S +I
Sbjct: 310 ---CY--YSKTIDIFPVITMHFSGGADLVLDKYNMYVASNTGGVFCLAIICNSPIEEAIF 364
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GN Q V ++ + L+ F P C
Sbjct: 365 GNRAQNNFLVGYDSSSLLVSFKPTNC 390
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 74/272 (27%), Positives = 117/272 (43%), Gaps = 31/272 (11%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ V E + S + + +GC + A G+LG+ G LSFP Q + FSYC+
Sbjct: 177 GNLVREKLAFSPSQTTPPLILGCSSESRD----ARGILGMNLGRLSFPFQAKVTKFSYCV 232
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTF-------------YYLGLTGISVGG 106
R + + S N + R + TF Y + + GI +GG
Sbjct: 233 PTRQPANNNNFPTGSFYLGNNPNSARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGG 292
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVAL 164
L I + F+ + G+G +VDSG+ T L Y+ +R+ +R G R +
Sbjct: 293 RKLNIPPSVFRPNAGGSGQTMVDSGSEFTFLVDVAYDRVREEIIRVLGPRVKKGYVYGGV 352
Query: 165 FDTCYDFSSRSSVEVPT----VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--- 217
D C+D +++E+ V+F F +G + +P + L V G C +
Sbjct: 353 ADMCFD---GNAMEIGRLLGDVAFEFEKGVEIVVPKERVLADV-GGGVHCVGIGRSERLG 408
Query: 218 SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++ +IIGN QQ V F+L N IGF C
Sbjct: 409 AASNIIGNFHQQNLWVEFDLANRRIGFGVADC 440
>gi|297724111|ref|NP_001174419.1| Os05g0403000 [Oryza sativa Japonica Group]
gi|50878436|gb|AAT85210.1| hypothetical protein [Oryza sativa Japonica Group]
gi|222631539|gb|EEE63671.1| hypothetical protein OsJ_18489 [Oryza sativa Japonica Group]
gi|255676353|dbj|BAH93147.1| Os05g0403000 [Oryza sativa Japonica Group]
Length = 437
Score = 80.5 bits (197), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 73/267 (27%), Positives = 110/267 (41%), Gaps = 35/267 (13%)
Query: 13 ASVDNIAIGCGHN--NEGLFVGAAGLLGLGGGSLSFPSQIN-----ASTFSYCLVDRDSD 65
A+V CGH EGL GA G++ L +FP+Q+ + F+ CL +
Sbjct: 156 ATVPEFLFTCGHTFLTEGLANGATGMVSLSRARFAFPTQLARTFGFSRRFALCLPPASAA 215
Query: 66 ------------------STSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
S S+L + L TA E Y +GLTGI V G
Sbjct: 216 GVVVFGDAPYVFQPGVDLSKSSLIYTPLLVNAVRTAGKYTTGETSIEYLIGLTGIKVNGR 275
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
+P++ T ID++G GG + + + T L+T Y A+ DAF T + VA F+
Sbjct: 276 DVPLNATLLAIDKNGVGGTTLSTASPYTVLETSIYKAVIDAFAAETATIPRVPAVAPFEL 335
Query: 168 CYD----FSSRSSVEVPTVSFHFPEGKV-LPLPAKNYLIPVDSNGTFCFAFAPTSSSL-- 220
CYD S+R+ VPT+ V + N ++P G C +L
Sbjct: 336 CYDGRKVGSTRAGPAVPTIELVLQREAVSWIMYGANSMVPAK-GGALCLGVVDGGPALYP 394
Query: 221 --SIIGNVQQQGTRVSFNLRNSLIGFT 245
+IG + + F+L S +GF+
Sbjct: 395 SSVVIGGHMMEDNLLEFDLEGSRLGFS 421
>gi|356558489|ref|XP_003547539.1| PREDICTED: uncharacterized protein LOC100817234 [Glycine max]
Length = 739
Score = 80.5 bits (197), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 85/246 (34%), Positives = 123/246 (50%), Gaps = 20/246 (8%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINAS---TFSYCLVDR-DSDS 66
S S I IGCG NN G F G++GLGGG +S S I S +SYCLV + +S
Sbjct: 55 SVSFPKIPIGCGLNNAGTFDSKCFGIVGLGGGVVSLISHIGLSIDSKYSYCLVPLFEFNS 114
Query: 67 TSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
TS + F + V+ P++ DTFYYL L G+SVG + + + + GN
Sbjct: 115 TSKINFGENAVVEGLGTVSTPIIPG-SFDTFYYLKLEGMSVGSKRIDFVDASTSNELKGN 173
Query: 124 GGIIVDSGTAVTRLQTETYNALR---DAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
II+DSGT +T L Y L +A + R ++ TD + CY +++EVP
Sbjct: 174 --IIIDSGTTLTILLENFYTKLEAEVEAHINLER-VNSTDQI--LSLCYKSPPNNAIEVP 228
Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
++ HF G + L + N + V + + FAFAP +S SI GN+ Q V ++L
Sbjct: 229 IITTHFA-GVDIVLNSLNTFVSVFDDAMW-FAFAPVASG-SIFGNLAQMNHLVGYDLLRK 285
Query: 241 LIGFTP 246
+ F P
Sbjct: 286 TVSFKP 291
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 80.5 bits (197), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 80/271 (29%), Positives = 122/271 (45%), Gaps = 29/271 (10%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ V E T S + + +GC + G+LG+ G LSF SQ S FSYC+
Sbjct: 175 GNLVKEKFTFSNSQTTPPLILGCAKES----TDEKGILGMNLGRLSFISQAKISKFSYCI 230
Query: 60 VDRDSDS--TSTLEFDSSLPPNA--------VTAPL-LRNHELDTFYY-LGLTGISVGGD 107
R + ST F PN+ +T P R LD Y + L GI +G
Sbjct: 231 PTRSNRPGLASTGSFYLGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQK 290
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALF 165
L I + F+ D G+G +VDSG+ T L Y+ +++ VR G+R +
Sbjct: 291 RLNIPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTA 350
Query: 166 DTCYDFSSRSSVEVPT----VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS---S 218
D C+D S+E+ + F F G + + ++ L+ V G C +S +
Sbjct: 351 DMCFD--GNHSMEIGRLIGDLVFEFGRGVEILVEKQSLLVNV-GGGIHCVGIGRSSMLGA 407
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +IIGNV QQ V F++ N +GF+ +C
Sbjct: 408 ASNIIGNVHQQNLWVEFDVTNRRVGFSKAEC 438
>gi|125606590|gb|EAZ45626.1| hypothetical protein OsJ_30294 [Oryza sativa Japonica Group]
Length = 431
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 76/269 (28%), Positives = 126/269 (46%), Gaps = 23/269 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAA---GLLGLGGGSLSFPSQINASTFSY 57
G F TET LG+ +V NI GCG N+G + A G+ G G +S +Q+ FSY
Sbjct: 158 GYFATETFALGNVTVANITFGCGTRNQGYYDNVAGVFGVGRGGRGGVSLLNQLGIDRFSY 217
Query: 58 CLVDRDSDSTSTLEFDSS-------LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
C + +S + S A + P++ + L + Y++ L G++VG L+
Sbjct: 218 CFSSSGAPGSSAVFLGGSPELATNATTTPAASTPMVADPVLKSGYFVKLVGVTVGATLVD 277
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD-----GVALF 165
++ + E G +++DS + VT L TY +R A V L + GV L
Sbjct: 278 VAGASSA--EGGGRALVIDSTSPVTVLDEATYGPVRRALVAQLAPLKEANANASAGVGL- 334
Query: 166 DTCYDFSSRSSVEVP---TVSFHFPEGKV-LPLPAKNYLIPVDSNGTFCFAFAPTSSS-L 220
D C++ ++ + P T++ HF G L LP +YL + G C P+SS+ +
Sbjct: 335 DLCFELAAGGATPTPPNVTMTLHFDGGAADLVLPPASYLAKDSAGGLICLTMTPSSSNGV 394
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++G+ T V ++L +++ F P C
Sbjct: 395 PVLGSWALLDTLVLYDLAKNVVSFQPLDC 423
>gi|50511404|gb|AAT77327.1| hypothetical protein [Oryza sativa Japonica Group]
gi|222631431|gb|EEE63563.1| hypothetical protein OsJ_18380 [Oryza sativa Japonica Group]
Length = 480
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 78/272 (28%), Positives = 117/272 (43%), Gaps = 31/272 (11%)
Query: 1 GDFVTETVTLGSASVDN-----IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
G+ + T G S D + GC + EG F GA+G+LGL G+LS SQ+N F
Sbjct: 163 GNLAVQNFTFGDDSEDTAVKGVVTFGCSSSTEGDF-GASGVLGLNKGNLSLVSQLNLGRF 221
Query: 56 SYCLVDR--DSDSTSTLEFDSSLPPNAVTAP-----------------LLRNHELDTFYY 96
SY +D+ + +F + +T P +R+ LD Y+
Sbjct: 222 SYYFAPEVNTTDNNAADDFIVFGDDDGITVPGNSGGSRPRYTPFFTTGAVRSANLD-LYF 280
Query: 97 LGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL 156
+ LTGI VGG L + ++ S VT L+ Y L+ V +
Sbjct: 281 VELTGIRVGGKDLQLGGGGGGSAGGSLEAVLSTS-VPVTYLEKNAYGLLKKELVSALGSN 339
Query: 157 SPTDGVAL-FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP 215
+ DG AL D CY ++P ++F F V+ L NYL + G C P
Sbjct: 340 NTEDGSALGLDLCYRSQHMDRAKIPDIAFVFGGNAVMKLQQWNYLYQDEDTGLECLTIPP 399
Query: 216 T---SSSLSIIGNVQQQGTRVSFNLRNSLIGF 244
+ S LS+IG++ Q GT + ++L S +GF
Sbjct: 400 SPDDSDGLSLIGSMIQTGTYMIYDLHKSRLGF 431
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 81/278 (29%), Positives = 121/278 (43%), Gaps = 39/278 (14%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHN-------NEGLFVGAAGLLGLGGGSLSFPSQINAS 53
G ET +L A+ GC + NE GL+G+ GSLS +Q++
Sbjct: 150 GTLAAETFSLAGAAQPGTLFGCMDSAGYTSDINED--SKTTGLMGMNRGSLSLVTQMSLP 207
Query: 54 TFSYCLVDRDS----------DSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
FSYC+ D+ D+ S L++ + L ++P Y + L GI
Sbjct: 208 KFSYCISGEDALGVLLLGDGTDAPSPLQY-TPLVTATTSSPYFNR----VAYTVQLEGIK 262
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDG 161
V LL + ++ F D +G G +VDSGT T L Y++L+D F+ T+ + D
Sbjct: 263 VSEKLLQLPKSVFVPDHTGAGQTMVDSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDP 322
Query: 162 VALF----DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD--SNGTFCFAFAP 215
+F D CY + S VP V+ F G + + + L V S+ +CF F
Sbjct: 323 NFVFEGAMDLCYH-APASFAAVPAVTLVF-SGAEMRVSGERLLYRVSKGSDWVYCFTFG- 379
Query: 216 TSSSLSI----IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S L I IG+ QQ + F+L S +GFT C
Sbjct: 380 NSDLLGIEAYVIGHHHQQNVWMEFDLLKSRVGFTQTTC 417
>gi|413953789|gb|AFW86438.1| hypothetical protein ZEAMMB73_078928 [Zea mays]
Length = 155
Score = 80.1 bits (196), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 56/159 (35%), Positives = 78/159 (49%), Gaps = 14/159 (8%)
Query: 93 TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG 152
TF + L GI+VGG L + +AF +GG+IVD GT +T LQ+ Y ALR AF +
Sbjct: 9 TFSTVTLAGINVGGKKLDLRPSAF------SGGMIVDCGTVITGLQSTAYRALRSAFRKA 62
Query: 153 TRA--LSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFC 210
A L P + DTCY+ + +V VP ++ F G + L N + NG
Sbjct: 63 MEAYRLLPNGDL---DTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGSL---VNGCLA 116
Query: 211 FAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
FA + S ++GNV Q+ V F+ S GF C
Sbjct: 117 FAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 155
>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 80.1 bits (196), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 70/253 (27%), Positives = 110/253 (43%), Gaps = 22/253 (8%)
Query: 18 IAIGCGHNNEGLF--VGAAGLLGLGGGSL-----SFPSQI---NASTFSYCLVDRDSDST 67
I GC H E AG+LGLG G +F Q+ + FSYC
Sbjct: 207 IVFGCAHQTEHFKNQRAVAGILGLGMGPAGKPPTAFTKQVLPAHGGRFSYCPFVPGMSMY 266
Query: 68 STLEFDSSLP----PNA--VTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISETAFKIDE 120
S L F S +P PN + P+L Y++ L G+SVG + L ++ F+ +
Sbjct: 267 SYLRFGSDIPSHPPPNVHRQSTPVLAPAHNSEAYFVKLAGVSVGANRLSGVTPAMFRRNA 326
Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
G GG +VD GT +T Y + A + + V +TC + +P
Sbjct: 327 HGAGGCVVDIGTRMTAFIHSAYVHIDHAVRQHLQRRGAHIVVVRGNTCVQQPAPHHDVLP 386
Query: 181 TVSFHFPEGKVLPLPAKNYLIP--VDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLR 238
+++ HF G L + ++ +P V + CF F +S+ L++IG QQ R F+L
Sbjct: 387 SMTLHFENGAWLRVMPEHVFMPFVVGGHHYQCFGFV-SSTDLTVIGARQQVNHRFIFDLH 445
Query: 239 NS--LIGFTPNKC 249
++ ++ F P C
Sbjct: 446 DTIPIMSFNPEDC 458
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 80.1 bits (196), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 84/263 (31%), Positives = 118/263 (44%), Gaps = 19/263 (7%)
Query: 1 GDFVTETVTLGSASVDNI-----AIGCGHNNEGLFVGAAGLLGLGGGS----LSFPSQIN 51
GD ET+TLGS ++ IGCGHNN+G F + GG +S S
Sbjct: 184 GDLSVETLTLGSTDGSSVQFPKTVIGCGHNNKGTFQREGSGIVGLGGGPVSLISQLSSSI 243
Query: 52 ASTFSYCLVD--RDSDSTSTLEF-DSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVGG 106
FSYCL S+S+S L F D ++ V+ P++ + L FY+L L SVG
Sbjct: 244 GGKFSYCLAPLFSQSNSSSKLNFGDEAVVSGRGTVSTPIVPKNGLG-FYFLTLEAFSVGD 302
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
+ + ++ G G II+DSGT +T L + Y L A D
Sbjct: 303 NRI-EFGSSSFESSGGEGNIIIDSGTTLTILPEDDYLNLESAVADAIELERVEDPSKFLR 361
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
CY +S + VP ++ HF +G + L + I VD G CFAF +S I GN+
Sbjct: 362 LCYRTTSSDELNVPVITAHF-KGADVELNPISTFIEVDE-GVVCFAFR-SSKIGPIFGNL 418
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQ V ++L + F P C
Sbjct: 419 AQQNLLVGYDLVKQTVSFKPTDC 441
>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 601
Score = 79.7 bits (195), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 69/275 (25%), Positives = 112/275 (40%), Gaps = 30/275 (10%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G ++E + + +V + +GC + G+ G G G S P+Q+N + FSYCL+
Sbjct: 326 GFLLSENLNFPAKNVSDFLVGCSVVS---VYQPGGIAGFGRGEESLPAQMNLTRFSYCLL 382
Query: 61 DRDSDST---STLEFDSS-----LPPNAVTA------PLLRNHELDTFYYLGLTGISVGG 106
D + S L +++ N V+ P + +YY+ L I VG
Sbjct: 383 SHQFDESPENSDLVMEATNSGEGKKTNGVSYTAFLKNPSTKKPAFGAYYYITLRKIVVGE 442
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG---TRALSPTDGVA 163
+ + + D +G+GG IVDSG+ +T ++ ++ + + FV+ TRA
Sbjct: 443 KRVRVPRRMLEPDVNGDGGFIVDSGSTLTFMERPIFDLVAEEFVKQVNYTRARELEKQFG 502
Query: 164 LFDTCYDFS-SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--- 219
L C+ + + P + F F G + LP NY V C +
Sbjct: 503 L-SPCFVLAGGAETASFPEMRFEFRGGAKMRLPVANYFSRVGKGDVACLTIVSDDVAGQG 561
Query: 220 -----LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I+GN QQQ V +L N GF C
Sbjct: 562 GAVGPAVILGNYQQQNFYVECDLENERFGFRSQSC 596
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 79.7 bits (195), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 79/274 (28%), Positives = 121/274 (44%), Gaps = 30/274 (10%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHN----NEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G+ ++ +GS+++ GC + N GL+G+ GSLSF +Q+ FS
Sbjct: 1089 GNLASDNFRIGSSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFS 1148
Query: 57 YCLVDRDSDSTSTL-EFDSSLPPNAVTAPLLR-NHELDTF----YYLGLTGISVGGDLLP 110
YC+ RDS + S N PL++ + L F Y + L GI VG +LP
Sbjct: 1149 YCISGRDSSGVLLFGDLHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILP 1208
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF----- 165
+ ++ F D +G G +VDSGT T L Y ALR+ F+ T+ + G F
Sbjct: 1209 LPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGA 1268
Query: 166 -DTCYDFSSRSSV-EVPTVSFHFPEGK-VLPLPAKNYLIPVDSNG---TFCFAFAPTSSS 219
D CY ++ + +P+VS F + V+ Y +P G +C F S
Sbjct: 1269 MDLCYSVAAGGKLPTLPSVSLMFRGAEMVVGGEVLLYRVPEMMKGNEWVYCLTFG-NSDL 1327
Query: 220 LSI----IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
L I IG+ QQ + F+ L+ F + C
Sbjct: 1328 LGIEAFVIGHHHQQNVWMEFD----LVAFAADLC 1357
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 79.7 bits (195), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 77/270 (28%), Positives = 115/270 (42%), Gaps = 28/270 (10%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ V E T S S + +GC + G+LG+ G LSF Q + FSYC+
Sbjct: 164 GNLVREKFTFSRSVSTPPLILGCATES----TDPRGILGMNLGRLSFAKQSKITKFSYCV 219
Query: 60 VDRDSDS--TSTLEFDSSLPPNA--------VTAPLLRNHELDTFYY-LGLTGISVGGDL 108
R + T T F P++ +T+ R D Y + + GI + G
Sbjct: 220 PPRQTRPGFTPTGSFYLGNNPSSKGFKYVGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKK 279
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFD 166
L IS F+ D G+G ++DSG+ T L +E Y+ +R VR G R + D
Sbjct: 280 LNISPAVFRADAGGSGQTMIDSGSEFTYLVSEAYDKVRAQVVRAVGPRLKKGYVYGGVAD 339
Query: 167 TCYDFSSRSSVEVPT----VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS---SS 219
C+D S +VE+ + F F G + +P + L V G C + ++
Sbjct: 340 MCFD--SVKAVEIGRLIGEMVFEFERGVEVVIPKERVLADV-GGGVHCVGIGSSDKLGAA 396
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+IIGN QQ V F+L +GF C
Sbjct: 397 SNIIGNFHQQNLWVEFDLVRRRVGFGKADC 426
>gi|224074147|ref|XP_002304273.1| predicted protein [Populus trichocarpa]
gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 72/276 (26%), Positives = 102/276 (36%), Gaps = 45/276 (16%)
Query: 16 DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA------STFSYCLVDRDSDS--- 66
+N GC H +G AG G G LS P+Q+ + FSYCLV DS
Sbjct: 214 NNFTFGCAHTTLAEPIGVAGF---GRGVLSLPAQLATLSPQLGNQFSYCLVSHSFDSDRV 270
Query: 67 --------------TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
+ P+ V +L N FY +GL GIS+G +P
Sbjct: 271 RRPSPLILGRYDHDEKERRVNGVKKPSFVYTSMLDNPRHPYFYCVGLEGISIGRKKIPAP 330
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT----C 168
+ K+D G+GG++VDSGT T L Y+ + F ++ V +T C
Sbjct: 331 DFLRKVDRKGSGGVVVDSGTTFTMLPASLYDFVVAEFENRVGRVNERASVIEENTGLSPC 390
Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV--------DSNGTFCFAFAPTSSSL 220
Y F + V G + LP +NY C
Sbjct: 391 YYFDNNVVNVPRVVLHFVGNGSSVVLPRRNYFYEFLDGGHGKGKKRKVGCLMLMNGGDEA 450
Query: 221 SI-------IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +GN QQQG V ++L N +GF +C
Sbjct: 451 ELSGGPGATLGNYQQQGFEVVYDLENRRVGFARRQC 486
>gi|357491945|ref|XP_003616260.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517595|gb|AES99218.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 441
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 78/271 (28%), Positives = 121/271 (44%), Gaps = 31/271 (11%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ V E + L S + I +GC + ++ A G+LG+ G LSFP+Q + FSY +
Sbjct: 165 GNLVRENIALSPSLTTPPIILGCANQSDD----ARGILGMNLGRLSFPNQAKITKFSYFV 220
Query: 60 -VDRDSDSTSTLEFDSSLPPNAVT---APLL--------RNHELDTFYY-LGLTGISVGG 106
V + + +L ++ PN+ LL R LD + L + GIS+GG
Sbjct: 221 PVKQTQPGSGSLYLGNN--PNSSCFRYVKLLTFSKSQSQRMPNLDPLAFTLPMQGISIGG 278
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD----GV 162
L I + FK D +G G I+DSG+ + + + YN +R+ V+ + D GV
Sbjct: 279 KKLNIPPSVFKPDTTGFGQTIIDSGSEFSYMVDKAYNVIRNELVKKVGSKIKKDYIYGGV 338
Query: 163 ALFDTCYD-FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
A D C+D ++ V + F F +G + +P + LI VD G CF
Sbjct: 339 A--DICFDGDATEIGRLVGDMVFEFEKGVEIVIPKERVLIEVDG-GVHCFGIGRAEGLGG 395
Query: 222 IIGNVQ---QQGTRVSFNLRNSLIGFTPNKC 249
+ QQ V F+L +GF C
Sbjct: 396 GGNIIGNFYQQNLWVEFDLAKHRVGFRGANC 426
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 85/263 (32%), Positives = 115/263 (43%), Gaps = 21/263 (7%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQIN--- 51
G ETVT S V +I GCGH+N G F G++GLGGG LS SQ
Sbjct: 138 GVLARETVTFSSTDGEPVVVGDIVFGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLY 197
Query: 52 -ASTFSYCLV--DRDSDSTSTLEFD--SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
+ FS CLV D + T+ F S + V A L + E T Y + L GISVG
Sbjct: 198 GSKRFSQCLVPFHADPHTLGTISFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGD 257
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
+ + + G I++DSGT T L E Y+ L + L P D
Sbjct: 258 TFVSFNSSEML----SKGNIMIDSGTPATYLPQEFYDRLVKELKVQSNML-PIDDDPDLG 312
Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
T + S +++E P + HF V +P + ++ P D G FCFA A T+ I GN
Sbjct: 313 TQLCYRSETNLEGPILIAHFEGADVQLMPIQTFIPPKD--GVFCFAMAGTTDGEYIFGNF 370
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
Q + F+L + F C
Sbjct: 371 AQSNVLIGFDLDRKTVSFKATDC 393
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 68/245 (27%), Positives = 112/245 (45%), Gaps = 26/245 (10%)
Query: 19 AIGCGHNNEGL--FVGAAGLLGLGGGSLSFPSQINAS--TFSYCLVDRDSD----STSTL 70
+ GC ++ G F GLLG+G G +S Q + + FSYCL + S+ S +T
Sbjct: 191 SFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTG 250
Query: 71 EF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGII 127
F + + ++ + +++ LT ISV G+ L +S + F G++
Sbjct: 251 YFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVF-----SRKGVV 305
Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT---CYDFSSRSSVEVPTVSF 184
DSG+ ++ + + L R L G A ++ CYD S ++P +S
Sbjct: 306 FDSGSELSYIPDRALSVLSQRI----RELLLKRGAAEEESERNCYDMRSVDEGDMPAISL 361
Query: 185 HFPEGKVLPLPAKNYLIP--VDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLI 242
HF +G L + + V +C AFAPT S +SIIG++ Q V ++L+ LI
Sbjct: 362 HFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKRQLI 420
Query: 243 GFTPN 247
G P+
Sbjct: 421 GIGPS 425
>gi|222634868|gb|EEE65000.1| hypothetical protein OsJ_19937 [Oryza sativa Japonica Group]
Length = 402
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 59/156 (37%), Positives = 73/156 (46%), Gaps = 17/156 (10%)
Query: 98 GLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS 157
G GI VGG L + F GG ++DS +T+L Y ALR AF R A
Sbjct: 260 GTMGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAF-RSAMAAY 312
Query: 158 P--TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP 215
P G A DTCYDF +SV VP VS F G V+ L A ++ C AF P
Sbjct: 313 PRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------EGCLAFVP 366
Query: 216 TSS--SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
T +L IGNVQQQ V +++ +GF C
Sbjct: 367 TPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 402
>gi|125552953|gb|EAY98662.1| hypothetical protein OsI_20585 [Oryza sativa Indica Group]
Length = 429
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 72/255 (28%), Positives = 110/255 (43%), Gaps = 50/255 (19%)
Query: 35 GLLGLGGGSLSFPSQIN--ASTFSYCLVD----RDSDSTSTL---EFDSSLPPNAVTAPL 85
G+ G G G LS PSQ+ FS+C + R+ + TS+L + S + + P+
Sbjct: 181 GIAGFGKGILSLPSQLGFLDKGFSHCFLGFRFARNPNFTSSLIMGDLALSAKDDFLFTPM 240
Query: 86 LRNHELDTFYYLGLTGISVG-GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNA 144
L++ FYY+GL G+S+G G + + ID GNGG+IVD+GT T L Y A
Sbjct: 241 LKSITNPNFYYIGLEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPFYTA 300
Query: 145 LRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV----------------EVPTVSFHFPE 188
+ LS V L++ YD R+ E+P ++FHF
Sbjct: 301 I----------LSSLASVILYERSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFLG 350
Query: 189 GKVLPLPAKN--YLI--PVDSNGTFCFAF----------APTSSSLSIIGNVQQQGTRVS 234
L LP + Y + P +S C F + +++G+ Q Q V
Sbjct: 351 DVKLTLPKDSCYYAVTAPKNSVVVKCLLFQRMDDEDDVGGANNGPGAVLGSFQMQNVEVV 410
Query: 235 FNLRNSLIGFTPNKC 249
+++ IGF P C
Sbjct: 411 YDMEAGRIGFQPKDC 425
>gi|125532795|gb|EAY79360.1| hypothetical protein OsI_34488 [Oryza sativa Indica Group]
Length = 342
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 72/262 (27%), Positives = 117/262 (44%), Gaps = 28/262 (10%)
Query: 5 TETVTLGSASVDNIAIGCGHN-NEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDR- 62
T+T +G+A+ ++A GC + N +GA+G++GLG S Q+NA+ FSYCL
Sbjct: 88 TDTFAIGTATA-SLAFGCAMDSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCLAPHG 146
Query: 63 DSDSTSTLEFDSSLP----PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
+ S L +S +A T PL+ + + Y + L GI GD++ I
Sbjct: 147 AAGKKSALLLGASAKLAGGKSAATTPLVNTSDDSSDYMIHLEGIKF-GDVI--------I 197
Query: 119 DESGNGGII-VDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY-----DFS 172
+ NG ++ VD+ V+ L ++A++ A A FD C+
Sbjct: 198 EPPPNGSVVLVDTIFGVSFLVDAAFHAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAG 257
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-----SSLSIIGNVQ 227
+ SS+ +P V F L +P Y+ NGT C A ++ + LSI+G +
Sbjct: 258 ANSSLPLPDVVLTFQGAAALTVPPSKYMYDA-GNGTVCLAMMSSAMLNLTTELSILGRLH 316
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
Q+ F+L + F P C
Sbjct: 317 QENIHFLFDLDKETLSFEPADC 338
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 79/271 (29%), Positives = 121/271 (44%), Gaps = 28/271 (10%)
Query: 1 GDFVTETVTLG-----SASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINA-- 52
G F ET+T+G A + + +GC + G A G+LGL SF S +
Sbjct: 184 GVFAKETITVGLTNGRKARLRGLLVGCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSLF 243
Query: 53 -STFSYCLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDT-----FYYLGLTGISV 104
+ SYCLVD S+ ++ L F S + R LD FY + + GIS+
Sbjct: 244 GAKLSYCLVDHLSNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISI 303
Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGV 162
G D+L I + D + GG I+DSGT++T L Y + R L +G+
Sbjct: 304 GDDMLDIPTQVW--DATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGI 361
Query: 163 ALFDTCYDFSSRSSV---EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS- 218
+ Y FSS S ++P ++FH G K+YL+ + G C F +
Sbjct: 362 PIE---YCFSSTSGFNESKLPQLTFHLKGGARFEPHRKSYLVDA-APGVKCLGFMSAGTP 417
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +++GN+ QQ F+L S + F P+ C
Sbjct: 418 ATNVVGNIMQQNYLWEFDLMASTLSFAPSTC 448
>gi|297724243|ref|NP_001174485.1| Os05g0511050 [Oryza sativa Japonica Group]
gi|222632192|gb|EEE64324.1| hypothetical protein OsJ_19161 [Oryza sativa Japonica Group]
gi|255676482|dbj|BAH93213.1| Os05g0511050 [Oryza sativa Japonica Group]
Length = 432
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 72/258 (27%), Positives = 110/258 (42%), Gaps = 53/258 (20%)
Query: 35 GLLGLGGGSLSFPSQIN--ASTFSYCLVD----RDSDSTSTL---EFDSSLPPNAVTAPL 85
G+ G G G LS PSQ+ FS+C + R+ + TS+L + S + + P+
Sbjct: 181 GIAGFGKGILSLPSQLGFLDKGFSHCFLGFRFARNPNFTSSLIMGDLALSAKDDFLFTPM 240
Query: 86 LRNHELDTFYYLGLTGISVG-GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNA 144
L++ FYY+GL G+S+G G + + ID GNGG+IVD+GT T L Y A
Sbjct: 241 LKSITNPNFYYIGLEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPFYTA 300
Query: 145 LRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV----------------EVPTVSFHFPE 188
+ LS V L++ YD R+ E+P ++FHF
Sbjct: 301 I----------LSSLASVILYERSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFLG 350
Query: 189 GKVLPLPAKN--YLI--PVDSNGTFCFAFAPTSSSL-------------SIIGNVQQQGT 231
L LP + Y + P +S C F + +++G+ Q Q
Sbjct: 351 DVKLTLPKDSCYYAVTAPKNSVVVKCLLFQRMDNDDDDDDVGGANNGPGAVLGSFQMQNV 410
Query: 232 RVSFNLRNSLIGFTPNKC 249
V +++ IGF P C
Sbjct: 411 EVVYDMEAGRIGFQPKDC 428
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/263 (25%), Positives = 116/263 (44%), Gaps = 25/263 (9%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNE-GLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G T+T +G+A ++A GC ++ G +G++GLG S +Q + FSYCL
Sbjct: 139 GKVGTDTFAVGTAKA-SLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCL 197
Query: 60 VDRDSDSTSTLEFDSSLP----PNAVTAPLL----RNHELDTFYYLGLTGISVGGDLLPI 111
D+ S L SS A + P + ++L +Y + L G+ G ++P+
Sbjct: 198 APHDAGKNSALFLGSSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPL 257
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
+ +++D+ + ++ L Y A++ A A V FD C+
Sbjct: 258 PPS--------GSTVLLDTFSPISFLVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPK 309
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-----SSLSIIGNV 226
S S P + F F G + + A NYL+ NGT C A ++ + LS++G++
Sbjct: 310 SGASGA-APDLVFTFRGGAAMTVAASNYLLDYK-NGTVCLAMLSSARLNSTTELSLLGSL 367
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
QQ+ F+L + F P C
Sbjct: 368 QQENIHFLFDLDKETLSFEPADC 390
>gi|125552155|gb|EAY97864.1| hypothetical protein OsI_19785 [Oryza sativa Indica Group]
Length = 508
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 78/272 (28%), Positives = 116/272 (42%), Gaps = 31/272 (11%)
Query: 1 GDFVTETVTLGSASVDN-----IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
G+ + T G S D + GC + EG F GA+G+LGL GSLS SQ+N F
Sbjct: 191 GNLAVQNFTFGDDSEDTAVKGVVTFGCSSSTEGDF-GASGVLGLNKGSLSLVSQLNLGRF 249
Query: 56 SYCLVDR--DSDSTSTLEFDSSLPPNAVTAP-----------------LLRNHELDTFYY 96
SY +D+ + +F + +T P + + LD Y+
Sbjct: 250 SYYFAPEVNTTDNNAADDFIVFGDDDGITVPGTSGGSRPRYTPFFTTGAVSSANLD-LYF 308
Query: 97 LGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL 156
+ LTGI VGG L + ++ S VT L+ Y L+ V +
Sbjct: 309 VELTGIRVGGKDLQLGGGGGGSAGGSLEAVLSTS-VPVTYLEKNAYGLLKKELVSALGSN 367
Query: 157 SPTDGVAL-FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP 215
+ DG AL D CY ++P ++F F V+ L NYL + G C P
Sbjct: 368 NTEDGSALGLDLCYRSQHMDRAKIPDIAFVFGGNAVMKLQQWNYLYQDEDTGLECLTILP 427
Query: 216 T---SSSLSIIGNVQQQGTRVSFNLRNSLIGF 244
+ S LS+IG++ Q GT + ++L S +GF
Sbjct: 428 SPDDSDGLSLIGSMIQTGTYMIYDLHKSRLGF 459
>gi|224032957|gb|ACN35554.1| unknown [Zea mays]
Length = 144
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 50/123 (40%), Positives = 69/123 (56%), Gaps = 3/123 (2%)
Query: 127 IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHF 186
I+DSGT +TRL T Y+AL A + ++ DTC+ + + + VP V+ F
Sbjct: 24 IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQA-ARLRVPEVTMAF 82
Query: 187 PEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTP 246
G L L A+N L+ VDS T C AFAP S+ +IIGN QQQ V ++++NS IGF
Sbjct: 83 AGGAALKLAARNLLVDVDS-ATTCLAFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFAA 140
Query: 247 NKC 249
C
Sbjct: 141 GGC 143
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 69/228 (30%), Positives = 105/228 (46%), Gaps = 23/228 (10%)
Query: 35 GLLGLGGGSLSFPSQ-----INASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNH 89
G+ G G +S SQ I FS+CL D+ L + PN V +PL+++
Sbjct: 220 GIFGFGQQGMSVISQLSLQGIAPRVFSHCL-KGDNSGGGVLVLGEIVEPNIVYSPLVQSQ 278
Query: 90 ELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF 149
Y L L ISV G ++PI+ F S N G IVDSGT + L E YN F
Sbjct: 279 P---HYNLNLQSISVNGQIVPIAPAVFA--TSNNRGTIVDSGTTLAYLAEEAYN----PF 329
Query: 150 VRGTRALSPTDGVALF---DTCYDFSSRSSVEV-PTVSFHFPEGKVLPLPAKNYLIPVDS 205
V AL P ++ + CY ++ S+V++ P VS +F G L L ++YL+ +
Sbjct: 330 VNAITALVPQSVRSVLSRGNQCYLITTSSNVDIFPQVSLNFAGGASLVLRPQDYLMQQNY 389
Query: 206 NG---TFCFAFAPT-SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
G +C F S++I+G++ + ++L IG+ C
Sbjct: 390 IGEGSVWCIGFQRIPGQSITILGDLVLKDKIFVYDLAGQRIGWANYDC 437
>gi|357131275|ref|XP_003567264.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like, partial [Brachypodium distachyon]
Length = 364
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 118/283 (41%), Gaps = 36/283 (12%)
Query: 1 GDFVTETVTLGSASVD-NIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G T+ +GSA+ A GC ++ V +AGLLG+ G+LSF SQ FS
Sbjct: 73 GALATDVFAVGSATPSLRAAFGCMASAFDSSPDGVASAGLLGMNRGALSFVSQAGTRRFS 132
Query: 57 YCLVDRDSDSTSTLEFDSSLP---PNAVTAPLLRNHELDTF----YYLGLTGISVGGDLL 109
YC+ DRD D+ L S LP P T + L F Y + L GI VG L
Sbjct: 133 YCISDRD-DAGVLLLGHSDLPNFLPLNYTPLYQPSLPLPYFDRVAYSVQLLGILVGSKPL 191
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RAL-SPTDGV-A 163
PI + D +G G +VDSGT T L + Y AL+ F R + RAL P+
Sbjct: 192 PIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYAALKAEFYRQSTPFLRALDEPSFAFQG 251
Query: 164 LFDTCYD----FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV----------DSNGTF 209
FDTC+ S +P+V+ F G + + L V D + +
Sbjct: 252 AFDTCFRVPRGMSPPPGRLLPSVTLRF-NGAEMVVGGDRLLYKVPGERRGGAGADDDAVW 310
Query: 210 CFAFAPTSS---SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
C F +IG+ Q V ++L +G +C
Sbjct: 311 CLTFGNADMVPIMAYVIGHHHQMNLWVEYDLERGRVGLAQVRC 353
>gi|50878437|gb|AAT85211.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 435
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 70/253 (27%), Positives = 108/253 (42%), Gaps = 31/253 (12%)
Query: 22 CGHNN--EGLFVGAAGLLGLGGGSLSFPSQINA-----STFSYCLVDRDS-------DST 67
CG + +GL A G++ L + P+Q+ + F+ CL +S D+
Sbjct: 169 CGATSLTKGLGAAATGMMSLSRARFALPTQVASIFRFSRKFALCLAPAESSGVVVFGDAP 228
Query: 68 STLEFDSSLPPNAVTAPLLRNH------ELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
+ L + + PLL N + T Y++G+TGI V G +P++ T I +S
Sbjct: 229 YEFQPVMDLSKSLIYTPLLVNPVTTTGGDKSTEYFIGVTGIKVNGRAVPLNATLLAIAKS 288
Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD----FSSRSSV 177
G GG + + T L+T Y A+ DAF T + VA F CYD S+R+
Sbjct: 289 GVGGTKLSMLSPYTVLETSIYKAVTDAFAAETAMIPRVPAVAPFKLCYDGTMVGSTRAGP 348
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCF-----AFAPTSSSLSIIGNVQQQGTR 232
VPTV V + + +G CF AP +S +IG +
Sbjct: 349 AVPTVELVLQSKAVSWVVFGANSMVATKDGALCFGVVDGGVAPETS--VVIGGHMMEDNL 406
Query: 233 VSFNLRNSLIGFT 245
+ F+L S +GFT
Sbjct: 407 LEFDLEGSRLGFT 419
>gi|125552105|gb|EAY97814.1| hypothetical protein OsI_19735 [Oryza sativa Indica Group]
Length = 424
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 75/273 (27%), Positives = 115/273 (42%), Gaps = 67/273 (24%)
Query: 5 TETVTLGSASVDNIAIGCGHNNE---GLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVD 61
T+ T S+S +A GC G GA+G++GLG G+LS L
Sbjct: 188 TDAFTFPSSSSVTLAFGCVSQTRISPGALTGASGIIGLGRGALS-------------LNP 234
Query: 62 RDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
+DS TFYYL L G++ G + + AF + E+
Sbjct: 235 KDS-------------------------PFSTFYYLPLVGLAAGNATVALPAGAFDLREA 269
Query: 122 G----NGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGV--ALFDTCY--- 169
GG ++DSG+ TRL + AL +RG+ +L P + C
Sbjct: 270 APKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGALELCVEAG 329
Query: 170 -DFSSRSSVEVPTVSFHFPEG----KVLPLPAKNYLIPVDSNGTFCFAFAPTSS------ 218
D S ++ VP++ F +G + L +PA+ Y V+++ T+C A ++S
Sbjct: 330 DDGDSLAAAAVPSLVLRFDDGVGGGRELVIPAEKYWARVEAS-TWCMAVVSSASGNATLP 388
Query: 219 --SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+IIGN QQ RV ++L N L+ F P C
Sbjct: 389 TNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 421
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 72/249 (28%), Positives = 104/249 (41%), Gaps = 19/249 (7%)
Query: 13 ASVDNIAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPSQ-----INASTFSYCLVDRD 63
AS I GC G G+LG G G LS SQ I FS+CL D
Sbjct: 203 ASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCL-KGD 261
Query: 64 SDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
+ L L P+ V +PL+ + Y L L I+V G +L I+ F S
Sbjct: 262 GNGGGILVLGEILEPSIVYSPLVPSQP---HYNLNLQSIAVNGQVLSINPAVFA--TSDK 316
Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
G I+DSGT ++ L E Y+ L +A + T ++ CY + PTVS
Sbjct: 317 RGTIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFA-TSFISKGSQCYLVLTSIDDSFPTVS 375
Query: 184 FHFPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
F+F G + L YL+ D +C F ++I+G++ + V ++L
Sbjct: 376 FNFEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQKVQEGVTILGDLVLKDKIVVYDLARQ 435
Query: 241 LIGFTPNKC 249
IG+T C
Sbjct: 436 QIGWTNYDC 444
>gi|326529727|dbj|BAK04810.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 488
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 81/288 (28%), Positives = 129/288 (44%), Gaps = 42/288 (14%)
Query: 1 GDFVTETVTLGS--ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
G V++T+ L A+ N A+GC + + +GL G G G+ S P+Q+ + FSYC
Sbjct: 199 GLLVSDTLRLSPRGAASRNFAVGC--SLASVHQPPSGLAGFGRGAPSVPAQLGVNKFSYC 256
Query: 59 LVDRDSDSTSTLEFDSSLPPNAV--------TAPLLRNH----ELDTFYYLGLTGISVGG 106
L+ R D + + + L ++ APLL+N +YYL LTGI+VGG
Sbjct: 257 LLSRRFDDDAAISGELVLGASSAGKAKAMMQYAPLLKNAGARPPYSVYYYLSLTGIAVGG 316
Query: 107 DLLPISETAF-KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTD 160
+ + A + G GG I+DSGT T L + + A V R R+ +
Sbjct: 317 KSVALPARALAPVSGGGGGGAIIDSGTTFTYLDPTVFKPVAAAMVAAVGGRYNRSKD-VE 375
Query: 161 GVALFDTCYDFSSRS-SVEVPTVSFHFPEGKVLPLPAKNYLIPVD-----SNGTFCFAFA 214
G C+ + + ++++P +S HF G + LP +NY + + C A
Sbjct: 376 GALGLRPCFALPAGARTMDLPELSLHFSGGAEMRLPIENYFLAAGPASGVAPEAICLAVV 435
Query: 215 PTSSSLS-------------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
SS S I+G+ QQQ +V ++L + +GF C
Sbjct: 436 SDVSSASGGAGVSGGGGPAIILGSFQQQNYQVEYDLEKNRLGFRQQPC 483
>gi|326518194|dbj|BAK07349.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 435
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 74/258 (28%), Positives = 110/258 (42%), Gaps = 29/258 (11%)
Query: 15 VDNIAIGCGHNNEG---LFVGA-AGLLGLGGGSLSFPSQINA-----STFSYCLVDRDSD 65
VD + GC H +G L G AG L L SF SQ+ A S FSYCL S
Sbjct: 176 VDKLTFGCAHTTDGFERLNHGVLAGALSLSRHPTSFLSQLTARRLADSRFSYCLFPGQSH 235
Query: 66 STST---LEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF--K 117
+ L F +P + T+ L + YY+G+T IS+ G + + AF +
Sbjct: 236 PNARHGFLRFGRDIPRHDHAHSTSLLFTGRGSGSMYYIGVTSISLNGKRIIGLQPAFFRR 295
Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR------GTRALSPTDGVALFDTCYDF 171
++ GG +VD GT +TRL E YN + V RA +P G L C F
Sbjct: 296 NPQTRRGGSVVDPGTPLTRLVREAYNIVEAELVAYMQTQGSRRAPAPVQGHRL---C--F 350
Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGT 231
S +P+++ + E + L ++ CF P ++++G QQ T
Sbjct: 351 VSWGHAHLPSMTINMNEDRAKLFIKPELLFLKVTHEHLCFLVVP-DEEMTVLGAAQQVDT 409
Query: 232 RVSFNLRNSLIGFTPNKC 249
R +F+L + + F C
Sbjct: 410 RFTFDLHANRLYFAQEHC 427
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/268 (28%), Positives = 113/268 (42%), Gaps = 24/268 (8%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ V E +T S S + +GC + G+LG+ G LSF SQ + FSYC+
Sbjct: 170 GNLVREKITFSTSQSTPPLILGCAEDASD----DKGILGMNLGRLSFASQAKITKFSYCV 225
Query: 60 VDRDSDS--TSTLEFDSSLPPNAVTAPLL---------RNHELDTFYY-LGLTGISVGGD 107
R T T F PN+ + R LD + + L GI +G
Sbjct: 226 PTRQVRPGFTPTGSFYLGENPNSAGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNK 285
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALF 165
L I +AF+ D SG G ++DSG+ T L YN +R+ VR G R +
Sbjct: 286 KLNIPVSAFRADPSGAGQSMIDSGSEFTYLVDVAYNKVREEVVRLAGPRLKKGYVYSGVS 345
Query: 166 DTCYDFSSRSSVE-VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS---SSLS 221
D C+D ++ + + F F +G + + L V G C + ++ +
Sbjct: 346 DMCFDGNAMEIGRLIGNMVFEFDKGVEIVIEKGRVLADV-GGGVHCVGIGRSEMLGAASN 404
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IIGN QQ V F++ N +GF C
Sbjct: 405 IIGNFHQQNLWVEFDIANRRVGFGKADC 432
>gi|222632517|gb|EEE64649.1| hypothetical protein OsJ_19503 [Oryza sativa Japonica Group]
Length = 505
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 84/181 (46%), Gaps = 9/181 (4%)
Query: 74 SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTA 133
SS P PLL + + FY + + +SV G L I + D NGG I+DSGT+
Sbjct: 327 SSAPAPGSRTPLLLDARVRPFYAVAVDSVSVDGVALDIPAEVW--DVGSNGGTIIDSGTS 384
Query: 134 VTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR----SSVEVPTVSFHFPEG 189
+T L T Y A+ A L P + FD CY++++R + VP ++ F
Sbjct: 385 LTVLATPAYKAVVAALSEQLAGL-PRVAMDPFDYCYNWTARGDGGGDLAVPKLAVQFAGS 443
Query: 190 KVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNK 248
L PAK+Y+I + G C + +S+IGN+ QQ F+L N + F
Sbjct: 444 ARLEPPAKSYVIDA-APGVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLNNRWLRFRQTS 502
Query: 249 C 249
C
Sbjct: 503 C 503
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 77/268 (28%), Positives = 114/268 (42%), Gaps = 24/268 (8%)
Query: 1 GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ V E T S+ + + +GC ++ G+LG+ G LSF S S FSYC+
Sbjct: 168 GNLVREKFTFSSSQTTPPLILGCATDSSD----TQGILGMNLGRLSFSSLAKISKFSYCV 223
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPL-----------LRNHELDTFYY-LGLTGISVGGD 107
R S S S+ L PN +A R LD Y L + GI + G
Sbjct: 224 PPRRSQSGSSPTGSFYLGPNPSSAGFKYVNLMTYRQSQRMPNLDPLAYTLPMLGIRINGK 283
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALF 165
L IS +AF+ D SG G ++DSGT T L E Y+ +++ V+ G +
Sbjct: 284 KLNISTSAFRADPSGAGQTLIDSGTWFTFLVDEAYSKVKEEIVKLAGPKLKKGYVYGGSL 343
Query: 166 DTCYDFSSRS-SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS---SSLS 221
D C+D + + ++F F G + + + L V G C + + +
Sbjct: 344 DMCFDGDAMVIGRMIGNMAFEFENGVEIVVEREKMLADV-GGGVQCLGIGRSDLLGVASN 402
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IIGN QQ V F+L +GF C
Sbjct: 403 IIGNFHQQDLWVEFDLVGRRVGFGRTDC 430
>gi|297740191|emb|CBI30373.3| unnamed protein product [Vitis vinifera]
Length = 218
Score = 77.4 bits (189), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 87/221 (39%), Gaps = 24/221 (10%)
Query: 50 INASTFSYCLVDRDSDSTST-----LEFDSSLPPNAVTAPLLRNHELDTFYY-LGLTGIS 103
+ F+YCL D D T L++ P L++ FYY LG+ I
Sbjct: 1 MGVKKFAYCLNSHDYDDTRNSGKLILDYRDGKTKGLSYTPFLKSPPASAFYYHLGVKDIK 60
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTE-----TYNALRDAFVRGTRALSP 158
+G LL I G G+I+DSG T N L+ + R+L
Sbjct: 61 IGNKLLRIPSKYLAPGSDGRSGVIIDSGYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEA 120
Query: 159 TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL----------IPVDSNGT 208
L CY+F+ S+++P + + F G + +P KNY +D+NGT
Sbjct: 121 ETQTGL-TPCYNFTGHKSIKIPPLIYQFRGGANMVVPGKNYFGISPQESLACFLMDTNGT 179
Query: 209 FCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
P S I+GN Q V ++L+N GF C
Sbjct: 180 NALEITPDPS--IILGNSQHVDYYVEYDLKNDRFGFRRQTC 218
>gi|356551755|ref|XP_003544239.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 249
Score = 77.4 bits (189), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 61/196 (31%), Positives = 88/196 (44%), Gaps = 17/196 (8%)
Query: 57 YCLVDRDSDSTS-TLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
YCL S S +L+ + P + T PLLRN + + YY+ LTGI+VG + +
Sbjct: 67 YCLPSFQSSYFSGSLKLGPTGQPRRIRTTPLLRNPQRPSLYYVNLTGINVGRVRVSLPTD 126
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
D + G I+DSGT +TR YNA+RD F + G T + +
Sbjct: 127 YLAFDPNKGSGTIIDSGTVITRFVXPVYNAIRDEFRYQVK------GPCFVKTYENLA-- 178
Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL-SIIGNVQQQGTRV 233
P + F G + LP +N LI G C A A +++ S + N QQQ RV
Sbjct: 179 -----PLIKLRF-TGLDVTLPYENTLIHTAYGGMACLAMAAAPNNVNSALTNFQQQNLRV 232
Query: 234 SFNLRNSLIGFTPNKC 249
F+ N+ +G C
Sbjct: 233 LFDTVNNRVGIARELC 248
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 77.4 bits (189), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 115/268 (42%), Gaps = 32/268 (11%)
Query: 1 GDFVTETVTLGSASVDN----------IAIGCGHNNEGLFV-GAA--GLLGLGGGSLSFP 47
G V + + L S S D+ + +GCG G ++ GAA G++GLG GS+S P
Sbjct: 199 GFLVEDILHLASVSDDSNSTQKRVQASVILGCGRKQTGGYLDGAAPDGVMGLGPGSISVP 258
Query: 48 SQINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGI 102
S + + +FS C D + + T+ F + + PLL Y + +
Sbjct: 259 SLLAKAGLIRKSFSLCF---DVNGSGTILFGDQGHTSQKSTPLLPTQGNYDAYLIEVESY 315
Query: 103 SVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
VG L ++ FK +VDSG + T L + YN + F + A +
Sbjct: 316 CVGNSCL--KQSGFKA--------LVDSGASFTYLPIDVYNKIVLEFDKQVNAQRISSQG 365
Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSN-GTFCFAFAPTSSSLS 221
++ CY+ SS+ VP + F + L + Y +P + FC PT +
Sbjct: 366 GPWNYCYNTSSKQLDNVPAMRLSFLMNQSLLIHNSTYYVPQNQEFAVFCLTLQPTDLNYG 425
Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IIG G RV F++ N +G++ + C
Sbjct: 426 IIGQNYMTGYRVVFDMENLKLGWSSSNC 453
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 77.4 bits (189), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 76/263 (28%), Positives = 107/263 (40%), Gaps = 24/263 (9%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
G T+TVT+ S S + IGCG NN G +GL G LS +Q+
Sbjct: 454 GTLATDTVTIHSTSGEPFVMAETIIGCGRNNSWFRPSFEGFVGLNWGPLSLITQMGGEYP 513
Query: 54 -TFSYCLVDRDSDSTSTLEFDSSLPP---NAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
SYC + TS + F ++ V+ + FYYL L +SVG +
Sbjct: 514 GLMSYCFA---GNGTSKINFGTNAIVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRI 570
Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
T F E G I++DSGT +T N +R A A+ D CY
Sbjct: 571 ETLGTPFHALE---GNIVIDSGTTLTYFPESYCNLVRQAVEHVVPAVPAADPTGNDLLCY 627
Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA---PTSSSLSIIGNV 226
+S+ + + P ++ HF G L L N + S G FC A PT +I GN
Sbjct: 628 -YSNTTEI-FPVITMHFSGGADLVLDKYNMFMESYSGGLFCLAIICNNPTQE--AIFGNR 683
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
Q V ++ + L+ F P C
Sbjct: 684 AQNNFLVGYDSSSLLVSFKPTNC 706
Score = 63.5 bits (153), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 68/249 (27%), Positives = 103/249 (41%), Gaps = 44/249 (17%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINAS 53
G TETVT+ S S + IGC NN G ++G++GL GSLS SQ+ +
Sbjct: 141 GTLATETVTIHSTSGVPFVMPETIIGCSRNNSGSGFRPSSSGIVGLSRGSLSLISQMGGA 200
Query: 54 TFSYCLVDRDSDSTSTLEFDSSLPPN-AVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
P + V+ + YYL L +SVG +
Sbjct: 201 ----------------------YPGDGVVSTTMFAKTAKRGQYYLNLDAVSVGDTRIETV 238
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR---GTRALSPTDGVALFDTCY 169
T F + NG I++DSGT +T N +R A R R + P+ L CY
Sbjct: 239 GTPF---HALNGNIVIDSGTPLTYFPVSYCNLVRKAVERVVTADRVVDPSRNDML---CY 292
Query: 170 DFSSRSSVEV-PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQ 227
+++E+ P ++ HF G L L N + ++ G FC A + + ++I GN
Sbjct: 293 ---YSNTIEIFPVITVHFSGGADLVLDKYNMYMELNRGGVFCLAIICNNPTQVAIFGNRA 349
Query: 228 QQGTRVSFN 236
Q V ++
Sbjct: 350 QNNFLVGYD 358
>gi|22165127|gb|AAM93743.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
Length = 265
Score = 77.4 bits (189), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 72/262 (27%), Positives = 115/262 (43%), Gaps = 28/262 (10%)
Query: 5 TETVTLGSASVDNIAIGCGHN-NEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDR- 62
T+T +G+A+ ++A GC + N +GA+G++GLG S Q+NA+ FSYCL
Sbjct: 11 TDTFAIGTATA-SLAFGCAMDSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCLAPHG 69
Query: 63 DSDSTSTLEFDSSLP----PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
+ S L +S +A T PL+ + + Y + L GI GD++ I
Sbjct: 70 AAGKKSALLLGASAKLAGGKSAATTPLVNTSDDSSDYMIHLEGIKF-GDVI--------I 120
Query: 119 DESGNGGII-VDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY-----DFS 172
NG ++ VD+ V+ L + A++ A A FD C+
Sbjct: 121 APPPNGSVVLVDTIFGVSFLVDAAFQAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAG 180
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-----SSLSIIGNVQ 227
+ SS+ +P V F L +P Y+ NGT C A ++ + LSI+G +
Sbjct: 181 ANSSLPLPDVVLTFQGAAALTVPPSKYMYDA-GNGTVCLAMMSSAMLNLTTELSILGRLH 239
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
Q+ F+L + F P C
Sbjct: 240 QENIHFLFDLDKETLSFEPADC 261
>gi|449432735|ref|XP_004134154.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
gi|449527085|ref|XP_004170543.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
Length = 435
Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 73/268 (27%), Positives = 110/268 (41%), Gaps = 34/268 (12%)
Query: 12 SASVDNIAIGCGHNN--EGLFVGAAGLLGLGGGSLSFPSQINAS-----TFSYCLVDRDS 64
+ S+ N CG EGL G G+ G G +S PSQ A+ F+ CL S
Sbjct: 151 AVSIPNFLFVCGSTFLLEGLAPGVTGMAGFGRNGISLPSQFAAAFSFNRKFAVCLSGSTS 210
Query: 65 ------------------DSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
D T++ + TA + E T Y++G+T I V
Sbjct: 211 SPGVIFSGNGPYHFLPNIDLTNSFTYTPLFINPVSTAGVSSAGEKSTEYFIGVTSIVVNS 270
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
+P++ T KID +GNGG + + T L++ Y AL AF + VA F+
Sbjct: 271 KPVPLNTTLLKIDSNGNGGTKISTVNPFTVLESSIYKALVKAFTTEVSKVPRVGAVAPFE 330
Query: 167 TCYDF----SSRSSVEVPTVSFHFPEGKVL-PLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
CY S+R VPT+ KV+ + N ++ V+ + C F +
Sbjct: 331 VCYSSKSFPSTRLGAGVPTIDLVLQNKKVIWSMFGANSMVQVN-DEVLCLGFVDGGVDVR 389
Query: 222 ---IIGNVQQQGTRVSFNLRNSLIGFTP 246
+IG Q + + F+L S +GFTP
Sbjct: 390 TAIVIGAHQIEDKLLEFDLATSRLGFTP 417
>gi|297811183|ref|XP_002873475.1| hypothetical protein ARALYDRAFT_909036 [Arabidopsis lyrata subsp.
lyrata]
gi|297319312|gb|EFH49734.1| hypothetical protein ARALYDRAFT_909036 [Arabidopsis lyrata subsp.
lyrata]
Length = 292
Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 71/248 (28%), Positives = 101/248 (40%), Gaps = 50/248 (20%)
Query: 6 ETVTLGSASV-DNIAIGCGHNNEG-LFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRD 63
E TL S+ D + GCG NN G + G AGLLG G L+F S
Sbjct: 90 EKFTLMSSDFFDGVNFGCGENNTGDYYEGVAGLLGNTSGHLTFGS--------------- 134
Query: 64 SDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
+ + + P+ + D FYYL + GI+V L I
Sbjct: 135 ----------TGISKSVKFTPVSSSPSKD-FYYLNIEGITVCDKQLEIPS---------- 173
Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD-GVALFDTCYDFSSRSSVEVPTV 182
++S T Y AL+ AF + T G + DTCYDF+ +V + +
Sbjct: 174 ----IESSTP------RAYAALKSAFKEKMSKYTITSSGDSELDTCYDFTGLKTVTITKI 223
Query: 183 SFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSIIGNVQQQGTRVSFNLRNSL 241
+F F G V+ L K L C AFA +++I G+VQQQ +V ++
Sbjct: 224 AFSFSGGTVVELDPKGILYSSSERSKLCLAFAEYPDDNVAIFGSVQQQTLQVVYDGVGGR 283
Query: 242 IGFTPNKC 249
+GF PN C
Sbjct: 284 VGFAPNGC 291
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 72/262 (27%), Positives = 115/262 (43%), Gaps = 28/262 (10%)
Query: 5 TETVTLGSASVDNIAIGCGHN-NEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDR- 62
T+T +G+A+ ++A GC + N +GA+G++GLG S Q+NA+ FSYCL
Sbjct: 119 TDTFAIGTATA-SLAFGCAMDSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCLAPHG 177
Query: 63 DSDSTSTLEFDSSLP----PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
+ S L +S +A T PL+ + + Y + L GI GD++ I
Sbjct: 178 AAGKKSALLLGASAKLAGGKSAATTPLVNTSDDSSDYMIHLEGIKF-GDVI--------I 228
Query: 119 DESGNGGII-VDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD-----FS 172
NG ++ VD+ V+ L + A++ A A FD C+
Sbjct: 229 APPPNGSVVLVDTIFGVSFLVDAAFQAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAG 288
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-----SSLSIIGNVQ 227
+ SS+ +P V F L +P Y+ NGT C A ++ + LSI+G +
Sbjct: 289 ANSSLPLPDVVLTFQGAAALTVPPSKYMYDA-GNGTVCLAMMSSAMLNLTTELSILGRLH 347
Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
Q+ F+L + F P C
Sbjct: 348 QENIHFLFDLDKETLSFEPADC 369
>gi|125572775|gb|EAZ14290.1| hypothetical protein OsJ_04214 [Oryza sativa Japonica Group]
Length = 465
Score = 76.6 bits (187), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 74/149 (49%), Gaps = 5/149 (3%)
Query: 5 TETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDS 64
TE T G +D + GCG N G F G +G++GLG G+LS SQ+ FSY DS
Sbjct: 149 TEAFTFGDTRIDGVVFGCGLKNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAPDDS 208
Query: 65 -DSTSTLEFDSSLPP---NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI-D 119
D+ S + F P + ++ LL + + YY+ L GI V G L I F + +
Sbjct: 209 VDTQSFILFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRN 268
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDA 148
+ G+GG+ + VT L+ Y LR A
Sbjct: 269 KDGSGGVFLSITDLVTVLEEAAYKPLRQA 297
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 76.6 bits (187), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 74/265 (27%), Positives = 119/265 (44%), Gaps = 24/265 (9%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCGHNN-----EGLFVGAAGLLGLGGGSLSFPSQI 50
G TETVT+ S S + IGCG +N G ++G++GL G LS SQ+
Sbjct: 495 GILATETVTIPSTSGEPFVMAETKIGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQM 554
Query: 51 N---ASTFSYCLVDRDSDSTSTLEF--DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
+ SYC + TS + F ++ + + A + + + FYYL L +SV
Sbjct: 555 DLPYPGLISYCFSGQ---GTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVE 611
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
+L+ T F ++ G I +DSGT +T N +R+A + A+ D +
Sbjct: 612 DNLIATLGTPFHAED---GNIFIDSGTTLTYFPMSYCNLVREAVEQVVTAVKVPDMGSDN 668
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL-SIIG 224
CY +S + P ++ HF G L L N + + G FC A S+ ++ G
Sbjct: 669 LLCY-YSDTIDI-FPVITMHFSGGADLVLDKYNMYLETITGGIFCLAIGCNDPSMPAVFG 726
Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
N Q V ++ +++I F+P C
Sbjct: 727 NRAQNNFLVGYDPSSNVISFSPTNC 751
Score = 67.8 bits (164), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 73/252 (28%), Positives = 110/252 (43%), Gaps = 24/252 (9%)
Query: 1 GDFVTETVTLGSAS-----VDNIAIGCG-HN----NEGLFVGAAGLLGLGGGSLSFPSQI 50
G TETVT+ S S + IGCG HN N G ++G++GL G S SQ+
Sbjct: 156 GILATETVTIHSTSGEPFVMAETTIGCGLHNTDLDNSGFASSSSGIVGLNMGPRSLISQM 215
Query: 51 N---ASTFSYCLVDRDSDSTSTLEF--DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
+ SYC + TS + F ++ + + A + + + FYYL L +SV
Sbjct: 216 DLPYPGLISYCFSGQ---GTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVE 272
Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
+ + T F ++ G I++DSG+ VT N +R A + A+ D
Sbjct: 273 DNRIETLGTPFHAED---GNIVIDSGSTVTYFPVSYCNLVRKAVEQVVTAVRVPDPSGND 329
Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIG 224
CY FS + P ++ HF G L L N + +S G FC A S + +I G
Sbjct: 330 MLCY-FSETIDI-FPVITMHFSGGADLVLDKYNMYMESNSGGLFCLAIICNSPTQEAIFG 387
Query: 225 NVQQQGTRVSFN 236
N Q V ++
Sbjct: 388 NRAQNNFLVGYD 399
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 76.6 bits (187), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 69/275 (25%), Positives = 120/275 (43%), Gaps = 32/275 (11%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVG-------AAGLLGLGGGSLSFPS 48
G+ ET T S ++ +I+ GC ++ + +G+LG+G G SF +
Sbjct: 177 GNLANETFTFYSNHGKHTALKSISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLA 236
Query: 49 Q---INASTFSYCLVDRDSDSTSTLEFDSSL--PPNAVTAPLLRNHELDTFYYLGLTGIS 103
Q I+ FSYC+ ++ +T L F + N T +++ + Y++ L GIS
Sbjct: 237 QLGSISHGKFSYCITANNTHNT-YLRFGKHVVKSKNLQTTKIMQV-KPSAAYHVNLLGIS 294
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
V G L I++T + + G+ G I+D+GT T L ++ L A + LS +
Sbjct: 295 VNGVKLNITKTDLAVRKDGSRGCIIDAGTLATLLVKPIFDTLHTAL---SNHLSSNQNLK 351
Query: 164 LF-------DTCYD-FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLI-PVDSNGTFCFAFA 214
+ D CY+ S +P V+FH + P +L + FC +
Sbjct: 352 RWVIHKLHKDLCYEQLSDAGRKNLPVVTFHLENADLEVKPEAIFLFREFEGKNVFCLSML 411
Query: 215 PTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ S +IIG QQ + ++ + ++ F P C
Sbjct: 412 -SDDSKTIIGAYQQMKQKFVYDTKARVLSFGPEDC 445
>gi|414869114|tpg|DAA47671.1| TPA: hypothetical protein ZEAMMB73_872184 [Zea mays]
Length = 492
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 77/267 (28%), Positives = 116/267 (43%), Gaps = 22/267 (8%)
Query: 1 GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FS 56
G F + +T+ S +V + C + G L L S PS++ S FS
Sbjct: 230 GTFSQDVLTVAPSVAVQDFTFVCLDAGASDGMPEVGTLDLSRDRNSLPSRLAGSASAAFS 289
Query: 57 YCLVDR-DSDSTSTLEFDSSLPPNAVTA--PLLRNHELD--TFYYLGLTGISVGGDLLPI 111
YC+ DS +L D+++ + TA PLL + + D Y++ + G+S+G LPI
Sbjct: 290 YCMPQYPDSPGFLSLGDDATVRGDNCTAHAPLLSSDDPDLANMYFIDVVGMSLGDVDLPI 349
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT-DGVALFDTCYD 170
F N IV++GT T L + Y LRDAF + + + G FDTCY+
Sbjct: 350 PSGTF----GNNASTIVEAGTTFTMLAPDAYTPLRDAFRQAMAQYNRSVPGFYDFDTCYN 405
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYL-IPVDSNGTF---CFAFAPTSSSL----SI 222
F+ + VP V F F G L + L + S G F C AF+ ++
Sbjct: 406 FTGLQELTVPLVEFKFGNGDSLLIDGDQMLYYDIPSEGPFTVTCLAFSTLDVDDDDVSAV 465
Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
IG T V +++ +GF P C
Sbjct: 466 IGAYSLATTEVVYDVAGGTVGFIPESC 492
>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 449
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 70/253 (27%), Positives = 106/253 (41%), Gaps = 38/253 (15%)
Query: 35 GLLGLGGGSLSFPSQINAST--FSYCLV----DRDSDSTSTLEFD----SSLPPNAVTAP 84
G+ G G G LS P Q+ S FS+C + + + +S L SS N P
Sbjct: 179 GIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTP 238
Query: 85 LLRNHELDTFYYLGLTGISVG-GD---LLPISETAFKIDESGNGGIIVDSGTAVTRLQTE 140
LL++ +YY+GL I++G GD +S +ID GNGG+++DSGT T L
Sbjct: 239 LLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEP 298
Query: 141 TYNALRD--AFVRGTRALSPTDGVALFDTCYDFSSRSS-------VEVPTVSFHFPEGKV 191
Y+ L V G + FD CY +++ ++P+++FHF
Sbjct: 299 LYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVS 358
Query: 192 LPLPAKNYLI----PVDSNGTFCFAFAPTSSSLS-----------IIGNVQQQGTRVSFN 236
+ LP N P++S C + I G+ QQQ V ++
Sbjct: 359 VVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYD 418
Query: 237 LRNSLIGFTPNKC 249
L +GF P C
Sbjct: 419 LEKERLGFQPMDC 431
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 103/247 (41%), Gaps = 20/247 (8%)
Query: 18 IAIGCGHNN-EGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDSSL 76
IA GCGH N E L G+LGLG S Q+ S FSYC+ D + + +
Sbjct: 179 IAFGCGHENGEQLESEFTGILGLGAKPTSLAVQL-GSKFSYCIGDLANKNYGYNQLVLGE 237
Query: 77 PPNAVTAPLLRNHELDT-FYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVT 135
+ + P E + YY+ L GISVG L I FK G+I+D+GT T
Sbjct: 238 DADILGDPTPIEFETENGIYYMNLEGISVGDKQLNIEPVVFK-RRGSRTGVILDTGTLYT 296
Query: 136 RLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV---PTVSFHFPEGKVL 192
L Y R+ + L P F + R + E+ P V+FHF G L
Sbjct: 297 WLADIAY---RELYNEIKSILDPKLERFWFRDFLCYHGRVNEELIGFPVVTFHFAGGAEL 353
Query: 193 PLPAKNYLIPVDSNGT----FCFAFAPTSSS------LSIIGNVQQQGTRVSFNLRNSLI 242
+ A + P+ + T FC + PT+ + IG + QQ ++++L+ I
Sbjct: 354 AMEATSMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYDLKERNI 413
Query: 243 GFTPNKC 249
C
Sbjct: 414 YLQRIDC 420
>gi|222623568|gb|EEE57700.1| hypothetical protein OsJ_08178 [Oryza sativa Japonica Group]
Length = 441
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 63/116 (54%), Gaps = 3/116 (2%)
Query: 134 VTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLP 193
+TRL T Y+AL A + S ++ DTC+ + S V P V+ F G L
Sbjct: 328 ITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQA-SRVSAPAVTMSFAGGAALK 386
Query: 194 LPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
L A+N L+ VD + T C AFAP S+ +IIGN QQQ V +++++S IGF C
Sbjct: 387 LSAQNLLVDVD-DSTTCLAFAPARSA-AIIGNTQQQTFSVVYDVKSSRIGFAAGGC 440
>gi|224127969|ref|XP_002329222.1| predicted protein [Populus trichocarpa]
gi|222871003|gb|EEF08134.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 82/289 (28%), Positives = 125/289 (43%), Gaps = 47/289 (16%)
Query: 2 DFVTETVTLGS-ASVDNIAIGCGHNN--EGLFVGAAGLLGLGGGSLSFPSQINAS----- 53
D++ TLGS +S+DN C +GL G GL LG +LS P QIN +
Sbjct: 139 DYLALLNTLGSLSSIDNFIFSCARTGFLKGLAKGVTGLASLGNSNLSIPVQINKAFSSSP 198
Query: 54 -TFSYCLVDRDSDSTSTLEFDSSLPPN----------AVTAPLLRN----------HELD 92
F+ CL S L F S P N + PL+ N H L
Sbjct: 199 NCFAMCLSGSISQPGVAL-FGSKGPYNFLHGIDLSKSLLYTPLIFNPLGRDAVPNTHTLS 257
Query: 93 TFYYLGLTGISVGGDLLPISETAFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAFVR 151
YY+GLT I V G ++ ++T ID +SG+GG + + T+LQ+ Y A AF+R
Sbjct: 258 PEYYVGLTAIKVNGKMVAFNKTLLAIDGQSGSGGTRISTVVPYTKLQSSIYKAFTLAFLR 317
Query: 152 GTRA----LSPTDGVALFDTCYDFSSRSSVE----VPTVSFHFPEGKVL-PLPAKNYLIP 202
+ L+ T V F CY + + + VP + V+ + N ++
Sbjct: 318 EAASSAFNLTTTKPVKPFSVCYPAGAVKTTQMGPAVPIIELVLDRQDVVWKMFGSNSMVR 377
Query: 203 V--DSNGTFCFAF----APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFT 245
V S +C F A S+ +IG +Q + + F+L++ +GF+
Sbjct: 378 VTKKSVDVWCLGFVDGGAIDGPSI-MIGGLQLEDNLLQFDLQSKKLGFS 425
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/252 (31%), Positives = 115/252 (45%), Gaps = 31/252 (12%)
Query: 15 VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST-----FSYCLVD-RDSDSTS 68
+ + GC G F A GL+GLGGG +S SQ+ A+T FSYCL +++++S
Sbjct: 238 IAKLDFGCSTTTTGTF-RADGLVGLGGGPVSLASQLGATTSLGRKFSYCLAPYANTNASS 296
Query: 69 TLEFDSSL---PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGG 125
L F S P A + PL+ E++T+Y + L I+V G P + +
Sbjct: 297 ALNFGSRAVVSEPGAASTPLITG-EVETYYTIALDSINVAGTKRPTT--------AAQAH 347
Query: 126 IIVDSGTAVTRLQTETYNALRDAFVRGT---RALSPTDGVALFDTCYDFS---SRSSVEV 179
IIVDSGT +T L + L R RA SP + D CYD S ++ +
Sbjct: 348 IIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESPEK---ILDLCYDISGVRGEDALGI 404
Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRVSFNL 237
P V+ G + L N + V G C A TS S+SI+GN+ QQ V ++L
Sbjct: 405 PDVTLVLGGGGEVTLKPDNTFVVVQ-EGVLCLALVATSERQSVSILGNIAQQNLHVGYDL 463
Query: 238 RNSLIGFTPNKC 249
+ F C
Sbjct: 464 EKGTVTFAAADC 475
>gi|242078855|ref|XP_002444196.1| hypothetical protein SORBIDRAFT_07g014645 [Sorghum bicolor]
gi|241940546|gb|EES13691.1| hypothetical protein SORBIDRAFT_07g014645 [Sorghum bicolor]
Length = 100
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 39/78 (50%), Positives = 56/78 (71%), Gaps = 5/78 (6%)
Query: 77 PPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLP-ISETAFKIDES-GNGGIIVDSG 131
PP+A A P++RN ++TFYY+ L GIS+GG +P ++E+ ++ S G GG+IVDSG
Sbjct: 21 PPSASAASFTPMVRNPRMETFYYVQLVGISLGGARVPGVAESDLRLAPSTGRGGVIVDSG 80
Query: 132 TAVTRLQTETYNALRDAF 149
T+VTRL +Y+AL DAF
Sbjct: 81 TSVTRLARRSYSALHDAF 98
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 75.9 bits (185), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 73/249 (29%), Positives = 111/249 (44%), Gaps = 35/249 (14%)
Query: 35 GLLGLGGGSLSFPSQIN--ASTFSYCL-----VDRDSDSTSTLEFDSSLPPNAVTA---- 83
G+ G G G LS PSQ+ FS+C V+ + S+ + S+L N +
Sbjct: 231 GIAGFGRGLLSLPSQLGFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFT 290
Query: 84 PLLRNHELDTFYYLGLTGISVGGDLLP--ISETAFKIDESGNGGIIVDSGTAVTRLQTET 141
P+L YY+GL I++G ++ P + T + D GNGG++VDSGT T L
Sbjct: 291 PMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPF 350
Query: 142 YNALRDAF---VRGTRALSPTDGVALFDTCYDF----SSRSSVE------VPTVSFHFPE 188
Y+ L + RA + T+ FD CY ++ +S+E P+++FHF
Sbjct: 351 YSQLLTTLQSTITYPRA-TETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLN 409
Query: 189 GKVLPLPAKN--YLIPVDSNGTF--CFAFAPTSS----SLSIIGNVQQQGTRVSFNLRNS 240
L LP N Y + S+G+ C F + G+ QQQ +V ++L
Sbjct: 410 NATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKE 469
Query: 241 LIGFTPNKC 249
IGF C
Sbjct: 470 RIGFQAMDC 478
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 81/277 (29%), Positives = 115/277 (41%), Gaps = 31/277 (11%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHN----NEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
G+ ++T GS+ I GC ++ N GL+G+ GSLS SQ+ FS
Sbjct: 158 GNLASDTFGFGSSFNPGIVFGCMNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIPKFS 217
Query: 57 YCLVDRDSDST-----STLEFDSSL---PPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
YC+ D S + SL P ++ PL + Y + L GI + L
Sbjct: 218 YCISGSDFSGILLLGESNFSWGGSLNYTPLVQISTPLPYFDR--SAYTVRLEGIKISDKL 275
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGV-- 162
L IS F D +G G + D GT + L YNALRD F+ T RAL + V
Sbjct: 276 LNISGNLFVPDHTGAGQTMFDLGTQFSYLLGPVYNALRDEFLNQTNGTLRALDDPNFVFQ 335
Query: 163 ALFDTCYDFSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVD-----SNGTFCFAFAP 215
D CY S E+P+VS F EG + + L V ++ +CF F
Sbjct: 336 IAMDLCYRVPVNQSELPELPSVSLVF-EGAEMRVFGDQLLYRVPGFVWGNDSVYCFTFGN 394
Query: 216 TS---SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ IIG+ QQ + F+L +G +C
Sbjct: 395 SDLLGVEAFIIGHHHQQSMWMEFDLVEHRVGLAHARC 431
>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
Length = 308
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 73/259 (28%), Positives = 111/259 (42%), Gaps = 46/259 (17%)
Query: 1 GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-----VGAAGLLGLGGGSLSFPSQI 50
G +ET T+GS AS +A GCGH+N G F G + S++
Sbjct: 83 GYLSSETFTIGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKV 142
Query: 51 NASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
FSYCLV SDST++ + + G + + G
Sbjct: 143 GGQ-FSYCLVPLSSDSTASSKIN-----------------------FGKSAVVSGSG--- 175
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
+ + +ES II+DSGT +T L + Y + A + + TD F CY
Sbjct: 176 -TSSPAAAEESN---IIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCY- 230
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
S +E+PT++ HF G + LP N + + CF+ P SS+L+I GN+ Q
Sbjct: 231 -SGVKKLEIPTITAHF-IGADVQLPPLNTFVQAQED-LVCFSMIP-SSNLAIFGNLSQMN 286
Query: 231 TRVSFNLRNSLIGFTPNKC 249
V ++L+N+ + F P C
Sbjct: 287 FLVGYDLKNNKVSFKPTDC 305
>gi|356558304|ref|XP_003547447.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 336
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 74/259 (28%), Positives = 109/259 (42%), Gaps = 30/259 (11%)
Query: 4 VTETVTLGSASVDNIAIGCGHN-NEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC---L 59
V ET G + + ++ + CGHN G G+ GL G S ++I FSYC L
Sbjct: 92 VFETTDEGHSQIFDVLVRCGHNIGFNTDPGYNGIRGLNNGPNSLATKI-GQKFSYCVGNL 150
Query: 60 VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
D + + + + + P +H FYY+ L GI VG L I+ F+I
Sbjct: 151 ADPYYNYNQLILCEGA-DLEGYSTPFEVHH---GFYYVTLKGIIVGEKRLDIAPITFEIK 206
Query: 120 ESGNGGIIVDSGTAVTRL----QTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
+ GG+I DSGT +T L YN +R+ R L Y SR
Sbjct: 207 GNNTGGVIRDSGTTITYLVDSVHKLLYNEVRNLLSWSFRQLCH----------YGIISRD 256
Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-----TSSSLSIIGNVQQQG 230
V P V+FHF +G L L ++ + N C +P T+ S S+I + QQ
Sbjct: 257 LVGFPVVTFHFADGADLALDTGSFFNQL--NSILCMTVSPASILNTTISPSVIELLAQQS 314
Query: 231 TRVSFNLRNSLIGFTPNKC 249
V ++L + + F C
Sbjct: 315 YNVGYDLLTNFVYFQRIDC 333
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 66/251 (26%), Positives = 112/251 (44%), Gaps = 28/251 (11%)
Query: 14 SVDNIAIGCGHNNEGL--FVGAAGLLGLGGGSLSFPSQINAS--TFSYCLVDRDSD---- 65
+ + GC ++ G F GLLG+G G +S Q + FSYCL + S+
Sbjct: 186 KIPSFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFF 245
Query: 66 STSTLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
S +T F + + ++ + +++ L ISV G+ L +S + F
Sbjct: 246 SKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----S 300
Query: 123 NGGIIVDSGTAVTRLQTETYNAL----RDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE 178
G++ DSG+ ++ + + L R+ +R A ++ CYD S +
Sbjct: 301 RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESE-----RNCYDMRSVDEGD 355
Query: 179 VPTVSFHFPEGKVLPLPAKNYLIP--VDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
+P +S HF +G L + + V +C AFAPT S +SIIG++ Q V ++
Sbjct: 356 MPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYD 414
Query: 237 LRNSLIGFTPN 247
L+ LIG P+
Sbjct: 415 LKRQLIGIGPS 425
>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
Length = 534
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 73/263 (27%), Positives = 118/263 (44%), Gaps = 18/263 (6%)
Query: 5 TETVTLGS-ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFP---SQINASTFSYCL 59
T TV+ G A + + +GC G V A G+L LG G +SF ++ FS+CL
Sbjct: 244 TVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAKRFGQRFSFCL 303
Query: 60 VDRDS--DSTSTLEFD---SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
+ +S D++S L F + + P + +L N ++ Y +TG+ VGG+ L I +
Sbjct: 304 LSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAQVTGVLVGGERLDIPDE 363
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS-- 172
+ + GG+I+D+ T+VT L E Y + A R L + F+ CY ++
Sbjct: 364 VWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELEGFEYCYKWTFT 423
Query: 173 -----SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIGNV 226
+V +P+ + G L AK+ ++P G C AF I+GNV
Sbjct: 424 GDGVDPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEPGVACLAFRKLLRGGPGILGNV 483
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
Q + + I F +KC
Sbjct: 484 FMQEYIWEIDHGDGKIRFRKDKC 506
>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
Length = 537
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 73/266 (27%), Positives = 119/266 (44%), Gaps = 18/266 (6%)
Query: 2 DFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFP---SQINASTFS 56
+ T TV+ G A + + +GC G V A G+L LG G +SF ++ FS
Sbjct: 244 EKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAKRFGQRFS 303
Query: 57 YCLVDRDS--DSTSTLEFD---SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
+CL+ +S D++S L F + + P + +L N ++ Y +TG+ VGG+ L I
Sbjct: 304 FCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTGVLVGGERLDI 363
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
+ + + GG+I+D+ T+VT L E Y + A R L + F+ CY +
Sbjct: 364 PDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELEGFEYCYKW 423
Query: 172 S-------SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSII 223
+ +V +P+ + G L AK+ ++P G C AF I+
Sbjct: 424 TFTGDGVXPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEPGVACLAFRKLLRGGPGIL 483
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GNV Q + + I F +KC
Sbjct: 484 GNVFMQEYIWEIDHGDGKIRFRKDKC 509
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 67/224 (29%), Positives = 99/224 (44%), Gaps = 16/224 (7%)
Query: 35 GLLGLGGGSLSFPSQ-----INASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNH 89
G+ G G LS SQ I FS+CL + D L L PN + +PL+ +
Sbjct: 229 GIFGFGQQDLSVVSQLSSLGITPKVFSHCL-KGEGDGGGKLVLGEILEPNIIYSPLVPSQ 287
Query: 90 ELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF 149
+ Y L L ISV G LLPI F S N G IVDSGT +T L Y+ A
Sbjct: 288 ---SHYNLNLQSISVNGQLLPIDPAVFA--TSNNQGTIVDSGTTLTYLVETAYDPFVSA- 341
Query: 150 VRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV---DSN 206
+ T + S T ++ + CY S+ P VS +F G + L YL+ + D
Sbjct: 342 ITATVSSSTTPVLSKGNQCYLVSTSVDEIFPPVSLNFAGGASMVLKPGEYLMHLGFSDGA 401
Query: 207 GTFCFAFAPTSS-SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+C F + ++I+G++ + ++L + IG+ C
Sbjct: 402 AMWCIGFQKVAEPGITILGDLVLKDKIFVYDLAHQRIGWANYDC 445
>gi|413950928|gb|AFW83577.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 163
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/165 (30%), Positives = 83/165 (50%), Gaps = 10/165 (6%)
Query: 91 LDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV 150
+ FY + + G+SV G+LL I + + + GG I+DSGT++T L + Y A+ A
Sbjct: 1 MRPFYAVAVNGVSVDGELLRIPRLVWDVQK--GGGAILDSGTSLTVLVSPAYRAVVAALG 58
Query: 151 RGTRALSPTDGVALFDTCYDFSS-----RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDS 205
+ L P + FD CY+++S +V VP ++ HF L P K+Y+I +
Sbjct: 59 KKLVGL-PRVAMDPFDYCYNWTSPLTGEDLAVAVPALAVHFAGSARLQPPPKSYVIDA-A 116
Query: 206 NGTFCFAFAP-TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
G C +S+IGN+ QQ F+L+N + F ++C
Sbjct: 117 PGVKCIGLQEGDWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRC 161
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 74/260 (28%), Positives = 108/260 (41%), Gaps = 22/260 (8%)
Query: 10 LGSASVDNIAI--GCGHNNEGL-----FVGAAGLLGLGGGSLSFPSQINAST---FSYCL 59
L SA D I GC +N+ G++GL +S Q+N T FSYCL
Sbjct: 186 LQSAENDRIPFYFGCSRDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCL 245
Query: 60 ----VDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
+ S +TS L F + + + ++ P + + Y+L L +SV G+ + I
Sbjct: 246 NLFDLSSPSHATSLLRFGNDIRKSRRKYLSTPFVSPRGMPN-YFLNLIDVSVAGNRMQIP 304
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCYD 170
F + G GG I+DSGTAVT + Y + AF + CY
Sbjct: 305 PGTFALKPDGTGGTIIDSGTAVTYISQTAYFPVITAFKNYFDQHGFQRVNIQLSGYICYK 364
Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQ 229
+ P+++FHF P YL V G FC A P S +IIG + Q
Sbjct: 365 QQGHTFHNYPSMAFHFQGADFFVEPEYVYLT-VQDRGAFCVALQPISPQQRTIIGALNQA 423
Query: 230 GTRVSFNLRNSLIGFTPNKC 249
T+ ++ N + FTP C
Sbjct: 424 NTQFIYDAANRQLLFTPENC 443
>gi|242091057|ref|XP_002441361.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
gi|241946646|gb|EES19791.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
Length = 439
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/256 (27%), Positives = 108/256 (42%), Gaps = 44/256 (17%)
Query: 35 GLLGLGGGSLSFPSQIN--ASTFSYCLVD----RDSDSTSTLEF------DSSLPPNAVT 82
G+ G G G+LS PSQ+ FS+C + R+ + TS L +S V
Sbjct: 183 GIAGFGRGALSLPSQLGFLGKGFSHCFLGFRFARNPNFTSPLVMGDLALSSASTDGGFVF 242
Query: 83 APLLRNHELDTFYYLGLTGISVG----GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQ 138
P+L + FYY+GL G+ +G G + + ID GNGG++VD+GT T+L
Sbjct: 243 TPMLTSATYPNFYYVGLEGVVLGDDDGGSAMAAPPSLSGIDAQGNGGVLVDTGTTYTQLP 302
Query: 139 TETYNALRDAFVRG------TRALSPTDGVALFDTCYDF-SSRSSV---EVPTVSFHFPE 188
Y ++ + + +R L G FD C+ +R+ E+P ++ H
Sbjct: 303 DPFYASVLASLISAAPPYERSRDLEARTG---FDLCFKVPCARAPCADDELPPITLHLAG 359
Query: 189 GKVLPLPAKNYLIPV----DSNGTFCFAF-----------APTSSSLSIIGNVQQQGTRV 233
G L LP + PV DS C F +++G+ Q Q V
Sbjct: 360 GARLALPKLSSYYPVTAIRDSVVVKCLLFQRMEMEDDGDGTSGGGPAAVLGSFQMQNVEV 419
Query: 234 SFNLRNSLIGFTPNKC 249
++L +GF P C
Sbjct: 420 VYDLAAGRVGFRPRDC 435
>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 542
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 66/241 (27%), Positives = 100/241 (41%), Gaps = 22/241 (9%)
Query: 18 IAIGCGHNNEGLF---VGAAGLLGLGGGSLSFPSQINAS-----TFSYCLVDRDSDSTST 69
+ IGCG G + V GL+GLG +S PS + + +FS C D D +
Sbjct: 235 VVIGCGMKQSGGYLDGVAPDGLMGLGLAEISVPSFLAKAGLIRNSFSMCF---DEDDSGR 291
Query: 70 LEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVD 129
+ F P + P L T Y +G+ G VG L +T+F+ +VD
Sbjct: 292 IFFGDQGPTTQQSTPFLTLDGNYTTYVVGVEGFCVGSSCL--KQTSFRA--------LVD 341
Query: 130 SGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEG 189
+GT+ T L Y + + F R A + + CY SS +VP+V FP
Sbjct: 342 TGTSFTFLPNGVYERITEEFDRQVNATISSFNGYPWKYCYKSSSNHLTKVPSVKLIFPLN 401
Query: 190 KVLPLPAKNYLIP-VDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNK 248
+ ++I + FC A PT + IG G RV F+ N +G++ +
Sbjct: 402 NSFVIHNPVFMIYGIQGITGFCLAIQPTEGDIGTIGQNFMAGYRVVFDRENMKLGWSHSS 461
Query: 249 C 249
C
Sbjct: 462 C 462
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/121 (39%), Positives = 67/121 (55%), Gaps = 10/121 (8%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
G+ E ++ G SV N GCG NN+GLF G +GL+GLG +LS SQ N++ FSY
Sbjct: 237 GELGAEHLSFGGISVSNFVFGCGKNNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSY 296
Query: 58 CLVDRDSDSTSTLEFDS------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
CL D+ ++ +L + +L P A T ++ N +L FY L LTGI VG L +
Sbjct: 297 CLPPTDAGASGSLAMGNESSVFKNLTPIAYTR-MVPNPQLSNFYMLNLTGIDVGVWLFKL 355
Query: 112 S 112
Sbjct: 356 Q 356
>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
Length = 486
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 67/242 (27%), Positives = 103/242 (42%), Gaps = 28/242 (11%)
Query: 35 GLLGLGGGSLSFPSQIN--ASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-------PL 85
G+ G G G+LS SQ+ FS+C + + + + A+T+ P+
Sbjct: 234 GIAGFGRGTLSMVSQLGFLQKGFSHCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPM 293
Query: 86 LRNHELDTFYYLGLTGISVGG-DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNA 144
L + FYY+GL I+VG + + + D GNGG+ +DSGT T L Y+
Sbjct: 294 LNSPMYPNFYYVGLEAITVGNVSATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQ 353
Query: 145 LRDAFVRGTRALSPTDGVAL---FDTCYDFSS------RSSVEVPTVSFHFPEGKVLPLP 195
+ + ++ T G+ + FD CY S +P+++FHF L LP
Sbjct: 354 VL-SILQSTINYPRDTGMEMQTGFDLCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLP 412
Query: 196 AKNYLIPVDSNG----TFCFAFAPTSS----SLSIIGNVQQQGTRVSFNLRNSLIGFTPN 247
N+ PV + G C F T + G+ QQQ V ++L IGF P
Sbjct: 413 QGNHFYPVSAPGNPAVVKCLMFQSTDDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPM 472
Query: 248 KC 249
C
Sbjct: 473 DC 474
>gi|302797823|ref|XP_002980672.1| hypothetical protein SELMODRAFT_113025 [Selaginella moellendorffii]
gi|300151678|gb|EFJ18323.1| hypothetical protein SELMODRAFT_113025 [Selaginella moellendorffii]
Length = 152
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 70/149 (46%), Gaps = 10/149 (6%)
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF-DTCY 169
I +AFKID GNGG DSGT V+ L + AL +AF R L+ T G + CY
Sbjct: 1 IPRSAFKIDRLGNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTNELCY 60
Query: 170 DFSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNG---TFCFAF----APTSSSL 220
D ++ S P V+ HF + L + +P+ T C AF A +
Sbjct: 61 DVAAGYSRLPRAPLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGV 120
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++IGN QQQ + +L S IGF P C
Sbjct: 121 NVIGNYQQQDYLIEHDLERSRIGFAPANC 149
>gi|226500708|ref|NP_001149229.1| aspartic proteinase nepenthesin-2 [Zea mays]
gi|195625632|gb|ACG34646.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 67/256 (26%), Positives = 104/256 (40%), Gaps = 24/256 (9%)
Query: 12 SASVDNIAIGCGHNNEGLFVGAA--GLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTST 69
S S +AIGC + F + G+ GLG + S P Q+N S FSYCL +
Sbjct: 218 SQSFKEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFSKFSYCLSSYQEPDLPS 277
Query: 70 LEFDSSLP----------PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
++ P T L N + T Y++ L IS+GG P T
Sbjct: 278 YLLLTAAPDMATGAVGGGAAVATTALQPNSDYKTLYFVHLQNISIGGTRFPAVST----- 332
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNAL---RDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
+SG G + VD+G + TRL+ + L D ++ + + G CY S ++
Sbjct: 333 KSG-GNMFVDTGASFTRLEGTVFAKLVTELDRIMKERKYVKEQPGRNNGQICYSPPSTAA 391
Query: 177 VE---VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
E +P + HF + + LP +YL S + +S++GN Q Q T +
Sbjct: 392 DESSKLPDMVLHFADSANMVLPWDSYLWKTTSKLCLAIYKSNIKGGISVLGNFQMQNTHM 451
Query: 234 SFNLRNSLIGFTPNKC 249
+ N + F C
Sbjct: 452 LLDTGNEKLSFVRADC 467
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 69/269 (25%), Positives = 109/269 (40%), Gaps = 38/269 (14%)
Query: 9 TLGSASVDNIAIGCGHNNEGL------------FVGAAGLLGLGGGSLSFPSQINASTFS 56
T G A D AIG G G +G++GLG S +Q+N + FS
Sbjct: 143 TGGMAGTDTFAIGAAKETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFS 202
Query: 57 YCLVDRDSDS----TSTLEF----DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
YCL + S + + + +SS P T+ ++ + +Y + L GI GG
Sbjct: 203 YCLAGKSSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKAGGAP 262
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
L + S +++D+ + + L Y AL+ A +D C
Sbjct: 263 L-------QAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLC 315
Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--------SL 220
FS + + P + F F G L +P NYL+ NGT C ++S
Sbjct: 316 --FSKAVAGDAPELVFTFDGGAALTVPPANYLL-ASGNGTVCLTIGSSASLNLTGELEGA 372
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
SI+G++QQ+ V F+L+ + F P C
Sbjct: 373 SILGSLQQENVHVLFDLKEETLSFKPADC 401
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 72/246 (29%), Positives = 101/246 (41%), Gaps = 19/246 (7%)
Query: 18 IAIGCGHNN-EGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDSSL 76
IA GCG+ N E L G+LGLG S Q+ S FSYC+ D + + +
Sbjct: 208 IAFGCGYENGEQLESHFTGILGLGAKPTSLAVQL-GSKFSYCIGDLANKNYGYNQLVLGE 266
Query: 77 PPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVT 135
+ + P E + + YY+ L GISVG L I FK G+I+DSGT T
Sbjct: 267 DADILGDPTPIEFETENSIYYMNLEGISVGDTQLNIEPVVFK-RRGPRTGVILDSGTLYT 325
Query: 136 RLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV---PTVSFHFPEGKVL 192
L Y R+ + L P F + R S E+ P V+FHF G L
Sbjct: 326 WLADIAY---RELYNEIKSILDPKLERFWFRDFLCYHGRVSEELIGFPVVTFHFAGGAEL 382
Query: 193 PLPAKNYLIPVDSNGT---FCFAFAPTS------SSLSIIGNVQQQGTRVSFNLRNSLIG 243
+ A + P+ T FC + PT + IG + QQ + ++L+ I
Sbjct: 383 AMEATSMFYPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQYYNIGYDLKEKNIY 442
Query: 244 FTPNKC 249
C
Sbjct: 443 LQRIDC 448
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 111/264 (42%), Gaps = 55/264 (20%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G TET+ +G AS + GC N G+ ++G++GLG LS SQ+ + FSYCL
Sbjct: 178 GYLATETLHVGGASFPGVTFGCSTEN-GVGNSSSGIVGLGRSPLSLVSQVGVARFSYCLR 236
Query: 61 DRDSDSTSTLEFDSSLPP---NAVTAPLLRNHEL--DTFYYLGLTGISVGGDLLPISETA 115
S + F S N + PLL N E+ ++YY+ LTGI+VG LP+
Sbjct: 237 SNADAGDSPILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPM---- 292
Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD---FS 172
A+ L T V GTR FD C+D
Sbjct: 293 -----------------AMANLTT----------VNGTR--------FGFDLCFDATAAG 317
Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNY--LIPVDSNGTF---CFAFAPTSS--SLSIIGN 225
V VPT+ F G + ++Y ++ VDS G C P S S+SIIGN
Sbjct: 318 GGGGVPVPTLVLRFAGGAEYAVRRRSYFGVVEVDSQGRAAVECLLVLPASEKLSISIIGN 377
Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
V Q V ++L + F P C
Sbjct: 378 VMQMDLHVLYDLDGGMFSFAPADC 401
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 77/281 (27%), Positives = 115/281 (40%), Gaps = 43/281 (15%)
Query: 8 VTLGSASVDNIAIG----------CGHNNEG-LFVGAAGLLGLGGGSLSFPSQINASTFS 56
VT G+ ++D +AIG C ++ G A+GL+GLG G LS SQ++ F
Sbjct: 179 VTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQLSVHRFM 238
Query: 57 YCLVDRDSDSTSTLEFDSSLPP-----NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
YCL S ++ L + + VT + + ++YYL L G++VG
Sbjct: 239 YCLPPPMSRTSGKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQTPGT 298
Query: 112 SETA-------------------FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG 152
+ A + G+IVD + ++ L+T Y+ L D
Sbjct: 299 TRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADDLEEE 358
Query: 153 TRALSPTDGVAL-FDTCYDFSS---RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT 208
R T + L D C+ V VPTVS F +G+ L L + ++G
Sbjct: 359 IRLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSF-DGRWLELDRDRLFV---TDGR 414
Query: 209 FCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+S +SI+GN Q Q RV FNLR I F C
Sbjct: 415 MMCLMIGRTSGVSILGNFQLQNMRVLFNLRRGKITFAKASC 455
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 72/273 (26%), Positives = 125/273 (45%), Gaps = 27/273 (9%)
Query: 1 GDFVTETVTLGSAS----VDNIAIGCGHNNEGLF--VGAAGLLGLGGGSLSFPSQINA-- 52
GD V++ T+ S N+++GCG ++ GL + +G +G G++SF Q++A
Sbjct: 89 GDLVSDIATMDSVRNRKVAANLSLGCGRDSGGLLELLDTSGFVGFDKGNVSFMGQLSALG 148
Query: 53 --STFSYCLVD---RDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
S F YCL R + ++S+ + P++ N + Y++ L+ IS+
Sbjct: 149 YRSKFIYCLPSDTFRGKLVIGNYKLRNASISSSMAYTPMITNPQAAELYFINLSTISIDK 208
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL-----SPTDG 161
+ + F +G GG ++D+ T ++ L ++ Y L A T L S D
Sbjct: 209 NKFQVPIQGFL--SNGTGGTVIDTTTFLSYLTSDFYTQLVQAIKNYTTNLVEVSSSVADA 266
Query: 162 VALFDTCYDFSSRSSVEVP-TVSFHFPEGKVLPLPAKNYLIPVDS-NGTFCFAFAPTSS- 218
+ + + CY+ S+ S P T+++HF G + + L DS N T C A + S
Sbjct: 267 LGV-ELCYNISANSDFPPPATLTYHFLGGAGVEVSTWFLLDDSDSVNNTICMAIGRSESV 325
Query: 219 --SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+L++IG QQ V ++L GF C
Sbjct: 326 GPNLNVIGTYQQLDLTVEYDLEQMRYGFGAQGC 358
>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
Length = 538
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 74/263 (28%), Positives = 119/263 (45%), Gaps = 18/263 (6%)
Query: 5 TETVTLGS-ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFP---SQINASTFSYCL 59
T TV+ G A + + +GC G V A G+L LG G +SF ++ FS+CL
Sbjct: 246 TVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHAAKRFGQRFSFCL 305
Query: 60 VDRDS--DSTSTLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
+ +S D++S L F + + P + ++ N ++ Y +TGI VGG+ L I +
Sbjct: 306 LSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERLDIPQE 365
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS-- 172
+ ++ GG+I+D+ T+VT L E Y A+ A R L + F+ CY ++
Sbjct: 366 IWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELDGFEYCYRWTFA 425
Query: 173 -----SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIGNV 226
+V VP ++ G L AK+ ++P G C AF I+GNV
Sbjct: 426 GDGVDLAHNVTVPRLTVEMAGGARLEPEAKSVVMPEVVPGVACLAFRKLPRGGPGILGNV 485
Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
Q + + F +KC
Sbjct: 486 LMQEYIWEIDHGKGKMRFRKDKC 508
>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
gi|224030351|gb|ACN34251.1| unknown [Zea mays]
Length = 342
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 77/281 (27%), Positives = 115/281 (40%), Gaps = 43/281 (15%)
Query: 8 VTLGSASVDNIAIG----------CGHNNEG-LFVGAAGLLGLGGGSLSFPSQINASTFS 56
VT G+ ++D +AIG C ++ G A+GL+GLG G LS SQ++ F
Sbjct: 62 VTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQLSVHRFM 121
Query: 57 YCLVDRDSDSTSTLEFDSSLPP-----NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
YCL S ++ L + + VT + + ++YYL L G++VG
Sbjct: 122 YCLPPPMSRTSGKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQTPGT 181
Query: 112 SETA-------------------FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG 152
+ A + G+IVD + ++ L+T Y+ L D
Sbjct: 182 TRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADDLEEE 241
Query: 153 TRALSPTDGVAL-FDTCYDFSS---RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT 208
R T + L D C+ V VPTVS F +G+ L L + ++G
Sbjct: 242 IRLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSF-DGRWLELDRDRLFV---TDGR 297
Query: 209 FCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+S +SI+GN Q Q RV FNLR I F C
Sbjct: 298 MMCLMIGRTSGVSILGNFQLQNMRVLFNLRRGKITFAKASC 338
>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 462
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 82/257 (31%), Positives = 115/257 (44%), Gaps = 19/257 (7%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGG-SLSFPSQINAS---TFS 56
G FV + VTL GCG + G F A+G+LGL G S SQ + FS
Sbjct: 206 GVFVCDEVTLKPDVFPKFQFGCGDSGGGDFGSASGVLGLAQGEQYSLISQTASKFKKKFS 265
Query: 57 YCLVDRDSDSTSTL--EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
YC ++ S L E S P+ LL N + Y++ L GISV L +S +
Sbjct: 266 YCFPHNENTRGSLLFGEKAISASPSLKFTRLL-NPSSGSVYFVELIGISVAKKRLNVSSS 324
Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR---GTRALSPTDGVALFDTCYDF 171
F + G I+DSGT +T L T Y ALR AF + ++SP DTCY+
Sbjct: 325 LF-----ASPGTIIDSGTVITHLPTAAYEALRTAFQQEMLHCPSVSPPPQEKPLDTCYNL 379
Query: 172 S--SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQ 227
++++P + HF + L L C AFA S S ++IIGN Q
Sbjct: 380 KGCGGRNIKLPEIVLHFVGEVDVSLHPSGILWANGDLTQACLAFARKSHPSHVTIIGNRQ 439
Query: 228 QQGTRVSFNLRNSLIGF 244
Q +V +++ +GF
Sbjct: 440 QVSLKVVYDIEGGRLGF 456
>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
Length = 538
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 74/266 (27%), Positives = 120/266 (45%), Gaps = 18/266 (6%)
Query: 2 DFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFP---SQINASTFS 56
+ T TV+ G A + + +GC G V A G+L LG G +SF ++ FS
Sbjct: 243 EKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHAAKRFGQRFS 302
Query: 57 YCLVDRDS--DSTSTLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
+CL+ +S D++S L F + + P + ++ N ++ Y +TGI VGG+ L I
Sbjct: 303 FCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERLDI 362
Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
+ + ++ GG+I+D+ T+VT L E Y A+ A R L + F+ CY +
Sbjct: 363 PQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELDGFEYCYRW 422
Query: 172 S-------SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSII 223
+ +V VP ++ G L AK+ ++P G C AF I+
Sbjct: 423 TFAGDGVDLTHNVTVPRLTVEMAGGARLEPEAKSVVMPEVVPGVACLAFRKLPRGGPGIL 482
Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
GNV Q + + F +KC
Sbjct: 483 GNVLMQEYIWEIDHGKGKMRFRKDKC 508
>gi|224146829|ref|XP_002336347.1| predicted protein [Populus trichocarpa]
gi|222834772|gb|EEE73235.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 81/289 (28%), Positives = 124/289 (42%), Gaps = 47/289 (16%)
Query: 2 DFVTETVTLGS-ASVDNIAIGCGHNN--EGLFVGAAGLLGLGGGSLSFPSQINAS----- 53
D++ TLGS +S+DN C +GL G GL LG +LS P QIN +
Sbjct: 139 DYLALLNTLGSLSSIDNFIFSCARTGFLKGLAKGVTGLASLGNSNLSIPVQINKAFSSSP 198
Query: 54 -TFSYCLVDRDSDSTSTLEFDSSLPPN----------AVTAPLLRN----------HELD 92
F+ CL S L F S P N + PL+ N H L
Sbjct: 199 NCFAMCLSGSISQPGVAL-FGSKGPYNFLHGIDLSKSLLYTPLIFNPLGRDAVPNTHTLS 257
Query: 93 TFYYLGLTGISVGGDLLPISETAFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAFVR 151
YY+GLT I V G ++ ++T ID +SG+GG + + T+LQ+ Y A AF+R
Sbjct: 258 PEYYVGLTAIKVNGKMVTFNKTLLAIDAQSGSGGTRISTVVPYTKLQSSIYKAFTLAFLR 317
Query: 152 GTRA----LSPTDGVALFDTCYDFSSRSSVE----VPTVSFHFPEGKVL-PLPAKNYLIP 202
+ L+ T V F CY S+ + + VP + V+ + N ++
Sbjct: 318 EAASSAFNLTTTKPVKPFSVCYPASAVKTTQMGPAVPIIELVLDRQDVVWKMFGSNSMMR 377
Query: 203 VDSNGT--FCFAF----APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFT 245
V +C A S+ +IG +Q + + F+L++ +GF+
Sbjct: 378 VTKKSVDLWCLGVVDGGAIDGPSI-MIGGLQLEDNLLQFDLQSKKLGFS 425
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 66/228 (28%), Positives = 106/228 (46%), Gaps = 23/228 (10%)
Query: 35 GLLGLGGGSLSFPSQINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNH 89
G+ G G +S SQ+++ FS+CL D+ L + PN V +PL+ +
Sbjct: 220 GIFGFGQQGMSVISQLSSQGIAPRVFSHCL-KGDNSGGGVLVLGEIVEPNIVYSPLVPSQ 278
Query: 90 ELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF 149
Y L L ISV G ++ I+ + F S N G IVDSGT + L E YN F
Sbjct: 279 P---HYNLNLQSISVNGQIVRIAPSVFA--TSNNRGTIVDSGTTLAYLAEEAYN----PF 329
Query: 150 VRGTRALSPTDGVALF---DTCYDFSSRSSVEV-PTVSFHFPEGKVLPLPAKNYLIP--- 202
V A+ P ++ + CY ++ S+V++ P VS +F G L L ++YL+
Sbjct: 330 VIAIAAVIPQSVRSVLSRGNQCYLITTSSNVDIFPQVSLNFAGGASLVLRPQDYLMQQNF 389
Query: 203 VDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +C F S S++I+G++ + ++L IG+ C
Sbjct: 390 IGEGSVWCIGFQKISGQSITILGDLVLKDKIFVYDLAGQRIGWANYDC 437
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 73.9 bits (180), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 71/270 (26%), Positives = 116/270 (42%), Gaps = 29/270 (10%)
Query: 1 GDFVTETV----TLGSASVDN----IAIGCGHNNEGLFVGAA----GLLGLGGGSLSFPS 48
G +V+E++ +G + + N + GC G + G+ G G G LS S
Sbjct: 176 GYYVSESMYFDMVMGQSMIANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVIS 235
Query: 49 QINA-----STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
Q++A FS+CL + + L L P V +PL+ + Y L L IS
Sbjct: 236 QLSARGITPKVFSHCL-KGEGNGGGILVLGEVLEPGIVYSPLVPSQP---HYNLYLQSIS 291
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG-TRALSPTDGV 162
V G LPI + F S N G I+DSGT + L E Y A +++++PT +
Sbjct: 292 VNGQTLPIDPSVFA--TSINRGTIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPT--I 347
Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV---DSNGTFCFAFAPTSSS 219
+ + CY S+ P VS +F + L + YL+ + D +C F
Sbjct: 348 SKGNQCYLVSTSVGEIFPLVSLNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEG 407
Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++I+G++ + ++L IG+ C
Sbjct: 408 VTILGDLVMKDKIFVYDLARQRIGWASYDC 437
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 73.6 bits (179), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 67/273 (24%), Positives = 114/273 (41%), Gaps = 35/273 (12%)
Query: 1 GDFVTETVTLGSAS--------VDNIAIGCGHNNEGLF----VGAAGLLGLGGGSLSFPS 48
GDF+ + +TL + + GCG N G G++G G + S S
Sbjct: 168 GDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIIS 227
Query: 49 QINA-----STFSYCLVDRDSDSTSTL-EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGI 102
Q+ A FS+CL + + + E +S P T P++ N Y + L G+
Sbjct: 228 QLAAGGSTKRIFSHCLDNMNGGGIFAVGEVES---PVVKTTPIVPNQ---VHYNVILKGM 281
Query: 103 SVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
V GD PI +G+GG I+DSGT + L YN+L + + V
Sbjct: 282 DVDGD--PIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQV--KLHMV 337
Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA------PT 216
C+ F+S + P V+ HF + L + +YL + + +CF +
Sbjct: 338 QETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLRED-MYCFGWQSGGMTTQD 396
Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ + ++G++ V ++L N +IG+ + C
Sbjct: 397 GADVILLGDLVLSNKLVVYDLENEVIGWADHNC 429
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 73.6 bits (179), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 68/269 (25%), Positives = 108/269 (40%), Gaps = 38/269 (14%)
Query: 9 TLGSASVDNIAIGCGHNNEGL------------FVGAAGLLGLGGGSLSFPSQINASTFS 56
T G A D AIG G G +G++GLG S +Q+N + FS
Sbjct: 143 TGGKAGTDTFAIGAAKETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFS 202
Query: 57 YCLVDRDSDS----TSTLEF----DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
YCL + S + + + +SS P T+ ++ + +Y + L GI GG
Sbjct: 203 YCLAGKSSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAP 262
Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
L + S +++D+ + + L Y AL+ A +D C
Sbjct: 263 L-------QAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLC 315
Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--------SL 220
F + + P + F F G L +P NYL+ NGT C ++S
Sbjct: 316 --FPKAVAGDAPELVFTFDGGAALTVPPANYLL-ASGNGTVCLTIGSSASLNLTGELEGA 372
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
SI+G++QQ+ V F+L+ + F P C
Sbjct: 373 SILGSLQQENVHVLFDLKEETLSFKPADC 401
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 73.6 bits (179), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 67/273 (24%), Positives = 114/273 (41%), Gaps = 35/273 (12%)
Query: 1 GDFVTETVTLGSAS--------VDNIAIGCGHNNEGLF----VGAAGLLGLGGGSLSFPS 48
GDF+ + +TL + + GCG N G G++G G + S S
Sbjct: 172 GDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIIS 231
Query: 49 QINA-----STFSYCLVDRDSDSTSTL-EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGI 102
Q+ A FS+CL + + + E +S P T P++ N Y + L G+
Sbjct: 232 QLAAGGSTKRIFSHCLDNMNGGGIFAVGEVES---PVVKTTPIVPNQ---VHYNVILKGM 285
Query: 103 SVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
V GD PI +G+GG I+DSGT + L YN+L + + V
Sbjct: 286 DVDGD--PIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQV--KLHMV 341
Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA------PT 216
C+ F+S + P V+ HF + L + +YL + + +CF +
Sbjct: 342 QETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLRED-MYCFGWQSGGMTTQD 400
Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ + ++G++ V ++L N +IG+ + C
Sbjct: 401 GADVILLGDLVLSNKLVVYDLENEVIGWADHNC 433
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 73.6 bits (179), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 109/248 (43%), Gaps = 24/248 (9%)
Query: 17 NIAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPSQINA-----STFSYCLVDRDSDST 67
+I GC ++ G G+ G G LS SQ+N+ FS+CL D +
Sbjct: 213 SIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSD-NGG 271
Query: 68 STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGII 127
L + P V PL+ + Y L L I V G LPI + F S G I
Sbjct: 272 GILVLGEIVEPGLVYTPLVPSQP---HYNLNLESIVVNGQKLPIDSSLFTT--SNTQGTI 326
Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGVALFDTCYDFSSRSSVEVPTVSFH 185
VDSGT + L Y+ +A T A+SP+ V+ + C+ SS PTVS +
Sbjct: 327 VDSGTTLAYLADGAYDPFVNAI---TAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTVSLY 383
Query: 186 FPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSL 241
F G + + +NYL+ +D+N +C + ++I+G++ + ++L N
Sbjct: 384 FMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANMR 443
Query: 242 IGFTPNKC 249
+G+T C
Sbjct: 444 MGWTDYDC 451
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 73.6 bits (179), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 68/273 (24%), Positives = 114/273 (41%), Gaps = 35/273 (12%)
Query: 1 GDFVTETVTLGSAS--------VDNIAIGCGHNNEGLF----VGAAGLLGLGGGSLSFPS 48
GDFV + +TL + + GCG N G G++G G + S S
Sbjct: 171 GDFVKDNITLDQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVIS 230
Query: 49 QINA-----STFSYCLVDRDSDSTSTL-EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGI 102
Q+ A FS+CL + + + E +S P T PL+ N Y + L G+
Sbjct: 231 QLAAGGSVKRIFSHCLDNMNGGGIFAIGEVES---PVVKTTPLVPNQ---VHYNVILKGM 284
Query: 103 SVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
V G+ PI +G+GG I+DSGT + L YN+L + + V
Sbjct: 285 DVDGE--PIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQV--KLHMV 340
Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA------PT 216
C+ F+S + P V+ HF + L + +YL + + +CF +
Sbjct: 341 QETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLRED-MYCFGWQSGGMTTQD 399
Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ + ++G++ V ++L N +IG+ + C
Sbjct: 400 GADVILLGDLVLSNKLVVYDLENEVIGWADHNC 432
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 73.6 bits (179), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 73/269 (27%), Positives = 110/269 (40%), Gaps = 27/269 (10%)
Query: 1 GDFVTET----VTLGSASVDN----IAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPS 48
G ++T+T LG + V N I GC G G+ G G G LS S
Sbjct: 196 GYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVS 255
Query: 49 QINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
Q+++ FS+CL D L P V +PL+ + Y L L I
Sbjct: 256 QLSSRGITPPVFSHCL-KGDGSGGGVFVLGEILVPGMVYSPLVPSQP---HYNLNLLSIG 311
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
V G +LP+ F + S G IVD+GT +T L E Y+ +A L T ++
Sbjct: 312 VNGQMLPLDAAVF--EASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLV-TPIIS 368
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTSSSL 220
+ CY S+ S P+VS +F G + L ++YL D +C F
Sbjct: 369 NGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQ 428
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+I+G++ + ++L IG+ C
Sbjct: 429 TILGDLVLKDKVFVYDLARQRIGWASYDC 457
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 73.6 bits (179), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 109/248 (43%), Gaps = 24/248 (9%)
Query: 17 NIAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPSQINA-----STFSYCLVDRDSDST 67
+I GC ++ G G+ G G LS SQ+N+ FS+CL D +
Sbjct: 213 SIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSD-NGG 271
Query: 68 STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGII 127
L + P V PL+ + Y L L I V G LPI + F S G I
Sbjct: 272 GILVLGEIVEPGLVYTPLVPSQP---HYNLNLESIVVNGQKLPIDSSLFTT--SNTQGTI 326
Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGVALFDTCYDFSSRSSVEVPTVSFH 185
VDSGT + L Y+ +A T A+SP+ V+ + C+ SS PTVS +
Sbjct: 327 VDSGTTLAYLADGAYDPFVNAI---TAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTVSLY 383
Query: 186 FPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSL 241
F G + + +NYL+ +D+N +C + ++I+G++ + ++L N
Sbjct: 384 FMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANMR 443
Query: 242 IGFTPNKC 249
+G+T C
Sbjct: 444 MGWTDYDC 451
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 66/233 (28%), Positives = 102/233 (43%), Gaps = 26/233 (11%)
Query: 35 GLLGLGGGSLSFPSQINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNH 89
G++G G LS P+Q+ A FS+CL + + L P PL+ +
Sbjct: 144 GIIGFGQLELSVPNQLAAQQNIPRVFSHCL-EGEKRGGGILVIGGIAEPGMTYTPLVPD- 201
Query: 90 ELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF 149
Y + L GISV + LPI F + + G+I+DSGT + + YN A
Sbjct: 202 --SVHYNVVLRGISVNSNRLPIDAEDFS--STNDTGVIMDSGTTLAYFPSGAYNVFVQAI 257
Query: 150 VRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLI-----PVD 204
T A +P + C+ S R S P V+ +F EG + L NYL+ P
Sbjct: 258 REATSA-TPVRVQGMDTQCFLVSGRLSDLFPNVTLNF-EGGAMELQPDNYLMWGGTAPTG 315
Query: 205 SNGTFCFAFAPTSSS--------LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +C + +SSS L+I+G++ + V ++L NS IG+ C
Sbjct: 316 TTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSRIGWMSYNC 368
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 65/223 (29%), Positives = 96/223 (43%), Gaps = 14/223 (6%)
Query: 35 GLLGLGGGSLSFPSQ-----INASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNH 89
G+ G G G LS SQ I FS+CL D + L L P+ V +PL+ +
Sbjct: 211 GIFGFGPGPLSVVSQLSSQGITPKVFSHCL-KGDGNGGGILVLGEILEPSIVYSPLVPSQ 269
Query: 90 ELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF 149
Y L L I+V G LPI+ F I + GG IVD GT + L E Y+ L A
Sbjct: 270 P---HYNLNLQSIAVNGQPLPINPAVFSISNN-RGGTIVDCGTTLAYLIQEAYDPLVTA- 324
Query: 150 VRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIP---VDSN 206
+ + S + + CY S+ P VS +F G + L + YL+ +D
Sbjct: 325 INTAVSQSARQTNSKGNQCYLVSTSIGDIFPLVSLNFEGGASMVLKPEQYLMHNGYLDGA 384
Query: 207 GTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+C F SI+G++ + V +++ IG+ C
Sbjct: 385 EMWCVGFQKLQEGASILGDLVLKDKIVVYDIAQQRIGWANYDC 427
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 119/273 (43%), Gaps = 34/273 (12%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ V E +T + + + +GC + G+LG+ G LSF SQ S FSYC+
Sbjct: 164 GNLVKEKITFSNTEITPPLILGCATESSD----DRGILGMNRGRLSFVSQAKISKFSYCI 219
Query: 60 VDRDSDS--TSTLEFDSSLPPNA--------VTAPL-LRNHELDTFYY-LGLTGISVGGD 107
+ + T T F PN+ +T P R LD Y + + GI G
Sbjct: 220 PPKSNRPGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLK 279
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALR-DAFVRGTRALSP---TDGVA 163
L IS + F+ D G+G +VDSG+ T L Y+ +R + R R L G A
Sbjct: 280 KLNISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTA 339
Query: 164 LFDTCYDFSSRSSVEVPT----VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-- 217
D C+D + +P + F F G + +P + L+ V G C +S
Sbjct: 340 --DMCFD---GNVAMIPRLIGDLVFVFTRGVEILVPKERVLVNV-GGGIHCVGIGRSSML 393
Query: 218 -SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++ +IIGNV QQ V F++ N +GF C
Sbjct: 394 GAASNIIGNVHQQNLWVEFDVTNRRVGFAKADC 426
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 109/248 (43%), Gaps = 24/248 (9%)
Query: 17 NIAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPSQINA-----STFSYCLVDRDSDST 67
+I GC ++ G G+ G G LS SQ+N+ FS+CL D +
Sbjct: 239 SIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSD-NGG 297
Query: 68 STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGII 127
L + P V PL+ + Y L L I V G LPI + F S G I
Sbjct: 298 GILVLGEIVEPGLVYTPLVPSQP---HYNLNLESIVVNGQKLPIDSSLFTT--SNTQGTI 352
Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGVALFDTCYDFSSRSSVEVPTVSFH 185
VDSGT + L Y+ +A T A+SP+ V+ + C+ SS PTVS +
Sbjct: 353 VDSGTTLAYLADGAYDPFVNAI---TAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTVSLY 409
Query: 186 FPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSL 241
F G + + +NYL+ +D+N +C + ++I+G++ + ++L N
Sbjct: 410 FMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANMR 469
Query: 242 IGFTPNKC 249
+G+T C
Sbjct: 470 MGWTDYDC 477
>gi|383130044|gb|AFG45742.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 73.2 bits (178), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 56/121 (46%), Gaps = 2/121 (1%)
Query: 92 DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR 151
+TFYY+ L G+S+G L + F D GNGG I+DSGT T E Y + AF
Sbjct: 32 NTFYYIDLRGVSIGRKRLNLPSKLFSFDNKGNGGTIIDSGTTFTIFNEEFYKNITAAFAS 91
Query: 152 --GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF 209
G R S + CY+ S V +P +FHF G + LP NY S +
Sbjct: 92 QIGFRRASEVEARTGMRLCYNASGVDHVLLPDFAFHFKGGSDMVLPVANYFSYFVSFDSI 151
Query: 210 C 210
C
Sbjct: 152 C 152
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 73.2 bits (178), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 75/269 (27%), Positives = 109/269 (40%), Gaps = 27/269 (10%)
Query: 1 GDFVTET----VTLGSASVDN----IAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPS 48
G ++T+T LG + V N I GC G G+ G G G LS S
Sbjct: 196 GYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVS 255
Query: 49 QINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
Q+++ FS+CL D L P V +PLL + Y L L I
Sbjct: 256 QLSSRGITPPVFSHCL-KGDGSGGGVFVLGEILVPGMVYSPLLPSQP---HYNLNLLSIG 311
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
V G +LPI F + S G IVD+GT +T L E Y+ +A L T ++
Sbjct: 312 VNGQILPIDAAVF--EASNTRGTIVDTGTTLTYLVKEAYDPFLNAISNSVSQLV-TLIIS 368
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTSSSL 220
+ CY S+ S P VS +F G + L ++YL D +C F
Sbjct: 369 NGEQCYLVSTSISDMFPPVSLNFAGGASMMLRPQDYLFHYGFYDGASMWCIGFQKAPEEQ 428
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+I+G++ + ++L IG+ C
Sbjct: 429 TILGDLVLKDKVFVYDLARQRIGWANYDC 457
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 73.2 bits (178), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 73/269 (27%), Positives = 110/269 (40%), Gaps = 27/269 (10%)
Query: 1 GDFVTET----VTLGSASVDN----IAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPS 48
G ++T+T LG + V N I GC G G+ G G G LS S
Sbjct: 196 GYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVS 255
Query: 49 QINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
Q+++ FS+CL D L P V +PL+ + Y L L I
Sbjct: 256 QLSSRGITPPVFSHCL-KGDGSGGGVFVLGEILVPGMVYSPLVPSQP---HYNLNLLSIG 311
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
V G +LP+ F + S G IVD+GT +T L E Y+ +A L T ++
Sbjct: 312 VNGQMLPLDAAVF--EASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLV-TPIIS 368
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTSSSL 220
+ CY S+ S P+VS +F G + L ++YL D +C F
Sbjct: 369 NGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQ 428
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+I+G++ + ++L IG+ C
Sbjct: 429 TILGDLVLKDKVFVYDLARQRIGWASYDC 457
>gi|125575539|gb|EAZ16823.1| hypothetical protein OsJ_32295 [Oryza sativa Japonica Group]
Length = 383
Score = 73.2 bits (178), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 69/272 (25%), Positives = 115/272 (42%), Gaps = 33/272 (12%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFV--GAAGLLGLGGGSLSFPSQINASTFSYC 58
G T+ V +G+A+ ++A GC ++ + G +G +GL LS +Q+N + FS+C
Sbjct: 116 GKIGTDAVAIGTATAASVAFGCVMASDIKLMDGGPSGFVGLARTPLSLVAQMNVTAFSHC 175
Query: 59 LVDRDSDSTSTLEF--------DSSLPPNAVTAPLLRN--HELDTFYYL-GLTGISVGGD 107
L D A+T P +++ ++ + YYL L GI G
Sbjct: 176 LAPHDGGGGKNSRLFLGAAAKLAGGGKSAAMTTPFVKSSPDDIKSLYYLINLEGIKAG-- 233
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVAL 164
E + +SG +++ + + V+ L Y L+ A V G A P ++
Sbjct: 234 ----DEAIITVPQSGR-TVLLQTFSPVSFLVDGVYQDLKKAVTAAVGGPTATPPEQFQSI 288
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS------- 217
FD C+ S P V F L +P NYL+ V + T C A A ++
Sbjct: 289 FDLCFKRGGVSG--APDVVLTFQGAAALTVPPTNYLLDVGDD-TVCVAIASSARLNSTEV 345
Query: 218 SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +SI+G +QQQ ++L + F C
Sbjct: 346 AGMSILGGLQQQNVHFLYDLEKETLSFEAADC 377
>gi|194699670|gb|ACF83919.1| unknown [Zea mays]
Length = 102
Score = 73.2 bits (178), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 51/93 (54%), Gaps = 1/93 (1%)
Query: 158 PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS 217
P + CY+ S EVP +S F +G V PA+NY I +D +G C A T
Sbjct: 7 PVPDFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTP 66
Query: 218 SS-LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +SIIGN QQQ V+++L N+ +GF P +C
Sbjct: 67 RTGMSIIGNFQQQNFHVAYDLHNNRLGFAPRRC 99
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 73.2 bits (178), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 73/269 (27%), Positives = 110/269 (40%), Gaps = 27/269 (10%)
Query: 1 GDFVTET----VTLGSASVDN----IAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPS 48
G ++T+T LG + V N I GC G G+ G G G LS S
Sbjct: 201 GYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVS 260
Query: 49 QINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
Q+++ FS+CL D L P V +PL+ + Y L L I
Sbjct: 261 QLSSRGITPPVFSHCL-KGDGSGGGVFVLGEILVPGMVYSPLVPSQP---HYNLNLLSIG 316
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
V G +LP+ F + S G IVD+GT +T L E Y+ +A L T ++
Sbjct: 317 VNGQMLPLDAAVF--EASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLV-TPIIS 373
Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTSSSL 220
+ CY S+ S P+VS +F G + L ++YL D +C F
Sbjct: 374 NGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQ 433
Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+I+G++ + ++L IG+ C
Sbjct: 434 TILGDLVLKDKVFVYDLARQRIGWASYDC 462
>gi|291002744|gb|ADD71504.1| xyloglucanase inhibitor 2 [Humulus lupulus]
Length = 445
Score = 73.2 bits (178), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 68/274 (24%), Positives = 114/274 (41%), Gaps = 44/274 (16%)
Query: 14 SVDNIAIGCGHNN--EGLFVGAAGLLGLGGGSLSFPSQINAS-----TFSYCLVDRDSDS 66
S N+ CG EGL G G+ GLG ++ PSQ A+ F+ CL + +
Sbjct: 155 SFPNVIFTCGSTFLLEGLASGVTGIAGLGRKKIALPSQFAAAFSFKRKFALCL-SSSTRA 213
Query: 67 TSTLEF---------DSSLPPNAVTAPLLRNH----------ELDTFYYLGLTGISVGGD 107
T + F + + N + PL+ N E Y++G+ GI V G+
Sbjct: 214 TGVVFFGDGPYIMLPNKDVSQNLIYTPLILNPVSTAGASFEGEPSADYFIGVKGIKVNGE 273
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
+ ++ + I + G GG + + T L+T Y A+ AF + + VA F+
Sbjct: 274 DVKLNTSLLSIAKDGTGGTKISTTQPYTSLETSIYKAVIGAFGKAVAKVPRVTAVAPFEL 333
Query: 168 CYDFSSRSSVE----VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA--------- 214
C++ +S SS VP + P K + N ++ V S+ C F
Sbjct: 334 CFNSTSFSSTRVGPGVPQIDLVLPNNKAWTIFGANSMVQV-SDDVLCLGFVDGGPLHFVD 392
Query: 215 ---PTSSSLSIIGNVQQQGTRVSFNLRNSLIGFT 245
P + + +IG Q + + F+L +S +GF+
Sbjct: 393 WGIPFTPTAIVIGGHQIEDNLLQFDLGSSTLGFS 426
>gi|357443039|ref|XP_003591797.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
gi|355480845|gb|AES62048.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
Length = 436
Score = 73.2 bits (178), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 72/277 (25%), Positives = 112/277 (40%), Gaps = 48/277 (17%)
Query: 13 ASVDNIAIGCGHN--NEGLFVGAAGLLGLGGGSLSFPSQINAS-----TFSYCLVDRDS- 64
SV N CG N GL G G+ GLG +S PSQ +++ F+ CL ++
Sbjct: 144 VSVPNFLFICGSNVVQNGLAKGVKGMAGLGRTKVSLPSQFSSAFSFKNKFAICLGTQNGV 203
Query: 65 ----DSTSTLEFDSSLPPNAVTAPLLRNH----------ELDTFYYLGLTGISVGGDLLP 110
D FD S N + PL+ N E Y++G+ I V +
Sbjct: 204 LFFGDGPYLFNFDES--KNLIYTPLITNPVSTSPSSFLGEKSVEYFIGVKSIRVSSKNVK 261
Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
++ T ID++G GG + + T ++T Y A+ DAFV+ +S + VA F TC+
Sbjct: 262 LNTTLLSIDQNGFGGTKISTVNPYTIMETSIYKAVADAFVKALN-VSTVEPVAPFGTCFA 320
Query: 171 ----FSSRSSVEVPTVSFHFP-EGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS---- 221
SSR +VP++ E V + N ++ ++ C F S +
Sbjct: 321 SQSISSSRMGPDVPSIDLVLQNENVVWNIIGANAMVRINDKDVICLGFVDAGSDFAKTSQ 380
Query: 222 --------------IIGNVQQQGTRVSFNLRNSLIGF 244
IG Q + + F+L S +GF
Sbjct: 381 VGFVVGGSKPMTSITIGAHQLENNLLQFDLATSRLGF 417
>gi|21717171|gb|AAM76364.1|AC074196_22 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433290|gb|AAP54828.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125532789|gb|EAY79354.1| hypothetical protein OsI_34483 [Oryza sativa Indica Group]
Length = 382
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 69/272 (25%), Positives = 115/272 (42%), Gaps = 33/272 (12%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFV--GAAGLLGLGGGSLSFPSQINASTFSYC 58
G T+ V +G+A+ ++A GC ++ + G +G +GL LS +Q+N + FS+C
Sbjct: 115 GKIGTDAVAIGTATAASVAFGCVMASDIKLMDGGPSGFVGLARTPLSLVAQMNVTAFSHC 174
Query: 59 LVDRDSDSTSTLEF--------DSSLPPNAVTAPLLRN--HELDTFYYL-GLTGISVGGD 107
L D A+T P +++ ++ + YYL L GI G
Sbjct: 175 LAPHDGGGGKNSRLFLGAAAKLAGGGKSAAMTTPFVKSSPDDIKSLYYLINLEGIKAG-- 232
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVAL 164
E + +SG +++ + + V+ L Y L+ A V G A P ++
Sbjct: 233 ----DEAIITVPQSGR-TVLLQTFSPVSFLVDGVYQDLKKAVTAAVGGPTATPPEQFQSI 287
Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS------- 217
FD C+ S P V F L +P NYL+ V + T C A A ++
Sbjct: 288 FDLCFKRGGVSG--APDVVLTFQGAAALTVPPTNYLLDVGDD-TVCVAIASSARLNSTEV 344
Query: 218 SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +SI+G +QQQ ++L + F C
Sbjct: 345 AGMSILGGLQQQNVHFLYDLEKETLSFEAADC 376
>gi|383130042|gb|AFG45741.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/120 (35%), Positives = 55/120 (45%), Gaps = 2/120 (1%)
Query: 93 TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR- 151
TFYY+ L G+S+G L + F D GNGG I+DSGT T E Y + AF
Sbjct: 33 TFYYIDLRGVSIGRKRLNLPSKLFSFDSKGNGGTIIDSGTTFTIFNEEFYKNITAAFASQ 92
Query: 152 -GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFC 210
G R S + CY+ S V +P +FHF G + LP NY S + C
Sbjct: 93 IGFRRASEVEARTGMRLCYNVSGVDHVLLPDFAFHFKGGSDMVLPVANYFSYFVSFDSIC 152
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 66/233 (28%), Positives = 102/233 (43%), Gaps = 26/233 (11%)
Query: 35 GLLGLGGGSLSFPSQINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNH 89
G++G G LS P+Q+ A FS+CL + + L P PL+ +
Sbjct: 171 GIIGFGQLELSVPNQLAAQQNIPRVFSHCL-EGEKRGGGILVIGGIAEPGMTYTPLVPDS 229
Query: 90 ELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF 149
Y + L GISV + LPI F + + G+I+DSGT + + YN A
Sbjct: 230 ---VHYNVVLRGISVNSNRLPIDAEDFS--STNDTGVIMDSGTTLAYFPSGAYNVFVQAI 284
Query: 150 VRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLI-----PVD 204
T A +P + C+ S R S P V+ +F EG + L NYL+ P
Sbjct: 285 REATSA-TPVRVQGMDTQCFLVSGRLSDLFPNVTLNF-EGGAMELQPDNYLMWGGTAPTG 342
Query: 205 SNGTFCFAFAPTSSS--------LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +C + +SSS L+I+G++ + V ++L NS IG+ C
Sbjct: 343 TTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSRIGWMSYNC 395
>gi|357440781|ref|XP_003590668.1| Basic 7S globulin [Medicago truncatula]
gi|355479716|gb|AES60919.1| Basic 7S globulin [Medicago truncatula]
Length = 434
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 70/252 (27%), Positives = 111/252 (44%), Gaps = 39/252 (15%)
Query: 27 EGLFVGAAGLLGLGGGSLSFPSQIN-----ASTFSYCLVDRDSDSTSTLEFDS---SLPP 78
EGL GA+G+ GLG L+ PSQ+ A F+ CL S S + F P
Sbjct: 170 EGLASGASGMAGLGRNKLALPSQLASAFSFAKKFAICL----SSSKGVVLFGDGPYGFLP 225
Query: 79 NAV-------TAPLLRN---------HELDTFYYLGLTGISVGGDLLPISETAFKIDES- 121
N V PLL N E Y++G+ I + G ++ + + ID S
Sbjct: 226 NVVFDSKSLTYTPLLINPFSTAAFAKSEPSAEYFIGVKTIKIDGKVVSLDTSLLSIDSSN 285
Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGT--RALSPTDGVALFDTCYD--FSSRSSV 177
G GG + + T L+ Y A+ DAFV+ + R + D VA F+ CY +R
Sbjct: 286 GAGGTKISTVDPYTVLEASIYKAVTDAFVKASAARNIKRVDSVAPFEFCYTNVTGTRLGA 345
Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA----PTSSSLSIIGNVQQQGTRV 233
+VPT+ + + + N ++ ++ + C F T +S+ +IG Q + +
Sbjct: 346 DVPTIELYLQNNVIWRIFGANSMVNIN-DEVLCLGFVIGGENTWASI-VIGGYQLENNLL 403
Query: 234 SFNLRNSLIGFT 245
F+L S +GF+
Sbjct: 404 QFDLAASKLGFS 415
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 77/257 (29%), Positives = 118/257 (45%), Gaps = 31/257 (12%)
Query: 11 GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN-----ASTFSYCLVDRDSD 65
G+A+ +I GC N G + A G++G G S + P+QI + FS+CL +
Sbjct: 192 GNATTSHIFFGCAINITGSWP-ADGIMGFGQISKTVPNQIATQRNMSRVFSHCL-GGEKH 249
Query: 66 STSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI--DE 120
LEF PN V PLL + T Y + L ISV +LPI F +
Sbjct: 250 GGGILEFGEE--PNTTEMVFTPLL---NVTTHYNVDLLSISVNSKVLPIDSKEFSYVSNS 304
Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA-LSPT-DGVALFDTCYDFSSRSSVE 178
+ G+I+DSGT+ L T+ L T A L P +G+ C+ S +VE
Sbjct: 305 TNETGVIIDSGTSFALLATKANRILFSEIKNLTTAKLGPKLEGLQ----CFYLKSGLTVE 360
Query: 179 V--PTVSFHFPEGKVLPLPAKNYLIPVD----SNGTFCFAFAPTSSSLSIIGNVQQQGTR 232
P V+ F G + L NYL+ V+ NG +C+A++ ++ L+I G + +
Sbjct: 361 TSFPNVTLTFSGGSTMKLKPDNYLVMVELKKKRNG-YCYAWS-SADGLTIFGEIVLKDKL 418
Query: 233 VSFNLRNSLIGFTPNKC 249
V +++ N IG+ C
Sbjct: 419 VFYDVENRRIGWKGQNC 435
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 119/273 (43%), Gaps = 34/273 (12%)
Query: 1 GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
G+ V E +T + + + +GC + G+LG+ G LSF SQ S FSYC+
Sbjct: 164 GNLVKEKITFSNTEITPPLILGCATESSD----DRGILGMNRGRLSFVSQAKISKFSYCI 219
Query: 60 VDRDSDS--TSTLEFDSSLPPNA--------VTAPL-LRNHELDTFYY-LGLTGISVGGD 107
+ + T T F PN+ +T P R LD Y + + GI G
Sbjct: 220 PPKSNRPGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLK 279
Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALR-DAFVRGTRALSP---TDGVA 163
L IS + F+ D G+G +VDSG+ T L Y+ +R + R R L G A
Sbjct: 280 KLNISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTA 339
Query: 164 LFDTCYDFSSRSSVEVPT----VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-- 217
D C+D + +P + F F G + +P + L+ V G C +S
Sbjct: 340 --DMCFD---GNVAMIPRLIGDLVFVFTRGVEIFVPKERVLVNV-GGGIHCVGIGRSSML 393
Query: 218 -SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++ +IIGNV QQ V F++ N +GF C
Sbjct: 394 GAASNIIGNVHQQNLWVEFDVTNRRVGFAKADC 426
>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 67/244 (27%), Positives = 104/244 (42%), Gaps = 27/244 (11%)
Query: 17 NIAIGCGHNNEGLFVGAA---GLLGLGGGSLSFPS-----QINASTFSYCLVDRDSDSTS 68
I GCG G F+ AA GL GLG +S PS + +++FS C D
Sbjct: 213 QIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTSNSFSMCF---GRDGIG 269
Query: 69 TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIV 128
+ F + PL N + T Y + +TGI+VG +L+ + + I
Sbjct: 270 RISFGDQGSSDQEETPLDINQKHPT-YAITITGIAVGNNLMDLEVST-----------IF 317
Query: 129 DSGTAVTRLQTETYNALRDAFVRGTRA-LSPTDGVALFDTCYDFSS-RSSVEVPTVSFHF 186
D+GT+ T L Y + D F +A D F+ CYD SS + ++ P++S
Sbjct: 318 DTGTSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQTPSISLRT 377
Query: 187 PEGKVLPLPAKNYLIPVDSNG-TFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFT 245
G + P +I + + +C A S+ L+IIG G RV F+ ++G+
Sbjct: 378 VGGSLFPAIDPGQVISIQQHEYVYCLAIV-KSTKLNIIGQNFMTGVRVVFDRERKILGWK 436
Query: 246 PNKC 249
C
Sbjct: 437 KFNC 440
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 75/273 (27%), Positives = 125/273 (45%), Gaps = 31/273 (11%)
Query: 1 GDFVTETVTLGSASVDNI-----AIGCGHNNEGLFVGAAGLLGLGGGS-LSFPSQINAS- 53
G+ TET+++ S+S + A GCG+NN G F + GG LS SQ+ +S
Sbjct: 176 GEVATETISIDSSSGSPVSFPGTAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSI 235
Query: 54 --TFSYCLVDRDSDS---------TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGI 102
FSYCL + + T+++ S +T PL++ + +T+Y+L L I
Sbjct: 236 GKKFSYCLSHTSATTNGTSVINLGTNSMTSKPSKDSAILTTPLIQK-DPETYYFLTLEAI 294
Query: 103 SVGGDLLPIS---ETAFKIDESGNGGIIVDSGTAVTRLQTETYN---ALRDAFVRGTRAL 156
+VG LP + + G II+DSGT +T L + Y+ A+ + V G + +
Sbjct: 295 TVGKTKLPYTGGGGYSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKRV 354
Query: 157 SPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT 216
S G+ C+ S + +PT++ HF V P +++ S C + PT
Sbjct: 355 SDPQGI--LTHCFK-SGDKEIGLPTITMHFTGADVKLSPINSFVKL--SEDIVCLSMIPT 409
Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ ++I GN+ Q V ++L + F C
Sbjct: 410 -TEVAIYGNMVQMDFLVGYDLETKTVSFQRMDC 441
>gi|18414692|ref|NP_567506.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15809800|gb|AAL06828.1| AT4g16560/dl4305c [Arabidopsis thaliana]
gi|18377815|gb|AAL67094.1| AT4g16560/dl4305c [Arabidopsis thaliana]
gi|332658370|gb|AEE83770.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 79/290 (27%), Positives = 115/290 (39%), Gaps = 66/290 (22%)
Query: 17 NIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA------STFSYCLVDRDSDSTSTL 70
N GC H +G AG G G LS P+Q+ ++FSYCLV DS
Sbjct: 211 NFTFGCAHTTLAEPIGVAGF---GRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSDRVR 267
Query: 71 E---------------------------FDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
+ V +L N + FY + L GIS
Sbjct: 268 RPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSVSLQGIS 327
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF--------VRGTRA 155
+G +P +ID++G GG++VDSGT T L + YN++ + F R R
Sbjct: 328 IGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHERADR- 386
Query: 156 LSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGK-VLPLPAKNYLIPV----------D 204
+ P+ G++ CY + +V+VP + HF + + LP +NY
Sbjct: 387 VEPSSGMS---PCYYLN--QTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKR 441
Query: 205 SNGTFCFAFAPTSSSL-----SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
G S L +I+GN QQQG V ++L N +GF KC
Sbjct: 442 KIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKC 491
>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 67/244 (27%), Positives = 104/244 (42%), Gaps = 27/244 (11%)
Query: 17 NIAIGCGHNNEGLFVGAA---GLLGLGGGSLSFPS-----QINASTFSYCLVDRDSDSTS 68
I GCG G F+ AA GL GLG +S PS + +++FS C D
Sbjct: 213 QIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTSNSFSMCF---GRDGIG 269
Query: 69 TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIV 128
+ F + PL N + T Y + +TGI+VG +L+ + + I
Sbjct: 270 RISFGDQGSSDQEETPLDINQKHPT-YAITITGIAVGNNLMDLEVST-----------IF 317
Query: 129 DSGTAVTRLQTETYNALRDAFVRGTRA-LSPTDGVALFDTCYDFSS-RSSVEVPTVSFHF 186
D+GT+ T L Y + D F +A D F+ CYD SS + ++ P++S
Sbjct: 318 DTGTSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQTPSISLRT 377
Query: 187 PEGKVLPLPAKNYLIPVDSNG-TFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFT 245
G + P +I + + +C A S+ L+IIG G RV F+ ++G+
Sbjct: 378 VGGSLFPAIDPGQVISIQQHEYVYCLAIV-KSTKLNIIGQNFMTGVRVVFDRERKILGWK 436
Query: 246 PNKC 249
C
Sbjct: 437 KFNC 440
>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 72/249 (28%), Positives = 112/249 (44%), Gaps = 35/249 (14%)
Query: 35 GLLGLGGGSLSFPSQIN--ASTFSYCL-----VDRDSDSTSTLEFDSSLPPNAVTA---- 83
G+ G G G LS PSQ+ FS+C V+ + S+ + S+L N +
Sbjct: 159 GIAGFGRGLLSLPSQLGFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFT 218
Query: 84 PLLRNHELDTFYYLGLTGISVGGDLLP--ISETAFKIDESGNGGIIVDSGTAVTRLQTET 141
P+L YY+GL I++G ++ P + T + D GNGG++VDSGT T L
Sbjct: 219 PMLNTPVYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPF 278
Query: 142 YNAL---RDAFVRGTRALSPTDGVALFDTCYDF----SSRSSVE------VPTVSFHFPE 188
Y+ L + + RA + T+ FD CY ++ +S+E P+++F+F
Sbjct: 279 YSQLLTILQSTITYPRA-TETESRTGFDLCYKVPCPNNNLTSLENDVMMVFPSITFNFLN 337
Query: 189 GKVLPLPAKN--YLIPVDSNGTF--CFAFAPTSS----SLSIIGNVQQQGTRVSFNLRNS 240
L LP N Y + S+G+ C F + G+ QQQ +V ++L
Sbjct: 338 NATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGNYGPAGVFGSFQQQNVKVVYDLEKE 397
Query: 241 LIGFTPNKC 249
IGF C
Sbjct: 398 RIGFQAMDC 406
>gi|361067845|gb|AEW08234.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130032|gb|AFG45736.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130034|gb|AFG45737.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130036|gb|AFG45738.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130046|gb|AFG45743.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130048|gb|AFG45744.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130050|gb|AFG45745.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130054|gb|AFG45747.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130056|gb|AFG45748.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/120 (35%), Positives = 55/120 (45%), Gaps = 2/120 (1%)
Query: 93 TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR- 151
TFYY+ L G+S+G L + F D GNGG I+DSGT T E Y + AF
Sbjct: 33 TFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNGGTIIDSGTTFTIFNEEFYKNITAAFASQ 92
Query: 152 -GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFC 210
G R S + CY+ S V +P +FHF G + LP NY S + C
Sbjct: 93 IGFRRASEVEARTGMRLCYNVSGVDHVLLPDFAFHFKGGSDMVLPVANYFSYFVSFDSIC 152
>gi|357440767|ref|XP_003590661.1| Basic 7S globulin [Medicago truncatula]
gi|355479709|gb|AES60912.1| Basic 7S globulin [Medicago truncatula]
Length = 500
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 63/255 (24%), Positives = 108/255 (42%), Gaps = 37/255 (14%)
Query: 27 EGLFVGAAGLLGLGGGSLSFPSQINAS-----TFSYCLVDRDS-----DSTSTLEFDSSL 76
GL GA+G+ GLG ++ PSQ+ ++ F++C D D + D+
Sbjct: 171 RGLAGGASGMAGLGRTKIALPSQLASAFIFKRKFAFCFSSSDGVIIFGDGPYSFLADNPS 230
Query: 77 PPNAV-------TAPLLRNH----------ELDTFYYLGLTGISVGGDLLPISETAFKID 119
PN V PLL NH E Y++G+ I + G ++ ++ + ID
Sbjct: 231 LPNVVFDSKSLTYTPLLINHVSTASAFLQGESSVEYFIGVKTIKIDGKVVSLNSSLLSID 290
Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT--RALSPTDGVALFDTCYDFSSRS-- 175
G GG + + T L+ Y A+ DAFV+ + R ++ D F+ CY F +
Sbjct: 291 NKGVGGTKISTVDPYTVLEASIYKAVTDAFVKASVARNITTEDSSPPFEFCYSFDNLPGT 350
Query: 176 --SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS---IIGNVQQQG 230
VPT+ + + N ++ ++ + C F +L +IG Q +
Sbjct: 351 PLGASVPTIELLLQNNVIWSMFGANSMVNIN-DEVLCLGFVNGGVNLRTSIVIGGYQLEN 409
Query: 231 TRVSFNLRNSLIGFT 245
+ F+L S +GF+
Sbjct: 410 NLLQFDLAASRLGFS 424
>gi|383130038|gb|AFG45739.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 154
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 52/110 (47%), Gaps = 2/110 (1%)
Query: 93 TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR- 151
TFYY+ L G+S+G L + F D GNGG I+DSGT T E Y + AF
Sbjct: 33 TFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNGGTIIDSGTTFTIFNEEFYKNITAAFASQ 92
Query: 152 -GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL 200
G R S + CY+ S V +P +FHF G + LP NY
Sbjct: 93 IGFRRASEVEARTGMRLCYNVSGVDHVLLPDFAFHFKGGSDMVLPVANYF 142
>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
Length = 497
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 79/285 (27%), Positives = 124/285 (43%), Gaps = 39/285 (13%)
Query: 1 GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
G + +T+ +V +GC + + +GL G G G+ S P+Q+ S FSYCL+
Sbjct: 212 GLLIADTLRAPGRAVSGFVLGC--SLVSVHQPPSGLAGFGRGAPSVPAQLGLSKFSYCLL 269
Query: 61 DRDSDSTSTLEFDSSLPPN---AVTAPLLRNHELD-----TFYYLGLTGISVGGDLLPIS 112
R D + + L + PL+++ D +YYL L+G++VGG + +
Sbjct: 270 SRRFDDNAAVSGSLVLGGDNDGMQYVPLVKSAAGDKQPYAVYYYLALSGVTVGGKAVRLP 329
Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTDGVALFDT 167
AF + +G+GG IVDSGT T L + + DA V R R+ +G+ L
Sbjct: 330 ARAFAANAAGSGGAIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDVEEGLGL-HP 388
Query: 168 CYDFSS-RSSVEVPTVSFHFPEGKVLPLPAKNYLI-----PVD-------SNGTFCFAF- 213
C+ S+ +P +S HF G V+ LP +NY + PV + C A
Sbjct: 389 CFALPQGAKSMALPELSLHFKGGAVMQLPLENYFVVAGRAPVPGAGAGAGAAEAICLAVV 448
Query: 214 ---------APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
I+G+ QQQ V ++L +GF C
Sbjct: 449 TDFGGSGAGDEGGGPAIILGSFQQQNYLVEYDLEKERLGFRRQPC 493
>gi|383130040|gb|AFG45740.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/120 (35%), Positives = 55/120 (45%), Gaps = 2/120 (1%)
Query: 93 TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR- 151
TFYY+ L G+S+G L + F D GNGG I+DSGT T E Y + AF
Sbjct: 33 TFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNGGTIIDSGTTFTIFNEEFYKNITAAFASQ 92
Query: 152 -GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFC 210
G R S + CY+ S V +P +FHF G + LP NY S + C
Sbjct: 93 IGFRRASEVEARTGMRLCYNVSGVDHVLLPDFAFHFKGGSDMVLPVANYFSYFVSFDSIC 152
>gi|326515366|dbj|BAK03596.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 452
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 75/274 (27%), Positives = 117/274 (42%), Gaps = 29/274 (10%)
Query: 2 DFVTETVTLGS--ASVDNIAIGCGHNNEGLFVG--AAGLLGLGGGSLSFPSQINAS---- 53
DFV + GS +SV+ + GC HN + AG++ L SF Q++A
Sbjct: 174 DFVFDGSGPGSPISSVNGLVFGCAHNTHDFYNHDLWAGVMSLNRHPTSFIRQLSARGLAA 233
Query: 54 -TFSYCLVDRDS-DSTSTLEFDSSLP--PNAVTAPLLRNHELD---TFYYLGLTGISVGG 106
FSYCL R D L F + +P +A + PLL +Y + G
Sbjct: 234 PRFSYCLASRQHRDRRGFLRFGADIPDQSHARSTPLLHGDLAQGGGMYYVGVVGVSLGGR 293
Query: 107 DLLPISETAFKIDE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
L I+ F+++ S GG I+D GT++T + T Y+ L + R+ A+F
Sbjct: 294 RLTAITPVMFELNRRSLRGGCIIDVGTSLTLMATAPYHVLVAELIAHMRSRGVQH--AIF 351
Query: 166 DTCYDFSSRSSVE-----VPTVSFHF---PEGKVLPLPAKNYLIPVDSNGT--FCFAFAP 215
R E +P+V+ HF PE L + + + + T C A P
Sbjct: 352 SPGQKHCFRGKWESIHRHLPSVTLHFQFHPESVALFIRPELLFVAMTGERTDYVCLAIVP 411
Query: 216 TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+ +IIG Q TR +F+L+ + + F P +C
Sbjct: 412 YAER-TIIGAGQMLDTRFTFDLQQNRLFFAPEQC 444
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 69/276 (25%), Positives = 114/276 (41%), Gaps = 40/276 (14%)
Query: 1 GDFVTETVTLGSASVD--------NIAIGCGHNNEGLFVGAA-----GLLGLGGGSLSFP 47
G FV + V S + D ++ GCG G + G+LG G + S
Sbjct: 176 GYFVKDVVQYDSVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMI 235
Query: 48 SQINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGI 102
SQ+ +S F++CL R+ + P PL+ N Y + +T +
Sbjct: 236 SQLASSGRVKKIFAHCLDGRNGGGI--FAIGRVVQPKVNMTPLVPNQP---HYNVNMTAV 290
Query: 103 SVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
VG + L I F+ + G I+DSGT + L Y L V+ + P V
Sbjct: 291 QVGQEFLNIPADLFQPGD--RKGAIIDSGTTLAYLPEIIYEPL----VKKITSQEPALKV 344
Query: 163 ALFDT---CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-- 217
+ D C+ +S R P V+FHF L + +YL P + G +C + ++
Sbjct: 345 HIVDKDYKCFQYSGRVDEGFPNVTFHFENSVFLRVYPHDYLFPYE--GMWCIGWQNSAMQ 402
Query: 218 ----SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+++++G++ V ++L N LIG+T C
Sbjct: 403 SRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNC 438
>gi|242086416|ref|XP_002443633.1| hypothetical protein SORBIDRAFT_08g022640 [Sorghum bicolor]
gi|241944326|gb|EES17471.1| hypothetical protein SORBIDRAFT_08g022640 [Sorghum bicolor]
Length = 503
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 64/214 (29%), Positives = 98/214 (45%), Gaps = 24/214 (11%)
Query: 53 STFSYCLVDR-DSDSTSTLEFDSSLPPNAVTA--PLLRN---HELDTFYYLGLTGISVGG 106
+ FSYCL S +L D+++ + VTA PL+ N EL + Y++ L G+S+G
Sbjct: 297 AAFSYCLPKSPSSQGYLSLAVDATVRHDKVTAHAPLVSNGGDPELASMYFIDLVGMSLGV 356
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR----GTRALSPTDGV 162
D +PI GN G+ +D GT T+L E Y LRD+F + +L DG
Sbjct: 357 DDIPIPPAG----SFGNNGVNLDLGTTFTKLTPEVYMTLRDSFRKQMSQNNHSLLGFDG- 411
Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT----FCFAFAPTSS 218
FDTC++ + + +P + F F G+ L + L D C AF+ +
Sbjct: 412 --FDTCFNLTGVRDLAMPLLWFKFSNGERLLIDLDQMLYYDDPAAAPFTMACLAFSSLDA 469
Query: 219 SLS---IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
S +IG T V +++ +GF P C
Sbjct: 470 GDSFSAVIGTHTLASTEVIYDVAGGKVGFIPRSC 503
>gi|242086414|ref|XP_002443632.1| hypothetical protein SORBIDRAFT_08g022630 [Sorghum bicolor]
gi|241944325|gb|EES17470.1| hypothetical protein SORBIDRAFT_08g022630 [Sorghum bicolor]
Length = 556
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/174 (31%), Positives = 79/174 (45%), Gaps = 17/174 (9%)
Query: 88 NHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRD 147
N EL + Y++ L GIS+G + L I F GN +D GT T L + Y ALR+
Sbjct: 388 NPELASMYFIDLVGISLGDEDLSIPAGTF-----GNRSTNLDVGTTFTILAPDAYTALRE 442
Query: 148 AFVRGTRAL----SPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV 203
+F R SPTD FDTC++F+ + + +P V F G +L + A L
Sbjct: 443 SFKRQMSQYNFSSSPTDIAGGFDTCFNFTDLNDLVIPNVQLKFSNGDMLVIDADQMLYYD 502
Query: 204 DSNGT-----FCFAFAPT---SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
D C AF+ S ++IG+ T V +++ +GF P C
Sbjct: 503 DDTDAAPFTMACLAFSSLDAGDSFAAVIGSYTLATTEVVYDVAGGQVGFIPWSC 556
>gi|449432733|ref|XP_004134153.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
Length = 432
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 68/267 (25%), Positives = 110/267 (41%), Gaps = 34/267 (12%)
Query: 12 SASVDNIAIGCGHNN--EGLFVGAAGLLGLGGGSLSFPSQINAS-----TFSYCLVD--- 61
+ S+ N CG EGL G +G+ G G +S PSQ +A+ F+ CL
Sbjct: 148 AVSIPNFLFVCGPTFLLEGLAGGVSGMAGFGRTGISLPSQFSAAFSFNRKFAVCLSGSTR 207
Query: 62 ---------------RDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
++ D T +L + TA + + E + Y++G+ I
Sbjct: 208 SPGVIFSGNGPYHFLQNVDVTKSLTYTPLFINPVSTAGVSTSGEKSSEYFIGVKSIVFNS 267
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
+PI+ T KID +GNGG + + T L++ YNAL R R + VA F
Sbjct: 268 KTVPINTTLLKIDSNGNGGTKISTVHPYTVLESSIYNALVKTITRELRNIPRVAAVAPFG 327
Query: 167 TCYDFSSRSSVE----VPTVSFHFPEGKVL-PLPAKNYLIPVDSNGTFCFAFAP---TSS 218
CY S S +P++ KV+ + N ++ V+ C F +
Sbjct: 328 VCYKSKSFGSTRLGPGMPSIDLILQNKKVIWRIFGANSMVQVNEE-VLCLGFVDGGVEAR 386
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFT 245
+ +IG Q + + F+L S +GF+
Sbjct: 387 TAIVIGAYQMEDNLLEFDLATSRLGFS 413
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 69/276 (25%), Positives = 113/276 (40%), Gaps = 40/276 (14%)
Query: 1 GDFVTETVTLGSASVD--------NIAIGCGHNNEGLFVGAA-----GLLGLGGGSLSFP 47
G FV + V S + D ++ GCG G + G+LG G + S
Sbjct: 176 GYFVKDVVQYDSVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMI 235
Query: 48 SQINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGI 102
SQ+ +S F++CL R+ + P PL+ N Y + +T +
Sbjct: 236 SQLASSGRVKKIFAHCLDGRNGGGI--FAIGRVVQPKVNMTPLVPNQP---HYNVNMTAV 290
Query: 103 SVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
VG + L I F+ + G I+DSGT + L Y L V+ + P V
Sbjct: 291 QVGQEFLTIPADLFQPGD--RKGAIIDSGTTLAYLPEIIYEPL----VKKITSQEPALKV 344
Query: 163 ALFDT---CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-- 217
+ D C+ +S R P V+FHF L + +YL P G +C + ++
Sbjct: 345 HIVDKDYKCFQYSGRVDEGFPNVTFHFENSVFLRVYPHDYLFP--HEGMWCIGWQNSAMQ 402
Query: 218 ----SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+++++G++ V ++L N LIG+T C
Sbjct: 403 SRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNC 438
>gi|383147800|gb|AFG55671.1| Pinus taeda anonymous locus CL1877Contig1_03 genomic sequence
gi|383147802|gb|AFG55672.1| Pinus taeda anonymous locus CL1877Contig1_03 genomic sequence
gi|383147804|gb|AFG55673.1| Pinus taeda anonymous locus CL1877Contig1_03 genomic sequence
gi|383147806|gb|AFG55674.1| Pinus taeda anonymous locus CL1877Contig1_03 genomic sequence
Length = 59
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 32/59 (54%), Positives = 42/59 (71%)
Query: 191 VLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
+L LP NY++PVD+ GT CFAFAPT S SI+GN+QQQ VS++ N IGF ++C
Sbjct: 1 ILSLPTNNYVVPVDNMGTHCFAFAPTDSGFSIMGNIQQQHIGVSYDTYNGQIGFALDQC 59
>gi|449527083|ref|XP_004170542.1| PREDICTED: LOW QUALITY PROTEIN: basic 7S globulin-like [Cucumis
sativus]
Length = 432
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 68/267 (25%), Positives = 110/267 (41%), Gaps = 34/267 (12%)
Query: 12 SASVDNIAIGCGHNN--EGLFVGAAGLLGLGGGSLSFPSQINAS-----TFSYCLVD--- 61
+ S+ N CG EGL G +G+ G G +S PSQ +A+ F+ CL
Sbjct: 148 AVSIPNFLFVCGPTFLLEGLAGGVSGMAGFGRTGISLPSQFSAAFSFNRKFAVCLSGSTR 207
Query: 62 ---------------RDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
++ D T +L + TA + + E + Y++G+ I
Sbjct: 208 SPGVIFSGNGPYHFLQNVDVTKSLTYTPLFINPVSTAGVSTSGEKSSEYFIGVKSIVFNS 267
Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
+PI+ T KID +GNGG + + T L++ YNAL R R + VA F
Sbjct: 268 KTVPINTTLLKIDSNGNGGTKISTVHPYTVLESSIYNALVKTITRELRNIPRVAAVAPFG 327
Query: 167 TCYDFSSRSSVE----VPTVSFHFPEGKVL-PLPAKNYLIPVDSNGTFCFAFAP---TSS 218
CY S S +P++ KV+ + N ++ V+ C F +
Sbjct: 328 VCYKSKSFGSTRLGPGMPSIDLILQNKKVIWRIFGANSMVQVNEE-VLCLGFVDGGVEAR 386
Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFT 245
+ +IG Q + + F+L S +GF+
Sbjct: 387 TAIVIGAYQMEDNLLEFDLATSRLGFS 413
>gi|297740190|emb|CBI30372.3| unnamed protein product [Vitis vinifera]
Length = 445
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 54/196 (27%), Positives = 84/196 (42%), Gaps = 19/196 (9%)
Query: 34 AGLLGLGGGSLSFPSQINASTFSYCLV-DRDSDSTSTLEFDSSLPPNAV----------- 81
+G+ G G G S P Q+ FSYCL+ R DS + + + P++
Sbjct: 234 SGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGGLSYTP 293
Query: 82 --TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQT 139
P+ N +YY+ L I VG + + + GNGG IVDSG+ T ++
Sbjct: 294 FRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGGTIVDSGSTFTFMEK 353
Query: 140 ETYNALRDAFVRG----TRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLP 195
+ A+ F R TRA + + ++ C++ S SV +P++ F F G + LP
Sbjct: 354 PVFEAVATEFDRQMANYTRA-ADVEALSGLKPCFNLSGVGSVALPSLVFQFKGGAKMELP 412
Query: 196 AKNYLIPVDSNGTFCF 211
NY V C
Sbjct: 413 VANYFSLVGDLSVLCL 428
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 81/273 (29%), Positives = 115/273 (42%), Gaps = 35/273 (12%)
Query: 1 GDFVTETV----TLGSASVDN----IAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPS 48
G +V++T+ LG + V N I GC G G+ G G G LS S
Sbjct: 163 GYYVSDTLYFDAILGESLVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVIS 222
Query: 49 Q-----INASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
Q I FS+CL + L L P V +PL+ + Y L L I+
Sbjct: 223 QLSTHGITPRVFSHCL-KGEGIGGGILVLGEILEPGMVYSPLVPSQP---HYNLNLQSIA 278
Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL---SPTD 160
V G LLPI + F S + G IVDSGT + L E Y D FV + S T
Sbjct: 279 VNGKLLPIDPSVFA--TSNSQGTIVDSGTTLAYLVAEAY----DPFVSAVNVIVSPSVTP 332
Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV-DSNG---TFCFAFAPT 216
++ + CY S+ S P SF+F G + L ++YLIP S G +C F
Sbjct: 333 IISKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYLIPFGPSQGGSVMWCIGFQKV 392
Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
++I+G++ + ++L IG+ C
Sbjct: 393 -QGVTILGDLVLKDKIFVYDLVRQRIGWANYDC 424
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.136 0.397
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,971,609,766
Number of Sequences: 23463169
Number of extensions: 171560387
Number of successful extensions: 425754
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 869
Number of HSP's successfully gapped in prelim test: 1033
Number of HSP's that attempted gapping in prelim test: 421022
Number of HSP's gapped (non-prelim): 2007
length of query: 249
length of database: 8,064,228,071
effective HSP length: 139
effective length of query: 110
effective length of database: 9,097,814,876
effective search space: 1000759636360
effective search space used: 1000759636360
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 75 (33.5 bits)