BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 014537
(423 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 225/442 (50%), Positives = 304/442 (68%), Gaps = 21/442 (4%)
Query: 1 MATF---LSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDA 57
MA F LS + LC I A+ GF+V+LIHRDSP SPFYNS ET QR+ +A
Sbjct: 1 MAAFRSPLSFALAIALLCVSGFGCIYARKVGFTVDLIHRDSPLSPFYNSEETDLQRINNA 60
Query: 58 LTRSLNRLNHFNQNSSIS-SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
L RS++R++HF+ ++ S S KA+++D+ N YL+ +S+GTPP + + +ADTGSDLIW
Sbjct: 61 LRRSISRVHHFDPIAAASVSPKAAESDVTSNRGEYLMSLSLGTPPFKIMGIADTGSDLIW 120
Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGS 176
TQC+PC +CY Q PLFDPK S TY+ C + QC+ L+Q +CSG CQY SYGD S
Sbjct: 121 TQCKPC--ERCYKQVDPLFDPKSSKTYRDFSCDARQCSLLDQSTCSGNICQYQYSYGDRS 178
Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
++ GN+A++T+TL STTG V+ P GCG N G F+ K +GIVGLG G +SLISQM
Sbjct: 179 YTMGNVASDTITLDSTTGSPVSFPKTVIGCGHENDGTFSDKGSGIVGLGAGPLSLISQMG 238
Query: 237 TTIAGKFSYCLVPVS-----STKINFGTNGIVSGPGVVSTPLTKAKT---FYVLTIDAIS 288
+++ GKFSYCLVP+S S+K+NFG+N +VSGPGV STPL ++T FY LT++A+S
Sbjct: 239 SSVGGKFSYCLVPLSSRAGNSSKLNFGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMS 298
Query: 289 VGNQR-------LGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 341
VGN+R LG +I+IDSGTTLT +P + SNL + + + +E + DP+G L
Sbjct: 299 VGNERIKFGDSSLGTGEGNIIIDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLS 358
Query: 342 LCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTN 401
+CYS S +VP +T HF GADVKL N FV+VS+D+VC F T+ + IYGN+ Q N
Sbjct: 359 VCYSATSDLKVPAITAHFTGADVKLKPINTFVQVSDDVVCLAFASTTSGISIYGNVAQMN 418
Query: 402 FLVGYDIEQQTVSFKPTDCTKQ 423
FLV Y+I+ +++SFKPTDCTK+
Sbjct: 419 FLVEYNIQGKSLSFKPTDCTKK 440
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 238/443 (53%), Positives = 310/443 (69%), Gaps = 24/443 (5%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
MA +S + I+ + + PI+A GF+VELI+RDSPKSPFYN ETP QR+ A+ R
Sbjct: 1 MAASVSLLAIVTLIFSGTLVPIDAAKDGFTVELINRDSPKSPFYNPRETPTQRIVSAVRR 60
Query: 61 SLNRLNHFN--QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
S++R++HF+ +NS I + A Q+++I N YL++ S+GTP + LA+ADTGSDLIWTQ
Sbjct: 61 SMSRVHHFSPTKNSDIFTDTA-QSEMISNQGEYLMKFSLGTPAFDILAIADTGSDLIWTQ 119
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCSG---VNCQYSVSYGD 174
C+PC QCY QD+PLFDPK SSTY+ + CS+ QC L + SCSG C YS SYGD
Sbjct: 120 CKPC--DQCYEQDAPLFDPKSSSTYRDISCSTKQCDLLKEGASCSGEGNKTCHYSYSYGD 177
Query: 175 GSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
SF++GN+A +T+TLGST+G+ V LP GCG NNGG F K +GIVGLGGG ISLISQ
Sbjct: 178 RSFTSGNVAADTITLGSTSGRPVLLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQ 237
Query: 235 MRTTIAGKFSYCLVPVS-----STKINFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDAI 287
+ +TI GKFSYCLVP+S S+K+NFG+NGIVSG GV STPL TFY LT++A+
Sbjct: 238 LGSTIDGKFSYCLVPLSSNATNSSKLNFGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAV 297
Query: 288 SVGNQRL-------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL 340
SVG++R+ G S +I+IDSGTTLT P+ + S L S + + PV DP+G L
Sbjct: 298 SVGSERIKFPGSSFGTSEGNIIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGIL 357
Query: 341 ELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQT 400
LCYS ++ + P +T HF GADVKL+ N FV+VS+ ++C F I NS I+GN+ Q
Sbjct: 358 SLCYSIDADLKFPSITAHFDGADVKLNPLNTFVQVSDTVLCFAFNPI-NSGAIFGNLAQM 416
Query: 401 NFLVGYDIEQQTVSFKPTDCTKQ 423
NFLVGYD+E +TVSFKPTDCT+
Sbjct: 417 NFLVGYDLEGKTVSFKPTDCTQD 439
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 227/413 (54%), Positives = 285/413 (69%), Gaps = 21/413 (5%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK-ASQADIIP 86
GF+ +LIHRDSPKSPFYN +ET QRLR+A+ RS++R+ HF S +S A Q D+
Sbjct: 30 GFTADLIHRDSPKSPFYNPTETSSQRLRNAIHRSVSRVFHFTDISQKDASDNAPQIDLTS 89
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N+ YL+ IS+GTPP +A+ADTGSDL+WTQC+PC CY Q PLFDPK SSTYK +
Sbjct: 90 NSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPC--DDCYTQVDPLFDPKASSTYKDV 147
Query: 147 PCSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
CSSSQC +L NQ SCS + C YS SYGD S++ GN+A +T+TLGST + V L I
Sbjct: 148 SCSSSQCTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNII 207
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS-----TKINFG 258
GCG NN G FN K +GIVGLGGG +SLI+Q+ +I GKFSYCLVP++S +KINFG
Sbjct: 208 IGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFG 267
Query: 259 TNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRL-------GVSTPDIVIDSGTTL 309
TN +VSG GVVSTPL +TFY LT+ +ISVG++ + G +I+IDSGTTL
Sbjct: 268 TNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGNIIIDSGTTL 327
Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRS 369
T LP + S L ++S I+A+ DP L LCYS +VP +T+HF GADV L S
Sbjct: 328 TLLPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSATGDLKVPAITMHFDGADVNLKPS 387
Query: 370 NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
N FV++SED+VC F+G + S IYGN+ Q NFLVGYD +TVSFKPTDC K
Sbjct: 388 NCFVQISEDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAK 439
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 228/413 (55%), Positives = 286/413 (69%), Gaps = 24/413 (5%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
GF+ +LIHRDSPKSPFYN ET QRLR+A+ RS+NR+ HF + + + Q D+ N
Sbjct: 30 GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEKDN---TPQPQIDLTSN 86
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ YL+ +SIGTPP +A+ADTGSDL+WTQC PC CY Q PLFDPK SSTYK +
Sbjct: 87 SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPC--DDCYTQVDPLFDPKTSSTYKDVS 144
Query: 148 CSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
CSSSQC +L NQ SCS + C YS+SYGD S++ GN+A +T+TLGS+ + + L I
Sbjct: 145 CSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIII 204
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS-----TKINFGT 259
GCG NN G FN K +GIVGLGGG +SLI Q+ +I GKFSYCLVP++S +KINFGT
Sbjct: 205 GCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGT 264
Query: 260 NGIVSGPGVVSTPL-TKA--KTFYVLTIDAISVGNQRL-------GVSTPDIVIDSGTTL 309
N IVSG GVVSTPL KA +TFY LT+ +ISVG++++ S +I+IDSGTTL
Sbjct: 265 NAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTL 324
Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRS 369
T LP + S L ++S I+A+ DP L LCYS +VP +T+HF GADVKL S
Sbjct: 325 TLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHFDGADVKLDSS 384
Query: 370 NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
N FV+VSED+VC F+G + S IYGN+ Q NFLVGYD +TVSFKPTDC K
Sbjct: 385 NAFVQVSEDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAK 436
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 228/413 (55%), Positives = 286/413 (69%), Gaps = 24/413 (5%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
GF+ +LIHRDSPKSPFYN ET QRLR+A+ RS+NR+ HF + + + Q D+ N
Sbjct: 30 GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEKDN---TPQPQIDLTSN 86
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ YL+ +SIGTPP +A+ADTGSDL+WTQC PC CY Q PLFDPK SSTYK +
Sbjct: 87 SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPC--DDCYTQVDPLFDPKTSSTYKDVS 144
Query: 148 CSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
CSSSQC +L NQ SCS + C YS+SYGD S++ GN+A +T+TLGS+ + + L I
Sbjct: 145 CSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIII 204
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS-----TKINFGT 259
GCG NN G FN K +GIVGLGGG +SLI Q+ +I GKFSYCLVP++S +KINFGT
Sbjct: 205 GCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGT 264
Query: 260 NGIVSGPGVVSTPL-TKA--KTFYVLTIDAISVGNQRL-------GVSTPDIVIDSGTTL 309
N IVSG GVVSTPL KA +TFY LT+ +ISVG++++ S +I+IDSGTTL
Sbjct: 265 NAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTL 324
Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRS 369
T LP + S L ++S I+A+ DP L LCYS +VP +T+HF GADVKL S
Sbjct: 325 TLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHFDGADVKLDSS 384
Query: 370 NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
N FV+VSED+VC F+G + S IYGN+ Q NFLVGYD +TVSFKPTDC K
Sbjct: 385 NAFVQVSEDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAK 436
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 403 bits (1035), Expect = e-110, Method: Compositional matrix adjust.
Identities = 211/429 (49%), Positives = 284/429 (66%), Gaps = 23/429 (5%)
Query: 11 LFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ 70
LF LCF + S A + GFSVELIHRDSPKSP+Y +E YQ DA RS+NR NHF +
Sbjct: 11 LFSLCF-IASFSHALSNGFSVELIHRDSPKSPYYKPTENKYQHFVDAARRSINRANHFFK 69
Query: 71 NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
+S S+ +++ +IP+ YL+ S+GTPPT+ +ADTGSD++W QCEPC QCY Q
Sbjct: 70 DSDTSTPEST---VIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPC--EQCYNQ 124
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTL 189
+P+F+P SS+YK++PCSS C S+ SCS N CQY +SYGD S S G+L+ +T++L
Sbjct: 125 TTPIFNPSKSSSYKNIPCSSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSL 184
Query: 190 GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP 249
ST+G V+ P I GCGT+N G F ++GIVGLGGG +SLI+Q+ ++I GKFSYCLVP
Sbjct: 185 ESTSGSPVSFPKIVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVP 244
Query: 250 V------SSTKINFGTNGIVSGPGVVSTPLTKAK-TFYVLTIDAISVGNQRL-------- 294
+ +S+ ++FG +VSG GVVSTPL K FY LT+ A SVGN+R+
Sbjct: 245 LLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEG 304
Query: 295 GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVP 353
G +I+IDSGTTLT +P +NL S + +++ V DP LCYS S P
Sbjct: 305 GDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNEYDFP 364
Query: 354 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
+T+HF+GADV+L + FV +++ IVC F+ I+GN+ Q N LVGYD++Q+TV
Sbjct: 365 IITVHFKGADVELHSISTFVPITDGIVCFAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTV 424
Query: 414 SFKPTDCTK 422
SFKPTDCTK
Sbjct: 425 SFKPTDCTK 433
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 210/439 (47%), Positives = 283/439 (64%), Gaps = 23/439 (5%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
M T LF LCF + S A + GFSVELIHRDSPKSP+Y +E YQ DA R
Sbjct: 1 MNTLCFLTLSLFSLCF-IASFSHALSNGFSVELIHRDSPKSPYYKPTENKYQHFVDAARR 59
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
S+NR NHF ++S S+ +++ +IP+ YL+ S+GTPPT+ +ADTGSD++W QCE
Sbjct: 60 SINRANHFFKDSDTSTPEST---VIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCE 116
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSN 179
PC QCY Q +P+F+P SS+YK++PC S C S+ SCS N CQY +SYGD S S
Sbjct: 117 PC--EQCYNQTTPIFNPSKSSSYKNIPCLSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQ 174
Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
G+L+ +T++L ST+G V+ P GCGT+N G F ++GIVGLGGG +SLI+Q+ ++I
Sbjct: 175 GDLSVDTLSLESTSGSPVSFPKTVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSI 234
Query: 240 AGKFSYCLVPV------SSTKINFGTNGIVSGPGVVSTPLTKAK-TFYVLTIDAISVGNQ 292
GKFSYCLVP+ +S+ ++FG +VSG GVVSTPL K FY LT+ A SVGN+
Sbjct: 235 GGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNK 294
Query: 293 RL--------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY 344
R+ G +I+IDSGTTLT +P +NL S + +++ V DP LCY
Sbjct: 295 RVEFGGSSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCY 354
Query: 345 SFNSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFL 403
S S P +T HF+GAD++L + FV +++ IVC F+ I+GN+ Q N L
Sbjct: 355 SLKSNEYDFPIITAHFKGADIELHSISTFVPITDGIVCFAFQPSPQLGSIFGNLAQQNLL 414
Query: 404 VGYDIEQQTVSFKPTDCTK 422
VGYD++Q+TVSFKPTDCTK
Sbjct: 415 VGYDLQQKTVSFKPTDCTK 433
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 216/445 (48%), Positives = 286/445 (64%), Gaps = 29/445 (6%)
Query: 1 MATFLSCVFILFF-----LCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLR 55
MATF S +L F LC I A GF+ EL+HRDSPKSP YNS +T QR
Sbjct: 1 MATFQS---VLSFASAIALCVASFGCIYAHNAGFTTELVHRDSPKSPLYNSQQTHLQRWN 57
Query: 56 DALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLI 115
A+ RS++R++HF + ++ S K +++II N YL+ +S+GTPP E LA+ADTGSDLI
Sbjct: 58 KAMRRSVSRVHHFQRTAATVSPKEVESEIIANGGEYLMSLSLGTPPFEILAIADTGSDLI 117
Query: 116 WTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCSGVN-CQYSVSYG 173
WTQC PC +CY Q +PLFDPK S TY+ L C + QC +L + SCS CQYS YG
Sbjct: 118 WTQCTPC--DKCYKQIAPLFDPKSSKTYRDLSCDTRQCQNLGESSSCSSEQLCQYSYYYG 175
Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
D SF+NGNLA +TVTL ST G V P GCG N G F+ K +GI+GLGGG +SLIS
Sbjct: 176 DRSFTNGNLAVDTVTLPSTNGGPVYFPKTVIGCGRRNNGTFDKKDSGIIGLGGGPMSLIS 235
Query: 234 QMRTTIAGKFSYCLVPVS------STKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTID 285
QM +++ GKFSYCLVP S S+K++FG N +VSG GV STPL TFY LT++
Sbjct: 236 QMGSSVGGKFSYCLVPFSSESAGNSSKLHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLE 295
Query: 286 AISVGNQRL-------GVSTPDIVIDSGTTLTFLPQGYNSNLL-SVMSSMIEAQPVADPT 337
A+SVG++++ G S +I+IDSGT+LT P + + +V +++I + D +
Sbjct: 296 AMSVGDKKIEFGGSSFGGSEGNIIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDAS 355
Query: 338 GSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNI 397
G L CY +VP +T HF GADV L N F+ +S+D++C F T S I+GN+
Sbjct: 356 GLLSHCYRPTPDLKVPVITAHFNGADVVLQTLNTFILISDDVLCLAFNS-TQSGAIFGNV 414
Query: 398 MQTNFLVGYDIEQQTVSFKPTDCTK 422
Q NFL+GYDI+ ++VSFKPTDCT+
Sbjct: 415 AQMNFLIGYDIQGKSVSFKPTDCTQ 439
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 208/438 (47%), Positives = 287/438 (65%), Gaps = 37/438 (8%)
Query: 9 FILFFLC--FYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
+LF+LC FY +EA GGFSVE+IHRDS +SPF+ +ET +QR+ +A+ RS+NR N
Sbjct: 11 LVLFYLCNIFY----LEAFNGGFSVEMIHRDSSRSPFFRPTETQFQRVANAVHRSVNRAN 66
Query: 67 HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
HF++ + KA++A I N+ YLI S+G PP + + DTGSD+IW QC+PC +
Sbjct: 67 HFHK-----AHKAAKATITQNDGEYLISYSVGIPPFQLYGIIDTGSDMIWLQCKPC--EK 119
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLA 183
CY Q + +FDP S+TYK LP SS+ C S+ SCS N C+Y++ YGDGS+S G+L+
Sbjct: 120 CYNQTTRIFDPSKSNTYKILPFSSTTCQSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLS 179
Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR---TTIA 240
ET+TLGST G +V GCG NN F K++GIVGLG G +SLI+Q+R ++I
Sbjct: 180 VETLTLGSTNGSSVKFRRTVIGCGRNNTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIG 239
Query: 241 GKFSYCLVPVS--STKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGV 296
KFSYCL +S S+K+NFG +VSG G VSTP+ K FY LT++A SVGN R+
Sbjct: 240 RKFSYCLASMSNISSKLNFGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEF 299
Query: 297 STP--------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--SF 346
++ +I+IDSGTTLT LP S L S ++ ++E V DP L LCY +F
Sbjct: 300 TSSSFRFGEKGNIIIDSGTTLTLLPNDIYSKLESAVADLVELDRVKDPLKQLSLCYRSTF 359
Query: 347 NSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVG 405
+ L+ P + HF GADVKL+ N F++V + + C F I++ + PI+GN+ Q NFLVG
Sbjct: 360 DELN-APVIMAHFSGADVKLNAVNTFIEVEQGVTCLAF--ISSKIGPIFGNMAQQNFLVG 416
Query: 406 YDIEQQTVSFKPTDCTKQ 423
YD++++ VSFKPTDC+KQ
Sbjct: 417 YDLQKKIVSFKPTDCSKQ 434
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 387 bits (993), Expect = e-105, Method: Compositional matrix adjust.
Identities = 214/411 (52%), Positives = 278/411 (67%), Gaps = 21/411 (5%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
GF+++LIHRDSPKSPFYNS+ET QR+R+A+ RS F+ + + S + Q+ I N
Sbjct: 25 GFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRSARSTLQFSNDDA--SPNSPQSFITSN 82
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
YL+ ISIGTPP LA+ADTGSDLIWTQC PC CY Q SPLFDPK SSTY+ +
Sbjct: 83 RGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPC--EDCYQQTSPLFDPKESSTYRKVS 140
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
CSSSQC +L SCS C Y+++YGD S++ G++A +TVT+GS+ + V+L + G
Sbjct: 141 CSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIG 200
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS-----TKINFGTN 260
CG N G F+ +GI+GLGGG SL+SQ+R +I GKFSYCLVP +S +KINFGTN
Sbjct: 201 CGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGTN 260
Query: 261 GIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRL-------GVSTPDIVIDSGTTLTF 311
GIVSG GVVST + K T+Y L ++AISVG++++ G +IVIDSGTTLT
Sbjct: 261 GIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNIVIDSGTTLTL 320
Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNF 371
LP + L SV++S I+A+ V DP G L LCY +S +VP++T+HF+G DVKL N
Sbjct: 321 LPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSSSFKVPDITVHFKGGDVKLGNLNT 380
Query: 372 FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
FV VSED+ C F + I+GN+ Q NFLVGYD TVSFK TDC++
Sbjct: 381 FVAVSEDVSCFAFAA-NEQLTIFGNLAQMNFLVGYDTVSGTVSFKKTDCSQ 430
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 223/431 (51%), Positives = 286/431 (66%), Gaps = 22/431 (5%)
Query: 10 ILFFLCFY---VVSPIEAQTG-GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL 65
+L LC + ++S + A+ GF+ +LIHRDSPKSPFYN +ETP QR+R+A+ RS NR+
Sbjct: 8 VLLSLCLFSSHILSNVNAKPKLGFTTDLIHRDSPKSPFYNPAETPSQRIRNAIHRSFNRV 67
Query: 66 NHFNQNSSISSSKAS-QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPP 124
+HF S + +S S Q DI P YL+ +S+GTPP+ +AVADTGS+LIWTQC+PC
Sbjct: 68 SHFTDLSEMDASLNSPQTDITPCGGEYLMNLSLGTPPSPIMAVADTGSNLIWTQCKPC-- 125
Query: 125 SQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGN 181
CY Q PLFDPK SSTYK + CSSSQC +L NQ SCS + C Y VSY DGS++ G
Sbjct: 126 DDCYTQVDPLFDPKASSTYKDVSCSSSQCTALENQASCSTEDKTCSYLVSYADGSYTMGK 185
Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
A +T+TLGST + V L I GCG NN F +K++G+VGLGGG +SLI Q+ +I G
Sbjct: 186 FAVDTLTLGSTDNRPVQLKNIIIGCGQNNAVTFRNKSSGVVGLGGGAVSLIKQLGDSIDG 245
Query: 242 KFSYCLVPVS--STKINFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRLGVS 297
KFSYCLVP + ++KINFGTN +VSGPG VSTPL TFY LT+ +ISVG++ +
Sbjct: 246 KFSYCLVPENDQTSKINFGTNAVVSGPGTVSTPLVVKSRDTFYYLTLKSISVGSKNM--Q 303
Query: 298 TPD------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ 351
TPD +VIDSGTTLT LP Y + + ++S+I A D LCY+ +
Sbjct: 304 TPDSNIKGNMVIDSGTTLTLLPVKYYIEIENAVASLINADKSKDERIGSSLCYNATADLN 363
Query: 352 VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
+P +T+HF GADVKL N F KV+ED+VC F IYGN+ Q NFLVGYD +
Sbjct: 364 IPVITMHFEGADVKLYPYNSFFKVTEDLVCLAFGMSFYRNGIYGNVAQKNFLVGYDTASK 423
Query: 412 TVSFKPTDCTK 422
T+SFKPTDC K
Sbjct: 424 TMSFKPTDCAK 434
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 203/434 (46%), Positives = 285/434 (65%), Gaps = 24/434 (5%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
V ++ FL F ++ A+ GGFSV+LIHRDSP SPF++ S+T +RL DA RS++R+
Sbjct: 12 VVVVGFL-FQLLEVALARGGGFSVDLIHRDSPHSPFFDPSKTQAERLTDAFRRSVSRVGR 70
Query: 68 FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
F + +S Q+ I+P+ YL+ + IGTPP +A+ DTGSDL WTQC PC + C
Sbjct: 71 FRPTAM--TSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPC--THC 126
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCSG-VNCQYSVSYGDGSFSNGNLATE 185
Y Q PLFDPK SSTY+ C +S C +L + +SCS C + SY DGSF+ GNLA+E
Sbjct: 127 YKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASE 186
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
T+T+ ST G+ V+ PG FGCG ++GG+F+ ++GIVGLGGG++SLISQ+++TI G FSY
Sbjct: 187 TLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSY 246
Query: 246 CLVPVS-----STKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRL---- 294
CL+PVS S++INFG +G VSG G VSTPL + TFY LT++ ISVG +RL
Sbjct: 247 CLLPVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKG 306
Query: 295 -----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 349
V +I++DSGTT TFLPQ + S L +++ I+ + V DP G LCY+ +
Sbjct: 307 YSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAE 366
Query: 350 SQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
P +T HF+ A+V+L N F+++ ED+VC T+ + + GN+ Q NFLVG+D+
Sbjct: 367 INAPIITAHFKDANVELQPLNTFMRMQEDLVCFTV-APTSDIGVLGNLAQVNFLVGFDLR 425
Query: 410 QQTVSFKPTDCTKQ 423
++ VSFK DCT+
Sbjct: 426 KKRVSFKAADCTQH 439
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 216/436 (49%), Positives = 289/436 (66%), Gaps = 28/436 (6%)
Query: 10 ILFFLCFYVVSPIEA-QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
++FF+ F +S EA GGFS +LI RDSP SPFYN SET + RL+ A RS++R NHF
Sbjct: 15 VIFFIHFSGLSHTEASNKGGFSTDLISRDSPLSPFYNPSETQFDRLQKAFHRSISRANHF 74
Query: 69 NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
N S+ + Q+ +I NN YL+ IS+GTPP +ADTGSDL+W QC+PC CY
Sbjct: 75 RANGV--STNSIQSPVISNNGEYLMNISLGTPPVSMHGIADTGSDLLWRQCKPC--DSCY 130
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLN-QKSCSGVN-CQYSVSYGDGSFSNGNLATET 186
Q P+FDP S TY+ L C C++L Q CS N C YS SYGDGS ++G+LA +T
Sbjct: 131 EQIEPIFDPAKSKTYQILSCEGKSCSNLGGQGGCSDDNTCIYSYSYGDGSHTSGDLAVDT 190
Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYC 246
+T+GSTTG+ V++P + FGCG NNGG F +G+VGLGGG +S+ISQ+R I G+FSYC
Sbjct: 191 LTIGSTTGRPVSVPKVVFGCGHNNGGTFELHGSGLVGLGGGPLSMISQLRPLIGGRFSYC 250
Query: 247 LVPVS-----STKINFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRLG---- 295
LVP+ S+K++FG+ GIVSG G VSTPL + TFY LT++++SVG+++L
Sbjct: 251 LVPLGNDPSVSSKMHFGSRGIVSGAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKGF 310
Query: 296 --VSTP-------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF 346
V +P +I+IDSGTTLT LPQ + L S + S I +PV DP LCYS
Sbjct: 311 SKVGSPLADADEGNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVRDPNNVFSLCYSN 370
Query: 347 NSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY 406
S ++P +T HF GAD++L N FV+V ED+ C +++ + I+GN+ Q NFLVGY
Sbjct: 371 LSGLRIPTITAHFVGADLELKPLNTFVQVQEDLFCFAMIPVSD-LAIFGNLAQMNFLVGY 429
Query: 407 DIEQQTVSFKPTDCTK 422
D++ +TVSFKPTDCTK
Sbjct: 430 DLKSRTVSFKPTDCTK 445
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 208/439 (47%), Positives = 282/439 (64%), Gaps = 31/439 (7%)
Query: 5 LSCVFILFFLC--FYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
++ VF L FL V S + A+ GF+VELIHRDSPKSP YNSSET + R+ +AL RS
Sbjct: 1 MAPVFSLLFLISTASVFSAVTARDYGFTVELIHRDSPKSPMYNSSETHFDRIVNALRRSS 60
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+R N+ + S ++A I N YL+ IS+GTPP +AVADTGSD+IWTQC+PC
Sbjct: 61 HR------NTVVLESDTAEAPIFNNGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPC 114
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA-SLNQKSCSG-VNCQYSVSYGDGSFSNG 180
S CY Q++P+FDP S+TYK++ CSS C+ S + SCS C YS++YGD S S G
Sbjct: 115 --SNCYQQNAPMFDPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQG 172
Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
NLA +TVT+ ST+G+ VA P GCG +N G FN+ +GIVGLG G SL++Q+
Sbjct: 173 NLAVDTVTMQSTSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATG 232
Query: 241 GKFSYCLVPV------SSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGN 291
GKFSYCL+P+ STK+NFG+N VSG G VSTP+ + KTFY L ++A+SVG+
Sbjct: 233 GKFSYCLIPIGTGSTNDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGD 292
Query: 292 QRL----GVST----PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELC 343
+ G S +I+IDSGTTLT+LP ++ S +S + DP+ L+ C
Sbjct: 293 TKFNFPEGASKLGGESNIIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYC 352
Query: 344 YSFNSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTN 401
++ + ++P VT+HF GADV L R N FV++S+D +C F +++ IYGNI Q+N
Sbjct: 353 FATTTDDYEMPPVTMHFEGADVPLQRENLFVRLSDDTICLAFGSFPDDNIFIYGNIAQSN 412
Query: 402 FLVGYDIEQQTVSFKPTDC 420
FLVGYDI+ VSF+P C
Sbjct: 413 FLVGYDIKNLAVSFQPAHC 431
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 215/443 (48%), Positives = 284/443 (64%), Gaps = 30/443 (6%)
Query: 4 FLSCVF-ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
F+ C I+ + F S EA+ GF+ + I RDSP SPFYN SET YQRL+ A RS+
Sbjct: 8 FVFCTLAIIILIHFSEHSHAEAKIDGFTTDFISRDSPHSPFYNPSETKYQRLQKAFRRSI 67
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
R NHF + +S Q+D+I YL+ IS+GTPP L +ADTGSDLIW QC PC
Sbjct: 68 LRGNHFR--AMRASPNDIQSDVISGGGAYLMNISLGTPPVPMLGIADTGSDLIWRQCLPC 125
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGVN-CQYSVSYGDGSFSNG 180
P CY Q PLFDPK S TYK+L C + C L Q+ SC N C YS SYGD S++ G
Sbjct: 126 P--NCYEQVEPLFDPKESETYKTLDCDNEFCQDLGQQGSCDDDNTCTYSYSYGDRSYTRG 183
Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
+L+++T+T+GST G + PGI FGCG +NGG FN K G++GLGGG +SL+ Q+ + +
Sbjct: 184 DLSSDTLTIGSTEGDPASFPGIAFGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVG 243
Query: 241 GKFSYCLVPVS-----STKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQR 293
G+FSYCLVP+S S+KINFG +G+VSG G VSTPL K TFY LT++ +SVG++
Sbjct: 244 GQFSYCLVPLSSDSTVSSKINFGKSGVVSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSET 303
Query: 294 L-------------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL 340
+ V +I+IDSGTTLT LPQ + +++ S +++ I Q DP G
Sbjct: 304 VAFKGFSENKSSPAAVEEGNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIF 363
Query: 341 ELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNS-VPIYGNIMQ 399
LCYS + ++P +T HF GADV+L N FV+V ED+VC F I +S + I+GN+ Q
Sbjct: 364 SLCYSSVNNLEIPTITAHFTGADVQLPPLNTFVQVQEDLVC--FSMIPSSNLAIFGNLAQ 421
Query: 400 TNFLVGYDIEQQTVSFKPTDCTK 422
NFLVGYD++ VSFK TDCT+
Sbjct: 422 INFLVGYDLKNNKVSFKQTDCTE 444
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 212/442 (47%), Positives = 293/442 (66%), Gaps = 26/442 (5%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
M T + ++ C Y +S ++A GGFSVE+IHRDS +SP Y +ETP+QR+ +A+ R
Sbjct: 3 MITRYCSLALVLLWCLYNISFLKANDGGFSVEMIHRDSSRSPLYRPTETPFQRVANAVRR 62
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
S+NR NHF + + S+ ++++ ++ + YL+R S+G+PP + L + DTGSD++W QCE
Sbjct: 63 SINRGNHFKK--AFVSTDSAESTVVASQGEYLMRYSVGSPPFQVLGIVDTGSDILWLQCE 120
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSN 179
PC CY Q +P+FDP S TYK+LPCSS+ C SL +CS N C+YS+ YGDGS S+
Sbjct: 121 PC--EDCYKQTTPIFDPSKSKTYKTLPCSSNTCESLRNTACSSDNVCEYSIDYGDGSHSD 178
Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
G+L+ ET+TLGST G +V P GCG NNGG F + +GIVGLGGG +SLISQ+ ++I
Sbjct: 179 GDLSVETLTLGSTDGSSVHFPKTVIGCGHNNGGTFQEEGSGIVGLGGGPVSLISQLSSSI 238
Query: 240 AGKFSYCLVPV-----SSTKINFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQ 292
GKFSYCL P+ SS+K+NFG +VSG G VSTPL + FY LT++A SVG+
Sbjct: 239 GGKFSYCLAPIFSESNSSSKLNFGDAAVVSGRGTVSTPLDPLNGQVFYFLTLEAFSVGDN 298
Query: 293 RLGVSTP----------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
R+ S +I+IDSGTTLT LPQ NL S +S +I+ + DP+ L L
Sbjct: 299 RIEFSGSSSSGSGSGDGNIIIDSGTTLTLLPQEDYLNLESAVSDVIKLERARDPSKLLSL 358
Query: 343 CYSFNSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQT 400
CY S +P +T HF+GADV+L+ + FV V + +VC F I++ + I+GN+ Q
Sbjct: 359 CYKTTSDELDLPVITAHFKGADVELNPISTFVPVEKGVVCFAF--ISSKIGAIFGNLAQQ 416
Query: 401 NFLVGYDIEQQTVSFKPTDCTK 422
N LVGYD+ ++TVSFKPTDCTK
Sbjct: 417 NLLVGYDLVKKTVSFKPTDCTK 438
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 208/442 (47%), Positives = 287/442 (64%), Gaps = 33/442 (7%)
Query: 9 FILFFLCFYVVSPI------EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
F+ +CF +SP + GFS+ LIHRDSP SP YN + T + RLR+A +RS+
Sbjct: 8 FVTIVICFISLSPFPLLGAAASPDPGFSLNLIHRDSPLSPLYNPNHTDFDRLRNAFSRSI 67
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+R+N F + +S Q D++PN Y +++SIGTP E + +ADTGSDL W QC PC
Sbjct: 68 SRVNVFKTKAVDINS--FQNDLVPNGGEYFMKMSIGTPLVEVIVIADTGSDLTWVQCLPC 125
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGVN--CQYSVSYGDGSFS 178
P CY Q SPLFDP SS+Y+ + C S C +L+ +++C+ C+Y SYGD S++
Sbjct: 126 DP--CYRQKSPLFDPSRSSSYRHMLCGSRFCNALDVSEQACTMDTNICEYHYSYGDKSYT 183
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
NGNLATE T+GST+ + V L I FGCGT NGG F+ +GIVGLGGG +SL+SQ+ +
Sbjct: 184 NGNLATEKFTIGSTSSRPVHLSPIVFGCGTGNGGTFDELGSGIVGLGGGALSLVSQLSSI 243
Query: 239 IAGKFSYCLVPVS-----STKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGN 291
I GKFSYCLVP+S ++KI FGT+ ++SGP VVSTPL + T+Y +T++AISVGN
Sbjct: 244 IKGKFSYCLVPLSEQSNVTSKIKFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGN 303
Query: 292 QRL---------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
+RL V +++IDSGTTLTFL + + L V+ ++A+ V+DP G +
Sbjct: 304 KRLPYTNGLLNGNVEKGNVIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRGLFSV 363
Query: 343 CYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTN 401
C+ +P + +HF ADVKL N FVK ED++C F I +N + I+GN+ Q +
Sbjct: 364 CFRSAGDIDLPVIAVHFNDADVKLQPLNTFVKADEDLLC--FTMISSNQIGIFGNLAQMD 421
Query: 402 FLVGYDIEQQTVSFKPTDCTKQ 423
FLVGYD+E++TVSFKPTDCTK
Sbjct: 422 FLVGYDLEKRTVSFKPTDCTKH 443
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 211/444 (47%), Positives = 289/444 (65%), Gaps = 30/444 (6%)
Query: 4 FLSCVFILFFLCFYVV-SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
F+ C+ + FL ++ S EA+ GF+ + I RDSP+SPFYN SET YQRL+ A RS+
Sbjct: 8 FVFCLLAIIFLIYFAKHSQAEAKVDGFTTDFISRDSPRSPFYNPSETKYQRLQKAFRRSI 67
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
R NHF + +S Q+++I +YL+ IS+GTPP L +ADTGSDLIW QC PC
Sbjct: 68 LRGNHFR--AIRASPNDIQSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPC 125
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGVN-CQYSVSYGDGSFSNG 180
CY Q PLFDPK S TYK+L C++ C L Q+ SC N C S SYGD S++
Sbjct: 126 --DDCYKQVEPLFDPKKSKTYKTLGCNNDFCQDLGQQGSCGDDNTCTSSYSYGDQSYTRR 183
Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
+L++ET T+GST G + PG+ FGCG +NGG FN K +G++GLGGG +SL+ Q+ + +
Sbjct: 184 DLSSETFTIGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVG 243
Query: 241 GKFSYCLVPVS-----STKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQR 293
G+FSYCLVP+S S+KINFG + +VSG G VSTPL K TFY LT++ +S+G+++
Sbjct: 244 GQFSYCLVPLSSDSTASSKINFGKSAVVSGSGTVSTPLIKGTPDTFYYLTLEGMSLGSEK 303
Query: 294 LGV-------STP------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL 340
+ S+P +I+IDSGTTLT LP+ + +++ S ++ +I Q DP G+
Sbjct: 304 VAFKGFSKNKSSPAAAEESNIIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTF 363
Query: 341 ELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNS-VPIYGNIMQ 399
LCYS ++P +T HF GADV+L N FV+ ED+VC F I +S + I+GN+ Q
Sbjct: 364 SLCYSGVKKLEIPTITAHFIGADVQLPPLNTFVQAQEDLVC--FSMIPSSNLAIFGNLSQ 421
Query: 400 TNFLVGYDIEQQTVSFKPTDCTKQ 423
NFLVGYD++ VSFKPTDCTKQ
Sbjct: 422 MNFLVGYDLKNNKVSFKPTDCTKQ 445
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 206/432 (47%), Positives = 272/432 (62%), Gaps = 26/432 (6%)
Query: 9 FILFFLC--FYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
+LF+LC FY +EA GGFSVE+IHRDS +SPF++ +ET +QR+ +A+ RS+NR N
Sbjct: 11 LVLFYLCNIFY----LEAFNGGFSVEMIHRDSSRSPFFSPTETQFQRVANAVHRSINRAN 66
Query: 67 HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
H NQ S S + + +I YLI S+GTP + + DTGSD+IW QC+PC +
Sbjct: 67 HLNQ--SFVSPNSPETTVISALGEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPC--KK 122
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATE 185
CY Q +P+FD S TYK+LPC S+ C S+ CS +C YS+ Y DGS S G+L+ E
Sbjct: 123 CYEQTTPIFDSSKSQTYKTLPCPSNTCQSVQGTFCSSRKHCLYSIHYVDGSQSLGDLSVE 182
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
T+TLGST G V PG GCG N K +GIVGLG G +SLI+Q+ + GKFSY
Sbjct: 183 TLTLGSTNGSPVQFPGTVIGCGRYNAIGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSY 242
Query: 246 CLVP---VSSTKINFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGVSTP- 299
CLVP +S+K+NFG +VSG G VSTPL FY LT++A SVG R+ +P
Sbjct: 243 CLVPGLSTASSKLNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPG 302
Query: 300 -----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSL-SQ 351
+I+IDSGTTLT LP G S L + ++ + Q V DP L LCY + L +
Sbjct: 303 SGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLCYKVTPDKLDAS 362
Query: 352 VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
VP +T HF GADV L+ N FV+V++D+VC F+ T + ++GN+ Q N LVGYD++
Sbjct: 363 VPVITAHFSGADVTLNAINTFVQVADDVVCFAFQP-TETGAVFGNLAQQNLLVGYDLQMN 421
Query: 412 TVSFKPTDCTKQ 423
TVSFK TDCTKQ
Sbjct: 422 TVSFKHTDCTKQ 433
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 367 bits (942), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 190/428 (44%), Positives = 273/428 (63%), Gaps = 21/428 (4%)
Query: 10 ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN 69
+LFF +++S + FS ELIHRDS KSP Y ++ +Q + +A RS+NR N
Sbjct: 9 LLFFSLCFIISFSHSLRNSFSFELIHRDSSKSPLYKPAQNKFQHVVNAARRSINRANRLF 68
Query: 70 QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
++S S ++ + N YL+ S+GTPP V DTGSD++W QC+PC QCY
Sbjct: 69 KDSL---SNTPESTVYVNGGEYLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPC--EQCYK 123
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
Q +P+F+P SS+YK++PCSS+ C S+ SC+ N C+Y++++ D S+S G L+ ET+T
Sbjct: 124 QTTPIFNPSKSSSYKNIPCSSNLCQSVRYTSCNKQNSCEYTINFSDQSYSQGELSVETLT 183
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
L STTG +V+ P GCG NN G+F +T+GIVGLG G +SL +Q++++I GKFSYCL+
Sbjct: 184 LDSTTGHSVSFPKTVIGCGHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSIGGKFSYCLL 243
Query: 249 PV-----SSTKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGVSTPD- 300
P+ ++K+NFG +VSG GVVSTP K + FY LT++A SVGN+R+ D
Sbjct: 244 PLLVDSNKTSKLNFGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEFEVLDD 303
Query: 301 -----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPE 354
I++DSGTTLT LP +NL S ++ +++ V DP L LCYS S P
Sbjct: 304 SEEGNIILDSGTTLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLNLCYSITSDQYDFPI 363
Query: 355 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
+T HF+GAD+KL+ + F V++ +VC F + + PI+GN+ Q N LVGYD++Q VS
Sbjct: 364 ITAHFKGADIKLNPISTFAHVADGVVCLAFTS-SQTGPIFGNLAQLNLLVGYDLQQNIVS 422
Query: 415 FKPTDCTK 422
FKP+DC K
Sbjct: 423 FKPSDCIK 430
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 365 bits (936), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 209/436 (47%), Positives = 289/436 (66%), Gaps = 26/436 (5%)
Query: 11 LFFLCFYV-VSPIEA-QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
+ LC Y+ +S + A GGFSVE+IHRDS +SP+Y +ET +QR+ +AL RS+NR NHF
Sbjct: 12 IVLLCLYINISFLNALDGGGFSVEIIHRDSSRSPYYRPTETQFQRVANALRRSINRANHF 71
Query: 69 NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
N+ + ++S+ +++ +I + YL+ S+GTPP + L + DTGSD+IW QC+PC CY
Sbjct: 72 NKPNLVASTNTAESTVIASQGEYLMSYSVGTPPFQILGIVDTGSDIIWLQCQPC--EDCY 129
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCSGVN--CQYSVSYGDGSFSNGNLATE 185
Q +P+FDP S TYK+LPCSS+ C S+ SCS N C+Y+++YGD S S G+L+ E
Sbjct: 130 NQTTPIFDPSQSKTYKTLPCSSNICQSVQSAASCSSNNDECEYTITYGDNSHSQGDLSVE 189
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
T+TLGST G +V P GCG NN G F + +GIVGLGGG +SLISQ+ ++I GKFSY
Sbjct: 190 TLTLGSTDGSSVQFPKTVIGCGHNNKGTFQREGSGIVGLGGGPVSLISQLSSSIGGKFSY 249
Query: 246 CLVPV-----SSTKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRL---- 294
CL P+ SS+K+NFG +VSG G VSTP+ FY LT++A SVG+ R+
Sbjct: 250 CLAPLFSQSNSSSKLNFGDEAVVSGRGTVSTPIVPKNGLGFYFLTLEAFSVGDNRIEFGS 309
Query: 295 -----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 349
+I+IDSGTTLT LP+ NL S ++ IE + V DP+ L LCY S
Sbjct: 310 SSFESSGGEGNIIIDSGTTLTILPEDDYLNLESAVADAIELERVEDPSKFLRLCYRTTSS 369
Query: 350 SQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
+ VP +T HF+GADV+L+ + F++V E +VC F+ + PI+GN+ Q N LVGYD
Sbjct: 370 DELNVPVITAHFKGADVELNPISTFIEVDEGVVCFAFRS-SKIGPIFGNLAQQNLLVGYD 428
Query: 408 IEQQTVSFKPTDCTKQ 423
+ +QTVSFKPTDCT++
Sbjct: 429 LVKQTVSFKPTDCTQE 444
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 363 bits (933), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 195/433 (45%), Positives = 273/433 (63%), Gaps = 26/433 (6%)
Query: 10 ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN 69
+LFF ++VS AQ GFSVELIHRDS KSP Y ++ YQ DA RS+NR NHF
Sbjct: 9 LLFFSICFIVSFSHAQKNGFSVELIHRDSLKSPLYKPTQNKYQYFVDAARRSINRANHFY 68
Query: 70 QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
+ S + Q+ +IP+ YL+ S+GTPP + + DTGSD++W QCEPC +CY
Sbjct: 69 K---YSLANIPQSTVIPDIGEYLMTYSVGTPPFKLYGIVDTGSDIVWLQCEPC--QECYN 123
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
Q +P+F+P SS+YK++PC S C S+ SC+ N C+YS YGD S S G+L+ +T+T
Sbjct: 124 QTTPMFNPSKSSSYKNIPCPSKLCQSMEDTSCNDKNYCEYSTYYGDNSHSGGDLSVDTLT 183
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
L ST G V+ P I GCGTNN + ++GIVG G G S I+Q+ ++ GKFSYCL
Sbjct: 184 LESTNGLTVSFPNIVIGCGTNNILSYEGASSGIVGFGSGPASFITQLGSSTGGKFSYCLT 243
Query: 249 PV---------SSTKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRL--- 294
P+ +++K+NFG VSG GVV+TP+ K +TFY LT++A SVGN+R+
Sbjct: 244 PLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIG 303
Query: 295 ----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 350
G + +I+IDSGTTLT L + S L S + +++ + V DPT +L LCYS +
Sbjct: 304 GVPNGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVDDPTQTLNLCYSVKAEG 363
Query: 351 -QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
P +T+HF+GADV L + FV V++ + C F+ + I+GN+ Q N +VGYD++
Sbjct: 364 YDFPIITMHFKGADVDLHPISTFVSVADGVFCLAFESSQDHA-IFGNLAQQNLMVGYDLQ 422
Query: 410 QQTVSFKPTDCTK 422
Q+ VSFKP+DCTK
Sbjct: 423 QKIVSFKPSDCTK 435
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 363 bits (932), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 211/444 (47%), Positives = 283/444 (63%), Gaps = 33/444 (7%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
+FI F S +EA+ GFS LIHRDS SP YN +T + RLR++ RS++R N
Sbjct: 11 LFIAFISMVSAFSLVEARNAGFSANLIHRDSSVSPLYNPRDTYFDRLRNSFHRSISRANR 70
Query: 68 FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
F NS IS+ Q+DI+P YL+RISIG P E LA+ADTGSDLIW QC+PC C
Sbjct: 71 FKPNS-ISARALVQSDIVPGGGEYLMRISIGNPQVEILAIADTGSDLIWVQCQPC--EMC 127
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSG----VNCQYSVSYGDGSFSNGN 181
Y Q+SP+FDP+ SS+Y+++ C + C L+ +SC C Y+ SYGD SFS+G+
Sbjct: 128 YKQNSPIFDPRRSSSYRNVLCGNEFCNKLDGEARSCDARGFVKTCGYTYSYGDQSFSDGH 187
Query: 182 LATETVTLGST---TGQAVA-LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT 237
LA E +GST T A+A + FGCGT NGG F+ +GI+GLGGG +SL+SQ+
Sbjct: 188 LAIERFGIGSTNSNTSAAIAYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGP 247
Query: 238 TIAGKFSYCLVPVS-----STKINFGTNGIVSGPG--VVSTPL--TKAKTFYVLTIDAIS 288
++GKFSYCLVP S ++KINFG + +SG VVSTPL K +T+Y LT++AIS
Sbjct: 248 KLSGKFSYCLVPTSEQSNYTSKINFGNDINISGSNYNVVSTPLLPKKPETYYYLTLEAIS 307
Query: 289 VGNQRL--------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL 340
V N+RL V +I+IDSGTTLTFL + +NL S + ++ + V+DP G
Sbjct: 308 VENKRLPYTNLWNGEVEKGNIIIDSGTTLTFLDSEFFNNLDSAVEEAVKGERVSDPHGLF 367
Query: 341 ELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQ 399
+C+ ++P +T HF GADV+L N F KV ED++C F I +N + I+GN+ Q
Sbjct: 368 NICFKDEKAIELPIITAHFTGADVELQPVNTFAKVEEDLLC--FTMIPSNDIAIFGNLAQ 425
Query: 400 TNFLVGYDIEQQTVSFKPTDCTKQ 423
NFLVGYD+E++ VSF PTDCTKQ
Sbjct: 426 MNFLVGYDLEKKAVSFLPTDCTKQ 449
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 363 bits (931), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 201/443 (45%), Positives = 271/443 (61%), Gaps = 34/443 (7%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTG---GFSVELIHRDSPKSPFYNSSETPYQRLRDA 57
MA S V ++ FL V + A TG GF+VELIHRDSPKSP YN E Y R+ D
Sbjct: 1 MAPIFSLVIVIIFLISTAV--VSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADT 58
Query: 58 LTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
L RS++ N+ + ++ +A I N YL+++S+GTPP +AVADTGSD+IWT
Sbjct: 59 LRRSIS------HNTGLVTNTV-EAPIYNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWT 111
Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVSYGDG 175
QCEPC + CY QD P+F+P S+TY+ + CSS C+ + SCS +C YS+SYGD
Sbjct: 112 QCEPC--TNCYQQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDN 169
Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
S S G+ A +T+T+GST+G+ VA P GCG +N G F++ +GIVGLG G SLI QM
Sbjct: 170 SHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQM 229
Query: 236 RTTIAGKFSYCLVPV-----SSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAI 287
+ + GKFSYCL P+ S K+NFG+N VSG G VSTP+ K K+FY L + A+
Sbjct: 230 GSAVGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAV 289
Query: 288 SVGNQRLGVST--------PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS 339
SVG ST +I+IDSGTTLT LP N +S+ I Q DP
Sbjct: 290 SVGRNNTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQF 349
Query: 340 LELCYSFNSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNI 397
LE C+ + +VP + +HF GA+++L R N ++VS++++C F G N + IYGNI
Sbjct: 350 LEYCFETTTDDYKVPFIAMHFEGANLRLQRENVLIRVSDNVICLAFAGAQDNDISIYGNI 409
Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
Q NFLVGYD+ ++SFKP +C
Sbjct: 410 AQINFLVGYDVTNMSLSFKPMNC 432
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 363 bits (931), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 206/420 (49%), Positives = 281/420 (66%), Gaps = 27/420 (6%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
GFSVE+IHRDS +SP Y +ETP+QR+ +A+ RS+NR NHFN+ S ++S+ +++ + +
Sbjct: 34 GFSVEMIHRDSSRSPLYRHTETPFQRVANAMRRSINRANHFNKKSFVASTNTAESTVKAS 93
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
YL+ S+GTPP E L V DTGS + W QC+ C CY Q +P+FDP S TYK+LP
Sbjct: 94 QGEYLMSYSVGTPPFEILGVVDTGSGITWMQCQRC--EDCYEQTTPIFDPSKSKTYKTLP 151
Query: 148 CSSSQCAS-LNQKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
CSS+ C S ++ SCS + C+Y++ YGDGS S G+L+ ET+TLGST G +V P
Sbjct: 152 CSSNMCQSVISTPSCSSDKIGCKYTIKYGDGSHSQGDLSVETLTLGSTNGSSVQFPNTVI 211
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKINFGT 259
GCG NN G F + +G+VGLGGG +SLISQ+ ++I GKFSYCL P+ SS+K+NFG
Sbjct: 212 GCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGGKFSYCLAPMFSQSNSSSKLNFGD 271
Query: 260 NGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL-----------GVSTPDIVIDS 305
+VSG G VSTPL T ++ FY LT++A SVG++R+ +I+IDS
Sbjct: 272 AAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGNIIIDS 331
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGAD 363
GTTLT LPQ SNL S ++ I+A V+DP+ L LCY Q VP +T HF+GAD
Sbjct: 332 GTTLTLLPQEDYSNLESAVADAIQANRVSDPSNFLSLCYQTTPSGQLDVPVITAHFKGAD 391
Query: 364 VKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 423
V+L+ + FV+V+E +VC F + V I+GN+ Q N LVGYD+ +QTVSFKPTDCT++
Sbjct: 392 VELNPISTFVQVAEGVVCFAFHS-SEVVSIFGNLAQLNLLVGYDLMEQTVSFKPTDCTQE 450
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 362 bits (930), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 196/436 (44%), Positives = 281/436 (64%), Gaps = 30/436 (6%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
V ++ FL F+++ A GGFSV+LIHRDSP SPF++ S+T +RL DA RS +R+
Sbjct: 12 VVVVGFL-FHLLEVGLASGGGFSVDLIHRDSPHSPFFDPSKTRTERLTDAFHRSASRVGR 70
Query: 68 FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
F Q S +S Q+ ++P+ Y++ +SIGTPP +A+ DTGSDL WTQC PC + C
Sbjct: 71 FRQ--SAMTSDGIQSRLVPSAGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPC--THC 126
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASL-NQKSC-SGVNCQYSVSYGDGSFSNGNLATE 185
Y Q P FDPK SSTY+ C +S C +L N +SC +G C + SY DGSF+ GNLA E
Sbjct: 127 YKQVVPFFDPKNSSTYRDSSCGTSFCLALGNDRSCRNGKKCTFMYSYADGSFTGGNLAVE 186
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
T+T+ ST G+ V+ PG FGC +GG+F+ ++GIVGLG ++S+ISQ+++TI G+FSY
Sbjct: 187 TLTVASTAGKPVSFPGFAFGCVHRSGGIFDEHSSGIVGLGVAELSMISQLKSTINGRFSY 246
Query: 246 CLVPV-----SSTKINFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLG-- 295
CL+PV S++INFG +GIVSG G VSTPL +Y++T++ SVG +RL
Sbjct: 247 CLLPVFTDSSMSSRINFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYK 306
Query: 296 -------VSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS 348
V +I++DSGTT T+LP + L ++ I+ + V DP G LCY+ +
Sbjct: 307 GFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYN-TT 365
Query: 349 LSQV--PEVTIHFRGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVG 405
+ Q+ P +T HF+ A+V+L N F+++ ED+VC +V T+ + I GN+ Q NFLVG
Sbjct: 366 VDQIDAPIITAHFKDANVELQPWNTFLRMQEDLVCFTVLP--TSDIGILGNLAQVNFLVG 423
Query: 406 YDIEQQTVSFKPTDCT 421
+D+ ++ VSFK DCT
Sbjct: 424 FDLRKKRVSFKAADCT 439
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 359 bits (922), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 200/443 (45%), Positives = 270/443 (60%), Gaps = 34/443 (7%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTG---GFSVELIHRDSPKSPFYNSSETPYQRLRDA 57
MA S V ++ FL V + A TG GF+VELIHRDSPKSP YN E Y R+ D
Sbjct: 1 MAPIFSLVIVIIFLISTAV--VSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADT 58
Query: 58 LTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
L RS++ N+ + ++ +A I N YL+++S+GTPP +AVADTGSD+IWT
Sbjct: 59 LRRSIS------HNTGLVTNTV-EAPIYNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWT 111
Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVSYGDG 175
QC PC + CY QD P+F+P S+TY+ + CSS C+ + SCS +C YS+SYGD
Sbjct: 112 QCVPC--TNCYQQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDN 169
Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
S S G+ A +T+T+GST+G+ VA P GCG +N G F++ +GIVGLG G SLI QM
Sbjct: 170 SHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQM 229
Query: 236 RTTIAGKFSYCLVPV-----SSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAI 287
+ + GKFSYCL P+ S K+NFG+N VSG G VSTP+ K K+FY L + A+
Sbjct: 230 GSAVGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAV 289
Query: 288 SVGNQRLGVST--------PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS 339
SVG ST +I+IDSGTTLT LP N +S+ I Q DP
Sbjct: 290 SVGRNNTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQF 349
Query: 340 LELCYSFNSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNI 397
LE C+ + +VP + +HF GA+++L R N ++VS++++C F G N + IYGNI
Sbjct: 350 LEYCFETTTDDYKVPFIAMHFEGANLRLQRENVLIRVSDNVICLAFAGAQDNDISIYGNI 409
Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
Q NFLVGYD+ ++SFKP +C
Sbjct: 410 AQINFLVGYDVTNMSLSFKPMNC 432
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 358 bits (918), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 198/429 (46%), Positives = 264/429 (61%), Gaps = 22/429 (5%)
Query: 11 LFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ 70
L LC Y + EA GFSVE+IHRDS +SPFY ++ET +QR+ +A+ RS+NR NHFNQ
Sbjct: 9 LVLLCLYNICFSEALKSGFSVEIIHRDSSRSPFYRATETQFQRVTNAVRRSMNRANHFNQ 68
Query: 71 NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
S S++ S ++ ++ +YL+ S+GTPP + DT SD+IW QC+ C CY
Sbjct: 69 ISVYSNAVESPVTLL-DDGDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQLC--ETCYND 125
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATETV 187
SP+FDP S TYK+LPCSS+ C S+ SCS C+++V+Y DGS S G+L ETV
Sbjct: 126 TSPMFDPSYSKTYKNLPCSSTTCKSVQGTSCSSDERKICEHTVNYKDGSHSQGDLIVETV 185
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
TLGS V P GC N F+S GIVGLGGG +SL+ Q+ ++I+ KFSYCL
Sbjct: 186 TLGSYNDPFVHFPRTVIGCIRNTNVSFDS--IGIVGLGGGPVSLVPQLSSSISKKFSYCL 243
Query: 248 VPVS--STKINFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRLGVSTP---- 299
P+S S+K+ FG +VSG G VST + K FY LT++A SVGN R+ +
Sbjct: 244 APISDRSSKLKFGDAAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFRSSSSRS 303
Query: 300 ----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY-SFNSLSQVPE 354
+I+IDSGTT T LP S L S ++ +++ + DP LCY S VP
Sbjct: 304 SGKGNIIIDSGTTFTVLPDDVYSKLESAVADVVKLERAEDPLKQFSLCYKSTYDKVDVPV 363
Query: 355 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
+T HF GADVKL+ N F+ S +VC F + S I+GN+ Q NFLVGYD++++ VS
Sbjct: 364 ITAHFSGADVKLNALNTFIVASHRVVCLAFLS-SQSGAIFGNLAQQNFLVGYDLQRKIVS 422
Query: 415 FKPTDCTKQ 423
FKPTDCTKQ
Sbjct: 423 FKPTDCTKQ 431
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 351 bits (900), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 200/442 (45%), Positives = 286/442 (64%), Gaps = 30/442 (6%)
Query: 2 ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
+FL+ F FFLCF +S +A + GFS+ELIHRDS KSPFY ++ YQ + DA+ RS
Sbjct: 4 VSFLTLSF--FFLCF-SISFSQAVSNGFSIELIHRDSSKSPFYKPTQNKYQHVVDAVHRS 60
Query: 62 LNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
+NR+NH N+NS S+ +++ +I +Y++ S+GTPP + + DTGSD++W QCEP
Sbjct: 61 INRVNHSNKNSLASTPEST---VISYEGDYIMSYSVGTPPIKSYGIVDTGSDIVWLQCEP 117
Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-NCQYSVSYGDGSFSNG 180
C QCY Q +P F+P SS+YK++ CSS C S+ SC+ NC+YS++YG+ S S G
Sbjct: 118 C--EQCYNQTTPKFNPSKSSSYKNISCSSKLCQSVRDTSCNDKKNCEYSINYGNQSHSQG 175
Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
+L+ ET+TL STTG+ V+ P GCGTNN G F ++G+VGLGGG SLI+Q+ +I
Sbjct: 176 DLSLETLTLESTTGRPVSFPKTVIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSIG 235
Query: 241 GKFSYCLVPVS---------STKINFGTNGIVSGPGVVSTPLTKAKT--FYVLTIDAISV 289
GKFSYCLV +S S+K+NFG IVSG V+STP+ K FY LTI+A SV
Sbjct: 236 GKFSYCLVRMSITLKNMSMGSSKLNFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSV 295
Query: 290 GNQRL-------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
G++R+ GV +I+IDS T +TF+P + L S + ++ + V DP L
Sbjct: 296 GDKRVEFAGSSKGVEEGNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTLERVDDPNQQFSL 355
Query: 343 CYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQT 400
CY+ +S + P +T HF+GAD+ L +N FV+V+ D++C F +N I+G+ Q
Sbjct: 356 CYNVSSDEEYDFPYMTAHFKGADILLYATNTFVEVARDVLCFAF-APSNGGAIFGSFSQQ 414
Query: 401 NFLVGYDIEQQTVSFKPTDCTK 422
+F+VGYD++Q+TVSFK DCT+
Sbjct: 415 DFMVGYDLQQKTVSFKSVDCTE 436
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 350 bits (897), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 186/415 (44%), Positives = 266/415 (64%), Gaps = 24/415 (5%)
Query: 4 FLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
F + V + F F ++ A+ GGFSV+LIHRDSP SPF++ S+T +RL DA RS++
Sbjct: 9 FFNVVVVGFL--FQLLEVALARGGGFSVDLIHRDSPHSPFFDPSKTQAERLTDAFRRSVS 66
Query: 64 RLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
R+ F + +S Q+ I+P+ YL+ + IGTPP +A+ DTGSDL WTQC PC
Sbjct: 67 RVGRFRPTAM--TSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPC- 123
Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCSG-VNCQYSVSYGDGSFSNGN 181
+ CY Q PLFDPK SSTY+ C +S C +L + +SCS C + SY DGSF+ GN
Sbjct: 124 -THCYKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGN 182
Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
LA+ET+T+ ST G+ V+ PG FGCG ++GG+F+ ++GIVGLGGG++SLISQ+++TI G
Sbjct: 183 LASETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTING 242
Query: 242 KFSYCLVPVS-----STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV 296
FSYCL+PVS S++INFG +G VSG G VSTPL Y +++ V
Sbjct: 243 LFSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLRLPYKGY----------SKKTEV 292
Query: 297 STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVT 356
+I++DSGTT TFLPQ + S L +++ I+ + V DP G LCY+ + P +T
Sbjct: 293 EEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAEINAPIIT 352
Query: 357 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
HF+ A+V+L N F+++ ED+VC T+ + + GN+ Q NFLVG+D+ ++
Sbjct: 353 AHFKDANVELQPLNTFMRMQEDLVCFTV-APTSDIGVLGNLAQVNFLVGFDLRKK 406
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 84/134 (62%), Gaps = 6/134 (4%)
Query: 291 NQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 350
+++ V +I++DSGTT T+LP + L ++ I+ + V DP G LCY+ ++
Sbjct: 410 SKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYN-TTVD 468
Query: 351 QV--PEVTIHFRGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYD 407
Q+ P +T HF+ A+V+L N F+++ ED+VC +V T+ + I GN+ Q NFLVG+D
Sbjct: 469 QIDAPIITAHFKDANVELQPWNTFLRMQEDLVCFTVLP--TSDIGILGNLAQVNFLVGFD 526
Query: 408 IEQQTVSFKPTDCT 421
+ ++ VSFK DCT
Sbjct: 527 LRKKRVSFKAADCT 540
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 194/437 (44%), Positives = 266/437 (60%), Gaps = 49/437 (11%)
Query: 8 VFILFF--LCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL 65
+ ILF+ LCF ++S A GFSVELIHRDS KSP Y ++ YQ + +A RS+NR
Sbjct: 6 LLILFYFSLCF-IISLSHALNNGFSVELIHRDSSKSPLYQPTQNKYQHIVNAARRSINRA 64
Query: 66 NHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
NHF + + + Q+ +IP++ YL+ S+GTPP + +ADTGSD++W QCEPC
Sbjct: 65 NHFYKTAL---TNTPQSTVIPDHGEYLMTYSVGTPPFKLYGIADTGSDIVWLQCEPC--K 119
Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATE 185
+CY Q +P F P SSTYK++PCSS C S Q GNL+ +
Sbjct: 120 ECYNQTTPKFKPSKSSTYKNIPCSSDLCKSGQQ---------------------GNLSVD 158
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
T+TL S+TG ++ P GCGT+N F ++GIVGLGGG SLI+Q+ ++I KFSY
Sbjct: 159 TLTLESSTGHPISFPKTVIGCGTDNTVSFEGASSGIVGLGGGPASLITQLGSSIDAKFSY 218
Query: 246 CLVP-----VSSTKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRL---- 294
CL+P +++K+NFG +VSG GVVSTP+ K FY LT++A SVGN+R+
Sbjct: 219 CLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPIVFYYLTLEAFSVGNKRIEFEG 278
Query: 295 ---GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS- 350
G +I+IDSGTTLT +P +NL S + +++ + V DPT LCYS S
Sbjct: 279 SSNGGHEGNIIIDSGTTLTVIPTDVYNNLESAVLELVKLKRVNDPTRLFNLCYSVTSDGY 338
Query: 351 QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP-----IYGNIMQTNFLVG 405
P +T HF+GADVKL + FV V++ IVC F + +P I+GN+ Q N LVG
Sbjct: 339 DFPIITTHFKGADVKLHPISTFVDVADGIVCLAFATTSAFIPSDVVSIFGNLAQQNLLVG 398
Query: 406 YDIEQQTVSFKPTDCTK 422
YD++Q+ VSFKPTDC+K
Sbjct: 399 YDLQQKIVSFKPTDCSK 415
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 344 bits (883), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 190/433 (43%), Positives = 265/433 (61%), Gaps = 37/433 (8%)
Query: 3 TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
+FL+ +F F CF ++S A GF++ELIHRDS KSPFY ++ Y+R+ +A+ RS+
Sbjct: 5 SFLTLLFFTIF-CF-IISLSHALNNGFTLELIHRDSSKSPFYQPTQNKYERIANAVRRSI 62
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
NR+NHF + S S+ Q+ + + YL+ SIGTPP + DTGSDL+W QCEPC
Sbjct: 63 NRVNHFYKYSLTSTP---QSTVNSDKGEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPC 119
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
QCY Q +P+FDP +SS+Y+++PC S C S+ SC G L
Sbjct: 120 --KQCYPQITPIFDPSLSSSYQNIPCLSDTCHSMRTTSCD---------------VRGYL 162
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
+ ET+TL STTG +V+ P GCG N G F+ ++GIVGLG G +SL SQ+ T+I GK
Sbjct: 163 SVETLTLDSTTGYSVSFPKTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGK 222
Query: 243 FSYCL---VPVSSTKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVS 297
FSYCL +P S++K+NFG IV G G ++TP+ K A++ Y LT++A SVGN+ +
Sbjct: 223 FSYCLGPWLPNSTSKLNFGDAAIVYGDGAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFG 282
Query: 298 TP-------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 350
P +I+IDSGTT TFLP S ++ I + V DP G+ +LCY+
Sbjct: 283 GPTYGGNEGNILIDSGTTFTFLPYDVYYRFESAVAEYINLEHVEDPNGTFKLCYNVAYHG 342
Query: 351 -QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
+ P +T HF+GAD+KL + F+KVS+ I C F I + I+GN+ Q N LVGY++
Sbjct: 343 FEAPLITAHFKGADIKLYYISTFIKVSDGIACLAF--IPSQTAIFGNVAQQNLLVGYNLV 400
Query: 410 QQTVSFKPTDCTK 422
Q TV+FKP DCTK
Sbjct: 401 QNTVTFKPVDCTK 413
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 200/420 (47%), Positives = 263/420 (62%), Gaps = 33/420 (7%)
Query: 27 GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
G F+ LIHRDSP SP YN T + RL+ + RS++R N F NS +S++K + DIIP
Sbjct: 31 GSFTASLIHRDSPISPLYNPKNTYFDRLQSSFHRSISRANRFTPNS-VSAAKTLEYDIIP 89
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
Y +RISIGTPP E L +ADTGSDLIW QC+PC +CY Q SP+F+PK SSTY+ +
Sbjct: 90 GGGEYFMRISIGTPPIEVLVIADTGSDLIWVQCQPC--QECYKQKSPIFNPKQSSTYRRV 147
Query: 147 PCSSSQCASLN--QKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
C + C +LN ++CS C YS SYGD SF+ G LATE +GST ++
Sbjct: 148 LCETRYCNALNSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGSTNN---SIQ 204
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV------SSTK 254
+ FGCG +NGG F+ +GIVGLGGG +SLISQ+ T I KFSYCLVP+ S K
Sbjct: 205 ELAFGCGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGK 264
Query: 255 INFGTNGIVSGPGV-VSTPLT--KAKTFYVLTIDAISVGNQRLG---------VSTPDIV 302
I FG N +SG VSTPL + +TFY LT++AISVGN+RL V +I+
Sbjct: 265 IVFGDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENSRNDGNVEKGNII 324
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGA 362
IDSGTTLTFL + L V+ +E + V+DP G +C+ ++P +T+HF A
Sbjct: 325 IDSGTTLTFLDSKLYNKLELVLEKAVEGERVSDPNGIFSICFRDKIGIELPIITVHFTDA 384
Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
DV+L N F K ED++C F I +N + I+GN+ Q NFLVGYD+++ VSF PTDC+
Sbjct: 385 DVELKPINTFAKAEEDLLC--FTMIPSNGIAIFGNLAQMNFLVGYDLDKNCVSFMPTDCS 442
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 336 bits (862), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 207/446 (46%), Positives = 271/446 (60%), Gaps = 38/446 (8%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
+ + FFL F V FSVELIHRDSP SP YN T RL A RS++R
Sbjct: 5 ILLCFFLFFSVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRR 64
Query: 68 FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
FN S + Q+ +I + + + I+IGTPP + A+ADTGSDL W QC+PC QC
Sbjct: 65 FNHQLSQTDL---QSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPC--QQC 119
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGVN--CQYSVSYGDGSFSNGNLA 183
Y ++ P+FD K SSTYKS PC S C +L+ ++ C N C+Y SYGD SFS G++A
Sbjct: 120 YKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVA 179
Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
TETV++ S +G V+ PG FGCG NNGG F+ +GI+GLGGG +SLISQ+ ++I+ KF
Sbjct: 180 TETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKF 239
Query: 244 SYCLVPVSSTK-----INFGTNGIVSG----PGVVSTPLTKAK--TFYVLTIDAISVGNQ 292
SYCL S+T IN GTN I S GVVSTPL + T+Y LT++AISVG +
Sbjct: 240 SYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKK 299
Query: 293 R---------------LGVSTPDIVIDSGTTLTFLPQGYNSNLLS-VMSSMIEAQPVADP 336
+ L ++ +I+IDSGTTLT L G+ S V S+ A+ V+DP
Sbjct: 300 KIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDP 359
Query: 337 TGSLELCYSFNSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYG 395
G L C+ S +PE+T+HF GADV+LS N FVK+SED+VC + T V IYG
Sbjct: 360 QGLLSHCFKSGSAEIGLPEITVHFTGADVRLSPINAFVKLSEDMVC-LSMVPTTEVAIYG 418
Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDCT 421
N Q +FLVGYD+E +TVSF+ DC+
Sbjct: 419 NFAQMDFLVGYDLETRTVSFQHMDCS 444
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 186/441 (42%), Positives = 252/441 (57%), Gaps = 26/441 (5%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
+ S V L F+ +S E + G FS++LIHRDSPKSP YN SETP +RL R
Sbjct: 7 LGLLFSIVIALSFVSVAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERL----DR 62
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
R F++ S S + + NN YL++ISIGTPP + + DTGSDL+WTQC
Sbjct: 63 FFRRFMSFSEASI--SPNTPEPPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCL 120
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFS 178
PC CY Q +P+FDP S+++K + C S QC L+ SCS C +S YGDGS +
Sbjct: 121 PC--LSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLA 178
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
G +ATET+TL S +GQ ++ I FGCG NN G FN G+ G GG +SL SQ+ +T
Sbjct: 179 QGVIATETLTLNSNSGQPXSIXNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMST 238
Query: 239 IAG--KFSYCLVPVSS-----TKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISV 289
+ KFS CLVP + +KI FG VSG VVSTPL T+Y +T+D ISV
Sbjct: 239 LGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISV 298
Query: 290 GNQRLGVSTP-------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
G++ S+ ++ ID+GT T LP+ + + L+ + I +PV DP +L
Sbjct: 299 GDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQL 358
Query: 343 CYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 402
CY +L P +T HF GADV+L N F+ E + C + I I+GN +Q NF
Sbjct: 359 CYRSATLIDGPILTAHFDGADVQLKPLNTFISPKEGVYCFAMQPIDGDTGIFGNFVQMNF 418
Query: 403 LVGYDIEQQTVSFKPTDCTKQ 423
L+G+D++ + VSFK DCTKQ
Sbjct: 419 LIGFDLDGKKVSFKAVDCTKQ 439
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 186/441 (42%), Positives = 252/441 (57%), Gaps = 26/441 (5%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
+ S V L F+ +S E + G FS++LIHRDSPKSP YN SETP +RL R
Sbjct: 7 LGLLFSIVIALSFVSVAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERL----DR 62
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
R F++ S S + + NN YL++ISIGTPP + + DTGSDL+WTQC
Sbjct: 63 FFRRFMSFSEASI--SPNTPEPPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCL 120
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFS 178
PC CY Q +P+FDP S+++K + C S QC L+ SCS C +S YGDGS +
Sbjct: 121 PC--LSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLA 178
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
G +ATET+TL S +GQ ++ I FGCG NN G FN G+ G GG +SL SQ+ +T
Sbjct: 179 QGVIATETLTLNSNSGQPTSILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMST 238
Query: 239 IAG--KFSYCLVPVSS-----TKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISV 289
+ KFS CLVP + +KI FG VSG VVSTPL T+Y +T+D ISV
Sbjct: 239 LGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISV 298
Query: 290 GNQRLGVSTP-------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
G++ S+ ++ ID+GT T LP+ + + L+ + I +PV DP +L
Sbjct: 299 GDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQL 358
Query: 343 CYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 402
CY +L P +T HF GADV+L N F+ E + C + I I+GN +Q NF
Sbjct: 359 CYRSATLIDGPILTAHFDGADVQLKPLNTFISPKEGVYCFAMQPIDGDTGIFGNFVQMNF 418
Query: 403 LVGYDIEQQTVSFKPTDCTKQ 423
L+G+D++ + VSFK DCTKQ
Sbjct: 419 LIGFDLDGKKVSFKAVDCTKQ 439
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 327 bits (837), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 183/440 (41%), Positives = 260/440 (59%), Gaps = 27/440 (6%)
Query: 4 FLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
F S + +LF CF VS + Q GFSVELIH S KSPFYN++E+ +QR+ + + S N
Sbjct: 3 FYSSLLLLF--CFCRVSVSKTQNNGFSVELIHPISSKSPFYNTAESHFQRMSNNMKHSTN 60
Query: 64 RLNHFNQNSSISSSKASQADIIPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
R+++ N S +K + P + Y+I IGTPP + V DT +D IW QC PC
Sbjct: 61 RVHYLNHVFSFPPNKVPNIVVSPFMGDGYIISFLIGTPPFQLYGVMDTANDNIWFQCNPC 120
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSN 179
P C+ SP+FDP SSTYK++PCSS +C ++ CS + C+YS +YG ++S
Sbjct: 121 KP--CFNTTSPMFDPSKSSTYKTIPCSSPKCKNVENTHCSSDDKKVCEYSFTYGGEAYSQ 178
Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
G+L+ +T+TL S ++ I GCG N G +G +GLG G +S ISQ+ ++I
Sbjct: 179 GDLSIDTLTLNSNNDTPISFKNIVIGCGHRNKGPLEGYVSGNIGLGRGPLSFISQLNSSI 238
Query: 240 AGKFSYCLVPVSST-----KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRL 294
GKFSYCLVP+ S K++FG +VSG G VSTP+T + Y T++A+SVG+ +
Sbjct: 239 GGKFSYCLVPLFSNEGISGKLHFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHII 298
Query: 295 GVSTP--------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY-- 344
+ +IDSGTTLT LP+ S L S+++SM++ + P +LCY
Sbjct: 299 KFENSTSKNDNLGNTIIDSGTTLTILPENVYSRLESIVTSMVKLERAKSPNQQFKLCYKA 358
Query: 345 SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNF 402
+ +L VP +T HF GADV L+ N F + ++VC F + N P I GNI Q NF
Sbjct: 359 TLKNL-DVPIITAHFNGADVHLNSLNTFYPIDHEVVCFAFVSVGN-FPGTIIGNIAQQNF 416
Query: 403 LVGYDIEQQTVSFKPTDCTK 422
LVG+D+++ +SFKPTDCTK
Sbjct: 417 LVGFDLQKNIISFKPTDCTK 436
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 326 bits (835), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 185/427 (43%), Positives = 257/427 (60%), Gaps = 26/427 (6%)
Query: 18 VVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS 77
VV+PIE+Q GFSVELIH DS +SPFYN ET QR+ + +T S+ R ++ N S+S +
Sbjct: 16 VVTPIESQNRGFSVELIHPDSSRSPFYNIRETQLQRISNVVTHSIKRAHYLNHVFSLSHN 75
Query: 78 KASQADIIPNNANY-LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
+ IIP +Y ++ SIGTPP + V DTGSD IW QC+PC P C Q SP+F+
Sbjct: 76 DLPKPTIIPYAGSYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKP--CLNQTSPIFN 133
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSG---VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
P SSTYK++ CSS C + CS C+Y ++Y D S S G+++ +T+TL S
Sbjct: 134 PSKSSTYKNIRCSSPICKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSND 193
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS- 252
G ++ P I GCG N +GI+G G G+ S++SQ+ ++I GKFSYCL + S
Sbjct: 194 GSPISFPKIVIGCGHKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLFSK 253
Query: 253 ----TKINFGTNGIVSGPGVVSTPLTKAKTFYV----LTIDAISVGNQRLGVS----TPD 300
+K+ FG +VSG GVVSTPL ++ FYV ++A SVG+ + + PD
Sbjct: 254 ANISSKLYFGDMAVVSGHGVVSTPLIQS--FYVGNYFTNLEAFSVGDHIIKLKDSSLIPD 311
Query: 301 ----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEV 355
VIDSG+T+T LP S L + + SM++ + V DPT L LCY +VP +
Sbjct: 312 NEGNAVIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTTLKKYEVPII 371
Query: 356 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
T HFRGADVKL+ N F++++ +++C F +YGNI Q NFLVGYD + +SF
Sbjct: 372 TAHFRGADVKLNAFNTFIQMNHEVMCFAFNSSAFPWVVYGNIAQQNFLVGYDTLKNIISF 431
Query: 416 KPTDCTK 422
KPT+CTK
Sbjct: 432 KPTNCTK 438
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 325 bits (833), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 196/425 (46%), Positives = 265/425 (62%), Gaps = 38/425 (8%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
SVELIHRDSP SP YN T RL A RS++R N +I S Q+ +I +
Sbjct: 26 LSVELIHRDSPLSPLYNPKNTVTDRLNAAFLRSISRSRRLN---NILSQTDLQSGLIGAD 82
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
+ + I+IGTPP + A+ADTGSDL W QC+PC QCY ++ P+FD K SSTYKS PC
Sbjct: 83 GEFFMSITIGTPPMKVFAIADTGSDLTWVQCKPC--QQCYKENGPIFDKKKSSTYKSEPC 140
Query: 149 SSSQCASLN--QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
S C +L+ ++ C C+Y SYGD SFS G++ATET+++ S +G V+ PG F
Sbjct: 141 DSRNCHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSPVSFPGTVF 200
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFGT 259
GCG NNGG F+ +GI+GLGGG +SLISQ+ ++I+ KFSYCL S+T IN GT
Sbjct: 201 GCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTNGTSVINLGT 260
Query: 260 NGIVSG----PGVVSTPLT--KAKTFYVLTIDAISVGNQRL------------GV---ST 298
N I S GV+STPL + +T+Y LT++AISVG +++ G+ ++
Sbjct: 261 NSIPSSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIPYTGSSYNPNDGGIFSETS 320
Query: 299 PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE-AQPVADPTGSLELCYSFNSLS-QVPEVT 356
+I+IDSGTTLT L G+ + + ++ A+ V+DP G L C+ S +PE+T
Sbjct: 321 GNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKRVSDPQGLLSHCFKSGSAEIGLPEIT 380
Query: 357 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
+HF GADV+LS N FVKVSED+VC + T V IYGN Q +FLVGYD+E +TVSF+
Sbjct: 381 VHFTGADVRLSPINAFVKVSEDMVC-LSMVPTTEVAIYGNFAQMDFLVGYDLETRTVSFQ 439
Query: 417 PTDCT 421
DC+
Sbjct: 440 RMDCS 444
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 179/439 (40%), Positives = 270/439 (61%), Gaps = 25/439 (5%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
M+ F I F+LC ++ A G S+E+IHRD KSP Y+ + T +QR + + R
Sbjct: 1 MSRFSVLTLIFFYLCCFIYFS-HASKKGLSIEMIHRDFSKSPLYHPTVTKFQRAYNVVHR 59
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
S+NR+N+F + S++ ++ + + P YLI S+GTPP + DTGS+++W QC+
Sbjct: 60 SINRVNYFTKEFSLNKNQPV-STLTPELGEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQ 118
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK--SCS--GVNCQYSVSYGDGS 176
PC + C+ Q SP+F+P SS+YK++PC+SS C N SCS G C+YS++YG +
Sbjct: 119 PC--NTCFNQTSPIFNPSKSSSYKNIPCTSSTCKDTNDTHISCSNGGDVCEYSITYGGDA 176
Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM- 235
S G+L+ +++TL ST+G +V P I GCG N NS+++G+VG+G G +SLI Q+
Sbjct: 177 KSQGDLSNDSLTLDSTSGSSVLFPNIVIGCGHINVLQDNSQSSGVVGMGRGPMSLIKQVG 236
Query: 236 RTTIAGKFSYCLVPV-----SSTKINFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAI 287
+++ KFSYCL+P SS+K+ FG + +VSG VVSTP+ K + +Y LT++A
Sbjct: 237 SSSVGSKFSYCLIPYNSDSNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAF 296
Query: 288 SVGNQRL------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 341
SVGN R+ ST +I+IDSGT LT LP + S L+S ++ ++ + P L
Sbjct: 297 SVGNNRIEYGERSNASTQNILIDSGTPLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHLS 356
Query: 342 LCYSFNSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQT 400
LCY+ VP++T HF GADVKL+ + F + I+C F +N + I+GNI Q
Sbjct: 357 LCYNTTGKQLNVPDITAHFNGADVKLNSNGTFFPFEDGIMCFGFIS-SNGLEIFGNIAQN 415
Query: 401 NFLVGYDIEQQTVSFKPTD 419
N L+ YD+E++ +SFKPTD
Sbjct: 416 NLLIDYDLEKEIISFKPTD 434
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 317 bits (811), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 202/449 (44%), Positives = 273/449 (60%), Gaps = 41/449 (9%)
Query: 3 TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
T L C L + + S A SVELIHRDSP SP YN T RL A
Sbjct: 5 TLLYCS--LLAITIFFTSTSSAHRKNLSVELIHRDSPHSPLYNPQHTVSDRLNAAF---- 58
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
L +++ S+ Q+ +I N Y + ISIGTPP++ LA+ADTGSDL W QC+PC
Sbjct: 59 --LRSISRSRRFSTKTDLQSGLISNGGEYFMSISIGTPPSKFLAIADTGSDLTWVQCKPC 116
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSC--SGVNCQYSVSYGDGSFS 178
QCY Q++PLFD K SSTYK+ C S C +L +++ C S C+Y SYGD SF+
Sbjct: 117 --QQCYKQNTPLFDKKKSSTYKTESCDSITCNALSEHEEGCDESRNACKYRYSYGDESFT 174
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
G +ATET+++ S++G V+ PG FGCG NNGG F +GI+GLGGG +SL+SQ+ ++
Sbjct: 175 KGEVATETISIDSSSGSPVSFPGTAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSS 234
Query: 239 IAGKFSYCLVPVSSTK-----INFGTNGIVSGP----GVVSTPLTKA--KTFYVLTIDAI 287
I KFSYCL S+T IN GTN + S P +++TPL + +T+Y LT++AI
Sbjct: 235 IGKKFSYCLSHTSATTNGTSVINLGTNSMTSKPSKDSAILTTPLIQKDPETYYFLTLEAI 294
Query: 288 SVGNQRL------GVS-------TPDIVIDSGTTLTFLPQGYNSNLLSVM-SSMIEAQPV 333
+VG +L G S T +I+IDSGTTLT L G+ + +V+ S+ A+ V
Sbjct: 295 TVGKTKLPYTGGGGYSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKRV 354
Query: 334 ADPTGSLELCY-SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP 392
+DP G L C+ S + +P +T+HF GADVKLS N FVK+SEDIVC + T V
Sbjct: 355 SDPQGILTHCFKSGDKEIGLPTITMHFTGADVKLSPINSFVKLSEDIVC-LSMIPTTEVA 413
Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
IYGN++Q +FLVGYD+E +TVSF+ DC+
Sbjct: 414 IYGNMVQMDFLVGYDLETKTVSFQRMDCS 442
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 316 bits (810), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 198/449 (44%), Positives = 272/449 (60%), Gaps = 41/449 (9%)
Query: 3 TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
TFL C L + F+ S A +VELIHRDSP SP YN T RL A RS+
Sbjct: 5 TFLYCS--LLAISFFFASNSSANRENLTVELIHRDSPHSPLYNPHHTVSDRLNAAFLRSI 62
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+R F + + Q+ +I N Y + ISIGTPP++ A+ADTGSDL W QC+PC
Sbjct: 63 SRSRRFTTKTDL------QSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPC 116
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGVN--CQYSVSYGDGSFS 178
QCY Q+SPLFD K SSTYK+ C S C +L +++ C C+Y SYGD SF+
Sbjct: 117 --QQCYKQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFT 174
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
G++ATET+++ S++G +V+ PG FGCG NNGG F +GI+GLGGG +SL+SQ+ ++
Sbjct: 175 KGDVATETISIDSSSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSS 234
Query: 239 IAGKFSYCLVPVSSTK-----INFGTNGIVSGP----GVVSTPLTKA--KTFYVLTIDAI 287
I KFSYCL ++T IN GTN I S P ++TPL + +T+Y LT++A+
Sbjct: 235 IGKKFSYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAV 294
Query: 288 SVGNQRLGVS-------------TPDIVIDSGTTLTFLPQGYNSNL-LSVMSSMIEAQPV 333
+VG +L + T +I+IDSGTTLT L G+ + +V S+ A+ V
Sbjct: 295 TVGKTKLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRV 354
Query: 334 ADPTGSLELCY-SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP 392
+DP G L C+ S + +P +T+HF ADVKLS N FVK++ED VC + T V
Sbjct: 355 SDPQGLLTHCFKSGDKEIGLPAITMHFTNADVKLSPINAFVKLNEDTVC-LSMIPTTEVA 413
Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
IYGN++Q +FLVGYD+E +TVSF+ DC+
Sbjct: 414 IYGNMVQMDFLVGYDLETKTVSFQRMDCS 442
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 307 bits (787), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 182/440 (41%), Positives = 265/440 (60%), Gaps = 31/440 (7%)
Query: 1 MATFLSCVF--ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDAL 58
MA +S F ILF + F + I G F+ L HRDS SP SS + Y RL +A
Sbjct: 1 MAATISLFFHLILFLISFSQTTIINGDNG-FTTSLFHRDSLLSPLEFSSLSHYDRLANAF 59
Query: 59 TRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
RSL+R ++ S + Q+ I P + YL+ +SIGTPP + L +ADTGSDL W Q
Sbjct: 60 RRSLSRSAALLNRAATSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWAQ 119
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGS 176
C PC +CY Q P+F+P S+++ +PC++ C +++ C GV C YS +YGD +
Sbjct: 120 CLPCL--KCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHC-GVQGVCDYSYTYGDRT 176
Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
+S G+L E +T+GS++ ++V GCG + G F +G++GLGGG +SL+SQM
Sbjct: 177 YSKGDLGFEKITIGSSSVKSV------IGCGHASSGGFGF-ASGVIGLGGGQLSLVSQMS 229
Query: 237 TT--IAGKFSYC---LVPVSSTKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISV 289
T I+ +FSYC L+ ++ KINFG N +VSGPGVVSTPL T+Y +T++AIS+
Sbjct: 230 QTSGISRRFSYCLPTLLSHANGKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISI 289
Query: 290 GNQR--LGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-- 345
GN+R +++IDSGTTLT LP+ ++S + +++A+ V DP GSL+LC+
Sbjct: 290 GNERHMAFAKQGNVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDG 349
Query: 346 FNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQT 400
N+ + +P +T HF GA+V L N F KV++++ C K T I GN+ Q
Sbjct: 350 INAAASLGIPVITAHFSGGANVNLLPINTFRKVADNVNCLTLKAASPTTEFGIIGNLAQA 409
Query: 401 NFLVGYDIEQQTVSFKPTDC 420
NFL+GYD+E + +SFKPT C
Sbjct: 410 NFLIGYDLEAKRLSFKPTVC 429
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 191/437 (43%), Positives = 257/437 (58%), Gaps = 27/437 (6%)
Query: 9 FILFFLCFYVVSPI---EAQTG--GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
F+ F L FY VS + EA GF+V+LIHRDSP SPFYN S TP QR+ +A RS++
Sbjct: 4 FVFFCLAFYSVSSLFSTEANESPSGFTVDLIHRDSPLSPFYNPSLTPSQRIINAALRSIS 63
Query: 64 RLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
RLN + N ++K Q+ +I +N YL+R IGTPP ERLA ADTGSDLIW QC PC
Sbjct: 64 RLNRVS-NLLDQNNKLPQSVLILHNGEYLMRFYIGTPPVERLATADTGSDLIWVQCSPC- 121
Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSC--SGVNCQYSVSYGDG-SFS 178
+ C+ Q +PLF P SST+ C S C L QK C SG C Y+ YGD SFS
Sbjct: 122 -ASCFPQSTPLFQPLKSSTFMPTTCRSQPCTLLLPEQKGCGKSG-ECIYTYKYGDQYSFS 179
Query: 179 NGNLATETVTLGSTTG-QAVALPGITFGCGT-NNGGLFNS-KTTGIVGLGGGDISLISQM 235
G L+TET+ S G Q VA P FGCG NN +F S K TGI+GLG G +SL+SQ+
Sbjct: 180 EGLLSTETLRFDSQGGVQTVAFPNSFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQI 239
Query: 236 RTTIAGKFSYCLVPVSST---KINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISV 289
I KFSYCL+P+ ST K+ FG I++G GVVSTP+ T+Y L ++A++V
Sbjct: 240 GDQIGHKFSYCLLPLGSTSTSKLKFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTV 299
Query: 290 GNQRLGVSTPD--IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 347
+ + + D ++IDSGT LT+L + + N + + + + V D L C+ +
Sbjct: 300 AQKTVPTGSTDGNVIIDSGTLLTYLGESFYYNFAASLQESLAVELVQDVLSPLPFCFPYR 359
Query: 348 SLSQVPEVTIHFRGADVKLSRSNFFVKVSE-DIVCSVFKGITNS-VPIYGNIMQTNFLVG 405
PE+ F GA V L +N FV + + VC + + S + I+G+ Q +F V
Sbjct: 360 DNFVFPEIAFQFTGARVSLKPANLFVMTEDRNTVCLMIAPSSVSGISIFGSFSQIDFQVE 419
Query: 406 YDIEQQTVSFKPTDCTK 422
YD+E + VSF+PTDC+K
Sbjct: 420 YDLEGKKVSFQPTDCSK 436
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 178/432 (41%), Positives = 254/432 (58%), Gaps = 22/432 (5%)
Query: 9 FILFFLCFYVVSPIEAQTG--GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
IL +S EA+ G GFSV+LIHRDSP SPFYN S TP +R+ +A RS++RL
Sbjct: 7 MILALFSLSTLSSREAREGLRGFSVDLIHRDSPSSPFYNPSLTPSERIINAALRSMSRLQ 66
Query: 67 HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
+ + +K ++ +IP+ YL+R IG+PP ERLA+ DTGS LIW QC PC
Sbjct: 67 RVSH--FLDENKLPESLLIPDKGEYLMRFYIGSPPVERLAMVDTGSSLIWLQCSPC--HN 122
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGV-NCQYSVSYGDGSFSNGNLA 183
C+ Q++PLF+P SSTYK C S C L +Q+ C + C Y + YGD SFS G L
Sbjct: 123 CFPQETPLFEPLKSSTYKYATCDSQPCTLLQPSQRDCGKLGQCIYGIMYGDKSFSVGILG 182
Query: 184 TETVTLGSTTG-QAVALPGITFGCGT-NNGGLFNS-KTTGIVGLGGGDISLISQMRTTIA 240
TET++ GST G Q V+ P FGCG NN ++ S K GI GLG G +SL+SQ+ I
Sbjct: 183 TETLSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIG 242
Query: 241 GKFSYCLVPVSST---KINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRL 294
KFSYCL+P ST K+ FG+ I++ GVVSTPL T+Y L ++A+++G + +
Sbjct: 243 HKFSYCLLPYDSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVV 302
Query: 295 GVSTPD--IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV 352
D IVIDSGT LT+L + +N ++ + + + + D L+ C+ + +
Sbjct: 303 STGQTDGNIVIDSGTPLTYLENTFYNNFVASLQETLGVKLLQDLPSPLKTCFPNRANLAI 362
Query: 353 PEVTIHFRGADVKLSRSNFFVKVSE-DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 410
P++ F GA V L N + +++ +I+C +V + ++G+I Q +F V YD+E
Sbjct: 363 PDIAFQFTGASVALRPKNVLIPLTDSNILCLAVVPSSGIGISLFGSIAQYDFQVEYDLEG 422
Query: 411 QTVSFKPTDCTK 422
+ VSF PTDC K
Sbjct: 423 KKVSFAPTDCAK 434
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 175/411 (42%), Positives = 250/411 (60%), Gaps = 20/411 (4%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD----I 84
F+++LIH DSP SPFYNSS T Q +R+A RS++R N + + S S ++ ++ I
Sbjct: 30 FTIDLIHHDSPPSPFYNSSMTRSQLIRNAAMRSISRANQLSLSLSHSLNQLKESSPEPII 89
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
IPNN NYL+RI IGTP ERLA+ADTGSDL W QC PC ++C+ Q++PL+DP SST+
Sbjct: 90 IPNNGNYLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFT 149
Query: 145 SLPCSSSQCASL--NQKSCSGV-NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
LPC S C L +Q CS +C Y+ +YGD S+S G L+++++ L Q
Sbjct: 150 LLPCDSQPCTQLPYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRL--MLLQLHYNSK 207
Query: 202 ITFGCGTNNGGLFNS--KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---TKIN 256
I FGCG N + KTTGIVGLG G +SL+SQ+ I KFSYCL+P SS +K+
Sbjct: 208 ICFGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFSSNSNSKLK 267
Query: 257 FGTNGIVSGPGVVSTPLTKAKT--FYVLTIDAISVGNQ--RLGVSTPDIVIDSGTTLTFL 312
FG IV G GVVSTPL FY L ++ I+VG + + G + +I+IDSG+TLT+L
Sbjct: 268 FGEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVKTGQTDGNIIIDSGSTLTYL 327
Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS-LSQVPEVTIHFRGADVKLSRSNF 371
+ + + +S++ + + + C+++ +S P+V HF G DV L N
Sbjct: 328 EESFYNEFVSLVKETVAVEEDQYIPYPFDFCFTYKEGMSTPPDVVFHFTGGDVVLKPMNT 387
Query: 372 FVKVSEDIVCS-VFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
V + ++++CS V + + I+GN+ Q +F VGYDI+ VSF PTDC+
Sbjct: 388 LVLIEDNLICSTVVPSHFDGIAIFGNLGQIDFHVGYDIQGGKVSFAPTDCS 438
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 300 bits (767), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 171/430 (39%), Positives = 248/430 (57%), Gaps = 39/430 (9%)
Query: 9 FILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
F+L CF +S + Q GF+VELIH S +SPFYN ET QR+ L S+NR+ +
Sbjct: 7 FVLLLFCFCRLSLTKTQNHGFNVELIHPISSRSPFYNPKETQIQRISSILNYSINRVRYL 66
Query: 69 NQNSSISSSKASQADIIP-NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
N S S +K + A Y++ SIGTPP + ++ DTG+D IW QC+PC P C
Sbjct: 67 NHVFSFSPNKIQDVPLSSFMGAGYVMSYSIGTPPFQLYSLIDTGNDNIWFQCKPCKP--C 124
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
Q SP+F P SSTYK++PC+S C + DG + L +T+
Sbjct: 125 LNQTSPMFHPSKSSTYKTIPCTSPICKN-----------------ADGHY----LGVDTL 163
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
TL S G ++ I GCG N G +G +GL G +S ISQ+ ++I GKFSYCL
Sbjct: 164 TLNSNNGTPISFKNIVIGCGHRNQGPLEGYVSGNIGLARGPLSFISQLNSSIGGKFSYCL 223
Query: 248 VPV-----SSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD-- 300
VP+ S+K++FG VSG G VSTP+ K + Y ++++A SVG+ + + D
Sbjct: 224 VPLFSKENVSSKLHFGDKSTVSGLGTVSTPI-KEENGYFVSLEAFSVGDHIIKLENSDNR 282
Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS---LSQVPEV 355
+IDSGTT+T LP+ S L SV+ M++ + V DP+ LCY S L++V +
Sbjct: 283 GNSIIDSGTTMTILPKDVYSRLESVVLDMVKLKRVKDPSQQFNLCYQTTSTTLLTKVLII 342
Query: 356 TIHFRGADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
T HF G++V L+ N F ++++++C F G +S+ I+GN++Q NFLVG+D+ ++T+
Sbjct: 343 TAHFSGSEVHLNALNTFYPITDEVICFAFVSGGNFSSLAIFGNVVQQNFLVGFDLNKKTI 402
Query: 414 SFKPTDCTKQ 423
SFKPTDCTK
Sbjct: 403 SFKPTDCTKH 412
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 189/440 (42%), Positives = 271/440 (61%), Gaps = 33/440 (7%)
Query: 8 VFILF-FLCFYVVSPI---EAQTG--GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
VF++F L Y S I EA G GFS++LIHRDSP SPFY+ S TP +R+ +A RS
Sbjct: 5 VFMVFMLLALYSPSSISTREAGEGLRGFSIDLIHRDSPLSPFYDPSLTPSERITNAAFRS 64
Query: 62 ---LNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
LNR++HF +++ S +IP N YL+ + IGTPP ERLA+ADTGSDLIW Q
Sbjct: 65 SSRLNRVSHFLDENNLPESL-----LIPENGEYLMTLYIGTPPVERLAIADTGSDLIWVQ 119
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGV-NCQYSVSYGDG 175
C PC C+ QD+PLF+P SST+K+ C S C S+ +Q+ C V C YS SYGD
Sbjct: 120 CSPC--QNCFPQDTPLFEPLKSSTFKAATCDSQPCTSVPPSQRQCGKVGQCIYSYSYGDK 177
Query: 176 SFSNGNLATETVTLGST-TGQAVALPGITFGCGTNNGGLFNS--KTTGIVGLGGGDISLI 232
SF+ G + TET++ GST Q V+ P FGCG N F++ K TG+VGLGGG +SL+
Sbjct: 178 SFTVGVVGTETLSFGSTGDAQTVSFPSSIFGCGVYNNFTFHTSDKVTGLVGLGGGPLSLV 237
Query: 233 SQMRTTIAGKFSYCLVPVSS---TKINFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDA 286
SQ+ I KFSYCL+P SS +K+ FG+ IV+ GVVSTPL +FY L ++A
Sbjct: 238 SQLGPQIGYKFSYCLLPFSSNSTSKLKFGSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEA 297
Query: 287 ISVGNQRL--GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY 344
+++G + + G + +I+IDSGT LT+L Q + +N ++ + ++ + D + C+
Sbjct: 298 VTIGQKVVPTGRTDGNIIIDSGTVLTYLEQTFYNNFVASLQEVLSVESAQDLPFPFKFCF 357
Query: 345 SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSE-DIVC-SVFKGITNSVPIYGNIMQTNF 402
+ ++ +P + F GA V L N +K+ + +++C +V + + I+GN+ Q +F
Sbjct: 358 PYRDMT-IPVIAFQFTGASVALQPKNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQFDF 416
Query: 403 LVGYDIEQQTVSFKPTDCTK 422
V YD+E + VSF PTDCTK
Sbjct: 417 QVVYDLEGKKVSFAPTDCTK 436
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 292 bits (747), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 172/433 (39%), Positives = 245/433 (56%), Gaps = 30/433 (6%)
Query: 14 LCFYVVSPIEAQT-----GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
L Y++S + ++ GFS++LIHRDSP SPFY S TP R+ + RS+ +LN
Sbjct: 9 LALYLLSTVSSREVSEGQRGFSIDLIHRDSPLSPFYKPSLTPSDRIINTALRSIYQLNR- 67
Query: 69 NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
+S ++ K + IPN+ YL+R IGTPP ERLA+ADT SDLIW QC PC C+
Sbjct: 68 ASHSDLNEKKTLERVRIPNHGEYLMRFYIGTPPVERLAIADTASDLIWVQCSPC--ETCF 125
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATET 186
QD+PLF+P SST+ +L C S C S N C V C Y+ +YGDGS + G L TE+
Sbjct: 126 PQDTPLFEPHKSSTFANLSCDSQPCTSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTES 185
Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGL--FNSKTTGIVGLGGGDISLISQMRTTIAGKFS 244
+ GS Q V P FGCG+NN + ++K TGIVGLG G +SL+SQ+ I KFS
Sbjct: 186 IHFGS---QTVTFPKTIFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIGHKFS 242
Query: 245 YCLVPVSST---KINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVST 298
YCL+P +ST K+ FG + ++G GVVSTPL ++Y L + I++G + L V T
Sbjct: 243 YCLLPFTSTSTIKLKFGNDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRT 302
Query: 299 PD-----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLSQV 352
D I+ID GT LT+L + N ++++ + D + C+ +
Sbjct: 303 TDHTNGNIIIDLGTVLTYLEVNFYHNFVTLLREALGISETKDDIPYPFDFCFPNQANITF 362
Query: 353 PEVTIHFRGADVKLSRSNFFVKVSE-DIVC-SVFKGI-TNSVPIYGNIMQTNFLVGYDIE 409
P++ F GA V LS N F + + +++C +V ++GN+ Q +F V YD +
Sbjct: 363 PKIVFQFTGAKVFLSPKNLFFRFDDLNMICLAVLPDFYAKGFSVFGNLAQVDFQVEYDRK 422
Query: 410 QQTVSFKPTDCTK 422
+ VSF P DC+K
Sbjct: 423 GKKVSFAPADCSK 435
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 288 bits (736), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 184/416 (44%), Positives = 249/416 (59%), Gaps = 32/416 (7%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
GF+ L RDSP SP +N S + Y L DA RS +R + + S+ ++ IIP+
Sbjct: 27 GFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPD 86
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ +L+ I IGTPP +A+ADTGSDL WTQC PC +C+ Q P+F+P+ SS+Y+ +
Sbjct: 87 SGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPC--RECFNQSQPIFNPRRSSSYRKVS 144
Query: 148 CSSSQCASLNQKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C+S C SL C +C Y SYGD SF+ G+LA++ +T+GS LP G
Sbjct: 145 CASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGS-----FKLPKTVIG 199
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG---KFSYCLVPVSSTK-----INF 257
CG NGG F T+GI+GLGGG +SL+SQMR TIAG +FSYCL S I+F
Sbjct: 200 CGHQNGGTFGGVTSGIIGLGGGSLSLVSQMR-TIAGVKPRFSYCLPTFFSNANITGTISF 258
Query: 258 GTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRL----GVST----PDIVIDSGT 307
G +VSG VVSTPL TFY LT++AISVG +R G+S +I+IDSGT
Sbjct: 259 GRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGT 318
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADV 364
TLT LP+ + S ++ +I+A+ V DP+G LELCYS + +P +T HF GADV
Sbjct: 319 TLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADV 378
Query: 365 KLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
KL N F V++++ C F T V I+GN+ Q NF VGYD+ + +SF+P C
Sbjct: 379 KLLPVNTFAPVADNVTCLTFAPATQ-VAIFGNLAQINFEVGYDLGNKRLSFEPKLC 433
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 169/430 (39%), Positives = 254/430 (59%), Gaps = 27/430 (6%)
Query: 9 FILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
IL + F + I G F+ L HRDS SP SS + Y RL +A RSL+R
Sbjct: 11 LILLLISFSQTTIINGDNG-FTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSRSATL 69
Query: 69 NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
++ + + QA + P + YL+ +SIGTPP + + +ADTGSDL+W QC PC +CY
Sbjct: 70 LNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPC--LKCY 127
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETV 187
Q P+FDP S+++ +PC+S C +++ C C YS +YGD +++ G+L E +
Sbjct: 128 KQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKI 187
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSY 245
T+GS++ ++V GCG + G +G++GLGGG +SL+SQM T I+ +FSY
Sbjct: 188 TIGSSSVKSV------IGCG-HESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSY 240
Query: 246 C---LVPVSSTKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGVSTP- 299
C L+ ++ KINFG N +VSGPGVVSTPL T+Y +T++AIS+GN+R S
Sbjct: 241 CLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAKQ 300
Query: 300 -DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY----SFNSLSQVPE 354
+++IDSGTTL+FLP+ ++S + +++A+ V DP +LC+ + + S +P
Sbjct: 301 GNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPI 360
Query: 355 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQ 411
+T F GA+V L N F KV+ ++ C T+ I GN+ NFL+GYD+E +
Sbjct: 361 ITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAK 420
Query: 412 TVSFKPTDCT 421
+SFKPT CT
Sbjct: 421 RLSFKPTVCT 430
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 170/440 (38%), Positives = 245/440 (55%), Gaps = 38/440 (8%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
M + + + +C ++ + T GFSV LI ++S ++ P +RL +
Sbjct: 1 MVVYPTSFHLATIICLMLLPLHISATEGFSVNLIRKNSS-----HAHVLPLRRLMEL--- 52
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
S++ + Q+ I +YL+ +SIGTPP + +ADTGSDL WT C
Sbjct: 53 -----------SAMEKTLTPQSPIYAYLGHYLMELSIGTPPFKIYGIADTGSDLTWTSCV 101
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSN 179
PC + CY Q +P+FDP+ S+TY+++ C S C L+ CS C Y+ +Y + +
Sbjct: 102 PC--NNCYKQRNPMFDPQKSTTYRNISCDSKLCHKLDTGVCSPQKRCNYTYAYASAAITR 159
Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
G LA ET+TL ST G++V L GI FGCG NN G FN GI+GLGGG +SLISQM ++
Sbjct: 160 GVLAQETITLSSTKGKSVPLKGIVFGCGHNNTGGFNDHEMGIIGLGGGPVSLISQMGSSF 219
Query: 240 AGK-FSYCLVPVS-----STKINFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGN 291
GK FS CLVP S+K++FG VSG GVVSTPL + KT Y +T+ ISV N
Sbjct: 220 GGKRFSQCLVPFHTDVSVSSKMSFGKGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVEN 279
Query: 292 QRL-------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELC 343
L V ++ +DSGT T LP +++ + S + +PV DP +LC
Sbjct: 280 TYLHFNGSSQNVEKGNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLC 339
Query: 344 YSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFL 403
Y + + P +T HF GADVKLS + F+ + + C F ++ +YGN Q+N+L
Sbjct: 340 YRTKNNLRGPVLTAHFEGADVKLSPTQTFISPKDGVFCLGFTNTSSDGGVYGNFAQSNYL 399
Query: 404 VGYDIEQQTVSFKPTDCTKQ 423
+G+D+++Q VSFKP DCTK
Sbjct: 400 IGFDLDRQVVSFKPKDCTKH 419
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 281 bits (718), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 177/440 (40%), Positives = 253/440 (57%), Gaps = 27/440 (6%)
Query: 3 TFLSCVFILFFLCFYV--VSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
T LS + FL + S ++A+ F+ ELIHRDSP SP +N+SET RL +A+ R
Sbjct: 9 TLLSFALSIIFLTVSMSGFSLVQAEKLSFTTELIHRDSPNSPLFNASETTDIRLANAVER 68
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC- 119
S +R+N FN S S + A I+ +N ++L++ISIG PPTE L TGSDL+W C
Sbjct: 69 SADRVNRFNDLISNSITAAEFPSIL-DNGDFLMKISIGIPPTELLVNVATGSDLVWIPCL 127
Query: 120 --EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVS-YGDGS 176
+PC C D FDP SSTYK++PC S +C N +C +C YS S
Sbjct: 128 SFKPC-THNC---DLRFFDPMESSTYKNVPCDSYRCQITNAATCQFSDCFYSCDPRHQDS 183
Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
+G+LA +T+TL STTG++ LP F CG GG + GI+GLG G +SL++++
Sbjct: 184 CPDGDLAMDTLTLNSTTGKSFMLPNTGFICGNRIGG--DYPGVGILGLGHGSLSLLNRIS 241
Query: 237 TTIAGKFSYCLVPVSS---TKINFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGN 291
I GKFS+C+VP SS +K++FG +VSG + ST L T Y L+ ISVGN
Sbjct: 242 HLIDGKFSHCIVPYSSNQTSKLSFGDKAVVSGSAMFSTRLDMTGGPYSYTLSFYGISVGN 301
Query: 292 QRL---GVSTPDIV----IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELC 343
+ + G+ + + +DSGT T+ P+ + S L + I+ +P+ DPT L LC
Sbjct: 302 KSISAGGIGSDYYMNGLGMDSGTMFTYFPEYFYSQLEYDVRYAIQQEPLYPDPTRRLRLC 361
Query: 344 YSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFK-GITNSVPIYGNIMQTNF 402
Y ++ P +T+HF G V+LS SN F++++EDIVC F + ++G QTN
Sbjct: 362 YRYSPDFSPPTITMHFEGGSVELSSSNSFIRMTEDIVCLAFATSSSEQDAVFGYWQQTNL 421
Query: 403 LVGYDIEQQTVSFKPTDCTK 422
L+GYD++ +SF TDCTK
Sbjct: 422 LIGYDLDAGFLSFLKTDCTK 441
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 280 bits (716), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 172/435 (39%), Positives = 240/435 (55%), Gaps = 56/435 (12%)
Query: 9 FILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
+ L ++ IEA G F+V+LI R NSS+ + R+
Sbjct: 9 LLAILLLVFIFPSIEAHNGRFTVKLIPR--------NSSQVLFNRI-------------- 46
Query: 69 NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
+Q + ++ +YL+ +SIGTPP + A DTGSDLIW QC PC + CY
Sbjct: 47 ----------TAQTPVSVHHYDYLMELSIGTPPVKTYAQVDTGSDLIWLQCIPC--TNCY 94
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATET 186
Q +P+FDP+ SSTY ++ S C+ L SCS NC Y+ SY D S + G LA ET
Sbjct: 95 KQLNPMFDPQSSSTYSNIAYGSESCSKLYSTSCSPDQNNCNYTYSYEDDSITEGVLAQET 154
Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSY 245
+TL STTG+ VAL G+ FGCG NN G+FN K GI+GLG G +SL+SQ+ ++ GK FS
Sbjct: 155 LTLTSTTGKPVALKGVIFGCGHNNNGVFNDKEMGIIGLGRGPLSLVSQIGSSFGGKMFSQ 214
Query: 246 CLVPVS-----STKINFGTNGIVSGPGVVSTPLTKAKT---FYVLTIDAISVGNQRL--- 294
CLVP ++ ++FG V G GVVSTPL T FY +T+ ISV + L
Sbjct: 215 CLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISVEDINLPFN 274
Query: 295 ------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFN 347
++ ++VIDSGT T LP+ + L+ + + + P+ DPT +LCY
Sbjct: 275 DGSSLEPITKGNMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDPIPIDPTLGYQLCYRTP 334
Query: 348 SLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKG-ITNSVPIYGNIMQTNFLVGY 406
+ + +T HF GADV L+ + F+ V + I C F +N IYGN Q+N+L+G+
Sbjct: 335 TNLKGTTLTAHFEGADVLLTPTQIFIPVQDGIFCFAFTSTFSNEYGIYGNHAQSNYLIGF 394
Query: 407 DIEQQTVSFKPTDCT 421
D+E+Q VSFK TDCT
Sbjct: 395 DLEKQLVSFKATDCT 409
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 278 bits (710), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 178/436 (40%), Positives = 262/436 (60%), Gaps = 58/436 (13%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
++LIHRDSP SP + + T RL+ + R+++R Q+ + Q D++P+
Sbjct: 29 LDLIHRDSPLSPLHTPNLTFSDRLQASFLRAISR-----QSRHVDF----QTDLLPSGGE 79
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y++ +SIGTPP LA+ADTGSDL W Q +PC QCY Q P+FDP S+T+ LPC++
Sbjct: 80 YMMNLSIGTPPFPILAIADTGSDLTWLQSKPC--DQCYPQKGPIFDPSNSTTFHKLPCTT 137
Query: 151 SQCASLNQ--KSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ C +L++ +SC+ C Y+ SYGD S++ G LA++TVT+G+ +V + + FGCG
Sbjct: 138 APCNALDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNA---SVQIRNVAFGCG 194
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV------------SSTKI 255
T NGG F+ + +GIVGLGGG++S +SQ+ TI KFSYCL+P+ ++++I
Sbjct: 195 TRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQPSDSPATSRI 254
Query: 256 NFGTNGIVSGP---GVV--STPLTKAK--TFYVLTIDAISVGNQRL-------------- 294
FG N + S GVV +TPL + T+Y LTI+AI+VG ++L
Sbjct: 255 VFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSSKTASYDS 314
Query: 295 ----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCY-SFNS 348
V +I+IDSGTTLTFL + + L + + I+ + V D S+ LC+ S
Sbjct: 315 GSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVNDVKNSMFSLCFKSGKE 374
Query: 349 LSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
++P + +HFR GADV+L N FV+ E +VC TN V IYGN+ Q NF+VGYD
Sbjct: 375 EVELPLMKVHFRGGADVELKPVNTFVRAEEGLVCFTMLP-TNDVGIYGNLAQMNFVVGYD 433
Query: 408 IEQQTVSFKPTDCTKQ 423
+ ++TVSF P DC+KQ
Sbjct: 434 LGKRTVSFLPADCSKQ 449
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 171/422 (40%), Positives = 239/422 (56%), Gaps = 33/422 (7%)
Query: 20 SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSIS-SSK 78
+P EA GFS +LIH++SP SPFY S+ + + N+L F Q S K
Sbjct: 21 TPTEAYNKGFSFKLIHKNSPNSPFYKSNN--FHK---------NKLRSFYQVPKKSFVQK 69
Query: 79 ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
+ + NN +YL+++++G+PP + + DTGSDL+W QC PC CY Q SP+F+P
Sbjct: 70 SPYTRVTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPC--GGCYRQKSPMFEPL 127
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
S TY +PC S QC+ C YS SY D S + G LA E +T ST G V
Sbjct: 128 RSKTYSPIPCESEQCSFFGYSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVV 187
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVPV-----SS 252
+ I FGCG +N G FN GI+G+GGG +SL+SQ+ T K FS CLVP +S
Sbjct: 188 VGDIIFGCGHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTS 247
Query: 253 TKINFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVG------NQRLGVSTPDIVID 304
INFG VSG GVV+TPL + +T Y++T++ ISVG N +S +I+ID
Sbjct: 248 GTINFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSSETLSKGNIMID 307
Query: 305 SGTTLTFLPQGYNSNL---LSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 361
SGT T++PQ + L L V SS++ + DP +LCY + + P +T HF G
Sbjct: 308 SGTPATYIPQEFYERLVEELKVQSSLLPIE--DDPDLGTQLCYRSETNLEGPILTAHFEG 365
Query: 362 ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
ADV+L F+ + + C G T+ I+GN Q+N L+G+D++++T+SFKPTDCT
Sbjct: 366 ADVQLLPIQTFIPPKDGVFCFAMAGSTDGDYIFGNFAQSNILMGFDLDRKTISFKPTDCT 425
Query: 422 KQ 423
Q
Sbjct: 426 NQ 427
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 173/441 (39%), Positives = 254/441 (57%), Gaps = 43/441 (9%)
Query: 1 MATFLSCVF--ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDAL 58
MA +S F ILF + F + I G F+ L HRDS SP SS + Y RL +A
Sbjct: 1 MAATISLFFHLILFLISFSQTTIINGDNG-FTTSLFHRDSLLSPLEFSSLSHYDRLANAF 59
Query: 59 TRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
RSL+R ++ S + Q+ II GTPP + L +ADTGSDL W Q
Sbjct: 60 RRSLSRSAALLNRAATSGAVGLQSSII------------GTPPVDYLGIADTGSDLTWAQ 107
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGS 176
C PC +CY Q P+F+P S+++ +PC++ C +++ C GV C YS +YGD +
Sbjct: 108 CLPCL--KCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHC-GVQGVCDYSYTYGDRT 164
Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
+S G+L E +T+GS++ ++V GCG + G F +G++GLGGG +SL+SQM
Sbjct: 165 YSKGDLGFEKITIGSSSVKSV------IGCGHASSGGFGF-ASGVIGLGGGQLSLVSQMS 217
Query: 237 TT--IAGKFSYC---LVPVSSTKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISV 289
T I+ +FSYC L+ ++ KINFG N +VSGPGVVSTPL T+Y +T++AIS+
Sbjct: 218 QTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNTVTYYYITLEAISI 277
Query: 290 GNQR--LGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--- 344
GN+R +++IDSGTTL+FLP+ ++S + +++A+ V DP +LC+
Sbjct: 278 GNERHMAFAKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDG 337
Query: 345 -SFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQT 400
+ + S +P +T F GA+V L N F KV+ ++ C T+ I GN+
Sbjct: 338 INVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALA 397
Query: 401 NFLVGYDIEQQTVSFKPTDCT 421
NFL+GYD+E + +SFKPT CT
Sbjct: 398 NFLIGYDLEAKRLSFKPTVCT 418
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 271 bits (694), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 169/439 (38%), Positives = 245/439 (55%), Gaps = 28/439 (6%)
Query: 8 VFILFFLCFY-VVSPIEAQT--GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNR 64
VF LC + + S EA GFS+ LIHR+SP SPFYN S TP +R+++ + RS R
Sbjct: 5 VFCFLLLCSHSIASFAEASKTLSGFSINLIHRESPLSPFYNPSLTPSERIKNTVLRSFAR 64
Query: 65 LNHFNQNSSISSSKASQADIIPNN--ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
S + ++ IP+ YL+R IGTPP ER A+ADTGSDLIW QC PC
Sbjct: 65 SKR-RLRLSQNDDRSPGTITIPDEPITEYLMRFYIGTPPVERFAIADTGSDLIWVQCAPC 123
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGVN--CQYSVSYGDGSFS 178
+C Q++PLFDP+ SST+K++PC S C L +Q++C G + C Y YGD +
Sbjct: 124 --EKCVPQNAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKSGQCYYQYIYGDHTLV 181
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGLFNSK-TTGIVGLGGGDISLISQMR 236
+G L E++ GS A+ P +TFGC +NN + SK G+VGLG G +SLISQ+
Sbjct: 182 SGILGFESINFGSKN-NAIKFPKLTFGCTFSNNDTVDESKRNMGLVGLGVGPLSLISQLG 240
Query: 237 TTIAGKFSYCLVPVSS---TKINFGTNGIVSG-PGVVSTPL---TKAKTFYVLTIDAISV 289
I KFSYC P+SS +K+ FG + IV GVVSTPL + ++Y L ++ +S+
Sbjct: 241 YQIGRKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSI 300
Query: 290 GNQRLGVSTP----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS 345
GN+++ S +I+IDSGT+ T L Q + + ++++ + + V P C+
Sbjct: 301 GNKKVKTSESQTDGNILIDSGTSFTILKQSFYNKFVALVKEVYGVEAVKIPPLVYNFCFE 360
Query: 346 FN-SLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP-IYGNIMQTNFL 403
+ P+V F GA V++ SN F +++C V ++ I+GN Q +
Sbjct: 361 NKGKRKRFPDVVFLFTGAKVRVDASNLFEAEDNNLLCMVALPTSDEDDSIFGNHAQIGYQ 420
Query: 404 VGYDIEQQTVSFKPTDCTK 422
V YD++ VSF P DC K
Sbjct: 421 VEYDLQGGMVSFAPADCAK 439
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 153/370 (41%), Positives = 219/370 (59%), Gaps = 20/370 (5%)
Query: 72 SSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD 131
S++ + + Q+ I +YL+ +SIGTPP + +ADTGSDL WT C PC ++CY Q
Sbjct: 6 SAMEKTVSPQSPIYAYLGHYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPC--NKCYKQR 63
Query: 132 SPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLG 190
+P+FDP+ S++Y+++ C S C L+ CS +C Y+ +Y + + G LA ET+TL
Sbjct: 64 NPIFDPQKSTSYRNISCDSKLCHKLDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLS 123
Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVP 249
ST G++V L GI FGCG NN G FN + GI+GLGGG +S ISQ+ ++ GK FS CLVP
Sbjct: 124 STKGESVPLKGIVFGCGHNNTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVP 183
Query: 250 VS-----STKINFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRL-------- 294
S+K++ G VSG GVVSTPL + KT Y +T+ ISVGN L
Sbjct: 184 FHTDVSVSSKMSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQ 243
Query: 295 GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNSLSQVP 353
V ++ +DSGT T LP L++ + S + +PV D +LCY + + P
Sbjct: 244 SVEKGNVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYRTKNNLRGP 303
Query: 354 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
+T HF G DVKL + FV + + C F ++ +YGN Q+N+L+G+D+++Q V
Sbjct: 304 VLTAHFEGGDVKLLPTQTFVSPKDGVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVV 363
Query: 414 SFKPTDCTKQ 423
SFKP DCTK
Sbjct: 364 SFKPMDCTKH 373
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 171/433 (39%), Positives = 230/433 (53%), Gaps = 28/433 (6%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
V LFFL ++ GFS++LI R SP SP YNS T + ++ A RS+ R
Sbjct: 5 VLTLFFLVSTMLVDASKSLMGFSIDLIPRHSPISPLYNSQMTQTELVKSAALRSITRSKR 64
Query: 68 FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
N IS + IP++ YL+R S+GTP ERLA+ DTGSDL W QC PC C
Sbjct: 65 VNFIGQISPPLSPIITPIPDHGEYLMRFSLGTPSVERLAIFDTGSDLSWLQCTPC--KTC 122
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSC-SGVNCQYSVSYGDGSFSNGNLAT 184
Y Q++PLFDP SSTY +PC S C NQ+ C S C Y YG SF+ G L
Sbjct: 123 YPQEAPLFDPTQSSTYVDVPCESQPCTLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGY 182
Query: 185 ETVTLGST-TGQAVA-LPGITFGCGTNNGGLFN--SKTTGIVGLGGGDISLISQMRTTIA 240
+T++ ST GQ A P FGC + F +K G VGLG G +SL SQ+ I
Sbjct: 183 DTISFSSTGMGQGGATFPKSVFGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIG 242
Query: 241 GKFSYCLVPVSST---KINFG----TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQR 293
KFSYC+VP SST K+ FG TN +VS P +++ ++YVL ++ I+VG ++
Sbjct: 243 HKFSYCMVPFSSTSTGKLKFGSMAPTNEVVSTPFMIN---PSYPSYYVLNLEGITVGQKK 299
Query: 294 L--GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ 351
+ G +I+IDS LT L QG ++ +S + I + D E C +
Sbjct: 300 VLTGQIGGNIIIDSVPILTHLEQGIYTDFISSVKEAINVEVAEDAPTPFEYCVRNPTNLN 359
Query: 352 VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVF---KGITNSVPIYGNIMQTNFLVGYDI 408
PE HF GADV L N F+ + ++VC KGI+ I+GN Q NF V YD+
Sbjct: 360 FPEFVFHFTGADVVLGPKNMFIALDNNLVCMTVVPSKGIS----IFGNWAQVNFQVEYDL 415
Query: 409 EQQTVSFKPTDCT 421
++ VSF PT+C+
Sbjct: 416 GEKKVSFAPTNCS 428
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 264 bits (674), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 172/449 (38%), Positives = 238/449 (53%), Gaps = 35/449 (7%)
Query: 3 TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
+F S + IL + + I+A F+ ELIH DSP SPF+N+SET RL AL RS
Sbjct: 12 SFTSLIIILSTVFLSSFAIIQADKFSFTAELIHIDSPNSPFFNASETTTHRLAKALQRSA 71
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
NR+ N S +S + A I + NYL+++ IGTPPTE A DTGS++IW C C
Sbjct: 72 NRVARLNPLS--NSDEGVHASIFSGDGNYLMKLLIGTPPTEIHAAIDTGSNVIWIPCINC 129
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDG-SFSNGN 181
C+ Q S +F+P SSTY+ PC S QC + + S C YS + NG
Sbjct: 130 --KDCFNQSSSIFNPLASSTYQDAPCDSYQCETTSSSCQSDNVCLYSCDEKHQLNCPNGR 187
Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
+A +T+TL S+ G+ LP F CG + F G++GLG G +SL S++ G
Sbjct: 188 IAVDTMTLTSSDGRPFPLPYSDFVCGNSIYKTF--AGVGVIGLGRGALSLTSKLYHLSDG 245
Query: 242 KFSYCLVPVSS---TKINFGTNGIVSGPG--VVSTPLTKAKTF--YVLTIDAISVGNQRL 294
KFSYCL S +KINFG +S VVST L + Y +T++ ISVG +R
Sbjct: 246 KFSYCLADYYSKQPSKINFGLQSFISDDDLEVVSTTLGHHRHSGNYYVTLEGISVGEKRQ 305
Query: 295 GVSTPD---------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS------ 339
+ D ++IDSGT T LP+ + L S +S I P P S
Sbjct: 306 DLYYVDDPFAPPVGNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFSM 365
Query: 340 -----LELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGIT-NSVPI 393
L C+ + + P++TIHF ADV+LS N F++V+ED+VC F +
Sbjct: 366 DNTLKLSPCFWYYPELKFPKITIHFTDADVELSDDNSFIRVAEDVVCFAFAATQPGQSTV 425
Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
YG+ Q NF++GYD+++ TVSFK TDC+K
Sbjct: 426 YGSWQQMNFILGYDLKRGTVSFKRTDCSK 454
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 263 bits (673), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 156/375 (41%), Positives = 219/375 (58%), Gaps = 27/375 (7%)
Query: 70 QNSSISSSKAS--QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
+NSS S K S Q+ + + YL+ +SIGTPP + A ADTGSDL+W QC PC ++C
Sbjct: 37 RNSSHDSYKPSTIQSPVSAYDCEYLMELSIGTPPIKIYAEADTGSDLVWFQCIPC--TKC 94
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATE 185
Y Q +P+FDP+ SS+Y ++ C + C L+ CS C Y+ SY D S + G LA E
Sbjct: 95 YKQQNPMFDPRSSSSYTNITCGTESCNKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQE 154
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG---K 242
T+TL STTG+ VA GI FGCG NN G FN + G++GLG G +SLISQ+ +++
Sbjct: 155 TLTLTSTTGEPVAFQGIIFGCGHNNSG-FNDREMGLIGLGRGPLSLISQIGSSLGAGGNM 213
Query: 243 FSYCLVPVSS-----TKINFGTNGIVSGPGVVSTPL-TKAKTFYVLTIDAISVGNQRL-- 294
FS CLVP ++ +++NFG V G G VSTPL +K T Y T+ ISV + L
Sbjct: 214 FSQCLVPFNTDPSITSQMNFGKGSEVLGNGTVSTPLISKDGTGYFATLLGISVEDINLPF 273
Query: 295 -------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 347
++ +I+IDSGTT+T+LP+ + L+ + + + +P ELCY
Sbjct: 274 SNGSSLGTITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFR--IDGYELCYQTP 331
Query: 348 SLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
+ P +TIHF G DV L+ + F+ V +D C YGN Q+N+L+G+D
Sbjct: 332 TNLNGPTLTIHFEGGDVLLTPAQMFIPVQDDNFCFAVFDTNEEYVTYGNYAQSNYLIGFD 391
Query: 408 IEQQTVSFKPTDCTK 422
+E+Q VSFK TDCTK
Sbjct: 392 LERQVVSFKATDCTK 406
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 167/432 (38%), Positives = 238/432 (55%), Gaps = 40/432 (9%)
Query: 24 AQTGGFSVELIHRDSPKSPFYNSSETPYQRLR---------DALTRSLNRLNHFNQNSSI 74
A GGFSV+ IHRDS +SP+ + + +P+ R + L RS + + S
Sbjct: 28 AGEGGFSVDFIHRDSARSPYRHPALSPHARALAAARRSLRGEVLGRSYSGASPAAAPVSA 87
Query: 75 SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP- 133
+ ++ II + YL+ +++GTPPT+ LA+ADTGSDL+W C S + D+
Sbjct: 88 ADGGV-ESKIITRSFEYLMYVNVGTPPTQLLAIADTGSDLVWVNCS---SSGGGLADADA 143
Query: 134 ----LFDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATETVT 188
+F P SSTY L C S+ C +L+Q SC CQY SYGDGS + G L+TET +
Sbjct: 144 GGNVVFQPTRSSTYSQLSCQSNACQALSQASCDADSECQYQYSYGDGSRTIGVLSTETFS 203
Query: 189 L--GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT--IAGKFS 244
G GQ V +P + FGC T + G F S G+VGLG G SL+SQ+ T I K S
Sbjct: 204 FVDGGGKGQ-VRVPRVNFGCSTASAGTFRSD--GLVGLGAGAFSLVSQLGATTHIDRKLS 260
Query: 245 YCLVPV----SSTKINFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGVST 298
YCL+P SS+ +NFG+ +VS PG STPL + ++Y + +++++VG Q +
Sbjct: 261 YCLIPSYDANSSSTLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQEVATHD 320
Query: 299 PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-----VP 353
I++DSGTTLTFL L++ + I+ Q V P L+LCY S+ +P
Sbjct: 321 SRIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGKSETDNFGIP 380
Query: 354 EVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQ 410
+VT+ F GA V L N F + E +C V ++ S P I GNI Q NF VGYD++
Sbjct: 381 DVTLRFGGGAAVTLRPENTFSLLQEGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDA 440
Query: 411 QTVSFKPTDCTK 422
+TV+F DC +
Sbjct: 441 RTVTFAAADCAR 452
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 164/417 (39%), Positives = 226/417 (54%), Gaps = 40/417 (9%)
Query: 24 AQTGGFSVELIHRDSPK-SPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
A GF+++LI +SP SPFY S E RL S
Sbjct: 3 ADNSGFTIQLIRHNSPNYSPFYKSDELHMHRL---------------------GSNGVFT 41
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
+ NN +YL+++++GTPP + + DTGSDL+W QC PC CY Q SP+F+P S+T
Sbjct: 42 RVTSNNGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPC--QGCYRQKSPMFEPLRSNT 99
Query: 143 YKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
Y +PC S +C SL SCS C YS +Y D S + G LA ETVT ST G+ V +
Sbjct: 100 YTPIPCDSEECNSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVGD 159
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVPVSST-----KI 255
I FGCG +N G FN GI+GLGGG +SL+SQ K FS CLVP + I
Sbjct: 160 IVFGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADPHTLGTI 219
Query: 256 NFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVG------NQRLGVSTPDIVIDSGT 307
+FG VSG GV +TPL + +T Y++T++ ISVG N +S +I+IDSGT
Sbjct: 220 SFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSSEMLSKGNIMIDSGT 279
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSLSQVPEVTIHFRGADVKL 366
T+LPQ + L+ + P+ DP +LCY + + P + HF GADV+L
Sbjct: 280 PATYLPQEFYDRLVKELKVQSNMLPIDDDPDLGTQLCYRSETNLEGPILIAHFEGADVQL 339
Query: 367 SRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 423
F+ + + C G T+ I+GN Q+N L+G+D++++TVSFK TDC+ Q
Sbjct: 340 MPIQTFIPPKDGVFCFAMAGTTDGEYIFGNFAQSNVLIGFDLDRKTVSFKATDCSNQ 396
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 258 bits (659), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 153/430 (35%), Positives = 245/430 (56%), Gaps = 44/430 (10%)
Query: 21 PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ----NSSISS 76
P + + GF V L H D K+ T ++RLR + R NRL+ N ++ +
Sbjct: 43 PNKLPSHGFRVRLKHVDHVKN------LTRFERLRRGVARGKNRLHRLNAMVLAAANATV 96
Query: 77 SKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
+A ++ N +L++++IG+PP A+ DTGSDLIWTQC+PC QC+ Q +P+FD
Sbjct: 97 GDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPC--QQCFDQSTPIFD 154
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
PK SS++ + CSS C +L +CS C+Y +YGD S + G LA ET T G +T
Sbjct: 155 PKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQ 214
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-- 254
+++PG+ FGCG +N G S+ G+VGLG G +SL+SQ++ KF+YCL + +K
Sbjct: 215 ISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQ---KFAYCLTAIDDSKPS 271
Query: 255 -INFGTNGIV----SGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGV---------- 296
+ G+ + S + +TPL K +FY L++ ISVG +L +
Sbjct: 272 SLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDD 331
Query: 297 STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ--PVADP-TGSLELCYSFNSLS--- 350
+ ++IDSGTT+T++ NS S+ + I PV D TG L+LC++ + +
Sbjct: 332 GSGGVIIDSGTTITYVE---NSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQV 388
Query: 351 QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 410
+VP++T HF+GAD++L N+ + S+ + + G + + I+GN+ Q NF+V +D+++
Sbjct: 389 EVPKLTFHFKGADLELPGENYMIGDSKAGLLCLAIGSSRGMSIFGNLQQQNFMVVHDLQE 448
Query: 411 QTVSFKPTDC 420
+T+SF PT C
Sbjct: 449 ETLSFLPTQC 458
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 257 bits (657), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 154/431 (35%), Positives = 246/431 (57%), Gaps = 46/431 (10%)
Query: 21 PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
P + + GF V L H D K+ T ++RLR + R NRL+ N ++++ A+
Sbjct: 298 PNKLPSHGFRVRLKHVDHVKN------LTRFERLRRGVARGKNRLHRLNA-MVLAAANAT 350
Query: 81 QAD-----IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
D ++ N +L++++IG+PP A+ DTGSDLIWTQC+PC QC+ Q +P+F
Sbjct: 351 VGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPC--QQCFDQSTPIF 408
Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
DPK SS++ + CSS C +L +CS C+Y +YGD S + G LA ET T G +T
Sbjct: 409 DPKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTED 468
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK- 254
+++PG+ FGCG +N G S+ G+VGLG G +SL+SQ++ KF+YCL + +K
Sbjct: 469 QISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQ---KFAYCLTAIDDSKP 525
Query: 255 --INFGTNGIV----SGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGV--------- 296
+ G+ + S + +TPL K +FY L++ ISVG +L +
Sbjct: 526 SSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHD 585
Query: 297 -STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ--PVADP-TGSLELCYSF---NSL 349
+ ++IDSGTT+T++ NS S+ + I PV D TG L+LC++ +
Sbjct: 586 DGSGGVIIDSGTTITYVE---NSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQ 642
Query: 350 SQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
+VP++T HF+GAD++L N+ + S+ + + G + + I+GN+ Q NF+V +D++
Sbjct: 643 VEVPKLTFHFKGADLELPGENYMIGDSKAGLLCLAIGSSRGMSIFGNLQQQNFMVVHDLQ 702
Query: 410 QQTVSFKPTDC 420
++T+SF PT C
Sbjct: 703 EETLSFLPTQC 713
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 166/415 (40%), Positives = 236/415 (56%), Gaps = 33/415 (7%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
G ++L+ DSP SPF + + +R + A+ RS +RL S+ KA +A +
Sbjct: 54 GLRIDLVRTDSPLSPFSPGNISSTERFKRAIKRSQDRLEKLQM--SVDEVKAVEAPVYAG 111
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
N +L++++IGTP A+ DTGSDL WTQC+PC + CY Q +P++DP SSTY +P
Sbjct: 112 NGEFLMKMAIGTPSLSFSAILDTGSDLTWTQCKPC--TDCYPQPTPIYDPSQSSTYSKVP 169
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
CSSS C +L SCSG NC+Y SYGD S + G L+ E+ TL S +LP I FGCG
Sbjct: 170 CSSSMCQALPMYSCSGANCEYLYSYGDQSSTQGILSYESFTLTSQ-----SLPHIAFGCG 224
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-----PVSSTKINFGTNGI 262
N G S+ G+VG G G +SLISQ+ ++ KFSYCLV P ++ + G
Sbjct: 225 QENEGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLVSITDSPSKTSPLFIGKTAS 284
Query: 263 VSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTTL 309
++ V STPL +++ TFY L+++ ISVG Q L ++ T ++IDSGTT+
Sbjct: 285 LNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTGGVIIDSGTTV 344
Query: 310 TFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPEVTIHFRGADVK 365
T+L Q GY+ +V+SS+ Q G L+LC+ S +S S P +T HF GAD
Sbjct: 345 TYLEQSGYDVVKKAVISSINLPQVDGSNIG-LDLCFEPQSGSSTSHFPTITFHFEGADFN 403
Query: 366 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
L + N+ S I C +N + I+GNI Q N+ + YD E+ +SF PT C
Sbjct: 404 LPKENYIYTDSSGIACLAMLP-SNGMSIFGNIQQQNYQILYDNERNVLSFAPTVC 457
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 254 bits (649), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 150/353 (42%), Positives = 205/353 (58%), Gaps = 29/353 (8%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
AD + +N+ YL+++ +GTPP E AV DTGS++ WTQC PC CY Q++P+FDP SS
Sbjct: 371 ADTVFDNSVYLMKLQVGTPPFEIEAVIDTGSEITWTQCLPC--VHCYKQNAPIFDPSKSS 428
Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
T+K +K C +C Y V Y D +++ G LAT+TVT+ ST+G+ +
Sbjct: 429 TFK-------------EKRCHDHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAE 475
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
GCG NN F G VGL G +SLI+QM G SYC ++KINFGTN
Sbjct: 476 TIIGCGRNNS-WFRPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCFAGNGTSKINFGTNA 534
Query: 262 IVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLG-VSTP------DIVIDSGTTLTF 311
IV G GVVST + T FY L +DA+SVG+ R+ + TP +IVIDSGTTLT+
Sbjct: 535 IVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVIDSGTTLTY 594
Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSN 370
P+ Y + + + ++ A P ADPTG+ LCY N+ P +T+HF GAD+ L + N
Sbjct: 595 FPESYCNLVRQAVEHVVPAVPAADPTGNDLLCYYSNTTEIFPVITMHFSGGADLVLDKYN 654
Query: 371 FFVK-VSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
F++ S + C ++ I+GN Q NFLVGYD VSFKPT+C+
Sbjct: 655 MFMESYSGGLFCLAIICNNPTQEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 707
Score = 207 bits (528), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 146/423 (34%), Positives = 214/423 (50%), Gaps = 83/423 (19%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
+AT + +F L + +++ + + GF+++LIHR S S
Sbjct: 3 LATTMIAIF-LQIITYFLFTTTASSPHGFTIDLIHRRSNAS------------------- 42
Query: 61 SLNRLNHFNQNSSISSSKASQ--ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
+S +S+++A AD + + YL+++ IGTPP E AV DTGS+LIWTQ
Sbjct: 43 ----------SSRVSNTQAGSPYADTVFDTYEYLMKLQIGTPPFEVEAVLDTGSELIWTQ 92
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFS 178
C PC CY Q +P+FDP SST+K C++ + C Y + Y D S++
Sbjct: 93 CLPC--LHCYDQKAPIFDPSKSSTFKETRCNTPDHS-----------CPYKLVYDDKSYT 139
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGL-FNSKTTGIVGLGGGDISLISQMRT 237
G LATETVT+ ST+G +P GC NN G F ++GIVGL G +SLISQM
Sbjct: 140 QGTLATETVTIHSTSGVPFVMPETIIGCSRNNSGSGFRPSSSGIVGLSRGSLSLISQM-- 197
Query: 238 TIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL 294
G G GVVST + T + Y L +DA+SVG+ R+
Sbjct: 198 ----------------------GGAYPGDGVVSTTMFAKTAKRGQYYLNLDAVSVGDTRI 235
Query: 295 G-VSTP------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 347
V TP +IVIDSGT LT+ P Y + + + ++ A V DP+ + LCY N
Sbjct: 236 ETVGTPFHALNGNIVIDSGTPLTYFPVSYCNLVRKAVERVVTADRVVDPSRNDMLCYYSN 295
Query: 348 SLSQVPEVTIHFR-GADVKLSRSNFFVKVSE-DIVC-SVFKGITNSVPIYGNIMQTNFLV 404
++ P +T+HF GAD+ L + N +++++ + C ++ V I+GN Q NFLV
Sbjct: 296 TIEIFPVITVHFSGGADLVLDKYNMYMELNRGGVFCLAIICNNPTQVAIFGNRAQNNFLV 355
Query: 405 GYD 407
GYD
Sbjct: 356 GYD 358
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 254 bits (649), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 164/428 (38%), Positives = 235/428 (54%), Gaps = 32/428 (7%)
Query: 25 QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN--RLNHFNQNSSISSSKASQA 82
+ GGFSV+ IHRDS +SPF S P+ R A RSL L + +S + +A
Sbjct: 26 EAGGFSVDFIHRDSARSPFAQPSLPPHARALAAARRSLRGAALGRYVGGASPAPGPVPEA 85
Query: 83 D------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
D II + YL+ +++GTPP + LA+ADTGSDL+W C + +F
Sbjct: 86 DGGVESKIITRSFEYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFH 145
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
P S+TY L C S+ C +L+Q SC CQY +YGDGS + G L+TET + + G
Sbjct: 146 PSRSTTYSLLSCQSAACQALSQASCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGG 205
Query: 196 A---VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPV 250
V +P ++FGC T + G F S G+VGLG G +SL+SQ+ IA +FSYCLVP
Sbjct: 206 GEGQVRVPRVSFGCSTGSAGSFRSD--GLVGLGAGALSLVSQLGAAARIARRFSYCLVPP 263
Query: 251 -----SSTKINFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLG-VSTPDIV 302
SS+ ++FG +VS PG STPL ++ ++Y + +++++V Q + ++ I+
Sbjct: 264 YAAANSSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQDVASANSSRII 323
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-----VPEVTI 357
+DSGTTLTFL L++ + I P L+LCY SQ +P+VT+
Sbjct: 324 VDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQGKSQAEDFGIPDVTL 383
Query: 358 HF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVS 414
F GA V L N F + E +C V ++ S P I GNI Q NF VGYD++ +TV+
Sbjct: 384 RFGGGASVTLRPENTFSLLEEGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDARTVT 443
Query: 415 FKPTDCTK 422
F DCT+
Sbjct: 444 FAAVDCTR 451
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 254 bits (648), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 161/434 (37%), Positives = 235/434 (54%), Gaps = 57/434 (13%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
+AT + +F+ LCF + + + GF+++LIHR
Sbjct: 3 LATTIIVLFLQISLCF-LFTTTASPPHGFTMDLIHR------------------------ 37
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
R N ++ S+ S + A+ + +N+ YL+++ +GTPP E A+ DTGS++ WTQC
Sbjct: 38 ---RSNASSRVSNTQSGSSPYANTVFDNSVYLMKLQVGTPPFEIQAIIDTGSEITWTQCL 94
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNG 180
PC CY Q++P+FDP SST+K +K C G +C Y V Y D +++ G
Sbjct: 95 PC--VHCYEQNAPIFDPSKSSTFK-------------EKRCDGHSCPYEVDYFDHTYTMG 139
Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
LATET+TL ST+G+ +P GCG NN F +G+VGL G SLI+QM
Sbjct: 140 TLATETITLHSTSGEPFVMPETIIGCGHNN-SWFKPSFSGMVGLNWGPSSLITQMGGEYP 198
Query: 241 GKFSYCLVPVSSTKINFGTNGIVSGPGVVSTP--LTKAKT-FYVLTIDAISVGNQRLGVS 297
G SYC ++KINFG N IV+G GVVST +T AK FY L +DA+SVGN R+
Sbjct: 199 GLMSYCFSGQGTSKINFGANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETM 258
Query: 298 -------TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 350
+IVIDSGTTLT+ P Y + + + ++ A ADPTG+ LCY+ +++
Sbjct: 259 GTTFHALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTID 318
Query: 351 QVPEVTIHFRGA-DVKLSRSNFFVKVSE-DIVC-SVFKGITNSVPIYGNIMQTNFLVGYD 407
P +T+HF G D+ L + N +++ + + C ++ I+GN Q NFLVGYD
Sbjct: 319 IFPVITMHFSGGVDLVLDKYNMYMESNNGGVFCLAIICNSPTQEAIFGNRAQNNFLVGYD 378
Query: 408 IEQQTVSFKPTDCT 421
VSF PT+C+
Sbjct: 379 SSSLLVSFSPTNCS 392
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 254 bits (648), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 161/422 (38%), Positives = 238/422 (56%), Gaps = 38/422 (9%)
Query: 26 TGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-----NSSISSSKAS 80
T GF V L H DS K+ T +R++ + R +RL N +S+ S
Sbjct: 44 TNGFRVMLRHVDSGKN------LTKLERVQHGIKRGKSRLQKLNAMVLAASSTPDSEDQL 97
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
+A I N YLI ++IGTPP AV DTGSDLIWTQC+PC ++CY Q +P+FDPK S
Sbjct: 98 EAPIHAGNGEYLIELAIGTPPVSYPAVLDTGSDLIWTQCKPC--TRCYKQPTPIFDPKKS 155
Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
S++ + C SS C++L +CS C+Y SYGD S + G LATET T G + + V++
Sbjct: 156 SSFSKVSCGSSLCSALPSSTCSD-GCEYVYSYGDYSMTQGVLATETFTFGKSKNK-VSVH 213
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INF 257
I FGCG +N G + +G+VGLG G +SL+SQ++ +FSYCL P+ TK +
Sbjct: 214 NIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKEQ---RFSYCLTPIDDTKESVLLL 270
Query: 258 GTNGIVS-GPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTP----------DIVI 303
G+ G V VV+TPL K +FY L+++AISVG+ RL + ++I
Sbjct: 271 GSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVII 330
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS---QVPEVTIHFR 360
DSGTT+T++ Q L S + + L+LC+S S S ++P++ HF+
Sbjct: 331 DSGTTITYVQQKAYEALKKEFISQTKLALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHFK 390
Query: 361 GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
G D++L N+ + S V + G ++ + I+GN+ Q N LV +D+E++T+SF PT C
Sbjct: 391 GGDLELPAENYMIGDSNLGVACLAMGASSGMSIFGNVQQQNILVNHDLEKETISFVPTSC 450
Query: 421 TK 422
+
Sbjct: 451 DQ 452
>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 176/432 (40%), Positives = 232/432 (53%), Gaps = 47/432 (10%)
Query: 22 IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-----NSSISS 76
++A GF+ ELI RDSP SPFYN+ L A TRS N H++ N S
Sbjct: 30 VKADNFGFTAELIRRDSPNSPFYNA-------LEAAATRSTNASQHYDAQIGRFNLMSDS 82
Query: 77 SKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
ASQ+++ + NYLI+IS+GTPP E LA+AD DL W C+ C Q +D F
Sbjct: 83 YYASQSELNFSKGNYLIKISVGTPPAEILALADITGDLTWLPCKTC---QDCTKDGFTFF 139
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY---SVSYGDGSFSN-GNLATETVTLGST 192
P SSTY S C S QC N C C Y + S +N G +A +T++ S+
Sbjct: 140 PSESSTYTSAACESYQCQITNGAVCQTKMCIYLCGPLPQQRSSCTNKGLVAMDTISFHSS 199
Query: 193 TGQAVALPGITFGCGT--NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV 250
+GQA++ P F CGT +N + GIVGLG G S+ SQM+ I G FS CLVP
Sbjct: 200 SGQALSYPNTNFICGTFIDNWHYIGA---GIVGLGRGLFSMTSQMKHLINGTFSQCLVPY 256
Query: 251 S---STKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLG---VSTP--D 300
S S+KINFG G+VSG GVVSTP+ Y L ++A+SVG R+ S P +
Sbjct: 257 SSKQSSKINFGLKGVVSGEGVVSTPIADDGESGAYFLFLEAMSVGGNRVANNFYSAPKSN 316
Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSLS--QVPEVTI 357
I ID TT T LP + N+ + + I P+ + L LCY S P +T+
Sbjct: 317 IYIDWRTTFTSLPHDFYENVEAEVRKAINLTPINYNNERKLSLCYKSESDHDFDAPPITM 376
Query: 358 HFRGADVKLSRSNFFVKVSEDIVCSVF--------KGITNSVPIYGNIMQTNFLVGYDIE 409
HF ADV+LS N FV++ ++VC F K IT++V YG+ Q NF+VGYD++
Sbjct: 377 HFTNADVQLSPLNTFVRMDWNVVCFAFLDGTFNATKRITHAV--YGSWQQMNFIVGYDLK 434
Query: 410 QQTVSFKPTDCT 421
TVSFK DCT
Sbjct: 435 SSTVSFKQADCT 446
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 159/421 (37%), Positives = 237/421 (56%), Gaps = 37/421 (8%)
Query: 26 TGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ----NSSISSSKASQ 81
T GF V L H DS K+ T +R++ + R +RL N S++ S +
Sbjct: 45 TKGFRVMLRHVDSGKN------LTKLERVQHGIKRGKSRLQRLNAMVLAASTLDSEDQLE 98
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
A I N YL+ ++IGTPP AV DTGSDLIWTQC+PC +QCY Q +P+FDPK SS
Sbjct: 99 APIHAGNGEYLMELAIGTPPVSYPAVLDTGSDLIWTQCKPC--TQCYKQPTPIFDPKKSS 156
Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
++ + C SS C+++ +CS C+Y SYGD S + G LATET T G + + V++
Sbjct: 157 SFSKVSCGSSLCSAVPSSTCSD-GCEYVYSYGDYSMTQGVLATETFTFGKSKNK-VSVHN 214
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFG 258
I FGCG +N G + +G+VGLG G +SL+SQ++ +FSYCL P+ TK + G
Sbjct: 215 IGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKEP---RFSYCLTPMDDTKESILLLG 271
Query: 259 TNGIVS-GPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTP----------DIVID 304
+ G V VV+TPL K +FY L+++ ISVG+ RL + ++ID
Sbjct: 272 SLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIID 331
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS---QVPEVTIHFRG 361
SGTT+T++ Q L S + + L+LC+S S S ++P++ HF+G
Sbjct: 332 SGTTITYIEQKAFEALKKEFISQTKLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHFKG 391
Query: 362 ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
D++L N+ + S V + G ++ + I+GN+ Q N LV +D+E++T+SF PT C
Sbjct: 392 GDLELPAENYMIGDSNLGVACLAMGASSGMSIFGNVQQQNILVNHDLEKETISFVPTSCD 451
Query: 422 K 422
+
Sbjct: 452 Q 452
>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
Length = 308
Score = 251 bits (641), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 153/361 (42%), Positives = 209/361 (57%), Gaps = 65/361 (18%)
Query: 69 NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
N + ++S Q+++I +YL+ IS+GTPP L +ADTGSDLIW QC PC CY
Sbjct: 7 NTGNQLASPNDIQSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPC--DDCY 64
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
Q PLFDPK S TYK+L G L++ET T
Sbjct: 65 KQVEPLFDPKKSKTYKTL---------------------------------GYLSSETFT 91
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
+GST G + PG+ FGCG +NGG FN K +G++GLGGG +SL+ Q+ + + G+FSYCLV
Sbjct: 92 IGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLV 151
Query: 249 PVS-----STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVI 303
P+S S+KINFG + +VSG G S+P ++ +I+I
Sbjct: 152 PLSSDSTASSKINFGKSAVVSGSG-TSSPAAAEES---------------------NIII 189
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGAD 363
DSGTTLT LP+ + +++ S ++ +I Q DP G+ LCYS ++P +T HF GAD
Sbjct: 190 DSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCYSGVKKLEIPTITAHFIGAD 249
Query: 364 VKLSRSNFFVKVSEDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
V+L N FV+ ED+VC F I +S + I+GN+ Q NFLVGYD++ VSFKPTDCTK
Sbjct: 250 VQLPPLNTFVQAQEDLVC--FSMIPSSNLAIFGNLSQMNFLVGYDLKNNKVSFKPTDCTK 307
Query: 423 Q 423
Q
Sbjct: 308 Q 308
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 152/418 (36%), Positives = 223/418 (53%), Gaps = 38/418 (9%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
EA+ GF + L H DS K+ T +Q L A+ R RL + ++ +
Sbjct: 35 EAKVTGFQIMLEHVDSGKN------LTKFQLLERAIERGSRRLQRLE--AMLNGPSGVET 86
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
+ + YL+ +SIGTP A+ DTGSDLIWTQC+PC +QC+ Q +P+F+P+ SS+
Sbjct: 87 SVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPC--TQCFNQSTPIFNPQGSSS 144
Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
+ +LPCSS C +L+ +CS CQY+ YGDGS + G++ TET+T GS V++P I
Sbjct: 145 FSTLPCSSQLCQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGS-----VSIPNI 199
Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---TKINFGT 259
TFGCG NN G G+VG+G G +SL SQ+ T KFSYC+ P+ S + + G+
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTPSNLLLGS 256
Query: 260 --NGIVSG-PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS-----------TPDIVIDS 305
N + +G P ++ TFY +T++ +SVG+ RL + T I+IDS
Sbjct: 257 LANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDS 316
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPEVTIHFRGA 362
GTTLT+ ++ S I V + +LC+ S S Q+P +HF G
Sbjct: 317 GTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGG 376
Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
D++L N+F+ S ++C + + I+GNI Q N LV YD VSF C
Sbjct: 377 DLELPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 162/438 (36%), Positives = 229/438 (52%), Gaps = 57/438 (13%)
Query: 4 FLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
+L+ +F+LF + +S IEAQ GF+++L + S N
Sbjct: 18 YLAIIFLLFHVLH--LSSIEAQNDGFTIKLFRKTS------------------------N 51
Query: 64 RLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
+ + QA I +L+ I IGTPP + + DTGSDLIW QC PC
Sbjct: 52 NIQNI-----------VQAPINAYIGQHLMEIYIGTPPIKITGLVDTGSDLIWIQCAPC- 99
Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNL 182
CY Q P+FDP SSTY ++ C S C L+ CS C Y+ YGD S + G L
Sbjct: 100 -LGCYKQIKPMFDPLKSSTYNNISCDSPLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVL 158
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG- 241
A +T T S TG+ V+L FGCG NN G FN G++GLGGG SLISQ+ G
Sbjct: 159 AQDTATFTSNTGKPVSLSRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGK 218
Query: 242 KFSYCLVPVS-----STKINFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRL 294
KFS CLVP S++++FG V G GVV+TPL + T Y +T+ ISV +
Sbjct: 219 KFSQCLVPFLTDIKISSRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYF 278
Query: 295 ----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNSL 349
+ ++++DSGT LPQ + + + + + +P+ DP+ +LCY +
Sbjct: 279 PMNSTIGKANMLVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYRTQTN 338
Query: 350 SQVPEVTIHFRGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVP-IYGNIMQTNFLVG 405
+ P +T HF GA+V L+ F+ ++ I C TNS P +YGN Q+N+L+G
Sbjct: 339 LKGPTLTFHFVGANVLLTPIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIG 398
Query: 406 YDIEQQTVSFKPTDCTKQ 423
+D+++Q VSFKPTDCTKQ
Sbjct: 399 FDLDRQVVSFKPTDCTKQ 416
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 151/355 (42%), Positives = 208/355 (58%), Gaps = 33/355 (9%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
AD + + YL+++ +GTPP E A DTGSDLIWTQC PC + CY Q +P+FDP SS
Sbjct: 52 ADTLFDYNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPC--TNCYSQYAPIFDPSNSS 109
Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
T+K +K C+G +C Y + Y D ++S G LATETVT+ ST+G+ +P
Sbjct: 110 TFK-------------EKRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPE 156
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
T GCG +N F +G+VGL G SLI+QM G SYC ++KINFGTN
Sbjct: 157 TTIGCG-HNSSWFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNA 215
Query: 262 IVSGPGVVSTP--LTKAKT-FYVLTIDAISVGN---QRLGVS----TPDIVIDSGTTLTF 311
IV+G GVVST LT AK Y L +DA+SVG+ + +G + +I+IDSGTTLT+
Sbjct: 216 IVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTY 275
Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSN 370
P Y + + + + A ADPTG+ LCY +++ P +T+HF GAD+ L + N
Sbjct: 276 FPVSYCNLVREAVDHYVTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYN 335
Query: 371 FFVK-VSEDIVCSVFKGITNSVP---IYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+++ ++ C I N+ P I+GN Q NFLVGYD VSF PT+C+
Sbjct: 336 MYIETITRGTFCLAI--ICNNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 161/421 (38%), Positives = 223/421 (52%), Gaps = 52/421 (12%)
Query: 22 IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ 81
IEAQ GF+V+LI + S H + N+ Q
Sbjct: 26 IEAQNDGFTVKLIRKSS----------------------------HLSSNNI---QDIVQ 54
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
A I YL+ + IGTPP + DTGSDLIW QC PC CY Q +P+FDP SS
Sbjct: 55 APINAYIGQYLMELYIGTPPIKISGTVDTGSDLIWVQCVPC--LGCYNQINPMFDPLKSS 112
Query: 142 TYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
TY ++ C S C CS C Y+ Y D S + G LA ETVTL S TG+ ++L
Sbjct: 113 TYTNISCDSPLCYKPYIGECSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKPISLQ 172
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG-KFSYCLVP-----VSSTK 254
GI FGCG NN G FN G++GLGGG SL+SQ+ G KFS CLVP S++
Sbjct: 173 GILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITISSQ 232
Query: 255 INFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTP----DIVIDSGT 307
++FG V G GVV+TPL + + T Y +T+ ISV + L +++ ++++DSGT
Sbjct: 233 MSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTIEKGNMLVDSGT 292
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNSLSQVPEVTIHFRGADVKL 366
LPQ + + + + +P+ DP+ +LCY + + P +T HF GA++ L
Sbjct: 293 PPNILPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYRTQTNLKGPTLTYHFEGANLLL 352
Query: 367 SRSNFFVKVSED---IVCSVFKGITNSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ F+ + + + C NS P IYGN QTN+L+G+D+++Q VSFKPTDCTK
Sbjct: 353 TPIQTFIPPTPETKGVFCLAITNCANSDPGIYGNFAQTNYLIGFDLDRQIVSFKPTDCTK 412
Query: 423 Q 423
Q
Sbjct: 413 Q 413
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 150/355 (42%), Positives = 207/355 (58%), Gaps = 33/355 (9%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
AD + + YL+++ +GTPP E A DTGSDLIWTQC PC + CY Q +P+FDP SS
Sbjct: 52 ADTLFDYNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPC--TNCYSQYAPIFDPSNSS 109
Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
T+K +K C+G +C Y + Y D ++S G LATETVT+ ST+G+ +P
Sbjct: 110 TFK-------------EKRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPE 156
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
T GCG +N F +G+VGL G SLI+QM G SYC ++KINFGTN
Sbjct: 157 TTIGCG-HNSSWFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNA 215
Query: 262 IVSGPGVVSTP--LTKAKT-FYVLTIDAISVGN---QRLGVS----TPDIVIDSGTTLTF 311
IV+G GVVST LT AK Y L +DA+SVG+ + +G + +I+IDSGTTLT+
Sbjct: 216 IVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTY 275
Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSN 370
P Y + + + + A ADPTG+ LCY +++ P +T+HF GAD+ L + N
Sbjct: 276 FPVSYCNLVREAVDHYVTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYN 335
Query: 371 FFVK-VSEDIVCSVFKGITNSVP---IYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+++ ++ C I N+ P I+GN Q NFLVGYD V F PT+C+
Sbjct: 336 MYIETITRGTFCLAI--ICNNPPQDAIFGNRAQNNFLVGYDSSSLLVFFSPTNCS 388
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 150/357 (42%), Positives = 212/357 (59%), Gaps = 32/357 (8%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
AD + + + YL+R+ +GTPP E +A DTGSDLIWTQC PCP CY Q +P+FDP SS
Sbjct: 52 ADTVFDYSIYLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCP--NCYTQFAPIFDPSKSS 109
Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
T+K +K C G +C Y + Y D S+S G LATETVT+ ST+G+ +
Sbjct: 110 TFK-------------EKRCHGNSCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAE 156
Query: 202 ITFGCGTNNGGL----FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
+ GCG NN L + + ++GIVGL G SLISQM I G SYC ++KINF
Sbjct: 157 TSIGCGLNNSNLMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCFSSQGTSKINF 216
Query: 258 GTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLG-VSTP------DIVIDSGTT 308
GTN +V+G G V+ + K + FY L +DA+SVG++R+ + TP +I IDSGTT
Sbjct: 217 GTNAVVAGDGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQDGNIFIDSGTT 276
Query: 309 LTFLPQGY-NSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKL 366
T+LP Y N +V +S++ A V DP+ LCY+++++ P +T+HF GAD+ L
Sbjct: 277 YTYLPTSYCNLVREAVAASVVAANQVPDPSSENLLCYNWDTMEIFPVITLHFAGGADLVL 336
Query: 367 SRSNFFVK-VSEDIVCSVFKGITNSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ N +V+ ++ C + S+P I+GN N LVGYD +SF PT+C+
Sbjct: 337 DKYNMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPTNCS 393
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 244 bits (622), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 159/417 (38%), Positives = 243/417 (58%), Gaps = 43/417 (10%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS-QADIIP 86
GF V L H DS K+ T +R+R + R NRL + ++SS + +A ++P
Sbjct: 39 GFRVRLKHVDSGKN------LTKLERIRHGVKRGRNRLQRLQAMALVASSSSEIEAPVLP 92
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N +L++++IGTPP A+ DTGSDLIWTQC+PC +QC+ Q +P+FDPK SS++ L
Sbjct: 93 GNGEFLMKLAIGTPPETYSAILDTGSDLIWTQCKPC--TQCFHQSTPIFDPKKSSSFSKL 150
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
CSS C +L Q SC+ C+Y SYGD S + G LA+ET+T G ++P + FGC
Sbjct: 151 SCSSQLCEALPQSSCNN-GCEYLYSYGDYSSTQGILASETLTFGK-----ASVPNVAFGC 204
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV--- 263
G +N G S+ G+VGLG G +SL+SQ++ KFSYCL V TK + G +
Sbjct: 205 GADNEGSGFSQGAGLVGLGRGPLSLVSQLKEP---KFSYCLTTVDDTKTSTLLMGSLASV 261
Query: 264 --SGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTT 308
S + +TPL + +FY L+++ ISVG+ RL + + ++IDSGTT
Sbjct: 262 NASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTT 321
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLS---QVPEVTIHFRGAD 363
+T+L + + + ++ I PV D +GS L++C++ S S +VP++ HF GAD
Sbjct: 322 ITYLEESAFNLVAKEFTAKINL-PV-DSSGSTGLDVCFTLPSGSTNIEVPKLVFHFDGAD 379
Query: 364 VKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
++L N+ + S V + G ++ + I+GN+ Q N LV +D+E++T+SF PT C
Sbjct: 380 LELPAENYMIGDSSMGVACLAMGSSSGMSIFGNVQQQNMLVLHDLEKETLSFLPTQC 436
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 148/418 (35%), Positives = 219/418 (52%), Gaps = 38/418 (9%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
E + GF + L H DS K+ T ++ L A+ R RL + ++ +
Sbjct: 35 EPKVAGFQIMLEHVDSGKN------LTKFELLERAVERGSRRLQRLE--AMLNGPSGVET 86
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
+ + YL+ +SIGTP A+ DTGSDLIWTQC+PC +QC+ Q +P+F+P+ SS+
Sbjct: 87 PVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPC--TQCFNQSTPIFNPQGSSS 144
Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
+ +LPCSS C +L +CS +CQY+ YGDGS + G++ TET+T GS V++P I
Sbjct: 145 FSTLPCSSQLCQALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTFGS-----VSIPNI 199
Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGT 259
TFGCG NN G G+VG+G G +SL SQ+ T KFSYC+ P+ S+ + G+
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSNSSTLLLGS 256
Query: 260 --NGIVSG-PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS-----------TPDIVIDS 305
N + +G P ++ TFY +T++ +SVG+ L + T I+IDS
Sbjct: 257 LANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDS 316
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPEVTIHFRGA 362
GTTLT+ + S + V + +LC+ S S Q+P +HF G
Sbjct: 317 GTTLTYFVDNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGG 376
Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
D+ L N+F+ S ++C + + I+GNI Q N LV YD VSF C
Sbjct: 377 DLVLPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSFLSAQC 434
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 148/418 (35%), Positives = 220/418 (52%), Gaps = 38/418 (9%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
E + GF + L H DS K+ T ++ L A+ R RL + ++ +
Sbjct: 35 EPKVAGFQIMLEHVDSGKN------LTKFELLERAVERGSRRLQRLE--AMLNGPSGVET 86
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
+ + YL+ +SIGTP A+ DTGSDLIWTQC+PC +QC+ Q +P+F+P+ SS+
Sbjct: 87 PVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPC--TQCFNQSTPIFNPQGSSS 144
Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
+ +LPCSS C +L +CS +CQY+ YGDGS + G++ TET+T GS V++P I
Sbjct: 145 FSTLPCSSQLCQALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTFGS-----VSIPNI 199
Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFGT 259
TFGCG NN G G+VG+G G +SL SQ+ T KFSYC+ P+ +S+ + G+
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTSSTLLLGS 256
Query: 260 --NGIVSG-PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS-----------TPDIVIDS 305
N + +G P ++ TFY +T++ +SVG+ L + T I+IDS
Sbjct: 257 LANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDS 316
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPEVTIHFRGA 362
GTTLT+ + S + V + +LC+ S S Q+P +HF G
Sbjct: 317 GTTLTYFADNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGG 376
Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
D+ L N+F+ S ++C + + I+GNI Q N LV YD VSF C
Sbjct: 377 DLVLPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSFLFAQC 434
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 241 bits (614), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 159/419 (37%), Positives = 244/419 (58%), Gaps = 43/419 (10%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS-QADIIP 86
GF +L H DS K+ T ++R++ + R +RL F + ++SS + A ++P
Sbjct: 39 GFRAKLKHVDSGKNL------TKFERIQHGVKRGRHRLQRFKAMALVASSNSEIDAPVLP 92
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N +L++++IGTPP A+ DTGSDLIWTQC+PC +QC+ Q +P+FDPK SS++ L
Sbjct: 93 GNGEFLMKLAIGTPPETYSAIMDTGSDLIWTQCKPC--TQCFDQPTPIFDPKKSSSFSKL 150
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
CSS C +L Q +CS C+Y YGD S + G LA+ET+T G V++P + FGC
Sbjct: 151 SCSSKLCEALPQSTCSD-GCEYLYGYGDYSSTQGMLASETLTFGK-----VSVPEVAFGC 204
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV--- 263
G +N G S+ +G+VGLG G +SL+SQ++ KFSYCL V TK + G +
Sbjct: 205 GEDNEGSGFSQGSGLVGLGRGPLSLVSQLKEP---KFSYCLTSVDDTKASTLLMGSLASV 261
Query: 264 --SGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTT 308
S + +TPL + +FY L+++ ISVG+ L + + ++IDSGTT
Sbjct: 262 KASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTT 321
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLS---QVPEVTIHFRGAD 363
+T+L Q + +S I PV D +GS LE+C++ S S +VP++ HF GAD
Sbjct: 322 ITYLEQSAFDLVAKEFTSQINL-PV-DNSGSTGLEVCFTLPSGSTDIEVPKLVFHFDGAD 379
Query: 364 VKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
++L N+ + + V + G ++ + I+GNI Q N LV +D+E++T+SF PT C +
Sbjct: 380 LELPAENYMIADASMGVACLAMGSSSGMSIFGNIQQQNMLVLHDLEKETLSFLPTQCDE 438
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 241 bits (614), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 164/436 (37%), Positives = 229/436 (52%), Gaps = 62/436 (14%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
+AT + +F L + +++++ + GF+++LIHR S S +R
Sbjct: 3 LATTMIAIF-LQIITYFLITTTASSPQGFTIDLIHRRSNASS----------------SR 45
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
N + + AD + + YL+++ IGTPP E AV DTGS+ IWTQC
Sbjct: 46 VFN-----------TQLGSPYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCL 94
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNG 180
PC CY Q +P+FDP SST+K + C + + C Y + YG S++ G
Sbjct: 95 PC--VHCYNQTAPIFDPSKSSTFKEIRCDTHDHS-----------CPYELVYGGKSYTKG 141
Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
L TETVT+ ST+GQ +P GCG NN G F G+VGL G SLI+QM
Sbjct: 142 TLVTETVTIHSTSGQPFVMPETIIGCGRNNSG-FKPGFAGVVGLDRGPKSLITQMGGEYP 200
Query: 241 GKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLG-V 296
G SYC ++KINFG N IV+G GVVST + T FY L +DA+SVGN R+ V
Sbjct: 201 GLMSYCFAGKGTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETV 260
Query: 297 STP------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 350
TP +IVIDSG+TLT+ P+ Y + + + ++ A V P + LCY ++
Sbjct: 261 GTPFHALKGNIVIDSGSTLTYFPESYCNLVRKAVEQVVTA--VRFPRSDI-LCYYSKTID 317
Query: 351 QVPEVTIHFR-GADVKLSRSNFFVKVSE-DIVCSVFKGITNS---VPIYGNIMQTNFLVG 405
P +T+HF GAD+ L + N +V + + C I NS I+GN Q NFLVG
Sbjct: 318 IFPVITMHFSGGADLVLDKYNMYVASNTGGVFCLAI--ICNSPIEEAIFGNRAQNNFLVG 375
Query: 406 YDIEQQTVSFKPTDCT 421
YD VSFKPT+C+
Sbjct: 376 YDSSSLLVSFKPTNCS 391
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 159/417 (38%), Positives = 243/417 (58%), Gaps = 43/417 (10%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS-QADIIP 86
GF + L H DS K+ T +QR++ + R+ +RL N +SS A + ++
Sbjct: 42 GFRITLKHVDSDKN------LTKFQRIQHGIKRANHRLERLNAMVLAASSNAEINSPVLS 95
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N +L+ ++IGTPP A+ DTGSDLIWTQC+PC +QC+ Q SP+FDPK SS++ L
Sbjct: 96 GNGEFLMNLAIGTPPETYSAIMDTGSDLIWTQCKPC--TQCFDQPSPIFDPKKSSSFSKL 153
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
CSS C +L Q SCS +C+Y +YGD S + G +ATET T G V++P + FGC
Sbjct: 154 SCSSQLCKALPQSSCSD-SCEYLYTYGDYSSTQGTMATETFTFGK-----VSIPNVGFGC 207
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN---FGTNGIV 263
G +N G ++ +G+VGLG G +SL+SQ++ KFSYCL + TK + G+ V
Sbjct: 208 GEDNEGDGFTQGSGLVGLGRGPLSLVSQLK---EAKFSYCLTSIDDTKTSTLLMGSLASV 264
Query: 264 SG--PGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTT 308
+G + +TPL + +FY L+++ ISVG RL + T ++IDSGTT
Sbjct: 265 NGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTT 324
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLS---QVPEVTIHFRGAD 363
+T+L + + +S + PV D +G+ LELCY+ S + +VP++ +HF GAD
Sbjct: 325 ITYLEESAFDLVKKEFTSQM-GLPV-DNSGATGLELCYNLPSDTSELEVPKLVLHFTGAD 382
Query: 364 VKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
++L N+ + S V + G + + I+GN+ Q N V +D+E++T+SF PT+C
Sbjct: 383 LELPGENYMIADSSMGVICLAMGSSGGMSIFGNVQQQNMFVSHDLEKETLSFLPTNC 439
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 144/375 (38%), Positives = 212/375 (56%), Gaps = 31/375 (8%)
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
N L ++ +S + + AD + + + YL+++ +GTPP E +A DTGSD+IWTQC PC
Sbjct: 393 NFLVGYDSSSLLLQGASPYADTLYDYSIYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPC 452
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
P CY Q +P+FDP SST++ ++ C+G +C Y + Y D ++S G L
Sbjct: 453 P--NCYSQFAPIFDPSKSSTFR-------------EQRCNGNSCHYEIIYADKTYSKGIL 497
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGL----FNSKTTGIVGLGGGDISLISQMRTT 238
ATETVT+ ST+G+ + GCG +N L F S ++GIVGL G +SLISQM
Sbjct: 498 ATETVTIPSTSGEPFVMAETKIGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLP 557
Query: 239 IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLG- 295
G SYC ++KINFGTN IV+G G V+ + K FY L +DA+SV + +
Sbjct: 558 YPGLISYCFSGQGTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNLIAT 617
Query: 296 VSTP------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 349
+ TP +I IDSGTTLT+ P Y + + + ++ A V D LCY +++
Sbjct: 618 LGTPFHAEDGNIFIDSGTTLTYFPMSYCNLVREAVEQVVTAVKVPDMGSDNLLCYYSDTI 677
Query: 350 SQVPEVTIHFR-GADVKLSRSNFFVK-VSEDIVCSVFKGITNSVP-IYGNIMQTNFLVGY 406
P +T+HF GAD+ L + N +++ ++ I C S+P ++GN Q NFLVGY
Sbjct: 678 DIFPVITMHFSGGADLVLDKYNMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGY 737
Query: 407 DIEQQTVSFKPTDCT 421
D +SF PT+C+
Sbjct: 738 DPSSNVISFSPTNCS 752
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 158/423 (37%), Positives = 225/423 (53%), Gaps = 57/423 (13%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
+AT + +F+ CF + + + G F+++LI R S S F RL
Sbjct: 18 LATTMIVLFLQIITCFLFTTTVSSPHG-FTIDLIQRRSNSSSF---------RL------ 61
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
S N+L + AD + + YL+++ +GTPP E A DTGSDLIWTQC
Sbjct: 62 SKNQLQ----------GASPYADTLFDYNIYLMKLQVGTPPFEIAAEIDTGSDLIWTQCM 111
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNG 180
PCP CY Q P+FDP SST+ N++ C G +C Y + Y D ++S G
Sbjct: 112 PCP--DCYSQFDPIFDPSKSSTF-------------NEQRCHGKSCHYEIIYEDNTYSKG 156
Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGL----FNSKTTGIVGLGGGDISLISQMR 236
LATETVT+ ST+G+ + T GCG +N L F S ++GIVGL G SLISQM
Sbjct: 157 ILATETVTIHSTSGEPFVMAETTIGCGLHNTDLDNSGFASSSSGIVGLNMGPRSLISQMD 216
Query: 237 TTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRL 294
G SYC ++KINFGTN IV+G G V+ + K FY L +DA+SV + R+
Sbjct: 217 LPYPGLISYCFSGQGTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRI 276
Query: 295 G-VSTP------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 347
+ TP +IVIDSG+T+T+ P Y + + + ++ A V DP+G+ LCY
Sbjct: 277 ETLGTPFHAEDGNIVIDSGSTVTYFPVSYCNLVRKAVEQVVTAVRVPDPSGNDMLCYFSE 336
Query: 348 SLSQVPEVTIHFR-GADVKLSRSNFFVKV-SEDIVC-SVFKGITNSVPIYGNIMQTNFLV 404
++ P +T+HF GAD+ L + N +++ S + C ++ I+GN Q NFLV
Sbjct: 337 TIDIFPVITMHFSGGADLVLDKYNMYMESNSGGLFCLAIICNSPTQEAIFGNRAQNNFLV 396
Query: 405 GYD 407
GYD
Sbjct: 397 GYD 399
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 161/426 (37%), Positives = 223/426 (52%), Gaps = 61/426 (14%)
Query: 11 LFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ 70
L + +++++ + GF+++LIHR S S +R N
Sbjct: 6 LQIITYFLITTTASSPQGFTIDLIHRRSNASS----------------SRVFN------- 42
Query: 71 NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
+ + AD + + YL+++ IGTPP E AV DTGS+ IWTQC PC CY Q
Sbjct: 43 ----TQLGSPYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPC--VHCYNQ 96
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
+P+FDP SST+K + C + + C Y + YG S++ G L TETVT+
Sbjct: 97 TAPIFDPSKSSTFKEIRCDTHDHS-----------CPYELVYGGKSYTKGTLVTETVTIH 145
Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV 250
ST+GQ +P GCG NN G F G+VGL G SLI+QM G SYC
Sbjct: 146 STSGQPFVMPETIIGCGRNNSG-FKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGK 204
Query: 251 SSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLG-VSTP------D 300
++KINFG N IV+G GVVST + T FY L +DA+SVGN R+ V TP +
Sbjct: 205 GTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGN 264
Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR 360
IVIDSG+TLT+ P+ Y + + + ++ A V P + LCY ++ P +T+HF
Sbjct: 265 IVIDSGSTLTYFPESYCNLVRKAVEQVVTA--VRFPRSDI-LCYYSKTIDIFPVITMHFS 321
Query: 361 -GADVKLSRSNFFVKVSE-DIVCSVFKGITNS---VPIYGNIMQTNFLVGYDIEQQTVSF 415
GAD+ L + N +V + + C I NS I+GN Q NFLVGYD VSF
Sbjct: 322 GGADLVLDKYNMYVASNTGGVFCLAI--ICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSF 379
Query: 416 KPTDCT 421
KPT+C+
Sbjct: 380 KPTNCS 385
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 238 bits (607), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 149/418 (35%), Positives = 226/418 (54%), Gaps = 40/418 (9%)
Query: 25 QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
+ GF V L H DS + T ++RL+ A+ R RL + ++ S + +A +
Sbjct: 38 EKNGFRVSLRHVDS------GGNYTKFERLQRAVKRGRLRLQRLSAKTA-SFEPSVEAPV 90
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
N +L+ ++IGTP A+ DTGSDLIWTQC+PC C+ Q +P+FDP+ SS++
Sbjct: 91 HAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPC--KVCFDQPTPIFDPEKSSSFS 148
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
LPCSS C +L SCS C+Y SYGD S + G LATET T G + + I F
Sbjct: 149 KLPCSSDLCVALPISSCSD-GCEYRYSYGDHSSTQGVLATETFTFGDAS-----VSKIGF 202
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK----INFGTN 260
GCG +N G S+ G+VGLG G +SLISQ+ KFSYCL + +K + G+
Sbjct: 203 GCGEDNRGRAYSQGAGLVGLGRGPLSLISQLGVP---KFSYCLTSIDDSKGISTLLVGSE 259
Query: 261 GIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGT 307
V + TPL + +FY L+++ ISVG+ L + + ++IDSGT
Sbjct: 260 ATVKS--AIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGT 317
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF---NSLSQVPEVTIHFRGADV 364
T+T+L + L S ++ A + LELC++ S +VP++ HF G D+
Sbjct: 318 TITYLKDNAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVEVPQLVFHFEGVDL 377
Query: 365 KLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
KL + N+ ++ S V + G ++ + I+GN Q N +V +D+E++T+SF P C +
Sbjct: 378 KLPKENYIIEDSALRVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQ 435
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 149/418 (35%), Positives = 225/418 (53%), Gaps = 40/418 (9%)
Query: 25 QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
+ GF V L H DS + T ++RL+ A+ R RL + ++ S + +A +
Sbjct: 38 EKNGFRVSLRHVDS------GGNYTKFERLQRAVKRGRLRLQRLSAKTA-SFEPSVEAPV 90
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
N +L+ ++IGTP A+ DTGSDLIWTQC+PC C+ Q +P+FDP+ SS++
Sbjct: 91 HAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPC--KVCFDQPTPIFDPEKSSSFS 148
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
LPCSS C +L SCS C+Y SYGD S + G LATET T G + + I F
Sbjct: 149 KLPCSSDLCVALPISSCSD-GCEYRYSYGDHSSTQGVLATETFTFGDAS-----VSKIGF 202
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK----INFGTN 260
GCG +N G S+ G+VGLG G +SLISQ+ KFSYCL + +K + G+
Sbjct: 203 GCGEDNRGRAYSQGAGLVGLGRGPLSLISQLGVP---KFSYCLTSIDDSKGISTLLVGSE 259
Query: 261 GIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGT 307
V + TPL + +FY L+++ ISVG+ L + + ++IDSGT
Sbjct: 260 ATVKS--AIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGT 317
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF---NSLSQVPEVTIHFRGADV 364
T+T+L + L S ++ A + LELC++ S VP++ HF G D+
Sbjct: 318 TITYLKDSAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVDVPQLVFHFEGVDL 377
Query: 365 KLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
KL + N+ ++ S V + G ++ + I+GN Q N +V +D+E++T+SF P C +
Sbjct: 378 KLPKENYIIEDSALRVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQ 435
>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 161/436 (36%), Positives = 235/436 (53%), Gaps = 54/436 (12%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN---SSISSSKASQADI 84
GFSVE IHRDS +SPF++ S T R+ +A RS R +++ S+ +++
Sbjct: 34 GFSVEFIHRDSARSPFHDPSLTAPARVLEAARRSTVRAAALSRSYVRVDAPSADGFVSEL 93
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP----------- 133
YL+ ++IGTPPT +A+ADTGSDLIW C Y D P
Sbjct: 94 TSTPFEYLMAVNIGTPPTRMVAIADTGSDLIWLNCS-------YGGDGPGLAAARDADAQ 146
Query: 134 ----LFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
FDP S+T++ + C S C+ L + SC + C+YS SYGDGS ++G L+TET T
Sbjct: 147 PPGVQFDPSKSTTFRLVDCDSVACSELPEASCGADSKCRYSYSYGDGSHTSGVLSTETFT 206
Query: 189 LGSTTGQ-----AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAG 241
G + + FGC T G +S G+VGLGGGD+SL+SQ+ T++
Sbjct: 207 FADAPGARGDGTTTRVANVNFGCSTTFVG--SSVGDGLVGLGGGDLSLVSQLGADTSLGR 264
Query: 242 KFSYCLVPVS---STKINFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGV 296
+FSYCLVP S S+ +NFG V+ PG V+TPL ++ K +Y++ + ++ VGN+
Sbjct: 265 RFSYCLVPYSVKASSALNFGPRAAVTDPGAVTTPLIPSQVKAYYIVELRSVKVGNKTF-- 322
Query: 297 STPD---IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-- 351
PD +++DSGTTLTFLP+ L+ ++ I+ P P L LC+ + + +
Sbjct: 323 EAPDRSPLIVDSGTTLTFLPEALVDPLVKELTGRIKLPPAQSPERLLPLCFDVSGVREGQ 382
Query: 352 ----VPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLV 404
+P+VT+ GA V L N FV+V E +C ++ P I GNI Q N V
Sbjct: 383 VAAMIPDVTVGLGGGAAVTLKAENTFVEVQEGTLCLAVSAMSEQFPASIIGNIAQQNMHV 442
Query: 405 GYDIEQQTVSFKPTDC 420
GYD+++ TV+F P C
Sbjct: 443 GYDLDKGTVTFAPAAC 458
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 161/458 (35%), Positives = 242/458 (52%), Gaps = 59/458 (12%)
Query: 1 MATFLSCVFILFFLCFYV----VSPIEAQTGG---------FSVELIHRDSPKSPFYNSS 47
MA+ S + I+ L V VSP + + G F V L H DS +
Sbjct: 1 MASSGSHMIIVILLALAVSSALVSPAASTSRGLDRRPEKTWFRVSLRHVDS------GGN 54
Query: 48 ETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAV 107
T ++RL+ A+ R RL + ++ S + +A + N +L++++IGTP A+
Sbjct: 55 YTKFERLQRAMKRGKLRLQRLSAKTA-SFESSVEAPVHAGNGEFLMKLAIGTPAETYSAI 113
Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ 167
DTGSDLIWTQC+PC C+ Q +P+FDPK SS++ LPCSS CA+L SCS C+
Sbjct: 114 MDTGSDLIWTQCKPC--KDCFDQPTPIFDPKKSSSFSKLPCSSDLCAALPISSCSD-GCE 170
Query: 168 YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGG 227
Y SYGD S + G LATET G + + I FGCG +N G S+ G+VGLG G
Sbjct: 171 YLYSYGDYSSTQGVLATETFAFGDAS-----VSKIGFGCGEDNDGSGFSQGAGLVGLGRG 225
Query: 228 DISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----PGVVSTPLTK---AKTF 279
+SLISQ+ KFSYCL + +K G + ++ G ++TPL + +F
Sbjct: 226 PLSLISQLGEP---KFSYCLTSMDDSK---GISSLLVGSEATMKNAITTPLIQNPSQPSF 279
Query: 280 YVLTIDAISVGNQRLGVS----------TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE 329
Y L+++ ISVG+ L + + ++IDSGTT+T+L + L S ++
Sbjct: 280 YYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAFAALKKEFISQLK 339
Query: 330 AQPVADPTGS--LELCYSF---NSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVF 384
D +GS L+LC++ S VP++ HF GAD+KL N+ + S V +
Sbjct: 340 LD--VDESGSTGLDLCFTLPPDASTVDVPQLVFHFEGADLKLPAENYIIADSGLGVICLT 397
Query: 385 KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
G ++ + I+GN Q N +V +D+E++T+SF P C +
Sbjct: 398 MGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQ 435
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 231 bits (589), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 141/413 (34%), Positives = 215/413 (52%), Gaps = 38/413 (9%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
G V+L DS K+ T Y+ ++ A+ R R+ N + + SS + +
Sbjct: 41 GLRVDLEQVDSGKN------LTKYELIKRAIKRGERRMRSIN--AMLQSSSGIETPVYAG 92
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ YL+ ++IGTP + A+ DTGSDLIWTQCEPC +QC+ Q +P+F+P+ SS++ +LP
Sbjct: 93 DGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPC--TQCFSQPTPIFNPQDSSSFSTLP 150
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C S C L ++C+ CQY+ YGDGS + G +ATET T + ++P I FGCG
Sbjct: 151 CESQYCQDLPSETCNNNECQYTYGYGDGSTTQGYMATETFTF-----ETSSVPNIAFGCG 205
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVS 264
+N G G++G+G G +SL SQ+ G+FSYC+ S+ + G+
Sbjct: 206 EDNQGFGQGNGAGLIGMGWGPLSLPSQLG---VGQFSYCMTSYGSSSPSTLALGSAASGV 262
Query: 265 GPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTF 311
G ST L + T+Y +T+ I+VG LG+ T ++IDSGTTLT+
Sbjct: 263 PEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTY 322
Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPEVTIHFRGADVKLSR 368
LPQ + + + I V + + L C+ S S QVPE+++ F G + L
Sbjct: 323 LPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGE 382
Query: 369 SNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N + +E ++C + + I+GNI Q V YD++ VSF PT C
Sbjct: 383 QNILISPAEGVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 156/434 (35%), Positives = 238/434 (54%), Gaps = 56/434 (12%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSIS-SSKASQADIIP 86
GF + L H DS K+ T Q+++ + R +RLN + ++ +SK + I
Sbjct: 44 GFRLSLRHVDSGKNL------TKIQKIQRGINRGFHRLNRLGAVAVLAVASKPDDTNNIK 97
Query: 87 -----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ +L+ +SIG P + A+ DTGSDLIWTQC+PC ++C+ Q +P+FDP+ SS
Sbjct: 98 APTHGGSGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPC--TECFDQPTPIFDPEKSS 155
Query: 142 TYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
+Y + CSS C +L + +C+ C+Y +YGD S + G LATET T ++
Sbjct: 156 SYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDEN----SI 211
Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK----- 254
GI FGCG N G S+ +G+VGLG G +SLISQ++ T KFSYCL + ++
Sbjct: 212 SGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSL 268
Query: 255 -INFGTNGIVSGPGV-VSTPLTKAKT---------FYVLTIDAISVGNQRLGVS------ 297
I +GIV+ G + +TK + FY L + I+VG +RL V
Sbjct: 269 FIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFEL 328
Query: 298 ----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ 351
T ++IDSGTT+T+L + L +S + + PV D +GS L+LC+ ++
Sbjct: 329 AEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRM-SLPV-DDSGSTGLDLCFKLPDAAK 386
Query: 352 ---VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
VP++ HF+GAD++L N+ V S V + G +N + I+GN+ Q NF V +D+
Sbjct: 387 NIAVPKMIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDL 446
Query: 409 EQQTVSFKPTDCTK 422
E++TVSF PT+C K
Sbjct: 447 EKETVSFVPTECGK 460
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 137/353 (38%), Positives = 185/353 (52%), Gaps = 55/353 (15%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NN YL++ISIGTPP + + DTGSDL+WTQC PC CY Q +P+FDP S+++K +
Sbjct: 20 NNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPC--LSCYKQKNPMFDPSKSTSFKEV 77
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C S QC L+ T T L I FGC
Sbjct: 78 SCESQQCRLLD--------------------------TPTSILN-----------IVFGC 100
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG--KFSYCLVPVSS-----TKINFGT 259
G NN G FN G+ G GG +SL SQ+ +T+ KFS CLVP + +KI FG
Sbjct: 101 GHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGP 160
Query: 260 NGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGVSTP-------DIVIDSGTTLT 310
VSG VVSTPL T+Y +T+D ISVG++ S+ ++ ID+GT T
Sbjct: 161 EAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGTPPT 220
Query: 311 FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSN 370
LP+ + + L+ + I +PV DP +LCY +L P +T HF GADV+L N
Sbjct: 221 LLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLIDGPILTAHFDGADVQLKPLN 280
Query: 371 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 423
F+ E + C + I I+GN +Q NFL+G+D++ + VSFK DCTKQ
Sbjct: 281 TFISPKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCTKQ 333
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 136/392 (34%), Positives = 208/392 (53%), Gaps = 33/392 (8%)
Query: 49 TPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVA 108
T Y+ ++ A+ R R+ N + + SS + + + YL+ ++IGTP + A+
Sbjct: 56 TKYELIKRAIKRGERRMRSIN--AMLQSSSGIETPVYAGSGEYLMNVAIGTPASSLSAIM 113
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
DTGSDLIWTQCEPC +QC+ Q +P+F+P+ SS++ +LPC S C L +SC +CQY
Sbjct: 114 DTGSDLIWTQCEPC--TQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSESCYN-DCQY 170
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
+ YGDGS + G +ATET T + ++P I FGCG +N G G++G+G G
Sbjct: 171 TYGYGDGSSTQGYMATETFTF-----ETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGP 225
Query: 229 ISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVSGPGVVSTPLTKAK---TFYVL 282
+SL SQ+ G+FSYC+ S+ + G+ G ST L + T+Y +
Sbjct: 226 LSLPSQLG---VGQFSYCMTSSGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYI 282
Query: 283 TIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP 332
T+ I+VG LG+ T ++IDSGTTLT+LPQ + + + I P
Sbjct: 283 TLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLSP 342
Query: 333 VADPTGSLELCYSF---NSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVC-SVFKGIT 388
V + + L C+ S QVPE+++ F G + L N + +E ++C ++
Sbjct: 343 VDESSSGLSTCFQLPSDGSTVQVPEISMQFDGGVLNLGEENVLISPAEGVICLAMGSSSQ 402
Query: 389 NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ I+GNI Q V YD++ VSF PT C
Sbjct: 403 QGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 434
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 227 bits (579), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 156/436 (35%), Positives = 239/436 (54%), Gaps = 60/436 (13%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
GF + L H DS K+ T Q+++ + R +RLN + ++ AS D N
Sbjct: 45 GFRLSLRHVDSGKNL------TKIQKIQRGINRGFHRLNRLGAVAVLAV--ASNPDDTNN 96
Query: 88 --------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
+ +L+ +SIG P + A+ DTGSDLIWTQC+PC ++C+ Q +P+FDP+
Sbjct: 97 IKAPTHGGSGEFLMELSIGNPAVKYAAIVDTGSDLIWTQCKPC--TECFDQPTPIFDPEK 154
Query: 140 SSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
SS+Y + CSS C +L + +C+ +C+Y +YGD S + G LATET T
Sbjct: 155 SSSYSKVGCSSGLCNALPRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDEN---- 210
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--- 254
++ GI FGCG N G S+ +G+VGLG G +SLISQ++ T KFSYCL + ++
Sbjct: 211 SISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKET---KFSYCLTSIEDSEASS 267
Query: 255 ---INFGTNGIVSGPGV-VSTPLTKAKT---------FYVLTIDAISVGNQRLGVS---- 297
I +GIV+ G + +TK + FY L + I+VG +RL V
Sbjct: 268 SLFIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTF 327
Query: 298 ------TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSL 349
T ++IDSGTT+T+L + L +S + + PV D +GS L+LC+ +
Sbjct: 328 ELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRM-SLPV-DDSGSTGLDLCFKLPNA 385
Query: 350 SQ---VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY 406
++ VP++ HF+GAD++L N+ V S V + G +N + I+GN+ Q NF V +
Sbjct: 386 AKNIAVPKLIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLH 445
Query: 407 DIEQQTVSFKPTDCTK 422
D+E++TV+F PT+C K
Sbjct: 446 DLEKETVTFVPTECGK 461
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 135/369 (36%), Positives = 195/369 (52%), Gaps = 33/369 (8%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
++ + +Y+ IS+GTP +ADTGSDLIW QC+PC C+ Q P+FDP+ S
Sbjct: 30 ESPVASGGGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPC--QACFNQKDPIFDPEGS 87
Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
S+Y ++ C + C SL +KSCS NC YS YGDGS + G L++ETVTL ST G+ +A
Sbjct: 88 SSYTTMSCGDTLCDSLPRKSCS-PNCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAK 146
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKI 255
I FGCG N G FN +G+VGLG G++S +SQ+ KFSYCLVP ++ +
Sbjct: 147 NIAFGCGHLNRGSFN-DASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPM 205
Query: 256 NFGTNGIVSGPG----VVSTPLTK---AKTFYVLTIDAISVGNQRLGVST------PD-- 300
FG G TP+ ++FY + + IS+ + L + PD
Sbjct: 206 FFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGS 265
Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS-----LSQVP 353
++ DSGTTLT LP +L + S + + + L+LCY + ++P
Sbjct: 266 GGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSSAGLDLCYDVSGSKASYKKKIP 325
Query: 354 EVTIHFRGADVKLSRSNFFVKVSE--DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
+ HF GAD +L N+F+ ++ IVC + IYGN+MQ NF V YDI
Sbjct: 326 AMVFHFEGADHQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIGSS 385
Query: 412 TVSFKPTDC 420
+ + P+ C
Sbjct: 386 KIGWAPSQC 394
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 135/369 (36%), Positives = 195/369 (52%), Gaps = 33/369 (8%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
++ + +Y+ IS+GTP +ADTGSDLIW QC+PC C+ Q P+FDP+ S
Sbjct: 30 ESPVASGGGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPC--QACFNQKDPIFDPEGS 87
Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
S+Y ++ C + C SL +KSCS +C YS YGDGS + G L++ETVTL ST G+ +A
Sbjct: 88 SSYTTMSCGDTLCDSLPRKSCS-PDCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAK 146
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKI 255
I FGCG N G FN +G+VGLG G++S +SQ+ KFSYCLVP ++ +
Sbjct: 147 NIAFGCGHLNRGSFN-DASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPM 205
Query: 256 NFGTNGIVSGPG----VVSTPLTK---AKTFYVLTIDAISVGNQRLGVST------PD-- 300
FG G TP+ ++FY + + IS+ + L + PD
Sbjct: 206 FFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGS 265
Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-----QVP 353
++ DSGTTLT LP +L + S I + + L+LCY + ++P
Sbjct: 266 GGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSAGLDLCYDVSGSKASYKMKIP 325
Query: 354 EVTIHFRGADVKLSRSNFFVKVSE--DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
+ HF GAD +L N+F+ ++ IVC + IYGN+MQ NF V YDI
Sbjct: 326 AMVFHFEGADYQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIGSS 385
Query: 412 TVSFKPTDC 420
+ + P+ C
Sbjct: 386 KIGWAPSQC 394
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 159/432 (36%), Positives = 231/432 (53%), Gaps = 45/432 (10%)
Query: 24 AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
A GGFSVE IHRDSP+SPF++ + T + R A RS+ R ++S S+S AD
Sbjct: 29 ASGGGFSVEFIHRDSPRSPFHDPAFTAHGRALAAARRSVARAAAIAGSASSSASGGGAAD 88
Query: 84 -----IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE---------PCPPSQCYM 129
++ + YL+ +++G+PP LA+ADTGSDL+W +C+ P +Q
Sbjct: 89 DVVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQ--- 145
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
FDP SSTY + C + C +L + +C G NC Y +YGDGS + G L+TET T
Sbjct: 146 -----FDPSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFT 200
Query: 189 L----GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGK 242
+ + V + G+ FGC T G F + +G G +SL++Q+ T++ +
Sbjct: 201 FDDGGSGRSPRQVRVGGVKFGCSTATAGSFPADGLVGLGG--GAVSLVTQLGGATSLGRR 258
Query: 243 FSYCLVPVS---STKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLG-V 296
FSYCLVP S S+ +NFG V+ PG STPL T+Y + +D++ VGN+ +
Sbjct: 259 FSYCLVPHSVNASSALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTVASA 318
Query: 297 STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-----FNSLSQ 351
++ I++DSGTTLTFL ++ +S I PV P G L+LCY+ +
Sbjct: 319 ASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGES 378
Query: 352 VPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDI 408
+P++T+ F GA V L N FV V E +C T P I GN+ Q N VGYD+
Sbjct: 379 IPDLTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDL 438
Query: 409 EQQTVSFKPTDC 420
+ TV+F DC
Sbjct: 439 DAGTVTFAGADC 450
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 154/457 (33%), Positives = 244/457 (53%), Gaps = 69/457 (15%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
+ +SC+ +L L + + G+ + L H DS ++ T
Sbjct: 8 LQALMSCLVLLTSLAV-------SASSGYRLALTHVDS--------------KIGLTKTE 46
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
+ R H ++ ++S A+ + YL+ ++IGTPP +A+ADTGSDL WTQC+
Sbjct: 47 LMRRAAHRSRLRALSGYDANSPRLHSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQCQ 106
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS-LNQKSCSGVN--CQYSVSYGDGSF 177
PC C+ QD+P++DP SST+ +PCSS+ C L ++CS + C+Y SY DG++
Sbjct: 107 PC--KLCFPQDTPVYDPSASSTFSPVPCSSATCLPVLRSRNCSTPSSLCRYGYSYSDGAY 164
Query: 178 SNGNLATETVTLGSTT-GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
S G L TET+TLGS+ GQAV++ + FGCGT+NGG + +TG VGLG G +SL++Q+
Sbjct: 165 SAGILGTETLTLGSSVPGQAVSVSDVAFGCGTDNGG-DSLNSTGTVGLGRGTLSLLAQLG 223
Query: 237 TTIAGKFSYCLVPVSSTKIN----FGTNG-IVSGPGVV-STPLTKA---KTFYVLTIDAI 287
GKFSYCL ++ ++ GT + GPG V STPL ++ + YV+++ I
Sbjct: 224 ---VGKFSYCLTDFFNSTLDSPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGI 280
Query: 288 SVGNQRLGV----------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV---- 333
++G+ RL + ST +V+DSGTT + LP+ ++ ++ ++ PV
Sbjct: 281 TLGDVRLPIPNKTFDLHANSTGGMVVDSGTTFSILPESGFRVVVDHVAQVLGQPPVNASS 340
Query: 334 ------ADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSED-IVCSVFK 385
P G +L + +P++ +HF GAD++L R N+ ED C
Sbjct: 341 LDSPCFPAPAGERQLPF-------MPDLVLHFAGGADMRLHRDNYMSYNQEDSSFCLNIV 393
Query: 386 GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
G T++ + GN Q N + +D+ +SF PTDC+K
Sbjct: 394 GTTSTWSMLGNFQQQNIQMLFDMTVGQLSFLPTDCSK 430
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 161/460 (35%), Positives = 233/460 (50%), Gaps = 62/460 (13%)
Query: 16 FYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSIS 75
+V + A+ GFSVE IHRDS KSPF++ + TP+ R A RS R + +
Sbjct: 27 LFVSPAVGAEEDGFSVEFIHRDSVKSPFHDPALTPHGRALAAARRSAARAAELHHLLARR 86
Query: 76 SSKASQ--------ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE------- 120
SS A A+++ YL+ I +GTPP LA+ADTGSDL+W +C+
Sbjct: 87 SSGAPSPGTGAGVVAEVVSRQFEYLMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNN 146
Query: 121 -PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVSYGDGSF 177
PPS F P SSTY + C + C +L+ SCS +C+Y SYGDGS
Sbjct: 147 STAPPSV-------YFVPSASSTYGRVGCDTKACRALSSAASCSPDGSCEYLYSYGDGSR 199
Query: 178 SNGNLATETVTLGSTTGQA-----------------VALPGITFGCGTNNGGLFNSKTTG 220
++G L+TET T + + V + + FGC T G F +
Sbjct: 200 ASGQLSTETFTFSTIADSSKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTFRADGLV 259
Query: 221 IVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTK----INFGTNGIVSGPGVVSTPLT 274
+G G +SL SQ+ T++ KFSYCL P ++T +NFG+ +VS PG STPL
Sbjct: 260 GLGG--GPVSLASQLGATTSLGRKFSYCLAPYANTNASSALNFGSRAVVSEPGAASTPLI 317
Query: 275 --KAKTFYVLTIDAISV-GNQR-LGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEA 330
+ +T+Y + +D+I+V G +R + I++DSGTTLT+L + L+ ++ I+
Sbjct: 318 TGEVETYYTIALDSINVAGTKRPTTAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKL 377
Query: 331 QPVADPTGSLELCYSFNSLS-----QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF 384
P L+LCY + + +P+VT+ G +V L N FV V E ++C
Sbjct: 378 PRAESPEKILDLCYDISGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLAL 437
Query: 385 KGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ SV I GNI Q N VGYD+E+ TV+F DC K
Sbjct: 438 VATSERQSVSILGNIAQQNLHVGYDLEKGTVTFAAADCAK 477
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 141/410 (34%), Positives = 221/410 (53%), Gaps = 54/410 (13%)
Query: 54 LRDALTRSLNRLNHFNQNSSISSSKASQAD----------IIPNNANYLIRISIGTPPTE 103
+RDAL R ++R Q+ S+ + +++D +PN YL+ +SIGTPP
Sbjct: 49 VRDALRRDMHR----QQSRSLFGRELAESDGTTVSARTRKDLPNGGEYLMTLSIGTPPLS 104
Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS--SQCASL--NQK 159
A+ADTGSDLIWTQC PC QC+ Q +PL++P S+T+ LPC+S S CA + +
Sbjct: 105 YPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCNSSLSMCAGVLAGKA 164
Query: 160 SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTT 219
G C Y+ +YG G ++ G +ET T GS +PGI FGC + +N +
Sbjct: 165 PPPGCACMYNQTYGTG-WTAGVQGSETFTFGSAAADQARVPGIAFGCSNASSSDWNG-SA 222
Query: 220 GIVGLGGGDISLISQMRTTIAGKFSYCLVPV----SSTKINFGTNGIVSGPGVVSTPLTK 275
G+VGLG G +SL+SQ+ AG+FSYCL P S++ + G + ++G GV STP
Sbjct: 223 GLVGLGRGSLSLVSQLG---AGRFSYCLTPFQDTNSTSTLLLGPSAALNGTGVRSTPFVA 279
Query: 276 A------KTFYVLTIDAISVGNQRLGVSTPD-----------IVIDSGTTLTFLPQGYNS 318
+ T+Y L + IS+G + L +S PD ++IDSGTT+T L
Sbjct: 280 SPAKAPMSTYYYLNLTGISLGAKALSIS-PDAFSLKADGTGGLIIDSGTTITSLVNAAYQ 338
Query: 319 NLLSVMSSMIEAQPV--ADPTGSLELCYSF----NSLSQVPEVTIHFRGADVKLSRSNFF 372
+ + + S++ + +D TG L+LCY+ ++ +P +T+HF GAD+ L ++
Sbjct: 339 QVRAAVQSLVTLPAIDGSDSTG-LDLCYALPTPTSAPPAMPSMTLHFDGADMVLPADSYM 397
Query: 373 VKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ S + C + T+ ++ +GN Q N + YD+ + +SF P C+
Sbjct: 398 ISGS-GVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVRNEMLSFAPAKCS 446
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 142/418 (33%), Positives = 209/418 (50%), Gaps = 40/418 (9%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK-----ASQADI 84
S L+ RD+ Y S + D + R R + S ++ + S++ +
Sbjct: 60 SFALVRRDAVTGSTYPSRR---HAVLDLVARDNARAEYLASRLSPAAYQPTGFSGSESKV 116
Query: 85 I----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
+ + Y +R+ IG+PPTE+ V D+GSD+IW QC+PC +CY Q PLFDP S
Sbjct: 117 VSGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPC--LECYAQADPLFDPATS 174
Query: 141 STYKSLPCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
+T+ ++PC S+ C +L C SG C Y VSYGDGS++ G LA ET+TLG T A
Sbjct: 175 ATFSAVPCGSAVCRTLRTSGCGDSG-GCDYEVSYGDGSYTKGALALETLTLGGT-----A 228
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG 258
+ G+ GCG N GLF G++GLG G +SL+ Q+ G FSYCL + + G
Sbjct: 229 VEGVAIGCGHRNRGLFVG-AAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGAGSLVLG 287
Query: 259 TNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS------TPD----IVIDS 305
+ V G V PL + A +FY + + I VG++RL + T D +V+D+
Sbjct: 288 RSEAVP-EGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDT 346
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRG-A 362
GT +T LPQ + L + + A P A L+ CY + + +VP V+ +F G A
Sbjct: 347 GTAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAA 406
Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ L N ++V I C F ++ I GNI Q + D + F PT C
Sbjct: 407 TLTLPARNLLLEVDGGIYCLAFAPSSSGPSILGNIQQEGIQITVDSANGYIGFGPTTC 464
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 155/438 (35%), Positives = 229/438 (52%), Gaps = 55/438 (12%)
Query: 21 PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
P G V L H D+ + + T Q LR A RS +R++ ++ S KA+
Sbjct: 49 PAAGLLDGLRVPLTHVDA------HGNYTKLQLLRRAARRSHHRMSRLVARTATGSVKAA 102
Query: 81 -----QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
Q + N +L+ +SIGTP A+ DTGSDL+WTQC+PC +C+ Q +P+F
Sbjct: 103 AAPDLQVPVHAGNGEFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPC--VECFNQSTPVF 160
Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTT 193
DP SSTY +LPCSSS C+ L +C+ +C Y+ +YGD S + G LA ET TL T
Sbjct: 161 DPSSSSTYSTLPCSSSLCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKTK 220
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
LPG+ FGCG N G ++ G+VGLG G +SL+SQ+ GKFSYCL + T
Sbjct: 221 -----LPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGL---GKFSYCLTSLDDT 272
Query: 254 K---INFGTNGIV-----SGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGV------ 296
+ G+ + S + +TPL K +FY +T+ A++VG+ R+ +
Sbjct: 273 SKSPLLLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFA 332
Query: 297 ----STPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS 350
T +++DSGT++T+L QGY + + M PVAD + L+LC+ +
Sbjct: 333 VQDDGTGGVIVDSGTSITYLELQGYRPLKKAFAAQM--KLPVADGSAVGLDLCFKAPASG 390
Query: 351 ----QVPEVTIHFR-GADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLV 404
+VP++ +HF GAD+ L N+ V S +C G + + I GN Q N
Sbjct: 391 VDDVEVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMG-SRGLSIIGNFQQQNIQF 449
Query: 405 GYDIEQQTVSFKPTDCTK 422
YD+++ T+SF P C K
Sbjct: 450 VYDVDKDTLSFAPVQCAK 467
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 142/401 (35%), Positives = 217/401 (54%), Gaps = 41/401 (10%)
Query: 54 LRDALTRSLNRLNHFNQNSSISSSKASQADI---IPNNANYLIRISIGTPPTERLAVADT 110
+RDAL R ++R F + + S + A +PN Y++ ++IGTPP A+ADT
Sbjct: 48 VRDALRRDMHRHARFTRELASSGDRTVAAPTRKDLPNGGEYIMTLAIGTPPLSYPAIADT 107
Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS--SQCASLNQKS-CSGVNCQ 167
GSDLIWTQC PC SQC+ Q ++P S+T+ LPC+S S CA+L S G +C
Sbjct: 108 GSDLIWTQCAPC-GSQCFKQAGQPYNPSSSTTFGVLPCNSSVSMCAALAGPSPPPGCSCM 166
Query: 168 YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGG 227
Y+ +YG G ++ G + ET T GST +PGI FGC + +N + G+VGLG G
Sbjct: 167 YNQTYGTG-WTAGIQSVETFTFGSTPADQTRVPGIAFGCSNASSDDWNG-SAGLVGLGRG 224
Query: 228 DISLISQMRTTIAGKFSYCLVPV----SSTKINFGTNGIVSGPGVVSTPLTKA------K 277
+SL+SQ+ AG FSYCL P S++ + G + ++G GV++TP +
Sbjct: 225 SMSLVSQLG---AGMFSYCLTPFQDANSTSTLLLGPSAALNGTGVLTTPFVASPSKAPMS 281
Query: 278 TFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSM 327
T+Y L + IS+G L + T ++IDSGTT+T L + + + S+
Sbjct: 282 TYYYLNLTGISIGTTALSIPPNAFALRTDGTGGLIIDSGTTITSLVDAAYQQVRAAIESL 341
Query: 328 IEAQPVADPTGS--LELCYSFNSLS----QVPEVTIHFRGADVKLSRSNFFVKVSEDIVC 381
+ PVAD + S L+LC++ S + +P +T HF GAD+ L N+ + + + C
Sbjct: 342 VTL-PVADGSDSTGLDLCFALTSETSTPPSMPSMTFHFDGADMVLPVDNYMI-LGSGVWC 399
Query: 382 SVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ T ++ +GN Q N + YDI ++T+SF P C+
Sbjct: 400 LAMRNQTVGAMSTFGNYQQQNVHLLYDIHEETLSFAPAKCS 440
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 140/352 (39%), Positives = 196/352 (55%), Gaps = 31/352 (8%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NN +YL+++++GTPP + + DT SDL+W QC PC CY Q +P+FDP
Sbjct: 27 NNGDYLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPC--QGCYKQKNPMFDPL-------- 76
Query: 147 PCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
+C S SCS C Y +Y D S + G LA E T ST G+ + + I FG
Sbjct: 77 ----KECNSFFDHSCSPEKACDYVYAYADDSATKGMLAKEIATFSSTDGKPI-VESIIFG 131
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVPV-----SSTKINFGT 259
CG NN G+FN G++GLGGG +SL+SQM K FS CLVP +S I+ G
Sbjct: 132 CGHNNTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCLVPFHADPHTSGTISLGE 191
Query: 260 NGIVSGPGVVSTPLT--KAKTFYVLTIDAISVG------NQRLGVSTPDIVIDSGTTLTF 311
VSG GVV+TPL + +T Y++T++ ISVG N +S +I+IDSGT T+
Sbjct: 192 ASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVPFNSSEMLSKGNIMIDSGTPETY 251
Query: 312 LPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSN 370
LPQ + L+ + I P+ DP +LCY + + P +T HF GADVKL
Sbjct: 252 LPQEFYDRLVEELKVQINLPPIHVDPDLGTQLCYKSETNLEGPILTAHFEGADVKLLPLQ 311
Query: 371 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
F+ + + C G T+ + I+GN Q+N L+G+D++++ V FKPTD TK
Sbjct: 312 TFIPPKDGVFCFAMTGTTDGLYIFGNFAQSNVLIGFDLDKRIVFFKPTDFTK 363
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 139/425 (32%), Positives = 207/425 (48%), Gaps = 49/425 (11%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN-- 87
S+ L+HRD+ Y S ++ + R R+ H + S+S D++
Sbjct: 64 SLSLVHRDAISGATYPSRR---HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVV 120
Query: 88 ------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ Y +R+ +G+PPT++ V D+GSD+IW QC PC QCY Q PLFDP SS
Sbjct: 121 PGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPC--EQCYAQTDPLFDPAASS 178
Query: 142 TYKSLPCSSSQCASLN----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
++ + C S+ C +L+ C YSV+YGDGS++ G LA ET+TLG T
Sbjct: 179 SFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT----- 233
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
A+ G+ GCG N GLF G++GLG G +SLI Q+ G FSYCL +++
Sbjct: 234 AVQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLIGQLGGAAGGVFSYCL----ASRGAG 288
Query: 258 GTNGIVSGP------GVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS------TPD-- 300
G +V G G V PL + A +FY + + I VG +RL + T D
Sbjct: 289 GAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGA 348
Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVT 356
+V+D+GT +T LP+ + L + A P + L+ CY + + +VP V+
Sbjct: 349 GGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVS 408
Query: 357 IHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
+F +GA + L N V+V + C F ++ + I GNI Q + D V F
Sbjct: 409 FYFDQGAVLTLPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGF 468
Query: 416 KPTDC 420
P C
Sbjct: 469 GPNTC 473
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 138/425 (32%), Positives = 207/425 (48%), Gaps = 49/425 (11%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN-- 87
S+ L+HRD+ Y S ++ + R R+ H + S+S D++
Sbjct: 64 SLSLVHRDAISGATYPSRR---HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVV 120
Query: 88 ------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ Y +R+ +G+PPT++ V D+GSD+IW QC PC QCY Q PLFDP SS
Sbjct: 121 PGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPC--EQCYAQTDPLFDPAASS 178
Query: 142 TYKSLPCSSSQCASLN----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
++ + C S+ C +L+ C YSV+YGDGS++ G LA ET+TLG T
Sbjct: 179 SFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT----- 233
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
A+ G+ GCG N GLF G++GLG G +SL+ Q+ G FSYCL +++
Sbjct: 234 AVQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCL----ASRGAG 288
Query: 258 GTNGIVSGP------GVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS------TPD-- 300
G +V G G V PL + A +FY + + I VG +RL + T D
Sbjct: 289 GAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGA 348
Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVT 356
+V+D+GT +T LP+ + L + A P + L+ CY + + +VP V+
Sbjct: 349 GGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVS 408
Query: 357 IHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
+F +GA + L N V+V + C F ++ + I GNI Q + D V F
Sbjct: 409 FYFDQGAVLTLPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGF 468
Query: 416 KPTDC 420
P C
Sbjct: 469 GPNTC 473
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 139/363 (38%), Positives = 207/363 (57%), Gaps = 44/363 (12%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
+ +SIG P + A+ DTGSDLIWTQC+PC ++C+ Q +P+FDP+ SS+Y + CSS
Sbjct: 1 MELSIGNPAVKYSAIVDTGSDLIWTQCKPC--TECFDQPTPIFDPEKSSSYSKVGCSSGL 58
Query: 153 CASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
C +L + +C+ C+Y +YGD S + G LATET T ++ GI FGCG N
Sbjct: 59 CNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDEN----SISGIGFGCGVEN 114
Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK------INFGTNGIVS 264
G S+ +G+VGLG G +SLISQ++ T KFSYCL + ++ I +GIV+
Sbjct: 115 EGDGFSQGSGLVGLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSLFIGSLASGIVN 171
Query: 265 GPGV-VSTPLTKAKT---------FYVLTIDAISVGNQRLGVS----------TPDIVID 304
G + +TK + FY L + I+VG +RL V T ++ID
Sbjct: 172 KTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIID 231
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ---VPEVTIHF 359
SGTT+T+L + L +S + + PV D +GS L+LC+ ++ VP++ HF
Sbjct: 232 SGTTITYLEETAFKVLKEEFTSRM-SLPV-DDSGSTGLDLCFKLPDAAKNIAVPKMIFHF 289
Query: 360 RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
+GAD++L N+ V S V + G +N + I+GN+ Q NF V +D+E++TVSF PT+
Sbjct: 290 KGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDLEKETVSFVPTE 349
Query: 420 CTK 422
C K
Sbjct: 350 CGK 352
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 151/409 (36%), Positives = 217/409 (53%), Gaps = 33/409 (8%)
Query: 32 ELIHRDSPKSPFY-NSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
ELIHR+ P SP N+S+T + A+ R R +++ ++ + + N
Sbjct: 21 ELIHREHPSSPLRSNTSKTTTEIFLAAVKRGAERRAQLSKHI-LAEGRLFSTPVASGNGE 79
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
YLI IS G+PP + + DTGSDLIWTQC PC C S +FDP SSTY ++ C+S
Sbjct: 80 YLIDISFGSPPQKASVIVDTGSDLIWTQCLPC--ETCNAAASVIFDPVKSSTYDTVSCAS 137
Query: 151 SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
+ C+SL +SC+ +C+Y YGDGS ++G L+TETVT+ +P + FGCG N
Sbjct: 138 NFCSSLPFQSCT-TSCKYDYMYGDGSSTSGALSTETVTV-----GTGTIPNVAFGCGHTN 191
Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG-IVSGPGVV 269
G F + GIVGLG G +SLISQ + + KFSYCLVP+ STK + G + GV
Sbjct: 192 LGSF-AGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTKTSPMLIGDSAAAGGVA 250
Query: 270 STPL---TKAKTFYVLTIDAISVGNQR----LGVSTPD------IVIDSGTTLTFLPQGY 316
T L T TFY + ISV + +G + D ++DSGTTLT+L G
Sbjct: 251 YTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYLETGA 310
Query: 317 NSNLLSVMSSMIEAQPVADPTGS---LELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNF 371
+ L++ + + + P + GS L+ C+S ++ P +T HF+GAD +L N
Sbjct: 311 FNALVAALKAEV---PFPEADGSLYGLDYCFSTAGVANPTYPTMTFHFKGADYELPPENV 367
Query: 372 FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
FV + + + I GNI Q N L+ +D+ Q V FK +C
Sbjct: 368 FVALDTGGSICLAMAASTGFSIMGNIQQQNHLIVHDLVNQRVGFKEANC 416
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 154/459 (33%), Positives = 235/459 (51%), Gaps = 60/459 (13%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
MA FL V+IL L + +S + G +EL H D Y +E R+R A R
Sbjct: 1 MAAFL--VWILLLLPYVAISSTASH--GVRLELTHADDRGG--YVGAE----RVRRAADR 50
Query: 61 SLNRLNHF-----------NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVAD 109
S R+N F S + + ++A + + A YL+ I+IGTPP AV D
Sbjct: 51 SHRRVNGFLGAIEGPSSTARLGSDGAGAGGAEASVHASTATYLVDIAIGTPPLPLTAVLD 110
Query: 110 TGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCS--GV 164
TGSDLIWTQC+ PC +C+ Q +PL+ P S+TY ++ C S C +L CS
Sbjct: 111 TGSDLIWTQCDAPC--RRCFPQPAPLYAPARSATYANVSCRSPMCQALQSPWSRCSPPDT 168
Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
C Y SYGDG+ ++G LATET TLGS T A+ G+ FGCGT N G ++G+VG+
Sbjct: 169 GCAYYFSYGDGTSTDGVLATETFTLGSDT----AVRGVAFGCGTENLG-STDNSSGLVGM 223
Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVSSTKIN---FGTNGIVSGPGVVSTPLT------- 274
G G +SL+SQ+ T +FSYC P ++T + G++ +S +TP
Sbjct: 224 GRGPLSLVSQLGVT---RFSYCFTPFNATAASPLFLGSSARLSS-AAKTTPFVPSPSGGA 279
Query: 275 -KAKTFYVLTIDAISVGNQRLGVS------TP----DIVIDSGTTLTFLPQGYNSNLLSV 323
+ ++Y L+++ I+VG+ L + TP ++IDSGTT T L + L
Sbjct: 280 RRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEERAFVALARA 339
Query: 324 MSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSEDIVC 381
++S + + L LC++ S +VP + +HF GAD++L R ++ V+ V
Sbjct: 340 LASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRRESYVVEDRSAGVA 399
Query: 382 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ + + G++ Q N + YD+E+ +SF+P C
Sbjct: 400 CLGMVSARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 438
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 151/457 (33%), Positives = 234/457 (51%), Gaps = 62/457 (13%)
Query: 11 LFFLCFYVVSPIEAQTGGFSVEL----IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
L L F VV A +G SV + IH D T Q +RDAL R ++R
Sbjct: 27 LAVLVFLVVCATLA-SGAASVRVGLTRIHSDP--------DTTAPQFVRDALRRDMHRQR 77
Query: 67 ----------HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
++ ++ A +PN YL+ ++IGTPP AVADTGSDLIW
Sbjct: 78 SRSFGRDRDRELAESDGRTTVSARTRKDLPNGGEYLMTLAIGTPPLPYAAVADTGSDLIW 137
Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS--SQCASLNQKSCSGVN--CQYSVSY 172
TQC PC +QC+ Q +PL++P S+T+ LPC+S S CA + C Y+ +Y
Sbjct: 138 TQCAPC-GTQCFEQPAPLYNPASSTTFSVLPCNSSLSMCAGALAGAAPPPGCACMYNQTY 196
Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
G G ++ G +ET T GS+ +PG+ FGC + +N + G+VGLG G +SL+
Sbjct: 197 GTG-WTAGVQGSETFTFGSSAADQARVPGVAFGCSNASSSDWNG-SAGLVGLGRGSLSLV 254
Query: 233 SQMRTTIAGKFSYCLVPV----SSTKINFGTNGIVSGPGVVSTPLTKA------KTFYVL 282
SQ+ AG+FSYCL P S++ + G + ++G GV STP + T+Y L
Sbjct: 255 SQLG---AGRFSYCLTPFQDTNSTSTLLLGPSAALNGTGVRSTPFVASPARAPMSTYYYL 311
Query: 283 TIDAISVGNQRLGVS------TPD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP 332
+ IS+G + L +S PD ++IDSGTT+T L + + + S++ P
Sbjct: 312 NLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTITSLANAAYQQVRAAVKSLVTTLP 371
Query: 333 VADPTGS--LELCYSFNSLSQ-----VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFK 385
D + S L+LC++ + + +P +T+HF GAD+ L ++ + S + C +
Sbjct: 372 TVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHFDGADMVLPADSYMISGS-GVWCLAMR 430
Query: 386 GITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
T+ ++ +GN Q N + YD+ ++T+SF P C+
Sbjct: 431 NQTDGAMSTFGNYQQQNMHILYDVREETLSFAPAKCS 467
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 156/459 (33%), Positives = 234/459 (50%), Gaps = 60/459 (13%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
MA FL V+IL L + +S + G +EL H D Y +E R+R A R
Sbjct: 1 MAAFL--VWILLLLPYVAISSTASH--GVRLELTHADDRGG--YVGAE----RVRRAADR 50
Query: 61 SLNRLNHFNQNSSISSSKAS-----------QADIIPNNANYLIRISIGTPPTERLAVAD 109
S R+N F SS A +A + + A YL+ I+IGTPP AV D
Sbjct: 51 SHRRVNGFLGAIEGPSSTARLGIDGAGAGGAEASVHASTATYLVDIAIGTPPLPLTAVLD 110
Query: 110 TGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCS--GV 164
TGSDLIWTQC+ PC +C+ Q +PL+ P S+TY ++ C S C +L CS
Sbjct: 111 TGSDLIWTQCDAPC--RRCFPQPAPLYAPARSATYANVSCRSPMCQALQSPWSRCSPPDT 168
Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
C Y SYGDG+ ++G LATET TLGS T A+ G+ FGCGT N G ++G+VG+
Sbjct: 169 GCAYYFSYGDGTSTDGVLATETFTLGSDT----AVRGVAFGCGTENLG-STDNSSGLVGM 223
Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVSSTKIN---FGTNGIVSGPGVVSTPLT------- 274
G G +SL+SQ+ T +FSYC P ++T + G++ +S +TP
Sbjct: 224 GRGPLSLVSQLGVT---RFSYCFTPFNATAASPLFLGSSARLSS-AAKTTPFVPSPSGGA 279
Query: 275 -KAKTFYVLTIDAISVGNQRLGVS------TP----DIVIDSGTTLTFLPQGYNSNLLSV 323
+ ++Y L+++ I+VG+ L + TP ++IDSGTT T L + L
Sbjct: 280 RRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEESAFVALARA 339
Query: 324 MSSMIEAQPVADPTGSLELCYSFNS--LSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVC 381
++S + + L LC++ S +VP + +HF GAD++L R ++ V+ V
Sbjct: 340 LASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRRESYVVEDRSAGVA 399
Query: 382 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ + + G++ Q N + YD+E+ +SF+P C
Sbjct: 400 CLGMVSARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 438
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 159/461 (34%), Positives = 226/461 (49%), Gaps = 56/461 (12%)
Query: 2 ATFLSCVFILFFLCFYVVSPIEAQTGGFSVEL--IHRDSPKSPFYNSSETPYQRLRDALT 59
A S ++ L F ++ G VEL +H D S T Q +R AL
Sbjct: 7 AQMASLAVLIISLVFAALASDSDAAAGVRVELTRVHADP--------SVTASQFVRGALR 58
Query: 60 RSLNRLNHFNQNSSISSSKASQADI--IPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
R ++R N + SS A P YL+ ++IGTPP A+ADTGSDLIWT
Sbjct: 59 RDMHRHNARKLALAASSGATVSAPTQDSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWT 118
Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ--CASLNQKSCS----GVNCQYSVS 171
QC PC SQC+ Q +PL++P S+T+ LPC+SS CA+ + + G C Y+V+
Sbjct: 119 QCAPC-TSQCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPGCACTYNVT 177
Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
YG G +++ +ET T GST +PGI FGC T + G S +G+VGLG G +SL
Sbjct: 178 YGSG-WTSVFQGSETFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSL 236
Query: 232 ISQMRTTIAGKFSYCLVPVSSTK------------INFGTNGIVSGPGVVSTPLTKAKTF 279
+SQ+ KFSYCL P T +N GT G+ S P V S TF
Sbjct: 237 VSQLGVP---KFSYCLTPYQDTNSTSTLLLGPSASLN-GTAGVSSTPFVASPSTAPMNTF 292
Query: 280 YVLTIDAISVGNQRLGVSTPD-----------IVIDSGTTLTFLPQGYNSNLLSVMSSMI 328
Y L + IS+G L + PD ++IDSGTT+T L + + + S++
Sbjct: 293 YYLNLTGISLGTTALSIP-PDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLV 351
Query: 329 EAQPVADPTGS--LELCYSFNSLSQ----VPEVTIHFRGADVKLSRSNFFVKVSEDIVCS 382
P D + L+LC+ S + +P +T+HF GAD+ L ++ + + C
Sbjct: 352 TL-PTTDGSADTGLDLCFMLPSSTSAPPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCL 410
Query: 383 VFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ T+ V I GN Q N + YDI Q+T+SF P C+
Sbjct: 411 AMQNQTDGEVNILGNYQQQNMHILYDIGQETLSFAPAKCSA 451
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 134/416 (32%), Positives = 203/416 (48%), Gaps = 40/416 (9%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN-- 87
S+ L+HRD+ Y S ++ + R R+ H + S+S D++
Sbjct: 64 SLSLVHRDAISGATYPSRR---HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVV 120
Query: 88 ------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ Y +R+ +G+PPT++ V D+GSD+IW QC PC QCY Q PLFDP SS
Sbjct: 121 PGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPC--EQCYAQTDPLFDPAASS 178
Query: 142 TYKSLPCSSSQCASLN----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
++ + C S+ C +L+ C YSV+YGDGS++ G LA ET+TLG T
Sbjct: 179 SFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT----- 233
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
A+ G+ GCG N GLF G++GLG G +SL+ Q+ G FSYCL +++
Sbjct: 234 AVQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCL----ASRGAG 288
Query: 258 GTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS------TPD----IVIDSGT 307
G +V G +A +FY + + I VG +RL + T D +V+D+GT
Sbjct: 289 GAGSLVLGRTEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGT 348
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADV 364
+T LP+ + L + A P + L+ CY + + +VP V+ +F +GA +
Sbjct: 349 AVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVL 408
Query: 365 KLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
L N V+V + C F ++ + I GNI Q + D V F P C
Sbjct: 409 TLPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 464
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 211 bits (537), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 156/461 (33%), Positives = 237/461 (51%), Gaps = 67/461 (14%)
Query: 11 LFFLCFYVVSPIEAQTGGFSVEL----IHRDSPKSPFYNSSETPYQRLRDALTRSLNR-- 64
L L F VV A +G SV + IH D T Q +RDAL R ++R
Sbjct: 27 LAVLVFLVVCATLA-SGAASVRVGLTRIHSDP--------DTTAPQFVRDALRRDMHRQR 77
Query: 65 -----------LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSD 113
L + +S + S ++ D+ PN YL+ ++IGTPP AVADTGSD
Sbjct: 78 SRSFGRDRDRELAESDGRTSTTVSARTRKDL-PNGGEYLMTLAIGTPPLPYAAVADTGSD 136
Query: 114 LIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS--SQCASLNQKSCSGVN--CQYS 169
LIWTQC PC +QC+ Q +PL++P S+T+ LPC+S S CA + C Y
Sbjct: 137 LIWTQCAPC-GTQCFEQPAPLYNPASSTTFSVLPCNSSLSMCAGALAGAAPPPGCACMYY 195
Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDI 229
+YG G ++ G +ET T GS+ +PG+ FGC + +N + G+VGLG G +
Sbjct: 196 QTYGTG-WTAGVQGSETFTFGSSAADQARVPGVAFGCSNASSSDWNG-SAGLVGLGRGSL 253
Query: 230 SLISQMRTTIAGKFSYCLVPV----SSTKINFGTNGIVSGPGVVSTPLTKA------KTF 279
SL+SQ+ AG+FSYCL P S++ + G + ++G GV STP + T+
Sbjct: 254 SLVSQLG---AGRFSYCLTPFQDTNSTSTLLLGPSAALNGTGVRSTPFVASPARAPMSTY 310
Query: 280 YVLTIDAISVGNQRLGVS------TPD----IVIDSGTTLTFLPQ-GYNSNLLSVMSSMI 328
Y L + IS+G + L +S PD ++IDSGTT+T L Y +V S ++
Sbjct: 311 YYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTITSLANAAYQQVRAAVKSQLV 370
Query: 329 EAQPVADPTGS--LELCYSFNSLSQ-----VPEVTIHFRGADVKLSRSNFFVKVSEDIVC 381
P D + S L+LC++ + + +P +T+HF GAD+ L ++ + S + C
Sbjct: 371 TTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHFDGADMVLPADSYMISGS-GVWC 429
Query: 382 SVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ T+ ++ +GN Q N + YD+ ++T+SF P C+
Sbjct: 430 LAMRNQTDGAMSTFGNYQQQNMHILYDVREETLSFAPAKCS 470
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 211 bits (537), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 147/433 (33%), Positives = 225/433 (51%), Gaps = 56/433 (12%)
Query: 27 GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-I 85
GG V L H D+ + + + Q L+ A RS +R++ ++ + A D+ +
Sbjct: 38 GGLRVRLTHVDA------HGNYSRLQLLQRAARRSHHRMSRLVARATGVKAVAGGGDLQV 91
Query: 86 P---NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
P N +L+ ++IGTP A+ DTGSDL+WTQC+PC C+ Q +P+FDP SST
Sbjct: 92 PVHAGNGEFLMDVAIGTPALSYAAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSST 149
Query: 143 YKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
Y ++PCSS+ C+ L +C S C Y+ +YGD S + G LA+ET TLG + LPG
Sbjct: 150 YATVPCSSALCSDLPTSTCTSASKCGYTYTYGDASSTQGVLASETFTLGK---EKKKLPG 206
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
+ FGCG N G ++ G+VGLG G +SL+SQ+ KFSYCL +S G +
Sbjct: 207 VAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLD---KFSYCL---TSLDDGDGKSP 260
Query: 262 IVSGPG------------VVSTPLTK---AKTFYVLTIDAISVGNQRLGV---------- 296
++ G V +TPL K +FY +++ ++VG+ R+ +
Sbjct: 261 LLLGGSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDD 320
Query: 297 STPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS---- 350
T +++DSGT++T+L QGY + + ++ M A P D + L+LC+ +
Sbjct: 321 GTGGVIVDSGTSITYLELQGYRALKKAFVAQM--ALPTVDGSEIGLDLCFQGPAKGVDEV 378
Query: 351 QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
QVP++ +HF GAD+ L N+ V S + + + I GN Q NF YD+
Sbjct: 379 QVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVAPSRGLSIIGNFQQQNFQFVYDVA 438
Query: 410 QQTVSFKPTDCTK 422
T+SF P C K
Sbjct: 439 GDTLSFAPVQCNK 451
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 152/430 (35%), Positives = 225/430 (52%), Gaps = 55/430 (12%)
Query: 31 VEL--IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN--HFNQNSSISSSKASQADIIP 86
VEL IH D S T Q +RDAL R ++R N +SS ++ ++ I P
Sbjct: 30 VELTRIHADP--------SVTASQFVRDALRRDMHRHNARQLAASSSNGTTVSAPTQISP 81
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
YL+ ++IGTPP A+ADTGSDLIWTQC PC SQC+ Q +PL++P S+T+ L
Sbjct: 82 TAGEYLMTLAIGTPPVSYQAIADTGSDLIWTQCAPC-SSQCFQQPTPLYNPSSSTTFAVL 140
Query: 147 PCSS--SQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAVALPG 201
PC+S S CA+ + G C Y+++YG G +++ +ET T GS+T +PG
Sbjct: 141 PCNSSLSMCAAALAGTTPPPGCTCMYNMTYGSG-WTSVYQGSETFTFGSSTPANQTGVPG 199
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV----SSTKINF 257
I FGC +GG S +G+VGLG G +SL+SQ+ KFSYCL P S++ +
Sbjct: 200 IAFGCSNASGGFNTSSASGLVGLGRGSLSLVSQLGVP---KFSYCLTPYQDTNSTSTLLL 256
Query: 258 G-------TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI--------- 301
G T G+ S P V S T+Y L + IS+G L + T +
Sbjct: 257 GPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKADGTGG 316
Query: 302 -VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD----PTGSLELCYSFNSLSQ----V 352
+IDSGTT+T L + + + S++ P D TG L+LC+ S + +
Sbjct: 317 FIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGGSAATG-LDLCFELPSSTSAPPTM 374
Query: 353 PEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQ 411
P +T+HF GAD+ L ++ + + ++ C + T+ V I GN Q N + YD+ Q+
Sbjct: 375 PSMTLHFDGADMVLPADSYMM-LDSNLWCLAMQNQTDGGVSILGNYQQQNMHILYDVGQE 433
Query: 412 TVSFKPTDCT 421
T++F P C+
Sbjct: 434 TLTFAPAKCS 443
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 211 bits (536), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 156/433 (36%), Positives = 225/433 (51%), Gaps = 57/433 (13%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS---LNRLNHFNQNSSISSSKAS---- 80
G V L H D+ + + + +Q LR A RS ++RL ++SSKA+
Sbjct: 40 GLRVHLTHVDA------HGNYSRHQLLRRAARRSHHRMSRLVARATGVPMTSSKAAGGGD 93
Query: 81 -QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
Q + N +L+ +SIGTP A+ DTGSDL+WTQC+PC C+ Q +P+FDP
Sbjct: 94 LQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSS 151
Query: 140 SSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
SSTY ++PCSS+ C+ L C S C Y+ +YGD S + G LATET TL +
Sbjct: 152 SSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----- 206
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG 258
LPG+ FGCG N G S+ G+VGLG G +SL+SQ+ KFSYCL + T +
Sbjct: 207 LPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDTNNSPL 263
Query: 259 TNGIVSG--------PGVVSTPLTK---AKTFYVLTIDAISVGNQRLGV----------S 297
G ++G V +TPL K +FY +++ AI+VG+ R+ +
Sbjct: 264 LLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDG 323
Query: 298 TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----Q 351
T +++DSGT++T+L QGY + + + M A P AD +G L+LC+ + +
Sbjct: 324 TGGVIVDSGTSITYLEVQGYRALKKAFAAQM--ALPAADGSGVGLDLCFRAPAKGVDQVE 381
Query: 352 VPEVTIHFR-GADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
VP + HF GAD+ L N+ V +C G + + I GN Q NF YD+
Sbjct: 382 VPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMG-SRGLSIIGNFQQQNFQFVYDVG 440
Query: 410 QQTVSFKPTDCTK 422
T+SF P C K
Sbjct: 441 HDTLSFAPVQCNK 453
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 156/433 (36%), Positives = 225/433 (51%), Gaps = 57/433 (13%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS---LNRLNHFNQNSSISSSKAS---- 80
G V L H D+ + + + +Q LR A RS ++RL ++SSKA+
Sbjct: 30 GLRVHLTHVDA------HGNYSRHQLLRRAARRSHHRMSRLVARATGVPMTSSKAAGGGD 83
Query: 81 -QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
Q + N +L+ +SIGTP A+ DTGSDL+WTQC+PC C+ Q +P+FDP
Sbjct: 84 LQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSS 141
Query: 140 SSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
SSTY ++PCSS+ C+ L C S C Y+ +YGD S + G LATET TL +
Sbjct: 142 SSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----- 196
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG 258
LPG+ FGCG N G S+ G+VGLG G +SL+SQ+ KFSYCL + T +
Sbjct: 197 LPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDTNNSPL 253
Query: 259 TNGIVSG--------PGVVSTPLTK---AKTFYVLTIDAISVGNQRLGV----------S 297
G ++G V +TPL K +FY +++ AI+VG+ R+ +
Sbjct: 254 LLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDG 313
Query: 298 TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----Q 351
T +++DSGT++T+L QGY + + + M A P AD +G L+LC+ + +
Sbjct: 314 TGGVIVDSGTSITYLEVQGYRALKKAFAAQM--ALPAADGSGVGLDLCFRAPAKGVDQVE 371
Query: 352 VPEVTIHFR-GADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
VP + HF GAD+ L N+ V +C G + + I GN Q NF YD+
Sbjct: 372 VPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMG-SRGLSIIGNFQQQNFQFVYDVG 430
Query: 410 QQTVSFKPTDCTK 422
T+SF P C K
Sbjct: 431 HDTLSFAPVQCNK 443
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 136/380 (35%), Positives = 215/380 (56%), Gaps = 42/380 (11%)
Query: 75 SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
+SS A A + A YL+ ++IGTPP +A+ADTGSDL WTQC+PC C+ QD+P+
Sbjct: 77 TSSDAGPARLRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCQPC--KLCFPQDTPI 134
Query: 135 FDPKMSSTYKSLPCSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGS 191
+D +SS++ +PC+S+ C + + ++C+ + C+Y +YGDG++S G L TET+T
Sbjct: 135 YDTAVSSSFSPVPCASATCLPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPG 194
Query: 192 TTGQAVALPGITFGCGTNNGGL-FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV 250
G V++ GI FGCG +NGGL +NS TG VGLG G +SL++Q+ GKFSYCL
Sbjct: 195 APG--VSVGGIAFGCGVDNGGLSYNS--TGTVGLGRGSLSLVAQLGV---GKFSYCLTDF 247
Query: 251 SSTKIN----FGTNGIVSGP----GVVSTPLTKA---KTFYVLTIDAISVGNQRLGV--- 296
+T + FG ++ P V STPL ++ T+Y ++++ IS+G+ RL +
Sbjct: 248 FNTSLGSPVLFGALAELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNG 307
Query: 297 -------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--- 346
+ +++DSGTT TFL + ++ ++ ++ QPV + + C+
Sbjct: 308 TFDLRDDGSGGMIVDSGTTFTFLVESAFRVVVDHVAGVLR-QPVVNASSLDSPCFPAATG 366
Query: 347 -NSLSQVPEVTIHFR-GADVKLSRSNF--FVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 402
L +P++ +HF GAD++L R N+ F + ++ + V I GN Q N
Sbjct: 367 EQQLPAMPDMVLHFAGGADMRLHRDNYMSFNQEESSFCLNIAGSPSADVSILGNFQQQNI 426
Query: 403 LVGYDIEQQTVSFKPTDCTK 422
+ +DI +SF PTDC K
Sbjct: 427 QMLFDITVGQLSFMPTDCGK 446
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 155/435 (35%), Positives = 219/435 (50%), Gaps = 56/435 (12%)
Query: 28 GFSVEL--IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI- 84
G VEL +H D S T Q +R AL R ++R N + SS A
Sbjct: 31 GVRVELTRVHADP--------SVTASQFVRGALRRDMHRHNARKLALAASSGATVSAPTQ 82
Query: 85 -IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
P YL+ ++IGTPP A+ADTGSDLIWTQC PC SQC+ Q +PL++P S+T+
Sbjct: 83 NSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPC-TSQCFRQPTPLYNPSSSTTF 141
Query: 144 KSLPCSSSQ--CASLNQKSCS----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
LPC+SS CA+ + + G C Y+V+YG G +++ +ET T GST
Sbjct: 142 AVLPCNSSLSVCAAALAGTGTAPPPGCACTYNVTYGSG-WTSVFQGSETFTFGSTPAGQS 200
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--- 254
+PGI FGC T + G S +G+VGLG G +SL+SQ+ KFSYCL P T
Sbjct: 201 RVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVP---KFSYCLTPYQDTNSTS 257
Query: 255 ---------INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD----- 300
+N GT G+ S P V S TFY L + IS+G L + PD
Sbjct: 258 TLLLGPSASLN-GTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIP-PDAFLLN 315
Query: 301 ------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ- 351
++IDSGTT+T L + + + S++ P D + + L+LC+ S +
Sbjct: 316 ADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGSAATGLDLCFMLPSSTSA 374
Query: 352 ---VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYD 407
+P +T+HF GAD+ L ++ + + C + T+ V I GN Q N + YD
Sbjct: 375 PPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYD 434
Query: 408 IEQQTVSFKPTDCTK 422
I Q+T+SF P C+
Sbjct: 435 IGQETLSFAPAKCSA 449
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 154/427 (36%), Positives = 217/427 (50%), Gaps = 39/427 (9%)
Query: 22 IEAQTGGFSVELIHRDSPKSPF-YNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
+++ TG +V L HR P SP T +RL R+ F+ S +
Sbjct: 51 VKSSTGAATVPLHHRHGPCSPLPTKKMPTLEERLHRDQLRAAYIQRKFSGGGVNGSRGGA 110
Query: 81 QADIIPNNA-------------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
D+ ++A YLI + +G+P + + DTGSD+ W QC+PC SQC
Sbjct: 111 -GDVQQSHATVPTTLGTSLDTLEYLITVRLGSPGKSQTMLIDTGSDVSWVQCKPC--SQC 167
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK--SCSGVNCQYSVSYGDGSFSNGNLATE 185
+ Q PLFDP SSTY CSS+ CA L Q+ CS CQY+V+YGDGS + G +++
Sbjct: 168 HSQADPLFDPSSSSTYSPFSCSSAACAQLGQEGNGCSSSQCQYTVTYGDGSSTTGTYSSD 227
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
T+ LGS A+ FGC G FN +T G++GLGGG SL+SQ T FSY
Sbjct: 228 TLALGSN-----AVRKFQFGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSY 281
Query: 246 CLVPVSSTKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVST---- 298
CL P +S+ F T G + G V TP+ ++ TFY + I AI VG ++L + T
Sbjct: 282 CL-PATSSSSGFLTLGAGT-SGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFS 339
Query: 299 PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVT 356
++DSGT LT LP S L S + ++ P A P+G L+ C+ F+ S V P V
Sbjct: 340 AGTIMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGILDTCFDFSGQSSVSIPTVA 399
Query: 357 IHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTV 413
+ F GA V ++ ++ S I+C F ++ S+ I GN+ Q F V YD+ V
Sbjct: 400 LVFSGGAVVDIASDGIMLQTSNSILCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGGGAV 459
Query: 414 SFKPTDC 420
FK C
Sbjct: 460 GFKAGAC 466
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 134/353 (37%), Positives = 186/353 (52%), Gaps = 31/353 (8%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ NY++ + +GTP ++ V DTGSD W QC PC +CY Q PLFDP SSTY ++
Sbjct: 159 STGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCV-VKCYKQKEPLFDPAKSSTYANV 217
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C+ S CA L+ C+G +C Y+V YGDGS++ G A +T+T+ A+ G FGC
Sbjct: 218 SCTDSACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHD-----AIKGFRFGC 272
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
G N GLF KT G++GLG G SL Q G F+YCL +++ GT + GP
Sbjct: 273 GEKNNGLFG-KTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTT-----GTGYLDFGP 326
Query: 267 GVVS-----TPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQ 314
G TP+ K +TFY + + I VG Q++ V ST ++DSGT +T LP
Sbjct: 327 GSAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPA 386
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQV--PEVTIHFR-GADVKLSRS 369
+ L S ++ A+ G L+ CY F LS V P V++ F+ GA + + S
Sbjct: 387 TAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVS 446
Query: 370 NFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+SE VC F G SV I GN Q + V YD+ ++TV F P C
Sbjct: 447 GIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 146/426 (34%), Positives = 222/426 (52%), Gaps = 61/426 (14%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ------ 81
GFSVE IHRDS KS F++ + TP RLR A RS+ R H + ++ +++ +
Sbjct: 3 GFSVEFIHRDSVKSLFHDPTLTPEARLRQAARRSMARHAHAARINNSAAAAGASGSDDSD 62
Query: 82 ----ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+ ++P N YL+ + + TPP LA+ADTGS L+W +C+ P
Sbjct: 63 ADVVSPMVPQNFEYLMALDVSTPPVRMLALADTGSSLVWLKCK-----------LPAAHT 111
Query: 138 KMSSTYKSLPCSSSQCASL-NQKSC----SGVN-CQYSVSYGDGSFSNGNLATETVTLGS 191
SS+Y LPC + C +L + SC SG N C Y ++ DGS + G + + T +
Sbjct: 112 PASSSYARLPCDAFACKALGDAASCRATGSGNNICVYRYAFADGSCTAGPVTVDAFTFST 171
Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVP 249
+ FGC T GL + G+VGL G ISL+SQ+ +T A KFSYCLVP
Sbjct: 172 R---------LDFGCATRTEGL-SVPDDGLVGLANGPISLVSQLSAKTPFAHKFSYCLVP 221
Query: 250 -----VSSTKINFGTNGIV-SGPGVVSTPLT--KAKTFYVLTIDAISVGNQ--RLGVSTP 299
S+ +NFG++ IV S PG +TPL + K+FY + +D+I V + L +T
Sbjct: 222 YSSSETVSSSLNFGSHAIVSSSPGAATTPLVAGRNKSFYTIALDSIKVAGKPVPLQTTTT 281
Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS------QVP 353
+++DSGT LT+LP+ L++ +++ I+ V P +CY + +P
Sbjct: 282 KLIVDSGTMLTYLPKAVLDPLVAALTAAIKLPRVKSPETLYAVCYDVRRRAPEDVGKSIP 341
Query: 354 EVTIHF-RGADVKLSRSN-FFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIE 409
+VT+ G +V+L N F V+ VC + + +P I GN+ Q N VG+D+E
Sbjct: 342 DVTLVLGGGGEVRLPWGNTFVVENKGTTVCLAL--VESHLPEFILGNVAQQNLHVGFDLE 399
Query: 410 QQTVSF 415
++TVSF
Sbjct: 400 RRTVSF 405
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 134/353 (37%), Positives = 186/353 (52%), Gaps = 31/353 (8%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ NY++ + +GTP ++ V DTGSD W QC PC +CY Q PLFDP SSTY ++
Sbjct: 159 STGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCV-VKCYKQKGPLFDPAKSSTYANV 217
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C+ S CA L+ C+G +C Y+V YGDGS++ G A +T+T+ A+ G FGC
Sbjct: 218 SCTDSACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHD-----AIKGFRFGC 272
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
G N GLF KT G++GLG G SL Q G F+YCL +++ GT + GP
Sbjct: 273 GEKNNGLFG-KTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTT-----GTGYLDFGP 326
Query: 267 GVVS-----TPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQ 314
G TP+ K +TFY + + I VG Q++ V ST ++DSGT +T LP
Sbjct: 327 GSAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPA 386
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQV--PEVTIHFR-GADVKLSRS 369
+ L S ++ A+ G L+ CY F LS V P V++ F+ GA + + S
Sbjct: 387 TAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVS 446
Query: 370 NFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+SE VC F G SV I GN Q + V YD+ ++TV F P C
Sbjct: 447 GIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 208 bits (530), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 142/425 (33%), Positives = 205/425 (48%), Gaps = 46/425 (10%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS--------ISSSKASQ 81
S L+ RD+ Y S P + D ++R R + S S
Sbjct: 59 SFALVRRDAVTGATYPS---PRHAVLDLVSRDNARAEYLASRLSPAYQPTDFFGSESKVV 115
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ + + Y +R+ IG+PPTE+ V D+GSD+IW QC+PC +CY Q PLFDP S+
Sbjct: 116 SGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPC--LECYAQADPLFDPASSA 173
Query: 142 TYKSLPCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
T+ ++ C S+ C +L C SG C+Y VSYGDGS++ G LA ET+TLG T A+
Sbjct: 174 TFSAVSCGSAICRTLRTSGCGDSG-GCEYEVSYGDGSYTKGTLALETLTLGGT-----AV 227
Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV--SSTKINF 257
G+ GCG N GLF G++GLG G +SL+ Q+ G FSYCL S +
Sbjct: 228 EGVAIGCGHRNRGLF-VGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGAAD 286
Query: 258 GTNGIVSG------PGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS------TPD-- 300
+V G G V PL + A +FY + + I VG++RL + T D
Sbjct: 287 AAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGG 346
Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVT 356
+V+D+GT +T LPQ + L + A P A L+ CY + + +VP V+
Sbjct: 347 GGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVS 406
Query: 357 IHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
+F G A + L N ++V I C F ++ + I GNI Q + D + F
Sbjct: 407 FYFDGAATLTLPARNLLLEVDGGIYCLAFAPSSSGLSILGNIQQEGIQITVDSANGYIGF 466
Query: 416 KPTDC 420
P C
Sbjct: 467 GPATC 471
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 208 bits (530), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 143/390 (36%), Positives = 205/390 (52%), Gaps = 30/390 (7%)
Query: 52 QRLRDALTRSLNRLNHF--NQNSSISSSKASQADI----IPNNANYLIRISIGTPPTERL 105
+ +R + +S R+ NSS SS A D+ P+ Y++ IS+GTP
Sbjct: 10 EAIRGLVAKSHARVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFR 69
Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN 165
A+ADTGSDL+W Q EPC + C +FDP+ SST++ + CSS C L G +
Sbjct: 70 AIADTGSDLVWVQSEPC--TGC--SGGTIFDPRQSSTFREMDCSSQLCTELPGSCEPGSS 125
Query: 166 -CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
C YS YG G + G A +T++LG+T+G + P GCG N G G+VGL
Sbjct: 126 ACSYSYEYGSGE-TEGEFARDTISLGTTSGGSQKFPSFAVGCGMVNSGF--DGVDGLVGL 182
Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVS----STKINFGTNGIVSGPGVVSTPLTKAK--- 277
G G +SL SQ+ I KFSYCLV ++ S+ + FG + + G G+ ST +T
Sbjct: 183 GQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTY 242
Query: 278 -TFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP 336
T+Y+LT++ I+V Q +G S +IDSGTTLT++P G +LS M SM+ V
Sbjct: 243 PTYYLLTVNGIAVAGQTMG-SPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGS 301
Query: 337 TGSLELCY--SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSE--DIVCSVFKGITNSVP 392
+ L+LCY S N + P +TI GA + SN+F+ V + D VC + G +P
Sbjct: 302 SMGLDLCYDRSSNRNYKFPALTIRLAGATMTPPSSNYFLVVDDSGDTVC-LAMGSAGGLP 360
Query: 393 --IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
I GN+MQ + + YD +SF C
Sbjct: 361 VSIIGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 207 bits (527), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 143/390 (36%), Positives = 206/390 (52%), Gaps = 30/390 (7%)
Query: 52 QRLRDALTRSLNRLNHF--NQNSSISSSKASQADI----IPNNANYLIRISIGTPPTERL 105
+ +R + +S R+ NSS SS A D+ P+ Y++ IS+GTP
Sbjct: 10 EAIRALVAKSHARVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFR 69
Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN 165
A+ADTGSDL+W Q EPC + C +FDP+ SST++ + CSS CA L G +
Sbjct: 70 AIADTGSDLVWVQSEPC--TGC--SGGTIFDPRQSSTFREMDCSSQLCAELPGSCEPGSS 125
Query: 166 -CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
C YS YG G + G A +T++LG+T+ + P GCG N G G+VGL
Sbjct: 126 TCSYSYEYGSGE-TEGEFARDTISLGTTSDGSQKFPSFAVGCGMVNSGF--DGVDGLVGL 182
Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVS----STKINFGTNGIVSGPGVVSTPLTKAK--- 277
G G +SL SQ+ I KFSYCLV ++ S+ + FG + + G G+ ST +T
Sbjct: 183 GQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTY 242
Query: 278 -TFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP 336
T+Y+LT++ I+V Q +G S +IDSGTTLT++P G +LS M SM+ V
Sbjct: 243 PTYYLLTVNGIAVAGQTMG-SPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGS 301
Query: 337 TGSLELCY--SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSE--DIVCSVFKGITNSVP 392
+ L+LCY S N + P +TI GA + SN+F+ V + D VC + G + +P
Sbjct: 302 SMGLDLCYDRSSNRNYKFPALTIRLAGATMTPPSSNYFLVVDDSGDTVC-LAMGSASGLP 360
Query: 393 --IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
I GN+MQ + + YD +SF C
Sbjct: 361 VSIIGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 207 bits (527), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 151/421 (35%), Positives = 213/421 (50%), Gaps = 38/421 (9%)
Query: 26 TGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS---KASQA 82
+GG +V L HR P SP S++ P L + L R R + + S + + S A
Sbjct: 58 SGGITVPLHHRHGPCSPV-PSNKMP-ASLEERLQRDQLRAAYIKRKFSGAKGGDVEQSDA 115
Query: 83 DIIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
+P + Y+I + IG+P + DTGSD+ W QC+PC SQC+ + LF
Sbjct: 116 ATVPTTLGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPC--SQCHSEVDSLF 173
Query: 136 DPKMSSTYKSLPCSSSQCASLNQ----KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGS 191
DP SSTY CSS+ C L+Q CS CQY VSY DGS + G +++T+TLGS
Sbjct: 174 DPSASSTYSPFSCSSAACVQLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSDTLTLGS 233
Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS 251
A+ G FGC + G F+ +T G++GLGG SL+SQ T FSYCL P
Sbjct: 234 N-----AIKGFQFGCSQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCLPPTP 288
Query: 252 STKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVST----PDIVID 304
+ F T G S G V TP+ T+ T+Y + ++AI VG Q+L + T V+D
Sbjct: 289 GSS-GFLTLGAASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFSAGSVMD 347
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-G 361
SGT +T LP S L S + ++ P A P+G L+ C+ F+ S V P V + F G
Sbjct: 348 SGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGG 407
Query: 362 ADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
A V L + +++ D C F ++ S+ GN+ Q F V YD+ V F+
Sbjct: 408 AVVNLDFNGIMLEL--DNWCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVGFRAGA 465
Query: 420 C 420
C
Sbjct: 466 C 466
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 145/420 (34%), Positives = 226/420 (53%), Gaps = 47/420 (11%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
G+ + L H DS T + +R A+ RS R ++S A+ +
Sbjct: 22 GYRLVLTHVDS------KGGYTKTELMRRAVHRSRLR--------ALSGYDATSPRLHSV 67
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
YL+ ++IG PP +A+ADTGSDL WTQC+PC C+ QD+P++DP SST+ LP
Sbjct: 68 QVEYLMELAIGKPPVPFVALADTGSDLTWTQCQPC--KLCFPQDTPVYDPSASSTFSPLP 125
Query: 148 CSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
CSS+ C + ++C+ + C+Y +YGDG++S G L TET+TLG ++ V++ G+ FGC
Sbjct: 126 CSSATCLPIWSRNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSA-PVSVGGVAFGC 184
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN----FGTNG- 261
GT+NGG + +TG VGLG G +SL++Q+ GKFSYCL ++ ++ GT
Sbjct: 185 GTDNGG-DSLNSTGTVGLGRGTLSLLAQLG---VGKFSYCLTDFFNSALDSPFLLGTLAE 240
Query: 262 IVSGPGVV-STPLTKA---KTFYVLTIDAISVGNQRL----------GVSTPDIVIDSGT 307
+ GP V STPL ++ + Y +++ IS+G+ RL G T +++DSGT
Sbjct: 241 LAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMIVDSGT 300
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIHFR-GADV 364
T T L + ++ ++ ++ QP + + C+ +P++ +HF GAD+
Sbjct: 301 TFTILAESGFREVVGRVARVL-GQPPVNASSLDAPCFPAPAGEPPYMPDLVLHFAGGADM 359
Query: 365 KLSRSNFFVKVSED-IVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+L R N+ ED C G T S + GN Q N + +D +SF PTDC+K
Sbjct: 360 RLYRDNYMSYNEEDSSFCLNIAGTTPESTSVLGNFQQQNIQMLFDTTVGQLSFLPTDCSK 419
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 138/366 (37%), Positives = 196/366 (53%), Gaps = 43/366 (11%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N +L+ +SIGTP A+ DTGSDL+WTQC+PC C+ Q +P+FDP SSTY ++
Sbjct: 70 GNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSSTYATV 127
Query: 147 PCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
PCSS+ C+ L C S C Y+ +YGD S + G LATET TL + LPG+ FG
Sbjct: 128 PCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK-----LPGVVFG 182
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG 265
CG N G S+ G+VGLG G +SL+SQ+ KFSYCL + T + G ++G
Sbjct: 183 CGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDTNNSPLLLGSLAG 239
Query: 266 --------PGVVSTPLTK---AKTFYVLTIDAISVGNQRLGV----------STPDIVID 304
V +TPL K +FY +++ AI+VG+ R+ + T +++D
Sbjct: 240 ISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVD 299
Query: 305 SGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----QVPEVTIH 358
SGT++T+L QGY + + + M A P AD +G L+LC+ + +VP + H
Sbjct: 300 SGTSITYLEVQGYRALKKAFAAQM--ALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFH 357
Query: 359 FR-GADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
F GAD+ L N+ V +C G + + I GN Q NF YD+ T+SF
Sbjct: 358 FDGGADLDLPAENYMVLDGGSGALCLTVMG-SRGLSIIGNFQQQNFQFVYDVGHDTLSFA 416
Query: 417 PTDCTK 422
P C K
Sbjct: 417 PVQCNK 422
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 143/425 (33%), Positives = 215/425 (50%), Gaps = 45/425 (10%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK------ASQ 81
GF ++L H D+ +S T Q L A+ RS R+ Q++++S + A++
Sbjct: 27 GFQLKLTHVDA------GTSYTKPQLLSRAIARSKARVAAL-QSAAVSPAPVADPITAAR 79
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ ++ YL+ ++IGTPP A+ DTGSDLIWTQC PC C Q +P FD K S+
Sbjct: 80 VLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPC--LLCAAQPTPYFDVKRSA 137
Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
TY++LPC SS+CA+L+ SC C Y YGD + + G LA ET T G+ + V
Sbjct: 138 TYRALPCRSSRCAALSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAAN 197
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---TKINFG 258
I+FGCG+ N G + ++G+VG G G +SL+SQ+ + +FSYCL S +++ FG
Sbjct: 198 ISFGCGSLNAGEL-ANSSGMVGFGRGPLSLVSQLGPS---RFSYCLTSYLSPTPSRLYFG 253
Query: 259 ------TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS----------TP 299
+ SG V STP Y L++ IS+G +RL + T
Sbjct: 254 VFANLNSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTG 313
Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF----NSLSQVPEV 355
++IDSGT++T+L Q + ++S I + D L+ C+ + N VP+
Sbjct: 314 GVIIDSGTSITWLQQDAYEAVRRGLASTIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDF 373
Query: 356 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
HF GA++ L N+ + S + T+ I GN Q N + YDI +SF
Sbjct: 374 VFHFDGANMTLPPENYMLIASTTGYLCLAMAPTSVGTIIGNYQQQNLHLLYDIANSFLSF 433
Query: 416 KPTDC 420
P C
Sbjct: 434 VPAPC 438
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 143/420 (34%), Positives = 208/420 (49%), Gaps = 37/420 (8%)
Query: 33 LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-------NSSISSSKASQADII 85
++HR P SP P + L R +R++ ++ +++ S AS+ +
Sbjct: 68 VVHRHGPCSPLQARGGEPSHA--EILDRDQDRVDSIHRLAAARPSSTADDPSSASKGVSL 125
Query: 86 P-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
P ANY++ + +GTP + L V DTGSDL W QC+PC CY Q PLFDP
Sbjct: 126 PARRGVPLGTANYIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPC--DGCYQQHDPLFDPS 183
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
S+TY ++PC + +C L+ SCS C+Y V YGD S ++GNLA +T+TLG ++ + +
Sbjct: 184 QSTTYSAVPCGAQECRRLDSGSCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSS 243
Query: 199 --LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
L FGCG ++ GLF K G+ GLG +SL SQ FSYCL P SST
Sbjct: 244 DQLQEFVFGCGDDDTGLFG-KADGLFGLGRDRVSLASQAAAKYGAGFSYCL-PSSSTAEG 301
Query: 257 FGTNGIVSGPGVVSTPL-TKAKT--FYVLTIDAISVGNQRLGVS-----TPDIVIDSGTT 308
+ + G + P T + T++ T FY L + I V + + VS TP VIDSGT
Sbjct: 302 YLSLGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTPGTVIDSGTV 361
Query: 309 LTFLPQGYNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GAD 363
+T LP + L S + ++ + A L+ CY F + Q+P V + F GA
Sbjct: 362 ITRLPSRAYAALRSSFAGLMRRYSYKRAPALSILDTCYDFTGRNKVQIPSVALLFDGGAT 421
Query: 364 VKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ L ++ C F G S+ I GN+ Q F V YD+ Q + F C+
Sbjct: 422 LNLGFGEVLYVANKSQACLAFASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGCS 481
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 134/416 (32%), Positives = 199/416 (47%), Gaps = 53/416 (12%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN-- 87
S+ L+HRD+ Y S ++ + R R+ H + S+S D++
Sbjct: 64 SLSLVHRDAISGATYPSRR---HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVV 120
Query: 88 ------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ Y +R+ +G+PPT++ V D+GSD+IW QC PC QCY Q PLFDP SS
Sbjct: 121 PGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPC--EQCYAQTDPLFDPAASS 178
Query: 142 TYKSLPCSSSQCASLN----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
++ + C S+ C +L+ C YSV+YGDGS++ G LA ET+TLG T
Sbjct: 179 SFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT----- 233
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
A+ G+ GCG N GLF G++GLG G +SL+ Q+ G FSYCL +
Sbjct: 234 AVQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLA-------SR 285
Query: 258 GTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS------TPD----IVIDSGT 307
G G S A +FY + + I VG +RL + T D +V+D+GT
Sbjct: 286 GAGGAGS----------LASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGT 335
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADV 364
+T LP+ + L + A P + L+ CY + + +VP V+ +F +GA +
Sbjct: 336 AVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVL 395
Query: 365 KLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
L N V+V + C F ++ + I GNI Q + D V F P C
Sbjct: 396 TLPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 451
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 140/414 (33%), Positives = 203/414 (49%), Gaps = 36/414 (8%)
Query: 33 LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS----ISSSKASQADIIPNN 88
++HR P SP P + L R +R++ ++ ++ S AS+ +P +
Sbjct: 121 VVHRHGPCSPLLARGGEPSHA--EILDRDQDRVDSIHRMTAGPWTAGQSSASKGVSLPAH 178
Query: 89 -------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
ANY++ + +GTP + L V DTGSDL W QC+PC + CY Q PLFDP S+
Sbjct: 179 RGLRLGTANYIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPC--NNCYKQHDPLFDPSQST 236
Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
TY ++PC + +C L+ +CS C+Y V YGD S ++GNLA +T+TLG ++ Q L G
Sbjct: 237 TYSAVPCGAQEC--LDSGTCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSDQ---LQG 291
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
FGCG ++ GLF + G+ GLG +SL SQ FSYCL P S + + G
Sbjct: 292 FVFGCGDDDTGLFG-RADGLFGLGRDRVSLASQAAARYGAGFSYCL-PSSWRAEGYLSLG 349
Query: 262 IVSGP--GVVSTPLTKAKT--FYVLTIDAISVGNQRLGVS-----TPDIVIDSGTTLTFL 312
+ P + +T++ T FY L + I V + + V+ P VIDSGT +T L
Sbjct: 350 SAAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAPGTVIDSGTVITRL 409
Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRS 369
P S L S + + A L+ CY F + Q+P V + F GA + L
Sbjct: 410 PSRAYSALRSSFAGFMRRYKRAPALSILDTCYDFTGRTKVQIPSVALLFDGGATLNLGFG 469
Query: 370 NFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ C F G SV I GN+ Q F V YD+ Q + F C+
Sbjct: 470 GVLYVANRSQACLAFASNGDDTSVGILGNMQQKTFAVVYDLANQKIGFGAKGCS 523
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 135/391 (34%), Positives = 206/391 (52%), Gaps = 34/391 (8%)
Query: 56 DALTRSLNRLNHFNQNSSISS--SKASQADIIPNNANYLIRISIGTPPTERLAVADTGSD 113
+A+ RS R+ + S + S+ Q+ + N YL+ +++G+PP + DTGSD
Sbjct: 2 EAVQRSHERVAFYTLKLSPDAFGSQEFQSPVKAGNGEYLMTLTLGSPPQSFDVIVDTGSD 61
Query: 114 LIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC--ASLNQKSCSGVNCQYSVS 171
L W QC PC CY Q P FDP S +++ C+ + C ++L K+C+ CQY +
Sbjct: 62 LNWVQCLPC--RVCYQQPGPKFDPSKSRSFRKAACTDNLCNVSALPLKACAANVCQYQYT 119
Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
YGD S +NG+LA ET++L + G ++P FGCGT N G F + G+VGLG G +SL
Sbjct: 120 YGDQSNTNGDLAFETISLNNGAGTQ-SVPNFAFGCGTQNLGTF-AGAAGLVGLGQGPLSL 177
Query: 232 ISQMRTTIAGKFSYCLV---PVSSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTID 285
SQ+ T A KFSYCLV +S++ + FG+ I + + T + + T+Y + ++
Sbjct: 178 NSQLSHTFANKFSYCLVSLNSLSASPLTFGS--IAAAANIQYTSIVVNARHPTYYYVQLN 235
Query: 286 AISVGNQRLGVSTPDI------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV 333
+I VG Q L ++ P + +IDSGTT+T L S +L S + +
Sbjct: 236 SIEVGGQPLNLA-PSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRL 294
Query: 334 ADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKV--SEDIVCSVFKGITN 389
L+LC++ +S VP++ F+GAD ++ N FV V S +C G +
Sbjct: 295 DGSAYGLDLCFNIAGVSNPSVPDMVFKFQGADFQMRGENLFVLVDTSATTLCLAMGG-SQ 353
Query: 390 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
I GNI Q N LV YD+E + + F DC
Sbjct: 354 GFSIIGNIQQQNHLVVYDLEAKKIGFATADC 384
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 155/425 (36%), Positives = 218/425 (51%), Gaps = 56/425 (13%)
Query: 24 AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
A GGFSVE IHRDSP+SPF++ + T + R A RS+ R ++S S+S AD
Sbjct: 29 ASGGGFSVEFIHRDSPRSPFHDPAFTAHGRALAAARRSVARAAAIAGSASSSASGGGAAD 88
Query: 84 -----IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE---------PCPPSQCYM 129
++ + YL+ +++G+PP LA+ADTGSDL+W +C+ P +Q
Sbjct: 89 DVVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQ--- 145
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
FDP SSTY + C + C +L + +C G NC Y +YGDGS + G L+TET T
Sbjct: 146 -----FDPSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFT 200
Query: 189 L----GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGK 242
+ + V + G+ FGC T G F + +G G +SL++Q+ T++ +
Sbjct: 201 FDDGGAGRSPRQVRIGGVKFGCSTATAGSFPADGLVGLGG--GAVSLVTQLGGATSLGRR 258
Query: 243 FSYCLVPVS---STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP 299
FSYCLVP S S+ +NFG V+ PG STPL KT S + R
Sbjct: 259 FSYCLVPHSVNASSALNFGALADVTEPGAASTPLVGNKTV-------ASAASSR------ 305
Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-----FNSLSQVPE 354
I++DSGTTLTFL ++ +S I PV P G L+LCY+ + +P+
Sbjct: 306 -IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESIPD 364
Query: 355 VTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQ 411
+T+ F GA V L N FV V E +C T P I GN+ Q N VGYD++
Sbjct: 365 LTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLDAG 424
Query: 412 TVSFK 416
TV K
Sbjct: 425 TVGNK 429
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/146 (36%), Positives = 77/146 (52%), Gaps = 9/146 (6%)
Query: 284 IDAISVGNQRLG-VSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
+DA +VGN+ + ++ I++DSGTTLTFL ++ +S I PV P G L+L
Sbjct: 421 LDAGTVGNKTVASAASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQL 480
Query: 343 CYS-----FNSLSQVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IY 394
CY+ + +P++T+ F GA V L N FV V E +C T P I
Sbjct: 481 CYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSIL 540
Query: 395 GNIMQTNFLVGYDIEQQTVSFKPTDC 420
GN+ Q N VGYD++ TV+F DC
Sbjct: 541 GNLAQQNIHVGYDLDAGTVTFAVADC 566
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 144/430 (33%), Positives = 216/430 (50%), Gaps = 54/430 (12%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDA-------LTRSLNRLNHFNQNSSISSSKAS 80
G V L H D+ + + T Q LR A ++R + R SS + + A
Sbjct: 38 GLRVALTHVDA------HGNYTKLQLLRRAARRSRHRMSRLVARTTGVPVMSSKAVAPAL 91
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
Q + N +L+ +SIGTP A+ DTGSDL+WTQC+PC +C+ Q +P+FDP S
Sbjct: 92 QVPVHAGNGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPC--VECFNQSTPVFDPSSS 149
Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
STY +LPCSS+ C+ L C+ C Y+ +YGD S + G LA ET TL T LP
Sbjct: 150 STYAALPCSSTLCSDLPSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTLAKTK-----LP 204
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INF 257
+ FGCG N G ++ G+VGLG G +SL+SQ+ KFSYCL + T +
Sbjct: 205 DVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLN---KFSYCLTSLDDTSKSPLLL 261
Query: 258 GTNGIV-----SGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGV----------STP 299
G+ + + V +TPL + +FY + + ++VG+ + + T
Sbjct: 262 GSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTG 321
Query: 300 DIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----QVP 353
+++DSGT++T+L QGY + + + M P AD +G L+ C+ + +VP
Sbjct: 322 GVIVDSGTSITYLELQGYRALKKAFAAQM--KLPAADGSGIGLDTCFEAPASGVDQVEVP 379
Query: 354 EVTIHFRGADVKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
++ H GAD+ L N+ V S +C G + + I GN Q N YD+ + T
Sbjct: 380 KLVFHLDGADLDLPAENYMVLDSGSGALCLTVMG-SRGLSIIGNFQQQNIQFVYDVGENT 438
Query: 413 VSFKPTDCTK 422
+SF P C K
Sbjct: 439 LSFAPVQCAK 448
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 128/363 (35%), Positives = 185/363 (50%), Gaps = 37/363 (10%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+R+++GTP DTGSDL+WTQC PC C+ QD P+ DP SSTY +LPC
Sbjct: 83 EYLVRLAVGTPRRPVALTLDTGSDLVWTQCAPC--RDCFDQDLPVLDPAASSTYAALPCG 140
Query: 150 SSQCASLNQKSCSGVN-------CQYSVSYGDGSFSNGNLATETVTLGST--TGQAVALP 200
+++C +L SC GV C Y+ YGD S + G +AT+ T G + +G+++
Sbjct: 141 AARCRALPFTSC-GVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTR 199
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN 260
+TFGCG N G+F S TGI G G G SL SQ+ T FSYC + +K + T
Sbjct: 200 RLTFGCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNVT---SFSYCFTSMFESKSSLVTL 256
Query: 261 G---------IVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI---VIDS 305
G SG V +TP+ K + Y L++ ISVG RL V +IDS
Sbjct: 257 GGSPAALYSHAHSGE-VRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFRSTIIDS 315
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-----QVPEVTIHFR 360
G ++T LP+ + + ++ + P +L+LC++ + VP +T+H
Sbjct: 316 GASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWRRPAVPSLTLHLE 375
Query: 361 GADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
GAD +L RSN+ F + ++C V + GN Q N V YD+E +SF P
Sbjct: 376 GADWELPRSNYVFEDLGARVMCIVLDAAPGEQTVIGNFQQQNTHVVYDLENDRLSFAPAR 435
Query: 420 CTK 422
C +
Sbjct: 436 CDR 438
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 127/304 (41%), Positives = 176/304 (57%), Gaps = 46/304 (15%)
Query: 3 TFLSCVFILFFLCFYVVSP-IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
T+ + ++ L F + P IEA GGF+ +LI R+S K
Sbjct: 2 TYPRKIHLISILLFVFIFPHIEAHNGGFTGKLIPRNSSK--------------------- 40
Query: 62 LNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
+ FN+N+ Q+ + N+ +YL+ +SIGTPP + A ADTGSDLIW QC P
Sbjct: 41 ----DFFNRNTI-------QSPVSANHYDYLMELSIGTPPVKIYAQADTGSDLIWLQCIP 89
Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSN 179
C + CY Q +P+FD + SST+ ++ C S C+ L SCS +NC+Y+ SY DGS +
Sbjct: 90 C--TNCYKQLNPMFDSQSSSTFSNIACGSESCSKLYSTSCSPDQINCKYNYSYVDGSETQ 147
Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
G LA ET+TL STTG+ VA G+ FGCG NN G FN K GI+GLG G +SL+SQ+ +++
Sbjct: 148 GVLAQETLTLTSTTGEPVAFKGVIFGCGHNNNGAFNDKEMGIIGLGRGPLSLVSQIGSSL 207
Query: 240 AGK-FSYCLVPVS-----STKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVG 290
G FS CLVP + S+ ++FG V G GVVSTPL T ++FY +T+ ISV
Sbjct: 208 GGNMFSQCLVPFNTNPSISSPMSFGKGSEVLGNGVVSTPLVSKTTYQSFYFVTLLGISVE 267
Query: 291 NQRL 294
+ L
Sbjct: 268 DINL 271
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 128/362 (35%), Positives = 187/362 (51%), Gaps = 36/362 (9%)
Query: 86 PNNANY---LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
P +A Y L+ I +GTPP + + + DTGSDL W Q EPC C+ Q P+FDP SST
Sbjct: 17 PESAGYGEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPC--RACFEQADPIFDPSKSST 74
Query: 143 YKSLPCSSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
Y + CSSS CA L Q + NC Y+ YGDGS + G + ET+T T G+ V
Sbjct: 75 YNKIACSSSACADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEEVK-- 132
Query: 201 GITFGCGTNNGGLF-NSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTK 254
FG N G F ++ GI+GLG G +S+ SQ+ + + KFSYCLV ++
Sbjct: 133 ---FGASVYNTGTFGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVDWLSAGSETST 189
Query: 255 INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVS----------TPDI 301
+ FG + SG V TP+ T+Y + + ISVG L + +
Sbjct: 190 MYFGDAAVPSGE-VQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGT 248
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHF 359
+IDSGTT+T+L Q + L++ +S + TG L+LC++ P +TIH
Sbjct: 249 IIDSGTTITYLQQEVFNALVAAYTSQVRYPTTTSATG-LDLCFNTRGTGSPVFPAMTIHL 307
Query: 360 RGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
G ++L +N F+ + +I+C F + + I+GNI Q NF + YD++ + F P
Sbjct: 308 DGVHLELPTANTFISLETNIICLAFASALDFPIAIFGNIQQQNFDIVYDLDNMRIGFAPA 367
Query: 419 DC 420
DC
Sbjct: 368 DC 369
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 138/390 (35%), Positives = 220/390 (56%), Gaps = 33/390 (8%)
Query: 59 TRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
T + R H ++ ++S A+ + YL+ ++IGTPP +A+ADTGSDL WTQ
Sbjct: 34 TELMRRAAHRSRLQALSGYDANSPRLHSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQ 93
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQKSCSGVN--CQYSVSYGDG 175
C+PC C+ QD+P++DP SST+ +PCSS+ C + ++CS + C+Y SY DG
Sbjct: 94 CQPC--KLCFPQDTPVYDPSASSTFSPVPCSSATCLPTWRSRNCSNPSSPCRYIYSYSDG 151
Query: 176 SFSNGNLATETVTLGSTT-GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
++S G L TET+T+GS+ GQ V++ + FGCGT+NGG + +TG VGLG G +SL++Q
Sbjct: 152 AYSVGILGTETLTIGSSVPGQTVSVGSVAFGCGTDNGG-DSLNSTGTVGLGRGTLSLLAQ 210
Query: 235 MRTTIAGKFSYCLVPVSSTKIN----FGTNG-IVSGPGVV-STPLTKA---KTFYVLTID 285
+ GKFSYCL ++ ++ GT + GPG V STPL ++ + Y + +
Sbjct: 211 LG---VGKFSYCLTDFFNSTMDSPFFLGTLAELAPGPGTVQSTPLLQSPLNPSRYFVNLQ 267
Query: 286 AISVGNQRLGV--STPDI--------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD 335
IS+G+ RL + T D+ ++DSGTT T L + ++ ++ ++ QP +
Sbjct: 268 GISLGDVRLPIPNGTFDLRADGNGGMMVDSGTTFTILAKSGFREVVDRVAQLL-GQPPVN 326
Query: 336 PTGSLELCY-SFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSED-IVCSVFKGITNSVP 392
+ C+ S + +P++ +HF GAD++L R N+ +D C G ++
Sbjct: 327 ASSLDSPCFPSPDGEPFMPDLVLHFAGGADMRLHRDNYMSYNEDDSSFCLNIVGSPSTWS 386
Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
GN Q N + +D+ +SF PTDC+K
Sbjct: 387 RLGNFQQQNIQMLFDMTVGQLSFLPTDCSK 416
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 141/434 (32%), Positives = 211/434 (48%), Gaps = 58/434 (13%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ-------- 81
S+ L+ RD Y S LR A+ + R N + + S A Q
Sbjct: 105 SLALVRRDEVTGSTYPS-------LRHAVLDLVARDNARAEYLATRLSPAYQPPGFSGSE 157
Query: 82 ----ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+ + + YL+R+S+G+PPTE+ V D+GSD++W QC+PC +CY+Q PLFDP
Sbjct: 158 SKVVSGLDEGSGEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPC--LECYVQADPLFDP 215
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTG 194
S+T+ + C S+ C L +C C+Y VSY DGS++ G LA ET+TLG T
Sbjct: 216 ATSATFSGVSCGSAICRILPTSACGDGELGGCEYEVSYADGSYTKGALALETLTLGGT-- 273
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP---VS 251
A+ G+ GCG N GLF G++GLG G +SL+ Q+ + G FSYCL
Sbjct: 274 ---AVEGVVIGCGHRNRGLFVG-AAGLMGLGWGPMSLVGQLGGEVGGAFSYCLASRGGYG 329
Query: 252 STKINFGTNGIVSG------PGVVSTPLT---KAKTFYVLTIDAISVGNQRLGV------ 296
S + +V G G V PL +A +FY + + I VG++RL +
Sbjct: 330 SGAADDDAGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQ 389
Query: 297 ----STPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSL 349
D+V+D+GTT+T LPQ Y + + + ++ A P A S L+ CY +
Sbjct: 390 LTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLSGY 449
Query: 350 S--QVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY 406
+ +VP V+ F G A + L+ N ++V I C F ++ + I GN Q +
Sbjct: 450 ASVRVPTVSFCFDGDARLILAARNVLLEVDMGIYCLAFAPSSSGLSIMGNTQQAGIQITV 509
Query: 407 DIEQQTVSFKPTDC 420
D + F P +C
Sbjct: 510 DSANGYIGFGPANC 523
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 201 bits (511), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 137/373 (36%), Positives = 195/373 (52%), Gaps = 44/373 (11%)
Query: 86 PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
P YL+ ++IGTPP A+ADTGSDLIWTQC PC SQC+ Q +PL++P S+T+
Sbjct: 27 PTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPC-TSQCFRQPTPLYNPSSSTTFAV 85
Query: 146 LPCSSSQ--CASLNQKSCS----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
LPC+SS CA+ + + G C Y+V+YG G +++ +ET T GST +
Sbjct: 86 LPCNSSLSVCAAALAGTGTAPPPGCACTYNVTYGSG-WTSVFQGSETFTFGSTPAGHARV 144
Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK----- 254
PGI FGC T + G S +G+VGLG G +SL+SQ+ KFSYCL P T
Sbjct: 145 PGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVP---KFSYCLTPYQDTNSTSTL 201
Query: 255 -------INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD------- 300
+N GT G+ S P V S TFY L + IS+G L + PD
Sbjct: 202 LLGPSASLN-GTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIP-PDAFSLNAD 259
Query: 301 ----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ--- 351
++IDSGTT+T L + + + S++ P D + L+LC+ S +
Sbjct: 260 GTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGSADTGLDLCFMLPSSTSAPP 318
Query: 352 -VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIE 409
+P +T+HF GAD+ L ++ + + C + T+ V I GN Q N + YDI
Sbjct: 319 AMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIG 378
Query: 410 QQTVSFKPTDCTK 422
Q+T+SF P C+
Sbjct: 379 QETLSFAPAKCSA 391
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 125/342 (36%), Positives = 177/342 (51%), Gaps = 16/342 (4%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
ANY+I + GTP + + DTGS++ W QC+PC S CY Q PLFDP +SSTY+++ C
Sbjct: 14 ANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVS-CYPQQEPLFDPTLSSTYRNISC 72
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
+S+ C L+ + CSG C Y V+YGDGS + G LATET TL + FGCG
Sbjct: 73 TSAACTGLSSRGCSGSTCVYGVTYGDGSSTVGFLATETFTLAAGN----VFNNFIFGCGQ 128
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGV 268
NN GLF + G++GLG SL SQ+ T++ FSYCL SS + PG
Sbjct: 129 NNQGLF-TGAAGLIGLGRSPYSLNSQLATSLGNIFSYCLPSTSSATGYLNIGNPLRTPGY 187
Query: 269 VSTPL-TKAKTFYVLTIDAISVGNQRLGVSTP-----DIVIDSGTTLTFLPQGYNSNLLS 322
+ ++A T Y + + ISVG RL +S+ +IDSGT +T LP L +
Sbjct: 188 TAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVGTIIDSGTVITRLPPTAYGALRT 247
Query: 323 VMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSEDIV 380
+ + A L+ CY F+ + V P + +H+ G DV + + F +S V
Sbjct: 248 AFRAAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLHYTGLDVTIPGAGVFYVISSSQV 307
Query: 381 CSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
C F G ++S + I GN+ Q V YD + + F C
Sbjct: 308 CLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 148/461 (32%), Positives = 235/461 (50%), Gaps = 64/461 (13%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLN 66
V I +LC V+ A G V+L H D+ K E P + L R A+ RS R
Sbjct: 9 VLIACWLCGCPVAGEAAFAGDIRVDLTHVDAGK-------ELPKRELIRRAMQRSKARAA 61
Query: 67 HFN--QNSSI---SSSKASQADIIPNNA-------NYLIRISIGTPPTERLAVADTGSDL 114
+ +N S ++A + + P A Y++ +++GTPP A+ DTGSDL
Sbjct: 62 ALSVVRNGGGFYGSIAQAREREREPGMAVRASGDLEYVLDLAVGTPPQPITALLDTGSDL 121
Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYG 173
IWTQC+ C + C Q PLF P+MSS+Y+ + C+ C + SC + C Y SYG
Sbjct: 122 IWTQCDTC--TACLRQPDPLFSPRMSSSYEPMRCAGQLCGDILHHSCVRPDTCTYRYSYG 179
Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
DG+ + G ATE T S++G+ ++P + FGCGT N G N+ +GIVG G +SL+S
Sbjct: 180 DGTTTLGYYATERFTFASSSGETQSVP-LGFGCGTMNVGSLNN-ASGIVGFGRDPLSLVS 237
Query: 234 QMRTTIAGKFSYCLVPVSSTK---INFGTNGIV------SGPGVVSTPLTKAK---TFYV 281
Q+ +FSYCL P +S++ + FG+ V +GP V +TP+ ++ TFY
Sbjct: 238 QLSIR---RFSYCLTPYASSRKSTLQFGSLADVGLYDDATGP-VQTTPILQSAQNPTFYY 293
Query: 282 LTIDAISVGNQRLGVST------PD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
+ ++VG +RL + PD ++IDSGT LT P + ++ S +
Sbjct: 294 VAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLFPAAVLAEVVRAFRSQLRL- 352
Query: 332 PVADPTGSLE-LCYSFNSLS----------QVPEVTIHFRGADVKLSRSNFFVK-VSEDI 379
P A+ + + +C++ +++ VP + HF+GAD+ L R N+ ++
Sbjct: 353 PFANGSSPDDGVCFAAPAVAAGGGRMARQVAVPRMVFHFQGADLDLPRENYVLEDHRRGH 412
Query: 380 VCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+C + + GN +Q + V YD+E++T+SF P +C
Sbjct: 413 LCVLLGDSGDDGATIGNFVQQDMRVVYDLERETLSFAPVEC 453
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 143/445 (32%), Positives = 219/445 (49%), Gaps = 61/445 (13%)
Query: 22 IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS-------- 73
+ A + + L+HRD + ++ TP Q L L R + R ++
Sbjct: 61 VAASSSTLHIRLLHRDR-----FAANATPAQLLARRLQRDVLRAAWIISKAAANGTPPPV 115
Query: 74 --ISSSKASQADII---PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
+SS++ A ++ P + Y+ +I++GTP E L DT SDL W QC+PC +CY
Sbjct: 116 AGLSSARGFVAPVVSRAPTSGEYIAKIAVGTPGVEALLALDTASDLTWLQCQPC--RRCY 173
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATE 185
Q P+FDP+ S++Y+ + +++ C +L + C Y+V YGDGS + G+ E
Sbjct: 174 PQSGPVFDPRHSTSYREMSFNAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEE 233
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
T+T V LP I+ GCG +N GLF + GI+GLG G +S +Q+ G FSY
Sbjct: 234 TLTFAG----GVRLPRISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQIDHN--GTFSY 287
Query: 246 CLV-----PVS-STKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRL-G 295
CLV P S S+ + FG + + P V TP TFY + + ISVG R+ G
Sbjct: 288 CLVDFLSGPGSLSSTLTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPG 347
Query: 296 VSTPD-----------IVIDSGTTLTFLPQ----GYNSNLLSVMSSMIEAQPVADPTGSL 340
V+ D +++DSGT +T L + + +V + + + P+G
Sbjct: 348 VTERDLQLDPYTGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVS-IGGPSGFF 406
Query: 341 ELCYSF--NSLSQVPEVTIHFRGA-DVKLSRSNFFVKV-SEDIVCSVFKGI-TNSVPIYG 395
+ CY+ + +VP V++HF G+ +VKL N+ + V S VC F +SV I G
Sbjct: 407 DTCYTVGGRGMKKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIG 466
Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDC 420
NI Q F + YDI + V F P C
Sbjct: 467 NIQQQGFRIVYDIGGR-VGFAPNSC 490
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 132/370 (35%), Positives = 184/370 (49%), Gaps = 45/370 (12%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+ +++GTPP DTGSDL+WTQC PC C+ Q PL DP SSTY +LPC
Sbjct: 91 EYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPC--RDCFHQGLPLLDPAASSTYAALPCG 148
Query: 150 SSQCASLNQKSCSG----------VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA- 198
+ +C +L SC G +C Y YGD S + G +AT+ T G G +
Sbjct: 149 APRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSR 208
Query: 199 LP--GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
LP +TFGCG N G+F S TGI G G G SL SQ+ T FSYC + +K +
Sbjct: 209 LPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNVT---TFSYCFTSMFESKSS 265
Query: 257 FGTNG-------------IVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPD 300
T G +SG V +TPL K + Y L++ ISVG RL V
Sbjct: 266 LVTLGGAPAAALLYSHAAHISGE-VRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVPEAK 324
Query: 301 I---VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSF--NSLSQ--- 351
+ +IDSG ++T LP+ + + ++ + P GS L+LC++ +L +
Sbjct: 325 LRSTIIDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVVEGSALDLCFALPVTALWRRPP 384
Query: 352 VPEVTIHFRGADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 410
VP +T+H GAD +L R N+ F ++ ++C V + GN Q N V YD+E
Sbjct: 385 VPSLTLHLDGADWELPRGNYVFEDLAARVMCVVLDAAPGDQTVIGNFQQQNTHVVYDLEN 444
Query: 411 QTVSFKPTDC 420
+SF P C
Sbjct: 445 DWLSFAPARC 454
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 146/414 (35%), Positives = 220/414 (53%), Gaps = 37/414 (8%)
Query: 29 FSVELIHRDSPKSPFYNSS-ETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
F ELI+R+ SP + + +TP + A+ R R ++ ++ + + +
Sbjct: 28 FRAELIYREHQSSPLRSETLKTPSEIFIAAVKRGHERRARLAKHV-LAGDQLFETPVASG 86
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
N YLI IS G PP + A+ DTGSDL W QC PC CY S FDP S++YK+L
Sbjct: 87 NGEYLIDISYGNPPQKSTAIVDTGSDLNWVQCLPC--KSCYETLSAKFDPSKSASYKTLG 144
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C S+ C L +SC+ +CQY YGDGS ++G L+T+ VT+G TG+ +P + FGCG
Sbjct: 145 CGSNFCQDLPFQSCA-ASCQYDYMYGDGSSTSGALSTDDVTIG--TGK---IPNVAFGCG 198
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN--FGTNGIVSG 265
+N G F +VGLG G +SL+SQ+ T KFSYCLVP+ STK + + + ++G
Sbjct: 199 NSNLGTFAGAGG-LVGLGKGPLSLVSQLGGTATKKFSYCLVPLGSTKTSPLYIGDSTLAG 257
Query: 266 PGVVSTPL---TKAKTFYVLTIDAISVGNQRLG--VSTPDI--------VIDSGTTLTFL 312
GV TP+ TFY + ISV + + +T DI ++DSGTTLT+L
Sbjct: 258 -GVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGTTLTYL 316
Query: 313 P-QGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNSLSQ--VPEVTIHFRGADVKL 366
+N +++++ A P + GS LE C+S ++ P V HF GADV L
Sbjct: 317 DVDAFN----PMVAALKAALPYPEADGSFYGLEYCFSTAGVANPTYPTVVFHFNGADVAL 372
Query: 367 SRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ N F+ + + + + I+GNI Q N ++ +D+ + + FK +C
Sbjct: 373 APDNTFIALDFEGTTCLAMASSTGFSIFGNIQQLNHVIVHDLVNKRIGFKSANC 426
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 142/429 (33%), Positives = 208/429 (48%), Gaps = 40/429 (9%)
Query: 22 IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN------------ 69
I + G +V L HR P SP +S + P + + L R R H
Sbjct: 45 ISSSLSGTTVALNHRHGPCSPVPSSKKRPTEE--ELLKRDQLRAEHIQRKFAMNAAVDGA 102
Query: 70 ---QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
Q S +SSS ++ + Y+I + +GTP + DTGSD+ W QC PCP
Sbjct: 103 GDLQQSKVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPP 162
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK--SCSGVN--CQYSVSYGDGSFSNGNL 182
C+ Q LFDP SSTY+++ C++++CA L Q+ C N CQY V YGDGS +NG
Sbjct: 163 CHAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTY 222
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
+ +T+TL +G + A+ G FGC G F+ +T G++GLGGG SL+SQ
Sbjct: 223 SRDTLTL---SGASDAVKGFQFGCSHLESG-FSDQTDGLMGLGGGAQSLVSQTAAAYGNS 278
Query: 243 FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTP 299
FSYCL P S + G G V+T + ++K TFY + I+VG ++LG+S P
Sbjct: 279 FSYCLPPTSGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLS-P 337
Query: 300 DI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--V 352
+ V+DSGT +T LP S L S + ++ A L+ C+ F +Q +
Sbjct: 338 SVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISI 397
Query: 353 PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
P V + F GA + L + + + G + I GN+ Q F V YD+
Sbjct: 398 PTVALVFSGGAAIDLDPNGIMYG---NCLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSS 454
Query: 412 TVSFKPTDC 420
T+ F+ C
Sbjct: 455 TLGFRSGAC 463
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 148/461 (32%), Positives = 235/461 (50%), Gaps = 64/461 (13%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLN 66
V I +LC V+ A G V+L H D+ K E P + L R A+ RS R
Sbjct: 9 VLIACWLCGCPVAGEAAFAGDIRVDLTHVDAGK-------ELPKRELIRRAMQRSKARAA 61
Query: 67 HFN--QNSSI---SSSKASQADIIPNNA-------NYLIRISIGTPPTERLAVADTGSDL 114
+ +N S ++A + + P A Y++ +++GTPP A+ DTGSDL
Sbjct: 62 ALSVVRNGGGFYGSIAQAREREREPGMAVRASGDLEYVLDLAVGTPPQPITALLDTGSDL 121
Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYG 173
IWTQC+ C + C Q PLF P+MSS+Y+ + C+ C + SC + C Y SYG
Sbjct: 122 IWTQCDTC--TACLRQPDPLFSPRMSSSYEPMRCAGQLCGDILHHSCVRPDTCTYRYSYG 179
Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
DG+ + G ATE T S++G+ ++P + FGCGT N G N+ +GIVG G +SL+S
Sbjct: 180 DGTTTLGYYATERFTFASSSGETQSVP-LGFGCGTMNVGSLNN-ASGIVGFGRDPLSLVS 237
Query: 234 QMRTTIAGKFSYCLVPVSSTK---INFGTNGIV------SGPGVVSTPLTKAK---TFYV 281
Q+ +FSYCL P +S++ + FG+ V +GP V +TP+ ++ TFY
Sbjct: 238 QLSIR---RFSYCLTPYASSRKSTLQFGSLADVGLYDDATGP-VQTTPILQSAQNPTFYY 293
Query: 282 LTIDAISVGNQRLGVST------PD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
+ ++VG +RL + PD ++IDSGT LT P + ++ S +
Sbjct: 294 VAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLFPVAVLAEVVRAFRSQLRL- 352
Query: 332 PVADPTGSLE-LCYSFNSLS----------QVPEVTIHFRGADVKLSRSNFFVK-VSEDI 379
P A+ + + +C++ +++ VP + HF+GAD+ L R N+ ++
Sbjct: 353 PFANGSSPDDGVCFAAPAVAAGGGRMARQVAVPRMVFHFQGADLDLPRENYVLEDHRRGH 412
Query: 380 VCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+C + + GN +Q + V YD+E++T+SF P +C
Sbjct: 413 LCVLLGDSGDDGATIGNFVQQDMRVVYDLERETLSFAPVEC 453
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 135/356 (37%), Positives = 190/356 (53%), Gaps = 43/356 (12%)
Query: 97 IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL 156
IGTP A+ DTGSDL+WTQC+PC C+ Q +P+FDP SSTY ++PCSS+ C+ L
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSSTYATVPCSSASCSDL 230
Query: 157 NQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFN 215
C S C Y+ +YGD S + G LATET TL + LPG+ FGCG N G
Sbjct: 231 PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK-----LPGVVFGCGDTNEGDGF 285
Query: 216 SKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG--------PG 267
S+ G+VGLG G +SL+SQ+ KFSYCL + T + G ++G
Sbjct: 286 SQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDTNNSPLLLGSLAGISEASAAASS 342
Query: 268 VVSTPLTK---AKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLP- 313
V +TPL K +FY +++ AI+VG+ R+ + T +++DSGT++T+L
Sbjct: 343 VQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEV 402
Query: 314 QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----QVPEVTIHFR-GADVKLS 367
QGY + + + M A P AD +G L+LC+ + +VP + HF GAD+ L
Sbjct: 403 QGYRALKKAFAAQM--ALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLP 460
Query: 368 RSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
N+ V +C G + + I GN Q NF YD+ T+SF P C K
Sbjct: 461 AENYMVLDGGSGALCLTVMG-SRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNK 515
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 135/421 (32%), Positives = 206/421 (48%), Gaps = 42/421 (9%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL----NHFNQNSSISSSKASQADI 84
+ ++L+HRD K P +N+S R + R R+ H + +A +D+
Sbjct: 66 YKLKLVHRD--KVPTFNTSHDHRTRFNARMQRDTKRVAALRRHLAAGKPTYAEEAFGSDV 123
Query: 85 I----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
+ + Y +RI +G+PP + V D+GSD+IW QCEPC +QCY Q P+F+P S
Sbjct: 124 VSGMEQGSGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPC--TQCYHQSDPVFNPADS 181
Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
S+Y + C+S+ C+ ++ C C+Y VSYGDGS++ G LA ET+T G T + VA+
Sbjct: 182 SSYAGVSCASTVCSHVDNAGCHEGRCRYEVSYGDGSYTKGTLALETLTFGRTLIRNVAI- 240
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINF 257
GCG +N G+F G++GLG G +S + Q+ G FSYCLV SS + F
Sbjct: 241 ----GCGHHNQGMF-VGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGIQSSGLLQF 295
Query: 258 GTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP----------DIVID 304
G + G V PL +A++FY + + + VG R+ +S +V+D
Sbjct: 296 GREAVPVGAAWV--PLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVMD 353
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-FNSLS-QVPEVTIHFRGA 362
+GT +T LP + P A + CY F +S +VP V+ +F G
Sbjct: 354 TGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVSFYFSGG 413
Query: 363 DV-KLSRSNFFVKVSEDI--VCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
+ L NF + V +D+ C F ++ + I GNI Q + D V F P
Sbjct: 414 PILTLPARNFLIPV-DDVGSFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGFVGFGPNV 472
Query: 420 C 420
C
Sbjct: 473 C 473
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 140/423 (33%), Positives = 207/423 (48%), Gaps = 42/423 (9%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS----ISSSKASQAD 83
GF ++L H D+ +S T Q L A+ RS R+ + + A++
Sbjct: 28 GFQLKLTHVDA------GTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVL 81
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ ++ YL+ ++IGTPP A+ DTGSDLIWTQC PC C Q +P FD K S+TY
Sbjct: 82 VTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPC--LLCADQPTPYFDVKKSATY 139
Query: 144 KSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
++LPC SS+CASL+ SC C Y YGD + + G LA ET T G+ V I
Sbjct: 140 RALPCRSSRCASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIA 199
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL---VPVSSTKINFG-- 258
FGCG+ N G + ++G+VG G G +SL+SQ+ + +FSYCL + + +++ FG
Sbjct: 200 FGCGSLNAGDL-ANSSGMVGFGRGPLSLVSQLGPS---RFSYCLTSYLSATPSRLYFGVY 255
Query: 259 ----TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS----------TPDI 301
+ SG V STP Y L++ AIS+G + L + T +
Sbjct: 256 ANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGV 315
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF----NSLSQVPEVTI 357
+IDSGT++T+L Q + + S I + D L+ C+ + N VP++
Sbjct: 316 IIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVF 375
Query: 358 HFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
HF A++ L N+ + S + T I GN Q N + YDI +SF P
Sbjct: 376 HFDSANMTLLPENYMLIASTTGYLCLVMAPTGVGTIIGNYQQQNLHLLYDIGNSFLSFVP 435
Query: 418 TDC 420
C
Sbjct: 436 APC 438
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 145/455 (31%), Positives = 207/455 (45%), Gaps = 61/455 (13%)
Query: 20 SPIEAQTG----GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNR---LNHFNQNS 72
+P E + G G + ++HR P SP ++ P D L NR + H +
Sbjct: 72 APREHKHGATSSGTRMTIVHRHGPCSPLADAHGKPPSH-EDILAADQNRAESIQHRVSTT 130
Query: 73 SISSSKASQADIIPNN-------------------------------ANYLIRISIGTPP 101
+ ++ P+ NY++ + +GTP
Sbjct: 131 ATGRGNPKRSRRAPSRRQQPSSAPAPAASLSSSTASLPASSGRALGTGNYVVTVGLGTPA 190
Query: 102 TERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC 161
+ V DTGSD W QC+PC CY Q LFDP SSTY ++ C++ C+ L+ + C
Sbjct: 191 SRYTVVFDTGSDTTWVQCQPCV-VVCYEQREKLFDPARSSTYANISCAAPACSDLDTRGC 249
Query: 162 SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGI 221
SG NC Y V YGDGS+S G A +T+TL S A+ G FGCG N GLF + G+
Sbjct: 250 SGGNCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGCGERNEGLFG-EAAGL 304
Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--INFGTNGIVSGPGVVSTPL--TKAK 277
+GLG G SL Q G F++CL SS ++FG + ++TP+
Sbjct: 305 LGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAAAGARLTTPMLTDNGP 364
Query: 278 TFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ- 331
TFY + + I VG Q L + +T ++DSGT +T LP S+L S +S + A+
Sbjct: 365 TFYYVGMTGIRVGGQLLSIPQSVFTTAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARG 424
Query: 332 -PVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGI 387
A L+ CY F +SQV P V++ F+ GA + + S S VC F
Sbjct: 425 YKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASVSQVCLGFAAN 484
Query: 388 TNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ V I GN F V YDI ++ V F P C
Sbjct: 485 EDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 197 bits (502), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 126/365 (34%), Positives = 184/365 (50%), Gaps = 27/365 (7%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
A + YL + +GTP + DTGSDL W QC PC +CY Q+ LF P S+
Sbjct: 4 APVAAARGEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPC--GKCYSQNDALFLPNTST 61
Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
++ L C S+ C L C+ C Y SYGDGS + G+ +T+T+ GQ +P
Sbjct: 62 SFTKLACGSALCNGLPFPMCNQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPN 121
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-----PVSSTKIN 256
FGCG +N G F + GI+GLG G +S SQ+++ GKFSYCLV P ++ +
Sbjct: 122 FAFGCGHDNEGSF-AGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQTSPLL 180
Query: 257 FGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP--DI--------VI 303
FG + P V P+ K T+Y + ++ ISVG+ L +S+ DI +
Sbjct: 181 FGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAGTIF 240
Query: 304 DSGTTLTFLPQGYNSNLLSVM--SSMIEAQPVADPTGSLELCYS---FNSLSQVPEVTIH 358
DSGTT+T L + +L+ M S+M ++ + D L+LC S + L VP +T H
Sbjct: 241 DSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDD-ISRLDLCLSGFPKDQLPTVPAMTFH 299
Query: 359 FRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
F G D+ L SN+F+ + + V I G++ Q NF V YD + + F P
Sbjct: 300 FEGGDMVLPPSNYFIYLESSQSYCFAMTSSPDVNIIGSVQQQNFQVYYDTAGRKLGFVPK 359
Query: 419 DCTKQ 423
DC +
Sbjct: 360 DCVGR 364
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 197 bits (502), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 127/358 (35%), Positives = 185/358 (51%), Gaps = 27/358 (7%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
YL + +GTP + DTGSDL W QC PC CY Q+ LF P S+++ L C
Sbjct: 1 GEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPC--GTCYSQNDSLFIPNTSTSFTKLAC 58
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
+ C L C+ C Y SYGDGS S G+ +T+T+ GQ +P FGCG
Sbjct: 59 GTELCNGLPYPMCNQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGH 118
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-----PVSSTKINFGTNGIV 263
+N G F + GI+GLG G +S SQ++T GKFSYCLV P ++ + FG +
Sbjct: 119 DNEGSF-AGADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLLFGDAAVP 177
Query: 264 SGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP--DI--------VIDSGTTLT 310
+ PGV L K T+Y + ++ ISVG + L +S+ DI + DSGTT+T
Sbjct: 178 TFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIFDSGTTVT 237
Query: 311 FLPQGYNSNLLSVM-SSMIEAQPVADPTGSLELC---YSFNSLSQVPEVTIHFRGADVKL 366
L + +L+ M +S ++ +D + L+LC ++ L VP +T HF G D++L
Sbjct: 238 QLAGEVHQEVLAAMNASTMDYPRKSDDSSGLDLCLGGFAEGQLPTVPSMTFHFEGGDMEL 297
Query: 367 SRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 423
SN+F+ + E F +++ V I G+I Q NF V YD + + F P C +
Sbjct: 298 PPSNYFIFL-ESSQSYCFSMVSSPDVTIIGSIQQQNFQVYYDTVGRKIGFVPKSCVGR 354
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 197 bits (501), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 146/422 (34%), Positives = 204/422 (48%), Gaps = 45/422 (10%)
Query: 31 VELIHRDSPKSPFYNSSE--TPYQRLRDALTRSLNRLNHFNQNSSISSSKA-------SQ 81
+ L HR P +P +S +P L D L R + + S +++ A S+
Sbjct: 67 LRLTHRHGPCAPAGKASALGSPPSFL-DTLRADQRRAEYIQRRVSGAAAAAPGMQLAGSK 125
Query: 82 ADIIPNNAN-------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
A +P N Y++ +S+GTP + DTGSD+ W QC+PCP CY Q PL
Sbjct: 126 AATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPL 185
Query: 135 FDPKMSSTYKSLPCSSSQCA--SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
FDP SS+Y ++PC+++ C+ +L CSG C Y VSYGDGS + G +++T+TL +
Sbjct: 186 FDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGS 245
Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS 252
AL G FGCG GLF + G++GLG SL+SQ +T G FSYCL P +
Sbjct: 246 N----ALKGFLFGCGHAQQGLF-AGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQN 300
Query: 253 TKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVS----TPDIVIDS 305
+ G S G +TPL A T+Y++ + ISVG Q L + V+D+
Sbjct: 301 SVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAVVDT 360
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQV--PEVTIHF-R 360
GT +T LP S L S + + P A TG L+ CY F V P ++I F
Sbjct: 361 GTVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGG 420
Query: 361 GADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
GA + L S C F G + I GN+ Q +F V +D TV F P
Sbjct: 421 GAAMDLGTSGILTS-----GCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPA 473
Query: 419 DC 420
C
Sbjct: 474 SC 475
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 197 bits (501), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 145/412 (35%), Positives = 221/412 (53%), Gaps = 47/412 (11%)
Query: 45 NSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN--ANYLIRISIGTPPT 102
+ S T Q +R AL R ++R N +S SS A + P +L+ ++IGTPP
Sbjct: 38 DPSVTASQFVRAALHRDMHRHNARKLAAS-SSDGTVSAPVSPTTVPGEFLMTLAIGTPPL 96
Query: 103 ERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS 162
LA+ADTGSDLIWTQC PC QC+ Q +PL++P S+T+ +LPC+SS L +C+
Sbjct: 97 PFLAIADTGSDLIWTQCAPC-SRQCFQQPTPLYNPSSSTTFSALPCNSS--LGLCAPACA 153
Query: 163 GVNCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAVALPGITFGCGTNNGGLFNSKTTGI 221
C Y+++YG G ++ TET T GS+T V +PGI FGC + G S +G+
Sbjct: 154 ---CMYNMTYGSG-WTYVFQGTETFTFGSSTPADQVRVPGIAFGCSNASSGFNASSASGL 209
Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLVPV----SSTKINFGTNGIVSGPGVV-STPLTKA 276
VGLG G +SL+SQ+ A KFSYCL P S++ + G + ++ GVV STP +
Sbjct: 210 VGLGRGSLSLVSQLG---APKFSYCLTPYQDTNSTSTLLLGPSASLNDTGVVSSTPFVAS 266
Query: 277 KT--FYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQGYNSNLLSVM 324
+ +Y L + IS+G L + T ++IDSGTT+T L + + +
Sbjct: 267 PSSIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLIIDSGTTITMLGNTAYQQVRAAV 326
Query: 325 SSMIEAQPVADPTGS--LELCYSFNSLS----QVPEVTIHFRGADVKLSRSNFFVKVSED 378
S++ P D + + L+LC+ S + +P +T+HF GAD+ L N+ + +S+
Sbjct: 327 LSLVTL-PTTDGSAATGLDLCFELPSSTSAPPSMPSMTLHFDGADMVLPADNYMMSLSDP 385
Query: 379 IV-----CSVFKGITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
C + T++ V I GN Q N + YD+ ++T+SF P C+
Sbjct: 386 DSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHILYDVGKETLSFAPAKCS 437
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 197 bits (501), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 142/429 (33%), Positives = 208/429 (48%), Gaps = 40/429 (9%)
Query: 22 IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN------------ 69
I + G +V L HR P SP +S + P + + L R R H
Sbjct: 45 ISSSLSGTTVALNHRHGPCSPVPSSKKRPTEE--ELLKRDQLRAEHIQRKFAMNAAVDGA 102
Query: 70 ---QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
Q S +SSS ++ + Y+I + +GTP + DTGSD+ W QC PCP
Sbjct: 103 GDLQQSKVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPP 162
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK--SCSGVN--CQYSVSYGDGSFSNGNL 182
CY Q LFDP SSTY+++ C++++CA L Q+ C N CQY V YGDGS +NG
Sbjct: 163 CYAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTY 222
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
+ +T+TL +G + A+ G FGC G F+ +T G++GLGGG SL+SQ
Sbjct: 223 SRDTLTL---SGASDAVKGFQFGCSHVESG-FSDQTDGLMGLGGGAQSLVSQTAAAYGNS 278
Query: 243 FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTP 299
FSYCL P S + G G V+T + +++ TFY + I+VG ++LG+S P
Sbjct: 279 FSYCLPPTSGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLS-P 337
Query: 300 DI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--V 352
+ V+DSGT +T LP S L S + ++ A L+ C+ F +Q +
Sbjct: 338 SVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISI 397
Query: 353 PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
P V + F GA + L + + + G + I GN+ Q F V YD+
Sbjct: 398 PTVALVFSGGAAIDLDPNGIMYG---NCLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSS 454
Query: 412 TVSFKPTDC 420
T+ F+ C
Sbjct: 455 TLGFRSGAC 463
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 197 bits (501), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 145/422 (34%), Positives = 205/422 (48%), Gaps = 33/422 (7%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPY-QRLRDALTRSLNRLNHFNQNSSISSSKASQ 81
EA G + L H SP + + + + + R +RLN ++ + S S
Sbjct: 65 EALKPGVKIRLDHIHGACSPLRPINSSSWIDMVSQSFDRDNDRLNTIWSKNNGTYSTMSN 124
Query: 82 ADIIPNN----ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+ P + NY++ GTP L + DTGSD+ W QC+PC S CY Q P+F+P
Sbjct: 125 LPLQPGSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPC--SDCYSQVDPIFEP 182
Query: 138 KMSSTYKSLPCSSSQCASL-NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
+ SS+YK L C SS C L C C Y ++YGDGS S G+ + ET+TLGS +
Sbjct: 183 QQSSSYKHLSCLSSACTELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTLGSDS--- 239
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP-VSSTKI 255
P FGCG N GLF + G++GLG +S SQ ++ G+FSYCL VSST
Sbjct: 240 --FPSFAFGCGHTNTGLFKG-SAGLLGLGRTALSFPSQTKSKYGGQFSYCLPDFVSSTST 296
Query: 256 NFGTNGIVSGPGVVS-TPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI-----VIDSG 306
+ G S P + PL + +FY + ++ ISVG +RL + + ++DSG
Sbjct: 297 GSFSVGQGSIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGGTIVDSG 356
Query: 307 TTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GA 362
T +T L PQ Y++ L + S P A P L+ CY +S SQV P +T HF+ A
Sbjct: 357 TVITRLVPQAYDA-LKTSFRSKTRNLPSAKPFSILDTCYDLSSYSQVRIPTITFHFQNNA 415
Query: 363 DVKLSRSNFFVKVSED--IVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPT 418
DV +S + D VC F + S+ I GN Q V +D + F P
Sbjct: 416 DVAVSAVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRMRVAFDTGAGRIGFAPG 475
Query: 419 DC 420
C
Sbjct: 476 SC 477
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 146/422 (34%), Positives = 204/422 (48%), Gaps = 45/422 (10%)
Query: 31 VELIHRDSPKSPFYNSSE--TPYQRLRDALTRSLNRLNHFNQNSSISSSKA-------SQ 81
+ L HR P +P +S +P L D L R + + S +++ A S+
Sbjct: 56 LRLTHRHGPCAPAGKASALGSPPSFL-DTLRADQRRAEYIQRRVSGAAAAAPGMQLAGSK 114
Query: 82 ADIIPNNAN-------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
A +P N Y++ +S+GTP + DTGSD+ W QC+PCP CY Q PL
Sbjct: 115 AATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPL 174
Query: 135 FDPKMSSTYKSLPCSSSQCA--SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
FDP SS+Y ++PC+++ C+ +L CSG C Y VSYGDGS + G +++T+TL +
Sbjct: 175 FDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGS 234
Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS 252
AL G FGCG GLF + G++GLG SL+SQ +T G FSYCL P +
Sbjct: 235 N----ALKGFLFGCGHAQQGLF-AGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQN 289
Query: 253 TKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVS----TPDIVIDS 305
+ G S G +TPL A T+Y++ + ISVG Q L + V+D+
Sbjct: 290 SVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAVVDT 349
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQV--PEVTIHF-R 360
GT +T LP S L S + + P A TG L+ CY F V P ++I F
Sbjct: 350 GTVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGG 409
Query: 361 GADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
GA + L S C F G + I GN+ Q +F V +D TV F P
Sbjct: 410 GAAMDLGTSGILTS-----GCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPA 462
Query: 419 DC 420
C
Sbjct: 463 SC 464
>gi|356528675|ref|XP_003532925.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 342
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 135/404 (33%), Positives = 188/404 (46%), Gaps = 99/404 (24%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
GFS++LIHRDSP SPFYN S TP +R+ DA S N+N K ++ +IPN
Sbjct: 28 GFSIDLIHRDSPLSPFYNPSLTPSERITDAALSS-------NEN------KLPESILIPN 74
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
N YL+R+ IGTPP ERL +ADTGSD IW QC P
Sbjct: 75 NGEYLMRLYIGTPPVERLVIADTGSDFIWVQCS--------------------------P 108
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG-QAVALPGITFGC 206
C + QC LN Y + SF+ + TET++ ST G Q V+ P FGC
Sbjct: 109 CQNCQCVYLN-------------IYANKSFTIEVVGTETLSFDSTGGAQTVSFPNSIFGC 155
Query: 207 GTNNGGLFNS--KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVS 264
G NN F S K TG+VGL G +SL+SQ+ I KFSY + FG+ I++
Sbjct: 156 GANNNLTFRSSDKATGLVGLVAGQLSLVSQLGAQIGYKFSY---------LKFGSEAIIT 206
Query: 265 GPGVVSTPLTKAKT--FYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLS 322
GVVSTPL + Y L ++ +++G + + T
Sbjct: 207 TNGVVSTPLIIKPSLPLYFLNLEVVTIGQKVVPTET------------------------ 242
Query: 323 VMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSED--IV 380
+ + V D + C+ + VP + F GA V L N +K+ + +
Sbjct: 243 -----LGVESVQDLPFPFKFCFPYRDNMTVPAIAFQFTGASVALRPKNLLIKLQDRNMLX 297
Query: 381 CSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+V + + + I+G I Q +F V YD++ + VS PTDCTK
Sbjct: 298 LAVVPSASSLSVISIFGIIAQFDFQVLYDLDGKKVSVAPTDCTK 341
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 144/417 (34%), Positives = 200/417 (47%), Gaps = 32/417 (7%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQ-RLRDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
S+E++HR P N + + L + +R++ + S + +P
Sbjct: 63 LSLEVVHRSGPCIQVLNQEKAANAPSNMEILLQDRHRVDSIHARLSSHGVFQEKQATLPV 122
Query: 87 ------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
+ +Y + + +GTP E + DTGSDL WTQCEPC + CY Q P DP S
Sbjct: 123 QSGASIGSGDYAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAKT-CYKQKEPRLDPTKS 181
Query: 141 STYKSLPCSSSQCASLNQ---KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
++YK++ CSS+ C L+ +SCS C Y V YGDGS+S G ATET+TL S+
Sbjct: 182 TSYKNISCSSAFCKLLDTEGGESCSSPTCLYQVQYGDGSYSIGFFATETLTLSSSN---- 237
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
FGCG N GLF G++GLG +SL SQ FSYCL SS+K
Sbjct: 238 VFKNFLFGCGQQNSGLFRG-AAGLLGLGRTKLSLPSQTAQKYKKLFSYCLPASSSSKGYL 296
Query: 258 GTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTL 309
G VS V TPL+ K+ FY L I +SVG +L + ST VIDSGT +
Sbjct: 297 SFGGQVS-KTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIFSTSGTVIDSGTVI 355
Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFRGA-DVKL 366
T LP S L S ++ P D + CY F N ++P+V + F+G ++ +
Sbjct: 356 TRLPSTAYSALSSAFQKLMTDYPSTDGYSIFDTCYDFSKNETIKIPKVGVSFKGGVEMDI 415
Query: 367 SRSNFFVKVSE-DIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
S V+ VC F G + V I+GN Q + V YD + V F P+ C
Sbjct: 416 DVSGILYPVNGLKKVCLAFAGNGDDVKAAIFGNTQQKTYQVVYDDAKGRVGFAPSGC 472
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 147/419 (35%), Positives = 198/419 (47%), Gaps = 43/419 (10%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQADI 84
+ L HR P +P SS + D L R H + S + KA+ A +
Sbjct: 66 LRLTHRHGPCAPLRASSLAA-PSVADTLRADQRRAEHILRRVSGRGAPQLWDYKAAAATV 124
Query: 85 IPN------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
N +NY++ S+GTP + DTGSDL W QC+PC CY Q PLFDP
Sbjct: 125 PANWGYDIGTSNYVVTASLGTPGMAQTLEVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPA 184
Query: 139 MSSTYKSLPCSSSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
SS+Y ++PC S CA L +CS C Y VSYGDGS + G +++T+TL +
Sbjct: 185 QSSSYAAVPCGRSACAGLGIYASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLAAN---- 240
Query: 197 VALPGITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
+ G FGCG +GGLF + G++G G SL+ Q G FSYCL P S+
Sbjct: 241 ATVQGFLFGCGHAQSGGLF-TGIDGLLGFGREQPSLVQQTAGAYGGVFSYCL-PTKSSTT 298
Query: 256 NFGTNGIVSG--PGVVST---PLTKAKTFYVLTIDAISVGNQRLGVS----TPDIVIDSG 306
+ T G SG PG +T P A T+YV+ + ISVG Q L V V+D+G
Sbjct: 299 GYLTLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFAAGTVVDTG 358
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGAD 363
T +T LP + L S S + + P A P G L+ CYSF V V + F GA
Sbjct: 359 TVITRLPPAAYAALRSAFRSGMASYPSAPPIGILDTCYSFAGYGTVNLTSVALTFSSGAT 418
Query: 364 VKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ L C F G S+ I GN+ Q +F V I+ +V F+P+ C
Sbjct: 419 MTLGADGIM-----SFGCLAFASSGSDGSMAILGNVQQRSFEV--RIDGSSVGFRPSSC 470
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 144/428 (33%), Positives = 219/428 (51%), Gaps = 56/428 (13%)
Query: 20 SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSK 78
P+ TG LIH+DS S YQ L R+ + R R F +
Sbjct: 37 KPLRLVTG-----LIHQDSILSS--------YQSLDRNNVERRRTRRAAF-------ITD 76
Query: 79 ASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
QA+++ ++ +L+ S+G PP +L DTGSDL+W QC PC + C+ Q +P+FD
Sbjct: 77 EIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPC--ADCFRQSTPIFD 134
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
P SSTY L S C + QK + +N C Y+ SY DGS S+GNLATE + ++
Sbjct: 135 PSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQG 194
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
V + + FGCG +N G F+ + +GI+GL GD S++S++ +FSYC+ +
Sbjct: 195 TVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRL----GSRFSYCIGDLFDP-- 248
Query: 256 NFGTNGIVSGPGVV----STPLTKAKTFYVLTIDAISVGNQRLGVSTPD----------- 300
++ N +V G GV STP FY +T++ ISVG RL ++ P+
Sbjct: 249 HYTHNQLVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDIN-PEVFQRTESGQGG 307
Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYS---FNSLSQVPEV 355
+V+DSGTT TFL + L + + ++ Q V T LCY L PE+
Sbjct: 308 VVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPEL 367
Query: 356 TIHF-RGADVKLSRSNFFVKVSEDIVC-SVFK-GITNSVPIYGNIMQTNFLVGYDIEQQT 412
HF GAD+ L ++ FV+ ++D+ C +V + + N + G + Q ++ V YD+ +
Sbjct: 368 AFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKR 427
Query: 413 VSFKPTDC 420
V F+ TDC
Sbjct: 428 VYFQRTDC 435
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 152/437 (34%), Positives = 210/437 (48%), Gaps = 45/437 (10%)
Query: 18 VVSPIEAQTGGFSVELIHRDSPKSPFYNSSET-----PYQRLRDALTRSLNR-------L 65
V+SP A T S+ + HR S N T RL A S++
Sbjct: 51 VLSP-RASTTKSSLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHSKLSKKLTT 109
Query: 66 NHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
NH +Q+ S + + + NY++ + +GTP + + DTGSDL WTQC+PC +
Sbjct: 110 NHVSQSQSTDLPAKDGSTL--GSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRT 167
Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNG 180
CY Q P+F+P S++Y ++ CSS+ C SL N SCS NC Y + YGD SFS G
Sbjct: 168 -CYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVG 226
Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
LA + TL S+ G+ FGCG NN GLF + G++GLG +S SQ T
Sbjct: 227 FLAKDKFTLTSSD----VFDGVYFGCGENNQGLF-TGVAGLLGLGRDKLSFPSQTATAYN 281
Query: 241 GKFSYCLVPVSST---KINFGTNGIVSGPGVVSTP---LTKAKTFYVLTIDAISVGNQRL 294
FSYCL P S++ + FG+ GI V TP +T +FY L I AI+VG Q+L
Sbjct: 282 KIFSYCL-PSSASYTGHLTFGSAGI--SRSVKFTPISTITDGTSFYGLNIVAITVGGQKL 338
Query: 295 GV-----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 349
+ STP +IDSGT +T LP + L S + + P L+ C+ +
Sbjct: 339 PIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGF 398
Query: 350 SQV--PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLV 404
V P+V F GA V+L F VC F G ++ + I+GN+ Q V
Sbjct: 399 KTVTIPKVAFSFSGGAVVELGSKGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEV 458
Query: 405 GYDIEQQTVSFKPTDCT 421
YD V F P C+
Sbjct: 459 VYDGAGGRVGFAPNGCS 475
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 136/360 (37%), Positives = 196/360 (54%), Gaps = 41/360 (11%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y+++IS+GTPP + A+ DTGSDL W QC PC ++C+ Q PLF P SS+Y + C
Sbjct: 6 GEYVLQISLGTPPQQFSAIVDTGSDLCWVQCAPC--ARCFEQPDPLFIPLASSSYSNASC 63
Query: 149 SSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ S C +L + +CS N C YS SYGDGS + G+ A ETVTL +T L I FGCG
Sbjct: 64 TDSLCDALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTLNGST-----LARIGFGCG 118
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST----KINFGTNGIV 263
N G F + G++GLG G +SL SQ+ ++ FSYCLV S+T I FG
Sbjct: 119 HNQEGTF-AGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTFSPITFGNAAEN 177
Query: 264 SGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTP------------DIVIDSGTT 308
S TPL + + ++Y + +++ISVGN+R V TP +++DSGTT
Sbjct: 178 SRASF--TPLLQNEDNPSYYYVGVESISVGNRR--VPTPPSAFRIDANGVGGVILDSGTT 233
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLSQ----VPEVTIHFRGAD 363
+T+ +L+ + I + P ADPT L LCY +S+S +P +T+H D
Sbjct: 234 ITYWRLAAFIPILAELRRQI-SYPEADPTPYGLNLCYDISSVSASSLTLPSMTVHLTNVD 292
Query: 364 VKLSRSNFFVKVSE--DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
++ SN +V V + VC+ ++ I GN+ Q N L+ D+ V F TDC+
Sbjct: 293 FEIPVSNLWVLVDNFGETVCTAMS-TSDQFSIIGNVQQQNNLIVTDVANSRVGFLATDCS 351
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 130/388 (33%), Positives = 214/388 (55%), Gaps = 48/388 (12%)
Query: 75 SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
+SS A A + A YL+ ++IGTPP +A+ADTGSDL WTQC+PC C+ QD+P+
Sbjct: 79 TSSNAGPARLRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCKPC--KLCFPQDTPI 136
Query: 135 FDPKMSSTYKSLPCSSSQCASL--NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTL 189
+D S+++ +PC+S+ C + + ++C+ C+Y +Y DG++S G L TET+T
Sbjct: 137 YDTAASASFSPVPCASATCLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTF 196
Query: 190 GSTT----GQAVALPGITFGCGTNNGGL-FNSKTTGIVGLGGGDISLISQMRTTIAGKFS 244
++ G V++ G+ FGCG +NGGL +NS TG VGLG G +SL++Q+ GKFS
Sbjct: 197 AGSSPGAPGPGVSVGGVAFGCGVDNGGLSYNS--TGTVGLGRGSLSLVAQLGV---GKFS 251
Query: 245 YCLVPVSSTKIN----FGTNGIVSGP------GVVSTPLTKA---KTFYVLTIDAISVGN 291
YCL +T + FG+ ++ P V STPL + + Y ++++ IS+G+
Sbjct: 252 YCLTDFFNTSLGSPVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGD 311
Query: 292 QRLGV----------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 341
RL + + +++DSGT T L + +++ ++ ++ QPV + +
Sbjct: 312 ARLPIPNGTFDLRDDGSGGMIVDSGTIFTVLVESAFRVVVNHVAGVLN-QPVVNASSLDS 370
Query: 342 LCYSFNS----LSQVPEVTIHFR-GADVKLSRSNF--FVKVSEDIVCSVFKGITNSVPIY 394
C+ + L +P++ +HF GAD++L R N+ F + S ++ + I
Sbjct: 371 PCFPATAGEQQLPDMPDMLLHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAPSAYGSIL 430
Query: 395 GNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
GN Q N + +DI +SF PTDC+K
Sbjct: 431 GNFQQQNIQMLFDITVGQLSFVPTDCSK 458
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 143/432 (33%), Positives = 205/432 (47%), Gaps = 59/432 (13%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA 89
S+E+IH+ P S +L RS +R +Q+ S +S S+ P +
Sbjct: 67 SLEVIHKHGPCS-----------KLSQDKGRSPSRTQMLDQDESRVNSIRSRLAKNPADG 115
Query: 90 ---------------------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
NY++ + +GTP + + DTGSDL WTQCEPC CY
Sbjct: 116 GKLKGSKVTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPC-ARYCY 174
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLA 183
Q P+F+P S++Y ++ CSS C L N SCS C Y + YGD S+S G A
Sbjct: 175 HQQEPIFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFA 234
Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
+ + L ST FGCG NN GLF G++GLG +SL+SQ F
Sbjct: 235 QDKLALTSTD----VFNNFLFGCGQNNRGLFVG-VAGLIGLGRNALSLVSQTAQKYGKLF 289
Query: 244 SYCLVPVSSTK--INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGV-- 296
SYCL SS+ + FG+ G S V TP ++ +FY L + AISVG ++L
Sbjct: 290 SYCLPSTSSSTGYLTFGSGGGTS-KAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSA 348
Query: 297 ---STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--Q 351
ST +IDSGT ++ LP S+L + + P A P L+ CY F+
Sbjct: 349 SVFSTAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYDFSQYDTVD 408
Query: 352 VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDI 408
VP++ ++F GA++ L S F ++ VC F G +++ + I GN+ Q F V YD+
Sbjct: 409 VPKINLYFSDGAEMDLDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDV 468
Query: 409 EQQTVSFKPTDC 420
+ F P C
Sbjct: 469 AGGRIGFAPGGC 480
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 144/446 (32%), Positives = 207/446 (46%), Gaps = 56/446 (12%)
Query: 24 AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
A + G + ++HR P SP ++ P + L NR+ + S +++ +
Sbjct: 83 ASSSGTRMTIVHRHGPCSPLADAHGKPPSH-DEILAADQNRVESIHHRVSTTATVRGKPK 141
Query: 84 IIPN---------------------------------NANYLIRISIGTPPTERLAVADT 110
P+ NY++ I +GTP + V DT
Sbjct: 142 RRPSPSRRQQQPSAPAPAASLSSSTASLPASSGRALGTGNYVVTIGLGTPASRYTVVFDT 201
Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSV 170
GSD W QC+PC CY Q LFDP SSTY ++ C++ C+ L + CSG +C YSV
Sbjct: 202 GSDTTWVQCQPCV-VVCYKQQEKLFDPARSSTYANVSCAAPACSDLYTRGCSGGHCLYSV 260
Query: 171 SYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDIS 230
YGDGS+S G A +T+TL S A+ G FGCG N GLF + G++GLG G S
Sbjct: 261 QYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGCGERNEGLFG-EAAGLLGLGRGKTS 315
Query: 231 LISQMRTTIAGKFSYCLVPVSSTK--INFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDA 286
L Q G F++CL SS ++FG + +TP+ TFY + +
Sbjct: 316 LPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAAVGARQTTPMLTDNGPTFYYVGMTG 375
Query: 287 ISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ--PVADPTGS 339
I VG Q L + ST ++DSGT +T LP S+L S +S + A+ A
Sbjct: 376 IRVGGQLLSIPQSVFSTAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSL 435
Query: 340 LELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKG--ITNSVPIY 394
L+ CY F +S+V P+V++ F+ GA + ++ S S VC F + V I
Sbjct: 436 LDTCYDFTGMSEVAIPKVSLLFQGGAYLDVNASGIMYAASLSQVCLGFAANEDDDDVGIV 495
Query: 395 GNIMQTNFLVGYDIEQQTVSFKPTDC 420
GN F V YDI ++TV F P C
Sbjct: 496 GNTQLKTFGVVYDIGKKTVGFSPGAC 521
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 144/428 (33%), Positives = 219/428 (51%), Gaps = 56/428 (13%)
Query: 20 SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSK 78
P+ TG LIH+DS S YQ L R+ + R R F +
Sbjct: 5 KPLRLVTG-----LIHQDSILSS--------YQSLDRNNVERRRTRRAAF-------ITD 44
Query: 79 ASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
QA+++ ++ +L+ S+G PP +L DTGSDL+W QC PC + C+ Q +P+FD
Sbjct: 45 EIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPC--ADCFRQSTPIFD 102
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
P SSTY L S C + QK + +N C Y+ SY DGS S+GNLATE + ++
Sbjct: 103 PSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQG 162
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
V + + FGCG +N G F+ + +GI+GL GD S++S++ +FSYC+ +
Sbjct: 163 TVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRL----GSRFSYCIGDLFDP-- 216
Query: 256 NFGTNGIVSGPGVV----STPLTKAKTFYVLTIDAISVGNQRLGVSTPD----------- 300
++ N +V G GV STP FY +T++ ISVG RL ++ P+
Sbjct: 217 HYTHNQLVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDIN-PEVFQRTESGQGG 275
Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSF---NSLSQVPEV 355
+V+DSGTT TFL + L + + ++ Q V T LCY L PE+
Sbjct: 276 VVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPEL 335
Query: 356 TIHF-RGADVKLSRSNFFVKVSEDIVC-SVFK-GITNSVPIYGNIMQTNFLVGYDIEQQT 412
HF GAD+ L ++ FV+ ++D+ C +V + + N + G + Q ++ V YD+ +
Sbjct: 336 AFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKR 395
Query: 413 VSFKPTDC 420
V F+ TDC
Sbjct: 396 VYFQRTDC 403
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 144/428 (33%), Positives = 219/428 (51%), Gaps = 56/428 (13%)
Query: 20 SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSK 78
P+ TG LIH+DS S YQ L R+ + R R F +
Sbjct: 5 KPLRLVTG-----LIHQDSILSS--------YQSLDRNNVERRRTRRAAFIXDEI----- 46
Query: 79 ASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
QA+++ ++ +L+ S+G PP +L DTGSDL+W QC PC + C+ Q +P+FD
Sbjct: 47 --QANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPC--ADCFRQSTPIFD 102
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
P SSTY L S C + QK + +N C Y+ SY DGS S+GNLATE + ++
Sbjct: 103 PSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQG 162
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
V + + FGCG +N G F+ + +GI+GL GD S++S++ +FSYC+ +
Sbjct: 163 TVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRL----GSRFSYCIGDLFDP-- 216
Query: 256 NFGTNGIVSGPGVV----STPLTKAKTFYVLTIDAISVGNQRLGVSTPD----------- 300
++ N +V G GV STP FY +T++ ISVG RL ++ P+
Sbjct: 217 HYTHNQLVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDIN-PEVFQRTESGQGG 275
Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSF---NSLSQVPEV 355
+V+DSGTT TFL + L + + ++ Q V T LCY L PE+
Sbjct: 276 VVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPEL 335
Query: 356 TIHF-RGADVKLSRSNFFVKVSEDIVC-SVFK-GITNSVPIYGNIMQTNFLVGYDIEQQT 412
HF GAD+ L ++ FV+ ++D+ C +V + + N + G + Q ++ V YD+ +
Sbjct: 336 AFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKR 395
Query: 413 VSFKPTDC 420
V F+ TDC
Sbjct: 396 VYFQRTDC 403
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 153/414 (36%), Positives = 213/414 (51%), Gaps = 30/414 (7%)
Query: 28 GFSVELIHRDSPKSPFYNSS-ETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
G +V L HR P SP + T +RLR R+ F+ I S A+
Sbjct: 54 GVTVPLHHRYDPCSPVPSKKVPTLEERLRRDQLRAAYIKRKFSGAGDIEQSDAATVPTTL 113
Query: 87 NNA----NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
+ Y+I + IG+P + DTGSD+ W QC+PC SQC+ + LFDP SST
Sbjct: 114 GTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPC--SQCHSEVDSLFDPSSSST 171
Query: 143 YKSLPCSSSQCASLNQKS----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
Y CSS+ CA L+Q C CQY V+YGD S + G +++T+TLGS+ A
Sbjct: 172 YSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDTLTLGSS-----A 226
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG 258
+ FGC + G FN +T G++GLGGG SL SQ T FSYCL P S + F
Sbjct: 227 MTDFQFGCSQSESGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCLPPTSGSS-GFL 285
Query: 259 TNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVST----PDIVIDSGTTLTF 311
T G S G V TP+ T+ T+YV+ +++I VG+Q+L + T ++DSGT +T
Sbjct: 286 TLGTGSS-GFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVFSAGSLMDSGTIITR 344
Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSR 368
LP S L S + ++ P A P+G L+ C+ F+ S +P VT+ F GA V L+
Sbjct: 345 LPPTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPTVTLVFSGGAAVDLAF 404
Query: 369 SNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+++S I C F G +S+ I GN+ Q F V YD+ V FK C
Sbjct: 405 DGIMLEISSSIRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 129/348 (37%), Positives = 178/348 (51%), Gaps = 21/348 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP + V DTGSD W QC+PC + CY Q LFDP SSTY ++
Sbjct: 175 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVA-CYEQREKLFDPASSSTYANV 233
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C++ C+ L+ CSG +C Y V YGDGS+S G A +T+TL S A+ G FGC
Sbjct: 234 SCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 289
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
G N GLF + G++GLG G SL Q G F++CL P ST + G S P
Sbjct: 290 GERNDGLFG-EAAGLLGLGRGKTSLPVQTYGKYGGVFAHCL-PARSTGTGYLDFGAGSPP 347
Query: 267 GVVSTPLTKAK--TFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSN 319
+TP+ TFY + + I VG + L + + ++DSGT +T LP S+
Sbjct: 348 ATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSS 407
Query: 320 LLSVMSSMIEAQPV--ADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVK 374
L S ++ + A+ A L+ CY F +SQV P V++ F+ GA + + S
Sbjct: 408 LRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYT 467
Query: 375 VSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
VS VC F G + V I GN F V YDI ++ V F P C
Sbjct: 468 VSASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 147/423 (34%), Positives = 206/423 (48%), Gaps = 40/423 (9%)
Query: 30 SVELIHRDSPKSPFYNSSET-----PYQRLRDALTRSLN-RLNHFNQNSSISSSKA---- 79
S+ + HR S N T RL A S++ +L+ +S SK+
Sbjct: 33 SLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHSKLSKKLATDHVSESKSTDLP 92
Query: 80 SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
++ + NY++ + +GTP + + DTGSDL WTQC+PC + CY Q P+F+P
Sbjct: 93 AKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRT-CYDQKEPIFNPSK 151
Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
S++Y ++ CSS+ C SL N SCS NC Y + YGD SFS G LA E TL ++
Sbjct: 152 STSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSD- 210
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST- 253
G+ FGCG NN GLF + G++GLG +S SQ T FSYCL P S++
Sbjct: 211 ---VFDGVYFGCGENNQGLF-TGVAGLLGLGRDKLSFPSQTATAYNKIFSYCL-PSSASY 265
Query: 254 --KINFGTNGIVSGPGVVSTP---LTKAKTFYVLTIDAISVGNQRLGV-----STPDIVI 303
+ FG+ GI V TP +T +FY L I AI+VG Q+L + STP +I
Sbjct: 266 TGHLTFGSAGI--SRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALI 323
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR- 360
DSGT +T LP + L S + + P L+ C+ + V P+V F
Sbjct: 324 DSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSG 383
Query: 361 GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
GA V+L F VC F G ++ + I+GN+ Q V YD V F P
Sbjct: 384 GAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPN 443
Query: 419 DCT 421
C+
Sbjct: 444 GCS 446
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 194 bits (493), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 129/348 (37%), Positives = 178/348 (51%), Gaps = 21/348 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP + V DTGSD W QC+PC + CY Q LFDP SSTY ++
Sbjct: 179 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVA-CYEQREKLFDPASSSTYANV 237
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C++ C+ L+ CSG +C Y V YGDGS+S G A +T+TL S A+ G FGC
Sbjct: 238 SCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 293
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
G N GLF + G++GLG G SL Q G F++CL P ST + G S P
Sbjct: 294 GERNDGLFG-EAAGLLGLGRGKTSLPVQTYGKYGGVFAHCL-PARSTGTGYLDFGAGSPP 351
Query: 267 GVVSTPLTKAK--TFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSN 319
+TP+ TFY + + I VG + L + + ++DSGT +T LP S+
Sbjct: 352 ATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSS 411
Query: 320 LLSVMSSMIEAQPV--ADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVK 374
L S ++ + A+ A L+ CY F +SQV P V++ F+ GA + + S
Sbjct: 412 LRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYT 471
Query: 375 VSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
VS VC F G + V I GN F V YDI ++ V F P C
Sbjct: 472 VSASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 194 bits (493), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 141/417 (33%), Positives = 199/417 (47%), Gaps = 36/417 (8%)
Query: 33 LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN----SSISSSK---------A 79
++HR P SP ++ + + L NR + +++S K A
Sbjct: 91 IVHRHGPCSPLADAHDGKLPSHEEILAADQNRAKSIQRRVSTTTTVSRGKPKRNRPSLPA 150
Query: 80 SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
S + NY++ I +GTP V DTGSD W QCEPC CY Q LFDP
Sbjct: 151 SSGSAL-GTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCV-VVCYKQQEKLFDPAR 208
Query: 140 SSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
SSTY ++ C++ C+ L K CSG +C Y V YGDGS+S G A +T+TL S A+
Sbjct: 209 SSTYANISCAAPACSDLYIKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AI 264
Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--INF 257
G FGCG N GL+ + G++GLG G SL Q G F++C SS ++F
Sbjct: 265 KGFRFGCGERNEGLYG-EAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGTGYLDF 323
Query: 258 GTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLT 310
G + + ++TP+ TFY + + I VG + L + +T ++DSGT +T
Sbjct: 324 GPGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTSGTIVDSGTVIT 383
Query: 311 FLPQGYNSNLLSVM-SSMIEAQPVADPTGS-LELCYSFNSLSQV--PEVTIHFR-GADVK 365
LP S+L S S+M E P S L+ CY F +S+V P V++ F+ GA +
Sbjct: 384 RLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSEVAIPTVSLLFQGGASLD 443
Query: 366 LSRSNFFVKVSEDIVCSVFKG--ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ S S C F G + V I GN F V YDI ++ V F P C
Sbjct: 444 VHASGIIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYDIGKKVVGFCPGAC 500
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 194 bits (493), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 142/424 (33%), Positives = 212/424 (50%), Gaps = 44/424 (10%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQ--------RLRDALTRSLNRLNHFNQNSSISSSKA 79
G L H SP SP SS+ P+ R+ +R + + SS+ +
Sbjct: 41 GLHQTLHHPQSPCSPAPLSSDLPFSAFITHDAARIAGLASRLATKDKDWVAASSVPLASG 100
Query: 80 SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
+ + NY+ R+ +GTP T + V D+GS L W QC PC S C+ Q PL+DP+
Sbjct: 101 ASVGV----GNYITRLGLGTPTTTYVMVVDSGSSLTWLQCAPCAVS-CHPQAGPLYDPRA 155
Query: 140 SSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTT 193
SSTY ++PCS+ QC A+LN SCSG CQY SYGDGSFS G L+ +TV+L S+
Sbjct: 156 SSTYAAVPCSAPQCAELQAATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSG 215
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL---VPV 250
+ PG +GCG +N GLF + G++GL +SL+SQ+ ++ F+YCL
Sbjct: 216 ----SFPGFYYGCGQDNVGLFG-RAAGLIGLARNKLSLLSQLAPSVGNSFAYCLPTSAAA 270
Query: 251 SSTKINFGTNGIVSGPG------VVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD---- 300
S+ ++FG+N PG +VS+ L + Y +++ +SV L V + +
Sbjct: 271 SAGYLSFGSNSDNKNPGKYSYTSMVSSSLD--ASLYFVSLAGMSVAGSPLAVPSSEYGSL 328
Query: 301 -IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIH 358
+IDSGT +T LP + L + + + A + L+ C+ VP V +
Sbjct: 329 PTIIDSGTVITRLPTPVYTALSKAVGAALAAPSAPAYS-ILQTCFKGQVAKLPVPAVNMA 387
Query: 359 FR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
F GA ++L+ N V V+E C F T+S I GN Q F V YD++ + F
Sbjct: 388 FAGGATLRLTPGNVLVDVNETTTCLAFA-PTDSTAIIGNTQQQTFSVVYDVKGSRIGFAA 446
Query: 418 TDCT 421
C+
Sbjct: 447 GGCS 450
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 149/430 (34%), Positives = 208/430 (48%), Gaps = 40/430 (9%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSET-----PYQRLRDALTRSLN-RLNHFNQNSSISS 76
A T S+ + HR S N T RL A S++ +L+ +S
Sbjct: 54 RASTTKSSLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHSKLSKKLATDHVSE 113
Query: 77 SKA----SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
SK+ ++ + NY++ + +GTP + + DTGSDL WTQC+PC + CY Q
Sbjct: 114 SKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRT-CYDQKE 172
Query: 133 PLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
P+F+P S++Y ++ CSS+ C SL N SCS NC Y + YGD SFS G LA E
Sbjct: 173 PIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKF 232
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
TL ++ G+ FGCG NN GLF + G++GLG +S SQ T FSYCL
Sbjct: 233 TLTNSD----VFDGVYFGCGENNQGLF-TGVAGLLGLGRDKLSFPSQTATAYNKIFSYCL 287
Query: 248 VPVSST---KINFGTNGIVSGPGVVSTP---LTKAKTFYVLTIDAISVGNQRLGV----- 296
P S++ + FG+ GI V TP +T +FY L I AI+VG Q+L +
Sbjct: 288 -PSSASYTGHLTFGSAGISR--SVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVF 344
Query: 297 STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PE 354
STP +IDSGT +T LP + L S + + P L+ C+ + V P+
Sbjct: 345 STPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPK 404
Query: 355 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQ 411
V F GA V+L F VC F G ++ + I+GN+ Q V YD
Sbjct: 405 VAFSFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGG 464
Query: 412 TVSFKPTDCT 421
V F P C+
Sbjct: 465 RVGFAPNGCS 474
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 194 bits (492), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 148/427 (34%), Positives = 221/427 (51%), Gaps = 42/427 (9%)
Query: 21 PIEAQTGGFSVELIHRDSPKSP-----FYNSSETPYQRLRDALTRSLNRLNHFNQNSSIS 75
P A++ GFS +I R + F ++ ++RL +RS ++++ Q+SS S
Sbjct: 22 PAHAESRGFSGTMIRRGRTDTTTAAINFTQAALESHRRLSFLASRS-SQVDK-PQSSSAS 79
Query: 76 SSKASQADIIP-----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
+ D +P Y + SIGTPP + A+ADTGSDLIWT+C+
Sbjct: 80 QLSNNDTDTVPLRMDGGGGAYDMEFSIGTPPQKLTALADTGSDLIWTKCDAG--GGAAWG 137
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-----GVNCQYSVSYG---DGSFSNGNL 182
S + P SST+ LPCS CA+L S + G C Y +YG D F+ G L
Sbjct: 138 GSSSYHPNASSTFTRLPCSDRLCAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFL 197
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
+ET TLG A+PG+ FGC T G + + G+VGLG G +SL+SQ+ AG
Sbjct: 198 GSETFTLGGD-----AVPGVGFGCTTALEGDYG-EGAGLVGLGRGPLSLVSQLD---AGT 248
Query: 243 FSYCLVPVSS--TKINFGTNGIV--SGPGVVSTPLTKAKTFYVLTIDAISVGNQRLG--V 296
F YCL +S + + FG + +G GV ST L + TFY + + +I++G+
Sbjct: 249 FMYCLTADASKASPLLFGALATMTGAGAGVQSTGLLASTTFYAVNLRSITIGSATTAGVG 308
Query: 297 STPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQVPE 354
+V DSGTTLT+L + Y + +S PV G E CY +S +P
Sbjct: 309 GPGGVVFDSGTTLTYLAEPAYTEAKAAFLSQTTSLTPVEGRYG-FEACYEKPDSARLIPA 367
Query: 355 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
+ +HF GAD+ L +N+ V+V + +VC V + + S+ I GNIMQ N+LV +D+ + +
Sbjct: 368 MVLHFDGGADMALPVANYVVEVDDGVVCWVVQ-RSPSLSIIGNIMQMNYLVLHDVRKSVL 426
Query: 414 SFKPTDC 420
SF+P +C
Sbjct: 427 SFQPANC 433
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 194 bits (492), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 127/351 (36%), Positives = 174/351 (49%), Gaps = 24/351 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP + V DTGSD W QC+PC CY Q LFDP SSTY ++
Sbjct: 176 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCV-VVCYEQREKLFDPARSSTYANV 234
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C++ C+ LN CSG +C Y V YGDGS+S G A +T+TL S A+ G FGC
Sbjct: 235 SCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 290
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIV 263
G N GLF + G++GLG G SL Q G F++CL P ST ++FG +
Sbjct: 291 GERNEGLFG-EAAGLLGLGRGKTSLPVQTYDKYGGVFAHCL-PARSTGTGYLDFGAGSLA 348
Query: 264 SGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGY 316
+ ++TP+ TFY + + I VG Q L + +T ++DSGT +T LP
Sbjct: 349 AARARLTTPMLTENGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPAA 408
Query: 317 NSNL--LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNF 371
S+L + A L+ CY F +SQV P V++ F+ GA + + S
Sbjct: 409 YSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGI 468
Query: 372 FVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
S VC F + V I GN F V YDI ++ V F P C
Sbjct: 469 MYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 128/351 (36%), Positives = 180/351 (51%), Gaps = 26/351 (7%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP + V DTGSD W QC+PC CY Q LFDP SSTY ++
Sbjct: 175 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCV-VVCYEQREKLFDPARSSTYANV 233
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C++ C+ L+ + CSG +C Y V YGDGS+S G A +T+TL S A+ G FGC
Sbjct: 234 SCAAPACSDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 289
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIV 263
G N GLF + G++GLG G SL Q G F++CL P ST ++FG
Sbjct: 290 GERNEGLFG-EAAGLLGLGRGKTSLPVQTYDKYGGVFAHCL-PARSTGTGYLDFGAGSPA 347
Query: 264 SGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGY 316
+ + +TP+ TFY + + I VG + L + +T ++DSGT +T LP
Sbjct: 348 A--RLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFATAGTIVDSGTVITRLPPAA 405
Query: 317 NSNLLSVMSSMIEAQ--PVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNF 371
S+L S ++ + A+ A L+ CY F +SQV P V++ F+ GA + + S
Sbjct: 406 YSSLRSAFAAAMSARGYKKAPAVSLLDTCYDFAGMSQVAIPTVSLLFQGGARLDVDASGI 465
Query: 372 FVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
S VC F + V I GN F V YDI ++ VSF P C
Sbjct: 466 MYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 138/417 (33%), Positives = 204/417 (48%), Gaps = 43/417 (10%)
Query: 30 SVELIHRDSPKSPFYNSSETPY--QRLRDALTRSLNRLNHFNQ-NSSISSSKASQADIIP 86
SV L+HR P +P SS+ P +RLR + RS ++ ++ N SI + D +
Sbjct: 60 SVPLVHRHGPCAPSTRSSDEPSLSERLRRSRARSKYIMSRASKSNVSIPTHLGGSVDSL- 118
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
Y++ + +GTP ++ + DTGSDL W QC PC + CY Q PLFDP SSTY +
Sbjct: 119 ---EYVVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAPI 175
Query: 147 PCSSSQCASLNQK---------SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
PC++ C L + S G C Y+++YGDGS + G + ET+T+ V
Sbjct: 176 PCNTDACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMAP----GV 231
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
+ FGCG + G N K G++GLGG SL+ Q + G FSYCL P ++ + F
Sbjct: 232 TVKDFHFGCGHDQDGP-NDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCL-PAANDQAGF 289
Query: 258 GTNG--IVSGPGVVSTPLTKA-KTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLT 310
G + G V TP+ + +TFYV+ + I+VG + + V + ++IDSGT +T
Sbjct: 290 LALGAPVNDASGFVFTPMVREQQTFYVVNMTGITVGGEPIDVPPSAFSGGMIIDSGTVVT 349
Query: 311 FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSR 368
L + L + + A P+ P G L+ CY+F S VP V + F G
Sbjct: 350 ELQHTAYAALQAAFRKAMAAYPLL-PNGELDTCYNFTGHSNVTVPRVALTFSGG------ 402
Query: 369 SNFFVKVSEDIV---CSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ + V + I+ C F+ G N I GN+ Q V YD+ V F C
Sbjct: 403 ATVDLDVPDGILLDNCLAFQEAGPDNQPGILGNVNQRTLEVLYDVGHGRVGFGADAC 459
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 126/367 (34%), Positives = 185/367 (50%), Gaps = 32/367 (8%)
Query: 80 SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
+Q+ + NY++ + +GTP + + DTGSDL WTQC+PC S CY Q P+FDP
Sbjct: 143 AQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKS-CYAQQQPIFDPSA 201
Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
S TY ++ C+S+ C+ L N CS NC Y + YGD SF+ G A +T+TL
Sbjct: 202 SKTYSNISCTSTACSGLKSATGNSPGCSSSNCVYGIQYGDSSFTVGFFAKDTLTL----T 257
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---S 251
Q G FGCG NN GLF KT G++GLG +S++ Q FSYCL P S
Sbjct: 258 QNDVFDGFMFGCGQNNRGLFG-KTAGLIGLGRDPLSIVQQTAQKFGKYFSYCL-PTSRGS 315
Query: 252 STKINFGT-NGIVSGP----GVVSTPL--TKAKTFYVLTIDAISVGNQRLGVS-----TP 299
+ + FG NG+ + G+ TP ++ TFY + + ISVG + L +S
Sbjct: 316 NGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQNA 375
Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTI 357
+IDSGT +T LP +L S + P A L+ CY ++ + +P+++
Sbjct: 376 GTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISF 435
Query: 358 HFRG-ADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
+F G A+V L + + VC F G +++ I+GNI Q V YD+ +
Sbjct: 436 NFNGNANVDLEPNGILITNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVYDVAGGQLG 495
Query: 415 FKPTDCT 421
F C+
Sbjct: 496 FGYKGCS 502
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 129/348 (37%), Positives = 178/348 (51%), Gaps = 21/348 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP + V DTGSD W QC+PC + CY Q LFDP SSTY ++
Sbjct: 176 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVA-CYEQREKLFDPASSSTYANV 234
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C++ C+ L+ CSG +C Y V YGDGS+S G A +T+TL S A+ G FGC
Sbjct: 235 SCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 290
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
G N GLF + G++GLG G SL Q G F++CL P ST + G S P
Sbjct: 291 GERNDGLFG-EAAGLLGLGRGKTSLPVQTYGKYGGVFAHCL-PPRSTGTGYLDFGAGSPP 348
Query: 267 GVVSTPLTKAK--TFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSN 319
+TP+ TFY + + I VG + L + + ++DSGT +T LP S+
Sbjct: 349 ATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSS 408
Query: 320 LLSVMSSMIEAQPV--ADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVK 374
L S ++ + A+ A L+ CY F +SQV P V++ F+ GA + + S
Sbjct: 409 LRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYT 468
Query: 375 VSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
VS VC F G + V I GN F V YDI ++ V F P C
Sbjct: 469 VSASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 139/422 (32%), Positives = 205/422 (48%), Gaps = 43/422 (10%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRL--RDALTRSL--NRLNHFNQNSSISSSKASQADII 85
S+ L+HRD+ Y S+ L RD RL+ + + S S I
Sbjct: 70 SLALLHRDAVSGRTYPSTRHAMLGLAARDGARVEYLQRRLSPTTMTTEVGSEVVS--GIS 127
Query: 86 PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
+ Y +R+ +G+PPTE+ V D+GSD+IW QC PC ++CY Q PLFDP S+++ +
Sbjct: 128 EGSGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPC--AECYQQADPLFDPAASASFTA 185
Query: 146 LPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
+PC S C +L S + C+Y VSYGDGS++ G LA ET+T G +T + G+
Sbjct: 186 VPCDSGVCRTLPGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTFGDST----PVQGV 241
Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI 262
GCG N GLF G++GLG G +SL+ Q+ G FSYCL +S + G +
Sbjct: 242 AIGCGHRNRGLF-VGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCL---ASRGADAGAGSL 297
Query: 263 VSGP------GVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS------TPD----IVI 303
V G G V PL + +FY + + + VG +RL + T D +V+
Sbjct: 298 VFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVM 357
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQ-PVADPTGSLELCYSFNSLS--QVPEVTIHF- 359
D+GT +T LP + L +S I P A L+ CY + + +VP V ++F
Sbjct: 358 DTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDTCYDLSGYASVRVPTVALYFG 417
Query: 360 -RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
GA + L N V++ + C F + + I GNI Q + D V F P+
Sbjct: 418 RDGAALTLPARNLLVEMGGGVYCLAFAASASGLSILGNIQQQGIQITVDSANGYVGFGPS 477
Query: 419 DC 420
C
Sbjct: 478 TC 479
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 135/405 (33%), Positives = 194/405 (47%), Gaps = 56/405 (13%)
Query: 49 TPYQRLRDALTRSLNRLNHF-NQNSSISSSKASQADIIPNN-------ANYLIRISIGTP 100
T ++ LR RS R H + +++ A + P YL+ ++ GTP
Sbjct: 38 THWELLRRMAQRSKARATHLLSAQDQSGRGRSASAPVNPGAYDDGFPFTEYLVHLAAGTP 97
Query: 101 PTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS 160
P E DTGSD+ WTQC+ CP S C+ Q PLFDP SS++ SLPCSS C +
Sbjct: 98 PQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSSPACET--TPP 155
Query: 161 CSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGG 212
C G N C YS+SYGDGS S G + E T S TG+ + A+PG+ FGCG N G
Sbjct: 156 CGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLVFGCGHANRG 215
Query: 213 LFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-PGVV-- 269
+F S TGI G G G +SL SQ++ G FS+C ++ +K T+ ++ G PGV
Sbjct: 216 VFTSNETGIAGFGRGSLSLPSQLKV---GNFSHCFTTITGSK----TSAVLLGLPGVAPP 268
Query: 270 -STPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMI 328
++PL + + Y STP +SGT++T LP + ++ +
Sbjct: 269 SASPLGRRRGSYRCR-------------STPR-SSNSGTSITSLPPRTYRAVREEFAAQV 314
Query: 329 EAQPVADPTGSLELCYSF---NSLSQVPEVTIHFRGADVKLSRSNFFVKVSED------- 378
+ V C+S VP + +HF GA ++L + N+ +V +D
Sbjct: 315 KLPVVPGNATDPFTCFSAPLRGPKPDVPTMALHFEGATMRLPQENYVFEVVDDDDAGNSS 374
Query: 379 -IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
I+C I I GNI Q N V YD++ +SF P C +
Sbjct: 375 RIICLAV--IEGGEIILGNIQQQNMHVLYDLQNSKLSFVPAQCDQ 417
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 127/348 (36%), Positives = 182/348 (52%), Gaps = 25/348 (7%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ ++ +GTP T V DTGS L W QC PC S C+ Q PLFDP+ SSTY S+ CS
Sbjct: 133 NYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVS-CHRQVGPLFDPRASSTYASVRCS 191
Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+SQC A+LN +CS N C Y SYGD SFS G+L+T+TV+ GST P
Sbjct: 192 ASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTDTVSFGSTR-----YPSFY 246
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
+GCG +N GLF ++ G++GL +SL+ Q+ ++ FSYCL P +++
Sbjct: 247 YGCGQDNEGLFG-RSAGLIGLARNKLSLLYQLAPSLGYSFSYCL-PTAASTGYLSIGPYN 304
Query: 264 SGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQG 315
+G TP+ + + Y +T+ +SVG L VS + +IDSGT +T LP
Sbjct: 305 TGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRLPTA 364
Query: 316 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFR-GADVKLSRSNFFV 373
++ L ++ + A L+ C+ S +VP V + F GA +KL+ N +
Sbjct: 365 VHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQLRVPTVAMAFAGGASMKLTTRNVLI 424
Query: 374 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
V + C F T+S I GN Q F V YD+ Q + F C+
Sbjct: 425 DVDDSTTCLAF-APTDSTAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 133/431 (30%), Positives = 212/431 (49%), Gaps = 38/431 (8%)
Query: 18 VVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH-FNQNSSISS 76
+V AQ +LIH S SP++N + + +R + S R+ + + Q
Sbjct: 23 IVEAYNAQPKQLVTKLIHWGSILSPYFNPNASVAERAERIVKTSATRIAYLYAQIKGDIH 82
Query: 77 SKASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
+ +++P+ +L+ S+G P T +LA+ DTGS+++W +C PC +C Q+ PL
Sbjct: 83 MNDFELNLLPSTYEPLFLVNFSMGQPATPQLAIMDTGSNILWVRCAPC--KRCTQQNGPL 140
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTT 193
DP SSTY SLPC+++ C C+ +N C Y++SY G S G LATE + S+
Sbjct: 141 LDPSKSSTYASLPCTNTMCHYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSD 200
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
A+P + FGC NG + + TG+ GLG G S +++M KFSYCL ++
Sbjct: 201 EGVNAVPSVVFGCSHENGDYKDRRFTGVFGLGKGITSFVTRM----GSKFSYCLGNIADP 256
Query: 254 KINFGTNGIVSGPGV----VSTPLTKAKTFYVLTIDAISVGNQRLGVST---------PD 300
++G N +V G STPL Y +T++ ISVG +RL + +
Sbjct: 257 --HYGYNQLVFGEKANFEGYSTPLKVVNGHYYVTLEGISVGEKRLDIDSTAFSMKGNEKS 314
Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVT 356
+IDSGT LT+L + L + + +++ + GS CY ++SQ P VT
Sbjct: 315 ALIDSGTALTWLAESAFRALDNEVRQLLDGVLMPFWRGSFA-CYK-GTVSQDLIGFPVVT 372
Query: 357 IHFR-GADVKLSRSNFFVKVSEDIVC------SVFKGITNSVPIYGNIMQTNFLVGYDIE 409
HF GAD+ L + F + + DI+C S + S + G + Q + + YD+
Sbjct: 373 FHFSGGADLDLDTESMFYQATPDILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLN 432
Query: 410 QQTVSFKPTDC 420
+ F+ DC
Sbjct: 433 SNKLFFQRIDC 443
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 127/348 (36%), Positives = 182/348 (52%), Gaps = 25/348 (7%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ ++ +GTP T V DTGS L W QC PC S C+ Q PLFDP+ SSTY S+ CS
Sbjct: 133 NYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVS-CHRQVGPLFDPRASSTYTSVRCS 191
Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+SQC A+LN +CS N C Y SYGD SFS G L+T+TV+ GST+ P
Sbjct: 192 ASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSFGSTS-----YPSFY 246
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
+GCG +N GLF ++ G++GL +SL+ Q+ ++ FSYCL P +++
Sbjct: 247 YGCGQDNEGLFG-RSAGLIGLARNKLSLLYQLAPSLGYSFSYCL-PTAASTGYLSIGPYN 304
Query: 264 SGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQG 315
+G TP+ + + Y +T+ +SVG L VS + +IDSGT +T LP
Sbjct: 305 TGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRLPTA 364
Query: 316 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFR-GADVKLSRSNFFV 373
++ L ++ + A L+ C+ S +VP V + F GA +KL+ N +
Sbjct: 365 VHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQLRVPTVVMAFAGGASMKLTTRNVLI 424
Query: 374 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
V + C F T+S I GN Q F V YD+ Q + F C+
Sbjct: 425 DVDDSTTCLAF-APTDSTAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 130/352 (36%), Positives = 178/352 (50%), Gaps = 26/352 (7%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP + V DTGSD W QC+PC CY Q LFDP SSTY ++
Sbjct: 175 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCV-VVCYEQQEKLFDPARSSTYANV 233
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C++ C L+ + CSG +C Y V YGDGS+S G A +T+TL S A+ G FGC
Sbjct: 234 SCAAPACFDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 289
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--INFGTNGIVS 264
G N GLF + G++GLG G SL Q G F++CL SS ++FG +
Sbjct: 290 GERNEGLFG-EAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAA 348
Query: 265 GPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYN 317
++TP+ TFY + + I VG Q L + +T ++DSGT +T LP
Sbjct: 349 AGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPPAY 408
Query: 318 SNLLSVMSSMIEAQ--PVADPTGSLELCYSFNSLSQV--PEVTIHFRGA---DVKLSRSN 370
S+L S S + A+ A L+ CY F +SQV P V++ F+G DV S
Sbjct: 409 SSLRSAFVSAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAILDVDASGIM 468
Query: 371 FFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ VS+ VC F + V I GN F V YDI ++ V F P C
Sbjct: 469 YAASVSQ--VCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 518
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 152/463 (32%), Positives = 226/463 (48%), Gaps = 66/463 (14%)
Query: 10 ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN 69
+L L +++ A + IH D P +SE +R AL R ++R F
Sbjct: 6 VLLILACTILASDAAAAVRVGLTRIHAD----PEVTASEF----VRGALRRDMHRHARFA 57
Query: 70 QNSSISSSKAS---------QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
+ SS A+ Q D+ N Y++ +SIGTPP A+ADTGSDLIWTQC
Sbjct: 58 REQLAPSSAAAAGLTVGAPTQKDLR-NGGEYIMTLSIGTPPLSYRAIADTGSDLIWTQCA 116
Query: 121 PCPPS------QCYMQDSPLFDPKMSSTYKSLPCSS--SQCASLNQKS-CSGVNCQYSVS 171
PC + QC+ Q L++P S+T+ LPC+S S CA++ S G C Y+ +
Sbjct: 117 PCGDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAMAGPSPPPGCACMYNQT 176
Query: 172 YGDGSFSNGNLATETVTLG-STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDIS 230
YG G ++ G + ET T G S+T AV +P I FGC + +N + G+VGLG G +S
Sbjct: 177 YGTG-WTAGVQSVETFTFGSSSTPPAVRVPNIAFGCSNASSNDWNG-SAGLVGLGRGSMS 234
Query: 231 LISQMRTTIAGKFSYCLVPV-------------SSTKINFGTNGIVSGPGVVSTPLTKAK 277
L+SQ+ AG FSYCL P S+ GT + S P V
Sbjct: 235 LVSQLG---AGAFSYCLTPFQDANSTSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMS 291
Query: 278 TFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFL-PQGYNSNLLSVMSS 326
T+Y L + ISVG L + T ++IDSGTT+T L Y +V S
Sbjct: 292 TYYYLNLTGISVGETALAIPPDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSL 351
Query: 327 MIEAQPVA---DPTGSLELCYSFNSLS---QVPEVTIHFR-GADVKLSRSNFFVKVSEDI 379
++ P+A D + L+LC++ + + +P +T+HF GAD+ L N+ + + +
Sbjct: 352 LVTRLPLAHGPDHSTGLDLCFALKASTPPPAMPSMTLHFEGGADMVLPVENYMI-LGSGV 410
Query: 380 VCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
C + T ++ + GN Q N V YD+ ++T+SF P C+
Sbjct: 411 WCLAMRNQTVGAMSMVGNYQQQNIHVLYDVRKETLSFAPAVCS 453
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 151/424 (35%), Positives = 211/424 (49%), Gaps = 48/424 (11%)
Query: 27 GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-- 84
G +V L HR P SP + + P L + L R R + + S + D+
Sbjct: 126 GAATVPLHHRHGPCSPL-PTKKMP--TLEETLHRDQLRAAYIQRK--FSGGGGAGGDVQR 180
Query: 85 ----IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
+P N YLI + +G+P T + + DTGSD+ W QC+PC SQC+ Q P
Sbjct: 181 SDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPC--SQCHSQADP 238
Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
LFDP SSTY C S+ CA L Q+ S CQY V+YGDGS + G +++T+ LG
Sbjct: 239 LFDPSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALG 298
Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--V 248
S+ A+ FGC G FN +T G++GLGGG SL+SQ T+ FSYCL
Sbjct: 299 SS-----AVRSFQFGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPT 352
Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVS----TPDI 301
P SS + G G G V TP+ ++ TFY + + AI VG ++L + +
Sbjct: 353 PSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGT 412
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF 359
V+DSGT +T LP S L S + ++ P A P+G L+ C+ F+ S V P V + F
Sbjct: 413 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 472
Query: 360 R-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFK 416
GA V L S + C F G ++ S+ I GN+ Q F V YD+ + V F+
Sbjct: 473 SGGAVVSLDASGIILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 527
Query: 417 PTDC 420
C
Sbjct: 528 AGAC 531
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 125/367 (34%), Positives = 185/367 (50%), Gaps = 32/367 (8%)
Query: 80 SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
+Q+ + NY++ + +GTP + + DTGSDL WTQC+PC S CY Q P+FDP
Sbjct: 143 AQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKS-CYAQQQPIFDPST 201
Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
S TY ++ C+S+ C+SL N CS NC Y + YGD SF+ G A + +TL
Sbjct: 202 SKTYSNISCTSAACSSLKSATGNSPGCSSSNCVYGIQYGDSSFTIGFFAKDKLTL----T 257
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---S 251
Q G FGCG NN GLF KT G++GLG +S++ Q FSYCL P S
Sbjct: 258 QNDVFDGFMFGCGQNNKGLFG-KTAGLIGLGRDPLSIVQQTAQKFGKYFSYCL-PTSRGS 315
Query: 252 STKINFGT-NGIVSGP----GVVSTPL--TKAKTFYVLTIDAISVGNQRLGVS-----TP 299
+ + FG NG+ + G+ TP ++ +Y + + ISVG + L +S
Sbjct: 316 NGHLTFGNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQNA 375
Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTI 357
+IDSGT +T LP +L S + P A L+ CY ++ + +P+++
Sbjct: 376 GTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISF 435
Query: 358 HFRG-ADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
+F G A+V+L + + VC F G +S+ I+GNI Q V YD+ +
Sbjct: 436 NFNGNANVELDPNGILITNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDVAGGQLG 495
Query: 415 FKPTDCT 421
F C+
Sbjct: 496 FGYKGCS 502
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 141/424 (33%), Positives = 211/424 (49%), Gaps = 46/424 (10%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
GF L H D+ N+ T Q L A+ RS R+ ++ + + + ++
Sbjct: 30 GFKATLTHVDA------NAGYTKAQLLSRAVARSRARVAALQSLATAADAITAARILLRF 83
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ YL+ + IG+PP A+ DTGSDLIWTQC PC C Q +P F+P S++Y SL
Sbjct: 84 SEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPC--LLCVEQPTPYFEPAKSTSYASL 141
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
PCSS+ C +L C C Y YGD + S G LA ET T G+ + + VA+P ++FGC
Sbjct: 142 PCSSAMCNALYSPLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTR-VAVPRVSFGC 200
Query: 207 GTNNGG-LFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFG---- 258
G N G LFN +G+VG G G +SL+SQ+ + +FSYCL +++++ FG
Sbjct: 201 GNMNAGTLFNG--SGMVGFGRGALSLVSQLGSP---RFSYCLTSFMSPATSRLYFGAYAT 255
Query: 259 ---TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS-----------TPDI 301
TN SGP V STP T Y L + ISV L + T +
Sbjct: 256 LNSTNTSSSGP-VQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGV 314
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIE-AQPVADPTGSLELCYSF----NSLSQVPEVT 356
+IDSGTT+TFL Q + + + + + A P+ + + C+ + + +PE+
Sbjct: 315 IIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRRMVTLPEMV 374
Query: 357 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
+HF GAD++L N+ V + ++ I G+ NF + YD+E +SF
Sbjct: 375 LHFDGADMELPLENYMVMDGGTGNLCLAMLPSDDGSIIGSFQHQNFHMLYDLENSLLSFV 434
Query: 417 PTDC 420
P C
Sbjct: 435 PAPC 438
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 191 bits (485), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 150/424 (35%), Positives = 211/424 (49%), Gaps = 48/424 (11%)
Query: 27 GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-- 84
G +V L HR P SP + + P L + L R R + + S + D+
Sbjct: 56 GAATVPLHHRHGPCSPL-PTKKMP--TLEETLHRDQLRAAYIQRK--FSGGGGAGGDVQR 110
Query: 85 ----IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
+P N YLI + +G+P T + + DTGSD+ W QC+PC SQC+ Q P
Sbjct: 111 SDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPC--SQCHSQADP 168
Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
LFDP SSTY C S+ CA L Q+ S CQY V+YGDGS + G +++T+ LG
Sbjct: 169 LFDPSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALG 228
Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV 250
S+ A+ FGC G FN +T G++GLGGG SL+SQ T+ FSYCL P
Sbjct: 229 SS-----AVRSFQFGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPT 282
Query: 251 SSTK--INFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVS----TPDI 301
S+ + G G G V TP+ ++ TFY + + AI VG ++L + +
Sbjct: 283 PSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGT 342
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF 359
V+DSGT +T LP S L S + ++ P A P+G L+ C+ F+ S V P V + F
Sbjct: 343 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 402
Query: 360 R-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFK 416
GA V L S + C F G ++ S+ I GN+ Q F V YD+ + V F+
Sbjct: 403 SGGAVVSLDASGIILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 457
Query: 417 PTDC 420
C
Sbjct: 458 AGAC 461
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 191 bits (485), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 137/409 (33%), Positives = 198/409 (48%), Gaps = 54/409 (13%)
Query: 60 RSLNRLNHFNQNSSI----SSSKASQADIIPN-------NANYLIRISIGTPPTERLAVA 108
RSL R ++ ++ +S +A+ A + P + YL+ ++IGTPP +
Sbjct: 373 RSLTRREVLHRMAARLLFSASGRAASARVDPGPYANGVPDTEYLVHLAIGTPPQPVQLIL 432
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--- 165
DTGSDL+WTQC PCP C+ + DP SST+ LPCSS C +L SC N
Sbjct: 433 DTGSDLVWTQCRPCP--VCFSRALGPLDPSNSSTFDVLPCSSPVCDNLTWSSCGKHNWGN 490
Query: 166 --CQYSVSYGDGSFSNGNLATETVTLGST--TGQAVALPGITFGCGTNNGGLFNSKTTGI 221
C Y +Y DGS + G+L ET T + TGQA +P + FGCG N G+F S TGI
Sbjct: 491 QTCVYVYAYADGSITTGHLDAETFTFAAADGTGQAT-VPDLAFGCGLFNNGIFTSNETGI 549
Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKINFGTNGIVSGPGVV-STPLTK 275
G G G +SL SQ++ FS+C + SS + N G V STPL +
Sbjct: 550 AGFGRGALSLPSQLKVD---NFSHCFTAITGSEPSSVLLGLPANLYSDADGAVQSTPLVQ 606
Query: 276 ---AKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQGYNSNLLS 322
+ Y L++ I+VG+ RL + T +IDSGT +T LPQ +
Sbjct: 607 NFSSLRAYYLSLKGITVGSTRLPIPESTFALKQDGTGGTIIDSGTGMTTLPQDAYKLVHD 666
Query: 323 VMSSMIEAQPVADPTGS--LELCYSF----NSLSQVPEVTIHFRGADVKLSRSNFFVKVS 376
++ + PV + T S LC+SF + VP++ +HF GA + L R N+ +
Sbjct: 667 AFTAQVRL-PVDNATSSSLSRLCFSFSVPRRAKPDVPKLVLHFEGATLDLPRENYMFEFE 725
Query: 377 E---DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ + C + + I GN Q N V YD+ + +SF P C +
Sbjct: 726 DAGGSVTCLAINA-GDDLTIIGNYQQQNLHVLYDLVRNMLSFVPAQCNR 773
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 191 bits (485), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 141/395 (35%), Positives = 208/395 (52%), Gaps = 43/395 (10%)
Query: 54 LRDALTRSLNRLNHFNQNSSISSSKASQAD--IIPN--NANYLIRISIGTPPTERLAVAD 109
++ A+ RS RL S++++ + + + P+ + YLI+++IGTP A+ D
Sbjct: 1 MKRAIQRSQERLEKLQITSAVNTHQMKDIETPVTPDIGSGEYLIQMAIGTPALSLSAIMD 60
Query: 110 TGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-NCQY 168
TGSDL+WT+C PC + C SSTY + C SS C + SC+ +C+Y
Sbjct: 61 TGSDLVWTKCNPC--TDCSTSSIYDP--SSSSTYSKVLCQSSLCQPPSIFSCNNDGDCEY 116
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
YGD S ++G L+ ET ++ S +LP ITFGCG +N G K G+VG G G
Sbjct: 117 VYPYGDRSSTSGILSDETFSISSQ-----SLPNITFGCGHDNQGF--DKVGGLVGFGRGS 169
Query: 229 ISLISQMRTTIAGKFSYCLVPVS----STKINFGTNGIVSGPGVVSTPLTKAKT--FYVL 282
+SL+SQ+ ++ KFSYCLV + ++ + G + V STPL ++ + Y L
Sbjct: 170 LSLVSQLGPSMGNKFSYCLVSRTDSSKTSPLFIGNTASLEATTVGSTPLVQSSSTNHYYL 229
Query: 283 TIDAISVGNQRLGVST----------PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP 332
+++ ISVG Q L + T ++IDSGTTLTFL Q + M S I P
Sbjct: 230 SLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSSINL-P 288
Query: 333 VADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNF-FVKVSEDIVCSVFKGITN 389
AD G L+LC++ S P +T HF+GAD + + N+ F + DIVC TN
Sbjct: 289 QAD--GQLDLCFNQQGSSNPGFPSMTFHFKGADYDVPKENYLFPDSTSDIVCLAMM-PTN 345
Query: 390 S----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
S + I+GN+ Q N+ + YD E +SF PT C
Sbjct: 346 SNLGNMAIFGNVQQQNYQILYDNENNVLSFAPTAC 380
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 191 bits (485), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 138/447 (30%), Positives = 216/447 (48%), Gaps = 62/447 (13%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN---------QNSS 73
+A G + L H D+ K + + +R A+ RS R + S
Sbjct: 28 DAFAGDVRLHLTHVDAGKQ------MSRRELIRRAMQRSKARAAALSVARSGSGRVPGKS 81
Query: 74 ISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
+ Q +P + YLI ++IGTPP A+ DTGSDLIWTQC PC + C
Sbjct: 82 AQQGEQHQQPGVPVRPSGDLEYLIDLAIGTPPQPVSALLDTGSDLIWTQCAPC--ASCLA 139
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
Q PLF P SS+Y + CS C + SC + C Y +YGDG+ + G ATE T
Sbjct: 140 QPDPLFAPAASSSYVPMRCSGQLCNDILHHSCQRPDTCTYRYNYGDGTTTLGVYATERFT 199
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
S++G+ +++P + FGCGT N G N+ +GIVG G +SL+SQ+ +FSYCL
Sbjct: 200 FASSSGEKLSVP-LGFGCGTMNVGSLNNG-SGIVGFGRDPLSLVSQLSIR---RFSYCLT 254
Query: 249 PVSSTK---INFG--TNGIVSGPG-----VVSTPLTKAK---TFYVLTIDAISVGNQRLG 295
P +ST+ + FG ++G+ G V +T L +++ TFY + ++VG +RL
Sbjct: 255 PYTSTRKSTLMFGSLSDGVFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLR 314
Query: 296 VS------TPD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS 345
+ PD +++DSGT LT P + +L + + + + +C++
Sbjct: 315 IPLSAFALRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRAQLRLPFTSSSSPDDGVCFA 374
Query: 346 -----------FNSLSQVPEVTIHFRGADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPI 393
++ VP + HF+GAD++L R N+ + +C + +S
Sbjct: 375 TPMAAGGRRASAATVVSVPRMAFHFQGADLELPRRNYVLDDPRRGSLCILLADSGDSGAT 434
Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDC 420
GN +Q + V YD+E +T+SF P C
Sbjct: 435 IGNFVQQDMRVLYDLEAETLSFAPAQC 461
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 191 bits (485), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 141/424 (33%), Positives = 211/424 (49%), Gaps = 46/424 (10%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
GF L H D+ N+ T Q L A+ RS R+ ++ + + + ++
Sbjct: 27 GFKATLTHVDA------NAGYTKAQLLSRAVARSRARVAALQSLATAADAITAARILLRF 80
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ YL+ + IG+PP A+ DTGSDLIWTQC PC C Q +P F+P S++Y SL
Sbjct: 81 SEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPC--LLCVEQPTPYFEPAKSTSYASL 138
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
PCSS+ C +L C C Y YGD + S G LA ET T G+ + + VA+P ++FGC
Sbjct: 139 PCSSAMCNALYSPLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTR-VAVPRVSFGC 197
Query: 207 GTNNGG-LFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFG---- 258
G N G LFN +G+VG G G +SL+SQ+ + +FSYCL +++++ FG
Sbjct: 198 GNMNAGTLFNG--SGMVGFGRGALSLVSQLGSP---RFSYCLTSFMSPATSRLYFGAYAT 252
Query: 259 ---TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS-----------TPDI 301
TN SGP V STP T Y L + ISV L + T +
Sbjct: 253 LNSTNTSSSGP-VQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGV 311
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIE-AQPVADPTGSLELCYSF----NSLSQVPEVT 356
+IDSGTT+TFL Q + + + + + A P+ + + C+ + + +PE+
Sbjct: 312 IIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRRMVTLPEMV 371
Query: 357 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
+HF GAD++L N+ V + ++ I G+ NF + YD+E +SF
Sbjct: 372 LHFDGADMELPLENYMVMDGGTGNLCLAMLPSDDGSIIGSFQHQNFHMLYDLENSLLSFV 431
Query: 417 PTDC 420
P C
Sbjct: 432 PAPC 435
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 191 bits (484), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 137/383 (35%), Positives = 204/383 (53%), Gaps = 26/383 (6%)
Query: 50 PYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN--YLIRISIGTPPTERLAV 107
P L A +S RL+ ++S ++Q + ++ Y + SIGTPP E A+
Sbjct: 39 PAINLTRAAHKSHQRLSMLAARLDDAASGSAQTPLQLDSGGGAYDMTFSIGTPPQELSAL 98
Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVN 165
ADTGSDLIW +C C ++C Q SP + P SS++ LPCS S C+ L CS G
Sbjct: 99 ADTGSDLIWAKCGAC--TRCVPQGSPSYYPNKSSSFSKLPCSGSLCSDLPSSQCSAGGAE 156
Query: 166 CQYSVSYGDGS----FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGI 221
C Y SYG S ++ G L +ET TLGS A+PGI FGC T +G+
Sbjct: 157 CDYKYSYGLASDPHHYTQGYLGSETFTLGSD-----AVPGIGFGC-TTMSEGGYGSGSGL 210
Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLV--PVSSTKINFGTNGIVSGPGVVSTPLTKAKT- 278
VGLG G +SL+SQ+ G FSYCL ++ + FG+ G ++G GV STPL + T
Sbjct: 211 VGLGRGPLSLVSQLNV---GAFSYCLTSDAAKTSPLLFGS-GALTGAGVQSTPLLRTSTY 266
Query: 279 FYVLTIDAISVGNQ-RLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT 337
+Y + +++IS+G G + I+ DSGTT+ FL + + + S +A
Sbjct: 267 YYTVNLESISIGAATTAGTGSSGIIFDSGTTVAFLAEPAYTLAKEAVLSQTTNLTMASGR 326
Query: 338 GSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNI 397
E+C+ S + P + +HF G D+ L N+F V + + C + + + S+ I GNI
Sbjct: 327 DGYEVCFQ-TSGAVFPSMVLHFDGGDMDLPTENYFGAVDDSVSCWIVQK-SPSLSIVGNI 384
Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
MQ N+ + YD+E+ +SF+P +C
Sbjct: 385 MQMNYHIRYDVEKSMLSFQPANC 407
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 143/447 (31%), Positives = 199/447 (44%), Gaps = 55/447 (12%)
Query: 21 PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS--- 77
PI A + V ++HR P SP + + L NR+ + S +++
Sbjct: 65 PITATSSAARVPIVHRHGPCSPLAGAHAGKPPSHAEILAADQNRVESLHHRVSSTTTGLG 124
Query: 78 -KASQADIIPNN------------------------ANYLIRISIGTPPTERLAVADTGS 112
K P + ANY++ I +GTPP+ V DTGS
Sbjct: 125 GKPRTKKKTPGHSSVPASSSSSSSSVPASSGLSLGTANYVVPIGLGTPPSRFTVVFDTGS 184
Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
D W QC PC S CY Q LFDP SSTY ++ C+ CA L+ C+ +C Y + Y
Sbjct: 185 DTTWVQCRPCVVS-CYKQKDRLFDPAKSSTYANVSCADPACADLDASGCNAGHCLYGIQY 243
Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
GDGS++ G A +T+ + A+ G FGCG N GLF +T G++GLG G S+
Sbjct: 244 GDGSYTVGFFAKDTLAVAQD-----AIKGFKFGCGEKNRGLFG-QTAGLLGLGRGPTSIT 297
Query: 233 SQMRTTIAGKFSYCLVPVSSTKINF----GTNGIVSGPGVVSTPL--TKAKTFYVLTIDA 286
Q G FSYCL P SS + + SG +TP+ K TFY + +
Sbjct: 298 VQAYEKYGGSFSYCL-PASSAATGYLEFGPLSPSSSGSNAKTTPMLTDKGPTFYYVGLTG 356
Query: 287 ISVGNQRLGV------STPDIVIDSGTTLTFLPQ--GYNSNLLSVMSSMIEAQPVADPTG 338
I VG ++LG S ++DSGT +T LP + + A
Sbjct: 357 IRVGGKQLGAIPESVFSNSGTLVDSGTVITRLPDTAYAALSSAFAAAMAASGYKKAAAYS 416
Query: 339 SLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPI 393
L+ CY F LSQV P V++ F+ GA + L S +S+ VC F G SV I
Sbjct: 417 ILDTCYDFTGLSQVSLPTVSLVFQGGACLDLDASGIVYAISQSQVCLGFASNGDDESVGI 476
Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDC 420
GN Q + V YD+ ++ V F P C
Sbjct: 477 VGNTQQRTYGVLYDVSKKVVGFAPGAC 503
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 146/429 (34%), Positives = 204/429 (47%), Gaps = 48/429 (11%)
Query: 22 IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ 81
+ + +G +V L HR P SP + + P L D L R R + + S K Q
Sbjct: 50 VRSSSGATTVPLHHRHGPCSPL-PTKKMP--SLEDRLHRDQLRAAYIKRKFSGDVKKDGQ 106
Query: 82 AD--------IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
+P N YLI + +G+P + + D+GSD+ W QC+PC Q
Sbjct: 107 GAGGVEQSHVTVPTTLGTSLNTLEYLITVRLGSPAKTQTVLIDSGSDVSWVQCKPC--LQ 164
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLA 183
C+ Q PLFDP +SSTY CSS+ CA L Q S CQY V Y DGS + G +
Sbjct: 165 CHSQVDPLFDPSLSSTYSPFSCSSAACAQLGQDGNGCSSSSQCQYIVRYADGSSTTGTYS 224
Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
++T+ LGS T + FGC G FN T G++GLGGG SL SQ T F
Sbjct: 225 SDTLALGSNT-----ISNFQFGCSHVESG-FNDLTDGLMGLGGGAPSLASQTAGTFGTAF 278
Query: 244 SYCLVPVSSTK----INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST- 298
SYCL P S+ + GT+G V P + S+P+ TFY + ++AI VG +L + T
Sbjct: 279 SYCLPPTPSSSGFLTLGAGTSGFVKTPMLRSSPV---PTFYGVRLEAIRVGGTQLSIPTS 335
Query: 299 ---PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--P 353
+V+DSGT +T LP+ S L S + ++ A P ++ C+ F+ S V P
Sbjct: 336 VFSAGMVMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLP 395
Query: 354 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQ 411
V + F G V +N + + C F ++ S I GN+ Q F V YD+
Sbjct: 396 SVALVFSGGAVVNLDANGIILGN----CLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGG 451
Query: 412 TVSFKPTDC 420
V FK C
Sbjct: 452 AVGFKAGAC 460
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 143/418 (34%), Positives = 203/418 (48%), Gaps = 34/418 (8%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRL--RDALTRSLNRLNHFNQNSSISSSKASQADI- 84
G ++ L+HR P SP + + ++ RD L R+ N + + S+ + Q+ +
Sbjct: 58 GATLPLVHRHGPCSPVMSKEKPSHEETLGRDQL-RAANIHAKLSSPRNSSAKELQQSGVT 116
Query: 85 IPNNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
IP ++ Y+I +S+GTP ++ DTGSD+ W QC PC C Q LFDP
Sbjct: 117 IPTSSGYSLGTPEYVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDP 176
Query: 138 KMSSTYKSLPCSSSQCASLNQK--SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
S+TY + CSS+QCA L + C +CQY V Y D S + G ++ TLG TT
Sbjct: 177 AKSATYSAFSCSSAQCAQLGGEGNGCLNSHCQYIVKYVDHSNTTGTYGSD--TLGLTTSD 234
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
AV FGC G F + G++GLGG SL+SQ T FSYCL P SS+
Sbjct: 235 AVK--NFQFGCSHRANG-FVGQLDGLMGLGGDTESLVSQTAATYGKAFSYCLPPSSSSAG 291
Query: 256 NFGTNGIVSGPGVVS----TPLTK--AKTFYVLTIDAISVGNQRLGVSTPDI----VIDS 305
F T G +G S TPL + TFY + + AI+V +L V V+DS
Sbjct: 292 GFLTLGAAAGGTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVFSGASVVDS 351
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGA 362
GT +T LP L + ++A P A P G L+ C+ F+ + +VP VT+ F RGA
Sbjct: 352 GTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTCFDFSGIKTVRVPVVTLTFSRGA 411
Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ L S F + G T I GN+ Q F + +D+ T+ F+P C
Sbjct: 412 VMDLDVSGIFYAGCLAFTATAQDGDTG---ILGNVQQRTFEMLFDVGGSTLGFRPGAC 466
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 135/416 (32%), Positives = 197/416 (47%), Gaps = 31/416 (7%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRL--RDALTRSLNRLNHFNQNSSISSSKASQADII 85
G ++ L HR P SP + + ++ RD L + + ++ ++++ A I
Sbjct: 57 GSTLALSHRHGPCSPVISKEKPSHEETLRRDQLRAAYIQAKVSSRYNNVAKELQQSAVTI 116
Query: 86 PNNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
P ++ Y+I ++IGTP ++ DTGSD+ W QC PC C Q LFDP
Sbjct: 117 PTSSGYSLGTTEYVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPA 176
Query: 139 MSSTYKSLPCSSSQCASLNQK--SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
MS+TY + C S+QCA L + C CQY V YGDGS + G ++T++L S+
Sbjct: 177 MSATYSAFSCGSAQCAQLGDEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSLTSSD--- 233
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
A+ FGC G F + G++GLGG SL+SQ T FSYCL P SS+
Sbjct: 234 -AVKSFQFGCSHRAAG-FVGELDGLMGLGGDTESLVSQTAATYGKAFSYCLPPPSSSGGG 291
Query: 257 F---GTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPDI----VIDSGT 307
F G G S TP+ + TFY + + I+V L V V+DSGT
Sbjct: 292 FLTLGAAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPASVFSGASVVDSGT 351
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADV 364
+T LP L + ++A P A P GSL+ C+ F+ + VP VT+ F RGA +
Sbjct: 352 VITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFDFSGFNTITVPTVTLTFSRGAAM 411
Query: 365 KLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
L S + G T I GN+ Q F + +D+ +T+ F+ C
Sbjct: 412 DLDISGILYAGCLAFTATAHDGDTG---ILGNVQQRTFEMLFDVGGRTIGFRSGAC 464
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 136/384 (35%), Positives = 206/384 (53%), Gaps = 33/384 (8%)
Query: 57 ALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDL 114
A RS RL+ +S+ ++Q+ + ++ Y + S+GTPP A+ADTGSDL
Sbjct: 45 AAHRSRERLSILATRLGAASAGSAQSPLQMDSGGGAYDMTFSMGTPPQTLSALADTGSDL 104
Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVN-----C 166
IW +C C +C + S + P SS++ LPCSS+ C +L +S C G C
Sbjct: 105 IWAKCGAC--KRCAPRGSASYYPTKSSSFSKLPCSSALCRTLESQSLATCGGTRARGAVC 162
Query: 167 QYSVSYGDGS----FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIV 222
Y SYG S ++ G + +ET TLGS A+ GI FGC T +G+V
Sbjct: 163 SYRYSYGLSSNPHHYTQGYMGSETFTLGSD-----AVQGIGFGC-TTMSEGGYGSGSGLV 216
Query: 223 GLGGGDISLISQMRTTIAGKFSYCLV--PVSSTKINFGTNGIVSGPGVVSTPLT--KAKT 278
GLG G +SL+ Q++ G FSYCL P +S+ + FG G ++GPGV STPL K T
Sbjct: 217 GLGRGKLSLVRQLK---VGAFSYCLTSDPSTSSPLLFGA-GALTGPGVQSTPLVNLKTST 272
Query: 279 FYVLTIDAISVGNQRL-GVSTPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADP 336
FY + +D+IS+G + G I+ DSGTTLTFL + Y ++S V
Sbjct: 273 FYTVNLDSISIGAAKTPGTGRHGIIFDSGTTLTFLAEPAYTLAEAGLLSQTTNLTRVPGT 332
Query: 337 TGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGN 396
G E+C+ + + P + +HF G D+ L N+F V++ + C + + + + I GN
Sbjct: 333 DG-YEVCFQTSGGAVFPSMVLHFDGGDMALKTENYFGAVNDSVSCWLVQKSPSEMSIVGN 391
Query: 397 IMQTNFLVGYDIEQQTVSFKPTDC 420
IMQ ++ + YD+++ +SF+PT+C
Sbjct: 392 IMQMDYHIRYDLDKSVLSFQPTNC 415
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 134/386 (34%), Positives = 191/386 (49%), Gaps = 28/386 (7%)
Query: 55 RDALTRSLNRLNHFNQNSSISSSKASQADIIPN---NANYLIRISIGTPPTERLAVADTG 111
RD L R H + NSS + +P Y + + +GTP + + DTG
Sbjct: 94 RDQLRVKSIRAKH-SMNSSTTGVFNEMKTRVPTTHFGGGYAVTVGLGTPKKDFSLLFDTG 152
Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN----CQ 167
SDL WTQCEPC C+ Q+ FDP S++YK+L CSS C S+ ++S G + C
Sbjct: 153 SDLTWTQCEPCS-GGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGKESAQGCSSSNSCL 211
Query: 168 YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGG 227
Y V YG G ++ G LATET+T+ + GCG NGG F S T G++GLG
Sbjct: 212 YGVKYGTG-YTVGFLATETLTITPSD----VFENFVIGCGERNGGRF-SGTAGLLGLGRS 265
Query: 228 DISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAI 287
++L SQ +T FSYCL SS+ + G VS + +K Y L + I
Sbjct: 266 PVALPSQTSSTYKNLFSYCLPASSSSTGHLSFGGGVSQAAKFTPITSKIPELYGLDVSGI 325
Query: 288 SVGNQRLGVS-----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
SVG ++L + T +IDSGTTLT+LP +S L S M+ + T L+
Sbjct: 326 SVGGRKLPIDPSVFRTAGTIIDSGTTLTYLPSTAHSALSSAFQEMMTNYTLTKGTSGLQP 385
Query: 343 CYSFNSLSQ----VPEVTIHFRGA-DVKLSRSNFFVKVSE-DIVCSVFK--GITNSVPIY 394
CY F+ + +P+++I F G +V + S F+ + + VC FK G V I+
Sbjct: 386 CYDFSKHANDNITIPQISIFFEGGVEVDIDDSGIFIAANGLEEVCLAFKDNGNDTDVAIF 445
Query: 395 GNIMQTNFLVGYDIEQQTVSFKPTDC 420
GN+ Q + V YD+ + V F P C
Sbjct: 446 GNVQQKTYEVVYDVAKGMVGFAPGGC 471
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 126/354 (35%), Positives = 182/354 (51%), Gaps = 33/354 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y RI +GTP E V DTGSD+ W QC PC S+CY Q P+FDP SST+KSL
Sbjct: 161 SGEYFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPC--SECYQQSDPIFDPTSSSTFKSLT 218
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
CS +CASL+ +C C Y VSYGDGSF+ GN AT+TVT G++ + + GCG
Sbjct: 219 CSDPKCASLDVSACRSNKCLYQVSYGDGSFTVGNYATDTVTF----GESGKVNDVALGCG 274
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVS 264
+N GLF + GG +S+ +Q++ A FSYCLV S K ++F N +
Sbjct: 275 HDNEGLFTGAAGLLGLGGGA-LSMTNQIK---AKSFSYCLVDRDSAKSSSLDF--NSVQI 328
Query: 265 GPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTP----------DIVIDSGTTLTF 311
G G + PL +K TFY + + SVG Q++ + + +++D GT +T
Sbjct: 329 GAGDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGAGGVILDCGTAVTR 388
Query: 312 LP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLS 367
L Q YNS + + + + P + CY F+SLS +VP VT HF G + L
Sbjct: 389 LQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFSSLSTVKVPTVTFHFTGGKSLNLP 448
Query: 368 RSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N+ + + + C F ++S+ I GN+ Q + YD+ + C
Sbjct: 449 AKNYLIPIDDAGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANNLIGLSANKC 502
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 149/424 (35%), Positives = 210/424 (49%), Gaps = 48/424 (11%)
Query: 27 GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-- 84
G +V L HR P SP + + P L + L R R + + S + D+
Sbjct: 56 GAATVPLHHRHGPCSPL-PTKKMP--TLEETLHRDQLRAAYIQRK--FSGGGGAGGDVQR 110
Query: 85 ----IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
+P N YLI + +G+P T + + DTGSD+ W QC+PC SQC+ Q P
Sbjct: 111 SDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPC--SQCHSQADP 168
Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
LFDP SSTY C S+ CA L Q+ S CQY V+YGDGS + G +++T+ LG
Sbjct: 169 LFDPSSSSTYSPFSCGSAACAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALG 228
Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV 250
S+ A+ FGC G FN +T G++GLGGG SL+SQ T+ FSYCL P
Sbjct: 229 SS-----AVKSFQFGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPT 282
Query: 251 SSTK--INFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVS----TPDI 301
S+ + G G G V TP+ ++ TFY + + AI VG ++L + +
Sbjct: 283 PSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGT 342
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF 359
V+DSGT +T LP S L S + ++ P A P+G L+ C+ F+ S V P V + F
Sbjct: 343 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 402
Query: 360 R-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFK 416
GA V L S + C F ++ S+ I GN+ Q F V YD+ + V F+
Sbjct: 403 SGGAVVSLDASGIILS-----NCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 457
Query: 417 PTDC 420
C
Sbjct: 458 AGAC 461
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 132/368 (35%), Positives = 190/368 (51%), Gaps = 42/368 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + I +G+PP + A+ DTGSDL+W QC+PC SQCY Q P++DP SST+ CS+
Sbjct: 4 YTMEIELGSPPKKFNAIVDTGSDLVWIQCKPC--SQCYSQSDPIYDPSASSTFAKTSCST 61
Query: 151 SQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
S C SL CS C Y YGD S + G+ A ET+TL S+ G + A P FGCG
Sbjct: 62 SSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCGR 121
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKINFGTNGIV 263
N G F GIVGLG G ISL +Q+ + I KFSYCLV ++ + FG++
Sbjct: 122 LNSGSFGG-AAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFGSSA-S 179
Query: 264 SGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI------------------- 301
+G G +STP+ + T+Y + ++ ISVG ++L ++T I
Sbjct: 180 TGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRALEVN 239
Query: 302 ----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEV 355
+ DSGTTLT L S + S +S + V + +LCY + + P +
Sbjct: 240 SGGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYDVSKSKNFKFPAL 299
Query: 356 TIHFRGADVKLSRSNFFVKV--SEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
T+ F+G + N+FV V +E + C ++ + + I GN+MQ N+ V YD T
Sbjct: 300 TLAFKGTKFSPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLMQQNYHVVYDRGTST 359
Query: 413 VSFKPTDC 420
+S P C
Sbjct: 360 ISMSPAQC 367
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 132/350 (37%), Positives = 189/350 (54%), Gaps = 30/350 (8%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ + +GTP T V DTGS L W QC PC S C+ Q PL+DP+ SSTY ++PCS
Sbjct: 133 NYVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVS-CHRQVGPLYDPRASSTYATVPCS 191
Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+SQC A+LN +CS N C Y SYGD SFS G L+ +TV+ GS + P
Sbjct: 192 ASQCDELQAATLNPSACSVRNVCIYQASYGDSSFSVGYLSRDTVSFGSGS-----YPNFY 246
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL-VPVSSTKINFG--TN 260
+GCG +N GLF ++ G++GL +SL+ Q+ ++ FSYCL P S+ ++ G T+
Sbjct: 247 YGCGQDNEGLFG-RSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTPASTGYLSIGPYTS 305
Query: 261 GIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQG 315
G S + S+ L + Y +T+ +SVG L VS + +IDSGT +T LP
Sbjct: 306 GHYSYTPMASSSLD--ASLYFVTLSGMSVGGSPLAVSPAEYSSLPTIIDSGTVITRLPTA 363
Query: 316 -YNSNLLSVMSSMIEAQPVADPTGS-LELCYSFN-SLSQVPEVTIHFR-GADVKLSRSNF 371
Y + +V ++M+ Q + P S L+ C+ S +VP V + F GA +KL+ N
Sbjct: 364 VYTALSKAVAAAMVGVQ--SAPAFSILDTCFQGQASQLRVPAVAMAFAGGATLKLATQNV 421
Query: 372 FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ V + C F T+S I GN Q F V YD+ Q + F C+
Sbjct: 422 LIDVDDSTTCLAFA-PTDSTTIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 470
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 125/344 (36%), Positives = 173/344 (50%), Gaps = 17/344 (4%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ NY+I + GTP + V DTGSD+ W QC+PC +CY Q PLFDP +SSTY+++
Sbjct: 13 SGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCA-VRCYAQQEPLFDPSLSSTYRNVS 71
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C+ C L+ + CS C Y V YGDGS + G LA +T L A FGCG
Sbjct: 72 CTEPACVGLSTRGCSSSTCLYGVFYGDGSSTIGFLAMDTFML----TPAQKFKNFIFGCG 127
Query: 208 TNNGGLFNSKTTGIVGLG-GGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
NN GLF T G+VGLG SL SQ+ ++ FSYCL SS + P
Sbjct: 128 QNNTGLFQG-TAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPSTSSATGYLNIGNPQNTP 186
Query: 267 GVVSTPL-TKAKTFYVLTIDAISVGNQRLGVSTP-----DIVIDSGTTLTFLPQGYNSNL 320
G + T+ T Y + + ISVG RL +S+ +IDSGT +T LP S L
Sbjct: 187 GYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQSVGTIIDSGTVITRLPPTAYSAL 246
Query: 321 LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSED 378
+ + + + +A L+ CY F+ + V P + +HF G DV++ + F +
Sbjct: 247 KTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHFAGLDVRIPATGVFFVFNSS 306
Query: 379 IVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
VC F G T+S + I GN+ Q V YD E + + F C
Sbjct: 307 QVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 149/461 (32%), Positives = 217/461 (47%), Gaps = 55/461 (11%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
M+ L+ I L +P T +L H D + T ++RL R
Sbjct: 6 MSELLAYALIFTLLFTAAATPTAGLT--MRADLTHVDKGRG------FTRWERLSRMAVR 57
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVA-DTGSDLIWTQC 119
S R Q + A +P++ YLI +IGTP +R+A+ DTGSDL+WTQC
Sbjct: 58 SRARAASLYQRGG-HYGQPVTATAVPSSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQC 116
Query: 120 EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC---ASLNQKSCS--GVNCQYSVSYGD 174
PCP C+ Q PLFDP +SST++++ C C + L+ +C+ C Y SYGD
Sbjct: 117 TPCP--VCFDQPFPLFDPSVSSTFRAVACPDPICRPSSGLSVSACALKTFRCFYLCSYGD 174
Query: 175 GSFSNGNLATETVTLGSTTGQA---VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
S + G + +T T S G+ VA+ G+ FGCG N G+F S +GI G G G +SL
Sbjct: 175 KSITAGYIFKDTFTFMSPNGEGAPPVAVSGLAFGCGDYNTGVFASNESGIAGFGRGPLSL 234
Query: 232 ISQMRTTIAGKFSYCLVPVSSTKIN------FGT--NGIV---SGPGVVSTPLTKA---K 277
SQ+R G+FSYCL T+ N GT NG+ SGP STP+ +
Sbjct: 235 PSQLRV---GRFSYCLTSHDETESNKTSAVFLGTPPNGLRAHSSGP-FRSTPIIHSPSFP 290
Query: 278 TFYVLTIDAISVGNQRLGVSTP----------DIVIDSGTTLTFLPQGYNSNLLSVMSSM 327
TFY L+++ I+VG RL V + VIDSGT +T P L + +
Sbjct: 291 TFYYLSLEGITVGKTRLPVDSSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQ 350
Query: 328 IEAQPVADPTGSLE--LCYSFNSLSQ---VPEVTIHFRGADVKLSRSNFFVKVSED-IVC 381
+ P D T + LC+ + VP++ H AD+ L R N+ + ++ ++C
Sbjct: 351 LPL-PRYDNTSEVGNLLCFQRPKGGKQVPVPKLIFHLASADMDLPRENYIPEDTDSGVMC 409
Query: 382 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ G + + GN Q N + YD+E + F C K
Sbjct: 410 LMINGAEVDMVLIGNFQQQNMHIVYDVENSKLLFASAQCDK 450
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 140/423 (33%), Positives = 208/423 (49%), Gaps = 38/423 (8%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA----- 82
GF+ LIH DSP SPFYN + T R+ + RS +RLN+ + +S +
Sbjct: 7 GFTARLIHHDSPLSPFYNHTMTDTARIEATVHRSRSRLNYLYYINKLSENALDNDVSLSP 66
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS- 141
++ YL+ +IG P ++ + DT + LIW QC C SQC + L +SS
Sbjct: 67 TLVNEGGEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNC-NSQCEPEKRGLTTKFLSSK 125
Query: 142 --TYKSLPCSSSQCASLNQ-KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
TY+ PC S+ C SL ++C+ + C+Y + YGD ++G L++++ ++ G
Sbjct: 126 SFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDTSDGML 185
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV----SS 252
V + + FGC TG VGL +SLISQ+ KFSYCLVP S+
Sbjct: 186 VDVGFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQLGIK---KFSYCLVPFNNLGST 242
Query: 253 TKINFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRLGVS--------TPDIV 302
+K+ FG+ + SG TPL + +YV + IS+GN +
Sbjct: 243 SKMYFGSLPVTSGG---QTPLLYPNSDAYYVKVL-GISIGNDEPHFDGVFDVYEVRDGWI 298
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIE-AQPVADPTGSLELCYSF---NSLSQVPEVTIH 358
ID+G T + L +LL+ ++ + Q DP ELC+ N L P+VT+H
Sbjct: 299 IDTGITYSSLETDAFDSLLAKFLTLKDFPQRKDDPKERFELCFELQNANDLESFPDVTVH 358
Query: 359 FRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
F GAD+ L+ + FVK+ +D I C + V I GN N+ VGYD+E Q +SF P
Sbjct: 359 FDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAP 418
Query: 418 TDC 420
DC
Sbjct: 419 VDC 421
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 134/355 (37%), Positives = 175/355 (49%), Gaps = 32/355 (9%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ I +GTP V DTGSD W QCEPC CY Q LFDP SST ++
Sbjct: 182 GTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCV-VVCYEQQEKLFDPARSSTDANI 240
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C++ C+ L K CSG +C Y V YGDGS+S G A +T+TL S A+ G FGC
Sbjct: 241 SCAAPACSDLYTKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AIKGFRFGC 296
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
G N GLF + G++GLG G SL Q G F++C SS GT + GP
Sbjct: 297 GERNEGLFG-EAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSS-----GTGYLDFGP 350
Query: 267 G---VVSTPLT------KAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFL 312
G VST LT TFY + + I VG + L + +T ++DSGT +T L
Sbjct: 351 GSSPAVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTTAGTIVDSGTVITRL 410
Query: 313 PQGYNSNLLSVMSSMIEAQ--PVADPTGSLELCYSFNSLSQV--PEVTIHFRGA---DVK 365
P S+L S +S I A+ A L+ CY F +SQV P V++ F+G DV
Sbjct: 411 PPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQGGASLDVD 470
Query: 366 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
S + VS+ + + V I GN F V YDI ++ V F P C
Sbjct: 471 ASGIIYAASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 141/422 (33%), Positives = 202/422 (47%), Gaps = 41/422 (9%)
Query: 30 SVELIHRDSPKSPFYNSSETP--YQRLRDALTR--SLNRLNHFNQNSSISSSKASQADII 85
++ ++HR P SP P + L D R S++R + + ++ + +
Sbjct: 74 ALNVVHRQGPCSPLQARGAPPPHAELLNDDQARVDSIHRKIAAAASPVLDQARGKKGVTL 133
Query: 86 P-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
P NY++ + +GTP + V DTGSDL W QC PC S CY Q PLFDP
Sbjct: 134 PAQRGISLGTGNYVVSMGLGTPARDMTVVFDTGSDLSWVQCTPC--SDCYEQKDPLFDPA 191
Query: 139 MSSTYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
SSTY ++PC+S +C L+ +SCS C+Y V YGD S ++G LA +T+TL Q+
Sbjct: 192 RSSTYSAVPCASPECQGLDSRSCSRDKKCRYEVVYGDQSQTDGALARDTLTL----TQSD 247
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
LPG FGCG + GLF + G+VGLG +SL SQ + FSYCL P S + +
Sbjct: 248 VLPGFVFGCGEQDTGLFG-RADGLVGLGREKVSLSSQAASKYGAGFSYCL-PSSPSAAGY 305
Query: 258 GTNGIVSGPGVVSTPLTKAKT------FYVLTIDAISVGNQRLGV-----STPDIVIDSG 306
+ G GP + T +T FY + + + V + + V S VIDSG
Sbjct: 306 LSLG---GPAPANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSAAGTVIDSG 362
Query: 307 TTLTFLPQGYNSNLLSVMS-SMIEAQPVADPTGS-LELCYSF--NSLSQVPEVTIHFR-G 361
T +T LP + L S + SM P S L+ CY F ++ ++P V + F G
Sbjct: 363 TVITRLPPRVYAALRSAFARSMGRYGYKRAPALSILDTCYDFTGHTTVRIPSVALVFAGG 422
Query: 362 ADVKLSRSN--FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
A V L S + KVS+ + G I GN Q V YD+ +Q + F
Sbjct: 423 AAVGLDFSGVLYVAKVSQACLAFAPNGDGADAGIIGNTQQKTLAVVYDVARQKIGFGANG 482
Query: 420 CT 421
C+
Sbjct: 483 CS 484
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 137/355 (38%), Positives = 180/355 (50%), Gaps = 28/355 (7%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y IR+S+GTPP V DTGSD++W QC PC CY Q +FDP SSTY +L C
Sbjct: 35 GEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPC--VSCYHQCDEVFDPYKSSTYSTLGC 92
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA-VALPGITFGCG 207
+S QC +L+ C G C Y V YGDGSFS G AT+ V+L ST+G V L I GCG
Sbjct: 93 NSRQCLNLDVGGCVGNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVVLNKIPLGCG 152
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-----PVSSTKINFGTNGI 262
+N G F ++GLG G +S +Q+ + G+FSYCL + + FG +
Sbjct: 153 HDNEGYFVGAAG-LLGLGKGPLSFPNQINSENGGRFSYCLTGRDTDSTERSSLIFG-DAA 210
Query: 263 VSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP----------DIVIDSGTTL 309
V GV TP + TFY L + ISVG L + T ++IDSGT++
Sbjct: 211 VPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGGVIIDSGTSV 270
Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKL 366
T L ++L + + + CY+ + LS VP VT+HF+ GAD+KL
Sbjct: 271 TRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSDLSSVDVPTVTLHFQGGADLKL 330
Query: 367 SRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
SN+ V V + C F G T I GNI Q F V YD V F P+ C
Sbjct: 331 PASNYLVPVDNSSTFCLAFAGTTGP-SIIGNIQQQGFRVIYDNLHNQVGFVPSQC 384
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 188 bits (477), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 135/351 (38%), Positives = 186/351 (52%), Gaps = 30/351 (8%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N YLI + +G+P T + + DTGSD+ W QC+PC SQC+ Q PLFDP SSTY
Sbjct: 48 NTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPC--SQCHSQADPLFDPSSSSTYSPF 105
Query: 147 PCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
C S+ CA L Q+ S CQY V+YGDGS + G +++T+ LGS+ A+
Sbjct: 106 SCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSS-----AVRSFQ 160
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFGTNG 261
FGC G FN +T G++GLGGG SL+SQ T+ FSYCL P SS + G G
Sbjct: 161 FGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAG 219
Query: 262 IVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQ 314
G V TP+ ++ TFY + + AI VG ++L + + V+DSGT +T LP
Sbjct: 220 GSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVMDSGTVITRLPP 279
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNF 371
S L S + ++ P A P+G L+ C+ F+ S V P V + F GA V L S
Sbjct: 280 TAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGI 339
Query: 372 FVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ C F G ++ S+ I GN+ Q F V YD+ + V F+ C
Sbjct: 340 ILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 188 bits (477), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 141/417 (33%), Positives = 199/417 (47%), Gaps = 41/417 (9%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
+++L H DS + ++TP L R R++ N ++ SS + +
Sbjct: 54 LTLDLHHLDS-----LSLNKTPTDLFNLRLHRDTLRVHALNSRAAGFSSSVVSG-LSQGS 107
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y R+ +GTPP V DTGSD++W QC PC +CY Q P+F+P S ++ +PC
Sbjct: 108 GEYFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPC--RKCYSQSDPIFNPYKSKSFAGIPC 165
Query: 149 SSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
SS C L+ CS C Y VSYGDGSF+ G+ ATET+T VAL GC
Sbjct: 166 SSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVAL-----GC 220
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
G +N GLF ++GLG G +S SQ KFSYCLV S++ + +V G
Sbjct: 221 GHHNEGLFVGAAG-LLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASS---KPSSMVFGD 276
Query: 267 GVVS-----TPLT---KAKTFYVLTIDAISVGNQRLGVSTPD-----------IVIDSGT 307
+S TPL K TFY + + ISVG R+ +P ++IDSGT
Sbjct: 277 AAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIIDSGT 336
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADVK 365
++T L + + L + CY + S +VP V +HFRGAD+
Sbjct: 337 SVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCYDLSGQSSVKVPTVVLHFRGADMA 396
Query: 366 LSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
L +N+ + V E+ C F G + + I GNI Q F V YD+ + F P CT
Sbjct: 397 LPATNYLIPVDENGSFCFAFAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 453
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 141/438 (32%), Positives = 219/438 (50%), Gaps = 53/438 (12%)
Query: 27 GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD--- 83
G S+ELIHR+S T Q L + L R R+ + ++ K +A
Sbjct: 54 GTLSLELIHRNSLLREAKEKLHTHEQLLLETLQRDEQRVRWIESKAQLAGKKKDEASSTD 113
Query: 84 --------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
++ + Y +R+ +GTP V DTGSDL W QC+PC CY Q P+F
Sbjct: 114 LNGPVTSGLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPC--KSCYKQADPIF 171
Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSG-----VNCQYSVSYGDGSFSNGNLATETVTLG 190
DP+ SS+++ +PC S C +L SCSG C Y V+YGDGSFS G+ +++ TLG
Sbjct: 172 DPRNSSSFQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLG 231
Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM-----RTTIAGKFSY 245
T +A++ + FGCG +N GL + G++GLG G +S SQ+ ++ A FSY
Sbjct: 232 -TGSKAMS---VAFGCGFDNEGL-FAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSY 286
Query: 246 CLV----PV--SSTKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGV 296
CLV P+ SS+ + FG I S + +PL K TFY + +SVG +L +
Sbjct: 287 CLVDRSNPMTRSSSSLIFGAAAIPSTAAL--SPLLKNPKLDTFYYAAMIGVSVGGAQLPI 344
Query: 297 STPD----------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF 346
S ++IDSGT++T P + + + P A + CY+F
Sbjct: 345 SLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAPRYSLFDTCYNF 404
Query: 347 NSLS--QVPEVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNF 402
+ + VP + +HF GAD++L +N+ + + + C F + + I GNI Q +F
Sbjct: 405 SGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSF 464
Query: 403 LVGYDIEQQTVSFKPTDC 420
+G+D+++ ++F P C
Sbjct: 465 RIGFDLQKSHLAFAPQQC 482
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 141/432 (32%), Positives = 207/432 (47%), Gaps = 49/432 (11%)
Query: 30 SVELIHRDSPKSPFYNSSETP--YQRLRDALTRS---LNRLNHFNQNSSISSSKASQADI 84
SV L+HR P +P S P +RLR R+ + + ++ S A
Sbjct: 18 SVPLVHRHGPCAPSAASGGKPSLAERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTS 77
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
IP N+ Y++ + IGTP ++ + DTGSDL W QC+PC +CY Q PLFDP
Sbjct: 78 IPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDP 137
Query: 138 KMSSTYKSLPCSSSQCASLNQKS----CSGVN------CQYSVSYGDGSFSNGNLATETV 187
SS+Y S+PC S C L + C+GV+ C+Y + YG+ + + G +TET+
Sbjct: 138 SSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETL 197
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
TL V + FGCG + G + K G++GLGG SL+SQ + G FSYCL
Sbjct: 198 TL----KPGVVVADFGFGCGDHQHGPYE-KFDGLLGLGGAPESLVSQTSSQFGGPFSYCL 252
Query: 248 VPVSSTKINFGTNGI-------VSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS 297
P S F T G + G+ TP+ + TFY++T+ ISVG L +
Sbjct: 253 PPTSG-GAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIP 311
Query: 298 ----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT--GSLELCYSFNSLSQ 351
+ +VIDSGT +T LP + L S S + + P+ G L+ CY F +
Sbjct: 312 PSAFSSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFTGHAN 371
Query: 352 --VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
VP +++ F GA + L+ + + + G N++ I GN+ Q F V YD
Sbjct: 372 VTVPTISLTFSGGATIDLAAPAGVLV--DGCLAFAGAGTDNAIGIIGNVNQRTFEVLYDS 429
Query: 409 EQQTVSFKPTDC 420
+ TV F+ C
Sbjct: 430 GKGTVGFRAGAC 441
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 187 bits (476), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 144/415 (34%), Positives = 203/415 (48%), Gaps = 59/415 (14%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
F+VE + R K P YN +T YQ + LT + S ASQ +
Sbjct: 122 FAVEGVDRSDLK-PVYNE-DTRYQT--EDLTTPV-------------VSGASQG-----S 159
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y RI +GTP E V DTGSD+ W QCEPC + CY Q P+F+P SSTYKSL C
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPC--ADCYQQSDPVFNPTSSSTYKSLTC 217
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
S+ QC+ L +C C Y VSYGDGSF+ G LAT+TVT G++ + + GCG
Sbjct: 218 SAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSG----KINNVALGCGH 273
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVSG 265
+N GLF + GG +S+ +QM+ T FSYCLV S K ++F N + G
Sbjct: 274 DNEGLFTGAAGLLGLGGGV-LSITNQMKAT---SFSYCLVDRDSGKSSSLDF--NSVQLG 327
Query: 266 PGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPD------------IVIDSGTTLT 310
G + PL + K TFY + + SVG ++ V PD +++D GT +T
Sbjct: 328 GGDATAPLLRNKKIDTFYYVGLSGFSVGGEK--VVLPDAIFDVDASGSGGVILDCGTAVT 385
Query: 311 FLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKL 366
L Q YNS + + + + + + CY F+SLS +VP V HF G + L
Sbjct: 386 RLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDL 445
Query: 367 SRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N+ + V + C F ++S+ I GN+ Q + YD+ + + C
Sbjct: 446 PAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 187 bits (475), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 129/423 (30%), Positives = 201/423 (47%), Gaps = 39/423 (9%)
Query: 27 GGFSVELIHRDSPKS---PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ-- 81
G + ++L+HRD + Y+ S + R++ R + + + SS +
Sbjct: 69 GKWKLKLVHRDKITAFNKSSYDHSHNFHARIQRDKKRVATLIRRLSPRDATSSYSVEEFG 128
Query: 82 ADII----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
A+++ + Y IRI +G+PP E+ V D+GSD++W QC+PC +QCY Q P+FDP
Sbjct: 129 AEVVSGMNQGSGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPC--TQCYHQTDPVFDP 186
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
S+++ +PCSSS C + C C+Y V YGDGS++ G LA ET+T G T + V
Sbjct: 187 ADSASFMGVPCSSSVCERIENAGCHAGGCRYEVMYGDGSYTKGTLALETLTFGRTVVRNV 246
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTK 254
A+ GCG N G+F ++GLGGG +SL+ Q+ G FSYCLV S+
Sbjct: 247 AI-----GCGHRNRGMFVGAAG-LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTDSAGS 300
Query: 255 INFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP----------DI 301
+ FG + G + PL +A +FY + + + VG ++ +S +
Sbjct: 301 LEFGRGAMPVGAAWI--PLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGV 358
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHF 359
V+D+GT +T +P P A + CY+ N +VP V+ +F
Sbjct: 359 VMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYF 418
Query: 360 RGADV-KLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
G + L NF + V + C F + + I GNI Q + +D V F P
Sbjct: 419 AGGPILTLPARNFLIPVDDVGTFCFAFAASPSGLSIIGNIQQEGIQISFDGANGFVGFGP 478
Query: 418 TDC 420
C
Sbjct: 479 NVC 481
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 187 bits (475), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 141/432 (32%), Positives = 207/432 (47%), Gaps = 49/432 (11%)
Query: 30 SVELIHRDSPKSPFYNSSETP--YQRLRDALTRS---LNRLNHFNQNSSISSSKASQADI 84
SV L+HR P +P S P +RLR R+ + + ++ S A
Sbjct: 98 SVPLVHRHGPCAPSAASGGKPSLAERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTS 157
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
IP N+ Y++ + IGTP ++ + DTGSDL W QC+PC +CY Q PLFDP
Sbjct: 158 IPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDP 217
Query: 138 KMSSTYKSLPCSSSQCASLNQKS----CSGVN------CQYSVSYGDGSFSNGNLATETV 187
SS+Y S+PC S C L + C+GV+ C+Y + YG+ + + G +TET+
Sbjct: 218 SSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETL 277
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
TL V + FGCG + G + K G++GLGG SL+SQ + G FSYCL
Sbjct: 278 TL----KPGVVVADFGFGCGDHQHGPYE-KFDGLLGLGGAPESLVSQTSSQFGGPFSYCL 332
Query: 248 VPVSSTKINFGTNGI-------VSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS 297
P S F T G + G+ TP+ + TFY++T+ ISVG L +
Sbjct: 333 PPTSG-GAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIP 391
Query: 298 ----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT--GSLELCYSFNSLSQ 351
+ +VIDSGT +T LP + L S S + + P+ G L+ CY F +
Sbjct: 392 PSAFSSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFTGHAN 451
Query: 352 --VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
VP +++ F GA + L+ + + + G N++ I GN+ Q F V YD
Sbjct: 452 VTVPTISLTFSGGATIDLAAPAGVLV--DGCLAFAGAGTDNAIGIIGNVNQRTFEVLYDS 509
Query: 409 EQQTVSFKPTDC 420
+ TV F+ C
Sbjct: 510 GKGTVGFRAGAC 521
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 187 bits (475), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 125/354 (35%), Positives = 183/354 (51%), Gaps = 32/354 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ NY +++ +G+PP + DTGS L W QC+PC C+ Q PLF+P S+TY+ L
Sbjct: 117 SGNYYLKLGLGSPPKYYTMILDTGSSLSWLQCKPC-VVYCHSQVDPLFEPSASNTYRPLY 175
Query: 148 CSSSQC-----ASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
CSSS+C A+LN C SGV C Y+ SYGD S+S G L+ + +TL T Q LP
Sbjct: 176 CSSSECSLLKAATLNDPLCTASGV-CVYTASYGDASYSMGYLSRDLLTL--TPSQ--TLP 230
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN 260
T+GCG +N GLF K GIVGL +S+++Q+ FSYCL +S+ F +
Sbjct: 231 SFTYGCGQDNEGLFG-KAAGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSGGGFLSI 289
Query: 261 GIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPDI----VIDSGTTLTFLP 313
G +S TP+ + + Y L + AI+V + +GV+ +IDSGT +T LP
Sbjct: 290 GKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPTIIDSGTVVTRLP 349
Query: 314 ----QGYNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIHFR-GADVKL 366
+ +MS E P L+ C+ S S+S PE+ + F+ GAD+ L
Sbjct: 350 ISIYAALREAFVKIMSRRYEQAPAYS---ILDTCFKGSLKSMSGAPEIRMIFQGGADLSL 406
Query: 367 SRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N ++ + I C F +N + I GN Q + + YD+ + F P C
Sbjct: 407 RAPNILIEADKGIACLAFAS-SNQIAIIGNHQQQTYNIAYDVSASKIGFAPGGC 459
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 130/363 (35%), Positives = 179/363 (49%), Gaps = 40/363 (11%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+ ++IGTPP DTGSDLIWTQC+PCP C+ Q P FDP SST C
Sbjct: 81 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 138
Query: 150 SSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
S+ C L SC C Y+ SYGD S + G L + T G ++PG+
Sbjct: 139 STLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVA 195
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFG 258
FGCG N G+F S TGI G G G +SL SQ++ G FS+C V+ K ++
Sbjct: 196 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPSTVLLDLP 252
Query: 259 TNGIVSGPGVV-STPLTK---AKTFYVLTIDAISVGNQRLGV---------STPDIVIDS 305
+ SG G V STPL + TFY L++ I+VG+ RL V T +IDS
Sbjct: 253 ADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDS 312
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS--FNSLSQVPEVTIHFRGAD 363
GT +T LP + ++ ++ V+ T C S + VP++ +HF GA
Sbjct: 313 GTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGAT 372
Query: 364 VKLSRSNFFVKVSE---DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
+ L R N+ +V + I+C ++ +G V GN Q N V YD++ +SF P
Sbjct: 373 MDLPRENYVFEVEDAGSSILCLAIIEG--GEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQ 430
Query: 420 CTK 422
C K
Sbjct: 431 CDK 433
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 130/363 (35%), Positives = 179/363 (49%), Gaps = 40/363 (11%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+ ++IGTPP DTGSDLIWTQC+PCP C+ Q P FDP SST C
Sbjct: 81 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 138
Query: 150 SSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
S+ C L SC C Y+ SYGD S + G L + T G ++PG+
Sbjct: 139 STLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVA 195
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFG 258
FGCG N G+F S TGI G G G +SL SQ++ G FS+C V+ K ++
Sbjct: 196 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPSTVLLDLP 252
Query: 259 TNGIVSGPGVV-STPLTK---AKTFYVLTIDAISVGNQRLGV---------STPDIVIDS 305
+ SG G V STPL + TFY L++ I+VG+ RL V T +IDS
Sbjct: 253 ADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLKNGTGGTIIDS 312
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS--FNSLSQVPEVTIHFRGAD 363
GT +T LP + ++ ++ V+ T C S + VP++ +HF GA
Sbjct: 313 GTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGAT 372
Query: 364 VKLSRSNFFVKVSE---DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
+ L R N+ +V + I+C ++ +G V GN Q N V YD++ +SF P
Sbjct: 373 MDLPRENYVFEVEDAGSSILCLAIIEG--GEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQ 430
Query: 420 CTK 422
C K
Sbjct: 431 CDK 433
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 146/448 (32%), Positives = 219/448 (48%), Gaps = 67/448 (14%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
MA FL V+IL L + +S + G +EL H D Y +E R+R A R
Sbjct: 1 MAAFL--VWILLLLPYVAISSTASH--GVRLELTHADDRGG--YVGAE----RVRRAADR 50
Query: 61 SLNRLNHF-----------NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVAD 109
S R+N F S + + ++A + + A YL+ I+IGTPP AV D
Sbjct: 51 SHRRVNGFLGAIEGPSSTARLGSDGAGAGGAEASVHASTATYLVDIAIGTPPLPLTAVLD 110
Query: 110 TGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCS--GV 164
TGSDLIWTQC+ PC +C+ Q +PL+ P S+TY ++ C S C +L CS
Sbjct: 111 TGSDLIWTQCDAPC--RRCFPQPAPLYAPARSATYANVSCRSPMCQALQSPWSRCSPPDT 168
Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
C Y SYGDG+ ++G LATET TLGS T A+ G+ FGCGT N G ++G+VG+
Sbjct: 169 GCAYYFSYGDGTSTDGVLATETFTLGSDT----AVRGVAFGCGTENLG-STDNSSGLVGM 223
Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTI 284
G G +SL+SQ+ T + C ++ T ++PL
Sbjct: 224 GRGPLSLVSQLGVTRPRR--SCRARAAARGGGAPTT---------TSPL----------- 261
Query: 285 DAISVGNQRLGVS------TP----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 334
+ I+VG+ L + TP ++IDSGTT T L + L ++S + +
Sbjct: 262 EGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEERAFVALARALASRVRLPLAS 321
Query: 335 DPTGSLELCYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP 392
L LC++ S +VP + +HF GAD++L R ++ V+ V + +
Sbjct: 322 GAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRRESYVVEDRSAGVACLGMVSARGMS 381
Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ G++ Q N + YD+E+ +SF+P C
Sbjct: 382 VLGSMQQQNTHILYDLERGILSFEPAKC 409
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 136/370 (36%), Positives = 189/370 (51%), Gaps = 51/370 (13%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+ ++IGTPP DTGSDLIWTQC+PC C+ Q P FD SST LPC
Sbjct: 34 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPC--VSCFDQPLPYFDTSRSSTNALLPCE 91
Query: 150 SSQC---------ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
S+QC LNQ + C Y SYGD S + G LA + T + T +LP
Sbjct: 92 STQCKLDPTVTVCVKLNQTVQT---CAYYTSYGDNSVTIGLLAADKFTFVAGT----SLP 144
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKI 255
G+TFGCG NN G+FNS TGI G G G +SL SQ++ G FS+C + S+ +
Sbjct: 145 GVTFGCGLNNTGVFNSNETGIAGFGRGPLSLPSQLK---VGNFSHCFTTITGAIPSTVLL 201
Query: 256 NFGTNGIVSGPGVV-STPLTK-AK-----TFYVLTIDAISVGNQRLGV---------STP 299
+ + +G G V +TPL + AK T Y L++ I+VG+ RL V T
Sbjct: 202 DLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTG 261
Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTI 357
+IDSGT++T LP + ++ I+ V C+S S ++ VP++ +
Sbjct: 262 GTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVL 321
Query: 358 HFRGADVKLSRSNFFVKVSED----IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
HF GA + L R N+ +V +D I+C ++ KG + I GN Q N V YD++
Sbjct: 322 HFEGATMDLPRENYVFEVPDDAGNSIICLAINKG--DETTIIGNFQQQNMHVLYDLQNNM 379
Query: 413 VSFKPTDCTK 422
+SF C K
Sbjct: 380 LSFVAAQCDK 389
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 141/422 (33%), Positives = 197/422 (46%), Gaps = 41/422 (9%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN-----SSISSSKASQADI 84
S+E+IHR P +++ T + L + +R++ + S+ + S+A
Sbjct: 62 SLEVIHRHGPCGDEVSNAPTA----AEMLVKDQSRVDFIHSKIAGELESVDRLRGSKATK 117
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
IP + NY++ + +GTP + DTGSDL WTQC+PC CY Q P+F P
Sbjct: 118 IPAKSGATIGSGNYIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPC-ARYCYNQKDPVFVP 176
Query: 138 KMSSTYKSLPCSSSQCASL-----NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGS 191
S+TY ++ CSS C+ L NQ CS C Y + YGD SFS G A ET+TL S
Sbjct: 177 SQSTTYSNISCSSPDCSQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETLTLTS 236
Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS 251
T + FGCG NN GLF S G++GLG IS++ Q FSYCL S
Sbjct: 237 TD----VIENFLFGCGQNNRGLFGS-AAGLIGLGQDKISIVKQTAQKYGQVFSYCLPKTS 291
Query: 252 STKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGV-----STPDIVI 303
S+ G G + TP+TKA FY + I + VG ++ + ST +I
Sbjct: 292 SSTGYLTFGGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISSSVFSTSGAII 351
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRG 361
DSGT +T LP S L S + P A L+ CY + S Q+P+V F+G
Sbjct: 352 DSGTVITRLPPDAYSALKSAFEKGMAKYPKAPELSILDTCYDLSKYSTIQIPKVGFVFKG 411
Query: 362 A-DVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
++ L S VC F G + +V I GN+ Q V YD+ + F
Sbjct: 412 GEELDLDGIGIMYGASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGFGYN 471
Query: 419 DC 420
C
Sbjct: 472 GC 473
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 129/356 (36%), Positives = 179/356 (50%), Gaps = 37/356 (10%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y RI +GTP E V DTGSD+ W QCEPC S CY Q P+F+P SSTYKSL
Sbjct: 159 SGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPC--SDCYQQSDPVFNPTSSSTYKSLT 216
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
CS+ QC+ L +C C Y VSYGDGSF+ G LAT+TVT G++ + + GCG
Sbjct: 217 CSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSG----KINDVALGCG 272
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVS 264
+N GLF + GG +S+ +QM+ T FSYCLV S K ++F N +
Sbjct: 273 HDNEGLFTGAAGLLGLGGGA-LSITNQMKAT---SFSYCLVDRDSGKSSSLDF--NSVQL 326
Query: 265 GPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPD------------IVIDSGTTL 309
G G + PL K TFY + + SVG Q+ V PD +++D GT +
Sbjct: 327 GSGDATAPLLRNQKIDTFYYVGLSGFSVGGQK--VMMPDAIFDVDASGSGGVILDCGTAV 384
Query: 310 TFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VK 365
T L Q YNS + + + + CY F+SLS +VP V HF G +
Sbjct: 385 TRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTCYDFSSLSSVKVPTVAFHFTGGKSLD 444
Query: 366 LSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
L N+ + V ++ C F ++S+ I GN+ Q + YD+ + + C
Sbjct: 445 LPAKNYLIPVDDNGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANKIIGLSGNKC 500
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 147/402 (36%), Positives = 200/402 (49%), Gaps = 37/402 (9%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
E+I RD + E+ Y +L S N N ++ + S+ +++ I + NY
Sbjct: 87 EIIRRDQARV------ESIYSKL------SKNSANEVSE--AKSTELPAKSGITLGSGNY 132
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
++ I IGTP + V DTGSDL WTQCEPC S CY Q P F+P SSTY+++ CSS
Sbjct: 133 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGS-CYSQKEPKFNPSSSSTYQNVSCSSP 191
Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
C + +SCS NC YS+ YGD SF+ G LA E TL ++ L + FGCG NN
Sbjct: 192 MCE--DAESCSASNCVYSIGYGDKSFTQGFLAKEKFTLTNSD----VLEDVYFGCGENNQ 245
Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST---KINFGTNGIVSGPGV 268
GLF+ ++GLG G +SL +Q TT FSYCL +S + FG+ GI V
Sbjct: 246 GLFDGVAG-LLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGI--SESV 302
Query: 269 VSTPLTKAKTFYVLTID--AISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLL 321
TP++ + + ID ISVG++ L + ST +IDSGT T LP + L
Sbjct: 303 KFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAELR 362
Query: 322 SVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR---GADVKLSRSNFFVKVSED 378
SV + + G + CY F L V TI F G V+L S + +
Sbjct: 363 SVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGGTVVELDGSGISLPIKIS 422
Query: 379 IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
VC F G + I+GN+ QT V YD+ V F P C
Sbjct: 423 QVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 140/450 (31%), Positives = 219/450 (48%), Gaps = 74/450 (16%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS-------------- 73
G V L H D+ + + + Q L+ A RS +R++ ++
Sbjct: 44 GLRVRLTHVDA------HGNYSRLQLLQRAARRSHHRMSRLVARATGAASTSSSKAAAAG 97
Query: 74 -ISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
S K Q + N +L+ +S+GTP A+ DTGSDL+WTQC+PC +C+ Q +
Sbjct: 98 DGSGGKDLQVPVHAGNGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPC--VECFNQTT 155
Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ--------YSVSYGDGSFSNGNLAT 184
P+FDP SSTY +LPCSS+ CA L +C+ + Y+ +YGD S + G LAT
Sbjct: 156 PVFDPAASSTYAALPCSSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLAT 215
Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFS 244
ET TL +PG+ FGCG N G ++ G+VGLG G +SL+SQ+ +FS
Sbjct: 216 ETFTLARQK-----VPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGID---RFS 267
Query: 245 YCLV---------PVSSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQ 292
YCL P+ + + P +TPL K +FY +++ ++VG+
Sbjct: 268 YCLTSLDDAAGRSPLLLGSAAGISASAATAP-AQTTPLVKNPSQPSFYYVSLTGLTVGST 326
Query: 293 RLGV----------STPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SL 340
RL + T +++DSGT++T+L + Y + + ++ M + P D + L
Sbjct: 327 RLALPSSAFAIQDDGTGGVIVDSGTSITYLELRAYRALRKAFVAHM--SLPTVDASEIGL 384
Query: 341 ELCYSFNSLS-------QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVP 392
+LC+ + + QVP++ +HF GAD+ L N+ V S + + +
Sbjct: 385 DLCFQGPAGAVDQDVQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMASRGLS 444
Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
I GN Q NF YD+ T+SF P +C K
Sbjct: 445 IIGNFQQQNFQFVYDVAGDTLSFAPAECNK 474
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 143/415 (34%), Positives = 203/415 (48%), Gaps = 59/415 (14%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
F+VE + R K P YN +T YQ + LT + S ASQ +
Sbjct: 122 FAVEGVDRSDLK-PVYNE-DTRYQT--EDLTTPV-------------VSGASQG-----S 159
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y RI +GTP + V DTGSD+ W QCEPC + CY Q P+F+P SSTYKSL C
Sbjct: 160 GEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPC--ADCYQQSDPVFNPTSSSTYKSLTC 217
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
S+ QC+ L +C C Y VSYGDGSF+ G LAT+TVT G++ + + GCG
Sbjct: 218 SAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSG----KINNVALGCGH 273
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVSG 265
+N GLF + GG +S+ +QM+ T FSYCLV S K ++F N + G
Sbjct: 274 DNEGLFTGAAGLLGLGGGV-LSITNQMKAT---SFSYCLVDRDSGKSSSLDF--NSVQLG 327
Query: 266 PGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPD------------IVIDSGTTLT 310
G + PL + K TFY + + SVG ++ V PD +++D GT +T
Sbjct: 328 GGDATAPLLRNKKIDTFYYVGLSGFSVGGEK--VVLPDAIFDVDASGSGGVILDCGTAVT 385
Query: 311 FLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKL 366
L Q YNS + + + + + + CY F+SLS +VP V HF G + L
Sbjct: 386 RLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDL 445
Query: 367 SRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N+ + V + C F ++S+ I GN+ Q + YD+ + + C
Sbjct: 446 PAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 145/427 (33%), Positives = 200/427 (46%), Gaps = 39/427 (9%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKAS- 80
EA G + L H SP + + + L + R RLN +S + S
Sbjct: 64 EALKPGVKIRLDHIHGACSPLRPINSSSWIDLVSQSFERDNARLNTIRSKNSGPYTTMSN 123
Query: 81 ---QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
Q+ NY++ GTP L + DTGSDL W QC+PC + CY Q +F+P
Sbjct: 124 LPLQSGTTVGTGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPC--ADCYSQVDAIFEP 181
Query: 138 KMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
K SS+YK+LPC S+ C L N C C Y ++YGDGS S G+ + ET+TLGS
Sbjct: 182 KQSSSYKTLPCLSATCTELITSESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTLGSD 241
Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS 252
+ Q A FGCG N GLF ++G++GLG +S SQ ++ G+F+YCL P
Sbjct: 242 SFQNFA-----FGCGHTNTGLFKG-SSGLLGLGQNSLSFPSQSKSKYGGQFAYCL-PDFG 294
Query: 253 TKINFGTNGIVSG---PGVVSTPLTK---AKTFYVLTIDAISVGNQRLGV-----STPDI 301
+ + G+ + G V TPL TFY + ++ ISVG RL +
Sbjct: 295 SSTSTGSFSVGKGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGRGST 354
Query: 302 VIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 358
++DSGT +T LPQ YN+ L + S P A P L+ CY + SQV P +T H
Sbjct: 355 IVDSGTVITRLLPQAYNA-LKTSFRSKTRDLPSAKPFSILDTCYDLSRHSQVRIPTITFH 413
Query: 359 FR-GADVKLSRSNFFVKVSE--DIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTV 413
F+ ADV +S V V VC F + + I GN Q V +D +
Sbjct: 414 FQNNADVAVSDVGILVPVQNGGSQVCLAFASASQMDGFNIIGNFQQQRMRVAFDTGAGRI 473
Query: 414 SFKPTDC 420
F C
Sbjct: 474 GFASGSC 480
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 139/400 (34%), Positives = 199/400 (49%), Gaps = 41/400 (10%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSISS-SKASQADIIPNNANYLIRISIGTPPTERLAVADT 110
+ +R RS R +S+ + S + D +P YL+ ++IGTPP DT
Sbjct: 52 ELMRRMALRSKARAPRLLSSSATAPVSPGAYDDGVPMT-EYLLHLAIGTPPQPVQLTLDT 110
Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN----- 165
GSDL+WTQC+PC + C+ Q P +D SST+ C S+QC L+ VN
Sbjct: 111 GSDLVWTQCQPC--AVCFNQSLPYYDASRSSTFALPSCDSTQC-KLDPSVTMCVNQTVQT 167
Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
C +S SYGD S + G L ETV+ + ++PG+ FGCG NN G+F S TGI G G
Sbjct: 168 CAFSYSYGDKSATIGFLDVETVSFVA----GASVPGVVFGCGLNNTGIFRSNETGIAGFG 223
Query: 226 GGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFGTNGIVSGPGVV-STPLTK---A 276
G +SL SQ++ G FS+C VS K + + +G G V +TPL K
Sbjct: 224 RGPLSLPSQLKV---GNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAH 280
Query: 277 KTFYVLTIDAISVGNQRLGV---------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSM 327
TFY L++ I+VG+ RL V T +IDSGT T LP + ++
Sbjct: 281 PTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAH 340
Query: 328 IEAQPV-ADPTGSLELCYSFNSLSQ---VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSV 383
++ V ++ TG L LC+S L + VP++ +HF GA + L R N+ + + CS+
Sbjct: 341 VKLPVVPSNETGPL-LCFSAPPLGKAPHVPKLVLHFEGATMHLPRENYVFEAKDGGNCSI 399
Query: 384 -FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
I + I GN Q N V YD++ +SF C K
Sbjct: 400 CLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKCDK 439
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 144/427 (33%), Positives = 200/427 (46%), Gaps = 50/427 (11%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD---- 83
S+ L H D+ +S++TP Q + L R R+ ++++ S A ++
Sbjct: 61 ALSLHLHHIDA-----LSSNKTPEQLFQLRLQRDAKRVEGVVALAALNQSHARRSGSSFS 115
Query: 84 ------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+ + Y RI +GTP V DTGSD++W QC PC +CY Q P+FDP
Sbjct: 116 SSIISGLAQGSGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPC--RKCYTQADPVFDP 173
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
S TY +PC + C L+ C+ N CQY VSYGDGSF+ G+ +TET+T T
Sbjct: 174 TKSRTYAGIPCGAPLCRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRTRVT 233
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
VAL GCG +N GLF ++GLG G +S Q KFSYCLV S++
Sbjct: 234 RVAL-----GCGHDNEGLFIGAAG-LLGLGRGRLSFPVQTGRRFNQKFSYCLVDRSASA- 286
Query: 256 NFGTNGIVSGPGVVS-----TPLT---KAKTFYVLTIDAISVGNQ----------RLGVS 297
+ +V G VS TPL K TFY L + ISVG RL +
Sbjct: 287 --KPSSVVFGDSAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAA 344
Query: 298 -TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPE 354
++IDSGT++T L + L A + C+ + L++ VP
Sbjct: 345 GNGGVIIDSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPT 404
Query: 355 VTIHFRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
V +HFRGADV L +N+ + V C F G + + I GNI Q F V +D+ V
Sbjct: 405 VVLHFRGADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRV 464
Query: 414 SFKPTDC 420
F P C
Sbjct: 465 GFAPRGC 471
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 127/351 (36%), Positives = 174/351 (49%), Gaps = 24/351 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP + V DTGSD W QC+PC CY Q LFDP SSTY ++
Sbjct: 176 GTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCV-VVCYEQREKLFDPARSSTYANV 234
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C++ C+ LN CSG +C Y V YGDGS+S G A +T+TL S A+ G FGC
Sbjct: 235 SCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 290
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIV 263
G N GLF + G++GLG G SL Q G F++CL P ST ++FG +
Sbjct: 291 GERNEGLFG-EAAGLLGLGRGKTSLPVQTYDKYGGVFAHCL-PARSTGTGYLDFGAGSLA 348
Query: 264 SGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGY 316
+ ++TP+ TFY + + I VG Q L + +T ++DSGT +T LP
Sbjct: 349 AASARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPAA 408
Query: 317 NSNL--LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNF 371
S+L + A L+ CY F +SQV P V++ F+ GA + + S
Sbjct: 409 YSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGI 468
Query: 372 FVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
S VC F + V I GN F V YDI ++ V F P C
Sbjct: 469 MYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 135/417 (32%), Positives = 202/417 (48%), Gaps = 44/417 (10%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN-SSISSSKASQADIIPNN 88
S++++H+ P S N L + L +R++ + S S K + A +P
Sbjct: 66 SLKVVHKHGPCSQL-NQQNGNAPNLVEILLEDQSRVDSIHAKLSDHSGVKETDAAKLPTK 124
Query: 89 A-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ NY++ I +G+P + + + DTGSDL W +C + FDP S+
Sbjct: 125 SGMSLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCSA----------AETFDPTKST 174
Query: 142 TYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
+Y ++ CS+ C+S+ N C+ C Y + YGDGS+S G L E +T+GST
Sbjct: 175 SYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERLTIGSTD--- 231
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-I 255
FGCG + GLF K G++GLG +S++SQ FSYCL SST +
Sbjct: 232 -IFNNFYFGCGQDVDGLFG-KAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPSSSSTGFL 289
Query: 256 NFGTNGIVSGPGVVSTPLTKA-KTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTL 309
+FG++ S TPL+ +FY L + I+VG Q+L + ST +IDSGT +
Sbjct: 290 SFGSSQSKSAK---FTPLSSGPSSFYNLDLTGITVGGQKLAIPLSVFSTAGTIIDSGTVV 346
Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGA-DVKL 366
T LP S L S + + P+ P L+ CY F+ +VP++ I F G DV +
Sbjct: 347 TRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIKVPKIVISFSGGVDVDV 406
Query: 367 SRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
++ FV VC F G T + I+GN Q NF V YD+ V F P C+
Sbjct: 407 DQAGIFVANGLKQVCLAFAGNTGARDTAIFGNTQQRNFEVVYDVSGGKVGFAPASCS 463
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 146/402 (36%), Positives = 201/402 (50%), Gaps = 37/402 (9%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
E+I RD + E+ Y +L S N N ++ + S+ +++ I + NY
Sbjct: 87 EIIRRDQARV------ESIYSKL------SKNSANEVSE--AKSTELPAKSGITLGSGNY 132
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
++ I IGTP + V DTGSDL WTQCEPC S CY Q P F+P SSTY+++ CSS
Sbjct: 133 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGS-CYSQKEPKFNPSSSSTYQNVSCSSP 191
Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
C + +SCS NC YS+ YGD SF+ G LA E TL ++ L + FGCG NN
Sbjct: 192 MCE--DAESCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSD----VLEDVYFGCGENNQ 245
Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST---KINFGTNGIVSGPGV 268
GLF+ ++GLG G +SL +Q TT FSYCL +S + FG+ GI V
Sbjct: 246 GLFDGVAG-LLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGI--SESV 302
Query: 269 VSTPLTKAKTFYVLTID--AISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLL 321
TP++ + + ID ISVG++ L + ST +IDSGT T LP + L
Sbjct: 303 KFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAELR 362
Query: 322 SVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGAD-VKLSRSNFFVKVSED 378
SV + + G + CY F L V P + F G+ V+L S + +
Sbjct: 363 SVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPIKIS 422
Query: 379 IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
VC F G + I+GN+ QT V YD+ V F P C
Sbjct: 423 QVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 134/433 (30%), Positives = 208/433 (48%), Gaps = 52/433 (12%)
Query: 23 EAQTGG--FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
+ + GG + ++++HRD + +S+ RL L R R+ + S +
Sbjct: 64 DHEEGGEKWMMKVVHRDQLS---FGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSY 120
Query: 81 QAD---------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD 131
+ D + + Y +RI +G+PP + V D+GSD++W QC+PC +QCY Q
Sbjct: 121 RVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--TQCYHQS 178
Query: 132 SPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGS 191
P+FDP S+++ + CSSS C L C C+Y VSYGDGS++ G LA ET+T G
Sbjct: 179 DPVFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFGR 238
Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV- 250
T ++VA+ GCG N G+F ++GLGGG +S + Q+ G FSYCLV
Sbjct: 239 TMVRSVAI-----GCGHRNRGMFVGAAG-LLGLGGGSMSFVGQLGGQTGGAFSYCLVSRG 292
Query: 251 --SSTKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP------ 299
SS + FG + +G V PL +A +FY + + + VG R+ +S
Sbjct: 293 TDSSGSLVFGREALPAGAAWV--PLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTE 350
Query: 300 ----DIVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL-- 349
+V+D+GT +T LP Q + L+ +++ A VA + CY
Sbjct: 351 LGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVA----IFDTCYDLLGFVS 406
Query: 350 SQVPEVTIHFRGADV-KLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
+VP V+ +F G + L NF + + + C F T+ + I GNI Q + +D
Sbjct: 407 VRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFD 466
Query: 408 IEQQTVSFKPTDC 420
V F P C
Sbjct: 467 GANGYVGFGPNIC 479
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 127/351 (36%), Positives = 173/351 (49%), Gaps = 24/351 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP + V DTGSD W QC+PC CY Q LFDP SSTY ++
Sbjct: 174 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCV-VVCYEQQEKLFDPVRSSTYANV 232
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C++ C+ LN CSG +C Y V YGDGS+S G A +T+TL S A+ G FGC
Sbjct: 233 SCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 288
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIV 263
G N GLF + G++GLG G SL Q G F++CL P ST ++FG
Sbjct: 289 GERNEGLFG-EAAGLLGLGRGKTSLPVQTYDKYGGVFAHCL-PARSTGTGYLDFGAGSPA 346
Query: 264 SGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGY 316
+ ++TP+ TFY + + I VG Q L + +T ++DSGT +T LP
Sbjct: 347 AASARLTTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPPA 406
Query: 317 NSNL--LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNF 371
S+L + A L+ CY F +SQV P V++ F+ GA + + S
Sbjct: 407 YSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGI 466
Query: 372 FVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
S VC F + V I GN F V YDI ++ V F P C
Sbjct: 467 MYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGVC 517
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 184 bits (467), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 148/424 (34%), Positives = 199/424 (46%), Gaps = 50/424 (11%)
Query: 31 VELIHRDSPKSPFYNSS-ETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQAD 83
+ L H+ P +P SS TP + D L R + + S + SKA A
Sbjct: 67 LRLTHKHGPCAPSRASSLATP--SVADTLRADQRRAEYILRRVSGRGTPQLWDSKAEAAT 124
Query: 84 I-IPNNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
+P N NY++ +S+GTP + DTGSDL W QC PC CY Q PLF
Sbjct: 125 ATVPANWGFNIGTLNYVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKDPLF 184
Query: 136 DPKMSSTYKSLPCSSSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
DP SS+Y ++PC C L SCS C Y VSYGDGS + G +++T+TL
Sbjct: 185 DPAQSSSYAAVPCGGPVCGGLGIYASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTLSPND 244
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
A+ G FGCG G + G++GLG + SL+ Q T G FSYCL P +
Sbjct: 245 ----AVRGFFFGCGHAQSGFTGND--GLLGLGREEASLVEQTAGTYGGVFSYCL-PTRPS 297
Query: 254 KINFGTNGIVSG---PGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP----DIVI 303
+ T G SG PG +T L A T+YV+ + ISVG Q+L V + V+
Sbjct: 298 TTGYLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFAGGTVV 357
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLSQV--PEVTIHF 359
D+GT +T LP + L S S + + P A TG L+ CY+F+ V P V + F
Sbjct: 358 DTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGTVTLPNVALTF 417
Query: 360 R-GADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
GA V L C F G + I GN+ Q +F V I+ +V FK
Sbjct: 418 SGGATVTLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFK 470
Query: 417 PTDC 420
P+ C
Sbjct: 471 PSSC 474
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 184 bits (467), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 131/348 (37%), Positives = 188/348 (54%), Gaps = 29/348 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + S+GTPP + A+ADTGSDLIW +C + C Q SP + P SST+ LPCS
Sbjct: 91 YDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPCSD 150
Query: 151 SQCASLNQKS-----CSGVNCQYSVSYG----DGSFSNGNLATETVTLGSTTGQAVALPG 201
C+ L S +G C Y SYG D ++ G LA ET TLG A A+P
Sbjct: 151 RLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLG-----ADAVPS 205
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS--TKINFGT 259
+ FGC T +G+VGLG G +SL+SQ+ A F YCL +S + + FG+
Sbjct: 206 VRFGC-TTASEGGYGSGSGLVGLGRGPLSLVSQLN---ASTFMYCLTSDASKASPLLFGS 261
Query: 260 NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRL-GVSTPD-IVIDSGTTLTFLPQGYN 317
++G V ST L + TFY + + +IS+G+ GV P+ +V DSGTTLT+L +
Sbjct: 262 LASLTGAQVQSTGLLASTTFYAVNLRSISIGSATTPGVGEPEGVVFDSGTTLTYLAEPAY 321
Query: 318 SNLLSVMSSMIEAQPVADPTGSLELCYSFN-----SLSQVPEVTIHFRGADVKLSRSNFF 372
S + S V D G E C+ S + VP + +HF GAD+ L +N+
Sbjct: 322 SEAKAAFLSQTSLDQVEDTDG-FEACFQKPANGRLSNAAVPTMVLHFDGADMALPVANYV 380
Query: 373 VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
V+V + +VC + + + S+ I GNIMQ N+LV +D+ + +SF+P +C
Sbjct: 381 VEVEDGVVCWIVQR-SPSLSIIGNIMQVNYLVLHDVHRSVLSFQPANC 427
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 184 bits (467), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 130/357 (36%), Positives = 179/357 (50%), Gaps = 35/357 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ +GTPP V DTGSD++W QC+PC ++CY Q +FDP S ++ +P
Sbjct: 127 SGEYFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPC--TKCYSQTDQIFDPSKSKSFAGIP 184
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C S C L+ CS N CQY VSYGDGSF+ G+ +TET+T + A+P + G
Sbjct: 185 CYSPLCRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTF-----RRAAVPRVAIG 239
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG 265
CG +N GLF ++GLG G +S +Q T KFSYCL +++ + IV G
Sbjct: 240 CGHDNEGLFVGAAG-LLGLGRGGLSFPTQTGTRFNNKFSYCLTDRTASA---KPSSIVFG 295
Query: 266 PGVVS-----TPLT---KAKTFYVLTIDAISVGNQRL-GVSTP----------DIVIDSG 306
VS TPL K TFY + + ISVG + G+S ++IDSG
Sbjct: 296 DSAVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSG 355
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADV 364
T++T L + +L A + CY + LS+ VP V +HFRGADV
Sbjct: 356 TSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSLFDTCYDLSGLSEVKVPTVVLHFRGADV 415
Query: 365 KLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
L +N+ V V C F G + + I GNI Q F V +D+ V F P C
Sbjct: 416 SLPAANYLVPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFAPRGC 472
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 184 bits (467), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 133/421 (31%), Positives = 197/421 (46%), Gaps = 40/421 (9%)
Query: 33 LIHRDSPKSPFYNSSETPYQRLRDA--LTRSLNRLNHFNQN--------SSISSSKAS-- 80
++HR P SP + A L R R++ ++ S + ++AS
Sbjct: 73 VVHRHGPCSPVQARPRGGGGAVTHAEILERDQARVDSIHRKVAGAGGAPSVVDPARASEQ 132
Query: 81 ------QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
Q I NY++ + +GTP + + DTGSDL W QC+PC + CY Q PL
Sbjct: 133 GVSLPAQRGISLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPC--ADCYEQQDPL 190
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
FDP +SSTY ++ C + +C L+ CS C+Y V YGD S ++GNL +T+TL ++
Sbjct: 191 FDPSLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD 250
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
LPG FGCG N GLF + G+ GLG +SL SQ + F+YCL P SS+
Sbjct: 251 ----TLPGFVFGCGDQNAGLFG-QVDGLFGLGREKVSLPSQGAPSYGPGFTYCL-PSSSS 304
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKT--FYVLTIDAISVGNQRLGV------STPDIVIDS 305
+ + G T L T FY + + I VG + + + + VIDS
Sbjct: 305 GRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDS 364
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFR-GA 362
GT +T LP + L + + + A L+ CY F + +Q+P V + F GA
Sbjct: 365 GTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGA 424
Query: 363 DVKLSRSN--FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
V L + + KVS+ + +S+ I GN Q F V YD+ Q + F C
Sbjct: 425 TVSLDFTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGC 484
Query: 421 T 421
+
Sbjct: 485 S 485
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 127/360 (35%), Positives = 178/360 (49%), Gaps = 39/360 (10%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y RI IG+P + V DTGSD+ W QC PC + CY Q PLFDP +SS+Y ++P
Sbjct: 193 SGEYFSRIGIGSPARQLYMVLDTGSDVTWLQCAPC--ADCYAQSDPLFDPALSSSYATVP 250
Query: 148 CSSSQCASLNQKSCS------GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
C S C +L+ +C +C Y V+YGDGS++ G+ ATET+TLG AV
Sbjct: 251 CDSPHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLGGDGSAAVH--D 308
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFG 258
+ GCG +N GLF ++ LGGG +S SQ+ T +FSYCLV S++ + FG
Sbjct: 309 VAIGCGHDNEGLFVGAAG-LLALGGGPLSFPSQISAT---EFSYCLVDRDSPSASTLQFG 364
Query: 259 TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP-----------DIVID 304
S V+ PL ++ TFY + ++ ISVG + L P +++D
Sbjct: 365 ----ASDSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSGGVIVD 420
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-G 361
SGT +T L S L +A P A + CY S QVP V++ F G
Sbjct: 421 SGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSSVQVPAVSLRFEGG 480
Query: 362 ADVKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
++KL N+ + V C F +V I GN+ Q V +D + TV F P C
Sbjct: 481 GELKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFDTAKNTVGFSPNKC 540
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 125/354 (35%), Positives = 177/354 (50%), Gaps = 31/354 (8%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ NY +++ +G+P + DTGS L W QC+PC C++Q PLFDP S TYKSL
Sbjct: 10 SGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCV-VYCHVQADPLFDPSASKTYKSLS 68
Query: 148 CSSSQC-----ASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
C+SSQC A+LN C S C Y+ SYGD S+S G L+ + +TL + LP
Sbjct: 69 CTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ----TLP 124
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN 260
G +GCG ++ GLF + GI+GLG +S++ Q+ + FSYCL
Sbjct: 125 GFVYGCGQDSEGLFG-RAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGGFLSIGK 183
Query: 261 GIVSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVSTPDI----VIDSGTTLTFLP 313
++G TP+T + Y L + AI+VG + LGV+ +IDSGT +T LP
Sbjct: 184 ASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSGTVITRLP 243
Query: 314 QG----YNSNLLSVMSSMIEAQPVADPTGSLELCYSFN--SLSQVPEVTIHFR-GADVKL 366
+ + +MSS P L+ C+ N + VPEV + F+ GAD+ L
Sbjct: 244 MSVYTPFQQAFVKIMSSKYARAPGFS---ILDTCFKGNLKDMQSVPEVRLIFQGGADLNL 300
Query: 367 SRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N ++V E + C F G N V I GN Q F V +DI + F C
Sbjct: 301 RPVNVLLQVDEGLTCLAFAG-NNGVAIIGNHQQQTFKVAHDISTARIGFATGGC 353
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 139/400 (34%), Positives = 198/400 (49%), Gaps = 41/400 (10%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSISS-SKASQADIIPNNANYLIRISIGTPPTERLAVADT 110
+ +R RS R +S+ + S + D +P YL+ ++IGTPP DT
Sbjct: 52 ELMRRMALRSKARAPRLLSSSATAPVSPGAYDDGVPMT-EYLLHLAIGTPPQPVQLTLDT 110
Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN----- 165
GS L+WTQC+PC + C+ Q P +D SST+ C S+QC L+ VN
Sbjct: 111 GSVLVWTQCQPC--AVCFNQSLPYYDASRSSTFALPSCDSTQC-KLDPSVTMCVNQTVQT 167
Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
C YS SYGD S + G L ETV+ + ++PG+ FGCG NN G+F S TGI G G
Sbjct: 168 CAYSYSYGDKSATIGFLDVETVSFVA----GASVPGVVFGCGLNNTGIFRSNETGIAGFG 223
Query: 226 GGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFGTNGIVSGPGVV-STPLTK---A 276
G +SL SQ++ G FS+C VS K + + +G G V +TPL K
Sbjct: 224 RGPLSLPSQLKV---GNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAH 280
Query: 277 KTFYVLTIDAISVGNQRLGV---------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSM 327
TFY L++ I+VG+ RL V T +IDSGT T LP + ++
Sbjct: 281 PTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAH 340
Query: 328 IEAQPV-ADPTGSLELCYSFNSLSQ---VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSV 383
++ V ++ TG L LC+S L + VP++ +HF GA + L R N+ + + CS+
Sbjct: 341 VKLPVVPSNETGPL-LCFSAPPLGKAPHVPKLVLHFEGATMHLPRENYVFEAKDGGNCSI 399
Query: 384 -FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
I + I GN Q N V YD++ +SF C K
Sbjct: 400 CLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKCDK 439
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 133/421 (31%), Positives = 197/421 (46%), Gaps = 40/421 (9%)
Query: 33 LIHRDSPKSPFYNSSETPYQRLRDA--LTRSLNRLNHFNQN--------SSISSSKAS-- 80
++HR P SP + A L R R++ ++ S + ++AS
Sbjct: 73 VVHRHGPCSPVQARRRGGGGAVTHAEILERDQARVDSIHRKVAGAGGAPSVVDPARASEQ 132
Query: 81 ------QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
Q I NY++ + +GTP + + DTGSDL W QC+PC + CY Q PL
Sbjct: 133 GVSLPAQRGISLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPC--ADCYEQQDPL 190
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
FDP +SSTY ++ C + +C L+ CS C+Y V YGD S ++GNL +T+TL ++
Sbjct: 191 FDPSLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD 250
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
LPG FGCG N GLF + G+ GLG +SL SQ + F+YCL P SS+
Sbjct: 251 ----TLPGFVFGCGDQNAGLFG-QVDGLFGLGREKVSLPSQGAPSYGPGFTYCL-PSSSS 304
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKT--FYVLTIDAISVGNQRLGV------STPDIVIDS 305
+ + G T L T FY + + I VG + + + + VIDS
Sbjct: 305 GRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDS 364
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFR-GA 362
GT +T LP + L + + + A L+ CY F + +Q+P V + F GA
Sbjct: 365 GTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGA 424
Query: 363 DVKLSRSN--FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
V L + + KVS+ + +S+ I GN Q F V YD+ Q + F C
Sbjct: 425 TVSLDFTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGC 484
Query: 421 T 421
+
Sbjct: 485 S 485
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 140/435 (32%), Positives = 207/435 (47%), Gaps = 53/435 (12%)
Query: 29 FSVELIHRDSPK-SPFYNSSETPYQRLRDALTRSLNRLNHFNQ--------NSSISSSKA 79
+SV+++HRDS N++ + +RL + L R R+ Q N + S
Sbjct: 114 WSVQVVHRDSLLVKDAANATASYERRLEETLRRDARRVRGLEQRIEKRLRLNKDPAGSHE 173
Query: 80 SQADIIPN------------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
+ A++ + Y RI +GTP E+ V DTGSD++W QCEPC S+C
Sbjct: 174 NVAEVAAEFGGEVVSGMAQGSGEYFTRIGVGTPMREQYMVLDTGSDVVWIQCEPC--SKC 231
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
Y Q P+F+P +S+++ +L C+S+ C+ L+ +C G C Y VSYGDGS++ G+ ATE +
Sbjct: 232 YSQVDPIFNPSLSASFSTLGCNSAVCSYLDAYNCHGGGCLYKVSYGDGSYTIGSFATEML 291
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
T G+T+ + VA+ GCG +N GLF ++GLG G +S SQ+ T FSYCL
Sbjct: 292 TFGTTSVRNVAI-----GCGHDNAGLFVGAAG-LLGLGAGLLSFPSQLGTQTGRAFSYCL 345
Query: 248 VPV---SSTKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI 301
V SS + FG + G + TPL TFY + + +ISVG L PD+
Sbjct: 346 VDRFSESSGTLEFGPESVPLGS--ILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDV 403
Query: 302 ------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 349
++DSGT +T L + + P A+ + CY + L
Sbjct: 404 FRIDETSGRGGFIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIFDTCYDLSGL 463
Query: 350 S--QVPEVTIHF-RGADVKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVG 405
VP V HF GA + L N+ + + C F T+ + I GNI Q V
Sbjct: 464 PLVNVPTVVFHFSNGASLILPAKNYMIPMDFMGTFCFAFAPATSDLSIMGNIQQQGIRVS 523
Query: 406 YDIEQQTVSFKPTDC 420
+D V F C
Sbjct: 524 FDTANSLVGFALRQC 538
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 139/438 (31%), Positives = 215/438 (49%), Gaps = 53/438 (12%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQA 82
+ L+HRDS + ++E +RL+ R+ ++ N + +S+ + A
Sbjct: 64 LHIHLLHRDS-FAVNATAAELLARRLQRDELRAAWIISKAAANGTPPPVVGLSTGRGLVA 122
Query: 83 DII---PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
++ P + Y+ +I++GTP + L DT SDL W QC+PC +CY Q P+FDP+
Sbjct: 123 PVVSRAPTSGEYMAKIAVGTPAVQALLALDTASDLTWLQCQPC--RRCYPQSGPVFDPRH 180
Query: 140 SSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDG----SFSNGNLATETVTLGST 192
S++Y + + C +L + C Y+V YGDG S S G+L ET+T
Sbjct: 181 STSYGEMNYDAPDCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGG 240
Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR-TTIAGKFSYCLV--- 248
QA ++ GCG +N GLF + GI+GLG G IS+ Q+ FSYCLV
Sbjct: 241 VRQAY----LSIGCGHDNKGLFGAPAAGILGLGRGQISIPHQIAFLGYNASFSYCLVDFI 296
Query: 249 --PVS-STKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRL-GVSTPD- 300
P S S+ + FG + + P TP TFY + + +SVG R+ GV+ D
Sbjct: 297 SGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDL 356
Query: 301 ----------IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVA--DPTGSLELCYSFN 347
+++DSGTT+T L + Y + + ++ V+ P+G + CY+
Sbjct: 357 QLDPYTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDTCYTVG 416
Query: 348 SLS--QVPEVTIHFRGA-DVKLSRSNFFVKV-SEDIVCSVFKGITN-SVPIYGNIMQTNF 402
+ +VP V++HF G +V L N+ + V S VC F G + SV + GNI+Q F
Sbjct: 417 GRAGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSVSVIGNILQQGF 476
Query: 403 LVGYDIEQQTVSFKPTDC 420
V YD+ Q V F P +C
Sbjct: 477 RVVYDLAGQRVGFAPNNC 494
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 129/357 (36%), Positives = 179/357 (50%), Gaps = 35/357 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ +GTP V DTGSD++W QC PC +CY Q P+FDP+ S TY ++P
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC--RRCYSQSDPIFDPRKSKTYATIP 196
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
CSS C L+ C+ C Y VSYGDGSF+ G+ +TET+T + VAL G
Sbjct: 197 CSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVAL-----G 251
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG 265
CG +N GLF ++GLG G +S Q KFSYCLV S++ + +V G
Sbjct: 252 CGHDNEGLFVGAAG-LLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASS---KPSSVVFG 307
Query: 266 PGVVS-----TPLT---KAKTFYVLTIDAISVGNQRL-GVSTP----------DIVIDSG 306
VS TPL K TFY + + ISVG R+ GV+ ++IDSG
Sbjct: 308 NAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQIGNGGVIIDSG 367
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADV 364
T++T L + + +A A + C+ +++++ VP V +HFRGADV
Sbjct: 368 TSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADV 427
Query: 365 KLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
L +N+ + V + C F G + I GNI Q F V YD+ V F P C
Sbjct: 428 SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 140/428 (32%), Positives = 201/428 (46%), Gaps = 46/428 (10%)
Query: 30 SVELIHRDSP---------------KSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSI 74
S+E++H+ P + N + ++ L+++L R N + S
Sbjct: 62 SLEVVHKHGPCSQLNHNGKAKTTISHTDIMNLDNERVKYIQSRLSKNLGRENSVKELDST 121
Query: 75 SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
+ S + I +ANY + + +GTP + V DTGSDL WTQCEPC S CY Q +
Sbjct: 122 TLPAKSGSLI--GSANYFVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGS-CYKQQDAI 178
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQK------SCSGVNCQYSVSYGDGSFSNGNLATETVT 188
FDP SS+Y ++ C+SS C L S S C Y + YGD S S G L+ E +T
Sbjct: 179 FDPSKSSSYINITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLT 238
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
+ +T + FGCG +N GLF S + G++GLG IS + Q + FSYCL
Sbjct: 239 ITATD----IVDDFLFGCGQDNEGLF-SGSAGLIGLGRHPISFVQQTSSIYNKIFSYCLP 293
Query: 249 PVSST--KINFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRL-GVSTPDI- 301
SS+ + FG + + + TPL+ TFY L I ISVG +L VS+
Sbjct: 294 STSSSLGHLTFGASA-ATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFS 352
Query: 302 ----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEV 355
+IDSGT +T L + L S +E PVA+ G + CY F+ + VP++
Sbjct: 353 AGGSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKEISVPKI 412
Query: 356 TIHFRGA-DVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQT 412
F G V+L + S VC F G N + I+GN+ Q V YD+E
Sbjct: 413 DFEFAGGVTVELPLVGILIGRSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVEGGR 472
Query: 413 VSFKPTDC 420
+ F C
Sbjct: 473 IGFGAAGC 480
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 139/415 (33%), Positives = 214/415 (51%), Gaps = 53/415 (12%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSISSSKASQADII--------PNNANYLIRISIGTPPTE 103
Q +RDAL R ++R F + + SSS +S A + PN Y++ ++IGTPP
Sbjct: 45 QFVRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQS 104
Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS--QCASLNQKSC 161
A+ADTGSDL+WTQC PC +C+ Q SPL++P S T++ LPCSS+ CA+ + +
Sbjct: 105 YPAIADTGSDLVWTQCAPC-GERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAG 163
Query: 162 S----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSK 217
+ G C+Y+ +YG G +++G +ET T GS+ V +PGI FGC + +N
Sbjct: 164 ATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFGCSNASSDDWN-- 220
Query: 218 TTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN-------FGTNGIVSGPGVVS 270
G GL G +S + AG FSYCL P TK ++G GV S
Sbjct: 221 --GSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRS 278
Query: 271 TPLTKA------KTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQ 314
TP + T+Y L + ISVG L + T ++IDSGTT+T L
Sbjct: 279 TPFVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRADGTGGLIIDSGTTITSLVD 338
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPEVTIHF-RGADVKLS 367
+ + + S+++ PV D + + L+LC++ S S +P +T+HF GAD+ L
Sbjct: 339 AAYKRVRAAVRSLVKL-PVTDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGADMVLP 397
Query: 368 RSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
N+ + + + C + T+ + GN Q N + YD++++T+SF P C+
Sbjct: 398 VENYMI-LDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKETLSFAPAKCS 451
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 131/392 (33%), Positives = 196/392 (50%), Gaps = 32/392 (8%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTG 111
+ ++ L+++L R N S ++ +++ + +ANY++ + +GTP + V DTG
Sbjct: 9 KYIQSRLSKNLGRENTVKDLDS--TTLPAESGSLIGSANYVVVVGLGTPKRDLSLVFDTG 66
Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN----QKSCSG---V 164
SDL WTQCEPC S CY Q +FDP SS+Y ++ C+SS C L + CS
Sbjct: 67 SDLTWTQCEPCAGS-CYKQQDAIFDPSKSSSYTNITCTSSLCTQLTSDGIKSECSSSTDA 125
Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
+C Y YGD S S G L+ E +T+ +T + FGCG +N GLFN + G++GL
Sbjct: 126 SCIYDAKYGDNSTSVGFLSQERLTITATD----IVDDFLFGCGQDNEGLFNG-SAGLMGL 180
Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVSST--KINFGTNGIVSGPGVVSTPLTKA---KTF 279
G IS++ Q + FSYCL SS+ + FG + + ++ TPL+ +F
Sbjct: 181 GRHPISIVQQTSSNYNKIFSYCLPATSSSLGHLTFGASA-ATNASLIYTPLSTISGDNSF 239
Query: 280 YVLTIDAISVGNQRL-GVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV 333
Y L I +ISVG +L VS+ +IDSGT +T L + L S +E PV
Sbjct: 240 YGLDIVSISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAFRRXMEKYPV 299
Query: 334 ADPTGSLELCYSFNSLSQ--VPEVTIHFRGA-DVKLSRSNFFVKVSEDIVCSVF--KGIT 388
A+ G L+ CY + + VP + F G V+L SE VC F G
Sbjct: 300 ANEAGLLDTCYDLSGYKEISVPRIDFEFSGGVTVELXHRGILXVESEQQVCLAFAANGSD 359
Query: 389 NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N + ++GN+ Q V YD++ + F C
Sbjct: 360 NDITVFGNVQQKTLEVVYDVKGGRIGFGAAGC 391
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 122/355 (34%), Positives = 183/355 (51%), Gaps = 28/355 (7%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ NY +++ +GTPP + DTGS L W QC+PC C+ Q PL+DP +S TYK L
Sbjct: 122 SGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPC-AVYCHAQADPLYDPSVSKTYKKLS 180
Query: 148 CSSSQC-----ASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
C+S +C A+LN C + C Y+ SYGD SFS G L+ + +TL S+ LP
Sbjct: 181 CASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQ----TLP 236
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFG 258
T+GCG +N GLF + GI+GL +S+++Q+ T FSYCL S+ F
Sbjct: 237 QFTYGCGQDNQGLFG-RAAGIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFL 295
Query: 259 TNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTP----DIVIDSGTTLTF 311
+ G +S TP+ +K + Y L + AI+V + L ++ +IDSGT +T
Sbjct: 296 SIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITR 355
Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCY--SFNSLSQVPEVTIHFR-GADVKLS 367
LP + L ++ + P S L+ C+ S S+S VPE+ + F+ GAD+ L
Sbjct: 356 LPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLR 415
Query: 368 RSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ ++ + I C F G TN + I GN Q + + YD+ + F P C
Sbjct: 416 APSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 470
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 146/451 (32%), Positives = 213/451 (47%), Gaps = 58/451 (12%)
Query: 18 VVSPIEAQT---GGFSVELIHRDSPKSPFYNSSETPY-QRLRDALTRSLNRLNHFNQNSS 73
VV P + +T +S+ L+HRD+ K ++E Y +R++ L R R+ N
Sbjct: 45 VVQPAKEETLEIKPWSIPLVHRDAMKGNSNKNNELSYAERMQQRLKRDAARVAAINSRLE 104
Query: 74 IS-------------------SSKASQADII----PNNANYLIRISIGTPPTERLAVADT 110
++ + Q+ ++ + Y RI +G P ++L V DT
Sbjct: 105 LAVNGIKRSSLKPDSSSSFTMAESDFQSPVVSGMDQGSGEYFSRIGVGAPRRDQLMVLDT 164
Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-GVNCQYS 169
GSD+ W QCEPC S CY Q P+++P +SS+YK + C ++ C L+ CS +C Y
Sbjct: 165 GSDVTWIQCEPC--SDCYQQSDPIYNPALSSSYKLVGCQANLCQQLDVSGCSRNGSCLYQ 222
Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDI 229
VSYGDGS++ GN ATET+TLG Q VA+ GCG +N GLF ++GLGGG +
Sbjct: 223 VSYGDGSYTQGNFATETLTLGGAPLQNVAI-----GCGHDNEGLFVGAAG-LLGLGGGSL 276
Query: 230 SLISQMRTTIAGKFSYCLVPV---SSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLT 283
S SQ+ FSYCLV SS+ + FG + + G V P+ K TFY ++
Sbjct: 277 SFPSQLTDENGKIFSYCLVDRDSESSSTLQFGRAAVPN--GAVLAPMLKNSRLDTFYYVS 334
Query: 284 IDAISVGNQRLGVSTP----------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV 333
+ ISVG + L +S +++DSGT +T L +L + + P
Sbjct: 335 LSGISVGGKMLSISDSVFGIDASGNGGVIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPS 394
Query: 334 ADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITN 389
D + CY +S VP V HF G + L N+ V V S C F ++
Sbjct: 395 TDGVSLFDTCYDLSSKESVDVPTVVFHFSGGGSMSLPAKNYLVPVDSMGTFCFAFAPTSS 454
Query: 390 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
S+ I GNI Q V +D V F C
Sbjct: 455 SLSIVGNIQQQGIRVSFDRANNQVGFAVNKC 485
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 130/374 (34%), Positives = 194/374 (51%), Gaps = 40/374 (10%)
Query: 79 ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
A++ + + YL+ ++IGTPP A+ DTGSDLIWTQC PC C Q +P F P
Sbjct: 80 AARILVAASQGEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPC--VLCADQPTPYFRPA 137
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
S+TY+ +PC S CA+L +C + C Y YGD + + G LA+ET T G+ V
Sbjct: 138 RSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKV 197
Query: 198 ALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---T 253
+ + FGCG N+G L NS +G+VGLG G +SL+SQ+ + +FSYCL S +
Sbjct: 198 MVSDVAFGCGNINSGQLANS--SGMVGLGRGPLSLVSQLGPS---RFSYCLTSFLSPEPS 252
Query: 254 KINF-------GTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVS------ 297
++NF GTN SG V STPL + Y +++ IS+G +RL +
Sbjct: 253 RLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAI 312
Query: 298 ----TPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSF----NS 348
T + IDSGT+LT+L Q Y++ ++S + P D LE C+ + +
Sbjct: 313 NDDGTGGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPPPSV 372
Query: 349 LSQVPEVTIHFR-GADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY 406
VP++ +HF GA++ + N+ + + +C ++ I GN Q N + Y
Sbjct: 373 AVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDAT-IIGNYQQQNMHILY 431
Query: 407 DIEQQTVSFKPTDC 420
DI +SF P C
Sbjct: 432 DIANSLLSFVPAPC 445
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 130/374 (34%), Positives = 194/374 (51%), Gaps = 40/374 (10%)
Query: 79 ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
A++ + + YL+ ++IGTPP A+ DTGSDLIWTQC PC C Q +P F P
Sbjct: 80 AARILVAASQGEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPC--VLCADQPTPYFRPA 137
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
S+TY+ +PC S CA+L +C + C Y YGD + + G LA+ET T G+ V
Sbjct: 138 RSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKV 197
Query: 198 ALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---T 253
+ + FGCG N+G L NS +G+VGLG G +SL+SQ+ + +FSYCL S +
Sbjct: 198 MVSDVAFGCGNINSGQLANS--SGMVGLGRGPLSLVSQLGPS---RFSYCLTSFLSPEPS 252
Query: 254 KINF-------GTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVS------ 297
++NF GTN SG V STPL + Y +++ IS+G +RL +
Sbjct: 253 RLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAI 312
Query: 298 ----TPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSF----NS 348
T + IDSGT+LT+L Q Y++ ++S + P D LE C+ + +
Sbjct: 313 NDDGTGGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPWPPPPSV 372
Query: 349 LSQVPEVTIHFR-GADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY 406
VP++ +HF GA++ + N+ + + +C ++ I GN Q N + Y
Sbjct: 373 AVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDAT-IIGNYQQQNMHILY 431
Query: 407 DIEQQTVSFKPTDC 420
DI +SF P C
Sbjct: 432 DIANSLLSFVPAPC 445
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 138/424 (32%), Positives = 202/424 (47%), Gaps = 40/424 (9%)
Query: 33 LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF-----NQNSSISS--SKASQADII 85
++HR P SP + P D L + R++ N+ S++ S ++ I
Sbjct: 91 VMHRHGPCSPLQTPGDAPSDA--DLLDQDQARVDSILGMITNETSAVGPGVSLPAERGIS 148
Query: 86 PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
NY++ + +GTP + V DTGSDL W QC PC CY Q PLF P SST+ +
Sbjct: 149 VGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSA 208
Query: 146 LPCSSSQCASLNQKSCSGV----NCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAVA-- 198
+ C + +C + ++SC G C Y V YGD S + G+L +T+TLG+ A A
Sbjct: 209 VRCGARECRA--RQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAEN 266
Query: 199 ---LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
LPG FGCG NN GLF + G+ GLG G +SL SQ FSYCL SS+
Sbjct: 267 DNKLPGFVFGCGENNTGLFG-QADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSAP 325
Query: 256 NFGTNGI-VSGPGVVS-TPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI----VIDSG 306
+ + G V P TP+ T +FY + + I V + + VS+P + ++DSG
Sbjct: 326 GYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPLIVDSG 385
Query: 307 TTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSF----NSLSQVPEVTIHFR 360
T +T L P+ Y + + +S+M + P S L+ CY F N+ +P V + F
Sbjct: 386 TVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALVFA 445
Query: 361 GA---DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
G V S + KV++ + G S I GN Q V YD+ +Q + F
Sbjct: 446 GGATISVDFSGVLYVAKVAQACLAFAPNGDGRSAGILGNTQQRTLAVVYDVARQKIGFAA 505
Query: 418 TDCT 421
C+
Sbjct: 506 KGCS 509
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 131/361 (36%), Positives = 183/361 (50%), Gaps = 39/361 (10%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+ ++IGTPP DTGS L+WTQC+PC + C+ Q P +D SST+ C
Sbjct: 34 EYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPC--AVCFNQSLPYYDASRSSTFALPSCD 91
Query: 150 SSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
S+QC L+ VN C YS SYGD S + G L ETV+ + ++PG+ F
Sbjct: 92 STQC-KLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVA----GASVPGVVF 146
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFGT 259
GCG NN G+F S TGI G G G +SL SQ++ G FS+C VS K +
Sbjct: 147 GCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVSGRKPSTVLFDLPA 203
Query: 260 NGIVSGPGVV-STPLTK---AKTFYVLTIDAISVGNQRLGV---------STPDIVIDSG 306
+ +G G V +TPL K TFY L++ I+VG+ RL V T +IDSG
Sbjct: 204 DLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSG 263
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSLSQ---VPEVTIHFRGA 362
T T LP + ++ ++ V ++ TG L LC+S L + VP++ +HF GA
Sbjct: 264 TAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPL-LCFSAPPLGKAPHVPKLVLHFEGA 322
Query: 363 DVKLSRSNFFVKVSEDIVCSV-FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ L R N+ + + CS+ I + I GN Q N V YD++ +SF C
Sbjct: 323 TMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKCD 382
Query: 422 K 422
K
Sbjct: 383 K 383
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 139/415 (33%), Positives = 214/415 (51%), Gaps = 53/415 (12%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSISSSKASQADII--------PNNANYLIRISIGTPPTE 103
Q +RDAL R ++R F + + SSS +S A + PN Y++ ++IGTPP
Sbjct: 45 QFVRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQS 104
Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS--QCASLNQKSC 161
A+ADTGSDL+WTQC PC +C+ Q SPL++P S T++ LPCSS+ CA+ + +
Sbjct: 105 YPAIADTGSDLVWTQCAPC-GERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAG 163
Query: 162 S----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSK 217
+ G C+Y+ +YG G +++G +ET T GS+ V +PGI FGC + +N
Sbjct: 164 ATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFGCSNASSDDWN-- 220
Query: 218 TTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN-------FGTNGIVSGPGVVS 270
G GL G +S + AG FSYCL P TK ++G GV S
Sbjct: 221 --GSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRS 278
Query: 271 TPLTKA------KTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQ 314
TP + T+Y L + ISVG L + T ++IDSGTT+T L
Sbjct: 279 TPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGGLIIDSGTTITSLVD 338
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPEVTIHF-RGADVKLS 367
+ + + S+++ PV D + + L+LC++ S S +P +T+HF GAD+ L
Sbjct: 339 AAYKRVRAAVRSLVKL-PVTDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGADMVLP 397
Query: 368 RSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
N+ + + + C + T+ + GN Q N + YD++++T+SF P C+
Sbjct: 398 VENYMI-LDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKETLSFAPAKCS 451
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 139/415 (33%), Positives = 214/415 (51%), Gaps = 53/415 (12%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSISSSKASQADII--------PNNANYLIRISIGTPPTE 103
Q +RDAL R ++R F + + SSS +S A + PN Y++ ++IGTPP
Sbjct: 50 QFVRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQS 109
Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS--QCASLNQKSC 161
A+ADTGSDL+WTQC PC +C+ Q SPL++P S T++ LPCSS+ CA+ + +
Sbjct: 110 YPAIADTGSDLVWTQCAPC-GERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAG 168
Query: 162 S----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSK 217
+ G C+Y+ +YG G +++G +ET T GS+ V +PGI FGC + +N
Sbjct: 169 ATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFGCSNASSDDWN-- 225
Query: 218 TTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN-------FGTNGIVSGPGVVS 270
G GL G +S + AG FSYCL P TK ++G GV S
Sbjct: 226 --GSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRS 283
Query: 271 TPLTKA------KTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQ 314
TP + T+Y L + ISVG L + T ++IDSGTT+T L
Sbjct: 284 TPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGGLIIDSGTTITSLVD 343
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPEVTIHF-RGADVKLS 367
+ + + S+++ PV D + + L+LC++ S S +P +T+HF GAD+ L
Sbjct: 344 AAYKRVRAAVRSLVKL-PVTDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGADMVLP 402
Query: 368 RSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
N+ + + + C + T+ + GN Q N + YD++++T+SF P C+
Sbjct: 403 VENYMI-LDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKETLSFAPAKCS 456
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 139/406 (34%), Positives = 191/406 (47%), Gaps = 41/406 (10%)
Query: 45 NSSETPYQRLRDALTRSLNRLN------HFNQNSSISSSKASQADIIPNNANYLIRISIG 98
+S++TP Q L R R+ H +++ S S + + + + Y RI +G
Sbjct: 66 SSNKTPEQLFHLRLQRDAKRVEALLNQIHARRSAGSSFSSSIISGLAQGSGEYFTRIGVG 125
Query: 99 TPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ 158
TP V DTGSD++W QC PC +CY Q +FDP S TY +PC + C L+
Sbjct: 126 TPARYVYMVLDTGSDVVWLQCAPC--RKCYTQTDHVFDPTKSRTYAGIPCGAPLCRRLDS 183
Query: 159 KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS 216
CS N CQY VSYGDGSF+ G+ +TET+T VAL GCG +N GLF
Sbjct: 184 PGCSNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRNRVTRVAL-----GCGHDNEGLFTG 238
Query: 217 KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVS-----T 271
++GLG G +S Q KFSYCLV S++ + ++ G VS T
Sbjct: 239 AAG-LLGLGRGRLSFPVQTGRRFNHKFSYCLVDRSASA---KPSSVIFGDSAVSRTAHFT 294
Query: 272 PLT---KAKTFYVLTIDAISVGNQ----------RLGVS-TPDIVIDSGTTLTFLPQGYN 317
PL K TFY L + ISVG RL + ++IDSGT++T L +
Sbjct: 295 PLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGGVIIDSGTSVTRLTRPAY 354
Query: 318 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKV 375
L A + C+ + L++ VP V +HFRGADV L +N+ + V
Sbjct: 355 IALRDAFRIGASHLKRAPEFSLFDTCFDLSGLTEVKVPTVVLHFRGADVSLPATNYLIPV 414
Query: 376 SED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
C F G + + I GNI Q F + YD+ V F P C
Sbjct: 415 DNSGSFCFAFAGTMSGLSIIGNIQQQGFRISYDLTGSRVGFAPRGC 460
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 126/350 (36%), Positives = 182/350 (52%), Gaps = 28/350 (8%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ R+ +GTP + V DTGS L W QC PC S C+ Q P+FDPK SS+Y ++ CS
Sbjct: 116 NYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVS-CHRQSGPVFDPKTSSSYAAVSCS 174
Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
S QC A+LN CS N C Y SYGD SFS G L+ +TV+ G A ++P
Sbjct: 175 SPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTVSFG-----ANSVPNFY 229
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
+GCG +N GLF ++ G++GL +SL+ Q+ T+ FSYCL SS+ + + G
Sbjct: 230 YGCGQDNEGLFG-RSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSTSSS--GYLSIGSY 286
Query: 264 SGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQG 315
+ G TP+ T + Y +++ ++V + L VS+ + +IDSGT +T LP
Sbjct: 287 NPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPTIIDSGTVITRLPTS 346
Query: 316 -YNSNLLSVMSSMIEAQPVADPTGSLELCYS--FNSLSQVPEVTIHFR-GADVKLSRSNF 371
Y + +V ++M + A L+ C+ + L VP V++ F GA +KLS N
Sbjct: 347 VYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKLRAVPAVSMAFSGGATLKLSAGNL 406
Query: 372 FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
V V C F S I GN Q F V YD++ + F C+
Sbjct: 407 LVDVDGATTCLAF-APARSAAIIGNTQQQTFSVVYDVKSNRIGFAAAGCS 455
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 131/384 (34%), Positives = 195/384 (50%), Gaps = 33/384 (8%)
Query: 59 TRSLNRLNHFNQNSSISSSKASQADIIPNNA----NYLIRISIGTPPTERLAVADTGSDL 114
T ++ L N ++++ S AS + P + NY+ R+ +GTP + V DTGS L
Sbjct: 102 TVTVASLYRANDDAAVDGSLAS-VPLTPGTSYGVGNYVTRMGLGTPAKPYIMVVDTGSSL 160
Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQY 168
W QC PC S C+ Q P+FDPK SS+Y ++ CS+ QC A+LN +CS + C Y
Sbjct: 161 TWLQCSPCRVS-CHRQSGPVFDPKTSSSYAAVSCSTPQCNDLSTATLNPAACSSSDVCIY 219
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
SYGD SFS G L+ +TV+ GS + +P +GCG +N GLF ++ G++GL
Sbjct: 220 QASYGDSSFSVGYLSKDTVSFGSNS-----VPNFYYGCGQDNEGLFG-RSAGLMGLARNK 273
Query: 229 ISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVS-TPL---TKAKTFYVLTI 284
+SL+ Q+ T+ FSYCL S+ + + PG S TP+ T + Y + +
Sbjct: 274 LSLLYQLAPTLGYSFSYCL---PSSSSSGYLSIGSYNPGQYSYTPMVSSTLDDSLYFIKL 330
Query: 285 DAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS 339
++V + L VS+ + +IDSGT +T LP L ++ ++ AD
Sbjct: 331 SGMTVAGKPLAVSSSEYSSLPTIIDSGTVITRLPTTVYDALSKAVAGAMKGTKRADAYSI 390
Query: 340 LELCYSFNSLS-QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNI 397
L+ C+ + S +VP V++ F GA +KLS N V V C F S I GN
Sbjct: 391 LDTCFVGQASSLRVPAVSMAFSGGAALKLSAQNLLVDVDSSTTCLAF-APARSAAIIGNT 449
Query: 398 MQTNFLVGYDIEQQTVSFKPTDCT 421
Q F V YD++ + F CT
Sbjct: 450 QQQTFSVVYDVKSNRIGFAAGGCT 473
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 143/395 (36%), Positives = 192/395 (48%), Gaps = 43/395 (10%)
Query: 56 DALTRSLNRLNHFNQNSSISSSKASQADIIPN----NANYLIRISIGTPPTERLAVADTG 111
+ LTRS +R + S+ QA ++ + Y IRIS+GTPP V DTG
Sbjct: 24 NGLTRSRSR-----DRQTKVPSQDFQAPVVSGLSLGSGEYFIRISVGTPPRRMYLVMDTG 78
Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVS 171
SD++W QC PC CY Q +FDP SSTY +L CS+ QC +L+ +C C Y V
Sbjct: 79 SDILWLQCAPC--VNCYHQSDAIFDPYKSSTYSTLGCSTRQCLNLDIGTCQANKCLYQVD 136
Query: 172 YGDGSFSNGNLATETVTLGSTTGQA-VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDIS 230
YGDGSF+ G T+ V+L ST+G V L I GCG +N G F ++GLG G +S
Sbjct: 137 YGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGYFVGAAG-LLGLGKGPLS 195
Query: 231 LISQMRTTIAGKFSYCLVP-----VSSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVL 282
+Q+ G+FSYCL + + FG V G TP + TFY L
Sbjct: 196 FPNQVDPQNGGRFSYCLTDRETDSTEGSSLVFG-EAAVPPAGARFTPQDSNMRVPTFYYL 254
Query: 283 TIDAISVGNQRLGVSTP----------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP 332
+ ISVG L + T ++IDSGT++T L N+ S+ +
Sbjct: 255 KMTGISVGGTILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQ---NAAYASLRDAFRAGTS 311
Query: 333 VADPTGSLEL---CYSFNSLS--QVPEVTIHFRGA-DVKLSRSNFFVKV-SEDIVCSVFK 385
PT L CY + L+ VP VT+HF+G D+KL SN+ + V + + C F
Sbjct: 312 DLAPTAGFSLFDTCYDLSGLASVDVPTVTLHFQGGTDLKLPASNYLIPVDNSNTFCLAFA 371
Query: 386 GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
G T I GNI Q F V YD V F P+ C
Sbjct: 372 GTTGP-SIIGNIQQQGFRVIYDNLHNQVGFVPSQC 405
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 145/426 (34%), Positives = 208/426 (48%), Gaps = 48/426 (11%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD---I 84
GF L H D+ ++ T Q L AL RS R+ ++++ A A +
Sbjct: 30 GFKATLRHVDA------DAGYTEEQLLSRALRRSSARVATLQSLAALAPGDAITAARILV 83
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
+ ++ YL+ + IGTP A+ DTGSDLIWTQC PC C Q +P FDP S+TY+
Sbjct: 84 LASDGEYLMEMGIGTPTRYYSAILDTGSDLIWTQCAPC--LLCVDQPTPYFDPARSATYR 141
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
SL C+S C +L C C Y YGD + + G LA ET T G T V+LPGI+F
Sbjct: 142 SLGCASPACNALYYPLCYQKVCVYQYFYGDSASTAGVLANETFTFG-TNETRVSLPGISF 200
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVSSTKINFGTN 260
GCG N GL + +G+VG G G +SL+SQ+ + +FSYCL PV S ++ FG
Sbjct: 201 GCGNLNAGLL-ANGSGMVGFGRGSLSLVSQLGSP---RFSYCLTSFLSPVPS-RLYFGVY 255
Query: 261 GIV-----SGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS-----------TPDI 301
+ S V STP T Y L + ISVG L + T
Sbjct: 256 ATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGT 315
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPEV 355
+IDSGTT+T+L + + + +S I P+ + T + L+ C+ + + +P++
Sbjct: 316 IIDSGTTITYLAEPAYDAVRAAFASQITL-PLLNVTDASVLDTCFQWPPPPRQSVTLPQL 374
Query: 356 TIHFRGADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
+HF GAD +L N+ V S + ++ I G+ NF V YD+E +S
Sbjct: 375 VLHFDGADWELPLQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMS 434
Query: 415 FKPTDC 420
F P C
Sbjct: 435 FVPAPC 440
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 148/439 (33%), Positives = 212/439 (48%), Gaps = 56/439 (12%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQ--------RLRDALTRSLNR---------LNHFNQ 70
G + L H SP SP S+ P+ R+ +R N L H ++
Sbjct: 42 GLHLTLHHPQSPCSPAPLPSDLPFSAVVTHDDARIAHLASRLANNHPTSPSSSSLLHGHR 101
Query: 71 NSSISSSKASQAD-----IIPNNA----NYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
SQA + P + NY+ R+ +GTP T + V DTGS L W QC P
Sbjct: 102 KKKAGGVGGSQASSSSVPLTPGASVAVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQCSP 161
Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQYSVSYGDG 175
C S C+ Q P+FDP+ S TY ++ CSSS+C A+LN +CS N C Y SYGD
Sbjct: 162 CSVS-CHRQAGPVFDPRASGTYAAVQCSSSECGELQAATLNPSACSVSNVCIYQASYGDS 220
Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
S+S G L+ +TV+ GS + PG +GCG +N GLF ++ G++GL +SL+ Q+
Sbjct: 221 SYSVGYLSKDTVSFGSGS-----FPGFYYGCGQDNEGLFG-RSAGLIGLAKNKLSLLYQL 274
Query: 236 RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVS-TPLTKA---KTFYVLTIDAISVGN 291
++ FSYCL P SS + + G + PG S TP+ + + Y +T+ ISV
Sbjct: 275 APSLGYAFSYCL-PTSSAAAGYLSIGSYN-PGQYSYTPMASSSLDASLYFVTLSGISVAG 332
Query: 292 QRLGV------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCY 344
L V S P I IDSGT +T LP + L +++ + + PT S L+ C+
Sbjct: 333 APLAVPPSEYRSLPTI-IDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDTCF 391
Query: 345 SFNSLS-QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 402
++ +VP V + F GA + LS N + V + C F T I GN Q F
Sbjct: 392 RGSAAGLRVPRVDMAFAGGATLALSPGNVLIDVDDSTTCLAFA-PTGGTAIIGNTQQQTF 450
Query: 403 LVGYDIEQQTVSFKPTDCT 421
V YD+ Q + F C+
Sbjct: 451 SVVYDVAQSRIGFAAGGCS 469
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 143/428 (33%), Positives = 207/428 (48%), Gaps = 45/428 (10%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRL---RDALTRSLN-RLNHFNQNSSISSSKASQAD 83
G +EL H SP SP ++ P+ + DA SL RL + S + A
Sbjct: 42 GLHLELHHPRSPCSPAPVPADLPFTAVLTHDDARISSLAARLAKTPSARATSLDADADAG 101
Query: 84 IIPNNA-------------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
+ + A NY+ R+ +GTP T+ + V DTGS L W QC PC S C+ Q
Sbjct: 102 LAGSLASVPLSPGASVGVGNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVS-CHRQ 160
Query: 131 DSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLAT 184
P+F+PK SSTY S+ CS+ QC A+LN +CS N C Y SYGD SFS G L+
Sbjct: 161 SGPVFNPKSSSTYASVGCSAQQCSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSK 220
Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFS 244
+TV+ GST+ LP +GCG +N GLF ++ G++GL +SL+ Q+ ++ F+
Sbjct: 221 DTVSFGSTS-----LPNFYYGCGQDNEGLFG-RSAGLIGLARNKLSLLYQLAPSLGYSFT 274
Query: 245 YCLVPVSSTKINFGTNGIVSGPGVVS-TPLTKA---KTFYVLTIDAISVGNQRLGVSTPD 300
YCL S+ + + PG S TP+ + + Y + + ++V L VS+
Sbjct: 275 YCL---PSSSSSGYLSLGSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSA 331
Query: 301 -----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPE 354
+IDSGT +T LP S L +++ ++ A L+ C+ S P
Sbjct: 332 YSSLPTIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQASRVSAPA 391
Query: 355 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
VT+ F GA +KLS N V V + C F S I GN Q F V YD++ +
Sbjct: 392 VTMSFAGGAALKLSAQNLLVDVDDSTTCLAF-APARSAAIIGNTQQQTFSVVYDVKSSRI 450
Query: 414 SFKPTDCT 421
F C+
Sbjct: 451 GFAAGGCS 458
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 137/391 (35%), Positives = 190/391 (48%), Gaps = 36/391 (9%)
Query: 53 RLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA----NYLIRISIGTPPTERLAVA 108
RL L R N H ++++ + A Q ++ + Y +R+ IG PP++ V
Sbjct: 107 RLDLVLKRVSNSDLHPAESNAEFEANALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVL 166
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
DTGSD+ W QC PC S+CY Q P+FDP S++Y + C + QC SL+ C C Y
Sbjct: 167 DTGSDVSWIQCAPC--SECYQQSDPIFDPVSSNSYSPIRCDAPQCKSLDLSECRNGTCLY 224
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
VSYGDGS++ G ATETVTLG+ + VA+ GCG NN GLF G++GLGGG
Sbjct: 225 EVSYGDGSYTVGEFATETVTLGTAAVENVAI-----GCGHNNEGLF-VGAAGLLGLGGGK 278
Query: 229 ISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTID 285
+S +Q+ T FSYCLV S ++ VV+ PL + TFY L +
Sbjct: 279 LSFPAQVNAT---SFSYCLVNRDSDAVSTLEFNSPLPRNVVTAPLRRNPELDTFYYLGLK 335
Query: 286 AISVGNQRLGVSTPDIVI------------DSGTTLTFLPQGYNSNLLSVMSSMIEAQPV 333
ISVG + L + P+ + DSGT +T L L + P
Sbjct: 336 GISVGGEALPI--PESIFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPK 393
Query: 334 ADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKV-SEDIVCSVFKGITN 389
A+ + CY +S QVP V+ HF G ++ L N+ + V S C F T+
Sbjct: 394 ANGVSLFDTCYDLSSRESVQVPTVSFHFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTS 453
Query: 390 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
S+ I GN+ Q VG+DI V F C
Sbjct: 454 SLSIMGNVQQQGTRVGFDIANSLVGFSADSC 484
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 128/357 (35%), Positives = 178/357 (49%), Gaps = 35/357 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ +GTP V DTGSD++W QC PC +CY Q P+FDP+ S TY ++P
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC--RRCYSQSDPIFDPRKSKTYATIP 196
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
CSS C L+ C+ C Y VSYGDGSF+ G+ +TET+T + VAL G
Sbjct: 197 CSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVAL-----G 251
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG 265
CG +N GLF ++GLG G +S Q KFSYCLV S++ + +V G
Sbjct: 252 CGHDNEGLFVGAAG-LLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASS---KPSSVVFG 307
Query: 266 PGVVS-----TPLT---KAKTFYVLTIDAISVGNQRL-GVSTP----------DIVIDSG 306
VS TPL K TFY + + ISVG R+ GV+ ++IDSG
Sbjct: 308 NAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSG 367
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADV 364
T++T L + + + A + C+ +++++ VP V +HFRGADV
Sbjct: 368 TSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADV 427
Query: 365 KLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
L +N+ + V + C F G + I GNI Q F V YD+ V F P C
Sbjct: 428 SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 181 bits (458), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 127/373 (34%), Positives = 182/373 (48%), Gaps = 34/373 (9%)
Query: 70 QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
Q SSS S + + Y R+ +GTPP V DTGSD++W QC PC +CY
Sbjct: 128 QGGGFSSSVTS--GLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPC--RKCYS 183
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
Q P+FDPK S ++ S+ C S C L+ C S +C Y V+YGDGSF+ G +TET+T
Sbjct: 184 QTDPVFDPKKSGSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLT 243
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
+ +P + GCG +N GLF ++GLG G +S +Q KFSYCLV
Sbjct: 244 F-----RGTRVPKVALGCGHDNEGLFVGAAG-LLGLGRGRLSFPTQTGLRFGRKFSYCLV 297
Query: 249 PVSS----TKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPD- 300
S+ + + FG + + V TPL K TFY L + ISVG R+ T
Sbjct: 298 DRSASSKPSSVVFGQSAVSR--TAVFTPLITNPKLDTFYYLELTGISVGGARVAGITASL 355
Query: 301 ----------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 350
++IDSGT++T L + +L + A + C+ + +
Sbjct: 356 FKLDTAGNGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYSLFDTCFDLSGKT 415
Query: 351 Q--VPEVTIHFRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
+ VP V +HFRGADV L +N+ + V + + C F G + + I GNI Q F V +D
Sbjct: 416 EVKVPTVVMHFRGADVSLPATNYLIPVDTNGVFCFAFAGTMSGLSIIGNIQQQGFRVVFD 475
Query: 408 IEQQTVSFKPTDC 420
+ + F C
Sbjct: 476 VAASRIGFAARGC 488
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 181 bits (458), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 145/436 (33%), Positives = 206/436 (47%), Gaps = 55/436 (12%)
Query: 29 FSVELIHRDSPK-SPFYNSSETPYQRLRDALTRSLNRLNHFNQN---------------- 71
+SV+L+HRDS N++ + +RL + L R R+ Q
Sbjct: 71 WSVQLVHRDSLLFKGAANATASYERRLEEKLRREAARVRALEQRIERKLKLKKDPAGSYE 130
Query: 72 --SSISSSKASQ--ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
+ +++ S+ + + + Y RI IGTP E+ V DTGSD++W QCEPC +C
Sbjct: 131 NVAGVTAEFGSEVVSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPC--REC 188
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
Y Q P+F+P S ++ ++ C S+ C+ L+ C G C Y VSYGDGS++ G+ ATET+
Sbjct: 189 YSQADPIFNPSSSVSFSTVGCDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETL 248
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
T G+T+ Q VA+ GCG +N GLF ++GLG G +S +Q+ T FSYCL
Sbjct: 249 TFGTTSIQNVAI-----GCGHDNVGLFVGAAG-LLGLGAGSLSFPAQLGTQTGRAFSYCL 302
Query: 248 VPV---SSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPD- 300
V SS + FG + G + TPL TFY L++ AISVG L S P
Sbjct: 303 VDRDSESSGTLEFGPESVPIGS--IFTPLVANPFLPTFYYLSMVAISVGGVILD-SVPSE 359
Query: 301 ------------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS 348
I+IDSGT +T L L + + P AD + CY ++
Sbjct: 360 AFRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSA 419
Query: 349 LSQV--PEVTIHF-RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLV 404
L V P V HF GA L N + + S C F +++ I GNI Q V
Sbjct: 420 LQSVSIPAVGFHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRV 479
Query: 405 GYDIEQQTVSFKPTDC 420
+D V F C
Sbjct: 480 SFDSANSLVGFAIDQC 495
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 133/421 (31%), Positives = 202/421 (47%), Gaps = 39/421 (9%)
Query: 30 SVELIHRDSPKSPFYNSSETP---YQRLRDALTRS---LNRLNHFNQNSSISSSKASQAD 83
SV L+HR P +P SS+ P RLR RS ++R++ S +
Sbjct: 57 SVPLVHRHGPCAPTQLSSDKPSSFTDRLRRNRARSKYIMSRVSKGMMGDDADVSIPTHLG 116
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
++ Y++ + +GTP ++ + DTGSDL W QC+PC + CY Q PLFDP SSTY
Sbjct: 117 GSVDSLEYVVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTY 176
Query: 144 KSLPCSSSQCASLNQKSCSG--------VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
+PC++ C L G C ++++YGDGS + G + ET+ L
Sbjct: 177 APIPCNTDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLALAP---- 232
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
VA+ FGCG + G N K G++GLGG SL+ Q + G FSYCL P + ++
Sbjct: 233 GVAVKDFRFGCGHDQDGA-NDKYDGLLGLGGAPESLVVQTASVYGGAFSYCL-PALNNQV 290
Query: 256 --------NFGTNGIVSGPGVVSTPLTK-AKTFYVLTIDAISVGNQRLGVS----TPDIV 302
+ G+V+ G V TP+ + +TFYV+ + I+VG + + V + ++
Sbjct: 291 GFLALGGGGAPSGGVVNTSGFVFTPMIREEETFYVVNMTGITVGGEPIDVPPSAFSGGMI 350
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR 360
IDSGT +T L + L + + A P+ G L+ CY F+ S V P+V + F
Sbjct: 351 IDSGTVVTELQHTAYNALQAAFRKAMAAYPLVR-NGELDTCYDFSGYSNVTLPKVALTFS 409
Query: 361 -GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
GA + L N + +D + G + I GN+ Q V YD + V F+
Sbjct: 410 GGATIDLDVPNGILL--DDCLAFQESGPDDQPGILGNVNQRTLEVLYDAGRGRVGFRAAV 467
Query: 420 C 420
C
Sbjct: 468 C 468
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 131/412 (31%), Positives = 197/412 (47%), Gaps = 39/412 (9%)
Query: 30 SVELIHRDSPKSPFYN-----SSETPYQRLRDALTRSLNRLN-----HFNQNSSISSSKA 79
S+E++H+ P S N S+TP+ + + + +N + Q+SS+S +
Sbjct: 70 SLEVVHKHGPCSQLNNHDGKAKSKTPHSEILNQDKERVKYINSRISKNLGQDSSVSELDS 129
Query: 80 ----SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
+++ + + NY + + +GTP + + DTGSDL WTQCEPC S CY Q +F
Sbjct: 130 VTLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARS-CYKQQDAIF 188
Query: 136 DPKMSSTYKSLPCSSSQCASL-----NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVT 188
DP S++Y ++ C+S+ C L N+ CS C Y + YGD SFS G + E ++
Sbjct: 189 DPSKSTSYSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERLS 248
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
+ +T + FGCG NN GLF + G++GLG IS + Q FSYCL
Sbjct: 249 VTATD----IVDNFLFGCGQNNQGLFGG-SAGLIGLGRHPISFVQQTAAVYRKIFSYCLP 303
Query: 249 PVSST--KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV-----STPDI 301
SS+ +++FGT + +++ +FY L I ISVG +L V ST
Sbjct: 304 ATSSSTGRLSFGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFSTGGA 363
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 359
+IDSGT +T LP + L S + P A L+ CY + +P++ F
Sbjct: 364 IIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVFSIPKIDFSF 423
Query: 360 RGA-DVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDI 408
G V+L S VC F G + V IYGN+ Q V YD+
Sbjct: 424 AGGVTVQLPPQGILYVASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYDV 475
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 140/423 (33%), Positives = 209/423 (49%), Gaps = 41/423 (9%)
Query: 30 SVELIHRDSPKS---PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS-KASQADII 85
S+E++H+ P S P +S + Q L +R + + +N + S+ KAS+A +
Sbjct: 76 SLEVVHKHGPCSKLRPHKANSPSHTQILAQDESRVASIQSRLAKNLAGGSNLKASKATLP 135
Query: 86 PNNA------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
+A NY++ + +G+P + + DTGSDL WTQCEPC CY Q +FDP
Sbjct: 136 SKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPC-VGYCYQQREHIFDPST 194
Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
S +Y ++ C S C L N CS C Y + YGDGS+S G A E ++L ST
Sbjct: 195 SLSYSNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTD- 253
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSS 252
FGCG NN GLF T G++GL +SL+SQ FSYCL S+
Sbjct: 254 ---VFNNFQFGCGQNNRGLFGG-TAGLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSSST 309
Query: 253 TKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGV-----STPDIVID 304
++FG+ G V TP + +FY L + ISVG ++L + ST +ID
Sbjct: 310 GYLSFGS-GDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFSTAGTIID 368
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-G 361
SGT ++ LP S++ V ++ P L+ CY + +VP++ ++F G
Sbjct: 369 SGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYKTVKVPKIILYFSGG 428
Query: 362 ADVKLSRSN--FFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
A++ L+ + +KVS+ VC F G + + V I GN+ Q V YD + V F P
Sbjct: 429 AEMDLAPEGIIYVLKVSQ--VCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAP 486
Query: 418 TDC 420
+ C
Sbjct: 487 SGC 489
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 124/366 (33%), Positives = 176/366 (48%), Gaps = 38/366 (10%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD-SPLFDPKMSSTYKSLPC 148
YL+ +S+GTPP DTGSDL+WTQC PC C+ Q +P+ DP SST+ +LPC
Sbjct: 89 EYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPC--LDCFEQGAAPVLDPAASSTHAALPC 146
Query: 149 SSSQCASLNQKSCSG-----VNCQYSVSYGDGSFSNGNLATETVTL-GSTTGQAVALPGI 202
+ C +L SC G +C Y YGD S + G LAT++ T G +A +
Sbjct: 147 DAPLCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAARRV 206
Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK----INFG 258
TFGCG N G+F + TGI G G G SL SQ+ T FSYC + TK + G
Sbjct: 207 TFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVT---SFSYCFTSMFDTKSSSVVTLG 263
Query: 259 --------TNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI----VI 303
T+ V +T L K + Y + + ISVG R+ V + +I
Sbjct: 264 AAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPESRLRSSTII 323
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-----QVPEVTIH 358
DSG ++T LP+ + + S + A + +L+LC++ + VP +T+H
Sbjct: 324 DSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCFALPVAALWRRPAVPALTLH 383
Query: 359 FR-GADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
GAD +L R N+ F + ++C V + GN Q N V YD+E +SF
Sbjct: 384 LDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQQQNTHVVYDLENDVLSFA 443
Query: 417 PTDCTK 422
P C K
Sbjct: 444 PARCDK 449
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 147/430 (34%), Positives = 206/430 (47%), Gaps = 59/430 (13%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISS---------SKA 79
FSV+L H D+ +NS TP L R R+ + + + S +
Sbjct: 60 FSVQLHHVDALS---FNS--TPETLFTTRLQRDAARVEAISYLAETAGTGKRVGTGFSSS 114
Query: 80 SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
+ + + Y RI +GTPP V DTGSD++W QC PC +CY Q P+FDP+
Sbjct: 115 VISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPC--KRCYAQSDPVFDPRK 172
Query: 140 SSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
S ++ S+ C S C L+ C+ C Y VSYGDGSF+ G+ +TET+T T V
Sbjct: 173 SRSFASIACRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTRVARV 232
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
AL GCG +N GLF ++GLG G +S SQ KFSYCLV S++
Sbjct: 233 AL-----GCGHDNEGLFVGAAG-LLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASS--- 283
Query: 258 GTNGIVSGPGVVS-----TPLT---KAKTFYVLTIDAISVGNQRL-GVSTP--------- 299
+ +V G VS TPL K TFY + + ISVG R+ G++
Sbjct: 284 KPSSMVFGDSAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGN 343
Query: 300 -DIVIDSGTTLTFLPQ----GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSFNSLSQ-- 351
++IDSGT++T L + + + S++ A P SL + C+ + ++
Sbjct: 344 GGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRA-----PQFSLFDTCFDLSGKTEVK 398
Query: 352 VPEVTIHFRGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 410
VP V +HFRGADV L SN+ + V + C F G + I GNI Q F V YD+
Sbjct: 399 VPTVVLHFRGADVSLPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAG 458
Query: 411 QTVSFKPTDC 420
V F P C
Sbjct: 459 SRVGFAPHGC 468
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 134/415 (32%), Positives = 201/415 (48%), Gaps = 37/415 (8%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
++I +D + F +S T + +R++ T + S+ S+ ++ + + NY
Sbjct: 59 DMITKDEERVRFLHSRLTNKESVRNSATT-----DKLRGGPSLVSTTPLKSGLSIGSGNY 113
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC--- 148
++I +GTP + DTGS L W QC+PC C++Q P+F P S TYK+LPC
Sbjct: 114 YVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCV-IYCHVQVDPIFTPSTSKTYKALPCSSS 172
Query: 149 --SSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
SS + ++LN CS C Y SYGD SFS G L+ + +TL T G +
Sbjct: 173 QCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTL---TPSEAPSSGFVY 229
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS--------STKIN 256
GCG +N GLF +++GI+GL IS++ Q+ FSYCL S ++
Sbjct: 230 GCGQDNQGLFG-RSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNSSSLSGFLS 288
Query: 257 FGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPDI----VIDSGTTL 309
G + + S P TPL K + + Y L + I+V + LGVS +IDSGT +
Sbjct: 289 IGASSLTSSP-YKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPTIIDSGTVI 347
Query: 310 TFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIHFR-GADVK 365
T LP YN+ S + M + A L+ C+ S +S VPE+ I FR GA ++
Sbjct: 348 TRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIQIIFRGGAGLE 407
Query: 366 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
L N V++ + C +N + I GN Q F V YD+ + F P C
Sbjct: 408 LKAHNSLVEIEKGTTCLAIAASSNPISIIGNYQQQTFKVAYDVANFKIGFAPGGC 462
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 130/367 (35%), Positives = 183/367 (49%), Gaps = 44/367 (11%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+ ++IGTPP DTGSDLIWTQC+PCP C+ Q P FDP SST C
Sbjct: 34 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 91
Query: 150 SSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
S+ C L SC C Y+ SYGD S + G L + T G ++PG+
Sbjct: 92 STLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVA 148
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKINFG 258
FGCG N G+F S TGI G G G +SL SQ++ G FS+C + S+ ++
Sbjct: 149 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLK---VGNFSHCFTTITGAIPSTVLLDLP 205
Query: 259 TNGIVSGPGVV-STPLTK-AK-----TFYVLTIDAISVGNQRLGV---------STPDIV 302
+ +G G V +TPL + AK T Y L++ I+VG+ RL V T +
Sbjct: 206 ADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTI 265
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR 360
IDSGT++T LP + ++ I+ V C+S S ++ VP++ +HF
Sbjct: 266 IDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFE 325
Query: 361 GADVKLSRSNFFVKVSED----IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
GA + L R N+ +V +D I+C ++ KG + I GN Q N V YD++ +SF
Sbjct: 326 GATMDLPRENYVFEVPDDAGNSIICLAINKG--DETTIIGNFQQQNMHVLYDLQNNMLSF 383
Query: 416 KPTDCTK 422
C K
Sbjct: 384 VAAQCDK 390
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 124/359 (34%), Positives = 182/359 (50%), Gaps = 38/359 (10%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY++ + +G + DTGSDL W QC+PC CY Q PL+DP +SS+YK++ C+
Sbjct: 137 NYIVTVELGGKNMS--LIVDTGSDLTWVQCQPC--RSCYNQQGPLYDPSVSSSYKTVFCN 192
Query: 150 SSQCASL-----NQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
SS C L N C G N C+Y VSYGDGS++ G+LA+E++ LG T
Sbjct: 193 SSTCQDLVAATGNSGPCGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVLGDT-----K 247
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKI 255
L + FGCG NN GLF +G++GLG +SL+SQ T G FSYCL + +S +
Sbjct: 248 LENLVFGCGRNNKGLFGG-ASGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGTL 306
Query: 256 NFGTNGIV--SGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVST--PDIVIDSGTT 308
+FG + V + V TPL + ++FY+L + S+G L + I+IDSGT
Sbjct: 307 SFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGGVELKTLSFGRGILIDSGTV 366
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRG---AD 363
+T LP + + P A L+ C++ S +P + + F G +
Sbjct: 367 ITRLPPSIYKAVKTEFLKQFSGFPSAPGYSILDTCFNLTSYEDISIPTIKMIFEGNAELE 426
Query: 364 VKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
V ++ +FVK +VC ++ N V I GN Q N V YD Q+ + +C
Sbjct: 427 VDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIAGENC 485
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 139/447 (31%), Positives = 201/447 (44%), Gaps = 46/447 (10%)
Query: 6 SCVF-ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNR 64
SC+ LFFL P+ + T L H D + T + LR + RS R
Sbjct: 11 SCMLPYLFFLAILFAWPVTSAT--LRAHLSHVDDGRG------FTKRELLRRMVVRSRAR 62
Query: 65 LNHFNQNSSISSSKASQADIIPN---NANYLIRISIGTPPTERLAVA-DTGSDLIWTQCE 120
+ S ++ A+ N N+ YLI +SIG P ++ + + DTGSD++WTQCE
Sbjct: 63 AANLCPYSGATARPATAPVGRANTDVNSEYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCE 122
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNG 180
PC ++C+ Q P FD S+T +S+ CS C + ++ C C Y YGDGS S G
Sbjct: 123 PC--AECFTQPLPRFDTAASNTVRSVACSDPLCNAHSEHGCFLHGCTYVSGYGDGSLSFG 180
Query: 181 NLATETVTLGSTTGQA-VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
+ ++ T G V +P I FGCG N G F TGI G G G +SL SQ++
Sbjct: 181 HFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLPSQLKVR- 239
Query: 240 AGKFSYCLV---PVSSTKINFGTNG---------IVSGPGVVSTPLTKAKTFYVLTIDAI 287
+FSYC S+ + G G I+S P V S P + YVL+ +
Sbjct: 240 --QFSYCFTTRFEAKSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGV 297
Query: 288 SVGNQRLGVSTPDI--------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS 339
+VG RL V P+I IDSGT +T P L S + A PV
Sbjct: 298 TVGKTRLPV--PEIKADGSGATFIDSGTDITTFPDAVFRQLKSAFIAQ-AALPVNKTADE 354
Query: 340 LELCYSFN--SLSQVPEVTIHFRGADVKLSRSNFFVKVSE--DIVCSVFKGITNSVPIYG 395
++C+S++ + +P++ H GAD L R N+ + E + +V + G
Sbjct: 355 DDICFSWDGKKTAAMPKLVFHLEGADWDLPRENYVTEDRESGQVCVAVSTSGQMDRTLIG 414
Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDCTK 422
N Q N + YD+ + P C K
Sbjct: 415 NFQQQNTHIVYDLAAGKLLLVPAQCDK 441
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 143/444 (32%), Positives = 215/444 (48%), Gaps = 67/444 (15%)
Query: 27 GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
GGFSVELIHRDS KSPF++ T + R A R S +SS D+
Sbjct: 25 GGFSVELIHRDSIKSPFHDPKLTRHDRFL-AAARRSRARAAALLASDVSS------DLFY 77
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM----------------- 129
+ YL +++GTPP LAVADTGSDL+W +C + +
Sbjct: 78 GDFEYLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPPP 137
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATET 186
+ F+P SS+Y + C C +L SC+G + C + SY DG+ + G LA +T
Sbjct: 138 EAVVYFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYRDGASATGLLAADT 197
Query: 187 VTL-GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
T G+ + I FGC T G + G+VGLG G +SL SQ+ KFS+
Sbjct: 198 FTFGGNINNDTTSTASIDFGCATGTAGR-EFQADGMVGLGAGPLSLASQL----GRKFSF 252
Query: 246 CL----VPVSSTKINFGTNGIVSGPGVVSTPL----TKAKTFYVLTIDAISVGNQRL--G 295
CL + +S+ +NFG +VS PG +TPL + A +Y ++ID++ V Q +
Sbjct: 253 CLTAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVAGQPVPGT 312
Query: 296 VSTPDIVIDSGTTLTFLPQG-----YNSNLLSVM--SSMIEAQPVADPTGSLELCYSFNS 348
S +++D+GT LTFL + +L VM + + A P P +LELCY +
Sbjct: 313 TSVSKVIVDTGTVLTFLDRAALLAPLTESLARVMDGAGLPRAPP---PDETLELCYDVSR 369
Query: 349 LSQV----PEVTIHF---RGADVKLSRSNFFVKVSEDIVCSVFKGITNS-----VPIYGN 396
+ V P+VT+ G +V+L+ FV V E ++C +T S + + GN
Sbjct: 370 VKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVKEGVLC--LAVVTTSPELQPLSVLGN 427
Query: 397 IMQTNFLVGYDIEQQTVSFKPTDC 420
+ + VG D++ +T +F +C
Sbjct: 428 VALQDLHVGIDLDARTATFATANC 451
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 136/431 (31%), Positives = 202/431 (46%), Gaps = 49/431 (11%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF--------NQNSSISSSKASQ 81
SV L+HR P +P S P L + L R R N+ +++S +
Sbjct: 44 SVPLVHRHGPCAPSAASGGKP--SLAERLRRDRARANYIVTKAAGGRTAATAVSDAVGGG 101
Query: 82 ADIIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
IP ++ Y++ + IGTP +++ + DTGSDL W QC+PC +CY Q PL
Sbjct: 102 GTSIPTFLGDSVDSLEYVVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKDPL 161
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQ-------KSCSGVNCQYSVSYGDGSFSNGNLATETV 187
FDP SS+Y S+PC S C L S + C+Y + YG+ + + G +TET+
Sbjct: 162 FDPSSSSSYASVPCDSDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTETL 221
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
TL V + FGCG + G + K G++GLGG SL+SQ + G FSYCL
Sbjct: 222 TL----KPGVVVADFGFGCGDHQHGPYE-KFDGLLGLGGAPESLVSQTSSQFGGPFSYCL 276
Query: 248 VPVSSTK--INFGT----NGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS- 297
P S + G + + G + TP+ + TFYV+T+ ISVG L V
Sbjct: 277 PPTSGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPLAVPP 336
Query: 298 ---TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ- 351
+ +VIDSGT +T LP + L S S + + P+ L+ CY F +
Sbjct: 337 SAFSSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGAVLDTCYDFTGHTNV 396
Query: 352 -VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
VP + + F GA + L+ + + + G +++ I GN+ Q F V YD
Sbjct: 397 TVPTIALTFSGGATIDLATPAGVLV--DGCLAFAGAGTDDTIGIIGNVNQRTFEVLYDSG 454
Query: 410 QQTVSFKPTDC 420
+ TV F+ C
Sbjct: 455 KGTVGFRAGAC 465
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 139/425 (32%), Positives = 200/425 (47%), Gaps = 50/425 (11%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-NSSISSSKASQAD------ 83
V+L H D+ +S ETP L R +R+ +++ S+ ++A
Sbjct: 80 VQLHHLDA-----LSSDETPQDLFNSRLARDASRVKSLTSLAAAVGSTNRTRARGPGFSS 134
Query: 84 -----IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
+ + Y R+ +GTP V DTGSD++W QC PC +CY Q P+F+P
Sbjct: 135 SVTSGLAQGSGEYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPC--KKCYSQTDPVFNPT 192
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
S ++ ++PC S C L+ CS C Y VSYGDGSF+ G +TET+T T
Sbjct: 193 KSRSFANIPCGSPLCRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTRVGR 252
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-- 254
VAL GCG +N GLF ++GLG G +S SQ+ + KFSYCLV S++
Sbjct: 253 VAL-----GCGHDNEGLFIGAAG-LLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSASSKP 306
Query: 255 --INFGTNGIVSGPG---VVSTPLTKAKTFYVLTIDAISVGNQRL-GVSTP--------- 299
+ FG + I +VS P K TFY + + +SVG R+ G++
Sbjct: 307 SYMVFGDSAISRTARFTPLVSNP--KLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGN 364
Query: 300 -DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVT 356
++IDSGT++T L + L A + C+ + ++ VP V
Sbjct: 365 GGVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVV 424
Query: 357 IHFRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
+HFRGADV L SN+ + V C F G + + I GNI Q F V YD+ V F
Sbjct: 425 LHFRGADVSLPASNYLIPVDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGF 484
Query: 416 KPTDC 420
P C
Sbjct: 485 APRGC 489
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 132/424 (31%), Positives = 210/424 (49%), Gaps = 48/424 (11%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNR----LNHFNQNSSISSSKASQADI 84
+ ++L+HRD K P +N+ R + R R L +++A +D+
Sbjct: 68 YKLKLVHRD--KVPTFNTYHDHRTRFNARMQRDTKRAASLLRRLAAGKPTYAAEAFGSDV 125
Query: 85 I----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
+ + Y +RI +G+PP + V D+GSD+IW QCEPC +QCY Q P+F+P S
Sbjct: 126 VSGMEQGSGEYFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPC--TQCYHQSDPVFNPADS 183
Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
S++ + C+S+ C+ ++ +C C+Y VSYGDGS++ G LA ET+T G T + VA+
Sbjct: 184 SSFSGVSCASTVCSHVDNAACHEGRCRYEVSYGDGSYTKGTLALETITFGRTLIRNVAI- 242
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINF 257
GCG +N G+F ++GLGGG +S + Q+ G FSYCLV SS + F
Sbjct: 243 ----GCGHHNQGMFVGAAG-LLGLGGGPMSFVGQLGGQTGGAFSYCLVSRGIESSGLLEF 297
Query: 258 GTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP----------DIVID 304
G + G V PL +A++FY + + + VG R+ +S +V+D
Sbjct: 298 GREAMPVGAAWV--PLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSELGDGGVVMD 355
Query: 305 SGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-FNSLS-QVPEVTIH 358
+GT +T LP + + ++ +++ P A + CY F +S +VP V+ +
Sbjct: 356 TGTAVTRLPTVAYEAFRDGFIAQTTNL----PRASGVSIFDTCYDLFGFVSVRVPTVSFY 411
Query: 359 FRGADV-KLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
F G + L NF + V + C F ++ + I GNI Q + D V F
Sbjct: 412 FSGGPILTLPARNFLIPVDDVGTFCFAFAPSSSGLSIIGNIQQEGIQISVDGANGFVGFG 471
Query: 417 PTDC 420
P C
Sbjct: 472 PNVC 475
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 134/423 (31%), Positives = 198/423 (46%), Gaps = 39/423 (9%)
Query: 30 SVELIHRDSPKSPFYNSSETPY------------QRLRDALTRSLNRLNHFNQNSSISSS 77
S+E++H+ P S +S + +R++ +R L N+ + S+
Sbjct: 66 SLEVVHKHGPCSQLNHSGKAEATISHNDIMNLDNERVKYIQSRLSKNLGGENRVKELDST 125
Query: 78 KA-SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
+++ + +A+Y + + +GTP + + DTGS L WTQCEPC S CY Q P+FD
Sbjct: 126 TLPAKSGRLIGSADYYVVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGS-CYKQQDPIFD 184
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSG---VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
P SS+Y ++ C+SS C CS +C Y V YGD S S G L+ E +T+ +T
Sbjct: 185 PSKSSSYTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTITATD 244
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVS 251
+ FGCG +N GLF T G++GL IS + Q + FSYCL P S
Sbjct: 245 ----IVHDFLFGCGQDNEGLFRG-TAGLMGLSRHPISFVQQTSSIYNKIFSYCLPSTPSS 299
Query: 252 STKINFGTNGIVSGPGVVSTP---LTKAKTFYVLTIDAISVGNQRL-GVSTPDI-----V 302
+ FG + + + TP ++ +FY L I ISVG +L VS+ +
Sbjct: 300 LGHLTFGASA-ATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSAGGSI 358
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR 360
IDSGT +T LP + L S + PVA T L+ CY F+ + VP + F
Sbjct: 359 IDSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVPRIDFEFA 418
Query: 361 GA-DVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
G V+L S +C F G N + I+GN+ Q V YD+E + F
Sbjct: 419 GGVKVELPLVGILYGESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFGA 478
Query: 418 TDC 420
C
Sbjct: 479 AGC 481
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 134/431 (31%), Positives = 200/431 (46%), Gaps = 55/431 (12%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP----- 86
++HRD+ + + T + L+ L R R ++ + + P
Sbjct: 68 RVVHRDT-----FAVNATAGELLKHRLQRDKRRAARISEAAGAGGGNGRKGVAAPVVSGL 122
Query: 87 --NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
+ Y +I +GTP T+ L V DTGSD++W QC PC +CY Q P+FDP+ SS+Y
Sbjct: 123 AQGSGEYFTKIGVGTPATQALMVLDTGSDVVWVQCAPC--RRCYEQSGPVFDPRRSSSYG 180
Query: 145 SLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
++ C ++ C L+ C C Y V+YGDGS + G+ TET+T G VA +
Sbjct: 181 AVGCGAALCRRLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTFAG--GARVAR--V 236
Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-------- 254
GCG +N GLF + ++GLG G +S +Q+ FSYCLV +S+
Sbjct: 237 ALGCGHDNEGLFVAAAG-LLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSH 295
Query: 255 ----INFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRL-GVSTPD------ 300
++FG G V TP+ + +TFY + + ISVG R+ GV+ D
Sbjct: 296 RSSTVSFGA-GSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPS 354
Query: 301 -----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG--SLELCYSF--NSLSQ 351
+++DSGT++T L + S L + P G + CY + +
Sbjct: 355 TGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVK 414
Query: 352 VPEVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
VP V++HF GA+ L N+ + V S C F G V I GNI Q F V +D +
Sbjct: 415 VPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGD 474
Query: 410 QQTVSFKPTDC 420
Q V F P C
Sbjct: 475 GQRVGFAPKGC 485
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 138/430 (32%), Positives = 201/430 (46%), Gaps = 55/430 (12%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNR---LNHFNQNSSISSSKASQADI 84
GF L H D+ + T Q L A+ RS R L ++ + ++ +
Sbjct: 29 GFQATLTHIDA------GAGYTEAQLLSRAVRRSKARVAALQSLATTTAADAITVARILV 82
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
+ + YL+ + IGTPP A+ DTGSDLIWTQC PC C Q +P FDP S +Y
Sbjct: 83 LASEGEYLMSMGIGTPPRYYSAILDTGSDLIWTQCAPC--MLCVDQPTPFFDPAQSPSYA 140
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
LPC+S C +L C C Y YGD + + G L+ ET T G T V +P I F
Sbjct: 141 KLPCNSPMCNALYYPLCYRNVCVYQYFYGDSANTAGVLSNETFTFG-TNDTRVTVPRIAF 199
Query: 205 GCGT-NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVSSTKINFGT 259
GCG N G LFN +G+VG G G +SL+SQ+ + +FSYCL PV S ++ FG
Sbjct: 200 GCGNLNAGSLFNG--SGMVGFGRGPLSLVSQLGSP---RFSYCLTSFMSPVPS-RLYFGA 253
Query: 260 NGIV------SGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS-----------TP 299
+ +G V STP T Y L + ISVG + L + T
Sbjct: 254 YATLNSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGTG 313
Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMI-----EAQPVADPTGSLELCYSF----NSLS 350
++IDSG+T+T+L + + + + A +AD L+ C+ + +
Sbjct: 314 GVIIDSGSTITYLARAAYDMVHQAFADQVGLPLTNATSLAD---VLDTCFVWPPPPRKIV 370
Query: 351 QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 410
+PE+ HF GA+++L N+ + + + ++ I G+ NF V YD E
Sbjct: 371 TMPELAFHFEGANMELPLENYMLIDGDTGNLCLAIAASDDGSIIGSFQHQNFHVLYDNEN 430
Query: 411 QTVSFKPTDC 420
+SF P C
Sbjct: 431 SLLSFTPATC 440
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 139/436 (31%), Positives = 204/436 (46%), Gaps = 51/436 (11%)
Query: 22 IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF--NQNSSISSSKA 79
+E + S+ L+HR P +P S P + + L RS R N+ + S+ A
Sbjct: 48 LEPSSATVSMSLVHRYGPCAP-SQYSNVPTPSISETLRRSRARTNYIMSQASKSMGMGMA 106
Query: 80 SQAD------IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
S D IP ++ Y++ + GTP ++ + DTGSD+ W QC PC ++
Sbjct: 107 STPDDDDAAVTIPTRLGGFVDSLEYVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTK 166
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGN 181
CY Q PLFDP SSTY + C++ C L N + G C YSV Y DGS S G
Sbjct: 167 CYPQKDPLFDPSKSSTYAPIACNTDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGV 226
Query: 182 LATETVTLGSTTGQAVALPGIT-----FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
+ ET+TL PGIT FGCG + G + K G++GLGG +SL+ Q
Sbjct: 227 YSNETLTLA---------PGITVEDFHFGCGRDQRGP-SDKYDGLLGLGGAPVSLVVQTS 276
Query: 237 TTIAGKFSYCLVPVSSTK--INFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGN 291
+ G FSYCL ++S + G+ + V TP+ TFY++T+ ISVG
Sbjct: 277 SVYGGAFSYCLPALNSEAGFLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGG 336
Query: 292 QRLGVSTP----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 347
+ L + ++IDSGT T LP+ + L + + ++A P+ P+ + CY+F
Sbjct: 337 KPLHIPQSAFRGGMIIDSGTVDTELPETAYNALEAALRKALKAYPLV-PSDDFDTCYNFT 395
Query: 348 SLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLV 404
S VP V F GA + L N + D + G + + I GN+ Q V
Sbjct: 396 GYSNITVPRVAFTFSGGATIDLDVPNGILV--NDCLAFQESGPDDGLGIIGNVNQRTLEV 453
Query: 405 GYDIEQQTVSFKPTDC 420
YD + V F+ C
Sbjct: 454 LYDAGRGNVGFRAGAC 469
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 141/430 (32%), Positives = 209/430 (48%), Gaps = 47/430 (10%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRL--RD-ALTRSLNRLNHFNQNSSISSSKASQADIIP 86
S++L+HRD+ + S L RD A L R + + S +SS S I+
Sbjct: 58 SLQLLHRDTVSGTKHPSRRHAVLALASRDTARVAYLQRRLSPSPSPSSTSSVESGGTIVS 117
Query: 87 N-NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
+ + YL+R+ IG+PP E+ VADTGSD+IW QC PC S CY Q PLFDP S+++
Sbjct: 118 HGSGEYLVRVGIGSPPLEQHLVADTGSDVIWVQCSPC--SDCYAQGDPLFDPANSASFSP 175
Query: 146 LPCSSSQCASLNQ-----KSCSGVNCQYSVSYGDGSFSNGNLATETVTL-GSTTGQAVAL 199
+PC+S C + + G C+Y VSYGD S++NG LA ET+TL G T Q VA+
Sbjct: 176 VPCNSGVCRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTLDGGTEVQGVAM 235
Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGT 259
GCG N GLF ++ G++GLG G +SL+ Q+ G FSYCL S + +
Sbjct: 236 -----GCGHENRGLF-AEAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEGSGSG 289
Query: 260 NGIV----SGP-GVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS----------TPDI 301
+ ++ + P G V PL + A +FY + ++ + V +RL + +
Sbjct: 290 SLVLGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGDDGGGGV 349
Query: 302 VIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 358
V+D+GT +T LP + Y + + + E P A + CY + + +VP V ++
Sbjct: 350 VMDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSLFDTCYDLSGYASVRVPTVALY 409
Query: 359 F-------RGADVKLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 410
F A + L N V V + C F + + I GNI Q + D
Sbjct: 410 FGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPSILGNIQQQGIEITVDSAS 469
Query: 411 QTVSFKPTDC 420
V F P C
Sbjct: 470 GYVGFGPATC 479
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 135/416 (32%), Positives = 195/416 (46%), Gaps = 46/416 (11%)
Query: 30 SVELIHRDSPKSPFYN-----SSETPYQRLRDALTRSLNRLNHFN--------QNSSI-- 74
S+E++H+ P S + S TP+ D L + R+ + N Q+SS+
Sbjct: 71 SLEVVHKHGPCSQLNDHDGKAKSTTPHS---DILNQDKERVKYINSRLSKNLGQDSSVEE 127
Query: 75 --SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
S++ +++ + + NY + + +GTP + + DTGSDL WTQCEPC S CY Q
Sbjct: 128 LDSATLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARS-CYKQQD 186
Query: 133 PLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVN--CQYSVSYGDGSFSNGNLATE 185
+FDP S++Y ++ C+S+ C L N CS C Y + YGD SFS G + E
Sbjct: 187 VIFDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRE 246
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
+T+ +T + FGCG NN GLF + G++GLG IS + Q FSY
Sbjct: 247 RLTVTATD----VVDNFLFGCGQNNQGLFGG-SAGLIGLGRHPISFVQQTAAKYRKIFSY 301
Query: 246 CLVPVSSTKINFGTNGIVSGPGVVSTP---LTKAKTFYVLTIDAISVGNQRLGV-----S 297
CL SS+ + +G + TP +++ +FY L I AI+VG +L V S
Sbjct: 302 CLPSTSSSTGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFS 361
Query: 298 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEV 355
T +IDSGT +T LP L S + P A L+ CY + +P +
Sbjct: 362 TGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKVFSIPTI 421
Query: 356 TIHFRGA-DVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDI 408
F G VKL S VC F G + V IYGN+ Q V YD+
Sbjct: 422 EFSFAGGVTVKLPPQGILFVASTKQVCLAFAANGDDSDVTIYGNVQQRTIEVVYDV 477
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 144/426 (33%), Positives = 207/426 (48%), Gaps = 48/426 (11%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD---I 84
GF L H D+ ++ T Q L AL RS R+ ++++ A A +
Sbjct: 30 GFKATLRHVDA------DAGYTEEQLLSRALRRSSARVATLQSLAALAPGDAITAARILV 83
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
+ ++ YL+ + IGTP A+ DTGSDLIWTQC PC C Q +P FDP S+TY+
Sbjct: 84 LASDGEYLMEMGIGTPTRYYSAILDTGSDLIWTQCAPC--LLCVDQPTPYFDPARSATYR 141
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
SL C+S C +L C C Y YGD + + G LA ET T G T V+LPGI+F
Sbjct: 142 SLGCASPACNALYYPLCYQKVCVYQYFYGDSASTAGVLANETFTFG-TNETRVSLPGISF 200
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVSSTKINFGTN 260
GCG N G + +G+VG G G +SL+SQ+ + +FSYCL PV S ++ FG
Sbjct: 201 GCGNLNAGSL-ANGSGMVGFGRGSLSLVSQLGSP---RFSYCLTSFLSPVPS-RLYFGVY 255
Query: 261 GIV-----SGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS-----------TPDI 301
+ S V STP T Y L + ISVG L + T
Sbjct: 256 ATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGT 315
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPEV 355
+IDSGTT+T+L + + + +S I P+ + T + L+ C+ + + +P++
Sbjct: 316 IIDSGTTITYLAEPAYDAVRAAFASQITL-PLLNVTDASVLDTCFQWPPPPRQSVTLPQL 374
Query: 356 TIHFRGADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
+HF GAD +L N+ V S + ++ I G+ NF V YD+E +S
Sbjct: 375 VLHFDGADWELPLQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMS 434
Query: 415 FKPTDC 420
F P C
Sbjct: 435 FVPAPC 440
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 127/429 (29%), Positives = 210/429 (48%), Gaps = 68/429 (15%)
Query: 49 TPYQRLRDALTRSLNRLNHFN--QNSSISSSKASQAD--IIPNNANYLIRISIGTPPTER 104
T ++ LR A+ RS RL + + S+ KA A+ I+P YL+++ IGTPP +
Sbjct: 43 TEHELLRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKF 102
Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
A DT SDLIWTQC+PC + CY Q P+F+P++SSTY +LPCSS C L+ C
Sbjct: 103 TAAIDTASDLIWTQCQPC--TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHD 160
Query: 165 N---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTG 220
+ CQY+ +Y + + G LA + + +G A G+ FGC T++ GG + +G
Sbjct: 161 DDESCQYTYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASG 215
Query: 221 IVGLGGGDISLISQMRTTIAGKFSYCLVPVSST---KINFGTNGIVS--GPGVVSTPLTK 275
+VGLG G +SL+SQ+ +F+YCL P +S K+ G + + ++ P+ +
Sbjct: 216 VVGLGRGPLSLVSQLSVR---RFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRR 272
Query: 276 A---KTFYVLTIDAISVGNQRL----------------------------GVSTPD---- 300
++Y L +D + +G++ + V+ D
Sbjct: 273 DPRYPSYYYLNLDGLLIGDRAMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRY 332
Query: 301 -IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY------SFNSLSQVP 353
++ID +T+TFL L++ + I + L+LC+ +F+ + VP
Sbjct: 333 GMIIDIASTITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGVAFDRV-YVP 391
Query: 354 EVTIHFRGADVKLSRSNFFVKVSED-IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
V + F G ++L ++ F + E ++C V + SV I GN Q N V Y++ +
Sbjct: 392 AVALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRG 451
Query: 412 TVSFKPTDC 420
V+F + C
Sbjct: 452 RVTFVQSPC 460
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 134/426 (31%), Positives = 195/426 (45%), Gaps = 41/426 (9%)
Query: 33 LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP------ 86
++HR P SP + P D L R++ ++ + ++ Q +P
Sbjct: 22 VMHRHGPCSPLQTPDDAPSDA--DLLEHDQARVDSIHRMIANETAVVGQDVSLPAERGIS 79
Query: 87 -NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
NY++ + +GTP + V DTGSDL W QC PC CY Q PLF P SST+ +
Sbjct: 80 VGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSA 139
Query: 146 LPCSSSQCASLNQKSCSGV----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA--- 198
+ C +C Q SCS C Y V YGD S + G+L +T+TLG+T +
Sbjct: 140 VRCGEPECPRARQ-SCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASENN 198
Query: 199 ---LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK- 254
LPG FGCG NN GLF K G+ GLG G +SL SQ FSYCL SS
Sbjct: 199 SNKLPGFVFGCGENNTGLFG-KADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPSSSSNAH 257
Query: 255 --INFGTNGIVSGPGVVSTPLTKAKT--FYVLTIDAISVGNQRLGVST------PDIVID 304
++ GT + L ++ T FY + + I V + + VS+ +++D
Sbjct: 258 GYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWPAGLIVD 317
Query: 305 SGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSF----NSLSQVPEVTIH 358
SGT +T L P+ Y++ + +S+M + P S L+ CY F N+ +P V +
Sbjct: 318 SGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALV 377
Query: 359 FRGA---DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
F G V S + KV++ + G S I GN Q V YD+ +Q + F
Sbjct: 378 FAGGATISVDFSGVLYVAKVAQACLAFAPNGNGRSAGILGNTQQRTVAVVYDVGRQKIGF 437
Query: 416 KPTDCT 421
C+
Sbjct: 438 AAKGCS 443
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 133/430 (30%), Positives = 200/430 (46%), Gaps = 37/430 (8%)
Query: 19 VSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS--LNRLNHFNQNSSISS 76
VS ++ F + L+HRD + + RDA+ + + RL+H +++
Sbjct: 62 VSGYKSDNNTFKLNLLHRDKLSHVHGHRRGFNDRMKRDAIRVATLVRRLSH-GAPAAVKD 120
Query: 77 SKASQA----DII----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
S+ A D+I + Y +RI +G+PP + V D+GSD++W QC+PC S+CY
Sbjct: 121 SRYKVANFATDVISGMEAGSGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPC--SRCY 178
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
Q P+FDP SS++ + C S C L C+ C+Y VSYGDGS++ G LA ET+T
Sbjct: 179 QQSDPVFDPADSSSFAGVSCGSDVCDRLENTGCNAGRCRYEVSYGDGSYTKGTLALETLT 238
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
+G + VA+ GCG N G+F ++GLGGG +S I Q+ G FSYCLV
Sbjct: 239 VGQVMIRDVAI-----GCGHTNQGMFIGAAG-LLGLGGGSMSFIGQLGGQTGGAFSYCLV 292
Query: 249 PV---SSTKINFGTNGIVSGPGVVSTPLT-KAKTFYVLTIDAISVGNQRLGV-------- 296
S+ + FG + G +S +A +FY + + I VG R+ V
Sbjct: 293 SRGTGSTGALEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLT 352
Query: 297 --STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QV 352
T +V+D+GT +T P ++ P A + CY N +V
Sbjct: 353 EYGTNGVVMDTGTAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRV 412
Query: 353 PEVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 410
P V+ +F G + L NF + V C F + + I GNI Q + +D
Sbjct: 413 PTVSFYFSDGPVLTLPARNFLIPVDGGGTFCLAFAPSPSGLSIIGNIQQEGIQISFDGAN 472
Query: 411 QTVSFKPTDC 420
V F P C
Sbjct: 473 GFVGFGPNIC 482
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 144/432 (33%), Positives = 209/432 (48%), Gaps = 55/432 (12%)
Query: 30 SVELIHRDSPKSPFYNSS---ETP--YQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
SV L HR P +P +S+ + P +RLR R+ + L + +S +
Sbjct: 55 SVPLAHRHGPCAPKGSSATDKKKPSFAERLRSDRARADHILRKASGRRMMSEGGGAS--- 111
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
IP ++ Y++ + IGTP ++ + DTGSDL W QC+PC S CY Q PLFDP
Sbjct: 112 IPTYLGGFVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDP 171
Query: 138 KMSSTYKSLPCSSSQCASL----------NQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
SST+ ++PC+S C L N S C Y++ YG+G+ + G +TET+
Sbjct: 172 SKSSTFATIPCASDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETL 231
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
LGS+ + FGCG++ G ++ K G++GLGG SL+SQ + G FSYCL
Sbjct: 232 ALGSS----AVVKSFRFGCGSDQHGPYD-KFDGLLGLGGAPESLVSQTASVYGGAFSYCL 286
Query: 248 VPVSSTKINFGTNGIV-----SGPGVVSTPLT----KAKTFYVLTIDAISVGNQRLGVST 298
P++S F T G S G V TP+ K TFYV+T+ ISVG + L +
Sbjct: 287 PPLNS-GAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIP- 344
Query: 299 PDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSF--NSLS 350
P + ++DSGT +T +P L + S + P+ P S L+ CY+F +
Sbjct: 345 PAVFAKGNIVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCYNFTGHGTV 404
Query: 351 QVPEVTIHF-RGADVKLS-RSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
VP+V + F GA V L S V ED + G S I GN+ V YD
Sbjct: 405 TVPKVALTFVGGATVDLDVPSGVLV---EDCLAFADAG-DGSFGIIGNVNTRTIEVLYDS 460
Query: 409 EQQTVSFKPTDC 420
+ + F+ C
Sbjct: 461 GKGHLGFRAGAC 472
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 127/429 (29%), Positives = 210/429 (48%), Gaps = 68/429 (15%)
Query: 49 TPYQRLRDALTRSLNRLNHFN--QNSSISSSKASQAD--IIPNNANYLIRISIGTPPTER 104
T ++ LR A+ RS RL + + S+ KA A+ I+P YL+++ IGTPP +
Sbjct: 43 TEHELLRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKF 102
Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
A DT SDLIWTQC+PC + CY Q P+F+P++SSTY +LPCSS C L+ C
Sbjct: 103 TAAIDTASDLIWTQCQPC--TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHD 160
Query: 165 N---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTG 220
+ CQY+ +Y + + G LA + + +G A G+ FGC T++ GG + +G
Sbjct: 161 DDESCQYTYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASG 215
Query: 221 IVGLGGGDISLISQMRTTIAGKFSYCLVPVSST---KINFGTNGIVS--GPGVVSTPLTK 275
+VGLG G +SL+SQ+ +F+YCL P +S K+ G + + ++ P+ +
Sbjct: 216 VVGLGRGPLSLVSQLSVR---RFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRR 272
Query: 276 A---KTFYVLTIDAISVGNQRL----------------------------GVSTPD---- 300
++Y L +D + +G++ + V+ D
Sbjct: 273 DPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRY 332
Query: 301 -IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY------SFNSLSQVP 353
++ID +T+TFL L++ + I + L+LC+ +F+ + VP
Sbjct: 333 GMIIDIASTITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGVAFDRV-YVP 391
Query: 354 EVTIHFRGADVKLSRSNFFVKVSED-IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
V + F G ++L ++ F + E ++C V + SV I GN Q N V Y++ +
Sbjct: 392 AVALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRG 451
Query: 412 TVSFKPTDC 420
V+F + C
Sbjct: 452 RVTFVQSPC 460
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 127/357 (35%), Positives = 177/357 (49%), Gaps = 35/357 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ +GTP V DTGSD++W QC PC +CY Q P+FDP+ S TY ++P
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC--RRCYSQSDPIFDPRKSKTYATIP 196
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
CSS C L+ C+ C Y VSYGDGSF+ G+ +TET+T + VAL G
Sbjct: 197 CSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVAL-----G 251
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG 265
CG +N GLF ++GLG G +S Q KFSYCLV S++ + +V G
Sbjct: 252 CGHDNEGLFVGAAG-LLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASS---KPSSVVFG 307
Query: 266 PGVVS-----TPLT---KAKTFYVLTIDAISVGNQRL-GVSTP----------DIVIDSG 306
VS TPL K TFY + + ISVG R+ GV+ ++IDSG
Sbjct: 308 NAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSG 367
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADV 364
T++T L + + + A + C+ +++++ VP V +HFR ADV
Sbjct: 368 TSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFDTCFDLSNMNEVKVPTVVLHFRRADV 427
Query: 365 KLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
L +N+ + V + C F G + I GNI Q F V YD+ V F P C
Sbjct: 428 SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 128/427 (29%), Positives = 201/427 (47%), Gaps = 59/427 (13%)
Query: 23 EAQTGG--FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
+ + GG + ++++HRD + +S+ RL L R R+ + S +
Sbjct: 125 DHEEGGEKWMMKVVHRDQLS---FGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSY 181
Query: 81 QAD---------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD 131
+ D + + Y +RI +G+PP + V D+GSD++W QC+PC +QCY Q
Sbjct: 182 RVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--TQCYHQS 239
Query: 132 SPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGS 191
P+FDP S+++ + CSSS C L C C+Y VSYGDGS++ G LA ET+T G
Sbjct: 240 DPVFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFGR 299
Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS 251
T ++VA+ GCG N G+F ++GLGGG +S + Q+ G FSYCLV +
Sbjct: 300 TMVRSVAI-----GCGHRNRGMFVGAAG-LLGLGGGSMSFVGQLGGQTGGAFSYCLVSAA 353
Query: 252 STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP----------DI 301
+ V P +A +FY + + + VG R+ +S +
Sbjct: 354 WVPL-------------VRNP--RAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGV 398
Query: 302 VIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEV 355
V+D+GT +T LP Q + L+ +++ A VA + CY +VP V
Sbjct: 399 VMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVA----IFDTCYDLLGFVSVRVPTV 454
Query: 356 TIHFRGADV-KLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
+ +F G + L NF + + + C F T+ + I GNI Q + +D V
Sbjct: 455 SFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYV 514
Query: 414 SFKPTDC 420
F P C
Sbjct: 515 GFGPNIC 521
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 126/355 (35%), Positives = 171/355 (48%), Gaps = 33/355 (9%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP V DTGSD W QC+PC + CY Q PLFDP S+TY ++
Sbjct: 92 GTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCV-AYCYRQKEPLFDPTKSATYANI 150
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
CSSS C+ L CSG +C Y + YGDGS++ G A +T+TL T + FGC
Sbjct: 151 SCSSSYCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYDT-----IKNFRFGC 205
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
G N GLF + G++GLG G SL Q G F+YCL P +S GT + GP
Sbjct: 206 GEKNRGLFG-RAAGLLGLGRGKTSLPVQAYDKYGGVFAYCL-PATSA----GTGFLDLGP 259
Query: 267 GVVS-----TPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQ 314
G + TP+ + TFY + + I VG L + ST ++DSGT +T LP
Sbjct: 260 GAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLPP 319
Query: 315 GYNSNLLSVMSSMIEAQPV-ADPTGS-LELCYSFNSLS----QVPEVTIHFRGA---DVK 365
+ L S S ++ A P S L+ CY +P V++ F+G DV
Sbjct: 320 SAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVD 379
Query: 366 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
S + VS+ + V I GN Q V YDI ++ V F P C
Sbjct: 380 ASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 434
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 144/452 (31%), Positives = 223/452 (49%), Gaps = 78/452 (17%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD---- 83
G +EL H D+ + F S R+R A RS R+N + ++ ++D
Sbjct: 29 GIRLELTHVDA-RGDFTGS-----DRVRRAADRSHRRVNGLLAAAPPPAASTLRSDGGGG 82
Query: 84 ----------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDS 132
+ + A YL+ +IGTPP AV DTGSDLIWTQC+ PC +C+ Q +
Sbjct: 83 GACAATAAASVHASTATYLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPC--RRCFPQPA 140
Query: 133 PLFDPKMSSTYKSLPCSSSQCASL-------------NQKSCSGVNCQYSVSYGDGSFSN 179
PL+ P S TY ++ C S C +L + + C Y SYGDGS ++
Sbjct: 141 PLYAPARSVTYANVSCGSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTD 200
Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQMRTT 238
G LATET T G+ T + + FGCGT+N GG NS +G+VG+G G +SL+SQ+ T
Sbjct: 201 GVLATETFTFGAGT----TVHDLAFGCGTDNLGGTDNS--SGLVGMGRGPLSLVSQLGVT 254
Query: 239 IAGKFSYCLVP----VSSTKINFGTNGIVSGPGVVSTPLT------KAKTFYVLTIDAIS 288
KFSYC P +S+ + G++ +S P STP + ++Y L+++ I+
Sbjct: 255 ---KFSYCFTPFNDTTTSSPLFLGSSASLS-PAAKSTPFVPSPSGPRRSSYYYLSLEGIT 310
Query: 289 VGNQRLGVSTP----------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG 338
VG+ L + ++IDSGTT T L + L +++ + +
Sbjct: 311 VGDTLLPIDPAVFRLTASGRGGLIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHL 370
Query: 339 SLELCYSF-----NSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVF-KGITNS-- 390
L +C++ VP + +HF GAD++L RS+ V ED V V GI ++
Sbjct: 371 GLSVCFAAPQGRGPEAVDVPRLVLHFDGADMELPRSS---AVVEDRVAGVACLGIVSARG 427
Query: 391 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ + G++ Q N V YD+ + +SF+P +C +
Sbjct: 428 MSVLGSMQQQNMHVRYDVGRDVLSFEPANCGE 459
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 177 bits (449), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 126/355 (35%), Positives = 171/355 (48%), Gaps = 33/355 (9%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP V DTGSD W QC+PC + CY Q PLFDP S+TY ++
Sbjct: 157 GTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCV-AYCYRQKEPLFDPTKSATYANI 215
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
CSSS C+ L CSG +C Y + YGDGS++ G A +T+TL T + FGC
Sbjct: 216 SCSSSYCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYDT-----IKNFRFGC 270
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
G N GLF + G++GLG G SL Q G F+YCL P +S GT + GP
Sbjct: 271 GEKNRGLFG-RAAGLLGLGRGKTSLPVQAYDKYGGVFAYCL-PATSA----GTGFLDLGP 324
Query: 267 GVVS-----TPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQ 314
G + TP+ + TFY + + I VG L + ST ++DSGT +T LP
Sbjct: 325 GAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLPP 384
Query: 315 GYNSNLLSVMSSMIEAQPV-ADPTGS-LELCYSFNSLS----QVPEVTIHFRGA---DVK 365
+ L S S ++ A P S L+ CY +P V++ F+G DV
Sbjct: 385 SAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVD 444
Query: 366 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
S + VS+ + V I GN Q V YDI ++ V F P C
Sbjct: 445 ASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 499
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 177 bits (449), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 166/324 (51%), Gaps = 17/324 (5%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
DTGSD+ W QC+PCP QCY Q LF P S+TYK LPC+S+ C L SC +C
Sbjct: 6 DTGSDITWIQCDPCP--QCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLNSSC 63
Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
Y VSYGD S + G+ A ET+TL S V++P FGCG N GLFN G++GLG
Sbjct: 64 NYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNG-AAGLMGLGK 122
Query: 227 GDISLISQMRTTIAGKFSYCLVPVSST----KINFGTNGIVSGPGVVSTPLTKAK---TF 279
I +Q FSYCL VSST ++FG ++ V TPL + +
Sbjct: 123 SSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDY-DVRFTPLVDSSSGPSQ 181
Query: 280 YVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS 339
Y +++ I+VG++ L +S +++DSGT ++ Q L + ++ A
Sbjct: 182 YFVSMTGINVGDELLPISA-TVMVDSGTVISRFEQSAYERLRDAFTQILPGLQTAVSVAP 240
Query: 340 LELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGN 396
+ C+ +++ +P +T+HFR A+++LS + V + ++C F ++ + GN
Sbjct: 241 FDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHILYPVDDGVMCFAFAPSSSGRSVLGN 300
Query: 397 IMQTNFLVGYDIEQQTVSFKPTDC 420
Q N YDI + + +C
Sbjct: 301 FQQQNLRFVYDIPKSRLGISAFEC 324
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 177 bits (449), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 129/434 (29%), Positives = 203/434 (46%), Gaps = 56/434 (12%)
Query: 29 FSVELIHRDS--PKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
V + HRD+ P P QRL R + ++ + S S IP
Sbjct: 27 LHVPVFHRDALFPPPPGAKRGSLLRQRLAADAARYASLVDATGRLHSPVFSG------IP 80
Query: 87 -NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
+ Y + +GTP T+ + V DTGSDL+W QC PC +CY Q +FDP+ SSTY+
Sbjct: 81 FESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPC--RRCYAQRGQVFDPRRSSTYRR 138
Query: 146 LPCSSSQCASLNQKSC-----SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
+PCSS QC +L C +G C+Y V+YGDGS S G+LAT+ + + T +
Sbjct: 139 VPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFANDT----YVN 194
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS--STKINFG 258
+T GCG +N GLF+S G++G+G G IS+ +Q+ F YCL + ST+ ++
Sbjct: 195 NVTLGCGRDNEGLFDS-AAGLLGVGRGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYL 253
Query: 259 TNGIVSGP------GVVSTPLTKAKTFYVLTIDAISVGNQR--------LGVSTP----D 300
G P ++S P + + Y + + SVG +R L + T
Sbjct: 254 VFGRTPEPPSTAFTALLSNP--RRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGG 311
Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNS--LSQVPEV 355
+V+DSGT ++ + + L + A + G + CY + P +
Sbjct: 312 VVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLI 371
Query: 356 TIHFR-GADVKLSRSNFFV-------KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
+HF GAD+ L N+F+ + + C F+ + + + GN+ Q F V +D
Sbjct: 372 VLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGFRVVFD 431
Query: 408 IEQQTVSFKPTDCT 421
+E++ + F P CT
Sbjct: 432 VEKERIGFAPKGCT 445
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 130/412 (31%), Positives = 201/412 (48%), Gaps = 49/412 (11%)
Query: 56 DALTRSLNRLNHFNQNSSISSSKASQADIIPNNA------------------NYLIRISI 97
D+ + R++ ++ +++S S A++ D P A YL+ + +
Sbjct: 96 DSAEKDAVRIDTMHRRAALSGSAAARRDSAPRRALSERVVATVESGVPVGSGEYLVDVYL 155
Query: 98 GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC---- 153
GTPP + DTGSDL W QC PC C+ Q P+FDP S +Y+++ C +C
Sbjct: 156 GTPPRRFRMIMDTGSDLNWLQCAPC--LDCFEQSGPIFDPAASISYRNVTCGDDRCRLVS 213
Query: 154 --ASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
A + C C Y YGD S + G+LA E T+ T + G+ FGCG
Sbjct: 214 PPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGVAFGCGH 273
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVP---VSSTKINFG-TNGIV 263
N GLF+ ++GLG G +S SQ+R G FSYCLV + +KI FG + ++
Sbjct: 274 RNRGLFHGAAG-LLGLGRGPLSFASQLRGVYGGHAFSYCLVEHGSAAGSKIIFGHDDALL 332
Query: 264 SGPGVVST---PLTKAKTFYVLTIDAISVGNQRLGVSTPDI-----VIDSGTTLTFLPQ- 314
+ P + T P T A TFY L + +I VG + + +S+ + +IDSGTTL++ P+
Sbjct: 333 AHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLSAGGTIIDSGTTLSYFPEP 392
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNF 371
Y + + + M + P+ L CY+ + +VPE+++ F GA + N+
Sbjct: 393 AYQAIRQAFIDRMSPSYPLILGFPVLSPCYNVSGAEKVEVPELSLVFADGAAWEFPAENY 452
Query: 372 FVKVS-EDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
F+++ E I+C G S + I GN Q NF V YD+E + F P C
Sbjct: 453 FIRLEPEGIMCLAVLGTPRSGMSIIGNYQQQNFHVLYDLEHNRLGFAPRRCA 504
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 147/448 (32%), Positives = 215/448 (47%), Gaps = 80/448 (17%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA- 89
++++HRDS S +++ + L++ L R R++ N +++ S+A++ P N
Sbjct: 70 LQVVHRDSLSSS--SNTSLVKEILQERLKRDAARVDSINARVQLAAMGVSKAEMKPLNGS 127
Query: 90 ------------------------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
Y R+ +GTPP V DTGSD++W QC PC +
Sbjct: 128 SIDARFDAKDFSSSIISGLAQGSGEYFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPC--A 185
Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLAT 184
+CY Q PLF+P SSTY+ +PC++ C L+ C C+Y VSYGDGSF+ G+ +T
Sbjct: 186 KCYGQTDPLFNPAASSTYRKVPCATPLCKKLDISGCRNKRYCEYQVSYGDGSFTVGDFST 245
Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFS 244
ET+T + VAL GCG +N GLF ++GLG G +S SQ + +FS
Sbjct: 246 ETLTFRGQVIRRVAL-----GCGHDNEGLFIGAAG-LLGLGRGSLSFPSQTGAQFSKRFS 299
Query: 245 YCLVPVS----STKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS 297
YCLV S ++ + FG I + TPL K TFY + + ISVG +RL S
Sbjct: 300 YCLVDRSASGTASSLIFGKAAIPK--SAIFTPLLSNPKLDTFYYVELVGISVGGRRL-TS 356
Query: 298 TP------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL----- 340
P ++IDSGT++T L S S+M +A V TG+L
Sbjct: 357 IPASVFRMDATGNGGVIIDSGTSVTRLVD-------SAYSTMRDAFRVG--TGNLKSAGG 407
Query: 341 ----ELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVP 392
+ CY + L +VP + HF+ GA + L +N+ + V S C F G T +
Sbjct: 408 FSLFDTCYDLSGLKTVKVPTLVFHFQGGAHISLPATNYLIPVDSSATFCFAFAGNTGGLS 467
Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
I GNI Q + V +D V FK C
Sbjct: 468 IIGNIQQQGYRVVFDSLANRVGFKAGSC 495
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 140/445 (31%), Positives = 213/445 (47%), Gaps = 60/445 (13%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS-------ISSSKASQ 81
V L+HRDS + +E +RL+ R+ ++ N + +S+ +
Sbjct: 70 MHVRLLHRDS-FAVNATGAELLARRLQRDELRAAWIISTAAANGTPPPDVVGLSTGRGLV 128
Query: 82 ADII---PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
A ++ P + +Y+ +I++GTP E L DT SDL W QC+PC +CY Q P+FDP+
Sbjct: 129 APVVSRAPTSGDYIAKIAVGTPAVEALLALDTASDLTWLQCQPC--RRCYPQSGPVFDPR 186
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDG------SFSNGNLATETVTL 189
S++Y + + C +L + C Y+V YGDG S S G+L ET+T
Sbjct: 187 HSTSYGEMNYDAPDCQALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTF 246
Query: 190 GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR-TTIAGKFSYCLV 248
QA ++ GCG +N GLF + GI+GL G IS+ Q+ FSYCLV
Sbjct: 247 AGGVRQAY----LSIGCGHDNKGLFGAPAAGILGLSRGQISIPHQIAFLGYNASFSYCLV 302
Query: 249 -----PVS-STKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRL-GVST 298
P S S+ + FG + + P TP TFY + + +SVG R+ GV+
Sbjct: 303 DFISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTE 362
Query: 299 PD-----------IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVA--DPTGSLELCY 344
D +++DSGTT+T L + Y + + ++ V+ P+G + CY
Sbjct: 363 RDLQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTCY 422
Query: 345 SFNSLS------QVPEVTIHFRGA-DVKLSRSNFFVKV-SEDIVCSVFKGITN-SVPIYG 395
+ + +VP V++HF G ++ L N+ + V S VC F G + SV + G
Sbjct: 423 TVGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAFAGTGDRSVSVIG 482
Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDC 420
NI+Q F V YDI Q V F P C
Sbjct: 483 NILQQGFRVVYDIGGQRVGFAPNSC 507
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 137/437 (31%), Positives = 211/437 (48%), Gaps = 46/437 (10%)
Query: 28 GFSVELIHR------DSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ 81
G +++++HR D P ++ +R R + RS+ R + ++ +++ ++
Sbjct: 54 GSTLQIVHRACLQTGDDIAVPDHHHYTGILRRDRHRV-RSIYRRLTAAETTTTTTTIPAR 112
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ + Y++ I IGTPP + DTGSDL W QC PCP S CY Q PLFDP SS
Sbjct: 113 LGLAFQSLEYVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSS 172
Query: 142 TYKSLPCSSSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
TY +PCS+ +C + Q C +C+YSV YGD S ++G+LA ET TL + A A
Sbjct: 173 TYVDVPCSAPECHIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAPAA 232
Query: 200 PGITFGCGTNNGGLFNSK---TTGIVGLGGGDISLISQMRTTI---AGKFSYCLVPVSST 253
G+ FGC +FN G++GLG GD S++SQ R +I G FSYCL P S+
Sbjct: 233 TGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPRGSS 292
Query: 254 KINFGTNGIVSGP-----GVVSTPL----TKAKTFYVLTIDAISVGNQRLGVSTPDI--- 301
G + P + TPL ++ ++ YV+ + +SV + +
Sbjct: 293 TGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSLG 352
Query: 302 -VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL---CYSFNSLSQV--PEV 355
VIDSGT +T +P L + + + P GS++L CY V P V
Sbjct: 353 AVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKML-PEGSMKLLDTCYDVTGQDVVTAPRV 411
Query: 356 TIHF-RGADVKLSRSNFFVKV-SED-------IVCSVFKGITNS--VPIYGNIMQTNFLV 404
+ F GA + + S + + +ED + C F TNS + I GN+ Q + V
Sbjct: 412 ALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFL-PTNSAGLVIVGNMQQRAYNV 470
Query: 405 GYDIEQQTVSFKPTDCT 421
+D++ + F P C+
Sbjct: 471 VFDVDGGRIGFGPNGCS 487
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 140/465 (30%), Positives = 222/465 (47%), Gaps = 67/465 (14%)
Query: 8 VFILFFLCFYVVSPIEAQT----GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
V +L Y P+ + V L H D+ K + SE +R A+ RS
Sbjct: 7 VLVLAIASLYYACPVASAAFVGDDDVRVALKHVDAGKQ--LSRSEL----IRRAMQRSKA 60
Query: 64 RLNHFN--QNSSISSSKASQAD-----------IIPN-NANYLIRISIGTPPTERLAVAD 109
R + +N + S+ + + D + P+ + Y++ ++IGTPP A+ D
Sbjct: 61 RAAALSAVRNRAASARFSGKNDDQRTTPPTGVSVRPSGDLEYVVDLAIGTPPQPVSALLD 120
Query: 110 TGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQY 168
TGSDLIWTQC PC + C Q PLF P S++Y+ + C+ C+ + C + C Y
Sbjct: 121 TGSDLIWTQCAPC--ASCLAQPDPLFAPGESASYEPMRCAGQLCSDILHHGCEMPDTCTY 178
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
+YGDG+ + G ATE T S+ G + + FGCG+ N G N+ +GIVG G
Sbjct: 179 RYNYGDGTMTMGVYATERFTFTSSGGDRLMTVPLGFGCGSMNVGSLNNG-SGIVGFGRNP 237
Query: 229 ISLISQMRTTIAGKFSYCLVPVSSTK--------INFGTNGIVSGPGVVSTPLTKA---K 277
+SL+SQ+ +FSYCL S + ++ G G +GP V +TPL ++
Sbjct: 238 LSLVSQLSIR---RFSYCLTSYGSGRKSTLLFGSLSGGVYGDATGP-VQTTPLLQSLQNP 293
Query: 278 TFYVLTIDAISVGNQRLGVST------PD----IVIDSGTTLTFLPQGYNSNLLSVMSSM 327
TFY + + ++VG +RL + PD +++DSGT LT LP + ++
Sbjct: 294 TFYYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQ 353
Query: 328 IEAQPVADPTGSLE--LCY---------SFNSLSQVPEVTIHFRGADVKLSRSNFFV-KV 375
+ P A+ G+ E +C+ S S VP + HF+ AD+ L R N+ +
Sbjct: 354 LRL-PFAN-GGNPEDGVCFLVPAAWRRSSSTSQVPVPRMVFHFQDADLDLPRRNYVLDDH 411
Query: 376 SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ +C + + GN++Q + V YD+E +T+SF P C
Sbjct: 412 RKGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSFAPAQC 456
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 146/443 (32%), Positives = 204/443 (46%), Gaps = 57/443 (12%)
Query: 22 IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK--- 78
+ T SV L H D+ S F ++S +LR L R R+ +++S+ +
Sbjct: 57 VSESTTSLSVHLSHVDALSS-FSDASPVDLFKLR--LQRDSLRVKSITSLAAVSTGRNAT 113
Query: 79 ------------ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
A + + + Y +R+ +GTP T V DTGSD++W QC PC
Sbjct: 114 KRTPRSAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPC--KA 171
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS-C---SGVNCQYSVSYGDGSFSNGNL 182
CY Q +FDPK S T+ ++PC S C L+ S C C Y VSYGDGSF+ G+
Sbjct: 172 CYNQSDVIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDF 231
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
+TET+T + + GCG +N GLF ++GLG G +S SQ ++ GK
Sbjct: 232 STETLTF-----HGARVDHVPLGCGHDNEGLFVGAAG-LLGLGRGGLSFPSQTKSRYNGK 285
Query: 243 FSYCLVPVSSTK--------INFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGN 291
FSYCLV +S+ I FG + + V TPL K TFY L + ISVG
Sbjct: 286 FSYCLVDRTSSGSSSKPPSTIVFGNDAVPKTS--VFTPLLTNPKLDTFYYLQLLGISVGG 343
Query: 292 QRL-GVSTPD----------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL 340
R+ GVS ++IDSGT++T L Q L A
Sbjct: 344 SRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATKLKRAPSYSLF 403
Query: 341 ELCYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNI 397
+ C+ + ++ +VP V HF G +V L SN+ + V +E C F G S+ I GNI
Sbjct: 404 DTCFDLSGMTTVKVPTVVFHFGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNI 463
Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
Q F V YD+ V F C
Sbjct: 464 QQQGFRVAYDLVGSRVGFLSRAC 486
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 129/395 (32%), Positives = 203/395 (51%), Gaps = 61/395 (15%)
Query: 65 LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPP 124
L H++ S+ SS A + A YL+ ++IGTPP +A+ADTGSDL WTQC+PC
Sbjct: 59 LLHYSTLST--SSDPGPARLRSGQAEYLMELAIGTPPVPFIALADTGSDLTWTQCKPC-- 114
Query: 125 SQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNL 182
C+ QD+P++D SS++ LPCSS+ C + CS C+Y +Y DG++
Sbjct: 115 KLCFGQDTPIYDTTTSSSFSPLPCSSATCLPIWSSRCSTPSATCRYRYAYDDGAY----- 169
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGL-FNSKTTGIVGLGGGDISLISQMRTTIAG 241
S +++ GI FGCG +NGGL +NS TG VGLG G +SL++Q+ G
Sbjct: 170 --------SPECAGISVGGIAFGCGVDNGGLSYNS--TGTVGLGRGSLSLVAQLG---VG 216
Query: 242 KFSYCLVPVSSTKIN----FGTNGIVSGPG-------VVSTPLTKA---KTFYVLTIDAI 287
KFSYCL +T ++ FG+ ++ V STPL ++ + Y ++++ I
Sbjct: 217 KFSYCLTDFFNTSLSSPVFFGSLAELAASSASADAAVVQSTPLVQSPYNPSRYYVSLEGI 276
Query: 288 SVGNQRLGV-----------STPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVAD 335
S+G+ RL + + +++DSGT T L + G+ + V + QPV +
Sbjct: 277 SLGDARLPIPNGTFDLNDDDGSGGMIVDSGTIFTILVETGFRVVVDHVAG--VLGQPVVN 334
Query: 336 PTGSLELCY-----SFNSLSQVPEVTIHFR-GADVKLSRSNF--FVKVSEDIVCSVFKGI 387
+ C+ L +P++ +HF GAD++L R N+ F + ++
Sbjct: 335 ASSLDRPCFPAPAAGVQELPDMPDMVLHFAGGADMRLHRDNYMSFNEEESSFCLNIVGTE 394
Query: 388 TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ S + GN Q N + +DI +SF PTDC+K
Sbjct: 395 SASGSVLGNFQQQNIQMLFDITVGQLSFMPTDCSK 429
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 131/371 (35%), Positives = 188/371 (50%), Gaps = 38/371 (10%)
Query: 79 ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
A++ ++ ++ YL+ + IGTP A+ DTGSDLIWTQC PC C Q +P FDP
Sbjct: 80 AARILVLASDGEYLMEMGIGTPARFYSAILDTGSDLIWTQCAPC--LLCVDQPTPYFDPA 137
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
SSTY+SL CS+ C +L C C Y YGD + + G LA ET T G T V
Sbjct: 138 NSSTYRSLGCSAPACNALYYPLCYQKTCVYQYFYGDSASTAGVLANETFTFG-TNDTRVT 196
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVSSTK 254
LP I+FGCG N G + +G+VG G G +SL+SQ+ + +FSYCL PV S +
Sbjct: 197 LPRISFGCGNLNAGSL-ANGSGMVGFGRGSLSLVSQLGSP---RFSYCLTSFLSPVRS-R 251
Query: 255 INFGTNGIV---SGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS----------- 297
+ FG + + V STP T Y L + ISVG RL +
Sbjct: 252 LYFGAYATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDG 311
Query: 298 TPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ--- 351
T +IDSGTT+T+L + Y + + + + P+ D T + L+ C+ + +
Sbjct: 312 TGGTIIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSV 371
Query: 352 -VPEVTIHFRGADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
+P++ +HF GAD +L N+ V S +C + ++ I G+ NF V YD+E
Sbjct: 372 TLPQLVLHFDGADWELPLQNYMLVDPSTGGLC-LAMATSSDGSIIGSYQHQNFNVLYDLE 430
Query: 410 QQTVSFKPTDC 420
+SF P C
Sbjct: 431 NSLLSFVPAPC 441
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 139/435 (31%), Positives = 199/435 (45%), Gaps = 53/435 (12%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRL------RDA-----LTRSLNRLNHFNQNSSISSS 77
+SVE++HRD+ ++ Y+R R+A L R + R N++
Sbjct: 74 WSVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPVNRYE 133
Query: 78 KASQAD----------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
++ D + + Y RI +GTP E+ V DTGSD+ W QCEPC +C
Sbjct: 134 NVAEVDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPC--REC 191
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
Y Q P+F+P S+++ ++ C S+ C+ L+ C C Y SYGDGS+S G+ ATET+
Sbjct: 192 YSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSGGCLYEASYGDGSYSTGSFATETL 251
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
T G+T+ VA+ GCG N GLF ++GLG G +S +Q+ T FSYCL
Sbjct: 252 TFGTTSVANVAI-----GCGHKNVGLFIGAAG-LLGLGAGALSFPNQIGTQTGHTFSYCL 305
Query: 248 VPV---SSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI 301
V SS + FG + G + TPL K TFY L++ AISVG L P++
Sbjct: 306 VDRESDSSGPLQFGPKSVPVGS--IFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEV 363
Query: 302 ------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 349
+IDSGT +T L + + P D + CY + L
Sbjct: 364 FRIDETSGHGGFIIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRTDAVSIFDTCYDLSGL 423
Query: 350 S--QVPEVTIHF-RGADVKLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVG 405
VP V HF GA + L N+ + + C F +SV I GN Q + V
Sbjct: 424 QFVSVPTVGFHFSNGASLILPAKNYLIPMDTVGTFCFAFAPAASSVSIMGNTQQQHIRVS 483
Query: 406 YDIEQQTVSFKPTDC 420
+D V F C
Sbjct: 484 FDSANSLVGFAFDQC 498
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 123/367 (33%), Positives = 183/367 (49%), Gaps = 41/367 (11%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y + +GTPP + + D+GSDL+W QC PC QCY QDSPL+ P SST+ +P
Sbjct: 61 SGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPC--RQCYAQDSPLYVPSNSSTFSPVP 118
Query: 148 CSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
C SS C + + C Y Y D S S G A E+ T+ V +
Sbjct: 119 CLSSDCLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATV-----DGVRIDK 173
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVS-STKIN 256
+ FGCG++N G F + G++GLG G +S SQ+ KF+YCLV P S S+ +
Sbjct: 174 VAFGCGSDNQGSF-AAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSSLI 232
Query: 257 FGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP----DI------VI 303
FG I + + TP+ K+ T Y + I+ ++VG + L +S D+ +
Sbjct: 233 FGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGNGGSIF 292
Query: 304 DSGTTLTF-LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHF- 359
DSGTTLT+ P Y S++L+ S + P A+ L+LC + Q P TI F
Sbjct: 293 DSGTTLTYWFPSAY-SHILAAFDSGVH-YPRAESVQGLDLCVELTGVDQPSFPSFTIEFD 350
Query: 360 RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIY---GNIMQTNFLVGYDIEQQTVSFK 416
GA + N+FV V+ ++ C G+ + + + GN++Q NF V YD E+ + F
Sbjct: 351 DGAVFQPEAENYFVDVAPNVRCLAMAGLASPLGGFNTIGNLLQQNFFVQYDREENLIGFA 410
Query: 417 PTDCTKQ 423
P C+
Sbjct: 411 PAKCSSH 417
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 136/417 (32%), Positives = 204/417 (48%), Gaps = 36/417 (8%)
Query: 27 GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ---NSSISSSKASQAD 83
G S++L+HR P +P + +S P + L R R++ Q + +++SS
Sbjct: 59 GSSSLKLVHRFGPCNP-HRTSTAPASSFNEILRRDKLRVDSIIQARRSMNLTSSVEHMKS 117
Query: 84 IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
+P ++Y++ + IGTP E + DTGS LIWTQC+PC CY + P+FD
Sbjct: 118 SVPFYGLSKITASDYIVNVGIGTPKKEMPLIFDTGSGLIWTQCKPC--KACYPK-VPVFD 174
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
P S+++K LPCSS C S+ Q CS C Y +Y D S S G LATET++ +
Sbjct: 175 PTKSASFKGLPCSSKLCQSIRQ-GCSSPKCTYLTAYVDNSSSTGTLATETISF---SHLK 230
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTK 254
I GC G + +GI+GL ISL SQ FSYC+ P S+
Sbjct: 231 YDFKNILIGCSDQVSGE-SLGESGIMGLNRSPISLASQTANIYDKLFSYCIPSTPGSTGH 289
Query: 255 INFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGVSTPDI----VIDSGTT 308
+ FG G V V +P++K + Y + + ISVG ++L + IDSG
Sbjct: 290 LTFG--GKVPN-DVRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASAFKIASTIDSGAV 346
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGA---D 363
LT LP S L SV M++ P+ D L+ CY F++ S V P +++ F G D
Sbjct: 347 LTRLPPKAYSALRSVFREMMKGYPLLDQDDFLDTCYDFSNYSTVAIPSISVFFEGGVEMD 406
Query: 364 VKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ +S + V S+ + C F + + V I+GN Q + V +D ++ + F P C
Sbjct: 407 IDVSGIMWQVPGSK-VYCLAFAELDDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGC 462
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 129/364 (35%), Positives = 183/364 (50%), Gaps = 37/364 (10%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y +I +GTP T L V DTGSD++W QC PC +CY Q +FDP+ S +Y ++
Sbjct: 138 GSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPC--RRCYDQSGQVFDPRRSRSYGAV 195
Query: 147 PCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
CS+ C L+ C C Y V+YGDGS + G+ ATET+T G VA I
Sbjct: 196 GCSAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAG--GARVAR--IAL 251
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVS-STKIN 256
GCG +N GLF + ++GLG G +S +Q+ FSYCLV P S S+ +
Sbjct: 252 GCGHDNEGLFVAAAG-LLGLGRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASHSSTVT 310
Query: 257 FGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRL-GVSTPD-----------I 301
FG+ + S TP+ K +TFY + + ISVG R+ GV+ D +
Sbjct: 311 FGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRGGV 370
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCY--SFNSLSQVPEVTIH 358
++DSGT++T L + S L + ++ SL + CY S + +VP V++H
Sbjct: 371 IVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMH 430
Query: 359 FR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
F GA+ L N+ + V S+ C F G V I GNI Q F V +D + Q V F
Sbjct: 431 FAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFV 490
Query: 417 PTDC 420
P C
Sbjct: 491 PKGC 494
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 179/365 (49%), Gaps = 38/365 (10%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y +I +GTP T L V DTGSD++W QC PC +CY Q +FDP+ S +Y ++
Sbjct: 143 GSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPC--RRCYDQSGQMFDPRASHSYGAV 200
Query: 147 PCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C++ C L+ C C Y V+YGDGS + G+ ATET+T S +P +
Sbjct: 201 DCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAS----GARVPRVAL 256
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP---------VSSTKI 255
GCG +N GLF + ++GLG G +S SQ+ FSYCLV S+ +
Sbjct: 257 GCGHDNEGLFVAAAG-LLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTV 315
Query: 256 NFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRL-GVSTPD----------- 300
FG+ + TP+ K +TFY + + ISVG R+ GV+ D
Sbjct: 316 TFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPSTGRGG 375
Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSFNSLS--QVPEVTI 357
+++DSGT++T L + + L + ++ SL + CY + L +VP V++
Sbjct: 376 VIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGLKVVKVPTVSM 435
Query: 358 HFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
HF GA+ L N+ + V S C F G V I GNI Q F V +D + Q + F
Sbjct: 436 HFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRLGF 495
Query: 416 KPTDC 420
P C
Sbjct: 496 VPKGC 500
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 133/416 (31%), Positives = 209/416 (50%), Gaps = 53/416 (12%)
Query: 49 TPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD-----------IIPNNANYLIRISI 97
T Q L + L R R+ + ++ K +A ++ + Y +R+ +
Sbjct: 1 THEQLLLETLQRDERRVRWIESKAKLAGKKKDEASSTDLNGPVTSGLLYGSGEYFVRLGL 60
Query: 98 GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
GTP V DTGSDL W QC+PC CY Q P+FDP+ SS+++ +PC S C +L
Sbjct: 61 GTPARSLFMVVDTGSDLPWLQCQPC--KSCYKQADPIFDPRNSSSFQRIPCLSPLCKALE 118
Query: 158 QKSCSG-----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG 212
SCSG C Y V+YGDGSFS G+ +++ TLG T +A++ + FGCG +N G
Sbjct: 119 VHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLG-TGSKAMS---VAFGCGFDNEG 174
Query: 213 LFNSKTTGIVGLGGGDISLISQM-----RTTIAGKFSYCLV----PV--SSTKINFGTNG 261
L + G++GLG G +S SQ+ ++ A FSYCLV P+ SS+ + FG
Sbjct: 175 L-FAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIFGVAA 233
Query: 262 IVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPD----------IVIDSGTT 308
I S + +PL K TFY + +SVG +L +S ++IDSGT+
Sbjct: 234 IPSTAAL--SPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTS 291
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVK 365
+T P + + + P A + CY+F+ + VP + +HF GAD++
Sbjct: 292 VTRFPTSVYATIRDAFRNATINLPSAPRYSLFDTCYNFSGKASVDVPALVLHFENGADLQ 351
Query: 366 LSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
L +N+ + + + C F + + I GNI Q +F +G+D+++ ++F P C
Sbjct: 352 LPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 407
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 123/355 (34%), Positives = 178/355 (50%), Gaps = 38/355 (10%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ IG P + V DTGSD+ W QC PC + CY Q P+F+P S++Y L
Sbjct: 141 SGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPC--ADCYHQADPIFEPASSTSYSPLS 198
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C + QC SL+ C C Y VSYGDGS++ G+ TET+TLGS + VA+ GCG
Sbjct: 199 CDTKQCQSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITLGSASVDNVAI-----GCG 253
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFGTNGIVS 264
NN GLF ++GLGGG +S SQ+ A FSYCLV S++ + F + +
Sbjct: 254 HNNEGLFIGAAG-LLGLGGGKLSFPSQIN---ASSFSYCLVDRDSDSASTLEFNSALL-- 307
Query: 265 GPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPD------------IVIDSGTTL 309
P ++ PL + + TFY + + +SVG + L + P+ I+IDSGT +
Sbjct: 308 -PHAITAPLLRNRELDTFYYVGMTGLSVGGELLSI--PESMFEMDESGNGGIIIDSGTAV 364
Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADV-KL 366
T L + L + PV + CY + + +VP VT H G V L
Sbjct: 365 TRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHLAGGKVLPL 424
Query: 367 SRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+N+ + V D C F ++++ I GN+ Q VG+D+ V F+P C
Sbjct: 425 PATNYLIPVDSDGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 129/426 (30%), Positives = 201/426 (47%), Gaps = 40/426 (9%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL--NHFNQNSSISSSKASQADII 85
G +L H DS + + +E + + + R+ +L + +++ AS + ++
Sbjct: 30 GLRADLTHIDSGRG--FTRNELLRRMVLRSRARAAKQLCPSRSGTPVRVTAPVASGSHVV 87
Query: 86 PNNANYLIRISIGTPPTERLAV-ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
YLI IGTP +++A+ DTGSD++WTQC PC C+ Q P FD S T
Sbjct: 88 -GYTEYLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPC--FDCFTQPLPRFDTSASDTVH 144
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
+ C+ C +L +C C Y V+YGD S + G LA ++ T G V +P + F
Sbjct: 145 GVLCTDPICRALRPHACFLGGCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVF 204
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFG--- 258
GCG N G F+S TGI G G G +SL Q+ + FSYC + ST + G
Sbjct: 205 GCGQYNTGNFHSNETGIAGFGRGPLSLPRQLGVS---SFSYCFTTIFESKSTPVFLGGAP 261
Query: 259 TNGI---VSGPGVVSTP-LTKAKTFYVLTIDAISVGNQRLGVSTPDIV----------ID 304
+G+ +GP ++STP L +Y L++ I+VG RL V V ID
Sbjct: 262 ADGLRAHATGP-ILSTPFLPNHPEYYYLSLKGITVGKTRLAVPESAFVVKADGSGGTIID 320
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLEL-CYSFNSLSQ-----VPEVTI 357
SGT +T P+ +L + + + + TG L C+S S+ VP++T+
Sbjct: 321 SGTAITAFPRAVFRSLWEAFVAQVPLPHTSYNDTGEPTLQCFSTESVPDASKVPVPKMTL 380
Query: 358 HFRGADVKLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
H GAD +L R N+ + + D +C V + + GN Q N + +D+ + +
Sbjct: 381 HLEGADWELPRENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNMHIVHDLAGNKLVIE 440
Query: 417 PTDCTK 422
P C K
Sbjct: 441 PAQCDK 446
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 129/371 (34%), Positives = 177/371 (47%), Gaps = 49/371 (13%)
Query: 90 NYLIRISIG----TPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
NY+ IS+G +P + DTGSDL W QC+PC S CY Q PLFDP S+TY +
Sbjct: 143 NYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPC--SACYAQRDPLFDPAGSATYAA 200
Query: 146 LPCSSSQCA-----------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
+ C++S CA S C Y+++YGDGSFS G LAT+TV LG +
Sbjct: 201 VRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGAS- 259
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK 254
L G FGCG +N GLF T G++GLG ++SL+SQ + G FSYCL P +++
Sbjct: 260 ----LGGFVFGCGLSNRGLFGG-TAGLMGLGRTELSLVSQTASRYGGVFSYCL-PAATSG 313
Query: 255 INFGTNGIVSGPGVVS-----TPLTKAKT--------FYVLTIDAISVGNQRL---GVST 298
G+ + G S TP+ + FY L + +VG L G+
Sbjct: 314 DASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGA 373
Query: 299 PDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSFNSLSQ--VPE 354
+++IDSGT +T L P Y + M A A P S L+ CY + VP
Sbjct: 374 SNVLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILDTCYDLTGHDEVKVPL 433
Query: 355 VTIHFR-GADVKLSRSNFFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIE 409
+T+ GADV + + V +D VC ++ + PI GN Q N V YD
Sbjct: 434 LTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGNYQQKNKRVVYDTL 493
Query: 410 QQTVSFKPTDC 420
+ F DC
Sbjct: 494 GSRLGFADEDC 504
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 128/434 (29%), Positives = 201/434 (46%), Gaps = 56/434 (12%)
Query: 29 FSVELIHRDS--PKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
V + HRD+ P P QRL R + ++ + S S IP
Sbjct: 27 LHVPVFHRDALFPPPPGAKRGSLLRQRLAADAARYASLVDATGRLHSPVFSG------IP 80
Query: 87 -NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
+ Y + +GTP T+ + V DTGSDL+W QC PC +CY Q +FDP+ SSTY+
Sbjct: 81 FESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPC--RRCYAQRGQVFDPRRSSTYRR 138
Query: 146 LPCSSSQCASLNQKSC-----SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
+PCSS QC +L C +G C+Y V+YGDGS S G LAT+ + + T +
Sbjct: 139 VPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLAFANDT----YVN 194
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS--STKINFG 258
+T GCG +N GLF+S G++G+ G IS+ +Q+ F YCL + ST+ ++
Sbjct: 195 NVTLGCGRDNEGLFDS-AAGLLGVARGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYL 253
Query: 259 TNGIVSGP------GVVSTPLTKAKTFYVLTIDAISVGNQR--------LGVSTP----D 300
G P ++S P + + Y + + SVG +R L + T
Sbjct: 254 VFGRTPEPPSTAFTALLSNP--RRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGG 311
Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNS--LSQVPEV 355
+V+DSGT ++ + + L + A + G + CY + P +
Sbjct: 312 VVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLI 371
Query: 356 TIHFR-GADVKLSRSNFFV-------KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
+HF GAD+ L N+F+ + + C F+ + + + GN+ Q F V +D
Sbjct: 372 VLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGFRVVFD 431
Query: 408 IEQQTVSFKPTDCT 421
+E++ + F P CT
Sbjct: 432 VEKERIGFAPKGCT 445
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 138/416 (33%), Positives = 197/416 (47%), Gaps = 61/416 (14%)
Query: 54 LRDALTRSLNRLNHF-----NQNSSISSSKASQADIIPNNA------NYLIRISIG---- 98
LR L +R N F N ++ +S+++ A++ + NY+ I++G
Sbjct: 137 LRRLLAADESRANSFQLRIRNDRAAAASTQSGSAEVPLTSGIRFQTLNYVTTIALGGGSS 196
Query: 99 -TPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASL 156
+P + DTGSDL W QC+PC S CY Q PLFDP S+TY ++ C++S C ASL
Sbjct: 197 GSPAANLTVIVDTGSDLTWVQCKPC--SACYAQRDPLFDPAGSATYAAVRCNASACAASL 254
Query: 157 NQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
SC G N C Y+++YGDGSFS G LAT+TV LG + L G FGCG +
Sbjct: 255 KAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGAS-----LDGFVFGCGLS 309
Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVV 269
N GLF T G++GLG ++SL+SQ G FSYCL +S +G +S G
Sbjct: 310 NRGLFGG-TAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGD----ASGSLSLGGDA 364
Query: 270 S-----TPLTKAKT--------FYVLTIDAISVGNQRL---GVSTPDIVIDSGTTLTFLP 313
S TP+ + FY L + +VG L G+ +++IDSGT +T L
Sbjct: 365 SSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVITRLA 424
Query: 314 QGYNSNLLSVMSSMIEAQ--PVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSR 368
+ + + A P A L+ CY + VP +T+ GA+V +
Sbjct: 425 PSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGAEVTVDA 484
Query: 369 SNFFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ V +D VC ++ + PI GN Q N V YD + F DC
Sbjct: 485 AGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 540
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 136/426 (31%), Positives = 199/426 (46%), Gaps = 59/426 (13%)
Query: 39 PKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP----------- 86
P+ Y Y+ L L R R N ++ S++D+ P
Sbjct: 88 PRETIYKIHHKDYKSLVLSRLHRDTVRFNSLTARLQLALEDISKSDLKPLETEIKPEDLS 147
Query: 87 ---------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+ Y R+ +G P + V DTGSD+ W QC+PC + CY Q P+FDP
Sbjct: 148 TPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPC--TDCYQQTDPIFDP 205
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
SSTY + C S QC+SL SC C Y V+YGDGS++ G+ ATE+V+ G++
Sbjct: 206 TASSTYAPVTCQSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSG---- 261
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV---PVSSTK 254
++ + GCG +N GLF G++GLGGG +SL +Q++ T FSYCLV S+
Sbjct: 262 SVKNVALGCGHDNEGLF-VGAAGLLGLGGGPLSLTNQLKAT---SFSYCLVNRDSAGSST 317
Query: 255 INFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPD----------- 300
++F N G V+ PL K + TFY + + +SVG Q VS P+
Sbjct: 318 LDF--NSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQM--VSIPESTFRLDESGNG 373
Query: 301 -IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVT 356
I++D GT +T L Q YN L M + + + CY + + +VP V+
Sbjct: 374 GIIVDCGTAITRLQTQAYNP-LRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVS 432
Query: 357 IHF-RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
HF G L +N+ + V S C F T+S+ I GN+ Q V +D+ +
Sbjct: 433 FHFADGKSWNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMG 492
Query: 415 FKPTDC 420
F P C
Sbjct: 493 FSPNKC 498
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 174 bits (441), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 125/378 (33%), Positives = 187/378 (49%), Gaps = 44/378 (11%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQ---DSPLFDPKMSSTYK 144
YL+ ++ GTPP E L +ADTGSDLIW QC PP+ C + P F S+T
Sbjct: 53 QYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLS 112
Query: 145 SLPCSSSQCASL-----NQKSCSG---VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
+PCS++QC + + SCS V C Y+ Y DGS + G LA +T T+ + T
Sbjct: 113 VVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGG 172
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
A+ G+ FGCGT N G S T G++GLG G +S +Q + A FSYCL+ + +
Sbjct: 173 AAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGRRG 232
Query: 257 FGTNGIVSG-----PGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI------- 301
++ + G TPL A TFY + + AI VGN+ L V +
Sbjct: 233 RSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGN 292
Query: 302 ---VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT---GSLELCYSFNSLSQV--- 352
VIDSG+TLT+L G +L+S ++ + + LELCY+ +S S +
Sbjct: 293 GGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSSSLAPA 352
Query: 353 ----PEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVG 405
P +TI F +G ++L N+ V V++D+ C + + + + GN+MQ + V
Sbjct: 353 NGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPFAFNVLGNLMQQGYHVE 412
Query: 406 YDIEQQTVSFKPTDCTKQ 423
+D + F T+C
Sbjct: 413 FDRASARIGFARTECVAH 430
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 174 bits (441), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 133/361 (36%), Positives = 176/361 (48%), Gaps = 37/361 (10%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y +R+ +GTP T V DTGSD++W QC PC CY Q P+F+P S T+ ++P
Sbjct: 133 SGEYFMRLGVGTPATNMYMVLDTGSDVVWLQCSPC--KVCYNQSDPVFNPAKSKTFATVP 190
Query: 148 CSSSQCASLNQKS-CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
C S C L+ S C C Y VSYGDGSF+ G+ +TET+T VAL
Sbjct: 191 CGSRLCRRLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFHGARVDHVAL---- 246
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--------I 255
GCG +N GLF ++GLG G +S SQ + GKFSYCLV +S+ I
Sbjct: 247 -GCGHDNEGLFVGAAG-LLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTI 304
Query: 256 NFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRL-GVSTPD----------IV 302
FG NG V V + LT K TFY L + ISVG R+ GVS ++
Sbjct: 305 VFG-NGAVPKTAVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVI 363
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR 360
IDSGT++T L Q L A + C+ + ++ +VP V HF
Sbjct: 364 IDSGTSVTRLTQSAYVALRDAFRLGATRLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFT 423
Query: 361 GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
G +V L SN+ + V ++ C F G S+ I GNI Q F V YD+ V F
Sbjct: 424 GGEVSLPASNYLIPVNNQGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRA 483
Query: 420 C 420
C
Sbjct: 484 C 484
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 174 bits (441), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 131/361 (36%), Positives = 174/361 (48%), Gaps = 37/361 (10%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y +R+ +GTP T V DTGSD++W QC PC CY Q +FDPK S T+ ++P
Sbjct: 132 SGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPC--KACYNQTDAIFDPKKSKTFATVP 189
Query: 148 CSSSQCASLNQKS-C---SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
C S C L+ S C C Y VSYGDGSF+ G+ +TET+T + +
Sbjct: 190 CGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF-----HGARVDHVP 244
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--------I 255
GCG +N GLF ++GLG G +S SQ + GKFSYCLV +S+ I
Sbjct: 245 LGCGHDNEGLFVGAAG-LLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTI 303
Query: 256 NFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRL-GVSTPD----------IV 302
FG N V V + LT K TFY L + ISVG R+ GVS ++
Sbjct: 304 VFG-NAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVI 362
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR 360
IDSGT++T L Q L A + C+ + ++ +VP V HF
Sbjct: 363 IDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFG 422
Query: 361 GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
G +V L SN+ + V +E C F G S+ I GNI Q F V YD+ V F
Sbjct: 423 GGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRA 482
Query: 420 C 420
C
Sbjct: 483 C 483
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 140/435 (32%), Positives = 202/435 (46%), Gaps = 62/435 (14%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
FS++L RDS +N+ Y+ L L+R +R+ + S+ ++D+ P
Sbjct: 76 FSLQLHPRDS----LHNAGHKDYKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSDLEPL 131
Query: 87 -------------------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
+ Y R+ +G P V DTGSD+ W QC+PC + C
Sbjct: 132 KTEILPEDLSTPIISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPC--TDC 189
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
Y Q P+FDP+ SS++ SLPC S QC +L C C Y VSYGDGSF+ G TET+
Sbjct: 190 YQQTDPIFDPRSSSSFASLPCESQQCQALETSGCRASKCLYQVSYGDGSFTVGEFVTETL 249
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
T G++ + + GCG +N GLF + GG +SL SQM+ A FSYCL
Sbjct: 250 TFGNSG----MINDVAVGCGHDNEGLFVGSAGLLGLGGGP-LSLTSQMK---ASSFSYCL 301
Query: 248 VPVSSTKINFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVSTPD---- 300
V S+ + + V+ PL K+ TFY + + +SVG Q L + P+
Sbjct: 302 VDRDSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSI-PPNLFQM 360
Query: 301 -------IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLEL---CYSFNSL 349
I++DSGT +T L Q YN ++ + + P T L CY +S
Sbjct: 361 DDSGYGGIIVDSGTAITRLQTQAYN----TLRDAFVSRTPYLKKTNGFALFDTCYDLSSQ 416
Query: 350 SQV--PEVTIHFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVG 405
S+V P V+ F G ++L N+ + V S C F T+S+ I GN+ Q V
Sbjct: 417 SRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVH 476
Query: 406 YDIEQQTVSFKPTDC 420
YD+ V F P C
Sbjct: 477 YDLANSVVGFSPHKC 491
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 134/428 (31%), Positives = 201/428 (46%), Gaps = 51/428 (11%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD-------- 83
L+HRD ++ + T + L L R R + + ++
Sbjct: 77 RLVHRDD-----FSVNATAAELLAYRLERDAKRAARLSAAAGPANGTRRGGGGVVAPVVS 131
Query: 84 -IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
+ + Y +I +GTP T L V DTGSD++W QC PC +CY Q +FDP+ S +
Sbjct: 132 GLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPC--RRCYEQSGQVFDPRRSRS 189
Query: 143 YKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
Y ++ C++ C L+ C C Y V+YGDGS + G+ ATET+T G VA
Sbjct: 190 YNAVGCAAPLCRRLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFAG--GARVAR- 246
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK------ 254
+ GCG +N GLF + ++GLG G +S +Q+ FSYCLV +S+
Sbjct: 247 -VALGCGHDNEGLFVAAAG-LLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRS 304
Query: 255 --INFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRL-GVSTPD-------- 300
+ FG+ + S TP+ K +TFY + + ISVG R+ GV+ D
Sbjct: 305 STVTFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSSG 364
Query: 301 ---IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCY--SFNSLSQVPE 354
+++DSGT++T L + S L ++ SL + CY S + +VP
Sbjct: 365 RGGVIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPT 424
Query: 355 VTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
V++HF GA+ L N+ + V S+ C F G V I GNI Q F V +D + Q
Sbjct: 425 VSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQR 484
Query: 413 VSFKPTDC 420
V+F P C
Sbjct: 485 VAFTPKGC 492
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 134/389 (34%), Positives = 182/389 (46%), Gaps = 32/389 (8%)
Query: 53 RLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA----NYLIRISIGTPPTERLAVA 108
RL L R N H ++ + S A Q ++ + Y +R+ IG PP++ V
Sbjct: 107 RLDLFLKRVSNSDLHPAESKAEFESNALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVL 166
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
DTGSD+ W QC PC S+CY Q P+FDP S++Y + C QC SL+ C C Y
Sbjct: 167 DTGSDVSWIQCAPC--SECYQQSDPIFDPISSNSYSPIRCDEPQCKSLDLSECRNGTCLY 224
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
VSYGDGS++ G ATETVTLGS + VA+ GCG NN GLF G++GLGGG
Sbjct: 225 EVSYGDGSYTVGEFATETVTLGSAAVENVAI-----GCGHNNEGLF-VGAAGLLGLGGGK 278
Query: 229 ISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTID 285
+S +Q+ T FSYCLV S ++ + PL + TFY L +
Sbjct: 279 LSFPAQVNAT---SFSYCLVNRDSDAVSTLEFNSPLPRNAATAPLMRNPELDTFYYLGLK 335
Query: 286 AISVGNQRLGVSTPDIVI----------DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD 335
ISVG + L + + DSGT +T L L + P A+
Sbjct: 336 GISVGGEALPIPESSFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKAN 395
Query: 336 PTGSLELCYSFNSLSQVPEVTIHFR---GADVKLSRSNFFVKV-SEDIVCSVFKGITNSV 391
+ CY +S V T+ FR G ++ L N+ + V S C F T+S+
Sbjct: 396 GVSLFDTCYDLSSRESVEIPTVSFRFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSL 455
Query: 392 PIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
I GN+ Q VG+DI V F C
Sbjct: 456 SIIGNVQQQGTRVGFDIANSLVGFSVDSC 484
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 125/362 (34%), Positives = 185/362 (51%), Gaps = 40/362 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + +S+GTPP A+ DTGSDL WTQC PC + C+ Q +PL+DP SST+ LPC+S
Sbjct: 96 YHMILSVGTPPLAFPAIIDTGSDLTWTQCAPC-TTACFAQPTPLYDPARSSTFSKLPCAS 154
Query: 151 SQCASLNQ--KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA---LPGITFG 205
C +L ++C+ C Y Y G F+ G LA +T+ +G G A G+ FG
Sbjct: 155 PLCQALPSAFRACNATGCVYDYRYAVG-FTAGYLAADTLAIGDGDGDGDASSSFAGVAFG 213
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL---VPVSSTKINFGTNGI 262
C T NGG + +GIVGLG +SL+SQ+ G+FSYCL ++ I FG
Sbjct: 214 CSTANGGDMDGA-SGIVGLGRSALSLLSQIGV---GRFSYCLRSDADAGASPILFGALAN 269
Query: 263 VSGPGVVSTPL-------TKAKTFYVLTIDAISVGNQRLGVSTP----------DIVIDS 305
V+G V ST L + +Y + + I+VG+ L V++ +++DS
Sbjct: 270 VTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGVIVDS 329
Query: 306 GTTLTFLPQ-GY---NSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL-SQVPEVTIHFR 360
GTT T+L + GY LS + ++ V+ +LC+ + + VP + F
Sbjct: 330 GTTFTYLAEAGYTMLRQAFLSQTAGLLTR--VSGAQFDFDLCFEAGAADTPVPRLVFRFA 387
Query: 361 -GADVKLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
GA+ + R ++F V E V + T V + GN+MQ + V YD++ T SF P
Sbjct: 388 GGAEYAVPRQSYFDAVDEGGRVACLLVLPTRGVSVIGNVMQMDLHVLYDLDGATFSFAPA 447
Query: 419 DC 420
DC
Sbjct: 448 DC 449
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 131/412 (31%), Positives = 206/412 (50%), Gaps = 54/412 (13%)
Query: 54 LRDALTRSLNRLNHFN--QNSSISSSKASQ---ADIIP----NNANYLIRISIGTPPTER 104
+R A+ RS R + +N + S K Q A ++P + Y++ ++IGTPP
Sbjct: 50 IRRAMRRSKARAAALSAVRNRARFSGKNEQQTPAGVLPVRPSGDLEYVVDLAIGTPPQPV 109
Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
A+ DTGSDLIWTQC PC + C Q PLF P S++Y+ + C+ + C+ + SC
Sbjct: 110 SALLDTGSDLIWTQCAPC--ASCLSQPDPLFAPGQSASYEPMRCAGTLCSDILHHSCERP 167
Query: 165 N-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT--FGCGTNNGGLFNSKTTGI 221
+ C Y +YGDG+ + G ATE T S+ G + + FGCG+ N G N+ +GI
Sbjct: 168 DTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCGSVNVGSLNNG-SGI 226
Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--------INFGTNGIVSGPGVVSTPL 273
VG G +SL+SQ+ +FSYCL +S + ++ G G +G V +TPL
Sbjct: 227 VGFGRNPLSLVSQLSIR---RFSYCLTSYASRRQSTLLFGSLSDGVYGDATG-RVQTTPL 282
Query: 274 TKA---KTFYVLTIDAISVGNQRLGVS------TPD----IVIDSGTTLTFLPQGYNSNL 320
++ TFY + ++VG +RL + PD +++DSGT LT LP + +
Sbjct: 283 LQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEV 342
Query: 321 LSVMSSMIEAQPVADPTGSLE--LCY---------SFNSLSQVPEVTIHFRGADVKLSRS 369
+ + P A+ G+ E +C+ S S VP + +HF+GAD+ L R
Sbjct: 343 VRAFRQQLRL-PFAN-GGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHFQGADLDLPRR 400
Query: 370 NFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N+ + +C + + GN++Q + V YD+E +T+S P C
Sbjct: 401 NYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 452
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 132/419 (31%), Positives = 199/419 (47%), Gaps = 40/419 (9%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ---ADII 85
+ ++L HRD K P + P +R ++ ++R R++ + S S + +D++
Sbjct: 71 WKLKLFHRD--KLPLNFDPDHP-RRFKERISRDSKRVSSLLRLLSSGSDEQVTDFGSDVV 127
Query: 86 ----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ Y +RI +G+PP + V D+GSD++W QC+PC S+CY Q P+FDP S+
Sbjct: 128 SGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPC--SECYQQSDPVFDPAGSA 185
Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
TY + C SS C L+ C+ C+Y VSYGDGS++ G LA ET+T G V +
Sbjct: 186 TYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGR-----VLIRN 240
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFG 258
I GCG N G+F ++GLGGG +S + Q+ G FSYCLV S+ + FG
Sbjct: 241 IAIGCGHMNRGMFIGAAG-LLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG 299
Query: 259 TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP----------DIVIDS 305
+ G V PL +A +FY + + + VG R+ + +V+D+
Sbjct: 300 RGAMPVGAAWV--PLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDT 357
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGAD 363
GT +T LP P +D + CY+ N +VP V+ +F G
Sbjct: 358 GTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGP 417
Query: 364 V-KLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ L NF + V E C F + + I GNI Q + D V F PT C
Sbjct: 418 ILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 476
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 140/466 (30%), Positives = 210/466 (45%), Gaps = 76/466 (16%)
Query: 12 FFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN 71
F C +++ A++ +L H DS + T ++ LR + RS RL
Sbjct: 19 LFPCVLLLTFSLAESAALRADLTHVDSGRG------FTKHELLRRMVARSKARL------ 66
Query: 72 SSISSSKASQADIIP--------NNANYLIRISIGTPPTERLAVA-DTGSDLIWTQCEPC 122
+S+ SS A P ++ YLI + IGTP +R+ + DTGSDL+WTQC C
Sbjct: 67 ASLRSSACDTALTAPVDHGGSDVGSSEYLIHLGIGTPRPQRVVLHLDTGSDLVWTQCA-C 125
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS---LNQKSCSGVN--CQYSVSYGDGSF 177
+ C+ Q P+F +S T+ +PCS C L C+ + C Y+ Y D S
Sbjct: 126 --TVCFDQPVPVFRASVSHTFSRVPCSDPLCGHAVYLPLSGCAARDRSCFYAYGYMDHSI 183
Query: 178 SNGNLATETVTLGS--TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ G +A +T T + A A+P I FGCG N GLF +GI G G G +SL SQ+
Sbjct: 184 TTGKMAEDTFTFKAPDRADTAAAVPNIRFGCGMMNYGLFTPNQSGIAGFGTGPLSLPSQL 243
Query: 236 RTTIAGKFSYCLVPVSSTKIN-------------FGTNGIVS---GPGVVSTPLTKAKTF 279
+ +FSYC + ++++ T I S PG P+ ++ F
Sbjct: 244 KVR---RFSYCFTAMEESRVSPVILGGEPENIEAHATGPIQSTPFAPGPAGAPV-GSQPF 299
Query: 280 YVLTIDAISVGNQRL----------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE 329
Y L++ ++VG RL G + IDSGT +TF PQ +L + +
Sbjct: 300 YFLSLRGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQVP 359
Query: 330 ---AQPVADPTGSLELCYSFNSLSQ---VPEVTIHFRGADVKLSRSNFFVKVSED----- 378
A+ DP LC+S + + VP++ +H GAD +L R N+ + +D
Sbjct: 360 LPVAKGYTDPDN--LLCFSVPAKKKAPAVPKLILHLEGADWELPRENYVLDNDDDGSGAG 417
Query: 379 -IVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+C V NS I GN Q N + YD+E + F P C K
Sbjct: 418 RKLCVVILSAGNSNGTIIGNFQQQNMHIVYDLESNKMVFAPARCDK 463
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 124/360 (34%), Positives = 184/360 (51%), Gaps = 40/360 (11%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y +R+ IG+P + V DTGSD+ W QC PC CY Q+ +FDP+ SS+++ L
Sbjct: 11 SGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPC--KSCYKQNDAVFDPRASSSFRRLS 68
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATE--TVTLGSTTGQAVALPGIT 203
CS+ QC L+ K+C+ + C Y VSYGDGSF+ G+LA++ +V+ G T+ +
Sbjct: 69 CSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSRGRTS-------PVV 121
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP-----VSSTKINFG 258
FGCG +N GLF ++GLG G +S SQ+ + KFSYCLV +S+ + FG
Sbjct: 122 FGCGHDNEGLFVGAAG-LLGLGAGKLSFPSQLSSR---KFSYCLVSRDNGVRASSALLFG 177
Query: 259 TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP-----------DIVID 304
+ + + T L K TFY + IS+G L + + ++ID
Sbjct: 178 DSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIID 237
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-G 361
SGT++T LP + + S + P A + CY F++L+ V P V+ HF G
Sbjct: 238 SGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGG 297
Query: 362 ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
A V+L SN+ V V + C F + + I GNI Q V D++ V F P C
Sbjct: 298 ASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 124/360 (34%), Positives = 184/360 (51%), Gaps = 40/360 (11%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y +R+ IG+P + V DTGSD+ W QC PC CY Q+ +FDP+ SS+++ L
Sbjct: 11 SGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPC--KSCYKQNDAVFDPRASSSFRRLS 68
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATET--VTLGSTTGQAVALPGIT 203
CS+ QC L+ K+C+ + C Y VSYGDGSF+ G+LA+++ V+ G T+ +
Sbjct: 69 CSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFLVSRGRTS-------PVV 121
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP-----VSSTKINFG 258
FGCG +N GLF ++GLG G +S SQ+ + KFSYCLV +S+ + FG
Sbjct: 122 FGCGHDNEGLFVGAAG-LLGLGAGKLSFPSQLSSR---KFSYCLVSRDNGVRASSALLFG 177
Query: 259 TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP-----------DIVID 304
+ + + T L K TFY + IS+G L + + ++ID
Sbjct: 178 DSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIID 237
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-G 361
SGT++T LP + + S + P A + CY F++L+ V P V+ HF G
Sbjct: 238 SGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGG 297
Query: 362 ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
A V+L SN+ V V + C F + + I GNI Q V D++ V F P C
Sbjct: 298 ASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 141/462 (30%), Positives = 225/462 (48%), Gaps = 57/462 (12%)
Query: 5 LSCVFILFFLCFYVV------SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDAL 58
+S + LFF + S ++ + ++L H S KSP NS+ + +
Sbjct: 1 MSLFWFLFFSAHLAIASSLKDSGLKHKQPDMQLKLYHMTSLKSP-PNSTSLLFAYM---F 56
Query: 59 TRSLNRLNHFNQN-SSISSSKASQADIIPNNA-------------NYLIRISIGTPPTER 104
+ R+ +F+ + S + AS + P A NY +++ +G+P
Sbjct: 57 AKDEERIRYFHSRLAKNSDANASSKKVGPKLAGIPLKSGLSMGSGNYYVKMGLGSPTKYY 116
Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC-----SSSQCASLNQK 159
+ DTGS W QC+PC C++Q+ P+F+P S TYK++PC SS + A+LN+
Sbjct: 117 TMIVDTGSSFSWLQCQPCT-IYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKSATLNEP 175
Query: 160 SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSK 217
+CS + C Y SYGD SFS G L+ + +TL T Q L +GCG +N GLF +
Sbjct: 176 TCSKQSNACVYKASYGDSSFSLGYLSQDVLTL--TPSQ--TLSSFVYGCGQDNQGLFG-R 230
Query: 218 TTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-------INFGTNGIVSGPGVVS 270
T GI+GL ++S++SQ+ FSYCL ST ++ GT+ +
Sbjct: 231 TDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSSYKF 290
Query: 271 TPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI----VIDSGTTLTFLPQGYNSNLLSV 323
TPL K + Y + +++I+V + LGV+ +IDSGT +T LP + L +
Sbjct: 291 TPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTIIDSGTVITRLPTPVYTTLKNA 350
Query: 324 MSSMIEAQPVADPTGS-LELCY--SFNSLSQV-PEVTIHFR-GADVKLSRSNFFVKVSED 378
+++ + P S L+ C+ S +S+V P++ I F+ GAD++L N V++
Sbjct: 351 YVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDIRIIFKGGADLQLKGHNSLVELETG 410
Query: 379 IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
I C G ++S+ I GN Q V YD+ V F P C
Sbjct: 411 ITCLAMAG-SSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 129/363 (35%), Positives = 175/363 (48%), Gaps = 39/363 (10%)
Query: 90 NYLIRISIGTPPTERLAV-ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
NY+ I++G + L V DTGSDL W QCEPCP S CY Q PLFDP S T+ ++PC
Sbjct: 179 NYVTTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPC 238
Query: 149 SSSQCASLNQKSCSGV-------------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
S CA+ + K +G C Y++SYGDGSFS G LA +T+ LG+TT
Sbjct: 239 GSPACAA-SLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGLGTTT-- 295
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST-- 253
L G FGCG +N GLF T G++GLG D+SL+SQ G FSYCL P ++T
Sbjct: 296 --KLDGFVFGCGLSNRGLFGG-TAGLMGLGRTDLSLVSQTAARFGGVFSYCL-PATTTST 351
Query: 254 -KINFGTNGIVSGPGVVSTPLTKAKT---FYVLTIDAISVGNQRL----GVSTPDIVIDS 305
++ G S P + T + T FY + I +VG G ++++DS
Sbjct: 352 GSLSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGGAALTAPGFGAGNVLVDS 411
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GA 362
GT +T L + + + E P A L+ CY + VP +T+ GA
Sbjct: 412 GTVITRLAPSVYKAVRAEFARRFE-YPAAPGFSILDACYDLTGRDEVNVPLLTLTLEGGA 470
Query: 363 DVKLSRSNFFVKVSED--IVCSVFKGI--TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
V + + V +D VC + + PI GN Q N V YD + F
Sbjct: 471 QVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVVYDTVGSRLGFADE 530
Query: 419 DCT 421
DCT
Sbjct: 531 DCT 533
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 120/355 (33%), Positives = 181/355 (50%), Gaps = 50/355 (14%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y I++G+PP + V DTGSDL W +C+PC P C S FD S+TYK+L C+
Sbjct: 3 YYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCSP-DC----SSTFDRLASNTYKALTCAD 57
Query: 151 SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL-GSTTGQAVALPGITFGCGTN 209
YS YGDGSF+ G+L+ +T+ + G+ + + PG FGCG+
Sbjct: 58 ----------------DYSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFGCGSL 101
Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV------PVSSTKINFGTNGI- 262
GL S GI+ L G +S SQ+ KFSYCL+ + + + FG +
Sbjct: 102 LKGLI-SGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVE 160
Query: 263 VSGPG------VVSTPLTKAKTFYVLTIDAISVGNQRLGVS--------TPDIVIDSGTT 308
+ PG + TP+ ++ +Y + +D ISVGNQRL +S + DSGTT
Sbjct: 161 LKEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSAFLNGQDKPTIFDSGTT 220
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFR-GADVK 365
LT LP G ++ ++SM+ G L+ C+ +S +P++T HF GAD
Sbjct: 221 LTMLPPGVCDSIKQSLASMVSGAEFVAIKG-LDACFRVPPSSGQGLPDITFHFNGGADFV 279
Query: 366 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
SN+ + + + C +F TN V I+GN+ Q +F V +D++ + + FK TDC
Sbjct: 280 TRPSNYVIDLGS-LQCLIFV-PTNEVSIFGNLQQQDFFVLHDMDNRRIGFKETDC 332
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 123/355 (34%), Positives = 176/355 (49%), Gaps = 24/355 (6%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
A I+P Y++ + +GTP + DTGSDL WTQCEPC C+ Q+ P FDP S+
Sbjct: 131 ASIVPTGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPC-LGGCFPQNQPKFDPTTST 189
Query: 142 TYKSLPCSSSQCASLNQ-----KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
+YK++ CSS C + + + C C Y + YG G ++ G LATET+ + S+
Sbjct: 190 SYKNVSCSSEFCKLIAEGNYPAQDCISNTCLYGIQYGSG-YTIGFLATETLAIASSD--- 245
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
FGC + G FN TTG++GLG I+L SQ FSYCL P S +
Sbjct: 246 -VFKNFLFGCSEESRGTFNG-TTGLLGLGRSPIALPSQTTNKYKNLFSYCL-PASPSSTG 302
Query: 257 FGTNGIVSGPGVVSTPLT-KAKTFYVLTIDAISVGNQRLGV--STPDIVIDSGTTLTFLP 313
+ G+ STP++ K K Y L ISV + L + S +IDSGTT TFLP
Sbjct: 303 HLSFGVEVSQAAKSTPISPKLKQLYGLNTVGISVRGRELPINGSISRTIIDSGTTFTFLP 362
Query: 314 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVTIHFRGA-DVKLSR 368
S L S M+ + + T S + CY F+++ +P ++I F G +V++
Sbjct: 363 SPTYSALGSAFREMMANYTLTNGTSSFQPCYDFSNIGNGTLTIPGISIFFEGGVEVEIDV 422
Query: 369 SNFFVKVSE-DIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
S + V+ VC F G + I+GN Q + V YD+ + V F P C
Sbjct: 423 SGIMIPVNGLKEVCLAFADTGSDSDFAIFGNYQQKTYEVIYDVAKGMVGFAPKGC 477
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 116/351 (33%), Positives = 167/351 (47%), Gaps = 25/351 (7%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N NY++ I +GTP V DTGSD W QC+PC + CY Q PLF P S+TY ++
Sbjct: 161 NTGNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCV-AYCYQQKEPLFTPTKSATYANI 219
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C+SS C+ L+ + CSG +C Y+V YGDGS++ G A +T+TLG T + FGC
Sbjct: 220 SCTSSYCSDLDTRGCSGGHCLYAVQYGDGSYTVGFYAQDTLTLGYDT-----VKDFRFGC 274
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF----GTNGI 262
G N GLF K G++GLG G S+ Q +G F+YC +P +S+ F
Sbjct: 275 GEKNRGLFG-KAAGLMGLGRGKTSVPVQAYDKYSGVFAYC-IPATSSGTGFLDFGPGAPA 332
Query: 263 VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYN 317
+ + + TFY + + I VG L + S ++DSGT +T LP
Sbjct: 333 AANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVITRLPPSAY 392
Query: 318 SNLLSVMSSMIE--AQPVADPTGSLELCYSFNSLS---QVPEVTIHFRGA---DVKLSRS 369
L S + +E A L+ CY +P V++ F+G DV S
Sbjct: 393 EPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTGYQGSIALPAVSLVFQGGACLDVDASGI 452
Query: 370 NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ VS+ + + I GN Q + V YD+ ++ V F P C
Sbjct: 453 LYVADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 139/435 (31%), Positives = 201/435 (46%), Gaps = 62/435 (14%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
FS++L RDS +N+ Y+ L L+R +R+ + S+ ++D+ P
Sbjct: 76 FSLQLHPRDS----LHNAGHKDYKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSDLEPL 131
Query: 87 -------------------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
+ Y R+ +G P V DTGSD+ W QC+PC + C
Sbjct: 132 KTEILPEDLSTPIISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPC--TDC 189
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
Y Q P+FDP+ SS++ SLPC S QC +L C C Y VSYGDGSF+ G ET+
Sbjct: 190 YQQTDPIFDPRSSSSFASLPCESQQCQALETSGCRASKCLYQVSYGDGSFTVGEFVIETL 249
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
T G++ + + GCG +N GLF + GG +SL SQM+ A FSYCL
Sbjct: 250 TFGNSG----MINNVAVGCGHDNEGLFVGSAGLLGLGGGS-LSLTSQMK---ASSFSYCL 301
Query: 248 VPVSSTKINFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVSTPD---- 300
V S+ + + V+ PL K+ TFY + + +SVG Q L + P+
Sbjct: 302 VDRDSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSI-PPNLFQM 360
Query: 301 -------IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLEL---CYSFNSL 349
I++DSGT +T L Q YN ++ + + P T L CY +S
Sbjct: 361 DDSGYGGIIVDSGTAITRLQTQAYN----TLRDAFVSRTPYLKKTNGFALFDTCYDLSSQ 416
Query: 350 SQV--PEVTIHFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVG 405
S+V P V+ F G ++L N+ + V S C F T+S+ I GN+ Q V
Sbjct: 417 SRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVH 476
Query: 406 YDIEQQTVSFKPTDC 420
YD+ V F P C
Sbjct: 477 YDLANSVVGFSPHKC 491
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 120/352 (34%), Positives = 170/352 (48%), Gaps = 32/352 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ IG PP++ + DTGSD+ W QC PC + CY Q P+F+P S+++ +L
Sbjct: 146 SGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPC--ADCYQQADPIFEPASSASFSTLS 203
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C++ QC SL+ C C Y VSYGDGS++ G+ TET+TLGS VA+ GCG
Sbjct: 204 CNTRQCRSLDVSECRNDTCLYEVSYGDGSYTVGDFVTETITLGSAPVDNVAI-----GCG 258
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPG 267
NN GLF + GG +S SQ+ T FSYCLV S + P
Sbjct: 259 HNNEGLFVGAAGLLGLGGGS-LSFPSQINAT---SFSYCLVDRDSESASTLEFNSTLPPN 314
Query: 268 VVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPD------------IVIDSGTTLTFL 312
VS PL + TFY + + +SVG + VS P+ +++DSGT +T L
Sbjct: 315 AVSAPLLRNHHLDTFYYVGLTGLSVGGEL--VSIPESAFQIDESGNGGVIVDSGTAITRL 372
Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRS 369
++L P + + CY +S +VP V+ HF G ++ L
Sbjct: 373 QTDVYNSLRDAFVKRTRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAK 432
Query: 370 NFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N+ V + SE C F +S+ I GN+ Q V YD+ V F P C
Sbjct: 433 NYLVPLDSEGTFCFAFAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 124/360 (34%), Positives = 180/360 (50%), Gaps = 40/360 (11%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS----ISSSKASQAD 83
GF ++L H D+ +S T Q L A+ RS R+ + + A++
Sbjct: 28 GFQLKLTHVDA------GTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVL 81
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ ++ YL+ ++IGTPP A+ DTGSDLIWTQC PC C Q +P FD K S+TY
Sbjct: 82 VTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPC--LLCADQPTPYFDVKKSATY 139
Query: 144 KSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
++LPC SS+CASL+ SC C Y YGD + + G LA ET T G+ V I
Sbjct: 140 RALPCRSSRCASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIA 199
Query: 204 FGCGT-NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL---VPVSSTKINFG- 258
FGCG+ N G L NS +G+VG G G +SL+SQ+ + +FSYCL + + +++ FG
Sbjct: 200 FGCGSLNAGDLANS--SGMVGFGRGPLSLVSQLGPS---RFSYCLTSYLSATPSRLYFGV 254
Query: 259 -----TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS----------TPD 300
+ SG V STP Y L++ AIS+G + L + T
Sbjct: 255 YANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGG 314
Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR 360
++IDSGT++T+L Q + + S I + D L+ C+ + V FR
Sbjct: 315 VIIDSGTSITWLQQDAYEAVRRGLVSAIPLTAMNDTDIGLDTCFQWPPPPNVTVTVPDFR 374
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 119/356 (33%), Positives = 177/356 (49%), Gaps = 36/356 (10%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y R+ +G P + V DTGSD+ W QC+PC + CY Q P++DP +S++Y ++
Sbjct: 159 GSGEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPC--ADCYAQSDPVYDPSVSTSYATV 216
Query: 147 PCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C S +C L+ +C S +C Y V+YGDGS++ G+ ATET+TLG + + +
Sbjct: 217 GCDSPRCRDLDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTLGDS----APVSNVAI 272
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFGTNG 261
GCG +N GLF ++ LGGG +S SQ+ T FSYCLV SS+ + FG
Sbjct: 273 GCGHDNEGLFVGAAG-LLALGGGPLSFPSQISATT---FSYCLVDRDSPSSSTLQFGD-- 326
Query: 262 IVSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTT 308
S V+ PL ++ TFY + + ISVG + L + + +++DSGT
Sbjct: 327 --SEQPAVTAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVDSGTA 384
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVK 365
+T L G L ++ P A + CY S QVP V + F G ++K
Sbjct: 385 VTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAVALWFEGGGELK 444
Query: 366 LSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
L N+ + V + C F G + V I GN+ Q V +D + TV F C
Sbjct: 445 LPAKNYLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNTVGFTADKC 500
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 137/434 (31%), Positives = 207/434 (47%), Gaps = 52/434 (11%)
Query: 25 QTGGFSVELIHRDSPKSPF--YNSSETPYQRLRDALTRSL-NRLNHF----NQNSSISSS 77
+ G +E+ H+DS +N + + D RSL +R+ N + S+ +
Sbjct: 62 ENGATILEMKHKDSCSGKILDWNKKLKKHLIMDDFQLRSLQSRMKSIISGRNIDDSVDAP 121
Query: 78 KASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+ I NY++ + +G + + DTGSDL W QC+PC +CY Q P+F+P
Sbjct: 122 IPLTSGIRLQTLNYIVTVELGG--RKMTVIVDTGSDLSWVQCQPC--KRCYNQQDPVFNP 177
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCS-GV------NCQYSVSYGDGSFSNGNLATETVTLG 190
S +Y+++ CSS C SL + + GV +C Y V+YGDGS++ G L TE + LG
Sbjct: 178 STSPSYRTVLCSSPTCQSLQSATGNLGVCGSNPPSCNYVVNYGDGSYTRGELGTEHLDLG 237
Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV 250
++T A+ FGCG NN GLF +G+VGLG +SLISQ G FSYCL P+
Sbjct: 238 NST----AVNNFIFGCGRNNQGLFGG-ASGLVGLGRSSLSLISQTSAMFGGVFSYCL-PI 291
Query: 251 SSTKIN----FGTNGIVSGPGVVSTPLTKAKT-------FYVLTIDAISVGNQRLGVSTP 299
+ T+ + G N V +TP++ + FY L + I+VG+ + V P
Sbjct: 292 TETEASGSLVMGGNSSVYKN---TTPISYTRMIPNPQLPFYFLNLTGITVGS--VAVQAP 346
Query: 300 D-----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV-- 352
++IDSGT +T LP L P A L+ C++ + +V
Sbjct: 347 SFGKDGMMIDSGTVITRLPPSIYQALKDEFVKQFSGFPSAPAFMILDTCFNLSGYQEVEI 406
Query: 353 PEVTIHFRG---ADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYD 407
P + +HF G +V ++ +FVK VC ++ N V I GN Q N V YD
Sbjct: 407 PNIKMHFEGNAELNVDVTGVFYFVKTDASQVCLAIASLSYENEVGIIGNYQQKNQRVIYD 466
Query: 408 IEQQTVSFKPTDCT 421
+ + F CT
Sbjct: 467 TKGSMLGFAAEACT 480
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 126/393 (32%), Positives = 193/393 (49%), Gaps = 43/393 (10%)
Query: 56 DALTRSLNRLNHFNQNSSISSSKASQADIIPN----NANYLIRISIGTPPTERLAVADTG 111
D +TR L+ L N ++ ++S A Q ++ + Y R+ IG+P + V DTG
Sbjct: 129 DGVTR-LD-LRPANGSAVFAASAAIQGPVVSGVGQGSGEYFSRVGIGSPARQLYMVLDTG 186
Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYS 169
SD+ W QC+PC + CY Q P+FDP +S++Y ++ C S +C L+ +C C Y
Sbjct: 187 SDVTWVQCQPC--ADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGACLYE 244
Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDI 229
V+YGDGS++ G+ ATET+TLG +T + + GCG +N GLF ++ LGGG +
Sbjct: 245 VAYGDGSYTVGDFATETLTLGDST----PVGNVAIGCGHDNEGLFVGAAG-LLALGGGPL 299
Query: 230 SLISQMRTTIAGKFSYCLV----PVSSTKINFGTNGIVSGPGVVSTPLTKA---KTFYVL 282
S SQ+ A FSYCLV P +ST + FG + G V+ PL ++ TFY +
Sbjct: 300 SFPSQIS---ASTFSYCLVDRDSPAAST-LQFGDGAAEA--GTVTAPLVRSPRTSTFYYV 353
Query: 283 TIDAISVGNQRLGV-----------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
+ ISVG Q L + + +++DSGT +T L + L +
Sbjct: 354 ALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSL 413
Query: 332 PVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVKVS-EDIVCSVFKGI 387
P + CY + + +VP V++ F G ++L N+ + V C F
Sbjct: 414 PRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPT 473
Query: 388 TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+V I GN+ Q V +D + V F P C
Sbjct: 474 NAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 506
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 141/419 (33%), Positives = 205/419 (48%), Gaps = 32/419 (7%)
Query: 22 IEAQTGGFSVELIHRDSPKS--PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKA 79
+ +G +V L HR P S P N+ RD L + + N S +
Sbjct: 50 VAPSSGVVTVPLHHRHGPCSTVPSTNAPTLEDMLRRDQLRAAYITRKYSGVNGSAGDVEG 109
Query: 80 SQADIIP------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
S + + YLI + +G+P + + DTGSD+ W QC+PC SQC+ Q
Sbjct: 110 SDVTVPTTLGTSLDTLEYLITVGMGSPAVAQTMLIDTGSDVSWVQCKPC--SQCHSQADS 167
Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
LFDP SSTY + C+S+ CA L Q+ CS CQY+V YGDGS +G +++T+ LGS+T
Sbjct: 168 LFDPSSSSTYSAFSCTSAACAQLRQRGCSSSQCQYTVKYGDGSTGSGTYSSDTLALGSST 227
Query: 194 GQAVALPGITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS 252
+ FGC + +G L +T G++GLGGG SL +Q T FSYCL P
Sbjct: 228 -----VENFQFGCSQSESGNLLQDQTAGLMGLGGGAESLATQTAGTFGKAFSYCLPPTPG 282
Query: 253 TKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI----VIDS 305
+ F T G + VV TP+ T+ ++Y + + AI VG ++L + ++DS
Sbjct: 283 SS-GFLTLGASTSGFVVKTPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASAFSAGSIMDS 341
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGAD 363
GT +T LP+ S L S + ++ P A P G + C+ F+ S V P V + F G
Sbjct: 342 GTIITRLPRTAYSALSSAFKAGMKQYPPAQPMGIFDTCFDFSGQSSVSIPTVALVFSGGA 401
Query: 364 VKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
V S+ + S C F ++ S+ I GN+ Q F V YD+ V FK C
Sbjct: 402 VVDLASDGIILGS----CLAFAANSDDTSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 456
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 118/358 (32%), Positives = 177/358 (49%), Gaps = 37/358 (10%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y R+ IG+P E V DTGSD+ W QC+PC + CY Q P+FDP +S++Y ++
Sbjct: 165 GSGEYFSRVGIGSPARELYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSASYAAV 222
Query: 147 PCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C S +C L+ +C C Y V+YGDGS++ G+ ATET+TLG +T + +
Sbjct: 223 SCDSPRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDST----PVTNVAI 278
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVSSTKINFGTN 260
GCG +N GLF ++ LGGG +S SQ+ A FSYCLV P +ST + FG +
Sbjct: 279 GCGHDNEGLFVGAAG-LLALGGGPLSFPSQIS---ASTFSYCLVDRDSPAAST-LQFGAD 333
Query: 261 GIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGV-----------STPDIVIDSG 306
G + V+ PL ++ TFY + + ISVG Q L + + +++DSG
Sbjct: 334 GAEA--DTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGGVIVDSG 391
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD- 363
T +T L + L + P + CY + + +VP V++ F G
Sbjct: 392 TAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGA 451
Query: 364 VKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
++L N+ + V C F +V I GN+ Q V +D + V F P C
Sbjct: 452 LRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKGVVGFTPNKC 509
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 130/356 (36%), Positives = 175/356 (49%), Gaps = 34/356 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y RI IGTP E+ V DTGSD++W QCEPC +CY Q P+F+P S ++ ++
Sbjct: 5 SGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPC--RECYSQADPIFNPSSSVSFSTVG 62
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C S+ C+ L+ C G C Y VSYGDGS++ G+ ATET+T G+T+ Q VA+ GCG
Sbjct: 63 CDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTFGTTSIQNVAI-----GCG 117
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP---VSSTKINFGTNGIVS 264
+N GLF ++GLG G +S +Q+ T FSYCLV SS + FG +
Sbjct: 118 HDNVGLFVGAAG-LLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDSESSGTLEFGPESVPI 176
Query: 265 GPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPD-------------IVIDSGTT 308
G + TPL TFY L++ AISVG L S P I+IDSGT
Sbjct: 177 GS--IFTPLVANPFLPTFYYLSMVAISVGGVILD-SVPSEAFRIDETTGRGGIIIDSGTA 233
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVK 365
+T L L + + P AD + CY ++L V P V HF GA
Sbjct: 234 VTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGAGFI 293
Query: 366 LSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
L N + + S C F +++ I GNI Q V +D V F C
Sbjct: 294 LPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 131/380 (34%), Positives = 193/380 (50%), Gaps = 56/380 (14%)
Query: 84 IIPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
++ N+A Y + +SIGTPP +ADTGS LIWTQC PC ++C + +P F P SST
Sbjct: 82 LLDNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPC--TECAARPAPPFQPASSST 139
Query: 143 YKSLPCSSSQCASLNQK--SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
+ LPC+SS C L +C+ C Y YG G F+ G LATET+ +G + P
Sbjct: 140 FSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMG-FTAGYLATETLHVG-----GASFP 193
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL---VPVSSTKINF 257
G+ FGC T NG + ++GIVGLG +SL+SQ+ G+FSYCL + I F
Sbjct: 194 GVAFGCSTENG--VGNSSSGIVGLGRSPLSLVSQVGV---GRFSYCLRSDADAGDSPILF 248
Query: 258 GTNGIVSGPGVVSTPLTK-----AKTFYVLTIDAISVGNQRLGVSTPDI----------- 301
G+ V+G V STPL + + ++Y + + I+VG L V++
Sbjct: 249 GSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLV 308
Query: 302 ---VIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNSL---SQ 351
++DSGTTLT+L +GY + +S M A G+ +LC+ + S
Sbjct: 309 GGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSG 368
Query: 352 VPEVTIHFR---GADVKLSRSNFFVKVSED------IVCSVFKGITN--SVPIYGNIMQT 400
VP T+ R GA+ + R ++ V+ D + C + + S+ I GN+MQ
Sbjct: 369 VPVPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLSISIIGNVMQM 428
Query: 401 NFLVGYDIEQQTVSFKPTDC 420
+ V YD++ SF P DC
Sbjct: 429 DLHVLYDLDGGMFSFAPADC 448
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 144/433 (33%), Positives = 201/433 (46%), Gaps = 58/433 (13%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
FS++L P+ N Y+ L L R R+N N ++ S +++D+ P
Sbjct: 77 FSLQL----HPRETLLNEQHPNYKTLVLSRLARDTARVNSLNTKLQLALSSLNRSDLYPT 132
Query: 88 N---------------------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
Y R+ +G P V DTGSD+ W QC+PC S
Sbjct: 133 ETELLRPEDLSTPVSSGTAQGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPC--SD 190
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
CY Q P+FDP SS+Y L C + QC L +C C Y VSYGDGSF+ G TET
Sbjct: 191 CYQQSDPIFDPTASSSYNPLTCDAQQCQDLEMSACRNGKCLYQVSYGDGSFTVGEYVTET 250
Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYC 246
V+ G+ + VA+ GCG +N GLF + G++GLGGG +SL SQ++ T FSYC
Sbjct: 251 VSFGAGSVNRVAI-----GCGHDNEGLF-VGSAGLLGLGGGPLSLTSQIKAT---SFSYC 301
Query: 247 LVPVSSTKIN-FGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRLGVSTPD--- 300
LV S K + N G VV+ L K TFY + + +SVG + + V P+
Sbjct: 302 LVDRDSGKSSTLEFNSPRPGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVP-PETFA 360
Query: 301 --------IVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS- 350
+++DSGT +T L Q YNS + +P A+ + CY +SL
Sbjct: 361 VDQSGAGGVIVDSGTAITRLRTQAYNSVRDAFKRKTSNLRP-AEGVALFDTCYDLSSLQS 419
Query: 351 -QVPEVTIHFRGADV-KLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
+VP V+ HF G L N+ + V C F T+S+ I GN+ Q V +D
Sbjct: 420 VRVPTVSFHFSGDRAWALPAKNYLIPVDGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSFD 479
Query: 408 IEQQTVSFKPTDC 420
+ V F P C
Sbjct: 480 LANSLVGFSPNKC 492
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 125/355 (35%), Positives = 180/355 (50%), Gaps = 38/355 (10%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y R+ +G P + V DTGSD+ W QC+PC + CY Q P+FDP SSTY + C
Sbjct: 18 GEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPC--TDCYQQTDPIFDPTASSTYAPVTC 75
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
S QC+SL SC C Y V+YGDGS++ G+ ATE+V+ G++ ++ + GCG
Sbjct: 76 QSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSG----SVKNVALGCGH 131
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV---PVSSTKINFGTNGIVSG 265
+N GLF G++GLGGG +SL +Q++ T FSYCLV S+ ++F N G
Sbjct: 132 DNEGLF-VGAAGLLGLGGGPLSLTNQLKAT---SFSYCLVNRDSAGSSTLDF--NSAQLG 185
Query: 266 PGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPD------------IVIDSGTTLT 310
V+ PL K + TFY + + +SVG Q VS P+ I++D GT +T
Sbjct: 186 VDSVTAPLMKNRKIDTFYYVGLSGMSVGGQM--VSIPESTFRLDESGNGGIIVDCGTAIT 243
Query: 311 FLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKL 366
L Q YN L M + + + CY + + +VP V+ HF G L
Sbjct: 244 RLQTQAYNP-LRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSWNL 302
Query: 367 SRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+N+ + V S C F T+S+ I GN+ Q V +D+ + F P C
Sbjct: 303 PAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 138/428 (32%), Positives = 203/428 (47%), Gaps = 46/428 (10%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD------ 83
SV L HR P SP +S + L R R ++ + S S+ A+ D
Sbjct: 61 SVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQSSKV 120
Query: 84 IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-PSQCYMQDSPLF 135
+P + Y+I + +G+P + V DTGSD+ W QCEPCP PS C+ LF
Sbjct: 121 SVPTTLGSSLDTLEYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALF 180
Query: 136 DPKMSSTYKSLPCSSSQCASL-NQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLG 190
DP SSTY + CS++ CA L + +G + CQY V YGDGS + G +++ +TL
Sbjct: 181 DPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTL- 239
Query: 191 STTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL-- 247
+G V + G FGC G + KT G++GLGG SL+SQ FSYCL
Sbjct: 240 --SGSDV-VRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCLPA 296
Query: 248 VPVSSTKINFGTNGIVSGPGV---VSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPDI 301
P SS + G G G +TP+ ++K T+Y ++ I+VG ++LG+S P +
Sbjct: 297 TPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLS-PSV 355
Query: 302 -----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PE 354
++DSGT +T LP + L S + + A+P G L+ C++F L +V P
Sbjct: 356 FAAGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPT 415
Query: 355 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQT 412
V + F G V ++ V C F + + GN+ Q F V YD+
Sbjct: 416 VALVFAGGAVVDLDAHGIVSGG----CLAFAPTRDDKAFGTIGNVQQRTFEVLYDVGGGV 471
Query: 413 VSFKPTDC 420
F+ C
Sbjct: 472 FGFRAGAC 479
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 128/358 (35%), Positives = 174/358 (48%), Gaps = 37/358 (10%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ +GTP V DTGSD++W QC PC +CY Q P+FDP S ++ ++P
Sbjct: 142 SGEYFTRLGVGTPARYVYMVLDTGSDIVWIQCAPC--IKCYSQTDPVFDPTKSRSFANIP 199
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTL-GSTTGQAVALPGITF 204
C S C L+ CS C Y VSYGDGSF+ G +TET+T G+ G+ V
Sbjct: 200 CGSPLCRRLDYPGCSTKKQICLYQVSYGDGSFTVGEFSTETLTFRGTRVGRVV------L 253
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVS 264
GCG +N GLF ++GLG G +S SQ+ KFSYCL S++ + IV
Sbjct: 254 GCGHDNEGLFVGAAG-LLGLGRGRLSFPSQIGRRFNSKFSYCLGDRSASS---RPSSIVF 309
Query: 265 GPGVVS-----TPLT---KAKTFYVLTIDAISVGNQRL-GVSTP----------DIVIDS 305
G +S TPL K TFY + + ISVG R+ G+S ++IDS
Sbjct: 310 GDSAISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGNGGVIIDS 369
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGAD 363
GT++T L + L A + C+ + ++ VP V +HFRGAD
Sbjct: 370 GTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHFRGAD 429
Query: 364 VKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
V L SN+ + V C F G + + I GNI Q F V YD+ V F P C
Sbjct: 430 VPLPASNYLIPVDNSGSFCFAFAGTASGLSIIGNIQQQGFRVVYDLATSRVGFAPRGC 487
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 171 bits (433), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 119/415 (28%), Positives = 193/415 (46%), Gaps = 58/415 (13%)
Query: 55 RDALTRSLNRLNHFNQNSSISSSKASQADIIPN-------NANYLIRISIGTPPTERLAV 107
R+ L R R +++ + S +A+ A + P + YL+ ++IGTPP +
Sbjct: 70 RELLHRMAARSK--ARSARLLSGRAASARVDPGSYTDGVPDTEYLVHMAIGTPPQPVQLI 127
Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC------ 161
DTGSDL WTQC PC C+ Q P F+P S T+ LPC C L SC
Sbjct: 128 LDTGSDLTWTQCAPC--VSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWG 185
Query: 162 SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTT 219
+G+ C Y+ +Y D S + G+L ++T + S ++P +TFGCG N G+F S T
Sbjct: 186 NGI-CVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNET 244
Query: 220 GIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---------------INFGTNGIVS 264
GI G G +S+ +Q++ FSYC ++ ++ G +G+V
Sbjct: 245 GIAGFSRGALSMPAQLKVD---NFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQ 301
Query: 265 GPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQ 314
++ ++ K +Y+ ++ ++VG RL + T ++DSGT +T LP+
Sbjct: 302 STALIRYHSSQLKAYYI-SLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPE 360
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIHFRGADVKLSRSNF 371
+ + + + V + T SL +LC+S + VP + +HF GA + L R N+
Sbjct: 361 AVYNLVCDAFVAQTKLT-VHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPRENY 419
Query: 372 FVKVSE----DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
++ E + C + + GN Q N V YD+ +SF P C K
Sbjct: 420 MFEIEEAGGIRLTCLAINA-GEDLSVIGNFQQQNMHVLYDLANDMLSFVPARCNK 473
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 171 bits (433), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 139/434 (32%), Positives = 205/434 (47%), Gaps = 57/434 (13%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ----------NSSISSSKASQ 81
++HRD+ + ++ T + LR L R R ++ N + S A
Sbjct: 72 RVVHRDA-----FAANATAAELLRHRLQRDKRRAARISKAAAGGGAGAANGTRSRGGAVA 126
Query: 82 ADIIPNNA----NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
A ++ A Y +I +GTP T L V DTGSD++W QC PC +CY Q P+FDP
Sbjct: 127 APVVSGLAQGSGEYFTKIGVGTPSTPALMVLDTGSDVVWLQCAPC--RRCYDQSGPVFDP 184
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
+ SS+Y ++ C++ C L+ C C Y V+YGDGS + G+ ATET+T G
Sbjct: 185 RRSSSYGAVDCAAPLCRRLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFAG--GA 242
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
VA + GCG +N GLF + ++GLG G +S +Q+ FSYCLV +S+
Sbjct: 243 RVAR--VALGCGHDNEGLFVAAAG-LLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSS 299
Query: 256 NFG---------TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRL-GVSTPD-- 300
+ T G S TP+ + +TFY + + ISVG R+ GV+ D
Sbjct: 300 SGAASRSRSSTVTFGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLR 359
Query: 301 ---------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NS 348
+++DSGT++T L + S L + ++ SL + CY
Sbjct: 360 LDPSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLGGRK 419
Query: 349 LSQVPEVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGY 406
+ +VP V++HF GA+ L N+ + V S C F G V I GNI Q F V +
Sbjct: 420 VVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVF 479
Query: 407 DIEQQTVSFKPTDC 420
D + Q V F P C
Sbjct: 480 DGDGQRVGFAPKGC 493
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 171 bits (433), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 120/413 (29%), Positives = 190/413 (46%), Gaps = 52/413 (12%)
Query: 52 QRLRDALTRSLNRLNHF--NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVAD 109
+ LR RS R + +S S D +P+ YL+ ++IGTPP + D
Sbjct: 71 ELLRRMAARSKARSARLLSGRAASARMDPGSYTDGVPDT-EYLVHMAIGTPPQPVQLILD 129
Query: 110 TGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC------SG 163
TGSDL WTQC PC C+ Q P F+P S T+ LPC C L SC +G
Sbjct: 130 TGSDLTWTQCAPC--VSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNG 187
Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTTGI 221
+ C Y+ +Y D S + G+L ++T + S ++P +TFGCG N G+F S TGI
Sbjct: 188 I-CVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGI 246
Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---------------INFGTNGIVSGP 266
G G +S+ +Q++ FSYC ++ ++ G +G+V
Sbjct: 247 AGFSRGALSMPAQLKVD---NFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQST 303
Query: 267 GVVSTPLTKAKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQGY 316
++ ++ K +Y+ ++ ++VG RL + T ++DSGT +T LP+
Sbjct: 304 ALIRYHSSQLKAYYI-SLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAV 362
Query: 317 NSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIHFRGADVKLSRSNFFV 373
+ + + + V + T SL +LC+S + VP + +HF GA + L R N+
Sbjct: 363 YNLVCDAFVAQTKLT-VHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPRENYMF 421
Query: 374 KVSE----DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
++ E + C + + GN Q N V YD+ +SF P C K
Sbjct: 422 EIEEAGGIRLTCLAINA-GEDLSVIGNFQQQNMHVLYDLANDMLSFVPARCNK 473
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 171 bits (432), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 122/344 (35%), Positives = 176/344 (51%), Gaps = 28/344 (8%)
Query: 95 ISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC- 153
+ +GTP T+ + V DTGS L W QC PC S C+ Q P+F+PK SSTY S+ CS+ QC
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCSPCLVS-CHRQSGPVFNPKSSSTYASVGCSAQQCS 59
Query: 154 ----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
A+LN +CS N C Y SYGD SFS G L+ +TV+ GST+ LP +GCG
Sbjct: 60 DLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----LPNFYYGCGQ 114
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGV 268
+N GLF ++ G++GL +SL+ Q+ ++ F+YCL S+ + + PG
Sbjct: 115 DNEGLFG-RSAGLIGLARNKLSLLYQLAPSLGYSFTYCL---PSSSSSGYLSLGSYNPGQ 170
Query: 269 VS-TPLTKA---KTFYVLTIDAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQGYNSN 319
S TP+ + + Y + + ++V L VS+ +IDSGT +T LP S
Sbjct: 171 YSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPTSVYSA 230
Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFR-GADVKLSRSNFFVKVSE 377
L +++ ++ A L+ C+ S P VT+ F GA +KLS N V V +
Sbjct: 231 LSKAVAAAMKGTSRASAYSILDTCFKGQASRVSAPAVTMSFAGGAALKLSAQNLLVDVDD 290
Query: 378 DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
C F S I GN Q F V YD++ + F C+
Sbjct: 291 STTCLAF-APARSAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 171 bits (432), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 131/434 (30%), Positives = 201/434 (46%), Gaps = 40/434 (9%)
Query: 18 VVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS 77
V +P +A ++ ++H P SP + P + L R +R++ + + ++
Sbjct: 52 VCTPTKAAPSSSALTVVHGHGPCSPQESRRGAPSHT--EILGRDQDRVDAIRRKVAAVTT 109
Query: 78 KASQADI--IP---------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
AS + +P + NY + +GTP T+ L DTGSD W QC+PCP
Sbjct: 110 AASSSKPKGVPLQVGWGKYLDTTNYFTSLRLGTPATDLLVELDTGSDQSWIQCKPCP--D 167
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASL---NQKSCSG-VNCQYSVSYGDGSFSNGNL 182
CY Q LFDP SSTY + CSS +C L ++ +CS C Y ++Y D S++ GNL
Sbjct: 168 CYEQHEALFDPSKSSTYSDITCSSRECQELGSSHKHNCSSDKKCPYEITYADDSYTVGNL 227
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
A +T+TL T A+PG FGCG NN G F + G++GLG G SL SQ+
Sbjct: 228 ARDTLTLSPTD----AVPGFVFGCGHNNAGSFG-EIDGLLGLGRGKASLSSQVAARYGAG 282
Query: 243 FSYCL--VPVSSTKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGV-- 296
FSYCL P ++ ++F + T + + +FY L + I+V + + V
Sbjct: 283 FSYCLPSSPSATGYLSFSGAAAAAPTNAQFTEMVAGQHPSFYYLNLTGITVAGRAIKVPP 342
Query: 297 ----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLS 350
+ +IDSGT + LP + L S + S + A + + CY +
Sbjct: 343 SVFATAAGTIIDSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPSSTIFDTCYDLTGHETV 402
Query: 351 QVPEVTIHFR-GADVKLSRSNF---FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY 406
++P V + F GA V L S + VS+ + + S+ + GN Q V Y
Sbjct: 403 RIPSVALVFADGATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIY 462
Query: 407 DIEQQTVSFKPTDC 420
D++ Q V F C
Sbjct: 463 DVDNQKVGFGANGC 476
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 120/413 (29%), Positives = 190/413 (46%), Gaps = 52/413 (12%)
Query: 52 QRLRDALTRSLNRLNHF--NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVAD 109
+ LR RS R + +S S D +P+ YL+ ++IGTPP + D
Sbjct: 45 ELLRRMAARSKARSARLLSGRAASARMDPGSYTDGVPDT-EYLVHMAIGTPPQPVQLILD 103
Query: 110 TGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC------SG 163
TGSDL WTQC PC C+ Q P F+P S T+ LPC C L SC +G
Sbjct: 104 TGSDLTWTQCAPC--VSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNG 161
Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTTGI 221
+ C Y+ +Y D S + G+L ++T + S ++P +TFGCG N G+F S TGI
Sbjct: 162 I-CVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGI 220
Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---------------INFGTNGIVSGP 266
G G +S+ +Q++ FSYC ++ ++ G +G+V
Sbjct: 221 AGFSRGALSMPAQLKVD---NFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQST 277
Query: 267 GVVSTPLTKAKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQGY 316
++ ++ K +Y+ ++ ++VG RL + T ++DSGT +T LP+
Sbjct: 278 ALIRYHSSQLKAYYI-SLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAV 336
Query: 317 NSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIHFRGADVKLSRSNFFV 373
+ + + + V + T SL +LC+S + VP + +HF GA + L R N+
Sbjct: 337 YNLVCDAFVAQTKLT-VHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPRENYMF 395
Query: 374 KVSE----DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
++ E + C + + GN Q N V YD+ +SF P C K
Sbjct: 396 EIEEAGGIRLTCLAINA-GEDLSVIGNFQQQNMHVLYDLANDMLSFVPARCNK 447
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 126/354 (35%), Positives = 172/354 (48%), Gaps = 30/354 (8%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y RI +GTPP V DTGSD++W QC PC CY Q P+F+P S ++ +
Sbjct: 126 SGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPC--KNCYSQTDPVFNPVKSGSFAKVL 183
Query: 148 CSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C + C L C+ C Y VSYGDGS++ G TET+T T + VAL GC
Sbjct: 184 CRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVAL-----GC 238
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS----TKINFGTNGI 262
G +N GLF ++GLG G +S SQ T KFSYCLV S+ + + FG N
Sbjct: 239 GHDNEGLFVGAAG-LLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFG-NSA 296
Query: 263 VSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRL-GVSTPD----------IVIDSGTTL 309
VS + LT + TFY + + ISVG + G++ ++ID GT++
Sbjct: 297 VSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSV 356
Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADVKLS 367
T L + L + + A + CY + + +VP V +HFRGADV L
Sbjct: 357 TRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLP 416
Query: 368 RSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
SN+ + V C F G T+ + I GNI Q F V YD+ V F P C
Sbjct: 417 ASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 470
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 129/380 (33%), Positives = 188/380 (49%), Gaps = 48/380 (12%)
Query: 76 SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
SS + QA + Y + IS+GTP VADTGSDLIWTQC PC ++C+ Q +P F
Sbjct: 71 SSVSFQALLENGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPC--TKCFQQPAPPF 128
Query: 136 DPKMSSTYKSLPCSSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
P SST+ LPC+SS C L + ++C+ C Y+ YG G ++ G LATET+ +G
Sbjct: 129 QPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGD-- 185
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV---PV 250
+ P + FGC T NG + T+GI GLG G +SLI Q+ G+FSYCL
Sbjct: 186 ---ASFPSVAFGCSTENG--VGNSTSGIAGLGRGALSLIPQLGV---GRFSYCLRSGSAA 237
Query: 251 SSTKINFGTNGIVSGPGVVSTPLTK----AKTFYVLTIDAISVGNQRLGVSTPDI----- 301
++ I FG+ ++ V STP ++Y + + I+VG L V+T
Sbjct: 238 GASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQN 297
Query: 302 ------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS---- 350
++DSGTTLT+L + GY + +S + V + T L+LC+
Sbjct: 298 GLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTV-NGTRGLDLCFKSTGGGGGGI 356
Query: 351 QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--------IYGNIMQTNF 402
VP + + F G + + +F V D SV +P + GN+MQ +
Sbjct: 357 AVPSLVLRFDGG-AEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDM 415
Query: 403 LVGYDIEQQTVSFKPTDCTK 422
+ YD++ SF P DC K
Sbjct: 416 HLLYDLDGGIFSFAPADCAK 435
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 124/353 (35%), Positives = 173/353 (49%), Gaps = 34/353 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ IG PP+ V DTGSD+ W QC PC ++CY Q P+F+P S+++ SL
Sbjct: 148 SGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPC--AECYEQTDPIFEPTSSASFTSLS 205
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C + QC SL+ C C Y VSYGDGS++ G+ TETVTLGST +L I GCG
Sbjct: 206 CETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGST-----SLGNIAIGCG 260
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFGTNGIVS 264
NN GLF + GG +S SQ+ A FSYCLV S++ ++F N ++
Sbjct: 261 HNNEGLFIGAAGLLGLGGGS-LSFPSQLN---ASSFSYCLVDRDSDSTSTLDF--NSPIT 314
Query: 265 GPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPD----------IVIDSGTTLTF 311
P V+ PL + TF+ L + +SVG L + I++DSGT +T
Sbjct: 315 -PDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTR 373
Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSR 368
L + L A + CY +S S +VP V+ HF G ++ L
Sbjct: 374 LQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPLPA 433
Query: 369 SNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N+ + V SE C F +++ I GN Q VG+D+ V F P C
Sbjct: 434 KNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 135/427 (31%), Positives = 203/427 (47%), Gaps = 53/427 (12%)
Query: 33 LIHRD----SPKSPFYNSSETPYQRLRDALTRSL-NRLNHF---NQNSSISSSKASQADI 84
+ HRD S KS +N L D RSL +R+ N ++ S + +
Sbjct: 1 MKHRDFCNSSGKSTDWNKKLQKSLILDDFRVRSLQSRIKSIFSGNNIDALDSQIPLSSGV 60
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
NY++ + IG + DTGSDL W QC+PC CY Q PLF+P S +Y+
Sbjct: 61 RLQTLNYIVTVEIGG--RNMTVIVDTGSDLTWVQCQPC--RLCYNQQDPLFNPSGSPSYQ 116
Query: 145 SLPCSSSQCASLNQKSCS----GVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
++ C+SS C SL + + G N C Y V+YGDGS++ G+L E + LG+T
Sbjct: 117 TILCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNLGTT----- 171
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
+ FGCG NN GLF +G++GLG D+SL+SQ G FSYCL +T +
Sbjct: 172 HVSNFIFGCGRNNKGLFGG-ASGLMGLGKSDLSLVSQTSAIFEGVFSYCL---PTTAADA 227
Query: 258 GTNGIVSGPGVV---STPLTKAK--------TFYVLTIDAISVGNQRLGVSTPD-----I 301
+ I+ G V +TP++ + TFY L + IS+G + + P+ I
Sbjct: 228 SGSLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGG--VALQAPNYRQSGI 285
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF 359
+IDSGT +T LP +L + P A P L+ C++ N +V P + + F
Sbjct: 286 LIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQF 345
Query: 360 RG---ADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVS 414
G V ++ +FVK VC ++ + +PI GN Q N V Y+ ++ +
Sbjct: 346 EGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKLG 405
Query: 415 FKPTDCT 421
F C+
Sbjct: 406 FAAEACS 412
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 137/418 (32%), Positives = 205/418 (49%), Gaps = 35/418 (8%)
Query: 31 VELIHRDSPKSPFY--NSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP-- 86
++++H+ P S + +E Y L+D +R + + +++S +S KA+ A +P
Sbjct: 85 LKVVHKHGPCSDLRQGHKAEAQYILLQDQ-SRVDSIHSKLSKDSGLSDVKATAATTLPAK 143
Query: 87 -----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ NY + + +GTP + + DTGSDL WTQCEPC S CY Q +F+P S+
Sbjct: 144 DGSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKS-CYNQKEAIFNPSQST 202
Query: 142 TYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
+Y ++ C S+ C SL N +C+ C Y + YGD SFS G E ++L +T
Sbjct: 203 SYANISCGSTLCDSLASATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLSLTATD--- 259
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
FGCG NN GL G++GLG +SL+SQ FSYCL P SS+
Sbjct: 260 -VFNDFYFGCGQNNKGL-FGGAAGLLGLGRDKLSLVSQTAQRYNKIFSYCL-PSSSSSTG 316
Query: 257 FGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTT 308
F T G + TPL + +FY L + ISVG ++L + ST +IDSGT
Sbjct: 317 FLTFGGSTSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVFSTAGTIIDSGTV 376
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGA-DVK 365
+T LP S L S ++ P A L+ C+ F++ VP++ + F G V
Sbjct: 377 ITRLPPAAYSALSSTFRKLMSQYPAAPALSILDTCFDFSNHDTISVPKIGLFFSGGVVVD 436
Query: 366 LSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ ++ F VC F G +++ V I+GN+ Q V YD V F P C+
Sbjct: 437 IDKTGIFYVNDLTQVCLAFAGNSDASDVAIFGNVQQKTLEVVYDGAAGRVGFAPAGCS 494
>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 315
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 113/295 (38%), Positives = 162/295 (54%), Gaps = 19/295 (6%)
Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C S C L+ CS C Y+ YGD S + G LA +T T S TG+ V+L FGC
Sbjct: 21 CDSPLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKLVSLSRFLFGC 80
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG-KFSYCLVPVS-----STKINFGTN 260
G NN G FN G++GLGGG SLISQ+ G KFS CLVP S++++FG
Sbjct: 81 GHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGKG 140
Query: 261 GIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRL----GVSTPDIVIDSGTTLTFLP 313
V G GVV+TPL + + T Y +T+ ISV + L + ++++DSGT LP
Sbjct: 141 SQVLGDGVVTTPLVQREQDMTSYFVTLLGISVEDTYLPMNSTIEKGNMLVDSGTPPNILP 200
Query: 314 QG-YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFF 372
Q Y+ + V +++ DP+ +LCY + + P +T HF GA++ L+ F
Sbjct: 201 QQLYDRVYVEVKNNVPLELITNDPSLGPQLCYRTQTNLKGPTLTYHFEGANLLLTPIQTF 260
Query: 373 VKVSED---IVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 423
+ + + + C TNS +YGN Q+N+L+G+D+++Q VSFK TDCTKQ
Sbjct: 261 IPPTPETKGVFCLAINNYTNSNGGVYGNFAQSNYLIGFDLDRQVVSFKATDCTKQ 315
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 116/363 (31%), Positives = 179/363 (49%), Gaps = 40/363 (11%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y + +GTPPT L V DTGSD++W QC+PC CY Q SPL+DP+ SSTY P
Sbjct: 96 SGEYFASVGVGTPPTPALLVIDTGSDVVWLQCKPC--VHCYRQLSPLYDPRGSSTYAQTP 153
Query: 148 CSSSQCASLNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
CS QC N ++C G C Y + YGD S ++GNLAT+ + + T ++ +T G
Sbjct: 154 CSPPQCR--NPQTCDGTTGGCGYRIVYGDASSTSGNLATDRLVFSNDT----SVGNVTLG 207
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKINFGTN 260
CG +N GLF S G++G+ G+ S +Q+ + F+YCL SS+ + FG
Sbjct: 208 CGHDNEGLFGS-AAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSSSYLVFGRT 266
Query: 261 GIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRL------------GVSTPDIVIDS 305
P V TPL + + Y + + SVG + + +V+DS
Sbjct: 267 A-PEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRGGVVVDS 325
Query: 306 GTTLT-FLPQGYNS--NLLSVMSSMIEAQPVADPTGSLELCYSFN--SLSQVPEVTIHFR 360
GT++T F Y + + ++ + + V + CY +++ P V +HF
Sbjct: 326 GTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLRGVAVADAPGVVLHFA 385
Query: 361 -GADVKLSRSNFFV-KVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
GADV L N+ V + S C + + + + GN++Q F V +D+E + V F+P
Sbjct: 386 GGADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDVENERVGFEP 445
Query: 418 TDC 420
C
Sbjct: 446 NGC 448
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 123/378 (32%), Positives = 183/378 (48%), Gaps = 44/378 (11%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQ---DSPLFDPKMSSTYK 144
YL+ ++ GTPP E L +ADTGSDLIW QC PP+ C + P F S+T
Sbjct: 52 QYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLS 111
Query: 145 SLPCSSSQCASLNQKSCSG--------VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
+PCS++QC + G V C Y+ Y DGS + G LA +T T+ + T
Sbjct: 112 VVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGG 171
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
A+ G+ FGCGT N G S T G++GLG G +S +Q + A FSYCL+ + +
Sbjct: 172 AAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGRRG 231
Query: 257 FGTNGIVSG-----PGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI------- 301
++ + G TPL A TFY + + AI VGN+ L V +
Sbjct: 232 RSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGN 291
Query: 302 ---VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT---GSLELCYSFNSLSQ---- 351
VIDSG+TLT+L G +L+S ++ + + LELCY+ +S S
Sbjct: 292 GGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSSSSAPA 351
Query: 352 ---VPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVG 405
P +TI F +G ++L N+ V V++D+ C + + + + GN+MQ + V
Sbjct: 352 NGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPFAFNVLGNLMQQGYHVE 411
Query: 406 YDIEQQTVSFKPTDCTKQ 423
+D + F T+C
Sbjct: 412 FDRASARIGFARTECVAH 429
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 120/367 (32%), Positives = 179/367 (48%), Gaps = 41/367 (11%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y + +GTPP + + D+GSDL+W QC PC QCY QD+PL+ P SST+ +P
Sbjct: 62 SGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPC--LQCYAQDTPLYAPSNSSTFNPVP 119
Query: 148 CSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
C S +C + + C Y Y D S S G A E+ T+ V +
Sbjct: 120 CLSPECLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATVDD-----VRIDK 174
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVS-STKIN 256
+ FGCG +N G F + G++GLG G +S SQ+ KF+YCLV P S S+ +
Sbjct: 175 VAFGCGRDNQGSF-AAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSWLI 233
Query: 257 FGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI----------VI 303
FG I + + TP+ ++ T Y + I+ + VG + L +S +
Sbjct: 234 FGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGGSIF 293
Query: 304 DSGTTLTF-LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR 360
DSGTT+T+ LP Y N+L+ + P A L+LC + Q P TI
Sbjct: 294 DSGTTVTYWLPPAYR-NILAAFDKNVR-YPRAASVQGLDLCVDVTGVDQPSFPSFTIVLG 351
Query: 361 GADV-KLSRSNFFVKVSEDIVCSVFKGITNSVPIY---GNIMQTNFLVGYDIEQQTVSFK 416
G V + + N+FV V+ ++ C G+ +SV + GN++Q NFLV YD E+ + F
Sbjct: 352 GGAVFQPQQGNYFVDVAPNVQCLAMAGLPSSVGGFNTIGNLLQQNFLVQYDREENRIGFA 411
Query: 417 PTDCTKQ 423
P C+
Sbjct: 412 PAKCSSH 418
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 120/355 (33%), Positives = 173/355 (48%), Gaps = 30/355 (8%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ +++ + GTP + DTGSD+ W QC PC CY Q P+FDP S+TY +
Sbjct: 131 DTLEFVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCS-GHCYKQHDPIFDPTKSATYSVV 189
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
PC QCA+ + CS C Y V YGDGS S G L+ ET++L ST ALPG FGC
Sbjct: 190 PCGHPQCAAADGSKCSNGTCLYKVEYGDGSSSAGVLSHETLSLTSTR----ALPGFAFGC 245
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--INFGTNGIVS 264
G N G F G++GLG G +SL SQ + G FSYCL ++T + G S
Sbjct: 246 GQTNLGDFG-DVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPSDNTTHGYLTIGPTTPAS 304
Query: 265 GPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVS----TPD-IVIDSGTTLTFL-PQG 315
V T + + + +FY + + +I +G L V T D +DSGT LT+L P+
Sbjct: 305 NDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDDGTFLDSGTILTYLPPEA 364
Query: 316 YNS--NLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFF- 372
Y + + + + P DP + CY F S + + F+ +D + +FF
Sbjct: 365 YTALRDRFKFTMTQYKPAPAYDP---FDTCYDFTGQSAIFIPAVSFKFSDGSVFDLSFFG 421
Query: 373 VKVSED-----IVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ + D I C F +++P I GN+ Q N V YD+ + + F C
Sbjct: 422 ILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 133/394 (33%), Positives = 197/394 (50%), Gaps = 60/394 (15%)
Query: 75 SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ--DS 132
SSS QA + Y + IS+GTPP + + DTGS+LIW QC PC ++C+ + +
Sbjct: 75 SSSVNVQAQLENGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPC--TRCFPRPTPA 132
Query: 133 PLFDPKMSSTYKSLPCSSSQCASL----NQKSCSG-VNCQYSVSYGDGSFSNGNLATETV 187
P+ P SST+ LPC+ S C L ++C+ C Y+ +YG G ++ G LATET+
Sbjct: 133 PVLQPARSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETL 191
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
T+G T P + FGC T NG ++GIVGLG G +SL+SQ+ G+FSYCL
Sbjct: 192 TVGDGT-----FPKVAFGCSTENG---VDNSSGIVGLGRGPLSLVSQL---AVGRFSYCL 240
Query: 248 ----VPVSSTKINFGT-NGIVSGPGVVSTPLTK-----AKTFYVLTIDAISVGNQRLGVS 297
++ I FG+ + G V STPL K T Y + + I+V + L V+
Sbjct: 241 RSDMADGGASPILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVT 300
Query: 298 TPDI-----------VIDSGTTLTFLPQ-GY---NSNLLSVMSSMIEAQPVADPTGSLEL 342
++DSGTTLT+L + GY S M+++ + P + L+L
Sbjct: 301 GSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDL 360
Query: 343 CYSFNS-----LSQVPEVTIHFR-GADVKLSRSNFFVKVSED------IVCSVFKGITNS 390
CY ++ +VP + + F GA + N+F V D + C + T+
Sbjct: 361 CYKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDD 420
Query: 391 VP--IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+P I GN+MQ + + YDI+ SF P DC K
Sbjct: 421 LPISIIGNLMQMDMHLLYDIDGGMFSFAPADCAK 454
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 124/366 (33%), Positives = 182/366 (49%), Gaps = 34/366 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ YL+ + +GTPP + DTGSDL W QC PC C+ Q P+FDP S++Y+++
Sbjct: 147 SGEYLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFDQRGPVFDPMASTSYRNVT 204
Query: 148 CSSSQC-------ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
C ++C A +S C Y YGD S + G+LA E T+ T + +
Sbjct: 205 CGDTRCGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTASSSRRVD 264
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---TKINF 257
G+ GCG N GLF+ ++GLG G +S SQ+R FSYCLV S +KI F
Sbjct: 265 GVVLGCGHRNRGLFHGAAG-LLGLGRGPLSFASQLRAVYGHAFSYCLVDHGSAVGSKIVF 323
Query: 258 G-TNGIVSGPGVVST---PLTKAKTFYVLTIDAISVGNQRL-------GVSTPD----IV 302
G N ++S P + T P TFY + + I VG + L GVS D +
Sbjct: 324 GDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGGTI 383
Query: 303 IDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 359
IDSGTTL++ P+ Y + + + M +A P+ L CY+ + + +VPE ++ F
Sbjct: 384 IDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVERVEVPEFSLLF 443
Query: 360 -RGADVKLSRSNFFVKV-SEDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFK 416
GA N+F+++ +E I+C G S + I GN Q NF V YD+ + F
Sbjct: 444 ADGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSAMSIIGNYQQQNFHVLYDLHHNRLGFA 503
Query: 417 PTDCTK 422
P C +
Sbjct: 504 PRRCAE 509
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 128/425 (30%), Positives = 193/425 (45%), Gaps = 45/425 (10%)
Query: 29 FSVELIHRDS-PKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ------ 81
+++ L+HRD P + N + R+R R L + ++SS +
Sbjct: 59 YTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVVVASSDSRYEVNDFG 118
Query: 82 ADIIPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+D++ + Y +RI +G+PP ++ V D+GSD++W QC+PC CY Q P+FDP
Sbjct: 119 SDVVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPC--KLCYKQSDPVFDP 176
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
S +Y + C SS C + C C+Y V YGDGS++ G LA ET+T T + V
Sbjct: 177 AKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNV 236
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTK 254
A+ GCG N G+F ++G+GGG +S + Q+ G F YCLV S+
Sbjct: 237 AM-----GCGHRNRGMFIGAAG-LLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGS 290
Query: 255 INFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPD----------- 300
+ FG + G V PL +A +FY + + + VG R + PD
Sbjct: 291 LVFGREALPVGASWV--PLVRNPRAPSFYYVGLKGLGVGGVR--IPLPDGVFDLTETGDG 346
Query: 301 -IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTI 357
+V+D+GT +T LP G + S P A + CY + +VP V+
Sbjct: 347 GVVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSF 406
Query: 358 HF-RGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
+F G + L NF + V + C F + I GNI Q V +D V F
Sbjct: 407 YFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGF 466
Query: 416 KPTDC 420
P C
Sbjct: 467 GPNVC 471
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 139/435 (31%), Positives = 205/435 (47%), Gaps = 61/435 (14%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLR-DALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
FS+EL P+ + S Y+ L L R R+ N ++ S ++D++P
Sbjct: 80 FSLEL----HPRELLHGGSHKDYRALMLSRLARDSARVKAINTKLQLAVSGTDKSDLVPM 135
Query: 88 N---------------------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
+ Y +R+ IG P V DTGSD+ W QC+PC
Sbjct: 136 DTEILHPQDFSTPVTSGTSQGSGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPC--DD 193
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
CY Q P+FDP SS++ L C + QC +L+ +C +C Y VSYGDGS++ G+ ATET
Sbjct: 194 CYQQVDPIFDPASSSSFSRLGCQTPQCRNLDVFACRNDSCLYQVSYGDGSYTVGDFATET 253
Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYC 246
V+ G++ ++ + GCG +N GLF ++GLGGG +SL SQ++ A FSYC
Sbjct: 254 VSFGNSG----SVDKVAIGCGHDNEGLFVGAAG-LIGLGGGPLSLTSQIK---ASSFSYC 305
Query: 247 LV---PVSSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL------ 294
LV V S+ + F + V+ P+ +K TFY + I +SVG ++L
Sbjct: 306 LVNRDSVDSSTLEFNS---AKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSI 362
Query: 295 ----GVSTPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 349
G I++D GT +T L Q YN+ L + + P + CY+ +S
Sbjct: 363 FEVDGSGKGGIIVDCGTAVTRLQTQAYNA-LRDTFVKLTKDLPSTSGFALFDTCYNLSSR 421
Query: 350 S--QVPEVTIHFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVG 405
+ +VP V F G + L SN+ + V S C F T S+ I GN+ Q V
Sbjct: 422 TSVRVPTVAFLFDGGKSLPLPPSNYLIPVDSAGTFCLAFAPTTASLSIIGNVQQQGTRVT 481
Query: 406 YDIEQQTVSFKPTDC 420
YD+ VSF C
Sbjct: 482 YDLANSQVSFSSRKC 496
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 145/424 (34%), Positives = 198/424 (46%), Gaps = 47/424 (11%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQADI 84
+ L HR P +P SS + D L R + + S S A+ A
Sbjct: 68 LRLTHRHGPCAPSRASSLA-APSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAAAAAT 126
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFD 136
+P NY++ S+GTP + DTGSDL W QC+PC + CY Q PLFD
Sbjct: 127 VPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFD 186
Query: 137 PKMSSTYKSLPCSSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
P SS+Y ++PC CA L +CS C Y VSYGDGS + G +++T+TL +++
Sbjct: 187 PAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS 246
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVS 251
A+ G FGCG GLFN G++GLG SL+ Q T G FSYCL P +
Sbjct: 247 ----AVQGFFFGCGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPST 301
Query: 252 STKINFGTNGIV-SGPGVVST---PLTKAKTFYVLTIDAISVGNQRLGVSTPDI----VI 303
+ + G G + PG +T P A T+YV+ + ISVG Q+L V V+
Sbjct: 302 AGYLTLGLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVV 361
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLSQV--PEVTIHF 359
D+GT +T LP + L S S + + P A G L+ CY+F V P V + F
Sbjct: 362 DTGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTF 421
Query: 360 -RGADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
GA V L C F G + I GN+ Q +F V I+ +V FK
Sbjct: 422 GSGATVMLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFK 474
Query: 417 PTDC 420
P+ C
Sbjct: 475 PSSC 478
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 124/353 (35%), Positives = 172/353 (48%), Gaps = 34/353 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ IG PP+ V DTGSD+ W QC PC ++CY Q P F+P S+++ SL
Sbjct: 148 SGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPC--AECYEQTDPXFEPTSSASFTSLS 205
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C + QC SL+ C C Y VSYGDGS++ G+ TETVTLGST +L I GCG
Sbjct: 206 CETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGST-----SLGNIAIGCG 260
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFGTNGIVS 264
NN GLF + GG +S SQ+ A FSYCLV S++ ++F N ++
Sbjct: 261 HNNEGLFIGAAGLLGLGGGS-LSFPSQLN---ASSFSYCLVDRDSDSTSTLDF--NSPIT 314
Query: 265 GPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPD----------IVIDSGTTLTF 311
P V+ PL + TF+ L + +SVG L + I++DSGT +T
Sbjct: 315 -PDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTR 373
Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSR 368
L + L A + CY +S S +VP V+ HF G ++ L
Sbjct: 374 LQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPLPA 433
Query: 369 SNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N+ + V SE C F +++ I GN Q VG+D+ V F P C
Sbjct: 434 KNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 126/354 (35%), Positives = 172/354 (48%), Gaps = 30/354 (8%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y RI +GTPP V DTGSD++W QC PC CY Q P+F+P S ++ +
Sbjct: 39 SGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPC--KNCYSQTDPVFNPVKSGSFAKVL 96
Query: 148 CSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C + C L C+ C Y VSYGDGS++ G TET+T T + VAL GC
Sbjct: 97 CRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVAL-----GC 151
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS----TKINFGTNGI 262
G +N GLF ++GLG G +S SQ T KFSYCLV S+ + + FG N
Sbjct: 152 GHDNEGLFVGAAG-LLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFG-NSA 209
Query: 263 VSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRL-GVSTPD----------IVIDSGTTL 309
VS + LT + TFY + + ISVG + G++ ++ID GT++
Sbjct: 210 VSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSV 269
Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADVKLS 367
T L + L + + A + CY + + +VP V +HFRGADV L
Sbjct: 270 TRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLP 329
Query: 368 RSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
SN+ + V C F G T+ + I GNI Q F V YD+ V F P C
Sbjct: 330 ASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 383
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 121/359 (33%), Positives = 188/359 (52%), Gaps = 33/359 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ NY +++ +G+P + DTGS W QC+PC C++Q+ P+F+P S TYK++P
Sbjct: 100 SGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCT-IYCHIQEDPVFNPSASKTYKTVP 158
Query: 148 C-----SSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
C SS + A+LN+ +CS + C Y SYGD SFS G L+ + +TL T Q L
Sbjct: 159 CSSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTL--TPSQ--TLS 214
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK------ 254
+GCG +N GLF +T GI+GL ++S++SQ+ FSYCL ST
Sbjct: 215 SFVYGCGQDNQGLFG-RTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEG 273
Query: 255 -INFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI----VIDSG 306
++ GT+ + TPL K + Y + +++I+V + LGV+ +IDSG
Sbjct: 274 FLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTIIDSG 333
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCY--SFNSLSQV-PEVTIHFR-G 361
T +T LP + L + +++ + P S L+ C+ S +S+V P++ I F+ G
Sbjct: 334 TVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDIRIIFKGG 393
Query: 362 ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
AD++L N V++ I C G ++S+ I GN Q V YD+ V F P C
Sbjct: 394 ADLQLKGHNSLVELETGITCLAMAG-SSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 133/394 (33%), Positives = 198/394 (50%), Gaps = 60/394 (15%)
Query: 75 SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ--DS 132
SSS QA + Y + IS+GTPP + + DTGS+LIW QC PC ++C+ + +
Sbjct: 75 SSSVNVQAQLENGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPC--TRCFPRPTPA 132
Query: 133 PLFDPKMSSTYKSLPCSSSQCASL----NQKSCSG-VNCQYSVSYGDGSFSNGNLATETV 187
P+ P SST+ LPC+ S C L ++C+ C Y+ +YG G ++ G LATET+
Sbjct: 133 PVLQPARSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETL 191
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
T+G T P + FGC T NG ++GIVGLG G +SL+SQ+ G+FSYCL
Sbjct: 192 TVGDGT-----FPKVAFGCSTENG---VDNSSGIVGLGRGPLSLVSQL---AVGRFSYCL 240
Query: 248 ----VPVSSTKINFGTNGIVSGPGVV-STPLTK-----AKTFYVLTIDAISVGNQRLGVS 297
++ I FG+ ++ VV STPL K T Y + + I+V + L V+
Sbjct: 241 RSDMADGGASPILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVT 300
Query: 298 TPDI-----------VIDSGTTLTFLPQ-GY---NSNLLSVMSSMIEAQPVADPTGSLEL 342
++DSGTTLT+L + GY S M+++ + P + L+L
Sbjct: 301 GSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDL 360
Query: 343 CYSFNS-----LSQVPEVTIHFR-GADVKLSRSNFFVKVSED------IVCSVFKGITNS 390
CY ++ +VP + + F GA + N+F V D + C + T+
Sbjct: 361 CYKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDD 420
Query: 391 VP--IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+P I GN+MQ + + YDI+ SF P DC K
Sbjct: 421 LPISIIGNLMQMDMHLLYDIDGGMFSFAPADCAK 454
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 129/441 (29%), Positives = 203/441 (46%), Gaps = 64/441 (14%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP-- 86
+ ++HRD+ P + + R R A H Q S+ S+ A+ AD++
Sbjct: 30 LHIPVVHRDAVFPPRRGAPPGSF-RCRHAAP-------HTAQLESLHSATAA-ADLLRSP 80
Query: 87 -------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
++ Y I +G PPT L V DTGSDLIW QC PC +CY Q +PL+DP+
Sbjct: 81 VMSGVPFDSGEYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPC--RRCYRQVTPLYDPRN 138
Query: 140 SSTYKSLPCSSSQCAS-LNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
S T++ +PC+S QC L C C Y V YGDGS S+G+LAT+T+ L T
Sbjct: 139 SKTHRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDTLVLPDDT--- 195
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL------VPV 250
+ +T GCG +N GL S G++G G G +S +Q+ FSYCL
Sbjct: 196 -RVHNVTLGCGHDNEGLLAS-AAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARN 253
Query: 251 SSTKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRL------------G 295
SS+ + FG + P TPL + + Y + + SVG +R+
Sbjct: 254 SSSYLVFGRTPEL--PSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPA 311
Query: 296 VSTPDIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLS-- 350
+V+DSGT ++ F Y + + +S A + + + + CY +
Sbjct: 312 TGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGNGPG 371
Query: 351 ---QVPEVTIHF-RGADVKLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNF 402
+VP + +HF AD+ L ++N+ + V C + + + + GN+ Q F
Sbjct: 372 TGVRVPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGNVQQQGF 431
Query: 403 LVGYDIEQQTVSFKPTDCTKQ 423
V +D+E+ + F P C+ +
Sbjct: 432 GVVFDVERGRIGFTPNGCSGE 452
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 114/373 (30%), Positives = 177/373 (47%), Gaps = 46/373 (12%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
++ Y I++G PPT L V DTGSDLIW QC PC CY Q +PL+DP+ SST++ +
Sbjct: 84 DSGEYFAVINVGDPPTRALVVIDTGSDLIWLQCVPC--RHCYRQVTPLYDPRSSSTHRRI 141
Query: 147 PCSSSQCAS-LNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
PC+S +C L C C Y V YGDGS S+G+LAT+ + T + +T
Sbjct: 142 PCASPRCRDVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPDDT----HVHNVT 197
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
GCG +N GL S G++G+G G +S +Q+ FSYCL S N G++ +V
Sbjct: 198 LGCGHDNVGLLES-AAGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQN-GSSYLV 255
Query: 264 SG-----PGVVSTPLT---KAKTFYVLTIDAISVGNQRL------------GVSTPDIVI 303
G P TPL + + Y + + SVG +R+ IV+
Sbjct: 256 FGRTPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGRGGIVV 315
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEA----QPVADPTGSLELCYSFN------SLSQVP 353
DSGT ++ + + + S A + +A + CY + +VP
Sbjct: 316 DSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVRVP 375
Query: 354 EVTIHFR-GADVKLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
+ +HF GAD+ L ++N+ + V C + + + + GN+ Q F + +D+
Sbjct: 376 SIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGLVFDV 435
Query: 409 EQQTVSFKPTDCT 421
E+ + F P C+
Sbjct: 436 ERGRIGFTPNGCS 448
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 116/351 (33%), Positives = 176/351 (50%), Gaps = 26/351 (7%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y +RI +G+PP + V D+GSD++W QC+PC +QCY Q PLFDP S+++ +
Sbjct: 40 SGEYFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPC--TQCYHQTDPLFDPADSASFMGVS 97
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
CSS+ C + C+ C+Y VSYGDGS++ G LA ET+T G T + VA+ GCG
Sbjct: 98 CSSAVCDRVENAGCNSGRCRYEVSYGDGSYTKGTLALETLTFGRTVVRNVAI-----GCG 152
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP- 266
+N G+F ++GLGGG +S + Q+ FSYCLV + F G + P
Sbjct: 153 HSNRGMFVGAAG-LLGLGGGSMSFMGQLSGQTGNAFSYCLVSRGTNTNGFLEFGSEAMPV 211
Query: 267 GVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTTLTFLP 313
G PL +A +FY + + + VG+ R+ VS + +V+D+GT +T P
Sbjct: 212 GAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVTRFP 271
Query: 314 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-FNSLS-QVPEVTIHFRGADV-KLSRSN 370
+ + P A + CY+ F LS +VP V+ +F G + + +N
Sbjct: 272 TVAYEAFRNAFIEQTQNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTIPANN 331
Query: 371 FFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
F + V + C F + + I GNI Q + D + V F P C
Sbjct: 332 FLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDEANEFVGFGPNIC 382
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 118/338 (34%), Positives = 168/338 (49%), Gaps = 32/338 (9%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
DTGSDLIWTQC PC C Q +P FD K S+TY++LPC SS+CASL+ SC C Y
Sbjct: 2 DTGSDLIWTQCAPC--LLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCVY 59
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
YGD + + G LA ET T G+ V I FGCG+ N G + ++G+VG G G
Sbjct: 60 QYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDL-ANSSGMVGFGRGP 118
Query: 229 ISLISQMRTTIAGKFSYCL---VPVSSTKINFG------TNGIVSGPGVVSTPLT---KA 276
+SL+SQ+ + +FSYCL + + +++ FG + SG V STP
Sbjct: 119 LSLVSQLGPS---RFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPAL 175
Query: 277 KTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTTLTFLPQGYNSNLLSVMSS 326
Y L++ AIS+G + L + T ++IDSGT++T+L Q + + S
Sbjct: 176 PNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVS 235
Query: 327 MIEAQPVADPTGSLELCYSF----NSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCS 382
I + D L+ C+ + N VP++ HF A++ L N+ + S
Sbjct: 236 AIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYMLIASTTGYLC 295
Query: 383 VFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ T I GN Q N + YDI +SF P C
Sbjct: 296 LVMAPTGVGTIIGNYQQQNLHLLYDIGNSFLSFVPAPC 333
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 132/408 (32%), Positives = 199/408 (48%), Gaps = 53/408 (12%)
Query: 53 RLRDALTRSLNRLNHFNQNSSI------SSSKASQADIIPNNANYLIRISIGTPPTERLA 106
+ +A+ R +R+ + ++ +SS + QA + Y + IS+GTP
Sbjct: 42 KYSEAVRRDSHRIAFLSDATAAGKATTTNSSVSFQALLENGVGGYNMNISVGTPLLTFPV 101
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGV 164
VADTGSDLIWTQC PC ++C+ Q +P F P SST+ LPC+SS C L + ++C+
Sbjct: 102 VADTGSDLIWTQCAPC--TKCFQQPAPPFQPASSSTFSKLPCTSSFCQFLPNSIRTCNAT 159
Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
C Y+ YG G ++ G LATET+ +G + P + FGC T NG + T+GI GL
Sbjct: 160 GCVYNYKYGSG-YTAGYLATETLKVGD-----ASFPSVAFGCSTENG--VGNSTSGIAGL 211
Query: 225 GGGDISLISQMRTTIAGKFSYCLV---PVSSTKINFGTNGIVSGPGVVSTPLTK----AK 277
G G +SLI Q+ G+FSYCL ++ I FG+ ++ V STP
Sbjct: 212 GRGALSLIPQLGV---GRFSYCLRSGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHP 268
Query: 278 TFYVLTIDAISVGNQRLGVSTPDI-----------VIDSGTTLTFLPQ-GYNSNLLSVMS 325
++Y + + I+VG L V+T ++DSGTTLT+L + GY + +S
Sbjct: 269 SYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLS 328
Query: 326 SMIEAQPVADPTGSLELCYSFNSLS---QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCS 382
V + T L+LC+ VP + + F G + + +F V D S
Sbjct: 329 QTANVTTV-NGTRGLDLCFKSTGGGGGIAVPSLVLRFDGG-AEYAVPTYFAGVETDSQGS 386
Query: 383 VFKGITNSVP--------IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
V +P + GN+MQ + + YD++ SF P DC K
Sbjct: 387 VTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFSPADCAK 434
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 134/357 (37%), Positives = 185/357 (51%), Gaps = 42/357 (11%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ IG P E V DTGSD+ W QC PC + CY Q P+F+P SS+Y+ L
Sbjct: 148 SGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPC--ADCYHQTEPIFEPSSSSSYEPLS 205
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C + QC +L C C Y VSYGDGS++ G+ ATET+T+GST Q VA+ GCG
Sbjct: 206 CDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAV-----GCG 260
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFGTNGIVS 264
+N GLF G++GLGGG ++L SQ+ TT FSYCLV S++ + FGT+
Sbjct: 261 HSNEGLF-VGAAGLLGLGGGLLALPSQLNTT---SFSYCLVDRDSDSASTVEFGTS---L 313
Query: 265 GPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTF 311
P V PL + TFY L + ISVG + L + + I+IDSGT +T
Sbjct: 314 PPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTR 373
Query: 312 LPQG-YNS---NLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-V 364
L G YNS + L S + +A VA + CY+ ++ + +VP V HF G +
Sbjct: 374 LQTGIYNSLRDSFLKGTSDLEKAAGVA----MFDTCYNLSAKTTIEVPTVAFHFPGGKML 429
Query: 365 KLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
L N+ + V S C F +S+ I GN+ Q V +D+ + F C
Sbjct: 430 ALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 486
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 126/354 (35%), Positives = 179/354 (50%), Gaps = 36/354 (10%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ IG P E V DTGSD+ W QC PC + CY Q P+F+P SS+Y+ L
Sbjct: 145 SGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPC--ADCYHQTEPIFEPSSSSSYEPLS 202
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C + QC +L C C Y VSYGDGS++ G+ ATET+T+GST Q VA+ GCG
Sbjct: 203 CDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAV-----GCG 257
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFGTNGIVS 264
+N GLF + GG ++L SQ+ TT FSYCLV S++ ++FGT+
Sbjct: 258 HSNEGLFVGAAGLLGLGGGL-LALPSQLNTT---SFSYCLVDRDSDSASTVDFGTS---L 310
Query: 265 GPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTF 311
P V PL + TFY L + ISVG + L + + I+IDSGT +T
Sbjct: 311 SPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTR 370
Query: 312 LP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLS 367
L + YNS S + ++ + A + CY+ ++ + +VP V HF G + L
Sbjct: 371 LQTEIYNSLRDSFVKGTLDLEKAAG-VAMFDTCYNLSAKTTVEVPTVAFHFPGGKMLALP 429
Query: 368 RSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N+ + V S C F +S+ I GN+ Q V +D+ + F C
Sbjct: 430 AKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 116/336 (34%), Positives = 172/336 (51%), Gaps = 28/336 (8%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSC 161
+ DTGS L W QC+PC C+ Q PL+DP +S TYK L C+S +C A+LN C
Sbjct: 2 ILDTGSSLSWLQCQPCA-VYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60
Query: 162 SGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTT 219
+ C Y+ SYGD SFS G L+ + +TL S+ LP T+GCG +N GLF +
Sbjct: 61 ETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQ----TLPQFTYGCGQDNQGLFG-RAA 115
Query: 220 GIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFGTNGIVSGPGVVSTPL---T 274
GI+GL +S+++Q+ T FSYCL S+ F + G +S TP+ +
Sbjct: 116 GIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDS 175
Query: 275 KAKTFYVLTIDAISVGNQRLGVSTP----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEA 330
K + Y L + AI+V + L ++ +IDSGT +T LP + L ++
Sbjct: 176 KNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLPMSMYAALRQAFVKIMST 235
Query: 331 QPVADPTGS-LELCY--SFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKG 386
+ P S L+ C+ S S+S VPE+ + F+ GAD+ L + ++ + I C F G
Sbjct: 236 KYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSILIEADKGITCLAFAG 295
Query: 387 I--TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
TN + I GN Q + + YD+ + F P C
Sbjct: 296 SSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 118/367 (32%), Positives = 184/367 (50%), Gaps = 40/367 (10%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ YL+ +++GTPP A+ DTGSDLIWTQC PC + C Q P+F P SS+Y+ +
Sbjct: 100 GDLEYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPC--ASCLPQPDPIFSPGASSSYEPM 157
Query: 147 PCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQA----VALPG 201
C+ C + SC + C Y SYGDG+ + G ATE T S++ ++ P
Sbjct: 158 RCAGELCNDILHHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAP- 216
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFG 258
+ FGCGT N G N+ +GIVG G +SL+SQ+ +FSYCL P +S + + FG
Sbjct: 217 LGFGCGTMNKGSLNNG-SGIVGFGRAPLSLVSQLAIR---RFSYCLTPYASGRKSTLLFG 272
Query: 259 T--NGI--VSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVS------TPD----I 301
+ G+ + V +T L +++ TFY + ++VG +RL + PD
Sbjct: 273 SLRGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGA 332
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-------VPE 354
++DSGT LT P + ++ S + A+ + + F + + VP
Sbjct: 333 IVDSGTALTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGVCFAAAASRVPRPAVVPR 392
Query: 355 VTIHFRGADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
+ H +GAD+ L R N+ + + +C + +S GN +Q + V YD+E T+
Sbjct: 393 MVFHLQGADLDLPRRNYVLDDQRKGNLCLLLADSGDSGTTIGNFVQQDMRVLYDLEADTL 452
Query: 414 SFKPTDC 420
SF P C
Sbjct: 453 SFAPAQC 459
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 124/358 (34%), Positives = 175/358 (48%), Gaps = 30/358 (8%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ NY ++I +GTP + DTGS L W QC+PC C++Q P+F P +S TYK+L
Sbjct: 104 SGNYYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCV-IYCHVQVDPIFTPSVSKTYKALS 162
Query: 148 C-----SSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
C SS + ++LN CS C Y SYGD SFS G L+ + +TL T A
Sbjct: 163 CSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTL---TPSAAPSS 219
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN 260
G +GCG +N GLF ++ GI+GL +S++ Q+ FSYCL S + N +
Sbjct: 220 GFVYGCGQDNQGLFG-RSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQPNSSVS 278
Query: 261 GIVSGPGVVS-------TPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI----VIDSG 306
G +S TPL K + Y L + I+V + LGVS +IDSG
Sbjct: 279 GFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVPTIIDSG 338
Query: 307 TTLTFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIHFR-GA 362
T +T LP YN+ S + M + A L+ C+ S +S VPE+ I FR GA
Sbjct: 339 TVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIRIIFRGGA 398
Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
++L N V++ + C +N + I GN Q F V YD+ + F P C
Sbjct: 399 GLELKVHNSLVEIEKGTTCLAIAASSNPISIIGNYQQQTFTVAYDVANSKIGFAPGGC 456
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 143/430 (33%), Positives = 210/430 (48%), Gaps = 47/430 (10%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRL-------------RDALTRSLNRLNHFNQNSSI 74
G + L H SP SP ++ P+ + R A T S +R + SS
Sbjct: 40 GLHLTLHHPRSPCSPAPLPADVPFSAVLTHDHARIASLAARLAKTPS-SRPTKLRRGSSS 98
Query: 75 SSSKASQADII--PNNA----NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
S S A + P + NY+ R+ +GTP + V DTGS L W QC PC S C+
Sbjct: 99 SPDAESLASVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVS-CH 157
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNL 182
Q P+F+P+ SS+Y S+ CS+ QC A+LN +CS N C Y SYGD SFS G L
Sbjct: 158 RQSGPVFNPRSSSSYASVSCSAPQCDALTTATLNPSTCSTSNVCIYQASYGDSSFSVGYL 217
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
+ +TV+ GST+ +P +GCG +N GLF ++ G++GL +SL+ Q+ ++
Sbjct: 218 SKDTVSFGSTS-----VPNFYYGCGQDNEGLFG-QSAGLIGLARNKLSLLYQLAPSMGYS 271
Query: 243 FSYCLVPVSSTKINFGTNGIVSGPGVVS-TPLTKA---KTFYVLTIDAISVGNQRLGVST 298
FSYCL +S+ + + PG S TP+ K+ + Y + + I+V + L VS
Sbjct: 272 FSYCL--PTSSSSSGYLSIGSYNPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSA 329
Query: 299 PD-----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQV 352
+IDSGT +T LP S L ++ ++ P A L+ C+ S +V
Sbjct: 330 SAYSSLPTIIDSGTVITRLPTDVYSALSKAVAGAMKGTPRASAFSILDTCFQGQASRLRV 389
Query: 353 PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
P+V++ F GA +KL +N V V C F S I GN Q F V YD++
Sbjct: 390 PQVSMAFAGGAALKLKATNLLVDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNS 448
Query: 412 TVSFKPTDCT 421
+ F C+
Sbjct: 449 KIGFAAGGCS 458
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 130/422 (30%), Positives = 193/422 (45%), Gaps = 54/422 (12%)
Query: 40 KSPFYNSSETPYQRLRDA-LTRSLNRLNHFNQNSSISSSKASQADIIP------------ 86
++ + SS Y+ L A L R +R+ ++ + +++D+ P
Sbjct: 83 RTSIHKSSHKDYKSLVLARLERDSDRVRSLATRMDLAIAGITKSDLKPVEKELEAEALET 142
Query: 87 --------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
+ Y R+ IG+PP V DTGSD+ W QC PC + CY Q P+F+P
Sbjct: 143 PLVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPC--ADCYQQADPIFEPS 200
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
SS+Y L C + QC SL+ C +C Y VSYGDGS++ G+ ATET+TL + +
Sbjct: 201 FSSSYAPLTCETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITLDGS----AS 256
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKI 255
L + GCG +N GLF + GG +S SQ+ A FSYCLV S++ +
Sbjct: 257 LNNVAIGCGHDNEGLFVGAAGLLGLGGGS-LSFPSQIN---ASSFSYCLVNRDTDSASTL 312
Query: 256 NFGT---NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD----------IV 302
F + + V+ P + + L TFY L + I VG Q L + I+
Sbjct: 313 EFNSPIPSHSVTAPLLRNNQL---DTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGII 369
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF- 359
+DSGT +T L ++L + P + CY +S S +VP V+ HF
Sbjct: 370 VDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFP 429
Query: 360 RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
G + L N+ + V S C F T+++ I GN+ Q V YD+ V F P
Sbjct: 430 DGKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPN 489
Query: 419 DC 420
C
Sbjct: 490 GC 491
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 118/432 (27%), Positives = 204/432 (47%), Gaps = 70/432 (16%)
Query: 49 TPYQRLRDALTRSLNRLNHFNQNSSISSSK----ASQADIIPNNANYLIRISIGTPPTER 104
T ++ LR A+ RS +RL +SS+ ++A ++ YL+++ +GTP
Sbjct: 42 TDHELLRRAIQRSRDRLASIAPRLLPTSSRNKVVVAEAPVLSAGGEYLVKLGLGTPQHCF 101
Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
A DT SDLIWTQC+PC +CY Q P+F+P S++Y +PC+S C L+ C+
Sbjct: 102 TAAIDTASDLIWTQCQPC--VKCYKQLDPVFNPVASTSYAVVPCNSDTCDELDTHRCARD 159
Query: 165 N-------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSK 217
CQY+ SYG + + G LA + + +G + G+ FGC +++ G +
Sbjct: 160 GDSDDEDACQYTYSYGGNATTRGILAVDRLAIGDDVFR-----GVVFGCSSSSVGGPPPQ 214
Query: 218 TTGIVGLGGGDISLISQMRTTIAGKFSYCLVP---VSSTKINFGTNG---IVSGPGVVST 271
+G+VGLG G +SL+SQ+ +F YCL P S+ ++ G + + + V
Sbjct: 215 VSGVVGLGRGALSLVSQLSVR---RFMYCLPPPVSRSAGRLVLGADAAATVRNASERVVV 271
Query: 272 PL---TKAKTFYVLTIDAISVGNQ--------RLGVSTP--------------------- 299
P+ ++ ++Y L +D IS+G++ R+ +TP
Sbjct: 272 PMSTGSRYPSYYYLNLDGISIGDRAMSFRSRNRMNATTPGTAAGAPASPVSGSGDGDGSG 331
Query: 300 ------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS---LS 350
++ID +T+TFL + ++ + I + L+LC+ +S
Sbjct: 332 TGPDAYGMIIDIASTITFLEESLYEEMVDDLEEEIRLPRGSGSDLGLDLCFILPEGVPMS 391
Query: 351 QV--PEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
+V P V++ F G ++L + FV+ + + G T+ V I GN Q N V Y++
Sbjct: 392 RVYAPPVSLAFEGVWLRLDKEQMFVEDRASGMMCLMVGKTDGVSILGNYQQQNMQVMYNL 451
Query: 409 EQQTVSFKPTDC 420
+ ++F T C
Sbjct: 452 RRGRITFIKTAC 463
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 131/416 (31%), Positives = 195/416 (46%), Gaps = 49/416 (11%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH--------FNQNSSISSSKASQAD 83
+LIHRDS SP+Y S++T R + SL RL++ F+ N + S ++
Sbjct: 40 KLIHRDSIVSPYYRSNDTVADRTERTMKASLARLSYLYAKIERDFDINDLWLNLHPSASE 99
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD-SPLFDPKMSST 142
+ +L+ S+G PP +LA+ DTGS L+W QC PC C Q P+FDP +SST
Sbjct: 100 PL-----FLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPC--KSCSQQIIGPMFDPSISST 152
Query: 143 YKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
Y SL C + C C S C Y+ +Y +G S G +ATE + GS+ A+
Sbjct: 153 YDSLSCKNIICRYAPSGECDSSSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNN 212
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
+ FGC NG + + TG+ GLG G S+++QM KFSYC+ ++ ++ N
Sbjct: 213 VLFGCSHRNGNYKDRRFTGVFGLGSGITSVVNQM----GSKFSYCIGNIADP--DYSYNQ 266
Query: 262 IVSGPGV----VSTPLTKAKTFYVLTIDAISVGNQRLGV---------STPDIVIDSGTT 308
+V GV STPL Y + ++ ISVG RL + ++IDSGT
Sbjct: 267 LVLSEGVNMEGYSTPLDVVDGHYQVILEGISVGETRLVIDPSAFKRTEKQRRVIIDSGTA 326
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN---SLSQVPEVTIHF-RGADV 364
T+L + L + ++++ S LCY L P VT HF GAD+
Sbjct: 327 PTWLAENEYRALEREVRNLLDRFLTPFMRESF-LCYKGKVGQDLVGFPAVTFHFAEGADL 385
Query: 365 KLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
V +E SV+ + G + Q + V YD+ + + F+ DC
Sbjct: 386 --------VVDTEMRQASVYGKDFKDFSVIGLMAQQYYNVAYDLNKHKLFFQRIDC 433
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 122/396 (30%), Positives = 187/396 (47%), Gaps = 30/396 (7%)
Query: 48 ETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAV 107
E RLR R + + + S+ + + + + Y R+ IG+P
Sbjct: 2 ERDEARLRWIHHRIQSSDHRHRRGRSLLQTAQVSSGLSLGSGEYFARMGIGSPQRSYYLE 61
Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ 167
DTGSD+ W QC PC S CY Q P++DP SS+Y+ + C S+ C +L+ +C G+ C
Sbjct: 62 LDTGSDVTWIQCAPC--SSCYSQVDPIYDPSNSSSYRRVYCGSALCQALDYSACQGMGCS 119
Query: 168 YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGG 227
Y V YGD S S+G+L E+ LG + + A+ I FGCG +N GLF + ++G+GGG
Sbjct: 120 YRVVYGDSSASSGDLGIESFYLGPNS--STAMRNIAFGCGHSNSGLFRGEAG-LLGMGGG 176
Query: 228 DISLISQMRTTIAGKFSYCLVPV------SSTKINFGTNGIVSGPGVVSTPLTK---AKT 278
+S SQ+ +I FSYCLV S+ + FG I TPL K T
Sbjct: 177 TLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAARF--TPLLKNPRIDT 234
Query: 279 FYVLTIDAISVGNQRL----------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMI 328
FY + ISVG L G T ++DSGT++T + + L +
Sbjct: 235 FYYAILTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAYRAAS 294
Query: 329 EAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSED-IVCSVF 384
P A L+ C++F L Q+P + +HF D+ L N + V C F
Sbjct: 295 RNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGTFCLAF 354
Query: 385 KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ + + GN+ Q F +G+D+++ ++ P +C
Sbjct: 355 APSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 390
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 130/424 (30%), Positives = 195/424 (45%), Gaps = 44/424 (10%)
Query: 29 FSVELIHRDS-PKSPFYNSSETPYQRLR---DALTRSLNRLNHFNQNSSISSSKASQ--A 82
+++ L+HRD P + N + R+R D ++ L R++ SS S + + +
Sbjct: 59 YTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGS 118
Query: 83 DII----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
DI+ + Y +RI +G+PP ++ V D+GSD++W QC+PC CY Q P+FDP
Sbjct: 119 DIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPC--KLCYKQSDPVFDPA 176
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
S +Y + C SS C + C C+Y V YGDGS++ G LA ET+T T + VA
Sbjct: 177 KSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVA 236
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKI 255
+ GCG N G+F ++G+GGG +S + Q+ G F YCLV S+ +
Sbjct: 237 M-----GCGHRNRGMFIGAAG-LLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSL 290
Query: 256 NFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPD------------ 300
FG + G V PL +A +FY + + + VG R + PD
Sbjct: 291 VFGREALPVGASWV--PLVRNPRAPSFYYVGLKGLGVGGVR--IPLPDGVFDLTETGDGG 346
Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIH 358
+V+D+GT +T LP S P A + CY + +VP V+ +
Sbjct: 347 VVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFY 406
Query: 359 F-RGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
F G + L NF + V + C F + I GNI Q V +D V F
Sbjct: 407 FTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFG 466
Query: 417 PTDC 420
P C
Sbjct: 467 PNVC 470
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 133/433 (30%), Positives = 197/433 (45%), Gaps = 49/433 (11%)
Query: 22 IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS--ISSSKA 79
+E + SV L+HR P + S+ P + L S R N+ +S ++S+
Sbjct: 48 LEPSSATLSVPLVHRYGPCAA-SQYSDMPTPSFSETLRHSRARTNYIKSRASTGMASTPD 106
Query: 80 SQADIIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
A +P ++ Y++ + GTP ++ + DTGSD+ W QC PC ++CY Q
Sbjct: 107 DAAVTVPTRLGGFVDSLEYMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKD 166
Query: 133 PLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
PLFDP SSTY + C + C L N + G C Y V YGDGS + G + ET+
Sbjct: 167 PLFDPSKSSTYAPIACGADACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETI 226
Query: 188 TLGSTTGQAVALPGIT-----FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
T PGIT FGCG + G + K G++GLGG SL+ Q + G
Sbjct: 227 TFA---------PGITVKDFHFGCGHDQRGP-SDKFDGLLGLGGAPESLVVQTASVYGGA 276
Query: 243 FSYCLVPVSSTKINFGTNGI-----VSGPGVVSTP---LTKAKTFYVLTIDAISVGNQRL 294
FSYCL P +++ F G+ + V TP L T Y++ + ISVG + L
Sbjct: 277 FSYCL-PALNSEAGFLALGVRPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPL 335
Query: 295 GVSTP----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 350
+ ++IDSGT +T LP+ + L + + A P+ + + CY+F S
Sbjct: 336 DIPRSAFRGGMLIDSGTIVTELPETAYNALNAALRKAFAAYPMV-ASEDFDTCYNFTGYS 394
Query: 351 Q--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
VP V + F GA + L N + +D + G + I GN+ Q V YD
Sbjct: 395 NVTVPRVALTFSGGATIDLDVPNGILV--KDCLAFRESGPDVGLGIIGNVNQRTLEVLYD 452
Query: 408 IEQQTVSFKPTDC 420
V F+ C
Sbjct: 453 AGHGKVGFRAGAC 465
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 119/357 (33%), Positives = 176/357 (49%), Gaps = 32/357 (8%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ IG P DTGSD+ W QC PC S CY Q P++DP SS+Y+ +
Sbjct: 9 SGEYFARMGIGNPQRSYYLELDTGSDVTWIQCAPC--SSCYSQVDPIYDPSNSSSYRRVY 66
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C S+ C +L+ +C G+ C Y V YGD S S+G+L E+ LG + + A+ I FGCG
Sbjct: 67 CGSALCQALDYSACQGMGCSYRVVYGDSSASSGDLGIESFYLGPNS--STAMRNIAFGCG 124
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV------SSTKINFGTNG 261
+N GLF + ++G+GGG +S SQ+ +I FSYCLV S+ + FG
Sbjct: 125 HSNSGLFRGEAG-LLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTA 183
Query: 262 IVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRL----------GVSTPDIVIDSGTT 308
I TPL K TFY + ISVG L G T ++DSGT+
Sbjct: 184 IPF--AARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSGTS 241
Query: 309 LT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADV 364
+T +P Y + L + P A L+ C++F L Q+P + +HF G D+
Sbjct: 242 VTRVVPPAY-AVLRDAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNGVDM 300
Query: 365 KLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
L N + V C F + + + GN+ Q F +G+D+++ ++ P +C
Sbjct: 301 VLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 357
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 113/308 (36%), Positives = 153/308 (49%), Gaps = 34/308 (11%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+ ++IGTPP DTGSDLIWTQC+PCP C+ Q P FDP SST C
Sbjct: 81 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 138
Query: 150 SSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
S+ C L SC C Y+ SYGD S + G L + T G ++PG+
Sbjct: 139 STLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVA 195
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFG 258
FGCG N G+F S TGI G G G +SL SQ++ G FS+C V+ K ++
Sbjct: 196 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPSTVLLDLP 252
Query: 259 TNGIVSGPGVV-STPLTK---AKTFYVLTIDAISVGNQRLGV---------STPDIVIDS 305
+ SG G V STPL + TFY L++ I+VG+ RL V T +IDS
Sbjct: 253 ADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDS 312
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS--FNSLSQVPEVTIHFRGAD 363
GT +T LP + ++ ++ V+ T C S + VP++ +HF GA
Sbjct: 313 GTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGAT 372
Query: 364 VKLSRSNF 371
+ L R N+
Sbjct: 373 MDLPRENY 380
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 116/355 (32%), Positives = 177/355 (49%), Gaps = 34/355 (9%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y R+ +G+P + V DTGSD+ W QC+PC + CY Q P+FDP +S++Y S+
Sbjct: 159 GSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSTSYASV 216
Query: 147 PCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C + +C L+ +C S C Y V+YGDGS++ G+ ATET+TLG + + +
Sbjct: 217 ACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDS----APVSSVAI 272
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFG--T 259
GCG +N GLF ++ LGGG +S SQ+ T FSYCLV SS+ + FG
Sbjct: 273 GCGHDNEGLFVGAAG-LLALGGGPLSFPSQISAT---TFSYCLVDRDSPSSSTLQFGDAA 328
Query: 260 NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRL----------GVSTPDIVIDSGTTL 309
+ V+ P ++ +P T TFY + + ISVG Q L G +++DSGT +
Sbjct: 329 DAEVTAP-LIRSPRT--STFYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIVDSGTAV 385
Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKL 366
T L + L ++ P + CY + + +VP V++ F G +++L
Sbjct: 386 TRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGELRL 445
Query: 367 SRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N+ + V C F +V I GN+ Q V +D + TV F C
Sbjct: 446 PAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTSNKC 500
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 141/412 (34%), Positives = 200/412 (48%), Gaps = 47/412 (11%)
Query: 34 IHRDSPKSPFYNSSETPYQRLRDALTRS-LNRLNHFNQNSSIS---SSKASQADIIPNNA 89
+HRDS + + T Q + + +++S L L Q +S SS SQ +
Sbjct: 106 LHRDSSR---VQAITTRLQLILNGVSKSDLKPLQTEIQPQDLSTPVSSGTSQG-----SG 157
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y R+ +G P V DTGSD+ W QC+PC S CY Q P+F P SS+Y L C
Sbjct: 158 EYFTRVGVGNPAKSYYMVLDTGSDINWIQCQPC--SDCYQQSDPIFTPAASSSYSPLTCD 215
Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL-GSTTGQAVALPGITFGCGT 208
S QC SL SC C+Y V+YGDGSF+ G+ TET++ GS T ++AL GCG
Sbjct: 216 SQQCNSLQMSSCRNGQCRYQVNYGDGSFTFGDFVTETMSFGGSGTVNSIAL-----GCGH 270
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV---PVSSTKINFGTNGIVSG 265
+N GLF ++GLGGG +SL SQ++ T FSYCLV +S+ ++F N G
Sbjct: 271 DNEGLFVGAAG-LLGLGGGPLSLTSQLKAT---SFSYCLVNRDSAASSTLDF--NSAPVG 324
Query: 266 PGVVSTPLTKAK--TFYVLTIDAISVGNQRLGVSTP----------DIVIDSGTTLTFL- 312
V++ L +K TFY + + +SVG + L + +++D GT +T L
Sbjct: 325 DSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVDCGTAITRLQ 384
Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRS 369
+ YNS L SM + CY + S +VP V+ HF G L +
Sbjct: 385 SEAYNS-LRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVKVPTVSFHFDGGKSWDLPAA 443
Query: 370 NFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N+ + V S C F T+S+ I GN+ Q V +D+ V F C
Sbjct: 444 NYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLANNRVGFSTNKC 495
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 124/359 (34%), Positives = 181/359 (50%), Gaps = 38/359 (10%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY++ + +G + DTGSDL W QC+PC CY Q PL+DP +SS+YK++ C+
Sbjct: 134 NYIVTVELGGKNMS--LIVDTGSDLTWVQCQPC--RSCYNQQGPLYDPSVSSSYKTVFCN 189
Query: 150 SSQCASL-----NQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
SS C L N C G N C+Y VSYGDGS++ G+LA+E++ LG T
Sbjct: 190 SSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-----K 244
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKI 255
L FGCG NN GLF + ++GLG +SL+SQ T G FSYCL + +S +
Sbjct: 245 LENFVFGCGRNNKGLFGGSSG-LMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSL 303
Query: 256 NFGTNGIV--SGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVST--PDIVIDSGTT 308
+FG + V + V TPL + ++FY+L + S+G L S+ I+IDSGT
Sbjct: 304 SFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGILIDSGTV 363
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRG---AD 363
+T LP + P A L+ C++ S +P + + F+G +
Sbjct: 364 ITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELE 423
Query: 364 VKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
V ++ +FVK +VC ++ N V I GN Q N V YD Q+ + +C
Sbjct: 424 VDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 482
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 130/436 (29%), Positives = 194/436 (44%), Gaps = 52/436 (11%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDAL-TRSLNRLNHFNQNSSISSSKASQ 81
+ G +E+ R Q + D L RS+ NH + +S S S
Sbjct: 48 RKEKGAIILEMKDRGECSESERKGDWVEKQLVLDGLHVRSIQ--NHIRKRTSSSQIADSS 105
Query: 82 ADIIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
+P NY++ + +G+ + DTGSDL W QCEPC CY Q+ PL
Sbjct: 106 ETQVPLTSGIKFQTLNYIVTMGLGSQNMS--VIVDTGSDLTWVQCEPC--RSCYNQNGPL 161
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSC-----SGVNCQYSVSYGDGSFSNGNLATETVTL 189
F P S +Y+ + C+S+ C SL +C + C Y V+YGDGS+++G L E +
Sbjct: 162 FKPSTSPSYQPILCNSTTCQSLELGACGSDPSTSATCDYVVNYGDGSYTSGELGIEKLGF 221
Query: 190 GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP 249
G +++ FGCG NN GLF +G++GLG ++S+ISQ T G FSYCL
Sbjct: 222 G-----GISVSNFVFGCGRNNKGLFGG-ASGLMGLGRSELSMISQTNATFGGVFSYCL-- 273
Query: 250 VSSTKINFGTNGIVSG--PGVVS--TPLTKAK--------TFYVLTIDAISVGNQRLGVS 297
ST + +V G GV TP+ + FY+L + I VG L V
Sbjct: 274 -PSTDQAGASGSLVMGNQSGVFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSLHVQ 332
Query: 298 TPD-----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV 352
+++DSGT ++ L L + P A L+ C++ QV
Sbjct: 333 ASSFGNGGVILDSGTVISRLAPSVYKALKAKFLEQFSGFPSAPGFSILDTCFNLTGYDQV 392
Query: 353 --PEVTIHFRG-ADVKLSRSNFFVKVSEDI--VCSVFKGITN--SVPIYGNIMQTNFLVG 405
P ++++F G A++ + + F V ED VC +++ + I GN Q N V
Sbjct: 393 NIPTISMYFEGNAELNVDATGIFYLVKEDASRVCLALASLSDEYEMGIIGNYQQRNQRVL 452
Query: 406 YDIEQQTVSFKPTDCT 421
YD + V F CT
Sbjct: 453 YDAKLSQVGFAKEPCT 468
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 124/359 (34%), Positives = 181/359 (50%), Gaps = 38/359 (10%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY++ + +G + DTGSDL W QC+PC CY Q PL+DP +SS+YK++ C+
Sbjct: 86 NYIVTVELGGKNMS--LIVDTGSDLTWVQCQPC--RSCYNQQGPLYDPSVSSSYKTVFCN 141
Query: 150 SSQCASL-----NQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
SS C L N C G N C+Y VSYGDGS++ G+LA+E++ LG T
Sbjct: 142 SSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-----K 196
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKI 255
L FGCG NN GLF + ++GLG +SL+SQ T G FSYCL + +S +
Sbjct: 197 LENFVFGCGRNNKGLFGGSSG-LMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSL 255
Query: 256 NFGTNGIV--SGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVST--PDIVIDSGTT 308
+FG + V + V TPL + ++FY+L + S+G L S+ I+IDSGT
Sbjct: 256 SFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGILIDSGTV 315
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRG---AD 363
+T LP + P A L+ C++ S +P + + F+G +
Sbjct: 316 ITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELE 375
Query: 364 VKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
V ++ +FVK +VC ++ N V I GN Q N V YD Q+ + +C
Sbjct: 376 VDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 434
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 124/359 (34%), Positives = 181/359 (50%), Gaps = 38/359 (10%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY++ + +G + DTGSDL W QC+PC CY Q PL+DP +SS+YK++ C+
Sbjct: 134 NYIVTVELGGKNMS--LIVDTGSDLTWVQCQPC--RSCYNQQGPLYDPSVSSSYKTVFCN 189
Query: 150 SSQCASL-----NQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
SS C L N C G N C+Y VSYGDGS++ G+LA+E++ LG T
Sbjct: 190 SSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-----K 244
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKI 255
L FGCG NN GLF + ++GLG +SL+SQ T G FSYCL + +S +
Sbjct: 245 LENFVFGCGRNNKGLFGGSSG-LMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSL 303
Query: 256 NFGTNGIV--SGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVST--PDIVIDSGTT 308
+FG + V + V TPL + ++FY+L + S+G L S+ I+IDSGT
Sbjct: 304 SFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGILIDSGTV 363
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRG---AD 363
+T LP + P A L+ C++ S +P + + F+G +
Sbjct: 364 ITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELE 423
Query: 364 VKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
V ++ +FVK +VC ++ N V I GN Q N V YD Q+ + +C
Sbjct: 424 VDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDSTQERLGIVGENC 482
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 132/396 (33%), Positives = 190/396 (47%), Gaps = 41/396 (10%)
Query: 56 DALTRSL-NRLNHFNQNSSISSSKAS---QADIIPNNANYLIRISIGTPPTERLAVADTG 111
D RS+ NR+ + ++ +S+ + I NY++ + +G+ T + DTG
Sbjct: 26 DLRVRSMQNRIRRVVSSHNVEASQTQIPLSSGINLQTLNYIVTMGLGS--TNMTVIIDTG 83
Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVN- 165
SDL W QCEPC CY Q P+F P SS+Y+S+ C+SS C SL N +C G N
Sbjct: 84 SDLTWVQCEPC--MSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGAC-GSNP 140
Query: 166 --CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVG 223
C Y V+YGDGS++NG L E ++ G V++ FGCG NN GLF +G++G
Sbjct: 141 STCNYVVNYGDGSYTNGELGVEQLSFG-----GVSVSDFVFGCGRNNKGLFGG-VSGLMG 194
Query: 224 LGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAK------ 277
LG +SL+SQ T G FSYCL S G S TP+T +
Sbjct: 195 LGRSYLSLVSQTNATFGGVFSYCLPTTESGASGSLVMGNESSVFKNVTPITYTRMLPNPQ 254
Query: 278 --TFYVLTIDAISVGNQRLGV---STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP 332
FY+L + I V L V ++IDSGT +T LP L ++ P
Sbjct: 255 LSNFYILNLTGIDVDGVALQVPSFGNGGVLIDSGTVITRLPSSVYKALKALFLKQFTGFP 314
Query: 333 VADPTGSLELCYSFNSLSQV--PEVTIHFRG-ADVKLSRSNFFVKVSEDI--VCSVFKGI 387
A L+ C++ +V P +++HF G A++K+ + F V ED VC +
Sbjct: 315 SAPGFSILDTCFNLTGYDEVSIPTISMHFEGNAELKVDATGTFYVVKEDASQVCLALASL 374
Query: 388 TNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+++ I GN Q N V YD +Q V F C+
Sbjct: 375 SDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEESCS 410
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 127/407 (31%), Positives = 186/407 (45%), Gaps = 27/407 (6%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKA-SQADIIPNN 88
S LIH S SPF + T + + + NRL + S S A + + +
Sbjct: 53 SFPLIHIYSECSPFRPPNRTWESLMSEKIRGDANRLRFLKRTSRSSKEDANANVPVRSGS 112
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y+I++ GTP + DTGSD+ W C+ C Q +P+FDP SS+YK C
Sbjct: 113 GEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQC---QGCHSTAPIFDPAKSSSYKPFAC 169
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
S C ++ CQ+ V YGDG+ +G LA++ +TLGS LP +FGC
Sbjct: 170 DSQPCQEISGNCGGNSKCQFEVLYGDGTQVDGTLASDAITLGSQ-----YLPNFSFGCAE 224
Query: 209 N-NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFGTNGIVSG 265
+ + ++S +G G + + G FSYCL SS + G VS
Sbjct: 225 SLSEDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSGSLVLGKEAAVSS 284
Query: 266 PGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVSTPDI------VIDSGTTLTFL-PQG 315
+ T L K TFY +T+ AISVGN R+ V +I +IDSGTT+T+L P
Sbjct: 285 SSLKFTTLIKDPSFPTFYFVTLKAISVGNTRISVPATNIASGGGTIIDSGTTITYLVPSA 344
Query: 316 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNFFV 373
Y + + QP P ++ CY +S S VP +T+H R D+ L + N +
Sbjct: 345 YKDLRDAFRQQLSSLQPT--PVEDMDTCYDLSSSSVDVPTITLHLDRNVDLVLPKENILI 402
Query: 374 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ C F T+S I GN+ Q N+ + +D+ V F C
Sbjct: 403 TQESGLSCLAFSS-TDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 133/440 (30%), Positives = 215/440 (48%), Gaps = 67/440 (15%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYN--SSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
+A+ +E +HR + +S +S +P + L + + ++
Sbjct: 97 KAEKDAVRIETMHRRAARSGVARMPASSSPRRALSERMVATV------------------ 138
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
++ + + YLI + +GTPP + DTGSDL W QC PC C+ Q P+FDP S
Sbjct: 139 ESGVAVGSGEYLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAAS 196
Query: 141 STYKSLPCSSSQCASL----NQKSC---SGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
S+Y+++ C +C + ++C + +C Y YGD S + G+LA E+ T+ T
Sbjct: 197 SSYRNVTCGDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTA 256
Query: 194 -GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS 252
G + + G+ FGCG N GLF+ ++GLG G +S SQ+R FSYCLV S
Sbjct: 257 PGASRRVDGVVFGCGHRNRGLFHGAAG-LLGLGRGPLSFASQLRAVYGHTFSYCLVEHGS 315
Query: 253 ---TKINFGTNGIV-SGPGVVSTPL----TKAKTFYVLTIDAISVGNQRLGVS--TPDI- 301
+K+ FG + +V + P + T + A TFY + + + VG L +S T D+
Sbjct: 316 DAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVG 375
Query: 302 -------VIDSGTTLT-FLPQGYN------SNLLSVMSSMIEAQPVADPTGSLELCYSFN 347
+IDSGTTL+ F+ Y +L+S + +I PV +P CY+ +
Sbjct: 376 KDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPVLNP------CYNVS 429
Query: 348 SLS--QVPEVTIHF-RGADVKLSRSNFFVKVSED-IVCSVFKGITNS-VPIYGNIMQTNF 402
+ +VPE+++ F GA N+FV++ D I+C +G + + I GN Q NF
Sbjct: 430 GVERPEVPELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTGMSIIGNFQQQNF 489
Query: 403 LVGYDIEQQTVSFKPTDCTK 422
V YD++ + F P C +
Sbjct: 490 HVVYDLQNNRLGFAPRRCAE 509
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 117/383 (30%), Positives = 181/383 (47%), Gaps = 61/383 (15%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC------PPSQCYMQDSPLFDPKMSS 141
+ Y + I +GTPP L VADTGSDL+W +C C PPS ++ P+ SS
Sbjct: 85 SGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFL-------PRHSS 137
Query: 142 TYKSLPCSSSQCASLNQKSCSGVN-------CQYSVSYGDGSFSNGNLATETVTLGSTTG 194
++ C C L N C++ SY DGS S+G + ET TL S +G
Sbjct: 138 SFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSG 197
Query: 195 QAVALPGITFGCGTN------NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
+ L G++FGCG +G FN G++GLG G IS SQ+ KFSYCL+
Sbjct: 198 SEIHLKGLSFGCGFRISGPSVSGAQFNG-ARGVMGLGRGSISFSSQLGRRFGNKFSYCLM 256
Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTKA--------------KTFYVLTIDAISVGNQRL 294
+ + T+ ++ G G+ S PLT A TFY +TI +I++ +L
Sbjct: 257 DYTLSPPP--TSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKL 314
Query: 295 GVSTPDI-----------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELC 343
++ P + V+DSGTTLT+L + +L + ++ A+ T +LC
Sbjct: 315 PIN-PAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLC 373
Query: 344 YSFNSLSQVPEV-TIHFR---GADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNI 397
+ + S+ P + + FR GA N+F++ E ++C + + N + GN+
Sbjct: 374 VNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNL 433
Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
MQ FL+ +D E+ + F C
Sbjct: 434 MQQGFLLEFDKEESRLGFTRRGC 456
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 120/347 (34%), Positives = 161/347 (46%), Gaps = 28/347 (8%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y++ +S+GTP + DTGSD+ W QC+PC C Q LFDP SSTY ++PC
Sbjct: 142 QYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCG 201
Query: 150 SSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTL--GSTTGQAVALPGITFG 205
+ C+ L + CSG C Y VSYGDGS + G ++T+ L G+T G FG
Sbjct: 202 ADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGNTVGT------FLFG 255
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG 265
CG G+F + G++ LG +SL SQ G FSYCL S G S
Sbjct: 256 CGHAQAGMF-AGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLGGPTSA 314
Query: 266 PGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI----VIDSGTTLTFLPQGYNS 318
G +T L A TFY++ + ISVG Q++ V V+D+GT +T LP +
Sbjct: 315 SGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVITRLPPTAYA 374
Query: 319 NLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFV 373
L S I P A G L+ CY F+ V P V + F GA + L
Sbjct: 375 ALRSAFRGAIAPYGYPSAPANGILDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGI-- 432
Query: 374 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+S + G I GN+ Q +F V +D TV F P C
Sbjct: 433 -LSSGCLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 145/460 (31%), Positives = 216/460 (46%), Gaps = 72/460 (15%)
Query: 8 VFILFFLCFYVVSPIEAQTG-GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
VF+L LCF + TG G ++L H D + T +R+R A+ S RL
Sbjct: 6 VFLLVLLCFRASLVTSSSTGAGLRMKLTHVDD------KAGYTTEERVRRAVAVSRERLA 59
Query: 67 HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC-EPCPPS 125
+ Q + +S A + Y+ IG PP A+ DTGS+LIWTQC C
Sbjct: 60 YTQQQQQLRASGDVSAPVHLATRQYIAEYLIGDPPQRAAALIDTGSNLIWTQCGTTCGLK 119
Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQ--CASLNQKSCSGVN--CQYSVSYGDGSFSNGN 181
C QD P ++ SST+ ++PC+ S CA+ C G++ C ++ SYG GS G+
Sbjct: 120 ACAKQDLPYYNLSRSSTFAAVPCADSAKLCAANGVHLC-GLDGSCTFAASYGAGSV-FGS 177
Query: 182 LATETVTLGSTTGQAVALPGITFGC--------GTNNGGLFNSKTTGIVGLGGGDISLIS 233
L TE T S + + FGC G NG +G++GLG G +SL+S
Sbjct: 178 LGTEAFTFQSGAAK------LGFGCVSLTRITKGALNG------ASGLIGLGRGRLSLVS 225
Query: 234 QMRTTIAGKFSYCLVPV-----SSTKINFGTNGIVSGPG--VVSTPLTKA------KTFY 280
Q T A KFSYCL P +S+ + G + +SG G V S P K+ TFY
Sbjct: 226 Q---TGATKFSYCLTPYLRNHGASSHLFVGASASLSGGGGAVTSIPFVKSPEDYPYSTFY 282
Query: 281 VLTIDAISVGNQRL--------------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSS 326
L + ISVG +L G + ++ID+G+ +T L + S L ++
Sbjct: 283 YLPLVGISVGETKLPIPSAAFELRRVAAGYWSGGVIIDTGSPVTSLAEAAYSALSDEVAR 342
Query: 327 MIE---AQPVADPTGSLELCYSFNSLSQ-VPEVTIHF-RGADVKLSRSNFFVKVSEDIVC 381
+ QP AD TG L+LC + + + VP + HF GAD+ +S +++ V + C
Sbjct: 343 QLNRSLVQPPAD-TG-LDLCVARQDVDKVVPVLVFHFGGGADMAVSAGSYWGPVDKSTAC 400
Query: 382 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ + I GN Q + + YDI + +SF+ DC+
Sbjct: 401 MLIEEGGYETVI-GNFQQQDVHLLYDIGKGELSFQTADCS 439
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 137/450 (30%), Positives = 209/450 (46%), Gaps = 58/450 (12%)
Query: 14 LCFYVVSPIEAQT------GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
L FY+ + I + T + +LIHR+S P Y+ +ET R + T S+ R +
Sbjct: 17 LAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSIERFDF 76
Query: 68 FNQNSSISSSKA----SQADIIPNN--ANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
S I K+ +++ +IP N + +L+ +SIG+PP +L V DTGS L+W QC P
Sbjct: 77 LE--SKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLP 134
Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNG 180
C C+ Q + FDP S ++K+L C +N C+ N +Y + Y G S G
Sbjct: 135 CI--NCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQG 192
Query: 181 NLATETVTLGSTTGQAVALPGITFGCG-----TNNGGLFNSKTTGIVGLGGGDISLISQM 235
LA E++ + + ITFGCG TNN +N G+ GLG I+ M
Sbjct: 193 ILAKESLLFETLDEGKIKKSNITFGCGHMNIKTNNDDAYN----GVFGLGA--YPHIT-M 245
Query: 236 RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVV----STPLTKAKTFYVLTIDAISVGN 291
T + KFSYC+ +++ + N +V G G STPL Y +T+ +ISVG+
Sbjct: 246 ATQLGNKFSYCIGDINNPL--YTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISVGS 303
Query: 292 QRLGVS----------TPDIVIDSGTTLTFLPQG----YNSNLLSVMSSMIEAQPVADPT 337
+ L + + ++IDSG T T L G ++ +M ++E P
Sbjct: 304 KTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKF 363
Query: 338 GSLELCYS---FNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF---KGITNS 390
LC+ L P VT HF GAD+ L + F + D C +
Sbjct: 364 EG--LCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN 421
Query: 391 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ + G + Q N+ VG+D+EQ V F+ DC
Sbjct: 422 LSVIGILAQQNYNVGFDLEQMKVFFRRIDC 451
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 26/351 (7%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y +RI +G+PP + V D+GSD++W QC+PC +QCY Q PLFDP S+++ +
Sbjct: 40 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPC--TQCYHQTDPLFDPADSASFMGVS 97
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
CSS+ C ++ C+ C+Y VSYGDGS + G LA ET+TLG T Q VA+ GCG
Sbjct: 98 CSSAVCDQVDNAGCNSGRCRYEVSYGDGSSTKGTLALETLTLGRTVVQNVAI-----GCG 152
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP- 266
N G+F ++GLGGG +S + Q+ FSYCLV + F G + P
Sbjct: 153 HMNQGMFVGAAG-LLGLGGGSMSFVGQLSRERGNAFSYCLVSRVTNSNGFLEFGSEAMPV 211
Query: 267 GVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTP----------DIVIDSGTTLTFLP 313
G PL + + ++Y + + + VG+ ++ +S +V+D+GT +T P
Sbjct: 212 GAAWIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTELGNGGVVMDTGTAVTRFP 271
Query: 314 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-FNSLS-QVPEVTIHFRGADV-KLSRSN 370
P A + CY+ F LS +VP V+ +F G + L +N
Sbjct: 272 TVAYEAFRDAFIDQTGNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTLPANN 331
Query: 371 FFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
F + V + C F + + I GNI Q + D + V F P C
Sbjct: 332 FLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDGANEFVGFGPNVC 382
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 133/429 (31%), Positives = 202/429 (47%), Gaps = 63/429 (14%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ--NSSISSSKASQADIIPN-- 87
+LIH S P Y +ET R+ + S R + S+ S+ +A + P+
Sbjct: 38 KLIHPGSVHHPHYKPNETAKDRMELDIQHSAARFAYIQARIEGSLVSNNEYKARVSPSLT 97
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL- 146
+ ISIG PP +L V DTGSD++W C PC + C LFDP MSST+ L
Sbjct: 98 GRTIMANISIGQPPIPQLVVMDTGSDILWVMCTPC--TNCDNHLGLLFDPSMSSTFSPLC 155
Query: 147 --PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
PC C+ C + ++V+Y D S ++G +TV +T +P + F
Sbjct: 156 KTPCDFKGCS-----RCDPI--PFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLF 208
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVS 264
GCG N G + GI+GL G SL T I KFSYC+ ++ N+ + ++
Sbjct: 209 GCGHNIGQDTDPGHNGILGLNNGPDSLA----TKIGQKFSYCIGDLADPYYNY--HQLIL 262
Query: 265 GPGV----VSTPLTKAKTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTTLT 310
G G STP FY +T++ ISVG +RL ++ T ++ID+G+T+T
Sbjct: 263 GEGADLEGYSTPFEVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTIT 322
Query: 311 FLPQGYN-------SNLL--SVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVTI 357
FL + NLL S + IE P C+ + S+S+ P VT
Sbjct: 323 FLVDSVHRLLSKEVRNLLGWSFRQTTIEKSPWMQ-------CF-YGSISRDLVGFPVVTF 374
Query: 358 HF-RGADVKLSRSNFFVKVSEDIVCSVFKGIT----NSVP-IYGNIMQTNFLVGYDIEQQ 411
HF GAD+ L +FF ++++++ C ++ S P + G + Q ++ VGYD+ Q
Sbjct: 375 HFADGADLALDSGSFFNQLNDNVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDLVNQ 434
Query: 412 TVSFKPTDC 420
V F+ DC
Sbjct: 435 FVYFQRIDC 443
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 143/432 (33%), Positives = 205/432 (47%), Gaps = 53/432 (12%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQ-RL-RDA---------LTRSLNRLNHFNQNSSISSS 77
FS+ L R + +P Y T + RL RDA L RSLN HF + SI+ S
Sbjct: 69 FSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGE--SINES 126
Query: 78 KASQADIIP--------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ-CY 128
+ P + A YL +I +G P V DTGSD+ W QC+PC CY
Sbjct: 127 LIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCY 186
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
Q P+FDPK SS+Y L C+S QC L++ +C+ C Y V YGDGSF+ G LATET++
Sbjct: 187 KQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLS 246
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
G++ ++P + GCG +N GLF I GG ISL SQ++ A FSYCLV
Sbjct: 247 FGNSN----SIPNLPIGCGHDNEGLFAGGAGLIGLGGGA-ISLSSQLK---ASSFSYCLV 298
Query: 249 PV---SSTKINFGTNGIVSGPGVVSTPLTKAKTFY---VLTIDAISVGNQRLGVSTPD-- 300
+ SS+ + F +N +++PL K F+ + + ISVG + L +S
Sbjct: 299 NLDSDSSSTLEFNSNMPSDS---LTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFE 355
Query: 301 --------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV 352
I++DSGT ++ LP +L + + A + CY+F+ S V
Sbjct: 356 IDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNV 415
Query: 353 PEVTIHF---RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
TI F G ++L N+ + + + C F +S+ I G+ Q V YD+
Sbjct: 416 EVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDL 475
Query: 409 EQQTVSFKPTDC 420
V F C
Sbjct: 476 TNSLVGFSTNKC 487
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 125/415 (30%), Positives = 197/415 (47%), Gaps = 49/415 (11%)
Query: 50 PYQRLRDALTRSLNRLNHF----NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERL 105
P+ AL+ +RL+ F + S+ S S A + Y + + +GTPP + L
Sbjct: 46 PFTTPSQALSFDSHRLSFFFSALHTPQSLKSPVVSGAST--GSGQYFVDLRLGTPPQKLL 103
Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL---NQKSCS 162
VADTGSDL+W +C C + S F + S+T+ C S C + C+
Sbjct: 104 LVADTGSDLVWVKCSACRNCTRHTPGS-AFLARHSTTFSPNHCYDSACQLVPLPKHHRCN 162
Query: 163 GVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN------NGG 212
C+Y SYGDGS ++G + ET TL +++G+ L GI FGC +G
Sbjct: 163 HARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKGIAFGCAFRISGPSVSGA 222
Query: 213 LFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVSSTKINFGTNGIVSG 265
FN G++GLG G ISL SQ+ KFSYCL+ P S I N + G
Sbjct: 223 SFNG-AHGVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDISPSPTSYLLIGSTQNDVAPG 281
Query: 266 PGVVS-TPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI-----------VIDSGTTLT 310
+ TPL + TFY + I+++SV +L ++ P + ++DSGTTLT
Sbjct: 282 KRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPIN-PSVWALDELGNGGTIVDSGTTLT 340
Query: 311 FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADV-KLS 367
FLP+ +L+V+ + A+PT +LC + + + ++P+++ G V
Sbjct: 341 FLPEPAYLQILTVIKRRVRLPSPAEPTPGFDLCVNVSEIEHPRLPKLSFKLGGDSVFSPP 400
Query: 368 RSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N+FV ED+ C + + + + GN+MQ FL+ +D ++ + F C
Sbjct: 401 PRNYFVDTDEDVKCLALQAVMTPSGFSVIGNLMQQGFLLEFDKDRTRLGFSRHGC 455
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 124/385 (32%), Positives = 180/385 (46%), Gaps = 63/385 (16%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS-PLFDPKMSSTYKSLPC 148
YL+ +S+GTPP DTGSDL+WTQC PC C+ Q + P+ DP SST+ ++ C
Sbjct: 93 EYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPC--LNCFDQGAIPVLDPAASSTHAAVRC 150
Query: 149 SSSQCASLNQKSC-------SGVNCQYSVSYGDGSFSNGNLATETVTLG---STTGQAVA 198
+ C +L SC +C Y YGD S + G LA++ T G + G V+
Sbjct: 151 DAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGVS 210
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG 258
+TFGCG N G+F + TGI G G G SL SQ+ T FSYC + + +
Sbjct: 211 ERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVT---SFSYCFTSMFESTSSLV 267
Query: 259 TNGIVSGP-----GVVSTPLTK---AKTFYVLTIDAISVGNQRLGV-------STPDIVI 303
T G+ V STPL + + Y L++ AI+VG R+ + +I
Sbjct: 268 TLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLREASAII 327
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQ---PVADPTGS-LELCYSFNSLS--------- 350
DSG ++T LP+ ++ + + AQ PV+ GS L+LC++ S +
Sbjct: 328 DSGASITTLPE----DVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPSAAAPKSAFGWR 383
Query: 351 ----------QVPEVTIHF-RGADVKLSRSNF-FVKVSEDIVCSVFKGIT---NSVPIYG 395
+VP + H GAD +L R N+ F ++C V T + + G
Sbjct: 384 WRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGDQTVVIG 443
Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDC 420
N Q N V YD+E +SF P C
Sbjct: 444 NYQQQNTHVVYDLENDVLSFAPARC 468
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 120/347 (34%), Positives = 161/347 (46%), Gaps = 28/347 (8%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y++ +S+GTP + DTGSD+ W QC+PC C Q LFDP SSTY ++PC
Sbjct: 142 QYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCG 201
Query: 150 SSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTL--GSTTGQAVALPGITFG 205
+ C+ L + CSG C Y VSYGDGS + G ++T+ L G+T G FG
Sbjct: 202 ADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGNTVGT------FLFG 255
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG 265
CG G+F + G++ LG +SL SQ G FSYCL S G S
Sbjct: 256 CGHAQAGMF-AGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLGGPSSA 314
Query: 266 PGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI----VIDSGTTLTFLPQGYNS 318
G +T L A TFY++ + ISVG Q++ V V+D+GT +T LP +
Sbjct: 315 SGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVITRLPPTAYA 374
Query: 319 NLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFV 373
L S I P A G L+ CY F+ V P V + F GA + L
Sbjct: 375 ALRSAFRGAIAPCGYPSAPANGILDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGI-- 432
Query: 374 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+S + G I GN+ Q +F V +D TV F P C
Sbjct: 433 -LSSGCLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 114/354 (32%), Positives = 177/354 (50%), Gaps = 34/354 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ +G+P + V DTGSD+ W QC+PC + CY Q P+FDP +S++Y S+
Sbjct: 164 SGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSTSYASVA 221
Query: 148 CSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C + +C L+ +C S C Y V+YGDGS++ G+ ATET+TLG + + + G
Sbjct: 222 CDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDS----APVSSVAIG 277
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFG--TN 260
CG +N GLF ++ LGGG +S SQ+ T FSYCLV SS+ + FG +
Sbjct: 278 CGHDNEGLFVGAAG-LLALGGGPLSFPSQISAT---TFSYCLVDRDSPSSSTLQFGDAAD 333
Query: 261 GIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLT 310
V+ P ++ +P T TFY + + +SVG Q L + +++DSGT +T
Sbjct: 334 AEVTAP-LIRSPRT--STFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSGTAVT 390
Query: 311 FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLS 367
L + L ++ P + CY + + +VP V++ F G +++L
Sbjct: 391 RLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGELRLP 450
Query: 368 RSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N+ + V C F +V I GN+ Q V +D + TV F C
Sbjct: 451 AKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTTNKC 504
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 113/344 (32%), Positives = 167/344 (48%), Gaps = 31/344 (9%)
Query: 98 GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
GT + + D+GSD+ W QC+PCP C+ Q PLFDP MS+TY ++PC+S+ CA L
Sbjct: 162 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 221
Query: 158 --QKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
++ CS CQ+ ++YGDGS + G + + +TLG + G FGC + G
Sbjct: 222 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYD----VIRGFRFGCAHADRGSA 277
Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----PGV 268
F+ G + LGGG SL+ Q T FSYCL P +S+ + F G+ P
Sbjct: 278 FDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASS-LGFLVLGVPPERAQLIPSF 336
Query: 269 VSTPL---TKAKTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQGYNSNLL 321
VSTPL + A TFY + + AI V + L V + VIDS T ++ LP L
Sbjct: 337 VSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASSVIDSSTIISRLPPTAYQALR 396
Query: 322 SVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKVSED 378
+ S + A P L+ CY F + + P + + F GA V L + +
Sbjct: 397 AAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLG---- 452
Query: 379 IVCSVFK-GITNSVPIY-GNIMQTNFLVGYDIEQQTVSFKPTDC 420
C F ++ +P + GN+ Q V YD+ + + F+ C
Sbjct: 453 -SCLAFAPTASDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 138/409 (33%), Positives = 200/409 (48%), Gaps = 31/409 (7%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI--IPN 87
S LIH S SPF + T + + + NRL F + +S SS + + A++
Sbjct: 53 SFPLIHIYSECSPFRPPNRTWESLMSEKIRGDANRL-RFLKRTSRSSKQDANANVPVRSG 111
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y+I++ GTP + DTGSD+ W C+ C Q +P+FDP SS+YK
Sbjct: 112 SGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQC---QGCHSTAPIFDPAKSSSYKPFA 168
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C S C ++ CQ+ VSYGDG+ +G LA++ +TLGS LP +FGC
Sbjct: 169 CDSQPCQEISGNCGGNSKCQFEVSYGDGTQVDGTLASDAITLGSQ-----YLPNFSFGCA 223
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCL--VPVSSTKINFGTNGIV 263
+ S + G++GLGGG +SL++Q T G FSYCL SS + G V
Sbjct: 224 -ESLSEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSGSLVLGKEAAV 282
Query: 264 SGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI------VIDSGTTLTFL-P 313
S + T L K TFY +T+ AISVGN R+ V +I +IDSGTT+T L P
Sbjct: 283 SSSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVPGTNIASGGGTIIDSGTTITHLVP 342
Query: 314 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNF 371
Y + + + QP P ++ CY +S S VP +T+H R D+ L + N
Sbjct: 343 SAYTALRDAFRQQLSSLQPT--PVEDMDTCYDLSSSSVDVPTITLHLDRNVDLVLPKENI 400
Query: 372 FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ + C F T+S I GN+ Q N+ + +D+ V F C
Sbjct: 401 LITQESGLACLAFSS-TDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 136/422 (32%), Positives = 198/422 (46%), Gaps = 38/422 (9%)
Query: 29 FSVELIHRDSPKSPFYNS---SETPYQRLRDALTRSLNRLNHFNQN-SSISSSKASQADI 84
S+E++HR P N ++ P + R NR++ + SS QA
Sbjct: 60 LSLEVVHRHGPCIGIVNQEKGADAPSNM--EIFLRDQNRVDSIHARLSSRGMFPEKQATT 117
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+P +Y++ + +GTP E + DTGSD+ WTQCEPC + CY Q P +P
Sbjct: 118 LPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKT-CYKQKEPRLNP 176
Query: 138 KMSSTYKSLPCSSSQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
S++YK++ CSS+ C + +SCS C Y V YGDGS+S G ATET+TL S+
Sbjct: 177 STSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSS 236
Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS 252
FGCG N G++GLG ++L SQ T FSYCL SS
Sbjct: 237 N----VFKNFLFGCGQQN-NGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSS 291
Query: 253 TKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS----TPDIVIDS 305
+K G VS V TPL+ + FY L I +SVG ++L + + VIDS
Sbjct: 292 SKGYLSLGGQVS-KSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDS 350
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGA- 362
GT +T L S L S +++ P + CY F+ ++P+V + F+G
Sbjct: 351 GTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGV 410
Query: 363 DVKLSRSNFFVKVSE-DIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
++ + S V+ VC F G + I+GN+ Q + V YD + V F P
Sbjct: 411 EMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGG 470
Query: 420 CT 421
C+
Sbjct: 471 CS 472
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 138/422 (32%), Positives = 200/422 (47%), Gaps = 38/422 (9%)
Query: 29 FSVELIHRDSPKSPFYNS---SETPYQRLRDALTRSLNRLNHFNQN-SSISSSKASQADI 84
S+E++HR P N ++ P + R NR++ + SS QA
Sbjct: 48 LSLEVVHRHGPCIGIVNQEKGADAPSNM--EIFLRDQNRVDSIHARLSSRGMFPEKQATT 105
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+P +Y++ + +GTP E + DTGSD+ WTQCEPC + CY Q P +P
Sbjct: 106 LPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKT-CYKQKEPRLNP 164
Query: 138 KMSSTYKSLPCSSSQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
S++YK++ CSS+ C + +SCS C Y V YGDGS+S G ATET+TL S+
Sbjct: 165 STSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSS 224
Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS 252
FGCG N GL G++GLG ++L SQ T FSYCL SS
Sbjct: 225 N----VFKNFLFGCGQQNNGL-FGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSS 279
Query: 253 TKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS----TPDIVIDS 305
+K G VS V TPL+ + FY L I +SVG ++L + + VIDS
Sbjct: 280 SKGYLSLGGQVS-KSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDS 338
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGA- 362
GT +T L S L S +++ P + CY F+ ++P+V + F+G
Sbjct: 339 GTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGV 398
Query: 363 DVKLSRSNFFVKVSE-DIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
++ + S V+ VC F G + I+GN+ Q + V YD + V F P
Sbjct: 399 EMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGG 458
Query: 420 CT 421
C+
Sbjct: 459 CS 460
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 137/425 (32%), Positives = 194/425 (45%), Gaps = 50/425 (11%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRL--RDAL-TRSLNRLNHFNQNSSISSSKASQADI 84
G +V L HR P SP ++ E L RD L + + N S + S A
Sbjct: 52 GTTVPLSHRHGPCSPAPSTVEPTMAELLRRDQLRAKYIQAKLSVNSGSGTDGVQQSAAIT 111
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+P + Y+I +SIGTP + + DTGSD+ W C ++ S FDP
Sbjct: 112 LPTTLGSALDTLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCH----ARAGAGSSLFFDP 167
Query: 138 KMSSTYKSLPCSSSQCASLNQKS--CS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
SSTY CSS+ C L + CS CQY+V YGDGS + G ++T+ L ST
Sbjct: 168 GKSSTYTPFSCSSAACTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLALNSTE- 226
Query: 195 QAVALPGITFGCGTNNG---GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS 251
+ FGC + GL +T G++GLGGG SL+SQ T FSYCL P +
Sbjct: 227 ---KVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCL-PAT 282
Query: 252 STKINFGTNGIVSG-PGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI-----V 302
+ F T G +G G V+TP+ +A TFY + + I+VG + +S P + +
Sbjct: 283 TRSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAIS-PTVFAAGSI 341
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR 360
+DSGT +T LP S L + + + P A L+ C+ F V P V + F
Sbjct: 342 MDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFTGQDNVSIPAVELVFS 401
Query: 361 GADVKLSRSNFFVKVSEDIV----CSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQTVSF 415
G V V + D + C F T + I GN+ Q F V +D+ Q + F
Sbjct: 402 GGAV--------VDLDADGIMYGSCLAFAPATGGIGSIIGNVQQRTFEVLHDVGQSVLGF 453
Query: 416 KPTDC 420
+P C
Sbjct: 454 RPGAC 458
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 118/360 (32%), Positives = 166/360 (46%), Gaps = 34/360 (9%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + +GTP + V DTGSD+ W QC PC + CY Q LF+P SS++K L C
Sbjct: 14 GEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPC--TNCYKQKDALFNPSSSSSFKVLDC 71
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA-VALPGITFGCG 207
SSS C +L+ C C Y YGDGSF+ G L T+ V L G V L I GCG
Sbjct: 72 SSSLCLNLDVMGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVVLTNIPLGCG 131
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPG 267
+N G F + GI+GLG G +S + + + FSYCL P + N + +
Sbjct: 132 HDNEGTFGT-AAGILGLGRGPLSFPNNLDASTRNIFSYCL-PDRESDPNHKSTLVFGDAA 189
Query: 268 VVSTPLTKAK-----------TFYVLTIDAISVGNQRLGVSTPDI-----------VIDS 305
+ T K T+Y + I ISVG L + + DS
Sbjct: 190 IPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSHGNGGTIFDS 249
Query: 306 GTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRG- 361
GTT+T L + Y + + ++ + AD + CY F ++ VP VT HF+G
Sbjct: 250 GTTITRLEARAYTAVRDAFRAATMHLTSAAD-FKIFDTCYDFTGMNSISVPTVTFHFQGD 308
Query: 362 ADVKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
D++L SN+ V VS +I C F + + GN+ Q +F V YD + + P C
Sbjct: 309 VDMRLPPSNYIVPVSNNNIFCFAFAA-SMGPSVIGNVQQQSFRVIYDNVHKQIGLLPDQC 367
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 120/360 (33%), Positives = 174/360 (48%), Gaps = 36/360 (10%)
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
I + +Y RI +GTP VADTGSD+ W QC PC +CY Q P+F+P +SS++
Sbjct: 74 IAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPC--RKCYRQQDPIFNPSLSSSF 131
Query: 144 KSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
K L C+SS C L K CS N C Y VSYGDGSF+ G+ +TET++ G ++VA+
Sbjct: 132 KPLACASSICGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSFGEHAVRSVAM--- 188
Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI 262
GCG NN GLF+ ++GLG G +S SQ T+ A FSYCL P + I +
Sbjct: 189 --GCGRNNQGLFHGAAG-LLGLGRGPLSFPSQTGTSYASVFSYCL-PRRESAI---AASL 241
Query: 263 VSGPGVVST--------PLTKAKTFYVLTIDAISVGNQRLGV----------STPDIVID 304
V GP V P + T+Y + + I V + + T +++D
Sbjct: 242 VFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVD 301
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFR-G 361
SGT ++ L + L S++ P A + CY +S+ + +P V + F G
Sbjct: 302 SGTAISRLTTPAYTALRDAFRSLVTF-PSAPGISLFDTCYDLSSMKTATLPAVVLDFDGG 360
Query: 362 ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
A + L V V E C F + I GN+ Q F + D +++ + P C
Sbjct: 361 ASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 420
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 136/421 (32%), Positives = 198/421 (47%), Gaps = 38/421 (9%)
Query: 30 SVELIHRDSPKSPFYNS---SETPYQRLRDALTRSLNRLNHFNQN-SSISSSKASQADII 85
S+E++HR P N ++ P + R NR++ + SS QA +
Sbjct: 1 SLEVVHRHGPCIGIVNQEKGADAPSNM--EIFLRDQNRVDSIHARLSSRGMFPEKQATTL 58
Query: 86 P-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
P +Y++ + +GTP E + DTGSD+ WTQCEPC + CY Q P +P
Sbjct: 59 PVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKT-CYKQKEPRLNPS 117
Query: 139 MSSTYKSLPCSSSQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
S++YK++ CSS+ C + +SCS C Y V YGDGS+S G ATET+TL S+
Sbjct: 118 TSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSN 177
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
FGCG N G++GLG ++L SQ T FSYCL SS+
Sbjct: 178 ----VFKNFLFGCGQQN-NGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSSS 232
Query: 254 KINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS----TPDIVIDSG 306
K G VS V TPL+ + FY L I +SVG ++L + + VIDSG
Sbjct: 233 KGYLSLGGQVS-KSVKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAFSAGTVIDSG 291
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGA-D 363
T +T L S L S +++ P + CY F+ ++P+V + F+G +
Sbjct: 292 TVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVE 351
Query: 364 VKLSRSNFFVKVSE-DIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ + S V+ VC F G + I+GN+ Q + V YD + V F P C
Sbjct: 352 MDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 411
Query: 421 T 421
+
Sbjct: 412 S 412
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 120/360 (33%), Positives = 174/360 (48%), Gaps = 36/360 (10%)
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
I + +Y RI +GTP VADTGSD+ W QC PC +CY Q P+F+P +SS++
Sbjct: 7 IAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPC--RKCYRQQDPIFNPSLSSSF 64
Query: 144 KSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
K L C+SS C L K CS N C Y VSYGDGSF+ G+ +TET++ G ++VA+
Sbjct: 65 KPLACASSICGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSFGEHAVRSVAM--- 121
Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI 262
GCG NN GLF+ ++GLG G +S SQ T+ A FSYCL P + I +
Sbjct: 122 --GCGRNNQGLFHGAAG-LLGLGRGPLSFPSQTGTSYASVFSYCL-PRRESAI---AASL 174
Query: 263 VSGPGVVST--------PLTKAKTFYVLTIDAISVGNQRLGV----------STPDIVID 304
V GP V P + T+Y + + I V + + T +++D
Sbjct: 175 VFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVD 234
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFR-G 361
SGT ++ L + L S++ P A + CY +S+ + +P V + F G
Sbjct: 235 SGTAISRLTTPAYTALRDAFRSLVTF-PSAPGISLFDTCYDLSSMKTATLPAVVLDFDGG 293
Query: 362 ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
A + L V V E C F + I GN+ Q F + D +++ + P C
Sbjct: 294 ASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 353
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 144/432 (33%), Positives = 204/432 (47%), Gaps = 53/432 (12%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQ-RL-RDA---------LTRSLNRLNHFNQNSSISSS 77
FS+ L R + +P Y T + RL RDA L RSLN HF + SI+ S
Sbjct: 69 FSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGE--SINES 126
Query: 78 KASQADIIP--------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ-CY 128
+ P + A YL +I +G P V DTGSD+ W QC+PC CY
Sbjct: 127 LIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCY 186
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
Q P+FDPK SS+Y L C+S QC L++ +C+ C Y V YGDGSF+ G LATET++
Sbjct: 187 KQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLS 246
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
G++ ++P + GCG +N GLF I GG ISL SQ++ A FSYCLV
Sbjct: 247 FGNSN----SIPNLPIGCGHDNEGLFAGGAGLIGLGGGA-ISLSSQLK---ASSFSYCLV 298
Query: 249 PV---SSTKINFGTNGIVSGPGVVSTPLTKAKTFY---VLTIDAISVGNQRLGVSTPD-- 300
+ SS+ + F N + + S PL K F+ + + ISVG + L +S
Sbjct: 299 NLDSDSSSTLEF--NSYMPSDSLTS-PLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFE 355
Query: 301 --------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV 352
I++DSGT ++ LP +L + + A + CY+F+ S V
Sbjct: 356 IDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNV 415
Query: 353 PEVTIHF---RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
TI F G ++L N+ + + + C F +S+ I G+ Q V YD+
Sbjct: 416 EVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDL 475
Query: 409 EQQTVSFKPTDC 420
V F C
Sbjct: 476 TNSIVGFSTNKC 487
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 161 bits (408), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 134/415 (32%), Positives = 197/415 (47%), Gaps = 46/415 (11%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD------ 83
SV L HR P SP +S + L R R ++ + S S+ A+ D
Sbjct: 34 SVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQSSKV 93
Query: 84 IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-PSQCYMQDSPLF 135
+P + Y+I + +G+P + V DTGSD+ W QCEPCP PS C+ LF
Sbjct: 94 SVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALF 153
Query: 136 DPKMSSTYKSLPCSSSQCASL-NQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLG 190
DP SSTY + CS++ CA L + +G + CQY V YGDGS + G +++ +TL
Sbjct: 154 DPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTL- 212
Query: 191 STTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL-- 247
+G V + G FGC G + KT G++GLGG S +SQ F YCL
Sbjct: 213 --SGSDV-VRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKSFFYCLPA 269
Query: 248 VPVSSTKINFGTNGIVSGPGV---VSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPDI 301
P SS + G G G +TP+ ++K T+Y ++ I+VG ++LG+S P +
Sbjct: 270 TPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLS-PSV 328
Query: 302 -----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PE 354
++DSGT +T LP + L S + + A+P G L+ C++F L +V P
Sbjct: 329 FAAGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPT 388
Query: 355 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYD 407
V + F G V ++ V C F + + GN+ Q F V YD
Sbjct: 389 VALVFAGGAVVDLDAHGIVSGG----CLAFAPTRDDKAFGTIGNVQQRTFEVLYD 439
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 126/349 (36%), Positives = 179/349 (51%), Gaps = 26/349 (7%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ R+ +GTP + V DTGS L W QC PC S C+ Q P+F+PK SS+Y S+ CS
Sbjct: 128 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVS-CHRQSGPVFNPKASSSYTSVSCS 186
Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+ QC A+LN SCS N C Y SYGD SFS G L+ +TV+ GST+ +P
Sbjct: 187 AQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----VPNFY 241
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
+GCG +N GLF ++ G++GL +SL+ Q+ ++ FSYCL P SS+ + +
Sbjct: 242 YGCGQDNEGLFG-QSAGLIGLARNKLSLLYQLAPSMGYSFSYCL-PTSSSSSSGYLSIGS 299
Query: 264 SGPGVVS-TPLTKA---KTFYVLTIDAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQ 314
PG S TP+ + + Y + + I V + L VS+ +IDSGT +T LP
Sbjct: 300 YNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITRLPT 359
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFRGADVKLSRS-NFF 372
G S L ++ ++ P A L+ C+ + +VPEVT+ F G + N
Sbjct: 360 GVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAALKLAARNLL 419
Query: 373 VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
V V C F S I GN Q F V YD++ + F C+
Sbjct: 420 VDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 139/445 (31%), Positives = 207/445 (46%), Gaps = 65/445 (14%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNS------------SISS 76
V L+HRDS + + TP Q L L R R + + +SS
Sbjct: 61 LHVRLLHRDS-----FAVNATPAQLLARRLQRDELRAAWIIKAAAPAAAANDTPVVGLSS 115
Query: 77 SKASQADIIPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
A A ++ + Y+ +I++GTP E L DTGSD+ W QC+PC +CY Q
Sbjct: 116 GGAFVAPVVSRAPTTSGEYMAKIAVGTPAVEALLAMDTGSDITWLQCQPC--RRCYPQSG 173
Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQK---SCSGVNCQYSVSYG-DGSFSNGNLATETVT 188
P+FDP+ S++Y+ + + C +L + + C Y+V YG DGS + G+ ET+T
Sbjct: 174 PVFDPRHSTSYREMGYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLT 233
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI--AGKFSYC 246
V +P ++ GCG +N GLF + GI+GLG G IS SQ+ FSYC
Sbjct: 234 FAG----GVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYC 289
Query: 247 LVP---------VSSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRL 294
L VSST + G P TP + TFY + + +SVG R+
Sbjct: 290 LADFFLSSPGRSVSST-LTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRV 348
Query: 295 GVSTPD------------IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVA--DPTGS 339
T D +++DSGT +T L + Y + + ++ ++ V+ P+G
Sbjct: 349 PGVTEDDLKLDPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGF 408
Query: 340 LELCYSFNSLS-QVPEVTIHFRGA-DVKLSRSNFFVKV-SEDIVCSVFKGITN-SVPIYG 395
+ CY+ + +VP V++HF G ++ L N+ + V S VC F G + SV I G
Sbjct: 409 FDTCYTMGGRAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVSIIG 468
Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDC 420
NI Q F V Y+I V F P C
Sbjct: 469 NIQQQGFRVVYNIGGGRVGFAPNSC 493
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 126/349 (36%), Positives = 179/349 (51%), Gaps = 26/349 (7%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ R+ +GTP + V DTGS L W QC PC S C+ Q P+F+PK SS+Y S+ CS
Sbjct: 126 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVS-CHRQSGPVFNPKASSSYASVSCS 184
Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+ QC A+LN SCS N C Y SYGD SFS G L+ +TV+ GST+ +P
Sbjct: 185 AQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----VPNFY 239
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
+GCG +N GLF ++ G++GL +SL+ Q+ ++ FSYCL P SS+ + +
Sbjct: 240 YGCGQDNEGLFG-QSAGLIGLARNKLSLLYQLAPSMGYSFSYCL-PTSSSSSSGYLSIGS 297
Query: 264 SGPGVVS-TPLTKA---KTFYVLTIDAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQ 314
PG S TP+ + + Y + + I V + L VS+ +IDSGT +T LP
Sbjct: 298 YNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITRLPT 357
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFRGADVKLSRS-NFF 372
G S L ++ ++ P A L+ C+ + +VPEVT+ F G + N
Sbjct: 358 GVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAALKLAARNLL 417
Query: 373 VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
V V C F S I GN Q F V YD++ + F C+
Sbjct: 418 VDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNSKIGFAAAGCS 465
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 138/427 (32%), Positives = 203/427 (47%), Gaps = 51/427 (11%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-IP-- 86
S+ L HR P +P SS + L + L R R +H + + S + +D+ IP
Sbjct: 61 SMPLAHRHGPCAPATTSS---WPSLAERLRRDRARRDHITRKAKASGRTTTLSDVSIPTS 117
Query: 87 -----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
++ Y++ + IGTP ++ + DTGSDL W QC+PC S CY Q PL+DP SS
Sbjct: 118 LGAAVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASS 177
Query: 142 TYKSLPCSSSQCASL----NQKSC---SGVN-CQYSVSYGDGSFSNGNLATETVTLGSTT 193
TY +PC S C L C SG + CQY + YG+ + G +TET+TL
Sbjct: 178 TYAPVPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTL---- 233
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
V++ FGCG G F+ + G + SL+SQ T G FSYCL P +ST
Sbjct: 234 SPQVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPE-SLVSQTAETYGGAFSYCLPPGNST 292
Query: 254 ----KINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS----TPDIV 302
+ TN + G + TPL + TFY++ + +SVG + L + + ++
Sbjct: 293 TGFLALGAPTNNNDTA-GFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVLSGGMI 351
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ--VPEVTIH 358
IDSGT +T LP S L + + + A P+ P L+ CY+F ++ VP V +
Sbjct: 352 IDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYNFTGIANVTVPTVALT 411
Query: 359 FRGADVKLSRSNFFVKVSEDIV---CSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTV 413
F G + + V ++ C F G + V I GN+ Q F V YD + V
Sbjct: 412 FDGG------ATIDLDVPSGVLIQDCLAFAGGASDGDVGIIGNVNQRTFEVLYDSGRGHV 465
Query: 414 SFKPTDC 420
F+P C
Sbjct: 466 GFRPGAC 472
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 134/421 (31%), Positives = 196/421 (46%), Gaps = 38/421 (9%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS-----ISSSKASQADI 84
S+ L++R P +P +++ T + L R R NH + +S + S +
Sbjct: 57 SMPLMYRHGPCAP-ASAAATNRPSPAEMLRRDRARRNHILRKASGRRITLGVSIPTSLGA 115
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
++ Y++ + GTP ++ + DTGSDL W QC+PC S CY Q P+FDP SSTY
Sbjct: 116 FVDSLQYVVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYA 175
Query: 145 SLPCSSSQCASLN--------QKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
+PC S C L+ S SG + CQY + YG+G + G +TET+TL
Sbjct: 176 PVPCGSEACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTLSPEA-- 233
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
A + +FGCG G+F+ + G + SL+SQ T G FSYCL +ST
Sbjct: 234 ATVVNNFSFGCGLVQKGVFDLFDGLLGLGGAPE-SLVSQTTGTYGGAFSYCLPAGNSTAG 292
Query: 256 NFGTNGIVSG----PGVVSTPLTKAK-TFYVLTIDAISVGNQRLGVS----TPDIVIDSG 306
+G G TPL + TFY++ + ISVG ++L + ++IDSG
Sbjct: 293 FLALGAPATGGNNTAGFQFTPLQVVETTFYLVKLTGISVGGKQLDIEPTVFAGGMIIDSG 352
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG--SLELCYSF--NSLSQVPEVTIHFRGA 362
T +T LP+ S L + S + A P+ P L+ CY F N+ VP V + F G
Sbjct: 353 TIVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCYDFTGNTNVTVPTVALTFEGG 412
Query: 363 ---DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
D+ + S + V G T I GN+ Q F V YD + V F+
Sbjct: 413 VTIDLDVP-SGVLLDGCLAFVAGASDGDTG---IIGNVNQRTFEVLYDSARGHVGFRAGA 468
Query: 420 C 420
C
Sbjct: 469 C 469
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 126/349 (36%), Positives = 179/349 (51%), Gaps = 26/349 (7%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ R+ +GTP + V DTGS L W QC PC S C+ Q P+F+PK SS+Y S+ CS
Sbjct: 126 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVS-CHRQSGPVFNPKASSSYASVSCS 184
Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+ QC A+LN SCS N C Y SYGD SFS G L+ +TV+ GST+ +P
Sbjct: 185 AQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----VPNFY 239
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
+GCG +N GLF ++ G++GL +SL+ Q+ ++ FSYCL P SS+ + +
Sbjct: 240 YGCGQDNEGLFG-QSAGLIGLARNKLSLLYQLAPSMGYSFSYCL-PTSSSSSSGYLSIGS 297
Query: 264 SGPGVVS-TPLTKA---KTFYVLTIDAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQ 314
PG S TP+ + + Y + + I V + L VS+ +IDSGT +T LP
Sbjct: 298 YNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITRLPT 357
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFRGADVKLSRS-NFF 372
G S L ++ ++ P A L+ C+ + +VPEVT+ F G + N
Sbjct: 358 GVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAALKLAARNLL 417
Query: 373 VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
V V C F S I GN Q F V YD++ + F C+
Sbjct: 418 VDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 465
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 132/435 (30%), Positives = 197/435 (45%), Gaps = 52/435 (11%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
G +++++HR +S + + L R NR+ ++ + + A+ IP
Sbjct: 59 GNTIQIVHRACLQSGDRKTVPDHHPHYTGILRRDHNRVRSIHRRLTGAGDTAA---TIPA 115
Query: 87 ------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
++ Y++ I IGTP + DTGSDL W QC+PC S CY Q PLFDP S
Sbjct: 116 SLGLAFHSLEYVVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTDS-CYQQQEPLFDPSKS 174
Query: 141 STYKSLPCSSSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
STY +PC + QC +C G C+YSV YGD S + GNLA E TL + A
Sbjct: 175 STYVDVPCGTPQCKIGGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAA- 233
Query: 199 LPGITFGCGTN-----NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVPVSS 252
G+ FGC G G++GLG GD S++SQ R +G FSYCL P S
Sbjct: 234 --GVVFGCSHEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCLPPRGS 291
Query: 253 TKINFGTNGIVSGP--GVVSTPL----TKAKTFYVLTIDAISVGNQRLGVSTPDI----V 302
+ + T G + P + TPL ++ + YV+ + ISV L + V
Sbjct: 292 SA-GYLTIGAAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYIGTV 350
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG---SLELCYSF--NSLSQVPEVTI 357
IDSGT +T +P L + + P G SL+ CY + + P V +
Sbjct: 351 IDSGTVITHMPAAAYYVLRDEFRRHMGGYTML-PEGHVESLDTCYDVTGHDVVTAPPVAL 409
Query: 358 HF-RGADVKLSRSNFFVKVSED-------IVCSVFKGITNSVP---IYGNIMQTNFLVGY 406
F GA + + S + + D + C F + ++P I GN+ Q + V +
Sbjct: 410 EFGGGARIDVDASGILLVFAVDASGQSLTLACLAF--VPTNLPGFVIIGNMQQRAYNVVF 467
Query: 407 DIEQQTVSFKPTDCT 421
D+E + + F C+
Sbjct: 468 DVEGRRIGFGANGCS 482
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 114/377 (30%), Positives = 182/377 (48%), Gaps = 46/377 (12%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + + +GTPP + DTGSDL W QC+PC C+ Q+ + PK SSTY+++ C
Sbjct: 169 GEYFLDMFVGTPPKHVWLILDTGSDLSWIQCDPC--YDCFEQNGSHYYPKDSSTYRNISC 226
Query: 149 SSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGST----TGQAVA 198
+C ++ + C N C Y Y DGS + G+ A+ET T+ T +
Sbjct: 227 YDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQ 286
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SST 253
+ + FGCG N G F +G++GLG G IS SQ+++ FSYCL + S+
Sbjct: 287 VVDVMFGCGHWNKGFFYG-ASGLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNTSVSS 345
Query: 254 KINFG------TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP-------- 299
K+ FG N ++ +++ T +TFY L I +I VG + L +S
Sbjct: 346 KLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHWSSEG 405
Query: 300 -------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQ 351
+IDSG+TLTF P + I+ Q +A + CY+ + ++ Q
Sbjct: 406 AAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAADDFVMSPCYNVSGAMMQ 465
Query: 352 V--PEVTIHFRGADV-KLSRSNFFVKVSED-IVC-SVFKGITNS-VPIYGNIMQTNFLVG 405
V P+ IHF V N+F + D ++C ++ K +S + I GN++Q NF +
Sbjct: 466 VELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPNHSHLTIIGNLLQQNFHIL 525
Query: 406 YDIEQQTVSFKPTDCTK 422
YD+++ + + P C +
Sbjct: 526 YDVKRSRLGYSPRRCAE 542
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 133/425 (31%), Positives = 197/425 (46%), Gaps = 63/425 (14%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
++LIH +S SP YNS +T + + + + S+ S P
Sbjct: 45 IKLIHHESSLSP-YNSKDTIWDHYSHKILKQ-----------TFSNDYISNLVPSPRYVV 92
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
+L+ SIG PP +LAV DTGS L W C PC S C Q P+FDP SSTY +L CS
Sbjct: 93 FLMNFSIGEPPIPQLAVMDTGSSLTWVMCHPC--SSCSQQSVPIFDPSKSSTYSNLSCSE 150
Query: 151 -SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-- 207
++C +N + C YSV Y S G A E +TL + + +P + FGCG
Sbjct: 151 CNKCDVVNGE------CPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCGRK 204
Query: 208 ---TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVS 264
++NG + G+ GLG G SL+ + KFSYC+ + +T N+ N +V
Sbjct: 205 FSISSNGYPYQG-INGVFGLGSGRFSLLP----SFGKKFSYCIGNLRNT--NYKFNRLVL 257
Query: 265 GPGV----VSTPLTKAKTFYVLTIDAISVGNQRLGV-----------STPDIVIDSGTTL 309
G ST L Y + ++AIS+G ++L + + ++IDSG
Sbjct: 258 GDKANMQGDSTTLNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADH 317
Query: 310 TFLPQGYNSNLLSV-MSSMIEAQPV---ADPTGSLELCYS---FNSLSQVPEVTIHF-RG 361
T+L + Y +LS + +++E V D LCYS LS P VT HF G
Sbjct: 318 TWLTK-YGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGFPLVTFHFAEG 376
Query: 362 ADVKLSRSNFFVKVSEDIVC------SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
A + L ++ F++ +E+ C + F S G + Q N+ VGYD+ + V F
Sbjct: 377 AVLDLDVTSMFIQTTENEFCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNRMRVYF 436
Query: 416 KPTDC 420
+ DC
Sbjct: 437 QRIDC 441
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 119/417 (28%), Positives = 192/417 (46%), Gaps = 61/417 (14%)
Query: 49 TPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVA 108
T ++ +R A+ RSL+R +N + +A ++P YL+++ IGTP A
Sbjct: 49 TDHELIRRAVQRSLDRPGVAARNRK---AVVGEAPLVPRGGEYLVKLGIGTPQHYFSAAI 105
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--- 165
DT SDL+W QC+PC CY Q P+F+P++SS+Y +PCSS C+ L+ C +
Sbjct: 106 DTASDLVWLQCQPC--VSCYRQLDPIFNPRLSSSYAVVPCSSDTCSQLDGHRCDEDDDQA 163
Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
C+Y+ Y + +NG LA + + +G AV L GC ++ G + +G+VGL
Sbjct: 164 CRYNYKYSGNAVTNGTLAIDKLAVGGNVFHAVVL-----GCSDSSVGGPPPQASGLVGLA 218
Query: 226 GGDISLISQMRTTIAGKFSYCLVPVSST---KINFGTNGIVSGPGVVSTPL-------TK 275
G +SL+SQ+ +F YCL P S K+ G VS + T+
Sbjct: 219 RGPLSLLSQLSVR---RFMYCLPPPMSRTPGKLVLGAGAGADAVRNVSDRVTVTMSSSTR 275
Query: 276 AKTFYVLTIDAISVGNQ-----RLGVSTP--------------------DIVIDSGTTLT 310
++Y L D ++VG+Q R S P +++D +T++
Sbjct: 276 YPSYYYLNFDGLAVGDQTPGTIRRPTSPPATGGGVGGGGGDGGSGANAYGMIVDVASTIS 335
Query: 311 FLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNS-----LSQVPEVTIHFRGAD 363
FL L + I P A P+ L+LC+ VP V++ F G
Sbjct: 336 FLEASLYDELADDLEEEIRL-PRATPSTRLGLDLCFILPEGVGIDRVYVPTVSMSFDGRW 394
Query: 364 VKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
++L R F++ ++C + G T+ V I GN Q N V Y++ + ++F C
Sbjct: 395 LELERDRLFLEDGR-MMC-LMIGRTSGVSILGNYQQQNMHVLYNLRRGKITFAKASC 449
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 140/442 (31%), Positives = 203/442 (45%), Gaps = 59/442 (13%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLR-DA-----LTRSLNRLNHFNQNSSISS 76
A++G +EL H S S + +E + L DA L R + + + S+
Sbjct: 35 RAESGATVLELRHHASFSSGGKSRAEEAHAVLASDAARVSSLQRRIGSYGLIRSSDAASA 94
Query: 77 SKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
SK +Q + NY+ + IG E + DT S+L W QCEPC C+ Q
Sbjct: 95 SKLAQVPVTSGARLRTLNYVATVGIGG--GEATVIVDTASELTWVQCEPC--DACHDQQE 150
Query: 133 PLFDPKMSSTYKSLPCSSSQCASL------NQKSCSG--VNCQYSVSYGDGSFSNGNLAT 184
PLFDP S +Y ++PC+SS C +L + ++C C Y++SY DGS+S G LA
Sbjct: 151 PLFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAH 210
Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFS 244
+ ++L Q G FGCGT+N G F T+G++GLG +SLISQ G FS
Sbjct: 211 DRLSLAGEDIQ-----GFVFGCGTSNQGPFGG-TSGLMGLGRSQLSLISQTMDQFGGVFS 264
Query: 245 YCLVPV---SSTKINFGTNGIV---SGP----GVVSTPLTKAKTFYVLTIDAISVGNQRL 294
YCL P SS + G + V S P +VS PL FY+ + I+VG +
Sbjct: 265 YCLPPKESGSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQ--GPFYLANLTGITVGGED- 321
Query: 295 GVSTP--------DIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS 345
V +P ++DSGT +T +P Y + +S + E P A P L+ C+
Sbjct: 322 -VQSPGFSAGGGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAE-YPQAAPFSILDTCFD 379
Query: 346 FNSLS--QVPEVTIHFR-GADVKLSRSNFFVKVSEDI--VCSVFKGITNS--VPIYGNIM 398
L QVP + + F GA+V++ V+ D VC + + PI GN
Sbjct: 380 LTGLREVQVPSLKLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQ 439
Query: 399 QTNFLVGYDIEQQTVSFKPTDC 420
Q N V +D + F C
Sbjct: 440 QKNLRVIFDTVGSQIGFAQETC 461
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 103/287 (35%), Positives = 144/287 (50%), Gaps = 26/287 (9%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+ +++GTPP DTGSDL+WTQC PC C+ Q PL DP SSTY +LPC
Sbjct: 85 EYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPC--RDCFDQGIPLLDPAASSTYAALPCG 142
Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-----TGQAVALPGITF 204
+ +C +L SC G +C Y YGD S + G +AT+ T G G A +TF
Sbjct: 143 APRCRALPFTSCGGRSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRRLTF 202
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVS 264
GCG N G+F S TGI G G G SL SQ+ T FSYC + +K + T G
Sbjct: 203 GCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNAT---SFSYCFTSMFDSKSSIVTLGGAP 259
Query: 265 GP--------GVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI---VIDSGTTLT 310
V +TPL K + Y L++ ISVG RL V +IDSG ++T
Sbjct: 260 AALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFRSTIIDSGASIT 319
Query: 311 FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEV 355
LP+ + + ++ + P +L++C++ ++L + P V
Sbjct: 320 TLPEEVYEAVKAEFAAQVGLPPSGVEGSALDVCFALPVSALWRRPAV 366
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 125/349 (35%), Positives = 179/349 (51%), Gaps = 26/349 (7%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ R+ +GTP + V DTGS L W QC PC S C+ Q P+F+PK SS+Y S+ CS
Sbjct: 128 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVS-CHRQSGPVFNPKASSSYTSVSCS 186
Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+ QC A+L+ SCS N C Y SYGD SFS G L+ +TV+ GST+ +P
Sbjct: 187 AQQCSDLTTATLSPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----VPNFY 241
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
+GCG +N GLF ++ G++GL +SL+ Q+ ++ FSYCL P SS+ + +
Sbjct: 242 YGCGQDNEGLFG-QSAGLIGLARNKLSLLYQLAPSMGYSFSYCL-PTSSSSSSGYLSIGS 299
Query: 264 SGPGVVS-TPLTKA---KTFYVLTIDAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQ 314
PG S TP+ + + Y + + I V + L VS+ +IDSGT +T LP
Sbjct: 300 YNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITRLPT 359
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFRGADVKLSRS-NFF 372
G S L ++ ++ P A L+ C+ + +VPEVT+ F G + N
Sbjct: 360 GVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAALKLAARNLL 419
Query: 373 VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
V V C F S I GN Q F V YD++ + F C+
Sbjct: 420 VDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 134/432 (31%), Positives = 190/432 (43%), Gaps = 53/432 (12%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN---------HFNQNSS 73
++ + G +V L HR P SP S + + L R R N H+ +
Sbjct: 52 DSSSSGATVPLNHRHGPCSPV-PSGKKKQPTFTELLRRDQLRANYIQRQFSDEHYPRTGG 110
Query: 74 ISSSKAS---QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
+ S+A+ + N Y+I +SIG+P DTGSD+ W +C+
Sbjct: 111 LQQSEATVPIALGSLLNTLEYVITVSIGSPAVAXTMFIDTGSDVSWLRCK---------- 160
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSC---SGVNCQYSVSYGDGSFSNGNLATETV 187
S L+DP SSTY CS+ CA L ++ SG C YSV YGDGS + G ++T+
Sbjct: 161 -SRLYDPGTSSTYAPFSCSAPACAQLGRRGTGCSSGSTCVYSVKYGDGSNTTGTYGSDTL 219
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
TL T+ ++ G FGC G T G++GLGG S +SQ T FSYCL
Sbjct: 220 TLAGTSEPLIS--GFQFGCSAVEHGFEEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYCL 277
Query: 248 VPV--SSTKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRL----GVST 298
P SS + G + +TP+ ++K TFY L + ISVG + L V +
Sbjct: 278 PPTWNSSGFLTLGAPSSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVFS 337
Query: 299 PDIVIDSGTTLTFL-PQGYNSNLLSVMSSMI--EAQPVADPTGSLELCYSFNSLSQ---- 351
++DSGT +T L P Y + + M + QP A P G L+ C+ F +
Sbjct: 338 AGSIVDSGTVITRLPPTAYGALSAAFRDGMARYQYQPAA-PRGLLDTCFDFTGHGEGNNF 396
Query: 352 -VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDI 408
VP V + G V N V+ C F + I GN+ Q F V YD+
Sbjct: 397 TVPSVALVLDGGAVVDLHPNGIVQDG----CLAFAATDDDGRTGIIGNVQQRTFEVLYDV 452
Query: 409 EQQTVSFKPTDC 420
Q F+P C
Sbjct: 453 GQSVFGFRPGAC 464
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 113/333 (33%), Positives = 161/333 (48%), Gaps = 29/333 (8%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-- 164
V DT SD+ W QC PCP QC++Q PL+DP SST+ +PC S C L +G
Sbjct: 172 VVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSP 231
Query: 165 ---NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGI 221
C+Y V+YGDG + G T+T+T+ T + + FGC G F+++ GI
Sbjct: 232 TTDECKYIVNYGDGKATTGTYVTDTLTMSPT----IVVKDFRFGCSHAVRGSFSNQNAGI 287
Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVS-TPLTK---AK 277
+ LGGG SL+ Q FSYC +P S+ G V S TPL K A
Sbjct: 288 LALGGGRGSLLEQTADAYGNAFSYC-IPKPSSAGFLSLGGPVEASLKFSYTPLIKNKHAP 346
Query: 278 TFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQP 332
TFY++ ++AI V ++L V V+DSG +T L PQ Y + + S+M P
Sbjct: 347 TFYIVHLEAIIVAGKQLAVPPTAFATGAVMDSGAVVTQLPPQVYAALRAAFRSAMAAYGP 406
Query: 333 VADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGI-- 387
+A P +L+ CY F +VP+V++ F GA + L ++ + C F
Sbjct: 407 LAAPVRNLDTCYDFTRFPDVKVPKVSLVFAGGATLDLEPASIILD-----GCLAFAATPG 461
Query: 388 TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
SV GN+ Q + V YD+ V F+ C
Sbjct: 462 EESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 133/417 (31%), Positives = 190/417 (45%), Gaps = 45/417 (10%)
Query: 38 SPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA------NY 91
S K +N L D RS+ N + +S + +ASQ I ++ NY
Sbjct: 8 SEKKIDWNRRLQKQLILDDLRVRSMQ--NRIRRVASTHNVEASQTQIPLSSGINLQTLNY 65
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
++ + +G+ + DTGSDL W QCEPC CY Q P+F P SS+Y+S+ C+SS
Sbjct: 66 IVTMGLGSK--NMTVIIDTGSDLTWVQCEPC--MSCYNQQGPIFKPSTSSSYQSVSCNSS 121
Query: 152 QCASL-----NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
C SL N +C N C Y V+YGDGS++NG L E ++ G V++
Sbjct: 122 TCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSFG-----GVSVSDFV 176
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
FGCG NN GLF +G++GLG +SL+SQ T G FSYCL + G
Sbjct: 177 FGCGRNNKGLFGG-VSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSSGSLVMGNE 235
Query: 264 SGPGVVSTPLTKAK--------TFYVLTIDAISVGNQRLGV----STPDIVIDSGTTLTF 311
S + P+T + FY+L + I VG L I+IDSGT +T
Sbjct: 236 SSVFKNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLSFGNGGILIDSGTVITR 295
Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG-ADVKLSR 368
LP L + P A L+ C++ +V P +++ F G A + +
Sbjct: 296 LPSSVYKALKAEFLKKFTGFPSAPGFSILDTCFNLTGYDEVSIPTISLRFEGNAQLNVDA 355
Query: 369 SNFFVKVSEDI--VCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ F V ED VC ++++ I GN Q N V YD +Q V F C+
Sbjct: 356 TGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEEPCS 412
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 125/425 (29%), Positives = 198/425 (46%), Gaps = 59/425 (13%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQ-RLRDALTRSLNR----LNHFNQNSSISSSKASQ-- 81
+ +L HRD+ N +T ++ R + R + R LN N+N+ + +
Sbjct: 58 WKTKLFHRDN-----INLKKTTHKTRFISRINRDIKRVTFLLNRLNKNTQEQQTTTATEA 112
Query: 82 ---ADII----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
+D++ + Y +RI IG+P + V D+GSD++W QCEPC QCY Q P+
Sbjct: 113 SFGSDVVSGTEEGSGEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPC--DQCYNQTDPI 170
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQK-SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
F+P S+++ + CSS+ C L+ +C C Y V+YGDGS++ G LA ET+T+G T
Sbjct: 171 FNPATSASFIGVACSSNVCNQLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITIGRTV 230
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----P 249
Q A+ GCG N G+F G++GLGGG +S + Q+ G F YCLV P
Sbjct: 231 IQDTAI-----GCGHWNEGMF-VGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCLVSRAMP 284
Query: 250 VSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS----------TP 299
V + + ++ P +FY +++ ++VG R+ +S T
Sbjct: 285 VGAMWVP-----------LIHNPF--YPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTG 331
Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTI 357
+V+D+GT +T LP + + P A + CY N +VP V+
Sbjct: 332 GVVMDTGTAITRLPTVAYNAFRDAFIAQTTNLPRAPGVSIFDTCYDLNGFVTVRVPTVSF 391
Query: 358 HFRGADVKLSRSNFFVKVSEDI--VCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
+F G + + F+ ++D+ C F + + I GNI Q V D V F
Sbjct: 392 YFSGGQILTFPARNFLIPADDVGTFCFAFAPSPSGLSIIGNIQQEGIQVSIDGTNGFVGF 451
Query: 416 KPTDC 420
P C
Sbjct: 452 GPNVC 456
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 139/441 (31%), Positives = 207/441 (46%), Gaps = 61/441 (13%)
Query: 23 EAQTGGFSVELIHRD--SPKSPFYNSSETPYQRLRDALTRSL-NRL-------NHFNQNS 72
+ G +E+ R S + +N D RS+ NR+ N Q+S
Sbjct: 57 RKEKGAIVLEMKDRGYCSERKINWNRKLQKQLIFDDLRVRSMQNRIRAKVSGHNSSEQSS 116
Query: 73 SISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
I AS ++ NY++ I +G + DTGSDL W QC+PC CY Q
Sbjct: 117 EIQIPLASGINL--ETLNYIVTIGLGNQ--NMTVIIDTGSDLTWVQCDPCMS--CYSQQG 170
Query: 133 PLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVN---CQYSVSYGDGSFSNGNLAT 184
P+F+P SS+Y SL C+SS C +L N ++C N C ++VSYGDGSF++G L
Sbjct: 171 PVFNPSNSSSYNSLLCNSSTCQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGV 230
Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFS 244
E ++ G +++ FGCG NN GLF +GI+GLG ++S+ISQ TT G FS
Sbjct: 231 EHLSFG-----GISVSNFVFGCGRNNKGLFGG-VSGIMGLGRSNLSMISQTNTTFGGVFS 284
Query: 245 YCL----------VPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRL 294
YCL + + + F ++ +VS P + FYVL + I VG
Sbjct: 285 YCLPTTDSGASGSLVIGNESSLFKNLTPIAYTSMVSNP--QLSNFYVLNLTGIDVG---- 338
Query: 295 GVSTPD-------IVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF 346
GV+ D I+IDSGT +T L P YN+ L + P+A L+ C++
Sbjct: 339 GVAIQDTSFGNGGILIDSGTVITRLAPSLYNA-LKAEFLKQFSGYPIAPALSILDTCFNL 397
Query: 347 NSLSQV--PEVTIHFR-GADVKLSRSN-FFVKVSEDIVCSVFKGIT--NSVPIYGNIMQT 400
+ +V P +++HF D+ + ++ VC ++ N + I GN Q
Sbjct: 398 TGIEEVSIPTLSMHFENNVDLNVDAVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQR 457
Query: 401 NFLVGYDIEQQTVSFKPTDCT 421
N V YD +Q + F DC+
Sbjct: 458 NQRVIYDAKQSKIGFAREDCS 478
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 122/369 (33%), Positives = 179/369 (48%), Gaps = 48/369 (13%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
++ + +SIGTPP R + DTGSDLIWTQC+ Q ++ PL+DP SS++ + PC
Sbjct: 88 HHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQ--HREKPLYDPAKSSSFAAAPCD 145
Query: 150 SSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C S N K+CS C Y+ +YG + + G LA+ET T G +V+L FGCG
Sbjct: 146 GRLCETGSFNTKNCSRNKCIYTYNYGSAT-TKGELASETFTFGEHRRVSVSL---DFGCG 201
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV----SSTKINFG----- 258
G +GI+G+ +SL+SQ++ +FSYCL P +++ I FG
Sbjct: 202 KLTSGSL-PGASGILGISPDRLSLVSQLQIP---RFSYCLTPFLDRNTTSHIFFGAMADL 257
Query: 259 ----TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVI-DSGTTLTFLP 313
T G + +V+ P + +Y + + ISVG +RL V I G+ TF+
Sbjct: 258 SKYRTTGPIQTTSLVTNP-DGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFVD 316
Query: 314 QGYNSNLLS--VMSSMIEAQ------PVADPTG---SLELCYSF--------NSLSQVPE 354
G + +L VM ++ EA PV + T ELC+ + QVP
Sbjct: 317 SGDTTGMLPSVVMEALKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGAVETAVQVPP 376
Query: 355 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
+ HF GA + L R ++ V+VS +C V I GN Q N V +D+E
Sbjct: 377 LVYHFDGGAAMLLRRDSYMVEVSAGRMCLVISSGARGA-IIGNYQQQNMHVLFDVENHEF 435
Query: 414 SFKPTDCTK 422
SF PT C +
Sbjct: 436 SFAPTQCNQ 444
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 117/357 (32%), Positives = 168/357 (47%), Gaps = 54/357 (15%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+ ++IGTPP DTGSDLIWTQC+PCP C+ Q P FDP SST C
Sbjct: 88 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 145
Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
S+ C L S ++ T G ++PG+ FGCG
Sbjct: 146 STLCQGLPVASLP--------------------RSDKFTF---VGAGASVPGVAFGCGLF 182
Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKINFGTNGIVS 264
N G+F S TGI G G G +SL SQ++ G FS+C + S+ ++ + +
Sbjct: 183 NNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTTITGAIPSTVLLDLPADLFSN 239
Query: 265 GPGVV-STPLTK---AKTFYVLTIDAISVGNQRLGV---------STPDIVIDSGTTLTF 311
G G V +TPL + TFY L++ I+VG+ RL V T +IDSGT +T
Sbjct: 240 GQGAVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGTAMTS 299
Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS--FNSLSQVPEVTIHFRGADVKLSRS 369
LP + ++ ++ V+ T C S + VP++ +HF GA + L R
Sbjct: 300 LPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMDLPRE 359
Query: 370 NFFVKVSE---DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
N+ +V + I+C ++ +G V GN Q N V YD++ +SF P C K
Sbjct: 360 NYVFEVEDAGSSILCLAIIEG--GEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQCDK 414
>gi|356558489|ref|XP_003547539.1| PREDICTED: uncharacterized protein LOC100817234 [Glycine max]
Length = 739
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 105/240 (43%), Positives = 145/240 (60%), Gaps = 17/240 (7%)
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV----S 251
+V+ P I GCG NN G F+SK GIVGLGGG +SLIS + +I K+SYCLVP+ S
Sbjct: 55 SVSFPKIPIGCGLNNAGTFDSKCFGIVGLGGGVVSLISHIGLSIDSKYSYCLVPLFEFNS 114
Query: 252 STKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGVSTP--------DI 301
++KINFG N +V G G VSTP+ TFY L ++ +SVG++R+ +I
Sbjct: 115 TSKINFGENAVVEGLGTVSTPIIPGSFDTFYYLKLEGMSVGSKRIDFVDASTSNELKGNI 174
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHF 359
+IDSGTTLT L + + + L + + + I + V L LCY N+ +VP +T HF
Sbjct: 175 IIDSGTTLTILLENFYTKLEAEVEAHINLERVNSTDQILSLCYKSPPNNAIEVPIITTHF 234
Query: 360 RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
G D+ L+ N FV V +D + F + S I+GN+ Q N LVGYD+ ++TVSFKPTD
Sbjct: 235 AGVDIVLNSLNTFVSVFDDAMWFAFAPVA-SGSIFGNLAQMNHLVGYDLLRKTVSFKPTD 293
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 120/379 (31%), Positives = 182/379 (48%), Gaps = 40/379 (10%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
++ + +A YL+ + +GTPP + DTGSDL W QC PC C+ Q P+FDP S
Sbjct: 136 ESGVAVGSAEYLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAAS 193
Query: 141 STYKSLPCSSSQCASLNQKSCSGVN---------CQYSVSYGDGSFSNGNLATETVTLGS 191
S+Y++L C +C + C Y YGD S S G+LA E+ T+
Sbjct: 194 SSYRNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNL 253
Query: 192 TT-GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVP 249
T G + + G+ FGCG N GLF+ ++GLG G +S SQ+R G FSYCLV
Sbjct: 254 TAPGASSRVDGVVFGCGHRNRGLFHGAAG-LLGLGRGPLSFASQLRAVYGGHTFSYCLVD 312
Query: 250 VSS---TKINFGTN---GIVSGPGVVSTPL----TKAKTFYVLTIDAISVGNQRLGVSTP 299
S +K+ FG + + + P + T + A TFY + + + VG + L +S+
Sbjct: 313 HGSDVASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLNISSD 372
Query: 300 ----------DIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS 348
+IDSGTTL+ F+ Y + + M + P L CY+ +
Sbjct: 373 TWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVLSPCYNVSG 432
Query: 349 LS--QVPEVTIHF-RGADVKLSRSNFFVKVSED-IVCSVFKGITNS-VPIYGNIMQTNFL 403
+ +VPE+++ F GA N+F+++ D I+C G + + I GN Q NF
Sbjct: 433 VERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQNFH 492
Query: 404 VGYDIEQQTVSFKPTDCTK 422
V YD+ + F P C +
Sbjct: 493 VAYDLHNNRLGFAPRRCAE 511
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 112/338 (33%), Positives = 167/338 (49%), Gaps = 37/338 (10%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN- 165
V DTGSD+ W QC+PC + CY Q P+FDP +S++Y ++ C S +C L+ +C
Sbjct: 2 VLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATG 59
Query: 166 -CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
C Y V+YGDGS++ G+ ATET+TLG +T + + GCG +N GLF ++ L
Sbjct: 60 ACLYEVAYGDGSYTVGDFATETLTLGDST----PVGNVAIGCGHDNEGLFVGAAG-LLAL 114
Query: 225 GGGDISLISQMRTTIAGKFSYCLV----PVSSTKINFGTNGIVSGPGVVSTPLTKA---K 277
GGG +S SQ+ A FSYCLV P +ST + FG + G V+ PL ++
Sbjct: 115 GGGPLSFPSQIS---ASTFSYCLVDRDSPAAST-LQFGDGAAEA--GTVTAPLVRSPRTS 168
Query: 278 TFYVLTIDAISVGNQRLGV-----------STPDIVIDSGTTLTFLPQGYNSNLLSVMSS 326
TFY + + ISVG Q L + + +++DSGT +T L + L
Sbjct: 169 TFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQ 228
Query: 327 MIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVKVS-EDIVCS 382
+ P + CY + + +VP V++ F G ++L N+ + V C
Sbjct: 229 GAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCL 288
Query: 383 VFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
F +V I GN+ Q V +D + V F P C
Sbjct: 289 AFAPTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 139/464 (29%), Positives = 210/464 (45%), Gaps = 73/464 (15%)
Query: 14 LCFYVVSPIEAQT------GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
L FY+ + I + T + +LIHR+S P Y+ +ET R + T S+ R +
Sbjct: 17 LAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSIERFDF 76
Query: 68 FNQNSSISSSKA----SQADIIPNN--ANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
S I K+ +++ +IP N + +L+ +SIG+PP +L V DTGS L+W QC P
Sbjct: 77 LE--SKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLP 134
Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNG 180
C C+ Q + FDP S ++K+L C +N C+ N +Y + Y G S G
Sbjct: 135 CI--NCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQG 192
Query: 181 NLATETVTLG-------------STTGQAVALPGITFGCG-----TNNGGLFNSKTTGIV 222
LA E++ ST + ITFGCG TNN +N G+
Sbjct: 193 ILAKESLLFETLDEGRVFQYNAISTQISKIKKSNITFGCGHMNIKTNNDDAYN----GVF 248
Query: 223 GLGGG-DISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVV----STPLTKAK 277
GLG I+ M T + KFSYC+ +++ + N +V G G STPL
Sbjct: 249 GLGAYPHIT----MATQLGNKFSYCIGDINNPL--YTHNHLVLGQGSYIEGDSTPLQIHF 302
Query: 278 TFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTTLTFLPQG----YNSNLLSV 323
Y +T+ +ISVG++ L + + ++IDSG T T L G ++ +
Sbjct: 303 GHYYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDL 362
Query: 324 MSSMIEAQPVADPTGSLELCYS---FNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDI 379
M ++E P LC+ L P VT HF GAD+ L + F + D
Sbjct: 363 MKGLLERIPTQRKFEG--LCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDR 420
Query: 380 VCSVF---KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
C ++ + G + Q N+ VG+D+EQ V F+ DC
Sbjct: 421 FCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 464
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 137/423 (32%), Positives = 202/423 (47%), Gaps = 38/423 (8%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS------KASQAD 83
S++++H+ P S + + L + +R+ + S S + K + +
Sbjct: 75 SLKVVHKHGPCSKLSQDEASAAPTHTEILLQDQSRVKSIHSRLSNSKTSGGKDVKVTDST 134
Query: 84 IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
IP + NY++ + +GTP + + DTGSD+ WTQC+PC S CY Q +FD
Sbjct: 135 TIPAKDGSTVGSGNYIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCARS-CYKQKEQIFD 193
Query: 137 PKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGS 191
P S++Y ++ CSSS C SL N C+ C Y + YGD SFS G TE +TL S
Sbjct: 194 PSQSTSYTNISCSSSICNSLTSATGNTPGCASSACVYGIQYGDSSFSVGFFGTEKLTLTS 253
Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS 251
T A I FGCG NN + G++GLG +S++SQ FSYCL P S
Sbjct: 254 TD----AFNNIYFGCGQNN-QGLFGGSAGLLGLGRDKLSVVSQTAQKYNKIFSYCL-PSS 307
Query: 252 STKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGV-----STPDIVI 303
S+ F T G + TPL + +FY L ISVG ++L + ST +I
Sbjct: 308 SSSTGFLTFGGSASKNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAISASVFSTAGAII 367
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-R 360
DSGT +T LP S L + +++ P+ L+ CY F+S + VP++ F
Sbjct: 368 DSGTVITRLPPAAYSALRASFRNLMSKYPMTKALSILDTCYDFSSYTTISVPKIGFSFSS 427
Query: 361 GADVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
G +V + + S VC F G +++ V I+GN+ Q V YD V F P
Sbjct: 428 GIEVDIDATGILYASSLSQVCLAFAGNSDATDVFIFGNVQQKTLEVFYDGSAGKVGFAPG 487
Query: 419 DCT 421
C+
Sbjct: 488 GCS 490
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 141/429 (32%), Positives = 200/429 (46%), Gaps = 50/429 (11%)
Query: 18 VVSPIEAQTGGFSVELIHRDSPKSPFYNSSETP-------YQRLRDA-LTRSLNRLNHFN 69
V S A + G +V L HR P SP S++ P + +LR + R L+ +
Sbjct: 52 VCSVTPASSSGTTVPLNHRYGPCSP-APSAKVPTILELLEHDQLRAKYIQRKLSGTDGLQ 110
Query: 70 Q-NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
+ ++ ++ S D + Y+I + IG+P + + DTGSD+ W +C
Sbjct: 111 PLDLTVPTTLGSALDTM----EYVITVGIGSPAVTQTMMIDTGSDVSWVRCNS------- 159
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
LFDP S+TY CSS+ CA L N CS CQY V YGDGS + G +++T
Sbjct: 160 TDGLTLFDPSKSTTYAPFSCSSAACAQLGNNGDGCSNSGCQYRVQYGDGSNTTGTYSSDT 219
Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYC 246
+ L ++ + FGC + K G++GLGG SL+SQ T FSYC
Sbjct: 220 LALSASD----TVTDFHFGCSHHEEDFDGEKIDGLMGLGGDAQSLVSQTAATYGKSFSYC 275
Query: 247 LVPVSSTK--INFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI 301
L P + T + FG SG G V+TP+ KA T Y + + ISVG LG+ P +
Sbjct: 276 LPPTNRTSGFLTFGAPNGTSG-GFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQ-PSV 333
Query: 302 -----VIDSGTTLTFLPQGYNSNLLSVM-SSMIE-AQPVADPTGSLELCYSFNSLSQV-- 352
V+DSGT +T+LP+ S L S SSM A P G L+ CY F L V
Sbjct: 334 LSNGSVMDSGTVITWLPRRAYSALSSAFRSSMTRLRHQRAAPLGILDTCYDFTGLVNVSI 393
Query: 353 PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
P V++ GA V L + ++ C F T+ I GN+ Q F V +D+ Q
Sbjct: 394 PAVSLVLDGGAVVDLDGNGIMIQ-----DCLAFAA-TSGDSIIGNVQQRTFEVLHDVGQG 447
Query: 412 TVSFKPTDC 420
F+ C
Sbjct: 448 VFGFRSGAC 456
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 127/415 (30%), Positives = 187/415 (45%), Gaps = 66/415 (15%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
E+ RD + F NS Y S N NH + N ++ + N+
Sbjct: 88 EIFGRDESRVSFINSKCNQYT--------SGNLKNHAHNN-----------NLFDEDGNF 128
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
L+ ++ GTP TE + DTGS + WTQC+ C C + FD SSTY C S
Sbjct: 129 LVDVAFGTPXTEIXLILDTGSSITWTQCKAC--VNCLQDSNRYFDSSASSTYSFGSCIPS 186
Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
V Y+++YGD S S GN +T+TL + FGCG NN
Sbjct: 187 T-----------VENNYNMTYGDDSTSVGNYGCDTMTLEPSD----VFQKFQFGCGRNNK 231
Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST-KINFGTNG--------- 261
G F S G++GLG G +S +SQ + FSYCL S + FG
Sbjct: 232 GDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIGSLLFGEKATSQSSSLKF 291
Query: 262 --IVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQ 314
+V+GPG + + +Y + + ISVGN+RL + ++P +IDS T +T LPQ
Sbjct: 292 TSLVNGPGTL-----QESGYYFVNLSDISVGNERLNIPSSVFASPGTIIDSRTVITRLPQ 346
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGS----LELCYSFNSLSQV--PEVTIHF-RGADVKLS 367
S L + + P+++ L+ CY+ + V PE+ +HF GADV+L+
Sbjct: 347 RAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLN 406
Query: 368 RSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+N +C F G T+ + I GN Q + V YDI+ + + F C+K
Sbjct: 407 GTNIVWGSDASRLCLAFAG-TSELTIIGNRQQLSLTVLYDIQGRRIGFGGNGCSK 460
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 113/373 (30%), Positives = 178/373 (47%), Gaps = 43/373 (11%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC-YMQDSPLFDPKMSSTYKSL 146
+ Y + + IGTPP L VADTGSDLIW +C PC C + F + S+TY ++
Sbjct: 83 SGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPC--RNCSHRSPGSAFFARHSTTYSAI 140
Query: 147 PCSSSQCASLNQKSCSGVN-------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
C S QC + + N C+Y +Y D S + G + E +TL ++TG+ L
Sbjct: 141 HCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTSTGKVKKL 200
Query: 200 PGITFGCGTN------NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----- 248
G++FGCG G F G++GLG IS SQ+ KFSYCL+
Sbjct: 201 NGLSFGCGFRISGPSLTGASFEG-AQGVMGLGRAPISFSSQLGRRFGSKFSYCLMDYTLS 259
Query: 249 --PVSSTKINFGTNGIVSGPGVVS-TPL---TKAKTFYVLTIDAISVGNQRLGV-----S 297
P S I N VS G++S TPL + TFY + I + V +L + S
Sbjct: 260 PPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWS 319
Query: 298 TPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ- 351
D+ +IDSGTTLTF+ + + +L ++ A+PT +LC + + +++
Sbjct: 320 IDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFDLCMNVSGVTRP 379
Query: 352 -VPEVTIHFRGADV-KLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYD 407
+P ++ + G V N+F++ + I C + ++ + GN+MQ FL+ +D
Sbjct: 380 ALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFSVLGNLMQQGFLLEFD 439
Query: 408 IEQQTVSFKPTDC 420
++ + F C
Sbjct: 440 RDKSRLGFTRRGC 452
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 128/451 (28%), Positives = 197/451 (43%), Gaps = 47/451 (10%)
Query: 3 TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDS--PKSPFYNSSETPYQRLRDALTR 60
T LSC+ L + + ++L HRD+ PK P R+ D +
Sbjct: 26 TLLSCLITTLLL---ITVADSMKDTSVRLKLAHRDTLLPK---------PLSRIEDVIGA 73
Query: 61 SLNR--LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
R L +NS++ + I A Y I +GTP + V DTGS+L W
Sbjct: 74 DQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVN 133
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVS 171
C + + +F S ++K++ C + C SL C Y
Sbjct: 134 CRYRARGK---DNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYR 190
Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
Y DGS + G A ET+T+G T G+ LPG GC ++ G G++GL D S
Sbjct: 191 YADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSF 250
Query: 232 ISQMRTTIAGKFSYCLVPVSSTK-----INFGTNGIVSGPGVVSTP--LTKAKTFYVLTI 284
S + KFSYCLV S K + FG++ +TP LT+ FY + +
Sbjct: 251 TSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINV 310
Query: 285 DAISVGNQRLGV--------STPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVAD 335
IS+G L + S ++DSGT+LT L Y + + ++E + V
Sbjct: 311 IGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKP 370
Query: 336 PTGSLELCYSFNS---LSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF-KGITNS 390
+E C+SF S +S++P++T H + GA + R ++ V + + C F T +
Sbjct: 371 EGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPA 430
Query: 391 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ GNIMQ N+L +D+ T+SF P+ CT
Sbjct: 431 TNVIGNIMQQNYLWEFDLMASTLSFAPSACT 461
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 115/380 (30%), Positives = 178/380 (46%), Gaps = 46/380 (12%)
Query: 66 NHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
H + + A N Y I++G+PP + V DTGSDL W +C+PC P
Sbjct: 99 RHLAEEEEVEHDLAQTPVSFTNGGVYYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCSP- 157
Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATE 185
C S FD S+TYK+L C+ + + F +G +
Sbjct: 158 DC----SSTFDRLASNTYKALTCADD------------LRLPVLLRLWRRLFHSGRSLRD 201
Query: 186 TVTL-GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFS 244
T+ + G+ + + PG FGCG+ GL S GI+ L G +S SQ+ KFS
Sbjct: 202 TLKMAGAASDELEEFPGFVFGCGSLLKGLI-SGEVGILALSPGSLSFPSQIGEKYGNKFS 260
Query: 245 YCLV------PVSSTKINFGTNGI-VSGPG------VVSTPLTKAKTFYVLTIDAISVGN 291
YCL+ + + + FG + + PG + TP+ ++ +Y + +D ISVGN
Sbjct: 261 YCLLRQTAQNSLKKSPMVFGEAAVELKEPGSGKPQELQYTPIGESSIYYTVRLDGISVGN 320
Query: 292 QRLGVS--------TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELC 343
QRL +S + DSGTTLT LP G ++ ++SM+ G L+ C
Sbjct: 321 QRLDLSPSTFLNGQDKPTIFDSGTTLTMLPSGVCDSIKQSLASMVSGAEFVAIKG-LDAC 379
Query: 344 YSF--NSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQT 400
+ +S +P++T HF GAD SN+ + + + C +F TN V I+GN+ Q
Sbjct: 380 FRVPPSSGQGLPDITFHFNGGADFVTRPSNYVIDLGS-LQCLIFVP-TNEVSIFGNLQQQ 437
Query: 401 NFLVGYDIEQQTVSFKPTDC 420
+F V +D++ + + FK TDC
Sbjct: 438 DFFVLHDMDNRRIGFKETDC 457
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 128/451 (28%), Positives = 197/451 (43%), Gaps = 47/451 (10%)
Query: 3 TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDS--PKSPFYNSSETPYQRLRDALTR 60
T LSC+ L + + ++L HRD+ PK P R+ D +
Sbjct: 4 TLLSCLITTLLL---ITVADSMKDTSVRLKLAHRDTLLPK---------PLSRIEDVIGA 51
Query: 61 SLNR--LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
R L +NS++ + I A Y I +GTP + V DTGS+L W
Sbjct: 52 DQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVN 111
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVS 171
C + + +F S ++K++ C + C SL C Y
Sbjct: 112 CRYRARGK---DNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYR 168
Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
Y DGS + G A ET+T+G T G+ LPG GC ++ G G++GL D S
Sbjct: 169 YADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSF 228
Query: 232 ISQMRTTIAGKFSYCLVPVSSTK-----INFGTNGIVSGPGVVSTP--LTKAKTFYVLTI 284
S + KFSYCLV S K + FG++ +TP LT+ FY + +
Sbjct: 229 TSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINV 288
Query: 285 DAISVGNQRLGV--------STPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVAD 335
IS+G L + S ++DSGT+LT L Y + + ++E + V
Sbjct: 289 IGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKP 348
Query: 336 PTGSLELCYSFNS---LSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF-KGITNS 390
+E C+SF S +S++P++T H + GA + R ++ V + + C F T +
Sbjct: 349 EGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPA 408
Query: 391 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ GNIMQ N+L +D+ T+SF P+ CT
Sbjct: 409 TNVIGNIMQQNYLWEFDLMASTLSFAPSACT 439
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 128/432 (29%), Positives = 193/432 (44%), Gaps = 42/432 (9%)
Query: 10 ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN 69
++ + YV E V L+HR P +P S T + D RS R ++
Sbjct: 1 MILHIYIYVSVKPEQNGSTVYVPLVHRHGPCAP-APSLSTDTRSFADIFRRSRARPSYIV 59
Query: 70 QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
+ +S ++ + Y++R+S GTP ++ V DTGSD+ W QC+PC QC+
Sbjct: 60 RGKKVSVPAHLGTSVM--SLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFP 117
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKS-----CSGVNCQYSVSYGDGSFSNGNLAT 184
Q PL+DP SSTY ++PC+S C L + SG C +++SY DG+ + G +
Sbjct: 118 QKDPLYDPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQ 177
Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNG---GLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
+ +TL + FGCG GLF+ G++GLG L + G
Sbjct: 178 DKLTL----APGAIVQNFYFGCGHGKHAVRGLFD----GVLGLG----RLRESLGARYGG 225
Query: 242 KFSYCLVPVSSTKINFGTNGIVSGP-GVVSTPLTKA---KTFYVLTIDAISVGNQRLGVS 297
FSYCL P S+K F G P G V TP+ TF +T+ I+VG ++L +
Sbjct: 226 VFSYCL-PSVSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLR 284
Query: 298 ----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-- 351
+ +++DSGT +T L L S +EA + P G L+ CY+
Sbjct: 285 PSAFSGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLL-PNGDLDTCYNLTGYKNVV 343
Query: 352 VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDI 408
VP++ + F GA + L N + C F G S + GN+ Q F V +D
Sbjct: 344 VPKIALTFTGGATINLDVPNGILVNG----CLAFAESGPDGSAGVLGNVNQRAFEVLFDT 399
Query: 409 EQQTVSFKPTDC 420
F+ C
Sbjct: 400 STSKFGFRAKAC 411
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 128/395 (32%), Positives = 193/395 (48%), Gaps = 52/395 (13%)
Query: 61 SLNRLNHFNQNSS--ISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
S+ RL + ++ I + + IIP +L+ ISIG+PP +L DT SDL+W Q
Sbjct: 55 SVERLEYLKAKATGDIIAHLSPNVPIIPQA--FLVNISIGSPPVTQLLHMDTASDLLWLQ 112
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA----SLNQKSCSGVNCQYSVSYGD 174
C PC CY Q P+FDP S T+++ C +SQ + N K+ S C+YS+ Y D
Sbjct: 113 CRPC--INCYAQSLPIFDPSRSYTHRNESCRTSQYSMPSLRFNAKTRS---CEYSMRYMD 167
Query: 175 GSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
G+ S G LA E + + + + AL + FGCG +N G TGI+GLG G+ SL+
Sbjct: 168 GTGSKGILAKEMLMFNTIYDESSSAALHDVVFGCGHDNYGE-PLVGTGILGLGYGEFSLV 226
Query: 233 SQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVV-----STPLTKAKTFYVLTIDAI 287
+ T KFSYC + ++ N +V G +TPL FY +TI+AI
Sbjct: 227 HRFGT----KFSYCFGSLDDP--SYPHNVLVLGDDGANILGDTTPLEIYNGFYYVTIEAI 280
Query: 288 SVG-----------NQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP 336
SV N+ +ID+G +LT L + L + + E + A
Sbjct: 281 SVDGIILPIDPWVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAAD 340
Query: 337 TGSLEL----CYSFN-----SLSQVPEVTIHF-RGADVKLSRSNFFVKVSEDIVC-SVFK 385
++ CY+ N S P VT HF GA++ L + F+K+S ++ C +V
Sbjct: 341 VNQDDMFKVECYNGNLERDLVESGFPIVTFHFSDGAELSLDVKSVFMKLSPNVFCLAVTP 400
Query: 386 GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
G NS+ G Q ++ +GYD+E + +SF+ DC
Sbjct: 401 GNMNSI---GATAQQSYNIGYDLEAKKISFERIDC 432
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 120/387 (31%), Positives = 188/387 (48%), Gaps = 48/387 (12%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
++ + + YL+ + +GTPP + DTGSDL W QC PC C+ Q P+FDP S
Sbjct: 141 ESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAAS 198
Query: 141 STYKSLPCSSSQCASL---------NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVT 188
S+Y+++ C +C + + ++C C Y YGD S + G+LA E+ T
Sbjct: 199 SSYRNVTCGDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFT 258
Query: 189 LGSTT-GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
+ T G + + G+ FGCG N GLF+ ++GLG G +S SQ+R FSYCL
Sbjct: 259 VNLTAPGASRRVDGVVFGCGHRNRGLFHGAAG-LLGLGRGPLSFASQLRAVYGHTFSYCL 317
Query: 248 VPVSS---TKINFGTN----GIVSGPGVVSTPL-------TKAKTFYVLTIDAISVGNQR 293
V S +K+ FG + + + P + T + A TFY + + + VG +
Sbjct: 318 VDHGSDVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGEL 377
Query: 294 LGVS--TPDI--------VIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
L +S T D+ +IDSGTTL+ F+ Y + M M + P+ L
Sbjct: 378 LNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLVPEFPVLSP 437
Query: 343 CYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSED---IVCSVFKGITNS-VPIYG 395
CY+ + + +VPE+++ F GA N+F+++ D I+C G + + I G
Sbjct: 438 CYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAVLGTPRTGMSIIG 497
Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDCTK 422
N Q NF V YD++ + F P C +
Sbjct: 498 NFQQQNFHVVYDLQNNRLGFAPRRCAE 524
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 118/349 (33%), Positives = 169/349 (48%), Gaps = 43/349 (12%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN- 165
V DTGSD++W QC PC +CY Q P+FDP+ SS+Y ++ C ++ C L+ C
Sbjct: 2 VLDTGSDVVWVQCAPC--RRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRG 59
Query: 166 -CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
C Y V+YGDGS + G+ TET+T G VA + GCG +N GLF + ++GL
Sbjct: 60 ACMYQVAYGDGSVTAGDFVTETLTF--AGGARVAR--VALGCGHDNEGLFVAAAG-LLGL 114
Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVSSTK------------INFGTNGIVSGPGVVSTP 272
G G +S +Q+ FSYCLV +S+ ++FG G V TP
Sbjct: 115 GRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGA-GSVGASSASFTP 173
Query: 273 LT---KAKTFYVLTIDAISVGNQRL-GVSTPD-----------IVIDSGTTLTFLPQGYN 317
+ + +TFY + + ISVG R+ GV+ D +++DSGT++T L +
Sbjct: 174 MVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASY 233
Query: 318 SNLLSVMSSMIEAQPVADPTG--SLELCYSFNS--LSQVPEVTIHFR-GADVKLSRSNFF 372
S L + P G + CY + +VP V++HF GA+ L N+
Sbjct: 234 SALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYL 293
Query: 373 VKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ V S C F G V I GNI Q F V +D + Q V F P C
Sbjct: 294 IPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 125/411 (30%), Positives = 187/411 (45%), Gaps = 42/411 (10%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
V L+HR P +P S T + D RS R ++ + +S ++ +
Sbjct: 56 VPLVHRHGPCAP-APSLSTDTRSFADIFRRSRARPSYIVRGKKVSVPAHLGTSVM--SLE 112
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y++R+S GTP ++ V DTGSD+ W QC+PC QC+ Q PL+DP SSTY ++PC+S
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCAS 172
Query: 151 SQCASLNQKS-----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C L + SG C +++SY DG+ + G + + +TL + FG
Sbjct: 173 DVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTL----APGAIVQNFYFG 228
Query: 206 CGTNNG---GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI 262
CG GLF+ G++GLG L + G FSYCL P S+K F G
Sbjct: 229 CGHGKHAVRGLFD----GVLGLG----RLRESLGARYGGVFSYCL-PSVSSKPGFLALGA 279
Query: 263 VSGP-GVVSTPLTKA---KTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQ 314
P G V TP+ TF +T+ I+VG ++L + + +++DSGT +T L
Sbjct: 280 GKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITGLQS 339
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNF 371
L S +EA + P G L+ CY+ VP++ + F GA + L N
Sbjct: 340 TAYRALRSAFRKAMEAYRLL-PNGDLDTCYNLTGYKNVVVPKIALTFTGGATINLDVPNG 398
Query: 372 FVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ C F G S + GN+ Q F V +D F+ C
Sbjct: 399 ILVNG----CLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 445
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 117/367 (31%), Positives = 176/367 (47%), Gaps = 48/367 (13%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y++ +SIGTPP A+ DTGSDL+W +C+ C +F SS+YK LP
Sbjct: 2 EGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLP 61
Query: 148 CSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTL---GSTTGQAVA 198
C+S+ C+ + S +G+ C+Y YGDGS ++G++ ++ ++ G+
Sbjct: 62 CNSTHCSGM---SSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSF 118
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-----PVSST 253
G FGCG G +N T G++GLG SLI Q+ + KFSYCLV P + +
Sbjct: 119 FDGFLFGCGRKLKGDWNF-TQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKS 177
Query: 254 KINFGTNGIVSGPGVVSTPLTKA----KTFYVLTIDAISVG-------NQRLGVSTP--- 299
+ G++ + G VVSTP+ +T Y + + +I+VG ++ G +T
Sbjct: 178 FLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSVGP 237
Query: 300 ----DIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQ---PVADPTGSLELCY--SFNSL 349
VIDSGTT T L P Y + M IE Q P + L+LC+ S ++
Sbjct: 238 FLANKTVIDSGTTYTLLTPPVYEA-----MRKSIEEQVILPTLGNSAGLDLCFNSSGDTS 292
Query: 350 SQVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
P VT +F + L N F S D+VC + I GN+ Q NF + YD+
Sbjct: 293 YGFPSVTFYFANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYDL 352
Query: 409 EQQTVSF 415
+SF
Sbjct: 353 VASQISF 359
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 121/375 (32%), Positives = 186/375 (49%), Gaps = 36/375 (9%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
++ + + YL+ + +GTPP + DTGSDL W QC PC C+ Q P+FDP S
Sbjct: 142 ESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAAS 199
Query: 141 STYKSLPCSSSQCASL----NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTT 193
+Y+++ C +C + ++C + C Y YGD S + G+LA E T+ T
Sbjct: 200 LSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTA 259
Query: 194 -GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS 252
G + + + FGCG +N GLF+ ++GLG G +S SQ+R FSYCLV S
Sbjct: 260 PGASRRVDDVVFGCGHSNRGLFHGAAG-LLGLGRGALSFASQLRAVYGHAFSYCLVDHGS 318
Query: 253 ---TKINFGTNGIVSG-PGVVST-----PLTKAKTFYVLTIDAISVGNQRLGV--STPDI 301
+KI FG + + G P + T A TFY + + + VG ++L + ST D+
Sbjct: 319 SVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDV 378
Query: 302 --------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-- 350
+IDSGTTL++ + Y + + M +A P+ L CY+ + +
Sbjct: 379 GKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERV 438
Query: 351 QVPEVTIHF-RGADVKLSRSNFFVKVSED-IVCSVFKGITNS-VPIYGNIMQTNFLVGYD 407
+VPE ++ F GA N+FV++ D I+C G S + I GN Q NF V YD
Sbjct: 439 EVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQNFHVLYD 498
Query: 408 IEQQTVSFKPTDCTK 422
++ + F P C +
Sbjct: 499 LQNNRLGFAPRRCAE 513
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 107/355 (30%), Positives = 180/355 (50%), Gaps = 34/355 (9%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ +IGTPP AV D +L+WTQC+ C S+C+ QD+PLFDP S+TY++ PC
Sbjct: 50 NYVANFTIGTPPQPASAVIDLAGELVWTQCKQC--SRCFEQDTPLFDPTASNTYRAEPCG 107
Query: 150 SSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ C S+ + ++CSG C Y S G + G + T+T +G+ A + FGC
Sbjct: 108 TPLCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGT------AKASLAFGCV 160
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVS 264
+ +GIVGLG SL++Q T FSYCL P + K + G++ ++
Sbjct: 161 VASDIDTMGGPSGIVGLGRTPWSLVTQ---TGVAAFSYCLAPHDAGKNSALFLGSSAKLA 217
Query: 265 GPG-VVSTPLT-------KAKTFYVLTIDAISVGNQRLGV--STPDIVIDSGTTLTFLPQ 314
G G STP +Y + ++ + G+ + + S +++D+ + ++FL
Sbjct: 218 GGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVLLDTFSPISFLVD 277
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKLSRSNFF 372
G + ++ + A P+A P +LC+ + S P++ FR GA + ++ SN+
Sbjct: 278 GAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVAASNYL 337
Query: 373 VKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ VC S T + + G++ Q N +D++++T+SF+P DCTK
Sbjct: 338 LDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 392
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 121/375 (32%), Positives = 186/375 (49%), Gaps = 36/375 (9%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
++ + + YL+ + +GTPP + DTGSDL W QC PC C+ Q P+FDP S
Sbjct: 142 ESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPATS 199
Query: 141 STYKSLPCSSSQCASL----NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTT 193
+Y+++ C +C + ++C + C Y YGD S + G+LA E T+ T
Sbjct: 200 LSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTA 259
Query: 194 -GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS 252
G + + + FGCG +N GLF+ ++GLG G +S SQ+R FSYCLV S
Sbjct: 260 PGASRRVDDVVFGCGHSNRGLFHGAAG-LLGLGRGALSFASQLRAVYGHAFSYCLVDHGS 318
Query: 253 ---TKINFGTNGIVSG-PGVVST-----PLTKAKTFYVLTIDAISVGNQRLGV--STPDI 301
+KI FG + + G P + T A TFY + + + VG ++L + ST D+
Sbjct: 319 SVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDV 378
Query: 302 --------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-- 350
+IDSGTTL++ + Y + + M +A P+ L CY+ + +
Sbjct: 379 GKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERV 438
Query: 351 QVPEVTIHF-RGADVKLSRSNFFVKVSED-IVCSVFKGITNS-VPIYGNIMQTNFLVGYD 407
+VPE ++ F GA N+FV++ D I+C G S + I GN Q NF V YD
Sbjct: 439 EVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQNFHVLYD 498
Query: 408 IEQQTVSFKPTDCTK 422
++ + F P C +
Sbjct: 499 LQNNRLGFAPRRCAE 513
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 124/421 (29%), Positives = 200/421 (47%), Gaps = 47/421 (11%)
Query: 35 HRDSPKSPFYNSSETPYQRL-------RDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
H+DS + ++ +RL R +R N + N + S+ + + I
Sbjct: 3 HKDSCSGKILDWNKKLQKRLIMDNFQLRSLQSRIKNIILSGNIDDSVDTQIPLTSGIRLQ 62
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ NY++ + +G + + DTGSDL W QC+PC ++CY Q P+F+P S +Y+++
Sbjct: 63 SLNYIVTVELGG--RKMTVIVDTGSDLSWVQCQPC--NRCYNQQDPVFNPSKSPSYRTVL 118
Query: 148 CSSSQCASLNQKSC-SGV------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
C+S C SL + SGV C Y V+YGDGS+++G + E + LG+TT +
Sbjct: 119 CNSLTCRSLQLATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNLGNTT-----VN 173
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN 260
FGCG N GLF +G+VGLG D+SLISQ+ G FSYCL +T+ +
Sbjct: 174 NFIFGCGRKNQGLFGG-ASGLVGLGRTDLSLISQISPMFGGVFSYCL---PTTEAEASGS 229
Query: 261 GIVSGPGVV---STPLTKAKT-------FYVLTIDAISVGN---QRLGVSTPDIVIDSGT 307
++ G V +TP++ + FY L + I+VG Q ++IDSGT
Sbjct: 230 LVMGGNSSVYKNTTPISYTRMIHNPLLPFYFLNLTGITVGGVEVQAPSFGKDRMIIDSGT 289
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG-ADV 364
++ LP L + P A L+ C++ + +V P++ ++F G A++
Sbjct: 290 VISRLPPSIYQALKAEFVKQFSGYPSAPSFMILDSCFNLSGYQEVKIPDIKMYFEGSAEL 349
Query: 365 KLSRSNFFVKVSEDI--VCSVFKGI--TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ + F V D VC + + V I GN Q N + YD + + F C
Sbjct: 350 NVDVTGVFYSVKTDASQVCLAIASLPYEDEVGIIGNYQQKNQRIIYDTKGSMLGFAEEAC 409
Query: 421 T 421
+
Sbjct: 410 S 410
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 117/424 (27%), Positives = 195/424 (45%), Gaps = 65/424 (15%)
Query: 49 TPYQRLRDALTRSLNRLNHFNQNSSISSSKA-----SQADIIPNNANYLIRISIGTPPTE 103
T + +R A+ RSL+R ++ ++ +A S+A ++P YL+++ GTP
Sbjct: 45 TDQELIRRAVQRSLDRPGIVARSGGGAADEAGKAVASEAPLVPGGGEYLVKLGTGTPQHF 104
Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG 163
A DT SDL+W QC+PC CY Q P+F+PK+SS+Y +PC+S CA L+ C
Sbjct: 105 FSAAIDTASDLVWMQCQPC--VSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHE 162
Query: 164 VN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTG 220
+ CQY+ Y + G LA + + +G AV FGC ++ G ++ +G
Sbjct: 163 DDDGACQYTYKYSGHGVTKGTLAIDKLAIGGDVFHAV-----VFGCSDSSVGGPAAQASG 217
Query: 221 IVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFGTNGIVSGPGVVSTPL-- 273
+VGLG G +SL+SQ+ +F YCL P S + G + + + V+ +
Sbjct: 218 LVGLGRGPLSLVSQLSVH---RFMYCLPPPMSRTSGKLVLGAGADAVRNMSDRVTVTMSS 274
Query: 274 -TKAKTFYVLTIDAISVGNQ-----RLGVSTPD------------------------IVI 303
T+ ++Y L +D ++VG+Q R S P +++
Sbjct: 275 STRYPSYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIV 334
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSF-----NSLSQVPEVT 356
D +T++FL L + I P A P+ L+LC+ VP V+
Sbjct: 335 DVASTISFLETSLYDELADDLEEEIRL-PRATPSLRLGLDLCFILPEGVGMDRVYVPTVS 393
Query: 357 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
+ F G ++L R F V++ + + G T+ V I GN N V +++ + ++F
Sbjct: 394 LSFDGRWLELDRDRLF--VTDGRMMCLMIGRTSGVSILGNFQLQNMRVLFNLRRGKITFA 451
Query: 417 PTDC 420
C
Sbjct: 452 KASC 455
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 129/420 (30%), Positives = 185/420 (44%), Gaps = 41/420 (9%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN---------QNSSISSSKAS 80
SV L HR+ P SP E P + L R R + Q+++ + S +
Sbjct: 62 SVPLAHRNGPCSPVRGKGELPRAEM---LRRDRERTEYIIRRASRSRRLQDNNDAVSVPT 118
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
Q ++ Y+ + +GTP + + DTGS L W QC+PC SQCY Q PLFDP S
Sbjct: 119 QLGSSYDSQEYVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTS 178
Query: 141 STYKSLPCSSSQC----ASLNQKSCSG---VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
S+Y +PC S +C A ++ C+ C Y + YG G+ G +T+ +TLG
Sbjct: 179 SSYSPVPCDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTLGP-- 236
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVP--V 250
+ FGCG + G++GLG SL Q G FS+CL P V
Sbjct: 237 --GAIVKRFHFGCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGVFSHCLPPTGV 294
Query: 251 SSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL----GVSTPDIVI 303
S+ + G S V TPL FY L AISV Q L V ++
Sbjct: 295 STGFLALGAPHDTS--AFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFREGVIT 352
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR- 360
DSGT L+ L + + L + S + P+A P G L+ C++F VP V++ FR
Sbjct: 353 DSGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFNFTGYDNVTVPTVSLTFRG 412
Query: 361 GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
GA V L S+ V D + + + G++ Q V YD+ + V F+ C
Sbjct: 413 GATVHLDASS---GVLMDGCLAFWSSGDEYTGLIGSVSQRTIEVLYDMPGRKVGFRTGAC 469
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 136/436 (31%), Positives = 205/436 (47%), Gaps = 45/436 (10%)
Query: 11 LFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQR---LRDALTRSLNRLNH 67
L + F + P + + F++ L H S K+ E+P + L T + +RL+
Sbjct: 13 LLIILFALTCPKQCTSYRFTLRL-HTKSIKT-----KESPKIKPGYLHSKSTPAPSRLD- 65
Query: 68 FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
N ++ + S IPN A +L ISIG PP +L + DTGSDL W QC PC +C
Sbjct: 66 -NLWTTEIADIVSHVTPIPNPAAFLANISIGDPPVPQLLLIDTGSDLTWIQCLPC---KC 121
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCA--SLNQKSCSGVNCQYSVSYGDGSFSNGNLATE 185
Y Q P F P SSTY++ C S+ A + + +G NC+Y + Y D S + G LA E
Sbjct: 122 YPQTIPFFHPSRSSTYRNASCESAPHAMPQIFRDEKTG-NCRYHLRYRDFSNTRGILAKE 180
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
+T ++ ++ P I FGCG +N G ++ +G++GLG G S++++ KFSY
Sbjct: 181 KLTFQTSDEGLISKPNIVFGCGQDNSGF--TQYSGVLGLGPGTFSIVTR---NFGSKFSY 235
Query: 246 CLVPVSSTKINFGTNGIVSGPGVV----STPLTKAKTFYVLTIDAISVGNQRLGV----- 296
C S + N ++ G G TPL + Y L + AIS+G + L +
Sbjct: 236 CF--GSLIDPTYPHNFLILGNGARIEGDPTPLQIFQDRYYLDLQAISLGEKLLDIEPGIF 293
Query: 297 ----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFN--- 347
S VID+G + T L + L + ++ + V D CY N
Sbjct: 294 QRYRSKGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKL 353
Query: 348 SLSQVPEVTIHFR-GADVKLSRSNFFV-KVSEDIVCSVFKGIT-NSVPIYGNIMQTNFLV 404
L P VT HF GA++ L + FV S D C T + + + G + Q N+ V
Sbjct: 354 DLYGFPVVTFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNV 413
Query: 405 GYDIEQQTVSFKPTDC 420
GY++ V F+ TDC
Sbjct: 414 GYNLRTMKVYFQRTDC 429
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 125/352 (35%), Positives = 169/352 (48%), Gaps = 33/352 (9%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKSLPC 148
NY++ S+GTP + DTGSDL W QC+PC + CY Q PLFDP SS+Y ++PC
Sbjct: 139 NYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPC 198
Query: 149 SSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
CA L +CS C Y VSYGDGS + G +++T+TL +++ A+ G FG
Sbjct: 199 GGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS----AVQGFFFG 254
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFGTNGIV 263
CG GLFN G++GLG SL+ Q T G FSYCL P ++ + G G
Sbjct: 255 CGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPS 313
Query: 264 -SGPGVVST---PLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTF----LPQG 315
+ PG +T P A T+YV+ + ISVG Q+L V + LP
Sbjct: 314 GAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPT 373
Query: 316 YNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSN 370
+ L S S + + P A G L+ CY+F V P V + F GA V L
Sbjct: 374 AYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADG 433
Query: 371 FFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
C F G + I GN+ Q +F V I+ +V FKP+ C
Sbjct: 434 IL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 478
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 140/424 (33%), Positives = 191/424 (45%), Gaps = 47/424 (11%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQADI 84
+ L HR P +P SS + D L R + + S S A+ A
Sbjct: 68 LRLTHRHGPCAPSRASSLA-APSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAAAAAT 126
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFD 136
+P NY++ S+GTP + DTGSDL W QC+PC + CY Q PLFD
Sbjct: 127 VPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFD 186
Query: 137 PKMSSTYKSLPCSSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
P SS+Y ++PC CA L +CS C Y VSYGDGS + G +++T+TL +++
Sbjct: 187 PAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS 246
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVS 251
A+ G FGCG GLFN G++GLG SL+ Q T G FSYCL P +
Sbjct: 247 ----AVQGFFFGCGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPST 301
Query: 252 STKINFGTNGIV-SGPGVVST---PLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGT 307
+ + G G + PG +T P A T+YV+ + ISVG Q+L V +
Sbjct: 302 AGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVV 361
Query: 308 TLTF----LPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLSQV--PEVTIHF 359
LP + L S S + + P A G L+ CY+F V P V + F
Sbjct: 362 DTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTF 421
Query: 360 -RGADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
GA V L C F G + I GN+ Q +F V I+ +V FK
Sbjct: 422 GSGATVTLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFK 474
Query: 417 PTDC 420
P+ C
Sbjct: 475 PSSC 478
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 118/348 (33%), Positives = 171/348 (49%), Gaps = 39/348 (11%)
Query: 84 IIPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
++ N+A Y + +SIGTPP +ADTGS LIWTQC PC ++C + +P F P SST
Sbjct: 82 LLDNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPC--TECAARPAPPFQPASSST 139
Query: 143 YKSLPCSSSQCASLNQ--KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
+ LPC+SS C L ++C+ C Y YG G F+ G LATET+ +G + P
Sbjct: 140 FSKLPCASSLCQFLTSPYRTCNATGCVYYYPYGMG-FTAGYLATETLHVGGAS-----FP 193
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL---VPVSSTKINF 257
G+TFGC T NG + ++GIVGLG +SL+SQ+ +FSYCL + I F
Sbjct: 194 GVTFGCSTENG--VGNSSSGIVGLGRSPLSLVSQVGV---ARFSYCLRSNADAGDSPILF 248
Query: 258 GTNGIVSGPGVVSTPLTK-----AKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFL 312
G+ V+G V STPL + + ++Y + + I+VG L ++ ++ +GT F
Sbjct: 249 GSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPMAMANLTTVNGTRFGF- 307
Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFF 372
+++ + V G E S V EV R A L
Sbjct: 308 DLCFDATAAGGGGGVPVPTLVLRFAGGAEYAVRRRSYFGVVEVDSQGRAAVECL----LV 363
Query: 373 VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ SE + S+ I GN+MQ + V YD++ SF P DC
Sbjct: 364 LPASEKL----------SISIIGNVMQMDLHVLYDLDGGMFSFAPADC 401
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 121/370 (32%), Positives = 176/370 (47%), Gaps = 51/370 (13%)
Query: 83 DIIPNN------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
D PNN N+L+ ++ GTPP + + DTGS + WTQC+PC +C FD
Sbjct: 148 DHTPNNKLFDEDGNFLVDVAFGTPPQKFTLILDTGSSITWTQCKPC--VRCLKASRRHFD 205
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
P S TY C S V Y+++YGD S S GN +T+TL +
Sbjct: 206 PSASLTYSLGSCIPST-----------VGNTYNMTYGDKSTSVGNYGCDTMTLE----HS 250
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST-KI 255
P FGCG NN G F S G++GLG G +S +SQ + FSYCL S +
Sbjct: 251 DVFPKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEEDSIGSL 310
Query: 256 NFGTNG-----------IVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV-----STP 299
FG +V+GPG ++ L ++ ++V +D ISVGN+RL + ++P
Sbjct: 311 LFGEKATSQSSSLKFTSLVNGPG--TSGLEESGYYFVKLLD-ISVGNKRLNIPSSVFASP 367
Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS----LELCYSFNSLSQV--P 353
+IDSGT +T LPQ S L + + P+++ L+ CY+ + V P
Sbjct: 368 GTIIDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLP 427
Query: 354 EVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
E+ +HF GADV+L+ +C F G + + I GN Q + V YDI+
Sbjct: 428 EIVLHFGEGADVRLNGKRVIWGNDASRLCLAFAG-NSELTIIGNRQQVSLTVLYDIQGGR 486
Query: 413 VSFKPTDCTK 422
+ F C+K
Sbjct: 487 IGFGGNGCSK 496
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 136/436 (31%), Positives = 199/436 (45%), Gaps = 70/436 (16%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI---- 84
V L+HRDS N+S D L R L R + + I + A+ AD
Sbjct: 66 LQVRLVHRDSFA---VNASAA------DLLARRLQR--DMRRAAWIITKAATPADPENGT 114
Query: 85 ----IPNNANYLIRISIGTPPT-----ERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
P + Y+ +I++GTP E L D GSD+ W QC PC +CY Q P++
Sbjct: 115 VVTGAPTSGEYIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPC--FRCYHQPGPVY 172
Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGV-----NCQYSVSYGDGSFSNGNLATETVTLG 190
+ SS+ + C + C +L S G CQY V YGDGS S G+ ET+T
Sbjct: 173 NRLKSSSASDVGCYAPACRALG--SSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTFP 230
Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV 250
V +PG+ GCG++N GLF + GI+GLG G +S SQ+ FSYCL
Sbjct: 231 P----GVRVPGVAIGCGSDNQGLFPAPAAGILGLGRGSLSFPSQIAGRYGRSFSYCLAGQ 286
Query: 251 S----STKINFGTNGIVSGPGVVSTP----LTKAK--TFYVLTIDAISVGNQRL-GVSTP 299
S+ + FG+ + LT ++ TFY + + ISVG R+ GV+
Sbjct: 287 GTGGRSSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTES 346
Query: 300 D-----------IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGS---LELCY 344
D +++DSGT +T L Y + + + ++ P G + CY
Sbjct: 347 DLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPSPGGPFAFFDTCY 406
Query: 345 S---FNSLSQVPEVTIHFRGA-DVKLSRSNFFVKV--SEDIVCSVFKGITN-SVPIYGNI 397
S + +VP V++HF G +VKL N+ + V ++ +C F G + V I GNI
Sbjct: 407 SSVRGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRGVSIIGNI 466
Query: 398 MQTNFLVGYDIEQQTV 413
F V YD++ Q V
Sbjct: 467 QLQGFRVVYDVDGQRV 482
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 130/430 (30%), Positives = 200/430 (46%), Gaps = 66/430 (15%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ--NSSISSSKASQADIIPN-- 87
+LIH S P Y +ET R+ + S RL + S+ S+ +A + P+
Sbjct: 38 KLIHPGSVHHPHYKPNETAKDRMELDIQHSAARLANIQARIEGSLVSNNDYKARVSPSLT 97
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL- 146
+ ISIG PP +L V DTGSD++W C PC + C LFDP SST+ L
Sbjct: 98 GRTIMANISIGQPPIPQLVVMDTGSDILWVMCTPC--TNCDNDLGLLFDPSKSSTFSPLC 155
Query: 147 --PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
PC C C + ++V+Y D S ++G +TV +T + + F
Sbjct: 156 KTPCDFEGC------RCDPI--PFTVTYADNSTASGTFGRDTVVFETTDEGTSRISDVLF 207
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVS 264
GCG N G + GI+GL G SL+ T + KFSYC+ ++ N+ + ++
Sbjct: 208 GCGHNIGHDTDPGHNGILGLNNGPDSLV----TKLGQKFSYCIGNLADPYYNY--HQLIL 261
Query: 265 GPGV----VSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD-----------IVIDSGTTL 309
G G STP FY +T++ ISVG +RL ++ P+ ++ID+G+T+
Sbjct: 262 GEGADLEGYSTPFEVYNGFYYVTMEGISVGEKRLDIA-PETFEMKENRAGGVIIDTGSTI 320
Query: 310 TFLPQGYNS-------NLL--SVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVT 356
TFL + NLL S + IE P C+ + S+S+ P VT
Sbjct: 321 TFLVDSVHKLLSKEVRNLLGWSFRQATIEKSPWMQ-------CF-YGSISRDLVGFPVVT 372
Query: 357 IHF-RGADVKLSRSNFFVKVSEDIVCSVFK-----GITNSVPIYGNIMQTNFLVGYDIEQ 410
HF GAD+ L +FF ++++++ C I + + G + Q ++ VGYD+
Sbjct: 373 FHFSDGADLALDSGSFFNQLNDNVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYDLVN 432
Query: 411 QTVSFKPTDC 420
Q V F+ DC
Sbjct: 433 QFVYFQRIDC 442
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 120/382 (31%), Positives = 182/382 (47%), Gaps = 60/382 (15%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y + + +GTP E + + DTGSD+ W QC PC C P F+P+ SS++ LPC+
Sbjct: 137 EYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPC--KDCVPALRPPFNPRHSSSFFKLPCA 194
Query: 150 SSQCASLNQK-----SCSGVNCQYSVSYGDGSFSNGNLATETVTLGST----TGQAVALP 200
SS C ++ Q S SG C +S+ YGDGS S+G LA ET+ G+T G+ V L
Sbjct: 195 SSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIA-GNTPNFGDGEPVKLS 253
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN---- 256
IT GC + + +G++G+ IS SQ+ + A KFS+C P +N
Sbjct: 254 NITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCF-PDKIAHLNSSGL 312
Query: 257 --FGTNGIVSGPGVVSTPLTK-------AKTFYVLTIDAISVGNQRLGVSTPDI------ 301
FG + I+S P + TPL + + +Y + + ISV RL +S +
Sbjct: 313 VFFGESDIIS-PYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKVT 371
Query: 302 -----VIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ- 351
+IDSGT T+L Q L+ S + + D CY+ S +
Sbjct: 372 GSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAK----VDDNSGFTPCYNITSGTAA 427
Query: 352 -----VPEVTIHFRGA-DVKLSRSNFFVKVS----EDIVCSVFKGITNSVP--IYGNIMQ 399
+P +T+HFRG DV L +++ + VS + +C F+ ++ +P I GN Q
Sbjct: 428 LESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQ-MSGDIPFNIIGNYQQ 486
Query: 400 TNFLVGYDIEQQTVSFKPTDCT 421
N V YD+E+ + P C
Sbjct: 487 QNLWVEYDLEKLRLGIAPAQCA 508
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 105/355 (29%), Positives = 180/355 (50%), Gaps = 34/355 (9%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ +IGTPP AV D +L+WTQC+ C S+C+ QD+PLFDP S+TY++ PC
Sbjct: 50 NYVANFTIGTPPQPASAVIDLAGELVWTQCKQC--SRCFEQDTPLFDPTASNTYRAEPCG 107
Query: 150 SSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ C S+ + ++CSG C Y S G + G + T+T +G+ A + FGC
Sbjct: 108 TPLCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGT------AKASLAFGCV 160
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVS 264
+ +GIVGLG SL++Q T FSYCL P + + + G++ ++
Sbjct: 161 VASDIDTMGGPSGIVGLGRTPWSLVTQ---TGVAAFSYCLAPHDAGRNSALFLGSSAKLA 217
Query: 265 GPG-VVSTPLT-------KAKTFYVLTIDAISVGNQRLGV--STPDIVIDSGTTLTFLPQ 314
G G STP +Y + ++ + G+ + + S +++D+ + ++FL
Sbjct: 218 GGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVLLDTFSPISFLVD 277
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKLSRSNFF 372
G + +++ + A P+A P +LC+ + S P++ FR GA + + +N+
Sbjct: 278 GAYQAVKKAVTAAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVPATNYL 337
Query: 373 VKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ VC S T + + G++ Q N +D++++T+SF+P DCTK
Sbjct: 338 LDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 392
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 136/431 (31%), Positives = 195/431 (45%), Gaps = 53/431 (12%)
Query: 34 IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS---KASQADIIPN--- 87
+HRDS SP+ ++ T + +R+ L R RL + S+ + K+S + + N
Sbjct: 1 MHRDSADSPYRPANATVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNP 60
Query: 88 -----------------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
+ Y + + +GTPP VADTGSD++W QC PC CY Q
Sbjct: 61 FLQQDFETPLRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPC--QSCYGQ 118
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
PLF+P SST++S+ C SS C L + C C Y VSYGDGSF+ G +TET++ G
Sbjct: 119 TDPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFG 178
Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV 250
S +VA+ GCG NN GLF + G++GLG G +S SQ+ FSYCL
Sbjct: 179 SNAVNSVAI-----GCGHNNQGLF-TGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTR 232
Query: 251 SST---KINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGV--------- 296
ST + FG + S +T LT K TFY + + I VG + +
Sbjct: 233 ESTGSVPLIFGNQAVASN-AQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDS 291
Query: 297 --STPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV- 352
+++DSGT +T L YN + + M + + CY + S +
Sbjct: 292 STGNGGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIM 351
Query: 353 -PEVTIHFR-GADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
P V+ F GA + L N V V C F + + I GNI Q +F + +D
Sbjct: 352 LPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDST 411
Query: 410 QQTVSFKPTDC 420
V C
Sbjct: 412 GNRVGIGANQC 422
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 125/352 (35%), Positives = 169/352 (48%), Gaps = 33/352 (9%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKSLPC 148
NY++ S+GTP + DTGSDL W QC+PC + CY Q PLFDP SS+Y ++PC
Sbjct: 47 NYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPC 106
Query: 149 SSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
CA L +CS C Y VSYGDGS + G +++T+TL +++ A+ G FG
Sbjct: 107 GGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS----AVQGFFFG 162
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFGTNGIV 263
CG GLFN G++GLG SL+ Q T G FSYCL P ++ + G G
Sbjct: 163 CGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPS 221
Query: 264 -SGPGVVST---PLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTF----LPQG 315
+ PG +T P A T+YV+ + ISVG Q+L V + LP
Sbjct: 222 GAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPT 281
Query: 316 YNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSN 370
+ L S S + + P A G L+ CY+F V P V + F GA V L
Sbjct: 282 AYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADG 341
Query: 371 FFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
C F G + I GN+ Q +F V I+ +V FKP+ C
Sbjct: 342 IL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 386
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 136/431 (31%), Positives = 195/431 (45%), Gaps = 53/431 (12%)
Query: 34 IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS---KASQADIIPN--- 87
+HRDS SP+ ++ T + +R+ L R RL + S+ + K+S + + N
Sbjct: 1 MHRDSADSPYRPANATVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNP 60
Query: 88 -----------------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
+ Y + + +GTPP VADTGSD++W QC PC CY Q
Sbjct: 61 FLQQDFETPLRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPC--QSCYGQ 118
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
PLF+P SST++S+ C SS C L + C C Y VSYGDGSF+ G +TET++ G
Sbjct: 119 TDPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFG 178
Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV 250
S +VA+ GCG NN GLF + G++GLG G +S SQ+ FSYCL
Sbjct: 179 SNAVNSVAI-----GCGHNNQGLF-TGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTR 232
Query: 251 SST---KINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGV--------- 296
ST + FG + S +T LT K TFY + + I VG + +
Sbjct: 233 ESTGSVPLIFGNQAVASN-AQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDS 291
Query: 297 --STPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV- 352
+++DSGT +T L YN + + M + + CY + S +
Sbjct: 292 STGNGGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIM 351
Query: 353 -PEVTIHFR-GADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
P V+ F GA + L N V V C F + + I GNI Q +F + +D
Sbjct: 352 LPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDST 411
Query: 410 QQTVSFKPTDC 420
V C
Sbjct: 412 GNRVGIGANQC 422
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 143/435 (32%), Positives = 204/435 (46%), Gaps = 48/435 (11%)
Query: 30 SVELIHRDSPKSPFYNSS-ETPYQRL-RDALT-----RSLNRLNHFNQNSSISSS--KAS 80
+V L HR P SP N T +RL RD L R L+R + + S
Sbjct: 63 TVPLHHRHGPCSPLPNKKMPTLEERLHRDKLRAAYIHRKLSRGKKQGGGGAGGDVVVQQS 122
Query: 81 QADIIP-------NNANYLIRISIGTPPTE-RLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
A +P + Y+I + +G+PP + + + DTGSD+ W +C+PC QC Q
Sbjct: 123 HAMTVPTTLGTSLDTLEYVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCW-QQCRPQVD 181
Query: 133 PLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGV-NCQYSVSYGDGSF-SNGNLATET 186
PLFDP +SSTY CSS+ CA L N CS CQY YGDGS + G +++T
Sbjct: 182 PLFDPSLSSTYSPFSCSSAACAQLFQEGNANGCSSSGQCQYIAMYGDGSVGTTGTYSSDT 241
Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA-GKFSY 245
+ LGS + V + FGC G+ + GG SL+SQ T FSY
Sbjct: 242 LALGSNS-NTVVVSKFRFGCSHAETGITGLTAGLMGLGGGAQ-SLVSQTAGTFGTTAFSY 299
Query: 246 CLVPVSSTK--INFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVST-- 298
CL P S+ + G G S G V TP+ ++ FY + ++AI VG ++L + T
Sbjct: 300 CLPPTPSSSGFLTLGAAG-TSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTV 358
Query: 299 --PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT---GSLELCYSFNSLSQV- 352
+++DSGT +T LP S+L S + ++ P A + G L+ C+ + S V
Sbjct: 359 FSAGMIMDSGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTCFDMSGQSSVS 418
Query: 353 -PEVTIHFRGAD---VKLSRSNFFVKV-SEDIVCSVFKGITN--SVPIYGNIMQTNFLVG 405
P V + F GA V L S +++ + I C F ++ S I GN+ Q F V
Sbjct: 419 MPTVALVFSGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGSTGIIGNVQQRTFQVL 478
Query: 406 YDIEQQTVSFKPTDC 420
YD+ V FK C
Sbjct: 479 YDVAGGAVGFKAGAC 493
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 142/468 (30%), Positives = 218/468 (46%), Gaps = 69/468 (14%)
Query: 6 SCVFILFFLCFYVVSPI------------EAQTGGFSVELIHRDSPKSPFYNSSETPYQR 53
S +F LF L ++ P+ + + GF LIH SP+SPFY + TP +
Sbjct: 8 SAIFRLFLLILHIPFPLSSSFSLPLKELAKGKAYGFKAPLIHWSSPESPFYEPNLTPGEL 67
Query: 54 LRDALTRSLNRLNHFNQ--NSSISSSK---ASQADIIPNNANYLIRISIGTPPTERLAVA 108
+R ++ S R + + +S IS+S+ S+ II + Y+++ +IG+PP E A+
Sbjct: 68 MRASVRTSRARGDRIRKIRSSGISNSRKYPVSRISII--DKVYVMKFNIGSPPVETYAIP 125
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS--------LNQKS 160
DTGS+++W QC + CY Q PLF+P SSTY C +C L KS
Sbjct: 126 DTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIRLCGHRECKQALWGLGEYLGCKS 185
Query: 161 CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP-GITFGCGTNN----GGLFN 215
V C+Y +SY D SFS G ++T+ +T + + FGCG NN G N
Sbjct: 186 SVQV-CRYHISYEDHSFSEGTISTDIITFPEHIAEFGNYSLRMFFGCGYNNSETPGQDPN 244
Query: 216 SKTT-GIVGLGGGDISLISQMRTTIAGKFSYCL------VPVSSTKINFGTNGIVSGPGV 268
S T G+VGLG SL+ Q+ G+FSYC+ P + +I FG +SG
Sbjct: 245 SFTAPGVVGLGNEMASLVGQL---TLGQFSYCISTPDVQKPNGTIEIRFGLAASISGH-- 299
Query: 269 VSTPLT-KAKTFYVL-TIDAISVGNQRLGVSTPD------------IVIDSGTTLTFLPQ 314
ST L + +Y+ +D I V + ++ P+ +++DSGTT T L
Sbjct: 300 -STALANNLEGWYIFQNVDGIYVDDTKVK-GYPEWVFQFAEGGIGGLIMDSGTTYTELYF 357
Query: 315 GYNSNLLSVMSSMIEAQP-VADPTGS-LELCYSFNS--LSQVPEVTIHF---RGADVKLS 367
L+ + IE P D + S LCY+ + L+ VP + + F + A +
Sbjct: 358 SALDALIGELKEQIELAPDTQDHSNSNYSLCYNAANFLLTYVPAIELKFTDNKEAYFPFT 417
Query: 368 RSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
N ++ D C G T+ + I G + +GYD++ VSF
Sbjct: 418 LRNAWIDNGNDQYCLAMFG-TSGISIIGIYQHRDIKIGYDLKYNLVSF 464
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 117/367 (31%), Positives = 174/367 (47%), Gaps = 43/367 (11%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y + S+GTP + + DTGSDL + QC PC CY QD PL+ P SST+ +P
Sbjct: 31 SGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPC--DLCYEQDGPLYQPSNSSTFTPVP 88
Query: 148 CSSSQ-----------CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
C S++ C+S +S C Y YGD S + G A ET T+G
Sbjct: 89 CDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETATVGGIRVNH 148
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---- 252
VA FGCG N G F S G++GLG G +S SQ KF+YCL S
Sbjct: 149 VA-----FGCGNRNQGSFVS-AGGVLGLGQGALSFTSQAGYAFENKFAYCLTSYLSPTSV 202
Query: 253 -TKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRL----------GVST 298
+ + FG + + + + TPL + Y + I I G + L V
Sbjct: 203 FSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKIDSVGN 262
Query: 299 PDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEV 355
+ DSGTT+T+ PQ Y + + S+ + P G L LC + + + P
Sbjct: 263 GGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQG-LPLCVNVSGIDHPIYPSF 321
Query: 356 TIHF-RGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
TI F +GA + ++ N+F++VS +I C ++ + ++ + GNI+Q N+LV YD E+ +
Sbjct: 322 TIEFDQGATYRPNQGNYFIEVSPNIDCLAMLESSSDGFNVIGNIIQQNYLVQYDREEHRI 381
Query: 414 SFKPTDC 420
F +C
Sbjct: 382 GFAHANC 388
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 154 bits (390), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 114/368 (30%), Positives = 179/368 (48%), Gaps = 39/368 (10%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + + IGTPP + DTGSDL W QC PC C++Q+ P +DPK SS++K++ C
Sbjct: 190 GEYFMDVFIGTPPRHFSLILDTGSDLNWIQCVPC--YDCFVQNGPYYDPKESSSFKNIGC 247
Query: 149 SSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT----GQAVA 198
+C ++ + C N C Y YGD S + G+ A ET T+ T+ +
Sbjct: 248 HDPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKR 307
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----ST 253
+ + FGCG N GLF+ + G +S SQ+++ FSYCLV + S+
Sbjct: 308 VENVMFGCGHWNRGLFHGAAGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRNSDTNVSS 366
Query: 254 KINFGTN-GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGVS------TPD- 300
K+ FG + +++ P V T L K TFY + I +I VG + L + +P+
Sbjct: 367 KLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPEETWHLSPEG 426
Query: 301 ---IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEV 355
++DSGTTL++ + + ++ PV L+ CY+ + + ++PE
Sbjct: 427 AGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPILDPCYNVSGVEKMELPEF 486
Query: 356 TIHFR-GADVKLSRSNFFVKVS-EDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQT 412
I F GA N+F+K+ E+IVC G S + I GN Q NF + YD ++
Sbjct: 487 RILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSALSIIGNYQQQNFHILYDTKKSR 546
Query: 413 VSFKPTDC 420
+ + P C
Sbjct: 547 LGYAPMKC 554
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 115/367 (31%), Positives = 175/367 (47%), Gaps = 48/367 (13%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y++ +SIGTPP A+ DTGSDL+W +C+ C +F SS+YK LP
Sbjct: 2 EGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLP 61
Query: 148 CSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTL---GSTTGQAVA 198
C+S+ C+ + S +G+ C+Y YGDGS ++G++ ++ ++ G+
Sbjct: 62 CNSTHCSGM---SSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSF 118
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-----PVSST 253
G FGC G +N T G++GLG SLI Q+ + KFSYCLV P + +
Sbjct: 119 FDGFLFGCARKLKGDWNF-TQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKS 177
Query: 254 KINFGTNGIVSGPGVVSTPLTKA----KTFYVLTIDAISVG-------NQRLGVSTP--- 299
+ G++ + G VVSTP+ +T Y + + +I++G ++ G +T
Sbjct: 178 FLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNTSVGP 237
Query: 300 ----DIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQ---PVADPTGSLELCY--SFNSL 349
VIDSGTT T L P Y + M IE Q P + L+LC+ S ++
Sbjct: 238 FLANKTVIDSGTTYTLLTPPVYEA-----MRKSIEEQVILPTLGNSAGLDLCFNSSGDTS 292
Query: 350 SQVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
P VT +F + L N F S D+VC + I GN+ Q NF + YD+
Sbjct: 293 YGFPSVTFYFANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYDL 352
Query: 409 EQQTVSF 415
+SF
Sbjct: 353 VASQISF 359
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 122/362 (33%), Positives = 180/362 (49%), Gaps = 48/362 (13%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
+ + +GTPP + D GSDL+WTQC P+ Q P+FD SS++ LPC S
Sbjct: 109 LTVGVGTPPQPSKVILDLGSDLLWTQCSLVGPTA--KQLEPVFDAARSSSFSVLPCDSKL 166
Query: 153 C--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
C + K+C+ C Y YG + + G LATET T G+ G + L TFGCG
Sbjct: 167 CEAGTFTNKTCTDRKCAYENDYGIMT-ATGVLATETFTFGAHHGVSANL---TFGCGKLA 222
Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTN---GIVS 264
G ++ +GI+GL G +S++ Q+ T KFSYCL P + K + FG G
Sbjct: 223 NGTI-AEASGILGLSPGPLSMLKQLAIT---KFSYCLTPFADRKTSPVMFGAMADLGKYK 278
Query: 265 GPGVVST-PLTK---AKTFYVLTIDAISVGNQRLGVS------TPD----IVIDSGTTLT 310
G V T PL K +Y + + +SVG++RL V PD V+DS TTL
Sbjct: 279 TTGKVQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSATTLA 338
Query: 311 FLPQGYNSNLLSVMSSMIEAQPVADPT-GSLELCYSF-NSLS----QVPEVTIHFRG-AD 363
+L + + L + I+ PVA+ + +C+ +S QVP + +HF G A+
Sbjct: 339 YLVEPAFTELKKAVMEGIKL-PVANRSVDDYPVCFELPRGMSMEGVQVPPLVLHFDGDAE 397
Query: 364 VKLSRSNFFVKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
+ L R N+F + S ++C + F+G N + GN+ Q N V YD+ + S+ PT
Sbjct: 398 MSLPRDNYFQEPSPGMMCLAVMQAPFEGAPN---VIGNVQQQNMHVLYDVGNRKFSYAPT 454
Query: 419 DC 420
C
Sbjct: 455 KC 456
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 120/381 (31%), Positives = 181/381 (47%), Gaps = 60/381 (15%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y + + +GTP E + + DTGSD+ W QC PC C P F+P+ SS++ LPC+
Sbjct: 138 EYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPC--KDCVPALRPPFNPRHSSSFFKLPCA 195
Query: 150 SSQCASLNQK-----SCSGVNCQYSVSYGDGSFSNGNLATETVTLGST----TGQAVALP 200
SS C ++ Q S SG C +S+ YGDGS S+G LA ET+ G+T G+ V L
Sbjct: 196 SSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIA-GNTPNFGDGEPVKLS 254
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN---- 256
IT GC + + +G++G+ IS SQ+ + A KFS+C P +N
Sbjct: 255 NITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCF-PDKIAHLNSSGL 313
Query: 257 --FGTNGIVSGPGVVSTPLTK-------AKTFYVLTIDAISVGNQRLGVSTPDI------ 301
FG + I+S P + TPL + + +Y + + ISV RL +S +
Sbjct: 314 VFFGESDIIS-PYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKVT 372
Query: 302 -----VIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ- 351
+IDSGT T+L Q L+ S + + D CY+ S +
Sbjct: 373 GSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAK----VDDNSGFTPCYNITSGTAA 428
Query: 352 -----VPEVTIHFRGA-DVKLSRSNFFVKVS----EDIVCSVFKGITNSVP--IYGNIMQ 399
+P +T+HFRG DV L +++ + VS + +C F ++ +P I GN Q
Sbjct: 429 LESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFL-MSGDIPFNIIGNYQQ 487
Query: 400 TNFLVGYDIEQQTVSFKPTDC 420
N V YD+E+ + P C
Sbjct: 488 QNLWVEYDLEKLRLGIAPAQC 508
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 115/368 (31%), Positives = 181/368 (49%), Gaps = 39/368 (10%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + + +GTPP + DTGSDL W QC PC C+ Q+ P +DPK SS++K++ C
Sbjct: 193 GEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC--YACFEQNGPYYDPKDSSSFKNITC 250
Query: 149 SSSQCASLNQ----KSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA---- 198
+C ++ + C G +C Y YGD S + G+ A ET T+ TT +
Sbjct: 251 HDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKI 310
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----ST 253
+ + FGCG N GLF+ ++GLG G +S +Q+++ FSYCLV + S+
Sbjct: 311 VENVMFGCGHWNRGLFHGAAG-LLGLGRGPLSFATQLQSLYGHSFSYCLVDRNSNSSVSS 369
Query: 254 KINFGTNG-IVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGVSTPDI------ 301
K+ FG + ++S P + T K TFY + I +I VG + L +
Sbjct: 370 KLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWHLSAQG 429
Query: 302 ----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEV 355
+IDSGTTLT+ + + I+ P+ + L+ CY+ + + ++PE
Sbjct: 430 GGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPLKPCYNVSGVEKMELPEF 489
Query: 356 TIHF-RGADVKLSRSNFFVKVS-EDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQT 412
I F GA N+F+++ ED+VC G S + I GN Q NF + YD+++
Sbjct: 490 AILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRSALSIIGNYQQQNFHILYDLKKSR 549
Query: 413 VSFKPTDC 420
+ + P C
Sbjct: 550 LGYAPMKC 557
>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
Length = 367
Score = 154 bits (388), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 117/388 (30%), Positives = 182/388 (46%), Gaps = 83/388 (21%)
Query: 49 TPYQRLRDALTRSLNRLNHFN--QNSSISSSKASQAD--IIPNNANYLIRISIGTPPTER 104
T ++ LR A+ RS RL + + S+ KA A+ I+P YL+++ IGTPP +
Sbjct: 43 TEHELLRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKF 102
Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
A DT SDLIWTQC+PC + CY Q P+F+P++SSTY +LPCSS C L+ C
Sbjct: 103 TAAIDTASDLIWTQCQPC--TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHD 160
Query: 165 N---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTG 220
+ CQY+ +Y + + G LA + + +G A G+ FGC T++ GG + +G
Sbjct: 161 DDESCQYTYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASG 215
Query: 221 IVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFY 280
+VGLG G +SL+SQ+ G ++ ++ST I F
Sbjct: 216 VVGLGRGPLSLVSQLSVRRYGM----IIDIAST-ITF----------------------- 247
Query: 281 VLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL 340
L S D +++ LP+G S+L L
Sbjct: 248 -------------LEASLYDELVNDLEVEIRLPRGTGSSL------------------GL 276
Query: 341 ELCY------SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSED-IVC-SVFKGITNSVP 392
+LC+ +F+ + VP V + F G ++L ++ F + E ++C V + SV
Sbjct: 277 DLCFILPDGVAFDRV-YVPAVALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVS 335
Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
I GN Q N V Y++ + V+F + C
Sbjct: 336 ILGNFQQQNMQVLYNLRRGRVTFVQSPC 363
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 131/415 (31%), Positives = 205/415 (49%), Gaps = 32/415 (7%)
Query: 30 SVELIHRDSPKSPFYNS-SETPY---QRLR-DALTRSLNRLNHFNQNSSISSSKASQADI 84
S++++H+ P N S + +LR D++ L++++ + + +Q+ I
Sbjct: 69 SLQVLHKYGPCMQVLNDRSHVEFLLQDQLRVDSIQARLSKISGHGIFEEMVTKLPAQSGI 128
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
NY++ + +GTP + V DTGS + WTQC+PC S CY Q FDP S++Y
Sbjct: 129 AIGTGNYVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGS-CYPQKEQKFDPTKSTSYN 187
Query: 145 SLPCSSSQCASL--NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
++ CSS+ C L +++ CS N C Y + YGD S+S G ATET+T+ S+
Sbjct: 188 NVSCSSASCNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTISSSD----VFT 243
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFG 258
FGCG +N GLF + G++GL +SL SQ +FSYCL P S+ +NFG
Sbjct: 244 NFLFGCGQSNNGLFG-QAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPSTPSSTGYLNFG 302
Query: 259 TNGIVSGPGVVSTPLTKA-KTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFL 312
G VS TP++ A +FY + I ISV +L + +T +IDSGT +T L
Sbjct: 303 --GKVSQTAGF-TPISPAFSSFYGIDIVGISVAGSQLPIDPSIFTTSGAIIDSGTVITRL 359
Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGA-DVKLSRS 369
P L + P + L+ CY F++ + V P+V++ F+G +V + S
Sbjct: 360 PPTAYKALKEAFDEKMSNYPKTNGDELLDTCYDFSNYTTVSFPKVSVSFKGGVEVDIDAS 419
Query: 370 NFFVKVSE-DIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
V+ +VC F + I+GN Q + V YD + + F C+
Sbjct: 420 GILYLVNGVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAGACS 474
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 135/444 (30%), Positives = 202/444 (45%), Gaps = 61/444 (13%)
Query: 27 GGFSV-ELIHRDSPKSPFYNSSETPYQRLRDALTR--SLN-RLNHFNQNSSISSSK---- 78
GG +V EL H +P + E L R SL R+ H+ ++ SS++
Sbjct: 65 GGATVLELRHHSFSPAPANSREEEADALLSTDAARVSSLQGRIEHYRLTTTSSSAEVAVT 124
Query: 79 ASQADI-IPNNA-----NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
AS+A + + + A NY+ + +G E + DT S+L W QC PC C+ Q
Sbjct: 125 ASKAQVPVSSGARLRTLNYVATVGLGG--GEATVIVDTASELTWVQCAPC--ESCHDQQG 180
Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-------------CQYSVSYGDGSFSN 179
PLFDP S +Y ++PC S C +L Q+ +G C Y++SY DGS+S
Sbjct: 181 PLFDPSSSPSYAAVPCDSPSCDALQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSR 240
Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
G LA + ++L + G FGCGT+N G T+G++GLG +SL+SQ
Sbjct: 241 GVLAHDRLSLAGEV-----IDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTVDQF 295
Query: 240 AGKFSYCLVPVSSTKINFGTNGIVSGPGVV--STPLTKAKT-----------FYVLTIDA 286
G FSYCL P+S G+ + P STP+ FY++ +
Sbjct: 296 GGVFSYCL-PLSRESDASGSLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQGPFYLVNLTG 354
Query: 287 ISVGNQRLGVS--TPDIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELC 343
I+VG Q + + + ++DSGT +T +P YN+ MS + E P A L+ C
Sbjct: 355 ITVGGQEVESTGFSARAIVDSGTVITSLVPSVYNAVRAEFMSQLAE-YPQAPGFSILDTC 413
Query: 344 YSFNSLS--QVPEVTIHFR-GADVKLSRSN--FFVKVSEDIVCSVFKGIT--NSVPIYGN 396
++ L QVP +T+ F GA+V++ +FV VC + + I GN
Sbjct: 414 FNMTGLKEVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSEDETSIIGN 473
Query: 397 IMQTNFLVGYDIEQQTVSFKPTDC 420
Q N V +D V F C
Sbjct: 474 YQQKNLRVVFDTSASQVGFAQETC 497
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 120/356 (33%), Positives = 168/356 (47%), Gaps = 53/356 (14%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK-----S 160
+ DTGSDL W QC+PC S CY Q PLFDP S++Y ++PC++S C ASL S
Sbjct: 180 IVDTGSDLTWVQCKPC--SVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGS 237
Query: 161 CSGV----------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
C+ V C YS++YGDGSFS G LAT+TV LG + + G FGCG +N
Sbjct: 238 CATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLSN 292
Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVS 270
GLF T G++GLG ++SL+SQ G FSYCL +S G +S G S
Sbjct: 293 RGLFGG-TAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGD----AAGSLSLGGDTS 347
Query: 271 -----TPLTKAKT--------FYVLTI---DAISVGNQRLGVSTPDIVIDSGTTLTFL-P 313
TP++ + FY + + G+ ++++DSGT +T L P
Sbjct: 348 SYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRLAP 407
Query: 314 QGYNSNLLSVMSSM-IEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRS 369
Y + E P A P L+ CY+ + VP +T+ GAD+ + +
Sbjct: 408 SVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAA 467
Query: 370 NFFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+D VC ++ + PI GN Q N V YD + F DC+
Sbjct: 468 GMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 523
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 120/356 (33%), Positives = 168/356 (47%), Gaps = 53/356 (14%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK-----S 160
+ DTGSDL W QC+PC S CY Q PLFDP S++Y ++PC++S C ASL S
Sbjct: 179 IVDTGSDLTWVQCKPC--SVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGS 236
Query: 161 CSGV----------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
C+ V C YS++YGDGSFS G LAT+TV LG + + G FGCG +N
Sbjct: 237 CATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLSN 291
Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVS 270
GLF T G++GLG ++SL+SQ G FSYCL +S G +S G S
Sbjct: 292 RGLFGG-TAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGD----AAGSLSLGGDTS 346
Query: 271 -----TPLTKAKT--------FYVLTI---DAISVGNQRLGVSTPDIVIDSGTTLTFL-P 313
TP++ + FY + + G+ ++++DSGT +T L P
Sbjct: 347 SYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRLAP 406
Query: 314 QGYNSNLLSVMSSM-IEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRS 369
Y + E P A P L+ CY+ + VP +T+ GAD+ + +
Sbjct: 407 SVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAA 466
Query: 370 NFFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+D VC ++ + PI GN Q N V YD + F DC+
Sbjct: 467 GMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 522
>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
Length = 629
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 100/283 (35%), Positives = 143/283 (50%), Gaps = 25/283 (8%)
Query: 98 GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
GT + + D+GSD+ W QC+PCP C+ Q PLFDP MS+TY ++PC+S+ CA L
Sbjct: 71 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 130
Query: 158 --QKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
++ CS CQ+ ++YGDGS + G + + +TLG + G FGC + G
Sbjct: 131 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYD----VIRGFRFGCAHADRGSA 186
Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----PGV 268
F+ G + LGGG SL+ Q T FSYCL P +S+ + F G+ P
Sbjct: 187 FDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASS-LGFLVLGVPPERAQLIPSF 245
Query: 269 VSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNL 320
VSTPL + A TFY + + AI V + L V P + VIDS T ++ LP L
Sbjct: 246 VSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVP-PAVFSASSVIDSSTIISRLPPTAYQAL 304
Query: 321 LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG 361
+ S + A P L+ CY F + + P + + F G
Sbjct: 305 RAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDG 347
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 74/288 (25%), Positives = 113/288 (39%), Gaps = 62/288 (21%)
Query: 156 LNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
+ QK+ G CQ+ ++YGDGS + G + + +TLG LP
Sbjct: 381 VQQKTLEGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVDRQGLP----------- 429
Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----P 266
L + G V FSYC +P S + + F T G+ P
Sbjct: 430 -LRTATQYGRV--------------------FSYC-IPPSPSSLGFITLGVPPQRAALVP 467
Query: 267 GVVSTPLTKAK----TFYVLTIDAISVGNQRLGVSTPDI-----VIDSGTTLTFLPQGYN 317
VSTPL + TFY + + AI V + L V P + VI S T ++ LP
Sbjct: 468 TFVSTPLLSSSSMPPTFYRVLLRAIIVAGRPLPVP-PTVFSTSSVIASTTVISRLPPTAY 526
Query: 318 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG-ADVKLSRSNFFVK 374
L + + A P L+ CY F + + P + + F G A V L + ++
Sbjct: 527 QALRAAFRRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLQ 586
Query: 375 VSEDIVCSVFK-GITNSVPIY-GNIMQTNFLVGYDIEQQTVSFKPTDC 420
C F T+ +P + GN+ Q V YD+ + + F+ C
Sbjct: 587 G-----CLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 629
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 135/448 (30%), Positives = 203/448 (45%), Gaps = 80/448 (17%)
Query: 25 QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
+ G +EL H D+ ++ S+E +R+R A R+ RL + S+ SQ
Sbjct: 20 RAAGLRLELTHVDAKQN---CSTE---ERMRRATERTHRRLASMGEASAPVHWAESQ--- 70
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
Y+ IG PP + A+ DTGS+LIWTQC C P+ C+ Q+ +DP S T +
Sbjct: 71 ------YIAEYLIGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRTAR 124
Query: 145 SLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
+ C+ + CA ++ C+ N C +YG G G L TE T + + V+L
Sbjct: 125 PVACNDTACALGSETRCARDNKACAVLTAYGAGVI-GGVLGTEAFTFQPQS-ENVSL--- 179
Query: 203 TFGC--------GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK 254
FGC G+ +G +GI+GLG G++SL+SQ+ KFSYCL P S
Sbjct: 180 AFGCIAATRLTPGSLDG------ASGIIGLGRGNLSLVSQLGDN---KFSYCLTPYFSQS 230
Query: 255 INF------GTNGIVSGPG-VVSTPLTKA------KTFYVLTIDAISVGNQRLGVSTPDI 301
N + G+ SG S P K TFY L + I+VG+ +L V
Sbjct: 231 TNTSRLFVGASAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAF 290
Query: 302 -------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSF 346
+IDSG+ T L L + + A V P G+ L+LC +
Sbjct: 291 DLRQVATGLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDLCAAV 350
Query: 347 ---NSLSQVPEVTIHF--RGADVKLSRSNFFVKVSEDIVCSVF---KGITNSVP-----I 393
+ VP + +HF G DV + N++ V + C V G +++P I
Sbjct: 351 AHGDVGKLVPPLVLHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTI 410
Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
GN MQ + + YD+E+ +SF+P DC+
Sbjct: 411 IGNYMQQDMHLLYDLEKGMLSFQPADCS 438
>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
Length = 720
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 103/296 (34%), Positives = 148/296 (50%), Gaps = 26/296 (8%)
Query: 98 GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
GT + + D+GSD+ W QC+PCP C+ Q PLFDP MS+TY ++PC+S+ CA L
Sbjct: 162 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 221
Query: 158 --QKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
++ CS CQ+ ++YGDGS + G + + +TLG + G FGC + G
Sbjct: 222 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYD----VIRGFRFGCAHADRGSA 277
Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----PGV 268
F+ G + LGGG SL+ Q T FSYCL P +S+ + F G+ P
Sbjct: 278 FDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASS-LGFLVLGVPPERAQLIPSF 336
Query: 269 VSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNL 320
VSTPL + A TFY + + AI V + L V P + VIDS T ++ LP L
Sbjct: 337 VSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVP-PAVFSASSVIDSSTIISRLPPTAYQAL 395
Query: 321 LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFV 373
+ S + A P L+ CY F + + P + + F GA V L + +
Sbjct: 396 RAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL 451
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 74/288 (25%), Positives = 113/288 (39%), Gaps = 62/288 (21%)
Query: 156 LNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
+ QK+ G CQ+ ++YGDGS + G + + +TLG LP
Sbjct: 472 VQQKTLEGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVDRQGLP----------- 520
Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----P 266
L + G V FSYC +P S + + F T G+ P
Sbjct: 521 -LRTATQYGRV--------------------FSYC-IPPSPSSLGFITLGVPPQRAALVP 558
Query: 267 GVVSTPLTKAK----TFYVLTIDAISVGNQRLGVSTPDI-----VIDSGTTLTFLPQGYN 317
VSTPL + TFY + + AI V + L V P + VI S T ++ LP
Sbjct: 559 TFVSTPLLSSSSMPPTFYRVLLRAIIVAGRPLPVP-PTVFSTSSVIASTTVISRLPPTAY 617
Query: 318 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG-ADVKLSRSNFFVK 374
L + + A P L+ CY F + + P + + F G A V L + ++
Sbjct: 618 QALRAAFRRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLQ 677
Query: 375 VSEDIVCSVFKGI-TNSVPIY-GNIMQTNFLVGYDIEQQTVSFKPTDC 420
C F T+ +P + GN+ Q V YD+ + + F+ C
Sbjct: 678 G-----CLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 720
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 124/398 (31%), Positives = 185/398 (46%), Gaps = 69/398 (17%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVA-DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
ADI ++ YLI +SIGTP +R+A+ DTGSDL+WTQC C C+ Q P FD
Sbjct: 93 DADI---DSEYLIHLSIGTPRPQRVALTLDTGSDLVWTQCA-C--HVCFAQPFPTFDALA 146
Query: 140 SSTYKSLPCSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL------ 189
S T ++PCS C S L+ + + C Y Y D S ++G + +T T
Sbjct: 147 SQTTLAVPCSDPICTSGKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGN 206
Query: 190 -GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
GS VA+P + FGCG N G+F S +GI G G +SL SQ++ +FS+C
Sbjct: 207 NGSKAHAGVAVPNVRFGCGQYNKGIFKSNESGIAGFSRGPMSLPSQLKV---ARFSHCFT 263
Query: 249 PVSSTKI------------NFGTNGIVSGPGVVSTPLTKAK-TFYVLTIDAISVGNQRLG 295
++ + N G + +GP V STP + + Y LT+ I+VG RL
Sbjct: 264 AIADARTSPVFLGGAPGPDNLGAH--ATGP-VQSTPFANSNGSLYYLTLKGITVGKTRLP 320
Query: 296 VST------------PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE-- 341
++ +IDSGT + LP +L + + ++ PVA+ + +
Sbjct: 321 LNALAFAGKGTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAFVARVKL-PVANESAADAES 379
Query: 342 -LCYS---------FNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDI------VCSVFK 385
LC+ +P+V +H GAD L R ++ + + ED +C V
Sbjct: 380 TLCFEAARSASLPPEAPAPALPKVVLHVAGADWDLPRESYVLDLLEDEDGSGSGLCLVMN 439
Query: 386 GITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+S + I GN Q N V YD+E+ + F P C K
Sbjct: 440 SAGDSDLTIIGNFQQQNMHVAYDLEKNKLVFVPARCDK 477
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 121/359 (33%), Positives = 172/359 (47%), Gaps = 34/359 (9%)
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
IPN A +L ISIG PP +L + DTGSDL W C PC +CY Q P F P SSTY+
Sbjct: 72 IPNPAAFLANISIGNPPVPQLLLIDTGSDLTWIHCLPC---KCYPQTIPFFHPSRSSTYR 128
Query: 145 SLPCSSSQCA--SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
+ C S+ A + + +G NCQY + Y D S + G LA E +T ++ ++ I
Sbjct: 129 NASCVSAPHAMPQIFRDEKTG-NCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNI 187
Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI 262
FGCG +N G +K +G++GLG G S++++ KFSYC S T + N +
Sbjct: 188 VFGCGQDNSGF--TKYSGVLGLGPGTFSIVTR---NFGSKFSYCF--GSLTNPTYPHNIL 240
Query: 263 VSGPGVV----STPLTKAKTFYVLTIDAISVGNQRLGV---------STPDIVIDSGTTL 309
+ G G TPL + Y L + AIS G + L + S VID+G +
Sbjct: 241 ILGNGAKIEGDPTPLQIFQDRYYLDLQAISFGEKLLDIEPGTFQRYRSQGGTVIDTGCSP 300
Query: 310 TFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFN---SLSQVPEVTIHFR-GAD 363
T L + L + ++ + V D CY N L P VT HF GA+
Sbjct: 301 TILAREAYETLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGFPVVTFHFAGGAE 360
Query: 364 VKLSRSNFFV-KVSEDIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ L + FV S D C T + + + G + Q N+ VGY++ V F+ TDC
Sbjct: 361 LALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 419
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 115/368 (31%), Positives = 181/368 (49%), Gaps = 39/368 (10%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y I + +GTPP + DTGSDL W QC PC +C+ Q+ P +DP SS+Y+++ C
Sbjct: 179 GEYFIDVFVGTPPKHFSLILDTGSDLNWIQCVPC--YECFEQNGPHYDPGQSSSYRNIGC 236
Query: 149 SSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATET----VTLGSTTGQAVA 198
S+C ++ + C N C Y YGD S + G+ A ET +T+ S +
Sbjct: 237 HDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRR 296
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----ST 253
+ + FGCG N GLF+ ++GLG G +S SQ+++ FSYCLV + S+
Sbjct: 297 VENVMFGCGHWNRGLFHGAAG-LLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDANVSS 355
Query: 254 KINFGTN-GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGVSTPDI------ 301
K+ FG + ++S P + T L K TFY + I +I VG + + +
Sbjct: 356 KLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATDG 415
Query: 302 ----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEV 355
+IDSGTTL++ + + + ++ PV LE CY+ + Q +P+
Sbjct: 416 SGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPDLPDF 475
Query: 356 TIHF-RGADVKLSRSNFFVKVS-EDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQT 412
I F GA N+F+++ ++VC G +++ I GN Q NF + YD ++
Sbjct: 476 GIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSALSIIGNYQQQNFHILYDTKKSR 535
Query: 413 VSFKPTDC 420
+ F PT C
Sbjct: 536 LGFAPTKC 543
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 130/382 (34%), Positives = 186/382 (48%), Gaps = 37/382 (9%)
Query: 60 RSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
R +N + N ++ +S ASQ Y RI +G P V DTGSD+ W QC
Sbjct: 158 RRINGSDSTNSLTAPVTSGASQG-----AGEYFARIGVGQPVQSYFFVPDTGSDVSWLQC 212
Query: 120 EPCPPSQ-CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFS 178
+PC CY Q P+FDPK SS+Y L C S QC L++ +C +C Y V YGDGSF+
Sbjct: 213 QPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFT 272
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
G LATET + + ++P + GCG +N GLF G++GLGGG ISL SQ+ T
Sbjct: 273 VGELATETFSFRHSN----SIPNLPIGCGHDNEGLF-VGADGLIGLGGGAISLSSQLEAT 327
Query: 239 IAGKFSYCLVPV---SSTKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQ 292
FSYCLV + SS+ ++F + +++PL K TF + + +SVG +
Sbjct: 328 ---SFSYCLVDLDSESSSTLDFNADQPSDS---LTSPLVKNDRFPTFRYVKVIGMSVGGK 381
Query: 293 RLGVSTPD----------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
L +S+ I++DSGTT+T +P L + + P A +
Sbjct: 382 PLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDT 441
Query: 343 CYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIM 398
CY +S S +VP + G + ++L N ++V S C F T + I GN+
Sbjct: 442 CYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFLPSTFPLSIIGNVQ 501
Query: 399 QTNFLVGYDIEQQTVSFKPTDC 420
Q V YD+ V F C
Sbjct: 502 QQGIRVSYDLANSLVGFSTDKC 523
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 116/364 (31%), Positives = 174/364 (47%), Gaps = 38/364 (10%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y ++ +GTP T L V DTGSD++W QC PC CY Q +FDP+ S +Y ++
Sbjct: 119 SGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPC--RHCYAQSGRVFDPRRSRSYAAVD 176
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C + C L+ C C Y V+YGDGS + G+ A+ET+T + + + G
Sbjct: 177 CVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF----ARGARVQRVAIG 232
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV---------PVSSTKIN 256
CG +N GLF + + ++GLG G +S SQ+ + FSYCLV S+ +
Sbjct: 233 CGHDNEGLFIAASG-LLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVT 291
Query: 257 FGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL-GVSTPD-----------I 301
FG + + G TP+ + TFY + + SVG R+ GVS D +
Sbjct: 292 FGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGV 351
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIH 358
++DSGT++T L + + + V+ SL + CY+ + +VP V++H
Sbjct: 352 ILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMH 411
Query: 359 FR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
GA V L N+ + V + C G V I GNI Q F V +D + Q V F
Sbjct: 412 LAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFV 471
Query: 417 PTDC 420
P C
Sbjct: 472 PKSC 475
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 91/260 (35%), Positives = 145/260 (55%), Gaps = 26/260 (10%)
Query: 49 TPYQRLRDALTRSLNRLNHFN--QNSSISSSKASQAD--IIPNNANYLIRISIGTPPTER 104
T ++ LR A+ RS RL + + S+ KA A+ I+P YL+++ IGTPP +
Sbjct: 43 TEHELLRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKF 102
Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
A DT SDLIWTQC+PC + CY Q P+F+P++SSTY +LPCSS C L+ C
Sbjct: 103 TAAIDTASDLIWTQCQPC--TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHD 160
Query: 165 N---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTG 220
+ CQY+ +Y + + G LA + + +G A G+ FGC T++ GG + +G
Sbjct: 161 DDESCQYTYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASG 215
Query: 221 IVGLGGGDISLISQMRTTIAGKFSYCLVPVSST---KINFGTNGIVS--GPGVVSTPLTK 275
+VGLG G +SL+SQ+ +F+YCL P +S K+ G + + ++ P+ +
Sbjct: 216 VVGLGRGPLSLVSQLSVR---RFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRR 272
Query: 276 A---KTFYVLTIDAISVGNQ 292
++Y L +D + +G++
Sbjct: 273 DPRYPSYYYLNLDGLLIGDR 292
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 128/421 (30%), Positives = 194/421 (46%), Gaps = 49/421 (11%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ--NSSISSSKASQADIIPN-- 87
+LIH S P Y +ET R+ + S RL + S+ + A + P+
Sbjct: 38 KLIHPGSVHHPHYKPNETAKDRMELDIEHSAARLAYIQARIEGSLVYNNDYTASVSPSLT 97
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL- 146
L+ +SIG P +L V DTGSD++W C PC + C LFDP MSST+ L
Sbjct: 98 GRTILVNLSIGQPSIPQLVVMDTGSDILWIMCNPC--TNCDNHLGLLFDPSMSSTFSPLC 155
Query: 147 --PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
PC C C + +++SY D S ++G + + +T + +
Sbjct: 156 KTPCGFKGC------KCDPI--PFTISYVDNSSASGTFGRDILVFETTDEGTSQISDVII 207
Query: 205 GCGTNNGGLFNSK--TTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI 262
GCG N G FNS GI+GL G SL +Q I KFSYC+ ++ N+ +
Sbjct: 208 GCGHNIG--FNSDPGYNGILGLNNGPNSLATQ----IGRKFSYCIGNLADPYYNYNQLRL 261
Query: 263 VSGPGV--VSTPLTKAKTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTTLT 310
G + STP FY +T++ ISVG +RL ++ T +++DSGTT+T
Sbjct: 262 GEGADLEGYSTPFEVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTIT 321
Query: 311 FLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYS---FNSLSQVPEVTIHF-RGADV 364
+L + L + + ++++ + V +LCY L P VT HF GAD+
Sbjct: 322 YLVDSAHKLLYNEVRNLLKWSFRQVIFENAPWKLCYYGIISRDLVGFPVVTFHFVDGADL 381
Query: 365 KLSRSNFFVKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
L +FF + +DI C + T S + G + Q ++ VGYD+ Q V F+ D
Sbjct: 382 ALDTGSFFSQ-RDDIFCMTVSPASILNTTISPSVIGLLAQQSYNVGYDLVNQFVYFQRID 440
Query: 420 C 420
C
Sbjct: 441 C 441
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 118/364 (32%), Positives = 176/364 (48%), Gaps = 38/364 (10%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y ++ +GTP T L V DTGSD++W QC PC CY Q +FDP+ S +Y ++
Sbjct: 125 SGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPC--RHCYAQSGRVFDPRRSRSYAAVD 182
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C + C L+ C C Y V+YGDGS + G+ A+ET+T + + + G
Sbjct: 183 CVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF----ARGARVQRVAIG 238
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVS--STKIN 256
CG +N GLF + + ++GLG G +S SQ+ + FSYCLV P S S+ +
Sbjct: 239 CGHDNEGLFIAASG-LLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVT 297
Query: 257 FGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL-GVSTPD-----------I 301
FG + + G TP+ + TFY + + SVG R+ GVS D +
Sbjct: 298 FGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGV 357
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIH 358
++DSGT++T L + + + V+ SL + CY+ + +VP V++H
Sbjct: 358 ILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMH 417
Query: 359 FR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
GA V L N+ + V + C G V I GNI Q F V +D + Q V F
Sbjct: 418 LAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFV 477
Query: 417 PTDC 420
P C
Sbjct: 478 PKSC 481
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 136/448 (30%), Positives = 212/448 (47%), Gaps = 54/448 (12%)
Query: 1 MATFL-SCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALT 59
MA F S +F L LCF + + + + L+H Y+ +++A
Sbjct: 1 MAIFFTSPLFFLIILCFSISVVHLSASPTLVLNLVH----SYHIYSRKPPHVYHIKEA-- 54
Query: 60 RSLNRLNHFNQNSS--ISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
S+ RL + ++ I + + IIP +L+ ISIG+PP +L DT SDL+W
Sbjct: 55 -SVERLEYLKAKTTGDIIAHLSPNVPIIPQA--FLVNISIGSPPITQLLHMDTASDLLWI 111
Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGVNCQYSVSYGDGS 176
QC PC CY Q P+FDP S T+++ C +SQ + + K + + +C+YS+ Y D +
Sbjct: 112 QCLPC--INCYAQSLPIFDPSRSYTHRNETCRTSQYSMPSLKFNANTRSCEYSMRYVDDT 169
Query: 177 FSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
S G LA E + + + + AL + FGCG +N G TGI+GLG G+ SL+ +
Sbjct: 170 GSKGILAREMLLFNTIYDESSSAALHDVVFGCGHDNYGE-PLVGTGILGLGYGEFSLVHR 228
Query: 235 MRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVV-----STPLTKAKTFYVLTIDAISV 289
KFSYC + ++ N +V G +TPL FY +TI+AISV
Sbjct: 229 F----GKKFSYCFGSLDDP--SYPHNVLVLGDDGANILGDTTPLEIHNGFYYVTIEAISV 282
Query: 290 G-----------NQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG 338
N+ +ID+G +LT L + L + + + E + A
Sbjct: 283 DGIILPIDPRVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVS 342
Query: 339 SLEL----CYSFN-----SLSQVPEVTIHF-RGADVKLSRSNFFVKVSEDIVC-SVFKGI 387
++ CY+ N S P VT HF GA++ L + F+K+S ++ C +V G
Sbjct: 343 QDDMIKMECYNGNFERDLVESGFPIVTFHFSEGAELSLDVKSLFMKLSPNVFCLAVTPGN 402
Query: 388 TNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
NS+ G Q ++ +GYD+E VSF
Sbjct: 403 LNSI---GATAQQSYNIGYDLEAMEVSF 427
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 114/368 (30%), Positives = 183/368 (49%), Gaps = 39/368 (10%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + + IGTPP + DTGSDL W QC PC C+ Q+ P +DPK SS+++++ C
Sbjct: 88 GEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPC--HDCFEQNGPYYDPKESSSFRNIGC 145
Query: 149 SSSQCASLNQKS----CSGVN--CQYSVSYGDGSFSNGNLATE--TVTLGSTTGQA--VA 198
+C ++ C N C Y YGD S + G+ ATE TV L S TG++
Sbjct: 146 HDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKR 205
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----ST 253
+ + FGCG N GLF+ + + G +S SQ+++ FSYCLV + S+
Sbjct: 206 VENVMFGCGHWNRGLFHGASGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRNSDTNVSS 264
Query: 254 KINFGTN-GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGV--STPDI---- 301
K+ FG + +++ P + T L K TFY + I +I VG + L + ST ++
Sbjct: 265 KLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPESTWNMTSDG 324
Query: 302 ----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEV 355
++DSGTTL++ + + ++ P+ L+ CY+ + + ++ P+
Sbjct: 325 VGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQDFPILDPCYNVSGVEKIDLPDF 384
Query: 356 TIHF-RGADVKLSRSNFFVKVS-EDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQT 412
I F GA N+F+++ E++VC G S + I GN Q NF V YD ++
Sbjct: 385 GILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRSALSIIGNYQQQNFHVLYDTKKSR 444
Query: 413 VSFKPTDC 420
+ + P +C
Sbjct: 445 LGYAPMNC 452
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 119/375 (31%), Positives = 185/375 (49%), Gaps = 50/375 (13%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS--PLFDPKMSSTYK 144
++ + + + IGTPP R + DTGSDLIWTQC+ + + P++DP SST+
Sbjct: 87 SDQGHSLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFA 146
Query: 145 SLPCSSSQC--ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
LPCS C + K+C+ N C Y YG + + G LA+ET T G+ +AV+L
Sbjct: 147 FLPCSDRLCQEGQFSFKNCTSKNRCVYEDVYGSAA-AVGVLASETFTFGAR--RAVSLR- 202
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN---FG 258
+ FGCG + G TGI+GL +SLI+Q++ +FSYCL P + K + FG
Sbjct: 203 LGFGCGALSAGSLIG-ATGILGLSPESLSLITQLKIQ---RFSYCLTPFADKKTSPLLFG 258
Query: 259 ---------TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST------PD--- 300
T + +VS P+ +Y + + IS+G++RL V PD
Sbjct: 259 AMADLSRHKTTRPIQTTAIVSNPVK--TVYYYVPLVGISLGHKRLAVPAASLAMRPDGGG 316
Query: 301 -IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT-GSLELCYSFNSLS-------- 350
++DSG+T+ +L + + + ++ PVA+ T ELC+ +
Sbjct: 317 GTIVDSGSTVAYLVEAAFEAVKEAVMDVVRL-PVANRTVEDYELCFVLPRRTAAAAMEAV 375
Query: 351 QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYD 407
QVP + +HF GA + L R N+F + ++C T+ V I GN+ Q N V +D
Sbjct: 376 QVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFD 435
Query: 408 IEQQTVSFKPTDCTK 422
++ SF PT C +
Sbjct: 436 VQHHKFSFAPTQCDQ 450
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 119/388 (30%), Positives = 187/388 (48%), Gaps = 58/388 (14%)
Query: 79 ASQADIIP-NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-----PSQCYMQDS 132
A+ + P ++ + + + IGTPP R + DTGSDLIWTQC + Q
Sbjct: 71 AADVPVAPLSDQGHSLTVGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQRE 130
Query: 133 PLFDPKMSSTYKSLPCSSSQC--ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTL 189
PL++P+ SS++ LPCS C + K+C+ N C Y YG + G LA+ET T
Sbjct: 131 PLYEPRRSSSFAYLPCSDRLCQEGQFSYKNCARNNRCMYDELYGSAE-AGGVLASETFTF 189
Query: 190 GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP 249
G V+LP + FGCG + G +G++GL G +SL+SQ+ +FSYCL P
Sbjct: 190 G--VNAKVSLP-LGFGCGALSAGDLVG-ASGLMGLSPGIMSLVSQLSVP---RFSYCLTP 242
Query: 250 VSSTKI------------NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQR---- 293
+ K + T G V ++ P + +YV + +S+G +R
Sbjct: 243 FAERKTSPLLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLV-GLSLGTKRLDVP 301
Query: 294 ---LGVSTPD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGS----L 340
LG+ PD ++DSG+T+++L + + +V +++EA PVA+ T
Sbjct: 302 ATSLGMIKPDGSGGTIVDSGSTMSYLEE---TAFRAVKKAVVEAVRLPVANGTDEDYDDY 358
Query: 341 ELCYSFNS-----LSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVP 392
ELC++ + + P + +HF GA + L R N+F + ++C + V
Sbjct: 359 ELCFALPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRAGLMCLAVGTSPDGFGVS 418
Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
I GN+ Q N V +D+ Q SF PT C
Sbjct: 419 IIGNVQQQNMHVLFDVRNQKFSFAPTKC 446
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 106/378 (28%), Positives = 184/378 (48%), Gaps = 52/378 (13%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEP--------CPPSQCYMQDSPLFDPKM 139
+ Y + + +GTP + + DTGSDL W QC P PP +P +D
Sbjct: 56 SGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPP-------APWYDKSS 108
Query: 140 SSTYKSLPCSSSQCASLNQ---KSCSGVN---CQYSVSYGDGSFSNGNLATETVTL---- 189
SS+Y+ +PC+ +C L SCS + C Y+ Y D S + G LA ET+++
Sbjct: 109 SSSYREIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRK 168
Query: 190 ------GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR-TTIAGK 242
G+ + + + + GC + G +G++GLG G ISL +Q R T + G
Sbjct: 169 RSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGI 228
Query: 243 FSYCLVPV--SSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRL-GV 296
FSYCLV S +F G + TP+ + A++FY + + ++V + + G+
Sbjct: 229 FSYCLVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGI 288
Query: 297 STPDI----------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF 346
++ D + DSGTTL++L + S +L +++ I + ELCY+
Sbjct: 289 ASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFELCYNV 348
Query: 347 NSLSQ-VPEVTIHFRGADV-KLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTNF 402
+ + +P++ + F+G V +L +N+ V V+E++ C + + TN I GN++Q +
Sbjct: 349 TRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLLQQDH 408
Query: 403 LVGYDIEQQTVSFKPTDC 420
+ YD+ + + FK + C
Sbjct: 409 HIEYDLAKARIGFKWSPC 426
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 124/347 (35%), Positives = 173/347 (49%), Gaps = 32/347 (9%)
Query: 95 ISIGTPPTERLAVADTGSDLIWTQCEPCP-PSQCYMQDSPLFDPKMSSTYKSLPCSSSQC 153
+ +G P V DTGSD+ W QC PC + CY Q +P+FDP++SS+Y + C S QC
Sbjct: 1 MRVGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQC 60
Query: 154 ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGL 213
L++ C+ +C Y V YGDGSF+ G LATET+T + ++P I+ GCG +N GL
Sbjct: 61 QLLDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSN----SIPNISIGCGHDNEGL 116
Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVS--T 271
F G++GLGGG IS+ SQ++ A FSYCLV + S +F T + P S +
Sbjct: 117 F-VGADGLIGLGGGAISISSQLK---ASSFSYCLVDIDSP--SFSTLDFNTDPPSDSLIS 170
Query: 272 PLTKAKTF----YVLTIDAISVGNQRLGVSTPD----------IVIDSGTTLTFLPQGYN 317
PL K F YV I +SVG + L +S+ I++DSGTT+T LP
Sbjct: 171 PLVKNDRFPSFRYVKVI-GMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVY 229
Query: 318 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVK 374
L + P A + CY +S S +VP + G + ++L N ++
Sbjct: 230 EVLREAFLGLTTNLPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQ 289
Query: 375 V-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
V S C F T + I GN Q V YD+ V F C
Sbjct: 290 VDSAGTFCLAFVSATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 115/364 (31%), Positives = 174/364 (47%), Gaps = 38/364 (10%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y ++ +GTP T L V DTGSD++W QC PC CY Q +FDP+ S +Y ++
Sbjct: 119 SGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPC--RHCYAQSGRVFDPRRSRSYAAVD 176
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C + C L+ C C Y V+YGDGS + G+ A+ET+T + + + G
Sbjct: 177 CVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF----ARGARVQRVAIG 232
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV---------PVSSTKIN 256
CG +N GLF + + ++GLG G +S +Q+ + FSYCLV S+ +
Sbjct: 233 CGHDNEGLFIAASG-LLGLGRGRLSFPTQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVT 291
Query: 257 FGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL-GVSTPD-----------I 301
FG + + G TP+ + TFY + + SVG R+ GVS D +
Sbjct: 292 FGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGV 351
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIH 358
++DSGT++T L + + + V+ SL + CY+ + +VP V++H
Sbjct: 352 ILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMH 411
Query: 359 FR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
GA V L N+ + V + C G V I GNI Q F V +D + Q V F
Sbjct: 412 LAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFV 471
Query: 417 PTDC 420
P C
Sbjct: 472 PKSC 475
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 130/382 (34%), Positives = 185/382 (48%), Gaps = 37/382 (9%)
Query: 60 RSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
R +N + N ++ +S ASQ Y RI +G P V DTGSD+ W QC
Sbjct: 158 RRINGSDSTNSLTAPVTSGASQG-----AGEYFARIGVGQPVQSYFFVPDTGSDVSWLQC 212
Query: 120 EPCPPSQ-CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFS 178
+PC CY Q P+FDPK SS+Y L C S QC L++ +C +C Y V YGDGSF+
Sbjct: 213 QPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFT 272
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
G LATET + + ++P + GCG +N GLF G++GLGGG ISL SQ+ T
Sbjct: 273 VGELATETFSFRHSN----SIPNLPIGCGHDNEGLF-VGAAGLIGLGGGAISLSSQLEAT 327
Query: 239 IAGKFSYCLVPV---SSTKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQ 292
FSYCLV + SS+ ++F + +++PL K TF + + +SVG +
Sbjct: 328 ---SFSYCLVDLDSESSSTLDFNADQPSDS---LTSPLVKNDRFPTFRYVKVIGMSVGGK 381
Query: 293 RLGVSTPD----------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
L +S+ I++DSGTT+T +P L + + P A +
Sbjct: 382 PLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDT 441
Query: 343 CYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIM 398
CY +S S +VP + G + ++L N +V S C F T + I GN+
Sbjct: 442 CYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQ 501
Query: 399 QTNFLVGYDIEQQTVSFKPTDC 420
Q V YD+ V F C
Sbjct: 502 QQGIRVSYDLANSLVGFSTDKC 523
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 120/370 (32%), Positives = 171/370 (46%), Gaps = 56/370 (15%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ + +G E V DT S+L W QC+PC C+ Q PLFDP S +Y ++PC+
Sbjct: 119 NYVATVGLGA--AEATVVVDTASELTWVQCQPC--ESCHDQQDPLFDPSSSPSYAAVPCN 174
Query: 150 SSQCASLNQKSCSGVN-----------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
SS C +L +G + C Y++SY DGS+S G LA + + L GQ +
Sbjct: 175 SSSCDALRVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRL---AGQDIE 231
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV----SSTK 254
G FGCGT+N G T+G++GLG +SL+SQ G FSYCL P+ SS
Sbjct: 232 --GFVFGCGTSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFGGVFSYCL-PMRESGSSGS 288
Query: 255 INFGTN-------------GIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRL---GVST 298
+ G + +VS G + P FY L + I+VG Q + S
Sbjct: 289 LVLGDDSSAYRNSTPIVYTAMVSDSGPLQGP------FYFLNLTGITVGGQEVESPWFSA 342
Query: 299 PDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEV 355
++IDSGT +T L P YN+ +S + E P A L+ C++ L QVP +
Sbjct: 343 GRVIIDSGTIITTLVPSVYNAVRAEFLSQLAE-YPQAPAFSILDTCFNLTGLKEVQVPSL 401
Query: 356 TIHFRGA-DVKLSRSNFFVKVSEDI--VCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQ 410
F G+ +V++ VS D VC + + I GN Q N V +D
Sbjct: 402 KFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIGNYQQKNLRVIFDTLG 461
Query: 411 QTVSFKPTDC 420
+ F C
Sbjct: 462 SQIGFAQETC 471
>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 506
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 140/459 (30%), Positives = 199/459 (43%), Gaps = 65/459 (14%)
Query: 17 YVVSPIEAQTGGFSVELIHRDSPKSPFY--NSSETPYQRLRDALTRSLNRLNHFN----- 69
+ VSP + +GG L H SP SP S P + L L +R H
Sbjct: 58 HRVSP--SSSGGSWAPLSHLHSPCSPAAGGRDSAPPPKTLSATLQWDEHRAGHIQRKLSG 115
Query: 70 -------------QNSSISSSKASQADIIPNNANYLIRISI-----------GTPPTERL 105
Q++ ++SS A+ ++ ++ + I P +
Sbjct: 116 NAAPMDDAGEETPQSTQVTSSPAANVNVGKSSTDSAFEQGIVPAATGPGGQKKLPGVAQS 175
Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS--CSG 163
V DT SD+ W QC PCP QCY Q L+DP S PCSS QC SL + + C+G
Sbjct: 176 MVVDTASDVPWVQCAPCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCTG 235
Query: 164 VN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN--NGGLFNSK 217
CQY V Y DGS ++G ++ +TL + AV+ FGC G FN+K
Sbjct: 236 AGNTGTCQYRVLYPDGSGTSGTYVSDLLTLNADPKGAVS--KFQFGCSHALLRPGSFNNK 293
Query: 218 TTGIVGLGGGDISLISQMRTTIAGK--FSYCLVPVSSTK--INFGTNGIVSGPGVVSTPL 273
T G + LG G SL SQ + T + FSYCL P S K ++ G + V TP+
Sbjct: 294 TAGFMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLSLGVPQHAASRYAV-TPM 352
Query: 274 TKAK---TFYVLTIDAISVGNQRL----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSS 326
K+K Y++ + I V QRL V + +DS T +T LP L + +
Sbjct: 353 LKSKMAPMIYMVRLIGIDVAGQRLPVPPAVFAANAAMDSRTIITRLPPTAYMALRAAFRA 412
Query: 327 MIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSNFFVKVSEDIVCSV 383
+ A P G L+ CY F + V P+VT+ F R A V+L S + C
Sbjct: 413 QMRAYRAVAPKGQLDTCYDFTGVPMVRLPKVTLVFDRNAAVELDPSGVMLD-----SCLA 467
Query: 384 FKGITNS-VP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
F N +P I GN+ Q V Y+++ +V F+ C
Sbjct: 468 FAPNANDFMPGIIGNVQQQTLEVLYNVDGASVGFRRAAC 506
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 110/368 (29%), Positives = 173/368 (47%), Gaps = 38/368 (10%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + + IGTPP + DTGSDL W QC PC C+ Q P +DPK SS+++++ C
Sbjct: 190 GEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPC--IACFEQSGPYYDPKESSSFENITC 247
Query: 149 SSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT----GQAVA 198
+C ++ K C N C Y YGD S + G+ A ET T+ TT +
Sbjct: 248 HDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKH 307
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----ST 253
+ + FGCG N GLF+ + G +S SQ+++ FSYCLV + S+
Sbjct: 308 VENVMFGCGHWNRGLFHGAAGLLGLGRGP-LSFASQLQSIYGHSFSYCLVDRNSDTSVSS 366
Query: 254 KINFGTNG-IVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGVSTPDI------ 301
K+ FG + ++S P + T + TFY + I +I V + L +
Sbjct: 367 KLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPEETWHLSKEG 426
Query: 302 ----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEV 355
+IDSGTTLT+ + + I+ + + L+ CY+ + + ++P+
Sbjct: 427 GGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEGFPPLKPCYNVSGIEKMELPDF 486
Query: 356 TIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTV 413
I F GA N+F+++ D+VC G S + I GN Q NF + YD+++ +
Sbjct: 487 GILFSDGAMWDFPVENYFIQIEPDLVCLAILGTPKSALSIIGNYQQQNFHILYDMKKSRL 546
Query: 414 SFKPTDCT 421
+ P CT
Sbjct: 547 GYAPMKCT 554
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 123/355 (34%), Positives = 178/355 (50%), Gaps = 29/355 (8%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKS 145
+ +++ + +GTP + DTGSDL W QC+PC S C+ Q PLFDP SSTY +
Sbjct: 140 DTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAA 199
Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+ C QCA+ CS N C Y V YGDGS + G L+ +T+ L S+ AL G
Sbjct: 200 VHCGEPQCAAAGDL-CSEDNTTCLYLVRYGDGSSTTGVLSRDTLALTSSR----ALTGFP 254
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
FGCGT N G F + G++GLG G++SL SQ + FSYCL P S++ + T G
Sbjct: 255 FGCGTRNLGDFG-RVDGLLGLGRGELSLPSQAAASFGAVFSYCL-PSSNSTTGYLTIGAT 312
Query: 264 ----SGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGVSTPDI------VIDSGTTLTF 311
+G + L K + +FY + + +I +G L V P + ++DSGT LT+
Sbjct: 313 PATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVP-PAVFTRGGTLLDSGTVLTY 371
Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNF 371
LP + L +E A P L+ CY F S+V + FR D + +F
Sbjct: 372 LPAQAYALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVVVPAVSFRFGDGAVFELDF 431
Query: 372 F---VKVSEDIVCSVFKGI-TNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
F + + E++ C F + T +P I GN Q + V YD+ + + F P C
Sbjct: 432 FGVMIFLDENVGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 486
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 121/380 (31%), Positives = 178/380 (46%), Gaps = 64/380 (16%)
Query: 76 SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
SS + QA + Y + IS+GTP VADTGSDLIWTQC PC ++C+ Q +P F
Sbjct: 71 SSVSFQALLENGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPC--TKCFQQPAPPF 128
Query: 136 DPKMSSTYKSLPCSSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
P SST+ LPC+SS C L + ++C+ C Y+ YG G ++ G LATET+ +G
Sbjct: 129 QPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGD-- 185
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV---PV 250
+ P + FGC T N GLG D+ + G+FSYCL
Sbjct: 186 ---ASFPSVAFGCSTEN------------GLGQLDLGV---------GRFSYCLRSGSAA 221
Query: 251 SSTKINFGTNGIVSGPGVVSTPLTK----AKTFYVLTIDAISVGNQRLGVSTPDI----- 301
++ I FG+ ++ V STP ++Y + + I+VG L V+T
Sbjct: 222 GASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQN 281
Query: 302 ------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS---- 350
++DSGTTLT+L + GY + +S + V + T L+LC+
Sbjct: 282 GLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTV-NGTRGLDLCFKSTGGGGGGI 340
Query: 351 QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--------IYGNIMQTNF 402
VP + + F G + + +F V D SV +P + GN+MQ +
Sbjct: 341 AVPSLVLRFDGG-AEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDM 399
Query: 403 LVGYDIEQQTVSFKPTDCTK 422
+ YD++ SF P DC K
Sbjct: 400 HLLYDLDGGIFSFAPADCAK 419
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 136/450 (30%), Positives = 196/450 (43%), Gaps = 89/450 (19%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-NSSISSSKASQADIIP 86
G +EL H D+ + Y E R+R A R+ RL + I SQ
Sbjct: 22 GIRLELTHVDAKE--HYTVEE----RVRRATERTHRRLASMGGVTAPIHWGGQSQ----- 70
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
Y+ IG PP A+ DTGS+LIWTQC C P+ C+ Q+ P +DP S +++
Sbjct: 71 ----YIAEYLIGDPPQRAEAIIDTGSNLIWTQCSRCRPT-CFRQNLPYYDPSRSRAARAV 125
Query: 147 PCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C+ + CA ++ C N C YG G+ + G LATE +T S T V F
Sbjct: 126 GCNDAACALGSETQCLSDNKTCAVVTGYGAGNIA-GTLATENLTFQSETVSLV------F 178
Query: 205 GC--------GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP-----VS 251
GC G+ NG +GI+GLG G +SL SQ+ T +FSYCL P +
Sbjct: 179 GCIVVTKLSPGSLNGA------SGIIGLGRGKLSLPSQLGDT---RFSYCLTPYFEDTIE 229
Query: 252 STKINFGT-----NGIVSGPGVVSTPLTKA------KTFYVLTIDAISVGNQRLGVSTPD 300
+ + G NG S V + P ++ TFY L + I+ G +L V +
Sbjct: 230 PSHMVVGASAGLINGSASSTPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAA 289
Query: 301 I-------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEA---QPVADPTGSLELCY 344
IDSG LT L L + ++ + A QP+A TG +LC
Sbjct: 290 FDLRQVAPGMWTGTFIDSGAPLTSLVDVAYQALRAELARQLGAALVQPLAGTTG-FDLCV 348
Query: 345 SFNSLSQ-VPEVTIHF-----RGADVKLSRSNFFVKVSEDIVCS-VFKGI------TNSV 391
+ + VP + +HF G D+ + +N++ V C VF + N
Sbjct: 349 ALKDAERLVPPLVLHFGGGSGTGTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNET 408
Query: 392 PIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ GN MQ N V YD+ +SF+P DC+
Sbjct: 409 TVIGNYMQQNMHVLYDLAGGVLSFQPADCS 438
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 106/378 (28%), Positives = 184/378 (48%), Gaps = 52/378 (13%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEP--------CPPSQCYMQDSPLFDPKM 139
+ Y + + +GTP + + DTGSDL W QC P PP +P +D
Sbjct: 24 SGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPP-------APWYDKSS 76
Query: 140 SSTYKSLPCSSSQCASLNQ---KSCSGVN---CQYSVSYGDGSFSNGNLATETVTL---- 189
SS+Y+ +PC+ +C L SCS + C Y+ Y D S + G LA ET+++
Sbjct: 77 SSSYREIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRK 136
Query: 190 ------GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR-TTIAGK 242
G+ + + + + GC + G +G++GLG G ISL +Q R T + G
Sbjct: 137 RSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGI 196
Query: 243 FSYCLVPV--SSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRL-GV 296
FSYCLV S +F G + TP+ + A++FY + + ++V + + G+
Sbjct: 197 FSYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGI 256
Query: 297 STPDI----------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF 346
++ D + DSGTTL++L + S +L +++ I + ELCY+
Sbjct: 257 ASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFELCYNV 316
Query: 347 NSLSQ-VPEVTIHFRGADV-KLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTNF 402
+ + +P++ + F+G V +L +N+ V V+E++ C + + TN I GN++Q +
Sbjct: 317 TRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLLQQDH 376
Query: 403 LVGYDIEQQTVSFKPTDC 420
+ YD+ + + FK + C
Sbjct: 377 HIEYDLAKARIGFKWSPC 394
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 107/368 (29%), Positives = 176/368 (47%), Gaps = 38/368 (10%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + + +GTPP + DTGSDL W QC PC C+ Q P +DPK SS+++++ C
Sbjct: 193 GEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC--IACFEQSGPYYDPKDSSSFRNISC 250
Query: 149 SSSQCASLNQKS----CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT----GQAVA 198
+C ++ C N C Y YGDGS + G+ A ET T+ TT +
Sbjct: 251 HDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKH 310
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----ST 253
+ + FGCG N GLF+ + G +S SQM++ FSYCLV + S+
Sbjct: 311 VENVMFGCGHWNRGLFHGAAGLLGLGKGP-LSFASQMQSLYGQSFSYCLVDRNSNASVSS 369
Query: 254 KINFGTNG-IVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGVSTP-------- 299
K+ FG + ++S P + T K TFY + I+++ V ++ L +
Sbjct: 370 KLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETWHLSSEG 429
Query: 300 --DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEV 355
+IDSGTTLT+ + + I+ + + L+ CY+ + + ++P+
Sbjct: 430 AGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKMELPDF 489
Query: 356 TIHF-RGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
I F GA N+F+++ D+VC ++ +++ I GN Q NF + YD+++ +
Sbjct: 490 GILFADGAVWNFPVENYFIQIDPDVVCLAILGNPRSALSIIGNYQQQNFHILYDMKKSRL 549
Query: 414 SFKPTDCT 421
+ P C
Sbjct: 550 GYAPMKCA 557
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 104/355 (29%), Positives = 177/355 (49%), Gaps = 34/355 (9%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ +IGTPP AV D +L+WTQC+ C +C+ Q +PLFDP S+TY++ PC
Sbjct: 50 NYVANFTIGTPPQPASAVIDLAGELVWTQCKQC--GRCFEQGTPLFDPTASNTYRAEPCG 107
Query: 150 SSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ C S+ + ++CSG C Y S G + G + T+T +G+ A + FGC
Sbjct: 108 TPLCESIPSDVRNCSGNVCAYEASTNAGD-TGGKVGTDTFAVGT------AKASLAFGCV 160
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVS 264
+ +GIVGLG SL++Q T FSYCL P + K + G++ ++
Sbjct: 161 VASDIDTMGGPSGIVGLGRTPWSLVTQ---TGVAAFSYCLAPHDAGKNSALFLGSSAKLA 217
Query: 265 GPG-VVSTPLT-------KAKTFYVLTIDAISVGNQRLGV--STPDIVIDSGTTLTFLPQ 314
G G STP +Y + ++ + G+ + + S +++D+ + ++FL
Sbjct: 218 GGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVLLDTFSPISFLVD 277
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKLSRSNFF 372
G + ++ + A P+A P +LC+ + S P++ FR GA + + +N+
Sbjct: 278 GAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVPATNYL 337
Query: 373 VKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ VC S T + + G++ Q N +D++++T+SF+P DCTK
Sbjct: 338 LDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 392
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 110/369 (29%), Positives = 175/369 (47%), Gaps = 40/369 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y I + +GTPP + DTGSDL W QC+PC C+ Q+ P ++P SS+Y+++ C
Sbjct: 170 YFIDMFVGTPPKHVWLILDTGSDLSWIQCDPC--YDCFEQNGPHYNPNESSSYRNISCYD 227
Query: 151 SQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGST----TGQAVALP 200
+C ++ + C N C Y Y DGS + G+ A ET T+ T + +
Sbjct: 228 PRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVV 287
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKI 255
+ FGCG N G F+ + G +S SQ+++ FSYCL + S+K+
Sbjct: 288 DVMFGCGHWNKGFFHGAGGLLGLGRGP-LSFPSQLQSIYGHSFSYCLTDLFSNTSVSSKL 346
Query: 256 NFGTNG-IVSGPGVVSTPL-----TKAKTFYVLTIDAISVGNQRLGVSTPDI-------- 301
FG + +++ + T L T TFY L I +I VG + L +
Sbjct: 347 IFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWSSEGVG 406
Query: 302 --VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTI 357
+IDSG+TLTF P + I+ Q +A + CY+ + QV P+ I
Sbjct: 407 GTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQVELPDYGI 466
Query: 358 HF-RGADVKLSRSNFFVKVSED-IVC-SVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTV 413
HF GA N+F + D ++C ++ K +S + I GN++Q NF + YD+++ +
Sbjct: 467 HFADGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTIIGNLLQQNFHILYDVKRSRL 526
Query: 414 SFKPTDCTK 422
+ P C +
Sbjct: 527 GYSPRRCAE 535
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 121/425 (28%), Positives = 192/425 (45%), Gaps = 56/425 (13%)
Query: 40 KSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGT 99
KSPF + ++ R SL R S + S AS + Y + + IG
Sbjct: 39 KSPFPSPTQALALDTRRLHFLSLRRKPIPFVKSPVVSGAAS------GSGQYFVDLRIGQ 92
Query: 100 PPTERLAVADTGSDLIWTQCEPCPPSQC-YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ 158
PP L +ADTGSDL+W +C C C + + +F P+ SST+ C C + +
Sbjct: 93 PPQSLLLIADTGSDLVWVKCSAC--RNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPK 150
Query: 159 KSCSGV--------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
+ + C Y Y DGS ++G A ET +L +++G+ L + FGCG
Sbjct: 151 PDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRI 210
Query: 211 GGLFNSKTT-----GIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVSSTKINFG 258
G S T+ G++GLG G IS SQ+ KFSYCL+ P S I G
Sbjct: 211 SGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGNG 270
Query: 259 TNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI-----------VID 304
+GI + TPL + TFY + + ++ V +L + P I V+D
Sbjct: 271 GDGISK---LFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRID-PSIWEIDDSGNGGTVVD 326
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP-TGSLELCYSFNSLSQ----VPEVTIHF 359
SGTTL FL + ++++ + ++ P+AD T +LC + + +++ +P + F
Sbjct: 327 SGTTLAFLAEPAYRSVIAAVRRRVKL-PIADALTPGFDLCVNVSGVTKPEKILPRLKFEF 385
Query: 360 RGADVKL-SRSNFFVKVSEDIVCSVFKGITNSV--PIYGNIMQTNFLVGYDIEQQTVSFK 416
G V + N+F++ E I C + + V + GN+MQ FL +D ++ + F
Sbjct: 386 SGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFS 445
Query: 417 PTDCT 421
C
Sbjct: 446 RRGCA 450
>gi|449467979|ref|XP_004151699.1| PREDICTED: probable aspartic protease At2g35615-like, partial
[Cucumis sativus]
Length = 209
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 78/207 (37%), Positives = 118/207 (57%), Gaps = 10/207 (4%)
Query: 9 FILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
IL + F + I G F+ L HRDS SP SS + Y RL +A RSL+R
Sbjct: 11 LILLLISFSQTTIINGDNG-FTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSRSATL 69
Query: 69 NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
++ + + QA + P + YL+ +SIGTPP + + +ADTGSDL+W QC PC +CY
Sbjct: 70 LNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCL--KCY 127
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETV 187
Q P+FDP S+++ +PC+S C +++ C C YS +YGD +++ G+L E +
Sbjct: 128 KQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKI 187
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLF 214
T+GS++ ++V GCG +GG F
Sbjct: 188 TIGSSSVKSV------IGCGHESGGGF 208
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 125/436 (28%), Positives = 200/436 (45%), Gaps = 54/436 (12%)
Query: 24 AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
A +E +HR + +S + +P R AL+ + ++
Sbjct: 98 ADKDAVRIETMHRRAARSGGDRTPASPSSSPRRALSERM--------------VATVESG 143
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ + YL+ + +GTPP + DTGSDL W QC PC C+ Q P+FDP SS+Y
Sbjct: 144 VAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFDQVGPVFDPAASSSY 201
Query: 144 KSLPCSSSQCASLN----QKSCSGV---NCQYSVSYGDGSFSNGNLATETVTLGSTT-GQ 195
+++ C +C + ++C +C Y YGD S + G+LA E+ T+ T G
Sbjct: 202 RNVTCGDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGA 261
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS--- 252
+ + + FGCG N GLF+ ++GLG G +S SQ+R FSYCLV S
Sbjct: 262 SRRVDDVVFGCGHWNRGLFHGAAG-LLGLGRGPLSFASQLRAVYGHTFSYCLVDHGSDVA 320
Query: 253 TKINFG----TNGIVSGPGVVSTPL----TKAKTFYVLTIDAISVGNQRLGVSTP----- 299
+K+ FG + P + T + A TFY + + + VG + L +S+
Sbjct: 321 SKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTWGVG 380
Query: 300 -------DIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS- 350
+IDSGTTL+ F+ Y + + M + P+ L CY+ + +
Sbjct: 381 EGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPDFPVLSPCYNVSGVDR 440
Query: 351 -QVPEVTIHF-RGADVKLSRSNFFVKVSED-IVCSVFKGITNS-VPIYGNIMQTNFLVGY 406
+VPE+++ F GA N+F+++ D I+C G + + I GN Q NF V Y
Sbjct: 441 PEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQNFHVVY 500
Query: 407 DIEQQTVSFKPTDCTK 422
D++ + F P C +
Sbjct: 501 DLKNNRLGFAPRRCAE 516
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 127/419 (30%), Positives = 187/419 (44%), Gaps = 68/419 (16%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
E+ RD + F NS Y S N NH + N ++ + N+
Sbjct: 87 EIFGRDESRVSFINSKCNQYT--------SGNLKNHAHNN-----------NLFDEDGNF 127
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
L+ ++ GTPP + + DTGS + WTQC+ C C FD SSTY C S
Sbjct: 128 LVDVAFGTPPQKFKLILDTGSSITWTQCKAC--VHCLKDSHRHFDSLASSTYSFGSCIPS 185
Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
V Y+++YGD S S GN +T+TL + FGCG NN
Sbjct: 186 T-----------VGNTYNMTYGDKSTSVGNYGCDTMTLEPSD----VFQKFQFGCGRNNE 230
Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST-KINFGTNG--------- 261
G F S G++GLG G +S +SQ + FSYCL +S + FG
Sbjct: 231 GDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEENSIGSLLFGEKATSQSSSLKF 290
Query: 262 --IVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQ 314
+V+GPG ++ L ++ ++V +D ISVGN+RL + ++P +IDSGT +T LPQ
Sbjct: 291 TSLVNGPG--TSGLEESGYYFVKLLD-ISVGNKRLNIPSSVFASPGTIIDSGTVITRLPQ 347
Query: 315 GYNSNLLSVMSSMIEAQPVAD----PTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLS 367
S L + + P+++ L+ CY+ + V PE +HF GADV+L+
Sbjct: 348 RAYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHFGDGADVRLN 407
Query: 368 RSNFFVKVSEDIVCSVFKGITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+C F G + S + I GN Q + V YDI + + F C+
Sbjct: 408 GKRVVWGNDASRLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRRIGFGGNGCS 466
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 131/422 (31%), Positives = 195/422 (46%), Gaps = 37/422 (8%)
Query: 30 SVELIHRDSPKSPFYNS-SETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
S+ ++HR P SP + S P + L R +R++ + + SS+K + N
Sbjct: 72 SLTVVHRHGPCSPLRSRGSGAPSHT--EILRRDQDRVDAIRRKVTASSNKPKGGVSLLAN 129
Query: 89 -------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
NY+ + +GTP TE + DTGSD W QC+PC + CY Q P+FDP SS
Sbjct: 130 WGKSLSTTNYVASLRLGTPATELVVELDTGSDQSWVQCKPC--ADCYEQRDPVFDPTASS 187
Query: 142 TYKSLPCSSSQCASLNQKSCSGV-------NCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
TY ++PC + +C L S S NC Y VSY D S + G+LA +T+TL +
Sbjct: 188 TYSAVPCGARECQELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPS 247
Query: 195 QAVA--LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPV 250
+ A +PG FGCG +N G F + G++GLG G SL SQ+ FSYCL P
Sbjct: 248 PSPADTVPGFVFGCGHSNAGTFG-EVDGLLGLGLGKASLPSQVAARYGAAFSYCLPSSPS 306
Query: 251 SSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV------STPDIVID 304
++ ++FG + + T Y L + I V + + V + +ID
Sbjct: 307 AAGYLSFGGAAARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATAAGTIID 366
Query: 305 SGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIHFR 360
SGT + L P Y + S S+M + P+ + + CY F + ++P V + F
Sbjct: 367 SGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDTCYDFTGHETVRIPAVELVFA 426
Query: 361 -GADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
GA V L S + D+ + + N + I GN Q V YD+ Q + F
Sbjct: 427 DGATVHLHPSGVLYTWN-DVAQTCLAFVPNHDLGILGNTQQRTLAVIYDVGSQRIGFGRK 485
Query: 419 DC 420
C
Sbjct: 486 GC 487
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 121/354 (34%), Positives = 175/354 (49%), Gaps = 27/354 (7%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKS 145
+ +++ + +GTP + DTGSDL W QC+PC S C+ Q PLFDP SSTY +
Sbjct: 145 DTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAA 204
Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+ C QCA+ CS N C Y V YGDGS + G L+ +T+ L S+ AL G
Sbjct: 205 VHCGEPQCAAAGGL-CSEDNTTCLYLVHYGDGSSTTGVLSRDTLALTSSR----ALAGFP 259
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
FGCGT N G F + G++GLG G++SL SQ + FSYCL P S++ + T G
Sbjct: 260 FGCGTRNLGDFG-RVDGLLGLGRGELSLPSQAAASFGAVFSYCL-PSSNSTTGYLTIGAT 317
Query: 264 ----SGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFL 312
+G + L K + +FY + + +I +G L V + ++DSGT LT+L
Sbjct: 318 PATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFTRGGTLLDSGTVLTYL 377
Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFF 372
P L +E A P L+ CY F S+V + FR D + +FF
Sbjct: 378 PAQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVIVPAVSFRFGDGAVFELDFF 437
Query: 373 ---VKVSEDIVCSVFKGI-TNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ + E++ C F + +P I GN Q + V YD+ + + F P C
Sbjct: 438 GVMIFLDENVGCLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 491
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 127/459 (27%), Positives = 201/459 (43%), Gaps = 50/459 (10%)
Query: 5 LSCVF----ILFFLCFYVVSPIEAQTGGFSVELIHRDSPK--SPFYNSSETPYQRLRDAL 58
+ C F +LF Y V + +++LIHR+S +P TP ++
Sbjct: 1 MECSFQTSLLLFITVSYFVVTESIKPNRMAMKLIHRESVARLNPNARVPITPEDHIKHLT 60
Query: 59 TRSLNRLNHFNQNSSISSSKAS--QADIIP--NNANYLIRISIGTPPTERLAVADTGSDL 114
S R + QNS +S Q D+ + +L+ S+G PP +L + DTGS L
Sbjct: 61 DISSARFKYL-QNSIDKELGSSNFQVDVEQAIKTSLFLVNFSVGQPPVPQLTIMDTGSSL 119
Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYG 173
+W QC+PC P+F+P +SST+ C C C N C Y Y
Sbjct: 120 LWIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDDRFCRYAPNGHCGSSNKCVYEQVYI 179
Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
G+ S G LA E +T + G V I FGCG NG S TGI+GLG SL
Sbjct: 180 SGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGYENGEQLESHFTGILGLGAKPTSLAV 239
Query: 234 QMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP--GVVSTP----LTKAKTFYVLTIDAI 287
Q+ KFSYC+ +++ N+G N +V G ++ P + Y + ++ I
Sbjct: 240 QL----GSKFSYCIGDLANK--NYGYNQLVLGEDADILGDPTPIEFETENSIYYMNLEGI 293
Query: 288 SVGNQRLGVS---------TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG 338
SVG+ +L + +++DSGT T+L L + + S+++ P +
Sbjct: 294 SVGDTQLNIEPVVFKRRGPRTGVILDSGTLYTWLADIAYRELYNEIKSILD--PKLERFW 351
Query: 339 SLE-LCYS---FNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSE----DIVCSVFK---- 385
+ LCY L P VT HF GA++ + ++ F +SE ++ C K
Sbjct: 352 FRDFLCYHGRVSEELIGFPVVTFHFAGGAELAMEATSMFYPLSEPNTFNVFCMSVKPTKE 411
Query: 386 --GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
G G + Q + +GYD++++ + + DC +
Sbjct: 412 HGGEYKEFTAIGLMAQQYYNIGYDLKEKNIYLQRIDCVQ 450
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 114/365 (31%), Positives = 179/365 (49%), Gaps = 42/365 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS-----PLFDPKMSSTYKS 145
Y +I +G+PP E DTGSD++W C PCP +C ++ L+D K SST K+
Sbjct: 77 YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCP--KCPVKTDLGIPLSLYDSKASSTSKN 134
Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
+ C + C+ + Q G C Y V YGDGS S+G+ + +TL TG P
Sbjct: 135 VGCEDAFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAPLAQ 194
Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTKI 255
+ FGCG N G S GI+G G + S+ISQ+ ++ FS+CL ++ I
Sbjct: 195 EVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMNGGGI 254
Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQ------RLGVSTPD--IVIDSGT 307
F G V P V +TPL + Y + + + V + L + D +IDSGT
Sbjct: 255 -FAI-GEVESPVVKTTPLVPNQVHYNVILKGMDVDGEPIDLPPSLASTNGDGGTIIDSGT 312
Query: 308 TLTFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFRGADV 364
TL +LPQ YNS + + + + T + C+SF N+ P V +HF + +
Sbjct: 313 TLAYLPQNLYNSLIEKITAKQQVKLHMVQETFA---CFSFTSNTDKAFPVVNLHFEDS-L 368
Query: 365 KLSR--SNFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFK 416
KLS ++ + ED+ C ++ G+T V + G+++ +N LV YD+E + + +
Sbjct: 369 KLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWA 428
Query: 417 PTDCT 421
+C+
Sbjct: 429 DHNCS 433
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 109/358 (30%), Positives = 166/358 (46%), Gaps = 39/358 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y+ +IGTPP AV D +L+WTQC PC P C+ QD PLFDP SST++ LPC S
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQP--CFEQDLPLFDPTKSSTFRGLPCGS 114
Query: 151 SQCASLNQKS--CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
C S+ + S C+ C Y G + G T+T +G+ A + FGC
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGMAGTDTFAIGA------AKETLGFGCVV 167
Query: 209 NNGGLFNS--KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG-TNGIVSG 265
+ +GIVGLG SL++QM T FSYCL SS + G T ++G
Sbjct: 168 MTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT---AFSYCLAGKSSGALFLGATAKQLAG 224
Query: 266 PGVVSTPLT----------KAKTFYVLTIDAISVGNQRLGVSTPD---IVIDSGTTLTFL 312
STP + +Y++ + I G L ++ +++D+ + ++L
Sbjct: 225 GKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKAGGAPLQAASSSGSTVLLDTVSRASYL 284
Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNF 371
G L +++ + QPVA P +LC+S PE+ F GA + + +N+
Sbjct: 285 ADGAYKALKKALTAAVGVQPVASPPKPYDLCFSKAVAGDAPELVFTFDGGAALTVPPANY 344
Query: 372 FVKVSEDIVCSV--------FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ VC G I G++ Q N V +D++++T+SFKP DC+
Sbjct: 345 LLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSFKPADCS 402
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 121/444 (27%), Positives = 205/444 (46%), Gaps = 52/444 (11%)
Query: 25 QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
Q G ++ELIH+DSP+SP Y + P +++ L+H Q S +S++KA +
Sbjct: 10 QLDGLTMELIHKDSPQSPLYPGNLPPGEQILQPAACPFAGLHH--QTSMMSTNKAVMNRM 67
Query: 85 IPNNANY------LIRISIGT--PPTERLAVA------DTGSDLIWTQCEPC--PPSQCY 128
+ +Y L ++ +G+ + R DTG++L W QCE C + C+
Sbjct: 68 MSPLTSYGDPFLFLAQVGVGSFQEKSHRTHFKTYYFQIDTGNELSWIQCEGCQNKGNMCF 127
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
P + S +YK + C+ NQ C C Y+V+YG GS+++GNLA ET T
Sbjct: 128 PHKDPPYTSSQSKSYKPVSCNQHSFCEPNQ--CKEGLCAYNVTYGPGSYTSGNLANETFT 185
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLF------NSKTTGIVGLGGGDISLISQMRTTIAGK 242
S G+ AL I+FGC T++ + + +G++G+G G S ++Q+ + GK
Sbjct: 186 FYSNHGKHTALKSISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLAQLGSISHGK 245
Query: 243 FSYCLVP--VSSTKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGVST 298
FSYC+ +T + FG + +V + +T + + K Y + + ISV +L ++
Sbjct: 246 FSYCITANNTHNTYLRFGKH-VVKSKNLQTTKIMQVKPSAAYHVNLLGISVNGVKLNITK 304
Query: 299 PDI----------VIDSGTTLTFLPQGYNSNLLSVMSSMIEA----QPVADPTGSLELCY 344
D+ +ID+GT T L + L + +S+ + + + +LCY
Sbjct: 305 TDLAVRKDGSRGCIIDAGTLATLLVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLCY 364
Query: 345 ---SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVS---EDIVCSVFKGITNSVPIYGNIM 398
S +P VT H AD+++ F+ +++ C +S I G
Sbjct: 365 EQLSDAGRKNLPVVTFHLENADLEVKPEAIFLFREFEGKNVFCLSMLS-DDSKTIIGAYQ 423
Query: 399 QTNFLVGYDIEQQTVSFKPTDCTK 422
Q YD + + +SF P DC K
Sbjct: 424 QMKQKFVYDTKARVLSFGPEDCEK 447
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 130/413 (31%), Positives = 190/413 (46%), Gaps = 67/413 (16%)
Query: 30 SVELIHRDSPKS---PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS-KASQADII 85
S+E++H+ P S P +S + Q L +R + + +N + S+ KAS+A +
Sbjct: 18 SLEVVHKHGPCSKLRPHKANSPSHTQILAQDESRVASIQSRLAKNLAGGSNLKASKATLP 77
Query: 86 PNNA------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
+A NY++ + +G+P + + DTGSDL WTQCEPC CY Q +FDP
Sbjct: 78 SKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPC-VGYCYQQREHIFDPST 136
Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
S +Y ++ C S C L N CS C Y + YGDGS+S G A E ++L ST
Sbjct: 137 SLSYSNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTD- 195
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK 254
FGCG NN GLF T G++GL +SL+SQ FSYCL P SS+
Sbjct: 196 ---VFNNFQFGCGQNNRGLFGG-TAGLLGLARNPLSLVSQTAQKYGKVFSYCL-PSSSSS 250
Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQ 314
+ + G SG G +KA F TP LP
Sbjct: 251 TGYLSFG--SGDGD-----SKAVKF------------------TPR-----------LPP 274
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSN- 370
S++ V ++ P L+ CY + +VP++ ++F GA++ L+
Sbjct: 275 TVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYKTVKVPKIILYFSGGAEMDLAPEGI 334
Query: 371 -FFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ +KVS+ VC F G + + V I GN+ Q V YD + V F P+ C
Sbjct: 335 IYVLKVSQ--VCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPSGC 385
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 118/362 (32%), Positives = 180/362 (49%), Gaps = 37/362 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE---PCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I +GTP + DTGSD++W C CP ++ +P +D SST KS+
Sbjct: 85 YFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTP-YDADASSTAKSVS 143
Query: 148 CSSSQCASLNQKS-C-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG-I 202
CS + C+ +NQ+S C SG CQY + YGDGS +NG L + V L TG Q + G I
Sbjct: 144 CSDNFCSYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGSTNGTI 203
Query: 203 TFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTKINF 257
FGCG+ G + GI+G G + S ISQ+ + + F++CL + I F
Sbjct: 204 IFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGI-F 262
Query: 258 GTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDSGTTL 309
+VS P V +TP+ Y + ++AI VGN L +S+ ++IDSGTTL
Sbjct: 263 AIGEVVS-PKVKTTPMLSKSAHYSVNLNAIEVGNSVLQLSSDAFDSGDDKGVIIDSGTTL 321
Query: 310 TFLPQG-YNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQVPEVTIHF-RGADVK 365
+LP YN + +++S E V D S + + L + P VT F + +
Sbjct: 322 VYLPDAVYNPLMNQILASHQELNLHTVQD---SFTCFHYIDRLDRFPTVTFQFDKSVSLA 378
Query: 366 LSRSNFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
+ + +V ED C ++ G+ S+ I G++ +N LV YDIE Q + + +
Sbjct: 379 VYPQEYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHN 438
Query: 420 CT 421
C+
Sbjct: 439 CS 440
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 107/357 (29%), Positives = 178/357 (49%), Gaps = 32/357 (8%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQC-EPCPPSQCYMQDSPLFDPKMSSTYKS 145
+ A Y++ ++IGTPP A+ D G +L+WTQC + C +C+ QD PLFD SST++
Sbjct: 47 SQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHC--RRCFKQDLPLFDTNASSTFRP 104
Query: 146 LPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSN--GNLATETVTLGSTTGQAVALPGIT 203
PC ++ C S+ +SC+G SF G + T+ V +G+ A +
Sbjct: 105 EPCGAAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGT-----AATARLA 159
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTN 260
FGC + ++G VGLG ++SL +QM T FSYCL P + K + G +
Sbjct: 160 FGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNAT---AFSYCLAPPDTGKSSALFLGAS 216
Query: 261 GIVSGP--GVVSTPLTKAKT--------FYVLTIDAISVGNQRLGV--STPDIVIDSGTT 308
++G G +TP K T Y+L ++AI GN + + S I++ + T
Sbjct: 217 AKLAGAGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAGNATIAMPQSGNTIMVSTATP 276
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKL 366
+T L +L ++ + A PV P + +LC+ S S P++ + F+ GA++ +
Sbjct: 277 VTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASGGAPDLVLAFQGGAEMTV 336
Query: 367 SRSNFFVKVSEDIVCSVFKG--ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
S++ D C G V I G++ Q N + +D++++T+SF+P DC+
Sbjct: 337 PVSSYLFDAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADCS 393
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 111/388 (28%), Positives = 181/388 (46%), Gaps = 60/388 (15%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---------LFDPKMS 140
Y +R +GTP L VADTGSDL W +C P + S F P+ S
Sbjct: 94 QYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKS 153
Query: 141 STYKSLPCSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG----- 190
T+ +PC+S C+ SL+ G C Y Y DGS + G + TE+ T+
Sbjct: 154 KTWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSSSS 213
Query: 191 ---STTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYC 246
+ L G+ GC G+ G F + + G++ LG ++S S + G+FSYC
Sbjct: 214 SSSKNKVKKAKLQGLVLGCTGSYTGPSFEA-SDGVLSLGYSNVSFASHAASRFGGRFSYC 272
Query: 247 LV----PVSSTK-INFGTNGIVS-------GPGVVSTPL---TKAKTFYVLTIDAISVGN 291
LV P ++T + FG N +S GPG TPL ++ + FY ++I AISV
Sbjct: 273 LVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAISVDG 332
Query: 292 QRLGVSTP--------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLE 341
+ L + +++DSGT+LT L + +++ + + P DP E
Sbjct: 333 ELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARFPRVAMDP---FE 389
Query: 342 LCYSFNSLSQ------VPEVTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPI 393
CY++ S S+ +P++ +HF G A ++ ++ + + + C V +G + +
Sbjct: 390 YCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPGVKCIGVQEGPWPGISV 449
Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
GNI+Q L +D++ + + FK + CT
Sbjct: 450 IGNILQQEHLWEFDLKNRRLRFKRSRCT 477
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 113/366 (30%), Positives = 179/366 (48%), Gaps = 39/366 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + + +G+PP + DTGSDL W QC PC C+ Q+ +DPK S++YK++ C+
Sbjct: 155 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPC--HDCFQQNGAFYDPKASASYKNITCND 212
Query: 151 SQCASLN----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT----GQAVALP 200
+C ++ K C N C Y YGD S + G+ A ET T+ TT + +
Sbjct: 213 PRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNVE 272
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----STKI 255
+ FGCG N GLF+ + G +S SQ+++ FSYCLV + S+K+
Sbjct: 273 NMMFGCGHWNRGLFHGAAGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKL 331
Query: 256 NFGTN-GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGV--STPDI------ 301
FG + ++S P + T K TFY + I +I V + L + T +I
Sbjct: 332 IFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETWNISSDGAG 391
Query: 302 --VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ-PVADPTGSLELCYSFNSLS--QVPEVT 356
+IDSGTTL++ + + + ++ + + PV L+ C++ + + Q+PE+
Sbjct: 392 GTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIDSIQLPELG 451
Query: 357 IHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQTVS 414
I F GA N F+ ++ED+VC G S I GN Q NF + YD ++ +
Sbjct: 452 IAFADGAVWNFPTENSFIWLNEDLVCLAILGTPKSAFSIIGNYQQQNFHILYDTKRSRLG 511
Query: 415 FKPTDC 420
+ PT C
Sbjct: 512 YAPTKC 517
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 108/358 (30%), Positives = 165/358 (46%), Gaps = 39/358 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y+ +IGTPP AV D +L+WTQC PC P C+ QD PLFDP SST++ LPC S
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQP--CFEQDLPLFDPTKSSTFRGLPCGS 114
Query: 151 SQCASLNQKS--CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
C S+ + S C+ C Y G + G T+T +G+ A + FGC
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGKAGTDTFAIGA------AKETLGFGCVV 167
Query: 209 NNGGLFNS--KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG-TNGIVSG 265
+ +GIVGLG SL++QM T FSYCL SS + G T ++G
Sbjct: 168 MTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT---AFSYCLAGKSSGALFLGATAKQLAG 224
Query: 266 PGVVSTPLT----------KAKTFYVLTIDAISVGNQRLGVSTPD---IVIDSGTTLTFL 312
STP + +Y++ + I G L ++ +++D+ + ++L
Sbjct: 225 GKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAASSSGSTVLLDTVSRASYL 284
Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNF 371
G L +++ + QPVA P +LC+ PE+ F GA + + +N+
Sbjct: 285 ADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKAVAGDAPELVFTFDGGAALTVPPANY 344
Query: 372 FVKVSEDIVCSV--------FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ VC G I G++ Q N V +D++++T+SFKP DC+
Sbjct: 345 LLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSFKPADCS 402
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 124/433 (28%), Positives = 197/433 (45%), Gaps = 56/433 (12%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF----NQNSSI---------SS 76
+ +LIHRDS SP YN +++ R + L S R ++ +NS++ ++
Sbjct: 36 TTKLIHRDSIFSPAYNPNDSIKDRAKRMLKNSNARFDYVQAISKRNSAVVDYDGGDTSAA 95
Query: 77 SKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
A +A ++ +L+ SIG PP + AV DTGS L W QCEPC C+ Q PL++
Sbjct: 96 DDAYEASLLSELCTFLVNFSIGQPPVPQYAVMDTGSSLTWIQCEPC--INCHQQKGPLYN 153
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
P SSTY S + + G +C YS +Y D + + G A E + +
Sbjct: 154 PSSSSTYVSCSDFDRTDTTFT--ATHGSDCNYSQTYADKTTTRGTYAREQLLFETPDDGI 211
Query: 197 VALPGITFGCGTNNGGL--FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS-- 252
+ + FGCG NN L +G+ GLG S+IS++ FSYC+ +
Sbjct: 212 TIMHDVIFGCGHNNTQLPGPTGYASGVFGLGDSGSSIISKL----GFGFSYCIGNIGDPL 267
Query: 253 ---TKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS------------ 297
++ G + G STPL +Y+ T+ IS+G +RL +
Sbjct: 268 YGFHRLTLGNKLKIEG---YSTPLVPRGLYYI-TLVGISIGQERLDIDPIVFQRVDLNGI 323
Query: 298 TPDIVIDSGTTLTFLP-QGYN---SNLLSVMSSMIEAQPVADPTGSLELCY--SFN-SLS 350
+ IVIDSG TL+++P Q YN + S++S + L LCY N L
Sbjct: 324 SSRIVIDSGATLSYIPRQAYNVVRDKVSSILSGFLSRYRYI--ARHLSLCYIGKLNQDLQ 381
Query: 351 QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYD 407
P+ T H GAD+ F + +++++C + + G + Q + V YD
Sbjct: 382 GFPDATFHLADGADLVFQVEGLFFQYTDNVLCLALVPTESDEETCLIGLLAQQYYNVAYD 441
Query: 408 IEQQTVSFKPTDC 420
++QQ + F+ +C
Sbjct: 442 LKQQKLYFQRIEC 454
>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 396
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 107/357 (29%), Positives = 177/357 (49%), Gaps = 32/357 (8%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQC-EPCPPSQCYMQDSPLFDPKMSSTYKS 145
+ A Y++ ++IGTPP A+ D G +L+WTQC + C +C+ QD PLFD SST++
Sbjct: 47 SQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHC--RRCFKQDLPLFDTNASSTFRP 104
Query: 146 LPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSN--GNLATETVTLGSTTGQAVALPGIT 203
PC ++ C S+ +SC+G SF G + T+ V +G+ A +
Sbjct: 105 EPCGAAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGT-----AATARLA 159
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTN 260
FGC + ++G VGLG ++SL +QM T FSYCL P + K + G +
Sbjct: 160 FGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNAT---AFSYCLAPPDTGKSSALFLGAS 216
Query: 261 GIVSGP--GVVSTPLTKAKT--------FYVLTIDAISVGNQRLGV--STPDIVIDSGTT 308
++G G +TP K T Y+L ++AI GN + + S I + + T
Sbjct: 217 AKLAGAGKGAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAGNATIAMPQSGNTITVSTATP 276
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKL 366
+T L +L ++ + A PV P + +LC+ S S P++ + F+ GA++ +
Sbjct: 277 VTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASGGAPDLVLAFQGGAEMTV 336
Query: 367 SRSNFFVKVSEDIVCSVFKG--ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
S++ D C G V I G++ Q N + +D++++T+SF+P DC+
Sbjct: 337 PVSSYLFDAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADCS 393
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 111/371 (29%), Positives = 176/371 (47%), Gaps = 40/371 (10%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y I + IG+PP + DTGSDL W QC PC C+ Q+ P +DPK S +++++ C
Sbjct: 194 GEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPC--FDCFEQNGPYYDPKDSISFRNITC 251
Query: 149 SSSQCASLNQ----KSC--SGVNCQYSVSYGDGSFSNGNLATETVTLG---STTGQA--V 197
+ +C ++ + C +C Y YGD S + G+ A ET T+ STTG++
Sbjct: 252 NDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFR 311
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----S 252
+ + FGCG N GLF+ + G +S SQ+++ FSYCLV S
Sbjct: 312 RVENVMFGCGHWNRGLFHGAAGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRDSDTSVS 370
Query: 253 TKINFGTN-GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGVSTPDI----- 301
+K+ FG + +++ P + T L K TFY L I +I VG ++L + +
Sbjct: 371 SKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSAD 430
Query: 302 -----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PE 354
+IDSGTTL++ + ++ + + L CY+ + ++ PE
Sbjct: 431 GAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDELNFPE 490
Query: 355 VTIHF-RGADVKLSRSNFFVKVSE-DIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQ 411
I F GA N+F+++ + DIVC G S + I GN Q NF + YD +
Sbjct: 491 FLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFHILYDTKNS 550
Query: 412 TVSFKPTDCTK 422
+ + P C +
Sbjct: 551 RLGYAPMRCAE 561
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 106/369 (28%), Positives = 175/369 (47%), Gaps = 38/369 (10%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y + + +GTPP + DTGSDL W QC PC C+ Q P +DPK SS+++++
Sbjct: 194 SGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC--IACFEQSGPYYDPKDSSSFRNIS 251
Query: 148 CSSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGST----TGQAV 197
C +C ++ K C N C Y YGDGS + G+ A ET T+ T T +
Sbjct: 252 CHDPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELK 311
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----S 252
+ + FGCG N GLF+ + G +S SQM++ FSYCLV + S
Sbjct: 312 HVENVMFGCGHWNRGLFHGAAGLLGLGKGP-LSFASQMQSLYGQSFSYCLVDRNSNASVS 370
Query: 253 TKINFGTNG-IVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGVSTPDI----- 301
+K+ FG + ++S P + T K TFY + I ++ V ++ L +
Sbjct: 371 SKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETWHLSSE 430
Query: 302 -----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPE 354
+IDSGTTLT+ + + I+ + + L+ CY+ + + ++P+
Sbjct: 431 GAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCYNVSGIEKMELPD 490
Query: 355 VTIHFRGADV-KLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
I F V N+F+ + ++VC ++ +++ I GN Q NF + YD+++
Sbjct: 491 FGILFADEAVWNFPVENYFIWIDPEVVCLAILGNPRSALSIIGNYQQQNFHILYDMKKSR 550
Query: 413 VSFKPTDCT 421
+ + P C
Sbjct: 551 LGYAPMKCA 559
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 121/450 (26%), Positives = 197/450 (43%), Gaps = 62/450 (13%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDAL----TRSLNRLNHFNQNSSISSSKASQ----- 81
+ELIHR SP+ +T QRL++ + R L L H + I KA +
Sbjct: 3 LELIHRHSPQ--VMGRPKTQLQRLKELVHSDSVRQLMIL-HKLRGGQIPRRKAKEVLSSS 59
Query: 82 -------ADIIPNN-------ANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQ 126
A +P + Y + +GTP + + VADTGSDL W C+ C
Sbjct: 60 SGRGSDDAIEVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRN 119
Query: 127 C------YMQDSPLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYG 173
C ++ +F +SS++K++PC + C SL C Y Y
Sbjct: 120 CSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYS 179
Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
DGS + G A ETVT+ G+ + L + GC + G G++GLG S
Sbjct: 180 DGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAI 239
Query: 234 QMRTTIAGKFSYCLVPVSSTK-----INFGT----NGIVSGPGVVSTPLTKAKTFYVLTI 284
+ GKFSYCLV S K + FG+ +++ L +FY + +
Sbjct: 240 KAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNM 299
Query: 285 DAISVGNQRLGVSTP--DI------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVAD 335
IS+G L + + D+ ++DSG++LTFL + Y + ++ S+++ + V
Sbjct: 300 MGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEM 359
Query: 336 PTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN-SV 391
G LE C++ + VP + HF GA+ + ++ + ++ + C F +
Sbjct: 360 DIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGT 419
Query: 392 PIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ GNIMQ N L +D+ + + F P+ CT
Sbjct: 420 SVVGNIMQQNHLWEFDLGLKKLGFAPSSCT 449
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 116/354 (32%), Positives = 172/354 (48%), Gaps = 44/354 (12%)
Query: 97 IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL 156
+GTPP + G++LIW P P +C+ Q P F+P S + LP +S C S
Sbjct: 1 MGTPPNPVKLKLENGNELIWNHSNPSP--ECFEQAFPYFEPLTFS--RGLPFAS--CGS- 53
Query: 157 NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS 216
K C Y+ SYGD S + G L + T G ++PG+ FGCG N G+F S
Sbjct: 54 -PKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVAFGCGLFNNGVFKS 109
Query: 217 KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKINFGTNGIVSGPGVV-S 270
TGI G G G +SL SQ++ G FS+C + S+ ++ + +G G V +
Sbjct: 110 NETGIAGFGRGPLSLPSQLK---VGNFSHCFTTITGAIPSTVLLDLPADLFSNGQGAVQT 166
Query: 271 TPLTK-AK-----TFYVLTIDAISVGNQRLGV---------STPDIVIDSGTTLTFLPQG 315
TPL + AK T Y L++ I+VG+ RL V T +IDSGT++T LP
Sbjct: 167 TPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQ 226
Query: 316 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFV 373
+ ++ I+ V C+S S ++ VP++ +HF GA + L R N+
Sbjct: 227 VYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVF 286
Query: 374 KVSED----IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+V +D I+C ++ KG + I GN Q N V YD++ +SF C K
Sbjct: 287 EVPDDAGNSIICLAINKG--DETTIIGNFQQQNMHVLYDLQNNMLSFVAAQCDK 338
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 111/371 (29%), Positives = 176/371 (47%), Gaps = 40/371 (10%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y I + IG+PP + DTGSDL W QC PC C+ Q+ P +DPK S +++++ C
Sbjct: 194 GEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPC--FDCFEQNGPYYDPKDSISFRNITC 251
Query: 149 SSSQCASLNQ----KSC--SGVNCQYSVSYGDGSFSNGNLATETVTLG---STTGQA--V 197
+ +C ++ + C +C Y YGD S + G+ A ET T+ STTG++
Sbjct: 252 NDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFR 311
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----S 252
+ + FGCG N GLF+ + G +S SQ+++ FSYCLV S
Sbjct: 312 RVENVMFGCGHWNRGLFHGAAGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRDSDTSVS 370
Query: 253 TKINFGTN-GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGVSTPDI----- 301
+K+ FG + +++ P + T L K TFY L I +I VG ++L + +
Sbjct: 371 SKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSAD 430
Query: 302 -----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PE 354
+IDSGTTL++ + ++ + + L CY+ + ++ PE
Sbjct: 431 GAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDELNFPE 490
Query: 355 VTIHF-RGADVKLSRSNFFVKVSE-DIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQ 411
I F GA N+F+++ + DIVC G S + I GN Q NF + YD +
Sbjct: 491 FLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFHILYDTKNS 550
Query: 412 TVSFKPTDCTK 422
+ + P C +
Sbjct: 551 RLGYAPMRCAE 561
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 112/366 (30%), Positives = 178/366 (48%), Gaps = 39/366 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + + +G+PP + DTGSDL W QC PC C+ Q+ +DPK S++YK++ C+
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPC--YDCFQQNGAFYDPKASASYKNITCND 227
Query: 151 SQCASLNQKS----CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTG----QAVALP 200
+C ++ C N C Y YGD S + G+ A ET T+ TT + +
Sbjct: 228 QRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVE 287
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----STKI 255
+ FGCG N GLF+ + G +S SQ+++ FSYCLV + S+K+
Sbjct: 288 NMMFGCGHWNRGLFHGAAGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKL 346
Query: 256 NFGTN-GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGV--STPDI------ 301
FG + ++S P + T K TFY + I +I V + L + T +I
Sbjct: 347 IFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAG 406
Query: 302 --VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ-PVADPTGSLELCYSFNSLS--QVPEVT 356
+IDSGTTL++ + + + ++ + + PV L+ C++ + + Q+PE+
Sbjct: 407 GTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQLPELG 466
Query: 357 IHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQTVS 414
I F GA N F+ ++ED+VC G S I GN Q NF + YD ++ +
Sbjct: 467 IAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLG 526
Query: 415 FKPTDC 420
+ PT C
Sbjct: 527 YAPTKC 532
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 117/360 (32%), Positives = 179/360 (49%), Gaps = 33/360 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE---PCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I +GTP + DTGSD++W C CP ++ +P +D SST KS+
Sbjct: 85 YFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTP-YDVDASSTAKSVS 143
Query: 148 CSSSQCASLNQKS-C-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG-I 202
CS + C+ +NQ+S C SG CQY + YGDGS +NG L + V L TG Q + G I
Sbjct: 144 CSDNFCSYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTI 203
Query: 203 TFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTKINF 257
FGCG+ G + GI+G G + S ISQ+ + + F++CL + I F
Sbjct: 204 IFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGI-F 262
Query: 258 GTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDSGTTL 309
+VS P V +TP+ Y + ++AI VGN L +S+ ++IDSGTTL
Sbjct: 263 AIGEVVS-PKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTL 321
Query: 310 TFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF-RGADVKLS 367
+LP YN L +++S E + S + + L + P VT F + + +
Sbjct: 322 VYLPDAVYNPLLNEILASHPEL-TLHTVQESFTCFHYTDKLDRFPTVTFQFDKSVSLAVY 380
Query: 368 RSNFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ +V ED C ++ G+ S+ I G++ +N LV YDIE Q + + +C+
Sbjct: 381 PREYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCS 440
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 116/366 (31%), Positives = 187/366 (51%), Gaps = 38/366 (10%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y + + +G PP L + DTGSDL W QC+PC C+ Q P+FDP S+++K +PC+
Sbjct: 86 EYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPC--KACFDQSGPVFDPSQSTSFKIIPCN 143
Query: 150 SSQCASLNQKSC-------SGVNCQYSVSYGDGSFSNGNLATETVTLG-STTGQAVALPG 201
++ C + C S C+Y YGD S ++G+LA E++++ S ++ +
Sbjct: 144 AAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRD 203
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVPVS-----STKI 255
+ GCG +N GL G++GLG G +S SQ+R++ G+ FSYCLV + S+ I
Sbjct: 204 MVIGCGHSNKGL-FQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSAI 262
Query: 256 NFGTNGIVSG--PGVVSTPLTK----AKTFYVLTIDAISVGN-------QRLGVSTP--- 299
+FG +S + TP + +TFY L I I + +R ++T
Sbjct: 263 SFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIATNGSG 322
Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTI 357
+IDSGTTLT+L + + S + I + P ADP L +CY+ + V P ++I
Sbjct: 323 GTIIDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFDILGICYNATGRAAVPFPALSI 381
Query: 358 HFR-GADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
F+ GA++ L + N+F++ + T+ + I GN Q N YD++ + F
Sbjct: 382 VFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSIIGNFQQQNIHFLYDVQHARLGF 441
Query: 416 KPTDCT 421
TDC+
Sbjct: 442 ANTDCS 447
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 117/353 (33%), Positives = 170/353 (48%), Gaps = 31/353 (8%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
+++ + GTP + DTGSD+ W QC PC CY Q P+FDP S+TY ++PC
Sbjct: 119 EFVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCS-GHCYKQHDPIFDPTKSATYSAVPCG 177
Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
QCA+ K S C Y V YGDGS + G L+ ET++L S A ALPG FGCG
Sbjct: 178 HPQCAAAGGKCSSNGTCLYKVQYGDGSSTAGVLSHETLSLTS----ARALPGFAFGCGET 233
Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--INFGTNGIVSG-P 266
N G F G++GLG G +SL SQ + FSYCL +++ + GT SG
Sbjct: 234 NLGDFG-DVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSHGYLTIGTTTPASGSD 292
Query: 267 GVVSTPLTKAK---TFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFL-PQGYN 317
GV T + + + +FY + + +I VG L V + ++DSGT LT+L P+ Y
Sbjct: 293 GVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDGTLLDSGTVLTYLPPEAYT 352
Query: 318 S--NLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGAD-VKLSRSNFFVK 374
+ + + + P DP + CY F + + + F+ +D S F V
Sbjct: 353 ALRDRFKFTMTQYKPAPAYDP---FDTCYDFAGQNAIFMPLVSFKFSDGSSFDLSPFGVL 409
Query: 375 VSEDIV-----CSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ D C F +++P I GN Q N + YD+ + + F C
Sbjct: 410 IFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSGSC 462
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 107/346 (30%), Positives = 171/346 (49%), Gaps = 34/346 (9%)
Query: 98 GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
GT + + D+GSD+ W QC+PCP C+ Q PLFDP S+TY ++PCSS+ CA L
Sbjct: 75 GTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLG 134
Query: 158 --QKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
++ C + CQ+ ++Y +G+ + G +++ +TLG + G FGC + G
Sbjct: 135 PYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYD----VVRGFLFGCAHADQGST 190
Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----PGV 268
F+ G + LGGG S + Q + + FSYC VP S++ F G+ P
Sbjct: 191 FSYDVAGTLALGGGSQSFVQQTASQYSRVFSYC-VPPSTSSFGFIMFGVPPQRAALVPTF 249
Query: 269 VSTPL----TKAKTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSN 319
VSTPL T + TFY + + +I V + L V + VIDS T ++ + P Y +
Sbjct: 250 VSTPLLSSSTMSPTFYRVLLRSIIVAGRPLPVPPTVFSASSVIDSATVISRIPPTAYQAL 309
Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKVS 376
+ S+M +P A P L+ CY F+ + + P + + F GA V L + ++
Sbjct: 310 RAAFRSAMTMYRP-APPVSILDTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILLQ-- 366
Query: 377 EDIVCSVFK-GITNSVPIY-GNIMQTNFLVGYDIEQQTVSFKPTDC 420
C F ++ +P + GN+ Q V YD+ + + F+ C
Sbjct: 367 ---GCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 409
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 115/357 (32%), Positives = 170/357 (47%), Gaps = 38/357 (10%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ + +G E + DT S+L W QC PC + C+ Q PLFDP S +Y LPC+
Sbjct: 126 NYVATVGLGG--GEATVIVDTASELTWVQCAPC--ASCHDQQGPLFDPASSPSYAVLPCN 181
Query: 150 SSQCASLNQKSCSGV---------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
SS C +L + S +C Y++SY DGS+S G LA + ++L +
Sbjct: 182 SSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEV-----ID 236
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV----SSTKIN 256
G FGCGT+N G F T+G++GLG +SLISQ G FSYCL P+ SS +
Sbjct: 237 GFVFGCGTSNQGPFGG-TSGLMGLGRSQLSLISQTMDQFGGVFSYCL-PLKESESSGSLV 294
Query: 257 FGTNGIV---SGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLT- 310
G + V S P V +T ++ FY + + I++G Q + S +++DSGT +T
Sbjct: 295 LGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKVIVDSGTIITS 354
Query: 311 FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRG-ADVKLS 367
+P YN+ +S E P A L+ C++ Q+P + F G +V++
Sbjct: 355 LVPSVYNAVKAEFLSQFAE-YPQAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVEVEVD 413
Query: 368 RSN--FFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
S +FV VC + + I GN Q N V +D + F C
Sbjct: 414 SSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 470
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 115/357 (32%), Positives = 170/357 (47%), Gaps = 38/357 (10%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ + +G E + DT S+L W QC PC + C+ Q PLFDP S +Y LPC+
Sbjct: 125 NYVATVGLGG--GEATVIVDTASELTWVQCAPC--ASCHDQQGPLFDPASSPSYAVLPCN 180
Query: 150 SSQCASLNQKSCSGV---------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
SS C +L + S +C Y++SY DGS+S G LA + ++L +
Sbjct: 181 SSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEV-----ID 235
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV----SSTKIN 256
G FGCGT+N G F T+G++GLG +SLISQ G FSYCL P+ SS +
Sbjct: 236 GFVFGCGTSNQGPFGG-TSGLMGLGRSQLSLISQTMDQFGGVFSYCL-PLKESESSGSLV 293
Query: 257 FGTNGIV---SGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLT- 310
G + V S P V +T ++ FY + + I++G Q + S +++DSGT +T
Sbjct: 294 LGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKVIVDSGTIITS 353
Query: 311 FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRG-ADVKLS 367
+P YN+ +S E P A L+ C++ Q+P + F G +V++
Sbjct: 354 LVPSVYNAVKAEFLSQFAE-YPQAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVEVEVD 412
Query: 368 RSN--FFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
S +FV VC + + I GN Q N V +D + F C
Sbjct: 413 SSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 469
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 114/365 (31%), Positives = 178/365 (48%), Gaps = 42/365 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS-----PLFDPKMSSTYKS 145
Y +I +G+PP E DTGSD++W C PCP +C ++ L+D K SST K+
Sbjct: 78 YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCP--KCPVKTDLGIPLSLYDSKTSSTSKN 135
Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
+ C C+ + Q G C Y V YGDGS S+G+ + +TL TG P
Sbjct: 136 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQ 195
Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLVPVSSTKI 255
+ FGCG N G +S GI+G G + S+ISQ+ + K FS+CL ++ I
Sbjct: 196 EVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGI 255
Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQ------RLGVSTPD--IVIDSGT 307
F G V P V +TP+ + Y + + + V L + D +IDSGT
Sbjct: 256 -FAV-GEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGT 313
Query: 308 TLTFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFRGADV 364
TL +LPQ YNS + + + + T + C+SF N+ P V +HF + +
Sbjct: 314 TLAYLPQNLYNSLIEKITAKQQVKLHMVQETFA---CFSFTSNTDKAFPVVNLHFEDS-L 369
Query: 365 KLSR--SNFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFK 416
KLS ++ + ED+ C ++ G+T V + G+++ +N LV YD+E + + +
Sbjct: 370 KLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWA 429
Query: 417 PTDCT 421
+C+
Sbjct: 430 DHNCS 434
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 114/365 (31%), Positives = 178/365 (48%), Gaps = 42/365 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS-----PLFDPKMSSTYKS 145
Y +I +G+PP E DTGSD++W C PCP +C ++ L+D K SST K+
Sbjct: 74 YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCP--KCPVKTDLGIPLSLYDSKTSSTSKN 131
Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
+ C C+ + Q G C Y V YGDGS S+G+ + +TL TG P
Sbjct: 132 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQ 191
Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLVPVSSTKI 255
+ FGCG N G +S GI+G G + S+ISQ+ + K FS+CL ++ I
Sbjct: 192 EVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGI 251
Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQ------RLGVSTPD--IVIDSGT 307
F G V P V +TP+ + Y + + + V L + D +IDSGT
Sbjct: 252 -FAV-GEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGT 309
Query: 308 TLTFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFRGADV 364
TL +LPQ YNS + + + + T + C+SF N+ P V +HF + +
Sbjct: 310 TLAYLPQNLYNSLIEKITAKQQVKLHMVQETFA---CFSFTSNTDKAFPVVNLHFEDS-L 365
Query: 365 KLSR--SNFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFK 416
KLS ++ + ED+ C ++ G+T V + G+++ +N LV YD+E + + +
Sbjct: 366 KLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWA 425
Query: 417 PTDCT 421
+C+
Sbjct: 426 DHNCS 430
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 121/450 (26%), Positives = 197/450 (43%), Gaps = 62/450 (13%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDAL----TRSLNRLNHFNQNSSISSSKASQ----- 81
+ELIHR SP+ +T QRL++ + R L L H + I KA +
Sbjct: 3 LELIHRHSPQ--VMGRPKTQLQRLKELVHSDSVRQLMIL-HKLRGGQIPRRKAKEVLSSS 59
Query: 82 -------ADIIPNN-------ANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQ 126
A +P + Y + +GTP + + VADTGSDL W C+ C
Sbjct: 60 SGRGSDDAIEVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRN 119
Query: 127 C------YMQDSPLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYG 173
C ++ +F +SS++K++PC + C SL C Y Y
Sbjct: 120 CSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYS 179
Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
DGS + G A ETVT+ G+ + L + GC + G G++GLG S
Sbjct: 180 DGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAI 239
Query: 234 QMRTTIAGKFSYCLVPVSSTK-----INFGT----NGIVSGPGVVSTPLTKAKTFYVLTI 284
+ GKFSYCLV S K + FG+ +++ L +FY + +
Sbjct: 240 KAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNM 299
Query: 285 DAISVGNQRLGVSTP--DI------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVAD 335
IS+G L + + D+ ++DSG++LTFL + Y + ++ S+++ + V
Sbjct: 300 MGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEM 359
Query: 336 PTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN-SV 391
G LE C++ + VP + HF GA+ + ++ + ++ + C F +
Sbjct: 360 DIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGT 419
Query: 392 PIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ GNIMQ N L +D+ + + F P+ CT
Sbjct: 420 SVVGNIMQQNHLWEFDLGLKKLGFAPSSCT 449
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 118/426 (27%), Positives = 189/426 (44%), Gaps = 44/426 (10%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ--ADIIPNN 88
++L HRD+ P R+ D + R + ++ + I
Sbjct: 33 LKLAHRDT-------LWPNPLSRIEDIIGADQKRHSLISRKRKFKGGVKMDLGSGIDYGT 85
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
A Y + +GTP + V DTGS+L W C + +++ +F + S ++K++ C
Sbjct: 86 AQYFTEVRVGTPAKKFRVVVDTGSELTWVNCRYRGRGKGKVKNRRVFRAEESKSFKTVGC 145
Query: 149 SSSQCA-------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
+ C SL+ C Y Y DGS + G A ET+T+G T G+ L G
Sbjct: 146 FTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRKARLRG 205
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-----IN 256
+ GC ++ G G++GL D S S + K SYCLV S K +
Sbjct: 206 LLVGCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSLFGAKLSYCLVDHLSNKNISNYLI 265
Query: 257 FG----TNGIVSGPGVVSTP--LTKAKTFYVLTIDAISVGNQRLGVSTP--------DIV 302
FG + + PG +TP LT FY + I IS+G+ L + T +
Sbjct: 266 FGYSSSSTSTKTAPG-RTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQVWDATTGGGTI 324
Query: 303 IDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPEVTI 357
+DSGT+LT L + Y + + ++E + V +E C+S FN S++P++T
Sbjct: 325 LDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSGFNE-SKLPQLTF 383
Query: 358 HFR-GADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
H + GA + R ++ V + + C F T + + GNIMQ N+L +D+ T+SF
Sbjct: 384 HLKGGARFEPHRKSYLVDAAPGVKCLGFMSAGTPATNVVGNIMQQNYLWEFDLMASTLSF 443
Query: 416 KPTDCT 421
P+ CT
Sbjct: 444 APSTCT 449
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 111/332 (33%), Positives = 160/332 (48%), Gaps = 47/332 (14%)
Query: 18 VVSPIEAQTGGFSVELIHRD-------SPKSPFYNSSETPYQRLRDALTRSLN-RLNHFN 69
+ P Q+GG IH +P+ P S + DA ++LN RL
Sbjct: 28 ALGPRVNQSGGVVQMTIHHVHGPGSSLAPQPPVSFSDVLAWD---DARVKTLNSRLTR-- 82
Query: 70 QNSSISSSKASQADI-------IPNN-------ANYLIRISIGTPPTERLAVADTGSDLI 115
+++ S ++ DI +P N NY +++ G+P + DTGS L
Sbjct: 83 KDTRFPKSVLTKKDIRFPKSVSVPLNPGASIGSGNYYVKVGFGSPARYYSMIVDTGSSLS 142
Query: 116 WTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSC--SGVNCQY 168
W QC+PC C++Q PLFDP S TYKSL C+SSQC A+LN C S C Y
Sbjct: 143 WLQCKPCV-VYCHVQADPLFDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVY 201
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
+ SYGD S+S G L+ + +TL + LPG +GCG ++ GLF + GI+GLG
Sbjct: 202 TASYGDSSYSMGYLSQDLLTLAPSQ----TLPGFVYGCGQDSDGLFG-RAAGILGLGRNK 256
Query: 229 ISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKA---KTFYVLTID 285
+S++ Q+ + FSYCL ++G TP+T + Y L +
Sbjct: 257 LSMLGQVSSKFGYAFSYCLPTRGGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLT 316
Query: 286 AISVGNQRLGVSTPDI----VIDSGTTLTFLP 313
AI+VG + LGV+ +IDSGT +T LP
Sbjct: 317 AITVGGRALGVAAAQYRVPTIIDSGTVITRLP 348
>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
Length = 396
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 178/366 (48%), Gaps = 46/366 (12%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N AN+ +IGTPP A+ D +L+WTQC C S+C+ QD PLF P SST++
Sbjct: 43 NVANF----TIGTPPQPASAIIDVAGELVWTQCSRC--SRCFKQDLPLFIPNASSTFRPE 96
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYG---DGSFSNGNLATETVTLGSTTGQAVALPGIT 203
PC + C S +CSG C Y + D + G + TET +G+ T +
Sbjct: 97 PCGTDACKSTPTSNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIGTATAS------LA 150
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFGTN 260
FGC + T+G +GLG SL++QM+ T KFSYCL P S+++ G++
Sbjct: 151 FGCVVASDIDTMDGTSGFIGLGRTPRSLVAQMKLT---KFSYCLSPRGTGKSSRLFLGSS 207
Query: 261 GIVSG-------PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTF-- 311
++G P + ++P + +Y+L++DAI GN + + ++ T F
Sbjct: 208 AKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATAQSGGILVMHTVSPFSL 267
Query: 312 -LPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFN---SLSQVPEVTIHFRG-ADV 364
+ Y + +V ++ A QP+A P +LC+ S + P++ F+G A +
Sbjct: 268 LVDSAYRAFKKAVTEAVGGAAEQPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAAL 327
Query: 365 KLSRSNFFVKVSE--DIVCSVFKGIT-------NSVPIYGNIMQTNFLVGYDIEQQTVSF 415
+ + + + V E D C+ + V + G++ Q + YD++++T+SF
Sbjct: 328 TVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSF 387
Query: 416 KPTDCT 421
+P DC+
Sbjct: 388 EPADCS 393
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 115/348 (33%), Positives = 154/348 (44%), Gaps = 31/348 (8%)
Query: 94 RISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC 153
R S P +L + DT SD+ W QC PCP SQCY Q L+DP S + +S CSS C
Sbjct: 172 RRSRLRPGVRQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTC 231
Query: 154 ASL-------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
L + S S CQY V Y DGS ++G L + ++L T+ +P FGC
Sbjct: 232 RQLGPYANGCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTS----QVPKFEFGC 287
Query: 207 GTNNGGLF-NSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI--V 263
G F SKT GI+ LG G SL+SQ T FSYC P +S K F G+
Sbjct: 288 SHAARGSFSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFPPTASHK-GFFVLGVPRR 346
Query: 264 SGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQGYNSN 319
S TP+ K Y + ++AI+V QRL V +DS T +T LP
Sbjct: 347 SSSRYAVTPMLKTPMLYQVRLEAIAVAGQRLDVPPTVFAAGAALDSRTVITRLPPTAYQA 406
Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR--GADVKLSRSNFFVKV 375
L S + A G L+ CY F +S + P +++ F GA V+L S
Sbjct: 407 LRSAFRDKMSMYRPAAANGQLDTCYDFTGVSSIMLPTISLVFDRTGAGVQLDPSGVLFG- 465
Query: 376 SEDIVCSVF---KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
C F G + I G + V Y++ +V F+ C
Sbjct: 466 ----SCLAFASTAGDDRATGIIGFLQLQTIEVLYNVAGGSVGFRRGAC 509
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 109/331 (32%), Positives = 161/331 (48%), Gaps = 28/331 (8%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGV 164
V D+ SD+ W QC PCP C+ Q +DP S T + CSS C +L C+
Sbjct: 32 VLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPYANGCANN 91
Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
CQY V Y DGS ++G + +TL + G AV+ G FGC G F+++ GI+ L
Sbjct: 92 QCQYLVRYPDGSSTSGAYIADLLTLDA--GNAVS--GFKFGCSHAEQGSFDARAAGIMAL 147
Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI--VSGPGVVSTPLTK---AKTF 279
GGG SL+SQ + FSYC +P +++ F T G+ + V TP+ + A TF
Sbjct: 148 GGGPESLLSQTASRYGNAFSYC-IPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAATF 206
Query: 280 YVLTIDAISVGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 334
Y + + I+VG QRLGV+ P + V+DS T +T LP L + S + A
Sbjct: 207 YGVLLRTITVGGQRLGVA-PAVFAAGSVLDSRTAITRLPPTAYQALRAAFRSSMTMYRSA 265
Query: 335 DPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF-KGITNS 390
P G L+ CY F + ++P++++ F R A + L S C F +
Sbjct: 266 PPKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILFN-----DCLAFTSNADDR 320
Query: 391 VP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+P + G++ Q V YD+ V F+ C
Sbjct: 321 MPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 144 bits (364), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 112/366 (30%), Positives = 181/366 (49%), Gaps = 38/366 (10%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y + + +G PP L + DTGSDL W QC+PC C+ Q P+FDP S+++K +PC+
Sbjct: 170 EYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPC--KACFDQSGPVFDPSQSTSFKIIPCN 227
Query: 150 SSQCASLNQKSC-------SGVNCQYSVSYGDGSFSNGNLATETVTLG-STTGQAVALPG 201
++ C + C S C+Y YGD S ++G+LA E++++ S ++ +
Sbjct: 228 AAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRD 287
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVPVS-----STKI 255
+ GCG +N GLF + G +S SQ+R++ G+ FSYCLV + S+ I
Sbjct: 288 MVIGCGHSNKGLFQGAGGLLGLGQGA-LSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSAI 346
Query: 256 NFGTNGIVSG--PGVVSTPLTK----AKTFYVLTIDAISVGNQRLGVSTPDI-------- 301
+FG +S + TP + +TFY L I I + + L +
Sbjct: 347 SFGAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGSG 406
Query: 302 --VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTI 357
+IDSGTTLT+L + + S + I + P ADP L +CY+ + V P ++I
Sbjct: 407 GTIIDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFDILGICYNATGRTAVPFPTLSI 465
Query: 358 HFR-GADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
F+ GA++ L + N+F++ + T+ + I GN Q N YD++ + F
Sbjct: 466 VFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSIIGNFQQQNIHFLYDVQHARLGF 525
Query: 416 KPTDCT 421
TDC+
Sbjct: 526 ANTDCS 531
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 144 bits (363), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 107/342 (31%), Positives = 150/342 (43%), Gaps = 51/342 (14%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
DT DL W QC PCP +CY Q + LFDP+ S T ++PC S+ C L + CS C
Sbjct: 167 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 226
Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
QY V YGDG ++G + +TL +T + FGC G F++ T+G + LGG
Sbjct: 227 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTMSLGG 282
Query: 227 GDISLISQMRTTIAGKFSYCLVPVSSTKI-------------NFGTNGIVSGPGVVSTPL 273
G SL+SQ T FSYC+ SS+ F +V P ++
Sbjct: 283 GRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFARTPLVRNPSII---- 338
Query: 274 TKAKTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMI 328
T Y++ + I VG +RL V V+DS +T L P Y + L+ S+M
Sbjct: 339 ---PTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMA 395
Query: 329 EAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKG 386
VA L+ CY F + VP V++ F G V V D + + +G
Sbjct: 396 AYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAMGVMVEG 445
Query: 387 ITNSVP--------IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
VP GN+ Q V YD+ +V F+ C
Sbjct: 446 CLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 113/370 (30%), Positives = 176/370 (47%), Gaps = 41/370 (11%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + + +GTPP + DTGSDL W QC PC C+ Q+ +DPK S+++K++ C
Sbjct: 160 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPC--YDCFHQNEAFYDPKTSASFKNITC 217
Query: 149 SSSQCASLN------QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA---- 198
+ +C+ ++ Q +C Y YGD S + G+ A ET T+ TT + +
Sbjct: 218 NDPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYK 277
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----ST 253
+ + FGCG N GLF+ + + G +S SQ+++ FSYCLV + S+
Sbjct: 278 VENMMFGCGHWNRGLFSGASGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRNSDTNVSS 336
Query: 254 KINFGTN-GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGVS------TPD- 300
K+ FG + +++ + T K TFY + I +I VG + L + +PD
Sbjct: 337 KLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPDG 396
Query: 301 ---IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----V 352
+IDSGTTL++ + Y M E V L+ C++ + + + +
Sbjct: 397 AGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPCFNVSGIEENNIHL 456
Query: 353 PEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQ 410
PE+ I F GA N F+ +SED+VC G S I GN Q NF + YD +
Sbjct: 457 PELGIAFADGAVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKM 516
Query: 411 QTVSFKPTDC 420
+ F PT C
Sbjct: 517 SRLGFTPTKC 526
>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
Length = 443
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 113/382 (29%), Positives = 173/382 (45%), Gaps = 67/382 (17%)
Query: 43 FYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA-DIIPNNANYLIRISIGTPP 101
+Y+ + T R A RS+ LN+ +S SSS + ++P Y++ +G P
Sbjct: 8 YYDHNMTSTDRSIWAADRSIAXLNYLLSVTSSSSSLGDISSKLVPEYYEYIMMYYLGVPS 67
Query: 102 TERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC 161
T +ADTGS+LIW QC PC + CY Q P+FDP S TY+++ S C ++ + SC
Sbjct: 68 TLVYGIADTGSELIWLQCLPC--THCYNQTPPIFDPAESYTYETVSSDSPICNAVRRISC 125
Query: 162 S--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTT 219
+C Y +YGDG+ + G L+T+ T V + +TFGC +
Sbjct: 126 REGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLTFGCSHDTKARLKGHQA 185
Query: 220 GIVGLGGGDISLISQMRTTIAGKFSYCLV----PVSSTKINFGTNGIVSGPGVVSTPLTK 275
G+VGL SL+SQ++ KFSYC+V S +++ FG+ ++ G TPL K
Sbjct: 186 GVVGLNRHPNSLVSQLKVK---KFSYCMVIPDDHGSGSRMYFGSRAVILGG---KTPLLK 239
Query: 276 AK-TFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 334
+ Y +T+ ISVG ++ D + +G +TF
Sbjct: 240 GDYSHYFVTLKGISVGEEK---GRSDELASAGPDITF----------------------- 273
Query: 335 DPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVF--KGITNSVP 392
HF GAD L++ +V+V + + C T +
Sbjct: 274 -----------------------HFYGADFILTKXTTYVEVEKGLWCLAMLSSNSTRKLS 310
Query: 393 IYGNIMQTNFLVGYDIEQQTVS 414
I GNI Q N+ VGYD+E Q V+
Sbjct: 311 ILGNIQQQNYHVGYDLEAQEVA 332
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 53/112 (47%), Gaps = 3/112 (2%)
Query: 125 SQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFS-NGN 181
+QC+ Q P+FDP SSTY ++P + C +C +C Y +SYG GS S G
Sbjct: 332 AQCFNQTPPIFDPSKSSTYSTVPWDAPTCYQAGGYACHIDEEDCCYRISYGSGSTSTEGT 391
Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
++ + V + + FGC G F GIVGL +SL+S
Sbjct: 392 ISIDAFAFEDNRQNMVDVXHLVFGCSDYTTGTFKGYEVGIVGLNQDSLSLVS 443
>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
distachyon]
Length = 836
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 137/426 (32%), Positives = 196/426 (46%), Gaps = 50/426 (11%)
Query: 31 VELIHRDSPKSPFYNSSETPY--------QRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
+ L HR P + S+ P +R + + R ++ +++ +S++
Sbjct: 425 LRLTHRHGPCAGPSRSASAPSFAEVLRADERRAEYIQRRMSGAKGPGGLQQFTAASSSKS 484
Query: 83 DIIPNN-------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
IP N Y++ +S+GTP + DTGSD+ W QC PC CY Q LF
Sbjct: 485 VTIPANIGHSIGTLQYVVTVSLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACYAQKDQLF 544
Query: 136 DPKMSSTYKSLPCSSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
DP SS+Y ++PC++ C+ L+ +G C Y VSYGDGS + G ++T+TL
Sbjct: 545 DPAKSSSYSAVPCAADACSELSTYGHGCAAGSQCGYVVSYGDGSNTTGVYGSDTLTL--- 601
Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVPVS 251
A A+ G FGCG GLF + G++ LG +SL SQ G FSYCL P
Sbjct: 602 -TDADAVTGFLFGCGHAQAGLF-AGIDGLLALGRKGMSLTSQTSGAYGGGVFSYCLPPSP 659
Query: 252 STKINFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRL-GVSTPDI----VI 303
S+ G S G +T L A TFY++ + I VG Q+L GV V+
Sbjct: 660 SSTGFLTLGGPSSASGFATTGLLTAWDVPTFYMVMLTGIGVGGQQLSGVPASAFAGGTVV 719
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQV--PEVTIHF 359
D+GT +T LP + L + + + P A TG L+ CY+F V P V++ F
Sbjct: 720 DTGTVITRLPPTAYAALRAAFRAAMAPYGYPAAPATGILDTCYNFTDYGTVTLPTVSLTF 779
Query: 360 R-GADVKLSRSNFFVKVSEDIVCSVFKGITNS----VPIYGNIMQTNFLVGYDIEQQTVS 414
GA +KL F C F TNS I GN+ Q +F V +D +V
Sbjct: 780 SGGATLKLDAPGFLSS-----GCLAFA--TNSGDGDPAILGNVQQRSFAVRFD--GSSVG 830
Query: 415 FKPTDC 420
F P C
Sbjct: 831 FMPHSC 836
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 116/368 (31%), Positives = 168/368 (45%), Gaps = 43/368 (11%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y +I +GTP T L V DTGSD++W QC PC +CY Q +FDP+ S +Y ++
Sbjct: 143 GSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPC--RRCYDQSGQMFDPRASHSYGAV 200
Query: 147 PCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C++ C L+ C C Y V+YGDGS + G+ ATET+T S +P +
Sbjct: 201 DCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAS----GARVPRVAL 256
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP---------VSSTKI 255
GCG +N GLF + ++GLG G +S SQ+ FSYCLV S+ +
Sbjct: 257 GCGHDNEGLFVAAAG-LLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTV 315
Query: 256 NFGTNGIVS-GPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST---------PD----- 300
FG+ + G V+ + + VL A G+QR + PD
Sbjct: 316 TFGSGARGALGRRVLHPDGEEPQDGDVLLRAAH--GHQRRRRARPGRGRVRPPPDPSTGR 373
Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG--SLELCYSFNSLS--QVPE 354
+++DSG + + + S A P G + CY + L +VP
Sbjct: 374 GGVIVDSGRPSPAWARAGRTPPCATRSRAAAAGLRLSPGGFSLFDTCYDLSGLKVVKVPT 433
Query: 355 VTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
V++HF GA+ L N+ + V S C F G V I GNI Q F V +D + Q
Sbjct: 434 VSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQR 493
Query: 413 VSFKPTDC 420
+ F P C
Sbjct: 494 LGFVPKGC 501
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 107/342 (31%), Positives = 150/342 (43%), Gaps = 51/342 (14%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
DT DL W QC PCP +CY Q + LFDP+ S T ++PC S+ C L + CS C
Sbjct: 151 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 210
Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
QY V YGDG ++G + +TL +T + FGC G F++ T+G + LGG
Sbjct: 211 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTMSLGG 266
Query: 227 GDISLISQMRTTIAGKFSYCLVPVSSTKI-------------NFGTNGIVSGPGVVSTPL 273
G SL+SQ T FSYC+ SS+ F +V P ++
Sbjct: 267 GRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFARTPLVRNPSII---- 322
Query: 274 TKAKTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMI 328
T Y++ + I VG +RL V V+DS +T L P Y + L+ S+M
Sbjct: 323 ---PTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMA 379
Query: 329 EAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKG 386
VA L+ CY F + VP V++ F G V V D + + +G
Sbjct: 380 AYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAMGVMVEG 429
Query: 387 ITNSVP--------IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
VP GN+ Q V YD+ +V F+ C
Sbjct: 430 CLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 109/331 (32%), Positives = 160/331 (48%), Gaps = 28/331 (8%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGV 164
V D+ SD+ W QC PCP C+ Q +DP S + CSS C +L C+
Sbjct: 162 VLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTALGPYANGCANN 221
Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
CQY V Y DGS ++G + +TL + G AV+ G FGC G F+++ GI+ L
Sbjct: 222 QCQYLVRYPDGSSTSGAYIADLLTLDA--GNAVS--GFKFGCSHAEQGSFDARAAGIMAL 277
Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI--VSGPGVVSTPLTK---AKTF 279
GGG SL+SQ + FSYC +P +++ F T G+ + V TP+ + A TF
Sbjct: 278 GGGPESLLSQTASRYGNAFSYC-IPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAATF 336
Query: 280 YVLTIDAISVGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 334
Y + + I+VG QRLGV+ P + V+DS T +T LP L S S + A
Sbjct: 337 YGVLLRTITVGGQRLGVA-PAVFAAGSVLDSRTAITRLPPTAYQALRSAFRSSMTMYRSA 395
Query: 335 DPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF-KGITNS 390
P G L+ CY F + ++P++++ F R A + L S C F +
Sbjct: 396 PPKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILFN-----DCLAFTSNADDR 450
Query: 391 VP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+P + G++ Q V YD+ V F+ C
Sbjct: 451 MPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481
>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 324
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 119/333 (35%), Positives = 159/333 (47%), Gaps = 33/333 (9%)
Query: 109 DTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN---QKSCSGV 164
DTGSDL W QC+PC + CY Q PLFDP SS+Y ++PC CA L +CS
Sbjct: 4 DTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAA 63
Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
C Y VSYGDGS + G +++T+TL +++ A+ G FGCG GLFN G++GL
Sbjct: 64 QCGYVVSYGDGSNTTGVYSSDTLTLSASS----AVQGFFFGCGHAQSGLFNG-VDGLLGL 118
Query: 225 GGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFGTNGIV-SGPGVVST---PLTKAKT 278
G SL+ Q T G FSYCL P ++ + G G + PG +T P A T
Sbjct: 119 GREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPT 178
Query: 279 FYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTF----LPQGYNSNLLSVMSSMIEA--QP 332
+YV+ + ISVG Q+L V + LP + L S S + + P
Sbjct: 179 YYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYP 238
Query: 333 VADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFK--GI 387
A G L+ CY+F V P V + F GA V L C F G
Sbjct: 239 TAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL-----SFGCLAFAPSGS 293
Query: 388 TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ I GN+ Q +F V I+ +V FKP+ C
Sbjct: 294 DGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 324
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 105/363 (28%), Positives = 168/363 (46%), Gaps = 41/363 (11%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y +++ +GTP E VADTGSDL W +C PP + +F PK S ++ +PC
Sbjct: 115 QYFVKLRVGTPVQEFTLVADTGSDLTWVKCAGASPPGR-------VFRPKTSRSWAPIPC 167
Query: 149 SSSQCA-----SLNQKSCSGVNCQYSVSYGDGSF-SNGNLATETVTLGSTTGQAVALPGI 202
SS C +L S C Y Y +GS + G + TE+ T+ G+ L +
Sbjct: 168 SSDTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKDV 227
Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI 262
GC +++ G G++ LG IS +Q G FSYCL V T +
Sbjct: 228 VLGCSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCL--VDHLAPRNATGYL 285
Query: 263 VSGPGVV-STPLTKAK-------TFYVLTIDAISVGNQRLGV-------STPDIVIDSGT 307
GPG V TP T+ K FY + +DAI V + L + + +++DSG
Sbjct: 286 AFGPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWDAKSGGVILDSGN 345
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQP-VADPTGSLELCYSFNSLSQ-----VPEVTIHFRG 361
TLT L +++ +S ++ P V+ P E CY++ + +P++ + F G
Sbjct: 346 TLTVLAAPAYKAVVAALSKHLDGVPKVSFPP--FEHCYNWTARRPGAPEIIPKLAVQFAG 403
Query: 362 -ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
A ++ ++ + V + C V +G + + GNIMQ L +D++ V FK ++
Sbjct: 404 SARLEPPAKSYVIDVKPGVKCIGVQEGEWPGLSVIGNIMQQEHLWEFDLKNMQVRFKQSN 463
Query: 420 CTK 422
CT+
Sbjct: 464 CTR 466
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 115/428 (26%), Positives = 189/428 (44%), Gaps = 46/428 (10%)
Query: 30 SVELIHRDS--PKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
+++LI R+S +P TP ++ S R + QNS + +S + +
Sbjct: 2 AMKLIRRESVVRHNPDARVPVTPEDHIQHMTDISSARFKYL-QNSIVKELGSSDFQVDVH 60
Query: 88 NAN----YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
A + + S+G PP + + DTGS L+W QC PC P+F+P +SST+
Sbjct: 61 QAIKTSLFFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTF 120
Query: 144 KSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
C C CS C Y Y G+ S G LA E +T + G V I
Sbjct: 121 VECSCDDRFCRYAPNGHCSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIA 180
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
FGCG NG S+ TGI+GLG SL Q+ KFSYC+ +++ N+G N +V
Sbjct: 181 FGCGHENGEQLESEFTGILGLGAKPTSLAVQL----GSKFSYCIGDLANK--NYGYNQLV 234
Query: 264 SGP--GVVSTP----LTKAKTFYVLTIDAISVGNQRLGV---------STPDIVIDSGTT 308
G ++ P Y + ++ ISVG+++L + S +++D+GT
Sbjct: 235 LGEDADILGDPTPIEFETENGIYYMNLEGISVGDKQLNIEPVVFKRRGSRTGVILDTGTL 294
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE-LCYS---FNSLSQVPEVTIHFR-GAD 363
T+L L + + S+++ P + + LCY L P VT HF GA+
Sbjct: 295 YTWLADIAYRELYNEIKSILD--PKLERFWFRDFLCYHGRVNEELIGFPVVTFHFAGGAE 352
Query: 364 VKLSRSNFFVKVSE-----DIVCSVFKGITNSVPIY------GNIMQTNFLVGYDIEQQT 412
+ + ++ F ++E ++ C + T Y G + Q + + YD++++
Sbjct: 353 LAMEATSMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYDLKERN 412
Query: 413 VSFKPTDC 420
+ + DC
Sbjct: 413 IYLQRIDC 420
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 138/396 (34%), Positives = 188/396 (47%), Gaps = 41/396 (10%)
Query: 54 LRDALTRSLNRLNHFNQNSSISSSKASQADI-----IPNNA-NYLIRISIGTPPTERLAV 107
L+D L R + F+ ++ S K QADI IP A NYL+++++GTP
Sbjct: 3 LQDQL-RVKSMHARFSNKNAGSHFKEMQADIPVQSGIPLGAGNYLVKMALGTPKLSLSLA 61
Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA----SLNQKSCSG 163
DTGSD+ WTQCEPC S CY Q FDP+ SS+YK++ CSSS C S + C
Sbjct: 62 LDTGSDITWTQCEPCVGS-CYRQAQTKFDPRKSSSYKNVSCSSSSCRIITDSGGARGCVS 120
Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVG 223
C Y V YGDGS+S G ATE +T+ + + FGCG N G F + G++G
Sbjct: 121 STCIYKVQYGDGSYSVGFFATEKLTISPSD----VISNFLFGCGQQNAGRFG-RIAGLLG 175
Query: 224 LGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLT---KAKTFY 280
LG G +SL Q F+YCL SS+ T G V TPL+ K FY
Sbjct: 176 LGRGKLSLALQTSEKYNNLFTYCLPSFSSSSTGHLTLGGQVPKSVKFTPLSPAFKNTPFY 235
Query: 281 VLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD 335
+ I +SVG L + S +IDSGT +T L S L S +++ P D
Sbjct: 236 GIDIKGLSVGGHVLPIDASVFSNAGAIIDSGTVITRLQPTVYSALSSKFQQLMKDYPKTD 295
Query: 336 PTGSLELCYSF--NSLSQVPEVTIHFRGA---DVKLSRSNFF----VKVSEDIVCSVF-- 384
L+ CY F N VP ++ F+G D+K FF V + D VC F
Sbjct: 296 GFSILDTCYDFSGNESISVPRISFFFKGGVEVDIK-----FFGILTVINAWDKVCLAFAP 350
Query: 385 KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
++GN Q + V +D+ + + F P+ C
Sbjct: 351 NDDDGDFVVFGNSQQQTYDVVHDLAKGRIGFAPSGC 386
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 127/442 (28%), Positives = 213/442 (48%), Gaps = 39/442 (8%)
Query: 10 ILFFLCF-YVVSP---IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL 65
+L +CF ++ SP + + GFS LIH SP SP+ N + AL +L+R
Sbjct: 7 LLLIICFTFIFSPCISAASDSKGFSTNLIHIHSPSSPYKNVKAESLAK-DTALESTLSRH 65
Query: 66 NHF--NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
+ Q ++ + +I + + +L +SIG PPT V DTGSDL W QCEPC
Sbjct: 66 AYLRARQQKALQPADFVPPPLIRDKSAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPC- 124
Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGV-NCQYSVSYGDGSFSNGN 181
CY Q P+++ S +Y + C+ C SL ++ CS +C Y +Y DG+ ++G
Sbjct: 125 -DVCYKQKDPIYNRTKSDSYTEMLCNEPPCVSLGREGQCSDSGSCLYQTAYADGARTSGL 183
Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS-KTTGIVGLGGGDISLISQMRT--T 238
L+ E V S + FGCG N S + G++GLG G +SL+SQ+
Sbjct: 184 LSYEKVAFTSHYSDEDKTAQVGFGCGLQNLNFITSNRDGGVLGLGPGLVSLVSQLSAIGK 243
Query: 239 IAGKFSYCLVPVSSTK----INFGTNGIVSGPGVVSTPLTKAKTFYV-LTIDAISVGNQR 293
++ F+YC +S+ + FG ++G TP+ A+ +YV L + VG R
Sbjct: 244 VSKSFAYCFGNISNPNAGGFLVFGDATYLNGD---MTPMVIAEFYYVNLLGIGLGVGEPR 300
Query: 294 LGVST------PD----IVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
L +++ PD ++IDSG+TL+ F P+ Y +V+ + + ++ T S +
Sbjct: 301 LDINSSSFERKPDGSGGVIIDSGSTLSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD- 359
Query: 343 CYS---FNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQ 399
C+ L P + ++ + R + F++ +++ C F + I G + Q
Sbjct: 360 CFEGKIERDLPLFPTLVLYLESTGILNDRWSIFLQRYDELFCLGFTS-GEGLSIIGTLAQ 418
Query: 400 TNFLVGYDIEQQTVSFKPT-DC 420
++ GY++E T+S + DC
Sbjct: 419 QSYKFGYNLELSTLSIESNPDC 440
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 98/289 (33%), Positives = 141/289 (48%), Gaps = 31/289 (10%)
Query: 23 EAQTGGFSVELIHRD--SPKSPFYNSSETPYQRLRDALTRSL-NRLNHFNQNSSISSSKA 79
+ G +E+ R S K ++ L D RS+ NRL + S+ S+
Sbjct: 71 RQEKGAIMLEMKDRSYCSKKKVNWHRKLHNQLTLDDLHVRSMQNRLRKMVSSHSVEVSQI 130
Query: 80 S---QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
+ + NY++ + +G + + DTGSDL W QCEPC CY Q P+F
Sbjct: 131 QIPLASGVNFQTLNYIVTMELGG--QDMTVIIDTGSDLTWVQCEPC--MSCYNQQGPVFK 186
Query: 137 PKMSSTYKSLPCSSSQCASL-----NQKSCSG--VNCQYSVSYGDGSFSNGNLATETVTL 189
P SS+Y+S+PC+SS C SL N +C NC Y+V+YGDGS++NG L E ++
Sbjct: 187 PSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSF 246
Query: 190 GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP 249
G +++ FGCG NN GLF +G++GLG ++SLISQ +T G FSYCL P
Sbjct: 247 G-----GISVSNFVFGCGKNNKGLFGG-VSGLMGLGRSNLSLISQTNSTFGGVFSYCLPP 300
Query: 250 VSSTKINFGTNGIVSGPGVVSTPLTKAK--------TFYVLTIDAISVG 290
+ G S TP+ + FY+L + I VG
Sbjct: 301 TDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVG 349
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 103/362 (28%), Positives = 165/362 (45%), Gaps = 39/362 (10%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +++ +GTP E VADTGS+L W +C PP +F P+ S ++ +P
Sbjct: 90 QYFVKVLVGTPAQEFTLVADTGSELTWVKCAGGASPPGL-------VFRPEASKSWAPVP 142
Query: 148 CSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSN-GNLATETVTLGSTTGQAVALPG 201
CSS C SL S S C Y Y +GS G + T++ T+ G+ L
Sbjct: 143 CSSDTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQLQD 202
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
+ GC + + G G++ LG IS S+ G FSYCL V T
Sbjct: 203 VVLGCSSTHDGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCL--VDHLAPRNATGY 260
Query: 262 IVSGPGVV-STPLTKAK-------TFYVLTIDAISVGNQRLGV-------STPDIVIDSG 306
+ GPG V TP T+ K FY + +DA+ V Q L + + +++DSG
Sbjct: 261 LAFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDPKSGGVILDSG 320
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS----LSQVPEVTIHFRG- 361
TTLT L +++ ++ ++ P D E CY++ + ++P++ + F G
Sbjct: 321 TTLTVLATPAYKAVVAALTKLLAGVPKVD-FPPFEHCYNWTAPRPGAPEIPKLAVQFTGC 379
Query: 362 ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
A ++ ++ + V + C + +G V + GNIMQ L +D++ V F P+ C
Sbjct: 380 ARLEPPAKSYVIDVKPGVKCIGLQEGEWPGVSVIGNIMQQEHLWEFDLKNMEVRFMPSTC 439
Query: 421 TK 422
T+
Sbjct: 440 TR 441
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 112/370 (30%), Positives = 176/370 (47%), Gaps = 41/370 (11%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + + +GTPP + DTGSDL W QC PC C+ Q+ +DPK S+++K++ C
Sbjct: 158 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPC--YDCFHQNGMFYDPKTSASFKNITC 215
Query: 149 SSSQCASLN------QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA----VA 198
+ +C+ ++ Q +C Y YGD S + G+ A ET T+ TT +
Sbjct: 216 NDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYK 275
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----ST 253
+ + FGCG N GLF+ + + G +S SQ+++ FSYCLV + S+
Sbjct: 276 VGNMMFGCGHWNRGLFSGASGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRNSNTNVSS 334
Query: 254 KINFGTN-GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGV--STPDI---- 301
K+ FG + +++ + T K TFY + I +I VG + L + T +I
Sbjct: 335 KLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDG 394
Query: 302 ----VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----V 352
+IDSGTTL++ + Y M E P+ L+ C++ + + + +
Sbjct: 395 DGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENNIHL 454
Query: 353 PEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQ 410
PE+ I F G N F+ +SED+VC G S I GN Q NF + YD ++
Sbjct: 455 PELGIAFVDGTVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKR 514
Query: 411 QTVSFKPTDC 420
+ F PT C
Sbjct: 515 SRLGFTPTKC 524
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 111/340 (32%), Positives = 160/340 (47%), Gaps = 33/340 (9%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCS 162
V DT SD+ W QC PCP C+ Q L+DP SS+ + PCSS C +L N + +
Sbjct: 159 VIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNLGPYANGCTPA 218
Query: 163 GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN--NGGLFNSKTTG 220
G CQY V Y DGS S G ++ +TL A A+ FGC G F++KT+G
Sbjct: 219 GDQCQYRVQYPDGSASAGTYISDVLTLNPAK-PASAISEFRFGCSHALLQPGSFSNKTSG 277
Query: 221 IVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFGTNGIVSGPGVVSTPLTKAKT 278
I+ LG G SL +Q + T FSYCL PV S G + + V TP+ ++K
Sbjct: 278 IMALGRGAQSLPTQTKATYGDVFSYCLPPTPVHSGFFILGVPRVAASRYAV-TPMLRSKA 336
Query: 279 ---FYVLTIDAISVGNQRL----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
Y++ + AI V +RL V V+DS T +T LP L + + + A
Sbjct: 337 APMLYLVRLIAIEVAGKRLPVPPAVFAAGAVMDSRTIVTRLPPTAYMALRAAFVAEMRAY 396
Query: 332 PVADPTGSLELCYSFNSLS-------QVPEVTIHFRGAD--VKLSRSNFFVKVSEDIVCS 382
A P L+ CY F+ + ++P++T+ F G + V+L S + C
Sbjct: 397 RAAAPKEHLDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGAVELDPSGVLLD-----GCL 451
Query: 383 VFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
F T+ I GN+ Q V Y+++ TV F+ C
Sbjct: 452 AFAPNTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGAC 491
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 104/363 (28%), Positives = 175/363 (48%), Gaps = 43/363 (11%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N AN+ +IGTPP A+ D +L+WTQC C S+C+ QD PLF P SST++
Sbjct: 67 NVANF----TIGTPPQPASAIIDVAGELVWTQCSMC--SRCFKQDLPLFVPNASSTFRPE 120
Query: 147 PCSSSQCASLNQKSCSGVNCQY--SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
PC + C S+ +CS C Y +++ G + G +AT+T +G+ T + F
Sbjct: 121 PCGTDACKSIPTSNCSSNMCTYEGTINSKLGGHTLGIVATDTFAIGTATAS------LGF 174
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---TKINFGTNG 261
GC +G +G++GLG SL+SQM T KFSYCL P S +++ G++
Sbjct: 175 GCVVASGIDTMGGPSGLIGLGRAPSSLVSQMNIT---KFSYCLTPHDSGKNSRLLLGSSA 231
Query: 262 IVSG-------PGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--STPDIVIDSGTTLTFL 312
++G P V ++P +Y + +D I G+ + + S +++ + ++FL
Sbjct: 232 KLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPPSGNTVLVQTLAPMSFL 291
Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR--GADVKLSR 368
L ++ + A P A P +LC+ LS P++ F+ A + +
Sbjct: 292 VDSAYQALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQQGAAALTVPP 351
Query: 369 SNFFVKVSED--IVCSVF--------KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
+ + V E+ VC + ++ I G++ Q N D+E++T+SF+P
Sbjct: 352 PKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPA 411
Query: 419 DCT 421
DC+
Sbjct: 412 DCS 414
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 115/353 (32%), Positives = 170/353 (48%), Gaps = 27/353 (7%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ +++ + GTP + DTGSDL W QC+PC CY Q P FDP SS+Y ++
Sbjct: 133 DTLEFVVVVGFGTPAQTAAIILDTGSDLSWIQCKPC-SGHCYRQHDPDFDPAKSSSYAAV 191
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
PC + CA+ C+G C Y V YGDGS + G L+ +T+T S++ G TFGC
Sbjct: 192 PCGTPVCAAAGGM-CNGTTCLYGVQYGDGSSTTGVLSRDTLTFNSSS----KFTGFTFGC 246
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--INFGTNGIVS 264
G N G F + G++GLG G +SL SQ + G FSYCL ++T +N G S
Sbjct: 247 GEKNIGDFG-EVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPSYNTTPGYLNIGATKPTS 305
Query: 265 GPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVSTPDI------VIDSGTTLTFLPQG 315
V T + K +FY + + +I++G L V P + ++DSGT LT+LP
Sbjct: 306 TVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVP-PSVFTKTGTLLDSGTILTYLPPP 364
Query: 316 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFF-VK 374
++L ++ A P L+ CY F + + F +D + +F+ +
Sbjct: 365 AYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNFSDGAVFDLDFYGIM 424
Query: 375 VSED-----IVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ D I C F ++P I GN Q V YD+ Q + F P C
Sbjct: 425 IFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKIGFIPISC 477
>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
Length = 366
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 101/288 (35%), Positives = 147/288 (51%), Gaps = 37/288 (12%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRL------RDA-----LTRSLNRLNHFNQNSSISSS 77
+SVE++HRD+ ++ Y+R R+A L R + R N++
Sbjct: 74 WSVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPVNRYE 133
Query: 78 KASQAD----------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
++ D + + Y RI +GTP E+ V DTGSD+ W QCEPC +C
Sbjct: 134 NVAEVDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPC--REC 191
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
Y Q P+F+P S+++ ++ C S+ C+ L+ C C Y SYGDGS+S G+ ATET+
Sbjct: 192 YSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSGGCLYEASYGDGSYSTGSFATETL 251
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
T G+T+ VA+ GCG N GLF ++GLG G +S +Q+ T FSYCL
Sbjct: 252 TFGTTSVANVAI-----GCGHKNVGLFIGAAG-LLGLGAGALSFPNQIGTQTGHTFSYCL 305
Query: 248 VPV---SSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISV 289
V SS + FG + G + TPL K TFY L++ AIS+
Sbjct: 306 VDRESDSSGPLQFGPKSVPVGS--IFTPLEKNPHLPTFYYLSVTAISI 351
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 115/366 (31%), Positives = 160/366 (43%), Gaps = 48/366 (13%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY++ + IG + + DTGSDL W QC PC CY Q PLF+P SS++ SLPC+
Sbjct: 144 NYIVTVGIGGQNST--LIVDTGSDLTWVQCLPC--RLCYNQQEPLFNPSNSSSFLSLPCN 199
Query: 150 SSQCASLNQKS-----CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
S C +L + CS N C Y + YGDGS+S G L E +TLG T +
Sbjct: 200 SPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKT-----EIDN 254
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKI--- 255
FGCG NN GLF +G++GL ++SL+SQ + FSYCL SS +
Sbjct: 255 FIFGCGRNNKGLFGG-ASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLG 313
Query: 256 -----NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI--------V 302
NF +S ++ P + FY L + IS+G L V P + +
Sbjct: 314 GADFSNFKNISPISYTRMIQNP--QMSNFYFLNLTGISIGGVNLNV--PRLSSNEGVLSL 369
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR 360
+DSGT +T L + L C++ +V P V F
Sbjct: 370 LDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFE 429
Query: 361 GAD---VKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
G V + +FVK +C F G + I GN Q N V Y+ ++ V F
Sbjct: 430 GNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGF 489
Query: 416 KPTDCT 421
C+
Sbjct: 490 AGEPCS 495
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 129/447 (28%), Positives = 216/447 (48%), Gaps = 39/447 (8%)
Query: 5 LSCVFILFFLCF-YVVSP---IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
++ V +L +CF ++ SP + + GFS LIH SP SP+ N + AL
Sbjct: 15 MASVNLLLIICFTFIFSPCISAASDSKGFSTNLIHIHSPSSPYKNVKAESLAK-DTALES 73
Query: 61 SLNRLNHF--NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
+L+R + Q ++ + +I + + +L +SIG PPT V DTGSDL W Q
Sbjct: 74 TLSRHAYLRARQQKALQPADFVPPPLIRDKSAFLANLSIGNPPTNVYVVLDTGSDLFWIQ 133
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGV-NCQYSVSYGDGS 176
CEPC CY Q P+++ S +Y + C+ C SL ++ CS +C Y SY DGS
Sbjct: 134 CEPC--DVCYKQKDPIYNRTKSDSYTEMLCNEPPCLSLGREGQCSDSGSCLYQTSYADGS 191
Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF-NSKTTGIVGLGGGDISLISQM 235
++G L+ E V S + FGCG N +S+ G++GLG G +SL+SQ+
Sbjct: 192 RTSGLLSYEKVAFTSHYSDEDKTAQVGFGCGLQNLNFVTSSRDGGVLGLGPGLVSLVSQL 251
Query: 236 RT--TIAGKFSYCLVPVSSTK----INFGTNGIVSGPGVVSTPLTKAKTFYV-LTIDAIS 288
++ F+YC +S+ + FG ++G TP+ A+ +YV L +
Sbjct: 252 SAIGKVSKSFAYCFGNLSNPNAGGFLVFGDATYLNGD---MTPMVIAEFYYVNLLGIGLG 308
Query: 289 VGNQRLGVST------PD----IVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPT 337
V RL +++ PD ++IDSG+TL+ F P+ Y +V+ + + ++ T
Sbjct: 309 VEEPRLDINSSSFERKPDGSGGVIIDSGSTLSIFPPEVYEVVRNAVVDKLKKGYNISPLT 368
Query: 338 GSLELCYSFN---SLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIY 394
S + C+ L P + ++ + R + F++ +++ C F + I
Sbjct: 369 SSPD-CFEGKIGRDLPLFPTLVLYLESTGILNDRWSIFLQRYDELFCLGFTS-GEGLSII 426
Query: 395 GNIMQTNFLVGYDIEQQTVSFKPT-DC 420
G + Q ++ GY++E T+S + DC
Sbjct: 427 GTLAQQSYKFGYNLELSTLSIESNPDC 453
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 106/376 (28%), Positives = 170/376 (45%), Gaps = 48/376 (12%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC-YMQDSPLFDPKMSSTYKSL 146
+ Y + + IG PP L +ADTGSDL+W +C C C + + +F P+ SST+
Sbjct: 80 SGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSAC--RNCSHHSPATVFFPRHSSTFSPA 137
Query: 147 PCSSSQCASLNQ----KSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
C C + + C+ C Y Y DGS ++G A ET +L +++G+
Sbjct: 138 HCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAK 197
Query: 199 LPGITFGCGTNNGGLFNSKTT-----GIVGLGGGDISLISQMRTTIAGKFSYCLV----- 248
L + FGCG G S T+ G++GLG G IS SQ+ KFSYCL+
Sbjct: 198 LKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLS 257
Query: 249 --PVSSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI-- 301
P S I G + + + TPL + TFY + + ++ V +L + P I
Sbjct: 258 PPPTSYLIIGDGGDAVSK---LFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRID-PSIWE 313
Query: 302 ---------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ- 351
V+DSGTTL FL +++ + I+ + T +LC + + +++
Sbjct: 314 IDDSGNGGTVMDSGTTLAFLADPAYRLVIAAVKQRIKLPNADELTPGFDLCVNVSGVTKP 373
Query: 352 ---VPEVTIHFRGADVKL-SRSNFFVKVSEDIVCSVFKGITNSV--PIYGNIMQTNFLVG 405
+P + F G V + N+F++ E I C + + V + GN+MQ FL
Sbjct: 374 EKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFE 433
Query: 406 YDIEQQTVSFKPTDCT 421
+D ++ + F C
Sbjct: 434 FDRDRSRLGFSRRGCA 449
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 115/366 (31%), Positives = 160/366 (43%), Gaps = 48/366 (13%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY++ + IG + + DTGSDL W QC PC CY Q PLF+P SS++ SLPC+
Sbjct: 65 NYIVTVGIGGQNST--LIVDTGSDLTWVQCLPC--RLCYNQQEPLFNPSNSSSFLSLPCN 120
Query: 150 SSQCASLNQKS-----CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
S C +L + CS N C Y + YGDGS+S G L E +TLG T +
Sbjct: 121 SPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKT-----EIDN 175
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKI--- 255
FGCG NN GLF +G++GL ++SL+SQ + FSYCL SS +
Sbjct: 176 FIFGCGRNNKGLFGG-ASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLG 234
Query: 256 -----NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI--------V 302
NF +S ++ P + FY L + IS+G L V P + +
Sbjct: 235 GADFSNFKNISPISYTRMIQNP--QMSNFYFLNLTGISIGGVNLNV--PRLSSNEGVLSL 290
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR 360
+DSGT +T L + L C++ +V P V F
Sbjct: 291 LDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFE 350
Query: 361 GAD---VKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
G V + +FVK +C F G + I GN Q N V Y+ ++ V F
Sbjct: 351 GNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGF 410
Query: 416 KPTDCT 421
C+
Sbjct: 411 AGEPCS 416
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 140 bits (354), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 133/445 (29%), Positives = 192/445 (43%), Gaps = 69/445 (15%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
G +EL H D+ ++ T +R+R A R+ RL S + A I N
Sbjct: 32 GLRLELTHVDAKQN------CTTKERMRRATERTHRRLA-----SMAGGGGEASAPIHWN 80
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y+ IG PP + A+ DTGS+LIWTQC C + C+ QD +DP S T K +
Sbjct: 81 ETQYIAEYLIGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVA 140
Query: 148 CSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C+ + C ++ C+ G C +YG G+ G L TE T G + + FG
Sbjct: 141 CNDTACLLGSETRCARDGKACAVLTAYGAGAI-GGFLGTEVFTFGHGQSSENNV-SLAFG 198
Query: 206 CGTNN----GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
C T + G L +GI+GLG G +SL SQ+ KFSYCL P S N T
Sbjct: 199 CITASRLTPGSL--DGASGIIGLGRGKLSLPSQLGDN---KFSYCLTPYFSDAANTSTLF 253
Query: 262 IVSGPG-------VVSTPLTKA------KTFYVLTIDAISVGNQRLGVSTPDI------- 301
+ + G S P K +FY L + I+VG +L V
Sbjct: 254 VGASAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDLREVAP 313
Query: 302 ------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYS----FNSL 349
+IDSG+ T L L + + A V P G+ L+LC ++
Sbjct: 314 AKWGGTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDLCVGGVAPGDAG 373
Query: 350 SQVPEVTIHF-----RGADVKLSRSNFFVKVSEDIVCSVF---KGITNSVP-----IYGN 396
VP + +HF G DV + N++ V + C V G +++P I GN
Sbjct: 374 KLVPPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLPLNETTIIGN 433
Query: 397 IMQTNFLVGYDIEQQTVSFKPTDCT 421
MQ + + YD+ Q +SF+P DC+
Sbjct: 434 YMQQDMHLLYDLGQGVLSFQPADCS 458
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 117/379 (30%), Positives = 185/379 (48%), Gaps = 53/379 (13%)
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS--PLFDPKMS 140
+I+ ++ + + + I P R + DTGSDLIWTQC+ + + P++DP S
Sbjct: 8 NILLSDQGHSLTVGIVQP---RKLIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGES 64
Query: 141 STYKSLPCSSSQC--ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
ST+ LPCS C + K+C+ N C Y YG + + G LA+ET T G+ +AV
Sbjct: 65 STFAFLPCSDRLCQEGQFSFKNCTSKNRCVYEDVYGSAA-AVGVLASETFTFGAR--RAV 121
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN- 256
+L + FGCG + G TGI+GL +SLI+Q++ +FSYCL P + K +
Sbjct: 122 SLR-LGFGCGALSAGSLIG-ATGILGLSPESLSLITQLKIQ---RFSYCLTPFADKKTSP 176
Query: 257 --FG---------TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST------P 299
FG T + +VS P+ +Y + + IS+G++RL V P
Sbjct: 177 LLFGAMADLSRHKTTRPIQTTAIVSNPVE--TVYYYVPLVGISLGHKRLAVPAASLAMRP 234
Query: 300 D----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT-GSLELCYSFNSLS---- 350
D ++DSG+T+ +L + + + ++ PVA+ T ELC+ +
Sbjct: 235 DGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRL-PVANRTVEDYELCFVLPRRTAAAA 293
Query: 351 ----QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFL 403
QVP + +HF GA + L R N+F + ++C T+ V I GN+ Q N
Sbjct: 294 MEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMH 353
Query: 404 VGYDIEQQTVSFKPTDCTK 422
V +D++ SF PT C +
Sbjct: 354 VLFDVQHHKFSFAPTQCDQ 372
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 111/349 (31%), Positives = 168/349 (48%), Gaps = 25/349 (7%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
+++ + G+P + DTGSDL W QC+PC CY Q P+FDP SS+Y +PC
Sbjct: 111 EFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCS-GHCYKQHDPVFDPAKSSSYAVVPCG 169
Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
+++CA+ + C+G C Y V YGDGS + G LA ET+T S++ G FGCG
Sbjct: 170 TTECAAAGGE-CNGTTCVYGVEYGDGSSTTGVLARETLTFSSSS----EFTGFIFGCGET 224
Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--INFGTNGIVSGPG 267
N G F + G++GLG G +SL SQ G FSYCL ++T ++ G +
Sbjct: 225 NLGDFG-EVDGLLGLGRGSLSLSSQAAPAFGGIFSYCLPSYNTTPGYLSIGATPVTGQIP 283
Query: 268 VVSTPLTKA---KTFYVLTIDAISVGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSN 319
V T + +FY + + +I++G L V + ++DSGT LT+LP +
Sbjct: 284 VQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFTKTGTLLDSGTILTYLPPPAYTA 343
Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFF--VKVSE 377
L ++ A P L+ CY F S + + F +D + NFF + +
Sbjct: 344 LRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGILIPGVSFNFSDGAVFNLNFFGIMTFPD 403
Query: 378 D----IVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
D + C F +P + G+ Q + V YD+ Q + F P C
Sbjct: 404 DTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYDVPAQKIGFIPASC 452
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 102/358 (28%), Positives = 171/358 (47%), Gaps = 42/358 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y+ ++IGTPP A+ + +WTQC PC +C+ QD PLF+ SSTY+ PC +
Sbjct: 28 YMANLTIGTPPQPASAIIHLAGEFVWTQCSPC--RRCFKQDLPLFNRSASSTYRPEPCGT 85
Query: 151 SQCASLNQKSCSGVN-CQYSVS--YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ C S+ +CSG C Y V +GD S G T+T +G+ T + FGC
Sbjct: 86 ALCESVPASTCSGDGVCSYEVETMFGDTSGIGG---TDTFAIGTATAS------LAFGCA 136
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS----TKINFGTNG-I 262
++ +G+VGLG SL+ QM T FSYCL P + + + G + +
Sbjct: 137 MDSNIKQLLGASGVVGLGRTPWSLVGQMNAT---AFSYCLAPHGAAGKKSALLLGASAKL 193
Query: 263 VSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVSTPD---IVIDSGTTLTFLPQGY 316
G +TPL + Y++ ++ I G+ + P+ +++D+ ++FL
Sbjct: 194 AGGKSAATTPLVNTSDDSSDYMIHLEGIKFGDVII-APPPNGSVVLVDTIFGVSFLVDAA 252
Query: 317 NSNLLSVMSSMIEAQPVADPTGSLELCY-------SFNSLSQVPEVTIHFRG-ADVKLSR 368
+ ++ + A P+A PT +LC+ NS +P+V + F+G A + +
Sbjct: 253 FQAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAALTVPP 312
Query: 369 SNFFVKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
S + VC S +T + I G + Q N +D++++T+SF+P DC+
Sbjct: 313 SKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPADCS 370
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 107/380 (28%), Positives = 164/380 (43%), Gaps = 60/380 (15%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y + +GTP T+ + V DTGSDL+W QC PC +CY Q +FDP+ SSTY+ +
Sbjct: 82 ESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPC--RRCYAQRGQVFDPRRSSTYRRV 139
Query: 147 PCSSSQCASLNQKSC-----SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
PCSS QC +L C +G C+Y V+YGDGS S G+LAT+ + + T +
Sbjct: 140 PCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFANDT----YVNN 195
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
+T GCG +N GLF+S G + + R ++ P SST G
Sbjct: 196 VTLGCGRDNEGLFDSAA--------GLLGRRAAARYPSRRRWPRRTAPSSSTASATGRRA 247
Query: 262 IVSG----------------PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP------ 299
+ P T +A T + A S G TP
Sbjct: 248 QRAARTSCSAARRSRRPRRSPPCCRTRGARACTTWTWPGSA-SAARGSPGSRTPASRWTR 306
Query: 300 -----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNS--L 349
+V+DSGT ++ + + L + A + G + CY
Sbjct: 307 RRGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPA 366
Query: 350 SQVPEVTIHFRG-ADVKLSRSNFFV-------KVSEDIVCSVFKGITNSVPIYGNIMQTN 401
+ P + +HF G AD+ L N+F+ + + C F+ + + + GN+ Q
Sbjct: 367 ASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQG 426
Query: 402 FLVGYDIEQQTVSFKPTDCT 421
F V +D+E++ + F P CT
Sbjct: 427 FRVVFDVEKERIGFAPKGCT 446
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 113/397 (28%), Positives = 187/397 (47%), Gaps = 45/397 (11%)
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNAN--------YLIRISIGTPPTERLAVADTGSDL 114
+L HF + + S+ + +P + Y +I +G+PP E DTGSD+
Sbjct: 38 KKLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDI 97
Query: 115 IWTQCEPCP--PSQCYMQ-DSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYS 169
+W C+PCP PS+ + LFD SST K + C C+ ++Q SC V C Y
Sbjct: 98 LWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKKVGCDDDFCSFISQSDSCQPAVGCSYH 157
Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALP---GITFGCGTNNGGLF---NSKTTGIVG 223
+ Y D S S GN + +TL TG P + FGCG++ G +S G++G
Sbjct: 158 IVYADESTSEGNFIRDKLTLEQVTGDLQTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMG 217
Query: 224 LGGGDISLISQMRTTIAGK--FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYV 281
G + S++SQ+ T K FS+CL V I F G+V P V +TP+ + Y
Sbjct: 218 FGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGI-FAV-GVVDSPKVKTTPMVPNQMHYN 275
Query: 282 LTIDAISVGNQRLGVSTPDI------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA- 334
+ + + V L + P I ++DSGTTL + P+ +L+ +++ QPV
Sbjct: 276 VMLMGMDVDGTALDLP-PSIMRNGGTIVDSGTTLAYFPKVLYDSLI---ETILARQPVKL 331
Query: 335 DPTGSLELCYSFNSLSQV--PEVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK----- 385
C+SF+ V P V+ F + VKL+ ++ + +++ C ++
Sbjct: 332 HIVEDTFQCFSFSENVDVAFPPVSFEFEDS-VKLTVYPHDYLFTLEKELYCFGWQAGGLT 390
Query: 386 -GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
G V + G+++ +N LV YD+E + + + +C+
Sbjct: 391 TGERTEVILLGDLVLSNKLVVYDLENEVIGWADHNCS 427
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 92/247 (37%), Positives = 127/247 (51%), Gaps = 40/247 (16%)
Query: 90 NYLIRISIG----TPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
NY+ IS+G +P + DTGSDL W QC+PC S CY Q PLFDP S+TY +
Sbjct: 91 NYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPC--SACYAQRDPLFDPAGSATYAA 148
Query: 146 LPCSSSQCA-----------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
+ C++S CA S C Y+++YGDGSFS G LAT+TV LG
Sbjct: 149 VRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALG---- 204
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK 254
+L G FGCG +N GLF T G++GLG ++SL+SQ + G FSYCL P +++
Sbjct: 205 -GASLGGFVFGCGLSNRGLFGG-TAGLMGLGRTELSLVSQTASRYGGVFSYCL-PAATSG 261
Query: 255 INFGTNGIVSGPGVVS-----TPLTKAKT--------FYVLTIDAISVGNQRL---GVST 298
G+ + G S TP+ + FY L + +VG L G+
Sbjct: 262 DASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGA 321
Query: 299 PDIVIDS 305
+++IDS
Sbjct: 322 SNVLIDS 328
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 110/386 (28%), Positives = 173/386 (44%), Gaps = 59/386 (15%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL---------FDPKMS 140
Y +R +GTP L VADTGSDL W +C P+ SP F P+ S
Sbjct: 96 QYFVRFRVGTPAQPFLLVADTGSDLTWVKCRR--PASANSSLSPADSGPGPGRAFRPEDS 153
Query: 141 STYKSLPCSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSNGNLATE--TVTLGSTT 193
T+ + C+S C SL G C Y Y DGS + G + TE T+ L
Sbjct: 154 RTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRE 213
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----P 249
+ L G+ GC ++ G + G++ LG IS S + G+FSYCLV P
Sbjct: 214 ERKAKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFSYCLVDHLSP 273
Query: 250 VSSTK-INFGTNGIVSGP------------GVVSTPL---TKAKTFYVLTIDAISVGNQR 293
++T + FG N VS P TPL + + FY +++ AISV +
Sbjct: 274 RNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAGEF 333
Query: 294 LGVSTP--------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLELC 343
L + +++DSGT+LT L + +++ +S + P DP E C
Sbjct: 334 LKIPRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVTMDP---FEYC 390
Query: 344 YSFNSLS------QVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYG 395
Y++ S S VP++ +HF G A ++ ++ + + + C + +G + + G
Sbjct: 391 YNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPWPGISVIG 450
Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDCT 421
NI+Q L +DI+ + + F+ + CT
Sbjct: 451 NILQQEHLWEFDIKNRRLKFQRSRCT 476
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 165/352 (46%), Gaps = 49/352 (13%)
Query: 100 PPTERLAVADTGSDLI-WTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ 158
PP+ + +A+ D I WTQC+PC +C FDP S TY C S
Sbjct: 83 PPSPQEILAEMNPDSITWTQCKPC--VRCLKDSHRHFDPSASLTYSLGSCIPST------ 134
Query: 159 KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKT 218
V Y+++YGD S S GN +T+TL + P FGCG NN G F S
Sbjct: 135 -----VGNTYNMTYGDKSTSVGNYGCDTMTLEPSD----VFPKFQFGCGRNNEGDFGSGA 185
Query: 219 TGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST-KINFGTNG----------IVSGPG 267
G++GLG G +S +SQ + FSYCL S + FG +V+GPG
Sbjct: 186 DGMLGLGQGQLSTVSQTASKFKKVFSYCLPEEDSIGSLLFGEKATSQSSLKFTSLVNGPG 245
Query: 268 VVSTPLTKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLS 322
++ L ++ ++V +D ISVGN+RL V ++P +IDSGT +T LPQ S L +
Sbjct: 246 --TSGLEESGYYFVKLLD-ISVGNKRLNVPSSVFASPGTIIDSGTVITCLPQRAYSALTA 302
Query: 323 VMSSMIEAQPVADPTGS----LELCYSFNSLSQV--PEVTIHF-RGADVKLSRSNFFVKV 375
+ P+++ L+ CY+ + V PE+ +HF GADV+L+
Sbjct: 303 AFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRVIWGN 362
Query: 376 SEDIVCSVFKG-----ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+C F G + + + I GN Q + V YDI+ + F C+K
Sbjct: 363 DASRLCLAFAGNSKSTMNSELTIIGNRQQVSLTVLYDIQGGRIGFGGNGCSK 414
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 111/364 (30%), Positives = 166/364 (45%), Gaps = 38/364 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y I +GTPP DTGSD++W CE CP D L+DPK SST +
Sbjct: 86 YYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASSTGSMVM 145
Query: 148 CSSSQCASLNQ----KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C + CA+ K + V C+YSV+YGDGS + G+ T+ + T P
Sbjct: 146 CDQAFCAATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQTQPANA 205
Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
+ FGCG GG N GI+G G + S++SQ+ T AGK F++CL +
Sbjct: 206 SVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQL--TTAGKVKKIFAHCLDTIKGG 263
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDS 305
I F +V P V +TPL K Y + + I VG L + +IDS
Sbjct: 264 GI-FSIGDVVQ-PKVKTTPLVADKPHYNVNLKTIDVGGTTLQLPAHIFEPGEKKGTIIDS 321
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVK 365
GTTLT+LP+ ++ + + + D G L Y + P +T HF D+
Sbjct: 322 GTTLTYLPELVFKEVMLAVFNKHQDITFHDVQGFLCFQYPGSVDDGFPTITFHFE-DDLA 380
Query: 366 LSR--SNFFVKVSEDIVCSVFK-GITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKP 417
L +F D+ C F+ G + S + + G+++ +N LV YD+E + + +
Sbjct: 381 LHVYPHEYFFANGNDVYCVGFQNGASQSKDGKDIVLMGDLVLSNKLVIYDLENRVIGWTD 440
Query: 418 TDCT 421
+C+
Sbjct: 441 YNCS 444
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 119/371 (32%), Positives = 170/371 (45%), Gaps = 53/371 (14%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ + +G E + DT S+L W QC PC C+ Q PLFDP S +Y ++PC+
Sbjct: 152 NYVATVGLGG--GEATVIVDTASELTWVQCAPC--ESCHDQQDPLFDPSSSPSYAAVPCN 207
Query: 150 SSQCASLN---------QKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
SS C +L +C G + C Y++SY DGS+S G LA + ++L
Sbjct: 208 SSSCDALQLATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSLAGEV-- 265
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV----S 251
+ G FGCGT+N G T+G++GLG +SL+SQ G FSYCL P+ S
Sbjct: 266 ---IDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCL-PLKESDS 321
Query: 252 STKINFGTNGIV---SGP----GVVSTPLTKAKTFYVLTIDAISVGNQRL-------GVS 297
S + G + V S P +VS PL FY + + I+VG Q + G
Sbjct: 322 SGSLVIGDDSSVYRNSTPIVYASMVSDPLQGP--FYFVNLTGITVGGQEVESSGFSSGGG 379
Query: 298 TPDIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPE 354
+IDSGT +T +P YN+ +S E P A L+ C++ L QVP
Sbjct: 380 GGKAIIDSGTVITSLVPSIYNAVKAEFLSQFAE-YPQAPGFSILDTCFNMTGLREVQVPS 438
Query: 355 VTIHFRGA-DVKLSRSN--FFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIE 409
+ + F G +V++ +FV VC + + I GN Q N V +D
Sbjct: 439 LKLVFDGGVEVEVDSGGVLYFVSSDSSQVCLAMAPLKSEYETNIIGNYQQKNLRVIFDTS 498
Query: 410 QQTVSFKPTDC 420
V F C
Sbjct: 499 GSQVGFAQETC 509
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 104/332 (31%), Positives = 155/332 (46%), Gaps = 29/332 (8%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCS---- 162
DT D+ W QC PCP QCY Q PLFDP SST ++ C S C SL CS
Sbjct: 153 DTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRSLGPYGNGCSNRSA 212
Query: 163 GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIV 222
C+Y + Y D + G T+T+T+ TT A+ FGC G F+ T G +
Sbjct: 213 NAECRYLIEYSDDRATAGTYMTDTLTISGTT----AVRNFRFGCSHAVRGRFSDLTAGTM 268
Query: 223 GLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-INFGTNGIVSGPGV-VSTPLTKAK--- 277
LGGG SL++Q ++ FSYC+ S++ ++ G + V +TPL ++
Sbjct: 269 SLGGGAQSLLAQTARSLGNAFSYCVPQASASGFLSIGGPATTNSTTVFATTPLVRSAINP 328
Query: 278 TFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV 333
+ Y++ + I V +RLG+ + V+DS +T LP L + + A P
Sbjct: 329 SLYLVRLQGIVVAGRRLGIPPVAFSAGAVMDSSAVITQLPPTAYRALRRAFRNAMRAYPR 388
Query: 334 ADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNS 390
+ TG+L+ CY F L+ +VP V++ F GA V L + C F ++
Sbjct: 389 SGATGTLDTCYDFLGLTNVRVPAVSLVFGGGAVVVLDPPAVMIG-----GCLAFTATSSD 443
Query: 391 VPI--YGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ + GN+ Q V YD+ V F+ C
Sbjct: 444 LALGFIGNVQQQTHEVLYDVAAGGVGFRRGAC 475
>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
Length = 393
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 119/407 (29%), Positives = 172/407 (42%), Gaps = 87/407 (21%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD------ 83
SV L HR P SP +S + L R R ++ + S S+ A+ D
Sbjct: 32 SVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQSSKV 91
Query: 84 IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-PSQCYMQDSPLF 135
+P + Y+I + +G+P + V DTGSD+ W QCEPCP PS C+ LF
Sbjct: 92 SVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALF 151
Query: 136 DPKMSSTYKSLPCSSSQCASL-NQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLG 190
DP SSTY + CS++ CA L + +G + CQY V YGDGS + G
Sbjct: 152 DPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGT--------- 202
Query: 191 STTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP 249
G FGC G + KT G++GLGG SL+SQ
Sbjct: 203 ----------GFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQ--------------- 237
Query: 250 VSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI-----VID 304
+ K T+Y ++ I+VG ++LG+S P + ++D
Sbjct: 238 -------------------TAARSKKVPTYYFAALEDIAVGGKKLGLS-PSVFAAGSLVD 277
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGA 362
SGT +T LP + L S + + A+P G L+ C++F L +V P V + F G
Sbjct: 278 SGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAGG 337
Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYD 407
V ++ V C F + + GN+ Q F V YD
Sbjct: 338 AVVDLDAHGIVSGG----CLAFAPTRDDKAFGTIGNVQQRTFEVLYD 380
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 102/370 (27%), Positives = 170/370 (45%), Gaps = 42/370 (11%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCE---PCPPSQCYMQDSPLFDPKMSSTYKSL 146
Y ++ +GTP + VADTGSDL W +C P + +F P S ++ +
Sbjct: 109 QYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLASPRVFRPANSKSWAPI 168
Query: 147 PCSSSQCAS---LNQKSCSG-----VNCQYSVSYGDGSFSNGNLATETVTL---GSTTGQ 195
PCSS C S + +CS C Y Y D S + G + T+ T+ GS + +
Sbjct: 169 PCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTDAATIALSGSGSDR 228
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVS 251
L + GC T+ G + G++ LG +IS S+ G+FSYCLV P +
Sbjct: 229 KAKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRN 288
Query: 252 STK-INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTP--DI---- 301
+T + FG G P TPL + FY +T+DA+SV + L + D+
Sbjct: 289 ATSYLTFGPVGAAHSPS--RTPLLLDAQVAPFYAVTVDAVSVAGKALNIPAEVWDVKKNG 346
Query: 302 --VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLELCYSFNSLSQ---VPE 354
++DSGT+LT L +++ +S + P DP E CY++ + + VP
Sbjct: 347 GAILDSGTSLTILATPAYKAVVAALSKQLARVPRVTMDP---FEYCYNWTATRRPPAVPR 403
Query: 355 VTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
+ + F G A ++ ++ + + + C + +G+ V + GNI+Q L +D+ +
Sbjct: 404 LEVRFAGSARLRPPTKSYVIDAAPGVKCIGLQEGVWPGVSVIGNILQQEHLWEFDLANRW 463
Query: 413 VSFKPTDCTK 422
+ F+ + C
Sbjct: 464 LRFQESRCAH 473
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 101/369 (27%), Positives = 167/369 (45%), Gaps = 36/369 (9%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQC------YMQDSPLFDPKMSS 141
Y + +GTP + + VADTGSDL W C+ C C ++ +F +SS
Sbjct: 10 GQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSS 69
Query: 142 TYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
++K++PC + C SL C Y Y DGS + G A ETVT+ G
Sbjct: 70 SFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEG 129
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK 254
+ + L + GC + G G++GLG S + GKFSYCLV S K
Sbjct: 130 RKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSHK 189
Query: 255 -----INFGT----NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP--DI-- 301
+ FG+ +++ L +FY + + IS+G L + + D+
Sbjct: 190 NVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDVKG 249
Query: 302 ----VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPE 354
++DSG++LTFL + Y + ++ S+++ + V G LE C++ + VP
Sbjct: 250 AGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFEESLVPR 309
Query: 355 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQT 412
+ HF GA+ + ++ + ++ + C F + + GNIMQ N L +D+ +
Sbjct: 310 LVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFDLGLKK 369
Query: 413 VSFKPTDCT 421
+ F P+ CT
Sbjct: 370 LGFAPSSCT 378
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 158/348 (45%), Gaps = 37/348 (10%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y + +GTPPT L V DTGSD++W QC PC QCY Q +FDP+ S +Y ++
Sbjct: 138 GSGEYFASVGVGTPPTPALLVLDTGSDVVWLQCAPC--RQCYAQSGRVFDPRRSRSYAAV 195
Query: 147 PCSSSQC-----ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
C + C C Y V+YGDGS + G+LATET+ + +P
Sbjct: 196 RCGAPPCRGLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWF----ARGARVPR 251
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
+ GCG +N GLF + ++GLG G +SL +Q +FSYC
Sbjct: 252 VAVGCGHDNEGLFVAAAG-LLGLGRGRLSLPTQTARRYGRRFSYCF-------------- 296
Query: 262 IVSGPGVVSTPLTKAKTFYVLTIDAISVGNQ--RLGVSTPD--IVIDSGTTLTFLPQGYN 317
G + + + +V VG + RL ST +++DSGT++T L +
Sbjct: 297 --QGSDLDHRTIIRTVHQHVGGARVRGVGERSLRLDPSTGRGGVILDSGTSVTRLARPVY 354
Query: 318 SNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIHFR-GADVKLSRSNFFV 373
+ + +A SL + CY + +VP V++H GA+V L N+ +
Sbjct: 355 VAVREAFRAAAGGLRLAPGGFSLFDTCYDLRGRRVVKVPTVSVHLAGGAEVALPPENYLI 414
Query: 374 KV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
V + C G V I GNI Q F V +D ++Q V+ P C
Sbjct: 415 PVDTRGTFCLALAGTDGGVSIVGNIQQQGFRVVFDGDRQRVALVPKSC 462
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 120/428 (28%), Positives = 188/428 (43%), Gaps = 53/428 (12%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
G ++L H D+ + T +R+R A+ S N S+ + A +
Sbjct: 33 GIRMKLTHVDA------KGNYTAPERVRRAIALS----RQINLASTRAEGGGVSAPVHWA 82
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y+ +G PP A+ DTGS LIWTQC C C QD P F+ S ++ +P
Sbjct: 83 TRQYIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAPVP 142
Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C CA C+ C + V+YG G G L T+ T S G +A ++F
Sbjct: 143 CQDKACAGNYLHFCALDGTCTFRVTYGAGGII-GFLGTDAFTFQS-GGATLAFGCVSFTR 200
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKINFGTNG 261
L + +G++GLG G +SL SQ T A +FSYCL P +S+ + G
Sbjct: 201 FAAPDVLHGA--SGLIGLGRGRLSLASQ---TGAKRFSYCLTPYFHNNGASSHLFVGAAA 255
Query: 262 IVSGPG--VVSTPLTKA------KTFYVLTIDAISVGNQRL--------------GVSTP 299
+SG G V+S ++ TFY L + I+VG +L G
Sbjct: 256 SLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEEGFWEG 315
Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ---PVADPTGSLELCYSFNSLSQ-VPEV 355
++IDSG+ T L + L+ ++ + P + G + LC + L + VP +
Sbjct: 316 GVIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVARGDLDRVVPTL 375
Query: 356 TIHFR-GADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
+HF GAD+ L N++ + + C ++ +G S I GN Q N + +D+ +
Sbjct: 376 VLHFSGGADMALPPENYWAPLEKSTACMAIVRGYLQS--IIGNFQQQNMHILFDVGGGRL 433
Query: 414 SFKPTDCT 421
SF+ DC+
Sbjct: 434 SFQNADCS 441
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 120/427 (28%), Positives = 192/427 (44%), Gaps = 38/427 (8%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN-QNSSISSSKASQADIIP 86
GFS+E++HR S +SPFY + T Y+R+ + S R ++ SS S +A + I
Sbjct: 27 GFSLEIVHRYSRESPFYPGNITDYERITRLVELSKIRAHNLAITTSSGFSPEAFRLRISQ 86
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
++ YL+++ IG+P V DTGS L WTQCEPC ++ + Q P+F+ S TY+ L
Sbjct: 87 DDTCYLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPC--TRRFRQLPPIFNSTASRTYRDL 144
Query: 147 PCSSSQCA-SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
PC C + N C C Y ++Y GS + G A + + + + +P FG
Sbjct: 145 PCQHQFCTNNQNVFQCRDDKCVYRIAYAGGSATAGVAAQDIL----QSAENDRIP-FYFG 199
Query: 206 CGTNNGGL----FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL------VPVSSTK- 254
C +N + K GI+GL +SL+ QM +FSYCL P +T
Sbjct: 200 CSRDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLSSPSHATSL 259
Query: 255 INFGTNGIVSGPGVVSTPLTKAKTF--YVLTIDAISVGNQRLGVS------TPD----IV 302
+ FG + S +STP + Y L + +SV R+ + PD +
Sbjct: 260 LRFGNDIRKSRRKYLSTPFVSPRGMPNYFLNLIDVSVAGNRMQIPPGTFALKPDGTGGTI 319
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE--LCYSF--NSLSQVPEVTIH 358
IDSGT +T++ Q +++ + + L +CY ++ P + H
Sbjct: 320 IDSGTAVTYISQTAYFPVITAFKNYFDQHGFQRVNIQLSGYICYKQQGHTFHNYPSMAFH 379
Query: 359 FRGADVKLSRSNFFVKVSED-IVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
F+GAD + ++ V + C + I+ I G + Q N YD + + F
Sbjct: 380 FQGADFFVEPEYVYLTVQDRGAFCVALQPISPQQRTIIGALNQANTQFIYDAANRQLLFT 439
Query: 417 PTDCTKQ 423
P +C
Sbjct: 440 PENCQDH 446
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 112/379 (29%), Positives = 174/379 (45%), Gaps = 49/379 (12%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y + I +G+PP L VADTGSDL W +C C + F + S+T+
Sbjct: 80 SGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPTH 139
Query: 148 CSSSQCASLNQKSCSGVN-------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
C SS C + Q + + N C+Y Y DGS ++G + ET TL +++G+ + L
Sbjct: 140 CFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKLK 199
Query: 201 GITFGCGTNNGGL------FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV------ 248
I FGCG + G FN +G++GLG G IS SQ+ FSYCL+
Sbjct: 200 SIAFGCGFHASGPSLIGSSFNG-ASGVMGLGRGPISFASQLGRRFGRSFSYCLLDYTLSP 258
Query: 249 -PVSSTKINFGTNGIVSGPGVVS-TPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI-- 301
P S I + ++S TPL +A TFY ++I + V +L + P +
Sbjct: 259 PPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHID-PSVWS 317
Query: 302 ---------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-----LELCYSFN 347
VIDSGTTLTFL + +LS ++ P P G+ +LC +
Sbjct: 318 LDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKL-PSPTPGGASTRSGFDLCVNVT 376
Query: 348 SLS--QVPEVTIHFRGADV-KLSRSNFFVKVSEDIVCSVFKGI---TNSVPIYGNIMQTN 401
+S + P +++ G + N+F+ +SE I C + + + + GN+MQ
Sbjct: 377 GVSRPRFPRLSLELGGESLYSPPPRNYFIDISEGIKCLAIQPVEAESGRFSVIGNLMQQG 436
Query: 402 FLVGYDIEQQTVSFKPTDC 420
FL+ +D + + F C
Sbjct: 437 FLLEFDRGKSRLGFSRRGC 455
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 113/335 (33%), Positives = 154/335 (45%), Gaps = 33/335 (9%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS--CSGV 164
V DT SD+ W QC PCP CY Q L+DP SS+ C+S C L + C+
Sbjct: 172 VLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNN 231
Query: 165 N-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC--GTNNGGLFNSKTTGI 221
N CQY V Y DG+ + G ++ +T+ T A+ FGC G F S GI
Sbjct: 232 NQCQYRVRYPDGTSTAGTYISDLLTITPAT----AVRSFQFGCSHGVQGSFSFGSSAAGI 287
Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI--VSGPGVVSTPLTK---- 275
+ LGGG SL+SQ T FS+C P T+ F T G+ V+ V TP+ K
Sbjct: 288 MALGGGPESLVSQTAATYGRVFSHCFPP--PTRRGFFTLGVPRVAAWRYVLTPMLKNPAI 345
Query: 276 AKTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEA 330
TFY++ ++AI+V QR+ V +DS T +T LP Y + + M
Sbjct: 346 PPTFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAMY 405
Query: 331 QPVADPTGSLELCYSFNSLSQ--VPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF-KG 386
QP A P G L+ CY + +P +T+ F + A V+L S + C F G
Sbjct: 406 QP-APPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLFQ-----GCLAFTAG 459
Query: 387 ITNSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ VP I GNI V Y+I V F+ C
Sbjct: 460 PNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 494
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 113/335 (33%), Positives = 154/335 (45%), Gaps = 33/335 (9%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS--CSGV 164
V DT SD+ W QC PCP CY Q L+DP SS+ C+S C L + C+
Sbjct: 147 VLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNN 206
Query: 165 N-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC--GTNNGGLFNSKTTGI 221
N CQY V Y DG+ + G ++ +T+ T A+ FGC G F S GI
Sbjct: 207 NQCQYRVRYPDGTSTAGTYISDLLTITPAT----AVRSFQFGCSHGVQGSFSFGSSAAGI 262
Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI--VSGPGVVSTPLTK---- 275
+ LGGG SL+SQ T FS+C P T+ F T G+ V+ V TP+ K
Sbjct: 263 MALGGGPESLVSQTAATYGRVFSHCFPP--PTRRGFFTLGVPRVAAWRYVLTPMLKNPAI 320
Query: 276 AKTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEA 330
TFY++ ++AI+V QR+ V +DS T +T LP Y + + M
Sbjct: 321 PPTFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAMY 380
Query: 331 QPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF-KG 386
QP A P G L+ CY + +P +T+ F + A V+L S + C F G
Sbjct: 381 QP-APPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLFQ-----GCLAFTAG 434
Query: 387 ITNSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ VP I GNI V Y+I V F+ C
Sbjct: 435 PNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 469
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 109/366 (29%), Positives = 166/366 (45%), Gaps = 42/366 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y I +GTPP DTGSD++W CE CP D +DPK SS+ ++
Sbjct: 84 YFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASSSGSTVS 143
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C CA+ G V C+YSV YGDGS + G T+ + TG PG
Sbjct: 144 CDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQTQPGNA 203
Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
+TFGCG GG N GI+G G + S++SQ+ AGK F++CL +
Sbjct: 204 TVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAA--AGKVKKIFAHCLDTIKGG 261
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDS 305
I F +V P V +TPL Y + + +I VG L + +IDS
Sbjct: 262 GI-FAIGNVVQ-PKVKTTPLVADMPHYNVNLKSIDVGGTTLQLPAHVFETGERKGTIIDS 319
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFRGAD 363
GTTLT+LP+ +++ + + + Q + +C+ + + P +T HF D
Sbjct: 320 GTTLTYLPELVFKEVMAAIFN--KHQDIVFHNVQDFMCFQYPGSVDDGFPTITFHFE-DD 376
Query: 364 VKLS--RSNFFVKVSEDIVCSVFK-GITNS-----VPIYGNIMQTNFLVGYDIEQQTVSF 415
+ L +F D+ C F+ G S + + G+++ +N LV YD+E Q + +
Sbjct: 377 LALHVYPHEYFFPNGNDMYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVIYDLENQVIGW 436
Query: 416 KPTDCT 421
+C+
Sbjct: 437 TDYNCS 442
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 111/394 (28%), Positives = 187/394 (47%), Gaps = 43/394 (10%)
Query: 65 LNHFNQNSSISSSKASQADIIPNNAN--------YLIRISIGTPPTERLAVADTGSDLIW 116
L HF + + S+ + +P + Y +I +G+PP E DTGSD++W
Sbjct: 40 LEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILW 99
Query: 117 TQCEPCP--PSQCYMQ-DSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVS 171
C+PCP P++ + LFD SST K + C C+ ++Q SC + C Y +
Sbjct: 100 INCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIV 159
Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALP---GITFGCGTNNGGLF---NSKTTGIVGLG 225
Y D S S+G + +TL TG P + FGCG++ G +S G++G G
Sbjct: 160 YADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFG 219
Query: 226 GGDISLISQMRTTIAGK--FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYV-- 281
+ S++SQ+ T K FS+CL V I F G+V P V +TP+ + Y
Sbjct: 220 QSNTSVLSQLAATGDAKRVFSHCLDNVKGGGI-FAV-GVVDSPKVKTTPMVPNQMHYNVM 277
Query: 282 ---LTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPT 337
+ +D S+ R V ++DSGTTL + P+ +L+ +++ QPV
Sbjct: 278 LMGMDVDGTSLDLPRSIVRNGGTIVDSGTTLAYFPKVLYDSLI---ETILARQPVKLHIV 334
Query: 338 GSLELCYSF--NSLSQVPEVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK--GIT--- 388
C+SF N P V+ F + VKL+ ++ + E++ C ++ G+T
Sbjct: 335 EETFQCFSFSTNVDEAFPPVSFEFEDS-VKLTVYPHDYLFTLEEELYCFGWQAGGLTTDE 393
Query: 389 -NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ V + G+++ +N LV YD++ + + + +C+
Sbjct: 394 RSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCS 427
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 112/365 (30%), Positives = 167/365 (45%), Gaps = 40/365 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y + +GTPP DTGSD++W C+ CP D L+DPK SST ++
Sbjct: 88 YYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASSTGSTVM 147
Query: 148 CSSSQCASL---NQKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C CA CS V C+YSV+YGDGS + G+ + + TG P
Sbjct: 148 CDQGFCADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGDGQTQPANA 207
Query: 202 -ITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
+ FGCG GG S + GI+G G + S++SQ+ T AGK F++CL +
Sbjct: 208 SVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLAT--AGKVKKIFAHCLDTIKGG 265
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI---------VID 304
I F +V P V +TPL K Y + + I VG L + DI +ID
Sbjct: 266 GI-FAIGDVVQ-PKVKTTPLVADKPHYNVNLKTIDVGGTTLELPA-DIFKPGEKRGTIID 322
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADV 364
SGTTLT+LP+ ++ + + + D L YS + P +T HF D+
Sbjct: 323 SGTTLTYLPELVFKKVMLAVFNKHQDITFHDVQDFLCFEYSGSVDDGFPTLTFHFE-DDL 381
Query: 365 KLSR--SNFFVKVSEDIVCSVFK-GITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFK 416
L +F D+ C F+ G S + + G+++ +N LV YD+E + + +
Sbjct: 382 ALHVYPHEYFFPNGNDVYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVVYDLENRVIGWT 441
Query: 417 PTDCT 421
+C+
Sbjct: 442 DYNCS 446
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 116/373 (31%), Positives = 176/373 (47%), Gaps = 41/373 (10%)
Query: 85 IPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMS 140
IP + Y +I IGTP DTGSD++W C+ CP D L+DP S
Sbjct: 82 IPTDTGLYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTAS 141
Query: 141 STYKSLPCSSSQCASLNQK----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
++ K++ C CA+ SC+ + CQYS++YGDGS + G + + +G
Sbjct: 142 ASSKTVTCGQEFCATATNGGVPPSCAANSPCQYSITYGDGSSTTGFFVADFLQYDQVSGD 201
Query: 196 A---VALPGITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTTIAGK----FSY 245
+A +TFGCG GG S GI+G G + S++SQ+ T AGK FS+
Sbjct: 202 GQTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQL--TSAGKVTKIFSH 259
Query: 246 CLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRL---------GV 296
CL V+ I F +V P V +TPL Y + + I VG L G
Sbjct: 260 CLDTVNGGGI-FAIGNVVQ-PKVKTTPLVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGG 317
Query: 297 STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVT 356
+ +IDSGTTL +LP+ +LS + S + + L YS + + PEVT
Sbjct: 318 GSRGTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQDFLCFQYSGSVDNGFPEVT 377
Query: 357 IHFRGADVKLSR--SNFFVKVSEDIVCSVFK--GIT----NSVPIYGNIMQTNFLVGYDI 408
HF G D+ L ++ + +ED+ C F+ G+ + + G++ +N LV YD+
Sbjct: 378 FHFDG-DLPLVVYPHDYLFQNTEDVYCVGFQSGGVQSKDGKDMVLLGDLALSNKLVVYDL 436
Query: 409 EQQTVSFKPTDCT 421
E Q + + +C+
Sbjct: 437 ENQVIGWTNYNCS 449
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 112/368 (30%), Positives = 167/368 (45%), Gaps = 48/368 (13%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y+ IG PP A+ DTGSDL+WTQC C C Q P ++ SST+ +PC+
Sbjct: 89 QYVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCA 148
Query: 150 SSQCASLNQ--KSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
+ CA+ + C C YG G + G L TE S T + + FGC
Sbjct: 149 ARICAANDDIIHFCDLAAGCSVIAGYGAGVVA-GTLGTEAFAFQSGTAE------LAFGC 201
Query: 207 GT----NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKINF 257
T G L + +G++GLG G +SL+SQ T A KFSYCL P ++ +
Sbjct: 202 VTFTRIVQGALHGA--SGLIGLGRGRLSLVSQ---TGATKFSYCLTPYFHNNGATGHLFV 256
Query: 258 GTNGIVSGPG-VVSTPLTKAKT---FYVLTIDAISVGNQRL--------------GVSTP 299
G + + G G V++T K FY L + ++VG RL G+ +
Sbjct: 257 GASASLGGHGDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSG 316
Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE--LCYSFNSLSQ-VPEVT 356
++IDSG+ T L L S +++ + VA P + + LC + + + VP V
Sbjct: 317 GVIIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPDADDGALCVARRDVGRVVPAVV 376
Query: 357 IHFR-GADVKLSRSNFFVKVSE--DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
HFR GAD+ + +++ V + + G + GN Q N V YD+
Sbjct: 377 FHFRGGADMAVPAESYWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDF 436
Query: 414 SFKPTDCT 421
SF+P DC+
Sbjct: 437 SFQPADCS 444
>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
Length = 419
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 105/403 (26%), Positives = 188/403 (46%), Gaps = 51/403 (12%)
Query: 58 LTRSLNRLNHFNQNSSISSSKASQADIIP---NNANYLIRISIGTPPTERLAVADTGSDL 114
L R L+R + + +++ ++P + A Y+ +IGTPP + D +L
Sbjct: 26 LRRGLDRQGMRGRILADATAAPPGGAVVPLHWSGACYVANFTIGTPPQAVSGIVDLSGEL 85
Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVS-- 171
+WTQC C S C+ Q+ P+FDP S+TY++ C S C S+ ++CSG C Y
Sbjct: 86 VWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGSPLCKSIPTRNCSGDGECGYEAPSM 145
Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKT---TGIVGLGGGD 228
+GD + G +T+ + +G+ G+ + FGC + G + +G VGLG
Sbjct: 146 FGD---TFGIASTDAIAIGNAEGR------LAFGCVVASDGSIDGAMDGPSGFVGLGRTP 196
Query: 229 ISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVSGPGVVS--TPL---------- 273
SL+ Q T FSYCL P K + G + ++G G + TPL
Sbjct: 197 WSLVGQSNVT---AFSYCLAPHGPGKKSALFLGASAKLAGAGKSNPPTPLLGQHASNTSD 253
Query: 274 TKAKTFYVLTIDAISVGNQRLGVSTPD------IVIDSGTTLTFLPQGYNSNLLSVMSSM 327
+ +Y + ++ I G+ + ++ + +++ L++LP L V+++
Sbjct: 254 DGSDPYYTVQLEGIKAGDVAVAAASSGGGAITILQLETFRPLSYLPDAAYQALEKVVTAA 313
Query: 328 IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG--------ADVKLSRSNFFVKVSEDI 379
+ + +A+P +LC+ ++S VP++ F+G + L N V I
Sbjct: 314 LGSPSMANPPEPFDLCFQNAAVSGVPDLVFTFQGGATLTAPPSKYLLGDGNGNGTVCLSI 373
Query: 380 VCSV-FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ S + V I G+++Q N +D+E++T+SF+P DC+
Sbjct: 374 LSSTRLDSADDGVSILGSLLQENVHFLFDLEKETLSFEPADCS 416
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 116/408 (28%), Positives = 179/408 (43%), Gaps = 36/408 (8%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
V L+HR P +P ++ P + + RS RL++ +S + +
Sbjct: 56 VPLLHRHGPCAPSLSTDTPP--SMSEMFRRSHARLSYIVSGKKVSVPAHLGTSV--KSLE 111
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y+ +S GTP ++ V DTGSDL W QC+PC QC Q PLFDP SSTY ++PC+S
Sbjct: 112 YVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAVPCAS 171
Query: 151 SQCASLNQKS----CS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
+C L + CS G C +++SY DG+ + G + +TL + FG
Sbjct: 172 GECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLTL----APGAIVKDFYFG 227
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG 265
CG + L + + SL +Q FSYCL P ++K F G
Sbjct: 228 CGHSKSSLPGLFDGLLGLGRLSE-SLGAQYGGGGG--FSYCL-PAVNSKPGFLAFGAGRN 283
Query: 266 P-GVVSTPLTKA---KTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQGYN 317
P G V TP+ + TF +T+ I+VG ++L + + +++DSGT +T L
Sbjct: 284 PSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAFSGGMIVDSGTVVTVLQSTVY 343
Query: 318 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVK 374
L + ++A + G L+ CY VP++ + F GA + L N +
Sbjct: 344 RALRAAFREAMKAYRLVH--GDLDTCYDLTGYKNVVVPKIALTFSGGATINLDVPNGILV 401
Query: 375 VSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
C F G + + GN+ Q F V +D F+ C
Sbjct: 402 NG----CLAFAETGKDGTAGVLGNVNQRTFEVLFDTSASKFGFRAKAC 445
>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
Length = 419
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 101/403 (25%), Positives = 191/403 (47%), Gaps = 51/403 (12%)
Query: 58 LTRSLNRLNHFNQNSSISSSKASQADIIP---NNANYLIRISIGTPPTERLAVADTGSDL 114
L R L++ + + +++ ++P + A+Y+ +IGTPP + D +L
Sbjct: 26 LRRGLDQQGMRGRILADATAAPPGGAVVPLHWSGAHYVANFTIGTPPQAVSGIVDLSGEL 85
Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVS-- 171
+WTQC C S C+ Q+ P+FDP S+TY++ C S C S+ ++CSG C Y
Sbjct: 86 VWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGSPLCKSIPTRNCSGDGECGYEAPSM 145
Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKT---TGIVGLGGGD 228
+GD + G +T+ + +G+ G+ + FGC + G + +G VGLG
Sbjct: 146 FGD---TFGIASTDAIAIGNAEGR------LAFGCVVASDGSIDGAMDGPSGFVGLGRTP 196
Query: 229 ISLISQMRTTIAGKFSYCLV---PVSSTKINFGTNGIVSGPGVVS--TPL---------- 273
SL+ Q T FSYCL P + + G + ++G G + TPL
Sbjct: 197 WSLVGQSNVT---AFSYCLALHGPGKKSALFLGASAKLAGAGKSNPPTPLLGQHASNTSD 253
Query: 274 TKAKTFYVLTIDAISVGNQRLGVSTPD------IVIDSGTTLTFLPQGYNSNLLSVMSSM 327
+ +Y + ++ I G+ + ++ + +++ L++LP L V+++
Sbjct: 254 DGSDPYYTVQLEGIKAGDVAVAAASSGGGAITVLQLETFRPLSYLPDAAYQALEKVVTAA 313
Query: 328 IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFV--------KVSEDI 379
+ + +A+P +LC+ ++S VP++ F+G ++ + ++ V I
Sbjct: 314 LGSPSMANPPEPFDLCFQNAAVSGVPDLVFTFQGGATLTAQPSKYLLGDGNGNGTVCLSI 373
Query: 380 VCSV-FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ S + V I G+++Q N +D+E++T+SF+P DC+
Sbjct: 374 LSSTRLDSADDGVSILGSLLQENVHFLFDLEKETLSFEPADCS 416
>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
Length = 397
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 106/367 (28%), Positives = 176/367 (47%), Gaps = 47/367 (12%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N AN+ +IGTPP A+ D +L+WTQC C S+C+ QD PLF P SST++
Sbjct: 43 NVANF----TIGTPPQPASAIIDVAGELVWTQCSRC--SRCFKQDLPLFIPNASSTFRPE 96
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYG---DGSFSNGNLATETVTLGSTTGQAVALPGIT 203
PC + C S +CSG C Y + D + G + TET +G+ T +
Sbjct: 97 PCGTDACKSTPTSNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIGTATAS------LA 150
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFGTN 260
FGC + T+G +GLG SL++QM+ T KFSYCL P S+++ G++
Sbjct: 151 FGCVVASDIDTMDGTSGFIGLGRTPRSLVAQMKLT---KFSYCLSPRGTGKSSRLFLGSS 207
Query: 261 GIVSG-------PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTF-- 311
++G P + ++P + +Y+L++DAI GN + + ++ T F
Sbjct: 208 AKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATAQSGGILVMHTVSPFSL 267
Query: 312 -LPQGYNSNLLSVMSSM--IEAQPVADPTGSLELCYSFN---SLSQVPEVTIHFRGADVK 365
+ Y + +V ++ A P+A P +LC+ S + P++ F+G
Sbjct: 268 LVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGGGAA 327
Query: 366 LS--RSNFFVKVSE--DIVCSVFKGIT-------NSVPIYGNIMQTNFLVGYDIEQQTVS 414
L+ + + + V E D C+ + V + G++ Q N YD++++T+S
Sbjct: 328 LTVPPAKYLIDVGEEKDTACAAILSMARLNRTGLEGVSVLGSLQQENVHFLYDLKKETLS 387
Query: 415 FKPTDCT 421
F+P DC+
Sbjct: 388 FEPADCS 394
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 103/372 (27%), Positives = 165/372 (44%), Gaps = 46/372 (12%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP--LFDPKMSSTYKSLP 147
Y +R +GTP + VADTGSDL W +C + SP +F S ++ +
Sbjct: 100 QYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPARVFRTAASKSWAPIA 159
Query: 148 CSSSQCAS-----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG-----------S 191
CSS C S L S C Y Y DGS + G + T++ T+ S
Sbjct: 160 CSSDTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSATIALSSGSGRGGGDS 219
Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS 251
+ G+ L G+ GC G + G++ LG +IS S+ G+FSYCL V
Sbjct: 220 SGGRRAKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCL--VD 277
Query: 252 STKINFGTNGIVSGPGVVS----TPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI--- 301
T+ + GPG + TPL + FY +T+DA+ V + L + D+
Sbjct: 278 HLAPRNATSYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPA-DVWDV 336
Query: 302 ------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLELCYSFNSLS--Q 351
++DSGT+LT L +++ +S + P DP E CY++ +
Sbjct: 337 DRNGGAILDSGTSLTILATPAYRAVVTALSKHLAGLPRVTMDP---FEYCYNWTDAGALE 393
Query: 352 VPEVTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIE 409
+P++ +HF G A ++ ++ + + + C V +G V + GNI+Q L +D+
Sbjct: 394 IPKMEVHFAGSARLEPPAKSYVIDAAPGVKCIGVQEGSWPGVSVIGNILQQEHLWEFDLR 453
Query: 410 QQTVSFKPTDCT 421
+ + FK T C
Sbjct: 454 DRWLRFKHTRCA 465
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 184/388 (47%), Gaps = 43/388 (11%)
Query: 65 LNHFNQNSSISSSKASQADIIPNNAN--------YLIRISIGTPPTERLAVADTGSDLIW 116
L HF + + S+ + +P + Y +I +G+PP E DTGSD++W
Sbjct: 40 LEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILW 99
Query: 117 TQCEPCP--PSQCYMQ-DSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVS 171
C+PCP P++ + LFD SST K + C C+ ++Q SC + C Y +
Sbjct: 100 INCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIV 159
Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALP---GITFGCGTNNGGLF---NSKTTGIVGLG 225
Y D S S+G + +TL TG P + FGCG++ G +S G++G G
Sbjct: 160 YADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFG 219
Query: 226 GGDISLISQMRTTIAGK--FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYV-- 281
+ S++SQ+ T K FS+CL V I F G+V P V +TP+ + Y
Sbjct: 220 QSNTSVLSQLAATGDAKRVFSHCLDNVKGGGI-FAV-GVVDSPKVKTTPMVPNQMHYNVM 277
Query: 282 ---LTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPT 337
+ +D S+ R V ++DSGTTL + P+ +L+ +++ QPV
Sbjct: 278 LMGMDVDGTSLDLPRSIVRNGGTIVDSGTTLAYFPKVLYDSLI---ETILARQPVKLHIV 334
Query: 338 GSLELCYSF--NSLSQVPEVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK--GIT--- 388
C+SF N P V+ F + VKL+ ++ + E++ C ++ G+T
Sbjct: 335 EETFQCFSFSTNVDEAFPPVSFEFEDS-VKLTVYPHDYLFTLEEELYCFGWQAGGLTTDE 393
Query: 389 -NSVPIYGNIMQTNFLVGYDIEQQTVSF 415
+ V + G+++ +N LV YD++ + + +
Sbjct: 394 RSEVILLGDLVLSNKLVVYDLDNEVIGW 421
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 110/359 (30%), Positives = 158/359 (44%), Gaps = 35/359 (9%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ +++ + G+P DTGSD+ W QC PC CY Q P+FDP S+TY ++
Sbjct: 157 DTLEFVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCS-GHCYKQHDPVFDPTKSATYSAV 215
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
PC QCA+ K + C Y V+YGDGS + G L+ ET++L ST LPG FGC
Sbjct: 216 PCGHPQCAAAGGKCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRD----LPGFAFGC 271
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
G N G F + G +SL SQ T FSYCL P T + T G +
Sbjct: 272 GQTNLGEFGGVDGLVGLGRGA-LSLPSQAAATFGATFSYCL-PSYDTTHGYLTMGSTTPA 329
Query: 267 G------VVSTPLTKAKTF---YVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFL 312
V T + + + + Y + + +I +G L V + + DSGT LT+L
Sbjct: 330 ASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFTRDGTLFDSGTILTYL 389
Query: 313 -PQGYNS--NLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKL 366
P+ Y S + + + P DP + CY F + + P V F GA L
Sbjct: 390 PPEAYASLRDRFKFTMTQYKPAPAYDP---FDTCYDFTGHNAIFMPAVAFKFSDGAVFDL 446
Query: 367 SRSNFFV---KVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
S + + C F +++P I GN Q V YD+ + + F C
Sbjct: 447 SPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDVAAEKIGFGQFTC 505
>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
Length = 315
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 106/322 (32%), Positives = 158/322 (49%), Gaps = 46/322 (14%)
Query: 139 MSSTYKSLPCSSSQC---ASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
MSST+K++ C C + ++ +C+ N C Y SYGD S + G++ +T T S
Sbjct: 1 MSSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTFTFMSPN 60
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
G VA+ + FGCG N GLF S +GI G G G SL SQ++ G+FSYCL V+ +
Sbjct: 61 GVPVAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLPSQLK---VGRFSYCLTLVTES 117
Query: 254 KINF----------GTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS--- 297
K + G +GP STP+ TFY L+++ I+VG RL
Sbjct: 118 KSSVVILGTPPDPDGLRAHTTGP-FQSTPIIYNPLIPTFYYLSLEGITVGKTRLPFDKSV 176
Query: 298 -------TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ---PVADPTGSL--ELCYS 345
+ VIDSGT+LT LP+ + ++ + AQ P D T + LC+
Sbjct: 177 FALKKDGSGGTVIDSGTSLTTLPEA----VFELLQEELVAQFPLPRYDNTPEVGDRLCFR 232
Query: 346 FNSLSQ---VPEVTIHFRGADVKLSRSNFFVKVSED-IVCSVFKGITN-SVPIYGNIMQT 400
+ VP++ +H GAD+ L R N+FV+ + ++C G + ++ + GN Q
Sbjct: 233 RPKGGKQVPVPKLILHLAGADMDLPRDNYFVEEPDSGVMCLQINGAEDTTMVLIGNFQQQ 292
Query: 401 NFLVGYDIEQQTVSFKPTDCTK 422
N V YD+E + F P C K
Sbjct: 293 NMHVVYDVENNKLLFAPAQCDK 314
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 106/359 (29%), Positives = 168/359 (46%), Gaps = 37/359 (10%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
+NY+I++ GTPP V DTGS++ W C PC S C + P F+P SSTY L C
Sbjct: 122 SNYIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPC--SGCSSKQQP-FEPSKSSTYNYLTC 178
Query: 149 SSSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
+S QC L KS + VNC + YGD S + L++ET+++GS + FGC
Sbjct: 179 ASQQCQLLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSVGSQ-----QVENFVFGC 233
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN----FGTNGI 262
GL +T +VG G +S +SQ T FSYCL + S+ G +
Sbjct: 234 SNAARGLIQ-RTPSLVGFGRNPLSFVSQTATLYDSTFSYCLPSLFSSAFTGSLLLGKEAL 292
Query: 263 VSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI----------VIDSGTTL 309
S G+ TPL ++ +FY + ++ ISVG + + + + +IDSGT +
Sbjct: 293 -SAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDSGTVI 351
Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHF-RGADVKLS 367
T L + + + S + +A PT + CY+ S + P +T+HF D+ L
Sbjct: 352 TRLVEPAYNAMRDSFRSQLSNLTMASPTDLFDTCYNRPSGDVEFPLITLHFDDNLDLTLP 411
Query: 368 RSNFFVKVSED--IVCSVF----KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N ++D ++C F G + + +GN Q + +D+ + + +C
Sbjct: 412 LDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQKLRIVHDVAESRLGIASENC 470
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 105/336 (31%), Positives = 158/336 (47%), Gaps = 52/336 (15%)
Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK--SCSGVNCQYSVSYGDGSFSNGNLA 183
+C + +P F P SST+ LPC+SS C L +C+ C Y YG G F+ G LA
Sbjct: 87 ECAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMG-FTAGYLA 145
Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
TET+ +G + PG+ FGC T NG + ++GIVGLG +SL+SQ+ G+F
Sbjct: 146 TETLHVGGAS-----FPGVAFGCSTENG--VGNSSSGIVGLGRSPLSLVSQVGV---GRF 195
Query: 244 SYCL---VPVSSTKINFGTNGIVSG----PGVVSTPLTKAKTFYVLTIDAISVGNQRLGV 296
SYCL + I FG+ V+G P ++ P + ++Y + + I+VG L V
Sbjct: 196 SYCLRSDADAGDSPILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVGATDLPV 255
Query: 297 STPDI--------------VIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGS-- 339
++ ++DSGTTLT+L +GY + +S M A G+
Sbjct: 256 TSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRF 315
Query: 340 -LELCYSFNSLSQ-----VPEVTIHFRGADVKLSRSNFFVKVSE-------DIVCSVFKG 386
+LC+ N+ VP + + F G R +V V E + C +
Sbjct: 316 GFDLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLVLP 375
Query: 387 ITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ S+ I GN+MQ + V YD++ SF P DC
Sbjct: 376 ASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADC 411
>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 413
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 109/402 (27%), Positives = 192/402 (47%), Gaps = 51/402 (12%)
Query: 55 RDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN---YLIRISIGTPPTERLAVADTG 111
D L R L + + ++ A A ++P + Y+ +IGTPP A+ D
Sbjct: 25 HDDLRRGLEQATRGRLLAD--ATPAGGAAVVPIRWSPPYYVANFTIGTPPQPASAIVDVA 82
Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVS 171
+L+WTQC C +C+ QD P+F P SST+K PC ++ C S+ +SCSG C Y
Sbjct: 83 GELVWTQCSAC--RRCFKQDLPVFVPNASSTFKPEPCGTAVCESIPTRSCSGDVCSYK-- 138
Query: 172 YGDGSFSNGN----LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGG 227
G + GN AT+T +G+ T + + FGC + +G +GLG
Sbjct: 139 -GPPTQLRGNTSGFAATDTFAIGTATVR------LAFGCVVASDIDTMDGPSGFIGLGRT 191
Query: 228 DISLISQMRTTIAGKFSYCLVPVS---STKINFGTNGIVSG-------PGVVSTPLTKAK 277
SL++QM+ T +FSYCL P + S+++ G++ ++G P + ++P +
Sbjct: 192 PWSLVAQMKLT---RFSYCLSPRNTGKSSRLFLGSSAKLAGGESTSTAPFIKTSPDDDSH 248
Query: 278 TFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTF---LPQGYNSNLLSVMSSM--IEAQP 332
+Y+L++DAI GN + + ++ T F + Y + +V ++ A P
Sbjct: 249 HYYLLSLDAIRAGNTTIATAQSGGILVMHTVSPFSLLVDSAYRAFKKAVTEAVGGAAAPP 308
Query: 333 VADPTGSLELCYSFN---SLSQVPEVTIHFRG-ADVKLSRSNFFVKVSE--DIVCSVFKG 386
+A P +LC+ S + P++ F+G A + + + + + V E D C+
Sbjct: 309 MATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILS 368
Query: 387 IT-------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ V + G++ Q + YD++++T+SF+P DC+
Sbjct: 369 MAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEPADCS 410
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 100/356 (28%), Positives = 167/356 (46%), Gaps = 36/356 (10%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N AN+ +IGTPP A D +L+WTQC C C+ QD P+F P SST+K
Sbjct: 54 NVANF----TIGTPPQAASAFIDLTGELVWTQCSQC--IHCFKQDLPVFVPNASSTFKPE 107
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
PC + C S+ C+ C Y G G + G +AT+T +G+ A + FGC
Sbjct: 108 PCGTDVCKSIPTPKCASDVCAYDGVTGLGGHTVGIVATDTFAIGTA-----APASLGFGC 162
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---TKINFGTNGIV 263
+ +G +GLG SL++QM+ T +FSYCL P + +++ G + +
Sbjct: 163 VVASDIDTMGGPSGFIGLGRTPWSLVAQMKLT---RFSYCLAPHDTGKNSRLFLGASAKL 219
Query: 264 SG-----PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLT----FLPQ 314
+G P V ++P +Y + ++ I G+ + + + T + +
Sbjct: 220 AGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRGRNTVLVQTAVVRVSLLVDS 279
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGS-LELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFF 372
Y +VM+S + A P A P G+ E+C+ +S P++ F+ GA + + +N+
Sbjct: 280 VYQEFKKAVMAS-VGAAPTATPVGAPFEVCFPKAGVSGAPDLVFTFQAGAALTVPPANYL 338
Query: 373 VKVSEDIVCSVFKGIT-------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
V D VC I + + I G+ Q N + +D+++ +SF+P DC+
Sbjct: 339 FDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCS 394
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 109/369 (29%), Positives = 174/369 (47%), Gaps = 48/369 (13%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y RI IGTP DTGSD++W C+ CP + ++DP+ S + + +
Sbjct: 90 YFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVT 149
Query: 148 CSSSQCASLNQK---SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C C + SC+ + C+YS+SYGDGS + G T+ + +G P
Sbjct: 150 CDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANA 209
Query: 202 -ITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
++FGCG GG S GI+G G + S++SQ+ AGK F++CL V+
Sbjct: 210 SVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAA--AGKVRKMFAHCLDTVNGG 267
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDS 305
I F +V P V +TPL Y + + I VG LG+ T +IDS
Sbjct: 268 GI-FAIGNVVQ-PKVKTTPLVSDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDS 325
Query: 306 GTTLTFLPQGYNSNLLSVM---SSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFR 360
GTTL ++P+G L +++ I Q + D + C+ ++ PEVT HF
Sbjct: 326 GTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-----CFQYSGSVDDGFPEVTFHFE 380
Query: 361 GADVKL--SRSNFFVKVSEDIVCSVFK--GIT----NSVPIYGNIMQTNFLVGYDIEQQT 412
G DV L S ++ + +++ C F+ G+ + + G+++ +N LV YD+E Q
Sbjct: 381 G-DVSLIVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLVLYDLENQA 439
Query: 413 VSFKPTDCT 421
+ + +C+
Sbjct: 440 IGWADYNCS 448
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 102/363 (28%), Positives = 177/363 (48%), Gaps = 46/363 (12%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y+ +IGTPP A+ D +L+WTQC C +C+ QD P+F P SST+K PC +
Sbjct: 45 YVANFTIGTPPQPASAIVDVAGELVWTQCSAC--RRCFKQDLPVFVPNASSTFKPEPCGT 102
Query: 151 SQCASLNQKSCSGVNCQYSVSYGDGSFSNGN----LATETVTLGSTTGQAVALPGITFGC 206
+ C S+ +SCSG C Y G + GN AT+T +G+ T + + FGC
Sbjct: 103 AVCESIPTRSCSGDVCSYK---GPPTQLRGNTSGFAATDTFAIGTATVR------LAFGC 153
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS---STKINFGTNGIV 263
+ +G +GLG SL++QM+ T +FSYCL P + S+++ G++ +
Sbjct: 154 VVASDIDTMDGPSGFIGLGRTPWSLVAQMKLT---RFSYCLSPRNTGKSSRLFLGSSAKL 210
Query: 264 SG-------PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTF---LP 313
+G P + ++P +Y+L++DAI GN + + ++ T F +
Sbjct: 211 AGSESTSTAPFIKTSPDDDGSNYYLLSLDAIRAGNTTIATAQSGGILVMHTVSPFSLLVD 270
Query: 314 QGYNSNLLSVMSSM--IEAQPVADPTGSLELCYSFN---SLSQVPEVTIHFRG-ADVKLS 367
Y + +V ++ A P+A P +LC+ S + P++ F+G A + +
Sbjct: 271 SAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTVP 330
Query: 368 RSNFFVKVSE--DIVCSVFKGIT-------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
+ + + V E D C+ + V + G++ Q + YD++++T+SF+P
Sbjct: 331 PAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEPA 390
Query: 419 DCT 421
DC+
Sbjct: 391 DCS 393
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 106/367 (28%), Positives = 175/367 (47%), Gaps = 44/367 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y I IGTP DTGSD++W C+ CP + L+DPK SST +
Sbjct: 89 YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVS 148
Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C CA+ L + + C+YSV+YGDGS + G ++ + +G P
Sbjct: 149 CDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANS 208
Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
+TFGCG+ GG N GI+G G + S++SQ+ + AGK F++CL ++
Sbjct: 209 TVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQL--SAAGKVKKIFAHCLDTINGG 266
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDS 305
I F +V P V +TPL Y + + +I VG L + +IDS
Sbjct: 267 GI-FAIGNVVQ-PKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDS 324
Query: 306 GTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGA 362
GTTLT+LP+ Y +L+V + + + + LC+ + P++T HF
Sbjct: 325 GTTLTYLPEIVYKEIMLAVFA---KHKDITFHNVQEFLCFQYVGRVDDDFPKITFHFEN- 380
Query: 363 DVKLSR--SNFFVKVSEDIVCSVFK--GITNS----VPIYGNIMQTNFLVGYDIEQQTVS 414
D+ L+ ++F + +++ C F+ G+ + + + G+++ +N LV YD+E Q +
Sbjct: 381 DLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDLENQVIG 440
Query: 415 FKPTDCT 421
+ +C+
Sbjct: 441 WTEYNCS 447
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 106/367 (28%), Positives = 175/367 (47%), Gaps = 44/367 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y I IGTP DTGSD++W C+ CP + L+DPK SST +
Sbjct: 4 YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVS 63
Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C CA+ L + + C+YSV+YGDGS + G ++ + +G P
Sbjct: 64 CDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANS 123
Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
+TFGCG+ GG N GI+G G + S++SQ+ + AGK F++CL ++
Sbjct: 124 TVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQL--SAAGKVKKIFAHCLDTINGG 181
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDS 305
I F +V P V +TPL Y + + +I VG L + +IDS
Sbjct: 182 GI-FAIGNVVQ-PKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDS 239
Query: 306 GTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGA 362
GTTLT+LP+ Y +L+V + + + + LC+ + P++T HF
Sbjct: 240 GTTLTYLPEIVYKEIMLAVFA---KHKDITFHNVQEFLCFQYVGRVDDDFPKITFHFEN- 295
Query: 363 DVKLSR--SNFFVKVSEDIVCSVFK--GITNS----VPIYGNIMQTNFLVGYDIEQQTVS 414
D+ L+ ++F + +++ C F+ G+ + + + G+++ +N LV YD+E Q +
Sbjct: 296 DLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDLENQVIG 355
Query: 415 FKPTDCT 421
+ +C+
Sbjct: 356 WTEYNCS 362
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 109/369 (29%), Positives = 174/369 (47%), Gaps = 48/369 (13%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y RI IGTP DTGSD++W C+ CP + ++DP+ S + + +
Sbjct: 90 YFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVT 149
Query: 148 CSSSQCASLNQK---SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C C + SC+ + C+YS+SYGDGS + G T+ + +G P
Sbjct: 150 CDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANA 209
Query: 202 -ITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
++FGCG GG S GI+G G + S++SQ+ AGK F++CL V+
Sbjct: 210 SVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAA--AGKVRKMFAHCLDTVNGG 267
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDS 305
I F +V P V +TPL Y + + I VG LG+ T +IDS
Sbjct: 268 GI-FAIGNVVQ-PKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDS 325
Query: 306 GTTLTFLPQGYNSNLLSVM---SSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFR 360
GTTL ++P+G L +++ I Q + D + C+ ++ PEVT HF
Sbjct: 326 GTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-----CFQYSGSVDDGFPEVTFHFE 380
Query: 361 GADVKL--SRSNFFVKVSEDIVCSVFK--GIT----NSVPIYGNIMQTNFLVGYDIEQQT 412
G DV L S ++ + +++ C F+ G+ + + G+++ +N LV YD+E Q
Sbjct: 381 G-DVSLIVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLVLYDLENQA 439
Query: 413 VSFKPTDCT 421
+ + +C+
Sbjct: 440 IGWADYNCS 448
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 107/376 (28%), Positives = 162/376 (43%), Gaps = 46/376 (12%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y I +GTPP DTGSD++W C CP D +DPK SS+ ++
Sbjct: 87 YFTEIKLGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKASSSGSTVS 146
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C CA+ G V C+YSV YGDGS + G T+ + TG PG
Sbjct: 147 CDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQTQPGNA 206
Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLVPVSSTKI 255
ITFGCG GG N GI+G G + S++SQ+ K F++CL + I
Sbjct: 207 TITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDTIKGGGI 266
Query: 256 --------------NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV----- 296
F +G+++ P + + ++ Y + + +I VG L +
Sbjct: 267 FAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTLQLPAHVF 326
Query: 297 ---STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVP 353
+IDSGTTLT+LP+ ++ V+ S + L YS + P
Sbjct: 327 ETGEKKGTIIDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQDFLCFQYSGSVDDGFP 386
Query: 354 EVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK-GITNS-----VPIYGNIMQTNFLVG 405
+T HF D+ L +F DI C F+ G S + + G+++ +N LV
Sbjct: 387 TITFHFE-DDLALHVYPHEYFFPNGNDIYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVV 445
Query: 406 YDIEQQTVSFKPTDCT 421
YD+E Q + + +C+
Sbjct: 446 YDLENQVIGWTDYNCS 461
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 122/441 (27%), Positives = 187/441 (42%), Gaps = 51/441 (11%)
Query: 28 GFSVELIHRDSPK----SPFYNSSETPYQRLR------DALTRSLNRLNHFNQNSSISSS 77
G E+ H SPK S F ++ R +A + ++ L H + + S
Sbjct: 42 GVWFEMFHMHSPKLKSQSKFLGPPKSRLDGTRQLLQSDNARRQMISSLRHGTRRKAFEVS 101
Query: 78 KASQADIIPN----NANYLIRISIGTP-PTERLAVADTGSDLIWTQCE----PCPPSQCY 128
+Q I + Y + I IGTP P + + V DTGSDL W CE CP +
Sbjct: 102 HTAQIPIHSGADSGQSQYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPH 161
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYGDGSFSNGN 181
+F SS+++++PCSS C SL + C + Y +G + G
Sbjct: 162 --PGRVFRANDSSSFRTIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGV 219
Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
A ETVT+G + + L + GC T + N G++GLG SL ++
Sbjct: 220 FANETVTVGLNDHKKIRLFDVLIGC-TESFNETNGFPDGVMGLGYRKHSLALRLAEIFGN 278
Query: 242 KFSYCLVPVSSTK-----INFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRL 294
KFSYCLV S+ ++FG + P + T L FY + + ISVG L
Sbjct: 279 KFSYCLVDHLSSSNHKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSML 338
Query: 295 GVSTP--------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL---C 343
+S+ +++DSGT+LT L ++ + + + P EL C
Sbjct: 339 SISSDIWNVTGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNNFC 398
Query: 344 YSFNSLSQ--VPEVTIHF-RGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQ 399
+ + VP + IHF GA K ++ + V+E I C + K I GN+MQ
Sbjct: 399 FEDKGFDRAAVPRLLIHFADGAIFKPPVKSYIIDVAEGIKCLGIIKADFPGSSILGNVMQ 458
Query: 400 TNFLVGYDIEQQTVSFKPTDC 420
N L YD+ + + F P+ C
Sbjct: 459 QNHLWEYDLGRGKLGFGPSSC 479
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 118/396 (29%), Positives = 187/396 (47%), Gaps = 53/396 (13%)
Query: 57 ALTRSLNRLNHFN----QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGS 112
A+ RS +RL+ N+ + +++Q + + +Y + IGTP T ADTGS
Sbjct: 54 AVQRSRSRLSMLAARAVSNAGAAPGESAQTPLKKGSGDYAMSFGIGTPATGLSGEADTGS 113
Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-------- 164
DLIWT+C C ++C + SP + P SS+ + C C L + CS V
Sbjct: 114 DLIWTKCGAC--ARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSG 171
Query: 165 NCQYSVSYGDG----SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTG 220
NC Y +YG+ ++ G L TET T G A A PGI FGC + G F + +G
Sbjct: 172 NCSYHYAYGNARDTHHYTEGILMTETFTFGD---DAAAFPGIAFGCTLRSEGGFGTG-SG 227
Query: 221 IVGLGGGDISLISQMRTTIAGKFSYCLVP--VSSTKINFGTNGIVSGPG--------VVS 270
+VGLG G +SL++Q+ F Y L + + I+FG+ V+G +++
Sbjct: 228 LVGLGRGKLSLVTQLNVE---AFGYRLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLT 284
Query: 271 TPLTKAKTFYVLTIDAISVGNQRLGV-----------STPDIVIDSGTTLTFLPQ-GYNS 318
P+ + FY + + ISVG + + + ++ DSGTTLT LP Y
Sbjct: 285 NPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTL 344
Query: 319 NLLSVMSSMIEAQPVADPTGSLELCYS-FNSLSQVPEVTIHFR-GADVKLSRSNFFVKV- 375
++S M +P +C++ +S + P + +HF GAD+ LS N+ ++
Sbjct: 345 VRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQ 404
Query: 376 ---SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
E C + ++ I GNIMQ +F V +D+
Sbjct: 405 GQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDL 440
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 124/425 (29%), Positives = 205/425 (48%), Gaps = 64/425 (15%)
Query: 52 QRLRDALTR-----SLNRLNHFN-QNSSISSSKASQADIIPNNANYLIRISIGTPPTERL 105
+++R++L+R N+ NH + + + +S S + + A + +++ IG+
Sbjct: 55 EQVRESLSRIQSQVQDNQNNHLDLRGNRPTSGVRSVVTPLEDYALFSMQLGIGSLQKNLS 114
Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG-- 163
A+ DTGS+ + QC + P+FDP S +Y+ +PC S C ++ Q++ +G
Sbjct: 115 AIIDTGSEAVLVQCGS--------RSRPVFDPAASQSYRQVPCISQLCLAVQQQTSNGSS 166
Query: 164 -------VNCQYSVSYGDGSFSNGNLATETVTLGST--TGQAVALPGITFGCGTN-NGGL 213
C YS+SYGD S G+ + + + L ST +GQAV + FGC + G L
Sbjct: 167 QPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFRDVAFGCAHSPQGFL 226
Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAG-KFSYCL-----VPVSSTKINFGTNGI----V 263
+ + GIVG G++SL SQ++ + G KFSYC P ++ I G +G+ V
Sbjct: 227 VDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFLGDSGLSKSKV 286
Query: 264 SGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGV---------STPD--IVIDSGTTLT- 310
++ P+T A++ Y + + +ISV + L + ST D V+DSGTT T
Sbjct: 287 GYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTR 346
Query: 311 FLPQGYNS--NLLSVMSSMIEAQPVADPTGSLELCYSF---NSLSQVPEVTIHFR-GADV 364
+ Y + N + + + V G + CY+ +SL VPEV + + +
Sbjct: 347 VVDDAYTAFRNAFAASNRSGLRKKVGAAAG-FDDCYNISAGSSLPGVPEVRLSLQNNVRL 405
Query: 365 KLSRSNFFVKVS----EDIVC----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
+L + FV VS E VC S K + + GN Q+N+LV YD E+ V F+
Sbjct: 406 ELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFE 465
Query: 417 PTDCT 421
DC+
Sbjct: 466 RADCS 470
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 105/365 (28%), Positives = 173/365 (47%), Gaps = 38/365 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTPP DTGSD++W C+ CP D L+DPK SS+ ++
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVS 146
Query: 148 CSSSQCASL---NQK--SC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---A 198
C + CA+ +K C +G C+Y YGDGS + G+ ++++ +G A A
Sbjct: 147 CDNKFCAATYGSGEKLPGCTAGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTRHA 206
Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSST 253
+ FGCG GG N GI+G G + S +SQ+ + + FS+CL +
Sbjct: 207 KANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDTIKGG 266
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI---------VID 304
I F +V P V STPL + Y + + +I V L + P I +ID
Sbjct: 267 GI-FAIGEVVQ-PKVKSTPLLPNMSHYNVNLQSIDVAGNALQLP-PHIFETSEKRGTIID 323
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADV 364
SGTTLT+LP+ ++L+ + + G L YS + P++T HF D+
Sbjct: 324 SGTTLTYLPELVYKDILAAVFQKHQDITFRTIQGFLCFEYSESVDDGFPKITFHFE-DDL 382
Query: 365 KLSR--SNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
L+ ++F + +++ C F+ + + G+++ +N +V YD+E+Q + +
Sbjct: 383 GLNVYPHDYFFQNGDNLYCLGFQNGGFQPKDAKDMVLLGDLVLSNKVVVYDLEKQVIGWT 442
Query: 417 PTDCT 421
+C+
Sbjct: 443 DYNCS 447
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 108/393 (27%), Positives = 179/393 (45%), Gaps = 41/393 (10%)
Query: 67 HFNQNSSISSSKASQADI------IPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQ- 118
H +S+ + AD+ +P + Y I IGTPP + DTGSD++W
Sbjct: 52 HLTHDSNRRGRLLAAADVPLGGLGLPTDTGLYYTEIEIGTPPKQYHVQVDTGSDILWVNC 111
Query: 119 --CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG----VNCQYSVSY 172
C CP D L+DPK SS+ ++ C CA+ G + C+YSV Y
Sbjct: 112 ISCNKCPRKSDLGIDLRLYDPKGSSSGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSVMY 171
Query: 173 GDGSFSNGNLATETVTLGSTTGQAV---ALPGITFGCGTNNGGLF---NSKTTGIVGLGG 226
GDGS + G ++++ +G A + FGCG GG N GI+G G
Sbjct: 172 GDGSSTTGYFVSDSLQYNQVSGDGQTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQ 231
Query: 227 GDISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTI 284
+ S++SQ+ + FS+CL + I F +V P V STPL Y + +
Sbjct: 232 SNTSMLSQLAAAGEVKKIFSHCLDTIKGGGI-FAIGDVVQ-PKVKSTPLVPDMPHYNVNL 289
Query: 285 DAISVGNQRLGV--------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP 336
++I+VG L + +IDSGTTLT+LP+ ++L+ + +
Sbjct: 290 ESINVGGTTLQLPSHMFETGEKKGTIIDSGTTLTYLPELVYKDVLAAVFAKHPDTTFHSV 349
Query: 337 TGSLELCYSFNSLSQVPEVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK--GIT---- 388
L + Y + P++T HF D+ L+ ++F + +++ C F+ G+
Sbjct: 350 QDFLCIQYFQSVDDGFPKITFHFE-DDLGLNVYPHDYFFQNGDNLYCFGFQNGGLQSKDG 408
Query: 389 NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ + G+++ +N +V YD+E Q V + +C+
Sbjct: 409 KDMVLLGDLVLSNKVVVYDLENQVVGWTDYNCS 441
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 106/400 (26%), Positives = 173/400 (43%), Gaps = 72/400 (18%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-------------PSQCYMQDSPLFD 136
Y +R +GTP L VADTGSDL W +C P+ F
Sbjct: 86 QYFVRFRVGTPAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRTFR 145
Query: 137 PKMSSTYKSLPCSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSNGNLATE--TVTL 189
P S T+ +PCSS+ C SL + C Y Y DGS + G + + T+ L
Sbjct: 146 PDKSRTWAPIPCSSATCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVGVDSATIAL 205
Query: 190 GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV- 248
+ L G+ GC T+ G + G++ LG +IS S+ + G+FSYCLV
Sbjct: 206 SGRAARKAKLRGVVLGCTTSYNGQSFLASDGVLSLGYSNISFASRAASRFGGRFSYCLVD 265
Query: 249 ---PVSSTK-INFGTNGIVS----GPGVVS-------------------TPLT---KAKT 278
P ++T + FG N S G+ S TPL + +
Sbjct: 266 HLAPRNATSYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRP 325
Query: 279 FYVLTIDAISVGNQRLGV--STPDI------VIDSGTTLTFLPQGYNSNLLSVMSSMIEA 330
FY +T+ +SV + L + + D+ ++DSGT+LT L + +++ +S +
Sbjct: 326 FYAVTVKGVSVAGELLKIPRAVWDVEQGGGAILDSGTSLTMLAKPAYRAVVAALSKRLAG 385
Query: 331 QP--VADPTGSLELCYSFNSLS------QVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVC 381
P DP + CY++ S S +P + +HF G A ++ ++ + + + C
Sbjct: 386 LPRVTMDP---FDYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVIDAAPGVKC 442
Query: 382 -SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ +G + + GNI+Q L YD++ + + FK + C
Sbjct: 443 IGLQEGPWPGLSVIGNILQQEHLWEYDLKNRRLRFKRSRC 482
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 118/396 (29%), Positives = 187/396 (47%), Gaps = 53/396 (13%)
Query: 57 ALTRSLNRLNHFN----QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGS 112
A+ RS +RL+ N+ + +++Q + + +Y + IGTP T ADTGS
Sbjct: 54 AVQRSRSRLSMLAARAVSNAGAAPGESAQTPLKKGSGDYAMSFGIGTPATGLSGEADTGS 113
Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-------- 164
DLIWT+C C ++C + SP + P SS+ + C C L + CS V
Sbjct: 114 DLIWTKCGAC--ARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSG 171
Query: 165 NCQYSVSYGDG----SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTG 220
NC Y +YG+ ++ G L TET T G A A PGI FGC + G F + +G
Sbjct: 172 NCSYHYAYGNARDTHHYTEGILMTETFTFGD---DAAAFPGIAFGCTLRSEGGFGTG-SG 227
Query: 221 IVGLGGGDISLISQMRTTIAGKFSYCLVP--VSSTKINFGTNGIVSGPG--------VVS 270
+VGLG G +SL++Q+ F Y L + + I+FG+ V+G +++
Sbjct: 228 LVGLGRGKLSLVTQLNVE---AFGYRLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLT 284
Query: 271 TPLTKAKTFYVLTIDAISVGNQRLGV-----------STPDIVIDSGTTLTFLPQ-GYNS 318
P+ + FY + + ISVG + + + ++ DSGTTLT LP Y
Sbjct: 285 NPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTL 344
Query: 319 NLLSVMSSMIEAQPVADPTGSLELCYS-FNSLSQVPEVTIHFR-GADVKLSRSNFFVKV- 375
++S M +P +C++ +S + P + +HF GAD+ LS N+ ++
Sbjct: 345 VRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQ 404
Query: 376 ---SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
E C + ++ I GNIMQ +F V +D+
Sbjct: 405 GQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDL 440
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 124/423 (29%), Positives = 189/423 (44%), Gaps = 47/423 (11%)
Query: 21 PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN--SSISSSK 78
P+ GF EL H P+ SS + R + S R+ +S
Sbjct: 30 PVAGSDAGFRAELHH------PYAGSSLPVHDMWRRSARASKARVARLEARLTGDMSVPL 83
Query: 79 ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
A +D Y + I IGTPP +ADT SDL WTQC + Q PLFDP
Sbjct: 84 ARISD-----EGYTVTIGIGTPPQLHTLIADTASDLTWTQCNLF--NDTAKQVEPLFDPA 136
Query: 139 MSSTYKSLPCSSSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
SS++ + CSS C N K CS C+Y Y + G LA E+ TL S Q
Sbjct: 137 KSSSFAFVTCSSKLCTEDNPGTKRCSNKTCRYVYPYVSVE-AAGVLAYESFTL-SDNNQH 194
Query: 197 VALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK- 254
+ + FGCG +G L + +GI+G+ +S++SQ+ KFSYCL P + K
Sbjct: 195 ICM-SFGFGCGALTDGNLLGA--SGILGMSPAILSMVSQLAIP---KFSYCLTPYTDRKS 248
Query: 255 --INFGTNGIVSGPGVVSTPLTKAKTF-YVLTIDAISVGNQRLGVSTPDIVIDSGTTLTF 311
+ FG + G + P+ K+ TF Y + + +S+G +RL V + G T+
Sbjct: 249 SPLFFGAWADL-GRYKTTGPIQKSLTFYYYVPLVGLSLGTRRLDVPAATFALKQGGTVVD 307
Query: 312 LPQGYNSNLLSVMSSMIEA------QPVADPT-GSLELCYSFNS-----LSQVPEVTIHF 359
L +++ EA P+ + T ++C++ S Q P + ++F
Sbjct: 308 LGCTVGQLAEPAFTALKEAVLHTLNLPLTNRTVKDYKVCFALPSGVAMGAVQTPPLVLYF 367
Query: 360 R-GADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
GAD+ L R N+F + + ++C ++ G + I GN+ Q NF + +D+ F P
Sbjct: 368 DGGADMVLPRDNYFQEPTAGLMCLALVPG--GGMSIIGNVQQQNFHLLFDVHDSKFLFAP 425
Query: 418 TDC 420
T C
Sbjct: 426 TIC 428
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 97/358 (27%), Positives = 166/358 (46%), Gaps = 40/358 (11%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N AN+ +IGTPP A D +L+WTQC C C+ QD P+F P SST+K
Sbjct: 24 NVANF----TIGTPPQAASAFIDLTGELVWTQCSQC--IHCFKQDLPVFVPNASSTFKPE 77
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
PC + C S+ C+ C + G G + G +AT+T +G+ A + FGC
Sbjct: 78 PCGTDVCKSIPTPKCASDVCAFDGVTGLGGHTVGIVATDTFAIGTA-----APASLGFGC 132
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---TKINFGTNGIV 263
+ +G +GLG SL++QM+ T +FSYCL P + +++ G + +
Sbjct: 133 VVASDIDTMGGPSGFIGLGRTPWSLVAQMKLT---RFSYCLAPHDTGKNSRLFLGASAKL 189
Query: 264 SG-----PGVVSTPLTKAKTFYVLTIDAISVGNQ-------RLGVSTPDIVIDSGTTLTF 311
+G P V ++P +Y + ++ I G+ R V V+ +
Sbjct: 190 AGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRGRNTVLVQTAVVRVSLLVDS 249
Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSN 370
+ Q + +++ + + A PV +P E+C+ +S P++ F+ GA + + +N
Sbjct: 250 VYQEFKKAVMASVGAAPTATPVGEP---FEVCFPKAGVSGAPDLVFTFQAGAALTVPPAN 306
Query: 371 FFVKVSEDIVCSVFKGIT-------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ V D VC I + + I G+ Q N + +D+++ +SF+P DC+
Sbjct: 307 YLFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCS 364
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 168/370 (45%), Gaps = 41/370 (11%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD---SPLFDPKMSSTYKSL 146
Y +R+ +GTP + VADTGSDL W +C S +F P S ++ L
Sbjct: 103 QYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRVFRPAGSKSWSPL 162
Query: 147 PCSSSQCAS---LNQKSCSGVN--CQYSVSYGDGSFSNG--NLATETVTLGSTTG-QAVA 198
PC S C S + +CS C Y Y D S + G L + TV+L G +
Sbjct: 163 PCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSATVSLSGNDGTRKAK 222
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVSSTK 254
L + GC T+ G + G++ LG +IS S+ + G+FSYCLV P ++T
Sbjct: 223 LQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFGGRFSYCLVDHLAPRNATS 282
Query: 255 -INFGTNGIVSGPGVV--STPL-----TKAKTFYVLTIDAISVGNQRLGVSTPDI----- 301
+ FG G TPL + + FY +++DA++V +RL + PD+
Sbjct: 283 FLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEI-LPDVWDFRK 341
Query: 302 ----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA--DPTGSLELCYSFNSLS-QVPE 354
++DSGT+LT L ++ +S P DP E CY++ +S ++P
Sbjct: 342 NGGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVNMDP---FEYCYNWTGVSAEIPR 398
Query: 355 VTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
+ + F G A + ++ + + + C V +G V + GNI+Q L +D+ +
Sbjct: 399 MELRFAGAATLAPPGKSYVIDTAPGVKCIGVVEGAWPGVSVIGNILQQEHLWEFDLANRW 458
Query: 413 VSFKPTDCTK 422
+ FK + C
Sbjct: 459 LRFKQSRCAH 468
>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Brachypodium distachyon]
Length = 429
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 112/367 (30%), Positives = 163/367 (44%), Gaps = 41/367 (11%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM---QDSPLFDPKMSSTY 143
+ + + IS+GTPP L DTGS L W C+ C S C+ + +FDP S+TY
Sbjct: 71 HEGKFFMDISLGTPPVANLVTVDTGSTLSWVVCQRCQIS-CHTTAPEAGSVFDPDKSTTY 129
Query: 144 KSLPCSSSQCASLNQKSCSGVN-------CQYSVSYG---DGSFSNGNLATETVTLGSTT 193
+ + CSS CA + + + C YS+ YG G +S G L T+ +TL S++
Sbjct: 130 ELVGCSSRDCADVQRSLVAPFGCIEETDTCLYSLRYGSGPSGQYSAGRLGTDKLTLASSS 189
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM-RTTIAGKFSYCLVPVSS 252
+ G FGC ++ F +G++G GG + S +Q+ R T FSYC P
Sbjct: 190 S---IIDGFIFGCSGDDS--FKGYESGVIGFGGANFSFFNQVARQTNYRAFSYCF-PGDH 243
Query: 253 TKINFGTNGIVSGPGVVST---PLTKAKTFYVLTIDAISVGNQRLGVSTPD-----IVID 304
T F + G +V T P ++ Y L + V RL V + +V+D
Sbjct: 244 TAEGFLSIGAYPKDELVYTNLIPHFGDRSVYSLQQIDMMVDGNRLQVDQSEYTKRMMVVD 303
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV-----PEVTIHF 359
SGT TFL M+S ++A+ T E C+ N V P V + F
Sbjct: 304 SGTVDTFLLGPVFDAFSKAMASAMQAKGFLSDTVGTETCFRPNGGDSVDSGDLPTVEMRF 363
Query: 360 RGADVKLSRSNFFVKV--SEDIVCSVFK----GITNSVPIYGNIMQTNFLVGYDIEQQTV 413
G +KL N F + S D +C FK G+ N V I GN +F V YD++
Sbjct: 364 IGTTLKLPPENVFHDLLPSHDKICLAFKPDVAGVRN-VQILGNKATXSFRVVYDLQAMYF 422
Query: 414 SFKPTDC 420
F+ C
Sbjct: 423 GFQAGAC 429
>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
Length = 280
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 95/234 (40%), Positives = 128/234 (54%), Gaps = 21/234 (8%)
Query: 20 SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDA-----LTRSLNRLNHFNQNSSI 74
SP + T S++L R S S S T + RD+ +T LN+ + ++ S
Sbjct: 61 SPFTSSTSTLSLQLHSRASLSSHADYKSLTLSRLDRDSARVKYITTKLNQNFNTDKLSGP 120
Query: 75 SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
S SQ + Y RI IG PP++ V DTGSD+ W QC PC + CY Q P+
Sbjct: 121 IISGTSQG-----SGEYFSRIGIGEPPSQAYMVLDTGSDISWVQCAPC--ADCYRQADPI 173
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
F+P S++Y L C ++QC L+Q C NC Y VSYGDGS++ G+ TETVT+G
Sbjct: 174 FEPTASASYAPLSCEAAQCRYLDQSQCRNGNCLYQVSYGDGSYTVGDFVTETVTIGVNKV 233
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
+ VAL GCG NN GLF G++GLGGG +S +Q+ +T FSYCLV
Sbjct: 234 KNVAL-----GCGHNNEGLF-VGAAGLIGLGGGPLSFPAQLNST---SFSYCLV 278
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 108/394 (27%), Positives = 172/394 (43%), Gaps = 69/394 (17%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSP------------LFD 136
Y +R +GTP + +ADTGSDL W +C PS SP +F
Sbjct: 109 QYFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAAPSPAVAPPRVFR 168
Query: 137 PKMSSTYKSLPCSSSQCAS-----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG- 190
P S T+ +PCSS C S L S S C Y Y D S + G + T++ T+
Sbjct: 169 PGDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSAARGVVGTDSATVAL 228
Query: 191 -------STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
+ L G+ GC T + G + G++ LG +IS S+ + G+F
Sbjct: 229 SGGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFEASDGVLSLGYSNISFASRAASRFGGRF 288
Query: 244 SYCLV----PVSSTK-INFGTNGIVSGPGVVS---------TPL---TKAKTFYVLTIDA 286
SYCLV P ++T + FG +GP S TPL + + FY + +D+
Sbjct: 289 SYCLVDHLAPRNATSYLTFG-----AGPDAASSSAPAPGSRTPLLLDARVRPFYAVAVDS 343
Query: 287 ISVGNQRLGV--------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADP 336
+SV L + S +IDSGT+LT L +++ +S + P DP
Sbjct: 344 VSVDGVALDIPAEVWDVGSNGGTIIDSGTSLTVLATPAYKAVVAALSEQLAGLPRVAMDP 403
Query: 337 TGSLELCYSFNSLSQ------VPEVTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGIT 388
+ CY++ + VP++ + F G A ++ ++ + + + C V +G
Sbjct: 404 ---FDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVIDAAPGVKCIGVQEGAW 460
Query: 389 NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
V + GNI+Q L +D+ + + F+ T CT+
Sbjct: 461 PGVSVIGNILQQEHLWEFDLNNRWLRFRQTSCTQ 494
>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 452
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 111/349 (31%), Positives = 149/349 (42%), Gaps = 53/349 (15%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKSLPC 148
NY++ S+GTP + DTGSDL W QC+PC + CY Q PLFDP SS+Y ++PC
Sbjct: 139 NYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPC 198
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
CA L + + + G A+ G FGCG
Sbjct: 199 GGPVCAGL------------------------GIYAASACSAAQCG---AVQGFFFGCGH 231
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFGTNGIV-SG 265
GLFN G++GLG SL+ Q T G FSYCL P ++ + G G +
Sbjct: 232 AQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAA 290
Query: 266 PGVVST---PLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTF----LPQGYNS 318
PG +T P A T+YV+ + ISVG Q+L V + LP +
Sbjct: 291 PGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYA 350
Query: 319 NLLSVMSSMIEA--QPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSNFFV 373
L S S + + P A G L+ CY+F V P V + F GA V L
Sbjct: 351 ALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL- 409
Query: 374 KVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
C F G + I GN+ Q +F V I+ +V FKP+ C
Sbjct: 410 ----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 452
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 122/384 (31%), Positives = 176/384 (45%), Gaps = 72/384 (18%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N I ++IG+PP V DTGS+L W C+ P + F+P +SS+Y
Sbjct: 55 HNVTLTISLTIGSPPQNVTMVLDTGSELSWLHCKKLP------NLNSTFNPLLSSSYTPT 108
Query: 147 PCSSSQCASLNQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
PC+SS C + + SC N C VSY D S + G LA ET +L A
Sbjct: 109 PCNSSVCMTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLA-----GAAQ 163
Query: 200 PGITFGCGTNNGGL----FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
PG FGC + G ++KTTG++G+ G +SL++QM + KFSYC+ S +
Sbjct: 164 PGTLFGCMDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQM---VLPKFSYCI----SGED 216
Query: 256 NFGTNGIVSGPGVVS----TPLTKAKT--------FYVLTIDAISVGNQRL----GVSTP 299
FG + GP S TPL A T Y + ++ I V + L V P
Sbjct: 217 AFGVLLLGDGPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVP 276
Query: 300 D------IVIDSGTTLTFLPQG-YNS---NLLSVMSSMIEAQPVADPT----GSLELCYS 345
D ++DSGT TFL YNS L ++ + DP G+++LCY
Sbjct: 277 DHTGAGQTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTR--IEDPNFVFEGAMDLCYH 334
Query: 346 F-NSLSQVPEVTIHFRGADVKLSRSNFFVKVSED---IVCSVFK-----GITNSVPIYGN 396
SL+ VP VT+ F GA++++S +VS+ + C F GI V G+
Sbjct: 335 APASLAAVPAVTLVFSGAEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIEAYV--IGH 392
Query: 397 IMQTNFLVGYDIEQQTVSFKPTDC 420
Q N + +D+ + V F T C
Sbjct: 393 HHQQNVWMEFDLVKSRVGFTETTC 416
>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 439
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 108/374 (28%), Positives = 161/374 (43%), Gaps = 84/374 (22%)
Query: 83 DIIPNN------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
D PNN N+L+ ++ GTPP + DTGS + WTQC+ C
Sbjct: 114 DHTPNNKLFDEDGNFLVDVAFGTPPQNFTLILDTGSSITWTQCKACT------------- 160
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
V Y+++YGD S S GN +T+TL +
Sbjct: 161 ---------------------------VENNYNMTYGDDSTSVGNYGCDTMTLEPSD--- 190
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST-KI 255
FG G NN G F S G++GLG G +S +SQ + FSYCL S +
Sbjct: 191 -VFQKFQFGRGRNNKGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIGSL 249
Query: 256 NFGTNG-----------IVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV-----STP 299
FG +V+GPG + + +Y + + ISVGN+RL + ++P
Sbjct: 250 LFGEKATSQSSSLKFTSLVNGPGTL-----QESGYYFVNLSDISVGNERLNIPSSVFASP 304
Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS----LELCYSFNSLSQV--P 353
+IDS T +T LPQ S L + + P+++ L+ CY+ + V P
Sbjct: 305 GTIIDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLP 364
Query: 354 EVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNS-----VPIYGNIMQTNFLVGYD 407
E+ +HF GADV+L+ +N E +C F G + S + I GN Q + V YD
Sbjct: 365 EIVLHFGGGADVRLNGTNIVWGSDESRLCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYD 424
Query: 408 IEQQTVSFKPTDCT 421
I+ + F+ C+
Sbjct: 425 IQGGRIGFRSNGCS 438
>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
Length = 452
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 126/421 (29%), Positives = 171/421 (40%), Gaps = 67/421 (15%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQADI 84
+ L HR P +P SS + D L R + + S S A+ A
Sbjct: 68 LRLTHRHGPCAPSRASSLA-APSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAAAAAT 126
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFD 136
+P NY++ S+GTP + DTGSDL W QC+PC + CY Q PLFD
Sbjct: 127 VPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFD 186
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
P SS+Y ++PC CA L + + + G
Sbjct: 187 PAQSSSYAAVPCGGPVCAGL------------------------GIYAASACSAAQCG-- 220
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTK 254
A+ G FGCG GLFN G++GLG SL+ Q T G FSYCL P ++
Sbjct: 221 -AVQGFFFGCGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGY 278
Query: 255 INFGTNGIV-SGPGVVST---PLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLT 310
+ G G + PG +T P A T+YV+ + ISVG Q+L V +
Sbjct: 279 LTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTG 338
Query: 311 F----LPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLSQV--PEVTIHF-RG 361
LP + L S S + + P A G L+ CY+F V P V + F G
Sbjct: 339 TVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSG 398
Query: 362 ADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
A V L C F G + I GN+ Q +F V I+ +V FKP+
Sbjct: 399 ATVTLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSS 451
Query: 420 C 420
C
Sbjct: 452 C 452
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 170/386 (44%), Gaps = 57/386 (14%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +R +GTP L VADTGSDL W +C S+ F P+ S T+ +
Sbjct: 93 QYFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPIS 152
Query: 148 CSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG----STTGQAVA 198
C+S C SL G C Y Y DGS + G + TE+ T+ +
Sbjct: 153 CASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRGREERKAK 212
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVSSTK 254
L G+ GC ++ G + G++ LG D+S S + AG+FSYCLV P ++T
Sbjct: 213 LKGLVLGCTSSYTGPSFEVSDGVLSLGYSDVSFASHAASRFAGRFSYCLVDHLSPRNATS 272
Query: 255 -INFGTN--------------------GIVSGPGVVSTPL---TKAKTFYVLTIDAISVG 290
+ FG N P TPL + + FY + + A+SV
Sbjct: 273 YLTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDVAVKAVSVA 332
Query: 291 NQRLGVSTP--------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSL 340
Q L + +++DSGT+LT L + +++ +S + P DP
Sbjct: 333 GQFLKIPRAVWDVDAGGGVILDSGTSLTVLAKPAYRAVVAALSEGLAGLPRVTMDP---F 389
Query: 341 ELCYSFNSLS---QVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYG 395
E CY++ S S +P++ +HF G A ++ ++ + + + C + +G + + G
Sbjct: 390 EYCYNWTSPSGDVTLPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPWPGISVIG 449
Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDCT 421
NI+Q L +DI+ + + F+ + CT
Sbjct: 450 NILQQEHLWEFDIKNRRLKFQRSRCT 475
>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 84/198 (42%), Positives = 109/198 (55%), Gaps = 12/198 (6%)
Query: 53 RLRDALTRSLNRLNHFNQNSSISSSKASQA----DIIPNNANYLIRISIGTPPTERLAVA 108
R +A S++ N +S +K+++ II + NY++ I IGTP + +
Sbjct: 92 RRDEARVESIHSKLSKNIADEVSKAKSTKLPAKNGIILGSPNYIVTIGIGTPKHDISLMF 151
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
DTGSDL WTQCEPC S CY Q P F+P SS+Y ++ CSS C N +SCS NC Y
Sbjct: 152 DTGSDLTWTQCEPCLGS-CYSQKEPKFNPSSSSSYHNVSCSSPMCG--NPESCSASNCLY 208
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
+ YGDGS + G LA E TL ++ L I FGCG NN G+F + GI+GLG G
Sbjct: 209 GIGYGDGSVTVGFLAKEKFTLTNSD----VLDDIYFGCGENNKGVFIG-SAGILGLGPGK 263
Query: 229 ISLISQMRTTIAGKFSYC 246
S Q TT FSYC
Sbjct: 264 FSFPLQTTTTYNNIFSYC 281
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 117/379 (30%), Positives = 174/379 (45%), Gaps = 54/379 (14%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
A + Y+ IG+PP A+ DTGSDLIWTQC C P C Q P ++ S
Sbjct: 77 AQVHRATRQYIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQS 136
Query: 141 STYKSLPCSSSQ--CASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
ST+ +PC+ CA+ C G++ C + SYG G G+L TE+ S T
Sbjct: 137 STFVPVPCADKAGFCAANGVHLC-GLDGSCTFIASYGAGRVI-GSLGTESFAFESGT--- 191
Query: 197 VALPGITFGCGT----NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-- 250
+ FGC + +G L + +G++GLG G +SL+SQ+ T +FSYCL P
Sbjct: 192 ---TSLAFGCVSLTRITSGAL--NDASGLIGLGRGRLSLVSQIGAT---RFSYCLTPYFH 243
Query: 251 --SSTKINFGTNGIVSGPGVVSTPLTKA------KTFYVLTIDAISVGNQRL-------- 294
++ F G G S P K+ TFY L ++ I+VG RL
Sbjct: 244 SSGASSHLFVGASASLGGGGASMPFVKSPKDYPYSTFYYLPLEGITVGKTRLPAVNSTTF 303
Query: 295 -------GVSTPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGS-LELCYS 345
G ++ID+G+ LT L Y + V + + V P S LELC +
Sbjct: 304 QLRQLFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGLELCVA 363
Query: 346 FNSLSQ-VPEVTIHF-RGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNF 402
+ VP + HF GAD+ + ++++ V + C + +G +S I GN Q +
Sbjct: 364 REGFQKVVPALVFHFGGGADMAVPAASYWAPVDKAAACMMILEGGYDS--IIGNFQQQDM 421
Query: 403 LVGYDIEQQTVSFKPTDCT 421
+ YD+ + SF+ DCT
Sbjct: 422 HLLYDLRRGRFSFQTADCT 440
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 110/383 (28%), Positives = 175/383 (45%), Gaps = 65/383 (16%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + + +++G+PP V DTGS+L W C+ P +FDP SS+Y +
Sbjct: 52 HNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS------VFDPLRSSSYSPI 105
Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
PC+S C + + SC C +SY D S GNLA++T +G++ A+P
Sbjct: 106 PCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNS-----AIP 160
Query: 201 GITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI-N 256
FGC G ++ +SKTTG++G+ G +S ++QM KFSYC+ S+ I
Sbjct: 161 ATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQ---KFSYCISGQDSSGILL 217
Query: 257 FGTNGIVSGPGVVSTPLTKAKT--------FYVLTIDAISVGNQRL----GVSTPD---- 300
FG + + TPL + T Y + ++ I V N L V PD
Sbjct: 218 FGESSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGA 277
Query: 301 --IVIDSGTTLTFLPQGYNSNLLS--VMSSMIEAQPVADPT----GSLELCYSF----NS 348
++DSGT TFL + L + V + + + DP G+++LCY +
Sbjct: 278 GQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRT 337
Query: 349 LSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFK-----GITNSVPIYGNI 397
L +P VT+ FRGA++ +S +V S+ + C F G+ + I G+
Sbjct: 338 LPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESY--IIGHH 395
Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
Q N + +D+ + V F C
Sbjct: 396 HQQNVWMEFDLAKSRVGFAEVRC 418
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 107/365 (29%), Positives = 171/365 (46%), Gaps = 39/365 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTPP DTGSD++W QC+ CP D L+D K SS+ K +P
Sbjct: 85 YYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIKESSSGKFVP 144
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---ALP 200
C C +N +G ++C Y YGDGS + G + V +G A
Sbjct: 145 CDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANG 204
Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTK 254
I FGCG G +S GI+G G + S+ISQ+ ++ + F++CL V+
Sbjct: 205 SIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGG 264
Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDSG 306
I F +V P V TPL + Y + + A+ VG+ L +ST +IDSG
Sbjct: 265 I-FAIGHVVQ-PKVNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGDRKGTIIDSG 322
Query: 307 TTLTFLPQG-YNSNLLSVMSSM--IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GA 362
TTL +LP+G Y + ++S ++ + + D + YS + P VT +F G
Sbjct: 323 TTLAYLPEGIYEPLVYKIISQHPDLKVRTLHDEYTCFQ--YSESVDDGFPAVTFYFENGL 380
Query: 363 DVKLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
+K+ ++ S D C ++ + ++ + G+++ +N LV YD+E Q + +
Sbjct: 381 SLKVYPHDYLFP-SGDFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYDLENQVIGWT 439
Query: 417 PTDCT 421
+C+
Sbjct: 440 EYNCS 444
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 108/365 (29%), Positives = 169/365 (46%), Gaps = 39/365 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTPP DTGSD++W QC+ CP D L+D K SS+ K +P
Sbjct: 83 YYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIKESSSGKLVP 142
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---ALP 200
C C +N +G ++C Y YGDGS + G + V +G A
Sbjct: 143 CDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANG 202
Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTK 254
I FGCG G +S GI+G G + S+ISQ+ ++ + F++CL V+
Sbjct: 203 SIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGG 262
Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDSG 306
I F +V P V TPL + Y + + A+ VG+ L +ST +IDSG
Sbjct: 263 I-FAIGHVVQ-PKVNMTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGDRKGTIIDSG 320
Query: 307 TTLTFLPQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GA 362
TTL +LP+G L+ M S ++ Q + D + YS + P VT F G
Sbjct: 321 TTLAYLPEGIYEPLVYKMISQHPDLKVQTLHDEYTCFQ--YSESVDDGFPAVTFFFENGL 378
Query: 363 DVKLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
+K+ ++ S + C ++ + ++ + G+++ +N LV YD+E Q + +
Sbjct: 379 SLKVYPHDYLFP-SVNFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYDLENQAIGWA 437
Query: 417 PTDCT 421
+C+
Sbjct: 438 EYNCS 442
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 110/383 (28%), Positives = 175/383 (45%), Gaps = 65/383 (16%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + + +++G+PP V DTGS+L W C+ P +FDP SS+Y +
Sbjct: 59 HNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS------VFDPLRSSSYSPI 112
Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
PC+S C + + SC C +SY D S GNLA++T +G++ A+P
Sbjct: 113 PCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNS-----AIP 167
Query: 201 GITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI-N 256
FGC G ++ +SKTTG++G+ G +S ++QM KFSYC+ S+ I
Sbjct: 168 ATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQ---KFSYCISGQDSSGILL 224
Query: 257 FGTNGIVSGPGVVSTPLTKAKT--------FYVLTIDAISVGNQRL----GVSTPD---- 300
FG + + TPL + T Y + ++ I V N L V PD
Sbjct: 225 FGESSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGA 284
Query: 301 --IVIDSGTTLTFLPQGYNSNLLS--VMSSMIEAQPVADPT----GSLELCYSF----NS 348
++DSGT TFL + L + V + + + DP G+++LCY +
Sbjct: 285 GQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRT 344
Query: 349 LSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFK-----GITNSVPIYGNI 397
L +P VT+ FRGA++ +S +V S+ + C F G+ + I G+
Sbjct: 345 LPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESY--IIGHH 402
Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
Q N + +D+ + V F C
Sbjct: 403 HQQNVWMEFDLAKSRVGFAEVRC 425
>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
Length = 519
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 115/419 (27%), Positives = 181/419 (43%), Gaps = 91/419 (21%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQC---EPCPPSQCYMQDSP------------- 133
Y +R +GTP L VADTGSDL W +C + P+ Y +P
Sbjct: 106 QYFVRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPGYGYAAPASNDSSTSSLSAA 165
Query: 134 ---------LFDPKMSSTYKSLPCSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSN 179
+F P S T+ +PCSS C SL G C Y Y DGS +
Sbjct: 166 AASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYDYRYKDGSAAR 225
Query: 180 GNLATETVTL-----GSTTGQAVA-LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
G + T++ T+ G+ Q A L G+ GC T+ G + G++ LG +IS S
Sbjct: 226 GTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFLASDGVLSLGYSNISFAS 285
Query: 234 QMRTTIAGKFSYCLV----PVSSTK-INFGTNGIVSG---------------------PG 267
+ G+FSYCLV P ++T + FG N VS G
Sbjct: 286 RAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSSPPSKTACAGGGSPAAAPPGPGG 345
Query: 268 VVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDIV----------IDSGTTLTFLPQ 314
TPL + + FY +T++ ISV + L + P +V +DSGT+LT L
Sbjct: 346 ARQTPLLLDHRMRPFYAVTVNGISVDGELLRI--PRLVWDVAKGGGAILDSGTSLTVLVS 403
Query: 315 GYNSNLLSVMSSMIEAQP--VADPTGSLELCYSFNSLS-------QVPEVTIHFRG-ADV 364
+++ ++ + P DP + CY++ S S +PE+ +HF G A +
Sbjct: 404 PAYRAVVAALNKKLAGLPRVTMDP---FDYCYNWTSPSTGEDLTVAMPELAVHFAGSARL 460
Query: 365 KLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ ++ + + + C + +G V + GNI+Q L +D++ + + FK + CT+
Sbjct: 461 QPPAKSYVIDAAPGVKCIGLQEGEWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRCTQ 519
>gi|125602787|gb|EAZ42112.1| hypothetical protein OsJ_26672 [Oryza sativa Japonica Group]
Length = 477
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 106/343 (30%), Positives = 149/343 (43%), Gaps = 74/343 (21%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK-----S 160
+ DTGSDL W QC+PC S CY Q PLFDP S++Y ++PC++S C ASL S
Sbjct: 179 IVDTGSDLTWVQCKPC--SVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGS 236
Query: 161 CSGV----------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
C+ V C YS++YGDGSFS G LAT+TV LG + + G FGCG +N
Sbjct: 237 CATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLSN 291
Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVS 270
GLF T G++GLG +G ++G
Sbjct: 292 RGLFGG-TAGLMGLG---------------------------------PDGALAG----- 312
Query: 271 TPLTKAKTFYVLTI---DAISVGNQRLGVSTPDIVIDSGTTLTFL-PQGYNSNLLSVMSS 326
P FY + + G+ ++++DSGT +T L P Y +
Sbjct: 313 LPDGAPPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQ 372
Query: 327 M-IEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG-ADVKLSRSNFFVKVSED--IV 380
E P A P L+ CY+ +V P +T+ G AD+ + + +D V
Sbjct: 373 FGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAAGMLFMARKDGSQV 432
Query: 381 CSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
C ++ + PI GN Q N V YD + F DC+
Sbjct: 433 CLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 475
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 102/327 (31%), Positives = 149/327 (45%), Gaps = 37/327 (11%)
Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSN 179
Q M P FD SST C S+ C L SC C Y+ Y D S +
Sbjct: 168 QQNMHALPYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTT 227
Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
G L + T G+ ++PG+ FGCG N G+F S TGI G G G +SL SQ++
Sbjct: 228 GLLEVDKFTFGA----GASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV-- 281
Query: 240 AGKFSYCLVPVSSTK-----INFGTNGIVSGPGVV-STPLTKAK---TFYVLTIDAISVG 290
G FS+C V+ K ++ + +G G V STPL + T Y L++ I+VG
Sbjct: 282 -GNFSHCFTAVNGLKQSTVLLDLLADLYKNGRGAVQSTPLIQNSANPTLYYLSLKGITVG 340
Query: 291 NQRLGV---------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 341
+ RL V T +IDSGT++T LP + ++ I+ V
Sbjct: 341 STRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPY 400
Query: 342 LCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSED----IVCSVFKGITNSVPIYG 395
C+S S ++ VP++ +HF GA + L R N+ +V +D ++C + + G
Sbjct: 401 TCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSMICLAINELGDERATIG 460
Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDCTK 422
N Q N V YD++ +SF C K
Sbjct: 461 NFQQQNMHVLYDLQNNMLSFVAAQCDK 487
Score = 46.6 bits (109), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 63/132 (47%), Gaps = 18/132 (13%)
Query: 287 ISVGNQRLGV---------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT 337
I+VG+ RL V T +IDSGT++T LP + ++ I+ V
Sbjct: 42 ITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNA 101
Query: 338 GSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSED----IVC-SVFKGITNS 390
C+S S ++ VP++ +HF GA + L R N+ +V +D I+C ++ KG +
Sbjct: 102 TGPYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKG--DE 159
Query: 391 VPIYGNIMQTNF 402
I GN Q N
Sbjct: 160 TTIIGNFQQQNM 171
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 104/373 (27%), Positives = 174/373 (46%), Gaps = 48/373 (12%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
++ IGTPP E L + DT S+L W Q C + C P F+P +SS++ S PC+SS
Sbjct: 1 MQTKIGTPPREVLLLVDTASELTWVQGTSC--TNCSPTKVPPFNPGLSSSFISEPCTSSV 58
Query: 153 CASLN----QKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C + Q +C S +C + V+Y DGS + G +A E +L S G A L + FGC
Sbjct: 59 CLGRSKLGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGC 118
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQM----RTTIAGKFSYCLVPVSSTKIN------ 256
+ + ++G +GL G S +Q+ ++ ++ +FSYC P + +N
Sbjct: 119 ASKDLQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCF-PNRAEHLNSSGVII 177
Query: 257 FGTNGIVSGPGVV-----STPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVI-------- 303
FG +GI + P+ FY + + ISVG + L + I
Sbjct: 178 FGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNGGT 237
Query: 304 --DSGTTLTFLPQGYNSNLLSVM-SSMIEAQPVADPTGSLELCYSFNS----LSQVPEVT 356
DSGTT++FL + ++ L+ ++ + + ELCY + L P VT
Sbjct: 238 YFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGDARLPTAPLVT 297
Query: 357 IHFR-GADVKLSRSNFFVKVSED----IVCSVFKG----ITNSVPIYGNIMQTNFLVGYD 407
+HF+ D++L ++ +V ++ +C F V + GN Q ++L+ +D
Sbjct: 298 LHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYLIEHD 357
Query: 408 IEQQTVSFKPTDC 420
+E+ + F P +C
Sbjct: 358 LERSRIGFAPANC 370
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 114/394 (28%), Positives = 185/394 (46%), Gaps = 38/394 (9%)
Query: 55 RDALTRSLNRLNHFNQNSSISSSKASQ--ADIIPNNANYLIRISIGTPPTERLAVADTGS 112
R L R +RL H SS A D + N Y R+ IG+PP E + DTGS
Sbjct: 52 RRVLDRD-HRLRHLQNLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGS 110
Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
+ + C C QC P F P++SSTY+ + C++ N GV C Y Y
Sbjct: 111 TVTYVPCSNC--VQCGNHQDPRFQPELSSTYQPVKCNADCNCDEN-----GVQCTYERRY 163
Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISL 231
+ S S+G LA + ++ G + + FGC T +G L+ + GI+GLG G +S+
Sbjct: 164 AEMSTSSGVLAEDVMSFGKES--ELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSV 221
Query: 232 ISQM--RTTIAGKFSYCL--VPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDA 286
+ Q+ + ++ FS C + V + G GI S PG+V + +++ +Y + +
Sbjct: 222 MDQLVGKGVVSNSFSLCYGGMDVGGGAMVLG--GISSPPGMVFSHSDPSRSPYYNIELKE 279
Query: 287 ISVGNQ--RLGVSTPD----IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGS 339
I V + +L T D ++DSGTT + P+ Y + ++M + + ++ P +
Sbjct: 280 IHVAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPN 339
Query: 340 L-ELCYS-----FNSLSQV-PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGIT 388
++C+S L +V PEV + F G + LS N+ KVS +FK
Sbjct: 340 FKDICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKNGN 399
Query: 389 NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ + G I+ N LV Y+ E T+ F T+C++
Sbjct: 400 DQTTLLGGIIVRNTLVTYNRENSTIGFWKTNCSE 433
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 112/392 (28%), Positives = 182/392 (46%), Gaps = 34/392 (8%)
Query: 55 RDALTRSLNRLNHFNQNSSISSSKASQ--ADIIPNNANYLIRISIGTPPTERLAVADTGS 112
R L R +RL H SS A D + N Y R+ IG+PP E + DTGS
Sbjct: 52 RRVLDRD-HRLRHLQNLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGS 110
Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
+ + C C QC P F P++SSTY+ + C++ N GV C Y Y
Sbjct: 111 TVTYVPCSNC--VQCGNHQDPRFQPELSSTYQPVKCNADCNCDEN-----GVQCTYERRY 163
Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISL 231
+ S S+G LA + ++ G + + FGC T +G L+ + GI+GLG G +S+
Sbjct: 164 AEMSTSSGVLAEDVMSFGKES--ELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSV 221
Query: 232 ISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAIS 288
+ Q+ + ++ FS C + GI S PG+V + +++ +Y + + I
Sbjct: 222 MDQLVGKGVVSNSFSLCYGGMDVGGGAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIH 281
Query: 289 VGNQ--RLGVSTPD----IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL- 340
V + +L T D ++DSGTT + P+ Y + ++M + + ++ P +
Sbjct: 282 VAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFK 341
Query: 341 ELCYS-----FNSLSQV-PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNS 390
++C+S L +V PEV + F G + LS N+ KVS +FK +
Sbjct: 342 DICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQ 401
Query: 391 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ G I+ N LV Y+ E T+ F T+C++
Sbjct: 402 TTLLGGIIVRNTLVTYNRENSTIGFWKTNCSE 433
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 105/367 (28%), Positives = 172/367 (46%), Gaps = 41/367 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP + DTG+D++W QC+ CP D L++ K SS+ K +P
Sbjct: 73 YYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGKLVP 132
Query: 148 CSSSQCASLNQKSCSGV------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
C C +N +G +C Y YGDGS + G + V +G A A
Sbjct: 133 CDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLKTASA 192
Query: 199 LPGITFGCGTNNGGLF----NSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSS 252
+ FGCG G GI+G G + S+ISQ+ ++ + F++CL V+
Sbjct: 193 NGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCLNGVNG 252
Query: 253 TKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVID 304
I F +V P V +TPL + Y + + AI VG+ L +ST +ID
Sbjct: 253 GGI-FAIGHVVQ-PTVNTTPLLPDQPHYSVNMTAIQVGHTFLNLSTDASEQRDSKGTIID 310
Query: 305 SGTTLTFLPQG-YNSNLLSVMSSM--IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR- 360
SGTTL +LP G Y + ++S ++ Q + D + YS + P VT +F
Sbjct: 311 SGTTLAYLPDGIYQPLVYKILSQQPNLKVQTLHDEYTCFQ--YSGSVDDGFPNVTFYFEN 368
Query: 361 GADVKLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
G +K+ ++ +SE++ C ++ + ++ + G+++ +N LV YD+E Q +
Sbjct: 369 GLSLKVYPHDYLF-LSENLWCIGWQNSGAQSRDSKNMTLLGDLVLSNKLVFYDLENQVIG 427
Query: 415 FKPTDCT 421
+ +C+
Sbjct: 428 WTEYNCS 434
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 117/437 (26%), Positives = 195/437 (44%), Gaps = 52/437 (11%)
Query: 24 AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
A+ GG + IH +P+S + + S + +N + + ++S
Sbjct: 35 ARGGGIGFKAIHVAAPQSRVKANPSPSSAAQKSLFPYSAHIFQQHTKNPA--ALRSSTTT 92
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ Y I +G+P E + + DTGS+L W QC PC C ++D S++Y
Sbjct: 93 LGRKFGEYYTSIKLGSPGQEAILIVDTGSELTWLQCLPC--KVCAPSVDTIYDAARSASY 150
Query: 144 KSLPCSSSQ-CASLNQKS----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAV 197
+ + C++SQ C++ +Q + G CQ++ YGDGSFS G+L+T+T+ + + G+ V
Sbjct: 151 RPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPV 210
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN- 256
+ FGC + L + +GI+GL G ++L Q+ KFS+C P S+ +N
Sbjct: 211 TVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCF-PDRSSHLNS 269
Query: 257 -----FGTNGI----VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD---IVID 304
FG + V V T + FY + + +S+ + L V P +++D
Sbjct: 270 TGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHEL-VFLPRGSVVILD 328
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQP------VADPTGSLELCYSFN----------- 347
SG++ + + ++S L + ++ +P D G L C+ +
Sbjct: 329 SGSSFSSFVRPFHSQL---REAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTL 385
Query: 348 -SLSQVPE--VTIHFRGADVKLSRSNFFVKVSEDIVCSVFK-GITNSVPIYGNIMQTNFL 403
SLS V E VTI V L + F V +C F+ G N V + GN Q N
Sbjct: 386 PSLSLVFEDGVTIGIPSIGVLLPVARFQNHVK---MCFAFEDGGPNPVNVIGNYQQQNLW 442
Query: 404 VGYDIEQQTVSFKPTDC 420
V YDI++ V F C
Sbjct: 443 VEYDIQRSRVGFARASC 459
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 105/371 (28%), Positives = 165/371 (44%), Gaps = 51/371 (13%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y RI IG+PP + DTGSD++W C CP D L++PK SST +
Sbjct: 73 YYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLIT 132
Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
C C++ G CQY V YGDGS + G + + L G
Sbjct: 133 CDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETNG 192
Query: 201 GITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKI 255
I FGCG G S + GI+G G + S+ISQ+ T + F++CL +S I
Sbjct: 193 SIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGI 252
Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS--------TPDIVIDSGT 307
F +V P + +TP+ + Y + ++ + VG+ L + +IDSGT
Sbjct: 253 -FAIGEVVE-PKLXNTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKRGAIIDSGT 310
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL--------CYSF--NSLSQVPEVTI 357
TL +LP+ S L +M ++ AQP L+L C+ F N P VT
Sbjct: 311 TLAYLPE---SIYLPLMEKILGAQP------DLKLRTVDDQFTCFVFDKNVDDGFPTVTF 361
Query: 358 HFRGADV-KLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQ 410
F + + + + ++ +D+ C ++ N V + G+++ N LV Y++E
Sbjct: 362 KFEESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNLEN 421
Query: 411 QTVSFKPTDCT 421
QT+ + +C+
Sbjct: 422 QTIGWTEYNCS 432
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 105/371 (28%), Positives = 165/371 (44%), Gaps = 51/371 (13%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y RI IG+PP + DTGSD++W C CP D L++PK SST +
Sbjct: 73 YYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLIT 132
Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
C C++ G CQY V YGDGS + G + + L G
Sbjct: 133 CDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETNG 192
Query: 201 GITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKI 255
I FGCG G S + GI+G G + S+ISQ+ T + F++CL +S I
Sbjct: 193 SIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGI 252
Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS--------TPDIVIDSGT 307
F +V P + +TP+ + Y + ++ + VG+ L + +IDSGT
Sbjct: 253 -FAIGEVVE-PKLKTTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKRGAIIDSGT 310
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL--------CYSF--NSLSQVPEVTI 357
TL +LP +S L +M ++ AQP L+L C+ F N P VT
Sbjct: 311 TLAYLP---DSIYLPLMEKILGAQP------DLKLRTVDDQFTCFVFDKNVDDGFPTVTF 361
Query: 358 HFRGADV-KLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQ 410
F + + + + ++ +D+ C ++ N V + G+++ N LV Y++E
Sbjct: 362 KFEESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNLEN 421
Query: 411 QTVSFKPTDCT 421
QT+ + +C+
Sbjct: 422 QTIGWTEYNCS 432
>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 373
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 118/384 (30%), Positives = 175/384 (45%), Gaps = 46/384 (11%)
Query: 70 QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
Q ++I S D I N + + IS+GTP L DTGS + W QC+ C CY
Sbjct: 3 QAANIPDSAVIGDDSIRKN-QFFMGISLGTPAVFNLVTIDTGSTISWVQCQYC-IVHCYT 60
Query: 130 QDS---PLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGV-----NCQYSVSYGDGSFSN 179
QD P F+ SSTY+ + CS+ C ++ Q SG +C YS+ Y G +S
Sbjct: 61 QDQRAGPTFNTSSSSTYRRVGCSAQVCHDMHVSQNIPSGCVEEEDSCIYSLRYASGEYSA 120
Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM-RTT 238
G L+ + +TL ++ ++ FGCG++N +N + GI+G G S +Q+ + T
Sbjct: 121 GYLSQDRLTLANS----YSIQKFIFGCGSDN--RYNGHSAGIIGFGNKSYSFFNQIAQLT 174
Query: 239 IAGKFSYCLVPVSSTKINFGTNGIVSGPGVV-STPLTKAKTF--------YVLTIDAISV 289
FSYC S + N G I GP V S L + F Y L + V
Sbjct: 175 NYSAFSYCF---PSNQENEGFLSI--GPYVRDSNKLILTQLFDYGAHLPVYALQQFDMMV 229
Query: 290 GNQRLGVSTP-----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY 344
RL V P V+DSGT TF+ L ++ + A+ + S E+C+
Sbjct: 230 NGMRLQVDPPVYTTRMTVVDSGTVETFVLSPVFRALDRALTKAMVAEGYVRGSDSKEICF 289
Query: 345 SFN----SLSQVPEVTIHFRGADVKLSRSN-FFVKVSEDIVCSVFKGITNSVP---IYGN 396
N S++P V I F + +KL N F+ + S+ +CS F+ VP I GN
Sbjct: 290 HSNGDSVDWSKLPVVEIKFSRSILKLPAENVFYYETSDGSICSTFQPDDAGVPGVQILGN 349
Query: 397 IMQTNFLVGYDIEQQTVSFKPTDC 420
+F V +DI+Q+ F+ C
Sbjct: 350 RATRSFRVVFDIQQRNFGFEAGAC 373
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 118/396 (29%), Positives = 177/396 (44%), Gaps = 67/396 (16%)
Query: 75 SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
SSSK + + +N ++IGTPP V DTGS+L W +C+ P + +
Sbjct: 51 SSSKTTGKLLFHHNVTLTASLTIGTPPQNITMVLDTGSELSWLRCKKEP------NFTSI 104
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
F+P S TY +PCSS C + +C C + +SY D S G+LA ET
Sbjct: 105 FNPLASKTYTKIPCSSQTCKTRTSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFR 164
Query: 189 LGSTTGQAVALPGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
GS T P FGC G+++ ++KTTG++G+ G +S ++QM KFSY
Sbjct: 165 FGSLTR-----PATVFGCMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQMGFR---KFSY 216
Query: 246 CLVPVSSTKINFGTNGIVSG-------PGV-VSTPLTK-AKTFYVLTIDAISVGNQRL-- 294
C+ + ST S P V +STPL + Y + ++ I V N+ L
Sbjct: 217 CISGLDSTGFLLLGEARYSWLKPLNYTPLVQISTPLPYFDRVAYSVQLEGIKVNNKVLPL 276
Query: 295 --GVSTPD------IVIDSGTTLTFLPQGYNSNLLS--------VMSSMIEAQPVADPTG 338
V PD ++DSGT TFL S L V+ + E Q V G
Sbjct: 277 PKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQ--G 334
Query: 339 SLELCYSFNS----LSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKGIT 388
+++LCY +S L +P V + FRGA++ +S +V + + C F G +
Sbjct: 335 AMDLCYLIDSTSSTLPNLPVVKLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTF-GNS 393
Query: 389 NSVPI----YGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ + I G+ Q N + YD+E + F C
Sbjct: 394 DELGISSFLIGHHQQQNVWMEYDLENSRIGFAELRC 429
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 105/368 (28%), Positives = 172/368 (46%), Gaps = 45/368 (12%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I +G PP + DTGSD++W C+ CP L+DP+ S++ +
Sbjct: 82 YFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSATRIY 141
Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
C CA+ + Q + CQYSV YGDGS + G + + TG + A
Sbjct: 142 CDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQTSSANG 201
Query: 201 GITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
+ FGCG G + + GI+G G + S+ISQ+ AGK F++CL V
Sbjct: 202 SVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQL--AAAGKVKRVFAHCLDNVKGG 259
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI---------VID 304
I F +VS P V +TP+ + Y + + I VG L + T DI +ID
Sbjct: 260 GI-FAIGEVVS-PKVNTTPMVPNQPHYNVVMKEIEVGGNVLELPT-DIFDTGDRRGTIID 316
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE--LCYSF--NSLSQVPEVTIHFR 360
SGTTL +LP+ S+M+ ++ QP E C+ + N P V HF
Sbjct: 317 SGTTLAYLPEVVYE---SMMTKIVSEQPGLKLHTVEEQFTCFQYTGNVNEGFPVVKFHFN 373
Query: 361 GA-DVKLSRSNFFVKVSEDIVCSVFK--GIT----NSVPIYGNIMQTNFLVGYDIEQQTV 413
G+ + ++ ++ ++ E++ C ++ G+ + + G+++ +N LV YD+E Q +
Sbjct: 374 GSLSLTVNPHDYLFQIHEEVWCFGWQNSGMQSKDGRDMTLLGDLVLSNKLVLYDLENQAI 433
Query: 414 SFKPTDCT 421
+ +C+
Sbjct: 434 GWTDYNCS 441
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 111/382 (29%), Positives = 176/382 (46%), Gaps = 64/382 (16%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + +++G+PP + V DTGS+L W C+ P + +F+P SS+Y +
Sbjct: 36 HNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSP------NLTSVFNPLSSSSYSPI 89
Query: 147 PCSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
PCSS C + + + V C VSY D S GNLA++ +GS+ ALP
Sbjct: 90 PCSSPVCRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSS-----ALP 144
Query: 201 GITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI-N 256
G FGC G ++ ++KTTG++G+ G +S ++Q+ KFSYC+ S+ +
Sbjct: 145 GTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLP---KFSYCISGRDSSGVLL 201
Query: 257 FGTNGIVSGPGVVSTPLTKAKT--------FYVLTIDAISVGNQRL----GVSTPD---- 300
FG + + + TPL + T Y + +D I VGN+ L + PD
Sbjct: 202 FGDSHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGA 261
Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPT----GSLELCYSF---NSL 349
++DSGT TFL + L + + P+ DP G+++LCY L
Sbjct: 262 GQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVPAGGKL 321
Query: 350 SQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFK-----GITNSVPIYGNIM 398
++P V++ FRGA++ + KV E + C F GI V G+
Sbjct: 322 PELPAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYCLTFGNSDLLGIEAFV--IGHHH 379
Query: 399 QTNFLVGYDIEQQTVSFKPTDC 420
Q N + +D+ + V F T C
Sbjct: 380 QQNVWMEFDLVKSRVGFVETRC 401
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 110/413 (26%), Positives = 176/413 (42%), Gaps = 85/413 (20%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC---------YMQDSP------- 133
Y +R +GTP L VADTGSDL W +C Y +P
Sbjct: 54 QYFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNYGYGAPASNDSSS 113
Query: 134 ----------LFDPKMSSTYKSLPCSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFS 178
+F P S T+ +PCSS C SL G C Y Y DGS +
Sbjct: 114 VSAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYEYRYKDGSAA 173
Query: 179 NGNLATETVTL---GSTTGQA---VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
G + T++ T+ G G+ L G+ GC T+ G + G++ LG ++S
Sbjct: 174 RGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFLASDGVLSLGYSNVSFA 233
Query: 233 SQMRTTIAGKFSYCLV----PVSSTK-INFGTN--------------GIVSGPGVVSTPL 273
S+ G+FSYCLV P ++T + FG N G + PG TPL
Sbjct: 234 SRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTACAGSAAAPGARQTPL 293
Query: 274 ---TKAKTFYVLTIDAISVGNQRLGVSTPDIV----------IDSGTTLTFL-PQGYNSN 319
+ + FY + ++ +SV + L + P +V +DSGT+LT L Y +
Sbjct: 294 LLDHRMRPFYAVAVNGVSVDGELLRI--PRLVWDVQKGGGAILDSGTSLTVLVSPAYRAV 351
Query: 320 LLSVMSSMIEAQPVA-DPTGSLELCYSFNS-------LSQVPEVTIHFRG-ADVKLSRSN 370
+ ++ ++ VA DP + CY++ S VP + +HF G A ++ +
Sbjct: 352 VAALGKKLVGLPRVAMDP---FDYCYNWTSPLTGEDLAVAVPALAVHFAGSARLQPPPKS 408
Query: 371 FFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ + + + C + +G V + GNI+Q L +D++ + + FK + C +
Sbjct: 409 YVIDAAPGVKCIGLQEGDWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRCMQ 461
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 70/167 (41%), Positives = 97/167 (58%), Gaps = 13/167 (7%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ NY +++ G+P + DTGS L W QC+PC C++Q PLFDP S TYKSL
Sbjct: 115 SGNYYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCV-VYCHVQADPLFDPSASKTYKSLS 173
Query: 148 CSSSQC-----ASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
C+SSQC A+LN C S C Y+ SYGD S+S G L+ + +TL + LP
Sbjct: 174 CTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ----TLP 229
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
G +GCG ++ GLF + GI+GLG +S++ Q+ + FSYCL
Sbjct: 230 GFVYGCGQDSDGLFG-RAAGILGLGRNKLSMLGQVSSKFGYAFSYCL 275
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 117/377 (31%), Positives = 183/377 (48%), Gaps = 58/377 (15%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
+++ IG+ A+ DTGS+ + QC + P+FDP S +Y+ +PC S
Sbjct: 1 MQLGIGSLQKNLSAIIDTGSEAVLVQCGS--------RSRPVFDPAASQSYRQVPCISQL 52
Query: 153 CASLNQKSCSG-----VN----CQYSVSYGDGSFSNGNLATETVTLGST--TGQAVALPG 201
C ++ Q++ +G VN C YS+SYGD S G+ + + + L ST + QAV
Sbjct: 53 CLAVQQQTSNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRD 112
Query: 202 ITFGCGTN-NGGLFNSKTTGIVGLGGGDISLISQMRTTIAG-KFSYCL-----VPVSSTK 254
+ FGC + G L + + GIVG G++SL SQ++ + G KFSYC P ++
Sbjct: 113 VAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGV 172
Query: 255 INFGTNGI----VSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGV---------STPD 300
I G +G+ VS ++ P+T A++ Y + + +ISV + L + ST D
Sbjct: 173 IFLGDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGD 232
Query: 301 --IVIDSGTTLT-FLPQGYNS--NLLSVMSSMIEAQPVADPTGSLELCYSF---NSLSQV 352
V+DSGTT T + Y + N + + + V G + CY+ +SL V
Sbjct: 233 GGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAG-FDDCYNISAGSSLPGV 291
Query: 353 PEVTIHFR-GADVKLSRSNFFVKVS----EDIVC----SVFKGITNSVPIYGNIMQTNFL 403
PEV + + ++L + FV VS E VC S K + + GN Q+N+L
Sbjct: 292 PEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYL 351
Query: 404 VGYDIEQQTVSFKPTDC 420
V YD E+ V F+ DC
Sbjct: 352 VEYDNERSRVGFERADC 368
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 120/413 (29%), Positives = 191/413 (46%), Gaps = 65/413 (15%)
Query: 51 YQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADT 110
Y+ LR+ R L R+ + + S D Y RI +GTPP + DT
Sbjct: 13 YRTLREHDQRRLRRIL-----PEVVAFPISGDDDTFTTGLYYTRIYLGTPPQQFYVHVDT 67
Query: 111 GSDLIWTQCEPCPPSQCYMQDS-----PLFDPKMSSTYKSLPCSSSQCASLNQKSCS--G 163
GSD+ W C PC + C + +FDP+ S++ S+ C+ +C + CS
Sbjct: 68 GSDVAWVNCVPC--TNCKRASNVALPISIFDPEKSTSKTSISCTDEECYLASNSKCSFNS 125
Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVALPG---ITFGCGTNNGGLFNSKTT 219
++C YS YGDGS + G L + ++ +G + A G +TFGCG+N G + T
Sbjct: 126 MSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLTFGCGSNQTGTW--LTD 183
Query: 220 GIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVSG----PGVVSTPL 273
G+VG G ++SL SQ+ + F++CL N G+ +V G PG+V TP+
Sbjct: 184 GLVGFGQAEVSLPSQLSKQNVSVNIFAHCL-----QGDNKGSGTLVIGHIREPGLVYTPI 238
Query: 274 TKAKTFYVLTIDAISVGNQRLGVSTP---------DIVIDSGTTLTFLPQ-GYNSNLLSV 323
++ Y ++ +++G V+TP +++DSGTTLT+L Q Y+ V
Sbjct: 239 VPKQSHY--NVELLNIGVSGTNVTTPTAFDLSNSGGVIMDSGTTLTYLVQPAYDQFQAKV 296
Query: 324 MSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVK--VSED 378
M +G L + + F + P VT++F GA + LS S++ K ++
Sbjct: 297 RDCM--------RSGVLPVAFQFFCTIEGYFPNVTLYFAGGAAMLLSPSSYLYKEMLTTG 348
Query: 379 IVCSVFKGITNSVPIYGNIMQTNF--------LVGYDIEQQTVSFKPTDCTKQ 423
+ F + S +YG + T F LV YD + +K DCTK+
Sbjct: 349 LSAYCFSWL-ESTSVYGYLSYTIFGDNVLKDQLVVYDNVNNRIGWKNFDCTKE 400
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 107/367 (29%), Positives = 175/367 (47%), Gaps = 39/367 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y RI +GTPP DTGSD++W C+P CP + FDP+ SST L
Sbjct: 41 YYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTASPLS 100
Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---ALP 200
C S+C S NQ S S C YS YGDGS + G ++ Q V A
Sbjct: 101 CIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNNASA 160
Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTKI 255
ITFGC N G + GI G G D+S++SQ+ + +A K FS+CL
Sbjct: 161 KITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGADPGG- 219
Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGT 307
G ++ PG+V TP+ ++ Y L + I+V Q+L + +T +ID GT
Sbjct: 220 GILVLGEITEPGMVYTPIVPSQPHYNLNLQGIAVNGQQLSIDPQVFATTNTRGTIIDCGT 279
Query: 308 TLTFL-PQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYSFNSLSQV-PEVTIHFRGADV 364
TL +L + Y + ++++++ ++ QP L + +S+ ++ P VT++F GA +
Sbjct: 280 TLAYLAEEAYEPFVNTIIAAVSQSTQPFMLKGNPCFL--TVHSIDEIFPSVTLYFEGAPM 337
Query: 365 KLSRSNFFVKV----SEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
L ++ ++ S + C ++ ++ + I G+++ + + YD+E Q +
Sbjct: 338 DLKPKDYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILGDLVLKDKVFVYDLENQRIG 397
Query: 415 FKPTDCT 421
+ DC+
Sbjct: 398 WTSFDCS 404
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 110/376 (29%), Positives = 169/376 (44%), Gaps = 56/376 (14%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + +++G+PP V DTGS+L W C+ P + F+P +SS+Y
Sbjct: 56 HNVTLTVSLTVGSPPQNVTMVLDTGSELSWLHCKKLP------NLNSTFNPLLSSSYTPT 109
Query: 147 PCSSSQCASLNQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
PC+SS C + + SC N C VSY D S + G LA ET +L A
Sbjct: 110 PCNSSICTTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLA-----GAAQ 164
Query: 200 PGITFGCGTNNGGL----FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
PG FGC + G +SKTTG++G+ G +SL++QM KFSYC+ + +
Sbjct: 165 PGTLFGCMDSAGYTSDINEDSKTTGLMGMNRGSLSLVTQMSLP---KFSYCISGEDALGV 221
Query: 256 NFGTNGIVSGPGVVSTPLTKAKT--------FYVLTIDAISVGNQRL----GVSTPD--- 300
+G + + TPL A T Y + ++ I V + L V PD
Sbjct: 222 LLLGDGTDAPSPLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTG 281
Query: 301 ---IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPT----GSLELCYSF-NSLS 350
++DSGT TFL S+L + + DP G+++LCY S +
Sbjct: 282 AGQTMVDSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPASFA 341
Query: 351 QVPEVTIHFRGADVKLSRSNFFVKVSED---IVCSVFKG---ITNSVPIYGNIMQTNFLV 404
VP VT+ F GA++++S +VS+ + C F + + G+ Q N +
Sbjct: 342 AVPAVTLVFSGAEMRVSGERLLYRVSKGSDWVYCFTFGNSDLLGIEAYVIGHHHQQNVWM 401
Query: 405 GYDIEQQTVSFKPTDC 420
+D+ + V F T C
Sbjct: 402 EFDLLKSRVGFTQTTC 417
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 114/385 (29%), Positives = 171/385 (44%), Gaps = 46/385 (11%)
Query: 60 RSLNR--LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
RS+N ++ S D + + +L+ + GTP + + DTGSD W
Sbjct: 96 RSINAKIFGQYSTQESKDGWSPESMDTLNEDGLFLVNVGFGTPQQKFNLIIDTGSDTTWI 155
Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSF 177
QC C C+ + + F+P +SS+Y + C S + Y++ Y D S+
Sbjct: 156 QCNSCSLGNCHNKKT--FNPSLSSSYSNRSCIPS------------TDTNYTMKYEDNSY 201
Query: 178 SNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD-ISLISQMR 236
S G + VTL + P FGCG + GG F + +G++GL G+ SLISQ
Sbjct: 202 SKGVFVCDEVTL-----KPDVFPKFQFGCGDSGGGEFGT-ASGVLGLAKGEQYSLISQTA 255
Query: 237 TTIAGKFSYCLVPVSST--KINFGTNGIVSGPGVVSTPLTKAKT--FYVLTIDAISVGNQ 292
+ KFSYC P T + FG I + P + T L + Y + + ISV +
Sbjct: 256 SKFKKKFSYCFPPKEHTLGSLLFGEKAISASPSLKFTQLLNPPSGLGYFVELIGISVAKK 315
Query: 293 RLGVS-----TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGS--LELCY 344
RL VS +P +IDSGT +T LP Y + + M+ ++ P L+ CY
Sbjct: 316 RLNVSSSLFASPGTIIDSGTVITRLPTAAYEALRTAFQQEMLHCPSISPPPQEKLLDTCY 375
Query: 345 SFNSLS----QVPEVTIHFRG-ADVKLSRSNFFVKVSEDI--VCSVFKGITN--SVPIYG 395
+ ++PE+ +HF G DV L S + + D+ C F +N V I G
Sbjct: 376 NLKGCGGRNIKLPEIVLHFVGEVDVSLHPSG-ILWANGDLTQACLAFARKSNPSHVTIIG 434
Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDC 420
N Q + V YDIE + F DC
Sbjct: 435 NRQQVSLKVVYDIEGGRLGFG-NDC 458
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 117/438 (26%), Positives = 189/438 (43%), Gaps = 53/438 (12%)
Query: 2 ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
ATF +F L F V P Q+ + +I S SPF + + + +T +
Sbjct: 8 ATFF--LFALLFSTTKAVDPCATQSDTSDLSVIPIYSKCSPFVPPKQESW--VNTVITMA 63
Query: 62 LNRLNHFNQNSSISSSKASQADIIPNN-----ANYLIRISIGTPPTERLAVADTGSDLIW 116
S+++ K + I P ANY++R+ +GTP + V DT +D W
Sbjct: 64 SKDPERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAW 123
Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYG 173
C S C S F P S+T SL CS +QC+ + SC C ++ SYG
Sbjct: 124 VPC-----SGCTGCSSTTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSYG 178
Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLI 232
S L + +TL + +PG TFGC +GG + G++GLG G ISLI
Sbjct: 179 GDSSLTATLVQDAITLANDV-----IPGFTFGCINAVSGGSIPPQ--GLLGLGRGPISLI 231
Query: 233 SQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP-----GVVSTPLTK---AKTFYVLTI 284
SQ +G FSYCL S K + + + GP + +TPL + + Y + +
Sbjct: 232 SQAGAMYSGVFSYCL---PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNL 288
Query: 285 DAISVGNQRLGVSTPDIV----------IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 334
+SVG ++ + + +V IDSGT +T Q + + P++
Sbjct: 289 TGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG-PIS 347
Query: 335 DPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSE-DIVCSVFKGITNSV-- 391
G+ + C++ + ++ P +T+HF G ++ L N + S + C N+V
Sbjct: 348 S-LGAFDTCFAATNEAEAPAITLHFEGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNS 406
Query: 392 --PIYGNIMQTNFLVGYD 407
+ N+ Q N + +D
Sbjct: 407 VLNVIANLQQQNLRIMFD 424
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 110/369 (29%), Positives = 174/369 (47%), Gaps = 48/369 (13%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y RI IGTP DTGSD++W C+ CP + ++DP+ S + + +
Sbjct: 90 YFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVT 149
Query: 148 CSSSQCASLNQK---SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C C + SC+ + C+YS+SYGDGS + G T+ + +G P
Sbjct: 150 CDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANA 209
Query: 202 -ITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
++FGCG GG S GI+G G + S++SQ+ AGK F++CL V+
Sbjct: 210 SVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQL--AAAGKVRKMFAHCLDTVNGG 267
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDS 305
I F +V P V +TPL Y + + I VG LG+ T +IDS
Sbjct: 268 GI-FAIGNVVQ-PKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDS 325
Query: 306 GTTLTFLPQGYNSNLLSVM---SSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFR 360
GTTL ++P+G L +++ I Q + D + C+ ++ PEVT HF
Sbjct: 326 GTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-----CFQYSGSVDDGFPEVTFHFE 380
Query: 361 GADVKL--SRSNFFVKVSEDIVCSVFK---GITNS---VPIYGNIMQTNFLVGYDIEQQT 412
G DV L S ++ + +++ C F+ G T + + G+++ +N LV YD+E Q
Sbjct: 381 G-DVSLIVSPHDYLFQNGKNLYCMGFQNGGGKTKDGKDLGLLGDLVLSNKLVLYDLENQA 439
Query: 413 VSFKPTDCT 421
+ + +C+
Sbjct: 440 IGWADYNCS 448
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 118/427 (27%), Positives = 188/427 (44%), Gaps = 72/427 (16%)
Query: 51 YQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADT 110
++ + A SL+R H + +++ K + + Y + S+GTPP + V DT
Sbjct: 35 WESINLAALSSLSRARHLKRPPTLTG-KVTLPAYPRSYGGYSVIFSLGTPPQKVSLVLDT 93
Query: 111 GSDLIWT---------QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC 161
GS L+WT C+ C S P++ SST +SLPC S +C N
Sbjct: 94 GSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLPCRSPKC---NWVFG 150
Query: 162 SGVNCQ-------YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF 214
S +NC Y + YG GS + G L ++ + L +P FGC +
Sbjct: 151 SDLNCSTTKRCPYYGLEYGLGS-TTGQLVSDVLGLSKLN----RIPDFLFGCSL----VS 201
Query: 215 NSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVSSTKI--------NFGT 259
N + GI G G G S+ +Q+ T KFSYCLV P S + +
Sbjct: 202 NRQPEGIAGFGRGLASIPAQLGLT---KFSYCLVSHRFDDTPQSGDLVLHRGRRHADAAA 258
Query: 260 NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQ------RLGVSTPD----IVIDSGTTL 309
NG+ P S L+ +Y +++ I VG + R V + + +++DSG+T
Sbjct: 259 NGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGDGGMIVDSGSTF 318
Query: 310 TFLPQ----GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GA 362
TF+ + L M+ A+ + D +G L CY+ S+ VP++T F+ GA
Sbjct: 319 TFMERIIFDPVARELEKHMTKYKRAKEIEDSSG-LGPCYNITGQSEVDVPKLTFSFKGGA 377
Query: 363 DVKLSRSNFFVKVSEDIVCSVF-------KGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
++ L +++F V++ +VC T I GN Q NF + YD+++Q F
Sbjct: 378 NMDLPLTDYFSLVTDGVVCMTVLTDPDEPGSTTGPAIILGNYQQQNFYIEYDLKKQRFGF 437
Query: 416 KPTDCTK 422
KP C +
Sbjct: 438 KPQQCDR 444
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 108/363 (29%), Positives = 171/363 (47%), Gaps = 37/363 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP DTGSD++W C+ CP + L+DP SS+ +
Sbjct: 81 YFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPSGSSSGTGVT 140
Query: 148 CSSSQCASLNQK---SC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA---VALP 200
C C + + SC CQYS+SYGDGS + G T+ + +G + +A
Sbjct: 141 CGQDFCVATHGGVIPSCVPAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQTTLANT 200
Query: 201 GITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
ITFGCG GG S + GI+G G + S++SQ+ AGK F++CL ++
Sbjct: 201 SITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAA--AGKVRKVFAHCLDTINGG 258
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP--DI------VIDS 305
I F +V P V +TPL Y + ++AI VG +L + T DI +IDS
Sbjct: 259 GI-FAIGDVVQ-PKVSTTPLVPGMPHYNVNLEAIDVGGVKLQLPTNIFDIGESKGTIIDS 316
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGA-DV 364
GTTL +LP + ++S + + P+ + YS + P +T HF G +
Sbjct: 317 GTTLAYLPGVVYNAIMSKVFAQYGDMPLKNDQDFQCFRYSGSVDDGFPIITFHFEGGLPL 376
Query: 365 KLSRSNFFVKVSEDIVCSVFK--GIT----NSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
+ ++ + E + C F+ G+ + + G++ +N LV YD+E Q + +
Sbjct: 377 NIHPHDYLFQNGE-LYCMGFQTGGLQTKDGKDMVLLGDLAFSNRLVLYDLENQVIGWTDY 435
Query: 419 DCT 421
+C+
Sbjct: 436 NCS 438
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 110/370 (29%), Positives = 171/370 (46%), Gaps = 48/370 (12%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y RI IG+PP DTGSD++W C+ CP + +DP S T ++
Sbjct: 85 YYTRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSGT--TVG 142
Query: 148 CSSSQCASLNQKSCSGV---------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
C C + + SGV CQ+ ++YGDGS + G T+ V +G
Sbjct: 143 CEQEFCVA--NSAASGVPPACPSAASPCQFRITYGDGSSTTGFYVTDFVQYNQVSGNGQT 200
Query: 199 LP---GITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT--IAGKFSYCLVPV 250
P ITFGCG GG S + GI+G G D S++SQ+ + F++CL V
Sbjct: 201 TPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCLDTV 260
Query: 251 SSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIV 302
I F +V P V +TPL T Y + + ISVG L + T +
Sbjct: 261 RGGGI-FAIGNVVQPPIVKTTPLVPNATHYNVNLQGISVGGATLQLPTSTFDSGDSKGTI 319
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQP-VADPTGSLELCYSFN-SL-SQVPEVTIHF 359
IDSGTTL +LP+ LL +++ + P +A +C+ F+ SL + P +T F
Sbjct: 320 IDSGTTLAYLPREVYRTLL---TAVFDKHPDLAVRNYEDFICFQFSGSLDEEFPVITFSF 376
Query: 360 RGADVKLSR--SNFFVKVSEDIVCSVF--KGIT----NSVPIYGNIMQTNFLVGYDIEQQ 411
G D+ L+ ++ + D+ C F G+ + + G+++ +N LV YD+E+Q
Sbjct: 377 EG-DLTLNVYPHDYLFQNGNDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVVYDLEKQ 435
Query: 412 TVSFKPTDCT 421
+ + +C+
Sbjct: 436 VIGWTDYNCS 445
>gi|21717171|gb|AAM76364.1|AC074196_22 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433290|gb|AAP54828.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125532789|gb|EAY79354.1| hypothetical protein OsI_34483 [Oryza sativa Indica Group]
Length = 382
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 100/359 (27%), Positives = 173/359 (48%), Gaps = 43/359 (11%)
Query: 96 SIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS 155
+IGTPP A D G L+WTQC C S C+ Q+ P FDP SSTY+ PC ++ C
Sbjct: 29 TIGTPPQPASAFIDVGGLLVWTQCSQCSSSSCFNQELPPFDPTKSSTYRPEPCGTALCEF 88
Query: 156 L--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGG 212
+ ++CSG C Y S ++G + T+ V +G+ T +VA FGC ++
Sbjct: 89 FPASIRNCSGDVCAYEASTQLFEHTSGKIGTDAVAIGTATAASVA-----FGCVMASDIK 143
Query: 213 LFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP--------------VSSTKINFG 258
L + +G VGL +SL++QM T FS+CL P ++ G
Sbjct: 144 LMDGGPSGFVGLARTPLSLVAQMNVT---AFSHCLAPHDGGGGKNSRLFLGAAAKLAGGG 200
Query: 259 TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD----IVIDSGTTLTFLPQ 314
+ ++ P V S+P +Y++ ++ I G++ + ++ P +++ + + ++FL
Sbjct: 201 KSAAMTTPFVKSSPDDIKSLYYLINLEGIKAGDEAI-ITVPQSGRTVLLQTFSPVSFLVD 259
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLE----LCYSFNSLSQVPEVTIHFRG-ADVKLSRS 369
G +L +++ + P A P + LC+ +S P+V + F+G A + + +
Sbjct: 260 GVYQDLKKAVTAAV-GGPTATPPEQFQSIFDLCFKRGGVSGAPDVVLTFQGAAALTVPPT 318
Query: 370 NFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
N+ + V +D VC + I G + Q N YD+E++T+SF+ DC+
Sbjct: 319 NYLLDVGDDTVCVAIASSARLNSTEVAGMSILGGLQQQNVHFLYDLEKETLSFEAADCS 377
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 116/440 (26%), Positives = 184/440 (41%), Gaps = 70/440 (15%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ--------- 81
+EL+HR + + ++ + R R NQ + S+ S+
Sbjct: 35 LELVHRHHERFAGGGGDVDRVEAVKGFVKRDKLRRQRMNQRWGVVSNYDSRRKGFEMTTT 94
Query: 82 -ADI-IPNNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
A++ +P ++ Y + +G+P V DTGS+ W C
Sbjct: 95 PAEVEMPMHSGRDDALGEYFAEVKVGSPGQRFWLVVDTGSEFTWLNC------------- 141
Query: 133 PLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATE 185
S +++++ C+S +C SL+ C Y +SY DGS + G T+
Sbjct: 142 -------SKSFEAVTCASRKCKVDLSELFSLSVCPKPSDPCLYDISYADGSSAKGFFGTD 194
Query: 186 TVTLGSTTGQAVALPGITFGCGTN--NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
++T+G T G+ L +T GC + NG FN +T GI+GLG S I + KF
Sbjct: 195 SITVGLTNGKQGKLNNLTIGCTKSMLNGVNFNEETGGILGLGFAKDSFIDKAANKYGAKF 254
Query: 244 SYCLVPVSSTK-------INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV 296
SYCLV S + I N + G + T L FY + + IS+G Q L +
Sbjct: 255 SYCLVDHLSHRSVSSNLTIGGHHNAKLLGE-IRRTELILFPPFYGVNVVGISIGGQMLKI 313
Query: 297 --------STPDIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSF 346
+ +IDSGTTLT L Y + ++ S+ + + V + +LE C+
Sbjct: 314 PPQVWDFNAEGGTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTGEDFDALEFCFDA 373
Query: 347 NSL--SQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTN 401
S VP + HF GA + ++ + V+ + C I + GNIMQ N
Sbjct: 374 EGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPLVKCIGIVPIDGIGGASVIGNIMQQN 433
Query: 402 FLVGYDIEQQTVSFKPTDCT 421
L +D+ TV F P+ CT
Sbjct: 434 HLWEFDLSTNTVGFAPSTCT 453
>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
Length = 450
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 109/355 (30%), Positives = 153/355 (43%), Gaps = 71/355 (20%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK-----S 160
+ DTGSDL W QC+PC S CY Q PLFDP S++Y ++PC++S C ASL S
Sbjct: 125 IVDTGSDLTWVQCKPC--SVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGS 182
Query: 161 CSGV----------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
C+ V C YS++YGDGSFS G LAT+TV LG + + G FGCG +N
Sbjct: 183 CATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLSN 237
Query: 211 GGLFNSKTT---------GIVGLGGGDISL---ISQMRTTIAGKFSYCLVPVSSTKINFG 258
GL + G G G +SL S R PVS T+
Sbjct: 238 RGLRRPGSAASSPTASPPGTSGDAAGSLSLGGDTSSYRNA---------TPVSYTR---- 284
Query: 259 TNGIVSGPGVVSTPLTKAKTFYVLTI---DAISVGNQRLGVSTPDIVIDSGTTLTFL-PQ 314
+++ P FY + + G+ ++++DSGT +T L P
Sbjct: 285 ---MIADPA--------QPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRLAPS 333
Query: 315 GYNSNLLSVMSSM-IEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSN 370
Y + E P A P L+ CY+ + VP +T+ GAD+ + +
Sbjct: 334 VYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEAGADMTVDAAG 393
Query: 371 FFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+D VC ++ + PI GN Q N V YD + F DC+
Sbjct: 394 MLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 448
>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
Japonica Group]
Length = 377
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/310 (30%), Positives = 142/310 (45%), Gaps = 31/310 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y+ +IGTPP AV D +L+WTQC PC P C+ QD PLFDP SST++ LPC S
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQP--CFEQDLPLFDPTKSSTFRGLPCGS 114
Query: 151 SQCASLNQKS--CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
C S+ + S C+ C Y G + G T+T +G+ A + FGC
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGKAGTDTFAIGA------AKETLGFGCVV 167
Query: 209 NNGGLFNS--KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG-TNGIVSG 265
+ +GIVGLG SL++QM T FSYCL SS + G T ++G
Sbjct: 168 MTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT---AFSYCLAGKSSGALFLGATAKQLAG 224
Query: 266 PGVVSTPLT----------KAKTFYVLTIDAISVGNQRLGVSTPD---IVIDSGTTLTFL 312
STP + +Y++ + I G L ++ +++D+ + ++L
Sbjct: 225 GKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAASSSGSTVLLDTVSRASYL 284
Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNF 371
G L +++ + QPVA P +LC+ PE+ F GA + + +N+
Sbjct: 285 ADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKAVAGDAPELVFTFDGGAALTVPPANY 344
Query: 372 FVKVSEDIVC 381
+ VC
Sbjct: 345 LLASGNGTVC 354
>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 491
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 102/338 (30%), Positives = 148/338 (43%), Gaps = 36/338 (10%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGVN- 165
DT D+ W QC PC QCY Q + FDP+ SST + C S C +L CS N
Sbjct: 164 DTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPVRCGSRACRTLGGYANGCSKPNS 223
Query: 166 ---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIV 222
C Y + Y D + G T+T+T+ +T FGC G F+++ +G +
Sbjct: 224 TGDCLYRIEYSDHRLTLGTYMTDTLTISPST----TFLNFRFGCSHAVRGKFSAQASGTM 279
Query: 223 GLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP------GVVSTPLTKA 276
LGGG SL+SQ FSYC VP S G V+G +TPL ++
Sbjct: 280 SLGGGPQSLLSQTARAYGNAFSYC-VPGPSAAGFLSIGGPVNGDDGGGSGAFATTPLVRS 338
Query: 277 K-----TFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSM 327
T YV+ + I V +RL V + V+DS +T LP L +
Sbjct: 339 ANVINPTIYVVRLQGIEVAGRRLNVPPVVFSGGTVMDSSAVITQLPPTAYRALRLAFRNA 398
Query: 328 IEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF 384
+ A PTG+L+ C+ F +S+ VP V++ F GA ++L + + C F
Sbjct: 399 MRAYKTRAPTGNLDTCFDFVGVSKVTVPTVSLVFDGGAVIELGLLSVLLD-----SCLAF 453
Query: 385 KGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ ++ GN+ Q V YD+ V F+ C
Sbjct: 454 APMAADFALGFIGNVQQQTHEVLYDVAGGAVGFRHGAC 491
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 107/384 (27%), Positives = 178/384 (46%), Gaps = 35/384 (9%)
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
RL F + ++S+++ D + N Y R+ IGTPP + + DTGS + + C C
Sbjct: 55 RRLRQFPTSDNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTC 114
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCS-SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGN 181
QC P FDP+ SSTYK + C+ C S GV C Y Y + S S+G
Sbjct: 115 --EQCGRHQDPKFDPESSSTYKPIKCNIDCICDS------DGVQCVYERQYAEMSTSSGV 166
Query: 182 LATETVTLGSTTGQAVALP-GITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RT 237
L + ++ G+ Q+ +P FGC G LF+ + GI+GLG GD+SL+ Q+ +
Sbjct: 167 LGEDVISFGN---QSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKG 223
Query: 238 TIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGV 296
I FS C + GI ++ T ++ +Y + + I V ++L +
Sbjct: 224 AINDSFSLCYGGMDIGGGAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPL 283
Query: 297 STP------DIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS--- 345
S+ V+DSGTT +LP + +++ ++M + + + P + ++C+S
Sbjct: 284 SSGIFDGRYGAVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAG 343
Query: 346 --FNSLS-QVPEVTIHFR-GADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIM 398
LS + P V + F G + L+ N+F KV +F+ + + G I+
Sbjct: 344 SDAAELSNKFPTVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIV 403
Query: 399 QTNFLVGYDIEQQTVSFKPTDCTK 422
N LV YD + F T+C++
Sbjct: 404 VRNTLVMYDRANSKIGFWKTNCSE 427
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 107/384 (27%), Positives = 178/384 (46%), Gaps = 35/384 (9%)
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
RL F + ++S+++ D + N Y R+ IGTPP + + DTGS + + C C
Sbjct: 55 RRLRQFPTSDNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTC 114
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCS-SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGN 181
QC P FDP+ SSTYK + C+ C S GV C Y Y + S S+G
Sbjct: 115 --EQCGRHQDPKFDPESSSTYKPIKCNIDCICDS------DGVQCVYERQYAEMSTSSGV 166
Query: 182 LATETVTLGSTTGQAVALP-GITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RT 237
L + ++ G+ Q+ +P FGC G LF+ + GI+GLG GD+SL+ Q+ +
Sbjct: 167 LGEDVISFGN---QSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKG 223
Query: 238 TIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGV 296
I FS C + GI ++ T ++ +Y + + I V ++L +
Sbjct: 224 AINDSFSLCYGGMDIGGGAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPL 283
Query: 297 STP------DIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS--- 345
S+ V+DSGTT +LP + +++ ++M + + + P + ++C+S
Sbjct: 284 SSGIFDGRYGAVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAG 343
Query: 346 --FNSLS-QVPEVTIHFR-GADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIM 398
LS + P V + F G + L+ N+F KV +F+ + + G I+
Sbjct: 344 SDAAELSNKFPTVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIV 403
Query: 399 QTNFLVGYDIEQQTVSFKPTDCTK 422
N LV YD + F T+C++
Sbjct: 404 VRNTLVMYDRANSKIGFWKTNCSE 427
>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 457
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 123/451 (27%), Positives = 197/451 (43%), Gaps = 49/451 (10%)
Query: 4 FLSCVFILFFLCFYVVSP----IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALT 59
F+ C+ L LCF P ++ GF V L+H S +SPFY + T + + ++
Sbjct: 9 FMICIQTL--LCFSSSLPDHVLLKDNRLGFKVPLLHWLSTESPFYEPNLTLAELTQASIR 66
Query: 60 RSLNR---LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
S R + + SS K + + + Y+++ SIG+P + A+ D+GS L+W
Sbjct: 67 TSGARGDSIRSIMSGNITSSMKYPISRMSYTDKAYVMKFSIGSPAVDTYAIPDSGSSLVW 126
Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK--SCSGVN--CQYSVS 171
QC CY Q PLF+P S TY C++++C +L + C N C+Y
Sbjct: 127 LQCGTPYCRNCYRQKIPLFNPSKSVTYMKRLCNTAECRVALGDEYWRCKKPNQICKYHED 186
Query: 172 YGDGSFSNGNLATETVTL-GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDIS 230
Y D S++ G ++T+ T +G I FGCG NN + G+VGL S
Sbjct: 187 YLDDSYTEGVISTDIFTFPEHISGFGNYTLRIIFGCGYNNSDPQHFYPPGLVGLTNNKAS 246
Query: 231 LISQMRTTIAGKFSYCLVPVS------STKINFGTNGIVSGPGVVSTPLTKAKTFYVL-T 283
L+ QM +FSYC+ + S +I FG +SG P + +Y+
Sbjct: 247 LVGQMD---VDQFSYCVSIDTEQNLKGSMEIRFGLAASISGHSTQLVP--NSDGWYIFKN 301
Query: 284 IDAISVGNQRLGVSTPDIV------------IDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
+D I V N+ P V +D+GTT T L L+ ++ I
Sbjct: 302 VDGIYV-NEFEVEGYPAWVFKYTEGGQGGLTMDTGTTYTELHNSVMDPLIKLLEEHITIV 360
Query: 332 PVADPTGS-LELCYSFNSL--SQVPEVTIHF---RGADVKLSRSNFFVKVSEDIVC-SVF 384
P D + S ELCY + + +P++ + F + + N + +C ++F
Sbjct: 361 PEKDYSNSGFELCYFSDDFLGATLPDIELRFTDNKDTYFSFNTRNAWTPNGRSQMCLAMF 420
Query: 385 KGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
+ TN + I G + +GYD+ VSF
Sbjct: 421 R--TNGMSIIGMHQLRDIKIGYDLHHNIVSF 449
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 119/412 (28%), Positives = 183/412 (44%), Gaps = 87/412 (21%)
Query: 60 RSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
RS N+L HF+ N S++ + +++GTPP V DTGS+L W +C
Sbjct: 72 RSPNKL-HFHHNVSLT-----------------VSLTVGTPPQNVSMVLDTGSELSWLRC 113
Query: 120 EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-----SC-SGVNCQYSVSYG 173
Q FDP SS+Y +PCSS C + SC S C +SY
Sbjct: 114 NKTQTFQT------TFDPNRSSSYSPVPCSSLTCTDRTRDFPIPASCDSNQLCHAILSYA 167
Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGC-----GTNNGGLFNSKTTGIVGLGGGD 228
D S S GNLA++T +G++ +PG FGC TN +SK TG++G+ G
Sbjct: 168 DASSSEGNLASDTFYIGNSD-----MPGTIFGCMDSSFSTNTEE--DSKNTGLMGMNRGS 220
Query: 229 ISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-------PGV-VSTPLTK-AKTF 279
+S +SQM KFSYC+ + + + S P + +STPL +
Sbjct: 221 LSFVSQMDFP---KFSYCISDSDFSGVLLLGDANFSWLMPLNYTPLIQISTPLPYFDRVA 277
Query: 280 YVLTIDAISVGNQRL----GVSTPD------IVIDSGTTLTFLP----QGYNSNLLSVMS 325
Y + ++ I V ++ L V PD ++DSGT TFL + L+ S
Sbjct: 278 YTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRNEFLNQTS 337
Query: 326 SMIEAQPVADPT----GSLELCY----SFNSLSQVPEVTIHFRGADVKLSRSNFFVKV-- 375
++ + DP G ++LCY S SL +P V++ FRGA++K+S +V
Sbjct: 338 QILRV--LEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSLMFRGAEMKVSGDRLLYRVPG 395
Query: 376 ----SEDIVCSVFKG---ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
S+ + C F + + G+ Q N + +D+E+ + F C
Sbjct: 396 EVRGSDSVYCFTFGNSDLLAVEAYVIGHHHQQNVWMEFDLEKSRIGFAQVQC 447
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 104/381 (27%), Positives = 180/381 (47%), Gaps = 43/381 (11%)
Query: 70 QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
QNS + +++ D + +N Y R+ IGTPP E + DTGS + + C C QC
Sbjct: 56 QNSELPNARMRLFDDLLSNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSSC--EQCGK 113
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL 189
P F P +SSTY+ + C+ S C ++ G C Y Y + S S+G +A + V+
Sbjct: 114 HQDPRFQPDLSSTYRPVKCNPS-CNCDDE----GKQCTYERRYAEMSSSSGVIAEDVVSF 168
Query: 190 GSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYC 246
G+ + + FGC G L++ + GI+GLG G +S++ Q+ + I FS C
Sbjct: 169 GNES--ELKPQRAVFGCENVETGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLC 226
Query: 247 LVPVSSTKINFGTNGIVSG-----PGVVSTPLTKAKT-FYVLTIDAISVGNQRLGVSTPD 300
++ G +V G P +V + ++ +Y + + + V + L + P
Sbjct: 227 Y-----GGMDVGGGAMVLGQISPPPNMVFSHSNPYRSPYYNIELKELHVAGKPLKLK-PK 280
Query: 301 I-------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----F 346
+ V+DSGTT + P+ +++ ++M + + + P + ++C+S
Sbjct: 281 VFDEKHGTVLDSGTTYAYFPEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREV 340
Query: 347 NSLSQV-PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNSVPIYGNIMQTN 401
+ LS+V PEV + F G + LS N+ KVS +F+ + + G I+ N
Sbjct: 341 SHLSKVFPEVNMVFGSGQKLSLSPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVVRN 400
Query: 402 FLVGYDIEQQTVSFKPTDCTK 422
LV YD E + F T+C++
Sbjct: 401 TLVTYDRENDKIGFWKTNCSE 421
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 113/396 (28%), Positives = 178/396 (44%), Gaps = 67/396 (16%)
Query: 75 SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
++SK + + +N + ++ GTP V DTGS+L W C+ P + +
Sbjct: 51 TTSKTTDKLLFHHNVTLTVSLTAGTPLQNITMVLDTGSELSWLHCKKEP------NFNSI 104
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
F+P S TY +PCSS C + + SC C + +SY D S GNLA ET
Sbjct: 105 FNPLASKTYTKIPCSSPTCETRTRDLPLPVSCDPAKLCHFIISYADASSVEGNLAFETFR 164
Query: 189 LGSTTGQAVALPGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
+GS TG P FGC G ++ ++KTTG++G+ G +S ++QM KFSY
Sbjct: 165 VGSVTG-----PATVFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVNQMGFR---KFSY 216
Query: 246 CLVPVSSTKINFGTNGIVSG-------PGV-VSTPLTK-AKTFYVLTIDAISVGNQRL-- 294
C+ S+ + S P V +STPL + Y + ++ I V ++ L
Sbjct: 217 CISDRDSSGVLLLGEASFSWLKPLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSL 276
Query: 295 --GVSTPD------IVIDSGTTLTF--------LPQGYNSNLLSVMSSMIEAQPVADPTG 338
V PD ++DSGT TF L Q + V+ + E + V G
Sbjct: 277 PKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALKQEFLLQTKGVLRVLNEPRYVFQ--G 334
Query: 339 SLELCYSFN----SLSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKGIT 388
+++LCY +L +P V + FRGA++ +S +V + + C F G +
Sbjct: 335 AMDLCYLIEPTRAALPNLPVVNLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTF-GNS 393
Query: 389 NSVPI----YGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+S+ I G+ Q N + YD+E+ + F C
Sbjct: 394 DSLGIESFVIGHHQQQNVWMEYDLEKSRIGFAEVRC 429
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 106/371 (28%), Positives = 171/371 (46%), Gaps = 50/371 (13%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y I +G+P E + + DTGS+L W +C PC C ++D S +YK + C+
Sbjct: 99 EYYTSIKLGSPGQEAILIVDTGSELTWLKCLPC--KVCAPSVDTIYDAARSVSYKPVTCN 156
Query: 150 SSQ-CASLNQKS----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAVALPGIT 203
+SQ C++ +Q + G CQ++ YGDGSFS G+L+T+T+ + + G+ V +
Sbjct: 157 NSQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFA 216
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN------F 257
FGC + L + +GI+GL G ++L Q+ KFS+C P S+ +N F
Sbjct: 217 FGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCF-PDRSSHLNSTGVVFF 275
Query: 258 GTNGI----VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD---IVIDSGTTLT 310
G + V V T + FY + + +S+ + L V P +++DSG++ +
Sbjct: 276 GNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHEL-VLLPRGSVVILDSGSSFS 334
Query: 311 FLPQGYNSNLLSVMSSMIEAQP------VADPTGSLELCYSFN------------SLSQV 352
+ ++S L + ++ +P D G L C+ + SLS V
Sbjct: 335 SFVRPFHSQL---REAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSLSLV 391
Query: 353 PE--VTIHFRGADVKLSRSNFFVKVSEDIVCSVFK-GITNSVPIYGNIMQTNFLVGYDIE 409
E VTI V L + + V +C F+ G N V + GN Q N V YDI+
Sbjct: 392 FEDGVTIGIPSIGVLLPVARYQNHVK---MCFAFEDGGPNPVNVIGNYQQQNLWVEYDIQ 448
Query: 410 QQTVSFKPTDC 420
+ V F C
Sbjct: 449 RSRVGFARASC 459
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 116/438 (26%), Positives = 188/438 (42%), Gaps = 53/438 (12%)
Query: 2 ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
ATF + L F V P Q+ + +I S SPF + + + +T +
Sbjct: 8 ATFF--LVALLFSTTKAVDPCATQSDTSDLSVIPIYSKCSPFVPPKQESW--VNTVITMA 63
Query: 62 LNRLNHFNQNSSISSSKASQADIIPNN-----ANYLIRISIGTPPTERLAVADTGSDLIW 116
S+++ K + I P ANY++R+ +GTP + V DT +D W
Sbjct: 64 SKDPERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAW 123
Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYG 173
C S C S F P S+T SL CS +QC+ + SC C ++ SYG
Sbjct: 124 VPC-----SGCTGFSSTTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSYG 178
Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLI 232
S L + +TL + +PG TFGC +GG + G++GLG G ISLI
Sbjct: 179 GDSSLTATLVQDAITLANDV-----IPGFTFGCINAVSGGSIPPQ--GLLGLGRGPISLI 231
Query: 233 SQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP-----GVVSTPLTK---AKTFYVLTI 284
SQ +G FSYCL S K + + + GP + +TPL + + Y + +
Sbjct: 232 SQAGAMYSGVFSYCL---PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNL 288
Query: 285 DAISVGNQRLGVSTPDIV----------IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 334
+SVG ++ + + +V IDSGT +T Q + + P++
Sbjct: 289 TGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG-PIS 347
Query: 335 DPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSE-DIVCSVFKGITNSV-- 391
G+ + C++ + ++ P +T+HF G ++ L N + S + C N+V
Sbjct: 348 S-LGAFDTCFAATNEAEAPAITLHFEGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNS 406
Query: 392 --PIYGNIMQTNFLVGYD 407
+ N+ Q N + +D
Sbjct: 407 VLNVIANLQQQNLRIMFD 424
>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 389
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 99/346 (28%), Positives = 163/346 (47%), Gaps = 32/346 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD-SPLFDPKMSSTYKSLPCS 149
++ I G+P ++ DTGS L WTQC PC S CY Q P + P S TY+ C
Sbjct: 58 FMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPC--SDCYAQKIYPKYRPAASITYRDAMCE 115
Query: 150 SSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
S S + + C Y Y D + G LA E +T+ + G + G+ FGC
Sbjct: 116 DSHPKSNPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVHGVYFGCN 175
Query: 208 T-NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL----VPVSSTKINFGTNGI 262
T ++G F TGI+GLG G S+I + KFS+CL P +S + G
Sbjct: 176 TLSDGSYFTG--TGILGLGVGKYSIIGEF----GSKFSFCLGEISEPKASHNLILGDGAN 229
Query: 263 VSG-PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLL 321
V G P V++ +T+ T + +++I VG + + +D+G+TL+ L +
Sbjct: 230 VQGHPTVIN--ITEGHT--IFQLESIIVGEEITLDDPVQVFVDTGSTLSHLSTNLYYKFV 285
Query: 322 SVMSSMIEAQPVA-DPTGSLELCYSFNSLSQVPEVTIHFR---GADVKLSRSNFFVKVS- 376
+I ++P++ +PT LCY +++ ++ ++ + F+ GA++ ++ N F++
Sbjct: 286 DAFDDLIGSRPLSYEPT----LCYKADTIERLEKMDVGFKFDVGAELSVNIHNIFIQQGP 341
Query: 377 EDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+I C + S I G I + VGYD+ +T DC
Sbjct: 342 PEIRCLAIQNNKESFSHVIIGVIAMQGYNVGYDLSAKTAYINKQDC 387
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 106/378 (28%), Positives = 177/378 (46%), Gaps = 37/378 (9%)
Query: 70 QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
NS + ++ D + +N Y R+ IGTPP E + DTGS + + C C QC
Sbjct: 67 HNSDLPNAHMRLYDDLLSNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSTC--EQCGK 124
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL 189
P F P+ SSTYK + C+ S C ++ G C Y Y + S S+G LA + ++
Sbjct: 125 HQDPRFQPESSSTYKPMQCNPS-CNCDDE----GKQCTYERRYAEMSSSSGLLAEDVLSF 179
Query: 190 GSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYC 246
G+ + + FGC T G LF+ + GI+GLG G +S++ Q+ + + FS C
Sbjct: 180 GNES--ELTPQRAIFGCETVETGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLC 237
Query: 247 LVPVSSTKINFGTNGIVSGPGVV---STPLTKAKTFYVLTIDAISVGNQRLGVSTPDI-- 301
+ I P +V S P A +Y + + + V +RL ++ P +
Sbjct: 238 YGGMDVVGGAMVLGNIPPPPDMVFAHSDPYRSA--YYNIELKELHVAGKRLKLN-PRVFD 294
Query: 302 -----VIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSL 349
V+DSGTT +LP + + + +++ + + + P S ++C+S + L
Sbjct: 295 GKHGTVLDSGTTYAYLPEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQL 354
Query: 350 SQV-PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLV 404
S++ PEV + F G + LS N+ KVS +F+ + + G I+ N LV
Sbjct: 355 SKIFPEVNMVFGNGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRNTLV 414
Query: 405 GYDIEQQTVSFKPTDCTK 422
YD + + F T+C++
Sbjct: 415 TYDRDNDKIGFWKTNCSE 432
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 107/368 (29%), Positives = 172/368 (46%), Gaps = 60/368 (16%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + +++G+PP + V DTGS+L W C+ P + +F+P SS+Y +
Sbjct: 996 HNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSP------NLTSVFNPLSSSSYSPI 1049
Query: 147 PCSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
PCSS C + + + V C VSY D S GNLA++ +GS+ ALP
Sbjct: 1050 PCSSPICRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSS-----ALP 1104
Query: 201 GITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI-N 256
G FGC G ++ ++KTTG++G+ G +S ++Q+ KFSYC+ S+ +
Sbjct: 1105 GTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLP---KFSYCISGRDSSGVLL 1161
Query: 257 FGTNGIVSGPGVVSTPLTKAKT--------FYVLTIDAISVGNQRL----GVSTPD---- 300
FG + + TPL + T Y + +D I VGN+ L + PD
Sbjct: 1162 FGDLHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGA 1221
Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPT----GSLELCYSFNS---L 349
++DSGT TFL + L + + P+ DP G+++LCYS + L
Sbjct: 1222 GQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVAAGGKL 1281
Query: 350 SQVPEVTIHFRGA------DVKLSRSNFFVKVSEDIVCSVFKG---ITNSVPIYGNIMQT 400
+P V++ FRGA +V L R +K +E + C F + + G+ Q
Sbjct: 1282 PTLPSVSLMFRGAEMVVGGEVLLYRVPEMMKGNEWVYCLTFGNSDLLGIEAFVIGHHHQQ 1341
Query: 401 NFLVGYDI 408
N + +D+
Sbjct: 1342 NVWMEFDL 1349
>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
Length = 486
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 105/337 (31%), Positives = 156/337 (46%), Gaps = 39/337 (11%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS--C-SG 163
V DT D+ W +C PC +QC +DP SSTY + PC+SS C L + + C +
Sbjct: 166 VLDTAGDVPWMRCVPCTFAQCAD-----YDPTRSSTYSAFPCNSSACKQLGRYANGCDAN 220
Query: 164 VNCQYSV-SYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIV 222
CQY V + GD ++G +++ +T+ S G V G FGC N G F ++ GI+
Sbjct: 221 GQCQYMVVTAGDSFTTSGTYSSDVLTINS--GDRVE--GFRFGCSQNEQGSFENQADGIM 276
Query: 223 GLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPG--VVSTPLTK----- 275
LG G SL++Q +T FSYCL P +TK F G+ G V+TP+ K
Sbjct: 277 ALGRGVQSLMAQTSSTYGDAFSYCLPPTETTK-GFFQIGVPIGASYRFVTTPMLKERGGA 335
Query: 276 ---AKTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMI 328
A T Y + AI+V + L V V+DS T +T LP L + + +
Sbjct: 336 SAAAATLYRALLLAITVDGKELNVPAEVFAAGTVMDSRTIITRLPVTAYGALRAAFRNRM 395
Query: 329 EAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFK 385
+ VA P L+ CY + ++P + + F G A V++ RS + C F
Sbjct: 396 RYR-VAPPQEELDTCYDLTGVRYPRLPRIALVFDGNAVVEMDRSGILLN-----GCLAFA 449
Query: 386 GITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ S I GN+ Q V +D+ + F+ C
Sbjct: 450 SNDDDSSPSILGNVQQQTIQVLHDVGGGRIGFRSAAC 486
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 100/308 (32%), Positives = 147/308 (47%), Gaps = 40/308 (12%)
Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATET 186
P FD SST C S+ C L SC C Y+ Y D S + G + +
Sbjct: 23 PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLIEVDK 82
Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYC 246
T G+ ++PG+ FGCG N G+F S TGI G G G +SL SQ++ G FS+C
Sbjct: 83 FTFGA----GASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHC 135
Query: 247 LVPVSSTK-----INFGTNGIVSGPGVV-STPLTKAK---TFYVLTIDAISVGNQRLGV- 296
V+ K ++ + +G G V STPL + TFY L++ I+VG+ RL V
Sbjct: 136 FTAVNGLKQSTVLLDLPADLYKNGRGAVQSTPLIQNSANPTFYYLSLKGITVGSTRLPVP 195
Query: 297 --------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS 348
T +IDSGT++T LP + ++ I+ V C+S S
Sbjct: 196 ESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPS 255
Query: 349 LSQ--VPEVTIHFRGADVKLSRSNFFVKVSED----IVC-SVFKGITNSVPIYGNIMQTN 401
++ VP++ +HF GA + L R N+ +V +D I+C ++ KG + I GN Q N
Sbjct: 256 QAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKG--DETTIIGNFQQQN 313
Query: 402 FLVGYDIE 409
V YD++
Sbjct: 314 MHVLYDLQ 321
>gi|125553836|gb|EAY99441.1| hypothetical protein OsI_21409 [Oryza sativa Indica Group]
Length = 376
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 78/212 (36%), Positives = 112/212 (52%), Gaps = 18/212 (8%)
Query: 98 GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
GT + + D+GSD+ W QC+PCP C+ Q PLFDP S+TY ++PCSS+ CA L
Sbjct: 155 GTSAVRQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYSAVPCSSAACARLG 214
Query: 158 --QKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
++ CS V CQ+ +Y DG+ + G +++ +TLG + G FGC + G
Sbjct: 215 PYRRGCSANVQCQFGFTYTDGATATGTYSSDDLTLGPYD----VVRGFLFGCAHADRGST 270
Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----PGV 268
F+ +G + LGGG S + Q T FSYC +P S + + F T G+ P
Sbjct: 271 FSFDVSGTLALGGGAQSFVQQTATQYGRVFSYC-IPPSPSSLGFITLGVPPQRAALVPTF 329
Query: 269 VSTPLTKAK----TFYVLTIDAISVGNQRLGV 296
VSTPL + TFY + + AI V + L V
Sbjct: 330 VSTPLLSSSSMPPTFYRVLLRAIIVAGRPLPV 361
>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
Length = 256
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 69/161 (42%), Positives = 91/161 (56%), Gaps = 10/161 (6%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ IG+PP V DTGSD+ W QC PC + CY Q P+F+P SS+Y L
Sbjct: 50 SGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPC--ADCYQQADPIFEPSFSSSYAPLT 107
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C + QC SL+ C +C Y VSYGDGS++ G+ ATET+TL + +L + GCG
Sbjct: 108 CETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITLDG----SASLNNVAIGCG 163
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
+N GLF + GG +S SQ+ A FSYCLV
Sbjct: 164 HDNEGLFVGAAGLLGLGGGS-LSFPSQIN---ASSFSYCLV 200
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 102/362 (28%), Positives = 166/362 (45%), Gaps = 35/362 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +GTPP E DTGSD++W + C CP + FD SST + +P
Sbjct: 81 YFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLVP 140
Query: 148 CSSSQCASLNQKSCSGV-----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---AL 199
CS C S Q + + C Y+ YGDGS ++G ++T + G+++ +
Sbjct: 141 CSHPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIANSS 200
Query: 200 PGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTK 254
I FGC T G + GI G G G++S+ISQ+ + FS+CL S
Sbjct: 201 AAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCLKGEDSGG 260
Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSG 306
G + PG+V +PL ++ Y L + +I+V Q L + S +ID+G
Sbjct: 261 -GILVLGEILEPGIVYSPLVPSQPHYNLDLQSIAVSGQLLPIDPAAFATSSNRGTIIDTG 319
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPT-GSLELCYSF-NSLSQV-PEVTIHFRGAD 363
TTL +L + +S +++ + +A PT CY NS+S+V P V+ +F G
Sbjct: 320 TTLAYLVEEAYDPFVSAITAAVSQ--LATPTINKGNQCYLVSNSVSEVFPPVSFNFAGGA 377
Query: 364 VKLSRSNFFVK-----VSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
L + ++ + C F+ I + I G+++ + + YD+ Q + +
Sbjct: 378 TMLLKPEEYLMYLTNYAGAALWCIGFQKIQGGITILGDLVLKDKIFVYDLAHQRIGWANY 437
Query: 419 DC 420
DC
Sbjct: 438 DC 439
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 104/389 (26%), Positives = 171/389 (43%), Gaps = 41/389 (10%)
Query: 65 LNHFNQNSSISSSKASQA--------DIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
+HFN + S + D + N Y R+ IGTPP + DTGS + +
Sbjct: 59 FSHFNPRRQLKESDSEHHPNARMRLYDDLLRNGYYTARLWIGTPPQRFALIVDTGSTVTY 118
Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGS 176
C C C P F P+ S TY+ + C + QC N + C Y Y + S
Sbjct: 119 VPCSTC--RHCGSHQDPKFRPEDSETYQPVKC-TWQCNCDNDRK----QCTYERRYAEMS 171
Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQM 235
S+G L + V+ G+ T ++ FGC + G ++N + GI+GLG GD+S++ Q+
Sbjct: 172 TSSGALGEDVVSFGNQT--ELSPQRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQL 229
Query: 236 --RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQ 292
+ I+ FS C + GI +V T ++ +Y + + I V +
Sbjct: 230 VEKKVISDSFSLCYGGMGVGGGAMVLGGISPPADMVFTRSDPVRSPYYNIDLKEIHVAGK 289
Query: 293 RLGVSTPDI-------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELC 343
RL ++ P + V+DSGTT +LP+ + + ++M + ++ P ++C
Sbjct: 290 RLHLN-PKVFDGKHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPRYNDIC 348
Query: 344 YSFNSL--SQV----PEVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPI 393
+S + SQ+ P V + F G + LS N+ KV VF + +
Sbjct: 349 FSGAEIDVSQISKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTL 408
Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
G I+ N LV YD E + F T+C++
Sbjct: 409 LGGIVVRNTLVMYDREHTKIGFWKTNCSE 437
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 113/364 (31%), Positives = 164/364 (45%), Gaps = 50/364 (13%)
Query: 57 ALTRSLNRLNHFNQNSSISSSKASQADIIPNN--ANYLIRISIGTPPTERLAVADTGSDL 114
A RS RL+ + +S ++A + + Y+++ SIG PP A DTGSDL
Sbjct: 57 AAERSRRRLSVY------TSGTGTKAPVTKSQKGGKYIMQFSIGEPPLLIWAEVDTGSDL 110
Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-----KSCSGVN--CQ 167
+W +C PC + C SPL+DP S + LPCSS C +L + CS C
Sbjct: 111 MWVKCSPC--NGCNPPPSPLYDPARSRSSGKLPCSSQLCQALGRGRIISDQCSDDPPLCG 168
Query: 168 YSVSYGD-GSFS-NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
Y +YG G S G L TET T G ++FG G T G+VGLG
Sbjct: 169 YHYAYGHSGDHSTQGVLGTETFTF----GDGYVANNVSFGRSDTIDGSQFGGTAGLVGLG 224
Query: 226 GGDISLISQMRTTIAGKFSYCLV--PVSSTKINFG-------TNGIVSGPGVVSTPLTKA 276
G +SL+SQ+ AG+F+YCL P + I FG + G VS +V+ P
Sbjct: 225 RGHLSLVSQLG---AGRFAYCLAADPNVYSTILFGSLAALDTSAGDVSSTPLVTNPKPDR 281
Query: 277 KTFYVLTIDAISVGNQRLGVSTPDIVIDS-GTTLTFLPQGYNSNLLSVMSSMIEAQPVAD 335
T Y + + ISVG RL + I+S G+ F G L + + Q +
Sbjct: 282 DTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFDSGAIDTSLKDAAYQVVRQAITS 341
Query: 336 PTGSL------ELCY---SFNSLSQVPEVTIHF-RGADVKLSRSNFFVKV----SEDIVC 381
L + C+ + +++Q+P + +HF GAD+ L+ N+ SE +VC
Sbjct: 342 EIQRLGYDAGDDTCFVAANQQAVAQMPPLVLHFDDGADMSLNGRNYLKTSTKGPSEVLVC 401
Query: 382 SVFK 385
K
Sbjct: 402 MAIK 405
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 104/367 (28%), Positives = 166/367 (45%), Gaps = 44/367 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y RI IG+PP DTGSD++W +C+ CP + +DP S T ++
Sbjct: 84 YYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGT--TVG 141
Query: 148 CSSSQCASLNQKSCSGV---------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQA-- 196
C C + S GV CQ+ ++YGDGS + G T+ V +G
Sbjct: 142 CEQEFCVA---NSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQT 198
Query: 197 -VALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPV 250
+ ITFGCG GG N GI+G G D S++SQ+ + F++CL V
Sbjct: 199 TTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTV 258
Query: 251 SSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIV 302
I F +V P V +TPL T Y + + ISVG L + T +
Sbjct: 259 RGGGI-FAIGNVVQ-PKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTFDSGDSKGTI 316
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGA 362
IDSGTTL +LP+ LL+ + + P+ + + +S + P +T F+G
Sbjct: 317 IDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQDFVCFQFSGSIDDGFPVITFSFKG- 375
Query: 363 DVKLSR--SNFFVKVSEDIVCSVF--KGIT----NSVPIYGNIMQTNFLVGYDIEQQTVS 414
D+ L+ ++ + D+ C F G+ + + G+++ +N LV YD+E++ +
Sbjct: 376 DLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVLSNKLVVYDLEKEVIG 435
Query: 415 FKPTDCT 421
+ +C+
Sbjct: 436 WTDYNCS 442
>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
Length = 362
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 68/165 (41%), Positives = 89/165 (53%), Gaps = 12/165 (7%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y +R+ +GTP T V DTGSD++W QC PC CY Q +FDPK S T+ ++P
Sbjct: 132 SGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPC--KACYNQTDAIFDPKKSKTFATVP 189
Query: 148 CSSSQCASLNQKS-C---SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
C S C L+ S C C Y VSYGDGSF+ G+ +TET+T + +
Sbjct: 190 CGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF-----HGARVDHVP 244
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
GCG +N GLF + GG +S SQ + GKFSYCLV
Sbjct: 245 LGCGHDNEGLFVGAAGLLGLGRGG-LSFPSQTKNRYNGKFSYCLV 288
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 103/389 (26%), Positives = 173/389 (44%), Gaps = 41/389 (10%)
Query: 65 LNHFNQNSSISSSKASQA--------DIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
L+HFN + S++ D + N Y R+ IGTPP + DTGS + +
Sbjct: 59 LSHFNPRRHLQGSQSEHHPNARMRLFDDLLRNGYYTTRLWIGTPPQRFALIVDTGSTVTY 118
Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGS 176
C C C P F P+ S TY+ + C + QC + + C Y Y + S
Sbjct: 119 VPCSTC--KHCGSHQDPKFRPEASETYQPVKC-TWQCNCDDDRK----QCTYERRYAEMS 171
Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQM 235
S+G L + V+ G+ + ++ FGC + G ++N + GI+GLG GD+S++ Q+
Sbjct: 172 TSSGVLGEDVVSFGNQS--ELSPQRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQL 229
Query: 236 --RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQ 292
+ I+ FS C + GI +V T ++ +Y + + I V +
Sbjct: 230 VEKKVISDAFSLCYGGMGVGGGAMVLGGISPPADMVFTHSDPVRSPYYNIDLKEIHVAGK 289
Query: 293 RLGVSTPDI-------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELC 343
RL ++ P + V+DSGTT +LP+ + + ++M + ++ P ++C
Sbjct: 290 RLHLN-PKVFDGKHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPHYNDIC 348
Query: 344 YS-----FNSLSQ-VPEVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPI 393
+S + LS+ P V + F G + LS N+ KV VF + +
Sbjct: 349 FSGAEINVSQLSKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTL 408
Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
G I+ N LV YD E + F T+C++
Sbjct: 409 LGGIVVRNTLVMYDREHSKIGFWKTNCSE 437
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 106/361 (29%), Positives = 172/361 (47%), Gaps = 32/361 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G+PPTE DTGSD++W + C CP S D FD S T S+
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 159
Query: 148 CSSSQCASLNQKS---CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
CS C+S+ Q + CS N C YS YGDGS ++G T+T + G+++
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 219
Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKI 255
I FGC T G + GI G G G +S++SQ+ R FS+CL S
Sbjct: 220 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGG 279
Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGT 307
F I+ PG+V +PL ++ Y L + +I V Q L + +T ++D+GT
Sbjct: 280 VFVLGEILV-PGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGT 338
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQV-PEVTIHFRGADVK 365
TLT+L + L+ +S+ + +Q V + E CY + S+S + P V+++F G
Sbjct: 339 TLTYLVKEAYDLFLNAISNSV-SQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASM 397
Query: 366 LSRS-----NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ R ++ + + C F+ I G+++ + + YD+ +Q + + DC
Sbjct: 398 MLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDC 457
Query: 421 T 421
+
Sbjct: 458 S 458
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 106/361 (29%), Positives = 172/361 (47%), Gaps = 32/361 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G+PPTE DTGSD++W + C CP S D FD S T S+
Sbjct: 105 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 164
Query: 148 CSSSQCASLNQKS---CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
CS C+S+ Q + CS N C YS YGDGS ++G T+T + G+++
Sbjct: 165 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 224
Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKI 255
I FGC T G + GI G G G +S++SQ+ R FS+CL S
Sbjct: 225 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGG 284
Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGT 307
F I+ PG+V +PL ++ Y L + +I V Q L + +T ++D+GT
Sbjct: 285 VFVLGEILV-PGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGT 343
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQV-PEVTIHFRGADVK 365
TLT+L + L+ +S+ + +Q V + E CY + S+S + P V+++F G
Sbjct: 344 TLTYLVKEAYDLFLNAISNSV-SQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASM 402
Query: 366 LSRS-----NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ R ++ + + C F+ I G+++ + + YD+ +Q + + DC
Sbjct: 403 MLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDC 462
Query: 421 T 421
+
Sbjct: 463 S 463
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 171/360 (47%), Gaps = 32/360 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G+PPTE DTGSD++W + C CP S D FD S T S+
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 159
Query: 148 CSSSQCASLNQKS---CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
CS C+S+ Q + CS N C YS YGDGS ++G T+T + G+++
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 219
Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKI 255
I FGC T G + GI G G G +S++SQ+ R FS+CL S
Sbjct: 220 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGG 279
Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGT 307
F I+ PG+V +PL ++ Y L + +I V Q L + +T ++D+GT
Sbjct: 280 VFVLGEILV-PGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGT 338
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQV-PEVTIHFRGADVK 365
TLT+L + L+ +S+ + +Q V + E CY + S+S + P V+++F G
Sbjct: 339 TLTYLVKEAYDLFLNAISNSV-SQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASM 397
Query: 366 LSRS-----NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ R ++ + + C F+ I G+++ + + YD+ +Q + + DC
Sbjct: 398 MLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDC 457
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 106/371 (28%), Positives = 162/371 (43%), Gaps = 52/371 (14%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP + DTGSD++W C+ CP D L+D K S+T ++
Sbjct: 155 YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 214
Query: 148 CSSSQCASLNQ--KSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---G 201
C + C+ + C G+ C YSV YGDGS + G + V +G P
Sbjct: 215 CDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGT 274
Query: 202 ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKIN 256
+ FGCG G S + GI+G G + S++SQ+ ++ + FS+CL V I
Sbjct: 275 VVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGI- 333
Query: 257 FGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI--------VIDSGTT 308
F +V P V TPL + + Y + + I VG L V + +IDSGTT
Sbjct: 334 FAIGEVVE-PKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTT 392
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE------LCYSF--NSLSQVPEVTIHF- 359
L + PQ V +IE P L C+ + N P VT+HF
Sbjct: 393 LAYFPQ-------EVYVPLIEKILSQQPDLRLHTVEQAFTCFDYTGNVDDGFPTVTLHFD 445
Query: 360 RGADVKLSRSNFFVKVSEDIVCSVFKGITNS---------VPIYGNIMQTNFLVGYDIEQ 410
+ + + + +V E C G NS + + G+++ +N LV YD+E+
Sbjct: 446 KSISLTVYPHEYLFQVKEFEWCI---GWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDLEK 502
Query: 411 QTVSFKPTDCT 421
Q + + +C+
Sbjct: 503 QGIGWVEYNCS 513
>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 252
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 84/224 (37%), Positives = 117/224 (52%), Gaps = 26/224 (11%)
Query: 38 SPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA------NY 91
S K +N L D RS+ N + +S + +ASQ I ++ NY
Sbjct: 8 SEKKIDWNRRLQKQLILDDLRVRSMQ--NRIRRVASTHNVEASQTQIPLSSGINLQTLNY 65
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
++ + +G+ + DT SDL W QCEPC CY Q P+F P SS+Y+S+ C+SS
Sbjct: 66 IVTMGLGSK--NMTVIIDTRSDLTWVQCEPCMS--CYNQQGPIFKPSTSSSYQSVSCNSS 121
Query: 152 QCASL-----NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
C SL N +C N C Y V+YGDGS++NG+L E ++ G V++
Sbjct: 122 TCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGDLGVEALSFG-----GVSVSDFV 176
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
FGCG NN GLF +G++GLG +SL+SQ T G FSYCL
Sbjct: 177 FGCGRNNKGLFGG-VSGLMGLGRSYLSLVSQTNATFGGVFSYCL 219
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 106/413 (25%), Positives = 174/413 (42%), Gaps = 48/413 (11%)
Query: 50 PYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN--------YLIRISIGTPP 101
P QR + RSL+ + + + A +P N Y ++ +G+P
Sbjct: 26 PVQRKFNGPHRSLDAIKAHDDRRR---GRFLAAIDVPLGGNGLPSSTGLYYTKVGLGSPA 82
Query: 102 TERLAVADTGSDLIWTQCE---PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ 158
E DTGSD++W C CP D L+DP S T ++PC C
Sbjct: 83 KEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYS 142
Query: 159 KSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---GITFGCGTNNG 211
SG ++C YS++YGDGS ++G+ +++T +G P + FGCG
Sbjct: 143 GPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQS 202
Query: 212 GLFNSKT----TGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNGIVSG 265
G +S + GI+G G + S++SQ+ + + FS+CL I + G V
Sbjct: 203 GSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCLDSHHGGGIF--SIGQVME 260
Query: 266 PGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGTTLTFLPQG-Y 316
P +TPL Y + + + V + + + S +IDSGTTL +LP Y
Sbjct: 261 PKFNTTPLVPRMAHYNVILKDMDVDGEPILLPLYLFDSGSGRGTIIDSGTTLAYLPLSIY 320
Query: 317 NSNLLSVMSSM--IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVK 374
N L V+ ++ V D YS P V HF G + + ++
Sbjct: 321 NQLLPKVLGRQPGLKLMIVEDQFTCFH--YSDKLDEGFPVVKFHFEGLSLTVHPHDYLFL 378
Query: 375 VSEDIVCSVFKGIT------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
EDI C ++ + + + G+++ +N LV YD+E + + +C+
Sbjct: 379 YKEDIYCIGWQKSSTQTKEGRDLILIGDLVLSNKLVVYDLENMVIGWTNFNCS 431
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 115/428 (26%), Positives = 179/428 (41%), Gaps = 57/428 (13%)
Query: 33 LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYL 92
+ H P SP S R DA L L+ + +SS+ + P+ Y+
Sbjct: 27 VYHNVHPSSPSPLESIIALARDDDA---RLLFLSSKAATAGVSSAPVASGQAPPS---YV 80
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
+R +G+P + L DT +D W C PC C S LF P SS+Y SLPCSSS
Sbjct: 81 VRAGLGSPSQQLLLALDTSADATWAHCSPC--GTC--PSSSLFAPANSSSYASLPCSSSW 136
Query: 153 CASLNQKSCSGVN--------------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
C ++C C +S + D SF LA++T+ LG A
Sbjct: 137 CPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLGKD-----A 190
Query: 199 LPGITFGCGTN-NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
+P TFGC ++ G N G++GLG G ++L+SQ + G FSYCL P +
Sbjct: 191 IPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCL-PSYRSYYFS 249
Query: 258 GTNGIVSGPG----VVSTPLTK---AKTFYVLTIDAISVGNQRLGVST----------PD 300
G+ + +G G V TP+ + + Y + + +SVG+ + V
Sbjct: 250 GSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAATGAG 309
Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIH 358
V+DSGT +T + L + A G+ + C++ + ++ P VT+H
Sbjct: 310 TVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVH 369
Query: 359 FRGA-DVKLSRSNFFVKVSE-DIVCSVF----KGITNSVPIYGNIMQTNFLVGYDIEQQT 412
G D+ L N + S + C + + + V + N+ Q N V +D+
Sbjct: 370 MDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSR 429
Query: 413 VSFKPTDC 420
V F C
Sbjct: 430 VGFAKESC 437
>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 64/157 (40%), Positives = 87/157 (55%), Gaps = 11/157 (7%)
Query: 70 QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
Q SSS S + + Y R+ +GTPP V DTGSD++W QC PC +CY
Sbjct: 155 QGGGFSSSVTS--GLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPC--RKCYS 210
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
Q P+FDPK S ++ S+ C S C L+ C S +C Y V+YGDGSF+ G +TET+T
Sbjct: 211 QTDPVFDPKKSGSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLT 270
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
+ +P + GCG +N GLF G++GLG
Sbjct: 271 F-----RGTRVPKVALGCGHDNEGLFVG-AAGLLGLG 301
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 104/367 (28%), Positives = 165/367 (44%), Gaps = 44/367 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y RI IG+PP DTGSD++W +C+ CP + +DP S T ++
Sbjct: 84 YYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGT--TVG 141
Query: 148 CSSSQCASLNQKSCSGV---------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQA-- 196
C C + S GV CQ+ ++YGDGS + G T+ V +G
Sbjct: 142 CEQEFCVA---NSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQT 198
Query: 197 -VALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPV 250
+ ITFGCG GG N GI+G G D S++SQ+ + F++CL V
Sbjct: 199 TTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTV 258
Query: 251 SSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIV 302
I F +V P V +TPL T Y + + ISVG L + T +
Sbjct: 259 RGGGI-FAIGNVVQ-PKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTFDSGDSKGTI 316
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGA 362
IDSGTTL +LP+ LL+ + + P+ + + +S + P +T F G
Sbjct: 317 IDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQDFVCFQFSGSIDDGFPVITFSFEG- 375
Query: 363 DVKLSR--SNFFVKVSEDIVCSVF--KGIT----NSVPIYGNIMQTNFLVGYDIEQQTVS 414
D+ L+ ++ + D+ C F G+ + + G+++ +N LV YD+E++ +
Sbjct: 376 DLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVLSNKLVVYDLEKEVIG 435
Query: 415 FKPTDCT 421
+ +C+
Sbjct: 436 WTDYNCS 442
>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
Length = 632
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 120/444 (27%), Positives = 180/444 (40%), Gaps = 32/444 (7%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHR------DSPKSP--FYNSSETPYQRLRDALT 59
FIL F+ V A FS LIHR S KSP F Y RL ++
Sbjct: 6 AFILLFILSLVSEKSLASL--FSSRLIHRFSDEGRASIKSPGSFPEKRSFEYYRLLTSID 63
Query: 60 RSLNRLNHFNQNSSISSSKASQADIIPNN---ANYLIRISIGTPPTERLAVADTGSDLIW 116
++N + S+ S+ S+ I P N + I IGTP L D+GSDL+W
Sbjct: 64 SRRQKMNLGAKFQSLVPSEGSKT-ISPGNYFGWLHYTWIDIGTPSVSFLVALDSGSDLLW 122
Query: 117 TQCE--PCPP------SQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
C C P S +D FDP S+T K PCS C S C Y
Sbjct: 123 IPCNCVQCAPLSSAYYSSLATKDLNEFDPSASTTSKVFPCSHKLCESAPACESPKEQCPY 182
Query: 169 SVSYG-DGSFSNGNLATETVTLGSTTGQAVALPG-ITFGCGTNNGGLFNSKTT--GIVGL 224
+V+Y + + S+G L + + L + + ++ + GCG G F G++GL
Sbjct: 183 TVTYASENTSSSGLLVEDVLHLAYSANASSSVKARVVVGCGEKQSGEFLKGIAPDGVMGL 242
Query: 225 GGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVL 282
G G+IS+ S + + FS C S +I FG G + P Y +
Sbjct: 243 GPGEISVPSFLAKAGLMRNSFSMCFDEEDSGRIYFGDVGPSTQQSTRFLPYKNEFVAYFV 302
Query: 283 TIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
++ VGN L S+ +IDSG + TFLP+ + + S I A G E
Sbjct: 303 GVEVCCVGNSCLKQSSFTTLIDSGQSFTFLPEEIYREVALEIDSHINATVKKIEGGPWEY 362
Query: 343 CYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 402
CY + +VP + + F + + FV + + I+ S G ++ N+
Sbjct: 363 CYETSFEPKVPAIKLKFSSNNTFVIHKPLFVLQRSEGLVQFCLPISASEEGTGGVIGQNY 422
Query: 403 LVGY----DIEQQTVSFKPTDCTK 422
+ GY D E + + + C +
Sbjct: 423 MAGYRIVFDRENMKLGWSASKCQE 446
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 163/373 (43%), Gaps = 48/373 (12%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQC---EPCPPSQCYMQDSPLFDPKMSSTYKS 145
+N L IG P + DTGSD +W C CP D L+DP +S T K+
Sbjct: 72 SNGLYYTKIGLGPKDYYVQVDTGSDTLWVNCVGCTACPKKSGLGMDLTLYDPNLSKTSKA 131
Query: 146 LPCSSSQCASLNQKSCS----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP- 200
+PC C S S G++C YS++YGDGS ++G+ + +T G +P
Sbjct: 132 VPCDDEFCTSTYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPD 191
Query: 201 --GITFGCGTNNGGLFNSKT----TGIVGLGGGDISLISQMRTTIAGK----FSYCLVPV 250
+ FGCG+ G +S T GI+G G + S++SQ+ AGK FS+CL +
Sbjct: 192 NTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAA--AGKVKRIFSHCLDSI 249
Query: 251 SSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIV 302
S I F +V P V +TPL + Y + + I V + + S +
Sbjct: 250 SGGGI-FAIGEVVQ-PKVKTTPLLQGMAHYNVVLKDIEVAGDPIQLPSDILDSSSGRGTI 307
Query: 303 IDSGTTLTFLPQGYNSNLLS---VMSSMIEAQPVADPTGSLELCYSFNSLSQVPEV--TI 357
IDSGTTL +LP LL S ++ V D C+ ++ V ++ T+
Sbjct: 308 IDSGTTLAYLPVSIYDQLLEKILAQRSGMKLYLVEDQF----TCFHYSDEESVDDLFPTV 363
Query: 358 HF---RGADVKLSRSNFFVKVSEDIVCSVFKGIT------NSVPIYGNIMQTNFLVGYDI 408
F G + ++ ED+ C ++ + + G+++ N LV YD+
Sbjct: 364 KFTFEEGLTLTTYPRDYLFLFKEDMWCVGWQKSMAQTKDGKELILLGDLVLANKLVVYDL 423
Query: 409 EQQTVSFKPTDCT 421
+ + + +C+
Sbjct: 424 DNMAIGWADYNCS 436
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 116/432 (26%), Positives = 180/432 (41%), Gaps = 61/432 (14%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
+ + H P SP S R DA L L+ + +SS+ + P+
Sbjct: 27 LSVYHNVHPSSPSPLESIIALARDDDA---RLLFLSSKAATAGVSSAPVASGQAPPS--- 80
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y++R +G+P + L DT +D W C PC C S LF P SS+Y SLPCSS
Sbjct: 81 YVVRAGLGSPSQQLLLALDTSADATWAHCSPC--GTC--PSSSLFAPANSSSYASLPCSS 136
Query: 151 SQCASLNQKSCSGVN--------------CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
S C ++C C +S + D SF LA++T+ LG
Sbjct: 137 SWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLGKD---- 191
Query: 197 VALPGITFGCGTN-NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
A+P TFGC ++ G N G++GLG G ++L+SQ + G FSYCL P +
Sbjct: 192 -AIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCL-PSYRSYY 249
Query: 256 NFGTNGIVSGPG----VVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTP--------- 299
G+ + +G G V TP+ + + Y + + +SVG R V P
Sbjct: 250 FSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVG--RAWVKVPAGSFAFDAA 307
Query: 300 ---DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPE 354
V+DSGT +T + L + A G+ + C++ + ++ P
Sbjct: 308 TGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPA 367
Query: 355 VTIHFRGA-DVKLSRSNFFVKVSE-DIVCSVF----KGITNSVPIYGNIMQTNFLVGYDI 408
VT+H G D+ L N + S + C + + + V + N+ Q N V +D+
Sbjct: 368 VTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDV 427
Query: 409 EQQTVSFKPTDC 420
+ F C
Sbjct: 428 ANSRIGFAKESC 439
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 161/361 (44%), Gaps = 45/361 (12%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ-DSPL--FDPKMSSTYKSLP 147
Y ++ +GTPP DTGSDL+W C PC + P+ +D K S++ +P
Sbjct: 36 YFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVP 95
Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
CS C + Q S SG N C YS YGDGS + G L + + A +
Sbjct: 96 CSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHY-----MVNATATVI 150
Query: 204 FGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSSTKIN 256
FGCG G ++ GI+G G D+S SQ+ GK F++CL
Sbjct: 151 FGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQ--GKTPNVFAHCL-DGGERGGG 207
Query: 257 FGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS----TPDI----VIDSGTT 308
G V P + TPL + Y + + +ISV N L + + D+ + DSGTT
Sbjct: 208 ILVLGNVIEPDIQYTPLVPYMSHYNVVLQSISVNNANLTIDPKLFSNDVMQGTIFDSGTT 267
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSR 368
L +LP +S ++ + D S + F P V ++F GA + L+
Sbjct: 268 LAYLPDEAYQAFTQAVSLVVAPFLLCDTRLSRFIYKLF------PNVVLYFEGASMTLTP 321
Query: 369 SNFFVK----VSEDIVCSVFKGITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
+ + ++ + I C ++ + ++ I+G+++ N LV YD+E+ + ++P D
Sbjct: 322 AEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFD 381
Query: 420 C 420
C
Sbjct: 382 C 382
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 105/376 (27%), Positives = 169/376 (44%), Gaps = 51/376 (13%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP--LFDPKMSSTYKSLP 147
Y +R +GTP + VADTGSDL W +C D+P +F S ++ +
Sbjct: 111 QYFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDG---TGDAPRRVFRAAASRSWAPIA 167
Query: 148 CSSSQCAS-----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL---GSTT----GQ 195
CSS C S L S C Y Y DGS + G + T++ T+ GS + G+
Sbjct: 168 CSSDTCTSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESRDGGGR 227
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVS 251
L G+ GC + G + G++ LG +IS S+ G+FSYCLV P +
Sbjct: 228 RAKLQGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRN 287
Query: 252 STK-INFGTNGIVSG--------PGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTP 299
+T + FG G G TPL + FY + +DA+ V + L +
Sbjct: 288 ATSYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAGEALDIPA- 346
Query: 300 DI---------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA--DPTGSLELCYSFNS 348
D+ ++DSGT+LT L +++ +S + P DP E CY++ +
Sbjct: 347 DVWDVARGGGAILDSGTSLTVLATPAYRAVVAALSERLAGLPRVSMDP---FEYCYNWTA 403
Query: 349 LS-QVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVG 405
+ ++P + + F G A ++ ++ V + + C V +G V + GNI+Q + L
Sbjct: 404 AALEIPGLEVRFAGSARLQPPAKSYVVDAAPGVKCIGVQEGAWPGVSVIGNILQQDHLWE 463
Query: 406 YDIEQQTVSFKPTDCT 421
+D+ + + FK T C
Sbjct: 464 FDLRDRWLRFKHTRCA 479
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 165/361 (45%), Gaps = 35/361 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP + DTGSD++W QC CP + L+D K S T K +
Sbjct: 98 YYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVS 157
Query: 148 CSSSQCASLN----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
C C ++N + ++C Y+ Y DGS S G + V +G A
Sbjct: 158 CDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANG 217
Query: 201 GITFGCGTNNGGLFNSKTT--GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKIN 256
+ FGC G +S+ GI+G G + S+ISQ+ ++ + F++CL ++ I
Sbjct: 218 SVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGI- 276
Query: 257 FGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP--DI------VIDSGTT 308
F IV P V +TPL +T Y + + A+ VG L + T D+ +IDSGTT
Sbjct: 277 FAIGHIVQ-PKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTT 335
Query: 309 LTFLPQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVK 365
L +LP+ LLS + S ++ + D + YS + P VT HF +
Sbjct: 336 LAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFTCFQ--YSESLDDGFPAVTFHFENSLYL 393
Query: 366 LSRSNFFVKVSEDIVCSVFK--GI----TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
+ ++ + + C ++ G+ ++ + G++ +N LV YD+E Q + + +
Sbjct: 394 KVHPHEYLFSYDGLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTEYN 453
Query: 420 C 420
C
Sbjct: 454 C 454
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 106/371 (28%), Positives = 161/371 (43%), Gaps = 52/371 (14%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP + DTGSD++W C+ CP D L+D K S+T ++
Sbjct: 74 YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 133
Query: 148 CSSSQCASLNQ--KSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---G 201
C + C+ + C G+ C YSV YGDGS + G + V +G P
Sbjct: 134 CDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGT 193
Query: 202 ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKIN 256
+ FGCG G S + GI+G G + S++SQ+ ++ + FS+CL V I
Sbjct: 194 VVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGI- 252
Query: 257 FGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGTT 308
F +V P V TPL + + Y + + I VG L V +IDSGTT
Sbjct: 253 FAIGEVVE-PKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTT 311
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE------LCYSF--NSLSQVPEVTIHF- 359
L + PQ V +IE P L C+ + N P VT+HF
Sbjct: 312 LAYFPQ-------EVYVPLIEKILSQQPDLRLHTVEQAFTCFDYTGNVDDGFPTVTLHFD 364
Query: 360 RGADVKLSRSNFFVKVSEDIVCSVFKGITNS---------VPIYGNIMQTNFLVGYDIEQ 410
+ + + + +V E C G NS + + G+++ +N LV YD+E+
Sbjct: 365 KSISLTVYPHEYLFQVKEFEWCI---GWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDLEK 421
Query: 411 QTVSFKPTDCT 421
Q + + +C+
Sbjct: 422 QGIGWVEYNCS 432
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 162/364 (44%), Gaps = 37/364 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP + DTGSD++W QC CP + D L++ S T K +P
Sbjct: 78 YYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKLVP 137
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
C C +N G ++C Y YGDGS + G + V +G A
Sbjct: 138 CDQEFCYEINGGQLPGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAANG 197
Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTK 254
+ FGCG G S GI+G G + S+ISQ+ T + F++CL +
Sbjct: 198 SVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCLDGTNGGG 257
Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDSG 306
I G V P V TPL + Y + + A+ VG++ L + T +IDSG
Sbjct: 258 IF--VIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVFEAGDRKGAIIDSG 315
Query: 307 TTLTFLPQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGAD 363
TTL +LP+ L+S + S ++ V D + YS + P VT HF +
Sbjct: 316 TTLAYLPEMVYKPLVSKIISQQPDLKVHTVRDEYTCFQ--YSDSLDDGFPNVTFHFENSV 373
Query: 364 VKLSRSNFFVKVSEDIVCSVFK--GI----TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
+ + ++ E + C ++ G+ ++ + G+++ +N LV YD+E Q + +
Sbjct: 374 ILKVYPHEYLFPFEGLWCIGWQNSGVQSRDRRNMTLLGDLVLSNKLVLYDLENQAIGWTE 433
Query: 418 TDCT 421
+C+
Sbjct: 434 YNCS 437
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 101/362 (27%), Positives = 166/362 (45%), Gaps = 35/362 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP + DTGSD++W QC CP + L+D K S T K +
Sbjct: 98 YYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVS 157
Query: 148 CSSSQCASLN----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
C C ++N + ++C Y+ Y DGS S G + V +G A
Sbjct: 158 CDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANG 217
Query: 201 GITFGCGTNNGGLFNSKTT--GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKIN 256
+ FGC G +S+ GI+G G + S+ISQ+ ++ + F++CL ++ I
Sbjct: 218 SVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGI- 276
Query: 257 FGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP--DI------VIDSGTT 308
F IV P V +TPL +T Y + + A+ VG L + T D+ +IDSGTT
Sbjct: 277 FAIGHIVQ-PKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTT 335
Query: 309 LTFLPQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVK 365
L +LP+ LLS + S ++ + D + YS + P VT HF +
Sbjct: 336 LAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFTCFQ--YSESLDDGFPAVTFHFENSLYL 393
Query: 366 LSRSNFFVKVSEDIVCSVFK--GI----TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
+ ++ + + C ++ G+ ++ + G++ +N LV YD+E Q + + +
Sbjct: 394 KVHPHEYLFSYDGLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTEYN 453
Query: 420 CT 421
C+
Sbjct: 454 CS 455
>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
gi|224030351|gb|ACN34251.1| unknown [Zea mays]
Length = 342
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 94/350 (26%), Positives = 156/350 (44%), Gaps = 60/350 (17%)
Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGD 174
QC+PC CY Q P+F+PK+SS+Y +PC+S CA L+ C + CQY+ Y
Sbjct: 2 QCQPC--VSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSG 59
Query: 175 GSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
+ G LA + + +G AV FGC ++ G ++ +G+VGLG G +SL+SQ
Sbjct: 60 HGVTKGTLAIDKLAIGGDVFHAV-----VFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQ 114
Query: 235 MRTTIAGKFSYCLVPVSSTK-----INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDA 286
+ +F YCL P S + G + + + V+ + T+ ++Y L +D
Sbjct: 115 LSVH---RFMYCLPPPMSRTSGKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDG 171
Query: 287 ISVGNQ-----RLGVSTPD------------------------IVIDSGTTLTFLPQGYN 317
++VG+Q R S P +++D +T++FL
Sbjct: 172 LAVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLY 231
Query: 318 SNLLSVMSSMIEAQPVADPTGS--LELCYSF-----NSLSQVPEVTIHFRGADVKLSRSN 370
L + I P A P+ L+LC+ VP V++ F G ++L R
Sbjct: 232 DELADDLEEEIRL-PRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSFDGRWLELDRDR 290
Query: 371 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
F V++ + + G T+ V I GN N V +++ + ++F C
Sbjct: 291 LF--VTDGRMMCLMIGRTSGVSILGNFQLQNMRVLFNLRRGKITFAKASC 338
>gi|125595855|gb|EAZ35635.1| hypothetical protein OsJ_19925 [Oryza sativa Japonica Group]
Length = 335
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 72/205 (35%), Positives = 111/205 (54%), Gaps = 18/205 (8%)
Query: 98 GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
GT + + D+GSD+ W QC+PCP C+ Q PLFDP S+TY ++PCSS+ CA L
Sbjct: 75 GTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLG 134
Query: 158 --QKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
++ C + CQ+ ++Y +G+ + G +++ +TLG + G FGC + G
Sbjct: 135 PYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYD----VVRGFLFGCAHADQGST 190
Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----PGV 268
F+ G + LGGG S + Q + + FSYC VP S++ F G+ P
Sbjct: 191 FSYDVAGTLALGGGSQSFVQQTASQYSRVFSYC-VPPSTSSFGFIMFGVPPQRAALVPTF 249
Query: 269 VSTPL----TKAKTFYVLTIDAISV 289
VSTPL T + TFY +T+ +I++
Sbjct: 250 VSTPLLSSSTMSPTFYSITLPSIAL 274
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 107/389 (27%), Positives = 180/389 (46%), Gaps = 36/389 (9%)
Query: 58 LTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
L+ S L +S+ ++ D+IP Y RI IGTPP + DTGS L +
Sbjct: 60 LSHSRRHLQRSESHSTATARMPLYDDLIPY-GYYTTRIWIGTPPQTFALIVDTGSTLTYV 118
Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSF 177
C C QC P F P SSTY+ L C S +C ++ ++C Y Y + S
Sbjct: 119 PCSTC--EQCGKHQDPNFQPDWSSTYQPLKC-SMECTCDSEM----MHCVYDRQYAEMSS 171
Query: 178 SNGNLATETVTLGSTTGQAVALPGIT-FGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
S+G L + V+ G Q+ P T FGC G +++ + GI+GLG GD+S++ Q+
Sbjct: 172 SSGVLGEDIVSFGK---QSELKPQRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQL 228
Query: 236 --RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQ 292
+ I FS C + GI G+V T A++ +Y + + I + +
Sbjct: 229 VEKGVIGNSFSLCYGGMDVGGGAMVLGGISPPAGMVFTHSDPARSAYYNIDLKEIHIAGK 288
Query: 293 RLGVSTPDI-------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELC 343
+L ++ P + ++DSGTT +LP+ + + ++M + + + P + ++C
Sbjct: 289 QLPIN-PMVFDGKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDIC 347
Query: 344 YS-----FNSLSQV-PEVTIHF-RGADVKLSRSNFFVKVSE---DIVCSVFKGITNSVPI 393
+S + LS+ P V + F G + LS N+ + S+ +F+ + +
Sbjct: 348 FSGVGSDVSQLSKTFPAVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTL 407
Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
G I+ N LV YD E + F T+C++
Sbjct: 408 LGGIIVRNTLVMYDREHLKIGFWKTNCSE 436
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 107/389 (27%), Positives = 180/389 (46%), Gaps = 36/389 (9%)
Query: 58 LTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
L+ S L +S+ ++ D+IP Y RI IGTPP + DTGS L +
Sbjct: 60 LSHSRRHLQRSESHSTATARMPLYDDLIPY-GYYTTRIWIGTPPQTFALIVDTGSTLTYV 118
Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSF 177
C C QC P F P SSTY+ L C S +C ++ ++C Y Y + S
Sbjct: 119 PCSTC--EQCGKHQDPNFQPDWSSTYQPLKC-SMECTCDSEM----MHCVYDRQYAEMSS 171
Query: 178 SNGNLATETVTLGSTTGQAVALPGIT-FGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
S+G L + V+ G Q+ P T FGC G +++ + GI+GLG GD+S++ Q+
Sbjct: 172 SSGVLGEDIVSFGK---QSELKPQRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQL 228
Query: 236 --RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQ 292
+ I FS C + GI G+V T A++ +Y + + I + +
Sbjct: 229 VEKGVIGNSFSLCYGGMDVGGGAMVLGGISPPAGMVFTHSDPARSAYYNIDLKEIHIAGK 288
Query: 293 RLGVSTPDI-------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELC 343
+L ++ P + ++DSGTT +LP+ + + ++M + + + P + ++C
Sbjct: 289 QLPIN-PMVFDGKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDIC 347
Query: 344 YS-----FNSLSQV-PEVTIHF-RGADVKLSRSNFFVKVSE---DIVCSVFKGITNSVPI 393
+S + LS+ P V + F G + LS N+ + S+ +F+ + +
Sbjct: 348 FSGVGSDVSQLSKTFPAVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTL 407
Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
G I+ N LV YD E + F T+C++
Sbjct: 408 LGGIIVRNTLVMYDREHLKIGFWKTNCSE 436
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 120/495 (24%), Positives = 193/495 (38%), Gaps = 86/495 (17%)
Query: 10 ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN 69
IL + +++ P+ + +EL+HR + + ++ + R R N
Sbjct: 16 ILITITLHLILPVAVNS--MRLELVHRHHERFSGGGGDVDQVEAVKGFVNRDGLRRQRMN 73
Query: 70 QNSSISSSKASQADI---------IPNNA-------NYLIRISIGTPPTERLAVADTGSD 113
Q +S+ + + +P A Y + +G+P ADTGS+
Sbjct: 74 QRWGVSNYDRRRKGLETTTTTEVEMPMRAGRDDALGEYFTEVKVGSPGQRFWLAADTGSE 133
Query: 114 LIWTQC---------------------------------EPCPPSQCYMQDSP---LFDP 137
W C + + +P +F P
Sbjct: 134 FTWFNCVMRNATTTATTKKTRKNKTKKKHHHHSKRNRTRTTRRTKKKKAKSNPCKGVFCP 193
Query: 138 KMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
S +++++ C+S +C SL+ C Y +SY DGS + G T+T+T+
Sbjct: 194 HRSKSFQAVTCASQKCKIDLSQLFSLSLCPKPSDPCLYDISYADGSSAKGFFGTDTITVD 253
Query: 191 STTGQAVALPGITFGC--GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
G+ L +T GC NG FN T GI+GLG S I + KFSYCLV
Sbjct: 254 LKNGKEGKLNNLTIGCTKSMENGVNFNEDTGGILGLGFAKDSFIDKAAYEYGAKFSYCLV 313
Query: 249 PVSSTK-------INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV----- 296
S + I N + G + T L FY + + IS+G Q L +
Sbjct: 314 DHLSHRNVSSYLTIGGHHNAKLLGE-IKRTELILFPPFYGVNVVGISIGGQMLKIPPQVW 372
Query: 297 ---STPDIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSL-- 349
S +IDSGTTLT L Y +++ S+ + + V + G+L+ C+
Sbjct: 373 DFNSQGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVKRVTGEDFGALDFCFDAEGFDD 432
Query: 350 SQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTNFLVGY 406
S VP + HF GA + ++ + V+ + C I + GNIMQ N L +
Sbjct: 433 SVVPRLVFHFAGGARFEPPVKSYIIDVAPLVKCIGIVPIDGIGGASVIGNIMQQNHLWEF 492
Query: 407 DIEQQTVSFKPTDCT 421
D+ T+ F P+ CT
Sbjct: 493 DLSTNTIGFAPSICT 507
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 104/360 (28%), Positives = 162/360 (45%), Gaps = 63/360 (17%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + + +G+PP + DTGSDL W QC PC C+ Q+
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPC--YDCFQQND------------------ 209
Query: 151 SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG----QAVALPGITFGC 206
NQ +C Y YGD S + G+ A ET T+ TT + + + FGC
Sbjct: 210 ------NQ------SCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGC 257
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----STKINFGTN- 260
G N GLF+ + G +S SQ+++ FSYCLV + S+K+ FG +
Sbjct: 258 GHWNRGLFHGAAGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDK 316
Query: 261 GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGV--STPDI--------VIDS 305
++S P + T K TFY + I +I V + L + T +I +IDS
Sbjct: 317 DLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDS 376
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQ-PVADPTGSLELCYSFNSLS--QVPEVTIHF-RG 361
GTTL++ + + + ++ + + PV L+ C++ + + Q+PE+ I F G
Sbjct: 377 GTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQLPELGIAFADG 436
Query: 362 ADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
A N F+ ++ED+VC G S I GN Q NF + YD ++ + + PT C
Sbjct: 437 AVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKC 496
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 160/361 (44%), Gaps = 45/361 (12%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ-DSPL--FDPKMSSTYKSLP 147
Y ++ +GTPP DTGSDL+W C PC + P+ +D K S++ +P
Sbjct: 36 YFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVP 95
Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
CS C + Q S SG N C YS YGDGS + G L + + A +
Sbjct: 96 CSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHY-----MVNATATVI 150
Query: 204 FGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSSTKIN 256
FGCG G ++ GI+G G D+S SQ+ GK F++CL
Sbjct: 151 FGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQ--GKTPNVFAHCL-DGGERGGG 207
Query: 257 FGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS----TPDI----VIDSGTT 308
G V P + TPL Y + + +ISV N L + + D+ + DSGTT
Sbjct: 208 ILVLGNVIEPDIQYTPLVPYMYHYNVVLQSISVNNANLTIDPKLFSNDVMQGTIFDSGTT 267
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSR 368
L +LP +S ++ + D S + F P V ++F GA + L+
Sbjct: 268 LAYLPDEAYQAFTQAVSLVVAPFLLCDTRLSRFIYKLF------PNVVLYFEGASMTLTP 321
Query: 369 SNFFVK----VSEDIVCSVFKGITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
+ + ++ + I C ++ + ++ I+G+++ N LV YD+E+ + ++P D
Sbjct: 322 AEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFD 381
Query: 420 C 420
C
Sbjct: 382 C 382
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 166/363 (45%), Gaps = 35/363 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP + DTGSD++W QC CP + + +D + S+T K +
Sbjct: 87 YYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLVS 146
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG 201
C C +N SG ++C Y YGDGS + G + V +G + A G
Sbjct: 147 CDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAANG 206
Query: 202 -ITFGCGTNNGGLFNS----KTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTK 254
I FGCG G S GI+G G + S+ISQ+ +T + F++CL +
Sbjct: 207 SIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGGG 266
Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDSG 306
I F +V P V TPL + Y + + + VG+ L +S +IDSG
Sbjct: 267 I-FAMGHVVQ-PKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRKGTIIDSG 324
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGADV 364
TTL +LP+ L++ + S V G + C+ ++ P V HF + +
Sbjct: 325 TTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYK-CFQYSERVDDGFPPVIFHFENSLL 383
Query: 365 KLSRSNFFVKVSEDIVCSVFK--GI----TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
+ ++ E++ C ++ G+ +V ++G+++ +N LV YD+E QT+ +
Sbjct: 384 LKVYPHEYLFQYENLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEY 443
Query: 419 DCT 421
+C+
Sbjct: 444 NCS 446
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 89/277 (32%), Positives = 130/277 (46%), Gaps = 30/277 (10%)
Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
C Y+++YGDGSF+ G L E + G+ + + FGCG NN GLF +G++GLG
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKFGT-----ILVKDFIFGCGRNNKGLFGG-VSGLMGLG 186
Query: 226 GGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVV---STPLTKAK----- 277
D+SLISQ G FSYCL ST+ + I+ G V S+P++ AK
Sbjct: 187 RSDLSLISQTSGIFGGVFSYCL---PSTERKGSGSLILGGNSSVYRNSSPISYAKMIENP 243
Query: 278 ---TFYVLTIDAISVGN---QRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
FY + + IS+G Q V I++DSGT +T LP L +
Sbjct: 244 QLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGF 303
Query: 332 PVADPTGSLELCYSFNSLSQV--PEVTIHFRG---ADVKLSRSNFFVKVSEDIVCSVFKG 386
P A L+ C++ ++ +V P + +HF G V ++ +FVK VC
Sbjct: 304 PPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALAS 363
Query: 387 IT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ + V I GN Q N V YD ++ V F C+
Sbjct: 364 LEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 400
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 104/372 (27%), Positives = 164/372 (44%), Gaps = 53/372 (14%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ IGTP + DTGSD++W QC CP + + L++ K S + K +P
Sbjct: 86 YYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLVP 145
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG 201
C C +N SG ++C Y YGDGS + G + V +G Q + G
Sbjct: 146 CDEEFCYEVNGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTSSNG 205
Query: 202 -ITFGCGTNNGGLF----NSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLVPVSSTK 254
+ FGCG G GI+G G + S+ISQ+ T K F++CL ++
Sbjct: 206 SVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCLDGINGGG 265
Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI--------VIDSG 306
I F +V P V TPL + Y + + A+ VG L + T + +IDSG
Sbjct: 266 I-FAIGHVVQ-PKVNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEFEAGDRKGAIIDSG 323
Query: 307 TTLTFLPQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGAD 363
TTL +LP+ L+S + S ++ V D + YS + P VT HF
Sbjct: 324 TTLAYLPEIVYEPLVSKIISQQPDLKVHIVRDEYTCFQ--YSGSVDDGFPNVTFHF---- 377
Query: 364 VKLSRSNFFVKVSEDIVCSVFKGI--------------TNSVPIYGNIMQTNFLVGYDIE 409
++ F+KV F+G+ ++ + G+++ +N LV YD+E
Sbjct: 378 ----ENSVFLKVHPHEYLFPFEGLWCIGWQNSGMQSRDRRNMTLLGDLVLSNKLVLYDLE 433
Query: 410 QQTVSFKPTDCT 421
Q + + +C+
Sbjct: 434 NQAIGWTEYNCS 445
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 107/361 (29%), Positives = 172/361 (47%), Gaps = 32/361 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G+PPTE DTGSD++W + C CP S D FD S T S+
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSVT 159
Query: 148 CSSSQCASLNQKS---CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
CS C+S+ Q + CS N C YS YGDGS ++G T+T + G+++
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 219
Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKI 255
I FGC T G + GI G G G +S++SQ+ R FS+CL S
Sbjct: 220 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGG 279
Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGT 307
F I+ PG+V +PL ++ Y L + +I V Q L + +T ++D+GT
Sbjct: 280 VFVLGEILV-PGMVYSPLLPSQPHYNLNLLSIGVNGQILPIDAAVFEASNTRGTIVDTGT 338
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQV-PEVTIHFR-GADV 364
TLT+L + L+ +S+ + +Q V + E CY + S+S + P V+++F GA +
Sbjct: 339 TLTYLVKEAYDPFLNAISNSV-SQLVTLIISNGEQCYLVSTSISDMFPPVSLNFAGGASM 397
Query: 365 KLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
L ++ + C F+ I G+++ + + YD+ +Q + + DC
Sbjct: 398 MLRPQDYLFHYGFYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWANYDC 457
Query: 421 T 421
+
Sbjct: 458 S 458
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 118/443 (26%), Positives = 191/443 (43%), Gaps = 61/443 (13%)
Query: 8 VFILFFLCFYVVSPIEA---------QTGGFSVELIHRDSPKSPFYNSSETPY-QRLRDA 57
+F F VVS +A ++ G + +IH SPF + + +
Sbjct: 3 IFTAFVFLTLVVSTTKAFDPCASPSSESKGSDLSVIHVYGQCSPFNQHKAGSWVNTVINM 62
Query: 58 LTRSLNRLNHFNQNSSISSSKASQADI-----IPNNANYLIRISIGTPPTERLAVADTGS 112
++ R+ + + S ++S KA+ I + N NY++R+ +GTP V DT
Sbjct: 63 ASKDPARVTYLS--SLVASPKATSVPIASGQQVLNIGNYVVRVKLGTPGQLMFMVLDTSR 120
Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC---SGVNCQYS 169
D W C C + C SP F P SSTY SL CS QC + SC C ++
Sbjct: 121 DAAWVPCADC--AGC---SSPTFSPNTSSTYASLQCSVPQCTQVRGLSCPTTGTAACFFN 175
Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDI 229
+YG S + L+ +++ L T LP +FGC G G++GLG G +
Sbjct: 176 QTYGGDSSFSAMLSQDSLGLAVDT-----LPSYSFGCVNAVSG-STLPPQGLLGLGRGPM 229
Query: 230 SLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP-----GVVSTPLTK---AKTFYV 281
SL+SQ + +G FSYC S K + + + GP + +TPL + T Y
Sbjct: 230 SLLSQSGSLYSGVFSYCF---PSFKSYYFSGSLRLGPLGQPKNIRTTPLLRNPHRPTLYY 286
Query: 282 LTIDAISVGNQRLGVSTPDI-----------VIDSGTTLTFLPQGYNSNLLSVMSSMIEA 330
+ + +SVG + V+ P++ +IDSGT +T + + + ++
Sbjct: 287 VNLTGVSVGRVLVPVA-PELLAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKG 345
Query: 331 QPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSE-DIVCSVFKGITN 389
P A G+ + C++ + P VT HF G D+KL N + S + C N
Sbjct: 346 -PFAT-IGAFDTCFAATNEDIAPPVTFHFTGMDLKLPLENTLIHSSAGSLACLAMAAAPN 403
Query: 390 SV----PIYGNIMQTNFLVGYDI 408
+V + N+ Q N + +D+
Sbjct: 404 NVNSVLNVIANLQQQNLRIMFDV 426
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 104/356 (29%), Positives = 160/356 (44%), Gaps = 45/356 (12%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD------SPLFDPKMSSTY 143
Y ++ +GTP T L V DTGSD++W PP ++ +P P+ +
Sbjct: 121 EYFAQVGVGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGSSTGAAPAPTPRWN--- 177
Query: 144 KSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
C + C L+ C C Y V+YGDGS + G+ A+ET+T + +
Sbjct: 178 ----CVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF----ARGARVQR 229
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
+ GCG +N GLF + + ++GLG G +S SQ+ + FSYCLV +S++ +
Sbjct: 230 VAIGCGHDNEGLFIAASG-LLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSRRARPSRR 288
Query: 262 IVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRL-GVSTPD-----------IVIDSGTTL 309
P + TFY + + SVG R+ GVS D +++DSGT++
Sbjct: 289 WGGTP--------RMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTSV 340
Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIHFR-GADVK 365
T L + + + V+ SL + CY+ + +VP V++H GA V
Sbjct: 341 TRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVA 400
Query: 366 LSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
L N+ + V + C G V I GNI Q F V +D + Q V F P C
Sbjct: 401 LPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 456
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 103/385 (26%), Positives = 172/385 (44%), Gaps = 44/385 (11%)
Query: 71 NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYM 129
+SSI++ D+ P+ Y + ++IG PP D+GSDL W QC+ PC C
Sbjct: 38 SSSIAAVFPLYGDVYPHGL-YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC--RSCNE 94
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNL 182
PL+ P S K +PC CASL+ + C + C Y + Y D S G L
Sbjct: 95 VPHPLYRPTKS---KLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVL 151
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
++ L T G +VA P + FGCG + G +S T G++GLG G +SL+SQ++
Sbjct: 152 INDSFALRLTNG-SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRG 210
Query: 240 AGK--FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLG 295
K +CL + FG + +V TP+ ++ + +Y ++ G++ LG
Sbjct: 211 VTKNVVGHCLSLRGGGFLFFGDD-LVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLG 269
Query: 296 VSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQ 351
V +V DSG++ T+ L++ + + +P SL LC+ F S+
Sbjct: 270 VRLAKVVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKGQEPFKSVLD 329
Query: 352 VPE----VTIHFRGAD---VKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNI 397
V + + ++F +++ N+ + C GI N + I G+I
Sbjct: 330 VRKEFKSLVLNFASGKKTLMEIPPENYLIVTENGNAC---LGILNGSEIGLKDLSIIGDI 386
Query: 398 MQTNFLVGYDIEQQTVSFKPTDCTK 422
+ +V YD E+ + + C +
Sbjct: 387 TMQDHMVIYDNEKGKIGWIRAPCDR 411
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 104/365 (28%), Positives = 170/365 (46%), Gaps = 40/365 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IG+P DTGSD++W +C+ CP + + +DP S T ++
Sbjct: 85 YYTQIEIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQYDPAGSGT--TVG 142
Query: 148 CSSSQCASLNQK----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP- 200
C C + + +C + CQ+ ++YGDGS + G +++V +G P
Sbjct: 143 CDQEFCVANSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGNGQTTPS 202
Query: 201 --GITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSST 253
ITFGCG GG S + GI+G G D S++SQ+ + F++CL V
Sbjct: 203 NASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCLDTVHGG 262
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--STPD------IVIDS 305
I F +V P V +TPL + T Y + + ISVG L + ST D +IDS
Sbjct: 263 GI-FAIGNVVQ-PKVKTTPLVQNVTHYNVNLQGISVGGATLQLPSSTFDSGDSKGTIIDS 320
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGA- 362
GTTL +LP+ LL+ + + Q +A +C+ F+ P VT F G
Sbjct: 321 GTTLAYLPREVYRTLLTAV--FDKYQDLALHNYQDFVCFQFSGSIDDGFPVVTFSFEGEI 378
Query: 363 DVKLSRSNFFVKVSEDIVCSVF--KGIT----NSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
+ + ++ + D+ C F G+ + + G+++ +N LV YD+E+Q + +
Sbjct: 379 TLNVYPHDYLFQNENDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVVYDLEKQVIGWA 438
Query: 417 PTDCT 421
+C+
Sbjct: 439 DYNCS 443
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 106/379 (27%), Positives = 173/379 (45%), Gaps = 64/379 (16%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + +++G PP V DTGS+L W C+ P +F+P SSTY +
Sbjct: 61 HNVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSP------NLGSVFNPVSSSTYSPV 114
Query: 147 PCSSSQCASLNQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
PCSS C + + SC C ++SY D + GNLA ET +GS T
Sbjct: 115 PCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVT-----R 169
Query: 200 PGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
PG FGC G ++ ++K+TG++G+ G +S ++Q+ + KFSYC+ S+
Sbjct: 170 PGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFS---KFSYCISGSDSSGFL 226
Query: 257 FGTNGIVSGPGVV--------STPLTK-AKTFYVLTIDAISVGNQRL----GVSTPD--- 300
+ S G + STPL + Y + ++ I VG++ L V PD
Sbjct: 227 LLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTG 286
Query: 301 ---IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADP----TGSLELCYSFNS--- 348
++DSGT TFL + L + + ++ + V DP G+++LCY S
Sbjct: 287 AGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTR 346
Query: 349 --LSQVPEVTIHFRGADVKLSRSNFFVKVS-------EDIVCSVFKG---ITNSVPIYGN 396
S +P V++ FRGA++ +S +V+ E++ C F + + G+
Sbjct: 347 PNFSGLPMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGH 406
Query: 397 IMQTNFLVGYDIEQQTVSF 415
Q N + +D+ + V F
Sbjct: 407 HHQQNVWMEFDLAKSRVGF 425
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 172/375 (45%), Gaps = 43/375 (11%)
Query: 76 SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
S++ D + N Y R+ IGTPP E + D+GS + + C C QC P F
Sbjct: 73 SARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASC--EQCGNHQDPRF 130
Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
P +SSTY + C+ C + K+ C Y Y + S S+G L + V+ G+ +
Sbjct: 131 QPDLSSTYSPVKCNVD-CTCDSDKN----QCTYERQYAEMSSSSGVLGEDIVSFGTES-- 183
Query: 196 AVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSS 252
+ FGC + G LF+ GI+GLG G +S++ Q+ + I FS C
Sbjct: 184 ELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCY----- 238
Query: 253 TKINFGTNGIVSG-----PGVVSTPLTKAKT-FYVLTIDAISVGNQRLGVSTPDI----- 301
++ G +V G PG++ T ++ +Y + + + V + L V P I
Sbjct: 239 GGMDIGGGAMVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVD-PRIFDGKH 297
Query: 302 --VIDSGTTLTFLPQGYNSNLLSVMSSMIEA-QPVADPTGSL-ELCYS-----FNSLSQV 352
V+DSGTT +LP+ +SS + + + P + ++C++ + LS+V
Sbjct: 298 GTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEV 357
Query: 353 -PEVTIHF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYD 407
P+V + F G + LS N+ + S E C VF+ + + G I+ N LV YD
Sbjct: 358 FPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYD 417
Query: 408 IEQQTVSFKPTDCTK 422
+ + F T+C++
Sbjct: 418 RHNEKIGFWKTNCSE 432
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 106/381 (27%), Positives = 172/381 (45%), Gaps = 68/381 (17%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + +++G PP V DTGS+L W C+ P +F+P SSTY +
Sbjct: 61 HNVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSP------NLGSVFNPVSSSTYSPV 114
Query: 147 PCSSSQCASLNQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
PCSS C + + SC C ++SY D + GNLA ET +GS T
Sbjct: 115 PCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVT-----R 169
Query: 200 PGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
PG FGC G ++ ++K+TG++G+ G +S ++Q+ + KFSYC+ S+
Sbjct: 170 PGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFS---KFSYCISGSDSSVFL 226
Query: 257 FGTNGIVSGPGVV--------STPLTK-AKTFYVLTIDAISVGNQRL----GVSTPD--- 300
+ S G + STPL + Y + ++ I VG++ L V PD
Sbjct: 227 LLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTG 286
Query: 301 ---IVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADP----TGSLELCYSFNS- 348
++DSGT TFL + ++ S++ V DP G+++LCY S
Sbjct: 287 AGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRL--VDDPDFVFQGTMDLCYKVGST 344
Query: 349 ----LSQVPEVTIHFRGADVKLSRSNFFVKVS-------EDIVCSVFKG---ITNSVPIY 394
S +P V++ FRGA++ +S +V+ E++ C F + +
Sbjct: 345 TRPNFSGLPMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVI 404
Query: 395 GNIMQTNFLVGYDIEQQTVSF 415
G+ Q N + +D+ + V F
Sbjct: 405 GHHHQQNVWMEFDLAKSRVGF 425
>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
gi|194693730|gb|ACF80949.1| unknown [Zea mays]
gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
Length = 519
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 107/387 (27%), Positives = 166/387 (42%), Gaps = 23/387 (5%)
Query: 54 LRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSD 113
LR L R RL NQ S+S ++ + Y + +GTP T L DTGSD
Sbjct: 63 LRSDLQRQKRRLAGKNQLLSLSKGGSTFSPGNDLGWLYYAWVDVGTPTTSFLVALDTGSD 122
Query: 114 LIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ 167
L W C+ C P Y +D ++ P S+T + LPCS C + + C
Sbjct: 123 LFWVPCDCIQCAPLSSYRGNLDRDLGIYKPAESTTSRHLPCSHELCQPGSGCTNPKQPCT 182
Query: 168 YSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG--LFNSKTTGIVGL 224
Y++ Y + + S+G L +++ L S G A + GCG G L G++GL
Sbjct: 183 YNIDYFSENTTSSGLLIEDSLHLNSREGHAPVNASVIIGCGRKQSGDYLDGIAPDGLLGL 242
Query: 225 GGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVL 282
G DIS+ S + + FS C SS +I FG G+ S PL Y +
Sbjct: 243 GMADISVPSFLARAGLVRNSFSMCFKEDSSGRIFFGDQGVSSQQSTPFVPLYGKLQTYAV 302
Query: 283 TIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
+D +G++ L S+ ++DSGT+ T LP + I A V + +
Sbjct: 303 NVDKSCIGHKCLEGSSFQALVDSGTSFTSLPPDVYKAFTTEFDKQINASRVPYEDSTWKY 362
Query: 343 CYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSED---IVCSVFKGITNSVPIYGNI 397
CYS + L VP + + F A+ N + +++ + + ++ PI I
Sbjct: 363 CYSASPLEMPDVPTIILAF-AANKSFQAVNPILPFNDEQGALARFCLAVLPSTEPI--GI 419
Query: 398 MQTNFLVGY----DIEQQTVSFKPTDC 420
+ NFLVGY D E + + ++C
Sbjct: 420 IGQNFLVGYHVVFDRESMKLGWYRSEC 446
>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
Length = 469
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 116/425 (27%), Positives = 182/425 (42%), Gaps = 70/425 (16%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSISSSKASQADI------IP-------NNANYLIRISIG 98
+R RD R H S ++S + AD+ +P Y +R +G
Sbjct: 59 ERARDDARR------HAYIRSQLASRRRRAADVGASAFAMPLSSGAYTGTGQYFVRFRVG 112
Query: 99 TPPTERLAVADTGSDLIWTQCEPC--PPSQCYMQDSPL--FDPKMSSTYKSLPCSSSQCA 154
TP + VADTGSDL W +C PP+ D P F S ++ L CSS C
Sbjct: 113 TPAQPFVLVADTGSDLTWVKCRGAAGPPAS----DPPAREFRASESRSWAPLACSSDTCT 168
Query: 155 S-----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG----------STTGQAVAL 199
S L S C Y Y DGS + G + T+ T+ G+ L
Sbjct: 169 SYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGGGRRAKL 228
Query: 200 PGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SST 253
G+ GC T +G F S + G++ LG +IS S+ G+FSYCLV +S+
Sbjct: 229 QGVVLGCTATYDGQSFQS-SDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNASS 287
Query: 254 KINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI--------- 301
+ FG G TPL + FY + +DA+ V + L + D+
Sbjct: 288 YLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPA-DVWDVGRGGGA 346
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLELCYSFNS-LSQVPEVTIH 358
++DSGT+LT L +++ + + A P DP E CY++ + ++P++ +
Sbjct: 347 ILDSGTSLTVLATPAYRAVVAALGGRLAALPRVAMDP---FEYCYNWTAGAPEIPKLEVS 403
Query: 359 FRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
F G A ++ ++ + + + C V +G V + GNI+Q L +D+ + + FK
Sbjct: 404 FAGSARLEPPAKSYVIDAAPGVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLRDRWLRFK 463
Query: 417 PTDCT 421
T C
Sbjct: 464 HTRCA 468
>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
Length = 376
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 108/221 (48%), Gaps = 23/221 (10%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN----HFNQNSSISSSKASQADII 85
S+E+IH+ P S R + L + +R+N +N + +
Sbjct: 67 SLEVIHKHGPCSKLSQDKGRSPSRTQ-MLDQDESRVNSIRSRLAKNPADGGKLKGSKVTL 125
Query: 86 PNNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
P+ + NY++ + +GTP + + DTGSDL WTQCEPC CY Q P+F+P
Sbjct: 126 PSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPC-ARYCYHQQEPIFNPS 184
Query: 139 MSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
S++Y ++ CSS C L N SCS C Y + YGD S+S G A + + L ST
Sbjct: 185 KSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTD 244
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
FGCG NN GLF G++GLG +SL+S+
Sbjct: 245 ----VFNNFLFGCGQNNRGLF-VGVAGLIGLGRNALSLMSK 280
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 54/111 (48%), Gaps = 9/111 (8%)
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNF 371
G N LS+MS P A P L+ CY F+ VP++ ++F GA++ L S
Sbjct: 269 GLGRNALSLMSKY----PKAAPASILDTCYDFSQYDTVDVPKINLYFSDGAEMDLDPSGI 324
Query: 372 FVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
F ++ VC F G +++ + I GN+ Q F V YD+ + F P C
Sbjct: 325 FYILNISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFAPGGC 375
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 105/383 (27%), Positives = 177/383 (46%), Gaps = 65/383 (16%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + ++ +++GTPP V DTGS+L W C + FDP S++Y+++
Sbjct: 27 HNVSLIVSLTVGTPPQNVSMVIDTGSELSWLHCNKT------LSYPTTFDPTRSTSYQTI 80
Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
PCSS C + Q SC N C ++SY D S S+GNLA++ +GS+ +
Sbjct: 81 PCSSPTCTNRTQDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHIGSSD-----IS 135
Query: 201 GITFGCGT---NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-STKIN 256
G+ FGC ++ +SK+TG++G+ G +S +SQ+ KFSYC+ S +
Sbjct: 136 GLVFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQLGFP---KFSYCISGTDFSGLLL 192
Query: 257 FGTNGIVSGPGVVSTPLTKAKT--------FYVLTIDAISVGNQRLGVST----PD---- 300
G + + + TPL + T Y + ++ I V ++ L + PD
Sbjct: 193 LGESNLTWSVPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEPDHTGA 252
Query: 301 --IVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADP----TGSLELCY----SF 346
++DSGT TFL S L+ SS++ + DP G+++LCY S
Sbjct: 253 GQTMVDSGTQFTFLLGPVYNALRSAFLNQTSSVLRV--LEDPDFVFQGAMDLCYLVPLSQ 310
Query: 347 NSLSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKG---ITNSVPIYGNI 397
L +P VT+ FRGA++ +S +V ++ + C F + + G+
Sbjct: 311 RVLPLLPTVTLVFRGAEMTVSGDRVLYRVPGELRGNDSVHCLSFGNSDLLGVEAYVIGHH 370
Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
Q N + +D+E+ + C
Sbjct: 371 HQQNVWMEFDLEKSRIGLAQVRC 393
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 98/288 (34%), Positives = 134/288 (46%), Gaps = 40/288 (13%)
Query: 159 KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKT 218
+ CSG +C Y V YGDGS++ G A +T+TL S A+ G FGCG N GLF +
Sbjct: 14 RGCSGGHCLYGVQYGDGSYTIGFFAMDTLTLSSHD----AIKGFRFGCGERNEGLFG-EA 68
Query: 219 TGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAK- 277
G++GLG G SL Q G F++C SS GT + GPG S+P AK
Sbjct: 69 AGLLGLGRGKTSLPVQTYDKYGGVFAHCFPARSS-----GTGYLEFGPG--SSPAVSAKL 121
Query: 278 -----------TFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLL 321
TFY + + I VG + L + + ++DSGT +T LP S+L
Sbjct: 122 STTPMLIDTGPTFYYVGMTGIRVGGKLLPIPQSVFAAAGTIVDSGTVITRLPPAAYSSLR 181
Query: 322 SVMSSMIEAQPV--ADPTGSLELCYSFNSLSQV--PEVTIHFRGA---DVKLSRSNFFVK 374
S ++ + A+ A L+ CY S+V P V++ F+G DV S +
Sbjct: 182 SAFAASMAARGYKRAPALSLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASGIIYAAS 241
Query: 375 VSEDIVCSVFKG--ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
VS+ C F G + V I GN F V YDI + V F P C
Sbjct: 242 VSQ--ACLGFAGNEAADDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 100/375 (26%), Positives = 165/375 (44%), Gaps = 45/375 (12%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
D+ P+ Y + ++IG PP D+GSDL W QC+ PC C PL+ P S
Sbjct: 56 GDVYPHGL-YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC--RSCNEVPHPLYRPTKS 112
Query: 141 STYKSLPCSSSQCASLNQKSCSGVN--------CQYSVSYGDGSFSNGNLATETVTLGST 192
K +PC CASL+ G + C Y + Y D S G L ++ L T
Sbjct: 113 ---KLVPCVHRLCASLHNALTGGKHRCESPHEQCDYVIKYADQGSSTGVLVNDSFALRLT 169
Query: 193 TGQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCL 247
G +VA P + FGCG + G +S T G++GLG G +SL+SQ++ K +CL
Sbjct: 170 NG-SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCL 228
Query: 248 VPVSSTKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGVSTPDIVIDS 305
+ FG + +V TP+ ++ + +Y ++ G++ LGV +V DS
Sbjct: 229 SLRGGGFLFFGDD-LVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDS 287
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTI 357
G++ T+ L++ + + +P SL LC+ F S+ V + + +
Sbjct: 288 GSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKGQEPFKSVLDVRKEFKSLVL 347
Query: 358 HFRGAD---VKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYD 407
+F +++ N+ + C GI N + I G+I + +V YD
Sbjct: 348 NFASGKKTLMEIPPENYLIVTENGNAC---LGILNGSEIGLKDLSIIGDITMQDHMVIYD 404
Query: 408 IEQQTVSFKPTDCTK 422
E+ + + C +
Sbjct: 405 NEKGKIGWIRAPCDR 419
>gi|255563739|ref|XP_002522871.1| DNA binding protein, putative [Ricinus communis]
gi|223537955|gb|EEF39569.1| DNA binding protein, putative [Ricinus communis]
Length = 414
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 114/428 (26%), Positives = 172/428 (40%), Gaps = 76/428 (17%)
Query: 24 AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
++ GF ++LIHRDSP+SPFY T +R+ + S R ++F+ S SS+A +
Sbjct: 27 SKPNGFRLQLIHRDSPESPFYPGKLTNSERISRLVEFSKIRAHNFD---SGFSSEAFRPP 83
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ + YL+++ IG P V DTGS LIWT
Sbjct: 84 VFQDFTCYLVKVRIGNPGIPLYLVPDTGSALIWT-------------------------- 117
Query: 144 KSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+ N C C Y+ Y DGS + G A + L S + +
Sbjct: 118 ---------VNNQNIFQCRNNKCSYTRRYDDGSITTGVAAQDI--LQSEGSERIPF---Y 163
Query: 204 FGCGTNNGGL----FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-------S 252
FGC +N K+ G++GL +SL+ Q+ +FSYCL P S
Sbjct: 164 FGCSRDNQNFSVFEHTGKSGGVMGLNTSPVSLLQQLSHITQRRFSYCLNPYQHGSEPPPS 223
Query: 253 TKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGV----------STPD 300
+ + FG + STPL + + Y L + ++V QRL + T
Sbjct: 224 SLLRFGNDIRKGRRRFQSTPLMSSPDRPNYFLNLLDMTVAGQRLHLPPGTFALRQDGTGG 283
Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA---QPVADPTGSLELCYSF---NSLSQVPE 354
+IDSGT LTF+ Q L+S + + Q V P +LCYSF ++
Sbjct: 284 TIIDSGTGLTFITQTAYPRLISAFQNYFDHRGFQRVHIP--EFDLCYSFRGNHTFHDHAS 341
Query: 355 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQT 412
+T HF AD + ++ + +D V T + G I Q N YD
Sbjct: 342 MTFHFERADFTVQADYVYLPMEDDNAFCVALQPTPPQQRTVIGAINQGNTRFIYDAAAHQ 401
Query: 413 VSFKPTDC 420
+ F +C
Sbjct: 402 LLFIAENC 409
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 107/377 (28%), Positives = 165/377 (43%), Gaps = 50/377 (13%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + + +++GTPP V DTGS+L W C P + + F P+ S T+ S+
Sbjct: 62 HNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASV 121
Query: 147 PCSSSQCASLNQKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
PC S+QC S + S C G + C+ S+SY DGS S+G LATE T+G A
Sbjct: 122 PCDSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGC 181
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
+ T+ G+ T G++G+ G +S +SQ T +FSYC+ + +
Sbjct: 182 MATAFDTSPDGV---ATAGLLGMNRGALSFVSQASTR---RFSYCISDRDDAGVLLLGHS 235
Query: 262 IVSGPGVVSTPLTKA--------KTFYVLTIDAISVGNQRL----GVSTPD------IVI 303
+ + TPL + + Y + + I VG + L V PD ++
Sbjct: 236 DLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMV 295
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA--DPT----GSLELCYSFNS----LSQVP 353
DSGT TFL S L + S + A DP + + C+ +++P
Sbjct: 296 DSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLP 355
Query: 354 EVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKGITNSVPI----YGNIMQTNFL 403
VT+ F GA + ++ KV + + C F G + VPI G+ Q N
Sbjct: 356 AVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWCLTF-GNADMVPITAYVIGHHHQMNVW 414
Query: 404 VGYDIEQQTVSFKPTDC 420
V YD+E+ V P C
Sbjct: 415 VEYDLERGRVGLAPIRC 431
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 99/349 (28%), Positives = 158/349 (45%), Gaps = 47/349 (13%)
Query: 59 TRSLNRLNHFNQNSSISSSKASQADIIPNN-----ANYLIRISIGTPPTERLAVADTGSD 113
++ RL + S+++ K + I P ANY++R+ +GTP + V DT +D
Sbjct: 11 SKDPERLKYL---STLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSND 67
Query: 114 LIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSV 170
W C S C S F P S+T SL CS +QC+ + SC C ++
Sbjct: 68 AAWVPC-----SGCTGCSSTTFLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQ 122
Query: 171 SYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDI 229
SYG S L + +TL + +PG TFGC +GG + G++GLG G I
Sbjct: 123 SYGGDSSLAATLVQDAITLANDV-----IPGFTFGCINAVSGGSIPPQ--GLLGLGRGPI 175
Query: 230 SLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP-----GVVSTPLTK---AKTFYV 281
SLISQ +G FSYCL S K + + + GP + +TPL + + Y
Sbjct: 176 SLISQAGAMYSGVFSYCL---PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYY 232
Query: 282 LTIDAISVGNQRLGVSTPDIV----------IDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
+ + +SVG ++ + + +V IDSGT +T Q + +
Sbjct: 233 VNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG- 291
Query: 332 PVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIV 380
P++ G+ + C++ + ++ P VT+HF G ++ L N + S V
Sbjct: 292 PISS-LGAFDTCFAATNEAEAPAVTLHFEGLNLVLPMENSLIHSSSGSV 339
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 107/377 (28%), Positives = 165/377 (43%), Gaps = 50/377 (13%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + + +++GTPP V DTGS+L W C P + + F P+ S T+ S+
Sbjct: 61 HNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASV 120
Query: 147 PCSSSQCASLNQKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
PC S+QC S + S C G + C+ S+SY DGS S+G LATE T+G A
Sbjct: 121 PCGSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGC 180
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
+ T+ G+ T G++G+ G +S +SQ T +FSYC+ + +
Sbjct: 181 MATAFDTSPDGV---ATAGLLGMNRGALSFVSQASTR---RFSYCISDRDDAGVLLLGHS 234
Query: 262 IVSGPGVVSTPLTKA--------KTFYVLTIDAISVGNQRL----GVSTPD------IVI 303
+ + TPL + + Y + + I VG + L V PD ++
Sbjct: 235 DLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMV 294
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA--DPT----GSLELCYSFNS----LSQVP 353
DSGT TFL S L + S + A DP + + C+ +++P
Sbjct: 295 DSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLP 354
Query: 354 EVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKGITNSVPI----YGNIMQTNFL 403
VT+ F GA + ++ KV + + C F G + VPI G+ Q N
Sbjct: 355 AVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWCLTF-GNADMVPITAYVIGHHHQMNVW 413
Query: 404 VGYDIEQQTVSFKPTDC 420
V YD+E+ V P C
Sbjct: 414 VEYDLERGRVGLAPIRC 430
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 108/373 (28%), Positives = 163/373 (43%), Gaps = 57/373 (15%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP + DTGSD++W C+ CP D L+D K S+T ++
Sbjct: 155 YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 214
Query: 148 CSSSQCASLNQ--KSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---G 201
C + C+ + C G+ C YSV YGDGS + G + V +G P
Sbjct: 215 CDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGT 274
Query: 202 ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKIN 256
+ FGCG G S + GI+G G + S++SQ+ ++ + FS+CL V I
Sbjct: 275 VVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGI- 333
Query: 257 FGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI--------VIDSGTT 308
F +V P V TPL + + Y + + I VG L V + +IDSGTT
Sbjct: 334 FAIGEVVE-PKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTT 392
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE------LCYSF--NSLSQVPEVTIHFR 360
L + PQ V +IE P L C+ + N P VT+HF
Sbjct: 393 LAYFPQ-------EVYVPLIEKILSQQPDLRLHTVEQAFTCFDYTGNVDDGFPTVTLHF- 444
Query: 361 GADVKLSRSNFFVKVSEDIVCSVFK---GITNS---------VPIYGNIMQTNFLVGYDI 408
D +S + V E + F+ G NS + + G+++ +N LV YD+
Sbjct: 445 --DKSISLT---VYPHEYLFQHEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDL 499
Query: 409 EQQTVSFKPTDCT 421
E+Q + + +C+
Sbjct: 500 EKQGIGWVEYNCS 512
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 159/368 (43%), Gaps = 39/368 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G P + DTGSD++W C P CP ++DP+ SST +
Sbjct: 2 YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 61
Query: 148 CSSSQCA---SLNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLG--STTGQAVALP 200
CS C + CS NC+Y SYGDGS S G + + S+ G A
Sbjct: 62 CSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTS 121
Query: 201 GITFGCGTNNGGLFNS---KTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKI 255
+ FGC G ++ GI+G G ++S+ +Q+ + I FS+CL
Sbjct: 122 QVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL-EGEKRGG 180
Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD--------IVIDSGT 307
G ++ PG+ TPL Y + + ISV + RL + D +++DSGT
Sbjct: 181 GILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDSGT 240
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV-PEVTIHFRGADVKL 366
TL + P G + + + A PV + LS + P VT++F G ++L
Sbjct: 241 TLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGRLSDLFPNVTLNFEGGAMEL 300
Query: 367 SRSNFFV------KVSEDIVCSVFKGITNS--------VPIYGNIMQTNFLVGYDIEQQT 412
N+ + + D+ C ++ ++S + I G+I+ + LV YD++
Sbjct: 301 QPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSR 360
Query: 413 VSFKPTDC 420
+ + +C
Sbjct: 361 IGWMSYNC 368
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 109/407 (26%), Positives = 179/407 (43%), Gaps = 40/407 (9%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSISSSKAS---QADIIPNNAN-YLIRISIGTPPTERLAV 107
R+ A ++ +R H ++ Q PN+ Y ++ +GTPP E
Sbjct: 35 HRVEVAALKARDRARHARMLRGVAGGVVDFSVQGTSDPNSVGLYYTKVKMGTPPKEFNVQ 94
Query: 108 ADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS---C 161
DTGSD++W C CP S + FD SST +PCS C S Q + C
Sbjct: 95 IDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSDPICTSRVQGAAAEC 154
Query: 162 S-GVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL---PGITFGCGTNNGGLF-- 214
S VN C Y+ YGDGS ++G ++ + GQ A+ I FGC + G
Sbjct: 155 SPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAVNSSATIVFGCSISQSGDLTK 214
Query: 215 -NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVST 271
+ GI G G G +S++SQ+ R FS+CL I+ P +V +
Sbjct: 215 TDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKGDGDGGGVLVLGEILE-PSIVYS 273
Query: 272 PLTKAKTFYVLTIDAISVGNQRLGVSTPDI----------VIDSGTTLTFLPQGYNSNLL 321
PL ++ Y L + +I+V Q L ++ P + ++D GTTL +L Q L+
Sbjct: 274 PLVPSQPHYNLNLQSIAVNGQLLPIN-PAVFSISNNRGGTIVDCGTTLAYLIQEAYDPLV 332
Query: 322 SVMSSMIEAQPVADPTGSLELCYSFN-SLSQV-PEVTIHFR-GADVKLSRSNFFVK---- 374
+ +++ + +Q CY + S+ + P V+++F GA + L + +
Sbjct: 333 TAINTAV-SQSARQTNSKGNQCYLVSTSIGDIFPSVSLNFEGGASMVLKPEQYLMHNGYL 391
Query: 375 VSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
++ C F+ I G+++ + +V YDI QQ + + DC+
Sbjct: 392 DGAEMWCIGFQKFQEGASILGDLVLKDKIVVYDIAQQRIGWANYDCS 438
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 99/349 (28%), Positives = 158/349 (45%), Gaps = 47/349 (13%)
Query: 59 TRSLNRLNHFNQNSSISSSKASQADIIPNN-----ANYLIRISIGTPPTERLAVADTGSD 113
++ RL + S+++ K + I P ANY++R+ +GTP + V DT +D
Sbjct: 11 SKDPERLKYL---STLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSND 67
Query: 114 LIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSV 170
W C S C S F P S+T SL CS +QC+ + SC C ++
Sbjct: 68 AAWVPC-----SGCTGCSSTTFLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQ 122
Query: 171 SYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDI 229
SYG S L + +TL + +PG TFGC +GG + G++GLG G I
Sbjct: 123 SYGGDSSLAATLVQDAITLANDV-----IPGFTFGCINAVSGGSIPPQ--GLLGLGRGPI 175
Query: 230 SLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP-----GVVSTPLTK---AKTFYV 281
SLISQ +G FSYCL S K + + + GP + +TPL + + Y
Sbjct: 176 SLISQAGAMYSGVFSYCL---PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYY 232
Query: 282 LTIDAISVGNQRLGVSTPDIV----------IDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
+ + +SVG ++ + + +V IDSGT +T Q + +
Sbjct: 233 VNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG- 291
Query: 332 PVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIV 380
P++ G+ + C++ + ++ P VT+HF G ++ L N + S V
Sbjct: 292 PISS-LGAFDTCFAETNEAEAPAVTLHFEGLNLVLPMENSLIHSSSGSV 339
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 118/448 (26%), Positives = 190/448 (42%), Gaps = 55/448 (12%)
Query: 8 VFILFFLCFYVVSPIEA------QTGGFSVELIHRDSPKSPFYNSSETPYQR-LRDALTR 60
+F L FL F + + Q G ++++ H SP SPF+ S ++ + +
Sbjct: 5 LFSLAFLFFTLAQGMHLNPKCGIQDQGSNLQVFHVYSPCSPFWPSKPLKWEESVLQMQAK 64
Query: 61 SLNRLNHFNQNSSISSSK-----ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLI 115
RL SS+ + K AS I+ + Y++R IGTP L DT +D
Sbjct: 65 DQARLQFL---SSLVARKSVVPIASGRQIV-QSPTYIVRAKIGTPAQTMLLAMDTSNDAA 120
Query: 116 WTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDG 175
W C S C S +F+ S+T+K++ C + QC + C G C ++++YG
Sbjct: 121 WIPC-----SGCVGCSSTVFNNVKSTTFKTVGCEAPQCKQVPNSKCGGSACAFNMTYGSS 175
Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
S + NL+ + VTL + + +P TFGC T G + G++GLG G +SL+SQ
Sbjct: 176 SIA-ANLSQDVVTLATDS-----IPSYTFGCLTEATG-SSIPPQGLLGLGRGPMSLLSQT 228
Query: 236 RTTIAGKFSYCLVPVSSTKINFGTN---GIVSGPG-VVSTPLTK---AKTFYVLTIDAIS 288
+ FSYCL S +NF + G V P + +TPL K + Y + + AI
Sbjct: 229 QNLYQSTFSYCLPSFRS--LNFSGSLRLGPVGQPKRIKTTPLLKNPRRSSLYYVNLMAIR 286
Query: 289 VGNQRLGVSTPDI----------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG 338
VG + + + + + DSGT T L + + + V G
Sbjct: 287 VGRRVVDIPPSALAFNPTTGAGTIFDSGTVFTRLVAPAYTAVRDAFRKRVGNATVTS-LG 345
Query: 339 SLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVK-VSEDIVCSVFKGITNSV----PI 393
+ CY+ S P +T F G +V L N + + I C ++V +
Sbjct: 346 GFDTCYT--SPIVAPTITFMFSGMNVTLPPDNLLIHSTASSITCLAMAAAPDNVNSVLNV 403
Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
N+ Q N + +D+ + CT
Sbjct: 404 IANMQQQNHRILFDVPNSRLGVAREPCT 431
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 121/415 (29%), Positives = 189/415 (45%), Gaps = 44/415 (10%)
Query: 42 PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN-YLIRISIGTP 100
P + E R RD L R Q+SS + Q P Y ++ +GTP
Sbjct: 33 PTNHGVELSQLRARDEL-----RHRRMLQSSSGVVDFSVQGTFDPFQVGLYYTKVQLGTP 87
Query: 101 PTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
P E DTGSD++W C CP + FDP SST + CS +C +
Sbjct: 88 PVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGK 147
Query: 158 QKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTL-----GSTTGQAVALPGITFGCG 207
Q S CS N C Y+ YGDGS ++G ++ + L GS T + A P + FGC
Sbjct: 148 QSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTTNSTA-P-VVFGCS 205
Query: 208 TNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTKINFGTNGI 262
G + GI G G ++S+ISQ+ + IA + FS+CL SS I
Sbjct: 206 NQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGDSSGGGILVLGEI 265
Query: 263 VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGTTLTFLPQ 314
V P +V T L A+ Y L + +ISV Q L + ++ ++DSGTTL +L +
Sbjct: 266 VE-PNIVYTSLVPAQPHYNLNLQSISVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAE 324
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQV-PEVTIHFR-GADVKLSRSNF 371
+S +++ I Q V CY +S++ V P+V+++F GA + L ++
Sbjct: 325 EAYDPFVSAITAAIP-QSVRTVVSRGNQCYLITSSVTDVFPQVSLNFAGGASMILRPQDY 383
Query: 372 FVKVSE----DIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
++ + + C F+ I + I G+++ + +V YD+ Q + + DC+
Sbjct: 384 LIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANYDCS 438
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 172/375 (45%), Gaps = 43/375 (11%)
Query: 76 SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
S++ D + N Y R+ IGTPP E + D+GS + + C C QC P F
Sbjct: 73 SARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASC--EQCGNHQDPRF 130
Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
P +SSTY + C+ C + K+ C Y Y + S S+G L + V+ G+ +
Sbjct: 131 QPDLSSTYSPVKCNVD-CTCDSDKN----QCTYERQYAEMSSSSGVLGEDIVSFGTES-- 183
Query: 196 AVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSS 252
+ FGC + G LF+ GI+GLG G +S++ Q+ + I FS C
Sbjct: 184 ELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCY----- 238
Query: 253 TKINFGTNGIVSG-----PGVVSTPLTKAKT-FYVLTIDAISVGNQRLGVSTPDI----- 301
++ G +V G PG++ T ++ +Y + + + V + L V P I
Sbjct: 239 GGMDIGGGAMVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVD-PRIFDGKH 297
Query: 302 --VIDSGTTLTFLPQGYNSNLLSVMSSMIEA-QPVADPTGSL-ELCYS-----FNSLSQV 352
V+DSGTT +LP+ +SS + + + P + ++C++ + LS+V
Sbjct: 298 GTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEV 357
Query: 353 -PEVTIHF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYD 407
P+V + F G + LS N+ + S E C VF+ + + G I+ N LV YD
Sbjct: 358 FPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYD 417
Query: 408 IEQQTVSFKPTDCTK 422
+ + F T+C++
Sbjct: 418 RHNEKIGFWKTNCSE 432
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 106/374 (28%), Positives = 171/374 (45%), Gaps = 41/374 (10%)
Query: 76 SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
S++ D + N Y R+ IGTPP E + D+GS + + C C QC P F
Sbjct: 70 SARMRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASC--EQCGNHQDPRF 127
Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
P +SSTY + CS+ C + KS C Y Y + S S+G L + V+ G T
Sbjct: 128 QPDLSSTYSPVKCSAD-CTCDSDKS----QCTYERQYAEMSSSSGVLGEDIVSFG--TES 180
Query: 196 AVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSS 252
+ FGC + G LF+ GI+GLG G +S++ Q+ + I FS C
Sbjct: 181 ELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCY----- 235
Query: 253 TKINFGTNGIVSG-----PGVVSTPLTKAKT-FYVLTIDAISVGNQRLGV------STPD 300
++ G +V G P +V + ++ +Y + + I V + L + S
Sbjct: 236 GGMDIGGGAMVLGAMPAPPDMVFSRSDPVRSPYYNIELKEIHVAGKALRLDPRIFDSKHG 295
Query: 301 IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV- 352
V+DSGTT +LP Q + + +V S + + + P + ++C++ + LSQ
Sbjct: 296 TVLDSGTTYAYLPEQAFVAFKDAVTSKVRPLKKIRGPDPNYKDICFAGAGRNVSQLSQAF 355
Query: 353 PEVTIHF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDI 408
P+V + F G + LS N+ + S E C VF+ + + G I+ N LV YD
Sbjct: 356 PDVDMVFGDGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDR 415
Query: 409 EQQTVSFKPTDCTK 422
+ + F T+C++
Sbjct: 416 HNEKIGFWKTNCSE 429
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 120/415 (28%), Positives = 192/415 (46%), Gaps = 44/415 (10%)
Query: 42 PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN-YLIRISIGTP 100
P ++ E R RDAL R Q+S+ + Q P Y ++ +GTP
Sbjct: 30 PTNHTVELSQLRARDAL-----RHRRMLQSSNGVVDFSVQGTFDPFQVGLYYTKVQLGTP 84
Query: 101 PTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
P E DTGSD++W C CP + FDP SST + CS +C +
Sbjct: 85 PVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGI 144
Query: 158 QKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTL-----GSTTGQAVALPGITFGCG 207
Q S CS N C Y+ YGDGS ++G ++ + L GS T + A P + FGC
Sbjct: 145 QSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNSTA-P-VVFGCS 202
Query: 208 TNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTKINFGTNGI 262
G + GI G G ++S+ISQ+ + IA + FS+CL SS I
Sbjct: 203 NQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSSGGGILVLGEI 262
Query: 263 VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGTTLTFLPQ 314
V P +V T L A+ Y L + +I+V Q L + ++ ++DSGTTL +L +
Sbjct: 263 VE-PNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAE 321
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQV-PEVTIHFR-GADVKLSRSNF 371
+S +++ I Q V CY +S+++V P+V+++F GA + L ++
Sbjct: 322 EAYDPFVSAITASIP-QSVHTVVSRGNQCYLITSSVTEVFPQVSLNFAGGASMILRPQDY 380
Query: 372 FVKVSE----DIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
++ + + C F+ I + I G+++ + +V YD+ Q + + DC+
Sbjct: 381 LIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANYDCS 435
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 107/368 (29%), Positives = 172/368 (46%), Gaps = 43/368 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +GTPP E DTGSD++W C CP S + FD SST +P
Sbjct: 84 YTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALVP 143
Query: 148 CSSSQCASLNQKS---CS-GVN-CQYSVSYGDGSFSNGNLATET----VTLGSTTGQAVA 198
CS CAS Q + CS VN C Y+ Y DGS ++G ++ + LG +T VA
Sbjct: 144 CSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTPANVA 203
Query: 199 LPG-ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSS 252
I FGC T G + GI+G G G++S++SQ+ R FS+CL
Sbjct: 204 SSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCL----- 258
Query: 253 TKINFGTNGI-----VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV-----STPD-- 300
K + GI + P +V +PL ++ Y L + +I+V Q L + +T D
Sbjct: 259 -KGDGNGGGILVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQVLSINPAVFATSDKR 317
Query: 301 -IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-VPEVTIH 358
+IDSGTTL++L Q L++ + + + + + + S+ P V+ +
Sbjct: 318 GTIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATSFISKGSQCYLVLTSIDDSFPTVSFN 377
Query: 359 FR-GADVKLSRSNFFV----KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
F GA + L S + + + + C F+ + V I G+++ + +V YD+ +Q +
Sbjct: 378 FEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQKVQEGVTILGDLVLKDKIVVYDLARQQI 437
Query: 414 SFKPTDCT 421
+ DC+
Sbjct: 438 GWTNYDCS 445
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 175/386 (45%), Gaps = 68/386 (17%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + + +++GTPP V DTGS+L W C P + S F P+ SST+ ++
Sbjct: 81 HNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMS--FRPRASSTFAAV 138
Query: 147 PCSSSQCASLNQKS---CSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
PC+S+QC S + S C G C S+SY DGS S+G LAT+ +GS A
Sbjct: 139 PCASAQCRSRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAVGSGPPLRAA--- 195
Query: 202 ITFGCGTNNGGLFNS-----KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI- 255
FGC ++ F+S + G++G+ G +S +SQ T +FSYC+ +
Sbjct: 196 --FGCMSSA---FDSSPDGVASAGLLGMNRGALSFVSQASTR---RFSYCISDRDDAGVL 247
Query: 256 NFGTNGIVSGPGVVSTPLTKA--------KTFYVLTIDAISVGNQRL----GVSTPD--- 300
G + + + + TP+ + + Y + + I VG + L V PD
Sbjct: 248 LLGHSDLPTFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTG 307
Query: 301 ---IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA----DPT----GSLELCYSFNS- 348
++DSGT TFL S L + + +A+P+ DP+ + + C+
Sbjct: 308 AGQTMVDSGTQFTFLLGDAYSALKAEFTR--QARPLLPALDDPSFAFQEAFDTCFRVPQG 365
Query: 349 ----LSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKGITNSVPIY---- 394
+++P VT+ F GA++ ++ KV + + C F G + VPI
Sbjct: 366 RSPPTARLPGVTLLFNGAEMAVAGDRLLYKVPGERRGGDGVWCLTF-GNADMVPIMAYVI 424
Query: 395 GNIMQTNFLVGYDIEQQTVSFKPTDC 420
G+ Q N V YD+E+ V P C
Sbjct: 425 GHHHQMNVWVEYDLERGRVGLAPVRC 450
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 97/355 (27%), Positives = 153/355 (43%), Gaps = 32/355 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---LFDPKMSSTYKSLP 147
Y + IS+GTPP L DTGS L W QC+ C +CY Q + +F+P SSTY +
Sbjct: 6 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQI-KCYDQAAKAGQIFNPYNSSTYSKVG 64
Query: 148 CSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
CS+ C ++ + C + C YS+ YG G +S G L + +TL S ++
Sbjct: 65 CSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNR----SID 120
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM-RTTIAGKFSYCLVPVSSTKINFGT 259
FGCG +N L+N GI+G G S +Q+ + T FSYC + +
Sbjct: 121 NFIFGCGEDN--LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTI 178
Query: 260 NGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRLGVS-----TPDIVIDSGTTLTFL 312
++ T L K Y + + V RL + + ++DSGT T++
Sbjct: 179 GPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMTIVDSGTADTYI 238
Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS----LSQVPEVTIHFRGADVKLSR 368
L M+ ++A+ +C+ NS + P V + + +KL
Sbjct: 239 LSPVFDALDKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIRSTLKLPV 298
Query: 369 SNFFVKVSEDIVCSVF---KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N F + S +++CS F V + GN +F + +DI+ FK C
Sbjct: 299 ENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 353
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 110/414 (26%), Positives = 181/414 (43%), Gaps = 45/414 (10%)
Query: 36 RDSPKSPFYNSSETPY---QRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYL 92
R +P P + Y RL +L R L H N ++ D + N Y
Sbjct: 37 RPAPGPPLFLPLTRSYPNASRLAASLRRGLGDGVHPN-------ARMRLHDDLLTNGYYT 89
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
R+ IGTPP E + D+GS + + C C QC P F P +SS+Y + C+
Sbjct: 90 TRLYIGTPPQEFALIVDSGSTVTYVPCSSC--EQCGNHQDPRFQPDLSSSYSPVKCNVDC 147
Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNG 211
++K C+ Y Y + S S+G L + V+ G + + FGC + G
Sbjct: 148 TCDSDKKQCT-----YERQYAEMSSSSGVLGEDIVSFGRES--ELKPQHAIFGCENSETG 200
Query: 212 GLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVV 269
LF+ GI+GLG G +S++ Q+ + I+ FS C + G+++ P ++
Sbjct: 201 DLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGMLAPPDMI 260
Query: 270 ---STPLTKAKTFYVLTIDAISVGNQRLGV------STPDIVIDSGTTLTFLP-QGYNSN 319
S PL +Y + + I V + L V S V+DSGTT +LP Q + +
Sbjct: 261 FSNSDPLRSP--YYNIELKEIHVAGKALRVESRIFNSKHGTVLDSGTTYAYLPEQAFVAF 318
Query: 320 LLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV-PEVTIHF-RGADVKLSRSNF 371
+V S + + + P S ++C++ + L +V P+V + F G + L+ N+
Sbjct: 319 KEAVTSKVHSLKKIRGPDPSYKDICFAGAGRNVSKLHEVFPDVDMVFGNGQKLSLTPENY 378
Query: 372 FV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
KV VF+ + + G I+ N LV YD + + F T+C++
Sbjct: 379 LFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNCSE 432
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 100/374 (26%), Positives = 166/374 (44%), Gaps = 44/374 (11%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
D+ P+ Y + ++IG PP D+GSDL W QC+ PC C PL+ P S
Sbjct: 58 GDVYPHGL-YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC--RSCNEVPHPLYRPTKS 114
Query: 141 STYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
K +PC CASL+ + C + C Y + Y D S G L ++ L T
Sbjct: 115 ---KLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTN 171
Query: 194 GQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLV 248
G +VA P + FGCG + G +S T G++GLG G +SL+SQ++ K +CL
Sbjct: 172 G-SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLS 230
Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGVSTPDIVIDSG 306
+ FG + +V TP+ ++ + +Y ++ G++ LGV +V DSG
Sbjct: 231 LRGGGFLFFGDD-LVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSG 289
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTIH 358
++ T+ L++ + + +P SL LC+ F S+ V + + ++
Sbjct: 290 SSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKGQEPFKSVLDVRKEFKSLVLN 349
Query: 359 FRGAD---VKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDI 408
F +++ N+ + C GI N + I G+I + +V YD
Sbjct: 350 FASGKKTLMEIPPENYLIVTENGNAC---LGILNGSEIGLKDLSIIGDITMQDHMVIYDN 406
Query: 409 EQQTVSFKPTDCTK 422
E+ + + C +
Sbjct: 407 EKGKIGWIRAPCDR 420
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 124/444 (27%), Positives = 186/444 (41%), Gaps = 50/444 (11%)
Query: 22 IEAQTG-GFSVELIHR--DSPKS--------------PFYNSSETPYQRLRDALTRSLNR 64
EA G FS +LIHR D KS P S E L + L R +
Sbjct: 20 FEASIGLTFSSKLIHRFSDEAKSISISRKGNASGDLWPKRYSFEYFQLLLGNDLKRQRMK 79
Query: 65 LNHFNQNSSISSSKASQADIIPNNANYL--IRISIGTPPTERLAVADTGSDLIWTQCE-- 120
L +N + S+ SQA N ++L I IGTP L D GSDL+W C+
Sbjct: 80 LGS-QKNQLLFPSQGSQALFFGNELDWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCI 138
Query: 121 PCPP-SQCYM-----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGD 174
C P S Y +D + P +SST + L C C + C Y +Y D
Sbjct: 139 QCAPLSASYYNISLDRDLSEYSPSLSSTSRHLSCDHQLCEWGSNCKNPKDPCPYIFNYDD 198
Query: 175 --GSFSNGNLATETVTL---GSTTGQAVALPGITFGCGTNNGGLF--NSKTTGIVGLGGG 227
+ S G L + + L G T + + + GCG GG F + G++GLG G
Sbjct: 199 FENTTSAGFLVEDKLHLASVGDHTARKMLQASVVLGCGRKQGGSFFDGAAPDGVMGLGPG 258
Query: 228 DISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTID 285
DIS+ S + I FS C S +I FG G S P+ Y + ++
Sbjct: 259 DISVPSLLAKAGLIQNCFSLCFDENDSGRILFGDRGHASQQSTPFLPIQGTYVAYFVGVE 318
Query: 286 AISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS 345
+ VGN L S ++DSG++ T+LP + L+S + A+ ++ G + CY+
Sbjct: 319 SYCVGNSCLKRSGFKALVDSGSSFTYLPSEVYNELVSEFDKQVNAKRISFQDGLWDYCYN 378
Query: 346 FNS--LSQVPEVTIHF-RGADVKLSRSNFFVKVSED--IVCSVFKGITNSVPIYGNIMQT 400
+S L +P + + F R + + + + + + C + S YG I Q
Sbjct: 379 ASSQELHDIPAIQLKFPRNQNFVVHNPTYSIPHHQGFTMFCLSLQPTDGS---YGIIGQ- 434
Query: 401 NFLVGY----DIEQQTVSFKPTDC 420
NF++GY DIE + + + C
Sbjct: 435 NFMIGYRMVFDIENLKLGWSNSSC 458
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 98/364 (26%), Positives = 164/364 (45%), Gaps = 35/364 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G+PP E DTGSD++W C P CP S F+P SST +P
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 148 CSSSQCASLNQKS---CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
CS +C + Q S C + C Y+ +YGDGS ++G ++T+ S G A +
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTANS 210
Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSST 253
I FGC + G + GI G G +S++SQ+ + ++ K FS+CL S
Sbjct: 211 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL-KGSDN 269
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDS 305
G + PG+V TPL ++ Y L +++I V Q+L + +T ++DS
Sbjct: 270 GGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDS 329
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGAD 363
GTTL +L G ++ +++ + V C+ +S S P V+++F G
Sbjct: 330 GTTLAYLADGAYDPFVNAITAAVSPS-VRSLVSKGNQCFVTSSSVDSSFPTVSLYFMGGV 388
Query: 364 VKLSRSNFFVKVSEDIVCSVFKGI------TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
+ ++ I +V I + I G+++ + + YD+ + +
Sbjct: 389 AMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANMRMGWTD 448
Query: 418 TDCT 421
DC+
Sbjct: 449 YDCS 452
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 104/379 (27%), Positives = 174/379 (45%), Gaps = 64/379 (16%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + +++G+PP V DTGS+L W C+ P +F+P SSTY +
Sbjct: 57 HNVTLTVTLAVGSPPQNISMVLDTGSELSWLHCKKSP------NLGSVFNPVSSSTYSPV 110
Query: 147 PCSSSQCASLNQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
PCSS C + + SC C ++SY D + GNLA +T +GS T
Sbjct: 111 PCSSPICRTRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIGSVT-----R 165
Query: 200 PGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
PG FGC G ++ ++K+TG++G+ G +S ++Q+ + KFSYC+ S+ I
Sbjct: 166 PGTLFGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFS---KFSYCISGSDSSGIL 222
Query: 257 FGTNGIVSGPGVVS-TPLTKAKT--------FYVLTIDAISVGNQRL----GVSTPD--- 300
+ S G + TPL T Y + ++ I VG++ L V PD
Sbjct: 223 LLGDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTG 282
Query: 301 ---IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPT----GSLELCYSFNS--- 348
++DSGT TFL + L + + ++ + V DP G+++LCY S
Sbjct: 283 AGQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGTMDLCYRVGSSTR 342
Query: 349 --LSQVPEVTIHFRGADVKLSRSNFFVKVS-------EDIVCSVFKG---ITNSVPIYGN 396
+ +P +++ FRGA++ +S +V+ E++ C F + + G+
Sbjct: 343 PNFTGLPVISLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGH 402
Query: 397 IMQTNFLVGYDIEQQTVSF 415
Q N + +D+ + V F
Sbjct: 403 HHQQNVWMEFDLAKSRVGF 421
>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 372
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 97/355 (27%), Positives = 153/355 (43%), Gaps = 32/355 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---LFDPKMSSTYKSLP 147
Y + IS+GTPP L DTGS L W QC+ C +CY Q + +F+P SSTY +
Sbjct: 25 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQI-KCYDQAAKAGQIFNPYNSSTYSKVG 83
Query: 148 CSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
CS+ C ++ + C + C YS+ YG G +S G L + +TL S ++
Sbjct: 84 CSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNR----SID 139
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM-RTTIAGKFSYCLVPVSSTKINFGT 259
FGCG +N L+N GI+G G S +Q+ + T FSYC + +
Sbjct: 140 NFIFGCGEDN--LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTI 197
Query: 260 NGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRLGVS-----TPDIVIDSGTTLTFL 312
++ T L K Y + + V RL + + ++DSGT T++
Sbjct: 198 GPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMTIVDSGTADTYI 257
Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS----LSQVPEVTIHFRGADVKLSR 368
L M+ ++A+ +C+ NS + P V + + +KL
Sbjct: 258 LSPVFDALDKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIRSTLKLPV 317
Query: 369 SNFFVKVSEDIVCSVF---KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N F + S +++CS F V + GN +F + +DI+ FK C
Sbjct: 318 ENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 372
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 108/387 (27%), Positives = 178/387 (45%), Gaps = 60/387 (15%)
Query: 83 DIIP--NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
D +P +N + + +++GTPP V DTGS+L W C SQ S F+P S
Sbjct: 63 DKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCN---TSQNSSSSSSTFNPVWS 119
Query: 141 STYKSLPCSSSQCASLNQK-----SC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
S+Y +PCSSS C + SC S C ++SY D S S GNLAT+T +GS+
Sbjct: 120 SSYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSS-- 177
Query: 195 QAVALPGITFGCGT---NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS 251
+P + FGC ++ +SK TG++G+ G +S +SQM KFSYC+
Sbjct: 178 ---GIPNVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFP---KFSYCISEYD 231
Query: 252 STKI------NFGTNGIVSGPGVV--STPLTK-AKTFYVLTIDAISVGNQRL----GVST 298
+ + NF ++ ++ STPL + Y + ++ I V ++ L V
Sbjct: 232 FSGLLLLGDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFE 291
Query: 299 PD------IVIDSGTTLTF-LPQGYNSNLLSVMSSMIEAQPVADPT-----GSLELCYSF 346
PD ++DSGT TF L Y + ++ + V + + G+++LCY
Sbjct: 292 PDHTGAGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRV 351
Query: 347 ----NSLSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKG---ITNSVPI 393
L +P VT+ FRGA++ ++ +V ++ I C F + +
Sbjct: 352 PTNQTRLPPLPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFV 411
Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDC 420
G++ Q N + +D+++ + C
Sbjct: 412 IGHLHQQNVWMEFDLKKSRIGLAEIRC 438
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 100/369 (27%), Positives = 153/369 (41%), Gaps = 51/369 (13%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
+Y++R +G+P L DT +D W C PC C S LF P S++Y LPCS
Sbjct: 76 SYVVRAGLGSPAQPILLALDTSADATWAHCSPC--GTCPSSGS-LFAPANSTSYAPLPCS 132
Query: 150 SSQCASLNQKSCSGVN----------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
S+ C L + C + C ++ + D SF +LA++ + LG A+
Sbjct: 133 STMCTVLQGQPCPAQDPYDSSAPLPMCAFTKPFADASF-QASLASDWLHLGKD-----AI 186
Query: 200 PGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS----STK 254
P FGC +G N G++GLG G ++L+SQ+ G FSYCL S
Sbjct: 187 PNYAFGCVSAVSGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPSYKSYYFSGS 246
Query: 255 INFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTP------------ 299
+ G G GV TP+ K + Y + + +SVG R V P
Sbjct: 247 LRLGAAGQPR--GVRYTPMLKNPNRSSLYYVNVTGLSVG--RAPVKVPAGSFAFDPATGA 302
Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTI 357
V+DSGT +T + L + A G+ + C++ + ++ P VT+
Sbjct: 303 GTVVDSGTVITRWTPPVYAALREEFRRHVAAPSGYTSLGAFDTCFNTDEVAAGVAPAVTV 362
Query: 358 HFRGA-DVKLSRSNFFVKVSE-DIVCSVF----KGITNSVPIYGNIMQTNFLVGYDIEQQ 411
H G D+ L N + S + C + + V + N+ Q N V +D+
Sbjct: 363 HMDGGLDLALPMENTLIHSSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRVVFDVANS 422
Query: 412 TVSFKPTDC 420
V F C
Sbjct: 423 RVGFARESC 431
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 164/364 (45%), Gaps = 35/364 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G+PP E DTGSD++W C P CP S F+P SST +P
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176
Query: 148 CSSSQCASLNQKS---CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
CS +C + Q S C + C Y+ +YGDGS ++G ++T+ + G A +
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 236
Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSST 253
I FGC + G + GI G G +S++SQ+ + ++ K FS+CL S
Sbjct: 237 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL-KGSDN 295
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDS 305
G + PG+V TPL ++ Y L +++I V Q+L + +T ++DS
Sbjct: 296 GGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDS 355
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGAD 363
GTTL +L G ++ +++ + V C+ +S S P V+++F G
Sbjct: 356 GTTLAYLADGAYDPFVNAITAAVSPS-VRSLVSKGNQCFVTSSSVDSSFPTVSLYFMGGV 414
Query: 364 VKLSRSNFFVKVSEDIVCSVFKGI------TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
+ ++ I +V I + I G+++ + + YD+ + +
Sbjct: 415 AMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANMRMGWTD 474
Query: 418 TDCT 421
DC+
Sbjct: 475 YDCS 478
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 98/370 (26%), Positives = 157/370 (42%), Gaps = 48/370 (12%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G+PP + DTGSD++W +C CP D L+DPK S T +
Sbjct: 70 YFTKLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPRKSDLGIDLTLYDPKGSETSDVVS 129
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
C C++ G + C YS++YGDGS + G + +T G P
Sbjct: 130 CDQDFCSATFDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNRINGNLRTSPQNS 189
Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTK 254
I FGCG G S + GI+G G + S++SQ+ + + FS+CL V
Sbjct: 190 SIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDNVRGGG 249
Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRL--------GVSTPDIVIDSG 306
I F +V P V +TPL Y + + +I V L V+ VIDSG
Sbjct: 250 I-FAIGEVVE-PKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSG 307
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELC--------YSFNSLSQVPEVTIH 358
TTL +LP V +I+ P L L Y+ N P V +H
Sbjct: 308 TTLAYLPD-------IVYDELIQKVLARQPGLKLYLVEQQFRCFLYTGNVDRGFPVVKLH 360
Query: 359 FRGA-DVKLSRSNFFVKVSEDIVCSVFKGIT------NSVPIYGNIMQTNFLVGYDIEQQ 411
F+ + + + ++ + + I C ++ + + G+++ +N LV YD+E
Sbjct: 361 FKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLENM 420
Query: 412 TVSFKPTDCT 421
+ + +C+
Sbjct: 421 VIGWTDYNCS 430
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 164/364 (45%), Gaps = 35/364 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G+PP E DTGSD++W C P CP S F+P SST +P
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 148 CSSSQCASLNQKS---CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
CS +C + Q S C + C Y+ +YGDGS ++G ++T+ + G A +
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 210
Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSST 253
I FGC + G + GI G G +S++SQ+ + ++ K FS+CL S
Sbjct: 211 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL-KGSDN 269
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDS 305
G + PG+V TPL ++ Y L +++I V Q+L + +T ++DS
Sbjct: 270 GGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDS 329
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGAD 363
GTTL +L G ++ +++ + V C+ +S S P V+++F G
Sbjct: 330 GTTLAYLADGAYDPFVNAITAAVSPS-VRSLVSKGNQCFVTSSSVDSSFPTVSLYFMGGV 388
Query: 364 VKLSRSNFFVKVSEDIVCSVFKGI------TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
+ ++ I +V I + I G+++ + + YD+ + +
Sbjct: 389 AMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANMRMGWTD 448
Query: 418 TDCT 421
DC+
Sbjct: 449 YDCS 452
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 159/368 (43%), Gaps = 39/368 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G P + DTGSD++W C P CP ++DP+ SST +
Sbjct: 29 YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 88
Query: 148 CSSSQCA---SLNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLG--STTGQAVALP 200
CS C + CS NC+Y SYGDGS S G + + S+ G A
Sbjct: 89 CSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTS 148
Query: 201 GITFGCGTNNGGLFNS---KTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKI 255
+ FGC G ++ GI+G G ++S+ +Q+ + I FS+CL
Sbjct: 149 QVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL-EGEKRGG 207
Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD--------IVIDSGT 307
G ++ PG+ TPL Y + + ISV + RL + D +++DSGT
Sbjct: 208 GILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDSGT 267
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV-PEVTIHFRGADVKL 366
TL + P G + + + A PV + LS + P VT++F G ++L
Sbjct: 268 TLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGRLSDLFPNVTLNFEGGAMEL 327
Query: 367 SRSNFFV------KVSEDIVCSVFKGITNS--------VPIYGNIMQTNFLVGYDIEQQT 412
N+ + + D+ C ++ ++S + I G+I+ + LV YD++
Sbjct: 328 QPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSR 387
Query: 413 VSFKPTDC 420
+ + +C
Sbjct: 388 IGWMSYNC 395
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 85/261 (32%), Positives = 125/261 (47%), Gaps = 30/261 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y I IGTP DTGSD++W C+ CP + L+DPK SST +
Sbjct: 33 YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVS 92
Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C CA+ L + + C+YSV+YGDGS + G ++ + +G P
Sbjct: 93 CDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANS 152
Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
+TFGCG+ GG N GI+G G + S++SQ+ + AGK F++CL ++
Sbjct: 153 TVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQL--SAAGKVKKIFAHCLDTINGG 210
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDS 305
I F +V P V +TPL Y + + +I VG L + +IDS
Sbjct: 211 GI-FAIGNVVQ-PKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDS 268
Query: 306 GTTLTFLPQ-GYNSNLLSVMS 325
GTTLT+LP+ Y +L+V +
Sbjct: 269 GTTLTYLPEIVYKEIMLAVFA 289
>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
gi|223942623|gb|ACN25395.1| unknown [Zea mays]
Length = 378
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 106/374 (28%), Positives = 166/374 (44%), Gaps = 51/374 (13%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPC--PPSQCYMQDSPL--FDPKMSSTYKS 145
Y +R +GTP + VADTGSDL W +C PP+ D P F S ++
Sbjct: 13 QYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPAS----DPPAREFRASESRSWAP 68
Query: 146 LPCSSSQCAS-----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG---------- 190
L CSS C S L S C Y Y DGS + G + T+ T+
Sbjct: 69 LACSSDTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGS 128
Query: 191 STTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP 249
G+ L G+ GC T +G F S + G++ LG +IS S+ G+FSYCLV
Sbjct: 129 GGGGRRAKLQGVVLGCTATYDGQSFQS-SDGVLSLGNSNISFASRAAARFGGRFSYCLVD 187
Query: 250 V-----SSTKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI 301
+S+ + FG G TPL + FY + +DA+ V + L + D+
Sbjct: 188 HLAPRNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPA-DV 246
Query: 302 ---------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLELCYSFNS-L 349
++DSGT+LT L +++ + + A P DP E CY++ +
Sbjct: 247 WDVGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVAMDP---FEYCYNWTAGA 303
Query: 350 SQVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYD 407
++P++ + F G A ++ ++ + + + C V +G V + GNI+Q L +D
Sbjct: 304 PEIPKLEVSFAGSARLEPPAKSYVIDAAPGVKCIGVQEGAWPGVSVIGNILQQEHLWEFD 363
Query: 408 IEQQTVSFKPTDCT 421
+ + + FK T C
Sbjct: 364 LRDRWLRFKHTRCA 377
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 115/388 (29%), Positives = 177/388 (45%), Gaps = 50/388 (12%)
Query: 65 LNHFNQNSSISSSKASQA--DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+ +S S+KA Q D + Y+I + +GTP ++ DTGS W CE C
Sbjct: 54 FRYITNKTSRLSTKAVQVGWDRGLQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-C 112
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN-----QKSCSGVNCQYSVSYGDGSF 177
C+ + S+T + C +S C Q S + +C + VSY DGS
Sbjct: 113 --DGCHTNPRTFLQSR-STTCAKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSA 169
Query: 178 SNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFN-SKTTGIVGLGGGDISLISQMR 236
S G L +T+T +PG +FGC ++ G G++G+G G +S++ Q
Sbjct: 170 SYGILYQDTLTFSDVQ----KIPGFSFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSS 225
Query: 237 TTIAGKFSYCLVPVSSTKINF--GTNGIVSGPGVVST----------PLTKAKTFYVLTI 284
T FSYCL P+ ++ F T G S G V+T K + + +
Sbjct: 226 PTFDC-FSYCL-PLQKSERGFFSKTTGYFS-LGKVATRTDVRYTKMVARKKNTELFFVDL 282
Query: 285 DAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEA---QPVADP 336
AISV +RLG+ S +V DSG+ L+++P LSV+S I + A
Sbjct: 283 TAISVDGERLGLSPSVFSRKGVVFDSGSELSYIPD----RALSVLSQRIRELLLKRGAAE 338
Query: 337 TGSLELCYSFNSLSQ--VPEVTIHF-RGADVKLSRSNFFVKVS---EDIVCSVFKGITNS 390
S CY S+ + +P +++HF GA L FV+ S +D+ C F T S
Sbjct: 339 EESERNCYDMRSVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAF-APTES 397
Query: 391 VPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
V I G++MQT+ V YD+++Q + P+
Sbjct: 398 VSIIGSLMQTSKEVVYDLKRQLIGIGPS 425
>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
Length = 137
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 59/135 (43%), Positives = 81/135 (60%), Gaps = 7/135 (5%)
Query: 78 KASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
K QA + N +L++++IG P A+ DTGSDL WTQC PC S CY Q +P++DP
Sbjct: 8 KDVQAPVSAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCMPC--SDCYKQPTPIYDP 65
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
+SSTY ++ C SS C +L +C C+Y +YGD S + G L+ ET TL S +
Sbjct: 66 SLSSTYGTVSCKSSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTLSSQS---- 121
Query: 198 ALPGITFGCGTNNGG 212
+P I FGCG +N G
Sbjct: 122 -IPHIAFGCGQDNEG 135
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 97/358 (27%), Positives = 165/358 (46%), Gaps = 33/358 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
N Y R+ IGTPP + DTGS + + C C QC P F P +SSTY+S+
Sbjct: 10 NGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSC--EQCGRHQDPKFQPDLSSTYQSVK 67
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC- 206
C+ C ++K C Y Y + S S+G L + ++ G+ + A+A FGC
Sbjct: 68 CNID-CNCDDEKQ----QCVYERQYAEMSTSSGVLGEDIISFGNLS--ALAPQRAVFGCE 120
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVS 264
G L++ GI+G+G GD+S++ + + I FS C + GI
Sbjct: 121 NMETGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAMVLGGISP 180
Query: 265 GPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGVSTPDI-------VIDSGTTLTFLPQ-G 315
+V + ++ +Y + + I V + L ++ P + ++DSGTT +LP+
Sbjct: 181 PSNMVFSQSDPVRSPYYNIDLKEIHVAGKPLPLN-PTVFDGKHGTILDSGTTYAYLPEAA 239
Query: 316 YNSNLLSVMSSMIEAQPVADPTGSL-ELCYS--FNSLSQV----PEVTIHF-RGADVKLS 367
+ S ++M + +P+ P + ++C+S + +SQ+ P V + F G + LS
Sbjct: 240 FVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSSSFPAVEMVFGNGQKLLLS 299
Query: 368 RSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
N+ KV +F+ + + G I+ N LV YD E + F T+C++
Sbjct: 300 PENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRENSKIGFWKTNCSE 357
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 109/360 (30%), Positives = 166/360 (46%), Gaps = 48/360 (13%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y+I + +GTP ++ DTGS W CE C C+ + S+T + C +
Sbjct: 82 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-C--DGCHTNPRTFLQSR-STTCAKVSCGT 137
Query: 151 SQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
S C Q S + +C + VSY DGS S G L +T+T +P TFG
Sbjct: 138 SMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ----KIPSFTFG 193
Query: 206 CGTNNGGLFN-SKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF--GTNGI 262
C ++ G G++G+G G +S++ Q G FSYCL P+ ++ F T G
Sbjct: 194 CNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCL-PLQKSERGFFSKTTGY 251
Query: 263 VSGPGVVST----------PLTKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGT 307
S G V+T K + + + AISV +RLG+ S +V DSG+
Sbjct: 252 FS-LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 310
Query: 308 TLTFLPQGYNSNLLSVMSSMIEA---QPVADPTGSLELCYSFNSLSQ--VPEVTIHF-RG 361
L+++P LSV+S I + A S CY S+ + +P +++HF G
Sbjct: 311 ELSYIPD----RALSVLSQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDG 366
Query: 362 ADVKLSRSNFFVKVS---EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
A L FV+ S +D+ C F T SV I G++MQT+ V YD+++Q + P+
Sbjct: 367 ARFDLGSHGVFVERSVQEQDVWCLAF-APTESVSIIGSLMQTSKEVVYDLKRQLIGIGPS 425
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 91/363 (25%), Positives = 170/363 (46%), Gaps = 33/363 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G P E DTGSD++W C P CP S F+P SST +P
Sbjct: 89 YFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRIP 148
Query: 148 CSSSQCASLNQKS---CSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---A 196
CS +C + Q C + C Y+ +YGDGS ++G ++T+ + G A
Sbjct: 149 CSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQTA 208
Query: 197 VALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVS 251
+ + FGC + G + GI G G +S++SQ+ + ++ K FS+CL S
Sbjct: 209 NSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCL-KGS 267
Query: 252 STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVI 303
G + PG+V TPL ++ Y L +++I+V Q+L + +T ++
Sbjct: 268 DNGGGILVLGEIVEPGLVFTPLVPSQPHYNLNLESIAVSGQKLPIDSSLFATSNTQGTIV 327
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL-SQVPEVTIHFRGA 362
DSGTTL +L G ++ +++ + + + ++ + +S+ S P T++F+G
Sbjct: 328 DSGTTLVYLVDGAYDPFINAIAAAVSPSVRSVVSKGIQCFVTTSSVDSSFPTATLYFKGG 387
Query: 363 -DVKLSRSNFFVK---VSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
+ + N+ ++ V +++ + + + I G+++ + + YD+ + +
Sbjct: 388 VSMTVKPENYLLQQGSVDNNVLWCIGWQRSQGITILGDLVLKDKIFVYDLANMRMGWADY 447
Query: 419 DCT 421
DC+
Sbjct: 448 DCS 450
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 105/380 (27%), Positives = 164/380 (43%), Gaps = 73/380 (19%)
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQC--EPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
+I + IGTPP + V DTGS L W QC + PP + FDP +SS++ +LPCS
Sbjct: 73 IISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPP-----KPKTSFDPSLSSSFSTLPCS 127
Query: 150 SSQCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
C +L S C YS Y DG+F+ GNL E +T +T P +
Sbjct: 128 HPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE----ITPPLI 183
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG-- 261
GC T +S GI+G+ G +S +SQ + + KFSYC +P S + F G
Sbjct: 184 LGCATE-----SSDDRGILGMNRGRLSFVSQAKIS---KFSYC-IPPKSNRPGFTPTGSF 234
Query: 262 -IVSGPGVVSTPLTKAKTF-------------YVLTIDAISVGNQRLGVS----TPD--- 300
+ P TF Y + + I G ++L +S PD
Sbjct: 235 YLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGG 294
Query: 301 ---IVIDSGTTLTFL-PQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVP 353
++DSG+ T L Y+ +M+ + ++ V G+ ++C+ N ++ +P
Sbjct: 295 SGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYG--GTADMCFDGN-VAMIP 351
Query: 354 E-----VTIHFRGADVKLSRSNFFVKVSEDIVC------SVFKGITNSVPIYGNIMQTNF 402
V + RG ++ + + V V I C S+ +N I GN+ Q N
Sbjct: 352 RLIGDLVFVFTRGVEILVPKERVLVNVGGGIHCVGIGRSSMLGAASN---IIGNVHQQNL 408
Query: 403 LVGYDIEQQTVSFKPTDCTK 422
V +D+ + V F DC++
Sbjct: 409 WVEFDVTNRRVGFAKADCSR 428
>gi|222637182|gb|EEE67314.1| hypothetical protein OsJ_24556 [Oryza sativa Japonica Group]
Length = 304
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 104/351 (29%), Positives = 159/351 (45%), Gaps = 69/351 (19%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP-----LFDPKMSSTYKSLP 147
+ +++GTPP A+ SDL W +C PC S C +P L+D SS++ P
Sbjct: 1 MELAVGTPPVTVQALFGI-SDLCWVECTPC--SGCNNNAAPPAGARLYDRANSSSFS--P 55
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ ++C G Y + D ++ G L TET+ GS A + TFGC
Sbjct: 56 LADTEC---------GYRYVYGATDTDRNYVKGILGTETIKFGSN--DAATVQSFTFGC- 103
Query: 208 TN---NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV--PVSSTKINFGTNGI 262
TN LF+ T G+VGLG +SL+ Q+ +FSYCL P ++ + FG+
Sbjct: 104 TNTVYRNDLFDGNT-GVVGLGRSKLSLVGQLGLD---RFSYCLASNPNVASPVLFGSTAS 159
Query: 263 VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLS 322
+ G GV STPL Y + + ISV RL + N +
Sbjct: 160 MDGNGVSSTPLLPDDANYYVNLLGISVDGTRLAIP---------------------NDTA 198
Query: 323 VMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVTIHFRGADVKLSRSNFFVKVSE- 377
MS EA GS LC+ + S+ VP +T+HF G D++L N+F +
Sbjct: 199 RMSRTYEAV-----NGSGLLCFLVDDASKNVVTVPTMTMHFDGMDMELLFGNYFAYTGKQ 253
Query: 378 ------DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
D++C + G +++ GN +Q +F V Y+++ +S +P DC K
Sbjct: 254 SGGGGGDVLC-LMIGKSSTGSRIGNYLQMDFHVLYELKNSVLSVQPADCGK 303
>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
Length = 493
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 123/515 (23%), Positives = 214/515 (41%), Gaps = 125/515 (24%)
Query: 1 MATFLSC-VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETP-YQRLRDAL 58
MAT SC F+ F LCF +S ++ + L H S N+ T + L+
Sbjct: 1 MAT--SCYAFLCFILCFSCISVSISEI--LYLPLTHSLS------NTQFTSTHHLLKSTS 50
Query: 59 TRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAV-ADTGSDLIWT 117
+RS +R H +Q + + + P ++Y + ++ + P + +++ DTGSDL+W
Sbjct: 51 SRSASRFQHQHQKRHLRNRHQVSLPLSPG-SDYTLSFTLNSNPPQHVSLYLDTGSDLVWF 109
Query: 118 QCEPCPPSQCYMQDSPLFD-------PKMSSTYKSLPCSSSQCA---------------- 154
PC P +C + + + P++SST +S+ C SS C+
Sbjct: 110 ---PCKPFECILCEGKAENTTASTPPPRLSSTARSVHCKSSACSAAHSNLPTSDLCAIAD 166
Query: 155 ----SLNQKSCSGVNC-QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
S+ C +C + +YGDGS L +++ L T +++L TFGC
Sbjct: 167 CPLESIETSDCHSFSCPSFYYAYGDGSLV-ARLYHDSIKLPLAT-PSLSLHNFTFGCAHT 224
Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMRT---TIAGKFSYCLVP----------------- 249
++ G+ G G G +SL +Q+ + + +FSYCLV
Sbjct: 225 A----LAEPVGVAGFGRGVLSLPAQLASFAPQLGNRFSYCLVSHSFNSDRLRLPSPLILG 280
Query: 250 --------VSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD- 300
V+ + F ++ P K FY + ++ IS+G ++ + P+
Sbjct: 281 HSDDKEKRVNKDDVQFVYTSMLDNP--------KHPYFYCVGLEGISIGKKK--IPAPEF 330
Query: 301 -----------IVIDSGTTLTFLPQGYNSNLLSVMSSMI-----EAQPVADPTGSLELCY 344
+V+DSGTT T LP +++++ + + A+ V D TG L CY
Sbjct: 331 LKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVEDKTG-LGPCY 389
Query: 345 SFNSLSQVPEVTIHFRGAD--VKLSRSNFF---------VKVSEDIVCSVFKGITNSVPI 393
++++ +P + +HF G + V L + N+F V+ + C + +
Sbjct: 390 YYDTVVNIPSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVRRKRRVGCLMLMNGGEEAEL 449
Query: 394 -------YGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
GN Q F V YD+EQ+ V F C
Sbjct: 450 TGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKCA 484
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 103/367 (28%), Positives = 156/367 (42%), Gaps = 42/367 (11%)
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQC---EPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
L IG P + DTGSD +W C CP + L+DP S T K +PC
Sbjct: 76 LYYTKIGLGPNDYYVQVDTGSDTLWVNCVGCTTCPKKSGLGMELTLYDPNSSKTSKVVPC 135
Query: 149 SSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---G 201
C S SG ++C YS++YGDGS ++G+ + +T G +P
Sbjct: 136 DDEFCTSTYDGPISGCKKDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTS 195
Query: 202 ITFGCGTNNGGLFNSKT----TGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
+ FGCG+ G +S T GI+G G + S++SQ+ AGK FS+CL V+
Sbjct: 196 VIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAA--AGKVKRVFSHCLDTVNGG 253
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI---------VID 304
I F +V P V +TPL Y + + I V + + T DI +ID
Sbjct: 254 GI-FAIGEVVQ-PKVKTTPLVPRMAHYNVVLKDIEVAGDPIQLPT-DIFDSTSGRGTIID 310
Query: 305 SGTTLTFLPQGYNSNLLS---VMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF-R 360
SGTTL +LP LL S +E V D + P V F
Sbjct: 311 SGTTLAYLPVSIYDQLLEKTLAQRSGMELYLVEDQFTCFHYSDEKSLDDAFPTVKFTFEE 370
Query: 361 GADVKLSRSNFFVKVSEDIVCSVFKGIT------NSVPIYGNIMQTNFLVGYDIEQQTVS 414
G + ++ ED+ C ++ T + + G+++ TN L YD++ ++
Sbjct: 371 GLTLTAYPHDYLFPFKEDMWCIGWQKSTAQTKDGKDLILLGDLVLTNKLFIYDLDNMSIG 430
Query: 415 FKPTDCT 421
+ +C+
Sbjct: 431 WTDYNCS 437
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 104/397 (26%), Positives = 171/397 (43%), Gaps = 43/397 (10%)
Query: 42 PFYNSSETPYQRLRDAL-------TRSLNRLNHFNQNS-SISSSKASQADIIPNNANYLI 93
PF+N E P + SL +H ++N S+ + + I +N+L+
Sbjct: 130 PFHNQEEFPQTFSSSSSFKLKLYPAASLYNTHHQHKNYYSLDLNASLNPGITTGTSNFLV 189
Query: 94 RISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC 153
+I +G PP + + D +D W QC+PC +CY Q +FDP SS+Y L C + C
Sbjct: 190 QIGVGGPPQKFYMIFDLQTDFTWLQCQPCI--KCYDQPDSIFDPSQSSSYTLLSCETKHC 247
Query: 154 ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG 212
L SCS C+Y+++Y DG+ + G L ETV+ S+ + ++ GC N G
Sbjct: 248 NLLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFESSG----WVDRVSLGCSNKNQG 303
Query: 213 LFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP----VSSTKINFGT---NGIVSG 265
F + G GLG G +S S++ A SYCLV SS+ + F + +G V
Sbjct: 304 PF-VGSDGTFGLGRGSLSFPSRIN---ASSMSYCLVESKDGYSSSTLEFNSPPCSGSVKA 359
Query: 266 PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVID----------SGTTLTFLPQG 315
++ P KA+ Y + + I VG +++ V ID S + +T L
Sbjct: 360 K-LLQNP--KAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSSSLITMLEND 416
Query: 316 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVK---LSRSNFF 372
+ + + + + CY+ +S + V + F D K L + ++
Sbjct: 417 TYNVVRDAFVAKTQHLERLKAFLQFDTCYNLSSNNTVELPILEFEVNDGKSWLLPKESYL 476
Query: 373 VKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
V ++ C F S I G + Q V +D+
Sbjct: 477 YAVDKNGTFCFAFAPSKGSFSILGTLQQYGTRVTFDL 513
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 105/380 (27%), Positives = 164/380 (43%), Gaps = 73/380 (19%)
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQC--EPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
+I + IGTPP + V DTGS L W QC + PP + FDP +SS++ +LPCS
Sbjct: 73 IISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPP-----KPKTSFDPSLSSSFSTLPCS 127
Query: 150 SSQCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
C +L S C YS Y DG+F+ GNL E +T +T P +
Sbjct: 128 HPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE----ITPPLI 183
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG-- 261
GC T +S GI+G+ G +S +SQ + + KFSYC +P S + F G
Sbjct: 184 LGCATE-----SSDDRGILGMNRGRLSFVSQAKIS---KFSYC-IPPKSNRPGFTPTGSF 234
Query: 262 -IVSGPGVVSTPLTKAKTF-------------YVLTIDAISVGNQRLGVS----TPD--- 300
+ P TF Y + + I G ++L +S PD
Sbjct: 235 YLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGG 294
Query: 301 ---IVIDSGTTLTFL-PQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVP 353
++DSG+ T L Y+ +M+ + ++ V G+ ++C+ N ++ +P
Sbjct: 295 SGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYG--GTADMCFDGN-VAMIP 351
Query: 354 E-----VTIHFRGADVKLSRSNFFVKVSEDIVC------SVFKGITNSVPIYGNIMQTNF 402
V + RG ++ + + V V I C S+ +N I GN+ Q N
Sbjct: 352 RLIGDLVFVFTRGVEIFVPKERVLVNVGGGIHCVGIGRSSMLGAASN---IIGNVHQQNL 408
Query: 403 LVGYDIEQQTVSFKPTDCTK 422
V +D+ + V F DC++
Sbjct: 409 WVEFDVTNRRVGFAKADCSR 428
>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
Length = 137
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 59/135 (43%), Positives = 81/135 (60%), Gaps = 7/135 (5%)
Query: 78 KASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
K QA + N +L++++IG P A+ DTGSDL WTQC PC S CY Q +P++DP
Sbjct: 8 KDVQAPVSAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCIPC--SDCYKQPTPIYDP 65
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
+SSTY ++ C SS C +L +C C+Y +YGD S + G L+ ET TL S +
Sbjct: 66 SLSSTYGTVSCKSSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTLSSQS---- 121
Query: 198 ALPGITFGCGTNNGG 212
+P I FGCG +N G
Sbjct: 122 -IPHIAFGCGQDNEG 135
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 118/446 (26%), Positives = 196/446 (43%), Gaps = 49/446 (10%)
Query: 4 FLSCVFILFFLCFYVVSPIEAQTGGFSVELI---HRDSPKSPFYNSSETPYQRLRDALTR 60
+S + IL F+ Y S + G +I + SPKS + R A+
Sbjct: 8 LISAIVILSFVTIYSSSASQIPNRGVRRPMIFPLYFASPKSSGH----------RQAIEG 57
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
S R + + +++ D + +N Y R+ IGTPP E + DTGS + + C
Sbjct: 58 SYWRRHLKSDPYHHPNARMRLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCS 117
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS-QCASLNQKSCSGVNCQYSVSYGDGSFSN 179
C C P F P SSTY + C+ C GVNC Y Y + S S+
Sbjct: 118 DC--EHCGKHQDPRFQPDESSTYHPVKCNMDCNCDH------DGVNCVYERRYAEMSSSS 169
Query: 180 GNLATETVTLGSTTGQAVALP-GITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM-- 235
G L + ++ G+ Q+ +P FGC G L++ + GI+GLG G +S++ Q+
Sbjct: 170 GVLGEDIISFGN---QSEVVPQRAVFGCENVETGDLYSQRADGIMGLGRGQLSIVDQLVD 226
Query: 236 RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQ-- 292
+ I FS C + GI P +V + ++ +Y + + I V +
Sbjct: 227 KNVINDSFSLCYGGMHVGGGAMVLGGIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPL 286
Query: 293 RLGVSTPD----IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS- 345
+L ST D V+DSGTT +LP + + + +++ + + P + ++C+S
Sbjct: 287 KLSPSTFDRKHGTVLDSGTTYAYLPEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSG 346
Query: 346 ----FNSLSQV-PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNSVPIYGN 396
+ LS+ PEV + F G + L+ N+ KV +F+ +S + G
Sbjct: 347 AGRDVSQLSKAFPEVDMVFSNGQKLSLTPENYLFQHTKVHGAYCLGIFRN-GDSTTLLGG 405
Query: 397 IMQTNFLVGYDIEQQTVSFKPTDCTK 422
I+ N LV YD E + + F T+C++
Sbjct: 406 IIVRNTLVTYDRENEKIGFWKTNCSE 431
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 106/388 (27%), Positives = 171/388 (44%), Gaps = 69/388 (17%)
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ +N + + +++G+PP V DTGS+L W C+ + +S +F+P S TY
Sbjct: 62 LFHHNVSLTVSLTVGSPPQNVTMVLDTGSELSWLHCK-----KTQFLNS-VFNPLSSKTY 115
Query: 144 KSLPCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
+PC S C + + SC C VSY D + GNLA ET LGS T
Sbjct: 116 SKVPCLSPTCKTRTRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLGSLTK--- 172
Query: 198 ALPGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK 254
P FGC G ++ +SKTTG++G+ G +S ++QM KFSYC+ S
Sbjct: 173 --PATIFGCMDSGFSSNSEEDSKTTGLIGMNRGSLSFVNQMGYP---KFSYCISGFDSAG 227
Query: 255 INFGTNGIVSGPGV----------VSTPLTK-AKTFYVLTIDAISVGNQRL----GVSTP 299
+ N S P + +STPL + Y + ++ I V N+ L V P
Sbjct: 228 VLLLGNA--SFPWLKPLSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVP 285
Query: 300 D------IVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPT----GSLELCYS 345
D ++DSGT TFL + LS +++ + D G+++LCY
Sbjct: 286 DHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLSQTRGILKV--LNDDNFVFQGAMDLCYL 343
Query: 346 FNS----LSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKG---ITNSVP 392
+S L +P V++ F+GA++ +S +V + + C F +
Sbjct: 344 LDSSRPNLQNLPVVSLMFQGAEMSVSGERLLYRVPGEVRGRDSVWCFTFGNSDLLGVEAF 403
Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ G+ Q N + +D+E+ + C
Sbjct: 404 VIGHHHQQNVWMEFDLEKSRIGLADVRC 431
>gi|115466078|ref|NP_001056638.1| Os06g0121500 [Oryza sativa Japonica Group]
gi|113594678|dbj|BAF18552.1| Os06g0121500 [Oryza sativa Japonica Group]
Length = 442
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 92/329 (27%), Positives = 132/329 (40%), Gaps = 72/329 (21%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
DT DL W QC PCP +CY Q + LFDP+ S T ++PC S+ C L + CS C
Sbjct: 169 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 228
Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
QY V YGDG ++G + +TL +T + FGC G F++ T+G +
Sbjct: 229 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTM---- 280
Query: 227 GDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDA 286
F +V P ++ T Y++ +
Sbjct: 281 ------------------------------FARTPLVRNPSII-------PTLYLVRLRG 303
Query: 287 ISVGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLE 341
I VG +RL V V+DS +T L P Y + L+ S+M VA L+
Sbjct: 304 IEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYPRVAGGRAGLD 363
Query: 342 LCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP------- 392
CY F + VP V++ F G V V D + + +G VP
Sbjct: 364 TCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAMGVMVEGCLAFVPTPGDFAL 413
Query: 393 -IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
GN+ Q V YD+ +V F+ C
Sbjct: 414 GFIGNVQQQTHEVLYDVVGGSVGFRRGAC 442
>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 421
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 103/347 (29%), Positives = 151/347 (43%), Gaps = 63/347 (18%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
E+ RD + F NS Y S N NH + N ++ + N+
Sbjct: 88 EIFGRDESRVSFINSKCNQYT--------SGNLKNHAHNN-----------NLFDEDGNF 128
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
L+ ++ GTPP + + DTGS + WTQC+ C C F+ SSTY S C
Sbjct: 129 LVDVAFGTPPQNFMLILDTGSSITWTQCKAC--VNCLQDSHRYFNWSASSTYSSGSCIPG 186
Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
V Y+++YGD S S GN +T+TL + FGCG NN
Sbjct: 187 T-----------VENNYNMTYGDDSTSVGNYGCDTMTLEPSD----VFQKFQFGCGRNNK 231
Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST-KINFGTNG--------- 261
G F S G++GLG G +S +SQ + FSYCL S + FG
Sbjct: 232 GDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIGSLLFGEKATSQSSSLKF 291
Query: 262 --IVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQ 314
+V+GPG + + +Y + + ISVGN+RL + ++P +IDS T +T LPQ
Sbjct: 292 TSLVNGPGTL-----QESGYYFVNLSDISVGNERLNIPSSVFASPGTIIDSRTVITRLPQ 346
Query: 315 GYNSNLLSVMSSMIEAQPVADPTGS----LELCYSFNSLSQVPEVTI 357
S L + + P+++ L+ CY+ PE+TI
Sbjct: 347 RAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYN-XXXXXXPELTI 392
>gi|55296886|dbj|BAD68338.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|55296941|dbj|BAD68392.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
Length = 424
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 92/329 (27%), Positives = 132/329 (40%), Gaps = 72/329 (21%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
DT DL W QC PCP +CY Q + LFDP+ S T ++PC S+ C L + CS C
Sbjct: 151 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 210
Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
QY V YGDG ++G + +TL +T + FGC G F++ T+G +
Sbjct: 211 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTM---- 262
Query: 227 GDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDA 286
F +V P ++ T Y++ +
Sbjct: 263 ------------------------------FARTPLVRNPSII-------PTLYLVRLRG 285
Query: 287 ISVGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLE 341
I VG +RL V V+DS +T L P Y + L+ S+M VA L+
Sbjct: 286 IEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYPRVAGGRAGLD 345
Query: 342 LCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP------- 392
CY F + VP V++ F G V V D + + +G VP
Sbjct: 346 TCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAMGVMVEGCLAFVPTPGDFAL 395
Query: 393 -IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
GN+ Q V YD+ +V F+ C
Sbjct: 396 GFIGNVQQQTHEVLYDVGGGSVGFRRGAC 424
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 115/420 (27%), Positives = 171/420 (40%), Gaps = 87/420 (20%)
Query: 65 LNHFNQNSSISSSKASQADIIPNNA----NY----------LIRISIGTPPTERLAVADT 110
L+ ++NS SSS ASQ PN NY ++ + IGTPP + V DT
Sbjct: 38 LSSHSKNSLFSSSLASQFKQNPNTKTTSYNYRSSFKYSMALIVSLPIGTPPQTQQMVLDT 97
Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA------SLNQKSCSGV 164
GS L W QC+ PP FDP +SS++ LPC+ S C +L
Sbjct: 98 GSQLSWIQCK-VPPK----TPPTAFDPLLSSSFSVLPCNHSLCKPRVPDYTLPTSCDQNR 152
Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
C YS Y DG+++ GNL E T S+ P + GC T+ +S T GI+G+
Sbjct: 153 LCHYSYFYADGTYAEGNLVREKFTFSSSQ----TTPPLILGCATD-----SSDTQGILGM 203
Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTF----- 279
G +S S + + KFSYC+ P S + T GP S
Sbjct: 204 NLGRLSFSSLAKIS---KFSYCVPPRRSQSGSSPTGSFYLGPNPSSAGFKYVNLMTYRQS 260
Query: 280 ----------YVLTIDAISVGNQRLGVSTP----------DIVIDSGTTLTFLPQGYNSN 319
Y L + I + ++L +ST +IDSGT TFL S
Sbjct: 261 QRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPSGAGQTLIDSGTWFTFLVDEAYSK 320
Query: 320 LLSVMSSMIEAQPVADPT--------GSLELCYSFNSL---SQVPEVTIHFR-GADVKLS 367
+ E +A P GSL++C+ +++ + + F G ++ +
Sbjct: 321 VKE------EIVKLAGPKLKKGYVYGGSLDMCFDGDAMVIGRMIGNMAFEFENGVEIVVE 374
Query: 368 RSNFFVKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
R V + C S G+ ++ I GN Q + V +D+ + V F TDC++
Sbjct: 375 REKMLADVGGGVQCLGIGRSDLLGVASN--IIGNFHQQDLWVEFDLVGRRVGFGRTDCSR 432
>gi|55296937|dbj|BAD68388.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|218197467|gb|EEC79894.1| hypothetical protein OsI_21421 [Oryza sativa Indica Group]
Length = 424
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 92/329 (27%), Positives = 132/329 (40%), Gaps = 72/329 (21%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
DT DL W QC PCP +CY Q + LFDP+ S T ++PC S+ C L + CS C
Sbjct: 151 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 210
Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
QY V YGDG ++G + +TL +T + FGC G F++ T+G +
Sbjct: 211 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTM---- 262
Query: 227 GDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDA 286
F +V P ++ T Y++ +
Sbjct: 263 ------------------------------FARTPLVRNPSII-------PTLYLVRLRG 285
Query: 287 ISVGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLE 341
I VG +RL V V+DS +T L P Y + L+ S+M VA L+
Sbjct: 286 IEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYPRVAGGRAGLD 345
Query: 342 LCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP------- 392
CY F + VP V++ F G V V D + + +G VP
Sbjct: 346 TCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAMGVMVEGCLAFVPTPGDFAL 395
Query: 393 -IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
GN+ Q V YD+ +V F+ C
Sbjct: 396 GFIGNVQQQTHEVLYDVVGGSVGFRRGAC 424
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 112/425 (26%), Positives = 185/425 (43%), Gaps = 44/425 (10%)
Query: 27 GGFSVELIHRDSPKSPFYNSSETPYQR--LRDALTRSLNRLNHFNQNSSISSSKA----S 80
G ++++ H P SP + P L D +R +RL + + ++ ++A +
Sbjct: 40 AGNTLQVSHAFGPCSPLGPGTTAPSWAGFLADQASRDASRLLYLDSLAARGKARAYAPIA 99
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
+ Y++R +GTPP + L DT +D W C C + C +P FDP S
Sbjct: 100 SGRQLLQTPTYVVRARLGTPPQQLLLAVDTSNDAAWIPCAGC--AGCPTSSAPPFDPAAS 157
Query: 141 STYKSLPCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
++Y+S+PC S CA +C G C +S++Y D S L+ +++ + G AV
Sbjct: 158 TSYRSVPCGSPLCAQAPNAACPPGGKACGFSLTYADSSL-QAALSQDSLAVA---GDAVK 213
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---- 254
TFGC G + G++GLG G +S +SQ R G FSYCL S
Sbjct: 214 T--YTFGCLQKATGT-AAPPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLPSFKSLNFSGT 270
Query: 255 INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI---------- 301
+ G NG P + +TPL + Y + + I VG + + + P +
Sbjct: 271 LRLGRNG--QPPRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGT 328
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 361
V+DSGT T L + + + A PV+ G + C++ +++ P VT+ F G
Sbjct: 329 VLDSGTMFTRLVAPAYVAVRDEVRRRVGA-PVSS-LGGFDTCFNTTAVAW-PPVTLLFDG 385
Query: 362 ADVKLSRSNFFVK-----VSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
V L N + +S + + G+ + + ++ Q N V +D+ V F
Sbjct: 386 MQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFA 445
Query: 417 PTDCT 421
CT
Sbjct: 446 RERCT 450
>gi|297794561|ref|XP_002865165.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297795163|ref|XP_002865466.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311000|gb|EFH41424.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311301|gb|EFH41725.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 134
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 60/138 (43%), Positives = 77/138 (55%), Gaps = 18/138 (13%)
Query: 3 TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
+ + C F+ FF +VELIH DSP SP YN T L A RS+
Sbjct: 7 SLVDCDFLFFF----------NDWENLTVELIHSDSPHSPLYNPHHTVSDGLNAAFLRSI 56
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+R FN + + Q+ +I N Y + ISIGTPP++ LA+ADTGSDL W QC+PC
Sbjct: 57 SRSRRFNTKTDL------QSGLISNGGEYFMSISIGTPPSKVLAIADTGSDLTWVQCKPC 110
Query: 123 PPSQCYMQDSPLFDPKMS 140
QCY Q+SPLFD K+S
Sbjct: 111 --QQCYKQNSPLFDKKIS 126
>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
Length = 346
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 151/351 (43%), Gaps = 32/351 (9%)
Query: 95 ISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---LFDPKMSSTYKSLPCSSS 151
IS+GTPP L DTGS L W QC+ C +CY Q + +F+P SSTY + CS+
Sbjct: 3 ISLGTPPVFNLVTIDTGSTLSWVQCKNCQI-KCYDQAAKAGQIFNPYNSSTYSKVGCSTE 61
Query: 152 QCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C ++ + C + C YS+ YG G +S G L + +TL S ++ F
Sbjct: 62 ACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNR----SIDNFIF 117
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQM-RTTIAGKFSYCLVPVSSTKINFGTNGIV 263
GCG +N L+N GI+G G S +Q+ + T FSYC + +
Sbjct: 118 GCGEDN--LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTIGPYA 175
Query: 264 SGPGVVSTPLT--KAKTFYVLTIDAISVGNQRLGVS-----TPDIVIDSGTTLTFLPQGY 316
++ T L K Y + + V RL + + ++DSGT T++
Sbjct: 176 RDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMTIVDSGTADTYILSPV 235
Query: 317 NSNLLSVMSSMIEAQPVADPTGSLELCYSFNS----LSQVPEVTIHFRGADVKLSRSNFF 372
L M+ ++A+ +C+ NS + P V + + +KL N F
Sbjct: 236 FDALDKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIRSTLKLPVENAF 295
Query: 373 VKVSEDIVCSVF---KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ S +++CS F V + GN +F + +DI+ FK C
Sbjct: 296 YESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 346
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 109/366 (29%), Positives = 171/366 (46%), Gaps = 40/366 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +GTPP + DTGSD++W C CP + FDP S T +
Sbjct: 52 YYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASLIS 111
Query: 148 CSSSQCASLNQKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---AL 199
CS +C+ Q S CS N C Y+ YGDGS ++G ++ + + G +V +
Sbjct: 112 CSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMNNSS 171
Query: 200 PGITFGC-GTNNGGLFNSKTT--GIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTK 254
I FGC G L S GI G G D+S++SQ+ + I+ + FS+CL S
Sbjct: 172 APIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDDSGG 231
Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSG 306
IV P +V TPL ++ Y L + +ISV Q L + S+ +IDSG
Sbjct: 232 GILVLGEIVE-PNIVYTPLVPSQPHYNLNMQSISVNGQTLAIDPSVFGTSSSQGTIIDSG 290
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCY----SFNSLSQVPEVTIHFR- 360
TTL +L + +S ++S++ P P S CY S N + P+V+++F
Sbjct: 291 TTLAYLAEAAYDPFISAITSIVS--PSVRPYLSKGNHCYLISSSINDI--FPQVSLNFAG 346
Query: 361 GADVKLSRSNFFVKVSE----DIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSF 415
GA + L ++ ++ S + C F+ I + I G+++ + + YDI Q + +
Sbjct: 347 GASMILIPQDYLIQQSSIGGAALWCIGFQKIQGQGITILGDLVLKDKIFVYDIANQRIGW 406
Query: 416 KPTDCT 421
DC+
Sbjct: 407 ANYDCS 412
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 103/367 (28%), Positives = 171/367 (46%), Gaps = 41/367 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +GTPP E DTGSD++W C CP S FD SS+ +
Sbjct: 79 YFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSLVS 138
Query: 148 CSSSQCASLNQKSCSGV-----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---AL 199
CS C S Q + + C Y+ YGDGS ++G +E++ GQ++ +
Sbjct: 139 CSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMVMGQSMIANSS 198
Query: 200 PGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTK 254
+ FGC T G + GI G G GD+S+ISQ+ R FS+CL +
Sbjct: 199 ASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCL----KGE 254
Query: 255 INFG---TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI---------V 302
N G G V PG+V +PL ++ Y L + +ISV Q L + P + +
Sbjct: 255 GNGGGILVLGEVLEPGIVYSPLVPSQPHYNLYLQSISVNGQTLPID-PSVFATSINRGTI 313
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQV-PEVTIHFR 360
IDSGTTL +L + + +S +++ + +Q V CY + S+ ++ P V+++F
Sbjct: 314 IDSGTTLAYLVEEAYTPFVSAITAAV-SQSVTPTISKGNQCYLVSTSVGEIFPLVSLNFA 372
Query: 361 G-ADVKLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
G A + L + + + + C F+ + V I G+++ + + YD+ +Q + +
Sbjct: 373 GSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEGVTILGDLVMKDKIFVYDLARQRIGW 432
Query: 416 KPTDCTK 422
DC++
Sbjct: 433 ASYDCSQ 439
>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
Length = 383
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 131/278 (47%), Gaps = 23/278 (8%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
D+ P+ Y + ++IG PP D+GSDL W QC+ PC C PL+ P S
Sbjct: 58 GDVYPHGL-YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC--RSCNEVPHPLYRPTKS 114
Query: 141 STYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
K +PC CASL+ + C + C Y + Y D S G L ++ L T
Sbjct: 115 ---KLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTN 171
Query: 194 GQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLV 248
G +VA P + FGCG + G +S T G++GLG G +SL+SQ++ K +CL
Sbjct: 172 G-SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLS 230
Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGVSTPDIVIDSG 306
+ FG + +V TP+ ++ + +Y ++ G++ LGV +V DSG
Sbjct: 231 LRGGGFLFFGDD-LVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSG 289
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY 344
++ T+ L++ + + +P SL LC+
Sbjct: 290 SSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCW 327
>gi|115475303|ref|NP_001061248.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|45735815|dbj|BAD12851.1| unknown protein [Oryza sativa Japonica Group]
gi|113623217|dbj|BAF23162.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|125602549|gb|EAZ41874.1| hypothetical protein OsJ_26419 [Oryza sativa Japonica Group]
Length = 449
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 97/383 (25%), Positives = 161/383 (42%), Gaps = 55/383 (14%)
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
I + YL + IG ++ + DTGS L+WTQC+ CP C++ D P + S T++
Sbjct: 76 IYEDVVYLAEMEIGERQQKQYLLIDTGSSLVWTQCDECP--HCHIGDVPPYGRSQSRTFQ 133
Query: 145 SLPCSSSQCASLNQK--------------SCSGVNCQYSVSY---GDGSFSNGNLATETV 187
+ C + C C + Y G G G ++ +T
Sbjct: 134 EVSCGDDDDNDKEEAIASYCPAKPPGYITLCVNGRCMFKALYNLTGQGETVQGYMSMDTF 193
Query: 188 T-LGSTTGQAVALPGITFGCGTNNGGLFNS--KTTGIVGLGGGDISLISQMRTTIAGKFS 244
+ A + FGC + + + TGI+GLG GD S + Q T KFS
Sbjct: 194 HFIDDRRFDYQAKFRMVFGCAHQENIVLTAVKECTGILGLGMGDASFLRQTGIT---KFS 250
Query: 245 YCLVP-------VSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS 297
YC+ P + + FG++ +SG V PL Y L + AI+ L
Sbjct: 251 YCVPPRMPGYSYRRHSWLRFGSHAQISGKKV---PLVMRWGKYYLPLTAITYTYNELMSP 307
Query: 298 TP-----------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD-PTGSLELCYS 345
P +++D+GT+L LP + +L+ M ++I+++ + + T + CY
Sbjct: 308 VPIIAYKSQEDYLHMMVDTGTSLLSLPTSLHDDLIKEMEAIIKSENIMEGATRWPKHCYK 367
Query: 346 FNSLSQVPEVTIHFR---GADVKLSRSNFFVKVSED---IVCSVFKGITN-SVPIYGNIM 398
++ +V ++T+ G D++L S F+K VC + + S I G
Sbjct: 368 -RTMDEVKDITVTLSFDGGLDIELFTSALFIKTETTKGPAVCLAVNRVDDSSKAILGMFA 426
Query: 399 QTNFLVGYDIEQQTVSFKPTDCT 421
QTN VGYD+ + ++ P C
Sbjct: 427 QTNINVGYDLLSREIAMDPIRCA 449
>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
Length = 389
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 86/266 (32%), Positives = 126/266 (47%), Gaps = 30/266 (11%)
Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
C Y+++YGDGSF+ G L E + G+ + + FGCG NN GLF +G++GLG
Sbjct: 76 CNYAINYGDGSFTRGELGHEKLKFGT-----ILVKDFIFGCGRNNKGLFGG-VSGLMGLG 129
Query: 226 GGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVV---STPLTKAK----- 277
D+SLISQ G FSYCL ST+ + I+ G V S+P++ AK
Sbjct: 130 RSDLSLISQTSGIFGGVFSYCL---PSTERKGSGSLILGGNSSVYRNSSPISYAKMIENP 186
Query: 278 ---TFYVLTIDAISVGNQRL---GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
FY + + IS+G L V I++DSGT +T LP L +
Sbjct: 187 QLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGF 246
Query: 332 PVADPTGSLELCYSFNSLSQV--PEVTIHFRG---ADVKLSRSNFFVKVSEDIVCSVFKG 386
P A L+ C++ ++ +V P + +HF G V ++ +FVK VC
Sbjct: 247 PPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALAS 306
Query: 387 IT--NSVPIYGNIMQTNFLVGYDIEQ 410
+ + V I GN Q N V YD ++
Sbjct: 307 LEYQDEVAILGNYQQKNLRVIYDTKE 332
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 99/363 (27%), Positives = 155/363 (42%), Gaps = 57/363 (15%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I +G P + DTGSD++W C+ CP L+DP S + +
Sbjct: 27 YFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKCPTKSDLGIKLTLYDPASSVSATRVS 86
Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
C C S L + CQY+V YGDGS + G ++ V TG ++
Sbjct: 87 CDDDFCTSTYNGLLPDCKKELPCQYNVVYGDGSSTAGYFVSDAVQFERVTGNLQTGLSNG 146
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN 260
+TFGCG G + + G I G F++CL V+ I F
Sbjct: 147 TVTFGCGAQQSGGLGTSGEALDG---------------ILGAFAHCLDNVNGGGI-FAIG 190
Query: 261 GIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP--------DIVIDSGTTLTFL 312
+VS P V +TP+ + Y + + I VG L + T +IDSGTTL +L
Sbjct: 191 ELVS-PKVNTTPMVPNQAHYNVYMKEIEVGGTVLELPTDVFDSGDRRGTIIDSGTTLAYL 249
Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLE------LC--YSFNSLSQVPEVTIHFRGA-D 363
P+ V SM+ P SL +C YS N P++ HF+ +
Sbjct: 250 PE-------VVYDSMMNEIRSQQPGLSLHTVEEQFICFKYSGNVDDGFPDIKFHFKDSLT 302
Query: 364 VKLSRSNFFVKVSEDIVCSVFK--GIT----NSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
+ + ++ ++SEDI C ++ G+ + + G+++ +N LV YDIE Q + +
Sbjct: 303 LTVYPHDYLFQISEDIWCFGWQNGGMQSKDGRDMTLLGDLVLSNKLVLYDIENQAIGWTE 362
Query: 418 TDC 420
+C
Sbjct: 363 YNC 365
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 120/459 (26%), Positives = 189/459 (41%), Gaps = 91/459 (19%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA 89
++ L H + + PF + YQ+L +T SL R H + ++ + +
Sbjct: 10 TIPLQHPQTNQIPF----QDQYQKLNHLVTTSLARARHLKNPQTTPATTTTAPLFSHSYG 65
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPL------FDPKMS 140
Y + +S GTPP + DTGSD++W C C C S F PK S
Sbjct: 66 GYSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLC--KHCSFSSSSPSSRIQPFIPKES 123
Query: 141 STYKSLPCSSSQCASLNQ-----------KSCSGVNC-QYSVSYGDGSFSNGNLATETVT 188
S+ K L C + +C+ ++ KSC C Y + YG G+ + G +ET+
Sbjct: 124 SSSKLLGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCPPYMIFYGSGT-TGGVALSETLH 182
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNS-KTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
L +++ P GC +F+S + GI G G G SL SQ+ GKFSYCL
Sbjct: 183 L-----HSLSKPNFLVGC-----SVFSSHQPAGIAGFGRGLSSLPSQLGL---GKFSYCL 229
Query: 248 ----------------VPVSSTKINFGTNGIVSGPGVVSTPLTKAKTF---YVLTIDAIS 288
+ + + TN +V P V + + +F Y L + I+
Sbjct: 230 LSHRFDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRIT 289
Query: 289 VGNQRLGV----------STPDIVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVA 334
VG + V ++IDSGTT TF+ + + + + + +
Sbjct: 290 VGGHHVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIE 349
Query: 335 DPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSV 391
D G L C++ + V PE+ ++F+ GADV L N+F V ++ C +T+ V
Sbjct: 350 DAIG-LRPCFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFVGGEVACLTV--VTDGV 406
Query: 392 P----------IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
I GN NF V YD+ + + FK C
Sbjct: 407 AGPERVGGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 108/385 (28%), Positives = 176/385 (45%), Gaps = 66/385 (17%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + I I++GTPP V DTGS+L W C + P F+P +SS+Y +
Sbjct: 62 HNVSLTISITVGTPPQNMSMVIDTGSELSWLHCN---TNTTATIPYPFFNPNISSSYTPI 118
Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
CSS C + + SC N C ++SY D S S GNLA++T GS+ P
Sbjct: 119 SCSSPTCTTRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGFGSSFN-----P 173
Query: 201 GITFGC-----GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
GI FGC TN+ +S TTG++G+ G +SL+SQ++ KFSYC+ + I
Sbjct: 174 GIVFGCMNSSYSTNSES--DSNTTGLMGMNLGSLSLVSQLKIP---KFSYCISGSDFSGI 228
Query: 256 ------NFGTNGIVSGPGVV--STPLTK-AKTFYVLTIDAISVGNQRLGVS----TPD-- 300
NF G ++ +V STPL ++ Y + ++ I + ++ L +S PD
Sbjct: 229 LLLGESNFSWGGSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDHT 288
Query: 301 ----IVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTG----SLELCYSF-- 346
+ D GT ++L L+ + + A + DP +++LCY
Sbjct: 289 GAGQTMFDLGTQFSYLLGPVYNALRDEFLNQTNGTLRA--LDDPNFVFQIAMDLCYRVPV 346
Query: 347 --NSLSQVPEVTIHFRGADVK------LSRSNFFVKVSEDIVCSVFKG---ITNSVPIYG 395
+ L ++P V++ F GA+++ L R FV ++ + C F + I G
Sbjct: 347 NQSELPELPSVSLVFEGAEMRVFGDQLLYRVPGFVWGNDSVYCFTFGNSDLLGVEAFIIG 406
Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDC 420
+ Q + + +D+ + V C
Sbjct: 407 HHHQQSMWMEFDLVEHRVGLAHARC 431
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 105/391 (26%), Positives = 176/391 (45%), Gaps = 49/391 (12%)
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
R H +++ +++ D + N Y R+ IGTPP + DTGS + + C C
Sbjct: 53 RRQLHGSESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTC 112
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
QC P F P +SSTY+ + C+ C N + + C Y Y + S S+G L
Sbjct: 113 --EQCGRHQDPKFQPDLSSTYQPVKCTLD-CNCDNDR----MQCVYERQYAEMSTSSGVL 165
Query: 183 ATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTI 239
+ V+ G+ + +A FGC G L++ GI+GLG GD+S++ Q+ + +
Sbjct: 166 GEDVVSFGNQS--ELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVV 223
Query: 240 AGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT------FYVLTIDAISVGNQR 293
+ FS C ++ G +V G + + A++ +Y + + I V +R
Sbjct: 224 SDSFSLCY-----GGMDVGGGAMVLGGISPPSDMVFAQSDPVRSPYYNIDLKEIHVAGKR 278
Query: 294 LGVSTPDI-------VIDSGTTLTFLPQGYNSNLLSVMSSMI-EAQPVADPTGSL----E 341
L ++ P + V+DSGTT +LP+ L+ +++ E Q + +G +
Sbjct: 279 LPLN-PSVFDGKHGSVLDSGTTYAYLPE---EAFLAFKEAIVKELQSFSQISGPDPNYND 334
Query: 342 LCYS-----FNSLSQV-PEVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSV 391
LC+S + LS+ P V + F G LS N+ KV +F+ +
Sbjct: 335 LCFSGAGIDVSQLSKTFPVVDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGKDPT 394
Query: 392 PIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ G I+ N LV YD EQ + F T+C +
Sbjct: 395 TLLGGIVVRNTLVLYDREQTKIGFWKTNCAE 425
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 120/479 (25%), Positives = 195/479 (40%), Gaps = 98/479 (20%)
Query: 24 AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
A +EL H D+ N T +R+R A R+ +R + +++ ++ +
Sbjct: 18 AGGAALRLELAHVDA------NEHCTMEERVRRATERTHHR-RLLHASTAAAAGGVAAPL 70
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC--------PPSQCYMQDSPLF 135
Y+ IG PP AV DTGSDL+WTQC C C+ Q+ P +
Sbjct: 71 RWSGKTQYIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYY 130
Query: 136 DPKMSSTYKSLPCS---------SSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATE 185
+ +S T +++PC + + A + SG + C + SYG G + G L T+
Sbjct: 131 NFSLSRTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VALGVLGTD 189
Query: 186 TVTLGSTTGQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
T S++ +A FGC + + G N +GI+GLG G +SL+SQ+ T +
Sbjct: 190 AFTFPSSSSVTLA-----FGCVSQTRISPGALNG-ASGIIGLGRGALSLVSQLNAT---E 240
Query: 243 FSYCLVP-----VSSTKINFGTNGIVSGPG-----------VVSTPLTKA------KTFY 280
FSYCL P VS + + G + V + P K TFY
Sbjct: 241 FSYCLTPYFRDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFY 300
Query: 281 VLTIDAISVGNQRLGV---------STPDI-----VIDSGTTLTFLPQGYNSNLLSVMSS 326
L + ++ GN + + + P + +IDSG+ T L + L ++
Sbjct: 301 YLPLVGLAAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELAR 360
Query: 327 MIEAQ-----PVADPTGSLELCYSFN------SLSQVPEVTIHFR-----GADVKLSRSN 370
+ P A G+LELC + + VP + + F G ++ +
Sbjct: 361 QLRGSGSLVPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEK 420
Query: 371 FFVKVSEDIVCSVFKG--------ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
++ +V C TN I GN MQ + V YD+ +SF+P +C+
Sbjct: 421 YWARVEASTWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 479
>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
gi|194689376|gb|ACF78772.1| unknown [Zea mays]
gi|224031455|gb|ACN34803.1| unknown [Zea mays]
gi|238011528|gb|ACR36799.1| unknown [Zea mays]
gi|238015454|gb|ACR38762.1| unknown [Zea mays]
Length = 304
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 92/311 (29%), Positives = 150/311 (48%), Gaps = 43/311 (13%)
Query: 146 LPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI-- 202
+ C+ + C+ + SC + C Y +YGDG+ + G ATE T S+ G + +
Sbjct: 1 MRCAGTLCSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPL 60
Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-------- 254
FGCG+ N G N+ +GIVG G +SL+SQ+ +FSYCL +S +
Sbjct: 61 GFGCGSVNVGSLNNG-SGIVGFGRNPLSLVSQLSIR---RFSYCLTSYASRRQSTLLFGS 116
Query: 255 INFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVS------TPD----I 301
++ G G +G V +TPL ++ TFY + ++VG +RL + PD +
Sbjct: 117 LSDGVYGDATGR-VQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGV 175
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE--LCY---------SFNSLS 350
++DSGT LT LP + ++ + P A+ G+ E +C+ S S
Sbjct: 176 IVDSGTALTLLPAAVLAEVVRAFRQQLRL-PFAN-GGNPEDGVCFLVPAAWRRSSSTSQM 233
Query: 351 QVPEVTIHFRGADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
VP + +HF+GAD+ L R N+ + +C + + GN++Q + V YD+E
Sbjct: 234 PVPRMVLHFQGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLE 293
Query: 410 QQTVSFKPTDC 420
+T+S P C
Sbjct: 294 AETLSIAPARC 304
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 80/263 (30%), Positives = 128/263 (48%), Gaps = 26/263 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G+PP E DTGSD++W C P CP S F+P SST +P
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 148 CSSSQCASLNQKS---CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
CS +C + Q S C + C Y+ +YGDGS ++G ++T+ + G A +
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 210
Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSST 253
I FGC + G + GI G G +S++SQ+ + ++ K FS+CL S
Sbjct: 211 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL-KGSDN 269
Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDS 305
G + PG+V TPL ++ Y L +++I V Q+L + +T ++DS
Sbjct: 270 GGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDS 329
Query: 306 GTTLTFLPQGYNSNLLSVMSSMI 328
GTTL +L G ++ +++ +
Sbjct: 330 GTTLAYLADGAYDPFVNAITAAV 352
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 106/383 (27%), Positives = 170/383 (44%), Gaps = 63/383 (16%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + +++GTPP V DTGS+L W C+ + +F+P +SS+Y +
Sbjct: 66 HNVTLTVSLTVGTPPQSVTMVLDTGSELSWLHCKK------QQNINSVFNPHLSSSYTPI 119
Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
PC S C + + SC N C +VSY D + GNLA++T + S +GQ P
Sbjct: 120 PCMSPICKTRTRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAI-SGSGQ----P 174
Query: 201 GITFG---CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
GI FG G ++ +SKTTG++G+ G +S ++QM KFSYC+ ++ +
Sbjct: 175 GIIFGSMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQMGFP---KFSYCISGKDASGVLL 231
Query: 258 GTNGIVSGPGVVS-TPLTKAKT--------FYVLTIDAISVGNQRLGVS----TPD---- 300
+ G + TPL K T Y + + I VG++ L V PD
Sbjct: 232 FGDATFKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGA 291
Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPT----GSLELCYSFNS---L 349
++DSGT TFL + L + + + DP G+++LC+ +
Sbjct: 292 GQTMVDSGTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGGVV 351
Query: 350 SQVPEVTIHFRGADVKLSRSNFFVKV---------SEDIVCSVFKG---ITNSVPIYGNI 397
VP VT+ F GA++ +S +V + D+ C F + + G+
Sbjct: 352 PAVPAVTMVFEGAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTFGNSDLLGIEAYVIGHH 411
Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
Q N + +D+ V F T C
Sbjct: 412 HQQNVWMEFDLVNSRVGFADTKC 434
>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 111/398 (27%), Positives = 174/398 (43%), Gaps = 74/398 (18%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE----PCPPSQCYMQD--------------- 131
YLI +SIGTPP DTGSDL W C C Y +
Sbjct: 80 YLISLSIGTPPQVIQVYMDTGSDLTWAPCGNISFDCIECDNYRNNRMMASFSPSHSSSSH 139
Query: 132 -----SPLFDPKMSSTYKSLPCSSSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLAT 184
SP SS PC+ + C ++L + +CS ++ +YG G G L
Sbjct: 140 RDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAGGVVTGTLTR 199
Query: 185 ETVTL-GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
+T+ + G G +P FGC ++ + GI G G G +SL SQ+ G F
Sbjct: 200 DTLRVHGRNLGVTQEIPRFCFGCVASS----YREPIGIAGFGRGALSLPSQLGFLRKG-F 254
Query: 244 SYCLV-------PVSSTKINFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGN-- 291
S+C + P S+ + G + S + TP+ K+ +Y + ++AI+VGN
Sbjct: 255 SHCFLAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPMYPNYYYVGLEAITVGNVS 314
Query: 292 ---------QRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD---PTGS 339
+ + +++DSGTT T LP+ + S +LSV+ S+I D TG
Sbjct: 315 ATEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLSVLQSIINYPRATDMEMRTG- 373
Query: 340 LELCYSF----NSL---SQVPEVTIHF-RGADVKLSRSNFFVKVSED-----IVCSVFKG 386
+LCY NS+ +P +T HF A + LSR + F +S + C +F+
Sbjct: 374 FDLCYKVPCQNNSILTGDLLPSITFHFLNNASLVLSRGSHFYAMSAPSNSTVVKCLLFQS 433
Query: 387 ITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ + + G+ Q + V YD+E++ + F+P DC
Sbjct: 434 MDDGDYGPAGVLGSFQQQDVEVVYDMEKERIGFRPMDC 471
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 109/414 (26%), Positives = 186/414 (44%), Gaps = 63/414 (15%)
Query: 42 PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPP 101
P SS P R+ D R L++ S + ++ D + +N Y R+ IGTPP
Sbjct: 34 PLSYSSLPPRPRVEDFRRRRLHQ-------SQLPNAHMKLYDDLLSNGYYTTRLWIGTPP 86
Query: 102 TERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC 161
E + DTGS + + C C QC P F P++S++Y++L C+ C ++
Sbjct: 87 QEFALIVDTGSTVTYVPCSTC--KQCGKHQDPKFQPELSTSYQALKCNPD-CNCDDE--- 140
Query: 162 SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTG 220
G C Y Y + S S+G L+ + ++ G+ + ++ FGC G LF+ + G
Sbjct: 141 -GKLCVYERRYAEMSSSSGVLSEDLISFGNES--QLSPQRAVFGCENEETGDLFSQRADG 197
Query: 221 IVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----PGVV---S 270
I+GLG G +S++ Q+ + I FS C + G +V G PG+V S
Sbjct: 198 IMGLGRGKLSVVDQLVDKGVIEDVFSLCY-----GGMEVGGGAMVLGKISPPPGMVFSHS 252
Query: 271 TPLTKAKTFYVLTIDAISVGNQRLGVSTPDI-------VIDSGTTLTFLPQGYNSNLLSV 323
P +Y + + + V + L ++ P + V+DSGTT + P+ +++
Sbjct: 253 DPFRSP--YYNIDLKQMHVAGKSLKLN-PKVFNGKHGTVLDSGTTYAYFPK---EAFIAI 306
Query: 324 MSSMIEAQPV------ADPTGSLELCYS--FNSLSQV----PEVTIHF-RGADVKLSRSN 370
++I+ P DP ++C+S ++++ PE+ + F G + LS N
Sbjct: 307 KDAVIKEIPSLKRIHGPDPNYD-DVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPEN 365
Query: 371 FF---VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ KV +F +S + G I+ N LV YD E + F T+C+
Sbjct: 366 YLFRHTKVRGAYCLGIFPD-RDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCS 418
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 109/414 (26%), Positives = 186/414 (44%), Gaps = 63/414 (15%)
Query: 42 PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPP 101
P SS P R+ D R L++ S + ++ D + +N Y R+ IGTPP
Sbjct: 34 PLSYSSLPPRPRVEDFRRRRLHQ-------SQLPNAHMKLYDDLLSNGYYTTRLWIGTPP 86
Query: 102 TERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC 161
E + DTGS + + C C QC P F P++S++Y++L C+ C ++
Sbjct: 87 QEFALIVDTGSTVTYVPCSTC--KQCGKHQDPKFQPELSTSYQALKCNPD-CNCDDE--- 140
Query: 162 SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTG 220
G C Y Y + S S+G L+ + ++ G+ + ++ FGC G LF+ + G
Sbjct: 141 -GKLCVYERRYAEMSSSSGVLSEDLISFGNES--QLSPQRAVFGCENEETGDLFSQRADG 197
Query: 221 IVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----PGVV---S 270
I+GLG G +S++ Q+ + I FS C + G +V G PG+V S
Sbjct: 198 IMGLGRGKLSVVDQLVDKGVIEDVFSLCY-----GGMEVGGGAMVLGKISPPPGMVFSHS 252
Query: 271 TPLTKAKTFYVLTIDAISVGNQRLGVSTPDI-------VIDSGTTLTFLPQGYNSNLLSV 323
P +Y + + + V + L ++ P + V+DSGTT + P+ +++
Sbjct: 253 DPFRSP--YYNIDLKQMHVAGKSLKLN-PKVFNGKHGTVLDSGTTYAYFPK---EAFIAI 306
Query: 324 MSSMIEAQPV------ADPTGSLELCYS--FNSLSQV----PEVTIHF-RGADVKLSRSN 370
++I+ P DP ++C+S ++++ PE+ + F G + LS N
Sbjct: 307 KDAVIKEIPSLKRIHGPDPNYD-DVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPEN 365
Query: 371 FF---VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ KV +F +S + G I+ N LV YD E + F T+C+
Sbjct: 366 YLFRHTKVRGAYCLGIFPD-RDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCS 418
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 98/356 (27%), Positives = 155/356 (43%), Gaps = 45/356 (12%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N Y+ IGTPP + D SDL+WT C P F+P S+T +
Sbjct: 96 NAGMYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP----------FNPVRSTTVADV 145
Query: 147 PCSSSQCASLNQKSC--SGVNCQYSVSYGDGSF-SNGNLATETVTLGSTTGQAVALPGIT 203
PC+ C ++C C Y+ YG G+ + G L TE T G T + G+
Sbjct: 146 PCTDDACQQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTFGDTR-----IDGVV 200
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK----INFGT 259
FGCG N G F S +G++GLG G++SL+SQ++ +FSY P S I FG
Sbjct: 201 FGCGLKNVGDF-SGVSGVIGLGRGNLSLVSQLQVD---RFSYHFAPDDSVDTQSFILFGD 256
Query: 260 NGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGV--STPDIVIDSGT------- 307
+ +ST L + + Y + + I V + L + T D+ G+
Sbjct: 257 DATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSI 316
Query: 308 --TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGAD 363
+T L + L ++S I V L+LCY+ SL ++VP + + F G
Sbjct: 317 TDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGA 376
Query: 364 V-KLSRSN-FFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
V +L N F++ + + C ++ + G+++Q + YDI + F+
Sbjct: 377 VMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVFE 432
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 167/362 (46%), Gaps = 31/362 (8%)
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
D + N Y R+ IGTP E + D+GS + + C C QC P F P +SST
Sbjct: 83 DDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATC--EQCGNHQDPRFQPDLSST 140
Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
Y + C+ C N++S C Y Y + S S+G L + ++ G + +
Sbjct: 141 YSPVKCNVD-CTCDNERS----QCTYERQYAEMSSSSGVLGEDIMSFGKES--ELKPQRA 193
Query: 203 TFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGT 259
FGC T G LF+ GI+GLG G +S++ Q+ + I+ FS C +
Sbjct: 194 VFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVL 253
Query: 260 NGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGV------STPDIVIDSGTTLTFL 312
G+ + P +V + ++ +Y + + I V + L + S V+DSGTT +L
Sbjct: 254 GGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAYL 313
Query: 313 P-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV-PEVTIHF-RGAD 363
P Q + + +V + + + + P + ++C++ + LS+V P+V + F G
Sbjct: 314 PEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQK 373
Query: 364 VKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ LS N+ + S E C VF+ + + G I+ N LV YD + + F T+C
Sbjct: 374 LSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 433
Query: 421 TK 422
++
Sbjct: 434 SE 435
>gi|340810931|gb|AEK75392.1| S5 [Oryza sativa]
gi|340810983|gb|AEK75418.1| S5 [Oryza nivara]
gi|340810985|gb|AEK75419.1| S5 [Oryza nivara]
gi|340810997|gb|AEK75425.1| S5 [Oryza nivara]
gi|340811011|gb|AEK75432.1| S5 [Oryza nivara]
gi|340811013|gb|AEK75433.1| S5 [Oryza nivara]
gi|340811041|gb|AEK75447.1| S5 [Oryza nivara]
gi|340811043|gb|AEK75448.1| S5 [Oryza nivara]
Length = 474
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 112/394 (28%), Positives = 177/394 (44%), Gaps = 53/394 (13%)
Query: 70 QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
Q I+SS +++ D+I N+ +L+ +S+G PP L DTGS L W QC+PC
Sbjct: 91 QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 149
Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
C+ Q + P+FDP S T + + CSS +C L Q +C +C YSV+YG+
Sbjct: 150 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 209
Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
G ++S G + T+T+ +G + + FGC + ++ GI G G S
Sbjct: 210 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMDVK--YSEFEAGIFGFGSSSFSFFE 261
Query: 234 QMR---TTIAGK-FSYCLVPVSSTKINFGTNGIVSGPGVVS--TPLTKA--KTFYVLTID 285
Q+ ++ K FSYCL P TK + G + TPL ++ + Y LT++
Sbjct: 262 QLAGYPDILSYKAFSYCL-PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTME 320
Query: 286 AISVGNQRLGVSTPDIVIDSG--------TTLTFLPQ---------GYNSNLLSVMSSMI 328
+ QRL S+ ++++DSG +T L + GY+ + S I
Sbjct: 321 MLIANGQRLVTSSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYI 380
Query: 329 EAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF-KG 386
D +G F++ S +P + I F GA + LS N F +C F +
Sbjct: 381 CYLSEHDYSGWNGTITPFSNWSALPPLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQN 440
Query: 387 ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
I GN + +F +DI+ + FK C
Sbjct: 441 PALRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 474
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 99/364 (27%), Positives = 167/364 (45%), Gaps = 45/364 (12%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
N Y R+ IGTPP + DTGS + + C C QC P F P+ SSTY+ +
Sbjct: 109 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTC--EQCGRHQDPKFQPESSSTYQPVK 166
Query: 148 CSSSQCASLNQKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C+ C +C G + C Y Y + S S+G L + ++ G+ + +A FG
Sbjct: 167 CTID-C------NCDGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQS--ELAPQRAVFG 217
Query: 206 C-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGI 262
C G L++ GI+GLG GD+S++ Q+ + I+ FS C ++ G +
Sbjct: 218 CENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCY-----GGMDVGGGAM 272
Query: 263 VSGPGVVSTPLTKAKT------FYVLTIDAISVGNQRLGVST------PDIVIDSGTTLT 310
V G + +T A + +Y + + + V +RL ++ V+DSGTT
Sbjct: 273 VLGGISPPSDMTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVFDGKHGTVLDSGTTYA 332
Query: 311 FLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS--FNSLSQV----PEVTIHF-RG 361
+LP+ + + +++ + + ++ P + ++C+S N +SQ+ P V + F G
Sbjct: 333 YLPEAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPVVDMVFGNG 392
Query: 362 ADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
LS N+ KV +F+ + + G I+ N LV YD EQ + F T
Sbjct: 393 HKYSLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTLVMYDREQTKIGFWKT 452
Query: 419 DCTK 422
+C +
Sbjct: 453 NCAE 456
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 164/363 (45%), Gaps = 39/363 (10%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
+Y + ++IG PP DTGSDL W QC+ P C L+ PK + +PC
Sbjct: 66 GHYSVILNIGNPPKAFDLDIDTGSDLTWVQCD-APCKGCTKPLDKLYKPKNN----RVPC 120
Query: 149 SSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
+SS C ++ +C C Y V Y D S G L ++ L G + P I FGC
Sbjct: 121 ASSLCQAIQNNNCDIPTEQCDYEVEYADLGSSLGVLLSDYFPLRLNNGSLLQ-PRIAFGC 179
Query: 207 GTNN---GGLFNSKTTGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTKINFGTNG 261
G + G T GI+GLG G S++SQ+RT +C V+ + FG +
Sbjct: 180 GYDQKYLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHCFSRVTGGFLFFGDH- 238
Query: 262 IVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 319
++ G+ TP+ + + T Y + G + G+ ++ DSG++ T+ +
Sbjct: 239 LLPPSGITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQS 298
Query: 320 LLSVMSSMIEAQPVAD--PTGSLELCYS--------FNSLSQVPEVTIHF---RGADVKL 366
+L+++ + P+ D +L +C+ + S +TI+F + ++L
Sbjct: 299 ILNLVRKDLSGMPLKDAPEEKALAVCWKTAKPIKSILDIKSFFKPLTINFIKAKNVQLQL 358
Query: 367 SRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
+ ++ + + VC GI N ++ + G+I + +V YD E+Q + + PT+
Sbjct: 359 APEDYLIITKDGNVCL---GILNGGEQGLGNLNVIGDIFMQDRVVVYDNERQQIGWFPTN 415
Query: 420 CTK 422
C +
Sbjct: 416 CNR 418
>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 508
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 110/351 (31%), Positives = 153/351 (43%), Gaps = 30/351 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQDSPL----FDPKMSSTYK 144
Y +SIGTP L DTGSDL W CE CP + + SST
Sbjct: 104 YYANVSIGTPGLYFLVALDTGSDLFWLPCECTKCPTYLTKRDNGKFWLNHYSSNASSTSI 163
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALP-GI 202
+PCSSS C NQ S + +C Y Y + S S G L + + + + Q + +
Sbjct: 164 RVPCSSSLCELANQCSSNKSSCPYQTHYLSENSSSAGYLVQDILHMATDDSQLKPVDVKV 223
Query: 203 TFGCGTNNGGLFNSKTT--GIVGLGGGDIS----LISQMRTTIAGKFSYCLVPVSSTKIN 256
T GCG G F++ T G++GLG G +S L SQ TT FS C +I+
Sbjct: 224 TLGCGKVQTGKFSNVTAPNGLIGLGMGKVSVPSFLASQGLTT--DSFSMCFGYYGYGRID 281
Query: 257 FGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGY 316
FG G V G TP A Y +TI I V N+ V I IDSG + T+L +
Sbjct: 282 FGDIGPV---GQRETPFNPASLSYNVTILQIIVTNRPTNVHLTAI-IDSGASFTYLTDPF 337
Query: 317 NSNLLSVMSSMIEAQPV-ADPTGSLELCY--SFNSLSQVPEVTIHFRGADVKLSRSNFFV 373
S + M + +E + + +D E CY S ++ Q P + G K +V
Sbjct: 338 YSIITENMDAAMELERIKSDSDFPFEYCYRLSLATIFQQPNLNFTMEGGR-KFDVITSYV 396
Query: 374 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI----EQQTVSFKPTDC 420
V D ++ I S I N++ NF GY + E+ T+ +K DC
Sbjct: 397 SVDTDDGPALCLAIVKSTDI--NVIGHNFFGGYRVVFNREKMTLGWKEVDC 445
>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
Length = 469
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 119/423 (28%), Positives = 204/423 (48%), Gaps = 73/423 (17%)
Query: 44 YNSSETPY--QRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPP 101
Y++ P+ + + L + L + Q + AS A +I I++GTP
Sbjct: 44 YSAKSRPWVSKLVAGFLKKQLRNRGNKQQQQQLGGEAASGA-----APPLVINITVGTPV 98
Query: 102 TERLA-VADTGSDLIWTQCEPC--------PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
+ ++ + D S +W QC PC PP+ F P S+T+ LPCSS
Sbjct: 99 AQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATA-------FRPNGSATFSPLPCSSDM 151
Query: 153 CASLNQKSC----------SGVNCQ-YSVSYGDGSFSN--GNLATETVTLGSTTGQAVAL 199
C + +++C +G C YS++YG GS +N G LAT+T T G+T A+
Sbjct: 152 CLPVLRETCGRAGAAANATAGARCDSYSLTYG-GSAANTSGYLATDTFTFGAT-----AV 205
Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK----- 254
PG+ FGC + G F + +G++G+G G++SLISQ++ GKFSY L+ +T
Sbjct: 206 PGVVFGCSDASYGDF-AGASGVIGIGRGNLSLISQLQF---GKFSYQLLAPEATDDGSAD 261
Query: 255 --INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGV-----------ST 298
I FG + + STPL T FY + + + V RL T
Sbjct: 262 SVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRANGT 321
Query: 299 PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE--LCYSFNSLS--QVPE 354
+++ S T +T+L Q + + ++S I P + + +LE LCY+ +S++ +VP+
Sbjct: 322 GGVILSSTTPVTYLEQAAYDVVRAAVASRI-GLPAVNGSAALELDLCYNASSMAKVKVPK 380
Query: 355 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
+T+ F GAD+ LS +N+F ++ + + + + G ++QT + YD++ +
Sbjct: 381 LTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGSVLGTLLQTGTNMIYDVDAGRL 440
Query: 414 SFK 416
+F+
Sbjct: 441 TFE 443
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 111/415 (26%), Positives = 177/415 (42%), Gaps = 44/415 (10%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
+ + H +SP SPF + ++ L + RL + + + S + I +
Sbjct: 34 LRVFHVNSPCSPFKQPNTVSWE---STLLKDKARLQYLSSLAKKPSVPIASGRAIVQSPT 90
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y++R +IGTP L DT +D W C C S LFDP SS+ ++L C +
Sbjct: 91 YIVRANIGTPAQPMLVALDTSNDAAWVPCSGC----VGCASSVLFDPSKSSSSRNLQCDA 146
Query: 151 SQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
QC +C +G +C ++++YG GS +L +T+TL + + TFGC +
Sbjct: 147 PQCKQAPNPTCTAGKSCGFNMTYG-GSTIEASLTQDTLTLAND-----VIKSYTFGCISK 200
Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPG-- 267
G + G++GLG G +SLISQ + FSYCL +S NF + + GP
Sbjct: 201 ATGT-SLPAQGLMGLGRGPLSLISQTQNLYMSTFSYCL--PNSKSSNF-SGSLRLGPKYQ 256
Query: 268 ---VVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI----------VIDSGTTLTF 311
+ +TPL K + Y + + I VGN+ + + T + + DSGT T
Sbjct: 257 PVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGTIFDSGTVFTR 316
Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNF 371
L + + + I+ A G + CYS + + P VT F G +V L N
Sbjct: 317 LVEPAYVAVRNEFRRRIK-NANATSLGGFDTCYSGSVV--YPSVTFMFAGMNVTLPPDNL 373
Query: 372 FVKVSE-DIVCSVFKGITNSV----PIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ S C N+V + ++ Q N V D+ + CT
Sbjct: 374 LIHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRLGISRETCT 428
>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
Length = 469
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 119/423 (28%), Positives = 204/423 (48%), Gaps = 73/423 (17%)
Query: 44 YNSSETPY--QRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPP 101
Y++ P+ + + L + L + Q + AS A +I I++GTP
Sbjct: 44 YSAKSRPWVSKLVAGFLKKQLRNRGNKQQQQQLGGEAASGA-----APPLVINITVGTPV 98
Query: 102 TERLA-VADTGSDLIWTQCEPC--------PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
+ ++ + D S +W QC PC PP+ F P S+T+ LPCSS
Sbjct: 99 AQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATA-------FRPNGSATFSPLPCSSDM 151
Query: 153 CASLNQKSC----------SGVNCQ-YSVSYGDGSFSN--GNLATETVTLGSTTGQAVAL 199
C + +++C +G C YS++YG GS +N G LAT+T T G+T A+
Sbjct: 152 CLPVLRETCGRAGAAANATAGARCDSYSLTYG-GSAANTSGYLATDTFTFGAT-----AV 205
Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK----- 254
PG+ FGC + G F + +G++G+G G++SLISQ++ GKFSY L+ +T
Sbjct: 206 PGVVFGCSDASYGDF-AGASGVIGIGRGNLSLISQLQF---GKFSYQLLAPEATDDGSAD 261
Query: 255 --INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGV-----------ST 298
I FG + + STPL T FY + + + V RL T
Sbjct: 262 SVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRANGT 321
Query: 299 PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE--LCYSFNSLS--QVPE 354
+++ S T +T+L Q + + ++S I P + + +LE LCY+ +S++ +VP+
Sbjct: 322 GGVILSSTTPVTYLEQAAYDVVRAAVASRI-GLPAVNGSAALELDLCYNASSMAKVKVPK 380
Query: 355 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
+T+ F GAD+ LS +N+F ++ + + + + G ++QT + YD++ +
Sbjct: 381 LTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGSVLGTLLQTGTNMIYDVDAGRL 440
Query: 414 SFK 416
+F+
Sbjct: 441 TFE 443
>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
Length = 469
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 97/350 (27%), Positives = 150/350 (42%), Gaps = 23/350 (6%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYK 144
Y + +GTP T L DTGSDL W C+ C P Y +D ++ P S+T +
Sbjct: 96 YYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSR 155
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
LPCS C S+ + C Y++ Y + + S+G L +T+ L +
Sbjct: 156 HLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASVI 215
Query: 204 FGCGTNNGG--LFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGT 259
GCG G L G++GLG DIS+ S + + FS C SS +I FG
Sbjct: 216 IGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIFFGD 275
Query: 260 NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 319
G+ S PL Y + +D +G++ L ++ ++DSGT+ T LP
Sbjct: 276 QGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSLPFDVYKA 335
Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSE 377
+ A V + + CYS + L VP +T+ F AD L N + ++
Sbjct: 336 FTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLTF-AADKSLQAVNPILPFND 394
Query: 378 D---IVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQTVSFKPTDC 420
+ + ++ PI I+ NFLVGY D E + + ++C
Sbjct: 395 KQGALAGFCLAVLPSTEPI--GIIAQNFLVGYHVVFDRESMKLGWYRSEC 442
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 101/369 (27%), Positives = 170/369 (46%), Gaps = 37/369 (10%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
D+ P +Y + ++IG P DTGSDL W QC+ P C PL+ P +
Sbjct: 44 GDVYPT-GHYYVTMNIGDPAKPYFLDIDTGSDLTWLQCD-APCQSCNKVPHPLYKPTKN- 100
Query: 142 TYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
K +PC++S C +L N+K C Y + Y D + S G L T+ TL
Sbjct: 101 --KLVPCAASICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTLPLRNSS 158
Query: 196 AVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLVP 249
+V P TFGCG + G+ + T G++GLG G +SL+SQ++ K +CL
Sbjct: 159 SVR-PSFTFGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCLST 217
Query: 250 VSSTKINFGTNGIVSGPGVVSTPLTKAKT--FYVLTIDAISVGNQRLGVSTPDIVIDSGT 307
+ FG N +V P+ ++ + +Y + + LGV ++V DSG+
Sbjct: 218 NGGGFLFFGDN-VVPTSRATWVPMVRSTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGS 276
Query: 308 TLT-FLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYS----FNSLSQVPE----VTI 357
T T F Q Y + + ++ + + ++ Q V+DP SL LC+ F S+S V + +
Sbjct: 277 TYTYFAAQPYQATVSALKAGLSKSLQQVSDP--SLPLCWKGQKVFKSVSDVKNDFKSLFL 334
Query: 358 HF-RGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTV 413
F + + +++ N+ + C + G + I G+I + L+ YD E+ +
Sbjct: 335 SFVKNSVLEIPPENYLIVTKNGNACLGILDGSAAKLTFNIIGDITMQDQLIIYDNERGQL 394
Query: 414 SFKPTDCTK 422
+ C++
Sbjct: 395 GWIRGSCSR 403
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 96/371 (25%), Positives = 158/371 (42%), Gaps = 50/371 (13%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G+PP + DTGSD++W +C CP D L+DPK S T + +
Sbjct: 70 YFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPKGSETSELIS 129
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
C C++ G + C YS++YGDGS + G + +T P
Sbjct: 130 CDQEFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVNDNLRTAPQNS 189
Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTK 254
I FGCG G +S + GI+G G + S++SQ+ + + FS+CL +
Sbjct: 190 SIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCLDNIRGGG 249
Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI---------VIDS 305
I F +V P V +TPL Y + + +I V L + + DI +IDS
Sbjct: 250 I-FAIGEVVE-PKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPS-DIFDSGNGKGTIIDS 306
Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELC--------YSFNSLSQVPEVTI 357
GTTL +LP V +I P L L Y+ N P V +
Sbjct: 307 GTTLAYLPA-------IVYDELIPKVMARQPRLKLYLVEQQFSCFQYTGNVDRGFPVVKL 359
Query: 358 HFRGA-DVKLSRSNFFVKVSEDIVCSVFKGIT------NSVPIYGNIMQTNFLVGYDIEQ 410
HF + + + ++ + + I C ++ + + G+++ +N LV YD+E
Sbjct: 360 HFEDSLSLTVYPHDYLFQFKDGIWCIGWQKSVAQTKNGKDMTLLGDLVLSNKLVIYDLEN 419
Query: 411 QTVSFKPTDCT 421
+ + +C+
Sbjct: 420 MAIGWTDYNCS 430
>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
Length = 375
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 102/353 (28%), Positives = 162/353 (45%), Gaps = 65/353 (18%)
Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG 163
R + DTGSDLIWTQC K+SS+ + S S + +G
Sbjct: 53 RKLIVDTGSDLIWTQC------------------KLSSSTAAAARHGSPPLSRTAPARTG 94
Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVG 223
++ + + + G LA+ET T G+ +AV+L + FGCG + G TGI+G
Sbjct: 95 A---FTRTCTASAAAVGVLASETFTFGAR--RAVSLR-LGFGCGALSAGSLIG-ATGILG 147
Query: 224 LGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN---FG---------TNGIVSGPGVVST 271
L +SLI+Q++ +FSYCL P + K + FG T + +VS
Sbjct: 148 LSPESLSLITQLKIQ---RFSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSN 204
Query: 272 PLTKAKTFYVLTIDAISVGNQRLGVST------PD----IVIDSGTTLTFLPQGYNSNLL 321
P+ +Y + + IS+G++RL V PD ++DSG+T+ +L + +
Sbjct: 205 PVET--VYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVK 262
Query: 322 SVMSSMIEAQPVADPT-GSLELCYSFNSLS--------QVPEVTIHFRG-ADVKLSRSNF 371
+ ++ PVA+ T ELC+ + QVP + +HF G A + L R N+
Sbjct: 263 EAVMDVVRL-PVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNY 321
Query: 372 FVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
F + ++C T+ V I GN+ Q N V +D++ SF PT C +
Sbjct: 322 FQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQCDQ 374
>gi|340810907|gb|AEK75380.1| S5 [Oryza sativa]
Length = 472
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 113/398 (28%), Positives = 176/398 (44%), Gaps = 61/398 (15%)
Query: 70 QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
Q I+SS +++ D+I N+ +L+ +S+G PP L DTGS L W QC+PC
Sbjct: 89 QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147
Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
C+ Q + P+FDP S T + + CSS +C L Q +C +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 207
Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
G ++S G + T+T+ +G + + FGC + ++ GI G G S
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMDVK--YSEFEAGIFGFGSSSFSFFE 259
Query: 234 QMRTTIAG--------KFSYCLVPVSSTKINFGTNGIVSGPGVVS--TPLTKA--KTFYV 281
Q+ AG FSYCL P TK + G + TPL ++ + Y
Sbjct: 260 QL----AGYPDILSYKAFSYCL-PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYS 314
Query: 282 LTIDAISVGNQRLGVSTPDIVIDSG--------TTLTFLPQ---------GYNSNLLSVM 324
LT++ + QRL S+ ++++DSG +T L + GY+ +
Sbjct: 315 LTMEMLIANGQRLVTSSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ 374
Query: 325 SSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSV 383
S I D +G F++ S +P + I F GA + LS N F +C
Sbjct: 375 ESYICYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMT 434
Query: 384 F-KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
F + I GN + +F +DI+ + FK C
Sbjct: 435 FAQNPALRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472
>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
Length = 485
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 97/350 (27%), Positives = 150/350 (42%), Gaps = 23/350 (6%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYK 144
Y + +GTP T L DTGSDL W C+ C P Y +D ++ P S+T +
Sbjct: 66 YYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSR 125
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
LPCS C S+ + C Y++ Y + + S+G L +T+ L +
Sbjct: 126 HLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASVI 185
Query: 204 FGCGTNNGG--LFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGT 259
GCG G L G++GLG DIS+ S + + FS C SS +I FG
Sbjct: 186 IGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIFFGD 245
Query: 260 NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 319
G+ S PL Y + +D +G++ L ++ ++DSGT+ T LP
Sbjct: 246 QGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSLPLDVYKA 305
Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSE 377
+ A V + + CYS + L VP +T+ F AD L N + ++
Sbjct: 306 FTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLTF-AADKSLQAVNPILPFND 364
Query: 378 D---IVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQTVSFKPTDC 420
+ + ++ PI I+ NFLVGY D E + + ++C
Sbjct: 365 KQGALAGFCLAVLPSTEPI--GIIAQNFLVGYHVVFDRESMKLGWYRSEC 412
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 103/371 (27%), Positives = 161/371 (43%), Gaps = 49/371 (13%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I +G+P + DTGSD++W +C CP L+DPK S T + +
Sbjct: 69 YFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVS 128
Query: 148 CSSSQCASLNQKS---CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
C + C+S + C N C YS+SYGDGS + G + +T G A
Sbjct: 129 CEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTATQNS 188
Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTK 254
I FGCG G F S + GI+G G + S++SQ+ + + FS+CL T
Sbjct: 189 SIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL----DTN 244
Query: 255 INFG--TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI--------VID 304
+ G + G V P V +TPL Y + + I V L + + VID
Sbjct: 245 VGGGIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVID 304
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQP------VADPTGSLELCYSFNSLSQVPEVTIH 358
SGTTL +LP+ L MS ++ QP V + + Y+ N S P V +H
Sbjct: 305 SGTTLAYLPRIVYDQL---MSKVLAKQPRLKVYLVEEQYSCFQ--YTGNVDSGFPIVKLH 359
Query: 359 FRGA-DVKLSRSNFFVKVSEDIVCSVFKGITNS-------VPIYGNIMQTNFLVGYDIEQ 410
F + + + ++ D + + S + + G+ + +N LV YD+E
Sbjct: 360 FEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLEN 419
Query: 411 QTVSFKPTDCT 421
T+ + +C+
Sbjct: 420 MTIGWTDYNCS 430
>gi|242089103|ref|XP_002440384.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
gi|241945669|gb|EES18814.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
Length = 555
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 94/393 (23%), Positives = 157/393 (39%), Gaps = 68/393 (17%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSP---------------- 133
YL+ + GTP V DT +DL W C + Y + S
Sbjct: 140 YLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRQSSKTMSVGGDDDVVAALA 199
Query: 134 -------LFDPKMSSTYKSLPCSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNL 182
+ P SS+++ + CS QCA L +C +C Y DG+ + G
Sbjct: 200 KKEARKNWYRPAKSSSWRRIRCSEQQCAHLPYNTCQSPSKLESCSYYQKTQDGTVTIGIY 259
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
E T+ + G+ LPG+ GC G G++ LG G +S G+
Sbjct: 260 GNEKATVTVSDGRMAKLPGLVLGCSVLEAGASVDAHDGVLSLGNGHMSFAIHAVLRFGGR 319
Query: 243 FSYCLVPVSSTK-----INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL 294
FS+CL+ +S++ + FG N V GPG + T + K Y + A+ VG +RL
Sbjct: 320 FSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETEILYNVDVKAAYGPRVTAVLVGGERL 379
Query: 295 GVSTPD------------IVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 341
+ PD +++D+ T++T +P+ Y L++ + + P G E
Sbjct: 380 DI--PDDVWNIDKGLGSGVILDTSTSVTSLVPEAYEP-LVAALDRHLAHLPRESFAG-FE 435
Query: 342 LCYSFNSLSQ---------VPEVTIHFRGADVKL---SRSNFFVKVSEDIVCSVFKGI-T 388
CY + +P+VT+ G +L ++S +V + C F+ +
Sbjct: 436 YCYRWTFTGDGVDPAHNVTIPKVTVEMTGG-ARLEPEAKSVVMPEVGHGVACLAFRKLPW 494
Query: 389 NSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
P I GN++ ++ D + T F+ C
Sbjct: 495 GGGPCIIGNVLMQEYIWEIDHSKATFRFRKDKC 527
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 155/360 (43%), Gaps = 49/360 (13%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N Y+ IGTPP + D SDL+WT C P F+P S+T +
Sbjct: 96 NAGMYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP----------FNPVRSTTVADV 145
Query: 147 PCSSSQCASLNQKSCSG------VNCQYSVSYGDGSF-SNGNLATETVTLGSTTGQAVAL 199
PC+ C ++C C Y+ YG G+ + G L TE T G T +
Sbjct: 146 PCTDDACQQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTFGDTR-----I 200
Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK----I 255
G+ FGCG N G F S +G++GLG G++SL+SQ++ +FSY P S I
Sbjct: 201 DGVVFGCGLQNVGDF-SGVSGVIGLGRGNLSLVSQLQVD---RFSYHFAPDDSVDTQSFI 256
Query: 256 NFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGV--STPDIVIDSGT--- 307
FG + +ST L + + Y + + I V + L + T D+ G+
Sbjct: 257 LFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGV 316
Query: 308 ------TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHF 359
+T L + L ++S I V L+LCY+ SL ++VP + + F
Sbjct: 317 FLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVF 376
Query: 360 RGADV-KLSRSN-FFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
G V +L N F++ + + C ++ + G+++Q + YDI + F+
Sbjct: 377 AGGAVMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVFE 436
>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
gi|194704920|gb|ACF86544.1| unknown [Zea mays]
gi|223949445|gb|ACN28806.1| unknown [Zea mays]
gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
Length = 515
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 97/350 (27%), Positives = 150/350 (42%), Gaps = 23/350 (6%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYK 144
Y + +GTP T L DTGSDL W C+ C P Y +D ++ P S+T +
Sbjct: 96 YYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSR 155
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
LPCS C S+ + C Y++ Y + + S+G L +T+ L +
Sbjct: 156 HLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASVI 215
Query: 204 FGCGTNNGG--LFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGT 259
GCG G L G++GLG DIS+ S + + FS C SS +I FG
Sbjct: 216 IGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIFFGD 275
Query: 260 NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 319
G+ S PL Y + +D +G++ L ++ ++DSGT+ T LP
Sbjct: 276 QGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSLPFDVYKA 335
Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSE 377
+ A V + + CYS + L VP +T+ F AD L N + ++
Sbjct: 336 FTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLTF-AADKSLQAVNPILPFND 394
Query: 378 D---IVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQTVSFKPTDC 420
+ + ++ PI I+ NFLVGY D E + + ++C
Sbjct: 395 KQGALAGFCLAVLPSTEPI--GIIAQNFLVGYHVVFDRESMKLGWYRSEC 442
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 106/364 (29%), Positives = 171/364 (46%), Gaps = 36/364 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G PP + DTGSD++W C CP + FDP S+T +
Sbjct: 83 YYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASLVS 142
Query: 148 CSSSQCASLNQKSCSGV-----NCQYSVSYGDGSFSNGNLATETVTLG---STTGQAVAL 199
CS CA Q S S C Y YGDGS ++G + + L ++ + +
Sbjct: 143 CSDQICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTSNSS 202
Query: 200 PGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTK 254
+ FGC T+ G + GI G G D+S+ISQ+ + IA K FS+CL S
Sbjct: 203 ASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDDSGG 262
Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSG 306
IV P VV TPL ++ Y L + +ISV Q L + S+ +IDSG
Sbjct: 263 GILVLGEIVE-PNVVYTPLVPSQPHYNLNLQSISVNGQVLPISPAVFATSSSQGTIIDSG 321
Query: 307 TTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCY-SFNSLSQV-PEVTIHFR-GA 362
TTL +L + YN+ +++V + I +Q CY + +S+S + P+V+++F GA
Sbjct: 322 TTLAYLAEEAYNAFVVAVTN--IVSQSTQSVVLKGNRCYVTSSSVSDIFPQVSLNFAGGA 379
Query: 363 DVKLSRSNFFVKVSE----DIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
+ L ++ ++ + + C F+ I + I G+++ + + YD+ Q + +
Sbjct: 380 SLVLGAQDYLIQQNSVGGTTVWCIGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWTN 439
Query: 418 TDCT 421
DC+
Sbjct: 440 YDCS 443
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 165/363 (45%), Gaps = 35/363 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G+PP E DTGSD++W C CP S FDP SST +
Sbjct: 68 YFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLIS 127
Query: 148 CSSSQCASLNQKS---CS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV--ALP 200
CS +C+ Q S CS G C Y+ YGDGS ++G ++ + + G +V +
Sbjct: 128 CSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSSA 187
Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTKI 255
I FGC + G + GI G G D+S+ISQM + I K FS+CL
Sbjct: 188 SIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGGG 247
Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI---------VIDSG 306
IV +V +PL ++ Y L + +ISV + L + P++ ++DSG
Sbjct: 248 ILVLGEIVE-EDIVYSPLVPSQPHYNLNLQSISVNGKSLAID-PEVFATSTNRGTIVDSG 305
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGA-D 363
TTL +L + +S ++ + +Q V CY S + P V+++F G
Sbjct: 306 TTLAYLAEEAYDPFVSAITEAV-SQSVRPLLSKGTQCYLITSSVKGIFPTVSLNFAGGVS 364
Query: 364 VKLSRSNFFVKVSE----DIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
+ L ++ ++ + + C F+ I + I G+++ + + YD+ Q + +
Sbjct: 365 MNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLAGQRIGWANY 424
Query: 419 DCT 421
DC+
Sbjct: 425 DCS 427
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 110/416 (26%), Positives = 181/416 (43%), Gaps = 58/416 (13%)
Query: 41 SPFYN-SSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-----IPNNANYLIR 94
SPF SE+ + D ++ R+ + SS+++ K A I + N NY++R
Sbjct: 42 SPFTAPKSESWMNTVIDMASKDPARIRYL---SSLTAQKTVAAPIASGQQVLNVGNYVVR 98
Query: 95 ISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA 154
+ +GTP V DT +D W C C + F + SST+ +L CS +C
Sbjct: 99 VQLGTPGQTMYMVLDTSNDAAWAPCSGC----IGCSSTTTFSAQNSSTFATLDCSKPECT 154
Query: 155 SLNQKSC---SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
SC V+C ++ +YG S + L +++ LG +P +FGC ++
Sbjct: 155 QARGLSCPTTGNVDCLFNQTYGGDSTFSATLVQDSLHLGPNV-----IPNFSFGCISSAS 209
Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP----- 266
G + G++GLG G +SLISQ + +G FSYCL S K + + + GP
Sbjct: 210 G-SSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCL---PSFKSYYFSGSLKLGPVGQPK 265
Query: 267 GVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI-----------VIDSGTTLT-F 311
+ +TPL + Y + + ISVG + +S P++ +IDSGT +T F
Sbjct: 266 AIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPIS-PELLAFDPNTGAGTIIDSGTVITRF 324
Query: 312 LPQGYNSNLLSVMSSMIEAQPVA--DPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRS 369
+P Y + + Q P G+ + C++ N+ P +T+H G D+KL
Sbjct: 325 VPAIY-----TAVRDEFRKQVGGSFSPLGAFDTCFATNNEVSAPAITLHLSGLDLKLPME 379
Query: 370 NFFVKVSE-DIVCSVFKGI----TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N + S + C + V + N+ Q N + +DI + C
Sbjct: 380 NSLIHSSAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRILFDINNSKLGIARELC 435
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 121/445 (27%), Positives = 183/445 (41%), Gaps = 81/445 (18%)
Query: 44 YNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTE 103
++S P+ L+ A + SL R +H ++ S S A+ + Y I +++GTPP
Sbjct: 45 HSSDSDPFHSLKFAASASLTRAHHLKHRNNNSPSVATTPAYPKSYGGYSIDLNLGTPPQT 104
Query: 104 RLAVADTGSDLIWTQCEP---CPPSQCYMQD-----SPLFDPKMSSTYKSLPCSSSQCAS 155
V DTGS L+W C C S C + P F PK SST K L C + +C
Sbjct: 105 SPFVLDTGSSLVWFPCTSRYLC--SHCNFPNIDTTKIPTFIPKNSSTAKLLGCRNPKCGY 162
Query: 156 L--------------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
+ ++CS Y + YG GS + G L + + T +P
Sbjct: 163 IFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLGS-TAGFLLLDNLNFPGKT-----VPQ 216
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVSSTK 254
GC L + +GI G G G SL SQM +FSYCLV P SS
Sbjct: 217 FLVGCSI----LSIRQPSGIAGFGRGQESLPSQMNLK---RFSYCLVSHRFDDTPQSSDL 269
Query: 255 I-------NFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGV-------- 296
+ + TNG+ P S P T K +Y LT+ + VG + + +
Sbjct: 270 VLQISSTGDTKTNGLSYTP-FRSNPSTNNPAFKEYYYLTLRKVIVGGKDVKIPYTFLEPG 328
Query: 297 --STPDIVIDSGTTLTFLPQG-YN---SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 350
++DSG+T TF+ + YN + + A+ L C++ + +
Sbjct: 329 SDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAETQSGLSPCFNISGVK 388
Query: 351 QV--PEVTIHFR-GADVKLSRSNFFVKVSE-DIVCSVFKGITNSVP--------IYGNIM 398
V PE+T F+ GA + N+F V + ++VC + P I GN
Sbjct: 389 TVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVVCLTVVSDGGAGPPKTTGPAIILGNYQ 448
Query: 399 QTNFLVGYDIEQQTVSFKPTDCTKQ 423
Q NF + YD+E + F P C ++
Sbjct: 449 QQNFYIEYDLENERFGFGPRSCRRK 473
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 165/363 (45%), Gaps = 35/363 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G+PP E DTGSD++W C CP S FDP SST +
Sbjct: 83 YFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLIS 142
Query: 148 CSSSQCASLNQKS---CS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV--ALP 200
CS +C+ Q S CS G C Y+ YGDGS ++G ++ + + G +V +
Sbjct: 143 CSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSSA 202
Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTKI 255
I FGC + G + GI G G D+S+ISQM + I K FS+CL
Sbjct: 203 SIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGGG 262
Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI---------VIDSG 306
IV +V +PL ++ Y L + +ISV + L + P++ ++DSG
Sbjct: 263 ILVLGEIVE-EDIVYSPLVPSQPHYNLNLQSISVNGKSLAID-PEVFATSTNRGTIVDSG 320
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGA-D 363
TTL +L + +S ++ + +Q V CY S + P V+++F G
Sbjct: 321 TTLAYLAEEAYDPFVSAITEAV-SQSVRPLLSKGTQCYLITSSVKGIFPTVSLNFAGGVS 379
Query: 364 VKLSRSNFFVKVSE----DIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
+ L ++ ++ + + C F+ I + I G+++ + + YD+ Q + +
Sbjct: 380 MNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLAGQRIGWANY 439
Query: 419 DCT 421
DC+
Sbjct: 440 DCS 442
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 119/441 (26%), Positives = 189/441 (42%), Gaps = 88/441 (19%)
Query: 50 PYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN-ANYLIRISIGTPPTERLAVA 108
P++ + L+ SLNR H S S++ + P + Y + ++ GTPP +
Sbjct: 90 PFKTINLLLSASLNRAQHLKTPQSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIF 149
Query: 109 DTGSDLIWTQCEP---CPPSQC---YMQDSPL--FDPKMSSTYKSLPCSSSQCASL---- 156
DTGS L+W C C S+C Y+ + + F PK+SS+ K + C + +CA +
Sbjct: 150 DTGSSLVWFPCTAGYRC--SRCSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPN 207
Query: 157 --------NQKS--CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
N KS CS Y + YG G+ + G L +ET+ L + +P GC
Sbjct: 208 LKSRCRNCNSKSRKCSDSCPGYGLQYGSGA-TAGILLSETLDL-----ENKRVPDFLVGC 261
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVSSTKI---- 255
+ + GI G G G SL SQMR +FS+CLV PVSS +
Sbjct: 262 SV----MSVHQPAGIAGFGRGPESLPSQMRLK---RFSHCLVSRGFDDSPVSSPLVLDSG 314
Query: 256 ----NFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGVS----TPD----- 300
T + P + ++ A + +Y L++ I +G + + PD
Sbjct: 315 SESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKPVKFPYKYLVPDSTGNG 374
Query: 301 -IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-------LELCYSF---NSL 349
+IDSG+T TFL + + ++ +E Q V P L C++
Sbjct: 375 GAIIDSGSTFTFL----DKPIFEAIADELEKQLVKYPRAKDVEAQSGLRPCFNIPKEEES 430
Query: 350 SQVPEVTIHFR-GADVKLSRSNFFVKVS-EDIVC-------SVFKGITNSVPIYGNIMQT 400
++ P+V + F+ G + L+ N+ V+ E +VC +V G I G Q
Sbjct: 431 AEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAVVGGGGGPAIILGAFQQQ 490
Query: 401 NFLVGYDIEQQTVSFKPTDCT 421
N LV YD+ +Q + F+ CT
Sbjct: 491 NVLVEYDLAKQRIGFRKQKCT 511
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 105/363 (28%), Positives = 167/363 (46%), Gaps = 36/363 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G+PP + DTGSD++W + C CP S FDP S T +
Sbjct: 90 YYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLIS 149
Query: 148 CSSSQCASLNQKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---AL 199
CS +C+ Q S C+ N C Y+ YGDGS ++G ++ + + G +V +
Sbjct: 150 CSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNSS 209
Query: 200 PGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTK 254
I FGC T G + GI G G D+S+ISQ+ + FS+CL S
Sbjct: 210 APIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDDSGG 269
Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSG 306
IV P +V TPL ++ Y L + +I V Q L + S +IDSG
Sbjct: 270 GILVLGEIVE-PNIVYTPLVPSQPHYNLNLQSIYVNGQTLAIDPSVFATSSNQGTIIDSG 328
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCY-SFNSLSQV-PEVTIHFRGA- 362
TTL +L + +S ++S + P P S CY + +S++ V P+V+++F G
Sbjct: 329 TTLAYLTEAAYDPFISAITSTVS--PSVSPYLSKGNQCYLTSSSINDVFPQVSLNFAGGT 386
Query: 363 DVKLSRSNFFVKVSE----DIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
+ L ++ ++ S + C F+ I + I G+++ + + YDI Q + +
Sbjct: 387 SMILIPQDYLIQQSSINGAALWCVGFQKIQGQEITILGDLVLKDKIFVYDIAGQRIGWAN 446
Query: 418 TDC 420
DC
Sbjct: 447 YDC 449
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 104/367 (28%), Positives = 161/367 (43%), Gaps = 43/367 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I +GTP + DTGSD++W C CP + L+ P SST +
Sbjct: 74 YFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKSDLGIELSLYSPSSSSTSNRVT 133
Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG 201
C+ C S G C+Y V+YGDGS + G + V L TG Q + G
Sbjct: 134 CNQDFCTSTYDGPIPGCTPELLCEYRVAYGDGSSTAGYFVRDHVVLDRVTGNFQTTSTNG 193
Query: 202 -ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKI 255
I FGCG G + + GI+G G + S+ISQ+ ++ + F++CL ++ I
Sbjct: 194 SIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLDNINGGGI 253
Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDSGT 307
F +V P V +TPL + Y + + AI V N+ L + T +IDSGT
Sbjct: 254 -FAIGEVVQ-PKVRTTPLVPQQAHYNVFMKAIEVDNEVLNLPTDVFDTDLRKGTIIDSGT 311
Query: 308 TLTFLPQGYNSNLLSVM---SSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGA-D 363
TL + P L+S + S ++ V + E Y N P VT HF +
Sbjct: 312 TLAYFPDVIYEPLISKIFARQSTLKLHTVEEQFTCFE--YDGNVDDGFPTVTFHFEDSLS 369
Query: 364 VKLSRSNFFVKVSEDIVCSVFKGITNS---------VPIYGNIMQTNFLVGYDIEQQTVS 414
+ + + + + C G NS + + G+++ N LV YD+E QT+
Sbjct: 370 LTVYPHEYLFDIDSNKWCV---GWQNSGAQSRDGKDMILLGDLVLQNRLVMYDLENQTIG 426
Query: 415 FKPTDCT 421
+ +C+
Sbjct: 427 WTEYNCS 433
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 100/367 (27%), Positives = 154/367 (41%), Gaps = 44/367 (11%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM-QDSPLFDPKMSSTYKSLPC 148
+Y+ R +GTPP L D +D W C C C SP FDP SSTY+ + C
Sbjct: 99 SYVARARLGTPPQTLLVAIDPSNDAAWVPCSAC--LGCAPGASSPSFDPTQSSTYRPVRC 156
Query: 149 SSSQCASLNQKSCS-----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+ QCA + + S G +C +++SY + + L + ++L + G AV T
Sbjct: 157 GAPQCAQVPPATPSCPAGPGASCAFNLSYASSTL-HAVLGQDALSLSDSNGAAVPDDHYT 215
Query: 204 FGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN 260
FGC T +GG + G+VG G G +S +SQ + T FSYCL S+ NF +
Sbjct: 216 FGCLRVVTGSGG--SVPPQGLVGFGRGPLSFLSQTKATYGSIFSYCLPSYKSS--NF-SG 270
Query: 261 GIVSGPG-----VVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI----------- 301
+ GP + +TPL + Y + + + V + + + +
Sbjct: 271 TLRLGPAGQPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGRGGT 330
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR- 360
++D+GT T L + L + + A P A G + CY N VP V F
Sbjct: 331 IVDAGTMFTRLSPPAYAALRNAFRRGVSA-PAAPALGGFDTCYYVNGTKSVPAVAFVFAG 389
Query: 361 GADVKLSRSNFFV-KVSEDIVCSVFK-----GITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
GA V L N + S + C G+ + + ++ Q N V +D+ V
Sbjct: 390 GARVTLPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNGRVG 449
Query: 415 FKPTDCT 421
F CT
Sbjct: 450 FSRELCT 456
>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 627
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 106/389 (27%), Positives = 160/389 (41%), Gaps = 27/389 (6%)
Query: 54 LRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN----YLIRISIGTPPTERLAVAD 109
+R L R RL ++ +S SK IIP + Y + +GTP T + D
Sbjct: 170 VRSDLQRQKRRLGG-GKHQLLSFSK--DGGIIPTGNDFGWLYYTWVDVGTPNTSFMVALD 226
Query: 110 TGSDLIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG 163
TGSDL W C+ C P Y +D ++ P S+T + LPCS C + +
Sbjct: 227 TGSDLFWIPCDCIECAPLSGYHGSLDRDLGIYKPAESTTSRHLPCSHELCLLGSDCTNQK 286
Query: 164 VNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG--LFNSKTTG 220
C Y+ Y + + S+G L + + L S A + GCG G L G
Sbjct: 287 QPCPYNTKYLQENTTSSGLLVEDILHLDSRESHAPVKASVIIGCGRKQSGSYLDGIAPDG 346
Query: 221 IVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT 278
++GLG DIS+ S + + FS C S +I FG G+ + PL
Sbjct: 347 LLGLGMADISVPSFLARAGLVRNSFSMCFT-KDSGRIFFGDQGVSTQQSTPFVPLYGKLQ 405
Query: 279 FYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG 338
Y + +D VG++ ++ ++DSGT+ T LP + + A +
Sbjct: 406 TYTVNVDKSCVGHKCFESTSFQAIVDSGTSFTALPLDIYKAVAIEFDKQVNASRLPQEAT 465
Query: 339 SLELCYSFNSL--SQVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYG 395
S + CYS + L VP VT+ F G + F + E V + S G
Sbjct: 466 SFDYCYSASPLVMPDVPTVTLTFAGNKSFQPVNPTFLLHDEEGAVAGFCLAVVQSPEPIG 525
Query: 396 NIMQTNFLVGY----DIEQQTVSFKPTDC 420
I Q NFL+GY D E + + ++C
Sbjct: 526 IIAQ-NFLLGYHVVFDRENMKLGWYRSEC 553
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 101/370 (27%), Positives = 165/370 (44%), Gaps = 47/370 (12%)
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
D + N Y R+ IGTPP E + D+GS + + C C QC P F P +SS+
Sbjct: 81 DDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASC--EQCGNHQDPRFQPDLSSS 138
Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
Y + C+ ++K C+ Y Y + S S+G L + V+ G + +
Sbjct: 139 YSPVKCNVDCTCDSDKKQCT-----YERQYAEMSSSSGVLGEDIVSFGRES--ELKPQRA 191
Query: 203 TFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGT 259
FGC + G LF+ GI+GLG G +S++ Q+ + I+ FS C ++ G
Sbjct: 192 VFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY-----GGMDIGG 246
Query: 260 NGIVSGPGV---------VSTPLTKAKTFYVLTIDAISVGNQRLGV------STPDIVID 304
+V G GV S PL +Y + + I V + L V S V+D
Sbjct: 247 GAMVLG-GVPAPSDMVFSHSDPLRSP--YYNIELKEIHVAGKALRVDSRVFNSKHGTVLD 303
Query: 305 SGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV-PEVT 356
SGTT +LP Q + + +V S + + + P + ++C++ + L +V P+V
Sbjct: 304 SGTTYAYLPEQAFVAFKDAVTSKVHSLKKIRGPDPNYKDICFAGAGRNVSKLHEVFPDVD 363
Query: 357 IHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
+ F G + L+ N+ KV VF+ + + G I+ N LV YD +
Sbjct: 364 MVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEK 423
Query: 413 VSFKPTDCTK 422
+ F T+C++
Sbjct: 424 IGFWKTNCSE 433
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 160/376 (42%), Gaps = 64/376 (17%)
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
L+ + IGTPP + + DTGS L W QC P + S +FDP +SS++ LPC+
Sbjct: 83 LVSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRK--PPPSSVFDPSLSSSFSVLPCNHP 140
Query: 152 QCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C +L C YS Y DG+ + GNL E +T ++ + P + G
Sbjct: 141 LCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITF----SRSQSTPPLILG 196
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG---I 262
C +S GI+G+ G +S SQ + T KFSYC VP + F G +
Sbjct: 197 CAEE-----SSDAKGILGMNLGRLSFASQAKLT---KFSYC-VPTRQVRPGFTPTGSFYL 247
Query: 263 VSGPGVVSTPLTKAKTF-------------YVLTIDAISVGNQRLGVS----TPD----- 300
P TF Y + + I +GNQ+L + PD
Sbjct: 248 GENPNSGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGAG 307
Query: 301 -IVIDSGTTLTFL-PQGYN---SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS---QV 352
+IDSG+ T+L + YN ++ ++ + ++ V G ++C++ N++ +
Sbjct: 308 QTMIDSGSEFTYLVDEAYNKVREEVVRLVGARLKKGYVYG--GVSDMCFNGNAIEIGRLI 365
Query: 353 PEVTIHF-RGADVKLSRSNFFVKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGY 406
+ F +G ++ + + V + C S G ++ I GN Q N V +
Sbjct: 366 GNMVFEFDKGVEIVVEKERVLADVGGGVHCVGIGRSEMLGAASN--IIGNFHQQNIWVEF 423
Query: 407 DIEQQTVSFKPTDCTK 422
D+ + V F DC++
Sbjct: 424 DLANRRVGFGKADCSR 439
>gi|297818124|ref|XP_002876945.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322783|gb|EFH53204.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 206
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 60/134 (44%), Positives = 76/134 (56%), Gaps = 9/134 (6%)
Query: 3 TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
+F + L+ F S I A +VELIHRDSP SP YN T L RS+
Sbjct: 70 SFFEVILHLYTAIFCFSSTI-ANRENLTVELIHRDSPHSPLYNPHHTVSDGLNATFLRSI 128
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+R FN + + Q+ +I N YL+ ISIGTPP++ LA+ADTGSDL W QC+P
Sbjct: 129 SRSRRFNTKTDL------QSGLISNGGEYLMSISIGTPPSKVLAIADTGSDLTWVQCKPY 182
Query: 123 PPSQCYMQDSPLFD 136
QCY Q+SPLFD
Sbjct: 183 --QQCYKQNSPLFD 194
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 169/372 (45%), Gaps = 40/372 (10%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
Q D+ P +Y + ++IG P DTGSDL W QC+ P C PL+ P
Sbjct: 44 QGDVYPT-GHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCD-APCRSCNKVPHPLYRP--- 98
Query: 141 STYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
+ + +PC+++ C +L N K S C Y + Y D + S G L ++ +L +
Sbjct: 99 TANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSS 158
Query: 195 QAVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLV 248
PG+TFGCG + G + G++GLG G +SL+SQ++ K +CL
Sbjct: 159 N--IRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLS 216
Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTKAKT--FYVLTIDAISVGNQRLGVSTPDIVIDSG 306
+ FG + +V V P+ + + +Y + + LGV ++V DSG
Sbjct: 217 TNGGGFLFFGDD-VVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSG 275
Query: 307 TTLT-FLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYS--------FNSLSQVPEVT 356
+T T F Q Y + + ++ + ++ + V+DPT L LC+ F+ ++ +
Sbjct: 276 STYTYFTAQPYQAVVSALKGGLSKSLKQVSDPT--LPLCWKGQKAFKSVFDVKNEFKSMF 333
Query: 357 IHF---RGADVKLSRSNFFVKVSEDIVC-SVFKGITN--SVPIYGNIMQTNFLVGYDIEQ 410
+ F + A +++ N+ + VC + G S + G+I + +V YD E+
Sbjct: 334 LSFASAKNAAMEIPPENYLIVTKNGNVCLGILDGTAAKLSFNVIGDITMQDQMVIYDNEK 393
Query: 411 QTVSFKPTDCTK 422
+ + CT+
Sbjct: 394 SQLGWARGACTR 405
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 169/372 (45%), Gaps = 40/372 (10%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
Q D+ P +Y + ++IG P DTGSDL W QC+ P C PL+ P
Sbjct: 44 QGDVYPT-GHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCD-APCRSCNKVPHPLYRP--- 98
Query: 141 STYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
+ + +PC+++ C +L N K S C Y + Y D + S G L ++ +L +
Sbjct: 99 TANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSS 158
Query: 195 QAVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLV 248
PG+TFGCG + G + G++GLG G +SL+SQ++ K +CL
Sbjct: 159 N--IRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLS 216
Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTKAKT--FYVLTIDAISVGNQRLGVSTPDIVIDSG 306
+ FG + +V V P+ + + +Y + + LGV ++V DSG
Sbjct: 217 TNGGGFLFFGDD-VVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSG 275
Query: 307 TTLT-FLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYS--------FNSLSQVPEVT 356
+T T F Q Y + + ++ + ++ + V+DPT L LC+ F+ ++ +
Sbjct: 276 STYTYFTAQPYQAVVSALKGGLSKSLKQVSDPT--LPLCWKGQKAFKSVFDVKNEFKSMF 333
Query: 357 IHF---RGADVKLSRSNFFVKVSEDIVC-SVFKGITN--SVPIYGNIMQTNFLVGYDIEQ 410
+ F + A +++ N+ + VC + G S + G+I + +V YD E+
Sbjct: 334 LSFSSAKNAAMEIPPENYLIVTKNGNVCLGILDGTAAKLSFNVIGDITMQDQMVIYDNEK 393
Query: 411 QTVSFKPTDCTK 422
+ + CT+
Sbjct: 394 SQLGWARGACTR 405
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 103/388 (26%), Positives = 171/388 (44%), Gaps = 67/388 (17%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQC------EPCPPSQCYMQDSPLFDPKMS 140
+N + + +++GTPP V DTGS+L W C + M +S F P+ S
Sbjct: 59 HNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGES--FRPRAS 116
Query: 141 STYKSLPCSSSQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
+T+ ++PC S+QC+S + SC G + C S+SY DGS S+G LAT+ +G
Sbjct: 117 ATFAAVPCGSTQCSSRDLPAPPSCDGASRQCHVSLSYADGSASDGALATDVFAVGEAPPL 176
Query: 196 AVALPGITFGCGTN--NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
A FGC + + T G++G+ G +S ++Q T +FSYC+
Sbjct: 177 RSA-----FGCMSTAYDSSPDGVATAGLLGMNRGTLSFVTQASTR---RFSYCISDRDDA 228
Query: 254 KINFGTNGIVSGPGVVSTPLTKA--------KTFYVLTIDAISVGNQRL----GVSTPD- 300
+ + + + TPL + + Y + + I VG + L V PD
Sbjct: 229 GVLLLGHSDLPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDH 288
Query: 301 -----IVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPT----GSLELCYSFN 347
++DSGT TFL + L ++ A + DP+ +L+ C+
Sbjct: 289 TGAGQTMVDSGTQFTFLLGDAYSALKAEFLKQTKPLLRA--LDDPSFAFQEALDTCFRVP 346
Query: 348 S-----LSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKGITNSVP---- 392
+ +++P VT+ F GA++ ++ KV ++ + C F G + VP
Sbjct: 347 AGRPPPSARLPPVTLLFNGAEMSVAGDRLLYKVPGEHRGADGVWCLTF-GNADMVPLTAY 405
Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ G+ Q N V YD+E+ V P C
Sbjct: 406 VIGHHHQMNLWVEYDLERGRVGLAPVKC 433
>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 525
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 100/351 (28%), Positives = 161/351 (45%), Gaps = 30/351 (8%)
Query: 95 ISIGTPPTERLAVADTGSDLIWT--QCEPCPPSQCYMQD---SPL--FDPKMSSTYKSLP 147
I IGTP + L V DTGSDL+W +CE C P +D S L + P +SST K +
Sbjct: 115 IDIGTPNVQFLVVLDTGSDLLWIPCECESCAPLSAESKDPRTSQLNPYTPSLSSTAKPVL 174
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT--LGSTTGQAVALPGITFG 205
CS C + C Y ++Y + S E + + G V LP + G
Sbjct: 175 CSDPLCEMSSTCMAPTDQCPYEINYVSANTSTSGALYEDYMYFMRESGGNPVKLP-VYLG 233
Query: 206 CGTNNGG--LFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNG 261
CG G L + G++GLG DIS+ +++ +T +A FS C+ P S + FG G
Sbjct: 234 CGKVQTGSLLKGAAPNGLMGLGTTDISVPNKLASTGQLADSFSLCISPGGSGTLTFGDEG 293
Query: 262 IVSGPGVVSTPLTKAKT----FYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYN 317
+ +TP+ Y++ ID+I+VGN L +++ + D+GT+ T+L +
Sbjct: 294 PAAQ---RTTPIIPKSVSMLDTYIVEIDSITVGNTNLLMAS-HALFDTGTSFTYLSKTVY 349
Query: 318 SNLLSVMSSMIEAQPVADPTGS-LELCY-SFNSLSQVPEVTIHFRGADVKLSRSNFFVKV 375
+ + + DP S +LCY + N+ QVP V++ G + L + +
Sbjct: 350 PQFVQAYDAQMSLPKWNDPRFSKWDLCYQTSNTNFQVPVVSLALSGGN-SLDVVSGLKSI 408
Query: 376 SED-----IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+D VC + I G TN+ + Y+ + T+ + P+DC+
Sbjct: 409 VDDNNAMIAVCVTVMDSGAGLSIIGQNFMTNYSITYNRAKMTIGWTPSDCS 459
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 117/441 (26%), Positives = 190/441 (43%), Gaps = 97/441 (21%)
Query: 45 NSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTER 104
N S+ Q+L ++ SL R +H + S Y I +S GTPP
Sbjct: 38 NPSQDHLQKLNYLVSTSLARAHHLKNPQTTPVFSHSYG-------GYSISLSFGTPPQTL 90
Query: 105 LAVADTGSDLIWTQCE---PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--- 158
V DTGS +W C C + SP F PK SS+ K + C + +C+ ++Q
Sbjct: 91 SFVMDTGSSFVWFPCTLRYLCNNCSFTSRISP-FLPKHSSSSKIIGCKNPKCSWIHQTDL 149
Query: 159 ---------KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
++CS + Y + YG G+ + G +ET+ L + +P GC
Sbjct: 150 RCTDCDNNSRNCSQICPPYLILYGSGT-TGGVALSETLHL-----HGLIVPNFLVGC--- 200
Query: 210 NGGLFNSKT-TGIVGLGGGDISLISQMRTTIAGKFSYCLV--------PVSSTKINFGTN 260
+F+S+ GI G G G SL SQ+ T KFSYCL+ SS ++ ++
Sbjct: 201 --SVFSSRQPAGIAGFGRGPSSLPSQLGLT---KFSYCLLSHKFDDTQESSSLVLDSQSD 255
Query: 261 GIVSGPGVVSTPLTKA---------KTFYVLTIDAISVGNQRLGVS----TPD------I 301
++ TPL K +Y +++ IS+G + + + +PD
Sbjct: 256 SDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIPYKYLSPDKDGNGGT 315
Query: 302 VIDSGTTLT-------------FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS 348
+IDSGTT T F+ Q N ++ ++ +P + +G+ EL
Sbjct: 316 IIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGLKPCFNVSGAKEL------ 369
Query: 349 LSQVPEVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVP-------IYGNIMQ 399
++P++ +HF+ GADV+L N+F + S ++ C F +T+ I GN
Sbjct: 370 --ELPQLRLHFKGGADVELPLENYFAFLGSREVAC--FTVVTDGAEKASGPGMILGNFQM 425
Query: 400 TNFLVGYDIEQQTVSFKPTDC 420
NF V YD++ + + FK C
Sbjct: 426 QNFYVEYDLQNERLGFKKESC 446
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 100/370 (27%), Positives = 162/370 (43%), Gaps = 50/370 (13%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G P E DTGSD++W C P CP S + LFD SS+ + LP
Sbjct: 84 YFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVLP 143
Query: 148 CSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
C+ CA++ +Q +C YS Y D S ++G T+++ G+ A +
Sbjct: 144 CTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIANSSA 203
Query: 201 GITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKI 255
I FGC G T GI G G G+ S+ISQ+ R FS+CL
Sbjct: 204 TIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL-------- 255
Query: 256 NFGTNG---IVSG----PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP-------DI 301
G NG +V G P +V +PL ++ Y L + +I++ Q T +
Sbjct: 256 KGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFPNPTMFPISNAGET 315
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT---GSLELCYSFNSLSQVPEVTIH 358
+IDSGTTL +L + ++SV++S + A PT GS S + P + +
Sbjct: 316 IIDSGTTLAYLVEEVYDWIVSVITSAVSQS--ATPTISRGSQCFRVSMSVADIFPVLRFN 373
Query: 359 FRGADVKLSRSNFFVKVSEDIVCSVFKGI--------TNSVPIYGNIMQTNFLVGYDIEQ 410
F G + +++ + C F + + + I G+++ + ++ YD+ Q
Sbjct: 374 FEGIASMVVTPEEYLQFDSIVSCYKFASLWCIGFQKAEDGLNILGDLVLKDKIIVYDLAQ 433
Query: 411 QTVSFKPTDC 420
Q + + DC
Sbjct: 434 QRIGWANYDC 443
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 170/383 (44%), Gaps = 32/383 (8%)
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+R H + + S+ S+ D + N Y R+ IGTPP + D+GS + + C C
Sbjct: 66 HRKLHKSDSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDC 125
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
QC P F P++SSTY+ + C + C + K C Y Y + S S G L
Sbjct: 126 --EQCGKHQDPKFQPELSSTYQPVKC-NMDCNCDDDKE----QCVYEREYAEHSSSKGVL 178
Query: 183 ATETVTLGSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM--RTTI 239
+ ++ G+ + + FGC T G L++ + GI+GLG GD+SL+ Q+ + I
Sbjct: 179 GEDLISFGNES--QLTPQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLI 236
Query: 240 AGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGVST 298
+ F C + + G ++ T ++ +Y + + I V ++L +++
Sbjct: 237 SNSFGLCYGGMDVGGGSMILGGFDYPSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNS 296
Query: 299 ------PDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLE----LCYSFN 347
V+DSGTT +LP + + +VM + + + P + + L + N
Sbjct: 297 RVFDGEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSPLKQIDGPDPNFKDTCFLVAASN 356
Query: 348 SLSQV----PEVTIHFR-GADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQ 399
+S++ P V + F+ G LS N+ KV VF + + G I+
Sbjct: 357 DVSELSKIFPSVEMIFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVV 416
Query: 400 TNFLVGYDIEQQTVSFKPTDCTK 422
N LV YD E V F T+C++
Sbjct: 417 RNTLVVYDRENSKVGFWRTNCSE 439
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 163/365 (44%), Gaps = 47/365 (12%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
N Y R+ IGTPP + DTGS + + C C QC P F P+ SSTY+ +
Sbjct: 81 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTC--EQCGRHQDPKFQPESSSTYQPVK 138
Query: 148 CS-SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C+ C S + C Y Y + S S+G L + ++ G+ + +A FGC
Sbjct: 139 CTIDCNCDS------DRMQCVYERQYAEMSTSSGVLGEDLISFGNQS--ELAPQRAVFGC 190
Query: 207 -GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIV 263
G L++ GI+GLG GD+S++ Q+ + I+ FS C ++ G +V
Sbjct: 191 ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCY-----GGMDVGGGAMV 245
Query: 264 SGPGVVSTPLTKA--------KTFYVLTIDAISVGNQRLGVST------PDIVIDSGTTL 309
G +S P A +Y + + I V +RL ++ V+DSGTT
Sbjct: 246 LGG--ISPPSDMAFAYSDPVRSPYYNIDLKEIHVAGKRLPLNANVFDGKHGTVLDSGTTY 303
Query: 310 TFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQ-VPEVTIHFR- 360
+LP+ + + +++ + + ++ P + ++C+S + LS+ P V + F
Sbjct: 304 AYLPEAAFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMVFEN 363
Query: 361 GADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
G LS N+ KV VF+ + + G I+ N LV YD EQ + F
Sbjct: 364 GQKYTLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLGGIIVRNTLVVYDREQTKIGFWK 423
Query: 418 TDCTK 422
T+C +
Sbjct: 424 TNCAE 428
>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 481
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 113/415 (27%), Positives = 178/415 (42%), Gaps = 101/415 (24%)
Query: 89 ANYLIRISIGTPPTERLAV-ADTGSDLIWTQCEPCPPSQCYMQD---------------- 131
++Y + ++G+ P + + + DTGSDL+W PC P +C + +
Sbjct: 73 SDYTLSFNLGSNPPQLITLYMDTGSDLVWF---PCSPFECILCEGKPQTTKPANITKQTH 129
Query: 132 -----SPLFDPKMSSTYKSLPCSSSQCA--SLNQKSCSGVNC-QYSVSYGDGSFSNGNLA 183
SP +S S C+ S+C + CS +C + +YGDGSF NL
Sbjct: 130 SVSCQSPACSAAHASMSSSNLCAISRCPLDYIETSDCSSFSCPPFYYAYGDGSFV-ANLY 188
Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT---TIA 240
+T++L S + L TFGC ++ TG+ G G G +SL +Q+ T +
Sbjct: 189 QQTLSLSS-----LHLQNFTFGCAHTA----LAEPTGVAGFGRGILSLPAQLSTLSPHLG 239
Query: 241 GKFSYCLVPVS---------STKINFGTNGIVSGPG-----------VVSTPLTKAKTFY 280
+FSYCLV S S I N ++G G ++S P K +Y
Sbjct: 240 NRFSYCLVSHSFDGDRLRRPSPLILGRHNDTITGAGDGESVEFVYTSMLSNP--KHPYYY 297
Query: 281 VLTIDAISVGNQRLGVSTPDI------------VIDSGTTLTFLPQGYNSNLLSVMSSMI 328
+ + ISVG + V P+I V+DSGTT T LP+ + + +++ +
Sbjct: 298 CVGLAGISVGKRT--VPAPEILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEFDKRV 355
Query: 329 -----EAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG--ADVKLSRSNFF--------- 372
A + TG L CY N LSQ+P + +HF G +DV L R N+F
Sbjct: 356 NRFHKRASEIETKTG-LGPCYYLNGLSQIPVLKLHFVGNNSDVVLPRKNYFYEFMDGGDG 414
Query: 373 VKVSEDIVCSVFKGITNSVPI-------YGNIMQTNFLVGYDIEQQTVSFKPTDC 420
++ + C + + + GN Q F V YD+E++ V F +C
Sbjct: 415 IRRKGKVGCMMLMNGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKEC 469
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 98/357 (27%), Positives = 159/357 (44%), Gaps = 38/357 (10%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY++R +GTPP V DT +D +W C C S C S F+ SSTY ++ CS
Sbjct: 104 NYVVRARLGTPPQLMFMVLDTSNDAVWLPCSGC--SGC-SNASTSFNTNSSSTYSTVSCS 160
Query: 150 SSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
++QC +C C ++ SYG S + NL +T+TL +P +F
Sbjct: 161 TTQCTQARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQDTLTLSPD-----VIPNFSF 215
Query: 205 GCGTNNGGLFNS-KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN-GI 262
GC + G NS G++GLG G +SL+SQ + +G FSYCL S + G+
Sbjct: 216 GCINSASG--NSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGL 273
Query: 263 VSGPGVVS-TPLT---KAKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTT 308
+ P + TPL + + Y + + +SVG+ ++ V S +IDSGT
Sbjct: 274 LGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNSGAGTIIDSGTV 333
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSR 368
+T Q + + G+ + C+S ++ + P++T+H D+KL
Sbjct: 334 ITRFAQPVYEAIRDEFRKQVNGS--FSTLGAFDTCFSADNENVTPKITLHMTSLDLKLPM 391
Query: 369 SNFFVKVSED-IVCSVFKGITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N + S + C GI + + + N+ Q N + +D+ + P C
Sbjct: 392 ENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 448
>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
Length = 472
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 111/394 (28%), Positives = 176/394 (44%), Gaps = 53/394 (13%)
Query: 70 QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
Q I+SS +++ D+I N+ +L+ +S+G PP L DTGS L W QC+PC
Sbjct: 89 QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147
Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
C+ Q + P+FDP S T + + CSS +C L Q +C +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 207
Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
G ++S G + T+T+ +G + + FGC + ++ GI G G S
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMDVK--YSEFEAGIFGFGSSSFSFFE 259
Query: 234 QMR---TTIAGK-FSYCLVPVSSTKINFGTNGIVSGPGVVS--TPLTKA--KTFYVLTID 285
Q+ ++ K FSYCL P TK + G + TPL ++ + Y LT++
Sbjct: 260 QLAGYPDILSYKAFSYCL-PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTME 318
Query: 286 AISVGNQRLGVSTPDIVIDSG--------TTLTFLPQ---------GYNSNLLSVMSSMI 328
+ QRL S+ ++++DSG +T L + GY+ + S I
Sbjct: 319 MLIANGQRLVTSSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYI 378
Query: 329 EAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF-KG 386
D +G F++ S +P + I F GA + L N F +C F +
Sbjct: 379 CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQN 438
Query: 387 ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
I GN + +F +DI+ + FK C
Sbjct: 439 PALRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 97/367 (26%), Positives = 163/367 (44%), Gaps = 50/367 (13%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y I IGTP + DTGS W C+ CP ++ +DP+ S + K +
Sbjct: 83 YYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVK 142
Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---GIT 203
C + C S + C+ + C Y Y DG + G L T+ + G P +T
Sbjct: 143 CDDTICTS--RPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVT 200
Query: 204 FGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSSTKIN 256
FGCG G N+ GI+G G + + +SQ+ AGK FS+CL + I
Sbjct: 201 FGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAA--AGKTKKIFSHCLDSTNGGGI- 257
Query: 257 FGTNGIVSGPGVVSTPLTK-AKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGT 307
F +V P V +TP+ K + ++++ + +I+V L + T IDSG+
Sbjct: 258 FAIGEVVE-PKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGS 316
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-------QVPEVTIHFR 360
TL +LP+ + S +I A P ++ Y+F + P++T HF
Sbjct: 317 TLVYLPE-------IIYSELILAVFAKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFE 369
Query: 361 GADVKLSR--SNFFVKVSEDIVCSVFK--GIT--NSVPIYGNIMQTNFLVGYDIEQQTVS 414
D+ L ++ ++ + C F+ GI + I G+++ +N +V YD+E+Q +
Sbjct: 370 N-DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIG 428
Query: 415 FKPTDCT 421
+ +C+
Sbjct: 429 WTEHNCS 435
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 102/368 (27%), Positives = 168/368 (45%), Gaps = 46/368 (12%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL----FDPKMSSTYKSL 146
Y +I +GTPP DTGSD+ W C PC Q + +DP SST +L
Sbjct: 37 YYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTDGAL 96
Query: 147 PCSSSQCASL---NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALP 200
C S C + N+ SC+ C YS +YGDGS + G + +T Q
Sbjct: 97 SCRDSNCGAALGSNEVSCTSAGYCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVNGTA 156
Query: 201 GITFGCGTNNGG--LFNSKT-TGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTKI 255
+ FGCGT G L +S+ G++G G +S+ SQ+ + + +F++CL
Sbjct: 157 SVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCL---QGDNQ 213
Query: 256 NFGT--NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP-----------DIV 302
GT G VS P + TP+ ++ Y + + I+V + V+TP ++
Sbjct: 214 GGGTIVIGSVSEPNISYTPIV-SRNHYAVGMQNIAVNGRN--VTTPASFDTTSTSAGGVI 270
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF-RG 361
+DSGTTL +L + ++ +S+ E+ + + L+L + + + P V + F G
Sbjct: 271 MDSGTTLAYLVDPAYTQFVNAVST-FESSMFSSHSQCLQLAWC-SLQADFPTVKLFFDAG 328
Query: 362 ADVKLSRSNFF----VKVSEDIVCSVFKGITN-----SVPIYGNIMQTNFLVGYDIEQQT 412
A + L+ N+ ++ + C ++ T S I G+I+ + LV YD + +
Sbjct: 329 AVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSILGDIVLKDHLVVYDNDNRV 388
Query: 413 VSFKPTDC 420
V +K DC
Sbjct: 389 VGWKSFDC 396
>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
Length = 472
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 111/394 (28%), Positives = 176/394 (44%), Gaps = 53/394 (13%)
Query: 70 QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
Q I+SS +++ D+I N+ +L+ +S+G PP L DTGS L W QC+PC
Sbjct: 89 QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147
Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
C+ Q + P+FDP S T + + CSS +C L Q +C +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKENSCTYSVTYGN 207
Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
G ++S G + T+T+ +G + + FGC + ++ GI G G S
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMDVK--YSEFEAGIFGFGSSSFSFFE 259
Query: 234 QMR---TTIAGK-FSYCLVPVSSTKINFGTNGIVSGPGVVS--TPLTKA--KTFYVLTID 285
Q+ ++ K FSYCL P TK + G + TPL ++ + Y LT++
Sbjct: 260 QLAGYPDILSYKAFSYCL-PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTME 318
Query: 286 AISVGNQRLGVSTPDIVIDSG--------TTLTFLPQ---------GYNSNLLSVMSSMI 328
+ QRL S+ ++++DSG +T L + GY+ + S I
Sbjct: 319 MLIANGQRLVTSSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYI 378
Query: 329 EAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF-KG 386
D +G F++ S +P + I F GA + L N F +C F +
Sbjct: 379 CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQN 438
Query: 387 ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
I GN + +F +DI+ + FK C
Sbjct: 439 PALRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472
>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
Length = 515
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 96/350 (27%), Positives = 149/350 (42%), Gaps = 23/350 (6%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYK 144
Y + +GTP T L DTGSDL W C+ C P Y +D ++ P S+T +
Sbjct: 96 YYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSR 155
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
LPCS C S+ + C Y++ Y + + S+G L +T+ L +
Sbjct: 156 HLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASVI 215
Query: 204 FGCGTNNGG--LFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGT 259
GCG G L G++ LG DIS+ S + + FS C SS +I FG
Sbjct: 216 IGCGQKQSGDYLDGIAPDGLLALGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIFFGD 275
Query: 260 NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 319
G+ S PL Y + +D +G++ L ++ ++DSGT+ T LP
Sbjct: 276 QGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSLPFDVYKA 335
Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSE 377
+ A V + + CYS + L VP +T+ F AD L N + ++
Sbjct: 336 FTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLTF-AADKSLQAVNPILPFND 394
Query: 378 D---IVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQTVSFKPTDC 420
+ + ++ PI I+ NFLVGY D E + + ++C
Sbjct: 395 KQGALAGFCLAVLPSTEPI--GIIAQNFLVGYHVVFDRESMKLGWYRSEC 442
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 101/383 (26%), Positives = 169/383 (44%), Gaps = 32/383 (8%)
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+R H + + S+ S+ D + N Y R+ IGTPP + D+GS + + C C
Sbjct: 65 HRKLHKSDSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDC 124
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
QC P F P+MSSTY+ + C+ C + + C Y Y + S S G L
Sbjct: 125 --EQCGKHQDPKFQPEMSSTYQPVKCNMD-CNCDDDRE----QCVYEREYAEHSSSKGVL 177
Query: 183 ATETVTLGSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM--RTTI 239
+ ++ G+ + + FGC T G L++ + GI+GLG GD+SL+ Q+ + I
Sbjct: 178 GEDLISFGNES--QLTPQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLI 235
Query: 240 AGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGVST 298
+ F C + + G +V T ++ +Y + + I V ++L + +
Sbjct: 236 SNSFGLCYGGMDVGGGSMILGGFDYPSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHS 295
Query: 299 ------PDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS----- 345
V+DSGTT +LP + + +VM + + + P + + C+
Sbjct: 296 RVFDGEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASN 355
Query: 346 -FNSLSQV-PEVTIHFR-GADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQ 399
+ LS++ P V + F+ G LS N+ KV VF + + G I+
Sbjct: 356 YVSELSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVV 415
Query: 400 TNFLVGYDIEQQTVSFKPTDCTK 422
N LV YD E V F T+C++
Sbjct: 416 RNTLVVYDRENSKVGFWRTNCSE 438
>gi|125575538|gb|EAZ16822.1| hypothetical protein OsJ_32294 [Oryza sativa Japonica Group]
Length = 392
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 69/190 (36%), Positives = 102/190 (53%), Gaps = 18/190 (9%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ +IGTPP AV D +L+WTQC+ C S+C+ QD+PLFDP S+TY++ PC
Sbjct: 50 NYVANFTIGTPPQPASAVIDLAGELVWTQCKQC--SRCFEQDTPLFDPTASNTYRAEPCG 107
Query: 150 SSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ C S+ + ++CSG C Y S G + G + T+T +G+ A + FGC
Sbjct: 108 TPLCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGT------AKASLAFGCV 160
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVS 264
+ +GIVGLG SL++Q T FSYCL P + + + G++ ++
Sbjct: 161 VASDIDTMGGPSGIVGLGRTPWSLVTQ---TGVAAFSYCLAPHDAGRNSALFLGSSAKLA 217
Query: 265 GPG-VVSTPL 273
G G STP
Sbjct: 218 GGGKAASTPF 227
>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 110/427 (25%), Positives = 184/427 (43%), Gaps = 43/427 (10%)
Query: 21 PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF-NQNSSISSSKA 79
P T + ++HR+ P +P +S+ P +R AL R+ N+ SS + +A
Sbjct: 52 PNSPSTSTIRLTILHREHPCAP---ASKRPVRRSPSALQEYHTRVRRLANRLSSCPADEA 108
Query: 80 SQADIIPNNA------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
+ + +I N +Y+ ++ +GTP + DT S L W CEPC + C + P
Sbjct: 109 TASGLIFANGVPWDYYSYVTQVQLGTPAKTHNVLVDTASSLSWVGCEPC-INACLI---P 164
Query: 134 LFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSG--VNCQYSVSYGDGSFSNGNLATET 186
F+P SSTYK + C S+ C A++ +KSC C Y SY D S S G ++++T
Sbjct: 165 TFNPNASSTYKVVGCGSALCNAVPSATMARKSCMAPTEGCSYRQSYHDYSLSVGVVSSDT 224
Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF--- 243
+T G + + + FGC G+ + +GI+G+ SL SQM T+ ++
Sbjct: 225 LTYGLGSQKFI------FGCCNLFRGV-GGRYSGILGMSVNKFSLFSQM--TVGHRYRAM 275
Query: 244 SYCL-VPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYV----LTIDAISVGNQRLGVST 298
SYC P + + FG + ++V + ++ +S+ Q G T
Sbjct: 276 SYCFPHPRNQGFLQFGRYDEHKSLLRFTPLYIDGNNYFVHVSNVMVETMSLDVQSSGNQT 335
Query: 299 PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYSFNSLS---QVPE 354
D+GT T LPQ +L + +++E V TG N + +P
Sbjct: 336 MRCFFDTGTPYTMLPQSLFVSLSDTVGNLVEGYYRVGASTGQTCFQADGNWIEGDLYMPT 395
Query: 355 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
V I F+ GA + L+ + ++ C FK + G+ D+E T+
Sbjct: 396 VKIEFQNGARITLNSEDLMFMEEPNVFCLAFKMNDGGDIVLGSRHLMGVHTVVDLEMMTM 455
Query: 414 SFKPTDC 420
+ C
Sbjct: 456 GLRGQGC 462
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 97/357 (27%), Positives = 161/357 (45%), Gaps = 37/357 (10%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY++R +GTPP V DT +D +W C C S C S F+ SSTY ++ CS
Sbjct: 103 NYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGC--SGCS-NASTSFNTNSSSTYSTVSCS 159
Query: 150 SSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
++QC +C + C ++ SYG S + +L +T+TL +P +F
Sbjct: 160 TAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDV-----IPNFSF 214
Query: 205 GCGTNNGGLFNS-KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN-GI 262
GC + G NS G++GLG G +SL+SQ + +G FSYCL S + G+
Sbjct: 215 GCINSASG--NSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGL 272
Query: 263 VSGPGVVS-TPLT---KAKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTT 308
+ P + TPL + + Y + + +SVG+ ++ V S +IDSGT
Sbjct: 273 LGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTV 332
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSR 368
+T Q + + + G+ + C+S ++ + P++T+H D+KL
Sbjct: 333 ITRFAQPVYEAIRDEFRKQVNVSSFST-LGAFDTCFSADNENVAPKITLHMTSLDLKLPM 391
Query: 369 SNFFVKVSE-DIVCSVFKGITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N + S + C GI + + + N+ Q N + +D+ + P C
Sbjct: 392 ENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 448
>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
Length = 534
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 90/392 (22%), Positives = 154/392 (39%), Gaps = 60/392 (15%)
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSP---------- 133
I + YL+ + IGTP V DT +DL W C + Y + S
Sbjct: 119 IAHVGMYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSTGQTMSMGGEG 178
Query: 134 -------LFDPKMSSTYKSLPCSSSQCASLNQKSC----SGVNCQYSVSYGDGSFSNGNL 182
+ P SS+++ + CS +CA L +C +C Y DG+ + G
Sbjct: 179 AKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQDGTVTIGIY 238
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
E T+ + G+ LPG+ GC G G++ LG GD+S +
Sbjct: 239 GKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAKRFGQR 298
Query: 243 FSYCLVPVSSTK-----INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL 294
FS+CL+ +S++ + FG N V GPG + T + K Y + + VG +RL
Sbjct: 299 FSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAQVTGVLVGGERL 358
Query: 295 GVSTPD------------IVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 341
+ PD +++D+ T++T +P+ Y + + + + + P E
Sbjct: 359 DI--PDEVWDAERFVGGGVILDTSTSVTSLVPEAY-APVTAALDRHLSHLPRVYELEGFE 415
Query: 342 LCYSFNSLSQ---------VPEVTIHFRGADVKL---SRSNFFVKVSEDIVCSVFKGITN 389
CY + +P T+ G +L ++S +V + C F+ +
Sbjct: 416 YCYKWTFTGDGVDPAHNVTIPSFTVEMAGG-ARLEPEAKSVVMPEVEPGVACLAFRKLLR 474
Query: 390 SVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
P I GN+ ++ D + F+ C
Sbjct: 475 GGPGILGNVFMQEYIWEIDHGDGKIRFRKDKC 506
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 97/358 (27%), Positives = 161/358 (44%), Gaps = 37/358 (10%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
NY++R +GTPP V DT +D +W C C S C S F+ SSTY ++ C
Sbjct: 28 GNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGC--SGC-SNASTSFNTNSSSTYSTVSC 84
Query: 149 SSSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
S++QC +C + C ++ SYG S + +L +T+TL +P +
Sbjct: 85 STAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPD-----VIPNFS 139
Query: 204 FGCGTNNGGLFNS-KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN-G 261
FGC + G NS G++GLG G +SL+SQ + +G FSYCL S + G
Sbjct: 140 FGCINSASG--NSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLG 197
Query: 262 IVSGPGVVS-TPLT---KAKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGT 307
++ P + TPL + + Y + + +SVG+ ++ V S +IDSGT
Sbjct: 198 LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGT 257
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLS 367
+T Q + + + G+ + C+S ++ + P++T+H D+KL
Sbjct: 258 VITRFAQPVYEAIRDEFRKQVNVSSFST-LGAFDTCFSADNENVAPKITLHMTSLDLKLP 316
Query: 368 RSNFFVKVSED-IVCSVFKGITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
N + S + C GI + + + N+ Q N + +D+ + P C
Sbjct: 317 MENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 374
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 161/365 (44%), Gaps = 35/365 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G P E DTGSD++W C P CP S F+P SST +
Sbjct: 91 YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 150
Query: 148 CSSSQCASLNQKS---CSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---A 196
CS +C + Q C N C Y+ +YGDGS ++G ++T+ + G A
Sbjct: 151 CSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTA 210
Query: 197 VALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVS 251
+ I FGC + G + GI G G +S+ISQ+ + ++ K FS+CL S
Sbjct: 211 NSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL-KGS 269
Query: 252 STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVI 303
G + PG+V TPL ++ Y L +++I+V Q+L + +T ++
Sbjct: 270 DNGGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIV 329
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNSLSQVPEVTIHFRGA 362
DSGTTL +L G +S +++ + + GS S + S P VT++F G
Sbjct: 330 DSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSSFPTVTLYFMGG 389
Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGI------TNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
+ ++ + SV I + I G+++ + + YD+ + +
Sbjct: 390 VAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWA 449
Query: 417 PTDCT 421
DC+
Sbjct: 450 DYDCS 454
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 161/365 (44%), Gaps = 35/365 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G P E DTGSD++W C P CP S F+P SST +
Sbjct: 89 YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 148
Query: 148 CSSSQCASLNQKS---CSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---A 196
CS +C + Q C N C Y+ +YGDGS ++G ++T+ + G A
Sbjct: 149 CSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTA 208
Query: 197 VALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVS 251
+ I FGC + G + GI G G +S+ISQ+ + ++ K FS+CL S
Sbjct: 209 NSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL-KGS 267
Query: 252 STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVI 303
G + PG+V TPL ++ Y L +++I+V Q+L + +T ++
Sbjct: 268 DNGGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIV 327
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNSLSQVPEVTIHFRGA 362
DSGTTL +L G +S +++ + + GS S + S P VT++F G
Sbjct: 328 DSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSSFPTVTLYFMGG 387
Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGI------TNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
+ ++ + SV I + I G+++ + + YD+ + +
Sbjct: 388 VAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWA 447
Query: 417 PTDCT 421
DC+
Sbjct: 448 DYDCS 452
>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 95/332 (28%), Positives = 136/332 (40%), Gaps = 16/332 (4%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQ----DSPLFDPKMSSTYK 144
Y + +GTP T + DTGSDL W C+ C P Y + D ++ P S+T +
Sbjct: 143 YYTWVDVGTPNTSFMVALDTGSDLFWVPCDCIECAPLAGYRETLDRDLGIYKPAESTTSR 202
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
LPCS C + S C YS Y + + S+G L + + L S A +
Sbjct: 203 HLPCSHELCPPGSGCSSPKQPCPYSTDYLQENTTSSGLLIEDILHLDSRESHAPVKASVV 262
Query: 204 FGCGTNNGG--LFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGT 259
GCG G L G++GLG DIS+ S + + FS C S +I FG
Sbjct: 263 IGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFK-EDSGRIFFGD 321
Query: 260 NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 319
G+ PL Y + +D VG++ ++ + ++DSGT+ T LP
Sbjct: 322 QGVSIQQSTPFVPLYGKYQTYAVNVDKSCVGHKCFEATSFEALVDSGTSFTALPLNVYKA 381
Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRG-ADVKLSRSNFFVKVS 376
+ + A + S E CYS + L VP VT+ F + +K
Sbjct: 382 VAVEFDKQVHAPRITQEDASFEYCYSASPLKMPDVPTVTLTFAANKSFQAVNPTIVLKDG 441
Query: 377 EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
E V + S G I Q NFL GY I
Sbjct: 442 EGSVAGFCLALQKSPEPIGIIGQ-NFLTGYHI 472
>gi|297744129|emb|CBI37099.3| unnamed protein product [Vitis vinifera]
Length = 299
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 73/215 (33%), Positives = 104/215 (48%), Gaps = 44/215 (20%)
Query: 25 QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
+ GF V L H DS + T ++RL+ A+ R RL + ++ S + +A +
Sbjct: 38 EKNGFRVSLRHVDS------GGNYTKFERLQRAVKRGRLRLQRLSAKTA-SFEPSVEAPV 90
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
N +L+ ++IGTP A+ DTGSDLIWTQC+PC C+ Q +P+FDP+ SS++
Sbjct: 91 HAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPC--KVCFDQPTPIFDPEKSSSFS 148
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
LPCSS L S GV LATET T G + + I F
Sbjct: 149 KLPCSS----DLYHSSTQGV-----------------LATETFTFGDAS-----VSKIGF 182
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
GCG +N G S+ G+ ISQM+ +
Sbjct: 183 GCGEDNRGRAYSQGAGL---------FISQMKLDV 208
Score = 45.4 bits (106), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 5/74 (6%)
Query: 335 DPTGS--LELCYSF---NSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN 389
D +GS LELC++ S VP++ HF G D+KL + N+ ++ S V + G ++
Sbjct: 209 DASGSTELELCFTLPPDGSPVDVPQLVFHFEGVDLKLPKENYIIEDSALRVICLTMGSSS 268
Query: 390 SVPIYGNIMQTNFL 403
+ I+GN Q N +
Sbjct: 269 GMSIFGNFQQQNIV 282
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 95/363 (26%), Positives = 158/363 (43%), Gaps = 35/363 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP DTGSD++W QC+ CP + L++ S + K +
Sbjct: 80 YYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVS 139
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
C C ++ SG ++C Y YGDGS + G + V S G A
Sbjct: 140 CDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANG 199
Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTK 254
+ FGCG G +S GI+G G + S+ISQ+ ++ + F++CL +
Sbjct: 200 SVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGG 259
Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSG 306
I F +V P V TPL + Y + + A+ VG + L + +IDSG
Sbjct: 260 I-FAIGRVVQ-PKVNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLFQPGDRKGAIIDSG 317
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGADV 364
TTL +LP+ L+ ++S A V + C+ ++ P VT HF +
Sbjct: 318 TTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYK-CFQYSGRVDEGFPNVTFHFENSVF 376
Query: 365 KLSRSNFFVKVSEDIVCSVFKGIT------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
+ ++ E + C ++ ++ + G+++ +N LV YD+E Q + +
Sbjct: 377 LRVYPHDYLFPYEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEY 436
Query: 419 DCT 421
+C+
Sbjct: 437 NCS 439
>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
Length = 371
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 81/317 (25%), Positives = 146/317 (46%), Gaps = 29/317 (9%)
Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATE 185
C+ QD P+F P SST+K PC + C S+ C+ C Y G G + G +AT+
Sbjct: 60 HCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPTPKCASDVCAYDGVTGLGGHTVGIVATD 119
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
T +G+ G ++ + + +G +GLG SL++QM+ T +FSY
Sbjct: 120 TFAIGTAAPARPPASGASWRATSTPW----AGPSGFIGLGRTPWSLVAQMKLT---RFSY 172
Query: 246 CLVPVSS---TKINFGTNGIVSG-----PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS 297
CL P + +++ G + ++G P V ++P +Y + ++ I G+ + +
Sbjct: 173 CLAPHDTGKNSRLFLGASAKLAGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMP 232
Query: 298 TPDIVIDSGTTLT----FLPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSFNSLSQV 352
+ T + + Y +VM+S + A P A P G+ E+C+ +S
Sbjct: 233 RGRNTVLVQTAVVRVSLLVDSVYQEFKKAVMAS-VGAAPTATPVGAPFEVCFPKAGVSGA 291
Query: 353 PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGIT-------NSVPIYGNIMQTNFLV 404
P++ F+ GA + + +N+ V D VC I + + I G+ Q N +
Sbjct: 292 PDLVFTFQAGAALTVPPANYLFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHL 351
Query: 405 GYDIEQQTVSFKPTDCT 421
+D+++ +SF+P DC+
Sbjct: 352 LFDLDKDMLSFEPADCS 368
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 103/360 (28%), Positives = 154/360 (42%), Gaps = 44/360 (12%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ R +GTP L D +D W C C + C SP F P SSTY+++PC
Sbjct: 82 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSAC--AGC-AASSPSFSPTQSSTYRTVPCG 138
Query: 150 SSQCASLNQKSCS---GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
S QCA + SC G +C ++++Y +F L +++ L + + TFGC
Sbjct: 139 SPQCAQVPSPSCPAGVGSSCGFNLTYAASTF-QAVLGQDSLALENNV-----VVSYTFGC 192
Query: 207 GTNNGGLFNS-KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN---GI 262
G NS G++G G G +S +SQ + T FSYCL S+ NF G
Sbjct: 193 LRVVSG--NSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSS--NFSGTLKLGP 248
Query: 263 VSGPG-VVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI----------VIDSGTT 308
+ P + +TPL + Y + + I VG++ + V + +ID+GT
Sbjct: 249 IGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTM 308
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGA-DVKLS 367
T L + + + PVA P G + CY N VP VT F GA V L
Sbjct: 309 FTRLAAPVYAAVRDAFRGRVRT-PVAPPLGGFDTCY--NVTVSVPTVTFMFAGAVAVTLP 365
Query: 368 RSNFFVKVSE-DIVCSVFK-----GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
N + S + C G+ ++ + ++ Q N V +D+ V F CT
Sbjct: 366 EENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELCT 425
>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 556
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 95/375 (25%), Positives = 160/375 (42%), Gaps = 61/375 (16%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS--PLFDPKMSSTYK 144
NN +L+ I +GTPP L DTG+ L + QCEPC +C+ Q +FDP S ++
Sbjct: 202 NNFLFLMPIKLGTPPVWNLVAVDTGATLSFVQCEPC-TLRCHKQTDAGEIFDPSKSESFS 260
Query: 145 SLPCSSSQCAS------LNQKSC--SGVNCQYSVSYGD-GSFSNGNLATETVTLGSTTGQ 195
+ CS ++C + L K+C +C YS+++G S+S G L + + +G +
Sbjct: 261 RVGCSENKCRTVQRALHLQSKACMEKEDSCLYSMTFGGTSSYSVGKLVRDRLAIGK-YAK 319
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVPVSSTK 254
+ P FGC + ++ G+VG S Q+ + K FSYC P K
Sbjct: 320 GYSFPDFLFGCSLDTE--YHQYEAGLVGFADEPFSFFEQVAPLVNYKAFSYCF-PSDRRK 376
Query: 255 INFGTNGIVSGPGVVSTP--LTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFL 312
+ + G + TP L + ++ Y L +D + V L + ++++DSG+ T L
Sbjct: 377 TGYLSIGDYTRVNSTYTPLFLARQQSRYALKLDEVLVNGMALVTTPSEMIVDSGSRWTIL 436
Query: 313 -----------------PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS------FNSL 349
P GYN N GS +C+ F+
Sbjct: 437 LSDTFTQLDAAITEAMRPLGYNRNYYR---------------GSDYICFEDAHFQQFSDW 481
Query: 350 SQVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF---KGITNSVPIYGNIMQTNFLVG 405
+ +P V + F G + L + F ++ +C+ F + + V + GN M + +
Sbjct: 482 AALPVVELKFDMGVKMVLQPQSSFHFNNDYGLCTYFMRDASLGSGVQLLGNTMTRSVGIT 541
Query: 406 YDIEQQTVSFKPTDC 420
+DI+ F+ DC
Sbjct: 542 FDIQGGQFGFRKGDC 556
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 99/369 (26%), Positives = 177/369 (47%), Gaps = 37/369 (10%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
D+ P +Y + ++IG P DTGSDL W QC+ P C PL+ P +
Sbjct: 49 GDVYPT-GHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCD-APCQSCNKVPHPLYRPTKN- 105
Query: 142 TYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
K +PC++S C +L N+K + C Y + Y D + S G L T++ +L +
Sbjct: 106 --KLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSL-PLRNK 162
Query: 196 AVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLVP 249
+ P ++FGCG + G + T G++GLG G +SL+SQ++ K +CL
Sbjct: 163 SNVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLST 222
Query: 250 VSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP--DIVIDSGT 307
+ FG + +V V P+ ++ + + + ++ R +ST ++V DSG+
Sbjct: 223 SGGGFLFFGDD-MVPTSRVTWVPMVRSTSGNYYSPGSATLYFDRRSLSTKPMEVVFDSGS 281
Query: 308 TLTFL-PQGYNSNLLSVMSSMIEA-QPVADPTGSLELCY----SFNSLSQVPE--VTIHF 359
T T+ Q Y + + ++ S+ ++ + V+DP SL LC+ +F S+S V + ++ F
Sbjct: 282 TYTYFSAQPYQATISAIKGSLSKSLKQVSDP--SLPLCWKGQKAFKSVSDVKKDFKSLQF 339
Query: 360 ---RGADVKLSRSNFFVKVSEDIVC-SVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTV 413
+ A +++ N+ + VC + G S I G+I + +V YD E+ +
Sbjct: 340 IFGKNAVMEIPPENYLIVTKNGNVCLGILDGSAAKLSFSIIGDITMQDQMVIYDNEKAQL 399
Query: 414 SFKPTDCTK 422
+ C++
Sbjct: 400 GWIRGSCSR 408
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/367 (27%), Positives = 164/367 (44%), Gaps = 47/367 (12%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G P E DTGSD++W C P CP S + LFD SS+ + LP
Sbjct: 84 YFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVLP 143
Query: 148 CSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
C+ CA++ +Q +C YS Y D S ++G T+++ G+ A +
Sbjct: 144 CTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIANSSA 203
Query: 201 GITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKI 255
I FGC G T GI G G G+ S+ISQ+ R FS+CL
Sbjct: 204 TIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL-------- 255
Query: 256 NFGTNG---IVSG----PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP-------DI 301
G NG +V G P +V +PL ++ Y L + +I++ Q T +
Sbjct: 256 KGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFPNPTMFPISNAGET 315
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT---GSLELCYSFNSLSQVPEVTIH 358
+IDSGTTL +L + ++SV++S + A PT GS S + P + +
Sbjct: 316 IIDSGTTLAYLVEEVYDWIVSVITSAVSQS--ATPTISRGSQCFRVSMSVADIFPVLRFN 373
Query: 359 FRG-ADVKLSRSNFF----VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
F G A + ++ + + + C F+ + + I G+++ + ++ YD+ +Q +
Sbjct: 374 FEGIASMVVTPEEYLQFDSIVREPALWCIGFQKAEDGLNILGDLVLKDKIIVYDLARQRI 433
Query: 414 SFKPTDC 420
+ DC
Sbjct: 434 GWANYDC 440
>gi|125532795|gb|EAY79360.1| hypothetical protein OsI_34488 [Oryza sativa Indica Group]
Length = 342
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 90/358 (25%), Positives = 155/358 (43%), Gaps = 73/358 (20%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y+ ++IGTPP A+ + +WTQC PC +C+ QD PLF+
Sbjct: 28 YMANLTIGTPPQPASAIIHLAGEFVWTQCSPC--RRCFKQDLPLFN-------------- 71
Query: 151 SQCASLNQKSCSGVNCQYSVS--YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
+Y V +GD S G T+T +G+ T + FGC
Sbjct: 72 ----------------RYEVETMFGDTSGIGG---TDTFAIGTATAS------LAFGCAM 106
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS----TKINFGTNG-IV 263
++ +G+VGLG SL+ QM T FSYCL P + + + G + +
Sbjct: 107 DSNIKQLLGASGVVGLGRTPWSLVGQMNAT---AFSYCLAPHGAAGKKSALLLGASAKLA 163
Query: 264 SGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPD----IVIDSGTTLTFLPQGY 316
G +TPL + Y++ ++ I G+ + + P +++D+ ++FL
Sbjct: 164 GGKSAATTPLVNTSDDSSDYMIHLEGIKFGD--VIIEPPPNGSVVLVDTIFGVSFLVDAA 221
Query: 317 NSNLLSVMSSMIEAQPVADPTGSLELCY-------SFNSLSQVPEVTIHFRGAD-VKLSR 368
+ ++ + A P+A PT +LC+ NS +P+V + F+GA + +
Sbjct: 222 FHAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAALTVPP 281
Query: 369 SNFFVKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
S + VC S +T + I G + Q N +D++++T+SF+P DC+
Sbjct: 282 SKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPADCS 339
>gi|125606590|gb|EAZ45626.1| hypothetical protein OsJ_30294 [Oryza sativa Japonica Group]
Length = 431
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 104/363 (28%), Positives = 165/363 (45%), Gaps = 64/363 (17%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
+ + IGTP V DT SDL+WTQC+PC C Q ++DP + TY +L SS
Sbjct: 90 VFLGIGTPAMNVTLVFDTTSDLLWTQCQPC--LSCVAQAGDMYDPNKTETYANLTSSS-- 145
Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG 212
Y+ +Y SF++G ATET LG+ T + ITFGCGT N G
Sbjct: 146 ---------------YNYTYSKQSFTSGYFATETFALGNVT-----VANITFGCGTRNQG 185
Query: 213 LFN--SKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV------------SSTKINFG 258
++ + G+ G G +SL++Q+ +FSYC S
Sbjct: 186 YYDNVAGVFGVGRGGRGGVSLLNQLGID---RFSYCFSSSGAPGSSAVFLGGSPELATNA 242
Query: 259 TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD--------IVIDSGTTLT 310
T + +V+ P+ K+ F L ++VG + V+ +VIDS + +T
Sbjct: 243 TTTPAASTPMVADPVLKSGYFVKLV--GVTVGATLVDVAGASSAEGGGRALVIDSTSPVT 300
Query: 311 FLPQG----YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVP-----EVTIHFRG 361
L + L++ ++ + EA A L+LC+ + P +T+HF G
Sbjct: 301 VLDEATYGPVRRALVAQLAPLKEANANASAGVGLDLCFELAAGGATPTPPNVTMTLHFDG 360
Query: 362 --ADVKLSRSNFFVKVSE-DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
AD+ L +++ K S ++C ++ +N VP+ G+ + LV YD+ + VSF+P
Sbjct: 361 GAADLVLPPASYLAKDSAGGLICLTMTPSSSNGVPVLGSWALLDTLVLYDLAKNVVSFQP 420
Query: 418 TDC 420
DC
Sbjct: 421 LDC 423
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 103/360 (28%), Positives = 154/360 (42%), Gaps = 44/360 (12%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ R +GTP L D +D W C C + C SP F P SSTY+++PC
Sbjct: 101 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSAC--AGC-AASSPSFSPTQSSTYRTVPCG 157
Query: 150 SSQCASLNQKSCS---GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
S QCA + SC G +C ++++Y +F L +++ L + + TFGC
Sbjct: 158 SPQCAQVPSPSCPAGVGSSCGFNLTYAASTF-QAVLGQDSLALENNV-----VVSYTFGC 211
Query: 207 GTNNGGLFNS-KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN---GI 262
G NS G++G G G +S +SQ + T FSYCL S+ NF G
Sbjct: 212 LRVVSG--NSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSS--NFSGTLKLGP 267
Query: 263 VSGPG-VVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI----------VIDSGTT 308
+ P + +TPL + Y + + I VG++ + V + +ID+GT
Sbjct: 268 IGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTM 327
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGA-DVKLS 367
T L + + + PVA P G + CY N VP VT F GA V L
Sbjct: 328 FTRLAAPVYAAVRDAFRGRVRT-PVAPPLGGFDTCY--NVTVSVPTVTFMFAGAVAVTLP 384
Query: 368 RSNFFVKVSE-DIVCSVFK-----GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
N + S + C G+ ++ + ++ Q N V +D+ V F CT
Sbjct: 385 EENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELCT 444
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 107/401 (26%), Positives = 182/401 (45%), Gaps = 59/401 (14%)
Query: 53 RLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGS 112
R+ D R L++ S + ++ D + +N Y R+ IGTPP E + DTGS
Sbjct: 49 RVEDFRRRRLHQ-------SQLPNAHMKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGS 101
Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
+ + C C QC P F P++SS+YK+L C+ C ++ G C Y Y
Sbjct: 102 TVTYVPCSTC--KQCGKHQDPKFQPELSSSYKALKCNPD-CNCDDE----GKLCVYERRY 154
Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISL 231
+ S S+G L+ + ++ G+ + + FGC G LF+ + GI+GLG G +S+
Sbjct: 155 AEMSSSSGVLSEDLISFGNES--QLTPQRAVFGCENVETGDLFSQRADGIMGLGRGKLSV 212
Query: 232 ISQM--RTTIAGKFSYCLVPVSSTKINFGTN--GIVSGP-GVV---STPLTKAKTFYVLT 283
+ Q+ + I FS C ++ G G +S P G+V S P +Y +
Sbjct: 213 VDQLVDKGVIEDVFSLC---YGGMEVGGGAMVLGKISPPAGMVFSHSDPFRSP--YYNID 267
Query: 284 IDAISVGNQRLGVSTPDI-------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--- 333
+ + V + L ++ P + V+DSGTT + P+ +++ ++I+ P
Sbjct: 268 LKQMHVAGKSLKLN-PKVFNGKHGTVLDSGTTYAYFPK---EAFIAIKDAIIKEIPSLKR 323
Query: 334 ---ADPTGSLELCYS--FNSLSQV----PEVTIHF-RGADVKLSRSNFF---VKVSEDIV 380
DP ++C+S ++++ PE+ + F G + LS N+ KV
Sbjct: 324 IHGPDPNYD-DVCFSGAGRDVAEIHNFFPEIDMEFGNGQKLILSPENYLFRHTKVRGAYC 382
Query: 381 CSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+F +S + G I+ N LV YD E + F T+C+
Sbjct: 383 LGIFPD-RDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCS 422
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 102/383 (26%), Positives = 170/383 (44%), Gaps = 62/383 (16%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + + +++GTPP V DTGS+L W C + F+ S +Y+ +
Sbjct: 27 HNISLTVSLTVGTPPQNVSMVIDTGSELSWLYCN---KTTTTTSYPTTFNQTRSISYRPI 83
Query: 147 PCSSSQCASLNQK-----SC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
PCSSS C + + SC S C ++SY D S S GNLA++T +G A +P
Sbjct: 84 PCSSSTCTNQTRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHMG-----ASDIP 138
Query: 201 GITFGCGT---NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-STKIN 256
G+ FGC ++ +SK TG++G+ G +S +SQM KFSYC+ S +
Sbjct: 139 GMVFGCMDSVFSSNSDEDSKNTGLMGMNRGSLSFVSQMGFP---KFSYCISGTDFSGMLL 195
Query: 257 FGTNGIVSGPGVVSTPLTKAKT--------FYVLTIDAISVGNQRL----GVSTPD---- 300
G + + TPL + T Y + ++ I V ++ L V PD
Sbjct: 196 LGESNFTWAVPLNYTPLVQISTPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTGA 255
Query: 301 --IVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADP----TGSLELCY----SF 346
++DSGT TFL S L+ + + + DP G+++LCY S
Sbjct: 256 GQTMVDSGTQFTFLLGPAYTALRSEFLNQTTGFLRV--LEDPDFVFQGAMDLCYRVPISQ 313
Query: 347 NSLSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKG---ITNSVPIYGNI 397
L ++P V++ F GA++ ++ +V ++ + C F + + G+
Sbjct: 314 RVLPRLPTVSLVFNGAEMTVADERVLYRVPGEIRGNDSVHCLSFGNSDLLGVEAYVIGHH 373
Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
Q N + +D+E+ + C
Sbjct: 374 HQQNVWMEFDLERSRIGLAQVRC 396
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 161/365 (44%), Gaps = 35/365 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G P E DTGSD++W C P CP S F+P SST +
Sbjct: 5 YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 64
Query: 148 CSSSQCASLNQKS---CSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---A 196
CS +C + Q C N C Y+ +YGDGS ++G ++T+ + G A
Sbjct: 65 CSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTA 124
Query: 197 VALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVS 251
+ I FGC + G + GI G G +S+ISQ+ + ++ K FS+CL S
Sbjct: 125 NSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL-KGS 183
Query: 252 STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVI 303
G + PG+V TPL ++ Y L +++I+V Q+L + +T ++
Sbjct: 184 DNGGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIV 243
Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNSLSQVPEVTIHFRGA 362
DSGTTL +L G +S +++ + + GS S + S P VT++F G
Sbjct: 244 DSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSSFPTVTLYFMGG 303
Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGI------TNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
+ ++ + SV I + I G+++ + + YD+ + +
Sbjct: 304 VAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWA 363
Query: 417 PTDCT 421
DC+
Sbjct: 364 DYDCS 368
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 95/363 (26%), Positives = 158/363 (43%), Gaps = 35/363 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP DTGSD++W QC+ CP + L++ S + K +
Sbjct: 80 YYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVS 139
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
C C ++ SG ++C Y YGDGS + G + V S G A
Sbjct: 140 CDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANG 199
Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTK 254
+ FGCG G +S GI+G G + S+ISQ+ ++ + F++CL +
Sbjct: 200 SVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGG 259
Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSG 306
I F +V P V TPL + Y + + A+ VG + L + +IDSG
Sbjct: 260 I-FAIGRVVQ-PKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSG 317
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGADV 364
TTL +LP+ L+ ++S A V + C+ ++ P VT HF +
Sbjct: 318 TTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYK-CFQYSGRVDEGFPNVTFHFENSVF 376
Query: 365 KLSRSNFFVKVSEDIVCSVFKGIT------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
+ ++ E + C ++ ++ + G+++ +N LV YD+E Q + +
Sbjct: 377 LRVYPHDYLFPHEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEY 436
Query: 419 DCT 421
+C+
Sbjct: 437 NCS 439
>gi|356558304|ref|XP_003547447.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 336
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/356 (28%), Positives = 154/356 (43%), Gaps = 51/356 (14%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL- 146
N Y +SIG PP +L + DT SD++W C LFDP SST+ L
Sbjct: 6 NKPYWSILSIGQPPIPQLVIMDTSSDILWIMCN---------HVGLLFDPSKSSTFSPLC 56
Query: 147 --PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
PC C C + +++SY D S ++G ++TV +T + +
Sbjct: 57 KTPCGFKGC------KCDPI--PFNISYVDKSSTSGTFGSDTVVFETTDEGHSQIFDVLV 108
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVS 264
CG N G + GI GL G SL T I KFSYC+ ++ N+ +
Sbjct: 109 RCGHNIGFNTDPGYNGIRGLNNGPNSL----ATKIGQKFSYCVGNLADPYYNYNQLILCE 164
Query: 265 GPGV--VSTPLTKAKTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTTLTFL 312
G + STP FY +T+ I VG +RL ++ T ++ DSGTT+T+L
Sbjct: 165 GADLEGYSTPFEVHHGFYYVTLKGIIVGEKRLDIAPITFEIKGNNTGGVIRDSGTTITYL 224
Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS---FNSLSQVPEVTIHF-RGADVKLSR 368
+ L + + +++ +LC+ L P VT HF GAD+ L
Sbjct: 225 VDSVHKLLYNEVRNLLSWS-------FRQLCHYGIISRDLVGFPVVTFHFADGADLALDT 277
Query: 369 SNFFVKVSEDIVCSV----FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+FF +++ + +V T S + + Q ++ VGYD+ V F+ DC
Sbjct: 278 GSFFNQLNSILCMTVSPASILNTTISPSVIELLAQQSYNVGYDLLTNFVYFQRIDC 333
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/364 (27%), Positives = 172/364 (47%), Gaps = 35/364 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +GTPP E DTGSD++W T C CP + FDP +SS+ +
Sbjct: 84 YYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVS 143
Query: 148 CSSSQCAS--LNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG--- 201
CS +C S + CS N C YS YGDGS ++G ++ ++ + +A+
Sbjct: 144 CSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINSSAP 203
Query: 202 ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTKIN 256
FGC G GI GLG G +S+ISQ+ +A + FS+CL S
Sbjct: 204 FVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGG-G 262
Query: 257 FGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRL-------GVSTPD-IVIDSGTT 308
G + P V TPL ++ Y + + +I+V Q L ++T D +ID+GTT
Sbjct: 263 IMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTGTT 322
Query: 309 LTFLPQGYNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNS--LSQVPEVTIHFRGADV 364
L +LP S + +++ + +P+ T C+ + + PEV++ F G
Sbjct: 323 LAYLPDEAYSPFIQAIANAVSQYGRPI---TYESYQCFEITAGDVDVFPEVSLSFAGGAS 379
Query: 365 KLSRSNFFVKV----SEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
+ R + ++++ I C F+ +++ + I G+++ + +V YD+ +Q + + D
Sbjct: 380 MVLRPHAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYD 439
Query: 420 CTKQ 423
C+ +
Sbjct: 440 CSLE 443
>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
Length = 538
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 88/390 (22%), Positives = 154/390 (39%), Gaps = 56/390 (14%)
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY---------------- 128
I + YL+ + GTP V DT +DL W C +
Sbjct: 121 IAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAA 180
Query: 129 --MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC----SGVNCQYSVSYGDGSFSNGNL 182
+ + P SS+++ + CS +CA L +C +C Y DG+ + G
Sbjct: 181 KEARRKNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQDGTLTMGIY 240
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
E T+ + G+ LPG+ GC G G++ LG G++S +
Sbjct: 241 GKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHAAKRFGQR 300
Query: 243 FSYCLVPVSSTK-----INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL 294
FS+CL+ +S++ + FG N V GPG + T + K Y + I VG +RL
Sbjct: 301 FSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERL 360
Query: 295 GVSTP----------DIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELC 343
+ +++D+ T++T +P+ Y + + S + + P E C
Sbjct: 361 DIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAY-AAVTSALDRHLSHLPRVYELDGFEYC 419
Query: 344 YSFN------SLSQ---VPEVTIHFRGADVKL---SRSNFFVKVSEDIVCSVFKGITNSV 391
Y + L+ VP +T+ G +L ++S +V + C F+ +
Sbjct: 420 YRWTFAGDGVDLAHNVTVPRLTVEMAGG-ARLEPEAKSVVMPEVVPGVACLAFRKLPRGG 478
Query: 392 P-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
P I GN++ ++ D + + F+ C
Sbjct: 479 PGILGNVLMQEYIWEIDHGKGKMRFRKDKC 508
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 99/361 (27%), Positives = 160/361 (44%), Gaps = 45/361 (12%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC-----YMQDSPLFDPKMSS 141
N Y++ S+GTPP V D SD +W QC C + C +P F +SS
Sbjct: 93 NTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSAC--ATCGADAPAATSAPPFYAFLSS 150
Query: 142 TYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSN--GNLATETVTLGSTTGQAV 197
T + + C++ C L ++CS + C YS YG G+ + G LA + + V
Sbjct: 151 TIREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFAT-----V 205
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN- 256
G+ FGC G G++GLG G++S +SQ++ G+FSY L P + +
Sbjct: 206 RADGVIFGCAVATEG----DIGGVIGLGRGELSPVSQLQI---GRFSYYLAPDDAVDVGS 258
Query: 257 ---FGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGV--STPDIVID--SG 306
F + VSTPL +++ Y + + I V + L + T D+ D G
Sbjct: 259 FILFLDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGG 318
Query: 307 TTL------TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIH 358
L TFL G + M+S IE + L+LCY+ SL ++VP + +
Sbjct: 319 VVLSITIPVTFLDAGAYKVVRQAMASKIELRAADGSELGLDLCYTSESLATAKVPSMALV 378
Query: 359 FRGADV-KLSRSN-FFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
F G V +L N F++ + + C ++ + G+++Q + YDI + F
Sbjct: 379 FAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMIYDISGSRLVF 438
Query: 416 K 416
+
Sbjct: 439 E 439
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 124/475 (26%), Positives = 197/475 (41%), Gaps = 65/475 (13%)
Query: 2 ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKS---PFYNSS----------- 47
A L + + + CFY S + Q G E R+ +S P Y +
Sbjct: 83 ALVLGALAVAAYYCFY--SDVAVQFLGMEQEEEQRNETRSFLLPLYPKARQGRALREFGD 140
Query: 48 -ETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS---QADIIPNNANYLIRISIGTPPTE 103
+ +R+ D ++ NR+ ++ ++S A + ++ P+ Y I IG PP
Sbjct: 141 VKLAARRVDDGGRKARNRMEVAKAATARTNSTALLPIKGNVFPD-GQYYTSIFIGNPPRP 199
Query: 104 RLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKS 160
DTGSDL W QC+ PC + C PL+ P + K +P C L NQ
Sbjct: 200 YFLDVDTGSDLTWIQCDAPC--TNCAKGPHPLYKP---AKEKIVPPRDLLCQELQGNQNY 254
Query: 161 CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS--- 216
C C Y + Y D S S G LA + + + +T G L FGC + G S
Sbjct: 255 CETCKQCDYEIEYADQSSSMGVLARDDMHMIATNGGREKL-DFVFGCAYDQQGQLLSSPA 313
Query: 217 KTTGIVGLGGGDISLISQMRT--TIAGKFSYCLV-PVSSTKINFGTNGIVSGPGVVSTPL 273
KT GI+GL IS SQ+ + IA F +C+ F + V GV T +
Sbjct: 314 KTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGGYMFLGDDYVPRWGVTWTSI 373
Query: 274 TKA-KTFYVLTIDAISVGNQRL-----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVM--S 325
Y + G+Q+L ST ++ DSG++ T+LP NL++ + +
Sbjct: 374 RSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFDSGSSYTYLPNEIYENLVAAIKYA 433
Query: 326 SMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFRGADVKLSRSNFFVKVS-----EDI 379
S Q +D T L LC+ + + + +V F ++ + F+ + ED
Sbjct: 434 SPGFVQDTSDRT--LPLCWKADFPVRYLEDVKQFFEPLNLHFGKKWLFMSKTFTISPEDY 491
Query: 380 VC-----SVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ +V G+ N S I G++ LV YD +++ + + +DCTK
Sbjct: 492 LIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNQRKQIGWADSDCTK 546
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 102/338 (30%), Positives = 153/338 (45%), Gaps = 48/338 (14%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y+I + +GTP ++ DTGS W CE C C+ + S+T + C +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-C--DGCHTNPRTFLQSR-STTCAKVSCGT 56
Query: 151 SQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
S C Q S + +C + VSY DGS S G L +T+T +PG TFG
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ----KIPGFTFG 112
Query: 206 CGTNNGGLFN-SKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF--GTNGI 262
C ++ G G++G+G G +S++ Q T G FSYCL P+ ++ F T G
Sbjct: 113 CNMDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDG-FSYCL-PLQMSERGFFSKTTGY 170
Query: 263 VSGPGVVSTPLTKAK-----------TFYVLTIDAISVGNQRLGV-----STPDIVIDSG 306
S G ++ T + + + + AISV +RLG+ S +V DSG
Sbjct: 171 FSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSG 230
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEA---QPVADPTGSLELCYSFNSLSQ--VPEVTIHF-R 360
+ L+++P LSV+S I + A S CY S+ + +P +++HF
Sbjct: 231 SELSYIPD----RALSVLSQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 286
Query: 361 GADVKLSRSNFFVKVS---EDIVCSVFKGITNSVPIYG 395
GA L R FV+ S +D+ C F T SV I G
Sbjct: 287 GARFDLGRHGVFVERSVQEQDVWCLAF-APTESVSIIG 323
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 108/369 (29%), Positives = 177/369 (47%), Gaps = 45/369 (12%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +GTPP E DTGSD++W C CP + FDP SST +
Sbjct: 77 YYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPGSSSTSSLIS 136
Query: 148 CSSSQCASLNQ---KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGS------TTGQA 196
C +C S Q SCSG N C Y+ YGDGS ++G ++ + S TT +
Sbjct: 137 CLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTTNSS 196
Query: 197 VALPGITFGCGT-NNGGLFNSKTT--GIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVS 251
+ + FGC G L S+ GI G G +S+ISQ+ + IA + FS+CL +
Sbjct: 197 AS---VVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKGDN 253
Query: 252 STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI---------V 302
S IV P +V +PL ++ Y L + +ISV Q + ++ P + +
Sbjct: 254 SGGGVLVLGEIVE-PNIVYSPLVPSQPHYNLNLQSISVNGQIVRIA-PSVFATSNNRGTI 311
Query: 303 IDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV---PEVTIH 358
+DSGTTL +L + YN ++++ + + Q V CY + S V P+V+++
Sbjct: 312 VDSGTTLAYLAEEAYNPFVIAI--AAVIPQSVRSVLSRGNQCYLITTSSNVDIFPQVSLN 369
Query: 359 FR-GADVKLSRSNFFVK---VSE-DIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQT 412
F GA + L ++ ++ + E + C F+ I+ S+ I G+++ + + YD+ Q
Sbjct: 370 FAGGASLVLRPQDYLMQQNFIGEGSVWCIGFQKISGQSITILGDLVLKDKIFVYDLAGQR 429
Query: 413 VSFKPTDCT 421
+ + DC+
Sbjct: 430 IGWANYDCS 438
>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
Length = 538
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 88/390 (22%), Positives = 154/390 (39%), Gaps = 56/390 (14%)
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY---------------- 128
I + YL+ + GTP V DT +DL W C +
Sbjct: 121 IAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAA 180
Query: 129 --MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC----SGVNCQYSVSYGDGSFSNGNL 182
+ + P SS+++ + CS +CA L +C +C Y DG+ + G
Sbjct: 181 KEARRKNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQDGTLTMGIY 240
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
E T+ + G+ LPG+ GC G G++ LG G++S +
Sbjct: 241 GKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHAAKRFGQR 300
Query: 243 FSYCLVPVSSTK-----INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL 294
FS+CL+ +S++ + FG N V GPG + T + K Y + I VG +RL
Sbjct: 301 FSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERL 360
Query: 295 GVSTP----------DIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELC 343
+ +++D+ T++T +P+ Y + + S + + P E C
Sbjct: 361 DIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAY-AAVTSALDRHLSHLPRVYELDGFEYC 419
Query: 344 YSFN------SLSQ---VPEVTIHFRGADVKL---SRSNFFVKVSEDIVCSVFKGITNSV 391
Y + L+ VP +T+ G +L ++S +V + C F+ +
Sbjct: 420 YRWTFAGDGVDLTHNVTVPRLTVEMAGG-ARLEPEAKSVVMPEVVPGVACLAFRKLPRGG 478
Query: 392 P-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
P I GN++ ++ D + + F+ C
Sbjct: 479 PGILGNVLMQEYIWEIDHGKGKMRFRKDKC 508
>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
Length = 537
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 90/396 (22%), Positives = 154/396 (38%), Gaps = 64/396 (16%)
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDS----------- 132
I + YL+ + IGTP V DT +DL W C + Y + S
Sbjct: 118 IAHVGMYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSMGQTMSVGGEG 177
Query: 133 ----------PLFDPKMSSTYKSLPCSSSQCASLNQKSC----SGVNCQYSVSYGDGSFS 178
+ P SS+++ + CS +CA L +C +C Y DG+ +
Sbjct: 178 ATAAKKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQDGTVT 237
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
G E T+ + G+ LPG+ GC G G++ LG GD+S
Sbjct: 238 IGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAKR 297
Query: 239 IAGKFSYCLVPVSSTK-----INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVG 290
+FS+CL+ +S++ + FG N V GPG + T + K Y + + VG
Sbjct: 298 FGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTGVLVG 357
Query: 291 NQRLGVSTPD------------IVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPT 337
+RL + PD +++D+ T++T +P+ Y + + + + + P
Sbjct: 358 GERLDI--PDEVWDAERFVGGGVILDTSTSVTSLVPEAY-APVTAALDRHLSHLPRVYEL 414
Query: 338 GSLELCYSFNSLSQ---------VPEVTIHFRGADVKL---SRSNFFVKVSEDIVCSVFK 385
E CY + +P T+ G +L ++S +V + C F+
Sbjct: 415 EGFEYCYKWTFTGDGVXPAHNVTIPSFTVEMAGG-ARLEPEAKSVVMPEVEPGVACLAFR 473
Query: 386 GITNSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ P I GN+ ++ D + F+ C
Sbjct: 474 KLLRGGPGILGNVFMQEYIWEIDHGDGKIRFRKDKC 509
>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
Length = 519
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 114/411 (27%), Positives = 177/411 (43%), Gaps = 91/411 (22%)
Query: 89 ANYLIRISIGTPPTERLAVA---DTGSDLIWTQCEPCPPSQCYM----------QDSPL- 134
++Y + +S+G PP+ +V+ DTGSDL+W PC P C + SPL
Sbjct: 86 SDYTLSLSVG-PPSTASSVSLFLDTGSDLVWF---PCAPFTCMLCEGKATPGGNHSSPLP 141
Query: 135 ----------FDPKMSSTYKSLP----CSSSQCA--SLNQKSCSGVNCQ-YSVSYGDGSF 177
P S+ + S P C++++C ++ SC+ C +YGDGS
Sbjct: 142 PPIDSRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSL 201
Query: 178 SNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT 237
NL V L ++ +A+ TF C ++ G+ G G G +SL +Q+
Sbjct: 202 V-ANLRRGRVGLAAS----MAVENFTFACAHTA----LAEPVGVAGFGRGPLSLPAQLAP 252
Query: 238 TIAGKFSYCLVP--------VSSTKINFGTNGIVSGPGV-----VSTPL---TKAKTFYV 281
+++G+FSYCLV + S+ + G + + G V TPL K FY
Sbjct: 253 SLSGRFSYCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYS 312
Query: 282 LTIDAISVGNQR------LGVSTPD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
+ ++A+SVG +R LG D +V+DSGTT T LP + + + + A
Sbjct: 313 VALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAA 372
Query: 332 PVADPTGS-----LELCYSFN-SLSQVPEVTIHFRG-ADVKLSRSNFFVKVSED----IV 380
G+ L CY ++ S VP V +HFRG A V L R N+F+ + +
Sbjct: 373 RFTRAEGAEAQTGLAPCYHYSPSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVG 432
Query: 381 CSVFKGITNS----------VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
C + + + GN Q F V YD++ V F CT
Sbjct: 433 CLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 483
>gi|125575539|gb|EAZ16823.1| hypothetical protein OsJ_32295 [Oryza sativa Japonica Group]
Length = 383
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 94/360 (26%), Positives = 168/360 (46%), Gaps = 44/360 (12%)
Query: 96 SIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP-KMSSTYKSLPCSSSQCA 154
+IGTPP A D G L+WTQC C S C+ Q +P P ++ PC ++ C
Sbjct: 29 TIGTPPQPASAFIDVGGLLVWTQCSQCSSSSCFNQGAPAVRPDQVVPPTGPEPCGTALCE 88
Query: 155 --SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNG 211
+ ++CSG C Y S ++G + T+ V +G+ T +VA FGC ++
Sbjct: 89 FFPASIRNCSGDVCAYEASTQLFEHTSGKIGTDAVAIGTATAASVA-----FGCVMASDI 143
Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP--------------VSSTKINF 257
L + +G VGL +SL++QM T FS+CL P ++
Sbjct: 144 KLMDGGPSGFVGLARTPLSLVAQMNVT---AFSHCLAPHDGGGGKNSRLFLGAAAKLAGG 200
Query: 258 GTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD----IVIDSGTTLTFLP 313
G + ++ P V S+P +Y++ ++ I G++ + ++ P +++ + + ++FL
Sbjct: 201 GKSAAMTTPFVKSSPDDIKSLYYLINLEGIKAGDEAI-ITVPQSGRTVLLQTFSPVSFLV 259
Query: 314 QGYNSNLLSVMSSMIEAQPVADPTGSLE----LCYSFNSLSQVPEVTIHFRG-ADVKLSR 368
G +L +++ + P A P + LC+ +S P+V + F+G A + +
Sbjct: 260 DGVYQDLKKAVTAAV-GGPTATPPEQFQSIFDLCFKRGGVSGAPDVVLTFQGAAALTVPP 318
Query: 369 SNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+N+ + V +D VC + I G + Q N YD+E++T+SF+ DC+
Sbjct: 319 TNYLLDVGDDTVCVAIASSARLNSTEVAGMSILGGLQQQNVHFLYDLEKETLSFEAADCS 378
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 97/378 (25%), Positives = 158/378 (41%), Gaps = 65/378 (17%)
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
++ + IGTP + V DTGS L W QC P + + FDP +SS++ LPCS
Sbjct: 82 ILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 141
Query: 152 QCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C +L S C YS Y DG+F+ GNL E T ++ P + G
Sbjct: 142 LCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQ----TTPPLILG 197
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG---I 262
C ++ GI+G+ G +S ISQ + + KFSYC +P S + + G +
Sbjct: 198 CAKE-----STDVKGILGMNLGRLSFISQAKIS---KFSYC-IPTRSNRPGLASTGSFYL 248
Query: 263 VSGPGVVSTPLTKAKTF-------------YVLTIDAISVGNQRLGVST----PD----- 300
P TF Y + + I +G +RL + + PD
Sbjct: 249 GENPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGGSG 308
Query: 301 -IVIDSGTTLTFLPQ-GYN---SNLLSVMSSMIEAQPVADPTGSLELCYSFNSL----SQ 351
++DSG+ T L Y+ ++ ++ S ++ V T ++C+ N
Sbjct: 309 QTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTA--DMCFDGNHQMVIGRL 366
Query: 352 VPEVTIHF-RGADVKLSRSNFFVKVSEDIVC------SVFKGITNSVPIYGNIMQTNFLV 404
+ ++ F RG ++ + + V V I C S+ +N I GN+ Q N V
Sbjct: 367 IGDLVFEFGRGVEILVEKQRLLVNVGGGIHCVGIGRSSMLGAASN---IIGNVHQQNLWV 423
Query: 405 GYDIEQQTVSFKPTDCTK 422
+D+ + V F +C++
Sbjct: 424 EFDVANRRVGFSKAECSR 441
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 159/374 (42%), Gaps = 44/374 (11%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
D+ P+ Y + +SIG PP DTGSDL W QC+ PC C PL+ P +
Sbjct: 50 GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCSKVPHPLYRPTKN 106
Query: 141 STYKSLPCSSSQCASLN-----QKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
K +PC CA+L+ + C C Y + Y D S G L T++ L
Sbjct: 107 ---KLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFAL-RLA 162
Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLV 248
++ PG+ FGCG + S T G++GLG G +SL+SQ++ K +CL
Sbjct: 163 NSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS 222
Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPDIVIDSG 306
+ FG + IV P+ + ++ +Y + G + LGV ++V DSG
Sbjct: 223 TRGGGFLFFGDD-IVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTIH 358
++ T+ L+ + + P SL LC+ F S+ V + V +
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFRTVVLS 341
Query: 359 F---RGADVKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDI 408
F + A +++ N+ + C GI N + I G+I + +V YD
Sbjct: 342 FSNGKKALMEIPPENYLIVTKYGNAC---LGILNGSEVGLKDLNIVGDITMQDQMVIYDN 398
Query: 409 EQQTVSFKPTDCTK 422
E+ + + C +
Sbjct: 399 ERGQIGWIRAPCDR 412
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 104/374 (27%), Positives = 166/374 (44%), Gaps = 45/374 (12%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
D+ P+ Y + ++IG PP DTGSDL W QC+ PC C PL+ P +
Sbjct: 58 GDVYPHGL-YYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAPC--RSCNKVPHPLYRPTKN 114
Query: 141 STYKSLPCSSSQCASLN-----QKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
K +PC CASL+ + C C Y + Y D S G L ++ L
Sbjct: 115 ---KLVPCVDQLCASLHNGLNRKHKCDSPYEQCDYVIKYADQGSSTGVLVNDSFALRLAN 171
Query: 194 GQAVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCL 247
G +V P + FGCG + +G + S T G++GLG G +SL+SQ + K +CL
Sbjct: 172 G-SVVRPSLAFGCGYDQQVSSGEM--SPTDGVLGLGTGSVSLLSQFKQHGVTKNVVGHCL 228
Query: 248 VPVSSTKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGVSTPDIVIDS 305
+ FG + +V V TP+ ++ + +Y ++ G+Q L V ++V DS
Sbjct: 229 SLRGGGFLFFGDD-LVPYQRVTWTPMVRSPLRNYYSPGSASLYFGDQSLRVKLTEVVFDS 287
Query: 306 GTTLT-FLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYSFNS-LSQVPEVTIHFRGA 362
G++ T F Q Y + + ++ + + V+DP SL LC+ V +V F+
Sbjct: 288 GSSFTYFAAQPYQALVTALKGDLSRTLKEVSDP--SLPLCWKGKKPFKSVLDVKKEFKSL 345
Query: 363 DVKLSRSN-FFVKVSEDIVCSVFK------GITN-------SVPIYGNIMQTNFLVGYDI 408
+ N F+++ V K GI N + I G+I + +V YD
Sbjct: 346 VLNFGNGNKAFMEIPPQNYLIVTKYGNACLGILNGSEVGLKDLSILGDITMQDQMVIYDN 405
Query: 409 EQQTVSFKPTDCTK 422
E+ + + C +
Sbjct: 406 EKGQIGWIRAPCDR 419
>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
Length = 492
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 114/412 (27%), Positives = 177/412 (42%), Gaps = 91/412 (22%)
Query: 89 ANYLIRISIGTPPTERLAVA---DTGSDLIWTQCEPCPPSQCYM----------QDSPL- 134
++Y + +S+G PP+ +V+ DTGSDL+W PC P C + SPL
Sbjct: 86 SDYTLSLSVG-PPSTASSVSLFLDTGSDLVWF---PCAPFTCMLCEGKATPGGNHSSPLP 141
Query: 135 ----------FDPKMSSTYKSLP----CSSSQCA--SLNQKSCSGVNCQ-YSVSYGDGSF 177
P S+ + S P C++++C ++ SC+ C +YGDGS
Sbjct: 142 PPIDSRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSL 201
Query: 178 SNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT 237
NL V L ++ +A+ TF C ++ G+ G G G +SL +Q+
Sbjct: 202 V-ANLRRGRVGLAAS----MAVENFTFACAHTA----LAEPVGVAGFGRGPLSLPAQLAP 252
Query: 238 TIAGKFSYCLVP--------VSSTKINFGTNGIVSGPGV-----VSTPL---TKAKTFYV 281
+++G+FSYCLV + S+ + G + + G V TPL K FY
Sbjct: 253 SLSGRFSYCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYS 312
Query: 282 LTIDAISVGNQR------LGVSTPD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
+ ++A+SVG +R LG D +V+DSGTT T LP + + + + A
Sbjct: 313 VALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAA 372
Query: 332 PVADPTGS-----LELCYSFN-SLSQVPEVTIHFRG-ADVKLSRSNFFVKVSED----IV 380
G+ L CY ++ S VP V +HFRG A V L R N+F+ + +
Sbjct: 373 RFTRAEGAEAQTGLAPCYHYSPSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVG 432
Query: 381 CSVFKGITNS----------VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
C + + + GN Q F V YD++ V F CT
Sbjct: 433 CLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTD 484
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 161/365 (44%), Gaps = 45/365 (12%)
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC-----YMQDSPLFDP 137
D N Y++ S+GTPP V D SD +W QC C + C +P F
Sbjct: 89 DPATNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSAC--ATCGADAPAATSAPPFYA 146
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSN--GNLATETVTLGSTT 193
+SST + + C++ C L ++CS + C YS YG G+ + G LA + +
Sbjct: 147 FLSSTIREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFAT-- 204
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
V G+ FGC G G++GLG G++SL+SQ++ G+FSY L P +
Sbjct: 205 ---VRADGVIFGCAVATEG----DIGGVIGLGRGELSLVSQLQI---GRFSYYLAPDDAV 254
Query: 254 KIN----FGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGV--STPDIVID 304
+ F + VSTPL +++ Y + + I V + L + T D+ D
Sbjct: 255 DVGSFILFLDDAKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRGTFDLQAD 314
Query: 305 --SGTTL------TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPE 354
G L TFL G + M+S I + L+LCY+ SL ++VP
Sbjct: 315 GSGGVVLSITIPVTFLDAGAYKVVRQAMASKIGLRAADGSELGLDLCYTSESLATAKVPS 374
Query: 355 VTIHFRGADV-KLSRSN-FFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
+ + F G V +L N F++ + + C ++ + G+++Q + YDI
Sbjct: 375 MALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMIYDISGS 434
Query: 412 TVSFK 416
+ F+
Sbjct: 435 RLVFE 439
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 96/353 (27%), Positives = 157/353 (44%), Gaps = 43/353 (12%)
Query: 97 IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS-QCAS 155
IGTPP E + DTGS + + C C QC P F P +S TY + C+ C +
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSC--DQCGNHQDPKFQPDLSDTYHPVKCNPDCTCDT 59
Query: 156 LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLF 214
N + C Y Y + S S+G L + V+ G+ + + FGC G LF
Sbjct: 60 ENDQ------CTYERQYAEMSSSSGILGEDLVSFGNMS--ELKPQRAVFGCENAETGDLF 111
Query: 215 NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTN--GIVSGPG--V 268
+ GI+GLG GD+S++ Q+ + I FS C ++ G G +S P V
Sbjct: 112 SQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCY---GGMEVGGGAMVLGQISPPSDMV 168
Query: 269 VSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI-------VIDSGTTLTFLPQGYNSNLL 321
S +Y + + + V ++L ++ P + ++DSGTT +LP+ +
Sbjct: 169 FSHSDPDRSPYYNIELRGLHVAGKKLDIN-PQVFDGKHGTILDSGTTYAYLPEAAFLPFI 227
Query: 322 SVMSSMIEA-QPVADPTGSL-ELCYSFNSLSQVPEVTIHFRGADV--------KLSRSNF 371
++S + + + P + ++C+S + S++PE+ F D+ LS N+
Sbjct: 228 QAITSELHGLKQIRGPDPNYNDVCFS-GAGSEIPELYKTFPSVDMVFDNGEKYSLSPENY 286
Query: 372 FVKVSE---DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
K S+ VF+ + + G I+ N LV YD E V F T+C+
Sbjct: 287 LFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 159/374 (42%), Gaps = 44/374 (11%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
D+ P+ Y + +SIG PP DTGSDL W QC+ PC C PL+ P +
Sbjct: 50 GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCSKVPHPLYRPTKN 106
Query: 141 STYKSLPCSSSQCASLN-----QKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
K +PC CA+L+ + C C Y + Y D S G L T++ L
Sbjct: 107 ---KLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFAL-RLA 162
Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLV 248
++ PG+ FGCG + S T G++GLG G +SL+SQ++ K +CL
Sbjct: 163 NSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS 222
Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPDIVIDSG 306
+ FG + IV P+ + ++ +Y + G + LGV ++V DSG
Sbjct: 223 TRGGGFLFFGDD-IVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTIH 358
++ T+ L+ + + P SL LC+ F S+ V + V +
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFRTVVLS 341
Query: 359 F---RGADVKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDI 408
F + A +++ N+ + C GI N + I G+I + +V YD
Sbjct: 342 FSNGKKALMEIPPENYLIVTKYGNAC---LGILNGSEVGLKDLNIVGDITMQDQMVIYDN 398
Query: 409 EQQTVSFKPTDCTK 422
E+ + + C +
Sbjct: 399 ERGQIGWIRAPCDR 412
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 98/362 (27%), Positives = 164/362 (45%), Gaps = 41/362 (11%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
N Y R+ IGTPP + DTGS + + C C C P F P +S TY+ +
Sbjct: 86 NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTC--EHCGRHQDPKFQPDLSETYQPVK 143
Query: 148 CSSSQCASLNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C+ C +C G C Y Y + S S+G L + V+ G+ + +A FG
Sbjct: 144 CTPD-C------NCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFGNLS--ELAPQRAVFG 194
Query: 206 CGTNN-GGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCL--VPVSSTKINFGTN 260
C + G L++ + GI+GLG GD+S++ Q+ + I+ FS C + V + G
Sbjct: 195 CENDETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMILG-- 252
Query: 261 GIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGVSTPDI-------VIDSGTTLTFL 312
GI +V T ++ +Y + + + V ++L ++ P + V+DSGTT +L
Sbjct: 253 GISPPEDMVFTHSDPDRSPYYNINLKEMHVAGKKLQLN-PKVFDGKHGTVLDSGTTYAYL 311
Query: 313 PQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSFNSL--SQV----PEVTIHFR-GAD 363
P+ + + ++M + + P + ++C++ + SQ+ P V + F G
Sbjct: 312 PETAFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENGHK 371
Query: 364 VKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ LS N+ KV VF + + G I N LV YD E + F T+C
Sbjct: 372 LSLSPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFVRNTLVMYDRENSKIGFWKTNC 431
Query: 421 TK 422
++
Sbjct: 432 SE 433
>gi|125554529|gb|EAZ00135.1| hypothetical protein OsI_22138 [Oryza sativa Indica Group]
Length = 472
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 111/398 (27%), Positives = 172/398 (43%), Gaps = 61/398 (15%)
Query: 70 QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
Q I+SS +++ D+I N+ +L+ +S+G PP L DTGS L W QC+PC
Sbjct: 89 QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147
Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
C+ Q + P+FDP S T + + CSS +C L Q +C +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 207
Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
G ++S G + T+T+ +G + + FGC + ++ GI G G S
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMDVK--YSEFEAGIFGFGSSSFSFFE 259
Query: 234 QMRTTIAG--------KFSYCLVPVSSTKINFGTNGIVSGP----GVVSTPLTKAKTFYV 281
Q+ AG FSYCL P TK + G G S + + Y
Sbjct: 260 QL----AGYPDILSYKAFSYCL-PTDETKPGYMILGRYDRAAMDGGYTSLFRSINRPTYS 314
Query: 282 LTIDAISVGNQRLGVSTPDIVIDSG--------TTLTFLPQ---------GYNSNLLSVM 324
LT++ + QRL S+ ++++DSG +T L + GY+ +
Sbjct: 315 LTMEMLIANGQRLVTSSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ 374
Query: 325 SSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSV 383
S I D +G F++ S +P + I F GA + L N F +C
Sbjct: 375 ESYICYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMT 434
Query: 384 F-KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
F + I GN + +F +DI+ + FK C
Sbjct: 435 FAQNPALRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472
>gi|242094480|ref|XP_002437730.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
gi|241915953|gb|EER89097.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
Length = 507
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 101/330 (30%), Positives = 140/330 (42%), Gaps = 34/330 (10%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN---QKSCSG 163
V DT SD+ W QC P S S +DP SSTY +L C+S+ C L + +C
Sbjct: 127 VLDTASDVPWVQCHPLASSATTDSSSSSYDPARSSTYYALACNSAACTELGRLYRGACVN 186
Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG-------GLFNS 216
CQY V S+ + T L T ++F G ++G G ++
Sbjct: 187 NQCQYRVPIPSSPASSSSSGTYGSDLLKLTADPADGASMSFKFGCSHGEAKQGGEGSIDN 246
Query: 217 KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFGTNGIVSGPGVVST 271
T GI+ LGGG SL+SQ FSYC+ S + + G + G T
Sbjct: 247 ATAGIMALGGGPESLVSQNAAMYGSAFSYCIPATESRRPGFFVLGGGVGDLSGAGGYAVT 306
Query: 272 PL---TKAKTFYVLTIDAISVGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSV 323
P+ + T Y + + AI+V Q+L V TP + V+DS T +T LP L
Sbjct: 307 PMLRYARVPTLYRVRLLAIAVDGQQLNV-TPSVFASGSVLDSRTAITRLPPTAYQALREA 365
Query: 324 MSSMIEAQPVADPTGSLELCYSFNS--LSQVPEVTIHFRG-ADVKLSRSNFFVKVSEDIV 380
S + A P G+L+ CY F L VP V + G A V L R
Sbjct: 366 FRSRMAMYREAPPQGNLDTCYDFAGAFLVMVPRVALLLDGNAVVALDRQGILFH-----D 420
Query: 381 CSVFKGITNS-VP-IYGNIMQTNFLVGYDI 408
C VF T+ +P I GN+ Q V Y++
Sbjct: 421 CLVFTSNTDDRMPGILGNVQQQTMEVLYNV 450
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 159/374 (42%), Gaps = 44/374 (11%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
D+ P+ Y + +SIG PP DTGSDL W QC+ PC C PL+ P +
Sbjct: 50 GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCSKVPHPLYRPTKN 106
Query: 141 STYKSLPCSSSQCASLN-----QKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
K +PC CA+L+ + C C Y + Y D S G L T++ L
Sbjct: 107 ---KLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFAL-RLA 162
Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLV 248
++ PG+ FGCG + S T G++GLG G +SL+SQ++ K +CL
Sbjct: 163 NSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS 222
Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPDIVIDSG 306
+ FG + IV P+ + ++ +Y + G + LGV ++V DSG
Sbjct: 223 TRGGGFLFFGDD-IVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTIH 358
++ T+ L+ + + P SL LC+ F S+ V + V +
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFKTVVLS 341
Query: 359 F---RGADVKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDI 408
F + A +++ N+ + C GI N + I G+I + +V YD
Sbjct: 342 FSNGKKALMEIPPENYLIVTKYGNAC---LGILNGSEVGLKDLNIVGDITMQDQMVIYDN 398
Query: 409 EQQTVSFKPTDCTK 422
E+ + + C +
Sbjct: 399 ERGQIGWIRAPCDR 412
>gi|125595873|gb|EAZ35653.1| hypothetical protein OsJ_19940 [Oryza sativa Japonica Group]
Length = 468
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 92/340 (27%), Positives = 132/340 (38%), Gaps = 66/340 (19%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
DT DL W QC PCP +CY Q + LFDP+ S T ++PC S+ C L +
Sbjct: 167 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGR---------- 216
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
YG + + + G F++ T+G + LGGG
Sbjct: 217 ---YGRWLLQQPVPVLRRLRRRQGQPRGRTCHAVR--------GNFSASTSGTMSLGGGR 265
Query: 229 ISLISQMRTTIAGKFSYCLVPVSSTKI-------------NFGTNGIVSGPGVVSTPLTK 275
SL+SQ T FSYC+ SS+ F +V P ++
Sbjct: 266 QSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFARTPLVRNPSII------ 319
Query: 276 AKTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEA 330
T Y++ + I VG +RL V V+DS +T L P Y + L+ S+M
Sbjct: 320 -PTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMAAY 378
Query: 331 QPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGIT 388
VA L+ CY F + VP V++ F G V V D + + +G
Sbjct: 379 PRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAMGVMVEGCL 428
Query: 389 NSVP--------IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
VP GN+ Q V YD+ +V F+ C
Sbjct: 429 AFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 468
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 96/361 (26%), Positives = 160/361 (44%), Gaps = 50/361 (13%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y I IGTP + DTGS W C+ CP ++ +DP+ S + K +
Sbjct: 59 YYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVK 118
Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---GIT 203
C + C S + C+ + C Y Y DG + G L T+ + G P +T
Sbjct: 119 CDDTICTS--RPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVT 176
Query: 204 FGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSSTKIN 256
FGCG G N+ GI+G G + + +SQ+ AGK FS+CL + I
Sbjct: 177 FGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAA--AGKTKKIFSHCLDSTNGGGI- 233
Query: 257 FGTNGIVSGPGVVSTPLTK-AKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGT 307
F +V P V +TP+ K + ++++ + +I+V L + T IDSG+
Sbjct: 234 FAIGEVVE-PKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGS 292
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL-------SQVPEVTIHFR 360
TL +LP+ + S +I A P ++ Y+F + P++T HF
Sbjct: 293 TLVYLPE-------IIYSELILAVFAKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFE 345
Query: 361 GADVKLSR--SNFFVKVSEDIVCSVFK--GIT--NSVPIYGNIMQTNFLVGYDIEQQTVS 414
D+ L ++ ++ + C F+ GI + I G+++ +N +V YD+E+Q +
Sbjct: 346 N-DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIG 404
Query: 415 F 415
+
Sbjct: 405 W 405
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 95/375 (25%), Positives = 160/375 (42%), Gaps = 62/375 (16%)
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
++ + IGTPP + V DTGS L W QC P++ S FDP +SST+ +LPC+
Sbjct: 98 IVDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTAS--FDPSLSSTFSTLPCTHP 155
Query: 152 QCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C +L C YS Y DG+++ GNL E T +++ P + G
Sbjct: 156 VCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTF----SRSLFTPPLILG 211
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG---I 262
C T ++ GI+G+ G +S SQ + T KFSYC VP T+ + G +
Sbjct: 212 CATE-----STDPRGILGMNRGRLSFASQSKIT---KFSYC-VPTRVTRPGYTPTGSFYL 262
Query: 263 VSGPGVVSTPLTKAKTF-------------YVLTIDAISVGNQRLGVS----------TP 299
P + + TF Y + + I +G ++L +S +
Sbjct: 263 GHNPNSNTFRYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSG 322
Query: 300 DIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSFNSLS---QVPE 354
++DSG+ T+L + Y+ V+ ++ G + ++C+ N++ + +
Sbjct: 323 QTMLDSGSEFTYLVNEAYDKVRAEVVRAVGPRMKKGYVYGGVADMCFDGNAIEIGRLIGD 382
Query: 355 VTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP------IYGNIMQTNFLVGYD 407
+ F +G + + + V + C GI NS I GN Q N V +D
Sbjct: 383 MVFEFEKGVQIVVPKERVLATVEGGVHCI---GIANSDKLGAASNIIGNFHQQNLWVEFD 439
Query: 408 IEQQTVSFKPTDCTK 422
+ + + F DC++
Sbjct: 440 LVNRRMGFGTADCSR 454
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 96/353 (27%), Positives = 157/353 (44%), Gaps = 43/353 (12%)
Query: 97 IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS-QCAS 155
IGTPP E + DTGS + + C C QC P F P +S TY + C+ C +
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSC--DQCGNHQDPKFQPDLSDTYHPVKCNPDCTCDT 59
Query: 156 LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLF 214
N + C Y Y + S S+G L + V+ G+ + + FGC G LF
Sbjct: 60 ENDQ------CTYERQYAEMSSSSGILGEDLVSFGNMS--ELKPQRAVFGCENAETGDLF 111
Query: 215 NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTN--GIVSGPG--V 268
+ GI+GLG GD+S++ Q+ + I FS C ++ G G +S P V
Sbjct: 112 SQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCY---GGMEVGGGAMVLGQISPPSDMV 168
Query: 269 VSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI-------VIDSGTTLTFLPQGYNSNLL 321
S +Y + + + V ++L ++ P + ++DSGTT +LP+ +
Sbjct: 169 FSHSDPDRSPYYNIELRGLHVAGKKLDIN-PQVFDGKHGTILDSGTTYAYLPEAAFLPFI 227
Query: 322 SVMSSMIEA-QPVADPTGSL-ELCYSFNSLSQVPEVTIHFRGADV--------KLSRSNF 371
++S + + + P + ++C+S + S++PE+ F D+ LS N+
Sbjct: 228 QAITSELHGLKQIRGPDPNYNDVCFS-GAGSEIPELYKTFPSVDMVFDNGEKYSLSPENY 286
Query: 372 FVKVSE---DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
K S+ VF+ + + G I+ N LV YD E V F T+C+
Sbjct: 287 LFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 102/365 (27%), Positives = 168/365 (46%), Gaps = 40/365 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G+PP E DTGSD++W C CP + FD SST +
Sbjct: 66 YFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQVR 125
Query: 148 CSSSQCASLNQKS---CSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV----- 197
CS C S Q + CS C Y+ YGDGS ++G ++T+ + GQ++
Sbjct: 126 CSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNSS 185
Query: 198 ALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSS 252
AL I FGC G + GI G G G++S+ISQ+ R FS+CL S
Sbjct: 186 AL--IVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCLKGDGS 243
Query: 253 TKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVID 304
G + PG+V +PL ++ Y L + +I+V Q L + ++ ++D
Sbjct: 244 GG-GILVLGEILEPGIVYSPLVPSQPHYNLNLLSIAVNGQLLPIDPAAFATSNSQGTIVD 302
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP-TGSLELCYSFN-SLSQV-PEVTIHFR- 360
SGTTL +L +S +++++ P P T CY + S+SQ+ P + +F
Sbjct: 303 SGTTLAYLVAEAYDPFVSAVNAIVS--PSVTPITSKGNQCYLVSTSVSQMFPLASFNFAG 360
Query: 361 GADVKLSRSNFFVKVSED----IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
GA + L ++ + + C F+ + V I G+++ + + YD+ +Q + +
Sbjct: 361 GASMVLKPEDYLIPFGSSGGSAMWCIGFQKV-QGVTILGDLVLKDKIFVYDLVRQRIGWA 419
Query: 417 PTDCT 421
DC+
Sbjct: 420 NYDCS 424
>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 164/365 (44%), Gaps = 45/365 (12%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + I+IG D+GSDL W QC+ P + C L+ P ++ L C
Sbjct: 55 YSVSINIGKGDEAFEFDIDSGSDLTWVQCD-APCTHCTKPREQLYKPNNNA----LNCFE 109
Query: 151 SQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C SL+ C + CQY + Y D S G L + V L T G ++A P I FG
Sbjct: 110 PLCTSLHPITNHHCKSADDQCQYEIEYADHGSSLGVLVNDHVPLKLTNG-SLAAPRIAFG 168
Query: 206 CGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTKINFGTN 260
CG ++ + T G++GLG G++S ISQ+ + + +CL F +
Sbjct: 169 CGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCL--SDEGGFLFFGD 226
Query: 261 GIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFL-PQGYN 317
V GV T ++ ++Y + G + G+ +V DSG++ T+ Q YN
Sbjct: 227 EFVPSSGVTWTSMSHESIGSYYSSGPAEVYFGGKATGIKDLTLVFDSGSSYTYFNSQAYN 286
Query: 318 SNLLSVMSSMIEAQPVAD--PTGSLELCYS----FNSLSQVPE----VTIHF---RGADV 364
S +L+++ + + +P+ D SL +C+ F SL V + + + F + A +
Sbjct: 287 S-ILALVKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDVKKYFNLLALRFTKTKNAQI 345
Query: 365 KLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
+L N+ + VC GI N + I G+I + +V YD E++ + + P
Sbjct: 346 QLPPENYLIITKYGNVCF---GILNGTEVGLGDLNIIGDISLKDKMVIYDNERRRIGWFP 402
Query: 418 TDCTK 422
T+C K
Sbjct: 403 TNCNK 407
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 100/372 (26%), Positives = 169/372 (45%), Gaps = 52/372 (13%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G+P E DTGSD++W C CP S + FD SST +
Sbjct: 83 YFTKVKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVS 142
Query: 148 CSSSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGST-TGQAVALPG 201
C C+ Q + S + C Y+ YGDGS + G ++T+ + GQ+V
Sbjct: 143 CGDPICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVANS 202
Query: 202 ---ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSST 253
I FGC T G + GI G G G +S+ISQ+ R FS+CL
Sbjct: 203 SSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCL------ 256
Query: 254 KINFGTNG---IVSG----PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST-------- 298
G NG +V G P +V +PL ++ Y L + +I+V Q L + +
Sbjct: 257 --KGGENGGGVLVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPIDSNVFATTNN 314
Query: 299 PDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIE-AQPVADPTGSLELCYSF-NSLSQV-PE 354
++DSGTTL +L Q YN + ++ +++ + ++P+ CY NS+ + P+
Sbjct: 315 QGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFSKPIISKGNQ---CYLVSNSVGDIFPQ 371
Query: 355 VTIHFR-GADVKLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
V+++F GA + L+ ++ + + C F+ + I G+++ + + YD+
Sbjct: 372 VSLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDLVLKDKIFVYDLA 431
Query: 410 QQTVSFKPTDCT 421
Q + + DC+
Sbjct: 432 NQRIGWADYDCS 443
>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 462
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 107/384 (27%), Positives = 166/384 (43%), Gaps = 42/384 (10%)
Query: 60 RSLNR--LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
RS+N L ++ S + + +L+ + G P + DTGSD W
Sbjct: 96 RSINARILGQYSTEESKDGGSPESMHSLNEDGFFLVNVGFGKPQQNLNLIIDTGSDTTWI 155
Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSF 177
+C C C+ + P F+P +SS+Y + C S Y+++Y D S+
Sbjct: 156 RCNSCSLGNCHNKKIPTFNPSLSSSYSNRSCIPS------------TKTNYTMNYEDNSY 203
Query: 178 SNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD-ISLISQMR 236
S G + VTL + P FG ++GG +G++GL G+ SLISQ
Sbjct: 204 SKGVFVCDEVTL-----KPDVFPKFQFG-CGDSGGGDFGSASGVLGLAQGEQYSLISQTA 257
Query: 237 TTIAGKFSYCLVPVSSTK--INFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQ 292
+ KFSYC +T+ + FG I + P + T L + + Y + + ISV +
Sbjct: 258 SKFKKKFSYCFPHNENTRGSLLFGEKAISASPSLKFTRLLNPSSGSVYFVELIGISVAKK 317
Query: 293 RLGVS-----TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADP--TGSLELCY 344
RL VS +P +IDSGT +T LP Y + + M+ V+ P L+ CY
Sbjct: 318 RLNVSSSLFASPGTIIDSGTVITHLPTAAYEALRTAFQQEMLHCPSVSPPPQEKPLDTCY 377
Query: 345 SFNSLS----QVPEVTIHFRG-ADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGN 396
+ ++PE+ +HF G DV L S +++ + K + V I GN
Sbjct: 378 NLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGILWANGDLTQACLAFARKSHPSHVTIIGN 437
Query: 397 IMQTNFLVGYDIEQQTVSFKPTDC 420
Q + V YDIE + F DC
Sbjct: 438 RQQVSLKVVYDIEGGRLGFG-NDC 460
>gi|296082634|emb|CBI21639.3| unnamed protein product [Vitis vinifera]
Length = 278
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 69/191 (36%), Positives = 95/191 (49%), Gaps = 37/191 (19%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
F V L H DS + T ++RL+ A+ R RL + ++ S + +A + N
Sbjct: 35 FRVSLRHVDS------GGNYTKFERLQRAMKRGKLRLQRLSAKTA-SFESSVEAPVHAGN 87
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
+L++++IGTP A+ DTGSDLIWTQC+PC C+ Q +P+FDPK SS++ LPC
Sbjct: 88 GEFLMKLAIGTPAETYSAIMDTGSDLIWTQCKPC--KDCFDQPTPIFDPKKSSSFSKLPC 145
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
SS S Q G LATET G ++ I FGCG
Sbjct: 146 SSDLYYSSTQ---------------------GVLATETFAFGD-----ASVSKIGFGCGE 179
Query: 209 NNGGLFNSKTT 219
+N G NS TT
Sbjct: 180 DNDG--NSGTT 188
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 108/398 (27%), Positives = 169/398 (42%), Gaps = 76/398 (19%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEP--CPPSQCYMQDSPLFDPKMSSTYK 144
+N + + +++GTPP V DTGS+L W C PP +P F+ SS+Y
Sbjct: 51 HNVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPL------TPAFNASGSSSYG 104
Query: 145 SLPCSSSQCASLNQ--------KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
++PC S+ C + + C+ S+SY D S ++G LAT+T L T G
Sbjct: 105 AVPCPSTACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLL--TGGAP 162
Query: 197 VALPGITFGC--------GTNNGGL---FNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
G FGC TN+ G + TG++G+ G +S ++Q T +F+Y
Sbjct: 163 PVAVGAYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTR---RFAY 219
Query: 246 CLVPVSSTKI-NFGTNGIVSGPGVVSTPLTKA--------KTFYVLTIDAISVGNQRL-- 294
C+ P + G +G V+ P + TPL + + Y + ++ I VG L
Sbjct: 220 CIAPGEGPGVLLLGDDGGVA-PPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPI 278
Query: 295 --GVSTPD------IVIDSGTTLTFLPQGYNSNLLSVMSSMIE--AQPVADP----TGSL 340
V TPD ++DSGT TFL + L + +S P+ +P G+
Sbjct: 279 PKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPGFVFQGAF 338
Query: 341 ELCYS------FNSLSQVPEVTIHFRGADVKLSRSNFFVKV---------SEDIVCSVFK 385
+ C+ + +PEV + RGA+V +S V +E + C F
Sbjct: 339 DACFRGPEARVAAASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFG 398
Query: 386 G---ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
S + G+ Q N V YD++ V F P C
Sbjct: 399 NSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 106/416 (25%), Positives = 170/416 (40%), Gaps = 77/416 (18%)
Query: 64 RLNHFNQNSSISSSKASQADIIPNNANYLIR------------ISIGTPPTERLAVADTG 111
RL +SS +S S+ + P ++ Y R + IGTP + V DTG
Sbjct: 41 RLTPTTNSSSFKTSLLSRRNPSPPSSPYTFRSNIKYSMALILSLPIGTPSQSQELVLDTG 100
Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA------SLNQKSCSGVN 165
S L W QC P + + FDP +SS++ LPCS C +L S
Sbjct: 101 SQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRL 160
Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
C YS Y DG+F+ GNL E T ++ P + GC ++ GI+G+
Sbjct: 161 CHYSYFYADGTFAEGNLVKEKFTFSNSQ----TTPPLILGCAKE-----STDEKGILGMN 211
Query: 226 GGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG---IVSGPGVVSTPLTKAKTF--- 279
G +S ISQ + + KFSYC +P S + + G + P TF
Sbjct: 212 LGRLSFISQAKIS---KFSYC-IPTRSNRPGLASTGSFYLGDNPNSRGFKYVSLLTFPQS 267
Query: 280 ----------YVLTIDAISVGNQRLGVS----TPD------IVIDSGTTLTFLPQ-GYN- 317
Y + + I +G +RL + PD ++DSG+ T L Y+
Sbjct: 268 QRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDK 327
Query: 318 --SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVTIHF-RGADVKLSRSN 370
++ ++ S ++ V T ++C+ N + + ++ F RG ++ + + +
Sbjct: 328 VKEEIVRLVGSRLKKGYVYGSTA--DMCFDGNHSMEIGRLIGDLVFEFGRGVEILVEKQS 385
Query: 371 FFVKVSEDIVC------SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
V V I C S+ +N I GN+ Q N V +D+ + V F +C
Sbjct: 386 LLVNVGGGIHCVGIGRSSMLGAASN---IIGNVHQQNLWVEFDVTNRRVGFSKAEC 438
>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 466
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 119/441 (26%), Positives = 186/441 (42%), Gaps = 85/441 (19%)
Query: 45 NSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN-ANYLIRISIGTPPTE 103
+SS P+ L+ A++ S+ R +H + +K+ + + P Y I + GTP
Sbjct: 42 SSSSHPFHTLKLAVSTSITRAHHLKNHKP---NKSLETPVHPKTYGGYSIDLEFGTPSQT 98
Query: 104 RLAVADTGSDLIWTQCEP---CPPSQC-YMQDSPLFDPKMSSTYKSLPCSSSQCASL--- 156
V DTGS L+W C C S+C ++P F PK SS+ K + C++ +CA +
Sbjct: 99 FPFVLDTGSTLVWLPCSSHYLC--SKCNSFSNTPKFIPKNSSSSKFVGCTNPKCAWVFGP 156
Query: 157 -------NQKSCSGVNCQ-----YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
Q + NC Y+V YG GS + G L +E + + L
Sbjct: 157 DVKSHCCRQDKAAFNNCSQTCPAYTVQYGLGS-TAGFLLSENLNFPTKKYSDFLL----- 210
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL---------------VP 249
GC + + GI G G G+ SL SQM T +FSYCL V
Sbjct: 211 GCSV----VSVYQPAGIAGFGRGEESLPSQMNLT---RFSYCLLSHQFDDSATITSNLVL 263
Query: 250 VSSTKINFGTNGIVSGPGVVSTPLTKAK----TFYVLTIDAISVGNQRLGVS----TPDI 301
+++ + TNG+ P + P TK +Y +T+ I VG +R+ V P++
Sbjct: 264 ETASSRDGKTNGVSYTP-FLKNPTTKKNPAFGAYYYITLKRIVVGEKRVRVPRRLLEPNV 322
Query: 302 ------VIDSGTTLTFLPQ---GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV 352
++DSG+T TF+ + + + S A+ G L C+ ++
Sbjct: 323 DGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQVSYTRAREAEKQFG-LSPCFVLAGGAET 381
Query: 353 ---PEVTIHFRG-ADVKLSRSNFFVKVSE-DIVCSVF--------KGITNSVPIYGNIMQ 399
PE+ FRG A ++L +N+F V + D+ C G I GN Q
Sbjct: 382 ASFPELRFEFRGGAKMRLPVANYFSLVGKGDVACLTIVSDDVAGSGGTVGPAVILGNYQQ 441
Query: 400 TNFLVGYDIEQQTVSFKPTDC 420
NF V YD+E + F+ C
Sbjct: 442 QNFYVEYDLENERFGFRSQSC 462
>gi|51091919|dbj|BAD35188.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|125596474|gb|EAZ36254.1| hypothetical protein OsJ_20576 [Oryza sativa Japonica Group]
gi|196212950|gb|ACG76111.1| S5 [Oryza sativa Japonica Group]
gi|340810891|gb|AEK75372.1| S5 [Oryza sativa]
gi|340810893|gb|AEK75373.1| S5 [Oryza sativa]
gi|340810899|gb|AEK75376.1| S5 [Oryza sativa]
gi|340810901|gb|AEK75377.1| S5 [Oryza sativa]
gi|340810933|gb|AEK75393.1| S5 [Oryza sativa]
gi|340810947|gb|AEK75400.1| S5 [Oryza sativa]
gi|340810949|gb|AEK75401.1| S5 [Oryza sativa]
gi|340810967|gb|AEK75410.1| S5 [Oryza sativa]
gi|340810969|gb|AEK75411.1| S5 [Oryza sativa]
gi|340810999|gb|AEK75426.1| S5 [Oryza rufipogon]
gi|340811017|gb|AEK75435.1| S5 [Oryza rufipogon]
gi|340811029|gb|AEK75441.1| S5 [Oryza nivara]
gi|340811051|gb|AEK75452.1| S5 [Oryza nivara]
gi|340811075|gb|AEK75464.1| S5 [Oryza nivara]
gi|340811077|gb|AEK75465.1| S5 [Oryza rufipogon]
gi|340811085|gb|AEK75469.1| S5 [Oryza nivara]
gi|340811096|gb|AEK75474.1| S5 [Oryza rufipogon]
gi|340811100|gb|AEK75476.1| S5 [Oryza rufipogon]
gi|340811114|gb|AEK75483.1| S5 [Oryza nivara]
Length = 472
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 110/394 (27%), Positives = 175/394 (44%), Gaps = 53/394 (13%)
Query: 70 QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
Q I+SS +++ D+I N+ +L+ +S+G PP L DTGS L W QC+PC
Sbjct: 89 QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147
Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
C+ Q + P+FDP S T + + CSS +C L Q +C +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 207
Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
G ++S G + T+T+ +G + + FGC + ++ GI G G S
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMDVK--YSEFEAGIFGFGSSSFSFFE 259
Query: 234 QMR---TTIAGK-FSYCLVPVSSTKINFGTNGIVSGPGVVS--TPLTKA--KTFYVLTID 285
Q+ ++ K SYCL P TK + G + TPL ++ + Y LT++
Sbjct: 260 QLAGYPDILSYKALSYCL-PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTME 318
Query: 286 AISVGNQRLGVSTPDIVIDSG--------TTLTFLPQ---------GYNSNLLSVMSSMI 328
+ QRL S+ ++++DSG +T L + GY+ + S I
Sbjct: 319 MLIANGQRLVTSSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYI 378
Query: 329 EAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF-KG 386
D +G F++ S +P + I F GA + L N F +C F +
Sbjct: 379 CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQN 438
Query: 387 ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
I GN + +F +DI+ + FK C
Sbjct: 439 PALRSQILGNRVTRSFGTTFDIQGKQFGFKYAVC 472
>gi|340810993|gb|AEK75423.1| S5 [Oryza rufipogon]
gi|340811015|gb|AEK75434.1| S5 [Oryza nivara]
gi|340811021|gb|AEK75437.1| S5 [Oryza nivara]
Length = 474
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 110/394 (27%), Positives = 175/394 (44%), Gaps = 53/394 (13%)
Query: 70 QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
Q I+SS +++ D+I N+ +L+ +S+G PP L DTGS L W QC+PC
Sbjct: 91 QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 149
Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
C+ Q + P+FDP S T + + CSS +C L Q +C +C YSV+YG+
Sbjct: 150 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 209
Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
G ++S G + T+T+ +G + + FGC + ++ GI G G S
Sbjct: 210 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMDVK--YSEFEAGIFGFGSSSFSFFE 261
Query: 234 QMR---TTIAGK-FSYCLVPVSSTKINFGTNGIVSGPGVVS--TPLTKA--KTFYVLTID 285
Q+ ++ K SYCL P TK + G + TPL ++ + Y LT++
Sbjct: 262 QLAGYPDILSYKALSYCL-PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTME 320
Query: 286 AISVGNQRLGVSTPDIVIDSG--------TTLTFLPQ---------GYNSNLLSVMSSMI 328
+ QRL S+ ++++DSG +T L + GY+ + S I
Sbjct: 321 MLIANGQRLVTSSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYI 380
Query: 329 EAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF-KG 386
D +G F++ S +P + I F GA + L N F +C F +
Sbjct: 381 CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQN 440
Query: 387 ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
I GN + +F +DI+ + FK C
Sbjct: 441 PALRSQILGNRVTRSFGTTFDIQGKQFGFKYAVC 474
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 96/361 (26%), Positives = 160/361 (44%), Gaps = 50/361 (13%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y I IGTP + DTGS W C+ CP ++ +DP+ S + K +
Sbjct: 83 YYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVK 142
Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---GIT 203
C + C S + C+ + C Y Y DG + G L T+ + G P +T
Sbjct: 143 CDDTICTS--RPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVT 200
Query: 204 FGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSSTKIN 256
FGCG G N+ GI+G G + + +SQ+ AGK FS+CL + I
Sbjct: 201 FGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAA--AGKTKKIFSHCLDSTNGGGI- 257
Query: 257 FGTNGIVSGPGVVSTPLTK-AKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGT 307
F +V P V +TP+ K + ++++ + +I+V L + T IDSG+
Sbjct: 258 FAIGEVVE-PKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGS 316
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL-------SQVPEVTIHFR 360
TL +LP+ + S +I A P ++ Y+F + P++T HF
Sbjct: 317 TLVYLPE-------IIYSELILAVFAKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFE 369
Query: 361 GADVKLSR--SNFFVKVSEDIVCSVFK--GIT--NSVPIYGNIMQTNFLVGYDIEQQTVS 414
D+ L ++ ++ + C F+ GI + I G+++ +N +V YD+E+Q +
Sbjct: 370 N-DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIG 428
Query: 415 F 415
+
Sbjct: 429 W 429
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 102/378 (26%), Positives = 164/378 (43%), Gaps = 72/378 (19%)
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQC-EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
+I + IGTPP + V DTGS L W QC + PP+ FDP +SST+ LPC+
Sbjct: 76 IINLPIGTPPQTQPMVLDTGSQLSWIQCHKKQPPTAS-------FDPSLSSTFSILPCTH 128
Query: 151 SQCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C +L C YS Y DG+++ GNL E T ++V+ P +
Sbjct: 129 PLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTF----SRSVSTPPLIL 184
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG--- 261
GC T ++ GI+G+ G +S Q + T KFSYC VP T+ F G
Sbjct: 185 GCATE-----STDPRGILGMNLGRLSFAKQSKIT---KFSYC-VPPRQTRPGFTPTGSFY 235
Query: 262 IVSGP--------GVVSTPLTKAKTF----YVLTIDAISVGNQRLGVS----------TP 299
+ + P G++++ + F Y + + I + ++L +S +
Sbjct: 236 LGNNPSSKGFKYVGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNISPAVFRADAGGSG 295
Query: 300 DIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSFNSLSQVP---- 353
+IDSG+ T+L + Y+ V+ ++ G + ++C F+S+ V
Sbjct: 296 QTMIDSGSEFTYLVSEAYDKVRAQVVRAVGPRLKKGYVYGGVADMC--FDSVKAVEIGRL 353
Query: 354 --EVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP------IYGNIMQTNFLV 404
E+ F RG +V + + V + C GI +S I GN Q N V
Sbjct: 354 IGEMVFEFERGVEVVIPKERVLADVGGGVHCV---GIGSSDKLGAASNIIGNFHQQNLWV 410
Query: 405 GYDIEQQTVSFKPTDCTK 422
+D+ ++ V F DC++
Sbjct: 411 EFDLVRRRVGFGKADCSR 428
>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 601
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 123/451 (27%), Positives = 188/451 (41%), Gaps = 97/451 (21%)
Query: 46 SSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN-ANYLIRISIGTPPTER 104
S+ P+ L+ A++ S+ R +H +++ SS K + P Y I + GTPP
Sbjct: 173 SNSHPFHTLQLAVSTSITRAHHLKNHNNPSSLKTL---VHPKTYGGYSIDLKFGTPPQTF 229
Query: 105 LAVADTGSDLIWTQCEP---CPPSQCYM---QDSPLFDPKMSSTYKSLPCSSSQCASL-- 156
V DTGS L+W C C S+C ++P F PK S + K + C + +CA +
Sbjct: 230 PFVLDTGSSLVWLPCYSHYLC--SKCNSFSNNNTPKFIPKDSFSSKFVGCRNPKCAWVFG 287
Query: 157 ----------------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
N +CS Y+V YG GS + G L +E + A +
Sbjct: 288 SDVTSHCCKLAKAAFSNNNNCSQTCPAYTVQYGLGS-TAGFLLSENLNF-----PAKNVS 341
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVSST 253
GC + + GI G G G+ SL +QM T +FSYCL+ P +S
Sbjct: 342 DFLVGCSV----VSVYQPGGIAGFGRGEESLPAQMNLT---RFSYCLLSHQFDESPENSD 394
Query: 254 KINFGTN-------GIVSGPGVVSTPLTKAKTF---YVLTIDAISVGNQRLGVS----TP 299
+ TN VS + P TK F Y +T+ I VG +R+ V P
Sbjct: 395 LVMEATNSGEGKKTNGVSYTAFLKNPSTKKPAFGAYYYITLRKIVVGEKRVRVPRRMLEP 454
Query: 300 DI------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN------ 347
D+ ++DSG+TLTF+ + + +++ Q + T + EL F
Sbjct: 455 DVNGDGGFIVDSGSTLTFMERP----IFDLVAEEFVKQ--VNYTRARELEKQFGLSPCFV 508
Query: 348 -----SLSQVPEVTIHFR-GADVKLSRSNFFVKVSE-DIVCSVF--------KGITNSVP 392
+ PE+ FR GA ++L +N+F +V + D+ C G
Sbjct: 509 LAGGAETASFPEMRFEFRGGAKMRLPVANYFSRVGKGDVACLTIVSDDVAGQGGAVGPAV 568
Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 423
I GN Q NF V D+E + F+ C K+
Sbjct: 569 ILGNYQQQNFYVECDLENERFGFRSQSCQKR 599
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 96/369 (26%), Positives = 163/369 (44%), Gaps = 50/369 (13%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y I IGTP + DTGS W C+ CP ++ +DP+ S + K +
Sbjct: 59 YYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVK 118
Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---GIT 203
C + C S + C+ + C Y Y DG + G L T+ + G P +T
Sbjct: 119 CDDTICTS--RPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVT 176
Query: 204 FGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSSTKIN 256
FGCG G N+ GI+G G + + +SQ+ AGK FS+CL + I
Sbjct: 177 FGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAA--AGKTKKIFSHCLDSTNGGGI- 233
Query: 257 FGTNGIVSGPGVVSTPLTK-AKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGT 307
F +V P V +TP+ K + ++++ + +I+V L + T IDSG+
Sbjct: 234 FAIGEVVE-PKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGS 292
Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL-------SQVPEVTIHFR 360
TL +LP+ + S +I A P ++ Y+F + P++T HF
Sbjct: 293 TLVYLPE-------IIYSELILAVFAKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFE 345
Query: 361 GADVKLSR--SNFFVKVSEDIVCSVFK--GIT--NSVPIYGNIMQTNFLVGYDIEQQTVS 414
D+ L ++ ++ + C F+ GI + I G+++ +N +V YD+E+Q +
Sbjct: 346 N-DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIG 404
Query: 415 FKPTDCTKQ 423
+ + ++
Sbjct: 405 WTEHNSVEE 413
>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
Length = 520
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 100/388 (25%), Positives = 158/388 (40%), Gaps = 25/388 (6%)
Query: 54 LRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN----YLIRISIGTPPTERLAVAD 109
+R L R R+ Q S+S + I P+ + Y + +GTP T L D
Sbjct: 65 VRSDLQRQKRRVGGKYQLLSLSQGGS----IFPSGNDLGWLYYTWVDVGTPNTSFLVALD 120
Query: 110 TGSDLIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG 163
TGSDL W C+ C P Y +D ++ P S+T + LPCS C+ + +
Sbjct: 121 TGSDLFWVPCDCIQCAPLSSYHGSLDRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPK 180
Query: 164 VNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG--LFNSKTTG 220
C Y++ Y + + S+G L + + L S G A + GCG G L G
Sbjct: 181 QPCPYNIDYFSENTTSSGLLIEDMLHLDSREGHAPVNASVIIGCGKKQSGSYLEGIAPDG 240
Query: 221 IVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT 278
++GLG DIS+ S + + FS C S +I FG G+ + P+
Sbjct: 241 LLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQ 300
Query: 279 FYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG 338
Y + +D +G++ + ++D+GT+ T LP ++ I A +
Sbjct: 301 TYAVNVDKYCIGHKCTEGAGFQALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDDY 360
Query: 339 SLELCYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGN 396
S E CYS L VP +T+ F + + +VF P
Sbjct: 361 SFEYCYSTGPLEMPDVPTITLTFAENKSFQAVNPILPFNDRQGEFAVFCLAVLPSPEPVG 420
Query: 397 IMQTNFLVGY----DIEQQTVSFKPTDC 420
I+ NF+VGY D E + + ++C
Sbjct: 421 IIGQNFMVGYHVVFDRENMKLGWYRSEC 448
>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
Length = 520
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 90/350 (25%), Positives = 147/350 (42%), Gaps = 23/350 (6%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYK 144
Y + +GTP T L DTGSDL W C+ C P Y +D ++ P S+T +
Sbjct: 102 YYTWVDVGTPNTSFLVALDTGSDLFWVPCDCIQCAPLSSYHGSLDRDLGIYKPSESTTSR 161
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
LPCS C+ + + C Y++ Y + + S+G L + + L S G A +
Sbjct: 162 HLPCSHELCSPASGCTNPKQPCPYNIDYFSENTTSSGLLIEDMLHLDSREGHAPVNASVI 221
Query: 204 FGCGTNNGG--LFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGT 259
GCG G L G++GLG DIS+ S + + FS C S +I FG
Sbjct: 222 IGCGKKQSGSYLEGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGRIFFGD 281
Query: 260 NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 319
G+ + P+ Y + +D +G++ + ++D+GT+ T LP +
Sbjct: 282 QGVPTQQSTPFVPMNGKLQTYAVNVDKYCIGHKCTEGAGFQALVDTGTSFTSLPLDAYKS 341
Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSE 377
+ I A + S E CYS L VP +T+ F + N + ++
Sbjct: 342 ITMEFDKQINASRASSDDYSFEYCYSTGPLEMPDVPTITLTF-AENKSFQAVNPILPFND 400
Query: 378 ---DIVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQTVSFKPTDC 420
+ + + P+ I+ NF+VGY D E + + ++C
Sbjct: 401 RQGEFAVFCLAVLPSPEPV--GIIGQNFMVGYHVVFDRENMKLGWYRSEC 448
>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
Length = 424
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 94/381 (24%), Positives = 164/381 (43%), Gaps = 42/381 (11%)
Query: 77 SKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
++A+ + + P + N Y + I+IG PP DTGSDL W QC+ P C
Sbjct: 37 TRAASSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVHCLEA 95
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
PL+ P + +PC+ C +L N + + C Y V Y DG S G L +
Sbjct: 96 PHPLYQP----SNDLIPCNDPLCKALHFNGNHRCETPEQCDYEVEYADGGSSLGVLVRDV 151
Query: 187 VTLGSTTGQAVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMRTT--IAGK 242
+L T G + P + GCG + G + G++GLG G +S++SQ+ + +
Sbjct: 152 FSLNYTKGLRLT-PRLALGCGYDQIPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNV 210
Query: 243 FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPD 300
+CL + + FG N + V TP+ + +K + + G + G+
Sbjct: 211 VGHCLSSLGGGILFFG-NDLYDSSRVSWTPMARENSKHYSPAMGGELLFGGRTTGLKNLL 269
Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCYSFNS-LSQVPEVTI 357
V DSG++ T+ + ++ + +P+ A +L LC+ + EV
Sbjct: 270 TVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKK 329
Query: 358 HFRGADVKL-----SRSNFFVKVSEDIVCS----VFKGITNSVPI-------YGNIMQTN 401
+F+ + S++ F + ++ S V GI N I G+I +
Sbjct: 330 YFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQD 389
Query: 402 FLVGYDIEQQTVSFKPTDCTK 422
++ YD E+Q++ + P DC +
Sbjct: 390 QMIIYDNEKQSIGWIPADCDE 410
>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
Length = 358
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 86/280 (30%), Positives = 135/280 (48%), Gaps = 27/280 (9%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
Q ++ P +Y + ++IG P DTGSDL W QC+ PC C PL+ P
Sbjct: 45 QGNVYPT-GHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPC--RSCNKVPHPLYRPTA 101
Query: 140 SSTYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
+S +PC+++ C +L N K S C Y + Y D + S G L + +L +
Sbjct: 102 NSL---VPCANALCTALHSGHGSNNKCPSPKQCDYQIKYTDSASSQGVLINDNFSLPMRS 158
Query: 194 GQAVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCL 247
PG+TFGCG + G + T G++GLG G +SL+SQ++ K +CL
Sbjct: 159 SN--IRPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHCL 216
Query: 248 VPVSSTKINFGTNGIVSGPGVVSTPLTK-AKTFYVLTIDAISVGNQRLGVSTPDIVIDSG 306
+ FG + IV V P+ K + +Y + + LGV ++V DSG
Sbjct: 217 STNGGGFLFFGDD-IVPTSRVTWVPMAKISGNYYSPGSGTLYFDRRSLGVKPMEVVFDSG 275
Query: 307 TTLT-FLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCY 344
+T T F Q Y + + ++ S + ++ + V+DP SL LC+
Sbjct: 276 STYTYFTAQPYQAVVSALKSGLSKSLKQVSDP--SLPLCW 313
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 131/462 (28%), Positives = 188/462 (40%), Gaps = 94/462 (20%)
Query: 39 PKSPFYNSSETP---YQRLRDALTRSLNRLNHFNQNSSI-------SSSKASQADII--P 86
P SPF +S ++P Y LR S+ R + +SI SS+ + A ++ P
Sbjct: 22 PLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDALSSTTTASATVVKSP 81
Query: 87 NNAN----YLIRISIGTPPTERLAVADTGSDLIWTQCEP------CPPSQCYMQDSPLFD 136
+A Y + +S GTP V DTGS L+W C C S P F
Sbjct: 82 LSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFI 141
Query: 137 PKMSSTYKSLPCSSSQCASL------------NQKSCSGVNC-QYSVSYGDGSFSNGNLA 183
PK SS+ K + C S +C L N ++C+ V C Y + YG GS + G L
Sbjct: 142 PKNSSSSKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCT-VGCPPYILQYGLGS-TAGVLI 199
Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
TE + T +P GC + + GI G G G +SL SQM +F
Sbjct: 200 TEKLDFPDLT-----VPDFVVGCSI----ISTRQPAGIAGFGRGPVSLPSQMNLK---RF 247
Query: 244 SYCLVPVSSTKINFGTN-------GIVSG---PGVVSTPLTKAKT--------FYVLTID 285
S+CLV N T+ G SG PG+ TP K +Y L +
Sbjct: 248 SHCLVSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLR 307
Query: 286 AISVGNQRLGVSTPDI----------VIDSGTTLTFLPQG----YNSNLLSVMSSMIEAQ 331
I VG + + + + ++DSG+T TF+ + S MS+ +
Sbjct: 308 RIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREK 367
Query: 332 PVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSE-DIVCSVF--- 384
+ TG L C++ + VPE+ F+ GA ++L SN+F V D VC
Sbjct: 368 DLEKETG-LGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSD 426
Query: 385 -----KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
G T I G+ Q N+LV YD+E F C+
Sbjct: 427 KTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 93/381 (24%), Positives = 166/381 (43%), Gaps = 42/381 (11%)
Query: 77 SKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
++A + + P + N Y + I+IG PP DTGSDL W QC+ P +C
Sbjct: 40 TRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVRCLEA 98
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
PL+ P + +PC+ C +L NQ+ + C Y V Y DG S G L +
Sbjct: 99 PHPLYQP----SSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDV 154
Query: 187 VTLGSTTGQAVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMRTT--IAGK 242
++ T G + P + GCG + G + G++GLG G +S++SQ+ + +
Sbjct: 155 FSMNYTKGLRLT-PRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNV 213
Query: 243 FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPD 300
+CL + + FG + + V TP+++ +K + + G + G+
Sbjct: 214 IGHCLSSLGGGILFFGDD-LYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLL 272
Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCYSFNS-LSQVPEVTI 357
V DSG++ T+ + ++ + +P+ A +L LC+ + EV
Sbjct: 273 TVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKK 332
Query: 358 HFRGADVKL-----SRSNFFVKVSEDIVCS----VFKGITNSVPI-------YGNIMQTN 401
+F+ + S++ F + ++ S V GI N I G+I +
Sbjct: 333 YFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQD 392
Query: 402 FLVGYDIEQQTVSFKPTDCTK 422
++ YD E+Q++ + P DC +
Sbjct: 393 QMIIYDNEKQSIGWMPADCDE 413
>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 417
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 108/426 (25%), Positives = 172/426 (40%), Gaps = 105/426 (24%)
Query: 77 SKASQADIIPNNANYLIRISIGTPPTERLAV-ADTGSDLIWTQCEPCPPSQCYMQD---- 131
S + + I ++Y + ++G+ P++ + + DTGSDL+W PC P +C + +
Sbjct: 5 SPSRRQPISNRESDYTLSFNLGSHPSQSITLYMDTGSDLVWF---PCAPFECILCEGKFN 61
Query: 132 ----------------SPLFDPKMSSTYKSLPCSSSQCA--SLNQKSCSGVNCQ-YSVSY 172
SP SS C+ ++C ++ CS C + +Y
Sbjct: 62 ATKPLNITRSHRVSCQSPACSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAY 121
Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
GDGSF +L +T+++ + L TFGC ++ TG+ G G G +SL
Sbjct: 122 GDGSFI-AHLHRDTLSMSQ-----LFLKNFTFGCAHTA----LAEPTGVAGFGRGLLSLP 171
Query: 233 SQMRT---TIAGKFSYCLV---------------------PVSSTKINFGTNGIVSGPGV 268
+Q+ T + +FSYCLV SS ++ F ++ P
Sbjct: 172 AQLATLSPNLGNRFSYCLVSHSFDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNP-- 229
Query: 269 VSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD------------IVIDSGTTLTFLPQGY 316
K FY + + ISVG + + P+ +V+DSGTT T LP
Sbjct: 230 ------KHSYFYCVGLTGISVGKRT--ILAPEMLRRVDRRGDGGVVVDSGTTFTMLPASL 281
Query: 317 NSNLLSVMSSMI-----EAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG--ADVKLSRS 369
+++++ + A V + TG L CY L +VP VT HF G ++V L R
Sbjct: 282 YNSVVAEFDRRVGRVHKRASEVEEKTG-LGPCYFLEGLVEVPTVTWHFLGNNSNVMLPRM 340
Query: 370 NFFVK-------VSEDIVCSVFKGITNSVP-------IYGNIMQTNFLVGYDIEQQTVSF 415
N+F + + C + + I GN Q F V YD+E Q V F
Sbjct: 341 NYFYEFLDGEDEARRKVGCLMLMNGGDDTELSGGPGAILGNYQQQGFEVVYDLENQRVGF 400
Query: 416 KPTDCT 421
C
Sbjct: 401 AKRQCA 406
>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
Length = 413
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 93/381 (24%), Positives = 166/381 (43%), Gaps = 42/381 (11%)
Query: 77 SKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
++A + + P + N Y + I+IG PP DTGSDL W QC+ P +C
Sbjct: 28 TRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVRCLEA 86
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
PL+ P + +PC+ C +L NQ+ + C Y V Y DG S G L +
Sbjct: 87 PHPLYQP----SSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDV 142
Query: 187 VTLGSTTGQAVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMRTT--IAGK 242
++ T G + P + GCG + G + G++GLG G +S++SQ+ + +
Sbjct: 143 FSMNYTQGLRLT-PRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNV 201
Query: 243 FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPD 300
+CL + + FG + + V TP+++ +K + + G + G+
Sbjct: 202 IGHCLSSLGGGILFFGDD-LYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLL 260
Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCYSFNS-LSQVPEVTI 357
V DSG++ T+ + ++ + +P+ A +L LC+ + EV
Sbjct: 261 TVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKK 320
Query: 358 HFRGADVKL-----SRSNFFVKVSEDIVCS----VFKGITNSVPI-------YGNIMQTN 401
+F+ + S++ F + ++ S V GI N I G+I +
Sbjct: 321 YFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQD 380
Query: 402 FLVGYDIEQQTVSFKPTDCTK 422
++ YD E+Q++ + P DC +
Sbjct: 381 QMIIYDNEKQSIGWMPVDCDE 401
>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
max]
Length = 455
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 120/446 (26%), Positives = 195/446 (43%), Gaps = 103/446 (23%)
Query: 62 LNRLNHFNQNSSISSSKASQADI---IPNNANYLIRISIGTPPTERLAVA---DTGSDLI 115
N +H +++S S+K + + + ++Y + ++G P + + DTGSDL+
Sbjct: 16 FNNTHHLLKSTSTLSAKRFRRQLSLPLSPGSDYTLSFNLG-PRAQAQPITLYMDTGSDLV 74
Query: 116 WTQCEPCPPSQCYM-QDSPLFDPKMSSTY------KSLPCSSSQ--------CA------ 154
W PC P +C + + P P +++T KS CS++ CA
Sbjct: 75 WF---PCAPFKCILCEGKPNASPPVNTTRSVAVSCKSPACSAAHNLASPSDLCAAARCPL 131
Query: 155 -SLNQKSCSGVNCQ-YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG 212
S+ C+ C + +YGDGS L +T++L S + L TFGC
Sbjct: 132 ESIETSDCANFKCPPFYYAYGDGSLI-ARLYRDTLSLSS-----LFLRNFTFGCAYTT-- 183
Query: 213 LFNSKTTGIVGLGGGDISLISQMRT---TIAGKFSYCLVPVS--STKINFGTNGIVS--- 264
++ TG+ G G G +SL +Q+ T + +FSYCLV S S ++ + I+
Sbjct: 184 --LAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHSFDSERVRKPSPLILGRYE 241
Query: 265 --------GPGV---VSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPD---------- 300
G GV V TP+ K FY + + ISVG +R+ V P+
Sbjct: 242 EEEEEEKVGGGVAEFVYTPMLENPKHPYFYTVGLIGISVG-KRI-VPAPEMLRRVNNRGD 299
Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMI-----EAQPVADPTGSLELCYSFNSLSQVP 353
+V+DSGTT T LP G+ ++++ + A+ + + TG L CY NS+++VP
Sbjct: 300 GGVVVDSGTTFTMLPAGFYNSVVDEFDRGVGRVNERARKIEEKTG-LAPCYYLNSVAEVP 358
Query: 354 EVTIHFRGAD--VKLSRSNFF---------VKVSEDIVCSVFKGITNSVPI-------YG 395
+T+ F G + V L R N+F K + C + + + G
Sbjct: 359 VLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKRRVGCLMLMNGGDEAELSGGPGATLG 418
Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDCT 421
N Q F V YD+E++ V F C
Sbjct: 419 NYQQQGFEVEYDLEEKRVGFARRQCA 444
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/372 (26%), Positives = 168/372 (45%), Gaps = 41/372 (11%)
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---------- 132
D + N Y R+ IGTP E + D+GS + + C C QC S
Sbjct: 84 DDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATC--EQCGNHQSESPNIIEAHD 141
Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
P F P +SSTY + C+ C N++S C Y Y + S S+G L + ++ G
Sbjct: 142 PRFQPDLSSTYSPVKCNVD-CTCDNERS----QCTYERQYAEMSSSSGVLGEDIMSFGKE 196
Query: 193 TGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVP 249
+ + FGC T G LF+ GI+GLG G +S++ Q+ + I+ FS C
Sbjct: 197 S--ELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGG 254
Query: 250 VSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGV------STPDIV 302
+ G+ + P +V + ++ +Y + + I V + L + S V
Sbjct: 255 MDVGGGTMVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTV 314
Query: 303 IDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV-PE 354
+DSGTT +LP Q + + +V + + + + P + ++C++ + LS+V P+
Sbjct: 315 LDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPD 374
Query: 355 VTIHF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 410
V + F G + LS N+ + S E C VF+ + + G I+ N LV YD
Sbjct: 375 VDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHN 434
Query: 411 QTVSFKPTDCTK 422
+ + F T+C++
Sbjct: 435 EKIGFWKTNCSE 446
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/372 (26%), Positives = 168/372 (45%), Gaps = 41/372 (11%)
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---------- 132
D + N Y R+ IGTP E + D+GS + + C C QC S
Sbjct: 83 DDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATC--EQCGNHQSESPNIIEAHD 140
Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
P F P +SSTY + C+ C N++S C Y Y + S S+G L + ++ G
Sbjct: 141 PRFQPDLSSTYSPVKCNVD-CTCDNERS----QCTYERQYAEMSSSSGVLGEDIMSFGKE 195
Query: 193 TGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVP 249
+ + FGC T G LF+ GI+GLG G +S++ Q+ + I+ FS C
Sbjct: 196 S--ELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGG 253
Query: 250 VSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGV------STPDIV 302
+ G+ + P +V + ++ +Y + + I V + L + S V
Sbjct: 254 MDVGGGTMVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTV 313
Query: 303 IDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV-PE 354
+DSGTT +LP Q + + +V + + + + P + ++C++ + LS+V P+
Sbjct: 314 LDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPD 373
Query: 355 VTIHF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 410
V + F G + LS N+ + S E C VF+ + + G I+ N LV YD
Sbjct: 374 VDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHN 433
Query: 411 QTVSFKPTDCTK 422
+ + F T+C++
Sbjct: 434 EKIGFWKTNCSE 445
>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
Length = 486
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 108/398 (27%), Positives = 170/398 (42%), Gaps = 73/398 (18%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE----PCPPSQCYMQDS-------------- 132
YLI ++IGTPP + DTGSDL W C C Y +
Sbjct: 82 YLISLNIGTPPQVIQVLMDTGSDLTWVPCGNLSFDCMECDDYRNNKLMATFSPSYSSSSY 141
Query: 133 ------PLFDPKMSSTYKSLPCSSSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLAT 184
P SS C+ + C ++L + +CS ++ +YG G G L
Sbjct: 142 RASCASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVVTGILTR 201
Query: 185 ETVTL-GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
+T+ + GS+ G A +P FGC G + GI G G G +S++SQ+ G F
Sbjct: 202 DTLRVNGSSPGVAKEIPKFCFGC----VGSAYREPIGIAGFGRGTLSMVSQLGFLQKG-F 256
Query: 244 SYCLV-------PVSSTKINFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGN-- 291
S+C + P S+ + G + S + TP+ + FY + ++AI+VGN
Sbjct: 257 SHCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAITVGNVS 316
Query: 292 ---------QRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE--AQPVADPTGSL 340
+ + + IDSGTT T LP+ + S +LS++ S I +
Sbjct: 317 ATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLSILQSTINYPRDTGMEMQTGF 376
Query: 341 ELCYSF-----NSLSQ---VPEVTIHF-RGADVKLSRSNFFVKVSED-----IVCSVFK- 385
+LCY N+L+ +P +T HF + L + N F VS + C +F+
Sbjct: 377 DLCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLPQGNHFYPVSAPGNPAVVKCLMFQS 436
Query: 386 ---GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
G ++G+ Q N V YD+E++ + F+P DC
Sbjct: 437 TDDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDC 474
>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 93/381 (24%), Positives = 166/381 (43%), Gaps = 42/381 (11%)
Query: 77 SKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
++A + + P + N Y + I+IG PP DTGSDL W QC+ P +C
Sbjct: 40 TRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVRCLEA 98
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
PL+ P + +PC+ C +L NQ+ + C Y V Y DG S G L +
Sbjct: 99 PHPLYQP----SSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDV 154
Query: 187 VTLGSTTGQAVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMRTT--IAGK 242
++ T G + P + GCG + G + G++GLG G +S++SQ+ + +
Sbjct: 155 FSMNYTQGLRLT-PRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNV 213
Query: 243 FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPD 300
+CL + + FG + + V TP+++ +K + + G + G+
Sbjct: 214 IGHCLSSLGGGILFFGDD-LYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLL 272
Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCYSFNS-LSQVPEVTI 357
V DSG++ T+ + ++ + +P+ A +L LC+ + EV
Sbjct: 273 TVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKK 332
Query: 358 HFRGADVKL-----SRSNFFVKVSEDIVCS----VFKGITNSVPI-------YGNIMQTN 401
+F+ + S++ F + ++ S V GI N I G+I +
Sbjct: 333 YFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQD 392
Query: 402 FLVGYDIEQQTVSFKPTDCTK 422
++ YD E+Q++ + P DC +
Sbjct: 393 QMIIYDNEKQSIGWMPVDCDE 413
>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 445
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/376 (26%), Positives = 170/376 (45%), Gaps = 51/376 (13%)
Query: 86 PNNANYLIRISIGTP--PTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
P Y + ++IGT + V DT S L W +C C P Q Q SP+FDP SS+Y
Sbjct: 69 PLEYTYGVAVTIGTGRGKSTYFLVLDTASSLPWMRCAHCLPVQ--RQRSPVFDPSDSSSY 126
Query: 144 KSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+ L +S C + N +G C + + G+ ++G + T+T+ LG+ T + + +
Sbjct: 127 RPLHPTSPLCRAPNPVLPAGDKCSFHLP-GE---AHGYVGTDTIILGNPT---LPIHSVA 179
Query: 204 FGCGTNNGGLFNSKTT--GIVGLGGGDISLISQMRTTIAGKFSYCLV-----PVSSTKIN 256
FGC + G F++K T G +G+G SLI Q++ + +FSYCL+ P + I
Sbjct: 180 FGCAQSTEG-FDTKGTFAGTLGMGKLPTSLIMQIKDRVGSRFSYCLIGLGHSPGRNGFIR 238
Query: 257 FGT----------NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGN-----------QRLG 295
FG + I P P A + Y + + IS+ +R
Sbjct: 239 FGADIPDPTLLVHHRIKILPTPPHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMFERRS 298
Query: 296 VSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEA---QPVADPTGSLELCYSFNSLSQV 352
+ +D+GT +T L + + ++ M++ + V DP SL S +
Sbjct: 299 DGSGGCFVDAGTQVTHLVPAAYAVVEEAVAHMVQQWGYKRVRDPNFSLCFREHPGIWSHI 358
Query: 353 PEVTIHFRG------ADVKLSRSNFFVKV-SEDIVC-SVFKGITNSVPIYGNIMQTNFLV 404
P++T+ F G A +++ N F+KV ++ +VC V++ S + G + Q +
Sbjct: 359 PKLTLDFEGPASRTVAHLEIVSRNLFLKVDNQPLVCFGVYRTSRGSPTVVGAMQQVDTRF 418
Query: 405 GYDIEQQTVSFKPTDC 420
+D+ T++F C
Sbjct: 419 IFDLHANTITFHRESC 434
>gi|125564663|gb|EAZ10043.1| hypothetical protein OsI_32347 [Oryza sativa Indica Group]
Length = 330
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 101/347 (29%), Positives = 161/347 (46%), Gaps = 63/347 (18%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNC 166
V DT SDL+WTQC+PC C Q ++DP + TY +L S+
Sbjct: 6 VFDTTSDLLWTQCQPC--LSCVAQAGDMYDPNKTETYANLTSSN---------------- 47
Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
Y+ +Y SF++G ATET LG+ T + ITFGCGT N G +++ + G+G
Sbjct: 48 -YNYTYSKQSFTSGYFATETFALGNVT-----VANITFGCGTRNQGYYDNVAG-VFGVGR 100
Query: 227 GDISLISQMRTTIAGKFSYCLVPV------------SSTKINFGTNGIVSGPGVVSTPLT 274
G +SL++Q+ +FSYC S T + +V+ P+
Sbjct: 101 GGVSLLNQLGID---RFSYCFSSSGAPGSSAVFLGGSPELATNATTTPAASTPMVADPVL 157
Query: 275 KAKTFYVLTIDAISVGNQRLGVSTPD--------IVIDSGTTLTFLPQG----YNSNLLS 322
K+ F L ++VG R+ V+ +VIDS + +T L + L++
Sbjct: 158 KSGYFVKLV--GVTVGATRVDVAGASSAEGGGRALVIDSTSPVTVLDEATYGPVRRALVA 215
Query: 323 VMSSMIEAQPVADPTGSLELCYSFNSLSQVP-----EVTIHFRG--ADVKLSRSNFFVKV 375
++ + EA A L+LC+ + P +T+HF G AD+ L +N+ K
Sbjct: 216 QLAPLKEANANASAGVGLDLCFELAAGGATPTPPNVTMTLHFDGGAADLVLPPANYLAKD 275
Query: 376 SE-DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
S ++C ++ +N VP+ G+ + LV YD+ + VSF+P DC
Sbjct: 276 SAGGLICLTMTPSSSNGVPVLGSSALLDTLVLYDLAKNVVSFQPLDC 322
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 97/376 (25%), Positives = 157/376 (41%), Gaps = 64/376 (17%)
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
L+ + IGTPP + + DTGS L W QC P + S +FDP +SS++ LPC+
Sbjct: 78 LVSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRK--PPPSTVFDPSLSSSFSVLPCNHP 135
Query: 152 QCA----SLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C + +N C YS Y DG+ + GNL E +T +T Q+ P + G
Sbjct: 136 LCKPRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITF--STSQST--PPLILG 191
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG---I 262
C + S GI+G+ G +S SQ + T KFSYC VP + F G +
Sbjct: 192 CAED-----ASDDKGILGMNLGRLSFASQAKIT---KFSYC-VPTRQVRPGFTPTGSFYL 242
Query: 263 VSGPGVVSTPLTKAKTF-------------YVLTIDAISVGNQRLGVSTPDI-------- 301
P TF + + + I +GN++L +
Sbjct: 243 GENPNSAGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAG 302
Query: 302 --VIDSGTTLTFLPQ-GYN---SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS---QV 352
+IDSG+ T+L YN ++ + ++ V +G ++C+ N++ +
Sbjct: 303 QSMIDSGSEFTYLVDVAYNKVREEVVRLAGPRLKKGYVY--SGVSDMCFDGNAMEIGRLI 360
Query: 353 PEVTIHF-RGADVKLSRSNFFVKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGY 406
+ F +G ++ + + V + C S G ++ I GN Q N V +
Sbjct: 361 GNMVFEFDKGVEIVIEKGRVLADVGGGVHCVGIGRSEMLGAASN--IIGNFHQQNLWVEF 418
Query: 407 DIEQQTVSFKPTDCTK 422
DI + V F DC++
Sbjct: 419 DIANRRVGFGKADCSR 434
>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
Length = 515
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 122/447 (27%), Positives = 189/447 (42%), Gaps = 44/447 (9%)
Query: 9 FILFFLCFYVVSPIEAQTGGFSVELIHRDS-------PKSPFYNSSETPYQRL---RDAL 58
IL + +V+ E G F E HR S P N + Y R+ RD L
Sbjct: 14 LILMLVSSWVLDRCEG-LGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRL 72
Query: 59 TRSLNRLNHFNQNSSISSSKASQADIIPNNANYL--IRISIGTPPTERLAVADTGSDLIW 116
R RL +++ S+ + I N +L +++GTP L DTGSDL W
Sbjct: 73 IRG-RRLA--SEDQSLVTFADGNETIRVNALGFLHYANVTVGTPSDWFLVALDTGSDLFW 129
Query: 117 TQCEPCPPSQCYMQ---------DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ 167
C+ C + C + D ++ P SST +PC+S+ C +++ + +C
Sbjct: 130 LPCD-CS-TNCVRELKAPGGSSLDLNIYSPNASSTSSKVPCNSTLCTRVDRCASPLSDCP 187
Query: 168 YSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPG-ITFGCGTNNGGLFN--SKTTGIVG 223
Y + Y +G+ S G L + + L S + + IT GCG G+F+ + G+ G
Sbjct: 188 YQIRYLSNGTSSTGVLVEDVLHLVSMEKNSKPIRARITLGCGLVQTGVFHDGAAPNGLFG 247
Query: 224 LGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLT--KAKTF 279
LG DIS+ S + A FS C + +I+FG G V TPL +
Sbjct: 248 LGLEDISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQR---ETPLNIRQPHPT 304
Query: 280 YVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQG-YNSNLLSVMSSMIEAQPVADPTG 338
Y +T+ ISVG G D V D+GT+ T+L Y S S ++ + D
Sbjct: 305 YNVTVTQISVGGN-TGDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSEL 363
Query: 339 SLELCYSF--NSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIY 394
E CY+ N S + P+V + +G V ED V + + + I
Sbjct: 364 PFEYCYAVSPNKKSFEYPDVNLTMKGGSSYPVYHPLIVVPIEDTVVYCLAIMKSEDISII 423
Query: 395 GNIMQTNFLVGYDIEQQTVSFKPTDCT 421
G T + V +D E+ + +K +DC+
Sbjct: 424 GQNFMTGYRVVFDREKLILGWKESDCS 450
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 117/419 (27%), Positives = 182/419 (43%), Gaps = 52/419 (12%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRD-ALTRSLNRLNHFNQNS-SISSSKASQADIIPNN 88
+ + H +S SPF S L+D A L+ L ++S I+S +A I +
Sbjct: 31 LRVFHINSQCSPFKTSVSWADTLLQDKARFLYLSSLAGVRKSSVPIASGRA-----IVQS 85
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y++R +IGTP L DT +D W C C S LFDP SS+ ++L C
Sbjct: 86 PTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGC----VGCSSSVLFDPSKSSSSRTLQC 141
Query: 149 SSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ QC SC+ +C ++++YG GS L +T+TL S +P TFGC
Sbjct: 142 EAPQCKQAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLASDV-----IPNYTFGC- 194
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPG 267
N + G++GLG G +SLISQ + FSYCL +S NF + + GP
Sbjct: 195 INKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCL--PNSKSSNF-SGSLRLGPK 251
Query: 268 -----VVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI----------VIDSGTTL 309
+ +TPL K + Y + + I VGN+ + + T + + DSGT
Sbjct: 252 NQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVY 311
Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPT--GSLELCYSFNSLSQVPEVTIHFRGADVKLS 367
T L + ++V + A+ T G + CYS + + P VT F G +V L
Sbjct: 312 TRLVE---PAYVAVRNEFRRRVKNANATSLGGFDTCYSGSVV--FPSVTFMFAGMNVTLP 366
Query: 368 RSNFFVKVSE-DIVCSVFKG----ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
N + S ++ C + + + + ++ Q N V D+ + CT
Sbjct: 367 PDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETCT 425
>gi|125575541|gb|EAZ16825.1| hypothetical protein OsJ_32297 [Oryza sativa Japonica Group]
Length = 416
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 93/364 (25%), Positives = 160/364 (43%), Gaps = 59/364 (16%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N AN+ +IGTPP A+ D P+ C P SST++
Sbjct: 67 NVANF----TIGTPPQPASAIIDVAG-----------PAPCSF-------PNASSTFRPE 104
Query: 147 PCSSSQCASLNQKSCSGVNCQY--SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
PC + C S+ +CS C Y +++ G + G +AT+T +G+ T + F
Sbjct: 105 PCGTDACKSIPTSNCSSNMCTYEGTINSKLGGHTLGIVATDTFAIGTATAS------LGF 158
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---TKINFGTNG 261
GC +G +G++GLG SL+SQM T KFSYCL P S +++ G++
Sbjct: 159 GCVVASGIDTMGGPSGLIGLGRAPSSLVSQMNIT---KFSYCLTPHDSGKNSRLLLGSSA 215
Query: 262 IVSG-------PGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--STPDIVIDSGTTLTFL 312
++G P V ++P +Y + +D I G+ + + S +++ + ++FL
Sbjct: 216 KLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPPSGNTVLVQTLAPMSFL 275
Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR--GADVKLSR 368
L ++ + A P A P +LC+ LS P++ F+ A + +
Sbjct: 276 VDSAYQALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQQGAAALTVPP 335
Query: 369 SNFFVKVSED--IVCSVF--------KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
+ + V E+ VC + ++ I G++ Q N D+E++T+SF+P
Sbjct: 336 PKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPA 395
Query: 419 DCTK 422
DC
Sbjct: 396 DCAH 399
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 171/372 (45%), Gaps = 52/372 (13%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G+P + DTGSD++W C CP S + FD SST +
Sbjct: 83 YFTKVKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVS 142
Query: 148 CSSSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGST-TGQAVALPG 201
C+ C+ Q + SG + C Y+ YGDGS + G ++T+ + GQ++
Sbjct: 143 CADPICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVANS 202
Query: 202 ---ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSST 253
I FGC T G + GI G G G +S+ISQ+ R FS+CL
Sbjct: 203 SSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCL------ 256
Query: 254 KINFGTNG---IVSG----PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST-------- 298
G NG +V G P +V +PL + Y L + +I+V Q L + +
Sbjct: 257 --KGGENGGGVLVLGEILEPSIVYSPLVPSLPHYNLNLQSIAVNGQLLPIDSNVFATTNN 314
Query: 299 PDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIE-AQPVADPTGSLELCYSF-NSLSQV-PE 354
++DSGTTL +L Q YN + ++ +++ + ++P+ CY NS+ + P+
Sbjct: 315 QGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPIISKGNQ---CYLVSNSVGDIFPQ 371
Query: 355 VTIHFR-GADVKLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
V+++F GA + L+ ++ + S + C F+ + I G+++ + + YD+
Sbjct: 372 VSLNFMGGASMVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDLVLKDKIFVYDLA 431
Query: 410 QQTVSFKPTDCT 421
Q + + +C+
Sbjct: 432 NQRIGWADYNCS 443
>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 513
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 110/403 (27%), Positives = 171/403 (42%), Gaps = 36/403 (8%)
Query: 45 NSSETPYQRL---RDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPP 101
N + Y R+ RD L R RL + +Q+ S + + +++GTP
Sbjct: 56 NRDSSKYYRVMAHRDRLIRG-RRLANEDQSLVTFSDGNETVRVDALGFLHYANVTVGTPS 114
Query: 102 TERLAVADTGSDLIWTQCEPCPPSQCYMQ---------DSPLFDPKMSSTYKSLPCSSSQ 152
+ DTGSDL W PC + C + D ++ P SST +PC+S+
Sbjct: 115 DWFMVALDTGSDLFWL---PCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTL 171
Query: 153 CASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPG-ITFGCGTNN 210
C ++ + +C Y + Y +G+ S G L + + L S + A+P +TFGCG
Sbjct: 172 CTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTFGCGQVQ 231
Query: 211 GGLFN--SKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
G+F+ + G+ GLG DIS+ S + A FS C + +I+FG G V
Sbjct: 232 TGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQR 291
Query: 267 GVVSTPLT--KAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVM 324
TPL + Y +T+ ISVG G D V DSGT+ T+L + +
Sbjct: 292 ---ETPLNIRQPHPTYNITVTKISVGGN-TGDLEFDAVFDSGTSFTYLTDAAYTLISESF 347
Query: 325 SSMI--EAQPVADPTGSLELCYSF--NSLS-QVPEVTIHFRGADVKLSRSNFFV--KVSE 377
+S+ + D E CY+ N S Q P V + +G V
Sbjct: 348 NSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDT 407
Query: 378 DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
D+ C I + + I G T + V +D E+ + +K +DC
Sbjct: 408 DVYCLAIMKIED-ISIIGQNFMTGYRVVFDREKLILGWKESDC 449
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/369 (26%), Positives = 172/369 (46%), Gaps = 46/369 (12%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +GTPP + DTGSD++W C CP + FDP S T +
Sbjct: 81 YYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPIS 140
Query: 148 CSSSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---AL 199
CS +C+ Q S SG + C Y+ YGDGS ++G ++ + G ++ +
Sbjct: 141 CSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNST 200
Query: 200 PGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTK 254
+ FGC T+ G + GI G G +S+ISQ+ + IA + FS+CL K
Sbjct: 201 APVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL------K 254
Query: 255 INFGTNGI-----VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI-------- 301
G GI + P +V TPL ++ Y + + +ISV Q L ++ P +
Sbjct: 255 GENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPIN-PSVFSTSNGQG 313
Query: 302 -VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQV-PEVTIH 358
+ID+GTTL +L + + +++ + +Q V CY S+ + P V+++
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAV-SQSVRPVVSKGNQCYVITTSVGDIFPPVSLN 372
Query: 359 FR-GADVKLSRSNFFVKVSE----DIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQT 412
F GA + L+ ++ ++ + + C F+ I N + I G+++ + + YD+ Q
Sbjct: 373 FAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLVGQR 432
Query: 413 VSFKPTDCT 421
+ + DC+
Sbjct: 433 IGWANYDCS 441
>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
Length = 467
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/380 (26%), Positives = 156/380 (41%), Gaps = 55/380 (14%)
Query: 87 NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ YL+++ IGTP R + DTGSDL WTQCEPC + P DP S T+
Sbjct: 98 GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 156
Query: 144 KSLPCSSSQCA---SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
+ L C C ++ C + YGDG +G L ++ G+ G L
Sbjct: 157 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 216
Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV------- 250
+ FGC + +TGI+ LG G S ++Q+ +FSYC +P
Sbjct: 217 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYC-IPASEITDDD 272
Query: 251 -------SSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDA----------------I 287
S++ + FG++ ++G P + + Y + + + +
Sbjct: 273 DDDDEERSASFLRFGSHARMTGK---RAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPV 329
Query: 288 SVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 347
V + + P +++DSGTTL +LP L + I D T CY N
Sbjct: 330 YVAGEEAAAAMP-MLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGN 388
Query: 348 SLS-QVPEVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNF 402
+ VT+ F GAD++L ++ F ++ED VC + I G Q N
Sbjct: 389 MTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRA--ILGVYPQRNI 446
Query: 403 LVGYDIEQQTVSFKPTDCTK 422
VGYD+ ++F C +
Sbjct: 447 NVGYDLSTMEIAFDRDQCDR 466
>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
Length = 488
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/380 (26%), Positives = 156/380 (41%), Gaps = 55/380 (14%)
Query: 87 NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ YL+++ IGTP R + DTGSDL WTQCEPC + P DP S T+
Sbjct: 119 GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 177
Query: 144 KSLPCSSSQCA---SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
+ L C C ++ C + YGDG +G L ++ G+ G L
Sbjct: 178 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 237
Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV------- 250
+ FGC + +TGI+ LG G S ++Q+ +FSYC +P
Sbjct: 238 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYC-IPASEITDDD 293
Query: 251 -------SSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDA----------------I 287
S++ + FG++ ++G P + + Y + + + +
Sbjct: 294 DDDDEERSASFLRFGSHARMTGK---RAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPV 350
Query: 288 SVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 347
V + + P +++DSGTTL +LP L + I D T CY N
Sbjct: 351 YVAGEEAAAAMP-MLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGN 409
Query: 348 SLS-QVPEVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNF 402
+ VT+ F GAD++L ++ F ++ED VC + I G Q N
Sbjct: 410 MTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRA--ILGVYPQRNI 467
Query: 403 LVGYDIEQQTVSFKPTDCTK 422
VGYD+ ++F C +
Sbjct: 468 NVGYDLSTMEIAFDRDQCDR 487
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/351 (28%), Positives = 155/351 (44%), Gaps = 26/351 (7%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQ---DSPLFDPKMSSTYKS 145
Y +++GTP L DTGSDL W C+ C Q + ++ P SST K
Sbjct: 130 YYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQGPVNFNIYSPNNSSTSKE 189
Query: 146 LPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPG-IT 203
+ CSSS C+ L+Q S C Y VSY D + S G L + + L + Q+ + IT
Sbjct: 190 VQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNARIT 249
Query: 204 FGCGTNNGGLFNSKTT--GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGT 259
GCG + G F S G+ GLG ++S+ S + I+ FS C P +I FG
Sbjct: 250 LGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGD 309
Query: 260 NGIVSGPGVVSTP--LTKAKTFYVLTIDAISVGNQRLGVSTPD--IVIDSGTTLTFLPQG 315
G PG TP L + Y ++I I VG +S D ++ DSGT+ T+L
Sbjct: 310 KG---SPGQNETPFNLGRRHPTYNVSITQIGVGGH---ISDLDVAVIFDSGTSFTYLNDP 363
Query: 316 YNSNLLSVMSSMI-EAQPVADPTGSLELCYSFN---SLSQVPEVTIHFR-GADVKLSRSN 370
S +SM+ E Q + E CY + + P + + + G ++
Sbjct: 364 AYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLTMKGGGHFVINHPI 423
Query: 371 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ + + ++S+ I G T + + +D E+ + +K ++CT
Sbjct: 424 VLISTESKRLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCT 474
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 117/419 (27%), Positives = 182/419 (43%), Gaps = 52/419 (12%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRD-ALTRSLNRLNHFNQNS-SISSSKASQADIIPNN 88
+ + H +S SPF S L+D A L+ L ++S I+S +A I +
Sbjct: 31 LRVFHINSLCSPFKTSVSWADTLLQDKARFLYLSSLAGVRKSSVPIASGRA-----IVQS 85
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y++R +IGTP L DT +D W C C S LFDP SS+ ++L C
Sbjct: 86 PTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGC----VGCSSSVLFDPSKSSSSRTLQC 141
Query: 149 SSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ QC SC+ +C ++++YG GS L +T+TL S +P TFGC
Sbjct: 142 EAPQCKQAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLASDV-----IPNYTFGC- 194
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPG 267
N + G++GLG G +SLISQ + FSYCL +S NF + + GP
Sbjct: 195 INKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCL--PNSKSSNF-SGSLRLGPK 251
Query: 268 -----VVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI----------VIDSGTTL 309
+ +TPL K + Y + + I VGN+ + + T + + DSGT
Sbjct: 252 NQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVY 311
Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPT--GSLELCYSFNSLSQVPEVTIHFRGADVKLS 367
T L + ++V + A+ T G + CYS + + P VT F G +V L
Sbjct: 312 TRLVE---PAYVAVRNEFRRRVKNANATSLGGFDTCYSGSVV--FPSVTFMFAGMNVTLP 366
Query: 368 RSNFFVKVSE-DIVCSVFKG----ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
N + S ++ C + + + + ++ Q N V D+ + CT
Sbjct: 367 PDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETCT 425
>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 482
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 113/411 (27%), Positives = 167/411 (40%), Gaps = 92/411 (22%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP----------- 137
++Y + ++G DTGSDL+W C P C ++ DP
Sbjct: 73 SDYTLSFNLGPHSQPITLYMDTGSDLVWFPCTPFNCILCELKPKLTSDPSPPTNISHSTP 132
Query: 138 ----------KMSSTYKSLPCSSSQCA--SLNQKSCSGVNC-QYSVSYGDGSFSNGNLAT 184
SST S C+ + C S+ K C +C + +YGDGS +L
Sbjct: 133 ISCNSHACSVAHSSTPSSDLCTMAHCPLDSIETKDCGSFHCPPFYYAYGDGSLI-ASLYR 191
Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT---TIAG 241
+T++L + + L TFGC S+ TG+ G G G +SL +Q+ T +
Sbjct: 192 DTLSLST-----LQLTNFTFGCAHTT----FSEPTGVAGFGRGLLSLPAQLATHSPQLGN 242
Query: 242 KFSYCLVPVS--STKI---------NFGTNGIVSGPGVVSTPLT------KAKTFYVLTI 284
+FSYCLV S S +I + +G VV T K FY + +
Sbjct: 243 RFSYCLVSHSFRSERIRKPSPLILGRYNDEKQSNGDEVVEFVYTSMLENPKHSYFYTVGL 302
Query: 285 DAISVGNQRLGVSTPDI------------VIDSGTTLTFLPQGYNSNLLS-----VMSSM 327
ISVG + V P I V+DSGTT T LP+ + ++++ S
Sbjct: 303 KGISVGKKT--VPAPKILRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKSN 360
Query: 328 IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGAD--VKLSRSNFF---------VKVS 376
A + TG L CY N+ + VP VT+ F G + V L R N+F V+
Sbjct: 361 RRAPEIEQKTG-LSPCYYLNTAAIVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGDGVRRK 419
Query: 377 EDIVCSVFKGITNSVP-------IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
E + C +F + + GN Q F V YD+E++ V F C
Sbjct: 420 ERVGCLMFMNGGDEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFARRKC 470
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 99/351 (28%), Positives = 155/351 (44%), Gaps = 26/351 (7%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQ---DSPLFDPKMSSTYKS 145
Y +++GTP L DTGSDL W C+ C Q + ++ P SST K
Sbjct: 107 YYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQGPVNFNIYSPNNSSTSKE 166
Query: 146 LPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPG-IT 203
+ CSSS C+ L+Q S C Y VSY D + S G L + + L + Q+ + IT
Sbjct: 167 VQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNARIT 226
Query: 204 FGCGTNNGGLFNSKTT--GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGT 259
GCG + G F S G+ GLG ++S+ S + I+ FS C P +I FG
Sbjct: 227 LGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGD 286
Query: 260 NGIVSGPGVVSTP--LTKAKTFYVLTIDAISVGNQRLGVSTPD--IVIDSGTTLTFLPQG 315
G PG TP L + Y ++I I VG +S D ++ DSGT+ T+L
Sbjct: 287 KG---SPGQNETPFNLGRRHPTYNVSITQIGVGGH---ISDLDVAVIFDSGTSFTYLNDP 340
Query: 316 YNSNLLSVMSSMI-EAQPVADPTGSLELCYSFN---SLSQVPEVTIHFR-GADVKLSRSN 370
S +SM+ E Q + E CY + + P + + + G ++
Sbjct: 341 AYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLTMKGGGHFVINHPI 400
Query: 371 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
+ + + ++S+ I G T + + +D E+ + +K ++CT
Sbjct: 401 VLISTESKRLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCT 451
>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 531
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 114/429 (26%), Positives = 179/429 (41%), Gaps = 37/429 (8%)
Query: 29 FSVELIHR--DSPKSPFYN------SSETPYQR----LRDALTRSLNR--LNHFNQNSSI 74
FS +LIHR D K+ F + + P +R R L+ L R L + +
Sbjct: 25 FSSKLIHRFSDEAKAFFVSRNGNIFADSWPKKRSFDYYRLLLSSDLKRQKLKLGAEYQLL 84
Query: 75 SSSKASQADIIPNNANYL--IRISIGTPPTERLAVADTGSDLIWTQCE--PCPP-SQCYM 129
S+ S A + N +L I IGTP L D GSDL+W C+ C P S Y
Sbjct: 85 FPSEGSDALFLGNEFGWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCMQCAPLSASYY 144
Query: 130 ----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLAT 184
+D + P +SST K L C+ C + S C Y SY + + S+G L
Sbjct: 145 DRLGRDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSKDPCPYLASYYSENTSSSGLLIE 204
Query: 185 ETVTLGSTTGQAV---ALPGITFGCGTNNGGLFN--SKTTGIVGLGGGDISLISQMRTT- 238
+ + L + A + GCG G F+ + G++GLG GD+S+ S +
Sbjct: 205 DRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPSLLAKAG 264
Query: 239 -IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS 297
+ FS C S I FG G+V+ PL Y++ ++ VG+ L +
Sbjct: 265 LVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGSSSLKTA 324
Query: 298 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS--LSQVPEV 355
++DSGT+ TFLP ++ + A + + CY+ +S L +P V
Sbjct: 325 GFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSPWKYCYNSSSQELLNIPTV 384
Query: 356 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQ 411
T+ F + + +SE+ +VF + I+ NF+ GY D E
Sbjct: 385 TLVFAMNQSFIVHNPVIKLISENEEFNVFCLPIQPIHEEFGIIGQNFMWGYRMVFDRENL 444
Query: 412 TVSFKPTDC 420
+ + ++C
Sbjct: 445 KLGWSTSNC 453
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 101/338 (29%), Positives = 152/338 (44%), Gaps = 48/338 (14%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y+I + +GTP ++ DTGS W CE C C+ + S+T + C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-C--DGCHTNPRTFLQSR-STTCAKVSCGT 56
Query: 151 SQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
S C Q S + +C + VSY DGS S G L +T+T +PG TFG
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ----KIPGFTFG 112
Query: 206 CGTNNGGLFN-SKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF--GTNGI 262
C ++ G G++G+G G +S++ Q T G FSYCL P+ ++ F T G
Sbjct: 113 CNMDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDG-FSYCL-PLQMSERGFFSKTTGY 170
Query: 263 VSGPGVVSTPLTKAK-----------TFYVLTIDAISVGNQRLGV-----STPDIVIDSG 306
S G ++ T + + + + AISV +RLG+ S +V DSG
Sbjct: 171 FSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSG 230
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEA---QPVADPTGSLELCYSFNSLSQ--VPEVTIHF-R 360
+ L+++P LSV+S I + A S CY S+ + +P +++HF
Sbjct: 231 SELSYIPD----RALSVLSQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 286
Query: 361 GADVKLSRSNFFVKVS---EDIVCSVFKGITNSVPIYG 395
GA L FV+ S +D+ C F T SV I G
Sbjct: 287 GARFDLGSHGVFVERSVQEQDVWCLAF-APTESVSIIG 323
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 99/369 (26%), Positives = 172/369 (46%), Gaps = 46/369 (12%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +GTPP + DTGSD++W C CP + FDP S T +
Sbjct: 81 YYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPIS 140
Query: 148 CSSSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---AL 199
CS +C+ Q S SG + C Y+ YGDGS ++G ++ + G ++ +
Sbjct: 141 CSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNST 200
Query: 200 PGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTK 254
+ FGC T+ G + GI G G +S+ISQ+ + IA + FS+CL K
Sbjct: 201 APVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL------K 254
Query: 255 INFGTNGI-----VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI-------- 301
G GI + P +V TPL ++ Y + + +ISV Q L ++ P +
Sbjct: 255 GENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPIN-PSVFSTSNGQG 313
Query: 302 -VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQV-PEVTIH 358
+ID+GTTL +L + + +++ + +Q V CY S+ + P V+++
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAV-SQSVRPVVSKGNQCYVITTSVGDIFPPVSLN 372
Query: 359 FR-GADVKLSRSNFFVKVSE----DIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQT 412
F GA + L+ ++ ++ + + C F+ I N + I G+++ + + YD+ Q
Sbjct: 373 FAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLVGQR 432
Query: 413 VSFKPTDCT 421
+ + DC+
Sbjct: 433 IGWANYDCS 441
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 111/418 (26%), Positives = 178/418 (42%), Gaps = 50/418 (11%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK---ASQADIIPN 87
+ + H +S SPF S D L + R + + + ++ S AS I+
Sbjct: 31 LRVFHINSQCSPFKTSVS-----WADTLLQDKARFLYLSSLAGVTKSSVPIASGRGIV-Q 84
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y++R +IGTP L DT +D W C C S LFDP SS+ ++L
Sbjct: 85 SPTYIVRANIGTPAQAMLVALDTSNDAAWIPCSGC----VGCSSSVLFDPSKSSSSRTLQ 140
Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C + QC SC+ +C ++++YG GS L +T+TL + +P TFGC
Sbjct: 141 CEAPQCKQAPNPSCTVSKSCGFNMTYG-GSAIEAYLTQDTLTLATDV-----IPNYTFGC 194
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
N + G++GLG G +SLISQ + FSYCL +S NF + + GP
Sbjct: 195 -INKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCL--PNSKSSNF-SGSLRLGP 250
Query: 267 G-----VVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI----------VIDSGTT 308
+ +TPL K + Y + + I VGN+ + + T + + DSGT
Sbjct: 251 KNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTV 310
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSR 368
T L + + + ++ A G + CYS + + P VT F G +V L
Sbjct: 311 YTRLVEPAYVAMRNEFRRRVK-NANATSLGGFDTCYSGSVV--FPSVTFMFAGMNVTLPP 367
Query: 369 SNFFVKVSE-DIVCSVFKG----ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
N + S ++ C + + + + ++ Q N V D+ + CT
Sbjct: 368 DNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETCT 425
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 98/382 (25%), Positives = 164/382 (42%), Gaps = 62/382 (16%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + + +++GTPP V DTGS+L W C + + F P+ S+T+ ++
Sbjct: 57 HNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCA---TGRAAAAAADSFRPRASATFAAV 113
Query: 147 PCSSSQCASLN---QKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
PC S++C+S + SC C+ S+SY DGS S+G LAT+ +G A
Sbjct: 114 PCGSARCSSRDLPAPPSCDAASRRCRVSLSYADGSASDGALATDVFAVGDAPPLRSA--- 170
Query: 202 ITFGC--GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGT 259
FGC + T G++G+ G +S ++Q T +FSYC+ +
Sbjct: 171 --FGCMSAAYDSSPDAVATAGLLGMNRGALSFVTQASTR---RFSYCISDRDDAGVLLLG 225
Query: 260 NGIVSGPGVVSTPLTKA--------KTFYVLTIDAISVGNQRL----GVSTPD------I 301
+ + + TPL + + Y + + I VG + L V PD
Sbjct: 226 HSDLPFLPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQT 285
Query: 302 VIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPT----GSLELCYSFNS----- 348
++DSGT TFL + L ++ A + DP+ + + C+
Sbjct: 286 MVDSGTQFTFLLGDAYSAVKAEFLKQTKPLLPA--LEDPSFAFQEAFDTCFRVPKGRPPP 343
Query: 349 LSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKGITNSVP----IYGNIM 398
+++P VT+ F GA + ++ KV ++ + C F G + VP + G+
Sbjct: 344 SARLPPVTLLFNGAQMSVAGDRLLYKVPGERRGADGVWCLTF-GNADMVPLTAYVIGHHH 402
Query: 399 QTNFLVGYDIEQQTVSFKPTDC 420
Q N V YD+E+ V P C
Sbjct: 403 QMNLWVEYDLERGRVGLAPVKC 424
>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 114/429 (26%), Positives = 179/429 (41%), Gaps = 37/429 (8%)
Query: 29 FSVELIHR--DSPKSPFYN------SSETPYQR----LRDALTRSLNR--LNHFNQNSSI 74
FS +LIHR D K+ F + + P +R R L+ L R L + +
Sbjct: 15 FSSKLIHRFSDEAKAFFVSRNGNIFADSWPKKRSFDYYRLLLSSDLKRQKLKLGAEYQLL 74
Query: 75 SSSKASQADIIPNNANYL--IRISIGTPPTERLAVADTGSDLIWTQCE--PCPP-SQCYM 129
S+ S A + N +L I IGTP L D GSDL+W C+ C P S Y
Sbjct: 75 FPSEGSDALFLGNEFGWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCMQCAPLSASYY 134
Query: 130 ----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLAT 184
+D + P +SST K L C+ C + S C Y SY + + S+G L
Sbjct: 135 DRLGRDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSKDPCPYLASYYSENTSSSGLLIE 194
Query: 185 ETVTLGSTTGQAV---ALPGITFGCGTNNGGLFN--SKTTGIVGLGGGDISLISQMRTT- 238
+ + L + A + GCG G F+ + G++GLG GD+S+ S +
Sbjct: 195 DRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPSLLAKAG 254
Query: 239 -IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS 297
+ FS C S I FG G+V+ PL Y++ ++ VG+ L +
Sbjct: 255 LVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGSSSLKTA 314
Query: 298 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS--LSQVPEV 355
++DSGT+ TFLP ++ + A + + CY+ +S L +P V
Sbjct: 315 GFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSPWKYCYNSSSQELLNIPTV 374
Query: 356 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQ 411
T+ F + + +SE+ +VF + I+ NF+ GY D E
Sbjct: 375 TLVFAMNQSFIVHNPVIKLISENEEFNVFCLPIQPIHEEFGIIGQNFMWGYRMVFDRENL 434
Query: 412 TVSFKPTDC 420
+ + ++C
Sbjct: 435 KLGWSTSNC 443
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 115/426 (26%), Positives = 169/426 (39%), Gaps = 55/426 (12%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
+ + H P SP S R DA R L + + ++S+ + P+
Sbjct: 24 LSVYHNVHPPSPSPLESIIALARADDA--RLLFLSSKAASSGGVTSAPVASGQTPPS--- 78
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y++R +GTP + L DT +D W+ C PC C F P SS+Y SLPC+S
Sbjct: 79 YVVRAGLGTPVQQLLLALDTSADATWSHCAPC--DTCPAGSR--FIPASSSSYASLPCAS 134
Query: 151 SQCASLNQKSCSGVN--------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
C + C C +S + D SF +L ++T+ LG A+ G
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLGKD-----AIAGY 188
Query: 203 TFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS----STKINF 257
FGC G G N G++GLG G +SL+SQ +T G FSYCL S +
Sbjct: 189 AFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLPSYRSYYFSGSLRL 248
Query: 258 GTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTP------------DIV 302
G G V TPL + Y + + +SVG R V P V
Sbjct: 249 GAAGQPR--NVRYTPLLTNPHRPSLYYVNVTGLSVG--RTWVKVPAGSFAFDPATGAGTV 304
Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR 360
IDSGT +T + L + A G+ + C++ + ++ P VT+H
Sbjct: 305 IDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMD 364
Query: 361 GA-DVKLSRSNFFVKVSE-DIVCSVF----KGITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
G D+ L N + S + C + + V + N+ Q N V D+ V
Sbjct: 365 GGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVG 424
Query: 415 FKPTDC 420
F C
Sbjct: 425 FAREPC 430
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 95/306 (31%), Positives = 144/306 (47%), Gaps = 32/306 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +GTPP E DTGSD++W C CP + FDP SST +
Sbjct: 25 YYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIA 84
Query: 148 CSSSQCASLNQKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTL-----GSTTGQAV 197
CS +C + Q S CS N C Y+ YGDGS ++G ++ + L GS T +
Sbjct: 85 CSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNST 144
Query: 198 ALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSS 252
A + FGC G + GI G G ++S+ISQ+ + IA + FS+CL SS
Sbjct: 145 AP--VVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSS 202
Query: 253 TKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP--------DIVID 304
IV P +V T L A+ Y L + +I+V Q L + + ++D
Sbjct: 203 GGGILVLGEIVE-PNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIVD 261
Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQV-PEVTIHFRGA 362
SGTTL +L + +S +++ I Q V CY +S+++V P+V+++F G
Sbjct: 262 SGTTLAYLAEEAYDPFVSAITASI-PQSVHTAVSRGNQCYLITSSVTEVFPQVSLNFAGG 320
Query: 363 DVKLSR 368
+ R
Sbjct: 321 ASMILR 326
>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
Length = 471
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 99/382 (25%), Positives = 156/382 (40%), Gaps = 57/382 (14%)
Query: 87 NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ YL+++ IGTP R + DTGSDL WTQCEPC + P DP S T+
Sbjct: 100 GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 158
Query: 144 KSLPCSSSQCA---SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
+ L C C ++ C + YGDG +G L ++ G+ G L
Sbjct: 159 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 218
Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS------ 251
+ FGC + +TGI+ LG G S ++Q+ +FSYC +P S
Sbjct: 219 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYC-IPASEITDDD 274
Query: 252 ----------STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAI-------------- 287
++ + FG++ ++G P + + Y + + ++
Sbjct: 275 DDDDDDEERSASFLRFGSHARMTGK---RAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPV 331
Query: 288 --SVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS 345
V + + P +++DSGTTL +LP L + I D T CY
Sbjct: 332 PVYVAGEEAAAAMP-MLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYL 390
Query: 346 FNSLS-QVPEVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQT 400
N + VT+ F GAD++L ++ F ++ED VC + I G Q
Sbjct: 391 GNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRA--ILGVYPQR 448
Query: 401 NFLVGYDIEQQTVSFKPTDCTK 422
N VGYD+ ++F C +
Sbjct: 449 NINVGYDLSTMEIAFDRDQCDR 470
>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
Length = 489
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 99/382 (25%), Positives = 156/382 (40%), Gaps = 57/382 (14%)
Query: 87 NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ YL+++ IGTP R + DTGSDL WTQCEPC + P DP S T+
Sbjct: 118 GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 176
Query: 144 KSLPCSSSQCA---SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
+ L C C ++ C + YGDG +G L ++ G+ G L
Sbjct: 177 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 236
Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS------ 251
+ FGC + +TGI+ LG G S ++Q+ +FSYC +P S
Sbjct: 237 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYC-IPASEITDDD 292
Query: 252 ----------STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAI-------------- 287
++ + FG++ ++G P + + Y + + ++
Sbjct: 293 DDDDDDEERSASFLRFGSHARMTGK---RAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPV 349
Query: 288 --SVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS 345
V + + P +++DSGTTL +LP L + I D T CY
Sbjct: 350 PVYVAGEEAAAAMP-MLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYL 408
Query: 346 FNSLS-QVPEVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQT 400
N + VT+ F GAD++L ++ F ++ED VC + I G Q
Sbjct: 409 GNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRA--ILGVYPQR 466
Query: 401 NFLVGYDIEQQTVSFKPTDCTK 422
N VGYD+ ++F C +
Sbjct: 467 NINVGYDLSTMEIAFDRDQCDR 488
>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
Length = 468
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 99/382 (25%), Positives = 156/382 (40%), Gaps = 57/382 (14%)
Query: 87 NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ YL+++ IGTP R + DTGSDL WTQCEPC + P DP S T+
Sbjct: 97 GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 155
Query: 144 KSLPCSSSQCA---SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
+ L C C ++ C + YGDG +G L ++ G+ G L
Sbjct: 156 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 215
Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS------ 251
+ FGC + +TGI+ LG G S ++Q+ +FSYC +P S
Sbjct: 216 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYC-IPASEITDDD 271
Query: 252 ----------STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAI-------------- 287
++ + FG++ ++G P + + Y + + ++
Sbjct: 272 DDDDDDEERSASFLRFGSHARMTGK---RAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPV 328
Query: 288 --SVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS 345
V + + P +++DSGTTL +LP L + I D T CY
Sbjct: 329 PVYVAGEEAAAAMP-MLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYL 387
Query: 346 FNSLS-QVPEVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQT 400
N + VT+ F GAD++L ++ F ++ED VC + I G Q
Sbjct: 388 GNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRA--ILGVYPQR 445
Query: 401 NFLVGYDIEQQTVSFKPTDCTK 422
N VGYD+ ++F C +
Sbjct: 446 NINVGYDLSTMEIAFDRDQCDR 467
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 104/381 (27%), Positives = 164/381 (43%), Gaps = 80/381 (20%)
Query: 100 PPTERLAVADTGSDLIWTQC----EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS 155
PP V DTGS+L W +C P P + FDP SS+Y +PCSS C +
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSNPNPVNN--------FDPTRSSSYSPIPCSSPTCRT 133
Query: 156 LNQK-----SC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
+ SC S C ++SY D S S GNLA E G++T + + FGC +
Sbjct: 134 RTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDS----NLIFGCMGS 189
Query: 210 NGG---LFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFGTNG 261
G ++KTTG++G+ G +S ISQM KFSYC +S T + G +
Sbjct: 190 VSGSDPEEDTKTTGLLGMNRGSLSFISQMGFP---KFSYC---ISGTDDFPGFLLLGDSN 243
Query: 262 IVSGPGVVSTPLTKAKT--------FYVLTIDAISVGNQRL----GVSTPD------IVI 303
+ TPL + T Y + + I V + L V PD ++
Sbjct: 244 FTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMV 303
Query: 304 DSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADP----TGSLELCYSFNS------- 348
DSGT TFL S+ L+ + ++ DP G+++LCY +
Sbjct: 304 DSGTQFTFLLGPVYTALRSHFLNRTNGILTV--YEDPDFVFQGTMDLCYRISPVRIRSGI 361
Query: 349 LSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKG---ITNSVPIYGNIMQ 399
L ++P V++ F GA++ +S +V ++ + C F + + G+ Q
Sbjct: 362 LHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQ 421
Query: 400 TNFLVGYDIEQQTVSFKPTDC 420
N + +D+++ + P +C
Sbjct: 422 QNMWIEFDLQRSRIGLAPVEC 442
>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 99/365 (27%), Positives = 163/365 (44%), Gaps = 45/365 (12%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + I+IG D+GSDL W QC+ P + C L+ P ++ L C
Sbjct: 55 YSVSINIGKGDEAFEFDIDSGSDLTWVQCD-APCTHCTKPREQLYKPNNNA----LNCFE 109
Query: 151 SQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C SL+ C + CQY + Y D S G L + V L T G ++A P I FG
Sbjct: 110 PLCTSLHPITNHHCKSADDQCQYEIEYADHGSSLGVLVNDHVPLKLTNG-SLAAPRIAFG 168
Query: 206 CGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTKINFGTN 260
CG ++ + T G++GLG G++S ISQ+ + + +CL F +
Sbjct: 169 CGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCL--SDEGGFLFFGD 226
Query: 261 GIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFL-PQGYN 317
V GV T ++ ++Y + + G+ +V DSG++ T+ Q YN
Sbjct: 227 EFVPSSGVTWTSMSHESIGSYYSSGPAEVYFSGKATGIKDLTLVFDSGSSYTYFNSQAYN 286
Query: 318 SNLLSVMSSMIEAQPVAD--PTGSLELCYS----FNSLSQVPE----VTIHF---RGADV 364
S +L+++ + + +P+ D SL +C+ F SL V + + + F + A +
Sbjct: 287 S-ILALVKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDVKKYFNPLALRFTKTKNAQI 345
Query: 365 KLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
+L N+ + VC GI N + I G+I + +V YD E++ + + P
Sbjct: 346 QLPPENYLIITKYGNVCF---GILNGTEVGLGDLNIIGDISLKDKMVIYDNERRRIGWFP 402
Query: 418 TDCTK 422
T+C K
Sbjct: 403 TNCNK 407
>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
Length = 395
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 89/305 (29%), Positives = 133/305 (43%), Gaps = 26/305 (8%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
D+ P+ Y + +SIG PP DTGSDL W QC+ PC C PL+ P +
Sbjct: 50 GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCSKVPHPLYRPTKN 106
Query: 141 STYKSLPCSSSQCASLN-----QKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
K +PC CA+L+ + C C Y + Y D S G L T++ L
Sbjct: 107 ---KLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFAL-RLA 162
Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLV 248
++ PG+ FGCG + S T G++GLG G +SL+SQ++ K +CL
Sbjct: 163 NSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS 222
Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPDIVIDSG 306
+ FG + IV P+ + ++ +Y + G + LGV ++V DSG
Sbjct: 223 TRGGGFLFFGDD-IVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS-LSQVPEVTIHFRGADVK 365
++ T+ L+ + + P SL LC+ V +V FR V
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFR--TVV 339
Query: 366 LSRSN 370
LS SN
Sbjct: 340 LSFSN 344
>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 535
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 93/360 (25%), Positives = 153/360 (42%), Gaps = 34/360 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y IS+G+PP DTGS W QC+ P + C PL+ P + T +LP S
Sbjct: 160 YYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAHPLYRP--ARTADALPASD 217
Query: 151 SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
C ++ + C Y +SY DGS S G +++ G+ I FGCG +
Sbjct: 218 PLCEGAQHENPN--QCDYEISYADGSSSMGVYVRDSMQFVGEDGERENA-DIVFGCGYDQ 274
Query: 211 GG-LFNS--KTTGIVGLGGGDISLISQM--RTTIAGKFSYCLV--PVSSTKINFGTNGIV 263
G L N+ T G++GL +SL +Q+ R I+ F +C+ P + F + +
Sbjct: 275 QGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPSGAGGYLFLGDDYI 334
Query: 264 SGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTP--DIVIDSGTTLTFLPQGYNSN 319
G+ P+ A + I+ G+Q+L +V D+G+T T+ P +
Sbjct: 335 PRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLTQVVFDTGSTYTYFPDEALTR 394
Query: 320 LLSVMSSMIEAQPVADPTG-SLELCYSFN-SLSQVPEVTIHFRGADVKLSRSNFFVKV-- 375
L+S + + V D + +L C + + V +V F+ ++ + FF +
Sbjct: 395 LISSLKEAASPRFVQDDSDKTLPFCMKSDFPVRSVEDVKHFFKPLSLQFEKRFFFSRTFN 454
Query: 376 -----------SEDIVCSVFKGIT---NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
++ V G T +SV I G++ LV YD ++ V + DCT
Sbjct: 455 IRPEHYLVISDKGNVCLGVLNGTTIGYDSVVIVGDVSLRGKLVAYDNDKNEVGWVDFDCT 514
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 114/412 (27%), Positives = 183/412 (44%), Gaps = 44/412 (10%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQ----RLRDALTRSLNRLNHFNQNSSISSSK 78
+ Q G ++++IH SP SPF S ++ +++ T L L+ SI
Sbjct: 23 DVQDNGSTLQVIHVFSPCSPFRPSKPLSWEESVLQMQAKDTTRLQFLDSLVARKSIVP-I 81
Query: 79 ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
AS II + Y++R IGTPP L DT +D W C C C S LF P+
Sbjct: 82 ASGRQII-QSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTAC--DGC---ASTLFAPE 135
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
S+T+K++ C++ +C + C + ++++YG S + NL +T+TL +
Sbjct: 136 KSTTFKNVSCAAPECKQVPNPGCGVSSRNFNLTYGSSSIA-ANLVQDTITLATD-----P 189
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG 258
+P TFGC + G ++ G++GLG G +SL+SQ + FSYCL S +NF
Sbjct: 190 VPSYTFGCVSKTTGT-SAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKS--LNFS 246
Query: 259 TN---GIVSGPGVVS-TPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI---------- 301
+ G V+ P + TPL K + Y + ++AI VG + + + +
Sbjct: 247 GSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGT 306
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 361
+ DSGT T L + + + G + CY N VP +T F G
Sbjct: 307 IFDSGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCY--NVPIVVPTITFIFTG 364
Query: 362 ADVKLSRSNFFVK-VSEDIVCSVFKGITNSV----PIYGNIMQTNFLVGYDI 408
+V L + N + + C G ++V + N+ Q N V YD+
Sbjct: 365 MNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDV 416
>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
Length = 410
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 89/360 (24%), Positives = 159/360 (44%), Gaps = 35/360 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + ++IG PP DTGSDL W QC+ P C L+ PK + +PCS+
Sbjct: 54 YSVILNIGNPPKAFDFDIDTGSDLTWVQCD-APCKGCTKPRDKLYKPKNN----LVPCSN 108
Query: 151 SQCASL---NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
S C ++ C + C Y + Y D S G L +++ L + G + P + FG
Sbjct: 109 SLCQAVSTGENYHCDAPDDQCDYEIEYADLGSSIGVLLSDSFPLRLSNGTLLQ-PKMAFG 167
Query: 206 CGTNNGGLFNS---KTTGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTKINFGTN 260
CG + L T GI+GLG G +S++SQ+RT +C + FG +
Sbjct: 168 CGYDQKHLGPHPPPDTAGILGLGRGKVSILSQLRTLGITQNVVGHCFSRARGGFLFFGDH 227
Query: 261 GIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNS 318
+ + TP+ + + T Y + G + G+ ++ DSG++ T+
Sbjct: 228 -LFPSSRITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQ 286
Query: 319 NLLSVMSSMIEAQPVAD-PTGSLELCYS--------FNSLSQVPEVTIHFRGA---DVKL 366
++L+++ + +P+ D P L +C+ + S +TI F A ++L
Sbjct: 287 SILNLVRKDLAGKPLKDAPEKELAVCWKTAKPIKSILDIKSYFKPLTISFMNAKNVQLQL 346
Query: 367 SRSNFFVKVSEDIVC-SVFKGITNSVP---IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ ++ + + VC + G + + G+I + +V YD E+Q + + P +C +
Sbjct: 347 APEDYLIITKDGNVCLGILNGSEQQLGNFNVIGDIFMQDRVVIYDNEKQQIGWFPANCDR 406
>gi|18409320|ref|NP_566948.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|27754243|gb|AAO22575.1| unknown protein [Arabidopsis thaliana]
gi|332645259|gb|AEE78780.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 529
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 104/356 (29%), Positives = 154/356 (43%), Gaps = 38/356 (10%)
Query: 95 ISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM--------QDSP--LFDPKMSSTYK 144
+S+GTP T L DTGSDL W C C S C Q P L+ P SST
Sbjct: 106 VSVGTPATWFLVALDTGSDLFWLPCN-C-GSTCIRDLKEVGLSQSRPLNLYSPNTSSTSS 163
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTL-GSTTGQAVALPGI 202
S+ CS +C ++ S +C Y + Y +F+ G L + + L G I
Sbjct: 164 SIRCSDDRCFGSSRCSSPASSCPYQIQYLSKDTFTTGTLFEDVLHLVTEDEGLEPVKANI 223
Query: 203 TFGCGTNNGGLFNSKTT--GIVGLGGGDI---SLISQMRTTIAGKFSYCLVPVSST--KI 255
T GCG N G S G++GLG D S++++ + T A FS C + +I
Sbjct: 224 TLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILAKAKIT-ANSFSMCFGNIIDVVGRI 282
Query: 256 NFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLP 313
+FG G + TPL T+ Y +++ +SVG +GV + D+GT+ T L
Sbjct: 283 SFGDKGYTDQ---METPLLPTEPSPTYAVSVTEVSVGGDAVGVQLL-ALFDTGTSFTHLL 338
Query: 314 QGYNSNLLSVMSSMI--EAQPVADPTGSLELCYSF---NSLSQVPEVTIHFRGADVKLSR 368
+ + + + +P+ DP E CY + P V + F G R
Sbjct: 339 EPEYGLITKAFDDHVTDKRRPI-DPELPFEFCYDLSPNKTTILFPRVAMTFEGGSQMFLR 397
Query: 369 SNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQTVSFKPTDC 420
+ F+ +ED GI SV NI+ NF+ GY D E+ + +K +DC
Sbjct: 398 NPLFIVWNEDNSAMYCLGILKSVDFKINIIGQNFMSGYRIVFDRERMILGWKRSDC 453
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 87/286 (30%), Positives = 137/286 (47%), Gaps = 33/286 (11%)
Query: 51 YQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADT 110
Y LR R L R+ +S + DI Y RIS+GTPP + DT
Sbjct: 6 YHTLRKHDQRRLRRM----LPEVVSFPISGDNDIFAMGL-YYTRISLGTPPQQFYVDVDT 60
Query: 111 GSDLIWTQCEPCPPSQCYMQDSPL----FDPKMSSTYKSLPCSSSQCASLNQK-SCS--G 163
GS++ W +C PC + + D P+ FDP+ S+T S+ C+ ++C LN+K CS
Sbjct: 61 GSNVAWVKCAPCTGCE-HSGDVPVPMSTFDPRKSTTKISISCTDAECGVLNKKLQCSPER 119
Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVALPG---ITFGCGTNNGGLFNSKTT 219
++C YS+ YGDGS + G + T + + A G + FGCG G ++
Sbjct: 120 LSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTGSWS--VD 177
Query: 220 GIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAK 277
G++G G +SL +Q+ + F++CL S + + G + P +V TP+ +
Sbjct: 178 GLLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVSGRGSL-VIGTIREPDLVYTPMVFGE 236
Query: 278 TFYVLTIDAISVGNQRLGVSTP---------DIVIDSGTTLTFLPQ 314
Y + +++G V+TP ++IDSGTTLT+L Q
Sbjct: 237 DHY--NVQLLNIGISGRNVTTPASFDLEYTGGVIIDSGTTLTYLVQ 280
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 114/449 (25%), Positives = 185/449 (41%), Gaps = 39/449 (8%)
Query: 10 ILFFLCFYVVSPIEAQTGGFSVELIHR--DSPKSPFYN-----SSET-----PYQRLRDA 57
+LF +CF +S + FS +LIHR + KS + SS+T +Q L+
Sbjct: 6 LLFVICFCFLSN-HSIGLTFSSKLIHRFSEEAKSLLISGNDNVSSQTWPNKNSFQYLQLL 64
Query: 58 LTRSLNR--LNHFNQNSSISSSKASQADIIPNNANYL--IRISIGTPPTERLAVADTGSD 113
L L R + QN + S S N+ ++L I IGTP L D GSD
Sbjct: 65 LDNDLKRQKMKLGAQNQLLFPSLGSHTFFYGNDLDWLHYTWIDIGTPNVSFLVALDAGSD 124
Query: 114 LIWTQCE--PCPPSQCYM-----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNC 166
L W C+ C P + +D + P +S+T + L C+ C + C
Sbjct: 125 LSWVPCDCIQCAPLSASLYKPLDRDLSEYRPSLSTTSRHLSCNHQLCELGSHCKNLKDPC 184
Query: 167 QYSVSYGDGSFSNGNLATE------TVTLGSTTGQAVALPGITFGCGTNNGG--LFNSKT 218
Y Y D + S+ E +V+ S + Q + GCG G L +
Sbjct: 185 PYIADYADPNTSSSGFLVEDILHLASVSDDSNSTQKRVQASVILGCGRKQTGGYLDGAAP 244
Query: 219 TGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKA 276
G++GLG G IS+ S + I FS C S I FG G S P
Sbjct: 245 DGVMGLGPGSISVPSLLAKAGLIRKSFSLCFDVNGSGTILFGDQGHTSQKSTPLLPTQGN 304
Query: 277 KTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP 336
Y++ +++ VGN L S ++DSG + T+LP + ++ + AQ ++
Sbjct: 305 YDAYLIEVESYCVGNSCLKQSGFKALVDSGASFTYLPIDVYNKIVLEFDKQVNAQRISSQ 364
Query: 337 TGSLELCYSFNS--LSQVPEVTIHF-RGADVKLSRSNFFVKVSED--IVCSVFKGITNSV 391
G CY+ +S L VP + + F + + S ++V +++ + C + +
Sbjct: 365 GGPWNYCYNTSSKQLDNVPAMRLSFLMNQSLLIHNSTYYVPQNQEFAVFCLTLQPTDLNY 424
Query: 392 PIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
I G T + V +D+E + + ++C
Sbjct: 425 GIIGQNYMTGYRVVFDMENLKLGWSSSNC 453
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 104/373 (27%), Positives = 164/373 (43%), Gaps = 40/373 (10%)
Query: 74 ISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
+ S++ D + Y R+ IGTPP E + DTGS + + C C + C P
Sbjct: 18 LGSARMDLHDDLLTKGYYTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSC--THCGNHQDP 75
Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
F P +SS+YK L C S+C++ C G +Y Y + S S+G L + + G +
Sbjct: 76 RFSPALSSSYKPLEC-GSECST---GFCDGSR-KYQRQYAEKSTSSGVLGKDVI--GFSN 128
Query: 194 GQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPV 250
+ + FGC T G L++ GI+GLG G +S+I Q+ + + FS C +
Sbjct: 129 SSDLGGQRLVFGCETAETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGM 188
Query: 251 SSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGVSTPDI-------V 302
G +V T ++ +Y L + I VG L + P++ V
Sbjct: 189 DEGGGAMILGGFQPPKDMVFTASDPHRSPYYNLMLKGIRVGGSPLRLK-PEVFDGKYGTV 247
Query: 303 IDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQ- 351
+DSGTT + P Q + S + + S+ E V P ++CY+ ++LSQ
Sbjct: 248 LDSGTTYAYFPGAAFQAFKSAVKEQVGSLKE---VPGPDEKFKDICYAGAGTNVSNLSQF 304
Query: 352 VPEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
P V F G V LS N+ K+S VF+ + + G I+ N LV Y+
Sbjct: 305 FPSVDFVFGDGQSVTLSPENYLFRHTKISGAYCLGVFEN-GDPTTLLGGIIVRNMLVTYN 363
Query: 408 IEQQTVSFKPTDC 420
+ ++ F T C
Sbjct: 364 RGKASIGFLKTKC 376
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 97/368 (26%), Positives = 170/368 (46%), Gaps = 35/368 (9%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
D+ P +Y + ++IG P DTGSDL W QC+ P C PL+ P +
Sbjct: 49 GDVYPT-GHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCD-APCQSCNKVPHPLYRPTKN- 105
Query: 142 TYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
K +PC++S C +L N+K + C Y + Y D + S G L ++ +L +
Sbjct: 106 --KLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFSL-PLRNK 162
Query: 196 AVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLVP 249
+ P ++FGCG + G + T G++GLG G +SL+SQ++ K +CL
Sbjct: 163 SNVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLST 222
Query: 250 VSSTKINFGTNGI-VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTT 308
+ FG + + S VS + + +Y + + L ++V DSG+T
Sbjct: 223 SGGGFLFFGDDMVPTSRVTWVSMVRSTSGNYYSPGSATLYFDRRSLSTKPMEVVFDSGST 282
Query: 309 LTFL-PQGYNSNLLSVMSSMIEA-QPVADPTGSLELCY----SFNSLSQVPE--VTIHF- 359
T+ Q Y + + ++ S+ ++ + V+DP SL LC+ +F S+S V + ++ F
Sbjct: 283 YTYFSAQPYQATISAIKGSLSKSLKQVSDP--SLPLCWKGQKAFKSVSDVKKDFKSLQFI 340
Query: 360 --RGADVKLSRSNFFVKVSEDIVC-SVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVS 414
+ A + + N+ + VC + G S I G+I + +V YD E+ +
Sbjct: 341 FGKNAVMDIPPENYLIITKNGNVCLGILDGSAAKLSFSIIGDITMQDQMVIYDNEKAQLG 400
Query: 415 FKPTDCTK 422
+ C++
Sbjct: 401 WIRGSCSR 408
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 116/445 (26%), Positives = 183/445 (41%), Gaps = 81/445 (18%)
Query: 44 YNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTE 103
++S P+ ++ A + SL R +H ++ S S A+ + Y I +++GTPP
Sbjct: 41 HSSDSDPFHSVKLAASSSLTRAHHLKHRNNNSPSVATTPAYPKSYGGYSIDLNLGTPPQT 100
Query: 104 RLAVADTGSDLIWTQCEP---CPPSQCYMQD-----SPLFDPKMSSTYKSLPCSSSQCAS 155
V DTGS L+W C C S C + P F PK SST K L C + +C
Sbjct: 101 SPFVLDTGSSLVWFPCTSHYLC--SHCNFPNIDPTKIPTFIPKNSSTAKLLGCRNPKCGY 158
Query: 156 L---------------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
L ++CS Y + YG G+ + G L + + T +P
Sbjct: 159 LFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGA-TAGFLLLDNLNFPGKT-----VP 212
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVSST 253
GC L + +GI G G G SL SQM +FSYCLV P SS
Sbjct: 213 QFLVGCSI----LSIRQPSGIAGFGRGQESLPSQMNLK---RFSYCLVSHRFDDTPQSSD 265
Query: 254 KI-NFGTNGIVSGPGVVSTPLTKA-------KTFYVLTIDAISVGNQRLGVSTPDI---- 301
+ + G G+ TP + +Y +T+ + VG + + +
Sbjct: 266 LVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDVKIPYKFLEPGS 325
Query: 302 ------VIDSGTTLTFLPQG-YN---SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ 351
++DSG+T TF+ + YN L + + + L C++ + +
Sbjct: 326 DGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGLSPCFNISGVKT 385
Query: 352 V--PEVTIHFRGADVKLSRS--NFFVKVSE-DIVC-SVFKGITNSVP-------IYGNIM 398
+ PE T F+G K+S+ N+F V + +++C +V P I GN
Sbjct: 386 ISFPEFTFQFKGG-AKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQPKTAGPAIILGNYQ 444
Query: 399 QTNFLVGYDIEQQTVSFKPTDCTKQ 423
Q NF V YD+E + F P +C ++
Sbjct: 445 QQNFYVEYDLENERFGFGPRNCKRK 469
>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
Length = 334
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 95/326 (29%), Positives = 148/326 (45%), Gaps = 48/326 (14%)
Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV--------NCQYSVSYGDG----SFSNG 180
PL P SS+ + C C L + CS V NC Y +YG+ ++ G
Sbjct: 13 PLLYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEG 72
Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
L TET T G A A PGI FGC + G F + + G+VGLG G +SL++Q+
Sbjct: 73 ILMTETFTFGD---DAAAFPGIAFGCTLRSEGGFGTGS-GLVGLGRGKLSLVTQLNVE-- 126
Query: 241 GKFSYCLVPVSS--TKINFGTNGIVSGPG--------VVSTPLTKAKTFYVLTIDAISVG 290
F Y L S + I+FG+ V+G +++ P+ + FY + + ISVG
Sbjct: 127 -AFGYRLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVG 185
Query: 291 NQRLGV-----------STPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTG 338
+ + + ++ DSGTTLT LP Y ++S M +P
Sbjct: 186 GKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAAND 245
Query: 339 SLELCYS-FNSLSQVPEVTIHFR-GADVKLSRSNFFVKV----SEDIVCSVFKGITNSVP 392
+C++ +S + P + +HF GAD+ LS N+ ++ E C + ++
Sbjct: 246 DDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALT 305
Query: 393 IYGNIMQTNFLVGYDIE-QQTVSFKP 417
I GNIMQ +F V +D+ + F+P
Sbjct: 306 IIGNIMQMDFHVVFDLSGNARMLFQP 331
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 125/441 (28%), Positives = 191/441 (43%), Gaps = 87/441 (19%)
Query: 46 SSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN-ANYLIRISIGTPPTER 104
SS+ P+ L + SL+R +H S ++ + + P + Y I ++ GTPP
Sbjct: 39 SSKKPWGSLNHLASLSLSRAHHIK--SPKTNFSLIKTPLFPRSYGGYSISLNFGTPPQTT 96
Query: 105 LAVADTGSDLIWTQCEP---CPPSQCYMQD-----SPLFDPKMSSTYKSLPCSSSQCASL 156
V DTGS L+W C C S+C + P F PK+SS+ K + C + +C+ +
Sbjct: 97 KFVMDTGSSLVWFPCTSRYLC--SECNFPNIKKTGIPTFLPKLSSSSKLIGCKNPRCSMI 154
Query: 157 N----QKSC-----SGVNCQ-----YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
Q C + NC Y + YG GS + G L +ET+ + +P
Sbjct: 155 FGPEIQSKCQECDSTAQNCTQTCPPYVIQYGSGS-TAGLLLSETLDFPNKK----TIPDF 209
Query: 203 TFGCGTNNGGLFNSKT-TGIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVSSTK 254
GC +F+ K GI G G SL SQ+ KFSYCLV P SS
Sbjct: 210 LVGC-----SIFSIKQPEGIAGFGRSPESLPSQLGLK---KFSYCLVSHAFDDTPTSSDL 261
Query: 255 I-NFGT-NGIVSGPGVVSTPLTKAKT-----FYVLTIDAISVGNQRLGVSTPDIV----- 302
+ + G+ +G+ G+ TP K T +Y + + I +G+ + V +V
Sbjct: 262 VLDTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGTDG 321
Query: 303 -----IDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF---NSLS 350
+DSGTT TF+ + M+ A + + TG L CY+ SLS
Sbjct: 322 NGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQNLTG-LRPCYNISGEKSLS 380
Query: 351 QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVP----------IYGNIMQ 399
VP++ F+ GA + L SN+F V ++C ++++V I GN Q
Sbjct: 381 -VPDLIFQFKGGAKMALPLSNYFSIVDSGVICLTI--VSDNVAGPGLGGGPAIILGNYQQ 437
Query: 400 TNFLVGYDIEQQTVSFKPTDC 420
NF V +D+E + FK C
Sbjct: 438 RNFYVEFDLENEKFGFKQQSC 458
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 173/369 (46%), Gaps = 46/369 (12%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I +G+PP + DTGSD++W C CP + FDP S T +
Sbjct: 81 YYTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATPVS 140
Query: 148 CSSSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---AL 199
CS +C+ Q S SG + C Y+ YGDGS ++G ++ + G ++ +
Sbjct: 141 CSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNST 200
Query: 200 PGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTK 254
+ FGC T+ G + GI G G +S+ISQ+ + +A + FS+CL K
Sbjct: 201 APVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCL------K 254
Query: 255 INFGTNGI-----VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI-------- 301
G GI + P +V TPL ++ Y + + +ISV Q L ++ P +
Sbjct: 255 GENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPIN-PSVFSTSNGQG 313
Query: 302 -VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQV-PEVTIH 358
+ID+GTTL +L + + +++ + +Q V CY S++ + P V+++
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAV-SQSVRPVVSKGNQCYVIATSVADIFPPVSLN 372
Query: 359 FR-GADVKLSRSNFFVKVSE----DIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQT 412
F GA + L+ ++ ++ + + C F+ I N + I G+++ + + YD+ Q
Sbjct: 373 FAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLVGQR 432
Query: 413 VSFKPTDCT 421
+ + DC+
Sbjct: 433 IGWANYDCS 441
>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
Length = 426
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 94/360 (26%), Positives = 157/360 (43%), Gaps = 37/360 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y ++ +IG PP DTGSDL W QC+ PC QC PL+ P T + C
Sbjct: 67 YHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPC--IQCTPAPHPLYQP----TNDLVVCK 120
Query: 150 SSQCASL---NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
CASL N + C Y V Y DG S G L + + T+G A P +T GC
Sbjct: 121 DPICASLHPDNYRCDDPDQCDYEVEYADGGSSIGVLVNDLFPVNLTSGMR-ARPRLTIGC 179
Query: 207 GTNN-GGLFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNGIV 263
G + G+ G++GLG G S+++Q+ + + +C + FG + I
Sbjct: 180 GYDQLPGIAYHPLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCFSRRGGGYLFFGDD-IY 238
Query: 264 SGPGVVSTPLTKAK-TFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLS 322
V+ TP+++ Y + + + G+ +V DSG++ T+ LLS
Sbjct: 239 DSSKVIWTPMSRDYLKHYTPGFAELILNGRSSGLKNLLVVFDSGSSYTYFNTQTYQTLLS 298
Query: 323 VMSSMIEAQPVADPT--GSLELCYSFNS-LSQVPEVTIHFR------GADVKLSRSNFFV 373
+ + +P+ + +L +C+ + + +F+ G+ K ++S F +
Sbjct: 299 FIKKDLHGKPLKEAVEDDTLPVCWRGKKPFKSIRDAKKYFKPLALSFGSGWK-TKSQFEI 357
Query: 374 KVSEDIVC----SVFKGITNSVP-------IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ ++ SV GI N I G+I LV YD E+Q + ++P++C +
Sbjct: 358 QQESYLIISSKGSVCLGILNGTEVGLQNYNIIGDISMQEKLVIYDNEKQVIGWQPSNCDR 417
>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
Length = 434
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 111/400 (27%), Positives = 169/400 (42%), Gaps = 77/400 (19%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE----PCPPSQCYMQD--------------- 131
YLI +++GTPP DTGSDL W C C Y +
Sbjct: 29 YLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSSSL 88
Query: 132 -----SPLFDPKMSSTYKSLPCSSSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLAT 184
SPL SS PC+ + C ++L + +C ++ +YG G G L
Sbjct: 89 RDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTR 148
Query: 185 ETVTL-GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
+T+T GS+ +P FGC G + GI G G G +SL SQ+ G F
Sbjct: 149 DTLTTHGSSPSFTREVPNFCFGC----VGSTYREPIGIAGFGRGVLSLPSQLGFLQKG-F 203
Query: 244 SYCLV-------PVSSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQR 293
S+C + P S+ + G I S + T L K +Y + ++AI+VGN
Sbjct: 204 SHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNAT 263
Query: 294 LGVSTPD------------IVIDSGTTLTFLPQGYNSNLLSVMSSMI---EAQPVADPTG 338
+ P ++IDSGTT T LP + + LLS++ S+I AQ TG
Sbjct: 264 -AIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEARTG 322
Query: 339 SLELCYSFNSLSQV--------PEVTIHF-RGADVKLSRSNFFVKV-----SEDIVCSVF 384
+LCY + V P ++ HF + L + N F + S + C +
Sbjct: 323 -FDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVVKCLLL 381
Query: 385 KGITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ + +S ++G+ Q N V YD+E++ + F+P DC
Sbjct: 382 QNMDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 421
>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 449
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 119/417 (28%), Positives = 173/417 (41%), Gaps = 93/417 (22%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT-------QCEPCPPSQCYMQDSPL--FDPKMSS 141
YL+ +SIGTPP DTGSDL W C+ C Q + L F P SS
Sbjct: 21 YLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTHSS 80
Query: 142 TYKSLPCSSSQC--------------------ASLNQKSCSGVNCQYSVSYGDGSFSNGN 181
T C SS C ASL + +C ++ +YG G+
Sbjct: 81 TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVTGS 140
Query: 182 LATETV-TLGSTTGQAVA---LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT 237
L + + T G+ +P FGC G + GI G G G +SL Q+
Sbjct: 141 LTRDVLFTHGNYNNNNNNNKQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPFQLGF 196
Query: 238 TIAGKFSYCLVPVS-STKINFGTNGIVSGPGVVS-------TPLTKA---KTFYVLTIDA 286
+ G FS+C +P S NF + I+ + S TPL K+ +Y + +++
Sbjct: 197 SHKG-FSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPMYPNYYYIGLES 255
Query: 287 ISVGNQ----RLGVS----------TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMI---E 329
I++GN R GVS ++IDSGTT T LP+ S L+S + +I
Sbjct: 256 ITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLELVIGYPR 315
Query: 330 AQPVADPTGSLELCY---------SFNSLSQVPEVTIHF-RGADVKLSRSNFFVKVSEDI 379
A+ V TG +LCY SF +Q+P +T HF V L + N F ++ I
Sbjct: 316 AKQVELNTG-FDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNFYAMAAPI 374
Query: 380 VCSVFKGI----------------TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+V K + I+G+ Q N V YD+E++ + F+P DC
Sbjct: 375 NSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQPMDC 431
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 106/409 (25%), Positives = 172/409 (42%), Gaps = 92/409 (22%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE----PCPPSQCY------MQDSPLFDPKMS 140
YLI ++IGTPP DTGSDL W C C +CY ++ +F P S
Sbjct: 83 YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDC--IECYDLKNNDLKSPSVFSPLHS 140
Query: 141 STYKSLPCSSSQCASLNQKS-----CSGVNC---------------QYSVSYGDGSFSNG 180
ST C+SS C ++ C+ C ++ +YG+G +G
Sbjct: 141 STSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISG 200
Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
L + + + +P +FGC T+ + GI G G G +SL SQ+
Sbjct: 201 ILTRDIL-----KARTRDVPRFSFGCVTST----YREPIGIAGFGRGLLSLPSQLGFLEK 251
Query: 241 GKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTF------------YVLTIDAIS 288
G FS+C +P ++ ++ G +S LT + F Y + +++I+
Sbjct: 252 G-FSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYPNSYYIGLESIT 310
Query: 289 VGNQRLGVSTP------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP 336
+G P +++DSGTT T LP+ + S LL+ + S I P A
Sbjct: 311 IGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQSTI-TYPRATE 369
Query: 337 TGS---LELCYSF----NSLSQV--------PEVTIHF-RGADVKLSRSNFFVKVSED-- 378
T S +LCY N+L+ + P +T HF A + L + N F +S
Sbjct: 370 TESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSD 429
Query: 379 ---IVCSVFKGITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ C +F+ + + ++G+ Q N V YD+E++ + F+ DC
Sbjct: 430 GSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 478
>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 417
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 111/400 (27%), Positives = 169/400 (42%), Gaps = 77/400 (19%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE----PCPPSQCYMQD--------------- 131
YLI +++GTPP DTGSDL W C C Y +
Sbjct: 12 YLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSSSL 71
Query: 132 -----SPLFDPKMSSTYKSLPCSSSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLAT 184
SPL SS PC+ + C ++L + +C ++ +YG G G L
Sbjct: 72 RDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTR 131
Query: 185 ETVTL-GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
+T+T GS+ +P FGC G + GI G G G +SL SQ+ G F
Sbjct: 132 DTLTTHGSSPSFTREVPNFCFGC----VGSTYREPIGIAGFGRGVLSLPSQLGFLQKG-F 186
Query: 244 SYCLV-------PVSSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQR 293
S+C + P S+ + G I S + T L K +Y + ++AI+VGN
Sbjct: 187 SHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNAT 246
Query: 294 LGVSTPD------------IVIDSGTTLTFLPQGYNSNLLSVMSSMI---EAQPVADPTG 338
+ P ++IDSGTT T LP + + LLS++ S+I AQ TG
Sbjct: 247 -AIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEARTG 305
Query: 339 SLELCYSFNSLSQV--------PEVTIHF-RGADVKLSRSNFFVKV-----SEDIVCSVF 384
+LCY + V P ++ HF + L + N F + S + C +
Sbjct: 306 -FDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVVKCLLL 364
Query: 385 KGITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
+ + +S ++G+ Q N V YD+E++ + F+P DC
Sbjct: 365 QNMDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 404
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 100/367 (27%), Positives = 170/367 (46%), Gaps = 37/367 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G+PP + DTGSD++W + C CP + FDP S+T +
Sbjct: 84 YFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAALVS 143
Query: 148 CSSSQCASLNQKS---CSGV--NCQYSVSYGDGSFSNGNLATETVTLGS---TTGQAVAL 199
CS +C + Q S CS C Y+ YGDGS ++G + + L + ++G+ +
Sbjct: 144 CSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGELSQI 203
Query: 200 -----PGITFGCGT-NNGGLFNSKTT--GIVGLGGGDISLISQMRT--TIAGKFSYCLVP 249
++F C T G L S GI G G ++S+ISQ+ + FS+CL
Sbjct: 204 CQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHCLKG 263
Query: 250 VSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDI 301
S IV P +V TPL ++ Y L + +ISV Q L + S
Sbjct: 264 DDSGGGVLVLGEIVE-PNIVYTPLVPSQPHYNLYLQSISVAGQTLAIDPSVFGASSNQGT 322
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV-PEVTIHFR 360
++DSGTTL +L +G +S ++S++ + + +S++ V P+V+++F
Sbjct: 323 IVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTYLSKGNQCYLVTSSVNDVFPQVSLNFA 382
Query: 361 -GADVKLSRSNFFVKVSE----DIVCSVF-KGITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
GA + L+ ++ ++ + + C F K + I G+++ + + YDI Q V
Sbjct: 383 GGASLILNPQDYLLQQNSVGGAAVWCVGFQKTPGQQITILGDLVLKDKIFVYDIANQRVG 442
Query: 415 FKPTDCT 421
+ DC+
Sbjct: 443 WTNYDCS 449
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 120/430 (27%), Positives = 186/430 (43%), Gaps = 48/430 (11%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQ----RLRDALTRSLNRLNHFNQNSSISSSK 78
+ Q G ++E+ H SP SPF S + +L+ L L SI
Sbjct: 27 DTQDHGSTLEVFHVFSPCSPFRPSKPLSWAESVLQLQAKDQARLQFLASMVAGRSIVP-I 85
Query: 79 ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
AS II + Y++R IGTPP L DT +D W C C C S LF P+
Sbjct: 86 ASGRQII-QSPTYIVRAKIGTPPQTLLLAIDTSNDAAWIPCTAC--DGC---TSTLFAPE 139
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
S+T+K++ C S +C + SC C ++++YG S + N+ +TVTL +
Sbjct: 140 KSTTFKNVSCGSPECNKVPSPSCGTSACTFNLTYGSSSIA-ANVVQDTVTLATD-----P 193
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG 258
+PG TFGC G ++ G++GLG G +SL+SQ + FSYCL S +NF
Sbjct: 194 IPGYTFGCVAKTTGP-STPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKS--LNFS 250
Query: 259 TN---GIVSGP-GVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI---------- 301
+ G V+ P + TPL K + Y + + AI VG + + + +
Sbjct: 251 GSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFNAATGAGT 310
Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT----GSLELCYSFNSLSQVPEVTI 357
V DSGT T L + + + A+ T G + CY+ ++ P +T
Sbjct: 311 VFDSGTVFTRLVAPVYTAVRDEFRRRVAMAAKANLTVTSLGGFDTCYTVPIVA--PTITF 368
Query: 358 HFRGADVKLSRSNFFVKVSED-----IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
F G +V L + N + + + S + + + + N+ Q N V YD+
Sbjct: 369 MFSGMNVTLPQDNILIHSTAGSTSCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNSR 428
Query: 413 VSFKPTDCTK 422
+ CTK
Sbjct: 429 LGVARELCTK 438
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 164/371 (44%), Gaps = 41/371 (11%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
+ ++ P+ Y I +G PP DTGSDL W QC+ PC + C PL+ P
Sbjct: 182 KGNVFPD-GQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPC--TNCAKGPHPLYKP-- 236
Query: 140 SSTYKSLPCSSSQCASL--NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
+ K +P S C L +Q C C Y + Y D S S G LA + + L +T G
Sbjct: 237 -AKEKIVPPRDSLCQELQGDQNYCETCKQCDYEIEYADRSSSMGVLAKDDMHLIATNGGR 295
Query: 197 VALPGITFGCGTNNGGLFNS---KTTGIVGLGGGDISLISQM--RTTIAGKFSYCLV-PV 250
L FGC + G S KT GI+GL ISL SQ+ + I+ F +C+
Sbjct: 296 EKL-DFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRET 354
Query: 251 SSTKINFGTNGIVSGPGVVSTPLTKA-KTFYVLTIDAISVGNQRLGV-STPDIVIDSGTT 308
+ F + V G+ P+ Y ++ G+Q L ++ ++ DSG++
Sbjct: 355 NGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGNSVQVIFDSGSS 414
Query: 309 LTFLPQGYNSNLLSVMSSMIEAQP--VADPTG-SLELCYS--FNSLSQVPEVTIHFRGAD 363
T+LP+ NL+ + E P V D + +L LC+ F+ S + +HF
Sbjct: 415 YTYLPEEMYKNLIDAIK---EDSPSFVQDSSDTTLPLCWKADFSVRSFFKPLNLHFGRRW 471
Query: 364 VKLSRSNFFVKVSEDIVC-----SVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQ 411
+ ++ F V +D + +V G+ N S I G++ LV YD E++
Sbjct: 472 FVVPKT--FTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNERR 529
Query: 412 TVSFKPTDCTK 422
+ + ++CTK
Sbjct: 530 QIGWANSECTK 540
>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 421
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 101/376 (26%), Positives = 168/376 (44%), Gaps = 48/376 (12%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
D+ P+ Y + +SIG PP DTGSDL W QC+ PC C PL+ P
Sbjct: 50 GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCNKVPHPLYRP--- 103
Query: 141 STYKSLPCSSSQCASLN-----QKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
+ K +PC C+SL+ + C C Y + Y D S G L T++ +
Sbjct: 104 TKNKIVPCVDQLCSSLHGGLSGKHKCDSPKQQCDYEIKYADQGSSLGVLLTDSFAV-RLA 162
Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLV 248
++ P + FGCG + + T G++GLG G ISL+SQ++ K +CL
Sbjct: 163 NSSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGHCLS 222
Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGVSTPDIVIDSG 306
+ FG N +V P+ ++ K +Y ++ G + LGV ++V+DSG
Sbjct: 223 IRGGGFLFFGDN-LVPYSRATWVPMVRSAFKNYYSPGTASLYFGGRSLGVRPMEVVLDSG 281
Query: 307 TTLTFL-PQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYS----FNSLSQVPE----VT 356
++ T+ Q Y + + ++ S + + + V DP SL LC+ F S+ V + +
Sbjct: 282 SSFTYFGAQPYQALVTALKSDLSKTLKEVFDP--SLPLCWKGKKPFKSVLDVKKEFKSLV 339
Query: 357 IHF---RGADVKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGY 406
+ F + A +++ N+ + C GI N + I G+I + +V Y
Sbjct: 340 LSFSNGKKALMEIPPENYLIVTKFGNAC---LGILNGSEIGLKDLNIVGDITMQDQMVIY 396
Query: 407 DIEQQTVSFKPTDCTK 422
D E+ + + C +
Sbjct: 397 DNERGQIGWIRAPCDR 412
>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
Length = 504
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 112/425 (26%), Positives = 171/425 (40%), Gaps = 107/425 (25%)
Query: 88 NANYLIRISIGTPPTERLAVA---DTGSDLIWTQCEPCPPSQCYM--------QDSPLFD 136
++Y + +S+G P + V+ DTGSDL+W PC P C + + PL
Sbjct: 87 GSDYTLSLSVG-PASAAAPVSLFLDTGSDLVWF---PCAPFTCMLCEGKPTPGRSGPLPP 142
Query: 137 PKMSSTYKSLPCSSSQCASLNQKS-----CSGVNCQYS-----------------VSYGD 174
P S + +PC+S C++ + + C+ C +YGD
Sbjct: 143 PPDS---RRIPCASPLCSAAHASAPPSDLCAAARCPLEDIETGSCGASHACPPLYYAYGD 199
Query: 175 GSFSNGNLATETVTLGS--TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
GS +L V LG+ AVA+ TF C G + G+ G G G +SL
Sbjct: 200 GSLV-AHLRRGRVALGAGARASVAVAVDNFTFACAHTALG----EPVGVAGFGRGPLSLP 254
Query: 233 SQMRTTIAGKFSYCLV--------------------PVSSTKINFGTNGIVSGPGVVSTP 272
Q+ ++G+FSYCLV P + T+G V P ++ P
Sbjct: 255 GQLSPQLSGRFSYCLVSHSFRADRLIRPSPLILGRSPDDADAAAAETDGFVYTP-LLHNP 313
Query: 273 LTKAKTFYVLTIDAISVGNQRLGVSTPDI-----------VIDSGTTLTFLPQGYNSNLL 321
K FY + ++A+SVG R+ + P++ V+DSGTT T LP + +
Sbjct: 314 --KHPYFYSVALEAVSVGAARI-QARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVA 370
Query: 322 SVMSSMIEAQPV-----ADPTGSLELCYSFNSLSQ-VPEVTIHFRG-ADVKLSRSNFFVK 374
+ + A A+ L CY + + + VP + +HFRG A V L R N+F+
Sbjct: 371 EAFARAMAAAGFARAERAEEQTGLTPCYRYAASDRGVPPLALHFRGNATVALPRRNYFMG 430
Query: 375 V----------SEDIVCSVF------KGITNSVPI--YGNIMQTNFLVGYDIEQQTVSFK 416
+D+ C + G P GN Q F V YD++ V F
Sbjct: 431 FKSEDAGAGTRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFA 490
Query: 417 PTDCT 421
CT
Sbjct: 491 RRRCT 495
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 123/475 (25%), Positives = 196/475 (41%), Gaps = 65/475 (13%)
Query: 2 ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKS---PFYNSS----------- 47
A L + + + CFY S + Q G E R+ +S P Y +
Sbjct: 83 ALVLGALAVAAYYCFY--SDVAVQFLGMEQEEEQRNETRSFLLPLYPKARQGRALREFGD 140
Query: 48 -ETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS---QADIIPNNANYLIRISIGTPPTE 103
+ +R+ D ++ NR+ ++ ++S A + ++ P+ Y I IG PP
Sbjct: 141 VKLAARRVDDGGRKARNRMEVAKAATARTNSTALLPIKGNVFPD-GQYYTSIFIGNPPRP 199
Query: 104 RLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKS 160
DTGSDL W QC+ PC + PL+ P + K +P C L NQ
Sbjct: 200 YFLDVDTGSDLTWIQCDAPC--TNFAKGPHPLYKP---AKEKIVPPRDLLCQELQGNQNY 254
Query: 161 CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS--- 216
C C Y + Y D S S G LA + + + +T G L FGC + G S
Sbjct: 255 CETCKQCDYEIEYADQSSSMGVLARDDMHMIATNGGREKL-DFVFGCAYDQQGQLLSSPA 313
Query: 217 KTTGIVGLGGGDISLISQMRT--TIAGKFSYCLV-PVSSTKINFGTNGIVSGPGVVSTPL 273
KT GI+GL IS SQ+ + IA F +C+ F + V GV T +
Sbjct: 314 KTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGGYMFLGDDYVPRWGVTWTSI 373
Query: 274 TKA-KTFYVLTIDAISVGNQRL-----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVM--S 325
Y + G+Q+L ST ++ DSG++ T+LP NL++ + +
Sbjct: 374 RSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFDSGSSYTYLPNEIYENLVAAIKYA 433
Query: 326 SMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFRGADVKLSRSNFFVKVS-----EDI 379
S Q +D T L LC+ + + + +V F ++ + F+ + ED
Sbjct: 434 SPGFVQDTSDRT--LPLCWKADFPVRYLEDVKQFFEPLNLHFGKKWLFMSKTFTISPEDY 491
Query: 380 VC-----SVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
+ +V G+ N S I G++ LV YD +++ + + +DCTK
Sbjct: 492 LIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNQRKQIGWADSDCTK 546
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.132 0.390
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,634,089,967
Number of Sequences: 23463169
Number of extensions: 284146421
Number of successful extensions: 745117
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1191
Number of HSP's successfully gapped in prelim test: 3638
Number of HSP's that attempted gapping in prelim test: 734244
Number of HSP's gapped (non-prelim): 5903
length of query: 423
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 278
effective length of database: 8,957,035,862
effective search space: 2490055969636
effective search space used: 2490055969636
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)