BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 021054
(318 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255541750|ref|XP_002511939.1| conserved hypothetical protein [Ricinus communis]
gi|223549119|gb|EEF50608.1| conserved hypothetical protein [Ricinus communis]
Length = 356
Score = 489 bits (1260), Expect = e-136, Method: Compositional matrix adjust.
Identities = 236/291 (81%), Positives = 258/291 (88%), Gaps = 4/291 (1%)
Query: 1 MSQWLHQRPLHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWT 60
M+ +LHQRPLHN L +PYPLSPRNS Q Q + + +RT L+VL SLL++LGV +PWT
Sbjct: 1 MTSFLHQRPLHNSLSDPYPLSPRNSANSQRQ---ISIFSRTGLIVLFSLLLILGVFVPWT 57
Query: 61 GTPGFMFP-NATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQK 119
P +F SS+AKWR YTL QAASFVA+NGT+IVCAVSQPYLPFLNNWLISI+RQK
Sbjct: 58 ELPNGIFSATKQSSVAKWRQYTLPQAASFVAQNGTVIVCAVSQPYLPFLNNWLISITRQK 117
Query: 120 HQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHI 179
HQD+VLVIAEDYATLYKVN +WPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRP HLLH+
Sbjct: 118 HQDKVLVIAEDYATLYKVNEKWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPRHLLHL 177
Query: 180 LELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYIC 239
LELGYNVMYNDVDMVWL DPF YL+G HDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYIC
Sbjct: 178 LELGYNVMYNDVDMVWLGDPFIYLEGKHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYIC 237
Query: 240 SCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
SCMI+L PT GAKLVMKKWI+ELQA+PWSKAKKANDQPAFNWALNKTAGQV
Sbjct: 238 SCMIFLHPTVGAKLVMKKWIKELQAQPWSKAKKANDQPAFNWALNKTAGQV 288
>gi|225454397|ref|XP_002279469.1| PREDICTED: UDP-galactose:fucoside alpha-3-galactosyltransferase
[Vitis vinifera]
gi|297745375|emb|CBI40455.3| unnamed protein product [Vitis vinifera]
Length = 361
Score = 454 bits (1168), Expect = e-125, Method: Compositional matrix adjust.
Identities = 218/280 (77%), Positives = 250/280 (89%), Gaps = 9/280 (3%)
Query: 14 LPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPN---A 70
L NP P+SPRN + ++RP + + RT LL+LL+L+VV+GV+LP T P+ +
Sbjct: 20 LSNPNPISPRNPL--DWRRP-ISIFGRTGLLILLTLMVVVGVLLP---TMKMRMPDGLLS 73
Query: 71 TSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAED 130
+S++KWRDYTL+QAA FVAKNGT+IVCAVSQPYLPFLNNWLISI+RQKHQD+VLVIAED
Sbjct: 74 RASVSKWRDYTLAQAAEFVAKNGTVIVCAVSQPYLPFLNNWLISIARQKHQDKVLVIAED 133
Query: 131 YATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYND 190
YATLY VN +WPGHAVLVPPAPD+QTAHKFGS GFFNFTSRRP HLL+ILELGYNVMYND
Sbjct: 134 YATLYTVNQKWPGHAVLVPPAPDAQTAHKFGSMGFFNFTSRRPRHLLNILELGYNVMYND 193
Query: 191 VDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDG 250
VDMVWL DPFPYLQG HDVYFTDDMAAVKPL+HSHDLPPPGKKGRTYICSCMI++RPT+G
Sbjct: 194 VDMVWLADPFPYLQGKHDVYFTDDMAAVKPLNHSHDLPPPGKKGRTYICSCMIFMRPTNG 253
Query: 251 AKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
AKLVMKKWIEELQA+PWS+AKK+NDQPAFNWALN+TAG+V
Sbjct: 254 AKLVMKKWIEELQAQPWSRAKKSNDQPAFNWALNRTAGEV 293
>gi|449441404|ref|XP_004138472.1| PREDICTED: UDP-galactose:fucoside
alpha-3-galactosyltransferase-like [Cucumis sativus]
gi|449495244|ref|XP_004159776.1| PREDICTED: UDP-galactose:fucoside
alpha-3-galactosyltransferase-like [Cucumis sativus]
Length = 355
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 223/286 (77%), Positives = 251/286 (87%), Gaps = 3/286 (1%)
Query: 5 LHQRPLHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGTPG 64
LHQR H+ L N YPLSPR+S + + + +N LL LLSL+V+LGV LPW
Sbjct: 5 LHQRSTHSSLANSYPLSPRSSSSSERSFSIFSPIN---LLALLSLIVILGVFLPWMNIQE 61
Query: 65 FMFPNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQV 124
+F ++ S +KWR+Y+L++AASFVA+NGT+IVCAVSQPYLPFLNNWLIS+SRQKH ++V
Sbjct: 62 SIFSSSKVSNSKWREYSLAEAASFVARNGTVIVCAVSQPYLPFLNNWLISLSRQKHHEKV 121
Query: 125 LVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGY 184
LVIAEDYATLYKVN RWPGHAVLVPPAPD+QTAHKFGSQGFFNFTSRRP HLLHILELGY
Sbjct: 122 LVIAEDYATLYKVNERWPGHAVLVPPAPDAQTAHKFGSQGFFNFTSRRPRHLLHILELGY 181
Query: 185 NVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIY 244
NVMYNDVDMVWL DPFPYLQG+HDVYFTDDMAAVKPL HSHDLPPPGKKGRTYICSCMI+
Sbjct: 182 NVMYNDVDMVWLADPFPYLQGNHDVYFTDDMAAVKPLHHSHDLPPPGKKGRTYICSCMIF 241
Query: 245 LRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
LRPT GAKLVM+KWIEEL+A+PWSKAKKANDQPAFNWALNKTAG+V
Sbjct: 242 LRPTSGAKLVMRKWIEELKAQPWSKAKKANDQPAFNWALNKTAGEV 287
>gi|296085981|emb|CBI31422.3| unnamed protein product [Vitis vinifera]
Length = 1331
Score = 449 bits (1154), Expect = e-124, Method: Composition-based stats.
Identities = 215/277 (77%), Positives = 239/277 (86%), Gaps = 3/277 (1%)
Query: 14 LPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSS 73
L + YP+S R+S + + V RT LLVLL+L+VVLGV+LP P +F + S
Sbjct: 990 LSDSYPISARSSPNWGRS---ISVFGRTGLLVLLTLVVVLGVVLPGMRAPDGLFGGSKVS 1046
Query: 74 LAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYAT 133
++KWR+YTL +A F AKNGT+IVCAVSQPYLPFLNNWLISISRQKHQD+VLVIAEDYAT
Sbjct: 1047 VSKWREYTLEEAVPFAAKNGTLIVCAVSQPYLPFLNNWLISISRQKHQDKVLVIAEDYAT 1106
Query: 134 LYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDM 193
LY VN RWPGHAVLVPPAPD+Q AHKFGSQGFFNFTSRRP HLL+ILELGYNVMYNDVDM
Sbjct: 1107 LYAVNDRWPGHAVLVPPAPDAQVAHKFGSQGFFNFTSRRPRHLLYILELGYNVMYNDVDM 1166
Query: 194 VWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKL 253
VWL DPFPYLQGDHDVYFTDDM AVKPL+HSHDLPPPGKKGRTYICSCMI++RPTDGAKL
Sbjct: 1167 VWLADPFPYLQGDHDVYFTDDMTAVKPLNHSHDLPPPGKKGRTYICSCMIFMRPTDGAKL 1226
Query: 254 VMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
VMK WIEELQA+PWS AKK+NDQPAFNWALN+TA QV
Sbjct: 1227 VMKDWIEELQAQPWSNAKKSNDQPAFNWALNRTAAQV 1263
>gi|224127804|ref|XP_002320168.1| predicted protein [Populus trichocarpa]
gi|222860941|gb|EEE98483.1| predicted protein [Populus trichocarpa]
Length = 284
Score = 438 bits (1127), Expect = e-120, Method: Compositional matrix adjust.
Identities = 225/287 (78%), Positives = 250/287 (87%), Gaps = 3/287 (1%)
Query: 4 WLHQRPLHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGTP 63
+LHQRPLH L +PYPLS + + + +RT L+ +LSLL++LGVILPWTGTP
Sbjct: 1 FLHQRPLHGTLSDPYPLS---PRQSSSSQRQISLFSRTGLIAILSLLLILGVILPWTGTP 57
Query: 64 GFMFPNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQ 123
+SLAKW+ YTL QA +FVAKN T+IVCAVSQPYLPFL+NWLISISRQKHQD+
Sbjct: 58 SIFSATKPASLAKWQQYTLPQAVAFVAKNKTVIVCAVSQPYLPFLSNWLISISRQKHQDK 117
Query: 124 VLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELG 183
VLVIAEDYATLY VN RWPGHAVLVPPAPDSQ+AHKFGSQGFFNFTSRRP HLLHILELG
Sbjct: 118 VLVIAEDYATLYNVNERWPGHAVLVPPAPDSQSAHKFGSQGFFNFTSRRPRHLLHILELG 177
Query: 184 YNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMI 243
Y+VMYNDVDMVWL DPF YL+G+HDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMI
Sbjct: 178 YDVMYNDVDMVWLGDPFRYLEGNHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMI 237
Query: 244 YLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
++RPTDGAKLVMKKWIEEL+A+PWSK +KANDQPAFNWALNKTAGQV
Sbjct: 238 FMRPTDGAKLVMKKWIEELKAQPWSKTRKANDQPAFNWALNKTAGQV 284
>gi|18411573|ref|NP_567211.1| rhamnogalacturonan II specific xylosyltransferase [Arabidopsis
thaliana]
gi|57222150|gb|AAW38982.1| At4g01220 [Arabidopsis thaliana]
gi|332656596|gb|AEE81996.1| rhamnogalacturonan II specific xylosyltransferase [Arabidopsis
thaliana]
Length = 299
Score = 433 bits (1113), Expect = e-119, Method: Compositional matrix adjust.
Identities = 222/295 (75%), Positives = 250/295 (84%), Gaps = 3/295 (1%)
Query: 3 QWLHQRPLHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGT 62
++LHQRP+ NP NP+ S S + RP + +L+R LL+LL+LLV+LGV LPW G+
Sbjct: 5 KFLHQRPIQNPFTNPFSSS-PLSTSSISNRP-ISLLSRNGLLLLLALLVILGVFLPWAGS 62
Query: 63 PGFMFPNATS-SLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQ 121
P F PN S S +KWRDY+L QA FVAKNGT+IVCAVS PYLPFLNNWLIS+SRQKHQ
Sbjct: 63 PLFPSPNKLSPSQSKWRDYSLPQAVKFVAKNGTVIVCAVSYPYLPFLNNWLISVSRQKHQ 122
Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
DQVLVIAEDYATLYKVN +WPGHAVL+PPA DSQTAHKFGSQGFFNFT+RRP HLL ILE
Sbjct: 123 DQVLVIAEDYATLYKVNEKWPGHAVLIPPALDSQTAHKFGSQGFFNFTARRPQHLLEILE 182
Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 241
LGYNVMYNDVDMVWL+DPF YL+G HD YF DDM A+KPLDHSHDLPPPGKKGRTYICSC
Sbjct: 183 LGYNVMYNDVDMVWLQDPFQYLEGKHDAYFMDDMTAIKPLDHSHDLPPPGKKGRTYICSC 242
Query: 242 MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVCSVCLC 296
MI+LRPT+GAKL+MKKWIEEL+ +PWS+AKKANDQP FNWALNKTA QVCS L
Sbjct: 243 MIFLRPTNGAKLLMKKWIEELETQPWSRAKKANDQPGFNWALNKTANQVCSFFLV 297
>gi|13877593|gb|AAK43874.1|AF370497_1 unknown protein [Arabidopsis thaliana]
Length = 299
Score = 433 bits (1113), Expect = e-119, Method: Compositional matrix adjust.
Identities = 222/295 (75%), Positives = 250/295 (84%), Gaps = 3/295 (1%)
Query: 3 QWLHQRPLHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGT 62
++LHQRP+ NP NP+ S S + RP + +L+R LL+LL+LLV+LGV LPW G+
Sbjct: 5 KFLHQRPIQNPFTNPFSSS-PLSTSSISNRP-ISLLSRNGLLLLLALLVILGVFLPWAGS 62
Query: 63 PGFMFPNATS-SLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQ 121
P F PN S S +KWRDY+L QA FVAKNGT+IVCAVS PYLPFLNNWLIS+SRQKHQ
Sbjct: 63 PLFPSPNKHSPSQSKWRDYSLPQAVKFVAKNGTVIVCAVSYPYLPFLNNWLISVSRQKHQ 122
Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
DQVLVIAEDYATLYKVN +WPGHAVL+PPA DSQTAHKFGSQGFFNFT+RRP HLL ILE
Sbjct: 123 DQVLVIAEDYATLYKVNEKWPGHAVLIPPALDSQTAHKFGSQGFFNFTARRPQHLLEILE 182
Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 241
LGYNVMYNDVDMVWL+DPF YL+G HD YF DDM A+KPLDHSHDLPPPGKKGRTYICSC
Sbjct: 183 LGYNVMYNDVDMVWLQDPFQYLEGKHDAYFMDDMTAIKPLDHSHDLPPPGKKGRTYICSC 242
Query: 242 MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVCSVCLC 296
MI+LRPT+GAKL+MKKWIEEL+ +PWS+AKKANDQP FNWALNKTA QVCS L
Sbjct: 243 MIFLRPTNGAKLLMKKWIEELETQPWSRAKKANDQPGFNWALNKTANQVCSFFLV 297
>gi|297810037|ref|XP_002872902.1| hypothetical protein ARALYDRAFT_912111 [Arabidopsis lyrata subsp.
lyrata]
gi|297318739|gb|EFH49161.1| hypothetical protein ARALYDRAFT_912111 [Arabidopsis lyrata subsp.
lyrata]
Length = 356
Score = 429 bits (1104), Expect = e-118, Method: Compositional matrix adjust.
Identities = 220/289 (76%), Positives = 246/289 (85%), Gaps = 7/289 (2%)
Query: 3 QWLHQRPLHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGT 62
++LHQRP+ NP S S + RP+ L L+R LL+LL+LLV+LGV LPW G+
Sbjct: 5 KFLHQRPIQNPF-----SSSPLSNSSISNRPISL-LSRNGLLILLALLVILGVFLPWAGS 58
Query: 63 PGFMFPNA-TSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQ 121
P F FPN +SS +KWRDY+L QA FVAKNGT+IVCAVS PYLPFLNNWLIS+SRQKHQ
Sbjct: 59 PLFPFPNRLSSSQSKWRDYSLPQAVKFVAKNGTVIVCAVSYPYLPFLNNWLISVSRQKHQ 118
Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
DQVLVIAEDYATLYKVN +WPGHAVL+PPA DSQTAHKFGSQGFFNFT+RRP HLL ILE
Sbjct: 119 DQVLVIAEDYATLYKVNEKWPGHAVLIPPALDSQTAHKFGSQGFFNFTARRPQHLLEILE 178
Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 241
LGYNVMYNDVDMVWL+DPF YL+G HD YF DDM A+KPLDHSHDLPPPGKKGRTYICSC
Sbjct: 179 LGYNVMYNDVDMVWLQDPFQYLEGKHDAYFMDDMTAIKPLDHSHDLPPPGKKGRTYICSC 238
Query: 242 MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
MI+LRPT+GAKL+MKKWIEELQ +PWS+AKKANDQP FNWALNKTA QV
Sbjct: 239 MIFLRPTNGAKLLMKKWIEELQTQPWSRAKKANDQPGFNWALNKTAHQV 287
>gi|30678839|ref|NP_849279.1| rhamnogalacturonan II specific xylosyltransferase [Arabidopsis
thaliana]
gi|7267619|emb|CAB80931.1| hypothetical protein [Arabidopsis thaliana]
gi|193885155|gb|ACF28391.1| At4g01220 [Arabidopsis thaliana]
gi|332656595|gb|AEE81995.1| rhamnogalacturonan II specific xylosyltransferase [Arabidopsis
thaliana]
Length = 360
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 219/289 (75%), Positives = 247/289 (85%), Gaps = 3/289 (1%)
Query: 3 QWLHQRPLHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGT 62
++LHQRP+ NP NP+ S S + RP + +L+R LL+LL+LLV+LGV LPW G+
Sbjct: 5 KFLHQRPIQNPFTNPFSSS-PLSTSSISNRP-ISLLSRNGLLLLLALLVILGVFLPWAGS 62
Query: 63 PGFMFPNATS-SLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQ 121
P F PN S S +KWRDY+L QA FVAKNGT+IVCAVS PYLPFLNNWLIS+SRQKHQ
Sbjct: 63 PLFPSPNKLSPSQSKWRDYSLPQAVKFVAKNGTVIVCAVSYPYLPFLNNWLISVSRQKHQ 122
Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
DQVLVIAEDYATLYKVN +WPGHAVL+PPA DSQTAHKFGSQGFFNFT+RRP HLL ILE
Sbjct: 123 DQVLVIAEDYATLYKVNEKWPGHAVLIPPALDSQTAHKFGSQGFFNFTARRPQHLLEILE 182
Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 241
LGYNVMYNDVDMVWL+DPF YL+G HD YF DDM A+KPLDHSHDLPPPGKKGRTYICSC
Sbjct: 183 LGYNVMYNDVDMVWLQDPFQYLEGKHDAYFMDDMTAIKPLDHSHDLPPPGKKGRTYICSC 242
Query: 242 MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
MI+LRPT+GAKL+MKKWIEEL+ +PWS+AKKANDQP FNWALNKTA QV
Sbjct: 243 MIFLRPTNGAKLLMKKWIEELETQPWSRAKKANDQPGFNWALNKTANQV 291
>gi|147777556|emb|CAN69309.1| hypothetical protein VITISV_003084 [Vitis vinifera]
Length = 309
Score = 427 bits (1099), Expect = e-117, Method: Compositional matrix adjust.
Identities = 201/244 (82%), Positives = 225/244 (92%), Gaps = 6/244 (2%)
Query: 50 LVVLGVILPWTGTPGFMFPN---ATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLP 106
+VV+GV+LP T P+ + +S++KWRDYTL+QAA FVAKNGT+IVCAVSQPYLP
Sbjct: 1 MVVVGVLLP---TMKMRMPDGLLSRASVSKWRDYTLAQAAEFVAKNGTVIVCAVSQPYLP 57
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
FLNNWLISI+RQKHQD+VLVIAEDYATLY VN +WPGHAVLVPPAPD+QTAHKFGS GFF
Sbjct: 58 FLNNWLISIARQKHQDKVLVIAEDYATLYTVNQKWPGHAVLVPPAPDAQTAHKFGSMGFF 117
Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHD 226
NFTSRRP HLL+ILELGYNVMYNDVDMVWL DPFPYLQG HDVYFTDDMAAVKPL+HSHD
Sbjct: 118 NFTSRRPRHLLNILELGYNVMYNDVDMVWLADPFPYLQGKHDVYFTDDMAAVKPLNHSHD 177
Query: 227 LPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKT 286
LPPPGKKGRTYICSCMI++RPT+GAKLVMKKWIEELQA+PWS+AKK+NDQPAFNWALN+T
Sbjct: 178 LPPPGKKGRTYICSCMIFMRPTNGAKLVMKKWIEELQAQPWSRAKKSNDQPAFNWALNRT 237
Query: 287 AGQV 290
AG+V
Sbjct: 238 AGEV 241
>gi|74027164|gb|AAZ94713.1| putative alpha 1,3-xylosyltransferase [Linum usitatissimum]
Length = 357
Score = 422 bits (1085), Expect = e-116, Method: Compositional matrix adjust.
Identities = 221/292 (75%), Positives = 243/292 (83%), Gaps = 5/292 (1%)
Query: 1 MSQWLHQRPLHNPLPNPY-PLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPW 59
M+ LHQRP + + P S S Q RP+ L +R LL LL+L+++LGVI+PW
Sbjct: 1 MTTSLHQRPFQPAFSHRFLPASLHYSNLPQ--RPISL-FSRAGLLSLLALMLILGVIVPW 57
Query: 60 TGTPGFMFP-NATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQ 118
PG +F N SS+ +WR YTL QAASFVAKNGT+IVCAVSQ YLPFLNNWLISISRQ
Sbjct: 58 ADMPGGIFSVNKASSVNQWRHYTLPQAASFVAKNGTLIVCAVSQAYLPFLNNWLISISRQ 117
Query: 119 KHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLH 178
K QD VLVIAEDYATL KVN RWPGHAVL+PPA DSQ AHKFGSQGFFNFT+RRP HLL+
Sbjct: 118 KRQDMVLVIAEDYATLDKVNERWPGHAVLIPPALDSQAAHKFGSQGFFNFTARRPQHLLN 177
Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
ILELGY+VMYNDVDMVWL DPF YL+G HDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI
Sbjct: 178 ILELGYSVMYNDVDMVWLGDPFTYLRGLHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 237
Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
CSCMI+LRPTDGAKLVMKKWIEELQA+PWS+AKKANDQPAFNWAL KT GQV
Sbjct: 238 CSCMIFLRPTDGAKLVMKKWIEELQAQPWSRAKKANDQPAFNWALMKTTGQV 289
>gi|2191131|gb|AAB61018.1| A_IG002N01.8 gene product [Arabidopsis thaliana]
Length = 444
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 220/302 (72%), Positives = 247/302 (81%), Gaps = 16/302 (5%)
Query: 3 QWLHQRPLHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGT 62
++LHQRP+ NP NP+ S S + RP + +L+R LL+LL+LLV+LGV LPW G+
Sbjct: 76 KFLHQRPIQNPFTNPFSSS-PLSTSSISNRP-ISLLSRNGLLLLLALLVILGVFLPWAGS 133
Query: 63 PGFMFPNATS-SLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQ 121
P F PN S S +KWRDY+L QA FVAKNGT+IVCAVS PYLPFLNNWLIS+SRQKHQ
Sbjct: 134 PLFPSPNKLSPSQSKWRDYSLPQAVKFVAKNGTVIVCAVSYPYLPFLNNWLISVSRQKHQ 193
Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
DQVLVIAEDYATLYKVN +WPGHAVL+PPA DSQTAHKFGSQGFFNFT+RRP HLL ILE
Sbjct: 194 DQVLVIAEDYATLYKVNEKWPGHAVLIPPALDSQTAHKFGSQGFFNFTARRPQHLLEILE 253
Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAV-------------KPLDHSHDLP 228
LGYNVMYNDVDMVWL+DPF YL+G HD YF DDM AV KPLDHSHDLP
Sbjct: 254 LGYNVMYNDVDMVWLQDPFQYLEGKHDAYFMDDMTAVCIFFSQTSSSLMIKPLDHSHDLP 313
Query: 229 PPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAG 288
PPGKKGRTYICSCMI+LRPT+GAKL+MKKWIEEL+ +PWS+AKKANDQP FNWALNKTA
Sbjct: 314 PPGKKGRTYICSCMIFLRPTNGAKLLMKKWIEELETQPWSRAKKANDQPGFNWALNKTAN 373
Query: 289 QV 290
QV
Sbjct: 374 QV 375
>gi|225448972|ref|XP_002273155.1| PREDICTED: UDP-galactose:fucoside alpha-3-galactosyltransferase
[Vitis vinifera]
Length = 360
Score = 421 bits (1081), Expect = e-115, Method: Compositional matrix adjust.
Identities = 214/275 (77%), Positives = 238/275 (86%), Gaps = 3/275 (1%)
Query: 16 NPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSSLA 75
+ YP+S R+S + + V RT LLVLL+L+VVLGV+LP P +F + S++
Sbjct: 21 DSYPISARSSPNWGRS---ISVFGRTGLLVLLTLVVVLGVVLPGMRAPDGLFGGSKVSVS 77
Query: 76 KWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLY 135
KWR+YTL +A F AKNGT+IVCAVSQPYLPFLNNWLISISRQKHQD+VLVIAEDYATLY
Sbjct: 78 KWREYTLEEAVPFAAKNGTLIVCAVSQPYLPFLNNWLISISRQKHQDKVLVIAEDYATLY 137
Query: 136 KVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVW 195
VN RWPGHAVLVPPAPD+Q AHKFGSQGFFNFTSRRP HLL+ILELGYNVMYNDVDMVW
Sbjct: 138 AVNDRWPGHAVLVPPAPDAQVAHKFGSQGFFNFTSRRPRHLLYILELGYNVMYNDVDMVW 197
Query: 196 LKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVM 255
L DPFPYLQGDHDVYFTDDM AVKPL+HSHDLPPPGKKGRTYICSCMI++RPTDGAKLVM
Sbjct: 198 LADPFPYLQGDHDVYFTDDMTAVKPLNHSHDLPPPGKKGRTYICSCMIFMRPTDGAKLVM 257
Query: 256 KKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
K WIEELQA+PWS AKK+NDQPAFNWALN+TA QV
Sbjct: 258 KDWIEELQAQPWSNAKKSNDQPAFNWALNRTAAQV 292
>gi|356524992|ref|XP_003531111.1| PREDICTED: UDP-galactose:fucoside
alpha-3-galactosyltransferase-like [Glycine max]
Length = 356
Score = 403 bits (1035), Expect = e-110, Method: Compositional matrix adjust.
Identities = 201/292 (68%), Positives = 233/292 (79%), Gaps = 6/292 (2%)
Query: 1 MSQWLHQRP-LHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPW 59
MS +LHQR L NP NP+P S + + + L ++ TTLL L+SL+V+LGV PW
Sbjct: 1 MSSFLHQRSSLQNPFSNPFPAS---TPSSSNSKKSLSIMGPTTLLALISLIVILGVFCPW 57
Query: 60 TGTPGFMFPNATSSLAKWRDYTLSQAASFVAKNGT-IIVCAVSQPYLPFLNNWLISISRQ 118
G P FP + +KW YTL QA SFVAKNG+ +IVC VSQPYLPFLNNWLISIS Q
Sbjct: 58 VGFPQ-GFPFTPTPTSKWAHYTLEQALSFVAKNGSSVIVCIVSQPYLPFLNNWLISISMQ 116
Query: 119 KHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLH 178
K QD VLVIAEDYA+L +VN WPGHAVL+PP D++ AHKFGSQGFFNFT+RRP HLL
Sbjct: 117 KRQDMVLVIAEDYASLDRVNLLWPGHAVLIPPVLDAEAAHKFGSQGFFNFTARRPSHLLK 176
Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
ILELGY+VMYNDVDMVWL DPFPYLQG+HDVYFTDDM A+KPL+HSHDLPPPGKKGR YI
Sbjct: 177 ILELGYSVMYNDVDMVWLADPFPYLQGNHDVYFTDDMTAIKPLNHSHDLPPPGKKGRPYI 236
Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
CSCMI+LRPT+GAKL+++KWIEELQ +PWSK K+NDQPAFNWAL K A +V
Sbjct: 237 CSCMIFLRPTNGAKLILRKWIEELQIQPWSKTVKSNDQPAFNWALMKNAKEV 288
>gi|358348526|ref|XP_003638296.1| UDP-galactose:fucoside alpha-3-galactosyltransferase [Medicago
truncatula]
gi|355504231|gb|AES85434.1| UDP-galactose:fucoside alpha-3-galactosyltransferase [Medicago
truncatula]
Length = 357
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 198/292 (67%), Positives = 229/292 (78%), Gaps = 5/292 (1%)
Query: 1 MSQWLHQRPLHNPL--PNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILP 58
MS +LHQR LHNP NP+P S +S + L + + LL LLSL++VLGV P
Sbjct: 1 MSGFLHQRSLHNPFQFSNPFPSSQPSSTN---PKKPLSIFSPAILLSLLSLIIVLGVFSP 57
Query: 59 WTGTPGFMFPNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQ 118
W G P + + ++KW YTL +A +FVAKNGT+IVC VSQPYLPFLNNWLISI+
Sbjct: 58 WVGMPQKLIFTSNPEVSKWGRYTLDEALTFVAKNGTVIVCIVSQPYLPFLNNWLISIAMH 117
Query: 119 KHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLH 178
K D VLVIAEDY +LYKVN WPGHAVL+PP D + +HKFGSQGFFNFT+RRP HLL
Sbjct: 118 KRHDMVLVIAEDYPSLYKVNQLWPGHAVLIPPVLDLEASHKFGSQGFFNFTARRPSHLLK 177
Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
ILELGY+VMYNDVDMVWL DPFPYLQG+HDVYFTDDM A+KPL+HSHDLPPPGKKGR YI
Sbjct: 178 ILELGYSVMYNDVDMVWLGDPFPYLQGNHDVYFTDDMTAIKPLNHSHDLPPPGKKGRPYI 237
Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
CSCMI+LRPTDGAKL++KKW+EELQ EPWS+ KK+NDQPAFNWAL K A V
Sbjct: 238 CSCMIFLRPTDGAKLILKKWMEELQIEPWSRTKKSNDQPAFNWALMKNAKGV 289
>gi|15234316|ref|NP_192086.1| rhamnogalacturonan xylosyltransferase 1 [Arabidopsis thaliana]
gi|4558546|gb|AAD22639.1|AC007138_3 hypothetical protein [Arabidopsis thaliana]
gi|3859601|gb|AAC72867.1| T15B16.8 gene product [Arabidopsis thaliana]
gi|7268220|emb|CAB77747.1| hypothetical protein [Arabidopsis thaliana]
gi|60547843|gb|AAX23885.1| hypothetical protein At4g01770 [Arabidopsis thaliana]
gi|117018033|tpg|DAA05811.1| TPA_exp: 1,3-alpha-D-xylosyltransferase [Arabidopsis thaliana]
gi|332656675|gb|AEE82075.1| rhamnogalacturonan xylosyltransferase 1 [Arabidopsis thaliana]
Length = 361
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 181/241 (75%), Positives = 205/241 (85%), Gaps = 8/241 (3%)
Query: 58 PWTGTPGFMFPN------ATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNW 111
PW G+P F+FPN + S +KWRDYTL+QAA FVAKNGT+IVCAVS P+LPFLNNW
Sbjct: 52 PWPGSPLFLFPNRLSSSLSPSPQSKWRDYTLAQAARFVAKNGTVIVCAVSSPFLPFLNNW 111
Query: 112 LISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSR 171
LIS+SRQKHQD+VLVIAEDY TLYKVN +WPGHAVL+PPA DS+TA FGSQGFFNFT+R
Sbjct: 112 LISVSRQKHQDKVLVIAEDYITLYKVNEKWPGHAVLIPPALDSKTAFSFGSQGFFNFTAR 171
Query: 172 RPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPG 231
RP HLL ILELGYNVMYNDVDMVWL+DPF YL+G HD YFTDDM +KPL+HSHDLP P
Sbjct: 172 RPQHLLQILELGYNVMYNDVDMVWLQDPFLYLEGSHDAYFTDDMPQIKPLNHSHDLPHPD 231
Query: 232 KKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAK--KANDQPAFNWALNKTAGQ 289
+ G TYICSCMIYLRPT+GAKL+MKKW EELQ++ WS++ KANDQPAFN ALNKTA Q
Sbjct: 232 RNGETYICSCMIYLRPTNGAKLLMKKWSEELQSQAWSESIRFKANDQPAFNLALNKTAHQ 291
Query: 290 V 290
V
Sbjct: 292 V 292
>gi|297814153|ref|XP_002874960.1| hypothetical protein ARALYDRAFT_490401 [Arabidopsis lyrata subsp.
lyrata]
gi|297320797|gb|EFH51219.1| hypothetical protein ARALYDRAFT_490401 [Arabidopsis lyrata subsp.
lyrata]
Length = 367
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 179/246 (72%), Positives = 204/246 (82%), Gaps = 13/246 (5%)
Query: 58 PWTGTPGFMFPNATSS-----------LAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLP 106
PW G+P + PN SS ++WRDYTL+QAA FVAKNGT+IVCAVS P+LP
Sbjct: 53 PWPGSPLLLLPNKVSSPSNATSLSSQAKSEWRDYTLAQAAKFVAKNGTVIVCAVSSPFLP 112
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
FLNNWLIS+SRQKHQ++VLVIAEDY TLYKVN +WPGHAVL+PPA DS+TA+ FGSQGFF
Sbjct: 113 FLNNWLISVSRQKHQEKVLVIAEDYITLYKVNEKWPGHAVLIPPALDSRTAYSFGSQGFF 172
Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHD 226
NFT+RRP HLL ILELGYNVMYNDVDMVWL+DPF YL+G HD YFTDDM +KPL+HSHD
Sbjct: 173 NFTARRPQHLLQILELGYNVMYNDVDMVWLQDPFQYLEGSHDAYFTDDMPQIKPLNHSHD 232
Query: 227 LPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAK--KANDQPAFNWALN 284
LP P + G TYICSCMIYLRPTDGAKL+MKKW EELQ++ WS++ KANDQPAFN ALN
Sbjct: 233 LPDPDQNGETYICSCMIYLRPTDGAKLLMKKWSEELQSQAWSESIRFKANDQPAFNLALN 292
Query: 285 KTAGQV 290
KTA QV
Sbjct: 293 KTAHQV 298
>gi|297814147|ref|XP_002874957.1| hypothetical protein ARALYDRAFT_352664 [Arabidopsis lyrata subsp.
lyrata]
gi|297320794|gb|EFH51216.1| hypothetical protein ARALYDRAFT_352664 [Arabidopsis lyrata subsp.
lyrata]
Length = 304
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 180/243 (74%), Positives = 206/243 (84%), Gaps = 11/243 (4%)
Query: 58 PWTGTPGFMFPNATSSLA---------KWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFL 108
PW G+P F FPN SS + KW DYTL+QAA FVAKNGT+IVCAVS P+LPFL
Sbjct: 53 PWPGSPLFQFPNRLSSSSSSLSPSRQSKWHDYTLAQAAKFVAKNGTVIVCAVSSPFLPFL 112
Query: 109 NNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNF 168
NNWLIS+SRQKHQ++VLVIAEDYATLYKVN +WPGHAVL+PPA DS+TA+ FGSQGFFNF
Sbjct: 113 NNWLISVSRQKHQEKVLVIAEDYATLYKVNEKWPGHAVLIPPALDSKTAYSFGSQGFFNF 172
Query: 169 TSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLP 228
T+RRP HLL +LELGYNVMYNDVDMVWL+DPF YL+G HDVYFTDD+ +KPL+HSHDLP
Sbjct: 173 TARRPQHLLQVLELGYNVMYNDVDMVWLQDPFQYLEGSHDVYFTDDLPQIKPLNHSHDLP 232
Query: 229 PPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAK--KANDQPAFNWALNKT 286
PG+ G TYICSCMIYLRPT+GAKL+MKKW EELQ++ WS++ KANDQPAFN ALNKT
Sbjct: 233 HPGRNGETYICSCMIYLRPTNGAKLLMKKWSEELQSQAWSESIRFKANDQPAFNLALNKT 292
Query: 287 AGQ 289
A Q
Sbjct: 293 AHQ 295
>gi|49617789|gb|AAT67590.1| hypothetical protein At4G01770 [Arabidopsis thaliana]
Length = 361
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 181/241 (75%), Positives = 205/241 (85%), Gaps = 8/241 (3%)
Query: 58 PWTGTPGFMFPN------ATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNW 111
PW G+P F+FPN + S +KWRDYTL+QAA FVAKNGT+IVCAVS P+LPFLNNW
Sbjct: 52 PWPGSPLFLFPNRLSSSLSPSPQSKWRDYTLAQAARFVAKNGTVIVCAVSSPFLPFLNNW 111
Query: 112 LISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSR 171
LIS+SRQKHQD+VLVIAEDY TLYKVN +WPGHAVL+PPA DS+TA FGSQGFFNFT+R
Sbjct: 112 LISVSRQKHQDKVLVIAEDYITLYKVNEKWPGHAVLIPPALDSKTAFSFGSQGFFNFTAR 171
Query: 172 RPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPG 231
RP HLL ILELGYNVMYNDVDMVWL+DPF YL+G HD YFTDDM +KPL+HSHDLP P
Sbjct: 172 RPQHLLQILELGYNVMYNDVDMVWLQDPFLYLEGSHDAYFTDDMPQIKPLNHSHDLPHPD 231
Query: 232 KKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAK--KANDQPAFNWALNKTAGQ 289
+ G TYICSCMIYLRPT+GAKL+MKKW EELQ++ WS++ KANDQPAFN ALNKTA Q
Sbjct: 232 RNGETYICSCMIYLRPTNGAKLLMKKWSEELQSQAWSESIRFKANDQPAFNLALNKTAHQ 291
Query: 290 V 290
V
Sbjct: 292 V 292
>gi|60651751|gb|AAX32892.1| putative xylosyltransferase [Arabidopsis thaliana]
Length = 367
Score = 376 bits (965), Expect = e-102, Method: Compositional matrix adjust.
Identities = 190/272 (69%), Positives = 223/272 (81%), Gaps = 13/272 (4%)
Query: 32 RPMLLVLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSS-----------LAKWRDY 80
RP+ L+ LLVLL+L ++LGV LPW G+P +FPN SS ++WR+Y
Sbjct: 27 RPIFLLSRNGLLLVLLALFLLLGVFLPWPGSPLLLFPNKVSSPSYASCLSPHAKSEWRNY 86
Query: 81 TLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGR 140
TL+QAA FVA NGT+IVCAVS P+LPFLNNWLIS+SRQKHQ++VLVIAEDY TLYKVN +
Sbjct: 87 TLAQAAKFVATNGTVIVCAVSSPFLPFLNNWLISVSRQKHQEKVLVIAEDYITLYKVNEK 146
Query: 141 WPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
WPGHAVL+PPA DS+TA+ FGSQGFFNFT+RRP HLL ILELGYNVMYNDVDMVWL+DPF
Sbjct: 147 WPGHAVLIPPALDSKTAYSFGSQGFFNFTARRPQHLLQILELGYNVMYNDVDMVWLQDPF 206
Query: 201 PYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIE 260
YL+G HD YFTDDM +KPL+HSHDLP P + G TYICSCMIYLRPT+GAKL+MKKW E
Sbjct: 207 QYLEGSHDAYFTDDMPQIKPLNHSHDLPAPDQNGETYICSCMIYLRPTNGAKLLMKKWSE 266
Query: 261 ELQAEPWSKAK--KANDQPAFNWALNKTAGQV 290
ELQ++ WS++ KANDQPAFN+ALNKTA QV
Sbjct: 267 ELQSQAWSESIRFKANDQPAFNFALNKTAHQV 298
>gi|15234287|ref|NP_192084.1| rhamnogalacturonan xylosyltransferase 2 [Arabidopsis thaliana]
gi|4558544|gb|AAD22637.1|AC007138_1 hypothetical protein [Arabidopsis thaliana]
gi|3859593|gb|AAC72859.1| T15B16.9 gene product [Arabidopsis thaliana]
gi|7268218|emb|CAB77745.1| hypothetical protein [Arabidopsis thaliana]
gi|17473527|gb|AAL38247.1| unknown protein [Arabidopsis thaliana]
gi|27311905|gb|AAO00918.1| unknown protein [Arabidopsis thaliana]
gi|117018082|tpg|DAA05812.1| TPA_exp: 1,3-alpha-D-xylosyltransferase [Arabidopsis thaliana]
gi|332656673|gb|AEE82073.1| rhamnogalacturonan xylosyltransferase 2 [Arabidopsis thaliana]
Length = 367
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 189/272 (69%), Positives = 222/272 (81%), Gaps = 13/272 (4%)
Query: 32 RPMLLVLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATS-----------SLAKWRDY 80
RP+ L+ LLVLL+L ++LGV LPW G+P +FPN S + ++WR+Y
Sbjct: 27 RPIFLLSRNGLLLVLLALFLLLGVFLPWPGSPLLLFPNKVSSPSYASSLSPHAKSEWRNY 86
Query: 81 TLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGR 140
TL+QAA FVA NGT+IVCAVS P+LPFLNNWLIS+SRQKHQ++VLVIAEDY TLYKVN +
Sbjct: 87 TLAQAAKFVATNGTVIVCAVSSPFLPFLNNWLISVSRQKHQEKVLVIAEDYITLYKVNEK 146
Query: 141 WPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
WPGHAVL+PPA DS+TA+ FGSQGFFNFT+RRP HLL ILELGYNVMYNDVDMVWL+DPF
Sbjct: 147 WPGHAVLIPPALDSKTAYSFGSQGFFNFTARRPQHLLQILELGYNVMYNDVDMVWLQDPF 206
Query: 201 PYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIE 260
YL+G HD YFTDDM +KPL+HSHDLP P + G TYICSCMIYLRPT+GAKL+MKKW E
Sbjct: 207 QYLEGSHDAYFTDDMPQIKPLNHSHDLPAPDQNGETYICSCMIYLRPTNGAKLLMKKWSE 266
Query: 261 ELQAEPWSKAK--KANDQPAFNWALNKTAGQV 290
ELQ++ WS++ KANDQPAFN ALNKTA QV
Sbjct: 267 ELQSQAWSESIRFKANDQPAFNLALNKTAHQV 298
>gi|15223560|ref|NP_176048.1| rhamnogalacturonan II specific xylosyltransferase [Arabidopsis
thaliana]
gi|9954757|gb|AAG09108.1|AC009323_19 Hypothetical protein [Arabidopsis thaliana]
gi|26449884|dbj|BAC42064.1| unknown protein [Arabidopsis thaliana]
gi|28827600|gb|AAO50644.1| unknown protein [Arabidopsis thaliana]
gi|332195285|gb|AEE33406.1| rhamnogalacturonan II specific xylosyltransferase [Arabidopsis
thaliana]
Length = 383
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 178/248 (71%), Positives = 204/248 (82%), Gaps = 11/248 (4%)
Query: 54 GVILPWTGTPGFMFPNATSSLA---------KWRDYTLSQAASFVAKNGTIIVCAVSQPY 104
GV LP T + FMFPN TSS WRDY+L+QA FVAKN T+IVCAVS P+
Sbjct: 35 GVFLPLTKSSLFMFPNTTSSSLSPSSSLSVSDWRDYSLAQAVKFVAKNETVIVCAVSYPF 94
Query: 105 LPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQG 164
LPFLNNWLISISRQKHQ++VLVIAEDYATLYKVN +WPGHAVL+PPA D Q+AHKFGSQG
Sbjct: 95 LPFLNNWLISISRQKHQEKVLVIAEDYATLYKVNEKWPGHAVLIPPALDPQSAHKFGSQG 154
Query: 165 FFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHS 224
FFN TSRRP HLL+ILELGYNVMYNDVDMVWL+DPF YLQG +D YF DDM A+KPL+HS
Sbjct: 155 FFNLTSRRPQHLLNILELGYNVMYNDVDMVWLQDPFDYLQGSYDAYFMDDMIAIKPLNHS 214
Query: 225 HDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW--SKAKKANDQPAFNWA 282
HDLPP + G TY+CSCMI+LR TDG KL+MK W+EE+QA+PW ++AKK +DQPAFN A
Sbjct: 215 HDLPPLSRSGVTYVCSCMIFLRSTDGGKLLMKTWVEEIQAQPWNNTQAKKPHDQPAFNRA 274
Query: 283 LNKTAGQV 290
L+KTA QV
Sbjct: 275 LHKTANQV 282
>gi|297853550|ref|XP_002894656.1| hypothetical protein ARALYDRAFT_474813 [Arabidopsis lyrata subsp.
lyrata]
gi|297340498|gb|EFH70915.1| hypothetical protein ARALYDRAFT_474813 [Arabidopsis lyrata subsp.
lyrata]
Length = 380
Score = 363 bits (933), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 176/252 (69%), Positives = 203/252 (80%), Gaps = 11/252 (4%)
Query: 50 LVVLGVILPWTGTPGFMFPNATSSLAKWR---------DYTLSQAASFVAKNGTIIVCAV 100
V+LGV LP TG+ F+ P+ TSS DY+L+QA FVAKN T+IVCAV
Sbjct: 31 FVILGVFLPLTGSSLFLSPDTTSSSLSPSSSLSVSDWSDYSLAQAVKFVAKNETVIVCAV 90
Query: 101 SQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKF 160
S P+LPFLNNWLISISRQ HQ++VLVIAEDYATLYKVN +WPGHAVL+PPA D Q AHKF
Sbjct: 91 SYPFLPFLNNWLISISRQNHQEKVLVIAEDYATLYKVNEKWPGHAVLIPPALDPQAAHKF 150
Query: 161 GSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKP 220
GSQGFFN TSRRP HLL+ILELGYNVMYNDVDMVWL+DPF YLQG HD YF DDM A+KP
Sbjct: 151 GSQGFFNLTSRRPQHLLNILELGYNVMYNDVDMVWLQDPFQYLQGSHDAYFMDDMIAIKP 210
Query: 221 LDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW--SKAKKANDQPA 278
L+HSH LPP + G TY+CSCMI+LR TDGAKL+MK W+EE+QA+PW ++AKK +DQPA
Sbjct: 211 LNHSHGLPPRSRSGVTYVCSCMIFLRSTDGAKLLMKTWVEEIQAQPWNNTQAKKPHDQPA 270
Query: 279 FNWALNKTAGQV 290
FN AL+KTA QV
Sbjct: 271 FNRALHKTANQV 282
>gi|242087783|ref|XP_002439724.1| hypothetical protein SORBIDRAFT_09g019070 [Sorghum bicolor]
gi|241945009|gb|EES18154.1| hypothetical protein SORBIDRAFT_09g019070 [Sorghum bicolor]
Length = 352
Score = 357 bits (916), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 171/225 (76%), Positives = 192/225 (85%), Gaps = 2/225 (0%)
Query: 68 PNATSSLAKWRDYTLSQAASFVAK--NGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVL 125
P A + WR YTL +AA+F A NGT+++ AVS PYLPFL+NWLIS+ R DQVL
Sbjct: 60 PAARPPSSPWRSYTLQEAAAFAAAAGNGTVLLAAVSGPYLPFLSNWLISVRRAGRADQVL 119
Query: 126 VIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYN 185
VIAEDY TL ++N WPGHAVLVPPAPD+QTAHKFGSQGFFNFTSRRP HLL ILELGY+
Sbjct: 120 VIAEDYETLDRINAAWPGHAVLVPPAPDAQTAHKFGSQGFFNFTSRRPRHLLQILELGYS 179
Query: 186 VMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYL 245
VMYNDVDMVWL DPFPYL DHDVYF DDM VKPLDHSH+LPPPGKKGRTYICSCMI+L
Sbjct: 180 VMYNDVDMVWLADPFPYLVEDHDVYFMDDMTPVKPLDHSHELPPPGKKGRTYICSCMIFL 239
Query: 246 RPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
RPT+GAKL+++KWIEEL+ +PWSK +KANDQPAFNWALNKTAGQV
Sbjct: 240 RPTEGAKLLLRKWIEELKEQPWSKQRKANDQPAFNWALNKTAGQV 284
>gi|115463643|ref|NP_001055421.1| Os05g0386900 [Oryza sativa Japonica Group]
gi|54287590|gb|AAV31334.1| unknown protein [Oryza sativa Japonica Group]
gi|113578972|dbj|BAF17335.1| Os05g0386900 [Oryza sativa Japonica Group]
gi|215701069|dbj|BAG92493.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215707102|dbj|BAG93562.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222631445|gb|EEE63577.1| hypothetical protein OsJ_18394 [Oryza sativa Japonica Group]
Length = 352
Score = 355 bits (910), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 171/236 (72%), Positives = 199/236 (84%), Gaps = 7/236 (2%)
Query: 57 LPWTGTPGFMFPNATSSLAKWRDYTLSQAASFVAK--NGTIIVCAVSQPYLPFLNNWLIS 114
PW G P ++++ + WR YTL +AA+F A+ NGTI++ AVS PYLPFL+NWLI+
Sbjct: 54 FPW-GPPS----HSSAPASPWRGYTLQEAAAFAARAGNGTIVLAAVSGPYLPFLSNWLIT 108
Query: 115 ISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPC 174
+ R DQVLV+AEDY TL ++N WPGHAVLVPPAPD+Q AHKFGSQGFFNFTSRRP
Sbjct: 109 VRRAGRADQVLVVAEDYDTLERINAAWPGHAVLVPPAPDAQVAHKFGSQGFFNFTSRRPR 168
Query: 175 HLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKG 234
HLL ILELGY+VMYNDVDMVWL DPFPY+ GDHDVYF DDM VKPLDHSH+LPPPGKKG
Sbjct: 169 HLLQILELGYSVMYNDVDMVWLADPFPYIVGDHDVYFMDDMTPVKPLDHSHELPPPGKKG 228
Query: 235 RTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
RTYICSCMI+LRPT+GAKL+++KWIEEL+ +PWSK +KANDQPAFNWALNKTAGQV
Sbjct: 229 RTYICSCMIFLRPTEGAKLLLRKWIEELKEQPWSKKQKANDQPAFNWALNKTAGQV 284
>gi|125552172|gb|EAY97881.1| hypothetical protein OsI_19800 [Oryza sativa Indica Group]
Length = 352
Score = 355 bits (910), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 167/224 (74%), Positives = 195/224 (87%), Gaps = 2/224 (0%)
Query: 69 NATSSLAKWRDYTLSQAASFVAK--NGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLV 126
++++ + WR YTL +AA+F A+ NGTI++ AVS PYLPFL+NWLI++ R DQVLV
Sbjct: 61 HSSAPASPWRGYTLQEAAAFAARAGNGTIVLAAVSGPYLPFLSNWLITVRRAGRADQVLV 120
Query: 127 IAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNV 186
+AEDY TL ++N WPGHAVLVPPAPD+Q AHKFGSQGFFNFTSRRP HLL ILELGY+V
Sbjct: 121 VAEDYDTLERINAAWPGHAVLVPPAPDAQVAHKFGSQGFFNFTSRRPRHLLQILELGYSV 180
Query: 187 MYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLR 246
MYNDVDMVWL DPFPY+ GDHDVYF DDM VKPLDHSH+LPPPGKKGRTYICSCMI+LR
Sbjct: 181 MYNDVDMVWLADPFPYIVGDHDVYFMDDMTPVKPLDHSHELPPPGKKGRTYICSCMIFLR 240
Query: 247 PTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
PT+GAKL+++KWIEEL+ +PWSK +KANDQPAFNWALNKTAGQV
Sbjct: 241 PTEGAKLLLRKWIEELKEQPWSKKQKANDQPAFNWALNKTAGQV 284
>gi|413945194|gb|AFW77843.1| hypothetical protein ZEAMMB73_274644 [Zea mays]
Length = 396
Score = 345 bits (884), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 156/199 (78%), Positives = 178/199 (89%)
Query: 92 NGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPA 151
NGT+++ AVS PYLPFL+NWLIS+ R DQVLVIAEDY TL ++N WPGHAVLVPPA
Sbjct: 130 NGTVLLAAVSGPYLPFLSNWLISVRRAGRADQVLVIAEDYETLDRINAAWPGHAVLVPPA 189
Query: 152 PDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYF 211
PD+QTAHKFGSQGFFNFTSRRP HLL ILELGY+VMYNDVDMVWL DPF Y+ +HDVYF
Sbjct: 190 PDAQTAHKFGSQGFFNFTSRRPRHLLQILELGYSVMYNDVDMVWLADPFAYIVENHDVYF 249
Query: 212 TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAK 271
DDMA VKPLDHSH+LPPPGKKGRTYICSCM++LRPT+GAKL+++KWIEEL+ +PWSK +
Sbjct: 250 MDDMAPVKPLDHSHELPPPGKKGRTYICSCMLFLRPTEGAKLLLRKWIEELKEQPWSKQR 309
Query: 272 KANDQPAFNWALNKTAGQV 290
K+NDQPAFNWALNKTAGQV
Sbjct: 310 KSNDQPAFNWALNKTAGQV 328
>gi|326504918|dbj|BAK06750.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 280
Score = 344 bits (882), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 181/288 (62%), Positives = 214/288 (74%), Gaps = 15/288 (5%)
Query: 5 LHQRPLHNPLPN---PYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTG 61
LHQRP PL P P P N+ + L +L L LL+LL++L ++ PW
Sbjct: 3 LHQRPHQKPLGADSLPVPTGPTNAAPPR----PLPLLTLPYLFSLLALLLLLALLFPWGP 58
Query: 62 TPGFMFPNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQ 121
P ++ + WR YTL AA+ + NGTII+ AVS PYLPFL+NWLIS+ R
Sbjct: 59 ------PRHSAPASPWRSYTLQDAAA--SGNGTIILAAVSGPYLPFLSNWLISVRRAGRA 110
Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
+QVLVIA+DY TL ++N WPGHAVLVPPAPD+Q AHKFGSQGFFNFTSRRP HLL ILE
Sbjct: 111 NQVLVIAKDYETLERINAAWPGHAVLVPPAPDAQAAHKFGSQGFFNFTSRRPRHLLQILE 170
Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 241
LGY+VMYNDVDMVWL DPFPYL G+HDVYF DDM VKPL+HSH LPPPGKKGR YICSC
Sbjct: 171 LGYSVMYNDVDMVWLADPFPYLVGNHDVYFMDDMTQVKPLNHSHALPPPGKKGRPYICSC 230
Query: 242 MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQ 289
MI+L+PT+G+KL+M+KWIEEL+ +PWSK K+NDQPAFNWAL KT GQ
Sbjct: 231 MIFLQPTEGSKLLMRKWIEELKEQPWSKKVKSNDQPAFNWALLKTTGQ 278
>gi|357133882|ref|XP_003568551.1| PREDICTED: uncharacterized protein LOC100839509 [Brachypodium
distachyon]
Length = 351
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 164/225 (72%), Positives = 187/225 (83%), Gaps = 2/225 (0%)
Query: 68 PNATSSLAKWRDYTLSQAASFVAK--NGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVL 125
P +S + WR YTL AA+F A NGTI++ AVS PYLPFL+NWLIS+ R DQVL
Sbjct: 59 PRHSSPASPWRAYTLQDAAAFAAAAGNGTIVLAAVSGPYLPFLSNWLISVRRAGRADQVL 118
Query: 126 VIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYN 185
VIAEDY TL ++N WPGHAVL+PPAPD+Q AHKFGSQGFFNFTSRRP HLL ILELGY+
Sbjct: 119 VIAEDYETLERINAAWPGHAVLIPPAPDAQAAHKFGSQGFFNFTSRRPRHLLQILELGYS 178
Query: 186 VMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYL 245
VMYNDVDMVWL DPFPYL G+HDVYF DDM VK L+HSH+LPPPGKKGR YICSCMI+L
Sbjct: 179 VMYNDVDMVWLADPFPYLVGNHDVYFMDDMTDVKALNHSHELPPPGKKGRPYICSCMIFL 238
Query: 246 RPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
+PT+GAKL+M+KWIEEL+ +PWSK K+NDQPAFNWAL KT GQV
Sbjct: 239 QPTEGAKLLMRKWIEELKEQPWSKKVKSNDQPAFNWALLKTVGQV 283
>gi|326533736|dbj|BAK05399.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 163/225 (72%), Positives = 187/225 (83%), Gaps = 2/225 (0%)
Query: 68 PNATSSLAKWRDYTLSQAASFV--AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVL 125
P ++ + WR YTL AA+F + NGTII+ AVS PYLPFL+NWLIS+ R +QVL
Sbjct: 59 PRHSAPSSPWRSYTLQDAAAFAAASGNGTIILAAVSGPYLPFLSNWLISVRRAGRANQVL 118
Query: 126 VIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYN 185
VIAEDY TL ++N WPGHAVLVPPAPD+Q AHKFGSQGFFNFTSRRP HLL ILELGY+
Sbjct: 119 VIAEDYETLERINAAWPGHAVLVPPAPDAQAAHKFGSQGFFNFTSRRPRHLLQILELGYS 178
Query: 186 VMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYL 245
VMYNDVDMVWL DPFPYL G+HDVYF DDM VKPL+HSH LPPPGKKGR YICSCMI+L
Sbjct: 179 VMYNDVDMVWLADPFPYLVGNHDVYFMDDMTEVKPLNHSHALPPPGKKGRPYICSCMIFL 238
Query: 246 RPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
+PT+G+KL+M+KWIEEL+ +PWSK K+NDQPAFNWAL KT GQV
Sbjct: 239 QPTEGSKLLMRKWIEELKEQPWSKKVKSNDQPAFNWALLKTTGQV 283
>gi|226501196|ref|NP_001143398.1| uncharacterized protein LOC100276040 [Zea mays]
gi|195619764|gb|ACG31712.1| hypothetical protein [Zea mays]
Length = 353
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 155/199 (77%), Positives = 176/199 (88%)
Query: 92 NGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPA 151
NGT+++ AVS PYLPFL+NWLIS+ R DQVLVIAEDY TL ++N WPGHAVLVPPA
Sbjct: 87 NGTVLLAAVSGPYLPFLSNWLISVRRAGRADQVLVIAEDYETLDRINAAWPGHAVLVPPA 146
Query: 152 PDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYF 211
PD+QTAHKFGSQGFFNFTSRRP HLL ILELGY+VMYNDVDMVWL DPF Y+ +HDVYF
Sbjct: 147 PDAQTAHKFGSQGFFNFTSRRPRHLLQILELGYSVMYNDVDMVWLADPFAYIVENHDVYF 206
Query: 212 TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAK 271
DDM VKPLDHSH+LPPPGKKGRTYICSCM++LRPT+GAKL+++KWIEEL+ +PWSK
Sbjct: 207 MDDMTPVKPLDHSHELPPPGKKGRTYICSCMLFLRPTEGAKLLLRKWIEELKEQPWSKQM 266
Query: 272 KANDQPAFNWALNKTAGQV 290
K+NDQPAFNWALNKTAGQV
Sbjct: 267 KSNDQPAFNWALNKTAGQV 285
>gi|302799954|ref|XP_002981735.1| Rhamnogalacturonan II Xylosyl transferase-like protein [Selaginella
moellendorffii]
gi|300150567|gb|EFJ17217.1| Rhamnogalacturonan II Xylosyl transferase-like protein [Selaginella
moellendorffii]
Length = 350
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 169/253 (66%), Positives = 197/253 (77%), Gaps = 1/253 (0%)
Query: 38 LNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSSLAKWRDYTLSQAASFVAKNGTIIV 97
++R L+ L+ V+L IL G+ +TSS ++W DYTL AA+ VA++G I+V
Sbjct: 17 ISRRRLVTLVLGAVLLAYILVSLQDRGYR-AGSTSSSSRWADYTLEAAAAAVARDGRIVV 75
Query: 98 CAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTA 157
CAVS PYL FL NWLISI+ H D+VLVIAEDY L VN WPGHAVLVPPA TA
Sbjct: 76 CAVSFPYLAFLVNWLISIASHGHHDKVLVIAEDYEMLNYVNEFWPGHAVLVPPALPLATA 135
Query: 158 HKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAA 217
+FGSQGFFNFTSRRP HLL +LELGY+V+YNDVDMVW+ DPFP GDHD+YFTDDM A
Sbjct: 136 QRFGSQGFFNFTSRRPQHLLKLLELGYSVLYNDVDMVWMSDPFPLFTGDHDIYFTDDMTA 195
Query: 218 VKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQP 277
+KPLDHSH LPPPGKKGRTYICSCMI+LRPT GAKLV++KWIEELQ +PWS KANDQP
Sbjct: 196 IKPLDHSHSLPPPGKKGRTYICSCMIFLRPTPGAKLVLQKWIEELQQQPWSPKAKANDQP 255
Query: 278 AFNWALNKTAGQV 290
AFNWALNKT+ +V
Sbjct: 256 AFNWALNKTSNKV 268
>gi|302768381|ref|XP_002967610.1| Rhamnogalacturonan II Xylosyl transferase-like protein [Selaginella
moellendorffii]
gi|300164348|gb|EFJ30957.1| Rhamnogalacturonan II Xylosyl transferase-like protein [Selaginella
moellendorffii]
Length = 350
Score = 340 bits (872), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 169/253 (66%), Positives = 197/253 (77%), Gaps = 1/253 (0%)
Query: 38 LNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSSLAKWRDYTLSQAASFVAKNGTIIV 97
++R L+ L+ V+L IL G+ +TSS ++W DYTL AA+ VA++G I+V
Sbjct: 17 ISRRRLVTLVLGAVLLAYILVSLQDRGYR-AGSTSSSSRWADYTLEAAAAAVARDGRIVV 75
Query: 98 CAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTA 157
CAVS PYL FL NWLISI+ H D+VLVIAEDY L VN WPGHAVLVPPA TA
Sbjct: 76 CAVSFPYLAFLVNWLISIASHGHHDKVLVIAEDYEMLNYVNEFWPGHAVLVPPALPLATA 135
Query: 158 HKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAA 217
+FGSQGFFNFTSRRP HLL +LELGY+V+YNDVDMVW+ DPFP GDHD+YFTDDM A
Sbjct: 136 QRFGSQGFFNFTSRRPQHLLKLLELGYSVLYNDVDMVWMSDPFPLFTGDHDIYFTDDMTA 195
Query: 218 VKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQP 277
+KPLDHSH LPPPGKKGRTYICSCMI+LRPT GAKLV++KWIEELQ +PWS KANDQP
Sbjct: 196 IKPLDHSHSLPPPGKKGRTYICSCMIFLRPTPGAKLVLQKWIEELQQQPWSPKAKANDQP 255
Query: 278 AFNWALNKTAGQV 290
AFNWALNKT+ +V
Sbjct: 256 AFNWALNKTSNKV 268
>gi|168006283|ref|XP_001755839.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693158|gb|EDQ79512.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 297
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 144/234 (61%), Positives = 173/234 (73%), Gaps = 7/234 (2%)
Query: 57 LPWTGTPGFMFPNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISIS 116
+ WT PN WR YTL QAA +A +G +IVC VS PY+PFLNNWLIS++
Sbjct: 1 MSWTANLHLGQPN-------WRSYTLRQAAQALAVDGILIVCTVSHPYMPFLNNWLISLA 53
Query: 117 RQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHL 176
+ VL+IAEDY TL VN RWPGH+VL+PPA T+ +FGSQGFFN T+RRP +L
Sbjct: 54 KYNRHQAVLIIAEDYTTLDFVNSRWPGHSVLIPPASSETTSLRFGSQGFFNLTARRPKYL 113
Query: 177 LHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRT 236
L ILELGY+V+YNDVDMVWL DPF Y + + +VY DDMA +K HSH LPPPGKKGRT
Sbjct: 114 LEILELGYSVLYNDVDMVWLADPFSYFKNNREVYIIDDMALLKTEYHSHALPPPGKKGRT 173
Query: 237 YICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
YICSCM++L+ T+GAKL+M+ WIEEL+ WS + K NDQPAFNWALNKTAGQV
Sbjct: 174 YICSCMLFLKSTEGAKLLMRTWIEELKERQWSPSVKTNDQPAFNWALNKTAGQV 227
>gi|388505822|gb|AFK40977.1| unknown [Medicago truncatula]
Length = 240
Score = 296 bits (758), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 154/236 (65%), Positives = 180/236 (76%), Gaps = 5/236 (2%)
Query: 1 MSQWLHQRPLHNPL--PNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILP 58
MS +LHQR LHNP NP+P S +S + L + + LL LLSL++VLGV P
Sbjct: 1 MSGFLHQRSLHNPFQFSNPFPSSQPSSTN---PKKPLSIFSPAILLSLLSLIIVLGVFSP 57
Query: 59 WTGTPGFMFPNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQ 118
W G P + + ++KW YTL +A +FVAKNGT+IVC VSQPYLPFLNNWLISI+
Sbjct: 58 WVGMPQKLIFTSNPEVSKWGRYTLDEALTFVAKNGTVIVCIVSQPYLPFLNNWLISIAMH 117
Query: 119 KHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLH 178
K D VLVIAEDY +LYKVN WPGHAVL+PP D + +HKFGSQGFFNFT+RRP HLL
Sbjct: 118 KRHDMVLVIAEDYPSLYKVNQLWPGHAVLIPPVLDLEASHKFGSQGFFNFTARRPSHLLK 177
Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKG 234
ILELGY+VMYNDVDMVWL DPFPYLQG+HDVYFTDDM A+KPL+HSHDLPPPGK+G
Sbjct: 178 ILELGYSVMYNDVDMVWLGDPFPYLQGNHDVYFTDDMTAIKPLNHSHDLPPPGKRG 233
>gi|297853548|ref|XP_002894655.1| hypothetical protein ARALYDRAFT_892838 [Arabidopsis lyrata subsp.
lyrata]
gi|297340497|gb|EFH70914.1| hypothetical protein ARALYDRAFT_892838 [Arabidopsis lyrata subsp.
lyrata]
Length = 189
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 103/146 (70%), Positives = 115/146 (78%), Gaps = 10/146 (6%)
Query: 111 WLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTS 170
WLISISRQ H+++VLVIAEDYATLYKVN RWPGHAVL+P A D Q AHKFGSQ +
Sbjct: 5 WLISISRQNHEEKVLVIAEDYATLYKVNERWPGHAVLIPSALDPQAAHKFGSQATTS--- 61
Query: 171 RRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPP 230
L LELGYNVMYNDVDMVWL+DPF YLQG HD YF DDM A+KPL+HSH LPP
Sbjct: 62 ------LEHLELGYNVMYNDVDMVWLQDPFQYLQGSHDAYFMDDMIAIKPLNHSHGLPPR 115
Query: 231 GKKGRTYICSCMIYLRPTDGAKLVMK 256
+ G TY+CSCMI+L PTDGAKL+MK
Sbjct: 116 SRGGVTYVCSCMIFL-PTDGAKLLMK 140
>gi|224064098|ref|XP_002301385.1| predicted protein [Populus trichocarpa]
gi|222843111|gb|EEE80658.1| predicted protein [Populus trichocarpa]
Length = 172
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 93/104 (89%), Positives = 101/104 (97%)
Query: 187 MYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLR 246
MYNDVDMVWL+DPFPYL+G+HDVYFTDDMAAVKPL HSHDLPPPGKKGRTYICSCMI++
Sbjct: 1 MYNDVDMVWLQDPFPYLEGNHDVYFTDDMAAVKPLGHSHDLPPPGKKGRTYICSCMIFMH 60
Query: 247 PTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
PTDGAKLV+KKWIEELQA+PWSK KK+NDQPAFNWALNKTAGQV
Sbjct: 61 PTDGAKLVLKKWIEELQAQPWSKTKKSNDQPAFNWALNKTAGQV 104
>gi|28466905|gb|AAO44061.1| At1g56550 [Arabidopsis thaliana]
Length = 169
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 86/128 (67%), Positives = 97/128 (75%), Gaps = 9/128 (7%)
Query: 54 GVILPWTGTPGFMFPNATSSLA---------KWRDYTLSQAASFVAKNGTIIVCAVSQPY 104
GV LP T + FMFPN TSS WRDY+L+QA FVAKN T+IVCAVS P+
Sbjct: 35 GVFLPLTKSSLFMFPNTTSSSLSPSSSLSVSDWRDYSLAQAVKFVAKNETVIVCAVSYPF 94
Query: 105 LPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQG 164
LPFLNNWLISISRQKHQ++VLVIAEDYATLYKVN +WPGHAVL+PPA D Q+AHKFGSQ
Sbjct: 95 LPFLNNWLISISRQKHQEKVLVIAEDYATLYKVNEKWPGHAVLIPPALDPQSAHKFGSQV 154
Query: 165 FFNFTSRR 172
F+ R
Sbjct: 155 QREFSPFR 162
>gi|390350619|ref|XP_003727460.1| PREDICTED: UDP-galactose:fucoside
alpha-3-galactosyltransferase-like [Strongylocentrotus
purpuratus]
Length = 364
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/176 (31%), Positives = 91/176 (51%), Gaps = 19/176 (10%)
Query: 95 IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPA--- 151
II+ ++ +L F NW+ S+ R +D V +IAED +T + R + L+ +
Sbjct: 127 IILATTNKAFLDFTENWIESLKRCNVRDHVTIIAEDPSTYEILAKRNDINLELLLTSKTN 186
Query: 152 -PDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVY 210
PDS A FGSQ + ++RP ++L L+ G +V+++DVD VWLK+P P+ + +D+Y
Sbjct: 187 LPDSDLA--FGSQDYLRLVNKRPNYILRYLQRGTDVLFSDVDTVWLKNPLPFFEDGYDLY 244
Query: 211 FTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEP 266
F D+ D P +C+ +Y R T ++ KWI+ + A P
Sbjct: 245 FGRDIY--------DDQTKPD-----LVCAGFVYYRATKATIDLIVKWIQRIHARP 287
>gi|390348517|ref|XP_003727022.1| PREDICTED: UDP-galactose:fucoside
alpha-3-galactosyltransferase-like [Strongylocentrotus
purpuratus]
Length = 293
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 53/204 (25%), Positives = 97/204 (47%), Gaps = 24/204 (11%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG-HAVL 147
++ +G +++ ++ ++ F NWL S+ R + + + IAED Y+ R+P V+
Sbjct: 57 ISNDGPLVLITSNKAFMDFAENWLESVHRLESRLNIFAIAEDMVA-YRTFLRYPDVTTVM 115
Query: 148 VPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDH 207
A Q + S + ++RP ++ +L G +V+++DVD VWLKDP P+L GD+
Sbjct: 116 TQRAVSPQKRLAYLSHDYNVLINKRPVYIYRLLAKGRDVLFSDVDTVWLKDPLPHLDGDY 175
Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
DV D+ K + C+ I+ R T+ ++ + +WI+ +
Sbjct: 176 DVVLQVDLRVPKVV----------------YCAGFIFFRATNASRAFVWEWIDRIH---- 215
Query: 268 SKAK-KANDQPAFNWALNKTAGQV 290
KA+ DQ N L + G +
Sbjct: 216 -KARDNIPDQKILNELLEENFGDL 238
>gi|326429312|gb|EGD74882.1| hypothetical protein PTSG_07110 [Salpingoeca sp. ATCC 50818]
Length = 1319
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 54/190 (28%), Positives = 89/190 (46%), Gaps = 34/190 (17%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVL-- 147
A++GTII+ S ++ F NW S +R+ +V+AED + ++ PG AVL
Sbjct: 1043 ARDGTIILLTTSSGFMDFFLNWRES-ARRVGITNYMVLAEDLSCYEQLEAIDPGKAVLSS 1101
Query: 148 --VPPAPDS-------QTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKD 198
+ A D+ +T + S+ + SRRP ++ +L +GYNV+Y D D VWL++
Sbjct: 1102 VRIVKASDTDAQIGKDKTGFSYASKQYNEIVSRRPTYIGRLLRMGYNVLYTDTDTVWLEN 1161
Query: 199 PFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTY-----ICSCMIYLRPTDGAKL 253
PF + D+D+Y D K+ T+ +C+ +++R G
Sbjct: 1162 PFQHFPPDYDMYIQSD-----------------KEDETFDPWHMLCTGFMFMRAGTGMIQ 1204
Query: 254 VMKKWIEELQ 263
M W LQ
Sbjct: 1205 FMDDWRTALQ 1214
>gi|167521059|ref|XP_001744868.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776482|gb|EDQ90101.1| predicted protein [Monosiga brevicollis MX1]
Length = 1062
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 50/195 (25%), Positives = 92/195 (47%), Gaps = 38/195 (19%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
V++N + V V+ ++ F N L+S++R + ++IAEDY + ++N R+P H V++
Sbjct: 800 VSENNAVTVMTVTSGFVDFATNLLMSMTRVG-VNNFIIIAEDYTSYQRLNARYP-HRVVL 857
Query: 149 P------------PAPD--SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMV 194
P D +T + S+ + RRP +LL IL +G++V+Y D D V
Sbjct: 858 PNLRTMMQGMSGRSGADVADRTGFSYASKQYNEIVGRRPRYLLGILRMGFDVLYTDTDTV 917
Query: 195 WLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTY-----ICSCMIYLRPTD 249
WL++P+ Q +D+ + D K+ T+ +C+ ++LR
Sbjct: 918 WLENPYHQFQAGYDMQISSD-----------------KEDETFDPWHMLCTGFMFLRSKR 960
Query: 250 GAKLVMKKWIEELQA 264
+ +W L+A
Sbjct: 961 PVMAFLDEWRRALEA 975
>gi|384244551|gb|EIE18052.1| hypothetical protein COCSUDRAFT_49426 [Coccomyxa subellipsoidea
C-169]
Length = 557
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 58/202 (28%), Positives = 95/202 (47%), Gaps = 29/202 (14%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQV-----LVIAEDYATLYKVNGRWPG 143
VA + T+I+ S Y+ F NW+ KH + + L + +D + +N R+PG
Sbjct: 286 VAVDKTVILTQTSCGYIDFAENWI------KHAEGLGIKNYLTVVDDEISFEYLNLRYPG 339
Query: 144 HAVLV----PPAPD-SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKD 198
H V V P A + Q FGS+ F + R + +LE G+ +++ D+D V +D
Sbjct: 340 HIVSVDVFSPGATNFDQPLLDFGSKLFNKMSCDRLTYQRKVLERGFTMLWIDMDTVLYQD 399
Query: 199 PFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKW 258
P + G D TDD+ + L+ S K G IC C+++ RPT AK +++W
Sbjct: 400 PVAVMPGGLDFIATDDILDEEGLNQSEQ-----KTGN--ICGCLMFFRPTRNAKDFLRQW 452
Query: 259 IEELQAEPWSKAKKANDQPAFN 280
++ + + DQ A N
Sbjct: 453 YDKCTTQ------ELPDQTALN 468
>gi|390348519|ref|XP_003727023.1| PREDICTED: uncharacterized protein LOC100888971 [Strongylocentrotus
purpuratus]
Length = 977
Score = 75.9 bits (185), Expect = 2e-11, Method: Composition-based stats.
Identities = 49/198 (24%), Positives = 90/198 (45%), Gaps = 25/198 (12%)
Query: 95 IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDS 154
+I+ + ++ +L F +NWL S+ R + V ++AED +N R +V
Sbjct: 732 VILVSTNKAFLNFTDNWLESVKRSGIRSGVTLVAEDREAFNYLNNRTDIELNVVLNDVSE 791
Query: 155 QTAHK--FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGD-HDVYF 211
+ F S + ++RP ++L +L G++V+++DVD+VWLK+P PY D +D++
Sbjct: 792 SPGERLLFDSPAYKQLVNKRPSYILQLLSSGHDVLFSDVDIVWLKNPLPYFTNDTNDIWL 851
Query: 212 TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAK 271
+D+ H+ T C+ + R + ++ +W++ L P
Sbjct: 852 QEDL---------HE--------PTVYCAGFTFYRSSPATIALVTEWVQTLALHP----- 889
Query: 272 KANDQPAFNWALNKTAGQ 289
DQ N L K Q
Sbjct: 890 TYPDQRVLNGLLKKKRWQ 907
>gi|384249697|gb|EIE23178.1| hypothetical protein COCSUDRAFT_63537 [Coccomyxa subellipsoidea
C-169]
Length = 327
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 82/182 (45%), Gaps = 18/182 (9%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
L + + VA + +I+ S YL F NW+ + + L IAED L + R+
Sbjct: 33 LQERVAAVAVDNKVILTQTSCGYLEFAVNWITHVEALGLTNW-LTIAEDETALKFLEERY 91
Query: 142 PGHAV----LVPPAPDSQTA-HKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWL 196
PGHA+ A S A +++GS F RP +L +L+LGY V+++D+D VWL
Sbjct: 92 PGHALPASAFTNEALSSGNALYEWGSAAFTKVACARPSYLQMVLDLGYEVLWSDMDAVWL 151
Query: 197 KDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMK 256
K+ F D DD ++ C+ ++Y RPT A+ ++
Sbjct: 152 KNFFALAPQGLDYVGVDDSETEN------------EQETENACTGLMYFRPTARAQQLLA 199
Query: 257 KW 258
W
Sbjct: 200 DW 201
>gi|384253349|gb|EIE26824.1| hypothetical protein COCSUDRAFT_59332 [Coccomyxa subellipsoidea
C-169]
Length = 327
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/176 (26%), Positives = 80/176 (45%), Gaps = 18/176 (10%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
+A + T+I+ S PYL F NW++ + R + ++++ +D A Y ++ ++PGH V
Sbjct: 59 IAVDKTVILTQASCPYLDFAENWILHVERLGITNYLILMDDDVAFKY-LDSKFPGHTVH- 116
Query: 149 PPAPDSQTAH------KFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
P + H ++ S F N R +L+LG++ ++ D+D VWL+D
Sbjct: 117 PSVISKGSKHMPKPLMRYDSLDFNNMMCDRLGIQRRVLDLGFSFLWTDMDTVWLQDASKI 176
Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKW 258
+ D T D HD K IC CM + PT A+ ++ W
Sbjct: 177 IPRGFDFVGTGDR-----FRFGHDEEEENK-----ICGCMTFWTPTVPARQALRDW 222
>gi|390342689|ref|XP_003725713.1| PREDICTED: uncharacterized protein LOC100892399 [Strongylocentrotus
purpuratus]
Length = 317
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 52/200 (26%), Positives = 84/200 (42%), Gaps = 30/200 (15%)
Query: 96 IVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPA---- 151
I+ + + NWL S+ R V +IAEDY + R ++V
Sbjct: 62 ILTTTNTAFSDLTENWLESLRRLDRPYNVTLIAEDYGAFQYLKDR---KDIMVSSRLQIN 118
Query: 152 ---PDSQTAHK----FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
D +H F ++ + +RP ++L IL G V++ D D VWL+DP P +
Sbjct: 119 LRFTDMNVSHSAALGFYTKEYIQLVGKRPHYILDILRSGVGVLFADADAVWLRDPIPLIA 178
Query: 205 GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQA 264
DVY D+ L +P C C +YL+PT A ++K W ++
Sbjct: 179 ---DVYSRYDVWVA--LGEGGQIP----------CPCFMYLKPTPRAVEMVKAWQSRIE- 222
Query: 265 EPWSKAKKANDQPAFNWALN 284
+ + NDQ A +W ++
Sbjct: 223 DQLNTTSPENDQGALDWVMH 242
>gi|440804176|gb|ELR25053.1| fucosylgalactoside 3-alpha-galactosyltransferase [Acanthamoeba
castellanii str. Neff]
Length = 806
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/196 (28%), Positives = 94/196 (47%), Gaps = 26/196 (13%)
Query: 76 KWRDY--TLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLV-----IA 128
K+RDY L A VA+NGT+I+ + YL FL NW + +Q + V++ +A
Sbjct: 401 KYRDYPFLLHNALPRVAQNGTVIMVIANSGYLEFLLNWKSYVDKQGITNYVIIPSDVQMA 460
Query: 129 EDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTS---RRPCHLLHILELGYN 185
+ + L P + S T +K G++ + N+ ++ + I+E+GY+
Sbjct: 461 QQLSYLGVDWAYDPEIGMDALSQSVSYTMNK-GTRAWANWNKVVHKKSHYFKKIVEMGYS 519
Query: 186 VMYNDVDMVWLKDPFPYL-QGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIY 244
VM +D+DMVWLK+PF + D D++FT+D A H D + IC + Y
Sbjct: 520 VMVSDIDMVWLKNPFSRMNDSDVDIFFTNDGGA-----HGRD---------SSICGGLFY 565
Query: 245 LRPTDGAKLVMKKWIE 260
+ + + + IE
Sbjct: 566 GKNREKSLAFINHVIE 581
>gi|405974682|gb|EKC39308.1| hypothetical protein CGI_10024476 [Crassostrea gigas]
Length = 370
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 58/234 (24%), Positives = 99/234 (42%), Gaps = 22/234 (9%)
Query: 35 LLVLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSS--LAKWRDYT--LSQAASFVA 90
L+ L T++++LSL IL + F+F A S L R+ T + AA ++
Sbjct: 41 LITLGVLTVILMLSLFTYNTKILKHSQREDFVFQPAKFSDLLRDGRNDTNHVLTAAMRIS 100
Query: 91 KNG--TIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
K G ++V V+ +LPF +WL + +QVL I D + K+N +WP +
Sbjct: 101 KGGQKALLVTLVNDAFLPFTFSWLCNTQGMGIHNQVLFITGDNESAMKINQKWPEVTAIQ 160
Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
S ++ G+ RR LL ILE + +VD +W+ +P +Q +
Sbjct: 161 IDGVHSGN-QEYSHVGYVELMVRRSEILLEILEKNIPIFLFEVDCIWITNPLNNIQSYSN 219
Query: 209 VYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
V + + + I +YL PTD +K + + ++L
Sbjct: 220 VDVVVNAVSSRS---------------NVIAGGFLYLHPTDASKKMWRALTKQL 258
>gi|297827043|ref|XP_002881404.1| hypothetical protein ARALYDRAFT_902666 [Arabidopsis lyrata subsp.
lyrata]
gi|297327243|gb|EFH57663.1| hypothetical protein ARALYDRAFT_902666 [Arabidopsis lyrata subsp.
lyrata]
Length = 640
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 87/206 (42%), Gaps = 26/206 (12%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
K+ IIV + ++ F+ W+ ++ + +LV A D L + W G V
Sbjct: 128 VKDNVIIVTFGNYAFMDFILTWVKHLTDLDLSN-ILVGAMDTKLLEAL--YWKGVPVFDM 184
Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
+ S +GS F + + +L GY ++ D DMVWLK+P PYL D
Sbjct: 185 GSHMSTVDVGWGSPTFHKMGREKVILIDSVLPFGYELLMCDTDMVWLKNPMPYLARFPDA 244
Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC-----MIYLRPTDGAKLVMKKWIEEL 262
DV + D +D S D IC C + + RPT+ AK + K+W E L
Sbjct: 245 DVLTSSDQVVPTVIDDSLD-----------ICWCCLQHRIFHWRPTESAKKLAKEWKEIL 293
Query: 263 QAEPWSKAKKANDQPAFNWALNKTAG 288
A+ K DQ FN + + G
Sbjct: 294 LAD-----DKVWDQNGFNEIVRRQLG 314
>gi|299117548|emb|CBN75392.1| Xylosyltransferase, family GT77 [Ectocarpus siliculosus]
Length = 548
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 53/194 (27%), Positives = 83/194 (42%), Gaps = 24/194 (12%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
+ G +IV +V+ YL +N+L S+ + +VL +A D ++ PG V P
Sbjct: 276 QQGVVIVSSVNCGYLDMASNFLQSVGQAAGDVKVLFVARDEVAFDFLDALSPGCTVFFPE 335
Query: 151 APDSQTAH-----KFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG 205
S+ AH ++G F T RP LL IL GY +Y DVD+ WL + L
Sbjct: 336 V-GSERAHAIQAGQWGDNIFKQQTVARPDILLPILRQGYKALYTDVDIFWLGNVLSLLP- 393
Query: 206 DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAE 265
+ V T + + D C+C +YL T A +++ W +E+
Sbjct: 394 NSRVTQTPPVEVILQADAKQK------------CTCFMYLDSTPNAIRLLELWKQEI--- 438
Query: 266 PWSKAKKANDQPAF 279
++ DQ AF
Sbjct: 439 --AEKSSFQDQRAF 450
>gi|30686478|ref|NP_850250.1| xyloglucanase 113 [Arabidopsis thaliana]
gi|18377747|gb|AAL67023.1| unknown protein [Arabidopsis thaliana]
gi|330254034|gb|AEC09128.1| xyloglucanase 113 [Arabidopsis thaliana]
Length = 644
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 57/200 (28%), Positives = 88/200 (44%), Gaps = 13/200 (6%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
K+ IIV + ++ F+ W+ ++ + +LV A D L + W G V
Sbjct: 130 KDNVIIVTFGNYAFMDFILTWVKHLTDLDLSN-ILVGAMDTKLLEAL--YWKGVPVFDMG 186
Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DHD 208
+ S +GS F + + +L GY ++ D DMVWLK+P PYL D D
Sbjct: 187 SHMSTVDVGWGSPTFHKMGREKVILIDSVLPFGYELLMCDTDMVWLKNPMPYLARFPDAD 246
Query: 209 VYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWS 268
V + D +D S D+ + G Y + + RPT+ AK + K+W E L A+
Sbjct: 247 VLTSSDQVVPTVIDDSLDIWQ--QVGAAYNIG-IFHWRPTESAKKLAKEWKEILLAD--- 300
Query: 269 KAKKANDQPAFNWALNKTAG 288
K DQ FN + + G
Sbjct: 301 --DKVWDQNGFNEIVRRQLG 318
>gi|222423247|dbj|BAH19600.1| AT2G35610 [Arabidopsis thaliana]
Length = 644
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 57/200 (28%), Positives = 88/200 (44%), Gaps = 13/200 (6%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
K+ IIV + ++ F+ W+ ++ + +LV A D L + W G V
Sbjct: 130 KDNVIIVTFGNYAFMDFILTWVKHLTDLDLSN-ILVGAVDTKLLEAL--YWKGVPVFDMG 186
Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DHD 208
+ S +GS F + + +L GY ++ D DMVWLK+P PYL D D
Sbjct: 187 SHMSTVDVGWGSPTFHKMGREKVILIDSVLPFGYELLMCDTDMVWLKNPMPYLARFPDAD 246
Query: 209 VYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWS 268
V + D +D S D+ + G Y + + RPT+ AK + K+W E L A+
Sbjct: 247 VLTSSDQVVPTVIDDSLDIWQ--QVGAAYNIG-IFHWRPTESAKKLAKEWKEILLAD--- 300
Query: 269 KAKKANDQPAFNWALNKTAG 288
K DQ FN + + G
Sbjct: 301 --DKVWDQNGFNEIVRRQLG 318
>gi|359489312|ref|XP_002270057.2| PREDICTED: uncharacterized protein At1g28695-like [Vitis vinifera]
gi|297734642|emb|CBI16693.3| unnamed protein product [Vitis vinifera]
Length = 346
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 71/146 (48%), Gaps = 13/146 (8%)
Query: 92 NGTIIVCAVSQPYLP---------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWP 142
N T+I+ V++ Y+ FL ++ I ++ D +LV+A D + R
Sbjct: 61 NKTVIITIVNKAYVEGDDTSMLDLFLESFWIGEGTREMADHLLVVAGDQTAYDRCIFRRL 120
Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
+V D + + S+ F RR LLH+LE GY+ ++ D D+ WL++PFP
Sbjct: 121 HCYKMVGEDGDMEGEKLYMSEDFIEMMWRRTLLLLHVLERGYSFIFTDTDVSWLRNPFPR 180
Query: 203 LQGD--HDVYFTDD--MAAVKPLDHS 224
L + D+ + D +++ +P D+S
Sbjct: 181 LTTNLTADLQISTDKFLSSHRPEDNS 206
>gi|326433586|gb|EGD79156.1| hypothetical protein PTSG_09887 [Salpingoeca sp. ATCC 50818]
Length = 300
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/205 (24%), Positives = 87/205 (42%), Gaps = 24/205 (11%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
V+ T++V S+ Y+ L W+ I +D V+V A+D TL PG +
Sbjct: 55 VSSASTVVVTMASKDYIGLLQPWVRRIKAVGIEDFVIV-AQDKETLAAAEELAPGRVRIG 113
Query: 149 PPAPDSQTAH-------KFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
TA +G++ + + ++ H++ +Y+D+D+++L +P
Sbjct: 114 YNEDKGTTASGTAGKKSAYGNKAWKEAVENKARYVWHVIARNQTALYSDIDVIFLHNPLK 173
Query: 202 YLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEE 261
YL+ H V + + +D+ P S IY +PT + VM+KW +
Sbjct: 174 YLES-HVVSIG--------MPYGNDIDEPN------FNSGFIYAKPTPQTQEVMRKW-KS 217
Query: 262 LQAEPWSKAKKANDQPAFNWALNKT 286
L E W DQ A N A+ +T
Sbjct: 218 LSKEIWRYKSPPYDQRALNEAIVQT 242
>gi|15235236|ref|NP_193730.1| uncharacterized protein [Arabidopsis thaliana]
gi|2827651|emb|CAA16605.1| putative protein [Arabidopsis thaliana]
gi|7268792|emb|CAB78997.1| putative protein [Arabidopsis thaliana]
gi|332658853|gb|AEE84253.1| uncharacterized protein [Arabidopsis thaliana]
Length = 715
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/203 (22%), Positives = 84/203 (41%), Gaps = 26/203 (12%)
Query: 91 KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
++ T+I+ A++Q + F ++ + I ++ V+ + D + P
Sbjct: 112 EDNTVIITALNQAWAEPNSTFDVFRESFKVGIETERLLKHVIAVCLDIKAYDQCLKVHP- 170
Query: 144 HAVLVPPAPDSQTA--HKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
H L+ Q + ++F + G+ RR L ++ LGYN ++ D D++WL+DPFP
Sbjct: 171 HCYLINATDSDQLSGPNRFMTPGYLKLIWRRMDLLRQVIGLGYNFIFTDADILWLRDPFP 230
Query: 202 YLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEE 261
D D T D +P D + ++ S Y++ + K WI
Sbjct: 231 RFFPDADFQITCDDYNGRPSDK-----------KNHVNSGFTYVKANNKTSKFYKYWIRS 279
Query: 262 LQAEPWSKAKKANDQPAFNWALN 284
+ P +DQ FN+ N
Sbjct: 280 SRKFP-----GKHDQDVFNFIKN 297
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 66/158 (41%), Gaps = 11/158 (6%)
Query: 90 AKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWP 142
+N T+IV ++Q + FL ++ I +K V+V+ D + + P
Sbjct: 451 TENRTVIVTTLNQAWAEPNSLFDLFLESFRIGQGTKKLLQHVVVVCLDSKAFARCSQLHP 510
Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
L D F + + RR L +LE+GYN ++ D D++WL+DPFP
Sbjct: 511 NCYYLKTTGTDFSGEKLFATPDYLKMMWRRIELLTQVLEMGYNFIFTDADIMWLRDPFPR 570
Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
L D D MA + HD G TY+ S
Sbjct: 571 LYPDGDF----QMACDRFFGDPHDSDNWVNGGFTYVKS 604
>gi|390347469|ref|XP_003726790.1| PREDICTED: uncharacterized protein LOC100893515 [Strongylocentrotus
purpuratus]
Length = 384
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 82/192 (42%), Gaps = 23/192 (11%)
Query: 95 IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYAT---LYKVNGRWPGHAVLVPPA 151
+I+ A + ++ NW SI R VL++AED + ++ + A
Sbjct: 149 VILTATDKNFIDLAENWRESIRRLGLSYDVLLMAEDEESHKYFFERTTERFRTVLSTQFA 208
Query: 152 PDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYF 211
+ A + + RR ++L IL+ GY+V+ D+D VW KDP + ++D Y
Sbjct: 209 LPGKLAKNIST--YQQLIRRRTVYILSILQSGYDVLLVDIDAVWFKDPVKLVLDEYDKY- 265
Query: 212 TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAK 271
D+ + GK+ R + C C Y++ T ++ WIE L + K
Sbjct: 266 --DIWLAQ-----------GKQAR-FPCPCFFYMKSTSVVIKMVYDWIERL---AYYGGK 308
Query: 272 KANDQPAFNWAL 283
DQ A ++ L
Sbjct: 309 VETDQVALSFIL 320
>gi|452824988|gb|EME31987.1| hypothetical protein Gasu_07340 [Galdieria sulphuraria]
Length = 658
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 61/121 (50%), Gaps = 5/121 (4%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIA---EDYATLYKVNGR--WPG 143
VA N T+I+ A++ Y FL N++ ++ + L++A ED G +
Sbjct: 385 VAINNTVILVAMNYGYRSFLMNFVCNLRQLNLFPGNLIVAALDEDMYRFAFTRGLPVYFE 444
Query: 144 HAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
+ V S A +GS F T + +L IL+LGY+V++ D D+VW ++P PYL
Sbjct: 445 NTVYSKEDATSVVAASYGSDSFKKLTKMKSRVVLRILKLGYDVIWTDCDIVWFRNPIPYL 504
Query: 204 Q 204
Q
Sbjct: 505 Q 505
>gi|326427878|gb|EGD73448.1| hypothetical protein PTSG_05152 [Salpingoeca sp. ATCC 50818]
Length = 749
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 64/128 (50%), Gaps = 6/128 (4%)
Query: 80 YTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNG 139
+TL VA+N +I+ VS Y + N++ ++ R DQ+++ A D +Y+
Sbjct: 442 FTLKDLLPIVARNKHVILSGVSYIYRDVVMNFVCNLRRLGIYDQLILAAFD-EEMYRFGF 500
Query: 140 RWPGHAVLVPPAPD----SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVW 195
R G + + D S ++GS F T + +L IL++GY+V + D D+VW
Sbjct: 501 RM-GLPIFYYQSDDLAGLSSRDLEYGSDAFKKVTKLKSQVVLQILQMGYDVTWTDTDIVW 559
Query: 196 LKDPFPYL 203
+DP P L
Sbjct: 560 FEDPIPKL 567
>gi|357131567|ref|XP_003567408.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
distachyon]
Length = 362
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 51/94 (54%), Gaps = 1/94 (1%)
Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVP-PAPDSQTAHKFGSQGFFNFTSRRPCHLLHIL 180
D VLV+A D L PGH L+ + + +A++F S+G+ + +L
Sbjct: 127 DHVLVVAVDTGALAHCEAVHPGHCYLLEVKSANISSANRFMSKGYLELVWAKLQLQHRVL 186
Query: 181 ELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD 214
+LGYN ++ DVD++WL+DPF ++ D+ + D
Sbjct: 187 QLGYNYLFTDVDIMWLRDPFRHISLYADMAVSTD 220
>gi|255089627|ref|XP_002506735.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
gi|226522008|gb|ACO67993.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
Length = 746
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 25/234 (10%)
Query: 61 GTPG-FMFPNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQK 119
G PG ++ P S LA+ T + A++ ++V + Y F+ NW+ ++ R+
Sbjct: 199 GKPGEYVAPK--SKLAEDNQLTAELVGRY-AEDNIVMVTWANHHYHDFVRNWVRNV-RKC 254
Query: 120 HQDQVLVIAEDYATLYK-VNGRWPGHAVLVPPAPDSQTAHKFG-SQGFFNFTSRRPCHLL 177
+V A D L K ++ P A+ T FG F+ R+ L+
Sbjct: 255 GMRNYMVGAMDNELLEKLIDDEVPTFAM-----QSGLTTKDFGWGTANFHKMGRKKIELI 309
Query: 178 HIL-ELGYNVMYNDVDMVWLKDPFPYLQG--DHDVYFTDDMAAVKPLDHSHDLPPPGKKG 234
H+ E+G++++ +DVD VWL++P PY+ DV + D A + P +K
Sbjct: 310 HLFTEMGFDILVSDVDTVWLRNPLPYMAKYPHADVLTSSDHLANTAEGEGLEDP---RKA 366
Query: 235 RTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAG 288
+ ++ LR D AK + K+W++ L+ K K DQ FN + G
Sbjct: 367 HSAANIGIMLLR--DTAKELAKEWVDVLE-----KDDKVWDQNVFNDLYRRGGG 413
>gi|356577001|ref|XP_003556618.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
Length = 397
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/138 (25%), Positives = 65/138 (47%), Gaps = 8/138 (5%)
Query: 79 DYTLSQAASFVA-KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAED 130
DY+L + A ++ T+I+ +++ + FL ++ I ++ + +++IA D
Sbjct: 100 DYSLENILNEAAMQDRTVILTTLNEAWAATNSIIDLFLESFRIGDRTRRLLNHLVIIALD 159
Query: 131 YATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYND 190
+ +LV A D F + + RR L +LE+GYN ++ D
Sbjct: 160 QKAFMRCQAIHTYCYLLVNEATDFHKEAYFMTPSYLKMMWRRIDFLRSVLEMGYNFVFTD 219
Query: 191 VDMVWLKDPFPYLQGDHD 208
VD++W +DPFP+ D D
Sbjct: 220 VDIMWFRDPFPWFHRDAD 237
>gi|125588648|gb|EAZ29312.1| hypothetical protein OsJ_13373 [Oryza sativa Japonica Group]
Length = 404
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/196 (25%), Positives = 83/196 (42%), Gaps = 25/196 (12%)
Query: 94 TIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAV 146
T+I+ V+ + FL + + + VLV+A D L + P +
Sbjct: 145 TVIITCVNHAFAAPDSLLDIFLQGFRVGDGTPELLRHVLVVAMDPTALTRCRAVHPHCYL 204
Query: 147 LVPPAPDSQ-TAHKF-GSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
P D T+ KF S+ + + IL+LGYN ++ DVD+VWL++PF ++
Sbjct: 205 YTMPGLDVDFTSEKFFASKDYLELVWSKLKLQRRILQLGYNFLFTDVDIVWLRNPFKHVA 264
Query: 205 GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQA 264
D+ + D+ P D+ + P G Y++P+ + K+W E +
Sbjct: 265 VYADMAISSDVFFGDP-DNIDNFPNTG----------FFYVKPSARTIAMTKEWHEARSS 313
Query: 265 EPWSKAKKANDQPAFN 280
P N+QP FN
Sbjct: 314 HP-----GLNEQPVFN 324
>gi|125545615|gb|EAY91754.1| hypothetical protein OsI_13395 [Oryza sativa Indica Group]
Length = 343
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/202 (25%), Positives = 86/202 (42%), Gaps = 16/202 (7%)
Query: 37 VLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSSLAKWRDYTLSQAASFVA-KNGTI 95
L R +V + LLV++G L + G D L A A NGT+
Sbjct: 12 ALGRAIAVVAICLLVIVGTYLLSSPAAGDGDTEEEFFSVDAGDDDLEAAVRGAAYANGTL 71
Query: 96 IVCAVSQPYLP----FLNNWLISISRQKHQDQ----VLVIAEDYATLYKVNGRWPGHAVL 147
IV +++ Y L+ +L S+ + +Q VL++A D +
Sbjct: 72 IVSVLNRAYADEDGGLLDLFLRSMREGEGTEQLIAHVLLVAMDRPAFLRCRRLGGVRCYQ 131
Query: 148 VPPAPDS----QTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
+P A D + + S GF RR L +L+LGY+ ++ D+D++WL++P P L
Sbjct: 132 LPAAQDGADDLSSEQLYMSDGFIRMMWRRIRLLGDVLKLGYSFIFTDLDVMWLRNPLPRL 191
Query: 204 Q---GDHDVYFTDDMAAVKPLD 222
+ + D+ + D +P D
Sbjct: 192 EYRAEEEDLLISSDQFNGRPGD 213
>gi|115456601|ref|NP_001051901.1| Os03g0849900 [Oryza sativa Japonica Group]
gi|28269394|gb|AAO37937.1| putative regulatory protein [Oryza sativa Japonica Group]
gi|108712117|gb|ABF99912.1| regulatory protein, putative, expressed [Oryza sativa Japonica
Group]
gi|113550372|dbj|BAF13815.1| Os03g0849900 [Oryza sativa Japonica Group]
Length = 408
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/176 (26%), Positives = 76/176 (43%), Gaps = 18/176 (10%)
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQ-TAHKF-GSQG 164
FL + + + VLV+A D L + P + P D T+ KF S+
Sbjct: 165 FLQGFRVGDGTPELLRHVLVVAMDPTALTRCRAVHPHCYLYTMPGLDVDFTSEKFFASKD 224
Query: 165 FFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHS 224
+ + IL+LGYN ++ DVD+VWL++PF ++ D+ + D+ P D+
Sbjct: 225 YLELVWSKLKLQRRILQLGYNFLFTDVDIVWLRNPFKHVAVYADMAISSDVFFGDP-DNI 283
Query: 225 HDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
+ P G Y++P+ + K+W E + P N+QP FN
Sbjct: 284 DNFPNTG----------FFYVKPSARTIAMTKEWHEARSSHP-----GLNEQPVFN 324
>gi|357517471|ref|XP_003629024.1| hypothetical protein MTR_8g072310 [Medicago truncatula]
gi|355523046|gb|AET03500.1| hypothetical protein MTR_8g072310 [Medicago truncatula]
Length = 655
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 53/201 (26%), Positives = 87/201 (43%), Gaps = 13/201 (6%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
K+ +IV + ++ F+ W+ ++ + + LV A D L + W G V
Sbjct: 122 VKDNVVIVTFGNYAFMDFILTWVKKLTDLEVSN-FLVGAMDTKLLEAL--YWKGVPVFDM 178
Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL--QGDH 207
+ S +G+ F + L IL G+ V+ D DMVWLK+P PYL
Sbjct: 179 GSHMSTVDVGWGTPTFHKMGREKVILLDSILPFGFEVLMCDTDMVWLKNPLPYLARHPGA 238
Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
D+ + D +D S ++ ++ + + RPT+ AK++ K+W E L A+
Sbjct: 239 DILTSSDQVVPTVVDDSLEI---WQEVSGAYNIGIFHWRPTESAKILAKQWKEMLLAD-- 293
Query: 268 SKAKKANDQPAFNWALNKTAG 288
K DQ FN L+ G
Sbjct: 294 ---DKIWDQNGFNDILHTQLG 311
>gi|224129610|ref|XP_002328759.1| predicted protein [Populus trichocarpa]
gi|222839057|gb|EEE77408.1| predicted protein [Populus trichocarpa]
Length = 638
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 57/200 (28%), Positives = 88/200 (44%), Gaps = 13/200 (6%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
K+ IIV + ++ F+ +W+ ++ + +LV A D L + W G V
Sbjct: 124 KDNVIIVTFGNYAFMDFILSWVKHLTDLGLSN-LLVGAMDTKLLEAL--YWKGIPVFDMG 180
Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DHD 208
+ S +GS F + + IL G ++ D DMVWLKDP PYL + D
Sbjct: 181 SHMSTADVGWGSPTFHKMGREKVILIDAILPYGVELLMCDTDMVWLKDPLPYLARYPEAD 240
Query: 209 VYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWS 268
V + D +D S DL + G Y + + RPT+ AK + ++W + L A+
Sbjct: 241 VLTSSDQVVPTVVDDSLDL--WQQVGAAYNIG-IFHWRPTESAKKLAREWKDMLLAD--- 294
Query: 269 KAKKANDQPAFNWALNKTAG 288
K DQ FN + K G
Sbjct: 295 --DKIWDQNGFNDIVRKQLG 312
>gi|323457174|gb|EGB13040.1| hypothetical protein AURANDRAFT_60713 [Aureococcus anophagefferens]
Length = 878
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 50/203 (24%), Positives = 82/203 (40%), Gaps = 28/203 (13%)
Query: 93 GTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYAT--LYKVNGRWPGHAVLVPP 150
GT ++ VS + NW ++ +VL++AED T + + W +
Sbjct: 28 GTAVLVTVSHGFEDMFRNWWYWYAKLDLNMEVLLLAEDEPTRSAFAASRNWTVITTTMGA 87
Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL-QGDHDV 209
P H + + + SRRP ++L L+ ++Y DVD VWL DP P+ G +D
Sbjct: 88 LPTQD--HTYETPNYKKLVSRRPHNILRGLDKYPRLIYTDVDTVWLSDPRPHFAAGAYDF 145
Query: 210 YFT-DDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWS 268
+ DD P Y C+ + + T + ++ KW L A+P
Sbjct: 146 WGQLDDGTPQSP----------------YYCTGFLAFKNTAPTRALLSKWAVALDAKP-- 187
Query: 269 KAKKANDQPAFNWALNKTAGQVC 291
+QP FN L + + C
Sbjct: 188 ----NLNQPLFNDLLKREPAKHC 206
>gi|312282939|dbj|BAJ34335.1| unnamed protein product [Thellungiella halophila]
Length = 374
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 47/204 (23%), Positives = 83/204 (40%), Gaps = 26/204 (12%)
Query: 91 KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
++ T+I+ A++Q + F ++ + ++ V+ + D + P
Sbjct: 109 EDNTVIITALNQAWAEPNSTFDVFRESFKAGLGTERLLKHVIAVCLDNKAYDRCVEVHP- 167
Query: 144 HAVLVPPAPDSQTA--HKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
H L+ Q + ++F + G+ RR +L LGYN ++ D D++WL+DPFP
Sbjct: 168 HCYLINATDSDQLSGPNRFMTPGYLKLIWRRMDLHRQVLGLGYNFIFTDADILWLRDPFP 227
Query: 202 YLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEE 261
D D T D KP D ++ ++ S Y++ + K WI
Sbjct: 228 RFFPDADFQITCDDYNGKPSDKNN-----------HVNSGFTYVKANNKTLNFYKYWIRS 276
Query: 262 LQAEPWSKAKKANDQPAFNWALNK 285
+ P +DQ FN NK
Sbjct: 277 SRKFP-----GKHDQDVFNLIKNK 295
>gi|147856952|emb|CAN79199.1| hypothetical protein VITISV_040773 [Vitis vinifera]
Length = 715
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 54/201 (26%), Positives = 86/201 (42%), Gaps = 13/201 (6%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
K+ IIV + ++ F+ W+ ++ + +LV A D L + W G V
Sbjct: 156 VKDNVIIVTFGNYAFMDFILTWVKHLTDLGISN-LLVGAMDTKLLEAL--YWRGVPVFDM 212
Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
+ S +G+ F + + +L GY ++ D DMVWLK+P PYL
Sbjct: 213 GSHMSTIDVGWGTPTFHKMGREKVILIDALLPFGYELLMCDTDMVWLKNPLPYLARFPGA 272
Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
DV + D +D D+ + G Y + + RPTD AK + ++W E L A+
Sbjct: 273 DVLTSSDQVVPTVVDDRLDIWQ--QVGAAYNIG-IFHWRPTDSAKKLAREWKEMLLAD-- 327
Query: 268 SKAKKANDQPAFNWALNKTAG 288
K DQ FN + + G
Sbjct: 328 ---DKIWDQNGFNELVRRQLG 345
>gi|357142580|ref|XP_003572620.1| PREDICTED: uncharacterized protein LOC100829311 [Brachypodium
distachyon]
Length = 614
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 89/204 (43%), Gaps = 17/204 (8%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
A++G I+V + +L F+ W+ ++ D +LV A D L ++ R G V
Sbjct: 98 ARDGVIVVTFGNYAFLDFILTWVHHLTGLG-VDNLLVGAMDTKLLRELYLR--GVPVFDM 154
Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
+ + +GS F + + +L GY ++ D DMVWLK+P PYL D
Sbjct: 155 GSRMATEDAGWGSPAFHKMGREKVLLINTLLPFGYELLMCDTDMVWLKNPLPYLARYPDA 214
Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQA--E 265
D+ + D D S + ++ + + RPT+ AK + K W + + + E
Sbjct: 215 DILTSSDQVIPTVTDDSLE---KWREVSGAFNIGIFHWRPTEPAKRLAKDWKDLVMSNDE 271
Query: 266 PWSKAKKANDQPAFNWALNKTAGQ 289
W DQ AFN + K GQ
Sbjct: 272 IW-------DQNAFNDLVRKVFGQ 288
>gi|42563115|ref|NP_177220.2| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
thaliana]
gi|332196971|gb|AEE35092.1| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
thaliana]
Length = 537
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 4/117 (3%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
KN T+++ Y L +W+ + R K + ++ +D + + P P
Sbjct: 271 KNRTVVLSVAGYSYKDMLMSWVCRLRRLKVPNFLVCALDDETYQFSILQGLP--VFFDPY 328
Query: 151 APDSQTAHK--FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG 205
AP + + + FGS+ F T + +L IL+LGYNV+ +DVD+ W ++P P LQ
Sbjct: 329 APKNISFNDCHFGSKCFQRVTKVKSRTVLKILKLGYNVLLSDVDVYWFRNPLPLLQS 385
>gi|12324743|gb|AAG52325.1|AC011663_4 hypothetical protein; 72471-70598 [Arabidopsis thaliana]
gi|12325048|gb|AAG52475.1|AC010796_14 hypothetical protein; 82031-83904 [Arabidopsis thaliana]
Length = 535
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 4/117 (3%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
KN T+++ Y L +W+ + R K + ++ +D + + P P
Sbjct: 269 KNRTVVLSVAGYSYKDMLMSWVCRLRRLKVPNFLVCALDDETYQFSILQGLP--VFFDPY 326
Query: 151 APDSQTAHK--FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG 205
AP + + + FGS+ F T + +L IL+LGYNV+ +DVD+ W ++P P LQ
Sbjct: 327 APKNISFNDCHFGSKCFQRVTKVKSRTVLKILKLGYNVLLSDVDVYWFRNPLPLLQS 383
>gi|242033773|ref|XP_002464281.1| hypothetical protein SORBIDRAFT_01g015590 [Sorghum bicolor]
gi|241918135|gb|EER91279.1| hypothetical protein SORBIDRAFT_01g015590 [Sorghum bicolor]
Length = 615
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 54/202 (26%), Positives = 89/202 (44%), Gaps = 13/202 (6%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
A++G I+V + +L F+ W+ ++ D +LV A D L ++ R G V
Sbjct: 100 ARDGVIVVTFGNYAFLDFILTWVRHLT-DLGVDNLLVGAMDTKLLRELYFR--GVPVFDM 156
Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
+ + +GS F + + +L GY ++ D DMVWLK+P PYL D
Sbjct: 157 GSRMATEDAGWGSPTFHKMGREKVLLINALLPFGYELLMCDTDMVWLKNPLPYLARYPDA 216
Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
D+ + D D S + ++ + + RPT+ AK + K W + + ++
Sbjct: 217 DLLTSSDQVIPTVTDDSLE---NWREVTGAFNIGIFHWRPTEPAKRLAKDWKDLVISD-- 271
Query: 268 SKAKKANDQPAFNWALNKTAGQ 289
K DQ AFN + K GQ
Sbjct: 272 ---DKLWDQNAFNDLVRKNFGQ 290
>gi|302813272|ref|XP_002988322.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300144054|gb|EFJ10741.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 557
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 91/216 (42%), Gaps = 15/216 (6%)
Query: 68 PNATSSLAKWRDYTLSQA-ASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLV 126
P S L + LS+ F AK I+V + + F+ W+ ++ +L+
Sbjct: 14 PPHGSELPPREAFVLSKEMVEFRAKKNVIMVTFANHAFEDFVLTWVRHLT-DVGVTNLLI 72
Query: 127 IAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNV 186
A D L ++ W G V + + +G+ F + + I+ +G+ V
Sbjct: 73 GAMDRKILEELF--WKGVPVFDMGSEMNPADVGWGTPVFHKMGREKVFLVNAIMAMGFEV 130
Query: 187 MYNDVDMVWLKDPFPYLQG--DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIY 244
++ D DMVW+K+P PY++ D DV + D D S + ++ + + +
Sbjct: 131 LFCDTDMVWMKNPLPYMERYPDADVLVSSDAVIATVTDESLE---DWRRSYAALNIGIFH 187
Query: 245 LRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
RPT+ AK + W +L+ E K DQ FN
Sbjct: 188 WRPTEAAKEFARAWQIQLEDE------KIWDQNGFN 217
>gi|297838873|ref|XP_002887318.1| hypothetical protein ARALYDRAFT_476193 [Arabidopsis lyrata subsp.
lyrata]
gi|297333159|gb|EFH63577.1| hypothetical protein ARALYDRAFT_476193 [Arabidopsis lyrata subsp.
lyrata]
Length = 537
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 4/117 (3%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
KN T+++ Y L +W+ + R K + ++ +D + + P P
Sbjct: 271 KNKTVVLSIAGYSYKDMLMSWVCRLRRLKVPNFLVCALDDETYQFSILQGLP--VFFDPY 328
Query: 151 APDSQTAHK--FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG 205
AP + + + FGS+ F T + +L IL+LGYNV+ +DVD+ W ++P P LQ
Sbjct: 329 APKNISFNDCHFGSKCFQRVTKVKSRTVLKILKLGYNVLLSDVDVYWFRNPLPLLQS 385
>gi|356523378|ref|XP_003530317.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
Length = 436
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 34/138 (24%), Positives = 63/138 (45%), Gaps = 8/138 (5%)
Query: 79 DYTLSQAASFVA-KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAED 130
+Y+L S A ++ T+I+ +++ + FL ++ I ++ + +++IA D
Sbjct: 111 EYSLENILSEAAMQDRTVILTTLNEAWAAPNSIIDLFLESFRIGDHTRRLLNHLVIIALD 170
Query: 131 YATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYND 190
+ +LV A D F + + RR L +LE+GYN ++ D
Sbjct: 171 QKAFIRCQAIHTYCYLLVSEATDFHEEAYFMTPSYLKMMWRRIDFLRSVLEMGYNFVFTD 230
Query: 191 VDMVWLKDPFPYLQGDHD 208
D++W +DPFP D D
Sbjct: 231 ADIMWFRDPFPRFHRDAD 248
>gi|356496487|ref|XP_003517099.1| PREDICTED: uncharacterized protein LOC100776463 [Glycine max]
Length = 627
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 56/201 (27%), Positives = 87/201 (43%), Gaps = 13/201 (6%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
K+ IIV + ++ F+ W+ ++ + LV A D L + W G V
Sbjct: 112 VKDNIIIVTFGNYAFMDFILTWVEQLNDLGVSN-YLVGAMDTKLLEAL--YWKGIPVFDM 168
Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
+ S +GS F + + IL GY ++ D DMVWLK+P PYL
Sbjct: 169 GSHMSTDDVGWGSPTFHKMGREKVILINLILPFGYELLMCDTDMVWLKNPLPYLASYPKA 228
Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
DV + D +D S ++ + G Y + + RPT+ AK + K+W E L A+
Sbjct: 229 DVLTSSDQVVPTVVDDSLEI--WQEVGAAYNIG-IFHWRPTESAKKLAKEWKELLLAD-- 283
Query: 268 SKAKKANDQPAFNWALNKTAG 288
K DQ FN +++ G
Sbjct: 284 ---DKIWDQNGFNDIVHRQLG 301
>gi|428172448|gb|EKX41357.1| hypothetical protein GUITHDRAFT_142056 [Guillardia theta CCMP2712]
Length = 528
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/201 (23%), Positives = 90/201 (44%), Gaps = 23/201 (11%)
Query: 94 TIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPD 153
+II+ ++ Y F NW++S+ + ++ +LV ++ A L+ + D
Sbjct: 62 SIILGFANKGYSKFAFNWVLSLRHAEVENFLLVALDEEAHLHFTRHHVTSYYNASMGTTD 121
Query: 154 SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTD 213
+++ H GS+ F N R +++ +LE ++V DVD V+ DPF +L D
Sbjct: 122 AKSQHH-GSKTFRNIMEIRLRYVVELLEQDFDVWLTDVDSVFNTDPFVFLDAD------- 173
Query: 214 DMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLR-----PTDGAKLVMKKWIEELQAEPWS 268
+ L + P GK + + Y+R P + A ++K+ I+ + P
Sbjct: 174 ---SAAELAYDTPFLPKGKDSPLMVMAGFFYMRRCSKFPENCA--LLKETIKYIDDHP-- 226
Query: 269 KAKKANDQPAFNWALNKTAGQ 289
+ +DQ AFN L++ +
Sbjct: 227 ---EKHDQFAFNAVLSRKEAE 244
>gi|255557809|ref|XP_002519934.1| reticulon3-A3, putative [Ricinus communis]
gi|223540980|gb|EEF42538.1| reticulon3-A3, putative [Ricinus communis]
Length = 639
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 60/230 (26%), Positives = 97/230 (42%), Gaps = 14/230 (6%)
Query: 62 TPGFMFPNATSSLAKWRDYTLS-QAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKH 120
TP + P S + + + L+ Q K+ IIV + ++ F+ W+ ++
Sbjct: 96 TPIWEIPPPNSKMLPIKKFRLTKQLVEKRVKDNVIIVTFGNFAFMDFILTWVKHLTDLGL 155
Query: 121 QDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHIL 180
+ +LV A D L + W G V + S +GS F + + L
Sbjct: 156 SN-LLVGAMDTKLLEAL--YWKGVPVFDMGSHMSTADVGWGSPTFHKMGREKVILIDAFL 212
Query: 181 ELGYNVMYNDVDMVWLKDPFPYLQG--DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
G+ ++ D DMVWLK+P PYL D DV + D +D D+ + G Y
Sbjct: 213 PFGFELLMCDTDMVWLKNPLPYLARYPDADVLTSSDQVVPTVVDDRLDIWQ--EVGAAYN 270
Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAG 288
+ + RPT+ +K + K+W E L A+ K DQ FN + + G
Sbjct: 271 IG-IFHWRPTESSKKLAKEWKEILLAD-----DKIWDQNGFNDIVRRQLG 314
>gi|218189628|gb|EEC72055.1| hypothetical protein OsI_04965 [Oryza sativa Indica Group]
Length = 385
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 76/176 (43%), Gaps = 24/176 (13%)
Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVP---------PAPDSQTAHKFGSQGFFNFTSRR 172
D VLV+A D + P H L+ A + +A++F S+G+ +
Sbjct: 139 DHVLVVAVDAGGFRRCKAVHP-HCYLLDVFVVSSGAGDAANLSSANRFMSRGYLELVWAK 197
Query: 173 PCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGK 232
+LELGY+ ++ DVD++WL+DPF ++ DV + D D + + P G
Sbjct: 198 LSLQQRVLELGYSFLFTDVDVMWLRDPFRHITLYADVTISSDHFHGDAGDVAGNSPNTG- 256
Query: 233 KGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAG 288
++RPT+ ++++W P ++DQ F+ + AG
Sbjct: 257 ---------FYHVRPTNRTVEMLRRWRAARSRFP----PASHDQNVFDGIKRELAG 299
>gi|356540910|ref|XP_003538927.1| PREDICTED: uncharacterized protein LOC100789338 [Glycine max]
Length = 632
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 55/201 (27%), Positives = 88/201 (43%), Gaps = 13/201 (6%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
K+ II+ + ++ F+ W+ ++ + LV A D L + W G V
Sbjct: 117 VKDNIIILTFGNYAFMDFILTWVEQLNDLGVSN-YLVGAMDIKLLEAL--YWKGIPVFDM 173
Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
+ S +GS F + + IL GY ++ D DMVWLK+P PYL +
Sbjct: 174 GSHMSTDDVGWGSPTFHKMGREKVLLINLILPFGYELLMCDTDMVWLKNPLPYLARYPEA 233
Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
DV + D +D S ++ + G Y + + RPT+ AK + K+W E L A+
Sbjct: 234 DVLTSSDQVVPTVVDDSLEI--WQEVGAAYNIG-IFHWRPTESAKKLAKEWKELLLAD-- 288
Query: 268 SKAKKANDQPAFNWALNKTAG 288
K DQ FN +++ G
Sbjct: 289 ---DKIWDQNGFNDIVHRQLG 306
>gi|296085465|emb|CBI29197.3| unnamed protein product [Vitis vinifera]
Length = 642
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/200 (27%), Positives = 86/200 (43%), Gaps = 13/200 (6%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
K+ IIV + ++ F+ W+ ++ + +LV A D L + W G V
Sbjct: 129 KDNVIIVTFGNYAFMDFILTWVKHLTDLGISN-LLVGAMDTKLLEAL--YWRGVPVFDMG 185
Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DHD 208
+ S +G+ F + + +L GY ++ D DMVWLK+P PYL D
Sbjct: 186 SHMSTIDVGWGTPTFHKMGREKVILIDALLPFGYELLMCDTDMVWLKNPLPYLARFPGAD 245
Query: 209 VYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWS 268
V + D +D D+ + G Y + + RPTD AK + ++W E L A+
Sbjct: 246 VLTSSDQVVPTVVDDRLDIWQ--QVGAAYNIG-IFHWRPTDSAKKLAREWKEMLLAD--- 299
Query: 269 KAKKANDQPAFNWALNKTAG 288
K DQ FN + + G
Sbjct: 300 --DKIWDQNGFNDLVRRQLG 317
>gi|226493177|ref|NP_001143144.1| uncharacterized protein LOC100275624 [Zea mays]
gi|195614994|gb|ACG29327.1| hypothetical protein [Zea mays]
gi|223947567|gb|ACN27867.1| unknown [Zea mays]
gi|238009756|gb|ACR35913.1| unknown [Zea mays]
gi|414871583|tpg|DAA50140.1| TPA: hypothetical protein ZEAMMB73_624371 [Zea mays]
Length = 615
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/202 (26%), Positives = 88/202 (43%), Gaps = 13/202 (6%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
A++G I+V + +L F+ W+ ++ D +LV A D L ++ R G V
Sbjct: 100 ARDGVIVVTFGNYAFLDFILTWVRHLT-DLGVDNLLVGAMDTKLLRELYFR--GVPVFDM 156
Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
+ +GS F + + +L GY ++ D DMVWLK+P PYL D
Sbjct: 157 GSRMVTEDAGWGSPTFHKMGREKVLLINALLPFGYELLMCDTDMVWLKNPLPYLARYPDA 216
Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
D+ + D D S + ++ + + RPT+ AK + K W + + ++
Sbjct: 217 DLLTSSDQVIPTVTDDSLE---NWREVTGAFNIGIFHWRPTEPAKRLAKDWKDLVISD-- 271
Query: 268 SKAKKANDQPAFNWALNKTAGQ 289
K DQ AFN + K GQ
Sbjct: 272 ---DKLWDQNAFNDLVRKKFGQ 290
>gi|125587812|gb|EAZ28476.1| hypothetical protein OsJ_12458 [Oryza sativa Japonica Group]
Length = 327
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 50/202 (24%), Positives = 85/202 (42%), Gaps = 16/202 (7%)
Query: 37 VLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSSLAKWRDYTLSQAASFVA-KNGTI 95
L R +V + LLV++G L + G D L A A NGT+
Sbjct: 12 ALGRAIAVVAICLLVIVGTYLLSSPAAGDGDTEEEFFSVDAGDDDLEAAVRGAAYANGTL 71
Query: 96 IVCAVSQPYLP----FLNNWLISISRQKHQDQ----VLVIAEDYATLYKVNGRWPGHAVL 147
IV +++ Y L+ +L S+ + +Q VL++A D +
Sbjct: 72 IVSVLNRAYADEDGGLLDLFLRSMREGEGTEQLIAHVLLVAMDRPAFLRCRRLGGVRCYQ 131
Query: 148 VPPAPDS----QTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
+P A D + + S GF RR L +L+LGY+ ++ D+D++WL++P L
Sbjct: 132 LPAAQDGADDLSSEQLYMSDGFIRMMWRRIRLLGDVLKLGYSFIFTDLDVMWLRNPLSRL 191
Query: 204 Q---GDHDVYFTDDMAAVKPLD 222
+ + D+ + D +P D
Sbjct: 192 EYRAEEEDLLISSDQFNGRPGD 213
>gi|359474749|ref|XP_002269181.2| PREDICTED: uncharacterized protein LOC100263017 [Vitis vinifera]
Length = 631
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/200 (27%), Positives = 86/200 (43%), Gaps = 13/200 (6%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
K+ IIV + ++ F+ W+ ++ + +LV A D L + W G V
Sbjct: 118 KDNVIIVTFGNYAFMDFILTWVKHLTDLGISN-LLVGAMDTKLLEAL--YWRGVPVFDMG 174
Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DHD 208
+ S +G+ F + + +L GY ++ D DMVWLK+P PYL D
Sbjct: 175 SHMSTIDVGWGTPTFHKMGREKVILIDALLPFGYELLMCDTDMVWLKNPLPYLARFPGAD 234
Query: 209 VYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWS 268
V + D +D D+ + G Y + + RPTD AK + ++W E L A+
Sbjct: 235 VLTSSDQVVPTVVDDRLDIWQ--QVGAAYNIG-IFHWRPTDSAKKLAREWKEMLLAD--- 288
Query: 269 KAKKANDQPAFNWALNKTAG 288
K DQ FN + + G
Sbjct: 289 --DKIWDQNGFNDLVRRQLG 306
>gi|168006999|ref|XP_001756196.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162692706|gb|EDQ79062.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 526
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 66/133 (49%), Gaps = 18/133 (13%)
Query: 77 WRDYTLSQAASFVAKNGTIIVCAVS-QPYLPFLNNWLISISRQKHQDQVLVIAEDYATLY 135
W ++L AS VA +I+ +V+ Y L +W+ S+ R + ++ +D LY
Sbjct: 238 WLPFSLEALASRVASPDKVIILSVAGDSYRTMLMSWVCSLRRLNISNYLVYALDD--ELY 295
Query: 136 KVNGRWPGHAVL--VPPAPDSQTAH------KFGSQGFFNFTSRRPCHLLHILELGYNVM 187
+ HAV VP SQT FG++ F T + +LH+L+LG+ V+
Sbjct: 296 Q-------HAVSQGVPVVKSSQTMRVSRDDCHFGTKCFQEVTKMKSRTVLHLLQLGFKVL 348
Query: 188 YNDVDMVWLKDPF 200
++DVD+ W ++P
Sbjct: 349 FSDVDVYWFQNPI 361
>gi|330800041|ref|XP_003288048.1| hypothetical protein DICPUDRAFT_152231 [Dictyostelium purpureum]
gi|325081936|gb|EGC35435.1| hypothetical protein DICPUDRAFT_152231 [Dictyostelium purpureum]
Length = 593
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/94 (25%), Positives = 49/94 (52%), Gaps = 8/94 (8%)
Query: 155 QTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD 214
+ A +G+ GF + +P +L +L+ GYNV++ D D+VW KDPF ++F ++
Sbjct: 87 EEAESYGNIGFRAICNEKPLVVLEVLKQGYNVLWTDTDIVWQKDPF--------IHFYNE 138
Query: 215 MAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPT 248
+ +++ D+ ++ IC+ ++R
Sbjct: 139 INKSNGFENNDDIDLYVQQDDDDICAGFYFIRSN 172
>gi|219130296|ref|XP_002185304.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403219|gb|EEC43173.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 555
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 62/129 (48%), Gaps = 10/129 (7%)
Query: 95 IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDS 154
++VC Q L N + SR + VLV A D T Y + HA+ V A
Sbjct: 290 VLVCNHGQSEL-LWNFVCAARSRSLNLAHVLVFATDSVT-YDLAVAMGLHAMDVQNAFGD 347
Query: 155 Q---TAHKFGSQGFFNFTSRRPCHLLHIL-ELGYNVMYNDVDMVWLKDPFPYL---QGDH 207
A ++G F + +++H+L LGYNV++ DVD+VW +DP Y Q D
Sbjct: 348 MPTVAARRYGDAAFTGMMMSK-VYVMHLLITLGYNVLFQDVDVVWYQDPLAYFNNDQSDF 406
Query: 208 DVYFTDDMA 216
D++F DD A
Sbjct: 407 DLFFQDDGA 415
>gi|219130298|ref|XP_002185305.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403220|gb|EEC43174.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 553
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 62/129 (48%), Gaps = 10/129 (7%)
Query: 95 IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDS 154
++VC Q L N + SR + VLV A D T Y + HA+ V A
Sbjct: 288 VLVCNHGQSEL-LWNFVCAARSRSLNLAHVLVFATDSVT-YDLAVAMGLHAMDVQNAFGD 345
Query: 155 Q---TAHKFGSQGFFNFTSRRPCHLLHIL-ELGYNVMYNDVDMVWLKDPFPYL---QGDH 207
A ++G F + +++H+L LGYNV++ DVD+VW +DP Y Q D
Sbjct: 346 MPTVAARRYGDAAFTGMMMSK-VYVMHLLITLGYNVLFQDVDVVWYQDPLAYFNNDQSDF 404
Query: 208 DVYFTDDMA 216
D++F DD A
Sbjct: 405 DLFFQDDGA 413
>gi|357438789|ref|XP_003589671.1| hypothetical protein MTR_1g034810 [Medicago truncatula]
gi|355478719|gb|AES59922.1| hypothetical protein MTR_1g034810 [Medicago truncatula]
Length = 306
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/157 (24%), Positives = 69/157 (43%), Gaps = 11/157 (7%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATL 134
L++AA ++ T+I+ +++ + FL+++ I + ++ + +++IA D
Sbjct: 14 LNEAA---MEDRTVILTTLNEAWAAPNSVIDLFLDSFRIGVRTRRLLNHLVIIALDQKAF 70
Query: 135 YKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMV 194
+ +LV D F + + RR L +LELGYN ++ D D++
Sbjct: 71 VRCQAIHTYCFLLVSEGNDFHEEAFFMTPLYLKMMWRRIDFLRSVLELGYNFVFTDADIM 130
Query: 195 WLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPG 231
W +DPFP D D D D H+ P G
Sbjct: 131 WFRDPFPRFYDDADFQIACD-HFTGFFDDVHNRPNGG 166
>gi|66811206|ref|XP_639311.1| WD40 repeat-containing protein [Dictyostelium discoideum AX4]
gi|74996971|sp|Q54RP0.1|AGTA_DICDI RecName: Full=UDP-galactose:fucoside alpha-3-galactosyltransferase;
AltName: Full=Alpha-GalT1; AltName:
Full=Fucosylgalactoside 3-alpha-galactosyltransferase;
AltName: Full=Skp1 alpha-3-galactosyltransferase
gi|60467933|gb|EAL65946.1| WD40 repeat-containing protein [Dictyostelium discoideum AX4]
gi|84873369|gb|ABC67744.1| UDP-Gal:fucoside alpha3-galactosyltransferase [Dictyostelium
discoideum]
Length = 648
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/176 (22%), Positives = 73/176 (41%), Gaps = 19/176 (10%)
Query: 77 WRDYTLSQAASF----VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYA 132
+RD TL+ F + K+ I+ C + Y F I R +D ++ +
Sbjct: 22 FRDMTLNLLKCFEKLSIDKSRYILYCMDDKAYQFFAEFKGIECQRFS-RDDIINSSTSST 80
Query: 133 TLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVD 192
L+ N + S+ A +G GF + +P +L +L+ GYNV++ D D
Sbjct: 81 QLFHDNNTNDNKGIY------SENAESYGDIGFRAICNEKPLVVLDVLKKGYNVLWTDTD 134
Query: 193 MVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPT 248
+VW +DPF ++F D+ + D+ ++ IC+ ++R
Sbjct: 135 IVWKRDPF--------IHFYQDINQENQFTNDDDIDLYVQQDDDDICAGFYFIRSN 182
>gi|219127152|ref|XP_002183805.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217404528|gb|EEC44474.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 650
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 65/146 (44%), Gaps = 13/146 (8%)
Query: 86 ASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQ-DQVLVIAEDYATLYKVNGRWPGH 144
A A+N TI+V +Q L N++ S +R+ VLV A D T YK+
Sbjct: 313 ADQAARNQTIVVMTCNQGQSELLVNFVCSCTRRGLPISHVLVFATDTET-YKLAKSLGLR 371
Query: 145 AVLVPPAPD--------SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWL 196
A V P ++ A +G F + + +L LGYNV++ DVD++W
Sbjct: 372 AWDVTSLPGAFGVRSFPTKAADAYGDLTFAALMMAKVYCVHVVLLLGYNVLFQDVDVIWY 431
Query: 197 KDPFPYLQ---GDHDVYFTDDMAAVK 219
+DP PY + DV DD A K
Sbjct: 432 QDPVPYFETHWTTMDVIMQDDGARTK 457
>gi|428175509|gb|EKX44399.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
[Guillardia theta CCMP2712]
Length = 1433
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 50/196 (25%), Positives = 80/196 (40%), Gaps = 26/196 (13%)
Query: 95 IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDS 154
I + + YL + NWL + R D ++ A D + G V P D
Sbjct: 50 IAITFSNSGYLNYCMNWLHHV-RSVGVDNYVIFALDAEAYSSLKGEAN---VFYDPRLDE 105
Query: 155 ----QTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF--PYLQGDHD 208
+ A FGS F +P L +LELG++++ +D D+VW KDPF P + G H
Sbjct: 106 GKIDKRATDFGSDPFKKIVHLKPTLTLRVLELGFHLLLSDADVVWFKDPFSVPEVVGSHL 165
Query: 209 VYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWS 268
+D D++ P ++ S Y+ P M++ + L + P
Sbjct: 166 NLMSD-----AHFDYAMGNTP------YFVNSGFAYMSPHPTTIAFMREVVRLLASRP-- 212
Query: 269 KAKKANDQPAFNWALN 284
DQ A+N A++
Sbjct: 213 ---DKMDQDAYNTAIS 225
>gi|89070444|ref|ZP_01157741.1| Acetyltransferase (isoleucine patch superfamily protein)
[Oceanicola granulosus HTCC2516]
gi|89043930|gb|EAR50116.1| Acetyltransferase (isoleucine patch superfamily protein)
[Oceanicola granulosus HTCC2516]
Length = 544
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 71/186 (38%), Gaps = 29/186 (15%)
Query: 104 YLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQ 163
Y+P NWL ++ +Q V+ D +P VL PAP+
Sbjct: 16 YVPVTRNWLAHLATLGLAEQATVVTLDSGARTA----FPVEQVLHRPAPEP--------- 62
Query: 164 GFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG-DHDVYFTDDMAAVKPLD 222
G R ILE G ++++D D +WL DP P + ++ F+ V P D
Sbjct: 63 GLAGLWKHRMAVCQEILEAGEALIHSDADAIWLDDPRPRIAACGSEMVFS--QGTVWPYD 120
Query: 223 HSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWA 282
L R +C YL PT + +M + ++ L + DQ A N
Sbjct: 121 IHERL-------RLVLCCGFFYLAPTSRVRTLMDQVLQRLDTD------GGEDQEAVNRV 167
Query: 283 LNKTAG 288
+ +T G
Sbjct: 168 VAETIG 173
>gi|115455131|ref|NP_001051166.1| Os03g0731800 [Oryza sativa Japonica Group]
gi|50540739|gb|AAT77895.1| expressed protein [Oryza sativa Japonica Group]
gi|108710906|gb|ABF98701.1| regulatory protein, putative, expressed [Oryza sativa Japonica
Group]
gi|113549637|dbj|BAF13080.1| Os03g0731800 [Oryza sativa Japonica Group]
gi|215740630|dbj|BAG97286.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 351
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 52/210 (24%), Positives = 87/210 (41%), Gaps = 24/210 (11%)
Query: 37 VLNRTTLLVLLSLLVVLGVILPWTGTPG--------FMFPNATSSLAKWRDYTLSQAASF 88
L R +V + LLV++G L + G F S D L A
Sbjct: 12 ALGRAIAVVAICLLVIVGTYLLSSPAAGDGDTEEEFFSVRRRRRSSVDAGDDDLEAAVRG 71
Query: 89 VA-KNGTIIVCAVSQPYLP----FLNNWLISISRQKHQDQ----VLVIAEDYATLYKVNG 139
A NGT+IV +++ Y L+ +L S+ + +Q VL++A D +
Sbjct: 72 AAYANGTLIVSVLNRAYADEDGGLLDLFLRSMREGEGTEQLIAHVLLVAMDRPAFLRCRR 131
Query: 140 RWPGHAVLVPPAPDS----QTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVW 195
+P A D + + S GF RR L +L+LGY+ ++ D+D++W
Sbjct: 132 LGGVRCYQLPAAQDGADDLSSEQLYMSDGFIRMMWRRIRLLGDVLKLGYSFIFTDLDVMW 191
Query: 196 LKDPFPYLQ---GDHDVYFTDDMAAVKPLD 222
L++P L+ + D+ + D +P D
Sbjct: 192 LRNPLSRLEYRAEEEDLLISSDQFNGRPGD 221
>gi|413946045|gb|AFW78694.1| hypothetical protein ZEAMMB73_007904 [Zea mays]
Length = 551
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 61/126 (48%), Gaps = 3/126 (2%)
Query: 80 YTLSQAASFVA-KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATL-YKV 137
Y+LS VA +NG++++ + Y L +W + R + + LV A D+ T + +
Sbjct: 259 YSLSMLLDLVADENGSVVLGVAGKSYRDMLMSWACRLRRLRVTN-FLVCALDHETYEFSI 317
Query: 138 NGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLK 197
P + P S FG++ F T + +L IL LGYNV+ +DVD+ W
Sbjct: 318 LQGLPVFRDPLSPTNVSFDDCHFGTKCFQQVTKVKSRIVLEILRLGYNVLLSDVDVYWFG 377
Query: 198 DPFPYL 203
+P P L
Sbjct: 378 NPMPLL 383
>gi|115453933|ref|NP_001050567.1| Os03g0586300 [Oryza sativa Japonica Group]
gi|50399937|gb|AAT76325.1| expressed protein [Oryza sativa Japonica Group]
gi|113549038|dbj|BAF12481.1| Os03g0586300 [Oryza sativa Japonica Group]
Length = 356
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 53/199 (26%), Positives = 83/199 (41%), Gaps = 9/199 (4%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
A+ G I+V + +L F+ W+ ++ D +LV A D L ++ R G V
Sbjct: 102 ARGGAIVVTFGNHAFLDFILTWVRHLT-DLGVDSLLVGAMDTKLLRELYLR--GVPVFDM 158
Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
+ + +GS F + + +L GY ++ D DMVWLK+P PYL D
Sbjct: 159 GSRMATEDAVWGSPTFHKMGREKVLLINALLPFGYELLMCDTDMVWLKNPLPYLARYPDA 218
Query: 210 YFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSK 269
V P L + Y + + RPT+ AK + K W + L ++
Sbjct: 219 DLLTSSDQVIPTVTDDSLENWREVTGAYNIG-IFHWRPTEPAKRLAKDWKDLLLSD---- 273
Query: 270 AKKANDQPAFNWALNKTAG 288
DQ AFN ++K G
Sbjct: 274 -DTLWDQNAFNDLIHKKFG 291
>gi|397604899|gb|EJK58811.1| hypothetical protein THAOC_21024, partial [Thalassiosira oceanica]
Length = 708
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 68/152 (44%), Gaps = 23/152 (15%)
Query: 83 SQAASFVAKNGTIIVCAVSQPY-----LPFLNNWLI------SISRQKHQDQVLVIAEDY 131
S+ + +AK GT V ++ + LPF+ + L+ S S+ + V+V D
Sbjct: 330 SRLKALLAKMGTKTVVVLTSNFGQLRSLPFMKSELLINFACNSRSKGFSLNNVIVFPTDL 389
Query: 132 ATL-----YKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNV 186
T +N + H + P + A +G Q F + + + ELGY+V
Sbjct: 390 ETKELSEGMGLNTFYDEHLMASVP---KREARYYGDQIFTGVMFSKVVCVQLVNELGYDV 446
Query: 187 MYNDVDMVWLKDPFPYLQGDH----DVYFTDD 214
++ DVD+VW KDP Y + D+YF DD
Sbjct: 447 LFQDVDLVWFKDPLTYFHNESLPQFDMYFQDD 478
>gi|356560983|ref|XP_003548765.1| PREDICTED: uncharacterized protein LOC100819032 [Glycine max]
Length = 639
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 52/200 (26%), Positives = 84/200 (42%), Gaps = 13/200 (6%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
K+ +IV + ++ F+ W+ + R LV A D + + W G V
Sbjct: 122 KDNAVIVTFGNYAFMDFILTWVKQL-RDLGVSNFLVGAMDTKLVEAL--YWKGIPVFDMG 178
Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DHD 208
+ S +GS F + + IL G+ ++ D DMVWLK+P PYL + D
Sbjct: 179 SHMSTVDVGWGSPTFHKMGREKVILIDSILPFGFELLMCDTDMVWLKNPLPYLARYPEAD 238
Query: 209 VYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWS 268
V + D +D S + P + + RPT+ AK + K+W E L A+
Sbjct: 239 VLTSSDQVIPTVVDDSLENWPEVSGAYNI---GIFHWRPTESAKKLAKQWKEMLLAD--- 292
Query: 269 KAKKANDQPAFNWALNKTAG 288
+ DQ FN +++ G
Sbjct: 293 --DQIWDQNGFNDIVHRQLG 310
>gi|145343482|ref|XP_001416351.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576576|gb|ABO94644.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 500
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 63/136 (46%), Gaps = 8/136 (5%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDH-DVYF 211
D A S+ + F + L +L+ G++V+ +DVD+VWL+D PY + + DV
Sbjct: 50 DLNDAADHASESWKKFCALMVKELRAVLDAGFDVILSDVDVVWLRDAAPYFKCESGDVDG 109
Query: 212 TDDMAAVKPLDHSHDLPPP-----GKK--GRTYICSCMIYLRPTDGAKLVMKKWIEELQA 264
+++ + S +L P G + R + M+++R T K + W LQA
Sbjct: 110 CEEIRGADVMISSDNLSPTMDWELGARYAMRGIFNTGMMFIRNTRAGKDFLSDWARNLQA 169
Query: 265 EPWSKAKKANDQPAFN 280
+ + +K Q FN
Sbjct: 170 KDGAYSKLTTHQQVFN 185
>gi|342318978|gb|EGU10930.1| Proteophosphoglycan ppg4 [Rhodotorula glutinis ATCC 204091]
Length = 969
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/162 (22%), Positives = 73/162 (45%), Gaps = 11/162 (6%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATL-YKVNGRWPGHAVLV 148
+K +++ + YL + N+ + R+ D ++IA D T+ + + P ++ +
Sbjct: 353 SKTNNVMLAFGTSNYLDLVRNF-VHFVREAGIDNFVLIAMDADTVAWAEEEKVPYYSYID 411
Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY-LQGDH 207
+ + S GF +RR + L G+N++ +D+D++W+K+PFPY GD+
Sbjct: 412 EEVATLGGSDSYKSDGFRRVVNRRCSVISTALRGGFNILQSDLDVIWVKNPFPYFFNGDY 471
Query: 208 DVYF-TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPT 248
+ +D D P R ++ S + Y R T
Sbjct: 472 EYEIQSDGRRGFTERD-------PAAPFRDFVNSGLFYARGT 506
>gi|356558894|ref|XP_003547737.1| PREDICTED: uncharacterized protein LOC100780346 [Glycine max]
Length = 638
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 85/200 (42%), Gaps = 13/200 (6%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
K+ IIV + ++ F+ W+ ++ + LV A D L + W G V
Sbjct: 122 KDNAIIVTFGNYAFMDFILTWVKQLTDLGVSN-FLVGAMDTKLLEAL--YWKGIPVFDMG 178
Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DHD 208
+ S +GS F + + IL G+ ++ D DMVWLK+P PYL + D
Sbjct: 179 SHMSTVDVGWGSPTFHKMGREKVILIDSILPFGFELLMCDTDMVWLKNPLPYLARYPEAD 238
Query: 209 VYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWS 268
V + D +D S + P + + RPT+ AK + K+W E L A+
Sbjct: 239 VLTSSDQVIPTVVDDSLENWPEVSGAYNI---GIFHWRPTESAKKLAKQWKEMLLAD--- 292
Query: 269 KAKKANDQPAFNWALNKTAG 288
+ DQ FN +++ G
Sbjct: 293 --DQIWDQNGFNDIVHRQLG 310
>gi|108709554|gb|ABF97349.1| expressed protein [Oryza sativa Japonica Group]
Length = 622
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 53/199 (26%), Positives = 83/199 (41%), Gaps = 9/199 (4%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
A+ G I+V + +L F+ W+ ++ D +LV A D L ++ R G V
Sbjct: 102 ARGGAIVVTFGNHAFLDFILTWVRHLT-DLGVDSLLVGAMDTKLLRELYLR--GVPVFDM 158
Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
+ + +GS F + + +L GY ++ D DMVWLK+P PYL D
Sbjct: 159 GSRMATEDAVWGSPTFHKMGREKVLLINALLPFGYELLMCDTDMVWLKNPLPYLARYPDA 218
Query: 210 YFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSK 269
V P L + Y + + RPT+ AK + K W + L ++
Sbjct: 219 DLLTSSDQVIPTVTDDSLENWREVTGAYNIG-IFHWRPTEPAKRLAKDWKDLLLSD---- 273
Query: 270 AKKANDQPAFNWALNKTAG 288
DQ AFN ++K G
Sbjct: 274 -DTLWDQNAFNDLIHKKFG 291
>gi|218193222|gb|EEC75649.1| hypothetical protein OsI_12401 [Oryza sativa Indica Group]
Length = 622
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 53/199 (26%), Positives = 83/199 (41%), Gaps = 9/199 (4%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
A+ G I+V + +L F+ W+ ++ D +LV A D L ++ R G V
Sbjct: 102 ARGGAIVVTFGNHAFLDFILTWVRHLT-DLGVDSLLVGAMDTKLLRELYLR--GVPVFDM 158
Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
+ + +GS F + + +L GY ++ D DMVWLK+P PYL D
Sbjct: 159 GSRMATEDAVWGSPTFHKMGREKVLLINALLPFGYELLMCDTDMVWLKNPLPYLARYPDA 218
Query: 210 YFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSK 269
V P L + Y + + RPT+ AK + K W + L ++
Sbjct: 219 DLLTSSDQVIPTVTDDSLENWREVTGAYNIG-IFHWRPTEPAKRLAKDWKDLLLSD---- 273
Query: 270 AKKANDQPAFNWALNKTAG 288
DQ AFN ++K G
Sbjct: 274 -DTLWDQNAFNDLIHKKFG 291
>gi|302840365|ref|XP_002951738.1| hypothetical protein VOLCADRAFT_92385 [Volvox carteri f.
nagariensis]
gi|300262986|gb|EFJ47189.1| hypothetical protein VOLCADRAFT_92385 [Volvox carteri f.
nagariensis]
Length = 783
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 49/183 (26%), Positives = 83/183 (45%), Gaps = 13/183 (7%)
Query: 86 ASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAE-DYATLYKVNGRWPGH 144
A + N ++ S + F WL + + + VL +A+ D A L + G H
Sbjct: 187 AQWAVNNTVMVAFTNSIMFRNFGPTWLHHVRKAGIKYWVLAVADNDTAKLVRNYG--ADH 244
Query: 145 AVLVPPAPDSQTAHKF--GSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP- 201
LV TA F GS+ + T ++ + H+ LG++V+ +D+D+VWL++P
Sbjct: 245 CFLVHENEIDDTAADFKWGSRSWQLHTWQKVLTVRHVHMLGFHVINSDMDVVWLRNPLEH 304
Query: 202 ----YLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKK 257
Y + D+ V D + PL D P G Y+ + + +LR T G + ++ K
Sbjct: 305 FLVKYTEPDYWVSM-DPITTANPL--GDDGPELGVSTHHYMNTGVYFLRQTPGGRALIDK 361
Query: 258 WIE 260
W E
Sbjct: 362 WYE 364
>gi|449463499|ref|XP_004149471.1| PREDICTED: uncharacterized protein At4g15970-like [Cucumis sativus]
Length = 350
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/137 (24%), Positives = 62/137 (45%), Gaps = 10/137 (7%)
Query: 79 DYTLSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDY 131
D L AA+ ++ T+I+ +++ + FL ++ I + D +++IA D
Sbjct: 74 DKVLKDAAT---EDKTVILTTLNEAWASPNAVIDLFLQSFRIGNRTHQLLDHLVIIALDK 130
Query: 132 ATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDV 191
+ LV D ++ F S + RR L +LE+GYN ++ D
Sbjct: 131 KAFMRCLDIHIHCVSLVTEGVDFRSEAYFMSPDYLKMMWRRIDFLRTVLEMGYNFVFTDA 190
Query: 192 DMVWLKDPFPYLQGDHD 208
D++W +DPFP+ + D
Sbjct: 191 DVMWFRDPFPFFDINAD 207
>gi|449433169|ref|XP_004134370.1| PREDICTED: uncharacterized protein At1g28695-like [Cucumis sativus]
Length = 305
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 61/140 (43%), Gaps = 15/140 (10%)
Query: 80 YTLSQAASFVAKNGTIIVCAVSQPYLP------------FLNNWLISISRQKHQDQVLVI 127
+ L +AA N T+++ +++ Y FL+ + + +K D +L++
Sbjct: 22 FALEKAA---MANKTVVITVINKAYADQGVRDDTTMLDVFLSGFWLGEDTRKLLDHLLLV 78
Query: 128 AEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVM 187
A D + + L D + S+ F +R LL +L+ GY+ +
Sbjct: 79 AVDQTAYDRCRFQRLNCFKLETEGVDFGGEKLYMSEEFIKMMWKRTLFLLEVLKRGYSFI 138
Query: 188 YNDVDMVWLKDPFPYLQGDH 207
+ D D++WL+DPFP L D
Sbjct: 139 FTDTDVMWLRDPFPKLSKDE 158
>gi|449478754|ref|XP_004155410.1| PREDICTED: uncharacterized LOC101214056 [Cucumis sativus]
Length = 1456
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 55/113 (48%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
KN TI++ Y L +W+ + R + + ++ + + V P + +PP
Sbjct: 1182 KNKTIVLAIAGYSYKDMLMSWVCRLRRLQISNYLVCALDSDTYKFSVLQGLPVYRDPLPP 1241
Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
S FG++ F T + +L IL+LGYNV+ +DVD+ W +P P++
Sbjct: 1242 TNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFMNPLPFI 1294
>gi|414878979|tpg|DAA56110.1| TPA: hypothetical protein ZEAMMB73_844418 [Zea mays]
Length = 350
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 33/131 (25%), Positives = 62/131 (47%), Gaps = 11/131 (8%)
Query: 94 TIIVCAVSQPYLPFLNNWLISISRQKHQ---------DQVLVIAEDYATLYKVNGRWPGH 144
T+I+ +V++ + + L+ + R+ Q + VLV+A D P
Sbjct: 66 TVIITSVNEAFAR--PDSLLGLFRESFQAGEGIGHLLNNVLVVAVDAKAFSHCRAVHPHC 123
Query: 145 AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
+L D +A+ + S+ + + L ILELGYN ++ DVD+VW ++PF ++
Sbjct: 124 YLLEVKTIDLSSANNYMSEAYIELVWTKLSLLQRILELGYNFLFTDVDIVWFRNPFRHIS 183
Query: 205 GDHDVYFTDDM 215
D+ + D+
Sbjct: 184 VFADMTTSSDV 194
>gi|449449787|ref|XP_004142646.1| PREDICTED: uncharacterized protein LOC101203732 [Cucumis sativus]
Length = 640
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/201 (26%), Positives = 86/201 (42%), Gaps = 13/201 (6%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
AK+ IIV + ++ F+ +W+ ++ + +LV A D L + W G V
Sbjct: 125 AKDNIIIVTFGNYAFMDFILSWVKHLTDLGLTN-LLVGAMDTKLLEAL--YWKGIPVFDM 181
Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
+ S +GS F + + IL G ++ D DMVWLK+P PYL
Sbjct: 182 GSHMSTVDVGWGSPTFHKMGREKVILIDSILPYGVELLMCDTDMVWLKNPLPYLARYPAA 241
Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
DV + D +D D + + + + RPT+ +K + K+W E L A+
Sbjct: 242 DVLTSSDQVVPTVVDDRLD---KWNEVTGALNIGIFHWRPTEASKKLAKEWKEMLLAD-- 296
Query: 268 SKAKKANDQPAFNWALNKTAG 288
K DQ FN ++K G
Sbjct: 297 ---DKIWDQNGFNELVHKKYG 314
>gi|390356493|ref|XP_003728806.1| PREDICTED: uncharacterized protein LOC100893660 [Strongylocentrotus
purpuratus]
Length = 318
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/185 (21%), Positives = 77/185 (41%), Gaps = 39/185 (21%)
Query: 92 NGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPA 151
N +I+ V+ ++ NWL S+ R + + ++AED T + R
Sbjct: 79 NKFVILTTVNAEFVDIAENWLESLRRLGIRYNITMVAEDQDTFEYFSIR----------- 127
Query: 152 PDSQTAHKFGSQGFFNFTSR-------------RPCHLLHILELGYNVMYNDVDMVWLKD 198
+ + + Q FNFT R R ++ +LE G +V+ DVD VW+K+
Sbjct: 128 ANREFRVLYQKQFAFNFTRRYSPAKLYEEVIRCRTVYIRTLLEQGNDVLLVDVDTVWIKN 187
Query: 199 PFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKW 258
P + ++ +Y + H+ P C C +Y++ T ++++W
Sbjct: 188 PVDLILKEYALY---------DIWHAQGYP------EGMPCPCFMYMKSTSEVVAMVREW 232
Query: 259 IEELQ 263
++ L+
Sbjct: 233 VKRLE 237
>gi|42568325|ref|NP_199295.2| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
thaliana]
gi|109946407|gb|ABG48382.1| At5g44820 [Arabidopsis thaliana]
gi|332007782|gb|AED95165.1| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
thaliana]
Length = 367
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/143 (23%), Positives = 58/143 (40%), Gaps = 7/143 (4%)
Query: 90 AKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWP 142
KN T+I+ ++Q + FL ++ I Q+ V+V+ D + +
Sbjct: 101 TKNNTVIITTLNQAWAEPNSLFDLFLESFRIGQGTQQLLKHVVVVCLDIKAFERCSQLHT 160
Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
+ D + + + R L +LE+G+N ++ D D++WL+DPFP
Sbjct: 161 NCYHIETSETDFSGEKVYNTPDYLKMMWARIDLLTQVLEMGFNFIFTDADIMWLRDPFPR 220
Query: 203 LQGDHDVYFTDDMAAVKPLDHSH 225
L D D D P D +
Sbjct: 221 LYPDGDFQMACDRFFGNPYDSDN 243
>gi|323449936|gb|EGB05820.1| hypothetical protein AURANDRAFT_66031 [Aureococcus anophagefferens]
Length = 619
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 66/146 (45%), Gaps = 13/146 (8%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQD----QVLVIAEDYATLYKV 137
L ++ A++ T++V A + L + N++ S++ + VLV A D T V
Sbjct: 311 LIKSRKIAARHDTVVVMATNLGTLDLVANFVCSVASVPELEPTLASVLVFASDGGTRDAV 370
Query: 138 NGRWPGHAVLVPPAPD---SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMV 194
G A PA S A +G + F + + +L L +NV++ D D+V
Sbjct: 371 EKL--GIAAFSDPALGDLPSDAAKVYGDRSFVRMMWLKVTSVYLVLSLQHNVLFQDADVV 428
Query: 195 WLKDPFPY---LQGDH-DVYFTDDMA 216
W +DP PY + D D ++ DD A
Sbjct: 429 WFRDPLPYFAEIADDQVDTFWMDDGA 454
>gi|224111368|ref|XP_002315829.1| predicted protein [Populus trichocarpa]
gi|222864869|gb|EEF02000.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/171 (23%), Positives = 74/171 (43%), Gaps = 14/171 (8%)
Query: 79 DYTLSQAASFVA-KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAED 130
DY L + A ++ T+I+ +++ + FL ++ I ++ + ++++A D
Sbjct: 105 DYKLEKVLKEAAMEDKTVIIATLNEAWAAPNTIIDLFLESFRIGQGTRRLLNHLVIVALD 164
Query: 131 YATLYKVNGRWPGHA-VLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYN 189
YK + H LV D F + + RR L +L++GYN ++
Sbjct: 165 RKA-YKRCMEFHAHCFALVTQGLDFHDEAYFMTPAYLEMMWRRIDFLRAVLQMGYNFVFT 223
Query: 190 DVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
D D++W +DPFP D D +A L +S D+ G Y+ S
Sbjct: 224 DADIMWFRDPFPRFYLDADF----QIACDHFLGNSSDIQNRPNGGFNYVKS 270
>gi|384251936|gb|EIE25413.1| hypothetical protein COCSUDRAFT_61632 [Coccomyxa subellipsoidea
C-169]
Length = 178
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 73/166 (43%), Gaps = 29/166 (17%)
Query: 94 TIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPD 153
TI+ ++ LP L +SR + +VLV+A D L + HA +P
Sbjct: 3 TIVDAEAAEILLPVFLESLQKVSR-RLVPRVLVLASDAQGLQICRTK---HAHCLP---- 54
Query: 154 SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTD 213
+F S L +L +NV+ DVDMVWLKDPF L+ + +D
Sbjct: 55 -----------WFQRLSAAIGALQQLLAKNFNVLAADVDMVWLKDPFAALEPN----LSD 99
Query: 214 DMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWI 259
+ V + + D P +K C+ +I+ R T+ + ++K W+
Sbjct: 100 FLMTVDSTNSTDD--SPAQKP----CAGLIFARATEPTRALVKSWV 139
>gi|308802167|ref|XP_003078397.1| unnamed protein product [Ostreococcus tauri]
gi|116056849|emb|CAL53138.1| unnamed protein product, partial [Ostreococcus tauri]
Length = 556
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 92/217 (42%), Gaps = 31/217 (14%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVL--VIAEDYATLYKVNGR-WPGHA 145
VAK+G +IV + Y F NWL + ++ ++ + E YA L K+ W
Sbjct: 44 VAKDGAVIVTWANSHYYDFALNWLRHLDALGVENYLIGAMDEEMYAKLRKIGVPCW---- 99
Query: 146 VLVPPAPDSQTAHK---FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
++ D + + +GS+ F + + + G +V+ +D+D+ WL+DP P+
Sbjct: 100 LMGSQGIDKEAVTRDFGWGSKNFHKMGRDKIRLIRDFTKTGTDVLISDIDVAWLRDPIPF 159
Query: 203 LQG--DHDVYFTDD----------MAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDG 250
+ D+ + D K +D L PP G I M++ R T G
Sbjct: 160 FRRYPKADILVSTDNLRNMTHESPEQIAKTVD-GEGLEPPC-AGTANI--GMMWFRSTSG 215
Query: 251 AKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTA 287
++ + +W+ L+ K K DQ FN ++K
Sbjct: 216 SQAITDEWVNNLE-----KDDKIWDQAEFNTLMHKGG 247
>gi|357517311|ref|XP_003628944.1| UDP-galactose:fucoside alpha-3-galactosyltransferase [Medicago
truncatula]
gi|355522966|gb|AET03420.1| UDP-galactose:fucoside alpha-3-galactosyltransferase [Medicago
truncatula]
Length = 1906
Score = 50.4 bits (119), Expect = 0.001, Method: Composition-based stats.
Identities = 30/125 (24%), Positives = 59/125 (47%), Gaps = 1/125 (0%)
Query: 80 YTLSQAASFVA-KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVN 138
++L S A +N T+++ Y L +W+ + + ++ ++ + + +
Sbjct: 1405 FSLESLLSITADRNKTVVLTVAGYSYKDMLMSWVCRLRKLSIENFIVSALDQETYQFSIL 1464
Query: 139 GRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKD 198
P + P+ S FG++ F T + +L IL+LGYNV+ +DVD W ++
Sbjct: 1465 QGIPVFKDPIAPSDISFDECHFGTKCFQRVTKVKSRIVLKILKLGYNVLLSDVDTYWFRN 1524
Query: 199 PFPYL 203
P P+L
Sbjct: 1525 PIPFL 1529
>gi|390356485|ref|XP_003728803.1| PREDICTED: uncharacterized protein LOC100893403 isoform 1
[Strongylocentrotus purpuratus]
gi|390356487|ref|XP_003728804.1| PREDICTED: uncharacterized protein LOC100893403 isoform 2
[Strongylocentrotus purpuratus]
Length = 359
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/176 (21%), Positives = 80/176 (45%), Gaps = 17/176 (9%)
Query: 92 NGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH-AVLVPP 150
N +I+ +V+ ++ NWL S+ R + + ++AED T + R VL
Sbjct: 120 NKFVILTSVNAEFVDIAENWLESLRRLGIRYNITMVAEDQDTFKYFSIRANREFRVLYQK 179
Query: 151 APDSQTAHKFGSQGFF-NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
++G + RR ++ +LE G +V+ DVD VW+K+P + ++ +
Sbjct: 180 QYAFNFTRRYGPVMMYQELIRRRTVYIRTLLEQGNDVLLVDVDTVWIKNPVDLILKEYAL 239
Query: 210 YFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAE 265
Y D+ + + + +P C C +Y++ T ++++W++ L+ +
Sbjct: 240 Y---DIWLAQ--GYENHIP----------CPCFMYMKSTSEVVAMVREWVKRLENQ 280
>gi|440803911|gb|ELR24794.1| hypothetical protein ACA1_174550 [Acanthamoeba castellanii str.
Neff]
Length = 458
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/167 (24%), Positives = 71/167 (42%), Gaps = 15/167 (8%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLV----IAEDYATLYKV 137
L Q VAK+G +I+ + YL L NW S+ R + V+V A + +
Sbjct: 35 LEQETRRVAKDGVVILALCNAGYLDLLLNWKASVDRLNITNYVIVPNDIKAAQELSFLGL 94
Query: 138 NGRWPGHAVLVPPAPDSQTAHKFGSQG-----FFNFTSRRPCHLLHILELGYNVMYNDVD 192
+ + L A ++ F + + ++ ++ +L LG +V+ DVD
Sbjct: 95 DWAYDPTIGLGDLASSEAASYPFNKEDPMYKLWNGVVHKKAAYVWKLLSLGLDVLVTDVD 154
Query: 193 MVWLKDPFPYLQGDH-DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
+V+LKDP P D++F DD + + D PP G + +
Sbjct: 155 IVFLKDPLPLFSNKTIDLFFIDDTKS-----KTDDGQPPSLCGASVL 196
>gi|412993976|emb|CCO14487.1| glycosyltransferase family 77 protein [Bathycoccus prasinos]
Length = 790
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 62/139 (44%), Gaps = 20/139 (14%)
Query: 162 SQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDH-------------D 208
S+ + NF R H +LE G++V+ +D D+VWLK+P YL+ + D
Sbjct: 268 SKAWKNFAKMRISHATSLLEFGFDVVMSDADVVWLKNPEEYLKCEKVSEDGKVVENLSFD 327
Query: 209 VYFTDDMAAVKPLDHSHDLPPPG--KKGRTY-----ICSCMIYLRPTDGAKLVMKKWIEE 261
+ +++ A + S +L P + G Y + +++LR T G K+W
Sbjct: 328 IDGCEELKAADVIVSSDNLSPTSDERDGGNYAKGGVFNTGIVFLRHTKGGIQWAKQWNLH 387
Query: 262 LQAEPWSKAKKANDQPAFN 280
L A + +DQ FN
Sbjct: 388 LSATDGRFHRLTSDQQVFN 406
>gi|357484405|ref|XP_003612490.1| hypothetical protein MTR_5g025590 [Medicago truncatula]
gi|355513825|gb|AES95448.1| hypothetical protein MTR_5g025590 [Medicago truncatula]
Length = 628
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/201 (27%), Positives = 85/201 (42%), Gaps = 13/201 (6%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
K+ IIV + ++ F+ W+ ++ + LV A D L + W G V
Sbjct: 113 VKDNIIIVTFGNYAFMDFILTWVKQLNDLGVSN-YLVGAMDTKLLEAL--YWKGVPVFDM 169
Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
+ S +GS F + + IL G ++ D DMVWLK+P PYL
Sbjct: 170 NSHMSTMDVGWGSPTFHKMGREKVILINSILPFGVELLMCDTDMVWLKNPLPYLARYPGA 229
Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
DV + D +D S L + G Y + + RPT+ A + K+WIE L A+
Sbjct: 230 DVLTSSDQVIPTVVDDS--LEVWKEVGAAYNIG-IFHWRPTESAIKLAKEWIELLVAD-- 284
Query: 268 SKAKKANDQPAFNWALNKTAG 288
K DQ FN +++ G
Sbjct: 285 ---DKIWDQNGFNDIVHRELG 302
>gi|449528663|ref|XP_004171323.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
At4g15970-like [Cucumis sativus]
Length = 350
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/137 (24%), Positives = 62/137 (45%), Gaps = 10/137 (7%)
Query: 79 DYTLSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDY 131
D L AA+ ++ T+I+ +++ + FL ++ I + D +++IA D
Sbjct: 74 DKVLKDAAT---EDKTVILTTLNEAWASPNAVIDLFLQSFRIGNRTHQLLDHLVIIALDX 130
Query: 132 ATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDV 191
+ LV D ++ + S + RR L +LE+GYN ++ D
Sbjct: 131 KAFMRCLDIHIHCVSLVTEGVDFRSEAYYMSPDYLKMMWRRIDFLRTVLEMGYNFVFTDA 190
Query: 192 DMVWLKDPFPYLQGDHD 208
D++W +DPFP+ + D
Sbjct: 191 DVMWFRDPFPFFDINAD 207
>gi|357114730|ref|XP_003559148.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
distachyon]
Length = 393
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 49/102 (48%), Gaps = 16/102 (15%)
Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
ILELGY+ ++ DVD++W +DPF ++ D+ + D+ P D+ + P G
Sbjct: 224 ILELGYSFLFTDVDVMWFRDPFKHVTAYADMTVSSDVFLGDP-DNIGNFPNTG------- 275
Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
+++P + + K W E P AN+QP FN
Sbjct: 276 ---FFHVKPNNRTIAMTKVWHESRGKYP-----GANEQPVFN 309
>gi|357121249|ref|XP_003562333.1| PREDICTED: uncharacterized protein LOC100845970 [Brachypodium
distachyon]
Length = 614
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 87/202 (43%), Gaps = 13/202 (6%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
A++G I+V + +L F+ W ++ D +LV A D L ++ R G V
Sbjct: 98 ARDGVIVVTFGNYAFLDFILTWAHHLTALG-VDNLLVGAMDTKLLRELYLR--GVPVFDM 154
Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
+ +GS F + + +L GY ++ D DMVWLK+P PYL D
Sbjct: 155 GSRMVTEDAGWGSPTFHKMGREKVLLINTLLPFGYELLMCDTDMVWLKNPLPYLARYPDA 214
Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
D+ + D D S + ++ + + RPT+ AK + K W + + ++
Sbjct: 215 DLLTSSDQVIPTVTDDSLE---NWREVTGAFNIGIFHWRPTEPAKKLAKDWKDLVLSD-- 269
Query: 268 SKAKKANDQPAFNWALNKTAGQ 289
+ DQ AFN +K GQ
Sbjct: 270 ---DEIWDQNAFNDLAHKVFGQ 288
>gi|108705996|gb|ABF93791.1| regulatory protein, putative, expressed [Oryza sativa Japonica
Group]
Length = 339
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/184 (22%), Positives = 73/184 (39%), Gaps = 21/184 (11%)
Query: 92 NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
NGT+I+ ++ + FL ++ I + D +++++ D T ++ + H
Sbjct: 3 NGTVILTTLNSAWAEPGSVVDVFLESFRIGDETRWLLDHLVMVSLDL-TAHRRCLQIHRH 61
Query: 145 --AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
A+ D F + G+ RR L H+L GY+ ++ D D+VW ++P P+
Sbjct: 62 CFALTTDDGFDFSGEKNFMTDGYLKMMWRRIDFLGHVLAKGYSFIFTDTDIVWFRNPLPH 121
Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
L D D D P D S+ P G Y+R T + W
Sbjct: 122 LHHDGDFQIACDHFTGDPDDLSN--SPNGG---------FAYVRSTSATAAFYRYWYAAR 170
Query: 263 QAEP 266
+ P
Sbjct: 171 ERHP 174
>gi|18396892|ref|NP_564315.1| nucleotide-diphospho-sugar transferase domain-containing protein
[Arabidopsis thaliana]
gi|334182926|ref|NP_001185108.1| nucleotide-diphospho-sugar transferase domain-containing protein
[Arabidopsis thaliana]
gi|10764852|gb|AAG22832.1|AC007508_5 F1K23.9 [Arabidopsis thaliana]
gi|15027991|gb|AAK76526.1| unknown protein [Arabidopsis thaliana]
gi|20259205|gb|AAM14318.1| unknown protein [Arabidopsis thaliana]
gi|332192896|gb|AEE31017.1| nucleotide-diphospho-sugar transferase domain-containing protein
[Arabidopsis thaliana]
gi|332192898|gb|AEE31019.1| nucleotide-diphospho-sugar transferase domain-containing protein
[Arabidopsis thaliana]
Length = 340
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 48/236 (20%), Positives = 92/236 (38%), Gaps = 38/236 (16%)
Query: 51 VVLGVILPWTGTPGFMFPNATSS----------LAKWRDYTLSQAASFVAK-----NGTI 95
+ + V L + G F F + T S +W +Y + + + + K N T+
Sbjct: 10 LAVAVALLFAGALYFYFSSITVSDPMSDLLHNVETRWTEYPVDELEAVLDKAAMGNNKTV 69
Query: 96 IVCAVSQPYLP-----------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
I+ V++ Y+ FL ++ + D ++++A D + + R
Sbjct: 70 IIAMVNKAYVEEVEGGRTMLDLFLESFWEGEGTRPLLDHLMLVAADQTSYDRCLFRRLHC 129
Query: 145 AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
+ D + + S+ F RR LL +L GYN+ + D D++WL+ PFP L
Sbjct: 130 YKMDTDGVDLEGEKVYMSKDFIEMMWRRTHLLLDVLSRGYNLTFTDTDVMWLRSPFPRLS 189
Query: 205 GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIE 260
+ + LD + G G I + ++R + + +KW +
Sbjct: 190 YN------------ESLDMQISVDSIGLVGGHLINTGFYHVRSNNKTISLFQKWYD 233
>gi|307107253|gb|EFN55496.1| hypothetical protein CHLNCDRAFT_57843 [Chlorella variabilis]
Length = 394
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/209 (27%), Positives = 94/209 (44%), Gaps = 21/209 (10%)
Query: 78 RDYTLSQAASF---VAKNGTIIVCAVSQPYL---PFLNNWLISISRQKHQDQVLVIAEDY 131
RD T + A F VA N ++V ++ Y L W ++ R ++ +++ +D
Sbjct: 102 RDKTYPELAEFLKKVAVNNEVLVSVSNKNYAWPGGMLQLWAENVKRTGVKNAMVIALDDD 161
Query: 132 ATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDV 191
+ P + V PDSQ GS + R L + ++LGY+V +DV
Sbjct: 162 TKSNAESFGLPAFRMDVK-IPDSQK--DVGSNHAVSALKFRI--LQNFMKLGYSVFLSDV 216
Query: 192 DMVWLKDPFPYLQGDHDVY-FTD--DMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCM 242
D+V+L++PF +L D DV TD D + D P G R ++ S +
Sbjct: 217 DIVFLQNPFEHLARDSDVEGMTDGWDHGTAYGYNDVADDPSMGWARYAHSMRIFVFNSGL 276
Query: 243 IYLRPTDGAKLVMKKWIEELQAE-PWSKA 270
YLRPT+ + ++ K I ++ E W +A
Sbjct: 277 FYLRPTNATQELLDKLIYRVETENGWDQA 305
>gi|242055411|ref|XP_002456851.1| hypothetical protein SORBIDRAFT_03g044050 [Sorghum bicolor]
gi|241928826|gb|EES01971.1| hypothetical protein SORBIDRAFT_03g044050 [Sorghum bicolor]
Length = 348
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 67/146 (45%), Gaps = 26/146 (17%)
Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVP----PAPDSQTAHKFGSQGFFNFTSRRPCHLL 177
+ VLVIA D + P H L+ A + +A++F ++ F +
Sbjct: 111 NHVLVIAVDAGGFSRCKAVHP-HCYLLEVNKSTAANLSSANRFMTKEFLELVWLKLSFQQ 169
Query: 178 HILELGYNVMYNDVDMVWLKDPFPYLQGDHDV-----YFTDDMAAVKPLDHSHDLPPPGK 232
ILELGY+ ++ D DM+WL++PF ++ D+ YF D A PL ++
Sbjct: 170 RILELGYSFLFTDADMIWLRNPFRHISVYADMSLSTDYFRDTFA---PLSNT-------- 218
Query: 233 KGRTYICSCMIYLRPTDGAKLVMKKW 258
+ + + Y+R T+ + V++ W
Sbjct: 219 -----LNTGLYYMRSTNRSIEVLRYW 239
>gi|159468740|ref|XP_001692532.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278245|gb|EDP04010.1| predicted protein [Chlamydomonas reinhardtii]
Length = 595
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 69/144 (47%), Gaps = 11/144 (7%)
Query: 124 VLVIAEDYATLYKVNGRWPGHAVLVPPAP--DSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
VL +A+D AT V + H +V D+ K+GS + T ++ + H+ +
Sbjct: 86 VLAVADD-ATAALVRSQGVDHCFVVHENEIDDTNATFKWGSNSWKLHTWQKVLVVRHVHQ 144
Query: 182 LGYNVMYNDVDMVWLKDPF-----PYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRT 236
LG+NV+ +D+D+VW +DP Y + D+ V D + P+ D P G
Sbjct: 145 LGFNVINSDLDVVWRRDPLHHFLVKYPEPDYWVSM-DPITTRNPI--GDDGPEAGVTVHH 201
Query: 237 YICSCMIYLRPTDGAKLVMKKWIE 260
Y+ + + +LR T G ++ KW E
Sbjct: 202 YMNTGVYFLRQTPGGTALIDKWYE 225
>gi|449435412|ref|XP_004135489.1| PREDICTED: uncharacterized protein LOC101214056 [Cucumis sativus]
Length = 1693
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 55/113 (48%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
KN TI++ Y L +W+ + R + + ++ + + V P + +PP
Sbjct: 1419 KNKTIVLAIAGYSYKDMLMSWVCRLRRLQISNYLVCALDSDTYKFSVLQGLPVYRDPLPP 1478
Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
S FG++ F T + +L IL+LGYNV+ +DVD+ W +P P++
Sbjct: 1479 TNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFMNPLPFI 1531
>gi|356503224|ref|XP_003520411.1| PREDICTED: uncharacterized protein At1g28695-like [Glycine max]
Length = 353
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/161 (23%), Positives = 66/161 (40%), Gaps = 17/161 (10%)
Query: 79 DYTLSQAASFVAKNGTIIVCAVSQPYLP------------FLNNWLISISRQKHQDQVLV 126
DY L++A+ N T+I+ V++ Y+ FL ++ + + D +L+
Sbjct: 62 DYALAKAS---MGNKTVIIAIVNKAYVEQDVESDTTMLDIFLGSFWLGEGTRSLIDHLLI 118
Query: 127 IAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNV 186
+A D + L + + SQ F RR LL +L+ GYN
Sbjct: 119 VAVDQTAYNRCQFLRLNCFRLETDGVGFEGEKIYMSQDFIKMMWRRTQFLLEVLKRGYNF 178
Query: 187 MYNDVDMVWLKDPFPYLQGD--HDVYFTDDMAAVKPLDHSH 225
++ D D++WL++PF L + D + D P H
Sbjct: 179 VFTDTDVMWLRNPFIRLSKNETEDFQISTDSYLGNPWSEKH 219
>gi|21536907|gb|AAM61239.1| regulatory protein [Arabidopsis thaliana]
Length = 367
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/143 (22%), Positives = 58/143 (40%), Gaps = 7/143 (4%)
Query: 90 AKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWP 142
KN T+I+ ++Q + FL ++ I Q+ V+V+ D + +
Sbjct: 101 TKNNTVIITTLNQAWAEPNSLFDLFLESFRIGRGTQQLLKHVVVVCLDIKAFQRCSQLHT 160
Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
+ D + + + R L +L++G+N ++ D D++WL+DPFP
Sbjct: 161 NCYHIETSETDFSGEKVYNTPDYLKMMWARIELLTQVLQMGFNFIFTDADIMWLRDPFPR 220
Query: 203 LQGDHDVYFTDDMAAVKPLDHSH 225
L D D D P D +
Sbjct: 221 LYPDGDFQMACDRFFGNPYDSDN 243
>gi|384244588|gb|EIE18088.1| hypothetical protein COCSUDRAFT_34634 [Coccomyxa subellipsoidea
C-169]
Length = 193
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/122 (24%), Positives = 52/122 (42%), Gaps = 15/122 (12%)
Query: 159 KFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAV 218
++GS GF R +LE G+ +++D+D VW ++P + D DD
Sbjct: 2 EYGSAGFDEMMCERLTFQRKVLEHGFTFLWSDMDTVWYQNPLDIMPKGFDFVGVDD---- 57
Query: 219 KPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPA 278
S+ P ++ +C C ++ RPT +K +K W + + +DQ A
Sbjct: 58 -----SYHGPKHLEQNTGNLCGCFMFWRPTQRSKDFLKDWYDNCAHQA------GDDQQA 106
Query: 279 FN 280
N
Sbjct: 107 LN 108
>gi|281203601|gb|EFA77798.1| WD40 repeat-containing protein [Polysphondylium pallidum PN500]
Length = 647
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/96 (25%), Positives = 50/96 (52%), Gaps = 1/96 (1%)
Query: 109 NNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNF 168
NN + SI+++ + + + D +++ + G+ + + A +G+ GF
Sbjct: 118 NNAMNSIAKEAVE-RPMTSVNDMMSMFGGSAGGSGNTIASNDTVYGEEAESYGNVGFRAI 176
Query: 169 TSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
+ +P +L +L+ GYNV++ D D+VWL +PF +
Sbjct: 177 CNEKPLVVLDVLKRGYNVLWTDTDIVWLGEPFAAFK 212
>gi|357439649|ref|XP_003590102.1| hypothetical protein MTR_1g044340 [Medicago truncatula]
gi|355479150|gb|AES60353.1| hypothetical protein MTR_1g044340 [Medicago truncatula]
Length = 360
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/173 (22%), Positives = 71/173 (41%), Gaps = 17/173 (9%)
Query: 68 PNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLP-------------FLNNWLIS 114
N S D L++A+ + T+I+ +++ Y+ FL+++ +
Sbjct: 54 TNINESYRDGLDKALAKAS--MRNYKTVIIAIINKAYVEQDVKGDAITMFDLFLSSFWLG 111
Query: 115 ISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPC 174
+ D +L++A D + L D F S+ F N RR
Sbjct: 112 EGTRSLIDNLLIVAVDQTAYDRCQFLRLNCYKLETDGVDFGGEKLFMSKDFINMMWRRTF 171
Query: 175 HLLHILELGYNVMYNDVDMVWLKDPFPYLQGD--HDVYFTDDMAAVKPLDHSH 225
LL +L+ GYN ++ D D++WL++PF L + D+ + D+ P H
Sbjct: 172 FLLEVLKRGYNFIFTDTDVMWLRNPFEKLSNNETEDLQISTDLYLDDPWSEKH 224
>gi|6063544|dbj|BAA85404.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 267
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/184 (22%), Positives = 73/184 (39%), Gaps = 21/184 (11%)
Query: 92 NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
NGT+I+ ++ + FL ++ I + D +++++ D T ++ + H
Sbjct: 3 NGTVILTTLNSAWAEPGSVVDVFLESFRIGDETRWLLDHLVMVSLDL-TAHRRCLQIHRH 61
Query: 145 --AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
A+ D F + G+ RR L H+L GY+ ++ D D+VW ++P P+
Sbjct: 62 CFALTTDDGFDFSGEKNFMTDGYLKMMWRRIDFLGHVLAKGYSFIFTDTDIVWFRNPLPH 121
Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
L D D D P D S+ P G Y+R T + W
Sbjct: 122 LHHDGDFQIACDHFTGDPDDLSN--SPNGG---------FAYVRSTSATAAFYRYWYAAR 170
Query: 263 QAEP 266
+ P
Sbjct: 171 ERHP 174
>gi|424513677|emb|CCO66299.1| predicted protein [Bathycoccus prasinos]
Length = 750
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/210 (21%), Positives = 88/210 (41%), Gaps = 25/210 (11%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG--HAV 146
+AK+ TIIV + YL F NW+ + + ++ A D + + G H
Sbjct: 126 IAKDNTIIVTWANHHYLDFARNWINHVQNRLGLSNFIIGAMDEKMYESLKEEFSGGVHTW 185
Query: 147 LVPPAPDSQTAHK----FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
L+ S+ A K +G++ F + + G NV+ +D+D+VWL++P P+
Sbjct: 186 LMGSQGISKEAVKNDFGWGTKNFHQMGRDKIRLIRDFTRSGVNVLVSDIDVVWLRNPLPF 245
Query: 203 LQG--DHDVYFTDDMAAVKPLDHS---HDLPPPGKKGRTYICSC-----MIYLRPTDGAK 252
+ DV + D + + S G+ +IC +++ + G +
Sbjct: 246 FKRYPVADVLVSSDQLRSETMIESLKQKKFMVDGEGLEFHICHAASNIGIMWFLASRGNQ 305
Query: 253 LVMKKWIEELQAEP--WSKAKKANDQPAFN 280
+ +W++ ++ + W DQ AFN
Sbjct: 306 ELTTEWVDRIEKDDNLW-------DQSAFN 328
>gi|21554540|gb|AAM63606.1| unknown [Arabidopsis thaliana]
Length = 428
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 56/205 (27%), Positives = 85/205 (41%), Gaps = 21/205 (10%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
VA N IIV + P L + S+ R Q+ ++V +D + VL
Sbjct: 153 VAVNKEIIVVLANSNVKPMLELQIASVKRVGIQNYLIVALDDSMESFC----ESKEVVLY 208
Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
PD S G + + L L+LGY+V+ +DVD+V+L++PF +L D D
Sbjct: 209 KRDPDKAVDMVGKSGGNHAVSGLKFRVLREFLQLGYSVLLSDVDIVFLQNPFSHLHRDSD 268
Query: 209 VY-FTD--DMAAVKPLDHSHDLPPPG--KKGRT----YICSCMIYLRPTDGAKLVMKKWI 259
V +D D + D P G + T S YLRPT + ++ +
Sbjct: 269 VESMSDGHDNNTAYGFNDVFDEPSMGWARYAHTMRIWVFNSGFFYLRPTLPSIDLLDRVA 328
Query: 260 EEL-QAEPWSKAKKANDQPAFNWAL 283
+ L ++E W DQ FN L
Sbjct: 329 DTLSKSEAW-------DQAVFNEQL 346
>gi|357137060|ref|XP_003570119.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
distachyon]
Length = 389
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 48/102 (47%)
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
F++++ I +++IA D + P LV D +F + G+
Sbjct: 143 FIDSFRRGIRTNSLLKHLVIIAFDRTAYRRCTEIHPYCFALVTDDVDFSQEKRFLTAGYL 202
Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
+R L +LE GY+ +++D D++W ++PFPYL D D
Sbjct: 203 ELMWKRLDFLRLVLEKGYSFIFSDADVMWFRNPFPYLYPDGD 244
>gi|218192017|gb|EEC74444.1| hypothetical protein OsI_09837 [Oryza sativa Indica Group]
Length = 273
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/184 (22%), Positives = 73/184 (39%), Gaps = 21/184 (11%)
Query: 92 NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
NGT+I+ ++ + FL ++ I + D +++++ D T ++ + H
Sbjct: 3 NGTVILTTLNSAWAEPGSVVDVFLESFRIGDDTRWLLDHLVMVSLDL-TAHRRCLQIHRH 61
Query: 145 --AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
A+ D F + G+ RR L H+L GY+ ++ D D+VW ++P P+
Sbjct: 62 CFALTTDDGFDFSGEKNFMTDGYLKMMWRRIDFLGHVLAKGYSFIFTDTDIVWFRNPLPH 121
Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
L D D D P D S+ P G Y+R T + W
Sbjct: 122 LHHDGDFQIACDHFTGDPDDLSN--SPNGG---------FAYVRSTSATAAFYRYWYAAR 170
Query: 263 QAEP 266
+ P
Sbjct: 171 ERHP 174
>gi|23306127|gb|AAN17394.1| Putataive InsB from Escherichia coli [Oryza sativa Japonica Group]
Length = 517
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/184 (22%), Positives = 73/184 (39%), Gaps = 21/184 (11%)
Query: 92 NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
NGT+I+ ++ + FL ++ I + D +++++ D T ++ + H
Sbjct: 181 NGTVILTTLNSAWAEPGSVVDVFLESFRIGDETRWLLDHLVMVSLDL-TAHRRCLQIHRH 239
Query: 145 --AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
A+ D F + G+ RR L H+L GY+ ++ D D+VW ++P P+
Sbjct: 240 CFALTTDDGFDFSGEKNFMTDGYLKMMWRRIDFLGHVLAKGYSFIFTDTDIVWFRNPLPH 299
Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
L D D D P D S+ P G Y+R T + W
Sbjct: 300 LHHDGDFQIACDHFTGDPDDLSN--SPNGG---------FAYVRSTSATAAFYRYWYAAR 348
Query: 263 QAEP 266
+ P
Sbjct: 349 ERHP 352
>gi|356524014|ref|XP_003530628.1| PREDICTED: uncharacterized protein LOC100788000 [Glycine max]
Length = 770
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 54/114 (47%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
K T+I+ Y L +W+ + + ++ V+ + + + + P + P
Sbjct: 285 KTKTVILTVAGYSYKDMLMSWVCRLRKLSIENFVVCALDKETSQFSILQGIPVFTDPIAP 344
Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
+ S FG++ F T + +L IL+LGYNV+ +DVD+ W K+P P L
Sbjct: 345 SNISFDDCHFGTKCFQRVTKVKSRIVLKILKLGYNVLLSDVDVYWFKNPVPLLH 398
>gi|326498413|dbj|BAJ98634.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 375
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/198 (22%), Positives = 83/198 (41%), Gaps = 26/198 (13%)
Query: 92 NGTIIVCAVSQPYLPFLNNWLISISRQKHQD---------QVLVIAEDYATLYKVNGRWP 142
+GT+I+ +V++ + L+ + R D L++A D L P
Sbjct: 102 DGTVIITSVNEAFAR--PGSLLDLFRGSFHDGEGIAHLLNHTLIVAADPGALALCKAVHP 159
Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
+L A +A+ F ++ + + H+L+LGYN +Y D+D++WL++PF +
Sbjct: 160 HCYLLQVMAAGVSSANGFLTRSYLELVWSKLTFQHHVLQLGYNYLYTDLDVLWLRNPFRH 219
Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
+ D+ + D + + DL G Y+R T+ ++ +W
Sbjct: 220 ISIYADMAISTD----RFNGGAEDLKNAPNTG-------FYYVRSTNRTVEMLSRW---- 264
Query: 263 QAEPWSKAKKANDQPAFN 280
+A KA+DQ F
Sbjct: 265 RAARSRFRPKAHDQEVFE 282
>gi|302822869|ref|XP_002993090.1| RRA1, glycosyltransferase CAZy family GT77-like protein
[Selaginella moellendorffii]
gi|300139090|gb|EFJ05838.1| RRA1, glycosyltransferase CAZy family GT77-like protein
[Selaginella moellendorffii]
Length = 431
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 45/177 (25%), Positives = 73/177 (41%), Gaps = 16/177 (9%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
L+Q +A N +IV ++ P L W SI + + ++V +D + +
Sbjct: 149 LAQLLEKIAINREVIVGVSNKNVAPMLQVWFESIKQSGITNYLVVALDDETAKFCKDHDV 208
Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPF 200
P + D+ G + S H+L L LGY+V+ +DVD+V+L++PF
Sbjct: 209 PAYR------KDATIPKSLAGTGDNHAISGTKFHILREFLVLGYSVLLSDVDIVYLQNPF 262
Query: 201 PYLQGDHDVY-FTDDMAAVKPLDHSHDLPPPGKKGRTY--------ICSCMIYLRPT 248
+LQ D D+ TD ++ P Y S + YLRPT
Sbjct: 263 KFLQRDCDIEAMTDGHTNATAYGYNDVFDDPKMGWSRYAHTMRIWVFNSGLFYLRPT 319
>gi|224136390|ref|XP_002326848.1| predicted protein [Populus trichocarpa]
gi|222835163|gb|EEE73598.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 54/124 (43%), Gaps = 12/124 (9%)
Query: 92 NGTIIVCAVSQPYLP------------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNG 139
N T+I+ V+Q Y+ FL+++ + + D +LV+A D
Sbjct: 22 NKTVIITVVNQAYVEQSVDAETTMLDLFLDSFWLGEDTRPLLDHLLVVAVDQIAYEMCFF 81
Query: 140 RWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
+ L D + SQ F N RR LL +L+ GYN ++ D D++WL++P
Sbjct: 82 KGLNCYKLETEGVDFGGEKIYMSQDFINMMWRRTLFLLDVLKRGYNFIFTDTDVMWLRNP 141
Query: 200 FPYL 203
L
Sbjct: 142 LSRL 145
>gi|296088016|emb|CBI35299.3| unnamed protein product [Vitis vinifera]
Length = 547
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/133 (25%), Positives = 59/133 (44%), Gaps = 4/133 (3%)
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
FL ++ I + ++ + +++IA D + + P L D + + +
Sbjct: 405 FLESFHIGNNTKRLLNHLVIIALDQKSYARCLALHPLCYALKTEGVDFSGEAYYSTPNYL 464
Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHD 226
RR L IL +GY+ ++ D D++W +DPF + D D T D P D ++
Sbjct: 465 EMMWRRIDFLRSILTMGYSFIFTDADIMWFRDPFQHFFQDADFQITCDSYIGNPYDVNN- 523
Query: 227 LPPPGKKGRTYIC 239
P G G TY+C
Sbjct: 524 -RPNG--GFTYMC 533
Score = 42.7 bits (99), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 42/183 (22%), Positives = 73/183 (39%), Gaps = 17/183 (9%)
Query: 41 TTLLVLLSLLVVLGVILPWTGTPGFMFPNATSSL------AKWRD-YTLSQAASFVA-KN 92
TTLL + + V + P F F N+ S AK ++ Y L + A +
Sbjct: 15 TTLLRRIVKITVFTA--AFVVVPCFFFYNSAFSFQFLPISAKGKEGYKLDKILKNAAMGD 72
Query: 93 GTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHA 145
T+I+ V++ + FL ++ I + Q+ + +++I D + P
Sbjct: 73 KTVILTTVNEAWAANNSLLDLFLESFRIGNNTQRLLNHLVIITLDPKAYARCTTLHPHCY 132
Query: 146 VLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG 205
L D F S + RR L +L++ YN ++ D D++W +DPF
Sbjct: 133 ALKTKEMDFSKEAFFMSHDYLEMMWRRIDFLRSVLKMRYNFIFTDADIMWFRDPFQRFDS 192
Query: 206 DHD 208
D
Sbjct: 193 KAD 195
>gi|297737271|emb|CBI26472.3| unnamed protein product [Vitis vinifera]
Length = 272
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/120 (24%), Positives = 56/120 (46%), Gaps = 8/120 (6%)
Query: 92 NGTIIVCAVSQPYLP--------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
+ T+I+ +++ Y+ FL+ + + + D +L++A D +L +
Sbjct: 3 DKTVILAMINKAYVDGDKTMLDLFLDGFWLGEGTRGLLDHLLLVAVDQTSLERCKFLHLH 62
Query: 144 HAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
L D + S+ F RR L +L+ GYN ++ D+D++WL++PFP L
Sbjct: 63 CYKLETEGVDFSGEKMYLSEDFMKMMWRRTLFLGEVLKKGYNFIFTDIDIMWLRNPFPRL 122
>gi|359489978|ref|XP_002270380.2| PREDICTED: uncharacterized protein At1g28695 [Vitis vinifera]
Length = 341
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/193 (22%), Positives = 83/193 (43%), Gaps = 23/193 (11%)
Query: 56 ILPWT---GTPGFMFPN-----ATSSLAKWRDYTLSQAASFVAK----NGTIIVCAVSQP 103
+ WT P F F N +TS + + D ++ ++K + T+I+ +++
Sbjct: 24 VFVWTPSLNKPFFPFQNYQCRSSTSKMMESGDSHTNELERALSKASMADKTVILAMINKA 83
Query: 104 YLP--------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQ 155
Y+ FL+ + + + D +L++A D +L + L D
Sbjct: 84 YVDGDKTMLDLFLDGFWLGEGTRGLLDHLLLVAVDQTSLERCKFLHLHCYKLETEGVDFS 143
Query: 156 TAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYF---T 212
+ S+ F RR L +L+ GYN ++ D+D++WL++PFP L + V T
Sbjct: 144 GEKMYLSEDFMKMMWRRTLFLGEVLKKGYNFIFTDIDIMWLRNPFPRLTLNQSVDLQIST 203
Query: 213 DDMAAVKPLDHSH 225
DD + + +H
Sbjct: 204 DDFNGDEWSESNH 216
>gi|242091071|ref|XP_002441368.1| hypothetical protein SORBIDRAFT_09g025390 [Sorghum bicolor]
gi|241946653|gb|EES19798.1| hypothetical protein SORBIDRAFT_09g025390 [Sorghum bicolor]
Length = 740
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 58/126 (46%), Gaps = 3/126 (2%)
Query: 80 YTLSQAASFVA-KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATL-YKV 137
Y+L VA KN ++++ Y L +W+ + R +V A D+ T + V
Sbjct: 439 YSLGMLLELVADKNRSVVLGVAGAGYRDMLMSWVCRL-RHLRVTNFIVCAVDHETYEFSV 497
Query: 138 NGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLK 197
P + P S FG++ F T + +L IL LGYNV+ +DVD+ W
Sbjct: 498 LQGLPVFIDPLSPKNVSIDDCHFGTKCFQQVTKVKSRIVLEILRLGYNVLLSDVDVYWFD 557
Query: 198 DPFPYL 203
+P P+L
Sbjct: 558 NPMPFL 563
>gi|296080952|emb|CBI18645.3| unnamed protein product [Vitis vinifera]
Length = 677
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 58/125 (46%), Gaps = 1/125 (0%)
Query: 80 YTLSQAASFVA-KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVN 138
++L S +A KN TI++ Y L +W+ + + V+ + + +
Sbjct: 329 FSLESLLSVIADKNKTIVLAVAGYSYKDMLMSWVCRLRSLLITNFVVCALDHDVYQFSLL 388
Query: 139 GRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKD 198
P + P+ S FG++ F T + +L IL+LGYNV+ +DVD+ W K+
Sbjct: 389 QGLPVFEDPLAPSDISFDDCHFGTKCFQRVTKSKSRLVLQILKLGYNVLMSDVDVYWFKN 448
Query: 199 PFPYL 203
P P L
Sbjct: 449 PLPLL 453
>gi|224099643|ref|XP_002311563.1| predicted protein [Populus trichocarpa]
gi|222851383|gb|EEE88930.1| predicted protein [Populus trichocarpa]
Length = 277
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 59/135 (43%), Gaps = 10/135 (7%)
Query: 81 TLSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYAT 133
L +AA ++ T+I+ +++ + FL ++ I ++ D ++++A D
Sbjct: 5 VLKEAA---MEDKTVIIATLNEAWAAPNTVIDLFLESFRIGQGTRRLLDHLVIVALDEKA 61
Query: 134 LYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDM 193
+ LV D F + + RR L +LE+GYN ++ D D+
Sbjct: 62 YRRCMELHTHCFALVTQGLDFHDEAYFMTHVYLEMMWRRIDFLRSLLEMGYNFVFTDADI 121
Query: 194 VWLKDPFPYLQGDHD 208
+W +DPFP D D
Sbjct: 122 MWFRDPFPRFFLDAD 136
>gi|302787072|ref|XP_002975306.1| RRA1, glycosyltransferase CAZy family GT77-like protein
[Selaginella moellendorffii]
gi|300156880|gb|EFJ23507.1| RRA1, glycosyltransferase CAZy family GT77-like protein
[Selaginella moellendorffii]
Length = 431
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/177 (25%), Positives = 73/177 (41%), Gaps = 16/177 (9%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
L+Q +A N +IV ++ P L W SI + + ++V +D + +
Sbjct: 149 LAQLLEKIAINRELIVGVSNKNVAPMLQVWFESIKQSGITNYLVVALDDETAKFCKDHDV 208
Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPF 200
P + D+ G + S H+L L LGY+V+ +DVD+V+L++PF
Sbjct: 209 PAYR------KDATIPKSLAGTGDNHAISGTKFHILREFLVLGYSVLLSDVDIVYLQNPF 262
Query: 201 PYLQGDHDVY-FTDDMAAVKPLDHSHDLPPPGKKGRTY--------ICSCMIYLRPT 248
+LQ D DV TD ++ P Y S + YLRPT
Sbjct: 263 KFLQRDCDVEAMTDGHTNATAYGYNDVFDDPKMGWSRYAHTMRIWVFNSGLFYLRPT 319
>gi|224120464|ref|XP_002318336.1| predicted protein [Populus trichocarpa]
gi|222859009|gb|EEE96556.1| predicted protein [Populus trichocarpa]
Length = 329
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/158 (24%), Positives = 70/158 (44%), Gaps = 11/158 (6%)
Query: 90 AKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWP 142
K+ T+I+ ++Q + FL ++ + + QK + +++I+ D + P
Sbjct: 63 TKHKTVILTTLNQAWAEPGSIFDLFLESFQVGDNTQKLVNNLIIISMDQKAHARCLAIHP 122
Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
L + + F S+ + RR L +LE+GY+ ++ D D+VWL++PFP
Sbjct: 123 HCYALRTEGLNFSSEAYFMSEEYLKMMWRRIEFLGTVLEMGYSFVFTDADIVWLRNPFPR 182
Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
D D P D ++ P G G TY+ S
Sbjct: 183 FYPKVDFQIACDNYYGNPEDKNN--RPNG--GFTYVRS 216
>gi|359496284|ref|XP_002271910.2| PREDICTED: uncharacterized protein LOC100242526 [Vitis vinifera]
Length = 874
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 58/125 (46%), Gaps = 1/125 (0%)
Query: 80 YTLSQAASFVA-KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVN 138
++L S +A KN TI++ Y L +W+ + + V+ + + +
Sbjct: 432 FSLESLLSVIADKNKTIVLAVAGYSYKDMLMSWVCRLRSLLITNFVVCALDHDVYQFSLL 491
Query: 139 GRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKD 198
P + P+ S FG++ F T + +L IL+LGYNV+ +DVD+ W K+
Sbjct: 492 QGLPVFEDPLAPSDISFDDCHFGTKCFQRVTKSKSRLVLQILKLGYNVLMSDVDVYWFKN 551
Query: 199 PFPYL 203
P P L
Sbjct: 552 PLPLL 556
>gi|297721775|ref|NP_001173251.1| Os03g0129350 [Oryza sativa Japonica Group]
gi|255674179|dbj|BAH91979.1| Os03g0129350 [Oryza sativa Japonica Group]
Length = 383
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/184 (22%), Positives = 73/184 (39%), Gaps = 21/184 (11%)
Query: 92 NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
NGT+I+ ++ + FL ++ I + D +++++ D T ++ + H
Sbjct: 90 NGTVILTTLNSAWAEPGSVVDVFLESFRIGDETRWLLDHLVMVSLDL-TAHRRCLQIHRH 148
Query: 145 --AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
A+ D F + G+ RR L H+L GY+ ++ D D+VW ++P P+
Sbjct: 149 CFALTTDDGFDFSGEKNFMTDGYLKMMWRRIDFLGHVLAKGYSFIFTDTDIVWFRNPLPH 208
Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
L D D D P D S+ P G Y+R T + W
Sbjct: 209 LHHDGDFQIACDHFTGDPDDLSN--SPNGG---------FAYVRSTSATAAFYRYWYAAR 257
Query: 263 QAEP 266
+ P
Sbjct: 258 ERHP 261
>gi|222624136|gb|EEE58268.1| hypothetical protein OsJ_09274 [Oryza sativa Japonica Group]
Length = 360
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 42/184 (22%), Positives = 73/184 (39%), Gaps = 21/184 (11%)
Query: 92 NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
NGT+I+ ++ + FL ++ I + D +++++ D T ++ + H
Sbjct: 90 NGTVILTTLNSAWAEPGSVVDVFLESFRIGDETRWLLDHLVMVSLDL-TAHRRCLQIHRH 148
Query: 145 --AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
A+ D F + G+ RR L H+L GY+ ++ D D+VW ++P P+
Sbjct: 149 CFALTTDDGFDFSGEKNFMTDGYLKMMWRRIDFLGHVLAKGYSFIFTDTDIVWFRNPLPH 208
Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
L D D D P D S+ P G Y+R T + W
Sbjct: 209 LHHDGDFQIACDHFTGDPDDLSN--SPNGG---------FAYVRSTSATAAFYRYWYAAR 257
Query: 263 QAEP 266
+ P
Sbjct: 258 ERHP 261
>gi|255561745|ref|XP_002521882.1| pentatricopeptide repeat-containing protein, putative [Ricinus
communis]
gi|223538920|gb|EEF40518.1| pentatricopeptide repeat-containing protein, putative [Ricinus
communis]
Length = 394
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 71/158 (44%), Gaps = 18/158 (11%)
Query: 65 FMFPNA--TSSLAKWRDYTLSQAASFVA-KNGTIIVCAVSQPYLP-------FLNNWLIS 114
++FP+A T SL + L + A K+ T+I+ +++ + FL ++ +
Sbjct: 81 YIFPSAVDTDSLPVSNEQKLEKVLKEAAMKDKTVILTTLNEAWAAPNSVIDLFLASFRLG 140
Query: 115 ISRQKHQDQVLVIAEDYATLYKVNGRW---PGHA-VLVPPAPDSQTAHKFGSQGFFNFTS 170
+K + +++IA D K N R H L+ D F + +
Sbjct: 141 EHTRKLLNHLVIIALDQ----KANARCIQVHDHCFALLTDGIDFSNEAYFMTPAYLKMMW 196
Query: 171 RRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
RR L +LE+GYN ++ D D++W +DPFP D D
Sbjct: 197 RRIDFLRSVLEMGYNFVFTDADIMWFRDPFPRFYSDAD 234
>gi|356503226|ref|XP_003520412.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
At1g28695-like [Glycine max]
Length = 339
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/141 (24%), Positives = 61/141 (43%), Gaps = 15/141 (10%)
Query: 79 DYTLSQAASFVAKNGTIIVCAVSQPYLP------------FLNNWLISISRQKHQDQVLV 126
D L++A+ N T+I+ V++ Y+ FL ++ + + D +L+
Sbjct: 48 DTALAKAS---MGNKTVIIAVVNKAYVDQDVESDTTMLDIFLGSFWLGEGTRSLIDHLLL 104
Query: 127 IAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNV 186
+A D + L D + + SQ F RR LL +L+ GYN
Sbjct: 105 VAVDQTAYDRCQFLKLNCFKLETDGVDFKGEKIYMSQDFIKMMWRRTFFLLEVLKRGYNF 164
Query: 187 MYNDVDMVWLKDPFPYLQGDH 207
++ D D++WL++PF L +
Sbjct: 165 VFTDTDVMWLRNPFTRLSKNE 185
>gi|300176044|emb|CBK23355.2| unnamed protein product [Blastocystis hominis]
Length = 367
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 55/131 (41%), Gaps = 24/131 (18%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
D+ TA +GS+ F N + L + V+Y D D + KDPFP L Y
Sbjct: 41 DTSTASSYGSRAFRNKVHWKLIMLQQAVNQNVRVLYMDSDNILFKDPFPVLNS----YNG 96
Query: 213 DDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKK 272
D A + +D IC+ IYL PT K ++ K IE P K
Sbjct: 97 YDFIAQRDVD---------------ICTGFIYLMPTLMTKQLLAKTIE---IRP--KLLN 136
Query: 273 ANDQPAFNWAL 283
A+DQ AFN +
Sbjct: 137 ADDQKAFNMVV 147
>gi|443697523|gb|ELT97963.1| hypothetical protein CAPTEDRAFT_218172 [Capitella teleta]
Length = 509
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 46/196 (23%), Positives = 86/196 (43%), Gaps = 30/196 (15%)
Query: 90 AKNGTIIVCAVSQPYLPF-LNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
A N T++V V Y F +N + SI +Q ++ L I D +++ R A++
Sbjct: 97 AINSTVVVSIVDSDYFSFAVNFYQFSIVKQDIRN-FLAICLDDVVSQQLSARGIPCALVN 155
Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ-GDH 207
A +G++ ++ T+ + +L +L Y+V+ D+D+ +DP+P+ +
Sbjct: 156 VSLNIGSGASDYGAKSYYQKTNLKTYIMLELLRHKYSVLLTDLDVTLFRDPWPHFTCTEC 215
Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGA-KLVMKKWIEELQAEP 266
D++F D R + S ++ RPT G+ +L K W +
Sbjct: 216 DLHFQMD--------------------RVLLNSGFVFARPTPGSIQLYSKAW------QY 249
Query: 267 WSKAKKANDQPAFNWA 282
+ + KA+DQ N A
Sbjct: 250 YVQYNKAHDQAYINMA 265
>gi|356522974|ref|XP_003530117.1| PREDICTED: uncharacterized protein At1g28695-like [Glycine max]
Length = 371
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/170 (21%), Positives = 68/170 (40%), Gaps = 14/170 (8%)
Query: 70 ATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLP------------FLNNWLISISR 117
T+++ + D + A +N T+I+ V++ Y+ FL ++ +
Sbjct: 50 TTTNVEAYGDGLDTALAKTSMENKTVIIAIVNKAYVEQDVESDTTMLDIFLGSFWLGEGT 109
Query: 118 QKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL 177
+ D +L++ D + L D + + SQ F RR LL
Sbjct: 110 RSLIDHLLIVTVDRTAYDRCQFLRLNCFRLETDGVDFEGEKIYMSQDFIKMMWRRTRFLL 169
Query: 178 HILELGYNVMYNDVDMVWLKDPFPYLQGD--HDVYFTDDMAAVKPLDHSH 225
+L+ GYN ++ D D++WL++PF L + D + D P H
Sbjct: 170 EVLKRGYNFVFTDTDVMWLRNPFTRLSKNETEDFQISTDTYLGDPWSEKH 219
>gi|326514932|dbj|BAJ99827.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 407
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 47/102 (46%), Gaps = 16/102 (15%)
Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
ILELGY ++ DVD+VWL+DPF ++ D+ + D+ P D+ + P G
Sbjct: 233 ILELGYGFLFTDVDIVWLRDPFKHVTAYADMTVSSDVYFGDP-DNLGNFPNTG------- 284
Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
+++P + K W P AN+QP FN
Sbjct: 285 ---FFHVKPNARTIAMTKLWHGGRGKYP-----GANEQPVFN 318
>gi|255539963|ref|XP_002511046.1| pentatricopeptide repeat-containing protein, putative [Ricinus
communis]
gi|223550161|gb|EEF51648.1| pentatricopeptide repeat-containing protein, putative [Ricinus
communis]
Length = 360
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/151 (27%), Positives = 67/151 (44%), Gaps = 10/151 (6%)
Query: 70 ATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLP--------FLNNWLISISRQKHQ 121
TS + RD S A N T+I+ V++ Y+ FL+++ + +
Sbjct: 67 TTSPVTSHRDALESGLAEASTANKTLIIAMVNKAYVEGDKPMLDMFLDSFWLGEDTRDLI 126
Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHK-FGSQGFFNFTSRRPCHLLHIL 180
+ +L++A D T Y+ H + + K + S F RR L IL
Sbjct: 127 NHLLLVAVD-QTAYERCKFLRLHCYKLETDGVAFDGEKVYMSDDFIKMMWRRTLLLGDIL 185
Query: 181 ELGYNVMYNDVDMVWLKDPFPYLQGDHDVYF 211
+ GYN ++ D D++WL++PFP L D V F
Sbjct: 186 KRGYNFIFTDTDVMWLRNPFPKLVLDGSVDF 216
>gi|449015632|dbj|BAM79034.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 791
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 61/121 (50%), Gaps = 10/121 (8%)
Query: 95 IIVCAVSQPYLPFLNNWLISISRQK-HQDQVLVIAEDYATLYKVNGRWPGHAVLVPPA-P 152
+I+ A + Y L N++ ++ R + H+ V + ED LY+ G AV + PA
Sbjct: 519 VIMTAATYEYRFLLMNFVCNLRRLRIHKLLVAALDED---LYRY-AYARGLAVYLEPALI 574
Query: 153 DSQTAHK----FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
DSQ H FGS F + + + H+ IL G++V+++DVD+ W +D P L D
Sbjct: 575 DSQYRHSLQCAFGSACFRHRSKLKSRHVYEILRRGHDVLWSDVDITWFRDVRPELLRARD 634
Query: 209 V 209
Sbjct: 635 A 635
>gi|326492838|dbj|BAJ90275.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326494142|dbj|BAJ85533.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 416
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 47/102 (46%), Gaps = 16/102 (15%)
Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
ILELGY ++ DVD+VWL+DPF ++ D+ + D+ P D+ + P G
Sbjct: 242 ILELGYGFLFTDVDIVWLRDPFKHVTAYADMTVSSDVYFGDP-DNLGNFPNTG------- 293
Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
+++P + K W P AN+QP FN
Sbjct: 294 ---FFHVKPNARTIAMTKLWHGAKGKYP-----GANEQPVFN 327
>gi|18410771|ref|NP_565102.1| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
thaliana]
gi|12323891|gb|AAG51917.1|AC013258_11 unknown protein; 7482-9019 [Arabidopsis thaliana]
gi|14517393|gb|AAK62587.1| At1g75110/F9E10_4 [Arabidopsis thaliana]
gi|20857366|gb|AAM26715.1| At1g75110/F9E10_4 [Arabidopsis thaliana]
gi|332197552|gb|AEE35673.1| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
thaliana]
Length = 428
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 84/205 (40%), Gaps = 21/205 (10%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
VA N IIV + P L + S+ R Q+ ++V +D + V
Sbjct: 153 VAVNKEIIVVLANSNVKPMLELQIASVKRVGIQNYLIVALDDSMESFC----ESKEVVFY 208
Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
PD S G + + L L+LGY+V+ +DVD+V+L++PF +L D D
Sbjct: 209 KRDPDKAVDMVGKSGGNHAVSGLKFRVLREFLQLGYSVLLSDVDIVFLQNPFSHLHRDSD 268
Query: 209 VY-FTD--DMAAVKPLDHSHDLPPPG--KKGRT----YICSCMIYLRPTDGAKLVMKKWI 259
V +D D + D P G + T S YLRPT + ++ +
Sbjct: 269 VESMSDGHDNNTAYGFNDVFDEPSMGWARYAHTMRIWVFNSGFFYLRPTIPSIDLLDRVA 328
Query: 260 EEL-QAEPWSKAKKANDQPAFNWAL 283
+ L ++E W DQ FN L
Sbjct: 329 DTLSKSEAW-------DQAVFNEQL 346
>gi|255073807|ref|XP_002500578.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
gi|226515841|gb|ACO61836.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
Length = 659
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 56/129 (43%), Gaps = 6/129 (4%)
Query: 161 GSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD-MAAVK 219
G + F + + + +LELG + D D+VWLKDP Y + Y T D + +
Sbjct: 123 GDKSFKKMGTVKTKFIQDLLELGIAPILTDADVVWLKDPRSYFK--RGTYITADVLVSTD 180
Query: 220 PLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAF 279
+D D + +++ RPTD AK ++ W ++ S DQPAF
Sbjct: 181 CIDVPADRKDNNGCSHVNFNTGVLHFRPTDAAKAFVQTWKTKVAT---STIAWMRDQPAF 237
Query: 280 NWALNKTAG 288
N ++ G
Sbjct: 238 NLITHEGVG 246
>gi|357131573|ref|XP_003567411.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
distachyon]
Length = 357
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 44/185 (23%), Positives = 86/185 (46%), Gaps = 28/185 (15%)
Query: 86 ASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQD---------QVLVIAEDYATLYK 136
A ++ T+I+ +V++ + N L+ + RQ ++ VLV+A D A
Sbjct: 79 ARVAMEDRTVIITSVNEAWAR--NGSLLDLYRQSFKNGEDTEHLLNHVLVVALDPAGFRH 136
Query: 137 VNGRWPGHAVLVPPAPDSQT-AHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVW 195
N P H L+ D+ T A +F S+ + + + +LELGYN ++ D DM+
Sbjct: 137 CNIVHP-HCYLLGATNDNFTSAAQFMSKEYLDLVWTKLSLQRRVLELGYNFLFTDTDMIV 195
Query: 196 LKDPFPYLQGDHDVYFT-DDMAAVK-PLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKL 253
L++PF ++ D+ + D +A + PLD+ + + Y++ T+ +
Sbjct: 196 LRNPFRHITVHADMSVSCDSFSATRAPLDNR-------------VNTGFYYMKATNRSME 242
Query: 254 VMKKW 258
+++ W
Sbjct: 243 LLRYW 247
>gi|242038241|ref|XP_002466515.1| hypothetical protein SORBIDRAFT_01g009150 [Sorghum bicolor]
gi|241920369|gb|EER93513.1| hypothetical protein SORBIDRAFT_01g009150 [Sorghum bicolor]
Length = 348
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 69/154 (44%), Gaps = 14/154 (9%)
Query: 92 NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
N T+I+ +++ Y FL + ++ D VL +A D + G
Sbjct: 77 NKTLILTMLNEAYAEEGGLLDLFLESLREGDGTEQLIDHVLFVAMDQQAFRRCRSLGGGV 136
Query: 145 AVLVPPAPDSQ-----TAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
+ A DSQ + + S GF RR L +L+ GY+ ++ D+D++WL++P
Sbjct: 137 KCYLLRAVDSQQGDLSSEQLYMSDGFIRMMWRRIRFLGDVLKHGYSFIFTDMDVMWLRNP 196
Query: 200 FPYL-QGD-HDVYFTDDMAAVKPLDHSHDLPPPG 231
FP L +G+ D+ + D P D++ + G
Sbjct: 197 FPKLDRGEGEDLLISSDKFNGVPHDYAGNELNTG 230
>gi|51451344|gb|AAU03099.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 765
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 51/114 (44%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
KN ++++ Y L +W+ + R + + V+ + + V P + P
Sbjct: 477 KNRSVVLAVAGASYRDMLMSWVCRLRRLRVTNFVVCALDQETYEFSVLQGMPVSRDTLSP 536
Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
S FG+Q F T + +L IL LGYNV+ +DVD+ W +P +L
Sbjct: 537 NNVSFDDCHFGTQCFQQVTKVKSRIVLKILRLGYNVLLSDVDVYWFHNPVSFLH 590
>gi|308813097|ref|XP_003083855.1| unnamed protein product [Ostreococcus tauri]
gi|116055737|emb|CAL57822.1| unnamed protein product [Ostreococcus tauri]
Length = 588
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 49/108 (45%), Gaps = 12/108 (11%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQG---DHDVYFTDDMAAVKPLDHSHDLPPPGK 232
L+ ILE G++V+ +D D+ WL+DP +++ D DV + D V D
Sbjct: 113 LVDILERGHDVLVSDADVAWLRDPEDWVRTALRDVDVAASTDCLDVSADDVGGCWGAAAN 172
Query: 233 KGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
G ++Y RPTD AK + WI + ++ DQ FN
Sbjct: 173 TG-------ILYFRPTDAAKTFLGNWIAAMDKATDDMTER--DQEIFN 211
>gi|148906908|gb|ABR16599.1| unknown [Picea sitchensis]
Length = 422
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 38/162 (23%), Positives = 69/162 (42%), Gaps = 11/162 (6%)
Query: 79 DYTLSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDY 131
D L++ A+ N T+I+ A++ + FL ++ + + + +L++A D
Sbjct: 143 DRVLAKTAN---SNRTVIITALNVAWAEPNTMIDLFLESFRVGEGTPELLNNLLIVALDA 199
Query: 132 ATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDV 191
+ P L D + S + RR L IL+ GY+ +++D
Sbjct: 200 KAYDRCLEIHPHCYTLKTRGVDFSAEKLYMSDDYLKMMWRRLGFLADILKRGYSFVFSDA 259
Query: 192 DMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKK 233
D++WL++PF D D+ D P D H++P G K
Sbjct: 260 DIMWLRNPFTRFSPDADIQIASDQYNGSPYD-VHNMPNGGYK 300
>gi|222632211|gb|EEE64343.1| hypothetical protein OsJ_19183 [Oryza sativa Japonica Group]
Length = 691
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 51/114 (44%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
KN ++++ Y L +W+ + R + + V+ + + V P + P
Sbjct: 403 KNRSVVLAVAGASYRDMLMSWVCRLRRLRVTNFVVCALDQETYEFSVLQGMPVSRDTLSP 462
Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
S FG+Q F T + +L IL LGYNV+ +DVD+ W +P +L
Sbjct: 463 NNVSFDDCHFGTQCFQQVTKVKSRIVLKILRLGYNVLLSDVDVYWFHNPVSFLH 516
>gi|297724249|ref|NP_001174488.1| Os05g0513700 [Oryza sativa Japonica Group]
gi|255676488|dbj|BAH93216.1| Os05g0513700 [Oryza sativa Japonica Group]
Length = 689
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 51/114 (44%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
KN ++++ Y L +W+ + R + + V+ + + V P + P
Sbjct: 401 KNRSVVLAVAGASYRDMLMSWVCRLRRLRVTNFVVCALDQETYEFSVLQGMPVSRDTLSP 460
Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
S FG+Q F T + +L IL LGYNV+ +DVD+ W +P +L
Sbjct: 461 NNVSFDDCHFGTQCFQQVTKVKSRIVLKILRLGYNVLLSDVDVYWFHNPVSFLH 514
>gi|359473691|ref|XP_002273214.2| PREDICTED: uncharacterized protein At4g15970 [Vitis vinifera]
Length = 415
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 26/102 (25%), Positives = 48/102 (47%)
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
FL ++ I +++ + +++IA D + LV D F + +
Sbjct: 152 FLESFRIGDHTRRYLNHLVIIALDQKAFARCLILHNHCFTLVTEGVDFSGEAYFMTSDYL 211
Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
RR L +LE+GYN +++D D++W +DPFP+ + D
Sbjct: 212 KMMWRRIDFLRSVLEMGYNFIFSDADIMWFRDPFPHFLPNAD 253
>gi|384251205|gb|EIE24683.1| hypothetical protein COCSUDRAFT_62105 [Coccomyxa subellipsoidea
C-169]
Length = 857
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 42/197 (21%), Positives = 79/197 (40%), Gaps = 7/197 (3%)
Query: 86 ASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHA 145
A+ +++G ++V S YL FL NW+ ++ + ++ ++ ++ + Y P
Sbjct: 346 ANSRSRDGIVLVTWASAAYLDFLRNWIHHLTILEVENFLIGAMDNEVSEYLRQQNIPYFD 405
Query: 146 VLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG 205
+ D Q G++ + + G + + D D+VWL+DP Y G
Sbjct: 406 MQAGMYSDMQGHLMKGTKAARMLAFNKIGVAQTLNTFGLDALLCDTDVVWLRDPSDYFSG 465
Query: 206 --DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQ 263
+ D+ D V L P R + + +++LR GA+ M W L+
Sbjct: 466 LEEADILVATDGLGVSNTKDDDGLESPEAALRHQMSTGIVFLRHGRGARNFMDAWDSALR 525
Query: 264 AEPWSKAKKANDQPAFN 280
+ +Q AFN
Sbjct: 526 -----RNVSRTEQQAFN 537
>gi|297738316|emb|CBI27517.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 25/97 (25%), Positives = 46/97 (47%)
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
FL ++ I +++ + +++IA D + LV D F + +
Sbjct: 25 FLESFRIGDHTRRYLNHLVIIALDQKAFARCLILHNHCFTLVTEGVDFSGEAYFMTSDYL 84
Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
RR L +LE+GYN +++D D++W +DPFP+
Sbjct: 85 KMMWRRIDFLRSVLEMGYNFIFSDADIMWFRDPFPHF 121
>gi|356575476|ref|XP_003555866.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
Length = 364
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 45/184 (24%), Positives = 77/184 (41%), Gaps = 20/184 (10%)
Query: 45 VLLSLLVVLGVI---------LPWTGTPGF--MFPNATSS-LAKWRDYTLSQAASFVA-K 91
V LS LV+LG + GF +FP+ + +A +Y L + + A K
Sbjct: 23 VSLSCLVLLGDVDSHRFLSSFHSSYSLSGFTRIFPSVYNDPVATSNEYPLEKILNEAAMK 82
Query: 92 NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
+ T+I+ +++ + FL ++ I D +++IA D +
Sbjct: 83 DRTVILTTLNEAWAAPNSVIDLFLESFRIGDRTSTFLDHLVIIALDQKAFARCQVIHTYC 142
Query: 145 AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
LV D F + + +R L +LE+GYN ++ D D++W +DPFP
Sbjct: 143 FSLVSEEADFHEEAYFMTPRYLMMMWKRIDFLRTVLEMGYNFVFTDADIMWFRDPFPLFH 202
Query: 205 GDHD 208
D D
Sbjct: 203 LDAD 206
>gi|357111002|ref|XP_003557304.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
distachyon]
Length = 375
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 37/172 (21%), Positives = 73/172 (42%), Gaps = 18/172 (10%)
Query: 94 TIIVCAVSQPYLP---FLNNWLISISRQKHQ----DQVLVIAEDYATLYKVNGRWPGHAV 146
T+++ A+++ + FL+ +L S ++ +LV+A D + N P
Sbjct: 112 TVLITALNEAWAAPGSFLDLFLESFQHGENTAYLVKHLLVVAMDKKAFDRCNAVHPFCYW 171
Query: 147 LVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGD 206
D + K+ + +R ILELGY ++ DVD++W +DPFP++
Sbjct: 172 FRVEGMDFASEQKYMKGDYLEMMWKRNRFQQTILELGYTFLFTDVDILWFRDPFPHISPT 231
Query: 207 HDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKW 258
+ + D P + + + P G ++Y++ +G+ K W
Sbjct: 232 AQLVMSSDFFVGDP-NSAGNYPNGG----------LLYVKSCEGSIGFYKHW 272
>gi|412992379|emb|CCO20092.1| predicted protein [Bathycoccus prasinos]
Length = 980
Score = 47.4 bits (111), Expect = 0.009, Method: Composition-based stats.
Identities = 51/215 (23%), Positives = 102/215 (47%), Gaps = 19/215 (8%)
Query: 79 DYTLSQA-ASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYK- 136
D L++A A + ++V + Y F+ NW+ I R + LV A D L +
Sbjct: 470 DNQLNEALAKRYSHENIVMVTWANNHYYDFVKNWVKHI-RDCGMNNFLVGAMDNELLVRL 528
Query: 137 VNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHIL-ELGYNVMYNDVDMVW 195
++ + P A+ + + +GS+ F R+ L+H+ ++G++++ +DVD W
Sbjct: 529 IDDKVPTFAM---QSGLTTADFGWGSKNFHQM-GRKKIELIHLFTKMGFDILVSDVDTAW 584
Query: 196 LKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC-MIYLRPTDGAKLV 254
+K+P P+++ +V D + + L + ++ T + + ++ LR + GA +
Sbjct: 585 MKNPIPFIRKFPEV---DVLTSSDSLSDFAETEWSLERTTTGMANIGIMLLRKSAGA--L 639
Query: 255 MKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQ 289
K+W+ L+ K + DQ AFN + K G+
Sbjct: 640 AKEWVHVLE-----KDENIWDQNAFNDLMRKGRGK 669
>gi|125528893|gb|EAY77007.1| hypothetical protein OsI_04964 [Oryza sativa Indica Group]
Length = 369
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 41/82 (50%)
Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
D VLV+A D A + P L + +A+ F S+ + + +LE
Sbjct: 126 DHVLVVAVDPAAFHHCRAVHPHCYHLKVDTMNLSSANNFMSEAYVELVWTKLSLQQRVLE 185
Query: 182 LGYNVMYNDVDMVWLKDPFPYL 203
LGYN ++ DVD++W +DPF ++
Sbjct: 186 LGYNFLFTDVDILWFRDPFRHI 207
>gi|449439235|ref|XP_004137392.1| PREDICTED: uncharacterized protein At4g15970-like [Cucumis sativus]
Length = 380
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 37/183 (20%), Positives = 75/183 (40%), Gaps = 18/183 (9%)
Query: 91 KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
++GTII+ ++ + FL ++ I Q+ ++++ D + P
Sbjct: 89 EDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVAVHPH 148
Query: 144 HAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
L + + F + + RR L+++LE+G++ ++ D D++WL+DPF +
Sbjct: 149 CYQLDTQGTNFSSEAYFMTADYLKMMWRRIEFLIYVLEMGHSFVFTDTDIMWLQDPFNHF 208
Query: 204 QGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQ 263
D D D+ P ++ +++P G +Y+R K W E
Sbjct: 209 YKDADFQIASDLYLGNP-ENLNNVPNGG----------FVYVRANHRTVKFYKFWYESRT 257
Query: 264 AEP 266
P
Sbjct: 258 IYP 260
>gi|115441885|ref|NP_001045222.1| Os01g0920700 [Oryza sativa Japonica Group]
gi|19386738|dbj|BAB86120.1| regulatory protein-like [Oryza sativa Japonica Group]
gi|57899427|dbj|BAD88365.1| regulatory protein-like [Oryza sativa Japonica Group]
gi|113534753|dbj|BAF07136.1| Os01g0920700 [Oryza sativa Japonica Group]
gi|125573134|gb|EAZ14649.1| hypothetical protein OsJ_04573 [Oryza sativa Japonica Group]
Length = 369
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 41/82 (50%)
Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
D VLV+A D A + P L + +A+ F S+ + + +LE
Sbjct: 126 DHVLVVAVDPAAFHHCRAVHPHCYHLKVDTMNLSSANNFMSEAYVELVWTKLSLQQRVLE 185
Query: 182 LGYNVMYNDVDMVWLKDPFPYL 203
LGYN ++ DVD++W +DPF ++
Sbjct: 186 LGYNFLFTDVDILWFRDPFRHI 207
>gi|242055415|ref|XP_002456853.1| hypothetical protein SORBIDRAFT_03g044070 [Sorghum bicolor]
gi|241928828|gb|EES01973.1| hypothetical protein SORBIDRAFT_03g044070 [Sorghum bicolor]
Length = 376
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/99 (27%), Positives = 46/99 (46%)
Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
D VL++ D A+ P +L + D A FGS + + +LE
Sbjct: 142 DHVLIVTVDAASFSGCKAAHPHCYLLEVKSMDMNRAKSFGSPEYVELNWLKLSVQQRVLE 201
Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKP 220
LGYN ++ D D++WL++PF + D+ + D + + P
Sbjct: 202 LGYNFLFTDADILWLRNPFQRISVYADMSCSLDNSKMAP 240
>gi|357118762|ref|XP_003561118.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
distachyon]
Length = 339
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 41/82 (50%), Gaps = 11/82 (13%)
Query: 177 LHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRT 236
L +L+LGYN ++ D+D++W +DPFP++ + + D+ P D P R
Sbjct: 171 LTVLQLGYNFLFTDMDILWFRDPFPHIPPTAQLVMSSDIFVGDP-----DSP------RN 219
Query: 237 YICSCMIYLRPTDGAKLVMKKW 258
Y ++Y R DGA + W
Sbjct: 220 YPNGGLLYARSCDGAIGFYEHW 241
>gi|115441887|ref|NP_001045223.1| Os01g0921000 [Oryza sativa Japonica Group]
gi|19386741|dbj|BAB86123.1| regulatory protein-like [Oryza sativa Japonica Group]
gi|19386794|dbj|BAB86173.1| OJ1485_B09.2 [Oryza sativa Japonica Group]
gi|57899430|dbj|BAD88368.1| regulatory protein-like [Oryza sativa Japonica Group]
gi|113534754|dbj|BAF07137.1| Os01g0921000 [Oryza sativa Japonica Group]
gi|215701207|dbj|BAG92631.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222619773|gb|EEE55905.1| hypothetical protein OsJ_04575 [Oryza sativa Japonica Group]
Length = 372
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 49/104 (47%), Gaps = 2/104 (1%)
Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
D VLV+A D A + P +L + + +A +F S+ + + +LE
Sbjct: 134 DHVLVVAVDPAGFRRCKAVHPHCYLLHVKSINLTSATRFMSREYLELVWTKLSLQQRVLE 193
Query: 182 LGYNVMYNDVDMVWLKDPFPYLQ--GDHDVYFTDDMAAVKPLDH 223
LGYN ++ D DMV +DPF ++ D D AA PLD+
Sbjct: 194 LGYNFLFTDCDMVLFRDPFRHIAVYADMSTSSDDYSAARAPLDN 237
>gi|308081654|ref|NP_001183634.1| uncharacterized protein LOC100502228 [Zea mays]
gi|238013584|gb|ACR37827.1| unknown [Zea mays]
gi|414878981|tpg|DAA56112.1| TPA: hypothetical protein ZEAMMB73_163166 [Zea mays]
gi|414878982|tpg|DAA56113.1| TPA: hypothetical protein ZEAMMB73_163166 [Zea mays]
Length = 357
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 38/161 (23%), Positives = 70/161 (43%), Gaps = 16/161 (9%)
Query: 68 PNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKH 120
+A + A+ D + ++ T+I+ +V++ + FL ++
Sbjct: 67 ADAKGAAAEQEDELQRLLRAVADEDRTVIMTSVNEAWAAQDSLLDLFLESFRSGERIAHF 126
Query: 121 QDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHK--FGSQGFFNFTSRRPCHLLH 178
D +LV+A D L + P H L+P A + + F S+ + + +
Sbjct: 127 VDHLLVVALDGGALERCRAVHP-HCYLLPTAAARNLSGEKVFMSKDYIDLVWSKVRLQQR 185
Query: 179 ILELGYNVMYNDVDMVWLKDPFP------YLQGDHDVYFTD 213
ILELGYN ++ DVD++W ++PF ++ D YF D
Sbjct: 186 ILELGYNFLFTDVDILWFRNPFERMSVAAHMVTSSDFYFGD 226
>gi|218189629|gb|EEC72056.1| hypothetical protein OsI_04966 [Oryza sativa Indica Group]
Length = 372
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 49/104 (47%), Gaps = 2/104 (1%)
Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
D VLV+A D A + P +L + + +A +F S+ + + +LE
Sbjct: 134 DHVLVVAVDPAGFRRCKAVHPHCYLLHVKSINLTSATRFMSREYLELVWTKLSLQQRVLE 193
Query: 182 LGYNVMYNDVDMVWLKDPFPYLQ--GDHDVYFTDDMAAVKPLDH 223
LGYN ++ D DMV +DPF ++ D D AA PLD+
Sbjct: 194 LGYNFLFTDCDMVLFRDPFRHIAVYADMSTSSDDYSAARAPLDN 237
>gi|42407878|dbj|BAD09019.1| regulatory protein-like [Oryza sativa Japonica Group]
gi|42407980|dbj|BAD09118.1| regulatory protein-like [Oryza sativa Japonica Group]
Length = 335
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 30/130 (23%), Positives = 61/130 (46%), Gaps = 9/130 (6%)
Query: 94 TIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAV 146
T+I+ ++++ Y FL ++ + + VL++A D A L + P +
Sbjct: 69 TVIMTSINRAYAAPGSLLDLFLESFRLGEGTEPLLKHVLIVAMDPAALARCRQVHPHCYL 128
Query: 147 LVPP--APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
L P A D +F S+ + + R IL+LG+N ++ D+D++W ++P ++
Sbjct: 129 LRRPEGAVDYSDEKRFMSKDYLDMMWGRNLFQQTILQLGFNFLFTDIDIMWFRNPLRHIA 188
Query: 205 GDHDVYFTDD 214
D+ +D
Sbjct: 189 ITSDIAVAND 198
>gi|218201503|gb|EEC83930.1| hypothetical protein OsI_30008 [Oryza sativa Indica Group]
Length = 343
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 30/130 (23%), Positives = 61/130 (46%), Gaps = 9/130 (6%)
Query: 94 TIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAV 146
T+I+ ++++ Y FL ++ + + VL++A D A L + P +
Sbjct: 77 TVIMTSINRAYAAPGSLLDLFLESFRLGEGTEPLLKHVLIVAMDPAALARCRQVHPHCYL 136
Query: 147 LVPP--APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
L P A D +F S+ + + R IL+LG+N ++ D+D++W ++P ++
Sbjct: 137 LQRPEGAVDYSDEKRFMSKDYLDMMWGRNLFQQTILQLGFNFLFTDIDIMWFRNPLRHIA 196
Query: 205 GDHDVYFTDD 214
D+ +D
Sbjct: 197 ITSDIAVAND 206
>gi|218189630|gb|EEC72057.1| hypothetical protein OsI_04967 [Oryza sativa Indica Group]
Length = 371
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 29/141 (20%), Positives = 65/141 (46%), Gaps = 9/141 (6%)
Query: 91 KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
++ T+I+ +V++ + FL ++ + + + ++V+A D L + P
Sbjct: 96 EDRTVIMTSVNEAWAAPGSLMDSFLESFSVGENISHFVEHIVVVAMDEGALRRCRAIHPH 155
Query: 144 HAVLVPPAP--DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
+L+P D A + ++ + + + +LELGYN+++ DVD+ W ++P
Sbjct: 156 CYLLLPEVAGLDLSGAKSYMTKDYLDLVWSKLKLQQRVLELGYNLLFTDVDLAWFRNPMV 215
Query: 202 YLQGDHDVYFTDDMAAVKPLD 222
++ D+ + D P D
Sbjct: 216 HITAAADITTSSDFYFGDPDD 236
>gi|359491817|ref|XP_002270758.2| PREDICTED: uncharacterized protein At4g15970-like [Vitis vinifera]
Length = 373
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 47/209 (22%), Positives = 92/209 (44%), Gaps = 19/209 (9%)
Query: 43 LLVLLSLLVVLGVILPWTGTPGFMFPNATSSLAKWR----DYTLSQAASFVAKNGTIIVC 98
+++ ++++V+ G++L + P A S ++ + D L +AA + T+I+
Sbjct: 43 IMLFMAVVVLPGMVLHKSAYSSQFLP-ALISRSELKESPLDIVLEKAA---MGDKTVILT 98
Query: 99 AVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPA 151
V+ + FL ++ I + ++ + +++IA D + + P L
Sbjct: 99 TVNGAWAANNSLLDLFLESFHIGNNTKRLLNHLVIIALDQKSYARCLALHPLCYALKTEG 158
Query: 152 PDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYF 211
D + + + RR L IL +GY+ ++ D D++W +DPF + D D
Sbjct: 159 VDFSGEAYYSTPNYLEMMWRRIDFLRSILTMGYSFIFTDADIMWFRDPFQHFFQDADFQI 218
Query: 212 TDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
T D P D ++ P G G TY+ S
Sbjct: 219 TCDSYIGNPYDVNN--RPNG--GFTYVKS 243
>gi|357131571|ref|XP_003567410.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
distachyon]
Length = 371
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 44/160 (27%), Positives = 69/160 (43%), Gaps = 20/160 (12%)
Query: 123 QVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILEL 182
VLVIA D A ++ N P +L + +A +F S+ + + ILEL
Sbjct: 132 HVLVIALDPAGFHRCNVVHPHCYLLEVKTVNLTSATRFMSKEYLEIVWTKLSLQQRILEL 191
Query: 183 GYNVMYNDVDMVWLKDPFPY--LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
GYN ++ D DM+ ++PF L D V D A PLD+ I +
Sbjct: 192 GYNFLFTDADMLLFRNPFRRITLHADMSVSSDDFSIARAPLDNP-------------INT 238
Query: 241 CMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
+ Y++ T+ +++ W QA S+ A+DQ F
Sbjct: 239 GLYYVKSTNRTVEMLRYW----QAAR-SRTPGAHDQTVFG 273
>gi|242055407|ref|XP_002456849.1| hypothetical protein SORBIDRAFT_03g044030 [Sorghum bicolor]
gi|241928824|gb|EES01969.1| hypothetical protein SORBIDRAFT_03g044030 [Sorghum bicolor]
Length = 300
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 30/131 (22%), Positives = 61/131 (46%), Gaps = 11/131 (8%)
Query: 94 TIIVCAVSQPYLPFLNNWLISISRQKHQ---------DQVLVIAEDYATLYKVNGRWPGH 144
T+I+ +V++ + N L+ + R+ Q + VLV+A D P
Sbjct: 30 TVIITSVNEAFAR--PNSLLGLFRESFQVGEGIGHLLNNVLVVAVDAKAFRYCKAVHPHC 87
Query: 145 AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
+L + +A+ + ++ + + +LELGYN ++ DVD+VW ++PF ++
Sbjct: 88 YLLEVKTMNLSSANNYMTESYIELVWTKLSLQQRVLELGYNFLFTDVDIVWFRNPFRHIS 147
Query: 205 GDHDVYFTDDM 215
D+ + D+
Sbjct: 148 AFADMTTSSDV 158
>gi|356561839|ref|XP_003549184.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
Length = 353
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 49/191 (25%), Positives = 77/191 (40%), Gaps = 40/191 (20%)
Query: 45 VLLSLLVVLGVILPWTGTPGFMFP------NATSSLAKWRDYTLSQAASFVA-KNGTIIV 97
+ L+ L VL + L T +P +FP NA SS D L + K+ T+I+
Sbjct: 34 MFLAGLAVLWMFLYNTASPFGIFPAFSHSINAQSSKQANYDPKLESVLRDASMKDKTVII 93
Query: 98 CAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATL----------YKVN-- 138
++ + FL + + Q + ++VI D TL Y+V
Sbjct: 94 TTLNDAWAEPGSMFDLFLESLQLGNGTQWLSNHLVVITWDQKTLARCLVVHKHCYQVETK 153
Query: 139 -GRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLK 197
G + G + P + + RR L ILE+GYN ++ D D++WL+
Sbjct: 154 GGNYTGEVFYMTP-------------NYLHMMWRRTEFLGSILEMGYNFVFTDTDIMWLR 200
Query: 198 DPFPYLQGDHD 208
DPF D D
Sbjct: 201 DPFKQFYKDTD 211
>gi|242055409|ref|XP_002456850.1| hypothetical protein SORBIDRAFT_03g044040 [Sorghum bicolor]
gi|241928825|gb|EES01970.1| hypothetical protein SORBIDRAFT_03g044040 [Sorghum bicolor]
Length = 355
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 74/172 (43%), Gaps = 18/172 (10%)
Query: 94 TIIVCAVSQPYL---PFLNNWLISISRQKHQ----DQVLVIAEDYATLYKVNGRWPGHAV 146
T+I+ +V++ + L+ +L S + D +LV+A D ++ P +
Sbjct: 83 TVIITSVNEAWAQPGSLLDLYLDSFKNGEDTAHLLDHLLVVALDARGFHRCQAVHPYCYL 142
Query: 147 LVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGD 206
L + D +A F S + + +LELGYN ++ D DMVW ++PF +
Sbjct: 143 LNATSVDMSSAKPFMSPDYLELVWTKLVFQQRVLELGYNFLFTDCDMVWFRNPFRHFP-- 200
Query: 207 HDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKW 258
VY DM+ S D P + + + Y++ T+ +MK W
Sbjct: 201 --VYA--DMSC-----SSDDFKPSRAPLDNPLNTGLYYMKTTNRTIEMMKYW 243
>gi|255571059|ref|XP_002526480.1| pentatricopeptide repeat-containing protein, putative [Ricinus
communis]
gi|223534155|gb|EEF35871.1| pentatricopeptide repeat-containing protein, putative [Ricinus
communis]
Length = 357
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 30/130 (23%), Positives = 56/130 (43%), Gaps = 12/130 (9%)
Query: 92 NGTIIVCAVSQPYLP------------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNG 139
N T+I+ +++ Y FL ++ + + D +L++A D +
Sbjct: 76 NKTVIIVILNKAYAEPTVKSETTMLDLFLESFWVGEDTRPLLDHLLLVAADQTAYERCMF 135
Query: 140 RWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
+ + D F S+ F RR LL +L+ GY+ ++ D D++WL++P
Sbjct: 136 KRLNCYKMETEGVDFGGEKLFMSKDFIKMMWRRTLLLLDVLKHGYSFIFTDADVMWLRNP 195
Query: 200 FPYLQGDHDV 209
FP L + V
Sbjct: 196 FPRLSKNESV 205
>gi|218197096|gb|EEC79523.1| hypothetical protein OsI_20607 [Oryza sativa Indica Group]
Length = 830
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 51/114 (44%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
KN ++++ Y L +W+ + R + + V+ + + V P + P
Sbjct: 542 KNRSVVLAVAGASYRDMLMSWVCRLRRLRVTNFVVCALDQETYEFSVLQGMPVSRDTLSP 601
Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
S FG+Q F T + +L IL LGYNV+ +DVD+ W +P +L
Sbjct: 602 NNVSFDDCHFGTQCFQQVTKVKSRIVLKILRLGYNVLLSDVDVHWFHNPVSFLH 655
>gi|340369123|ref|XP_003383098.1| PREDICTED: hypothetical protein LOC100635634 [Amphimedon
queenslandica]
Length = 393
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 45/214 (21%), Positives = 87/214 (40%), Gaps = 35/214 (16%)
Query: 78 RDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKV 137
+ Y + A K+G +V V+ ++ NW+ S + + +L+I +
Sbjct: 102 KHYPCNFATPIPIKDGVSVVTFVNAGWINLTKNWICSARKVGLGEHILLITVE------- 154
Query: 138 NGRWPGHAVLVPPAP------DSQTAHKFGSQGFFNFTSRRPCHLLHILELGY-NVMYND 190
P P P + ++ KFG G+ F R +L +L G ++ D
Sbjct: 155 ----PNVCSNFPDTPCHYEKGAAISSTKFGQPGYQKFMIERTKIILRLLSCGIKKLLLAD 210
Query: 191 VDMVWLKDPFPYLQGD---HDVYFTDDMAAVKPLDH-SHDLPPPGKKGRTYICSCMIYLR 246
D+V+L++P L + D+ D ++ +D ++++ P YIC +YL
Sbjct: 211 ADIVFLQNPLKRLDTELEYRDIVLQRDSTGLQVIDSLAYNVFP-------YICGGFMYLN 263
Query: 247 PTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
+ KL+ + ++ + + W NDQ N
Sbjct: 264 VNNKTKLLYQSVLQFQRNQSW------NDQAGLN 291
>gi|326496953|dbj|BAJ98503.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326526459|dbj|BAJ97246.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 369
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 35/137 (25%), Positives = 59/137 (43%), Gaps = 18/137 (13%)
Query: 94 TIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAV 146
T+I+ +V++ + FL ++ D +LV+A D L + P H
Sbjct: 103 TVIMTSVNEAWAAEGSLLDLFLESFRAGERIAHFVDNLLVVALDAGALERCRAVHP-HCY 161
Query: 147 LVPPAPDSQT----AHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP- 201
L+PP F S+ + + + ILELGYN ++ DVD++W ++PF
Sbjct: 162 LLPPVAGGNKNLSDEKVFMSKDYLDLVWSKVRLQQRILELGYNFLFTDVDIMWFRNPFER 221
Query: 202 -----YLQGDHDVYFTD 213
++ D YF D
Sbjct: 222 MSVAAHMVTSSDFYFGD 238
>gi|238478665|ref|NP_001154379.1| nucleotide-diphospho-sugar transferase domain-containing protein
[Arabidopsis thaliana]
gi|332192897|gb|AEE31018.1| nucleotide-diphospho-sugar transferase domain-containing protein
[Arabidopsis thaliana]
Length = 278
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 38/183 (20%), Positives = 73/183 (39%), Gaps = 23/183 (12%)
Query: 89 VAKNGTIIVCAVSQPYLP-----------FLNNWLISISRQKHQDQVLVIAEDYATLYKV 137
+ N T+I+ V++ Y+ FL ++ + D ++++A D + +
Sbjct: 1 MGNNKTVIIAMVNKAYVEEVEGGRTMLDLFLESFWEGEGTRPLLDHLMLVAADQTSYDRC 60
Query: 138 NGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLK 197
R + D + + S+ F RR LL +L GYN+ + D D++WL+
Sbjct: 61 LFRRLHCYKMDTDGVDLEGEKVYMSKDFIEMMWRRTHLLLDVLSRGYNLTFTDTDVMWLR 120
Query: 198 DPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKK 257
PFP L + + LD + G G I + ++R + + +K
Sbjct: 121 SPFPRLSYN------------ESLDMQISVDSIGLVGGHLINTGFYHVRSNNKTISLFQK 168
Query: 258 WIE 260
W +
Sbjct: 169 WYD 171
>gi|326529655|dbj|BAK04774.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 369
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/116 (23%), Positives = 48/116 (41%)
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
FL ++ + ++ D +++++ D A + + D F + G+
Sbjct: 125 FLESFRVGEHTRELVDHLVIVSLDLAAHRRCKQIHAHCLAVATEGVDFSGQKNFMTDGYL 184
Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLD 222
RR L +LE G++ ++ D D+VWL+ P P L D D D P D
Sbjct: 185 RMMWRRIDFLRQVLEKGFSFIFTDTDIVWLRSPLPRLYADGDFQIACDHFTGDPDD 240
>gi|326497757|dbj|BAK05968.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 412
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 38/157 (24%), Positives = 65/157 (41%), Gaps = 11/157 (7%)
Query: 92 NGTIIVCAVSQPYLP---FLNNWLISISRQKHQ----DQVLVIAEDYATLYKVNGRWPGH 144
N T+++ A+++ + FL+ +L S ++ +L++A D + N P
Sbjct: 147 NRTVLITALNEAWAAPGSFLDLFLESFKHGENTANLVKHLLIVAMDKKAFDRCNAVHPLC 206
Query: 145 AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
D K+ + +R ILELGY ++ DVD++W +DPFP +
Sbjct: 207 YWFRVEGMDFAAEQKYMKGDYLEMMWKRNRFQQTILELGYTFLFTDVDILWFRDPFPRIP 266
Query: 205 GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 241
V + D P D + P G Y+ SC
Sbjct: 267 EAAQVVMSSDFFVGDP-DSPGNYP---NGGLLYVRSC 299
>gi|167523934|ref|XP_001746303.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775065|gb|EDQ88690.1| predicted protein [Monosiga brevicollis MX1]
Length = 319
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 41/203 (20%), Positives = 82/203 (40%), Gaps = 35/203 (17%)
Query: 95 IIVCAVSQPYLPFLNNWLISISRQ-KHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPA-- 151
II+ A+S + F+ N L SI +VIAED ++ +++ + PG +L
Sbjct: 44 IILAAISSGFSDFMYNLLASIQATVGTTRHAVVIAEDVSSYFQMEEKLPGQVLLATEDLL 103
Query: 152 -----------PDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
+ +GS + RP +L + + G ++++ D D+ + ++P+
Sbjct: 104 TFGMHNAREKLEGDKRGQSYGSTLYLLLMRARPKYLTRMHDTGVHMLFVDADIAFFENPY 163
Query: 201 PYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIE 260
++ + L S+DL +C+ +Y T ++ W +
Sbjct: 164 AWMDAN--------------LKQSYDLLASNDHA-NLLCAGFVYYANTSVITRFLEVW-D 207
Query: 261 ELQAEPWSKAKKANDQPAFNWAL 283
+L A+ +Q +FN AL
Sbjct: 208 QLVAQG-----DGRNQASFNKAL 225
>gi|357126544|ref|XP_003564947.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
distachyon]
Length = 370
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 39/158 (24%), Positives = 70/158 (44%), Gaps = 16/158 (10%)
Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
+ VL++A D + N P +L + +A +F S+ + + +LE
Sbjct: 129 NHVLIVALDPTGFGRCNVVHPYCYLLEVKTANLTSATRFMSKEYLELVWSKLSFQQRVLE 188
Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 241
LGYN +Y D DM+ +++P ++ VY ++ LD L P G
Sbjct: 189 LGYNFLYTDTDMIMMRNPLRHIP----VYADMSVSTDNFLDARVPLTNPLNTG------- 237
Query: 242 MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAF 279
+ Y++ T+ + +++ W E A P + + NDQP F
Sbjct: 238 LYYMKATNRSISMLRYWQE---ARP--RFPRLNDQPVF 270
>gi|357126546|ref|XP_003564948.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
distachyon]
Length = 392
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 45/229 (19%), Positives = 88/229 (38%), Gaps = 33/229 (14%)
Query: 69 NATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQ 121
NA S D ++GT+I+ +V++ + FL ++ + +
Sbjct: 92 NARGSRGVEDDEFARMVRRAAMEDGTVIMTSVNEAWAAPGSLLDTFLESFRVGENISHFV 151
Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPP----------APDSQTAHKFGSQGFFNFTSR 171
+ ++V+A D L + + H L+ P +PD A + ++ + +
Sbjct: 152 EHIIVVAMDEGALLRCHALKHPHCHLLLPDPENNSPNGTSPDLSGAKSYMTKDYLSLVWS 211
Query: 172 RPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPG 231
+ +L+LG+N+++ DVD+ W +DP ++ D+ + D P D +
Sbjct: 212 KLRLQQRVLDLGHNLLFTDVDVAWFRDPRVHITLAADITTSSDFYFGDPDDLGN------ 265
Query: 232 KKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
Y + IY + T M W P + +DQ FN
Sbjct: 266 -----YPNTGFIYFKATARNGRAMAHWHAARSRFP-----REHDQFVFN 304
>gi|242076940|ref|XP_002448406.1| hypothetical protein SORBIDRAFT_06g026610 [Sorghum bicolor]
gi|241939589|gb|EES12734.1| hypothetical protein SORBIDRAFT_06g026610 [Sorghum bicolor]
Length = 373
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 43/88 (48%), Gaps = 4/88 (4%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
D +F + G+ RR L +LE GYN +++D D++W ++PFP+ D D
Sbjct: 170 DFSEERRFLTSGYLEMMWRRLDFLRLVLEKGYNFIFSDADIMWFRNPFPHFYPDVDF--- 226
Query: 213 DDMAAVKPLDHSHDLPPPGKKGRTYICS 240
+A + ++ DL G +Y+ S
Sbjct: 227 -QIACDHYVRNATDLRNIANGGFSYVKS 253
>gi|328873065|gb|EGG21432.1| WD40 repeat-containing protein [Dictyostelium fasciculatum]
Length = 684
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 30/50 (60%)
Query: 154 SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
+ + +G GF + +P +L +L+ GYNV++ D D+VW+ DP Y+
Sbjct: 152 GEESESYGGVGFRAICNEKPLVVLDVLKRGYNVLWTDTDIVWMADPLKYI 201
>gi|159486479|ref|XP_001701267.1| hypothetical protein CHLREDRAFT_179139 [Chlamydomonas reinhardtii]
gi|158271849|gb|EDO97660.1| predicted protein [Chlamydomonas reinhardtii]
Length = 695
Score = 46.2 bits (108), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 51/109 (46%), Gaps = 5/109 (4%)
Query: 154 SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG----DHDV 209
S +K+G + T R+ + I+ G+NV+++DVD+VW +DP Y G + D+
Sbjct: 247 STDQYKYGDSHYNAATWRKVVVVSRIVHWGFNVIHSDVDVVWFRDPLAYFLGPAVNNVDM 306
Query: 210 YFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKW 258
+ D+ + L G T + + ++R T G K +M W
Sbjct: 307 ALSSDLISTGNPKGDEGL-EVGMHQHTNFNTGVYFVRATPGGKSLMAGW 354
>gi|443691598|gb|ELT93412.1| hypothetical protein CAPTEDRAFT_215597 [Capitella teleta]
Length = 523
Score = 46.2 bits (108), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 50/208 (24%), Positives = 80/208 (38%), Gaps = 35/208 (16%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
L +A G I++ V Y+ N + + D L + D+ + R
Sbjct: 77 LKEAIRARVIGGAIVLVVVDSGYIEMAINLHRTSFEKLQIDNYLFVGIDHQVCSGL--RL 134
Query: 142 PGHAVLVPPA--PDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
G + + + +GS F T + +L L+LG+ V+ DVD+V+ K+P
Sbjct: 135 HGVVCVTHEGFMGEKNSDSNWGSTEFMQKTHFKTRVVLQGLQLGFQVLITDVDVVFFKNP 194
Query: 200 FPYLQ-GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYL-RPTDGAKLVMKK 257
FPY D D+ ++D++ G G YL RPT A+
Sbjct: 195 FPYFTCSDCDIEISNDIS-------------EGNSG--------FYLARPTSPAR----- 228
Query: 258 WIEELQAEPWSKAKKANDQPAFNWALNK 285
L A W K A D+ + ALN+
Sbjct: 229 ---TLHASAWEIGKVAGDRISNQKALNR 253
>gi|449522244|ref|XP_004168137.1| PREDICTED: uncharacterized protein At4g15970-like [Cucumis sativus]
Length = 380
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 29/137 (21%), Positives = 60/137 (43%), Gaps = 7/137 (5%)
Query: 91 KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
++GTII+ ++ + FL ++ I Q+ ++++ D + P
Sbjct: 89 EDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVAVHPH 148
Query: 144 HAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
L + + F + + RR L+++LE+G++ ++ D D++WL+DPF +
Sbjct: 149 CYQLDTQGTNFSSEAYFMTADYLKMMWRRIEFLIYVLEMGHSFVFTDTDIMWLQDPFNHF 208
Query: 204 QGDHDVYFTDDMAAVKP 220
D D D+ P
Sbjct: 209 YKDADFQIASDLYLGNP 225
>gi|222640915|gb|EEE69047.1| hypothetical protein OsJ_28045 [Oryza sativa Japonica Group]
Length = 352
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 30/130 (23%), Positives = 61/130 (46%), Gaps = 9/130 (6%)
Query: 94 TIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAV 146
T+I+ ++++ Y FL ++ + + VL++A D A L + P +
Sbjct: 77 TVIMTSINRAYAAPGSLLDLFLESFRLGEGTEPLLKHVLIVAMDPAALARCRQVHPHCYL 136
Query: 147 LVPP--APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
L P A D +F S+ + + R IL+LG+N ++ D+D++W ++P ++
Sbjct: 137 LRRPEGAVDYSDEKRFMSKDYLDMMWGRNLFQQTILQLGFNFLFTDIDIMWFRNPLRHIA 196
Query: 205 GDHDVYFTDD 214
D+ +D
Sbjct: 197 ITSDIAVAND 206
>gi|145353802|ref|XP_001421190.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581427|gb|ABO99483.1| predicted protein, partial [Ostreococcus lucimarinus CCE9901]
Length = 451
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 55/213 (25%), Positives = 93/213 (43%), Gaps = 40/213 (18%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQV-----LVIAED---YATLYK--VNGR 140
++G + V ++ F+ NW +H D++ LV A D Y L K VN
Sbjct: 54 RDGGVAVTFANEGMYDFVVNWC------EHMDEIGITNYLVGAMDESLYGRLRKIGVNAW 107
Query: 141 WPGHAVLVPPAPDSQTAHKFG-SQGFFNFTSRRPCHLLHIL-ELGYNVMYNDVDMVWLKD 198
G + D + FG F+ R L+H L + G++V+ DVD VWL+D
Sbjct: 108 LMGSKNI----DDDEVKKDFGWGTRTFHKMGRDKIRLVHELTKTGFDVIVTDVDAVWLRD 163
Query: 199 PFPYL----QGDHDVYFTD--DMAAVKPLDHSHDLPPPGKK-----GRTYICSCMIYLRP 247
PFP+L + D V + + +V +H + G + G I +++ R
Sbjct: 164 PFPFLRRYPKADALVSIDNLRNHTSVVATQANHAVDGEGLEHSACGGNKNI--GIMWFRS 221
Query: 248 TDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
T+G++ ++W+ +L++ K DQ FN
Sbjct: 222 TEGSQSFTQEWLNKLESN-----DKDWDQVVFN 249
>gi|226492551|ref|NP_001141233.1| uncharacterized protein LOC100273320 [Zea mays]
gi|194703416|gb|ACF85792.1| unknown [Zea mays]
gi|413933156|gb|AFW67707.1| regulatory protein [Zea mays]
Length = 343
Score = 45.8 bits (107), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 55/123 (44%), Gaps = 10/123 (8%)
Query: 92 NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
N T+I+ +++ Y FL + ++ D VL +A D +
Sbjct: 74 NRTLILTMLNKAYSEEGGLLDLFLESLREGDGTEQLIDHVLFVAMDQQAFRRCRSLGGLK 133
Query: 145 AVLVPP---APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
L+ P A D + + S GF RR L +L+ GY+ ++ D+D++WL++PFP
Sbjct: 134 CYLLRPTDSADDLSSEQIYMSDGFIRMMWRRIRFLGDVLKHGYSFVFTDMDVMWLRNPFP 193
Query: 202 YLQ 204
L
Sbjct: 194 KLD 196
>gi|20197540|gb|AAD15452.2| unknown protein [Arabidopsis thaliana]
Length = 528
Score = 45.8 bits (107), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 53/201 (26%), Positives = 84/201 (41%), Gaps = 17/201 (8%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
K+ IIV + ++ F+ W+ ++ + +LV A D L + W G V
Sbjct: 18 VKDNVIIVTFGNYAFMDFILTWVKHLTDLDLSN-ILVGAMDTKLLEAL--YWKGVPVFDM 74
Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
+ S +GS F + + +L GY ++ D DM +P PYL D
Sbjct: 75 GSHMSTVDVGWGSPTFHKMGREKVILIDSVLPFGYELLMCDTDM----NPMPYLARFPDA 130
Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
DV + D +D S D+ + G Y + + RPT+ AK + K+W E L A+
Sbjct: 131 DVLTSSDQVVPTVIDDSLDIWQ--QVGAAYNIG-IFHWRPTESAKKLAKEWKEILLAD-- 185
Query: 268 SKAKKANDQPAFNWALNKTAG 288
K DQ FN + + G
Sbjct: 186 ---DKVWDQNGFNEIVRRQLG 203
>gi|414585717|tpg|DAA36288.1| TPA: hypothetical protein ZEAMMB73_697462 [Zea mays]
Length = 374
Score = 45.8 bits (107), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 29/118 (24%), Positives = 53/118 (44%), Gaps = 4/118 (3%)
Query: 123 QVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILEL 182
++++A D+ + P L D +F + G+ RR L +LE
Sbjct: 141 HLVIVAFDWKAYEECVKIHPYCFALGTDGVDFSEEKRFLTSGYLEMMWRRLDFLRLVLEK 200
Query: 183 GYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
GYN +++D D++W ++PFP+ D D +A + ++ DL G +Y+ S
Sbjct: 201 GYNFIFSDADIMWFRNPFPHFYPDVDF----QIACDHYVRNATDLRNIANGGFSYVKS 254
>gi|326522376|dbj|BAK07650.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 390
Score = 45.8 bits (107), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 31/134 (23%), Positives = 58/134 (43%), Gaps = 4/134 (2%)
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
F++++ + +++IA D+ + LV D +F + G+
Sbjct: 141 FIDSFRHGVRTSSLLKHLVIIAFDWKAYKRCTEVHTYCYALVTDNVDFSQEKRFLTAGYL 200
Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHD 226
+R L +LE GY+ +++D D++W ++PFPY D D +A + +S D
Sbjct: 201 EMMWKRLDFLRLVLEKGYSFIFSDADIMWFRNPFPYFYPDGDF----QVACDHYVGNSTD 256
Query: 227 LPPPGKKGRTYICS 240
L G Y+ S
Sbjct: 257 LRNIANGGFNYVKS 270
>gi|195640292|gb|ACG39614.1| regulatory protein [Zea mays]
Length = 344
Score = 45.8 bits (107), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 55/124 (44%), Gaps = 11/124 (8%)
Query: 92 NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
N T+I+ +++ Y FL + ++ D VL +A D +
Sbjct: 74 NRTLILTMLNKAYSEEGGLLDLFLESLREGDGTEQLIDHVLFVAMDQQAFRRCRSLGGLK 133
Query: 145 AVLVPPAPDS----QTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
L+ P DS + + S GF RR L +L+ GY+ ++ D+D++WL++PF
Sbjct: 134 CYLLRPPTDSADDLSSEQLYMSDGFIRMMWRRIRFLGDVLKHGYSFVFTDMDVMWLRNPF 193
Query: 201 PYLQ 204
P L
Sbjct: 194 PKLD 197
>gi|242055405|ref|XP_002456848.1| hypothetical protein SORBIDRAFT_03g044020 [Sorghum bicolor]
gi|241928823|gb|EES01968.1| hypothetical protein SORBIDRAFT_03g044020 [Sorghum bicolor]
Length = 366
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 35/138 (25%), Positives = 62/138 (44%), Gaps = 16/138 (11%)
Query: 91 KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
++ T+I+ +V++ + FL ++ D +LV+A D L P
Sbjct: 99 EDRTVIMTSVNEAWAAQDSLLDLFLESFRSGERIAHFVDHLLVVALDGGALEHCRAVHP- 157
Query: 144 HAVLVPPAPDSQTAHK--FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
H L+P A + + F S+ + + + ILELGYN ++ DVD++W ++PF
Sbjct: 158 HCYLLPAAAARNLSGEKVFMSKDYIDLVWSKVRLQQRILELGYNFLFTDVDILWFRNPFE 217
Query: 202 ------YLQGDHDVYFTD 213
++ D YF D
Sbjct: 218 RMSVAAHMVTSSDFYFGD 235
>gi|242037075|ref|XP_002465932.1| hypothetical protein SORBIDRAFT_01g048460 [Sorghum bicolor]
gi|241919786|gb|EER92930.1| hypothetical protein SORBIDRAFT_01g048460 [Sorghum bicolor]
Length = 380
Score = 45.4 bits (106), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 28/119 (23%), Positives = 51/119 (42%)
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
FL ++ S ++ D +++++ D + LV D F + G+
Sbjct: 131 FLESFRSGESTRELLDHLVIVSLDTTAHARCRQVHRHCFALVTDGVDFSGQKNFMTDGYL 190
Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSH 225
RR L +LE G++ ++ D D+VW ++P P+ D D+ D P D S+
Sbjct: 191 KMMWRRIDFLRKVLEKGFSFVFTDTDIVWFRNPLPHFYPDGDLQIACDHFTGDPSDLSN 249
>gi|326493850|dbj|BAJ85387.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 362
Score = 45.4 bits (106), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 35/162 (21%), Positives = 69/162 (42%), Gaps = 18/162 (11%)
Query: 60 TGTPGFMFPNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPY------LPFLNNWLI 113
TGT +FP L + +GT+I+ +V++ + L
Sbjct: 70 TGTEERLFPGLAELLPR-----------VATDDGTVIITSVNEAWSRPGSLLDLFREGFK 118
Query: 114 SISRQKHQ-DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRR 172
+ H + L++A D L P +L A D +A++F ++ + +
Sbjct: 119 NGEGIAHLLNHTLIVAVDAGALAHCEAVHPHCYLLEVTAADVSSANRFMTKSYLELVWAK 178
Query: 173 PCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD 214
+L+LGY+ ++ DVD++WL++PF ++ D+ + D
Sbjct: 179 LELQQRVLQLGYSYLFTDVDIMWLRNPFRHISLYADMAVSTD 220
>gi|443703204|gb|ELU00880.1| hypothetical protein CAPTEDRAFT_192549 [Capitella teleta]
Length = 541
Score = 45.4 bits (106), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 32/51 (62%)
Query: 154 SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
S+T FGS+ F + R +LH L+ GYNV+++D D+ + +PFP ++
Sbjct: 185 SETGASFGSKAFKEKMNIRTFMVLHALKEGYNVLHSDCDIYYFANPFPVIK 235
>gi|255081748|ref|XP_002508096.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
gi|226523372|gb|ACO69354.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
Length = 994
Score = 45.4 bits (106), Expect = 0.034, Method: Composition-based stats.
Identities = 39/142 (27%), Positives = 65/142 (45%), Gaps = 24/142 (16%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
D+Q +HK +Q F + +E G +V+ D D+ W+++PFP+L D DV
Sbjct: 237 DAQGSHKISAQKFRI--------IQEFVERGCSVLLTDTDVAWMRNPFPFLYRDADVESM 288
Query: 213 ----DDMAAVKPLDHSHD--LPPPG-KKGRTY----ICSCMIYLRPTDGAKLVMKKWIEE 261
D+ +A LD D + P G K+ R + + S M Y+ T+ ++ +M
Sbjct: 289 SDGWDNSSAHGFLDRVDDPSMGPDGRKRARAFRVAALNSGMWYVSATEASRRLMA----- 343
Query: 262 LQAEPWSKAKKANDQPAFNWAL 283
+ A + K DQ +N L
Sbjct: 344 IMAHRMATEDKLWDQAGYNLEL 365
>gi|293333989|ref|NP_001170602.1| uncharacterized protein LOC100384639 [Zea mays]
gi|238006282|gb|ACR34176.1| unknown [Zea mays]
Length = 303
Score = 45.4 bits (106), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 30/119 (25%), Positives = 55/119 (46%), Gaps = 7/119 (5%)
Query: 92 NGTIIVCAVSQPYLP---FLNNWLISISRQKHQ----DQVLVIAEDYATLYKVNGRWPGH 144
+ T+++ A+++ + FL+ +L S + + +LV+A D L + N P
Sbjct: 41 DKTVLMTAINEAWAAPGSFLDLFLESFRQGEDTAGLPRHLLVVAMDGRALERCNAVHPFC 100
Query: 145 AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
+ D K+ + RR +LELGY+ ++ DVD++W + PFP L
Sbjct: 101 YLFRVDGMDFAAEQKYMEGDYLEMMWRRNRFQQSVLELGYSFLFTDVDILWFRSPFPRL 159
>gi|218194109|gb|EEC76536.1| hypothetical protein OsI_14328 [Oryza sativa Indica Group]
Length = 388
Score = 45.1 bits (105), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 21/68 (30%), Positives = 35/68 (51%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
D ++ F S+ + R ILELGYN ++ DVD++W +DPF ++ D+ +
Sbjct: 191 DYRSEKTFMSKDYLEMMWGRNKFQQTILELGYNFLFTDVDVMWFRDPFRHISMGADIAIS 250
Query: 213 DDMAAVKP 220
D+ P
Sbjct: 251 SDVFIGDP 258
>gi|255574023|ref|XP_002527929.1| conserved hypothetical protein [Ricinus communis]
gi|223532704|gb|EEF34486.1| conserved hypothetical protein [Ricinus communis]
Length = 498
Score = 45.1 bits (105), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 50/117 (42%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
V N T+++ Y L +W+ + R + + ++ + + V P
Sbjct: 357 VDANKTVVLAVAGYSYKDMLMSWVCRLRRLQVTNFLICALDQETYQFAVLQGLPVFQDPS 416
Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG 205
P S FG+ F T + +L IL+LGYNV+ +DVD+ W +P P L
Sbjct: 417 APRNISFDDCHFGTDCFQRVTKVKSRIVLKILKLGYNVLLSDVDVYWFANPLPILSS 473
>gi|308801050|ref|XP_003075306.1| unnamed protein product [Ostreococcus tauri]
gi|116061860|emb|CAL52578.1| unnamed protein product [Ostreococcus tauri]
Length = 868
Score = 45.1 bits (105), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 27/113 (23%), Positives = 51/113 (45%), Gaps = 7/113 (6%)
Query: 175 HLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKG 234
L+ ++E G++V +DVD+ W +DP Y + DV +++ + S +L P G
Sbjct: 451 ELVRVVEAGFHVALSDVDVAWTRDPTAYFLCERDVDGCEEIKDADVMISSDNLSPTTDWG 510
Query: 235 R-------TYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
R + ++Y++ + +++W E L A A + Q FN
Sbjct: 511 RGARYARGGVFNTGVVYVKSSARGAAFLREWREHLLATTGPYAALTSHQQVFN 563
>gi|307103589|gb|EFN51848.1| hypothetical protein CHLNCDRAFT_37217 [Chlorella variabilis]
Length = 631
Score = 45.1 bits (105), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 40/164 (24%), Positives = 73/164 (44%), Gaps = 10/164 (6%)
Query: 56 ILPWTGTPGFMFPNATSSLAKWRDYTLSQA-ASFVAKNGTIIVCAVSQPYLPFLNNWLIS 114
+ P + P A + A R + L++ VAK+ TI+V + Y F+ NW+
Sbjct: 75 VQPSSDKPAEQQGTADAVAADSRPHALTRDLVGKVAKDNTIMVTWANWHYHDFVMNWVEH 134
Query: 115 ISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFG-SQGFFNFTSRRP 173
+ ++ +D + V+ P ++ T FG F+ R
Sbjct: 135 LQAAGCDAFIVGAMDDKLLEFLVSKNIPAFSM-----SSGLTLGDFGWGTPTFHKMGREK 189
Query: 174 CHLLH-ILELGYNVMYNDVDMVWLKDPFPYLQG--DHDVYFTDD 214
+L++ ++GY+V+ +DVD VWL++P PY+ D D+ + D
Sbjct: 190 INLIYSFTKMGYDVLISDVDTVWLRNPLPYINAYRDADILTSSD 233
>gi|356529430|ref|XP_003533295.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
Length = 357
Score = 45.1 bits (105), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 24/38 (63%)
Query: 171 RRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
RR L +LE+GYN ++ D D++WL+DPF D D
Sbjct: 178 RRTEFLTSVLEMGYNFVFTDTDIMWLRDPFKQFYEDAD 215
>gi|28269397|gb|AAO37940.1| putative regulatory protein [Oryza sativa Japonica Group]
Length = 379
Score = 44.7 bits (104), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 21/68 (30%), Positives = 35/68 (51%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
D ++ F S+ + R ILELGYN ++ DVD++W +DPF ++ D+ +
Sbjct: 182 DYRSEKMFMSKDYLEMMWGRNKFQQTILELGYNFLFTDVDVMWFRDPFRHISMGADIAIS 241
Query: 213 DDMAAVKP 220
D+ P
Sbjct: 242 SDVFIGDP 249
>gi|212274615|ref|NP_001130717.1| uncharacterized protein LOC100191821 [Zea mays]
gi|194689930|gb|ACF79049.1| unknown [Zea mays]
gi|223949661|gb|ACN28914.1| unknown [Zea mays]
gi|413932429|gb|AFW66980.1| putative NAC domain transcription factor superfamily protein [Zea
mays]
Length = 366
Score = 44.7 bits (104), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 31/128 (24%), Positives = 54/128 (42%), Gaps = 16/128 (12%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
D ++ F S+ + R +LELGYN ++ DVD++W +DPF ++ D+ +
Sbjct: 169 DYRSEKVFMSKDYLEMMWGRNRFQQTVLELGYNFLFTDVDVMWFRDPFRHISMAADIAIS 228
Query: 213 DDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKK 272
D+ P + P G +++R +D + W Q W K
Sbjct: 229 SDVYMGDPYS-LRNFPNGG----------FLFVRASDKTVAFYRAW----QQGRWRFLGK 273
Query: 273 ANDQPAFN 280
++Q FN
Sbjct: 274 -HEQDVFN 280
>gi|413951493|gb|AFW84142.1| hypothetical protein ZEAMMB73_205487 [Zea mays]
Length = 383
Score = 44.7 bits (104), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 45/99 (45%)
Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
D VL++ D + P +L + D A FG+ + + +LE
Sbjct: 147 DHVLIVTVDAGSFSGCKAVHPHCYLLEVKSMDMNRAKTFGTPEYVEMIWLKLSIQQRVLE 206
Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKP 220
LGYN ++ D D++WL++PF + D+ + D + + P
Sbjct: 207 LGYNFLFTDADILWLRNPFQRISVYADMSCSVDNSKMAP 245
>gi|357131569|ref|XP_003567409.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
distachyon]
Length = 360
Score = 44.7 bits (104), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 49/198 (24%), Positives = 85/198 (42%), Gaps = 27/198 (13%)
Query: 91 KNGTIIVCAVSQPYL---PFLNNWLISISRQKHQDQ----VLVIAEDYATLYKVNGRWPG 143
++ T+I+ V++ + L+ +L S + + VL++A D A + P
Sbjct: 82 EDKTVIITTVNEAWARPGSLLDIYLESFKNGEDTEHLLAHVLIVALDPAGFRRCTVVHPH 141
Query: 144 HAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
+L + +A F S+ + + ILELGYN + D DM+ L+DPF +
Sbjct: 142 CHLLEVKIANLTSATPFMSKEYLELVWTKLYLQQCILELGYNFLCTDTDMILLRDPFRRI 201
Query: 204 --QGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEE 261
D V D +A PLD+ PP + + Y++ T+ + +++ W
Sbjct: 202 PVYADMSVSSDDFSSARAPLDN-----PPN--------TGLYYMKATNRSIEMLRYW--- 245
Query: 262 LQAEPWSKAKKANDQPAF 279
A P + NDQP F
Sbjct: 246 QAARP--RFPGVNDQPVF 261
>gi|356561837|ref|XP_003549183.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
Length = 351
Score = 44.7 bits (104), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 24/38 (63%)
Query: 171 RRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
RR L +LE+GYN ++ D D++WL+DPF D D
Sbjct: 172 RRTEFLTSVLEMGYNFVFTDTDIMWLRDPFKQFYEDAD 209
>gi|108712116|gb|ABF99911.1| regulatory protein, putative, expressed [Oryza sativa Japonica
Group]
gi|222626172|gb|EEE60304.1| hypothetical protein OsJ_13372 [Oryza sativa Japonica Group]
Length = 391
Score = 44.7 bits (104), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 21/68 (30%), Positives = 35/68 (51%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
D ++ F S+ + R ILELGYN ++ DVD++W +DPF ++ D+ +
Sbjct: 194 DYRSEKMFMSKDYLEMMWGRNKFQQTILELGYNFLFTDVDVMWFRDPFRHISMGADIAIS 253
Query: 213 DDMAAVKP 220
D+ P
Sbjct: 254 SDVFIGDP 261
>gi|226494464|ref|NP_001150863.1| regulatory protein [Zea mays]
gi|195642450|gb|ACG40693.1| regulatory protein [Zea mays]
Length = 412
Score = 44.7 bits (104), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 54/117 (46%), Gaps = 7/117 (5%)
Query: 94 TIIVCAVSQPYLP---FLNNWLISISRQKHQ----DQVLVIAEDYATLYKVNGRWPGHAV 146
T+++ A+++ + FL+ +L S + + +LV+A D L + N P +
Sbjct: 152 TVLMTAINEAWAAPGSFLDLFLESFRQGEDTAGLPRHLLVVAMDGRALERCNAVHPFCYL 211
Query: 147 LVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
D K+ + RR +LELGY+ ++ DVD++W + PFP L
Sbjct: 212 FRVDGMDFAAEQKYMEGDYLEMMWRRNRFQQSVLELGYSFLFTDVDILWFRSPFPRL 268
>gi|413932427|gb|AFW66978.1| putative NAC domain transcription factor superfamily protein [Zea
mays]
Length = 263
Score = 44.7 bits (104), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 31/128 (24%), Positives = 54/128 (42%), Gaps = 16/128 (12%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
D ++ F S+ + R +LELGYN ++ DVD++W +DPF ++ D+ +
Sbjct: 66 DYRSEKVFMSKDYLEMMWGRNRFQQTVLELGYNFLFTDVDVMWFRDPFRHISMAADIAIS 125
Query: 213 DDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKK 272
D+ P + P G +++R +D + W Q W K
Sbjct: 126 SDVYMGDPYS-LRNFPNGG----------FLFVRASDKTVAFYRAW----QQGRWRFLGK 170
Query: 273 ANDQPAFN 280
++Q FN
Sbjct: 171 -HEQDVFN 177
>gi|255556103|ref|XP_002519086.1| conserved hypothetical protein [Ricinus communis]
gi|223541749|gb|EEF43297.1| conserved hypothetical protein [Ricinus communis]
Length = 436
Score = 44.7 bits (104), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 83/212 (39%), Gaps = 21/212 (9%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
L+Q VA IIV + L W SI + + ++V +D + +
Sbjct: 153 LAQILEEVAVQREIIVALANSNVKEMLEIWFNSIKKVGIPNYLVVALDDEIADFCESNDV 212
Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPF 200
P + PD + G G + S H+L L+LGY+V+ +DVD+V+L++PF
Sbjct: 213 P----VYKRDPD-EGIDSIGKTGGNHAVSGLKFHILREFLQLGYSVLLSDVDIVYLQNPF 267
Query: 201 PYLQGDHDVY-FTDDMAAVKPLDHSHDLPPPGKKGRTY--------ICSCMIYLRPTDGA 251
YL D DV TD + + P Y S Y+RPT +
Sbjct: 268 DYLYRDSDVESMTDGHNNLTAYGYDDVFDEPAMGWARYAHTMRIWVFNSGFFYIRPTIPS 327
Query: 252 KLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
++ + L E K DQ FN L
Sbjct: 328 IELLDRVANRLAHE------KVWDQAVFNEEL 353
>gi|357142041|ref|XP_003572439.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
distachyon]
Length = 364
Score = 44.7 bits (104), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 31/140 (22%), Positives = 59/140 (42%), Gaps = 11/140 (7%)
Query: 91 KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
++ T+IV ++++ Y FL ++ + D VL++A D +
Sbjct: 95 EDKTVIVTSINRAYAAPGSLLDLFLESFRLGEGTAALLDHVLIVAVDPGAFRRCRAVHR- 153
Query: 144 HAVLV---PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
H L+ P A D F ++ + + R ILELG+N ++ D+D++W ++P
Sbjct: 154 HCYLLRQSPSAVDYSGEKHFMTKDYLDMMWSRNRFQQTILELGFNFLFTDIDIMWFRNPL 213
Query: 201 PYLQGDHDVYFTDDMAAVKP 220
+ D+ D P
Sbjct: 214 RRIAITSDIAIASDFFDGDP 233
>gi|255073125|ref|XP_002500237.1| hypothetical protein MICPUN_107680 [Micromonas sp. RCC299]
gi|226515499|gb|ACO61495.1| hypothetical protein MICPUN_107680 [Micromonas sp. RCC299]
Length = 439
Score = 44.7 bits (104), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 62/222 (27%), Positives = 92/222 (41%), Gaps = 31/222 (13%)
Query: 68 PNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYL-------PFLNNWLISISRQKH 120
P AT A+ D TL +A +G ++ AVS L L WL + R K
Sbjct: 134 PAATG--AEKGDETLKAILEEIAPDGEVL-AAVSNKALINEKGDYGMLRTWLDGVQRSKV 190
Query: 121 QDQ-VLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-H 178
++ V+ + E A K G H A +T H +Q F H+L
Sbjct: 191 KNYLVICLDETVAGTMKKLGVPYWHRERKALADGDETNHGISAQKF---------HILRE 241
Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPL---DHSHDLPPPG---- 231
L LGY+V+ +DVD+V L +PF +L D DV D + + D P G
Sbjct: 242 FLVLGYSVLLSDVDIVTLDNPFDHLYRDSDVEGLSDGYDERTAYGWNDGIDDPKMGWARY 301
Query: 232 -KKGRTY-ICSCMIYLRPTDGAKLVMKKWIEELQ-AEPWSKA 270
+ R + + S + YL+P+D M L+ A+ W +A
Sbjct: 302 AQTMRVFAMNSGLFYLKPSDRTVQFMDGITARLERAKEWDQA 343
>gi|356529763|ref|XP_003533457.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
Length = 349
Score = 44.7 bits (104), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 47/201 (23%), Positives = 78/201 (38%), Gaps = 31/201 (15%)
Query: 91 KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGR--- 140
K+ T+I+ ++ + FL ++ + +K + ++VI D K + R
Sbjct: 85 KDKTVIITTLNDAWAEPGSIFDLFLESFRLGNQTKKFLNHLVVITWDQ----KAHARCLA 140
Query: 141 WPGHAVLVPPAPDSQTAHKFG-SQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
H V D+ T F + + + RR L +L++GYN ++ D D++WL+DP
Sbjct: 141 LHKHCYQVETKGDNFTGEAFFMTADYLHMMWRRIEFLGTVLDMGYNFVFTDTDIMWLRDP 200
Query: 200 FPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWI 259
F D D D +S+DL G Y+ + L K W
Sbjct: 201 FKQFYKDTDFQIACDFFN----GNSYDLNNHPNGGFNYV-------KSNKRTILFYKFWF 249
Query: 260 EELQAEPWSKAKKANDQPAFN 280
A P K +DQ N
Sbjct: 250 NSRNAYP-----KLHDQDVLN 265
>gi|223975905|gb|ACN32140.1| unknown [Zea mays]
gi|414884273|tpg|DAA60287.1| TPA: regulatory protein [Zea mays]
Length = 443
Score = 44.7 bits (104), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 54/117 (46%), Gaps = 7/117 (5%)
Query: 94 TIIVCAVSQPYLP---FLNNWLISISRQKHQ----DQVLVIAEDYATLYKVNGRWPGHAV 146
T+++ A+++ + FL+ +L S + + +LV+A D L + N P +
Sbjct: 183 TVLMTAINEAWAAPGSFLDLFLESFRQGEDTAGLPRHLLVVAMDGRALERCNAVHPFCYL 242
Query: 147 LVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
D K+ + RR +LELGY+ ++ DVD++W + PFP L
Sbjct: 243 FRVDGMDFAAEQKYMEGDYLEMMWRRNRFQQSVLELGYSFLFTDVDILWFRSPFPRL 299
>gi|452824018|gb|EME31024.1| hypothetical protein Gasu_17840 [Galdieria sulphuraria]
Length = 376
Score = 44.3 bits (103), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 46/235 (19%), Positives = 106/235 (45%), Gaps = 33/235 (14%)
Query: 72 SSLAKWRDYTLSQAASFVAKNGT---IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIA 128
SS K++++ Q SF NG+ ++ ++ Y+P +N ++ +S + ++ ++
Sbjct: 31 SSTDKFQEFL--QRHSFATLNGSRKWVVATLANEAYIPLVNLFVGRLSTLQLENLIVFCI 88
Query: 129 EDYATLYKVNGRWPGHAV--LVPPA--PDSQTAHKF------------GSQGFFNFTSRR 172
+ Y Y V P V L+PP+ P S + F G+ F + T +
Sbjct: 89 DPYIYEYCVFHHIPAWKVTDLIPPSCLPFSFWQNIFQNINHRRAYPAGGNIEFISLTQLK 148
Query: 173 PCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGK 232
++ +++++D D+VW+++P PYLQ ++ D+ +SH +
Sbjct: 149 YLVFYSVISYNVDILFSDPDVVWIQNPIPYLQQKRSLHV--DIFIQTDRKYSH------Q 200
Query: 233 KGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTA 287
+Y+ + +Y+ +L+++ ++++ + + + Q +FN L +T
Sbjct: 201 SLFSYMNTGFVYIHSHCATQLLLRIMMQQM----YQQYPIISQQRSFNRVLCRTG 251
>gi|255083186|ref|XP_002504579.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
gi|226519847|gb|ACO65837.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
Length = 779
Score = 44.3 bits (103), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 61/143 (42%), Gaps = 24/143 (16%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW-PGHAVLV 148
A T+ C + L FL NWL SR + ++ VLVIA D K RW + V
Sbjct: 165 AVGDTVSACFATIEMLDFLVNWLEHASRLEMRN-VLVIAMD-----KHTARWCDENGVAR 218
Query: 149 PPAPDSQTAHKFG-------------SQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVW 195
A D+ + ++GF + + +L++G +V +DVD+VW
Sbjct: 219 MDASDAIDKSEMNDPRVEVADVGYRMTRGFNLLGEAKTASIAKLLDMGLDVFLSDVDVVW 278
Query: 196 LKDPFPYLQGDH----DVYFTDD 214
L++P Y + DV T D
Sbjct: 279 LRNPSDYFESGQLALADVAVTSD 301
>gi|229914876|gb|ACQ90601.1| putative Myb DNA binding protein [Eutrema halophilum]
Length = 397
Score = 44.3 bits (103), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 36/167 (21%), Positives = 69/167 (41%), Gaps = 14/167 (8%)
Query: 81 TLSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYAT 133
L +AA+ ++GT+I+ +++ + F ++ I + +++IA D
Sbjct: 101 VLRRAAT---RDGTVILTTLNEAWAAPGSVIDLFFESFGIGEGTSMLLNHLVIIALDAKA 157
Query: 134 LYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDM 193
+ L D F ++ + RR L +LE+GYN ++ D D+
Sbjct: 158 YSRCRELHKHCFSLETEGVDFSGEAYFMTRSYLKMMWRRIDFLRSVLEMGYNFVFTDADV 217
Query: 194 VWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
+W ++PFP D +A L S+DL G +++ S
Sbjct: 218 MWFRNPFPRFYRSADF----QIACDHYLGRSNDLQNRPNGGFSFVRS 260
>gi|147790757|emb|CAN65937.1| hypothetical protein VITISV_008966 [Vitis vinifera]
Length = 546
Score = 44.3 bits (103), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 29/44 (65%)
Query: 160 FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
FG++ F T + +L IL+LGYNV+ +DVD+ W K+P P L
Sbjct: 337 FGTKCFQRVTKSKSRLVLQILKLGYNVLMSDVDVYWFKNPLPLL 380
>gi|413936404|gb|AFW70955.1| hypothetical protein ZEAMMB73_889446 [Zea mays]
Length = 334
Score = 44.3 bits (103), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 31/128 (24%), Positives = 54/128 (42%), Gaps = 16/128 (12%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
D ++ F S+ + R +LELGYN ++ DVD++W +DPF ++ D+ +
Sbjct: 137 DYRSEKVFMSKDYLEMMWGRNRFQQTVLELGYNSLFTDVDVMWFRDPFRHISMAADIAIS 196
Query: 213 DDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKK 272
D+ P + P G +++R +D + W Q W K
Sbjct: 197 SDVYMGDPYS-LRNFPNGG----------FLFVRASDKTVAFYRAW----QQGRWRFLGK 241
Query: 273 ANDQPAFN 280
++Q FN
Sbjct: 242 -HEQDVFN 248
>gi|326490027|dbj|BAJ94087.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 379
Score = 44.3 bits (103), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 27/42 (64%)
Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKP 220
ILELGYN ++ DVD++W +DPF ++ D+ + D+ P
Sbjct: 208 ILELGYNFLFTDVDVMWFRDPFKHISMGADIAISSDVFIGDP 249
>gi|219362841|ref|NP_001136698.1| uncharacterized protein LOC100216832 [Zea mays]
gi|194690508|gb|ACF79338.1| unknown [Zea mays]
gi|194696684|gb|ACF82426.1| unknown [Zea mays]
gi|414864584|tpg|DAA43141.1| TPA: hypothetical protein ZEAMMB73_234362 [Zea mays]
Length = 368
Score = 44.3 bits (103), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 27/116 (23%), Positives = 48/116 (41%)
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
FL ++ S ++ +++++ D A + LV D F + G+
Sbjct: 118 FLESFRSGESTRELLQHLVIVSLDTAAHARCGQVHRHCFALVTDGVDFSGQKNFMTDGYL 177
Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLD 222
RR L +LE G++ ++ D D+VW ++P P+ D D D P D
Sbjct: 178 RMMWRRVDFLREVLEKGFSFVFTDTDIVWFRNPLPHFYPDGDFQIACDHFTGDPSD 233
>gi|255081588|ref|XP_002508016.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
gi|226523292|gb|ACO69274.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
Length = 843
Score = 44.3 bits (103), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 34/56 (60%), Gaps = 2/56 (3%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
D +AH S+ + F R + +L +G++V+ +DVD+VW KDP P+LQ +D
Sbjct: 252 DGSSAHA--SKSWKAFARLRISQVSALLRMGFDVLMSDVDVVWTKDPRPFLQCGYD 305
>gi|223947351|gb|ACN27759.1| unknown [Zea mays]
Length = 407
Score = 44.3 bits (103), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 54/117 (46%), Gaps = 7/117 (5%)
Query: 94 TIIVCAVSQPYLP---FLNNWLISISRQKHQ----DQVLVIAEDYATLYKVNGRWPGHAV 146
T+++ A+++ + FL+ +L S + + +LV+A D L + N P +
Sbjct: 147 TVLMTAINEAWAAPGSFLDLFLESFRQGEDTAGLPRHLLVVAMDGRALERCNAVHPFCYL 206
Query: 147 LVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
D K+ + RR +LELGY+ ++ DVD++W + PFP L
Sbjct: 207 FRVDGMDFAAEQKYMEGDYLEMMWRRNRFQQSVLELGYSFLFTDVDILWFRSPFPRL 263
>gi|413932428|gb|AFW66979.1| putative NAC domain transcription factor superfamily protein [Zea
mays]
Length = 306
Score = 44.3 bits (103), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 31/128 (24%), Positives = 54/128 (42%), Gaps = 16/128 (12%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
D ++ F S+ + R +LELGYN ++ DVD++W +DPF ++ D+ +
Sbjct: 109 DYRSEKVFMSKDYLEMMWGRNRFQQTVLELGYNFLFTDVDVMWFRDPFRHISMAADIAIS 168
Query: 213 DDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKK 272
D+ P + P G +++R +D + W Q W K
Sbjct: 169 SDVYMGDPYS-LRNFPNGG----------FLFVRASDKTVAFYRAW----QQGRWRFLGK 213
Query: 273 ANDQPAFN 280
++Q FN
Sbjct: 214 -HEQDVFN 220
>gi|219886249|gb|ACL53499.1| unknown [Zea mays]
Length = 277
Score = 44.3 bits (103), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 41/199 (20%), Positives = 79/199 (39%), Gaps = 25/199 (12%)
Query: 91 KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
++ T+I+ +V++ + FL ++ + + V+V+A D + P
Sbjct: 8 EDRTVIMTSVNEAWAAPGSLLDSFLESFQVGENVSHFAKHVVVVAMDDGAFRRCQAVHPH 67
Query: 144 HAVLVPPAP--DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
+L P D A + ++ + + + +LELGYN+++ DVD+ W ++P
Sbjct: 68 CHLLRPEKAGLDLSGAKSYMTKDYLDLVWSKLKLQQRVLELGYNLLFTDVDVAWFRNPLV 127
Query: 202 YLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEE 261
++ D+ + D P D + P G IY + T M W
Sbjct: 128 HITMAADITTSSDFYFGNP-DDLGNFPNTG----------FIYFKATPRNARAMAYWHAA 176
Query: 262 LQAEPWSKAKKANDQPAFN 280
+ P + +DQ FN
Sbjct: 177 RRRFP-----ENHDQFVFN 190
>gi|357114228|ref|XP_003558902.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
distachyon]
Length = 376
Score = 44.3 bits (103), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 27/113 (23%), Positives = 50/113 (44%), Gaps = 5/113 (4%)
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
FL ++ I + + D +++++ D A + + D F + G+
Sbjct: 129 FLESFRIGDNTRHLLDHLVIVSLDNAAHRRCKQIHAHCLAVATDGVDFSGQKNFMTDGYL 188
Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP--YLQGDHDV---YFTDD 214
RR L +LE G++ ++ D D+VW + P P Y +GD + +FT D
Sbjct: 189 KMMWRRIDFLRQVLEKGFSFIFTDTDIVWFRSPIPRLYAEGDFQIACDHFTGD 241
>gi|218189627|gb|EEC72054.1| hypothetical protein OsI_04963 [Oryza sativa Indica Group]
Length = 367
Score = 44.3 bits (103), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 41/175 (23%), Positives = 71/175 (40%), Gaps = 26/175 (14%)
Query: 95 IIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVL 147
II+ +V++ + FL + + D +L++A D + P H
Sbjct: 105 IIMTSVNEAWAAPGSLLDLFLEGFRAGEGIARFVDHLLIVALDDGAFRRCRDVHP-HCYR 163
Query: 148 VPPAPDSQTAHK-FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGD 206
+ A + T K F S+ + + + ILELGYN ++ DVD++W +DPF +
Sbjct: 164 LAVAGRNFTDEKVFMSEDYLDLVWSKVKLQQRILELGYNFLFTDVDILWFRDPFEQMSMA 223
Query: 207 HDVYFTDDM---AAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKW 258
+ + D A P + P G +Y+R + A VM+ W
Sbjct: 224 AHMVTSSDFFVGGAYNPA----NFPNTG----------FLYVRSSRRAVGVMEAW 264
>gi|388506682|gb|AFK41407.1| unknown [Lotus japonicus]
Length = 431
Score = 44.3 bits (103), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 90/212 (42%), Gaps = 21/212 (9%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
L++ VA N +IV + L W +I + + ++V +D + + +
Sbjct: 153 LAKILEKVAVNRELIVALANTNVKEMLEVWFTNIKKVGIPNYLVVALDDNIAEFCESNQV 212
Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPF 200
P + DS G +G + S H+L L+LGY+V+ +DVD+V+L++PF
Sbjct: 213 PVYKRDPDDGIDS-----IGKEGGNHAVSGLKFHILREFLQLGYSVLLSDVDIVYLQNPF 267
Query: 201 PYLQGDHDVY-FTD--DMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGA 251
+L D DV +D D + D P G R ++ S Y+RPT +
Sbjct: 268 DHLYRDSDVESMSDGHDNNTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTIPS 327
Query: 252 KLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
++ + L E KA DQ FN L
Sbjct: 328 IELLDRVATRLSKE------KAWDQAVFNEEL 353
>gi|356534414|ref|XP_003535750.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
Length = 364
Score = 44.3 bits (103), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 47/199 (23%), Positives = 82/199 (41%), Gaps = 22/199 (11%)
Query: 23 RNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVI---------LPWTGTPGF--MFPNAT 71
R S T +R + T V LS LV+LG + GF +FP+
Sbjct: 3 RESKTIHLRRAVAASFFFAT--VSLSCLVLLGDVDSHRFLSSFHSSYSLSGFTRIFPSVY 60
Query: 72 SS-LAKWRDYTLSQAASFVA-KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQD 122
+ +A +Y L + + A K+ T+I+ +++ + FL ++ I +
Sbjct: 61 NDPVATSNEYPLEKILNDAAMKDRTVILTTLNEAWATPNSVIDLFLESFRIGDRTSTFLN 120
Query: 123 QVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILEL 182
+++IA D + LV D F + + +R L +LE+
Sbjct: 121 HLVIIALDQKAFARCQVIHTHCFSLVSEEADFHEEAYFMTPRYLMMMWKRIDFLRTVLEM 180
Query: 183 GYNVMYNDVDMVWLKDPFP 201
GYN ++ D D++W +DPFP
Sbjct: 181 GYNFVFTDADIMWFRDPFP 199
>gi|302755907|ref|XP_002961377.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300170036|gb|EFJ36637.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 553
Score = 44.3 bits (103), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 51/110 (46%), Gaps = 4/110 (3%)
Query: 95 IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDS 154
+++ VS Y L +W+ + R + LV D +Y+ G G V + S
Sbjct: 280 VVLTVVSHSYRDMLMSWVCRL-RHLNVTNYLVATID-KEMYQF-GILQGLPVFRTESGRS 336
Query: 155 QTAH-KFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
+ FGS F T + +L ILELGY+V+++DVD+ W P L
Sbjct: 337 DSKDCTFGSSCFKTVTKSKSRTVLRILELGYSVLFSDVDVYWFSSPIQEL 386
>gi|242037413|ref|XP_002466101.1| hypothetical protein SORBIDRAFT_01g001260 [Sorghum bicolor]
gi|241919955|gb|EER93099.1| hypothetical protein SORBIDRAFT_01g001260 [Sorghum bicolor]
Length = 371
Score = 44.3 bits (103), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 20/68 (29%), Positives = 35/68 (51%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
D ++ F S+ + R +LELGYN ++ DVD++W +DPF ++ D+ +
Sbjct: 174 DYRSEKMFMSKDYLEMMWGRNRFQQTVLELGYNFLFTDVDVMWFRDPFRHISMAADIAIS 233
Query: 213 DDMAAVKP 220
D+ P
Sbjct: 234 SDVYIGDP 241
>gi|242037411|ref|XP_002466100.1| hypothetical protein SORBIDRAFT_01g001250 [Sorghum bicolor]
gi|241919954|gb|EER93098.1| hypothetical protein SORBIDRAFT_01g001250 [Sorghum bicolor]
Length = 423
Score = 44.3 bits (103), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 45/208 (21%), Positives = 86/208 (41%), Gaps = 28/208 (13%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATL 134
+S+AA+ A + T+IV V++ + FL ++ + VL++A D A +
Sbjct: 151 VSRAAT--ADDRTVIVTCVNEAWAAPGSLLDLFLESFRVGDGTAPLLRHVLIVAMDPAAM 208
Query: 135 YKVNGRWPGHAVLVPPAP--DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVD 192
+ H P P D +A F S+ + + +L+LGY ++ DVD
Sbjct: 209 ARCR-TLHRHCYHYAPLPGVDFASAKFFLSKDYLELVWSKLKLQRRVLQLGYTFVFTDVD 267
Query: 193 MVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAK 252
++W ++P ++ D+ + D+ D+ + P G +++P +
Sbjct: 268 VLWFRNPLKHVTAYADMSVSSDV-FFGDADNVDNFPNTG----------FFHVKPNNRTV 316
Query: 253 LVMKKWIEELQAEPWSKAKKANDQPAFN 280
+ W E + P N+QP FN
Sbjct: 317 AMTAAWHEARERFPGK-----NEQPVFN 339
>gi|148906537|gb|ABR16421.1| unknown [Picea sitchensis]
Length = 432
Score = 44.3 bits (103), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 62/219 (28%), Positives = 91/219 (41%), Gaps = 34/219 (15%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAED-YATLYKVNG- 139
L++ +A +IV + L W SI R + ++V +D A K G
Sbjct: 148 LAELLQKIAVKKELIVGLANYNVKEMLEVWSDSIKRAGITNYLVVALDDSVAEFCKSRGV 207
Query: 140 ----RWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMV 194
R P A+ S+T K G + S HLL L+LGY+V+ +DVD+V
Sbjct: 208 PVYRRDPADAI-------SKTVGKTGDN---HAISGLKFHLLREFLQLGYSVLLSDVDIV 257
Query: 195 WLKDPFPYLQGDHDVYFTDD----MAAVKPLDHSHDLPPPGKKGRTYIC------SCMIY 244
+L++PF +L D DV D M A D S D P G + S Y
Sbjct: 258 YLQNPFNFLYRDCDVESMTDGYSNMTAYGYDDVSDD-PSMGWSRYAHTMRIWVFNSGFFY 316
Query: 245 LRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
+RPT + ++ + ++ L E A DQ FN L
Sbjct: 317 IRPTIPSIELLDRVVDRLSKE------NAWDQAVFNELL 349
>gi|356561843|ref|XP_003549186.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
Length = 357
Score = 44.3 bits (103), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 43/177 (24%), Positives = 74/177 (41%), Gaps = 14/177 (7%)
Query: 45 VLLSLLVVLGVILPWTGTPGFMFP------NATSSLAKWRDYTLSQAASFVAKNGTIIVC 98
+ L+ L VL + L + +P F FP +A SS A + S A+ K+ T+I+
Sbjct: 37 MFLAGLAVLWMFLYNSASP-FEFPTSSDYFSAESSKADYDPKLESVLANASMKDKTVIIT 95
Query: 99 AVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPA 151
++ + FL ++ + Q + ++ I D T + L
Sbjct: 96 ILNDAWAEPGSMFDLFLESFRLGNETQWLLNHLVAITWDQKTYARCLAMHKHCYQLGTKG 155
Query: 152 PDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
+ F + + RR L +LE+GYN ++ D D++WL+DPF D D
Sbjct: 156 GNFTGEVFFMAPNYLQMMWRRTEFLGSVLEMGYNFVFTDTDIMWLRDPFKIFYKDAD 212
>gi|242055413|ref|XP_002456852.1| hypothetical protein SORBIDRAFT_03g044060 [Sorghum bicolor]
gi|241928827|gb|EES01972.1| hypothetical protein SORBIDRAFT_03g044060 [Sorghum bicolor]
Length = 378
Score = 43.9 bits (102), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 35/172 (20%), Positives = 71/172 (41%), Gaps = 13/172 (7%)
Query: 64 GFMFPNATSSLAKWRDYTLSQAASFVAK----NGTIIVCAVSQPYLP-------FLNNWL 112
G P A + A D + A + + + T+I+ +V++ + FL ++
Sbjct: 79 GVRAPPANEAAASESDDDADEFARMLRRAAMEDRTVIMTSVNEAWAAPGSLLDSFLESFR 138
Query: 113 ISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAP--DSQTAHKFGSQGFFNFTS 170
+ + ++V+A D + P +L P D A + ++ + +
Sbjct: 139 VGENVSHFVKHIVVVAMDDGAFRRCQAVHPHCHLLRPEKEGLDLSGAKSYMTKDYLDLVW 198
Query: 171 RRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLD 222
+ ILELGYN+++ DVD+ W ++P ++ D+ + D P D
Sbjct: 199 SKLRLQQRILELGYNLLFTDVDLAWFRNPLVHITMAADITTSSDFYFGNPDD 250
>gi|302798270|ref|XP_002980895.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300151434|gb|EFJ18080.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 553
Score = 43.9 bits (102), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 51/110 (46%), Gaps = 4/110 (3%)
Query: 95 IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDS 154
+++ VS Y L +W+ + R + LV D +Y+ G G V + S
Sbjct: 280 VVLTVVSHSYRDMLMSWVCRL-RHLNVTNYLVATID-KEMYQF-GILQGLPVFRTESGRS 336
Query: 155 QTAH-KFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
+ FGS F T + +L ILELGY+V+++DVD+ W P L
Sbjct: 337 DSKDCTFGSSCFKTVTKSKSRTVLRILELGYSVLFSDVDVYWFSSPIREL 386
>gi|42571685|ref|NP_973933.1| nucleotide-diphospho-sugar transferase-like protein [Arabidopsis
thaliana]
gi|122213725|sp|Q3E6Y3.1|Y1869_ARATH RecName: Full=Uncharacterized protein At1g28695; Flags: Precursor
gi|332192894|gb|AEE31015.1| nucleotide-diphospho-sugar transferase-like protein [Arabidopsis
thaliana]
Length = 329
Score = 43.9 bits (102), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 36/150 (24%), Positives = 62/150 (41%), Gaps = 15/150 (10%)
Query: 84 QAASFVA---KNGTIIVCAVSQPYLP-----------FLNNWLISISRQKHQDQVLVIAE 129
+AA + A N T+I+ V++ Y+ FL ++ D ++V+A
Sbjct: 46 EAALYTAAAGNNKTVIITMVNKAYVKEVGRGSTMLDLFLESFWEGEGTLPLLDHLMVVAV 105
Query: 130 DYATLYKVN-GRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMY 188
D + R + + D + F S+ F RR +L +L GYNV++
Sbjct: 106 DQTAYDRCRFKRLHCYKMETEDGVDLEGEKVFMSKDFIEMMWRRTRLILDVLRRGYNVIF 165
Query: 189 NDVDMVWLKDPFPYLQGDHDVYFTDDMAAV 218
D D++WL+ P L D+ + D V
Sbjct: 166 TDTDVMWLRSPLSRLNMSLDMQISVDRINV 195
>gi|168052834|ref|XP_001778844.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162669713|gb|EDQ56294.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 299
Score = 43.9 bits (102), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 53/212 (25%), Positives = 85/212 (40%), Gaps = 22/212 (10%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
LS VA NG +IV + + W SI R + ++V +D + +
Sbjct: 17 LSALLKKVAVNGELIVGISNNNVRDMVQIWFESIKRVGVTNYLVVALDDEIASFCQDHDV 76
Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPF 200
P + D+ + G + S HLL L LGY+++ +DVD+V+L++PF
Sbjct: 77 PVYR------RDATISKSQAGTGANHAISGLKFHLLREFLVLGYSILLSDVDIVYLQNPF 130
Query: 201 PYLQGDHDVY-FTDDMAAVKPLDHSHDLPPPGKKGRTY--------ICSCMIYLRPTDGA 251
+L D DV +D + + P Y S + Y+RPT +
Sbjct: 131 NHLHRDCDVESMSDGFDNTTAYGYDDVMTDPSMGWSRYAHTMRIWVFNSGLFYIRPTVPS 190
Query: 252 KLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
++ + +L E KA DQ FN L
Sbjct: 191 IELLDRVTAKLTKE------KAWDQAVFNEEL 216
>gi|357121351|ref|XP_003562384.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
distachyon]
Length = 378
Score = 43.9 bits (102), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 20/68 (29%), Positives = 35/68 (51%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
D ++ F S+ + R I+ELGYN ++ DVD++W +DPF ++ D+ +
Sbjct: 181 DYRSEKMFMSKDYLEMMWGRNKFQQTIVELGYNFLFTDVDVMWFRDPFKHISMGADIAIS 240
Query: 213 DDMAAVKP 220
D+ P
Sbjct: 241 SDVFIGDP 248
>gi|413936405|gb|AFW70956.1| hypothetical protein ZEAMMB73_889446 [Zea mays]
Length = 306
Score = 43.9 bits (102), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 31/128 (24%), Positives = 54/128 (42%), Gaps = 16/128 (12%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
D ++ F S+ + R +LELGYN ++ DVD++W +DPF ++ D+ +
Sbjct: 109 DYRSEKVFMSKDYLEMMWGRNRFQQTVLELGYNSLFTDVDVMWFRDPFRHISMAADIAIS 168
Query: 213 DDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKK 272
D+ P + P G +++R +D + W Q W K
Sbjct: 169 SDVYMGDPYS-LRNFPNGG----------FLFVRASDKTVAFYRAW----QQGRWRFLGK 213
Query: 273 ANDQPAFN 280
++Q FN
Sbjct: 214 -HEQDVFN 220
>gi|449460119|ref|XP_004147793.1| PREDICTED: uncharacterized protein LOC101205922 [Cucumis sativus]
gi|449476774|ref|XP_004154830.1| PREDICTED: uncharacterized LOC101205922 [Cucumis sativus]
Length = 427
Score = 43.9 bits (102), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 61/134 (45%), Gaps = 6/134 (4%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
L++ VA +IV + L W +I + Q+ ++V ++ + ++
Sbjct: 145 LAKLLEKVAIQKELIVTLANSNVKSMLEVWFTTIQKVGIQNYLVVALDNQTEEFCIS--- 201
Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPF 200
H V V K G +G + S +L L+LGY+V+ +DVD+V+L++PF
Sbjct: 202 --HEVPVYKRDPDNNIDKVGKEGGNHQVSALKFRILREFLQLGYSVLLSDVDIVYLQNPF 259
Query: 201 PYLQGDHDVYFTDD 214
+L D DV D
Sbjct: 260 DHLYRDSDVESMSD 273
>gi|115471611|ref|NP_001059404.1| Os07g0294800 [Oryza sativa Japonica Group]
gi|28564589|dbj|BAC57756.1| putative regulatory protein [Oryza sativa Japonica Group]
gi|50509709|dbj|BAD31747.1| putative regulatory protein [Oryza sativa Japonica Group]
gi|113610940|dbj|BAF21318.1| Os07g0294800 [Oryza sativa Japonica Group]
gi|215686845|dbj|BAG89695.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 406
Score = 43.9 bits (102), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 39/88 (44%), Gaps = 11/88 (12%)
Query: 171 RRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPP 230
RR ILELG++ ++ DVD++W + PFP+L D V + D P P
Sbjct: 227 RRNRFQQTILELGFSFLFTDVDILWFRSPFPHLSPDAQVVMSSDFFVGDPTS-------P 279
Query: 231 GKKGRTYICSCMIYLRPTDGAKLVMKKW 258
G Y ++Y+R + + W
Sbjct: 280 G----NYPNGGLLYVRSSASTVRFYEHW 303
>gi|443696457|gb|ELT97152.1| hypothetical protein CAPTEDRAFT_193488 [Capitella teleta]
Length = 530
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 59/122 (48%), Gaps = 17/122 (13%)
Query: 162 SQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPL 221
S+ F N + R +L L+LGY+V+++DVD+V+ KDP + D+ K
Sbjct: 174 SKDFINKMNIRTYMILEALQLGYHVIHSDVDVVFFKDPTERIF---------DLCHFKDP 224
Query: 222 DHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNW 281
+ + D+ P G + +++R ++ + + KK E +K K +DQ A N
Sbjct: 225 EKACDVAPLWDSGAHN--AGFLFIRNSNASISLYKKM------EHTAKTTKIDDQKALNR 276
Query: 282 AL 283
A+
Sbjct: 277 AM 278
>gi|413923500|gb|AFW63432.1| hypothetical protein ZEAMMB73_690950 [Zea mays]
Length = 390
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 23/102 (22%), Positives = 46/102 (45%)
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
F++++ + ++++A D+ P L D +F + G+
Sbjct: 141 FIDSFRRGVGTSSLLRHLVIVAFDFKAYEHCVKIHPYCYALPTKDVDFSEEKRFQTTGYL 200
Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
+R L +LE GY+ +++D D+VW ++PFP+ D D
Sbjct: 201 EMMWKRLDFLRLVLEKGYSFVFSDADIVWFRNPFPHFYTDGD 242
>gi|224125690|ref|XP_002329694.1| predicted protein [Populus trichocarpa]
gi|222870602|gb|EEF07733.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 34/134 (25%), Positives = 58/134 (43%), Gaps = 4/134 (2%)
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
FL ++ I ++ + +++I+ D + P L + F S+ +
Sbjct: 113 FLESFQIGEKTKELLNNLVIISLDQKAHARCLAIHPHCYALKTEGLNFTREASFMSEDYL 172
Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHD 226
RR L +LE+GY+ ++ D D++WL++PFP D D P D S+
Sbjct: 173 KMMWRRIEFLNTVLEMGYSFVFTDADIMWLRNPFPRFYPRVDFQIACDKYYGNPKDKSNR 232
Query: 227 LPPPGKKGRTYICS 240
P G G TY+ S
Sbjct: 233 --PNG--GFTYVRS 242
>gi|50198979|gb|AAT70491.1| At1g14590 [Arabidopsis thaliana]
Length = 354
Score = 43.5 bits (101), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 33/139 (23%), Positives = 63/139 (45%), Gaps = 14/139 (10%)
Query: 81 TLSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYAT 133
LS+AA+ ++ T+++ ++ + F ++ I + D ++++A D A
Sbjct: 56 VLSKAAT---RDRTVVLTTLNAAWAAPGSVIDLFFESFRIGEETSQILDHLVIVALD-AK 111
Query: 134 LYKVNGRWPGHAV-LVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVD 192
Y H LV D F ++ + RR L +LE+GYN ++ D D
Sbjct: 112 AYSRCLELHKHCFSLVTEGVDFSREAYFMTRSYLKMMWRRIDLLRSVLEMGYNFVFTDAD 171
Query: 193 MVWLKDPFP--YLQGDHDV 209
++W ++PFP Y+ D +
Sbjct: 172 VMWFRNPFPRFYMYADFQI 190
>gi|224067546|ref|XP_002302503.1| predicted protein [Populus trichocarpa]
gi|222844229|gb|EEE81776.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 27/44 (61%)
Query: 160 FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
F SQ F RR LL +L+ GYN ++ D D++WL++PF L
Sbjct: 104 FMSQDFLKMMWRRTLLLLDVLKHGYNFIFTDTDVMWLRNPFSRL 147
>gi|255536925|ref|XP_002509529.1| conserved hypothetical protein [Ricinus communis]
gi|223549428|gb|EEF50916.1| conserved hypothetical protein [Ricinus communis]
Length = 254
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 36/64 (56%), Gaps = 5/64 (7%)
Query: 147 LVPPAPDSQTAHKFGSQG--FFNFTS---RRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
L+ A D+ T H S F+F S RR L +L+LGYN++Y DVD++WL++P
Sbjct: 43 LIIIALDTHTLHYCQSIRPHCFHFKSTGQRRKELFLQVLQLGYNLVYTDVDVMWLRNPMS 102
Query: 202 YLQG 205
G
Sbjct: 103 LFDG 106
>gi|384247083|gb|EIE20571.1| hypothetical protein COCSUDRAFT_37807 [Coccomyxa subellipsoidea
C-169]
Length = 279
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 55/111 (49%), Gaps = 17/111 (15%)
Query: 180 LELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT----DDMAAVKPLDHSHDLPPPG---- 231
LELG+NV+ +DVD++ ++DPF +L DHD+ DD A ++ D P G
Sbjct: 88 LELGWNVLLSDVDIIVVQDPFQHLHRDHDIEGMSDGFDDATAYGNINGLDD-PAMGWSRY 146
Query: 232 KKGRTYI--CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
+G T++ S + Y++ +MK+ L E KA DQ FN
Sbjct: 147 AQGTTHLNMNSGLFYIQANVRTVDLMKRVAARLAKE------KAWDQSVFN 191
>gi|326525413|dbj|BAJ93085.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 357
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 29/141 (20%), Positives = 64/141 (45%), Gaps = 9/141 (6%)
Query: 91 KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
++ T+I+ +V++ + FL ++ + ++ ++V+A D L + P
Sbjct: 87 EDRTVIMTSVNEAWAAEGSLLDSFLESFRVGLNISHLVKHIVVVAMDEGALRRCRAVHPH 146
Query: 144 HAVLVPPAP--DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
+L+P D A + ++ + + + +L LGYN+++ DVD+ W +DP
Sbjct: 147 CHLLLPDVDGLDLSGAKSYMTKDYLDLVWSKLRLQHRVLLLGYNLLFTDVDVAWFRDPRV 206
Query: 202 YLQGDHDVYFTDDMAAVKPLD 222
++ D+ + D P D
Sbjct: 207 HITAAADITTSSDFYFGDPDD 227
>gi|223942815|gb|ACN25491.1| unknown [Zea mays]
gi|413951495|gb|AFW84144.1| hypothetical protein ZEAMMB73_902211 [Zea mays]
Length = 378
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 41/199 (20%), Positives = 79/199 (39%), Gaps = 25/199 (12%)
Query: 91 KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
++ T+I+ +V++ + FL ++ + + V+V+A D + P
Sbjct: 109 EDRTVIMTSVNEAWAAPGSLLDSFLESFQVGENVSHFAKHVVVVAMDDGAFRRCQAVHPH 168
Query: 144 HAVLVPPAP--DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
+L P D A + ++ + + + +LELGYN+++ DVD+ W ++P
Sbjct: 169 CHLLRPEKAGLDLSGAKSYMTKDYLDLVWSKLKLQQRVLELGYNLLFTDVDVAWFRNPLV 228
Query: 202 YLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEE 261
++ D+ + D P D + P G IY + T M W
Sbjct: 229 HITMAADITTSSDFYFGNP-DDLGNFPNTG----------FIYFKATPRNARAMAYWHAA 277
Query: 262 LQAEPWSKAKKANDQPAFN 280
+ P + +DQ FN
Sbjct: 278 RRRFP-----ENHDQFVFN 291
>gi|224136554|ref|XP_002326889.1| predicted protein [Populus trichocarpa]
gi|222835204|gb|EEE73639.1| predicted protein [Populus trichocarpa]
Length = 419
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 89/212 (41%), Gaps = 20/212 (9%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
L++ VA +IV + L W +I + ++ ++V +D+ + +
Sbjct: 135 LAKLLEEVAVRKELIVALANSNVKTMLEVWFANIKKAGIRNYLVVALDDHIVDFCKSNDV 194
Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
P + PDS + G + + L L+LGY+V+ +DVD+++L++PF
Sbjct: 195 P----VYKRDPDSGIDSVARTGGNHAVSGLKFRILREFLQLGYSVLLSDVDIIYLQNPFD 250
Query: 202 YLQGDHDVYFT----DDMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGA 251
+L D DV D+M A D + P G R ++ S Y+RPT +
Sbjct: 251 HLYRDSDVESMSDGHDNMTAYG-FDDVFNEPAMGWARYAHTMRIWVYNSGFFYIRPTLPS 309
Query: 252 KLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
++ + L EP S DQ FN L
Sbjct: 310 IELLDRVAGRLSREPNSW-----DQAVFNEEL 336
>gi|242082105|ref|XP_002445821.1| hypothetical protein SORBIDRAFT_07g026310 [Sorghum bicolor]
gi|241942171|gb|EES15316.1| hypothetical protein SORBIDRAFT_07g026310 [Sorghum bicolor]
Length = 375
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 40/88 (45%), Gaps = 11/88 (12%)
Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
ILELG++ ++ DVD++W ++P ++ DV D P D H+LP G
Sbjct: 203 ILELGFSFLFTDVDILWFRNPMRHIAVTSDVAIASDYFNGNP-DSLHNLPNGG------- 254
Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEP 266
+Y+R + ++W E P
Sbjct: 255 ---FLYVRSMNRTVEFYRRWREARAGFP 279
>gi|116309821|emb|CAH66858.1| H0307D04.3 [Oryza sativa Indica Group]
Length = 362
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 23/102 (22%), Positives = 47/102 (46%)
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
F++++ + + +++IA D+ + P L D +F + G+
Sbjct: 113 FIDSFRSGVRTSSLLNHLVIIAFDWNAYKQCLKIHPYCFALGTDGVDFSEEKRFLTSGYL 172
Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
+R L +LE G++ +++D D++W + PFPY D D
Sbjct: 173 EMMWKRIDFLRLVLESGFSFIFSDADIMWFRSPFPYFYPDGD 214
>gi|6691205|gb|AAF24543.1|AC007508_6 F1K23.11 [Arabidopsis thaliana]
Length = 841
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 35/146 (23%), Positives = 61/146 (41%), Gaps = 15/146 (10%)
Query: 84 QAASFVA---KNGTIIVCAVSQPYLP-----------FLNNWLISISRQKHQDQVLVIAE 129
+AA + A N T+I+ V++ Y+ FL ++ D ++V+A
Sbjct: 46 EAALYTAAAGNNKTVIITMVNKAYVKEVGRGSTMLDLFLESFWEGEGTLPLLDHLMVVAV 105
Query: 130 DYATLYKVN-GRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMY 188
D + R + + D + F S+ F RR +L +L GYNV++
Sbjct: 106 DQTAYDRCRFKRLHCYKMETEDGVDLEGEKVFMSKDFIEMMWRRTRLILDVLRRGYNVIF 165
Query: 189 NDVDMVWLKDPFPYLQGDHDVYFTDD 214
D D++WL+ P L D+ + D
Sbjct: 166 TDTDVMWLRSPLSRLNMSLDMQISVD 191
>gi|115460156|ref|NP_001053678.1| Os04g0585400 [Oryza sativa Japonica Group]
gi|38344278|emb|CAE03761.2| OSJNBa0013K16.10 [Oryza sativa Japonica Group]
gi|113565249|dbj|BAF15592.1| Os04g0585400 [Oryza sativa Japonica Group]
Length = 362
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 23/102 (22%), Positives = 47/102 (46%)
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
F++++ + + +++IA D+ + P L D +F + G+
Sbjct: 113 FIDSFRSGVRTSSLINHLVIIAFDWNAYKQCLKIHPYCFALGTDGVDFSEEKRFLTSGYL 172
Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
+R L +LE G++ +++D D++W + PFPY D D
Sbjct: 173 EMMWKRIDFLRLVLESGFSFIFSDADIMWFRSPFPYFYPDGD 214
>gi|443704245|gb|ELU01390.1| hypothetical protein CAPTEDRAFT_211846 [Capitella teleta]
Length = 528
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 56/258 (21%), Positives = 112/258 (43%), Gaps = 37/258 (14%)
Query: 35 LLVLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSSLAKWRDYTLSQAASFVAKNGT 94
+ V+N ++ + V VI P T S + + SQA ++
Sbjct: 50 IAVMNTEAKAEVMKTVEVTAVISVTAEVNAEKIPKQTFSFTSYSEALSSQAD---GESNV 106
Query: 95 IIVCAVSQPYLPF-LNNWLISISRQKHQDQVLVIA-EDYATLYKVNGRWPGHAVLVPPAP 152
+ + P +N +L SI + ++ + V + E++ + ++ + + A
Sbjct: 107 VFLAFADMGGFPMAMNFYLTSIQKYDIRNVLFVSSSEEFCSRFQA-IQVACFVYMNESAH 165
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV-YF 211
D + + S+ F N + R +L L+LGY+V+++DVD+V+ +DP + D+ YF
Sbjct: 166 DKTSV--YLSEDFINKMNIRTYMILEALQLGYHVIHSDVDVVFFRDPTERIL---DLCYF 220
Query: 212 TD-----DMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEP 266
D D+AA+ D +H+ + +++R ++ + + KK +
Sbjct: 221 KDPQKVCDVAALWDFD-AHN-------------AGFLFIRKSNASISMYKKM------KH 260
Query: 267 WSKAKKANDQPAFNWALN 284
+K K +DQ A N A+N
Sbjct: 261 TAKTTKIDDQMALNGAMN 278
>gi|125549492|gb|EAY95314.1| hypothetical protein OsI_17140 [Oryza sativa Indica Group]
Length = 357
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 23/102 (22%), Positives = 47/102 (46%)
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
F++++ + + +++IA D+ + P L D +F + G+
Sbjct: 113 FIDSFRSGVRTSSLLNHLVIIAFDWNAYKQCLKIHPYCFALGTDGVDFSEEKRFLTSGYL 172
Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
+R L +LE G++ +++D D++W + PFPY D D
Sbjct: 173 EMMWKRIDFLRLVLESGFSFIFSDADIMWFRSPFPYFYPDGD 214
>gi|125591429|gb|EAZ31779.1| hypothetical protein OsJ_15931 [Oryza sativa Japonica Group]
Length = 357
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 23/102 (22%), Positives = 47/102 (46%)
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
F++++ + + +++IA D+ + P L D +F + G+
Sbjct: 113 FIDSFRSGVRTSSLINHLVIIAFDWNAYKQCLKIHPYCFALGTDGVDFSEEKRFLTSGYL 172
Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
+R L +LE G++ +++D D++W + PFPY D D
Sbjct: 173 EMMWKRIDFLRLVLESGFSFIFSDADIMWFRSPFPYFYPDGD 214
>gi|224005136|ref|XP_002296219.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|209586251|gb|ACI64936.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 499
Score = 43.1 bits (100), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 63/151 (41%), Gaps = 20/151 (13%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISI-SRQKHQDQVLVIAEDYATLYKVNGR 140
L Q F + TIIV V+Q L N+L + +R+ +VLV D + +
Sbjct: 307 LHQTTQF--EKETIIVMTVNQGQSHLLVNFLCAARARKLDVRRVLVFVTDEESKQLIEDL 364
Query: 141 WPGHAVLV---------PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDV 191
V V P K+G F + + +L++ LGY+V++ D
Sbjct: 365 SNDDEVGVMVFYDKWNMEELPKGGEGVKYGDSTFTSMMFAKILCVLYVSLLGYDVLFQDA 424
Query: 192 DMVWLKDPFPY--------LQGDHDVYFTDD 214
D+VW +DP P+ L ++D+ F D
Sbjct: 425 DIVWYQDPLPFFEMKDNSSLNQNYDIIFQHD 455
>gi|428168198|gb|EKX37146.1| hypothetical protein GUITHDRAFT_116723 [Guillardia theta CCMP2712]
Length = 1532
Score = 43.1 bits (100), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 29/121 (23%), Positives = 58/121 (47%), Gaps = 9/121 (7%)
Query: 87 SFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKH-QDQVLVIAEDYATLYKVNGRWPGH- 144
+F A+ ++V ++ Y+PF +WL + ++ + L+IA D ++ R H
Sbjct: 1219 NFPAQEQFLLVAFGNREYVPFFQSWLCNTAKMDGVHSRTLIIASDDDAYQQL--RRSSHN 1276
Query: 145 -----AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
+VL + +A +F S GFF T R + ++ G +++ + D +WL++P
Sbjct: 1277 ATIVRSVLKVDGEELPSAFEFQSLGFFLLTQHRNRLMERLIVAGIDMLLFEADAIWLRNP 1336
Query: 200 F 200
Sbjct: 1337 L 1337
>gi|449442485|ref|XP_004139012.1| PREDICTED: uncharacterized protein At1g28695-like [Cucumis sativus]
gi|449505304|ref|XP_004162430.1| PREDICTED: uncharacterized protein At1g28695-like [Cucumis sativus]
Length = 358
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 30/58 (51%)
Query: 147 LVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
LV D + S+ F RR LL +L+ GYN ++ D D++WL++PF L
Sbjct: 145 LVTDGVDFGGEKLYMSEDFIKMMWRRTQFLLEVLKRGYNFIFTDTDVMWLRNPFTKLS 202
>gi|15223826|ref|NP_172911.1| putative myb DNA binding protein [Arabidopsis thaliana]
gi|7527728|gb|AAF63177.1|AC010657_13 T5E21.9 [Arabidopsis thaliana]
gi|332191067|gb|AEE29188.1| putative myb DNA binding protein [Arabidopsis thaliana]
Length = 386
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 33/138 (23%), Positives = 63/138 (45%), Gaps = 14/138 (10%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATL 134
LS+AA+ ++ T+++ ++ + F ++ I + D ++++A D A
Sbjct: 89 LSKAAT---RDRTVVLTTLNAAWAAPGSVIDLFFESFRIGEETSQILDHLVIVALD-AKA 144
Query: 135 YKVNGRWPGHAV-LVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDM 193
Y H LV D F ++ + RR L +LE+GYN ++ D D+
Sbjct: 145 YSRCLELHKHCFSLVTEGVDFSREAYFMTRSYLKMMWRRIDLLRSVLEMGYNFVFTDADV 204
Query: 194 VWLKDPFP--YLQGDHDV 209
+W ++PFP Y+ D +
Sbjct: 205 MWFRNPFPRFYMYADFQI 222
>gi|412990871|emb|CCO18243.1| predicted protein [Bathycoccus prasinos]
Length = 711
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 31/133 (23%), Positives = 59/133 (44%), Gaps = 16/133 (12%)
Query: 162 SQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPL 221
+ F + + LL +++ G+ V+ +D D W++DP + + T++MA +
Sbjct: 262 GKAFQQIGALKASFLLRLMQKGFRVLVSDCDTAWMRDPREW-------FVTNEMAKYVDM 314
Query: 222 DHSHDLPPPGKKGRTYIC------SCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKAND 275
S D + + C + +++L PT+ K +K W L+ + K +D
Sbjct: 315 AVSTDCLSYKNEEKVRGCWHDQFNTGILFLNPTEKTKGFLKDWQVALET---TTHKFEHD 371
Query: 276 QPAFNWALNKTAG 288
Q FN L + AG
Sbjct: 372 QDIFNRLLREGAG 384
>gi|159469263|ref|XP_001692787.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278040|gb|EDP03806.1| predicted protein [Chlamydomonas reinhardtii]
Length = 703
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 46/196 (23%), Positives = 84/196 (42%), Gaps = 19/196 (9%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
+A+NG ++V + Y F+ W+ + R ++ +D+ + + ++
Sbjct: 168 IAQNGYLMVTWANFHYFDFVKTWVKHVQRVGVTGYIVGAMDDHLLREMIKLEYNCFSM-- 225
Query: 149 PPAPDSQTAHKFG-SQGFFNFTSRRPCHLLHI-LELGYNVMYNDVDMVWLKDPFPYLQG- 205
T FG F R L+ I L+L +V+ DVD++WL++P PY +
Sbjct: 226 ---KSGLTLGDFGWGSATFAKMGREKIRLISIFLKLDVHVVIADVDVLWLRNPLPYFERY 282
Query: 206 -DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQA 264
+ D+ + D A D S + P G M++ + + ++KWIE ++A
Sbjct: 283 PEADILTSSDNMANTVNDESLEKWP--DAGAAANIGIMLFRKKSLD---FVEKWIEIIEA 337
Query: 265 EPWSKAKKANDQPAFN 280
+ K DQ AFN
Sbjct: 338 D-----DKVWDQNAFN 348
>gi|125558021|gb|EAZ03557.1| hypothetical protein OsI_25693 [Oryza sativa Indica Group]
Length = 261
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 37/80 (46%), Gaps = 11/80 (13%)
Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
ILELG++ ++ DVD++W + PFP+L D V + D P PG Y
Sbjct: 90 ILELGFSFLFTDVDILWFRSPFPHLSPDAQVVMSSDFFVGDP-------TSPG----NYP 138
Query: 239 CSCMIYLRPTDGAKLVMKKW 258
++Y+R + + W
Sbjct: 139 NGGLLYVRSSASTVRFYEHW 158
>gi|356552128|ref|XP_003544422.1| PREDICTED: uncharacterized protein LOC100804935 [Glycine max]
Length = 425
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 55/117 (47%), Gaps = 14/117 (11%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKG- 234
L L+LGY+V+ +DVD+V+L++PF YL D DV D K +D+ G
Sbjct: 232 LREFLQLGYSVLLSDVDIVYLQNPFDYLYRDSDVESMSDGHNNKTAYGYNDVFDEPSMGW 291
Query: 235 -------RTYI-CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
R ++ S Y+RPT + ++ + L +P K+ DQ FN L
Sbjct: 292 ARYAHTMRIWVYNSGFFYIRPTLPSIELLDRVATRLSNDP-----KSWDQAVFNEEL 343
>gi|356561841|ref|XP_003549185.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
Length = 527
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 39/161 (24%), Positives = 67/161 (41%), Gaps = 19/161 (11%)
Query: 91 KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGR--- 140
K+ T+I+ ++ + FL ++ + +K + ++VI D K + R
Sbjct: 85 KDKTVIITTLNDAWAKPGSIFDLFLESFRLGNQTKKFLNHLVVITLDQ----KAHARCLA 140
Query: 141 WPGHAVLVPPAPDSQTAHKFG-SQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
H + D+ T F + + RR L +L++GYN ++ D D++WL+DP
Sbjct: 141 LHKHCYQLETKGDNFTGEAFFMTADYLQMMWRRIEFLGTVLDMGYNFVFTDTDVMWLRDP 200
Query: 200 FPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
F D D D +SHDL G Y+ S
Sbjct: 201 FKLFYKDVDFQIACDFFN----GNSHDLNNFPNGGFNYVKS 237
>gi|307108280|gb|EFN56520.1| hypothetical protein CHLNCDRAFT_144132 [Chlorella variabilis]
Length = 690
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 57/125 (45%), Gaps = 4/125 (3%)
Query: 86 ASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNG-RWPGH 144
AS GT++V ++ F NWL + + D +V A D AT ++ P
Sbjct: 103 ASRAGTGGTVMVAVMNSAQWDFGLNWLHHV-KAAGIDFYVVAAADQATSERLAAASEPCF 161
Query: 145 AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
AP A +G +G+ T + L +++ G+N++ +DVD+VW +DP +
Sbjct: 162 EWFDDEAPKLGLA--WGQEGWRRMTWSKVFVLAAVVDYGFNLVVSDVDVVWFRDPRTLMA 219
Query: 205 GDHDV 209
DV
Sbjct: 220 AHPDV 224
>gi|255638110|gb|ACU19369.1| unknown [Glycine max]
Length = 425
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 55/117 (47%), Gaps = 14/117 (11%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKG- 234
L L+LGY+V+ +DVD+V+L++PF YL D DV D K +D+ G
Sbjct: 232 LREFLQLGYSVLLSDVDIVYLQNPFDYLYRDSDVESMSDGHNNKTAYGYNDVFDEPSMGW 291
Query: 235 -------RTYI-CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
R ++ S Y+RPT + ++ + L +P K+ DQ FN L
Sbjct: 292 ARYAHTMRIWVYNSGFFYIRPTLPSIELLDRVATRLSNDP-----KSWDQAVFNEEL 343
>gi|443714935|gb|ELU07133.1| hypothetical protein CAPTEDRAFT_189031 [Capitella teleta]
Length = 568
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 31/52 (59%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
+S T FGS F + R +LH L+ GYNV+++D D+ + +PFP ++
Sbjct: 213 NSGTGASFGSTAFKEKMNIRTFMVLHALKEGYNVLHSDCDVYYFANPFPVIK 264
>gi|384253564|gb|EIE27038.1| hypothetical protein COCSUDRAFT_11236, partial [Coccomyxa
subellipsoidea C-169]
Length = 274
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 43/195 (22%), Positives = 81/195 (41%), Gaps = 23/195 (11%)
Query: 89 VAKNGTIIVCAVSQPYLP---FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHA 145
VAK+ ++V + Y L+ W+ + R + ++V +D + + + +
Sbjct: 2 VAKDNEVMVAISNANYAQPGGMLDLWMAGVRRANVTNALIVALDDATQQHAESLGFTAYQ 61
Query: 146 VLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG 205
+ + S A F RP +L+LGY V+ +DVD++ L++PF +L
Sbjct: 62 MSLQKGVGSNHA-----VSGLKFRVLRP-----MLDLGYAVLLSDVDIITLQNPFQFLHR 111
Query: 206 DHDVYFTDDMAAVKPLDHSHDLPPPGKKG--------RTYIC-SCMIYLRPTDGAKLVMK 256
D DV D +D+ G R ++ S + Y+RPT + ++
Sbjct: 112 DSDVEGMSDGWDNATAYGYNDVRDDAAMGWARYAHSMRVFVMNSGLFYMRPTRASVDLLD 171
Query: 257 KWIEELQAEP-WSKA 270
+ + L E W +A
Sbjct: 172 RIVHRLNTEDGWDQA 186
>gi|125599894|gb|EAZ39470.1| hypothetical protein OsJ_23899 [Oryza sativa Japonica Group]
Length = 261
Score = 42.7 bits (99), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 37/80 (46%), Gaps = 11/80 (13%)
Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
ILELG++ ++ DVD++W + PFP+L D V + D P PG Y
Sbjct: 90 ILELGFSFLFTDVDILWFRSPFPHLSPDAQVVMSSDFFVGDP-------TSPG----NYP 138
Query: 239 CSCMIYLRPTDGAKLVMKKW 258
++Y+R + + W
Sbjct: 139 NGGLLYVRSSASTVRFYEHW 158
>gi|255084147|ref|XP_002508648.1| hypothetical protein MICPUN_66770 [Micromonas sp. RCC299]
gi|226523925|gb|ACO69906.1| hypothetical protein MICPUN_66770 [Micromonas sp. RCC299]
Length = 424
Score = 42.7 bits (99), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 56/110 (50%), Gaps = 19/110 (17%)
Query: 183 GYNVMYNDVDMVWLKDPFPYLQG--DHDVYFTDDM-----AAVKPLDHSHDLPPPGKKGR 235
G +V+ +D+D+ WL+DP P+ + D+ + D+ A PL H + G +
Sbjct: 106 GISVLISDIDVAWLRDPTPFFKRYPSADILVSTDLLRSEIALDPPLQTPHLVDGEGLE-- 163
Query: 236 TYICSC-----MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
++C +++ RPT G++ + ++W+ ++A+ K DQ AFN
Sbjct: 164 FHVCHAASNIGIMWFRPTRGSQQLTEEWVRRIEAD-----DKLWDQNAFN 208
>gi|302849316|ref|XP_002956188.1| hypothetical protein VOLCADRAFT_107128 [Volvox carteri f.
nagariensis]
gi|300258491|gb|EFJ42727.1| hypothetical protein VOLCADRAFT_107128 [Volvox carteri f.
nagariensis]
Length = 324
Score = 42.7 bits (99), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 53/114 (46%), Gaps = 15/114 (13%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVY-FTD--DMAAVKPLDHSHDLPPPG- 231
+ L LGY+V+ +DVD+V L++PF +L D DV +D D D HD P G
Sbjct: 131 IREFLSLGYSVLLSDVDIVTLQNPFDHLYRDEDVEALSDGYDEQTAYGWDDVHDDPKMGW 190
Query: 232 ----KKGRTY-ICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
R + + S + Y+RP + +M + + L +K DQ FN
Sbjct: 191 SRWAHTIRVFTLNSGLFYIRPNERTIGLMDRITDRLN------KRKEWDQAVFN 238
>gi|357121335|ref|XP_003562376.1| PREDICTED: uncharacterized protein LOC100833150 [Brachypodium
distachyon]
Length = 425
Score = 42.7 bits (99), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 56/118 (47%), Gaps = 16/118 (13%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD----MAAVKPLDHSHDLPPPG 231
L L+LGY+++ +D+D+++L++PF +L D DV D M A + D P G
Sbjct: 232 LREFLQLGYSILLSDIDIIFLRNPFDHLYRDSDVESMSDGHNNMTAYG-FNDVFDEPSMG 290
Query: 232 -----KKGRTYI-CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
R ++ S Y+RPT A ++ + L EP K+ DQ FN L
Sbjct: 291 WARYAHTMRIWVYNSGFFYIRPTIPAIELLDRVAGRLSREP-----KSWDQAVFNEEL 343
>gi|115447979|ref|NP_001047769.1| Os02g0686300 [Oryza sativa Japonica Group]
gi|113537300|dbj|BAF09683.1| Os02g0686300 [Oryza sativa Japonica Group]
Length = 393
Score = 42.7 bits (99), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 23/88 (26%), Positives = 42/88 (47%), Gaps = 4/88 (4%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
D +F + G+ + +R L +LE GY+ +++D D+ W ++PFP+ D D
Sbjct: 190 DFSQEKRFLTAGYLDMMWKRLDFLRLVLEKGYSFIFSDADITWFRNPFPHFYPDGDF--- 246
Query: 213 DDMAAVKPLDHSHDLPPPGKKGRTYICS 240
+A + ++ DL G Y+ S
Sbjct: 247 -QIACDHYVGNATDLGNIANGGFNYVRS 273
>gi|300123542|emb|CBK24814.2| unnamed protein product [Blastocystis hominis]
Length = 243
Score = 42.7 bits (99), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 35/132 (26%), Positives = 62/132 (46%), Gaps = 4/132 (3%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
VA +I+ +LP ++ S H + A D T YK W +VL+
Sbjct: 62 VAIKKNVIISVFDFAFLPQFYSFYHSSLLPLHITNFIAFAMDKRT-YKTLQDWGIPSVLL 120
Query: 149 P--PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL-QG 205
+ + ++ +GS F + + +LH+L+ YN + +DVD+V+ K+PF YL G
Sbjct: 121 ELDISISTSSSSDYGSYAFATKANMKTLAVLHVLQCNYNPLLSDVDVVYFKNPFEYLGTG 180
Query: 206 DHDVYFTDDMAA 217
+ D+ + +A
Sbjct: 181 NWDIQIQREESA 192
>gi|357126542|ref|XP_003564946.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
distachyon]
Length = 365
Score = 42.7 bits (99), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 25/41 (60%), Gaps = 6/41 (14%)
Query: 179 ILELGYNVMYNDVDMVWLKDPFP------YLQGDHDVYFTD 213
ILELGYN ++ DVD++W +DPF ++ D YF D
Sbjct: 194 ILELGYNFLFTDVDIMWFRDPFERMSVAAHMVTSSDFYFGD 234
>gi|224067389|ref|XP_002302478.1| predicted protein [Populus trichocarpa]
gi|222844204|gb|EEE81751.1| predicted protein [Populus trichocarpa]
Length = 429
Score = 42.7 bits (99), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 51/212 (24%), Positives = 91/212 (42%), Gaps = 20/212 (9%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
L++ + VA + +IV + L W +I + ++ ++V +D+ + +
Sbjct: 145 LAKLLAEVAVHKELIVALANSNVKAMLEVWFTNIKKAGIRNYLVVALDDHIVDFCKSNDV 204
Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
P + PDS + G + + L L+LGY+V+ +D+D+++L++PF
Sbjct: 205 P----VYKRDPDSGIDSVARTGGNHAVSGLKFHILREFLQLGYSVLLSDIDIIYLQNPFD 260
Query: 202 YLQGDHDVYFT----DDMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGA 251
+L D DV D+M A + D P G R ++ S Y+RPT +
Sbjct: 261 HLYRDSDVESMSDGHDNMTAYG-FNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTLPS 319
Query: 252 KLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
++ + L EP S DQ FN L
Sbjct: 320 IELLDRVAGRLSREPNSW-----DQAVFNEEL 346
>gi|186498694|ref|NP_001118253.1| nucleotide-diphospho-sugar transferase domain-containing protein
[Arabidopsis thaliana]
gi|330250447|gb|AEC05541.1| nucleotide-diphospho-sugar transferase domain-containing protein
[Arabidopsis thaliana]
Length = 408
Score = 42.7 bits (99), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 46/213 (21%), Positives = 84/213 (39%), Gaps = 37/213 (17%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATL 134
L +AA+ K+GT+I+ +++ + F ++ I ++ +++IA D A
Sbjct: 112 LRRAAT---KDGTVILTTLNEAWAAPGSVIDLFFESFRIGKGTRRLLKHLVIIALD-AKA 167
Query: 135 YKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTS------RRPCHLLHILELGYNVMY 188
Y H + +++ G + +F S RR L +LE GYN ++
Sbjct: 168 YSRCQELHKHCFRL----ETEGVDFSGGEAYFMTPSYLTMMWRRISFLRSVLEKGYNFVF 223
Query: 189 NDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPT 248
D D++W ++PF D D D +P D + P G ++R
Sbjct: 224 TDADVMWFRNPFRRFYEDGDFQIACDHYIGRPNDF-RNRPNGG----------FTFVRAN 272
Query: 249 DGAKLVMKKWIEELQAEPWSKAKKANDQPAFNW 281
+ + K W + P K +DQ N+
Sbjct: 273 NRSIGFYKFWYDSRTKYP-----KNHDQDVLNF 300
>gi|222625288|gb|EEE59420.1| hypothetical protein OsJ_11574 [Oryza sativa Japonica Group]
Length = 683
Score = 42.4 bits (98), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 10/131 (7%)
Query: 160 FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVK 219
+GS F + + +L GY ++ D DMVWLK+P PYL D V
Sbjct: 105 WGSPTFHKMGREKVLLINALLPFGYELLMCDTDMVWLKNPLPYLARYPDADLLTSSDQVI 164
Query: 220 PLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEP--WSKAKKANDQP 277
P L + Y + + RPT+ AK + K W + L ++ W DQ
Sbjct: 165 PTVTDDSLENWREVTGAYNIG-IFHWRPTEPAKRLAKDWKDLLLSDDTLW-------DQN 216
Query: 278 AFNWALNKTAG 288
AFN ++K G
Sbjct: 217 AFNDLIHKKFG 227
>gi|194696198|gb|ACF82183.1| unknown [Zea mays]
Length = 426
Score = 42.4 bits (98), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 56/118 (47%), Gaps = 16/118 (13%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD----MAAVKPLDHSHDLPPPG 231
L L+LGY+++ +D+D+++L++PF +L D DV D M A + D P G
Sbjct: 233 LREFLQLGYSILLSDIDIIFLRNPFDHLHRDSDVESMSDGHNNMTAYG-FNDVFDEPSMG 291
Query: 232 -----KKGRTYI-CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
R ++ S Y+RPT + ++ + L EP K+ DQ FN L
Sbjct: 292 WARYAHTMRIWVYNSGFFYIRPTVPSIELLDRVAGRLSREP-----KSWDQAVFNEEL 344
>gi|414867357|tpg|DAA45914.1| TPA: hypothetical protein ZEAMMB73_843284 [Zea mays]
Length = 426
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 56/118 (47%), Gaps = 16/118 (13%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD----MAAVKPLDHSHDLPPPG 231
L L+LGY+++ +D+D+++L++PF +L D DV D M A + D P G
Sbjct: 233 LREFLQLGYSILLSDIDIIFLRNPFDHLHRDSDVESMSDGHNNMTAYG-FNDVFDEPSMG 291
Query: 232 -----KKGRTYI-CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
R ++ S Y+RPT + ++ + L EP K+ DQ FN L
Sbjct: 292 WARYAHTMRIWVYNSGFFYIRPTVPSIELLDRVAGRLSREP-----KSWDQAVFNEEL 344
>gi|356538805|ref|XP_003537891.1| PREDICTED: uncharacterized protein LOC100802215 [Glycine max]
Length = 437
Score = 42.4 bits (98), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 59/133 (44%), Gaps = 4/133 (3%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
L++ VA +IVC + L W +I R + ++ +D + + +
Sbjct: 155 LAKILEKVAVKQELIVCLANTNVKEMLEVWFTNIKRVGITNYLVAALDDETAKFCESNQV 214
Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
P + P T + GS + R L L+LGY+V+ +DVD+V+L++PF
Sbjct: 215 PVYKR--DPDDGVDTIGRTGSNHAVSGLKFRI--LREFLQLGYSVLLSDVDIVYLQNPFD 270
Query: 202 YLQGDHDVYFTDD 214
+L D DV D
Sbjct: 271 HLYRDSDVESMSD 283
>gi|412993241|emb|CCO16774.1| predicted protein [Bathycoccus prasinos]
Length = 997
Score = 42.4 bits (98), Expect = 0.29, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 34/62 (54%), Gaps = 8/62 (12%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
D++ +HK +Q F +++G +V+ D D+V+L++PFPYL DHD+
Sbjct: 279 DARGSHKISAQKFGIIKD--------FVKVGCSVLLTDTDVVYLQNPFPYLYRDHDIESM 330
Query: 213 DD 214
D
Sbjct: 331 SD 332
>gi|222619772|gb|EEE55904.1| hypothetical protein OsJ_04572 [Oryza sativa Japonica Group]
Length = 355
Score = 42.4 bits (98), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 38/84 (45%), Gaps = 17/84 (20%)
Query: 178 HILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDM---AAVKPLDHSHDLPPPGKKG 234
ILELGYN ++ DVD++W +DPF + + + D A P + P G
Sbjct: 183 RILELGYNFLFTDVDILWFRDPFEQMSMAAHMVTSSDFFVGGAYNPA----NFPNTG--- 235
Query: 235 RTYICSCMIYLRPTDGAKLVMKKW 258
+Y+R + A VM+ W
Sbjct: 236 -------FLYVRSSRRAVGVMEAW 252
>gi|359491833|ref|XP_002270793.2| PREDICTED: uncharacterized protein At4g15970-like [Vitis vinifera]
Length = 346
Score = 42.4 bits (98), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 49/215 (22%), Positives = 83/215 (38%), Gaps = 21/215 (9%)
Query: 41 TTLLVLLSLLVVLGVILPWTGTPGFMFPNATSSL------AKWRD-YTLSQAASFVA-KN 92
TTLL + + V + P F F N+ S AK ++ Y L + A +
Sbjct: 15 TTLLRRIVKITVFTA--AFVVVPCFFFYNSAFSFQFLPISAKGKEGYKLDKILKNAAMGD 72
Query: 93 GTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHA 145
T+I+ V++ + FL ++ I + Q+ + +++I D + P
Sbjct: 73 KTVILTTVNEAWAANNSLLDLFLESFRIGNNTQRLLNHLVIITLDPKAYARCTTLHPHCY 132
Query: 146 VLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG 205
L D F S + RR L +L++ YN ++ D D++W +DPF
Sbjct: 133 ALKTKEMDFSKEAFFMSHDYLEMMWRRIDFLRSVLKMRYNFIFTDADIMWFRDPFQRFDS 192
Query: 206 DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
D D +S D+ G TY+ S
Sbjct: 193 KADFQIACDYFN----GNSSDVNNSPNGGFTYVKS 223
>gi|242062902|ref|XP_002452740.1| hypothetical protein SORBIDRAFT_04g031550 [Sorghum bicolor]
gi|241932571|gb|EES05716.1| hypothetical protein SORBIDRAFT_04g031550 [Sorghum bicolor]
Length = 387
Score = 42.4 bits (98), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 17/56 (30%), Positives = 31/56 (55%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
D +F + G+ +R L +LE GY+ +++D D++W ++PFP+ D D
Sbjct: 187 DFSQEKRFQTTGYLEMMWKRLDFLRLVLEKGYSFVFSDADIMWFRNPFPHFYTDGD 242
>gi|168062158|ref|XP_001783049.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162665433|gb|EDQ52118.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 420
Score = 42.4 bits (98), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 59/206 (28%), Positives = 87/206 (42%), Gaps = 24/206 (11%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
+A N +IV + L W SI R + ++V +D + P V
Sbjct: 145 IAINEELIVSISNNNVRDMLQIWFESIQRVGVTNYLVVALDDEIASFCNEHNVP---VYR 201
Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPFPYLQGDH 207
A S++ GS + S HLL L LGY+V+ +DVD+V+L++PF +L D
Sbjct: 202 RDATISKSQAGTGSN---HAISGLKFHLLREFLVLGYSVLLSDVDIVFLQNPFNHLHRDC 258
Query: 208 DVYFTDD----MAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGAKLVMKK 257
DV D + A D S D P G R ++ S + Y+RPT + ++ +
Sbjct: 259 DVESMSDGFNNITAYGYDDVSED-PTMGWSRFAHTMRIWVFNSGLFYIRPTVPSIELLDR 317
Query: 258 WIEELQAEPWSKAKKANDQPAFNWAL 283
L E KA DQ FN L
Sbjct: 318 VTARLSKE------KAWDQAVFNEEL 337
>gi|222623466|gb|EEE57598.1| hypothetical protein OsJ_07965 [Oryza sativa Japonica Group]
Length = 370
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 23/88 (26%), Positives = 42/88 (47%), Gaps = 4/88 (4%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
D +F + G+ + +R L +LE GY+ +++D D+ W ++PFP+ D D
Sbjct: 190 DFSQEKRFLTAGYLDMMWKRLDFLRLVLEKGYSFIFSDADITWFRNPFPHFYPDGDF--- 246
Query: 213 DDMAAVKPLDHSHDLPPPGKKGRTYICS 240
+A + ++ DL G Y+ S
Sbjct: 247 -QIACDHYVGNATDLGNIANGGFNYVRS 273
>gi|145345111|ref|XP_001417066.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577292|gb|ABO95359.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 486
Score = 42.0 bits (97), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 45/208 (21%), Positives = 92/208 (44%), Gaps = 26/208 (12%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAED--YATLYKVN-GRWPGHA 145
VA +G +IV + Y F N+L + + + ++ ++ YA L K+ W
Sbjct: 86 VAVDGAVIVTWANMHYYDFALNFLSHLDALEVTNYLIGAMDEELYAALRKIGVNTW---- 141
Query: 146 VLVPPAPDSQTAHK---FGSQGFFNFTSRRPCHLLH-ILELGYNVMYNDVDMVWLKDPFP 201
++ + D+ K +GS+ F R L+H + G +V+ +D+D+ WL++P P
Sbjct: 142 LMGSKSIDADAVKKDFGWGSKNFHKM-GRDKIRLIHDFTKTGVDVLISDIDVAWLRNPIP 200
Query: 202 YLQG--DHDVYFTDDMAAVKPLDHSHDLPP--PGKKGRTYICS-----CMIYLRPTDGAK 252
+ + D+ + D + + + G+ + C+ M++ R T+ ++
Sbjct: 201 FFRRYPKADILVSTDNLQNRTHQDARQISHMVDGEGLESTPCAGTVNIGMMWFRATEASQ 260
Query: 253 LVMKKWIEELQAEPWSKAKKANDQPAFN 280
+ +W+ L+ K +K DQ FN
Sbjct: 261 QLTGEWVRNLE-----KDEKIWDQAEFN 283
>gi|145355476|ref|XP_001421987.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582226|gb|ABP00281.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 591
Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 52/226 (23%), Positives = 92/226 (40%), Gaps = 36/226 (15%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQK------HQDQVLVIAEDYAT--------- 133
+A++G + V VS + F NWL + + K + ++++A D AT
Sbjct: 2 LARDGLVFVSYVSDGFHEFALNWLKLLRKAKGAQPNEKDENIVMLALDEATERFCERHSM 61
Query: 134 --LYKVNGRWPGHAVLVPPAP--DSQTAHKFGSQG-----FFNFTSRRPCHLLHILELGY 184
N R+ G + D+ A + S T+ R LL +L+ G+
Sbjct: 62 PCFGGANYRYKGGVMATGGTALGDASGARQAASVAEAAKAMREMTTLRVKLLLDLLDRGH 121
Query: 185 NVMYNDVDMVWLKDPFPYLQ---GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 241
+V+ +D D+ WL+DP +++ D DV + D + D P G
Sbjct: 122 DVLVSDADVAWLRDPREWMREAMTDVDVAASTDCLNARDDDEGKCWGAPTNTG------- 174
Query: 242 MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTA 287
++Y T+ AK + W++ + E ++ DQ FN L K +
Sbjct: 175 ILYFNATEPAKKFIADWVDGM--EKATEDTTERDQEIFNKLLIKRS 218
>gi|413951496|gb|AFW84145.1| regulatory protein [Zea mays]
Length = 383
Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 55/120 (45%), Gaps = 12/120 (10%)
Query: 94 TIIVCAVSQPYLPFLNNWLISISRQKHQ---------DQVLVIAEDYATLYKVNGRWPG- 143
T+I+ +V++ + N L+ + R+ + D VLV+A D P
Sbjct: 107 TVILTSVNEAFAR--PNSLLGLFRESFRAGEGTEHLLDHVLVVAVDAMAFVHCKAVHPHC 164
Query: 144 HAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
+ + V A + F S + + +LELGYN ++ DVD+VWL++PF ++
Sbjct: 165 YRLEVDSATYLSSESSFLSAAYVELVWAKLSLQQRVLELGYNFLFTDVDVVWLRNPFRHI 224
>gi|219130309|ref|XP_002185310.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403225|gb|EEC43179.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 594
Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 12/45 (26%)
Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGD------------HDVYFTDD 214
LGY++++ DVD+VWL++P Y D +DVYF DD
Sbjct: 363 LGYDLLFQDVDIVWLRNPLEYFHNDTSSANDEVSPDYYDVYFQDD 407
>gi|168017116|ref|XP_001761094.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687780|gb|EDQ74161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 526
Score = 42.0 bits (97), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 56/229 (24%), Positives = 92/229 (40%), Gaps = 18/229 (7%)
Query: 83 SQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWP 142
+ F AK I V + ++ F+ NW+ ++ + + +LV A D L + W
Sbjct: 11 KEMIEFRAKKNVIAVTFANFAFMDFVLNWVRHLTDVEVTN-ILVGAMDTKILEAL--FWK 67
Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
G V + +G+ F + + L GY ++ D D+ DPFPY
Sbjct: 68 GVPVFDMRSGMETVDVGWGTPKFHKMGREKVILINAFLAEGYEILMCDTDV----DPFPY 123
Query: 203 LQG--DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIE 260
+ D D+ + D V +D L G+ Y + + RPT AK + K+W++
Sbjct: 124 FERFPDADILTSSD-EVVNSVDDDR-LEFYGQSWGAYNIG-IFFWRPTTIAKELAKEWLQ 180
Query: 261 ELQAEPWSKAKKANDQPAFN-WALNKTAGQVCSVCLCCHCQNEIMMLVL 308
L ++ K DQ FN A N T V L + + M +L
Sbjct: 181 LLLSD-----DKIWDQNGFNDLARNVTGPSVGHDSLFWAYRGNLKMGIL 224
>gi|15222124|ref|NP_177648.1| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
thaliana]
gi|12323904|gb|AAG51930.1|AC013258_24 unknown protein; 4711-6345 [Arabidopsis thaliana]
gi|111074332|gb|ABH04539.1| At1g75120 [Arabidopsis thaliana]
gi|332197553|gb|AEE35674.1| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
thaliana]
Length = 402
Score = 42.0 bits (97), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 70/170 (41%), Gaps = 15/170 (8%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLY-KVNGRWPGHAVL 147
VA N IIV + P L + S+ R Q+ ++V +D + K N
Sbjct: 128 VAVNKEIIVVLANNNVKPMLEVQIASVKRVGIQNYLVVPLDDSLESFCKSN-----EVAY 182
Query: 148 VPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDH 207
PD+ S+ + + + L L+LGY V+ +DVD+V+L++PF +L D
Sbjct: 183 YKRDPDNAIDVVGKSRRSSDVSGLKFRVLREFLQLGYGVLLSDVDIVFLQNPFGHLYRDS 242
Query: 208 DVYFTDDMAAVKPLDHSHDL--PPPGKKGRTYIC-------SCMIYLRPT 248
DV D +D+ P + RT S YLRPT
Sbjct: 243 DVESMSDGHDNNTAYGFNDVFDDPTMTRSRTVYTNRIWVFNSGFFYLRPT 292
>gi|407408299|gb|EKF31795.1| hypothetical protein MOQ_004365 [Trypanosoma cruzi marinkellei]
Length = 710
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 56/123 (45%), Gaps = 12/123 (9%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISIS-RQKHQDQVLVIAEDYATLYKVNGRWP--GHAV 146
+NG I V + + L+N++ S + R K ++ ++ +D A ++ R P +
Sbjct: 395 GENGFITVVIFNSFWRDHLHNFVYSFAKRAKMRNLIVASVDDTALSLCLSFRLPCLNATL 454
Query: 147 LVPPAPDSQTAHKFGSQGFF------NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
V P + A S+G F F+ +P + IL GY M D+D+ W + P
Sbjct: 455 FVEPEKEGDNA---SSKGGFTRKVTEEFSWVKPRLAIAILRRGYTFMLADLDITWNRSPM 511
Query: 201 PYL 203
PYL
Sbjct: 512 PYL 514
>gi|218191377|gb|EEC73804.1| hypothetical protein OsI_08508 [Oryza sativa Indica Group]
Length = 496
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 17/56 (30%), Positives = 31/56 (55%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
D +F + G+ + +R L +LE GY+ +++D D+ W ++PFP+ D D
Sbjct: 190 DFSQEKRFLTAGYLDMMWKRLDFLRLVLEKGYSFIFSDADITWFRNPFPHFYPDGD 245
>gi|303287152|ref|XP_003062865.1| glycosyltransferase family 77 protein [Micromonas pusilla CCMP1545]
gi|226455501|gb|EEH52804.1| glycosyltransferase family 77 protein [Micromonas pusilla CCMP1545]
Length = 525
Score = 42.0 bits (97), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 60/124 (48%), Gaps = 14/124 (11%)
Query: 160 FGSQGFFNFTSRRPCHLLHIL-ELGYNVMYNDVDMVWLKDPFPYLQG--DHDVYFTDDMA 216
+G++ F R+ L+ + +G++++ +DVD VW+K+P PY+ D DV + D
Sbjct: 62 WGTENFHKM-GRKKIDLIKVFTNMGFDILVSDVDTVWMKNPMPYVMKYPDADVLTSSDHL 120
Query: 217 AVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQ 276
A + P + ++ ++ +R T AK + ++W+ L K K DQ
Sbjct: 121 ASTATGDGLEDP---LRAQSAANIGIMLIRHT--AKELAEEWVNVLD-----KDAKVWDQ 170
Query: 277 PAFN 280
AFN
Sbjct: 171 NAFN 174
>gi|226497410|ref|NP_001143689.1| uncharacterized protein LOC100276419 [Zea mays]
gi|195624720|gb|ACG34190.1| hypothetical protein [Zea mays]
Length = 423
Score = 42.0 bits (97), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 56/118 (47%), Gaps = 16/118 (13%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD----MAAVKPLDHSHDLPPPG 231
L L+LGY+++ +D+D+++L++PF +L D DV D M A + D P G
Sbjct: 230 LREFLQLGYSILLSDIDIIFLRNPFDHLYRDSDVESMSDGHNNMTAYG-FNDVFDEPSMG 288
Query: 232 -----KKGRTYI-CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
R ++ S Y+RPT + ++ + L EP K+ DQ FN L
Sbjct: 289 WARYAHTMRIWVYNSGFFYIRPTVPSIELLDRVAGRLSREP-----KSWDQAVFNEEL 341
>gi|226496894|ref|NP_001150786.1| regulatory protein [Zea mays]
gi|195641824|gb|ACG40380.1| regulatory protein [Zea mays]
Length = 332
Score = 42.0 bits (97), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 27/37 (72%)
Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDM 215
+LELGYN ++ DVD+VWL++PF ++ D+ + D+
Sbjct: 200 VLELGYNFLFTDVDVVWLRNPFRHISVYPDMTTSSDI 236
>gi|42562383|ref|NP_174191.2| nucleotide-diphospho-sugar transferase domain-containing protein
[Arabidopsis thaliana]
gi|10764853|gb|AAG22833.1|AC007508_6 F1K23.10 [Arabidopsis thaliana]
gi|60547599|gb|AAX23763.1| hypothetical protein At1g28700 [Arabidopsis thaliana]
gi|71905439|gb|AAZ52697.1| expressed protein [Arabidopsis thaliana]
gi|332192895|gb|AEE31016.1| nucleotide-diphospho-sugar transferase domain-containing protein
[Arabidopsis thaliana]
Length = 338
Score = 41.6 bits (96), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 30/142 (21%), Positives = 59/142 (41%), Gaps = 11/142 (7%)
Query: 89 VAKNGTIIVCAVSQPYLP-----------FLNNWLISISRQKHQDQVLVIAEDYATLYKV 137
N T+I+ V++ Y+ FL ++ + + ++V+A D +
Sbjct: 64 TGNNKTVIIAMVNKAYVEEDGGGRTMLDLFLESFWEGEGTRPLLNHLMVVAADQTAYDRC 123
Query: 138 NGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLK 197
R + D + + S+ F RR LL +L GY++++ D D++WL+
Sbjct: 124 LFRRLHCYKMDTEGVDLEGEKVYMSKDFIEMMWRRTRLLLDVLSRGYHIIFTDTDVMWLR 183
Query: 198 DPFPYLQGDHDVYFTDDMAAVK 219
P L D++ + D V+
Sbjct: 184 SPLSRLNVSLDMHISVDRNNVR 205
>gi|224134266|ref|XP_002321777.1| predicted protein [Populus trichocarpa]
gi|222868773|gb|EEF05904.1| predicted protein [Populus trichocarpa]
Length = 284
Score = 41.6 bits (96), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 30/137 (21%), Positives = 61/137 (44%), Gaps = 12/137 (8%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLP---------FLNNWLISISRQKHQDQVLVIAEDYA 132
L++A++ +N T+I+ V++ Y+ FLN + + + + +L++ D A
Sbjct: 7 LAEAST---ENRTVIIAMVNKAYVEGDDKSMLDLFLNGFWFGENTRDLVNHLLLVNVDQA 63
Query: 133 TLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVD 192
+ + L + + S F RR L +L GYN ++ D D
Sbjct: 64 SYERCKFLRLHCYKLETDGVEFDREEVYMSNEFIKMMWRRTFFLGEVLVRGYNFIFTDTD 123
Query: 193 MVWLKDPFPYLQGDHDV 209
++WL++PF L + ++
Sbjct: 124 VLWLRNPFQRLSFNENI 140
>gi|356560053|ref|XP_003548310.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
Length = 379
Score = 41.6 bits (96), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 43/98 (43%), Gaps = 5/98 (5%)
Query: 144 HAVLVPPAPDSQTAHKFG-SQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
H V D+ T F + + + RR L +L++GYN ++ D D++WL+DPF
Sbjct: 148 HCYQVETKGDNFTGEAFFMTADYLHMMWRRIEFLGTVLDMGYNFVFTDTDIMWLRDPFKL 207
Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
D D D +++DL G Y+ S
Sbjct: 208 FYKDADFQIACDFFN----GNTYDLNNSPNGGFNYVKS 241
>gi|118484315|gb|ABK94035.1| unknown [Populus trichocarpa]
Length = 435
Score = 41.6 bits (96), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 33/55 (60%), Gaps = 1/55 (1%)
Query: 161 GSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD 214
G G + S H+L L+LGY+V+ +DVD+V+L++PF YL D DV D
Sbjct: 227 GKTGGNHAVSGLKFHILREFLQLGYSVLLSDVDIVYLQNPFHYLYRDSDVESMSD 281
>gi|224103449|ref|XP_002313061.1| predicted protein [Populus trichocarpa]
gi|222849469|gb|EEE87016.1| predicted protein [Populus trichocarpa]
Length = 435
Score = 41.6 bits (96), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 33/55 (60%), Gaps = 1/55 (1%)
Query: 161 GSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD 214
G G + S H+L L+LGY+V+ +DVD+V+L++PF YL D DV D
Sbjct: 227 GKTGGNHAVSGLKFHILREFLQLGYSVLLSDVDIVYLQNPFHYLYRDSDVESMSD 281
>gi|255538576|ref|XP_002510353.1| conserved hypothetical protein [Ricinus communis]
gi|223551054|gb|EEF52540.1| conserved hypothetical protein [Ricinus communis]
Length = 430
Score = 41.6 bits (96), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 54/117 (46%), Gaps = 14/117 (11%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVY-FTDDMAAVKPLDHSHDLPPPGKKG 234
L L+LGY+V+ +DVD+V+L++PF +L D DV TD + ++ P
Sbjct: 236 LREFLQLGYSVLLSDVDIVYLQNPFDHLYRDSDVESMTDGHNNMTAYGYNDVFDEPAMGW 295
Query: 235 RTYICSCMI--------YLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
Y + I Y+RPT + ++ + + L +P S DQ FN L
Sbjct: 296 ARYAHTMRIWVYNSGFFYIRPTIPSIELLDRVADRLSRQPNSW-----DQAVFNEEL 347
>gi|115456599|ref|NP_001051900.1| Os03g0849800 [Oryza sativa Japonica Group]
gi|113550371|dbj|BAF13814.1| Os03g0849800, partial [Oryza sativa Japonica Group]
Length = 247
Score = 41.6 bits (96), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 29/52 (55%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
D ++ F S+ + R ILELGYN ++ DVD++W +DPF ++
Sbjct: 194 DYRSEKMFMSKDYLEMMWGRNKFQQTILELGYNFLFTDVDVMWFRDPFRHIS 245
>gi|226529638|ref|NP_001142135.1| hypothetical protein [Zea mays]
gi|194689294|gb|ACF78731.1| unknown [Zea mays]
gi|194707310|gb|ACF87739.1| unknown [Zea mays]
gi|414873942|tpg|DAA52499.1| TPA: hypothetical protein ZEAMMB73_904107 [Zea mays]
Length = 374
Score = 41.6 bits (96), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 43/196 (21%), Positives = 77/196 (39%), Gaps = 26/196 (13%)
Query: 94 TIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAV 146
T+I+ V+Q + FL ++ + +L++A D A + + P H
Sbjct: 112 TVIITCVNQAWAAPGSLLDLFLESFRVGDGTAPLLRHLLIVAMDAAAMARCRALHP-HCY 170
Query: 147 LVPPAP--DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
L PA D A F S+ + + +L LGY++++ D D++WL++P ++
Sbjct: 171 LYSPARGVDFAPAKPFLSRDYLELVWSKLRLQRRVLRLGYSLLFTDADVLWLRNPLKHVT 230
Query: 205 GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQA 264
D+ + D+ P D + P G ++RP + W
Sbjct: 231 AYADMTVSCDVFFGDP-DGVDNFPNTG----------FFHVRPNSRTIAMAAAWHRARDR 279
Query: 265 EPWSKAKKANDQPAFN 280
P N+QP FN
Sbjct: 280 FP-----GKNEQPVFN 290
>gi|4406788|gb|AAD20098.1| unknown protein [Arabidopsis thaliana]
gi|20197486|gb|AAM15094.1| unknown protein [Arabidopsis thaliana]
Length = 626
Score = 41.6 bits (96), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 46/213 (21%), Positives = 84/213 (39%), Gaps = 37/213 (17%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATL 134
L +AA+ K+GT+I+ +++ + F ++ I ++ +++IA D A
Sbjct: 330 LRRAAT---KDGTVILTTLNEAWAAPGSVIDLFFESFRIGKGTRRLLKHLVIIALD-AKA 385
Query: 135 YKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTS------RRPCHLLHILELGYNVMY 188
Y H + +++ G + +F S RR L +LE GYN ++
Sbjct: 386 YSRCQELHKHCFRL----ETEGVDFSGGEAYFMTPSYLTMMWRRISFLRSVLEKGYNFVF 441
Query: 189 NDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPT 248
D D++W ++PF D D D +P D + P G ++R
Sbjct: 442 TDADVMWFRNPFRRFYEDGDFQIACDHYIGRPNDF-RNRPNGG----------FTFVRAN 490
Query: 249 DGAKLVMKKWIEELQAEPWSKAKKANDQPAFNW 281
+ + K W + P K +DQ N+
Sbjct: 491 NRSIGFYKFWYDSRTKYP-----KNHDQDVLNF 518
>gi|303276088|ref|XP_003057338.1| hypothetical protein MICPUCDRAFT_62376 [Micromonas pusilla
CCMP1545]
gi|226461690|gb|EEH58983.1| hypothetical protein MICPUCDRAFT_62376 [Micromonas pusilla
CCMP1545]
Length = 727
Score = 41.6 bits (96), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 28/120 (23%), Positives = 50/120 (41%), Gaps = 4/120 (3%)
Query: 161 GSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKP 220
G + F S + + +LEL + D D+ WL+DP PY D + +
Sbjct: 185 GDEHFKKMGSVKTKFIQDLLELNIAPILTDADVTWLRDPRPYFN-KGTYALADALVSTDC 243
Query: 221 LDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
+D D + +++ RP++ +K ++ W ++ + S DQPAFN
Sbjct: 244 IDIPGDARDENACAHVNFNTGVLHFRPSNASKAFVETWKNKVAS---STIAWMRDQPAFN 300
>gi|29824405|gb|AAP04162.1| unknown protein [Arabidopsis thaliana]
gi|110737123|dbj|BAF00513.1| hypothetical protein [Arabidopsis thaliana]
Length = 402
Score = 41.6 bits (96), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 45/170 (26%), Positives = 70/170 (41%), Gaps = 15/170 (8%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLV-IAEDYATLYKVNGRWPGHAVL 147
VA N IIV + P L + S+ R Q+ ++V + + + K N
Sbjct: 128 VAVNKEIIVVLANNNVKPMLEVQIASVKRVGIQNYLVVPLYDSLESFCKSN-----EVAY 182
Query: 148 VPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDH 207
PD+ S+ + + + L L+LGY V+ +DVD+V+L++PF +L D
Sbjct: 183 YKRDPDNAIDVVGKSRRSSDVSGLKFRVLREFLQLGYGVLLSDVDIVFLQNPFGHLYRDS 242
Query: 208 DVYFTDDMAAVKPLDHSHDL--PPPGKKGRTYIC-------SCMIYLRPT 248
DV D +D+ P + RT S YLRPT
Sbjct: 243 DVESMSDGHDNNTAYGFNDVFDDPTMTRSRTVYTNRIWVFNSGFFYLRPT 292
>gi|242035367|ref|XP_002465078.1| hypothetical protein SORBIDRAFT_01g031700 [Sorghum bicolor]
gi|241918932|gb|EER92076.1| hypothetical protein SORBIDRAFT_01g031700 [Sorghum bicolor]
Length = 425
Score = 41.2 bits (95), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 56/118 (47%), Gaps = 16/118 (13%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD----MAAVKPLDHSHDLPPPG 231
L L+LGY+++ +D+D+++L++PF +L D DV D M A + D P G
Sbjct: 232 LREFLQLGYSILLSDIDIIFLRNPFDHLYRDSDVESMSDGHNNMTAYG-FNDVFDEPSMG 290
Query: 232 -----KKGRTYI-CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
R ++ S Y+RPT + ++ + L EP K+ DQ FN L
Sbjct: 291 WARYAHTMRIWVHNSGFFYIRPTIPSIELLDRVAGHLSREP-----KSWDQAVFNEEL 343
>gi|312190393|gb|ADQ43193.1| unknown [Eutrema parvulum]
Length = 395
Score = 41.2 bits (95), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 4/81 (4%)
Query: 160 FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVK 219
F ++ + RR L +LE+GYN ++ D D++W ++PFP D +A
Sbjct: 183 FMTRSYLKMMWRRIDFLRSVLEMGYNFVFTDADVMWFRNPFPRFYRYADF----QIACDH 238
Query: 220 PLDHSHDLPPPGKKGRTYICS 240
L S+DL G +++ S
Sbjct: 239 YLGRSNDLENRPNGGFSFVRS 259
>gi|297842255|ref|XP_002889009.1| hypothetical protein ARALYDRAFT_476651 [Arabidopsis lyrata subsp.
lyrata]
gi|297334850|gb|EFH65268.1| hypothetical protein ARALYDRAFT_476651 [Arabidopsis lyrata subsp.
lyrata]
Length = 399
Score = 41.2 bits (95), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 71/175 (40%), Gaps = 25/175 (14%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLY-KVN-----GRWP 142
VA N IIV + P L S+ R Q+ +++ +D + + N R P
Sbjct: 125 VAVNKEIIVVLANNNVKPMLEVQTASVKRVGIQNYLVIPLDDSIESFCRSNEVAYFKRDP 184
Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
+A+ V S +A + + L L+LGY V+ DVD+V+L++PF +
Sbjct: 185 DNAIDVVGKSGSGSA----------VSGLKFRVLREFLQLGYGVLLADVDLVFLQNPFGH 234
Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDL--PPPGKKGRTYIC-------SCMIYLRPT 248
L D DV D +D+ P + RT S YLRPT
Sbjct: 235 LYRDSDVESMSDGHDNNTAYGFNDVFDDPSMTRSRTVYTTRIWVFNSGFFYLRPT 289
>gi|359492122|ref|XP_003634366.1| PREDICTED: uncharacterized protein LOC100255856 isoform 2 [Vitis
vinifera]
Length = 428
Score = 41.2 bits (95), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 26/34 (76%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
L L+LGY+V+ +D+D+V+L++PF YL D DV
Sbjct: 235 LREFLQLGYSVLLSDIDIVYLQNPFDYLYRDSDV 268
>gi|225458348|ref|XP_002283221.1| PREDICTED: uncharacterized protein LOC100255856 isoform 1 [Vitis
vinifera]
Length = 434
Score = 41.2 bits (95), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 26/34 (76%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
L L+LGY+V+ +D+D+V+L++PF YL D DV
Sbjct: 241 LREFLQLGYSVLLSDIDIVYLQNPFDYLYRDSDV 274
>gi|302142458|emb|CBI19661.3| unnamed protein product [Vitis vinifera]
Length = 446
Score = 41.2 bits (95), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 26/34 (76%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
L L+LGY+V+ +D+D+V+L++PF YL D DV
Sbjct: 241 LREFLQLGYSVLLSDIDIVYLQNPFDYLYRDSDV 274
>gi|212275955|ref|NP_001130696.1| hypothetical protein [Zea mays]
gi|194689860|gb|ACF79014.1| unknown [Zea mays]
gi|194707158|gb|ACF87663.1| unknown [Zea mays]
gi|224030421|gb|ACN34286.1| unknown [Zea mays]
gi|414887364|tpg|DAA63378.1| TPA: hypothetical protein ZEAMMB73_518373 [Zea mays]
Length = 423
Score = 41.2 bits (95), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 49/205 (23%), Positives = 83/205 (40%), Gaps = 20/205 (9%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
VA +IV + L W +I R + ++V +D + + P +
Sbjct: 147 VAVKKELIVALANSNVKEMLEMWFTNIKRAGISNYLVVALDDSIENFCKSNDVP----VY 202
Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
PD + + G + + L L+LGY+V+ +D+D+++ ++PF YL D D
Sbjct: 203 RRDPDDGIDNIGKTGGNHAVSGLKFRILREFLQLGYSVLLSDIDIIFFQNPFDYLYRDSD 262
Query: 209 VYFTDD----MAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGAKLVMKKW 258
V D M A + D P G R ++ S ++RPT + ++ +
Sbjct: 263 VESMSDGHNNMTAYG-FNDVFDEPSMGWARYAHTMRIWVYNSGFFFIRPTIPSIELLDRV 321
Query: 259 IEELQAEPWSKAKKANDQPAFNWAL 283
L EP K+ DQ FN L
Sbjct: 322 AYRLSHEP-----KSWDQAVFNEEL 341
>gi|224102433|ref|XP_002312675.1| predicted protein [Populus trichocarpa]
gi|222852495|gb|EEE90042.1| predicted protein [Populus trichocarpa]
Length = 552
Score = 41.2 bits (95), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 27/113 (23%), Positives = 49/113 (43%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
+N T+++ Y L +W+ + + + + ++ + + V P P
Sbjct: 284 ENKTLVLAVAGYSYKDMLMSWVCRLHQLRVTNFIICALDQETYQFSVLQGLPVFHDPSAP 343
Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
S FG+ F T + + IL+LGYNV+ +DVD+ W +P P L
Sbjct: 344 RNISFNDCHFGTTCFQRVTKVKSRMVWKILKLGYNVLLSDVDVYWFGNPLPLL 396
>gi|224114682|ref|XP_002339511.1| predicted protein [Populus trichocarpa]
gi|222832587|gb|EEE71064.1| predicted protein [Populus trichocarpa]
Length = 275
Score = 41.2 bits (95), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 27/48 (56%)
Query: 162 SQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
S F RR L +L GYN ++ D D++WL++PFP L + ++
Sbjct: 84 SDEFIKMMWRRTLFLGQVLRRGYNFIFTDADVLWLRNPFPRLSFNKNI 131
>gi|302819510|ref|XP_002991425.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300140818|gb|EFJ07537.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 545
Score = 41.2 bits (95), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 50/216 (23%), Positives = 87/216 (40%), Gaps = 19/216 (8%)
Query: 68 PNATSSLAKWRDYTLSQA-ASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLV 126
P S L + LS+ F AK I+V + + F+ W+ ++ +LV
Sbjct: 14 PPHGSELPPREAFVLSKEMVEFRAKKNVIMVTFANHAFEDFVLTWVRHLT-DVGVTNLLV 72
Query: 127 IAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNV 186
A D L ++ W G V + + +G+ F + + I+ +G+ V
Sbjct: 73 GAMDRKILEELF--WKGVPVFDMGSEMNPADVGWGTPVFHKMGREKVFLVNAIMAMGFEV 130
Query: 187 MYNDVDMVWLKDPFPYLQG--DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIY 244
++ D DM +P PY++ D DV + D D S + ++ + + +
Sbjct: 131 LFCDTDM----NPLPYMERYPDADVLVSSDAVIATVTDESLE---DWRRSYAALNIGIFH 183
Query: 245 LRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
RPT+ AK + W +L+ E K DQ FN
Sbjct: 184 WRPTEAAKKFARAWQIQLEDE------KIWDQNGFN 213
>gi|356545279|ref|XP_003541071.1| PREDICTED: uncharacterized protein LOC100804189 [Glycine max]
Length = 438
Score = 41.2 bits (95), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 87/213 (40%), Gaps = 23/213 (10%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
L++ VA +IVC + L W +I R + ++ +D + + +
Sbjct: 155 LAKILEKVAVKRELIVCLANTNVKEMLEVWFTNIKRVGITNYLVAALDDETAKFCESNQV 214
Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPF 200
P + PD G G + S +L L+LGY+V+ +DVD+V L++PF
Sbjct: 215 P----VYKRDPDD-GVDIIGRTGSNHAVSGLKFRILREFLQLGYSVLLSDVDIVHLQNPF 269
Query: 201 PYLQGDHDVYFT----DDMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDG 250
+L D DV D+M A D D P G R ++ S Y+RPT
Sbjct: 270 DHLYRDSDVESMSDGHDNMTAYGYND-VFDEPTMGWARYAHTMRIWVYNSGFFYIRPTIP 328
Query: 251 AKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
+ ++ + L E KA DQ FN L
Sbjct: 329 SIELLDRVATRLSKE------KAWDQAVFNEEL 355
>gi|384248111|gb|EIE21596.1| hypothetical protein COCSUDRAFT_17381, partial [Coccomyxa
subellipsoidea C-169]
Length = 529
Score = 40.8 bits (94), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 80/194 (41%), Gaps = 18/194 (9%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
VA G ++V + YL F+ NWL ++ + LV A D L + R G
Sbjct: 12 VAPQGPVMVTWANFHYLDFVLNWLAHVNALGIK--PLVGAMDDKILQALVDR--GVHTFA 67
Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--D 206
+ S+ +GS F + + ++G++++ DVD VW +PFPY+ D
Sbjct: 68 MRSGLSEDDFGWGSASFHKMGREKIQLIYTFTKMGFDILVADVDTVW--NPFPYMARYPD 125
Query: 207 HDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEP 266
D+ + D D + P G M+ + GA + K+W E L A+
Sbjct: 126 ADILTSSDHLRNSTADDGLERFP--DAGSAANIGIMLVRK---GALSLAKEWNEVLLAD- 179
Query: 267 WSKAKKANDQPAFN 280
+ DQ AFN
Sbjct: 180 ----DQVWDQNAFN 189
>gi|303278666|ref|XP_003058626.1| glycosyltransferase family 77 protein [Micromonas pusilla CCMP1545]
gi|226459786|gb|EEH57081.1| glycosyltransferase family 77 protein [Micromonas pusilla CCMP1545]
Length = 924
Score = 40.8 bits (94), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 58/140 (41%), Gaps = 19/140 (13%)
Query: 168 FTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDH-----------DVYFTDDMA 216
F R +L +G++V+ +DVD+VW +DP P+ D + +A
Sbjct: 223 FARVRIGEARALLSMGFDVLMSDVDVVWRRDPRPFFSRRRDDDGDDGDGGDDGDYDASLA 282
Query: 217 AVKPLDHSHDLPPPG--KKGRTYIC-----SCMIYLRPTDGAKLVMKKWIEELQAEPWSK 269
+ + S +L P ++G TY + +++ R T + W E L
Sbjct: 283 SADVMVSSDNLSPRRDFEQGATYAARGTFNTGIVFARRTPAGERFAAAWYERLARPTGRF 342
Query: 270 AKKANDQPAFNWALNKTAGQ 289
A +DQ FN A+ + G+
Sbjct: 343 ASLTSDQQVFN-AMVRAEGR 361
>gi|449461134|ref|XP_004148298.1| PREDICTED: uncharacterized protein LOC101214097 [Cucumis sativus]
Length = 435
Score = 40.8 bits (94), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 49/204 (24%), Positives = 82/204 (40%), Gaps = 18/204 (8%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
VA + +IV + L W SI + + ++V +D + P +
Sbjct: 158 VAIDRELIVAVANSNVKAMLELWFTSIKKAGIPNYLVVALDDEIVQFCKKNDVP----VY 213
Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
PD + + G + + L L+LGY V+ +DVD+V+L++PF +L D D
Sbjct: 214 MRDPDEKVDSIGRTGGNHAVSGTKFRILREFLQLGYAVLLSDVDIVYLQNPFNHLYRDSD 273
Query: 209 VY-FTDDMAAVKPLDHSHDLPPP-------GKKGRTYI-CSCMIYLRPTDGAKLVMKKWI 259
V TD V ++ P R ++ S Y+RPT A ++ +
Sbjct: 274 VESMTDGHDNVTAYGYNDVFEEPAMGWARFAHTMRIWVYNSGFFYIRPTIPAIELLDRVA 333
Query: 260 EELQAEPWSKAKKANDQPAFNWAL 283
L S+ + + DQ FN L
Sbjct: 334 NRL-----SRERNSWDQAVFNEEL 352
>gi|449492975|ref|XP_004159157.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101227563 [Cucumis sativus]
Length = 435
Score = 40.8 bits (94), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 54/117 (46%), Gaps = 14/117 (11%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVY-FTDDMAAVKPLDHSHDLPPP---- 230
L L+LGY V+ +DVD+V+L++PF +L D DV TD V ++ P
Sbjct: 241 LREFLQLGYAVLLSDVDIVYLQNPFNHLYRDSDVESMTDGHDNVTAYGYNDVFEEPAMGW 300
Query: 231 ---GKKGRTYI-CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
R ++ S Y+RPT A ++ + L S+ + + DQ FN L
Sbjct: 301 ARFAHTMRIWVYNSGFFYIRPTIPAIELLDRVANRL-----SRERNSWDQAVFNEEL 352
>gi|297842253|ref|XP_002889008.1| hypothetical protein ARALYDRAFT_476650 [Arabidopsis lyrata subsp.
lyrata]
gi|297334849|gb|EFH65267.1| hypothetical protein ARALYDRAFT_476650 [Arabidopsis lyrata subsp.
lyrata]
Length = 429
Score = 40.8 bits (94), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 84/206 (40%), Gaps = 23/206 (11%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
VA N IIV + P L + SI R + +++ +D + + +
Sbjct: 154 VAVNKEIIVVLANSNVKPMLELQIASIKRVGILNYLIIALDDSVESFCESKEVSYY---- 209
Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
PD S G + + L L+LGY+V+ +DVD+V+L++PF +L D D
Sbjct: 210 KRDPDKAVDMVGKSGGNHAVSGLKFRVLREFLQLGYSVLLSDVDIVFLQNPFGHLYRDSD 269
Query: 209 VYFT----DDMAAVKPLDHSHDLPPPG--KKGRT----YICSCMIYLRPTDGAKLVMKKW 258
V D+M A + D P G + T S YLRPT + ++ +
Sbjct: 270 VESMSDGHDNMTAY-GFNDVFDEPSMGWARYAHTMRIWVFNSGFFYLRPTLPSIELLDRV 328
Query: 259 IEEL-QAEPWSKAKKANDQPAFNWAL 283
L ++E W DQ FN L
Sbjct: 329 AYTLSKSEAW-------DQAVFNEQL 347
>gi|328769745|gb|EGF79788.1| hypothetical protein BATDEDRAFT_89199 [Batrachochytrium
dendrobatidis JAM81]
Length = 885
Score = 40.8 bits (94), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 46/200 (23%), Positives = 83/200 (41%), Gaps = 24/200 (12%)
Query: 75 AKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQV-----LVIAE 129
A R L + S +A N ++V V++ +L N S+ + + L + E
Sbjct: 557 ASVRSKCLKEFGSKIAHNNELVVVPVNRAFLVQAANLRCSLLHAGIDNLIYWSLDLEVHE 616
Query: 130 DYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYN 189
+ K+ ++ + P + G+ F +P L +L+ G+NV Y
Sbjct: 617 SLISKGKL-------SIFLAGFPSTSERQDPGTPYFIKMMRYKPKVLAMLLDAGFNVWYM 669
Query: 190 DVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHS----HDLPPPGKKGRTYICSCMIYL 245
D D + L+D P + D V+ + K LD S H +PP G ++YL
Sbjct: 670 DADTIALQDFRPEIVADTSVHIHIALDNDKRLDMSTFLKHSVPPLASAG-------IMYL 722
Query: 246 RPTDGAKLVMKKWIEELQAE 265
+ +KL +K + ++QA+
Sbjct: 723 QNHPSSKLFIKS-VLDMQAQ 741
>gi|326435846|gb|EGD81416.1| hypothetical protein PTSG_02139 [Salpingoeca sp. ATCC 50818]
Length = 422
Score = 40.8 bits (94), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 36/76 (47%), Gaps = 5/76 (6%)
Query: 180 LELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYIC 239
L G++V ND D VWL+DP+P + + V+ D +A GK +C
Sbjct: 174 LHHGFDVWMNDADAVWLRDPWPLIHRELTVHKADIVAQRGGFPFE-----IGKAWGATMC 228
Query: 240 SCMIYLRPTDGAKLVM 255
+Y+R T KLV+
Sbjct: 229 FGFVYMRATAATKLVV 244
>gi|302844548|ref|XP_002953814.1| hypothetical protein VOLCADRAFT_106094 [Volvox carteri f.
nagariensis]
gi|300260922|gb|EFJ45138.1| hypothetical protein VOLCADRAFT_106094 [Volvox carteri f.
nagariensis]
Length = 1007
Score = 40.8 bits (94), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 32/136 (23%), Positives = 60/136 (44%), Gaps = 19/136 (13%)
Query: 154 SQTAHKFGSQGFFNFTSRRPCHLLH-ILELGYNVMYNDVDMVWLKDPFPYLQG-----DH 207
S T +++G ++ T+ H++ + E+G +V+++D+D+VW DP PY + H
Sbjct: 150 SDTKYQWGGH-HWSMTTWNKVHIMKSVYEMGVHVVHSDMDVVWFNDPLPYFKSLLEKPVH 208
Query: 208 DVYFTDDMAAVKPLDHS--HDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAE 265
V TD + + P + L P I + + ++R G W+
Sbjct: 209 IVIATDAVTSENPKGDTGLEALTSP----HANINTGIYFMRQWPGGLEFFNIWLS----- 259
Query: 266 PWSKAKKANDQPAFNW 281
W + +DQ FN+
Sbjct: 260 -WQDKRIGHDQDGFNF 274
>gi|145352275|ref|XP_001420477.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580711|gb|ABO98770.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 634
Score = 40.8 bits (94), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 29/119 (24%), Positives = 55/119 (46%), Gaps = 4/119 (3%)
Query: 162 SQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPL 221
S+ F + + + +LELG + +D D+VW++DP Y D + + +
Sbjct: 112 SKTFKKMGTVKTKFIQDLLELGIAPILSDADVVWMRDPRELFNNGTYAY-ADVLISSDCI 170
Query: 222 DHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
D +D + ++++RPT+ AK ++KW +++ S+ DQPA N
Sbjct: 171 DTVNDRADNANCRNVNFNTGIVHIRPTEPAKAFVEKWKQKVAT---SEIAWMRDQPALN 226
>gi|359489310|ref|XP_002270120.2| PREDICTED: uncharacterized protein At1g28695-like [Vitis vinifera]
Length = 346
Score = 40.8 bits (94), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 56/121 (46%), Gaps = 9/121 (7%)
Query: 92 NGTIIVCAVSQPYLP---------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWP 142
N T+I+ V++ Y+ FL ++ I ++ D +LV+A D + R
Sbjct: 61 NKTVIITIVNKAYVEGDDTSMLDLFLESFWIGEGTREMADHLLVVALDQTAFDRCIFRRL 120
Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
+V D + S+ F RR LL +L+ GY+ ++ D D+ WL++PFP
Sbjct: 121 HCYKMVSEDGDMDGEKLYMSKDFIKMMWRRTLLLLRVLDRGYSFIFTDTDVSWLRNPFPR 180
Query: 203 L 203
L
Sbjct: 181 L 181
>gi|255084750|ref|XP_002504806.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
gi|226520075|gb|ACO66064.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
Length = 750
Score = 40.4 bits (93), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 46/98 (46%), Gaps = 4/98 (4%)
Query: 183 GYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCM 242
G+ V+ +DVD VWL DP + + D TD + L H + G G + + +
Sbjct: 272 GHRVLVSDVDTVWLDDPREWFERDDLPTRTDVSVSTDCLSHEEERRSRGCWGPGF-NTGI 330
Query: 243 IYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
++LRPT+ +M W + L + K +DQ FN
Sbjct: 331 LWLRPTEATINLMATWRDALLT---TSDKFEHDQDIFN 365
>gi|71659866|ref|XP_821653.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70887037|gb|EAN99802.1| hypothetical protein Tc00.1047053508461.84 [Trypanosoma cruzi]
Length = 714
Score = 40.4 bits (93), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 57/124 (45%), Gaps = 10/124 (8%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISIS-RQKHQDQVLVIAEDYATLYKVNGRWP--GHAV 146
+NG + V + + L+N++ S + R K ++ ++ +D A L ++ R P +
Sbjct: 395 GENGFVTVVIFNSFWRDQLHNFVYSFAKRAKMRNLIVASVDDTALLLCLSFRLPCLNATL 454
Query: 147 LVPPAPDSQTA-HKFGSQGFF------NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
V P ++ S+G F F+ +P + +L GY M D+D+ W + P
Sbjct: 455 FVEPEKGTEKGGDNASSKGGFTRKVTEEFSWVKPRLAIAVLRRGYTFMLADLDITWNRSP 514
Query: 200 FPYL 203
PYL
Sbjct: 515 MPYL 518
>gi|297734641|emb|CBI16692.3| unnamed protein product [Vitis vinifera]
Length = 313
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 56/121 (46%), Gaps = 9/121 (7%)
Query: 92 NGTIIVCAVSQPYLP---------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWP 142
N T+I+ V++ Y+ FL ++ I ++ D +LV+A D + R
Sbjct: 61 NKTVIITIVNKAYVEGDDTSMLDLFLESFWIGEGTREMADHLLVVALDQTAFDRCIFRRL 120
Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
+V D + S+ F RR LL +L+ GY+ ++ D D+ WL++PFP
Sbjct: 121 HCYKMVSEDGDMDGEKLYMSKDFIKMMWRRTLLLLRVLDRGYSFIFTDTDVSWLRNPFPR 180
Query: 203 L 203
L
Sbjct: 181 L 181
>gi|18394730|ref|NP_564082.1| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
thaliana]
gi|8778445|gb|AAF79453.1|AC025808_35 F18O14.8 [Arabidopsis thaliana]
gi|21593695|gb|AAM65662.1| unknown [Arabidopsis thaliana]
gi|109134163|gb|ABG25079.1| At1g19360 [Arabidopsis thaliana]
gi|110737961|dbj|BAF00917.1| hypothetical protein [Arabidopsis thaliana]
gi|332191718|gb|AEE29839.1| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
thaliana]
Length = 428
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 88/213 (41%), Gaps = 23/213 (10%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDY-ATLYKVNGR 140
L++ +A + +IV + L + SI R + ++V +DY L K N
Sbjct: 145 LAKILEEIAVDKEVIVALANANVKAMLEVQIASIKRVGITNYLVVALDDYIENLCKENDV 204
Query: 141 WPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
P D T K G G + + L L+LGY V+ +DVD+V+L++PF
Sbjct: 205 AYYKR---DPDKDVDTVGKTG--GNHAVSGLKFRVLREFLQLGYGVLLSDVDIVFLQNPF 259
Query: 201 PYLQGDHDVY-FTD--DMAAVKPLDHSHDLPPPG--KKGRT----YICSCMIYLRPTDGA 251
+L D DV +D D + D P G + T S YLRPT +
Sbjct: 260 SHLYRDSDVESMSDGHDNHTAYGFNDVFDEPAMGWARYAHTMRIWVFNSGFFYLRPTIPS 319
Query: 252 KLVMKKWIEEL-QAEPWSKAKKANDQPAFNWAL 283
++ + + L +A+ W DQ FN L
Sbjct: 320 IELLDRVADRLSKAKVW-------DQAVFNEEL 345
>gi|357168283|ref|XP_003581573.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
distachyon]
Length = 389
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/176 (19%), Positives = 76/176 (43%), Gaps = 21/176 (11%)
Query: 79 DYTLSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDY 131
D L +A++ ++ T+I+ ++ + F++++ +S + +++IA D+
Sbjct: 89 DRVLQKAST---RDNTVILTTLNAAWASPGSVIDLFIDSFRSGVSTSSLLNHLVIIAFDW 145
Query: 132 ATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYN-- 189
+ VL D +F + G+ +R L +LE GY+ +++
Sbjct: 146 NAYKQCLKIHHYCFVLATKGVDFSEEKRFLTSGYLEMMWKRLDFLRTVLEKGYSFIFSLF 205
Query: 190 -----DVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
D D++W ++PFP+ D D+ +A + ++ DL G Y+ S
Sbjct: 206 FYAFHDADIMWFRNPFPHFYPDGDI----QIACDHYVGNATDLKSIANGGFNYVKS 257
>gi|297850324|ref|XP_002893043.1| hypothetical protein ARALYDRAFT_472163 [Arabidopsis lyrata subsp.
lyrata]
gi|297338885|gb|EFH69302.1| hypothetical protein ARALYDRAFT_472163 [Arabidopsis lyrata subsp.
lyrata]
Length = 428
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 53/118 (44%), Gaps = 17/118 (14%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD---MAAVKPLDHSHDLPPPG- 231
L L+LGY V+ +DVD+V+L++PF +L D DV D + D P G
Sbjct: 235 LREFLQLGYGVLLSDVDIVFLQNPFSHLYRDSDVESMSDGHNNHTAYGFNDVFDEPAMGW 294
Query: 232 -KKGRT----YICSCMIYLRPTDGAKLVMKKWIEEL-QAEPWSKAKKANDQPAFNWAL 283
+ T S YLRPT + ++ + + L +A+ W DQ FN L
Sbjct: 295 ARYAHTMRIWVFNSGFFYLRPTIPSIELLDRVADRLSKAKVW-------DQAVFNEEL 345
>gi|359488580|ref|XP_002273650.2| PREDICTED: uncharacterized protein At4g15970-like [Vitis vinifera]
Length = 308
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 48/104 (46%), Gaps = 3/104 (2%)
Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
FL ++ + + +K + ++++ D P L P P+ A K
Sbjct: 67 FLESFKVGVGTKKLLNHLVIVTTDDQAFQYCKAMHPHCFPL--PTPEDFVARKPLMHPDR 124
Query: 167 NFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
+ RR LL + ELGYN ++ D D++WLK+PF Y+ D+
Sbjct: 125 SKFGRRTIRLLGEVDELGYNFVFTDADVMWLKNPFLYVDPIQDL 168
>gi|414869642|tpg|DAA48199.1| TPA: hypothetical protein ZEAMMB73_139999 [Zea mays]
Length = 573
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 53/201 (26%), Positives = 84/201 (41%), Gaps = 36/201 (17%)
Query: 49 LLVVLGVILPWTGTPGFMFPNATSSL----AKW----RDYTLSQAASF-----VAKNG-- 93
LL V+ L T F+F AT+ + A W R +A SF V KN
Sbjct: 16 LLPVISFFLGAALTAAFVFLGATTDVSWRFAAWGSGARPGAGDEAKSFAELAEVLKNASM 75
Query: 94 ---TIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
T+IV +++Q Y FL+++ D +L++A D A L
Sbjct: 76 EDKTVIVTSINQAYAAPGSLLDLFLDSFRAGEGTAGLLDHLLIVAVDPAALETCRS-VHR 134
Query: 144 HAVLVPPAPDSQTAHK-----FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKD 198
H L+ PA + A F ++ + + R ILELG++ ++ DVD++W ++
Sbjct: 135 HCYLLRPAAGAADADLGAAKFFMTKDYLDMMWARNRLQQTILELGFSFLFTDVDILWFRN 194
Query: 199 PFPYLQGDHDV-----YFTDD 214
P ++ DV YF D
Sbjct: 195 PMRHIAVTSDVAIACDYFNGD 215
>gi|388519423|gb|AFK47773.1| unknown [Medicago truncatula]
Length = 429
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 55/117 (47%), Gaps = 15/117 (12%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVY-FTDDMAAVKPLDHSHDLPPPGK-- 232
L L+LGY+V+ +D+D+V+L++PF +L D DV +D + ++ PG
Sbjct: 236 LREFLQLGYSVLLSDIDIVYLQNPFDHLYRDSDVESMSDGHNNMTAYGYNDVFNDPGMGW 295
Query: 233 -----KGRTYICSC-MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
R ++ + Y+RPT + ++ + L E KA DQ FN L
Sbjct: 296 SSGVFTRRIFVYNAGFFYIRPTIPSIELLDRVAARLLKE------KAWDQVVFNEEL 346
>gi|357500079|ref|XP_003620328.1| hypothetical protein MTR_6g081060 [Medicago truncatula]
gi|355495343|gb|AES76546.1| hypothetical protein MTR_6g081060 [Medicago truncatula]
Length = 379
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 29/61 (47%), Gaps = 1/61 (1%)
Query: 171 RRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPP 230
RR L +L LGY+ ++ D D++WL+DPF D D D D H+ P
Sbjct: 183 RRIEFLGAVLHLGYSFVFTDTDIMWLRDPFKQFYKDADFQIACDYFNGNSYD-MHNYPNG 241
Query: 231 G 231
G
Sbjct: 242 G 242
>gi|357115371|ref|XP_003559462.1| PREDICTED: uncharacterized protein At1g28695-like [Brachypodium
distachyon]
Length = 371
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/73 (27%), Positives = 39/73 (53%), Gaps = 2/73 (2%)
Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ--GDHDVY 210
D + + S GF +R L +++ GY+ ++ D+D++WL++PF L G+ D+
Sbjct: 174 DLSSEQLYMSDGFIRMMWQRIRLLGDVVKHGYSFIFTDLDVMWLRNPFQSLNRTGEEDLL 233
Query: 211 FTDDMAAVKPLDH 223
+ D +P D+
Sbjct: 234 ISSDRFNGRPHDY 246
>gi|407847536|gb|EKG03219.1| hypothetical protein TCSYLVIO_005741 [Trypanosoma cruzi]
Length = 390
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 56/124 (45%), Gaps = 10/124 (8%)
Query: 90 AKNGTIIVCAVSQPYLPFLNNWLISIS-RQKHQDQVLVIAEDYATLYKVNGRWP--GHAV 146
+NG + V + + L+N++ S + R K ++ ++ +D A L + R P +
Sbjct: 71 GENGFVTVVIFNSFWRDHLHNFVYSFAKRAKMRNLIVASVDDTALLLCFSFRLPCLNATL 130
Query: 147 LVPPAPDSQTAH-KFGSQGFF------NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
V P ++ S+G F F+ +P + +L GY M D+D+ W + P
Sbjct: 131 FVEPEKGTEKGGANASSKGGFTRKVTEEFSWVKPPLAIAVLRRGYTFMLADLDITWNRSP 190
Query: 200 FPYL 203
PYL
Sbjct: 191 MPYL 194
>gi|225450476|ref|XP_002276802.1| PREDICTED: uncharacterized protein LOC100240902 [Vitis vinifera]
Length = 285
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 26/34 (76%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
L L+LGY V+++D+D+V+L++PF YL D DV
Sbjct: 159 LGEFLQLGYIVLFSDIDIVYLQNPFDYLYRDSDV 192
>gi|302835477|ref|XP_002949300.1| hypothetical protein VOLCADRAFT_117113 [Volvox carteri f.
nagariensis]
gi|300265602|gb|EFJ49793.1| hypothetical protein VOLCADRAFT_117113 [Volvox carteri f.
nagariensis]
Length = 336
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 29/121 (23%), Positives = 56/121 (46%), Gaps = 15/121 (12%)
Query: 160 FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY-LQGDHDVY-------- 210
F GF + + ++L+ L LG++V++ D D+V ++DP PY L D D++
Sbjct: 99 FHDTGFNSLGFAKVKYILNGLSLGHDVVFLDTDIVVMRDPLPYLLVHDADLFGSMEKCMV 158
Query: 211 FTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKA 270
+ D ++ P S PP I M+Y + + + W E+++E ++
Sbjct: 159 YNDSLSFNSPEFKSLRKRPPA------INIGMLYFKASASVTRCVYNWAWEMRSEVQTRP 212
Query: 271 K 271
+
Sbjct: 213 R 213
>gi|298704797|emb|CBJ48945.1| Acetyltransferase (isoleucine patch superfamily protein)
[Ectocarpus siliculosus]
Length = 423
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 48/109 (44%), Gaps = 16/109 (14%)
Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPP--GKKGRT 236
+L G +V+ +D D +WL+DP L D + ++ A + + +P P K G T
Sbjct: 237 LLNAGQDVLMSDADAIWLRDPTDDLAS--DAFLDSNVVASRAV-----MPFPLYHKWGAT 289
Query: 237 YICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNK 285
IC + R A +EE+ E NDQ A N ALNK
Sbjct: 290 -ICMGFAFFRAGGSAMQAFMTVVEEIANE------LGNDQIAVNEALNK 331
>gi|296089833|emb|CBI39652.3| unnamed protein product [Vitis vinifera]
Length = 247
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 26/34 (76%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
L L+LGY V+++D+D+V+L++PF YL D DV
Sbjct: 115 LGEFLQLGYIVLFSDIDIVYLQNPFDYLYRDSDV 148
>gi|159484574|ref|XP_001700329.1| predicted protein [Chlamydomonas reinhardtii]
gi|158272370|gb|EDO98171.1| predicted protein [Chlamydomonas reinhardtii]
Length = 528
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/114 (24%), Positives = 46/114 (40%), Gaps = 27/114 (23%)
Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGD-----HDVYFTDDMAAVKPLDHSHDLPPPGKK 233
+ E G NV+++D D+VW DP P+ H + TD +A P+ G
Sbjct: 48 VYEFGVNVIHSDTDVVWFGDPLPFFHERLSGPVHVIMATDAVATGNPV---------GDM 98
Query: 234 GR-------TYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
G T I + + +++ G + K W++ W +DQ FN
Sbjct: 99 GLEVTTNPFTNINTGIYFIKQYAGGLDMFKAWLD------WQDKNIGHDQDGFN 146
>gi|397604903|gb|EJK58815.1| hypothetical protein THAOC_21028 [Thalassiosira oceanica]
Length = 645
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 23/35 (65%), Gaps = 4/35 (11%)
Query: 184 YNVMYNDVDMVWLKDPFPYLQ----GDHDVYFTDD 214
+ + +DVDMVWLKDP Y + GD DV+F DD
Sbjct: 411 HEMAASDVDMVWLKDPRLYFESGSAGDFDVFFQDD 445
>gi|307111693|gb|EFN59927.1| hypothetical protein CHLNCDRAFT_49368 [Chlorella variabilis]
Length = 617
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 18/58 (31%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 159 KFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL--QGDHDVYFTDD 214
K+G +G+ T + L + + G+N++ +DVD+VW +DP P D+ F++D
Sbjct: 163 KWGEEGWRRMTWAKVFVLDAVADWGFNLVISDVDVVWFRDPLPLFAKHAHADLIFSED 220
>gi|219130406|ref|XP_002185357.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403272|gb|EEC43226.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 647
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 19/66 (28%), Positives = 34/66 (51%), Gaps = 3/66 (4%)
Query: 154 SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDH---DVY 210
S+ A ++G + F + + + LGY++++ DVD+VW +P Y D++
Sbjct: 391 SEAARRYGDRRFTAMMMAKVICVQLVSMLGYDLLFQDVDIVWFSNPLEYFAHADPGMDMF 450
Query: 211 FTDDMA 216
F DD A
Sbjct: 451 FQDDGA 456
>gi|118485729|gb|ABK94714.1| unknown [Populus trichocarpa]
Length = 260
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 56/118 (47%), Gaps = 16/118 (13%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT----DDMAAVKPLDHSHDLPPPG 231
L L+LGY+V+ +DVD+++L++PF +L D DV D+M A D + P G
Sbjct: 66 LREFLQLGYSVLLSDVDIIYLQNPFDHLYRDSDVESMSDGHDNMTAYG-FDDVFNEPAMG 124
Query: 232 -----KKGRTYIC-SCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
R ++ S Y+RPT + ++ + L EP S DQ FN L
Sbjct: 125 WARYAHTMRIWVYNSGFFYIRPTLPSIELLDRVAGRLSREPNSW-----DQAVFNEEL 177
>gi|303283027|ref|XP_003060805.1| glycosyltransferase family 77 protein [Micromonas pusilla CCMP1545]
gi|226458276|gb|EEH55574.1| glycosyltransferase family 77 protein [Micromonas pusilla CCMP1545]
Length = 366
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 50/204 (24%), Positives = 80/204 (39%), Gaps = 34/204 (16%)
Query: 100 VSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATL----------YKVNGRWPGHAVLV- 148
VS + F+ NW + R ++ V+V A D AT Y H V+
Sbjct: 153 VSDGFHEFMLNWHAHVKRLGIRN-VVVAALDEATYATCARHAIACYSHRSLRYTHGVVAT 211
Query: 149 --PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGD 206
P D+ + + F + + LL +L+ G V+ +DVD+VWL+DP
Sbjct: 212 GGSPLHDANASVTLNATAFQQIGALKTQFLLTLLKRGLRVLVSDVDVVWLRDPA------ 265
Query: 207 HDVYFTDDMAAVKPLDH----SHDLPPPGKKGRTYIC------SCMIYLRPTDGAKLVMK 256
+ YF A S D + +T C + ++Y+ PT+ A +
Sbjct: 266 -ESYFDATDGAATAAAADIAVSTDCLSAIDEAKTRGCWHMQFNTGIMYVNPTETAMAFVT 324
Query: 257 KWIEELQAEPWSKAKKANDQPAFN 280
W E L+A + +DQ FN
Sbjct: 325 AWGEALRA---TTHAFEHDQDVFN 345
>gi|219130412|ref|XP_002185360.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403275|gb|EEC43229.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 483
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 19/66 (28%), Positives = 34/66 (51%), Gaps = 3/66 (4%)
Query: 154 SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDH---DVY 210
S+ A ++G + F + + + LGY++++ DVD+VW +P Y D++
Sbjct: 391 SEAARRYGDRRFTAMMMAKVICVQLVSMLGYDLLFQDVDIVWFSNPLEYFAHADPGMDMF 450
Query: 211 FTDDMA 216
F DD A
Sbjct: 451 FQDDGA 456
>gi|159479202|ref|XP_001697686.1| hypothetical protein CHLREDRAFT_195312 [Chlamydomonas reinhardtii]
gi|158269942|gb|EDO96019.1| predicted protein [Chlamydomonas reinhardtii]
Length = 579
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 30/50 (60%), Gaps = 2/50 (4%)
Query: 156 TAHKFGSQGFFNFTSRRPCHLLH-ILELGYNVMYNDVDMVWLKDPFPYLQ 204
TA+ +GS + T+ H++ + E G NV+++D D+VW DP P+ Q
Sbjct: 27 TAYHWGSH-HWKLTTWNKVHIIKAVYEFGVNVIHSDTDVVWFGDPLPFFQ 75
>gi|168040047|ref|XP_001772507.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162676197|gb|EDQ62683.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 552
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 26/36 (72%)
Query: 168 FTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
T R+ +L IL+ GY+V+++DVD++W ++P P L
Sbjct: 338 LTQRKLQVVLQILQKGYHVVWSDVDVIWFQNPLPRL 373
>gi|397580444|gb|EJK51588.1| hypothetical protein THAOC_29231 [Thalassiosira oceanica]
Length = 619
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 39/156 (25%), Positives = 71/156 (45%), Gaps = 26/156 (16%)
Query: 79 DYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQD--QVLV---------- 126
D L++ + ++ II V++ L N+L + +R K+ D ++LV
Sbjct: 224 DERLAKLTAISNRHQPIIAMTVNKGQSQLLANFLCA-ARSKNLDVSRILVFVTDAESEAI 282
Query: 127 ----IAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILEL 182
I++D+++ R+ +V P D++T +G F + +L+I L
Sbjct: 283 VQNLISDDHSSPMVYFDRYNFESV--PLGGDNET---YGDATFTAMMWVKILSVLYISLL 337
Query: 183 GYNVMYNDVDMVWLKDPFPYLQGD----HDVYFTDD 214
G+++M+ DVD+VW DP P D +D F D
Sbjct: 338 GHDIMFQDVDIVWEDDPLPLFYCDDLVRYDAIFQHD 373
>gi|167516356|ref|XP_001742519.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163779143|gb|EDQ92757.1| predicted protein [Monosiga brevicollis MX1]
Length = 1057
Score = 38.9 bits (89), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 55/114 (48%), Gaps = 6/114 (5%)
Query: 80 YTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNG 139
+TL VA++ ++V S Y + N++ + + DQ+++ A D +Y+
Sbjct: 530 FTLRDLLPVVARDNHVLVTGASYAYRDVVMNFVCQLRKLGIYDQLVIAAFD-EDMYRFGF 588
Query: 140 RWPGHAVLVPPAPD--SQTAH--KFGSQGFFNFTSRRPCHLLHILELGYNVMYN 189
R G V A D T+H ++GSQ F T + +L IL+LGY+V +
Sbjct: 589 RM-GLPVFFYQASDISGLTSHDLEYGSQHFKKVTKLKSQVVLQILKLGYDVTWT 641
>gi|242050722|ref|XP_002463105.1| hypothetical protein SORBIDRAFT_02g037900 [Sorghum bicolor]
gi|241926482|gb|EER99626.1| hypothetical protein SORBIDRAFT_02g037900 [Sorghum bicolor]
Length = 423
Score = 38.9 bits (89), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 48/205 (23%), Positives = 83/205 (40%), Gaps = 20/205 (9%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
VA +IV + L W +I R + ++V +D + + P +
Sbjct: 147 VAVKKELIVALANSNVKEMLEMWFTNIKRAGISNYLVVALDDNIENFCKSNDVP----VY 202
Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
PD + + G + + L L+LGY+V+ +D+D+++ ++PF +L D D
Sbjct: 203 RRDPDDGIDNIGKTGGNHAVSGLKFRILREFLQLGYSVLLSDIDIIFFQNPFDHLYRDSD 262
Query: 209 VYFTDD----MAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGAKLVMKKW 258
V D M A + D P G R ++ S ++RPT + ++ +
Sbjct: 263 VESMSDGHNNMTAYG-FNDVFDEPSMGWARYAHTMRIWVYNSGFFFIRPTIPSIELLDRV 321
Query: 259 IEELQAEPWSKAKKANDQPAFNWAL 283
L EP K+ DQ FN L
Sbjct: 322 AYRLSHEP-----KSWDQAVFNEEL 341
>gi|147832387|emb|CAN64421.1| hypothetical protein VITISV_031574 [Vitis vinifera]
Length = 397
Score = 38.9 bits (89), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 56/121 (46%), Gaps = 9/121 (7%)
Query: 92 NGTIIVCAVSQPYLP---------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWP 142
N T+I+ V++ Y+ FL ++ I ++ + +LV+A D + R
Sbjct: 112 NKTVIITIVNKAYVEGDDTSMLDLFLESFWIGEGTREMAEHLLVVALDQTAFDRCIFRRL 171
Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
+V D + S+ F RR LL +LE GY+ ++ D D+ WL++PFP
Sbjct: 172 HCYKMVSEDGDMDGEKLYMSKDFIKMMWRRTLLLLRVLERGYSFIFTDTDVSWLRNPFPR 231
Query: 203 L 203
L
Sbjct: 232 L 232
>gi|326497825|dbj|BAJ94775.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326499500|dbj|BAJ86061.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326512430|dbj|BAJ99570.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 424
Score = 38.9 bits (89), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 49/212 (23%), Positives = 87/212 (41%), Gaps = 20/212 (9%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
L++ VA IIV + L W +I R + ++V +D + +
Sbjct: 141 LAKILQHVAVKKEIIVAVANSNVKETLGMWFTNIKRVGITNYLVVALDDSIENFCKSKDV 200
Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
P + PD + G ++ + L L+LGY+V+ +D+D+++ ++PF
Sbjct: 201 P----VYRRDPDEGIDSIGKTGGNHAVSALKFRILREFLQLGYSVLLSDIDIMFFQNPFD 256
Query: 202 YLQGDHDVYFT----DDMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGA 251
+L D D+ D+M A + D P G R ++ S ++RPT +
Sbjct: 257 HLYRDSDIESMSDGHDNMTAYG-FNDVFDEPSMGWARYAHTMRIWVFNSGFFFIRPTIPS 315
Query: 252 KLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
++ + L EP K+ DQ FN L
Sbjct: 316 IELLDRVAGRLSREP-----KSWDQAVFNEEL 342
>gi|357128785|ref|XP_003566050.1| PREDICTED: uncharacterized protein LOC100833479 [Brachypodium
distachyon]
Length = 711
Score = 38.9 bits (89), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 26/44 (59%)
Query: 160 FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
FG+ F T + +L IL LGYNV+ +DVD+ W +P +L
Sbjct: 491 FGTPCFQRVTKVKSRVVLEILRLGYNVLLSDVDVYWFDNPVQFL 534
>gi|422293879|gb|EKU21179.1| hypothetical protein NGA_0133501 [Nannochloropsis gaditana CCMP526]
Length = 421
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 52/203 (25%), Positives = 85/203 (41%), Gaps = 22/203 (10%)
Query: 15 PNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSSL 74
P + L PR ++ + RP + R+ LL L L + G +F A S
Sbjct: 140 PPSWRLKPR--ISLEGSRPPKMTHMRSQEGFLLELFSNLPDLEEKAGA---LFARAADSQ 194
Query: 75 AKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATL 134
+ L++ + N VC+ Q + N L+ IS K V+ IAE L
Sbjct: 195 KRLLIMALNEGDVDLLVN---FVCSARQASISVEN--LVVISADK---SVVDIAEAL-NL 245
Query: 135 YKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMV 194
+ + PG L S+ + ++G + F + + + LGY+V++ D D+V
Sbjct: 246 HAFS--HPGFGTL-----PSERSGRYGDENFQVMMWLKVVSVWIAIRLGYHVLFQDADLV 298
Query: 195 WLKDPFP-YLQGDHDVYFTDDMA 216
WLK P+ + D +F DD A
Sbjct: 299 WLKSPWEAFADTSIDGFFMDDGA 321
>gi|407928253|gb|EKG21116.1| Heat shock protein Hsp70 [Macrophomina phaseolina MS6]
Length = 251
Score = 38.5 bits (88), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 38/82 (46%), Gaps = 8/82 (9%)
Query: 125 LVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGF------FNFTSRRPCHLLH 178
L++A DY T Y VN WPG+A V P S+ A+ + G FN S C+
Sbjct: 8 LIVAVDYGTTYTVN-EWPGNAESVFKVP-SKIAYAAENPGLDDDQWGFNVKSSMGCYTWT 65
Query: 179 ILELGYNVMYNDVDMVWLKDPF 200
L L N+ D D L+D F
Sbjct: 66 KLLLDKNIRLTDFDDSKLEDVF 87
>gi|326527715|dbj|BAK08132.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 424
Score = 38.5 bits (88), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 49/212 (23%), Positives = 87/212 (41%), Gaps = 20/212 (9%)
Query: 82 LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
L++ VA IIV + L W +I R + ++V +D + +
Sbjct: 141 LAKILQHVAVKKEIIVAVANSNVKETLGMWFTNIKRVGITNYLVVALDDSIENFCKSKDV 200
Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
P + PD + G ++ + L L+LGY+V+ +D+D+++ ++PF
Sbjct: 201 P----VYRRDPDEGIDSIGKTGGNHAVSALKFRILREFLQLGYSVLLSDIDIMFFQNPFD 256
Query: 202 YLQGDHDVYFT----DDMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGA 251
+L D D+ D+M A + D P G R ++ S ++RPT +
Sbjct: 257 HLYRDSDIESMSDGHDNMTAYG-FNDVFDEPSMGWARYAHTMRIWVFNSGFFFIRPTIPS 315
Query: 252 KLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
++ + L EP K+ DQ FN L
Sbjct: 316 IELLDRVAGRLSREP-----KSWDQAVFNEEL 342
>gi|226504952|ref|NP_001140234.1| uncharacterized protein LOC100272274 [Zea mays]
gi|194698620|gb|ACF83394.1| unknown [Zea mays]
gi|414590698|tpg|DAA41269.1| TPA: hypothetical protein ZEAMMB73_319089 [Zea mays]
Length = 423
Score = 38.5 bits (88), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 55/118 (46%), Gaps = 16/118 (13%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD----MAAVKPLDHSHDLPPPG 231
L L+LGY+V+ +D+D+++ ++PF +L D DV D M A + D P G
Sbjct: 230 LREFLQLGYSVLLSDIDIIFFRNPFDHLYRDSDVESMSDGHNNMTAYG-FNDVFDEPSMG 288
Query: 232 -----KKGRTYI-CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
R ++ S ++RPT + ++ + L EP K+ DQ FN L
Sbjct: 289 WARYAHTMRIWVYNSGFFFIRPTIPSIELLDRVAYRLSHEP-----KSWDQAVFNEEL 341
>gi|357122173|ref|XP_003562790.1| PREDICTED: uncharacterized protein LOC100830494 [Brachypodium
distachyon]
Length = 424
Score = 38.5 bits (88), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 16/118 (13%)
Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD----MAAVKPLDHSHDLPPPG 231
L L+LGY+V+ +D+D+++ ++PF +L D D+ D M A + D P G
Sbjct: 231 LREFLQLGYSVLLSDIDIMFFQNPFDHLHRDSDIESMSDGHNNMTAYG-FNDVFDEPSMG 289
Query: 232 -----KKGRTYI-CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
R ++ S ++RPT + ++ + L EP K+ DQ FN L
Sbjct: 290 WARYAHTMRIWVYNSGFFFIRPTIPSIELLDRVAGRLSREP-----KSWDQAVFNEEL 342
>gi|224002126|ref|XP_002290735.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974157|gb|EED92487.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 561
Score = 38.5 bits (88), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 38/164 (23%), Positives = 67/164 (40%), Gaps = 14/164 (8%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISI-SRQKHQDQVLVIAEDYATLYKVNGRWPGH--- 144
+ K T++V V++ L N++ + SR VLV D + G +
Sbjct: 239 IVKENTVVVMTVNKGQSLLLTNFVCNAHSRGFDISNVLVFPTDEESRKLAEGLGLAYYYD 298
Query: 145 AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
+ + P+ + + +G F + + +I LGY+V++ DVD+ WL+DP +
Sbjct: 299 EINLGHMPEKEATY-YGDDTFAAMMFAKILCVYYINLLGYDVLFQDVDITWLRDPLEFFH 357
Query: 205 GDHDVYFTDDMAAVKPLD--HSHDLPPPGKKGRTYICSCMIYLR 246
+ AAV+ D HD P + S Y+R
Sbjct: 358 NKTN-------AAVQSYDIAFQHDGSPQPRFAPYSANSGFYYVR 394
>gi|302838823|ref|XP_002950969.1| hypothetical protein VOLCADRAFT_91495 [Volvox carteri f.
nagariensis]
gi|300263664|gb|EFJ47863.1| hypothetical protein VOLCADRAFT_91495 [Volvox carteri f.
nagariensis]
Length = 279
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 40/155 (25%), Positives = 62/155 (40%), Gaps = 22/155 (14%)
Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY-LQGDH 207
P P Q SQG F TS +L L LG +V++ D D V+ ++P PY L +
Sbjct: 50 PDGPMGQMKSVDRSQGSFTKTST----ILDGLTLGVDVLFLDADQVFFRNPLPYILAREA 105
Query: 208 DVYFTDDMAAVKPLDHSH-DLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKW------IE 260
D+ + D H+H D P + +Y RPT + W I
Sbjct: 106 DILVSGDC-------HNHGDATPMERFPPINNNIGFVYFRPTAMVTRAIYNWALWLRNIA 158
Query: 261 ELQAEPWSKAKKANDQPAFNWALNKTAGQVCSVCL 295
+PW ++ A PA W + + + S+ +
Sbjct: 159 LTGNKPWDQSTFA---PAVEWVSSDVSVRHLSMSM 190
>gi|302834667|ref|XP_002948896.1| hypothetical protein VOLCADRAFT_89227 [Volvox carteri f.
nagariensis]
gi|300266087|gb|EFJ50276.1| hypothetical protein VOLCADRAFT_89227 [Volvox carteri f.
nagariensis]
Length = 284
Score = 38.1 bits (87), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 30/120 (25%), Positives = 54/120 (45%), Gaps = 16/120 (13%)
Query: 160 FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL-QGDHDVYFTDDMAAV 218
F S GF + ++L+ L G++V++ D D+ L+DP PY+ Q D D++ + V
Sbjct: 149 FHSDGFHALGFAKVKYILNSLSAGHDVLFLDADVQLLRDPVPYIYQRDGDMFVMMEKCLV 208
Query: 219 ----KPLDHSHDLP----PPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKA 270
+P+ P PP G +Y + T G + WI ++ + ++A
Sbjct: 209 YNVSRPITFRRSEPDVWIPPLNIGS-------LYFKATAGVTRCVYNWIWDMYGQVPNRA 261
>gi|412988791|emb|CCO15382.1| glycosyltransferase family 77 protein [Bathycoccus prasinos]
Length = 841
Score = 38.1 bits (87), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 47/206 (22%), Positives = 80/206 (38%), Gaps = 40/206 (19%)
Query: 94 TIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW-------PGHAV 146
I V + F NW+ + R K + V A D L + W G++V
Sbjct: 209 AIFVTFATHSVRDFAKNWVNAARRLKLEPH-FVGALDEKMLEDLKS-WNVPSMLLTGNSV 266
Query: 147 LVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ-- 204
L TA GS F + + + +L++ N + +D D+ W++DP Y
Sbjct: 267 LANRGVQFITA---GSAAFKKMGTVKTKFVQDLLDMNLNPILSDADVAWMRDPRAYFNKG 323
Query: 205 --GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYIC------SCMIYLRPTDGAKLVMK 256
G D+ + D V PG K + C + ++++R T+ +K ++
Sbjct: 324 TYGKADILVSTDCIDV-----------PGDKDDSNKCAHVNFNTGILHIRATEESKNFLQ 372
Query: 257 KWIEELQAE--PWSKAKKANDQPAFN 280
W ++ W + DQPAFN
Sbjct: 373 AWKTKVATSTIAWMR-----DQPAFN 393
>gi|302850830|ref|XP_002956941.1| hypothetical protein VOLCADRAFT_83868 [Volvox carteri f.
nagariensis]
gi|300257822|gb|EFJ42066.1| hypothetical protein VOLCADRAFT_83868 [Volvox carteri f.
nagariensis]
Length = 647
Score = 38.1 bits (87), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 44/195 (22%), Positives = 84/195 (43%), Gaps = 22/195 (11%)
Query: 91 KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAED--YATLYKVN-GRWPGHAVL 147
++G I++ + YL F W+ ++ + ++ +D L ++N W + +
Sbjct: 21 QDGLIMITWANHHYLDFAKTWVYNLKKSGVSGYMVGAMDDDMLKDLVELNINTWRMNTGI 80
Query: 148 VPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG-- 205
++ +GSQ F + + +L L V+ +D+D WLK+P PY
Sbjct: 81 ------TKRDLGWGSQNFHLMGRFKIKLIRDVLALDVTVVVSDIDTAWLKNPIPYFHRYP 134
Query: 206 DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAE 265
+ D+ + D + D S + P G + M++ RP +K + +W++ L +
Sbjct: 135 EADILTSTDQLSPTVKDDSLERFPDA--GSAFNIGIMLF-RPN--SKEFVDEWVKALD-D 188
Query: 266 PWSKAKKANDQPAFN 280
P K DQ AFN
Sbjct: 189 P-----KMWDQTAFN 198
>gi|323449948|gb|EGB05832.1| hypothetical protein AURANDRAFT_66047 [Aureococcus anophagefferens]
Length = 584
Score = 38.1 bits (87), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 19/64 (29%), Positives = 31/64 (48%), Gaps = 1/64 (1%)
Query: 154 SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG-DHDVYFT 212
+ AHK+G F + + + + G +V++ D D+VW DP P L D +
Sbjct: 360 ERPAHKYGDSVFVSMMWLKVVAVYLVNRCGRDVLFMDADVVWQADPLPLLAAYGEDTLWM 419
Query: 213 DDMA 216
DD+A
Sbjct: 420 DDVA 423
>gi|308809339|ref|XP_003081979.1| unnamed protein product [Ostreococcus tauri]
gi|116060446|emb|CAL55782.1| unnamed protein product, partial [Ostreococcus tauri]
Length = 579
Score = 38.1 bits (87), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 28/119 (23%), Positives = 53/119 (44%), Gaps = 4/119 (3%)
Query: 162 SQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPL 221
S+ F + + + +L+LG + +D D+VW++DP + + D + + +
Sbjct: 93 SKAFKKMGTVKTKFVQDLLDLGIAPILSDADVVWMRDPREVFN-NGTFKYADILISSDCI 151
Query: 222 DHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
D D + ++Y+RPT AK ++KW ++ S+ DQPA N
Sbjct: 152 DTVGDRKDDKSCLHVNFNTGVLYIRPTTRAKEFVEKWKHKVAT---SEIAWMRDQPALN 207
>gi|224002501|ref|XP_002290922.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220972698|gb|EED91029.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 604
Score = 38.1 bits (87), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 15/40 (37%), Positives = 25/40 (62%), Gaps = 4/40 (10%)
Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQG----DHDVYFTDD 214
+ ELG ++++ DVD+VW ++P Y + D+YF DD
Sbjct: 430 VNELGKDLLFQDVDVVWYQNPLEYFHDSSLTEFDIYFQDD 469
>gi|297607539|ref|NP_001060135.2| Os07g0587100 [Oryza sativa Japonica Group]
gi|34393596|dbj|BAC83249.1| hypothetical protein [Oryza sativa Japonica Group]
gi|255677929|dbj|BAF22049.2| Os07g0587100 [Oryza sativa Japonica Group]
Length = 426
Score = 37.7 bits (86), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 49/205 (23%), Positives = 83/205 (40%), Gaps = 20/205 (9%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
VA +IV + L W +I R + ++V +D + + P
Sbjct: 150 VAIKKELIVALANSNVREMLEMWFTNIKRVGISNYLIVALDDSTETFCKSKGVP----FY 205
Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPFPYLQGDH 207
PD + G G + S +L L+LGY+V+ +D+D+++ ++PF +L D
Sbjct: 206 RRDPD-EGVDNIGKVGGNHAVSGLKFRILREFLQLGYSVLLSDIDIIFFQNPFDHLYRDA 264
Query: 208 DVY-FTD--DMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGAKLVMKKW 258
DV +D D + D P G R ++ S ++RPT + ++ +
Sbjct: 265 DVESMSDGHDNRTAYGFNDVFDEPLMGWARYAHTMRIWVYNSGFFFIRPTIPSIELLDRV 324
Query: 259 IEELQAEPWSKAKKANDQPAFNWAL 283
L EP K+ DQ FN L
Sbjct: 325 AGRLSREP-----KSWDQAVFNEEL 344
>gi|384249480|gb|EIE22961.1| hypothetical protein COCSUDRAFT_41969 [Coccomyxa subellipsoidea
C-169]
Length = 396
Score = 37.7 bits (86), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 35/67 (52%)
Query: 137 VNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWL 196
+NGR+ H V P T ++ G + +L+IL LGY+V+Y DVD V+
Sbjct: 142 LNGRYSHHCVQSTNWPGHDTEYETGDAWHVAAELHKLELILNILTLGYSVLYVDVDHVFY 201
Query: 197 KDPFPYL 203
++P +L
Sbjct: 202 RNPMHHL 208
>gi|218199927|gb|EEC82354.1| hypothetical protein OsI_26669 [Oryza sativa Indica Group]
Length = 426
Score = 37.7 bits (86), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 49/205 (23%), Positives = 83/205 (40%), Gaps = 20/205 (9%)
Query: 89 VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
VA +IV + L W +I R + ++V +D + + P
Sbjct: 150 VAIKKELIVALANSNVREMLEMWFTNIKRVGISNYLIVALDDSTETFCKSKGVP----FY 205
Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPFPYLQGDH 207
PD + G G + S +L L+LGY+V+ +D+D+++ ++PF +L D
Sbjct: 206 RRDPD-EGVDNIGKVGGNHAVSGLKFRILREFLQLGYSVLLSDIDIIFFQNPFDHLYRDA 264
Query: 208 DVY-FTD--DMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGAKLVMKKW 258
DV +D D + D P G R ++ S ++RPT + ++ +
Sbjct: 265 DVESMSDGHDNRTAYGFNDVFDEPLMGWARYAHTMRIWVYNSGFFFIRPTIPSIELLDRV 324
Query: 259 IEELQAEPWSKAKKANDQPAFNWAL 283
L EP K+ DQ FN L
Sbjct: 325 AGRLSREP-----KSWDQAVFNEEL 344
>gi|159484632|ref|XP_001700358.1| predicted protein [Chlamydomonas reinhardtii]
gi|158272399|gb|EDO98200.1| predicted protein [Chlamydomonas reinhardtii]
Length = 629
Score = 37.7 bits (86), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 31/133 (23%), Positives = 53/133 (39%), Gaps = 10/133 (7%)
Query: 76 KWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLY 135
+W + L VAK T++V + + L + R + L+ A D T
Sbjct: 16 RWEELVLG-----VAKENTVLVTVIDKIVWAQLGPSYVDNVRAANISYWLIAALDPETSL 70
Query: 136 KVNGRWP-GHAVLVPPA----PDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYND 190
+ GH P S +K+GS + T + + + E G +++++D
Sbjct: 71 GLGSMGAVGHCFNAPADRLVYKGSDAQYKWGSNHWTQTTWNKVHMMKAVYEFGVHIIHSD 130
Query: 191 VDMVWLKDPFPYL 203
D+VW DP PY
Sbjct: 131 TDVVWFSDPLPYF 143
>gi|397640988|gb|EJK74420.1| hypothetical protein THAOC_03902, partial [Thalassiosira oceanica]
Length = 1388
Score = 37.4 bits (85), Expect = 8.3, Method: Composition-based stats.
Identities = 29/106 (27%), Positives = 50/106 (47%), Gaps = 5/106 (4%)
Query: 95 IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDS 154
+IV + + Y L NW ++H + +V++ D ++ P ++L +
Sbjct: 899 VIVTSSNYEYRDLLYNW--ECHARRHGLKWVVVSMDQLIHEELG---PERSILAEGINIT 953
Query: 155 QTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
A KF S+ F T + +L ILE G+ V+++D D V L DP
Sbjct: 954 DHAGKFRSKSFNKITCGKFRAVLDILESGHEVLFSDPDNVLLMDPL 999
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.325 0.139 0.458
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,542,265,028
Number of Sequences: 23463169
Number of extensions: 234982419
Number of successful extensions: 702851
Number of sequences better than 100.0: 425
Number of HSP's better than 100.0 without gapping: 313
Number of HSP's successfully gapped in prelim test: 112
Number of HSP's that attempted gapping in prelim test: 702360
Number of HSP's gapped (non-prelim): 451
length of query: 318
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 176
effective length of database: 9,027,425,369
effective search space: 1588826864944
effective search space used: 1588826864944
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 77 (34.3 bits)