BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 018302
         (358 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255541750|ref|XP_002511939.1| conserved hypothetical protein [Ricinus communis]
 gi|223549119|gb|EEF50608.1| conserved hypothetical protein [Ricinus communis]
          Length = 356

 Score =  626 bits (1615), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 302/359 (84%), Positives = 325/359 (90%), Gaps = 4/359 (1%)

Query: 1   MSQWLHQRPLHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWT 60
           M+ +LHQRPLHN L +PYPLSPRNS   Q Q   + + +RT L+VL SLL++LGV +PWT
Sbjct: 1   MTSFLHQRPLHNSLSDPYPLSPRNSANSQRQ---ISIFSRTGLIVLFSLLLILGVFVPWT 57

Query: 61  GTPGFMF-PNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQK 119
             P  +F     SS+AKWR YTL QAASFVA+NGT+IVCAVSQPYLPFLNNWLISI+RQK
Sbjct: 58  ELPNGIFSATKQSSVAKWRQYTLPQAASFVAQNGTVIVCAVSQPYLPFLNNWLISITRQK 117

Query: 120 HQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHI 179
           HQD+VLVIAEDYATLYKVN +WPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRP HLLH+
Sbjct: 118 HQDKVLVIAEDYATLYKVNEKWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPRHLLHL 177

Query: 180 LELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYIC 239
           LELGYNVMYNDVDMVWL DPF YL+G HDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYIC
Sbjct: 178 LELGYNVMYNDVDMVWLGDPFIYLEGKHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYIC 237

Query: 240 SCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSA 299
           SCMI+L PT GAKLVMKKWI+ELQA+PWSKAKKANDQPAFNWALNKTAGQVDLYLLPQ+A
Sbjct: 238 SCMIFLHPTVGAKLVMKKWIKELQAQPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQAA 297

Query: 300 FPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           FPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHA ESPLGKL
Sbjct: 298 FPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAQESPLGKL 356


>gi|225454397|ref|XP_002279469.1| PREDICTED: UDP-galactose:fucoside alpha-3-galactosyltransferase
           [Vitis vinifera]
 gi|297745375|emb|CBI40455.3| unnamed protein product [Vitis vinifera]
          Length = 361

 Score =  580 bits (1495), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 278/348 (79%), Positives = 316/348 (90%), Gaps = 9/348 (2%)

Query: 14  LPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPN---A 70
           L NP P+SPRN +   ++RP + +  RT LL+LL+L+VV+GV+LP   T     P+   +
Sbjct: 20  LSNPNPISPRNPL--DWRRP-ISIFGRTGLLILLTLMVVVGVLLP---TMKMRMPDGLLS 73

Query: 71  TSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAED 130
            +S++KWRDYTL+QAA FVAKNGT+IVCAVSQPYLPFLNNWLISI+RQKHQD+VLVIAED
Sbjct: 74  RASVSKWRDYTLAQAAEFVAKNGTVIVCAVSQPYLPFLNNWLISIARQKHQDKVLVIAED 133

Query: 131 YATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYND 190
           YATLY VN +WPGHAVLVPPAPD+QTAHKFGS GFFNFTSRRP HLL+ILELGYNVMYND
Sbjct: 134 YATLYTVNQKWPGHAVLVPPAPDAQTAHKFGSMGFFNFTSRRPRHLLNILELGYNVMYND 193

Query: 191 VDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDG 250
           VDMVWL DPFPYLQG HDVYFTDDMAAVKPL+HSHDLPPPGKKGRTYICSCMI++RPT+G
Sbjct: 194 VDMVWLADPFPYLQGKHDVYFTDDMAAVKPLNHSHDLPPPGKKGRTYICSCMIFMRPTNG 253

Query: 251 AKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQ 310
           AKLVMKKWIEELQA+PWS+AKK+NDQPAFNWALN+TAG+VDLYLLPQ+AFPTGGLYFKN+
Sbjct: 254 AKLVMKKWIEELQAQPWSRAKKSNDQPAFNWALNRTAGEVDLYLLPQAAFPTGGLYFKNK 313

Query: 311 TWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           TWV+ETKG +VIIHNNYITGFEKKIKRF+D+GLWLVDDHA ESPLGKL
Sbjct: 314 TWVQETKGMNVIIHNNYITGFEKKIKRFQDYGLWLVDDHAQESPLGKL 361


>gi|449441404|ref|XP_004138472.1| PREDICTED: UDP-galactose:fucoside
           alpha-3-galactosyltransferase-like [Cucumis sativus]
 gi|449495244|ref|XP_004159776.1| PREDICTED: UDP-galactose:fucoside
           alpha-3-galactosyltransferase-like [Cucumis sativus]
          Length = 355

 Score =  575 bits (1483), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 281/358 (78%), Positives = 316/358 (88%), Gaps = 3/358 (0%)

Query: 1   MSQWLHQRPLHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWT 60
           M   LHQR  H+ L N YPLSPR+S + +    +   +N   LL LLSL+V+LGV LPW 
Sbjct: 1   MPSVLHQRSTHSSLANSYPLSPRSSSSSERSFSIFSPIN---LLALLSLIVILGVFLPWM 57

Query: 61  GTPGFMFPNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKH 120
                +F ++  S +KWR+Y+L++AASFVA+NGT+IVCAVSQPYLPFLNNWLIS+SRQKH
Sbjct: 58  NIQESIFSSSKVSNSKWREYSLAEAASFVARNGTVIVCAVSQPYLPFLNNWLISLSRQKH 117

Query: 121 QDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHIL 180
            ++VLVIAEDYATLYKVN RWPGHAVLVPPAPD+QTAHKFGSQGFFNFTSRRP HLLHIL
Sbjct: 118 HEKVLVIAEDYATLYKVNERWPGHAVLVPPAPDAQTAHKFGSQGFFNFTSRRPRHLLHIL 177

Query: 181 ELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
           ELGYNVMYNDVDMVWL DPFPYLQG+HDVYFTDDMAAVKPL HSHDLPPPGKKGRTYICS
Sbjct: 178 ELGYNVMYNDVDMVWLADPFPYLQGNHDVYFTDDMAAVKPLHHSHDLPPPGKKGRTYICS 237

Query: 241 CMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAF 300
           CMI+LRPT GAKLVM+KWIEEL+A+PWSKAKKANDQPAFNWALNKTAG+VDLYLLPQSAF
Sbjct: 238 CMIFLRPTSGAKLVMRKWIEELKAQPWSKAKKANDQPAFNWALNKTAGEVDLYLLPQSAF 297

Query: 301 PTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           PTGGLYFKN++WV+ETKG HVIIHNNYITGFEKKIKRFR+F LW VDDH +ESPLG++
Sbjct: 298 PTGGLYFKNESWVQETKGMHVIIHNNYITGFEKKIKRFREFNLWYVDDHTLESPLGRI 355


>gi|296085981|emb|CBI31422.3| unnamed protein product [Vitis vinifera]
          Length = 1331

 Score =  573 bits (1478), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 276/345 (80%), Positives = 305/345 (88%), Gaps = 3/345 (0%)

Query: 14   LPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSS 73
            L + YP+S R+S  +      + V  RT LLVLL+L+VVLGV+LP    P  +F  +  S
Sbjct: 990  LSDSYPISARSSPNWGRS---ISVFGRTGLLVLLTLVVVLGVVLPGMRAPDGLFGGSKVS 1046

Query: 74   LAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYAT 133
            ++KWR+YTL +A  F AKNGT+IVCAVSQPYLPFLNNWLISISRQKHQD+VLVIAEDYAT
Sbjct: 1047 VSKWREYTLEEAVPFAAKNGTLIVCAVSQPYLPFLNNWLISISRQKHQDKVLVIAEDYAT 1106

Query: 134  LYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDM 193
            LY VN RWPGHAVLVPPAPD+Q AHKFGSQGFFNFTSRRP HLL+ILELGYNVMYNDVDM
Sbjct: 1107 LYAVNDRWPGHAVLVPPAPDAQVAHKFGSQGFFNFTSRRPRHLLYILELGYNVMYNDVDM 1166

Query: 194  VWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKL 253
            VWL DPFPYLQGDHDVYFTDDM AVKPL+HSHDLPPPGKKGRTYICSCMI++RPTDGAKL
Sbjct: 1167 VWLADPFPYLQGDHDVYFTDDMTAVKPLNHSHDLPPPGKKGRTYICSCMIFMRPTDGAKL 1226

Query: 254  VMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWV 313
            VMK WIEELQA+PWS AKK+NDQPAFNWALN+TA QVDLYLLPQ AFPTGGLYFKNQTWV
Sbjct: 1227 VMKDWIEELQAQPWSNAKKSNDQPAFNWALNRTAAQVDLYLLPQVAFPTGGLYFKNQTWV 1286

Query: 314  EETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
            +ETKG HVIIHNNYITGFEKKIKRFRDFGLWLVDD+A+ESPLG++
Sbjct: 1287 QETKGLHVIIHNNYITGFEKKIKRFRDFGLWLVDDYALESPLGRI 1331


>gi|147777556|emb|CAN69309.1| hypothetical protein VITISV_003084 [Vitis vinifera]
          Length = 309

 Score =  554 bits (1427), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 261/312 (83%), Positives = 291/312 (93%), Gaps = 6/312 (1%)

Query: 50  LVVLGVILPWTGTPGFMFPN---ATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLP 106
           +VV+GV+LP   T     P+   + +S++KWRDYTL+QAA FVAKNGT+IVCAVSQPYLP
Sbjct: 1   MVVVGVLLP---TMKMRMPDGLLSRASVSKWRDYTLAQAAEFVAKNGTVIVCAVSQPYLP 57

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           FLNNWLISI+RQKHQD+VLVIAEDYATLY VN +WPGHAVLVPPAPD+QTAHKFGS GFF
Sbjct: 58  FLNNWLISIARQKHQDKVLVIAEDYATLYTVNQKWPGHAVLVPPAPDAQTAHKFGSMGFF 117

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHD 226
           NFTSRRP HLL+ILELGYNVMYNDVDMVWL DPFPYLQG HDVYFTDDMAAVKPL+HSHD
Sbjct: 118 NFTSRRPRHLLNILELGYNVMYNDVDMVWLADPFPYLQGKHDVYFTDDMAAVKPLNHSHD 177

Query: 227 LPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKT 286
           LPPPGKKGRTYICSCMI++RPT+GAKLVMKKWIEELQA+PWS+AKK+NDQPAFNWALN+T
Sbjct: 178 LPPPGKKGRTYICSCMIFMRPTNGAKLVMKKWIEELQAQPWSRAKKSNDQPAFNWALNRT 237

Query: 287 AGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLV 346
           AG+VDLYLLPQ+AFPTGGLYFKN+TWV+ETKG +VIIHNNYITGFEKKIKRF+D+GLWLV
Sbjct: 238 AGEVDLYLLPQAAFPTGGLYFKNKTWVQETKGMNVIIHNNYITGFEKKIKRFQDYGLWLV 297

Query: 347 DDHAVESPLGKL 358
           DDHA ESPLGKL
Sbjct: 298 DDHAQESPLGKL 309


>gi|297810037|ref|XP_002872902.1| hypothetical protein ARALYDRAFT_912111 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318739|gb|EFH49161.1| hypothetical protein ARALYDRAFT_912111 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 356

 Score =  553 bits (1425), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 279/357 (78%), Positives = 310/357 (86%), Gaps = 7/357 (1%)

Query: 3   QWLHQRPLHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGT 62
           ++LHQRP+ NP       S   S +    RP+ L L+R  LL+LL+LLV+LGV LPW G+
Sbjct: 5   KFLHQRPIQNPF-----SSSPLSNSSISNRPISL-LSRNGLLILLALLVILGVFLPWAGS 58

Query: 63  PGFMFPNA-TSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQ 121
           P F FPN  +SS +KWRDY+L QA  FVAKNGT+IVCAVS PYLPFLNNWLIS+SRQKHQ
Sbjct: 59  PLFPFPNRLSSSQSKWRDYSLPQAVKFVAKNGTVIVCAVSYPYLPFLNNWLISVSRQKHQ 118

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
           DQVLVIAEDYATLYKVN +WPGHAVL+PPA DSQTAHKFGSQGFFNFT+RRP HLL ILE
Sbjct: 119 DQVLVIAEDYATLYKVNEKWPGHAVLIPPALDSQTAHKFGSQGFFNFTARRPQHLLEILE 178

Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 241
           LGYNVMYNDVDMVWL+DPF YL+G HD YF DDM A+KPLDHSHDLPPPGKKGRTYICSC
Sbjct: 179 LGYNVMYNDVDMVWLQDPFQYLEGKHDAYFMDDMTAIKPLDHSHDLPPPGKKGRTYICSC 238

Query: 242 MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFP 301
           MI+LRPT+GAKL+MKKWIEELQ +PWS+AKKANDQP FNWALNKTA QVDLY+L Q+AFP
Sbjct: 239 MIFLRPTNGAKLLMKKWIEELQTQPWSRAKKANDQPGFNWALNKTAHQVDLYMLSQAAFP 298

Query: 302 TGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           TGGLYFKN+TWV+ETKGKHVIIHNNYI GFEKKIKRFRDF LWLVDDHA ESPLGK+
Sbjct: 299 TGGLYFKNKTWVKETKGKHVIIHNNYIVGFEKKIKRFRDFNLWLVDDHASESPLGKV 355


>gi|30678839|ref|NP_849279.1| rhamnogalacturonan II specific xylosyltransferase [Arabidopsis
           thaliana]
 gi|7267619|emb|CAB80931.1| hypothetical protein [Arabidopsis thaliana]
 gi|193885155|gb|ACF28391.1| At4g01220 [Arabidopsis thaliana]
 gi|332656595|gb|AEE81995.1| rhamnogalacturonan II specific xylosyltransferase [Arabidopsis
           thaliana]
          Length = 360

 Score =  552 bits (1423), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 278/357 (77%), Positives = 310/357 (86%), Gaps = 3/357 (0%)

Query: 3   QWLHQRPLHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGT 62
           ++LHQRP+ NP  NP+  S   S +    RP + +L+R  LL+LL+LLV+LGV LPW G+
Sbjct: 5   KFLHQRPIQNPFTNPFSSS-PLSTSSISNRP-ISLLSRNGLLLLLALLVILGVFLPWAGS 62

Query: 63  PGFMFPNATS-SLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQ 121
           P F  PN  S S +KWRDY+L QA  FVAKNGT+IVCAVS PYLPFLNNWLIS+SRQKHQ
Sbjct: 63  PLFPSPNKLSPSQSKWRDYSLPQAVKFVAKNGTVIVCAVSYPYLPFLNNWLISVSRQKHQ 122

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
           DQVLVIAEDYATLYKVN +WPGHAVL+PPA DSQTAHKFGSQGFFNFT+RRP HLL ILE
Sbjct: 123 DQVLVIAEDYATLYKVNEKWPGHAVLIPPALDSQTAHKFGSQGFFNFTARRPQHLLEILE 182

Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 241
           LGYNVMYNDVDMVWL+DPF YL+G HD YF DDM A+KPLDHSHDLPPPGKKGRTYICSC
Sbjct: 183 LGYNVMYNDVDMVWLQDPFQYLEGKHDAYFMDDMTAIKPLDHSHDLPPPGKKGRTYICSC 242

Query: 242 MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFP 301
           MI+LRPT+GAKL+MKKWIEEL+ +PWS+AKKANDQP FNWALNKTA QVD+YLL Q+AFP
Sbjct: 243 MIFLRPTNGAKLLMKKWIEELETQPWSRAKKANDQPGFNWALNKTANQVDMYLLSQAAFP 302

Query: 302 TGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           TGGLYFKN+TWV+ETKGKH IIHNNYI GFEKKIKRFRDF LWLVDDHA ESPLGKL
Sbjct: 303 TGGLYFKNKTWVKETKGKHAIIHNNYIVGFEKKIKRFRDFNLWLVDDHASESPLGKL 359


>gi|74027164|gb|AAZ94713.1| putative alpha 1,3-xylosyltransferase [Linum usitatissimum]
          Length = 357

 Score =  550 bits (1417), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 280/359 (77%), Positives = 307/359 (85%), Gaps = 3/359 (0%)

Query: 1   MSQWLHQRPLHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWT 60
           M+  LHQRP      + + L      +   QRP+ L  +R  LL LL+L+++LGVI+PW 
Sbjct: 1   MTTSLHQRPFQPAFSHRF-LPASLHYSNLPQRPISL-FSRAGLLSLLALMLILGVIVPWA 58

Query: 61  GTPGFMFP-NATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQK 119
             PG +F  N  SS+ +WR YTL QAASFVAKNGT+IVCAVSQ YLPFLNNWLISISRQK
Sbjct: 59  DMPGGIFSVNKASSVNQWRHYTLPQAASFVAKNGTLIVCAVSQAYLPFLNNWLISISRQK 118

Query: 120 HQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHI 179
            QD VLVIAEDYATL KVN RWPGHAVL+PPA DSQ AHKFGSQGFFNFT+RRP HLL+I
Sbjct: 119 RQDMVLVIAEDYATLDKVNERWPGHAVLIPPALDSQAAHKFGSQGFFNFTARRPQHLLNI 178

Query: 180 LELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYIC 239
           LELGY+VMYNDVDMVWL DPF YL+G HDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYIC
Sbjct: 179 LELGYSVMYNDVDMVWLGDPFTYLRGLHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYIC 238

Query: 240 SCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSA 299
           SCMI+LRPTDGAKLVMKKWIEELQA+PWS+AKKANDQPAFNWAL KT GQVD+YLLPQSA
Sbjct: 239 SCMIFLRPTDGAKLVMKKWIEELQAQPWSRAKKANDQPAFNWALMKTTGQVDVYLLPQSA 298

Query: 300 FPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           FPTGGLYFKN+TWV+ TKGKHVIIHNNYI GFEKKIKRFRD+GLWLVDDHA ESPLGKL
Sbjct: 299 FPTGGLYFKNKTWVQGTKGKHVIIHNNYIVGFEKKIKRFRDYGLWLVDDHAKESPLGKL 357


>gi|225448972|ref|XP_002273155.1| PREDICTED: UDP-galactose:fucoside alpha-3-galactosyltransferase
           [Vitis vinifera]
          Length = 360

 Score =  548 bits (1413), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 275/343 (80%), Positives = 304/343 (88%), Gaps = 3/343 (0%)

Query: 16  NPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSSLA 75
           + YP+S R+S  +      + V  RT LLVLL+L+VVLGV+LP    P  +F  +  S++
Sbjct: 21  DSYPISARSSPNWGRS---ISVFGRTGLLVLLTLVVVLGVVLPGMRAPDGLFGGSKVSVS 77

Query: 76  KWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLY 135
           KWR+YTL +A  F AKNGT+IVCAVSQPYLPFLNNWLISISRQKHQD+VLVIAEDYATLY
Sbjct: 78  KWREYTLEEAVPFAAKNGTLIVCAVSQPYLPFLNNWLISISRQKHQDKVLVIAEDYATLY 137

Query: 136 KVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVW 195
            VN RWPGHAVLVPPAPD+Q AHKFGSQGFFNFTSRRP HLL+ILELGYNVMYNDVDMVW
Sbjct: 138 AVNDRWPGHAVLVPPAPDAQVAHKFGSQGFFNFTSRRPRHLLYILELGYNVMYNDVDMVW 197

Query: 196 LKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVM 255
           L DPFPYLQGDHDVYFTDDM AVKPL+HSHDLPPPGKKGRTYICSCMI++RPTDGAKLVM
Sbjct: 198 LADPFPYLQGDHDVYFTDDMTAVKPLNHSHDLPPPGKKGRTYICSCMIFMRPTDGAKLVM 257

Query: 256 KKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEE 315
           K WIEELQA+PWS AKK+NDQPAFNWALN+TA QVDLYLLPQ AFPTGGLYFKNQTWV+E
Sbjct: 258 KDWIEELQAQPWSNAKKSNDQPAFNWALNRTAAQVDLYLLPQVAFPTGGLYFKNQTWVQE 317

Query: 316 TKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           TKG HVIIHNNYITGFEKKIKRFRDFGLWLVDD+A+ESPLG++
Sbjct: 318 TKGLHVIIHNNYITGFEKKIKRFRDFGLWLVDDYALESPLGRI 360


>gi|2191131|gb|AAB61018.1| A_IG002N01.8 gene product [Arabidopsis thaliana]
          Length = 444

 Score =  545 bits (1405), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 279/370 (75%), Positives = 310/370 (83%), Gaps = 16/370 (4%)

Query: 3   QWLHQRPLHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGT 62
           ++LHQRP+ NP  NP+  S   S +    RP + +L+R  LL+LL+LLV+LGV LPW G+
Sbjct: 76  KFLHQRPIQNPFTNPFSSS-PLSTSSISNRP-ISLLSRNGLLLLLALLVILGVFLPWAGS 133

Query: 63  PGFMFPNATS-SLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQ 121
           P F  PN  S S +KWRDY+L QA  FVAKNGT+IVCAVS PYLPFLNNWLIS+SRQKHQ
Sbjct: 134 PLFPSPNKLSPSQSKWRDYSLPQAVKFVAKNGTVIVCAVSYPYLPFLNNWLISVSRQKHQ 193

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
           DQVLVIAEDYATLYKVN +WPGHAVL+PPA DSQTAHKFGSQGFFNFT+RRP HLL ILE
Sbjct: 194 DQVLVIAEDYATLYKVNEKWPGHAVLIPPALDSQTAHKFGSQGFFNFTARRPQHLLEILE 253

Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAV-------------KPLDHSHDLP 228
           LGYNVMYNDVDMVWL+DPF YL+G HD YF DDM AV             KPLDHSHDLP
Sbjct: 254 LGYNVMYNDVDMVWLQDPFQYLEGKHDAYFMDDMTAVCIFFSQTSSSLMIKPLDHSHDLP 313

Query: 229 PPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAG 288
           PPGKKGRTYICSCMI+LRPT+GAKL+MKKWIEEL+ +PWS+AKKANDQP FNWALNKTA 
Sbjct: 314 PPGKKGRTYICSCMIFLRPTNGAKLLMKKWIEELETQPWSRAKKANDQPGFNWALNKTAN 373

Query: 289 QVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDD 348
           QVD+YLL Q+AFPTGGLYFKN+TWV+ETKGKH IIHNNYI GFEKKIKRFRDF LWLVDD
Sbjct: 374 QVDMYLLSQAAFPTGGLYFKNKTWVKETKGKHAIIHNNYIVGFEKKIKRFRDFNLWLVDD 433

Query: 349 HAVESPLGKL 358
           HA ESPLGKL
Sbjct: 434 HASESPLGKL 443


>gi|356524992|ref|XP_003531111.1| PREDICTED: UDP-galactose:fucoside
           alpha-3-galactosyltransferase-like [Glycine max]
          Length = 356

 Score =  528 bits (1361), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 259/360 (71%), Positives = 296/360 (82%), Gaps = 6/360 (1%)

Query: 1   MSQWLHQRP-LHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPW 59
           MS +LHQR  L NP  NP+P    ++ +    +  L ++  TTLL L+SL+V+LGV  PW
Sbjct: 1   MSSFLHQRSSLQNPFSNPFP---ASTPSSSNSKKSLSIMGPTTLLALISLIVILGVFCPW 57

Query: 60  TGTPGFMFPNATSSLAKWRDYTLSQAASFVAKNGT-IIVCAVSQPYLPFLNNWLISISRQ 118
            G P   FP   +  +KW  YTL QA SFVAKNG+ +IVC VSQPYLPFLNNWLISIS Q
Sbjct: 58  VGFPQ-GFPFTPTPTSKWAHYTLEQALSFVAKNGSSVIVCIVSQPYLPFLNNWLISISMQ 116

Query: 119 KHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLH 178
           K QD VLVIAEDYA+L +VN  WPGHAVL+PP  D++ AHKFGSQGFFNFT+RRP HLL 
Sbjct: 117 KRQDMVLVIAEDYASLDRVNLLWPGHAVLIPPVLDAEAAHKFGSQGFFNFTARRPSHLLK 176

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
           ILELGY+VMYNDVDMVWL DPFPYLQG+HDVYFTDDM A+KPL+HSHDLPPPGKKGR YI
Sbjct: 177 ILELGYSVMYNDVDMVWLADPFPYLQGNHDVYFTDDMTAIKPLNHSHDLPPPGKKGRPYI 236

Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQS 298
           CSCMI+LRPT+GAKL+++KWIEELQ +PWSK  K+NDQPAFNWAL K A +VDLYLLPQ+
Sbjct: 237 CSCMIFLRPTNGAKLILRKWIEELQIQPWSKTVKSNDQPAFNWALMKNAKEVDLYLLPQA 296

Query: 299 AFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           AFPTGGLYFKN+ WV+ETKG HVIIHNNYI GFEKKIKRFRD+GLWLVDDHA ESPLG L
Sbjct: 297 AFPTGGLYFKNKAWVKETKGMHVIIHNNYIVGFEKKIKRFRDYGLWLVDDHAHESPLGGL 356


>gi|358348526|ref|XP_003638296.1| UDP-galactose:fucoside alpha-3-galactosyltransferase [Medicago
           truncatula]
 gi|355504231|gb|AES85434.1| UDP-galactose:fucoside alpha-3-galactosyltransferase [Medicago
           truncatula]
          Length = 357

 Score =  518 bits (1333), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 257/360 (71%), Positives = 294/360 (81%), Gaps = 5/360 (1%)

Query: 1   MSQWLHQRPLHNPL--PNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILP 58
           MS +LHQR LHNP    NP+P S  +S      +  L + +   LL LLSL++VLGV  P
Sbjct: 1   MSGFLHQRSLHNPFQFSNPFPSSQPSSTN---PKKPLSIFSPAILLSLLSLIIVLGVFSP 57

Query: 59  WTGTPGFMFPNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQ 118
           W G P  +   +   ++KW  YTL +A +FVAKNGT+IVC VSQPYLPFLNNWLISI+  
Sbjct: 58  WVGMPQKLIFTSNPEVSKWGRYTLDEALTFVAKNGTVIVCIVSQPYLPFLNNWLISIAMH 117

Query: 119 KHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLH 178
           K  D VLVIAEDY +LYKVN  WPGHAVL+PP  D + +HKFGSQGFFNFT+RRP HLL 
Sbjct: 118 KRHDMVLVIAEDYPSLYKVNQLWPGHAVLIPPVLDLEASHKFGSQGFFNFTARRPSHLLK 177

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
           ILELGY+VMYNDVDMVWL DPFPYLQG+HDVYFTDDM A+KPL+HSHDLPPPGKKGR YI
Sbjct: 178 ILELGYSVMYNDVDMVWLGDPFPYLQGNHDVYFTDDMTAIKPLNHSHDLPPPGKKGRPYI 237

Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQS 298
           CSCMI+LRPTDGAKL++KKW+EELQ EPWS+ KK+NDQPAFNWAL K A  VD+YLLPQS
Sbjct: 238 CSCMIFLRPTDGAKLILKKWMEELQIEPWSRTKKSNDQPAFNWALMKNAKGVDMYLLPQS 297

Query: 299 AFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           AFPTGGLYFKN+TWV+ETKGKHVIIHNNYI GFEKKIKRFRD+GLWLVD+H  ESPLG+L
Sbjct: 298 AFPTGGLYFKNKTWVKETKGKHVIIHNNYIVGFEKKIKRFRDYGLWLVDEHKQESPLGRL 357


>gi|15234316|ref|NP_192086.1| rhamnogalacturonan xylosyltransferase 1 [Arabidopsis thaliana]
 gi|4558546|gb|AAD22639.1|AC007138_3 hypothetical protein [Arabidopsis thaliana]
 gi|3859601|gb|AAC72867.1| T15B16.8 gene product [Arabidopsis thaliana]
 gi|7268220|emb|CAB77747.1| hypothetical protein [Arabidopsis thaliana]
 gi|60547843|gb|AAX23885.1| hypothetical protein At4g01770 [Arabidopsis thaliana]
 gi|117018033|tpg|DAA05811.1| TPA_exp: 1,3-alpha-D-xylosyltransferase [Arabidopsis thaliana]
 gi|332656675|gb|AEE82075.1| rhamnogalacturonan xylosyltransferase 1 [Arabidopsis thaliana]
          Length = 361

 Score =  498 bits (1281), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 235/309 (76%), Positives = 269/309 (87%), Gaps = 8/309 (2%)

Query: 58  PWTGTPGFMFPN------ATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNW 111
           PW G+P F+FPN      + S  +KWRDYTL+QAA FVAKNGT+IVCAVS P+LPFLNNW
Sbjct: 52  PWPGSPLFLFPNRLSSSLSPSPQSKWRDYTLAQAARFVAKNGTVIVCAVSSPFLPFLNNW 111

Query: 112 LISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSR 171
           LIS+SRQKHQD+VLVIAEDY TLYKVN +WPGHAVL+PPA DS+TA  FGSQGFFNFT+R
Sbjct: 112 LISVSRQKHQDKVLVIAEDYITLYKVNEKWPGHAVLIPPALDSKTAFSFGSQGFFNFTAR 171

Query: 172 RPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPG 231
           RP HLL ILELGYNVMYNDVDMVWL+DPF YL+G HD YFTDDM  +KPL+HSHDLP P 
Sbjct: 172 RPQHLLQILELGYNVMYNDVDMVWLQDPFLYLEGSHDAYFTDDMPQIKPLNHSHDLPHPD 231

Query: 232 KKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAK--KANDQPAFNWALNKTAGQ 289
           + G TYICSCMIYLRPT+GAKL+MKKW EELQ++ WS++   KANDQPAFN ALNKTA Q
Sbjct: 232 RNGETYICSCMIYLRPTNGAKLLMKKWSEELQSQAWSESIRFKANDQPAFNLALNKTAHQ 291

Query: 290 VDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDH 349
           VDLYLL Q AFPTGGLYFKN+ WV+ETKGKHVI+HNNYI G+++K+KRF+D+GLWLVDDH
Sbjct: 292 VDLYLLSQVAFPTGGLYFKNEAWVQETKGKHVIVHNNYIIGYDRKMKRFQDYGLWLVDDH 351

Query: 350 AVESPLGKL 358
           A+ESPLGKL
Sbjct: 352 ALESPLGKL 360


>gi|297814153|ref|XP_002874960.1| hypothetical protein ARALYDRAFT_490401 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320797|gb|EFH51219.1| hypothetical protein ARALYDRAFT_490401 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 367

 Score =  494 bits (1271), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 232/314 (73%), Positives = 265/314 (84%), Gaps = 13/314 (4%)

Query: 58  PWTGTPGFMFPNATSS-----------LAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLP 106
           PW G+P  + PN  SS            ++WRDYTL+QAA FVAKNGT+IVCAVS P+LP
Sbjct: 53  PWPGSPLLLLPNKVSSPSNATSLSSQAKSEWRDYTLAQAAKFVAKNGTVIVCAVSSPFLP 112

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           FLNNWLIS+SRQKHQ++VLVIAEDY TLYKVN +WPGHAVL+PPA DS+TA+ FGSQGFF
Sbjct: 113 FLNNWLISVSRQKHQEKVLVIAEDYITLYKVNEKWPGHAVLIPPALDSRTAYSFGSQGFF 172

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHD 226
           NFT+RRP HLL ILELGYNVMYNDVDMVWL+DPF YL+G HD YFTDDM  +KPL+HSHD
Sbjct: 173 NFTARRPQHLLQILELGYNVMYNDVDMVWLQDPFQYLEGSHDAYFTDDMPQIKPLNHSHD 232

Query: 227 LPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAK--KANDQPAFNWALN 284
           LP P + G TYICSCMIYLRPTDGAKL+MKKW EELQ++ WS++   KANDQPAFN ALN
Sbjct: 233 LPDPDQNGETYICSCMIYLRPTDGAKLLMKKWSEELQSQAWSESIRFKANDQPAFNLALN 292

Query: 285 KTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLW 344
           KTA QVDLYLL Q AFPTGGLYF N+ WV+ETKGKHVIIHNNYI G++ K++RF D+GLW
Sbjct: 293 KTAHQVDLYLLSQVAFPTGGLYFSNEAWVKETKGKHVIIHNNYIIGYDNKMRRFHDYGLW 352

Query: 345 LVDDHAVESPLGKL 358
           LVDDHA+ESPLGKL
Sbjct: 353 LVDDHALESPLGKL 366


>gi|49617789|gb|AAT67590.1| hypothetical protein At4G01770 [Arabidopsis thaliana]
          Length = 361

 Score =  492 bits (1266), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 234/309 (75%), Positives = 267/309 (86%), Gaps = 8/309 (2%)

Query: 58  PWTGTPGFMFPN------ATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNW 111
           PW G+P F+FPN      + S  +KWRDYTL+QAA FVAKNGT+IVCAVS P+LPFLNNW
Sbjct: 52  PWPGSPLFLFPNRLSSSLSPSPQSKWRDYTLAQAARFVAKNGTVIVCAVSSPFLPFLNNW 111

Query: 112 LISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSR 171
           LIS+SRQKHQD+VLVIAEDY TLYKVN +WPGHAVL+PPA DS+TA  FGSQGFFNFT+R
Sbjct: 112 LISVSRQKHQDKVLVIAEDYITLYKVNEKWPGHAVLIPPALDSKTAFSFGSQGFFNFTAR 171

Query: 172 RPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPG 231
           RP HLL ILELGYNVMYNDVDMVWL+DPF YL+G HD YFTDDM  +KPL+HSHDLP P 
Sbjct: 172 RPQHLLQILELGYNVMYNDVDMVWLQDPFLYLEGSHDAYFTDDMPQIKPLNHSHDLPHPD 231

Query: 232 KKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAK--KANDQPAFNWALNKTAGQ 289
           + G TYICSCMIYLRPT+GAKL+MKKW EELQ++ WS++   KANDQPAFN ALNKTA Q
Sbjct: 232 RNGETYICSCMIYLRPTNGAKLLMKKWSEELQSQAWSESIRFKANDQPAFNLALNKTAHQ 291

Query: 290 VDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDH 349
           VDLYLL Q AFPTGGLYFKN+ WV+ETKGK VI+HNNYI G+++K+KRF+D+GLWLVDDH
Sbjct: 292 VDLYLLSQVAFPTGGLYFKNEAWVQETKGKXVIVHNNYIIGYDRKMKRFQDYGLWLVDDH 351

Query: 350 AVESPLGKL 358
           A ESPLGKL
Sbjct: 352 APESPLGKL 360


>gi|15223560|ref|NP_176048.1| rhamnogalacturonan II specific xylosyltransferase [Arabidopsis
           thaliana]
 gi|9954757|gb|AAG09108.1|AC009323_19 Hypothetical protein [Arabidopsis thaliana]
 gi|26449884|dbj|BAC42064.1| unknown protein [Arabidopsis thaliana]
 gi|28827600|gb|AAO50644.1| unknown protein [Arabidopsis thaliana]
 gi|332195285|gb|AEE33406.1| rhamnogalacturonan II specific xylosyltransferase [Arabidopsis
           thaliana]
          Length = 383

 Score =  491 bits (1263), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 231/316 (73%), Positives = 268/316 (84%), Gaps = 11/316 (3%)

Query: 54  GVILPWTGTPGFMFPNATSSLA---------KWRDYTLSQAASFVAKNGTIIVCAVSQPY 104
           GV LP T +  FMFPN TSS            WRDY+L+QA  FVAKN T+IVCAVS P+
Sbjct: 35  GVFLPLTKSSLFMFPNTTSSSLSPSSSLSVSDWRDYSLAQAVKFVAKNETVIVCAVSYPF 94

Query: 105 LPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQG 164
           LPFLNNWLISISRQKHQ++VLVIAEDYATLYKVN +WPGHAVL+PPA D Q+AHKFGSQG
Sbjct: 95  LPFLNNWLISISRQKHQEKVLVIAEDYATLYKVNEKWPGHAVLIPPALDPQSAHKFGSQG 154

Query: 165 FFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHS 224
           FFN TSRRP HLL+ILELGYNVMYNDVDMVWL+DPF YLQG +D YF DDM A+KPL+HS
Sbjct: 155 FFNLTSRRPQHLLNILELGYNVMYNDVDMVWLQDPFDYLQGSYDAYFMDDMIAIKPLNHS 214

Query: 225 HDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW--SKAKKANDQPAFNWA 282
           HDLPP  + G TY+CSCMI+LR TDG KL+MK W+EE+QA+PW  ++AKK +DQPAFN A
Sbjct: 215 HDLPPLSRSGVTYVCSCMIFLRSTDGGKLLMKTWVEEIQAQPWNNTQAKKPHDQPAFNRA 274

Query: 283 LNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFG 342
           L+KTA QV +YLLPQSAFP+GGLYF+N+TWV ET+GKHVI+HNNYI G++KK+KRF+DF 
Sbjct: 275 LHKTANQVKVYLLPQSAFPSGGLYFRNETWVNETRGKHVIVHNNYIIGYDKKMKRFQDFS 334

Query: 343 LWLVDDHAVESPLGKL 358
           LWLVDDHA+ESPLGKL
Sbjct: 335 LWLVDDHALESPLGKL 350


>gi|60651751|gb|AAX32892.1| putative xylosyltransferase [Arabidopsis thaliana]
          Length = 367

 Score =  489 bits (1260), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 241/340 (70%), Positives = 285/340 (83%), Gaps = 13/340 (3%)

Query: 32  RPMLLVLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSS-----------LAKWRDY 80
           RP+ L+     LLVLL+L ++LGV LPW G+P  +FPN  SS            ++WR+Y
Sbjct: 27  RPIFLLSRNGLLLVLLALFLLLGVFLPWPGSPLLLFPNKVSSPSYASCLSPHAKSEWRNY 86

Query: 81  TLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGR 140
           TL+QAA FVA NGT+IVCAVS P+LPFLNNWLIS+SRQKHQ++VLVIAEDY TLYKVN +
Sbjct: 87  TLAQAAKFVATNGTVIVCAVSSPFLPFLNNWLISVSRQKHQEKVLVIAEDYITLYKVNEK 146

Query: 141 WPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
           WPGHAVL+PPA DS+TA+ FGSQGFFNFT+RRP HLL ILELGYNVMYNDVDMVWL+DPF
Sbjct: 147 WPGHAVLIPPALDSKTAYSFGSQGFFNFTARRPQHLLQILELGYNVMYNDVDMVWLQDPF 206

Query: 201 PYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIE 260
            YL+G HD YFTDDM  +KPL+HSHDLP P + G TYICSCMIYLRPT+GAKL+MKKW E
Sbjct: 207 QYLEGSHDAYFTDDMPQIKPLNHSHDLPAPDQNGETYICSCMIYLRPTNGAKLLMKKWSE 266

Query: 261 ELQAEPWSKAK--KANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKG 318
           ELQ++ WS++   KANDQPAFN+ALNKTA QVDLYLL Q AFPTGGLYF +  WV+ETKG
Sbjct: 267 ELQSQAWSESIRFKANDQPAFNFALNKTAHQVDLYLLSQVAFPTGGLYFNDAAWVKETKG 326

Query: 319 KHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           KHVI+HNNYI G+++K++RF+D+GLWLVDDHA+ESPLGKL
Sbjct: 327 KHVIVHNNYIIGYDRKMRRFQDYGLWLVDDHALESPLGKL 366


>gi|297853550|ref|XP_002894656.1| hypothetical protein ARALYDRAFT_474813 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297340498|gb|EFH70915.1| hypothetical protein ARALYDRAFT_474813 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 380

 Score =  487 bits (1254), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 231/320 (72%), Positives = 267/320 (83%), Gaps = 11/320 (3%)

Query: 50  LVVLGVILPWTGTPGFMFPNATSSLAKWR---------DYTLSQAASFVAKNGTIIVCAV 100
            V+LGV LP TG+  F+ P+ TSS              DY+L+QA  FVAKN T+IVCAV
Sbjct: 31  FVILGVFLPLTGSSLFLSPDTTSSSLSPSSSLSVSDWSDYSLAQAVKFVAKNETVIVCAV 90

Query: 101 SQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKF 160
           S P+LPFLNNWLISISRQ HQ++VLVIAEDYATLYKVN +WPGHAVL+PPA D Q AHKF
Sbjct: 91  SYPFLPFLNNWLISISRQNHQEKVLVIAEDYATLYKVNEKWPGHAVLIPPALDPQAAHKF 150

Query: 161 GSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKP 220
           GSQGFFN TSRRP HLL+ILELGYNVMYNDVDMVWL+DPF YLQG HD YF DDM A+KP
Sbjct: 151 GSQGFFNLTSRRPQHLLNILELGYNVMYNDVDMVWLQDPFQYLQGSHDAYFMDDMIAIKP 210

Query: 221 LDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW--SKAKKANDQPA 278
           L+HSH LPP  + G TY+CSCMI+LR TDGAKL+MK W+EE+QA+PW  ++AKK +DQPA
Sbjct: 211 LNHSHGLPPRSRSGVTYVCSCMIFLRSTDGAKLLMKTWVEEIQAQPWNNTQAKKPHDQPA 270

Query: 279 FNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRF 338
           FN AL+KTA QVD+YLLPQSAFP+GGLYFKN+TWV ETKGKHVI+HNNYI G+E+K+KRF
Sbjct: 271 FNRALHKTANQVDVYLLPQSAFPSGGLYFKNETWVNETKGKHVIVHNNYIIGYERKMKRF 330

Query: 339 RDFGLWLVDDHAVESPLGKL 358
           +DF LWLVDDHA+ESPLG L
Sbjct: 331 QDFNLWLVDDHALESPLGIL 350


>gi|15234287|ref|NP_192084.1| rhamnogalacturonan xylosyltransferase 2 [Arabidopsis thaliana]
 gi|4558544|gb|AAD22637.1|AC007138_1 hypothetical protein [Arabidopsis thaliana]
 gi|3859593|gb|AAC72859.1| T15B16.9 gene product [Arabidopsis thaliana]
 gi|7268218|emb|CAB77745.1| hypothetical protein [Arabidopsis thaliana]
 gi|17473527|gb|AAL38247.1| unknown protein [Arabidopsis thaliana]
 gi|27311905|gb|AAO00918.1| unknown protein [Arabidopsis thaliana]
 gi|117018082|tpg|DAA05812.1| TPA_exp: 1,3-alpha-D-xylosyltransferase [Arabidopsis thaliana]
 gi|332656673|gb|AEE82073.1| rhamnogalacturonan xylosyltransferase 2 [Arabidopsis thaliana]
          Length = 367

 Score =  486 bits (1250), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 240/340 (70%), Positives = 284/340 (83%), Gaps = 13/340 (3%)

Query: 32  RPMLLVLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATS-----------SLAKWRDY 80
           RP+ L+     LLVLL+L ++LGV LPW G+P  +FPN  S           + ++WR+Y
Sbjct: 27  RPIFLLSRNGLLLVLLALFLLLGVFLPWPGSPLLLFPNKVSSPSYASSLSPHAKSEWRNY 86

Query: 81  TLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGR 140
           TL+QAA FVA NGT+IVCAVS P+LPFLNNWLIS+SRQKHQ++VLVIAEDY TLYKVN +
Sbjct: 87  TLAQAAKFVATNGTVIVCAVSSPFLPFLNNWLISVSRQKHQEKVLVIAEDYITLYKVNEK 146

Query: 141 WPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
           WPGHAVL+PPA DS+TA+ FGSQGFFNFT+RRP HLL ILELGYNVMYNDVDMVWL+DPF
Sbjct: 147 WPGHAVLIPPALDSKTAYSFGSQGFFNFTARRPQHLLQILELGYNVMYNDVDMVWLQDPF 206

Query: 201 PYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIE 260
            YL+G HD YFTDDM  +KPL+HSHDLP P + G TYICSCMIYLRPT+GAKL+MKKW E
Sbjct: 207 QYLEGSHDAYFTDDMPQIKPLNHSHDLPAPDQNGETYICSCMIYLRPTNGAKLLMKKWSE 266

Query: 261 ELQAEPWSKAK--KANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKG 318
           ELQ++ WS++   KANDQPAFN ALNKTA QVDLYLL Q AFPTGGLYF +  WV+ETKG
Sbjct: 267 ELQSQAWSESIRFKANDQPAFNLALNKTAHQVDLYLLSQVAFPTGGLYFNDAAWVKETKG 326

Query: 319 KHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           KHVI+HNNYI G+++K++RF+D+GLWLVDDHA+ESPLGKL
Sbjct: 327 KHVIVHNNYIIGYDRKMRRFQDYGLWLVDDHALESPLGKL 366


>gi|242087783|ref|XP_002439724.1| hypothetical protein SORBIDRAFT_09g019070 [Sorghum bicolor]
 gi|241945009|gb|EES18154.1| hypothetical protein SORBIDRAFT_09g019070 [Sorghum bicolor]
          Length = 352

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 229/293 (78%), Positives = 258/293 (88%), Gaps = 2/293 (0%)

Query: 68  PNATSSLAKWRDYTLSQAASFVAK--NGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVL 125
           P A    + WR YTL +AA+F A   NGT+++ AVS PYLPFL+NWLIS+ R    DQVL
Sbjct: 60  PAARPPSSPWRSYTLQEAAAFAAAAGNGTVLLAAVSGPYLPFLSNWLISVRRAGRADQVL 119

Query: 126 VIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYN 185
           VIAEDY TL ++N  WPGHAVLVPPAPD+QTAHKFGSQGFFNFTSRRP HLL ILELGY+
Sbjct: 120 VIAEDYETLDRINAAWPGHAVLVPPAPDAQTAHKFGSQGFFNFTSRRPRHLLQILELGYS 179

Query: 186 VMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYL 245
           VMYNDVDMVWL DPFPYL  DHDVYF DDM  VKPLDHSH+LPPPGKKGRTYICSCMI+L
Sbjct: 180 VMYNDVDMVWLADPFPYLVEDHDVYFMDDMTPVKPLDHSHELPPPGKKGRTYICSCMIFL 239

Query: 246 RPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGL 305
           RPT+GAKL+++KWIEEL+ +PWSK +KANDQPAFNWALNKTAGQVD+YLLPQSAFPTGGL
Sbjct: 240 RPTEGAKLLLRKWIEELKEQPWSKQRKANDQPAFNWALNKTAGQVDVYLLPQSAFPTGGL 299

Query: 306 YFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           YFKN+TWV++TKGKHVIIHNNYITGFEKKIKRFRD GLWLVD+H+ ESPLG++
Sbjct: 300 YFKNKTWVKQTKGKHVIIHNNYITGFEKKIKRFRDHGLWLVDEHSDESPLGRI 352


>gi|115463643|ref|NP_001055421.1| Os05g0386900 [Oryza sativa Japonica Group]
 gi|54287590|gb|AAV31334.1| unknown protein [Oryza sativa Japonica Group]
 gi|113578972|dbj|BAF17335.1| Os05g0386900 [Oryza sativa Japonica Group]
 gi|215701069|dbj|BAG92493.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215707102|dbj|BAG93562.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222631445|gb|EEE63577.1| hypothetical protein OsJ_18394 [Oryza sativa Japonica Group]
          Length = 352

 Score =  479 bits (1234), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 226/292 (77%), Positives = 261/292 (89%), Gaps = 2/292 (0%)

Query: 69  NATSSLAKWRDYTLSQAASFVAK--NGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLV 126
           ++++  + WR YTL +AA+F A+  NGTI++ AVS PYLPFL+NWLI++ R    DQVLV
Sbjct: 61  HSSAPASPWRGYTLQEAAAFAARAGNGTIVLAAVSGPYLPFLSNWLITVRRAGRADQVLV 120

Query: 127 IAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNV 186
           +AEDY TL ++N  WPGHAVLVPPAPD+Q AHKFGSQGFFNFTSRRP HLL ILELGY+V
Sbjct: 121 VAEDYDTLERINAAWPGHAVLVPPAPDAQVAHKFGSQGFFNFTSRRPRHLLQILELGYSV 180

Query: 187 MYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLR 246
           MYNDVDMVWL DPFPY+ GDHDVYF DDM  VKPLDHSH+LPPPGKKGRTYICSCMI+LR
Sbjct: 181 MYNDVDMVWLADPFPYIVGDHDVYFMDDMTPVKPLDHSHELPPPGKKGRTYICSCMIFLR 240

Query: 247 PTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLY 306
           PT+GAKL+++KWIEEL+ +PWSK +KANDQPAFNWALNKTAGQVD+YLLPQSAFPTGGLY
Sbjct: 241 PTEGAKLLLRKWIEELKEQPWSKKQKANDQPAFNWALNKTAGQVDVYLLPQSAFPTGGLY 300

Query: 307 FKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           FKN+TWV+ETKGKHVIIHNNYITGFEKKIKRFRD GLWLVD+H+ ESPLG++
Sbjct: 301 FKNKTWVKETKGKHVIIHNNYITGFEKKIKRFRDHGLWLVDEHSDESPLGRI 352


>gi|125552172|gb|EAY97881.1| hypothetical protein OsI_19800 [Oryza sativa Indica Group]
          Length = 352

 Score =  479 bits (1234), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 226/292 (77%), Positives = 261/292 (89%), Gaps = 2/292 (0%)

Query: 69  NATSSLAKWRDYTLSQAASFVAK--NGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLV 126
           ++++  + WR YTL +AA+F A+  NGTI++ AVS PYLPFL+NWLI++ R    DQVLV
Sbjct: 61  HSSAPASPWRGYTLQEAAAFAARAGNGTIVLAAVSGPYLPFLSNWLITVRRAGRADQVLV 120

Query: 127 IAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNV 186
           +AEDY TL ++N  WPGHAVLVPPAPD+Q AHKFGSQGFFNFTSRRP HLL ILELGY+V
Sbjct: 121 VAEDYDTLERINAAWPGHAVLVPPAPDAQVAHKFGSQGFFNFTSRRPRHLLQILELGYSV 180

Query: 187 MYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLR 246
           MYNDVDMVWL DPFPY+ GDHDVYF DDM  VKPLDHSH+LPPPGKKGRTYICSCMI+LR
Sbjct: 181 MYNDVDMVWLADPFPYIVGDHDVYFMDDMTPVKPLDHSHELPPPGKKGRTYICSCMIFLR 240

Query: 247 PTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLY 306
           PT+GAKL+++KWIEEL+ +PWSK +KANDQPAFNWALNKTAGQVD+YLLPQSAFPTGGLY
Sbjct: 241 PTEGAKLLLRKWIEELKEQPWSKKQKANDQPAFNWALNKTAGQVDVYLLPQSAFPTGGLY 300

Query: 307 FKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           FKN+TWV+ETKGKHVIIHNNYITGFEKKIKRFRD GLWLVD+H+ ESPLG++
Sbjct: 301 FKNKTWVKETKGKHVIIHNNYITGFEKKIKRFRDHGLWLVDEHSDESPLGRI 352


>gi|413945194|gb|AFW77843.1| hypothetical protein ZEAMMB73_274644 [Zea mays]
          Length = 396

 Score =  469 bits (1206), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 214/267 (80%), Positives = 243/267 (91%)

Query: 92  NGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPA 151
           NGT+++ AVS PYLPFL+NWLIS+ R    DQVLVIAEDY TL ++N  WPGHAVLVPPA
Sbjct: 130 NGTVLLAAVSGPYLPFLSNWLISVRRAGRADQVLVIAEDYETLDRINAAWPGHAVLVPPA 189

Query: 152 PDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYF 211
           PD+QTAHKFGSQGFFNFTSRRP HLL ILELGY+VMYNDVDMVWL DPF Y+  +HDVYF
Sbjct: 190 PDAQTAHKFGSQGFFNFTSRRPRHLLQILELGYSVMYNDVDMVWLADPFAYIVENHDVYF 249

Query: 212 TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAK 271
            DDMA VKPLDHSH+LPPPGKKGRTYICSCM++LRPT+GAKL+++KWIEEL+ +PWSK +
Sbjct: 250 MDDMAPVKPLDHSHELPPPGKKGRTYICSCMLFLRPTEGAKLLLRKWIEELKEQPWSKQR 309

Query: 272 KANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGF 331
           K+NDQPAFNWALNKTAGQVD+YLLPQSAFPTGGLYFKN+ WV+ETKGKHVIIHNNYITGF
Sbjct: 310 KSNDQPAFNWALNKTAGQVDVYLLPQSAFPTGGLYFKNKKWVKETKGKHVIIHNNYITGF 369

Query: 332 EKKIKRFRDFGLWLVDDHAVESPLGKL 358
           EKKIKRFRD GLWLVD+H+ ESPLG++
Sbjct: 370 EKKIKRFRDHGLWLVDEHSDESPLGRI 396


>gi|357133882|ref|XP_003568551.1| PREDICTED: uncharacterized protein LOC100839509 [Brachypodium
           distachyon]
          Length = 351

 Score =  466 bits (1199), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 224/293 (76%), Positives = 252/293 (86%), Gaps = 2/293 (0%)

Query: 68  PNATSSLAKWRDYTLSQAASFVAK--NGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVL 125
           P  +S  + WR YTL  AA+F A   NGTI++ AVS PYLPFL+NWLIS+ R    DQVL
Sbjct: 59  PRHSSPASPWRAYTLQDAAAFAAAAGNGTIVLAAVSGPYLPFLSNWLISVRRAGRADQVL 118

Query: 126 VIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYN 185
           VIAEDY TL ++N  WPGHAVL+PPAPD+Q AHKFGSQGFFNFTSRRP HLL ILELGY+
Sbjct: 119 VIAEDYETLERINAAWPGHAVLIPPAPDAQAAHKFGSQGFFNFTSRRPRHLLQILELGYS 178

Query: 186 VMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYL 245
           VMYNDVDMVWL DPFPYL G+HDVYF DDM  VK L+HSH+LPPPGKKGR YICSCMI+L
Sbjct: 179 VMYNDVDMVWLADPFPYLVGNHDVYFMDDMTDVKALNHSHELPPPGKKGRPYICSCMIFL 238

Query: 246 RPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGL 305
           +PT+GAKL+M+KWIEEL+ +PWSK  K+NDQPAFNWAL KT GQVD+YLLPQSAFPTGGL
Sbjct: 239 QPTEGAKLLMRKWIEELKEQPWSKKVKSNDQPAFNWALLKTVGQVDVYLLPQSAFPTGGL 298

Query: 306 YFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           YFKN+TWV+ETKGKHVIIHNNYITGFEKKIKRFRD  LWLVD+H+ ESPLGKL
Sbjct: 299 YFKNKTWVKETKGKHVIIHNNYITGFEKKIKRFRDHKLWLVDEHSDESPLGKL 351


>gi|226501196|ref|NP_001143398.1| uncharacterized protein LOC100276040 [Zea mays]
 gi|195619764|gb|ACG31712.1| hypothetical protein [Zea mays]
          Length = 353

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 213/267 (79%), Positives = 241/267 (90%)

Query: 92  NGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPA 151
           NGT+++ AVS PYLPFL+NWLIS+ R    DQVLVIAEDY TL ++N  WPGHAVLVPPA
Sbjct: 87  NGTVLLAAVSGPYLPFLSNWLISVRRAGRADQVLVIAEDYETLDRINAAWPGHAVLVPPA 146

Query: 152 PDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYF 211
           PD+QTAHKFGSQGFFNFTSRRP HLL ILELGY+VMYNDVDMVWL DPF Y+  +HDVYF
Sbjct: 147 PDAQTAHKFGSQGFFNFTSRRPRHLLQILELGYSVMYNDVDMVWLADPFAYIVENHDVYF 206

Query: 212 TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAK 271
            DDM  VKPLDHSH+LPPPGKKGRTYICSCM++LRPT+GAKL+++KWIEEL+ +PWSK  
Sbjct: 207 MDDMTPVKPLDHSHELPPPGKKGRTYICSCMLFLRPTEGAKLLLRKWIEELKEQPWSKQM 266

Query: 272 KANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGF 331
           K+NDQPAFNWALNKTAGQVD+YLLPQSAFPTGGLYFKN+ WV+ETKGKHVIIHNNYITGF
Sbjct: 267 KSNDQPAFNWALNKTAGQVDVYLLPQSAFPTGGLYFKNKKWVKETKGKHVIIHNNYITGF 326

Query: 332 EKKIKRFRDFGLWLVDDHAVESPLGKL 358
           EKKIKRFRD GLWLVD+H+ ESPLG++
Sbjct: 327 EKKIKRFRDHGLWLVDEHSDESPLGRI 353


>gi|326533736|dbj|BAK05399.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 351

 Score =  464 bits (1193), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 221/293 (75%), Positives = 252/293 (86%), Gaps = 2/293 (0%)

Query: 68  PNATSSLAKWRDYTLSQAASFV--AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVL 125
           P  ++  + WR YTL  AA+F   + NGTII+ AVS PYLPFL+NWLIS+ R    +QVL
Sbjct: 59  PRHSAPSSPWRSYTLQDAAAFAAASGNGTIILAAVSGPYLPFLSNWLISVRRAGRANQVL 118

Query: 126 VIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYN 185
           VIAEDY TL ++N  WPGHAVLVPPAPD+Q AHKFGSQGFFNFTSRRP HLL ILELGY+
Sbjct: 119 VIAEDYETLERINAAWPGHAVLVPPAPDAQAAHKFGSQGFFNFTSRRPRHLLQILELGYS 178

Query: 186 VMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYL 245
           VMYNDVDMVWL DPFPYL G+HDVYF DDM  VKPL+HSH LPPPGKKGR YICSCMI+L
Sbjct: 179 VMYNDVDMVWLADPFPYLVGNHDVYFMDDMTEVKPLNHSHALPPPGKKGRPYICSCMIFL 238

Query: 246 RPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGL 305
           +PT+G+KL+M+KWIEEL+ +PWSK  K+NDQPAFNWAL KT GQVD+YLLPQSAFPTGGL
Sbjct: 239 QPTEGSKLLMRKWIEELKEQPWSKKVKSNDQPAFNWALLKTTGQVDVYLLPQSAFPTGGL 298

Query: 306 YFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           YFKN+TWV+ETKGKHV+IHNNYITGFEKKIKRFRD  LWLVD+H+ ESPLGK+
Sbjct: 299 YFKNKTWVKETKGKHVLIHNNYITGFEKKIKRFRDHKLWLVDEHSDESPLGKI 351


>gi|224127804|ref|XP_002320168.1| predicted protein [Populus trichocarpa]
 gi|222860941|gb|EEE98483.1| predicted protein [Populus trichocarpa]
          Length = 284

 Score =  436 bits (1122), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 225/287 (78%), Positives = 250/287 (87%), Gaps = 3/287 (1%)

Query: 4   WLHQRPLHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGTP 63
           +LHQRPLH  L +PYPLS          +  + + +RT L+ +LSLL++LGVILPWTGTP
Sbjct: 1   FLHQRPLHGTLSDPYPLS---PRQSSSSQRQISLFSRTGLIAILSLLLILGVILPWTGTP 57

Query: 64  GFMFPNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQ 123
                   +SLAKW+ YTL QA +FVAKN T+IVCAVSQPYLPFL+NWLISISRQKHQD+
Sbjct: 58  SIFSATKPASLAKWQQYTLPQAVAFVAKNKTVIVCAVSQPYLPFLSNWLISISRQKHQDK 117

Query: 124 VLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELG 183
           VLVIAEDYATLY VN RWPGHAVLVPPAPDSQ+AHKFGSQGFFNFTSRRP HLLHILELG
Sbjct: 118 VLVIAEDYATLYNVNERWPGHAVLVPPAPDSQSAHKFGSQGFFNFTSRRPRHLLHILELG 177

Query: 184 YNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMI 243
           Y+VMYNDVDMVWL DPF YL+G+HDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMI
Sbjct: 178 YDVMYNDVDMVWLGDPFRYLEGNHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMI 237

Query: 244 YLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV 290
           ++RPTDGAKLVMKKWIEEL+A+PWSK +KANDQPAFNWALNKTAGQV
Sbjct: 238 FMRPTDGAKLVMKKWIEELKAQPWSKTRKANDQPAFNWALNKTAGQV 284


>gi|302799954|ref|XP_002981735.1| Rhamnogalacturonan II Xylosyl transferase-like protein [Selaginella
           moellendorffii]
 gi|300150567|gb|EFJ17217.1| Rhamnogalacturonan II Xylosyl transferase-like protein [Selaginella
           moellendorffii]
          Length = 350

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 211/318 (66%), Positives = 250/318 (78%), Gaps = 3/318 (0%)

Query: 38  LNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSSLAKWRDYTLSQAASFVAKNGTIIV 97
           ++R  L+ L+   V+L  IL      G+    +TSS ++W DYTL  AA+ VA++G I+V
Sbjct: 17  ISRRRLVTLVLGAVLLAYILVSLQDRGYR-AGSTSSSSRWADYTLEAAAAAVARDGRIVV 75

Query: 98  CAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTA 157
           CAVS PYL FL NWLISI+   H D+VLVIAEDY  L  VN  WPGHAVLVPPA    TA
Sbjct: 76  CAVSFPYLAFLVNWLISIASHGHHDKVLVIAEDYEMLNYVNEFWPGHAVLVPPALPLATA 135

Query: 158 HKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAA 217
            +FGSQGFFNFTSRRP HLL +LELGY+V+YNDVDMVW+ DPFP   GDHD+YFTDDM A
Sbjct: 136 QRFGSQGFFNFTSRRPQHLLKLLELGYSVLYNDVDMVWMSDPFPLFTGDHDIYFTDDMTA 195

Query: 218 VKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQP 277
           +KPLDHSH LPPPGKKGRTYICSCMI+LRPT GAKLV++KWIEELQ +PWS   KANDQP
Sbjct: 196 IKPLDHSHSLPPPGKKGRTYICSCMIFLRPTPGAKLVLQKWIEELQQQPWSPKAKANDQP 255

Query: 278 AFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTW--VEETKGKHVIIHNNYITGFEKKI 335
           AFNWALNKT+ +VD+YLLPQ++FP+GGLYFKN+TW  + + + K  IIHNNYITGF+KKI
Sbjct: 256 AFNWALNKTSNKVDMYLLPQASFPSGGLYFKNETWRRLPQNQNKLTIIHNNYITGFDKKI 315

Query: 336 KRFRDFGLWLVDDHAVES 353
           KRF D GLWL+DD  V +
Sbjct: 316 KRFHDTGLWLIDDAHVNT 333


>gi|302768381|ref|XP_002967610.1| Rhamnogalacturonan II Xylosyl transferase-like protein [Selaginella
           moellendorffii]
 gi|300164348|gb|EFJ30957.1| Rhamnogalacturonan II Xylosyl transferase-like protein [Selaginella
           moellendorffii]
          Length = 350

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 211/318 (66%), Positives = 250/318 (78%), Gaps = 3/318 (0%)

Query: 38  LNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSSLAKWRDYTLSQAASFVAKNGTIIV 97
           ++R  L+ L+   V+L  IL      G+    +TSS ++W DYTL  AA+ VA++G I+V
Sbjct: 17  ISRRRLVTLVLGAVLLAYILVSLQDRGYR-AGSTSSSSRWADYTLEAAAAAVARDGRIVV 75

Query: 98  CAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTA 157
           CAVS PYL FL NWLISI+   H D+VLVIAEDY  L  VN  WPGHAVLVPPA    TA
Sbjct: 76  CAVSFPYLAFLVNWLISIASHGHHDKVLVIAEDYEMLNYVNEFWPGHAVLVPPALPLATA 135

Query: 158 HKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAA 217
            +FGSQGFFNFTSRRP HLL +LELGY+V+YNDVDMVW+ DPFP   GDHD+YFTDDM A
Sbjct: 136 QRFGSQGFFNFTSRRPQHLLKLLELGYSVLYNDVDMVWMSDPFPLFTGDHDIYFTDDMTA 195

Query: 218 VKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQP 277
           +KPLDHSH LPPPGKKGRTYICSCMI+LRPT GAKLV++KWIEELQ +PWS   KANDQP
Sbjct: 196 IKPLDHSHSLPPPGKKGRTYICSCMIFLRPTPGAKLVLQKWIEELQQQPWSPKAKANDQP 255

Query: 278 AFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTW--VEETKGKHVIIHNNYITGFEKKI 335
           AFNWALNKT+ +VD+YLLPQ++FP+GGLYFKN+TW  + + + K  IIHNNYITGF+KKI
Sbjct: 256 AFNWALNKTSNKVDMYLLPQASFPSGGLYFKNETWRRLPQNQNKLTIIHNNYITGFDKKI 315

Query: 336 KRFRDFGLWLVDDHAVES 353
           KRF D GLWL+DD  V +
Sbjct: 316 KRFHDTGLWLIDDAHVNT 333


>gi|18411573|ref|NP_567211.1| rhamnogalacturonan II specific xylosyltransferase [Arabidopsis
           thaliana]
 gi|57222150|gb|AAW38982.1| At4g01220 [Arabidopsis thaliana]
 gi|332656596|gb|AEE81996.1| rhamnogalacturonan II specific xylosyltransferase [Arabidopsis
           thaliana]
          Length = 299

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 222/297 (74%), Positives = 252/297 (84%), Gaps = 4/297 (1%)

Query: 3   QWLHQRPLHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGT 62
           ++LHQRP+ NP  NP+  S   S +    RP + +L+R  LL+LL+LLV+LGV LPW G+
Sbjct: 5   KFLHQRPIQNPFTNPFSSS-PLSTSSISNRP-ISLLSRNGLLLLLALLVILGVFLPWAGS 62

Query: 63  PGFMFPNATS-SLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQ 121
           P F  PN  S S +KWRDY+L QA  FVAKNGT+IVCAVS PYLPFLNNWLIS+SRQKHQ
Sbjct: 63  PLFPSPNKLSPSQSKWRDYSLPQAVKFVAKNGTVIVCAVSYPYLPFLNNWLISVSRQKHQ 122

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
           DQVLVIAEDYATLYKVN +WPGHAVL+PPA DSQTAHKFGSQGFFNFT+RRP HLL ILE
Sbjct: 123 DQVLVIAEDYATLYKVNEKWPGHAVLIPPALDSQTAHKFGSQGFFNFTARRPQHLLEILE 182

Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 241
           LGYNVMYNDVDMVWL+DPF YL+G HD YF DDM A+KPLDHSHDLPPPGKKGRTYICSC
Sbjct: 183 LGYNVMYNDVDMVWLQDPFQYLEGKHDAYFMDDMTAIKPLDHSHDLPPPGKKGRTYICSC 242

Query: 242 MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV-DLYLLPQ 297
           MI+LRPT+GAKL+MKKWIEEL+ +PWS+AKKANDQP FNWALNKTA QV   +L+PQ
Sbjct: 243 MIFLRPTNGAKLLMKKWIEELETQPWSRAKKANDQPGFNWALNKTANQVCSFFLVPQ 299


>gi|13877593|gb|AAK43874.1|AF370497_1 unknown protein [Arabidopsis thaliana]
          Length = 299

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 222/297 (74%), Positives = 252/297 (84%), Gaps = 4/297 (1%)

Query: 3   QWLHQRPLHNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGT 62
           ++LHQRP+ NP  NP+  S   S +    RP + +L+R  LL+LL+LLV+LGV LPW G+
Sbjct: 5   KFLHQRPIQNPFTNPFSSS-PLSTSSISNRP-ISLLSRNGLLLLLALLVILGVFLPWAGS 62

Query: 63  PGFMFPNATS-SLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQ 121
           P F  PN  S S +KWRDY+L QA  FVAKNGT+IVCAVS PYLPFLNNWLIS+SRQKHQ
Sbjct: 63  PLFPSPNKHSPSQSKWRDYSLPQAVKFVAKNGTVIVCAVSYPYLPFLNNWLISVSRQKHQ 122

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
           DQVLVIAEDYATLYKVN +WPGHAVL+PPA DSQTAHKFGSQGFFNFT+RRP HLL ILE
Sbjct: 123 DQVLVIAEDYATLYKVNEKWPGHAVLIPPALDSQTAHKFGSQGFFNFTARRPQHLLEILE 182

Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 241
           LGYNVMYNDVDMVWL+DPF YL+G HD YF DDM A+KPLDHSHDLPPPGKKGRTYICSC
Sbjct: 183 LGYNVMYNDVDMVWLQDPFQYLEGKHDAYFMDDMTAIKPLDHSHDLPPPGKKGRTYICSC 242

Query: 242 MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQV-DLYLLPQ 297
           MI+LRPT+GAKL+MKKWIEEL+ +PWS+AKKANDQP FNWALNKTA QV   +L+PQ
Sbjct: 243 MIFLRPTNGAKLLMKKWIEELETQPWSRAKKANDQPGFNWALNKTANQVCSFFLVPQ 299


>gi|168006283|ref|XP_001755839.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693158|gb|EDQ79512.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 297

 Score =  399 bits (1026), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 186/301 (61%), Positives = 226/301 (75%), Gaps = 7/301 (2%)

Query: 57  LPWTGTPGFMFPNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISIS 116
           + WT       PN       WR YTL QAA  +A +G +IVC VS PY+PFLNNWLIS++
Sbjct: 1   MSWTANLHLGQPN-------WRSYTLRQAAQALAVDGILIVCTVSHPYMPFLNNWLISLA 53

Query: 117 RQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHL 176
           +      VL+IAEDY TL  VN RWPGH+VL+PPA    T+ +FGSQGFFN T+RRP +L
Sbjct: 54  KYNRHQAVLIIAEDYTTLDFVNSRWPGHSVLIPPASSETTSLRFGSQGFFNLTARRPKYL 113

Query: 177 LHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRT 236
           L ILELGY+V+YNDVDMVWL DPF Y + + +VY  DDMA +K   HSH LPPPGKKGRT
Sbjct: 114 LEILELGYSVLYNDVDMVWLADPFSYFKNNREVYIIDDMALLKTEYHSHALPPPGKKGRT 173

Query: 237 YICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLP 296
           YICSCM++L+ T+GAKL+M+ WIEEL+   WS + K NDQPAFNWALNKTAGQVD+YLLP
Sbjct: 174 YICSCMLFLKSTEGAKLLMRTWIEELKERQWSPSVKTNDQPAFNWALNKTAGQVDVYLLP 233

Query: 297 QSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLG 356
           Q AFP+GGLYFKN +W +ET+ KHVI+HNNY+ GF++KIKRFR   LW V+  A  SPL 
Sbjct: 234 QVAFPSGGLYFKNDSWRKETENKHVIVHNNYVVGFDQKIKRFRAHNLWFVELDAGLSPLA 293

Query: 357 K 357
           +
Sbjct: 294 E 294


>gi|297814147|ref|XP_002874957.1| hypothetical protein ARALYDRAFT_352664 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320794|gb|EFH51216.1| hypothetical protein ARALYDRAFT_352664 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 304

 Score =  377 bits (968), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 181/246 (73%), Positives = 207/246 (84%), Gaps = 11/246 (4%)

Query: 58  PWTGTPGFMFPNATSSLA---------KWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFL 108
           PW G+P F FPN  SS +         KW DYTL+QAA FVAKNGT+IVCAVS P+LPFL
Sbjct: 53  PWPGSPLFQFPNRLSSSSSSLSPSRQSKWHDYTLAQAAKFVAKNGTVIVCAVSSPFLPFL 112

Query: 109 NNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNF 168
           NNWLIS+SRQKHQ++VLVIAEDYATLYKVN +WPGHAVL+PPA DS+TA+ FGSQGFFNF
Sbjct: 113 NNWLISVSRQKHQEKVLVIAEDYATLYKVNEKWPGHAVLIPPALDSKTAYSFGSQGFFNF 172

Query: 169 TSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLP 228
           T+RRP HLL +LELGYNVMYNDVDMVWL+DPF YL+G HDVYFTDD+  +KPL+HSHDLP
Sbjct: 173 TARRPQHLLQVLELGYNVMYNDVDMVWLQDPFQYLEGSHDVYFTDDLPQIKPLNHSHDLP 232

Query: 229 PPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAK--KANDQPAFNWALNKT 286
            PG+ G TYICSCMIYLRPT+GAKL+MKKW EELQ++ WS++   KANDQPAFN ALNKT
Sbjct: 233 HPGRNGETYICSCMIYLRPTNGAKLLMKKWSEELQSQAWSESIRFKANDQPAFNLALNKT 292

Query: 287 AGQVDL 292
           A Q  L
Sbjct: 293 AHQSSL 298


>gi|326504918|dbj|BAK06750.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 280

 Score =  343 bits (879), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 160/222 (72%), Positives = 185/222 (83%), Gaps = 2/222 (0%)

Query: 68  PNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVI 127
           P  ++  + WR YTL  AA+  + NGTII+ AVS PYLPFL+NWLIS+ R    +QVLVI
Sbjct: 59  PRHSAPASPWRSYTLQDAAA--SGNGTIILAAVSGPYLPFLSNWLISVRRAGRANQVLVI 116

Query: 128 AEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVM 187
           A+DY TL ++N  WPGHAVLVPPAPD+Q AHKFGSQGFFNFTSRRP HLL ILELGY+VM
Sbjct: 117 AKDYETLERINAAWPGHAVLVPPAPDAQAAHKFGSQGFFNFTSRRPRHLLQILELGYSVM 176

Query: 188 YNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRP 247
           YNDVDMVWL DPFPYL G+HDVYF DDM  VKPL+HSH LPPPGKKGR YICSCMI+L+P
Sbjct: 177 YNDVDMVWLADPFPYLVGNHDVYFMDDMTQVKPLNHSHALPPPGKKGRPYICSCMIFLQP 236

Query: 248 TDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQ 289
           T+G+KL+M+KWIEEL+ +PWSK  K+NDQPAFNWAL KT GQ
Sbjct: 237 TEGSKLLMRKWIEELKEQPWSKKVKSNDQPAFNWALLKTTGQ 278


>gi|224064098|ref|XP_002301385.1| predicted protein [Populus trichocarpa]
 gi|222843111|gb|EEE80658.1| predicted protein [Populus trichocarpa]
          Length = 172

 Score =  334 bits (857), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 154/172 (89%), Positives = 165/172 (95%)

Query: 187 MYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLR 246
           MYNDVDMVWL+DPFPYL+G+HDVYFTDDMAAVKPL HSHDLPPPGKKGRTYICSCMI++ 
Sbjct: 1   MYNDVDMVWLQDPFPYLEGNHDVYFTDDMAAVKPLGHSHDLPPPGKKGRTYICSCMIFMH 60

Query: 247 PTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLY 306
           PTDGAKLV+KKWIEELQA+PWSK KK+NDQPAFNWALNKTAGQVDLYLLPQ+AFPTGGLY
Sbjct: 61  PTDGAKLVLKKWIEELQAQPWSKTKKSNDQPAFNWALNKTAGQVDLYLLPQTAFPTGGLY 120

Query: 307 FKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           FKNQTWV+ETKGKH IIHNNYITGFEKKIKRF D+GLWLVD HA ESPLGKL
Sbjct: 121 FKNQTWVQETKGKHAIIHNNYITGFEKKIKRFHDYGLWLVDGHASESPLGKL 172


>gi|388505822|gb|AFK40977.1| unknown [Medicago truncatula]
          Length = 240

 Score =  295 bits (754), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 154/236 (65%), Positives = 180/236 (76%), Gaps = 5/236 (2%)

Query: 1   MSQWLHQRPLHNPL--PNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILP 58
           MS +LHQR LHNP    NP+P S  +S      +  L + +   LL LLSL++VLGV  P
Sbjct: 1   MSGFLHQRSLHNPFQFSNPFPSSQPSSTN---PKKPLSIFSPAILLSLLSLIIVLGVFSP 57

Query: 59  WTGTPGFMFPNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQ 118
           W G P  +   +   ++KW  YTL +A +FVAKNGT+IVC VSQPYLPFLNNWLISI+  
Sbjct: 58  WVGMPQKLIFTSNPEVSKWGRYTLDEALTFVAKNGTVIVCIVSQPYLPFLNNWLISIAMH 117

Query: 119 KHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLH 178
           K  D VLVIAEDY +LYKVN  WPGHAVL+PP  D + +HKFGSQGFFNFT+RRP HLL 
Sbjct: 118 KRHDMVLVIAEDYPSLYKVNQLWPGHAVLIPPVLDLEASHKFGSQGFFNFTARRPSHLLK 177

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKG 234
           ILELGY+VMYNDVDMVWL DPFPYLQG+HDVYFTDDM A+KPL+HSHDLPPPGK+G
Sbjct: 178 ILELGYSVMYNDVDMVWLGDPFPYLQGNHDVYFTDDMTAIKPLNHSHDLPPPGKRG 233


>gi|297853548|ref|XP_002894655.1| hypothetical protein ARALYDRAFT_892838 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297340497|gb|EFH70914.1| hypothetical protein ARALYDRAFT_892838 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 189

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 103/146 (70%), Positives = 115/146 (78%), Gaps = 10/146 (6%)

Query: 111 WLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTS 170
           WLISISRQ H+++VLVIAEDYATLYKVN RWPGHAVL+P A D Q AHKFGSQ   +   
Sbjct: 5   WLISISRQNHEEKVLVIAEDYATLYKVNERWPGHAVLIPSALDPQAAHKFGSQATTS--- 61

Query: 171 RRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPP 230
                 L  LELGYNVMYNDVDMVWL+DPF YLQG HD YF DDM A+KPL+HSH LPP 
Sbjct: 62  ------LEHLELGYNVMYNDVDMVWLQDPFQYLQGSHDAYFMDDMIAIKPLNHSHGLPPR 115

Query: 231 GKKGRTYICSCMIYLRPTDGAKLVMK 256
            + G TY+CSCMI+L PTDGAKL+MK
Sbjct: 116 SRGGVTYVCSCMIFL-PTDGAKLLMK 140


>gi|28466905|gb|AAO44061.1| At1g56550 [Arabidopsis thaliana]
          Length = 169

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 86/128 (67%), Positives = 97/128 (75%), Gaps = 9/128 (7%)

Query: 54  GVILPWTGTPGFMFPNATSSLAK---------WRDYTLSQAASFVAKNGTIIVCAVSQPY 104
           GV LP T +  FMFPN TSS            WRDY+L+QA  FVAKN T+IVCAVS P+
Sbjct: 35  GVFLPLTKSSLFMFPNTTSSSLSPSSSLSVSDWRDYSLAQAVKFVAKNETVIVCAVSYPF 94

Query: 105 LPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQG 164
           LPFLNNWLISISRQKHQ++VLVIAEDYATLYKVN +WPGHAVL+PPA D Q+AHKFGSQ 
Sbjct: 95  LPFLNNWLISISRQKHQEKVLVIAEDYATLYKVNEKWPGHAVLIPPALDPQSAHKFGSQV 154

Query: 165 FFNFTSRR 172
              F+  R
Sbjct: 155 QREFSPFR 162


>gi|388512037|gb|AFK44080.1| unknown [Lotus japonicus]
          Length = 75

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 63/74 (85%), Positives = 68/74 (91%)

Query: 285 KTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLW 344
           K A +VDLYLLPQ AFPTGGLYFKN+TWV+ETKGKHV+IHNNYI GFEKKIKRFRD+GLW
Sbjct: 2   KNAKEVDLYLLPQPAFPTGGLYFKNKTWVKETKGKHVVIHNNYIVGFEKKIKRFRDYGLW 61

Query: 345 LVDDHAVESPLGKL 358
           LVDDHA ESPLG L
Sbjct: 62  LVDDHAKESPLGTL 75


>gi|326429312|gb|EGD74882.1| hypothetical protein PTSG_07110 [Salpingoeca sp. ATCC 50818]
          Length = 1319

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 83/286 (29%), Positives = 139/286 (48%), Gaps = 38/286 (13%)

Query: 79   DYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVN 138
            + T++      A++GTII+   S  ++ F  NW  S +R+      +V+AED +   ++ 
Sbjct: 1032 EATVAAVVKSNARDGTIILLTTSSGFMDFFLNWRES-ARRVGITNYMVLAEDLSCYEQLE 1090

Query: 139  GRWPGHAVL----VPPAPDS-------QTAHKFGSQGFFNFTSRRPCHLLHILELGYNVM 187
               PG AVL    +  A D+       +T   + S+ +    SRRP ++  +L +GYNV+
Sbjct: 1091 AIDPGKAVLSSVRIVKASDTDAQIGKDKTGFSYASKQYNEIVSRRPTYIGRLLRMGYNVL 1150

Query: 188  YNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTY-----ICSCM 242
            Y D D VWL++PF +   D+D+Y   D                 K+  T+     +C+  
Sbjct: 1151 YTDTDTVWLENPFQHFPPDYDMYIQSD-----------------KEDETFDPWHMLCTGF 1193

Query: 243  IYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPT 302
            +++R   G    M  W   LQ    ++ K  N     +   ++   ++ +Y+LP   FP+
Sbjct: 1194 MFMRAGTGMIQFMDDWRTALQE---AQGKFVNQYIFNDLFQSRYRAKLRVYVLPDMKFPS 1250

Query: 303  GGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDD 348
            G L+F N+ W E+      ++HNN+I G + K +RFR  GLW VDD
Sbjct: 1251 GALFF-NRAWREKQPSPPAVVHNNFIVGPDSKRRRFRARGLWFVDD 1295


>gi|224127808|ref|XP_002320169.1| predicted protein [Populus trichocarpa]
 gi|222860942|gb|EEE98484.1| predicted protein [Populus trichocarpa]
          Length = 66

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 58/62 (93%), Positives = 61/62 (98%)

Query: 289 QVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDD 348
           QVDLYLLPQ+AFPTGGLYFKNQTWV+ETKGKHVIIHNNYITGFEKKIKRFRD+ LWLVDD
Sbjct: 5   QVDLYLLPQAAFPTGGLYFKNQTWVQETKGKHVIIHNNYITGFEKKIKRFRDYSLWLVDD 64

Query: 349 HA 350
           HA
Sbjct: 65  HA 66


>gi|390350619|ref|XP_003727460.1| PREDICTED: UDP-galactose:fucoside
           alpha-3-galactosyltransferase-like [Strongylocentrotus
           purpuratus]
          Length = 364

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 82/257 (31%), Positives = 132/257 (51%), Gaps = 28/257 (10%)

Query: 95  IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPA--- 151
           II+   ++ +L F  NW+ S+ R   +D V +IAED +T   +  R   +  L+  +   
Sbjct: 127 IILATTNKAFLDFTENWIESLKRCNVRDHVTIIAEDPSTYEILAKRNDINLELLLTSKTN 186

Query: 152 -PDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVY 210
            PDS  A  FGSQ +    ++RP ++L  L+ G +V+++DVD VWLK+P P+ +  +D+Y
Sbjct: 187 LPDSDLA--FGSQDYLRLVNKRPNYILRYLQRGTDVLFSDVDTVWLKNPLPFFEDGYDLY 244

Query: 211 FTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKA 270
           F  D+          D   P       +C+  +Y R T     ++ KWI+ + A P    
Sbjct: 245 FGRDIY--------DDQTKPD-----LVCAGFVYYRATKATIDLIVKWIQRIHARP---- 287

Query: 271 KKANDQPAFNWAL-NKT-AGQVDLYLLPQSAFPTGGLYFKNQTWVEETKG-KHVIIHNNY 327
            +  DQ   N  L N+T    + L  L Q  FP G  YF N  W E+    + +++HNN+
Sbjct: 288 -EIPDQQLLNHLLRNRTIRNTLKLKYLDQRQFPNGNDYF-NVEWREKHANIEPIVVHNNW 345

Query: 328 ITGFEKKIKRFRDFGLW 344
           I G + KI+RF++  +W
Sbjct: 346 IKGHDIKIERFKNASMW 362


>gi|390348517|ref|XP_003727022.1| PREDICTED: UDP-galactose:fucoside
           alpha-3-galactosyltransferase-like [Strongylocentrotus
           purpuratus]
          Length = 293

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 73/260 (28%), Positives = 128/260 (49%), Gaps = 25/260 (9%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG-HAVL 147
           ++ +G +++   ++ ++ F  NWL S+ R + +  +  IAED    Y+   R+P    V+
Sbjct: 57  ISNDGPLVLITSNKAFMDFAENWLESVHRLESRLNIFAIAEDMVA-YRTFLRYPDVTTVM 115

Query: 148 VPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDH 207
              A   Q    + S  +    ++RP ++  +L  G +V+++DVD VWLKDP P+L GD+
Sbjct: 116 TQRAVSPQKRLAYLSHDYNVLINKRPVYIYRLLAKGRDVLFSDVDTVWLKDPLPHLDGDY 175

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           DV    D+   K                   C+  I+ R T+ ++  + +WI+ +     
Sbjct: 176 DVVLQVDLRVPK----------------VVYCAGFIFFRATNASRAFVWEWIDRIH---- 215

Query: 268 SKAK-KANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNN 326
            KA+    DQ   N  L +  G + + +L  + FP G LYF ++ W      K VI+HNN
Sbjct: 216 -KARDNIPDQKILNELLEENFGDLRVKVLDSALFPNGALYFDDK-WRRTQTVKPVIVHNN 273

Query: 327 YITGFEKKIKRFRDFGLWLV 346
           +I   ++K+ RF+  G W +
Sbjct: 274 WIEDHDEKVLRFKSHGFWFI 293


>gi|167521059|ref|XP_001744868.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776482|gb|EDQ90101.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1062

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 78/278 (28%), Positives = 133/278 (47%), Gaps = 44/278 (15%)

Query: 89   VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
            V++N  + V  V+  ++ F  N L+S++R    +  ++IAEDY +  ++N R+P H V++
Sbjct: 800  VSENNAVTVMTVTSGFVDFATNLLMSMTRVG-VNNFIIIAEDYTSYQRLNARYP-HRVVL 857

Query: 149  P------------PAPD--SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMV 194
            P               D   +T   + S+ +     RRP +LL IL +G++V+Y D D V
Sbjct: 858  PNLRTMMQGMSGRSGADVADRTGFSYASKQYNEIVGRRPRYLLGILRMGFDVLYTDTDTV 917

Query: 195  WLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTY-----ICSCMIYLRPTD 249
            WL++P+   Q  +D+  + D                 K+  T+     +C+  ++LR   
Sbjct: 918  WLENPYHQFQAGYDMQISSD-----------------KEDETFDPWHMLCTGFMFLRSKR 960

Query: 250  GAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALN-KTAGQVDLYLLPQSAFPTGGLYFK 308
                 + +W   L+A   ++    N Q  FN   N K   Q+    LP   FP+G LYF 
Sbjct: 961  PVMAFLDEWRRALEA---AQGVTVN-QYVFNDIFNRKYREQIPTRPLPDRKFPSGALYF- 1015

Query: 309  NQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLV 346
            ++ W      +  ++HNN+I G + K +RF+  GLW V
Sbjct: 1016 DKYWRSAQPEQPTVVHNNFIVGADAKRQRFQKLGLWYV 1053


>gi|390348519|ref|XP_003727023.1| PREDICTED: uncharacterized protein LOC100888971 [Strongylocentrotus
           purpuratus]
          Length = 977

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 74/261 (28%), Positives = 129/261 (49%), Gaps = 30/261 (11%)

Query: 95  IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDS 154
           +I+ + ++ +L F +NWL S+ R   +  V ++AED      +N R      +V      
Sbjct: 732 VILVSTNKAFLNFTDNWLESVKRSGIRSGVTLVAEDREAFNYLNNRTDIELNVVLNDVSE 791

Query: 155 QTAHK--FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGD-HDVYF 211
               +  F S  +    ++RP ++L +L  G++V+++DVD+VWLK+P PY   D +D++ 
Sbjct: 792 SPGERLLFDSPAYKQLVNKRPSYILQLLSSGHDVLFSDVDIVWLKNPLPYFTNDTNDIWL 851

Query: 212 TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAK 271
            +D+         H+         T  C+   + R +     ++ +W++ L   P     
Sbjct: 852 QEDL---------HEP--------TVYCAGFTFYRSSPATIALVTEWVQTLALHPTYP-- 892

Query: 272 KANDQPAFNWALNKTAGQVDLY---LLPQSAFPTGGLYFKNQTWVEETKG-KHVIIHNNY 327
              DQ   N  L K   Q D     ++    FP+G LYF +  W E  K  + V++HNN+
Sbjct: 893 ---DQRVLNGLLKKKRWQGDYIKRAVMDSRLFPSGRLYF-DPDWREANKDTEQVMVHNNW 948

Query: 328 ITGFEKKIKRFRDFGLWLVDD 348
           I G ++K++RFR+ GLW +DD
Sbjct: 949 IKGHDRKVERFRNEGLWYLDD 969


>gi|440804176|gb|ELR25053.1| fucosylgalactoside 3-alpha-galactosyltransferase [Acanthamoeba
           castellanii str. Neff]
          Length = 806

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 84/301 (27%), Positives = 145/301 (48%), Gaps = 49/301 (16%)

Query: 76  KWRDY--TLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLV-----IA 128
           K+RDY   L  A   VA+NGT+I+   +  YL FL NW   + +Q   + V++     +A
Sbjct: 401 KYRDYPFLLHNALPRVAQNGTVIMVIANSGYLEFLLNWKSYVDKQGITNYVIIPSDVQMA 460

Query: 129 EDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTS---RRPCHLLHILELGYN 185
           +  + L       P   +       S T +K G++ + N+     ++  +   I+E+GY+
Sbjct: 461 QQLSYLGVDWAYDPEIGMDALSQSVSYTMNK-GTRAWANWNKVVHKKSHYFKKIVEMGYS 519

Query: 186 VMYNDVDMVWLKDPFPYLQ-GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIY 244
           VM +D+DMVWLK+PF  +   D D++FT+D  A     H  D         + IC  + Y
Sbjct: 520 VMVSDIDMVWLKNPFSRMNDSDVDIFFTNDGGA-----HGRD---------SSICGGLFY 565

Query: 245 LRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYL-LPQS----- 298
            +  + +   +   IE          ++  +Q   N  L   +G+ +  + +P++     
Sbjct: 566 GKNREKSLAFINHVIE------CEVTQEKKEQYCMNKWLGAVSGKNNANVPVPETPNYKI 619

Query: 299 ----AFPTGGLYFKNQTWVEETKGKHV--IIHNNYITGFEKKIKRFRDFGLWLVDDHAVE 352
                FP+GG +F ++ W     GK +  ++HNN+I G   K+KRF+ FG W VDD ++ 
Sbjct: 620 MSIDDFPSGGYFFGDK-WA----GKELPYVVHNNWIVGKANKLKRFKVFGYWRVDDSSLS 674

Query: 353 S 353
           +
Sbjct: 675 N 675


>gi|299117548|emb|CBN75392.1| Xylosyltransferase, family GT77 [Ectocarpus siliculosus]
          Length = 548

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 80/268 (29%), Positives = 122/268 (45%), Gaps = 30/268 (11%)

Query: 91  KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
           + G +IV +V+  YL   +N+L S+ +     +VL +A D      ++   PG  V  P 
Sbjct: 276 QQGVVIVSSVNCGYLDMASNFLQSVGQAAGDVKVLFVARDEVAFDFLDALSPGCTVFFPE 335

Query: 151 APDSQTAH-----KFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG 205
              S+ AH     ++G   F   T  RP  LL IL  GY  +Y DVD+ WL +    L  
Sbjct: 336 V-GSERAHAIQAGQWGDNIFKQQTVARPDILLPILRQGYKALYTDVDIFWLGNVLSLLPN 394

Query: 206 DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAE 265
              V  T  +  +   D                C+C +YL  T  A  +++ W +E+   
Sbjct: 395 SR-VTQTPPVEVILQADAKQK------------CTCFMYLDSTPNAIRLLELWKQEI--- 438

Query: 266 PWSKAKKANDQPAFNWALNK-TAGQVDLYLLPQSAFPTGGLYFKNQTWVEET----KGKH 320
             ++     DQ AF   L +     + L LLP+ A   G   F ++T+V+      + K 
Sbjct: 439 --AEKSSFQDQRAFQSPLAQMQEAGLALQLLPKEAMLPGNRGF-DETYVKNNYALLQDKL 495

Query: 321 VIIHNNYITGFEKKIKRFRDFGLWLVDD 348
           +I+HNN+I G + K +RFR  GLW VDD
Sbjct: 496 LIVHNNWIVGHDPKRERFRMAGLWEVDD 523


>gi|384244551|gb|EIE18052.1| hypothetical protein COCSUDRAFT_49426 [Coccomyxa subellipsoidea
           C-169]
          Length = 557

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 78/278 (28%), Positives = 128/278 (46%), Gaps = 38/278 (13%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQV-----LVIAEDYATLYKVNGRWPG 143
           VA + T+I+   S  Y+ F  NW+      KH + +     L + +D  +   +N R+PG
Sbjct: 286 VAVDKTVILTQTSCGYIDFAENWI------KHAEGLGIKNYLTVVDDEISFEYLNLRYPG 339

Query: 144 HAVLV----PPAPD-SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKD 198
           H V V    P A +  Q    FGS+ F   +  R  +   +LE G+ +++ D+D V  +D
Sbjct: 340 HIVSVDVFSPGATNFDQPLLDFGSKLFNKMSCDRLTYQRKVLERGFTMLWIDMDTVLYQD 399

Query: 199 PFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKW 258
           P   + G  D   TDD+   + L+ S       K G   IC C+++ RPT  AK  +++W
Sbjct: 400 PVAVMPGGLDFIATDDILDEEGLNQSEQ-----KTGN--ICGCLMFFRPTRNAKDFLRQW 452

Query: 259 IEELQAEPWSKAKKANDQPAFN--WALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEET 316
            ++   +      +  DQ A N  W  +    ++  Y++P+  FP+G L  K  + V E 
Sbjct: 453 YDKCTTQ------ELPDQTALNALWRESDWKQRLHWYIMPRQLFPSGRLEPKIDS-VPEA 505

Query: 317 KGKH------VIIHNNYITGFEKKIKRFRDFGLWLVDD 348
           K +       V IH NY  G   K    ++   W++ D
Sbjct: 506 KHRDKSKMFPVFIHANYRIGKNDKRLFLKERSAWIIPD 543


>gi|413948937|gb|AFW81586.1| hypothetical protein ZEAMMB73_866796, partial [Zea mays]
          Length = 54

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 41/54 (75%), Positives = 47/54 (87%)

Query: 305 LYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           L FKN+TWV++TK KHV IHN YITGFEKK KRFRD GLWLVDDH+ ESPLG++
Sbjct: 1   LSFKNKTWVKQTKVKHVTIHNKYITGFEKKAKRFRDHGLWLVDDHSDESPLGRI 54


>gi|384249697|gb|EIE23178.1| hypothetical protein COCSUDRAFT_63537 [Coccomyxa subellipsoidea
           C-169]
          Length = 327

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 74/285 (25%), Positives = 123/285 (43%), Gaps = 31/285 (10%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L +  + VA +  +I+   S  YL F  NW+  +      +  L IAED   L  +  R+
Sbjct: 33  LQERVAAVAVDNKVILTQTSCGYLEFAVNWITHVEALGLTNW-LTIAEDETALKFLEERY 91

Query: 142 PGHAV----LVPPAPDSQTA-HKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWL 196
           PGHA+        A  S  A +++GS  F      RP +L  +L+LGY V+++D+D VWL
Sbjct: 92  PGHALPASAFTNEALSSGNALYEWGSAAFTKVACARPSYLQMVLDLGYEVLWSDMDAVWL 151

Query: 197 KDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMK 256
           K+ F       D    DD                 ++     C+ ++Y RPT  A+ ++ 
Sbjct: 152 KNFFALAPQGLDYVGVDDSETEN------------EQETENACTGLMYFRPTARAQQLLA 199

Query: 257 KWIEELQAEPWSKAKKANDQPAFNWALNKTA--GQVDLYLLPQSAFPTGGL-----YFKN 309
            W  ++  E        N+Q AFN   + +     +D +++P+  +P G L     Y ++
Sbjct: 200 DW-HDMCIE-----LNNNNQGAFNRVFSGSGLHRTIDYFIMPKQLYPHGALIELVDYKRS 253

Query: 310 QTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESP 354
           +     +      +H NY  G   K +  RD  +W  D  +   P
Sbjct: 254 EAAASASGMDPAWVHANYRVGHSAKREFLRDRLVWKADQASASFP 298


>gi|390347469|ref|XP_003726790.1| PREDICTED: uncharacterized protein LOC100893515 [Strongylocentrotus
           purpuratus]
          Length = 384

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 65/257 (25%), Positives = 113/257 (43%), Gaps = 24/257 (9%)

Query: 95  IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYAT---LYKVNGRWPGHAVLVPPA 151
           +I+ A  + ++    NW  SI R      VL++AED  +    ++         +    A
Sbjct: 149 VILTATDKNFIDLAENWRESIRRLGLSYDVLLMAEDEESHKYFFERTTERFRTVLSTQFA 208

Query: 152 PDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYF 211
              + A    +  +     RR  ++L IL+ GY+V+  D+D VW KDP   +  ++D Y 
Sbjct: 209 LPGKLAKNIST--YQQLIRRRTVYILSILQSGYDVLLVDIDAVWFKDPVKLVLDEYDKY- 265

Query: 212 TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAK 271
             D+   +           GK+ R + C C  Y++ T     ++  WIE L    +   K
Sbjct: 266 --DIWLAQ-----------GKQAR-FPCPCFFYMKSTSVVIKMVYDWIERL---AYYGGK 308

Query: 272 KANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGF 331
              DQ A ++ L      + +       FP G  +F N TW ++   +  I H N++   
Sbjct: 309 VETDQVALSFILAANKRNLGIKYFDYYEFPIGKQFF-NWTWHDQFADRVYIAHGNHLGRS 367

Query: 332 EKKIKRFRDFGLWLVDD 348
             K+ +F++F +WL+DD
Sbjct: 368 VGKVSKFKEFKVWLIDD 384


>gi|390342689|ref|XP_003725713.1| PREDICTED: uncharacterized protein LOC100892399 [Strongylocentrotus
           purpuratus]
          Length = 317

 Score = 87.4 bits (215), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 69/265 (26%), Positives = 114/265 (43%), Gaps = 32/265 (12%)

Query: 96  IVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPA---- 151
           I+   +  +     NWL S+ R      V +IAEDY     +  R     ++V       
Sbjct: 62  ILTTTNTAFSDLTENWLESLRRLDRPYNVTLIAEDYGAFQYLKDR---KDIMVSSRLQIN 118

Query: 152 ---PDSQTAHK----FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
               D   +H     F ++ +     +RP ++L IL  G  V++ D D VWL+DP P + 
Sbjct: 119 LRFTDMNVSHSAALGFYTKEYIQLVGKRPHYILDILRSGVGVLFADADAVWLRDPIPLIA 178

Query: 205 GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQA 264
              DVY   D+     L     +P          C C +YL+PT  A  ++K W   ++ 
Sbjct: 179 ---DVYSRYDVWVA--LGEGGQIP----------CPCFMYLKPTPRAVEMVKAWQSRIE- 222

Query: 265 EPWSKAKKANDQPAFNWALNK--TAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVI 322
           +  +     NDQ A +W ++      ++ +  L  +AFPTG   F  + W    K +  +
Sbjct: 223 DQLNTTSPENDQGALDWVMHNHIRYKRLRVKKLDMTAFPTGQSMFLKKGWYAAHKNEIYV 282

Query: 323 IHNNYITGFEKKIKRFRDFGLWLVD 347
           +H N + G+ KK   F+   LW ++
Sbjct: 283 VHANKVGGYAKKKFVFKWKKLWYLE 307


>gi|384253349|gb|EIE26824.1| hypothetical protein COCSUDRAFT_59332 [Coccomyxa subellipsoidea
           C-169]
          Length = 327

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 67/272 (24%), Positives = 119/272 (43%), Gaps = 32/272 (11%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           +A + T+I+   S PYL F  NW++ + R    + ++++ +D A  Y ++ ++PGH V  
Sbjct: 59  IAVDKTVILTQASCPYLDFAENWILHVERLGITNYLILMDDDVAFKY-LDSKFPGHTVH- 116

Query: 149 PPAPDSQTAH------KFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
           P      + H      ++ S  F N    R      +L+LG++ ++ D+D VWL+D    
Sbjct: 117 PSVISKGSKHMPKPLMRYDSLDFNNMMCDRLGIQRRVLDLGFSFLWTDMDTVWLQDASKI 176

Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
           +    D   T D          HD     K     IC CM +  PT  A+  ++ W ++ 
Sbjct: 177 IPRGFDFVGTGDR-----FRFGHDEEEENK-----ICGCMTFWTPTVPARQALRDWHQK- 225

Query: 263 QAEPWSKAKKANDQPAFN--WALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKH 320
                      +DQ      WA  +   +V  Y++P   FP+G L  + +     ++ ++
Sbjct: 226 -----CIDSTVDDQRTLQEMWASGELKKKVFWYIMPWQLFPSGALLDQVKVDFSRSQAQN 280

Query: 321 ------VIIHNNYITGFEKKIKRFRDFGLWLV 346
                  +IH NY TG + K K  ++   W +
Sbjct: 281 PGAVLPAVIHANYRTGDDAKRKFLQERFAWKI 312


>gi|281203601|gb|EFA77798.1| WD40 repeat-containing protein [Polysphondylium pallidum PN500]
          Length = 647

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 63/256 (24%), Positives = 119/256 (46%), Gaps = 31/256 (12%)

Query: 109 NNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNF 168
           NN + SI+++   ++ +    D  +++  +    G+ +        + A  +G+ GF   
Sbjct: 118 NNAMNSIAKEA-VERPMTSVNDMMSMFGGSAGGSGNTIASNDTVYGEEAESYGNVGFRAI 176

Query: 169 TSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG-------DHDVYF------TDDM 215
            + +P  +L +L+ GYNV++ D D+VWL +PF   +        D+D          DD+
Sbjct: 177 CNEKPLVVLDVLKRGYNVLWTDTDIVWLGEPFAAFKHATQESGIDYDSLDLIVQQDDDDI 236

Query: 216 AAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEE--LQAEPWSKAKKA 273
            A      S+++         Y+ + + +L P    ++ M+K+++E  +Q    +     
Sbjct: 237 CAGFYYIRSNEVTIK------YMETVIAFLNPIVDDQISMRKFLKEHAIQITKNNNNSSG 290

Query: 274 NDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEK 333
            D P+ N        ++  + LP++ FP G  YF N    +    K VI+HNN I G   
Sbjct: 291 VDTPSGN--------KLRYFRLPRTTFPNGTAYF-NLKIPQRNNVKPVIVHNNCIIGHRS 341

Query: 334 KIKRFRDFGLWLVDDH 349
           K +RF ++ LW +DD+
Sbjct: 342 KRERFIEYNLWFIDDN 357


>gi|300176044|emb|CBK23355.2| unnamed protein product [Blastocystis hominis]
          Length = 367

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 57/196 (29%), Positives = 90/196 (45%), Gaps = 26/196 (13%)

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
           D+ TA  +GS+ F N    +   L   +     V+Y D D +  KDPFP L      Y  
Sbjct: 41  DTSTASSYGSRAFRNKVHWKLIMLQQAVNQNVRVLYMDSDNILFKDPFPVLNS----YNG 96

Query: 213 DDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKK 272
            D  A + +D               IC+  IYL PT   K ++ K IE     P  K   
Sbjct: 97  YDFIAQRDVD---------------ICTGFIYLMPTLMTKQLLAKTIE---IRP--KLLN 136

Query: 273 ANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWV-EETKGKHVIIHNNYITGF 331
           A+DQ AFN  +      V L  LP   F +G ++FK  ++  ++ +   +++H+N++ G 
Sbjct: 137 ADDQKAFNMVVQNNTS-VKLLFLPDHLFSSGAVFFKKHSYYWDKIRETQIMMHDNFVIGI 195

Query: 332 EKKIKRFRDFGLWLVD 347
           + KI R ++  L+ +D
Sbjct: 196 KNKIYRLKELKLYKLD 211


>gi|405974682|gb|EKC39308.1| hypothetical protein CGI_10024476 [Crassostrea gigas]
          Length = 370

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 78/332 (23%), Positives = 141/332 (42%), Gaps = 38/332 (11%)

Query: 35  LLVLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSS--LAKWRDYT--LSQAASFVA 90
           L+ L   T++++LSL      IL  +    F+F  A  S  L   R+ T  +  AA  ++
Sbjct: 41  LITLGVLTVILMLSLFTYNTKILKHSQREDFVFQPAKFSDLLRDGRNDTNHVLTAAMRIS 100

Query: 91  KNG--TIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           K G   ++V  V+  +LPF  +WL +       +QVL I  D  +  K+N +WP    + 
Sbjct: 101 KGGQKALLVTLVNDAFLPFTFSWLCNTQGMGIHNQVLFITGDNESAMKINQKWPEVTAIQ 160

Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
                S    ++   G+     RR   LL ILE    +   +VD +W+ +P   +Q   +
Sbjct: 161 IDGVHSGN-QEYSHVGYVELMVRRSEILLEILEKNIPIFLFEVDCIWITNPLNNIQSYSN 219

Query: 209 VYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL------ 262
           V    +  + +                  I    +YL PTD +K + +   ++L      
Sbjct: 220 VDVVVNAVSSRS---------------NVIAGGFLYLHPTDASKKMWRALTKQLVQLGMK 264

Query: 263 ---QAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGK 319
               A+    ++  NDQ   +  +    G +   +L    +  G  Y    ++ +E + +
Sbjct: 265 IKNSADEKLVSEGDNDQQYLSKLVFSKYGGLRYKILSTDDYADGKWY----SFSKEERNR 320

Query: 320 H---VIIHNNYITGFEKKIKRFRDFGLWLVDD 348
               +I++NN+I+G   KIKR + +G W + +
Sbjct: 321 RKMPLILNNNWISGNLAKIKRAKAWGHWFLTE 352


>gi|330800041|ref|XP_003288048.1| hypothetical protein DICPUDRAFT_152231 [Dictyostelium purpureum]
 gi|325081936|gb|EGC35435.1| hypothetical protein DICPUDRAFT_152231 [Dictyostelium purpureum]
          Length = 593

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 50/205 (24%), Positives = 95/205 (46%), Gaps = 17/205 (8%)

Query: 155 QTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD 214
           + A  +G+ GF    + +P  +L +L+ GYNV++ D D+VW KDPF        ++F ++
Sbjct: 87  EEAESYGNIGFRAICNEKPLVVLEVLKQGYNVLWTDTDIVWQKDPF--------IHFYNE 138

Query: 215 MAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPT--------DGAKLVMKKWIEELQAEP 266
           +      +++ D+    ++    IC+   ++R          D    +     +++    
Sbjct: 139 INKSNGFENNDDIDLYVQQDDDDICAGFYFIRSNPKTIKYIHDTIAFLNPMIDDQIAMRL 198

Query: 267 WSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNN 326
           + K++  N         N+ + ++   LL +  FP G  YF N    +       I+HNN
Sbjct: 199 FLKSQGVNILSKEILLKNQNSDKIKYVLLDRKLFPNGTAYF-NLKLTQRANITPYIVHNN 257

Query: 327 YITGFEKKIKRFRDFGLWLVDDHAV 351
            I G   K +RF ++GLW VD+ ++
Sbjct: 258 CIIGHRSKKERFIEYGLWSVDEKSL 282


>gi|440803911|gb|ELR24794.1| hypothetical protein ACA1_174550 [Acanthamoeba castellanii str.
           Neff]
          Length = 458

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 73/296 (24%), Positives = 119/296 (40%), Gaps = 54/296 (18%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLV----IAEDYATLYKV 137
           L Q    VAK+G +I+   +  YL  L NW  S+ R    + V+V     A    +   +
Sbjct: 35  LEQETRRVAKDGVVILALCNAGYLDLLLNWKASVDRLNITNYVIVPNDIKAAQELSFLGL 94

Query: 138 NGRWPGHAVLVPPAPDSQTAHKFGSQG-----FFNFTSRRPCHLLHILELGYNVMYNDVD 192
           +  +     L   A     ++ F  +      +     ++  ++  +L LG +V+  DVD
Sbjct: 95  DWAYDPTIGLGDLASSEAASYPFNKEDPMYKLWNGVVHKKAAYVWKLLSLGLDVLVTDVD 154

Query: 193 MVWLKDPFPYLQGDH-DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGA 251
           +V+LKDP P       D++F DD  +      + D  PP   G + +  C          
Sbjct: 155 IVFLKDPLPLFSNKTIDLFFIDDTKS-----KTDDGQPPSLCGAS-VLEC---------- 198

Query: 252 KLVMKKWIEELQAEPWSKAKKANDQPAFN-W----ALNKTAGQVDLYLLPQSAFPTGGLY 306
                            + K + +QP FN W    A   +   V + ++ ++ FP+G  Y
Sbjct: 199 -----------------EQKGSKEQPCFNKWYQDIAKRGSQSSVRVQVMTRADFPSGYNY 241

Query: 307 F------KNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLG 356
           F            E+ K    I+H+N+I G E KI R R  GL LVD   +E  + 
Sbjct: 242 FHYFWKTNGDMIREKVKPDPYIVHDNWIVGHENKITRLRQHGLLLVDIKELEEKMA 297


>gi|66811206|ref|XP_639311.1| WD40 repeat-containing protein [Dictyostelium discoideum AX4]
 gi|74996971|sp|Q54RP0.1|AGTA_DICDI RecName: Full=UDP-galactose:fucoside alpha-3-galactosyltransferase;
           AltName: Full=Alpha-GalT1; AltName:
           Full=Fucosylgalactoside 3-alpha-galactosyltransferase;
           AltName: Full=Skp1 alpha-3-galactosyltransferase
 gi|60467933|gb|EAL65946.1| WD40 repeat-containing protein [Dictyostelium discoideum AX4]
 gi|84873369|gb|ABC67744.1| UDP-Gal:fucoside alpha3-galactosyltransferase [Dictyostelium
           discoideum]
          Length = 648

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 66/291 (22%), Positives = 120/291 (41%), Gaps = 31/291 (10%)

Query: 77  WRDYTLSQAASF----VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYA 132
           +RD TL+    F    + K+  I+ C   + Y  F     I   R   +D ++  +    
Sbjct: 22  FRDMTLNLLKCFEKLSIDKSRYILYCMDDKAYQFFAEFKGIECQRFS-RDDIINSSTSST 80

Query: 133 TLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVD 192
            L+  N       +       S+ A  +G  GF    + +P  +L +L+ GYNV++ D D
Sbjct: 81  QLFHDNNTNDNKGIY------SENAESYGDIGFRAICNEKPLVVLDVLKKGYNVLWTDTD 134

Query: 193 MVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAK 252
           +VW +DPF        ++F  D+       +  D+    ++    IC+   ++R      
Sbjct: 135 IVWKRDPF--------IHFYQDINQENQFTNDDDIDLYVQQDDDDICAGFYFIRSNQRTI 186

Query: 253 LVMKKWI--------EELQAEPWSKAKKANDQPA---FNWALNKTAGQVDLYLLPQSAFP 301
             ++  I        +++    + K++  N +      + + N    ++   LL +  FP
Sbjct: 187 KFIQDSINFLNPCIDDQIAMRLFLKSQGINIKSKNILLSLSENDKKDKIRYRLLDKKLFP 246

Query: 302 TGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVE 352
            G  YF N    +       IIHNN I G   K  RF ++GLW ++D  ++
Sbjct: 247 NGTNYF-NLKITQRDNITPFIIHNNCIIGHRSKKDRFIEYGLWYINDDEID 296


>gi|428172448|gb|EKX41357.1| hypothetical protein GUITHDRAFT_142056 [Guillardia theta CCMP2712]
          Length = 528

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 67/260 (25%), Positives = 118/260 (45%), Gaps = 25/260 (9%)

Query: 94  TIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPD 153
           +II+   ++ Y  F  NW++S+   + ++ +LV  ++ A L+        +        D
Sbjct: 62  SIILGFANKGYSKFAFNWVLSLRHAEVENFLLVALDEEAHLHFTRHHVTSYYNASMGTTD 121

Query: 154 SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTD 213
           +++ H  GS+ F N    R  +++ +LE  ++V   DVD V+  DPF +L  D       
Sbjct: 122 AKSQHH-GSKTFRNIMEIRLRYVVELLEQDFDVWLTDVDSVFNTDPFVFLDAD------- 173

Query: 214 DMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLR-----PTDGAKLVMKKWIEELQAEPWS 268
              +   L +     P GK     + +   Y+R     P + A  ++K+ I+ +   P  
Sbjct: 174 ---SAAELAYDTPFLPKGKDSPLMVMAGFFYMRRCSKFPENCA--LLKETIKYIDDHP-- 226

Query: 269 KAKKANDQPAFNWALNKTAGQVDLY-LLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNY 327
              + +DQ AFN  L++   +   Y L+    F  G LYF  +   +    K  ++ NN+
Sbjct: 227 ---EKHDQFAFNAVLSRKEAEGYKYKLMDPLLFCNGALYFSERA-PQMLGMKSAVVQNNH 282

Query: 328 ITGFEKKIKRFRDFGLWLVD 347
           ITG   K  RFR+  LW +D
Sbjct: 283 ITGVLSKRHRFREHLLWYLD 302


>gi|440803910|gb|ELR24793.1| fucosylgalactoside 3-alpha-galactosyltransferase [Acanthamoeba
           castellanii str. Neff]
          Length = 359

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 73/279 (26%), Positives = 120/279 (43%), Gaps = 42/279 (15%)

Query: 86  ASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYAT--LYKVNGRWPG 143
           A  VAK+G +++   +  YL  L NW  S+ R    + V+V  +  A   L  +   W  
Sbjct: 4   AKRVAKDGVVVLALCNAGYLDLLLNWKASVDRLNITNYVIVPNDIKAAQQLSFLGIDWAY 63

Query: 144 HAVLVPPAPDSQTAHKFGS-------QGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWL 196
              +      S  A ++ +       Q +     ++  ++  I+  G NV+  DVD+V++
Sbjct: 64  DPAIGLGDLASSEAARYTTDKKDPMHQSWNQVVHKKAANVRAIIATGLNVLVTDVDIVFM 123

Query: 197 KDPFP-YLQGDHDVYF-TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLV 254
           KDP P ++    D++F  DDM                K G+  +C      R  +     
Sbjct: 124 KDPLPLFVNKTVDLFFINDDMR---------------KGGQQALCGGFWLGRSNEHTM-- 166

Query: 255 MKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVE 314
              +I+ +Q     + K   +QP FN   +K   Q +  ++P   FP+G  YF ++ W  
Sbjct: 167 --AFIDSVQR---CEQKGIKEQPCFNKWHDKIRRQRN-EVMPMDDFPSGYYYF-HEFWKT 219

Query: 315 ETKGKHV-------IIHNNYITGFEKKIKRFRDFGLWLV 346
            + G  V       I+HNN+I G E KI RF   GL +V
Sbjct: 220 TSDGVRVKVRPDPYIVHNNWIVGHENKIARFLQHGLLMV 258


>gi|42563115|ref|NP_177220.2| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
           thaliana]
 gi|332196971|gb|AEE35092.1| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
           thaliana]
          Length = 537

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 68/270 (25%), Positives = 109/270 (40%), Gaps = 23/270 (8%)

Query: 91  KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
           KN T+++      Y   L +W+  + R K  + ++   +D    + +    P       P
Sbjct: 271 KNRTVVLSVAGYSYKDMLMSWVCRLRRLKVPNFLVCALDDETYQFSILQGLPVFFDPYAP 330

Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVY 210
              S     FGS+ F   T  +   +L IL+LGYNV+ +DVD+ W ++P P LQ      
Sbjct: 331 KNISFNDCHFGSKCFQRVTKVKSRTVLKILKLGYNVLLSDVDVYWFRNPLPLLQS----- 385

Query: 211 FTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSK- 269
           F   + A +  D  +   P  +  R  + S   + R        M+K ++       S+ 
Sbjct: 386 FGPSVLAAQS-DEYNTTAPINRPRR--LNSGFYFARSDSPTIAAMEKVVKHAATSGLSEQ 442

Query: 270 ------------AKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETK 317
                       A +  D        N T   +D  L P  A+  G L+ K     E  K
Sbjct: 443 PSFYDTLCGEGGAYRLGDDRCVEPETNLTVQFLDRELFPNGAY--GDLWLKEDVRAECEK 500

Query: 318 GKHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
               ++HNN+I+G  KK++R    GLW  D
Sbjct: 501 KHCFVLHNNWISGRLKKLERQMMKGLWEYD 530


>gi|12324743|gb|AAG52325.1|AC011663_4 hypothetical protein; 72471-70598 [Arabidopsis thaliana]
 gi|12325048|gb|AAG52475.1|AC010796_14 hypothetical protein; 82031-83904 [Arabidopsis thaliana]
          Length = 535

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 68/270 (25%), Positives = 109/270 (40%), Gaps = 23/270 (8%)

Query: 91  KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
           KN T+++      Y   L +W+  + R K  + ++   +D    + +    P       P
Sbjct: 269 KNRTVVLSVAGYSYKDMLMSWVCRLRRLKVPNFLVCALDDETYQFSILQGLPVFFDPYAP 328

Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVY 210
              S     FGS+ F   T  +   +L IL+LGYNV+ +DVD+ W ++P P LQ      
Sbjct: 329 KNISFNDCHFGSKCFQRVTKVKSRTVLKILKLGYNVLLSDVDVYWFRNPLPLLQS----- 383

Query: 211 FTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSK- 269
           F   + A +  D  +   P  +  R  + S   + R        M+K ++       S+ 
Sbjct: 384 FGPSVLAAQS-DEYNTTAPINRPRR--LNSGFYFARSDSPTIAAMEKVVKHAATSGLSEQ 440

Query: 270 ------------AKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETK 317
                       A +  D        N T   +D  L P  A+  G L+ K     E  K
Sbjct: 441 PSFYDTLCGEGGAYRLGDDRCVEPETNLTVQFLDRELFPNGAY--GDLWLKEDVRAECEK 498

Query: 318 GKHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
               ++HNN+I+G  KK++R    GLW  D
Sbjct: 499 KHCFVLHNNWISGRLKKLERQMMKGLWEYD 528


>gi|167523934|ref|XP_001746303.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775065|gb|EDQ88690.1| predicted protein [Monosiga brevicollis MX1]
          Length = 319

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 59/265 (22%), Positives = 111/265 (41%), Gaps = 37/265 (13%)

Query: 95  IIVCAVSQPYLPFLNNWLISISRQKHQDQ-VLVIAEDYATLYKVNGRWPGHAVLV----- 148
           II+ A+S  +  F+ N L SI       +  +VIAED ++ +++  + PG  +L      
Sbjct: 44  IILAAISSGFSDFMYNLLASIQATVGTTRHAVVIAEDVSSYFQMEEKLPGQVLLATEDLL 103

Query: 149 --------PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
                         +    +GS  +      RP +L  + + G ++++ D D+ + ++P+
Sbjct: 104 TFGMHNAREKLEGDKRGQSYGSTLYLLLMRARPKYLTRMHDTGVHMLFVDADIAFFENPY 163

Query: 201 PYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIE 260
            ++  +              L  S+DL          +C+  +Y   T      ++ W +
Sbjct: 164 AWMDAN--------------LKQSYDLLASNDHA-NLLCAGFVYYANTSVITRFLEVW-D 207

Query: 261 ELQAEPWSKAKKANDQPAFNWALNK-TAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGK 319
           +L A+         +Q +FN AL       +  Y LP S FP G  YFK   + E     
Sbjct: 208 QLVAQG-----DGRNQASFNKALKHPDVTGIQFYELPSSKFPHGRYYFKKH-FAEMQANP 261

Query: 320 HVIIHNNYITGFEKKIKRFRDFGLW 344
            +++H N ++G ++K    R  G+W
Sbjct: 262 PIMVHANGMSGGDRKRNALRRIGMW 286


>gi|342318978|gb|EGU10930.1| Proteophosphoglycan ppg4 [Rhodotorula glutinis ATCC 204091]
          Length = 969

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 65/272 (23%), Positives = 114/272 (41%), Gaps = 19/272 (6%)

Query: 81  TLSQAASFVA--KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATL-YKV 137
           TL  A   VA  K   +++   +  YL  + N+ +   R+   D  ++IA D  T+ +  
Sbjct: 342 TLLPALKLVANSKTNNVMLAFGTSNYLDLVRNF-VHFVREAGIDNFVLIAMDADTVAWAE 400

Query: 138 NGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLK 197
             + P ++ +         +  + S GF    +RR   +   L  G+N++ +D+D++W+K
Sbjct: 401 EEKVPYYSYIDEEVATLGGSDSYKSDGFRRVVNRRCSVISTALRGGFNILQSDLDVIWVK 460

Query: 198 DPFPY-LQGDHDVYF-TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVM 255
           +PFPY   GD++    +D        D       P    R ++ S + Y R T       
Sbjct: 461 NPFPYFFNGDYEYEIQSDGRRGFTERD-------PAAPFRDFVNSGLFYARGTPRMADFY 513

Query: 256 KKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEE 315
              I  +   P  +     +Q   N  L +   ++   +L  + FP G  YF  +     
Sbjct: 514 DILIRTVAENPHRR-----EQHLLNTILQENVLRIHYRILDPTLFPNGFQYFA-RALPTR 567

Query: 316 TKGKHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
              +   IHNN++ G   K  RFR+ G+W  D
Sbjct: 568 AGVEPFCIHNNWVDGKYTKEYRFREIGMWTQD 599


>gi|443696457|gb|ELT97152.1| hypothetical protein CAPTEDRAFT_193488 [Capitella teleta]
          Length = 530

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 61/246 (24%), Positives = 115/246 (46%), Gaps = 28/246 (11%)

Query: 108 LNNWLISISRQKHQDQVLVIA-EDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           +N +L SI R   ++ + V + E++ + ++          +   A D   A  + S+ F 
Sbjct: 122 MNFYLTSIQRYDVRNVLFVSSSEEFCSRFQA-IEVACFVYMNESAHDK--ASVYLSKDFI 178

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHD 226
           N  + R   +L  L+LGY+V+++DVD+V+ KDP   +          D+   K  + + D
Sbjct: 179 NKMNIRTYMILEALQLGYHVIHSDVDVVFFKDPTERIF---------DLCHFKDPEKACD 229

Query: 227 LPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKT 286
           + P    G     +  +++R ++ +  + KK       E  +K  K +DQ A N A+   
Sbjct: 230 VAPLWDSGAHN--AGFLFIRNSNASISLYKKM------EHTAKTTKIDDQKALNRAMGSL 281

Query: 287 AGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKH-----VIIHNNYITGFEKKIKRFRDF 341
             ++ +  LP + F +G  +F       +  G H     +++HNN+I   E K+ RF++ 
Sbjct: 282 KKKLRVTSLPGAEFQSGNKFF--DISHRQFAGDHPCTNCIVMHNNWIVSMEAKVYRFKEM 339

Query: 342 GLWLVD 347
            +WL D
Sbjct: 340 HMWLYD 345


>gi|443691598|gb|ELT93412.1| hypothetical protein CAPTEDRAFT_215597 [Capitella teleta]
          Length = 523

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 70/281 (24%), Positives = 116/281 (41%), Gaps = 44/281 (15%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L +A       G I++  V   Y+    N   +   +   D  L +  D+     +  R 
Sbjct: 77  LKEAIRARVIGGAIVLVVVDSGYIEMAINLHRTSFEKLQIDNYLFVGIDHQVCSGL--RL 134

Query: 142 PGHAVLVPPA--PDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
            G   +       +  +   +GS  F   T  +   +L  L+LG+ V+  DVD+V+ K+P
Sbjct: 135 HGVVCVTHEGFMGEKNSDSNWGSTEFMQKTHFKTRVVLQGLQLGFQVLITDVDVVFFKNP 194

Query: 200 FPYLQ-GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYL-RPTDGAKLVMKK 257
           FPY    D D+  ++D++              G  G         YL RPT  A+     
Sbjct: 195 FPYFTCSDCDIEISNDIS-------------EGNSG--------FYLARPTSPAR----- 228

Query: 258 WIEELQAEPWSKAKKANDQPAFNWALNK------TAGQVDLYLLPQSAFPTGGLYFKN-Q 310
               L A  W   K A D+ +   ALN+         ++ +  L +  FP G  YF++  
Sbjct: 229 ---TLHASAWEIGKVAGDRISNQKALNRMLENMQMKNEIKIKFLSKYLFPNGVDYFESGH 285

Query: 311 TWVEETKG--KHVIIHNNYITGFEKKIKRFRDFGLWLVDDH 349
            +  ++    + V++HNN+I     K+ RF++ GLWL D H
Sbjct: 286 RYFADSPACSQCVMVHNNWILTKAAKVYRFKETGLWLNDRH 326


>gi|390356485|ref|XP_003728803.1| PREDICTED: uncharacterized protein LOC100893403 isoform 1
           [Strongylocentrotus purpuratus]
 gi|390356487|ref|XP_003728804.1| PREDICTED: uncharacterized protein LOC100893403 isoform 2
           [Strongylocentrotus purpuratus]
          Length = 359

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 57/258 (22%), Positives = 118/258 (45%), Gaps = 21/258 (8%)

Query: 92  NGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH-AVLVPP 150
           N  +I+ +V+  ++    NWL S+ R   +  + ++AED  T    + R      VL   
Sbjct: 120 NKFVILTSVNAEFVDIAENWLESLRRLGIRYNITMVAEDQDTFKYFSIRANREFRVLYQK 179

Query: 151 APDSQTAHKFGSQGFF-NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
                   ++G    +     RR  ++  +LE G +V+  DVD VW+K+P   +  ++ +
Sbjct: 180 QYAFNFTRRYGPVMMYQELIRRRTVYIRTLLEQGNDVLLVDVDTVWIKNPVDLILKEYAL 239

Query: 210 YFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSK 269
           Y   D+   +   + + +P          C C +Y++ T     ++++W++ L+ +    
Sbjct: 240 Y---DIWLAQ--GYENHIP----------CPCFMYMKSTSEVVAMVREWVKRLENQRGRS 284

Query: 270 AKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYIT 329
            +  +DQ A    L      + +  +    FPTG  YF ++ W+     +  + H N++ 
Sbjct: 285 LE--SDQVALTKVL-PLFPDIPIQKMSFEKFPTGKHYF-DRAWLALHTKEVYVAHGNHLG 340

Query: 330 GFEKKIKRFRDFGLWLVD 347
             + K K+ ++FGLW+++
Sbjct: 341 RHDGKKKKLKEFGLWMLN 358


>gi|443697523|gb|ELT97963.1| hypothetical protein CAPTEDRAFT_218172 [Capitella teleta]
          Length = 509

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 66/274 (24%), Positives = 121/274 (44%), Gaps = 36/274 (13%)

Query: 90  AKNGTIIVCAVSQPYLPF-LNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           A N T++V  V   Y  F +N +  SI +Q  ++  L I  D     +++ R    A++ 
Sbjct: 97  AINSTVVVSIVDSDYFSFAVNFYQFSIVKQDIRN-FLAICLDDVVSQQLSARGIPCALVN 155

Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG-DH 207
                   A  +G++ ++  T+ +   +L +L   Y+V+  D+D+   +DP+P+    + 
Sbjct: 156 VSLNIGSGASDYGAKSYYQKTNLKTYIMLELLRHKYSVLLTDLDVTLFRDPWPHFTCTEC 215

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGA-KLVMKKWIEELQAEP 266
           D++F  D                    R  + S  ++ RPT G+ +L  K W      + 
Sbjct: 216 DLHFQMD--------------------RVLLNSGFVFARPTPGSIQLYSKAW------QY 249

Query: 267 WSKAKKANDQPAFNWALNKTAGQ--VDLYLLPQSAFPTGGLYFKNQTWVEETK---GKHV 321
           + +  KA+DQ   N A  +   +  V ++ LP+  F  G  YF+    +        + +
Sbjct: 250 YVQYNKAHDQAYINMAARELTQKKLVRIHELPRKTFACGVYYFQQDGRMFYNHPPCEQCI 309

Query: 322 IIHNNYITGFEKKIKRFRDFGLWLV-DDHAVESP 354
           + HNNYI     KI R R+  LW+  ++H  + P
Sbjct: 310 MAHNNYIGSVSAKIYRLRENLLWVENENHYYDDP 343


>gi|297838873|ref|XP_002887318.1| hypothetical protein ARALYDRAFT_476193 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297333159|gb|EFH63577.1| hypothetical protein ARALYDRAFT_476193 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 537

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 66/273 (24%), Positives = 105/273 (38%), Gaps = 29/273 (10%)

Query: 91  KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
           KN T+++      Y   L +W+  + R K  + ++   +D    + +    P       P
Sbjct: 271 KNKTVVLSIAGYSYKDMLMSWVCRLRRLKVPNFLVCALDDETYQFSILQGLPVFFDPYAP 330

Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVY 210
              S     FGS+ F   T  +   +L IL+LGYNV+ +DVD+ W ++P P LQ      
Sbjct: 331 KNISFNDCHFGSKCFQRVTKVKSRTVLKILKLGYNVLLSDVDVYWFRNPLPLLQSFGPSV 390

Query: 211 FT---DDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
            T   D+     P++    L            S   +    D     M+K ++       
Sbjct: 391 LTAQSDEYNTTVPINRPRRLN-----------SGFYFAHSDDPTIAAMEKVVKHAATSGL 439

Query: 268 SKA-------------KKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVE 314
           S+               +  D        N T   +D  L P  A+  G L+ K     E
Sbjct: 440 SEQPSFYDTLCGEGGIYRVGDDKCLEPETNLTVHFLDRELFPNGAY--GDLWLKEDVRGE 497

Query: 315 ETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
             K    ++HNN+I+G  KK++R    GLW  D
Sbjct: 498 CEKKHCFVLHNNWISGRLKKLERQMMEGLWEYD 530


>gi|443693448|gb|ELT94805.1| hypothetical protein CAPTEDRAFT_212986 [Capitella teleta]
          Length = 466

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 58/246 (23%), Positives = 112/246 (45%), Gaps = 28/246 (11%)

Query: 108 LNNWLISISRQKHQDQVLVIA-EDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           +N +L SI +   ++ + V + E++ + ++   R      +   A D  +   + S+ F 
Sbjct: 99  MNFYLTSIQKYDIRNVLFVSSSEEFCSRFRA-IRVACFVYMNESAHDKTSV--YLSKDFI 155

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHD 226
           N  + R   +L  L+LGY+V+++DVD+V+ +DP   +          D+   K      D
Sbjct: 156 NKMNIRTYMILEALQLGYHVIHSDVDVVFFRDPTERIL---------DLCHFKDTKKVCD 206

Query: 227 LPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKT 286
           +      G     +  +++R +  +  + KK       E  +K    +DQ A N A+   
Sbjct: 207 VAALWDSGAHN--AGFLFIRNSSASISMYKKM------EHTAKTTNIDDQKALNGAMRSL 258

Query: 287 AGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKH-----VIIHNNYITGFEKKIKRFRDF 341
             ++ +  LP + F +G  +F N     +  G H     +++HNN+I   E K+ RF++ 
Sbjct: 259 KKELRITSLPGAEFQSGLQFFDNSH--RQFAGDHPCTNCIVMHNNWIVSMEAKVYRFKEM 316

Query: 342 GLWLVD 347
            +WL D
Sbjct: 317 HMWLYD 322


>gi|300120973|emb|CBK21215.2| unnamed protein product [Blastocystis hominis]
          Length = 421

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 70/273 (25%), Positives = 118/273 (43%), Gaps = 31/273 (11%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAED--YATLYKVNGRWPGHAVL 147
           A N TII+      Y+    N   +    K+++  +   +   Y  LYK+N       V 
Sbjct: 31  AMNKTIILYCTDNSYIDLFLNGYYASQLWKYENLAVTCFDRLCYQKLYKLNIPVALLNVE 90

Query: 148 VPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDH 207
              + D   A    ++ F N    +      ++ +   ++Y D D++ L++P  YL    
Sbjct: 91  SDTSVDITKAAICHTKAFHNKVHYKLVLWQIVINMNIQILYVDSDVILLQNPLFYLNS-- 148

Query: 208 DVYFTD-DMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEP 266
              FT  D+ A +  +               +CS  +Y+ PT   KL +++ I   +  P
Sbjct: 149 ---FTGYDIIAQRDGE---------------LCSGFMYMYPTRNTKLAVERSI---KIRP 187

Query: 267 WSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWV-EETKGKHVIIHN 325
             K   ANDQ A   A  +T     L LLP    P+G ++FK+  +  +    K + IHN
Sbjct: 188 --KLPDANDQDALIAAF-RTIRDFRLKLLPFDTVPSGEVFFKSHHYYWDPISPKQLTIHN 244

Query: 326 NYITGFEKKIKRFRDFGLWLVD-DHAVESPLGK 357
           NYI G E K+ RF++  ++ +D +    +PL K
Sbjct: 245 NYIHGLENKLYRFKEIKMYKLDRNREYSNPLAK 277


>gi|443691108|gb|ELT93068.1| hypothetical protein CAPTEDRAFT_218142 [Capitella teleta]
          Length = 550

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 58/246 (23%), Positives = 112/246 (45%), Gaps = 28/246 (11%)

Query: 108 LNNWLISISRQKHQDQVLVIA-EDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           +N +L SI +   ++ + V + E++ + ++   R      +   A D  +   + S+ F 
Sbjct: 142 MNFYLTSIQKYDIRNVLFVSSSEEFCSRFRA-IRVACFVYMNESAHDKTSV--YLSKDFI 198

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHD 226
           N  + R   +L  L+LGY+V+++DVD+V+ +DP   +          D+   K      D
Sbjct: 199 NKMNIRTYMILEALQLGYHVIHSDVDVVFFRDPTERIL---------DLCHFKDTKKVCD 249

Query: 227 LPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKT 286
           +      G     +  +++R +  +  + KK       E  +K    +DQ A N A+   
Sbjct: 250 VAALWDSGAHN--AGFLFIRNSSASISMYKKM------EHTAKTTNIDDQKALNGAMRSL 301

Query: 287 AGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKH-----VIIHNNYITGFEKKIKRFRDF 341
             ++ +  LP + F +G  +F N     +  G H     +++HNN+I   E K+ RF++ 
Sbjct: 302 KKELRITSLPGAEFQSGLQFFDNSH--RQFAGDHPCTNCIVMHNNWIVSMEAKVYRFKEM 359

Query: 342 GLWLVD 347
            +WL D
Sbjct: 360 HMWLYD 365


>gi|449478754|ref|XP_004155410.1| PREDICTED: uncharacterized LOC101214056 [Cucumis sativus]
          Length = 1456

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 63/268 (23%), Positives = 108/268 (40%), Gaps = 19/268 (7%)

Query: 91   KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
            KN TI++      Y   L +W+  + R +  + ++   +     + V    P +   +PP
Sbjct: 1182 KNKTIVLAIAGYSYKDMLMSWVCRLRRLQISNYLVCALDSDTYKFSVLQGLPVYRDPLPP 1241

Query: 151  APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVY 210
               S     FG++ F   T  +   +L IL+LGYNV+ +DVD+ W  +P P+      +Y
Sbjct: 1242 TNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFMNPLPF------IY 1295

Query: 211  FTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKA 270
                   V   D      P     R  + S   + R  +     M+K ++       S+ 
Sbjct: 1296 TFGSGVLVAQSDEYKKTGPINLPRR--LNSGFYFARSDESTIAAMEKVVKHATTSEQSEQ 1353

Query: 271  KKANDQPAFNWALNKTAG--------QVDLYLLPQSAFPTG---GLYFKNQTWVEETKGK 319
                D       +N+            + ++ L ++ FP G   GL+ K        K  
Sbjct: 1354 PSFYDTLCGEGGINRVGSNKCLEPETNLTIHFLDRNLFPNGAYQGLWNKKNVKSACRKKG 1413

Query: 320  HVIIHNNYITGFEKKIKRFRDFGLWLVD 347
              ++HNN+I+G  KK++R    GLW  D
Sbjct: 1414 CFVLHNNWISGRLKKLERQMFSGLWDYD 1441


>gi|443704245|gb|ELU01390.1| hypothetical protein CAPTEDRAFT_211846 [Capitella teleta]
          Length = 528

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 75/323 (23%), Positives = 144/323 (44%), Gaps = 39/323 (12%)

Query: 35  LLVLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSSLAKWRDYTLSQAASFVAKNGT 94
           + V+N      ++  + V  VI           P  T S   + +   SQA     ++  
Sbjct: 50  IAVMNTEAKAEVMKTVEVTAVISVTAEVNAEKIPKQTFSFTSYSEALSSQAD---GESNV 106

Query: 95  IIVCAVSQPYLPF-LNNWLISISRQKHQDQVLVIA-EDYATLYKVNGRWPGHAVLVPPAP 152
           + +        P  +N +L SI +   ++ + V + E++ + ++   +      +   A 
Sbjct: 107 VFLAFADMGGFPMAMNFYLTSIQKYDIRNVLFVSSSEEFCSRFQA-IQVACFVYMNESAH 165

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
           D  +   + S+ F N  + R   +L  L+LGY+V+++DVD+V+ +DP   +      YF 
Sbjct: 166 DKTSV--YLSEDFINKMNIRTYMILEALQLGYHVIHSDVDVVFFRDPTERILD--LCYFK 221

Query: 213 D-----DMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           D     D+AA+   D +H+             +  +++R ++ +  + KK       +  
Sbjct: 222 DPQKVCDVAALWDFD-AHN-------------AGFLFIRKSNASISMYKKM------KHT 261

Query: 268 SKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKN--QTWVEETKGKHV-IIH 324
           +K  K +DQ A N A+N +  ++ +  LP   F  G  YF N  + +V +    +  ++H
Sbjct: 262 AKTTKIDDQMALNGAMN-SMKELRITSLPVEEFQCGLKYFDNSHRQFVGDNPCTNCTVMH 320

Query: 325 NNYITGFEKKIKRFRDFGLWLVD 347
           NN+I   E KI RF++  +WL D
Sbjct: 321 NNWIVSMEAKIYRFKEMHMWLYD 343


>gi|297827043|ref|XP_002881404.1| hypothetical protein ARALYDRAFT_902666 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327243|gb|EFH57663.1| hypothetical protein ARALYDRAFT_902666 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 640

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 75/290 (25%), Positives = 118/290 (40%), Gaps = 43/290 (14%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
            K+  IIV   +  ++ F+  W+  ++     + +LV A D   L  +   W G  V   
Sbjct: 128 VKDNVIIVTFGNYAFMDFILTWVKHLTDLDLSN-ILVGAMDTKLLEAL--YWKGVPVFDM 184

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
            +  S     +GS  F      +   +  +L  GY ++  D DMVWLK+P PYL    D 
Sbjct: 185 GSHMSTVDVGWGSPTFHKMGREKVILIDSVLPFGYELLMCDTDMVWLKNPMPYLARFPDA 244

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC-----MIYLRPTDGAKLVMKKWIEEL 262
           DV  + D      +D S D           IC C     + + RPT+ AK + K+W E L
Sbjct: 245 DVLTSSDQVVPTVIDDSLD-----------ICWCCLQHRIFHWRPTESAKKLAKEWKEIL 293

Query: 263 QAEPWSKAKKANDQPAFNWALNKT---------------AGQVDLYLLPQSAFPTGGLYF 307
            A+      K  DQ  FN  + +                 G + + +LP S F +G  YF
Sbjct: 294 LAD-----DKVWDQNGFNEIVRRQLGPSVDGDSGLFYAYDGNLKVGILPASIFCSGHTYF 348

Query: 308 KNQTWVEETKGKHVIIHNNY-ITGFEKKIKRFRDFGLWLVDDHAVESPLG 356
             Q   ++ + +   +H  +   G E K  R R+  ++       +SP G
Sbjct: 349 V-QAMYQQLRLEPYAVHTTFQYAGTEGKRHRLREGMVFYDPPEYYDSPGG 397


>gi|328873065|gb|EGG21432.1| WD40 repeat-containing protein [Dictyostelium fasciculatum]
          Length = 684

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 54/206 (26%), Positives = 87/206 (42%), Gaps = 25/206 (12%)

Query: 155 QTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY-LQGD---HDVY 210
           + +  +G  GF    + +P  +L +L+ GYNV++ D D+VW+ DP  Y LQ D   + + 
Sbjct: 153 EESESYGGVGFRAICNEKPLVVLDVLKRGYNVLWTDTDIVWMADPLKYILQLDLSFNGLS 212

Query: 211 FTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIE----ELQAEP 266
                     +           K   Y+ + + +L P    ++ M+ +++     LQ  P
Sbjct: 213 DLIIQQDDDDVCAGFYYIKSNPKTIKYMETVISFLNPVVDDQIAMRLFLKGNAINLQQHP 272

Query: 267 WSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNN 326
            +K K                 Q+    L ++ FP G  YF N           +IIHNN
Sbjct: 273 DTKDKD----------------QIKYLKLDRALFPNGTAYF-NLKLSHRRNVSPIIIHNN 315

Query: 327 YITGFEKKIKRFRDFGLWLVDDHAVE 352
            I G   K  RF D+ LW V D+ V+
Sbjct: 316 CIIGHRSKKDRFIDYNLWFVKDYQVD 341


>gi|443703204|gb|ELU00880.1| hypothetical protein CAPTEDRAFT_192549 [Capitella teleta]
          Length = 541

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 58/200 (29%), Positives = 95/200 (47%), Gaps = 30/200 (15%)

Query: 154 SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTD 213
           S+T   FGS+ F    + R   +LH L+ GYNV+++D D+ +  +PFP ++         
Sbjct: 185 SETGASFGSKAFKEKMNIRTFMVLHALKEGYNVLHSDCDIYYFANPFPVIKE-----LCG 239

Query: 214 DMAAVKPL-DH-SHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAK 271
               V PL D+ SH+             +  +Y R T   K+ +  + E +    ++   
Sbjct: 240 SECDVAPLWDYRSHN-------------AGFLYTRST---KMGIAMY-EHMNKTAYTTT- 281

Query: 272 KANDQPAFNWALNK-TAGQVDLYLLPQSAFPTGGLYFK--NQTWVEETKGKHVII-HNNY 327
            A+DQ A N A+   T   + L  LP   F +G L+F   N+T+ E+      I+ HNN+
Sbjct: 282 -ADDQTALNRAIGHCTKIGLKLVRLPTEQFQSGKLFFGDGNRTFAEDNPCTTCIVAHNNW 340

Query: 328 ITGFEKKIKRFRDFGLWLVD 347
           I G   K  RF++  +W+ D
Sbjct: 341 IVGIAVKEYRFKEMHMWVYD 360


>gi|323457174|gb|EGB13040.1| hypothetical protein AURANDRAFT_60713 [Aureococcus anophagefferens]
          Length = 878

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 69/258 (26%), Positives = 111/258 (43%), Gaps = 35/258 (13%)

Query: 93  GTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYAT--LYKVNGRWPGHAVLVPP 150
           GT ++  VS  +     NW    ++     +VL++AED  T   +  +  W      +  
Sbjct: 28  GTAVLVTVSHGFEDMFRNWWYWYAKLDLNMEVLLLAEDEPTRSAFAASRNWTVITTTMGA 87

Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP-YLQGDHDV 209
            P     H + +  +    SRRP ++L  L+    ++Y DVD VWL DP P +  G +D 
Sbjct: 88  LPTQD--HTYETPNYKKLVSRRPHNILRGLDKYPRLIYTDVDTVWLSDPRPHFAAGAYDF 145

Query: 210 YFT-DDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWS 268
           +   DD     P                Y C+  +  + T   + ++ KW   L A+P  
Sbjct: 146 WGQLDDGTPQSP----------------YYCTGFLAFKNTAPTRALLSKWAVALDAKP-- 187

Query: 269 KAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYI 328
                 +QP FN  L +   +     L ++ FP+G LYF  +      +   V++HNN++
Sbjct: 188 ----NLNQPLFNDLLKREPAK--HCPLSKALFPSGNLYFDGK-----KRDGVVVVHNNFV 236

Query: 329 TGFEKKIKRFRDFGLWLV 346
            G  KKI RF+  G W V
Sbjct: 237 VGKAKKIARFKQVGFWAV 254


>gi|449435412|ref|XP_004135489.1| PREDICTED: uncharacterized protein LOC101214056 [Cucumis sativus]
          Length = 1693

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 63/271 (23%), Positives = 112/271 (41%), Gaps = 25/271 (9%)

Query: 91   KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
            KN TI++      Y   L +W+  + R +  + ++   +     + V    P +   +PP
Sbjct: 1419 KNKTIVLAIAGYSYKDMLMSWVCRLRRLQISNYLVCALDSDTYKFSVLQGLPVYRDPLPP 1478

Query: 151  APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD-- 208
               S     FG++ F   T  +   +L IL+LGYNV+ +DVD+ W  +P P++       
Sbjct: 1479 TNISFNDCHFGTECFQRVTKVKSRMVLRILKLGYNVLLSDVDVYWFMNPLPFIYTFGSGV 1538

Query: 209  -VYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
             V  +D+     P+    +LP         + S   + R  +     M+K ++       
Sbjct: 1539 LVAQSDEYKKTGPI----NLP-------RRLNSGFYFARSDESTIAAMEKVVKHATTSEQ 1587

Query: 268  SKAKKANDQPAFNWALNKTAG--------QVDLYLLPQSAFPTG---GLYFKNQTWVEET 316
            S+     D       +N+            + ++ L ++ FP G   GL+ K        
Sbjct: 1588 SEQPSFYDTLCGEGGINRVGSNKCLEPETNLTIHFLDRNLFPNGAYQGLWNKKNVKSACR 1647

Query: 317  KGKHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
            K    ++HNN+I+G  KK++R    GLW  D
Sbjct: 1648 KKGCFVLHNNWISGRLKKLERQMFSGLWDYD 1678


>gi|428175509|gb|EKX44399.1| UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase
           [Guillardia theta CCMP2712]
          Length = 1433

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 65/265 (24%), Positives = 109/265 (41%), Gaps = 29/265 (10%)

Query: 95  IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDS 154
           I +   +  YL +  NWL  + R    D  ++ A D      + G      V   P  D 
Sbjct: 50  IAITFSNSGYLNYCMNWLHHV-RSVGVDNYVIFALDAEAYSSLKGEAN---VFYDPRLDE 105

Query: 155 ----QTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF--PYLQGDHD 208
               + A  FGS  F      +P   L +LELG++++ +D D+VW KDPF  P + G H 
Sbjct: 106 GKIDKRATDFGSDPFKKIVHLKPTLTLRVLELGFHLLLSDADVVWFKDPFSVPEVVGSHL 165

Query: 209 VYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWS 268
              +D        D++    P       ++ S   Y+ P       M++ +  L + P  
Sbjct: 166 NLMSD-----AHFDYAMGNTP------YFVNSGFAYMSPHPTTIAFMREVVRLLASRPDK 214

Query: 269 KAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYI 328
             + A +    NW   +TA  +   ++  +    G +YF  +   + +    V +HNN+ 
Sbjct: 215 MDQDAYNTAISNWE-RRTAESLTFSIMDPARVSNGWVYFM-RMLGQRSGADLVAVHNNWA 272

Query: 329 TG------FEKKIKRFRDFGLWLVD 347
            G        +K+ RFR+  LW+ D
Sbjct: 273 DGQGDGNTHVQKVHRFREHLLWMSD 297


>gi|452824988|gb|EME31987.1| hypothetical protein Gasu_07340 [Galdieria sulphuraria]
          Length = 658

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 70/284 (24%), Positives = 119/284 (41%), Gaps = 39/284 (13%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIA---EDYATLYKVNGR--WPG 143
           VA N T+I+ A++  Y  FL N++ ++ +       L++A   ED        G   +  
Sbjct: 385 VAINNTVILVAMNYGYRSFLMNFVCNLRQLNLFPGNLIVAALDEDMYRFAFTRGLPVYFE 444

Query: 144 HAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
           + V       S  A  +GS  F   T  +   +L IL+LGY+V++ D D+VW ++P PYL
Sbjct: 445 NTVYSKEDATSVVAASYGSDSFKKLTKMKSRVVLRILKLGYDVIWTDCDIVWFRNPIPYL 504

Query: 204 QG-DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
           Q  D D+    +       ++S+D        R  I S     R         +  I+  
Sbjct: 505 QSQDADLIIQSN---APDNENSND--------RRRINSGFYLARSNPHTIEAFEDVIQ-- 551

Query: 263 QAEPWSKAKKANDQPAFN--WALNKTA-----------GQVDLYLLPQSAFPTG---GLY 306
               ++   +  +QP F   W   +             G+  + LL +  FP G   G++
Sbjct: 552 ----FAAKSRMTEQPCFYDLWCGKEGENAQGKERCIYKGKFHVLLLDRKLFPNGITEGIW 607

Query: 307 FKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHA 350
                 ++E      I+HNN++ G E K++R+   G    D+H 
Sbjct: 608 DSPAGRIQELFPHLYILHNNWVKGNEGKMERYYRHGYIFYDNHT 651


>gi|443713062|gb|ELU06069.1| hypothetical protein CAPTEDRAFT_202855 [Capitella teleta]
          Length = 526

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 73/293 (24%), Positives = 132/293 (45%), Gaps = 44/293 (15%)

Query: 68  PNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPF-LNNWLISISRQKHQDQVLV 126
           P  T S   + +   SQA S   ++  + +        P  +N +L SI R   ++ + V
Sbjct: 80  PQQTFSFTSYSEALSSQADS---ESKVVFLAFADMGGFPMAMNFYLTSIQRYDIRNVLFV 136

Query: 127 IA-EDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYN 185
            + E++ + ++          +   A D   A  + S+ F N  + R   +L  L+LGY+
Sbjct: 137 SSSEEFCSRFQA-IEVACFVYMNESAHDK--ASVYLSKDFINKMNIRTYMILEALQLGYH 193

Query: 186 VMYNDVDMVWLKDPFPYLQGDHDV-YFTD-----DMAAVKPLDHSHDLPPPGKKGRTYIC 239
           V+++DVD+V+ K+P   +    DV +F D     D+AA         L   G     ++ 
Sbjct: 194 VIHSDVDVVFFKNPTERI---FDVCHFRDPEKVCDVAA---------LWDSGAHNAGFL- 240

Query: 240 SCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSA 299
               ++R ++ A + M K +E       +K    +DQ A N A+     ++ +  LP + 
Sbjct: 241 ----FIRNSN-ASISMYKSMEHT-----AKTTNIDDQKALNGAMRSLKKELRITSLPGAE 290

Query: 300 FPTGGLYFKNQTWVEETKGKH-----VIIHNNYITGFEKKIKRFRDFGLWLVD 347
           F +G  +F N     +  G H     +++HNN+I   E K+ RF++  +WL D
Sbjct: 291 FQSGLQFFDNSQ--RQFAGDHPCTHCIVMHNNWIVSMEAKVYRFKEMHMWLYD 341


>gi|390356493|ref|XP_003728806.1| PREDICTED: uncharacterized protein LOC100893660 [Strongylocentrotus
           purpuratus]
          Length = 318

 Score = 67.8 bits (164), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 61/270 (22%), Positives = 114/270 (42%), Gaps = 43/270 (15%)

Query: 92  NGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPA 151
           N  +I+  V+  ++    NWL S+ R   +  + ++AED  T    + R           
Sbjct: 79  NKFVILTTVNAEFVDIAENWLESLRRLGIRYNITMVAEDQDTFEYFSIR----------- 127

Query: 152 PDSQTAHKFGSQGFFNFTSR-------------RPCHLLHILELGYNVMYNDVDMVWLKD 198
            + +    +  Q  FNFT R             R  ++  +LE G +V+  DVD VW+K+
Sbjct: 128 ANREFRVLYQKQFAFNFTRRYSPAKLYEEVIRCRTVYIRTLLEQGNDVLLVDVDTVWIKN 187

Query: 199 PFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKW 258
           P   +  ++ +Y          + H+   P          C C +Y++ T     ++++W
Sbjct: 188 PVDLILKEYALY---------DIWHAQGYP------EGMPCPCFMYMKSTSEVVAMVREW 232

Query: 259 IEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKG 318
           ++ L+    S  +  +DQ A    L      + +  +    FP G  YF N  W    K 
Sbjct: 233 VKRLENHRGSSLE--SDQVALINVL-PLFPAIPIKKMSIEKFPPGFHYF-NWAWFALHKN 288

Query: 319 KHVIIHNNYITGFEKKIKRFRDFGLWLVDD 348
           +  + H N +   + K K+ ++FGLW++++
Sbjct: 289 EVYVAHANNLGRHDGKKKKLKEFGLWMLNN 318


>gi|30686478|ref|NP_850250.1| xyloglucanase 113 [Arabidopsis thaliana]
 gi|18377747|gb|AAL67023.1| unknown protein [Arabidopsis thaliana]
 gi|330254034|gb|AEC09128.1| xyloglucanase 113 [Arabidopsis thaliana]
          Length = 644

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 71/269 (26%), Positives = 113/269 (42%), Gaps = 30/269 (11%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
            K+  IIV   +  ++ F+  W+  ++     + +LV A D   L  +   W G  V   
Sbjct: 129 VKDNVIIVTFGNYAFMDFILTWVKHLTDLDLSN-ILVGAMDTKLLEAL--YWKGVPVFDM 185

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
            +  S     +GS  F      +   +  +L  GY ++  D DMVWLK+P PYL    D 
Sbjct: 186 GSHMSTVDVGWGSPTFHKMGREKVILIDSVLPFGYELLMCDTDMVWLKNPMPYLARFPDA 245

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           DV  + D      +D S D+    + G  Y    + + RPT+ AK + K+W E L A+  
Sbjct: 246 DVLTSSDQVVPTVIDDSLDIWQ--QVGAAYNIG-IFHWRPTESAKKLAKEWKEILLAD-- 300

Query: 268 SKAKKANDQPAFNWALNKT---------------AGQVDLYLLPQSAFPTGGLYFKNQTW 312
               K  DQ  FN  + +                 G + + +LP S F +G  YF  Q  
Sbjct: 301 ---DKVWDQNGFNEIVRRQLGPSVEGDSGLFYAYDGNLKVGILPASIFCSGHTYFV-QAM 356

Query: 313 VEETKGKHVIIHNNY-ITGFEKKIKRFRD 340
            ++ + +   +H  +   G E K  R R+
Sbjct: 357 YQQLRLEPYAVHTTFQYAGTEGKRHRLRE 385


>gi|222423247|dbj|BAH19600.1| AT2G35610 [Arabidopsis thaliana]
          Length = 644

 Score = 67.4 bits (163), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 71/269 (26%), Positives = 113/269 (42%), Gaps = 30/269 (11%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
            K+  IIV   +  ++ F+  W+  ++     + +LV A D   L  +   W G  V   
Sbjct: 129 VKDNVIIVTFGNYAFMDFILTWVKHLTDLDLSN-ILVGAVDTKLLEAL--YWKGVPVFDM 185

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
            +  S     +GS  F      +   +  +L  GY ++  D DMVWLK+P PYL    D 
Sbjct: 186 GSHMSTVDVGWGSPTFHKMGREKVILIDSVLPFGYELLMCDTDMVWLKNPMPYLARFPDA 245

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           DV  + D      +D S D+    + G  Y    + + RPT+ AK + K+W E L A+  
Sbjct: 246 DVLTSSDQVVPTVIDDSLDIWQ--QVGAAYNIG-IFHWRPTESAKKLAKEWKEILLAD-- 300

Query: 268 SKAKKANDQPAFNWALNKT---------------AGQVDLYLLPQSAFPTGGLYFKNQTW 312
               K  DQ  FN  + +                 G + + +LP S F +G  YF  Q  
Sbjct: 301 ---DKVWDQNGFNEIVRRQLGPSVEGDSGLFYAYDGNLKVGILPASIFCSGHTYFV-QAM 356

Query: 313 VEETKGKHVIIHNNY-ITGFEKKIKRFRD 340
            ++ + +   +H  +   G E K  R R+
Sbjct: 357 YQQLRLEPYAVHTTFQYAGTEGKRHRLRE 385


>gi|413946045|gb|AFW78694.1| hypothetical protein ZEAMMB73_007904 [Zea mays]
          Length = 551

 Score = 67.4 bits (163), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 76/306 (24%), Positives = 124/306 (40%), Gaps = 40/306 (13%)

Query: 64  GFMFPNATSSLAKWRDYTLSQAASFVA-KNGTIIVCAVSQPYLPFLNNWLISISRQKHQD 122
           G++   ++ + A    Y+LS     VA +NG++++    + Y   L +W   + R +  +
Sbjct: 243 GYLHGCSSETSAPDLPYSLSMLLDLVADENGSVVLGVAGKSYRDMLMSWACRLRRLRVTN 302

Query: 123 QVLVIAEDYATL-YKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
             LV A D+ T  + +    P     + P   S     FG++ F   T  +   +L IL 
Sbjct: 303 -FLVCALDHETYEFSILQGLPVFRDPLSPTNVSFDDCHFGTKCFQQVTKVKSRIVLEILR 361

Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYF---TDDMAAVKPLDHSHDLPPPGKKGRTYI 238
           LGYNV+ +DVD+ W  +P P L       F   +D+     P+    +LP     G    
Sbjct: 362 LGYNVLLSDVDVYWFGNPMPLLYSLGLATFGAQSDEYNETGPI----NLPRRLNSG---- 413

Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAG---------- 288
                Y   +D A ++  + I +  A     +   ++QP+F   L    G          
Sbjct: 414 ----FYFARSDNATIIAMEMIVKHAA-----SSGLSEQPSFYDVLCGENGTNRIGEDRCL 464

Query: 289 ----QVDLYLLPQSAFPTG---GLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDF 341
                + +  L +  FP G   GL+ K        +    IIHNN+I    KK+ R    
Sbjct: 465 EPNTNLTVVFLDRDLFPNGAYKGLWEKRDVRATCKELGCFIIHNNWINKRRKKVHRQMVS 524

Query: 342 GLWLVD 347
           GLW  D
Sbjct: 525 GLWDYD 530


>gi|359496284|ref|XP_002271910.2| PREDICTED: uncharacterized protein LOC100242526 [Vitis vinifera]
          Length = 874

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 70/286 (24%), Positives = 117/286 (40%), Gaps = 32/286 (11%)

Query: 80  YTLSQAASFVA-KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVN 138
           ++L    S +A KN TI++      Y   L +W+  +      + V+   +     + + 
Sbjct: 432 FSLESLLSVIADKNKTIVLAVAGYSYKDMLMSWVCRLRSLLITNFVVCALDHDVYQFSLL 491

Query: 139 GRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKD 198
              P     + P+  S     FG++ F   T  +   +L IL+LGYNV+ +DVD+ W K+
Sbjct: 492 QGLPVFEDPLAPSDISFDDCHFGTKCFQRVTKSKSRLVLQILKLGYNVLMSDVDVYWFKN 551

Query: 199 PFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKW 258
           P P L      Y       V   D   +  P     R  + S   + R  D     M+K 
Sbjct: 552 PLPLL------YSFGPAILVAQSDEYKETGPINLPRR--LNSGFYFARSDDTTIAGMEKV 603

Query: 259 IEELQAEPWSKAKKANDQPAFNWALNKTAGQVDL--------------YLLPQSAFPTGG 304
           ++       + +   ++QP+F  +L    G   L              + L ++ FP G 
Sbjct: 604 VKH------AASSNLSEQPSFYDSLCGEGGSYRLGDNRCLEPETNLTVHFLDRNLFPNGA 657

Query: 305 ---LYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
              L+ K +     +K    I+HNN+I+G  KK++R    GLW  D
Sbjct: 658 YQNLWEKKKWKSVCSKKDCFILHNNWISGRRKKLQRQVLSGLWGYD 703


>gi|296080952|emb|CBI18645.3| unnamed protein product [Vitis vinifera]
          Length = 677

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 70/286 (24%), Positives = 117/286 (40%), Gaps = 32/286 (11%)

Query: 80  YTLSQAASFVA-KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVN 138
           ++L    S +A KN TI++      Y   L +W+  +      + V+   +     + + 
Sbjct: 329 FSLESLLSVIADKNKTIVLAVAGYSYKDMLMSWVCRLRSLLITNFVVCALDHDVYQFSLL 388

Query: 139 GRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKD 198
              P     + P+  S     FG++ F   T  +   +L IL+LGYNV+ +DVD+ W K+
Sbjct: 389 QGLPVFEDPLAPSDISFDDCHFGTKCFQRVTKSKSRLVLQILKLGYNVLMSDVDVYWFKN 448

Query: 199 PFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKW 258
           P P L      Y       V   D   +  P     R  + S   + R  D     M+K 
Sbjct: 449 PLPLL------YSFGPAILVAQSDEYKETGPINLPRR--LNSGFYFARSDDTTIAGMEKV 500

Query: 259 IEELQAEPWSKAKKANDQPAFNWALNKTAGQVDL--------------YLLPQSAFPTGG 304
           ++       + +   ++QP+F  +L    G   L              + L ++ FP G 
Sbjct: 501 VKH------AASSNLSEQPSFYDSLCGEGGSYRLGDNRCLEPETNLTVHFLDRNLFPNGA 554

Query: 305 ---LYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
              L+ K +     +K    I+HNN+I+G  KK++R    GLW  D
Sbjct: 555 YQNLWEKKKWKSVCSKKDCFILHNNWISGRRKKLQRQVLSGLWGYD 600


>gi|357131567|ref|XP_003567408.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
           distachyon]
          Length = 362

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 54/224 (24%), Positives = 101/224 (45%), Gaps = 28/224 (12%)

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVP-PAPDSQTAHKFGSQGFFNFTSRRPCHLLHIL 180
           D VLV+A D   L       PGH  L+   + +  +A++F S+G+      +      +L
Sbjct: 127 DHVLVVAVDTGALAHCEAVHPGHCYLLEVKSANISSANRFMSKGYLELVWAKLQLQHRVL 186

Query: 181 ELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDM--AAVKPLDHSHDLPPPGKKGRTYI 238
           +LGYN ++ DVD++WL+DPF ++    D+  + D      + L+++   P  G       
Sbjct: 187 QLGYNYLFTDVDIMWLRDPFRHISLYADMAVSTDRFNGDAEALNNA---PNTG------- 236

Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTA-GQVDL-YLLP 296
                Y++ T+    ++++W      +   +   A+DQ  F+      A G++ L ++  
Sbjct: 237 ---FYYVKSTNRTVEMVQRW-----RDARHRFTGAHDQAVFDEIKADLAHGELRLRFVFL 288

Query: 297 QSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRD 340
            +A   G   F+     +E  G+   +H N   G E K+   R+
Sbjct: 289 DTALFGGFCQFR-----DEIDGRVCAMHANCCIGLENKVHDLRN 327


>gi|340369123|ref|XP_003383098.1| PREDICTED: hypothetical protein LOC100635634 [Amphimedon
           queenslandica]
          Length = 393

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 63/284 (22%), Positives = 118/284 (41%), Gaps = 39/284 (13%)

Query: 78  RDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKV 137
           + Y  + A     K+G  +V  V+  ++    NW+ S  +    + +L+I  +       
Sbjct: 102 KHYPCNFATPIPIKDGVSVVTFVNAGWINLTKNWICSARKVGLGEHILLITVE------- 154

Query: 138 NGRWPGHAVLVPPAP------DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYN-VMYND 190
               P      P  P       + ++ KFG  G+  F   R   +L +L  G   ++  D
Sbjct: 155 ----PNVCSNFPDTPCHYEKGAAISSTKFGQPGYQKFMIERTKIILRLLSCGIKKLLLAD 210

Query: 191 VDMVWLKDPFPYLQGD---HDVYFTDDMAAVKPLDH-SHDLPPPGKKGRTYICSCMIYLR 246
            D+V+L++P   L  +    D+    D   ++ +D  ++++ P       YIC   +YL 
Sbjct: 211 ADIVFLQNPLKRLDTELEYRDIVLQRDSTGLQVIDSLAYNVFP-------YICGGFMYLN 263

Query: 247 PTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLY 306
             +  KL+ +  ++  + + W      NDQ   N  +   +  ++  LLP S FP G  Y
Sbjct: 264 VNNKTKLLYQSVLQFQRNQSW------NDQAGLNICIRHHSLHINWTLLPLSLFPNGKEY 317

Query: 307 FKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHA 350
           F    W    + + +I+H N+ +G  +KI       +W  +  A
Sbjct: 318 F--DFWA--NREQPLIVHANFKSGSMEKITSMIIRDIWCYEKIA 357


>gi|326427878|gb|EGD73448.1| hypothetical protein PTSG_05152 [Salpingoeca sp. ATCC 50818]
          Length = 749

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 72/302 (23%), Positives = 121/302 (40%), Gaps = 53/302 (17%)

Query: 80  YTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNG 139
           +TL      VA+N  +I+  VS  Y   + N++ ++ R    DQ+++ A D   +Y+   
Sbjct: 442 FTLKDLLPIVARNKHVILSGVSYIYRDVVMNFVCNLRRLGIYDQLILAAFD-EEMYRFGF 500

Query: 140 RWPGHAVLVPPAPD----SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVW 195
           R  G  +    + D    S    ++GS  F   T  +   +L IL++GY+V + D D+VW
Sbjct: 501 RM-GLPIFYYQSDDLAGLSSRDLEYGSDAFKKVTKLKSQVVLQILQMGYDVTWTDTDIVW 559

Query: 196 LKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKK---GRTYICSCMIYLRPTDGAK 252
            +DP P L           MA         + P P ++   G   I S    +R T    
Sbjct: 560 FEDPIPKL-----------MAMESDFVVQSNAPFPDERVANGPLRINSGFYRVRSTPVTI 608

Query: 253 LVMKKWIEELQAEPWSKAKKANDQPAFNWAL------NKTAGQ----------------- 289
             M++ +        + +    +QP+F   L        T G                  
Sbjct: 609 AAMQQIVAH------AASSTMTEQPSFYIVLCGGKEGTTTVGDNKCHYRPPPDLIPHDQG 662

Query: 290 ---VDLYLLPQSAFPTGGL-YFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWL 345
              +++  L +  +P G +  + N   +     + VI+HNN+I G   KI+R    GLW 
Sbjct: 663 NALLEVEFLDRRQYPNGNVGGYWNSDNIHVRHPQIVILHNNWIKGLRAKIERLVLRGLWY 722

Query: 346 VD 347
            D
Sbjct: 723 YD 724


>gi|255089627|ref|XP_002506735.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
 gi|226522008|gb|ACO67993.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
          Length = 746

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 78/306 (25%), Positives = 133/306 (43%), Gaps = 42/306 (13%)

Query: 61  GTPG-FMFPNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQK 119
           G PG ++ P   S LA+    T      + A++  ++V   +  Y  F+ NW+ ++ R+ 
Sbjct: 199 GKPGEYVAPK--SKLAEDNQLTAELVGRY-AEDNIVMVTWANHHYHDFVRNWVRNV-RKC 254

Query: 120 HQDQVLVIAEDYATLYK-VNGRWPGHAVLVPPAPDSQTAHKFG-SQGFFNFTSRRPCHLL 177
                +V A D   L K ++   P  A+         T   FG     F+   R+   L+
Sbjct: 255 GMRNYMVGAMDNELLEKLIDDEVPTFAM-----QSGLTTKDFGWGTANFHKMGRKKIELI 309

Query: 178 HIL-ELGYNVMYNDVDMVWLKDPFPYLQG--DHDVYFTDDMAAVKPLDHSHDLPPPGKKG 234
           H+  E+G++++ +DVD VWL++P PY+      DV  + D  A        + P   +K 
Sbjct: 310 HLFTEMGFDILVSDVDTVWLRNPLPYMAKYPHADVLTSSDHLANTAEGEGLEDP---RKA 366

Query: 235 RTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTA------- 287
            +     ++ LR  D AK + K+W++ L+     K  K  DQ  FN    +         
Sbjct: 367 HSAANIGIMLLR--DTAKELAKEWVDVLE-----KDDKVWDQNVFNDLYRRGGGPSVKDD 419

Query: 288 --------GQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNY-ITGFEKKIKRF 338
                   G++ + +LP S F +G  YF  Q   E+   +  ++H  +  +G E K  R 
Sbjct: 420 KNVVTGYDGKLKVGILPVSMFASGHTYFV-QRMHEKVGVEPYVVHATFQYSGTEGKRHRM 478

Query: 339 RDFGLW 344
           R+  LW
Sbjct: 479 REALLW 484


>gi|51451344|gb|AAU03099.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 765

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 64/274 (23%), Positives = 103/274 (37%), Gaps = 30/274 (10%)

Query: 91  KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
           KN ++++      Y   L +W+  + R +  + V+   +     + V    P     + P
Sbjct: 477 KNRSVVLAVAGASYRDMLMSWVCRLRRLRVTNFVVCALDQETYEFSVLQGMPVSRDTLSP 536

Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVY 210
              S     FG+Q F   T  +   +L IL LGYNV+ +DVD+ W  +P  +L       
Sbjct: 537 NNVSFDDCHFGTQCFQQVTKVKSRIVLKILRLGYNVLLSDVDVYWFHNPVSFLHSLGPGT 596

Query: 211 F---TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           F   +D+     P++    L            S   Y R  D     M+  ++       
Sbjct: 597 FAAQSDEFNQTGPINMPRRLN-----------SGFYYARSDDATITAMEMIVKHATNSGL 645

Query: 268 SK-------------AKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVE 314
           S+             A +  D      + N T   +   + P  A+  GGL+ K      
Sbjct: 646 SEQPSFYDILCGKDGANRIGDDRCLEPSTNLTIVFLSRDMFPNGAY--GGLWEKKHGVSS 703

Query: 315 ETKG-KHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
             +    VIIHNN++ G  KK+ R    GLW  D
Sbjct: 704 ACRELGCVIIHNNWVNGRRKKLHRQMASGLWDYD 737


>gi|300120627|emb|CBK20181.2| unnamed protein product [Blastocystis hominis]
          Length = 370

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 63/280 (22%), Positives = 121/280 (43%), Gaps = 48/280 (17%)

Query: 81  TLSQAASFVAKNGTIIVCAVSQPYL-PFLNNWLISISRQKHQDQVLVIAED---YATLYK 136
           TL+     VA N  +++      YL  F  ++++S  +       +V+A D   Y TL K
Sbjct: 100 TLADLVPKVAINRVVMLTFTDSSYLESFYASYVVS--KLDQYSNFIVVAVDMNAYITLSK 157

Query: 137 VNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWL 196
             G    +   + P   + +   FGS  F    + +   +  ++ L ++V+  D D+V  
Sbjct: 158 -QGYPVAYFESLLPENLTCSESSFGSNQFHLKMANKMQIIRQVILLDHSVLLFDSDVVLF 216

Query: 197 KDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPT-------D 249
           +DP P +    +                +DL     +G   IC+  I+ RPT       D
Sbjct: 217 RDPIPTILAYQN----------------YDLIAQKDEG---ICAGFIFFRPTLQSLQFID 257

Query: 250 GAKLVMKKWIEELQAEPWSKA-KKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFK 308
                 ++W+  +  +P  +  ++ + +P+  W L           LPQ+ F +G ++F+
Sbjct: 258 LVLQYQQRWV--MSDQPVMEILRQMHVRPSLRWTL-----------LPQNTFSSGSVFFE 304

Query: 309 N-QTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
           + Q + +      VI+HNNY+ G+  KI R ++  ++ +D
Sbjct: 305 SHQFYWDAIDANQVIMHNNYVIGYTNKIYRLKEMKMYPLD 344


>gi|443714935|gb|ELU07133.1| hypothetical protein CAPTEDRAFT_189031 [Capitella teleta]
          Length = 568

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 92/211 (43%), Gaps = 30/211 (14%)

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
           +S T   FGS  F    + R   +LH L+ GYNV+++D D+ +  +PFP ++        
Sbjct: 213 NSGTGASFGSTAFKEKMNIRTFMVLHALKEGYNVLHSDCDVYYFANPFPVIKE-----LC 267

Query: 213 DDMAAVKPL-DH-SHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKA 270
                V PL D+ +H+             +  +Y R T     + K        E  +  
Sbjct: 268 GSECDVAPLWDYVTHN-------------AGFLYTRSTTMGIALYKNM------EHTALK 308

Query: 271 KKANDQPAFNWALNK-TAGQVDLYLLPQSAFPTGGLYFKN--QTWVEETKGKHVII-HNN 326
              +DQ A   A+   T   + L  LP   F +G L+F +  +T+ E+      I+ HNN
Sbjct: 309 TGRDDQSALKTAVEDCTKNGLRLVSLPTEQFQSGRLFFGDGKRTFAEDNPCSTCIVAHNN 368

Query: 327 YITGFEKKIKRFRDFGLWLVDDHAVESPLGK 357
           +I G E K  RF++  +W+ D     S  G+
Sbjct: 369 WIKGIEAKEYRFKEMHMWVYDGDEYYSSTGR 399


>gi|222632211|gb|EEE64343.1| hypothetical protein OsJ_19183 [Oryza sativa Japonica Group]
          Length = 691

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 64/274 (23%), Positives = 103/274 (37%), Gaps = 30/274 (10%)

Query: 91  KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
           KN ++++      Y   L +W+  + R +  + V+   +     + V    P     + P
Sbjct: 403 KNRSVVLAVAGASYRDMLMSWVCRLRRLRVTNFVVCALDQETYEFSVLQGMPVSRDTLSP 462

Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVY 210
              S     FG+Q F   T  +   +L IL LGYNV+ +DVD+ W  +P  +L       
Sbjct: 463 NNVSFDDCHFGTQCFQQVTKVKSRIVLKILRLGYNVLLSDVDVYWFHNPVSFLHSLGPGT 522

Query: 211 F---TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           F   +D+     P++    L            S   Y R  D     M+  ++       
Sbjct: 523 FAAQSDEFNQTGPINMPRRLN-----------SGFYYARSDDATITAMEMIVKHATNSGL 571

Query: 268 SK-------------AKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVE 314
           S+             A +  D      + N T   +   + P  A+  GGL+ K      
Sbjct: 572 SEQPSFYDILCGKDGANRIGDDRCLEPSTNLTIVFLSRDMFPNGAY--GGLWEKKHGVSS 629

Query: 315 ETKG-KHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
             +    VIIHNN++ G  KK+ R    GLW  D
Sbjct: 630 ACRELGCVIIHNNWVNGRRKKLHRQMASGLWDYD 663


>gi|297724249|ref|NP_001174488.1| Os05g0513700 [Oryza sativa Japonica Group]
 gi|255676488|dbj|BAH93216.1| Os05g0513700 [Oryza sativa Japonica Group]
          Length = 689

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 64/274 (23%), Positives = 103/274 (37%), Gaps = 30/274 (10%)

Query: 91  KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
           KN ++++      Y   L +W+  + R +  + V+   +     + V    P     + P
Sbjct: 401 KNRSVVLAVAGASYRDMLMSWVCRLRRLRVTNFVVCALDQETYEFSVLQGMPVSRDTLSP 460

Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVY 210
              S     FG+Q F   T  +   +L IL LGYNV+ +DVD+ W  +P  +L       
Sbjct: 461 NNVSFDDCHFGTQCFQQVTKVKSRIVLKILRLGYNVLLSDVDVYWFHNPVSFLHSLGPGT 520

Query: 211 F---TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           F   +D+     P++    L            S   Y R  D     M+  ++       
Sbjct: 521 FAAQSDEFNQTGPINMPRRLN-----------SGFYYARSDDATITAMEMIVKHATNSGL 569

Query: 268 SK-------------AKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVE 314
           S+             A +  D      + N T   +   + P  A+  GGL+ K      
Sbjct: 570 SEQPSFYDILCGKDGANRIGDDRCLEPSTNLTIVFLSRDMFPNGAY--GGLWEKKHGVSS 627

Query: 315 ETKG-KHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
             +    VIIHNN++ G  KK+ R    GLW  D
Sbjct: 628 ACRELGCVIIHNNWVNGRRKKLHRQMASGLWDYD 661


>gi|218197096|gb|EEC79523.1| hypothetical protein OsI_20607 [Oryza sativa Indica Group]
          Length = 830

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 64/274 (23%), Positives = 103/274 (37%), Gaps = 30/274 (10%)

Query: 91  KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
           KN ++++      Y   L +W+  + R +  + V+   +     + V    P     + P
Sbjct: 542 KNRSVVLAVAGASYRDMLMSWVCRLRRLRVTNFVVCALDQETYEFSVLQGMPVSRDTLSP 601

Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVY 210
              S     FG+Q F   T  +   +L IL LGYNV+ +DVD+ W  +P  +L       
Sbjct: 602 NNVSFDDCHFGTQCFQQVTKVKSRIVLKILRLGYNVLLSDVDVHWFHNPVSFLHSLGPGT 661

Query: 211 F---TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           F   +D+     P++    L            S   Y R  D     M+  ++       
Sbjct: 662 FAAQSDEFNQTGPINMPRRLN-----------SGFYYARSDDATITAMEMIVKHATNSGL 710

Query: 268 SK-------------AKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVE 314
           S+             A +  D      + N T   +   + P  A+  GGL+ K      
Sbjct: 711 SEQPSFYDILCGKDGANRIGDDRCLEPSTNLTVVFLSRDMFPNGAY--GGLWEKKHGVSS 768

Query: 315 ETKG-KHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
             +    VIIHNN++ G  KK+ R    GLW  D
Sbjct: 769 ACRELGCVIIHNNWVNGRRKKLHRQMASGLWDYD 802


>gi|308813097|ref|XP_003083855.1| unnamed protein product [Ostreococcus tauri]
 gi|116055737|emb|CAL57822.1| unnamed protein product [Ostreococcus tauri]
          Length = 588

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 53/197 (26%), Positives = 82/197 (41%), Gaps = 33/197 (16%)

Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQG---DHDVYFTDDMAAVKPLDHSHDLPPPGK 232
           L+ ILE G++V+ +D D+ WL+DP  +++    D DV  + D   V   D          
Sbjct: 113 LVDILERGHDVLVSDADVAWLRDPEDWVRTALRDVDVAASTDCLDVSADDVGGCWGAAAN 172

Query: 233 KGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTA----- 287
            G       ++Y RPTD AK  +  WI  +        ++  DQ  FN  L   A     
Sbjct: 173 TG-------ILYFRPTDAAKTFLGNWIAAMDKATDDMTER--DQEIFNNLLVADASGDTK 223

Query: 288 --------------GQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKH-VIIHNNY-ITGF 331
                         G V L  LP   F +G  +F  Q    E + +    +H  +  +  
Sbjct: 224 VTEGGSSVNVRVVKGGVRLGSLPMRYFASGHTFFVEQLHKHEGESQQPFCVHATFQFSQV 283

Query: 332 EKKIKRFRDFGLWLVDD 348
             K +RFR+ GLWL+++
Sbjct: 284 HGKRQRFREHGLWLIEN 300


>gi|224129610|ref|XP_002328759.1| predicted protein [Populus trichocarpa]
 gi|222839057|gb|EEE77408.1| predicted protein [Populus trichocarpa]
          Length = 638

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 72/269 (26%), Positives = 113/269 (42%), Gaps = 30/269 (11%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
            K+  IIV   +  ++ F+ +W+  ++     + +LV A D   L  +   W G  V   
Sbjct: 123 VKDNVIIVTFGNYAFMDFILSWVKHLTDLGLSN-LLVGAMDTKLLEAL--YWKGIPVFDM 179

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
            +  S     +GS  F      +   +  IL  G  ++  D DMVWLKDP PYL    + 
Sbjct: 180 GSHMSTADVGWGSPTFHKMGREKVILIDAILPYGVELLMCDTDMVWLKDPLPYLARYPEA 239

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           DV  + D      +D S DL    + G  Y    + + RPT+ AK + ++W + L A+  
Sbjct: 240 DVLTSSDQVVPTVVDDSLDL--WQQVGAAYNIG-IFHWRPTESAKKLAREWKDMLLAD-- 294

Query: 268 SKAKKANDQPAFNWALNKT---------------AGQVDLYLLPQSAFPTGGLYFKNQTW 312
               K  DQ  FN  + K                 G + L +LP S F +G  YF  Q  
Sbjct: 295 ---DKIWDQNGFNDIVRKQLGPSVDGDSGLAYAFDGNLKLGVLPASIFCSGHTYFV-QAM 350

Query: 313 VEETKGKHVIIHNNY-ITGFEKKIKRFRD 340
            ++ + +   +H  +   G E K  R R+
Sbjct: 351 YQQLRLEPYAVHTTFQYAGTEGKRHRLRE 379


>gi|242091071|ref|XP_002441368.1| hypothetical protein SORBIDRAFT_09g025390 [Sorghum bicolor]
 gi|241946653|gb|EES19798.1| hypothetical protein SORBIDRAFT_09g025390 [Sorghum bicolor]
          Length = 740

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 72/290 (24%), Positives = 114/290 (39%), Gaps = 40/290 (13%)

Query: 80  YTLSQAASFVA-KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATL-YKV 137
           Y+L      VA KN ++++      Y   L +W+  + R       +V A D+ T  + V
Sbjct: 439 YSLGMLLELVADKNRSVVLGVAGAGYRDMLMSWVCRL-RHLRVTNFIVCAVDHETYEFSV 497

Query: 138 NGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLK 197
               P     + P   S     FG++ F   T  +   +L IL LGYNV+ +DVD+ W  
Sbjct: 498 LQGLPVFIDPLSPKNVSIDDCHFGTKCFQQVTKVKSRIVLEILRLGYNVLLSDVDVYWFD 557

Query: 198 DPFPYLQGDHDVYF---TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLV 254
           +P P+L       F   +D+     P+    +LP     G         Y   +D A + 
Sbjct: 558 NPMPFLYSLGPAIFGAQSDEYNETGPI----NLPRRLNSG--------FYFARSDNATIT 605

Query: 255 MKKWIEELQAEPWSKAKKANDQPAFNWALNKTAG--------------QVDLYLLPQSAF 300
             + I        +     ++QP+F   L    G               +++  L +  F
Sbjct: 606 AMEMI-----VGHATNSGLSEQPSFYDVLCGENGINRIGDDKCLEPNTNLNVVFLNRDLF 660

Query: 301 PTG---GLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
           P G   GL+ K+       +    IIHNN+I   +KK+ R    GLW  D
Sbjct: 661 PNGAYKGLWEKHDVRAACKELGCFIIHNNWINKRKKKLHRQMSSGLWDYD 710


>gi|242033773|ref|XP_002464281.1| hypothetical protein SORBIDRAFT_01g015590 [Sorghum bicolor]
 gi|241918135|gb|EER91279.1| hypothetical protein SORBIDRAFT_01g015590 [Sorghum bicolor]
          Length = 615

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 72/285 (25%), Positives = 117/285 (41%), Gaps = 27/285 (9%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
           A++G I+V   +  +L F+  W+  ++     D +LV A D   L ++  R  G  V   
Sbjct: 100 ARDGVIVVTFGNYAFLDFILTWVRHLT-DLGVDNLLVGAMDTKLLRELYFR--GVPVFDM 156

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
            +  +     +GS  F      +   +  +L  GY ++  D DMVWLK+P PYL    D 
Sbjct: 157 GSRMATEDAGWGSPTFHKMGREKVLLINALLPFGYELLMCDTDMVWLKNPLPYLARYPDA 216

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           D+  + D       D S +     ++        + + RPT+ AK + K W + + ++  
Sbjct: 217 DLLTSSDQVIPTVTDDSLE---NWREVTGAFNIGIFHWRPTEPAKRLAKDWKDLVISD-- 271

Query: 268 SKAKKANDQPAFNWALNKTAGQ--------------VDLYLLPQSAFPTGGLYFKNQTWV 313
               K  DQ AFN  + K  GQ              + L +LP S F +G  YF    + 
Sbjct: 272 ---DKLWDQNAFNDLVRKNFGQPVQGGDLVYSYDGKLKLGVLPASIFCSGHTYFVQGMYK 328

Query: 314 EETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           +     + +       G E K  R R+  L+       +SP G L
Sbjct: 329 QLHLEPYAVHTTFQYAGTEGKRHRLREAMLFFDQPSYYDSPGGFL 373


>gi|384244588|gb|EIE18088.1| hypothetical protein COCSUDRAFT_34634 [Coccomyxa subellipsoidea
           C-169]
          Length = 193

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 49/202 (24%), Positives = 83/202 (41%), Gaps = 25/202 (12%)

Query: 159 KFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAV 218
           ++GS GF      R      +LE G+  +++D+D VW ++P   +    D    DD    
Sbjct: 2   EYGSAGFDEMMCERLTFQRKVLEHGFTFLWSDMDTVWYQNPLDIMPKGFDFVGVDD---- 57

Query: 219 KPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPA 278
                S+  P   ++    +C C ++ RPT  +K  +K W +    +        +DQ A
Sbjct: 58  -----SYHGPKHLEQNTGNLCGCFMFWRPTQRSKDFLKDWYDNCAHQ------AGDDQQA 106

Query: 279 FN--WALNKTAGQVDLYLLPQSAFPTGGLYFKNQT--WV--EETKGKHVI----IHNNYI 328
            N  W       ++  Y++P+  FP+G     N    W   E+    H +    IH N  
Sbjct: 107 LNRMWNSADMKQKLHWYIMPRQLFPSGTPALSNLKIDWSPNEDPARPHTLFPAWIHANCR 166

Query: 329 TGFEKKIKRFRDFGLWLVDDHA 350
           TG E K    ++   W + D +
Sbjct: 167 TGHEAKRGFLKERLAWNITDDS 188


>gi|296085465|emb|CBI29197.3| unnamed protein product [Vitis vinifera]
          Length = 642

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 73/287 (25%), Positives = 119/287 (41%), Gaps = 30/287 (10%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
            K+  IIV   +  ++ F+  W+  ++     + +LV A D   L  +   W G  V   
Sbjct: 128 VKDNVIIVTFGNYAFMDFILTWVKHLTDLGISN-LLVGAMDTKLLEAL--YWRGVPVFDM 184

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
            +  S     +G+  F      +   +  +L  GY ++  D DMVWLK+P PYL      
Sbjct: 185 GSHMSTIDVGWGTPTFHKMGREKVILIDALLPFGYELLMCDTDMVWLKNPLPYLARFPGA 244

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           DV  + D      +D   D+    + G  Y    + + RPTD AK + ++W E L A+  
Sbjct: 245 DVLTSSDQVVPTVVDDRLDIWQ--QVGAAYNIG-IFHWRPTDSAKKLAREWKEMLLAD-- 299

Query: 268 SKAKKANDQPAFNWALNKT---------------AGQVDLYLLPQSAFPTGGLYFKNQTW 312
               K  DQ  FN  + +                 G + L LLP S F +G  YF  Q+ 
Sbjct: 300 ---DKIWDQNGFNDLVRRQLGPSVDEDSGLAYAYDGNLKLGLLPASIFCSGHTYFV-QSM 355

Query: 313 VEETKGKHVIIHNNY-ITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
            ++ + +   +H  +   G + K  R R+  ++       +SP G L
Sbjct: 356 YQQLRLEPYAVHTTFQYAGTDGKRHRLREAKVFYDPPEYYDSPGGFL 402


>gi|359474749|ref|XP_002269181.2| PREDICTED: uncharacterized protein LOC100263017 [Vitis vinifera]
          Length = 631

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 73/287 (25%), Positives = 119/287 (41%), Gaps = 30/287 (10%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
            K+  IIV   +  ++ F+  W+  ++     + +LV A D   L  +   W G  V   
Sbjct: 117 VKDNVIIVTFGNYAFMDFILTWVKHLTDLGISN-LLVGAMDTKLLEAL--YWRGVPVFDM 173

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
            +  S     +G+  F      +   +  +L  GY ++  D DMVWLK+P PYL      
Sbjct: 174 GSHMSTIDVGWGTPTFHKMGREKVILIDALLPFGYELLMCDTDMVWLKNPLPYLARFPGA 233

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           DV  + D      +D   D+    + G  Y    + + RPTD AK + ++W E L A+  
Sbjct: 234 DVLTSSDQVVPTVVDDRLDIWQ--QVGAAYNIG-IFHWRPTDSAKKLAREWKEMLLAD-- 288

Query: 268 SKAKKANDQPAFNWALNKT---------------AGQVDLYLLPQSAFPTGGLYFKNQTW 312
               K  DQ  FN  + +                 G + L LLP S F +G  YF  Q+ 
Sbjct: 289 ---DKIWDQNGFNDLVRRQLGPSVDEDSGLAYAYDGNLKLGLLPASIFCSGHTYFV-QSM 344

Query: 313 VEETKGKHVIIHNNY-ITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
            ++ + +   +H  +   G + K  R R+  ++       +SP G L
Sbjct: 345 YQQLRLEPYAVHTTFQYAGTDGKRHRLREAKVFYDPPEYYDSPGGFL 391


>gi|356496487|ref|XP_003517099.1| PREDICTED: uncharacterized protein LOC100776463 [Glycine max]
          Length = 627

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 71/269 (26%), Positives = 112/269 (41%), Gaps = 30/269 (11%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
            K+  IIV   +  ++ F+  W+  ++     +  LV A D   L  +   W G  V   
Sbjct: 112 VKDNIIIVTFGNYAFMDFILTWVEQLNDLGVSN-YLVGAMDTKLLEAL--YWKGIPVFDM 168

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
            +  S     +GS  F      +   +  IL  GY ++  D DMVWLK+P PYL      
Sbjct: 169 GSHMSTDDVGWGSPTFHKMGREKVILINLILPFGYELLMCDTDMVWLKNPLPYLASYPKA 228

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           DV  + D      +D S ++    + G  Y    + + RPT+ AK + K+W E L A+  
Sbjct: 229 DVLTSSDQVVPTVVDDSLEI--WQEVGAAYNIG-IFHWRPTESAKKLAKEWKELLLAD-- 283

Query: 268 SKAKKANDQPAFNWALNKT---------------AGQVDLYLLPQSAFPTGGLYFKNQTW 312
               K  DQ  FN  +++                 G + L +LP S F +G  YF  Q  
Sbjct: 284 ---DKIWDQNGFNDIVHRQLGPSVDEDSGLVYAYDGNLKLGILPSSIFCSGHTYFV-QAM 339

Query: 313 VEETKGKHVIIHNNY-ITGFEKKIKRFRD 340
            ++ + +   +H  +   G E K  R R+
Sbjct: 340 YQQLRLEPYAVHTTFQYAGTEGKRHRLRE 368


>gi|356560983|ref|XP_003548765.1| PREDICTED: uncharacterized protein LOC100819032 [Glycine max]
          Length = 639

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 68/272 (25%), Positives = 113/272 (41%), Gaps = 26/272 (9%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
            K+  +IV   +  ++ F+  W+  + R       LV A D   +  +   W G  V   
Sbjct: 121 VKDNAVIVTFGNYAFMDFILTWVKQL-RDLGVSNFLVGAMDTKLVEAL--YWKGIPVFDM 177

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
            +  S     +GS  F      +   +  IL  G+ ++  D DMVWLK+P PYL    + 
Sbjct: 178 GSHMSTVDVGWGSPTFHKMGREKVILIDSILPFGFELLMCDTDMVWLKNPLPYLARYPEA 237

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEP- 266
           DV  + D      +D S +  P            + + RPT+ AK + K+W E L A+  
Sbjct: 238 DVLTSSDQVIPTVVDDSLENWPEVSGAYNI---GIFHWRPTESAKKLAKQWKEMLLADDQ 294

Query: 267 -WSK-----------AKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVE 314
            W +               +D+    +A +   G++ L +LP S F +G  YF  Q   +
Sbjct: 295 IWDQNGFNDIVHRQLGPSVDDESGLVFAFD---GKLKLGILPASIFCSGHTYFV-QAMYQ 350

Query: 315 ETKGKHVIIHNNY-ITGFEKKIKRFRDFGLWL 345
           + + +   +H  +   G E K  R R+  L+L
Sbjct: 351 QLRLEPYAVHTTFQYGGTEGKRHRLREAMLFL 382


>gi|255557809|ref|XP_002519934.1| reticulon3-A3, putative [Ricinus communis]
 gi|223540980|gb|EEF42538.1| reticulon3-A3, putative [Ricinus communis]
          Length = 639

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 75/298 (25%), Positives = 122/298 (40%), Gaps = 31/298 (10%)

Query: 62  TPGFMFPNATSSLAKWRDYTLS-QAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKH 120
           TP +  P   S +   + + L+ Q      K+  IIV   +  ++ F+  W+  ++    
Sbjct: 96  TPIWEIPPPNSKMLPIKKFRLTKQLVEKRVKDNVIIVTFGNFAFMDFILTWVKHLTDLGL 155

Query: 121 QDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHIL 180
            + +LV A D   L  +   W G  V    +  S     +GS  F      +   +   L
Sbjct: 156 SN-LLVGAMDTKLLEAL--YWKGVPVFDMGSHMSTADVGWGSPTFHKMGREKVILIDAFL 212

Query: 181 ELGYNVMYNDVDMVWLKDPFPYLQG--DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
             G+ ++  D DMVWLK+P PYL    D DV  + D      +D   D+    + G  Y 
Sbjct: 213 PFGFELLMCDTDMVWLKNPLPYLARYPDADVLTSSDQVVPTVVDDRLDIWQ--EVGAAYN 270

Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTA----------- 287
              + + RPT+ +K + K+W E L A+      K  DQ  FN  + +             
Sbjct: 271 IG-IFHWRPTESSKKLAKEWKEILLAD-----DKIWDQNGFNDIVRRQLGPSVDDDSGLV 324

Query: 288 ----GQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNY-ITGFEKKIKRFRD 340
               G + L +LP S F +G  YF  Q   ++ + +   +H  +   G E K  R R+
Sbjct: 325 YAFDGNLKLGILPASIFCSGHTYFV-QAMYQQLRLEPYAVHTTFQYAGTEGKRHRLRE 381


>gi|226493177|ref|NP_001143144.1| uncharacterized protein LOC100275624 [Zea mays]
 gi|195614994|gb|ACG29327.1| hypothetical protein [Zea mays]
 gi|223947567|gb|ACN27867.1| unknown [Zea mays]
 gi|238009756|gb|ACR35913.1| unknown [Zea mays]
 gi|414871583|tpg|DAA50140.1| TPA: hypothetical protein ZEAMMB73_624371 [Zea mays]
          Length = 615

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 72/285 (25%), Positives = 116/285 (40%), Gaps = 27/285 (9%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
           A++G I+V   +  +L F+  W+  ++     D +LV A D   L ++  R  G  V   
Sbjct: 100 ARDGVIVVTFGNYAFLDFILTWVRHLT-DLGVDNLLVGAMDTKLLRELYFR--GVPVFDM 156

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
            +        +GS  F      +   +  +L  GY ++  D DMVWLK+P PYL    D 
Sbjct: 157 GSRMVTEDAGWGSPTFHKMGREKVLLINALLPFGYELLMCDTDMVWLKNPLPYLARYPDA 216

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           D+  + D       D S +     ++        + + RPT+ AK + K W + + ++  
Sbjct: 217 DLLTSSDQVIPTVTDDSLE---NWREVTGAFNIGIFHWRPTEPAKRLAKDWKDLVISD-- 271

Query: 268 SKAKKANDQPAFNWALNKTAGQ--------------VDLYLLPQSAFPTGGLYFKNQTWV 313
               K  DQ AFN  + K  GQ              + L +LP S F +G  YF    + 
Sbjct: 272 ---DKLWDQNAFNDLVRKKFGQPVQGDDLVYSYDGKLKLGVLPASIFCSGHTYFVQGMYK 328

Query: 314 EETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           +     + +       G E K  R R+  L+       +SP G L
Sbjct: 329 QLHLEPYAVHTTFQYAGTEGKRHRLREAMLFFDQPSYYDSPGGFL 373


>gi|412992379|emb|CCO20092.1| predicted protein [Bathycoccus prasinos]
          Length = 980

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 68/285 (23%), Positives = 130/285 (45%), Gaps = 35/285 (12%)

Query: 78  RDYTLSQA-ASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYK 136
            D  L++A A   +    ++V   +  Y  F+ NW+  I R    +  LV A D   L +
Sbjct: 469 EDNQLNEALAKRYSHENIVMVTWANNHYYDFVKNWVKHI-RDCGMNNFLVGAMDNELLVR 527

Query: 137 -VNGRWPGHAVLVPPAPDSQTAHKFG-SQGFFNFTSRRPCHLLHIL-ELGYNVMYNDVDM 193
            ++ + P  A+         T   FG     F+   R+   L+H+  ++G++++ +DVD 
Sbjct: 528 LIDDKVPTFAM-----QSGLTTADFGWGSKNFHQMGRKKIELIHLFTKMGFDILVSDVDT 582

Query: 194 VWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC-MIYLRPTDGAK 252
            W+K+P P+++   +V   D + +   L    +     ++  T + +  ++ LR + GA 
Sbjct: 583 AWMKNPIPFIRKFPEV---DVLTSSDSLSDFAETEWSLERTTTGMANIGIMLLRKSAGA- 638

Query: 253 LVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQ------------VDLYLLPQSAF 300
            + K+W+  L+     K +   DQ AFN  + K  G+            +   +LP + F
Sbjct: 639 -LAKEWVHVLE-----KDENIWDQNAFNDLMRKGRGKSFPDHSFEGYDGLRFGILPVATF 692

Query: 301 PTGGLYFKNQTWVEETKGKHVIIHNNY-ITGFEKKIKRFRDFGLW 344
            +G  +F  + + E+ K +  ++H  +  +G E K  R R+F LW
Sbjct: 693 ASGHTFFVQRMY-EKKKLEPYVVHATFQFSGTEGKRHRLREFKLW 736


>gi|145343482|ref|XP_001416351.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576576|gb|ABO94644.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 500

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 60/244 (24%), Positives = 95/244 (38%), Gaps = 46/244 (18%)

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDH-DVYF 211
           D   A    S+ +  F +     L  +L+ G++V+ +DVD+VWL+D  PY + +  DV  
Sbjct: 50  DLNDAADHASESWKKFCALMVKELRAVLDAGFDVILSDVDVVWLRDAAPYFKCESGDVDG 109

Query: 212 TDDMAAVKPLDHSHDLPPP-----GKK--GRTYICSCMIYLRPTDGAKLVMKKWIEELQA 264
            +++     +  S +L P      G +   R    + M+++R T   K  +  W   LQA
Sbjct: 110 CEEIRGADVMISSDNLSPTMDWELGARYAMRGIFNTGMMFIRNTRAGKDFLSDWARNLQA 169

Query: 265 EPWSKAKKANDQPAFNWALNK----------------------------TAGQVDLYLLP 296
           +  + +K    Q  FN  + +                            T     +  LP
Sbjct: 170 KDGAYSKLTTHQQVFNKMVREENAWPGLDVAPGASAKTRVLQSGSPLPSTGSPFAIGALP 229

Query: 297 QSAFPTGGLYFKNQTWVEETKG-----KHVIIHNNYI---TGFEKKIKRFRDFGLWLVDD 348
              F  G  YF    WV    G     K   +H  Y    +G E K  RF++ GLW + D
Sbjct: 230 LRLFVNGHGYF--MQWVNRKDGVWDDVKPYAVHATYTFDGSGGEAKRYRFQEVGLWALHD 287

Query: 349 HAVE 352
            A E
Sbjct: 288 AAGE 291


>gi|147856952|emb|CAN79199.1| hypothetical protein VITISV_040773 [Vitis vinifera]
          Length = 715

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 69/269 (25%), Positives = 111/269 (41%), Gaps = 30/269 (11%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
            K+  IIV   +  ++ F+  W+  ++     + +LV A D   L  +   W G  V   
Sbjct: 156 VKDNVIIVTFGNYAFMDFILTWVKHLTDLGISN-LLVGAMDTKLLEAL--YWRGVPVFDM 212

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
            +  S     +G+  F      +   +  +L  GY ++  D DMVWLK+P PYL      
Sbjct: 213 GSHMSTIDVGWGTPTFHKMGREKVILIDALLPFGYELLMCDTDMVWLKNPLPYLARFPGA 272

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           DV  + D      +D   D+    + G  Y    + + RPTD AK + ++W E L A+  
Sbjct: 273 DVLTSSDQVVPTVVDDRLDIWQ--QVGAAYNIG-IFHWRPTDSAKKLAREWKEMLLAD-- 327

Query: 268 SKAKKANDQPAFNWALNKTA---------------GQVDLYLLPQSAFPTGGLYFKNQTW 312
               K  DQ  FN  + +                 G + L LLP S F +G  YF  Q+ 
Sbjct: 328 ---DKIWDQNGFNELVRRQLGPSVDEDSGLAYAYDGNLKLGLLPASIFCSGHTYFV-QSM 383

Query: 313 VEETKGKHVIIHNNY-ITGFEKKIKRFRD 340
            ++ +     +H  +   G + K  R R+
Sbjct: 384 YQQLRLXPYAVHTTFQYAGTDGKRHRLRE 412


>gi|356540910|ref|XP_003538927.1| PREDICTED: uncharacterized protein LOC100789338 [Glycine max]
          Length = 632

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 70/269 (26%), Positives = 113/269 (42%), Gaps = 30/269 (11%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
            K+  II+   +  ++ F+  W+  ++     +  LV A D   L  +   W G  V   
Sbjct: 117 VKDNIIILTFGNYAFMDFILTWVEQLNDLGVSN-YLVGAMDIKLLEAL--YWKGIPVFDM 173

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
            +  S     +GS  F      +   +  IL  GY ++  D DMVWLK+P PYL    + 
Sbjct: 174 GSHMSTDDVGWGSPTFHKMGREKVLLINLILPFGYELLMCDTDMVWLKNPLPYLARYPEA 233

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           DV  + D      +D S ++    + G  Y    + + RPT+ AK + K+W E L A+  
Sbjct: 234 DVLTSSDQVVPTVVDDSLEI--WQEVGAAYNIG-IFHWRPTESAKKLAKEWKELLLAD-- 288

Query: 268 SKAKKANDQPAFNWALNKTA---------------GQVDLYLLPQSAFPTGGLYFKNQTW 312
               K  DQ  FN  +++                 G + L +LP S F +G  YF  Q  
Sbjct: 289 ---DKIWDQNGFNDIVHRQLGPSVDEDSGLVYAYDGNLKLGILPSSIFCSGHTYFV-QAM 344

Query: 313 VEETKGKHVIIHNNY-ITGFEKKIKRFRD 340
            ++ + +   +H  +   G E K  R R+
Sbjct: 345 YQQLRLEPYAVHTTFQYAGTEGKRHRLRE 373


>gi|357142580|ref|XP_003572620.1| PREDICTED: uncharacterized protein LOC100829311 [Brachypodium
           distachyon]
          Length = 614

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 74/289 (25%), Positives = 120/289 (41%), Gaps = 34/289 (11%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
           A++G I+V   +  +L F+  W+  ++     D +LV A D   L ++  R  G  V   
Sbjct: 98  ARDGVIVVTFGNYAFLDFILTWVHHLTGLG-VDNLLVGAMDTKLLRELYLR--GVPVFDM 154

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
            +  +     +GS  F      +   +  +L  GY ++  D DMVWLK+P PYL    D 
Sbjct: 155 GSRMATEDAGWGSPAFHKMGREKVLLINTLLPFGYELLMCDTDMVWLKNPLPYLARYPDA 214

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQA--E 265
           D+  + D       D S +     ++        + + RPT+ AK + K W + + +  E
Sbjct: 215 DILTSSDQVIPTVTDDSLE---KWREVSGAFNIGIFHWRPTEPAKRLAKDWKDLVMSNDE 271

Query: 266 PWSKAKKANDQPAFNWALNKTAGQ---------------VDLYLLPQSAFPTGGLYFKNQ 310
            W       DQ AFN  + K  GQ               + L +LP + F  G  YF  Q
Sbjct: 272 IW-------DQNAFNDLVRKVFGQPVQGQDELVYSYDGKLKLGVLPATIFCNGHTYFV-Q 323

Query: 311 TWVEETKGKHVIIHNNY-ITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
              ++ + +   +H  +   G E K  R R+  L+       +SP G L
Sbjct: 324 GMHQQLRLEPYAVHTTFQYAGTEGKRHRLREAMLFFDPPPYYDSPGGFL 372


>gi|356558894|ref|XP_003547737.1| PREDICTED: uncharacterized protein LOC100780346 [Glycine max]
          Length = 638

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 72/285 (25%), Positives = 118/285 (41%), Gaps = 31/285 (10%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
            K+  IIV   +  ++ F+  W+  ++     +  LV A D   L  +   W G  V   
Sbjct: 121 VKDNAIIVTFGNYAFMDFILTWVKQLTDLGVSN-FLVGAMDTKLLEAL--YWKGIPVFDM 177

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
            +  S     +GS  F      +   +  IL  G+ ++  D DMVWLK+P PYL    + 
Sbjct: 178 GSHMSTVDVGWGSPTFHKMGREKVILIDSILPFGFELLMCDTDMVWLKNPLPYLARYPEA 237

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           DV  + D      +D S +  P            + + RPT+ AK + K+W E L A+  
Sbjct: 238 DVLTSSDQVIPTVVDDSLENWPEVSGAYNI---GIFHWRPTESAKKLAKQWKEMLLAD-- 292

Query: 268 SKAKKANDQPAFNWALNKT---------------AGQVDLYLLPQSAFPTGGLYFKNQTW 312
               +  DQ  FN  +++                 G++ L +LP S F +G  YF  Q  
Sbjct: 293 ---DQIWDQNGFNDIVHRQLGPSVDDESGLVFVFDGKLKLGILPASIFCSGHTYFI-QAM 348

Query: 313 VEETKGKHVIIHNNY-ITGFEKKIKRFRDFGLWLVDDHAVESPLG 356
            ++ + +   +H  +   G E K  R R+  +  +D     +PLG
Sbjct: 349 YQQLRLEPYAVHTTFQYGGTEGKRHRLRE-AMHFLDPPEYYNPLG 392


>gi|147790757|emb|CAN65937.1| hypothetical protein VITISV_008966 [Vitis vinifera]
          Length = 546

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 57/217 (26%), Positives = 91/217 (41%), Gaps = 31/217 (14%)

Query: 148 VPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDH 207
           + P+  S     FG++ F   T  +   +L IL+LGYNV+ +DVD+ W K+P P L    
Sbjct: 325 LAPSDISFDDCHFGTKCFQRVTKSKSRLVLQILKLGYNVLMSDVDVYWFKNPLPLL---- 380

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
             Y       V   D   +  P     R  + S   + R  D     M+K ++       
Sbjct: 381 --YSFGPAILVAQSDEYKETGPINLPRR--LNSGFYFARSDDTTIAGMEKVVKH------ 430

Query: 268 SKAKKANDQPAFNWALNKTAGQVDL--------------YLLPQSAFPTGGL--YFKNQT 311
           + +   ++QP+F  +L    G   L              + L ++ FP G     ++ + 
Sbjct: 431 AASSNLSEQPSFYDSLCGEGGSYRLGDNRCLEPETNLTVHFLDRNLFPNGAYQNLWEKKK 490

Query: 312 WVEETKGKHV-IIHNNYITGFEKKIKRFRDFGLWLVD 347
           W      K   I+HNN+I+G  KK++R    GLW  D
Sbjct: 491 WKSVCXKKDCFILHNNWISGRRKKLQRQVLSGLWGYD 527


>gi|302813272|ref|XP_002988322.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300144054|gb|EFJ10741.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 557

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 69/298 (23%), Positives = 118/298 (39%), Gaps = 32/298 (10%)

Query: 68  PNATSSLAKWRDYTLS-QAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLV 126
           P   S L     + LS +   F AK   I+V   +  +  F+  W+  ++       +L+
Sbjct: 14  PPHGSELPPREAFVLSKEMVEFRAKKNVIMVTFANHAFEDFVLTWVRHLT-DVGVTNLLI 72

Query: 127 IAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNV 186
            A D   L ++   W G  V    +  +     +G+  F      +   +  I+ +G+ V
Sbjct: 73  GAMDRKILEELF--WKGVPVFDMGSEMNPADVGWGTPVFHKMGREKVFLVNAIMAMGFEV 130

Query: 187 MYNDVDMVWLKDPFPYLQG--DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIY 244
           ++ D DMVW+K+P PY++   D DV  + D       D S +     ++    +   + +
Sbjct: 131 LFCDTDMVWMKNPLPYMERYPDADVLVSSDAVIATVTDESLE---DWRRSYAALNIGIFH 187

Query: 245 LRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL-NKTAGQVD------------ 291
            RPT+ AK   + W  +L+ E      K  DQ  FN  + N T   VD            
Sbjct: 188 WRPTEAAKEFARAWQIQLEDE------KIWDQNGFNELIQNGTREAVDPDNDRGLFYAFD 241

Query: 292 ----LYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWL 345
               + +LP S F +G  YF    + +     + +       G E K  R R+  L+ 
Sbjct: 242 RTLKVGILPVSMFCSGHTYFVQHLYKQLGLDVYAVHTTFQFAGTEGKRHRLREAKLFF 299


>gi|357517471|ref|XP_003629024.1| hypothetical protein MTR_8g072310 [Medicago truncatula]
 gi|355523046|gb|AET03500.1| hypothetical protein MTR_8g072310 [Medicago truncatula]
          Length = 655

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 61/235 (25%), Positives = 99/235 (42%), Gaps = 28/235 (11%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
            K+  +IV   +  ++ F+  W+  ++  +  +  LV A D   L  +   W G  V   
Sbjct: 122 VKDNVVIVTFGNYAFMDFILTWVKKLTDLEVSN-FLVGAMDTKLLEAL--YWKGVPVFDM 178

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL--QGDH 207
            +  S     +G+  F      +   L  IL  G+ V+  D DMVWLK+P PYL      
Sbjct: 179 GSHMSTVDVGWGTPTFHKMGREKVILLDSILPFGFEVLMCDTDMVWLKNPLPYLARHPGA 238

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           D+  + D      +D S ++    ++        + + RPT+ AK++ K+W E L A+  
Sbjct: 239 DILTSSDQVVPTVVDDSLEI---WQEVSGAYNIGIFHWRPTESAKILAKQWKEMLLAD-- 293

Query: 268 SKAKKANDQPAFNWALNKTA---------------GQVDLYLLPQSAFPTGGLYF 307
               K  DQ  FN  L+                  G++ L +LP S F +G  YF
Sbjct: 294 ---DKIWDQNGFNDILHTQLGPSVDDDSGLVYTFDGKLKLGILPASIFCSGHTYF 345


>gi|359489312|ref|XP_002270057.2| PREDICTED: uncharacterized protein At1g28695-like [Vitis vinifera]
 gi|297734642|emb|CBI16693.3| unnamed protein product [Vitis vinifera]
          Length = 346

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/146 (27%), Positives = 71/146 (48%), Gaps = 13/146 (8%)

Query: 92  NGTIIVCAVSQPYLP---------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWP 142
           N T+I+  V++ Y+          FL ++ I    ++  D +LV+A D     +   R  
Sbjct: 61  NKTVIITIVNKAYVEGDDTSMLDLFLESFWIGEGTREMADHLLVVAGDQTAYDRCIFRRL 120

Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
               +V    D +    + S+ F     RR   LLH+LE GY+ ++ D D+ WL++PFP 
Sbjct: 121 HCYKMVGEDGDMEGEKLYMSEDFIEMMWRRTLLLLHVLERGYSFIFTDTDVSWLRNPFPR 180

Query: 203 LQGD--HDVYFTDD--MAAVKPLDHS 224
           L  +   D+  + D  +++ +P D+S
Sbjct: 181 LTTNLTADLQISTDKFLSSHRPEDNS 206


>gi|218189628|gb|EEC72055.1| hypothetical protein OsI_04965 [Oryza sativa Indica Group]
          Length = 385

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 53/226 (23%), Positives = 92/226 (40%), Gaps = 34/226 (15%)

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVP---------PAPDSQTAHKFGSQGFFNFTSRR 172
           D VLV+A D     +     P H  L+           A +  +A++F S+G+      +
Sbjct: 139 DHVLVVAVDAGGFRRCKAVHP-HCYLLDVFVVSSGAGDAANLSSANRFMSRGYLELVWAK 197

Query: 173 PCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGK 232
                 +LELGY+ ++ DVD++WL+DPF ++    DV  + D       D + + P  G 
Sbjct: 198 LSLQQRVLELGYSFLFTDVDVMWLRDPFRHITLYADVTISSDHFHGDAGDVAGNSPNTG- 256

Query: 233 KGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAG---Q 289
                      ++RPT+    ++++W       P      ++DQ  F+    + AG   +
Sbjct: 257 ---------FYHVRPTNRTVEMLRRWRAARSRFP----PASHDQNVFDGIKRELAGGELR 303

Query: 290 VDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKI 335
           V +  L  + F     Y           G+   +H N   G E K+
Sbjct: 304 VRIAFLDTAVFAGFCEY-------RPDAGRVCTVHANCCVGLENKV 342


>gi|356524014|ref|XP_003530628.1| PREDICTED: uncharacterized protein LOC100788000 [Glycine max]
          Length = 770

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 62/274 (22%), Positives = 106/274 (38%), Gaps = 31/274 (11%)

Query: 91  KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
           K  T+I+      Y   L +W+  + +   ++ V+   +   + + +    P     + P
Sbjct: 285 KTKTVILTVAGYSYKDMLMSWVCRLRKLSIENFVVCALDKETSQFSILQGIPVFTDPIAP 344

Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVY 210
           +  S     FG++ F   T  +   +L IL+LGYNV+ +DVD+ W K+P P L       
Sbjct: 345 SNISFDDCHFGTKCFQRVTKVKSRIVLKILKLGYNVLLSDVDVYWFKNPVPLLHSLGPAV 404

Query: 211 F---TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
               +D+     P+    +LP         + S   Y R        ++K +   +    
Sbjct: 405 LAAQSDEYQNQGPI----NLP-------RRLNSGFYYARSDSQTIAAIEKVVRHAETSGL 453

Query: 268 SKAKKANDQPAFNWALNKTAG--------QVDLYLLPQSAFPTGGLYFKNQTWVEET--- 316
           S+     D    N   N+            + ++ L +  F  G        W E+    
Sbjct: 454 SEQPSFYDTLCGNGGSNRVGDDKCVEPETNLTVHFLDRDLFANGAY---QDLWREKNVKE 510

Query: 317 ---KGKHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
              K    IIHNN+I+G  KK++R    GLW  D
Sbjct: 511 ACLKKGSYIIHNNWISGRLKKLERQVLSGLWEYD 544


>gi|168006999|ref|XP_001756196.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162692706|gb|EDQ79062.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 526

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 72/294 (24%), Positives = 121/294 (41%), Gaps = 41/294 (13%)

Query: 77  WRDYTLSQAASFVAKNGTIIVCAVS-QPYLPFLNNWLISISRQKHQDQVLVIAEDYATLY 135
           W  ++L   AS VA    +I+ +V+   Y   L +W+ S+ R    + ++   +D   LY
Sbjct: 238 WLPFSLEALASRVASPDKVIILSVAGDSYRTMLMSWVCSLRRLNISNYLVYALDD--ELY 295

Query: 136 KVNGRWPGHAVL--VPPAPDSQTAH------KFGSQGFFNFTSRRPCHLLHILELGYNVM 187
           +       HAV   VP    SQT         FG++ F   T  +   +LH+L+LG+ V+
Sbjct: 296 Q-------HAVSQGVPVVKSSQTMRVSRDDCHFGTKCFQEVTKMKSRTVLHLLQLGFKVL 348

Query: 188 YNDVDMVWLKDPFPYLQG---DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIY 244
           ++DVD+ W ++P   +        V  TD     +    + +LP     G  +  S    
Sbjct: 349 FSDVDVYWFQNPIQEMMAYGPGTLVAQTDQYNETE----AANLPRRLNSGFYFAWS---- 400

Query: 245 LRPTDGA--KLVMKKWIEELQAEP--------WSKAKKANDQPAFNWALNKTAGQVDLYL 294
            R T  A  K+V       +  +P             +  D        N TA  +D   
Sbjct: 401 DRATVAAFVKIVKHAMTSNMSEQPSFYDTMCGLDGVYRVGDDKCVEPDTNVTAIFLDRRT 460

Query: 295 LPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDD 348
            P  A  +G  + +        +    ++HNN+++G E+K+KR    GLW  D+
Sbjct: 461 YPNGA--SGNHWEQRDVRKSCEQQGCRVLHNNWVSGRERKLKRQIAAGLWDYDE 512


>gi|218193222|gb|EEC75649.1| hypothetical protein OsI_12401 [Oryza sativa Indica Group]
          Length = 622

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 72/285 (25%), Positives = 115/285 (40%), Gaps = 26/285 (9%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
           A+ G I+V   +  +L F+  W+  ++     D +LV A D   L ++  R  G  V   
Sbjct: 102 ARGGAIVVTFGNHAFLDFILTWVRHLT-DLGVDSLLVGAMDTKLLRELYLR--GVPVFDM 158

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
            +  +     +GS  F      +   +  +L  GY ++  D DMVWLK+P PYL    D 
Sbjct: 159 GSRMATEDAVWGSPTFHKMGREKVLLINALLPFGYELLMCDTDMVWLKNPLPYLARYPDA 218

Query: 210 YFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSK 269
                   V P      L    +    Y    + + RPT+ AK + K W + L ++    
Sbjct: 219 DLLTSSDQVIPTVTDDSLENWREVTGAYNIG-IFHWRPTEPAKRLAKDWKDLLLSD---- 273

Query: 270 AKKANDQPAFNWALNKT---------------AGQVDLYLLPQSAFPTGGLYFKNQTWVE 314
                DQ AFN  ++K                 G++ L +LP S F +G  YF  Q   +
Sbjct: 274 -DTLWDQNAFNDLIHKKFGYPVVGEDELVYSYDGKLKLGVLPASIFCSGHTYFV-QGMYQ 331

Query: 315 ETKGKHVIIHNNY-ITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           + + +   +H  +   G   K  R R+  L+       +SP G L
Sbjct: 332 QLRLEPYAVHTTFQYAGTAGKRHRLREAMLFFDQPSYYDSPGGFL 376


>gi|443734106|gb|ELU18210.1| hypothetical protein CAPTEDRAFT_213446 [Capitella teleta]
          Length = 490

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 56/202 (27%), Positives = 92/202 (45%), Gaps = 33/202 (16%)

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL---QGDHDV 209
           DS  A  + S+ F    + R   +   L LG++V++ D+DM++LK+P P L   +GD  V
Sbjct: 128 DSAKASTYNSKAFKRKMNIRTFMISDALALGFSVVHTDLDMMFLKNPMPTLTSTKGDL-V 186

Query: 210 YFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSK 269
              DD          H+             +  + +RPT+  K + KK  E  +  P   
Sbjct: 187 SLWDDFV--------HN-------------AGFLLVRPTEYGKQIYKKMDELTKKTP--- 222

Query: 270 AKKANDQPAFNWALNKTAGQVDL--YLLPQSAFPTGGLYFKNQTWVEETKGKH-VIIHNN 326
               +DQ A N A+    G+       L ++ F  G  YF+    +  +  K  +++HNN
Sbjct: 223 --SMDDQTALNRAVKGLKGKKGFKAVALNKNQFLCGLGYFEKGHRLFPSPCKECIVVHNN 280

Query: 327 YITGFEKKIKRFRDFGLWLVDD 348
           +I   E KI RF++  +W VD+
Sbjct: 281 WIVSREAKIYRFKEHFMWAVDE 302


>gi|326433586|gb|EGD79156.1| hypothetical protein PTSG_09887 [Salpingoeca sp. ATCC 50818]
          Length = 300

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 54/220 (24%), Positives = 93/220 (42%), Gaps = 25/220 (11%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           V+   T++V   S+ Y+  L  W+  I     +D V+V A+D  TL       PG   + 
Sbjct: 55  VSSASTVVVTMASKDYIGLLQPWVRRIKAVGIEDFVIV-AQDKETLAAAEELAPGRVRIG 113

Query: 149 PPAPDSQTAH-------KFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
                  TA         +G++ +      +  ++ H++      +Y+D+D+++L +P  
Sbjct: 114 YNEDKGTTASGTAGKKSAYGNKAWKEAVENKARYVWHVIARNQTALYSDIDVIFLHNPLK 173

Query: 202 YLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEE 261
           YL+  H V           + + +D+  P         S  IY +PT   + VM+KW + 
Sbjct: 174 YLES-HVVSIG--------MPYGNDIDEPN------FNSGFIYAKPTPQTQEVMRKW-KS 217

Query: 262 LQAEPWSKAKKANDQPAFNWALNKTAGQV-DLYLLPQSAF 300
           L  E W       DQ A N A+ +T  +  D+  LP   F
Sbjct: 218 LSKEIWRYKSPPYDQRALNEAIVQTNTKFPDIVELPLRLF 257


>gi|15235236|ref|NP_193730.1| uncharacterized protein [Arabidopsis thaliana]
 gi|2827651|emb|CAA16605.1| putative protein [Arabidopsis thaliana]
 gi|7268792|emb|CAB78997.1| putative protein [Arabidopsis thaliana]
 gi|332658853|gb|AEE84253.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 715

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 66/158 (41%), Gaps = 11/158 (6%)

Query: 90  AKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWP 142
            +N T+IV  ++Q +         FL ++ I    +K    V+V+  D     + +   P
Sbjct: 451 TENRTVIVTTLNQAWAEPNSLFDLFLESFRIGQGTKKLLQHVVVVCLDSKAFARCSQLHP 510

Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
               L     D      F +  +     RR   L  +LE+GYN ++ D D++WL+DPFP 
Sbjct: 511 NCYYLKTTGTDFSGEKLFATPDYLKMMWRRIELLTQVLEMGYNFIFTDADIMWLRDPFPR 570

Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
           L  D D      MA  +     HD       G TY+ S
Sbjct: 571 LYPDGDF----QMACDRFFGDPHDSDNWVNGGFTYVKS 604



 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 56/260 (21%), Positives = 107/260 (41%), Gaps = 34/260 (13%)

Query: 91  KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
           ++ T+I+ A++Q +         F  ++ + I  ++    V+ +  D     +     P 
Sbjct: 112 EDNTVIITALNQAWAEPNSTFDVFRESFKVGIETERLLKHVIAVCLDIKAYDQCLKVHP- 170

Query: 144 HAVLVPPAPDSQTA--HKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
           H  L+      Q +  ++F + G+     RR   L  ++ LGYN ++ D D++WL+DPFP
Sbjct: 171 HCYLINATDSDQLSGPNRFMTPGYLKLIWRRMDLLRQVIGLGYNFIFTDADILWLRDPFP 230

Query: 202 YLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEE 261
               D D   T D    +P D            + ++ S   Y++  +      K WI  
Sbjct: 231 RFFPDADFQITCDDYNGRPSDK-----------KNHVNSGFTYVKANNKTSKFYKYWIRS 279

Query: 262 LQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHV 321
            +  P       +DQ  FN+  N     V+   +    F T  +YF    + + ++  +V
Sbjct: 280 SRKFP-----GKHDQDVFNFIKNDL--HVEKLGIKMRFFDT--VYFGG--FCQPSRDINV 328

Query: 322 I--IHNNYITGFEKKIKRFR 339
           +  +H N   G + K+   +
Sbjct: 329 VNTMHANCCIGLDNKVNNLK 348


>gi|357517311|ref|XP_003628944.1| UDP-galactose:fucoside alpha-3-galactosyltransferase [Medicago
            truncatula]
 gi|355522966|gb|AET03420.1| UDP-galactose:fucoside alpha-3-galactosyltransferase [Medicago
            truncatula]
          Length = 1906

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 74/287 (25%), Positives = 121/287 (42%), Gaps = 34/287 (11%)

Query: 80   YTLSQAASFVA-KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATL-YKV 137
            ++L    S  A +N T+++      Y   L +W+  + R+   +  +V A D  T  + +
Sbjct: 1405 FSLESLLSITADRNKTVVLTVAGYSYKDMLMSWVCRL-RKLSIENFIVSALDQETYQFSI 1463

Query: 138  NGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLK 197
                P     + P+  S     FG++ F   T  +   +L IL+LGYNV+ +DVD  W +
Sbjct: 1464 LQGIPVFKDPIAPSDISFDECHFGTKCFQRVTKVKSRIVLKILKLGYNVLLSDVDTYWFR 1523

Query: 198  DPFPYLQG-DHDVYF--TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLV 254
            +P P+L    H V    +D+     P+    +LP     G  Y  S    +   D  K+V
Sbjct: 1524 NPIPFLNSFGHAVLVAQSDEYQEQGPI----NLPRRLNSGFYYAHSDNQTIAAID--KVV 1577

Query: 255  MKKWIEELQAEPW--------SKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLY 306
                   L  +P           + + +D        N     V ++ L ++ FP G   
Sbjct: 1578 KHAETSGLSEQPSFYDILCGKGGSNRVSDNKCVEPETN-----VTVHFLDRNLFPNGA-- 1630

Query: 307  FKNQTWVEET------KGKHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
            ++N  W ++T      K    IIHNN+I+G  KK+ R    GLW  D
Sbjct: 1631 YQN-LWQDKTVKATCLKKGCYIIHNNWISGRLKKLTRQVLSGLWEYD 1676


>gi|108709554|gb|ABF97349.1| expressed protein [Oryza sativa Japonica Group]
          Length = 622

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 72/285 (25%), Positives = 115/285 (40%), Gaps = 26/285 (9%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
           A+ G I+V   +  +L F+  W+  ++     D +LV A D   L ++  R  G  V   
Sbjct: 102 ARGGAIVVTFGNHAFLDFILTWVRHLT-DLGVDSLLVGAMDTKLLRELYLR--GVPVFDM 158

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
            +  +     +GS  F      +   +  +L  GY ++  D DMVWLK+P PYL    D 
Sbjct: 159 GSRMATEDAVWGSPTFHKMGREKVLLINALLPFGYELLMCDTDMVWLKNPLPYLARYPDA 218

Query: 210 YFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSK 269
                   V P      L    +    Y    + + RPT+ AK + K W + L ++    
Sbjct: 219 DLLTSSDQVIPTVTDDSLENWREVTGAYNIG-IFHWRPTEPAKRLAKDWKDLLLSD---- 273

Query: 270 AKKANDQPAFNWALNKT---------------AGQVDLYLLPQSAFPTGGLYFKNQTWVE 314
                DQ AFN  ++K                 G++ L +LP S F +G  YF  Q   +
Sbjct: 274 -DTLWDQNAFNDLIHKKFGYPVVGEDELVYSYDGKLKLGVLPASIFCSGHTYFV-QGMYQ 331

Query: 315 ETKGKHVIIHNNY-ITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           + + +   +H  +   G   K  R R+  L+       +SP G L
Sbjct: 332 QLRLEPYAVHTTFQYAGTVGKRHRLREAMLFFDQPSYYDSPGGFL 376


>gi|449449787|ref|XP_004142646.1| PREDICTED: uncharacterized protein LOC101203732 [Cucumis sativus]
          Length = 640

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 72/287 (25%), Positives = 118/287 (41%), Gaps = 30/287 (10%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
           AK+  IIV   +  ++ F+ +W+  ++     + +LV A D   L  +   W G  V   
Sbjct: 125 AKDNIIIVTFGNYAFMDFILSWVKHLTDLGLTN-LLVGAMDTKLLEAL--YWKGIPVFDM 181

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
            +  S     +GS  F      +   +  IL  G  ++  D DMVWLK+P PYL      
Sbjct: 182 GSHMSTVDVGWGSPTFHKMGREKVILIDSILPYGVELLMCDTDMVWLKNPLPYLARYPAA 241

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           DV  + D      +D   D      +    +   + + RPT+ +K + K+W E L A+  
Sbjct: 242 DVLTSSDQVVPTVVDDRLD---KWNEVTGALNIGIFHWRPTEASKKLAKEWKEMLLAD-- 296

Query: 268 SKAKKANDQPAFNWALNKT---------------AGQVDLYLLPQSAFPTGGLYFKNQTW 312
               K  DQ  FN  ++K                 G + L +LP S F +G  YF  Q  
Sbjct: 297 ---DKIWDQNGFNELVHKKYGPSVDEDSELVYAYDGNLKLGVLPASIFCSGHTYFV-QAM 352

Query: 313 VEETKGKHVIIHNNY-ITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
            ++ + +   +H  +   G E K  R R+  ++       ++P G L
Sbjct: 353 YQQFRLEPYAVHTTFQYAGTEGKRHRLREAMVFFDPPEYFDAPGGYL 399


>gi|384251205|gb|EIE24683.1| hypothetical protein COCSUDRAFT_62105 [Coccomyxa subellipsoidea
           C-169]
          Length = 857

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 61/277 (22%), Positives = 109/277 (39%), Gaps = 16/277 (5%)

Query: 86  ASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHA 145
           A+  +++G ++V   S  YL FL NW+  ++  + ++ ++   ++  + Y      P   
Sbjct: 346 ANSRSRDGIVLVTWASAAYLDFLRNWIHHLTILEVENFLIGAMDNEVSEYLRQQNIPYFD 405

Query: 146 VLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG 205
           +      D Q     G++        +      +   G + +  D D+VWL+DP  Y  G
Sbjct: 406 MQAGMYSDMQGHLMKGTKAARMLAFNKIGVAQTLNTFGLDALLCDTDVVWLRDPSDYFSG 465

Query: 206 --DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQ 263
             + D+    D   V        L  P    R  + + +++LR   GA+  M  W   L+
Sbjct: 466 LEEADILVATDGLGVSNTKDDDGLESPEAALRHQMSTGIVFLRHGRGARNFMDAWDSALR 525

Query: 264 AEPWSKAKKANDQPAF-----------NWALNKTAGQ-VDLYLLPQSAFPTGGLYFKNQT 311
                  ++A +  A            NW +    G+ V   +LP S F  G  YF  Q 
Sbjct: 526 RNVSRTEQQAFNAVARVGVKPLKTHPDNWRVFYGMGRTVTFGILPASGFANGHSYFV-QR 584

Query: 312 WVEETKGKHVIIHNNYI-TGFEKKIKRFRDFGLWLVD 347
             +  +     +H +Y+  G   K  R R+  +WL D
Sbjct: 585 LHDVMRVSPYAVHMSYVFGGVAAKRNRLREEQIWLDD 621


>gi|326492838|dbj|BAJ90275.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326494142|dbj|BAJ85533.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 416

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 65/263 (24%), Positives = 109/263 (41%), Gaps = 35/263 (13%)

Query: 94  TIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAED---YATLYKVNGRWPG 143
           T+I+  V+Q +         FL ++ I     +    VLV+A D   +A    V+     
Sbjct: 148 TVIITCVNQAWAAPGSLLDLFLESFRIGDGTARLLPHVLVVAMDPGAHARCLAVHRHC-- 205

Query: 144 HAVLVPPAPDSQTAHK-FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
           +   +P       AHK F S+ +      +      ILELGY  ++ DVD+VWL+DPF +
Sbjct: 206 YHYTIPGLNIDFAAHKYFLSKDYLELVWSKLKLQRRILELGYGFLFTDVDIVWLRDPFKH 265

Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
           +    D+  + D+    P D+  + P  G            +++P      + K W    
Sbjct: 266 VTAYADMTVSSDVYFGDP-DNLGNFPNTG----------FFHVKPNARTIAMTKLWHGAK 314

Query: 263 QAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVI 322
              P      AN+QP FN    +   ++ L +   +    GG      ++ ++ + K V 
Sbjct: 315 GKYP-----GANEQPVFNMMKKRMVAELGLRVQYMNPAYVGGFC----SYGKDLR-KIVT 364

Query: 323 IHNNYITGFEKKIKRFRD-FGLW 344
           +H N   G E KIK  ++  G W
Sbjct: 365 MHANCCVGIENKIKDLKNVLGDW 387


>gi|357484405|ref|XP_003612490.1| hypothetical protein MTR_5g025590 [Medicago truncatula]
 gi|355513825|gb|AES95448.1| hypothetical protein MTR_5g025590 [Medicago truncatula]
          Length = 628

 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 72/273 (26%), Positives = 112/273 (41%), Gaps = 30/273 (10%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
            K+  IIV   +  ++ F+  W+  ++     +  LV A D   L  +   W G  V   
Sbjct: 113 VKDNIIIVTFGNYAFMDFILTWVKQLNDLGVSN-YLVGAMDTKLLEAL--YWKGVPVFDM 169

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
            +  S     +GS  F      +   +  IL  G  ++  D DMVWLK+P PYL      
Sbjct: 170 NSHMSTMDVGWGSPTFHKMGREKVILINSILPFGVELLMCDTDMVWLKNPLPYLARYPGA 229

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           DV  + D      +D S  L    + G  Y    + + RPT+ A  + K+WIE L A+  
Sbjct: 230 DVLTSSDQVIPTVVDDS--LEVWKEVGAAYNIG-IFHWRPTESAIKLAKEWIELLVAD-- 284

Query: 268 SKAKKANDQPAFNWALNKT---------------AGQVDLYLLPQSAFPTGGLYFKNQTW 312
               K  DQ  FN  +++                 G + L +LP S F +G  YF  Q  
Sbjct: 285 ---DKIWDQNGFNDIVHRELGPSVDEDSGLVYAYDGNLKLGILPSSIFCSGHTYFV-QGM 340

Query: 313 VEETKGKHVIIHNNY-ITGFEKKIKRFRDFGLW 344
            ++ + +   +H  +   G E K  R R+  L+
Sbjct: 341 YQQLRLEPYAVHTTFQYAGTEGKRHRLREAMLF 373


>gi|145355476|ref|XP_001421987.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582226|gb|ABP00281.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 591

 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 73/312 (23%), Positives = 122/312 (39%), Gaps = 63/312 (20%)

Query: 88  FVAKNGTIIVCAVSQPYLPFLNNWLISISRQK------HQDQVLVIAEDYAT-------- 133
            +A++G + V  VS  +  F  NWL  + + K        + ++++A D AT        
Sbjct: 1   MLARDGLVFVSYVSDGFHEFALNWLKLLRKAKGAQPNEKDENIVMLALDEATERFCERHS 60

Query: 134 ---LYKVNGRWPGHAVLVPPAP--DSQTAHKFGSQG-----FFNFTSRRPCHLLHILELG 183
                  N R+ G  +        D+  A +  S           T+ R   LL +L+ G
Sbjct: 61  MPCFGGANYRYKGGVMATGGTALGDASGARQAASVAEAAKAMREMTTLRVKLLLDLLDRG 120

Query: 184 YNVMYNDVDMVWLKDPFPYLQ---GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
           ++V+ +D D+ WL+DP  +++    D DV  + D    +  D       P   G      
Sbjct: 121 HDVLVSDADVAWLRDPREWMREAMTDVDVAASTDCLNARDDDEGKCWGAPTNTG------ 174

Query: 241 CMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTA------------- 287
            ++Y   T+ AK  +  W++ +  E  ++     DQ  FN  L K +             
Sbjct: 175 -ILYFNATEPAKKFIADWVDGM--EKATEDTTERDQEIFNKLLIKRSSTSESREIKRRVR 231

Query: 288 -----GQVDLYLLPQSAFPTGGLYFKNQTWVEETK------GKHVIIHNNYITGFEKKIK 336
                G V   LLP   F +G  YF  +    ET+        H     + + G   K +
Sbjct: 232 VKRLEGGVQFALLPMRLFASGHTYFVQRLHERETRLDEQPLCAHATFQFSQVHG---KRQ 288

Query: 337 RFRDFGLWLVDD 348
           RFR+ GLW V++
Sbjct: 289 RFREHGLWDVEE 300


>gi|449015632|dbj|BAM79034.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 791

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 68/272 (25%), Positives = 111/272 (40%), Gaps = 29/272 (10%)

Query: 95  IIVCAVSQPYLPFLNNWLISISRQK-HQDQVLVIAEDYATLYKVNGRWPGHAVLVPPA-P 152
           +I+ A +  Y   L N++ ++ R + H+  V  + ED   LY+      G AV + PA  
Sbjct: 519 VIMTAATYEYRFLLMNFVCNLRRLRIHKLLVAALDED---LYRY-AYARGLAVYLEPALI 574

Query: 153 DSQTAHK----FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
           DSQ  H     FGS  F + +  +  H+  IL  G++V+++DVD+ W +D  P L    D
Sbjct: 575 DSQYRHSLQCAFGSACFRHRSKLKSRHVYEILRRGHDVLWSDVDITWFRDVRPELLRARD 634

Query: 209 VYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWS 268
               +          +   P     G   + S   Y   +      + + +E       +
Sbjct: 635 AAPPNISGRSVWFQSNEPDPDQAWNGIRRLNSGFYYAVSSASTVTGLSRILEH------A 688

Query: 269 KAKKANDQPAFNWAL----------NKTAGQVDLY--LLPQSAFPTGGLY-FKNQTWVEE 315
            + + ++QP+F   L          NKT    DLY   L    F  G  + +  Q   ++
Sbjct: 689 ASSQLSEQPSFYDVLCGEHGEYRLDNKTCFNGDLYTHFLDPRVFRNGANWTYWVQNRADQ 748

Query: 316 TKGKHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
                 I+HNN+I G   K +R    GL   D
Sbjct: 749 GAPAAAILHNNWIKGLAAKQERLEAAGLAYFD 780


>gi|356577001|ref|XP_003556618.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
          Length = 397

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/144 (25%), Positives = 66/144 (45%), Gaps = 8/144 (5%)

Query: 79  DYTLSQAASFVA-KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAED 130
           DY+L    +  A ++ T+I+  +++ +         FL ++ I    ++  + +++IA D
Sbjct: 100 DYSLENILNEAAMQDRTVILTTLNEAWAATNSIIDLFLESFRIGDRTRRLLNHLVIIALD 159

Query: 131 YATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYND 190
                +         +LV  A D      F +  +     RR   L  +LE+GYN ++ D
Sbjct: 160 QKAFMRCQAIHTYCYLLVNEATDFHKEAYFMTPSYLKMMWRRIDFLRSVLEMGYNFVFTD 219

Query: 191 VDMVWLKDPFPYLQGDHDVYFTDD 214
           VD++W +DPFP+   D D     D
Sbjct: 220 VDIMWFRDPFPWFHRDADFQIACD 243


>gi|303287152|ref|XP_003062865.1| glycosyltransferase family 77 protein [Micromonas pusilla CCMP1545]
 gi|226455501|gb|EEH52804.1| glycosyltransferase family 77 protein [Micromonas pusilla CCMP1545]
          Length = 525

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/210 (24%), Positives = 91/210 (43%), Gaps = 28/210 (13%)

Query: 166 FNFTSRRPCHLLHIL-ELGYNVMYNDVDMVWLKDPFPYLQG--DHDVYFTDDMAAVKPLD 222
           F+   R+   L+ +   +G++++ +DVD VW+K+P PY+    D DV  + D  A     
Sbjct: 67  FHKMGRKKIDLIKVFTNMGFDILVSDVDTVWMKNPMPYVMKYPDADVLTSSDHLASTATG 126

Query: 223 HSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWA 282
              + P    + ++     ++ +R T  AK + ++W+  L      K  K  DQ AFN  
Sbjct: 127 DGLEDP---LRAQSAANIGIMLIRHT--AKELAEEWVNVLD-----KDAKVWDQNAFNDL 176

Query: 283 LNKTA--------------GQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYI 328
           + +                G++   +LP S F +G  +F  +   +     +V+      
Sbjct: 177 MRRGRAAAGGDDKLFLGYDGKLKFGILPVSTFASGHTFFVQRMHEKHDADPYVVHATFQF 236

Query: 329 TGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           +G E K  R R+  LW VDD +   P   L
Sbjct: 237 SGTEGKRHRMREAKLW-VDDASYYDPTEGL 265


>gi|443719977|gb|ELU09871.1| hypothetical protein CAPTEDRAFT_207654 [Capitella teleta]
          Length = 523

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 53/201 (26%), Positives = 89/201 (44%), Gaps = 30/201 (14%)

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
           +S T   +GS+ F    + R   +LH L+  YN++++D D+ +  +PFP ++        
Sbjct: 170 NSGTGAHYGSKVFKEKMNIRTFMVLHALKESYNLLHSDSDVYYFANPFPVIKE-----LC 224

Query: 213 DDMAAVKPL-DH-SHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKA 270
                V PL D+ SH+             +  +Y R T    + M + +           
Sbjct: 225 GSECDVAPLWDYMSHN-------------AGFLYTRNTTMG-IAMYEHMNNTAC-----T 265

Query: 271 KKANDQPAFNWALNK-TAGQVDLYLLPQSAFPTGGLYFK--NQTWVEETKGKHVII-HNN 326
              +DQ A N A+   T   + L  LP   F +G L+F   N+T+ E+      I+ HNN
Sbjct: 266 TTDDDQVALNRAIEHCTKIGLKLVRLPTEQFQSGKLFFDDGNRTFAEDNPCTTCIVAHNN 325

Query: 327 YITGFEKKIKRFRDFGLWLVD 347
           +I G   K  RF++  +W+ D
Sbjct: 326 WIVGIAAKEYRFKEMHMWVYD 346


>gi|125545615|gb|EAY91754.1| hypothetical protein OsI_13395 [Oryza sativa Indica Group]
          Length = 343

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 51/202 (25%), Positives = 86/202 (42%), Gaps = 16/202 (7%)

Query: 37  VLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSSLAKWRDYTLSQAASFVA-KNGTI 95
            L R   +V + LLV++G  L  +   G              D  L  A    A  NGT+
Sbjct: 12  ALGRAIAVVAICLLVIVGTYLLSSPAAGDGDTEEEFFSVDAGDDDLEAAVRGAAYANGTL 71

Query: 96  IVCAVSQPYLP----FLNNWLISISRQKHQDQ----VLVIAEDYATLYKVNGRWPGHAVL 147
           IV  +++ Y       L+ +L S+   +  +Q    VL++A D     +           
Sbjct: 72  IVSVLNRAYADEDGGLLDLFLRSMREGEGTEQLIAHVLLVAMDRPAFLRCRRLGGVRCYQ 131

Query: 148 VPPAPDS----QTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
           +P A D      +   + S GF     RR   L  +L+LGY+ ++ D+D++WL++P P L
Sbjct: 132 LPAAQDGADDLSSEQLYMSDGFIRMMWRRIRLLGDVLKLGYSFIFTDLDVMWLRNPLPRL 191

Query: 204 Q---GDHDVYFTDDMAAVKPLD 222
           +    + D+  + D    +P D
Sbjct: 192 EYRAEEEDLLISSDQFNGRPGD 213


>gi|115456601|ref|NP_001051901.1| Os03g0849900 [Oryza sativa Japonica Group]
 gi|28269394|gb|AAO37937.1| putative regulatory protein [Oryza sativa Japonica Group]
 gi|108712117|gb|ABF99912.1| regulatory protein, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113550372|dbj|BAF13815.1| Os03g0849900 [Oryza sativa Japonica Group]
          Length = 408

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 49/196 (25%), Positives = 83/196 (42%), Gaps = 25/196 (12%)

Query: 94  TIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAV 146
           T+I+  V+  +         FL  + +     +    VLV+A D   L +     P   +
Sbjct: 145 TVIITCVNHAFAAPDSLLDIFLQGFRVGDGTPELLRHVLVVAMDPTALTRCRAVHPHCYL 204

Query: 147 LVPPAPDSQ-TAHKF-GSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
              P  D   T+ KF  S+ +      +      IL+LGYN ++ DVD+VWL++PF ++ 
Sbjct: 205 YTMPGLDVDFTSEKFFASKDYLELVWSKLKLQRRILQLGYNFLFTDVDIVWLRNPFKHVA 264

Query: 205 GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQA 264
              D+  + D+    P D+  + P  G            Y++P+     + K+W E   +
Sbjct: 265 VYADMAISSDVFFGDP-DNIDNFPNTG----------FFYVKPSARTIAMTKEWHEARSS 313

Query: 265 EPWSKAKKANDQPAFN 280
            P       N+QP FN
Sbjct: 314 HP-----GLNEQPVFN 324


>gi|125588648|gb|EAZ29312.1| hypothetical protein OsJ_13373 [Oryza sativa Japonica Group]
          Length = 404

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 49/196 (25%), Positives = 83/196 (42%), Gaps = 25/196 (12%)

Query: 94  TIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAV 146
           T+I+  V+  +         FL  + +     +    VLV+A D   L +     P   +
Sbjct: 145 TVIITCVNHAFAAPDSLLDIFLQGFRVGDGTPELLRHVLVVAMDPTALTRCRAVHPHCYL 204

Query: 147 LVPPAPDSQ-TAHKF-GSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
              P  D   T+ KF  S+ +      +      IL+LGYN ++ DVD+VWL++PF ++ 
Sbjct: 205 YTMPGLDVDFTSEKFFASKDYLELVWSKLKLQRRILQLGYNFLFTDVDIVWLRNPFKHVA 264

Query: 205 GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQA 264
              D+  + D+    P D+  + P  G            Y++P+     + K+W E   +
Sbjct: 265 VYADMAISSDVFFGDP-DNIDNFPNTG----------FFYVKPSARTIAMTKEWHEARSS 313

Query: 265 EPWSKAKKANDQPAFN 280
            P       N+QP FN
Sbjct: 314 HP-----GLNEQPVFN 324


>gi|443688778|gb|ELT91377.1| hypothetical protein CAPTEDRAFT_201955 [Capitella teleta]
          Length = 580

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 57/259 (22%), Positives = 96/259 (37%), Gaps = 28/259 (10%)

Query: 96  IVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQ 155
           +   +   YLP   N  I+  R+      L +  D+        R     V +      +
Sbjct: 152 LAAVIDHGYLPIAENIYITSFRRHFMQNFLYVCVDFEACEA--ARLQCMPVFLYMNASHK 209

Query: 156 TAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDM 215
            +    SQ F   +  +       +  GY V   D+D+ + ++P P L            
Sbjct: 210 DSGDMNSQSFREKSMLKLQLAYEAMAAGYTVFLTDLDVFFFRNPLPKL------------ 257

Query: 216 AAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKAND 275
             +     S DL      G+  I +  + LRPT+       ++  E+   P  + +  +D
Sbjct: 258 --LDLCQESCDLVGQRDVGQV-INTGFMLLRPTNTTI----RFYHEMLTSP-KRDEFMHD 309

Query: 276 QPAFNWAL---NKTAGQVDLYLLPQSAFPTGGLYFKNQT---WVEETKGKHVIIHNNYIT 329
           Q  FN+             + LL +  FP G  YF+N     +      K   +HNN+I 
Sbjct: 310 QTFFNYMFPNFKSRHRSTKVILLSEEEFPEGRNYFRNGRRFFYDSNPCEKCFEVHNNWIV 369

Query: 330 GFEKKIKRFRDFGLWLVDD 348
           G   K  RF++  +W+VDD
Sbjct: 370 GTRAKTLRFQEHLMWMVDD 388


>gi|326514932|dbj|BAJ99827.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 407

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 65/263 (24%), Positives = 103/263 (39%), Gaps = 35/263 (13%)

Query: 94  TIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAED---YATLYKVNGRWPG 143
           T+I+  V+Q +         FL ++ I     +    VLV+A D   +A    V+     
Sbjct: 139 TVIITCVNQAWAAPGSLLDLFLESFRIGDGTARLLPHVLVVAMDAGAHARCLAVHRHC-- 196

Query: 144 HAVLVPPAPDSQTAHK-FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
           +   +P       AHK F S+ +      +      ILELGY  ++ DVD+VWL+DPF +
Sbjct: 197 YHYTIPGLNIDFAAHKYFLSKDYLELVWSKLKLQRRILELGYGFLFTDVDIVWLRDPFKH 256

Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
           +    D+  + D+    P D+  + P  G            +++P      + K W    
Sbjct: 257 VTAYADMTVSSDVYFGDP-DNLGNFPNTG----------FFHVKPNARTIAMTKLWHGGR 305

Query: 263 QAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVI 322
              P      AN+QP FN    +   ++ L +        GG     +       GK V 
Sbjct: 306 GKYP-----GANEQPVFNMMKKQMVAELGLRVQYLDPAYVGGFCSYGKDL-----GKIVT 355

Query: 323 IHNNYITGFEKKIKRFRD-FGLW 344
           +H N   G   KIK  +   G W
Sbjct: 356 MHANCCVGIGNKIKDLKGVLGDW 378


>gi|357121249|ref|XP_003562333.1| PREDICTED: uncharacterized protein LOC100845970 [Brachypodium
           distachyon]
          Length = 614

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 72/287 (25%), Positives = 118/287 (41%), Gaps = 30/287 (10%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
           A++G I+V   +  +L F+  W   ++     D +LV A D   L ++  R  G  V   
Sbjct: 98  ARDGVIVVTFGNYAFLDFILTWAHHLTALG-VDNLLVGAMDTKLLRELYLR--GVPVFDM 154

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
            +        +GS  F      +   +  +L  GY ++  D DMVWLK+P PYL    D 
Sbjct: 155 GSRMVTEDAGWGSPTFHKMGREKVLLINTLLPFGYELLMCDTDMVWLKNPLPYLARYPDA 214

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           D+  + D       D S +     ++        + + RPT+ AK + K W + + ++  
Sbjct: 215 DLLTSSDQVIPTVTDDSLE---NWREVTGAFNIGIFHWRPTEPAKKLAKDWKDLVLSD-- 269

Query: 268 SKAKKANDQPAFNWALNKTAGQ---------------VDLYLLPQSAFPTGGLYFKNQTW 312
               +  DQ AFN   +K  GQ               + L +LP + F +G  YF  Q  
Sbjct: 270 ---DEIWDQNAFNDLAHKVFGQPVQGQDELVYSYDGKLKLGVLPAAIFCSGHTYFV-QGM 325

Query: 313 VEETKGKHVIIHNNY-ITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
            ++   +   +H  +   G E K  R R+  L+       +SP G L
Sbjct: 326 HQQLHLEPYAVHTTFQYAGTEGKRHRLREAMLFFDPPPYYDSPGGFL 372


>gi|115453933|ref|NP_001050567.1| Os03g0586300 [Oryza sativa Japonica Group]
 gi|50399937|gb|AAT76325.1| expressed protein [Oryza sativa Japonica Group]
 gi|113549038|dbj|BAF12481.1| Os03g0586300 [Oryza sativa Japonica Group]
          Length = 356

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 61/233 (26%), Positives = 95/233 (40%), Gaps = 24/233 (10%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
           A+ G I+V   +  +L F+  W+  ++     D +LV A D   L ++  R  G  V   
Sbjct: 102 ARGGAIVVTFGNHAFLDFILTWVRHLT-DLGVDSLLVGAMDTKLLRELYLR--GVPVFDM 158

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
            +  +     +GS  F      +   +  +L  GY ++  D DMVWLK+P PYL    D 
Sbjct: 159 GSRMATEDAVWGSPTFHKMGREKVLLINALLPFGYELLMCDTDMVWLKNPLPYLARYPDA 218

Query: 210 YFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSK 269
                   V P      L    +    Y    + + RPT+ AK + K W + L ++    
Sbjct: 219 DLLTSSDQVIPTVTDDSLENWREVTGAYNIG-IFHWRPTEPAKRLAKDWKDLLLSD---- 273

Query: 270 AKKANDQPAFNWALNKT---------------AGQVDLYLLPQSAFPTGGLYF 307
                DQ AFN  ++K                 G++ L +LP S F +G  YF
Sbjct: 274 -DTLWDQNAFNDLIHKKFGYPVVGEDELVYSYDGKLKLGVLPASIFCSGHTYF 325


>gi|224102433|ref|XP_002312675.1| predicted protein [Populus trichocarpa]
 gi|222852495|gb|EEE90042.1| predicted protein [Populus trichocarpa]
          Length = 552

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 65/290 (22%), Positives = 115/290 (39%), Gaps = 40/290 (13%)

Query: 80  YTLSQAASFVA-KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVN 138
           ++L    S  A +N T+++      Y   L +W+  + + +  + ++   +     + V 
Sbjct: 272 FSLVSLLSITADENKTLVLAVAGYSYKDMLMSWVCRLHQLRVTNFIICALDQETYQFSVL 331

Query: 139 GRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKD 198
              P       P   S     FG+  F   T  +   +  IL+LGYNV+ +DVD+ W  +
Sbjct: 332 QGLPVFHDPSAPRNISFNDCHFGTTCFQRVTKVKSRMVWKILKLGYNVLLSDVDVYWFGN 391

Query: 199 PFPYLQG---DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKL-V 254
           P P L        V  +D+     P+    +LP     G         Y   +D + +  
Sbjct: 392 PLPLLYSFGPGVLVAQSDEYNYTGPV----NLPRRLNSG--------FYFARSDASSVAA 439

Query: 255 MKKWIEELQAEPWSKAKKANDQPAFNWALNKTAG--------------QVDLYLLPQSAF 300
           M+K ++       +     ++QP+F   L    G               + ++ L ++ F
Sbjct: 440 MEKVVKH------AARSNLSEQPSFYDTLCGEGGSYRISDNRCVEPETNLTIHFLDRNLF 493

Query: 301 PTGG---LYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
           P G    L+ K        K   +I+HNN+I+G  KK++R    GLW  D
Sbjct: 494 PNGAYLNLWQKKNVKKACMKKGCLILHNNWISGRVKKLERQVVSGLWEYD 543


>gi|218189629|gb|EEC72056.1| hypothetical protein OsI_04966 [Oryza sativa Indica Group]
          Length = 372

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 53/223 (23%), Positives = 88/223 (39%), Gaps = 25/223 (11%)

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
           D VLV+A D A   +     P   +L   + +  +A +F S+ +      +      +LE
Sbjct: 134 DHVLVVAVDPAGFRRCKAVHPHCYLLHVKSINLTSATRFMSREYLELVWTKLSLQQRVLE 193

Query: 182 LGYNVMYNDVDMVWLKDPFPYLQ--GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYIC 239
           LGYN ++ D DMV  +DPF ++    D      D  AA  PLD+      P   G     
Sbjct: 194 LGYNFLFTDCDMVLFRDPFRHIAVYADMSTSSDDYSAARAPLDN------PLNTG----- 242

Query: 240 SCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSA 299
             + Y++ T  +  +++ W     A P  +   A+DQ  F    ++   +     L    
Sbjct: 243 --LYYVKATSQSVEMLRYW---QAARP--RFPGAHDQAVFGHIKHELVAK-----LRARI 290

Query: 300 FPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFG 342
            P   LYF       +   + V +H +   G + K+    D  
Sbjct: 291 EPLDTLYFGGFCEYHDDLARAVTMHADCCVGLDTKVHDLTDIA 333


>gi|302798270|ref|XP_002980895.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300151434|gb|EFJ18080.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 553

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 69/278 (24%), Positives = 114/278 (41%), Gaps = 49/278 (17%)

Query: 95  IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDS 154
           +++  VS  Y   L +W+  + R  +    LV   D   +Y+  G   G  V    +  S
Sbjct: 280 VVLTVVSHSYRDMLMSWVCRL-RHLNVTNYLVATID-KEMYQF-GILQGLPVFRTESGRS 336

Query: 155 QTAH-KFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTD 213
            +    FGS  F   T  +   +L ILELGY+V+++DVD+ W   P   L       F  
Sbjct: 337 DSKDCTFGSSCFKTVTKSKSRTVLRILELGYSVLFSDVDVYWFSSPIRELMA-----FGP 391

Query: 214 DMAAVKPLDHSH----DLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSK 269
            + A +  +++     +LP     G         Y   +D A +V  + I +  A+    
Sbjct: 392 GVLAAQTDEYNEKEAVNLPRRLNSG--------FYFAWSDNATIVSFRKIVKHAAK---- 439

Query: 270 AKKANDQPAFNWALNKTAGQ--------------VDLYLLPQSAFPTGGLYFKNQTW-VE 314
             K ++QP+F   L    G+              + +  L +  +P G        W  +
Sbjct: 440 -SKLSEQPSFYDVLCGEDGKHRKGNSSCVEPETNLTVEFLDRWRYPNGAYKL---IWDSK 495

Query: 315 ETKGK-----HVIIHNNYITGFEKKIKRFRDFGLWLVD 347
           + +G       VI+HNN+ TG ++KI+R    GLW  D
Sbjct: 496 DVRGTCQRLGCVILHNNWKTGHQQKIERQTRAGLWEYD 533


>gi|115441887|ref|NP_001045223.1| Os01g0921000 [Oryza sativa Japonica Group]
 gi|19386741|dbj|BAB86123.1| regulatory protein-like [Oryza sativa Japonica Group]
 gi|19386794|dbj|BAB86173.1| OJ1485_B09.2 [Oryza sativa Japonica Group]
 gi|57899430|dbj|BAD88368.1| regulatory protein-like [Oryza sativa Japonica Group]
 gi|113534754|dbj|BAF07137.1| Os01g0921000 [Oryza sativa Japonica Group]
 gi|215701207|dbj|BAG92631.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222619773|gb|EEE55905.1| hypothetical protein OsJ_04575 [Oryza sativa Japonica Group]
          Length = 372

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 53/223 (23%), Positives = 88/223 (39%), Gaps = 25/223 (11%)

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
           D VLV+A D A   +     P   +L   + +  +A +F S+ +      +      +LE
Sbjct: 134 DHVLVVAVDPAGFRRCKAVHPHCYLLHVKSINLTSATRFMSREYLELVWTKLSLQQRVLE 193

Query: 182 LGYNVMYNDVDMVWLKDPFPYLQ--GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYIC 239
           LGYN ++ D DMV  +DPF ++    D      D  AA  PLD+      P   G     
Sbjct: 194 LGYNFLFTDCDMVLFRDPFRHIAVYADMSTSSDDYSAARAPLDN------PLNTG----- 242

Query: 240 SCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSA 299
             + Y++ T  +  +++ W     A P  +   A+DQ  F    ++   +     L    
Sbjct: 243 --LYYVKATSQSVEMLRYW---QAARP--RFPGAHDQAVFGHIKHELVAK-----LRARI 290

Query: 300 FPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFG 342
            P   LYF       +   + V +H +   G + K+    D  
Sbjct: 291 EPLDTLYFGGFCEYHDDLARAVTMHADCCVGLDTKVHDLTDIA 333


>gi|424513677|emb|CCO66299.1| predicted protein [Bathycoccus prasinos]
          Length = 750

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 63/300 (21%), Positives = 115/300 (38%), Gaps = 46/300 (15%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG--HAV 146
           +AK+ TIIV   +  YL F  NW+  +  +      ++ A D      +   + G  H  
Sbjct: 126 IAKDNTIIVTWANHHYLDFARNWINHVQNRLGLSNFIIGAMDEKMYESLKEEFSGGVHTW 185

Query: 147 LVPPAPDSQTAHK----FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
           L+     S+ A K    +G++ F      +   +      G NV+ +D+D+VWL++P P+
Sbjct: 186 LMGSQGISKEAVKNDFGWGTKNFHQMGRDKIRLIRDFTRSGVNVLVSDIDVVWLRNPLPF 245

Query: 203 LQG--DHDVYFTDDMAAVKPLDHS---HDLPPPGKKGRTYICSC-----MIYLRPTDGAK 252
            +     DV  + D    + +  S         G+    +IC       +++   + G +
Sbjct: 246 FKRYPVADVLVSSDQLRSETMIESLKQKKFMVDGEGLEFHICHAASNIGIMWFLASRGNQ 305

Query: 253 LVMKKWIEELQAEP--WSKAKKANDQPAFNWALNKTAG---QVD---------------- 291
            +  +W++ ++ +   W       DQ AFN   +   G   QVD                
Sbjct: 306 ELTTEWVDRIEKDDNLW-------DQSAFNDLKSLNGGCQTQVDGSGVQEAYGDENKKMR 358

Query: 292 --LYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDH 349
             +  LP S F  G  Y+  +    E K  + +       G   K  R R+   W+ ++ 
Sbjct: 359 VKMGALPVSLFANGHTYYVQRLHERERKNAYAVHATFQYGGTPGKRNRMREANAWMGEEE 418


>gi|302755907|ref|XP_002961377.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300170036|gb|EFJ36637.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 553

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 69/278 (24%), Positives = 114/278 (41%), Gaps = 49/278 (17%)

Query: 95  IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDS 154
           +++  VS  Y   L +W+  + R  +    LV   D   +Y+  G   G  V    +  S
Sbjct: 280 VVLTVVSHSYRDMLMSWVCRL-RHLNVTNYLVATID-KEMYQF-GILQGLPVFRTESGRS 336

Query: 155 QTAH-KFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTD 213
            +    FGS  F   T  +   +L ILELGY+V+++DVD+ W   P   L       F  
Sbjct: 337 DSKDCTFGSSCFKTVTKSKSRTVLRILELGYSVLFSDVDVYWFSSPIQELMA-----FGL 391

Query: 214 DMAAVKPLDHSH----DLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSK 269
            + A +  +++     +LP     G         Y   +D A +V  + I +  A+    
Sbjct: 392 GVLAAQTDEYNEKEAVNLPRRLNSG--------FYFAWSDNATIVSFRKIVKHAAKS--- 440

Query: 270 AKKANDQPAFNWALNKTAGQ--------------VDLYLLPQSAFPTGGLYFKNQTW-VE 314
             K ++QP+F   L    G+              + +  L +  +P G        W  +
Sbjct: 441 --KLSEQPSFYDVLCGEDGKHRKGNSSCVEPETNLTVEFLDRWRYPNGAYKL---IWDSK 495

Query: 315 ETKGK-----HVIIHNNYITGFEKKIKRFRDFGLWLVD 347
           + +G       VI+HNN+ TG ++KI+R    GLW  D
Sbjct: 496 DVRGTCQRLGCVILHNNWKTGHQQKIERQTRAGLWEYD 533


>gi|356523378|ref|XP_003530317.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
          Length = 436

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 35/144 (24%), Positives = 64/144 (44%), Gaps = 8/144 (5%)

Query: 79  DYTLSQAASFVA-KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAED 130
           +Y+L    S  A ++ T+I+  +++ +         FL ++ I    ++  + +++IA D
Sbjct: 111 EYSLENILSEAAMQDRTVILTTLNEAWAAPNSIIDLFLESFRIGDHTRRLLNHLVIIALD 170

Query: 131 YATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYND 190
                +         +LV  A D      F +  +     RR   L  +LE+GYN ++ D
Sbjct: 171 QKAFIRCQAIHTYCYLLVSEATDFHEEAYFMTPSYLKMMWRRIDFLRSVLEMGYNFVFTD 230

Query: 191 VDMVWLKDPFPYLQGDHDVYFTDD 214
            D++W +DPFP    D D     D
Sbjct: 231 ADIMWFRDPFPRFHRDADFQIACD 254


>gi|219127152|ref|XP_002183805.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217404528|gb|EEC44474.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 650

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 45/146 (30%), Positives = 65/146 (44%), Gaps = 13/146 (8%)

Query: 86  ASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQ-DQVLVIAEDYATLYKVNGRWPGH 144
           A   A+N TI+V   +Q     L N++ S +R+      VLV A D  T YK+       
Sbjct: 313 ADQAARNQTIVVMTCNQGQSELLVNFVCSCTRRGLPISHVLVFATDTET-YKLAKSLGLR 371

Query: 145 AVLVPPAPD--------SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWL 196
           A  V   P         ++ A  +G   F      +   +  +L LGYNV++ DVD++W 
Sbjct: 372 AWDVTSLPGAFGVRSFPTKAADAYGDLTFAALMMAKVYCVHVVLLLGYNVLFQDVDVIWY 431

Query: 197 KDPFPYLQ---GDHDVYFTDDMAAVK 219
           +DP PY +      DV   DD A  K
Sbjct: 432 QDPVPYFETHWTTMDVIMQDDGARTK 457


>gi|219130296|ref|XP_002185304.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403219|gb|EEC43173.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 555

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/129 (33%), Positives = 62/129 (48%), Gaps = 10/129 (7%)

Query: 95  IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDS 154
           ++VC   Q  L   N    + SR  +   VLV A D  T Y +      HA+ V  A   
Sbjct: 290 VLVCNHGQSEL-LWNFVCAARSRSLNLAHVLVFATDSVT-YDLAVAMGLHAMDVQNAFGD 347

Query: 155 Q---TAHKFGSQGFFNFTSRRPCHLLHIL-ELGYNVMYNDVDMVWLKDPFPYL---QGDH 207
                A ++G   F      +  +++H+L  LGYNV++ DVD+VW +DP  Y    Q D 
Sbjct: 348 MPTVAARRYGDAAFTGMMMSK-VYVMHLLITLGYNVLFQDVDVVWYQDPLAYFNNDQSDF 406

Query: 208 DVYFTDDMA 216
           D++F DD A
Sbjct: 407 DLFFQDDGA 415


>gi|219130298|ref|XP_002185305.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403220|gb|EEC43174.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 553

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/129 (33%), Positives = 62/129 (48%), Gaps = 10/129 (7%)

Query: 95  IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDS 154
           ++VC   Q  L   N    + SR  +   VLV A D  T Y +      HA+ V  A   
Sbjct: 288 VLVCNHGQSEL-LWNFVCAARSRSLNLAHVLVFATDSVT-YDLAVAMGLHAMDVQNAFGD 345

Query: 155 Q---TAHKFGSQGFFNFTSRRPCHLLHIL-ELGYNVMYNDVDMVWLKDPFPYL---QGDH 207
                A ++G   F      +  +++H+L  LGYNV++ DVD+VW +DP  Y    Q D 
Sbjct: 346 MPTVAARRYGDAAFTGMMMSK-VYVMHLLITLGYNVLFQDVDVVWYQDPLAYFNNDQSDF 404

Query: 208 DVYFTDDMA 216
           D++F DD A
Sbjct: 405 DLFFQDDGA 413


>gi|89070444|ref|ZP_01157741.1| Acetyltransferase (isoleucine patch superfamily protein)
           [Oceanicola granulosus HTCC2516]
 gi|89043930|gb|EAR50116.1| Acetyltransferase (isoleucine patch superfamily protein)
           [Oceanicola granulosus HTCC2516]
          Length = 544

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 47/190 (24%), Positives = 73/190 (38%), Gaps = 29/190 (15%)

Query: 104 YLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQ 163
           Y+P   NWL  ++     +Q  V+  D          +P   VL  PAP+          
Sbjct: 16  YVPVTRNWLAHLATLGLAEQATVVTLDSGARTA----FPVEQVLHRPAPEP--------- 62

Query: 164 GFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG-DHDVYFTDDMAAVKPLD 222
           G       R      ILE G  ++++D D +WL DP P +     ++ F+     V P D
Sbjct: 63  GLAGLWKHRMAVCQEILEAGEALIHSDADAIWLDDPRPRIAACGSEMVFS--QGTVWPYD 120

Query: 223 HSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWA 282
               L       R  +C    YL PT   + +M + ++ L  +         DQ A N  
Sbjct: 121 IHERL-------RLVLCCGFFYLAPTSRVRTLMDQVLQRLDTD------GGEDQEAVNRV 167

Query: 283 LNKTAGQVDL 292
           + +T G  D+
Sbjct: 168 VAETIGGWDV 177


>gi|357114730|ref|XP_003559148.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
           distachyon]
          Length = 393

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 40/161 (24%), Positives = 66/161 (40%), Gaps = 21/161 (13%)

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
           ILELGY+ ++ DVD++W +DPF ++    D+  + D+    P D+  + P  G       
Sbjct: 224 ILELGYSFLFTDVDVMWFRDPFKHVTAYADMTVSSDVFLGDP-DNIGNFPNTG------- 275

Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQS 298
                +++P +    + K W E     P      AN+QP FN        ++ L +    
Sbjct: 276 ---FFHVKPNNRTIAMTKVWHESRGKYP-----GANEQPVFNMIKKNLVKELGLKVRYLD 327

Query: 299 AFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFR 339
               GG     +       GK   +H N   G   K++  R
Sbjct: 328 TAYIGGFCGYGKDL-----GKICTMHANCCVGLNAKLRDLR 363


>gi|242055409|ref|XP_002456850.1| hypothetical protein SORBIDRAFT_03g044040 [Sorghum bicolor]
 gi|241928825|gb|EES01970.1| hypothetical protein SORBIDRAFT_03g044040 [Sorghum bicolor]
          Length = 355

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 56/256 (21%), Positives = 101/256 (39%), Gaps = 28/256 (10%)

Query: 94  TIIVCAVSQPYL---PFLNNWLISISRQKHQ----DQVLVIAEDYATLYKVNGRWPGHAV 146
           T+I+ +V++ +      L+ +L S    +      D +LV+A D    ++     P   +
Sbjct: 83  TVIITSVNEAWAQPGSLLDLYLDSFKNGEDTAHLLDHLLVVALDARGFHRCQAVHPYCYL 142

Query: 147 LVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGD 206
           L   + D  +A  F S  +      +      +LELGYN ++ D DMVW ++PF +    
Sbjct: 143 LNATSVDMSSAKPFMSPDYLELVWTKLVFQQRVLELGYNFLFTDCDMVWFRNPFRHFP-- 200

Query: 207 HDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEP 266
             VY   DM+       S D  P        + + + Y++ T+    +MK W    +  P
Sbjct: 201 --VY--ADMSC-----SSDDFKPSRAPLDNPLNTGLYYMKTTNRTIEMMKYWRAARERFP 251

Query: 267 WSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNN 326
                  +DQ  F    ++  G++ + + P      GG+         +   K   IH +
Sbjct: 252 -----GQHDQAVFVNIRHELVGKLQVRIEPLDTVYYGGI-----CEYHDDPEKVCTIHAD 301

Query: 327 YITGFEKKIKRFRDFG 342
              G + K+     F 
Sbjct: 302 CCVGLDTKVHDLMAFA 317


>gi|308801050|ref|XP_003075306.1| unnamed protein product [Ostreococcus tauri]
 gi|116061860|emb|CAL52578.1| unnamed protein product [Ostreococcus tauri]
          Length = 868

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 46/216 (21%), Positives = 84/216 (38%), Gaps = 42/216 (19%)

Query: 175 HLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKG 234
            L+ ++E G++V  +DVD+ W +DP  Y   + DV   +++     +  S +L P    G
Sbjct: 451 ELVRVVEAGFHVALSDVDVAWTRDPTAYFLCERDVDGCEEIKDADVMISSDNLSPTTDWG 510

Query: 235 R-------TYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNK-- 285
           R           + ++Y++ +      +++W E L A     A   + Q  FN  + +  
Sbjct: 511 RGARYARGGVFNTGVVYVKSSARGAAFLREWREHLLATTGPYAALTSHQQVFNKMVREHN 570

Query: 286 -------------------------TAGQVDLYLLPQSAFPTGGLYFKNQTWV-----EE 315
                                    T  ++ +  LP   F  G  YF           E+
Sbjct: 571 AWPGIDVAEGAPERTRVLESGAPLSTGLKIKIGALPLKLFANGHGYFVQGANARGGDRED 630

Query: 316 TKGKHVIIHNNYI---TGFEKKIKRFRDFGLWLVDD 348
            + +   +H  Y    +G + K  RF++ GLW+ DD
Sbjct: 631 DRLRPYAVHATYTFDGSGNDAKRYRFKEAGLWMGDD 666


>gi|125587812|gb|EAZ28476.1| hypothetical protein OsJ_12458 [Oryza sativa Japonica Group]
          Length = 327

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 50/202 (24%), Positives = 85/202 (42%), Gaps = 16/202 (7%)

Query: 37  VLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSSLAKWRDYTLSQAASFVA-KNGTI 95
            L R   +V + LLV++G  L  +   G              D  L  A    A  NGT+
Sbjct: 12  ALGRAIAVVAICLLVIVGTYLLSSPAAGDGDTEEEFFSVDAGDDDLEAAVRGAAYANGTL 71

Query: 96  IVCAVSQPYLP----FLNNWLISISRQKHQDQ----VLVIAEDYATLYKVNGRWPGHAVL 147
           IV  +++ Y       L+ +L S+   +  +Q    VL++A D     +           
Sbjct: 72  IVSVLNRAYADEDGGLLDLFLRSMREGEGTEQLIAHVLLVAMDRPAFLRCRRLGGVRCYQ 131

Query: 148 VPPAPDS----QTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
           +P A D      +   + S GF     RR   L  +L+LGY+ ++ D+D++WL++P   L
Sbjct: 132 LPAAQDGADDLSSEQLYMSDGFIRMMWRRIRLLGDVLKLGYSFIFTDLDVMWLRNPLSRL 191

Query: 204 Q---GDHDVYFTDDMAAVKPLD 222
           +    + D+  + D    +P D
Sbjct: 192 EYRAEEEDLLISSDQFNGRPGD 213


>gi|312282939|dbj|BAJ34335.1| unnamed protein product [Thellungiella halophila]
          Length = 374

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 47/204 (23%), Positives = 83/204 (40%), Gaps = 26/204 (12%)

Query: 91  KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
           ++ T+I+ A++Q +         F  ++   +  ++    V+ +  D     +     P 
Sbjct: 109 EDNTVIITALNQAWAEPNSTFDVFRESFKAGLGTERLLKHVIAVCLDNKAYDRCVEVHP- 167

Query: 144 HAVLVPPAPDSQTA--HKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
           H  L+      Q +  ++F + G+     RR      +L LGYN ++ D D++WL+DPFP
Sbjct: 168 HCYLINATDSDQLSGPNRFMTPGYLKLIWRRMDLHRQVLGLGYNFIFTDADILWLRDPFP 227

Query: 202 YLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEE 261
               D D   T D    KP D ++           ++ S   Y++  +      K WI  
Sbjct: 228 RFFPDADFQITCDDYNGKPSDKNN-----------HVNSGFTYVKANNKTLNFYKYWIRS 276

Query: 262 LQAEPWSKAKKANDQPAFNWALNK 285
            +  P       +DQ  FN   NK
Sbjct: 277 SRKFP-----GKHDQDVFNLIKNK 295


>gi|443697956|gb|ELT98190.1| hypothetical protein CAPTEDRAFT_212399 [Capitella teleta]
          Length = 580

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 70/268 (26%), Positives = 112/268 (41%), Gaps = 45/268 (16%)

Query: 92  NGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAED--YATLYKVNGRWPGHAVLV- 148
           N  I+VC  S      +N +  S  +    + + +  +D   ATL K      G A    
Sbjct: 152 NIIILVCVDSGYSRMAMNLYYTSFHKLSINNYLFMGTDDQICATLQK-----QGFACYTY 206

Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDH- 207
             +P   +   +G+  F   T  +    L  L LGY V+  DVD+++  +PFPYL     
Sbjct: 207 QESPIHDSVSNWGTVEFSRKTHHKTKVTLDALLLGYTVLLVDVDIIFFHNPFPYLICKRC 266

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYL-RPTDGA-KLVMKKWIEELQAE 265
           D+   +DM               G  G         YL RPT  +  L  K +   L   
Sbjct: 267 DIQIQNDMT-------------EGNSG--------FYLARPTTASITLHQKAYNASLLPG 305

Query: 266 PWSKAKKANDQPAFNWALNKTAGQVDLYL--LPQSAFPTGGLYFKNQTWV---EETKGKH 320
             S      +Q   +  + + + + +L +  LP+  FP G +YF+    +   +      
Sbjct: 306 ALS------NQKVLDRIMERMSQEHELVMQTLPKKQFPNGEVYFEEGRRMFANDNPCNDC 359

Query: 321 VIIHNNY-ITGFEKKIKRFRDFGLWLVD 347
           VI+HNN+ +TG  K++ RF++ GLW VD
Sbjct: 360 VIVHNNWMLTGAAKEL-RFKESGLWQVD 386


>gi|357438789|ref|XP_003589671.1| hypothetical protein MTR_1g034810 [Medicago truncatula]
 gi|355478719|gb|AES59922.1| hypothetical protein MTR_1g034810 [Medicago truncatula]
          Length = 306

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/125 (26%), Positives = 54/125 (43%), Gaps = 1/125 (0%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           FL+++ I +  ++  + +++IA D     +         +LV    D      F +  + 
Sbjct: 43  FLDSFRIGVRTRRLLNHLVIIALDQKAFVRCQAIHTYCFLLVSEGNDFHEEAFFMTPLYL 102

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHD 226
               RR   L  +LELGYN ++ D D++W +DPFP    D D     D       D  H+
Sbjct: 103 KMMWRRIDFLRSVLELGYNFVFTDADIMWFRDPFPRFYDDADFQIACD-HFTGFFDDVHN 161

Query: 227 LPPPG 231
            P  G
Sbjct: 162 RPNGG 166


>gi|20197540|gb|AAD15452.2| unknown protein [Arabidopsis thaliana]
          Length = 528

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 67/269 (24%), Positives = 109/269 (40%), Gaps = 34/269 (12%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP 149
            K+  IIV   +  ++ F+  W+  ++     + +LV A D   L  +   W G  V   
Sbjct: 18  VKDNVIIVTFGNYAFMDFILTWVKHLTDLDLSN-ILVGAMDTKLLEAL--YWKGVPVFDM 74

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DH 207
            +  S     +GS  F      +   +  +L  GY ++  D DM    +P PYL    D 
Sbjct: 75  GSHMSTVDVGWGSPTFHKMGREKVILIDSVLPFGYELLMCDTDM----NPMPYLARFPDA 130

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPW 267
           DV  + D      +D S D+    + G  Y    + + RPT+ AK + K+W E L A+  
Sbjct: 131 DVLTSSDQVVPTVIDDSLDIWQ--QVGAAYNIG-IFHWRPTESAKKLAKEWKEILLAD-- 185

Query: 268 SKAKKANDQPAFNWALNKT---------------AGQVDLYLLPQSAFPTGGLYFKNQTW 312
               K  DQ  FN  + +                 G + + +LP S F +G  YF  Q  
Sbjct: 186 ---DKVWDQNGFNEIVRRQLGPSVEGDSGLFYAYDGNLKVGILPASIFCSGHTYFV-QAM 241

Query: 313 VEETKGKHVIIHNNY-ITGFEKKIKRFRD 340
            ++ + +   +H  +   G E K  R R+
Sbjct: 242 YQQLRLEPYAVHTTFQYAGTEGKRHRLRE 270


>gi|115455131|ref|NP_001051166.1| Os03g0731800 [Oryza sativa Japonica Group]
 gi|50540739|gb|AAT77895.1| expressed protein [Oryza sativa Japonica Group]
 gi|108710906|gb|ABF98701.1| regulatory protein, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113549637|dbj|BAF13080.1| Os03g0731800 [Oryza sativa Japonica Group]
 gi|215740630|dbj|BAG97286.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 351

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 52/210 (24%), Positives = 87/210 (41%), Gaps = 24/210 (11%)

Query: 37  VLNRTTLLVLLSLLVVLGVILPWTGTPG--------FMFPNATSSLAKWRDYTLSQAASF 88
            L R   +V + LLV++G  L  +   G        F       S     D  L  A   
Sbjct: 12  ALGRAIAVVAICLLVIVGTYLLSSPAAGDGDTEEEFFSVRRRRRSSVDAGDDDLEAAVRG 71

Query: 89  VA-KNGTIIVCAVSQPYLP----FLNNWLISISRQKHQDQ----VLVIAEDYATLYKVNG 139
            A  NGT+IV  +++ Y       L+ +L S+   +  +Q    VL++A D     +   
Sbjct: 72  AAYANGTLIVSVLNRAYADEDGGLLDLFLRSMREGEGTEQLIAHVLLVAMDRPAFLRCRR 131

Query: 140 RWPGHAVLVPPAPDS----QTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVW 195
                   +P A D      +   + S GF     RR   L  +L+LGY+ ++ D+D++W
Sbjct: 132 LGGVRCYQLPAAQDGADDLSSEQLYMSDGFIRMMWRRIRLLGDVLKLGYSFIFTDLDVMW 191

Query: 196 LKDPFPYLQ---GDHDVYFTDDMAAVKPLD 222
           L++P   L+    + D+  + D    +P D
Sbjct: 192 LRNPLSRLEYRAEEEDLLISSDQFNGRPGD 221


>gi|21554540|gb|AAM63606.1| unknown [Arabidopsis thaliana]
          Length = 428

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 68/256 (26%), Positives = 103/256 (40%), Gaps = 28/256 (10%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           VA N  IIV   +    P L   + S+ R   Q+ ++V  +D    +          VL 
Sbjct: 153 VAVNKEIIVVLANSNVKPMLELQIASVKRVGIQNYLIVALDDSMESFC----ESKEVVLY 208

Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
              PD        S G    +  +   L   L+LGY+V+ +DVD+V+L++PF +L  D D
Sbjct: 209 KRDPDKAVDMVGKSGGNHAVSGLKFRVLREFLQLGYSVLLSDVDIVFLQNPFSHLHRDSD 268

Query: 209 V-YFTD--DMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGAKLVMKKWI 259
           V   +D  D       +   D P  G        R ++  S   YLRPT  +  ++ +  
Sbjct: 269 VESMSDGHDNNTAYGFNDVFDEPSMGWARYAHTMRIWVFNSGFFYLRPTLPSIDLLDRVA 328

Query: 260 EEL-QAEPWSKAKKANDQPAFNWAL------NKTAGQVDLYLLPQSAFPTGGLYFKNQTW 312
           + L ++E W       DQ  FN  L        T       ++    F    + FK    
Sbjct: 329 DTLSKSEAW-------DQAVFNEQLFYPSHPGYTGLHASKRVMDMYEFMNSKVLFKTVRK 381

Query: 313 VEETKG-KHVIIHNNY 327
            +E K  K VI+H NY
Sbjct: 382 NQELKKLKPVIVHLNY 397


>gi|412993976|emb|CCO14487.1| glycosyltransferase family 77 protein [Bathycoccus prasinos]
          Length = 790

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 57/247 (23%), Positives = 92/247 (37%), Gaps = 53/247 (21%)

Query: 161 GSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDH------------- 207
            S+ + NF   R  H   +LE G++V+ +D D+VWLK+P  YL+ +              
Sbjct: 267 ASKAWKNFAKMRISHATSLLEFGFDVVMSDADVVWLKNPEEYLKCEKVSEDGKVVENLSF 326

Query: 208 DVYFTDDMAAVKPLDHSHDLPPPG--KKGRTY-----ICSCMIYLRPTDGAKLVMKKWIE 260
           D+   +++ A   +  S +L P    + G  Y       + +++LR T G     K+W  
Sbjct: 327 DIDGCEELKAADVIVSSDNLSPTSDERDGGNYAKGGVFNTGIVFLRHTKGGIQWAKQWNL 386

Query: 261 ELQAEPWSKAKKANDQPAFNWALNK-----------------------------TAGQVD 291
            L A      +  +DQ  FN    K                               G   
Sbjct: 387 HLSATDGRFHRLTSDQQVFNAMSRKENAWPGLEVMRMDGTRTLGPKENKRVLVAAEGDTL 446

Query: 292 LYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYI---TGFEKKIKRFRDFGLWLVDD 348
           L + P + F  G +Y   Q + E+ K     +H  Y    +  + K  RF + GLW V +
Sbjct: 447 LGVFPVAKFNPGHVYMV-QKFHEKEKKTPFAVHATYTFDGSSGDAKKWRFMEAGLWRVPE 505

Query: 349 HAVESPL 355
              E   
Sbjct: 506 EEREEKF 512


>gi|357126544|ref|XP_003564947.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
           distachyon]
          Length = 370

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 48/221 (21%), Positives = 91/221 (41%), Gaps = 21/221 (9%)

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
           + VL++A D     + N   P   +L     +  +A +F S+ +      +      +LE
Sbjct: 129 NHVLIVALDPTGFGRCNVVHPYCYLLEVKTANLTSATRFMSKEYLELVWSKLSFQQRVLE 188

Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 241
           LGYN +Y D DM+ +++P  ++     VY    ++    LD    L  P   G       
Sbjct: 189 LGYNFLYTDTDMIMMRNPLRHIP----VYADMSVSTDNFLDARVPLTNPLNTG------- 237

Query: 242 MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFP 301
           + Y++ T+ +  +++ W E   A P  +  + NDQP F    ++   ++ + + P     
Sbjct: 238 LYYMKATNRSISMLRYWQE---ARP--RFPRLNDQPVFARIKHELVEKLQVRIEPLRT-- 290

Query: 302 TGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFG 342
              +YF       +   K  I+H +   G + K+    D  
Sbjct: 291 ---IYFGGFCQYHDDFDKISIMHADCCIGVDNKVHDLMDVA 328


>gi|242055411|ref|XP_002456851.1| hypothetical protein SORBIDRAFT_03g044050 [Sorghum bicolor]
 gi|241928826|gb|EES01971.1| hypothetical protein SORBIDRAFT_03g044050 [Sorghum bicolor]
          Length = 348

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 52/228 (22%), Positives = 92/228 (40%), Gaps = 36/228 (15%)

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLV----PPAPDSQTAHKFGSQGFFNFTSRRPCHLL 177
           + VLVIA D     +     P H  L+      A +  +A++F ++ F      +     
Sbjct: 111 NHVLVIAVDAGGFSRCKAVHP-HCYLLEVNKSTAANLSSANRFMTKEFLELVWLKLSFQQ 169

Query: 178 HILELGYNVMYNDVDMVWLKDPFPYLQGDHDV-----YFTDDMAAVKPLDHSHDLPPPGK 232
            ILELGY+ ++ D DM+WL++PF ++    D+     YF D  A   PL ++        
Sbjct: 170 RILELGYSFLFTDADMIWLRNPFRHISVYADMSLSTDYFRDTFA---PLSNT-------- 218

Query: 233 KGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDL 292
                + + + Y+R T+ +  V++ W       P       ++Q  FN   ++   ++  
Sbjct: 219 -----LNTGLYYMRSTNRSIEVLRYWRAARARFP-----GGSEQGVFNEIKHEVVTKLQA 268

Query: 293 YLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRD 340
            +    A  T  +YF           +   +H N   G   K+   RD
Sbjct: 269 RI---EAVET--VYFSGFCEYHGELNRVCTMHANCCIGLANKVLDLRD 311


>gi|397604899|gb|EJK58811.1| hypothetical protein THAOC_21024, partial [Thalassiosira oceanica]
          Length = 708

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 42/152 (27%), Positives = 68/152 (44%), Gaps = 23/152 (15%)

Query: 83  SQAASFVAKNGTIIVCAVSQPY-----LPFLNNWLI------SISRQKHQDQVLVIAEDY 131
           S+  + +AK GT  V  ++  +     LPF+ + L+      S S+    + V+V   D 
Sbjct: 330 SRLKALLAKMGTKTVVVLTSNFGQLRSLPFMKSELLINFACNSRSKGFSLNNVIVFPTDL 389

Query: 132 ATL-----YKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNV 186
            T        +N  +  H +   P    + A  +G Q F      +   +  + ELGY+V
Sbjct: 390 ETKELSEGMGLNTFYDEHLMASVP---KREARYYGDQIFTGVMFSKVVCVQLVNELGYDV 446

Query: 187 MYNDVDMVWLKDPFPYLQGDH----DVYFTDD 214
           ++ DVD+VW KDP  Y   +     D+YF DD
Sbjct: 447 LFQDVDLVWFKDPLTYFHNESLPQFDMYFQDD 478


>gi|323449936|gb|EGB05820.1| hypothetical protein AURANDRAFT_66031 [Aureococcus anophagefferens]
          Length = 619

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 63/282 (22%), Positives = 111/282 (39%), Gaps = 34/282 (12%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQD----QVLVIAEDYATLYKV 137
           L ++    A++ T++V A +   L  + N++ S++     +     VLV A D  T   V
Sbjct: 311 LIKSRKIAARHDTVVVMATNLGTLDLVANFVCSVASVPELEPTLASVLVFASDGGTRDAV 370

Query: 138 NGRWPGHAVLVPPAPD---SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMV 194
                G A    PA     S  A  +G + F      +   +  +L L +NV++ D D+V
Sbjct: 371 EKL--GIAAFSDPALGDLPSDAAKVYGDRSFVRMMWLKVTSVYLVLSLQHNVLFQDADVV 428

Query: 195 WLKDPFPY---LQGDH-DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDG 250
           W +DP PY   +  D  D ++ DD A       S    P       Y  S   +LR  + 
Sbjct: 429 WFRDPLPYFAEIADDQVDTFWMDDGA------RSSRYTP------WYANSGFYFLRANER 476

Query: 251 AKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDL--YLLPQSAFPTGGLYFK 308
               M + +         ++     Q A    L     +  L   LLP + FP G ++  
Sbjct: 477 VVFFMHRLLMSYDVILAVRSH----QHALIMLLTDLMAKHGLTAQLLPNTDFPQGQVFHH 532

Query: 309 NQTWVEE---TKGKHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
            +  +++    + K  + H  +      K++  ++  LW +D
Sbjct: 533 KKEIMKDFVAGRRKPYVFHMCWTASRVDKLRYLKNIALWFLD 574


>gi|414878979|tpg|DAA56110.1| TPA: hypothetical protein ZEAMMB73_844418 [Zea mays]
          Length = 350

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 33/131 (25%), Positives = 62/131 (47%), Gaps = 11/131 (8%)

Query: 94  TIIVCAVSQPYLPFLNNWLISISRQKHQ---------DQVLVIAEDYATLYKVNGRWPGH 144
           T+I+ +V++ +     + L+ + R+  Q         + VLV+A D           P  
Sbjct: 66  TVIITSVNEAFAR--PDSLLGLFRESFQAGEGIGHLLNNVLVVAVDAKAFSHCRAVHPHC 123

Query: 145 AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
            +L     D  +A+ + S+ +      +   L  ILELGYN ++ DVD+VW ++PF ++ 
Sbjct: 124 YLLEVKTIDLSSANNYMSEAYIELVWTKLSLLQRILELGYNFLFTDVDIVWFRNPFRHIS 183

Query: 205 GDHDVYFTDDM 215
              D+  + D+
Sbjct: 184 VFADMTTSSDV 194


>gi|18410771|ref|NP_565102.1| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
           thaliana]
 gi|12323891|gb|AAG51917.1|AC013258_11 unknown protein; 7482-9019 [Arabidopsis thaliana]
 gi|14517393|gb|AAK62587.1| At1g75110/F9E10_4 [Arabidopsis thaliana]
 gi|20857366|gb|AAM26715.1| At1g75110/F9E10_4 [Arabidopsis thaliana]
 gi|332197552|gb|AEE35673.1| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
           thaliana]
          Length = 428

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 67/256 (26%), Positives = 102/256 (39%), Gaps = 28/256 (10%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           VA N  IIV   +    P L   + S+ R   Q+ ++V  +D    +          V  
Sbjct: 153 VAVNKEIIVVLANSNVKPMLELQIASVKRVGIQNYLIVALDDSMESFC----ESKEVVFY 208

Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
              PD        S G    +  +   L   L+LGY+V+ +DVD+V+L++PF +L  D D
Sbjct: 209 KRDPDKAVDMVGKSGGNHAVSGLKFRVLREFLQLGYSVLLSDVDIVFLQNPFSHLHRDSD 268

Query: 209 V-YFTD--DMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGAKLVMKKWI 259
           V   +D  D       +   D P  G        R ++  S   YLRPT  +  ++ +  
Sbjct: 269 VESMSDGHDNNTAYGFNDVFDEPSMGWARYAHTMRIWVFNSGFFYLRPTIPSIDLLDRVA 328

Query: 260 EEL-QAEPWSKAKKANDQPAFNWAL------NKTAGQVDLYLLPQSAFPTGGLYFKNQTW 312
           + L ++E W       DQ  FN  L        T       ++    F    + FK    
Sbjct: 329 DTLSKSEAW-------DQAVFNEQLFYPSHPGYTGLHASKRVMDMYEFMNSKVLFKTVRK 381

Query: 313 VEETKG-KHVIIHNNY 327
            +E K  K VI+H NY
Sbjct: 382 NQELKKLKPVIVHLNY 397


>gi|449433169|ref|XP_004134370.1| PREDICTED: uncharacterized protein At1g28695-like [Cucumis sativus]
          Length = 305

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 33/140 (23%), Positives = 61/140 (43%), Gaps = 15/140 (10%)

Query: 80  YTLSQAASFVAKNGTIIVCAVSQPYLP------------FLNNWLISISRQKHQDQVLVI 127
           + L +AA     N T+++  +++ Y              FL+ + +    +K  D +L++
Sbjct: 22  FALEKAA---MANKTVVITVINKAYADQGVRDDTTMLDVFLSGFWLGEDTRKLLDHLLLV 78

Query: 128 AEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVM 187
           A D     +   +      L     D      + S+ F     +R   LL +L+ GY+ +
Sbjct: 79  AVDQTAYDRCRFQRLNCFKLETEGVDFGGEKLYMSEEFIKMMWKRTLFLLEVLKRGYSFI 138

Query: 188 YNDVDMVWLKDPFPYLQGDH 207
           + D D++WL+DPFP L  D 
Sbjct: 139 FTDTDVMWLRDPFPKLSKDE 158


>gi|357128785|ref|XP_003566050.1| PREDICTED: uncharacterized protein LOC100833479 [Brachypodium
           distachyon]
          Length = 711

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 61/278 (21%), Positives = 105/278 (37%), Gaps = 39/278 (14%)

Query: 91  KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATL-YKVNGRWPGHAVLVP 149
           +N ++++      Y   L +W   + R       +V A D+ T  + V    P       
Sbjct: 422 ENRSVVLGVAGASYRDMLMSWACRL-RHLGVTNFVVCALDHETYEFSVLQGLPVFRDPSS 480

Query: 150 PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
           P   S     FG+  F   T  +   +L IL LGYNV+ +DVD+ W  +P  +L      
Sbjct: 481 PKNVSFDDCHFGTPCFQRVTKVKSRVVLEILRLGYNVLLSDVDVYWFDNPVQFLYSLGSA 540

Query: 210 YF---TDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEP 266
            F   +D+     P+    +LP     G         Y R        M+  ++      
Sbjct: 541 TFAAQSDEYNETGPI----NLPRRLNSG-------FYYARSDHATITAMEMVVKH----- 584

Query: 267 WSKAKKANDQPAFNWALNKTAG--------------QVDLYLLPQSAFPTGG---LYFKN 309
            +    +++QP+F   L    G               + +  L ++ FP G    L+ + 
Sbjct: 585 -ANKSSSSEQPSFYDILCGQEGANRHGNNICLEPSTNLTVMFLDRNLFPNGAYRRLWERR 643

Query: 310 QTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
                  +    ++HNN++ G +KK++R    GLW  D
Sbjct: 644 DVRSACRELGCFVLHNNWVNGRKKKLRRQMASGLWDYD 681


>gi|384251936|gb|EIE25413.1| hypothetical protein COCSUDRAFT_61632 [Coccomyxa subellipsoidea
           C-169]
          Length = 178

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 45/166 (27%), Positives = 73/166 (43%), Gaps = 29/166 (17%)

Query: 94  TIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPD 153
           TI+    ++  LP     L  +SR +   +VLV+A D   L     +   HA  +P    
Sbjct: 3   TIVDAEAAEILLPVFLESLQKVSR-RLVPRVLVLASDAQGLQICRTK---HAHCLP---- 54

Query: 154 SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTD 213
                      +F   S     L  +L   +NV+  DVDMVWLKDPF  L+ +     +D
Sbjct: 55  -----------WFQRLSAAIGALQQLLAKNFNVLAADVDMVWLKDPFAALEPN----LSD 99

Query: 214 DMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWI 259
            +  V   + + D   P +K     C+ +I+ R T+  + ++K W+
Sbjct: 100 FLMTVDSTNSTDD--SPAQKP----CAGLIFARATEPTRALVKSWV 139


>gi|357131573|ref|XP_003567411.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
           distachyon]
          Length = 357

 Score = 50.8 bits (120), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 57/269 (21%), Positives = 108/269 (40%), Gaps = 38/269 (14%)

Query: 86  ASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQD---------QVLVIAEDYATLYK 136
           A    ++ T+I+ +V++ +    N  L+ + RQ  ++          VLV+A D A    
Sbjct: 79  ARVAMEDRTVIITSVNEAWAR--NGSLLDLYRQSFKNGEDTEHLLNHVLVVALDPAGFRH 136

Query: 137 VNGRWPGHAVLVPPAPDSQT-AHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVW 195
            N   P H  L+    D+ T A +F S+ + +    +      +LELGYN ++ D DM+ 
Sbjct: 137 CNIVHP-HCYLLGATNDNFTSAAQFMSKEYLDLVWTKLSLQRRVLELGYNFLFTDTDMIV 195

Query: 196 LKDPFPYLQGDHDVYFTDD--MAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKL 253
           L++PF ++    D+  + D   A   PLD+              + +   Y++ T+ +  
Sbjct: 196 LRNPFRHITVHADMSVSCDSFSATRAPLDNR-------------VNTGFYYMKATNRSME 242

Query: 254 VMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWV 313
           +++ W       P       +DQ  F    ++   ++ + + P        +YF N    
Sbjct: 243 LLRYWQAARTRFPGD-----HDQGVFYNIKHELVEKLKVRIEPLDT-----VYFSNFCEY 292

Query: 314 EETKGKHVIIHNNYITGFEKKIKRFRDFG 342
               G    +H     G + K+    D  
Sbjct: 293 HNDLGSACTMHAACCKGLDNKVHDLMDMA 321


>gi|449463499|ref|XP_004149471.1| PREDICTED: uncharacterized protein At4g15970-like [Cucumis sativus]
          Length = 350

 Score = 50.8 bits (120), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 34/137 (24%), Positives = 62/137 (45%), Gaps = 10/137 (7%)

Query: 79  DYTLSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDY 131
           D  L  AA+   ++ T+I+  +++ +         FL ++ I     +  D +++IA D 
Sbjct: 74  DKVLKDAAT---EDKTVILTTLNEAWASPNAVIDLFLQSFRIGNRTHQLLDHLVIIALDK 130

Query: 132 ATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDV 191
               +          LV    D ++   F S  +     RR   L  +LE+GYN ++ D 
Sbjct: 131 KAFMRCLDIHIHCVSLVTEGVDFRSEAYFMSPDYLKMMWRRIDFLRTVLEMGYNFVFTDA 190

Query: 192 DMVWLKDPFPYLQGDHD 208
           D++W +DPFP+   + D
Sbjct: 191 DVMWFRDPFPFFDINAD 207


>gi|302787072|ref|XP_002975306.1| RRA1, glycosyltransferase CAZy family GT77-like protein
           [Selaginella moellendorffii]
 gi|300156880|gb|EFJ23507.1| RRA1, glycosyltransferase CAZy family GT77-like protein
           [Selaginella moellendorffii]
          Length = 431

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 56/212 (26%), Positives = 88/212 (41%), Gaps = 22/212 (10%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L+Q    +A N  +IV   ++   P L  W  SI +    + ++V  +D    +  +   
Sbjct: 149 LAQLLEKIAINRELIVGVSNKNVAPMLQVWFESIKQSGITNYLVVALDDETAKFCKDHDV 208

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPF 200
           P +        D+         G  +  S    H+L   L LGY+V+ +DVD+V+L++PF
Sbjct: 209 PAYR------KDATIPKSLAGTGDNHAISGTKFHILREFLVLGYSVLLSDVDIVYLQNPF 262

Query: 201 PYLQGDHDVYFTDD---MAAVKPLDHSHDLPPPG-----KKGRTYIC-SCMIYLRPTDGA 251
            +LQ D DV    D    A     +   D P  G        R ++  S + YLRPT  +
Sbjct: 263 KFLQRDCDVEAMTDGHTNATAYGYNDVFDDPKMGWSRYAHTMRIWVFNSGLFYLRPTVPS 322

Query: 252 KLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
             ++ +    L  E      KA DQ   N  L
Sbjct: 323 IELLDRVATRLARE------KAWDQAVINEEL 348


>gi|42568325|ref|NP_199295.2| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
           thaliana]
 gi|109946407|gb|ABG48382.1| At5g44820 [Arabidopsis thaliana]
 gi|332007782|gb|AED95165.1| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
           thaliana]
          Length = 367

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/143 (23%), Positives = 58/143 (40%), Gaps = 7/143 (4%)

Query: 90  AKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWP 142
            KN T+I+  ++Q +         FL ++ I    Q+    V+V+  D     + +    
Sbjct: 101 TKNNTVIITTLNQAWAEPNSLFDLFLESFRIGQGTQQLLKHVVVVCLDIKAFERCSQLHT 160

Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
               +     D      + +  +      R   L  +LE+G+N ++ D D++WL+DPFP 
Sbjct: 161 NCYHIETSETDFSGEKVYNTPDYLKMMWARIDLLTQVLEMGFNFIFTDADIMWLRDPFPR 220

Query: 203 LQGDHDVYFTDDMAAVKPLDHSH 225
           L  D D     D     P D  +
Sbjct: 221 LYPDGDFQMACDRFFGNPYDSDN 243


>gi|302822869|ref|XP_002993090.1| RRA1, glycosyltransferase CAZy family GT77-like protein
           [Selaginella moellendorffii]
 gi|300139090|gb|EFJ05838.1| RRA1, glycosyltransferase CAZy family GT77-like protein
           [Selaginella moellendorffii]
          Length = 431

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 55/212 (25%), Positives = 88/212 (41%), Gaps = 22/212 (10%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L+Q    +A N  +IV   ++   P L  W  SI +    + ++V  +D    +  +   
Sbjct: 149 LAQLLEKIAINREVIVGVSNKNVAPMLQVWFESIKQSGITNYLVVALDDETAKFCKDHDV 208

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPF 200
           P +        D+         G  +  S    H+L   L LGY+V+ +DVD+V+L++PF
Sbjct: 209 PAYR------KDATIPKSLAGTGDNHAISGTKFHILREFLVLGYSVLLSDVDIVYLQNPF 262

Query: 201 PYLQGDHDVYFTDD---MAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGA 251
            +LQ D D+    D    A     +   D P  G        R ++  S + YLRPT  +
Sbjct: 263 KFLQRDCDIEAMTDGHTNATAYGYNDVFDDPKMGWSRYAHTMRIWVFNSGLFYLRPTVPS 322

Query: 252 KLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
             ++ +    L  E      KA DQ   N  L
Sbjct: 323 IELLDRVATRLARE------KAWDQAVINEEL 348


>gi|302840365|ref|XP_002951738.1| hypothetical protein VOLCADRAFT_92385 [Volvox carteri f.
           nagariensis]
 gi|300262986|gb|EFJ47189.1| hypothetical protein VOLCADRAFT_92385 [Volvox carteri f.
           nagariensis]
          Length = 783

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 49/183 (26%), Positives = 83/183 (45%), Gaps = 13/183 (7%)

Query: 86  ASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAE-DYATLYKVNGRWPGH 144
           A +   N  ++    S  +  F   WL  + +   +  VL +A+ D A L +  G    H
Sbjct: 187 AQWAVNNTVMVAFTNSIMFRNFGPTWLHHVRKAGIKYWVLAVADNDTAKLVRNYG--ADH 244

Query: 145 AVLVPPAPDSQTAHKF--GSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP- 201
             LV       TA  F  GS+ +   T ++   + H+  LG++V+ +D+D+VWL++P   
Sbjct: 245 CFLVHENEIDDTAADFKWGSRSWQLHTWQKVLTVRHVHMLGFHVINSDMDVVWLRNPLEH 304

Query: 202 ----YLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKK 257
               Y + D+ V   D +    PL    D P  G     Y+ + + +LR T G + ++ K
Sbjct: 305 FLVKYTEPDYWVSM-DPITTANPL--GDDGPELGVSTHHYMNTGVYFLRQTPGGRALIDK 361

Query: 258 WIE 260
           W E
Sbjct: 362 WYE 364


>gi|412990871|emb|CCO18243.1| predicted protein [Bathycoccus prasinos]
          Length = 711

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 48/210 (22%), Positives = 87/210 (41%), Gaps = 49/210 (23%)

Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGR 235
           LL +++ G+ V+ +D D  W++DP  +       + T++MA    +  S D      + +
Sbjct: 276 LLRLMQKGFRVLVSDCDTAWMRDPREW-------FVTNEMAKYVDMAVSTDCLSYKNEEK 328

Query: 236 TYIC------SCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAG- 288
              C      + +++L PT+  K  +K W   L+    +  K  +DQ  FN  L + AG 
Sbjct: 329 VRGCWHDQFNTGILFLNPTEKTKGFLKDWQVALET---TTHKFEHDQDIFNRLLREGAGL 385

Query: 289 -------QVD-----------------------LYLLPQSAFPTGGLYFKNQTWVEETKG 318
                  ++D                       L  LP + F +G ++F  QT +     
Sbjct: 386 RPPQRLDEIDVENRKEGAERHELHVALASKDLKLGALPLTLFTSGHVFFIQQTHL-HLNV 444

Query: 319 KHVIIHNNY-ITGFEKKIKRFRDFGLWLVD 347
           +  ++H  Y  +    K +R R+  LWL+D
Sbjct: 445 EPFVVHTTYQFSQARGKRQRLREHQLWLID 474


>gi|255084147|ref|XP_002508648.1| hypothetical protein MICPUN_66770 [Micromonas sp. RCC299]
 gi|226523925|gb|ACO69906.1| hypothetical protein MICPUN_66770 [Micromonas sp. RCC299]
          Length = 424

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 46/196 (23%), Positives = 84/196 (42%), Gaps = 36/196 (18%)

Query: 183 GYNVMYNDVDMVWLKDPFPYLQG--DHDVYFTDDM-----AAVKPLDHSHDLPPPGKKGR 235
           G +V+ +D+D+ WL+DP P+ +     D+  + D+     A   PL   H +   G +  
Sbjct: 106 GISVLISDIDVAWLRDPTPFFKRYPSADILVSTDLLRSEIALDPPLQTPHLVDGEGLE-- 163

Query: 236 TYICSC-----MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN---------- 280
            ++C       +++ RPT G++ + ++W+  ++A+      K  DQ AFN          
Sbjct: 164 FHVCHAASNIGIMWFRPTRGSQQLTEEWVRRIEAD-----DKLWDQNAFNDLKALAGACA 218

Query: 281 -----WALNKTA--GQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEK 333
                  L  TA  G+V +  LP S F  G  ++  +   +     + + +     G   
Sbjct: 219 YRPDGTGLTDTAFGGRVTMGTLPVSQFSNGHTFYAQRLHTQVGLEPYAVHNTFQFGGTPG 278

Query: 334 KIKRFRDFGLWLVDDH 349
           K  R R+   WL D+ 
Sbjct: 279 KRHRAREANAWLGDEE 294


>gi|224111368|ref|XP_002315829.1| predicted protein [Populus trichocarpa]
 gi|222864869|gb|EEF02000.1| predicted protein [Populus trichocarpa]
          Length = 404

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 41/171 (23%), Positives = 74/171 (43%), Gaps = 14/171 (8%)

Query: 79  DYTLSQAASFVA-KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAED 130
           DY L +     A ++ T+I+  +++ +         FL ++ I    ++  + ++++A D
Sbjct: 105 DYKLEKVLKEAAMEDKTVIIATLNEAWAAPNTIIDLFLESFRIGQGTRRLLNHLVIVALD 164

Query: 131 YATLYKVNGRWPGHA-VLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYN 189
               YK    +  H   LV    D      F +  +     RR   L  +L++GYN ++ 
Sbjct: 165 RKA-YKRCMEFHAHCFALVTQGLDFHDEAYFMTPAYLEMMWRRIDFLRAVLQMGYNFVFT 223

Query: 190 DVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
           D D++W +DPFP    D D      +A    L +S D+      G  Y+ S
Sbjct: 224 DADIMWFRDPFPRFYLDADF----QIACDHFLGNSSDIQNRPNGGFNYVKS 270


>gi|308802167|ref|XP_003078397.1| unnamed protein product [Ostreococcus tauri]
 gi|116056849|emb|CAL53138.1| unnamed protein product, partial [Ostreococcus tauri]
          Length = 556

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 53/221 (23%), Positives = 94/221 (42%), Gaps = 31/221 (14%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVL--VIAEDYATLYKVNG-RWPGHA 145
           VAK+G +IV   +  Y  F  NWL  +     ++ ++  +  E YA L K+    W    
Sbjct: 44  VAKDGAVIVTWANSHYYDFALNWLRHLDALGVENYLIGAMDEEMYAKLRKIGVPCW---- 99

Query: 146 VLVPPAPDSQTAHK---FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
           ++     D +   +   +GS+ F      +   +    + G +V+ +D+D+ WL+DP P+
Sbjct: 100 LMGSQGIDKEAVTRDFGWGSKNFHKMGRDKIRLIRDFTKTGTDVLISDIDVAWLRDPIPF 159

Query: 203 LQG--DHDVYFTDD----------MAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDG 250
            +     D+  + D              K +D    L PP   G   I   M++ R T G
Sbjct: 160 FRRYPKADILVSTDNLRNMTHESPEQIAKTVD-GEGLEPPC-AGTANI--GMMWFRSTSG 215

Query: 251 AKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVD 291
           ++ +  +W+  L+     K  K  DQ  FN  ++K    +D
Sbjct: 216 SQAITDEWVNNLE-----KDDKIWDQAEFNTLMHKGGCGMD 251


>gi|384248111|gb|EIE21596.1| hypothetical protein COCSUDRAFT_17381, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 529

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 72/290 (24%), Positives = 114/290 (39%), Gaps = 38/290 (13%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           VA  G ++V   +  YL F+ NWL  ++    +   LV A D   L  +  R  G     
Sbjct: 12  VAPQGPVMVTWANFHYLDFVLNWLAHVNALGIKP--LVGAMDDKILQALVDR--GVHTFA 67

Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--D 206
             +  S+    +GS  F      +   +    ++G++++  DVD VW  +PFPY+    D
Sbjct: 68  MRSGLSEDDFGWGSASFHKMGREKIQLIYTFTKMGFDILVADVDTVW--NPFPYMARYPD 125

Query: 207 HDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEP 266
            D+  + D       D   +  P    G       M+  +   GA  + K+W E L A+ 
Sbjct: 126 ADILTSSDHLRNSTADDGLERFP--DAGSAANIGIMLVRK---GALSLAKEWNEVLLAD- 179

Query: 267 WSKAKKANDQPAFNWALNKTA----------------GQVDLYLLPQSAFPTGGLYFKNQ 310
                +  DQ AFN    +                  G + + +LP + F +G  YF  +
Sbjct: 180 ----DQVWDQNAFNDLFRRDMKFDGPESANRIFRGYDGNLRVGILPVALFASGHTYFVQR 235

Query: 311 TWVEETKGKHV-IIHNNY-ITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
             + E  G  V  +H  +  +G   K  R R+  LWL D      P G L
Sbjct: 236 --LHEQMGLDVYAVHATFQYSGTPGKRHRMRERLLWLADPPEYYDPPGGL 283


>gi|255084750|ref|XP_002504806.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
 gi|226520075|gb|ACO66064.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
          Length = 750

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 52/211 (24%), Positives = 81/211 (38%), Gaps = 48/211 (22%)

Query: 183 GYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCM 242
           G+ V+ +DVD VWL DP  + + D     TD   +   L H  +    G  G  +  + +
Sbjct: 272 GHRVLVSDVDTVWLDDPREWFERDDLPTRTDVSVSTDCLSHEEERRSRGCWGPGF-NTGI 330

Query: 243 IYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL------------------- 283
           ++LRPT+    +M  W + L     +  K  +DQ  FN  L                   
Sbjct: 331 LWLRPTEATINLMATWRDALLT---TSDKFEHDQDIFNKLLRVEQDGSPASFAAVDPPLG 387

Query: 284 -----------------------NKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKH 320
                                   + A  + L  LP + F +G  YF  Q   +    K 
Sbjct: 388 VRSTDSSFAEADGDGRVGEALHMRRVARGIVLGALPLARFCSGHTYFV-QRLPQRLGVKP 446

Query: 321 VIIHNNY-ITGFEKKIKRFRDFGLWLVDDHA 350
           +++H  Y  +    K +R R+ GLWL+DD A
Sbjct: 447 LVVHTTYQFSQARGKRQRLREAGLWLLDDDA 477


>gi|449528663|ref|XP_004171323.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           At4g15970-like [Cucumis sativus]
          Length = 350

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/137 (24%), Positives = 62/137 (45%), Gaps = 10/137 (7%)

Query: 79  DYTLSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDY 131
           D  L  AA+   ++ T+I+  +++ +         FL ++ I     +  D +++IA D 
Sbjct: 74  DKVLKDAAT---EDKTVILTTLNEAWASPNAVIDLFLQSFRIGNRTHQLLDHLVIIALDX 130

Query: 132 ATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDV 191
               +          LV    D ++   + S  +     RR   L  +LE+GYN ++ D 
Sbjct: 131 KAFMRCLDIHIHCVSLVTEGVDFRSEAYYMSPDYLKMMWRRIDFLRTVLEMGYNFVFTDA 190

Query: 192 DMVWLKDPFPYLQGDHD 208
           D++W +DPFP+   + D
Sbjct: 191 DVMWFRDPFPFFDINAD 207


>gi|222625288|gb|EEE59420.1| hypothetical protein OsJ_11574 [Oryza sativa Japonica Group]
          Length = 683

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 57/216 (26%), Positives = 86/216 (39%), Gaps = 27/216 (12%)

Query: 161 GSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKP 220
           GS  F      +   +  +L  GY ++  D DMVWLK+P PYL    D         V P
Sbjct: 106 GSPTFHKMGREKVLLINALLPFGYELLMCDTDMVWLKNPLPYLARYPDADLLTSSDQVIP 165

Query: 221 LDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEP--WSKAKKANDQPA 278
                 L    +    Y    + + RPT+ AK + K W + L ++   W       DQ A
Sbjct: 166 TVTDDSLENWREVTGAYNIG-IFHWRPTEPAKRLAKDWKDLLLSDDTLW-------DQNA 217

Query: 279 FNWALNKT---------------AGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVII 323
           FN  ++K                 G++ L +LP S F +G  YF  Q   ++ + +   +
Sbjct: 218 FNDLIHKKFGYPVVGEDELVYSYDGKLKLGVLPASIFCSGHTYFV-QGMYQQLRLEPYAV 276

Query: 324 HNNY-ITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           H  +   G   K  R R+  L+       +SP G L
Sbjct: 277 HTTFQYAGTVGKRHRLREAMLFFDQPSYYDSPGGFL 312


>gi|307107253|gb|EFN55496.1| hypothetical protein CHLNCDRAFT_57843 [Chlorella variabilis]
          Length = 394

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 57/209 (27%), Positives = 94/209 (44%), Gaps = 21/209 (10%)

Query: 78  RDYTLSQAASF---VAKNGTIIVCAVSQPYL---PFLNNWLISISRQKHQDQVLVIAEDY 131
           RD T  + A F   VA N  ++V   ++ Y      L  W  ++ R   ++ +++  +D 
Sbjct: 102 RDKTYPELAEFLKKVAVNNEVLVSVSNKNYAWPGGMLQLWAENVKRTGVKNAMVIALDDD 161

Query: 132 ATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDV 191
                 +   P   + V   PDSQ     GS    +    R   L + ++LGY+V  +DV
Sbjct: 162 TKSNAESFGLPAFRMDVK-IPDSQK--DVGSNHAVSALKFRI--LQNFMKLGYSVFLSDV 216

Query: 192 DMVWLKDPFPYLQGDHDVY-FTD--DMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCM 242
           D+V+L++PF +L  D DV   TD  D       +   D P  G        R ++  S +
Sbjct: 217 DIVFLQNPFEHLARDSDVEGMTDGWDHGTAYGYNDVADDPSMGWARYAHSMRIFVFNSGL 276

Query: 243 IYLRPTDGAKLVMKKWIEELQAE-PWSKA 270
            YLRPT+  + ++ K I  ++ E  W +A
Sbjct: 277 FYLRPTNATQELLDKLIYRVETENGWDQA 305


>gi|255081748|ref|XP_002508096.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
 gi|226523372|gb|ACO69354.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
          Length = 994

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 54/206 (26%), Positives = 88/206 (42%), Gaps = 36/206 (17%)

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
           D+Q +HK  +Q F          +   +E G +V+  D D+ W+++PFP+L  D DV   
Sbjct: 237 DAQGSHKISAQKFR--------IIQEFVERGCSVLLTDTDVAWMRNPFPFLYRDADVESM 288

Query: 213 ----DDMAAVKPLDHSHD--LPPPG-KKGRTY----ICSCMIYLRPTDGAKLVMKKWIEE 261
               D+ +A   LD   D  + P G K+ R +    + S M Y+  T+ ++ +M      
Sbjct: 289 SDGWDNSSAHGFLDRVDDPSMGPDGRKRARAFRVAALNSGMWYVSATEASRRLMAIMAHR 348

Query: 262 LQAEPWSKAKKANDQPAFNWAL-------NKTAGQVDLYLLPQSAFPTGGLYFK----NQ 310
           +  E      K  DQ  +N  L       + TAG     + P   F    + F+    NQ
Sbjct: 349 MATE-----DKLWDQAGYNLELWFASRDAHGTAGATVRVMDPL-CFVNSKVMFRFIRHNQ 402

Query: 311 TWVEETKGKHVIIHNNYITGFEKKIK 336
             + +   + V +H NY T    K+K
Sbjct: 403 PALGKENHRPVAMHANYHTDKAHKMK 428


>gi|303286932|ref|XP_003062755.1| hypothetical protein MICPUCDRAFT_36030 [Micromonas pusilla
           CCMP1545]
 gi|226455391|gb|EEH52694.1| hypothetical protein MICPUCDRAFT_36030 [Micromonas pusilla
           CCMP1545]
          Length = 599

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 45/196 (22%), Positives = 79/196 (40%), Gaps = 35/196 (17%)

Query: 183 GYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDM-----AAVKPLDHSHDLPPPGKKGR-- 235
           G +V+ +D+D+ WL+DP PY +     Y T DM       +K ++ + D           
Sbjct: 138 GVSVLISDIDVAWLRDPLPYFK----RYPTADMLVSSDTTLKTIESAADAAADAAAVADD 193

Query: 236 --------TYICSC-----MIYLRPTDGAKLVMKKWIEELQAEP--W---------SKAK 271
                   ++ C+      +++ RPT G++ + ++W++ ++A+   W         SK  
Sbjct: 194 ADDDDGLDSHPCNAASNIGIMFFRPTPGSRALTEEWVKTIEADESVWDQNAFNDLKSKGG 253

Query: 272 KANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGF 331
                P  +  L    G V L  LP S F  G  +   +T  +  K    + +     G 
Sbjct: 254 ACQSSPDAHGLLKAYNGTVTLGALPVSQFGNGHTFHAQRTHSQRKKAPFAVHNTFQFGGT 313

Query: 332 EKKIKRFRDFGLWLVD 347
             K  R R+   WL D
Sbjct: 314 PGKRHRMREANAWLGD 329


>gi|326498413|dbj|BAJ98634.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 375

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 47/214 (21%), Positives = 91/214 (42%), Gaps = 27/214 (12%)

Query: 92  NGTIIVCAVSQPYLPFLNNWLISISRQKHQD---------QVLVIAEDYATLYKVNGRWP 142
           +GT+I+ +V++ +       L+ + R    D           L++A D   L       P
Sbjct: 102 DGTVIITSVNEAFAR--PGSLLDLFRGSFHDGEGIAHLLNHTLIVAADPGALALCKAVHP 159

Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
              +L   A    +A+ F ++ +      +     H+L+LGYN +Y D+D++WL++PF +
Sbjct: 160 HCYLLQVMAAGVSSANGFLTRSYLELVWSKLTFQHHVLQLGYNYLYTDLDVLWLRNPFRH 219

Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
           +    D+  + D    +    + DL      G         Y+R T+    ++ +W    
Sbjct: 220 ISIYADMAISTD----RFNGGAEDLKNAPNTG-------FYYVRSTNRTVEMLSRW---- 264

Query: 263 QAEPWSKAKKANDQPAFNWALNK-TAGQVDLYLL 295
           +A       KA+DQ  F     +  AG++ + L+
Sbjct: 265 RAARSRFRPKAHDQEVFEAIKGEFVAGELQIKLV 298


>gi|224136390|ref|XP_002326848.1| predicted protein [Populus trichocarpa]
 gi|222835163|gb|EEE73598.1| predicted protein [Populus trichocarpa]
          Length = 307

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/125 (26%), Positives = 54/125 (43%), Gaps = 12/125 (9%)

Query: 92  NGTIIVCAVSQPYLP------------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNG 139
           N T+I+  V+Q Y+             FL+++ +    +   D +LV+A D         
Sbjct: 22  NKTVIITVVNQAYVEQSVDAETTMLDLFLDSFWLGEDTRPLLDHLLVVAVDQIAYEMCFF 81

Query: 140 RWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
           +      L     D      + SQ F N   RR   LL +L+ GYN ++ D D++WL++P
Sbjct: 82  KGLNCYKLETEGVDFGGEKIYMSQDFINMMWRRTLFLLDVLKRGYNFIFTDTDVMWLRNP 141

Query: 200 FPYLQ 204
              L 
Sbjct: 142 LSRLS 146


>gi|242055415|ref|XP_002456853.1| hypothetical protein SORBIDRAFT_03g044070 [Sorghum bicolor]
 gi|241928828|gb|EES01973.1| hypothetical protein SORBIDRAFT_03g044070 [Sorghum bicolor]
          Length = 376

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 45/217 (20%), Positives = 89/217 (41%), Gaps = 26/217 (11%)

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
           D VL++  D A+        P   +L   + D   A  FGS  +      +      +LE
Sbjct: 142 DHVLIVTVDAASFSGCKAAHPHCYLLEVKSMDMNRAKSFGSPEYVELNWLKLSVQQRVLE 201

Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKP--LDHSHDLPPPGKKGRTYIC 239
           LGYN ++ D D++WL++PF  +    D+  + D + + P  LD  +++            
Sbjct: 202 LGYNFLFTDADILWLRNPFQRISVYADMSCSLDNSKMAPTLLDCENNIG----------- 250

Query: 240 SCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLP-QS 298
               Y++ T+ +  +++ W    +A         N+Q  F+    +   ++ +   P ++
Sbjct: 251 --FYYMKATNRSVELVRYW----RAARARFDGNPNEQVVFSNIKRELISKLGVRFQPLET 304

Query: 299 AFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKI 335
            + +G   F++         K   +H N   G E K+
Sbjct: 305 EYISGFCDFQDHL------DKVCTVHANCCMGLENKV 335


>gi|356503224|ref|XP_003520411.1| PREDICTED: uncharacterized protein At1g28695-like [Glycine max]
          Length = 353

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/161 (23%), Positives = 66/161 (40%), Gaps = 17/161 (10%)

Query: 79  DYTLSQAASFVAKNGTIIVCAVSQPYLP------------FLNNWLISISRQKHQDQVLV 126
           DY L++A+     N T+I+  V++ Y+             FL ++ +    +   D +L+
Sbjct: 62  DYALAKAS---MGNKTVIIAIVNKAYVEQDVESDTTMLDIFLGSFWLGEGTRSLIDHLLI 118

Query: 127 IAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNV 186
           +A D     +          L       +    + SQ F     RR   LL +L+ GYN 
Sbjct: 119 VAVDQTAYNRCQFLRLNCFRLETDGVGFEGEKIYMSQDFIKMMWRRTQFLLEVLKRGYNF 178

Query: 187 MYNDVDMVWLKDPFPYLQGD--HDVYFTDDMAAVKPLDHSH 225
           ++ D D++WL++PF  L  +   D   + D     P    H
Sbjct: 179 VFTDTDVMWLRNPFIRLSKNETEDFQISTDSYLGNPWSEKH 219


>gi|18396892|ref|NP_564315.1| nucleotide-diphospho-sugar transferase domain-containing protein
           [Arabidopsis thaliana]
 gi|334182926|ref|NP_001185108.1| nucleotide-diphospho-sugar transferase domain-containing protein
           [Arabidopsis thaliana]
 gi|10764852|gb|AAG22832.1|AC007508_5 F1K23.9 [Arabidopsis thaliana]
 gi|15027991|gb|AAK76526.1| unknown protein [Arabidopsis thaliana]
 gi|20259205|gb|AAM14318.1| unknown protein [Arabidopsis thaliana]
 gi|332192896|gb|AEE31017.1| nucleotide-diphospho-sugar transferase domain-containing protein
           [Arabidopsis thaliana]
 gi|332192898|gb|AEE31019.1| nucleotide-diphospho-sugar transferase domain-containing protein
           [Arabidopsis thaliana]
          Length = 340

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 48/236 (20%), Positives = 92/236 (38%), Gaps = 38/236 (16%)

Query: 51  VVLGVILPWTGTPGFMFPNATSS----------LAKWRDYTLSQAASFVAK-----NGTI 95
           + + V L + G   F F + T S            +W +Y + +  + + K     N T+
Sbjct: 10  LAVAVALLFAGALYFYFSSITVSDPMSDLLHNVETRWTEYPVDELEAVLDKAAMGNNKTV 69

Query: 96  IVCAVSQPYLP-----------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
           I+  V++ Y+            FL ++      +   D ++++A D  +  +   R    
Sbjct: 70  IIAMVNKAYVEEVEGGRTMLDLFLESFWEGEGTRPLLDHLMLVAADQTSYDRCLFRRLHC 129

Query: 145 AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
             +     D +    + S+ F     RR   LL +L  GYN+ + D D++WL+ PFP L 
Sbjct: 130 YKMDTDGVDLEGEKVYMSKDFIEMMWRRTHLLLDVLSRGYNLTFTDTDVMWLRSPFPRLS 189

Query: 205 GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIE 260
            +            + LD    +   G  G   I +   ++R  +    + +KW +
Sbjct: 190 YN------------ESLDMQISVDSIGLVGGHLINTGFYHVRSNNKTISLFQKWYD 233


>gi|108705996|gb|ABF93791.1| regulatory protein, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 339

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 60/269 (22%), Positives = 101/269 (37%), Gaps = 41/269 (15%)

Query: 92  NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
           NGT+I+  ++  +         FL ++ I    +   D +++++ D  T ++   +   H
Sbjct: 3   NGTVILTTLNSAWAEPGSVVDVFLESFRIGDETRWLLDHLVMVSLDL-TAHRRCLQIHRH 61

Query: 145 --AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
             A+      D      F + G+     RR   L H+L  GY+ ++ D D+VW ++P P+
Sbjct: 62  CFALTTDDGFDFSGEKNFMTDGYLKMMWRRIDFLGHVLAKGYSFIFTDTDIVWFRNPLPH 121

Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
           L  D D     D     P D S+   P G            Y+R T       + W    
Sbjct: 122 LHHDGDFQIACDHFTGDPDDLSN--SPNGG---------FAYVRSTSATAAFYRYWYAAR 170

Query: 263 QAEPWSKAKKANDQPAFNWALNKTAG----QVDLYLLPQSAFPTGGLYFKNQTWVEETKG 318
           +  P       +DQ   N  + + A      V +  L    F     + +N + V     
Sbjct: 171 ERHP-----GLHDQDVLN-LIKRDAYVARLGVRIRFLSTDLFAGLCEHGRNLSTV----- 219

Query: 319 KHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
               +H N   G  +K+    D GL L D
Sbjct: 220 --CTMHANCCVGLRRKVD---DLGLMLQD 243


>gi|23306127|gb|AAN17394.1| Putataive InsB from Escherichia coli [Oryza sativa Japonica Group]
          Length = 517

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 60/269 (22%), Positives = 101/269 (37%), Gaps = 41/269 (15%)

Query: 92  NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
           NGT+I+  ++  +         FL ++ I    +   D +++++ D  T ++   +   H
Sbjct: 181 NGTVILTTLNSAWAEPGSVVDVFLESFRIGDETRWLLDHLVMVSLDL-TAHRRCLQIHRH 239

Query: 145 --AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
             A+      D      F + G+     RR   L H+L  GY+ ++ D D+VW ++P P+
Sbjct: 240 CFALTTDDGFDFSGEKNFMTDGYLKMMWRRIDFLGHVLAKGYSFIFTDTDIVWFRNPLPH 299

Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
           L  D D     D     P D S+   P G            Y+R T       + W    
Sbjct: 300 LHHDGDFQIACDHFTGDPDDLSN--SPNGG---------FAYVRSTSATAAFYRYWYAAR 348

Query: 263 QAEPWSKAKKANDQPAFNWALNKTAG----QVDLYLLPQSAFPTGGLYFKNQTWVEETKG 318
           +  P       +DQ   N  + + A      V +  L    F     + +N + V     
Sbjct: 349 ERHP-----GLHDQDVLN-LIKRDAYVARLGVRIRFLSTDLFAGLCEHGRNLSTV----- 397

Query: 319 KHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
               +H N   G  +K+    D GL L D
Sbjct: 398 --CTMHANCCVGLRRKVD---DLGLMLQD 421


>gi|21536907|gb|AAM61239.1| regulatory protein [Arabidopsis thaliana]
          Length = 367

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/143 (22%), Positives = 58/143 (40%), Gaps = 7/143 (4%)

Query: 90  AKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWP 142
            KN T+I+  ++Q +         FL ++ I    Q+    V+V+  D     + +    
Sbjct: 101 TKNNTVIITTLNQAWAEPNSLFDLFLESFRIGRGTQQLLKHVVVVCLDIKAFQRCSQLHT 160

Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
               +     D      + +  +      R   L  +L++G+N ++ D D++WL+DPFP 
Sbjct: 161 NCYHIETSETDFSGEKVYNTPDYLKMMWARIELLTQVLQMGFNFIFTDADIMWLRDPFPR 220

Query: 203 LQGDHDVYFTDDMAAVKPLDHSH 225
           L  D D     D     P D  +
Sbjct: 221 LYPDGDFQMACDRFFGNPYDSDN 243


>gi|357439649|ref|XP_003590102.1| hypothetical protein MTR_1g044340 [Medicago truncatula]
 gi|355479150|gb|AES60353.1| hypothetical protein MTR_1g044340 [Medicago truncatula]
          Length = 360

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/147 (23%), Positives = 62/147 (42%), Gaps = 15/147 (10%)

Query: 94  TIIVCAVSQPYLP-------------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGR 140
           T+I+  +++ Y+              FL+++ +    +   D +L++A D     +    
Sbjct: 78  TVIIAIINKAYVEQDVKGDAITMFDLFLSSFWLGEGTRSLIDNLLIVAVDQTAYDRCQFL 137

Query: 141 WPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
                 L     D      F S+ F N   RR   LL +L+ GYN ++ D D++WL++PF
Sbjct: 138 RLNCYKLETDGVDFGGEKLFMSKDFINMMWRRTFFLLEVLKRGYNFIFTDTDVMWLRNPF 197

Query: 201 PYLQGD--HDVYFTDDMAAVKPLDHSH 225
             L  +   D+  + D+    P    H
Sbjct: 198 EKLSNNETEDLQISTDLYLDDPWSEKH 224


>gi|145353802|ref|XP_001421190.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581427|gb|ABO99483.1| predicted protein, partial [Ostreococcus lucimarinus CCE9901]
          Length = 451

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 71/294 (24%), Positives = 123/294 (41%), Gaps = 49/294 (16%)

Query: 91  KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQV-----LVIAED---YATLYK--VNGR 140
           ++G + V   ++    F+ NW       +H D++     LV A D   Y  L K  VN  
Sbjct: 54  RDGGVAVTFANEGMYDFVVNWC------EHMDEIGITNYLVGAMDESLYGRLRKIGVNAW 107

Query: 141 WPGHAVLVPPAPDSQTAHKFG-SQGFFNFTSRRPCHLLHIL-ELGYNVMYNDVDMVWLKD 198
             G   +     D +    FG     F+   R    L+H L + G++V+  DVD VWL+D
Sbjct: 108 LMGSKNI----DDDEVKKDFGWGTRTFHKMGRDKIRLVHELTKTGFDVIVTDVDAVWLRD 163

Query: 199 PFPYL----QGDHDVYFTD--DMAAVKPLDHSHDLPPPGKKGRTYICS-----CMIYLRP 247
           PFP+L    + D  V   +  +  +V     +H +   G +     C       +++ R 
Sbjct: 164 PFPFLRRYPKADALVSIDNLRNHTSVVATQANHAVDGEGLEHSA--CGGNKNIGIMWFRS 221

Query: 248 TDGAKLVMKKWIEELQA--EPWSKA--KKANDQPAFNWALNKT------AGQVDLYLLPQ 297
           T+G++   ++W+ +L++  + W +    K  +Q     A + +       G + L +LP 
Sbjct: 222 TEGSQSFTQEWLNKLESNDKDWDQVVFNKLVEQGGCETARDGSGVAPAYGGGLMLGILPV 281

Query: 298 SAFPTGGLYFKNQTWVEETKG-KHVIIHNNY-ITGFEKKIKRFRDFGLWLVDDH 349
           + F  G  YF  +  + E  G K   +H  +   G   K  R R+   W  D +
Sbjct: 282 AFFANGYTYFTER--LHEMFGLKPYAVHTTFGYAGTVGKRHRLREANQWYGDKY 333


>gi|357137060|ref|XP_003570119.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
           distachyon]
          Length = 389

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/102 (26%), Positives = 48/102 (47%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           F++++   I        +++IA D     +     P    LV    D     +F + G+ 
Sbjct: 143 FIDSFRRGIRTNSLLKHLVIIAFDRTAYRRCTEIHPYCFALVTDDVDFSQEKRFLTAGYL 202

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
               +R   L  +LE GY+ +++D D++W ++PFPYL  D D
Sbjct: 203 ELMWKRLDFLRLVLEKGYSFIFSDADVMWFRNPFPYLYPDGD 244


>gi|255561745|ref|XP_002521882.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis]
 gi|223538920|gb|EEF40518.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis]
          Length = 394

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 67/294 (22%), Positives = 116/294 (39%), Gaps = 48/294 (16%)

Query: 65  FMFPNA--TSSLAKWRDYTLSQAASFVA-KNGTIIVCAVSQPYLP-------FLNNWLIS 114
           ++FP+A  T SL    +  L +     A K+ T+I+  +++ +         FL ++ + 
Sbjct: 81  YIFPSAVDTDSLPVSNEQKLEKVLKEAAMKDKTVILTTLNEAWAAPNSVIDLFLASFRLG 140

Query: 115 ISRQKHQDQVLVIAEDYATLYKVNGRW---PGHA-VLVPPAPDSQTAHKFGSQGFFNFTS 170
              +K  + +++IA D     K N R      H   L+    D      F +  +     
Sbjct: 141 EHTRKLLNHLVIIALDQ----KANARCIQVHDHCFALLTDGIDFSNEAYFMTPAYLKMMW 196

Query: 171 RRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPP 230
           RR   L  +LE+GYN ++ D D++W +DPFP    D D     D      ++  H+ P  
Sbjct: 197 RRIDFLRSVLEMGYNFVFTDADIMWFRDPFPRFYSDADFQIACDHFTGSSIN-IHNKPNG 255

Query: 231 GKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQ- 289
           G            Y+R  + +    K W    +  P       +DQ   N   N +  + 
Sbjct: 256 GFN----------YVRSNNRSIEFYKFWYSSRETYP-----GIHDQDVLNKIKNASFIED 300

Query: 290 --VDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVI--IHNNYITGFEKKIKRFR 339
             + +  L  + F  GGL        E +K  +++  +H N   G E K+   R
Sbjct: 301 LGLKMRFLDTAFF--GGL-------CEPSKDLNLVCTMHANCCYGLESKLHDLR 345


>gi|6063544|dbj|BAA85404.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 267

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 60/269 (22%), Positives = 101/269 (37%), Gaps = 41/269 (15%)

Query: 92  NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
           NGT+I+  ++  +         FL ++ I    +   D +++++ D  T ++   +   H
Sbjct: 3   NGTVILTTLNSAWAEPGSVVDVFLESFRIGDETRWLLDHLVMVSLDL-TAHRRCLQIHRH 61

Query: 145 --AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
             A+      D      F + G+     RR   L H+L  GY+ ++ D D+VW ++P P+
Sbjct: 62  CFALTTDDGFDFSGEKNFMTDGYLKMMWRRIDFLGHVLAKGYSFIFTDTDIVWFRNPLPH 121

Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
           L  D D     D     P D S+   P G            Y+R T       + W    
Sbjct: 122 LHHDGDFQIACDHFTGDPDDLSN--SPNGG---------FAYVRSTSATAAFYRYWYAAR 170

Query: 263 QAEPWSKAKKANDQPAFNWALNKTAG----QVDLYLLPQSAFPTGGLYFKNQTWVEETKG 318
           +  P       +DQ   N  + + A      V +  L    F     + +N + V     
Sbjct: 171 ERHP-----GLHDQDVLN-LIKRDAYVARLGVRIRFLSTDLFAGLCEHGRNLSTV----- 219

Query: 319 KHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
               +H N   G  +K+    D GL L D
Sbjct: 220 --CTMHANCCVGLRRKVD---DLGLMLQD 243


>gi|297737271|emb|CBI26472.3| unnamed protein product [Vitis vinifera]
          Length = 272

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/118 (24%), Positives = 55/118 (46%), Gaps = 8/118 (6%)

Query: 94  TIIVCAVSQPYLP--------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHA 145
           T+I+  +++ Y+         FL+ + +    +   D +L++A D  +L +         
Sbjct: 5   TVILAMINKAYVDGDKTMLDLFLDGFWLGEGTRGLLDHLLLVAVDQTSLERCKFLHLHCY 64

Query: 146 VLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
            L     D      + S+ F     RR   L  +L+ GYN ++ D+D++WL++PFP L
Sbjct: 65  KLETEGVDFSGEKMYLSEDFMKMMWRRTLFLGEVLKKGYNFIFTDIDIMWLRNPFPRL 122


>gi|255073807|ref|XP_002500578.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
 gi|226515841|gb|ACO61836.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
          Length = 659

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/129 (27%), Positives = 56/129 (43%), Gaps = 6/129 (4%)

Query: 161 GSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD-MAAVK 219
           G + F    + +   +  +LELG   +  D D+VWLKDP  Y +     Y T D + +  
Sbjct: 123 GDKSFKKMGTVKTKFIQDLLELGIAPILTDADVVWLKDPRSYFK--RGTYITADVLVSTD 180

Query: 220 PLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAF 279
            +D   D             + +++ RPTD AK  ++ W  ++     S      DQPAF
Sbjct: 181 CIDVPADRKDNNGCSHVNFNTGVLHFRPTDAAKAFVQTWKTKVAT---STIAWMRDQPAF 237

Query: 280 NWALNKTAG 288
           N   ++  G
Sbjct: 238 NLITHEGVG 246


>gi|159468740|ref|XP_001692532.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158278245|gb|EDP04010.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 595

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 69/144 (47%), Gaps = 11/144 (7%)

Query: 124 VLVIAEDYATLYKVNGRWPGHAVLV--PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
           VL +A+D AT   V  +   H  +V      D+    K+GS  +   T ++   + H+ +
Sbjct: 86  VLAVADD-ATAALVRSQGVDHCFVVHENEIDDTNATFKWGSNSWKLHTWQKVLVVRHVHQ 144

Query: 182 LGYNVMYNDVDMVWLKDPF-----PYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRT 236
           LG+NV+ +D+D+VW +DP       Y + D+ V   D +    P+    D P  G     
Sbjct: 145 LGFNVINSDLDVVWRRDPLHHFLVKYPEPDYWVSM-DPITTRNPI--GDDGPEAGVTVHH 201

Query: 237 YICSCMIYLRPTDGAKLVMKKWIE 260
           Y+ + + +LR T G   ++ KW E
Sbjct: 202 YMNTGVYFLRQTPGGTALIDKWYE 225


>gi|224099643|ref|XP_002311563.1| predicted protein [Populus trichocarpa]
 gi|222851383|gb|EEE88930.1| predicted protein [Populus trichocarpa]
          Length = 277

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/135 (23%), Positives = 59/135 (43%), Gaps = 10/135 (7%)

Query: 81  TLSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYAT 133
            L +AA    ++ T+I+  +++ +         FL ++ I    ++  D ++++A D   
Sbjct: 5   VLKEAA---MEDKTVIIATLNEAWAAPNTVIDLFLESFRIGQGTRRLLDHLVIVALDEKA 61

Query: 134 LYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDM 193
             +          LV    D      F +  +     RR   L  +LE+GYN ++ D D+
Sbjct: 62  YRRCMELHTHCFALVTQGLDFHDEAYFMTHVYLEMMWRRIDFLRSLLEMGYNFVFTDADI 121

Query: 194 VWLKDPFPYLQGDHD 208
           +W +DPFP    D D
Sbjct: 122 MWFRDPFPRFFLDAD 136


>gi|296088016|emb|CBI35299.3| unnamed protein product [Vitis vinifera]
          Length = 547

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/133 (25%), Positives = 59/133 (44%), Gaps = 4/133 (3%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           FL ++ I  + ++  + +++IA D  +  +     P    L     D      + +  + 
Sbjct: 405 FLESFHIGNNTKRLLNHLVIIALDQKSYARCLALHPLCYALKTEGVDFSGEAYYSTPNYL 464

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHD 226
               RR   L  IL +GY+ ++ D D++W +DPF +   D D   T D     P D ++ 
Sbjct: 465 EMMWRRIDFLRSILTMGYSFIFTDADIMWFRDPFQHFFQDADFQITCDSYIGNPYDVNN- 523

Query: 227 LPPPGKKGRTYIC 239
             P G  G TY+C
Sbjct: 524 -RPNG--GFTYMC 533



 Score = 42.0 bits (97), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 43/189 (22%), Positives = 74/189 (39%), Gaps = 17/189 (8%)

Query: 41  TTLLVLLSLLVVLGVILPWTGTPGFMFPNATSSL------AKWRD-YTLSQAASFVAK-N 92
           TTLL  +  + V      +   P F F N+  S       AK ++ Y L +     A  +
Sbjct: 15  TTLLRRIVKITVFTA--AFVVVPCFFFYNSAFSFQFLPISAKGKEGYKLDKILKNAAMGD 72

Query: 93  GTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHA 145
            T+I+  V++ +         FL ++ I  + Q+  + +++I  D     +     P   
Sbjct: 73  KTVILTTVNEAWAANNSLLDLFLESFRIGNNTQRLLNHLVIITLDPKAYARCTTLHPHCY 132

Query: 146 VLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG 205
            L     D      F S  +     RR   L  +L++ YN ++ D D++W +DPF     
Sbjct: 133 ALKTKEMDFSKEAFFMSHDYLEMMWRRIDFLRSVLKMRYNFIFTDADIMWFRDPFQRFDS 192

Query: 206 DHDVYFTDD 214
             D     D
Sbjct: 193 KADFQIACD 201


>gi|359489978|ref|XP_002270380.2| PREDICTED: uncharacterized protein At1g28695 [Vitis vinifera]
          Length = 341

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/124 (24%), Positives = 57/124 (45%), Gaps = 8/124 (6%)

Query: 94  TIIVCAVSQPYLP--------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHA 145
           T+I+  +++ Y+         FL+ + +    +   D +L++A D  +L +         
Sbjct: 74  TVILAMINKAYVDGDKTMLDLFLDGFWLGEGTRGLLDHLLLVAVDQTSLERCKFLHLHCY 133

Query: 146 VLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG 205
            L     D      + S+ F     RR   L  +L+ GYN ++ D+D++WL++PFP L  
Sbjct: 134 KLETEGVDFSGEKMYLSEDFMKMMWRRTLFLGEVLKKGYNFIFTDIDIMWLRNPFPRLTL 193

Query: 206 DHDV 209
           +  V
Sbjct: 194 NQSV 197


>gi|222624136|gb|EEE58268.1| hypothetical protein OsJ_09274 [Oryza sativa Japonica Group]
          Length = 360

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 59/268 (22%), Positives = 98/268 (36%), Gaps = 39/268 (14%)

Query: 92  NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
           NGT+I+  ++  +         FL ++ I    +   D +++++ D  T ++   +   H
Sbjct: 90  NGTVILTTLNSAWAEPGSVVDVFLESFRIGDETRWLLDHLVMVSLDL-TAHRRCLQIHRH 148

Query: 145 --AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
             A+      D      F + G+     RR   L H+L  GY+ ++ D D+VW ++P P+
Sbjct: 149 CFALTTDDGFDFSGEKNFMTDGYLKMMWRRIDFLGHVLAKGYSFIFTDTDIVWFRNPLPH 208

Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
           L  D D     D     P D S+   P G            Y+R T       + W    
Sbjct: 209 LHHDGDFQIACDHFTGDPDDLSN--SPNGG---------FAYVRSTSATAAFYRYWYAAR 257

Query: 263 QAEPWSKAKKANDQPAFNWALNKTAGQ---VDLYLLPQSAFPTGGLYFKNQTWVEETKGK 319
           +  P       +DQ   N            V +  L    F     + +N + V      
Sbjct: 258 ERHP-----GLHDQDVLNLIKRDAYVARLGVRIRFLSTDLFAGLCEHGRNLSTV------ 306

Query: 320 HVIIHNNYITGFEKKIKRFRDFGLWLVD 347
              +H N   G  +K+    D GL L D
Sbjct: 307 -CTMHANCCVGLRRKVD---DLGLMLQD 330


>gi|218192017|gb|EEC74444.1| hypothetical protein OsI_09837 [Oryza sativa Indica Group]
          Length = 273

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 60/269 (22%), Positives = 101/269 (37%), Gaps = 41/269 (15%)

Query: 92  NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
           NGT+I+  ++  +         FL ++ I    +   D +++++ D  T ++   +   H
Sbjct: 3   NGTVILTTLNSAWAEPGSVVDVFLESFRIGDDTRWLLDHLVMVSLDL-TAHRRCLQIHRH 61

Query: 145 --AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
             A+      D      F + G+     RR   L H+L  GY+ ++ D D+VW ++P P+
Sbjct: 62  CFALTTDDGFDFSGEKNFMTDGYLKMMWRRIDFLGHVLAKGYSFIFTDTDIVWFRNPLPH 121

Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
           L  D D     D     P D S+   P G            Y+R T       + W    
Sbjct: 122 LHHDGDFQIACDHFTGDPDDLSN--SPNGG---------FAYVRSTSATAAFYRYWYAAR 170

Query: 263 QAEPWSKAKKANDQPAFNWALNKTAG----QVDLYLLPQSAFPTGGLYFKNQTWVEETKG 318
           +  P       +DQ   N  + + A      V +  L    F     + +N + V     
Sbjct: 171 ERHP-----GLHDQDVLN-LIKRDAYVARLGVRIRFLSTDLFAGLCEHGRNLSTV----- 219

Query: 319 KHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
               +H N   G  +K+    D GL L D
Sbjct: 220 --CTMHANCCVGLRRKVD---DLGLMLQD 243


>gi|356503226|ref|XP_003520412.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           At1g28695-like [Glycine max]
          Length = 339

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 34/141 (24%), Positives = 61/141 (43%), Gaps = 15/141 (10%)

Query: 79  DYTLSQAASFVAKNGTIIVCAVSQPYLP------------FLNNWLISISRQKHQDQVLV 126
           D  L++A+     N T+I+  V++ Y+             FL ++ +    +   D +L+
Sbjct: 48  DTALAKAS---MGNKTVIIAVVNKAYVDQDVESDTTMLDIFLGSFWLGEGTRSLIDHLLL 104

Query: 127 IAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNV 186
           +A D     +          L     D +    + SQ F     RR   LL +L+ GYN 
Sbjct: 105 VAVDQTAYDRCQFLKLNCFKLETDGVDFKGEKIYMSQDFIKMMWRRTFFLLEVLKRGYNF 164

Query: 187 MYNDVDMVWLKDPFPYLQGDH 207
           ++ D D++WL++PF  L  + 
Sbjct: 165 VFTDTDVMWLRNPFTRLSKNE 185


>gi|356522974|ref|XP_003530117.1| PREDICTED: uncharacterized protein At1g28695-like [Glycine max]
          Length = 371

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 33/150 (22%), Positives = 63/150 (42%), Gaps = 12/150 (8%)

Query: 70  ATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLP------------FLNNWLISISR 117
            T+++  + D   +  A    +N T+I+  V++ Y+             FL ++ +    
Sbjct: 50  TTTNVEAYGDGLDTALAKTSMENKTVIIAIVNKAYVEQDVESDTTMLDIFLGSFWLGEGT 109

Query: 118 QKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL 177
           +   D +L++  D     +          L     D +    + SQ F     RR   LL
Sbjct: 110 RSLIDHLLIVTVDRTAYDRCQFLRLNCFRLETDGVDFEGEKIYMSQDFIKMMWRRTRFLL 169

Query: 178 HILELGYNVMYNDVDMVWLKDPFPYLQGDH 207
            +L+ GYN ++ D D++WL++PF  L  + 
Sbjct: 170 EVLKRGYNFVFTDTDVMWLRNPFTRLSKNE 199


>gi|297721775|ref|NP_001173251.1| Os03g0129350 [Oryza sativa Japonica Group]
 gi|255674179|dbj|BAH91979.1| Os03g0129350 [Oryza sativa Japonica Group]
          Length = 383

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 61/269 (22%), Positives = 100/269 (37%), Gaps = 41/269 (15%)

Query: 92  NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
           NGT+I+  ++  +         FL ++ I    +   D +++++ D  T ++   +   H
Sbjct: 90  NGTVILTTLNSAWAEPGSVVDVFLESFRIGDETRWLLDHLVMVSLDL-TAHRRCLQIHRH 148

Query: 145 --AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
             A+      D      F + G+     RR   L H+L  GY+ ++ D D+VW ++P P+
Sbjct: 149 CFALTTDDGFDFSGEKNFMTDGYLKMMWRRIDFLGHVLAKGYSFIFTDTDIVWFRNPLPH 208

Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
           L  D D     D     P D S+   P G            Y+R T       + W    
Sbjct: 209 LHHDGDFQIACDHFTGDPDDLSN--SPNGG---------FAYVRSTSATAAFYRYWYAAR 257

Query: 263 QAEPWSKAKKANDQPAFNW----ALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKG 318
           +  P       +DQ   N     A     G V +  L    F     + +N + V     
Sbjct: 258 ERHP-----GLHDQDVLNLIKRDAYVARLG-VRIRFLSTDLFAGLCEHGRNLSTV----- 306

Query: 319 KHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
               +H N   G  +K+    D GL L D
Sbjct: 307 --CTMHANCCVGLRRKVD---DLGLMLQD 330


>gi|224120464|ref|XP_002318336.1| predicted protein [Populus trichocarpa]
 gi|222859009|gb|EEE96556.1| predicted protein [Populus trichocarpa]
          Length = 329

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 39/158 (24%), Positives = 70/158 (44%), Gaps = 11/158 (6%)

Query: 90  AKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWP 142
            K+ T+I+  ++Q +         FL ++ +  + QK  + +++I+ D     +     P
Sbjct: 63  TKHKTVILTTLNQAWAEPGSIFDLFLESFQVGDNTQKLVNNLIIISMDQKAHARCLAIHP 122

Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
               L     +  +   F S+ +     RR   L  +LE+GY+ ++ D D+VWL++PFP 
Sbjct: 123 HCYALRTEGLNFSSEAYFMSEEYLKMMWRRIEFLGTVLEMGYSFVFTDADIVWLRNPFPR 182

Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
                D     D     P D ++   P G  G TY+ S
Sbjct: 183 FYPKVDFQIACDNYYGNPEDKNN--RPNG--GFTYVRS 216


>gi|159469263|ref|XP_001692787.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158278040|gb|EDP03806.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 703

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 61/276 (22%), Positives = 112/276 (40%), Gaps = 36/276 (13%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           +A+NG ++V   +  Y  F+  W+  + R      ++   +D+     +   +   ++  
Sbjct: 168 IAQNGYLMVTWANFHYFDFVKTWVKHVQRVGVTGYIVGAMDDHLLREMIKLEYNCFSM-- 225

Query: 149 PPAPDSQTAHKFG-SQGFFNFTSRRPCHLLHI-LELGYNVMYNDVDMVWLKDPFPYLQG- 205
                  T   FG     F    R    L+ I L+L  +V+  DVD++WL++P PY +  
Sbjct: 226 ---KSGLTLGDFGWGSATFAKMGREKIRLISIFLKLDVHVVIADVDVLWLRNPLPYFERY 282

Query: 206 -DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQA 264
            + D+  + D  A    D S +  P    G       M++ + +      ++KWIE ++A
Sbjct: 283 PEADILTSSDNMANTVNDESLEKWP--DAGAAANIGIMLFRKKSLD---FVEKWIEIIEA 337

Query: 265 EPWSKAKKANDQPAFNWALNKTA---------------GQVDLYLLPQSAFPTGGLYFKN 309
           +      K  DQ AFN    +                 G + + +LP S F +G   +  
Sbjct: 338 D-----DKVWDQNAFNDLFRRGVKPLEPPNKNLFLGYDGSLTMGILPVSIFCSGHTMYV- 391

Query: 310 QTWVEETKGKHVIIHNNY-ITGFEKKIKRFRDFGLW 344
           Q   +  K +   +H  +  +G   K  R R+F L+
Sbjct: 392 QRMAQRLKLEPYAVHGTFQFSGTPGKRHRMREFMLY 427


>gi|242038241|ref|XP_002466515.1| hypothetical protein SORBIDRAFT_01g009150 [Sorghum bicolor]
 gi|241920369|gb|EER93513.1| hypothetical protein SORBIDRAFT_01g009150 [Sorghum bicolor]
          Length = 348

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 33/124 (26%), Positives = 56/124 (45%), Gaps = 12/124 (9%)

Query: 92  NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
           N T+I+  +++ Y         FL +       ++  D VL +A D     +      G 
Sbjct: 77  NKTLILTMLNEAYAEEGGLLDLFLESLREGDGTEQLIDHVLFVAMDQQAFRRCRSLGGGV 136

Query: 145 AVLVPPAPDSQ-----TAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
              +  A DSQ     +   + S GF     RR   L  +L+ GY+ ++ D+D++WL++P
Sbjct: 137 KCYLLRAVDSQQGDLSSEQLYMSDGFIRMMWRRIRFLGDVLKHGYSFIFTDMDVMWLRNP 196

Query: 200 FPYL 203
           FP L
Sbjct: 197 FPKL 200


>gi|359473691|ref|XP_002273214.2| PREDICTED: uncharacterized protein At4g15970 [Vitis vinifera]
          Length = 415

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 27/108 (25%), Positives = 49/108 (45%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           FL ++ I    +++ + +++IA D     +          LV    D      F +  + 
Sbjct: 152 FLESFRIGDHTRRYLNHLVIIALDQKAFARCLILHNHCFTLVTEGVDFSGEAYFMTSDYL 211

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD 214
               RR   L  +LE+GYN +++D D++W +DPFP+   + D     D
Sbjct: 212 KMMWRRIDFLRSVLEMGYNFIFSDADIMWFRDPFPHFLPNADFQIACD 259


>gi|255539963|ref|XP_002511046.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis]
 gi|223550161|gb|EEF51648.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis]
          Length = 360

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 41/151 (27%), Positives = 67/151 (44%), Gaps = 10/151 (6%)

Query: 70  ATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLP--------FLNNWLISISRQKHQ 121
            TS +   RD   S  A     N T+I+  V++ Y+         FL+++ +    +   
Sbjct: 67  TTSPVTSHRDALESGLAEASTANKTLIIAMVNKAYVEGDKPMLDMFLDSFWLGEDTRDLI 126

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHK-FGSQGFFNFTSRRPCHLLHIL 180
           + +L++A D  T Y+       H   +     +    K + S  F     RR   L  IL
Sbjct: 127 NHLLLVAVD-QTAYERCKFLRLHCYKLETDGVAFDGEKVYMSDDFIKMMWRRTLLLGDIL 185

Query: 181 ELGYNVMYNDVDMVWLKDPFPYLQGDHDVYF 211
           + GYN ++ D D++WL++PFP L  D  V F
Sbjct: 186 KRGYNFIFTDTDVMWLRNPFPKLVLDGSVDF 216


>gi|357131571|ref|XP_003567410.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
           distachyon]
          Length = 371

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 52/222 (23%), Positives = 88/222 (39%), Gaps = 25/222 (11%)

Query: 123 QVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILEL 182
            VLVIA D A  ++ N   P   +L     +  +A +F S+ +      +      ILEL
Sbjct: 132 HVLVIALDPAGFHRCNVVHPHCYLLEVKTVNLTSATRFMSKEYLEIVWTKLSLQQRILEL 191

Query: 183 GYNVMYNDVDMVWLKDPFPY--LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
           GYN ++ D DM+  ++PF    L  D  V   D   A  PLD+              I +
Sbjct: 192 GYNFLFTDADMLLFRNPFRRITLHADMSVSSDDFSIARAPLDNP-------------INT 238

Query: 241 CMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAF 300
            + Y++ T+    +++ W    QA   S+   A+DQ  F    ++   ++ + + P    
Sbjct: 239 GLYYVKSTNRTVEMLRYW----QAAR-SRTPGAHDQTVFGNIKHELVEKLKVRIEPLDTS 293

Query: 301 PTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFG 342
                YF       +   K   +H +   G + K+    D  
Sbjct: 294 -----YFGGFCEYHDDFEKISTMHADCCIGVDNKVHDLMDVA 330


>gi|242037411|ref|XP_002466100.1| hypothetical protein SORBIDRAFT_01g001250 [Sorghum bicolor]
 gi|241919954|gb|EER93098.1| hypothetical protein SORBIDRAFT_01g001250 [Sorghum bicolor]
          Length = 423

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 57/268 (21%), Positives = 102/268 (38%), Gaps = 35/268 (13%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATL 134
           +S+AA+  A + T+IV  V++ +         FL ++ +          VL++A D A +
Sbjct: 151 VSRAAT--ADDRTVIVTCVNEAWAAPGSLLDLFLESFRVGDGTAPLLRHVLIVAMDPAAM 208

Query: 135 YKVNGRWPGHAVLVPPAP--DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVD 192
            +       H     P P  D  +A  F S+ +      +      +L+LGY  ++ DVD
Sbjct: 209 ARCR-TLHRHCYHYAPLPGVDFASAKFFLSKDYLELVWSKLKLQRRVLQLGYTFVFTDVD 267

Query: 193 MVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAK 252
           ++W ++P  ++    D+  + D+      D+  + P  G            +++P +   
Sbjct: 268 VLWFRNPLKHVTAYADMSVSSDV-FFGDADNVDNFPNTG----------FFHVKPNNRTV 316

Query: 253 LVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYL-LPQSAFPTGGLYFKNQT 311
            +   W E  +  P       N+QP FN         + L L     AF  G   +    
Sbjct: 317 AMTAAWHEARERFPGK-----NEQPVFNAIKKGLVADLGLRLQYIDPAFVAGFCSYGKDL 371

Query: 312 WVEETKGKHVIIHNNYITGFEKKIKRFR 339
                 GK   +H N   G   K+   R
Sbjct: 372 ------GKVCTMHANCCVGLRNKLADLR 393


>gi|449439235|ref|XP_004137392.1| PREDICTED: uncharacterized protein At4g15970-like [Cucumis sativus]
          Length = 380

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 50/256 (19%), Positives = 98/256 (38%), Gaps = 27/256 (10%)

Query: 91  KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
           ++GTII+  ++  +         FL ++ I    Q+    ++++  D     +     P 
Sbjct: 89  EDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVAVHPH 148

Query: 144 HAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
              L     +  +   F +  +     RR   L+++LE+G++ ++ D D++WL+DPF + 
Sbjct: 149 CYQLDTQGTNFSSEAYFMTADYLKMMWRRIEFLIYVLEMGHSFVFTDTDIMWLQDPFNHF 208

Query: 204 QGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQ 263
             D D     D+    P ++ +++P  G           +Y+R         K W E   
Sbjct: 209 YKDADFQIASDLYLGNP-ENLNNVPNGG----------FVYVRANHRTVKFYKFWYESRT 257

Query: 264 AEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVII 323
             P    +   ++   +  + K    + L  L  + F  GG        +     K   +
Sbjct: 258 IYPGQHDQDVLNKIKHSPLIPKIG--MKLRFLDTANF--GGF-----CQMGRDMSKMATM 308

Query: 324 HNNYITGFEKKIKRFR 339
           H N   G E K+   R
Sbjct: 309 HANCCVGLENKVHDLR 324


>gi|15222124|ref|NP_177648.1| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
           thaliana]
 gi|12323904|gb|AAG51930.1|AC013258_24 unknown protein; 4711-6345 [Arabidopsis thaliana]
 gi|111074332|gb|ABH04539.1| At1g75120 [Arabidopsis thaliana]
 gi|332197553|gb|AEE35674.1| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
           thaliana]
          Length = 402

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 75/292 (25%), Positives = 115/292 (39%), Gaps = 45/292 (15%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAED-YATLYKVNGR 140
           L++    VA N  IIV   +    P L   + S+ R   Q+ ++V  +D   +  K N  
Sbjct: 121 LAKLLEKVAVNKEIIVVLANNNVKPMLEVQIASVKRVGIQNYLVVPLDDSLESFCKSN-- 178

Query: 141 WPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
                      PD+       S+   + +  +   L   L+LGY V+ +DVD+V+L++PF
Sbjct: 179 ---EVAYYKRDPDNAIDVVGKSRRSSDVSGLKFRVLREFLQLGYGVLLSDVDIVFLQNPF 235

Query: 201 PYLQGDHDVYFTDDMAAVKPLDHSHDL--PPPGKKGRTYIC-------SCMIYLRPTDGA 251
            +L  D DV    D          +D+   P   + RT          S   YLRPT  +
Sbjct: 236 GHLYRDSDVESMSDGHDNNTAYGFNDVFDDPTMTRSRTVYTNRIWVFNSGFFYLRPTLPS 295

Query: 252 KLVMKKWIEEL-QAEPWSKAKKANDQPAFNWALNKTA--GQVDLY----LLPQSAFPTGG 304
             ++ +  + L ++  W       DQ  FN  L   +  G   LY    ++    F    
Sbjct: 296 IELLDRVTDTLSKSGGW-------DQAVFNQHLFYPSHPGYTGLYASKRVMDVYEFMNSR 348

Query: 305 LYFKNQTWVEETKG-KHVIIHNN---------------YITGFEKKIKRFRD 340
           + FK     EE K  K VIIH N               Y+ G +  + RFRD
Sbjct: 349 VLFKTVRKDEEMKKLKPVIIHMNYHSDKLERMQAAVEFYVNGKQDALDRFRD 400


>gi|357118762|ref|XP_003561118.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
           distachyon]
          Length = 339

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 40/159 (25%), Positives = 63/159 (39%), Gaps = 20/159 (12%)

Query: 177 LHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRT 236
           L +L+LGYN ++ D+D++W +DPFP++     +  + D+    P     D P      R 
Sbjct: 171 LTVLQLGYNFLFTDMDILWFRDPFPHIPPTAQLVMSSDIFVGDP-----DSP------RN 219

Query: 237 YICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLP 296
           Y    ++Y R  DGA    + W       P +  +   D+        +   +V    L 
Sbjct: 220 YPNGGLLYARSCDGAIGFYEHWRSSRARFPGTHEQYVFDKIVKEGVPPRLGARVQ--FLD 277

Query: 297 QSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKI 335
              F  GG             GK   +H N   G EKK+
Sbjct: 278 TDRF--GGFCRHGNDL-----GKVCSMHANCCVGMEKKM 309


>gi|357111002|ref|XP_003557304.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
           distachyon]
          Length = 375

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 37/172 (21%), Positives = 73/172 (42%), Gaps = 18/172 (10%)

Query: 94  TIIVCAVSQPYLP---FLNNWLISISRQKHQ----DQVLVIAEDYATLYKVNGRWPGHAV 146
           T+++ A+++ +     FL+ +L S    ++       +LV+A D     + N   P    
Sbjct: 112 TVLITALNEAWAAPGSFLDLFLESFQHGENTAYLVKHLLVVAMDKKAFDRCNAVHPFCYW 171

Query: 147 LVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGD 206
                 D  +  K+    +     +R      ILELGY  ++ DVD++W +DPFP++   
Sbjct: 172 FRVEGMDFASEQKYMKGDYLEMMWKRNRFQQTILELGYTFLFTDVDILWFRDPFPHISPT 231

Query: 207 HDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKW 258
             +  + D     P + + + P  G          ++Y++  +G+    K W
Sbjct: 232 AQLVMSSDFFVGDP-NSAGNYPNGG----------LLYVKSCEGSIGFYKHW 272


>gi|115441885|ref|NP_001045222.1| Os01g0920700 [Oryza sativa Japonica Group]
 gi|19386738|dbj|BAB86120.1| regulatory protein-like [Oryza sativa Japonica Group]
 gi|57899427|dbj|BAD88365.1| regulatory protein-like [Oryza sativa Japonica Group]
 gi|113534753|dbj|BAF07136.1| Os01g0920700 [Oryza sativa Japonica Group]
 gi|125573134|gb|EAZ14649.1| hypothetical protein OsJ_04573 [Oryza sativa Japonica Group]
          Length = 369

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 25/82 (30%), Positives = 41/82 (50%)

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
           D VLV+A D A  +      P    L     +  +A+ F S+ +      +      +LE
Sbjct: 126 DHVLVVAVDPAAFHHCRAVHPHCYHLKVDTMNLSSANNFMSEAYVELVWTKLSLQQRVLE 185

Query: 182 LGYNVMYNDVDMVWLKDPFPYL 203
           LGYN ++ DVD++W +DPF ++
Sbjct: 186 LGYNFLFTDVDILWFRDPFRHI 207


>gi|384498860|gb|EIE89351.1| hypothetical protein RO3G_14062 [Rhizopus delemar RA 99-880]
          Length = 299

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 66/271 (24%), Positives = 112/271 (41%), Gaps = 36/271 (13%)

Query: 91  KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
           K+  +IV   +      + NW+ S+ R +    ++   +D   LY+ +    G+   V  
Sbjct: 29  KDRILIVATANYGMRNHVYNWIESLKRTEVTKFIIFCLDD--KLYE-HLVLAGYDRQVAK 85

Query: 151 APDS----QTAHKFG---SQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
            PD+    Q    F    S+ +   T  +   +  +L L  +V ++D+D+VW++      
Sbjct: 86  IPDTWFHQQVEASFSLYFSETYRIITHAKTLVVQQLLYLDISVFFSDIDIVWMRPQI--- 142

Query: 204 QGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQ 263
                V + + M  ++P   +  L       +  + S    +RPT   K ++ + I    
Sbjct: 143 -----VEYMNTMVKMRP--QTSVLFQQEGVDQQEVNSGFYLMRPTAITKRLLAETI---- 191

Query: 264 AEPWSKAKKANDQPAFNWALNK------TAGQVDLYLLPQSAFPTGGLYFKNQTWVEETK 317
                  +K   Q A N ALNK      T G V L LL    FP G +YF N     +  
Sbjct: 192 -VIQDTNEKLTQQGAMNAALNKLDLDIRTTGIVLLDLL---YFPNGHVYF-NLDLPRQHS 246

Query: 318 GKHVIIHNNYITGFEKKIKRFRDFGLWLVDD 348
            K  I+H NY+ G +KK K      +W ++D
Sbjct: 247 IKPFIVHANYLIGEDKKTKLMEQ-NMWYLND 276


>gi|125528893|gb|EAY77007.1| hypothetical protein OsI_04964 [Oryza sativa Indica Group]
          Length = 369

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 25/82 (30%), Positives = 41/82 (50%)

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
           D VLV+A D A  +      P    L     +  +A+ F S+ +      +      +LE
Sbjct: 126 DHVLVVAVDPAAFHHCRAVHPHCYHLKVDTMNLSSANNFMSEAYVELVWTKLSLQQRVLE 185

Query: 182 LGYNVMYNDVDMVWLKDPFPYL 203
           LGYN ++ DVD++W +DPF ++
Sbjct: 186 LGYNFLFTDVDILWFRDPFRHI 207


>gi|388506682|gb|AFK41407.1| unknown [Lotus japonicus]
          Length = 431

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 68/267 (25%), Positives = 109/267 (40%), Gaps = 36/267 (13%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L++    VA N  +IV   +      L  W  +I +    + ++V  +D    +  + + 
Sbjct: 153 LAKILEKVAVNRELIVALANTNVKEMLEVWFTNIKKVGIPNYLVVALDDNIAEFCESNQV 212

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPF 200
           P +        DS      G +G  +  S    H+L   L+LGY+V+ +DVD+V+L++PF
Sbjct: 213 PVYKRDPDDGIDS-----IGKEGGNHAVSGLKFHILREFLQLGYSVLLSDVDIVYLQNPF 267

Query: 201 PYLQGDHDV-YFTD--DMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGA 251
            +L  D DV   +D  D       +   D P  G        R ++  S   Y+RPT  +
Sbjct: 268 DHLYRDSDVESMSDGHDNNTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTIPS 327

Query: 252 KLVMKKWIEELQAEPWSKAKKANDQPAFNW-----------ALNKTAGQVDLYLLPQSAF 300
             ++ +    L  E      KA DQ  FN             L+     +D+YL   S  
Sbjct: 328 IELLDRVATRLSKE------KAWDQAVFNEELFYPSHPGYDGLHAARRTMDMYLFMNSKV 381

Query: 301 PTGGLYFKNQTWVEETKGKHVIIHNNY 327
               L+   +     +K K VIIH NY
Sbjct: 382 ----LFKTVRNDANLSKLKPVIIHVNY 404


>gi|357131569|ref|XP_003567409.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
           distachyon]
          Length = 360

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 57/259 (22%), Positives = 104/259 (40%), Gaps = 32/259 (12%)

Query: 91  KNGTIIVCAVSQPYL---PFLNNWLISISRQKHQDQ----VLVIAEDYATLYKVNGRWPG 143
           ++ T+I+  V++ +      L+ +L S    +  +     VL++A D A   +     P 
Sbjct: 82  EDKTVIITTVNEAWARPGSLLDIYLESFKNGEDTEHLLAHVLIVALDPAGFRRCTVVHPH 141

Query: 144 HAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
             +L     +  +A  F S+ +      +      ILELGYN +  D DM+ L+DPF  +
Sbjct: 142 CHLLEVKIANLTSATPFMSKEYLELVWTKLYLQQCILELGYNFLCTDTDMILLRDPFRRI 201

Query: 204 --QGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEE 261
               D  V   D  +A  PLD+     PP         + + Y++ T+ +  +++ W   
Sbjct: 202 PVYADMSVSSDDFSSARAPLDN-----PPN--------TGLYYMKATNRSIEMLRYW--- 245

Query: 262 LQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHV 321
             A P  +    NDQP F     +   ++ + + P        +YF       +   K  
Sbjct: 246 QAARP--RFPGVNDQPVFVKIKTELIEKLQVRIEPLDT-----VYFGGFCEYHDDFDKIC 298

Query: 322 IIHNNYITGFEKKIKRFRD 340
            +H +   G + K+    D
Sbjct: 299 TMHADCCIGVDNKVHDLMD 317


>gi|242055407|ref|XP_002456849.1| hypothetical protein SORBIDRAFT_03g044030 [Sorghum bicolor]
 gi|241928824|gb|EES01969.1| hypothetical protein SORBIDRAFT_03g044030 [Sorghum bicolor]
          Length = 300

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 30/131 (22%), Positives = 61/131 (46%), Gaps = 11/131 (8%)

Query: 94  TIIVCAVSQPYLPFLNNWLISISRQKHQ---------DQVLVIAEDYATLYKVNGRWPGH 144
           T+I+ +V++ +     N L+ + R+  Q         + VLV+A D           P  
Sbjct: 30  TVIITSVNEAFAR--PNSLLGLFRESFQVGEGIGHLLNNVLVVAVDAKAFRYCKAVHPHC 87

Query: 145 AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
            +L     +  +A+ + ++ +      +      +LELGYN ++ DVD+VW ++PF ++ 
Sbjct: 88  YLLEVKTMNLSSANNYMTESYIELVWTKLSLQQRVLELGYNFLFTDVDIVWFRNPFRHIS 147

Query: 205 GDHDVYFTDDM 215
              D+  + D+
Sbjct: 148 AFADMTTSSDV 158


>gi|297738316|emb|CBI27517.3| unnamed protein product [Vitis vinifera]
          Length = 288

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 25/97 (25%), Positives = 46/97 (47%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           FL ++ I    +++ + +++IA D     +          LV    D      F +  + 
Sbjct: 25  FLESFRIGDHTRRYLNHLVIIALDQKAFARCLILHNHCFTLVTEGVDFSGEAYFMTSDYL 84

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
               RR   L  +LE+GYN +++D D++W +DPFP+ 
Sbjct: 85  KMMWRRIDFLRSVLEMGYNFIFSDADIMWFRDPFPHF 121


>gi|357121335|ref|XP_003562376.1| PREDICTED: uncharacterized protein LOC100833150 [Brachypodium
           distachyon]
          Length = 425

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 63/267 (23%), Positives = 107/267 (40%), Gaps = 35/267 (13%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L++    VA    +IV   +      L  W  +I R    + ++V  +D    +  +   
Sbjct: 142 LAKILEEVAVKKELIVALANSNVREMLEVWFTNIKRVGVPNYLVVALDDNIESFCKSNDV 201

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
           P    +    PD        + G    +  +   L   L+LGY+++ +D+D+++L++PF 
Sbjct: 202 P----VYRRDPDEGVDSIAKTGGNHAVSGLKFRILREFLQLGYSILLSDIDIIFLRNPFD 257

Query: 202 YLQGDHDVYFTDD----MAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGA 251
           +L  D DV    D    M A    +   D P  G        R ++  S   Y+RPT  A
Sbjct: 258 HLYRDSDVESMSDGHNNMTAYG-FNDVFDEPSMGWARYAHTMRIWVYNSGFFYIRPTIPA 316

Query: 252 KLVMKKWIEELQAEPWSKAKKANDQPAFN-----------WALNKTAGQVDLYLLPQSAF 300
             ++ +    L  EP     K+ DQ  FN             L+ +   +D+YL   S  
Sbjct: 317 IELLDRVAGRLSREP-----KSWDQAVFNEELFFPSHPGYEGLHASRRTMDIYLFMNSKV 371

Query: 301 PTGGLYFKNQTWVEETKGKHVIIHNNY 327
               L+   +   +  K K VI+H+NY
Sbjct: 372 ----LFKTVRKDAQLRKLKPVIVHSNY 394


>gi|356575476|ref|XP_003555866.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
          Length = 364

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 45/184 (24%), Positives = 77/184 (41%), Gaps = 20/184 (10%)

Query: 45  VLLSLLVVLGVI---------LPWTGTPGF--MFPNATSS-LAKWRDYTLSQAASFVA-K 91
           V LS LV+LG +                GF  +FP+  +  +A   +Y L +  +  A K
Sbjct: 23  VSLSCLVLLGDVDSHRFLSSFHSSYSLSGFTRIFPSVYNDPVATSNEYPLEKILNEAAMK 82

Query: 92  NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
           + T+I+  +++ +         FL ++ I        D +++IA D     +        
Sbjct: 83  DRTVILTTLNEAWAAPNSVIDLFLESFRIGDRTSTFLDHLVIIALDQKAFARCQVIHTYC 142

Query: 145 AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
             LV    D      F +  +     +R   L  +LE+GYN ++ D D++W +DPFP   
Sbjct: 143 FSLVSEEADFHEEAYFMTPRYLMMMWKRIDFLRTVLEMGYNFVFTDADIMWFRDPFPLFH 202

Query: 205 GDHD 208
            D D
Sbjct: 203 LDAD 206


>gi|218189630|gb|EEC72057.1| hypothetical protein OsI_04967 [Oryza sativa Indica Group]
          Length = 371

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 29/141 (20%), Positives = 65/141 (46%), Gaps = 9/141 (6%)

Query: 91  KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
           ++ T+I+ +V++ +         FL ++ +  +     + ++V+A D   L +     P 
Sbjct: 96  EDRTVIMTSVNEAWAAPGSLMDSFLESFSVGENISHFVEHIVVVAMDEGALRRCRAIHPH 155

Query: 144 HAVLVPPAP--DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
             +L+P     D   A  + ++ + +    +      +LELGYN+++ DVD+ W ++P  
Sbjct: 156 CYLLLPEVAGLDLSGAKSYMTKDYLDLVWSKLKLQQRVLELGYNLLFTDVDLAWFRNPMV 215

Query: 202 YLQGDHDVYFTDDMAAVKPLD 222
           ++    D+  + D     P D
Sbjct: 216 HITAAADITTSSDFYFGDPDD 236


>gi|29824405|gb|AAP04162.1| unknown protein [Arabidopsis thaliana]
 gi|110737123|dbj|BAF00513.1| hypothetical protein [Arabidopsis thaliana]
          Length = 402

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 74/292 (25%), Positives = 115/292 (39%), Gaps = 45/292 (15%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLV-IAEDYATLYKVNGR 140
           L++    VA N  IIV   +    P L   + S+ R   Q+ ++V + +   +  K N  
Sbjct: 121 LAKLLEKVAVNKEIIVVLANNNVKPMLEVQIASVKRVGIQNYLVVPLYDSLESFCKSN-- 178

Query: 141 WPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
                      PD+       S+   + +  +   L   L+LGY V+ +DVD+V+L++PF
Sbjct: 179 ---EVAYYKRDPDNAIDVVGKSRRSSDVSGLKFRVLREFLQLGYGVLLSDVDIVFLQNPF 235

Query: 201 PYLQGDHDVYFTDDMAAVKPLDHSHDL--PPPGKKGRTYIC-------SCMIYLRPTDGA 251
            +L  D DV    D          +D+   P   + RT          S   YLRPT  +
Sbjct: 236 GHLYRDSDVESMSDGHDNNTAYGFNDVFDDPTMTRSRTVYTNRIWVFNSGFFYLRPTLPS 295

Query: 252 KLVMKKWIEEL-QAEPWSKAKKANDQPAFNWALNKTA--GQVDLY----LLPQSAFPTGG 304
             ++ +  + L ++  W       DQ  FN  L   +  G   LY    ++    F    
Sbjct: 296 IELLDRVTDTLSKSGGW-------DQAVFNQHLFYPSHPGYTGLYASKRVMDVYEFMNSR 348

Query: 305 LYFKNQTWVEETKG-KHVIIHNN---------------YITGFEKKIKRFRD 340
           + FK     EE K  K VIIH N               Y+ G +  + RFRD
Sbjct: 349 VLFKTVRKDEEMKKLKPVIIHMNYHSDKLERMQAAVEFYVNGKQDALDRFRD 400


>gi|42407878|dbj|BAD09019.1| regulatory protein-like [Oryza sativa Japonica Group]
 gi|42407980|dbj|BAD09118.1| regulatory protein-like [Oryza sativa Japonica Group]
          Length = 335

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 30/130 (23%), Positives = 61/130 (46%), Gaps = 9/130 (6%)

Query: 94  TIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAV 146
           T+I+ ++++ Y         FL ++ +    +     VL++A D A L +     P   +
Sbjct: 69  TVIMTSINRAYAAPGSLLDLFLESFRLGEGTEPLLKHVLIVAMDPAALARCRQVHPHCYL 128

Query: 147 LVPP--APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
           L  P  A D     +F S+ + +    R      IL+LG+N ++ D+D++W ++P  ++ 
Sbjct: 129 LRRPEGAVDYSDEKRFMSKDYLDMMWGRNLFQQTILQLGFNFLFTDIDIMWFRNPLRHIA 188

Query: 205 GDHDVYFTDD 214
              D+   +D
Sbjct: 189 ITSDIAVAND 198


>gi|308081654|ref|NP_001183634.1| uncharacterized protein LOC100502228 [Zea mays]
 gi|238013584|gb|ACR37827.1| unknown [Zea mays]
 gi|414878981|tpg|DAA56112.1| TPA: hypothetical protein ZEAMMB73_163166 [Zea mays]
 gi|414878982|tpg|DAA56113.1| TPA: hypothetical protein ZEAMMB73_163166 [Zea mays]
          Length = 357

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 38/161 (23%), Positives = 70/161 (43%), Gaps = 16/161 (9%)

Query: 68  PNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKH 120
            +A  + A+  D       +   ++ T+I+ +V++ +         FL ++         
Sbjct: 67  ADAKGAAAEQEDELQRLLRAVADEDRTVIMTSVNEAWAAQDSLLDLFLESFRSGERIAHF 126

Query: 121 QDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHK--FGSQGFFNFTSRRPCHLLH 178
            D +LV+A D   L +     P H  L+P A     + +  F S+ + +    +      
Sbjct: 127 VDHLLVVALDGGALERCRAVHP-HCYLLPTAAARNLSGEKVFMSKDYIDLVWSKVRLQQR 185

Query: 179 ILELGYNVMYNDVDMVWLKDPF------PYLQGDHDVYFTD 213
           ILELGYN ++ DVD++W ++PF       ++    D YF D
Sbjct: 186 ILELGYNFLFTDVDILWFRNPFERMSVAAHMVTSSDFYFGD 226


>gi|148906908|gb|ABR16599.1| unknown [Picea sitchensis]
          Length = 422

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 38/162 (23%), Positives = 69/162 (42%), Gaps = 11/162 (6%)

Query: 79  DYTLSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDY 131
           D  L++ A+    N T+I+ A++  +         FL ++ +     +  + +L++A D 
Sbjct: 143 DRVLAKTAN---SNRTVIITALNVAWAEPNTMIDLFLESFRVGEGTPELLNNLLIVALDA 199

Query: 132 ATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDV 191
               +     P    L     D      + S  +     RR   L  IL+ GY+ +++D 
Sbjct: 200 KAYDRCLEIHPHCYTLKTRGVDFSAEKLYMSDDYLKMMWRRLGFLADILKRGYSFVFSDA 259

Query: 192 DMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKK 233
           D++WL++PF     D D+    D     P D  H++P  G K
Sbjct: 260 DIMWLRNPFTRFSPDADIQIASDQYNGSPYD-VHNMPNGGYK 300


>gi|407408299|gb|EKF31795.1| hypothetical protein MOQ_004365 [Trypanosoma cruzi marinkellei]
          Length = 710

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 54/267 (20%), Positives = 107/267 (40%), Gaps = 18/267 (6%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISIS-RQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
            +NG I V   +  +   L+N++ S + R K ++ ++   +D A    ++ R P     +
Sbjct: 395 GENGFITVVIFNSFWRDHLHNFVYSFAKRAKMRNLIVASVDDTALSLCLSFRLPCLNATL 454

Query: 149 PPAPDSQTAHKFGSQGFF-----NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
              P+ +  +     GF       F+  +P   + IL  GY  M  D+D+ W + P PYL
Sbjct: 455 FVEPEKEGDNASSKGGFTRKVTEEFSWVKPRLAIAILRRGYTFMLADLDITWNRSPMPYL 514

Query: 204 QGDH-DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
             +  D+    D  +   ++    +  P  +   Y    M +       +  M+ +++  
Sbjct: 515 LKNRLDLVHQCDSGSRLSINSGLYMARPNSRNLRYFQDLMSFRTDESADQNAMRLFMKYD 574

Query: 263 QAEPWSKAKKANDQPAFNWALN-KTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHV 321
                S  ++   +  FN   N K  G V L          G   F+ +++      K +
Sbjct: 575 HIHGVS--QRCLPKWDFNMKCNYKVEGSVKL--------EDGRQTFRWRSYPLNENPKWL 624

Query: 322 IIHNNYITGFEKKIKRFRDFGLWLVDD 348
            +H   ++G + KI+ F+    W +D+
Sbjct: 625 AMHATCLSGAKDKIRYFKAIKAWFLDE 651


>gi|218201503|gb|EEC83930.1| hypothetical protein OsI_30008 [Oryza sativa Indica Group]
          Length = 343

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 30/130 (23%), Positives = 61/130 (46%), Gaps = 9/130 (6%)

Query: 94  TIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAV 146
           T+I+ ++++ Y         FL ++ +    +     VL++A D A L +     P   +
Sbjct: 77  TVIMTSINRAYAAPGSLLDLFLESFRLGEGTEPLLKHVLIVAMDPAALARCRQVHPHCYL 136

Query: 147 LVPP--APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
           L  P  A D     +F S+ + +    R      IL+LG+N ++ D+D++W ++P  ++ 
Sbjct: 137 LQRPEGAVDYSDEKRFMSKDYLDMMWGRNLFQQTILQLGFNFLFTDIDIMWFRNPLRHIA 196

Query: 205 GDHDVYFTDD 214
              D+   +D
Sbjct: 197 ITSDIAVAND 206


>gi|168017116|ref|XP_001761094.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687780|gb|EDQ74161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 526

 Score = 47.4 bits (111), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 66/277 (23%), Positives = 110/277 (39%), Gaps = 31/277 (11%)

Query: 84  QAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
           +   F AK   I V   +  ++ F+ NW+  ++  +  + +LV A D   L  +   W G
Sbjct: 12  EMIEFRAKKNVIAVTFANFAFMDFVLNWVRHLTDVEVTN-ILVGAMDTKILEAL--FWKG 68

Query: 144 HAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
             V    +        +G+  F      +   +   L  GY ++  D D+    DPFPY 
Sbjct: 69  VPVFDMRSGMETVDVGWGTPKFHKMGREKVILINAFLAEGYEILMCDTDV----DPFPYF 124

Query: 204 QG--DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEE 261
           +   D D+  + D   V  +D    L   G+    Y    + + RPT  AK + K+W++ 
Sbjct: 125 ERFPDADILTSSD-EVVNSVDDDR-LEFYGQSWGAYNIG-IFFWRPTTIAKELAKEWLQL 181

Query: 262 LQAEPWSKAKKANDQPAFN-WALNKTA-------------GQVDLYLLPQSAFPTGGLYF 307
           L ++      K  DQ  FN  A N T              G + + +LP S F +G  YF
Sbjct: 182 LLSD-----DKIWDQNGFNDLARNVTGPSVGHDSLFWAYRGNLKMGILPVSLFCSGHTYF 236

Query: 308 KNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLW 344
             + + +     + +       G E K  R R+  L+
Sbjct: 237 IQELYKKLDLVPYAVHTTFQFGGTEGKRHRLREAKLF 273


>gi|326493850|dbj|BAJ85387.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 362

 Score = 47.4 bits (111), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 56/294 (19%), Positives = 118/294 (40%), Gaps = 50/294 (17%)

Query: 60  TGTPGFMFPNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQK 119
           TGT   +FP     L +               +GT+I+ +V++ +       L+ + R+ 
Sbjct: 70  TGTEERLFPGLAELLPR-----------VATDDGTVIITSVNEAWSR--PGSLLDLFREG 116

Query: 120 HQD---------QVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTS 170
            ++           L++A D   L       P   +L   A D  +A++F ++ +     
Sbjct: 117 FKNGEGIAHLLNHTLIVAVDAGALAHCEAVHPHCYLLEVTAADVSSANRFMTKSYLELVW 176

Query: 171 RRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDM--AAVKPLDHSHDLP 228
            +      +L+LGY+ ++ DVD++WL++PF ++    D+  + D      + L ++   P
Sbjct: 177 AKLELQQRVLQLGYSYLFTDVDIMWLRNPFRHISLYADMAVSTDRFNGDAEALTNA---P 233

Query: 229 PPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNK-TA 287
             G            Y+R T+    ++++W         S+    +DQ  F+    +  A
Sbjct: 234 NTG----------FYYVRSTNRTVEMLRRW-----RAARSRFPPTHDQAVFDEIKGELAA 278

Query: 288 GQVDL-YLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRD 340
           G++ + ++  ++A       F     +     +   +H N   G E K+   R+
Sbjct: 279 GELRIRFVFLETAL------FDGFCQLHGEMDRVCTMHANCCIGLENKVHDLRN 326


>gi|226529638|ref|NP_001142135.1| hypothetical protein [Zea mays]
 gi|194689294|gb|ACF78731.1| unknown [Zea mays]
 gi|194707310|gb|ACF87739.1| unknown [Zea mays]
 gi|414873942|tpg|DAA52499.1| TPA: hypothetical protein ZEAMMB73_904107 [Zea mays]
          Length = 374

 Score = 47.4 bits (111), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 56/256 (21%), Positives = 93/256 (36%), Gaps = 33/256 (12%)

Query: 94  TIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAV 146
           T+I+  V+Q +         FL ++ +          +L++A D A + +     P H  
Sbjct: 112 TVIITCVNQAWAAPGSLLDLFLESFRVGDGTAPLLRHLLIVAMDAAAMARCRALHP-HCY 170

Query: 147 LVPPAP--DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
           L  PA   D   A  F S+ +      +      +L LGY++++ D D++WL++P  ++ 
Sbjct: 171 LYSPARGVDFAPAKPFLSRDYLELVWSKLRLQRRVLRLGYSLLFTDADVLWLRNPLKHVT 230

Query: 205 GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQA 264
              D+  + D+    P D   + P  G            ++RP      +   W      
Sbjct: 231 AYADMTVSCDVFFGDP-DGVDNFPNTG----------FFHVRPNSRTIAMAAAWHRARDR 279

Query: 265 EPWSKAKKANDQPAFNWALNKTAGQVDLYL-LPQSAFPTGGLYFKNQTWVEETKGKHVII 323
            P       N+QP FN         + L L     AF  G   +          GK   +
Sbjct: 280 FP-----GKNEQPVFNAIKKGLVADLGLRLQYIDPAFVAGFCSYGRDL------GKVCTM 328

Query: 324 HNNYITGFEKKIKRFR 339
           H N   G   KI   R
Sbjct: 329 HANCCVGLRAKITDLR 344


>gi|212274615|ref|NP_001130717.1| uncharacterized protein LOC100191821 [Zea mays]
 gi|194689930|gb|ACF79049.1| unknown [Zea mays]
 gi|223949661|gb|ACN28914.1| unknown [Zea mays]
 gi|413932429|gb|AFW66980.1| putative NAC domain transcription factor superfamily protein [Zea
           mays]
          Length = 366

 Score = 47.4 bits (111), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 37/162 (22%), Positives = 67/162 (41%), Gaps = 23/162 (14%)

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
           +LELGYN ++ DVD++W +DPF ++    D+  + D+    P     + P  G       
Sbjct: 195 VLELGYNFLFTDVDVMWFRDPFRHISMAADIAISSDVYMGDPYS-LRNFPNGG------- 246

Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYL-LPQ 297
               +++R +D      + W    Q   W    K ++Q  FN   ++ A ++DL +    
Sbjct: 247 ---FLFVRASDKTVAFYRAW----QQGRWRFLGK-HEQDVFNLIKHEEAPRLDLAIQFLD 298

Query: 298 SAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFR 339
           +A+ +G               +   +H N   G   K+   R
Sbjct: 299 TAYISGFCQLSKDL------NRICTLHANCCVGLGAKLHDLR 334


>gi|222640915|gb|EEE69047.1| hypothetical protein OsJ_28045 [Oryza sativa Japonica Group]
          Length = 352

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 30/130 (23%), Positives = 61/130 (46%), Gaps = 9/130 (6%)

Query: 94  TIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAV 146
           T+I+ ++++ Y         FL ++ +    +     VL++A D A L +     P   +
Sbjct: 77  TVIMTSINRAYAAPGSLLDLFLESFRLGEGTEPLLKHVLIVAMDPAALARCRQVHPHCYL 136

Query: 147 LVPP--APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
           L  P  A D     +F S+ + +    R      IL+LG+N ++ D+D++W ++P  ++ 
Sbjct: 137 LRRPEGAVDYSDEKRFMSKDYLDMMWGRNLFQQTILQLGFNFLFTDIDIMWFRNPLRHIA 196

Query: 205 GDHDVYFTDD 214
              D+   +D
Sbjct: 197 ITSDIAVAND 206


>gi|148906537|gb|ABR16421.1| unknown [Picea sitchensis]
          Length = 432

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 76/274 (27%), Positives = 112/274 (40%), Gaps = 49/274 (17%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAED-YATLYKVNG- 139
           L++    +A    +IV   +      L  W  SI R    + ++V  +D  A   K  G 
Sbjct: 148 LAELLQKIAVKKELIVGLANYNVKEMLEVWSDSIKRAGITNYLVVALDDSVAEFCKSRGV 207

Query: 140 ----RWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMV 194
               R P  A+       S+T  K G     +  S    HLL   L+LGY+V+ +DVD+V
Sbjct: 208 PVYRRDPADAI-------SKTVGKTGDN---HAISGLKFHLLREFLQLGYSVLLSDVDIV 257

Query: 195 WLKDPFPYLQGDHDVYFTDD----MAAVKPLDHSHDLPPPG-----KKGRTYIC-SCMIY 244
           +L++PF +L  D DV    D    M A    D S D P  G        R ++  S   Y
Sbjct: 258 YLQNPFNFLYRDCDVESMTDGYSNMTAYGYDDVSDD-PSMGWSRYAHTMRIWVFNSGFFY 316

Query: 245 LRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN-----------WALNKTAGQVDLY 293
           +RPT  +  ++ + ++ L  E       A DQ  FN             L+ +   +D Y
Sbjct: 317 IRPTIPSIELLDRVVDRLSKE------NAWDQAVFNELLFFPSRPGYEGLHASRRAMDYY 370

Query: 294 LLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNY 327
           L   S      L+ K +   +    K VIIH NY
Sbjct: 371 LFLNSKV----LFTKFRKEPKMPSYKPVIIHINY 400


>gi|413932427|gb|AFW66978.1| putative NAC domain transcription factor superfamily protein [Zea
           mays]
          Length = 263

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 37/162 (22%), Positives = 69/162 (42%), Gaps = 23/162 (14%)

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
           +LELGYN ++ DVD++W +DPF ++    D+  + D+    P     + P  G       
Sbjct: 92  VLELGYNFLFTDVDVMWFRDPFRHISMAADIAISSDVYMGDPYS-LRNFPNGG------- 143

Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYL-LPQ 297
               +++R +D      + W    Q   W    K ++Q  FN   ++ A ++DL +    
Sbjct: 144 ---FLFVRASDKTVAFYRAW----QQGRWRFLGK-HEQDVFNLIKHEEAPRLDLAIQFLD 195

Query: 298 SAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFR 339
           +A+ +G         + +   +   +H N   G   K+   R
Sbjct: 196 TAYISGFCQ------LSKDLNRICTLHANCCVGLGAKLHDLR 231


>gi|326496953|dbj|BAJ98503.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326526459|dbj|BAJ97246.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 369

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 46/102 (45%), Gaps = 11/102 (10%)

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQT----AHKFGSQGFFNFTSRRPCHLL 177
           D +LV+A D   L +     P H  L+PP             F S+ + +    +     
Sbjct: 138 DNLLVVALDAGALERCRAVHP-HCYLLPPVAGGNKNLSDEKVFMSKDYLDLVWSKVRLQQ 196

Query: 178 HILELGYNVMYNDVDMVWLKDPF------PYLQGDHDVYFTD 213
            ILELGYN ++ DVD++W ++PF       ++    D YF D
Sbjct: 197 RILELGYNFLFTDVDIMWFRNPFERMSVAAHMVTSSDFYFGD 238


>gi|302819510|ref|XP_002991425.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300140818|gb|EFJ07537.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 545

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 67/298 (22%), Positives = 114/298 (38%), Gaps = 36/298 (12%)

Query: 68  PNATSSLAKWRDYTLS-QAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLV 126
           P   S L     + LS +   F AK   I+V   +  +  F+  W+  ++       +LV
Sbjct: 14  PPHGSELPPREAFVLSKEMVEFRAKKNVIMVTFANHAFEDFVLTWVRHLT-DVGVTNLLV 72

Query: 127 IAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNV 186
            A D   L ++   W G  V    +  +     +G+  F      +   +  I+ +G+ V
Sbjct: 73  GAMDRKILEELF--WKGVPVFDMGSEMNPADVGWGTPVFHKMGREKVFLVNAIMAMGFEV 130

Query: 187 MYNDVDMVWLKDPFPYLQG--DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIY 244
           ++ D DM    +P PY++   D DV  + D       D S +     ++    +   + +
Sbjct: 131 LFCDTDM----NPLPYMERYPDADVLVSSDAVIATVTDESLE---DWRRSYAALNIGIFH 183

Query: 245 LRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL-NKTAGQVD------------ 291
            RPT+ AK   + W  +L+ E      K  DQ  FN  + N T   VD            
Sbjct: 184 WRPTEAAKKFARAWQIQLEDE------KIWDQNGFNELIQNGTREAVDPDNDRGLFYAFD 237

Query: 292 ----LYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWL 345
               + +LP S F +G  YF    + +     + +       G E K  R R+  L+ 
Sbjct: 238 RTLKVGILPVSMFCSGHTYFVQHLYKQLGLDVYAVHTTFQFAGTEGKRHRLREAQLFF 295


>gi|255571059|ref|XP_002526480.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis]
 gi|223534155|gb|EEF35871.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis]
          Length = 357

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 30/130 (23%), Positives = 56/130 (43%), Gaps = 12/130 (9%)

Query: 92  NGTIIVCAVSQPYLP------------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNG 139
           N T+I+  +++ Y              FL ++ +    +   D +L++A D     +   
Sbjct: 76  NKTVIIVILNKAYAEPTVKSETTMLDLFLESFWVGEDTRPLLDHLLLVAADQTAYERCMF 135

Query: 140 RWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
           +      +     D      F S+ F     RR   LL +L+ GY+ ++ D D++WL++P
Sbjct: 136 KRLNCYKMETEGVDFGGEKLFMSKDFIKMMWRRTLLLLDVLKHGYSFIFTDADVMWLRNP 195

Query: 200 FPYLQGDHDV 209
           FP L  +  V
Sbjct: 196 FPRLSKNESV 205


>gi|145345111|ref|XP_001417066.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577292|gb|ABO95359.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 486

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 63/292 (21%), Positives = 118/292 (40%), Gaps = 44/292 (15%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAED--YATLYKVN-GRWPGHA 145
           VA +G +IV   +  Y  F  N+L  +   +  + ++   ++  YA L K+    W    
Sbjct: 86  VAVDGAVIVTWANMHYYDFALNFLSHLDALEVTNYLIGAMDEELYAALRKIGVNTW---- 141

Query: 146 VLVPPAPDSQTAHK---FGSQGFFNFTSRRPCHLLH-ILELGYNVMYNDVDMVWLKDPFP 201
           ++   + D+    K   +GS+ F     R    L+H   + G +V+ +D+D+ WL++P P
Sbjct: 142 LMGSKSIDADAVKKDFGWGSKNFHKM-GRDKIRLIHDFTKTGVDVLISDIDVAWLRNPIP 200

Query: 202 YLQG--DHDVYFTDDMAAVKPLDHSHDLP--PPGKKGRTYICS-----CMIYLRPTDGAK 252
           + +     D+  + D    +    +  +     G+   +  C+      M++ R T+ ++
Sbjct: 201 FFRRYPKADILVSTDNLQNRTHQDARQISHMVDGEGLESTPCAGTVNIGMMWFRATEASQ 260

Query: 253 LVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTA----------------GQVDLYLLP 296
            +  +W+  L+     K +K  DQ  FN  + +                  G+V +  LP
Sbjct: 261 QLTGEWVRNLE-----KDEKIWDQAEFNTLVQRGGCGTPSPDGSGVGSAYEGKVQMGTLP 315

Query: 297 QSAFPTGGLYFKNQTWVEETKGKHVIIHNNY-ITGFEKKIKRFRDFGLWLVD 347
            + F  G  YF  Q   E        +H  +   G   K  R R+   WL D
Sbjct: 316 VALFNNGHTYF-TQRLPELINVNPYAMHATFQYDGTPGKRNRMREANQWLGD 366


>gi|449461134|ref|XP_004148298.1| PREDICTED: uncharacterized protein LOC101214097 [Cucumis sativus]
          Length = 435

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 64/260 (24%), Positives = 101/260 (38%), Gaps = 35/260 (13%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           VA +  +IV   +      L  W  SI +    + ++V  +D    +      P +    
Sbjct: 158 VAIDRELIVAVANSNVKAMLELWFTSIKKAGIPNYLVVALDDEIVQFCKKNDVPVYM--- 214

Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
              PD +      + G    +  +   L   L+LGY V+ +DVD+V+L++PF +L  D D
Sbjct: 215 -RDPDEKVDSIGRTGGNHAVSGTKFRILREFLQLGYAVLLSDVDIVYLQNPFNHLYRDSD 273

Query: 209 V-YFTDDMAAVKPLDHSHDLPPP-------GKKGRTYI-CSCMIYLRPTDGAKLVMKKWI 259
           V   TD    V    ++     P           R ++  S   Y+RPT  A  ++ +  
Sbjct: 274 VESMTDGHDNVTAYGYNDVFEEPAMGWARFAHTMRIWVYNSGFFYIRPTIPAIELLDRVA 333

Query: 260 EELQAEPWSKAKKANDQPAFN-----------WALNKTAGQVDLYLLPQSAFPTGGLYFK 308
             L     S+ + + DQ  FN             LN +   +D YL     F    + FK
Sbjct: 334 NRL-----SRERNSWDQAVFNEELFFPSHSNYEGLNASRRTMDFYL-----FMNSKVLFK 383

Query: 309 NQTWVEETKG-KHVIIHNNY 327
                +  K  K VIIH NY
Sbjct: 384 TVRKDDNLKKLKPVIIHVNY 403


>gi|413936404|gb|AFW70955.1| hypothetical protein ZEAMMB73_889446 [Zea mays]
          Length = 334

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 37/162 (22%), Positives = 67/162 (41%), Gaps = 23/162 (14%)

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
           +LELGYN ++ DVD++W +DPF ++    D+  + D+    P     + P  G       
Sbjct: 163 VLELGYNSLFTDVDVMWFRDPFRHISMAADIAISSDVYMGDPYS-LRNFPNGG------- 214

Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYL-LPQ 297
               +++R +D      + W    Q   W    K ++Q  FN   ++ A ++DL +    
Sbjct: 215 ---FLFVRASDKTVAFYRAW----QQGRWRFLGK-HEQDVFNLIKHEEAPRLDLAIQFLD 266

Query: 298 SAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFR 339
           +A+ +G               +   +H N   G   K+   R
Sbjct: 267 TAYISGFCQLSKDL------NRICTLHANCCVGLGAKLHDLR 302


>gi|443710358|gb|ELU04612.1| hypothetical protein CAPTEDRAFT_207413 [Capitella teleta]
          Length = 403

 Score = 47.0 bits (110), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 45/200 (22%), Positives = 80/200 (40%), Gaps = 34/200 (17%)

Query: 160 FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVK 219
           F S  F   +  +   +  +L+LGY ++  D+D+V+LK+P               +  VK
Sbjct: 236 FASAAFNEKSKVKLAMVYAVLQLGYKILIADLDVVFLKNP---------------LDVVK 280

Query: 220 PLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAF 279
                 D+       +  + +  +Y +PT  +    KK  E++         K +DQ  F
Sbjct: 281 SCGKDCDIAVQNNTNKQ-LNTGFLYSKPTPKSIAFYKKITEKMVDS------KGHDQSVF 333

Query: 280 N--WALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKR 337
           N  +  N   G +++++LP      G         +     K  + H N+  GF  KI+ 
Sbjct: 334 NMVYKRNMVPG-INIHVLPVDVACVG---------ISHDPEKCHVFHANFKKGFSPKIRM 383

Query: 338 FRDFGLWLVDDHAVESPLGK 357
            +  GLW  +   V S L +
Sbjct: 384 LKQHGLWKAEPEDVNSYLCR 403


>gi|449522244|ref|XP_004168137.1| PREDICTED: uncharacterized protein At4g15970-like [Cucumis sativus]
          Length = 380

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 29/137 (21%), Positives = 60/137 (43%), Gaps = 7/137 (5%)

Query: 91  KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
           ++GTII+  ++  +         FL ++ I    Q+    ++++  D     +     P 
Sbjct: 89  EDGTIILTTLNDAWAEPDSLLDLFLKSFHIGNGTQRLLKHLVIVTLDQKAYSRCVAVHPH 148

Query: 144 HAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
              L     +  +   F +  +     RR   L+++LE+G++ ++ D D++WL+DPF + 
Sbjct: 149 CYQLDTQGTNFSSEAYFMTADYLKMMWRRIEFLIYVLEMGHSFVFTDTDIMWLQDPFNHF 208

Query: 204 QGDHDVYFTDDMAAVKP 220
             D D     D+    P
Sbjct: 209 YKDADFQIASDLYLGNP 225


>gi|326497757|dbj|BAK05968.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 412

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 38/157 (24%), Positives = 65/157 (41%), Gaps = 11/157 (7%)

Query: 92  NGTIIVCAVSQPYLP---FLNNWLISISRQKHQ----DQVLVIAEDYATLYKVNGRWPGH 144
           N T+++ A+++ +     FL+ +L S    ++       +L++A D     + N   P  
Sbjct: 147 NRTVLITALNEAWAAPGSFLDLFLESFKHGENTANLVKHLLIVAMDKKAFDRCNAVHPLC 206

Query: 145 AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
                   D     K+    +     +R      ILELGY  ++ DVD++W +DPFP + 
Sbjct: 207 YWFRVEGMDFAAEQKYMKGDYLEMMWKRNRFQQTILELGYTFLFTDVDILWFRDPFPRIP 266

Query: 205 GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 241
               V  + D     P D   + P     G  Y+ SC
Sbjct: 267 EAAQVVMSSDFFVGDP-DSPGNYP---NGGLLYVRSC 299


>gi|326529655|dbj|BAK04774.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 369

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 27/116 (23%), Positives = 48/116 (41%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           FL ++ +    ++  D +++++ D A   +          +     D      F + G+ 
Sbjct: 125 FLESFRVGEHTRELVDHLVIVSLDLAAHRRCKQIHAHCLAVATEGVDFSGQKNFMTDGYL 184

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLD 222
               RR   L  +LE G++ ++ D D+VWL+ P P L  D D     D     P D
Sbjct: 185 RMMWRRIDFLRQVLEKGFSFIFTDTDIVWLRSPLPRLYADGDFQIACDHFTGDPDD 240


>gi|413932428|gb|AFW66979.1| putative NAC domain transcription factor superfamily protein [Zea
           mays]
          Length = 306

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 41/188 (21%), Positives = 75/188 (39%), Gaps = 23/188 (12%)

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
           D ++   F S+ +      R      +LELGYN ++ DVD++W +DPF ++    D+  +
Sbjct: 109 DYRSEKVFMSKDYLEMMWGRNRFQQTVLELGYNFLFTDVDVMWFRDPFRHISMAADIAIS 168

Query: 213 DDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKK 272
            D+    P     + P  G           +++R +D      + W    Q   W    K
Sbjct: 169 SDVYMGDPYS-LRNFPNGG----------FLFVRASDKTVAFYRAW----QQGRWRFLGK 213

Query: 273 ANDQPAFNWALNKTAGQVDLYL-LPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGF 331
            ++Q  FN   ++ A ++DL +    +A+ +G               +   +H N   G 
Sbjct: 214 -HEQDVFNLIKHEEAPRLDLAIQFLDTAYISGFCQLSKDL------NRICTLHANCCVGL 266

Query: 332 EKKIKRFR 339
             K+   R
Sbjct: 267 GAKLHDLR 274


>gi|255574023|ref|XP_002527929.1| conserved hypothetical protein [Ricinus communis]
 gi|223532704|gb|EEF34486.1| conserved hypothetical protein [Ricinus communis]
          Length = 498

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 30/117 (25%), Positives = 50/117 (42%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           V  N T+++      Y   L +W+  + R +  + ++   +     + V    P      
Sbjct: 357 VDANKTVVLAVAGYSYKDMLMSWVCRLRRLQVTNFLICALDQETYQFAVLQGLPVFQDPS 416

Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG 205
            P   S     FG+  F   T  +   +L IL+LGYNV+ +DVD+ W  +P P L  
Sbjct: 417 APRNISFDDCHFGTDCFQRVTKVKSRIVLKILKLGYNVLLSDVDVYWFANPLPILSS 473


>gi|359491817|ref|XP_002270758.2| PREDICTED: uncharacterized protein At4g15970-like [Vitis vinifera]
          Length = 373

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 47/209 (22%), Positives = 92/209 (44%), Gaps = 19/209 (9%)

Query: 43  LLVLLSLLVVLGVILPWTGTPGFMFPNATSSLAKWR----DYTLSQAASFVAKNGTIIVC 98
           +++ ++++V+ G++L  +       P A  S ++ +    D  L +AA     + T+I+ 
Sbjct: 43  IMLFMAVVVLPGMVLHKSAYSSQFLP-ALISRSELKESPLDIVLEKAA---MGDKTVILT 98

Query: 99  AVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPA 151
            V+  +         FL ++ I  + ++  + +++IA D  +  +     P    L    
Sbjct: 99  TVNGAWAANNSLLDLFLESFHIGNNTKRLLNHLVIIALDQKSYARCLALHPLCYALKTEG 158

Query: 152 PDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYF 211
            D      + +  +     RR   L  IL +GY+ ++ D D++W +DPF +   D D   
Sbjct: 159 VDFSGEAYYSTPNYLEMMWRRIDFLRSILTMGYSFIFTDADIMWFRDPFQHFFQDADFQI 218

Query: 212 TDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
           T D     P D ++   P G  G TY+ S
Sbjct: 219 TCDSYIGNPYDVNN--RPNG--GFTYVKS 243


>gi|300123077|emb|CBK24084.2| unnamed protein product [Blastocystis hominis]
          Length = 414

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 62/274 (22%), Positives = 107/274 (39%), Gaps = 36/274 (13%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQ---VLVIAEDYATLYKVNGRWPGHA 145
           +A N T++       YL    N   +  R  H +Q     V   D+ +   V  R     
Sbjct: 86  IAVNNTVMTSFTDFGYL----NIFYTFYRLSHLEQYPNFFVTVIDHKSYEDVKKRGIPVF 141

Query: 146 VLVPPAPDSQTAHK---FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
              P   D +   K     S+ F      +   +  +L LG+  +Y D D++  ++P+P 
Sbjct: 142 YYRPVGVDDEMMSKGSIIMSKDFQKKVVNKLDFIRLVLSLGFVTLYMDCDLILFQNPWPI 201

Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
           L      + + D   V   D S +             S  + L PT   +L++       
Sbjct: 202 LS----TFSSRDYDLVTQRDESLN-------------SGFMLLFPTTQTRLLLS------ 238

Query: 263 QAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKG-KHV 321
            A    K     DQ +  + L + +G + L+LLP   F +G  + ++  +  +  G    
Sbjct: 239 CATLHMKQANELDQESILFCLPRLSG-LRLHLLPLEQFSSGRYFAESHQFYWDAIGTNQY 297

Query: 322 IIHNNYITGFEKKIKRFRDFGLWLVD-DHAVESP 354
           ++HNN+I G   K+ R+R+  L+  D D    SP
Sbjct: 298 MMHNNWIIGTNNKLYRWREMRLYTEDSDEYYSSP 331


>gi|255556103|ref|XP_002519086.1| conserved hypothetical protein [Ricinus communis]
 gi|223541749|gb|EEF43297.1| conserved hypothetical protein [Ricinus communis]
          Length = 436

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 68/267 (25%), Positives = 102/267 (38%), Gaps = 36/267 (13%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L+Q    VA    IIV   +      L  W  SI +    + ++V  +D    +  +   
Sbjct: 153 LAQILEEVAVQREIIVALANSNVKEMLEIWFNSIKKVGIPNYLVVALDDEIADFCESNDV 212

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPF 200
           P    +    PD +     G  G  +  S    H+L   L+LGY+V+ +DVD+V+L++PF
Sbjct: 213 P----VYKRDPD-EGIDSIGKTGGNHAVSGLKFHILREFLQLGYSVLLSDVDIVYLQNPF 267

Query: 201 PYLQGDHDV-YFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMI--------YLRPTDGA 251
            YL  D DV   TD    +    +      P      Y  +  I        Y+RPT  +
Sbjct: 268 DYLYRDSDVESMTDGHNNLTAYGYDDVFDEPAMGWARYAHTMRIWVFNSGFFYIRPTIPS 327

Query: 252 KLVMKKWIEELQAEPWSKAKKANDQPAFNW-----------ALNKTAGQVDLYLLPQSAF 300
             ++ +    L  E      K  DQ  FN             L  +   +D YL   S  
Sbjct: 328 IELLDRVANRLAHE------KVWDQAVFNEELFYPSHPGYDGLYASRRTMDFYLFMNSKV 381

Query: 301 PTGGLYFKNQTWVEETKGKHVIIHNNY 327
               L+   +     +K K VIIH NY
Sbjct: 382 ----LFKTVRKDANLSKLKPVIIHVNY 404


>gi|443717821|gb|ELU08709.1| hypothetical protein CAPTEDRAFT_227405 [Capitella teleta]
          Length = 492

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 48/206 (23%), Positives = 87/206 (42%), Gaps = 36/206 (17%)

Query: 152 PDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYF 211
            DS+    + S+ F    + R   +L  L+LG+NV++ DVD+V+  +P   ++ +     
Sbjct: 130 ADSEKESVYMSKDFVRKMNIRTYMILEALKLGFNVLHTDVDVVYFTNPLTEVEKE----- 184

Query: 212 TDDMAAVKPLDHSHDLPP-----PGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEP 266
                     +   DL P        +G  +I S    +R  +  K++         AE 
Sbjct: 185 --------CPEKDCDLAPLWDSIVYNEGFVFIRSSPAGVRAFEDMKVI---------AET 227

Query: 267 WSKAKKANDQPAFNWALNKTAG-QVDLYLLPQSAFPTGGLYFKNQTWV--EETKGKH--V 321
            +K    +DQ A N  + +     +    LP + + +G  ++++   V      G H   
Sbjct: 228 TNK----DDQVALNTVIKRGHKYGLRFKKLPVTQYLSGKAFYEDTERVFGNSAPGCHNCK 283

Query: 322 IIHNNYITGFEKKIKRFRDFGLWLVD 347
           + HNN+I   E K+ RFR+   W+ D
Sbjct: 284 VAHNNWIVSIEAKVYRFREMLQWIHD 309


>gi|356561839|ref|XP_003549184.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
          Length = 353

 Score = 46.6 bits (109), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 49/185 (26%), Positives = 77/185 (41%), Gaps = 16/185 (8%)

Query: 45  VLLSLLVVLGVILPWTGTPGFMFP------NATSSLAKWRDYTLSQAASFVA-KNGTIIV 97
           + L+ L VL + L  T +P  +FP      NA SS     D  L       + K+ T+I+
Sbjct: 34  MFLAGLAVLWMFLYNTASPFGIFPAFSHSINAQSSKQANYDPKLESVLRDASMKDKTVII 93

Query: 98  CAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
             ++  +         FL +  +    Q   + ++VI  D  TL +       H   V  
Sbjct: 94  TTLNDAWAEPGSMFDLFLESLQLGNGTQWLSNHLVVITWDQKTLARCLVV-HKHCYQVET 152

Query: 151 APDSQTAHKF-GSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
              + T   F  +  + +   RR   L  ILE+GYN ++ D D++WL+DPF     D D 
Sbjct: 153 KGGNYTGEVFYMTPNYLHMMWRRTEFLGSILEMGYNFVFTDTDIMWLRDPFKQFYKDTDF 212

Query: 210 YFTDD 214
               D
Sbjct: 213 QIACD 217


>gi|412993241|emb|CCO16774.1| predicted protein [Bathycoccus prasinos]
          Length = 997

 Score = 46.6 bits (109), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 47/205 (22%), Positives = 87/205 (42%), Gaps = 36/205 (17%)

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
           D++ +HK  +Q F          +   +++G +V+  D D+V+L++PFPYL  DHD+   
Sbjct: 279 DARGSHKISAQKF--------GIIKDFVKVGCSVLLTDTDVVYLQNPFPYLYRDHDIESM 330

Query: 213 DDMAAVKPLDHSH---DLPPPGKKGRTYICSCMI--------YLRPTDGAKLVMKKWIEE 261
            D    +  +  H   D    G+ GR  + +  +        Y+  T+ +  +M      
Sbjct: 331 SDGWDNQTANGFHQVIDDAAMGRSGRARVKAFRVSALNSGLWYVAATEASYRLMSIMAHR 390

Query: 262 LQAEP-WSKAKKANDQPAFNWAL-------NKTAGQVDLYLLPQSAFPTGGLYFKNQTWV 313
           +  E  W       DQ  +N  L       ++T+G     + P     +  ++   +   
Sbjct: 391 MATEDLW-------DQAGYNLELWFASRDWHQTSGATVRVMNPYCFLNSKVMFRIVRHKK 443

Query: 314 EETKGKH--VIIHNNYITGFEKKIK 336
           E  K +H  V +H NY T  E+K++
Sbjct: 444 ELAKDRHRPVAMHANYHTDKERKMQ 468


>gi|413936405|gb|AFW70956.1| hypothetical protein ZEAMMB73_889446 [Zea mays]
          Length = 306

 Score = 46.6 bits (109), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 41/188 (21%), Positives = 75/188 (39%), Gaps = 23/188 (12%)

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
           D ++   F S+ +      R      +LELGYN ++ DVD++W +DPF ++    D+  +
Sbjct: 109 DYRSEKVFMSKDYLEMMWGRNRFQQTVLELGYNSLFTDVDVMWFRDPFRHISMAADIAIS 168

Query: 213 DDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKK 272
            D+    P     + P  G           +++R +D      + W    Q   W    K
Sbjct: 169 SDVYMGDPYS-LRNFPNGG----------FLFVRASDKTVAFYRAW----QQGRWRFLGK 213

Query: 273 ANDQPAFNWALNKTAGQVDLYL-LPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGF 331
            ++Q  FN   ++ A ++DL +    +A+ +G               +   +H N   G 
Sbjct: 214 -HEQDVFNLIKHEEAPRLDLAIQFLDTAYISGFCQLSKDL------NRICTLHANCCVGL 266

Query: 332 EKKIKRFR 339
             K+   R
Sbjct: 267 GAKLHDLR 274


>gi|297850324|ref|XP_002893043.1| hypothetical protein ARALYDRAFT_472163 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297338885|gb|EFH69302.1| hypothetical protein ARALYDRAFT_472163 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 428

 Score = 46.2 bits (108), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 104/264 (39%), Gaps = 29/264 (10%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L++    +A +  +IV   +      L   + SI R    + ++V  +DY   +      
Sbjct: 145 LAKILEEIAVDKEVIVALANANVKAMLEVQIASIKRVGITNYLVVALDDYIENFCKEKDV 204

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
             +     P  D  T  K G  G    +  +   L   L+LGY V+ +DVD+V+L++PF 
Sbjct: 205 AYYKR--DPDKDVDTVGKTG--GNHAVSGLKFRVLREFLQLGYGVLLSDVDIVFLQNPFS 260

Query: 202 YLQGDHDVYFTDD---MAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGAK 252
           +L  D DV    D          +   D P  G        R ++  S   YLRPT  + 
Sbjct: 261 HLYRDSDVESMSDGHNNHTAYGFNDVFDEPAMGWARYAHTMRIWVFNSGFFYLRPTIPSI 320

Query: 253 LVMKKWIEEL-QAEPWSKAKKANDQPAFNWAL------NKTAGQVDLYLLPQSAFPTGGL 305
            ++ +  + L +A+ W       DQ  FN  L      N TA      ++    F    +
Sbjct: 321 ELLDRVADRLSKAKVW-------DQAVFNEELFYPSHPNYTALHASKRVMDMYEFMNSKV 373

Query: 306 YFKNQTWVEETKGKH--VIIHNNY 327
            FK      E K K   VI+H NY
Sbjct: 374 LFKTVRKNHELKKKVKPVIVHVNY 397


>gi|242055405|ref|XP_002456848.1| hypothetical protein SORBIDRAFT_03g044020 [Sorghum bicolor]
 gi|241928823|gb|EES01968.1| hypothetical protein SORBIDRAFT_03g044020 [Sorghum bicolor]
          Length = 366

 Score = 46.2 bits (108), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 9/100 (9%)

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHK--FGSQGFFNFTSRRPCHLLHI 179
           D +LV+A D   L       P H  L+P A     + +  F S+ + +    +      I
Sbjct: 137 DHLLVVALDGGALEHCRAVHP-HCYLLPAAAARNLSGEKVFMSKDYIDLVWSKVRLQQRI 195

Query: 180 LELGYNVMYNDVDMVWLKDPF------PYLQGDHDVYFTD 213
           LELGYN ++ DVD++W ++PF       ++    D YF D
Sbjct: 196 LELGYNFLFTDVDILWFRNPFERMSVAAHMVTSSDFYFGD 235


>gi|238478665|ref|NP_001154379.1| nucleotide-diphospho-sugar transferase domain-containing protein
           [Arabidopsis thaliana]
 gi|332192897|gb|AEE31018.1| nucleotide-diphospho-sugar transferase domain-containing protein
           [Arabidopsis thaliana]
          Length = 278

 Score = 46.2 bits (108), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 38/183 (20%), Positives = 73/183 (39%), Gaps = 23/183 (12%)

Query: 89  VAKNGTIIVCAVSQPYLP-----------FLNNWLISISRQKHQDQVLVIAEDYATLYKV 137
           +  N T+I+  V++ Y+            FL ++      +   D ++++A D  +  + 
Sbjct: 1   MGNNKTVIIAMVNKAYVEEVEGGRTMLDLFLESFWEGEGTRPLLDHLMLVAADQTSYDRC 60

Query: 138 NGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLK 197
             R      +     D +    + S+ F     RR   LL +L  GYN+ + D D++WL+
Sbjct: 61  LFRRLHCYKMDTDGVDLEGEKVYMSKDFIEMMWRRTHLLLDVLSRGYNLTFTDTDVMWLR 120

Query: 198 DPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKK 257
            PFP L  +            + LD    +   G  G   I +   ++R  +    + +K
Sbjct: 121 SPFPRLSYN------------ESLDMQISVDSIGLVGGHLINTGFYHVRSNNKTISLFQK 168

Query: 258 WIE 260
           W +
Sbjct: 169 WYD 171


>gi|414585717|tpg|DAA36288.1| TPA: hypothetical protein ZEAMMB73_697462 [Zea mays]
          Length = 374

 Score = 46.2 bits (108), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 29/118 (24%), Positives = 53/118 (44%), Gaps = 4/118 (3%)

Query: 123 QVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILEL 182
            ++++A D+    +     P    L     D     +F + G+     RR   L  +LE 
Sbjct: 141 HLVIVAFDWKAYEECVKIHPYCFALGTDGVDFSEEKRFLTSGYLEMMWRRLDFLRLVLEK 200

Query: 183 GYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
           GYN +++D D++W ++PFP+   D D      +A    + ++ DL      G +Y+ S
Sbjct: 201 GYNFIFSDADIMWFRNPFPHFYPDVDF----QIACDHYVRNATDLRNIANGGFSYVKS 254


>gi|413951493|gb|AFW84142.1| hypothetical protein ZEAMMB73_205487 [Zea mays]
          Length = 383

 Score = 46.2 bits (108), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 25/99 (25%), Positives = 45/99 (45%)

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
           D VL++  D  +        P   +L   + D   A  FG+  +      +      +LE
Sbjct: 147 DHVLIVTVDAGSFSGCKAVHPHCYLLEVKSMDMNRAKTFGTPEYVEMIWLKLSIQQRVLE 206

Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKP 220
           LGYN ++ D D++WL++PF  +    D+  + D + + P
Sbjct: 207 LGYNFLFTDADILWLRNPFQRISVYADMSCSVDNSKMAP 245


>gi|293333989|ref|NP_001170602.1| uncharacterized protein LOC100384639 [Zea mays]
 gi|238006282|gb|ACR34176.1| unknown [Zea mays]
          Length = 303

 Score = 46.2 bits (108), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 65/143 (45%), Gaps = 14/143 (9%)

Query: 68  PNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLP---FLNNWLISISRQKHQ--- 121
           P A+  LA      L +AA+    + T+++ A+++ +     FL+ +L S  + +     
Sbjct: 24  PPASQDLAD----LLRRAAT---ADKTVLMTAINEAWAAPGSFLDLFLESFRQGEDTAGL 76

Query: 122 -DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHIL 180
              +LV+A D   L + N   P   +      D     K+    +     RR      +L
Sbjct: 77  PRHLLVVAMDGRALERCNAVHPFCYLFRVDGMDFAAEQKYMEGDYLEMMWRRNRFQQSVL 136

Query: 181 ELGYNVMYNDVDMVWLKDPFPYL 203
           ELGY+ ++ DVD++W + PFP L
Sbjct: 137 ELGYSFLFTDVDILWFRSPFPRL 159


>gi|226492551|ref|NP_001141233.1| uncharacterized protein LOC100273320 [Zea mays]
 gi|194703416|gb|ACF85792.1| unknown [Zea mays]
 gi|413933156|gb|AFW67707.1| regulatory protein [Zea mays]
          Length = 343

 Score = 45.8 bits (107), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 32/122 (26%), Positives = 55/122 (45%), Gaps = 10/122 (8%)

Query: 92  NGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGH 144
           N T+I+  +++ Y         FL +       ++  D VL +A D     +        
Sbjct: 74  NRTLILTMLNKAYSEEGGLLDLFLESLREGDGTEQLIDHVLFVAMDQQAFRRCRSLGGLK 133

Query: 145 AVLVPP---APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
             L+ P   A D  +   + S GF     RR   L  +L+ GY+ ++ D+D++WL++PFP
Sbjct: 134 CYLLRPTDSADDLSSEQIYMSDGFIRMMWRRIRFLGDVLKHGYSFVFTDMDVMWLRNPFP 193

Query: 202 YL 203
            L
Sbjct: 194 KL 195


>gi|145352275|ref|XP_001420477.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580711|gb|ABO98770.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 634

 Score = 45.8 bits (107), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 49/223 (21%), Positives = 85/223 (38%), Gaps = 43/223 (19%)

Query: 162 SQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPL 221
           S+ F    + +   +  +LELG   + +D D+VW++DP          Y  D + +   +
Sbjct: 112 SKTFKKMGTVKTKFIQDLLELGIAPILSDADVVWMRDPRELFNNGTYAY-ADVLISSDCI 170

Query: 222 DHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN- 280
           D  +D             + ++++RPT+ AK  ++KW +++     S+     DQPA N 
Sbjct: 171 DTVNDRADNANCRNVNFNTGIVHIRPTEPAKAFVEKWKQKVAT---SEIAWMRDQPALNL 227

Query: 281 -----------------------------WALNKTAGQVDLYLLPQSAFPTGGLYF-KNQ 310
                                        +A N T   + + +LP + F  G  +F +  
Sbjct: 228 LVREGSPALAPAVAVPDDKRGLPGYRSIVFAANST---IRMGVLPIAQFSNGHTFFVQEH 284

Query: 311 TWVEETKGKHVIIHNNYITGFEK-----KIKRFRDFGLWLVDD 348
                  G+   +H  Y  G        K +R R  GLW  DD
Sbjct: 285 HLYHPEDGEPYAVHTTYQYGDSARYAYGKRQRLRQHGLWYADD 327


>gi|195640292|gb|ACG39614.1| regulatory protein [Zea mays]
          Length = 344

 Score = 45.8 bits (107), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 81/190 (42%), Gaps = 19/190 (10%)

Query: 32  RPMLLVLNRTTLLVLLSLLVVLGVIL---PWTGTPG----FMFPNATSSLAKWRDYTLSQ 84
           RP    L+R   L+ + +++V+   +   P  G  G    F   +  SS     D  ++ 
Sbjct: 8   RPEQCTLSRVVSLITVGVVIVVCTYVTSSPAGGDAGKEEFFSIWHRRSSDFVPDDLEVAL 67

Query: 85  AASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKV 137
             +  A N T+I+  +++ Y         FL +       ++  D VL +A D     + 
Sbjct: 68  RGAAYA-NRTLILTMLNKAYSEEGGLLDLFLESLREGDGTEQLIDHVLFVAMDQQAFRRC 126

Query: 138 NGRWPGHAVLVPPAPDS----QTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDM 193
                    L+ P  DS     +   + S GF     RR   L  +L+ GY+ ++ D+D+
Sbjct: 127 RSLGGLKCYLLRPPTDSADDLSSEQLYMSDGFIRMMWRRIRFLGDVLKHGYSFVFTDMDV 186

Query: 194 VWLKDPFPYL 203
           +WL++PFP L
Sbjct: 187 MWLRNPFPKL 196


>gi|242037075|ref|XP_002465932.1| hypothetical protein SORBIDRAFT_01g048460 [Sorghum bicolor]
 gi|241919786|gb|EER92930.1| hypothetical protein SORBIDRAFT_01g048460 [Sorghum bicolor]
          Length = 380

 Score = 45.8 bits (107), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 28/119 (23%), Positives = 51/119 (42%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           FL ++    S ++  D +++++ D     +          LV    D      F + G+ 
Sbjct: 131 FLESFRSGESTRELLDHLVIVSLDTTAHARCRQVHRHCFALVTDGVDFSGQKNFMTDGYL 190

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSH 225
               RR   L  +LE G++ ++ D D+VW ++P P+   D D+    D     P D S+
Sbjct: 191 KMMWRRIDFLRKVLEKGFSFVFTDTDIVWFRNPLPHFYPDGDLQIACDHFTGDPSDLSN 249


>gi|168052834|ref|XP_001778844.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162669713|gb|EDQ56294.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 299

 Score = 45.8 bits (107), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 53/212 (25%), Positives = 85/212 (40%), Gaps = 22/212 (10%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           LS     VA NG +IV   +      +  W  SI R    + ++V  +D    +  +   
Sbjct: 17  LSALLKKVAVNGELIVGISNNNVRDMVQIWFESIKRVGVTNYLVVALDDEIASFCQDHDV 76

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPF 200
           P +        D+  +      G  +  S    HLL   L LGY+++ +DVD+V+L++PF
Sbjct: 77  PVYR------RDATISKSQAGTGANHAISGLKFHLLREFLVLGYSILLSDVDIVYLQNPF 130

Query: 201 PYLQGDHDV-YFTDDMAAVKPLDHSHDLPPPGKKGRTY--------ICSCMIYLRPTDGA 251
            +L  D DV   +D         +   +  P      Y          S + Y+RPT  +
Sbjct: 131 NHLHRDCDVESMSDGFDNTTAYGYDDVMTDPSMGWSRYAHTMRIWVFNSGLFYIRPTVPS 190

Query: 252 KLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
             ++ +   +L  E      KA DQ  FN  L
Sbjct: 191 IELLDRVTAKLTKE------KAWDQAVFNEEL 216


>gi|242076940|ref|XP_002448406.1| hypothetical protein SORBIDRAFT_06g026610 [Sorghum bicolor]
 gi|241939589|gb|EES12734.1| hypothetical protein SORBIDRAFT_06g026610 [Sorghum bicolor]
          Length = 373

 Score = 45.8 bits (107), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 43/88 (48%), Gaps = 4/88 (4%)

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
           D     +F + G+     RR   L  +LE GYN +++D D++W ++PFP+   D D    
Sbjct: 170 DFSEERRFLTSGYLEMMWRRLDFLRLVLEKGYNFIFSDADIMWFRNPFPHFYPDVDF--- 226

Query: 213 DDMAAVKPLDHSHDLPPPGKKGRTYICS 240
             +A    + ++ DL      G +Y+ S
Sbjct: 227 -QIACDHYVRNATDLRNIANGGFSYVKS 253


>gi|226494464|ref|NP_001150863.1| regulatory protein [Zea mays]
 gi|195642450|gb|ACG40693.1| regulatory protein [Zea mays]
          Length = 412

 Score = 45.8 bits (107), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 65/143 (45%), Gaps = 14/143 (9%)

Query: 68  PNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLP---FLNNWLISISRQKHQ--- 121
           P A+  LA      L +AA+    + T+++ A+++ +     FL+ +L S  + +     
Sbjct: 133 PAASQDLADL----LRRAAT---ADKTVLMTAINEAWAAPGSFLDLFLESFRQGEDTAGL 185

Query: 122 -DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHIL 180
              +LV+A D   L + N   P   +      D     K+    +     RR      +L
Sbjct: 186 PRHLLVVAMDGRALERCNAVHPFCYLFRVDGMDFAAEQKYMEGDYLEMMWRRNRFQQSVL 245

Query: 181 ELGYNVMYNDVDMVWLKDPFPYL 203
           ELGY+ ++ DVD++W + PFP L
Sbjct: 246 ELGYSFLFTDVDILWFRSPFPRL 268


>gi|452845748|gb|EME47681.1| hypothetical protein DOTSEDRAFT_77920 [Dothistroma septosporum
           NZE10]
          Length = 362

 Score = 45.8 bits (107), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 80/366 (21%), Positives = 143/366 (39%), Gaps = 65/366 (17%)

Query: 36  LVLNRTTLLVLLSLLVV--LGVILPWTGTPGFMFPNATSSLAKWRDYTLSQAASFVAKNG 93
           ++  + T LVL+S L +  + V L +  T  F      S L    D    +  +  A NG
Sbjct: 6   ILRKKFTTLVLISALCIGLVYVSLDYAWTAPFFRVERQSGL----DEDDIRMFAKHAING 61

Query: 94  TIIVCAVSQPYLPFLNNWLISISRQKHQDQVLV---IAEDYATLYKVNGRWPGHAVLVPP 150
           T+++   +   L ++ N L S+         ++   +  D  +    N R   H     P
Sbjct: 62  TLVIVPSNHGMLHWVENLLCSLEFTSFDTSKIIFWALDSDTKSYMTSNNRTAYHN----P 117

Query: 151 A--PDSQTAHKFGSQGFFN-FTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL---- 203
           A    S   ++ G+   +N     RP   L +L +G +++  D D+V+ + P   +    
Sbjct: 118 AFFATSNNENRKGNTAAYNRMMLERPKFYLDVLSVGLHILMLDADIVFWQSPLSMIPSTP 177

Query: 204 QGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI---CSCMIYLRPTDGAKLVMKKWIE 260
           Q D D+ ++ D    +     H+     ++   YI   C+ + ++R +  AK +   W E
Sbjct: 178 QEDVDIIYSTD---AREFYQDHNAFDDERRRGPYIPPICNGIFWMRSS--AKTI-ALWSE 231

Query: 261 ELQA--EPWSKAKKA--NDQPAFNWALNKTAGQV-------------------------D 291
            L+   +PW    +   +DQ   +  LN   GQV                         +
Sbjct: 232 MLEIFHKPWLWRPRGFQDDQRGMDVLLNDGRGQVMPPYPDGIEEGIVPTSPDAKSELGVN 291

Query: 292 LYLLPQSAFPTGGL-------YFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLW 344
           + LL Q+    G L       Y KN   + E+    +  H N+ T    K +  +  GL+
Sbjct: 292 VQLLDQTEVVNGHLLMNRNVQYMKNLAKLRESGRDRIAAHFNWWTEELTKEEGAKRLGLY 351

Query: 345 LVDDHA 350
           L+DD+ 
Sbjct: 352 LLDDNG 357


>gi|218194109|gb|EEC76536.1| hypothetical protein OsI_14328 [Oryza sativa Indica Group]
          Length = 388

 Score = 45.8 bits (107), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 49/218 (22%), Positives = 79/218 (36%), Gaps = 21/218 (9%)

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
           D ++++  D A              L     D ++   F S+ +      R      ILE
Sbjct: 160 DHLVIVTMDPAAYEGCQAVHRHCYFLRTTGVDYRSEKTFMSKDYLEMMWGRNKFQQTILE 219

Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 241
           LGYN ++ DVD++W +DPF ++    D+  + D+    P     + P  G          
Sbjct: 220 LGYNFLFTDVDVMWFRDPFRHISMGADIAISSDVFIGDPYSLG-NFPNGG---------- 268

Query: 242 MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFP 301
            +++R  D      + W    Q   W    K ++Q  FN  L K   Q  L +  Q    
Sbjct: 269 FLFVRSNDKTLDFYRSW----QQGRWRFFGK-HEQDVFN--LIKHEQQAKLGIAIQFLDT 321

Query: 302 TGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFR 339
           T   Y      + +   K   +H N   G   K+   R
Sbjct: 322 T---YISGFCQLSKDLNKICTLHANCCVGLGAKMHDLR 356


>gi|255083186|ref|XP_002504579.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
 gi|226519847|gb|ACO65837.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
          Length = 779

 Score = 45.8 bits (107), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 40/143 (27%), Positives = 61/143 (42%), Gaps = 24/143 (16%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW-PGHAVLV 148
           A   T+  C  +   L FL NWL   SR + ++ VLVIA D     K   RW   + V  
Sbjct: 165 AVGDTVSACFATIEMLDFLVNWLEHASRLEMRN-VLVIAMD-----KHTARWCDENGVAR 218

Query: 149 PPAPDSQTAHKFG-------------SQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVW 195
             A D+    +               ++GF      +   +  +L++G +V  +DVD+VW
Sbjct: 219 MDASDAIDKSEMNDPRVEVADVGYRMTRGFNLLGEAKTASIAKLLDMGLDVFLSDVDVVW 278

Query: 196 LKDPFPYLQGDH----DVYFTDD 214
           L++P  Y +       DV  T D
Sbjct: 279 LRNPSDYFESGQLALADVAVTSD 301


>gi|356545279|ref|XP_003541071.1| PREDICTED: uncharacterized protein LOC100804189 [Glycine max]
          Length = 438

 Score = 45.4 bits (106), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 69/268 (25%), Positives = 106/268 (39%), Gaps = 38/268 (14%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L++    VA    +IVC  +      L  W  +I R    + ++   +D    +  + + 
Sbjct: 155 LAKILEKVAVKRELIVCLANTNVKEMLEVWFTNIKRVGITNYLVAALDDETAKFCESNQV 214

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPF 200
           P    +    PD       G  G  +  S     +L   L+LGY+V+ +DVD+V L++PF
Sbjct: 215 P----VYKRDPDD-GVDIIGRTGSNHAVSGLKFRILREFLQLGYSVLLSDVDIVHLQNPF 269

Query: 201 PYLQGDHDVYFT----DDMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDG 250
            +L  D DV       D+M A    D   D P  G        R ++  S   Y+RPT  
Sbjct: 270 DHLYRDSDVESMSDGHDNMTAYGYND-VFDEPTMGWARYAHTMRIWVYNSGFFYIRPTIP 328

Query: 251 AKLVMKKWIEELQAEPWSKAKKANDQPAFNW-----------ALNKTAGQVDLYLLPQSA 299
           +  ++ +    L  E      KA DQ  FN             L+     +D+YL   S 
Sbjct: 329 SIELLDRVATRLSKE------KAWDQAVFNEELFYPSFPGYDGLHAARRTMDMYLFMNSK 382

Query: 300 FPTGGLYFKNQTWVEETKGKHVIIHNNY 327
                L+   +     +K K VIIH NY
Sbjct: 383 V----LFKTVRNDANLSKLKPVIIHVNY 406


>gi|307109515|gb|EFN57753.1| hypothetical protein CHLNCDRAFT_143033 [Chlorella variabilis]
          Length = 701

 Score = 45.4 bits (106), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 50/196 (25%), Positives = 75/196 (38%), Gaps = 37/196 (18%)

Query: 186 VMYNDVDMVWLKDPFPYLQGDHDVYF---TDDMAAVKPLD--HSHDLPP----PGKKGRT 236
           V+ +D D VWL+ P+ Y Q      F   +D ++    ++    H  P     PG     
Sbjct: 183 VVVSDTDAVWLRPPWTYFQQRPAAEFFSSSDCLSHQVEVEWRRDHGQPRCGHVPGNGDGH 242

Query: 237 YICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKA-----KKANDQPAFNWALNKTAGQVD 291
            + + +  +R T+GAK  M+ W   L      +A     ++ +DQ A N       G V 
Sbjct: 243 ALNTGLFAVRNTEGAKAAMRAWAGMLVDPAQERADDPMHREMDDQLALNTMFEGQGGSVS 302

Query: 292 -----------------LYLLPQSAFPTGGLYFKNQT-WVEETK--GKHVIIHNNYITGF 331
                            L +LP   F  G + F  +T W         H      +I G 
Sbjct: 303 LSAEDPRTVLVWDNKLRLQVLPVLLFSNGHVAFVQRTPWKMGVAPIAIHTTFQRYHIPG- 361

Query: 332 EKKIKRFRDFGLWLVD 347
             K  R R+FGLWL+D
Sbjct: 362 --KRSRLREFGLWLMD 375


>gi|168040047|ref|XP_001772507.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162676197|gb|EDQ62683.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 552

 Score = 45.4 bits (106), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 49/202 (24%), Positives = 82/202 (40%), Gaps = 38/202 (18%)

Query: 168 FTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDL 227
            T R+   +L IL+ GY+V+++DVD++W ++P P L       F      V   + + +L
Sbjct: 338 LTQRKLQVVLQILQKGYHVVWSDVDVIWFQNPLPRLTA-----FPTGTFLVVSDEPNMNL 392

Query: 228 PPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQP---AFNWALN 284
           P     GR  + S   Y +         K  I        S A +A +QP    F   L 
Sbjct: 393 PA---NGRGRVDSGFFYAQSEKATIKAFKDLI--------SYASEAPEQPEQLVFGHVLC 441

Query: 285 KTAGQ--------------VDLYLLPQSAFPTGGL----YFKNQTWVEETKGKHVIIHNN 326
               Q              +    L +  F  G +    + +N T    + G +V +HNN
Sbjct: 442 GKGAQHRVGVTECKAASSGLRTSFLNRRTFANGIVNDYWWHENVTEAALSSGVYV-LHNN 500

Query: 327 YITGFEKKIKRFRDFGLWLVDD 348
           +I G  +K++R +   +W  +D
Sbjct: 501 WIHGAAEKVRRQKAKHVWFYND 522


>gi|223947351|gb|ACN27759.1| unknown [Zea mays]
          Length = 407

 Score = 45.4 bits (106), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 30/117 (25%), Positives = 54/117 (46%), Gaps = 7/117 (5%)

Query: 94  TIIVCAVSQPYLP---FLNNWLISISRQKHQ----DQVLVIAEDYATLYKVNGRWPGHAV 146
           T+++ A+++ +     FL+ +L S  + +        +LV+A D   L + N   P   +
Sbjct: 147 TVLMTAINEAWAAPGSFLDLFLESFRQGEDTAGLPRHLLVVAMDGRALERCNAVHPFCYL 206

Query: 147 LVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
                 D     K+    +     RR      +LELGY+ ++ DVD++W + PFP L
Sbjct: 207 FRVDGMDFAAEQKYMEGDYLEMMWRRNRFQQSVLELGYSFLFTDVDILWFRSPFPRL 263


>gi|357126546|ref|XP_003564948.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
           distachyon]
          Length = 392

 Score = 45.4 bits (106), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 41/207 (19%), Positives = 85/207 (41%), Gaps = 33/207 (15%)

Query: 91  KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
           ++GT+I+ +V++ +         FL ++ +  +     + ++V+A D   L + +     
Sbjct: 114 EDGTVIMTSVNEAWAAPGSLLDTFLESFRVGENISHFVEHIIVVAMDEGALLRCHALKHP 173

Query: 144 HAVLVPP----------APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDM 193
           H  L+ P          +PD   A  + ++ + +    +      +L+LG+N+++ DVD+
Sbjct: 174 HCHLLLPDPENNSPNGTSPDLSGAKSYMTKDYLSLVWSKLRLQQRVLDLGHNLLFTDVDV 233

Query: 194 VWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKL 253
            W +DP  ++    D+  + D     P D  +           Y  +  IY + T     
Sbjct: 234 AWFRDPRVHITLAADITTSSDFYFGDPDDLGN-----------YPNTGFIYFKATARNGR 282

Query: 254 VMKKWIEELQAEPWSKAKKANDQPAFN 280
            M  W         S+  + +DQ  FN
Sbjct: 283 AMAHW-----HAARSRFPREHDQFVFN 304


>gi|159486479|ref|XP_001701267.1| hypothetical protein CHLREDRAFT_179139 [Chlamydomonas reinhardtii]
 gi|158271849|gb|EDO97660.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 695

 Score = 45.4 bits (106), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 31/124 (25%), Positives = 55/124 (44%), Gaps = 5/124 (4%)

Query: 139 GRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKD 198
           G+ P        +  S   +K+G   +   T R+   +  I+  G+NV+++DVD+VW +D
Sbjct: 232 GKHPCFKFYEDGSHSSTDQYKYGDSHYNAATWRKVVVVSRIVHWGFNVIHSDVDVVWFRD 291

Query: 199 PFPYLQG----DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLV 254
           P  Y  G    + D+  + D+ +         L   G    T   + + ++R T G K +
Sbjct: 292 PLAYFLGPAVNNVDMALSSDLISTGNPKGDEGL-EVGMHQHTNFNTGVYFVRATPGGKSL 350

Query: 255 MKKW 258
           M  W
Sbjct: 351 MAGW 354


>gi|28269397|gb|AAO37940.1| putative regulatory protein [Oryza sativa Japonica Group]
          Length = 379

 Score = 45.4 bits (106), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 49/218 (22%), Positives = 79/218 (36%), Gaps = 21/218 (9%)

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
           D ++++  D A              L     D ++   F S+ +      R      ILE
Sbjct: 151 DHLVIVTMDPAAYEGCQAVHRHCYFLRTTGVDYRSEKMFMSKDYLEMMWGRNKFQQTILE 210

Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 241
           LGYN ++ DVD++W +DPF ++    D+  + D+    P     + P  G          
Sbjct: 211 LGYNFLFTDVDVMWFRDPFRHISMGADIAISSDVFIGDPYSLG-NFPNGG---------- 259

Query: 242 MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFP 301
            +++R  D      + W    Q   W    K ++Q  FN  L K   Q  L +  Q    
Sbjct: 260 FLFVRSNDKTLDFYRSW----QQGRWRFFGK-HEQDVFN--LIKHEQQAKLGIAIQFLDT 312

Query: 302 TGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFR 339
           T   Y      + +   K   +H N   G   K+   R
Sbjct: 313 T---YISGFCQLSKDLNKICTLHANCCVGLGAKMHDLR 347


>gi|356529763|ref|XP_003533457.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
          Length = 349

 Score = 45.4 bits (106), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 58/260 (22%), Positives = 94/260 (36%), Gaps = 35/260 (13%)

Query: 91  KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
           K+ T+I+  ++  +         FL ++ +    +K  + ++VI  D     K + R   
Sbjct: 85  KDKTVIITTLNDAWAEPGSIFDLFLESFRLGNQTKKFLNHLVVITWDQ----KAHARCLA 140

Query: 144 ---HAVLVPPAPDSQTAHKF-GSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
              H   V    D+ T   F  +  + +   RR   L  +L++GYN ++ D D++WL+DP
Sbjct: 141 LHKHCYQVETKGDNFTGEAFFMTADYLHMMWRRIEFLGTVLDMGYNFVFTDTDIMWLRDP 200

Query: 200 FPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWI 259
           F     D D     D        +S+DL      G  Y+       +      L  K W 
Sbjct: 201 FKQFYKDTDFQIACDFFN----GNSYDLNNHPNGGFNYV-------KSNKRTILFYKFWF 249

Query: 260 EELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGK 319
               A P     K +DQ      LNK      +  +          YF       E   K
Sbjct: 250 NSRNAYP-----KLHDQD----VLNKIKKDSFVSNMKLKVRFLSTSYFGGFCQHAEDFNK 300

Query: 320 HVIIHNNYITGFEKKIKRFR 339
              +H N   G E K+   +
Sbjct: 301 VSTMHANCCVGLENKVNDLK 320


>gi|326522376|dbj|BAK07650.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 390

 Score = 45.4 bits (106), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 31/134 (23%), Positives = 58/134 (43%), Gaps = 4/134 (2%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           F++++   +        +++IA D+    +          LV    D     +F + G+ 
Sbjct: 141 FIDSFRHGVRTSSLLKHLVIIAFDWKAYKRCTEVHTYCYALVTDNVDFSQEKRFLTAGYL 200

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHD 226
               +R   L  +LE GY+ +++D D++W ++PFPY   D D      +A    + +S D
Sbjct: 201 EMMWKRLDFLRLVLEKGYSFIFSDADIMWFRNPFPYFYPDGDF----QVACDHYVGNSTD 256

Query: 227 LPPPGKKGRTYICS 240
           L      G  Y+ S
Sbjct: 257 LRNIANGGFNYVKS 270


>gi|108712116|gb|ABF99911.1| regulatory protein, putative, expressed [Oryza sativa Japonica
           Group]
 gi|222626172|gb|EEE60304.1| hypothetical protein OsJ_13372 [Oryza sativa Japonica Group]
          Length = 391

 Score = 45.4 bits (106), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 49/218 (22%), Positives = 79/218 (36%), Gaps = 21/218 (9%)

Query: 122 DQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILE 181
           D ++++  D A              L     D ++   F S+ +      R      ILE
Sbjct: 163 DHLVIVTMDPAAYEGCQAVHRHCYFLRTTGVDYRSEKMFMSKDYLEMMWGRNKFQQTILE 222

Query: 182 LGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 241
           LGYN ++ DVD++W +DPF ++    D+  + D+    P     + P  G          
Sbjct: 223 LGYNFLFTDVDVMWFRDPFRHISMGADIAISSDVFIGDPYSLG-NFPNGG---------- 271

Query: 242 MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFP 301
            +++R  D      + W    Q   W    K ++Q  FN  L K   Q  L +  Q    
Sbjct: 272 FLFVRSNDKTLDFYRSW----QQGRWRFFGK-HEQDVFN--LIKHEQQAKLGIAIQFLDT 324

Query: 302 TGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFR 339
           T   Y      + +   K   +H N   G   K+   R
Sbjct: 325 T---YISGFCQLSKDLNKICTLHANCCVGLGAKMHDLR 359


>gi|307103589|gb|EFN51848.1| hypothetical protein CHLNCDRAFT_37217 [Chlorella variabilis]
          Length = 631

 Score = 45.4 bits (106), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 40/164 (24%), Positives = 73/164 (44%), Gaps = 10/164 (6%)

Query: 56  ILPWTGTPGFMFPNATSSLAKWRDYTLSQA-ASFVAKNGTIIVCAVSQPYLPFLNNWLIS 114
           + P +  P      A +  A  R + L++     VAK+ TI+V   +  Y  F+ NW+  
Sbjct: 75  VQPSSDKPAEQQGTADAVAADSRPHALTRDLVGKVAKDNTIMVTWANWHYHDFVMNWVEH 134

Query: 115 ISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFG-SQGFFNFTSRRP 173
           +        ++   +D    + V+   P  ++         T   FG     F+   R  
Sbjct: 135 LQAAGCDAFIVGAMDDKLLEFLVSKNIPAFSM-----SSGLTLGDFGWGTPTFHKMGREK 189

Query: 174 CHLLH-ILELGYNVMYNDVDMVWLKDPFPYLQG--DHDVYFTDD 214
            +L++   ++GY+V+ +DVD VWL++P PY+    D D+  + D
Sbjct: 190 INLIYSFTKMGYDVLISDVDTVWLRNPLPYINAYRDADILTSSD 233


>gi|223975905|gb|ACN32140.1| unknown [Zea mays]
 gi|414884273|tpg|DAA60287.1| TPA: regulatory protein [Zea mays]
          Length = 443

 Score = 45.4 bits (106), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 30/117 (25%), Positives = 54/117 (46%), Gaps = 7/117 (5%)

Query: 94  TIIVCAVSQPYLP---FLNNWLISISRQKHQ----DQVLVIAEDYATLYKVNGRWPGHAV 146
           T+++ A+++ +     FL+ +L S  + +        +LV+A D   L + N   P   +
Sbjct: 183 TVLMTAINEAWAAPGSFLDLFLESFRQGEDTAGLPRHLLVVAMDGRALERCNAVHPFCYL 242

Query: 147 LVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
                 D     K+    +     RR      +LELGY+ ++ DVD++W + PFP L
Sbjct: 243 FRVDGMDFAAEQKYMEGDYLEMMWRRNRFQQSVLELGYSFLFTDVDILWFRSPFPRL 299


>gi|428179625|gb|EKX48495.1| hypothetical protein GUITHDRAFT_105641 [Guillardia theta CCMP2712]
          Length = 432

 Score = 45.1 bits (105), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 56/227 (24%), Positives = 90/227 (39%), Gaps = 48/227 (21%)

Query: 143 GHAVLVPPAPDSQ----------TAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVD 192
           G  VL P  P  Q           A  +G+  F      +   LL I++ G+  +  D D
Sbjct: 110 GMRVLNPAVPCVQYPRHYGTSKGAAISWGTARFAQVVQVKTDVLLGIVQEGFTGLLMDAD 169

Query: 193 MVWLKDPFPYL-----QGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRP 247
           +   ++P P L     +   D+   DDM                + GR    S  +YLRP
Sbjct: 170 IHLFRNPMPELAQLAEEERADMVIQDDM----------------EGGRN---SGFMYLRP 210

Query: 248 TD-GAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNK-TAGQVDLYLLPQSAFPTGGL 305
           ++ GA   +   +  +Q     +++    Q A N AL         ++ LPQS++P G  
Sbjct: 211 SEEGAAFAVH--VVGMQ----RRSRTLRQQEAVNRALRSFHMRHFRVFPLPQSSWPCGEA 264

Query: 306 YFKNQT-----WVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
           +F         W        +++HNN+I G + K  R ++F  W VD
Sbjct: 265 FFNKFARRIFAWSSACS-DCIMVHNNWILGDDAKEYRAKEFLQWTVD 310


>gi|449460119|ref|XP_004147793.1| PREDICTED: uncharacterized protein LOC101205922 [Cucumis sativus]
 gi|449476774|ref|XP_004154830.1| PREDICTED: uncharacterized LOC101205922 [Cucumis sativus]
          Length = 427

 Score = 45.1 bits (105), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 35/134 (26%), Positives = 61/134 (45%), Gaps = 6/134 (4%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L++    VA    +IV   +      L  W  +I +   Q+ ++V  ++    + ++   
Sbjct: 145 LAKLLEKVAIQKELIVTLANSNVKSMLEVWFTTIQKVGIQNYLVVALDNQTEEFCIS--- 201

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPF 200
             H V V          K G +G  +  S     +L   L+LGY+V+ +DVD+V+L++PF
Sbjct: 202 --HEVPVYKRDPDNNIDKVGKEGGNHQVSALKFRILREFLQLGYSVLLSDVDIVYLQNPF 259

Query: 201 PYLQGDHDVYFTDD 214
            +L  D DV    D
Sbjct: 260 DHLYRDSDVESMSD 273


>gi|159464751|ref|XP_001690605.1| hypothetical protein CHLREDRAFT_114491 [Chlamydomonas reinhardtii]
 gi|158280105|gb|EDP05864.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 501

 Score = 45.1 bits (105), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 58/274 (21%), Positives = 110/274 (40%), Gaps = 34/274 (12%)

Query: 91  KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPP 150
           ++G I++   +  YL F   W+ ++ +   +  ++   +D      V+   P   +    
Sbjct: 15  QDGLIMITWANHHYLDFAKTWVHNLQKSGVKGYMVGAMDDDMLKELVDLDIPAWRM---N 71

Query: 151 APDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG--DHD 208
              ++    +GS  F      +   +  +L L  +V+ +D+D  W+K+P PY     + D
Sbjct: 72  TGITKKDLGWGSTNFHLMGRFKIKLIRDVLALEVSVVVSDIDTAWVKNPIPYFHRYPEAD 131

Query: 209 VYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWS 268
           +  + D       D S +  P    G  +    M++ RPT  +K  +  W++ L  +P  
Sbjct: 132 ILTSSDQLGPTVNDDSLENFPAA--GSAFNIGIMLF-RPT--SKAFLDDWVKSLD-DP-- 183

Query: 269 KAKKANDQPAFNWALNKTA------------------GQVDLYLLPQSAFPTGGLYFKNQ 310
              K  DQ AFN    K                    G++ + +LP + F +G  +F  +
Sbjct: 184 ---KMWDQTAFNDLARKVGAATAASSEPPKNLWKGYDGKLTVGVLPCALFASGHTFFVQR 240

Query: 311 TWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLW 344
            + E     +V       +G   K  RFR+F L+
Sbjct: 241 KYAELGLEPYVAHATFQYSGTPGKRHRFREFMLF 274


>gi|242044184|ref|XP_002459963.1| hypothetical protein SORBIDRAFT_02g019390 [Sorghum bicolor]
 gi|241923340|gb|EER96484.1| hypothetical protein SORBIDRAFT_02g019390 [Sorghum bicolor]
          Length = 256

 Score = 45.1 bits (105), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 47/235 (20%), Positives = 88/235 (37%), Gaps = 21/235 (8%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           FL ++   +  +     +L++  D     +     P    L     D  +  ++ ++ + 
Sbjct: 18  FLESFRTGVKTEALLKHLLIVTVDAKARERCARAHPLCYALDVGGADFTSEQRYMAEDYL 77

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHD 226
           +   RR      +LELGY+ +  DVD+VW ++P   +    D+    D       D+ +D
Sbjct: 78  DMMWRRVRFQGRVLELGYSFLLTDVDIVWFRNPLLRVPVAADMAMACDRF---RGDNPYD 134

Query: 227 LPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKT 286
           L      G       ++Y R +       + W E     P +K      Q  F    ++ 
Sbjct: 135 LDKGANAG-------LVYARASARTAEFYRVWYEARTLFPGNKT-----QDVFEKVKHQL 182

Query: 287 AGQ--VDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFR 339
             +  V L  +  + F T    F ++   E+   K    H N I G + K+ + R
Sbjct: 183 TARLGVTLQFVDTAYFGT----FCDRQRKEKDFNKLCTFHANCILGLKNKVHKLR 233


>gi|242035367|ref|XP_002465078.1| hypothetical protein SORBIDRAFT_01g031700 [Sorghum bicolor]
 gi|241918932|gb|EER92076.1| hypothetical protein SORBIDRAFT_01g031700 [Sorghum bicolor]
          Length = 425

 Score = 45.1 bits (105), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 64/268 (23%), Positives = 108/268 (40%), Gaps = 37/268 (13%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAED-YATLYKVNGR 140
           L++    VA    +IV   +      L  W  +I R    + ++V  +D   +L K  G 
Sbjct: 142 LAKILEEVAVKKELIVALANTNVREMLEVWFTNIKRVGIPNYLVVALDDNIESLCKSKG- 200

Query: 141 WPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
                 +    PD    +   + G    +  +   L   L+LGY+++ +D+D+++L++PF
Sbjct: 201 ----VPVYRRDPDEGIDNIAKTGGNHAVSGLKFRVLREFLQLGYSILLSDIDIIFLRNPF 256

Query: 201 PYLQGDHDVYFTDD----MAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDG 250
            +L  D DV    D    M A    +   D P  G        R ++  S   Y+RPT  
Sbjct: 257 DHLYRDSDVESMSDGHNNMTAYG-FNDVFDEPSMGWARYAHTMRIWVHNSGFFYIRPTIP 315

Query: 251 AKLVMKKWIEELQAEPWSKAKKANDQPAFN-----------WALNKTAGQVDLYLLPQSA 299
           +  ++ +    L  EP     K+ DQ  FN             L+ +   +D+YL   S 
Sbjct: 316 SIELLDRVAGHLSREP-----KSWDQAVFNEELFFPSHPGYEGLHASRRTMDIYLFMNSK 370

Query: 300 FPTGGLYFKNQTWVEETKGKHVIIHNNY 327
                L+   +   +  K K VI+H NY
Sbjct: 371 V----LFKTVRKDAQLKKLKPVIVHLNY 394


>gi|452824018|gb|EME31024.1| hypothetical protein Gasu_17840 [Galdieria sulphuraria]
          Length = 376

 Score = 45.1 bits (105), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 46/235 (19%), Positives = 106/235 (45%), Gaps = 33/235 (14%)

Query: 72  SSLAKWRDYTLSQAASFVAKNGT---IIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIA 128
           SS  K++++   Q  SF   NG+   ++    ++ Y+P +N ++  +S  + ++ ++   
Sbjct: 31  SSTDKFQEFL--QRHSFATLNGSRKWVVATLANEAYIPLVNLFVGRLSTLQLENLIVFCI 88

Query: 129 EDYATLYKVNGRWPGHAV--LVPPA--PDSQTAHKF------------GSQGFFNFTSRR 172
           + Y   Y V    P   V  L+PP+  P S   + F            G+  F + T  +
Sbjct: 89  DPYIYEYCVFHHIPAWKVTDLIPPSCLPFSFWQNIFQNINHRRAYPAGGNIEFISLTQLK 148

Query: 173 PCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGK 232
                 ++    +++++D D+VW+++P PYLQ    ++   D+       +SH      +
Sbjct: 149 YLVFYSVISYNVDILFSDPDVVWIQNPIPYLQQKRSLHV--DIFIQTDRKYSH------Q 200

Query: 233 KGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTA 287
              +Y+ +  +Y+      +L+++  ++++    + +    + Q +FN  L +T 
Sbjct: 201 SLFSYMNTGFVYIHSHCATQLLLRIMMQQM----YQQYPIISQQRSFNRVLCRTG 251


>gi|356552128|ref|XP_003544422.1| PREDICTED: uncharacterized protein LOC100804935 [Glycine max]
          Length = 425

 Score = 45.1 bits (105), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 74/172 (43%), Gaps = 29/172 (16%)

Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKG- 234
           L   L+LGY+V+ +DVD+V+L++PF YL  D DV    D    K     +D+      G 
Sbjct: 232 LREFLQLGYSVLLSDVDIVYLQNPFDYLYRDSDVESMSDGHNNKTAYGYNDVFDEPSMGW 291

Query: 235 -------RTYI-CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNW----- 281
                  R ++  S   Y+RPT  +  ++ +    L  +P     K+ DQ  FN      
Sbjct: 292 ARYAHTMRIWVYNSGFFYIRPTLPSIELLDRVATRLSNDP-----KSWDQAVFNEELFFP 346

Query: 282 ------ALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNY 327
                  L+     +D+YL   S      L+   +   +  K K VI+H NY
Sbjct: 347 SHPGYDGLHAAKRTMDMYLFMNSKV----LFKTVRKDAKLKKLKPVIVHVNY 394


>gi|255081588|ref|XP_002508016.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
 gi|226523292|gb|ACO69274.1| glycosyltransferase family 77 protein [Micromonas sp. RCC299]
          Length = 843

 Score = 45.1 bits (105), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 34/56 (60%), Gaps = 2/56 (3%)

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
           D  +AH   S+ +  F   R   +  +L +G++V+ +DVD+VW KDP P+LQ  +D
Sbjct: 252 DGSSAHA--SKSWKAFARLRISQVSALLRMGFDVLMSDVDVVWTKDPRPFLQCGYD 305


>gi|255638110|gb|ACU19369.1| unknown [Glycine max]
          Length = 425

 Score = 45.1 bits (105), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 74/172 (43%), Gaps = 29/172 (16%)

Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKG- 234
           L   L+LGY+V+ +DVD+V+L++PF YL  D DV    D    K     +D+      G 
Sbjct: 232 LREFLQLGYSVLLSDVDIVYLQNPFDYLYRDSDVESMSDGHNNKTAYGYNDVFDEPSMGW 291

Query: 235 -------RTYI-CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNW----- 281
                  R ++  S   Y+RPT  +  ++ +    L  +P     K+ DQ  FN      
Sbjct: 292 ARYAHTMRIWVYNSGFFYIRPTLPSIELLDRVATRLSNDP-----KSWDQAVFNEELFFP 346

Query: 282 ------ALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNY 327
                  L+     +D+YL   S      L+   +   +  K K VI+H NY
Sbjct: 347 SHPGYDGLHAAKRTMDMYLFMNSKV----LFKTVRKDAKLKKLKPVIVHVNY 394


>gi|428168198|gb|EKX37146.1| hypothetical protein GUITHDRAFT_116723 [Guillardia theta CCMP2712]
          Length = 1532

 Score = 45.1 bits (105), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 59/287 (20%), Positives = 116/287 (40%), Gaps = 45/287 (15%)

Query: 87   SFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKH-QDQVLVIAEDYATLYKVNGRWPGH- 144
            +F A+   ++V   ++ Y+PF  +WL + ++      + L+IA D     ++  R   H 
Sbjct: 1219 NFPAQEQFLLVAFGNREYVPFFQSWLCNTAKMDGVHSRTLIIASDDDAYQQL--RRSSHN 1276

Query: 145  -----AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
                 +VL     +  +A +F S GFF  T  R   +  ++  G +++  + D +WL++P
Sbjct: 1277 ATIVRSVLKVDGEELPSAFEFQSLGFFLLTQHRNRLMERLIVAGIDMLLFEADAIWLRNP 1336

Query: 200  FPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGA----KLVM 255
                   +DV      A V    ++      GK G  +       LR T       K + 
Sbjct: 1337 L------YDVNLMYSDADVIGFWNT------GKIGFGW-----TLLRATRATVKLWKEMD 1379

Query: 256  KKWIEELQAEPWSKAKK-----ANDQPAFNWALN--KTAGQVDLYLLPQSAFPTGGLYFK 308
            +++ E +Q +   +  K     AN+    N  L   +   ++ L+ L +  + +G  Y  
Sbjct: 1380 RRYDEIIQTQQDGRLDKYLGPVANEMDLLNHVLGEMRATDEIRLFTLEECDYVSGIWYDG 1439

Query: 309  N--------QTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
                     +    E      +++NN+I G   K+ R + +  W  +
Sbjct: 1440 GIGGDGAELREKCHECGRVPYVVNNNFIVGIRGKMNRAKRWRHWFAE 1486


>gi|357142041|ref|XP_003572439.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
           distachyon]
          Length = 364

 Score = 45.1 bits (105), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 30/134 (22%), Positives = 58/134 (43%), Gaps = 11/134 (8%)

Query: 91  KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
           ++ T+IV ++++ Y         FL ++ +        D VL++A D     +       
Sbjct: 95  EDKTVIVTSINRAYAAPGSLLDLFLESFRLGEGTAALLDHVLIVAVDPGAFRRCRAVHR- 153

Query: 144 HAVLV---PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
           H  L+   P A D      F ++ + +    R      ILELG+N ++ D+D++W ++P 
Sbjct: 154 HCYLLRQSPSAVDYSGEKHFMTKDYLDMMWSRNRFQQTILELGFNFLFTDIDIMWFRNPL 213

Query: 201 PYLQGDHDVYFTDD 214
             +    D+    D
Sbjct: 214 RRIAITSDIAIASD 227


>gi|326527715|dbj|BAK08132.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 424

 Score = 45.1 bits (105), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 64/281 (22%), Positives = 113/281 (40%), Gaps = 35/281 (12%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L++    VA    IIV   +      L  W  +I R    + ++V  +D    +  +   
Sbjct: 141 LAKILQHVAVKKEIIVAVANSNVKETLGMWFTNIKRVGITNYLVVALDDSIENFCKSKDV 200

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
           P    +    PD        + G    ++ +   L   L+LGY+V+ +D+D+++ ++PF 
Sbjct: 201 P----VYRRDPDEGIDSIGKTGGNHAVSALKFRILREFLQLGYSVLLSDIDIMFFQNPFD 256

Query: 202 YLQGDHDVYFT----DDMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGA 251
           +L  D D+       D+M A    +   D P  G        R ++  S   ++RPT  +
Sbjct: 257 HLYRDSDIESMSDGHDNMTAYG-FNDVFDEPSMGWARYAHTMRIWVFNSGFFFIRPTIPS 315

Query: 252 KLVMKKWIEELQAEPWSKAKKANDQPAFN-----------WALNKTAGQVDLYLLPQSAF 300
             ++ +    L  EP     K+ DQ  FN             L+ +   +D+YL   S  
Sbjct: 316 IELLDRVAGRLSREP-----KSWDQAVFNEELFFPSHPGYEGLHASKRTMDIYLFMNSKV 370

Query: 301 PTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDF 341
               L+   +   +  K K VI+H NY    E ++K   +F
Sbjct: 371 ----LFKTVRKSAQLRKLKPVIVHLNYHPDKEARMKAVIEF 407


>gi|326497825|dbj|BAJ94775.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326499500|dbj|BAJ86061.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326512430|dbj|BAJ99570.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 424

 Score = 44.7 bits (104), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 64/281 (22%), Positives = 113/281 (40%), Gaps = 35/281 (12%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L++    VA    IIV   +      L  W  +I R    + ++V  +D    +  +   
Sbjct: 141 LAKILQHVAVKKEIIVAVANSNVKETLGMWFTNIKRVGITNYLVVALDDSIENFCKSKDV 200

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
           P    +    PD        + G    ++ +   L   L+LGY+V+ +D+D+++ ++PF 
Sbjct: 201 P----VYRRDPDEGIDSIGKTGGNHAVSALKFRILREFLQLGYSVLLSDIDIMFFQNPFD 256

Query: 202 YLQGDHDVYFT----DDMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGA 251
           +L  D D+       D+M A    +   D P  G        R ++  S   ++RPT  +
Sbjct: 257 HLYRDSDIESMSDGHDNMTAYG-FNDVFDEPSMGWARYAHTMRIWVFNSGFFFIRPTIPS 315

Query: 252 KLVMKKWIEELQAEPWSKAKKANDQPAFN-----------WALNKTAGQVDLYLLPQSAF 300
             ++ +    L  EP     K+ DQ  FN             L+ +   +D+YL   S  
Sbjct: 316 IELLDRVAGRLSREP-----KSWDQAVFNEELFFPSHPGYEGLHASKRTMDIYLFMNSKV 370

Query: 301 PTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDF 341
               L+   +   +  K K VI+H NY    E ++K   +F
Sbjct: 371 ----LFKTVRKSAQLRKLKPVIVHLNYHPDKEARMKAVIEF 407


>gi|18394730|ref|NP_564082.1| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
           thaliana]
 gi|8778445|gb|AAF79453.1|AC025808_35 F18O14.8 [Arabidopsis thaliana]
 gi|21593695|gb|AAM65662.1| unknown [Arabidopsis thaliana]
 gi|109134163|gb|ABG25079.1| At1g19360 [Arabidopsis thaliana]
 gi|110737961|dbj|BAF00917.1| hypothetical protein [Arabidopsis thaliana]
 gi|332191718|gb|AEE29839.1| Nucleotide-diphospho-sugar transferase family protein [Arabidopsis
           thaliana]
          Length = 428

 Score = 44.7 bits (104), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 70/265 (26%), Positives = 106/265 (40%), Gaps = 31/265 (11%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDY-ATLYKVNGR 140
           L++    +A +  +IV   +      L   + SI R    + ++V  +DY   L K N  
Sbjct: 145 LAKILEEIAVDKEVIVALANANVKAMLEVQIASIKRVGITNYLVVALDDYIENLCKENDV 204

Query: 141 WPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
                    P  D  T  K G  G    +  +   L   L+LGY V+ +DVD+V+L++PF
Sbjct: 205 AYYKR---DPDKDVDTVGKTG--GNHAVSGLKFRVLREFLQLGYGVLLSDVDIVFLQNPF 259

Query: 201 PYLQGDHDV-YFTD--DMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGA 251
            +L  D DV   +D  D       +   D P  G        R ++  S   YLRPT  +
Sbjct: 260 SHLYRDSDVESMSDGHDNHTAYGFNDVFDEPAMGWARYAHTMRIWVFNSGFFYLRPTIPS 319

Query: 252 KLVMKKWIEEL-QAEPWSKAKKANDQPAFNWAL------NKTAGQVDLYLLPQSAFPTGG 304
             ++ +  + L +A+ W       DQ  FN  L        TA      ++    F    
Sbjct: 320 IELLDRVADRLSKAKVW-------DQAVFNEELFYPSHPEYTALHASKRVMDMYEFMNSK 372

Query: 305 LYFKNQTWVEETKGKH--VIIHNNY 327
           + FK      E K K   VI+H NY
Sbjct: 373 VLFKTVRKNHELKKKVKPVIVHVNY 397


>gi|255073125|ref|XP_002500237.1| hypothetical protein MICPUN_107680 [Micromonas sp. RCC299]
 gi|226515499|gb|ACO61495.1| hypothetical protein MICPUN_107680 [Micromonas sp. RCC299]
          Length = 439

 Score = 44.7 bits (104), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 62/222 (27%), Positives = 92/222 (41%), Gaps = 31/222 (13%)

Query: 68  PNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYL-------PFLNNWLISISRQKH 120
           P AT   A+  D TL      +A +G ++  AVS   L         L  WL  + R K 
Sbjct: 134 PAATG--AEKGDETLKAILEEIAPDGEVLA-AVSNKALINEKGDYGMLRTWLDGVQRSKV 190

Query: 121 QDQ-VLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-H 178
           ++  V+ + E  A   K  G    H      A   +T H   +Q F         H+L  
Sbjct: 191 KNYLVICLDETVAGTMKKLGVPYWHRERKALADGDETNHGISAQKF---------HILRE 241

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPL---DHSHDLPPPG---- 231
            L LGY+V+ +DVD+V L +PF +L  D DV    D    +     +   D P  G    
Sbjct: 242 FLVLGYSVLLSDVDIVTLDNPFDHLYRDSDVEGLSDGYDERTAYGWNDGIDDPKMGWARY 301

Query: 232 -KKGRTY-ICSCMIYLRPTDGAKLVMKKWIEELQ-AEPWSKA 270
            +  R + + S + YL+P+D     M      L+ A+ W +A
Sbjct: 302 AQTMRVFAMNSGLFYLKPSDRTVQFMDGITARLERAKEWDQA 343


>gi|323449948|gb|EGB05832.1| hypothetical protein AURANDRAFT_66047 [Aureococcus anophagefferens]
          Length = 584

 Score = 44.7 bits (104), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 47/205 (22%), Positives = 83/205 (40%), Gaps = 30/205 (14%)

Query: 154 SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG-DHDVYFT 212
            + AHK+G   F +    +   +  +   G +V++ D D+VW  DP P L     D  + 
Sbjct: 360 ERPAHKYGDSVFVSMMWLKVVAVYLVNRCGRDVLFMDADVVWQADPLPLLAAYGEDTLWM 419

Query: 213 DDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYL----RPTDGAKLVMKKWIEELQAEPWS 268
           DD+A       S+   P       Y  +   YL    R T      M    + +    W 
Sbjct: 420 DDVA------RSNRYVP-------YSANTGFYLVRANRRTAHFLFAMLTHFDLIIV--WR 464

Query: 269 KAKKANDQPAFNWALNKTAGQVDL--YLLPQSAFPTGGLYFKNQTWVEETK---GKHVII 323
                + Q   N  L + A +++L   +LP+  F  G ++ +N+T+ +E      +  + 
Sbjct: 465 -----SHQHVLNALLQEHASRMNLGVRVLPRDDFQIGAIFHRNKTYADELARGVARPAVF 519

Query: 324 HNNYITGFEKKIKRFRDFGLWLVDD 348
           H  +  G + K++  R+   W V D
Sbjct: 520 HWAWTEGRDDKLRFARETNTWHVRD 544


>gi|297842255|ref|XP_002889009.1| hypothetical protein ARALYDRAFT_476651 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334850|gb|EFH65268.1| hypothetical protein ARALYDRAFT_476651 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 399

 Score = 44.7 bits (104), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 68/267 (25%), Positives = 105/267 (39%), Gaps = 50/267 (18%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAED-YATLYKVN-----GRWP 142
           VA N  IIV   +    P L     S+ R   Q+ +++  +D   +  + N      R P
Sbjct: 125 VAVNKEIIVVLANNNVKPMLEVQTASVKRVGIQNYLVIPLDDSIESFCRSNEVAYFKRDP 184

Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
            +A+ V     S +A           +  +   L   L+LGY V+  DVD+V+L++PF +
Sbjct: 185 DNAIDVVGKSGSGSA----------VSGLKFRVLREFLQLGYGVLLADVDLVFLQNPFGH 234

Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDL--PPPGKKGRTYIC-------SCMIYLRPTDGAKL 253
           L  D DV    D          +D+   P   + RT          S   YLRPT  +  
Sbjct: 235 LYRDSDVESMSDGHDNNTAYGFNDVFDDPSMTRSRTVYTTRIWVFNSGFFYLRPTLPSIE 294

Query: 254 VMKKWIEEL-QAEPWSKAKKANDQPAFN-----------WALNKTAGQVDLYLLPQSAFP 301
           ++ +  + L ++  W       DQ  FN             L+ +   +D+Y      F 
Sbjct: 295 LLDRVTDTLSKSGGW-------DQAVFNEHLFYPSHPGYIGLHASKRAMDIY-----EFM 342

Query: 302 TGGLYFKNQTWVEETKG-KHVIIHNNY 327
              + FK     EE K  K VI+H NY
Sbjct: 343 NSRVLFKTVRKNEELKKLKPVIVHMNY 369


>gi|413923500|gb|AFW63432.1| hypothetical protein ZEAMMB73_690950 [Zea mays]
          Length = 390

 Score = 44.7 bits (104), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 48/233 (20%), Positives = 85/233 (36%), Gaps = 20/233 (8%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           F++++   +        ++++A D+          P    L     D     +F + G+ 
Sbjct: 141 FIDSFRRGVGTSSLLRHLVIVAFDFKAYEHCVKIHPYCYALPTKDVDFSEEKRFQTTGYL 200

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHD 226
               +R   L  +LE GY+ +++D D+VW ++PFP+   D D      +A    + ++ D
Sbjct: 201 EMMWKRLDFLRLVLEKGYSFVFSDADIVWFRNPFPHFYTDGDF----QIACDHYVGNATD 256

Query: 227 LPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKT 286
           L      G  Y+       +  D +    K W       P       +DQ  FN+ +   
Sbjct: 257 LRNIANGGFNYV-------KSNDQSIEFYKFWYSSRFRYP-----GYHDQDVFNF-IKHD 303

Query: 287 AGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFR 339
               D+ L  +    T   YF           K   +H N   G + KI   R
Sbjct: 304 RYTTDIGLTIKFLSTT---YFGGICEPSRDLNKVCTMHANCCIGLQSKIHDLR 353


>gi|356529430|ref|XP_003533295.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
          Length = 357

 Score = 44.7 bits (104), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 25/44 (56%)

Query: 171 RRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD 214
           RR   L  +LE+GYN ++ D D++WL+DPF     D D     D
Sbjct: 178 RRTEFLTSVLEMGYNFVFTDTDIMWLRDPFKQFYEDADFQIACD 221


>gi|219362841|ref|NP_001136698.1| uncharacterized protein LOC100216832 [Zea mays]
 gi|194690508|gb|ACF79338.1| unknown [Zea mays]
 gi|194696684|gb|ACF82426.1| unknown [Zea mays]
 gi|414864584|tpg|DAA43141.1| TPA: hypothetical protein ZEAMMB73_234362 [Zea mays]
          Length = 368

 Score = 44.3 bits (103), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 27/116 (23%), Positives = 48/116 (41%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           FL ++    S ++    +++++ D A   +          LV    D      F + G+ 
Sbjct: 118 FLESFRSGESTRELLQHLVIVSLDTAAHARCGQVHRHCFALVTDGVDFSGQKNFMTDGYL 177

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLD 222
               RR   L  +LE G++ ++ D D+VW ++P P+   D D     D     P D
Sbjct: 178 RMMWRRVDFLREVLEKGFSFVFTDTDIVWFRNPLPHFYPDGDFQIACDHFTGDPSD 233


>gi|356534414|ref|XP_003535750.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
          Length = 364

 Score = 44.3 bits (103), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 47/199 (23%), Positives = 82/199 (41%), Gaps = 22/199 (11%)

Query: 23  RNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVI---------LPWTGTPGF--MFPNAT 71
           R S T   +R +       T  V LS LV+LG +                GF  +FP+  
Sbjct: 3   RESKTIHLRRAVAASFFFAT--VSLSCLVLLGDVDSHRFLSSFHSSYSLSGFTRIFPSVY 60

Query: 72  SS-LAKWRDYTLSQAASFVA-KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQD 122
           +  +A   +Y L +  +  A K+ T+I+  +++ +         FL ++ I        +
Sbjct: 61  NDPVATSNEYPLEKILNDAAMKDRTVILTTLNEAWATPNSVIDLFLESFRIGDRTSTFLN 120

Query: 123 QVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILEL 182
            +++IA D     +          LV    D      F +  +     +R   L  +LE+
Sbjct: 121 HLVIIALDQKAFARCQVIHTHCFSLVSEEADFHEEAYFMTPRYLMMMWKRIDFLRTVLEM 180

Query: 183 GYNVMYNDVDMVWLKDPFP 201
           GYN ++ D D++W +DPFP
Sbjct: 181 GYNFVFTDADIMWFRDPFP 199


>gi|168062158|ref|XP_001783049.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162665433|gb|EDQ52118.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 420

 Score = 44.3 bits (103), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 56/206 (27%), Positives = 85/206 (41%), Gaps = 24/206 (11%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           +A N  +IV   +      L  W  SI R    + ++V  +D    +      P +    
Sbjct: 145 IAINEELIVSISNNNVRDMLQIWFESIQRVGVTNYLVVALDDEIASFCNEHNVPVYR--- 201

Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPFPYLQGDH 207
               D+  +      G  +  S    HLL   L LGY+V+ +DVD+V+L++PF +L  D 
Sbjct: 202 ---RDATISKSQAGTGSNHAISGLKFHLLREFLVLGYSVLLSDVDIVFLQNPFNHLHRDC 258

Query: 208 DVYFTDD----MAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGAKLVMKK 257
           DV    D    + A    D S D P  G        R ++  S + Y+RPT  +  ++ +
Sbjct: 259 DVESMSDGFNNITAYGYDDVSED-PTMGWSRFAHTMRIWVFNSGLFYIRPTVPSIELLDR 317

Query: 258 WIEELQAEPWSKAKKANDQPAFNWAL 283
               L  E      KA DQ  FN  L
Sbjct: 318 VTARLSKE------KAWDQAVFNEEL 337


>gi|42571685|ref|NP_973933.1| nucleotide-diphospho-sugar transferase-like protein [Arabidopsis
           thaliana]
 gi|122213725|sp|Q3E6Y3.1|Y1869_ARATH RecName: Full=Uncharacterized protein At1g28695; Flags: Precursor
 gi|332192894|gb|AEE31015.1| nucleotide-diphospho-sugar transferase-like protein [Arabidopsis
           thaliana]
          Length = 329

 Score = 44.3 bits (103), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 36/150 (24%), Positives = 62/150 (41%), Gaps = 15/150 (10%)

Query: 84  QAASFVA---KNGTIIVCAVSQPYLP-----------FLNNWLISISRQKHQDQVLVIAE 129
           +AA + A    N T+I+  V++ Y+            FL ++          D ++V+A 
Sbjct: 46  EAALYTAAAGNNKTVIITMVNKAYVKEVGRGSTMLDLFLESFWEGEGTLPLLDHLMVVAV 105

Query: 130 DYATLYKVN-GRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMY 188
           D     +    R   + +      D +    F S+ F     RR   +L +L  GYNV++
Sbjct: 106 DQTAYDRCRFKRLHCYKMETEDGVDLEGEKVFMSKDFIEMMWRRTRLILDVLRRGYNVIF 165

Query: 189 NDVDMVWLKDPFPYLQGDHDVYFTDDMAAV 218
            D D++WL+ P   L    D+  + D   V
Sbjct: 166 TDTDVMWLRSPLSRLNMSLDMQISVDRINV 195


>gi|326525413|dbj|BAJ93085.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 357

 Score = 44.3 bits (103), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 29/141 (20%), Positives = 64/141 (45%), Gaps = 9/141 (6%)

Query: 91  KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
           ++ T+I+ +V++ +         FL ++ + ++       ++V+A D   L +     P 
Sbjct: 87  EDRTVIMTSVNEAWAAEGSLLDSFLESFRVGLNISHLVKHIVVVAMDEGALRRCRAVHPH 146

Query: 144 HAVLVPPAP--DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
             +L+P     D   A  + ++ + +    +      +L LGYN+++ DVD+ W +DP  
Sbjct: 147 CHLLLPDVDGLDLSGAKSYMTKDYLDLVWSKLRLQHRVLLLGYNLLFTDVDVAWFRDPRV 206

Query: 202 YLQGDHDVYFTDDMAAVKPLD 222
           ++    D+  + D     P D
Sbjct: 207 HITAAADITTSSDFYFGDPDD 227


>gi|356561843|ref|XP_003549186.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
          Length = 357

 Score = 44.3 bits (103), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 44/184 (23%), Positives = 76/184 (41%), Gaps = 14/184 (7%)

Query: 45  VLLSLLVVLGVILPWTGTPGFMFP------NATSSLAKWRDYTLSQAASFVAKNGTIIVC 98
           + L+ L VL + L  + +P F FP      +A SS A +     S  A+   K+ T+I+ 
Sbjct: 37  MFLAGLAVLWMFLYNSASP-FEFPTSSDYFSAESSKADYDPKLESVLANASMKDKTVIIT 95

Query: 99  AVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPA 151
            ++  +         FL ++ +    Q   + ++ I  D  T  +          L    
Sbjct: 96  ILNDAWAEPGSMFDLFLESFRLGNETQWLLNHLVAITWDQKTYARCLAMHKHCYQLGTKG 155

Query: 152 PDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYF 211
            +      F +  +     RR   L  +LE+GYN ++ D D++WL+DPF     D D   
Sbjct: 156 GNFTGEVFFMAPNYLQMMWRRTEFLGSVLEMGYNFVFTDTDIMWLRDPFKIFYKDADFQI 215

Query: 212 TDDM 215
             D+
Sbjct: 216 ACDV 219


>gi|255536925|ref|XP_002509529.1| conserved hypothetical protein [Ricinus communis]
 gi|223549428|gb|EEF50916.1| conserved hypothetical protein [Ricinus communis]
          Length = 254

 Score = 44.3 bits (103), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 39/73 (53%), Gaps = 5/73 (6%)

Query: 147 LVPPAPDSQTAHKFGSQG--FFNFTS---RRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
           L+  A D+ T H   S     F+F S   RR    L +L+LGYN++Y DVD++WL++P  
Sbjct: 43  LIIIALDTHTLHYCQSIRPHCFHFKSTGQRRKELFLQVLQLGYNLVYTDVDVMWLRNPMS 102

Query: 202 YLQGDHDVYFTDD 214
              G  ++    D
Sbjct: 103 LFDGLAEISLGCD 115


>gi|356538805|ref|XP_003537891.1| PREDICTED: uncharacterized protein LOC100802215 [Glycine max]
          Length = 437

 Score = 44.3 bits (103), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 35/133 (26%), Positives = 59/133 (44%), Gaps = 4/133 (3%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L++    VA    +IVC  +      L  W  +I R    + ++   +D    +  + + 
Sbjct: 155 LAKILEKVAVKQELIVCLANTNVKEMLEVWFTNIKRVGITNYLVAALDDETAKFCESNQV 214

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
           P +     P     T  + GS    +    R   L   L+LGY+V+ +DVD+V+L++PF 
Sbjct: 215 PVYKR--DPDDGVDTIGRTGSNHAVSGLKFRI--LREFLQLGYSVLLSDVDIVYLQNPFD 270

Query: 202 YLQGDHDVYFTDD 214
           +L  D DV    D
Sbjct: 271 HLYRDSDVESMSD 283


>gi|224005136|ref|XP_002296219.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|209586251|gb|ACI64936.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 499

 Score = 44.3 bits (103), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 20/151 (13%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISI-SRQKHQDQVLVIAEDYATLYKV--- 137
           L Q   F  +  TIIV  V+Q     L N+L +  +R+    +VLV   D  +   +   
Sbjct: 307 LHQTTQF--EKETIIVMTVNQGQSHLLVNFLCAARARKLDVRRVLVFVTDEESKQLIEDL 364

Query: 138 -NGRWPGHAVL-----VPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDV 191
            N    G  V      +   P      K+G   F +    +   +L++  LGY+V++ D 
Sbjct: 365 SNDDEVGVMVFYDKWNMEELPKGGEGVKYGDSTFTSMMFAKILCVLYVSLLGYDVLFQDA 424

Query: 192 DMVWLKDPFPY--------LQGDHDVYFTDD 214
           D+VW +DP P+        L  ++D+ F  D
Sbjct: 425 DIVWYQDPLPFFEMKDNSSLNQNYDIIFQHD 455


>gi|326490027|dbj|BAJ94087.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 379

 Score = 44.3 bits (103), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 27/42 (64%)

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKP 220
           ILELGYN ++ DVD++W +DPF ++    D+  + D+    P
Sbjct: 208 ILELGYNFLFTDVDVMWFRDPFKHISMGADIAISSDVFIGDP 249


>gi|224136554|ref|XP_002326889.1| predicted protein [Populus trichocarpa]
 gi|222835204|gb|EEE73639.1| predicted protein [Populus trichocarpa]
          Length = 419

 Score = 44.3 bits (103), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 52/212 (24%), Positives = 89/212 (41%), Gaps = 20/212 (9%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L++    VA    +IV   +      L  W  +I +   ++ ++V  +D+   +  +   
Sbjct: 135 LAKLLEEVAVRKELIVALANSNVKTMLEVWFANIKKAGIRNYLVVALDDHIVDFCKSNDV 194

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
           P    +    PDS       + G    +  +   L   L+LGY+V+ +DVD+++L++PF 
Sbjct: 195 P----VYKRDPDSGIDSVARTGGNHAVSGLKFRILREFLQLGYSVLLSDVDIIYLQNPFD 250

Query: 202 YLQGDHDVYFT----DDMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGA 251
           +L  D DV       D+M A    D   + P  G        R ++  S   Y+RPT  +
Sbjct: 251 HLYRDSDVESMSDGHDNMTAYG-FDDVFNEPAMGWARYAHTMRIWVYNSGFFYIRPTLPS 309

Query: 252 KLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
             ++ +    L  EP S      DQ  FN  L
Sbjct: 310 IELLDRVAGRLSREPNSW-----DQAVFNEEL 336


>gi|308809339|ref|XP_003081979.1| unnamed protein product [Ostreococcus tauri]
 gi|116060446|emb|CAL55782.1| unnamed protein product, partial [Ostreococcus tauri]
          Length = 579

 Score = 44.3 bits (103), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 50/223 (22%), Positives = 85/223 (38%), Gaps = 43/223 (19%)

Query: 162 SQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPL 221
           S+ F    + +   +  +L+LG   + +D D+VW++DP      +    + D + +   +
Sbjct: 93  SKAFKKMGTVKTKFVQDLLDLGIAPILSDADVVWMRDPREVFN-NGTFKYADILISSDCI 151

Query: 222 DHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN- 280
           D   D             + ++Y+RPT  AK  ++KW  ++     S+     DQPA N 
Sbjct: 152 DTVGDRKDDKSCLHVNFNTGVLYIRPTTRAKEFVEKWKHKVAT---SEIAWMRDQPALNL 208

Query: 281 -----------------------------WALNKTAGQVDLYLLPQSAFPTGGLYFKNQT 311
                                        +A N T   + L +LP + F  G  +F  + 
Sbjct: 209 LVRDGHPPLEVAVRVPADKRGLPGYRSLLFAANNT---IRLGVLPVAQFSNGHTFFVQEH 265

Query: 312 WVEE-TKGKHVIIHNNYITGFEK-----KIKRFRDFGLWLVDD 348
            V     G+   +H  Y  G        K +R R  GLW +DD
Sbjct: 266 HVHHPEDGEPYSVHMTYQYGDTHDYAYGKRQRLRQHGLWYLDD 308


>gi|224125690|ref|XP_002329694.1| predicted protein [Populus trichocarpa]
 gi|222870602|gb|EEF07733.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score = 44.3 bits (103), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 50/230 (21%), Positives = 89/230 (38%), Gaps = 22/230 (9%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           FL ++ I    ++  + +++I+ D     +     P    L     +      F S+ + 
Sbjct: 113 FLESFQIGEKTKELLNNLVIISLDQKAHARCLAIHPHCYALKTEGLNFTREASFMSEDYL 172

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHD 226
               RR   L  +LE+GY+ ++ D D++WL++PFP      D     D     P D S+ 
Sbjct: 173 KMMWRRIEFLNTVLEMGYSFVFTDADIMWLRNPFPRFYPRVDFQIACDKYYGNPKDKSNR 232

Query: 227 LPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALN-K 285
             P G  G TY+       R      L  + W    +  P +     +DQ   N   N +
Sbjct: 233 --PNG--GFTYV-------RSNLRTILFYRFWFMSRKTYPGN-----HDQDVLNKIKNDR 276

Query: 286 TAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKI 335
              +++L +        GG   +++ +      K   +H N   G E K+
Sbjct: 277 VLEKIELTMRFLDTAYFGGFCERSKDF-----NKVCTMHANCCYGLETKV 321


>gi|356561837|ref|XP_003549183.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
          Length = 351

 Score = 44.3 bits (103), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 25/44 (56%)

Query: 171 RRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD 214
           RR   L  +LE+GYN ++ D D++WL+DPF     D D     D
Sbjct: 172 RRTEFLTSVLEMGYNFVFTDTDIMWLRDPFKQFYEDADFQIACD 215


>gi|50198979|gb|AAT70491.1| At1g14590 [Arabidopsis thaliana]
          Length = 354

 Score = 44.3 bits (103), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 33/139 (23%), Positives = 63/139 (45%), Gaps = 14/139 (10%)

Query: 81  TLSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYAT 133
            LS+AA+   ++ T+++  ++  +         F  ++ I     +  D ++++A D A 
Sbjct: 56  VLSKAAT---RDRTVVLTTLNAAWAAPGSVIDLFFESFRIGEETSQILDHLVIVALD-AK 111

Query: 134 LYKVNGRWPGHAV-LVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVD 192
            Y        H   LV    D      F ++ +     RR   L  +LE+GYN ++ D D
Sbjct: 112 AYSRCLELHKHCFSLVTEGVDFSREAYFMTRSYLKMMWRRIDLLRSVLEMGYNFVFTDAD 171

Query: 193 MVWLKDPFP--YLQGDHDV 209
           ++W ++PFP  Y+  D  +
Sbjct: 172 VMWFRNPFPRFYMYADFQI 190


>gi|224067389|ref|XP_002302478.1| predicted protein [Populus trichocarpa]
 gi|222844204|gb|EEE81751.1| predicted protein [Populus trichocarpa]
          Length = 429

 Score = 44.3 bits (103), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 51/212 (24%), Positives = 91/212 (42%), Gaps = 20/212 (9%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L++  + VA +  +IV   +      L  W  +I +   ++ ++V  +D+   +  +   
Sbjct: 145 LAKLLAEVAVHKELIVALANSNVKAMLEVWFTNIKKAGIRNYLVVALDDHIVDFCKSNDV 204

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
           P    +    PDS       + G    +  +   L   L+LGY+V+ +D+D+++L++PF 
Sbjct: 205 P----VYKRDPDSGIDSVARTGGNHAVSGLKFHILREFLQLGYSVLLSDIDIIYLQNPFD 260

Query: 202 YLQGDHDVYFT----DDMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGA 251
           +L  D DV       D+M A    +   D P  G        R ++  S   Y+RPT  +
Sbjct: 261 HLYRDSDVESMSDGHDNMTAYG-FNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTLPS 319

Query: 252 KLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
             ++ +    L  EP S      DQ  FN  L
Sbjct: 320 IELLDRVAGRLSREPNSW-----DQAVFNEEL 346


>gi|357114228|ref|XP_003558902.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
           distachyon]
          Length = 376

 Score = 44.3 bits (103), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 27/113 (23%), Positives = 50/113 (44%), Gaps = 5/113 (4%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           FL ++ I  + +   D +++++ D A   +          +     D      F + G+ 
Sbjct: 129 FLESFRIGDNTRHLLDHLVIVSLDNAAHRRCKQIHAHCLAVATDGVDFSGQKNFMTDGYL 188

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP--YLQGDHDV---YFTDD 214
               RR   L  +LE G++ ++ D D+VW + P P  Y +GD  +   +FT D
Sbjct: 189 KMMWRRIDFLRQVLEKGFSFIFTDTDIVWFRSPIPRLYAEGDFQIACDHFTGD 241


>gi|443710359|gb|ELU04613.1| hypothetical protein CAPTEDRAFT_225841 [Capitella teleta]
          Length = 315

 Score = 44.3 bits (103), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 47/200 (23%), Positives = 80/200 (40%), Gaps = 34/200 (17%)

Query: 160 FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVK 219
           F S  F   +  +   +  +L+LGY ++  D+D+V+LK+P        DV        VK
Sbjct: 148 FASAAFNEKSKVKLAMVYAVLQLGYKILIADLDVVFLKNPL-------DV--------VK 192

Query: 220 PLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAF 279
                 D+       +  + +  +Y +PT  +    KK  E++         K +DQ  F
Sbjct: 193 SCGKDCDIAVQNNTNKQ-LNTGFLYSKPTPKSIAFYKKITEKMVDS------KGHDQSVF 245

Query: 280 N--WALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKR 337
           N  +  N   G +++++LP      G           E +  HV  H N+      KI+ 
Sbjct: 246 NMVYKRNMVPG-INIHVLPVDVACVGSF--------AEPRKCHV-FHANFKKLLAPKIRI 295

Query: 338 FRDFGLWLVDDHAVESPLGK 357
            +  G+W  D   VE  L +
Sbjct: 296 LKQHGMWKADSRDVEEFLCR 315


>gi|242037413|ref|XP_002466101.1| hypothetical protein SORBIDRAFT_01g001260 [Sorghum bicolor]
 gi|241919955|gb|EER93099.1| hypothetical protein SORBIDRAFT_01g001260 [Sorghum bicolor]
          Length = 371

 Score = 44.3 bits (103), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 41/188 (21%), Positives = 74/188 (39%), Gaps = 23/188 (12%)

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
           D ++   F S+ +      R      +LELGYN ++ DVD++W +DPF ++    D+  +
Sbjct: 174 DYRSEKMFMSKDYLEMMWGRNRFQQTVLELGYNFLFTDVDVMWFRDPFRHISMAADIAIS 233

Query: 213 DDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKK 272
            D+    P     + P  G           +++R +       + W    Q   W    K
Sbjct: 234 SDVYIGDPYS-LRNFPNGG----------FLFVRSSAKTIDFYRAW----QQGRWRFLGK 278

Query: 273 ANDQPAFNWALNKTAGQVDLYL-LPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGF 331
            ++Q  FN   ++ A ++DL +    +A+ +G               K   +H N   G 
Sbjct: 279 -HEQDVFNLIKHEMAPKLDLAIQFLDTAYISGFCQLSKDL------NKICTLHANCCVGL 331

Query: 332 EKKIKRFR 339
             K+   R
Sbjct: 332 GAKLHDLR 339


>gi|218189627|gb|EEC72054.1| hypothetical protein OsI_04963 [Oryza sativa Indica Group]
          Length = 367

 Score = 44.3 bits (103), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 41/175 (23%), Positives = 71/175 (40%), Gaps = 26/175 (14%)

Query: 95  IIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVL 147
           II+ +V++ +         FL  +       +  D +L++A D     +     P H   
Sbjct: 105 IIMTSVNEAWAAPGSLLDLFLEGFRAGEGIARFVDHLLIVALDDGAFRRCRDVHP-HCYR 163

Query: 148 VPPAPDSQTAHK-FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGD 206
           +  A  + T  K F S+ + +    +      ILELGYN ++ DVD++W +DPF  +   
Sbjct: 164 LAVAGRNFTDEKVFMSEDYLDLVWSKVKLQQRILELGYNFLFTDVDILWFRDPFEQMSMA 223

Query: 207 HDVYFTDDM---AAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKW 258
             +  + D     A  P     + P  G           +Y+R +  A  VM+ W
Sbjct: 224 AHMVTSSDFFVGGAYNPA----NFPNTG----------FLYVRSSRRAVGVMEAW 264


>gi|302850830|ref|XP_002956941.1| hypothetical protein VOLCADRAFT_83868 [Volvox carteri f.
           nagariensis]
 gi|300257822|gb|EFJ42066.1| hypothetical protein VOLCADRAFT_83868 [Volvox carteri f.
           nagariensis]
          Length = 647

 Score = 44.3 bits (103), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 58/269 (21%), Positives = 109/269 (40%), Gaps = 36/269 (13%)

Query: 91  KNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAED--YATLYKVN-GRWPGHAVL 147
           ++G I++   +  YL F   W+ ++ +      ++   +D     L ++N   W  +  +
Sbjct: 21  QDGLIMITWANHHYLDFAKTWVYNLKKSGVSGYMVGAMDDDMLKDLVELNINTWRMNTGI 80

Query: 148 VPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG-- 205
                 ++    +GSQ F      +   +  +L L   V+ +D+D  WLK+P PY     
Sbjct: 81  ------TKRDLGWGSQNFHLMGRFKIKLIRDVLALDVTVVVSDIDTAWLKNPIPYFHRYP 134

Query: 206 DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAE 265
           + D+  + D  +    D S +  P    G  +    M++ RP   +K  + +W++ L  +
Sbjct: 135 EADILTSTDQLSPTVKDDSLERFPDA--GSAFNIGIMLF-RPN--SKEFVDEWVKALD-D 188

Query: 266 PWSKAKKANDQPAFNWALNKTA--------------GQVDLYLLPQSAFPTGGLYFKNQT 311
           P     K  DQ AFN    K                G++ + +LP + F +G  +F    
Sbjct: 189 P-----KMWDQTAFNDLARKVGAALTPPKNLWMGYNGKLRVGVLPCALFASGHTFFVQHK 243

Query: 312 WVEETKGKHVIIHNNYITGFEKKIKRFRD 340
           + E     +V       +G   K  RFR+
Sbjct: 244 YQELGLEPYVAHATFQYSGTPGKRHRFRE 272


>gi|242049768|ref|XP_002462628.1| hypothetical protein SORBIDRAFT_02g029220 [Sorghum bicolor]
 gi|241926005|gb|EER99149.1| hypothetical protein SORBIDRAFT_02g029220 [Sorghum bicolor]
          Length = 349

 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 50/245 (20%), Positives = 97/245 (39%), Gaps = 43/245 (17%)

Query: 107 FLNNWLISISRQ---KHQDQVLVIAEDYATL-------YKVNGRWPGHAVLVPPAPDSQT 156
           FL ++ + +  +   KH   V V  + YA         Y +  R  G         D  +
Sbjct: 109 FLESFRVGVRTEPLLKHLVIVAVDGKAYARCTQVHPFCYHLRARGAG-------VDDYAS 161

Query: 157 AHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMA 216
              F S+ + +   RR      +L+LGY+ ++ D+D++WL++P   +    D+  + D  
Sbjct: 162 EQSFMSKSYLDLMWRRNRFQARVLQLGYSFVFTDMDILWLRNPLLRVPVGADLAMSADYF 221

Query: 217 AVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQ 276
                D+ +DL      G        +Y + +         W E  +  P       N+Q
Sbjct: 222 YG---DNPYDLNKTANGG-------FVYAKASARTAAFYDGWYEARREHPGK-----NEQ 266

Query: 277 PAFNWALNKTAGQVDLYL-LPQSAFPTGGLYFKNQTWVEETKGKHVI--IHNNYITGFEK 333
             F+ A +  A +  + +    +A+ +G        + E  K  HV+  +H N + G + 
Sbjct: 267 DVFDQAKHALAARHGVRVQFVDTAYLSG--------FCELRKDFHVVCTVHGNCLFGLKD 318

Query: 334 KIKRF 338
           K+++ 
Sbjct: 319 KLQKL 323


>gi|219886249|gb|ACL53499.1| unknown [Zea mays]
          Length = 277

 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 41/199 (20%), Positives = 79/199 (39%), Gaps = 25/199 (12%)

Query: 91  KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
           ++ T+I+ +V++ +         FL ++ +  +       V+V+A D     +     P 
Sbjct: 8   EDRTVIMTSVNEAWAAPGSLLDSFLESFQVGENVSHFAKHVVVVAMDDGAFRRCQAVHPH 67

Query: 144 HAVLVPPAP--DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
             +L P     D   A  + ++ + +    +      +LELGYN+++ DVD+ W ++P  
Sbjct: 68  CHLLRPEKAGLDLSGAKSYMTKDYLDLVWSKLKLQQRVLELGYNLLFTDVDVAWFRNPLV 127

Query: 202 YLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEE 261
           ++    D+  + D     P D   + P  G           IY + T      M  W   
Sbjct: 128 HITMAADITTSSDFYFGNP-DDLGNFPNTG----------FIYFKATPRNARAMAYWHAA 176

Query: 262 LQAEPWSKAKKANDQPAFN 280
            +  P     + +DQ  FN
Sbjct: 177 RRRFP-----ENHDQFVFN 190


>gi|229914876|gb|ACQ90601.1| putative Myb DNA binding protein [Eutrema halophilum]
          Length = 397

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 36/167 (21%), Positives = 69/167 (41%), Gaps = 14/167 (8%)

Query: 81  TLSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYAT 133
            L +AA+   ++GT+I+  +++ +         F  ++ I        + +++IA D   
Sbjct: 101 VLRRAAT---RDGTVILTTLNEAWAAPGSVIDLFFESFGIGEGTSMLLNHLVIIALDAKA 157

Query: 134 LYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDM 193
             +          L     D      F ++ +     RR   L  +LE+GYN ++ D D+
Sbjct: 158 YSRCRELHKHCFSLETEGVDFSGEAYFMTRSYLKMMWRRIDFLRSVLEMGYNFVFTDADV 217

Query: 194 VWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
           +W ++PFP      D      +A    L  S+DL      G +++ S
Sbjct: 218 MWFRNPFPRFYRSADF----QIACDHYLGRSNDLQNRPNGGFSFVRS 260


>gi|384247083|gb|EIE20571.1| hypothetical protein COCSUDRAFT_37807 [Coccomyxa subellipsoidea
           C-169]
          Length = 279

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 55/111 (49%), Gaps = 17/111 (15%)

Query: 180 LELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT----DDMAAVKPLDHSHDLPPPG---- 231
           LELG+NV+ +DVD++ ++DPF +L  DHD+       DD  A   ++   D P  G    
Sbjct: 88  LELGWNVLLSDVDIIVVQDPFQHLHRDHDIEGMSDGFDDATAYGNINGLDD-PAMGWSRY 146

Query: 232 KKGRTYI--CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
            +G T++   S + Y++       +MK+    L  E      KA DQ  FN
Sbjct: 147 AQGTTHLNMNSGLFYIQANVRTVDLMKRVAARLAKE------KAWDQSVFN 191


>gi|15223826|ref|NP_172911.1| putative myb DNA binding protein [Arabidopsis thaliana]
 gi|7527728|gb|AAF63177.1|AC010657_13 T5E21.9 [Arabidopsis thaliana]
 gi|332191067|gb|AEE29188.1| putative myb DNA binding protein [Arabidopsis thaliana]
          Length = 386

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 33/139 (23%), Positives = 63/139 (45%), Gaps = 14/139 (10%)

Query: 81  TLSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYAT 133
            LS+AA+   ++ T+++  ++  +         F  ++ I     +  D ++++A D A 
Sbjct: 88  VLSKAAT---RDRTVVLTTLNAAWAAPGSVIDLFFESFRIGEETSQILDHLVIVALD-AK 143

Query: 134 LYKVNGRWPGHAV-LVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVD 192
            Y        H   LV    D      F ++ +     RR   L  +LE+GYN ++ D D
Sbjct: 144 AYSRCLELHKHCFSLVTEGVDFSREAYFMTRSYLKMMWRRIDLLRSVLEMGYNFVFTDAD 203

Query: 193 MVWLKDPFP--YLQGDHDV 209
           ++W ++PFP  Y+  D  +
Sbjct: 204 VMWFRNPFPRFYMYADFQI 222


>gi|242055413|ref|XP_002456852.1| hypothetical protein SORBIDRAFT_03g044060 [Sorghum bicolor]
 gi|241928827|gb|EES01972.1| hypothetical protein SORBIDRAFT_03g044060 [Sorghum bicolor]
          Length = 378

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 29/141 (20%), Positives = 62/141 (43%), Gaps = 9/141 (6%)

Query: 91  KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
           ++ T+I+ +V++ +         FL ++ +  +       ++V+A D     +     P 
Sbjct: 110 EDRTVIMTSVNEAWAAPGSLLDSFLESFRVGENVSHFVKHIVVVAMDDGAFRRCQAVHPH 169

Query: 144 HAVLVPPAP--DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
             +L P     D   A  + ++ + +    +      ILELGYN+++ DVD+ W ++P  
Sbjct: 170 CHLLRPEKEGLDLSGAKSYMTKDYLDLVWSKLRLQQRILELGYNLLFTDVDLAWFRNPLV 229

Query: 202 YLQGDHDVYFTDDMAAVKPLD 222
           ++    D+  + D     P D
Sbjct: 230 HITMAADITTSSDFYFGNPDD 250


>gi|384253564|gb|EIE27038.1| hypothetical protein COCSUDRAFT_11236, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 274

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 45/195 (23%), Positives = 83/195 (42%), Gaps = 23/195 (11%)

Query: 89  VAKNGTIIVCAVSQPYLP---FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHA 145
           VAK+  ++V   +  Y      L+ W+  + R    + ++V  +D    +  +  +  + 
Sbjct: 2   VAKDNEVMVAISNANYAQPGGMLDLWMAGVRRANVTNALIVALDDATQQHAESLGFTAYQ 61

Query: 146 VLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG 205
           + +     S  A          F   RP     +L+LGY V+ +DVD++ L++PF +L  
Sbjct: 62  MSLQKGVGSNHA-----VSGLKFRVLRP-----MLDLGYAVLLSDVDIITLQNPFQFLHR 111

Query: 206 DHDVY-FTD--DMAAVKPLDHSHDLPPPG-----KKGRTYIC-SCMIYLRPTDGAKLVMK 256
           D DV   +D  D A     +   D    G        R ++  S + Y+RPT  +  ++ 
Sbjct: 112 DSDVEGMSDGWDNATAYGYNDVRDDAAMGWARYAHSMRVFVMNSGLFYMRPTRASVDLLD 171

Query: 257 KWIEELQAEP-WSKA 270
           + +  L  E  W +A
Sbjct: 172 RIVHRLNTEDGWDQA 186


>gi|226497410|ref|NP_001143689.1| uncharacterized protein LOC100276419 [Zea mays]
 gi|195624720|gb|ACG34190.1| hypothetical protein [Zea mays]
          Length = 423

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 48/173 (27%), Positives = 75/173 (43%), Gaps = 31/173 (17%)

Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD----MAAVKPLDHSHDLPPPG 231
           L   L+LGY+++ +D+D+++L++PF +L  D DV    D    M A    +   D P  G
Sbjct: 230 LREFLQLGYSILLSDIDIIFLRNPFDHLYRDSDVESMSDGHNNMTAYG-FNDVFDEPSMG 288

Query: 232 -----KKGRTYI-CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN----- 280
                   R ++  S   Y+RPT  +  ++ +    L  EP     K+ DQ  FN     
Sbjct: 289 WARYAHTMRIWVYNSGFFYIRPTVPSIELLDRVAGRLSREP-----KSWDQAVFNEELFF 343

Query: 281 ------WALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNY 327
                   L+ +   +D YLL  S      L+   +      K K VI+H NY
Sbjct: 344 PSHPGYEGLHASRRTMDFYLLMNSKV----LFKTVRKDAGLKKLKPVIVHLNY 392


>gi|115460156|ref|NP_001053678.1| Os04g0585400 [Oryza sativa Japonica Group]
 gi|38344278|emb|CAE03761.2| OSJNBa0013K16.10 [Oryza sativa Japonica Group]
 gi|113565249|dbj|BAF15592.1| Os04g0585400 [Oryza sativa Japonica Group]
          Length = 362

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 24/108 (22%), Positives = 48/108 (44%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           F++++   +      + +++IA D+    +     P    L     D     +F + G+ 
Sbjct: 113 FIDSFRSGVRTSSLINHLVIIAFDWNAYKQCLKIHPYCFALGTDGVDFSEEKRFLTSGYL 172

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD 214
               +R   L  +LE G++ +++D D++W + PFPY   D D     D
Sbjct: 173 EMMWKRIDFLRLVLESGFSFIFSDADIMWFRSPFPYFYPDGDFQIACD 220


>gi|6691205|gb|AAF24543.1|AC007508_6 F1K23.11 [Arabidopsis thaliana]
          Length = 841

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 35/146 (23%), Positives = 61/146 (41%), Gaps = 15/146 (10%)

Query: 84  QAASFVA---KNGTIIVCAVSQPYLP-----------FLNNWLISISRQKHQDQVLVIAE 129
           +AA + A    N T+I+  V++ Y+            FL ++          D ++V+A 
Sbjct: 46  EAALYTAAAGNNKTVIITMVNKAYVKEVGRGSTMLDLFLESFWEGEGTLPLLDHLMVVAV 105

Query: 130 DYATLYKVN-GRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMY 188
           D     +    R   + +      D +    F S+ F     RR   +L +L  GYNV++
Sbjct: 106 DQTAYDRCRFKRLHCYKMETEDGVDLEGEKVFMSKDFIEMMWRRTRLILDVLRRGYNVIF 165

Query: 189 NDVDMVWLKDPFPYLQGDHDVYFTDD 214
            D D++WL+ P   L    D+  + D
Sbjct: 166 TDTDVMWLRSPLSRLNMSLDMQISVD 191


>gi|125549492|gb|EAY95314.1| hypothetical protein OsI_17140 [Oryza sativa Indica Group]
          Length = 357

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 24/108 (22%), Positives = 48/108 (44%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           F++++   +      + +++IA D+    +     P    L     D     +F + G+ 
Sbjct: 113 FIDSFRSGVRTSSLLNHLVIIAFDWNAYKQCLKIHPYCFALGTDGVDFSEEKRFLTSGYL 172

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD 214
               +R   L  +LE G++ +++D D++W + PFPY   D D     D
Sbjct: 173 EMMWKRIDFLRLVLESGFSFIFSDADIMWFRSPFPYFYPDGDFQIACD 220


>gi|116309821|emb|CAH66858.1| H0307D04.3 [Oryza sativa Indica Group]
          Length = 362

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 24/108 (22%), Positives = 48/108 (44%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           F++++   +      + +++IA D+    +     P    L     D     +F + G+ 
Sbjct: 113 FIDSFRSGVRTSSLLNHLVIIAFDWNAYKQCLKIHPYCFALGTDGVDFSEEKRFLTSGYL 172

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD 214
               +R   L  +LE G++ +++D D++W + PFPY   D D     D
Sbjct: 173 EMMWKRIDFLRLVLESGFSFIFSDADIMWFRSPFPYFYPDGDFQIACD 220


>gi|125591429|gb|EAZ31779.1| hypothetical protein OsJ_15931 [Oryza sativa Japonica Group]
          Length = 357

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 24/108 (22%), Positives = 48/108 (44%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           F++++   +      + +++IA D+    +     P    L     D     +F + G+ 
Sbjct: 113 FIDSFRSGVRTSSLINHLVIIAFDWNAYKQCLKIHPYCFALGTDGVDFSEEKRFLTSGYL 172

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDD 214
               +R   L  +LE G++ +++D D++W + PFPY   D D     D
Sbjct: 173 EMMWKRIDFLRLVLESGFSFIFSDADIMWFRSPFPYFYPDGDFQIACD 220


>gi|115471611|ref|NP_001059404.1| Os07g0294800 [Oryza sativa Japonica Group]
 gi|28564589|dbj|BAC57756.1| putative regulatory protein [Oryza sativa Japonica Group]
 gi|50509709|dbj|BAD31747.1| putative regulatory protein [Oryza sativa Japonica Group]
 gi|113610940|dbj|BAF21318.1| Os07g0294800 [Oryza sativa Japonica Group]
 gi|215686845|dbj|BAG89695.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 406

 Score = 43.5 bits (101), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 26/42 (61%)

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKP 220
           ILELG++ ++ DVD++W + PFP+L  D  V  + D     P
Sbjct: 235 ILELGFSFLFTDVDILWFRSPFPHLSPDAQVVMSSDFFVGDP 276


>gi|449492975|ref|XP_004159157.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC101227563 [Cucumis sativus]
          Length = 435

 Score = 43.5 bits (101), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 62/260 (23%), Positives = 99/260 (38%), Gaps = 35/260 (13%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           VA +  +IV   +      L  W  SI +    + ++V  +D    +      P +    
Sbjct: 158 VAIDRELIVAVANSNVKAMLELWFTSIKKAGIPNYLVVALDDEIVQFCKXNDVPVYM--- 214

Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
              PD +      + G    +  +   L   L+LGY V+ +DVD+V+L++PF +L  D D
Sbjct: 215 -RDPDEKVDSIGRTGGNHAVSGTKFRILREFLQLGYAVLLSDVDIVYLQNPFNHLYRDSD 273

Query: 209 V-YFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMI--------YLRPTDGAKLVMKKWI 259
           V   TD    V    ++     P      +  +  I        Y+RPT  A  ++ +  
Sbjct: 274 VESMTDGHDNVTAYGYNDVFEEPAMGWARFAHTMRIWVYNSGFFYIRPTIPAIELLDRVA 333

Query: 260 EELQAEPWSKAKKANDQPAFN-----------WALNKTAGQVDLYLLPQSAFPTGGLYFK 308
             L     S+ + + DQ  FN             L  +   +D YL     F    + FK
Sbjct: 334 NRL-----SRERNSWDQAVFNEELFFPSHSNYEGLYASRRTMDFYL-----FMNSKVLFK 383

Query: 309 NQTWVEETKG-KHVIIHNNY 327
                +  K  K VIIH NY
Sbjct: 384 TVRKDDNLKKLKPVIIHVNY 403


>gi|297842253|ref|XP_002889008.1| hypothetical protein ARALYDRAFT_476650 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334849|gb|EFH65267.1| hypothetical protein ARALYDRAFT_476650 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 429

 Score = 43.5 bits (101), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 66/257 (25%), Positives = 101/257 (39%), Gaps = 30/257 (11%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           VA N  IIV   +    P L   + SI R    + +++  +D    +  +     +    
Sbjct: 154 VAVNKEIIVVLANSNVKPMLELQIASIKRVGILNYLIIALDDSVESFCESKEVSYY---- 209

Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
              PD        S G    +  +   L   L+LGY+V+ +DVD+V+L++PF +L  D D
Sbjct: 210 KRDPDKAVDMVGKSGGNHAVSGLKFRVLREFLQLGYSVLLSDVDIVFLQNPFGHLYRDSD 269

Query: 209 VYFT----DDMAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGAKLVMKKW 258
           V       D+M A    +   D P  G        R ++  S   YLRPT  +  ++ + 
Sbjct: 270 VESMSDGHDNMTAY-GFNDVFDEPSMGWARYAHTMRIWVFNSGFFYLRPTLPSIELLDRV 328

Query: 259 IEEL-QAEPWSKAKKANDQPAFNWAL------NKTAGQVDLYLLPQSAFPTGGLYFKNQT 311
              L ++E W       DQ  FN  L        T       ++    F    + FK   
Sbjct: 329 AYTLSKSEAW-------DQAVFNEQLFYPSHPGYTGLHASKRVMDMYEFMNSKVLFKTVR 381

Query: 312 WVEETKG-KHVIIHNNY 327
              E K  K VI+H NY
Sbjct: 382 KNHELKKLKPVIVHLNY 398


>gi|357121351|ref|XP_003562384.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
           distachyon]
          Length = 378

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 16/42 (38%), Positives = 27/42 (64%)

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKP 220
           I+ELGYN ++ DVD++W +DPF ++    D+  + D+    P
Sbjct: 207 IVELGYNFLFTDVDVMWFRDPFKHISMGADIAISSDVFIGDP 248


>gi|224067546|ref|XP_002302503.1| predicted protein [Populus trichocarpa]
 gi|222844229|gb|EEE81776.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 27/44 (61%)

Query: 160 FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
           F SQ F     RR   LL +L+ GYN ++ D D++WL++PF  L
Sbjct: 104 FMSQDFLKMMWRRTLLLLDVLKHGYNFIFTDTDVMWLRNPFSRL 147


>gi|407847536|gb|EKG03219.1| hypothetical protein TCSYLVIO_005741 [Trypanosoma cruzi]
          Length = 390

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 55/275 (20%), Positives = 102/275 (37%), Gaps = 30/275 (10%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISIS-RQKHQDQVLVIAEDYATLYKVNGRWP--GHAV 146
            +NG + V   +  +   L+N++ S + R K ++ ++   +D A L   + R P     +
Sbjct: 71  GENGFVTVVIFNSFWRDHLHNFVYSFAKRAKMRNLIVASVDDTALLLCFSFRLPCLNATL 130

Query: 147 LVPPAPDSQTA-HKFGSQGFF------NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
            V P   ++       S+G F       F+  +P   + +L  GY  M  D+D+ W + P
Sbjct: 131 FVEPEKGTEKGGANASSKGGFTRKVTEEFSWVKPPLAIAVLRRGYTFMLADLDITWNRSP 190

Query: 200 FPYLQGDH-DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKW 258
            PYL  +  D+    D      ++    +  P  +   Y    M +       +  M+ +
Sbjct: 191 MPYLLKNRLDLVHQCDSRGRLSINSGIYMARPNSRNLLYFQDLMSFRTDESADQNAMRLF 250

Query: 259 IEELQAEPWSKAKKANDQPAFNWALN-----KTAGQVDLYLLPQSAFPTGGLYFKNQTWV 313
                   +      + Q    W  N     K  G V L          G   F+   + 
Sbjct: 251 AN------YDHVHGVSQQCLPKWEFNMKCNYKVEGSVRL--------EDGRQTFRWHPYP 296

Query: 314 EETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDD 348
                K V +H   ++G + KI+ F+    W +D+
Sbjct: 297 LHENPKWVAMHATCLSGAKDKIRYFKAIKAWFLDE 331


>gi|414867357|tpg|DAA45914.1| TPA: hypothetical protein ZEAMMB73_843284 [Zea mays]
          Length = 426

 Score = 43.5 bits (101), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 64/268 (23%), Positives = 106/268 (39%), Gaps = 37/268 (13%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAED-YATLYKVNGR 140
           L++    VA    +IV   +      L  W  +I R    + ++V  +D   +L K  G 
Sbjct: 143 LAKILEEVAVKKELIVALANTNVREMLEVWFSNIKRVGIPNYLVVALDDNIESLCKSKG- 201

Query: 141 WPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
                 +    PD    +   + G    +  +   L   L+LGY+++ +D+D+++L++PF
Sbjct: 202 ----VPVYRRDPDEGIDNIAKTGGNHAVSGLKFRVLREFLQLGYSILLSDIDIIFLRNPF 257

Query: 201 PYLQGDHDVYFTDD----MAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDG 250
            +L  D DV    D    M A    +   D P  G        R ++  S   Y+RPT  
Sbjct: 258 DHLHRDSDVESMSDGHNNMTAYG-FNDVFDEPSMGWARYAHTMRIWVYNSGFFYIRPTVP 316

Query: 251 AKLVMKKWIEELQAEPWSKAKKANDQPAFN-----------WALNKTAGQVDLYLLPQSA 299
           +  ++ +    L  EP     K+ DQ  FN             L+ +   +D YL   S 
Sbjct: 317 SIELLDRVAGRLSREP-----KSWDQAVFNEELFFPSHPGYEGLHASRRTMDFYLFMNSK 371

Query: 300 FPTGGLYFKNQTWVEETKGKHVIIHNNY 327
                L+   +      K K VI+H NY
Sbjct: 372 V----LFKTVRKDAGLKKLKPVIVHLNY 395


>gi|223942815|gb|ACN25491.1| unknown [Zea mays]
 gi|413951495|gb|AFW84144.1| hypothetical protein ZEAMMB73_902211 [Zea mays]
          Length = 378

 Score = 43.5 bits (101), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 29/141 (20%), Positives = 62/141 (43%), Gaps = 9/141 (6%)

Query: 91  KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
           ++ T+I+ +V++ +         FL ++ +  +       V+V+A D     +     P 
Sbjct: 109 EDRTVIMTSVNEAWAAPGSLLDSFLESFQVGENVSHFAKHVVVVAMDDGAFRRCQAVHPH 168

Query: 144 HAVLVPPAP--DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
             +L P     D   A  + ++ + +    +      +LELGYN+++ DVD+ W ++P  
Sbjct: 169 CHLLRPEKAGLDLSGAKSYMTKDYLDLVWSKLKLQQRVLELGYNLLFTDVDVAWFRNPLV 228

Query: 202 YLQGDHDVYFTDDMAAVKPLD 222
           ++    D+  + D     P D
Sbjct: 229 HITMAADITTSSDFYFGNPDD 249


>gi|194696198|gb|ACF82183.1| unknown [Zea mays]
          Length = 426

 Score = 43.5 bits (101), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 64/268 (23%), Positives = 106/268 (39%), Gaps = 37/268 (13%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAED-YATLYKVNGR 140
           L++    VA    +IV   +      L  W  +I R    + ++V  +D   +L K  G 
Sbjct: 143 LAKILEEVAVKKELIVALANTNVREMLEVWFSNIKRVGIPNYLVVALDDNIESLCKSKG- 201

Query: 141 WPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
                 +    PD    +   + G    +  +   L   L+LGY+++ +D+D+++L++PF
Sbjct: 202 ----VPVYRRDPDEGIDNIAKTGGNHAVSGLKFRVLREFLQLGYSILLSDIDIIFLRNPF 257

Query: 201 PYLQGDHDVYFTDD----MAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDG 250
            +L  D DV    D    M A    +   D P  G        R ++  S   Y+RPT  
Sbjct: 258 DHLHRDSDVESMSDGHNNMTAYG-FNDVFDEPSMGWARYAHTMRIWVYNSGFFYIRPTVP 316

Query: 251 AKLVMKKWIEELQAEPWSKAKKANDQPAFN-----------WALNKTAGQVDLYLLPQSA 299
           +  ++ +    L  EP     K+ DQ  FN             L+ +   +D YL   S 
Sbjct: 317 SIELLDRVAGRLSREP-----KSWDQAVFNEELFFPSHPGYEGLHASRRTMDFYLFMNSK 371

Query: 300 FPTGGLYFKNQTWVEETKGKHVIIHNNY 327
                L+   +      K K VI+H NY
Sbjct: 372 V----LFKTVRKDAGLKKLKPVIVHLNY 395


>gi|300123542|emb|CBK24814.2| unnamed protein product [Blastocystis hominis]
          Length = 243

 Score = 43.1 bits (100), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 35/132 (26%), Positives = 62/132 (46%), Gaps = 4/132 (3%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           VA    +I+      +LP   ++  S     H    +  A D  T YK    W   +VL+
Sbjct: 62  VAIKKNVIISVFDFAFLPQFYSFYHSSLLPLHITNFIAFAMDKRT-YKTLQDWGIPSVLL 120

Query: 149 P--PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL-QG 205
               +  + ++  +GS  F    + +   +LH+L+  YN + +DVD+V+ K+PF YL  G
Sbjct: 121 ELDISISTSSSSDYGSYAFATKANMKTLAVLHVLQCNYNPLLSDVDVVYFKNPFEYLGTG 180

Query: 206 DHDVYFTDDMAA 217
           + D+    + +A
Sbjct: 181 NWDIQIQREESA 192


>gi|71659866|ref|XP_821653.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70887037|gb|EAN99802.1| hypothetical protein Tc00.1047053508461.84 [Trypanosoma cruzi]
          Length = 714

 Score = 43.1 bits (100), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 55/275 (20%), Positives = 104/275 (37%), Gaps = 30/275 (10%)

Query: 90  AKNGTIIVCAVSQPYLPFLNNWLISIS-RQKHQDQVLVIAEDYATLYKVNGRWP--GHAV 146
            +NG + V   +  +   L+N++ S + R K ++ ++   +D A L  ++ R P     +
Sbjct: 395 GENGFVTVVIFNSFWRDQLHNFVYSFAKRAKMRNLIVASVDDTALLLCLSFRLPCLNATL 454

Query: 147 LVPPAPDSQTA-HKFGSQGFF------NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
            V P   ++       S+G F       F+  +P   + +L  GY  M  D+D+ W + P
Sbjct: 455 FVEPEKGTEKGGDNASSKGGFTRKVTEEFSWVKPRLAIAVLRRGYTFMLADLDITWNRSP 514

Query: 200 FPYLQGDH-DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKW 258
            PYL  +  D+    D      ++    +  P      Y    M +       +  M+ +
Sbjct: 515 MPYLLKNRLDLVHQCDSRGRLSINSGLYMARPNSCNLRYFQDLMSFRTDESADQNAMRLF 574

Query: 259 IEELQAEPWSKAKKANDQPAFNWALN-----KTAGQVDLYLLPQSAFPTGGLYFKNQTWV 313
           +       +      + Q    W  N     K  G V L          G   F+ + + 
Sbjct: 575 MN------YDHVHGVSQQCLPKWDFNMKCNYKVEGSVRL--------EDGRQTFRWRPYP 620

Query: 314 EETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDD 348
                K V +H   ++G + KI+ F+    W +D+
Sbjct: 621 LNENPKWVAMHATCLSGAKDKIRYFKAIKAWFLDE 655


>gi|219130406|ref|XP_002185357.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403272|gb|EEC43226.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 647

 Score = 43.1 bits (100), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 61/286 (21%), Positives = 107/286 (37%), Gaps = 34/286 (11%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISI-SRQKHQDQVLVI-----AEDYATLYKVNGRWP 142
           +A   T+IV  V+      L N++ +  SR      V+V      + D AT   +   + 
Sbjct: 323 IAIENTVIVMVVNFGQTELLMNFVCAAKSRSLDLSNVIVFTTDQESTDLATSLGLTAYYD 382

Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
                  P   S+ A ++G + F      +   +  +  LGY++++ DVD+VW  +P  Y
Sbjct: 383 QRNFGEIP---SEAARRYGDRRFTAMMMAKVICVQLVSMLGYDLLFQDVDIVWFSNPLEY 439

Query: 203 LQGDH---DVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWI 259
                   D++F DD A      HS    P       Y        R    + L++   I
Sbjct: 440 FAHADPGMDMFFQDDGA------HSTRYAPYSANSGLYFVRHNRRTRHFLTSLLMVGDLI 493

Query: 260 EELQAEPWSKAKKANDQPA-FNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKG 318
            +  +   +     N+  + F   +   A + D        FP G  Y  NQ   +  + 
Sbjct: 494 VKTDSHQQALVATLNEHVSLFGLRVKVLARETD-------EFPGG--YHWNQRSGKYMRA 544

Query: 319 KH------VIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
            +      V+ H ++    + K+   R  G W V +H VE    ++
Sbjct: 545 FYSGDVHPVLFHMSWTLNKKDKLLYLRQMGEWYVQEHCVEKKRSQI 590


>gi|328769745|gb|EGF79788.1| hypothetical protein BATDEDRAFT_89199 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 885

 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 53/226 (23%), Positives = 93/226 (41%), Gaps = 29/226 (12%)

Query: 75  AKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQV-----LVIAE 129
           A  R   L +  S +A N  ++V  V++ +L    N   S+      + +     L + E
Sbjct: 557 ASVRSKCLKEFGSKIAHNNELVVVPVNRAFLVQAANLRCSLLHAGIDNLIYWSLDLEVHE 616

Query: 130 DYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYN 189
              +  K+       ++ +   P +      G+  F      +P  L  +L+ G+NV Y 
Sbjct: 617 SLISKGKL-------SIFLAGFPSTSERQDPGTPYFIKMMRYKPKVLAMLLDAGFNVWYM 669

Query: 190 DVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHS----HDLPPPGKKGRTYICSCMIYL 245
           D D + L+D  P +  D  V+    +   K LD S    H +PP    G       ++YL
Sbjct: 670 DADTIALQDFRPEIVADTSVHIHIALDNDKRLDMSTFLKHSVPPLASAG-------IMYL 722

Query: 246 RPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL-NKTAGQV 290
           +    +KL +K  + ++QA+    +   +DQ A    + NKT  Q 
Sbjct: 723 QNHPSSKLFIKS-VLDMQAQ----SILLDDQEAMRRVIHNKTLVQT 763


>gi|226496894|ref|NP_001150786.1| regulatory protein [Zea mays]
 gi|195641824|gb|ACG40380.1| regulatory protein [Zea mays]
          Length = 332

 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 44/95 (46%), Gaps = 1/95 (1%)

Query: 122 DQVLVIAEDYATLYKVNGRWP-GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHIL 180
           D VLV+A D           P  + + V  A    +   F S  +      +      +L
Sbjct: 142 DHVLVVAVDAMAFVHCKAVHPHCYRLEVDSATYLSSESSFLSAAYVELVWAKLSLQQRVL 201

Query: 181 ELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDM 215
           ELGYN ++ DVD+VWL++PF ++    D+  + D+
Sbjct: 202 ELGYNFLFTDVDVVWLRNPFRHISVYPDMTTSSDI 236


>gi|359492122|ref|XP_003634366.1| PREDICTED: uncharacterized protein LOC100255856 isoform 2 [Vitis
           vinifera]
          Length = 428

 Score = 43.1 bits (100), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 62/266 (23%), Positives = 102/266 (38%), Gaps = 34/266 (12%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L++    VA +  +IV   +      L  W  +I R    + ++V  +D    +  +   
Sbjct: 145 LAKILEEVAVSKELIVALANSNVKSSLEVWFANIKRVGIPNYLVVALDDDIENFCKSNNV 204

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
           P    +    PD        S G    +  +   L   L+LGY+V+ +D+D+V+L++PF 
Sbjct: 205 P----VYKRDPDEGIDSVARSGGNHAVSGLKFQILREFLQLGYSVLLSDIDIVYLQNPFD 260

Query: 202 YLQGDHDV-YFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMI--------YLRPTDGAK 252
           YL  D DV   TD         ++     P      Y  +  I        Y+RPT  + 
Sbjct: 261 YLYRDSDVESMTDGHNNYTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTIPSI 320

Query: 253 LVMKKWIEELQAEPWSKAKKANDQPAFN-----------WALNKTAGQVDLYLLPQSAFP 301
            ++ +  + L         KA DQ  FN             L+ +   +D YL   S   
Sbjct: 321 ELLDRVADRLAHS------KAWDQAVFNEELFFPSHPGYTGLHASRRTMDFYLFMNSKV- 373

Query: 302 TGGLYFKNQTWVEETKGKHVIIHNNY 327
              L+   +   +  K K VI+H NY
Sbjct: 374 ---LFKTVRKDAKLKKLKPVIVHVNY 396


>gi|125558021|gb|EAZ03557.1| hypothetical protein OsI_25693 [Oryza sativa Indica Group]
          Length = 261

 Score = 43.1 bits (100), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 26/42 (61%)

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKP 220
           ILELG++ ++ DVD++W + PFP+L  D  V  + D     P
Sbjct: 90  ILELGFSFLFTDVDILWFRSPFPHLSPDAQVVMSSDFFVGDP 131


>gi|413951496|gb|AFW84145.1| regulatory protein [Zea mays]
          Length = 383

 Score = 43.1 bits (100), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 31/121 (25%), Positives = 55/121 (45%), Gaps = 12/121 (9%)

Query: 94  TIIVCAVSQPYLPFLNNWLISISRQKHQ---------DQVLVIAEDYATLYKVNGRWP-G 143
           T+I+ +V++ +     N L+ + R+  +         D VLV+A D           P  
Sbjct: 107 TVILTSVNEAFAR--PNSLLGLFRESFRAGEGTEHLLDHVLVVAVDAMAFVHCKAVHPHC 164

Query: 144 HAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL 203
           + + V  A    +   F S  +      +      +LELGYN ++ DVD+VWL++PF ++
Sbjct: 165 YRLEVDSATYLSSESSFLSAAYVELVWAKLSLQQRVLELGYNFLFTDVDVVWLRNPFRHI 224

Query: 204 Q 204
            
Sbjct: 225 S 225


>gi|242082105|ref|XP_002445821.1| hypothetical protein SORBIDRAFT_07g026310 [Sorghum bicolor]
 gi|241942171|gb|EES15316.1| hypothetical protein SORBIDRAFT_07g026310 [Sorghum bicolor]
          Length = 375

 Score = 43.1 bits (100), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 40/88 (45%), Gaps = 11/88 (12%)

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
           ILELG++ ++ DVD++W ++P  ++    DV    D     P D  H+LP  G       
Sbjct: 203 ILELGFSFLFTDVDILWFRNPMRHIAVTSDVAIASDYFNGNP-DSLHNLPNGG------- 254

Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEP 266
               +Y+R  +      ++W E     P
Sbjct: 255 ---FLYVRSMNRTVEFYRRWREARAGFP 279


>gi|356561841|ref|XP_003549185.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
          Length = 527

 Score = 43.1 bits (100), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 39/161 (24%), Positives = 67/161 (41%), Gaps = 19/161 (11%)

Query: 91  KNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
           K+ T+I+  ++  +         FL ++ +    +K  + ++VI  D     K + R   
Sbjct: 85  KDKTVIITTLNDAWAKPGSIFDLFLESFRLGNQTKKFLNHLVVITLDQ----KAHARCLA 140

Query: 144 ---HAVLVPPAPDSQTAHKF-GSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
              H   +    D+ T   F  +  +     RR   L  +L++GYN ++ D D++WL+DP
Sbjct: 141 LHKHCYQLETKGDNFTGEAFFMTADYLQMMWRRIEFLGTVLDMGYNFVFTDTDVMWLRDP 200

Query: 200 FPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
           F     D D     D        +SHDL      G  Y+ S
Sbjct: 201 FKLFYKDVDFQIACDFFN----GNSHDLNNFPNGGFNYVKS 237


>gi|225458348|ref|XP_002283221.1| PREDICTED: uncharacterized protein LOC100255856 isoform 1 [Vitis
           vinifera]
          Length = 434

 Score = 43.1 bits (100), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 62/266 (23%), Positives = 102/266 (38%), Gaps = 34/266 (12%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L++    VA +  +IV   +      L  W  +I R    + ++V  +D    +  +   
Sbjct: 151 LAKILEEVAVSKELIVALANSNVKSSLEVWFANIKRVGIPNYLVVALDDDIENFCKSNNV 210

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
           P    +    PD        S G    +  +   L   L+LGY+V+ +D+D+V+L++PF 
Sbjct: 211 P----VYKRDPDEGIDSVARSGGNHAVSGLKFQILREFLQLGYSVLLSDIDIVYLQNPFD 266

Query: 202 YLQGDHDV-YFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMI--------YLRPTDGAK 252
           YL  D DV   TD         ++     P      Y  +  I        Y+RPT  + 
Sbjct: 267 YLYRDSDVESMTDGHNNYTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTIPSI 326

Query: 253 LVMKKWIEELQAEPWSKAKKANDQPAFN-----------WALNKTAGQVDLYLLPQSAFP 301
            ++ +  + L         KA DQ  FN             L+ +   +D YL   S   
Sbjct: 327 ELLDRVADRLAHS------KAWDQAVFNEELFFPSHPGYTGLHASRRTMDFYLFMNSKV- 379

Query: 302 TGGLYFKNQTWVEETKGKHVIIHNNY 327
              L+   +   +  K K VI+H NY
Sbjct: 380 ---LFKTVRKDAKLKKLKPVIVHVNY 402


>gi|388519423|gb|AFK47773.1| unknown [Medicago truncatula]
          Length = 429

 Score = 42.7 bits (99), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 46/172 (26%), Positives = 74/172 (43%), Gaps = 30/172 (17%)

Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV-YFTDDMAAVKPLDHSHDLPPPGK-- 232
           L   L+LGY+V+ +D+D+V+L++PF +L  D DV   +D    +    ++     PG   
Sbjct: 236 LREFLQLGYSVLLSDIDIVYLQNPFDHLYRDSDVESMSDGHNNMTAYGYNDVFNDPGMGW 295

Query: 233 -----KGRTYICSC-MIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNW----- 281
                  R ++ +    Y+RPT  +  ++ +    L  E      KA DQ  FN      
Sbjct: 296 SSGVFTRRIFVYNAGFFYIRPTIPSIELLDRVAARLLKE------KAWDQVVFNEELFHP 349

Query: 282 ------ALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNY 327
                  L+     +D+YL   S      L+   +     +K K VIIH NY
Sbjct: 350 SYPGYDGLHAAKRTMDIYLFMNSRV----LFKTVRNNANLSKLKPVIIHLNY 397


>gi|302142458|emb|CBI19661.3| unnamed protein product [Vitis vinifera]
          Length = 446

 Score = 42.7 bits (99), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 62/266 (23%), Positives = 102/266 (38%), Gaps = 34/266 (12%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L++    VA +  +IV   +      L  W  +I R    + ++V  +D    +  +   
Sbjct: 151 LAKILEEVAVSKELIVALANSNVKSSLEVWFANIKRVGIPNYLVVALDDDIENFCKSNNV 210

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFP 201
           P    +    PD        S G    +  +   L   L+LGY+V+ +D+D+V+L++PF 
Sbjct: 211 P----VYKRDPDEGIDSVARSGGNHAVSGLKFQILREFLQLGYSVLLSDIDIVYLQNPFD 266

Query: 202 YLQGDHDV-YFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMI--------YLRPTDGAK 252
           YL  D DV   TD         ++     P      Y  +  I        Y+RPT  + 
Sbjct: 267 YLYRDSDVESMTDGHNNYTAYGYNDVFDEPAMGWARYAHTMRIWVYNSGFFYIRPTIPSI 326

Query: 253 LVMKKWIEELQAEPWSKAKKANDQPAFN-----------WALNKTAGQVDLYLLPQSAFP 301
            ++ +  + L         KA DQ  FN             L+ +   +D YL   S   
Sbjct: 327 ELLDRVADRLAHS------KAWDQAVFNEELFFPSHPGYTGLHASRRTMDFYLFMNSKV- 379

Query: 302 TGGLYFKNQTWVEETKGKHVIIHNNY 327
              L+   +   +  K K VI+H NY
Sbjct: 380 ---LFKTVRKDAKLKKLKPVIVHVNY 402


>gi|449442485|ref|XP_004139012.1| PREDICTED: uncharacterized protein At1g28695-like [Cucumis sativus]
 gi|449505304|ref|XP_004162430.1| PREDICTED: uncharacterized protein At1g28695-like [Cucumis sativus]
          Length = 358

 Score = 42.7 bits (99), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 29/125 (23%), Positives = 52/125 (41%), Gaps = 12/125 (9%)

Query: 92  NGTIIVCAVSQPYLP------------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNG 139
           N T+I+  V++ Y              FL+++ +    +     +L++  D     +   
Sbjct: 78  NKTVIIAVVNKAYANQETGAVTTMLDVFLDSFWLGEGTRPLVKHILLVTVDQTAYDRCQF 137

Query: 140 RWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDP 199
                  LV    D      + S+ F     RR   LL +L+ GYN ++ D D++WL++P
Sbjct: 138 LHLNCFRLVTDGVDFGGEKLYMSEDFIKMMWRRTQFLLEVLKRGYNFIFTDTDVMWLRNP 197

Query: 200 FPYLQ 204
           F  L 
Sbjct: 198 FTKLS 202


>gi|125599894|gb|EAZ39470.1| hypothetical protein OsJ_23899 [Oryza sativa Japonica Group]
          Length = 261

 Score = 42.7 bits (99), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 26/42 (61%)

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKP 220
           ILELG++ ++ DVD++W + PFP+L  D  V  + D     P
Sbjct: 90  ILELGFSFLFTDVDILWFRSPFPHLSPDAQVVMSSDFFVGDP 131


>gi|224002126|ref|XP_002290735.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974157|gb|EED92487.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 561

 Score = 42.7 bits (99), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 61/285 (21%), Positives = 107/285 (37%), Gaps = 28/285 (9%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISI-SRQKHQDQVLVIAEDYATLYKVNGRWPGH--- 144
           + K  T++V  V++     L N++ +  SR      VLV   D  +     G    +   
Sbjct: 239 IVKENTVVVMTVNKGQSLLLTNFVCNAHSRGFDISNVLVFPTDEESRKLAEGLGLAYYYD 298

Query: 145 AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
            + +   P+ +  + +G   F      +   + +I  LGY+V++ DVD+ WL+DP  +  
Sbjct: 299 EINLGHMPEKEATY-YGDDTFAAMMFAKILCVYYINLLGYDVLFQDVDITWLRDPLEFFH 357

Query: 205 GDHDVYFTDDMAAVKPLD--HSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
              +       AAV+  D    HD  P  +       S   Y+R     + +   ++   
Sbjct: 358 NKTN-------AAVQSYDIAFQHDGSPQPRFAPYSANSGFYYVRWNKRTQYLFTSFLYHG 410

Query: 263 QAEPWSKAKKANDQPAFNWALNKTAGQVDLYL-----LPQSAFPTGGLYFKNQTWVEET- 316
                  A  A+ Q      L + A    L +     +    FP GG +F     V ++ 
Sbjct: 411 DL----VATTASHQQVLVELLLEHASMFGLKVKVFDKVETDMFP-GGYHFHRDPAVMKSI 465

Query: 317 ---KGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
              K    I H ++      K+   R  G W V D  VE+ +  +
Sbjct: 466 VTGKSNAYIFHMSWTENKSNKLLFLRQLGEWYVHDKCVETSVTDI 510


>gi|307108280|gb|EFN56520.1| hypothetical protein CHLNCDRAFT_144132 [Chlorella variabilis]
          Length = 690

 Score = 42.4 bits (98), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 34/125 (27%), Positives = 57/125 (45%), Gaps = 4/125 (3%)

Query: 86  ASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNG-RWPGH 144
           AS     GT++V  ++     F  NWL  + +    D  +V A D AT  ++     P  
Sbjct: 103 ASRAGTGGTVMVAVMNSAQWDFGLNWLHHV-KAAGIDFYVVAAADQATSERLAAASEPCF 161

Query: 145 AVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
                 AP    A  +G +G+   T  +   L  +++ G+N++ +DVD+VW +DP   + 
Sbjct: 162 EWFDDEAPKLGLA--WGQEGWRRMTWSKVFVLAAVVDYGFNLVVSDVDVVWFRDPRTLMA 219

Query: 205 GDHDV 209
              DV
Sbjct: 220 AHPDV 224


>gi|412988791|emb|CCO15382.1| glycosyltransferase family 77 protein [Bathycoccus prasinos]
          Length = 841

 Score = 42.4 bits (98), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 70/315 (22%), Positives = 114/315 (36%), Gaps = 86/315 (27%)

Query: 94  TIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW-------PGHAV 146
            I V   +     F  NW+ +  R K +    V A D   L  +   W        G++V
Sbjct: 209 AIFVTFATHSVRDFAKNWVNAARRLKLEPH-FVGALDEKMLEDLKS-WNVPSMLLTGNSV 266

Query: 147 LVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ-- 204
           L        TA   GS  F    + +   +  +L++  N + +D D+ W++DP  Y    
Sbjct: 267 LANRGVQFITA---GSAAFKKMGTVKTKFVQDLLDMNLNPILSDADVAWMRDPRAYFNKG 323

Query: 205 --GDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYIC------SCMIYLRPTDGAKLVMK 256
             G  D+  + D   V           PG K  +  C      + ++++R T+ +K  ++
Sbjct: 324 TYGKADILVSTDCIDV-----------PGDKDDSNKCAHVNFNTGILHIRATEESKNFLQ 372

Query: 257 KWIEELQAE--PWSKAKKANDQPAFN---------------------------------W 281
            W  ++      W +     DQPAFN                                 +
Sbjct: 373 AWKTKVATSTIAWMR-----DQPAFNLLMHEGVRGHALENAIKLKDMADRGKEGFRMVYY 427

Query: 282 ALNKTAGQVDLYLLPQSAFPTGGLYF---KNQTWVEETKGKHVIIHNNYITGFE-----K 333
           A N T   V L +LP   F  G  YF    ++ + E+  G+   +H  Y  G +      
Sbjct: 428 AANAT---VKLGVLPNWLFGNGHSYFVQWHHKQFPED--GEPYAVHTTYQYGDDGQYAWG 482

Query: 334 KIKRFRDFGLWLVDD 348
           K +R R  G+W  DD
Sbjct: 483 KRERMRQAGIWTADD 497


>gi|42562383|ref|NP_174191.2| nucleotide-diphospho-sugar transferase domain-containing protein
           [Arabidopsis thaliana]
 gi|10764853|gb|AAG22833.1|AC007508_6 F1K23.10 [Arabidopsis thaliana]
 gi|60547599|gb|AAX23763.1| hypothetical protein At1g28700 [Arabidopsis thaliana]
 gi|71905439|gb|AAZ52697.1| expressed protein [Arabidopsis thaliana]
 gi|332192895|gb|AEE31016.1| nucleotide-diphospho-sugar transferase domain-containing protein
           [Arabidopsis thaliana]
          Length = 338

 Score = 42.4 bits (98), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 30/142 (21%), Positives = 59/142 (41%), Gaps = 11/142 (7%)

Query: 89  VAKNGTIIVCAVSQPYLP-----------FLNNWLISISRQKHQDQVLVIAEDYATLYKV 137
              N T+I+  V++ Y+            FL ++      +   + ++V+A D     + 
Sbjct: 64  TGNNKTVIIAMVNKAYVEEDGGGRTMLDLFLESFWEGEGTRPLLNHLMVVAADQTAYDRC 123

Query: 138 NGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLK 197
             R      +     D +    + S+ F     RR   LL +L  GY++++ D D++WL+
Sbjct: 124 LFRRLHCYKMDTEGVDLEGEKVYMSKDFIEMMWRRTRLLLDVLSRGYHIIFTDTDVMWLR 183

Query: 198 DPFPYLQGDHDVYFTDDMAAVK 219
            P   L    D++ + D   V+
Sbjct: 184 SPLSRLNVSLDMHISVDRNNVR 205


>gi|359491833|ref|XP_002270793.2| PREDICTED: uncharacterized protein At4g15970-like [Vitis vinifera]
          Length = 346

 Score = 42.4 bits (98), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 49/215 (22%), Positives = 83/215 (38%), Gaps = 21/215 (9%)

Query: 41  TTLLVLLSLLVVLGVILPWTGTPGFMFPNATSSL------AKWRD-YTLSQAASFVA-KN 92
           TTLL  +  + V      +   P F F N+  S       AK ++ Y L +     A  +
Sbjct: 15  TTLLRRIVKITVFTA--AFVVVPCFFFYNSAFSFQFLPISAKGKEGYKLDKILKNAAMGD 72

Query: 93  GTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHA 145
            T+I+  V++ +         FL ++ I  + Q+  + +++I  D     +     P   
Sbjct: 73  KTVILTTVNEAWAANNSLLDLFLESFRIGNNTQRLLNHLVIITLDPKAYARCTTLHPHCY 132

Query: 146 VLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG 205
            L     D      F S  +     RR   L  +L++ YN ++ D D++W +DPF     
Sbjct: 133 ALKTKEMDFSKEAFFMSHDYLEMMWRRIDFLRSVLKMRYNFIFTDADIMWFRDPFQRFDS 192

Query: 206 DHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
             D     D        +S D+      G TY+ S
Sbjct: 193 KADFQIACDYFN----GNSSDVNNSPNGGFTYVKS 223


>gi|357126542|ref|XP_003564946.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
           distachyon]
          Length = 365

 Score = 42.4 bits (98), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 25/41 (60%), Gaps = 6/41 (14%)

Query: 179 ILELGYNVMYNDVDMVWLKDPF------PYLQGDHDVYFTD 213
           ILELGYN ++ DVD++W +DPF       ++    D YF D
Sbjct: 194 ILELGYNFLFTDVDIMWFRDPFERMSVAAHMVTSSDFYFGD 234


>gi|453081046|gb|EMF09096.1| hypothetical protein SEPMUDRAFT_144857 [Mycosphaerella populorum
           SO2202]
          Length = 227

 Score = 42.4 bits (98), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 51/229 (22%), Positives = 86/229 (37%), Gaps = 59/229 (25%)

Query: 172 RPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD------VYFTDDMAAVKPLDHSH 225
           RP   + IL  G +++  D D +W ++PF  +   HD      VY TD     +  D   
Sbjct: 4   RPKFFMKILSAGLDILMIDADTIWWQNPFSIVPDSHDRPAVDIVYSTDAREFYQDHDAFR 63

Query: 226 D-------LPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKA-----KKA 273
           D       +PP        IC+ + +++ T  A + + K +  +   PW KA        
Sbjct: 64  DVWRRGPFVPP--------ICNGIFWMKST-TATISLWKDMLAIFETPWYKAFFRPKGFQ 114

Query: 274 NDQPAFNWALNKTAGQV-------------------------DLYLLPQSAFPTGGL--- 305
           +DQ   +  LN    QV                          + LL Q++   G L   
Sbjct: 115 DDQRGMDVLLNHGRAQVVAPFPEGIHEDQVPTAAISSSDAVLKVRLLDQTSIANGQLLMN 174

Query: 306 ----YFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHA 350
               Y +N   + ++ G  +  H N+ T    K    ++ GL L+D++ 
Sbjct: 175 RQSRYAENLAKLRKSGGDRIAAHFNWDTTVVSKEDGAKELGLVLLDENG 223


>gi|219130309|ref|XP_002185310.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403225|gb|EEC43179.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 594

 Score = 42.4 bits (98), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 45/166 (27%), Positives = 68/166 (40%), Gaps = 28/166 (16%)

Query: 69  NATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQ----V 124
           N T +L K R       A+ VA + TI+V   +      L N+  + +R +  D     V
Sbjct: 250 NITQTLDKLR-----PIAASVAVDNTIVVMVCNHGQSELLLNFACA-ARARGLDTALEAV 303

Query: 125 LVIAEDYATLYKVNGRWPGHAVLVPPAP----DSQTAHKFGSQGFFNFTSRRPCHLLHIL 180
           LV A D  T     G   G +V   P        + A  +    F      +   +  + 
Sbjct: 304 LVFATDEETRDLAIGL--GLSVFYDPVVFGEMPKEAARAYADVKFRAMMMAKVYCVQLVS 361

Query: 181 ELGYNVMYNDVDMVWLKDPFPYLQGD------------HDVYFTDD 214
            LGY++++ DVD+VWL++P  Y   D            +DVYF DD
Sbjct: 362 MLGYDLLFQDVDIVWLRNPLEYFHNDTSSANDEVSPDYYDVYFQDD 407


>gi|255538576|ref|XP_002510353.1| conserved hypothetical protein [Ricinus communis]
 gi|223551054|gb|EEF52540.1| conserved hypothetical protein [Ricinus communis]
          Length = 430

 Score = 42.0 bits (97), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 45/172 (26%), Positives = 71/172 (41%), Gaps = 29/172 (16%)

Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV-YFTDDMAAVKPLDHSHDLPPPGKKG 234
           L   L+LGY+V+ +DVD+V+L++PF +L  D DV   TD    +    ++     P    
Sbjct: 236 LREFLQLGYSVLLSDVDIVYLQNPFDHLYRDSDVESMTDGHNNMTAYGYNDVFDEPAMGW 295

Query: 235 RTYICSCMI--------YLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN------ 280
             Y  +  I        Y+RPT  +  ++ +  + L  +P S      DQ  FN      
Sbjct: 296 ARYAHTMRIWVYNSGFFYIRPTIPSIELLDRVADRLSRQPNSW-----DQAVFNEELFFP 350

Query: 281 -----WALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNY 327
                  L+     +D Y+   S      L+   +      K K VI+H NY
Sbjct: 351 SHPGYEGLHAAKRTMDFYMFMNSKV----LFKTVRKDANLKKLKPVIVHVNY 398


>gi|224134266|ref|XP_002321777.1| predicted protein [Populus trichocarpa]
 gi|222868773|gb|EEF05904.1| predicted protein [Populus trichocarpa]
          Length = 284

 Score = 42.0 bits (97), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 30/137 (21%), Positives = 61/137 (44%), Gaps = 12/137 (8%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLP---------FLNNWLISISRQKHQDQVLVIAEDYA 132
           L++A++   +N T+I+  V++ Y+          FLN +    + +   + +L++  D A
Sbjct: 7   LAEAST---ENRTVIIAMVNKAYVEGDDKSMLDLFLNGFWFGENTRDLVNHLLLVNVDQA 63

Query: 133 TLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVD 192
           +  +          L     +      + S  F     RR   L  +L  GYN ++ D D
Sbjct: 64  SYERCKFLRLHCYKLETDGVEFDREEVYMSNEFIKMMWRRTFFLGEVLVRGYNFIFTDTD 123

Query: 193 MVWLKDPFPYLQGDHDV 209
           ++WL++PF  L  + ++
Sbjct: 124 VLWLRNPFQRLSFNENI 140


>gi|118484315|gb|ABK94035.1| unknown [Populus trichocarpa]
          Length = 435

 Score = 42.0 bits (97), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 35/127 (27%), Positives = 57/127 (44%), Gaps = 6/127 (4%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           +A    +IV   +      L  W  SI +    + ++V  +D    +  +   P    + 
Sbjct: 160 IAVGKELIVALANSNVKDMLEVWFKSIQKVGIPNYLVVALDDEIAKFCESSDVP----VY 215

Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPFPYLQGDH 207
              PD +     G  G  +  S    H+L   L+LGY+V+ +DVD+V+L++PF YL  D 
Sbjct: 216 KRDPD-KGIDSVGKTGGNHAVSGLKFHILREFLQLGYSVLLSDVDIVYLQNPFHYLYRDS 274

Query: 208 DVYFTDD 214
           DV    D
Sbjct: 275 DVESMSD 281


>gi|222619772|gb|EEE55904.1| hypothetical protein OsJ_04572 [Oryza sativa Japonica Group]
          Length = 355

 Score = 42.0 bits (97), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 38/83 (45%), Gaps = 17/83 (20%)

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDM---AAVKPLDHSHDLPPPGKKGR 235
           ILELGYN ++ DVD++W +DPF  +     +  + D     A  P     + P  G    
Sbjct: 184 ILELGYNFLFTDVDILWFRDPFEQMSMAAHMVTSSDFFVGGAYNPA----NFPNTG---- 235

Query: 236 TYICSCMIYLRPTDGAKLVMKKW 258
                  +Y+R +  A  VM+ W
Sbjct: 236 ------FLYVRSSRRAVGVMEAW 252


>gi|303276088|ref|XP_003057338.1| hypothetical protein MICPUCDRAFT_62376 [Micromonas pusilla
           CCMP1545]
 gi|226461690|gb|EEH58983.1| hypothetical protein MICPUCDRAFT_62376 [Micromonas pusilla
           CCMP1545]
          Length = 727

 Score = 42.0 bits (97), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 28/120 (23%), Positives = 50/120 (41%), Gaps = 4/120 (3%)

Query: 161 GSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKP 220
           G + F    S +   +  +LEL    +  D D+ WL+DP PY          D + +   
Sbjct: 185 GDEHFKKMGSVKTKFIQDLLELNIAPILTDADVTWLRDPRPYFN-KGTYALADALVSTDC 243

Query: 221 LDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
           +D   D             + +++ RP++ +K  ++ W  ++ +   S      DQPAFN
Sbjct: 244 IDIPGDARDENACAHVNFNTGVLHFRPSNASKAFVETWKNKVAS---STIAWMRDQPAFN 300


>gi|224103449|ref|XP_002313061.1| predicted protein [Populus trichocarpa]
 gi|222849469|gb|EEE87016.1| predicted protein [Populus trichocarpa]
          Length = 435

 Score = 42.0 bits (97), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 35/127 (27%), Positives = 57/127 (44%), Gaps = 6/127 (4%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           +A    +IV   +      L  W  SI +    + ++V  +D    +  +   P    + 
Sbjct: 160 IAVGKELIVALANSNVKDMLEVWFKSIQKVGIPNYLVVALDDEIAKFCESNDVP----VY 215

Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPFPYLQGDH 207
              PD +     G  G  +  S    H+L   L+LGY+V+ +DVD+V+L++PF YL  D 
Sbjct: 216 KRDPD-KGIDSVGKTGGNHAVSGLKFHILREFLQLGYSVLLSDVDIVYLQNPFHYLYRDS 274

Query: 208 DVYFTDD 214
           DV    D
Sbjct: 275 DVESMSD 281


>gi|302849316|ref|XP_002956188.1| hypothetical protein VOLCADRAFT_107128 [Volvox carteri f.
           nagariensis]
 gi|300258491|gb|EFJ42727.1| hypothetical protein VOLCADRAFT_107128 [Volvox carteri f.
           nagariensis]
          Length = 324

 Score = 42.0 bits (97), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 53/114 (46%), Gaps = 15/114 (13%)

Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVY-FTD--DMAAVKPLDHSHDLPPPG- 231
           +   L LGY+V+ +DVD+V L++PF +L  D DV   +D  D       D  HD P  G 
Sbjct: 131 IREFLSLGYSVLLSDVDIVTLQNPFDHLYRDEDVEALSDGYDEQTAYGWDDVHDDPKMGW 190

Query: 232 ----KKGRTY-ICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
                  R + + S + Y+RP +    +M +  + L        +K  DQ  FN
Sbjct: 191 SRWAHTIRVFTLNSGLFYIRPNERTIGLMDRITDRLN------KRKEWDQAVFN 238


>gi|357122173|ref|XP_003562790.1| PREDICTED: uncharacterized protein LOC100830494 [Brachypodium
           distachyon]
          Length = 424

 Score = 42.0 bits (97), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 64/278 (23%), Positives = 110/278 (39%), Gaps = 43/278 (15%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           VA    +IV   +      L  W  +I R    + ++V  +D    +  +   P    + 
Sbjct: 148 VAVKKELIVAVANSNVKQTLEMWFTNIKRVGISNYLVVALDDSVESFCKSKDVP----VY 203

Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
              PD        + G    ++ +   L   L+LGY+V+ +D+D+++ ++PF +L  D D
Sbjct: 204 RRDPDEGIDSIGKTGGNHAVSALKFQILREFLQLGYSVLLSDIDIMFFQNPFDHLHRDSD 263

Query: 209 VYFTDD----MAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGAKLVMKKW 258
           +    D    M A    +   D P  G        R ++  S   ++RPT  +  ++ + 
Sbjct: 264 IESMSDGHNNMTAYG-FNDVFDEPSMGWARYAHTMRIWVYNSGFFFIRPTIPSIELLDRV 322

Query: 259 IEELQAEPWSKAKKANDQPAFN-----------WALNKTAGQVDLYLLPQSA--FPT--G 303
              L  EP     K+ DQ  FN             L+ +   +D+YL   S   F T   
Sbjct: 323 AGRLSREP-----KSWDQAVFNEELFFPSHPGYEGLHASKRTMDIYLFMNSKVLFKTVRK 377

Query: 304 GLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDF 341
             Y +N         K VI+H NY    E+++K   +F
Sbjct: 378 DAYLRNL--------KPVIVHLNYHPDKEERMKAVIEF 407


>gi|414869642|tpg|DAA48199.1| TPA: hypothetical protein ZEAMMB73_139999 [Zea mays]
          Length = 573

 Score = 42.0 bits (97), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 51/196 (26%), Positives = 82/196 (41%), Gaps = 31/196 (15%)

Query: 49  LLVVLGVILPWTGTPGFMFPNATSSL----AKW----RDYTLSQAASF-----VAKNG-- 93
           LL V+   L    T  F+F  AT+ +    A W    R     +A SF     V KN   
Sbjct: 16  LLPVISFFLGAALTAAFVFLGATTDVSWRFAAWGSGARPGAGDEAKSFAELAEVLKNASM 75

Query: 94  ---TIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPG 143
              T+IV +++Q Y         FL+++          D +L++A D A L         
Sbjct: 76  EDKTVIVTSINQAYAAPGSLLDLFLDSFRAGEGTAGLLDHLLIVAVDPAALETCRS-VHR 134

Query: 144 HAVLVPPAPDSQTAHK-----FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKD 198
           H  L+ PA  +  A       F ++ + +    R      ILELG++ ++ DVD++W ++
Sbjct: 135 HCYLLRPAAGAADADLGAAKFFMTKDYLDMMWARNRLQQTILELGFSFLFTDVDILWFRN 194

Query: 199 PFPYLQGDHDVYFTDD 214
           P  ++    DV    D
Sbjct: 195 PMRHIAVTSDVAIACD 210


>gi|326435846|gb|EGD81416.1| hypothetical protein PTSG_02139 [Salpingoeca sp. ATCC 50818]
          Length = 422

 Score = 42.0 bits (97), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 52/120 (43%), Gaps = 11/120 (9%)

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYI 238
            L  G++V  ND D VWL+DP+P +  +  V+  D +A              GK     +
Sbjct: 173 CLHHGFDVWMNDADAVWLRDPWPLIHRELTVHKADIVAQRGGFPFE-----IGKAWGATM 227

Query: 239 CSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQS 298
           C   +Y+R T   KLV+      LQ  P+    + +DQ   N A+   A      LLP S
Sbjct: 228 CFGFVYMRATAATKLVVAL---ALQLAPY---LERDDQLLVNSAVYLLATGRQQLLLPLS 281


>gi|356560053|ref|XP_003548310.1| PREDICTED: uncharacterized protein At4g15970-like [Glycine max]
          Length = 379

 Score = 42.0 bits (97), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 27/98 (27%), Positives = 43/98 (43%), Gaps = 5/98 (5%)

Query: 144 HAVLVPPAPDSQTAHKF-GSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
           H   V    D+ T   F  +  + +   RR   L  +L++GYN ++ D D++WL+DPF  
Sbjct: 148 HCYQVETKGDNFTGEAFFMTADYLHMMWRRIEFLGTVLDMGYNFVFTDTDIMWLRDPFKL 207

Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
              D D     D        +++DL      G  Y+ S
Sbjct: 208 FYKDADFQIACDFFN----GNTYDLNNSPNGGFNYVKS 241


>gi|115447979|ref|NP_001047769.1| Os02g0686300 [Oryza sativa Japonica Group]
 gi|113537300|dbj|BAF09683.1| Os02g0686300 [Oryza sativa Japonica Group]
          Length = 393

 Score = 41.6 bits (96), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 18/62 (29%), Positives = 32/62 (51%)

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
           D     +F + G+ +   +R   L  +LE GY+ +++D D+ W ++PFP+   D D    
Sbjct: 190 DFSQEKRFLTAGYLDMMWKRLDFLRLVLEKGYSFIFSDADITWFRNPFPHFYPDGDFQIA 249

Query: 213 DD 214
            D
Sbjct: 250 CD 251


>gi|222623466|gb|EEE57598.1| hypothetical protein OsJ_07965 [Oryza sativa Japonica Group]
          Length = 370

 Score = 41.6 bits (96), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 23/88 (26%), Positives = 42/88 (47%), Gaps = 4/88 (4%)

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
           D     +F + G+ +   +R   L  +LE GY+ +++D D+ W ++PFP+   D D    
Sbjct: 190 DFSQEKRFLTAGYLDMMWKRLDFLRLVLEKGYSFIFSDADITWFRNPFPHFYPDGDF--- 246

Query: 213 DDMAAVKPLDHSHDLPPPGKKGRTYICS 240
             +A    + ++ DL      G  Y+ S
Sbjct: 247 -QIACDHYVGNATDLGNIANGGFNYVRS 273


>gi|218191377|gb|EEC73804.1| hypothetical protein OsI_08508 [Oryza sativa Indica Group]
          Length = 496

 Score = 41.6 bits (96), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 18/62 (29%), Positives = 32/62 (51%)

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
           D     +F + G+ +   +R   L  +LE GY+ +++D D+ W ++PFP+   D D    
Sbjct: 190 DFSQEKRFLTAGYLDMMWKRLDFLRLVLEKGYSFIFSDADITWFRNPFPHFYPDGDFQIA 249

Query: 213 DD 214
            D
Sbjct: 250 CD 251


>gi|302835477|ref|XP_002949300.1| hypothetical protein VOLCADRAFT_117113 [Volvox carteri f.
           nagariensis]
 gi|300265602|gb|EFJ49793.1| hypothetical protein VOLCADRAFT_117113 [Volvox carteri f.
           nagariensis]
          Length = 336

 Score = 41.6 bits (96), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 40/171 (23%), Positives = 68/171 (39%), Gaps = 13/171 (7%)

Query: 160 FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVK 219
           F   GF +    +  ++L+ L LG++V++ D D+V ++DP PYL   HD      M    
Sbjct: 99  FHDTGFNSLGFAKVKYILNGLSLGHDVVFLDTDIVVMRDPLPYLL-VHDADLFGSMEKCM 157

Query: 220 PLDHSHDLPPPG----KKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKAND 275
             + S     P     +K    I   M+Y + +      +  W  E+++E  ++ +   D
Sbjct: 158 VYNDSLSFNSPEFKSLRKRPPAINIGMLYFKASASVTRCVYNWAWEMRSEVQTRPRLW-D 216

Query: 276 QPAFNWALNKTAG-------QVDLYLLPQSAFPTGGLYFKNQTWVEETKGK 319
           Q  F   + K           +D  L     FP  G  F N     +  G+
Sbjct: 217 QDIFGKVMVKCTAVHGLRFQVLDPRLFQSGCFPGCGCKFFNNEIRPDRIGR 267


>gi|212275955|ref|NP_001130696.1| hypothetical protein [Zea mays]
 gi|194689860|gb|ACF79014.1| unknown [Zea mays]
 gi|194707158|gb|ACF87663.1| unknown [Zea mays]
 gi|224030421|gb|ACN34286.1| unknown [Zea mays]
 gi|414887364|tpg|DAA63378.1| TPA: hypothetical protein ZEAMMB73_518373 [Zea mays]
          Length = 423

 Score = 41.6 bits (96), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 49/205 (23%), Positives = 83/205 (40%), Gaps = 20/205 (9%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           VA    +IV   +      L  W  +I R    + ++V  +D    +  +   P    + 
Sbjct: 147 VAVKKELIVALANSNVKEMLEMWFTNIKRAGISNYLVVALDDSIENFCKSNDVP----VY 202

Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
              PD    +   + G    +  +   L   L+LGY+V+ +D+D+++ ++PF YL  D D
Sbjct: 203 RRDPDDGIDNIGKTGGNHAVSGLKFRILREFLQLGYSVLLSDIDIIFFQNPFDYLYRDSD 262

Query: 209 VYFTDD----MAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGAKLVMKKW 258
           V    D    M A    +   D P  G        R ++  S   ++RPT  +  ++ + 
Sbjct: 263 VESMSDGHNNMTAYG-FNDVFDEPSMGWARYAHTMRIWVYNSGFFFIRPTIPSIELLDRV 321

Query: 259 IEELQAEPWSKAKKANDQPAFNWAL 283
              L  EP     K+ DQ  FN  L
Sbjct: 322 AYRLSHEP-----KSWDQAVFNEEL 341


>gi|338731946|ref|YP_004670419.1| type 11 methyltransferase [Simkania negevensis Z]
 gi|336481329|emb|CCB87928.1| methyltransferase type 11 [Simkania negevensis Z]
          Length = 216

 Score = 41.6 bits (96), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 42/152 (27%), Positives = 71/152 (46%), Gaps = 26/152 (17%)

Query: 186 VMYNDVDMVWLKDP--FPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMI 243
           +++ D D ++LK+P  F  LQ D  +   +++    PLDH      P K     + +  I
Sbjct: 66  IVWVDADAIFLKNPTLFESLQCDIALRTYEEL----PLDH------PSK-----VYTGTI 110

Query: 244 YLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTG 303
           YL   + A+ ++K+W EE +     +  +  DQ +   AL      VDL+ LP +     
Sbjct: 111 YLSNNEMARTLLKRWKEESERMLREEKNEVWDQVSIKQAL--LGSNVDLFPLPDTY---A 165

Query: 304 GLYFKNQTWVEETKGKHVIIHNNYITGFEKKI 335
            +Y K  T +EE     VI+H      F+K++
Sbjct: 166 TIYDKQLTSIEEA----VILHYQASRLFKKEV 193


>gi|242050722|ref|XP_002463105.1| hypothetical protein SORBIDRAFT_02g037900 [Sorghum bicolor]
 gi|241926482|gb|EER99626.1| hypothetical protein SORBIDRAFT_02g037900 [Sorghum bicolor]
          Length = 423

 Score = 41.6 bits (96), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 60/260 (23%), Positives = 102/260 (39%), Gaps = 35/260 (13%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           VA    +IV   +      L  W  +I R    + ++V  +D    +  +   P    + 
Sbjct: 147 VAVKKELIVALANSNVKEMLEMWFTNIKRAGISNYLVVALDDNIENFCKSNDVP----VY 202

Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
              PD    +   + G    +  +   L   L+LGY+V+ +D+D+++ ++PF +L  D D
Sbjct: 203 RRDPDDGIDNIGKTGGNHAVSGLKFRILREFLQLGYSVLLSDIDIIFFQNPFDHLYRDSD 262

Query: 209 VYFTDD----MAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGAKLVMKKW 258
           V    D    M A    +   D P  G        R ++  S   ++RPT  +  ++ + 
Sbjct: 263 VESMSDGHNNMTAYG-FNDVFDEPSMGWARYAHTMRIWVYNSGFFFIRPTIPSIELLDRV 321

Query: 259 IEELQAEPWSKAKKANDQPAFN-----------WALNKTAGQVDLYLLPQSAFPTGGLYF 307
              L  EP     K+ DQ  FN             L+ +   +D+YL   S      L+ 
Sbjct: 322 AYRLSHEP-----KSWDQAVFNEELFFPSHPGYEGLHASKRTMDMYLFMNSKV----LFK 372

Query: 308 KNQTWVEETKGKHVIIHNNY 327
             +      K K VI+H NY
Sbjct: 373 TVRKDAHLRKLKPVIVHLNY 392


>gi|357500079|ref|XP_003620328.1| hypothetical protein MTR_6g081060 [Medicago truncatula]
 gi|355495343|gb|AES76546.1| hypothetical protein MTR_6g081060 [Medicago truncatula]
          Length = 379

 Score = 41.2 bits (95), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 43/176 (24%), Positives = 65/176 (36%), Gaps = 34/176 (19%)

Query: 171 RRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPP 230
           RR   L  +L LGY+ ++ D D++WL+DPF     D D     D       D  H+ P  
Sbjct: 183 RRIEFLGAVLHLGYSFVFTDTDIMWLRDPFKQFYKDADFQIACDYFNGNSYD-MHNYPNG 241

Query: 231 GKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNK----- 285
           G            Y++         K W    +  P       +DQ      LNK     
Sbjct: 242 G----------FTYVKSNRRTIWFYKFWFNSRETYP-----TMHDQD----VLNKIKMHP 282

Query: 286 --TAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFR 339
             T  ++ +  L  S F  GG    ++ +     GK + +H N   G E K+   +
Sbjct: 283 LITNKKLKIRFLSTSYF--GGFCQSSKDF-----GKALTMHANCCVGLENKVNDLK 331


>gi|159470725|ref|XP_001693507.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283010|gb|EDP08761.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 305

 Score = 41.2 bits (95), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 53/228 (23%), Positives = 89/228 (39%), Gaps = 34/228 (14%)

Query: 122 DQVLVI----AEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFG--SQGFFNFTSRRPCH 175
           DQ LV+        A    +N R+ GH   V  A  +     FG    GF +    +  +
Sbjct: 34  DQHLVLVAWSGAALAACQALNSRY-GHTC-VRDAEHTAATGSFGFHDAGFNSLGFAKIKY 91

Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQG-DHDVYFTDDMAAVKPLDHS---------H 225
           +L+ L  G++V++ D D++ L+DP PY  G   D++ + +   + P   S         H
Sbjct: 92  ILNGLSAGHDVVFLDTDIIVLQDPLPYFLGRGADMWGSMEKCMIYPDRMSFHSTEFQAFH 151

Query: 226 DLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNK 285
              PP   G       ++Y + T G    +  W+ E+  E  ++  +  DQ  +   + K
Sbjct: 152 KKLPPINIG-------VLYFKATAGVTRCVYSWLMEMYTEVLNR-PRVWDQDLYGKVMVK 203

Query: 286 TAG-------QVDLYLLPQSAFPTGGLYFKNQTWVE-ETKGKHVIIHN 325
                      +D  L   + FP  G  F +    + E  GK    H 
Sbjct: 204 CTALHELRWQVLDPRLFQSACFPGCGCSFSDSDMTDPERVGKFAKGHR 251


>gi|242062902|ref|XP_002452740.1| hypothetical protein SORBIDRAFT_04g031550 [Sorghum bicolor]
 gi|241932571|gb|EES05716.1| hypothetical protein SORBIDRAFT_04g031550 [Sorghum bicolor]
          Length = 387

 Score = 41.2 bits (95), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 18/62 (29%), Positives = 32/62 (51%)

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT 212
           D     +F + G+     +R   L  +LE GY+ +++D D++W ++PFP+   D D    
Sbjct: 187 DFSQEKRFQTTGYLEMMWKRLDFLRLVLEKGYSFVFSDADIMWFRNPFPHFYTDGDFQIA 246

Query: 213 DD 214
            D
Sbjct: 247 CD 248


>gi|384494920|gb|EIE85411.1| hypothetical protein RO3G_10121 [Rhizopus delemar RA 99-880]
          Length = 668

 Score = 41.2 bits (95), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 62/261 (23%), Positives = 104/261 (39%), Gaps = 20/261 (7%)

Query: 89  VAKNGTIIVCAVSQPYLP-FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVL 147
           V+    I++ AV+   +  +  NW+ S+ +     + LV A D   +  +     G  V 
Sbjct: 95  VSGKDRILLTAVANSGMADYTLNWIASLKKCGLDKKFLVFAIDQEMVDIMTKTGYGRRVA 154

Query: 148 VPPA-----PDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
           + P+       S    ++ S G+   T  +   +  +L  G  V ++DVD+V+       
Sbjct: 155 LIPSDWFHKELSGGFEEWQSDGYTPITHSKTLVVERLLYAGITVWFSDVDIVFTSSSIY- 213

Query: 203 LQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEEL 262
              D+ V   +   A   +  S +        +  I S    +RPT   K ++   I   
Sbjct: 214 ---DYLVMKLNSRKARTEVLFSQETE------QNLINSGFYLMRPTLLNKRILDSSIYIQ 264

Query: 263 QAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEETKGKHVI 322
             EP    ++A ++   +  LN    Q+   LL    FP G LYF  Q   +      +I
Sbjct: 265 DREPKVTQQQAMNRVLNDLNLNYRTSQIA--LLDLILFPQGRLYFDRQIPTKYNM-TPMI 321

Query: 323 IHNNYITGFEKKIKRFRDFGL 343
           +H NY  G  KK K  + FGL
Sbjct: 322 VHANYRKGDNKK-KDLQKFGL 341


>gi|4406788|gb|AAD20098.1| unknown protein [Arabidopsis thaliana]
 gi|20197486|gb|AAM15094.1| unknown protein [Arabidopsis thaliana]
          Length = 626

 Score = 41.2 bits (95), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 36/154 (23%), Positives = 66/154 (42%), Gaps = 21/154 (13%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATL 134
           L +AA+   K+GT+I+  +++ +         F  ++ I    ++    +++IA D A  
Sbjct: 330 LRRAAT---KDGTVILTTLNEAWAAPGSVIDLFFESFRIGKGTRRLLKHLVIIALD-AKA 385

Query: 135 YKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTS------RRPCHLLHILELGYNVMY 188
           Y        H   +    +++     G + +F   S      RR   L  +LE GYN ++
Sbjct: 386 YSRCQELHKHCFRL----ETEGVDFSGGEAYFMTPSYLTMMWRRISFLRSVLEKGYNFVF 441

Query: 189 NDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLD 222
            D D++W ++PF     D D     D    +P D
Sbjct: 442 TDADVMWFRNPFRRFYEDGDFQIACDHYIGRPND 475


>gi|186498694|ref|NP_001118253.1| nucleotide-diphospho-sugar transferase domain-containing protein
           [Arabidopsis thaliana]
 gi|330250447|gb|AEC05541.1| nucleotide-diphospho-sugar transferase domain-containing protein
           [Arabidopsis thaliana]
          Length = 408

 Score = 41.2 bits (95), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 37/150 (24%), Positives = 63/150 (42%), Gaps = 13/150 (8%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDYATL 134
           L +AA+   K+GT+I+  +++ +         F  ++ I    ++    +++IA D A  
Sbjct: 112 LRRAAT---KDGTVILTTLNEAWAAPGSVIDLFFESFRIGKGTRRLLKHLVIIALD-AKA 167

Query: 135 YKVNGRWPGHAV-LVPPAPD-SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVD 192
           Y        H   L     D S     F +  +     RR   L  +LE GYN ++ D D
Sbjct: 168 YSRCQELHKHCFRLETEGVDFSGGEAYFMTPSYLTMMWRRISFLRSVLEKGYNFVFTDAD 227

Query: 193 MVWLKDPFPYLQGDHDVYFTDDMAAVKPLD 222
           ++W ++PF     D D     D    +P D
Sbjct: 228 VMWFRNPFRRFYEDGDFQIACDHYIGRPND 257


>gi|302852218|ref|XP_002957630.1| hypothetical protein VOLCADRAFT_77645 [Volvox carteri f.
           nagariensis]
 gi|300257042|gb|EFJ41296.1| hypothetical protein VOLCADRAFT_77645 [Volvox carteri f.
           nagariensis]
          Length = 535

 Score = 41.2 bits (95), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 57/273 (20%), Positives = 108/273 (39%), Gaps = 36/273 (13%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           +A+NG ++V   +  Y  F+  W+  +        ++   +D+     +  ++   ++  
Sbjct: 5   IAQNGYLVVTWANYHYFDFVRTWVDHVKAVGVTGYIVGAMDDHLLREMIALKYNCFSM-- 62

Query: 149 PPAPDSQTAHKFG-SQGFFNFTSRRPCHLLHI-LELGYNVMYNDVDMVWLKDPFPYLQG- 205
                  T   FG     F    R    L+ I L+L  +V+  DVD++WL++P PY    
Sbjct: 63  ---KSGLTLGDFGWGSPTFAKMGREKIRLISIFLKLDVSVVIADVDVLWLRNPLPYFDRF 119

Query: 206 -DHDVYFTDDMA-AVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQ 263
            + D+  + D   A  P +     P  G      I   M++ + +      ++ WI+ ++
Sbjct: 120 PEADILTSSDSTFATVPDEQLERWPEAGAAANIGI---MLFRKKSLS---FVEDWIQVIE 173

Query: 264 AEPWSKAKKANDQPAFN------WAL----NKT-----AGQVDLYLLPQSAFPTGGLYFK 308
           ++     +K  DQ AFN      W +    NK       G + + +LP S F +G   F 
Sbjct: 174 SD-----EKIWDQNAFNDLFRRGWKMLDPPNKNLFLGYDGSLTMGILPVSIFCSGHTMFT 228

Query: 309 NQTWVEETKGKHVIIHNNYITGFEKKIKRFRDF 341
            +         + +      +G   K  R R+F
Sbjct: 229 QRLGYTMGLQPYAVHATFQFSGTPGKRHRMREF 261


>gi|359489310|ref|XP_002270120.2| PREDICTED: uncharacterized protein At1g28695-like [Vitis vinifera]
          Length = 346

 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 33/121 (27%), Positives = 56/121 (46%), Gaps = 9/121 (7%)

Query: 92  NGTIIVCAVSQPYLP---------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWP 142
           N T+I+  V++ Y+          FL ++ I    ++  D +LV+A D     +   R  
Sbjct: 61  NKTVIITIVNKAYVEGDDTSMLDLFLESFWIGEGTREMADHLLVVALDQTAFDRCIFRRL 120

Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
               +V    D      + S+ F     RR   LL +L+ GY+ ++ D D+ WL++PFP 
Sbjct: 121 HCYKMVSEDGDMDGEKLYMSKDFIKMMWRRTLLLLRVLDRGYSFIFTDTDVSWLRNPFPR 180

Query: 203 L 203
           L
Sbjct: 181 L 181


>gi|115456599|ref|NP_001051900.1| Os03g0849800 [Oryza sativa Japonica Group]
 gi|113550371|dbj|BAF13814.1| Os03g0849800, partial [Oryza sativa Japonica Group]
          Length = 247

 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 18/52 (34%), Positives = 29/52 (55%)

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ 204
           D ++   F S+ +      R      ILELGYN ++ DVD++W +DPF ++ 
Sbjct: 194 DYRSEKMFMSKDYLEMMWGRNKFQQTILELGYNFLFTDVDVMWFRDPFRHIS 245


>gi|224114682|ref|XP_002339511.1| predicted protein [Populus trichocarpa]
 gi|222832587|gb|EEE71064.1| predicted protein [Populus trichocarpa]
          Length = 275

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 17/48 (35%), Positives = 27/48 (56%)

Query: 162 SQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
           S  F     RR   L  +L  GYN ++ D D++WL++PFP L  + ++
Sbjct: 84  SDEFIKMMWRRTLFLGQVLRRGYNFIFTDADVLWLRNPFPRLSFNKNI 131


>gi|359488580|ref|XP_002273650.2| PREDICTED: uncharacterized protein At4g15970-like [Vitis vinifera]
          Length = 308

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 28/104 (26%), Positives = 48/104 (46%), Gaps = 3/104 (2%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           FL ++ + +  +K  + ++++  D           P    L  P P+   A K       
Sbjct: 67  FLESFKVGVGTKKLLNHLVIVTTDDQAFQYCKAMHPHCFPL--PTPEDFVARKPLMHPDR 124

Query: 167 NFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
           +   RR   LL  + ELGYN ++ D D++WLK+PF Y+    D+
Sbjct: 125 SKFGRRTIRLLGEVDELGYNFVFTDADVMWLKNPFLYVDPIQDL 168


>gi|312190393|gb|ADQ43193.1| unknown [Eutrema parvulum]
          Length = 395

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 4/81 (4%)

Query: 160 FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVK 219
           F ++ +     RR   L  +LE+GYN ++ D D++W ++PFP      D      +A   
Sbjct: 183 FMTRSYLKMMWRRIDFLRSVLEMGYNFVFTDADVMWFRNPFPRFYRYADF----QIACDH 238

Query: 220 PLDHSHDLPPPGKKGRTYICS 240
            L  S+DL      G +++ S
Sbjct: 239 YLGRSNDLENRPNGGFSFVRS 259


>gi|303283027|ref|XP_003060805.1| glycosyltransferase family 77 protein [Micromonas pusilla CCMP1545]
 gi|226458276|gb|EEH55574.1| glycosyltransferase family 77 protein [Micromonas pusilla CCMP1545]
          Length = 366

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 51/205 (24%), Positives = 81/205 (39%), Gaps = 36/205 (17%)

Query: 100 VSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWP-----------GHAVLV 148
           VS  +  F+ NW   + R   ++ V+V A D AT Y    R              H V+ 
Sbjct: 153 VSDGFHEFMLNWHAHVKRLGIRN-VVVAALDEAT-YATCARHAIACYSHRSLRYTHGVVA 210

Query: 149 ---PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQG 205
               P  D+  +    +  F    + +   LL +L+ G  V+ +DVD+VWL+DP      
Sbjct: 211 TGGSPLHDANASVTLNATAFQQIGALKTQFLLTLLKRGLRVLVSDVDVVWLRDPA----- 265

Query: 206 DHDVYFTDDMAAVKPLDH----SHDLPPPGKKGRTYIC------SCMIYLRPTDGAKLVM 255
             + YF     A          S D      + +T  C      + ++Y+ PT+ A   +
Sbjct: 266 --ESYFDATDGAATAAAADIAVSTDCLSAIDEAKTRGCWHMQFNTGIMYVNPTETAMAFV 323

Query: 256 KKWIEELQAEPWSKAKKANDQPAFN 280
             W E L+A   +     +DQ  FN
Sbjct: 324 TAWGEALRA---TTHAFEHDQDVFN 345


>gi|357168283|ref|XP_003581573.1| PREDICTED: uncharacterized protein At4g15970-like [Brachypodium
           distachyon]
          Length = 389

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 35/176 (19%), Positives = 76/176 (43%), Gaps = 21/176 (11%)

Query: 79  DYTLSQAASFVAKNGTIIVCAVSQPYLP-------FLNNWLISISRQKHQDQVLVIAEDY 131
           D  L +A++   ++ T+I+  ++  +         F++++   +S     + +++IA D+
Sbjct: 89  DRVLQKAST---RDNTVILTTLNAAWASPGSVIDLFIDSFRSGVSTSSLLNHLVIIAFDW 145

Query: 132 ATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYN-- 189
               +         VL     D     +F + G+     +R   L  +LE GY+ +++  
Sbjct: 146 NAYKQCLKIHHYCFVLATKGVDFSEEKRFLTSGYLEMMWKRLDFLRTVLEKGYSFIFSLF 205

Query: 190 -----DVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICS 240
                D D++W ++PFP+   D D+     +A    + ++ DL      G  Y+ S
Sbjct: 206 FYAFHDADIMWFRNPFPHFYPDGDI----QIACDHYVGNATDLKSIANGGFNYVKS 257


>gi|219130412|ref|XP_002185360.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403275|gb|EEC43229.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 483

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 34/137 (24%), Positives = 59/137 (43%), Gaps = 12/137 (8%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISI-SRQKHQDQVLVI-----AEDYATLYKVNGRWP 142
           +A   T+IV  V+      L N++ +  SR      V+V      + D AT   +   + 
Sbjct: 323 IAIENTVIVMVVNFGQTELLMNFVCAAKSRSLDLSNVIVFTTDQESTDLATSLGLTAYYD 382

Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
                  P   S+ A ++G + F      +   +  +  LGY++++ DVD+VW  +P  Y
Sbjct: 383 QRNFGEIP---SEAARRYGDRRFTAMMMAKVICVQLVSMLGYDLLFQDVDIVWFSNPLEY 439

Query: 203 LQGDH---DVYFTDDMA 216
                   D++F DD A
Sbjct: 440 FAHADPGMDMFFQDDGA 456


>gi|297734641|emb|CBI16692.3| unnamed protein product [Vitis vinifera]
          Length = 313

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 33/121 (27%), Positives = 56/121 (46%), Gaps = 9/121 (7%)

Query: 92  NGTIIVCAVSQPYLP---------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWP 142
           N T+I+  V++ Y+          FL ++ I    ++  D +LV+A D     +   R  
Sbjct: 61  NKTVIITIVNKAYVEGDDTSMLDLFLESFWIGEGTREMADHLLVVALDQTAFDRCIFRRL 120

Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
               +V    D      + S+ F     RR   LL +L+ GY+ ++ D D+ WL++PFP 
Sbjct: 121 HCYKMVSEDGDMDGEKLYMSKDFIKMMWRRTLLLLRVLDRGYSFIFTDTDVSWLRNPFPR 180

Query: 203 L 203
           L
Sbjct: 181 L 181


>gi|307102835|gb|EFN51102.1| hypothetical protein CHLNCDRAFT_141437 [Chlorella variabilis]
          Length = 683

 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 76/334 (22%), Positives = 119/334 (35%), Gaps = 83/334 (24%)

Query: 92  NGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVP-- 149
           +  + V   + P+   L+NW++SI       Q LV A D  T  +   +   H  L+P  
Sbjct: 58  SALLFVTFSAAPFWELLSNWVLSIQALPGPPQFLVGALDAQTAAECEAQGWRHVTLLPAG 117

Query: 150 -----PAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELG--YNVMYNDVDMVWLKDPFPY 202
                P+ D+    +       +  + R   +L +LE      V+ +  D  WL+DP  Y
Sbjct: 118 GAARVPSRDAARRRQQAEAEDRSSAAARAGLVLGLLEEDGMEAVVVSHPDTAWLRDPAEY 177

Query: 203 LQGDHDVYFTDDMAAVKPLDHS-----------------HDLPPPGK----KGRTYICSC 241
           L   H     D + +   L H                   +   PG+    +GR +  + 
Sbjct: 178 L-AQHPA--ADLLISTGCLSHEVGGAVQAAVQAAVRVVLEEAGAPGQPLSGQGRAF-NTG 233

Query: 242 MIYLRPTDGAKLVMKKWIEEL-QAEPWSKAKKANDQPAFNWALNKTAGQVDLYL------ 294
           +  +R T  A+ ++  W + L   E  + A      P FN    + AGQ  L L      
Sbjct: 234 VYAVRKTAAARELLASWRDALGDPEAAAAAAAVQPPPPFNETAPRDAGQRALRLLLEEGQ 293

Query: 295 ------------------------LPQSAFPTGGLYFKNQTWVEETKG-KHVIIHNN--- 326
                                   LP   FP G + F  +  + E KG +  ++H N   
Sbjct: 294 DSLPSAPGDPRTVLMRGGSLAVHPLPVLLFPGGHVAFVQR--LPEGKGVQPYVVHANGDR 351

Query: 327 ---------YITGFEKKIKRFRDFGLWLVD--DH 349
                      T   K+  R R+FGLW VD  DH
Sbjct: 352 WYGADGAQPEATAAAKR-GRLREFGLWRVDGADH 384


>gi|118485729|gb|ABK94714.1| unknown [Populus trichocarpa]
          Length = 260

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 47/186 (25%), Positives = 80/186 (43%), Gaps = 20/186 (10%)

Query: 108 LNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFN 167
           L  W  +I +   ++ ++V  +D+   +  +   P    +    PDS       + G   
Sbjct: 2   LEVWFANIKKAGIRNYLVVALDDHIVDFCKSNDVP----VYKRDPDSGIDSVARTGGNHA 57

Query: 168 FTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFT----DDMAAVKPLDH 223
            +  +   L   L+LGY+V+ +DVD+++L++PF +L  D DV       D+M A    D 
Sbjct: 58  VSGLKFRILREFLQLGYSVLLSDVDIIYLQNPFDHLYRDSDVESMSDGHDNMTAYG-FDD 116

Query: 224 SHDLPPPG-----KKGRTYIC-SCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQP 277
             + P  G        R ++  S   Y+RPT  +  ++ +    L  EP S      DQ 
Sbjct: 117 VFNEPAMGWARYAHTMRIWVYNSGFFYIRPTLPSIELLDRVAGRLSREPNSW-----DQA 171

Query: 278 AFNWAL 283
            FN  L
Sbjct: 172 VFNEEL 177


>gi|225450476|ref|XP_002276802.1| PREDICTED: uncharacterized protein LOC100240902 [Vitis vinifera]
          Length = 285

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 17/34 (50%), Positives = 26/34 (76%)

Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
           L   L+LGY V+++D+D+V+L++PF YL  D DV
Sbjct: 159 LGEFLQLGYIVLFSDIDIVYLQNPFDYLYRDSDV 192


>gi|159484574|ref|XP_001700329.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158272370|gb|EDO98171.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 528

 Score = 40.0 bits (92), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 33/138 (23%), Positives = 57/138 (41%), Gaps = 18/138 (13%)

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGD-----HDVYFTDDMAAVKPL-DHSHDLPPPGK 232
           + E G NV+++D D+VW  DP P+         H +  TD +A   P+ D   ++     
Sbjct: 48  VYEFGVNVIHSDTDVVWFGDPLPFFHERLSGPVHVIMATDAVATGNPVGDMGLEVT---T 104

Query: 233 KGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDL 292
              T I + + +++   G   + K W++      W      +DQ  FN     +  + + 
Sbjct: 105 NPFTNINTGIYFIKQYAGGLDMFKAWLD------WQDKNIGHDQDGFNTMARGSGFRHED 158

Query: 293 YLLPQSAFP---TGGLYF 307
             LP +  P   T   YF
Sbjct: 159 KHLPPAVLPSDATANRYF 176


>gi|303278666|ref|XP_003058626.1| glycosyltransferase family 77 protein [Micromonas pusilla CCMP1545]
 gi|226459786|gb|EEH57081.1| glycosyltransferase family 77 protein [Micromonas pusilla CCMP1545]
          Length = 924

 Score = 40.0 bits (92), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 31/129 (24%), Positives = 56/129 (43%), Gaps = 19/129 (14%)

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDH-----------DVYFTDDMAAVKPLDHSHDL 227
           +L +G++V+ +DVD+VW +DP P+                D  +   +A+   +  S +L
Sbjct: 234 LLSMGFDVLMSDVDVVWRRDPRPFFSRRRDDDGDDGDGGDDGDYDASLASADVMVSSDNL 293

Query: 228 PPPG--KKGRTYIC-----SCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFN 280
            P    ++G TY       + +++ R T   +     W E L       A   +DQ  FN
Sbjct: 294 SPRRDFEQGATYAARGTFNTGIVFARRTPAGERFAAAWYERLARPTGRFASLTSDQQVFN 353

Query: 281 WALNKTAGQ 289
            A+ +  G+
Sbjct: 354 -AMVRAEGR 361


>gi|296089833|emb|CBI39652.3| unnamed protein product [Vitis vinifera]
          Length = 247

 Score = 39.7 bits (91), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 17/34 (50%), Positives = 26/34 (76%)

Query: 176 LLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
           L   L+LGY V+++D+D+V+L++PF YL  D DV
Sbjct: 115 LGEFLQLGYIVLFSDIDIVYLQNPFDYLYRDSDV 148


>gi|307104144|gb|EFN52399.1| expressed protein [Chlorella variabilis]
          Length = 705

 Score = 39.7 bits (91), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 66/172 (38%), Gaps = 37/172 (21%)

Query: 186 VMYNDVDMVWLKDPFPYLQGDHDVYF---TDDMAAV-----KPLDHSHDLPP--PGKKGR 235
           V+ +D D VWL+ P+ YL+    V F   TD ++A      +P D S+ L    PG    
Sbjct: 193 VVVSDSDCVWLRPPWQYLEQRQGVDFFVSTDCLSAKAEDAWQPTDRSYPLCGHVPGNSHA 252

Query: 236 TYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLL 295
               + M  +  T  ++  +K WI+ L     +K + ++       ALN   G   + + 
Sbjct: 253 AAFNTGMFAVSNTPASRAFLKAWIKMLTDPAQAKHEDSDSHVDDQLALNLLRGVTPIAV- 311

Query: 296 PQSAFPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVD 347
                                   H     + +   +K   R R+FGLW VD
Sbjct: 312 ------------------------HATFQRHPVAALKK--HRLREFGLWHVD 337


>gi|326435872|gb|EGD81442.1| hypothetical protein PTSG_02163 [Salpingoeca sp. ATCC 50818]
          Length = 418

 Score = 39.7 bits (91), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 37/140 (26%), Positives = 53/140 (37%), Gaps = 35/140 (25%)

Query: 179 ILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGR--- 235
            L  GY+V+ +D D+ W  +P P L           +  VK    +H L   G++G    
Sbjct: 137 CLAQGYDVLMHDADVTWFSNPLPEL-----------ITTVK----THSLDLLGQRGSYPK 181

Query: 236 -------TYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAG 288
                    +C  +IY R T   +LVM   I  L+          +DQ   N AL   AG
Sbjct: 182 TVSNRWGAVLCGGLIYYRATSATRLVMDVAIGLLEYH------SPDDQVLINMALALLAG 235

Query: 289 QVDLYLLPQSAFPTGGLYFK 308
                 +P S   TG    +
Sbjct: 236 T----RMPSSVTSTGNKVLR 251


>gi|298704797|emb|CBJ48945.1| Acetyltransferase (isoleucine patch superfamily protein)
           [Ectocarpus siliculosus]
          Length = 423

 Score = 39.7 bits (91), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 49/111 (44%), Gaps = 16/111 (14%)

Query: 177 LHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPP--GKKG 234
           + +L  G +V+ +D D +WL+DP   L    D +   ++ A + +     +P P   K G
Sbjct: 235 ICLLNAGQDVLMSDADAIWLRDPTDDLAS--DAFLDSNVVASRAV-----MPFPLYHKWG 287

Query: 235 RTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNK 285
            T IC    + R    A       +EE+  E        NDQ A N ALNK
Sbjct: 288 AT-ICMGFAFFRAGGSAMQAFMTVVEEIANE------LGNDQIAVNEALNK 331


>gi|147832387|emb|CAN64421.1| hypothetical protein VITISV_031574 [Vitis vinifera]
          Length = 397

 Score = 39.3 bits (90), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 33/121 (27%), Positives = 56/121 (46%), Gaps = 9/121 (7%)

Query: 92  NGTIIVCAVSQPYLP---------FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWP 142
           N T+I+  V++ Y+          FL ++ I    ++  + +LV+A D     +   R  
Sbjct: 112 NKTVIITIVNKAYVEGDDTSMLDLFLESFWIGEGTREMAEHLLVVALDQTAFDRCIFRRL 171

Query: 143 GHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY 202
               +V    D      + S+ F     RR   LL +LE GY+ ++ D D+ WL++PFP 
Sbjct: 172 HCYKMVSEDGDMDGEKLYMSKDFIKMMWRRTLLLLRVLERGYSFIFTDTDVSWLRNPFPR 231

Query: 203 L 203
           L
Sbjct: 232 L 232


>gi|357115371|ref|XP_003559462.1| PREDICTED: uncharacterized protein At1g28695-like [Brachypodium
           distachyon]
          Length = 371

 Score = 39.3 bits (90), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 20/73 (27%), Positives = 39/73 (53%), Gaps = 2/73 (2%)

Query: 153 DSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQ--GDHDVY 210
           D  +   + S GF     +R   L  +++ GY+ ++ D+D++WL++PF  L   G+ D+ 
Sbjct: 174 DLSSEQLYMSDGFIRMMWQRIRLLGDVVKHGYSFIFTDLDVMWLRNPFQSLNRTGEEDLL 233

Query: 211 FTDDMAAVKPLDH 223
            + D    +P D+
Sbjct: 234 ISSDRFNGRPHDY 246


>gi|397604903|gb|EJK58815.1| hypothetical protein THAOC_21028 [Thalassiosira oceanica]
          Length = 645

 Score = 39.3 bits (90), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 23/35 (65%), Gaps = 4/35 (11%)

Query: 184 YNVMYNDVDMVWLKDPFPYLQ----GDHDVYFTDD 214
           + +  +DVDMVWLKDP  Y +    GD DV+F DD
Sbjct: 411 HEMAASDVDMVWLKDPRLYFESGSAGDFDVFFQDD 445


>gi|226504952|ref|NP_001140234.1| uncharacterized protein LOC100272274 [Zea mays]
 gi|194698620|gb|ACF83394.1| unknown [Zea mays]
 gi|414590698|tpg|DAA41269.1| TPA: hypothetical protein ZEAMMB73_319089 [Zea mays]
          Length = 423

 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 62/275 (22%), Positives = 107/275 (38%), Gaps = 37/275 (13%)

Query: 89  VAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLV 148
           VA    +IV   +      L  W  +I R    + ++V  +D    +  +   P    + 
Sbjct: 147 VAVKKELIVALANSNVKEMLEVWFTNIKRAGIPNYLVVALDDNIESFCKSKDVP----VY 202

Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHD 208
              PD    +   + G    +  +   L   L+LGY+V+ +D+D+++ ++PF +L  D D
Sbjct: 203 RRDPDDGIDNIGKTGGNHAVSGLKFRILREFLQLGYSVLLSDIDIIFFRNPFDHLYRDSD 262

Query: 209 VYFTDD----MAAVKPLDHSHDLPPPG-----KKGRTYI-CSCMIYLRPTDGAKLVMKKW 258
           V    D    M A    +   D P  G        R ++  S   ++RPT  +  ++ + 
Sbjct: 263 VESMSDGHNNMTAYG-FNDVFDEPSMGWARYAHTMRIWVYNSGFFFIRPTIPSIELLDRV 321

Query: 259 IEELQAEPWSKAKKANDQPAFN-----------WALNKTAGQVDLYLLPQSAFPTGGLYF 307
              L  EP     K+ DQ  FN             L+ +   +D+YL     F    + F
Sbjct: 322 AYRLSHEP-----KSWDQAVFNEELFFPSHPGYEGLHASKRTMDMYL-----FMNSKVLF 371

Query: 308 KNQTWVEETKG-KHVIIHNNYITGFEKKIKRFRDF 341
           K        +  K VI+H NY      ++K   +F
Sbjct: 372 KTVRKDANLRTLKPVIVHLNYHPDKSDRMKAVIEF 406


>gi|300176591|emb|CBK24256.2| unnamed protein product [Blastocystis hominis]
          Length = 217

 Score = 38.9 bits (89), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 33/146 (22%), Positives = 61/146 (41%), Gaps = 26/146 (17%)

Query: 164 GFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDH 223
            F   T+ +   +  +L +G N ++ D D+V  ++PF Y+                P D 
Sbjct: 81  AFNQLTALKLIAVQKVLNMGVNCLFFDSDVVIFQNPFKYV----------------PTDE 124

Query: 224 SHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWAL 283
             D     +K  T ICS  +Y R TD ++L ++  + ++      K +  +DQ A    +
Sbjct: 125 EFDF--IAQKDAT-ICSGFVYWRATDRSRLTIELTLNKM------KERNIHDQTALVEVV 175

Query: 284 -NKTAGQVDLYLLPQSAFPTGGLYFK 308
            N    ++   L     +  G +YF+
Sbjct: 176 DNHLVPELKSLLFEPMLYNRGSIYFE 201


>gi|307111693|gb|EFN59927.1| hypothetical protein CHLNCDRAFT_49368 [Chlorella variabilis]
          Length = 617

 Score = 38.9 bits (89), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 18/58 (31%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 159 KFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL--QGDHDVYFTDD 214
           K+G +G+   T  +   L  + + G+N++ +DVD+VW +DP P        D+ F++D
Sbjct: 163 KWGEEGWRRMTWAKVFVLDAVADWGFNLVISDVDVVWFRDPLPLFAKHAHADLIFSED 220


>gi|302844548|ref|XP_002953814.1| hypothetical protein VOLCADRAFT_106094 [Volvox carteri f.
           nagariensis]
 gi|300260922|gb|EFJ45138.1| hypothetical protein VOLCADRAFT_106094 [Volvox carteri f.
           nagariensis]
          Length = 1007

 Score = 38.9 bits (89), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 32/136 (23%), Positives = 60/136 (44%), Gaps = 19/136 (13%)

Query: 154 SQTAHKFGSQGFFNFTSRRPCHLLH-ILELGYNVMYNDVDMVWLKDPFPYLQG-----DH 207
           S T +++G    ++ T+    H++  + E+G +V+++D+D+VW  DP PY +       H
Sbjct: 150 SDTKYQWGGH-HWSMTTWNKVHIMKSVYEMGVHVVHSDMDVVWFNDPLPYFKSLLEKPVH 208

Query: 208 DVYFTDDMAAVKPLDHS--HDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAE 265
            V  TD + +  P   +    L  P       I + + ++R   G       W+      
Sbjct: 209 IVIATDAVTSENPKGDTGLEALTSP----HANINTGIYFMRQWPGGLEFFNIWLS----- 259

Query: 266 PWSKAKKANDQPAFNW 281
            W   +  +DQ  FN+
Sbjct: 260 -WQDKRIGHDQDGFNF 274


>gi|302834667|ref|XP_002948896.1| hypothetical protein VOLCADRAFT_89227 [Volvox carteri f.
           nagariensis]
 gi|300266087|gb|EFJ50276.1| hypothetical protein VOLCADRAFT_89227 [Volvox carteri f.
           nagariensis]
          Length = 284

 Score = 38.9 bits (89), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 30/120 (25%), Positives = 54/120 (45%), Gaps = 16/120 (13%)

Query: 160 FGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYL-QGDHDVYFTDDMAAV 218
           F S GF      +  ++L+ L  G++V++ D D+  L+DP PY+ Q D D++   +   V
Sbjct: 149 FHSDGFHALGFAKVKYILNSLSAGHDVLFLDADVQLLRDPVPYIYQRDGDMFVMMEKCLV 208

Query: 219 ----KPLDHSHDLP----PPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQAEPWSKA 270
               +P+      P    PP   G        +Y + T G    +  WI ++  +  ++A
Sbjct: 209 YNVSRPITFRRSEPDVWIPPLNIGS-------LYFKATAGVTRCVYNWIWDMYGQVPNRA 261


>gi|159479202|ref|XP_001697686.1| hypothetical protein CHLREDRAFT_195312 [Chlamydomonas reinhardtii]
 gi|158269942|gb|EDO96019.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 579

 Score = 38.5 bits (88), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 30/50 (60%), Gaps = 2/50 (4%)

Query: 156 TAHKFGSQGFFNFTSRRPCHLLH-ILELGYNVMYNDVDMVWLKDPFPYLQ 204
           TA+ +GS   +  T+    H++  + E G NV+++D D+VW  DP P+ Q
Sbjct: 27  TAYHWGSH-HWKLTTWNKVHIIKAVYEFGVNVIHSDTDVVWFGDPLPFFQ 75


>gi|167516356|ref|XP_001742519.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163779143|gb|EDQ92757.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1057

 Score = 38.5 bits (88), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 32/114 (28%), Positives = 55/114 (48%), Gaps = 6/114 (5%)

Query: 80  YTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNG 139
           +TL      VA++  ++V   S  Y   + N++  + +    DQ+++ A D   +Y+   
Sbjct: 530 FTLRDLLPVVARDNHVLVTGASYAYRDVVMNFVCQLRKLGIYDQLVIAAFD-EDMYRFGF 588

Query: 140 RWPGHAVLVPPAPDSQ--TAH--KFGSQGFFNFTSRRPCHLLHILELGYNVMYN 189
           R  G  V    A D    T+H  ++GSQ F   T  +   +L IL+LGY+V + 
Sbjct: 589 RM-GLPVFFYQASDISGLTSHDLEYGSQHFKKVTKLKSQVVLQILKLGYDVTWT 641


>gi|422293879|gb|EKU21179.1| hypothetical protein NGA_0133501 [Nannochloropsis gaditana CCMP526]
          Length = 421

 Score = 38.5 bits (88), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 48/203 (23%), Positives = 83/203 (40%), Gaps = 22/203 (10%)

Query: 15  PNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPWTGTPGFMFPNATSSL 74
           P  + L PR  ++ +  RP  +   R+    LL L   L  +    G    +F  A  S 
Sbjct: 140 PPSWRLKPR--ISLEGSRPPKMTHMRSQEGFLLELFSNLPDLEEKAGA---LFARAADSQ 194

Query: 75  AKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATL 134
            +     L++    +  N    VC+  Q  +   N  +IS       D+ +V   +   L
Sbjct: 195 KRLLIMALNEGDVDLLVN---FVCSARQASISVENLVVISA------DKSVVDIAEALNL 245

Query: 135 YKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMV 194
           +  +   PG   L      S+ + ++G + F      +   +   + LGY+V++ D D+V
Sbjct: 246 HAFS--HPGFGTL-----PSERSGRYGDENFQVMMWLKVVSVWIAIRLGYHVLFQDADLV 298

Query: 195 WLKDPF-PYLQGDHDVYFTDDMA 216
           WLK P+  +     D +F DD A
Sbjct: 299 WLKSPWEAFADTSIDGFFMDDGA 321


>gi|297607539|ref|NP_001060135.2| Os07g0587100 [Oryza sativa Japonica Group]
 gi|34393596|dbj|BAC83249.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|255677929|dbj|BAF22049.2| Os07g0587100 [Oryza sativa Japonica Group]
          Length = 426

 Score = 38.1 bits (87), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 32/134 (23%), Positives = 57/134 (42%), Gaps = 6/134 (4%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L++    VA    +IV   +      L  W  +I R    + ++V  +D    +  +   
Sbjct: 143 LAKILQQVAIKKELIVALANSNVREMLEMWFTNIKRVGISNYLIVALDDSTETFCKSKGV 202

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPF 200
           P         PD +     G  G  +  S     +L   L+LGY+V+ +D+D+++ ++PF
Sbjct: 203 P----FYRRDPD-EGVDNIGKVGGNHAVSGLKFRILREFLQLGYSVLLSDIDIIFFQNPF 257

Query: 201 PYLQGDHDVYFTDD 214
            +L  D DV    D
Sbjct: 258 DHLYRDADVESMSD 271


>gi|407928253|gb|EKG21116.1| Heat shock protein Hsp70 [Macrophomina phaseolina MS6]
          Length = 251

 Score = 38.1 bits (87), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 38/82 (46%), Gaps = 8/82 (9%)

Query: 125 LVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGF------FNFTSRRPCHLLH 178
           L++A DY T Y VN  WPG+A  V   P S+ A+   + G       FN  S   C+   
Sbjct: 8   LIVAVDYGTTYTVN-EWPGNAESVFKVP-SKIAYAAENPGLDDDQWGFNVKSSMGCYTWT 65

Query: 179 ILELGYNVMYNDVDMVWLKDPF 200
            L L  N+   D D   L+D F
Sbjct: 66  KLLLDKNIRLTDFDDSKLEDVF 87


>gi|428183855|gb|EKX52712.1| hypothetical protein GUITHDRAFT_101866 [Guillardia theta CCMP2712]
          Length = 899

 Score = 38.1 bits (87), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 55/286 (19%), Positives = 113/286 (39%), Gaps = 40/286 (13%)

Query: 84  QAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKH-QDQVLVIAEDYATLYKVNGRWP 142
           QAA        +I+  ++  ++PF+ +WL +  +     ++ LVI  D  +  +V  R+ 
Sbjct: 517 QAAMQGMPKDVVILHFINSAFMPFMLSWLCNTQQMTGVHERTLVITTDAGSEERVK-RFS 575

Query: 143 G--HAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPF 200
              HA+ +           F S G+      R   ++ +L+    +   + D +W+ +  
Sbjct: 576 SRVHAMYLEGYNGINGNLDFDSLGYRRMLKARIEIIISVLQKSDLLFLTEPDAIWMNNLL 635

Query: 201 ---PYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKK 257
               ++    D+   DD                G KG  ++      +R       + +K
Sbjct: 636 EIKEFVNSPSDLIGFDD--------------EQGGKGAGFVL-----MRNNAAISCLWRK 676

Query: 258 WIE--ELQAEPWSKAKKA------NDQPAFNWALN--KTAGQVDLYLLPQSAFPTGGLYF 307
            +E  EL   P+            NDQ  FN  +   + AG +    +    F TG  ++
Sbjct: 677 LLERQELTLAPYEGQPNEIVTYDDNDQKYFNALIQDMEAAGMMKSSTMDTCKFRTGK-WY 735

Query: 308 KNQTWVEETKGKHV---IIHNNYITGFEKKIKRFRDFGLWLVDDHA 350
           K + W +     ++   I++ N++ G + KI+R + +G W +++  
Sbjct: 736 KREEWKDYMTCLNIVPDILNFNWMVGNQPKIQRAKLYGHWFLEEDG 781


>gi|218199927|gb|EEC82354.1| hypothetical protein OsI_26669 [Oryza sativa Indica Group]
          Length = 426

 Score = 38.1 bits (87), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 32/134 (23%), Positives = 57/134 (42%), Gaps = 6/134 (4%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L++    VA    +IV   +      L  W  +I R    + ++V  +D    +  +   
Sbjct: 143 LAKILQQVAIKKELIVALANSNVREMLEMWFTNIKRVGISNYLIVALDDSTETFCKSKGV 202

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPF 200
           P         PD +     G  G  +  S     +L   L+LGY+V+ +D+D+++ ++PF
Sbjct: 203 P----FYRRDPD-EGVDNIGKVGGNHAVSGLKFRILREFLQLGYSVLLSDIDIIFFQNPF 257

Query: 201 PYLQGDHDVYFTDD 214
            +L  D DV    D
Sbjct: 258 DHLYRDADVESMSD 271


>gi|147795394|emb|CAN76537.1| hypothetical protein VITISV_034847 [Vitis vinifera]
          Length = 252

 Score = 38.1 bits (87), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 47/103 (45%), Gaps = 15/103 (14%)

Query: 107 FLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFF 166
           +L  W  SI R    + ++V  +D    + + GR    +V         +  KF      
Sbjct: 4   YLEVWFTSIKRVAILNYLVVALDDXIANFYLVGRID--SVARTGVBKDVSGLKF------ 55

Query: 167 NFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDV 209
                    L   L+LGY+V+ +DVD+V+L++PF +L  D DV
Sbjct: 56  -------LXLRDFLQLGYSVLLSDVDLVYLRNPFDHLYRDCDV 91


>gi|223999733|ref|XP_002289539.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974747|gb|EED93076.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 610

 Score = 38.1 bits (87), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 45/215 (20%), Positives = 83/215 (38%), Gaps = 23/215 (10%)

Query: 154 SQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTD 213
           S  A ++G + F      +   +L+ L L Y+V++ DVD+VW +DP          +F D
Sbjct: 357 SGEARRYGDKNFKAMMYAKVLCVLYPLLLNYDVLFQDVDIVWYRDPM--------TFFHD 408

Query: 214 DMAAVKPLD--HSHDLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKWIEELQ-AEPWSKA 270
             A +   D    HD     +       S   Y+R    ++ +    +        W   
Sbjct: 409 KTANISHFDVLFQHDGSNSVRYAPYSANSGFYYVRANKRSQYLFTSLLYHSDLIITWDSH 468

Query: 271 KKANDQPAFN----WALNKTAGQVDLYLLPQSAFPTGGLYFKNQTWVEE-TKGK--HVII 323
           ++   Q        + LN     V ++      FP G  Y + + ++++  KG+    I 
Sbjct: 469 QQVLVQLLAEHSSLFGLN-----VKVFNRDTEMFPGGYHYHRRKDFMKKLMKGEMNSYIF 523

Query: 324 HNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358
           H ++    + K+   R  G W V D  +    G++
Sbjct: 524 HISWTENKDNKLLFLRQMGEWYVTDQCISKTAGEI 558


>gi|384249480|gb|EIE22961.1| hypothetical protein COCSUDRAFT_41969 [Coccomyxa subellipsoidea
           C-169]
          Length = 396

 Score = 38.1 bits (87), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 35/67 (52%)

Query: 137 VNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWL 196
           +NGR+  H V     P   T ++ G          +   +L+IL LGY+V+Y DVD V+ 
Sbjct: 142 LNGRYSHHCVQSTNWPGHDTEYETGDAWHVAAELHKLELILNILTLGYSVLYVDVDHVFY 201

Query: 197 KDPFPYL 203
           ++P  +L
Sbjct: 202 RNPMHHL 208


>gi|302838823|ref|XP_002950969.1| hypothetical protein VOLCADRAFT_91495 [Volvox carteri f.
           nagariensis]
 gi|300263664|gb|EFJ47863.1| hypothetical protein VOLCADRAFT_91495 [Volvox carteri f.
           nagariensis]
          Length = 279

 Score = 37.7 bits (86), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 56/142 (39%), Gaps = 22/142 (15%)

Query: 149 PPAPDSQTAHKFGSQGFFNFTSRRPCHLLHILELGYNVMYNDVDMVWLKDPFPY-LQGDH 207
           P  P  Q      SQG F  TS     +L  L LG +V++ D D V+ ++P PY L  + 
Sbjct: 50  PDGPMGQMKSVDRSQGSFTKTST----ILDGLTLGVDVLFLDADQVFFRNPLPYILAREA 105

Query: 208 DVYFTDDMAAVKPLDHSH-DLPPPGKKGRTYICSCMIYLRPTDGAKLVMKKW------IE 260
           D+  + D        H+H D  P  +          +Y RPT      +  W      I 
Sbjct: 106 DILVSGDC-------HNHGDATPMERFPPINNNIGFVYFRPTAMVTRAIYNWALWLRNIA 158

Query: 261 ELQAEPWSKAKKANDQPAFNWA 282
               +PW ++  A   PA  W 
Sbjct: 159 LTGNKPWDQSTFA---PAVEWV 177


>gi|222637364|gb|EEE67496.1| hypothetical protein OsJ_24927 [Oryza sativa Japonica Group]
          Length = 390

 Score = 37.7 bits (86), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 32/134 (23%), Positives = 57/134 (42%), Gaps = 6/134 (4%)

Query: 82  LSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQKHQDQVLVIAEDYATLYKVNGRW 141
           L++    VA    +IV   +      L  W  +I R    + ++V  +D    +  +   
Sbjct: 107 LAKILQQVAIKKELIVALANSNVREMLEMWFTNIKRVGISNYLIVALDDSTETFCKSKGV 166

Query: 142 PGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLL-HILELGYNVMYNDVDMVWLKDPF 200
           P         PD +     G  G  +  S     +L   L+LGY+V+ +D+D+++ ++PF
Sbjct: 167 P----FYRRDPD-EGVDNIGKVGGNHAVSGLKFRILREFLQLGYSVLLSDIDIIFFQNPF 221

Query: 201 PYLQGDHDVYFTDD 214
            +L  D DV    D
Sbjct: 222 DHLYRDADVESMSD 235


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.138    0.449 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,514,655,720
Number of Sequences: 23463169
Number of extensions: 284650450
Number of successful extensions: 613987
Number of sequences better than 100.0: 457
Number of HSP's better than 100.0 without gapping: 307
Number of HSP's successfully gapped in prelim test: 150
Number of HSP's that attempted gapping in prelim test: 613377
Number of HSP's gapped (non-prelim): 524
length of query: 358
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 215
effective length of database: 9,003,962,200
effective search space: 1935851873000
effective search space used: 1935851873000
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 77 (34.3 bits)