BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 010098
         (518 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255541500|ref|XP_002511814.1| UDP-galactose:MGDG galactosyltransferase, putative [Ricinus
           communis]
 gi|223548994|gb|EEF50483.1| UDP-galactose:MGDG galactosyltransferase, putative [Ricinus
           communis]
          Length = 470

 Score =  746 bits (1925), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/466 (75%), Positives = 397/466 (85%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           MD+KQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDG+R+VTLVIPWLSL HQK VYP N
Sbjct: 1   MDQKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGDRKVTLVIPWLSLKHQKLVYPKN 60

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           ITF+ P EQEAYVR WLEDR  F   F   FYPGKFA DK SIL VGDI++ I DEEAD+
Sbjct: 61  ITFSLPSEQEAYVRRWLEDRIAFLCDFKLCFYPGKFAKDKGSILPVGDISDFISDEEADV 120

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           A+LEEPEHLTWFHHGKRWK KFR V+GIVHTNYLEYVKREKN RL+A LL++VNSW+  +
Sbjct: 121 AILEEPEHLTWFHHGKRWKTKFRLVIGIVHTNYLEYVKREKNGRLKALLLKYVNSWVVSI 180

Query: 181 HCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVW 240
           +CHKVIRLSAATQ+Y  SI+CNVHGVNPKFLEIG+KK+EQQ+NG++ FTKGAYYIG+MVW
Sbjct: 181 YCHKVIRLSAATQDYTKSIICNVHGVNPKFLEIGKKKLEQQKNGDQPFTKGAYYIGKMVW 240

Query: 241 SKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFH 300
           SKGY+ELL LL+ + KEL GLE+DLYG+GED DQ+Q AAKKL+LVVRV PGRDHADP+FH
Sbjct: 241 SKGYKELLKLLDDHQKELTGLEVDLYGSGEDSDQVQEAAKKLELVVRVNPGRDHADPVFH 300

Query: 301 DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLK 360
           DYKVFLNPSTTDVVCT TAEALAMGKIVVCANHPSNDFFKQF NC+TYD R GFVEA  K
Sbjct: 301 DYKVFLNPSTTDVVCTTTAEALAMGKIVVCANHPSNDFFKQFSNCQTYDNRKGFVEAVCK 360

Query: 361 ALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIE 420
           ALAE+PA+ TD + + LSWE+ATERFL+VAEL      K  + PS++  S SLN ++ IE
Sbjct: 361 ALAEQPAELTDEERYALSWEAATERFLKVAELDLPSAWKVERVPSKNFASTSLNLRQNIE 420

Query: 421 DAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLVTELGKG 466
           DA  Y+H+V SGFE SRRA GAIPGSLQPDEQ CKELGL    GKG
Sbjct: 421 DASAYVHHVVSGFEVSRRAFGAIPGSLQPDEQQCKELGLAISAGKG 466


>gi|224130208|ref|XP_002320779.1| predicted protein [Populus trichocarpa]
 gi|222861552|gb|EEE99094.1| predicted protein [Populus trichocarpa]
          Length = 475

 Score =  726 bits (1875), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/461 (73%), Positives = 395/461 (85%), Gaps = 4/461 (0%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           MD K+QHIAIFTTASLPW+TGTAVNPLFRAA+LAKDG R+VTLVIPWLSL HQK VYP N
Sbjct: 1   MDNKKQHIAIFTTASLPWMTGTAVNPLFRAAFLAKDGSRKVTLVIPWLSLQHQKLVYPNN 60

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           ITF SP EQ+ YVR WL++R  F+  F  +FYP KFA DK+SIL+VGDI+E+IPDE+AD+
Sbjct: 61  ITFTSPSEQQVYVRQWLQERISFSPAFSIQFYPAKFAVDKRSILSVGDISEVIPDEDADV 120

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           AVLEEPEHLTWFHHGKRWK KFR V+GI+HTNYLEY+KREK+ R++A +++++NSW+  +
Sbjct: 121 AVLEEPEHLTWFHHGKRWKTKFRLVIGIIHTNYLEYIKREKHGRVKAVVVKYINSWVVEI 180

Query: 181 HCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQ--NGNKAFTKGAYYIGRM 238
           +CHKVIRLSAATQ+YPNSI+CNVHGVNPKFLEIG+KK+E QQ  NGN+AFTKGAYYIG+M
Sbjct: 181 YCHKVIRLSAATQDYPNSIICNVHGVNPKFLEIGKKKIELQQSGNGNQAFTKGAYYIGKM 240

Query: 239 VWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPI 298
           VWSKGY+EL+ LL    KEL GLE+DLYG+GED DQ+Q AAKKL LVVRVYPGRDHADP+
Sbjct: 241 VWSKGYKELIKLLQDNQKELIGLEVDLYGSGEDSDQVQAAAKKLDLVVRVYPGRDHADPV 300

Query: 299 FHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEAT 358
           FHDYKVFLNPSTTDVVCT TAEALAMGKIVVCANHPSN+FFKQF NC TYD   GFVEAT
Sbjct: 301 FHDYKVFLNPSTTDVVCTTTAEALAMGKIVVCANHPSNEFFKQFVNCWTYDNSKGFVEAT 360

Query: 359 LKALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRI 418
            +AL EEPA+ T AQ H+LSWE+ATERFL+VA+L      K +KS  ++  S SLN++  
Sbjct: 361 SRALTEEPAELTGAQRHELSWEAATERFLRVADLDQVFARKPAKSLLKNFASTSLNTR-- 418

Query: 419 IEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGL 459
           +ED   YLHYVA G ETSRRA GAIPGSLQPDE+LC+ELGL
Sbjct: 419 MEDVSAYLHYVALGSETSRRAFGAIPGSLQPDEELCQELGL 459


>gi|388492674|gb|AFK34403.1| unknown [Lotus japonicus]
          Length = 463

 Score =  714 bits (1844), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 335/456 (73%), Positives = 392/456 (85%), Gaps = 1/456 (0%)

Query: 4   KQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITF 63
           K+QHIAIFTTASLPWLTGTAVNPLFRAAYL+KDGER VTLVIPWLSL  Q  VYP NITF
Sbjct: 3   KKQHIAIFTTASLPWLTGTAVNPLFRAAYLSKDGERDVTLVIPWLSLKDQALVYPSNITF 62

Query: 64  ASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVL 123
           ASP E E Y+R WLE+R GFTS F  +FYPGKF+ DK+SILAVGDI+E+IPD+EADIAVL
Sbjct: 63  ASPSEHEKYIRQWLEERVGFTSGFSIKFYPGKFSRDKRSILAVGDISEVIPDKEADIAVL 122

Query: 124 EEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCH 183
           EEPEHLTWFHHGKRWK KFR V+GI+HTNYLEYVKREKN ++QAFLL+++N+W+  ++CH
Sbjct: 123 EEPEHLTWFHHGKRWKTKFRLVIGIIHTNYLEYVKREKNGQMQAFLLKYLNNWVVGIYCH 182

Query: 184 KVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKG 243
           KVIRLSAATQ+Y  SIVCNVHGVNPKFLEIG+KK EQQQNG++AFTKGAY+IG+MVWSKG
Sbjct: 183 KVIRLSAATQDYSGSIVCNVHGVNPKFLEIGKKKREQQQNGDQAFTKGAYFIGKMVWSKG 242

Query: 244 YEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYK 303
           Y+ELL L   + KEL+ LE+DL+G+GED D++Q+AAKKL++ VRV+P RDHAD +FHDYK
Sbjct: 243 YKELLHLFKNHQKELSALEVDLFGSGEDSDEVQKAAKKLEMAVRVHPARDHADALFHDYK 302

Query: 304 VFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALA 363
           +FLNPSTTDVVCT TAEALAMGKIVVCANH SN+FFKQFPNC T+D   GFV+  LKALA
Sbjct: 303 LFLNPSTTDVVCTTTAEALAMGKIVVCANHCSNEFFKQFPNCWTFDESKGFVQLILKALA 362

Query: 364 EEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIEDAF 423
           EEPAQ TDAQ H LSWE+ATERFL+ AEL      K S+S S ++ S SLN ++ ++DA 
Sbjct: 363 EEPAQLTDAQRHDLSWEAATERFLKAAELDKPFEKKLSRSTSIYM-STSLNLQQTVDDAS 421

Query: 424 GYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGL 459
            Y+H+VASGFE SRR  GAIPGSL+PDE+L KELGL
Sbjct: 422 AYVHHVASGFEISRRMFGAIPGSLKPDEELSKELGL 457


>gi|75120871|sp|Q6DW73.1|DGDG2_LOTJA RecName: Full=Digalactosyldiacylglycerol synthase 2, chloroplastic
 gi|49617335|gb|AAT67423.1| digalactosyldiacylglycerol synthase 2 [Lotus japonicus]
          Length = 463

 Score =  714 bits (1843), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 335/456 (73%), Positives = 392/456 (85%), Gaps = 1/456 (0%)

Query: 4   KQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITF 63
           K+QHIAIFTTASLPWLTGTAVNPLFRAAYL+KDGER VTLVIPWLSL  Q  VYP NITF
Sbjct: 3   KKQHIAIFTTASLPWLTGTAVNPLFRAAYLSKDGERDVTLVIPWLSLKDQALVYPNNITF 62

Query: 64  ASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVL 123
           ASP E E Y+R WLE+R GFTS F  +FYPGKF+ DK+SILAVGDI+E+IPD+EADIAVL
Sbjct: 63  ASPSEHEKYIRQWLEERVGFTSGFSIKFYPGKFSRDKRSILAVGDISEVIPDKEADIAVL 122

Query: 124 EEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCH 183
           EEPEHLTWFHHGKRWK KFR V+GI+HTNYLEYVKREKN ++QAFLL+++N+W+  ++CH
Sbjct: 123 EEPEHLTWFHHGKRWKTKFRLVIGIIHTNYLEYVKREKNGQMQAFLLKYLNNWVVGIYCH 182

Query: 184 KVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKG 243
           KVIRLSAATQ+Y  SIVCNVHGVNPKFLEIG+KK EQQQNG++AFTKGAY+IG+MVWSKG
Sbjct: 183 KVIRLSAATQDYSGSIVCNVHGVNPKFLEIGKKKREQQQNGDQAFTKGAYFIGKMVWSKG 242

Query: 244 YEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYK 303
           Y+ELL L   + KEL+ LE+DL+G+GED D++Q+AAKKL++ VRV+P RDHAD +FHDYK
Sbjct: 243 YKELLHLFKNHQKELSALEVDLFGSGEDSDEVQKAAKKLEMAVRVHPARDHADALFHDYK 302

Query: 304 VFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALA 363
           +FLNPSTTDVVCT TAEALAMGKIVVCANH SN+FFKQFPNC T+D   GFV+  LKALA
Sbjct: 303 LFLNPSTTDVVCTTTAEALAMGKIVVCANHCSNEFFKQFPNCWTFDESKGFVQLILKALA 362

Query: 364 EEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIEDAF 423
           EEPAQ TDAQ H LSWE+ATERFL+ AEL      K S+S S ++ S SLN ++ ++DA 
Sbjct: 363 EEPAQLTDAQRHDLSWEAATERFLKAAELDKPFEKKLSRSTSIYM-STSLNLQQTVDDAS 421

Query: 424 GYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGL 459
            Y+H+VASGFE SRR  GAIPGSL+PDE+L KELGL
Sbjct: 422 AYVHHVASGFEISRRMFGAIPGSLKPDEELSKELGL 457


>gi|147854598|emb|CAN80695.1| hypothetical protein VITISV_018376 [Vitis vinifera]
          Length = 476

 Score =  706 bits (1821), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/465 (72%), Positives = 387/465 (83%), Gaps = 1/465 (0%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           MD KQ HIAIFTTASLPW+TGTAVNPLFR AYL K  E +VTLVIPWLSL  Q+ VYP  
Sbjct: 1   MDSKQ-HIAIFTTASLPWMTGTAVNPLFRVAYLTKGREFKVTLVIPWLSLKDQELVYPNK 59

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           ITF SP EQEAYVR WL +RTGF   F  +FYPGKF+ DK+SIL VGDITEIIPDEEADI
Sbjct: 60  ITFKSPSEQEAYVRQWLGERTGFVCDFSIKFYPGKFSRDKRSILVVGDITEIIPDEEADI 119

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           AVLEEPEHLTW+HHGKRWK KF  V+GIVHTNYLEYV+REKN RLQAFLL+++N+W+  +
Sbjct: 120 AVLEEPEHLTWYHHGKRWKTKFHLVLGIVHTNYLEYVRREKNGRLQAFLLKYINNWVVDI 179

Query: 181 HCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVW 240
           +CHKVIRLSAATQ+ P SI+CNVHGVNPKFLEIG++K E QQNG++AFTKGAYYIG+MVW
Sbjct: 180 YCHKVIRLSAATQDLPRSIICNVHGVNPKFLEIGKRKNEHQQNGDRAFTKGAYYIGKMVW 239

Query: 241 SKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFH 300
           SKGY+ELL LL+   KEL GLE+DLYGNGED DQ+Q AAKKL+L VRVYPG DHADP+FH
Sbjct: 240 SKGYKELLKLLHDNQKELTGLEVDLYGNGEDSDQVQGAAKKLELDVRVYPGHDHADPLFH 299

Query: 301 DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLK 360
           DYKVFLNPSTTDV+CT TAEALAMGKIVVCANHPSNDFFKQF NCRTY   +GFV+ TLK
Sbjct: 300 DYKVFLNPSTTDVLCTTTAEALAMGKIVVCANHPSNDFFKQFTNCRTYQDSSGFVKETLK 359

Query: 361 ALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIE 420
           AL+EEPAQ TDAQ H+LSW++ATERFLQ A L   V  K + +P +   S ++N ++ ++
Sbjct: 360 ALSEEPAQLTDAQMHELSWDAATERFLQAAGLDHVVERKPTDTPPKKFMSMTMNLRKNMD 419

Query: 421 DAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLVTELGK 465
           DA  Y+H+VASG E SRR  GAIPGSLQPDE+  +ELG +  L K
Sbjct: 420 DASAYVHHVASGIEASRRVFGAIPGSLQPDEEQRQELGKLILLSK 464


>gi|351722100|ref|NP_001237488.1| digalactosyldiacylglycerol synthase 2, chloroplastic [Glycine max]
 gi|75120873|sp|Q6DW75.1|DGDG2_SOYBN RecName: Full=Digalactosyldiacylglycerol synthase 2, chloroplastic
 gi|49617331|gb|AAT67421.1| digalactosyldiacylglycerol synthase 2 [Glycine max]
          Length = 463

 Score =  702 bits (1813), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 332/460 (72%), Positives = 391/460 (85%), Gaps = 2/460 (0%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           MD+K+ HIAIFTTASLPWLTGTAVNPLFRAAYLAK GER VTLVIPWLSL  Q+ VYP N
Sbjct: 1   MDKKE-HIAIFTTASLPWLTGTAVNPLFRAAYLAKSGERDVTLVIPWLSLKDQRLVYPNN 59

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           ITFASP E E Y+  WLE+R GFTS F  +FYPGKF+ DK+SILAVGDI+EIIPD+ ADI
Sbjct: 60  ITFASPSEHEKYICQWLEERVGFTSGFSIQFYPGKFSRDKRSILAVGDISEIIPDKVADI 119

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           AVLEEPEHLTW+HHGKRWK KFR V+GI+HTNYLEYVKREKN  +QAFLL+++N+W+  +
Sbjct: 120 AVLEEPEHLTWYHHGKRWKTKFRLVIGIIHTNYLEYVKREKNGVMQAFLLKYLNNWVVSI 179

Query: 181 HCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVW 240
           +CHKVIRLSAATQ+Y  SI+CNVHGVNPKFLEIG+KK EQQQ G  AFTKGAY+IG+M+W
Sbjct: 180 YCHKVIRLSAATQDYTGSIICNVHGVNPKFLEIGKKKREQQQKGEHAFTKGAYFIGKMIW 239

Query: 241 SKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFH 300
           SKGY+ELL LL  + KEL+ LE+DL+G+GED D++Q+AA+KL+L VRV+P RDHAD +FH
Sbjct: 240 SKGYKELLQLLKDHEKELSALEVDLFGSGEDSDEVQKAAEKLELAVRVHPARDHADALFH 299

Query: 301 DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLK 360
           DYK+FLNPSTTDVVCT TAEALAMGKIVVCANHPSNDFFKQFPNC TYD  +GFV+ TLK
Sbjct: 300 DYKLFLNPSTTDVVCTTTAEALAMGKIVVCANHPSNDFFKQFPNCWTYDDDDGFVKLTLK 359

Query: 361 ALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIE 420
           ALAE+PAQPTDAQ H LSWE+AT+RFL+ A+L   +  K S++ S  L + SLN +  ++
Sbjct: 360 ALAEQPAQPTDAQRHDLSWEAATKRFLKAADLDKPLERKLSRTTSNFL-AASLNLQEKVD 418

Query: 421 DAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
           +A  Y+H+VASGFE SRR  GAIP SLQPDE+L KELGL 
Sbjct: 419 EASAYVHHVASGFEVSRRIFGAIPDSLQPDEELRKELGLT 458


>gi|225454079|ref|XP_002266316.1| PREDICTED: digalactosyldiacylglycerol synthase 2, chloroplastic
           [Vitis vinifera]
 gi|297745205|emb|CBI40285.3| unnamed protein product [Vitis vinifera]
          Length = 466

 Score =  702 bits (1813), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 334/458 (72%), Positives = 384/458 (83%), Gaps = 1/458 (0%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           MD KQ HIAIFTTASLPW+TGTAVNPLFR AYL K  E +VTLVIPWLSL  Q+ VYP  
Sbjct: 1   MDSKQ-HIAIFTTASLPWMTGTAVNPLFRVAYLTKGREFKVTLVIPWLSLKDQELVYPNK 59

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           ITF SP EQEAYVR WL +RTGF   F  +FYPGKF+ DK+SIL VGDITEIIPDEEADI
Sbjct: 60  ITFKSPSEQEAYVRQWLGERTGFVCDFSIKFYPGKFSRDKRSILVVGDITEIIPDEEADI 119

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           AVLEEPEHLTW+HHGKRWK KF  V+GIVHTNYLEYV+REKN RLQAFLL+++N+W+  +
Sbjct: 120 AVLEEPEHLTWYHHGKRWKTKFHLVLGIVHTNYLEYVRREKNGRLQAFLLKYINNWVVDI 179

Query: 181 HCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVW 240
           +CHKVIRLSAATQ+ P SI+CNVHGVNPKFLEIG++K E QQNG++AFTKGAYYIG+MVW
Sbjct: 180 YCHKVIRLSAATQDLPRSIICNVHGVNPKFLEIGKRKNEHQQNGDRAFTKGAYYIGKMVW 239

Query: 241 SKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFH 300
           SKGY+ELL LL+   KEL GLE+DLYGNGED DQ+Q AAKKL+L VRVYPG DHADP+FH
Sbjct: 240 SKGYKELLKLLHDNQKELTGLEVDLYGNGEDSDQVQGAAKKLELDVRVYPGHDHADPLFH 299

Query: 301 DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLK 360
           DYKVFLNPSTTDV+CT TAEALAMGKIVVCANHPSNDFFKQF NCRTY   +GFV+ TLK
Sbjct: 300 DYKVFLNPSTTDVLCTTTAEALAMGKIVVCANHPSNDFFKQFTNCRTYQDSSGFVKETLK 359

Query: 361 ALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIE 420
           AL+EEPAQ TDAQ H+LSW++ATERFLQ A L   V  K + +P +   S ++N ++ ++
Sbjct: 360 ALSEEPAQLTDAQMHELSWDAATERFLQAAGLDHVVERKPTDTPPKKFMSMTMNLRKNMD 419

Query: 421 DAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELG 458
           DA  Y+H+VASG E SRR  GAIPGSLQPDE+  +ELG
Sbjct: 420 DASAYVHHVASGIEASRRVFGAIPGSLQPDEEQRQELG 457


>gi|22328179|ref|NP_191964.2| digalactosyldiacylglycerol synthase 2 [Arabidopsis thaliana]
 gi|75161985|sp|Q8W1S1.1|DGDG2_ARATH RecName: Full=Digalactosyldiacylglycerol synthase 2, chloroplastic
 gi|18141112|gb|AAL60504.1|AF421193_1 digalactosyldiacylglycerol synthase [Arabidopsis thaliana]
 gi|33589694|gb|AAQ22613.1| At4g00550 [Arabidopsis thaliana]
 gi|110736406|dbj|BAF00170.1| digalactosyldiacylglycerol synthase [Arabidopsis thaliana]
 gi|332656498|gb|AEE81898.1| digalactosyldiacylglycerol synthase 2 [Arabidopsis thaliana]
          Length = 473

 Score =  692 bits (1787), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 322/458 (70%), Positives = 383/458 (83%)

Query: 2   DRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNI 61
           ++++QHIAIFTTAS+PWLTGTAVNPLFRAAYLA DGERRVTLVIPWL+L HQK VYP +I
Sbjct: 3   NQQEQHIAIFTTASIPWLTGTAVNPLFRAAYLANDGERRVTLVIPWLTLKHQKLVYPNSI 62

Query: 62  TFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIA 121
           TF+SP EQEAYVR WLE+R  F   F+ RFYPGKFA DK+SIL VGDI++ IPDEEADIA
Sbjct: 63  TFSSPSEQEAYVRQWLEERVSFRLAFEIRFYPGKFAIDKRSILPVGDISDAIPDEEADIA 122

Query: 122 VLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVH 181
           VLEEPEHLTWFHHG++WK KF +V+GIVHTNYLEYVKREK  R++AF L+++NSW+  ++
Sbjct: 123 VLEEPEHLTWFHHGQKWKTKFNYVIGIVHTNYLEYVKREKQGRVKAFFLKYLNSWVVGIY 182

Query: 182 CHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWS 241
           CHKVIRLSAATQEYP SIVCNVHGVNPKFLEIG +K+EQQ+   + FTKGAYYIG+MVWS
Sbjct: 183 CHKVIRLSAATQEYPKSIVCNVHGVNPKFLEIGLRKLEQQKLQEQPFTKGAYYIGKMVWS 242

Query: 242 KGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHD 301
           KGY+ELL LL  + KELA LE+DLYG+GED ++I+ AA+KL L V VYPGRDHAD +FH+
Sbjct: 243 KGYKELLKLLEKHQKELAELEVDLYGDGEDSEEIKEAARKLDLTVNVYPGRDHADSLFHN 302

Query: 302 YKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKA 361
           YKVFLNPSTTDVVCT TAEALAMGKIVVCANH SN FFKQFPNCRTYD   GFV ATLKA
Sbjct: 303 YKVFLNPSTTDVVCTTTAEALAMGKIVVCANHISNKFFKQFPNCRTYDDGQGFVRATLKA 362

Query: 362 LAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIED 421
           L E+P+Q T+ Q H+LSWE+AT+RF++V++L        + S      S S++  + +ED
Sbjct: 363 LGEQPSQLTEQQRHELSWEAATQRFIKVSDLNRLSRADSNLSKRSVFASSSISVGKNLED 422

Query: 422 AFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGL 459
              Y+H++ASGFE SR A GAIPGSLQPDE+LC++LGL
Sbjct: 423 MSAYIHFLASGFEASRTAFGAIPGSLQPDEELCRDLGL 460


>gi|449432175|ref|XP_004133875.1| PREDICTED: digalactosyldiacylglycerol synthase 2,
           chloroplastic-like [Cucumis sativus]
 gi|449480158|ref|XP_004155815.1| PREDICTED: digalactosyldiacylglycerol synthase 2,
           chloroplastic-like [Cucumis sativus]
          Length = 469

 Score =  689 bits (1779), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 321/465 (69%), Positives = 377/465 (81%), Gaps = 1/465 (0%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           MDRKQ HIAIFTTA LPW+TGTAVNPLFRAAYL+ DG+R+VTLV+PWLSL  Q+ VYP N
Sbjct: 1   MDRKQ-HIAIFTTACLPWMTGTAVNPLFRAAYLSMDGDRKVTLVVPWLSLRDQELVYPSN 59

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
            TF  P +QE Y+R WLE+RTG    F   FYPGKFA DK+SIL VGDI+EIIPD E+DI
Sbjct: 60  TTFILPSQQEKYIRQWLEERTGSKFNFTLLFYPGKFAVDKRSILPVGDISEIIPDHESDI 119

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           AVLEEPEHLTW+HHGKRWK KF  VVGI+HTNYLEYVKREKN  ++AFLLE++N W+  +
Sbjct: 120 AVLEEPEHLTWYHHGKRWKNKFTLVVGIIHTNYLEYVKREKNGAVEAFLLEYINRWVVHI 179

Query: 181 HCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVW 240
           +CHKVIRLSAATQ+YP SI+CNVHGVNPKFLEIG+KKMEQQQNG + F KGAYYIG+M+W
Sbjct: 180 YCHKVIRLSAATQDYPKSIICNVHGVNPKFLEIGKKKMEQQQNGGQVFGKGAYYIGKMIW 239

Query: 241 SKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFH 300
           SKGY EL+ LL+ Y K+ A LE+DL+GNGED D++Q+A + L++ V+V+PGRDH DPIFH
Sbjct: 240 SKGYRELVKLLSDYQKDFAELEVDLFGNGEDSDEVQKATQDLEVSVKVHPGRDHTDPIFH 299

Query: 301 DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLK 360
           DYKVFLNPSTTDVVCT TAEALAMGK VVCANHPSN+FFKQFPNC  YD  +GFV+A  K
Sbjct: 300 DYKVFLNPSTTDVVCTTTAEALAMGKFVVCANHPSNEFFKQFPNCLLYDNNDGFVKAVFK 359

Query: 361 ALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIE 420
           AL EEP Q TDAQ H+LSWE+ATERFL+ AEL   +  K  K+ S+   S SL   R  +
Sbjct: 360 ALLEEPEQLTDAQRHELSWEAATERFLKAAELDQSLARKPQKTRSKKFLSLSLQLGRKFD 419

Query: 421 DAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLVTELGK 465
           DA  Y+H+++ GFE SRR  GA+PGSL PDEQ CKELGL +   K
Sbjct: 420 DATAYIHHMSLGFEGSRRVFGAVPGSLHPDEQQCKELGLASPSSK 464


>gi|356532042|ref|XP_003534583.1| PREDICTED: digalactosyldiacylglycerol synthase 2,
           chloroplastic-like [Glycine max]
          Length = 463

 Score =  689 bits (1778), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 325/460 (70%), Positives = 386/460 (83%), Gaps = 2/460 (0%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           MD+K+ HIAIFTTASLPWLTGTAVNPLFRAAYLAK GER VTLVIPWLSL  Q  VYP N
Sbjct: 1   MDKKE-HIAIFTTASLPWLTGTAVNPLFRAAYLAKSGERDVTLVIPWLSLKDQGLVYPNN 59

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           I FASP E E Y+R WLE+R GFT  F  +FYPGKF+ DK+SILAVGDI+EIIPD+ ADI
Sbjct: 60  IKFASPSEHEKYIRQWLEERVGFTCGFSIQFYPGKFSRDKRSILAVGDISEIIPDKVADI 119

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           AVLEEPEHLTW+HHGKRWK KFR V+GI+HTNYLEYVKREKN  +QAFL++++N+W+  +
Sbjct: 120 AVLEEPEHLTWYHHGKRWKTKFRLVIGIIHTNYLEYVKREKNGTMQAFLMKYLNNWVVGI 179

Query: 181 HCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVW 240
           +CHKVIRLSAATQ+Y  SI+CNVHGVNPKFLEIG+KK EQQQ G +AFTKGAY+IG+M+W
Sbjct: 180 YCHKVIRLSAATQDYTGSIICNVHGVNPKFLEIGKKKREQQQKGEQAFTKGAYFIGKMIW 239

Query: 241 SKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFH 300
           SKGY+ELL LL  + KEL+ LE+DL+G+GED +++Q+AA+KL+L V V+P RDHAD  FH
Sbjct: 240 SKGYKELLQLLKDHQKELSALEVDLFGSGEDSNEVQKAAEKLELAVSVHPARDHADARFH 299

Query: 301 DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLK 360
           DYK+FLNPSTTDVVCT TAEALAMGKIVVCANHPSNDFFKQFPNC TYD  + FV+ TLK
Sbjct: 300 DYKLFLNPSTTDVVCTTTAEALAMGKIVVCANHPSNDFFKQFPNCWTYDDDDEFVKLTLK 359

Query: 361 ALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIE 420
           ALAE+PAQPTDAQ H LSWE+AT+RFL+ A+L   +  K  ++ S  L + S+N +  ++
Sbjct: 360 ALAEQPAQPTDAQRHDLSWEAATKRFLKAADLDKPLDRKLLRTSSNFL-AASINLQEKVD 418

Query: 421 DAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
           +A  Y+H+VASGFE SRR  GAIP SLQPDE+L KELGL 
Sbjct: 419 EASAYVHHVASGFEVSRRIFGAIPDSLQPDEELRKELGLT 458


>gi|297810121|ref|XP_002872944.1| hypothetical protein ARALYDRAFT_490510 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318781|gb|EFH49203.1| hypothetical protein ARALYDRAFT_490510 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 474

 Score =  685 bits (1767), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 319/457 (69%), Positives = 380/457 (83%)

Query: 3   RKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNIT 62
           ++QQHIAIFTTAS+PWLTGTAVNPLFRAAYLA DG+RRVTLVIPWL+L HQK VYP +IT
Sbjct: 5   QQQQHIAIFTTASIPWLTGTAVNPLFRAAYLANDGDRRVTLVIPWLTLNHQKLVYPNSIT 64

Query: 63  FASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAV 122
           F+SP E EAYVR WLE+R  F   F+ RFYPGKFA DK+SIL VGDI++ IPDEEADIA 
Sbjct: 65  FSSPSEHEAYVRHWLEERVSFPLGFEIRFYPGKFATDKRSILPVGDISDAIPDEEADIAA 124

Query: 123 LEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHC 182
           LEEPEHLTWFHHGK+WK KF +V+GIVHTNYLEYVKREK  R++AFLL+++NSW+  ++C
Sbjct: 125 LEEPEHLTWFHHGKKWKTKFNYVIGIVHTNYLEYVKREKQGRVKAFLLKYLNSWVVGIYC 184

Query: 183 HKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSK 242
           HKVIRLSAATQEYP SIVCNVHGVNPKFLEIG +K+EQQ+   + FTKGAYYIG+MVWSK
Sbjct: 185 HKVIRLSAATQEYPKSIVCNVHGVNPKFLEIGLRKLEQQKLQEQPFTKGAYYIGKMVWSK 244

Query: 243 GYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDY 302
           GY+ELL LL  + KEL  LE+DLYG+GED ++I+ AA+KL+L V VYPGRDHAD +FH+Y
Sbjct: 245 GYKELLKLLEKHQKELPELEVDLYGDGEDSEEIKAAARKLELTVNVYPGRDHADSLFHNY 304

Query: 303 KVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKAL 362
           KVFLNPSTTDVVCT TAEALAMGKIVVCANH SN+FFKQFPNCRTYD   GFV AT+KAL
Sbjct: 305 KVFLNPSTTDVVCTTTAEALAMGKIVVCANHISNEFFKQFPNCRTYDDGKGFVRATVKAL 364

Query: 363 AEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIEDA 422
            E+P Q T+ Q H+LSWE+AT+RF++  +L      + + S      S S++  + +ED 
Sbjct: 365 GEQPLQLTEQQRHELSWEAATQRFIKACDLSRLSRAESNISKKSVFASSSISMGKNLEDM 424

Query: 423 FGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGL 459
             Y+H++ASGFE SR A GAIPGSLQPDE+LC++LGL
Sbjct: 425 SAYIHFLASGFEASRTAFGAIPGSLQPDEELCRDLGL 461


>gi|357501163|ref|XP_003620870.1| Digalactosyldiacylglycerol synthase [Medicago truncatula]
 gi|355495885|gb|AES77088.1| Digalactosyldiacylglycerol synthase [Medicago truncatula]
          Length = 464

 Score =  685 bits (1767), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 330/467 (70%), Positives = 386/467 (82%), Gaps = 7/467 (1%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           MD+K+ H AIFTTASLPWLTGTAVNPLFRAAYL K G R VTLVIPWLSL  QK VYP N
Sbjct: 1   MDQKR-HTAIFTTASLPWLTGTAVNPLFRAAYLYKAGIRNVTLVIPWLSLKDQKVVYPNN 59

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           ITF SP EQE Y+R WLEDR GF S F  +FYPGKF+ DK+SILAVGDI+EIIPD++ADI
Sbjct: 60  ITFDSPAEQEKYIRQWLEDRVGFASGFSIKFYPGKFSRDKRSILAVGDISEIIPDKDADI 119

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           AVLEEPEHLTWFHHGKRWK KF+ V+GI+HTNYL YVKREKN  LQAFLL+++N+W+  +
Sbjct: 120 AVLEEPEHLTWFHHGKRWKTKFKLVIGIIHTNYLAYVKREKNGTLQAFLLKYLNNWVVGI 179

Query: 181 HCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVW 240
           +CHKVIRLSAATQ+YP SIVCNVHGVNPKFLEIG+KK EQQQNG  AFTKGAY+IG+M+W
Sbjct: 180 YCHKVIRLSAATQDYPGSIVCNVHGVNPKFLEIGKKKREQQQNGEIAFTKGAYFIGKMIW 239

Query: 241 SKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFH 300
           SKGY+ELL LLN + KEL+ LE+DL+G+GED D++Q AAKKL++ VRV+P  DHAD +FH
Sbjct: 240 SKGYKELLQLLNDHQKELSALELDLFGSGEDSDEVQEAAKKLEMTVRVHPACDHADGLFH 299

Query: 301 DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLK 360
           D+K+F+NPSTTDVVCT TAEALAMGKIVVCA+H SN+FFKQFPNC TY+    FVE TLK
Sbjct: 300 DFKLFINPSTTDVVCTTTAEALAMGKIVVCADHCSNEFFKQFPNCWTYNNHKEFVELTLK 359

Query: 361 ALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKR---SKSPSRHLESESLNSKR 417
           AL EEP QPTDAQ H LSWE+ATERFL+  +L  D  ++R   S++ S +L S SL  ++
Sbjct: 360 ALTEEPGQPTDAQRHDLSWEAATERFLKAVDL--DKPSERKLLSRTTSNYL-STSLYLQQ 416

Query: 418 IIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLVTELG 464
            +EDA  ++H+VASGFE SRR  GAIP SLQPDEQL KELG     G
Sbjct: 417 TVEDASAFVHHVASGFEISRRIFGAIPHSLQPDEQLRKELGFANTSG 463


>gi|357507141|ref|XP_003623859.1| Digalactosyldiacylglycerol synthase [Medicago truncatula]
 gi|124360223|gb|ABN08236.1| digalactosyldiacylglycerol synthase 2, putative [Medicago
           truncatula]
 gi|355498874|gb|AES80077.1| Digalactosyldiacylglycerol synthase [Medicago truncatula]
          Length = 464

 Score =  683 bits (1763), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 329/467 (70%), Positives = 384/467 (82%), Gaps = 7/467 (1%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           MD K+ HIAIFTTASLPWLTGTAVNPLFRAAYL K G + VTLVIPWLSL  QK VYP N
Sbjct: 1   MDWKR-HIAIFTTASLPWLTGTAVNPLFRAAYLYKSGIKNVTLVIPWLSLKDQKVVYPNN 59

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           ITF SP EQE Y+R WLEDR GF S F  +FYPGKF+ DK+SILAVGDI+EIIPD +ADI
Sbjct: 60  ITFDSPAEQEKYIRQWLEDRVGFASGFSIKFYPGKFSRDKRSILAVGDISEIIPDNDADI 119

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           AVLEEPEHLTWFHHGKRWK KF+ V+GI+HTNYL YVKREKN  LQAFLL+++N+W+  +
Sbjct: 120 AVLEEPEHLTWFHHGKRWKTKFKLVIGIIHTNYLAYVKREKNGNLQAFLLKYLNNWVVGI 179

Query: 181 HCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVW 240
           +CHKVIRLSAATQ+YP S+VCNVHGVNPKFLEIG+KK EQQQNG  AFTKGAY IG+M+W
Sbjct: 180 YCHKVIRLSAATQDYPGSVVCNVHGVNPKFLEIGKKKREQQQNGENAFTKGAYLIGKMIW 239

Query: 241 SKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFH 300
           SKGY+ELL LLN + KEL+ LE+DL+G+GED +++Q AAKKL++ VRV+P RDHAD +FH
Sbjct: 240 SKGYKELLQLLNDHQKELSALELDLFGSGEDSNEVQEAAKKLEMTVRVHPARDHADDLFH 299

Query: 301 DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLK 360
           D+K+F+NPSTTDVVCT TAEALAMGKIVVCA+H SN+FFKQFPNC TY+    FV  TLK
Sbjct: 300 DFKLFINPSTTDVVCTTTAEALAMGKIVVCADHCSNEFFKQFPNCWTYNNPKEFVALTLK 359

Query: 361 ALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKR---SKSPSRHLESESLNSKR 417
           AL EEP QPTDAQ H LSWE+ATERFL+  +L  D  ++R   S+S S +L S SL  ++
Sbjct: 360 ALTEEPGQPTDAQRHDLSWEAATERFLKAVDL--DKPSERELLSRSTSNYL-STSLYLQQ 416

Query: 418 IIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLVTELG 464
            +EDA  ++H+VASGFE SRR  GAIP SLQPDEQL KELG     G
Sbjct: 417 TVEDASAFVHHVASGFEISRRIFGAIPHSLQPDEQLRKELGFANTSG 463


>gi|223946733|gb|ACN27450.1| unknown [Zea mays]
 gi|414865520|tpg|DAA44077.1| TPA: hypothetical protein ZEAMMB73_930994 [Zea mays]
 gi|414865521|tpg|DAA44078.1| TPA: hypothetical protein ZEAMMB73_930994 [Zea mays]
          Length = 475

 Score =  640 bits (1651), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 292/458 (63%), Positives = 365/458 (79%), Gaps = 3/458 (0%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H AIFTTASLPW+TGT++NPLFRAAYLAKDG++ VTLVIPWL L  Q+ VYP NI F 
Sbjct: 4   KRHFAIFTTASLPWMTGTSINPLFRAAYLAKDGDKDVTLVIPWLCLRDQELVYPNNIVFD 63

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP EQE+YVR W+E+R  F  +F  +FYPGKF+ + +SIL VGDITE IPDE AD+AVLE
Sbjct: 64  SPSEQESYVRHWIEERIDFRPSFSIKFYPGKFSKEMRSILPVGDITECIPDEVADVAVLE 123

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHG+RWK KFR V+GIVHTNYL YV+REKN ++ A  L++ N+W+ R++CHK
Sbjct: 124 EPEHLNWYHHGRRWKNKFRRVIGIVHTNYLAYVRREKNGQVIACFLKYANTWVTRIYCHK 183

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           +IRLS ATQ  P S++CNVHGVNPKFLE+G+ K+ Q QNG KAFTKGAYYIG+MVWSKGY
Sbjct: 184 IIRLSGATQNLPRSVICNVHGVNPKFLEVGKLKLRQLQNGEKAFTKGAYYIGKMVWSKGY 243

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            ELL LL+ Y  +L GLE+DLYG+GED D++  +AK+L L V V+PGRDHADP+FH+YKV
Sbjct: 244 RELLDLLSKYQSKLVGLEVDLYGSGEDSDEVCESAKRLSLSVNVHPGRDHADPLFHEYKV 303

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPSTTDVVCT TAEALAMGKIV+CANHPSN+FFKQFPNCR YD  + FV+ TL AL+E
Sbjct: 304 FINPSTTDVVCTTTAEALAMGKIVICANHPSNEFFKQFPNCRVYDNEDEFVQLTLNALSE 363

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRI---IED 421
           +PA  TD Q ++LSW++ATERF++ +++   V   R+   SR L    L ++++   +ED
Sbjct: 364 QPAPLTDTQRYELSWDAATERFIEASDINPHVPESRAHQNSRALLPAFLRTRKLKQNLED 423

Query: 422 AFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGL 459
           A  YLH   SG E +R A GA+P +LQPDE LCK+LGL
Sbjct: 424 ASVYLHQALSGLEVTRCAFGAVPKTLQPDEHLCKDLGL 461


>gi|115451541|ref|NP_001049371.1| Os03g0214400 [Oryza sativa Japonica Group]
 gi|108706834|gb|ABF94629.1| digalactosyldiacylglycerol synthase 2, putative, expressed [Oryza
           sativa Japonica Group]
 gi|113547842|dbj|BAF11285.1| Os03g0214400 [Oryza sativa Japonica Group]
 gi|222624449|gb|EEE58581.1| hypothetical protein OsJ_09908 [Oryza sativa Japonica Group]
          Length = 476

 Score =  640 bits (1651), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 297/463 (64%), Positives = 368/463 (79%), Gaps = 12/463 (2%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+A+FTTASLPW+TGTA+NPLFRAAYLAKDG+R VTLVIPWLSL  Q+ VYP  I F 
Sbjct: 4   RRHLAVFTTASLPWMTGTAINPLFRAAYLAKDGDREVTLVIPWLSLRDQELVYPNKIVFG 63

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP E EAYVR W+E+R  F  +F  +FYP KF+ + +SIL VGDITE IPDE AD+AVLE
Sbjct: 64  SPLEHEAYVRRWIEERISFRPSFSIKFYPAKFSKELRSILPVGDITECIPDEVADVAVLE 123

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHG++WK KFR V+G+VHTNY+ YVKREKN +  A  ++++N+W+ R++CHK
Sbjct: 124 EPEHLNWYHHGRKWKNKFRRVIGVVHTNYIAYVKREKNGQAIACFMKYMNTWVTRIYCHK 183

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           +IRLS ATQ+ P S+VCNVHGVNPKFLE+G+ K++Q +NG  AFTKGAYYIG+MVWSKGY
Sbjct: 184 IIRLSGATQDLPRSVVCNVHGVNPKFLEVGKLKLKQLRNGEIAFTKGAYYIGKMVWSKGY 243

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            ELL LL+ Y  +L GLE+DLYG+GED D+++++A+ L L V+V+PGRDHADP+FHDYKV
Sbjct: 244 RELLDLLSKYQSKLVGLEVDLYGSGEDSDEVRKSAELLSLAVKVHPGRDHADPLFHDYKV 303

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPSTTDVVCT TAEALAMGKIV+CANHPSN+FFKQFPNCRTYD    FV+ TL ALAE
Sbjct: 304 FINPSTTDVVCTTTAEALAMGKIVICANHPSNEFFKQFPNCRTYDSGEEFVQLTLDALAE 363

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESE-SLNS-------K 416
           +PA  TD Q+H LSWE+ATERF++ AEL   +    S  P  H  S+ SL +       K
Sbjct: 364 QPAPLTDMQSHDLSWEAATERFMEAAELNLPI----SAEPRIHQTSKVSLPAFMRTRKLK 419

Query: 417 RIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGL 459
           + +EDA  YLH   SG E +RRA GA+P +LQPDEQLC +LGL
Sbjct: 420 QSLEDASVYLHQALSGLEVTRRAFGAVPKTLQPDEQLCNDLGL 462


>gi|293333049|ref|NP_001168993.1| uncharacterized protein LOC100382823 [Zea mays]
 gi|223974317|gb|ACN31346.1| unknown [Zea mays]
          Length = 475

 Score =  637 bits (1644), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 290/458 (63%), Positives = 365/458 (79%), Gaps = 3/458 (0%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H AIFTTASLPW+TGT++NPLFRAAYLAKDG++ VTLVIPWL L  Q+ +YP NI F 
Sbjct: 4   KRHFAIFTTASLPWMTGTSINPLFRAAYLAKDGDKDVTLVIPWLCLRDQELLYPNNIVFD 63

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP EQE+YVR W+E+R  F  +F  +FYPGKF+ + +SIL VGDITE IPDE AD+AVLE
Sbjct: 64  SPSEQESYVRHWIEERIDFRPSFSIKFYPGKFSKEMRSILPVGDITECIPDEVADVAVLE 123

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHG+RWK KFR V+GIVHTNYL YV+REKN ++ A  L++ N+W+ R++CHK
Sbjct: 124 EPEHLNWYHHGRRWKNKFRRVIGIVHTNYLAYVRREKNGQVIACFLKYANTWVTRIYCHK 183

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           +IRLS ATQ  P S++CNVHGVNPKFLE+G+ K+ Q QNG +AFTKGAYYIG+MVWSKGY
Sbjct: 184 IIRLSGATQNLPRSVICNVHGVNPKFLEVGKIKLRQLQNGEEAFTKGAYYIGKMVWSKGY 243

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            ELL LL+ Y  +L GLE+DLYG+GED D++  +AK+L L V V+PGRDHADP+FH+YKV
Sbjct: 244 RELLDLLSKYQSKLVGLEVDLYGSGEDSDEVCESAKRLSLSVNVHPGRDHADPLFHEYKV 303

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPSTTDVVCT TAEALAMGKIV+CANHPSN+FFKQFPNCR YD  + FV+ TL AL+E
Sbjct: 304 FINPSTTDVVCTTTAEALAMGKIVICANHPSNEFFKQFPNCRVYDNEDEFVQLTLNALSE 363

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRI---IED 421
           +PA  TD Q ++LSW++ATERF++ +++   V   R+   SR L    L ++++   +ED
Sbjct: 364 QPAPLTDTQRYELSWDAATERFIEASDINPHVPESRAHQNSRALLPAFLRTRKLKQNLED 423

Query: 422 AFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGL 459
           A  YLH   SG E +R A GA+P +LQPDE LCK+LGL
Sbjct: 424 ASVYLHQALSGLEVTRCAFGAVPKTLQPDEHLCKDLGL 461


>gi|357120344|ref|XP_003561887.1| PREDICTED: digalactosyldiacylglycerol synthase 2,
           chloroplastic-like [Brachypodium distachyon]
          Length = 579

 Score =  630 bits (1626), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 290/464 (62%), Positives = 360/464 (77%), Gaps = 3/464 (0%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           M   ++H AIFTTASLPW+TGT++NPLFRAAYLAKDG+R VTLVIPWLSL  Q+ VYP  
Sbjct: 103 MSSTRRHFAIFTTASLPWMTGTSINPLFRAAYLAKDGDRDVTLVIPWLSLRDQELVYPNK 162

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           I F S  E E YVR W+E+R  F  +F  +FYPGKF+ + +SIL VGDIT+ IPDE ADI
Sbjct: 163 IVFDSLLEHEGYVRRWIEERIDFKPSFGIKFYPGKFSKEMRSILPVGDITDCIPDEVADI 222

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           AVLEEPEHL W+HHG+RWK KFR V+G+VHTNYL YV+REKN ++ A  L + N+W+ R+
Sbjct: 223 AVLEEPEHLNWYHHGRRWKKKFRRVIGVVHTNYLAYVRREKNGQVIACFLRYANTWVTRI 282

Query: 181 HCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVW 240
           +CHK+IRLS ATQ+ P S++CNVHGVNPKFLE+G+ K++Q Q G  AFTKGAYYIG+MVW
Sbjct: 283 YCHKIIRLSGATQDLPKSVICNVHGVNPKFLEVGKLKLKQLQTGEAAFTKGAYYIGKMVW 342

Query: 241 SKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFH 300
           SKGY ELL LL+ Y   L GLE+DLYG+GED D+++ +A++L L V+V+PGRDHADP+FH
Sbjct: 343 SKGYRELLDLLSKYQTRLGGLEVDLYGSGEDSDEVRESAERLNLAVKVHPGRDHADPLFH 402

Query: 301 DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLK 360
           DYKVF+NPSTTDVVCT +AEALAMGKIV+CANHPSN+FFKQFPNCR Y+    FV+ TL 
Sbjct: 403 DYKVFINPSTTDVVCTTSAEALAMGKIVICANHPSNEFFKQFPNCRIYNNDEEFVQVTLN 462

Query: 361 ALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRI-- 418
           ALAE+PA  TD Q + LSWE+AT+RF+Q AE+   V   R    SR      L ++++  
Sbjct: 463 ALAEQPAPLTDMQMYDLSWEAATDRFMQAAEINLSVAEPRIHQASRAYFPTFLRTRKLTQ 522

Query: 419 -IEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLVT 461
            +EDA  YLH   SG E +R A GA+P +LQPDEQL K+LGL +
Sbjct: 523 SLEDASVYLHQALSGLEVTRCAFGAVPKTLQPDEQLRKDLGLAS 566


>gi|115452143|ref|NP_001049672.1| Os03g0268300 [Oryza sativa Japonica Group]
 gi|29893654|gb|AAP06908.1| name [Oryza sativa Japonica Group]
 gi|29893655|gb|AAP06909.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|108707380|gb|ABF95175.1| digalactosyldiacylglycerol synthase 2, putative, expressed [Oryza
           sativa Japonica Group]
 gi|108707381|gb|ABF95176.1| digalactosyldiacylglycerol synthase 2, putative, expressed [Oryza
           sativa Japonica Group]
 gi|108707382|gb|ABF95177.1| digalactosyldiacylglycerol synthase 2, putative, expressed [Oryza
           sativa Japonica Group]
 gi|113548143|dbj|BAF11586.1| Os03g0268300 [Oryza sativa Japonica Group]
 gi|125543255|gb|EAY89394.1| hypothetical protein OsI_10900 [Oryza sativa Indica Group]
 gi|215734947|dbj|BAG95669.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score =  618 bits (1593), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 291/456 (63%), Positives = 357/456 (78%), Gaps = 1/456 (0%)

Query: 4   KQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITF 63
           ++QHIAIFTTASLPW+TGTAVNPLFRAAYLAK G+  VTLV+PWLS   Q  VYP  + F
Sbjct: 3   RKQHIAIFTTASLPWMTGTAVNPLFRAAYLAKAGDWEVTLVVPWLSKGDQLLVYPNKMKF 62

Query: 64  ASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVL 123
           + P EQE YVR WLE+R G    F+ +FYPGKF+ +K+SIL  GDIT+ + D++ADIAVL
Sbjct: 63  SVPGEQEGYVRRWLEERIGLLPKFEIKFYPGKFSTEKRSILPAGDITQTVSDDKADIAVL 122

Query: 124 EEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCH 183
           EEPEHLTW+HHG+RWK KFR V+G+VHTNYLEYVKRE+N  + AFLL+ +NSW+  ++CH
Sbjct: 123 EEPEHLTWYHHGRRWKNKFRKVIGVVHTNYLEYVKRERNGYIHAFLLKHINSWVTDIYCH 182

Query: 184 KVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKG 243
           KVIRLSAATQE P SIVCNVHGVNPKF+EIG+ K +Q     +AF KGAYYIG+MVWSKG
Sbjct: 183 KVIRLSAATQEVPRSIVCNVHGVNPKFIEIGKLKHQQISQREQAFFKGAYYIGKMVWSKG 242

Query: 244 YEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYK 303
           Y ELL LL  + KEL+GL+M+LYG+GED D+++ +A+KL L VRVYPGRDH D IFHDYK
Sbjct: 243 YTELLQLLQKHQKELSGLKMELYGSGEDSDEVKASAEKLNLDVRVYPGRDHGDSIFHDYK 302

Query: 304 VFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALA 363
           VF+NPSTTDVVCT TAEALAMGKIV+CANHPSN+FFK+FPNC  Y+    FV  T+KALA
Sbjct: 303 VFINPSTTDVVCTTTAEALAMGKIVICANHPSNEFFKRFPNCHMYNTEKEFVRLTMKALA 362

Query: 364 EEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKR-SKSPSRHLESESLNSKRIIEDA 422
           EEP   ++   H+LSWE+ATERF++VA++   +  K+ S SP   +       K+ +E+A
Sbjct: 363 EEPIPLSEELRHELSWEAATERFVRVADIAPIMSIKQHSPSPQYFMYISPDELKKNMEEA 422

Query: 423 FGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELG 458
             + H   SGFET+R   GAIP +LQPDEQ CKELG
Sbjct: 423 SAFFHNAISGFETARCVFGAIPNTLQPDEQQCKELG 458


>gi|326509111|dbj|BAJ86948.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 466

 Score =  616 bits (1589), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 292/456 (64%), Positives = 352/456 (77%), Gaps = 1/456 (0%)

Query: 4   KQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITF 63
           ++QHI IFTTASLPW+TGTAVNPLFRAAYLAK G+  VTLV+PWLS   Q  VYP  + F
Sbjct: 5   RRQHITIFTTASLPWMTGTAVNPLFRAAYLAKAGDWEVTLVVPWLSKGDQTLVYPNKMKF 64

Query: 64  ASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVL 123
           + P EQE YVR WLE+RTG    F+  FYPGKF+ +K+SIL VGDITE I DE ADIAVL
Sbjct: 65  SGPAEQEGYVRRWLEERTGPLPRFNINFYPGKFSTEKRSILPVGDITETISDENADIAVL 124

Query: 124 EEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCH 183
           EEPEHLTW+HHG+RWK KFR V+G+VHTNYLEYVKRE+N  + AFLL+ +NSW+  ++CH
Sbjct: 125 EEPEHLTWYHHGRRWKNKFRKVIGVVHTNYLEYVKRERNGYIHAFLLKHINSWVTDIYCH 184

Query: 184 KVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKG 243
           KVIRLS ATQE P S++CNVHGVNPKF+EIG+ K +Q     ++F KGAYYIG+MVWSKG
Sbjct: 185 KVIRLSGATQEVPRSVICNVHGVNPKFIEIGKLKHQQISQREQSFFKGAYYIGKMVWSKG 244

Query: 244 YEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYK 303
           Y ELL LL  + KEL+GL+M+LYG+GED D+++ +A+KL L VRVYPGRDHAD IFHDYK
Sbjct: 245 YTELLHLLQKHQKELSGLKMELYGSGEDSDEVKASAEKLNLDVRVYPGRDHADSIFHDYK 304

Query: 304 VFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALA 363
           VF+NPSTTDVVCT TAEALAMGKIV+CANHPSN+FFK+FPNC  Y     FV  T+KAL 
Sbjct: 305 VFINPSTTDVVCTTTAEALAMGKIVICANHPSNEFFKRFPNCHMYSTEKEFVRLTMKALT 364

Query: 364 EEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPS-RHLESESLNSKRIIEDA 422
           EEP   T+   H+LSWE+ATERF++VA++      K+  S S R +       K+ +E+A
Sbjct: 365 EEPIPLTEELRHELSWEAATERFVRVADIAPATPAKQHPSSSQRFMYINPDELKKNMEEA 424

Query: 423 FGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELG 458
             + H   SGFE +R   GAIP SLQPDEQ CKELG
Sbjct: 425 SAFFHNTISGFEAARCVFGAIPNSLQPDEQQCKELG 460


>gi|326524592|dbj|BAK00679.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  615 bits (1586), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 282/460 (61%), Positives = 353/460 (76%), Gaps = 3/460 (0%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H A+FTTASLPW+TGT++NPLFRAAYLAKDG R VTL+IPWLSL  Q+ VYP  + F 
Sbjct: 4   RRHFAVFTTASLPWMTGTSINPLFRAAYLAKDGSRDVTLLIPWLSLRDQELVYPNKVVFG 63

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP EQE YVR W+E+R  F  +F  +FYP KF+ + +SIL VGDITE IPDE ADIAVLE
Sbjct: 64  SPLEQEGYVRRWIEERVDFRPSFSIKFYPAKFSKEMRSILPVGDITECIPDEAADIAVLE 123

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHG+RWK KFR V+G++HTNYL YV+REKN ++ A  L + N+W+ R++CHK
Sbjct: 124 EPEHLNWYHHGRRWKNKFRRVIGVIHTNYLAYVRREKNGQVIACCLRYANTWVTRIYCHK 183

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           +IRLS ATQ+ P S+VCNVHGVNPKFLE+G  K++Q QNG  AFTKGAYYIG+MVWSKGY
Sbjct: 184 IIRLSGATQDLPKSVVCNVHGVNPKFLEVGRLKLKQLQNGEAAFTKGAYYIGKMVWSKGY 243

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            ELL LL+ Y   L  LE+DL G+GED D+I+ +A++  L V V+PGRDHADP+FHDYKV
Sbjct: 244 RELLDLLSKYQSRLGDLEVDLDGSGEDSDEIRESAERSSLAVNVHPGRDHADPLFHDYKV 303

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPSTTDVVCT +AEAL MGKIV+CANHPSN+FFKQFPNCR Y+    FV+ TL ALAE
Sbjct: 304 FINPSTTDVVCTTSAEALEMGKIVICANHPSNEFFKQFPNCRIYNNDEEFVQHTLNALAE 363

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRI---IED 421
           +P   TD Q + LSWE+AT+RF++ AE+       R +  S+      L ++++   +ED
Sbjct: 364 QPTPLTDMQMYDLSWEAATDRFMEAAEINLPTAEPRIQQASKTYFPTFLRTRKLKQSLED 423

Query: 422 AFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLVT 461
           A  YLH   SG E +R A GA+P +LQPDEQL K+LGL +
Sbjct: 424 ASVYLHQALSGLEVTRCAFGAVPKTLQPDEQLRKDLGLAS 463


>gi|357112938|ref|XP_003558262.1| PREDICTED: digalactosyldiacylglycerol synthase 2,
           chloroplastic-like [Brachypodium distachyon]
          Length = 464

 Score =  614 bits (1583), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 286/456 (62%), Positives = 356/456 (78%), Gaps = 1/456 (0%)

Query: 4   KQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITF 63
           ++QH+ I TTASLPW+TGTAVNPLFRAAYLAK G+  VTLV+PWLS   Q  VYP  + F
Sbjct: 3   RRQHVTIVTTASLPWMTGTAVNPLFRAAYLAKTGDWEVTLVVPWLSKADQMLVYPNKMKF 62

Query: 64  ASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVL 123
            +P EQEAYVR WLE+R G    F+  FYPGKF+ +K+SIL VGDITE + D +ADIAVL
Sbjct: 63  GAPGEQEAYVRRWLEERIGALPRFNINFYPGKFSTEKRSILPVGDITETVSDAKADIAVL 122

Query: 124 EEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCH 183
           EEPEHLTW+HHG+RWK KFR V+G+VHTNYLEYVKRE+N  + AFLL+ +NSW+  ++CH
Sbjct: 123 EEPEHLTWYHHGRRWKTKFRKVIGVVHTNYLEYVKRERNGYIHAFLLKHINSWVTDIYCH 182

Query: 184 KVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKG 243
           KVIRLS ATQ+ P S++CNVHGVNPKF+EIG+ K  Q     ++F KGAYYIG+MVWSKG
Sbjct: 183 KVIRLSGATQDVPRSVICNVHGVNPKFIEIGKLKQRQISQREQSFFKGAYYIGKMVWSKG 242

Query: 244 YEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYK 303
           Y ELL L + + KEL+GL+M+LYG+GED D+++ +A+KL L VRVYPGRDHAD IFHDYK
Sbjct: 243 YTELLQLFHKHQKELSGLKMELYGSGEDADEVKASAEKLNLDVRVYPGRDHADSIFHDYK 302

Query: 304 VFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALA 363
           VF+NPSTTDVVCT TAEALAMGKIV+CANHPSN+FFK+FPNC  Y+    FV+ T+KALA
Sbjct: 303 VFINPSTTDVVCTTTAEALAMGKIVICANHPSNEFFKRFPNCHMYNTEKEFVKLTMKALA 362

Query: 364 EEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNS-KRIIEDA 422
           EEP   ++   H+LSWE+ATERF++VA++   +  K++ S S+H    + +  K+ +EDA
Sbjct: 363 EEPITLSEELRHELSWEAATERFVRVADIAPAMPIKQNPSSSQHFMYINPDELKKNMEDA 422

Query: 423 FGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELG 458
             + H   SG E +R   GAIP SLQPDEQ CKELG
Sbjct: 423 SAFFHNTISGIEAARWVFGAIPNSLQPDEQQCKELG 458


>gi|223947965|gb|ACN28066.1| unknown [Zea mays]
 gi|223948123|gb|ACN28145.1| unknown [Zea mays]
 gi|238013614|gb|ACR37842.1| unknown [Zea mays]
 gi|414866033|tpg|DAA44590.1| TPA: hypothetical protein ZEAMMB73_630561 [Zea mays]
 gi|414866034|tpg|DAA44591.1| TPA: hypothetical protein ZEAMMB73_630561 [Zea mays]
          Length = 464

 Score =  613 bits (1582), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 287/459 (62%), Positives = 358/459 (77%), Gaps = 1/459 (0%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           M RK QH+AIFTTA LPW+TGTAVNPLFRAAYLAK G+  VTLV+PWLS   Q+ VYP  
Sbjct: 1   MARKHQHVAIFTTACLPWMTGTAVNPLFRAAYLAKAGDWEVTLVVPWLSKGDQELVYPNK 60

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           + F+ P EQE YVR WLE+RTG    FD +FYPGKF+ +K+SIL VGDI++ I D++ADI
Sbjct: 61  MRFSLPAEQENYVRRWLEERTGLLPKFDIKFYPGKFSTEKRSILPVGDISQTISDDKADI 120

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           AVLEEPEHLTW+HHG+RWK+KF+ V+G+VHTNYLEYVKREKN  + AF+L+ +NSW+  +
Sbjct: 121 AVLEEPEHLTWYHHGRRWKSKFQKVIGVVHTNYLEYVKREKNGYISAFILKHINSWVTDI 180

Query: 181 HCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVW 240
           +CHKVIRLSAATQ+ P S++CNVHGVNPKF+EIG+ K +Q     +AF KGAYYIG+MVW
Sbjct: 181 YCHKVIRLSAATQDVPRSVICNVHGVNPKFIEIGKLKHQQLCQREQAFFKGAYYIGKMVW 240

Query: 241 SKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFH 300
           SKGY ELL LL+ +  EL+GL+M+LYG+GED D+++ +A++L L +RVYPGRDH D IFH
Sbjct: 241 SKGYTELLQLLHKHQMELSGLKMELYGSGEDADEVKASAERLSLDIRVYPGRDHGDSIFH 300

Query: 301 DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLK 360
           DYKVF+NPSTTDVVCT TAEALAMGKIV+CANHPSN FFK+FPNC  Y+    FV  T+K
Sbjct: 301 DYKVFINPSTTDVVCTTTAEALAMGKIVICANHPSNVFFKRFPNCHMYNTDEEFVRLTMK 360

Query: 361 ALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNS-KRII 419
           ALAEEP   +D   H+LSWE+ATERF++VA++   V   +    S+H    S +  ++ +
Sbjct: 361 ALAEEPIPLSDDLRHELSWEAATERFIRVADIAPTVPVGQPPPSSQHFMRISPDELQKNM 420

Query: 420 EDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELG 458
           E+A  + H   SGFE  R   GAIP +LQPDEQ CKELG
Sbjct: 421 EEASAFFHNTISGFEAVRCVFGAIPNTLQPDEQQCKELG 459


>gi|125542897|gb|EAY89036.1| hypothetical protein OsI_10520 [Oryza sativa Indica Group]
          Length = 459

 Score =  613 bits (1581), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 286/449 (63%), Positives = 353/449 (78%), Gaps = 12/449 (2%)

Query: 19  LTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPKEQEAYVRWWLE 78
           +TGTA+NPLFRAAYL KDG+R VTLVIPWLSL  Q+ VYP  I F SP E EAYVR W+E
Sbjct: 1   MTGTAINPLFRAAYLTKDGDREVTLVIPWLSLRDQELVYPNKIVFGSPLELEAYVRRWIE 60

Query: 79  DRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLEEPEHLTWFHHGKRW 138
           +R  F  +F  +FYP KF+ + +SIL VGDITE IPDE AD+AVLEEPEHL W+HHG++W
Sbjct: 61  ERISFRPSFSIKFYPAKFSKELRSILPVGDITECIPDEVADVAVLEEPEHLNWYHHGRKW 120

Query: 139 KAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHKVIRLSAATQEYPNS 198
           K KFR V+G+VHTNY+ YVKREKN +  A  ++++N+W+ R++CHK+IRLS ATQ+ P S
Sbjct: 121 KNKFRRVIGVVHTNYIAYVKREKNGQAIACFMKYMNTWVTRIYCHKIIRLSGATQDLPRS 180

Query: 199 IVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKEL 258
           +VCNVHGVNPKFLE+G+ K++Q +NG  AFTKGAYYIG+MVWSKGY ELL LL+ Y  +L
Sbjct: 181 VVCNVHGVNPKFLEVGKLKLKQLRNGEIAFTKGAYYIGKMVWSKGYRELLDLLSKYQSKL 240

Query: 259 AGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTAT 318
            GLE+DLYG+GED D+++++A+ L L V+V+PGRDHADP+FHDYKVF+NPSTTDVVCT T
Sbjct: 241 VGLEVDLYGSGEDSDEVRKSAELLSLAVKVHPGRDHADPLFHDYKVFINPSTTDVVCTTT 300

Query: 319 AEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLS 378
           AEALAMGKIV+CANHPSN+FFKQFPNCRTYD    FV+ TL ALAE+PA  TD Q+H LS
Sbjct: 301 AEALAMGKIVICANHPSNEFFKQFPNCRTYDSGEEFVQLTLDALAEQPAPLTDMQSHDLS 360

Query: 379 WESATERFLQVAELVGDVVTKRSKSPSRHLESE-SLNS-------KRIIEDAFGYLHYVA 430
           WE+ATERF++ AEL   +    S  P  H  S+ SL +       K+ +EDA  YLH   
Sbjct: 361 WEAATERFMEAAELNLPI----SAEPRIHQTSKASLPAFMRTRKLKQSLEDASVYLHQAL 416

Query: 431 SGFETSRRALGAIPGSLQPDEQLCKELGL 459
           SG E +RRA GA+P +LQPDEQLC +LGL
Sbjct: 417 SGLEVTRRAFGAVPKTLQPDEQLCNDLGL 445


>gi|242036203|ref|XP_002465496.1| hypothetical protein SORBIDRAFT_01g039910 [Sorghum bicolor]
 gi|241919350|gb|EER92494.1| hypothetical protein SORBIDRAFT_01g039910 [Sorghum bicolor]
          Length = 464

 Score =  612 bits (1577), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 288/459 (62%), Positives = 356/459 (77%), Gaps = 1/459 (0%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           M RKQQHIAIFTTA LPW+TGTAVNPLFRAAYLAK G   VTLV+PWLS   Q+ VYP  
Sbjct: 1   MARKQQHIAIFTTACLPWMTGTAVNPLFRAAYLAKAGNWEVTLVVPWLSKGDQELVYPNK 60

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           + F+SP EQE YVR WLE+RTG    FD +FYPGKF+ +K+SIL VGDI++ I D++ADI
Sbjct: 61  MKFSSPAEQENYVRQWLEERTGPLPKFDIKFYPGKFSTEKRSILPVGDISQTISDDKADI 120

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           AVLEEPEHLTW+HHG+RWK+KF  V+G+VHTNYLEYVKREKN  + AF+L+ +NSW+  +
Sbjct: 121 AVLEEPEHLTWYHHGRRWKSKFPKVIGVVHTNYLEYVKREKNGYISAFILKHINSWVTDI 180

Query: 181 HCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVW 240
           +CHKVIRLSAATQ+ P S++CNVHGVNPKF+EIG+ K ++     +AF KGAYYIG+MVW
Sbjct: 181 YCHKVIRLSAATQDVPRSVICNVHGVNPKFIEIGKLKHQELCQREQAFFKGAYYIGKMVW 240

Query: 241 SKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFH 300
           SKGY ELL LL+ +  EL+ L+M+LYG+GED D+++ +A++L L +RVYPGRDH D IFH
Sbjct: 241 SKGYTELLHLLHKHQMELSVLKMELYGSGEDADEVKASAERLSLDIRVYPGRDHGDSIFH 300

Query: 301 DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLK 360
           DYKVF+NPSTTDVVCT TAEALAMGKIV+CANHPSNDFFK+FPNC  Y     FV  T+K
Sbjct: 301 DYKVFINPSTTDVVCTTTAEALAMGKIVICANHPSNDFFKRFPNCHMYSTEKEFVTLTMK 360

Query: 361 ALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESL-NSKRII 419
           AL EEP   +D   H+LSWE+ATERF++VA++   +   +    S H    S  N ++ +
Sbjct: 361 ALGEEPIPLSDDLRHELSWEAATERFVRVADIAPTMPVGQQPPASPHFMRISPDNLQKNM 420

Query: 420 EDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELG 458
           E+A  + H   SG ET+R   GAIP +LQPDEQ CKELG
Sbjct: 421 EEASAFFHNTISGIETARCVFGAIPNTLQPDEQQCKELG 459


>gi|226504524|ref|NP_001144526.1| uncharacterized protein LOC100277520 [Zea mays]
 gi|195643464|gb|ACG41200.1| hypothetical protein [Zea mays]
          Length = 428

 Score =  571 bits (1471), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 263/408 (64%), Positives = 327/408 (80%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           M RK QH+AIFTTA LPW+TGTAVNPLFRAAYLAK G+  VTLV+PWLS   Q+ VYP  
Sbjct: 1   MARKHQHVAIFTTACLPWMTGTAVNPLFRAAYLAKAGDWEVTLVVPWLSKGDQELVYPNK 60

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           + F+ P EQE YVR WLE+RTG    FD +FYPGKF+ +K+SIL VGDI++ I D++ADI
Sbjct: 61  MRFSLPAEQENYVRRWLEERTGLLPKFDIKFYPGKFSTEKRSILPVGDISQTISDDKADI 120

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           AVLEEPEHLTW+HHG+RWK+KF+ V+G+VHTNYLEYVKREKN  + AF+L+ +NSW+  +
Sbjct: 121 AVLEEPEHLTWYHHGRRWKSKFQKVIGVVHTNYLEYVKREKNGYISAFILKHINSWVTDI 180

Query: 181 HCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVW 240
           +CHKVIRLSAATQ+ P S++CNVHGVNPKF+EIG+ K +Q     +AF KGAYYIG+MVW
Sbjct: 181 YCHKVIRLSAATQDVPRSVICNVHGVNPKFIEIGKLKHQQLCQREQAFFKGAYYIGKMVW 240

Query: 241 SKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFH 300
           SKGY ELL LL+ +  EL+GL+M+LYG+GED D+++ +A++L L +RVYPGRDH D IFH
Sbjct: 241 SKGYTELLQLLHKHQMELSGLKMELYGSGEDADEVKASAERLSLDIRVYPGRDHGDSIFH 300

Query: 301 DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLK 360
           DYKVF+NPSTTDVVCT TAEALAMGKIV+CANHPSN FFK+FPNC  Y+    FV  T+K
Sbjct: 301 DYKVFINPSTTDVVCTTTAEALAMGKIVICANHPSNVFFKRFPNCHMYNTDEEFVRLTMK 360

Query: 361 ALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHL 408
           ALAEEP   +D   H+LSWE+ATERF++VA++   V   +    S+H 
Sbjct: 361 ALAEEPTPLSDDLRHELSWEAATERFIRVADIAPTVPVGQPPPSSQHF 408


>gi|302816216|ref|XP_002989787.1| digalactosyldiacylglycerol synthase [Selaginella moellendorffii]
 gi|300142353|gb|EFJ09054.1| digalactosyldiacylglycerol synthase [Selaginella moellendorffii]
          Length = 808

 Score =  552 bits (1423), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 261/471 (55%), Positives = 344/471 (73%), Gaps = 6/471 (1%)

Query: 2   DRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNI 61
           D  ++ IAI TTASLPW+TGTAVNPLFRAAYLAKDG R VTL+IPWL    Q  VYP ++
Sbjct: 337 DDGRRKIAIVTTASLPWMTGTAVNPLFRAAYLAKDGMRNVTLLIPWLCKKDQALVYPSHM 396

Query: 62  TFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIA 121
           +F +P+EQE+YVR WLE+R GF + F   FYPG+F+  K+SILA GDI++ IPD+EAD+A
Sbjct: 397 SFNTPEEQESYVRTWLEERIGFKADFKIAFYPGRFSTHKRSILACGDISQFIPDKEADVA 456

Query: 122 VLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVH 181
           VLEEPEHLTW++HG+RW  KF+ V+G+VHTNYLEYV+REKN  LQAFLL+ +N+W+  ++
Sbjct: 457 VLEEPEHLTWYYHGRRWNKKFKDVIGVVHTNYLEYVQREKNGALQAFLLKHINNWVVGIY 516

Query: 182 CHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWS 241
           C+KV+RLSAATQE P S VCNVHGV+PKFLEIG+K   +  +G  AFTKGAY++G+M+W 
Sbjct: 517 CNKVLRLSAATQELPKSQVCNVHGVSPKFLEIGKKVTVETDSGKSAFTKGAYFLGKMIWG 576

Query: 242 KGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHD 301
           KGY EL+ LL  Y ++LA L++D+YG+GED   ++  A+ L L +  + GRDHAD   H 
Sbjct: 577 KGYRELVNLLAQYKQDLADLKLDVYGSGEDAADVKSTAQNLGLGINFHQGRDHADESLHS 636

Query: 302 YKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKA 361
           YKVF+NPS +DVVCT TAEALAMGKIVVCA+HPSN+FF+ FPNC TY     FVE    A
Sbjct: 637 YKVFINPSVSDVVCTTTAEALAMGKIVVCADHPSNEFFRGFPNCLTYSSPTEFVEKVKVA 696

Query: 362 LAEEPAQPTDAQTHQLSWESATERFLQVAEL-VGDVVTKRSKSPSRHLESESL-----NS 415
           +A EP   +  + H+LSWE+ATERFL++ +L  G    +++K+ +      +L     N 
Sbjct: 697 MASEPVPLSAEECHRLSWEAATERFLEMTKLDRGSASEQQAKAGAAVKRPMTLSMSLPNL 756

Query: 416 KRIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLVTELGKG 466
           + +++      H+VASGFE +R   GAIPG++    +  K+LGL   L  G
Sbjct: 757 RDVVDGTLALAHFVASGFEPARLVSGAIPGTMHCSVEQSKDLGLPPPLFHG 807


>gi|351727022|ref|NP_001238170.1| digalactosyldiacylglycerol synthase 1, chloroplastic [Glycine max]
 gi|75120874|sp|Q6DW76.1|DGDG1_SOYBN RecName: Full=Digalactosyldiacylglycerol synthase 1, chloroplastic;
           Flags: Precursor
 gi|49617329|gb|AAT67420.1| digalactosyldiacylglycerol synthase 1 [Glycine max]
          Length = 783

 Score =  550 bits (1417), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 253/464 (54%), Positives = 339/464 (73%), Gaps = 9/464 (1%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYL++  +++VTL++PWL    Q+ VYP N+TF 
Sbjct: 310 ERHVAIVTTASLPWMTGTAVNPLFRAAYLSQSAKQKVTLLVPWLCKSDQELVYPSNLTFT 369

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQEAY+R WLE+R GF + F   FYPGKF+  ++SI+  GD ++ IP  +ADIA+LE
Sbjct: 370 SPEEQEAYIRSWLEERIGFKADFKISFYPGKFSEARRSIIPAGDTSQFIPSRDADIAILE 429

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVGIVHTNYLEY+KREKN  LQAFL++ +N+W+ R +CHK
Sbjct: 430 EPEHLNWYHHGKRWTDKFNHVVGIVHTNYLEYIKREKNGALQAFLVKHINNWVTRAYCHK 489

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P S++CNVHGVNPKFL+IGEK   +++ G KAFTKGAY++G+MVW+KGY
Sbjct: 490 VLRLSAATQDLPKSVICNVHGVNPKFLKIGEKIAAERELGQKAFTKGAYFLGKMVWAKGY 549

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
           +EL+ LL  +  +L G ++D++GNGED +++Q AA++L L +    GRDHAD   H YKV
Sbjct: 550 KELIDLLAKHKADLDGFKLDVFGNGEDANEVQSAARRLDLNLNFQKGRDHADDSLHRYKV 609

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSN+FF+ FPNC TY     FV    +AL  
Sbjct: 610 FINPSISDVLCTATAEALAMGKFVVCADHPSNEFFRSFPNCLTYRTSEDFVTKVKEALEN 669

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESL--------NSK 416
           EP   T  Q +QLSWE+AT+RF++ +EL G ++ K +      ++   L        N  
Sbjct: 670 EPYPLTPEQRYQLSWEAATQRFMEYSELDG-ILNKENNGEKSRVDKGKLIAKSASMPNLT 728

Query: 417 RIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
            +++    + HY  +G E  R   GAIPG+   D+Q CK+L L+
Sbjct: 729 ELVDGGLAFAHYCLTGNEFLRLCTGAIPGTRDYDKQHCKDLHLL 772


>gi|255586525|ref|XP_002533901.1| galactolipid galactosyltransferase, putative [Ricinus communis]
 gi|223526143|gb|EEF28483.1| galactolipid galactosyltransferase, putative [Ricinus communis]
          Length = 797

 Score =  550 bits (1417), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 259/479 (54%), Positives = 343/479 (71%), Gaps = 13/479 (2%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK  +++VTL++PWL    Q+ VYP N+TF+
Sbjct: 324 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKSEKQKVTLLVPWLCKSDQELVYPSNLTFS 383

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE+Y+R WLEDR GF + F   FYPGKF+ +++SI+  GD ++ IP ++ADIA+LE
Sbjct: 384 SPQEQESYIRNWLEDRIGFKADFKISFYPGKFSKERRSIIPAGDTSQFIPSKDADIAILE 443

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVG+VHTNYLEY+KREKN  LQ+FL++ +N+W+ R +CHK
Sbjct: 444 EPEHLNWYHHGKRWTDKFNHVVGVVHTNYLEYIKREKNGALQSFLVKHINNWVTRAYCHK 503

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLS ATQ+ P S++CNVHGVNPKFL+IGEK    ++ G +AF+KGAY++G+MVW+KGY
Sbjct: 504 VLRLSGATQDLPKSVICNVHGVNPKFLKIGEKVTADRELGQQAFSKGAYFLGKMVWAKGY 563

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
           +EL+ LL  +  EL G ++D++GNGED  ++Q AAK+L L V    GRDHAD   H YKV
Sbjct: 564 KELIDLLAKHKNELDGFKLDVFGNGEDAHEVQIAAKRLDLNVNFLKGRDHADDSLHGYKV 623

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSN+FF+ FPNC TY     FV    +AL  
Sbjct: 624 FINPSVSDVLCTATAEALAMGKFVVCADHPSNEFFRSFPNCSTYRTSEDFVAKVREALEN 683

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAEL-------VGDVVTKRSKSPSRHLESESL-NSK 416
           EP   T  Q + LSWE+AT+RF+Q ++L        GD    R+   S  ++S SL N  
Sbjct: 684 EPQPLTPEQRYNLSWEAATQRFMQYSDLDKVLNDDQGDAKLSRASGKS-IVKSVSLPNMS 742

Query: 417 RIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLVTELGKGHCHRPNGSW 475
            +++    + HY  +G E  R   GAIPG+   D+Q CK+L L+      H   P   W
Sbjct: 743 GMVDGGLAFAHYCLTGNEFLRLCTGAIPGTRDYDKQHCKDLHLLPP----HVENPIYGW 797


>gi|76800640|gb|ABA55727.1| digalactosyldiacylglycerol synthase 1 [Vigna unguiculata]
          Length = 780

 Score =  548 bits (1412), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 252/464 (54%), Positives = 338/464 (72%), Gaps = 9/464 (1%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYL++  +++VTL++PWL    Q+ VYPG++TF 
Sbjct: 307 ERHVAIVTTASLPWMTGTAVNPLFRAAYLSQSAKQKVTLLVPWLCKSDQELVYPGSLTFT 366

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE Y+R WLE+R GF + F   FYPGKF+ +++SI+  GD ++ IP  +ADIA+LE
Sbjct: 367 SPEEQEVYIRSWLEERIGFKADFKISFYPGKFSKERRSIIPAGDTSQFIPSRDADIAILE 426

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVGIVHTNYLEY+KREKN  LQAF ++ +N+W+ R +CHK
Sbjct: 427 EPEHLNWYHHGKRWTDKFNHVVGIVHTNYLEYIKREKNGALQAFFVKHINNWVTRAYCHK 486

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P S++CNVHGVNPKFLEIGEK   +++ G K+FTKGAY++G+MVW+KGY
Sbjct: 487 VLRLSAATQDLPKSVICNVHGVNPKFLEIGEKIATERELGQKSFTKGAYFLGKMVWAKGY 546

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
           +EL+ LL  +  +L G ++D++GNGED +++Q AA+KL L +    GRDHAD   H YKV
Sbjct: 547 KELIDLLAKHKADLDGFKLDVFGNGEDANEVQSAARKLDLNLSFQKGRDHADDSLHGYKV 606

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSN+FF+ FPNC TY     FV    +AL  
Sbjct: 607 FINPSISDVLCTATAEALAMGKFVVCADHPSNEFFRSFPNCLTYRTSEDFVAKVKEALEN 666

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESL--------NSK 416
           EP   T  Q +QLSWE+AT+RF++ +EL   ++ K +      L+   L        N  
Sbjct: 667 EPYPLTPEQRYQLSWEAATQRFMEYSEL-DSILNKENNGEKSSLDKGKLVPKSVSMPNLT 725

Query: 417 RIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
            +++    + HY  +G E  R   GAIPG+   D+Q CK+L L+
Sbjct: 726 ELVDGGLAFAHYCLTGNEFLRLCTGAIPGTRDYDKQHCKDLHLL 769


>gi|414587304|tpg|DAA37875.1| TPA: hypothetical protein ZEAMMB73_508020 [Zea mays]
          Length = 783

 Score =  548 bits (1411), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 252/466 (54%), Positives = 337/466 (72%), Gaps = 15/466 (3%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTA+NPLFRAAYLA+  +++VTLV+PWLS   Q+ VYP NITF+
Sbjct: 312 RRHVAIVTTASLPWMTGTAINPLFRAAYLARSAKQKVTLVVPWLSKSDQELVYPNNITFS 371

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE Y+R WL++R GF + F   FYPGKF+ +++SI+  GD ++ I  +EADIA+LE
Sbjct: 372 SPEEQETYIRNWLQERIGFEANFKISFYPGKFSKERRSIIPAGDTSQFISSKEADIAILE 431

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  V+G+VHTNYLEY+KREKN  +Q+FL++ +N+W+ R +CHK
Sbjct: 432 EPEHLNWYHHGKRWTDKFNHVIGVVHTNYLEYIKREKNGAIQSFLVKHINNWVTRAYCHK 491

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P S+VCNVHGVNPKFL +GEK    ++ G K F+KGAY++G+MVW+KGY
Sbjct: 492 VLRLSAATQDLPRSVVCNVHGVNPKFLNVGEKIAADRECGQKVFSKGAYFLGKMVWAKGY 551

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ LL+ +  +L G  +D+YGNGED + +Q AA+K  L +  + G+DHAD   H YKV
Sbjct: 552 RELIDLLSKHKNDLEGFMIDVYGNGEDSEAVQNAARKFDLNINFFKGKDHADDSLHGYKV 611

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSNDFFK FPNC TY     FV    +A+A 
Sbjct: 612 FVNPSVSDVLCTATAEALAMGKFVVCADHPSNDFFKSFPNCLTYRTSAEFVARVKEAMAT 671

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKR------- 417
           EP   T  Q + LSWE+ATERF++ +EL  D V  R+  P R+     +N  R       
Sbjct: 672 EPQPLTSEQRYSLSWEAATERFMEYSEL--DKVLNRNGHPGRN---GKVNKARKIPLLPK 726

Query: 418 ---IIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
              +++    + H+  +G E  R A GAIPG+   D+Q C +L L+
Sbjct: 727 LSDVVDGGLAFAHHCLTGNEILRLATGAIPGTRDYDKQQCMDLNLL 772


>gi|356505431|ref|XP_003521494.1| PREDICTED: digalactosyldiacylglycerol synthase 1,
           chloroplastic-like [Glycine max]
          Length = 783

 Score =  546 bits (1407), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 252/467 (53%), Positives = 339/467 (72%), Gaps = 9/467 (1%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYL++  +++VTL++PWL    Q+ VYP N+TF 
Sbjct: 310 ERHVAIVTTASLPWMTGTAVNPLFRAAYLSQSAKQKVTLLVPWLCKSDQELVYPSNLTFT 369

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQEAY+R WLE+R GF + F   FYPGKF+  ++SI+  GD ++ IP  +ADIA+LE
Sbjct: 370 SPEEQEAYIRSWLEERIGFKADFKISFYPGKFSEARRSIIPAGDTSQFIPSRDADIAILE 429

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVGIVHTNYLEY+KREKN  LQAFL++ +N+W+ R +CHK
Sbjct: 430 EPEHLNWYHHGKRWTDKFNHVVGIVHTNYLEYIKREKNGALQAFLVKHINNWVTRAYCHK 489

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P S++CNVHGVNPKFL+IGEK   +++ G KAFTKGAY++G++VW+KGY
Sbjct: 490 VLRLSAATQDLPKSVICNVHGVNPKFLKIGEKIAAERELGQKAFTKGAYFLGKLVWAKGY 549

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
           +EL+ LL  +  +L G ++D++GNGED +++Q AA++L L +    GRDHAD   H YKV
Sbjct: 550 KELIDLLAKHKADLDGFKLDVFGNGEDANEVQSAARRLDLNLNFQKGRDHADDSLHGYKV 609

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSN+FF+ FPNC TY     FV    +AL  
Sbjct: 610 FINPSISDVLCTATAEALAMGKFVVCADHPSNEFFRSFPNCLTYRTSEDFVAKVKEALEN 669

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESL--------NSK 416
           EP   T  Q +QLSWE+AT+RF++ +EL   ++ K +      ++   L        N  
Sbjct: 670 EPYPLTPEQRYQLSWEAATQRFMEYSEL-DRILNKENNGEKASVDKGKLIAKSASMPNLT 728

Query: 417 RIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLVTEL 463
            +++    + HY  +G E  R   GAIPG+   D+Q CK+L L+  L
Sbjct: 729 ELVDGGLAFAHYCLTGNEFLRLCTGAIPGTRDYDKQHCKDLHLLPPL 775


>gi|38344037|emb|CAE01529.2| OJ991214_12.18 [Oryza sativa Japonica Group]
 gi|39545715|emb|CAD40929.3| OSJNBa0033G16.5 [Oryza sativa Japonica Group]
 gi|125590355|gb|EAZ30705.1| hypothetical protein OsJ_14764 [Oryza sativa Japonica Group]
          Length = 775

 Score =  545 bits (1405), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 250/467 (53%), Positives = 339/467 (72%), Gaps = 8/467 (1%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           +   ++H+AI TTASLPW+TGTA+NPLFRAAYLA+  +++VTLV+PWL    Q+ VYP N
Sbjct: 299 LSDNRRHVAIVTTASLPWMTGTAINPLFRAAYLARSTKQKVTLVVPWLCKSDQELVYPNN 358

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           ITF+SP+EQE Y+R WL++R GF + F   FYPGKF+ +++SI+  GD ++ I   EADI
Sbjct: 359 ITFSSPEEQENYIRNWLQERLGFEANFKISFYPGKFSKERRSIIPAGDTSQFISSSEADI 418

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           A+LEEPEHL W+HHGKRW  KF+ V+G+VHTNYLEY+KREKN  LQAFL++ +N+W+ R 
Sbjct: 419 AILEEPEHLNWYHHGKRWTDKFKHVIGVVHTNYLEYIKREKNGALQAFLVKHINNWVTRA 478

Query: 181 HCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVW 240
           +CHKV+RLSAATQ+ P S+VCNVHGVNPKFL++GEK    +++G ++FTKGAY++G+MVW
Sbjct: 479 YCHKVLRLSAATQDLPRSVVCNVHGVNPKFLKVGEKIAADKEHGQQSFTKGAYFLGKMVW 538

Query: 241 SKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFH 300
           +KGY EL+ LL+ +  +L G  +D+YGNGED   +Q AA+KL L +  + GRDHAD   H
Sbjct: 539 AKGYRELIDLLSKHKSDLEGFNVDVYGNGEDSQAVQMAARKLNLSLNFFKGRDHADSSLH 598

Query: 301 DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLK 360
            YKVF+NPS +DV+CTATAEALAMGK V+CA+HPSN+FFK FPNC TY     FV    +
Sbjct: 599 GYKVFINPSVSDVLCTATAEALAMGKFVICADHPSNEFFKSFPNCLTYKTSEEFVARVKE 658

Query: 361 ALAEEPAQPTDAQTHQLSWESATERFLQVAEL-------VGDVVTKRSKSPSRHLESESL 413
           A+A EP+  T  Q + LSWE+ATERF++ +EL       +G       +S  R +     
Sbjct: 659 AMASEPSPLTPEQRYSLSWEAATERFMEYSELDKVLNNKIGYSGQDGKRSKVRKIPLLPR 718

Query: 414 NSKRIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
            S+ +++    + H+  +G E  R A GAIPG+   D+Q C +L L+
Sbjct: 719 LSE-VVDGGLAFAHHCLTGNEILRLATGAIPGTRDYDKQQCMDLNLL 764


>gi|449451183|ref|XP_004143341.1| PREDICTED: digalactosyldiacylglycerol synthase 1,
           chloroplastic-like [Cucumis sativus]
 gi|449523513|ref|XP_004168768.1| PREDICTED: digalactosyldiacylglycerol synthase 1,
           chloroplastic-like [Cucumis sativus]
          Length = 790

 Score =  545 bits (1405), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 252/461 (54%), Positives = 340/461 (73%), Gaps = 5/461 (1%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLA+  ++ VTL++PWLS+  Q+ VYP ++TF+
Sbjct: 319 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAQSAKQSVTLLVPWLSMSDQELVYPNHLTFS 378

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE Y+R WLE+R GF   F   FYPGKF+ +++SI+  GD ++ IP ++ADIA+LE
Sbjct: 379 SPEEQETYIRKWLEERIGFKPDFKISFYPGKFSKERRSIIPAGDTSQFIPSKDADIAILE 438

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHG+RW  KF  VVG+VHTNYLEY+KREKN  LQAFL++ +N+W+ R +CHK
Sbjct: 439 EPEHLNWYHHGRRWTDKFNHVVGVVHTNYLEYIKREKNGALQAFLVKHINNWVIRAYCHK 498

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P S++CNVHGVNPKFL+IGEK  E ++ GN AF+KGAY++G+MVW+KGY
Sbjct: 499 VLRLSAATQDLPKSVICNVHGVNPKFLKIGEKVDEDRKLGNIAFSKGAYFLGKMVWAKGY 558

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ LL  +  +L G  +D++GNGED  ++Q AAKKL+L V    GRDHAD   H YKV
Sbjct: 559 RELIDLLAEHKHDLDGFNLDVFGNGEDAHEVQSAAKKLELNVNFLRGRDHADDSLHGYKV 618

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSNDFF+ FPNC TY     FV    +AL  
Sbjct: 619 FINPSVSDVLCTATAEALAMGKFVVCADHPSNDFFRSFPNCLTYKSSEDFVAKVKEALEN 678

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAEL--VGDVVTKRSKSPSRHLESESLNS---KRII 419
           EP   T  + + LSWE+AT+RFL+ ++L  V +   +   + +R +  +S+++     ++
Sbjct: 679 EPRPLTPEERYNLSWEAATQRFLEYSDLNKVLNSDKELESNTNRKVIRKSISTPSLTEVV 738

Query: 420 EDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
           +    + HY  +G E  R   GAIPG+   D Q CK+L L+
Sbjct: 739 DGGLAFAHYCLTGNELLRLCTGAIPGTRDYDGQHCKDLHLL 779


>gi|242073032|ref|XP_002446452.1| hypothetical protein SORBIDRAFT_06g016250 [Sorghum bicolor]
 gi|241937635|gb|EES10780.1| hypothetical protein SORBIDRAFT_06g016250 [Sorghum bicolor]
          Length = 788

 Score =  545 bits (1404), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 250/466 (53%), Positives = 335/466 (71%), Gaps = 15/466 (3%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTA+NPLFRAAYLA+  +++VTLV+PWLS   Q+ VYP NITF+
Sbjct: 317 RRHVAIVTTASLPWMTGTAINPLFRAAYLARSAKQKVTLVVPWLSKSDQELVYPNNITFS 376

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE Y+R WL++R GF + F   FYPGKF+ +++SI+  GD ++ I  +EADIA+LE
Sbjct: 377 SPEEQETYIRNWLQERIGFEAIFKISFYPGKFSKERRSIIPAGDTSQFISSKEADIAILE 436

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  V+G+VHTNYLEY+KREKN  +Q+FL++ +N+W+ R +CHK
Sbjct: 437 EPEHLNWYHHGKRWTDKFNHVIGVVHTNYLEYIKREKNGAIQSFLVKHINNWVTRAYCHK 496

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P S+VCNVHGVNPKFL +G K    ++ G K F+KGAY++G+MVW+KGY
Sbjct: 497 VLRLSAATQDLPRSVVCNVHGVNPKFLNVGAKIAADRECGQKVFSKGAYFLGKMVWAKGY 556

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ LL+ +  +L G  +D+YGNGED + +Q AA+K  L +  + G+DHAD   H YKV
Sbjct: 557 RELIDLLSKHKNDLEGFMIDVYGNGEDSEAVQNAARKFDLSINFFKGKDHADDSLHGYKV 616

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSNDFFK FPNC TY     FV    +A+  
Sbjct: 617 FVNPSVSDVLCTATAEALAMGKFVVCADHPSNDFFKSFPNCLTYRTSEEFVARVKEAMTT 676

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKR------- 417
           EP   T  Q + LSWE+ATERF++ +EL  D V  R+  P R+     +N  R       
Sbjct: 677 EPQPLTPEQRYNLSWEAATERFMEYSEL--DKVLNRNGHPGRN---GKINKARKIPLLPK 731

Query: 418 ---IIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
              +++    + H+  +G E  R A GAIPG+   D+Q C +L L+
Sbjct: 732 LSDVVDGGLAFAHHCLTGNEILRLATGAIPGTRDYDKQQCMDLNLL 777


>gi|116310807|emb|CAH67597.1| OSIGBa0092M08.9 [Oryza sativa Indica Group]
          Length = 775

 Score =  545 bits (1403), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 250/467 (53%), Positives = 339/467 (72%), Gaps = 8/467 (1%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           +   ++H+AI TTASLPW+TGTA+NPLFRAAYLA+  +++VTLV+PWL    Q+ VYP N
Sbjct: 299 LSDNRRHVAIVTTASLPWMTGTAINPLFRAAYLARSTKQKVTLVVPWLCKSDQELVYPNN 358

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           ITF+SP+EQE Y+R WL++R GF + F   FYPGKF+ +++SI+  GD ++ I   EADI
Sbjct: 359 ITFSSPEEQENYIRNWLQERLGFEANFKISFYPGKFSKERRSIIPAGDTSQFISSGEADI 418

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           A+LEEPEHL W+HHGKRW  KF+ V+G+VHTNYLEY+KREKN  LQAFL++ +N+W+ R 
Sbjct: 419 AILEEPEHLNWYHHGKRWTDKFKHVIGVVHTNYLEYIKREKNGALQAFLVKHINNWVTRA 478

Query: 181 HCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVW 240
           +CHKV+RLSAATQ+ P S+VCNVHGVNPKFL++GEK    +++G ++FTKGAY++G+MVW
Sbjct: 479 YCHKVLRLSAATQDLPRSVVCNVHGVNPKFLKVGEKIAADKEHGQQSFTKGAYFLGKMVW 538

Query: 241 SKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFH 300
           +KGY EL+ LL+ +  +L G  +D+YGNGED   +Q AA+KL L +  + GRDHAD   H
Sbjct: 539 AKGYRELIDLLSKHKSDLEGFNVDVYGNGEDSQAVQMAARKLNLSLNFFKGRDHADSSLH 598

Query: 301 DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLK 360
            YKVF+NPS +DV+CTATAEALAMGK V+CA+HPSN+FFK FPNC TY     FV    +
Sbjct: 599 GYKVFINPSVSDVLCTATAEALAMGKFVICADHPSNEFFKSFPNCLTYKTSEEFVARVKE 658

Query: 361 ALAEEPAQPTDAQTHQLSWESATERFLQVAEL-------VGDVVTKRSKSPSRHLESESL 413
           A+A EP+  T  Q + LSWE+ATERF++ +EL       +G       +S  R +     
Sbjct: 659 AMASEPSPLTPEQRYSLSWEAATERFMEYSELDKVLNNKIGYSGQDGKRSKVRKIPLLPR 718

Query: 414 NSKRIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
            S+ +++    + H+  +G E  R A GAIPG+   D+Q C +L L+
Sbjct: 719 LSE-VVDGGLAFAHHCLTGNEILRLATGAIPGTRDYDKQQCMDLNLL 764


>gi|218194828|gb|EEC77255.1| hypothetical protein OsI_15847 [Oryza sativa Indica Group]
          Length = 737

 Score =  544 bits (1401), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 250/467 (53%), Positives = 339/467 (72%), Gaps = 8/467 (1%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           +   ++H+AI TTASLPW+TGTA+NPLFRAAYLA+  +++VTLV+PWL    Q+ VYP N
Sbjct: 261 LSDNRRHVAIVTTASLPWMTGTAINPLFRAAYLARSTKQKVTLVVPWLCKSDQELVYPNN 320

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           ITF+SP+EQE Y+R WL++R GF + F   FYPGKF+ +++SI+  GD ++ I   EADI
Sbjct: 321 ITFSSPEEQENYIRNWLQERLGFEANFKISFYPGKFSKERRSIIPAGDTSQFISSGEADI 380

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           A+LEEPEHL W+HHGKRW  KF+ V+G+VHTNYLEY+KREKN  LQAFL++ +N+W+ R 
Sbjct: 381 AILEEPEHLNWYHHGKRWTDKFKHVIGVVHTNYLEYIKREKNGALQAFLVKHINNWVTRA 440

Query: 181 HCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVW 240
           +CHKV+RLSAATQ+ P S+VCNVHGVNPKFL++GEK    +++G ++FTKGAY++G+MVW
Sbjct: 441 YCHKVLRLSAATQDLPRSVVCNVHGVNPKFLKVGEKIAADKEHGQQSFTKGAYFLGKMVW 500

Query: 241 SKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFH 300
           +KGY EL+ LL+ +  +L G  +D+YGNGED   +Q AA+KL L +  + GRDHAD   H
Sbjct: 501 AKGYRELIDLLSKHKSDLEGFNVDVYGNGEDSQAVQMAARKLNLSLNFFKGRDHADSSLH 560

Query: 301 DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLK 360
            YKVF+NPS +DV+CTATAEALAMGK V+CA+HPSN+FFK FPNC TY     FV    +
Sbjct: 561 GYKVFINPSVSDVLCTATAEALAMGKFVICADHPSNEFFKSFPNCLTYKTSEEFVARVKE 620

Query: 361 ALAEEPAQPTDAQTHQLSWESATERFLQVAEL-------VGDVVTKRSKSPSRHLESESL 413
           A+A EP+  T  Q + LSWE+ATERF++ +EL       +G       +S  R +     
Sbjct: 621 AMASEPSPLTPEQRYSLSWEAATERFMEYSELDKVLNNKIGYSGQDGKRSKVRKIPLLPR 680

Query: 414 NSKRIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
            S+ +++    + H+  +G E  R A GAIPG+   D+Q C +L L+
Sbjct: 681 LSE-VVDGGLAFAHHCLTGNEILRLATGAIPGTRDYDKQQCMDLNLL 726


>gi|224140022|ref|XP_002323386.1| predicted protein [Populus trichocarpa]
 gi|222868016|gb|EEF05147.1| predicted protein [Populus trichocarpa]
          Length = 793

 Score =  543 bits (1398), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 250/462 (54%), Positives = 335/462 (72%), Gaps = 6/462 (1%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK  ++ VTL++PWL    Q+ VYP N+TF 
Sbjct: 321 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKSEKQNVTLLVPWLCKSDQELVYPNNLTFT 380

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP++QE Y+R WLE+R GF + F   FYPGKF+ +++SI++ GD ++ +P ++ADIA+LE
Sbjct: 381 SPEDQENYIRNWLEERVGFKADFKISFYPGKFSKERRSIISAGDTSKFVPSKDADIAILE 440

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVG+VHTNYLEY+KREKN  LQAFL++ +N+ + R +CHK
Sbjct: 441 EPEHLNWYHHGKRWTDKFNHVVGVVHTNYLEYIKREKNGALQAFLVKHINNLVTRAYCHK 500

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P S++CNVHGVNPKFL+IGEK   +++ G +AF+KGAY++G+MVW+KGY
Sbjct: 501 VLRLSAATQDLPKSVICNVHGVNPKFLKIGEKVAAERELGQQAFSKGAYFLGKMVWAKGY 560

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
           +EL+ LL  +  EL G ++D++GNGED +++Q  AK+L L +    GRDHAD   H YKV
Sbjct: 561 KELIDLLAKHKNELDGFKLDVFGNGEDANEVQSTAKRLDLNLNFLKGRDHADDSLHGYKV 620

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSN++F+ FPNC TY     FV    +ALA 
Sbjct: 621 FINPSISDVLCTATAEALAMGKFVVCADHPSNEYFRSFPNCLTYKTSEDFVARVKEALAN 680

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAEL------VGDVVTKRSKSPSRHLESESLNSKRI 418
           EP   T  Q + LSWE+AT+RF+Q +EL        DV   ++   S        N   +
Sbjct: 681 EPQPLTPEQRYNLSWEAATQRFMQYSELDRVLDSEKDVKLSKTNGKSITKAVSMPNLSEM 740

Query: 419 IEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
           I+    + HY  +G E  R   GAIPG+   D+Q CK+L L+
Sbjct: 741 IDGGLAFAHYCLTGNEFLRLCTGAIPGTRDYDKQHCKDLHLL 782


>gi|15229824|ref|NP_187773.1| digalactosyldiacylglycerol synthase 1 [Arabidopsis thaliana]
 gi|75193744|sp|Q9S7D1.1|DGDG1_ARATH RecName: Full=Digalactosyldiacylglycerol synthase 1, chloroplastic;
           Flags: Precursor
 gi|5354158|gb|AAD42378.1|AF149841_1 digalactosyldiacylglycerol synthase [Arabidopsis thaliana]
 gi|5354160|gb|AAD42379.1|AF149842_1 digalactosyldiacylglycerol synthase [Arabidopsis thaliana]
 gi|6041825|gb|AAF02140.1|AC009918_12 digalactosyldiacylglycerol synthase [Arabidopsis thaliana]
 gi|18700089|gb|AAL77656.1| AT3g11670/T19F11_7 [Arabidopsis thaliana]
 gi|20855998|gb|AAM26642.1| AT3g11670/T19F11_7 [Arabidopsis thaliana]
 gi|332641560|gb|AEE75081.1| digalactosyldiacylglycerol synthase 1 [Arabidopsis thaliana]
          Length = 808

 Score =  543 bits (1398), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 254/472 (53%), Positives = 334/472 (70%), Gaps = 7/472 (1%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK  ++ VTLV+PWL    Q+ VYP N+TF+
Sbjct: 343 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKAAKQSVTLVVPWLCESDQELVYPNNLTFS 402

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE+Y+R WLE+R GF + F   FYPGKF+ +++SI   GD ++ I  ++ADIA+LE
Sbjct: 403 SPEEQESYIRKWLEERIGFKADFKISFYPGKFSKERRSIFPAGDTSQFISSKDADIAILE 462

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W++HGKRW  KF  VVGIVHTNYLEY+KREKN  LQAF +  VN+W+ R +C K
Sbjct: 463 EPEHLNWYYHGKRWTDKFNHVVGIVHTNYLEYIKREKNGALQAFFVNHVNNWVTRAYCDK 522

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P S+VCNVHGVNPKFL IGEK  E++  G +AF+KGAY++G+MVW+KGY
Sbjct: 523 VLRLSAATQDLPKSVVCNVHGVNPKFLMIGEKIAEERSRGEQAFSKGAYFLGKMVWAKGY 582

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ L+  +  EL    +D+YGNGED  ++QRAAKK  L +    GRDHAD   H YKV
Sbjct: 583 RELIDLMAKHKSELGSFNLDVYGNGEDAVEVQRAAKKHDLNLNFLKGRDHADDALHKYKV 642

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSN+FF+ FPNC TY     FV    +A+ +
Sbjct: 643 FINPSISDVLCTATAEALAMGKFVVCADHPSNEFFRSFPNCLTYKTSEDFVSKVQEAMTK 702

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNS-KRIIEDAF 423
           EP   T  Q + LSWE+AT+RF++ ++L  D +    +   +  +S S+ S   +++   
Sbjct: 703 EPLPLTPEQMYNLSWEAATQRFMEYSDL--DKILNNGEGGRKMRKSRSVPSFNEVVDGGL 760

Query: 424 GYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLVTELGKGHCHRPNGSW 475
            + HYV +G +  R   GA P +   D Q CK+L LV      H H+P   W
Sbjct: 761 AFSHYVLTGNDFLRLCTGATPRTKDYDNQHCKDLNLVPP----HVHKPIFGW 808


>gi|297833950|ref|XP_002884857.1| hypothetical protein ARALYDRAFT_897369 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297330697|gb|EFH61116.1| hypothetical protein ARALYDRAFT_897369 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 808

 Score =  541 bits (1395), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 254/472 (53%), Positives = 332/472 (70%), Gaps = 7/472 (1%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK  ++ VTLV+PWL    Q+ VYP N+TF+
Sbjct: 343 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKSAKQSVTLVVPWLCESDQELVYPNNLTFS 402

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE+Y+R WLE+R GF + F   FYPGKF+ +++SI   GD ++ I  ++ADIA+LE
Sbjct: 403 SPEEQESYIRKWLEERIGFKADFKISFYPGKFSKERRSIFPAGDTSQFISSKDADIAILE 462

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVGIVHTNYLEY+KREKN  LQAF +  VN+W+ R +C K
Sbjct: 463 EPEHLNWYHHGKRWTDKFNHVVGIVHTNYLEYIKREKNGALQAFFVNHVNNWVTRAYCDK 522

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLS ATQ+ P S+VCNVHGVNPKFL IG K  E++  G +AF+KGAY++G+MVW+KGY
Sbjct: 523 VLRLSGATQDLPKSVVCNVHGVNPKFLMIGGKIAEERSRGEQAFSKGAYFLGKMVWAKGY 582

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ L+  +  EL    +D+YGNGED  ++QRAAKK  L +    GRDHAD   H YKV
Sbjct: 583 RELIDLMAKHKSELGSFNLDVYGNGEDAVEVQRAAKKHDLNLNFLKGRDHADDALHKYKV 642

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSN+FF+ FPNC TY     FV    +A+ +
Sbjct: 643 FINPSISDVLCTATAEALAMGKFVVCADHPSNEFFRSFPNCLTYKTSEDFVSKVKEAMTK 702

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNS-KRIIEDAF 423
           EP   T  Q + LSWE+AT+RF++ ++L  D +        R  +S S+ S   +++   
Sbjct: 703 EPLPLTPEQMYNLSWEAATQRFMEYSDL--DKILNDGDGGRRMRKSRSVPSFNEMVDGGL 760

Query: 424 GYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLVTELGKGHCHRPNGSW 475
            + HYV +G +  R   GA P +   D+Q CK+L LV      H H+P   W
Sbjct: 761 AFTHYVLTGNDFLRLCTGATPRTKDYDKQHCKDLNLVPP----HVHKPIFGW 808


>gi|312282649|dbj|BAJ34190.1| unnamed protein product [Thellungiella halophila]
          Length = 806

 Score =  541 bits (1394), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 253/472 (53%), Positives = 335/472 (70%), Gaps = 6/472 (1%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK  ++ VTLV+PWL    Q+ VYP N+TF+
Sbjct: 340 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKSAKQSVTLVVPWLCESDQELVYPNNLTFS 399

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE+Y+R WLE+R GF + F   FYPGKF+ +++SI   GD ++ IP ++ADIA+LE
Sbjct: 400 SPEEQESYIRKWLEERIGFKADFKISFYPGKFSKERRSIFPAGDTSQFIPSKDADIAILE 459

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVGIVHTNYLEY+KREK+  LQAF +  VN+W+ R +C K
Sbjct: 460 EPEHLNWYHHGKRWTDKFNHVVGIVHTNYLEYIKREKHGALQAFFVNHVNNWVTRAYCDK 519

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLS ATQ+ P S+VCNVHGVNPKFL IGEK  E++  G +AF+KGAY++G+MVW+KGY
Sbjct: 520 VLRLSGATQDLPKSVVCNVHGVNPKFLMIGEKIAEERSRGEQAFSKGAYFLGKMVWAKGY 579

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ L+  +  +L    +D+YGNGED  ++QRAA+KL L +    GRDHAD   H YKV
Sbjct: 580 RELIDLMAKHKSDLGSFNLDVYGNGEDAIEVQRAAQKLDLNLNFLKGRDHADDSLHTYKV 639

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSN+FF+ FPNC TY     FV    +A+++
Sbjct: 640 FINPSISDVLCTATAEALAMGKFVVCADHPSNEFFRTFPNCLTYKTSEDFVSKVKEAMSK 699

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNS-KRIIEDAF 423
           EP   T  Q + LSWE+AT+RF++ ++L   ++        R  +S S+ S   +I+   
Sbjct: 700 EPLPLTPEQMYNLSWEAATQRFMEYSDL-DKILNDGDGGRRRMRKSRSVPSINEVIDGGL 758

Query: 424 GYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLVTELGKGHCHRPNGSW 475
            + HYV +G +  R   GA P +   D+Q CK+L LV        H+P   W
Sbjct: 759 AFTHYVLTGNDFLRLCSGATPRTKDYDKQHCKDLNLVPP----QVHKPIFGW 806


>gi|326509741|dbj|BAJ87086.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 814

 Score =  541 bits (1393), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 252/468 (53%), Positives = 333/468 (71%), Gaps = 18/468 (3%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLA+   ++VTLV+PWL    Q+ VYP +ITF+
Sbjct: 342 RRHVAIVTTASLPWMTGTAVNPLFRAAYLARSTMQKVTLVVPWLCKSDQQLVYPDDITFS 401

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE Y+R WL++R GF + F   FYPGKF+ +++SI+  GD +E I   EADIA+LE
Sbjct: 402 SPEEQETYIRNWLQERLGFAANFKISFYPGKFSKERRSIIPAGDTSEFISSREADIAILE 461

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  V+G+VHTNYLEY+KREKN  LQAFL++ +N+W+ R +CHK
Sbjct: 462 EPEHLNWYHHGKRWTDKFNHVIGVVHTNYLEYIKREKNGALQAFLVKHINNWVTRAYCHK 521

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P SI+CNVHGVNPKFL IGEK +  +++G+ +F+KGAY++G+MVW+KGY
Sbjct: 522 VLRLSAATQDLPRSIICNVHGVNPKFLNIGEKVIADREHGHNSFSKGAYFLGKMVWAKGY 581

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ LL+ +  +L G E+D+YGNGED   +Q AA+KL L +  Y G+DHAD   H YKV
Sbjct: 582 RELIDLLSKHKNDLEGFEIDVYGNGEDSQAVQAAARKLDLGINFYKGKDHADDSLHGYKV 641

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK V+C +HPSN+FFK FPNC  Y     FV    +A++ 
Sbjct: 642 FINPSVSDVLCTATAEALAMGKFVICPDHPSNEFFKSFPNCLMYKAPEEFVARVKEAMSS 701

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSK-------- 416
           EP   T  + + LSWE+ATERF++ +EL  D V K        L  E +  K        
Sbjct: 702 EPQPLTPEKRYSLSWEAATERFMEYSEL--DKVLKDRNG----LYGEGVKRKGARKLPLL 755

Query: 417 ----RIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
                I++    + HY A+G E  R A GA PG+   D+Q C +L L+
Sbjct: 756 PKFSDILDGGLAFAHYCATGNEILRMATGATPGTRDYDKQQCMDLNLL 803


>gi|218190926|gb|EEC73353.1| hypothetical protein OsI_07563 [Oryza sativa Indica Group]
          Length = 825

 Score =  540 bits (1392), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 252/462 (54%), Positives = 338/462 (73%), Gaps = 10/462 (2%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLA++ ++ VTLV+PWL    Q+ VYP ++TF+
Sbjct: 357 KRHVAIVTTASLPWMTGTAVNPLFRAAYLARNSKQDVTLVVPWLCKSDQELVYPNSMTFS 416

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE Y++ WLE+R GF S F   FYPGKF+ +++SI+  GD ++ I   EADIA+LE
Sbjct: 417 SPEEQETYIKKWLEERLGFESNFKISFYPGKFSKERRSIIPAGDTSQFISSREADIAILE 476

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHG RW  KF  VVG+VHTNYLEY+KREKN  LQAFL++ +N+W+ R +C K
Sbjct: 477 EPEHLNWYHHGNRWTDKFNHVVGVVHTNYLEYIKREKNGALQAFLVKHINNWVTRAYCDK 536

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P SI+CNVHGVNPKFL+IG+K M  ++NG ++F+KGAY++G+MVW+KGY
Sbjct: 537 VLRLSAATQDLPKSIICNVHGVNPKFLKIGDKIMADRENGQQSFSKGAYFLGKMVWAKGY 596

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            ELL LL+    +L G ++D+YG+GED  ++Q  AKKL L +  + GRDHAD   H YKV
Sbjct: 597 RELLDLLDKRKSDLQGFKLDVYGSGEDSQEVQSTAKKLNLNLNFFKGRDHADDSLHGYKV 656

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK V+CA HPSN+FF  FPNC TY     FV    +A+A 
Sbjct: 657 FINPSISDVLCTATAEALAMGKFVICAEHPSNEFFMSFPNCLTYRTPEEFVARVNEAMAR 716

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAEL---VGDVVTK---RSKSPSRHLESESLNSKRI 418
           EP   T  Q + LSWE+ATERF++ ++L   +   VT+   RSK+  R ++S   N    
Sbjct: 717 EPQPLTPEQRYNLSWEAATERFMEYSDLDKVLSQPVTEGVHRSKT-RRTIQS---NLSDA 772

Query: 419 IEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
           ++    + H+  +G E  R A GAIPG+   D+Q C ++GL+
Sbjct: 773 MDGGLAFAHHCLTGSEVLRLATGAIPGTRDYDKQHCVDMGLL 814


>gi|222623009|gb|EEE57141.1| hypothetical protein OsJ_07044 [Oryza sativa Japonica Group]
          Length = 966

 Score =  540 bits (1392), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 252/462 (54%), Positives = 338/462 (73%), Gaps = 10/462 (2%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLA++ ++ VTLV+PWL    Q+ VYP ++TF+
Sbjct: 498 KRHVAIVTTASLPWMTGTAVNPLFRAAYLARNSKQDVTLVVPWLCKSDQELVYPNSMTFS 557

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE Y++ WLE+R GF S F   FYPGKF+ +++SI+  GD ++ I   EADIA+LE
Sbjct: 558 SPEEQETYIKKWLEERLGFESNFKISFYPGKFSKERRSIIPAGDTSQFISSREADIAILE 617

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHG RW  KF  VVG+VHTNYLEY+KREKN  LQAFL++ +N+W+ R +C K
Sbjct: 618 EPEHLNWYHHGNRWTDKFNHVVGVVHTNYLEYIKREKNGALQAFLVKHINNWVTRAYCDK 677

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P SI+CNVHGVNPKFL+IG+K M  ++NG ++F+KGAY++G+MVW+KGY
Sbjct: 678 VLRLSAATQDLPKSIICNVHGVNPKFLKIGDKIMADRENGQQSFSKGAYFLGKMVWAKGY 737

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            ELL LL+    +L G ++D+YG+GED  ++Q  AKKL L +  + GRDHAD   H YKV
Sbjct: 738 RELLDLLDKRKSDLQGFKLDVYGSGEDSQEVQSTAKKLNLNLNFFKGRDHADDSLHGYKV 797

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK V+CA HPSN+FF  FPNC TY     FV    +A+A 
Sbjct: 798 FINPSISDVLCTATAEALAMGKFVICAEHPSNEFFMSFPNCLTYRTPEEFVARVNEAMAR 857

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAEL---VGDVVTK---RSKSPSRHLESESLNSKRI 418
           EP   T  Q + LSWE+ATERF++ ++L   +   VT+   RSK+  R ++S   N    
Sbjct: 858 EPQPLTPEQRYNLSWEAATERFMEYSDLDKVLSQPVTEGVHRSKT-RRTIQS---NLSDA 913

Query: 419 IEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
           ++    + H+  +G E  R A GAIPG+   D+Q C ++GL+
Sbjct: 914 MDGGLAFAHHCLTGSEVLRLATGAIPGTRDYDKQHCVDMGLL 955


>gi|115446553|ref|NP_001047056.1| Os02g0539100 [Oryza sativa Japonica Group]
 gi|50252668|dbj|BAD28837.1| putative digalactosyldiacylglycerol synthase [Oryza sativa Japonica
           Group]
 gi|113536587|dbj|BAF08970.1| Os02g0539100 [Oryza sativa Japonica Group]
 gi|215767721|dbj|BAG99949.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 802

 Score =  540 bits (1392), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 252/462 (54%), Positives = 338/462 (73%), Gaps = 10/462 (2%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLA++ ++ VTLV+PWL    Q+ VYP ++TF+
Sbjct: 334 KRHVAIVTTASLPWMTGTAVNPLFRAAYLARNSKQDVTLVVPWLCKSDQELVYPNSMTFS 393

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE Y++ WLE+R GF S F   FYPGKF+ +++SI+  GD ++ I   EADIA+LE
Sbjct: 394 SPEEQETYIKKWLEERLGFESNFKISFYPGKFSKERRSIIPAGDTSQFISSREADIAILE 453

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHG RW  KF  VVG+VHTNYLEY+KREKN  LQAFL++ +N+W+ R +C K
Sbjct: 454 EPEHLNWYHHGNRWTDKFNHVVGVVHTNYLEYIKREKNGALQAFLVKHINNWVTRAYCDK 513

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P SI+CNVHGVNPKFL+IG+K M  ++NG ++F+KGAY++G+MVW+KGY
Sbjct: 514 VLRLSAATQDLPKSIICNVHGVNPKFLKIGDKIMADRENGQQSFSKGAYFLGKMVWAKGY 573

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            ELL LL+    +L G ++D+YG+GED  ++Q  AKKL L +  + GRDHAD   H YKV
Sbjct: 574 RELLDLLDKRKSDLQGFKLDVYGSGEDSQEVQSTAKKLNLNLNFFKGRDHADDSLHGYKV 633

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK V+CA HPSN+FF  FPNC TY     FV    +A+A 
Sbjct: 634 FINPSISDVLCTATAEALAMGKFVICAEHPSNEFFMSFPNCLTYRTPEEFVARVNEAMAR 693

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAEL---VGDVVTK---RSKSPSRHLESESLNSKRI 418
           EP   T  Q + LSWE+ATERF++ ++L   +   VT+   RSK+  R ++S   N    
Sbjct: 694 EPQPLTPEQRYNLSWEAATERFMEYSDLDKVLSQPVTEGVHRSKT-RRTIQS---NLSDA 749

Query: 419 IEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
           ++    + H+  +G E  R A GAIPG+   D+Q C ++GL+
Sbjct: 750 MDGGLAFAHHCLTGSEVLRLATGAIPGTRDYDKQHCVDMGLL 791


>gi|225439442|ref|XP_002264659.1| PREDICTED: digalactosyldiacylglycerol synthase 1,
           chloroplastic-like [Vitis vinifera]
          Length = 797

 Score =  539 bits (1388), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 249/464 (53%), Positives = 337/464 (72%), Gaps = 10/464 (2%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLA   ++ VTL++PWL    Q+ VYP ++TF+
Sbjct: 325 KRHVAIVTTASLPWMTGTAVNPLFRAAYLASYAKQNVTLLVPWLCKKDQELVYPNSLTFS 384

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE Y+R WLE+R GF + F   FYPGKF+  ++SI+  GD ++ IP  +ADIA+LE
Sbjct: 385 SPEEQEVYIRNWLEERVGFKADFKISFYPGKFSKSRRSIIPAGDTSQFIPSRDADIAILE 444

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVG+VHTNYLEY+KREKN  LQAF ++ +N+W+AR +CHK
Sbjct: 445 EPEHLNWYHHGKRWTDKFNHVVGVVHTNYLEYIKREKNGALQAFFVKHINNWVARAYCHK 504

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P S++CNVHGVNPKFL+IGEK  E+++ G +AF+KGAY++G+MVW+KGY
Sbjct: 505 VLRLSAATQDLPKSVICNVHGVNPKFLKIGEKLAEERELGQRAFSKGAYFLGKMVWAKGY 564

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ LL+ +  +L G  +D++GNGED  ++Q AAK+L L +    GRDHAD   H YKV
Sbjct: 565 RELIDLLSRHKNDLDGFNLDVFGNGEDAHEVQTAAKRLHLNLNFMKGRDHADDSLHGYKV 624

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK V+CA+HPSN+FF  FPNC TY   + FV    +ALA 
Sbjct: 625 FINPSVSDVLCTATAEALAMGKFVICADHPSNEFFSSFPNCLTYKTSDDFVAKVKEALAN 684

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAEL------VGDVVTKRS--KSPSRHLESESLNSK 416
           EP   T  Q + LSWE+AT+RF++ ++L        D    +S  K  +R +   +L+  
Sbjct: 685 EPQPLTPEQRYNLSWEAATQRFMEYSDLDRVLNNKDDAQLSKSCGKLITRSVSMPTLSG- 743

Query: 417 RIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
            +++    + HY  +G E  R   GAIPG+   D+Q C++L L+
Sbjct: 744 -MVDGGLAFAHYCLTGNELLRLCTGAIPGTRDYDKQHCRDLHLL 786


>gi|75120872|sp|Q6DW74.1|DGDG1_LOTJA RecName: Full=Digalactosyldiacylglycerol synthase 1, chloroplastic;
           Flags: Precursor
 gi|49617333|gb|AAT67422.1| digalactosyldiacylglycerol synthase 1 [Lotus japonicus]
          Length = 786

 Score =  538 bits (1385), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 252/465 (54%), Positives = 338/465 (72%), Gaps = 11/465 (2%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYL++  +++VTL++PWL    Q+ VYP N+TF 
Sbjct: 313 ERHVAIVTTASLPWMTGTAVNPLFRAAYLSQSEKQKVTLLVPWLCKSDQELVYPSNLTFT 372

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE Y+R WLE+R GF + F   FYPGKF+  ++SI+  GD  + IP ++ADIA+LE
Sbjct: 373 SPEEQEGYIRNWLEERIGFKADFKISFYPGKFSQARRSIIPAGDTAQFIPSKDADIAILE 432

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHG RW  KF  VVGIVHTNYLEY+KREKN  LQAFL++ +N+W+AR +C K
Sbjct: 433 EPEHLNWYHHGTRWTDKFNHVVGIVHTNYLEYIKREKNGALQAFLVKHINNWVARAYCDK 492

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P S+VCNVHGVNPKFL+IGE    +++ G K FTKGAY++G+MVW+KGY
Sbjct: 493 VLRLSAATQDLPKSVVCNVHGVNPKFLKIGESIAAERELGQKGFTKGAYFLGKMVWAKGY 552

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
           +EL+ LL  +  +L G+++D++GNGED +++Q AA++  L +    GRDHAD   H YKV
Sbjct: 553 KELIDLLAKHKADLDGVKLDVFGNGEDANEVQSAARRFDLNLNFQKGRDHADDSLHRYKV 612

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSN+FF+ FPNC TY     F     +ALA 
Sbjct: 613 FINPSISDVLCTATAEALAMGKFVVCADHPSNEFFRSFPNCLTYKTPEDFAVKVKEALAN 672

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKS---PSRH-----LESESL-NS 415
           EP   T  Q +QLSWE+AT+RF++ +EL  D V  + K    PS++      +S S+ N 
Sbjct: 673 EPYPLTPEQRYQLSWEAATQRFMEYSEL--DKVLNKEKDGAKPSKNNRKIMAKSASMPNL 730

Query: 416 KRIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
             +++    + HY  +G E  R   GA PG+   D+Q CK+L L+
Sbjct: 731 TELVDGGLAFAHYCLTGNEFLRLCTGATPGTRDYDKQHCKDLNLL 775


>gi|357167555|ref|XP_003581220.1| PREDICTED: digalactosyldiacylglycerol synthase 1,
           chloroplastic-like [Brachypodium distachyon]
          Length = 791

 Score =  536 bits (1381), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 243/463 (52%), Positives = 332/463 (71%), Gaps = 8/463 (1%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTA+NPLFRAAYLA+  +++VTLV+PWL    Q+ VYP NITF+
Sbjct: 319 RRHVAIVTTASLPWMTGTAINPLFRAAYLARSTKQKVTLVVPWLCNSDQQLVYPNNITFS 378

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQ  Y+R WL++R GF + F   FYPGKF+ +++SI+  GD ++ I  +EADIA+LE
Sbjct: 379 SPEEQATYIRNWLQERLGFEADFKISFYPGKFSTERRSIIPAGDTSQFISSKEADIAILE 438

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHG+RW  KF  V+G+VHTNYLEY+KREKN  LQAFL++ +N+W+ R +CHK
Sbjct: 439 EPEHLNWYHHGQRWTDKFNHVIGVVHTNYLEYIKREKNGALQAFLVKHINNWVTRAYCHK 498

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ  P SI+CNVHGVNP+FL +GEK    +++G + F+KGAY++G+MVW+KGY
Sbjct: 499 VLRLSAATQNLPRSIICNVHGVNPRFLHVGEKVAADRESGKEVFSKGAYFLGKMVWAKGY 558

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
           +E++ LL+ +  +L G ++D+YGNGED   +Q AA+KL L +  + G+DHAD   H YKV
Sbjct: 559 KEMIDLLSKHKNDLQGFKLDVYGNGEDSQAVQSAARKLDLGISFFKGKDHADDSLHGYKV 618

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK V+C +HPSNDFFK F NC TY     FV    +A+A 
Sbjct: 619 FINPSVSDVLCTATAEALAMGKFVICPDHPSNDFFKSFSNCLTYKAPEEFVARVKEAMAS 678

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNS-------KR 417
           EP   T  Q + LSWE+ATERF++ +EL    +  R+    + +E   L           
Sbjct: 679 EPQPLTPEQRYSLSWEAATERFMEYSEL-DKALKNRNGQSGQGVERNRLRKIPLLPKFSD 737

Query: 418 IIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
           I++    + H+  +G E  R A GAIPG+   D+Q C +L L+
Sbjct: 738 IVDGGLAFAHHCLTGNEILRLATGAIPGTRDYDKQQCMDLNLL 780


>gi|242061874|ref|XP_002452226.1| hypothetical protein SORBIDRAFT_04g022040 [Sorghum bicolor]
 gi|241932057|gb|EES05202.1| hypothetical protein SORBIDRAFT_04g022040 [Sorghum bicolor]
          Length = 792

 Score =  536 bits (1381), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 247/461 (53%), Positives = 328/461 (71%), Gaps = 6/461 (1%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+A+ TTASLPW+TGTAVNPLFRAAYLAK   + VTLV+PWL    Q+ VYP ++TF 
Sbjct: 322 KRHVAVVTTASLPWMTGTAVNPLFRAAYLAKGTRQDVTLVVPWLCKSDQELVYPNSMTFN 381

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQEAY+R WLE+R GF S F   FYPGKF+ +++SI+  GD ++ IP  EADIA+LE
Sbjct: 382 SPEEQEAYIRSWLEERLGFESNFKISFYPGKFSKERRSIIPAGDTSQFIPSREADIAILE 441

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVG+VHTNYLEY+KREKN  LQAFL++ +N+W+ R +C K
Sbjct: 442 EPEHLNWYHHGKRWTDKFNHVVGVVHTNYLEYIKREKNGALQAFLVKHINNWVTRAYCDK 501

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P SIVCNVHGVNPKFL+IGE+    ++ G   F+KGAY++G+MVW+KGY
Sbjct: 502 VLRLSAATQDLPKSIVCNVHGVNPKFLKIGEEITANRETGEAPFSKGAYFLGKMVWAKGY 561

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ L+  +  +L G ++D+YG+GED  ++Q  A+KL L +  + GRDHAD   H YKV
Sbjct: 562 RELIDLMAKHKSDLEGFKLDVYGSGEDSQEVQSTARKLDLSLNFFKGRDHADNSLHGYKV 621

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK V+CA HPSN+FF  FPNC TY     FV    +A+  
Sbjct: 622 FINPSISDVLCTATAEALAMGKFVICAEHPSNEFFMTFPNCLTYKTSEEFVARVKEAMDR 681

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPS-----RHLESESLNSKRII 419
           EP   T  Q + LSWE+ATERF++ ++L   V+   +  P      ++  +   N   I+
Sbjct: 682 EPQPLTPEQRYNLSWEAATERFMEYSDL-DKVLNNEAAQPGQGRNRKNTRTSQPNLSDIM 740

Query: 420 EDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
           +    + H   +G E  R A GAIPG+   D+Q C ++GL+
Sbjct: 741 DGGLAFAHRCLTGNEVLRLATGAIPGTRDYDKQHCIDMGLL 781


>gi|296083152|emb|CBI22788.3| unnamed protein product [Vitis vinifera]
          Length = 641

 Score =  536 bits (1380), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 247/456 (54%), Positives = 329/456 (72%), Gaps = 15/456 (3%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLA   ++ VTL++PWL    Q+ VYP ++TF+
Sbjct: 190 KRHVAIVTTASLPWMTGTAVNPLFRAAYLASYAKQNVTLLVPWLCKKDQELVYPNSLTFS 249

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE Y+R WLE+R GF + F   FYPGKF+  ++SI+  GD ++ IP  +ADIA+LE
Sbjct: 250 SPEEQEVYIRNWLEERVGFKADFKISFYPGKFSKSRRSIIPAGDTSQFIPSRDADIAILE 309

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVG+VHTNYLEY+KREKN  LQAF ++ +N+W+AR +CHK
Sbjct: 310 EPEHLNWYHHGKRWTDKFNHVVGVVHTNYLEYIKREKNGALQAFFVKHINNWVARAYCHK 369

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P S++CNVHGVNPKFL+IGEK  E+++ G +AF+KGAY++G+MVW+KGY
Sbjct: 370 VLRLSAATQDLPKSVICNVHGVNPKFLKIGEKLAEERELGQRAFSKGAYFLGKMVWAKGY 429

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ LL+ +  +L G  +D++GNGED  ++Q AAK+L L +    GRDHAD   H YKV
Sbjct: 430 RELIDLLSRHKNDLDGFNLDVFGNGEDAHEVQTAAKRLHLNLNFMKGRDHADDSLHGYKV 489

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK V+CA+HPSN+FF  FPNC TY   + FV    +ALA 
Sbjct: 490 FINPSVSDVLCTATAEALAMGKFVICADHPSNEFFSSFPNCLTYKTSDDFVAKVKEALAN 549

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIEDAFG 424
           EP   T  Q + LSWE+AT+RF++ ++L   +  K     S+ L                
Sbjct: 550 EPQPLTPEQRYNLSWEAATQRFMEYSDLDRVLNNKDDAQLSKRL---------------A 594

Query: 425 YLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
           + HY  +G E  R   GAIPG+   D+Q C++L L+
Sbjct: 595 FAHYCLTGNELLRLCTGAIPGTRDYDKQHCRDLHLL 630


>gi|77548769|gb|ABA91566.1| digalactosyldiacylglycerol synthase 1, putative, expressed [Oryza
           sativa Japonica Group]
          Length = 729

 Score =  535 bits (1378), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 252/468 (53%), Positives = 338/468 (72%), Gaps = 12/468 (2%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK  ++ VTLV+PWL    Q+ VYP ++TF+
Sbjct: 258 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKSSKQDVTLVVPWLCKSDQELVYPNSMTFS 317

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQEAY+R WLE+R GF + F   FYPGKF  +++SI+  GD ++ IP +EADIA+LE
Sbjct: 318 SPQEQEAYMRSWLEERVGFKTDFKISFYPGKFQKERRSIIPAGDTSQFIPSKEADIAILE 377

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVG+VHTNYLEY+KREKN  +QAF ++ +N+ +AR +CHK
Sbjct: 378 EPEHLNWYHHGKRWTDKFNHVVGVVHTNYLEYIKREKNGVIQAFFVKHINNLVARAYCHK 437

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLS ATQ+ P S++CNVHGVNPKFLE+GE+   ++++G  +F+KGAY++G+MVW+KGY
Sbjct: 438 VLRLSGATQDLPKSMICNVHGVNPKFLEVGERIAAERESGQHSFSKGAYFLGKMVWAKGY 497

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ L   +  +L G+++D+YGNGED  ++Q AA KL L +  + GRDHAD   H YKV
Sbjct: 498 RELIDLYAKHKSDLEGIKLDIYGNGEDSHEVQSAAMKLNLNLNFHKGRDHADDSLHGYKV 557

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSNDFF+ FPNC TY     FV    +A+A 
Sbjct: 558 FINPSISDVLCTATAEALAMGKFVVCADHPSNDFFRSFPNCLTYKTSEDFVAKVKEAMAR 617

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVG-------DVVTKRS---KSPSRHLE-SESL 413
           +P   T  Q + LSWE+AT+RF++ +EL         D  T  S   KS    +E S SL
Sbjct: 618 DPQPLTPEQRYNLSWEAATQRFMEHSELDKVLSSSNRDCTTSTSGCGKSGDNKMEKSASL 677

Query: 414 -NSKRIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
            N   +++    + HY  +G E  R + GAIPG+L  ++Q   +L L+
Sbjct: 678 PNMSDMVDGGLAFAHYCFTGNELLRLSTGAIPGTLNYNKQHSLDLHLL 725


>gi|125533480|gb|EAY80028.1| hypothetical protein OsI_35196 [Oryza sativa Indica Group]
          Length = 683

 Score =  535 bits (1377), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 252/468 (53%), Positives = 338/468 (72%), Gaps = 12/468 (2%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK  ++ VTLV+PWL    Q+ VYP ++TF+
Sbjct: 205 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKSSKQDVTLVVPWLCKSDQELVYPNSMTFS 264

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQEAY+R WLE+R GF + F   FYPGKF  +++SI+  GD ++ IP +EADIA+LE
Sbjct: 265 SPQEQEAYMRSWLEERVGFKTDFKISFYPGKFQKERRSIIPAGDTSQFIPSKEADIAILE 324

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVG+VHTNYLEY+KREKN  +QAF ++ +N+ +AR +CHK
Sbjct: 325 EPEHLNWYHHGKRWTDKFNHVVGVVHTNYLEYIKREKNGVIQAFFVKHINNLVARAYCHK 384

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLS ATQ+ P S++CNVHGVNPKFLE+GE+   ++++G  +F+KGAY++G+MVW+KGY
Sbjct: 385 VLRLSGATQDLPKSMICNVHGVNPKFLEVGERIAAERESGQHSFSKGAYFLGKMVWAKGY 444

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ L   +  +L G+++D+YGNGED  ++Q AA KL L +  + GRDHAD   H YKV
Sbjct: 445 RELIDLYAKHKSDLEGIKLDIYGNGEDSHEVQSAAMKLNLNLNFHKGRDHADDSLHGYKV 504

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSNDFF+ FPNC TY     FV    +A+A 
Sbjct: 505 FINPSISDVLCTATAEALAMGKFVVCADHPSNDFFRSFPNCLTYKTSEDFVAKVKEAMAR 564

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVG-------DVVTKRS---KSPSRHLE-SESL 413
           +P   T  Q + LSWE+AT+RF++ +EL         D  T  S   KS    +E S SL
Sbjct: 565 DPQPLTPEQRYNLSWEAATQRFMEHSELDKVLSSSNRDCTTSTSGCGKSGDNKMEKSASL 624

Query: 414 -NSKRIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
            N   +++    + HY  +G E  R + GAIPG+L  ++Q   +L L+
Sbjct: 625 PNMSDMVDGGLAFAHYCFTGNELLRLSTGAIPGTLNYNKQHSLDLHLL 672


>gi|115484289|ref|NP_001065806.1| Os11g0158400 [Oryza sativa Japonica Group]
 gi|77548768|gb|ABA91565.1| digalactosyldiacylglycerol synthase 1, putative, expressed [Oryza
           sativa Japonica Group]
 gi|113644510|dbj|BAF27651.1| Os11g0158400 [Oryza sativa Japonica Group]
          Length = 736

 Score =  534 bits (1376), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 252/468 (53%), Positives = 338/468 (72%), Gaps = 12/468 (2%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK  ++ VTLV+PWL    Q+ VYP ++TF+
Sbjct: 258 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKSSKQDVTLVVPWLCKSDQELVYPNSMTFS 317

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQEAY+R WLE+R GF + F   FYPGKF  +++SI+  GD ++ IP +EADIA+LE
Sbjct: 318 SPQEQEAYMRSWLEERVGFKTDFKISFYPGKFQKERRSIIPAGDTSQFIPSKEADIAILE 377

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVG+VHTNYLEY+KREKN  +QAF ++ +N+ +AR +CHK
Sbjct: 378 EPEHLNWYHHGKRWTDKFNHVVGVVHTNYLEYIKREKNGVIQAFFVKHINNLVARAYCHK 437

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLS ATQ+ P S++CNVHGVNPKFLE+GE+   ++++G  +F+KGAY++G+MVW+KGY
Sbjct: 438 VLRLSGATQDLPKSMICNVHGVNPKFLEVGERIAAERESGQHSFSKGAYFLGKMVWAKGY 497

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ L   +  +L G+++D+YGNGED  ++Q AA KL L +  + GRDHAD   H YKV
Sbjct: 498 RELIDLYAKHKSDLEGIKLDIYGNGEDSHEVQSAAMKLNLNLNFHKGRDHADDSLHGYKV 557

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSNDFF+ FPNC TY     FV    +A+A 
Sbjct: 558 FINPSISDVLCTATAEALAMGKFVVCADHPSNDFFRSFPNCLTYKTSEDFVAKVKEAMAR 617

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVG-------DVVTKRS---KSPSRHLE-SESL 413
           +P   T  Q + LSWE+AT+RF++ +EL         D  T  S   KS    +E S SL
Sbjct: 618 DPQPLTPEQRYNLSWEAATQRFMEHSELDKVLSSSNRDCTTSTSGCGKSGDNKMEKSASL 677

Query: 414 -NSKRIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
            N   +++    + HY  +G E  R + GAIPG+L  ++Q   +L L+
Sbjct: 678 PNMSDMVDGGLAFAHYCFTGNELLRLSTGAIPGTLNYNKQHSLDLHLL 725


>gi|357149367|ref|XP_003575088.1| PREDICTED: digalactosyldiacylglycerol synthase 1,
           chloroplastic-like [Brachypodium distachyon]
          Length = 782

 Score =  533 bits (1374), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 248/460 (53%), Positives = 330/460 (71%), Gaps = 6/460 (1%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++HIA+ TTASLPW+TGTAVNPLFRAAYLAK  ++ VTLV+PWL    Q+ VYP ++TF 
Sbjct: 314 KRHIAVVTTASLPWMTGTAVNPLFRAAYLAKSEKQDVTLVVPWLCRSDQELVYPYSVTFN 373

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE Y+R W+++R GF S F   FYPGKF+ +++SIL  GD ++ I   EADIA+LE
Sbjct: 374 SPEEQETYIRNWVDERLGFASNFKISFYPGKFSKERRSILPAGDTSQFISSREADIAILE 433

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVGIVHTNYLEY+KREKN  LQAFL++ +N+W+ R +C K
Sbjct: 434 EPEHLNWYHHGKRWTEKFNHVVGIVHTNYLEYIKREKNGALQAFLVKHINNWVTRAYCDK 493

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P SIVCNVHGVNPKFL+IG+K    ++ G ++F+KGAY++G+MVW+KGY
Sbjct: 494 VLRLSAATQDLPKSIVCNVHGVNPKFLKIGDKLTADREGGQQSFSKGAYFLGKMVWAKGY 553

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ LL  +  +L G ++D+YG+GED  ++   A+KL L ++ + G DHAD   H YKV
Sbjct: 554 RELVDLLAKHKGDLEGFKLDVYGSGEDSQEVHSTARKLDLNLKFFKGMDHADDSLHRYKV 613

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA HPSN+FF  FPNC TY   + FV    +A+A 
Sbjct: 614 FINPSISDVLCTATAEALAMGKFVVCAEHPSNEFFMSFPNCLTYKTSDEFVARVKEAMAR 673

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSP----SRHLESESLNSKRIIE 420
           EP   T  + + LSWE+ATERF++ ++L  D V   + SP    +R   +   +    ++
Sbjct: 674 EPQPLTPEERYNLSWEAATERFMEYSDL--DKVLNDNSSPRLRETRSRRTSQPSFSNAVD 731

Query: 421 DAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
               + H   +G E  R A GAIPG+   DEQ C ++GL+
Sbjct: 732 GGLAFAHRCLTGSEVLRLATGAIPGTRDYDEQHCADMGLL 771


>gi|27542603|gb|AAO16600.1| digalactosyldiacylglycerol synthase, partial [Xerophyta humilis]
          Length = 465

 Score =  533 bits (1373), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 250/454 (55%), Positives = 327/454 (72%), Gaps = 5/454 (1%)

Query: 12  TTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPKEQEA 71
           TTASLPW+TGTAVNPLFRAAYLAK  ++RVTLV+PWL    Q+ VYP ++TF SP+EQEA
Sbjct: 1   TTASLPWMTGTAVNPLFRAAYLAKSAKQRVTLVVPWLCESDQELVYPNSLTFGSPEEQEA 60

Query: 72  YVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLEEPEHLTW 131
           Y R WLE+R GF + F   FYPGKF+ +++SI+ VGD ++ I  ++ADIA+LEEPEHL W
Sbjct: 61  YTRNWLEERIGFKADFRISFYPGKFSKERRSIIPVGDTSQFISAKDADIAILEEPEHLNW 120

Query: 132 FHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHKVIRLSAA 191
           +HHGKRW  KF  VVG+VHTNYLEY+KREK    QAFL++ +N+W+ R +CHKV+RLS+A
Sbjct: 121 YHHGKRWTDKFNHVVGVVHTNYLEYIKREKYGAFQAFLVKHINNWVTRAYCHKVLRLSSA 180

Query: 192 TQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLL 251
           TQ  P S++CNVHGVNPKFL+IGE+   ++QNG++AF+KGAY++G+MVW+KGY EL+ L 
Sbjct: 181 TQNLPKSVICNVHGVNPKFLKIGERVAAERQNGHQAFSKGAYFLGKMVWAKGYRELIDLF 240

Query: 252 NIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTT 311
           + +  +L G ++D+YGNGED  ++Q  AKKL L +    GRDHAD   H YKVF+NPS +
Sbjct: 241 SKHKNDLEGFKLDVYGNGEDSQEVQSTAKKLDLNLNFLKGRDHADDSLHGYKVFINPSVS 300

Query: 312 DVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTD 371
           DV  TATAEALAMGK V+CA+HPSN+FF  FPNC TY     FV    +AL+ EP   TD
Sbjct: 301 DVYGTATAEALAMGKFVICADHPSNEFFMAFPNCLTYKTSKDFVARVKEALSSEPQPLTD 360

Query: 372 AQTHQLSWESATERFLQVAELVGD----VVTKRSKSPSRHLESESLNS-KRIIEDAFGYL 426
            Q H LSWE+AT+RF++ +EL  D    V     +   R  +S S+ S   +++    + 
Sbjct: 361 EQRHNLSWEAATQRFMEYSELNKDCKIAVAQNSMQKTLRIKKSVSMPSLSEVVDGGLAFA 420

Query: 427 HYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
           HY  +G E  R A GAIPG+   D   CK+L L+
Sbjct: 421 HYCLTGNEVLRLATGAIPGTRDYDMPHCKDLHLL 454


>gi|413937158|gb|AFW71709.1| digalactosyldiacylglycerol synthase 1 [Zea mays]
          Length = 792

 Score =  533 bits (1372), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 246/461 (53%), Positives = 327/461 (70%), Gaps = 6/461 (1%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK   + VTLV+PWL    Q+ VYP ++TF+
Sbjct: 322 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKGTRQDVTLVVPWLCKSDQELVYPNSMTFS 381

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE Y+R WLE+R GF S F   FYPGKF+ +++SI+  GD ++ IP  EADIA+LE
Sbjct: 382 SPEEQETYIRSWLEERLGFESNFKISFYPGKFSKERRSIIPAGDTSQFIPSREADIAILE 441

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVG+VHTNYLEY+KREKN  LQAFL++ +N+W+ R +C K
Sbjct: 442 EPEHLNWYHHGKRWTDKFNHVVGVVHTNYLEYIKREKNGALQAFLVKHINNWVTRAYCDK 501

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P SIVCNVHGVNPKFL+IGE+    + +G   F+KGAY++G+MVW+KGY
Sbjct: 502 VLRLSAATQDLPKSIVCNVHGVNPKFLKIGEEITANRGSGEPPFSKGAYFLGKMVWAKGY 561

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ L+  +  +L G ++D+YG+GED  ++Q  A++L L +  + GRDHAD   H YKV
Sbjct: 562 RELIDLMAKHKSDLEGFKLDVYGSGEDSQEVQSTARRLDLSLNFFKGRDHADNSLHGYKV 621

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK V+CA HPSN+FF  FPNC TY     FV    +A+  
Sbjct: 622 FINPSISDVLCTATAEALAMGKFVICAEHPSNEFFMSFPNCLTYKTSEEFVARVKEAMDR 681

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPS-----RHLESESLNSKRII 419
           EP   T  Q + LSWE+ATERF++ ++L   V+   +  P      ++  +   N   I+
Sbjct: 682 EPQPLTPEQRYNLSWEAATERFMEYSDL-DKVLNNEAAQPGQGRKRKNKRTSQPNLSDIM 740

Query: 420 EDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
           +      H   +G E  R A GAIPG+   D+Q C ++GL+
Sbjct: 741 DGGLALAHRCLTGNEVLRLATGAIPGTRDYDKQHCVDMGLL 781


>gi|168035561|ref|XP_001770278.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162678495|gb|EDQ64953.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 571

 Score =  531 bits (1368), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 260/477 (54%), Positives = 331/477 (69%), Gaps = 23/477 (4%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
            ++IAI TTASLPW+TGTAVNPLFRAAYLA+ GE++VTL++PWLS   Q  VYP  +TF 
Sbjct: 88  HRNIAIVTTASLPWMTGTAVNPLFRAAYLARSGEQKVTLLVPWLSKKDQVLVYPNQMTFE 147

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP EQE++VR W+EDR GF   F   FYPGKF+ +K+SILA GDI++ IPD+EAD+AVLE
Sbjct: 148 SPAEQESFVRKWVEDRVGFKCDFKISFYPGKFSLEKRSILAAGDISQFIPDQEADVAVLE 207

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W++HG+RW  KF+ VVG+VHTNYLEYVKREKN R+QAFLL+ VN+W+ R++C K
Sbjct: 208 EPEHLNWYYHGRRWTDKFQHVVGVVHTNYLEYVKREKNGRVQAFLLKHVNNWVVRIYCDK 267

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ  P S VCNVHGVNP+FL IG    E + +  K F+KGAYY+G+MVW KGY
Sbjct: 268 VLRLSAATQNLPRSSVCNVHGVNPQFLSIGRAIAEVEGDKPK-FSKGAYYLGKMVWGKGY 326

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ LL    + L  + MD++G+GED D ++  A+K  L +  +PGRDHAD   H YKV
Sbjct: 327 RELMDLLAQNKEVLGNINMDIFGSGEDSDAVKDEAQKHGLALDFHPGRDHADASLHGYKV 386

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DVVCT TAEALAMGKIVVCA+HPSN+FF  FPNC  Y     FVE    AL  
Sbjct: 387 FINPSISDVVCTTTAEALAMGKIVVCADHPSNEFFMSFPNCYIYRTPEEFVEKVKLALTS 446

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAE--------------------LVGDVVTKRSKSP 404
           EP   T    H LSWE+AT+RF+  AE                    L  D   +R+ + 
Sbjct: 447 EPLPLTPELQHLLSWEAATDRFIDSAEIKRLPPRRAIMGKGKGKLRRLPIDAAKRRTMAL 506

Query: 405 SRHLESESLNSKRIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLVT 461
           S  L   +L++  +++    + HY  SG E +RRA GA+PG++  DE+  K+L L T
Sbjct: 507 SLALPKRALSN--MLDRGLAFAHYFLSGIEVARRAAGALPGTMYIDEEYRKDLNLPT 561


>gi|413937159|gb|AFW71710.1| hypothetical protein ZEAMMB73_218762 [Zea mays]
          Length = 567

 Score =  531 bits (1367), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 246/461 (53%), Positives = 327/461 (70%), Gaps = 6/461 (1%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK   + VTLV+PWL    Q+ VYP ++TF+
Sbjct: 97  KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKGTRQDVTLVVPWLCKSDQELVYPNSMTFS 156

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE Y+R WLE+R GF S F   FYPGKF+ +++SI+  GD ++ IP  EADIA+LE
Sbjct: 157 SPEEQETYIRSWLEERLGFESNFKISFYPGKFSKERRSIIPAGDTSQFIPSREADIAILE 216

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVG+VHTNYLEY+KREKN  LQAFL++ +N+W+ R +C K
Sbjct: 217 EPEHLNWYHHGKRWTDKFNHVVGVVHTNYLEYIKREKNGALQAFLVKHINNWVTRAYCDK 276

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P SIVCNVHGVNPKFL+IGE+    + +G   F+KGAY++G+MVW+KGY
Sbjct: 277 VLRLSAATQDLPKSIVCNVHGVNPKFLKIGEEITANRGSGEPPFSKGAYFLGKMVWAKGY 336

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ L+  +  +L G ++D+YG+GED  ++Q  A++L L +  + GRDHAD   H YKV
Sbjct: 337 RELIDLMAKHKSDLEGFKLDVYGSGEDSQEVQSTARRLDLSLNFFKGRDHADNSLHGYKV 396

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK V+CA HPSN+FF  FPNC TY     FV    +A+  
Sbjct: 397 FINPSISDVLCTATAEALAMGKFVICAEHPSNEFFMSFPNCLTYKTSEEFVARVKEAMDR 456

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPS-----RHLESESLNSKRII 419
           EP   T  Q + LSWE+ATERF++ ++L   V+   +  P      ++  +   N   I+
Sbjct: 457 EPQPLTPEQRYNLSWEAATERFMEYSDL-DKVLNNEAAQPGQGRKRKNKRTSQPNLSDIM 515

Query: 420 EDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
           +      H   +G E  R A GAIPG+   D+Q C ++GL+
Sbjct: 516 DGGLALAHRCLTGNEVLRLATGAIPGTRDYDKQHCVDMGLL 556


>gi|326489161|dbj|BAK01564.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 727

 Score =  531 bits (1367), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 245/464 (52%), Positives = 337/464 (72%), Gaps = 8/464 (1%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK  ++ VTL++PWL    Q+ VYP N+TF+
Sbjct: 253 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKSSKQYVTLMVPWLCKSDQELVYPNNMTFS 312

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP++QE+Y+R WLE+R GF + F   FYPGKF  +++SI+  GD ++ IP +EADIA+LE
Sbjct: 313 SPEDQESYIRDWLEERVGFKTDFRISFYPGKFQKERRSIIPAGDTSQFIPSKEADIAILE 372

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVG+VHTNYLEY+KREKN  +QAF ++ +N+ +AR +CHK
Sbjct: 373 EPEHLNWYHHGKRWTDKFNHVVGVVHTNYLEYIKREKNGAIQAFFVKHINNLVARAYCHK 432

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLS ATQ+   S++CNVHGVNPKFLE+GE+   ++++G ++ +KGAY++G+MVW+KGY
Sbjct: 433 VLRLSGATQDLARSMICNVHGVNPKFLEVGERIAAERESGQQSMSKGAYFLGKMVWAKGY 492

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ LL  +  +L G ++D+YGNGED  ++Q AAKKL L +  + GRDHAD   H YKV
Sbjct: 493 RELIDLLAKHKTDLDGFKLDVYGNGEDSVEVQSAAKKLDLNLNFHKGRDHADDSLHGYKV 552

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSN+FF+ FPNC TY     FV    +A+  
Sbjct: 553 FVNPSISDVLCTATAEALAMGKFVVCADHPSNEFFRSFPNCLTYTTSEDFVAKVKEAMTR 612

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAEL------VGDVVTKRSKSPSRHL-ESESL-NSK 416
           +P   T  Q + LSWE+AT+RF++ +EL       GD  +    S  R + +S SL N  
Sbjct: 613 DPQPLTPEQRYNLSWEAATQRFMEHSELDKVLSSNGDCASTSGNSVDRKMRKSASLPNMS 672

Query: 417 RIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
            I++    + HY  +G E  R + GA+PG+L  ++Q   ++ L+
Sbjct: 673 DIVDGGLAFAHYCFTGSELLRLSTGAVPGTLNYNKQHSVDMHLL 716


>gi|238011522|gb|ACR36796.1| unknown [Zea mays]
 gi|413925644|gb|AFW65576.1| hypothetical protein ZEAMMB73_139867 [Zea mays]
          Length = 620

 Score =  530 bits (1366), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 242/469 (51%), Positives = 336/469 (71%), Gaps = 14/469 (2%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK  ++ VTLV+PWL    Q+ VYP ++TF+
Sbjct: 140 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKSCKQDVTLVVPWLCKSDQELVYPNSMTFS 199

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE Y+R WLE+R GF + F   FYPGKF  +++SI+  GD ++ IP +EADIA+LE
Sbjct: 200 SPEEQEKYMRNWLEERVGFKTDFKISFYPGKFQKERRSIIPAGDTSQFIPSKEADIAILE 259

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  V+G+VHTNYLEY+KREKN  +QAF ++ +N+ +AR +CHK
Sbjct: 260 EPEHLNWYHHGKRWTDKFNHVIGVVHTNYLEYIKREKNGAIQAFFVKHINNLVARAYCHK 319

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           ++RLS ATQ+ P S +CNVHGVNPKFLE+GE+   ++++G ++F+KGAY++G+MVW+KGY
Sbjct: 320 ILRLSGATQDLPKSTICNVHGVNPKFLEVGERIAAERESGQQSFSKGAYFLGKMVWAKGY 379

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ L   ++ +L G ++D+YGNGED  ++Q AA+KL L +  + GRDHAD   H YKV
Sbjct: 380 RELIDLFAKHNSDLEGFKLDIYGNGEDSHEVQSAARKLNLNLNFHKGRDHADDSLHGYKV 439

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSNDFF+ FPNC TY     FV    +A+A 
Sbjct: 440 FINPSISDVLCTATAEALAMGKFVVCADHPSNDFFRSFPNCLTYKTSEDFVARVKEAMAR 499

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELV------------GDVVTKRSKSPSRHL--ES 410
           +P   T  Q + LSW++AT+RF++ +EL             G V T+  KS    +   +
Sbjct: 500 DPQPLTPEQRYNLSWDAATQRFMEHSELDKVLNGDSNSECGGSVGTETGKSAGTMMRRSA 559

Query: 411 ESLNSKRIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGL 459
            + N   +++    ++HY  +G E  R + GA+PG+   ++Q   +L L
Sbjct: 560 SAPNMSDVVDGGLAFVHYCFTGSELLRLSTGAVPGTRDYNKQHSLDLRL 608


>gi|293332506|ref|NP_001169316.1| uncharacterized protein LOC100383181 [Zea mays]
 gi|224028615|gb|ACN33383.1| unknown [Zea mays]
 gi|413925643|gb|AFW65575.1| hypothetical protein ZEAMMB73_139867 [Zea mays]
          Length = 739

 Score =  530 bits (1365), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 242/469 (51%), Positives = 336/469 (71%), Gaps = 14/469 (2%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK  ++ VTLV+PWL    Q+ VYP ++TF+
Sbjct: 259 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKSCKQDVTLVVPWLCKSDQELVYPNSMTFS 318

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE Y+R WLE+R GF + F   FYPGKF  +++SI+  GD ++ IP +EADIA+LE
Sbjct: 319 SPEEQEKYMRNWLEERVGFKTDFKISFYPGKFQKERRSIIPAGDTSQFIPSKEADIAILE 378

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  V+G+VHTNYLEY+KREKN  +QAF ++ +N+ +AR +CHK
Sbjct: 379 EPEHLNWYHHGKRWTDKFNHVIGVVHTNYLEYIKREKNGAIQAFFVKHINNLVARAYCHK 438

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           ++RLS ATQ+ P S +CNVHGVNPKFLE+GE+   ++++G ++F+KGAY++G+MVW+KGY
Sbjct: 439 ILRLSGATQDLPKSTICNVHGVNPKFLEVGERIAAERESGQQSFSKGAYFLGKMVWAKGY 498

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ L   ++ +L G ++D+YGNGED  ++Q AA+KL L +  + GRDHAD   H YKV
Sbjct: 499 RELIDLFAKHNSDLEGFKLDIYGNGEDSHEVQSAARKLNLNLNFHKGRDHADDSLHGYKV 558

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSNDFF+ FPNC TY     FV    +A+A 
Sbjct: 559 FINPSISDVLCTATAEALAMGKFVVCADHPSNDFFRSFPNCLTYKTSEDFVARVKEAMAR 618

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELV------------GDVVTKRSKSPSRHL--ES 410
           +P   T  Q + LSW++AT+RF++ +EL             G V T+  KS    +   +
Sbjct: 619 DPQPLTPEQRYNLSWDAATQRFMEHSELDKVLNGDSNSECGGSVGTETGKSAGTMMRRSA 678

Query: 411 ESLNSKRIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGL 459
            + N   +++    ++HY  +G E  R + GA+PG+   ++Q   +L L
Sbjct: 679 SAPNMSDVVDGGLAFVHYCFTGSELLRLSTGAVPGTRDYNKQHSLDLRL 727


>gi|226495503|ref|NP_001152532.1| digalactosyldiacylglycerol synthase 1 [Zea mays]
 gi|195657187|gb|ACG48061.1| digalactosyldiacylglycerol synthase 1 [Zea mays]
          Length = 792

 Score =  529 bits (1363), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 245/461 (53%), Positives = 326/461 (70%), Gaps = 6/461 (1%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK   + VTLV+PWL    Q+ VYP ++TF+
Sbjct: 322 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKGTRQDVTLVVPWLCKSDQELVYPNSMTFS 381

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE Y+R WLE+R GF S F   FYPGKF+ +++SI+  GD ++ IP  EADIA+LE
Sbjct: 382 SPEEQETYIRSWLEERLGFESNFKISFYPGKFSKERRSIIPAGDTSQFIPSREADIAILE 441

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVG+VHTNYLEY+KREKN  LQAFL++ +N+W+ R +C K
Sbjct: 442 EPEHLNWYHHGKRWTDKFNHVVGVVHTNYLEYIKREKNGALQAFLVKHINNWVTRAYCDK 501

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P SIVCNVHGVNPKFL+IGE+    + +    F+KGAY++G+MVW+KGY
Sbjct: 502 VLRLSAATQDLPKSIVCNVHGVNPKFLKIGEEITANRGSWEPPFSKGAYFLGKMVWAKGY 561

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ L+  +  +L G ++D+YG+GED  ++Q  A++L L +  + GRDHAD   H YKV
Sbjct: 562 RELIDLMAKHKSDLEGFKLDVYGSGEDSQEVQSTARRLDLSLNFFKGRDHADNSLHGYKV 621

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK V+CA HPSN+FF  FPNC TY     FV    +A+  
Sbjct: 622 FINPSISDVLCTATAEALAMGKFVICAEHPSNEFFMSFPNCLTYKTSEEFVARVKEAMDR 681

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPS-----RHLESESLNSKRII 419
           EP   T  Q + LSWE+ATERF++ ++L   V+   +  P      ++  +   N   I+
Sbjct: 682 EPQPLTPEQRYNLSWEAATERFMEYSDL-DKVLNNEAAQPGQGRKRKNKRTSQPNLSDIM 740

Query: 420 EDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
           +      H   +G E  R A GAIPG+   D+Q C ++GL+
Sbjct: 741 DGGLALAHRCLTGNEVLRLATGAIPGTRDYDKQHCVDMGLL 781


>gi|168018107|ref|XP_001761588.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687272|gb|EDQ73656.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 787

 Score =  529 bits (1363), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 257/485 (52%), Positives = 334/485 (68%), Gaps = 18/485 (3%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           +++IAI TTASLPW+TGTAVNPLFRAA+LAK G++ VTL++PWL    Q+QVYP  +TF 
Sbjct: 307 RRNIAIVTTASLPWMTGTAVNPLFRAAFLAKAGKQNVTLLVPWLCKKDQEQVYPNRMTFD 366

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP++QE+YVR W+E R GF S F   FYPGKF+ DK+SILA GDI++ IP EEAD+AVLE
Sbjct: 367 SPEDQESYVRDWVEARVGFKSDFKIAFYPGKFSTDKRSILASGDISDFIPKEEADVAVLE 426

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHLTWF+HGKRW  KF+ VVGIVHTNYLEYVKREKN   +AF LE +N+W+AR +C+K
Sbjct: 427 EPEHLTWFYHGKRWTDKFQHVVGIVHTNYLEYVKREKNGAARAFALEHINNWMARAYCNK 486

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ++P S V NVHGV P FLE G++   +   GN  F+KGAYY+G+M+W KGY
Sbjct: 487 VLRLSAATQDFPRSSVVNVHGVGPIFLETGKRLAAESGEGNPTFSKGAYYLGKMIWGKGY 546

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ L      +L+ +E+D++G+GED  ++   A++  L +R Y GRDH D   H YKV
Sbjct: 547 RELVDLFVKNKDQLSNVELDVFGSGEDSHEVHAEAQQNGLRMRFYQGRDHGDNTLHGYKV 606

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DVVCT TAEALAMGKI VCA+HPSNDFF+ FPNC  Y     FVE   +A+A 
Sbjct: 607 FINPSLSDVVCTTTAEALAMGKIAVCADHPSNDFFRSFPNCYFYRTPEEFVEKVQQAMAS 666

Query: 365 EPAQPTDAQTHQLSWESATERFLQVA--ELVGDVVTKRSKSPSRHLESESLNSKR----- 417
           EP   +    H LSWE+AT+RF+  A  +++     K+S+S +  L  E ++ K      
Sbjct: 667 EPVPLSPELQHLLSWEAATDRFIDSAGIDMLPPKGAKKSRSKTPALLGEEIDQKTMTLST 726

Query: 418 -------IIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLVTELGKGHCHR 470
                  I++    + HY+ +GF+  R  LGA P +   D Q CK+LGL       H  R
Sbjct: 727 SSPDLTDIVDKGLYFAHYLMTGFDPMRNLLGAHPQTKHIDSQHCKDLGLPPP----HVQR 782

Query: 471 PNGSW 475
           P   W
Sbjct: 783 PVYGW 787


>gi|242067509|ref|XP_002449031.1| hypothetical protein SORBIDRAFT_05g003730 [Sorghum bicolor]
 gi|241934874|gb|EES08019.1| hypothetical protein SORBIDRAFT_05g003730 [Sorghum bicolor]
          Length = 741

 Score =  527 bits (1357), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 238/454 (52%), Positives = 326/454 (71%), Gaps = 14/454 (3%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK  ++ VTLV+PWL    Q+ VYP ++TF+
Sbjct: 258 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKSSKQDVTLVVPWLCKSDQELVYPNSMTFS 317

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE Y+R WLE+R GF + F   FYPGKF  +++SI+  GD ++ IP +EADIA+LE
Sbjct: 318 SPQEQETYMRNWLEERVGFKTDFKISFYPGKFQKERRSIIPAGDTSQFIPSKEADIAILE 377

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVG+VHTNYLEY+KREKN  +QAF ++ +N+ +AR +CHK
Sbjct: 378 EPEHLNWYHHGKRWTDKFNHVVGVVHTNYLEYIKREKNGAIQAFFVKHINNLVARAYCHK 437

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           ++RLS ATQ+ P S++CNVHGVNPKFLE+GE+   ++++G ++F+KGAY++G+MVW+KGY
Sbjct: 438 ILRLSGATQDLPKSMICNVHGVNPKFLEVGERIAAERESGQQSFSKGAYFLGKMVWAKGY 497

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ L   +  +L G ++D+YGNGED  ++Q AA+KL L +  + GRDHAD   H YKV
Sbjct: 498 RELIDLFAKHKSDLEGFKLDIYGNGEDSHEVQSAARKLNLNLNFHKGRDHADDSLHGYKV 557

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSNDFF+ FPNC TY     FV    +A+  
Sbjct: 558 FINPSISDVLCTATAEALAMGKFVVCADHPSNDFFRSFPNCLTYKTSEDFVARVKEAMTR 617

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAEL------------VGDVVTKRSKSPSRHL--ES 410
           +P   T  Q + LSWE+AT+RF++ +EL               V T+  KS    +   +
Sbjct: 618 DPQPLTPEQRYDLSWEAATQRFMEHSELDKVLNSDSSSECTSSVTTESGKSRDTRMRRSA 677

Query: 411 ESLNSKRIIEDAFGYLHYVASGFETSRRALGAIP 444
            + N   +++    ++HY  +G E  R + GA+P
Sbjct: 678 SAPNMSDVVDGGLAFVHYCFTGSELLRLSTGAVP 711


>gi|357157408|ref|XP_003577788.1| PREDICTED: digalactosyldiacylglycerol synthase 1,
           chloroplastic-like [Brachypodium distachyon]
          Length = 739

 Score =  526 bits (1354), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 250/462 (54%), Positives = 333/462 (72%), Gaps = 8/462 (1%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK  ++ VTL++PWL    Q+ VYP ++TF+
Sbjct: 269 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKSSKQYVTLMVPWLCKSDQELVYPNSMTFS 328

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP++QE Y+R WLE+R GF + F   FYPGKF  +++SI+  GD ++ IP +EADIA+LE
Sbjct: 329 SPQDQETYIREWLEERVGFKTDFKISFYPGKFQKERRSIIPAGDTSQFIPSKEADIAILE 388

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVG+VHTNYLEY+KREKN  +QAF ++ +N+ +AR +CHK
Sbjct: 389 EPEHLNWYHHGKRWTDKFNHVVGVVHTNYLEYIKREKNGAIQAFFVKHINNLVARAYCHK 448

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLS ATQ+   SIVCNVHGVNPKFLE+GE+   ++ +G ++ +KGAY++G+MVW+KGY
Sbjct: 449 VLRLSGATQDLARSIVCNVHGVNPKFLEVGERIAAERGSGQQSMSKGAYFLGKMVWAKGY 508

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ L   +   L G ++D+YGNGED  ++Q AAKKL L +  + GRDHAD   H YKV
Sbjct: 509 RELIDLFAKHKSALEGFKLDIYGNGEDSLEVQSAAKKLDLNLNFHKGRDHADDSLHGYKV 568

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSNDFF+ FPNC TY     FV    +A+  
Sbjct: 569 FINPSISDVLCTATAEALAMGKFVVCADHPSNDFFRSFPNCLTYTTSEDFVAKVKEAMTR 628

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHL-----ESESL-NSKRI 418
           +P   T  Q + LSWE+AT+RF++ +EL  D V   S S S H      +S SL N   +
Sbjct: 629 DPQPLTPEQRYNLSWEAATQRFMEHSEL--DKVLSSSSSTSEHATRKMKKSASLPNMSDV 686

Query: 419 IEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
           I+    + HY  +G E  R + GAIPG+L  ++Q   +L L+
Sbjct: 687 IDGGLAFAHYCFTGNELLRLSTGAIPGTLNYNKQHSLDLHLL 728


>gi|49616595|gb|AAT67159.1| digalactosyldiacylglycerol synthase [Nicotiana tabacum]
          Length = 471

 Score =  522 bits (1345), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 246/460 (53%), Positives = 331/460 (71%), Gaps = 4/460 (0%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK  ++ VTL++PWL    Q+ VYP  +TF+
Sbjct: 1   KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKSEKQNVTLLVPWLCKSDQELVYPNQLTFS 60

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP++Q  Y+R WLE+R GF + F   FYPGKF+ +++SI+  GD ++ IP  +ADIA+LE
Sbjct: 61  SPEDQVLYIRNWLEERIGFKANFKISFYPGKFSKERRSIMPAGDTSQFIPSRDADIAILE 120

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVGIVHTNYLEY+KREKN  LQAF ++ VN+ + R +C K
Sbjct: 121 EPEHLNWYHHGKRWTDKFNHVVGIVHTNYLEYIKREKNGALQAFFVKHVNNLVTRAYCDK 180

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P S+VCNVHGVNPKFL+IGEK    +Q+G + F+KGAY++G+MVW+KGY
Sbjct: 181 VLRLSAATQDLPKSLVCNVHGVNPKFLKIGEKAAADRQSGQQVFSKGAYFLGKMVWAKGY 240

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ LL  +  +L G  +D+YGNGED  ++Q  AK+L L V    GRDHAD   H YKV
Sbjct: 241 RELIDLLANHKTDLDGFNLDVYGNGEDAHEVQSTAKRLNLNVNFMKGRDHADDSLHGYKV 300

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DV+CTATAEALAMGK VVCA+HPSN+FF+ FPNC TY   + FV    +A++ 
Sbjct: 301 FINPSISDVLCTATAEALAMGKFVVCADHPSNEFFQAFPNCLTYRTPDDFVAKVQEAMSS 360

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHL---ESESL-NSKRIIE 420
           EP   T  + ++LSWE+AT+RF++ ++L   + ++ S    R     +S SL N + +++
Sbjct: 361 EPQPLTPEEQYKLSWEAATQRFMEYSDLDKVLASETSLDRRRQKVIGKSVSLPNLEEMVD 420

Query: 421 DAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
               + H   +G E  R   GAIP S   D++ CK+L L+
Sbjct: 421 GGLAFTHNCFTGNEFLRLCTGAIPRSRDYDKEHCKDLHLL 460


>gi|168040216|ref|XP_001772591.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162676146|gb|EDQ62633.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 527

 Score =  521 bits (1343), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 254/485 (52%), Positives = 333/485 (68%), Gaps = 18/485 (3%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           +++IAI TTASLPW+TGTAVNPLFRAA+LAK G++ VTL++PWLS   Q+QVYP  +TF 
Sbjct: 47  RRNIAIVTTASLPWMTGTAVNPLFRAAFLAKTGKQNVTLLVPWLSKNDQQQVYPNRMTFD 106

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP++QE+YVR W+E R GF S F   FYPGKF+ +K+SILA GDI+E IP+EEAD+AVLE
Sbjct: 107 SPEDQESYVRDWVEARVGFKSDFKIAFYPGKFSVEKRSILAAGDISEFIPNEEADVAVLE 166

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHLTW++HGKRW  KF+ VVGIVHTNYLEYVKREKN  +QAF LE +N+W+ R +C+K
Sbjct: 167 EPEHLTWYYHGKRWTDKFQHVVGIVHTNYLEYVKREKNGAVQAFFLEHINNWMVRAYCNK 226

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQE P S V NVHGV P+FLE G++   +  +G   F+KGAYY+G+M+W KGY
Sbjct: 227 VLRLSAATQELPKSSVMNVHGVGPRFLETGKRLASEAPDGKPTFSKGAYYLGKMIWGKGY 286

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ L       L+ +E+D++G+GED  ++   A++  L ++ Y GRDH D   HDYK+
Sbjct: 287 RELVDLFVDNKDLLSNVELDVFGSGEDSHEVHAEAQQNGLRMKFYQGRDHGDKSLHDYKI 346

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DVVCT TAEALAMGKIVVCA+HPSNDFF+ FPNC  Y     FVE   +A++ 
Sbjct: 347 FINPSLSDVVCTTTAEALAMGKIVVCADHPSNDFFRSFPNCYIYRTPEEFVEKVQQAMSS 406

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAEL--VGDVVTKRSKSPSRHLESESLNSKR----- 417
           EP   +    H LSWE+AT+RF+  A +  +     K+S+     L SE +  K      
Sbjct: 407 EPEPLSPELQHLLSWEAATDRFIDSAGINKLPPKGAKKSRPKEPVLLSEGVEQKTMTLST 466

Query: 418 -------IIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLVTELGKGHCHR 470
                  I++      HY+ +G +  R  +GA+P +   D Q CK+LGL       H  R
Sbjct: 467 STPDLAGIVDKGLYVSHYLLAGIDPWRSLMGALPETKHIDAQHCKDLGLPPP----HVQR 522

Query: 471 PNGSW 475
           P   W
Sbjct: 523 PVYGW 527


>gi|125585725|gb|EAZ26389.1| hypothetical protein OsJ_10273 [Oryza sativa Japonica Group]
          Length = 423

 Score =  518 bits (1334), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 255/456 (55%), Positives = 318/456 (69%), Gaps = 41/456 (8%)

Query: 4   KQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITF 63
           ++QHIAIFTTASLPW+TGTAVNPLFRAAYLAK G+  VTLV+PWLS   Q  VYP  + F
Sbjct: 3   RKQHIAIFTTASLPWMTGTAVNPLFRAAYLAKAGDWEVTLVVPWLSKGDQLLVYPNKMKF 62

Query: 64  ASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVL 123
           + P EQE YVR WLE+R G    F+ +FYPGKF+ +K+SIL  GDIT+ + D++ADIAVL
Sbjct: 63  SVPGEQEGYVRRWLEERIGLLPKFEIKFYPGKFSTEKRSILPAGDITQTVSDDKADIAVL 122

Query: 124 EEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCH 183
           EEPEHLTW+HHG+RWK KFR V+G+VHTNYLEYVKRE+N  + AFLL+ +NSW+  ++CH
Sbjct: 123 EEPEHLTWYHHGRRWKNKFRKVIGVVHTNYLEYVKRERNGYIHAFLLKHINSWVTDIYCH 182

Query: 184 KVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKG 243
           KVIRLSAATQE P SIVCNVHGVNPKF+EIG+ K +Q     +AF KGAYYIG+MVWSKG
Sbjct: 183 KVIRLSAATQEVPRSIVCNVHGVNPKFIEIGKLKHQQISQREQAFFKGAYYIGKMVWSKG 242

Query: 244 YEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYK 303
           Y ELL LL  + KEL+GL+M+LYG+GED D+++ +A+KL L VRVYPGRDH D IFH+  
Sbjct: 243 YTELLQLLQKHQKELSGLKMELYGSGEDSDEVKASAEKLNLDVRVYPGRDHGDSIFHE-- 300

Query: 304 VFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALA 363
                                                 FPNC  Y+    FV  T+KALA
Sbjct: 301 --------------------------------------FPNCHMYNTEKEFVRLTMKALA 322

Query: 364 EEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKR-SKSPSRHLESESLNSKRIIEDA 422
           EEP   ++   H+LSWE+ATERF++VA++   +  K+ S SP   +       K+ +E+A
Sbjct: 323 EEPIPLSEELRHELSWEAATERFVRVADIAPIMSIKQHSPSPQYFMYISPDELKKNMEEA 382

Query: 423 FGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELG 458
             + H   SGFET+R   GAIP +LQPDEQ CKELG
Sbjct: 383 SAFFHNAISGFETARCVFGAIPNTLQPDEQQCKELG 418


>gi|168037594|ref|XP_001771288.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677377|gb|EDQ63848.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 568

 Score =  518 bits (1334), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 250/459 (54%), Positives = 327/459 (71%), Gaps = 7/459 (1%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           +++IAI TTASLPW+TGTAVNPLFRAAYLA+ GE++V L++PWL    Q  VYP  ITF 
Sbjct: 102 RRNIAIVTTASLPWMTGTAVNPLFRAAYLARSGEQKVNLLVPWLCKKDQVLVYPNQITFE 161

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           +P EQE +VR W+EDR GF   F   FYPGKF+ +K+SILA GDI++ IP++EAD+AVLE
Sbjct: 162 TPAEQERFVRKWVEDRVGFQCDFKLSFYPGKFSTEKRSILAAGDISQFIPNQEADVAVLE 221

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W++HG+RW  KF+ VVG+VHTNYLEYVKRE+N  +QAFLL+ VN+W+ R++C+K
Sbjct: 222 EPEHLNWYYHGRRWTDKFQHVVGVVHTNYLEYVKRERNGSVQAFLLKHVNNWVVRIYCNK 281

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P S VCNVHGVNP+FL IG+   E + N  K F+ GAYY+G+MVW KGY
Sbjct: 282 VLRLSAATQDLPRSSVCNVHGVNPQFLRIGKGLAEIEGNEPK-FSMGAYYLGKMVWGKGY 340

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ LL    + L  + +D++G+GED D ++  A++  L +  YPGRDHAD   H YKV
Sbjct: 341 RELVDLLVQNKEVLCNINLDIFGSGEDSDAVRDEAQQNGLALNFYPGRDHADASLHGYKV 400

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DVVCT TAEALAMGKIVVCA+HPSN+FF  FPNC TY     FVE    AL+ 
Sbjct: 401 FINPSESDVVCTTTAEALAMGKIVVCADHPSNEFFMPFPNCYTYRTPEEFVEKVKLALSS 460

Query: 365 EPAQPTDAQTHQLSWESATERFLQVA----ELVGDVVTKRSKSPSRHLESESLNSKRIIE 420
           EP   T    H LSWE+AT+RF+  A     +  +    R+ S S  L  ++L++  +I+
Sbjct: 461 EPLPLTPELQHLLSWEAATDRFIDSAGRKRRIPIEAAKGRTMSLSLALPKKTLSN--MID 518

Query: 421 DAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGL 459
               + HY  SG E +R A G +PG++   E+  K+L L
Sbjct: 519 TGLAFSHYFLSGIEIARIAAGGLPGTMNIGEEYRKDLDL 557


>gi|224088090|ref|XP_002308321.1| predicted protein [Populus trichocarpa]
 gi|222854297|gb|EEE91844.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score =  516 bits (1329), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 239/449 (53%), Positives = 319/449 (71%), Gaps = 7/449 (1%)

Query: 19  LTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPKEQEAYVRWWLE 78
           +TGTAVNPL+RAAYLAK  ++ VTL++PWL    Q+ VYP N+TF SP+EQE Y+R WLE
Sbjct: 1   MTGTAVNPLYRAAYLAKSEKQNVTLLVPWLCKSDQELVYPNNLTFTSPEEQENYIRNWLE 60

Query: 79  DRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLEEPEHLTWFHHGKRW 138
           +R GF + F   FYPGKFA +++SI+  GD ++ IP ++ADIA+LEEPEHL W+HHGKRW
Sbjct: 61  ERIGFKADFKISFYPGKFAKERRSIIPAGDTSQFIPSKDADIAILEEPEHLNWYHHGKRW 120

Query: 139 KAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHKVIRLSAATQEYPNS 198
             KF  VVG+VHTNYLEY+KREKN  LQAF ++ +N+ + R +CHKV+RLSAATQ+ P S
Sbjct: 121 TTKFNHVVGVVHTNYLEYIKREKNGALQAFFVKHINNLVTRAYCHKVLRLSAATQDLPKS 180

Query: 199 IVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKEL 258
           ++CNVHGVNPKFL+IGEK   + + G +AF+KGAY++G+MVW+KGY+EL+ LL  +  +L
Sbjct: 181 VICNVHGVNPKFLKIGEKVAAESELGQQAFSKGAYFLGKMVWAKGYKELIDLLAKHKNDL 240

Query: 259 AGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTAT 318
            G  +D++GNGED +++Q  AK+L L +    GRDHAD   H YKVF+NPS +DV+CTAT
Sbjct: 241 DGFNLDVFGNGEDANEVQTTAKRLDLNLNFLKGRDHADDSLHGYKVFINPSLSDVLCTAT 300

Query: 319 AEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLS 378
           AEALAMGK VVCA+HPSN++F+ FPNC TY     FV    +ALA EP   T  Q + LS
Sbjct: 301 AEALAMGKFVVCADHPSNEYFRSFPNCLTYKTSEDFVARVKEALANEPHPLTPEQIYNLS 360

Query: 379 WESATERFLQVAEL-------VGDVVTKRSKSPSRHLESESLNSKRIIEDAFGYLHYVAS 431
           WE+AT+RF+Q +EL         DV   ++   S      + N   +++    + HY  +
Sbjct: 361 WEAATQRFMQYSELDRVLDPEKDDVKLSKTNGRSITKAVSTPNMSEMVDGGLAFAHYCLT 420

Query: 432 GFETSRRALGAIPGSLQPDEQLCKELGLV 460
           G E  R   GAIPG+   D+Q CK+L L+
Sbjct: 421 GNELLRLCTGAIPGTRDYDKQHCKDLHLL 449


>gi|359490799|ref|XP_003634170.1| PREDICTED: LOW QUALITY PROTEIN: digalactosyldiacylglycerol synthase
           2, chloroplastic-like [Vitis vinifera]
          Length = 485

 Score =  487 bits (1254), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 234/365 (64%), Positives = 285/365 (78%), Gaps = 1/365 (0%)

Query: 95  KFAADKKSILAVGDITEIIPDEEADIAVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYL 154
           +F+ DK+SI+ VGDIT+IIPDEEADI V EEPEHLT +HH KR K KFR V GIVHTNYL
Sbjct: 112 QFSRDKRSIVVVGDITKIIPDEEADITVXEEPEHLTXYHHRKRXKTKFRLVQGIVHTNYL 171

Query: 155 EYVKREKNDRLQAFLLEFVNSWLARVHCHK-VIRLSAATQEYPNSIVCNVHGVNPKFLEI 213
           E ++RE N  LQ FL++++N+ +  ++CH+ +IRLSAATQ+ P SI+CNV GV+PKFLEI
Sbjct: 172 EXIRRENNGWLQTFLIKYINNXVVDIYCHEALIRLSAATQDLPRSIICNVDGVHPKFLEI 231

Query: 214 GEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFD 273
           G++K E QQN + AFTKG YYI +MVW KGY+ELL LL+ + KEL GLE+DLYGNGED D
Sbjct: 232 GKRKKEHQQNDDDAFTKGVYYIRKMVWGKGYKELLKLLDDHQKELTGLEVDLYGNGEDSD 291

Query: 274 QIQRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANH 333
           Q+Q  AKKL+  VRV+P RDHADP+FHDYKVFLNP+TTDV+C  TAEAL MGKIVVCANH
Sbjct: 292 QVQEVAKKLEPDVRVHPRRDHADPLFHDYKVFLNPNTTDVLCATTAEALVMGKIVVCANH 351

Query: 334 PSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERFLQVAELV 393
           PSNDFFKQF NC TY   NGFV+ TLKAL+EEP Q TDAQ H+LSW++A E+FLQ A L 
Sbjct: 352 PSNDFFKQFTNCWTYQDNNGFVKETLKALSEEPGQLTDAQMHELSWDAAIEKFLQAAGLD 411

Query: 394 GDVVTKRSKSPSRHLESESLNSKRIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQL 453
             V  K +K+P +   S  +N  + ++D   Y+HYVASG E  RR   AIPGSLQPDE+ 
Sbjct: 412 QVVERKPTKTPPKKFMSMRMNLWKNMDDESAYVHYVASGIEAWRRVFSAIPGSLQPDEEQ 471

Query: 454 CKELG 458
            +ELG
Sbjct: 472 RQELG 476


>gi|3047114|gb|AAC13625.1| F6N23.24 gene product [Arabidopsis thaliana]
 gi|7267394|emb|CAB80864.1| predicted protein of unknown function [Arabidopsis thaliana]
          Length = 389

 Score =  478 bits (1229), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 250/458 (54%), Positives = 305/458 (66%), Gaps = 84/458 (18%)

Query: 2   DRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNI 61
           ++++QHIAIFTTAS+PWLTGTA  PL           R V+++  + S            
Sbjct: 3   NQQEQHIAIFTTASIPWLTGTA--PL-----------RYVSILERYDSQ----------- 38

Query: 62  TFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIA 121
                           E   G+  T    F+  +FA DK+SIL VGDI++ IPDEEADIA
Sbjct: 39  ----------------ESSFGY-QTHTASFH--RFAIDKRSILPVGDISDAIPDEEADIA 79

Query: 122 VLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVH 181
           VLEEPEHLTWFHHG++WK KF +V+GIVHTNYLEYVKREK  R++AF L+++NSW+  ++
Sbjct: 80  VLEEPEHLTWFHHGQKWKTKFNYVIGIVHTNYLEYVKREKQGRVKAFFLKYLNSWVVGIY 139

Query: 182 CHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWS 241
           CHKVIRLSAATQEYP SIVCNVHGVNPKFLEIG +K+EQQ+   + FTKGAYYIG+MVWS
Sbjct: 140 CHKVIRLSAATQEYPKSIVCNVHGVNPKFLEIGLRKLEQQKLQEQPFTKGAYYIGKMVWS 199

Query: 242 KGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHD 301
           KGY+ELL LL  + KELA                                         D
Sbjct: 200 KGYKELLKLLEKHQKELA-----------------------------------------D 218

Query: 302 YKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKA 361
           YKVFLNPSTTDVVCT TAEALAMGKIVVCANH SN FFKQFPNCRTYD   GFV ATLKA
Sbjct: 219 YKVFLNPSTTDVVCTTTAEALAMGKIVVCANHISNKFFKQFPNCRTYDDGQGFVRATLKA 278

Query: 362 LAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIED 421
           L E+P+Q T+ Q H+LSWE+AT+RF++V++L        + S      S S++  + +ED
Sbjct: 279 LGEQPSQLTEQQRHELSWEAATQRFIKVSDLNRLSRADSNLSKRSVFASSSISVGKNLED 338

Query: 422 AFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGL 459
              Y+H++ASGFE SR A GAIPGSLQPDE+LC++LGL
Sbjct: 339 MSAYIHFLASGFEASRTAFGAIPGSLQPDEELCRDLGL 376


>gi|225444561|ref|XP_002277070.1| PREDICTED: digalactosyldiacylglycerol synthase 2, chloroplastic
           [Vitis vinifera]
 gi|297738487|emb|CBI27732.3| unnamed protein product [Vitis vinifera]
          Length = 300

 Score =  473 bits (1216), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 222/301 (73%), Positives = 255/301 (84%), Gaps = 1/301 (0%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           MD KQ HIAIFTTASLPW+TGTAVNPLFR AYL K  E +VTLVIPWLS   Q+ VYP  
Sbjct: 1   MDSKQ-HIAIFTTASLPWMTGTAVNPLFRVAYLTKGREFKVTLVIPWLSPKDQELVYPNK 59

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           I   SP EQEAYVR WL +RTGF   F  +FYPGKF+ DK+SIL VGDITEIIPDEEADI
Sbjct: 60  IILKSPSEQEAYVRQWLGERTGFVCDFSIKFYPGKFSRDKRSILVVGDITEIIPDEEADI 119

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           AVLEEPEHLTW+HHG RWK KFR VVGIVHTNYLEYV+REKN RLQAFLL+++N+W+  +
Sbjct: 120 AVLEEPEHLTWYHHGNRWKTKFRLVVGIVHTNYLEYVRREKNGRLQAFLLKYINNWVIDI 179

Query: 181 HCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVW 240
           +CHKVIRLSAA Q+ P SI+CNVHGVNPKFLEIG++K E QQNG++AF+KGAYYIG+M W
Sbjct: 180 YCHKVIRLSAAIQDLPRSIICNVHGVNPKFLEIGKRKNEHQQNGDQAFSKGAYYIGKMAW 239

Query: 241 SKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFH 300
           SKGY+ELL LL+ + KEL GLE+DLYGNGED DQ+Q AAKKL+L VRV+PG DHADP+FH
Sbjct: 240 SKGYKELLKLLHDHQKELTGLEVDLYGNGEDSDQVQEAAKKLELDVRVHPGHDHADPLFH 299

Query: 301 D 301
           +
Sbjct: 300 E 300


>gi|222615552|gb|EEE51684.1| hypothetical protein OsJ_33042 [Oryza sativa Japonica Group]
          Length = 705

 Score =  460 bits (1184), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 226/468 (48%), Positives = 310/468 (66%), Gaps = 42/468 (8%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK  ++ VTLV+PWL    Q+ VYP ++TF+
Sbjct: 257 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKSSKQDVTLVVPWLCKSDQELVYPNSMTFS 316

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQEAY+R WLE+R GF + F   FYPGKF  +++SI+  GD ++ IP +EADIA+LE
Sbjct: 317 SPQEQEAYMRSWLEERVGFKTDFKISFYPGKFQKERRSIIPAGDTSQFIPSKEADIAILE 376

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVG+VHTNYLEY+KREKN  +QAF ++ +N+ +AR +CHK
Sbjct: 377 EPEHLNWYHHGKRWTDKFNHVVGVVHTNYLEYIKREKNGVIQAFFVKHINNLVARAYCHK 436

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLS ATQ+ P S++CNVHGVNPKFLE+GE+   ++++G  +F+KGAY++G+MVW+KGY
Sbjct: 437 VLRLSGATQDLPKSMICNVHGVNPKFLEVGERIAAERESGQHSFSKGAYFLGKMVWAKGY 496

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
            EL+ L   +  +L G+++D+YGNGED  ++Q AA KL L +  + GRDHAD        
Sbjct: 497 RELIDLYAKHKSDLEGIKLDIYGNGEDSHEVQSAAMKLNLNLNFHKGRDHADD------- 549

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
                                   +  +HPSNDFF+ FPNC TY     FV    +A+A 
Sbjct: 550 -----------------------SLTWDHPSNDFFRSFPNCLTYKTSEDFVAKVKEAMAR 586

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVG-------DVVTKRS---KSPSRHLE-SESL 413
           +P   T  Q + LSWE+AT+RF++ +EL         D  T  S   KS    +E S SL
Sbjct: 587 DPQPLTPEQRYNLSWEAATQRFMEHSELDKVLSSSNRDCTTSTSGCGKSGDNKMEKSASL 646

Query: 414 -NSKRIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
            N   +++    + HY  +G E  R + GAIPG+L  ++Q   +L L+
Sbjct: 647 PNMSDMVDGGLAFAHYCFTGNELLRLSTGAIPGTLNYNKQHSLDLHLL 694


>gi|159470905|ref|XP_001693597.1| galactolipid galactosyltransferase [Chlamydomonas reinhardtii]
 gi|158283100|gb|EDP08851.1| galactolipid galactosyltransferase [Chlamydomonas reinhardtii]
          Length = 934

 Score =  451 bits (1159), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 213/440 (48%), Positives = 295/440 (67%), Gaps = 19/440 (4%)

Query: 6   QHIAIFTTASLPWLTGTAVNPLFRAAYLAKDG-ERRVTLVIPWLSLIHQKQVYPGNITFA 64
           + +AI TTASLPWLTGTAVNPL RAAYLA  G +R+VTLV+PWLS   Q++V+P +++F 
Sbjct: 404 RQVAIVTTASLPWLTGTAVNPLLRAAYLASSGGDRKVTLVLPWLSQADQQRVFPADVSFN 463

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           +P+EQE +VR W  +RTG    F   FYPG++AA+K SIL VGDIT +IPD EAD+AVLE
Sbjct: 464 TPEEQEEFVRQWARNRTGLPCNFKVAFYPGRYAAEKGSILPVGDITTVIPDHEADVAVLE 523

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHG+RW  KF  VVG++HTNYL+Y +RE+   ++  +L+ +N+W+ R++CHK
Sbjct: 524 EPEHLNWYHHGRRWTDKFAHVVGVMHTNYLDYARREEGGHVKEAILKHINAWMCRIYCHK 583

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           VI+LS A Q  P      VHGV+P FL++G+ K +   +G+K F+K AY++G+++W+KGY
Sbjct: 584 VIKLSDAVQPLPRQETMFVHGVSPSFLQVGQTKAQLAASGDKPFSKDAYFLGKVLWAKGY 643

Query: 245 EELLGLLNIYHKELAG--LEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDY 302
            ELL  L   H +  G  + +D+YG+G D   ++  A +  L +     RDHAD    DY
Sbjct: 644 TELLDRLK-EHTQRTGQSIAVDVYGSGPDLKAVEEEASRRNLRLAFRGARDHADKSLQDY 702

Query: 303 KVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKAL 362
           KVF+NPS +DVV T TAEALAMGK V+CA+HPSN FF+QFPNC  Y   + F +   +AL
Sbjct: 703 KVFINPSLSDVVATTTAEALAMGKFVLCADHPSNRFFEQFPNCLIYRTPDEFSQQLHRAL 762

Query: 363 AEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIEDA 422
           + +PA  +  Q H L+WE+ATERFL +AEL    ++    +               +++ 
Sbjct: 763 SSDPAPLSSQQLHSLTWEAATERFLDIAELRPGSISPLDTA---------------LDNV 807

Query: 423 FGYLHYVASGFETSRRALGA 442
               H+V +G E  R A GA
Sbjct: 808 LAAAHHVLTGVEGLRVAAGA 827


>gi|303271245|ref|XP_003054984.1| glycosyltransferase family 4 protein [Micromonas pusilla CCMP1545]
 gi|226462958|gb|EEH60236.1| glycosyltransferase family 4 protein [Micromonas pusilla CCMP1545]
          Length = 502

 Score =  435 bits (1119), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 210/400 (52%), Positives = 275/400 (68%), Gaps = 15/400 (3%)

Query: 6   QHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGER--RVTLVIPWLSLIHQKQVYPGNITF 63
           +H  I TTASLPW+TGT+VNPL RAAYLA  GE   RVTLV+PWL L  QK V+P N  F
Sbjct: 30  RHFCIVTTASLPWMTGTSVNPLLRAAYLAHRGETSCRVTLVVPWLPLCDQKLVHP-NAIF 88

Query: 64  ASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVL 123
            +P++Q+ +VR WL  R  F  +FD RFYPG++A DK SI+ VGD+TE +PDE+AD+A+L
Sbjct: 89  ENPEQQKEHVRGWLSGRVDFDPSFDIRFYPGRYAIDKGSIVPVGDVTECVPDEDADVAIL 148

Query: 124 EEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCH 183
           EEPEHLTWFHHG RW  KFR VVGI+HTNYLEY +REK+   +  LL  VN  + R HCH
Sbjct: 149 EEPEHLTWFHHGMRWSDKFRHVVGIIHTNYLEYARREKDGERKEALLRGVNRTVTRAHCH 208

Query: 184 KVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEK---------KMEQQQNGNKAFTKGAYY 234
           KVI+LS A Q++  S+  NVHGV+P F+E+G K         +  ++ + N +FTKG Y+
Sbjct: 209 KVIKLSDAVQDFARSVTVNVHGVSPHFIEVGRKIALAAEERSRRGEESSSNSSFTKGGYF 268

Query: 235 IGRMVWSKGYEELLGLLNIYHKELA---GLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPG 291
           IG++VW+KGY ELL  +  Y++  A    L MD++GNG+DF +++ A+++ +L +  +  
Sbjct: 269 IGKVVWAKGYLELLDRVKEYNETAAQKDKLVMDVFGNGDDFQEVKAASERERLALTFHGQ 328

Query: 292 RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGR 351
            DHA      YK F+NPS +DVV T TAEALAMGK VVCA HPSN+FF  F NCRTY   
Sbjct: 329 ADHASETTVGYKFFINPSLSDVVATTTAEALAMGKFVVCARHPSNEFFSTFANCRTYANS 388

Query: 352 NGFVEATLKALAEEPAQPTDAQTHQLSWESATERFLQVAE 391
           + F E   + L  EP   +    H+L+W++ATERFL  AE
Sbjct: 389 DEFAECVREVLHGEPEPISPDDLHRLTWQAATERFLDAAE 428


>gi|308809281|ref|XP_003081950.1| digalactosyldiacylglycerol synthase (ISS) [Ostreococcus tauri]
 gi|116060417|emb|CAL55753.1| digalactosyldiacylglycerol synthase (ISS) [Ostreococcus tauri]
          Length = 869

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 224/478 (46%), Positives = 308/478 (64%), Gaps = 45/478 (9%)

Query: 6   QHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFAS 65
           + +AI TTASLPW+TGTAVNPL RAAYLA+ G   VTLVIP+L+   QK V+P N+ F +
Sbjct: 386 RSVAIVTTASLPWMTGTAVNPLLRAAYLARRGTHDVTLVIPFLAPNEQKLVHP-NMIFNT 444

Query: 66  PKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIP-DEEADIAVLE 124
           P+EQ AYV  W+E+R GF       FYPG++A DK SI+ VGD+T  IP D ++D+AVLE
Sbjct: 445 PEEQGAYVNKWVEERCGFKPQMKLSFYPGRYATDKYSIIPVGDLTSYIPSDRDSDVAVLE 504

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+H G+RW  KF+ VVGIVHTNYL+YV+ E+N  ++   L+FVN+ ++ VHCHK
Sbjct: 505 EPEHLNWYHAGQRWTDKFKHVVGIVHTNYLDYVRLEENGPIKEKALKFVNNVVSAVHCHK 564

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGN------------------- 225
           VI+LS A QE+P S   NVHGV+P FL++G +K  +  + N                   
Sbjct: 565 VIKLSDAVQEFPKSTTMNVHGVSPIFLDVGAQKALEATHANVDVVKGPLASVGRSATKKL 624

Query: 226 -----KAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAG----LEMDLYGNGEDFDQIQ 276
                  FTKGAY++G++VW KGY+ELL  ++ ++    G    LE+D++GNG+DF +++
Sbjct: 625 GKSNKPVFTKGAYFLGKVVWGKGYKELLDRVSEHNGSENGRDCPLELDVFGNGDDFTEVK 684

Query: 277 RAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSN 336
             A++  + +R +  +DHA+   HDYKVF+NPS +DVV T TAEALAMGK VVCA HPSN
Sbjct: 685 STAEERHIPLRFHGRKDHAEKDIHDYKVFVNPSLSDVVATTTAEALAMGKFVVCAKHPSN 744

Query: 337 DFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDV 396
           +FF  FPNC  YD  + F +   KAL  EP   +   +++LSWE+AT+RFL  AEL    
Sbjct: 745 EFFSTFPNCLVYDNPDEFSKCVKKALTSEPTPLSAQDSYRLSWEAATDRFLDAAEL---- 800

Query: 397 VTKRSKSPSRHLESESLNSKRIIEDAFGY-LHYVASGFETSRRALGAIPGSLQPDEQL 453
            + R  +P       +L  K   ++ F + +H   +  E  RRA GA   +L+  E+L
Sbjct: 801 -SPREINP-------TLGDK--AKEKFAHAMHTTLTSVEPIRRATGAGANTLKAPEKL 848


>gi|108707383|gb|ABF95178.1| digalactosyldiacylglycerol synthase 2, putative, expressed [Oryza
           sativa Japonica Group]
          Length = 300

 Score =  431 bits (1109), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 199/298 (66%), Positives = 243/298 (81%)

Query: 4   KQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITF 63
           ++QHIAIFTTASLPW+TGTAVNPLFRAAYLAK G+  VTLV+PWLS   Q  VYP  + F
Sbjct: 3   RKQHIAIFTTASLPWMTGTAVNPLFRAAYLAKAGDWEVTLVVPWLSKGDQLLVYPNKMKF 62

Query: 64  ASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVL 123
           + P EQE YVR WLE+R G    F+ +FYPGKF+ +K+SIL  GDIT+ + D++ADIAVL
Sbjct: 63  SVPGEQEGYVRRWLEERIGLLPKFEIKFYPGKFSTEKRSILPAGDITQTVSDDKADIAVL 122

Query: 124 EEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCH 183
           EEPEHLTW+HHG+RWK KFR V+G+VHTNYLEYVKRE+N  + AFLL+ +NSW+  ++CH
Sbjct: 123 EEPEHLTWYHHGRRWKNKFRKVIGVVHTNYLEYVKRERNGYIHAFLLKHINSWVTDIYCH 182

Query: 184 KVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKG 243
           KVIRLSAATQE P SIVCNVHGVNPKF+EIG+ K +Q     +AF KGAYYIG+MVWSKG
Sbjct: 183 KVIRLSAATQEVPRSIVCNVHGVNPKFIEIGKLKHQQISQREQAFFKGAYYIGKMVWSKG 242

Query: 244 YEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHD 301
           Y ELL LL  + KEL+GL+M+LYG+GED D+++ +A+KL L VRVYPGRDH D IFH+
Sbjct: 243 YTELLQLLQKHQKELSGLKMELYGSGEDSDEVKASAEKLNLDVRVYPGRDHGDSIFHE 300


>gi|303284501|ref|XP_003061541.1| glycosyltransferase family 4 protein [Micromonas pusilla CCMP1545]
 gi|226456871|gb|EEH54171.1| glycosyltransferase family 4 protein [Micromonas pusilla CCMP1545]
          Length = 529

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 223/467 (47%), Positives = 293/467 (62%), Gaps = 32/467 (6%)

Query: 8   IAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPK 67
           IAI TTA+LPW+TGTAVNPL RAAYLA+ G   VTLVIPWL+   QK ++P +I F SP+
Sbjct: 48  IAIVTTAALPWMTGTAVNPLLRAAYLARRGLHDVTLVIPWLAPSEQKIIHP-SIVFDSPE 106

Query: 68  EQEAYVRWWLEDRTGF-TSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLEEP 126
           EQ AYVR W+++R GF  S     FYPG++  DK SIL VGD++E I D+  D+AVLEEP
Sbjct: 107 EQSAYVRKWVKERCGFEPSNLKMDFYPGRYHTDKYSILPVGDVSEYITDKSHDVAVLEEP 166

Query: 127 EHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHKVI 186
           EHL W++ G+RW  KF+ VVGIVHTNYLEY K E +  ++   L  VNSW++R+HCHK+I
Sbjct: 167 EHLNWYNSGERWSDKFQHVVGIVHTNYLEYAKMEAHGNVKEKALRLVNSWVSRLHCHKII 226

Query: 187 RLSAATQEYPNSIVCNVHGVNPKFLEIGEKK------MEQQQN----------GNKAFTK 230
           +LS A QE+P S   NVHGV+  FLE+G++K          QN          G   FTK
Sbjct: 227 KLSDAVQEFPRSETVNVHGVSEVFLEVGKRKATAAAAAMAAQNDPDSAAATSAGRAVFTK 286

Query: 231 GAYYIGRMVWSKGYEELLGLLNIYHKELAG----LEMDLYGNGEDFDQIQRAAKKLKLVV 286
           G Y++G++VW KG+ ELL  +  ++    G    LE+D+YGNGEDF  + + + +  L +
Sbjct: 287 GCYFLGKVVWGKGFHELLERVEAHNTSADGAAYPLELDVYGNGEDFHSVTQTSAEKNLPL 346

Query: 287 RVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCR 346
             +   DHA    HDYKVF+NPS +DVV T TAEALAMGK V+CA+HPSN+FF  FPNC 
Sbjct: 347 TFHGRADHASDAMHDYKVFVNPSLSDVVATTTAEALAMGKYVICASHPSNEFFSSFPNCL 406

Query: 347 TYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSR 406
           TYD    F +   KAL+ +P   +    ++LSWE+AT+RFL  AEL  + V+    S   
Sbjct: 407 TYDSPEEFSKCVKKALSTDPTPLSSRDRYRLSWEAATDRFLDAAELGEEQVSGPGTS--- 463

Query: 407 HLESESLNSKRIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQL 453
                   + +  E     LH      E  RRA GA   ++ P E+L
Sbjct: 464 -------RTGKAGETLVHALHSNMLRREKFRRAAGAGANTMTPPERL 503


>gi|307106518|gb|EFN54763.1| hypothetical protein CHLNCDRAFT_31413 [Chlorella variabilis]
          Length = 467

 Score =  424 bits (1089), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 196/376 (52%), Positives = 257/376 (68%)

Query: 19  LTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPKEQEAYVRWWLE 78
           +TGTAVNPL RAAYLA+D  R+VTL+IPWL+   Q +V+P N TF +P++QE YVR W++
Sbjct: 1   MTGTAVNPLLRAAYLARDRGRKVTLMIPWLAKPDQSKVFPNNTTFETPEQQEEYVRDWVK 60

Query: 79  DRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLEEPEHLTWFHHGKRW 138
            RTGF S F   FYPG++A +K SIL VGD T+ +PD EAD+A+LEEPEHL W+HHG+RW
Sbjct: 61  KRTGFDSDFKVTFYPGRYAPEKCSILPVGDPTQYVPDHEADVAILEEPEHLNWYHHGRRW 120

Query: 139 KAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHKVIRLSAATQEYPNS 198
             KF  VVG+VHTNYL+Y +RE+    + FLL+ +N+W+ R+HCHKV++LS A Q  P  
Sbjct: 121 TDKFNHVVGVVHTNYLDYARREEGGDTKEFLLKHINNWVCRIHCHKVVKLSDAVQPLPKQ 180

Query: 199 IVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKEL 258
               VHGV   FL++G+KK E    G K F +GAY+IG++VW+KGY ELL L+  + +  
Sbjct: 181 TTEFVHGVAENFLDVGKKKSEPAPEGGKRFARGAYFIGKVVWAKGYTELLDLMTKHCRAH 240

Query: 259 AGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTAT 318
             + MD YG GED + ++  A    L +R +  +DH D   H+Y+VF+NPST+DVV T T
Sbjct: 241 GDVAMDCYGTGEDLEAVRTEAATRHLSLRFHGAKDHLDTSMHEYQVFINPSTSDVVATTT 300

Query: 319 AEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLS 378
           AEALAMGK V+CA+HPSN FF QF NC  +     F +    ALA EP          L+
Sbjct: 301 AEALAMGKWVICADHPSNRFFSQFKNCLIFKTPEEFSQHVEHALAHEPHPMGPEDRQNLT 360

Query: 379 WESATERFLQVAELVG 394
           WE+ATERFL V EL  
Sbjct: 361 WEAATERFLDVTELTA 376


>gi|384244772|gb|EIE18270.1| hypothetical protein COCSUDRAFT_49349 [Coccomyxa subellipsoidea
           C-169]
          Length = 814

 Score =  422 bits (1084), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 211/391 (53%), Positives = 268/391 (68%), Gaps = 8/391 (2%)

Query: 6   QHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFAS 65
           +HIAI+TTASLPW+TGTAVNPL RAAYLAK+  R VTLVIPWL+   Q  V+  N++F +
Sbjct: 354 RHIAIYTTASLPWMTGTAVNPLLRAAYLAKEKHREVTLVIPWLAPPDQALVF-NNLSFET 412

Query: 66  PKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLEE 125
           P++QE YVR W   RTG    F   FYPG++AA+K SIL VGD T  IPD EAD+AVLEE
Sbjct: 413 PEQQEDYVRAWARKRTGLPCDFKVAFYPGRYAAEKCSILPVGDPTSYIPDNEADVAVLEE 472

Query: 126 PEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHKV 185
           PEHL W+HHG+RW  KF+ VVGI+HTNYL+Y +RE+   L+A  L  VN  + R+HCHK+
Sbjct: 473 PEHLNWYHHGRRWTDKFQHVVGIIHTNYLDYAQREEGGALKAKALAAVNQIVCRMHCHKI 532

Query: 186 IRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYE 245
           ++LS A QE P S+   VHGV   FL +GE K +    G   FTKGAY+IG+ +W+KGY 
Sbjct: 533 VKLSDAVQEMPRSVTQFVHGVPGSFLAVGEAKAKPAPEGAPRFTKGAYFIGKAIWAKGYT 592

Query: 246 ELLGLLNIYHKELAGLEM----DLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHD 301
           ELL L+    ++ A  +M    D YG G+D ++++ A+ + KL ++ + GRDH D   HD
Sbjct: 593 ELLDLM---ERDSASRDMHTHVDCYGYGDDLEELKAASARKKLPLQFHGGRDHLDESMHD 649

Query: 302 YKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKA 361
           Y+VF+NPST+DVV T TAEALAMGK VV A  P N FFK+F NC TY     F E    A
Sbjct: 650 YRVFVNPSTSDVVATTTAEALAMGKWVVVAELPCNAFFKRFSNCLTYSTPEEFSERLRTA 709

Query: 362 LAEEPAQPTDAQTHQLSWESATERFLQVAEL 392
           L EEP   +  +  +L+WE ATERFL  AEL
Sbjct: 710 LLEEPHPMSAEERRRLTWEDATERFLDAAEL 740


>gi|302834335|ref|XP_002948730.1| hypothetical protein VOLCADRAFT_116955 [Volvox carteri f.
           nagariensis]
 gi|300265921|gb|EFJ50110.1| hypothetical protein VOLCADRAFT_116955 [Volvox carteri f.
           nagariensis]
          Length = 767

 Score =  421 bits (1083), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 199/399 (49%), Positives = 267/399 (66%), Gaps = 23/399 (5%)

Query: 6   QHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFAS 65
           + +AI TTASLPWLTGTAVNPL RAAYLA  G+R+            Q++V+P ++TF +
Sbjct: 246 RQVAIVTTASLPWLTGTAVNPLLRAAYLASSGDRKA----------DQQRVFPADVTFET 295

Query: 66  PKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLEE 125
           P+EQE +VR W  +RTG    F   FYPG++AA+K SIL VGDIT +IPD EAD+AVLEE
Sbjct: 296 PEEQEEFVRQWARNRTGLECNFKVAFYPGRYAAEKGSILPVGDITMVIPDHEADVAVLEE 355

Query: 126 PEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHKV 185
           PEHL W+HHG+RW  KF  VVG++HTNYL+Y +RE+  +L+  LL+ +N+W+ R++CHKV
Sbjct: 356 PEHLNWYHHGRRWTDKFAHVVGVMHTNYLDYARREEGGQLKEALLKHINAWMCRIYCHKV 415

Query: 186 IRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYE 245
           I+LS A Q  P      VHGV+P FL++G+ K +   +G + ++K  Y++G+++W+KGY 
Sbjct: 416 IKLSDAVQPLPRQETMFVHGVSPSFLKVGQSKAQLAASGERPWSKDVYFLGKVLWAKGYT 475

Query: 246 ELLGLLNIYHKELAG--LEMDLYGNGEDFD----------QIQRAAKKLKLVVRVYPGRD 293
           ELL  L   H +  G  + +D+YG+G D             ++  A +  L +R    RD
Sbjct: 476 ELLDRLK-EHTQRTGERVPVDVYGSGPDLQARLGPPLASLAVEHEASRRNLALRFRGARD 534

Query: 294 HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNG 353
           HAD    DYKVF+NPS +DVV T TAEALAMGK VVCA HPSN FF+QFPNC  Y   + 
Sbjct: 535 HADATLQDYKVFINPSLSDVVATTTAEALAMGKFVVCAEHPSNKFFEQFPNCLIYRSPDE 594

Query: 354 FVEATLKALAEEPAQPTDAQTHQLSWESATERFLQVAEL 392
           F +   +AL  EP   +  Q H L+WE+ATERFL +AEL
Sbjct: 595 FSQQLHRALTTEPQPLSPQQLHSLTWEAATERFLDIAEL 633


>gi|255086497|ref|XP_002509215.1| glycosyltransferase family 4 protein [Micromonas sp. RCC299]
 gi|226524493|gb|ACO70473.1| glycosyltransferase family 4 protein [Micromonas sp. RCC299]
          Length = 494

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 209/444 (47%), Positives = 289/444 (65%), Gaps = 16/444 (3%)

Query: 8   IAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPK 67
           +AI TTASLPW+TGTAVNPL RAAYLA+ G   VTLV+PWL+   Q+ V+P N+ F +P+
Sbjct: 31  VAIVTTASLPWMTGTAVNPLLRAAYLARRGLHEVTLVVPWLTPAEQRMVHP-NVIFDTPE 89

Query: 68  EQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLEEPE 127
           EQ  Y+  W+++R GF       FYPG++A DK SI+ VGD++E I D + D+AVLEEPE
Sbjct: 90  EQGEYINKWVKERCGFEPKMKLDFYPGRYATDKYSIIPVGDVSEYISDGKHDVAVLEEPE 149

Query: 128 HLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHKVIR 187
           HL W+H G RW  KFR VVG+VHTNYLEY + E++  ++   + FVNSW++RVHCHK+I+
Sbjct: 150 HLNWYHTGSRWSDKFRHVVGVVHTNYLEYARLEEHGAVKEAAMRFVNSWVSRVHCHKIIK 209

Query: 188 LSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQ-QNGNKAFTKGAYYIGRMVWSKGYEE 246
           LS A Q++P S   N+HGV+P FLE       ++ +   + F+KG Y++G++VW KG+ E
Sbjct: 210 LSDAVQDFPRSETVNIHGVSPVFLEPPPAPEPEKIRPSTEVFSKGCYFLGKVVWGKGFNE 269

Query: 247 LLGLLNIYHKELAG----LEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDY 302
           LL  +  ++    G    L++D++GNGEDFD +   AK+  L ++     DHA    HDY
Sbjct: 270 LLRRVEEHNTSETGVTHPLKLDVFGNGEDFDDVTARAKQKGLPLKFKGRMDHASDAMHDY 329

Query: 303 KVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKAL 362
           KVF+NPS +DVV T TAEALAMGK V+CA HPSN+FF  FPNC  Y+    F +   KAL
Sbjct: 330 KVFINPSLSDVVATTTAEALAMGKYVICAKHPSNEFFSTFPNCMVYETPEQFSQCVKKAL 389

Query: 363 AEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIEDA 422
           + +PA  +    ++LSWE+AT+RFL  A    D+  ++ + P   L        ++ E  
Sbjct: 390 STDPAPLSAKDRYRLSWEAATDRFLDAA----DIKEEQMRGPGTGL------GDKLGETF 439

Query: 423 FGYLHYVASGFETSRRALGAIPGS 446
           F  +H VA+  E  R  LGA  G+
Sbjct: 440 FAAVHGVAAKHEKMRGVLGAGRGT 463


>gi|145352250|ref|XP_001420465.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580699|gb|ABO98758.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 455

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 220/450 (48%), Positives = 298/450 (66%), Gaps = 30/450 (6%)

Query: 19  LTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPKEQEAYVRWWLE 78
           +TGTAVNPL RAAYLA+ G   VTLVIP+L+   QK V+PG I F +P+EQ  YV  W+E
Sbjct: 1   MTGTAVNPLLRAAYLARRGIHDVTLVIPFLAPNEQKLVHPGMI-FNTPEEQGEYVNKWVE 59

Query: 79  DRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIP-DEEADIAVLEEPEHLTWFHHGKR 137
           +R GF       FYPG++A DK SI+ VGD+T  IP D E+DIAVLEEPEHL W+H G+R
Sbjct: 60  ERCGFKPQMKLSFYPGRYATDKYSIIPVGDLTSYIPGDRESDIAVLEEPEHLNWYHSGER 119

Query: 138 WKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHKVIRLSAATQEYPN 197
           W  KF+ VVGIVHTNYL+YV+ E+N  ++   L+FVN+ ++ VHCHKVI+LS A QE+P 
Sbjct: 120 WTDKFKHVVGIVHTNYLDYVRLEENGEIKEKALKFVNNVVSSVHCHKVIKLSDAVQEFPK 179

Query: 198 SIVCNVHGVNPKFLEIG---------EKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELL 248
           S   NVHGV+P FL++G         EK +E +++    FTKGAY++G++VW KGY+ELL
Sbjct: 180 STTMNVHGVSPIFLDVGAKKAIEATKEKDVEMKRSKKPVFTKGAYFLGKVVWGKGYKELL 239

Query: 249 GLLNIYHKELAG----LEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV 304
             ++ ++    G    LE+D++GNG+DF +++  A+K+KL +  +  +DHA    HDYKV
Sbjct: 240 DRVSEHNVSEDGRECPLELDVFGNGDDFAEVKSNAEKMKLPLHFHGRKDHAASDIHDYKV 299

Query: 305 FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAE 364
           F+NPS +DVV T TAEALAMGK VVCA HPSN+FF  FPNC  Y+    F +   +AL  
Sbjct: 300 FVNPSLSDVVATTTAEALAMGKFVVCAKHPSNEFFSTFPNCLVYNTPQEFTKCVKQALTS 359

Query: 365 EPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIEDAFG 424
           EPA  +   +++LSWE+AT+RFL  AEL     + R  +P       SL  K   ++ F 
Sbjct: 360 EPAPLSAQDSYRLSWEAATDRFLDAAEL-----SPRDVNP-------SLGDK--AKETFA 405

Query: 425 Y-LHYVASGFETSRRALGAIPGSLQPDEQL 453
           + +H   +  E  RRA GA   +L+  E+L
Sbjct: 406 HAMHTTMNKIEPIRRATGAGNNTLKAPEKL 435


>gi|255080068|ref|XP_002503614.1| glycosyltransferase family 4 protein [Micromonas sp. RCC299]
 gi|226518881|gb|ACO64872.1| glycosyltransferase family 4 protein [Micromonas sp. RCC299]
          Length = 491

 Score =  418 bits (1074), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 204/400 (51%), Positives = 271/400 (67%), Gaps = 14/400 (3%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDG-ERRVTLVIPWLSLIHQKQVYPGNITF 63
            +H  I TTASLPW+TGT+VNPL RAAY+A  G E  VTL++PWL+   QK V+P N  F
Sbjct: 25  DRHFTIVTTASLPWMTGTSVNPLLRAAYMANRGDECGVTLLVPWLAPCDQKLVHP-NAMF 83

Query: 64  ASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVL 123
            +P+EQ+ Y+R WL  R  F   F+  FYPG++A DK SI+ VGDIT+ +PD EAD+AVL
Sbjct: 84  QTPEEQQQYIRSWLAGRVDFDPKFEIHFYPGRYAIDKGSIVPVGDITDYVPDNEADVAVL 143

Query: 124 EEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCH 183
           EEPEHLTWFHHGKRW  KF+ VVGI+HTNYLEY +REK+   +  LL  VN+++AR HCH
Sbjct: 144 EEPEHLTWFHHGKRWTHKFQHVVGIIHTNYLEYARREKDGDKKEVLLRGVNAFVARAHCH 203

Query: 184 KVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKA-------FTKG--AYY 234
           K+I+LS A Q++  S+  NVHGV+P F+E+G K     ++  K+       F KG   Y+
Sbjct: 204 KIIKLSDAVQDFARSVTVNVHGVSPHFIEVGRKIAAAAKDNTKSEEEVGSTFGKGKVGYF 263

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAG---LEMDLYGNGEDFDQIQRAAKKLKLVVRVYPG 291
           IG++VW+KGY ELL  +  Y+   A    L MD++G+G+DF  ++ +A + +L +  +  
Sbjct: 264 IGKVVWAKGYLELLERVKEYNATAASKDKLIMDVFGDGDDFKAVKDSAARQQLALTFHGR 323

Query: 292 RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGR 351
            DHA  +   YK F+NPS +DVV T TAEALAMGK VVCA HPSN+FF  F NCRTY   
Sbjct: 324 ADHAGDVIRGYKFFINPSLSDVVATTTAEALAMGKFVVCARHPSNEFFSTFKNCRTYSNP 383

Query: 352 NGFVEATLKALAEEPAQPTDAQTHQLSWESATERFLQVAE 391
           + F E   + L  EP   + +  H+L+W++ATERFL  AE
Sbjct: 384 DEFAECVREVLHGEPEPISPSDLHRLTWQAATERFLDAAE 423


>gi|308806183|ref|XP_003080403.1| unnamed protein product [Ostreococcus tauri]
 gi|116058863|emb|CAL54570.1| unnamed protein product [Ostreococcus tauri]
          Length = 500

 Score =  412 bits (1060), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 207/458 (45%), Positives = 281/458 (61%), Gaps = 23/458 (5%)

Query: 6   QHIAIFTTASLPWLTGTAVNPLFRAAYLAK-DGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           +   + TTA+LPW+TGT+VNPL RA YLA+ D  R VTL++PWL+   Q  VYP   TF 
Sbjct: 42  RKFVVVTTAALPWMTGTSVNPLLRAVYLARGDDTREVTLMVPWLAKADQAIVYPRETTFE 101

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           +P EQE Y+R W  +R GF +     FYPG++A DK SI+ VGDI   +P  + D+A+LE
Sbjct: 102 TPSEQETYIRRWANERVGFEARIKITFYPGRYATDKGSIVPVGDIARRVPKGDRDVAILE 161

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL WFH G RW ++F+ VVGI+HTNYLEY +RE+N   +  +L ++N    R H HK
Sbjct: 162 EPEHLCWFHPGARWTSRFKHVVGIIHTNYLEYARREENGAQKEQVLRWINHLTTRCHTHK 221

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQ-QQNGNKAFTKGAYYIGRMVWSKG 243
           VI+LS A Q+Y  SI  NVHGV+  F+E G  K +  ++ G+ AF +GAY+IG+ +W+KG
Sbjct: 222 VIKLSDAVQDYARSITQNVHGVSNGFIEGGRAKAKAIKKEGSAAFRRGAYFIGKCIWAKG 281

Query: 244 YEELLGLLNIYHKELA-----GLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPI 298
           Y ELL ++  ++++ A      LEMD+YG+G+DF  ++ A  +  L +R+    DHADP 
Sbjct: 282 YSELLHVVGDFNEKYAKGKKGALEMDVYGDGDDFAVVKSAIAEQNLPLRLLGRLDHADPK 341

Query: 299 FHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEAT 358
             DYKVF+NPS +DVV T +AEALAMGK VVCA H SN FF  F NCRTY   + F +  
Sbjct: 342 ILDYKVFVNPSLSDVVATTSAEALAMGKFVVCAEHASNAFFATFTNCRTYSNMDEFAKCM 401

Query: 359 LKALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRI 418
            + +   P   TD + H+L+W++ATER L  A   GD                    K  
Sbjct: 402 REVMTTTPKPMTDEELHRLTWDAATERLLDSAAPCGDT---------------KYTLKSH 446

Query: 419 IEDAF-GYLHYVASGFETSRRALGAIPGSLQPDEQLCK 455
           + D F    HY     E+ R  +G   G+L+P E L K
Sbjct: 447 VADWFTARFHYALVASESLRCLIGGGAGTLEPPEDLSK 484


>gi|30681840|ref|NP_850561.1| digalactosyldiacylglycerol synthase 1 [Arabidopsis thaliana]
 gi|332641561|gb|AEE75082.1| digalactosyldiacylglycerol synthase 1 [Arabidopsis thaliana]
          Length = 639

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 178/296 (60%), Positives = 228/296 (77%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK  ++ VTLV+PWL    Q+ VYP N+TF+
Sbjct: 343 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKAAKQSVTLVVPWLCESDQELVYPNNLTFS 402

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE+Y+R WLE+R GF + F   FYPGKF+ +++SI   GD ++ I  ++ADIA+LE
Sbjct: 403 SPEEQESYIRKWLEERIGFKADFKISFYPGKFSKERRSIFPAGDTSQFISSKDADIAILE 462

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W++HGKRW  KF  VVGIVHTNYLEY+KREKN  LQAF +  VN+W+ R +C K
Sbjct: 463 EPEHLNWYYHGKRWTDKFNHVVGIVHTNYLEYIKREKNGALQAFFVNHVNNWVTRAYCDK 522

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLSAATQ+ P S+VCNVHGVNPKFL IGEK  E++  G +AF+KGAY++G+MVW+KGY
Sbjct: 523 VLRLSAATQDLPKSVVCNVHGVNPKFLMIGEKIAEERSRGEQAFSKGAYFLGKMVWAKGY 582

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFH 300
            EL+ L+  +  EL    +D+YGNGED  ++QRAAKK  L +    GRDHAD   H
Sbjct: 583 RELIDLMAKHKSELGSFNLDVYGNGEDAVEVQRAAKKHDLNLNFLKGRDHADDALH 638


>gi|145349066|ref|XP_001418961.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579191|gb|ABO97254.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 427

 Score =  397 bits (1020), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 189/392 (48%), Positives = 262/392 (66%), Gaps = 7/392 (1%)

Query: 6   QHIAIFTTASLPWLTGTAVNPLFRAAYLAK-DGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           +   I TTA+LPW+TGT+VNPL RA YLA  D  R VTL++PWL+   Q+ VYP  + F 
Sbjct: 36  RKFVIVTTAALPWMTGTSVNPLLRAVYLANGDTTREVTLLVPWLARKDQRIVYPKRVEFK 95

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           +P EQEAY+  W + R GF       +YPG++A DK SI+ VGDIT  +P    D+A+LE
Sbjct: 96  TPSEQEAYIMDWTKKRVGFAPKILIAWYPGRYATDKGSIVPVGDITLRVPKASRDVAILE 155

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+H G RW ++F+ VVGI+HTNYLEY +RE++   +  +L ++N   AR H HK
Sbjct: 156 EPEHLCWYHPGARWTSRFKHVVGIIHTNYLEYARREEDGERKEQILRWINHLTARCHTHK 215

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQ-QQNGNKAFTKGAYYIGRMVWSKG 243
           VI+LS A QE+  SI  NVHGV+  F++ G +K ++ ++ G+ AF++GAY+IG+ VW+KG
Sbjct: 216 VIKLSDAVQEFARSITQNVHGVSNGFIDAGREKAKRIKKEGSGAFSRGAYFIGKCVWAKG 275

Query: 244 YEELLGLLNIYHKELAG-----LEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPI 298
           Y EL+ ++  ++++ A      LEMD+YG+G+DF  ++ A  +  L + +    DHA+  
Sbjct: 276 YSELMHVVGDFNEKYAKSAKERLEMDVYGDGDDFADVKAAVAEKALPLSLLGRLDHANEK 335

Query: 299 FHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEAT 358
             DYKVF+NPS +DVV T +AEALAMGK VVCA HPSN FF  FPNCRTY   + F +  
Sbjct: 336 ILDYKVFINPSLSDVVATTSAEALAMGKFVVCAEHPSNAFFATFPNCRTYSNMDEFAKCI 395

Query: 359 LKALAEEPAQPTDAQTHQLSWESATERFLQVA 390
            +     P   TD + H+L+WE+ATER L  A
Sbjct: 396 REVTTSTPKPMTDDEIHRLTWEAATERLLDAA 427


>gi|297789503|ref|XP_002862712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297308393|gb|EFH38970.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 639

 Score =  397 bits (1019), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 177/296 (59%), Positives = 226/296 (76%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H+AI TTASLPW+TGTAVNPLFRAAYLAK  ++ VTLV+PWL    Q+ VYP N+TF+
Sbjct: 343 KRHVAIVTTASLPWMTGTAVNPLFRAAYLAKSAKQSVTLVVPWLCESDQELVYPNNLTFS 402

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP+EQE+Y+R WLE+R GF + F   FYPGKF+ +++SI   GD ++ I  ++ADIA+LE
Sbjct: 403 SPEEQESYIRKWLEERIGFKADFKISFYPGKFSKERRSIFPAGDTSQFISSKDADIAILE 462

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHGKRW  KF  VVGIVHTNYLEY+KREKN  LQAF +  VN+W+ R +C K
Sbjct: 463 EPEHLNWYHHGKRWTDKFNHVVGIVHTNYLEYIKREKNGALQAFFVNHVNNWVTRAYCDK 522

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           V+RLS ATQ+ P S+VCNVHGVNPKFL IG K  E++  G +AF+KGAY++G+MVW+KGY
Sbjct: 523 VLRLSGATQDLPKSVVCNVHGVNPKFLMIGGKIAEERSRGEQAFSKGAYFLGKMVWAKGY 582

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFH 300
            EL+ L+  +  EL    +D+YGNGED  ++QRAAKK  L +    GRDHAD   H
Sbjct: 583 RELIDLMAKHKSELGSFNLDVYGNGEDAVEVQRAAKKHDLNLNFLKGRDHADDALH 638


>gi|384252336|gb|EIE25812.1| hypothetical protein COCSUDRAFT_27439 [Coccomyxa subellipsoidea
           C-169]
          Length = 519

 Score =  365 bits (936), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 190/384 (49%), Positives = 248/384 (64%), Gaps = 15/384 (3%)

Query: 6   QHIAIFTTASLPWLTGTAVNPLFRAAYLA---KDGERRVTLVIPWLSLIHQKQVYPGNIT 62
           +H+A+ TTASLPW TGTAVNPL RAAYLA   KD   +V+L++PWL+   Q  VYP  IT
Sbjct: 13  RHVAVITTASLPWRTGTAVNPLLRAAYLAHILKDS--KVSLLVPWLAKSEQGIVYPNGIT 70

Query: 63  FASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAV 122
           F +P+EQ  +VR W+E+RT F   FD +FYPG++ A   SI  VGD T  IPDEEAD+A+
Sbjct: 71  FENPEEQADWVRAWVEERTSFPCKFDIKFYPGRYDASFLSIFPVGDPTIYIPDEEADVAI 130

Query: 123 LEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKR---EKNDRLQAFLLEFVNSWLAR 179
           LEEPEHLTWFHHG+R+  KF  V+GI+HTNY++Y++R        L A +++  N  +  
Sbjct: 131 LEEPEHLTWFHHGRRYTEKFNHVIGIMHTNYIDYIRRGAGSAGGPLAARIVKMANWRMCD 190

Query: 180 VHCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMV 239
           +H HKVI+LS A Q  P      VHGV+P FL +G+K     +     F+KGAY+IG+ V
Sbjct: 191 IHTHKVIKLSDAVQNLPRQSTHFVHGVSPAFLAVGDKMAAALKGSAYCFSKGAYFIGKAV 250

Query: 240 WSKGYEELLGLLNIYHKELAG--LEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADP 297
           W KGY ELL LL + H++  G  L +D YG GED D I+  A++ +L V     RDH D 
Sbjct: 251 WGKGYTELLDLL-LAHRKAHGSNLPVDAYGTGEDSDDIKERAERYELNVSFLGARDHLDD 309

Query: 298 IFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEA 357
             H Y+VF+NPST+DVV T +AEALAMGK +VC  HPSNDFF  F N   Y     F E 
Sbjct: 310 SIHPYRVFINPSTSDVVATTSAEALAMGKWLVCPEHPSNDFFATFENTLIYHSPAEFSEQ 369

Query: 358 TLKAL--AEEPAQPTDAQTHQLSW 379
              A   + +P +P D +  +L+W
Sbjct: 370 LEFAENNSPKPLKPEDRK--RLTW 391


>gi|384247393|gb|EIE20880.1| hypothetical protein COCSUDRAFT_2635, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 396

 Score =  357 bits (916), Expect = 9e-96,   Method: Compositional matrix adjust.
 Identities = 171/396 (43%), Positives = 245/396 (61%), Gaps = 16/396 (4%)

Query: 6   QHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFAS 65
           +H+AI +TA++PW+TGTAVNPL RAAY+A   + +VT V+PWL+   Q  V+P N++F +
Sbjct: 1   RHVAIVSTATIPWMTGTAVNPLLRAAYMAHCTDLKVTFVVPWLARCDQDTVFPNNLSFEA 60

Query: 66  PKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLEE 125
           P +QE  +R W++ RTG +  FD  +YPG++      I  VGD+T ++ + +AD+ +LEE
Sbjct: 61  PHQQEVCMREWVKQRTGLSPEFDIVWYPGRYDRTMLGIFPVGDLTRVVMECKADVVILEE 120

Query: 126 PEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK- 184
           PEHLTWFHHG RW  +F  VVGI+HT+Y E  +R     +      F NS L  +HCHK 
Sbjct: 121 PEHLTWFHHGPRWTERFNHVVGIIHTSYRELSRRNAGIVMYGISTVF-NSLLCAIHCHKA 179

Query: 185 --------VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIG 236
                   V++LS   Q++P S+   VHG  P F++ G  K    + G K F+KGAY++G
Sbjct: 180 TRPVFLDQVVKLSDTVQQFPRSVTMCVHGAAPSFVQAGAAKAAPTE-GGKRFSKGAYFLG 238

Query: 237 RMVWSKGYEELLGLLNIYH-----KELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPG 291
           ++V+ KG+EELL LL+ +      K+ +   +D YG+GE F+ ++R A+KL L +     
Sbjct: 239 KIVYGKGWEELLALLDFHQRHTKDKQTSHPTIDAYGSGEAFESVRRKAEKLNLSINFLGR 298

Query: 292 RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGR 351
           +DH DP   DY+VF+N ST+DVV T + EALAMGK ++CA HP N F   F NC  Y   
Sbjct: 299 KDHLDPAIQDYQVFINASTSDVVATTSMEALAMGKWLICAKHPCNAFVSTFSNCLVYSSP 358

Query: 352 NGFVEATLKALAEEPAQPTDAQTHQLSWESATERFL 387
             F +    AL +EP   +  +   L WE+ATER L
Sbjct: 359 AQFSDHIEHALKQEPPPLSADELRNLGWEAATERML 394


>gi|428169992|gb|EKX38921.1| hypothetical protein GUITHDRAFT_115024 [Guillardia theta CCMP2712]
          Length = 460

 Score =  352 bits (904), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 175/382 (45%), Positives = 241/382 (63%), Gaps = 8/382 (2%)

Query: 16  LPWLTGTAVNPLFRAAYLAKDGERR-VTLVIPWLSLIHQKQVYPGNITFASPKEQEAYVR 74
           +PW+TGTA+NPL RA YL    E+  VTL +PW+SL  Q  ++ GN+ F SP EQ  ++R
Sbjct: 1   MPWMTGTAINPLLRAVYLEMRKEQHSVTLCLPWVSLEQQPYIF-GNLRFKSPSEQAEWIR 59

Query: 75  WWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLEEPEHLTWFHH 134
            W++   G  ++    FY G +     SI A+GDITEIIP+ E D+ +LEEPEHLTW+H 
Sbjct: 60  DWVKKNLGRPTSIKLVFYEGHYVPIYGSIFALGDITEIIPESERDVVILEEPEHLTWYHM 119

Query: 135 GKRWKAKFRFVVGIVHTNYLEYVKREKND----RL-QAFLLEFVNSWLARVHCHKVIRLS 189
           G  W   F FVVG+VHTNY+ Y   +       RL +A +L    +   R +CH++I+LS
Sbjct: 120 GNDWTKVFNFVVGVVHTNYINYAINDHQVPPPLRLARAAVLGLATAMCTRAYCHRIIKLS 179

Query: 190 AATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLG 249
            A Q++P+SI CN+HGV   F++IG  KM     G+  F+KGAY+IG+M+WSKGY +L  
Sbjct: 180 DAVQKFPHSITCNIHGVRSNFIDIGVSKMSPSIFGSSRFSKGAYFIGKMLWSKGYRQLFV 239

Query: 250 LLNIYHKELA-GLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNP 308
            L  Y ++    L +D++G+G D + I++  K+  L    +   DHA+   HD+KV +NP
Sbjct: 240 NLKEYRRKTGENLHVDIFGSGPDEELIKKEVKQEGLDWTFHGACDHANSRIHDFKVMINP 299

Query: 309 STTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQ 368
           S +DVVCT TAEALAMGK VVCA+HPSN+FFK F NC  Y     F      A+A +PA 
Sbjct: 300 SLSDVVCTTTAEALAMGKFVVCADHPSNEFFKTFRNCFVYSTAKEFKLCIQHAMAADPAP 359

Query: 369 PTDAQTHQLSWESATERFLQVA 390
            T+   ++LSWE+ATER    A
Sbjct: 360 LTENDRYRLSWEAATERLYDAA 381


>gi|238015426|gb|ACR38748.1| unknown [Zea mays]
          Length = 242

 Score =  347 bits (890), Expect = 9e-93,   Method: Compositional matrix adjust.
 Identities = 158/238 (66%), Positives = 195/238 (81%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           M RK QH+AIFTTA LPW+TGTAVNPLFRAAYLAK G+  VTLV+PWLS   Q+ VYP  
Sbjct: 1   MARKHQHVAIFTTACLPWMTGTAVNPLFRAAYLAKAGDWEVTLVVPWLSKGDQELVYPNK 60

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           + F+ P EQE YVR WLE+RTG    FD +FYPGKF+ +K+SIL VGDI++ I D++ADI
Sbjct: 61  MRFSLPAEQENYVRRWLEERTGLLPKFDIKFYPGKFSTEKRSILPVGDISQTISDDKADI 120

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           AVLEEPEHLTW+HHG+RWK+KF+ V+G+VHTNYLEYVKREKN  + AF+L+ +NSW+  +
Sbjct: 121 AVLEEPEHLTWYHHGRRWKSKFQKVIGVVHTNYLEYVKREKNGYISAFILKHINSWVTDI 180

Query: 181 HCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRM 238
           +CHKVIRLSAATQ+ P S++CNVHGVNPKF+EIG+ K +Q     +AF KGAYYIG+M
Sbjct: 181 YCHKVIRLSAATQDVPRSVICNVHGVNPKFIEIGKLKHQQLCQREQAFFKGAYYIGKM 238


>gi|412991341|emb|CCO16186.1| predicted protein [Bathycoccus prasinos]
          Length = 618

 Score =  342 bits (877), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 177/416 (42%), Positives = 254/416 (61%), Gaps = 32/416 (7%)

Query: 8   IAIFTTASLPWLTGTAVNPLFRAAYLAKD--------------GERRVTLVIPWLSLIHQ 53
             I TTA+LPW+TGT+VNPL RAAYLA                 +R+V LV+PWL    Q
Sbjct: 90  FTIVTTAALPWMTGTSVNPLLRAAYLANTRTEKDEEEREKEEEIKRKVALVVPWLPKCDQ 149

Query: 54  KQVYPGNITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEII 113
           +QV+P   +F  P+EQ   +  W+ +R GF    +  FYPG++A DK SI+ VGDI E I
Sbjct: 150 RQVFPKRQSFNYPEEQAEAMMEWVTNRVGFRPDVEVLFYPGRYATDKGSIVPVGDIIERI 209

Query: 114 PDEEADIAVLEEPEHLTWFHHGKR-WKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEF 172
           P    D+A+LEEPEHL WFH G++ WK +F  VVGI+HTNYLEY +RE+N   +   +  
Sbjct: 210 PMRLRDVAILEEPEHLNWFHCGRKGWKQEFNLVVGIIHTNYLEYARREENGEQKEAFIRG 269

Query: 173 VNSWLARVHCHKVIRLSAATQEY-PNSIVCNVHGVNPKFLEIGEKKM----EQQQNGNK- 226
           +N  + R++ HKVI+LS A Q++  +++  NVHGV+  FL++G+++     E +++G + 
Sbjct: 270 LNFTMCRIYTHKVIKLSDAVQDFGEDAVTVNVHGVSRAFLDVGKRRADFAKENEKSGERL 329

Query: 227 AFTKGAYYIGRMVWSKGYEELLGLLNIYHKELA--------GLE---MDLYGNGEDFDQI 275
            F+K  Y+I ++VW+KGY ELL ++  Y+K LA        GLE   + ++G+G+D   +
Sbjct: 330 GFSKNCYFIAKVVWAKGYHELLDVVQEYNKSLAMKKEEKEGGLEYLPVSVFGDGDDLWDV 389

Query: 276 QRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPS 335
           +   +  K+ +      DH D    D+K+F+NPS +DVV T TAEALAMGK V+CA HPS
Sbjct: 390 KAECRTRKIPLDFKGRLDHLDKSIDDFKIFINPSLSDVVATTTAEALAMGKFVICAEHPS 449

Query: 336 NDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERFLQVAE 391
           N FF  F NC+TY  +  F     + L  EP    D    +L+WE+AT R L  +E
Sbjct: 450 NAFFATFENCKTYASQEDFNRIMDECLRTEPKPMDDVARARLTWEAATSRLLDASE 505


>gi|298714892|emb|CBJ27648.1| Digalactosyldiacylglycerol synthase, family GT4 [Ectocarpus
           siliculosus]
          Length = 564

 Score =  327 bits (838), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 179/422 (42%), Positives = 248/422 (58%), Gaps = 37/422 (8%)

Query: 10  IFTTASLPWLTGTAVNPLFRAAYLAKDGE-RRVTLVIPWLSLIHQK-------------- 54
           + TTA LPW+TGT++NPL RAA+LA+  +   VTL++P+LSL  Q               
Sbjct: 64  VVTTACLPWMTGTSINPLLRAAFLARGRDADMVTLMVPFLSLEDQPKNRTMAAGVSQQRN 123

Query: 55  -----QVYPGNITFASPKEQEAYVRWWLEDRTGFTSTFDTR--FYPGKFAADKKSILAVG 107
                QV+P  +TF +P+EQE +VR WLED      +   R  FYPG++  D  SI  +G
Sbjct: 124 NEINPQVFPRGVTFDTPEEQEVWVRNWLEDAGLARESERLRLVFYPGRYHKDYGSIFPMG 183

Query: 108 DITEIIPDEEADIAVLEEPEHLTWFHH-GKRWKAKFRFVVGIVHTNYLEYVKREKN-DRL 165
           D+T +IP EEADI +LEEPEHL W+   G+ W+  F+ VVG+VHTNYL Y         +
Sbjct: 184 DLTLMIPPEEADICILEEPEHLNWYRAPGRSWRRTFKHVVGVVHTNYLAYSSGYSVWGPV 243

Query: 166 QAFLLEFVNSWLARVHCHKVIRLSAATQEYP--NSIVCNVHGVNPKFLEIGEKKMEQQQN 223
             F+L ++N  +AR +CHK+I+LS   Q        VCNVHGV  KFL++G      Q+ 
Sbjct: 244 LTFMLRYMNIIMARAYCHKIIKLSGVIQSLAPEKETVCNVHGVRQKFLDVG------QEY 297

Query: 224 GNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELA-GLEMDLYGNGEDFDQIQRAAKKL 282
            +K    GAY+IG+ +W+KGY+ L+ LL   +K L     MD+YG+G D + I+  + + 
Sbjct: 298 AHKPRAGGAYFIGKSLWAKGYDRLINLLEYNNKRLGRAFHMDVYGSGPDREAIEAKSCEK 357

Query: 283 KLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQF 342
              +  +P  DH++    DY VF+NPS ++V+CT  AEALAMGK VVCA H SN+FF QF
Sbjct: 358 GCDITFFPATDHSE--LGDYSVFINPSVSEVLCTTVAEALAMGKWVVCARHSSNEFFFQF 415

Query: 343 PNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSK 402
           PNC  +D    F      AL  +P   T A  H+LSW +ATER    A  V +V  ++ +
Sbjct: 416 PNCLPFDSEEDFAACVSWALRHDPEDLTPALRHKLSWAAATERLADAA--VMNVGERKRQ 473

Query: 403 SP 404
            P
Sbjct: 474 KP 475


>gi|18141114|gb|AAL60505.1|AF421194_1 digalactosyldiacylglycerol synthase [Arabidopsis thaliana]
          Length = 235

 Score =  304 bits (778), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 144/222 (64%), Positives = 175/222 (78%)

Query: 238 MVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADP 297
           MVWSKGY+ELL LL  + KELA LE+DLYG+GED ++I+ AA+KL L V VYPGRDHAD 
Sbjct: 1   MVWSKGYKELLKLLEKHQKELAELEVDLYGDGEDSEEIKEAARKLDLTVNVYPGRDHADS 60

Query: 298 IFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEA 357
           +FH+YKVFLNPSTTDVVCT TAEALAMGKIVVCANH SN FFKQFPNCRTYD   GFV A
Sbjct: 61  LFHNYKVFLNPSTTDVVCTTTAEALAMGKIVVCANHISNKFFKQFPNCRTYDDGQGFVRA 120

Query: 358 TLKALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKR 417
           TLKAL E+P+Q T+ Q H+LSWE+AT+RF++V++L        + S      S S++  +
Sbjct: 121 TLKALGEQPSQLTEQQRHELSWEAATQRFIKVSDLNRLSRADSNLSKRSVFASSSISVGK 180

Query: 418 IIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGL 459
            +ED   Y+H++ASGFE SR A GAIPGSLQPDE+LC++LGL
Sbjct: 181 NLEDMSAYIHFLASGFEASRTAFGAIPGSLQPDEELCRDLGL 222


>gi|219116326|ref|XP_002178958.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409725|gb|EEC49656.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 400

 Score =  301 bits (772), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 169/402 (42%), Positives = 233/402 (57%), Gaps = 23/402 (5%)

Query: 4   KQQHIAIFTTASLPWLTGTAVNPLFRAAYLA---KDGERRVTLVIPWLSL-IHQKQVYPG 59
           K + I I TTAS+PW TGTAVNPL RAAYL    K     VTL++PWL   + Q+ VY  
Sbjct: 3   KSKRIEIVTTASMPWRTGTAVNPLLRAAYLTRGRKAAGGSVTLMLPWLERKLDQENVYGK 62

Query: 60  NITFASPKEQEAYVRWWLEDRTGF---TSTFDTRFYPGKFAADKKSILAVGDITEIIPDE 116
             TF SP EQE Y+R WL +       +   + R+Y     + + SI ++GDIT +IP +
Sbjct: 63  ENTFESPVEQEVYIRAWLRESANMPEASEELNIRWYTAWQNSVENSIYSMGDITALIPAD 122

Query: 117 EADIAVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSW 176
           E DI +LEEPEHL W+  G  W  KF+ VVGI+HTNY +Y   +    ++A  +  + SW
Sbjct: 123 EVDICILEEPEHLNWY--GLLWTKKFKHVVGILHTNYFQYALDQPAAFIRAPAMRLLCSW 180

Query: 177 LARVHCHKVIRLSAATQEYP--NSIVCNVHGVNPKFLEIG----EKKMEQQQNGNKAFTK 230
           + R HCH+VI+LS           +V NVHGV   FLE+     +K +      +  F  
Sbjct: 181 MCRAHCHRVIKLSGTLDVVAPEKELVENVHGVREDFLEVAAKLRDKVLAADHVKDPIFAS 240

Query: 231 GA----YYIGRMVWSKGYEELLGLLNIYHKELAGL--EMDLYGNGEDFDQIQRAAKKLKL 284
            +    Y+IG+M+WSKG   L+ LL  Y +E A L  ++D+YG+G D       AK L+L
Sbjct: 241 DSPPTVYFIGKMLWSKGLGSLMELLK-YAEESADLNVKVDMYGSGPDQGAATAKAKSLEL 299

Query: 285 VVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPN 344
            +  +   DH + +   +K+F+NPST++V+CT +AEALAMGK V+  +HPSNDFF QFPN
Sbjct: 300 DMPFHGPVDHVE-LGSTHKIFVNPSTSEVLCTTSAEALAMGKFVILPSHPSNDFFAQFPN 358

Query: 345 CRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERF 386
           C  Y  +  FV     A+   P    D  +H LSWE+AT+R 
Sbjct: 359 CLAYSSKEEFVGNLYYAITHSPEPLADEYSHALSWEAATQRL 400


>gi|224014982|ref|XP_002297152.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220968127|gb|EED86477.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 401

 Score =  301 bits (771), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 167/403 (41%), Positives = 238/403 (59%), Gaps = 22/403 (5%)

Query: 4   KQQHIAIFTTASLPWLTGTAVNPLFRAAYLA---KDGERRVTLVIPWLSL-IHQKQVYPG 59
           K ++I I TTA+LPW+TGTAVNPL RAAYL+   K     VTL++PW+     Q+++Y  
Sbjct: 1   KSKNIWIVTTAALPWMTGTAVNPLLRAAYLSTGRKAEGGSVTLMLPWVEREADQERIYGK 60

Query: 60  NITFASPKEQEAYVRWWLEDRTGF---TSTFDTRFYPGKFAADKKSILAVGDITEIIPDE 116
              F  P+ QE ++R WL D       +   + R+Y       + S+ ++GDI  +IP+E
Sbjct: 61  TKMFERPEIQEEFIRGWLRDAANMKEASEDLEIRWYTAWQEVAENSLYSMGDIIGLIPEE 120

Query: 117 EADIAVLEEPEHLTWFHH-GKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNS 175
             DI VLEEPEHL W+   G+ W AK++ VVGIVHTNY  Y   +    ++A  +  + S
Sbjct: 121 ACDICVLEEPEHLNWYRAPGENWTAKYKHVVGIVHTNYFVYATEQPAAFIRAPGMRLLCS 180

Query: 176 WLARVHCHKVIRLSAATQEYP--NSIVCNVHGVNPKFLEIGE----KKMEQQQNGNKAFT 229
           W+ R HCH++I+LS     +     +V NVHGV   FL++G+    K        +  F+
Sbjct: 181 WMCRAHCHRLIKLSGTLGNFAPEKELVENVHGVRRTFLDVGDELRSKLTAPDAASDPIFS 240

Query: 230 KGA----YYIGRMVWSKGYEELLGLLNIYHKELAGLEM--DLYGNGEDFDQIQRAAKKLK 283
             A    Y+IG+M+WSKG   L+ L+  Y +E AGL++  D+YG G + D+    A K+ 
Sbjct: 241 ADADPTVYFIGKMLWSKGLASLMDLMK-YAEESAGLKVKVDMYGGGPNKDEASAKATKMG 299

Query: 284 LVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFP 343
           L +  +   DHA+ +   +K+F+NPST++V+CT  AEALAMGK VV  +HPSNDFF QFP
Sbjct: 300 LDMPFHGAIDHAE-LGWSHKIFINPSTSEVLCTTVAEALAMGKFVVLPSHPSNDFFAQFP 358

Query: 344 NCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERF 386
           NC  Y  +  FV     AL   P   +D  ++ LSWE+ATERF
Sbjct: 359 NCLPYSNKEEFVGNLYYALTHAPEPLSDEYSYALSWEAATERF 401


>gi|358348954|ref|XP_003638506.1| Digalactosyldiacylglycerol synthase [Medicago truncatula]
 gi|355504441|gb|AES85644.1| Digalactosyldiacylglycerol synthase [Medicago truncatula]
          Length = 228

 Score =  300 bits (768), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 147/230 (63%), Positives = 180/230 (78%), Gaps = 6/230 (2%)

Query: 238 MVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADP 297
           M+WSKGY+ELL LLN + KEL+ LE+DL+G+GED D++Q AAKKL++ VRV+P RDHAD 
Sbjct: 1   MIWSKGYKELLQLLNDHQKELSALELDLFGSGEDSDEVQEAAKKLEMTVRVHPARDHADG 60

Query: 298 IFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEA 357
           +FHD+K+F+NPSTTDVVCT TAEALAMGKIVVC +H SN+FFKQFPNC TY+    FVE 
Sbjct: 61  LFHDFKLFINPSTTDVVCTTTAEALAMGKIVVCTDHCSNEFFKQFPNCWTYNNHKEFVEL 120

Query: 358 TLKALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKR---SKSPSRHLESESLN 414
           TLKAL EEP QPTDAQ H LSWE+ATERFL+  +L  D  ++R   S++ S +L S SL 
Sbjct: 121 TLKALTEEPGQPTDAQRHDLSWEAATERFLKAVDL--DKPSERKLLSRTTSNYL-STSLY 177

Query: 415 SKRIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLVTELG 464
            ++ +EDA  ++H+VASGFE SRR  GAIP SLQPDEQL KELG     G
Sbjct: 178 LQQTVEDASAFVHHVASGFEISRRIFGAIPHSLQPDEQLRKELGFANTSG 227


>gi|358348920|ref|XP_003638489.1| Digalactosyldiacylglycerol synthase [Medicago truncatula]
 gi|355504424|gb|AES85627.1| Digalactosyldiacylglycerol synthase [Medicago truncatula]
          Length = 189

 Score =  298 bits (764), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 139/189 (73%), Positives = 160/189 (84%), Gaps = 1/189 (0%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           MD+K+ HIAIFTTASLPWLTGTAVNPLFRAAYL K G R VTLVIPWLSL  QK VYP N
Sbjct: 1   MDQKR-HIAIFTTASLPWLTGTAVNPLFRAAYLYKAGIRNVTLVIPWLSLKDQKVVYPNN 59

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           ITF SP EQE Y+R WLEDR GF S F  +FYPGKF+ DK+SILAVGDI+EIIPD++ DI
Sbjct: 60  ITFDSPAEQEKYIRQWLEDRVGFASGFSIKFYPGKFSRDKRSILAVGDISEIIPDKDVDI 119

Query: 121 AVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           AVLEEPEHLTWFHHGKRWK KF+ V+GI+HTNYL YVKREKN  LQAFLL+++N+W+  +
Sbjct: 120 AVLEEPEHLTWFHHGKRWKTKFKLVIGIIHTNYLAYVKREKNGTLQAFLLKYLNNWVVGI 179

Query: 181 HCHKVIRLS 189
           +CHK + L+
Sbjct: 180 YCHKCVILT 188


>gi|219119254|ref|XP_002180391.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407864|gb|EEC47799.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 417

 Score =  288 bits (737), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 171/419 (40%), Positives = 227/419 (54%), Gaps = 35/419 (8%)

Query: 6   QHIAIFTTASLPWLTGTAVNPLFRAAYLAKD---------------GERRVTLVIPWLSL 50
           +H  I TTA+LPW TGTAVNPL RAAYL +                 E  VTLVIPWL L
Sbjct: 1   RHFHIVTTAALPWFTGTAVNPLLRAAYLHEKTRQLNTPANHSTNAVSESWVTLVIPWLEL 60

Query: 51  IHQKQVYPGNITFASPKEQEAYVRWWLEDRTGF------TSTFDTRFYPGKFAADKKSIL 104
           +  ++   G + F  P+EQE Y+R WL    G        S     FYP ++ +   S+ 
Sbjct: 61  VEDQEEVYGRV-FRVPQEQETYIREWLRLEAGLPDAACPQSGLRMLFYPARYHSGLGSVF 119

Query: 105 AVGDITEIIPDEEADIAVLEEPEHLTWFHH-GKRWKAKFRFVVGIVHTNYLEYVKREKND 163
           A+GDI E +     D+ VLEEPEH  W+   G+ W  +F +VVGIVHTNY EY     + 
Sbjct: 120 AMGDIMEHMDPARMDVCVLEEPEHCNWYRAPGEGWTKRFNYVVGIVHTNYKEYASAHYSG 179

Query: 164 RLQAFLLEFVNSWLARVHCHKVIRLSAATQEYP--NSIVCNVHGVNPKFLEIGEKKMEQQ 221
              A  L  ++S + R +CHKVI+LS A Q Y        NVHGV   FL+ G ++    
Sbjct: 180 LWTAPALALMSSAMVRAYCHKVIKLSDALQTYAPEKEETSNVHGVRDDFLKEGRRRASTY 239

Query: 222 QNGNKAFTK-----GAYYIGRMVWSKGYEELLGLLNIYHKELAG--LEMDLYGNGEDFDQ 274
            N   A  +       Y+IG+++W+KG + LL L + Y K+  G    +D+YG+G D   
Sbjct: 240 ANDTMALDEVPSETTVYFIGKLLWTKGLDILLELEDDY-KQYTGQYFSIDVYGSGPDQKD 298

Query: 275 IQRAAKKLKLVVRVYPGR-DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANH 333
           I RA    +     +PGR DHA  +   YKVF+NPS ++V+CT TAEALAMGK V+   H
Sbjct: 299 IMRAYLGRRKRSTTFPGRVDHAI-LTEQYKVFVNPSVSEVLCTTTAEALAMGKFVIIPVH 357

Query: 334 PSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERFLQVAEL 392
           PSN FF +FPNC  Y  R  FV     AL  EP   +       SWE+AT+RF+Q + +
Sbjct: 358 PSNTFFLRFPNCLGYRNRFEFVANLRWALTHEPDPLSPELATTFSWEAATDRFIQASAI 416


>gi|299471221|emb|CBN79076.1| Digalactosyldiacylglycerol synthase, family GT4 [Ectocarpus
           siliculosus]
          Length = 649

 Score =  277 bits (709), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 166/447 (37%), Positives = 233/447 (52%), Gaps = 62/447 (13%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGER-RVTLVIPWLSLIHQKQVYPGNITF 63
           ++ I + TTA+LPW+TGT+VNPL RAAYL +  +  +V+L+IPWL L  Q  V P    +
Sbjct: 130 ERRIWVVTTAALPWMTGTSVNPLLRAAYLTRGRDPGKVSLMIPWLGLEDQHFVLPDGHRY 189

Query: 64  ASPKEQEAYVRWWLEDRTGFTSTFD---TRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
              ++QEA++R WL    G TS  D     +Y  ++      I  +GDIT +IPD+EAD+
Sbjct: 190 ERKEDQEAFIRGWLRG-AGMTSEADDLRIAWYDARYHQVAGCIFPMGDITRLIPDDEADV 248

Query: 121 AVLEEPEHLTWFHH-GKRWKAKFRFVVGIVHTNYLEYVKREKN----DRLQAFLLEFVNS 175
            ++EEPEHL WF   G  W  KF  VVG++HTNY+ Y   + N     R++A      N 
Sbjct: 249 CIMEEPEHLNWFRATGVNWSKKFTHVVGVIHTNYVHYTLADTNHWASGRVKAPFARAFNK 308

Query: 176 WLARVHCHKVIRLSAATQEYP--NSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKG-- 231
            +AR +C KVI+LSA  Q++      V NVHGV   FL +G+ + +    G + F  G  
Sbjct: 309 IMARAYCDKVIKLSATLQKFAEEKETVTNVHGVRENFLLVGDDRAKAAARG-EPFAAGNR 367

Query: 232 AYYIGRMVWSKGYEELLGLLNIY----------------------------------HKE 257
            Y++G+M+W KGY +L  LL  Y                                  H  
Sbjct: 368 PYFLGKMLWEKGYGKLWDLLEGYQAAQNQPDGEEGREASSPAALAEKKTAAAGDHRAHAT 427

Query: 258 LAGLEM-----------DLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKVFL 306
            AG ++             YG+G D D I+  A  + L V   P  DHA+     YK F+
Sbjct: 428 PAGDKVDAGGDGGGIILGAYGSGPDSDPIRERAAVMGLSVEFNPATDHAE--LSQYKTFV 485

Query: 307 NPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEP 366
           NPS ++V+CT  AEALAMGK VV A H SN+FF QFPN   +  +  F E    ++  EP
Sbjct: 486 NPSESEVLCTTVAEALAMGKFVVIAEHASNEFFYQFPNTLKFKSQEEFNEQLSYSMTNEP 545

Query: 367 AQPTDAQTHQLSWESATERFLQVAELV 393
              T  Q H L W +AT+R ++ A++ 
Sbjct: 546 VPLTPEQRHVLGWSAATDRLVESAKVT 572


>gi|323454331|gb|EGB10201.1| hypothetical protein AURANDRAFT_23035, partial [Aureococcus
           anophagefferens]
          Length = 475

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 167/418 (39%), Positives = 224/418 (53%), Gaps = 36/418 (8%)

Query: 4   KQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITF 63
           K + + I TTASLPW+TGTAVNP  RAAY+A  G   VTL++PW++    +    G   F
Sbjct: 64  KDRSVVIVTTASLPWMTGTAVNPALRAAYMAGGGYGDVTLMLPWMADGGDQTALFGACRF 123

Query: 64  ASPKEQEAYVRWWL----EDRTG--FTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEE 117
             P  QEA+VR W+     D  G   +S+    +YP ++AA   SIL + DIT  IP   
Sbjct: 124 PEPAAQEAFVRAWIAENAPDAVGPDGSSSLKIGWYPARYAAGLGSILNLDDITSHIPRAC 183

Query: 118 ADIAVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAF--LLEFV-N 174
            D+ +LEEPEHL W+ +G RW ++FR VVG+ HTNY  Y   E  D    F  L E V  
Sbjct: 184 DDVVILEEPEHLNWYRNGPRWTSRFRHVVGVAHTNYEAYATLESRDGRVGFDVLAERVFT 243

Query: 175 SWLARVHCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAF-TKGA- 232
             + R HC  V++LSA  +  P+S V NVHGV   FL++G         G  AF  +G  
Sbjct: 244 ETVTRAHCDVVVQLSATLRPLPHSRVANVHGVRKPFLDVG------AAGGPPAFPPRGGD 297

Query: 233 ---YYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY 289
              Y++G+ +W+KGY++LL  L     + A   +D YG G D D I   +K L + +   
Sbjct: 298 ARCYFLGKAMWAKGYDQLLVFLGDGAAD-ADARVDCYGGGPDLDDIVAKSKILGVGLDFR 356

Query: 290 PGRDHADP-IFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNC--- 345
              DHAD  +F  Y VF+NPS ++V+CTATAEALAMGK VV A HPSN+FF QF  C   
Sbjct: 357 GPADHADANVFGAYDVFVNPSISEVLCTATAEALAMGKRVVIAKHPSNEFFYQFDGCHAV 416

Query: 346 --------RTYDGRNGFVEATLKAL---AEEPAQPTDAQTHQLSWESATERFLQVAEL 392
                   R             +A    +  P +        L+W++ATER +  A L
Sbjct: 417 APGDAASFRKELAAALAAAEDDRARYLPSSTPVRVVPDALRPLTWDAATERLVGAAAL 474


>gi|414865519|tpg|DAA44076.1| TPA: hypothetical protein ZEAMMB73_930994 [Zea mays]
          Length = 191

 Score =  272 bits (695), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 121/182 (66%), Positives = 150/182 (82%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           ++H AIFTTASLPW+TGT++NPLFRAAYLAKDG++ VTLVIPWL L  Q+ VYP NI F 
Sbjct: 4   KRHFAIFTTASLPWMTGTSINPLFRAAYLAKDGDKDVTLVIPWLCLRDQELVYPNNIVFD 63

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           SP EQE+YVR W+E+R  F  +F  +FYPGKF+ + +SIL VGDITE IPDE AD+AVLE
Sbjct: 64  SPSEQESYVRHWIEERIDFRPSFSIKFYPGKFSKEMRSILPVGDITECIPDEVADVAVLE 123

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL W+HHG+RWK KFR V+GIVHTNYL YV+REKN ++ A  L++ N+W+ R++CHK
Sbjct: 124 EPEHLNWYHHGRRWKNKFRRVIGIVHTNYLAYVRREKNGQVIACFLKYANTWVTRIYCHK 183

Query: 185 VI 186
           VI
Sbjct: 184 VI 185


>gi|414866032|tpg|DAA44589.1| TPA: hypothetical protein ZEAMMB73_630561 [Zea mays]
          Length = 227

 Score =  271 bits (694), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 130/222 (58%), Positives = 164/222 (73%), Gaps = 1/222 (0%)

Query: 238 MVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADP 297
           MVWSKGY ELL LL+ +  EL+GL+M+LYG+GED D+++ +A++L L +RVYPGRDH D 
Sbjct: 1   MVWSKGYTELLQLLHKHQMELSGLKMELYGSGEDADEVKASAERLSLDIRVYPGRDHGDS 60

Query: 298 IFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEA 357
           IFHDYKVF+NPSTTDVVCT TAEALAMGKIV+CANHPSN FFK+FPNC  Y+    FV  
Sbjct: 61  IFHDYKVFINPSTTDVVCTTTAEALAMGKIVICANHPSNVFFKRFPNCHMYNTDEEFVRL 120

Query: 358 TLKALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNS-K 416
           T+KALAEEP   +D   H+LSWE+ATERF++VA++   V   +    S+H    S +  +
Sbjct: 121 TMKALAEEPIPLSDDLRHELSWEAATERFIRVADIAPTVPVGQPPPSSQHFMRISPDELQ 180

Query: 417 RIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELG 458
           + +E+A  + H   SGFE  R   GAIP +LQPDEQ CKELG
Sbjct: 181 KNMEEASAFFHNTISGFEAVRCVFGAIPNTLQPDEQQCKELG 222


>gi|323448050|gb|EGB03954.1| hypothetical protein AURANDRAFT_2181 [Aureococcus anophagefferens]
          Length = 421

 Score =  254 bits (648), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 152/425 (35%), Positives = 221/425 (52%), Gaps = 50/425 (11%)

Query: 6   QHIAIFTTASLPWLTGTAVNPLFRAAYL-AKDGERRVTLVIPWLSLIHQKQVYPGNITFA 64
           + I I TTAS+PWLTGTA+NPL RAA+L A   +  VTL +PWL    Q +VY  +I F 
Sbjct: 1   KRILIMTTASVPWLTGTAINPLLRAAHLTAGRPDGAVTLYLPWLEPERQHEVYK-DIRFE 59

Query: 65  SPKEQEAYVRWWLEDRTGFTSTFD---TRFYPGKFAADKKSILAVGDITEIIPDEE---- 117
           +  +Q  YV  W+ D  G     +     FY   +   + SI  +G   E +P  +    
Sbjct: 60  TRGDQARYVAAWVRDTAGMAEAAEKLRIAFYDAHYHTPQGSIYPMGRTVEALPRADFRRY 119

Query: 118 ------------ADIAVLEEPEHLTWFH----HGKRWKAKFRFVVGIVHTNYLEYVKREK 161
                        D+ VLEEPEHL W+         W+++   VVG+VHT+Y+ Y   E+
Sbjct: 120 DGGGGAEAGAWAPDVVVLEEPEHLNWYSFVDGDASAWRSQGH-VVGVVHTHYVSYAATER 178

Query: 162 N-------------DRLQAFLLEFVNSWLARVHCHKVIRLSAATQEYPNS---IVCNVHG 205
                            +A +   ++ W+   HCH++++LS       N    +VCNVHG
Sbjct: 179 ACGGLLGELVHPVVGPFKAMMALLMSRWMTNGHCHRIVKLSNTLPRVGNDDAEVVCNVHG 238

Query: 206 VNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKE----LAGL 261
           V   FL++GE +  +   G    T+GAY+IG+++W KG ++L  LL    +E    L G 
Sbjct: 239 VRGAFLDVGEGRKARGTAG----TEGAYFIGKLIWQKGLDDLGRLLAHTAREFGGTLPGG 294

Query: 262 EMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEA 321
            + + G+G     + R+  K KL    +  RDHA+P+  D++V +NPS T+V+CT  AEA
Sbjct: 295 PVHVVGDGLHRHDVARSFAKRKLPAVFHGRRDHAEPLCQDFRVLVNPSKTEVLCTTIAEA 354

Query: 322 LAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWES 381
           LAMGK VV   HPSN+FF  FP C  ++ +  F      AL  EP   +D    +LSW +
Sbjct: 355 LAMGKWVVIRKHPSNEFFYDFPTCLPFETKAEFATHYAFALRHEPPPLSDRMRRRLSWAA 414

Query: 382 ATERF 386
           AT+RF
Sbjct: 415 ATDRF 419


>gi|412987894|emb|CCO19290.1| predicted protein [Bathycoccus prasinos]
          Length = 963

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 115/213 (53%), Positives = 154/213 (72%), Gaps = 3/213 (1%)

Query: 6   QHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFAS 65
           + +AI TTA+LPW+TGTAVNPL RAAYLA+ G   VTLV+P++ +  QK ++P N+ F S
Sbjct: 442 RKVAIVTTATLPWMTGTAVNPLLRAAYLARRGLHEVTLVVPFIPVDEQKTLHPNNV-FES 500

Query: 66  PKEQEAYVRWWLEDRTGF-TSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLE 124
           P++QE +VR W+++R GF        FYPG++A DK SI+ VGD++  I +   D+AVLE
Sbjct: 501 PEQQETFVRNWVKERCGFDVPNLKLNFYPGRYATDKMSIIPVGDVSSHIKNSN-DVAVLE 559

Query: 125 EPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           EPEHL WFH G RW   F  VVGI+HTNYL+YV+ E + +++   L FVNS ++RVHCHK
Sbjct: 560 EPEHLNWFHTGPRWSDTFEHVVGIIHTNYLDYVRLENHGKVKEKALGFVNSVVSRVHCHK 619

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKK 217
           VI+LS A QE+P S   NVHGV+P FL++G  K
Sbjct: 620 VIKLSDAVQEFPRSCTMNVHGVSPVFLDVGASK 652



 Score =  186 bits (472), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 100/220 (45%), Positives = 130/220 (59%), Gaps = 17/220 (7%)

Query: 228 FTKGAYYIGRMVWSKGYEELLGLL-----NIYHKELAGLEMDLYGNGEDFDQIQRAAKKL 282
           FTKGAY++G++VW KGY ELL  +     N  + +   + MD+YGNGED + ++R A   
Sbjct: 722 FTKGAYFLGKVVWGKGYHELLDCVEKHNANAEYGQTCPISMDVYGNGEDLESVERTAMDK 781

Query: 283 KLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQF 342
           KL +      DHA+P  HDYK+F+NPS +DVV T TAEALAMGK VVCA HPSN+FF  F
Sbjct: 782 KLPLNFKGRLDHANPTVHDYKIFVNPSLSDVVATTTAEALAMGKFVVCAKHPSNEFFSSF 841

Query: 343 PNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSK 402
           PNC TY  +  F +   KA   EP   +    ++LSWE+AT+RFL  AEL  +    + K
Sbjct: 842 PNCLTYGNQEEFSQCMKKAFDTEPKPLSAEDAYRLSWEAATDRFLDAAELGPE---HKEK 898

Query: 403 SPSRHLESESLNSKRIIEDAFGYLHYVASGFETSRRALGA 442
            P     SES     +   AF    Y  +  E  R+ALGA
Sbjct: 899 QPGLSKVSES-----VAASAF----YALNNIEGVRQALGA 929


>gi|223999577|ref|XP_002289461.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974669|gb|EED92998.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 760

 Score =  246 bits (628), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 180/564 (31%), Positives = 265/564 (46%), Gaps = 118/564 (20%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLA-----------------------------K 35
            + I + TTA+LPW TGTAVNPL RAAYL                              K
Sbjct: 170 NRSIYVVTTAALPWRTGTAVNPLLRAAYLTRRTRDINKEQGEQCNAVGHDSSISNVTGTK 229

Query: 36  DGERRVTLVIPWLSLIHQKQVYPGNITFASPKEQEAYVRWWL------EDRTGFTSTFDT 89
             ++ VTLVIPWL L   +     N  F +P++QE Y+R WL      E+          
Sbjct: 230 PKKQYVTLVIPWLELEEDRLELYPNHKFDTPEQQETYIREWLRNDAHMEEEANPDHGLRI 289

Query: 90  RFYPGKFAADKKSILAVGDITEIIPD---EEADIA----VLEEPEHLTWFHH-GKRWKAK 141
            FYP ++ +  KSI A+GD+  ++ +   EE D++    +LEEPEHL W+   G  W   
Sbjct: 290 LFYPARYHSGLKSIFAMGDMVAVLTNQTTEEDDLSDAVCILEEPEHLNWYRAPGDGWTKV 349

Query: 142 FRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHKVIRLSAATQEYP--NSI 199
           F +V+GIVHTNY+EY   + +    A  ++ ++S + R +CHKVI+LS   Q Y      
Sbjct: 350 FNYVIGIVHTNYVEYASTQFHGLWTAPAIQVMSSAMIRAYCHKVIKLSGVLQTYAVEKES 409

Query: 200 VCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKG--------AYYIGRMVWSKGYEELLGLL 251
           V NVHGV   F+  G ++     N     T           YYIG+++W+KG+E++L L 
Sbjct: 410 VDNVHGVREDFIREGRRRAGVSLNQTNITTASLDEEAEGQVYYIGKILWAKGFEQMLELQ 469

Query: 252 NIYHKELAG--LEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHAD------------- 296
             Y+ E  G    +D+YG G + ++I+RA    +   + + G+  +D             
Sbjct: 470 EFYN-ECTGKYFAIDVYGGGPEEEEIKRAFHGRRGNSKSHKGKKVSDLEELLSREYIKKK 528

Query: 297 -------------------------------PIFH-----DYKVFLNPSTTDVVCTATAE 320
                                          P+ H      YKVF+NPS ++V+CT T E
Sbjct: 529 FHSIKTSSLEFEMPKSFHELRRKPIPANFLGPVDHALLGEKYKVFVNPSISEVLCTTTFE 588

Query: 321 ALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWE 380
           ALAMGK  +   H SN+FF +FPNC  Y  +  F  A   AL  EP   T     + +WE
Sbjct: 589 ALAMGKFAIVPIHESNEFFMKFPNCLGYRNKWEFAAALRWALTHEPEPLTPDLAQEFTWE 648

Query: 381 SATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIEDAFGYLHYVASGF--ETSRR 438
           +AT+R +Q + +       R ++  R L    L +K++ E    + + +  G   +T R+
Sbjct: 649 AATDRLIQSSAI------SRREAQERAL----LGTKKLDERIAAFHYELGKGTKGDTLRK 698

Query: 439 ALGAIPGSLQPDEQLCKELGLVTE 462
            LG  P S Q   +L K+ GL TE
Sbjct: 699 FLGGGPISNQFRYELAKQ-GLDTE 721


>gi|219126206|ref|XP_002183353.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405109|gb|EEC45053.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 415

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 151/420 (35%), Positives = 210/420 (50%), Gaps = 46/420 (10%)

Query: 8   IAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRV-------------------------T 42
           + I TTA+LPW+TGTAVNPL RAAYL      R                          T
Sbjct: 1   VFIVTTAALPWMTGTAVNPLLRAAYLTLRNAARCPHQDCDGTNNSSNAITNKTNTNTTVT 60

Query: 43  LVIPWL-SLIHQKQVYPGNITFASPKEQEAYVRWWLEDRTGFTS----TFDTRFYPGKFA 97
           LVIPWL S   +  +Y      A+   QE ++R WL  R   TS    +    FYP ++ 
Sbjct: 61  LVIPWLESAADRVLLYGDAWKHATTAVQETFIREWLVTRASLTSQQAHSVVIVFYPARYH 120

Query: 98  ADKKSILAVGDITEIIPDEEADIAVLEEPEHLTWFHH-GK-RWKAKFRFVVGIVHTNYLE 155
               SI A+GD  E +   +  + +LEEPEH+  +   GK  W+ KF  VVGI+HTNY  
Sbjct: 121 HGLSSIFAMGDFCERLTVPDDAVCLLEEPEHINCYRAPGKTSWRKKFAHVVGIIHTNYKA 180

Query: 156 YVKREKNDRLQAFLLEFVNSWLARVHCHKVIRLSAATQEYP--NSIVCNVHGVNPKFLEI 213
           Y     +  L   L+  ++SW  R +C KVI+LS   Q Y      VCNVHG+  +FL  
Sbjct: 181 YASHHYSGLLTGPLVGVLSSWCVRAYCDKVIKLSPVLQTYAAEKETVCNVHGIRDEFLHT 240

Query: 214 GEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAG-LEMDLYGNGEDF 272
                     G K      Y++G+++W+KG +++L L   Y K       MD++G+G + 
Sbjct: 241 ------PAPTGPKI-----YFLGKLLWAKGLDKMLRLQYAYRKATGSYFAMDVFGSGPEE 289

Query: 273 DQIQRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
            +I++A     L    + GR     I  DYK+ +NPS T+V+CT+TAEA+AM K V+   
Sbjct: 290 KEIRKAFLGEALEDDDFLGRQDHGTISTDYKIMVNPSITEVLCTSTAEAVAMSKFVILPT 349

Query: 333 HPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERFLQVAEL 392
           HPSN FF+QFPNC  Y+    F      A +  P   T      LSW +AT R L+  ++
Sbjct: 350 HPSNVFFEQFPNCLFYETPADFCRVLQHATSHNPEPLTPECRDVLSWSAATTRLLEAGQV 409


>gi|356519373|ref|XP_003528347.1| PREDICTED: uncharacterized protein LOC100810790 [Glycine max]
          Length = 644

 Score =  235 bits (599), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 101/174 (58%), Positives = 133/174 (76%)

Query: 19  LTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPKEQEAYVRWWLE 78
           +TG AVNPLFR AYL++  +++VTL++PWL    Q+ V P N+TF SP+EQEAY+R WLE
Sbjct: 1   MTGRAVNPLFRGAYLSEFAKQKVTLLVPWLCKSDQELVCPNNLTFTSPEEQEAYMRSWLE 60

Query: 79  DRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLEEPEHLTWFHHGKRW 138
           +R GF + F   FYPGKF+  ++SI+  GD ++ IP  +ADIA+L+EPEHL W+HHGKRW
Sbjct: 61  ERIGFKADFKISFYPGKFSEARRSIIPAGDTSQFIPSRDADIAILKEPEHLNWYHHGKRW 120

Query: 139 KAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHKVIRLSAAT 192
             KF  VVGIVHTNY+EY+KREKN  LQ FL++ +N+W+ R +CHKV RL  AT
Sbjct: 121 TNKFNHVVGIVHTNYIEYIKREKNGALQVFLVKHINNWVTRAYCHKVFRLLDAT 174


>gi|397640440|gb|EJK74120.1| hypothetical protein THAOC_04222 [Thalassiosira oceanica]
          Length = 800

 Score =  234 bits (596), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 143/396 (36%), Positives = 215/396 (54%), Gaps = 32/396 (8%)

Query: 23  AVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN-----ITFASPKEQEAYVRWWL 77
           A NP+      A DG+  VTLV+PWL     ++V  G      + F   +EQEAY+R WL
Sbjct: 351 ANNPMLSLGSDANDGQ--VTLVVPWLQDDDDRRVLYGTSGNQTVPFKDQEEQEAYIRNWL 408

Query: 78  EDRTGFTST---FDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLEEPEHLTWFHH 134
                  +     +  FYP KF     SI A+GDI E+I D  +D+ +LEEPEHL W++ 
Sbjct: 409 STEADMAAEAEELNIIFYPAKFHNKYNSIFALGDICELIDDRTSDVCILEEPEHLNWYNK 468

Query: 135 GK---RWKAKFRFVVGIVHTNYLEY-VKREKNDRLQAFLLEFVNSWLARVHCHKVIRLSA 190
            K    W +KF   VG++HTNY  Y +   +   L A +L  VN  +   +CH+V++LS 
Sbjct: 469 PKGTSPWTSKFGHCVGVIHTNYKAYALNHGRAGVLTAPILAGVNRLVVANNCHRVVKLSG 528

Query: 191 ATQEYPNS-IVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLG 249
             QE+  S I+ N+HG+   +L+    +  +++    A    AY+IG+++W+KG+++L+ 
Sbjct: 529 VLQEFGGSEIIENIHGIRKAYLD----EGRRRRLRQGARGGRAYFIGKLLWAKGFDQLIE 584

Query: 250 LLNIYHKELAG-LEMDLYGNGEDFDQIQ------RAAKKLKLVVRV-----YPGR-DHAD 296
           L + Y        ++D+YG+G D DQI+      R   K +L  R      + GR DHA 
Sbjct: 585 LQSSYLDRTGEYFDIDIYGSGPDEDQIKEAFLSSRDPTKWRLSPREAIPANFMGRIDHAA 644

Query: 297 PIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVE 356
               +Y +F+NPS T+V+CT TAEA AMGK V+  +HPSN +F++F NC  Y  R  FV 
Sbjct: 645 LAGDEYDIFVNPSVTEVLCTTTAEATAMGKWVLIPSHPSNSYFERFDNCLLYRNRREFVS 704

Query: 357 ATLKALAEEPAQPTDAQTHQLSWESATERFLQVAEL 392
               A A  P    +    +LSW +AT R ++ + +
Sbjct: 705 KLKHAKANPPPVLAEEVAEELSWNAATARCVRASAI 740



 Score = 45.1 bits (105), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 18/26 (69%), Positives = 22/26 (84%)

Query: 10  IFTTASLPWLTGTAVNPLFRAAYLAK 35
           + TTA+LPW TGTAVNPL R+AYL +
Sbjct: 237 VVTTAALPWFTGTAVNPLLRSAYLLR 262


>gi|224005637|ref|XP_002291779.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220972298|gb|EED90630.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 403

 Score =  229 bits (585), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 130/346 (37%), Positives = 205/346 (59%), Gaps = 20/346 (5%)

Query: 59  GNITFASPKEQEAYVRWWLEDRTGF---TSTFDTRFYPGKFAADKKSILAVGDITEIIPD 115
           G  +F   +EQE ++R WL +  G        +  FYP ++    +SI A+GDI ++IP+
Sbjct: 4   GGDSFEDEQEQEVFIREWLANEAGMPDEAKELNITFYPARYHHFAQSIFALGDICDLIPN 63

Query: 116 EEADIAVLEEPEHLTWFHH--GKRWKAKFRFVVGIVHTNYLEYVK-REKNDRLQAFLLEF 172
           ++ D+ +LEEPEHL W+       W +KFR V+GI+HTNY  YV+       L A L   
Sbjct: 64  DQTDVCILEEPEHLNWYRAPGSSPWTSKFRHVLGIIHTNYKSYVRGHAPAGFLAAPLTAG 123

Query: 173 VNSWLARVHCHKVIRLSAATQEY--PNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTK 230
           VNS + + +CH+V++LS+  Q +     +V NVHG+   +LE    +  + ++ + + +K
Sbjct: 124 VNSLVVQANCHRVVKLSSVLQSFMPGKEVVQNVHGIRSSYLE----ERRRIRSASASSSK 179

Query: 231 GAYYIGRMVWSKGYEELLGLLNIYHKELAG--LEMDLYGNGEDFDQIQRAAKKLK----L 284
            AY+IG+++W+KG+  LL L   Y+++  G   E +++G+G D ++I+RA +K +    L
Sbjct: 180 KAYFIGKLLWAKGFTHLLEL-EFYYRQKTGNYFECEIFGSGPDEEEIKRAFQKGQGDQPL 238

Query: 285 VVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPN 344
            V+     DH+  +  D  VF+NPS T+V+ T TAEA+AMGK V+  +H SN+FF+QFPN
Sbjct: 239 PVKFLGRADHSS-LAGDEFVFVNPSLTEVLATTTAEAIAMGKFVIIPSHSSNEFFEQFPN 297

Query: 345 CRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERFLQVA 390
           C TY  R  FV     A+  EP   ++   + LSWE+AT R +  A
Sbjct: 298 CLTYRNRREFVSLLKYAMRNEPPPLSEELAYLLSWEAATMRCVSAA 343


>gi|299116496|emb|CBN76211.1| Digalactosyldiacylglycerol synthase, family GT4 [Ectocarpus
           siliculosus]
          Length = 579

 Score =  221 bits (563), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 117/254 (46%), Positives = 166/254 (65%), Gaps = 14/254 (5%)

Query: 6   QHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGER-RVTLVIPWLSLIHQKQVYPGNITFA 64
           + I +FTTASLPW+TGTAVNPL RAAYL +  ++ +VTL +PWL    Q++VYP    +A
Sbjct: 40  RRIWVFTTASLPWMTGTAVNPLLRAAYLTRGRDKGKVTLGVPWLVDEDQEKVYPKGKRYA 99

Query: 65  SPKEQEAYVRWWLEDRTGF---TSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIA 121
           + ++QEAYVR WL  + G    +   D  FYP ++     SI  +GD+    PDEEAD+A
Sbjct: 100 AKEDQEAYVREWLV-KAGLEEESKKLDILFYPARYHEQFGSIFPMGDLAAQAPDEEADVA 158

Query: 122 VLEEPEHLTWFH-HGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARV 180
           VLEEPEHL +F   G  W  KF +VVGI+HTNY  Y + EK+ R++  +++   ++  R 
Sbjct: 159 VLEEPEHLNFFRAEGVPWLNKFNYVVGIIHTNYQFYARGEKHGRVKKPIVKAACAFTVRA 218

Query: 181 HCHKVIRLSAATQEYP--NSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRM 238
           HCH++I+LS A Q Y     +V NVHGV P+F E+G+   E  +NG   FT  AY+IG++
Sbjct: 219 HCHRIIKLSDALQGYAREKEMVENVHGVRPQFFEVGD---EAVKNG---FTGDAYFIGKV 272

Query: 239 VWSKGYEELLGLLN 252
           +W+KG + LL L++
Sbjct: 273 LWTKGIDILLALMH 286



 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 69/188 (36%), Positives = 103/188 (54%), Gaps = 14/188 (7%)

Query: 265 LYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAM 324
           +YGNG D D+++   +++ L V  +   DHA+     YKVF+NPS ++V+CT  AEALAM
Sbjct: 375 IYGNGSDLDEVKEKVQEMDLPVSFHDAIDHAE--LGSYKVFVNPSQSEVLCTTIAEALAM 432

Query: 325 GKIVVCANHPSND-FFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESAT 383
           GK VVCA HPSN+ FF  F  C ++     F+    KAL+E P + ++   H+LSW +AT
Sbjct: 433 GKWVVCARHPSNEFFFSNFETCLSFSDEREFLSCMQKALSETPPRLSEETRHKLSWAAAT 492

Query: 384 ERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIEDAFGYLH-YVASGFETS--RRAL 440
           +RF+  A         R + P R+  S+S      ++     LH +   G      R+A+
Sbjct: 493 DRFMAAA--------SRPEVPGRNNTSKSARGLARLDKLLAALHQHTMKGKRGDLLRQAM 544

Query: 441 GAIPGSLQ 448
           G  P + Q
Sbjct: 545 GGGPVAKQ 552


>gi|223942621|gb|ACN25394.1| unknown [Zea mays]
 gi|413937160|gb|AFW71711.1| hypothetical protein ZEAMMB73_218762 [Zea mays]
          Length = 238

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 100/228 (43%), Positives = 141/228 (61%), Gaps = 6/228 (2%)

Query: 238 MVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADP 297
           MVW+KGY EL+ L+  +  +L G ++D+YG+GED  ++Q  A++L L +  + GRDHAD 
Sbjct: 1   MVWAKGYRELIDLMAKHKSDLEGFKLDVYGSGEDSQEVQSTARRLDLSLNFFKGRDHADN 60

Query: 298 IFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEA 357
             H YKVF+NPS +DV+CTATAEALAMGK V+CA HPSN+FF  FPNC TY     FV  
Sbjct: 61  SLHGYKVFINPSISDVLCTATAEALAMGKFVICAEHPSNEFFMSFPNCLTYKTSEEFVAR 120

Query: 358 TLKALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPS-----RHLESES 412
             +A+  EP   T  Q + LSWE+ATERF++ ++L   V+   +  P      ++  +  
Sbjct: 121 VKEAMDREPQPLTPEQRYNLSWEAATERFMEYSDL-DKVLNNEAAQPGQGRKRKNKRTSQ 179

Query: 413 LNSKRIIEDAFGYLHYVASGFETSRRALGAIPGSLQPDEQLCKELGLV 460
            N   I++      H   +G E  R A GAIPG+   D+Q C ++GL+
Sbjct: 180 PNLSDIMDGGLALAHRCLTGNEVLRLATGAIPGTRDYDKQHCVDMGLL 227


>gi|359479615|ref|XP_002276164.2| PREDICTED: digalactosyldiacylglycerol synthase 2,
           chloroplastic-like [Vitis vinifera]
          Length = 144

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 87/142 (61%), Positives = 105/142 (73%), Gaps = 1/142 (0%)

Query: 324 MGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESAT 383
           MGKIVVCANHPSNDFFKQF NCRTY   NGFV+ TLK L+EEPAQ T AQ H+LSW++AT
Sbjct: 1   MGKIVVCANHPSNDFFKQFTNCRTYQDNNGFVKETLKTLSEEPAQLTYAQMHELSWDAAT 60

Query: 384 ERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIEDAFGYLHYVASGFETS-RRALGA 442
           ERFLQ A L   V +K +K+P +   S  +N ++ ++DA  Y+HYVASG E S RR  GA
Sbjct: 61  ERFLQAAGLDQAVESKPTKTPLKKFMSMRMNLRKNMDDASAYVHYVASGIEASWRRVFGA 120

Query: 443 IPGSLQPDEQLCKELGLVTELG 464
           IPGSLQPDE+  +ELG    +G
Sbjct: 121 IPGSLQPDEEQRQELGWAFPIG 142


>gi|397615282|gb|EJK63335.1| hypothetical protein THAOC_16014 [Thalassiosira oceanica]
          Length = 449

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 96/223 (43%), Positives = 132/223 (59%), Gaps = 16/223 (7%)

Query: 177 LARVHCHKVIRLSAATQEYP--NSIVCNVHGVNPKFLEIGEK---------KMEQQQNGN 225
           + R HCH+VI+LS   Q++     +V NVHGV   FL+IG++          M     G 
Sbjct: 1   MCRAHCHRVIKLSGTLQQFAPEKELVENVHGVRRTFLDIGDELRCKLTTPDAMPDPVFGA 60

Query: 226 KAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEM--DLYGNGEDFDQIQRAAKKLK 283
            A T   Y+IG+M+WSKG + L+ L+  Y +E A L++  D+YG G D D+    A K+ 
Sbjct: 61  DA-TPAVYFIGKMLWSKGIDSLMELIK-YAEESADLKVKVDMYGGGPDKDEASERAAKMG 118

Query: 284 LVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFP 343
           L +  +   DH++ +   +KVF+NPST++V+CT  AEALAMGK VV  +HPSNDFF QFP
Sbjct: 119 LEMPFHGPVDHSE-LGWTHKVFINPSTSEVLCTTVAEALAMGKFVVLPSHPSNDFFAQFP 177

Query: 344 NCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERF 386
           NC  Y  +  FV     AL  EP   +   +  LSWE+ATERF
Sbjct: 178 NCLPYTNKEEFVGNLYYALTHEPEPLSSEYSRALSWEAATERF 220


>gi|62319070|dbj|BAD94208.1| digalactosyldiacylglycerol synthase [Arabidopsis thaliana]
          Length = 149

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 80/136 (58%), Positives = 101/136 (74%)

Query: 324 MGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESAT 383
           MGKIVVCANH SN FFKQFPNCRTYD   GFV ATLKAL E+P+Q T+ Q H+LSWE+AT
Sbjct: 1   MGKIVVCANHISNKFFKQFPNCRTYDDGQGFVRATLKALGEQPSQLTEQQRHELSWEAAT 60

Query: 384 ERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIEDAFGYLHYVASGFETSRRALGAI 443
           +RF++V++L        + S      S S++  + +ED   Y+H++ASGFE SR A GAI
Sbjct: 61  QRFIKVSDLNRLSRADSNLSKRSVFASSSISVGKNLEDMSAYIHFLASGFEASRTAFGAI 120

Query: 444 PGSLQPDEQLCKELGL 459
           PGSLQPDE+LC++LGL
Sbjct: 121 PGSLQPDEELCRDLGL 136


>gi|345447320|gb|AEN92270.1| digalactosyldiacylglycerol synthase [Chromera velia]
          Length = 217

 Score =  168 bits (425), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 85/201 (42%), Positives = 133/201 (66%), Gaps = 12/201 (5%)

Query: 6   QHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGER-RVTLVIPWLSLIHQKQ--VYPGNIT 62
           ++I I TTA+LPW+TGTAVNPL RAA+L +  ++ +V+L++PWL   HQ+Q  VYP  +T
Sbjct: 14  RNITIITTAALPWMTGTAVNPLLRAAFLTRGRKKGQVSLLVPWLEK-HQQQSKVYPKGVT 72

Query: 63  FASPKEQEAYVRWWLE---DRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEAD 119
           F S +EQ  ++  WL    D      + D  FY G + ++ +SIL +GDI +++PD+ AD
Sbjct: 73  FDSQEEQREFILNWLRESGDMREEAESLDLMFYNGSYHSEYRSILPIGDIIKLVPDDHAD 132

Query: 120 IAVLEEPEHLTWFH----HGKRWKAKFRFVVGIVHTNYLEYVKREKN-DRLQAFLLEFVN 174
           + VLEEPEHL W+         ++ KFR VVG++HTNY+EY + ++    L + +L +VN
Sbjct: 133 VCVLEEPEHLNWYRAVDPDVPPFQKKFRHVVGVIHTNYVEYARTQRELGYLASPILFYVN 192

Query: 175 SWLARVHCHKVIRLSAATQEY 195
           ++L R +C ++I+LS   Q++
Sbjct: 193 NFLVRCYCDRIIKLSDTLQDF 213


>gi|219111375|ref|XP_002177439.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411974|gb|EEC51902.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 726

 Score =  165 bits (418), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 107/307 (34%), Positives = 165/307 (53%), Gaps = 39/307 (12%)

Query: 5   QQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERR-VTLVIPWLSLIH-QKQVYPGNIT 62
            + I + TTA++PW TGT++NPL RA YL +   +  +TLVIPWL  I  +K++Y   + 
Sbjct: 94  NRKIWVVTTAAMPWRTGTSLNPLMRALYLTRGRPKHSITLVIPWLEDIKSRKKLYGDALC 153

Query: 63  F--ASPKEQEAYVRWWLEDRTGFT---STFDTRFYPGKFAADKKSILAVGDITEIIPDEE 117
           F     + QE ++R +  +R             F+ G++     SI  V DI  +IP +E
Sbjct: 154 FDDGGKQAQEQWIREYCRERCKCEEEEQNLRIMFWRGRYHDGFGSIFPVEDICSLIPKKE 213

Query: 118 ADIAVLEEPEHLTWFHH----GKR----------WKAKFRFVVGIV------HTNYLEYV 157
           AD+A+LEEPEHL WF      GK           W  KF+ VVG+V      HTNY  Y+
Sbjct: 214 ADVAILEEPEHLNWFRLPTKVGKNEENQDVDRLGWAHKFKHVVGVVFFLLQLHTNYGAYI 273

Query: 158 KR--EKNDRLQAFLLEFVNSWLARVHCHKVIRLSAATQEYPNSI--VCNVHGVNPKFLEI 213
           ++       + A  L+ ++S + R +CH+++RLSA      + I    NVHGV  +FL  
Sbjct: 274 RQYGMGTSFVTAPALDALSSLVVRAYCHRLVRLSATLPSLDSDIEVTSNVHGVRSEFLSP 333

Query: 214 GEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAG--LEMDLYGNGED 271
            ++K E  +          Y++G+++W+KG++++L +   YH E+AG    MD+YG G+D
Sbjct: 334 PQRKSETTKP-----HAPVYFVGKLIWAKGFDKVLEVQEAYH-EVAGEYFAMDIYGGGDD 387

Query: 272 FDQIQRA 278
              IQR 
Sbjct: 388 MKAIQRG 394



 Score = 89.7 bits (221), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 48/119 (40%), Positives = 65/119 (54%), Gaps = 9/119 (7%)

Query: 301 DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLK 360
           D+K+FLN S T+V+CT +AEALAMGK V+   H SN+FF  FPNC  ++  +  V     
Sbjct: 582 DHKIFLNMSITEVLCTTSAEALAMGKFVILPKHSSNEFFYCFPNCLAFEDMDDCVRKIQY 641

Query: 361 ALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDV---------VTKRSKSPSRHLES 410
           AL  +P   TD     LSWE AT+R    + +  D          V KR K+   H++S
Sbjct: 642 ALTNKPEPLTDKFVRMLSWEGATDRLYDSSGMTRDEADKLKEAGRVEKREKAARFHVDS 700


>gi|383154099|gb|AFG59186.1| Pinus taeda anonymous locus 2_8731_01 genomic sequence
 gi|383154101|gb|AFG59187.1| Pinus taeda anonymous locus 2_8731_01 genomic sequence
 gi|383154103|gb|AFG59188.1| Pinus taeda anonymous locus 2_8731_01 genomic sequence
 gi|383154105|gb|AFG59189.1| Pinus taeda anonymous locus 2_8731_01 genomic sequence
          Length = 165

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 78/140 (55%), Positives = 99/140 (70%), Gaps = 2/140 (1%)

Query: 255 HK-ELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHD-YKVFLNPSTTD 312
           HK EL G  +D++G GED  ++Q  A KLKL +  + GRDHAD   H  YKVF+NPS +D
Sbjct: 3   HKDELDGFNLDVFGTGEDSAEVQFTAHKLKLNMNFHQGRDHADDALHSSYKVFINPSVSD 62

Query: 313 VVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTDA 372
           V+CTATAEALAMGKIVVCA+HPSN+FF  FPNC  Y     FV+   +ALA EP   +  
Sbjct: 63  VLCTATAEALAMGKIVVCADHPSNEFFGSFPNCLMYKNSEEFVKKVKEALAAEPVPLSAE 122

Query: 373 QTHQLSWESATERFLQVAEL 392
           Q ++LSWE+ATERF+  A++
Sbjct: 123 QQYRLSWEAATERFINYADM 142


>gi|163797706|ref|ZP_02191654.1| hexosyltransferase; glycosyltransferase [alpha proteobacterium
           BAL199]
 gi|159177054|gb|EDP61617.1| hexosyltransferase; glycosyltransferase [alpha proteobacterium
           BAL199]
          Length = 389

 Score =  159 bits (401), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 118/375 (31%), Positives = 185/375 (49%), Gaps = 31/375 (8%)

Query: 8   IAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPK 67
           + + TTA+ PWLTG A   L++A  LA+ G R V  V+PW+    Q  ++ G + FA+P+
Sbjct: 11  VTVMTTAAPPWLTGPAYLSLWQACGLAELGFR-VAYVVPWVPPAGQSLLWQGQV-FATPQ 68

Query: 68  EQEAYVRWWLE--DRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLEE 125
           +Q A++   +    R    + F  R +  KF    +SI+ + D+    P   A + VL E
Sbjct: 69  DQLAWLSAEIRRMGRPVVPALFHYRGHASKFL---RSIVPLEDVFRAAP--PARVHVLTE 123

Query: 126 PEHLTWFHHG-KRWKAKFRFVVGIVHTNYLEYVKREKNDR--LQAFLLEFVNSWLARVHC 182
           PEHL W+     R       V+G+V T+Y  Y++R       + A L+  ++ +L R H 
Sbjct: 124 PEHLCWYPGATPRRHVDAETVLGVVMTSYDSYIRRHGGPAAWIGAPLVRQLHRFLIRRHT 183

Query: 183 HKVIRLSAA----TQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRM 238
              + +S A    T  +P  +   V GV P + E+    +E  Q G        Y++GR+
Sbjct: 184 DWTVPVSPAAEGITSGHPVRL-GRVTGVLPDYAEV--PPVEPGQGG-------VYFLGRL 233

Query: 239 VWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPI 298
           VW KG   ++ +          L +++ G G D D I+  A+ L   V+          +
Sbjct: 234 VWDKGLSTVVEV-----SRRMNLPLEVLGEGPDGDAIRAMARDLAAPVKFLGPTREPWTL 288

Query: 299 FHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEAT 358
            H Y+VF NPS ++V+CT TAEAL  G+ VV  + P+N+ FK +PN   Y   +G V A 
Sbjct: 289 LHRYRVFFNPSLSEVLCTTTAEALVAGRHVVLPDCPANEPFKAYPNAHFYTDVDGAVAAL 348

Query: 359 LKALAEEPAQPTDAQ 373
             A+  EP  P  A+
Sbjct: 349 SLAMTTEPVAPVAAR 363


>gi|289064560|gb|ADC80600.1| digalactosyldiacylglycerol synthase 1-like protein, partial
           [Populus tremula x Populus alba]
 gi|289064562|gb|ADC80601.1| digalactosyldiacylglycerol synthase 1-like protein, partial
           [Populus tremula x Populus alba]
          Length = 99

 Score =  155 bits (392), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 69/99 (69%), Positives = 82/99 (82%)

Query: 23  AVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPKEQEAYVRWWLEDRTG 82
           AVNPLFRAA+LAKDG R+VTLVIPWLSL HQK VYP NITF SP E + YVR WL++R  
Sbjct: 1   AVNPLFRAAFLAKDGSRKVTLVIPWLSLQHQKLVYPNNITFTSPSEHQVYVRQWLQERIS 60

Query: 83  FTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIA 121
           F+  F  +FYP KFA DK+SIL+VGDI+E+IPDE+AD+A
Sbjct: 61  FSPDFSIQFYPAKFAVDKRSILSVGDISEVIPDEDADVA 99


>gi|414589659|tpg|DAA40230.1| TPA: hypothetical protein ZEAMMB73_725439 [Zea mays]
          Length = 364

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 69/135 (51%), Positives = 90/135 (66%)

Query: 258 LAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTA 317
           L G ++D+YG+GED  ++Q  AK+L L +  + GRDHAD      KVF NPS +DV+CT 
Sbjct: 173 LEGFKLDVYGSGEDSQEVQSTAKRLDLSLNFFKGRDHADNSLCGCKVFKNPSISDVLCTT 232

Query: 318 TAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQL 377
           TAEALAMGK V+CA HPSN+FF  FPNC TY     FV    +A+  EP   T  Q + L
Sbjct: 233 TAEALAMGKFVICAEHPSNEFFMSFPNCLTYKTSEEFVARVKEAMDREPQPLTPEQRYNL 292

Query: 378 SWESATERFLQVAEL 392
           S E+ATERF++ ++L
Sbjct: 293 SREAATERFMEYSDL 307


>gi|356551464|ref|XP_003544095.1| PREDICTED: uncharacterized protein LOC100807412 [Glycine max]
          Length = 329

 Score =  144 bits (364), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 63/84 (75%), Positives = 76/84 (90%)

Query: 95  KFAADKKSILAVGDITEIIPDEEADIAVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYL 154
           +F+ DK+S LAVGDI+EIIPD+ ADIAVLEEPEHLTW+HHGKRWK KFR V+GI+HTNYL
Sbjct: 241 QFSRDKRSSLAVGDISEIIPDKVADIAVLEEPEHLTWYHHGKRWKTKFRLVIGIIHTNYL 300

Query: 155 EYVKREKNDRLQAFLLEFVNSWLA 178
           EYVKREKN  +QAFLL+++NSW+ 
Sbjct: 301 EYVKREKNGTMQAFLLKYLNSWVV 324


>gi|297602698|ref|NP_001052767.2| Os04g0416900 [Oryza sativa Japonica Group]
 gi|255675447|dbj|BAF14681.2| Os04g0416900 [Oryza sativa Japonica Group]
          Length = 165

 Score =  134 bits (338), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 61/116 (52%), Positives = 82/116 (70%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           +   ++H+AI TTASLPW+TGTA+NPLFRAAYLA+  +++VTLV+PWL    Q+ VYP N
Sbjct: 33  LSDNRRHVAIVTTASLPWMTGTAINPLFRAAYLARSTKQKVTLVVPWLCKSDQELVYPNN 92

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDE 116
           ITF+SP+EQE Y+R WL++R GF + F   FYPGK      S   V + T   P++
Sbjct: 93  ITFSSPEEQENYIRNWLQERLGFEANFKISFYPGKVKIQLSSDFDVFNHTLFFPND 148


>gi|163797716|ref|ZP_02191664.1| hypothetical protein BAL199_08073 [alpha proteobacterium BAL199]
 gi|159177064|gb|EDP61627.1| hypothetical protein BAL199_08073 [alpha proteobacterium BAL199]
          Length = 329

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 105/334 (31%), Positives = 161/334 (48%), Gaps = 26/334 (7%)

Query: 8   IAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPK 67
           +A+ T+A+LPW TG +   L+ A  LA  G  RV  VIPWL+   Q + + G++ FA+P+
Sbjct: 10  VAVVTSAALPWRTGPSFFSLWHACGLADLG-LRVAYVIPWLAPASQARAW-GSVRFATPE 67

Query: 68  EQEAYVRWWLE--DRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLEE 125
           +Q A++    E  D  G    F  R +   FA   + I+ + D+    P   A +  L E
Sbjct: 68  DQYAWLATEAERIDCPGRPVYFCYRSW---FAPVIRGIVPLEDVFGATPPARAYM--LNE 122

Query: 126 PEHLTWFHHGK-RWKAKFRFVVGIVHTNYLEYV--KREKNDRLQAFLLEFVNSWLARVHC 182
           PEHL W+   + R +     V G+V TNY  YV   R    RL + L+   +  L R   
Sbjct: 123 PEHLCWYPWTRSRQRIPADRVAGLVMTNYEYYVGQMRVPGARLLSRLVARYHRHLIRSRT 182

Query: 183 HKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSK 242
             V+ LS A          NVH      +     ++    +G      G Y+IG+ +W K
Sbjct: 183 DVVVPLSPAVP----LDGANVHEARITGVLTAYTRVPPVGDGG-----GVYFIGKTIWEK 233

Query: 243 GYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDY 302
           G++ L+ +        + + +D+YG G D   IQ  A++    VR +   +    +  +Y
Sbjct: 234 GFDTLIEIAC-----RSAVPVDVYGTGPDAPAIQVLARERGATVRFHGPTESPWSVLGEY 288

Query: 303 KVFLNPSTTDVVCTATAEALAMGKIVVCANHPSN 336
           +VFLNPS ++ +CT TAEAL  G+ VV    P N
Sbjct: 289 RVFLNPSLSESMCTTTAEALVAGRHVVLPVCPGN 322


>gi|289064564|gb|ADC80602.1| digalactosyldiacylglycerol synthase 1-like protein, partial [Eperua
           falcata]
          Length = 99

 Score =  130 bits (327), Expect = 2e-27,   Method: Composition-based stats.
 Identities = 54/99 (54%), Positives = 78/99 (78%)

Query: 23  AVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPKEQEAYVRWWLEDRTG 82
           AVNPLFRAAYL++  +++VTL++PWL    Q+ VYPGN+TF+SP++QE Y+R WLE+R G
Sbjct: 1   AVNPLFRAAYLSQSAKQKVTLLVPWLCKSDQELVYPGNLTFSSPEDQENYIRNWLEERIG 60

Query: 83  FTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIA 121
           F + F   FYPGKF+ +++SI+  GD ++ IP ++ADIA
Sbjct: 61  FKADFRISFYPGKFSKERRSIIPTGDTSQFIPSKDADIA 99


>gi|118592311|ref|ZP_01549703.1| hypothetical protein SIAM614_25826 [Stappia aggregata IAM 12614]
 gi|118434969|gb|EAV41618.1| hypothetical protein SIAM614_25826 [Stappia aggregata IAM 12614]
          Length = 384

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 112/390 (28%), Positives = 179/390 (45%), Gaps = 26/390 (6%)

Query: 8   IAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPK 67
            A+ T+A+LPW+TG +   L+ A  L   G  RV  V+PWL    Q+ ++ G   FA  +
Sbjct: 16  FAVLTSAALPWMTGPSFISLWHACGLTALG-YRVVYVLPWLDAHSQQHLW-GEARFADFE 73

Query: 68  EQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLEEPE 127
           EQ  +++  L+   G     + R Y  ++ A   SI+ + D+    P   A   V  EPE
Sbjct: 74  EQLTWLQSELQS-FGPYKLPECRPYRARYVAGMGSIVPMEDVYRAAP--PARCLVASEPE 130

Query: 128 HLTWFHHGKRWKA-KFRFVVGIVHTNYLEYVKRE--KNDRLQAFLLEFVNSWLARVHCHK 184
           HL W+      K  +    +G+  T+Y  Y++          A L+ +++    R+    
Sbjct: 131 HLCWYPATTGRKGIRADKTIGLCMTDYETYIRMSGLPFPNSLARLVSYLHGRALRLRIDL 190

Query: 185 VIRLSAA-TQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKG 243
            + LS A T       V  V GV P + ++     E         T+G Y++G  +W KG
Sbjct: 191 PLSLSPALTLPGVTMPVERVTGVMPGYAQVPLVTEE---------TEGIYFLGAFLWEKG 241

Query: 244 YEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY-PGRDHADPIFHDY 302
            ++L  +        AG  +D+ G G D  + +  A+K    +R   P R     I   Y
Sbjct: 242 LDDLARI-----AARAGRSIDVIGGGRDEAEFRAFARKEGADLRFLGPNRRFWSDIGR-Y 295

Query: 303 KVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKAL 362
           ++ +NPS ++++CTATA+AL  G+ V+  + P N  +K +PN   Y   +G +EA   AL
Sbjct: 296 RIMVNPSRSEILCTATADALVAGRHVILPDCPGNLPYKAYPNAHFYTELDGALEALDYAL 355

Query: 363 AEEPAQPTDAQTHQLSWESATERFLQVAEL 392
              P  P  A+     W SA  R   +A L
Sbjct: 356 KTVPEPPVAAR-EDFDWMSACRRLAGLAGL 384


>gi|289064578|gb|ADC80609.1| digalactosyldiacylglycerol synthase 1-like protein, partial
           [Tachigali melinonii]
 gi|289584351|gb|ADD11005.1| digalactosyldiacylglycerol synthase 1-like protein [Tachigali
           melinonii]
          Length = 99

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 55/99 (55%), Positives = 76/99 (76%)

Query: 23  AVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPKEQEAYVRWWLEDRTG 82
           AVNPLFRAAYL++  E+ VTL++PWL    Q+ VYP N+TF+SP+EQE Y+R WLE+R G
Sbjct: 1   AVNPLFRAAYLSQSAEQIVTLLVPWLCKSDQELVYPSNLTFSSPEEQEVYIRNWLEERIG 60

Query: 83  FTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIA 121
           F + F   FYPGKF+ +++SI+  GD ++ IP +EAD+A
Sbjct: 61  FKADFKVSFYPGKFSKERRSIIPAGDTSQFIPSKEADVA 99


>gi|289064566|gb|ADC80603.1| digalactosyldiacylglycerol synthase 1-like protein, partial [Eperua
           grandiflora]
          Length = 99

 Score =  129 bits (323), Expect = 5e-27,   Method: Composition-based stats.
 Identities = 53/98 (54%), Positives = 77/98 (78%)

Query: 23  AVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPKEQEAYVRWWLEDRTG 82
           AVNPLFRAAYL++  +++VTL++PWL    Q+ VYPGN+TF+SP++QE Y+R WLE+R G
Sbjct: 1   AVNPLFRAAYLSQSAKQKVTLLVPWLCKSDQELVYPGNLTFSSPEDQENYIRNWLEERIG 60

Query: 83  FTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADI 120
           F + F   FYPGKF+ +++SI+  GD ++ IP ++ADI
Sbjct: 61  FKADFRISFYPGKFSKERRSIIPTGDTSQFIPSKDADI 98


>gi|289064576|gb|ADC80608.1| digalactosyldiacylglycerol synthase 1-like protein, partial
           [Bauhinia purpurea]
 gi|289584347|gb|ADD11003.1| digalactosyldiacylglycerol synthase 1-like protein [Bauhinia
           purpurea]
          Length = 99

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 75/99 (75%)

Query: 23  AVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPKEQEAYVRWWLEDRTG 82
           AVNPLFRAAYL+   +++VTL++PWL    Q+ VYP NITF+SP+EQE Y+R WLE+R G
Sbjct: 1   AVNPLFRAAYLSHSAKKKVTLLVPWLCKSDQELVYPSNITFSSPEEQELYIRNWLEERIG 60

Query: 83  FTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIA 121
           F + F   FYPGKF+ +++S++  GD ++ IP  +ADIA
Sbjct: 61  FKADFKISFYPGKFSKERRSVIPTGDTSQFIPSRDADIA 99


>gi|289064574|gb|ADC80607.1| digalactosyldiacylglycerol synthase 1-like protein, partial
           [Bauhinia purpurea]
          Length = 99

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 75/99 (75%)

Query: 23  AVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPKEQEAYVRWWLEDRTG 82
           AVNPLFRAAYL+   +++VTL++PWL    Q+ VYP NITF+SP+EQE Y+R WLE+R G
Sbjct: 1   AVNPLFRAAYLSHSAKKKVTLLVPWLCKSDQELVYPSNITFSSPEEQELYIRNWLEERIG 60

Query: 83  FTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIA 121
           F + F   FYPGKF+ +++S++  GD ++ IP  +ADIA
Sbjct: 61  FRADFKISFYPGKFSKERRSVIPTGDTSQFIPSRDADIA 99


>gi|289584349|gb|ADD11004.1| digalactosyldiacylglycerol synthase 1-like protein [Eperua falcata]
          Length = 99

 Score =  126 bits (317), Expect = 3e-26,   Method: Composition-based stats.
 Identities = 51/99 (51%), Positives = 77/99 (77%)

Query: 23  AVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPKEQEAYVRWWLEDRTG 82
           AVNPLFRAAYL++  ++++TL++PWL    Q+ VYPGN+ F+SP++QE Y+R WLE+R G
Sbjct: 1   AVNPLFRAAYLSQSAKQKITLLVPWLCKWDQELVYPGNLNFSSPEDQENYIRNWLEERIG 60

Query: 83  FTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIA 121
           F + F   FYPGKF+ +++SI+  GD ++ IP +++DIA
Sbjct: 61  FKADFRISFYPGKFSKERRSIIPTGDTSQFIPSKDSDIA 99


>gi|289064572|gb|ADC80606.1| digalactosyldiacylglycerol synthase 1-like protein, partial
           [Bauhinia purpurea]
          Length = 99

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 53/99 (53%), Positives = 75/99 (75%)

Query: 23  AVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPKEQEAYVRWWLEDRTG 82
           AVNPLFRAAYL+   +++VTL++PWL    Q+ VYP NITF+SP+EQE Y+R WLE+R G
Sbjct: 1   AVNPLFRAAYLSHSAKKKVTLLVPWLCKSDQELVYPSNITFSSPEEQELYIRNWLEERIG 60

Query: 83  FTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIA 121
           F + F   FYPGKF+ +++S++  GD ++ IP  +AD+A
Sbjct: 61  FKADFKISFYPGKFSKERRSVIPTGDTSQFIPSRDADVA 99


>gi|289064568|gb|ADC80604.1| digalactosyldiacylglycerol synthase 1-like protein, partial
           [Bauhinia guianensis]
          Length = 99

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 52/99 (52%), Positives = 75/99 (75%)

Query: 23  AVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPKEQEAYVRWWLEDRTG 82
           AVNPLFRAA+L+   +++VTL++PWL    Q+ VYP N+TF+SP+EQE Y+R WLE+R G
Sbjct: 1   AVNPLFRAAFLSHSAKKKVTLLVPWLCKSDQELVYPSNLTFSSPEEQELYIRNWLEERIG 60

Query: 83  FTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIA 121
           F + F   FYPG+F+ +++SI+  GD ++ IP  +ADIA
Sbjct: 61  FKADFKISFYPGRFSKERRSIIPTGDTSQFIPSRDADIA 99


>gi|289064570|gb|ADC80605.1| digalactosyldiacylglycerol synthase 1-like protein, partial
           [Bauhinia guianensis]
          Length = 99

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 52/99 (52%), Positives = 75/99 (75%)

Query: 23  AVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPKEQEAYVRWWLEDRTG 82
           AVNPLFRAA+L+   +++VTL++PWL    Q+ VYP N+TF+SP+EQE Y+R WLE+R G
Sbjct: 1   AVNPLFRAAFLSHSAKKKVTLLVPWLRKSDQELVYPSNLTFSSPEEQELYIRNWLEERIG 60

Query: 83  FTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIA 121
           F + F   FYPG+F+ +++SI+  GD ++ IP  +ADIA
Sbjct: 61  FKADFKISFYPGRFSKERRSIIPTGDTSQFIPSRDADIA 99


>gi|289584345|gb|ADD11002.1| digalactosyldiacylglycerol synthase 1-like protein [Bauhinia
           guianensis]
          Length = 99

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 52/99 (52%), Positives = 75/99 (75%)

Query: 23  AVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNITFASPKEQEAYVRWWLEDRTG 82
           AVNPLFRAA+L+   +++VTL++PWL    Q+ VYP N+TF+SP+EQE Y+R WLE+R G
Sbjct: 1   AVNPLFRAAFLSHSAKKKVTLLVPWLCKSDQELVYPSNLTFSSPEEQELYIRNWLEERIG 60

Query: 83  FTSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIA 121
           F + F   FYPG+F+ +++SI+  GD ++ IP  +ADIA
Sbjct: 61  FKADFKIPFYPGRFSKERRSIIPTGDTSQFIPSRDADIA 99


>gi|147832609|emb|CAN68228.1| hypothetical protein VITISV_027578 [Vitis vinifera]
          Length = 596

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 56/105 (53%), Positives = 71/105 (67%), Gaps = 9/105 (8%)

Query: 119 DIAVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLA 178
           DIA+LEEPEHL W+++GKRW  KF  VVG+VHTNYLEY+K EKN  LQ FL+ F      
Sbjct: 427 DIAILEEPEHLNWYNYGKRWNDKFNRVVGVVHTNYLEYIKTEKNKALQVFLVIF------ 480

Query: 179 RVHCHKVIRLSA---ATQEYPNSIVCNVHGVNPKFLEIGEKKMEQ 220
           R+  H   R  A     + YP S++ NV+ VNPKF++IGEK  E+
Sbjct: 481 RLQEHNATRFFAFMLLPKIYPKSVISNVNDVNPKFMKIGEKVAEE 525


>gi|397615283|gb|EJK63336.1| hypothetical protein THAOC_16015 [Thalassiosira oceanica]
          Length = 578

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 62/206 (30%), Positives = 102/206 (49%), Gaps = 36/206 (17%)

Query: 4   KQQHIAIFTTASLPWLTGTAVNPLFRAAYLA---KDGERRVTLVIPWLSLI-HQKQVYPG 59
           K ++I I TT +LPW+TGTAVNP+ RAAYL+   K     VT+++PW+     QK++Y  
Sbjct: 374 KSKNIWIVTTGALPWMTGTAVNPMLRAAYLSTGRKQAGGSVTIMLPWVERPDDQKRIYGE 433

Query: 60  NITFASPKEQEAYVRWWLEDRTGF---TSTFDTRFYPGKFAADKKSILAVGDITEIIPDE 116
              FA P++QE ++R WL +       +   + R+Y       + S+ ++GDI  +IP  
Sbjct: 434 EAKFAGPEDQEQFIRKWLRETANMKDASEELNIRWYTAWQEVLENSLYSMGDIIGLIP-- 491

Query: 117 EADIAVLEEPEHLTWFH------------------------HGKRWKAKFRFVVGIVHTN 152
              +++L      + F+                         G+ W +KF+ VVGI+HTN
Sbjct: 492 ---VSLLFGYISTSLFYADARNGFTYAGRRVRHMRTRRARAPGENWTSKFKHVVGIIHTN 548

Query: 153 YLEYVKREKNDRLQAFLLEFVNSWLA 178
           Y  Y   +    ++A  +   + W+A
Sbjct: 549 YFVYATEQPAAFIRASFVCMKSPWVA 574


>gi|224001572|ref|XP_002290458.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220973880|gb|EED92210.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 963

 Score = 96.7 bits (239), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 46/100 (46%), Positives = 65/100 (65%), Gaps = 1/100 (1%)

Query: 289 YPGR-DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRT 347
           +PG  DHA  I   +K+FLNPST++V+CT +AEALAMGK V+   HPSN+FF QF NC  
Sbjct: 798 FPGVIDHAQLISVPHKIFLNPSTSEVLCTTSAEALAMGKFVILPKHPSNEFFLQFTNCLA 857

Query: 348 YDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERFL 387
           Y+      E    AL  +P   ++ + H+ +WE+AT+R +
Sbjct: 858 YETLEECAEKMKWALERDPTPLSEEERHKFTWEAATDRLM 897



 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 56/172 (32%), Positives = 89/172 (51%), Gaps = 22/172 (12%)

Query: 138 WKAKFRFVVGIVHTNYLEYVKRE--KNDRLQAFLLEFVNSWLARVHCHKVIRLSAATQEY 195
           W  +F+FVVGIVHTNY  Y ++       + A  +  V++   R +CH+VI+LS     +
Sbjct: 414 WTHRFQFVVGIVHTNYEAYARQYGIGASLIAAPTIGAVSALAIRAYCHQVIKLSDTLPSF 473

Query: 196 P--NSIVCNVHGVNPKFLEIG-------EKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEE 246
                  CNVHGV  +FLE G        ++    +   K      Y+IG++VW+KG++ 
Sbjct: 474 APGKECTCNVHGVRKEFLEGGIVDYKALAEEEAANETTAKEAPAAVYFIGKLVWAKGFDL 533

Query: 247 LLGLLNIYHKELAG-LEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADP 297
           +L + +I+ K+     E+D+YG G D           K +VR + GR+H+ P
Sbjct: 534 MLEVQDIFKKKNGDYFEIDVYGGGPD----------EKSIVRAFHGRNHSSP 575



 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 59/180 (32%), Positives = 77/180 (42%), Gaps = 55/180 (30%)

Query: 8   IAIFTTASLPWLTGTAVNPLFRAAYLAK--DGERR------------------------- 40
           IA+ TTA++PW TGTAVNPL RA YL +  + ERR                         
Sbjct: 151 IAVVTTAAMPWRTGTAVNPLLRALYLVRFQEEERRKAELKLLETSNANAATTSTDDSNVN 210

Query: 41  --------------------VTLVIPWL-SLIHQKQVYPGNITFASP----KEQEAYVRW 75
                               V LVIPWL S   + ++Y   +TF +     KEQ  ++R 
Sbjct: 211 TNATTTSTTTTAATTKKQGQVALVIPWLESPSERTKLYGPTLTFPNGPPGMKEQSDWIRS 270

Query: 76  WLEDRTGF---TSTFDTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLEEPEHLTWF 132
           +   R G           +YP  + A   SI    D+   IP E  D+A+LEEPEHL WF
Sbjct: 271 YSSTRCGMKHEAHLLKIIWYPAFYVAGFGSIFPKVDLCNFIPKELVDVAILEEPEHLNWF 330


>gi|441477321|dbj|BAM75420.1| digalactosyldiacylglycerol synthase 2, partial [Phyllodoce
           nipponica]
 gi|441477323|dbj|BAM75421.1| digalactosyldiacylglycerol synthase 2, partial [Phyllodoce
           nipponica]
 gi|441477325|dbj|BAM75422.1| digalactosyldiacylglycerol synthase 2, partial [Phyllodoce
           nipponica]
 gi|441477327|dbj|BAM75423.1| digalactosyldiacylglycerol synthase 2, partial [Phyllodoce
           nipponica]
 gi|441477329|dbj|BAM75424.1| digalactosyldiacylglycerol synthase 2, partial [Phyllodoce
           nipponica]
 gi|441477331|dbj|BAM75425.1| digalactosyldiacylglycerol synthase 2, partial [Phyllodoce
           nipponica]
 gi|441477333|dbj|BAM75426.1| digalactosyldiacylglycerol synthase 2, partial [Phyllodoce
           nipponica]
 gi|441477335|dbj|BAM75427.1| digalactosyldiacylglycerol synthase 2, partial [Phyllodoce
           nipponica]
 gi|441477337|dbj|BAM75428.1| digalactosyldiacylglycerol synthase 2, partial [Phyllodoce
           nipponica]
 gi|441477339|dbj|BAM75429.1| digalactosyldiacylglycerol synthase 2, partial [Phyllodoce
           nipponica]
 gi|441477341|dbj|BAM75430.1| digalactosyldiacylglycerol synthase 2, partial [Phyllodoce
           nipponica]
 gi|441477343|dbj|BAM75431.1| digalactosyldiacylglycerol synthase 2, partial [Phyllodoce
           nipponica]
          Length = 53

 Score = 90.1 bits (222), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 36/53 (67%), Positives = 47/53 (88%)

Query: 132 FHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHK 184
           +HHGKRWK KF  VVG+VHTNYLEYV+REKN ++QAFLL+++NSW+  ++CHK
Sbjct: 1   YHHGKRWKTKFHLVVGVVHTNYLEYVRREKNGQVQAFLLKYINSWVVSIYCHK 53


>gi|302143898|emb|CBI23003.3| unnamed protein product [Vitis vinifera]
          Length = 92

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 35/51 (68%), Positives = 43/51 (84%)

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYI 235
           +IRLSAATQ+ P SI+CNV GV+PKFLEIG++K E QQN + AFTKG Y+I
Sbjct: 19  LIRLSAATQDLPRSIICNVDGVHPKFLEIGKRKKEHQQNDDDAFTKGVYHI 69


>gi|242095340|ref|XP_002438160.1| hypothetical protein SORBIDRAFT_10g008965 [Sorghum bicolor]
 gi|241916383|gb|EER89527.1| hypothetical protein SORBIDRAFT_10g008965 [Sorghum bicolor]
          Length = 67

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 34/59 (57%), Positives = 44/59 (74%)

Query: 237 RMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHA 295
           +MV SKGY ELL LL+ Y  +L GLE+DLYG+GED D++  +AK+L L   V+PGRD A
Sbjct: 1   KMVRSKGYRELLDLLSKYQSKLVGLEVDLYGSGEDSDEVCESAKRLSLAANVHPGRDQA 59


>gi|143636048|gb|ABO93349.1| digalactosyl-diacylglycerol synthase type 2 [Vigna unguiculata]
          Length = 81

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 57/82 (69%), Gaps = 1/82 (1%)

Query: 367 AQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRHLESESLNSKRIIEDAFGYL 426
           AQPT+A+  +LSWE+AT RFL+   +   +  + S++ S  + + SLN ++ +++A  Y+
Sbjct: 1   AQPTNAERRELSWEAATNRFLKAVGMDKQLDKRLSRNSSVFMAA-SLNLQQTVDEASAYV 59

Query: 427 HYVASGFETSRRALGAIPGSLQ 448
           H+VASGFE SRR  GAIP +LQ
Sbjct: 60  HHVASGFEVSRRFFGAIPHTLQ 81


>gi|414883306|tpg|DAA59320.1| TPA: hypothetical protein ZEAMMB73_375795, partial [Zea mays]
          Length = 89

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 33/69 (47%), Positives = 42/69 (60%)

Query: 324 MGKIVVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESAT 383
           MGK V+CA HPSN+FF  F  C TY     FV    +A+  EP   T  Q + LSWE+AT
Sbjct: 1   MGKFVICAEHPSNEFFMSFLKCLTYKTSEEFVARVKEAMDREPQPLTPEQRYNLSWETAT 60

Query: 384 ERFLQVAEL 392
           ERF++  +L
Sbjct: 61  ERFMEYLDL 69


>gi|414078441|ref|YP_006997759.1| glycosyl transferase group 1 protein [Anabaena sp. 90]
 gi|413971857|gb|AFW95946.1| glycosyl transferase group 1 protein [Anabaena sp. 90]
          Length = 419

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 94/389 (24%), Positives = 156/389 (40%), Gaps = 63/389 (16%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           M  K   IA+FT    P+LTG +V    R  +L K G   V L+ P ++  + KQV  GN
Sbjct: 1   MHNKPLRIALFTGLYAPFLTGVSVAVHQRVHWLLKQG-HEVFLIHPEINDKYPKQV--GN 57

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVG--DITEIIPDEEA 118
              +  +E E+++ +         S++     P  F       L+      T+++   + 
Sbjct: 58  RPMSGIEELESFINF---------SSYSFPTEPLIFYKSLPQPLSYRHWSDTKLLNHFQP 108

Query: 119 DIAVLEEPEHLTWFHHG-----------KRWKAKFRFVVGIVHTNYLEYVKREKNDR--- 164
           DI V+EE   +   + G           K  K     ++ I HT+ + Y++    +    
Sbjct: 109 DIIVVEEAPQMRGLYSGFLQGYGRSVGVKYAKKTNTPIISIFHTDIIAYIQYYFGNMFFN 168

Query: 165 -----LQAFLLEFVNSW-----LARVHCHKVIRLSAATQEY-P-NSIVC-NVHGVNPKFL 211
                L   + +F  ++      +R    K  +L A   EY P   I C   H  N  + 
Sbjct: 169 LVRPLLPILVKQFSETYDLSLFSSREQLTKYQKLKAQRSEYLPYQGIDCEKFHPRNISYD 228

Query: 212 EIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGED 271
            I + K                ++GR+   K   +LL +  +   ++  + + + G+G  
Sbjct: 229 PIPDDKRPT-----------ILFVGRITAEKNVNQLLDIYPLIADKIPNVHLVIVGSGPL 277

Query: 272 FDQIQRAAKKLKLVVRVYPGRDHADPI---FHDYKVFLNPSTTDVVCTATAEALAMGKIV 328
             +I+R A+K    V ++ G  H   +   F    +F+NPS T+  CT T EALA G  V
Sbjct: 278 DAEIRRRAEKFPNGVTIW-GESHGKELLGWFAKADIFVNPSVTENFCTTTNEALASGTPV 336

Query: 329 VCANHPSNDFFKQFPNCRTYDGRNGFVEA 357
           V    PS          +   G NGF+ A
Sbjct: 337 VAVVAPSTA-------EQVISGVNGFLAA 358


>gi|440682684|ref|YP_007157479.1| glycosyl transferase group 1 [Anabaena cylindrica PCC 7122]
 gi|428679803|gb|AFZ58569.1| glycosyl transferase group 1 [Anabaena cylindrica PCC 7122]
          Length = 422

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 101/449 (22%), Positives = 176/449 (39%), Gaps = 74/449 (16%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           M  K   IA+FT    P+LTG +V    R  +L + G   V L+ P ++ ++ KQV  GN
Sbjct: 1   MKNKPLRIALFTGLFAPFLTGVSVAVHQRVRWLLEQGHE-VFLIHPEINNLYPKQV--GN 57

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDI-----TEIIPD 115
                 +E E +              F +  +P K     KS+    +      T+++  
Sbjct: 58  RPMPGLEELETF------------PNFSSFAFPTKPLILYKSLPQPLNYRHWNDTKLLER 105

Query: 116 EEADIAVLEEPEHLTWFHH----------GKRWKAKFRF-VVGIVHTNYLEYVK------ 158
            + DI ++EE   +   +           G ++  K +  ++ + HT+ + Y++      
Sbjct: 106 FQPDIIIVEEAAQMRGLYSFYLQGYGRPVGVKYAKKTKTPIISVFHTDIVAYIRYYLGDV 165

Query: 159 --REKNDRLQAFLLEFVNSW-----LARVHCHKVIRLSAATQEYPNSIVCNVHGVNPKFL 211
             +  N  +   + +F  S+      +R    K  +L   T EY      N    +P+  
Sbjct: 166 FFKLVNPIIPLLVKQFTQSYNLNLFPSREQLSKYQKLKCQTAEYLPYQGINCEKFHPR-- 223

Query: 212 EIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGED 271
            I    + + +     F      +GR+   K   +LL    +   ++  + M + G+G  
Sbjct: 224 NICYNPIPEDKRPTILF------VGRITAEKNVTQLLDAYPLIAAKIPDVHMVIIGSGPL 277

Query: 272 FDQIQRAAKKLKLVVRVYPGRDHADPI---FHDYKVFLNPSTTDVVCTATAEALAMGKIV 328
             +I+R A+     V ++ G  H   +   F    VF+NPS T+  CT   EALA G  +
Sbjct: 278 DQEIRRRAQNYPSGVTIW-GESHGTELLGWFARADVFINPSVTENFCTTNNEALASGTPL 336

Query: 329 VCANHPS---------NDFFKQFPNCRTYDGR--NGFVEATLKALAEEPAQPTDAQTHQL 377
           V A  PS         N F  Q  N + +  +         LKA   + A+P+     + 
Sbjct: 337 VAAIAPSTAEQVIPGHNGFLAQPNNPKDFAQKIITILENPELKAQLSQQARPS---ILEF 393

Query: 378 SWESATERF----LQVAELVGDVVTKRSK 402
            W   T++F     Q+ E+   V    SK
Sbjct: 394 DWSVCTQKFEDRLYQLVEMPTKVELGTSK 422


>gi|427732472|ref|YP_007078709.1| glycosyltransferase [Nostoc sp. PCC 7524]
 gi|427368391|gb|AFY51112.1| glycosyltransferase [Nostoc sp. PCC 7524]
          Length = 424

 Score = 65.1 bits (157), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 102/433 (23%), Positives = 167/433 (38%), Gaps = 78/433 (18%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           M++ +  IA+FT    P+LTG +V    RA +L + G   V LV P  +  + KQV  G+
Sbjct: 1   MNKPKLRIALFTGLYTPFLTGVSVAVHQRARWLLQQGHE-VFLVHPQFNDQYPKQV--GD 57

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTS-TFDTRFYPGKFAADKKSILAVG--DITEIIPDEE 117
                  E ++Y          F+S +F T   P  F       L+    D TE++ + +
Sbjct: 58  RPMPGLDELQSY--------PNFSSYSFPTE--PLIFYKSLPQPLSYRHWDDTELLIEFQ 107

Query: 118 ADIAVLEEPEHL--------------TWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKND 163
            DI V+EE   +                  + +R K     ++ + HT+ + Y++    D
Sbjct: 108 PDIIVVEESAQMRGVYSALIQGYGRPVGLQYARRTKTP---IISVFHTDIVAYIRYYLGD 164

Query: 164 RLQAFLLEFV-------------NSWLARVHCHKVIRLSAATQEYP--NSIVC-NVHGVN 207
               FL   +             N + +R    K   L     EY     I C   H  N
Sbjct: 165 VFFNFLRPIIPLLVKQFSQVYNLNVFPSREQLIKYQNLGCHPSEYVPYQGIDCEKFHPRN 224

Query: 208 PKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYG 267
            ++  I +                  ++GR+   K   +L+    +   ++  + + + G
Sbjct: 225 IRYNPIPDDD-----------RPTILFVGRITAEKNVLQLIDAYPLIAAQIPNVHLVIVG 273

Query: 268 NGEDFDQIQRAAKKLKLVVRVYPGRDHADPI---FHDYKVFLNPSTTDVVCTATAEALAM 324
           +G    +I+R A K    + ++ G  H   +   F    VF+NPS T+  CT T EALA 
Sbjct: 274 SGPLDAEIRRRAAKFGSGITIW-GESHGTELLGWFARADVFVNPSVTENFCTTTNEALAS 332

Query: 325 GKIVVCANHPS---------NDFFKQFPNCRTYDGR--NGFVEATLKALAEEPAQPTDAQ 373
           G  VV A  PS         N    +  N   +  +  +      LKA   E A+P+   
Sbjct: 333 GTPVVAALVPSTIEQVSSGHNGLLSEPNNPADFAQKVIDILSNPELKAAMAEQARPS--- 389

Query: 374 THQLSWESATERF 386
             Q  W +  E+F
Sbjct: 390 ILQYDWSACMEKF 402


>gi|186681749|ref|YP_001864945.1| group 1 glycosyl transferase [Nostoc punctiforme PCC 73102]
 gi|186464201|gb|ACC80002.1| glycosyl transferase, group 1 [Nostoc punctiforme PCC 73102]
          Length = 429

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 98/433 (22%), Positives = 172/433 (39%), Gaps = 76/433 (17%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           M+++   IAIFT    P+++G ++    R  +L + G          + LIH     PG 
Sbjct: 1   MNKQPLRIAIFTALYAPFMSGVSMGVHQRVRWLLQQGHE--------VFLIH-----PG- 46

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSI-----LAVGDITEIIPD 115
           + +  PKE        LE+   F S F +  YP K     KS+           T+++ +
Sbjct: 47  VNYLYPKEVRDRPMLRLEEVQSF-SNFSSYAYPSKPMIFYKSLPEPLHYRYWSDTKLLLN 105

Query: 116 EEADIAVLEEPEHLTWFH------HGKRWKAKFRF-----VVGIVHTNYLEYVKREKNDR 164
            + DI V+EE   +  ++      +G+    ++       ++ I HT+ + Y++    + 
Sbjct: 106 FQPDIVVVEEAPGMIGYYSLFLQGYGRPIGTEYTKRTGTPIISIFHTDIVAYIRYYIGEW 165

Query: 165 LQAFLLEF-------------VNSWLARVHCHKVIRLSAATQEYPNSIVCNVHGVNPKFL 211
              F+                VN + ++    K   L+A   EY      +    +P+ +
Sbjct: 166 AFRFIRPIIPILIKQLSEAYDVNYFSSQEQLTKYKLLAAQRLEYLPYQGIDCEKFHPRNI 225

Query: 212 EIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGED 271
                  +Q+            ++GR+   K  ++LL    +   ++  + + + G+G  
Sbjct: 226 CYDPIPGDQRPT--------LLFVGRISAEKNVKQLLDAFVLIAAKIPDVHLVIVGSGPQ 277

Query: 272 FDQIQRAAKKLKLVVRVYPGRDHADPI---FHDYKVFLNPSTTDVVCTATAEALAMGKIV 328
             +I+R AKK +  + V+ G  H   +   F    VF+N S T+  CT T EALA G  V
Sbjct: 278 DKEIRRRAKKFESGITVW-GESHGTELLGWFARADVFVNASVTENFCTTTNEALASGTPV 336

Query: 329 VCANHPSNDFFKQFPNCRTYDGRNGF-------------VEATLK--ALAEEPAQPTDAQ 373
           V    PS           +  GRNGF             V A L+  AL EE ++     
Sbjct: 337 VAVLAPSTSE-----QISSSSGRNGFLAEPNNPKDFAMKVIAILENPALKEEMSRQARLY 391

Query: 374 THQLSWESATERF 386
             +  W +  E+F
Sbjct: 392 IQKFDWSACMEKF 404


>gi|75908795|ref|YP_323091.1| group 1 glycosyl transferase [Anabaena variabilis ATCC 29413]
 gi|75702520|gb|ABA22196.1| Glycosyl transferase, group 1 [Anabaena variabilis ATCC 29413]
          Length = 421

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 90/400 (22%), Positives = 153/400 (38%), Gaps = 65/400 (16%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           M + +  IA+F+    P+LTG +V    R  +L K G   V LV P ++    K+V  G+
Sbjct: 1   MQKSKLRIALFSGLYAPFLTGVSVAVHQRVHWLLKQG-HEVLLVHPQINDKFPKKV--GS 57

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVG--DITEIIPDEEA 118
              A  +E + +  +         S+F     P  F       L+      T+++ D + 
Sbjct: 58  RPMAGLEELKPFPNF---------SSFAFPTQPLIFYKSLPQPLSYRHWSDTKLLSDFQP 108

Query: 119 DIAVLEEPEHLTWFHHG-----------KRWKAKFRFVVGIVHTNYLEYVKREKNDRLQA 167
           DI V+EE   +   + G           K  K     ++ + HT+ + Y+K    D   +
Sbjct: 109 DIIVVEEAAQMRGLYSGLLQGYGRPVGVKYAKRTHTPIISVFHTDIVAYIKYYFGDIFFS 168

Query: 168 FLLEFV-------------NSWLARVHCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIG 214
            +   +             N + +R    K   L     EY      +    +P+   I 
Sbjct: 169 LVRPLIPLLVKQFSDNYDLNVFPSREQLKKYHNLKCQRSEYVPYQGIDCEKFHPR--NIT 226

Query: 215 EKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQ 274
              +         F      +GR+   K   +L+ +  +   ++  + + + G+G   ++
Sbjct: 227 HNPIPDDSRPTLLF------VGRITAEKNVTQLIDMYPLIAAKIPDVHLVIIGSGPLDEE 280

Query: 275 IQRAAKKLKLVVRVYPGRDHADPI---FHDYKVFLNPSTTDVVCTATAEALAMGKIVVCA 331
           I+R  +K    +  + G  H   +   F    VF+NPS T+  CT T EALA G  VV A
Sbjct: 281 IRRRVQKFPSGITFW-GESHGTELLGWFARADVFVNPSVTENFCTTTNEALASGTPVVAA 339

Query: 332 NHPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTD 371
             PS        + + + G NG        L  EP  P D
Sbjct: 340 IAPST-------SEQVFPGLNG--------LLAEPNSPED 364


>gi|354566593|ref|ZP_08985765.1| glycosyl transferase group 1 [Fischerella sp. JSC-11]
 gi|353545609|gb|EHC15060.1| glycosyl transferase group 1 [Fischerella sp. JSC-11]
          Length = 426

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 85/387 (21%), Positives = 157/387 (40%), Gaps = 63/387 (16%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           M ++   IA+ T    P+LTG ++    R  +L + G   V L+ P ++  + ++V  G+
Sbjct: 5   MKKQPLRIALITGLYAPFLTGVSIAVHQRVRWLLQQG-HEVFLIHPEINDQYPEEV--GS 61

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDI-----TEIIPD 115
              A  KE +++ +            F +  +P +     KS+           T+++ +
Sbjct: 62  RPMAGLKELQSFPK------------FSSYAFPTQPLIFYKSLPQPLHYRHWSDTKLLEN 109

Query: 116 EEADIAVLEEPEHLTWFH------HGKRWKAKF-----RFVVGIVHTNYLEYVKREKNDR 164
            + DI V+EE   +  F+      +G+    ++       ++ + HT+ + Y++    ++
Sbjct: 110 FKPDIVVVEEAPQMRGFYSLYLQGYGRPVGIEYAKKTGTPIISLFHTDIVAYIRYYMGNQ 169

Query: 165 LQAFLLEFV-------------NSWLARVHCHKVIRLSAATQEYPNSIVCNVHGVNPKFL 211
             +F+   +             N + +R    K   L +   EY      +    NP   
Sbjct: 170 FFSFIRPMLPMLIKQFSEAYDYNFFSSREQLEKYKSLKSQRSEYLPYQGIDCQKFNPA-- 227

Query: 212 EIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGED 271
            I    M         F      +GR+   K   +LL +  I   ++    + + G+G  
Sbjct: 228 NISYNPMPDDHRPTLLF------VGRIAPEKSVIQLLDIYPIIAAKIPDAHLVIVGSGPQ 281

Query: 272 FDQIQRAAKKLKLVVRVYPGRDHADPIFHDY---KVFLNPSTTDVVCTATAEALAMGKIV 328
            ++I++ A K    V V+ G  H   +   Y    VF+NPS T+  CT+T EALA G  V
Sbjct: 282 QEEIRQRATKFGKGVTVW-GESHGTELLGWYARADVFVNPSATENFCTSTNEALASGTPV 340

Query: 329 VCANHPSNDFFKQFPNCRTYDGRNGFV 355
           V    PS        + +   GRNGF+
Sbjct: 341 VAVKAPST-------SEQVVPGRNGFL 360


>gi|226490906|ref|NP_001142346.1| uncharacterized protein LOC100274517 [Zea mays]
 gi|194708330|gb|ACF88249.1| unknown [Zea mays]
          Length = 142

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 59/119 (49%), Gaps = 6/119 (5%)

Query: 338 FFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVV 397
           F   FPNC TY     FV    +A+  EP Q T  Q + LSWE+ATERF++ ++L   V+
Sbjct: 3   FSCHFPNCLTYKTSEEFVARVKEAMDREPQQLTPEQRYNLSWEAATERFMEYSDL-DKVL 61

Query: 398 TKRSKSPSRHLESESLNSKR-----IIEDAFGYLHYVASGFETSRRALGAIPGSLQPDE 451
              +  P +  + ++  + +     I++      H   +G E  R A  AIPG+   D+
Sbjct: 62  NNEAAQPKQGRKRKNKRTSQPNLFDIMDGGLALAHRCLTGNEVLRLATEAIPGTRDYDK 120


>gi|119513458|ref|ZP_01632485.1| Glycosyl transferase, group 1 [Nodularia spumigena CCY9414]
 gi|119461890|gb|EAW42900.1| Glycosyl transferase, group 1 [Nodularia spumigena CCY9414]
          Length = 420

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 100/438 (22%), Positives = 179/438 (40%), Gaps = 68/438 (15%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           M+++   IA+FT    P+LTG +V    R  +L + G   V LV P ++       YP N
Sbjct: 1   MNKQPLRIALFTGLYAPFLTGVSVAVHQRVRWLLEQG-HEVFLVHPEIN-----DQYPKN 54

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDI-----TEIIPD 115
           +  + P          LE+   F   F    +P K     KS+           T+++  
Sbjct: 55  VG-SRPMPG-------LEELQCFPK-FSAYAFPTKPLIFYKSLPQPLHYRHWSDTKLLEK 105

Query: 116 EEADIAVLEEPEHLTWFH------HGKRWKAKF-----RFVVGIVHTNYLEYVKREKND- 163
            + DI V+EE   +  F+      +G+    ++       ++ I HT+ + Y++    + 
Sbjct: 106 FQPDIVVVEEAPQMRGFYSMFLQGYGRPIGVEYAQKTGTPIISIFHTDIVAYIQYYLGNH 165

Query: 164 --RLQAFLLEFVNSWLARVHCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQ 221
              L   ++ F+      V+   +    A   +Y N + C   G    +  I  +K   +
Sbjct: 166 VFNLMRPIIPFLVKQSTEVYDRNLFPSKAQLSKY-NELNCQ-RGEYVPYQGIDCEKFHPR 223

Query: 222 QNGNKAFTKG----AYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQR 277
              +            ++GR+   K   +L+ +  +   ++  + + + G+G   ++++R
Sbjct: 224 NIIHNPIPDDNRPTILFVGRITAEKNVTQLIDMFPLIAAKIPDVHLVIIGSGPLDEELRR 283

Query: 278 AAKKLKLVVRVYPGRDHADPI---FHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHP 334
            ++K + +     G  H   +   F    +F+NPS T+  CT   EALA G  VV    P
Sbjct: 284 ESQKFEGITMW--GESHGTELLGWFARADIFVNPSATENFCTTNNEALASGTPVVAVVAP 341

Query: 335 SNDFFKQFPNCRTYDGRNGF-------------VEATLK--ALAEEPAQPTDAQTHQLSW 379
           S   F+Q      + GRNGF             V A L+   L EE +Q       +  W
Sbjct: 342 ST--FEQ-----VFPGRNGFLAEPNNPQDFADKVVAILENSQLKEEISQQARPSILKYDW 394

Query: 380 ESATERFLQ-VAELVGDV 396
            + TE+F + + ELV +V
Sbjct: 395 SACTEKFEEKLYELVQNV 412


>gi|427706170|ref|YP_007048547.1| group 1 glycosyl transferase [Nostoc sp. PCC 7107]
 gi|427358675|gb|AFY41397.1| glycosyl transferase group 1 [Nostoc sp. PCC 7107]
          Length = 422

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 86/397 (21%), Positives = 162/397 (40%), Gaps = 59/397 (14%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           M ++Q  IA+FT    P+LTG +V    R  +L + G  +V LV P ++       YP N
Sbjct: 1   MQKQQLRIALFTGLYPPFLTGVSVAVHQRVRWLLQQG-HQVLLVHPEIN-----NKYPKN 54

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDI-----TEIIPD 115
           ++   P          L++   F + F +  +P +     KS+    +      T+++ +
Sbjct: 55  VS-DRPMP-------GLDELKAFPA-FSSYTFPTEPLIFYKSLPQPLNYRHWSDTKLLAE 105

Query: 116 EEADIAVLEEPEHLTWFHHG-----------KRWKAKFRFVVGIVHTNYLEYVKREKNDR 164
            + DI V+EE   +   + G           +  K     ++ + HT+ + Y+K    ++
Sbjct: 106 FQPDIIVVEEAAQMRGLYSGFLQGYGRPVGVEYAKQTHTPIISVFHTDIVAYIKYYLGNQ 165

Query: 165 ---LQAFLLEFVNSWLARVHCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQ 221
              L + ++ F+    +  +   +        +Y N + C      P +  I  +K   Q
Sbjct: 166 FFSLISLIVPFLTQQFSEAYDVNLFPSRELLNKYKN-LKCQRSEYVP-YQGIDSEKFHPQ 223

Query: 222 QNGNKAFTKG----AYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQR 277
              +    +       ++GR+   K   +L+    +   ++  + + + G+G    +I++
Sbjct: 224 NIIHNPIPQDHRPTLLFVGRVTAEKNVTQLIDAFPLIAAKVPDVHLVIIGSGPLDQEIRK 283

Query: 278 AAKKLKLVVRVYPGRDHADPI---FHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHP 334
            A+K    + V+ G  H   +   F    +F+NPS T+  CT   EALA G  +V AN  
Sbjct: 284 RAEKFPNGITVW-GESHGKELLGWFARADIFVNPSVTENFCTTNMEALASGTPLVAANAG 342

Query: 335 SNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTD 371
            N         + + G NGF+         EP  P D
Sbjct: 343 GNL-------EQVFPGINGFL--------SEPNNPVD 364


>gi|298492374|ref|YP_003722551.1| group 1 glycosyl transferase ['Nostoc azollae' 0708]
 gi|298234292|gb|ADI65428.1| glycosyl transferase group 1 ['Nostoc azollae' 0708]
          Length = 421

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 97/439 (22%), Positives = 173/439 (39%), Gaps = 73/439 (16%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           M ++   IA+FT    P+LTG +V    R  +L + G + V L+ P ++  + K V   N
Sbjct: 1   MKKQPLRIALFTGLFPPFLTGVSVAVHQRVRWLLEQGHQ-VFLIHPEINNQYPKIV--SN 57

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTS-TFDTRFYPGKFAADKKSILAVG--DITEIIPDEE 117
                 +E +++         GF+S  F T+  P  F       L       T+++   +
Sbjct: 58  RPMPGLEELQSF--------PGFSSYAFPTQ--PLIFYKSLPQPLNYRHWSDTKLLEKFQ 107

Query: 118 ADIAVLEEPEHLTWFH--------------HGKRWKAKFRFVVGIVHTNYLEYVKREKND 163
            DI ++EE   +   +              + KR K     ++ + HT+ + Y++    D
Sbjct: 108 PDIIIVEEAAQMRGLYSIFLQGYGRPIGVEYAKRTKTP---IISVFHTDIVAYIRYYLGD 164

Query: 164 RLQAFLLEFV-------------NSWLARVHCHKVIRLSAATQEYPNSIVCNVHGVNPKF 210
              + L   V             N + +R    K  +L     EY      N    +P+ 
Sbjct: 165 VFFSLLRPIVPLLVKQFSNAYSLNLFPSREQLSKYQKLKCKRVEYVPYQGINCEKFHPRN 224

Query: 211 LEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGE 270
           +    +  +Q+            ++GR+   K   +LL        ++  + + + G+G 
Sbjct: 225 ICYDPRPNDQRPT--------ILFVGRITAEKNVTQLLDAFPFIAAKIPDVHLVIIGSGP 276

Query: 271 DFDQIQRAAKKLKLVVRVYPGRDHADPI---FHDYKVFLNPSTTDVVCTATAEALAMGKI 327
              +I+R A+     V ++ G  H   +   F    VF+NPS T+  CT   EALA G  
Sbjct: 277 LDQEIRRRAQAFPFGVTIW-GESHGTELLGWFARADVFVNPSVTENFCTTNNEALASGTP 335

Query: 328 VVCANHPS---------NDFFKQFPNCRTYDGR--NGFVEATLKALAEEPAQPTDAQTHQ 376
           VV A  PS         N F  Q  N + +  +       + LKA   + ++P+     +
Sbjct: 336 VVAAIAPSTPEQVIIGYNGFLAQPNNPKDFAEKIIKILENSDLKAQLSKQSRPS---ILE 392

Query: 377 LSWESATERFL-QVAELVG 394
             W   +E+F  ++ +LVG
Sbjct: 393 FDWSVCSEKFEDKLYQLVG 411


>gi|282900678|ref|ZP_06308620.1| Glycosyl transferase, group 1 (heterocyst glycolipid formation
           protein, HglT) [Cylindrospermopsis raciborskii CS-505]
 gi|281194478|gb|EFA69433.1| Glycosyl transferase, group 1 (heterocyst glycolipid formation
           protein, HglT) [Cylindrospermopsis raciborskii CS-505]
          Length = 418

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 82/370 (22%), Positives = 145/370 (39%), Gaps = 62/370 (16%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           M++K   IA+FT    P+LTG +V    R  +L + G   V L+ P ++  + +QV  GN
Sbjct: 1   MEKKPLRIALFTGLYAPFLTGVSVAVHQRVHWLLEQGH-EVFLIHPQINKQYSQQV--GN 57

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDI-----TEIIPD 115
                  E + +            S F +  +P +     KS+    +      T+++ +
Sbjct: 58  RPMLGLSELQIF------------SNFSSYAFPTQPLIFYKSLPQPLNYRHWNDTKLLEN 105

Query: 116 EEADIAVLEEPEHLTWFH--------------HGKRWKAKFRFVVGIVHTNYLEYVKREK 161
            + DI V+EE   +   +              + KR K     ++ + HT+ + Y+K   
Sbjct: 106 FQPDIIVVEEAAQMRGLYSIFLQGYGRAVGVDYAKRTKTP---IISVFHTDIVAYIKYYL 162

Query: 162 NDRLQAFLLEFV-------------NSWLARVHCHKVIRLSAATQEYPNSIVCNVHGVNP 208
            D L   + + +             N + +R    K  +L     EY      N    +P
Sbjct: 163 GDILFRLMSQIIPLLVRQFSDQYTLNIFPSREQLKKYQKLQCQRCEYVPYQGINCEKFHP 222

Query: 209 KFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGN 268
           +   I    +   +     F      +GR+   K   ++L    +   ++  + + + G+
Sbjct: 223 R--NICHDPIPNDKRPTILF------VGRITAEKNVTQILEAYPLIAAKIPDVHLVIVGS 274

Query: 269 GEDFDQIQRAAKKLKLVVRVYPGRDHADPI---FHDYKVFLNPSTTDVVCTATAEALAMG 325
           G    +I+  A+K    V ++ G  H   +   F    VF+NPS T+  CT   EALA G
Sbjct: 275 GPLDQEIRHRAQKFADGVTIW-GESHGTELLGWFARADVFVNPSVTENFCTTNNEALASG 333

Query: 326 KIVVCANHPS 335
             VV    PS
Sbjct: 334 TPVVAVMAPS 343


>gi|186680599|ref|YP_001863795.1| group 1 glycosyl transferase [Nostoc punctiforme PCC 73102]
 gi|186463051|gb|ACC78852.1| glycosyl transferase, group 1 [Nostoc punctiforme PCC 73102]
          Length = 425

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 85/403 (21%), Positives = 156/403 (38%), Gaps = 71/403 (17%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           M +++  IA+FT    P+LTG +V    R  +L + G   V LV P +S  + K V  G+
Sbjct: 1   MTKQKLRIALFTGLYAPFLTGVSVAVHQRVRWLLEQGHE-VFLVHPQISDRYPKNV--GD 57

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVGDI-----TEIIPD 115
                  E E++              F    +P +     KS+    +      T+++  
Sbjct: 58  RPMPGLDEIESF------------PNFSAYAFPTEPLLFYKSLPQPLNYRHWSDTKLLDK 105

Query: 116 EEADIAVLEEPEHL----TWFHHGKRWKAKFRF-------VVGIVHTNYLEYVKREKNDR 164
            + DI ++EE   +    ++F  G        +       ++ + HT+ + Y+K    D+
Sbjct: 106 FKPDIILVEEAAQMRGLYSFFLQGYGRPIGVEYAKRTGTPIISLFHTDIVAYIKYYFGDK 165

Query: 165 LQAFLLEFV-------------NSWLARVHCHKVIRLSAATQEYPNSIVCNVHGVNPKFL 211
               +   +             N + ++    K   L     EY         G++ +  
Sbjct: 166 FFNLVRPIIPTLVKQFSESYDFNYFSSKEQLTKYEELKCQRAEY-----VPYQGIDCEKF 220

Query: 212 EIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGED 271
                      N N+       ++GR+   K   +LL +  +   ++  + + + G+G  
Sbjct: 221 HPRNICYNPIPNDNRP---TLLFVGRITPEKNVNQLLDIFPVIAAKIPDVHLVIVGSGPM 277

Query: 272 FDQIQRAAKKLKLVVRVYPGRDHADPI---FHDYKVFLNPSTTDVVCTATAEALAMGKIV 328
            ++I+  AKK    + ++ G  H   +   F    +F+NPS T+  CT   EALA G  +
Sbjct: 278 DEEIRERAKKFGSGITIW-GESHGTELLGWFARADIFVNPSVTENFCTTNNEALASGTPL 336

Query: 329 VCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTD 371
           V    PS        + + + GRNGF+         +P  PTD
Sbjct: 337 VAVVAPST-------SEQVFPGRNGFL--------AQPNNPTD 364


>gi|17232833|ref|NP_489381.1| hypothetical protein all5341 [Nostoc sp. PCC 7120]
 gi|17134480|dbj|BAB77040.1| all5341 [Nostoc sp. PCC 7120]
          Length = 421

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 86/387 (22%), Positives = 150/387 (38%), Gaps = 63/387 (16%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           M + +  IA+F+    P+LTG +V    R  +L K G   V LV P ++    K V  G+
Sbjct: 1   MQKSKLRIALFSGLYAPFLTGVSVAVHQRVHWLLKQG-HEVLLVHPQINDKFPKNV--GS 57

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVG--DITEIIPDEEA 118
              A  +E + +  +         S+F     P  F       L+      ++++ D + 
Sbjct: 58  RPMAGLEELKPFPNF---------SSFAFPTQPLIFYKSLPQPLSYRHWSDSKLLSDFQP 108

Query: 119 DIAVLEEPEHLTWFHHG-----------KRWKAKFRFVVGIVHTNYLEYVKREKNDRLQA 167
           DI V+EE   +   + G           K  K     ++ + HT+ + Y+K    D   +
Sbjct: 109 DIIVVEEAAQMRGLYSGLLQGYGRPVGVKYAKRTHTPIISVFHTDIVAYIKYYFGDIFFS 168

Query: 168 FLLEFV-------------NSWLARVHCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIG 214
            +   +             N + +R    K   L     EY         G++       
Sbjct: 169 LVRPLIPLLVKQFSDNYDLNVFPSREQLKKYHNLKCQRSEY-----VPYQGID------C 217

Query: 215 EKKMEQQQNGNKAFTKG---AYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGED 271
           EK   +    N     G     ++GR+   K   +L+ +  +   ++  + + + G+G  
Sbjct: 218 EKFHPRNIIHNPIPNDGRPTLLFVGRITAEKNVTQLIDMYPLIAAKIPDVHLVIIGSGPL 277

Query: 272 FDQIQRAAKKLKLVVRVYPGRDHADPI---FHDYKVFLNPSTTDVVCTATAEALAMGKIV 328
            ++I+R  +K    +  + G  H   +   F    VF+NPS T+  CT T EALA G  V
Sbjct: 278 DEEIRRRVQKFPSGITFW-GESHGTELLGWFARADVFVNPSVTENFCTTTNEALASGTPV 336

Query: 329 VCANHPSNDFFKQFPNCRTYDGRNGFV 355
           V A  PS        + + + G NG +
Sbjct: 337 VAAIAPST-------SEQVFPGMNGLL 356


>gi|427727548|ref|YP_007073785.1| glycosyltransferase [Nostoc sp. PCC 7524]
 gi|427363467|gb|AFY46188.1| glycosyltransferase [Nostoc sp. PCC 7524]
          Length = 424

 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 86/403 (21%), Positives = 163/403 (40%), Gaps = 71/403 (17%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGN 60
           M++++  IAI T +  P+L+G ++    R  +L + G   V L+ P ++ ++ K+++  +
Sbjct: 3   MNKQRLRIAIITGSYAPFLSGISIGVHQRVRWLLQQGHE-VFLLHPEVNELYPKEIFKRS 61

Query: 61  ITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILA-----VGDITEIIPD 115
           I               +E+   F S F +  YP +     KS+         + T+++ D
Sbjct: 62  IP-------------GIEEVQSF-SNFSSYAYPTQPLIFYKSLPVPLHYHYWNDTKLLMD 107

Query: 116 EEADIAVLEEPEHLTWFH------HGKRWKAKFRFVVG-----IVHTNYLEYVK------ 158
            E DI V+EE   +  ++      +G+   +++  + G     + HT+ + Y++      
Sbjct: 108 FEPDIVVVEEAPAMIGYYSLFLQGYGRLIGSEYAKLTGTPTITLFHTDIVAYIRYYLGNL 167

Query: 159 --REKNDRLQAFLLEF-----VNSWLARVHCHKVIRLSAATQEYPNSIVCNVHGVNPKFL 211
             R     +   +  F     VN + ++    +  ++     EY      +    +PK  
Sbjct: 168 SFRLIRPIIPTLVKHFSEAYDVNYFSSQEQLAQYRKMKLQRGEYLPYQGVDCKKFHPK-- 225

Query: 212 EIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGED 271
            I    +         F      +GR+   K   +LL    +    +  + + L G+G  
Sbjct: 226 NICYDPIPHDHRPTLLF------VGRISVEKNVRQLLKAYPLIAARIPDVHLVLVGSGPL 279

Query: 272 FDQIQRAAKKLKLVVRVYPGRDHADPI---FHDYKVFLNPSTTDVVCTATAEALAMGKIV 328
           + +IQ  A+K    + ++ G  H   +   F    +F+NPS ++  CT   EALA G  V
Sbjct: 280 YSEIQARAQKFGSGITMW-GESHGTELLGWFARADIFVNPSVSENFCTTNNEALASGTPV 338

Query: 329 VCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTD 371
           V A  PS          + + G NGF+         EP  P D
Sbjct: 339 VAALAPSTA-------EQVFPGYNGFL--------AEPNNPAD 366


>gi|427720731|ref|YP_007068725.1| group 1 glycosyl transferase [Calothrix sp. PCC 7507]
 gi|427353167|gb|AFY35891.1| glycosyl transferase group 1 [Calothrix sp. PCC 7507]
          Length = 414

 Score = 58.5 bits (140), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 97/428 (22%), Positives = 170/428 (39%), Gaps = 72/428 (16%)

Query: 3   RKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLSLIHQKQVYPGNIT 62
           +K   IA+FT+   P+LTG +V    R  +L + G   V L+ P ++  + K+V  GN  
Sbjct: 2   KKPLRIALFTSLYAPFLTGVSVAVHQRVHWLLQQG-HEVFLIHPEINNQYPKKV--GNRP 58

Query: 63  FASPKEQEAYVRWWLEDRTGFTS-TFDTR---FYPGKFAADKKSILAVGDITEIIPDEEA 118
            +  +E        L+    F+S  F T+   FY  K      S     D T+++   + 
Sbjct: 59  MSGLEE--------LQHFPNFSSYAFPTKPLIFY--KSLPQPLSYRHWSD-TKLLEKFQP 107

Query: 119 DIAVLEEPEHLTWFH------HGKRWKAKF-----RFVVGIVHTNYLEYVKREKNDR--- 164
           DI ++EE   L  F+      +G+    ++       +V I HT+ + Y++    +    
Sbjct: 108 DIIIVEEAPQLRGFYSLYLQGYGRPVGVEYAKRTGTPIVSIFHTDIIAYIQYYLGNVFFN 167

Query: 165 -----LQAFLLEFVNSWLARVHCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKME 219
                L   + +F  ++   +   + +     T +   S      G++ +          
Sbjct: 168 LLRPILPILVKQFSETYNFNLFPSRELLAKYKTWKCQRSEYVPYQGIDCEKFHPQNIAYN 227

Query: 220 QQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAA 279
                N+       ++GR+   K   +LL    +   ++  + + + G+G    QI++ A
Sbjct: 228 PIPEDNRP---TMLFVGRITAEKNVTQLLDAFPLIAAQIPDVHLVIIGSGPFDQQIRQRA 284

Query: 280 KKLKLVVRVYPGRDHADPI---FHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSN 336
           ++ K  V V+ G  H   +   F    +F+NPS T+  CT   EALA G  VV AN   N
Sbjct: 285 QQFKSGVTVW-GESHGLELLGWFARADIFVNPSVTENFCTTNMEALASGTPVVAANAGGN 343

Query: 337 DFFKQFPNCRTYDGRNGFVEAT------------------LKALAEEPAQPTDAQTHQLS 378
                    +   G NGF+ A                   LKA   + A+P+     +  
Sbjct: 344 P-------EQVISGVNGFLAAPNNPIDFAQKVIAILQNPELKAKVSQQARPS---ILEFD 393

Query: 379 WESATERF 386
           W S  E+F
Sbjct: 394 WSSCMEKF 401


>gi|305856063|gb|ADM67872.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Pieris japonica var. taiwanensis]
          Length = 34

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 23/34 (67%), Positives = 30/34 (88%)

Query: 167 AFLLEFVNSWLARVHCHKVIRLSAATQEYPNSIV 200
           AFLL+++NSW+  ++CHKVIRLSAATQ+ P SIV
Sbjct: 1   AFLLKYINSWVVSIYCHKVIRLSAATQDLPRSIV 34


>gi|305856071|gb|ADM67876.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron nakaharae]
          Length = 33

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 22/33 (66%), Positives = 29/33 (87%)

Query: 167 AFLLEFVNSWLARVHCHKVIRLSAATQEYPNSI 199
           AFLL+++NSW+  ++CHKVIRLSAATQ+ P SI
Sbjct: 1   AFLLKYINSWVTSIYCHKVIRLSAATQDLPRSI 33


>gi|305856065|gb|ADM67873.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron ovatum]
 gi|305856067|gb|ADM67874.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron ellipticum]
 gi|305856073|gb|ADM67877.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron noriakianum]
 gi|305856075|gb|ADM67878.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron kanehirai]
 gi|305856077|gb|ADM67879.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron simsii]
 gi|305856079|gb|ADM67880.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron oldhamii]
 gi|305856081|gb|ADM67881.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron rubropilosum]
 gi|305856083|gb|ADM67882.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron breviperulatum]
 gi|305856085|gb|ADM67883.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron kawakamii]
 gi|305856087|gb|ADM67884.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron formosanum]
 gi|305856089|gb|ADM67885.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron hyperythrum]
 gi|305856091|gb|ADM67886.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron rubropunctatum]
 gi|305856093|gb|ADM67887.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron morii]
 gi|305856095|gb|ADM67888.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron pseudochrysanthum]
 gi|338807900|gb|AEJ07674.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron simsii]
 gi|338807902|gb|AEJ07675.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron chihsinianum]
 gi|338807904|gb|AEJ07676.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron seniavinii]
 gi|338807906|gb|AEJ07677.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron wiltonii]
 gi|338807910|gb|AEJ07679.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron scabrum]
          Length = 33

 Score = 55.8 bits (133), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 22/33 (66%), Positives = 29/33 (87%)

Query: 167 AFLLEFVNSWLARVHCHKVIRLSAATQEYPNSI 199
           AFLL+++NSW+  ++CHKVIRLSAATQ+ P SI
Sbjct: 1   AFLLKYINSWVISIYCHKVIRLSAATQDLPRSI 33


>gi|427735736|ref|YP_007055280.1| glycosyltransferase [Rivularia sp. PCC 7116]
 gi|427370777|gb|AFY54733.1| glycosyltransferase [Rivularia sp. PCC 7116]
          Length = 422

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 63/284 (22%), Positives = 117/284 (41%), Gaps = 41/284 (14%)

Query: 110 TEIIPDEEADIAVLEEPEHL----TWFHHG-------KRWKAKFRFVVGIVHTNYLEYVK 158
           T+++ D   DI ++EE   +    + F  G       +  K K   V+ + HT+ + Y++
Sbjct: 100 TKLLADFNPDIVLVEEAAQMRGLYSLFLQGYGRPVAVEYAKKKGTPVISLFHTDIVAYIQ 159

Query: 159 REKNDRLQAFLLEFVNSWLARVHCHKVIRLSAATQEYP--NSIVCN------VHGVNPKF 210
               ++L + +  F+   + +      + L ++ ++     ++ C         GV+ K 
Sbjct: 160 YYLGNQLFSLIRPFLPKLIKQFSESYDLSLFSSKEQLAKYQAMKCQRSEYFPYQGVDCKK 219

Query: 211 LEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGE 270
                       N N+       ++GR+   K   +LL +  I   ++  + + + G+G 
Sbjct: 220 FNPENTIFNPIPNDNRP---TLLFVGRITAEKNVNQLLDIYPIVADKIKDVHLVIVGSGP 276

Query: 271 DFDQIQRAAKKLKLVVRVYPGRDHADPI---FHDYKVFLNPSTTDVVCTATAEALAMGKI 327
              +I++ A++    V V+ G  H + +   F    +F+NPS T+  CT   EALA G  
Sbjct: 277 IEAEIKQRAQQFGSGVTVW-GESHGEELLGWFARADLFVNPSITENFCTTNNEALASGTP 335

Query: 328 VVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQPTD 371
           VV A  PS          +   G NGF+         EP  P D
Sbjct: 336 VVAAIAPSTA-------EQVTPGSNGFL--------AEPNNPAD 364


>gi|338807908|gb|AEJ07678.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron simsii]
          Length = 33

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 21/33 (63%), Positives = 29/33 (87%)

Query: 167 AFLLEFVNSWLARVHCHKVIRLSAATQEYPNSI 199
           +FLL+++NSW+  ++CHKVIRLSAATQ+ P SI
Sbjct: 1   SFLLKYINSWVISIYCHKVIRLSAATQDLPRSI 33


>gi|393784495|ref|ZP_10372658.1| hypothetical protein HMPREF1071_03526 [Bacteroides salyersiae
           CL02T12C01]
 gi|392665476|gb|EIY59000.1| hypothetical protein HMPREF1071_03526 [Bacteroides salyersiae
           CL02T12C01]
          Length = 381

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/99 (27%), Positives = 51/99 (51%), Gaps = 2/99 (2%)

Query: 229 TKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL--VV 286
           TK    IGR+   KG++ L+ + N+  K + G E+++YG+G D+  +    K+  L   +
Sbjct: 203 TKKVLAIGRLCPQKGFDMLVDIWNVVCKNIEGWELNIYGDGPDYAALDSKIKQYSLHKTI 262

Query: 287 RVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMG 325
            +YP   +   ++ +  +F  PS  +    A  EA++ G
Sbjct: 263 HMYPATTNIQSVYLNSSIFCFPSRYEGFSMALMEAMSYG 301


>gi|305856069|gb|ADM67875.1| putative UDP-galactose-dependent digalactosyldiacylglycerol
           synthase [Rhododendron mariesii]
          Length = 33

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 21/33 (63%), Positives = 28/33 (84%)

Query: 167 AFLLEFVNSWLARVHCHKVIRLSAATQEYPNSI 199
            FLL+++NSW+  ++CHKVIRLSAATQ+ P SI
Sbjct: 1   GFLLKYINSWVISIYCHKVIRLSAATQDLPRSI 33


>gi|428298425|ref|YP_007136731.1| group 1 glycosyl transferase [Calothrix sp. PCC 6303]
 gi|428234969|gb|AFZ00759.1| glycosyl transferase group 1 [Calothrix sp. PCC 6303]
          Length = 417

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 81/395 (20%), Positives = 154/395 (38%), Gaps = 79/395 (20%)

Query: 1   MDRKQQHIAIFTTASLPWLTGTAVNPLFRAAYLAKDGERRVTLVIPWLS----------- 49
           M ++   IAI T+   P+LTG +V    R  +L K G   V L+ P ++           
Sbjct: 1   MQKQPLRIAIITSLYAPFLTGVSVAVHQRVNWLLKQG-HEVFLIHPEINDKFPQKVSNRP 59

Query: 50  --LIHQKQVYPGNITFASPKEQEAYVRWWLEDRTGFTSTFDTRFYPGKFAADKKSILAVG 107
              I + Q +P   +FA P E   + +             + R +               
Sbjct: 60  MPSIEENQAFPNFYSFAFPTEPLIFYK-------SLPQPLNHRHWSD------------- 99

Query: 108 DITEIIPDEEADIAVLEEPEHLTWFH------HGKRWKAKF-----RFVVGIVHTNYLEY 156
             T+++ + + DI ++EE   L  F+      +G+     +       ++ + HT+ + Y
Sbjct: 100 --TKLLENFKPDIIIVEEAPQLRGFYSLFLQGYGRPVGIDYARKTGTPIISLFHTDIVAY 157

Query: 157 VKREKNDRL------------QAFLLEF-VNSWLARVHCHKVIRLSAATQEYPNSIVCNV 203
           ++    +++            + F  E+  N + +R    K   L +   EY      + 
Sbjct: 158 IRYYLGNQIFNLISLILPGIIKKFSEEYDANFFPSREELKKYQDLKSQRSEYLPYQGVDC 217

Query: 204 HGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEM 263
               PK   I    + + +     F      +GR+   K   +L+ +  +   ++  + +
Sbjct: 218 QKFQPK--NICYNPIPEDKRPTLLF------VGRITAEKNVTQLIDIYPLVAAKIPDVHL 269

Query: 264 DLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDY---KVFLNPSTTDVVCTATAE 320
            + G+G    +++  A+K    + ++ G  H + +   Y    +F+NPS T+  CTA  E
Sbjct: 270 VIIGSGPLDAEMRTRAEKYVPGITIW-GESHGNELLGWYTRADLFVNPSLTENFCTANNE 328

Query: 321 ALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFV 355
           ALA G  VV A  PS          +   G+NGF+
Sbjct: 329 ALASGTPVVAAKAPST-------AEQVIVGKNGFL 356


>gi|119510304|ref|ZP_01629440.1| Glycosyl transferase, group 1 [Nodularia spumigena CCY9414]
 gi|119465048|gb|EAW45949.1| Glycosyl transferase, group 1 [Nodularia spumigena CCY9414]
          Length = 439

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 39/141 (27%), Positives = 63/141 (44%), Gaps = 19/141 (13%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRD 293
           ++GR+   K   +LL +  I   ++  + + + G+G   ++I R A+K K  V ++ G  
Sbjct: 240 FVGRITPEKNVTQLLEIYPIVAAKIPDIHLVIVGSGPQDEKILRLAQKFKSGVTIW-GES 298

Query: 294 HADPI---FHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDG 350
           H   +   F    +F+NPS  +  CT   EALA G  VV    PS        + + +  
Sbjct: 299 HGTELLGWFARADIFVNPSFVENFCTTNNEALASGTPVVTTFAPST-------SEQIFPS 351

Query: 351 RNGFVEATLKALAEEPAQPTD 371
            NGF+         EP  P D
Sbjct: 352 VNGFL--------AEPNNPKD 364


>gi|434404545|ref|YP_007147430.1| glycosyltransferase [Cylindrospermum stagnale PCC 7417]
 gi|428258800|gb|AFZ24750.1| glycosyltransferase [Cylindrospermum stagnale PCC 7417]
          Length = 420

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 39/141 (27%), Positives = 63/141 (44%), Gaps = 19/141 (13%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRD 293
           ++GR+   K   +LL    +   ++  + + + G+G   ++I++ A+     V V+ G  
Sbjct: 240 FVGRITAEKNVNQLLDAYPLIAAKIPDVHLVIIGSGPMDEEIRQRAQTFGSGVTVW-GES 298

Query: 294 HADPI---FHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDG 350
           H   +   F    VF+NPS T+  CT   EALA G  +V    PS          + Y G
Sbjct: 299 HGTELLGWFASADVFVNPSATENFCTTNNEALASGTPLVAVVAPST-------AEQVYPG 351

Query: 351 RNGFVEATLKALAEEPAQPTD 371
           +NGF+         EP  P D
Sbjct: 352 KNGFL--------AEPNNPKD 364


>gi|282162813|ref|YP_003355198.1| putative glycosyltransferase [Methanocella paludicola SANAE]
 gi|282155127|dbj|BAI60215.1| putative glycosyltransferase [Methanocella paludicola SANAE]
          Length = 392

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 51/188 (27%), Positives = 85/188 (45%), Gaps = 30/188 (15%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL----VVRVY 289
           ++GRMV  KG   LL  +    +   G ++ L G G   D ++R A++L L    V   Y
Sbjct: 213 FVGRMVPQKGIGYLLEAMPCVLRTHPGTKLVLVGRGSLCDGLRRRARELGLDGNAVFSGY 272

Query: 290 PGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYD 349
              D     +    +F+ PST +      AEA+A GK VVC +   +   ++  +    D
Sbjct: 273 VEEDELKEAYGACDLFILPSTVEPFGIVVAEAMASGKPVVCTD---SGGVREIVD----D 325

Query: 350 GRNGFV--EATLKALAEE--------------PAQPTDAQTHQLSWESATERFLQVAELV 393
           G +GFV    + +ALAE+                +  +A   +L W+   E+  +V E  
Sbjct: 326 GSSGFVVPPGSPEALAEKINTLLSDARLRADMGLKGREAAERRLDWKLIAEKTKRVYE-- 383

Query: 394 GDVVTKRS 401
            DV+++R+
Sbjct: 384 -DVLSRRA 390


>gi|293373007|ref|ZP_06619376.1| glycosyltransferase, group 1 family protein [Bacteroides ovatus SD
           CMC 3f]
 gi|292632075|gb|EFF50684.1| glycosyltransferase, group 1 family protein [Bacteroides ovatus SD
           CMC 3f]
          Length = 369

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 56/220 (25%), Positives = 105/220 (47%), Gaps = 21/220 (9%)

Query: 127 EHLTWFH-HGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHC--- 182
           E++   H HG R  +   F+   +H   +  V      + Q+FL++ + +W  ++ C   
Sbjct: 79  ENIQLIHAHGSRAASNVAFIARKLHIPMVYTVHGWSFHQDQSFLIKSLRAWSEKIICKLS 138

Query: 183 HKVIRLSAAT----QE---YPNSIVCNVHGVNPK-FLEIGEKKMEQQQNGNKAFTKGAYY 234
            KVI +S +     QE     N++V   +G+N K F   G +K  +++ G   FT+  + 
Sbjct: 139 RKVICVSESNWITGQETFGLKNALVIE-NGINLKRFNPDGNRKDLRKEFG---FTEDDFV 194

Query: 235 IG---RMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKK--LKLVVRVY 289
           IG   R+   K   + +  + I H++ A ++  L G G+  ++ + A ++  ++  +R  
Sbjct: 195 IGFVSRITLQKAPLDFVKSIAIAHQKDARIKALLVGEGDMEEETKEAIRQNGIEEYIRTS 254

Query: 290 PGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVV 329
           P R+    + H   VF  PS  + +  A  EA+AM K +V
Sbjct: 255 PFRNDVPDVLHAINVFCLPSLWEGLSIALLEAMAMKKALV 294


>gi|432329884|ref|YP_007248027.1| glycosyltransferase [Methanoregula formicicum SMSP]
 gi|432136593|gb|AGB01520.1| glycosyltransferase [Methanoregula formicicum SMSP]
          Length = 384

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 38/127 (29%), Positives = 60/127 (47%), Gaps = 11/127 (8%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLV--VRVYPG 291
           ++GR++  K  + L+  +++  KE  G+ + + G+G + + I      L L   VR+Y  
Sbjct: 209 FVGRLIREKNADLLVQAIHLLVKEFPGIRLTIIGDGPERNAIATQVTNLSLEKHVRMYGF 268

Query: 292 -RDHADPI--FHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTY 348
            +DH + I      KVF+ PS+ +    A  EAL  G  VV   HP+N        C   
Sbjct: 269 IQDHDEVIAKMKAAKVFVLPSSREGFGIAALEALGCGLPVVTIRHPANAV------CDLI 322

Query: 349 DGRNGFV 355
              NGFV
Sbjct: 323 SEENGFV 329


>gi|427737277|ref|YP_007056821.1| glycosyltransferase [Rivularia sp. PCC 7116]
 gi|427372318|gb|AFY56274.1| glycosyltransferase [Rivularia sp. PCC 7116]
          Length = 445

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 49/177 (27%), Positives = 79/177 (44%), Gaps = 26/177 (14%)

Query: 234 YIGRMVWSKGYEELL-GLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL----VVRV 288
           YIGR V  KG + L+    ++  K  A LE  L G+GE    +++    L L    + R 
Sbjct: 247 YIGRFVDWKGIDFLIDAFKSVADKTNAVLE--LVGDGEVRSDLEKQVADLGLENNVIFRG 304

Query: 289 YPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN--------HPSNDFFK 340
           +  R+ A  +  +  VF+ PS  +    A  EA+A+G  V+           HP    + 
Sbjct: 305 WLKREEASKLACECDVFVMPSLREAGGNAVLEAMALGLPVIATKWAGPANTLHPDCGIWV 364

Query: 341 QFPNCRTYDGRNGFVEATLKALAEEP--------AQPTDAQTHQLSWESATERFLQV 389
           +  + + +   NGF EA +K LA EP        A P    T+   W+S  +R +++
Sbjct: 365 EPTSIKAF--VNGFSEAMIK-LATEPELRLQMGEAGPKRILTNYFDWDSKVDRIIEI 418


>gi|315231857|ref|YP_004072293.1| glycosyl transferase, group 1 [Thermococcus barophilus MP]
 gi|315184885|gb|ADT85070.1| glycosyl transferase, group 1 [Thermococcus barophilus MP]
          Length = 400

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 47/186 (25%), Positives = 86/186 (46%), Gaps = 16/186 (8%)

Query: 161 KNDRLQAFLLEFVNSWLAR---VHCHKVIRLSAATQEY-------PNSIVCNVHGVNPKF 210
           +++R    L EF N  + +       K+I L    + Y       P  IV   +G++ + 
Sbjct: 139 RSNRWLNLLSEFYNKSIGKWVLSKIDKIIVLDPRAKRYFIKIGVSPEKIVVIPNGIDYES 198

Query: 211 LEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLN-IYHKELAGLEMDLYGNG 269
           L+  ++ +E+ +  +K   K   Y+GR+   KG +EL+   N +  KE   L + + G+G
Sbjct: 199 LQSFKENVEKYKT-HKGNVKTILYVGRLQKEKGVKELILAFNELLKKEKFNLNLLIVGDG 257

Query: 270 EDFDQIQRAAKKLKLVVRV-YPGRDHADPIFHDY---KVFLNPSTTDVVCTATAEALAMG 325
               +++   +KL +  +V + G     P+   Y    VF+ PS  + V TA  EA+A G
Sbjct: 258 PQRQELENLVRKLGIEDKVMFTGYLTGKPLLQAYYSADVFILPSKFEGVPTAILEAMATG 317

Query: 326 KIVVCA 331
             ++  
Sbjct: 318 LPIIAT 323


>gi|312194073|ref|YP_004014134.1| group 1 glycosyl transferase [Frankia sp. EuI1c]
 gi|311225409|gb|ADP78264.1| glycosyl transferase group 1 [Frankia sp. EuI1c]
          Length = 422

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 49/105 (46%), Gaps = 6/105 (5%)

Query: 232 AYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV--- 288
           A + GR++  KG    +  L   H E  G  +D+YG+G D DQI+ A ++L +  RV   
Sbjct: 252 AVFAGRLLAWKGVRLAVAALARPHAE--GWTLDIYGSGPDEDQIRAACEQLGVADRVTLH 309

Query: 289 -YPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
               RD            L PS  D      AE LA+G+ V+C +
Sbjct: 310 GQRPRDEVLKALATCDALLFPSMHDSAGWIVAETLALGRPVICLD 354


>gi|345304574|ref|YP_004826476.1| group 1 glycosyl transferase [Rhodothermus marinus SG0.5JP17-172]
 gi|345113807|gb|AEN74639.1| glycosyl transferase group 1 [Rhodothermus marinus SG0.5JP17-172]
          Length = 416

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 56/123 (45%), Gaps = 11/123 (8%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY-PGR 292
           Y+GR+VW KG + L   +N   KE       + G G    +++  A+  + +   Y  GR
Sbjct: 227 YVGRLVWEKGLDVLAATINRLQKEQVPHRCLIVGEGPARHELE--ARLPEAIFTGYLEGR 284

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRN 352
           + A   +    VF  PS T+     T EA+A G   VCA+ P ++   +        GR 
Sbjct: 285 ELAR-AYASADVFFFPSETETFGNVTLEAMASGLPAVCADAPGSNMLIE-------HGRT 336

Query: 353 GFV 355
           GF+
Sbjct: 337 GFL 339


>gi|407712273|ref|YP_006832838.1| group 1 glucosyll transferase [Burkholderia phenoliruptrix BR3459a]
 gi|407234457|gb|AFT84656.1| group 1 glucosyll transferase [Burkholderia phenoliruptrix BR3459a]
          Length = 358

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 46/153 (30%), Positives = 71/153 (46%), Gaps = 12/153 (7%)

Query: 256 KELAGLEMDLYGNGEDFDQIQRA----AKKLKLVVRVYPGRDHAD--PIFHDYKVFLNPS 309
           K++   ++ LYG  ED D  QRA    A +L +  RV      +D   +  +  VF+ PS
Sbjct: 210 KQVENAKLTLYGVVEDPDY-QRAVVALATELGIAERVEVAGPRSDVATVLSESSVFVMPS 268

Query: 310 TTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRN--GFVEATLKALAEEPA 367
            ++    A  EALA G  +V +  P+  F   FP  +  D  N   + EA + ALA+E  
Sbjct: 269 RSEGHSVAFLEALASGLPIVASRIPAFAFANGFPGVQLVDTDNIRCYAEAIVTALAQERV 328

Query: 368 QPTDAQTHQLSWESATERFLQVAELVGDVVTKR 400
           Q + A    L+     ER+  +A  +   V+ R
Sbjct: 329 QRSLA---GLTLRDTAERYRAIARQICPAVSPR 358


>gi|268318276|ref|YP_003291995.1| group 1 glycosyl transferase [Rhodothermus marinus DSM 4252]
 gi|262335810|gb|ACY49607.1| glycosyl transferase group 1 [Rhodothermus marinus DSM 4252]
          Length = 416

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 56/123 (45%), Gaps = 11/123 (8%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY-PGR 292
           Y+GR+VW KG + L   +N   KE       + G G    +++  A+  + +   Y  GR
Sbjct: 227 YVGRLVWEKGLDVLAATINRLQKEQVPHRCLIVGEGPARHELE--ARLPEAIFTGYLEGR 284

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRN 352
           + A   +    VF  PS T+     T EA+A G   VCA+ P ++   +        GR 
Sbjct: 285 ELAR-AYASADVFFFPSETETFGNVTLEAMASGLPAVCADAPGSNMLIE-------HGRT 336

Query: 353 GFV 355
           GF+
Sbjct: 337 GFL 339


>gi|427394175|ref|ZP_18887677.1| hypothetical protein HMPREF9698_01312 [Alloiococcus otitis ATCC
           51267]
 gi|425730163|gb|EKU93006.1| hypothetical protein HMPREF9698_01312 [Alloiococcus otitis ATCC
           51267]
          Length = 386

 Score = 48.5 bits (114), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 64/246 (26%), Positives = 103/246 (41%), Gaps = 25/246 (10%)

Query: 135 GKRWKAKFRF-VVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHKVIRLSAAT- 192
           G+R+  K+    V   HTN+ +Y+K    D  +  +L F  +W  +  C  V   S+ T 
Sbjct: 100 GRRYAVKYGVPFVASYHTNFDQYLKSYNLDWAKG-ILNFYLNWFHQA-CQAVFAPSSVTA 157

Query: 193 -----QEYPNSIV----CNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKG 243
                Q YPN  +     + H   P  ++   K   Q +   +       Y+GR+   K 
Sbjct: 158 DQLKEQGYPNVKIWPRGVDHHHFKPAKVKDWAKHEAQAKFKLEKNKLTILYVGRLATEKS 217

Query: 244 YEELLGLLN-IYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV--YPGRDHADPIFH 300
            + L+  L  I    L  L++++ G+G    +I+R A++  L + +  +   D+   ++ 
Sbjct: 218 LDVLIDTLKEIPPHLLYKLQVNIVGDGPIRQEIKRQAREFDLPINLLGFQQGDNLSLLYQ 277

Query: 301 DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPS-NDFFKQFPN---CRTYDGRNGFVE 356
              +F  PS T+       EALA G  V+ A      D  KQ  N   C       G VE
Sbjct: 278 AGDIFFFPSATETFGNVVLEALASGLPVIGAKAGGVKDLVKQAHNGILCPP-----GEVE 332

Query: 357 ATLKAL 362
           A L AL
Sbjct: 333 AFLDAL 338


>gi|300853871|ref|YP_003778855.1| glycosyl transferase family protein [Clostridium ljungdahlii DSM
           13528]
 gi|300433986|gb|ADK13753.1| putative glycosyl transferase [Clostridium ljungdahlii DSM 13528]
          Length = 406

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 75/340 (22%), Positives = 143/340 (42%), Gaps = 58/340 (17%)

Query: 88  DTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLEEPEHLTWFHHGK----RWKAKFR 143
           + +FYP  F  +K     VG +T+I+P +      L+    + W +       +  A+FR
Sbjct: 53  NVKFYPVVFQTEKLPFPVVG-MTDIMPYKSTMYRKLDNDMLIKWKNAFSEVLTKSVAEFR 111

Query: 144 FVVGIVHTNY-LEYVKREK------------NDRLQAFLLEFVNSWLARVHCHKVIRLSA 190
             + I H  + L  + +E              D  Q  L    N+++++ +C K+  ++A
Sbjct: 112 PDIIITHHLWILSAMVKEMFPLIKTIAICHGTDLRQMDLAPNFNNYVSK-YCRKIDNIAA 170

Query: 191 ATQEYPNSIVCNVH-----------GVNPKFLEIG--EKKMEQQQNGNKAFTKGAYYIGR 237
             ++  N I+               G NP        EK M++ +           Y G+
Sbjct: 171 LHEDQKNEIIKKYQIDKNKITVVGVGFNPNIFYTNDTEKNMDKIK---------LIYAGK 221

Query: 238 MVWSKGYEELLGLLNIYHKELAGLEMDLYGN--GEDFDQIQRAAK--KLKLVVRVYPGRD 293
           + ++KG   L+   N    +   +E+ L G+  G  F  I++ A+  +L+++++    + 
Sbjct: 222 LNFAKGIPSLIKSYNKLDIDRNSIELILAGSGTGSQFKAIEKMAEESRLRIILKGSISQK 281

Query: 294 HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPS-NDFFKQFPNCRTYDGRN 352
               +F +  +F+ PS  + +     EALA G ++V  N P   DFF  + N      + 
Sbjct: 282 ELSKLFRESHLFVFPSFFEGLPLVLTEALASGMLIVTTNLPGVKDFFGDYIN------KK 335

Query: 353 GFVE----ATLKALAEEPAQPTDAQTHQLSWESATERFLQ 388
           G +E     +LK+L +EP +  D    +  +  A E  L+
Sbjct: 336 GLIEYVKMPSLKSL-DEPFE-EDLPNFEKEFSKAIENQLE 373


>gi|323524904|ref|YP_004227057.1| group 1 glycosyl transferase [Burkholderia sp. CCGE1001]
 gi|323381906|gb|ADX53997.1| glycosyl transferase group 1 [Burkholderia sp. CCGE1001]
          Length = 357

 Score = 47.8 bits (112), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 45/153 (29%), Positives = 70/153 (45%), Gaps = 12/153 (7%)

Query: 256 KELAGLEMDLYGNGEDFDQIQRA----AKKLKLVVRVYPGRDHAD--PIFHDYKVFLNPS 309
           K++   ++ LYG  ED D  QRA    A +L +  RV      +D   +  +  VF+ PS
Sbjct: 209 KQVENAKLTLYGVVEDPDY-QRAVVALATELGIAERVVVAGPRSDVATVLSESSVFVMPS 267

Query: 310 TTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRN--GFVEATLKALAEEPA 367
            ++    A  EALA G  +V +  P+  F   FP  +  D  N   + EA + AL +E  
Sbjct: 268 RSEGHSVAFLEALASGLPIVASRIPAFAFANGFPGVQLVDTDNIRCYAEAIVTALGQERV 327

Query: 368 QPTDAQTHQLSWESATERFLQVAELVGDVVTKR 400
           Q + A    L+     ER+  +A  +   V+ R
Sbjct: 328 QRSLA---GLTLRDTAERYRAIARQICPAVSPR 357


>gi|385206665|ref|ZP_10033533.1| glycosyltransferase [Burkholderia sp. Ch1-1]
 gi|385179003|gb|EIF28279.1| glycosyltransferase [Burkholderia sp. Ch1-1]
          Length = 357

 Score = 46.6 bits (109), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 64/144 (44%), Gaps = 12/144 (8%)

Query: 265 LYGNGEDFDQIQRA----AKKLKLVVRVYPGRDHAD--PIFHDYKVFLNPSTTDVVCTAT 318
           LYG  ED D  QRA    AK+L +  RV       D   +  +  VF+ PS ++    A 
Sbjct: 218 LYGVVEDPDY-QRAVVALAKELGIAERVVVAGPRTDVATVLSESNVFVMPSQSEAHSVAF 276

Query: 319 AEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRN--GFVEATLKALAEEPAQPTDAQTHQ 376
            EALA G  +V +  P+  F   FP  +  D  N   + EA + AL +E AQ +      
Sbjct: 277 LEALASGVPIVASKIPAFAFANGFPGVQLIDTDNVQCYAEAVIAALGQERAQRS---LTG 333

Query: 377 LSWESATERFLQVAELVGDVVTKR 400
           L+     +R+  +A  V   V  R
Sbjct: 334 LTLRDTADRYRAIARQVSLAVPAR 357


>gi|91781998|ref|YP_557204.1| glycosyltransferase [Burkholderia xenovorans LB400]
 gi|91685952|gb|ABE29152.1| Putative glycosyltransferase [Burkholderia xenovorans LB400]
          Length = 358

 Score = 46.6 bits (109), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 64/144 (44%), Gaps = 12/144 (8%)

Query: 265 LYGNGEDFDQIQRA----AKKLKLVVRVYPGRDHAD--PIFHDYKVFLNPSTTDVVCTAT 318
           LYG  ED D  QRA    AK+L +  RV       D   +  +  VF+ PS ++    A 
Sbjct: 219 LYGVVEDPDY-QRAVVALAKELGIAERVVVAGPRTDVATVLSESNVFVMPSQSEAHSVAF 277

Query: 319 AEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRN--GFVEATLKALAEEPAQPTDAQTHQ 376
            EALA G  +V +  P+  F   FP  +  D  N   + EA + AL +E AQ +      
Sbjct: 278 LEALASGVPIVASKIPAFAFANGFPGVQLIDTDNVQCYAEAVIAALGQERAQRS---LTG 334

Query: 377 LSWESATERFLQVAELVGDVVTKR 400
           L+     +R+  +A  V   V  R
Sbjct: 335 LTLRDTADRYRAIARQVSLAVPAR 358


>gi|336251783|ref|YP_004585751.1| glycosyl transferase group 1 [Halopiger xanaduensis SH-6]
 gi|335339707|gb|AEH38945.1| glycosyl transferase group 1 [Halopiger xanaduensis SH-6]
          Length = 378

 Score = 46.6 bits (109), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 53/241 (21%), Positives = 99/241 (41%), Gaps = 31/241 (12%)

Query: 170 LEFVNSWLARV--HCHKVIRLSAATQEYPNSIVC-------------NVHGVNPKFLEIG 214
           L++ + +L R+      V RL+A    YP ++               +   V P  +++ 
Sbjct: 131 LDYWDEYLGRLAPFGKAVERLTAKAPHYPIAVSSVTADRLAELGPSRDAIAVVPNGIDV- 189

Query: 215 EKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQ 274
             ++E        F     Y GR++  K  + LL   +   +    + + + G+G + D+
Sbjct: 190 -DQIETTDPARDGF--DVLYAGRLIEDKHVDLLLEAFDRVAETAPDVTLGIIGDGPERDR 246

Query: 275 IQRAAKKLKLVVRVYPG---RDHADPIFH--DYKVFLNPSTTDVVCTATAEALAMGKIVV 329
           ++R  + L+   RV       ++ D + H     VF +PST +      AEA+A    V+
Sbjct: 247 LERQVRSLEHADRVTMLGFLEEYTDVLAHMRAADVFASPSTREGFGITFAEAMAADCTVI 306

Query: 330 CANHPSNDFFK-----QFPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATE 384
            A+HP +   +      F    T D     +E TL+   E PA     +  +  W++ TE
Sbjct: 307 AADHPESAAAEVIGDAGFLASPTADDIARLLERTLE--GERPATDPARRAERFDWDTVTE 364

Query: 385 R 385
           +
Sbjct: 365 Q 365


>gi|255525605|ref|ZP_05392539.1| glycosyl transferase group 1 [Clostridium carboxidivorans P7]
 gi|296185492|ref|ZP_06853902.1| glycosyltransferase, group 1 family protein [Clostridium
           carboxidivorans P7]
 gi|255510695|gb|EET87001.1| glycosyl transferase group 1 [Clostridium carboxidivorans P7]
 gi|296050326|gb|EFG89750.1| glycosyltransferase, group 1 family protein [Clostridium
           carboxidivorans P7]
          Length = 408

 Score = 46.2 bits (108), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 70/316 (22%), Positives = 129/316 (40%), Gaps = 56/316 (17%)

Query: 88  DTRFYPGKFAADKKSILAVGDITEIIPDEEADIAVLEEPEHLTWFHHGKR--WKAKFRFV 145
           + +FYP  F  +      VG +T+I+P +      L E   + W +       K+   F 
Sbjct: 53  NVKFYPVVFETENLPFPVVG-MTDIMPYKSTMYRKLNEDMLIKWKNAFSEVLTKSVMEFK 111

Query: 146 VGIVHTNYL----EYVKR-----------EKNDRLQAFLLEFVNSWLARVHCHKVIRLSA 190
             I+ T++L      VK               D  Q  L    N++++  +C  +  ++A
Sbjct: 112 PDIIITHHLWILSAMVKEMFPLIKTIAICHGTDLRQMDLAPNFNNYVSN-YCKNIDNIAA 170

Query: 191 ATQEYPNSIVCNVH-----------GVNPKFLEIG--EKKMEQQQNGNKAFTKGAYYIGR 237
             ++  N I+               G NP        EK M++ +           Y G+
Sbjct: 171 LHEDQKNEIIKKYQIDKNKITVVGVGFNPNIFYTNDTEKNMDKIK---------LIYAGK 221

Query: 238 MVWSKGYEELLGLLNIYHKELAGLEMDLYGNG--EDFDQIQRAAK--KLKLVVRVYPGRD 293
           + ++KG   L+   N    +   +E+ L G+G    F  I++ A+  +LK++++    ++
Sbjct: 222 LNFAKGIPSLIKAYNKLDIDKNSIELILAGSGTGSQFKAIEKMAEESRLKIILKGSIPQN 281

Query: 294 HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPS-NDFFKQFPNCRTYDGRN 352
               +F +  +F+ PS  + +     EALA G ++V  N P   DFF  + N      + 
Sbjct: 282 ELSKLFRESHLFVFPSFFEGLPLVLTEALASGMLIVTTNLPGVKDFFGDYIN------KK 335

Query: 353 GFVE----ATLKALAE 364
           G +E     +LK+L E
Sbjct: 336 GLIEYVKMPSLKSLDE 351


>gi|257064172|ref|YP_003143844.1| glycosyltransferase [Slackia heliotrinireducens DSM 20476]
 gi|256791825|gb|ACV22495.1| glycosyltransferase [Slackia heliotrinireducens DSM 20476]
          Length = 743

 Score = 46.2 bits (108), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 29/102 (28%), Positives = 51/102 (50%), Gaps = 2/102 (1%)

Query: 230 KGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL--VVR 287
           K A  + R+V +K  ++ +  + +  K + GL +D+YG G     ++  A +  +  +V 
Sbjct: 280 KRAVMVTRLVDTKQVDQAIEAVALAAKSIEGLTLDVYGEGAQMAYLKDMAIQCGVADIVN 339

Query: 288 VYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVV 329
                +HA  I  DY V L  S+T+ +C A  E+L  G  V+
Sbjct: 340 FRGYVEHASEIVADYDVSLLTSSTEALCLAIPESLVAGTPVI 381


>gi|414171970|ref|ZP_11426881.1| hypothetical protein HMPREF9695_00527 [Afipia broomeae ATCC 49717]
 gi|410893645|gb|EKS41435.1| hypothetical protein HMPREF9695_00527 [Afipia broomeae ATCC 49717]
          Length = 398

 Score = 46.2 bits (108), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 56/234 (23%), Positives = 94/234 (40%), Gaps = 27/234 (11%)

Query: 113 IPDEEADIAVLEEPEHLTW--FHHGKRWKAKFRFVVGIVHTNYLEYVK---------REK 161
           I + + DI  +  P+ L +     G+R K     VV   HT Y  YVK         +  
Sbjct: 101 IEEFDPDIIHIAVPDILGYQALRLGRRLKVP---VVASYHTRYDTYVKFYAPLKLFQKPV 157

Query: 162 NDRLQAFLLEFVNSWLARVHCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKME-Q 220
            + L+ F    V  ++       V+R     Q Y  ++     GV+ +     ++  E +
Sbjct: 158 ENYLRFFYRNCVQVYVPSGSMADVLR----EQGYAENLAAWPRGVDVERFHPAKRSQEWR 213

Query: 221 QQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAK 280
            ++G         ++GR V  KG + L+  LN   ++         G+G      +RA  
Sbjct: 214 ARHGIAPDQVAIVFVGRFVREKGLDLLVDTLNELKRQNVAHRSIAVGDGP-----ERAWL 268

Query: 281 KLKLVVRVYPGRDHADPIFHDY---KVFLNPSTTDVVCTATAEALAMGKIVVCA 331
           + +L   ++PG  H + +   Y    +F  PS T+     T EA+A G   VCA
Sbjct: 269 EERLPDTIFPGFLHGEDLAQAYASSDIFFFPSQTETFGNVTLEAMASGLPAVCA 322


>gi|337286081|ref|YP_004625554.1| group 1 glycosyl transferase [Thermodesulfatator indicus DSM 15286]
 gi|335358909|gb|AEH44590.1| glycosyl transferase group 1 [Thermodesulfatator indicus DSM 15286]
          Length = 371

 Score = 46.2 bits (108), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 34/100 (34%), Positives = 54/100 (54%), Gaps = 4/100 (4%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPG- 291
           YIG+++  KG + LL  L+  HKEL+   + + G G   ++++  AKKL L  RV + G 
Sbjct: 200 YIGQLIHRKGLDILLKALS--HKELSDCFLFVVGEGPLKNELKALAKKLGLFSRVAFTGY 257

Query: 292 RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCA 331
           R+        + VF+ PS  + +     EA+ MGK VV +
Sbjct: 258 REDRLNFLRGFDVFVLPSRLEGIPRCLMEAMGMGKPVVAS 297


>gi|374586857|ref|ZP_09659949.1| glycosyl transferase group 1 [Leptonema illini DSM 21528]
 gi|373875718|gb|EHQ07712.1| glycosyl transferase group 1 [Leptonema illini DSM 21528]
          Length = 442

 Score = 46.2 bits (108), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 43/178 (24%), Positives = 74/178 (41%), Gaps = 26/178 (14%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV----Y 289
           ++GR+ + K  E +L  L +  +   G+ +D+YG+G     ++  A+ L L+  V    +
Sbjct: 243 HVGRISFEKNVEIVLRALALLKERFPGITLDVYGDGPALTSMKIEARHLGLLDDVTFHGF 302

Query: 290 PGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYD 349
             R+    ++ +Y +FL  ST +       EA+A G   V  +  +     Q       +
Sbjct: 303 VSRETLPDVYPNYDLFLTASTMETQGLVVLEAMACGLPCVGVSAFALPELIQ-------E 355

Query: 350 GRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKSPSRH 407
           GRNGFV            +P     H +     TER L+   L     T+  +  S H
Sbjct: 356 GRNGFV-----------VKP----GHHIDMAERTERILKDPALYKAFSTQSLEIASEH 398


>gi|417645154|ref|ZP_12295085.1| glycosyltransferase, group 1 family protein [Staphylococcus warneri
           VCU121]
 gi|445058901|ref|YP_007384305.1| hypothetical protein A284_02680 [Staphylococcus warneri SG1]
 gi|330684089|gb|EGG95841.1| glycosyltransferase, group 1 family protein [Staphylococcus
           epidermidis VCU121]
 gi|443424958|gb|AGC89861.1| hypothetical protein A284_02680 [Staphylococcus warneri SG1]
          Length = 381

 Score = 45.8 bits (107), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 31/126 (24%), Positives = 63/126 (50%), Gaps = 10/126 (7%)

Query: 236 GRMVWSKGYEELLGLLNIYHKELAGL--EMDLYGNGEDFDQIQRAAK--KLKLVVRVYPG 291
           GR+ + KG++ LL  + +   +L  L  E+ LYG+G++ D +++     +LK +V++YP 
Sbjct: 213 GRLEYEKGFDLLLESIRLIQDDLRHLNYELHLYGDGQEKDHLKQFIDQYQLKDIVQLYPA 272

Query: 292 RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGR 351
               +      K+ + PS  +       EA+A   +V+  +  +       P+    +G+
Sbjct: 273 TPSLNQKLAQSKITVIPSRNEGFGMVILEAMAQDNVVISFDGNTG------PDSIIKNGK 326

Query: 352 NGFVEA 357
           NG++ A
Sbjct: 327 NGYLVA 332


>gi|126178638|ref|YP_001046603.1| glycosyl transferase, group 1 [Methanoculleus marisnigri JR1]
 gi|125861432|gb|ABN56621.1| glycosyl transferase, group 1 [Methanoculleus marisnigri JR1]
          Length = 348

 Score = 45.8 bits (107), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 39/157 (24%), Positives = 69/157 (43%), Gaps = 10/157 (6%)

Query: 187 RLSAATQEYPNSIVCNVHGVNPKFL----EIGEKKMEQQQNGNKAFTKGAYYIGRMVWSK 242
           R+   T+E   +++  +H    K       I  K+   +   + A T    ++GR V  K
Sbjct: 122 RIICYTEEEKENLISILHIPESKIAVIPNGINTKQFHPRAGDHAADTINLLWVGRFVKGK 181

Query: 243 GYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL-----VVRVYPGRDHADP 297
           G E ++  ++I  KE+  L + L G G + D I+   + L+L     ++   P  D    
Sbjct: 182 GVEYIVQAMDILVKEIPSLHLTLIGEGPERDCIRELIESLELDNNINIIDFVP-YDEMPW 240

Query: 298 IFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHP 334
            F D  +F+ PS  + V     EA++    VV ++ P
Sbjct: 241 FFQDSDIFVLPSLHEGVPRTALEAMSCELPVVISDLP 277


>gi|126178134|ref|YP_001046099.1| glycosyl transferase, group 1 [Methanoculleus marisnigri JR1]
 gi|125860928|gb|ABN56117.1| glycosyl transferase, group 1 [Methanoculleus marisnigri JR1]
          Length = 476

 Score = 45.8 bits (107), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 49/108 (45%), Gaps = 5/108 (4%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV----- 288
           + GR++  K  + LL  L    +E+  L   + G+G +   ++R A+ L L   V     
Sbjct: 293 FTGRLIREKNVDVLLRALVAVREEVPDLRALVVGDGPERPALERLARDLGLDESVTFTGF 352

Query: 289 YPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSN 336
            P  D         +VF+ PST +    A  EA+A G  VV  +HP N
Sbjct: 353 LPDHDAVVAAMKASRVFVLPSTREGFGIAALEAMACGIPVVTTDHPGN 400


>gi|392948455|ref|ZP_10314064.1| poly(glycerol-phosphate) alpha-glucosyltransferase [Lactobacillus
           pentosus KCA1]
 gi|392436210|gb|EIW14125.1| poly(glycerol-phosphate) alpha-glucosyltransferase [Lactobacillus
           pentosus KCA1]
          Length = 502

 Score = 45.4 bits (106), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 36/132 (27%), Positives = 62/132 (46%), Gaps = 14/132 (10%)

Query: 230 KGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNG--EDFDQIQRAAKKLKLVVR 287
           K   Y+GR+   +  ++L+ ++ +  ++L  ++ D YG G  E    + +  ++LKL   
Sbjct: 325 KELLYVGRIAQDRQLDQLIRVIALVKQQLQDVQCDFYGYGDPEYIKTLTKLIEELKLTNN 384

Query: 288 VYPGRDHA--DPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCA--NHPSNDFFKQFP 343
           V+    H   D  + DY+V LN    D    +  EA++ G  VV    N+   DF     
Sbjct: 385 VHLLDYHPDLDKRYDDYQVLLNTDLVDGGPMSMPEAMSHGIPVVSYRFNYGPKDFIN--- 441

Query: 344 NCRTYDGRNGFV 355
                DG++GFV
Sbjct: 442 -----DGQDGFV 448


>gi|307728632|ref|YP_003905856.1| group 1 glycosyl transferase [Burkholderia sp. CCGE1003]
 gi|307583167|gb|ADN56565.1| glycosyl transferase group 1 [Burkholderia sp. CCGE1003]
          Length = 360

 Score = 45.4 bits (106), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 39/146 (26%), Positives = 66/146 (45%), Gaps = 10/146 (6%)

Query: 262 EMDLYGNGEDFD---QIQRAAKKLKLVVRVYPGRDHAD--PIFHDYKVFLNPSTTDVVCT 316
           ++ LYG  ED D   ++   A +L +  RV      +D   +  +  VF+ PS ++    
Sbjct: 215 KLTLYGVVEDPDYQREVVALATRLNIAERVLVAGPRSDVASVLSESSVFVMPSRSEGHSV 274

Query: 317 ATAEALAMGKIVVCANHPSNDFFKQFPNCRTY--DGRNGFVEATLKALAEEPAQPTDAQT 374
           A  EALA G  +V +  P+  F   FP  +    D  +G+ EA + AL ++  Q +    
Sbjct: 275 AFLEALASGVPIVASRIPAFAFANGFPGVQLVDTDDTSGYAEAIVTALGQQRVQRS---L 331

Query: 375 HQLSWESATERFLQVAELVGDVVTKR 400
             L+     ER+  +A  +   V+ R
Sbjct: 332 TGLTLRDTAERYRAIARQLCPAVSAR 357


>gi|350567952|ref|ZP_08936358.1| glycosyl transferase [Propionibacterium avidum ATCC 25577]
 gi|348662204|gb|EGY78873.1| glycosyl transferase [Propionibacterium avidum ATCC 25577]
          Length = 412

 Score = 45.4 bits (106), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 38/161 (23%), Positives = 67/161 (41%), Gaps = 5/161 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY-PGR 292
           Y+GRM   K  +  +      H+    L +D+YG G D D ++  A    +    +  GR
Sbjct: 204 YVGRMSHEKSPQLAVAAAVELHRRGVPLRLDMYGVGPDADTLKEQAGDAPVFFNGFVEGR 263

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANH-PSNDFFKQFPNCRTYDGR 351
           D     F    + ++    +    A+ EALA G  VV AN   +++              
Sbjct: 264 DEVARRFAAADLSMSVCPAETFGLASLEALACGTPVVTANRGGAHEIVDATSGEAGSPDA 323

Query: 352 NGFVEATLK---ALAEEPAQPTDAQTHQLSWESATERFLQV 389
           +G  +AT++    L  +  +    +  Q +WE++ E+ L V
Sbjct: 324 DGLADATVRLADRLGPDLREAARRRAEQFTWEASVEKMLAV 364


>gi|300310581|ref|YP_003774673.1| glycosyltransferase [Herbaspirillum seropedicae SmR1]
 gi|300073366|gb|ADJ62765.1| glycosyltransferase protein [Herbaspirillum seropedicae SmR1]
          Length = 349

 Score = 45.4 bits (106), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 70/155 (45%), Gaps = 12/155 (7%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRD 293
           Y+GR+   K  E  L L      +L G +  + G+G   +++ +   +++ +     G+D
Sbjct: 184 YVGRVAVEKNIEAFLKL------DLPGTKW-VVGDGPQLEELTQRYPEVRFLG--AKGQD 234

Query: 294 HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNG 353
                ++   VF+ PS TD       EA+A G  V  A +P         N  +   R+ 
Sbjct: 235 ELPAYYNCADVFVFPSKTDTFGLVLLEAMACG--VPVAAYPVEGPIDVVDNGVSGILRHD 292

Query: 354 FVEATLKALAEEPAQPTDAQTHQLSWESATERFLQ 388
             +A L+AL  +  Q   A     SWESAT++FLQ
Sbjct: 293 LRQACLQALTLD-RQAVHAHARSRSWESATQQFLQ 326


>gi|240102074|ref|YP_002958382.1| Glycosyltransferase, family 1 [Thermococcus gammatolerans EJ3]
 gi|239909627|gb|ACS32518.1| Glycosyltransferase, family 1 [Thermococcus gammatolerans EJ3]
          Length = 392

 Score = 45.4 bits (106), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 37/126 (29%), Positives = 55/126 (43%), Gaps = 9/126 (7%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV----Y 289
           ++GR++  KG + LL  L    +EL  +   + G G +  +++R AK L L   V    +
Sbjct: 224 FVGRLISEKGVDFLLKALVKVKEELPDVRAVIVGGGPERKRLERMAKGLGLEKNVLFTGF 283

Query: 290 PGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYD 349
                   +    KVF+ PS  +       EA+A G  VV  N P N            D
Sbjct: 284 LPYKRVIALMKASKVFVLPSLREGFGMVALEAMACGLPVVTLNAPMNA-----ARFLVED 338

Query: 350 GRNGFV 355
           G+NGFV
Sbjct: 339 GKNGFV 344


>gi|359414034|ref|ZP_09206499.1| glycosyl transferase group 1 [Clostridium sp. DL-VIII]
 gi|357172918|gb|EHJ01093.1| glycosyl transferase group 1 [Clostridium sp. DL-VIII]
          Length = 401

 Score = 45.1 bits (105), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 38/136 (27%), Positives = 65/136 (47%), Gaps = 5/136 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPGR 292
           YIGR+   K   EL+ L+    KE+  +++ + G G   + ++   KK K+  RV + G 
Sbjct: 211 YIGRIGEEKNISELIRLMPGVIKEIHNVKLLIVGGGPYLNNLKSLVKKDKIEERVIFTGM 270

Query: 293 DHADPIFHDYK---VFLNPSTTDVVCTATAEALAMGKIVVCANHPS-NDFFKQFPNCRTY 348
            + + I+  YK   VF+  S ++       EAL+ G  VVC   P  N   +Q  N  +Y
Sbjct: 271 INPEEIYKYYKIAEVFVTASFSETQGLTYVEALSSGCPVVCKYDPCINGVIEQGENGFSY 330

Query: 349 DGRNGFVEATLKALAE 364
             ++ F     + L++
Sbjct: 331 KEKDEFAHYIKRILSD 346


>gi|435851632|ref|YP_007313218.1| glycosyltransferase [Methanomethylovorans hollandica DSM 15978]
 gi|433662262|gb|AGB49688.1| glycosyltransferase [Methanomethylovorans hollandica DSM 15978]
          Length = 384

 Score = 45.1 bits (105), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 49/105 (46%), Gaps = 4/105 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL----VVRVY 289
           +IGR V  KG + L+  LNI   +   + + + G G D D+I R      L    +++ +
Sbjct: 213 WIGRYVKGKGVDYLVDALNILKSDYPKIILTMVGTGPDKDKIIRKIHNYGLENSIILKDF 272

Query: 290 PGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHP 334
                   ++H   +F+ PS  + V     EA++ G  +VC+  P
Sbjct: 273 IPNSEIVQLYHQSSIFVLPSLEEGVPRTILEAMSCGVPIVCSRLP 317


>gi|291459052|ref|ZP_06598442.1| glycosyl transferase, group 1 family [Oribacterium sp. oral taxon
           078 str. F0262]
 gi|291418306|gb|EFE92025.1| glycosyl transferase, group 1 family [Oribacterium sp. oral taxon
           078 str. F0262]
          Length = 413

 Score = 45.1 bits (105), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 55/218 (25%), Positives = 88/218 (40%), Gaps = 35/218 (16%)

Query: 205 GVNPKFLEIGEKKMEQ--QQNGNKAFTKGA------YYIGRMVWSKGYEELLGLLNIYHK 256
           G N +   + E++ E   ++ G  A+ +G        Y G++   KG   LL  L +   
Sbjct: 194 GYNRRIFRLPEERTELSLEKRGGAAWREGGRGGKRLLYAGKIAEKKGVMSLLRALRLLDP 253

Query: 257 EL---AGLEMDLYG---NGEDFDQIQRAAKKLKL--VVRVYPGRDHADPIFHDYKVFLNP 308
            L   A L + L G   N E++ +I++ +++L    V     G++     +    VF+ P
Sbjct: 254 ALFPAASLALFLAGSAGNQEEYRRIRKLSEELPFPAVFLGLLGQEELAKQYQRADVFVLP 313

Query: 309 STTDVVCTATAEALAMGKIVVCANHPS-NDFFKQ-----------FPNCRTYDGRN---- 352
           S  D +     EALA G   V +  P    FF +            P  R  D  N    
Sbjct: 314 SFFDAIPLTLVEALACGAKAVVSELPGIRRFFSENTRGANIRYVPLPGMRHADEANPEEL 373

Query: 353 -GFVEATLKALAEEPAQPTDA--QTHQLSWESATERFL 387
             F +   +A+ E    P+D+     QLSWE   E+ L
Sbjct: 374 PAFEKRLAEAVTEALLDPSDSVPDLRQLSWEGIAEKIL 411


>gi|301301025|ref|ZP_07207186.1| glycosyltransferase, group 1 family protein [Lactobacillus
           salivarius ACS-116-V-Col5a]
 gi|300851382|gb|EFK79105.1| glycosyltransferase, group 1 family protein [Lactobacillus
           salivarius ACS-116-V-Col5a]
          Length = 382

 Score = 45.1 bits (105), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 42/154 (27%), Positives = 67/154 (43%), Gaps = 9/154 (5%)

Query: 236 GRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY-PGR-D 293
           GRMV++KG++ L+    I+ +  +  ++ L G+GE+   I+   KK  L  R+Y PG+  
Sbjct: 216 GRMVYAKGFDTLVEAFRIFAQRNSDWKLLLVGDGEELPTIKNKIKKYGLEKRIYTPGKTS 275

Query: 294 HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVC----ANHPSNDFFKQFPNCRTYD 349
                F    V L PS  + +     E+L MG  +V     A  P      +    +   
Sbjct: 276 DIKEYFLQSSVLLLPSRWEGMPMIVLESLEMGCPIVAFDIDAMRPLVTNGMEGLIVKEKQ 335

Query: 350 GRNGFVEATLKALAEEPAQPTDAQTHQLSWESAT 383
             N + +A LK    E  +    Q HQ S + A 
Sbjct: 336 DANAYAQAMLKIAESEDLR---KQMHQASIKKAN 366


>gi|395205557|ref|ZP_10396188.1| glycosyltransferase, group 1 family protein [Propionibacterium
           humerusii P08]
 gi|422441363|ref|ZP_16518173.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL037PA3]
 gi|422472453|ref|ZP_16548941.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL037PA2]
 gi|422572140|ref|ZP_16647711.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL044PA1]
 gi|313836063|gb|EFS73777.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL037PA2]
 gi|314929599|gb|EFS93430.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL044PA1]
 gi|314970626|gb|EFT14724.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL037PA3]
 gi|328906193|gb|EGG25968.1| glycosyltransferase, group 1 family protein [Propionibacterium
           humerusii P08]
          Length = 412

 Score = 45.1 bits (105), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 38/161 (23%), Positives = 66/161 (40%), Gaps = 5/161 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY-PGR 292
           YIGRM   K     +      H+    + +D+YG G D D +++ A    +    +  GR
Sbjct: 204 YIGRMSHEKSPHLAVAAAIELHRRGVPVRLDMYGVGPDADAMKKQAGDAPVFFNGFVEGR 263

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANH-PSNDFFKQFPNCRTYDGR 351
           D     F    + ++    +    A+ EALA G  VV AN   +++              
Sbjct: 264 DEVARRFAAADISMSVCPAETFGLASLEALACGTPVVTANRGGAHEIVDATSGEAGSPDA 323

Query: 352 NGFVEATLK---ALAEEPAQPTDAQTHQLSWESATERFLQV 389
           NG  +A L+    L     +    +  Q +W+++ E+ L+V
Sbjct: 324 NGLADAVLRLAPRLGPTLREAARRRAEQFTWDASVEKMLRV 364


>gi|422391313|ref|ZP_16471404.1| glycosyl transferase [Propionibacterium acnes HL103PA1]
 gi|422464207|ref|ZP_16540818.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL060PA1]
 gi|422566595|ref|ZP_16642228.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL082PA2]
 gi|314964986|gb|EFT09085.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL082PA2]
 gi|315093712|gb|EFT65688.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL060PA1]
 gi|327325532|gb|EGE67331.1| glycosyl transferase [Propionibacterium acnes HL103PA1]
          Length = 405

 Score = 45.1 bits (105), Expect = 0.094,   Method: Compositional matrix adjust.
 Identities = 40/161 (24%), Positives = 66/161 (40%), Gaps = 5/161 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY-PGR 292
           Y+GRM   K  +  +      H+    L +D+YG G D D +++ A    +    +  GR
Sbjct: 204 YVGRMSHEKSPQLAVAAALELHRRGVPLRLDMYGVGPDTDAMKQQAGDAPVFFNGFVAGR 263

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANH-PSNDFFKQFPNCRTYDGR 351
           D     F    V ++    +    A+ EALA G  VV AN   +++              
Sbjct: 264 DEVARRFAAADVSMSVCPAETFGLASLEALACGTPVVTANRGGAHEIVDVTSGEAGSPDA 323

Query: 352 NGFVEATLKALAEEPAQPTDAQTH---QLSWESATERFLQV 389
            G  +AT +  A    +  DA  H   Q +W+++ E  L +
Sbjct: 324 MGLADATQRLAARLGPELRDAARHRAEQFTWDASVEAMLMI 364


>gi|90961965|ref|YP_535881.1| glycosyltransferase [Lactobacillus salivarius UCC118]
 gi|90821159|gb|ABD99798.1| Glycosyltransferase [Lactobacillus salivarius UCC118]
          Length = 382

 Score = 45.1 bits (105), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 44/157 (28%), Positives = 69/157 (43%), Gaps = 10/157 (6%)

Query: 236 GRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY-PGR-D 293
           GRMV++KG++ L+    I+ +  +  ++ L G+GE+   I+   KK  L  R+Y PG+  
Sbjct: 216 GRMVYAKGFDTLVEAFRIFAQRNSDWKLLLVGDGEELPTIKNKIKKYGLEKRIYTPGKTS 275

Query: 294 HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVC----ANHPSNDFFKQFPNCRTYD 349
                F    V L PS  + +     E+L MG  +V     A  P      +    +   
Sbjct: 276 DIKEYFLQSSVLLLPSRWEGMPMIVLESLEMGCPIVAFDIDAMRPLVTNGMEGLIVKEKQ 335

Query: 350 GRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERF 386
             N + +A LK    E  +    Q HQ S + A  RF
Sbjct: 336 DANAYAQAMLKIAESEDLR---KQMHQASIKKAN-RF 368


>gi|282855168|ref|ZP_06264500.1| glycosyltransferase, group 1 family protein [Propionibacterium
           acnes J139]
 gi|386070023|ref|YP_005984919.1| glycosyl transferase family protein [Propionibacterium acnes ATCC
           11828]
 gi|422458983|ref|ZP_16535632.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL050PA2]
 gi|422467551|ref|ZP_16544103.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL110PA4]
 gi|422468995|ref|ZP_16545525.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL110PA3]
 gi|422575082|ref|ZP_16650626.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL001PA1]
 gi|282581756|gb|EFB87141.1| glycosyltransferase, group 1 family protein [Propionibacterium
           acnes J139]
 gi|314924145|gb|EFS87976.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL001PA1]
 gi|314982219|gb|EFT26312.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL110PA3]
 gi|315090479|gb|EFT62455.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL110PA4]
 gi|315104004|gb|EFT75980.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL050PA2]
 gi|353454390|gb|AER04909.1| glycosyl transferase [Propionibacterium acnes ATCC 11828]
          Length = 405

 Score = 45.1 bits (105), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 40/161 (24%), Positives = 66/161 (40%), Gaps = 5/161 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY-PGR 292
           Y+GRM   K  +  +      H+    L +D+YG G D D +++ A    +    +  GR
Sbjct: 204 YVGRMSHEKSPQLAVAAALELHRRGVPLRLDMYGVGPDTDAMKQQAGDAPVFFNGFVAGR 263

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANH-PSNDFFKQFPNCRTYDGR 351
           D     F    V ++    +    A+ EALA G  VV AN   +++              
Sbjct: 264 DEVARRFAAADVSMSVCPAETFGLASLEALACGTPVVTANRGGAHEIVDVTSGEAGSPDA 323

Query: 352 NGFVEATLKALAEEPAQPTDAQTH---QLSWESATERFLQV 389
            G  +AT +  A    +  DA  H   Q +W+++ E  L +
Sbjct: 324 MGLADATQRLAARLGPELRDAARHRAEQFTWDASVEAMLMI 364


>gi|339637093|emb|CCC15967.1| poly(glycerol-phosphate) alpha-glucosyltransferase [Lactobacillus
           pentosus IG1]
          Length = 502

 Score = 45.1 bits (105), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 36/132 (27%), Positives = 62/132 (46%), Gaps = 14/132 (10%)

Query: 230 KGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNG--EDFDQIQRAAKKLKLVVR 287
           K   Y+GR+   +  ++L+ ++ +  ++L  ++ D YG G  E    + +  ++LKL   
Sbjct: 325 KELLYVGRIAPDRQLDQLIRVIALVKQQLQDVQCDFYGYGDPEYIKTLTKLIEELKLTNN 384

Query: 288 VYPGRDHAD--PIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCA--NHPSNDFFKQFP 343
           V+    H D    + DY+V LN    D    +  EA++ G  VV    N+   DF     
Sbjct: 385 VHLLDYHPDLEKRYDDYQVLLNTDLVDGGPMSMPEAMSHGIPVVSYRFNYGPKDFIN--- 441

Query: 344 NCRTYDGRNGFV 355
                DG++GFV
Sbjct: 442 -----DGQDGFV 448


>gi|334882817|emb|CCB83893.1| poly(Glycerol-phosphate) alpha-glucosyltransferas e [Lactobacillus
           pentosus MP-10]
          Length = 502

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 36/132 (27%), Positives = 62/132 (46%), Gaps = 14/132 (10%)

Query: 230 KGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNG--EDFDQIQRAAKKLKLVVR 287
           K   Y+GR+   +  ++L+ ++ +  ++L  ++ D YG G  E    + +  ++LKL   
Sbjct: 325 KELLYVGRIAPDRQLDQLIRVIALVKQQLQDVQCDFYGYGDPEYIKTLTKLIEELKLTNN 384

Query: 288 VYPGRDHA--DPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCA--NHPSNDFFKQFP 343
           V+    H   D  + DY+V LN    D    +  EA++ G  VV    N+   DF     
Sbjct: 385 VHLLDYHPDLDKRYDDYQVLLNTDLVDGGPMSMPEAMSHGIPVVSYRFNYGPKDFIN--- 441

Query: 344 NCRTYDGRNGFV 355
                DG++GFV
Sbjct: 442 -----DGQDGFV 448


>gi|373115573|ref|ZP_09529743.1| hypothetical protein HMPREF0995_00579 [Lachnospiraceae bacterium
           7_1_58FAA]
 gi|371670210|gb|EHO35295.1| hypothetical protein HMPREF0995_00579 [Lachnospiraceae bacterium
           7_1_58FAA]
          Length = 381

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 34/134 (25%), Positives = 66/134 (49%), Gaps = 13/134 (9%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLV--VRVYPGR 292
           IGR+ + K    LL + +  ++ +    + L G+G+D  +++  A+ L L+  V +Y   
Sbjct: 215 IGRLCYQKNQRFLLNVFSDVYQNMPESRLLLVGDGDDRKELELYAESLGLLDSVIIYGTS 274

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANH------PSNDFFKQFPNCR 346
           +H + +     VF  PS  + +  A  EA A G  V+C++        S+D ++      
Sbjct: 275 NHVEELLCAMDVFAFPSLFEGLGIAMIEAQASGLPVICSDQIPKESVVSDDVYRI----- 329

Query: 347 TYDGRNGFVEATLK 360
           +   R+G+V+A L+
Sbjct: 330 SVHDRDGWVKALLR 343


>gi|386265670|ref|YP_005829162.1| Lipopolysaccharide biosynthesis protein LsgC [Haemophilus
           influenzae R2846]
 gi|309972906|gb|ADO96107.1| Lipopolysaccharide biosynthesis protein LsgC [Haemophilus
           influenzae R2846]
          Length = 353

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 41/169 (24%), Positives = 78/169 (46%), Gaps = 12/169 (7%)

Query: 169 LLEFVNSWLARVHCHKVIRLSAA--TQEYPNSIVCNVHGV-NPKFLEIGEKKMEQQQNGN 225
           L+  +  +LA   C K++ L+ A  TQ        N+  + NP  L I + K+E+ +N  
Sbjct: 125 LVRTLGKYLAVTTCDKIVTLTEAEKTQWQEKFKRNNIISIANPNTL-IPKNKLEKLEN-- 181

Query: 226 KAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL- 284
               K    +G +   KG++ LL +  +  K+     + + G+GE+ + ++  AK+L + 
Sbjct: 182 ----KTILSVGHLFSYKGFDYLLKVWQVLAKQYPDWNLKIVGSGEEEENLKNLAKELDIK 237

Query: 285 -VVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
             V   P  +     + +  ++  PS T+ +     EA+A G  +V  N
Sbjct: 238 DSVNFIPRTNDVSFYYENSSIYCLPSQTEGLPLVVIEAMAFGLPIVAFN 286


>gi|253687703|ref|YP_003016893.1| group 1 glycosyl transferase [Pectobacterium carotovorum subsp.
           carotovorum PC1]
 gi|251754281|gb|ACT12357.1| glycosyl transferase group 1 [Pectobacterium carotovorum subsp.
           carotovorum PC1]
          Length = 351

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 31/116 (26%), Positives = 58/116 (50%), Gaps = 6/116 (5%)

Query: 218 MEQQQNGNKAFTKGAY-YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQ 276
           ME     +K   +G + Y+GR+   KG + LL      H EL+   + + G+G   + ++
Sbjct: 180 MENHVRQDKHTYRGQFLYVGRLAEEKGLDFLLDFFRT-HPELS---LTIVGDGPQRESLE 235

Query: 277 RAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
             A +  ++ + Y      + IF ++ +F+ PST++       EALA G  V+C++
Sbjct: 236 NRASE-NVLFKGYVNNSELNSIFVEHDIFIFPSTSEPWGLVVEEALAYGLPVICSD 290


>gi|433543079|ref|ZP_20499493.1| polysaccharide biosynthesis protein [Brevibacillus agri BAB-2500]
 gi|432185618|gb|ELK43105.1| polysaccharide biosynthesis protein [Brevibacillus agri BAB-2500]
          Length = 408

 Score = 44.3 bits (103), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 38/130 (29%), Positives = 57/130 (43%), Gaps = 18/130 (13%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPG-- 291
           Y+GRMV  KG  ELL   +   +E   + + L G+  D ++ QR  K L+   R +P   
Sbjct: 207 YVGRMVGEKGIFELLDAFSRLARESGRVRLLLVGDVSDSERDQR-GKALREHCREHPQIV 265

Query: 292 ----RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCR- 346
               R     +     +F+ PS  + +  +  EA+AMGK +V  N            CR 
Sbjct: 266 LTGFRQDIPQLLAASDIFVLPSHREGLPRSIIEAMAMGKPIVATN---------IRGCRE 316

Query: 347 -TYDGRNGFV 355
              DG NG +
Sbjct: 317 EVTDGVNGIL 326


>gi|410669939|ref|YP_006922310.1| glycosyl transferase, group 1 [Methanolobus psychrophilus R15]
 gi|409169067|gb|AFV22942.1| glycosyl transferase, group 1 [Methanolobus psychrophilus R15]
          Length = 375

 Score = 44.3 bits (103), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 52/218 (23%), Positives = 97/218 (44%), Gaps = 17/218 (7%)

Query: 128 HLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAFL--LEFVNSWLARVHCHKV 185
           H T+  +    K  F  V+    T+ L  +K +++  ++A+L    F+N +  +      
Sbjct: 90  HFTYPPYSPLAKMPFPIVIT---THGLASIKVKRSHAIRAYLNLSFFLNPFFEKEALKNA 146

Query: 186 IRLSAATQEYPNSIVCNVHGVNPK--FLEIGEKKMEQQQNGNKAFTKGA-----YYIGRM 238
             + A ++   N +   + G NPK  +L  G      Q+ G+K           ++IGR+
Sbjct: 147 DSIIAVSKWIKNEVELAI-GPNPKIIYLPNGINFSALQEGGSKFENPSVSHPSLFFIGRL 205

Query: 239 VWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPG--RDHA 295
           +  KG + L+  L    K++  + + + G G     +++ A+KL L   + + G   +H 
Sbjct: 206 IKFKGVDLLISSLYYVKKQVPDIHLYVAGQGPQASNLKKLAQKLGLESNITFLGFLSEHD 265

Query: 296 DPIF-HDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
             +F H+  VF+ PS  +       EAL  G  VV +N
Sbjct: 266 KQVFLHECDVFVIPSRYETFGIVVLEALEAGIPVVASN 303


>gi|419419721|ref|ZP_13959954.1| glycosyl transferase [Propionibacterium acnes PRP-38]
 gi|422395748|ref|ZP_16475781.1| glycosyl transferase [Propionibacterium acnes HL097PA1]
 gi|327332253|gb|EGE73989.1| glycosyl transferase [Propionibacterium acnes HL097PA1]
 gi|379979442|gb|EIA12762.1| glycosyl transferase [Propionibacterium acnes PRP-38]
          Length = 405

 Score = 44.3 bits (103), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 39/161 (24%), Positives = 66/161 (40%), Gaps = 5/161 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY-PGR 292
           Y+GRM   K  +  +      H+    L +D+YG G D D +++ A    +    +  GR
Sbjct: 204 YVGRMSHEKSPQLAVAAALELHRRGVPLRLDMYGVGPDTDAMKQQAGNAPVFFNGFVAGR 263

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANH-PSNDFFKQFPNCRTYDGR 351
           D     F    + ++    +    A+ EALA G  VV AN   +++              
Sbjct: 264 DEVARRFAAADISMSVCPAETFGLASLEALACGTPVVTANRGGTHEIVDVTSGEAGSPDA 323

Query: 352 NGFVEATLKALAEEPAQPTDAQTH---QLSWESATERFLQV 389
            G  +AT +  A    +  DA  H   Q +W+++ E  L +
Sbjct: 324 VGLADATQRLAARLGPELRDAARHRAEQFTWDASVETMLMI 364


>gi|383809933|ref|ZP_09965445.1| glycosyltransferase, group 1 family protein [Rothia aeria F0474]
 gi|383447253|gb|EID50238.1| glycosyltransferase, group 1 family protein [Rothia aeria F0474]
          Length = 399

 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 29/127 (22%), Positives = 64/127 (50%), Gaps = 5/127 (3%)

Query: 211 LEIGEKKMEQQQNGNKAFTK-GAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNG 269
           +EIG+ ++ +    +KA  +    ++GR+   K  + L+  L++  + L  + +D+ G G
Sbjct: 194 IEIGDYELAEGATIDKAPGELRITFVGRLAQEKNIDVLIEALSLLPEHLNHVILDIAGGG 253

Query: 270 EDFDQIQRAAKKL----KLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMG 325
           E  +++++ A+      ++V R Y   +    ++    +F  P T ++    + EA++  
Sbjct: 254 ELREELEKKARDCGVSERVVFRGYVPDEDLPGVYQRADIFCQPGTAELQSLVSLEAMSAS 313

Query: 326 KIVVCAN 332
           K VV AN
Sbjct: 314 KPVVLAN 320


>gi|383319512|ref|YP_005380353.1| glycosyltransferase [Methanocella conradii HZ254]
 gi|379320882|gb|AFC99834.1| Glycosyltransferase [Methanocella conradii HZ254]
          Length = 393

 Score = 44.3 bits (103), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 34/138 (24%), Positives = 62/138 (44%), Gaps = 13/138 (9%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL----VVRVY 289
           ++GRM+  KG   L+  + +   +    ++ L G G   ++++R    + L    +   Y
Sbjct: 212 FVGRMITQKGVPYLIEAMPMVLAKHPDAKLLLVGRGSSLEKLKRKVNAMGLEKSVIFSGY 271

Query: 290 PGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYD 349
            G +     +    +F+ PS  +V+  A  EA++  + VVC N   N    +       D
Sbjct: 272 MGEEDLKEAYGTCDIFVLPSVWEVLPIAILEAMSSSRPVVCTNAGGNAELVK-------D 324

Query: 350 GRNGFVEATL--KALAEE 365
           G NG+V      +ALAE+
Sbjct: 325 GVNGYVVPMRDPRALAEK 342


>gi|451943662|ref|YP_007464298.1| glycogen synthase [Corynebacterium halotolerans YIM 70093 = DSM
           44683]
 gi|451903049|gb|AGF71936.1| glycogen synthase [Corynebacterium halotolerans YIM 70093 = DSM
           44683]
          Length = 375

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 34/166 (20%), Positives = 67/166 (40%), Gaps = 9/166 (5%)

Query: 176 WLARVHCHKVIRLSAATQEYPNSIVCNVHGVN--PKFLEIGEKKMEQQQNGNKAFTKGAY 233
           W+  +H    + L  +      +    +  V   PK ++    + E++ +  +    G +
Sbjct: 134 WIRMLHNQAEVNLCTSAPMVEQAAAAGIRDVELWPKAVDTVGYRPERRTDRMRELLSGDH 193

Query: 234 -------YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVV 286
                  Y+GRM   K  E L+G+++   + + G  + + G+G   ++++R         
Sbjct: 194 PDAPLVVYVGRMSREKDLERLVGIMSNLRERVPGARLAMVGSGPYREELERMLDPAWTTF 253

Query: 287 RVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
             Y         F    VF+ PSTT+ +     E++A G  VV A 
Sbjct: 254 TGYLSGPELAEAFASGDVFVFPSTTETLGLVALESMASGVPVVGAR 299


>gi|399051248|ref|ZP_10741170.1| glycosyltransferase [Brevibacillus sp. CF112]
 gi|398050825|gb|EJL43170.1| glycosyltransferase [Brevibacillus sp. CF112]
          Length = 408

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 38/130 (29%), Positives = 57/130 (43%), Gaps = 18/130 (13%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPG-- 291
           Y+GRMV  KG  ELL   +   +E   + + L G+  D ++ QR  K L+   R +P   
Sbjct: 207 YVGRMVGEKGIFELLDAFSRLARESGRVRLLLVGDVSDSERDQR-GKALREHCRKHPQIV 265

Query: 292 ----RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCR- 346
               R     +     +F+ PS  + +  +  EA+AMGK +V  N            CR 
Sbjct: 266 LTGFRRDIPQLLAASDIFVLPSHREGLPRSIIEAMAMGKPIVATN---------IRGCRE 316

Query: 347 -TYDGRNGFV 355
              DG NG +
Sbjct: 317 EVTDGVNGIL 326


>gi|420154844|ref|ZP_14661718.1| nucleotide sugar dehydrogenase [Clostridium sp. MSTE9]
 gi|394760127|gb|EJF42751.1| nucleotide sugar dehydrogenase [Clostridium sp. MSTE9]
          Length = 837

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 47/184 (25%), Positives = 77/184 (41%), Gaps = 22/184 (11%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLY-GNGEDFDQIQRAAKKLKLVVRVYPGR 292
           Y GR+   KG + L+  + I +K+ +   + ++ G+G   D +Q+ A    +      G 
Sbjct: 203 YAGRISAEKGLDTLMESIRIVNKKYSDRVLFVFAGDGPYLDALQKQALPNTVFTGFLTGE 262

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPS-NDFFKQFPNCRTYDGR 351
             A+ ++    VF+ PS T+       EA+A G  V+CA+     DF     N   +  R
Sbjct: 263 ALAE-LYASSDVFVFPSGTETFGNVVLEAMASGLPVICADEGGVTDFTVHRINASVF--R 319

Query: 352 NGFVEATLKALAEEPAQPTDAQT-------------HQLSWESATERFLQVAELVGDVVT 398
            G    +  +LAEE     + +T             H  SWES   + +Q  E V +   
Sbjct: 320 CG----SADSLAEEMTGMIENETLRLRLGSTSVSTAHSRSWESILGKLMQQYESVLEAAR 375

Query: 399 KRSK 402
             SK
Sbjct: 376 SESK 379


>gi|108804137|ref|YP_644074.1| group 1 glycosyl transferase [Rubrobacter xylanophilus DSM 9941]
 gi|108765380|gb|ABG04262.1| glycosyl transferase, group 1 [Rubrobacter xylanophilus DSM 9941]
          Length = 374

 Score = 43.9 bits (102), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 49/205 (23%), Positives = 81/205 (39%), Gaps = 33/205 (16%)

Query: 145 VVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHKVIRL--SAATQEY------- 195
           +V   HTN   Y +  +      FL      +   VH    + L  S+AT EY       
Sbjct: 113 LVASYHTNVAAYARFYR----LGFLHRAARLYTRAVHNRAAVNLCTSSATLEYLRGEGIR 168

Query: 196 -----PNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGL 250
                P  + C + G +         +  ++ +G         ++GR+   KG E+L   
Sbjct: 169 ALRLWPQGVDCELFGPH-----RASGRWRERLSGGHPDAGLLLFVGRLAPEKGIEQLRAA 223

Query: 251 LNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYK---VFLN 307
           L+    ++ G+ + L G+G     +QR       V   + G  H + +   Y    +F+ 
Sbjct: 224 LD----KMPGVRLALVGDGPARPALQRVFAGTPTV---FTGVLHGEELAAAYASADLFVF 276

Query: 308 PSTTDVVCTATAEALAMGKIVVCAN 332
           PSTT+ +  A  EALA G  V+ A 
Sbjct: 277 PSTTETLGMAMLEALASGVPVIAAR 301


>gi|187736561|ref|YP_001878673.1| group 1 glycosyl transferase [Akkermansia muciniphila ATCC BAA-835]
 gi|187426613|gb|ACD05892.1| glycosyl transferase group 1 [Akkermansia muciniphila ATCC BAA-835]
          Length = 392

 Score = 43.9 bits (102), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 66/162 (40%), Gaps = 22/162 (13%)

Query: 197 NSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRM-VWSKGYEELLGLLNIYH 255
           N +VC  + V P  L   E+K  +             ++GR+   SKG + LL +     
Sbjct: 200 NKLVCIPNIVEPAVLSKEERKEPR-----------CLFVGRLDNPSKGVDRLLRIWEKVE 248

Query: 256 KELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVC 315
           K      +D+ G+G D D ++ +A+KL L    + G  + +P +    VF   ST +   
Sbjct: 249 KTCPEWHLDIVGDGPDADMLKDSAQKLGLSRIAFHGFQNPEPYYSRASVFCMTSTFEGFG 308

Query: 316 TATAEALAMGKIVVCANHPSNDFFKQFPNCRTY--DGRNGFV 355
               EA+  G + V         F  +P  R     G NG +
Sbjct: 309 LVLVEAMQHGCVPVA--------FDSYPAVRDIISHGENGIL 342


>gi|348026059|ref|YP_004765864.1| glycosyl transferase group 1 [Megasphaera elsdenii DSM 20460]
 gi|341822113|emb|CCC73037.1| glycosyl transferase group 1 [Megasphaera elsdenii DSM 20460]
          Length = 345

 Score = 43.9 bits (102), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 38/161 (23%), Positives = 74/161 (45%), Gaps = 11/161 (6%)

Query: 216 KKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQI 275
           KK+ Q +N  K  +    + GR+V SK  + ++  L++ +++  G    L G+G +   +
Sbjct: 159 KKISQNRNSVKKKSGQILFTGRLVLSKRIDTIIRALSLLNRDGIGYNFVLAGDGPELSHL 218

Query: 276 QRAAKKLKLVVRVYPGRDHADPIFHDYK---VFLNPSTTDVVCTATAEALAMGKIVVCAN 332
            + A++L + V  + GR   D I+  Y+   +F++   ++       EAL     +VC  
Sbjct: 219 MKLAQQLDVSVN-FMGRVSHDEIYKLYQESELFISLGESESFGLTYVEALLSNCKIVCPR 277

Query: 333 HPSN-DFFKQFPNCRTY----DGRNGFVEATLKALAEEPAQ 368
                +FF  +     +    D  N  +E T+K L  +P +
Sbjct: 278 TGGQVEFFNHYSKYVGFVNPLDDLN--IENTIKQLLLKPTE 316


>gi|365852544|ref|ZP_09392926.1| glycosyltransferase, group 1 family protein [Lactobacillus
           parafarraginis F0439]
 gi|363714727|gb|EHL98214.1| glycosyltransferase, group 1 family protein [Lactobacillus
           parafarraginis F0439]
          Length = 513

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 37/135 (27%), Positives = 63/135 (46%), Gaps = 10/135 (7%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL--VVRVYPG 291
           ++GR+   KG   L+ +    H +L  L++ +YG G+  +  ++ AK L +  VV     
Sbjct: 339 FVGRLGNGKGIVRLVDMFQQIHNKLTSLKLLIYGYGDAEESARQEAKNLGIEDVVDFKDY 398

Query: 292 RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGR 351
           +      + + K+F+  + TDV   A  EA + G  +V  + P        P    +DG 
Sbjct: 399 QVDLKSAYAESKLFVTTTGTDVEPLAMTEAASFGLPMVAFDIPYG------PREVIHDGW 452

Query: 352 NG--FVEATLKALAE 364
           NG  F +  LK +AE
Sbjct: 453 NGLLFRDGELKEMAE 467


>gi|448569902|ref|ZP_21638985.1| glycosyltransferase [Haloferax lucentense DSM 14919]
 gi|445723706|gb|ELZ75343.1| glycosyltransferase [Haloferax lucentense DSM 14919]
          Length = 382

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 44/206 (21%), Positives = 84/206 (40%), Gaps = 46/206 (22%)

Query: 206 VNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDL 265
           V P  +++ + +     +   +F     + GR++  K    LL   +    +   + + +
Sbjct: 184 VVPNGIDVDQIRNAPLPDDGDSFD--VLFAGRLIADKNVSTLLDAFDRVADDYDDVTLGV 241

Query: 266 YGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPI-----FHDYK----------VFLNPST 310
            G+G +FD+++R A  L          DHAD +       +Y+          VF +PST
Sbjct: 242 VGDGPEFDRLERQANAL----------DHADRVSLLGFLDEYEDVLGQMRAADVFASPST 291

Query: 311 TDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFV-EATLKALAEE---- 365
            +      AEA+A    V+   HP +   +         G  G++ E T++++AE     
Sbjct: 292 REGFGITYAEAMAADCTVIGVQHPESAAIEVI-------GDAGYLAEPTVESVAESLARA 344

Query: 366 ------PAQPTDAQTHQLSWESATER 385
                 P +PT  +  Q  W+S  ++
Sbjct: 345 LGGERPPTEPT-KRAEQYDWDSVADQ 369


>gi|301301007|ref|ZP_07207168.1| glycosyltransferase, group 1 family protein [Lactobacillus
           salivarius ACS-116-V-Col5a]
 gi|300851364|gb|EFK79087.1| glycosyltransferase, group 1 family protein [Lactobacillus
           salivarius ACS-116-V-Col5a]
          Length = 382

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 50/109 (45%), Gaps = 6/109 (5%)

Query: 219 EQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRA 278
           E+    NK F       GRMV  K ++ L+    I+ +E +  ++ L G+GED   I+  
Sbjct: 203 EKSDMKNKVFLAA----GRMVHVKAFDNLVEAFKIFAQENSDWKLLLVGDGEDLPMIRDK 258

Query: 279 AKKLKLVVRVY-PGR-DHADPIFHDYKVFLNPSTTDVVCTATAEALAMG 325
            KK  L  R+  PG+ D     F    V L PS  + +     EAL MG
Sbjct: 259 IKKYGLEQRIITPGKTDDIKKYFLQSSVLLLPSRWEGMPMIGLEALEMG 307


>gi|187922837|ref|YP_001894479.1| group 1 glycosyl transferase [Burkholderia phytofirmans PsJN]
 gi|187714031|gb|ACD15255.1| glycosyl transferase group 1 [Burkholderia phytofirmans PsJN]
          Length = 357

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 42/143 (29%), Positives = 62/143 (43%), Gaps = 10/143 (6%)

Query: 265 LYGNGEDFD---QIQRAAKKLKLVVRVYPGRDHAD--PIFHDYKVFLNPSTTDVVCTATA 319
           LYG  ED D    +   AK L +  RV       D   +  +  VF+ PS ++    A  
Sbjct: 218 LYGVIEDPDYWRDVIALAKTLGIAERVAVAGPRTDVATVLSESNVFVMPSQSEAHSVAFL 277

Query: 320 EALAMGKIVVCANHPSNDFFKQFPNCRTYDGRN--GFVEATLKALAEEPAQPTDAQTHQL 377
           EALA G  +V +  P+  F   FP  +  D  N   + EA + AL +  AQ + A    L
Sbjct: 278 EALASGVPIVASKIPAFAFASGFPGVQLVDTGNVQCYAEAVIVALGQARAQRSLA---GL 334

Query: 378 SWESATERFLQVAELVGDVVTKR 400
           +     +R+  +A  V   V+ R
Sbjct: 335 TLRDTADRYRAIARQVSLAVSAR 357


>gi|440712381|ref|ZP_20893002.1| glycosyl transferase, group 1 [Rhodopirellula baltica SWK14]
 gi|436442902|gb|ELP35995.1| glycosyl transferase, group 1 [Rhodopirellula baltica SWK14]
          Length = 374

 Score = 43.5 bits (101), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 32/127 (25%), Positives = 54/127 (42%), Gaps = 2/127 (1%)

Query: 208 PKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYG 267
           PK      K   ++Q      T     +GR+   KG++ L+  L     +    E+ ++G
Sbjct: 172 PKEFRTSLKPSTERQTDPGQATCHLVAVGRLEREKGFDRLIRSLGYLPNDAPNWELTIHG 231

Query: 268 NGEDFDQIQRAAKKLKLVVRV-YPGRDHAD-PIFHDYKVFLNPSTTDVVCTATAEALAMG 325
            G   D ++  A KL L  RV +PG      P++     F+ PS  +   +A  EA+A  
Sbjct: 232 EGSQRDHLESLASKLNLASRVKFPGWTRPIWPVYQSADWFVLPSRYEGFPSALLEAMACS 291

Query: 326 KIVVCAN 332
             V+  +
Sbjct: 292 TAVLTVD 298


>gi|125973872|ref|YP_001037782.1| group 1 glycosyl transferase [Clostridium thermocellum ATCC 27405]
 gi|125714097|gb|ABN52589.1| glycosyl transferase group 1 [Clostridium thermocellum ATCC 27405]
          Length = 364

 Score = 43.5 bits (101), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 33/123 (26%), Positives = 59/123 (47%), Gaps = 7/123 (5%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDH 294
           +GR+   K Y+ L+   +    E +   +++YG G + +++Q+   +L L  R+      
Sbjct: 184 VGRLSNQKNYDLLIDAFSDIANEYSEYRLEIYGEGPERERLQKKINRLGLNDRIQLMGTK 243

Query: 295 ADPIFH--DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRN 352
            + + H  D ++F+  S  +    A AEA+A G  V+  N PS    K+       DG N
Sbjct: 244 TNVMKHIADAQLFVMSSNFEGFPNALAEAMASGLPVISTNFPSG-VAKEL----IIDGEN 298

Query: 353 GFV 355
           G+V
Sbjct: 299 GYV 301


>gi|344924331|ref|ZP_08777792.1| glycosyl transferase, group 1 [Candidatus Odyssella
           thessalonicensis L13]
          Length = 363

 Score = 43.5 bits (101), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 40/150 (26%), Positives = 70/150 (46%), Gaps = 7/150 (4%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDH 294
           +GR++ SKG+E L+       +    L + +YG GE+  +++   + L L  RV      
Sbjct: 191 VGRLIKSKGFETLIQAFKEVAEANPDLILTIYGEGEERPRLEELIRSLNLKERVLLPGTV 250

Query: 295 ADPI--FHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSN-DFFKQFPNCRTYDGR 351
            D +       +F+ PS  +    A  EA+A+G  V+ +N   N D  K   N R +   
Sbjct: 251 PDVLTRLSQADIFVFPSHYEGFPNALGEAMAVGLPVIASNCTGNIDLIKDGINGRLFPVG 310

Query: 352 NGFVEATLKALAEEPAQPTDAQTHQLSWES 381
           +    A+L +L  E  + T+ Q  +LS+ +
Sbjct: 311 DA---ASLASLMLELLKDTE-QRQRLSFHA 336


>gi|295395884|ref|ZP_06806069.1| glycosyltransferase [Brevibacterium mcbrellneri ATCC 49030]
 gi|294971157|gb|EFG47047.1| glycosyltransferase [Brevibacterium mcbrellneri ATCC 49030]
          Length = 402

 Score = 43.5 bits (101), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 38/126 (30%), Positives = 60/126 (47%), Gaps = 7/126 (5%)

Query: 211 LEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGE 270
           +++ E     +  G+K       ++GR+   K  E+LL  +     EL  L   + G GE
Sbjct: 205 IDLSEFSPSDEPAGDKP--PSVLFVGRLSSEKHIEDLLAAVARTSPEL-NLHATIIGAGE 261

Query: 271 DFDQIQRAAKKLKLVVRV-YPGRDHADPIFHDYK---VFLNPSTTDVVCTATAEALAMGK 326
               +++ A+ L +  RV  PG+   + +   YK    F  PST ++   AT EALA GK
Sbjct: 262 QLQALKQQAQDLGIADRVNIPGKVSQEQLVQAYKDATFFCMPSTAELQSIATLEALASGK 321

Query: 327 IVVCAN 332
            VV A+
Sbjct: 322 PVVLAD 327


>gi|289427670|ref|ZP_06429382.1| glycosyltransferase, group 1 family protein [Propionibacterium
           acnes J165]
 gi|295131350|ref|YP_003582013.1| glycosyltransferase, group 1 family protein [Propionibacterium
           acnes SK137]
 gi|335051032|ref|ZP_08543971.1| glycosyltransferase, group 1 family protein [Propionibacterium sp.
           409-HC1]
 gi|342211931|ref|ZP_08704656.1| glycosyltransferase, group 1 family protein [Propionibacterium sp.
           CC003-HC2]
 gi|354607543|ref|ZP_09025512.1| hypothetical protein HMPREF1003_02079 [Propionibacterium sp.
           5_U_42AFAA]
 gi|386024762|ref|YP_005943067.1| GDP-mannose-dependent alpha-(1-6)-phosphatidylinositol dimannoside
           mannosyltransferase [Propionibacterium acnes 266]
 gi|417929917|ref|ZP_12573297.1| glycosyltransferase, group 1 family protein [Propionibacterium
           acnes SK182]
 gi|422386341|ref|ZP_16466461.1| glycosyl transferase [Propionibacterium acnes HL096PA3]
 gi|422388835|ref|ZP_16468935.1| glycosyl transferase [Propionibacterium acnes HL096PA2]
 gi|422392484|ref|ZP_16472553.1| glycosyl transferase [Propionibacterium acnes HL099PA1]
 gi|422425740|ref|ZP_16502670.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL043PA1]
 gi|422429683|ref|ZP_16506579.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL072PA2]
 gi|422448489|ref|ZP_16525216.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL036PA3]
 gi|422461283|ref|ZP_16537913.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL038PA1]
 gi|422475910|ref|ZP_16552354.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL056PA1]
 gi|422478472|ref|ZP_16554893.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL007PA1]
 gi|422481148|ref|ZP_16557550.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL063PA1]
 gi|422483657|ref|ZP_16560045.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL036PA1]
 gi|422484352|ref|ZP_16560730.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL043PA2]
 gi|422486886|ref|ZP_16563229.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL013PA2]
 gi|422490102|ref|ZP_16566423.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL020PA1]
 gi|422496520|ref|ZP_16572804.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL025PA1]
 gi|422497220|ref|ZP_16573495.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL002PA3]
 gi|422503558|ref|ZP_16579796.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL027PA2]
 gi|422505776|ref|ZP_16582005.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL036PA2]
 gi|422508753|ref|ZP_16584912.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL046PA2]
 gi|422512165|ref|ZP_16588300.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL087PA2]
 gi|422517696|ref|ZP_16593787.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL074PA1]
 gi|422522326|ref|ZP_16598352.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL045PA1]
 gi|422526813|ref|ZP_16602806.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL083PA1]
 gi|422530017|ref|ZP_16605982.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL053PA1]
 gi|422532978|ref|ZP_16608920.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL072PA1]
 gi|422538084|ref|ZP_16613963.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL078PA1]
 gi|422559847|ref|ZP_16635562.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL005PA1]
 gi|422568133|ref|ZP_16643757.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL002PA2]
 gi|289159161|gb|EFD07353.1| glycosyltransferase, group 1 family protein [Propionibacterium
           acnes J165]
 gi|291376923|gb|ADE00778.1| glycosyltransferase, group 1 family protein [Propionibacterium
           acnes SK137]
 gi|313773116|gb|EFS39082.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL074PA1]
 gi|313808803|gb|EFS47257.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL087PA2]
 gi|313810403|gb|EFS48117.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL083PA1]
 gi|313812261|gb|EFS49975.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL025PA1]
 gi|313817982|gb|EFS55696.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL046PA2]
 gi|313819895|gb|EFS57609.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL036PA1]
 gi|313823386|gb|EFS61100.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL036PA2]
 gi|313824857|gb|EFS62571.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL063PA1]
 gi|313830099|gb|EFS67813.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL007PA1]
 gi|313832629|gb|EFS70343.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL056PA1]
 gi|314925752|gb|EFS89583.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL036PA3]
 gi|314960832|gb|EFT04933.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL002PA2]
 gi|314973052|gb|EFT17148.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL053PA1]
 gi|314975548|gb|EFT19643.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL045PA1]
 gi|314979763|gb|EFT23857.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL072PA2]
 gi|314984831|gb|EFT28923.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL005PA1]
 gi|315079861|gb|EFT51837.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL078PA1]
 gi|315083245|gb|EFT55221.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL027PA2]
 gi|315086865|gb|EFT58841.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL002PA3]
 gi|315089957|gb|EFT61933.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL072PA1]
 gi|315096721|gb|EFT68697.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL038PA1]
 gi|327325135|gb|EGE66941.1| glycosyl transferase [Propionibacterium acnes HL096PA3]
 gi|327325236|gb|EGE67041.1| glycosyl transferase [Propionibacterium acnes HL096PA2]
 gi|327444036|gb|EGE90690.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL043PA1]
 gi|327449336|gb|EGE95990.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL013PA2]
 gi|327449436|gb|EGE96090.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL043PA2]
 gi|328756319|gb|EGF69935.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL020PA1]
 gi|328761347|gb|EGF74874.1| glycosyl transferase [Propionibacterium acnes HL099PA1]
 gi|332676220|gb|AEE73036.1| GDP-mannose-dependent alpha-(1-6)-phosphatidylinositol dimannoside
           mannosyltransferase [Propionibacterium acnes 266]
 gi|333768109|gb|EGL45313.1| glycosyltransferase, group 1 family protein [Propionibacterium sp.
           409-HC1]
 gi|340767475|gb|EGR90000.1| glycosyltransferase, group 1 family protein [Propionibacterium sp.
           CC003-HC2]
 gi|340772604|gb|EGR95105.1| glycosyltransferase, group 1 family protein [Propionibacterium
           acnes SK182]
 gi|353556562|gb|EHC25932.1| hypothetical protein HMPREF1003_02079 [Propionibacterium sp.
           5_U_42AFAA]
 gi|456738914|gb|EMF63481.1| group 1 glycosyl transferase [Propionibacterium acnes FZ1/2/0]
          Length = 405

 Score = 43.5 bits (101), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 39/161 (24%), Positives = 66/161 (40%), Gaps = 5/161 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY-PGR 292
           Y+GRM   K  +  +      H+    L +D+YG G D D +++ A    +    +  GR
Sbjct: 204 YVGRMSHEKSPQLAVAAALELHRRGVPLRLDMYGVGPDTDAMKQQAGDAPVFFNGFVVGR 263

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANH-PSNDFFKQFPNCRTYDGR 351
           D     F    + ++    +    A+ EALA G  VV AN   +++              
Sbjct: 264 DEVARRFAAADISMSVCPAETFGLASLEALACGTPVVTANRGGAHEIVDVTSGEAGSPDA 323

Query: 352 NGFVEATLKALAEEPAQPTDAQTH---QLSWESATERFLQV 389
            G  +AT +  A    +  DA  H   Q +W+++ E  L +
Sbjct: 324 VGLADATQRLAARLGPELRDAARHRAEQFTWDASVETMLMI 364


>gi|422553312|ref|ZP_16629098.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL005PA3]
 gi|422555977|ref|ZP_16631738.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL005PA2]
 gi|314986090|gb|EFT30182.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL005PA2]
 gi|314988705|gb|EFT32796.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL005PA3]
          Length = 405

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 39/161 (24%), Positives = 66/161 (40%), Gaps = 5/161 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY-PGR 292
           Y+GRM   K  +  +      H+    L +D+YG G D D +++ A    +    +  GR
Sbjct: 204 YVGRMSHEKSPQLAVAAALELHRRGVPLRLDMYGVGPDTDAMKQQAGDAPVFFNGFVVGR 263

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANH-PSNDFFKQFPNCRTYDGR 351
           D     F    + ++    +    A+ EALA G  VV AN   +++              
Sbjct: 264 DEVARRFAAADISMSVCPAETFGLASLEALACGTPVVTANRGGAHEIVDVTSGEAGSPDA 323

Query: 352 NGFVEATLKALAEEPAQPTDAQTH---QLSWESATERFLQV 389
            G  +AT +  A    +  DA  H   Q +W+++ E  L +
Sbjct: 324 VGLADATQRLAARLGPELRDAARHRAEQFTWDASVETMLMI 364


>gi|407936208|ref|YP_006851850.1| group 1 glycosyl transferase [Propionibacterium acnes C1]
 gi|407904789|gb|AFU41619.1| group 1 glycosyl transferase [Propionibacterium acnes C1]
          Length = 405

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 39/161 (24%), Positives = 66/161 (40%), Gaps = 5/161 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY-PGR 292
           Y+GRM   K  +  +      H+    L +D+YG G D D +++ A    +    +  GR
Sbjct: 204 YVGRMSHEKSPQLAVAAALELHRRGVPLRLDMYGVGPDTDAMKQQAGDAPVFFNGFVVGR 263

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANH-PSNDFFKQFPNCRTYDGR 351
           D     F    + ++    +    A+ EALA G  VV AN   +++              
Sbjct: 264 DEVARRFAAADISMSVCPAETFGLASLEALACGTPVVTANRGGAHEIVDVTSGEAGSPDA 323

Query: 352 NGFVEATLKALAEEPAQPTDAQTH---QLSWESATERFLQV 389
            G  +AT +  A    +  DA  H   Q +W+++ E  L +
Sbjct: 324 VGLADATQRLAARLGPELRDAARHRAEQFTWDASVETMLMI 364


>gi|289424836|ref|ZP_06426618.1| glycosyltransferase, group 1 family protein [Propionibacterium
           acnes SK187]
 gi|365963473|ref|YP_004945039.1| glycosyl transferase family protein [Propionibacterium acnes
           TypeIA2 P.acn31]
 gi|365965716|ref|YP_004947281.1| glycosyl transferase family protein [Propionibacterium acnes
           TypeIA2 P.acn17]
 gi|365974652|ref|YP_004956211.1| glycosyl transferase family protein [Propionibacterium acnes
           TypeIA2 P.acn33]
 gi|422428710|ref|ZP_16505620.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL087PA1]
 gi|422433878|ref|ZP_16510742.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL059PA2]
 gi|422436479|ref|ZP_16513328.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL083PA2]
 gi|422438946|ref|ZP_16515783.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL092PA1]
 gi|422443938|ref|ZP_16520735.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL002PA1]
 gi|422444562|ref|ZP_16521346.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL027PA1]
 gi|422451156|ref|ZP_16527860.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL030PA2]
 gi|422453977|ref|ZP_16530658.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL087PA3]
 gi|422493880|ref|ZP_16570177.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL086PA1]
 gi|422500623|ref|ZP_16576878.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL063PA2]
 gi|422510176|ref|ZP_16586324.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL059PA1]
 gi|422517102|ref|ZP_16593207.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL110PA2]
 gi|422531067|ref|ZP_16607016.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL110PA1]
 gi|422540666|ref|ZP_16616531.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL013PA1]
 gi|422540799|ref|ZP_16616661.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL037PA1]
 gi|422544771|ref|ZP_16620606.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL082PA1]
 gi|422548407|ref|ZP_16624222.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL050PA3]
 gi|422550170|ref|ZP_16625969.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL050PA1]
 gi|422557883|ref|ZP_16633624.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL025PA2]
 gi|422563709|ref|ZP_16639384.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL046PA1]
 gi|422571616|ref|ZP_16647198.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL067PA1]
 gi|422579659|ref|ZP_16655178.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL005PA4]
 gi|289154799|gb|EFD03482.1| glycosyltransferase, group 1 family protein [Propionibacterium
           acnes SK187]
 gi|313763201|gb|EFS34565.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL013PA1]
 gi|313793294|gb|EFS41352.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL110PA1]
 gi|313801062|gb|EFS42330.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL110PA2]
 gi|313816538|gb|EFS54252.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL059PA1]
 gi|313828401|gb|EFS66115.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL063PA2]
 gi|313837985|gb|EFS75699.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL086PA1]
 gi|314914368|gb|EFS78199.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL005PA4]
 gi|314917688|gb|EFS81519.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL050PA1]
 gi|314919580|gb|EFS83411.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL050PA3]
 gi|314930170|gb|EFS94001.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL067PA1]
 gi|314957157|gb|EFT01261.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL027PA1]
 gi|314957798|gb|EFT01901.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL002PA1]
 gi|314963506|gb|EFT07606.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL082PA1]
 gi|314969911|gb|EFT14009.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL037PA1]
 gi|315097949|gb|EFT69925.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL059PA2]
 gi|315100713|gb|EFT72689.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL046PA1]
 gi|315109195|gb|EFT81171.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL030PA2]
 gi|327451458|gb|EGE98112.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL087PA3]
 gi|327451567|gb|EGE98221.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL092PA1]
 gi|327451854|gb|EGE98508.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL083PA2]
 gi|328752068|gb|EGF65684.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL087PA1]
 gi|328755534|gb|EGF69150.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL025PA2]
 gi|365740154|gb|AEW84356.1| glycosyl transferase [Propionibacterium acnes TypeIA2 P.acn31]
 gi|365742397|gb|AEW82091.1| glycosyl transferase [Propionibacterium acnes TypeIA2 P.acn17]
 gi|365744651|gb|AEW79848.1| glycosyl transferase [Propionibacterium acnes TypeIA2 P.acn33]
          Length = 405

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 39/161 (24%), Positives = 66/161 (40%), Gaps = 5/161 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY-PGR 292
           Y+GRM   K  +  +      H+    L +D+YG G D D +++ A    +    +  GR
Sbjct: 204 YVGRMSHEKSPQLAVAAALELHRRGVPLRLDMYGVGPDTDAMKQQAGDAPVFFNGFVVGR 263

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANH-PSNDFFKQFPNCRTYDGR 351
           D     F    + ++    +    A+ EALA G  VV AN   +++              
Sbjct: 264 DEVARRFAAADISMSVCPAETFGLASLEALACGTPVVTANRGGAHEIVDVTSGEAGSPDA 323

Query: 352 NGFVEATLKALAEEPAQPTDAQTH---QLSWESATERFLQV 389
            G  +AT +  A    +  DA  H   Q +W+++ E  L +
Sbjct: 324 VGLADATQRLAARLGPELRDAARHRAEQFTWDASVETMLMI 364


>gi|406901341|gb|EKD44028.1| AprM [uncultured bacterium]
          Length = 379

 Score = 43.5 bits (101), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 54/114 (47%), Gaps = 7/114 (6%)

Query: 229 TKGAYY--IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGN-GEDFDQIQRAAK----K 281
           T G Y   IGR+   K  + ++   NI  K+L  L++ L GN G  F+++Q A +    K
Sbjct: 200 TYGEYILSIGRLEEKKNTKRIVEAFNILKKQLPNLKLILVGNSGAGFEEVQAAIEQSDYK 259

Query: 282 LKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPS 335
             ++   +        I    KVF+ PS  +       EALA+G  VV ++ P+
Sbjct: 260 KDIICPGFMSSQQLVSILKQAKVFVFPSLYEGFGIPVLEALAVGTPVVVSDIPA 313


>gi|258511316|ref|YP_003184750.1| group 1 glycosyl transferase [Alicyclobacillus acidocaldarius
           subsp. acidocaldarius DSM 446]
 gi|257478042|gb|ACV58361.1| glycosyl transferase group 1 [Alicyclobacillus acidocaldarius
           subsp. acidocaldarius DSM 446]
          Length = 385

 Score = 43.5 bits (101), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 67/297 (22%), Positives = 112/297 (37%), Gaps = 56/297 (18%)

Query: 138 WKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHKVIRLSAATQEYPN 197
           WK+    V    HTN   Y +  K + L+  L  +  +   R H +      A  +E   
Sbjct: 106 WKSHLPLVAS-YHTNVPAYARHYKLEFLEPLLWWYFRTLHNRAHLNLATS-RATLRELER 163

Query: 198 SIVCNV----HGVN-------PKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEE 246
               N+     GV+       P   E+  +   + + G++       Y+GR+   K  E 
Sbjct: 164 QGFQNLELWERGVDVELFRNAPYSEEMRRRLAPEAKPGDRVL----LYVGRLASEKNIER 219

Query: 247 LLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV-- 304
           +  +L+     +  L + + G+G    +++R     +     + G  H + +   Y+   
Sbjct: 220 MRPVLD----AIPDLHLAIVGDGPHRPELERVFAGTRTH---FTGYLHGEELAQAYRAAD 272

Query: 305 -FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFV-------- 355
            FL PSTT+ +     EA+A G  +V A+ P      +       DGR GF+        
Sbjct: 273 AFLFPSTTETLGLVLFEAMAAGLPIVAADSPPTREVLE-------DGRAGFIFDPDSTES 325

Query: 356 -EATLKALAEEPA------QPTDAQTHQLSWESAT-------ERFLQVAELVGDVVT 398
             AT+  +  + A      Q   A   QL WE  +       ER LQ   +V   VT
Sbjct: 326 LIATVDLVMRDEARREAVRQRGLAIAEQLDWEGPSKQLLGHYERVLQSFSVVAGAVT 382


>gi|408369773|ref|ZP_11167553.1| Capsular polysaccharide biosynthesis glycosyl transferase
           [Galbibacter sp. ck-I2-15]
 gi|407744827|gb|EKF56394.1| Capsular polysaccharide biosynthesis glycosyl transferase
           [Galbibacter sp. ck-I2-15]
          Length = 377

 Score = 43.5 bits (101), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 29/106 (27%), Positives = 50/106 (47%), Gaps = 10/106 (9%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQR-AAKKLKLVVRVYPG- 291
           Y+GR+V  KG  EL+      H+    + + L G   DF+ IQ   + + + +++ +P  
Sbjct: 203 YVGRLVGDKGTNELIDAFTQMHQSHPEIRLLLIG---DFEDIQDPVSDRTRSLIKNHPAI 259

Query: 292 -----RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
                + +  P F    VF+ PS  +       EA AM K V+C++
Sbjct: 260 VYAGYQSNVKPFFKLMDVFVFPSYREGFGMVVMEAAAMEKPVICSD 305


>gi|171059760|ref|YP_001792109.1| group 1 glycosyl transferase [Leptothrix cholodnii SP-6]
 gi|170777205|gb|ACB35344.1| glycosyl transferase group 1 [Leptothrix cholodnii SP-6]
          Length = 379

 Score = 43.5 bits (101), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 45/197 (22%), Positives = 78/197 (39%), Gaps = 19/197 (9%)

Query: 178 ARVHCHKVIRLSAATQE--YPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYI 235
           A + C   +     TQE   PN +    +G+N +  E  + +  + + G          I
Sbjct: 143 ATIACGAEVERMLVTQEGIAPNRVKTVANGINLRRFESADGRRLRAELGVLPGQVLIGVI 202

Query: 236 GRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL--VVRVYPGRD 293
           GR+   KG+ +L+  L   ++E    +    G G+  D +Q+   +  L  VVR+   R 
Sbjct: 203 GRLHPLKGHADLIVALAQLYREGVDFQCVFVGGGDLHDALQQQVDEAGLNGVVRLLGQRS 262

Query: 294 HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNG 353
               +     +F  PS  + +  A  E++AM + V+             P   T DG NG
Sbjct: 263 DVADVLAAIDIFAMPSRREGLPMALLESMAMARAVLATA------VGSIPEVIT-DGENG 315

Query: 354 FVEATLKALAEEPAQPT 370
            +         EP+ P+
Sbjct: 316 ML--------VEPSNPS 324


>gi|406908463|gb|EKD48960.1| glycosyltransferase [uncultured bacterium]
          Length = 283

 Score = 43.5 bits (101), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 39/162 (24%), Positives = 71/162 (43%), Gaps = 25/162 (15%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAA--------KKLKLV 285
           Y+GR V  K    ++ L ++  +     E+ L G G   D +Q  A        ++++ +
Sbjct: 113 YVGRFVSEKN---IMALFDVMAQLPEAYELTLVGYGNYLDLLQERAYDILGLSRERVRFI 169

Query: 286 VRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNC 345
           V+  P +      +    +FL PS TD      AEA+A    V+  + P           
Sbjct: 170 VQ--PTQQTLLDRYRSAHLFLFPSRTDTQGLVIAEAMACSTPVIAFDGPGQ-------RD 220

Query: 346 RTYDGRNGFVEATLKALAEE-PAQPTDAQTHQL----SWESA 382
              DG+NGF+ A  + +A++  +  +D   ++L    +W+SA
Sbjct: 221 SIIDGKNGFIVANQREMAQQIQSVLSDPYLYELLQKNAWQSA 262


>gi|366087881|ref|ZP_09454366.1| poly(glycerol-phosphate) alpha-glucosyltransferase [Lactobacillus
           zeae KCTC 3804]
          Length = 505

 Score = 43.5 bits (101), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 37/147 (25%), Positives = 70/147 (47%), Gaps = 9/147 (6%)

Query: 189 SAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELL 248
            A  + +P + V  +  V      + +K  E   + N    K   Y+GR+   +  ++++
Sbjct: 291 DAIVKRFPTAKVTAIPAVTIDPDTLPQKPAEATTDVN---NKKLMYVGRIAEERQLDQMI 347

Query: 249 GLLNIYHKELAGLEMDLYGNGE-DFD-QIQRAAKKLKLVVRVYPGRDHA---DPIFHDYK 303
             L + + ++ G+E+DLYG G+ D+  +++   K LKL  +V+  +D+    D  +  Y 
Sbjct: 348 RALGLVNSKIPGVELDLYGYGDADYQKKLEALTKDLKLDKQVH-FKDYVTDLDQRYEQYT 406

Query: 304 VFLNPSTTDVVCTATAEALAMGKIVVC 330
           + LN + TD    A  EA   G  V+ 
Sbjct: 407 MLLNTAATDGGPLAILEAQTHGLPVIS 433


>gi|256004327|ref|ZP_05429309.1| glycosyl transferase group 1 [Clostridium thermocellum DSM 2360]
 gi|385778250|ref|YP_005687415.1| group 1 glycosyl transferase [Clostridium thermocellum DSM 1313]
 gi|419722144|ref|ZP_14249292.1| glycosyl transferase group 1 [Clostridium thermocellum AD2]
 gi|419724241|ref|ZP_14251309.1| glycosyl transferase group 1 [Clostridium thermocellum YS]
 gi|255991761|gb|EEU01861.1| glycosyl transferase group 1 [Clostridium thermocellum DSM 2360]
 gi|316939930|gb|ADU73964.1| glycosyl transferase group 1 [Clostridium thermocellum DSM 1313]
 gi|380772247|gb|EIC06099.1| glycosyl transferase group 1 [Clostridium thermocellum YS]
 gi|380781715|gb|EIC11365.1| glycosyl transferase group 1 [Clostridium thermocellum AD2]
          Length = 381

 Score = 43.5 bits (101), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 33/123 (26%), Positives = 59/123 (47%), Gaps = 7/123 (5%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDH 294
           +GR+   K Y+ L+   +    E +   +++YG G + +++Q+   +L L  R+      
Sbjct: 201 VGRLSNQKNYDLLIDAFSDIANEYSEYRLEIYGEGPERERLQKKINRLGLNDRIQLMGTK 260

Query: 295 ADPIFH--DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRN 352
            + + H  D ++F+  S  +    A AEA+A G  V+  N PS    K+       DG N
Sbjct: 261 TNVMKHIADAQLFVMSSNFEGFPNALAEAMASGLPVISTNFPSG-VAKEL----IIDGEN 315

Query: 353 GFV 355
           G+V
Sbjct: 316 GYV 318


>gi|300728264|ref|ZP_07061632.1| glycosyltransferase, group 1 family [Prevotella bryantii B14]
 gi|299774499|gb|EFI71123.1| glycosyltransferase, group 1 family [Prevotella bryantii B14]
          Length = 366

 Score = 43.5 bits (101), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 32/130 (24%), Positives = 60/130 (46%), Gaps = 12/130 (9%)

Query: 230 KGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL--VVR 287
           K A  +GR+ W KG++ L+    + +++     +D+YG G++ + +Q     L L  V+ 
Sbjct: 188 KRAIAVGRLTWEKGFDYLIDAWKLVNQKYKDWVLDIYGEGDEREVLQHRIDSLGLRDVIH 247

Query: 288 VYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVV--CANHPSNDFFKQFPNC 345
           +     H +  +  + VF+  S ++    A  EA A G  +V    N   ++  +     
Sbjct: 248 LCGNSSHIEKEYASHSVFIMSSRSEGFPLALIEASACGLPLVSFACNQGVSEIIQ----- 302

Query: 346 RTYDGRNGFV 355
              DG NGF+
Sbjct: 303 ---DGYNGFL 309


>gi|288963135|ref|YP_003453414.1| glycosyltransferase [Azospirillum sp. B510]
 gi|288915387|dbj|BAI76870.1| glycosyltransferase [Azospirillum sp. B510]
          Length = 397

 Score = 43.1 bits (100), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 47/178 (26%), Positives = 75/178 (42%), Gaps = 31/178 (17%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLV--VRVYPGR 292
           +GR+V  KG + L+  L     ELAG+E+D+ G+G     ++  A +L L   VR++   
Sbjct: 217 VGRLVAQKGIDVLIDALA--RPELAGVELDVVGDGGWRTALEEQAARLGLAGRVRLHGWL 274

Query: 293 DHA--DPIFHDYKVFLNPSTTDVVCTATAEALAMG-KIVVCANHPSNDFFKQFPNCRTYD 349
           D A    ++    +F+ PS  + +     EA+A G  +V  A   + D           +
Sbjct: 275 DRAVLAGLYRTVDIFVLPSRDEGMPNVVLEAMASGLPVVASAVAGARDL--------VVE 326

Query: 350 GRNGFV------EATLKALAEEPAQPTDAQT----------HQLSWESATERFLQVAE 391
           G  GF+      +A   AL    A P   +              SW SA   FL++ E
Sbjct: 327 GETGFLVPPEQPDALAGALLRLTADPMARRALGDRGRRRVEEHFSWRSAALSFLELVE 384


>gi|422525025|ref|ZP_16601033.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL053PA2]
 gi|315077127|gb|EFT49194.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL053PA2]
          Length = 335

 Score = 43.1 bits (100), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 39/161 (24%), Positives = 66/161 (40%), Gaps = 5/161 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY-PGR 292
           Y+GRM   K  +  +      H+    L +D+YG G D D +++ A    +    +  GR
Sbjct: 134 YVGRMSHEKSPQLAVAAALELHRRGVPLRLDMYGVGPDTDAMKQQAGDAPVFFNGFVVGR 193

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANH-PSNDFFKQFPNCRTYDGR 351
           D     F    + ++    +    A+ EALA G  VV AN   +++              
Sbjct: 194 DEVARRFAAADISMSVCPAETFGLASLEALACGTPVVTANRGGAHEIVDVTSGEAGSPDA 253

Query: 352 NGFVEATLKALAEEPAQPTDAQTH---QLSWESATERFLQV 389
            G  +AT +  A    +  DA  H   Q +W+++ E  L +
Sbjct: 254 VGLADATQRLAARLGPELRDAARHRAEQFTWDASVETMLMI 294


>gi|422456864|ref|ZP_16533527.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL030PA1]
 gi|315106101|gb|EFT78077.1| glycosyltransferase, group 1 family [Propionibacterium acnes
           HL030PA1]
          Length = 405

 Score = 43.1 bits (100), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 39/161 (24%), Positives = 65/161 (40%), Gaps = 5/161 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY-PGR 292
           Y+GRM   K  +  +      H+    L +D+YG G D D ++  A    +    +  GR
Sbjct: 204 YVGRMSHEKSPQLAVAAALELHRRGVPLRLDMYGVGPDTDAMKPQAGDAPVFFNGFVAGR 263

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANH-PSNDFFKQFPNCRTYDGR 351
           D     F    + ++    +    A+ EALA G  VV AN   +++              
Sbjct: 264 DEVARRFAAADISMSVCPAETFGLASLEALACGTPVVTANRGGAHEIVDVTSGEAGSPDA 323

Query: 352 NGFVEATLKALAEEPAQPTDAQTH---QLSWESATERFLQV 389
            G  +AT +  A    +  DA  H   Q +W+++ E  L +
Sbjct: 324 VGLADATQRLAARLGPELRDAARHRAEQFTWDASVETMLMI 364


>gi|50843281|ref|YP_056508.1| glycosyl transferase family protein [Propionibacterium acnes
           KPA171202]
 gi|335055046|ref|ZP_08547839.1| glycosyltransferase, group 1 family protein [Propionibacterium sp.
           434-HC2]
 gi|387504192|ref|YP_005945421.1| glycosyl transferase family protein [Propionibacterium acnes 6609]
 gi|50840883|gb|AAT83550.1| glycosyl transferase [Propionibacterium acnes KPA171202]
 gi|333762992|gb|EGL40466.1| glycosyltransferase, group 1 family protein [Propionibacterium sp.
           434-HC2]
 gi|335278237|gb|AEH30142.1| glycosyl transferase [Propionibacterium acnes 6609]
          Length = 405

 Score = 43.1 bits (100), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 39/161 (24%), Positives = 65/161 (40%), Gaps = 5/161 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY-PGR 292
           Y+GRM   K  +  +      H+    L +D+YG G D D ++  A    +    +  GR
Sbjct: 204 YVGRMSHEKSPQLAVAAALELHRRGVPLRLDMYGVGPDTDAMKPQAGDAPVFFNGFVAGR 263

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANH-PSNDFFKQFPNCRTYDGR 351
           D     F    + ++    +    A+ EALA G  VV AN   +++              
Sbjct: 264 DEVARRFAAADISMSVCPAETFGLASLEALACGTPVVTANRGGAHEIVDVTSGEAGSPDA 323

Query: 352 NGFVEATLKALAEEPAQPTDAQTH---QLSWESATERFLQV 389
            G  +AT +  A    +  DA  H   Q +W+++ E  L +
Sbjct: 324 VGLADATQRLAARLGPELRDAARHRAEQFTWDASVETMLMI 364


>gi|339626692|ref|YP_004718335.1| group 1 glycosyl transferase [Sulfobacillus acidophilus TPY]
 gi|379006145|ref|YP_005255596.1| group 1 glycosyl transferase [Sulfobacillus acidophilus DSM 10332]
 gi|339284481|gb|AEJ38592.1| glycosyl transferase group 1 [Sulfobacillus acidophilus TPY]
 gi|361052407|gb|AEW03924.1| glycosyl transferase group 1 [Sulfobacillus acidophilus DSM 10332]
          Length = 375

 Score = 43.1 bits (100), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 55/128 (42%), Gaps = 11/128 (8%)

Query: 218 MEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQR 277
           M Q+  G +     A Y+GR+   KG E L  L ++       L +   G+G     ++R
Sbjct: 189 MRQRLTGGQGDRPIALYVGRLALEKGLERLRVLFSVNPD----LHLAFVGDGPARPDLER 244

Query: 278 AAKKLKLVVRVYPGRDHADPIFHDY---KVFLNPSTTDVVCTATAEALAMG-KIVVCANH 333
              +       + G  H + +   Y    VF+ PSTTD +     EA+A G  IV   + 
Sbjct: 245 LFAETPTT---FVGTLHGEQLAEAYASADVFVFPSTTDTLGLVLLEAMASGLPIVAAESR 301

Query: 334 PSNDFFKQ 341
           P+++   Q
Sbjct: 302 PTHELVDQ 309


>gi|316935612|ref|YP_004110594.1| glycosyl transferase group 1 protein [Rhodopseudomonas palustris
           DX-1]
 gi|315603326|gb|ADU45861.1| glycosyl transferase group 1 [Rhodopseudomonas palustris DX-1]
          Length = 385

 Score = 42.7 bits (99), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 50/107 (46%), Gaps = 11/107 (10%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPG-- 291
           +GR+V  KG+  LL  L     EL      + G G + D +QR A +L +  R+  PG  
Sbjct: 201 MGRLVPYKGFAVLLEALTQIDGELV-----IIGEGAERDNLQRLAARLGVSDRLQLPGFL 255

Query: 292 -RDHADPIFHDYKVFLNPSTT--DVVCTATAEALAMGKIVVCANHPS 335
            RD     FH  K+F  PS T  +       EA+A+G  VV    P+
Sbjct: 256 PRDEVKAYFHAAKLFTLPSVTIAEAFGLVQIEAMAVGLPVVNTALPT 302


>gi|398831938|ref|ZP_10590106.1| glycosyltransferase [Herbaspirillum sp. YR522]
 gi|398223949|gb|EJN10276.1| glycosyltransferase [Herbaspirillum sp. YR522]
          Length = 348

 Score = 42.7 bits (99), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 46/173 (26%), Positives = 73/173 (42%), Gaps = 22/173 (12%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRD 293
           Y+GR+   K  E  L L      +L G +  + G+G   +++ R   +++ +     G D
Sbjct: 184 YVGRVAVEKNIEAFLRL------DLPGTKW-VVGDGPQCEELARRYPQVRFLG--AKGHD 234

Query: 294 HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNG 353
                ++   VF+ PS TD       EA+A G        P   +  + P      G +G
Sbjct: 235 ELPAYYNCADVFVFPSKTDTFGLVLLEAMACGI-------PVAAYPVEGPIDVVEHGVSG 287

Query: 354 FVEATLK-----ALAEEPAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRS 401
            ++A LK     A+  +P     A   Q SWESAT++FL    LV      R+
Sbjct: 288 ILDADLKRACLQAIGLDPVA-VRAHAEQRSWESATQQFLHHLHLVRRPAAARA 339


>gi|336425370|ref|ZP_08605392.1| hypothetical protein HMPREF0994_01398 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336012340|gb|EGN42260.1| hypothetical protein HMPREF0994_01398 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 376

 Score = 42.7 bits (99), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 47/191 (24%), Positives = 85/191 (44%), Gaps = 26/191 (13%)

Query: 176 WLARVHCHKVIRLSAATQE----YPNSI----VCNVHGVNPKFLEIGEKKMEQQQNGNKA 227
           +LAR +  K   +   T+E    +P+ I    V   + +NP F+        ++  G + 
Sbjct: 144 FLARTYFAKAAGVVLQTKEAFSFFPDRIRKKAVILKNPLNPSFIR-------ERFEGERE 196

Query: 228 FTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVR 287
                + +GRM  +K +E +L   +   ++  G  + +YGNGE  + + + A ++ L  R
Sbjct: 197 LC--IHAVGRMDANKNHEMILRAFSGLAEDFPGARLVIYGNGECREPLLKLAGQIGLADR 254

Query: 288 V-YPG--RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPN 344
           V  PG   D AD I+     F+  S ++ +     EA+ +G  V+  + P        P 
Sbjct: 255 VSLPGAVTDVADKIYKS-SAFVLSSFSEGMPNTLIEAMCLGLPVISTDCPCGG-----PA 308

Query: 345 CRTYDGRNGFV 355
               DG NG++
Sbjct: 309 ELIQDGVNGYL 319


>gi|409197527|ref|ZP_11226190.1| glycosyltransferase [Marinilabilia salmonicolor JCM 21150]
          Length = 389

 Score = 42.7 bits (99), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 41/161 (25%), Positives = 71/161 (44%), Gaps = 20/161 (12%)

Query: 219 EQQQNGNKAFTKGAY-------YIGRMVWSKGYEELLGLLNI--YHKELAGLEMDLYGNG 269
           E+ + G   F K  Y       ++GR+  +KG   +L  L    +H  +AG+     G+G
Sbjct: 183 EEIKEGRTVFEKKDYSGSLTFCFVGRLESAKGVGHILSALKKIGFHNRIAGIH--FVGDG 240

Query: 270 EDFDQIQRAAKKLKLVVRV--YPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKI 327
            + +     +K +++ V    +  RD    I     VFL PS ++      AEA   G I
Sbjct: 241 PEKEVFVEQSKGIEISVFFHGFMSRDKVGEILEKSHVFLLPSASEGFPKVVAEAANYGCI 300

Query: 328 VVCANHPSNDFFKQFPNCRTYDGRNGFVEATLKALAEEPAQ 368
            + +N  S     Q+     +DG+NG++    + +A+E A 
Sbjct: 301 PLVSNVSS---LGQY----IFDGKNGYIVDYDEKIADELAN 334


>gi|320162016|ref|YP_004175241.1| putative glycosyltransferase [Anaerolinea thermophila UNI-1]
 gi|319995870|dbj|BAJ64641.1| putative glycosyltransferase [Anaerolinea thermophila UNI-1]
          Length = 401

 Score = 42.7 bits (99), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 58/128 (45%), Gaps = 17/128 (13%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPGRD 293
           +GR+ W KGYE  L  + I  ++   +   + G+GE    ++ A  +L L   V + G+ 
Sbjct: 227 VGRLHWVKGYEYALSAIAILKQQQIPVRYCIIGDGEFRPALEFAVHQLGLQDCVTFLGKQ 286

Query: 294 HADPIFHDYK---VFLNPSTTDVVCTATAEALAMGKIVVCANH---PSNDFFKQFPNCRT 347
             D I  + +   VFL+P+  +    A  EA AM   VVC++    P N           
Sbjct: 287 SQDVIRRELEKADVFLHPAVEEGFGNAVLEAQAMEVPVVCSDAVGLPEN----------V 336

Query: 348 YDGRNGFV 355
            DG  GFV
Sbjct: 337 QDGVTGFV 344


>gi|223477767|ref|YP_002582081.1| galactosyltransferase [Thermococcus sp. AM4]
 gi|214032993|gb|EEB73821.1| galactosyltransferase [Thermococcus sp. AM4]
          Length = 370

 Score = 42.7 bits (99), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 34/126 (26%), Positives = 57/126 (45%), Gaps = 9/126 (7%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV----Y 289
           + GR++  KG + LL  L    +E+  + + + G+G +  +++R AK L L   V    +
Sbjct: 201 FAGRLIPEKGVDLLLRALAEVKREIPDVRVVIIGDGPERKRLERMAKGLGLEKNVLFTGF 260

Query: 290 PGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYD 349
              ++   +    KVF+ PS  +       EA+A G  VV  + P N             
Sbjct: 261 LSYENVIALMKASKVFVLPSKREGFGIVVLEAMASGLPVVTLDEPMNA-----AKFLVEK 315

Query: 350 GRNGFV 355
           G+NGFV
Sbjct: 316 GKNGFV 321


>gi|149372911|ref|ZP_01891908.1| amylovoran biosynthesis glycosyl transferase AmsK [unidentified
           eubacterium SCB49]
 gi|149354404|gb|EDM42970.1| amylovoran biosynthesis glycosyl transferase AmsK [unidentified
           eubacterium SCB49]
          Length = 358

 Score = 42.7 bits (99), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 34/132 (25%), Positives = 54/132 (40%), Gaps = 11/132 (8%)

Query: 237 RMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHAD 296
           RM W K  E  +    +  ++      DLYG+G D  Q+     K +L   V      A+
Sbjct: 188 RMTWEKNIEGCIKFAALLKEKNIPFSYDLYGDGRDLGQLYLLVDKYELSKEVTIHGSVAN 247

Query: 297 PIFHD----YKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRT----- 347
           PI  +    Y  F+  S ++ + T+  EA ++G  + C    S    +   N +T     
Sbjct: 248 PILKERLSSYDFFVQLSISEALPTSVLEAQSLG--IPCIVSNSGGLPEAVINLKTAIVQE 305

Query: 348 YDGRNGFVEATL 359
           YD     VE T+
Sbjct: 306 YDAIESLVEQTV 317


>gi|448395469|ref|ZP_21568740.1| glycosyl transferase group 1 [Haloterrigena salina JCM 13891]
 gi|445661293|gb|ELZ14083.1| glycosyl transferase group 1 [Haloterrigena salina JCM 13891]
          Length = 378

 Score = 42.7 bits (99), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 56/242 (23%), Positives = 96/242 (39%), Gaps = 27/242 (11%)

Query: 170 LEFVNSWLARV--HCHKVIRLSAATQEYPNSIVC----NVHGVNPKFLEI-------GEK 216
           L++ + +L R+      V  L+A T  YP +I       +  ++P   EI          
Sbjct: 131 LDYWDEYLGRLAPFGKAVEGLTAKTPHYPIAISSVTADRLTEISPSRDEIEIVPNGIDVD 190

Query: 217 KMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQ 276
           ++E  Q     F     Y GR++  K  + LL   +   +    + + + G+G + D ++
Sbjct: 191 QIEATQPAEDGFD--VLYAGRLIEHKNVDYLLEAFDRIAETDPNITLGIIGDGPERDALE 248

Query: 277 RAAKKLKLVVRVYPG---RDHADPIFH--DYKVFLNPSTTDVVCTATAEALAMGKIVVCA 331
           R A+ L    RV       ++ D + H     VF +PST +      AEA+A    V+ A
Sbjct: 249 RQAQNLTHADRVTMLGFLEEYDDVLAHMRAADVFASPSTREGFGITFAEAMAADCTVIAA 308

Query: 332 NHPSNDFFK-----QFPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERF 386
            HP +   +      F    T D     +E  L    E P     A+  +  W++  E+ 
Sbjct: 309 EHPESAASEVIDDAGFLASPTVDDVTDVLERALS--GERPNTEPTARAQRYDWDAVAEQA 366

Query: 387 LQ 388
            Q
Sbjct: 367 EQ 368


>gi|383807478|ref|ZP_09963038.1| hypothetical protein IMCC13023_10000 [Candidatus Aquiluna sp.
           IMCC13023]
 gi|383298832|gb|EIC91447.1| hypothetical protein IMCC13023_10000 [Candidatus Aquiluna sp.
           IMCC13023]
          Length = 482

 Score = 42.4 bits (98), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 63/128 (49%), Gaps = 12/128 (9%)

Query: 232 AYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV--- 288
           A ++GR+   K    LL  ++   K    L++ L G+G + ++++R A  L +  RV   
Sbjct: 213 ALFVGRLDNEKRINILLNAISKLDK-FPNLKLRLVGDGGERERLERQAGLLGISDRVEFL 271

Query: 289 -YPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRT 347
            +        I+ +  VF+ PS  ++   AT EA+A G+ V+ A+  +       P+   
Sbjct: 272 GHVTDQELAGIYENCTVFVMPSIAELQSIATMEAMASGRPVIAADAMA------LPHL-V 324

Query: 348 YDGRNGFV 355
           +DG NG++
Sbjct: 325 HDGDNGYL 332


>gi|239638186|ref|ZP_04679138.1| glycosyl transferase, group 1 family protein [Staphylococcus
           warneri L37603]
 gi|239596207|gb|EEQ78752.1| glycosyl transferase, group 1 family protein [Staphylococcus
           warneri L37603]
          Length = 381

 Score = 42.4 bits (98), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 29/124 (23%), Positives = 60/124 (48%), Gaps = 10/124 (8%)

Query: 236 GRMVWSKGYEELLGLLNIYHKELAGL--EMDLYGNGEDFDQIQRAAKKLKL--VVRVYPG 291
           GR+ + KG++ LL  + +   +L  L  E+ LYG+G++ + +++   + +L  +V +YP 
Sbjct: 213 GRLEYEKGFDLLLESIRLIQDDLRHLNYELHLYGDGQEKEHLKQFIDQYQLQDIVHLYPA 272

Query: 292 RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGR 351
               +      K+ + PS  +       EA+A   IV+  +  +       P+     G+
Sbjct: 273 TPSLNQKLAQSKITVIPSRNEGFGMVILEAMAQDNIVISFDGNTG------PDSIIKSGK 326

Query: 352 NGFV 355
           NG++
Sbjct: 327 NGYL 330


>gi|32474014|ref|NP_867008.1| capsular polysaccharide biosynthesis protein [Rhodopirellula
           baltica SH 1]
 gi|32444551|emb|CAD74550.1| capsular polysaccharide biosynthesis protein [Rhodopirellula
           baltica SH 1]
          Length = 374

 Score = 42.4 bits (98), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 31/127 (24%), Positives = 55/127 (43%), Gaps = 2/127 (1%)

Query: 208 PKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYG 267
           PK      K + ++Q+           +GR+   KG++ L+  L     +    E+ ++G
Sbjct: 172 PKEFRTRLKPITERQSDPGQAKCHLVAVGRLEREKGFDRLIRSLGDLPDDAPDWELTIHG 231

Query: 268 NGEDFDQIQRAAKKLKLVVRV-YPGRDHAD-PIFHDYKVFLNPSTTDVVCTATAEALAMG 325
            G   D ++  A KL L  RV +PG      P++     F+ PS  +   +A  EA+A  
Sbjct: 232 EGSQRDHLESLASKLNLASRVKFPGWTRPIWPVYQSADCFVLPSRYEGFPSALLEAMACS 291

Query: 326 KIVVCAN 332
             V+  +
Sbjct: 292 TAVLTVD 298


>gi|260582796|ref|ZP_05850582.1| glycosyltransferase [Haemophilus influenzae NT127]
 gi|260094122|gb|EEW78024.1| glycosyltransferase [Haemophilus influenzae NT127]
          Length = 353

 Score = 42.0 bits (97), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 85/192 (44%), Gaps = 18/192 (9%)

Query: 169 LLEFVNSWLARVHCHKVIRLSAA--TQEYPNSIVCNVHGV-NPKFLEIGEKKMEQQQNGN 225
           L+  +  +LA   C K++ L+ A  TQ        N+  + NP  L + + K+ + +N  
Sbjct: 125 LVRTLGKYLAVTTCDKIVTLTEAEKTQWQEKFKTNNIISIANPNTL-LPKNKLAKWEN-- 181

Query: 226 KAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL- 284
               K    +G +   KG++ LL +  +  K+     + + G+GE+ + +++ AK L + 
Sbjct: 182 ----KTILSVGHLFSYKGFDYLLKVWQVLAKKYPDWNLKIVGSGEEEENLKKLAKALDIE 237

Query: 285 -VVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFP 343
             V   P  +     +    ++  PS T+ +     EA+A G  +V  N  S    KQ  
Sbjct: 238 NSVNFIPRTNDVAFYYESSSIYCLPSQTEGLPLVLIEAMAFGLPIVAFNCSSG--VKQLV 295

Query: 344 NCRTYDGRNGFV 355
             +T    NGF+
Sbjct: 296 ENKT----NGFL 303


>gi|194476900|ref|YP_002049079.1| SqdX [Paulinella chromatophora]
 gi|171191907|gb|ACB42869.1| SqdX [Paulinella chromatophora]
          Length = 385

 Score = 42.0 bits (97), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 63/259 (24%), Positives = 105/259 (40%), Gaps = 30/259 (11%)

Query: 109 ITEIIPDEEADIAVLEEPEHLTWFHHGKRWKAKFRFV--VGIVHTNYLEYVKREKNDRLQ 166
           + EI+ D   D+  +  P  L     G  W AK + +  V   HT+  +Y++      L+
Sbjct: 75  VAEILEDFCPDLVHVVNPAVLGL---GGIWLAKTKNLPLVASYHTHLPKYLEHYGIGMLE 131

Query: 167 AFLLEFVNSWLARVHCHKVIRLSAAT---QEYPNSIVCNV----HGVNPKFL--EIGEKK 217
             L E + +     H   V+ L  +T   +E  N  + +      GV+      E+    
Sbjct: 132 PLLWELLKA----AHNQAVLNLCTSTAMVEELSNKGIQHTALWQRGVDTDLFRPELRSNV 187

Query: 218 MEQQQNGNKAFTKGAY-YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQ 276
           M ++    ++ T     YIGR+   K  E +L +L    + L    + L G+G    Q++
Sbjct: 188 MRERLLDGRSDTGALLLYIGRLSAEKQIERILPVL----EALPETRLALVGDGPHRQQLE 243

Query: 277 RAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSN 336
           R  K        Y G +     +     FL PS+T+ +     EA+A G  VV AN    
Sbjct: 244 RIFKDTATHFVGYLGGEELASAYASADAFLFPSSTETLGLVLLEAMAAGCPVVGANRGG- 302

Query: 337 DFFKQFPNCRTYDGRNGFV 355
                 P+  T +G NG++
Sbjct: 303 -----IPDIVT-NGVNGYL 315


>gi|323497831|ref|ZP_08102844.1| glycosyltransferase WfaO [Vibrio sinaloensis DSM 21326]
 gi|323317103|gb|EGA70101.1| glycosyltransferase WfaO [Vibrio sinaloensis DSM 21326]
          Length = 356

 Score = 42.0 bits (97), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 43/153 (28%), Positives = 60/153 (39%), Gaps = 16/153 (10%)

Query: 190 AATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLG 249
           A    Y +  + NV  V P F        E   N      K   ++GR    KG + L+ 
Sbjct: 148 ADKHRYYDKFLKNV-AVIPNFTNFASVDAEYSHN------KKVLFVGRYNDMKGVDYLVD 200

Query: 250 LLNIYHKELAGLEMDLYGNGED----FDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKVF 305
           ++N+ HK   G E  LYG GE      +QIQ+    L  VV V    ++    +    +F
Sbjct: 201 IINLVHKHCPGWEFTLYGEGERKAWLLEQIQKM--NLSHVVEVNEPTENISVEYQKSGIF 258

Query: 306 LNPSTTDVVCTATAEALAMGKIVV---CANHPS 335
           +  S  +       EA A G  VV   C   PS
Sbjct: 259 ILTSRNEGFPMVLLEAQAHGLPVVSFDCETGPS 291


>gi|300115049|ref|YP_003761624.1| group 1 glycosyl transferase [Nitrosococcus watsonii C-113]
 gi|299540986|gb|ADJ29303.1| glycosyl transferase group 1 [Nitrosococcus watsonii C-113]
          Length = 355

 Score = 42.0 bits (97), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 52/102 (50%), Gaps = 4/102 (3%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLY--GNGEDFDQIQRAAKKLKLVVRVYPGR 292
           +GR+   KG+E+LL    +  K +AG  + L   G+G    ++Q  A  L+L  RV+ G 
Sbjct: 181 VGRLHPVKGFEDLLAAFALLPKSIAGRPVHLLIAGDGPLRQKLQIEAANLRLEGRVHWGG 240

Query: 293 DHADP-IFHDY-KVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
              +P +F+    +F+ PS  + +     EA + GK V+  N
Sbjct: 241 WQQNPSVFYQLADIFICPSRHEPLGNVILEAWSHGKPVIATN 282


>gi|400974762|ref|ZP_10801993.1| putative glycosyltransferase [Salinibacterium sp. PAMC 21357]
          Length = 414

 Score = 42.0 bits (97), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 37/138 (26%), Positives = 66/138 (47%), Gaps = 14/138 (10%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV----Y 289
           ++GR+   K  ++L+  + I    L   ++++ G GE   +++  A  L L  RV    +
Sbjct: 227 FLGRLADEKQIDKLIRAVAIMDPAL-NTQLEIVGGGELEGKLRALAASLGLADRVTLTGF 285

Query: 290 PGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYD 349
             +D          VF  PST ++   +T EA+A G  VV AN  +       P+   +D
Sbjct: 286 VEQDQLRDALQRGSVFAMPSTAELQSISTMEAMASGLPVVAANAMA------LPHL-VHD 338

Query: 350 GRNGFV--EATLKALAEE 365
           G NG++    +++ LAE+
Sbjct: 339 GENGYLFDPTSIEDLAEK 356


>gi|268323391|emb|CBH36979.1| putative glycosyl transferase, family 1 [uncultured archaeon]
          Length = 375

 Score = 42.0 bits (97), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 39/130 (30%), Positives = 60/130 (46%), Gaps = 16/130 (12%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPG- 291
           + GR++  K  + L+  L +  KE+  ++  + G G +  +++  A  L L   V + G 
Sbjct: 203 FAGRLIKEKNVDVLIKALKLVKKEMPEVKCKIIGEGPEKPKLENLAYDLGLESNVEFTGF 262

Query: 292 -RDHADPIFH--DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSN---DFFKQFPNC 345
             D+ D I H    KV + PST +    A  EA A G  VV  NH  N   DF ++    
Sbjct: 263 LGDYNDVISHMKSSKVLMLPSTREGFGIAALEANACGLPVVTVNHKMNAVCDFIER---- 318

Query: 346 RTYDGRNGFV 355
                RNGF+
Sbjct: 319 ----DRNGFI 324


>gi|407003868|gb|EKE20382.1| mannosyltransferase B-like protein [uncultured bacterium]
          Length = 382

 Score = 42.0 bits (97), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 52/213 (24%), Positives = 90/213 (42%), Gaps = 44/213 (20%)

Query: 204 HGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEM 263
             V+P+ LE+  KK    +       K   Y+G +   K  ++L+       K+L  +++
Sbjct: 184 QDVSPEKLELVRKKYALPK-------KFVLYVGTLQPRKNLDQLVMAFGNVQKDLGDVDL 236

Query: 264 DLYGN--GEDFD-QIQRAAKKLKLVVRVY-PGR-DHAD--PIFHDYKVFLNPSTTDVVCT 316
            + GN  G++FD +I  A ++L L  +V+ PG  D  D   IF    VF  PS  +    
Sbjct: 237 VICGNRKGKNFDSRIDTAVQELGLGDKVFFPGFIDEEDKRAIFASAHVFAFPSLYEGFGI 296

Query: 317 ATAEALAMGKIVVCANHPS-----------------NDFFKQFPN-CRTYDGRNGFVEAT 358
              EA++ G  V+C+N  S                 +DF K+  + C+  D RN  +   
Sbjct: 297 PPLEAMSQGVPVICSNISSLKEIATDGALYFEVSSLDDFSKKLYDICKDEDLRNKLISNG 356

Query: 359 LKALAEEPAQPTDAQTHQLSWESATERFLQVAE 391
            K ++              SW+ + ++ L + E
Sbjct: 357 KKRIS------------FFSWQKSAQKMLAIYE 377


>gi|182676823|ref|YP_001830931.1| glycosyl transferase group 1 [Beijerinckia indica subsp. indica
           ATCC 9039]
 gi|182636670|gb|ACB97442.1| glycosyl transferase group 1 [Beijerinckia indica subsp. indica
           ATCC 9039]
          Length = 380

 Score = 42.0 bits (97), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 42/149 (28%), Positives = 66/149 (44%), Gaps = 18/149 (12%)

Query: 258 LAGLEMDLYGNGEDFDQIQRAAKK--LKLVVRVYPGRDHAD--PIFHDYKVFLNPS---T 310
           + G   ++ GNG D+D IQ    K  LK  VR+    D AD   ++H   +FL+P    +
Sbjct: 218 MNGFIYEIAGNGPDYDAIQALINKRGLKPYVRLLGRVDDADMPALYHAADIFLHPQIDLS 277

Query: 311 TDV--VCTATAEALAMGKIVVCA-NHPSNDFFKQFPNCRTYDG-RNGFVEATLKALAEEP 366
            DV       A+A++ G  VV   +    DF          DG ++  +   +K L + P
Sbjct: 278 GDVEGFGLTIADAMSFGAAVVAGRDGGPADFVIDGATGLLVDGTKDTEIAKAIKLLIDNP 337

Query: 367 A------QPTDAQTH-QLSWESATERFLQ 388
           A      +  +   H  LSW +A ER ++
Sbjct: 338 ALRTKLGEAAERWVHDHLSWSAAAERIIE 366


>gi|428218224|ref|YP_007102689.1| group 1 glycosyl transferase [Pseudanabaena sp. PCC 7367]
 gi|427990006|gb|AFY70261.1| glycosyl transferase group 1 [Pseudanabaena sp. PCC 7367]
          Length = 379

 Score = 42.0 bits (97), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 45/163 (27%), Positives = 69/163 (42%), Gaps = 15/163 (9%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRA-AKKLKLVVRVYPGR 292
           Y+GR+   KG +E+L +L    + +    + L G+G    ++++  A      V    G 
Sbjct: 205 YVGRLSAEKGIDEILPVL----QAIPNSRLALVGDGPYRQELEKIFAGTNTNFVGYLQGT 260

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPS-NDFFKQFPNCRTYD-- 349
           D A   F    VFL PS T+ +     EA+A G  VV AN     D      N   +D  
Sbjct: 261 DLAS-AFASSDVFLFPSRTETLGLVLLEAMAAGCPVVAANSGGIPDIVTNGLNGYMFDPI 319

Query: 350 GRNGFVEATLKALAEEPAQPTD------AQTHQLSWESATERF 386
             +G + AT + L   PA   D       +  +  WE+AT + 
Sbjct: 320 AEDGLLTATQQLLDCSPAIADDLKHNARLEAEKWGWEAATSQL 362


>gi|126667453|ref|ZP_01738424.1| Glycosyl transferase, group 1 [Marinobacter sp. ELB17]
 gi|126628045|gb|EAZ98671.1| Glycosyl transferase, group 1 [Marinobacter sp. ELB17]
          Length = 386

 Score = 42.0 bits (97), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 27/106 (25%), Positives = 52/106 (49%), Gaps = 2/106 (1%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLV--VRVYPG 291
           +IG+M+  KG  +L+ + +  +++   L + L G+G    +++R AK L  V  V     
Sbjct: 201 FIGQMIPRKGIPDLIEVFDQLYQQEPDLRLQLLGDGSQRQELERQAKTLSSVNAVEFLGF 260

Query: 292 RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSND 337
           R     +   + +F+  S+ + +     EA+A+G  VV  + P  D
Sbjct: 261 RSDRLALLSKFSLFVMTSSLEGIPRCMMEAMAVGVPVVAYDIPGVD 306


>gi|332529608|ref|ZP_08405564.1| glycosyl transferase, group 1 [Hylemonella gracilis ATCC 19624]
 gi|332040958|gb|EGI77328.1| glycosyl transferase, group 1 [Hylemonella gracilis ATCC 19624]
          Length = 326

 Score = 41.6 bits (96), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 43/176 (24%), Positives = 77/176 (43%), Gaps = 32/176 (18%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRA--AKKLKLVVRVYPGR 292
           IGR+V  KG+ +++  L I   E +   + + G G+  D+I+    A+ L  +V +    
Sbjct: 154 IGRLVLRKGFSDIIEALAILKAERSDFHLTIVGYGKKKDEIETLLDARGLNGMVTMVGRV 213

Query: 293 DHAD-----PIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRT 347
           ++A+        H Y  + N   + +   A  EA++ G  V+ ++HP N  + +      
Sbjct: 214 EYAELERYYLASHAYLFYGNREGSSL---AMIEAVSYGLPVLASDHPGNRAYVK------ 264

Query: 348 YDGRNGF---------VEATLKALAE------EPAQPTDAQTHQLSWESATERFLQ 388
            DG NGF         + A ++ L E      E    + A     SW+S   R+L+
Sbjct: 265 -DGYNGFLVSYGDPLALAARMRHLLEKREAIRELGAHSVAMAETFSWKSIAARYLE 319


>gi|392395302|ref|YP_006431904.1| glycosyltransferase [Desulfitobacterium dehalogenans ATCC 51507]
 gi|390526380|gb|AFM02111.1| glycosyltransferase [Desulfitobacterium dehalogenans ATCC 51507]
          Length = 388

 Score = 41.6 bits (96), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 53/100 (53%), Gaps = 2/100 (2%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLV-VRVYPG- 291
           ++GR V+ K ++ L+ + +  HKE A   + + G+GE   +I++ A+   L    ++ G 
Sbjct: 215 HVGRFVYQKNHDFLIDIFSEIHKENADAVLVMVGSGELEQEIRQKAESFGLTGSALFLGI 274

Query: 292 RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCA 331
           R++ + I     VFL PS  + +  A  EA A G + + +
Sbjct: 275 RNNVNDILQAMDVFLFPSHYEGLGMAVIEAQAAGLLTIVS 314


>gi|126178643|ref|YP_001046608.1| glycosyl transferase, group 1 [Methanoculleus marisnigri JR1]
 gi|125861437|gb|ABN56626.1| glycosyl transferase, group 1 [Methanoculleus marisnigri JR1]
          Length = 360

 Score = 41.6 bits (96), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 44/189 (23%), Positives = 80/189 (42%), Gaps = 9/189 (4%)

Query: 185 VIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGY 244
           VI L+   ++   +I      V P  +++ E    + + G+    K   ++GR+   KG 
Sbjct: 141 VIALTEHMKDSMQAIYSRDVVVVPNGIDLNENAEREAERGDPG--KRVLFVGRLHPVKGV 198

Query: 245 EELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPGR---DHADPIFH 300
             LL  ++I H++L   ++ L G+G++ + ++     L +   V + G+   +      +
Sbjct: 199 RHLLQAMSIVHQDLPEAKLILVGDGDEREHLETLTDSLGIRECVEFVGKVPHERVQDYMN 258

Query: 301 DYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPS-NDFFKQFPNCRTYDGRNG--FVEA 357
             + F+ PS ++       EA+A G  VV        D  +   N    D  N     EA
Sbjct: 259 QVEAFVLPSLSEGFPVTILEAMACGLPVVATRVGGIPDIIEDGTNGYLVDAMNQERMAEA 318

Query: 358 TLKALAEEP 366
            LK L  EP
Sbjct: 319 LLKVLRNEP 327


>gi|385840553|ref|YP_005863877.1| Glycosyltransferase [Lactobacillus salivarius CECT 5713]
 gi|300214674|gb|ADJ79090.1| Glycosyltransferase [Lactobacillus salivarius CECT 5713]
          Length = 328

 Score = 41.6 bits (96), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 34/109 (31%), Positives = 49/109 (44%), Gaps = 6/109 (5%)

Query: 219 EQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRA 278
           E+    NK F       GRMV  K ++ L+    I+ ++    ++ L G+GED   I+  
Sbjct: 203 EKSDMKNKVFLAA----GRMVHVKAFDNLVEAFKIFAQKNPDWKLLLVGDGEDLPMIRDK 258

Query: 279 AKKLKLVVRVY-PGR-DHADPIFHDYKVFLNPSTTDVVCTATAEALAMG 325
            KK  L  R+  PG+ D     F    V L PS  + +     EAL MG
Sbjct: 259 IKKYGLEQRIITPGKTDDIKKYFLQSSVLLLPSRWEGMPMIGLEALEMG 307


>gi|77164101|ref|YP_342626.1| group 1 glycosyl transferase [Nitrosococcus oceani ATCC 19707]
 gi|254435938|ref|ZP_05049445.1| glycosyl transferase, group 1 family protein [Nitrosococcus oceani
           AFC27]
 gi|76882415|gb|ABA57096.1| Glycosyl transferase, group 1 [Nitrosococcus oceani ATCC 19707]
 gi|207089049|gb|EDZ66321.1| glycosyl transferase, group 1 family protein [Nitrosococcus oceani
           AFC27]
          Length = 355

 Score = 41.6 bits (96), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 52/102 (50%), Gaps = 4/102 (3%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLY--GNGEDFDQIQRAAKKLKLVVRVYPGR 292
           +GR+   KG+E+LL    +  + +AG  + L   G+G    ++Q  A  L+L  RV+ G 
Sbjct: 181 VGRLHPVKGFEDLLAAFALLPRSIAGRPVHLLIAGDGPLRQELQVKAANLRLEGRVHWGG 240

Query: 293 DHADP-IFHDY-KVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
              +P +F+    +F+ PS  + +     EA + GK V+  N
Sbjct: 241 WQQNPSVFYQLADIFICPSRHEPLGNVILEAWSHGKPVIATN 282


>gi|384135007|ref|YP_005517721.1| group 1 glycosyl transferase [Alicyclobacillus acidocaldarius
           subsp. acidocaldarius Tc-4-1]
 gi|339289092|gb|AEJ43202.1| glycosyl transferase group 1 [Alicyclobacillus acidocaldarius
           subsp. acidocaldarius Tc-4-1]
          Length = 383

 Score = 41.6 bits (96), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 67/297 (22%), Positives = 111/297 (37%), Gaps = 56/297 (18%)

Query: 138 WKAKFRFVVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHKVIRLSAATQEYPN 197
           WK+    V    HTN   Y +  K + L+  L  +  +   R H +      A  +E   
Sbjct: 104 WKSHLPLVAS-YHTNVPAYARHYKLEFLEPLLWWYFRTLHNRAHLNLATS-RATLRELER 161

Query: 198 SIVCNV----HGVN-------PKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEE 246
               N+     GV+       P   E+  +   + + G++       Y+GR+   K  E 
Sbjct: 162 QGFQNLELWERGVDVELFRNAPYSEEMRRRLAPEAKPGDRVL----LYVGRLASEKNIER 217

Query: 247 LLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKV-- 304
           +  +L+     +  L + + G+G    +++R     +     + G  H + +   Y+   
Sbjct: 218 MRPVLD----AIPDLHLAIVGDGPHRPELERVFAGTRTH---FTGYLHGEELAQAYRAAD 270

Query: 305 -FLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGF--------- 354
            FL PSTT+ +     EA+A G  +V A+ P      +       DGR G          
Sbjct: 271 AFLFPSTTETLGLVLFEAMAAGLPIVAADSPPTREVLE-------DGRAGLIFDPDSTES 323

Query: 355 VEATLKALAEEPA------QPTDAQTHQLSWESAT-------ERFLQVAELVGDVVT 398
           V AT+  +  + A      Q   A   QL WE  +       ER LQ   +V   VT
Sbjct: 324 VIATVDLVMRDEARREAVRQRGLAIAEQLDWEGPSKQLLGHYERVLQSFSVVAGAVT 380


>gi|227485975|ref|ZP_03916291.1| possible glycosyltransferase [Anaerococcus lactolyticus ATCC 51172]
 gi|227236020|gb|EEI86035.1| possible glycosyltransferase [Anaerococcus lactolyticus ATCC 51172]
          Length = 406

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 42/171 (24%), Positives = 80/171 (46%), Gaps = 25/171 (14%)

Query: 182 CHKVIRLSAATQEYPNSIVCNVHGVNPKFL----------EIGEKKMEQQQNGNKAFTKG 231
           C+++I  +A T+    +++ N +G++P  +          E+ +K + ++  G +   K 
Sbjct: 168 CNRIIVPTAKTE----NLLIN-YGIDPAKIDIIPTGIHIPELYDKTLLRKALGIEEDAKV 222

Query: 232 AYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLY--GNGEDFDQIQRAAKKLKLVVRVY 289
             Y+GR+   K  +E++     Y+  L   E+ LY  G G   D ++  A K+   V ++
Sbjct: 223 LLYLGRLGEEKNIQEIME----YYDRLKDSEIKLYIVGGGPYLDTLKEDAAKITKEV-IF 277

Query: 290 PGRDHADPIFHDYK---VFLNPSTTDVVCTATAEALAMGKIVVCANHPSND 337
            G   A+ +   Y+   +F+  ST++       EALA G I +C N    D
Sbjct: 278 TGMVEANSVNRYYQAADIFVTASTSETQGLTYYEALANGTIALCRNDSVLD 328


>gi|402819480|ref|ZP_10869048.1| hypothetical protein IMCC14465_02820 [alpha proteobacterium
           IMCC14465]
 gi|402511627|gb|EJW21888.1| hypothetical protein IMCC14465_02820 [alpha proteobacterium
           IMCC14465]
          Length = 355

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 29/97 (29%), Positives = 48/97 (49%), Gaps = 3/97 (3%)

Query: 236 GRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHA 295
           GR +  KG++ L+   +I  K  A + + LYG G     +++ AK L L   ++ G  HA
Sbjct: 178 GRFLEKKGFQTLIDAASILKKTHANIHIYLYGEGPFLPNLKQQAKSLALENVIFMGW-HA 236

Query: 296 D--PIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVC 330
           +    F+ Y  + +PS  +    AT EA+  G  V+ 
Sbjct: 237 NLHEEFNKYDAYCSPSLMESFGLATGEAMMHGLPVIA 273


>gi|320103439|ref|YP_004179030.1| group 1 glycosyl transferase [Isosphaera pallida ATCC 43644]
 gi|319750721|gb|ADV62481.1| glycosyl transferase group 1 [Isosphaera pallida ATCC 43644]
          Length = 408

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 48/110 (43%), Gaps = 2/110 (1%)

Query: 232 AYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YP 290
           A YIGR+   KG ++LL    I  +      + L G G     ++R    L L   V  P
Sbjct: 220 AAYIGRLAPEKGLDDLLRAWAIVRRSHPQALLTLVGEGPQRPALERLRDDLGLGDAVSLP 279

Query: 291 GRDH-ADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFF 339
           G  H  + +F    +F+ PS  + +  A  EA+A    +V ++ P N   
Sbjct: 280 GATHRPEALFATLDLFVLPSYEEGMSIALLEAMAHAAPIVASDIPGNRLL 329


>gi|375132569|ref|YP_005048977.1| glycosyltransferase protein [Vibrio furnissii NCTC 11218]
 gi|315181744|gb|ADT88657.1| hypothetical glycosyltransferase protein [Vibrio furnissii NCTC
           11218]
          Length = 366

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 42/167 (25%), Positives = 69/167 (41%), Gaps = 14/167 (8%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPG--- 291
           +GR V  KG+ +L+   + Y +  +GL M L   G D D     +  LK   R  PG   
Sbjct: 203 VGRFVEEKGFHDLI---DAYRE--SGLSMPLVLVG-DTDHETPYSAALKEKARSTPGVIM 256

Query: 292 -----RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCR 346
                 D    +F   ++F+ PS  + +  A  EA++     + ++ P+N      P C 
Sbjct: 257 TGFLKGDALKAVFSQARLFVMPSYHEGLPIALLEAMSFSLPALVSDIPANTEVALDPTCY 316

Query: 347 TYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERFLQVAELV 393
              G    +   L AL+EE          +  WE   ++ +QV + V
Sbjct: 317 FPVGNVAALGEKLSALSEEARVDYAEYLAKYDWEKIAQQTMQVYQKV 363


>gi|260769207|ref|ZP_05878140.1| alpha-D-GlcNAc alpha-1,2-L-rhamnosyltransferase [Vibrio furnissii
           CIP 102972]
 gi|260614545|gb|EEX39731.1| alpha-D-GlcNAc alpha-1,2-L-rhamnosyltransferase [Vibrio furnissii
           CIP 102972]
          Length = 366

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 42/167 (25%), Positives = 69/167 (41%), Gaps = 14/167 (8%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPG--- 291
           +GR V  KG+ +L+   + Y +  +GL M L   G D D     +  LK   R  PG   
Sbjct: 203 VGRFVEEKGFHDLI---DAYRE--SGLSMPLVLVG-DTDHETPYSAALKEKARSTPGVIM 256

Query: 292 -----RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCR 346
                 D    +F   ++F+ PS  + +  A  EA++     + ++ P+N      P C 
Sbjct: 257 TGFLKGDALKAVFSQARLFVMPSYHEGLPIALLEAMSFSLPALVSDIPANTEVALDPTCY 316

Query: 347 TYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERFLQVAELV 393
              G    +   L AL+EE          +  WE   ++ +QV + V
Sbjct: 317 FPVGNVAALGEKLSALSEEARVDYAEYLAKYDWEKIAQQTMQVYQKV 363


>gi|366089649|ref|ZP_09456015.1| glycosyltransferase [Lactobacillus acidipiscis KCTC 13900]
          Length = 392

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 38/164 (23%), Positives = 78/164 (47%), Gaps = 11/164 (6%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVR-VYPGR- 292
           + R+ + K  + L+       K++ G+++ + G+G   DQ+ +   ++ L  + ++ G  
Sbjct: 209 LSRVAYEKDIDRLIDAFPEIQKQVPGVKLMICGDGPAKDQLSQQVAQMGLTDKIIFTGEI 268

Query: 293 DHAD--PIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHP-SNDFFKQFPNCRTYD 349
           DH D    +H   +F++ S ++       EA+A G  VV A+ P ++D          ++
Sbjct: 269 DHDDVPAYYHMADLFVSTSISESQGLTFIEAIAAGLKVVAASGPYTDDLLDNLNIGTIFE 328

Query: 350 GRNGFVEATLKALAEEPAQPTDAQTHQ-----LSWESATERFLQ 388
               FV+  +K L  EPA+  D +  +     +S E+  E+ +Q
Sbjct: 329 TPVQFVQEVVKYL-NEPAKFNDQRFREEKIASISAENFGEQIMQ 371


>gi|118602860|ref|YP_904075.1| glycosyl transferase, group 1 [Candidatus Ruthia magnifica str. Cm
           (Calyptogena magnifica)]
 gi|118567799|gb|ABL02604.1| glycosyl transferase, group 1 [Candidatus Ruthia magnifica str. Cm
           (Calyptogena magnifica)]
          Length = 382

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 46/182 (25%), Positives = 78/182 (42%), Gaps = 19/182 (10%)

Query: 199 IVCNVHGVNPKFLEIGEKKMEQQQNG-NKAFTKG-----AYYIGRMVWSKGYEELLGLLN 252
           ++ N      K L + EKK+  + N  N +F++G       Y+GR+  SKG  E   LL 
Sbjct: 176 VLTNFQKEKIKILGVDEKKIILKPNSLNMSFSRGDNKNGYVYVGRLEESKGIYE---LLK 232

Query: 253 IYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTT- 311
           ++ K      + + G+G+D + I+    K  ++ +    R+    I    K  +  S   
Sbjct: 233 VWEKLDEQFVLIIVGSGDDEENIRNQYNKSNIIFKGKCSREKTLQIISSVKYLIQTSVLY 292

Query: 312 DVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGFV--EATLKALAEEPAQP 369
           +       EA+  G  V+  N  +   F Q       DG NGF+  + TLK + E+  + 
Sbjct: 293 ETFGLTMIEAMNFGVPVIGFNVGTRKDFIQ-------DGVNGFICSKITLKDVIEKSFEY 345

Query: 370 TD 371
            D
Sbjct: 346 KD 347


>gi|90961964|ref|YP_535880.1| glycosyltransferase [Lactobacillus salivarius UCC118]
 gi|90821158|gb|ABD99797.1| Glycosyltransferase [Lactobacillus salivarius UCC118]
          Length = 382

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 34/109 (31%), Positives = 49/109 (44%), Gaps = 6/109 (5%)

Query: 219 EQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRA 278
           E+    NK F       GRMV  K ++ L+    I+ ++    ++ L G+GED   I+  
Sbjct: 203 EKSDMKNKVFLAA----GRMVHVKAFDNLVEAFKIFAQKNPDWKLLLVGDGEDLPMIRDK 258

Query: 279 AKKLKLVVRVY-PGR-DHADPIFHDYKVFLNPSTTDVVCTATAEALAMG 325
            KK  L  R+  PG+ D     F    V L PS  + +     EAL MG
Sbjct: 259 IKKYGLEQRIITPGKTDDIKKYFLQSSVLLLPSRWEGMPMIGLEALEMG 307


>gi|406831617|ref|ZP_11091211.1| group 1 glycosyl transferase [Schlesneria paludicola DSM 18645]
          Length = 372

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 2/107 (1%)

Query: 232 AYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVY-- 289
           A YIGR+   K  E+L+  +    +    L + L G G +  +++  AK++     ++  
Sbjct: 197 AGYIGRLAKQKRVEDLIWAVETLRQIRPQLHLVLVGEGPERTRLEEFAKQIGATNHIHFV 256

Query: 290 PGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSN 336
             RD A      + VF   S+ + +  +  EA++MGK V+ ++ P+N
Sbjct: 257 GHRDDAPRWMSLFDVFCLASSFEGMSNSVMEAMSMGKPVLASDIPAN 303


>gi|260890941|ref|ZP_05902204.1| glycosyl transferase [Leptotrichia hofstadii F0254]
 gi|260859494|gb|EEX73994.1| glycosyl transferase [Leptotrichia hofstadii F0254]
          Length = 379

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 68/136 (50%), Gaps = 10/136 (7%)

Query: 207 NPKFLEIGEKKMEQQQNGNKAFTKGAYY--IGRMVWSKGYEELLGLLNIYHK-ELAGLEM 263
           NP  LE   KK E      + + K  Y+  + R+   K  E L+   NIY+K +  G++ 
Sbjct: 177 NPIDLEAIRKKAENIDKKYENYLKQDYFLQVSRLTEQKQPEHLV---NIYYKLKQRGIKE 233

Query: 264 DLY--GNGEDFDQIQRAAKKLKLV--VRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATA 319
            LY  GNGE  + I++  K+ KL   V +    ++  P F + K+F++    + + T   
Sbjct: 234 KLYFIGNGEKVELIKQKIKEYKLQNDVILLGQIENPYPFFKNAKLFVHTGKYEGLPTVLL 293

Query: 320 EALAMGKIVVCANHPS 335
           E+LA+G  VV  + P+
Sbjct: 294 ESLALGTPVVAYDCPT 309


>gi|384166569|ref|YP_005547948.1| glycogen synthase [Bacillus amyloliquefaciens LL3]
 gi|328914124|gb|AEB65720.1| Glycogen synthase [Bacillus amyloliquefaciens LL3]
          Length = 442

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 50/100 (50%), Gaps = 2/100 (2%)

Query: 235 IGRMVWSKGYEELLGLLN-IYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPG-R 292
           I R+   KG+  LL  L+ I    L  +E+ + G+GE    ++  A+KLKL +  + G R
Sbjct: 266 IARLGPRKGHNHLLDALSRIPSDVLDHVEVPIVGDGERRSALEEQARKLKLSMVSFLGKR 325

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
           D    I  +  +F+ P+  D +  +  EA+  G  ++  +
Sbjct: 326 DDVAAILEETDIFVLPTINDSLPISIIEAMFSGSAIIATD 365


>gi|417300420|ref|ZP_12087631.1| glycosyl transferase, group 1 [Rhodopirellula baltica WH47]
 gi|327543294|gb|EGF29727.1| glycosyl transferase, group 1 [Rhodopirellula baltica WH47]
          Length = 374

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 31/127 (24%), Positives = 55/127 (43%), Gaps = 2/127 (1%)

Query: 208 PKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYG 267
           PK      K   ++Q+     T     +GR+   KG++ L+  L     +    E+ ++G
Sbjct: 172 PKEFRTSLKPSTERQSDPGQATCHLVAVGRLEREKGFDRLIRSLGDLPDDAPNWELTIHG 231

Query: 268 NGEDFDQIQRAAKKLKLVVRV-YPGRDHAD-PIFHDYKVFLNPSTTDVVCTATAEALAMG 325
            G   + ++  A KL L  RV +PG      P++     F+ PS  +   +A  EA+A  
Sbjct: 232 EGSQRNHLESLASKLHLASRVKFPGWTRPIWPVYQSADWFVLPSRYEGFPSALLEAMACS 291

Query: 326 KIVVCAN 332
             V+  +
Sbjct: 292 TAVLTVD 298


>gi|387813101|ref|YP_005428582.1| glycosyltransferase [Marinobacter hydrocarbonoclasticus ATCC 49840]
 gi|381338112|emb|CCG94159.1| putative glycosyltransferase [Marinobacter hydrocarbonoclasticus
           ATCC 49840]
          Length = 333

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 68/160 (42%), Gaps = 10/160 (6%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPGRD 293
           +GR+V +KG+++L+         L      + G+G++ + +Q   ++LKL  RV   G  
Sbjct: 171 VGRLVHAKGFDQLIDAFQSAKGSLL-----IAGDGKEHEALQERIERLKLTSRVKLIGYQ 225

Query: 294 HADP-IFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHP-SNDFFKQFPNCRTYD-- 349
              P +       +  S  +      AEAL  GK VV  + P +N+       C T D  
Sbjct: 226 EDIPGLMQSVDGIVISSRREGFSYVCAEALLSGKPVVSTDVPVANELLPPQHICPTGDTA 285

Query: 350 GRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERFLQV 389
           G +  +   +  L E  A   D    +LS +S     LQV
Sbjct: 286 GLSSLLNTPIAELVESQAGVRDFAARELSVDSMVANTLQV 325


>gi|332710837|ref|ZP_08430774.1| glycosyltransferase [Moorea producens 3L]
 gi|332350390|gb|EGJ29993.1| glycosyltransferase [Moorea producens 3L]
          Length = 400

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 49/102 (48%), Gaps = 4/102 (3%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL--VVRVYP-- 290
           IGR +  KG+   +     +H+E    E  L G+G + +++Q+ AK L     VR +   
Sbjct: 223 IGRFLGWKGFALGIRAFAQFHRERPTSEYWLIGDGPEKERLQKLAKDLGCGDAVRFWGKL 282

Query: 291 GRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
            RD       +  V ++PS  D   T   EA+A G+ VVC +
Sbjct: 283 SRDQVLQRMAEVDVLMHPSFHDHWPTVVLEAMASGRPVVCLD 324


>gi|153869132|ref|ZP_01998816.1| glycosyl transferase, group 1 family protein [Beggiatoa sp. PS]
 gi|152074327|gb|EDN71193.1| glycosyl transferase, group 1 family protein [Beggiatoa sp. PS]
          Length = 383

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 41/182 (22%), Positives = 83/182 (45%), Gaps = 11/182 (6%)

Query: 154 LEYVKREKNDRLQAFLLEFVNSWLARVHCHKVIRLSAATQE---YPNSIVCNVHGVNPKF 210
           L  +++   +RL+ ++      W  ++  +    +S A Q        +V   +GV+  F
Sbjct: 123 LMTLRKGDKNRLELYVDRLTVPWADKIIFNNKAAISLALQSEGIQEKQVVYIPNGVDTAF 182

Query: 211 LEIGE----KKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLY 266
               +    KK E+    N     G+  +GR+   KG+  LL  L I HK+     + + 
Sbjct: 183 FSTMKIDRLKKREELNISNDILIIGS--VGRLGHEKGFPYLLQSLKIIHKQFPHCVLLII 240

Query: 267 GNGEDFDQIQRAAKKLKLVVRV-YPGRDHADP-IFHDYKVFLNPSTTDVVCTATAEALAM 324
           G G   + ++  A++L +  +V + G+    P +     ++++PS ++ +  A  EA+A+
Sbjct: 241 GAGVLLEFLKSKAEELDISEKVIFLGQRIDVPELLSGMDIYIHPSLSEGMPNAVMEAMAI 300

Query: 325 GK 326
           GK
Sbjct: 301 GK 302


>gi|46578510|ref|YP_009318.1| glycosyl transferase domain-containing protein [Desulfovibrio
           vulgaris str. Hildenborough]
 gi|387151993|ref|YP_005700929.1| group 1 glycosyl transferase [Desulfovibrio vulgaris RCH1]
 gi|46447921|gb|AAS94577.1| conserved domain protein/glycosyl transferase, group 1 family
           protein [Desulfovibrio vulgaris str. Hildenborough]
 gi|311232437|gb|ADP85291.1| glycosyl transferase group 1 [Desulfovibrio vulgaris RCH1]
          Length = 816

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 32/137 (23%), Positives = 52/137 (37%), Gaps = 2/137 (1%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRD 293
           Y+GR+   K    L         +  GL + L G+G   D++ R    L ++   Y   D
Sbjct: 647 YVGRLSREKNLHILAEAFRTACAQNPGLALVLVGDGPIRDELARTLTDLPVIFTGYIEGD 706

Query: 294 HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANH--PSNDFFKQFPNCRTYDGR 351
                +    +F+ PS TD       EA A G  V+  +   P  +       C   +G 
Sbjct: 707 ALAEAYASSDIFVFPSGTDTFGNVVLEAQASGLPVIVTDRGGPRENLLPGRTGCIVPEGE 766

Query: 352 NGFVEATLKALAEEPAQ 368
              + A +  LA +P +
Sbjct: 767 ADALSAAMLDLAADPVR 783


>gi|158521448|ref|YP_001529318.1| group 1 glycosyl transferase [Desulfococcus oleovorans Hxd3]
 gi|158510274|gb|ABW67241.1| glycosyl transferase group 1 [Desulfococcus oleovorans Hxd3]
          Length = 353

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 28/109 (25%), Positives = 51/109 (46%), Gaps = 2/109 (1%)

Query: 228 FTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVR 287
           F+     +GR    KG++ L+  L I       L ++L G+G +  +++R  K+L L  +
Sbjct: 170 FSPSMVGMGRFSPEKGFDILIQALGILKNRGVELPLELGGDGPEKPRLERMVKQLNLDTQ 229

Query: 288 V-YPG-RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHP 334
           + + G             +F+ PS  +    A  EA++ G+I+V A  P
Sbjct: 230 IRFAGWISDVAAFLRKGSIFVLPSRRESFGIAVLEAMSQGRIIVAAMAP 278


>gi|197123269|ref|YP_002135220.1| group 1 glycosyl transferase [Anaeromyxobacter sp. K]
 gi|196173118|gb|ACG74091.1| glycosyl transferase group 1 [Anaeromyxobacter sp. K]
          Length = 378

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 48/107 (44%), Gaps = 6/107 (5%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGR- 292
           Y+G M   KG  +LL  + + H     + +   G+G D + ++  A  L   +  +PGR 
Sbjct: 204 YVGAMTEKKGVGDLLEGIRLLHARGRPVRLTAVGSGADLEALRARASALPPGLATFPGRV 263

Query: 293 --DHADPIFHDYKVFLNPST---TDVVCTATAEALAMGKIVVCANHP 334
             D A  +  D  +   P+    T+ +     EALA    VV ++HP
Sbjct: 264 GNDEAFRLMLDSTLVCVPTRPEFTEGMPLTLTEALASRTPVVASDHP 310


>gi|427736945|ref|YP_007056489.1| glycosyltransferase [Rivularia sp. PCC 7116]
 gi|427371986|gb|AFY55942.1| glycosyltransferase [Rivularia sp. PCC 7116]
          Length = 419

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 33/132 (25%), Positives = 56/132 (42%), Gaps = 13/132 (9%)

Query: 205 GVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMD 264
           GV+P   ++    +  QQ G +       Y+GR+  +KG   LL  L I  ++   + + 
Sbjct: 206 GVDPSLFDV----VSHQQQGTRLL-----YVGRLAGAKGLPILLESLAILKQQHPQIVLT 256

Query: 265 LYGNGEDFDQIQRAAKKLKLVVRV----YPGRDHADPIFHDYKVFLNPSTTDVVCTATAE 320
           + G+G D  +++    KL L   V    Y  +      F +  VF+  S  + +     E
Sbjct: 257 VVGDGSDRKELEDMTFKLGLSQNVEFVGYKSQAEVREYFQNTDVFVMSSFAEGIPVVLME 316

Query: 321 ALAMGKIVVCAN 332
           A+A G  VV   
Sbjct: 317 AMAAGVPVVATQ 328


>gi|366053569|ref|ZP_09451291.1| glycosyltransferase [Lactobacillus suebicus KCTC 3549]
          Length = 480

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 46/195 (23%), Positives = 85/195 (43%), Gaps = 29/195 (14%)

Query: 178 ARVHCHKVIRLSAATQ--------EYPNS----IVCNVHGVNPKFLEIGEKKMEQQQNGN 225
           A  H  +V R+  AT+        ++PN     +   V GV+    EI   +    Q+  
Sbjct: 245 ALTHMDRVDRMITATKLQRDDLLVDFPNGQNQIVAIPVGGVSDHVEEINVDR--DMQSPI 302

Query: 226 KAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKK--LK 283
           K  +     I R+   K  + ++  +   H E   + +D+YG GE+ ++IQ+  ++   +
Sbjct: 303 KFIS-----ISRLASEKHIDLIVQAVAKVHDEGTPVALDIYGLGEEHNKIQKTIEESNAQ 357

Query: 284 LVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFP 343
             +++    DHAD ++ +Y  F++ S ++       EAL     V+  N        +F 
Sbjct: 358 DYIKLMGLTDHADQVYPNYDAFISASFSEGFGLTYIEALNAALPVITFN-------ARFG 410

Query: 344 NCR-TYDGRNGFVEA 357
                +DG NGF+E 
Sbjct: 411 ALELVHDGINGFLET 425


>gi|407800671|ref|ZP_11147518.1| glycosyl transferase, putative [Oceaniovalibus guishaninsula
           JLT2003]
 gi|407057265|gb|EKE43254.1| glycosyl transferase, putative [Oceaniovalibus guishaninsula
           JLT2003]
          Length = 424

 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 46/100 (46%), Gaps = 4/100 (4%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV----Y 289
           ++GR+   KG   LL  +   H +  GL + L G+G +   ++R A+ L +   V    Y
Sbjct: 229 FVGRLAAIKGVALLLDAVARLHADRPGLHLTLVGDGAERAGLERRARDLGIADAVTFTGY 288

Query: 290 PGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVV 329
            G+D          +F+ PS  + V     EA+A G  V+
Sbjct: 289 LGQDAVAGRLAQTDIFVLPSFAEGVPVVLMEAMAAGLPVI 328


>gi|380032196|ref|YP_004889187.1| poly(glycerol-phosphate) alpha-glucosyltransferase [Lactobacillus
           plantarum WCFS1]
 gi|342241439|emb|CCC78673.1| poly(glycerol-phosphate) alpha-glucosyltransferase [Lactobacillus
           plantarum WCFS1]
          Length = 502

 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 31/130 (23%), Positives = 58/130 (44%), Gaps = 10/130 (7%)

Query: 230 KGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNG--EDFDQIQRAAKKLKLVVR 287
           K   Y+GR+   +  ++L+  + +  +++  ++ D YG G  E    + +  ++LKL   
Sbjct: 325 KQLLYVGRLAPDRQLDQLIRTIALVKQQVKEIQCDFYGYGDPEYLKTLNKLVEELKLTNN 384

Query: 288 VYPGRDHAD--PIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNC 345
           V+    H D    + DY++ LN    D    +  EA++ G  V+         F   P  
Sbjct: 385 VHLLDYHPDLETRYDDYQLLLNTDIVDGGPMSMPEAMSHGIPVISYR------FNYGPKD 438

Query: 346 RTYDGRNGFV 355
              DG +GF+
Sbjct: 439 WIDDGHDGFI 448


>gi|308180243|ref|YP_003924371.1| poly(glycerol-phosphate) alpha-glucosyltransferase [Lactobacillus
           plantarum subsp. plantarum ST-III]
 gi|448820850|ref|YP_007414012.1| Poly(Glycerol-phosphate) alpha-glucosyltransferase [Lactobacillus
           plantarum ZJ316]
 gi|308045734|gb|ADN98277.1| poly(glycerol-phosphate) alpha-glucosyltransferase [Lactobacillus
           plantarum subsp. plantarum ST-III]
 gi|448274347|gb|AGE38866.1| Poly(Glycerol-phosphate) alpha-glucosyltransferase [Lactobacillus
           plantarum ZJ316]
          Length = 502

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 31/130 (23%), Positives = 58/130 (44%), Gaps = 10/130 (7%)

Query: 230 KGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNG--EDFDQIQRAAKKLKLVVR 287
           K   Y+GR+   +  ++L+  + +  +++  ++ D YG G  E    + +  ++LKL   
Sbjct: 325 KQLLYVGRLAPDRQLDQLIRTIALVKQQVKEIQCDFYGYGDPEYLKTLNKLVEELKLTNN 384

Query: 288 VYPGRDHAD--PIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNC 345
           V+    H D    + DY++ LN    D    +  EA++ G  V+         F   P  
Sbjct: 385 VHLLDYHPDLETRYDDYQLLLNTDIVDGGPMSMPEAMSHGIPVISYR------FNYGPKD 438

Query: 346 RTYDGRNGFV 355
              DG +GF+
Sbjct: 439 WIDDGHDGFI 448


>gi|352096769|ref|ZP_08957525.1| glycosyl transferase group 1 [Synechococcus sp. WH 8016]
 gi|351675991|gb|EHA59149.1| glycosyl transferase group 1 [Synechococcus sp. WH 8016]
          Length = 391

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 61/276 (22%), Positives = 108/276 (39%), Gaps = 32/276 (11%)

Query: 135 GKRWKAKFRFV--VGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLARVHCHKVIRLSAAT 192
           G  W AK + +  +   HT+  +Y++      L+  L E + +     H   V+ L  +T
Sbjct: 108 GGIWLAKSKSIPLIASYHTHLPKYLEHYGMGMLEPLLWELLKA----AHSQAVLNLCTST 163

Query: 193 ---QEYPNSIVCNV----HGVNPKFLEIGEKKMEQQQNGNKAFT-KGAY--YIGRMVWSK 242
              QE  +  + +      GV+ +      +  E +Q    A+  +G+   Y+GR+   K
Sbjct: 164 AMVQELSDKGIQHTALWQRGVDTELFRPELRSQELRQRLLGAYDDRGSLLLYVGRLSAEK 223

Query: 243 GYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDY 302
             E +  +L    + L    + L G+G    Q+++  +        Y   +     +   
Sbjct: 224 QIERIRPVL----EALPDTRLALVGDGPHRQQLEKHFEGTATTFVGYLAGEELAGAYASG 279

Query: 303 KVFLNPSTTDVVCTATAEALAMGKIVVCANHPS-NDFFKQFPNCRTYD------GRNGFV 355
             FL PS+T+ +     EA+A G  VV AN     D      N   Y+      G    +
Sbjct: 280 DAFLFPSSTETLGLVLLEAMAAGCPVVGANRGGIPDIISDGVNGCLYEPDGADAGAGSLI 339

Query: 356 EATLKALAEEPAQPT-----DAQTHQLSWESATERF 386
           EAT K L  +  + T      ++  +  W  ATE+ 
Sbjct: 340 EATRKLLGNDLERQTLRNAARSEAERWGWAGATEQL 375


>gi|421650638|ref|ZP_16091012.1| glycosyltransferase, group 1 family protein [Acinetobacter
           baumannii OIFC0162]
 gi|408509885|gb|EKK11552.1| glycosyltransferase, group 1 family protein [Acinetobacter
           baumannii OIFC0162]
          Length = 347

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 39/162 (24%), Positives = 67/162 (41%), Gaps = 7/162 (4%)

Query: 229 TKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLV-VR 287
           +K    +GR+ + K YE LL    I   +     + +YG GE+ D + +   + KL  V+
Sbjct: 173 SKDIIAVGRLTYQKNYESLLESWEIISNKSPEWNLKIYGTGENLDDLTKIITERKLKNVK 232

Query: 288 VYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVV---CANHPSNDFFKQFPN 344
           +     + D I+     F+  S  + +     EA   G  +V   C + PS +      N
Sbjct: 233 LMGAASNIDEIYKKAAFFVMSSRFEGLPMVLIEAQTFGLPIVSYDCPHGPS-EVIDHEEN 291

Query: 345 CRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERF 386
               + +N   EA  KA+     QP     +  S + + +RF
Sbjct: 292 GLLVENQNS--EALAKAIQRLIDQPEVRLEYSKSAKKSAQRF 331


>gi|254556273|ref|YP_003062690.1| poly(glycerol-phosphate) alpha-glucosyltransferase [Lactobacillus
           plantarum JDM1]
 gi|418274861|ref|ZP_12890316.1| poly(glycerol-phosphate) alpha-glucosyltransferase [Lactobacillus
           plantarum subsp. plantarum NC8]
 gi|254045200|gb|ACT61993.1| poly(glycerol-phosphate) alpha-glucosyltransferase [Lactobacillus
           plantarum JDM1]
 gi|376009554|gb|EHS82881.1| poly(glycerol-phosphate) alpha-glucosyltransferase [Lactobacillus
           plantarum subsp. plantarum NC8]
          Length = 502

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 31/130 (23%), Positives = 58/130 (44%), Gaps = 10/130 (7%)

Query: 230 KGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNG--EDFDQIQRAAKKLKLVVR 287
           K   Y+GR+   +  ++L+  + +  +++  ++ D YG G  E    + +  ++LKL   
Sbjct: 325 KQLLYVGRLAPDRQLDQLIRTIALVKQQVKEIQCDFYGYGAPEYLKTLNKLVEELKLTNN 384

Query: 288 VYPGRDHAD--PIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNC 345
           V+    H D    + DY++ LN    D    +  EA++ G  V+         F   P  
Sbjct: 385 VHLLDYHPDLETRYDDYQLLLNTDIVDGGPMSMPEAMSHGIPVISYR------FNYGPKD 438

Query: 346 RTYDGRNGFV 355
              DG +GF+
Sbjct: 439 WIDDGHDGFI 448


>gi|385331656|ref|YP_005885607.1| group 1 glycosyl transferase [Marinobacter adhaerens HP15]
 gi|311694806|gb|ADP97679.1| glycosyl transferase, group 1 [Marinobacter adhaerens HP15]
          Length = 383

 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 25/106 (23%), Positives = 51/106 (48%), Gaps = 2/106 (1%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL--VVRVYPG 291
           ++G+++  KG  +LL      H+   GLE+ L G G    Q++    ++K    V+    
Sbjct: 200 FVGQLIPRKGLLDLLSAFEGLHEHHQGLELQLIGEGRQRPQLEEKCARMKAGQAVKFLGF 259

Query: 292 RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSND 337
           R+    +   +++F+  S+ + +     E++A+G  VV  + P  D
Sbjct: 260 REDRLELLSKFEMFVMTSSLEGIPRCLMESMAVGVPVVAYDIPGVD 305


>gi|262198731|ref|YP_003269940.1| group 1 glycosyl transferase [Haliangium ochraceum DSM 14365]
 gi|262082078|gb|ACY18047.1| glycosyl transferase group 1 [Haliangium ochraceum DSM 14365]
          Length = 810

 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 47/195 (24%), Positives = 72/195 (36%), Gaps = 8/195 (4%)

Query: 145 VVGIVHTNYLEYVKREKNDRLQAFLLEFVNSWLAR------VHCHKVIRLSAATQEYPNS 198
           VVG  HT+  EY  R   D + A ++  + SW  R      V    V RL          
Sbjct: 531 VVGQYHTDVPEYTMRLMGDPMLAGVVRIITSWFYRTVDRALVPSQWVARLINDMGVPAER 590

Query: 199 IVCNVHGVNPK-FLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKE 257
           I     G++   F +    +   ++ G     K   Y+GR+   KG   L         E
Sbjct: 591 ITRIPRGIDLDLFRQAARDEHAFEEYGLNGEPK-VLYVGRVSKEKGLSHLAAGFRRLSSE 649

Query: 258 LAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTA 317
           L G  + + G+G   D++       K++       +    ++    VF  PS T+     
Sbjct: 650 LPGARLVVIGDGPYADELATQMPADKVIFTGPVTGEKLARLYASSDVFAFPSETETFGNV 709

Query: 318 TAEALAMGKIVVCAN 332
             EA A G  VV A+
Sbjct: 710 VVEAQATGLPVVVAD 724


>gi|417090450|ref|ZP_11955947.1| hypothetical protein SSUR61_0755 [Streptococcus suis R61]
 gi|353533585|gb|EHC03235.1| hypothetical protein SSUR61_0755 [Streptococcus suis R61]
          Length = 371

 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 26/100 (26%), Positives = 53/100 (53%), Gaps = 2/100 (2%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQR--AAKKLKLVVRVYPGR 292
           +GR+V  KG+E L+ +++   KE   +E+ L G GE   ++QR   A+++  +V +   +
Sbjct: 203 VGRLVPQKGFERLIRIVSRLAKEGYSIELSLLGTGELHSELQRLIEAEEVDSIVHLLGYK 262

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
           ++      +  +F+  S  +   TA  E+L +G  V+  +
Sbjct: 263 ENPYAYVKNADLFVCSSLHEGYSTAVTESLIVGTPVLTTD 302


>gi|421139469|ref|ZP_15599508.1| hypothetical protein MHB_09248 [Pseudomonas fluorescens BBc6R8]
 gi|404509385|gb|EKA23316.1| hypothetical protein MHB_09248 [Pseudomonas fluorescens BBc6R8]
          Length = 129

 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 29/115 (25%), Positives = 48/115 (41%), Gaps = 13/115 (11%)

Query: 215 EKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLY-------- 266
           E+ + + QNG  +F    Y            E +G L+ + K  A  E  L+        
Sbjct: 10  ERLLHEVQNGGASFAPRKYAEEEATKRAAAGEPIGNLDEFKKLAADYEALLFKHGFIESR 69

Query: 267 -----GNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCT 316
                GNGE+F   QR ++ L L+    PG +H   +  +    L+P+T D + +
Sbjct: 70  SEEECGNGENFKLTQRGSQLLSLIDSCIPGDNHPRQVLDEQADALDPATFDELAS 124


>gi|344211493|ref|YP_004795813.1| glycosyltransferase [Haloarcula hispanica ATCC 33960]
 gi|343782848|gb|AEM56825.1| glycosyltransferase [Haloarcula hispanica ATCC 33960]
          Length = 233

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 42/157 (26%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDH 294
           IG +   KGY+ L+  L+   +E   +++ ++G+G   D++    K L L   V      
Sbjct: 60  IGSLRERKGYDILIEALDQIQQEYPEVQLHVFGDGPQEDELVAQVKHLDLDENV------ 113

Query: 295 ADPIFHDY-------------KVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQ 341
               FH Y             + F++PS ++       EA++ G  VV  N         
Sbjct: 114 ---TFHGYVDQSIVREHLSRARAFVHPSRSEGFSMVRLEAMSTGCPVVVTNVSG------ 164

Query: 342 FPNCRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLS 378
             N    DG  GFV  T     EEP    DA T  LS
Sbjct: 165 -ANEMVRDGEEGFVVPT-----EEPEPIADAVTTLLS 195


>gi|386014207|ref|YP_005932484.1| glycosyl transferase [Pseudomonas putida BIRD-1]
 gi|313500913|gb|ADR62279.1| Glycosyl transferase, group 1 [Pseudomonas putida BIRD-1]
          Length = 308

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 48/101 (47%), Gaps = 4/101 (3%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDH 294
           +GR+V SKG+E L+           GL++ + G G     +Q+    L L  RV+  R H
Sbjct: 144 VGRLVESKGFEMLIEAFARASARQPGLQLAIIGEGPQHAVLQQRIDALGLAERVHL-RGH 202

Query: 295 ADPIFHDYKVF---LNPSTTDVVCTATAEALAMGKIVVCAN 332
            + +   Y+ F   L PS ++ +     EA+     VVC++
Sbjct: 203 REDLQQLYRAFDWLLVPSRSEGLGLVVQEAVMADVPVVCSD 243


>gi|284162298|ref|YP_003400921.1| group 1 glycosyl transferase [Archaeoglobus profundus DSM 5631]
 gi|284012295|gb|ADB58248.1| glycosyl transferase group 1 [Archaeoglobus profundus DSM 5631]
          Length = 376

 Score = 40.4 bits (93), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 46/176 (26%), Positives = 74/176 (42%), Gaps = 9/176 (5%)

Query: 165 LQAFLLEFVNSWLARVHCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNG 224
           L+ F  + V +  AR+ C  VI +S  T    + I+  V+  N K +  G    + +Q  
Sbjct: 137 LKGFFGKVVENLTARIKCD-VIAVSERTLR--DLILIGVN--NAKLIPNGIDFEKIRQIE 191

Query: 225 NKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL 284
             +      + GR++  K    LL  + I   E+  L   + G+G + D++ + A +L L
Sbjct: 192 PASLESDIIFAGRLIRDKNVNLLLEAVRILKDEIRDLTCLIIGDGPEKDRLCKLALELDL 251

Query: 285 VVRV----YPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSN 336
              V    +   D    +    KVF+ PST +       EA A G  VV   H  N
Sbjct: 252 DKNVRFLNFLEYDTLISLMKSSKVFVLPSTREGFGVTALEANACGLPVVTVRHRRN 307


>gi|334117027|ref|ZP_08491119.1| glycosyl transferase group 1 [Microcoleus vaginatus FGP-2]
 gi|333461847|gb|EGK90452.1| glycosyl transferase group 1 [Microcoleus vaginatus FGP-2]
          Length = 386

 Score = 40.4 bits (93), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 33/147 (22%), Positives = 63/147 (42%), Gaps = 15/147 (10%)

Query: 216 KKMEQQQNGNKAFTKGAY-YIGRMVWSKGYEELLGLLNIYHKE-LAGLEMDLYGNGEDFD 273
           K+++Q Q  N    +  + Y+GR+   KG + LL    I   +      + + G G+  +
Sbjct: 191 KRLDQTQQPNLQLKRPIFLYVGRITARKGIKTLLEACAILKNQGYVDYSLLIIGKGDQRE 250

Query: 274 QIQRAAKKLKLVVRV----YPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVV 329
           +++   KK     +V    +    +    F    +F+ P+  DV      EA+ +GK V+
Sbjct: 251 ELEAFIKKCDFEEQVTWAGWVEYGNLGAYFQQADIFVFPTFEDVWGMVVPEAMVLGKAVL 310

Query: 330 CANHPSNDFFKQFPNCR-TYDGRNGFV 355
           C+N  +        +C     G NG++
Sbjct: 311 CSNGAA--------SCELIMSGENGYI 329


>gi|448587634|ref|ZP_21648939.1| hexosyltransferase, glycosyltransferase [Haloferax elongans ATCC
           BAA-1513]
 gi|445737945|gb|ELZ89473.1| hexosyltransferase, glycosyltransferase [Haloferax elongans ATCC
           BAA-1513]
          Length = 406

 Score = 40.4 bits (93), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 61/291 (20%), Positives = 119/291 (40%), Gaps = 25/291 (8%)

Query: 128 HLTWF--HHGKRWKAKFRF-VVGIVHTN--YLEYVKREKNDRLQAFLLEFVNSWLARVHC 182
           H++W   + G R K +     +  VH N  + E   +  NDRL+        +W A    
Sbjct: 120 HMSWTSGYVGARLKDELDIPYLLTVHANRDWFERQLQSDNDRLET-------AWSAADRL 172

Query: 183 HKVIRLSAATQE-YPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWS 241
            +V R   +  + Y + +V   +G +    E       +++ G    T   + +G +   
Sbjct: 173 IRVNRRDRSKLDPYNDDVVHVPNGYDTDIFERIPTDEARKRLGLDDDTPVVFALGTLKPR 232

Query: 242 KGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV----YPGRDHADP 297
           KG++ L+  +   H    G  + + G G   D+++  A  L +  +     Y   +  + 
Sbjct: 233 KGFQHLMRAMTRVHDSDPGTRLVIGGQGGMRDELESLASDLGIADQTALLGYVESETLND 292

Query: 298 IFHDYKVFLNPSTTDVVCTATAEALAMGK-IVVCANHPSNDFFKQFPNCRTYDG---RNG 353
             +   VF+ PS ++       EA+A G  +V   N  S +           DG    + 
Sbjct: 293 WMNAADVFVLPSYSESFGVVQLEAMACGTPVVATKNGGSEEVVTSEAYGLLVDGPESHDE 352

Query: 354 FVEATLKALAEE-PAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSKS 403
             +A ++AL  +  A   ++  ++ +WE+  E   ++A+L  DV+ +  KS
Sbjct: 353 LADAVVEALHRQWDADAIESYANEFTWENVCE---ELADLYVDVLDESQKS 400


>gi|298715222|emb|CBJ27894.1| UDP-sulfoquinovose: diacylglycerol alpha-sulfoquinovosyltransferase
           SQD2, C-terminal fragment, family GT4 [Ectocarpus
           siliculosus]
          Length = 268

 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 51/197 (25%), Positives = 79/197 (40%), Gaps = 29/197 (14%)

Query: 202 NVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGL 261
           +V   NPKF     K+M  +            Y+GR+   K   ++ G+L    K     
Sbjct: 9   DVERFNPKF---ACKEMRSRLTDGHPEDPLIIYVGRLGAEKRLRDIKGVLERNPKA---- 61

Query: 262 EMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEA 321
            + L G G D D ++      K V+      +     F    VF+ PS ++ +     E+
Sbjct: 62  RLALVGKGPDSDALKEHFSGTKTVLTGVMSGEALSQAFASADVFVMPSDSETLGFVVLES 121

Query: 322 LAMGKIVVCANHPSNDFFKQFPNCRTYDGRNGF------VEAT---LKALAEEPA----- 367
           +A G  VV AN          P+    DG+ G+      VEA    +KAL E+ A     
Sbjct: 122 MASGVPVVGANAGG------IPDL-IEDGKTGYLVPAGDVEAMSDRVKALLEDKALRGKM 174

Query: 368 -QPTDAQTHQLSWESAT 383
            +    +T + SWE+AT
Sbjct: 175 SKAGREETERWSWEAAT 191


>gi|282162814|ref|YP_003355199.1| putative glycosyltransferase [Methanocella paludicola SANAE]
 gi|282155128|dbj|BAI60216.1| putative glycosyltransferase [Methanocella paludicola SANAE]
          Length = 405

 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 30/126 (23%), Positives = 55/126 (43%), Gaps = 11/126 (8%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL----VVRVY 289
           ++GRM+  KG   L+  + +  ++    ++ L G G   + +++    + L    +   Y
Sbjct: 212 FVGRMITQKGVPYLIDAMPLVLEKHPDTKLLLVGRGNALESLKKKVGAMGLEKSVIFSGY 271

Query: 290 PGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYD 349
              D     +    +F+ PS  +V+  A  EA++  K VVC N   N    +       D
Sbjct: 272 MTEDMLKETYGTCDMFVLPSVWEVLPIAILEAMSSSKPVVCTNAGGNAELVK-------D 324

Query: 350 GRNGFV 355
           G NG+V
Sbjct: 325 GYNGYV 330


>gi|157413792|ref|YP_001484658.1| glycosyltransferase [Prochlorococcus marinus str. MIT 9215]
 gi|157388367|gb|ABV51072.1| Glycosyltransferase [Prochlorococcus marinus str. MIT 9215]
          Length = 354

 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 29/126 (23%), Positives = 59/126 (46%), Gaps = 6/126 (4%)

Query: 208 PKFLEIGE--KKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDL 265
           P F++  +  K +    N NK       Y+GR+   K    ++ +L+   KE     +D+
Sbjct: 164 PNFIDENKIAKYVSLSNNTNKKI--NLIYVGRLTKLKSVSSIIKILDRLQKEKKFFHLDI 221

Query: 266 YGNGEDFDQIQRAAKKLK--LVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALA 323
            G+G   ++I++   KL    +V ++   D+  P+     +F+ PS ++ +  +  EAL 
Sbjct: 222 VGDGPLREKIKKEISKLNNPELVTLHGFVDNPYPLLAKSNIFILPSKSEGISRSALEALF 281

Query: 324 MGKIVV 329
           +G   +
Sbjct: 282 LGNFCI 287


>gi|145635445|ref|ZP_01791146.1| putative UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae PittAA]
 gi|145267319|gb|EDK07322.1| putative UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae PittAA]
          Length = 353

 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 38/174 (21%), Positives = 75/174 (43%), Gaps = 22/174 (12%)

Query: 169 LLEFVNSWLARVHCHKVIRLSAATQEY------PNSIV--CNVHGVNPKFLEIGEKKMEQ 220
           L+  +  +LA   C K++ L+ A +         N+I+   N + + PK       K+ +
Sbjct: 125 LVRTIGKYLAVTTCDKIVTLTEAEKTLWQEKFKTNNIISIANPNTILPK------NKLAK 178

Query: 221 QQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAK 280
            +N      K    +G +   KG++ LL +  +  K+     + + G+GE+ + ++  AK
Sbjct: 179 WEN------KTVLSVGHLFSYKGFDYLLKVWQVLAKKYPDWNLKIVGSGEEEENLKNLAK 232

Query: 281 KLKL--VVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
            L +   V   P  +     +    ++  PS T+ +     EA+A G  +V  N
Sbjct: 233 ALDIEDSVNFTPRTNDVSFYYESSSIYCLPSQTEGLPLVVIEAMAFGLPIVAFN 286


>gi|421615265|ref|ZP_16056297.1| glycosyl transferase, group 1 [Rhodopirellula baltica SH28]
 gi|408494025|gb|EKJ98651.1| glycosyl transferase, group 1 [Rhodopirellula baltica SH28]
          Length = 374

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 57/123 (46%), Gaps = 8/123 (6%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPGRD 293
           +GR+   KG++ L+  L+    +    E+ +YG G     ++  A+KL L  RV  PG  
Sbjct: 199 VGRLEHEKGFDRLIRCLSELPNDAPNWELTIYGEGSQRGDLESLARKLNLAPRVKLPGWT 258

Query: 294 HAD-PIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRN 352
               P++     F+ PS  +   +A  EA+A    V+  +  +    +Q  +     GRN
Sbjct: 259 RPIWPVYRSADWFVLPSRYEGFPSALLEAMACSTAVLTVD--AGGAVRQIID----HGRN 312

Query: 353 GFV 355
           G++
Sbjct: 313 GWL 315


>gi|256395524|ref|YP_003117088.1| group 1 glycosyl transferase [Catenulispora acidiphila DSM 44928]
 gi|256361750|gb|ACU75247.1| glycosyl transferase group 1 [Catenulispora acidiphila DSM 44928]
          Length = 391

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 35/134 (26%), Positives = 60/134 (44%), Gaps = 15/134 (11%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPGR 292
           ++GR+   K    LL  L +  + +    + L G+G+    +Q  A +L L  RV + GR
Sbjct: 224 FVGRLDIQKNVARLLDALALVAEPV---RLRLVGDGDLRQDLQAQAARLGLDDRVEFAGR 280

Query: 293 DHADPIFHDY---KVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYD 349
            H + +  +Y   ++F+ PS  + +     EA+A G  V+    P         N  T +
Sbjct: 281 KHGEDLVKEYADAELFVLPSDREGMALVVLEAMAAGLPVLATAVPG--------NIETVE 332

Query: 350 GRNGFVEATLKALA 363
           G    V  T +A+A
Sbjct: 333 GVGVLVAPTPQAMA 346


>gi|381398653|ref|ZP_09924056.1| glycosyl transferase group 1 [Microbacterium laevaniformans OR221]
 gi|380774144|gb|EIC07445.1| glycosyl transferase group 1 [Microbacterium laevaniformans OR221]
          Length = 403

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 59/126 (46%), Gaps = 11/126 (8%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPGR 292
           ++GR+   K  E  L  +     EL  +  D+ G G+    +++ A++L +  RV + GR
Sbjct: 227 FVGRLTTEKHVEVTLAAIAKLAPELPDIAFDIVGGGDQRRALEQTAQRLGIQDRVTFYGR 286

Query: 293 DHADPIFHDY---KVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYD 349
              + +   Y    VF   S  ++   AT EA+A G  +V A+  +       P+   +D
Sbjct: 287 VEEEDLRAAYTRADVFAIASIAELQSIATMEAMASGLPIVAADAVA------LPHL-VHD 339

Query: 350 GRNGFV 355
           G NG++
Sbjct: 340 GENGYL 345


>gi|378551019|ref|ZP_09826235.1| hypothetical protein CCH26_13059 [Citricoccus sp. CH26A]
          Length = 400

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 31/122 (25%), Positives = 48/122 (39%), Gaps = 7/122 (5%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRD 293
           Y+GRM   K  + LL  +    + + G  + + G+G   +Q++R       V   Y    
Sbjct: 205 YVGRMSREKDLDALLEPMRRLRQRVPGARLAMVGSGPHVEQLRRHFDPAWTVFTGYMSGP 264

Query: 294 HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNG 353
                +    VF  PSTT+ +     E++A G  VV A      F          D R G
Sbjct: 265 ELSQAYASADVFAFPSTTETLGLVALESMASGVPVVGARAGGIPFVID-------DARTG 317

Query: 354 FV 355
           F+
Sbjct: 318 FL 319


>gi|78186192|ref|YP_374235.1| glycosyl transferase [Chlorobium luteolum DSM 273]
 gi|78166094|gb|ABB23192.1| glycosyl transferase [Chlorobium luteolum DSM 273]
          Length = 372

 Score = 40.4 bits (93), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 39/194 (20%), Positives = 81/194 (41%), Gaps = 9/194 (4%)

Query: 145 VVGIVHTNYLEYVKREK----NDRLQAFLLEFVNSWLARVHCHKVIRLSAATQEYPNSIV 200
           V    HT++  Y+   +       L  +L  F NS    +  ++++R +  +    N  +
Sbjct: 111 VASAYHTDFPSYLNYYRLGFAEGALWRYLAWFYNSCETVLAPNEIVRRNLLSHGIRNVGI 170

Query: 201 CNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAG 260
            +  G++ +    G +    +++ N    K   + GR VW K    ++ L   + +E   
Sbjct: 171 WS-RGIDRELFHPGRRSETLRRSWNADGRKVLVFAGRFVWYKDIRIVMDLYRRFQEEGKA 229

Query: 261 --LEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTAT 318
             +   + G+G + D ++RA  +   V   Y         +    +FL PSTT+  C  +
Sbjct: 230 DRVRFVMIGSGPEEDALRRAMPEA--VFTGYLTGTSLPEAYASGDIFLFPSTTEAFCNVS 287

Query: 319 AEALAMGKIVVCAN 332
            EA++ G   + ++
Sbjct: 288 LEAISCGLPAIVSD 301


>gi|148550012|ref|YP_001270114.1| group 1 glycosyl transferase [Pseudomonas putida F1]
 gi|395445778|ref|YP_006386031.1| group 1 glycosyl transferase [Pseudomonas putida ND6]
 gi|148514070|gb|ABQ80930.1| glycosyl transferase, group 1 [Pseudomonas putida F1]
 gi|388559775|gb|AFK68916.1| group 1 glycosyl transferase [Pseudomonas putida ND6]
          Length = 363

 Score = 40.4 bits (93), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 48/101 (47%), Gaps = 4/101 (3%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDH 294
           +GR+V SKG+E L+           GL++ + G G     +Q+    L L  RV+  R H
Sbjct: 199 VGRLVESKGFEMLIEAFARASARQPGLQLAIIGEGPQHAVLQQRIDALGLAERVHL-RGH 257

Query: 295 ADPIFHDYKVF---LNPSTTDVVCTATAEALAMGKIVVCAN 332
            + +   Y+ F   L PS ++ +     EA+     VVC++
Sbjct: 258 REDLQQLYRAFDWLLVPSRSEGLGLVVQEAVMADVPVVCSD 298


>gi|431797950|ref|YP_007224854.1| glycosyltransferase [Echinicola vietnamensis DSM 17526]
 gi|430788715|gb|AGA78844.1| glycosyltransferase [Echinicola vietnamensis DSM 17526]
          Length = 377

 Score = 40.4 bits (93), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 27/104 (25%), Positives = 50/104 (48%), Gaps = 2/104 (1%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPGR- 292
           IGR++ +KG+ +LL    +  +E +   M ++G G     ++   ++L L   V  PG  
Sbjct: 201 IGRLLNTKGWLDLLEAFALVRRERSDCTMTIFGEGALRKSMESRIEELGLTQSVSLPGNV 260

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSN 336
                  +DY  F+ PS  +    A  EA+ +G  ++ ++ P N
Sbjct: 261 PQVQERLYDYDCFVFPSWYEGFSGALIEAMMVGIPIIASDIPMN 304


>gi|26991616|ref|NP_747041.1| glycosyl transferase family protein [Pseudomonas putida KT2440]
 gi|24986708|gb|AAN70505.1|AE016692_4 glycosyl transferase, putative [Pseudomonas putida KT2440]
          Length = 379

 Score = 40.0 bits (92), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 48/101 (47%), Gaps = 4/101 (3%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDH 294
           +GR+V SKG+E L+           GL++ + G G     +Q+    L L  RV+  R H
Sbjct: 215 VGRLVESKGFEMLIEAFARASARQPGLQLAIIGEGPQHAVLQQRIDALGLAERVHL-RGH 273

Query: 295 ADPIFHDYKVF---LNPSTTDVVCTATAEALAMGKIVVCAN 332
            + +   Y+ F   L PS ++ +     EA+     VVC++
Sbjct: 274 REDLQQLYRAFDWLLVPSRSEGLGLVVQEAVMADVPVVCSD 314


>gi|397693941|ref|YP_006531822.1| glycosyl transferase [Pseudomonas putida DOT-T1E]
 gi|397330671|gb|AFO47030.1| glycosyl transferase, putative [Pseudomonas putida DOT-T1E]
          Length = 363

 Score = 40.0 bits (92), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 48/101 (47%), Gaps = 4/101 (3%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDH 294
           +GR+V SKG+E L+           GL++ + G G     +Q+    L L  RV+  R H
Sbjct: 199 VGRLVESKGFEMLIEAFARASARQPGLQLAIIGEGPQHAVLQQRIDALGLAERVHL-RGH 257

Query: 295 ADPIFHDYKVF---LNPSTTDVVCTATAEALAMGKIVVCAN 332
            + +   Y+ F   L PS ++ +     EA+     VVC++
Sbjct: 258 REDLQQLYRAFDWLLVPSRSEGLGLVVQEAVMADVPVVCSD 298


>gi|421523678|ref|ZP_15970307.1| glycosyl transferase family protein [Pseudomonas putida LS46]
 gi|402752664|gb|EJX13169.1| glycosyl transferase family protein [Pseudomonas putida LS46]
          Length = 363

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 48/101 (47%), Gaps = 4/101 (3%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRDH 294
           +GR+V SKG+E L+           GL++ + G G     +Q+    L L  RV+  R H
Sbjct: 199 VGRLVESKGFEMLIEAFARASARQPGLQLAIIGEGPQHAVLQQRIDALGLAERVHL-RGH 257

Query: 295 ADPIFHDYKVF---LNPSTTDVVCTATAEALAMGKIVVCAN 332
            + +   Y+ F   L PS ++ +     EA+     VVC++
Sbjct: 258 REDLQQLYRAFDWLLVPSRSEGLGLVVQEAVMADVPVVCSD 298


>gi|120603907|ref|YP_968307.1| group 1 glycosyl transferase [Desulfovibrio vulgaris DP4]
 gi|120564136|gb|ABM29880.1| glycosyl transferase, group 1 [Desulfovibrio vulgaris DP4]
          Length = 816

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 32/137 (23%), Positives = 51/137 (37%), Gaps = 2/137 (1%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRD 293
           Y+GR+   K    L         +  GL + L G+G   D++ R    L  +   Y   D
Sbjct: 647 YVGRLSREKNLHILAEAFRTACAQNPGLALVLVGDGPIRDELARTLADLPAIFTGYIEGD 706

Query: 294 HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANH--PSNDFFKQFPNCRTYDGR 351
                +    +F+ PS TD       EA A G  V+  +   P  +       C   +G 
Sbjct: 707 ALAEAYASSDIFVFPSGTDTFGNVVLEAQASGLPVIVTDRGGPRENLLPGRTGCIVPEGE 766

Query: 352 NGFVEATLKALAEEPAQ 368
              + A +  LA +P +
Sbjct: 767 ADALSAAMLDLAADPVR 783


>gi|366053438|ref|ZP_09451160.1| poly(glycerol-phosphate) alpha-glucosyltransferase [Lactobacillus
           suebicus KCTC 3549]
          Length = 502

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 38/168 (22%), Positives = 72/168 (42%), Gaps = 23/168 (13%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDL--YGNGEDFDQIQRAAKKLKL----VVR 287
           ++GR+ W K  +++L +    H +++   + L  YG+G D ++ ++    LKL    +V 
Sbjct: 331 HVGRLGWDKQIDQVLQVFKQIHSQVSDATLKLYGYGDGADVEKFKQQVTDLKLDEPGLVS 390

Query: 288 VYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRT 347
               R   D ++   +++L+ +  D    A  EAL  G  V+  +      +   P    
Sbjct: 391 FEGYRPDIDNVYDQAQLYLDCARIDGQPLADVEALGHGVPVLAYD------YNYGPRELI 444

Query: 348 YDGRNGFV-----------EATLKALAEEPAQPTDAQTHQLSWESATE 384
            DG+NG +           EA     ++   Q    Q ++LS  S+ E
Sbjct: 445 EDGKNGKLIKLNDVATMASEAVKLLKSKRKLQDASDQAYELSARSSAE 492


>gi|184201273|ref|YP_001855480.1| putative glycosyltransferase [Kocuria rhizophila DC2201]
 gi|183581503|dbj|BAG29974.1| putative glycosyltransferase [Kocuria rhizophila DC2201]
          Length = 384

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 45/173 (26%), Positives = 75/173 (43%), Gaps = 21/173 (12%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPGR 292
           + GR+   K   EL+  L +  +E   + ++L G GE  D++ R A +  +  RV + G 
Sbjct: 211 FCGRLAVEKNVNELIEALALIPRE-KNVHVELAGEGEQRDRLLRLAHERGVADRVEFLGF 269

Query: 293 DHADPIFHDY---KVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYD 349
              + +   Y    VF  P T ++    T EA++    VV AN       +  P+    +
Sbjct: 270 LTDEDLRRAYLRADVFCQPGTAELQSLVTLEAMSASTPVVLAN------ARALPHL-ADE 322

Query: 350 GRNGFVEATLKALAEEPAQPTD-AQTHQLSWESATERFLQVAELVGDVVTKRS 401
           GRNG++          P  P D A   QL  ++  ER   + E   ++V + S
Sbjct: 323 GRNGYLFT--------PGDPADLAAKLQLVLDADPERRRAMGERSHEMVAQHS 367


>gi|297617020|ref|YP_003702179.1| group 1 glycosyl transferase [Syntrophothermus lipocalidus DSM
           12680]
 gi|297144857|gb|ADI01614.1| glycosyl transferase group 1 [Syntrophothermus lipocalidus DSM
           12680]
          Length = 402

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 48/105 (45%), Gaps = 6/105 (5%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVR---VYP 290
           ++GR+   K  E LL   ++  +++    + L   G    +++R A  L L +    V+ 
Sbjct: 209 FVGRLTKEKNLEFLLEAFSMVKRQVPPAVLVLVAGGPMESELKRLATSLGLELERDVVFT 268

Query: 291 GR---DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
           GR   D    ++ D  VF  PS T+      AEA+A G  VV  N
Sbjct: 269 GRLPFDELVDVYFDSTVFTFPSVTETQGLVLAEAMATGLPVVAVN 313


>gi|325577345|ref|ZP_08147793.1| pilin glycosyltransferase [Haemophilus parainfluenzae ATCC 33392]
 gi|325160669|gb|EGC72791.1| pilin glycosyltransferase [Haemophilus parainfluenzae ATCC 33392]
          Length = 386

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 34/129 (26%), Positives = 57/129 (44%), Gaps = 16/129 (12%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV----Y 289
           ++GR++  KG  E L    +  KE    +  + G  +  +    + +KL+  +      Y
Sbjct: 214 FLGRLLREKGIFEFLEAAKVVKKEKTNTQFLVLGQIDKQNPTAMSEEKLQEYIDANIIEY 273

Query: 290 PGRDHADPIF-HDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCR-- 346
            G     P +     VF+ PS  + V  +T EA+AMGK+++  +          P CR  
Sbjct: 274 LGYVKNVPDYIEKVDVFVLPSYREGVPKSTQEAMAMGKVILTTD---------VPGCRET 324

Query: 347 TYDGRNGFV 355
             DG NGF+
Sbjct: 325 VVDGVNGFL 333


>gi|291514375|emb|CBK63585.1| Glycosyltransferase [Alistipes shahii WAL 8301]
          Length = 435

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 63/297 (21%), Positives = 122/297 (41%), Gaps = 34/297 (11%)

Query: 44  VIPWLSLIHQKQVYPGNITFASPKEQEAYVRWWLEDRTGFTSTFDT---RF-YPGKFAAD 99
           ++  +S IH   +    + + SP+E E+Y   +++      ST D    R+ + GK+ A+
Sbjct: 70  ILEKMSFIH---IDSNMVPYISPEEYESYHEQYVKSGRKTWSTTDAWKQRYTFSGKYGAN 126

Query: 100 KKSILAVGDIT--EIIPDEEADIAVLEEPEHLTWFH--HGKRWKAKFRFVVGIVHTNYLE 155
               +A   +   ++  D E    V+   + LT++     KR   K      +VH +  E
Sbjct: 127 LMEEVARYAVVAAQVARDLEGQFDVIHAHDWLTYYAGIAAKRVSGKPL----VVHMHATE 182

Query: 156 YVKREKNDRLQAFLLEFVNSWLARVHCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGE 215
           Y +  +N   Q + +E       R   H   R+ A +    N IV N +GV  + +    
Sbjct: 183 YDRSGENVNTQVYAIE-------RAGMHAADRVIAVSNLTRN-IVINRYGVPAEKIVTVH 234

Query: 216 KKMEQQQNGNKAFTKGAY-----YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGE 270
             +   QN  K   +G       ++GR+ + KG +  +       K +  +   + G+G+
Sbjct: 235 NAVRFAQNSGKVPERGVTDKVVTFLGRITYQKGPDYFVEAAAKVLKRVPDVRFVMAGSGD 294

Query: 271 DFDQIQRAAKKLKLVVRVY-----PGRDHADPIFHDYKVFLNPSTTDVVCTATAEAL 322
             + + R   +L +  R +      G D    +F    V++ PS ++    +  EA+
Sbjct: 295 MMNHVIRRVARLGIADRFHFTGFLRGED-VHKMFQLSDVYVMPSVSEPFGISPLEAM 350


>gi|117925583|ref|YP_866200.1| group 1 glycosyl transferase [Magnetococcus marinus MC-1]
 gi|117609339|gb|ABK44794.1| glycosyl transferase, group 1 [Magnetococcus marinus MC-1]
          Length = 367

 Score = 40.0 bits (92), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 58/270 (21%), Positives = 103/270 (38%), Gaps = 42/270 (15%)

Query: 91  FYPGKFAADKKSILAVGDITEIIPDEEADIAVLE--EPEHLTWFHHGKRWKAKFRFV--- 145
            +P +F  +KK     GD  E+I D+     ++E  +PEH+     G    A  R+    
Sbjct: 63  IHPRRF--EKKWFTLKGDNVEMIRDQGLTARLIESFQPEHIHIATEGPIGMAACRYCRNH 120

Query: 146 ----VGIVHTNYLEYVKREKNDRLQA-FLLEFVNSWLARVHCHKVIRLSAAT----QEYP 196
                   HT + EY +R     + + ++   +  +  R  C        A     +++ 
Sbjct: 121 HLAFTTTYHTQWPEYAQRRSRGLISSNWVYPLMRRFHNRAACIMATTPPMADMLKEKQFN 180

Query: 197 NSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHK 256
             I   + GV+ +         E+ +    A      Y+GR+   K  +  +      H 
Sbjct: 181 TRIALWMRGVDTQLFR-PRPATEKPEILRHAKGPIYLYVGRLAREKNLDPFI------HA 233

Query: 257 ELAGLEMDLYGNGEDFDQIQRAA--------KKLKLVVRVYPGRDHADPIFHDYKVFLNP 308
           ++AG ++ L G G D +++QR          ++ + + R Y   D          VF+ P
Sbjct: 234 DVAGTKV-LVGEGPDQERLQRLGGNTLFAGLQRGEALARFYAAAD----------VFVFP 282

Query: 309 STTDVVCTATAEALAMGKIVVCANHPSNDF 338
           S TD       EALA G  +   +H +  F
Sbjct: 283 SLTDTFGKVLIEALASGTPIAAFHHLAPAF 312


>gi|358446384|ref|ZP_09156930.1| putative group 1 glycosyl transferase [Corynebacterium casei UCMA
           3821]
 gi|356607558|emb|CCE55254.1| putative group 1 glycosyl transferase [Corynebacterium casei UCMA
           3821]
          Length = 374

 Score = 40.0 bits (92), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 26/99 (26%), Positives = 42/99 (42%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRD 293
           YIGR+   K  E L  ++ I  K + G  + + G G   +++Q++          Y   +
Sbjct: 204 YIGRISREKDLERLNNVMQIVRKHVPGTRLGIVGGGPYLEELQKSFDDSFTTFTGYLSGE 263

Query: 294 HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
                F    VF+ PS T+ +     E+ A G  VV  N
Sbjct: 264 DLAAAFASGDVFVFPSKTETLGLVALESFASGVPVVGTN 302


>gi|421745688|ref|ZP_16183531.1| glycosyltransferase [Cupriavidus necator HPC(L)]
 gi|409775782|gb|EKN57232.1| glycosyltransferase [Cupriavidus necator HPC(L)]
          Length = 364

 Score = 40.0 bits (92), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 29/106 (27%), Positives = 48/106 (45%), Gaps = 2/106 (1%)

Query: 229 TKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL--VV 286
           +K A  +GR+  +KG++ LL    I  +E     +D+ G+GE  D +     +L L   V
Sbjct: 175 SKIALAVGRLTHAKGFDVLLHAWKIVAREHPDWRLDIVGDGELHDALMTLRNELGLSASV 234

Query: 287 RVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
           R+ P R     ++    +F   S  +       EA+A G  VV  +
Sbjct: 235 RLLPARRDLSDLYRGASLFCLSSRYEGFGLVLIEAMAHGLPVVATD 280


>gi|16273585|ref|NP_439840.1| lipopolysaccharide biosynthesis protein [Haemophilus influenzae Rd
           KW20]
 gi|260581039|ref|ZP_05848861.1| lipopolysaccharide biosynthesis protein [Haemophilus influenzae
           RdAW]
 gi|12230662|sp|O05083.1|Y1698_HAEIN RecName: Full=Uncharacterized glycosyltransferase HI_1698
 gi|1574551|gb|AAC23344.1| lipopolysaccharide biosynthesis protein [Haemophilus influenzae Rd
           KW20]
 gi|260092279|gb|EEW76220.1| lipopolysaccharide biosynthesis protein [Haemophilus influenzae
           RdAW]
          Length = 353

 Score = 40.0 bits (92), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 36/171 (21%), Positives = 77/171 (45%), Gaps = 16/171 (9%)

Query: 169 LLEFVNSWLARVHCHKVIRLSAA-----TQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQN 223
           L+  +  +LA   C K++ L+ A      +++  + + ++   NP  L + + K+ + +N
Sbjct: 125 LVRTLGKYLAVTTCDKIVTLTEAEKTLWQEKFKTNNIISI--ANPNTL-LPKNKLAKLEN 181

Query: 224 GNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLK 283
                 K    +G +   KG++ LL +  +  K+     + + G+GE+ + ++  AK L 
Sbjct: 182 ------KTILSVGHLFSYKGFDYLLKVWQVLAKKYPDWNLKIVGSGEEEENLKNLAKALD 235

Query: 284 L--VVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
           +   V   P  +     +    ++  PS T+ +     EA+A G  +V  N
Sbjct: 236 IEDSVNFIPRTNDVSFYYESSSIYCLPSQTEGLPLVVIEAMAFGLPIVAFN 286


>gi|154252361|ref|YP_001413185.1| group 1 glycosyl transferase [Parvibaculum lavamentivorans DS-1]
 gi|154156311|gb|ABS63528.1| glycosyl transferase group 1 [Parvibaculum lavamentivorans DS-1]
          Length = 378

 Score = 39.7 bits (91), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 47/112 (41%), Gaps = 8/112 (7%)

Query: 229 TKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV 288
           ++ A ++GR+   K   EL         + A L +D YG G +  Q++ A          
Sbjct: 198 SRRALFLGRLAPDKNAPEL-ARAACELADGASLTVDFYGRGRELPQLEAALAAAGNPAVT 256

Query: 289 YPG-RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVV------CANH 333
           Y G  DH   I  DY   L PS  + +  A  EA+A G + V      C +H
Sbjct: 257 YRGFVDHPGTILRDYGYLLLPSNAEGLSNAMLEAMAHGVVPVATRVSGCVDH 308


>gi|319643012|ref|ZP_07997646.1| glycosyltransferase family 4 [Bacteroides sp. 3_1_40A]
 gi|345521494|ref|ZP_08800818.1| glycosyltransferase family 4 [Bacteroides sp. 4_3_47FAA]
 gi|254834420|gb|EET14729.1| glycosyltransferase family 4 [Bacteroides sp. 4_3_47FAA]
 gi|317385377|gb|EFV66322.1| glycosyltransferase family 4 [Bacteroides sp. 3_1_40A]
          Length = 381

 Score = 39.7 bits (91), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 25/101 (24%), Positives = 45/101 (44%), Gaps = 2/101 (1%)

Query: 236 GRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAK--KLKLVVRVYPGRD 293
           GR+ W KG++ L+   NI  K      +D++G G   D + R  K  KL+  + +YP   
Sbjct: 210 GRLTWQKGFDRLVDAWNIVQKHHPDWILDIFGEGFYKDSLTRQIKDRKLEYSITIYPFTQ 269

Query: 294 HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHP 334
           +    + +  +    S  +       EA+++G   V  + P
Sbjct: 270 NITQEYLNSSILALSSNYEGFGLVLIEAMSLGVPCVSFDCP 310


>gi|152975172|ref|YP_001374689.1| group 1 glycosyl transferase [Bacillus cytotoxicus NVH 391-98]
 gi|152023924|gb|ABS21694.1| glycosyl transferase group 1 [Bacillus cytotoxicus NVH 391-98]
          Length = 689

 Score = 39.7 bits (91), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 2/98 (2%)

Query: 236 GRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV--YPGRD 293
           G +   KGY+ LL  L    KE    E+ ++G+G + D++++  K  +L V+   +  R+
Sbjct: 203 GNLYHLKGYDILLQALIALKKENLLYELTMFGDGSERDKLEQIVKMHQLPVKFKGHVPRE 262

Query: 294 HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCA 331
                   + VF+ PS  +    +  EA+A G  ++C+
Sbjct: 263 VLQEELPKFDVFVQPSRLENFPFSVIEAMASGCAIICS 300


>gi|410454129|ref|ZP_11308071.1| group 1 glycosyl transferase [Bacillus bataviensis LMG 21833]
 gi|409932440|gb|EKN69401.1| group 1 glycosyl transferase [Bacillus bataviensis LMG 21833]
          Length = 398

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 33/117 (28%), Positives = 56/117 (47%), Gaps = 8/117 (6%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLY--GNGEDFDQIQRAAKKLKLVVRVYPGR 292
           + R+V  KG    L   NI  +E  G E+  Y  GNG  ++ +++  K++ +        
Sbjct: 234 VARLVDEKGVFFALEACNILIEE--GFEISWYLIGNGPLYEALKKKTKEIGIERNFILLG 291

Query: 293 DHADP--IFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRT 347
           + A+P        +++ PS T+  C A  EA+A+ K +V  + PS  F  Q  N +T
Sbjct: 292 EQANPYPFMSLGDIYVQPSKTEAHCVAVEEAIALKKPIVVTDIPS--FNHQIQNEKT 346


>gi|258543628|ref|YP_003189061.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus IFO 3283-01]
 gi|384043546|ref|YP_005482290.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus IFO 3283-12]
 gi|384052063|ref|YP_005479126.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus IFO 3283-03]
 gi|384055172|ref|YP_005488266.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus IFO 3283-07]
 gi|384058405|ref|YP_005491072.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus IFO 3283-22]
 gi|384061046|ref|YP_005500174.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus IFO 3283-26]
 gi|384064338|ref|YP_005484980.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus IFO 3283-32]
 gi|384120351|ref|YP_005502975.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus IFO 3283-01-42C]
 gi|256634706|dbj|BAI00682.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus IFO 3283-01]
 gi|256637762|dbj|BAI03731.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus IFO 3283-03]
 gi|256640816|dbj|BAI06778.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus IFO 3283-07]
 gi|256643871|dbj|BAI09826.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus IFO 3283-22]
 gi|256646926|dbj|BAI12874.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus IFO 3283-26]
 gi|256649979|dbj|BAI15920.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus IFO 3283-32]
 gi|256652969|dbj|BAI18903.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus IFO 3283-01-42C]
 gi|256656023|dbj|BAI21950.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus IFO 3283-12]
          Length = 363

 Score = 39.7 bits (91), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 27/104 (25%), Positives = 49/104 (47%), Gaps = 3/104 (2%)

Query: 232 AYYIGRMVWSKGYEELLGLLNIY--HKELAGLEMDLYGNGEDFDQIQRAAKKLKLV-VRV 288
           A  +GR+  +KGY+ L   L     +  L   E+ + G G++   +++  K+LKL  +R 
Sbjct: 186 AGSLGRLHIAKGYDILCTALKRLQAYTSLPAFEVTVAGEGQERAHLEQMVKELKLTNIRF 245

Query: 289 YPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
                 +    H   ++L PS  +  C A  EA+  G  V+ ++
Sbjct: 246 SGFASSSQDFLHTQHLYLQPSRREGFCIAMHEAMQAGLPVIGSD 289


>gi|448545187|ref|ZP_21625855.1| glycosyltransferase, family protein 1 [Haloferax sp. ATCC BAA-646]
 gi|448547522|ref|ZP_21626943.1| glycosyltransferase, family protein 1 [Haloferax sp. ATCC BAA-645]
 gi|448556446|ref|ZP_21632057.1| glycosyltransferase, family protein 1 [Haloferax sp. ATCC BAA-644]
 gi|445704383|gb|ELZ56299.1| glycosyltransferase, family protein 1 [Haloferax sp. ATCC BAA-646]
 gi|445715892|gb|ELZ67644.1| glycosyltransferase, family protein 1 [Haloferax sp. ATCC BAA-645]
 gi|445716474|gb|ELZ68216.1| glycosyltransferase, family protein 1 [Haloferax sp. ATCC BAA-644]
          Length = 223

 Score = 39.7 bits (91), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 39/160 (24%), Positives = 68/160 (42%), Gaps = 16/160 (10%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLV--VRVYPG 291
           + GR++  K    LL   +    +   + + + G+G +FD+++R    L     VR+   
Sbjct: 51  FAGRLIADKNVSILLDAFDRVADDYDDVTLGVVGDGPEFDRLERQTSALDHADRVRMLGF 110

Query: 292 RDHADPIFHDYK---VFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNC--- 345
            D  + +    +   VF +PST +      AEA+A    V+   HP +   +   +    
Sbjct: 111 LDEYEDVLGQMRAADVFASPSTREGFGITYAEAMAADCTVIGVQHPESAASEVIGDAGYL 170

Query: 346 --RTYDGRNGFVEATLKALAEE--PAQPTDAQTHQLSWES 381
              T D      E+  +ALA E  P +PT  +  Q  W+S
Sbjct: 171 AEPTVD---SVAESVARALAGETPPTEPT-KRADQYDWDS 206


>gi|387594448|gb|EIJ89472.1| hypothetical protein NEQG_00242 [Nematocida parisii ERTm3]
          Length = 397

 Score = 39.7 bits (91), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 38/150 (25%), Positives = 62/150 (41%), Gaps = 9/150 (6%)

Query: 189 SAATQEYPNSIVCNVH--GVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEE 246
           +A T+E+    + N H    NP    +   +       N+        + R+   KG   
Sbjct: 174 NAVTEEFKPKTITNTHVTQTNPGITRVLNIRQSNGWTNNEIIIA---VVSRLTTRKGSVL 230

Query: 247 LLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPGRDHADPI---FHDY 302
           L  +L    K    + + + G+GE  + +++  +K KL  +V + G  H   I    +  
Sbjct: 231 LSDILPSIFKINPLIRLIIAGDGEKKELLEQTVEKYKLKDKVKFLGGIHPSVINLVLNQS 290

Query: 303 KVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
            +FLN S TD  C +  EA A G  VV  N
Sbjct: 291 NLFLNTSLTDAFCISIIEAAACGLYVVSTN 320


>gi|409405042|ref|ZP_11253515.1| glycosyltransferase [Herbaspirillum sp. GW103]
 gi|386435809|gb|EIJ48633.1| glycosyltransferase [Herbaspirillum sp. GW103]
          Length = 349

 Score = 39.7 bits (91), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 41/155 (26%), Positives = 67/155 (43%), Gaps = 12/155 (7%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRD 293
           Y+GR+   K  E  L L      +L G +  + G+G   + +     +++ +     G D
Sbjct: 184 YVGRVAVEKNIEAFLKL------DLPGTKW-VVGDGPQLEDLTARYPEVRFLGA--KGHD 234

Query: 294 HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNG 353
                ++   VF+ PS TD       EA+A G  +  A +P         N  +   R+ 
Sbjct: 235 ELPAYYNCADVFVFPSKTDTFGLVLLEAMACG--IPVAAYPVEGPIDVVDNGVSGVLRHD 292

Query: 354 FVEATLKALAEEPAQPTDAQTHQLSWESATERFLQ 388
             +A L+AL  +  +   A     SWESAT++FLQ
Sbjct: 293 LHQACLQALKLD-HEAVRAHAATRSWESATQQFLQ 326


>gi|145633632|ref|ZP_01789359.1| putative UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae 3655]
 gi|229845233|ref|ZP_04465366.1| putative UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae 6P18H1]
 gi|229847319|ref|ZP_04467421.1| putative UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae 7P49H1]
 gi|144985509|gb|EDJ92325.1| putative UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae 3655]
 gi|229809744|gb|EEP45468.1| putative UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae 7P49H1]
 gi|229811828|gb|EEP47524.1| putative UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae 6P18H1]
          Length = 353

 Score = 39.3 bits (90), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 38/172 (22%), Positives = 75/172 (43%), Gaps = 18/172 (10%)

Query: 169 LLEFVNSWLARVHCHKVIRLSAATQEY------PNSIVCNVHGVNPKFLEIGEKKMEQQQ 222
           L+  +  +LA   C K++ L+ A +         N+I+      NP  L + + K+ + +
Sbjct: 125 LVRTLGKYLAVTTCDKIVTLTEAEKTLWQEKFKTNNIIT---IANPNTL-LPKNKLAKWE 180

Query: 223 NGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKL 282
           N      K    +G +   KG++ LL +  +  K+     + + G+GE+ + ++  AK L
Sbjct: 181 N------KTILSVGHLFSYKGFDYLLKVWQVLAKKYPDWNLKIVGSGEEEENLKNLAKAL 234

Query: 283 KL--VVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
            +   V   P  +     +    ++  PS T+ +     EA+A G  +V  N
Sbjct: 235 DIEDSVNFTPRTNDVSFYYESSSIYCLPSQTEGLPLVVIEAMAFGLPIVAFN 286


>gi|423333106|ref|ZP_17310887.1| hypothetical protein HMPREF1075_02538 [Parabacteroides distasonis
           CL03T12C09]
 gi|409228588|gb|EKN21477.1| hypothetical protein HMPREF1075_02538 [Parabacteroides distasonis
           CL03T12C09]
          Length = 362

 Score = 39.3 bits (90), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 25/107 (23%), Positives = 48/107 (44%), Gaps = 5/107 (4%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL--VVRVYPGR 292
           +GR+V+ KG++ L+ +    H +     +++YG+G    ++Q    K  L    +     
Sbjct: 191 VGRLVYQKGFDTLIEIFGNIHLKYPDWVVEIYGSGVLLRELQSQVDKAGLTSCFKFMGVT 250

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVV---CANHPSN 336
           D  +  +H   +F  PS  +       EA++ G   +   C N PS+
Sbjct: 251 DRIECEYHKASIFAMPSRFEGFPMVLVEAMSQGLACISFDCPNGPSD 297


>gi|68250302|ref|YP_249414.1| UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae 86-028NP]
 gi|68058501|gb|AAX88754.1| putative UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae 86-028NP]
          Length = 353

 Score = 39.3 bits (90), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 36/171 (21%), Positives = 77/171 (45%), Gaps = 16/171 (9%)

Query: 169 LLEFVNSWLARVHCHKVIRLSAA-----TQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQN 223
           L+  +  +LA   C K++ L+ A      +++  + + ++   NP  L + + K+ + +N
Sbjct: 125 LVRTLGKYLAVTTCDKIVTLTEAEKTLWQEKFKTNNIISI--ANPNTL-LPKNKLAKWEN 181

Query: 224 GNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLK 283
                 K    +G +   KG++ LL +  +  K+     + + G+GE+ + ++  AK L 
Sbjct: 182 ------KTVLSVGHLFSYKGFDYLLKVWQVLAKKYPDWNLKIAGSGEEEENLKNLAKALN 235

Query: 284 L--VVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
           +   V   P  +     +    ++  PS T+ +     EA+A G  +V  N
Sbjct: 236 IEDSVNFIPRTNDVAFYYESSSIYCLPSQTEGLPLVVIEAMAFGLPIVAFN 286


>gi|398308522|ref|ZP_10511996.1| glycosyl transferase group 1 family protein [Bacillus mojavensis
           RO-H-1]
          Length = 673

 Score = 39.3 bits (90), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 41/156 (26%), Positives = 71/156 (45%), Gaps = 11/156 (7%)

Query: 232 AYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPG 291
           A  I R+   K     +    +  KE+   ++D++G+GEDF++I++  ++ KL   V+  
Sbjct: 351 AVIISRLASMKNLTHAVKAFGLVVKEIPEAKLDIFGSGEDFEKIKKEIEEHKLQNNVFLK 410

Query: 292 RDHADPIFHDYKVFLNPSTT--DVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYD 349
               +P     K +L  ST+  +    +  EAL+ G  VV  ++               D
Sbjct: 411 GYTNNPDLEFQKAWLTISTSHFEGFGLSNMEALSNGCPVVTYDYDYG------ARSLVSD 464

Query: 350 GRNGFV--EATLKALAEE-PAQPTDAQTHQLSWESA 382
           G NG+V  +  ++ LAE   +   D +THQ   E A
Sbjct: 465 GVNGYVIEQYNIEKLAEGIISLMRDEKTHQKFSEQA 500


>gi|145631594|ref|ZP_01787360.1| putative UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae R3021]
 gi|144982794|gb|EDJ90320.1| putative UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae R3021]
          Length = 311

 Score = 39.3 bits (90), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 36/171 (21%), Positives = 77/171 (45%), Gaps = 16/171 (9%)

Query: 169 LLEFVNSWLARVHCHKVIRLSAA-----TQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQN 223
           L+  +  +LA   C K++ L+ A      +++  + + ++   NP  L + + K+ + +N
Sbjct: 125 LVRTLGKYLAVTTCDKIVTLTEAEKTLWQEKFKTNNIISI--ANPNTL-LPKNKLAKWEN 181

Query: 224 GNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLK 283
                 K    +G +   KG++ LL    +  K      + + G+GE+ + ++  AK+L 
Sbjct: 182 ------KTILSVGHLFSYKGFDYLLKAWQVLAKNHPNWNLKIVGSGEEEENLKNLAKELD 235

Query: 284 L--VVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
           +   V   P  +     + +  ++  PS T+ +     EA+A G  +V  N
Sbjct: 236 IKDSVNFIPRTNDVSFYYENSSIYCLPSQTEGLPLVVIEAMAFGLPIVAFN 286


>gi|449667223|ref|XP_002167427.2| PREDICTED: phosphatidylinositol N-acetylglucosaminyltransferase
           subunit A-like [Hydra magnipapillata]
          Length = 440

 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 40/176 (22%), Positives = 78/176 (44%), Gaps = 24/176 (13%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRD- 293
           + R+V+ KG + L+G++ +  ++   ++  + G+G     I+   +K +L  RV+   + 
Sbjct: 195 VSRIVYRKGVDLLVGIIPLICEKFPDVQFLIGGDGPRKISIEELIEKHRLHKRVFLVGEL 254

Query: 294 ---HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN-------HPSNDFFKQFP 343
                  I     +FLN S T+  C +  EA + G  VV  N        P +  +   P
Sbjct: 255 QHYEVRNILVQGDIFLNTSLTEAFCMSIVEAASCGLKVVSTNVGGIPEVLPPDMIYLCEP 314

Query: 344 NCRTYDGRNGFVEATLKALAEEPAQ--PTDAQTHQ-----LSWESATERFLQVAEL 392
           + +      G  EA +KA+++  A       + H+      +W++ T R  +V +L
Sbjct: 315 SIK------GVYEAVVKAISDHQAGLFVCSYEYHERIKSMYTWDNITSRTEKVYDL 364


>gi|220918058|ref|YP_002493362.1| group 1 glycosyl transferase [Anaeromyxobacter dehalogenans 2CP-1]
 gi|219955912|gb|ACL66296.1| glycosyl transferase group 1 [Anaeromyxobacter dehalogenans 2CP-1]
          Length = 378

 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 28/112 (25%), Positives = 51/112 (45%), Gaps = 16/112 (14%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRD 293
           Y+G M   KG  +LL  + + H     + +   G+G D + ++  AK L   +  +PG  
Sbjct: 204 YVGAMTEKKGVGDLLEGMRLLHARGRPVRLTAVGSGADLEALRARAKSLPPGLATFPG-- 261

Query: 294 HADPIFHD--YKVFLN---------PSTTDVVCTATAEALAMGKIVVCANHP 334
              P+ ++  +++ L+         P  T+ +     EALA    VV ++HP
Sbjct: 262 ---PVGNEEAFRLMLDSTLVCVPTRPEFTEGMPLTLTEALASRTPVVASDHP 310


>gi|421849041|ref|ZP_16282026.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus NBRC 101655]
 gi|371460310|dbj|GAB27229.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus NBRC 101655]
          Length = 372

 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 27/104 (25%), Positives = 49/104 (47%), Gaps = 3/104 (2%)

Query: 232 AYYIGRMVWSKGYEELLGLLNIY--HKELAGLEMDLYGNGEDFDQIQRAAKKLKLV-VRV 288
           A  +GR+  +KGY+ L   L     +  L   E+ + G G++   +++  K+LKL  +R 
Sbjct: 195 AGSLGRLHIAKGYDILCTALKRLQAYTSLPAFEVTVAGEGQERAHLEQMVKELKLTNIRF 254

Query: 289 YPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
                 +    H   ++L PS  +  C A  EA+  G  V+ ++
Sbjct: 255 SGFASSSQDFLHTQHLYLQPSRREGFCIAMHEAMQAGLPVIGSD 298


>gi|295397156|ref|ZP_06807261.1| glycosyltransferase [Aerococcus viridans ATCC 11563]
 gi|294974599|gb|EFG50321.1| glycosyltransferase [Aerococcus viridans ATCC 11563]
          Length = 382

 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 51/104 (49%), Gaps = 3/104 (2%)

Query: 230 KGAYYIGRMVW-SKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAK--KLKLVV 286
           K   ++GR+ + +KG + LL  + I  +  + +E+++ GNG D +++ +  K  KL+  V
Sbjct: 210 KSILFVGRIYYLTKGIDLLLKSMKIITQSGSRIELNIVGNGPDMERLVQDLKNHKLEHSV 269

Query: 287 RVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVC 330
            V    D     +    + + PST +       EAL MG  VV 
Sbjct: 270 NVLGSTDDVKSHYLQNSMLVLPSTIEGFGLVVTEALEMGLPVVS 313


>gi|148825758|ref|YP_001290511.1| putative UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae PittEE]
 gi|148715918|gb|ABQ98128.1| putative UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae PittEE]
          Length = 353

 Score = 39.3 bits (90), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 38/172 (22%), Positives = 75/172 (43%), Gaps = 18/172 (10%)

Query: 169 LLEFVNSWLARVHCHKVIRLSAATQEY------PNSIVCNVHGVNPKFLEIGEKKMEQQQ 222
           L+  +  +LA   C K++ L+ A +         N+I+      NP  L + + K+ + +
Sbjct: 125 LVRTLGKYLAVTTCDKIVTLTEAEKTLWQEKFKTNNIIT---IANPNTL-LPKNKLAKWE 180

Query: 223 NGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKL 282
           N      K    +G +   KG++ LL +  +  K+     + + G+GE+ + ++  AK L
Sbjct: 181 N------KTILSVGHLFSYKGFDYLLKVWQVLAKKYPDWNLKIVGSGEEEENLKNLAKAL 234

Query: 283 KL--VVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
            +   V   P  +     +    ++  PS T+ +     EA+A G  +V  N
Sbjct: 235 DIEDSVNFTPRTNDVSFYYESSSIYCLPSQTEGLPLVVIEAMAFGLPIVAFN 286


>gi|380696370|ref|ZP_09861229.1| putative glycosyl transferase [Bacteroides faecis MAJ27]
          Length = 351

 Score = 39.3 bits (90), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 32/143 (22%), Positives = 67/143 (46%), Gaps = 9/143 (6%)

Query: 213 IGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDF 272
           IG   M+ +   N   +K    +GR+V +KG++ LL +     K  +   + + G+GE+ 
Sbjct: 162 IGNFVMDAKFKSN-LMSKKVLSVGRLVHTKGFDILLDIWGDISKRHSDWILQIVGSGEEK 220

Query: 273 DQIQRAAKKLKLV--VRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVC 330
           + +++  ++L L   V++ P   + +  + D  ++++ +  +      AEA + G  +VC
Sbjct: 221 ENLRKKIEELHLEENVQMIPTVKNIEDYYIDASIYVSAARLEPWGLVLAEAKSFGLPIVC 280

Query: 331 ANHPSNDFFKQFPNCRTYDGRNG 353
            + P        P     DG +G
Sbjct: 281 FDCPHG------PKNIVRDGMDG 297


>gi|421852983|ref|ZP_16285665.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus subsp. pasteurianus LMG 1262 = NBRC 106471]
 gi|371478834|dbj|GAB30868.1| lipopolysaccharide glycosyl transferase group 1 [Acetobacter
           pasteurianus subsp. pasteurianus LMG 1262 = NBRC 106471]
          Length = 372

 Score = 39.3 bits (90), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 27/104 (25%), Positives = 49/104 (47%), Gaps = 3/104 (2%)

Query: 232 AYYIGRMVWSKGYEELLGLLNIY--HKELAGLEMDLYGNGEDFDQIQRAAKKLKLV-VRV 288
           A  +GR+  +KGY+ L   L     +  L   E+ + G G++   +++  K+LKL  +R 
Sbjct: 195 AGSLGRLHIAKGYDILCTALKRLQAYTSLPAFEVTVAGEGQERAHLEQMVKELKLTNIRF 254

Query: 289 YPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
                 +    H   ++L PS  +  C A  EA+  G  V+ ++
Sbjct: 255 SGFASSSQDFLHTQHLYLQPSRREGFCIAMHEAMQAGLPVIGSD 298


>gi|417844872|ref|ZP_12490911.1| putative glycosyl transferase, group 1 [Haemophilus haemolyticus
           M21639]
 gi|341956352|gb|EGT82782.1| putative glycosyl transferase, group 1 [Haemophilus haemolyticus
           M21639]
          Length = 353

 Score = 39.3 bits (90), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 37/171 (21%), Positives = 75/171 (43%), Gaps = 16/171 (9%)

Query: 169 LLEFVNSWLARVHCHKVIRLSAA-----TQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQN 223
           L+  +  +LA   C KV+ L+ +      +++  + + ++   NP  L + + K+ + +N
Sbjct: 125 LVRTLGKYLASTRCDKVVTLTESEKTLWQEKFKTNNIISI--ANPNTL-LPKNKLAKWEN 181

Query: 224 GNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLK 283
                 K    +G +   KG++ LL    +  K      + + G+GE+ D ++  AK L 
Sbjct: 182 ------KTILSVGHLFSYKGFDYLLKAWQVLAKNHPDWNLKIVGSGEEEDNLKNLAKALD 235

Query: 284 L--VVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
           +   V   P  +     +    ++  PS T+ +     EA+A G  +V  N
Sbjct: 236 IEDSVNFIPRTNDVAFYYESSSIYCLPSQTEGLPLVVIEAMAFGLPIVAFN 286


>gi|56963617|ref|YP_175348.1| glycosyltransferase [Bacillus clausii KSM-K16]
 gi|56909860|dbj|BAD64387.1| glycosyltransferase [Bacillus clausii KSM-K16]
          Length = 406

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 55/125 (44%), Gaps = 11/125 (8%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPGRD 293
           IGRM   K  E LL  +     EL  ++  + G+G     ++  A  L +   V + G  
Sbjct: 214 IGRMAEEKNMEALLYAIKSLESELDHVKTVMVGDGPVRKSLEALAVSLGISEHVRFTGAV 273

Query: 294 HADPIFHDY---KVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDG 350
             + I H Y    +F++ STT+       EA+A G IVV  + PS    K+       DG
Sbjct: 274 EWNQIHHYYHLGDLFVSASTTEAQGLTYIEAMASGCIVVAKSDPS---IKRI----VLDG 326

Query: 351 RNGFV 355
             GFV
Sbjct: 327 TTGFV 331


>gi|319776042|ref|YP_004138530.1| UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae F3047]
 gi|319896850|ref|YP_004135045.1| udp-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae F3031]
 gi|329123201|ref|ZP_08251770.1| UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus aegyptius ATCC 11116]
 gi|317432354|emb|CBY80709.1| putative UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae F3031]
 gi|317450633|emb|CBY86853.1| putative UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae F3047]
 gi|327471546|gb|EGF16990.1| UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus aegyptius ATCC 11116]
          Length = 353

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 38/172 (22%), Positives = 75/172 (43%), Gaps = 18/172 (10%)

Query: 169 LLEFVNSWLARVHCHKVIRLSAATQEY------PNSIVCNVHGVNPKFLEIGEKKMEQQQ 222
           L+  +  +LA   C K++ L+ A +         N+I+      NP  L + + K+ + +
Sbjct: 125 LVRTLGKYLAVTTCDKIVTLTEAEKTLWQEKFKTNNIIT---IANPNTL-LPKNKLAKWE 180

Query: 223 NGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKL 282
           N      K    +G +   KG++ LL +  +  K+     + + G+GE+ + ++  AK L
Sbjct: 181 N------KTILSVGHLFSYKGFDYLLKVWQVLAKKYPDWNLKIVGSGEEEENLKNLAKVL 234

Query: 283 KL--VVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
            +   V   P  +     +    ++  PS T+ +     EA+A G  +V  N
Sbjct: 235 DIEDSVNFIPRTNDVSFYYESSSIYCLPSQTEGLPLVVIEAMAFGLPIVAFN 286


>gi|283458527|ref|YP_003363155.1| glycosyltransferase [Rothia mucilaginosa DY-18]
 gi|283134570|dbj|BAI65335.1| glycosyltransferase [Rothia mucilaginosa DY-18]
          Length = 440

 Score = 39.3 bits (90), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 23/103 (22%), Positives = 49/103 (47%), Gaps = 4/103 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKL----KLVVRVY 289
           ++GR+   K  + L+  L++    L  + +D+ G GE  + +++ A       +++ R +
Sbjct: 257 FVGRLAEEKNVDVLIEALSLLPDTLEHVVLDIAGGGELREALEQKAHDCGVSDRVIFRGF 316

Query: 290 PGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
              +H   ++    VF  P T ++    + EA++    VV AN
Sbjct: 317 VPDEHLPEVYQMADVFCQPGTAELQSLVSLEAMSASTPVVLAN 359


>gi|402830647|ref|ZP_10879344.1| glycosyltransferase, group 1 family protein [Capnocytophaga sp.
           CM59]
 gi|402283996|gb|EJU32501.1| glycosyltransferase, group 1 family protein [Capnocytophaga sp.
           CM59]
          Length = 372

 Score = 39.3 bits (90), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 53/214 (24%), Positives = 93/214 (43%), Gaps = 26/214 (12%)

Query: 189 SAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELL 248
           S   ++Y N IVC   G++   L I +  +   +N  K + K  + +GR+V+ KG+E L+
Sbjct: 152 SNTLKKYKNKIVCIPIGIDNNELFIDQNTLNSLKNKYKGY-KIIFSLGRLVYYKGFEYLI 210

Query: 249 GLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPGRDHADPIFHDYK---V 304
             +     ++  L   + G GE  +++Q    K  L  RV   G+   + +   Y+   +
Sbjct: 211 ETVKSLPNDIIIL---IAGIGELKEKLQEHISKHNLQDRVKLLGKIPFEELGAYYQLCDI 267

Query: 305 FLNPST--TDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDG-------RNGFV 355
           F  PST  ++       EA+A GK V+  +   +     + N     G        N   
Sbjct: 268 FCLPSTERSEAFGVVQIEAMAFGKPVISTSIKGSGV--DWVNLNNVSGIIVPPKDTNRLT 325

Query: 356 EATLKALAEEPAQPTDAQTHQLSWESATERFLQV 389
           EA ++ L       TD + +QL    A +R+ +V
Sbjct: 326 EAIMELL-------TDEKKYQLLSIGAKKRYEEV 352


>gi|424835811|ref|ZP_18260470.1| group 1 glycosyl transferase family protein [Clostridium sporogenes
           PA 3679]
 gi|365977681|gb|EHN13779.1| group 1 glycosyl transferase family protein [Clostridium sporogenes
           PA 3679]
          Length = 414

 Score = 39.3 bits (90), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 34/136 (25%), Positives = 57/136 (41%), Gaps = 13/136 (9%)

Query: 205 GVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMD 264
           G NP    I EKK+    N NK       Y G++ +SKG   L+  +N+  K +  +E+ 
Sbjct: 206 GFNPDTFYINEKKI----NSNKI---KIVYAGKLSYSKGVPFLINSINLIKKYVNNIELY 258

Query: 265 LYGNGEDFDQI------QRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTAT 318
           L G+G   + I         +    + V     +     +F +  + + PS  + +    
Sbjct: 259 LAGSGAGDETISIIKLCNHNSCNFNINVLGSISQKKLSDLFRECDILVLPSLYEGLPLVL 318

Query: 319 AEALAMGKIVVCANHP 334
            EA+A G  +VC   P
Sbjct: 319 IEAMACGLKIVCTELP 334


>gi|148827120|ref|YP_001291873.1| lipopolysaccharide biosynthesis protein [Haemophilus influenzae
           PittGG]
 gi|148718362|gb|ABQ99489.1| lipopolysaccharide biosynthesis protein [Haemophilus influenzae
           PittGG]
          Length = 296

 Score = 39.3 bits (90), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 37/172 (21%), Positives = 75/172 (43%), Gaps = 18/172 (10%)

Query: 169 LLEFVNSWLARVHCHKVIRLSAATQEY------PNSIVCNVHGVNPKFLEIGEKKMEQQQ 222
           L+  +  +LA   C K++ L+ + +         N+I+      NP  L + + K+ + +
Sbjct: 68  LVRTLGKYLAVTTCDKIVTLTESEKTLWQEKFKTNNII---SIANPNTL-LPKNKLAKLE 123

Query: 223 NGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKL 282
           N      K    +G +   KG++ LL +  +  K+     + + G+GE+ + ++  AK L
Sbjct: 124 N------KTILSVGHLFSYKGFDYLLKVWQVLEKKYPNWNLKIVGSGEEEENLKNLAKAL 177

Query: 283 KL--VVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
            +   V   P  +     +    ++  PS T+ +     EA+A G  +V  N
Sbjct: 178 DIEDSVNFIPRTNDVSFYYESSSIYCLPSQTEGLPLVLIEAMAFGLPIVAFN 229


>gi|329113796|ref|ZP_08242567.1| Glycosyltransferase [Acetobacter pomorum DM001]
 gi|326696806|gb|EGE48476.1| Glycosyltransferase [Acetobacter pomorum DM001]
          Length = 372

 Score = 39.3 bits (90), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 27/104 (25%), Positives = 50/104 (48%), Gaps = 3/104 (2%)

Query: 232 AYYIGRMVWSKGYEELLGLLNIY--HKELAGLEMDLYGNGEDFDQIQRAAKKLKLV-VRV 288
           A  +GR+  +KGY+ L   L     +  L   E+ + G G++   +++ AK+LKL  +R 
Sbjct: 195 AGSLGRLHVAKGYDILCAALKRLQAYTSLPAFEVTVAGEGQERAHLEQMAKELKLTNIRF 254

Query: 289 YPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
                 +    +   ++L PS  +  C A  EA+  G  V+ ++
Sbjct: 255 SGFASSSQDFLYTQHLYLQPSRREGFCIAMHEAMQAGLPVIGSD 298


>gi|387772327|ref|ZP_10128275.1| glycosyltransferase, group 1 family protein [Haemophilus
           parahaemolyticus HK385]
 gi|386906584|gb|EIJ71310.1| glycosyltransferase, group 1 family protein [Haemophilus
           parahaemolyticus HK385]
          Length = 619

 Score = 39.3 bits (90), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 34/129 (26%), Positives = 57/129 (44%), Gaps = 16/129 (12%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV----Y 289
           ++GR++  KG  E L    +  KE    +  + G  +  +    + +KL+  +      Y
Sbjct: 447 FLGRLLREKGIFEFLEAAKVVKKEKPNTQFLVLGQIDKQNPTAMSEEKLQEYIDANIIEY 506

Query: 290 PGRDHADPIF-HDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCR-- 346
            G     P +     VF+ PS  + V  +T EA+AMGK+++  +          P CR  
Sbjct: 507 LGYVKNVPDYIEKTDVFVLPSYREGVPKSTQEAMAMGKVILTTD---------VPGCRET 557

Query: 347 TYDGRNGFV 355
             DG NGF+
Sbjct: 558 VVDGVNGFL 566


>gi|354595093|ref|ZP_09013130.1| glycosyltransferase [Commensalibacter intestini A911]
 gi|353671932|gb|EHD13634.1| glycosyltransferase [Commensalibacter intestini A911]
          Length = 362

 Score = 39.3 bits (90), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 27/114 (23%), Positives = 54/114 (47%), Gaps = 7/114 (6%)

Query: 222 QNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKE--LAGLEMDLYGNGEDFDQIQRAA 279
           QNG    T     +GR+   KGY+ L+    +  K   L   ++ +YG G+++D + +  
Sbjct: 175 QNGQ---TIQIVSMGRLHPVKGYDILIEACRLLSKRPNLPSYKVTIYGEGQEYDTLNKRI 231

Query: 280 KKLKL--VVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCA 331
           ++ +L   +++     H +       ++L PS ++  C A  EA+  G  V+ +
Sbjct: 232 QQYQLEQYIKLAGFTPHVEQALIQSHLYLQPSRSEGFCVAAHEAMQAGLPVIAS 285


>gi|227326699|ref|ZP_03830723.1| glycosyltransferase-like protein [Pectobacterium carotovorum subsp.
           carotovorum WPP14]
          Length = 342

 Score = 39.3 bits (90), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 26/99 (26%), Positives = 49/99 (49%), Gaps = 5/99 (5%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRD 293
           Y+GR+   KG + LL     +  +   L + + G+G   + ++R A +  +  + Y    
Sbjct: 188 YVGRLSEEKGLDFLLD----FFCKRPELSLTIVGDGPQRESLERKAGE-NIQFKGYVNNS 242

Query: 294 HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
             + IF ++ VF+ PST++       EAL  G  V+C++
Sbjct: 243 ELNSIFVEHDVFIFPSTSEPWGLVVEEALVYGLPVICSD 281


>gi|83309364|ref|YP_419628.1| glycosyltransferase [Magnetospirillum magneticum AMB-1]
 gi|82944205|dbj|BAE49069.1| Glycosyltransferase [Magnetospirillum magneticum AMB-1]
          Length = 347

 Score = 38.9 bits (89), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 45/159 (28%), Positives = 66/159 (41%), Gaps = 22/159 (13%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRD 293
           Y+GR+   K  E  L L      +L G +  + G+G   D+++R   ++      + G D
Sbjct: 190 YVGRVAVEKNIEAFLAL------DLPGAKA-VVGDGPQLDEMKRRFPEVHFAGARF-GED 241

Query: 294 HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRNG 353
            A   +    VF+ PS TD       EALA G  V     P        PN    D  +G
Sbjct: 242 LAR-HYAAADVFVFPSRTDTFGLVLLEALASGLPVAAYPVPG-------PNDVIGDSPSG 293

Query: 354 FVEATL-----KALAEEPAQPTDAQTHQLSWESATERFL 387
            ++  L      AL+ +PA    A     SWE+ T +FL
Sbjct: 294 VLDQDLGKAARAALSIDPAL-CRAHALTFSWEACTRQFL 331


>gi|422325343|ref|ZP_16406379.1| hypothetical protein HMPREF0737_01489 [Rothia mucilaginosa M508]
 gi|353343347|gb|EHB87665.1| hypothetical protein HMPREF0737_01489 [Rothia mucilaginosa M508]
          Length = 401

 Score = 38.9 bits (89), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 23/103 (22%), Positives = 49/103 (47%), Gaps = 4/103 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKL----KLVVRVY 289
           ++GR+   K  + L+  L++    L  + +D+ G GE  + +++ A       +++ R +
Sbjct: 218 FVGRLAEEKNVDVLIEALSLLPDTLEHVVLDIAGGGELREALEQKAHDCGVSDRVIFRGF 277

Query: 290 PGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
              +H   ++    VF  P T ++    + EA++    VV AN
Sbjct: 278 VPDEHLPEVYQMADVFCQPGTAELQSLVSLEAMSASTPVVLAN 320


>gi|255327238|ref|ZP_05368312.1| glycosyl transferase, group 1 [Rothia mucilaginosa ATCC 25296]
 gi|255295518|gb|EET74861.1| glycosyl transferase, group 1 [Rothia mucilaginosa ATCC 25296]
          Length = 401

 Score = 38.9 bits (89), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 23/103 (22%), Positives = 49/103 (47%), Gaps = 4/103 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKL----KLVVRVY 289
           ++GR+   K  + L+  L++    L  + +D+ G GE  + +++ A       +++ R +
Sbjct: 218 FVGRLAEEKNVDVLIEALSLLPDTLEHVVLDIAGGGELREALEQKAHDCGVSDRVIFRGF 277

Query: 290 PGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
              +H   ++    VF  P T ++    + EA++    VV AN
Sbjct: 278 VPDEHLPEVYQMADVFCQPGTAELQSLVSLEAMSASTPVVLAN 320


>gi|428298841|ref|YP_007137147.1| group 1 glycosyl transferase [Calothrix sp. PCC 6303]
 gi|428235385|gb|AFZ01175.1| glycosyl transferase group 1 [Calothrix sp. PCC 6303]
          Length = 389

 Score = 38.9 bits (89), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 66/148 (44%), Gaps = 11/148 (7%)

Query: 233 YYIGRMVWSKGYEELLGLLNIYHKE-LAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YP 290
           + IG+++  KG  EL+    I  ++      + + G G    ++   AKK  L   + + 
Sbjct: 207 FSIGQIIPRKGIRELIEACYILQQQGYEKYSLLIAGEGNQRQELDELAKKYNLATNISWL 266

Query: 291 GR---DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCA-NHPSNDFFKQFPNCR 346
           G+    H    FH   VF+ PS  DV      EA+++GK V+ + N  S +  ++  N  
Sbjct: 267 GQVNYSHLGSYFHLIDVFVFPSHEDVWGMVLLEAMSLGKAVIASQNAGSAELIREGDNGY 326

Query: 347 TYDGRNGFVEA---TLKALAEEPAQPTD 371
           T++    F E    ++K   + P Q  D
Sbjct: 327 TFNP--SFPEELANSMKECMDNPPQIGD 352


>gi|407983029|ref|ZP_11163690.1| glycosyl transferases group 1 family protein [Mycobacterium
           hassiacum DSM 44199]
 gi|407375312|gb|EKF24267.1| glycosyl transferases group 1 family protein [Mycobacterium
           hassiacum DSM 44199]
          Length = 409

 Score = 38.9 bits (89), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 43/102 (42%), Gaps = 6/102 (5%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRD 293
           YIGR+ + KG  +L+  L    +   G  + + G G  FD +    +K + VVR      
Sbjct: 216 YIGRLEYEKGIHDLIAALPRIRRSHPGARLTIAGTGTQFDFLVEQTRKHR-VVRAVTFAG 274

Query: 294 HAD-----PIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVC 330
           H D      + HD  V + PS  +       EA A G  VV 
Sbjct: 275 HLDHEQLVHLLHDADVAVLPSHYEPFGIVALEAAATGVPVVA 316


>gi|332300154|ref|YP_004442075.1| group 1 glycosyl transferase [Porphyromonas asaccharolytica DSM
           20707]
 gi|332177217|gb|AEE12907.1| glycosyl transferase group 1 [Porphyromonas asaccharolytica DSM
           20707]
          Length = 384

 Score = 38.9 bits (89), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 52/228 (22%), Positives = 92/228 (40%), Gaps = 19/228 (8%)

Query: 177 LARVHCHKVIRLSAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIG 236
           ++  H  +V    A  +E+ +  V   +GVN  +LE     ++  Q G+   T    Y+G
Sbjct: 154 ISPAHIKRVEAWRACPREFASKTVNIPNGVNSYWLEQISCDLKSHQRGD---TWHFLYVG 210

Query: 237 RMVWSKGYEELL-GLLNIYHKELAGLEMDLY---GNGEDFDQIQRAAKKLKLVVR---VY 289
           R    K    L+  +L++  K   G  +DL+   G G+D  +I+  AK+   V     V 
Sbjct: 211 RFTSRKNLPRLMQAILSLKDK---GDSLDLHIVGGKGDDTKRIETLAKRHPEVFYLHGVV 267

Query: 290 PGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMG-KIVVCANHPSNDFFK----QFPN 344
             ++    I     +F  PS T+       EAL+ G  I+       + FF     +  N
Sbjct: 268 QDKEKIRAIMQQCHIFTMPSLTETFGLVYVEALSQGLPILYTEGEGVDGFFSSSYGERCN 327

Query: 345 CRTYDGRNGFVEATLKALAEEPAQPTDAQTHQLSWESATERFLQVAEL 392
            ++       +E  L    E     +  + H   W+   +++L + EL
Sbjct: 328 PKSVQDIAEKLERMLSHYEEYSIDDSYLKEH-FDWDIVADKYLAIMEL 374


>gi|251799716|ref|YP_003014447.1| group 1 glycosyl transferase [Paenibacillus sp. JDR-2]
 gi|247547342|gb|ACT04361.1| glycosyl transferase group 1 [Paenibacillus sp. JDR-2]
          Length = 372

 Score = 38.9 bits (89), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 25/81 (30%), Positives = 43/81 (53%), Gaps = 6/81 (7%)

Query: 256 KELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV---YPGRD-HADPIFHDYKVFLNPSTT 311
           +++   E+ LYG+G     ++  +K LKL V      P +  HA+    DY +F++PS  
Sbjct: 218 QDIGEFEVVLYGDGPQRAALEAESKILKLPVTFRGNLPSKQLHAE--LADYDIFVHPSRM 275

Query: 312 DVVCTATAEALAMGKIVVCAN 332
           +    +  EALA G  V+C++
Sbjct: 276 ESFGLSVTEALASGCAVICSD 296


>gi|254515337|ref|ZP_05127398.1| glycosyl transferase, group 1 [gamma proteobacterium NOR5-3]
 gi|219677580|gb|EED33945.1| glycosyl transferase, group 1 [gamma proteobacterium NOR5-3]
          Length = 372

 Score = 38.9 bits (89), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 30/129 (23%), Positives = 58/129 (44%), Gaps = 2/129 (1%)

Query: 211 LEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGE 270
           +++ E + E ++       K   +IG+++  K   +LL + N    E   +E+ L G+G+
Sbjct: 176 VDLSEVEAEAERPRPTTSKKRIGFIGQLISRKNVPDLLWIFNKLWSERDDVELHLLGDGD 235

Query: 271 DFDQIQRAAKKLKLVVRVY--PGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIV 328
           +   ++  A  L     ++    RD    + H + +F+  ST + +     EA AMG  V
Sbjct: 236 ERMNLEMQAGMLSASPDIHFLGFRDDRLELLHSFDLFVMTSTLEGIPRCLMEACAMGVPV 295

Query: 329 VCANHPSND 337
              + P  D
Sbjct: 296 AAYDIPGID 304


>gi|57641668|ref|YP_184146.1| glycosyl transferase family protein [Thermococcus kodakarensis
           KOD1]
 gi|57159992|dbj|BAD85922.1| glycosyltransferase, family 1 [Thermococcus kodakarensis KOD1]
          Length = 369

 Score = 38.9 bits (89), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 34/126 (26%), Positives = 59/126 (46%), Gaps = 15/126 (11%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPG- 291
           ++GR++  K  E L+  + I   E+  +++ + G+G +  +++R +++L LV  V + G 
Sbjct: 195 FVGRLIREKNVELLVRAVQILKTEIPDIKVLIIGDGPERRRLERLSEELNLVENVDFKGF 254

Query: 292 RDHADPIFHDYK---VFLNPSTTDVVCTATAEALAMGKIVVCANHPSN----------DF 338
            D  + +    K   VF+ PS  +       EA A G  VV   HP N          + 
Sbjct: 255 LDEYEAVISHMKASRVFVLPSIREGFGITALEANASGIPVVTVVHPLNAAAELIVHEYNG 314

Query: 339 FKQFPN 344
           F  FPN
Sbjct: 315 FLAFPN 320


>gi|407003967|gb|EKE20461.1| group 1 glycosyl transferase, partial [uncultured bacterium]
          Length = 252

 Score = 38.9 bits (89), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 39/174 (22%), Positives = 74/174 (42%), Gaps = 26/174 (14%)

Query: 237 RMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPG---R 292
           R+   KG   L+  L I   +   L + + G+G++ + +    KK+KL   V + G   R
Sbjct: 81  RITARKGTNYLVEALGILAPKYPNLLLRILGDGDEKNNLIDLTKKMKLENFVQFLGRIPR 140

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYDGRN 352
           +   P + +  +F+ PS  + +  A  EALA G  ++       +   +       DG N
Sbjct: 141 EETAPYYQEASLFVLPSLNEGMSNAMLEALASGLPILATQTGGTEELVE-------DGIN 193

Query: 353 GFV-------------EATLK--ALAEEPAQPTDAQTHQLSWESATERFLQVAE 391
           GFV             E  LK   L  +    +  +  ++SW++  +++ ++ E
Sbjct: 194 GFVIKMKDPKDLADKIEMILKDDGLRVQMGLASRKKAQEMSWKNVAQKYFELYE 247


>gi|123967089|ref|YP_001012170.1| SqdX [Prochlorococcus marinus str. MIT 9515]
 gi|123201455|gb|ABM73063.1| SqdX [Prochlorococcus marinus str. MIT 9515]
          Length = 377

 Score = 38.9 bits (89), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 67/162 (41%), Gaps = 16/162 (9%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRVYPGRD 293
           Y+GR+   K  E +  +L    + + G  + L G+G    Q+++  +  K     Y   +
Sbjct: 205 YVGRLSAEKQIERIKPVL----ESIPGACLALVGDGPYRSQLEKIFENTKTNFVGYLSGE 260

Query: 294 HADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPS-NDFFKQFPNCRTYD--- 349
                +    +FL PS+T+ +     EA+A G  V+ AN     D      N   YD   
Sbjct: 261 ELASAYASGDIFLFPSSTETLGLVLLEAMAAGCPVIGANKGGIPDIINDGINGCLYDPDE 320

Query: 350 ---GRNGFVEATLKALAE---EPAQPTDA--QTHQLSWESAT 383
              G    +EAT K LA+   + A   +A  +  Q  W  AT
Sbjct: 321 KDNGEKSLIEATKKILADKNKKEAMRIEARKEAEQWDWNQAT 362


>gi|410723605|ref|ZP_11362839.1| glycosyltransferase [Clostridium sp. Maddingley MBC34-26]
 gi|410603069|gb|EKQ57514.1| glycosyltransferase [Clostridium sp. Maddingley MBC34-26]
          Length = 402

 Score = 38.9 bits (89), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 33/125 (26%), Positives = 57/125 (45%), Gaps = 11/125 (8%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPGR 292
           YIGR+   K   E+L LL    +++  +++ + G G   D ++   KK  +   V + G 
Sbjct: 211 YIGRIGEEKNISEILMLLPNIIRKIKNVKLLVVGGGPYLDNLKTLVKKQNIEGNVIFTGM 270

Query: 293 DHADPIFHDYK---VFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTYD 349
            + + ++  YK   +F+  ST++       EAL+ G  VVC      D   +        
Sbjct: 271 INCEEVYKYYKLSEIFVTASTSETQGLTYIEALSSGCPVVCKYDSCIDGVIE-------Q 323

Query: 350 GRNGF 354
           G+NGF
Sbjct: 324 GKNGF 328


>gi|260588647|ref|ZP_05854560.1| glycosyltransferase [Blautia hansenii DSM 20583]
 gi|260541122|gb|EEX21691.1| glycosyltransferase [Blautia hansenii DSM 20583]
          Length = 422

 Score = 38.9 bits (89), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 43/184 (23%), Positives = 82/184 (44%), Gaps = 22/184 (11%)

Query: 235 IGRMVWSKGYEELLGLLN--IYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPG 291
           +GR+   K  EE+L      IY +E   L + + G+G D +++++ ++ L L  +V + G
Sbjct: 219 VGRLAKEKNLEEILNYFARLIYEEEKQNLTLLIVGDGPDRERLEKISESLSLTGKVIFTG 278

Query: 292 RDHADPIFHDYK---VFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRTY 348
             +   +   Y+   VF+  S ++       EALA G+  +C          +  N    
Sbjct: 279 MVNPQDVGIYYQLGDVFVCASNSETQGITYIEALASGRPALCRKD-------ECLNQVIT 331

Query: 349 DGRNGFVEAT-------LKALAEEPAQPTDA--QTHQLSWESATERFLQVAELVGDVVTK 399
           DG NGF   T       L+ + E+  +  +   +  + ++  +T  F  +AE +   V K
Sbjct: 332 DGYNGFQYETYEYFKMHLEYILEDETRKAEMGLRAKETAYLYSTWNFCTMAESLYKEVLK 391

Query: 400 RSKS 403
           R ++
Sbjct: 392 RQEA 395


>gi|424665844|ref|ZP_18102880.1| hypothetical protein HMPREF1205_01719 [Bacteroides fragilis HMW
           616]
 gi|404574097|gb|EKA78848.1| hypothetical protein HMPREF1205_01719 [Bacteroides fragilis HMW
           616]
          Length = 411

 Score = 38.9 bits (89), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 31/112 (27%), Positives = 51/112 (45%), Gaps = 2/112 (1%)

Query: 220 QQQNGNKAFTKGAYYIGRM-VWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRA 278
           Q  N N    K   ++GRM  + K  + LL +  + +K++   E+ L G G++   +Q  
Sbjct: 225 QCPNVNLNKKKQILFVGRMSYYDKRVDRLLDIWGMIYKKVPAWELILVGGGKELPLLQAK 284

Query: 279 AKKLKLV-VRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVV 329
           ++K+KL  +R     DH D  + D  V    S  +       EA A G + V
Sbjct: 285 SQKMKLQRIRFAGHSDHVDDFYRDASVLCLTSAFEGWPLCLTEAQANGVVPV 336


>gi|378697937|ref|YP_005179895.1| putative UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae 10810]
 gi|301170453|emb|CBW30060.1| putative UDP-galactose--lipooligosaccharide galactosyltransferase
           [Haemophilus influenzae 10810]
          Length = 353

 Score = 38.9 bits (89), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 36/171 (21%), Positives = 76/171 (44%), Gaps = 16/171 (9%)

Query: 169 LLEFVNSWLARVHCHKVIRLSAA-----TQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQN 223
           L+  +  +LA   C K++ L+ A      +++  + + ++   NP  L + + K+ + +N
Sbjct: 125 LVRTLGKYLAVTTCDKIVTLTEAEKTLWQEKFKTNNIISI--ANPNTL-LPKNKLAKLEN 181

Query: 224 GNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLK 283
                 K    +G +   KG++ LL    +  K+     + + G+GE+ + ++  AK L 
Sbjct: 182 ------KTILSVGHLFSYKGFDYLLKAWQVLAKKYPDWNLKIVGSGEEEENLKNLAKALD 235

Query: 284 L--VVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
           +   V   P  +     +    ++  PS T+ +     EA+A G  +V  N
Sbjct: 236 IEDSVNFIPRTNDVSFYYESSSIYCLPSQTEGLPLVVIEAMAFGLPIVAFN 286


>gi|448609621|ref|ZP_21660652.1| hexosyltransferase, glycosyltransferase [Haloferax mucosum ATCC
           BAA-1512]
 gi|445746638|gb|ELZ98099.1| hexosyltransferase, glycosyltransferase [Haloferax mucosum ATCC
           BAA-1512]
          Length = 405

 Score = 38.9 bits (89), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 61/293 (20%), Positives = 120/293 (40%), Gaps = 31/293 (10%)

Query: 128 HLTWF--HHGKRWKAKFRF-VVGIVHTN--YLEYVKREKNDRLQAFLLEFVNSWLARVHC 182
           H++W   + G R KA      V  VH N  + E   + +N+RL+        +W    + 
Sbjct: 119 HMSWTAGYVGARLKAALDIPYVLTVHANRDWFETQLQSENERLR-------EAW---TNA 168

Query: 183 HKVIRLS----AATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRM 238
            ++IR++    +  + Y + +V   +G +    E  +      + G  A T   + +G +
Sbjct: 169 DRLIRVNRRDRSKLEAYNDDVVHISNGYDTDVFERLDSDDACDRLGIDADTPLVFALGTL 228

Query: 239 VWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV----YPGRDH 294
              KG++ L+  +   H+      + + G G   D+++  A+ L +  R     Y   + 
Sbjct: 229 KPRKGFQHLMHAMTRVHESEPDARLVIGGQGRVRDELESLAESLGIADRTDLLGYVESET 288

Query: 295 ADPIFHDYKVFLNPSTTDVVCTATAEALAMGK-IVVCANHPSNDFFKQFPNCRTYDG--- 350
            +   +   VF+ PS ++       EA+A G  +V   N  S +           DG   
Sbjct: 289 LNDWMNAADVFVLPSYSESFGVVQLEAMACGTPVVATENGGSEEVIASEEYGLLVDGPEA 348

Query: 351 RNGFVEATLKALAEE-PAQPTDAQTHQLSWESATERFLQVAELVGDVVTKRSK 402
            +   +A + AL  +  A   ++  ++ +WE   E   ++A+L  DV+    K
Sbjct: 349 HDELGDAVVTALRRDWDADAIESYANKFTWEYVCE---EIADLYVDVLEATRK 398


>gi|325958643|ref|YP_004290109.1| group 1 glycosyl transferase [Methanobacterium sp. AL-21]
 gi|325330075|gb|ADZ09137.1| glycosyl transferase group 1 [Methanobacterium sp. AL-21]
          Length = 380

 Score = 38.9 bits (89), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 39/163 (23%), Positives = 75/163 (46%), Gaps = 11/163 (6%)

Query: 212 EIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELL-GLLNIYHKELA-GLEMDLYGNG 269
           EI  KK  + +N  K  T    Y G ++  KG + +L  L ++ HK+ A  + + + G+G
Sbjct: 183 EINLKKDRKDKNSKK--TINLLYTGHIIPRKGVDYILKSLHSLVHKKGADNVVLTVVGDG 240

Query: 270 EDFDQIQRAAKKLKLVVRV----YPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMG 325
            + +++ + A +L++   +    +  R+    +  +  +++  S ++    A AEALAMG
Sbjct: 241 PEKNKLIKMADELEIADHINWMSFLSREELIDLIRESDIYMLLSRSEAYGIAVAEALAMG 300

Query: 326 KIVVCANHPSNDFFKQFPNCRTYD---GRNGFVEATLKALAEE 365
              V +N  +   F     C   D     +   +  LK  AE+
Sbjct: 301 TPCVISNSTALTEFSNEIGCYAVDYPPVPDDVADTVLKIYAED 343


>gi|119720626|ref|YP_921121.1| glycosyl transferase, group 1 [Thermofilum pendens Hrk 5]
 gi|119525746|gb|ABL79118.1| glycosyl transferase, group 1 [Thermofilum pendens Hrk 5]
          Length = 380

 Score = 38.9 bits (89), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 34/130 (26%), Positives = 65/130 (50%), Gaps = 6/130 (4%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELA--GLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YP 290
           ++GR++  KG + LL ++ + + EL    ++  + G+G   + I++ A+K + VV + Y 
Sbjct: 198 FVGRLLKDKGVDTLLRIIYLINDELNLHDVKFTIVGSGPLEEDIKKLAQKYQNVVFLGYV 257

Query: 291 GRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSN-DFFKQFPNCRTYD 349
             ++   I+ +  +FL PS ++ +  +  EA A G   V +  P   D  +     R  D
Sbjct: 258 KHENMPSIYREANLFLLPSRSEGMPLSLLEAQACGLPAVASKIPGVLDIVRDGVTGRLVD 317

Query: 350 GRN--GFVEA 357
             +  GFV A
Sbjct: 318 AEDVRGFVSA 327


>gi|429752475|ref|ZP_19285335.1| glycosyltransferase, group 1 family protein [Capnocytophaga sp.
           oral taxon 326 str. F0382]
 gi|429176756|gb|EKY18114.1| glycosyltransferase, group 1 family protein [Capnocytophaga sp.
           oral taxon 326 str. F0382]
          Length = 371

 Score = 38.5 bits (88), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 40/153 (26%), Positives = 71/153 (46%), Gaps = 16/153 (10%)

Query: 189 SAATQEYPNSIVCNVHGVNPKFLEIGEKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELL 248
           S+  ++Y + I C   G++ K L + E K+ + +   K   K  +Y+GR+V+ KG E L+
Sbjct: 152 SSTLKKYKDKITCIPIGIDNKELTVNETKLNELKKEYKG-KKIVFYLGRLVYYKGIEYLI 210

Query: 249 GLLNIYHKELAGLEMDLY-GNGEDFDQIQRAA------KKLKLVVRVYPGRDHADPIFHD 301
                  K L    + L  G GE  D++Q+         K+KL+ ++    +     +  
Sbjct: 211 EA----SKSLPDDTIILIAGIGELKDKLQKQIHSYNLEDKVKLLGKI--PFEELGAYYQL 264

Query: 302 YKVFLNPST--TDVVCTATAEALAMGKIVVCAN 332
             +F  PST  ++       EA+A GK V+  +
Sbjct: 265 CDIFCLPSTERSEAFGVVQIEAMAFGKPVISTS 297


>gi|417566732|ref|ZP_12217604.1| glycosyltransferase, group 1 family protein [Acinetobacter
           baumannii OIFC143]
 gi|445450086|ref|ZP_21444421.1| glycosyltransferase, group 1 family protein [Acinetobacter
           baumannii WC-A-92]
 gi|395552404|gb|EJG18412.1| glycosyltransferase, group 1 family protein [Acinetobacter
           baumannii OIFC143]
 gi|444756069|gb|ELW80628.1| glycosyltransferase, group 1 family protein [Acinetobacter
           baumannii WC-A-92]
          Length = 348

 Score = 38.5 bits (88), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 32/128 (25%), Positives = 52/128 (40%), Gaps = 7/128 (5%)

Query: 229 TKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKL-VVR 287
           TK    IGR+ + K +  L+    + + +  G  +++YG+GED D +     K KL  V 
Sbjct: 172 TKKVLAIGRLTYQKNFHALIEAWKLLNDKTLGWHLEIYGDGEDKDSLNDLIIKYKLENVF 231

Query: 288 VYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHPSNDFFKQFPNCRT 347
           +         ++      +  S  +       EAL+ G  VV  N P        P+   
Sbjct: 232 IKANTSSLSEVYKQAAFLVMSSRYEGFGMVLVEALSFGLPVVSFNCPHG------PSEII 285

Query: 348 YDGRNGFV 355
            D  NGF+
Sbjct: 286 DDSVNGFL 293


>gi|195953073|ref|YP_002121363.1| group 1 glycosyl transferase [Hydrogenobaculum sp. Y04AAS1]
 gi|195932685|gb|ACG57385.1| glycosyl transferase group 1 [Hydrogenobaculum sp. Y04AAS1]
          Length = 378

 Score = 38.5 bits (88), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 54/236 (22%), Positives = 99/236 (41%), Gaps = 23/236 (9%)

Query: 109 ITEIIPDEEADIAVLEEPEHLTWFHHGKRWKAKFRFVVGIVHTNYLEYVKREKNDRLQAF 168
           I  II  E+ DI  +     L  F    + K  FR ++G  H+N    +++  ++ L+  
Sbjct: 77  IANIIQYEKPDIVEIGSLFLLPSFIKKLKEKLNFR-MIGFFHSN----LEKSLSNILKLN 131

Query: 169 LLEFVNSWLARVHCHK-------VIRLSAATQEYPNSI-VCNV----HGVNPKFLE--IG 214
             + V S + R + +K       +I  S   ++Y N++ + NV    HG++ +F E    
Sbjct: 132 KEDSVVSKMTRKYIYKAYSDMDLIIAPSFYVKDYLNTLGLYNVEVVYHGIDVRFFENQAP 191

Query: 215 EKKMEQQQNGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQ 274
           +K ++    G  A      Y+GR    K + ELL +    +     + + L G+G D   
Sbjct: 192 DKDLQNALKGKIALI----YVGRFSKDKNFLELLDIFKTVYSLDNRIHLILVGDGPDKKH 247

Query: 275 IQRAAKKLKLVVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVC 330
           I +   +   V      +     ++    +F++ S +D    +  EA A G  VV 
Sbjct: 248 INKILDRGFTVFDYIKDKKTLAGLYKASDIFVSASKSDTFGYSIIEAQACGLPVVA 303


>gi|257388796|ref|YP_003178569.1| group 1 glycosyl transferase [Halomicrobium mukohataei DSM 12286]
 gi|257171103|gb|ACV48862.1| glycosyl transferase group 1 [Halomicrobium mukohataei DSM 12286]
          Length = 392

 Score = 38.5 bits (88), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 68/154 (44%), Gaps = 28/154 (18%)

Query: 235 IGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPG-- 291
           IGR+   KGY+ LL  L    + +  + +D++G G +   ++  A +L +   V + G  
Sbjct: 220 IGRLERRKGYDVLLDALATVVEAVPNVHLDVFGEGPEEQALREQAARLGVADNVTFHGYV 279

Query: 292 -----RDHADPIFHDYKVFLNPSTTDVVCTATAEALAMG-KIVVCANHPSNDFFKQFPNC 345
                RDH        + F++PS ++       EA+A+G  +VV     +++  +     
Sbjct: 280 DQSVVRDH----LARARAFVHPSRSESFSLVRLEAMAVGCPVVVTDTSGAHEMVR----- 330

Query: 346 RTYDGRNGFVEATLKALAEEPAQP-TDAQTHQLS 378
              DG  GFV  T      E A+P  DA    LS
Sbjct: 331 ---DGNEGFVVPT------EAAEPIADALLELLS 355


>gi|145637637|ref|ZP_01793292.1| lipopolysaccharide biosynthesis protein [Haemophilus influenzae
           PittHH]
 gi|145269160|gb|EDK09108.1| lipopolysaccharide biosynthesis protein [Haemophilus influenzae
           PittHH]
          Length = 353

 Score = 38.5 bits (88), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 38/172 (22%), Positives = 74/172 (43%), Gaps = 18/172 (10%)

Query: 169 LLEFVNSWLARVHCHKVIRLSAATQEY------PNSIVCNVHGVNPKFLEIGEKKMEQQQ 222
           L+  +  +LA   C K++ L+ A +         N+I+      NP  L + + K+ + +
Sbjct: 125 LVRTLGKYLAVTTCDKIVTLTEAEKTLWQEKFKTNNIIT---IANPNTL-LPKNKLAKWE 180

Query: 223 NGNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKL 282
           N      K    +G +   KG++ LL    +  K+     + + G+GE+ + ++  AK L
Sbjct: 181 N------KTILSVGHLFSYKGFDYLLKAWQVLEKKYPDWNLKIVGSGEEEENLKNLAKAL 234

Query: 283 KL--VVRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCAN 332
            +   V   P  +     +    ++  PS T+ +     EA+A G  +V  N
Sbjct: 235 DIEDSVNFIPRTNDVAFYYESSSIYCLPSQTEGLPLVVIEAMAFGLPIVAFN 286


>gi|406659118|ref|ZP_11067256.1| accessory Sec system glycosylation protein GtfA [Streptococcus
           iniae 9117]
 gi|405577227|gb|EKB51375.1| accessory Sec system glycosylation protein GtfA [Streptococcus
           iniae 9117]
          Length = 505

 Score = 38.5 bits (88), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 41/168 (24%), Positives = 73/168 (43%), Gaps = 27/168 (16%)

Query: 237 RMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQ----IQRAAKKLKLVVRVYPGR 292
           R+   K  + L+  + + H +L GL  D+YG G +  +    IQ+ + +  + ++   G 
Sbjct: 327 RLAGEKHIDWLVSAVILAHDKLPGLTFDIYGEGGERSKLGELIQKGSAQNYISLK---GH 383

Query: 293 DHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVVCANHP-SNDFFKQFPNCRTYDGR 351
                I+ +Y+V+L  ST++       EA+  G  ++  + P  N  F +       DG+
Sbjct: 384 QDLRDIYQNYQVYLTASTSEGFGLTLMEAVGSGLPLIGLDVPYGNQTFVE-------DGK 436

Query: 352 NGFV---------EATLKALAE---EPAQPTDAQTHQLSWESATERFL 387
           NG++         +   KA AE   +  Q  D +  Q    S  E FL
Sbjct: 437 NGYLIPRQEPDDSKQMAKAFAEKIIQYFQDMDKEKAQEYSYSKAENFL 484


>gi|313147945|ref|ZP_07810138.1| glycosyltransferase family 4 [Bacteroides fragilis 3_1_12]
 gi|313136712|gb|EFR54072.1| glycosyltransferase family 4 [Bacteroides fragilis 3_1_12]
          Length = 397

 Score = 38.5 bits (88), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 31/112 (27%), Positives = 51/112 (45%), Gaps = 2/112 (1%)

Query: 220 QQQNGNKAFTKGAYYIGRM-VWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRA 278
           Q  N N    K   ++GRM  + K  + LL +  + +K++   E+ L G G++   +Q  
Sbjct: 211 QCPNVNLNKKKQILFVGRMSYYDKRVDRLLDIWGMIYKKVPDWELILVGGGKELPLLQAK 270

Query: 279 AKKLKLV-VRVYPGRDHADPIFHDYKVFLNPSTTDVVCTATAEALAMGKIVV 329
           ++K+KL  +R     DH D  + D  V    S  +       EA A G + V
Sbjct: 271 SQKMKLQRIRFAGHSDHVDDFYRDASVLCLTSAFEGWPLCLTEAQANGVVPV 322


>gi|332299663|ref|YP_004441584.1| group 1 glycosyl transferase [Porphyromonas asaccharolytica DSM
           20707]
 gi|332176726|gb|AEE12416.1| glycosyl transferase group 1 [Porphyromonas asaccharolytica DSM
           20707]
          Length = 384

 Score = 38.5 bits (88), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 51/107 (47%), Gaps = 4/107 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPGR 292
           Y+GR+   K    LL    +   E   L++ + G+G    ++++ A  LKL  RV + G+
Sbjct: 208 YVGRLDKVKNLPRLLEAFRLLLDEGLDLQLLMVGDGPMDAELRQQATMLKLEERVTFAGK 267

Query: 293 DHADPIFHDYKV---FLNPSTTDVVCTATAEALAMGKIVVCANHPSN 336
                ++  Y +    + PST ++     AEALA+G  V  +   S+
Sbjct: 268 QQGSDLYAHYGLGDALILPSTNELFGATVAEALALGMPVTVSEVASS 314


>gi|291534639|emb|CBL07751.1| Glycosyltransferase [Roseburia intestinalis M50/1]
          Length = 371

 Score = 38.5 bits (88), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 38/163 (23%), Positives = 72/163 (44%), Gaps = 15/163 (9%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPGR 292
           Y+G ++  KG + LL  L    K    + + + G G++ D +Q+   +LK+  RV + G 
Sbjct: 199 YVGSLIQRKGLDLLLPAL---AKTPQNIRLWIVGEGQEKDLLQKQCTELKISDRVEFLGY 255

Query: 293 DHADPIFHDYKV---FLNPSTTDVVCTATAEALAMGKIVVCANHP--SNDFFKQFPNCRT 347
              +P+   Y+    F+ P+  D       EA+     ++ + +   + D      N   
Sbjct: 256 QEGEPLQKLYRTADAFVLPTREDCFGLVLLEAMCASLPIISSKYADGARDLVSDGENGYI 315

Query: 348 YDGR--NGFVEATLKALAEEPA----QPTDAQTHQLSWESATE 384
            D    +GF EA  KA A + A    + +  + H+ S+E  ++
Sbjct: 316 IDPEDVDGFAEAIQKAAAGDMASVMGKKSYEKAHEFSFEQVSK 358


>gi|313887450|ref|ZP_07821139.1| glycosyltransferase, group 1 family protein [Porphyromonas
           asaccharolytica PR426713P-I]
 gi|312923092|gb|EFR33912.1| glycosyltransferase, group 1 family protein [Porphyromonas
           asaccharolytica PR426713P-I]
          Length = 384

 Score = 38.5 bits (88), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 51/107 (47%), Gaps = 4/107 (3%)

Query: 234 YIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLKLVVRV-YPGR 292
           Y+GR+   K    LL    +   E   L++ + G+G    ++++ A  LKL  RV + G+
Sbjct: 208 YVGRLDKVKNLPRLLEAFRLLLDEGLDLQLLMVGDGPMDAELRQQATMLKLEERVTFAGK 267

Query: 293 DHADPIFHDYKV---FLNPSTTDVVCTATAEALAMGKIVVCANHPSN 336
                ++  Y +    + PST ++     AEALA+G  V  +   S+
Sbjct: 268 QQGSDLYAHYGLGDALILPSTNELFGATVAEALALGMPVTVSEVASS 314


>gi|406662564|ref|ZP_11070657.1| Glycogen synthase [Cecembia lonarensis LW9]
 gi|405553501|gb|EKB48723.1| Glycogen synthase [Cecembia lonarensis LW9]
          Length = 391

 Score = 38.5 bits (88), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 28/48 (58%)

Query: 236 GRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNGEDFDQIQRAAKKLK 283
            R+V SKG+EE+ G   I  K+   L + + GNG D+   Q  AK+L+
Sbjct: 210 SRVVRSKGFEEMFGAFEILAKKYPNLYLIVVGNGYDYVHYQERAKQLE 257


>gi|15896828|ref|NP_350177.1| glycosyltransferase [Clostridium acetobutylicum ATCC 824]
 gi|337738801|ref|YP_004638248.1| glycosyltransferase [Clostridium acetobutylicum DSM 1731]
 gi|384460313|ref|YP_005672733.1| Glycosyltransferase [Clostridium acetobutylicum EA 2018]
 gi|15026692|gb|AAK81517.1|AE007856_1 Glycosyltransferase [Clostridium acetobutylicum ATCC 824]
 gi|325511002|gb|ADZ22638.1| Glycosyltransferase [Clostridium acetobutylicum EA 2018]
 gi|336291812|gb|AEI32946.1| glycosyltransferase [Clostridium acetobutylicum DSM 1731]
          Length = 398

 Score = 38.5 bits (88), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 46/185 (24%), Positives = 81/185 (43%), Gaps = 12/185 (6%)

Query: 212 EIGEKKMEQQQN--GNKAFTKGAYYIGRMVWSKGYEELLGLLNIYHKELAGLEMDLYGNG 269
           E+  K+ E+  N  G K   K   Y+GR+   K  +E++ L      EL  +++ + G G
Sbjct: 186 ELSSKEREKILNHYGWKTKDKILVYVGRVAEEKNIDEIINLFKKGLNELKDIKLLIVGGG 245

Query: 270 EDFDQIQRAAKKLKL--VVRVYPGRDHADPIFHDYKV---FLNPSTTDVVCTATAEALAM 324
               Q++    +  +  +V+ + G   +D ++  YK+   F+  S ++       EALA 
Sbjct: 246 PYLSQLKELVSRYGIEDIVK-FTGMVDSDQVYKYYKMGIAFVTASQSETQGLTYIEALAS 304

Query: 325 GKIVVCANHPS-NDFFKQFPNCRTYDGRNGFVEA--TLKALAEEPAQPTDAQTHQLSWES 381
           G  V+C   P   +          Y   + FV+A  +LK+  E   +   +   Q S E 
Sbjct: 305 GCPVICKWDPCIKNLIVNGVTGFAYTDTSEFVKAVESLKS-NEILRRKIISNAKQKSCEY 363

Query: 382 ATERF 386
           +TE F
Sbjct: 364 STENF 368


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.136    0.414 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,523,183,216
Number of Sequences: 23463169
Number of extensions: 365632159
Number of successful extensions: 814959
Number of sequences better than 100.0: 456
Number of HSP's better than 100.0 without gapping: 155
Number of HSP's successfully gapped in prelim test: 301
Number of HSP's that attempted gapping in prelim test: 814373
Number of HSP's gapped (non-prelim): 502
length of query: 518
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 371
effective length of database: 8,910,109,524
effective search space: 3305650633404
effective search space used: 3305650633404
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)