RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3650
         (435 letters)



>gnl|CDD|215385 PLN02726, PLN02726, dolichyl-phosphate beta-D-mannosyltransferase.
          Length = 243

 Score =  394 bits (1014), Expect = e-138
 Identities = 154/238 (64%), Positives = 192/238 (80%), Gaps = 1/238 (0%)

Query: 198 VKNKYTVLLPTYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIY 257
              KY++++PTYNE+ N+ +IVYLI K + +    +EIIV+DDGSPDGT D  KQLQ +Y
Sbjct: 7   GAMKYSIIVPTYNERLNIALIVYLIFKALQDVK-DFEIIVVDDGSPDGTQDVVKQLQKVY 65

Query: 258 GSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDV 317
           G ++I+L+PR  KLGLGTAY+HGLK+A+G+F++IMDADLSHHPK++P  IK Q++   D+
Sbjct: 66  GEDRILLRPRPGKLGLGTAYIHGLKHASGDFVVIMDADLSHHPKYLPSFIKKQRETGADI 125

Query: 318 VTGTRYVGTGGVYGWDFKRKLVSRGANYLTQLLLRPGVSDLTGSFRLYKKQVLENLVSSC 377
           VTGTRYV  GGV+GWD +RKL SRGAN L Q LL PGVSDLTGSFRLYK+  LE+LVSS 
Sbjct: 126 VTGTRYVKGGGVHGWDLRRKLTSRGANVLAQTLLWPGVSDLTGSFRLYKRSALEDLVSSV 185

Query: 378 VSKGYVFQMEMVIRARQYNYTIGEVPISFVDRVYGESKLGGTEIFQFAKALLYLFATT 435
           VSKGYVFQME+++RA +  Y I EVPI+FVDRVYGESKLGG+EI Q+ K LLYL  TT
Sbjct: 186 VSKGYVFQMEIIVRASRKGYRIEEVPITFVDRVYGESKLGGSEIVQYLKGLLYLLLTT 243



 Score =  317 bits (814), Expect = e-107
 Identities = 122/178 (68%), Positives = 149/178 (83%)

Query: 8   YEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIM 67
           +EIIV+DDGSPDGT D  KQLQ +YG ++I+L+PR  KLGLGTAY+HGLK+A+G+F++IM
Sbjct: 41  FEIIVVDDGSPDGTQDVVKQLQKVYGEDRILLRPRPGKLGLGTAYIHGLKHASGDFVVIM 100

Query: 68  DADLSHHPKFIPEMIKLQQQENLDVVTGTRYVGTGGVYGWDFKRKLVSRGANYLTQLLLR 127
           DADLSHHPK++P  IK Q++   D+VTGTRYV  GGV+GWD +RKL SRGAN L Q LL 
Sbjct: 101 DADLSHHPKYLPSFIKKQRETGADIVTGTRYVKGGGVHGWDLRRKLTSRGANVLAQTLLW 160

Query: 128 PGVSDLTGSFRLYKKQVLENLVSSCVSKGYVFQMEMVIRARQYNYTIGEVPISFVDRV 185
           PGVSDLTGSFRLYK+  LE+LVSS VSKGYVFQME+++RA +  Y I EVPI+FVDRV
Sbjct: 161 PGVSDLTGSFRLYKRSALEDLVSSVVSKGYVFQMEIIVRASRKGYRIEEVPITFVDRV 218


>gnl|CDD|133062 cd06442, DPM1_like, DPM1_like represents putative enzymes similar
           to eukaryotic DPM1.  Proteins similar to eukaryotic
           DPM1, including enzymes from bacteria and archaea; DPM1
           is the catalytic subunit of eukaryotic
           dolichol-phosphate mannose (DPM) synthase. DPM synthase
           is required for synthesis of the
           glycosylphosphatidylinositol (GPI) anchor, N-glycan
           precursor, protein O-mannose, and C-mannose. In higher
           eukaryotes,the enzyme has three subunits, DPM1, DPM2 and
           DPM3. DPM is synthesized from dolichol phosphate and
           GDP-Man on the cytosolic surface of the ER membrane by
           DPM synthase and then is flipped onto the luminal side
           and used as a donor substrate. In lower eukaryotes, such
           as Saccharomyces cerevisiae and Trypanosoma brucei, DPM
           synthase consists of a single component (Dpm1p and
           TbDpm1, respectively) that possesses one predicted
           transmembrane region near the C terminus for anchoring
           to the ER membrane. In contrast, the Dpm1 homologues of
           higher eukaryotes, namely fission yeast, fungi, and
           animals, have no transmembrane region, suggesting the
           existence of adapter molecules for membrane anchoring.
           This family also includes bacteria and archaea DPM1_like
           enzymes. However, the enzyme structure and mechanism of
           function are not well understood. This protein family
           belongs to Glycosyltransferase 2 superfamily.
          Length = 224

 Score =  360 bits (926), Expect = e-125
 Identities = 117/228 (51%), Positives = 160/228 (70%), Gaps = 4/228 (1%)

Query: 204 VLLPTYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIV 263
           +++PTYNE+EN+P ++  +   +      YEIIV+DD SPDGT +  ++L   Y   ++ 
Sbjct: 1   IIIPTYNERENIPELIERLDAALK--GIDYEIIVVDDNSPDGTAEIVRELAKEYP--RVR 56

Query: 264 LKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDVVTGTRY 323
           L  R  K GLG+AY+ G K A G+ I++MDADLSH P++IPE+++ Q +   D+V G+RY
Sbjct: 57  LIVRPGKRGLGSAYIEGFKAARGDVIVVMDADLSHPPEYIPELLEAQLEGGADLVIGSRY 116

Query: 324 VGTGGVYGWDFKRKLVSRGANYLTQLLLRPGVSDLTGSFRLYKKQVLENLVSSCVSKGYV 383
           V  GGV GW  KRKL+SRGAN L +LLL   VSD T  FR Y+++VLE L+ S VSKGY 
Sbjct: 117 VEGGGVEGWGLKRKLISRGANLLARLLLGRKVSDPTSGFRAYRREVLEKLIDSLVSKGYK 176

Query: 384 FQMEMVIRARQYNYTIGEVPISFVDRVYGESKLGGTEIFQFAKALLYL 431
           FQ+E+++RAR+  Y I EVPI+FVDR +GESKLGG EI ++ K LL L
Sbjct: 177 FQLELLVRARRLGYRIVEVPITFVDREHGESKLGGKEIVEYLKGLLRL 224



 Score =  297 bits (762), Expect = e-100
 Identities = 96/181 (53%), Positives = 127/181 (70%), Gaps = 2/181 (1%)

Query: 5   NYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFI 64
              YEIIV+DD SPDGT +  ++L   Y   ++ L  R  K GLG+AY+ G K A G+ I
Sbjct: 25  GIDYEIIVVDDNSPDGTAEIVRELAKEYP--RVRLIVRPGKRGLGSAYIEGFKAARGDVI 82

Query: 65  IIMDADLSHHPKFIPEMIKLQQQENLDVVTGTRYVGTGGVYGWDFKRKLVSRGANYLTQL 124
           ++MDADLSH P++IPE+++ Q +   D+V G+RYV  GGV GW  KRKL+SRGAN L +L
Sbjct: 83  VVMDADLSHPPEYIPELLEAQLEGGADLVIGSRYVEGGGVEGWGLKRKLISRGANLLARL 142

Query: 125 LLRPGVSDLTGSFRLYKKQVLENLVSSCVSKGYVFQMEMVIRARQYNYTIGEVPISFVDR 184
           LL   VSD T  FR Y+++VLE L+ S VSKGY FQ+E+++RAR+  Y I EVPI+FVDR
Sbjct: 143 LLGRKVSDPTSGFRAYRREVLEKLIDSLVSKGYKFQLELLVRARRLGYRIVEVPITFVDR 202

Query: 185 V 185
            
Sbjct: 203 E 203


>gnl|CDD|133022 cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member
           of the Glycosyltransferase 2 superfamily.  DPM1 is the
           catalytic subunit of eukaryotic dolichol-phosphate
           mannose (DPM) synthase. DPM synthase is required for
           synthesis of the glycosylphosphatidylinositol (GPI)
           anchor, N-glycan precursor, protein O-mannose, and
           C-mannose. In higher eukaryotes,the enzyme has three
           subunits, DPM1, DPM2 and DPM3. DPM is synthesized from
           dolichol phosphate and GDP-Man on the cytosolic surface
           of the ER membrane by DPM synthase and then is flipped
           onto the luminal side and used as a donor substrate. In
           lower eukaryotes, such as Saccharomyces cerevisiae and
           Trypanosoma brucei, DPM synthase consists of a single
           component (Dpm1p and TbDpm1, respectively) that
           possesses one predicted transmembrane region near the C
           terminus for anchoring to the ER membrane. In contrast,
           the Dpm1 homologues of higher eukaryotes, namely fission
           yeast, fungi, and animals, have no transmembrane region,
           suggesting the existence of adapter molecules for
           membrane anchoring. This family also includes bacteria
           and archaea DPM1_like enzymes. However, the enzyme
           structure and mechanism of function are not well
           understood. The UDP-glucose:dolichyl-phosphate
           glucosyltransferase (DPG_synthase) is a
           transmembrane-bound enzyme of the endoplasmic reticulum
           involved in protein N-linked glycosylation. This enzyme
           catalyzes the transfer of glucose from UDP-glucose to
           dolichyl phosphate. This protein family belongs to
           Glycosyltransferase 2 superfamily.
          Length = 185

 Score =  216 bits (552), Expect = 8e-69
 Identities = 76/189 (40%), Positives = 117/189 (61%), Gaps = 4/189 (2%)

Query: 204 VLLPTYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIV 263
           V++P YNE+EN+P +V  +   ++EG Y YEIIV+DDGS DGT + A++L +     +++
Sbjct: 1   VVIPAYNEEENIPELVERLLAVLEEG-YDYEIIVVDDGSTDGTAEIARELAARVPRVRVI 59

Query: 264 LKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDVVTGTRY 323
              R    G G A   G K A G+ ++ MDADL H P+ IP++++   +   DVV G+R+
Sbjct: 60  RLSRNF--GKGAAVRAGFKAARGDIVVTMDADLQHPPEDIPKLLEKLLEGGADVVIGSRF 117

Query: 324 VGTGGVYGWDFKRKLVSRGANYLTQLLLRPGVSDLTGSFRLYKKQVLENLVSSCVSKGYV 383
           V  GG  G    R+L SR  N+L +LLL   +SD    FRL++++VLE L+S   S G+ 
Sbjct: 118 VR-GGGAGMPLLRRLGSRLFNFLIRLLLGVRISDTQSGFRLFRREVLEALLSLLESNGFE 176

Query: 384 FQMEMVIRA 392
           F +E+++ A
Sbjct: 177 FGLELLVGA 185



 Score =  190 bits (486), Expect = 5e-59
 Identities = 66/165 (40%), Positives = 99/165 (60%), Gaps = 3/165 (1%)

Query: 3   EGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGN 62
           E  Y YEIIV+DDGS DGT + A++L +     +++   R    G G A   G K A G+
Sbjct: 24  EEGYDYEIIVVDDGSTDGTAEIARELAARVPRVRVIRLSRNF--GKGAAVRAGFKAARGD 81

Query: 63  FIIIMDADLSHHPKFIPEMIKLQQQENLDVVTGTRYVGTGGVYGWDFKRKLVSRGANYLT 122
            ++ MDADL H P+ IP++++   +   DVV G+R+V  GG  G    R+L SR  N+L 
Sbjct: 82  IVVTMDADLQHPPEDIPKLLEKLLEGGADVVIGSRFVR-GGGAGMPLLRRLGSRLFNFLI 140

Query: 123 QLLLRPGVSDLTGSFRLYKKQVLENLVSSCVSKGYVFQMEMVIRA 167
           +LLL   +SD    FRL++++VLE L+S   S G+ F +E+++ A
Sbjct: 141 RLLLGVRISDTQSGFRLFRREVLEALLSLLESNGFEFGLELLVGA 185


>gnl|CDD|133031 cd04188, DPG_synthase, DPG_synthase is involved in protein N-linked
           glycosylation.  UDP-glucose:dolichyl-phosphate
           glucosyltransferase (DPG_synthase) is a
           transmembrane-bound enzyme of the endoplasmic reticulum
           involved in protein N-linked glycosylation. This enzyme
           catalyzes the transfer of glucose from UDP-glucose to
           dolichyl phosphate.
          Length = 211

 Score =  140 bits (355), Expect = 2e-39
 Identities = 67/214 (31%), Positives = 114/214 (53%), Gaps = 5/214 (2%)

Query: 204 VLLPTYNEKENLPIIVYLITKYMDE-GNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKI 262
           V++P YNE++ LP  +    +Y++E  ++ YEIIV+DDGS DGT + A++L     +   
Sbjct: 1   VVIPAYNEEKRLPPTLEEAVEYLEERPSFSYEIIVVDDGSKDGTAEVARKLARKNPALIR 60

Query: 263 VLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDVVTGTR 322
           VL    K  G G A   G+  A G++I+  DADL+   + + ++ +  +    D+  G+R
Sbjct: 61  VLT-LPKNRGKGGAVRAGMLAARGDYILFADADLATPFEELEKLEEALKTSGYDIAIGSR 119

Query: 323 YV-GTGGVYGWDFKRKLVSRGANYLTQLLLRPGVSDLTGSFRLYKKQVLENLVSSCVSKG 381
               +  V    + R L+ RG N+L +LLL  G+ D    F+L+ +     L      + 
Sbjct: 120 AHLASAAVVKRSWLRNLLGRGFNFLVRLLLGLGIKDTQCGFKLFTRDAARRLFPRLHLER 179

Query: 382 YVFQMEMVIRARQYNYTIGEVPISFVDRVYGESK 415
           + F +E+++ AR+  Y I EVP+ +V+     SK
Sbjct: 180 WAFDVELLVLARRLGYPIEEVPVRWVEI--PGSK 211



 Score =  125 bits (316), Expect = 8e-34
 Identities = 56/180 (31%), Positives = 95/180 (52%), Gaps = 2/180 (1%)

Query: 5   NYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFI 64
           ++ YEIIV+DDGS DGT + A++L     +   VL    K  G G A   G+  A G++I
Sbjct: 28  SFSYEIIVVDDGSKDGTAEVARKLARKNPALIRVLT-LPKNRGKGGAVRAGMLAARGDYI 86

Query: 65  IIMDADLSHHPKFIPEMIKLQQQENLDVVTGTRYV-GTGGVYGWDFKRKLVSRGANYLTQ 123
           +  DADL+   + + ++ +  +    D+  G+R    +  V    + R L+ RG N+L +
Sbjct: 87  LFADADLATPFEELEKLEEALKTSGYDIAIGSRAHLASAAVVKRSWLRNLLGRGFNFLVR 146

Query: 124 LLLRPGVSDLTGSFRLYKKQVLENLVSSCVSKGYVFQMEMVIRARQYNYTIGEVPISFVD 183
           LLL  G+ D    F+L+ +     L      + + F +E+++ AR+  Y I EVP+ +V+
Sbjct: 147 LLLGLGIKDTQCGFKLFTRDAARRLFPRLHLERWAFDVELLVLARRLGYPIEEVPVRWVE 206


>gnl|CDD|133030 cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to
           eukaryotic DPM1.  A family of  bacterial enzymes related
           to eukaryotic DPM1; Although the mechanism of eukaryotic
           enzyme is well studied, the mechanism of the  bacterial
           enzymes is not well understood. The eukaryotic DPM1 is
           the catalytic subunit of eukaryotic Dolichol-phosphate
           mannose (DPM) synthase. DPM synthase is required for
           synthesis of the glycosylphosphatidylinositol (GPI)
           anchor, N-glycan precursor, protein O-mannose, and
           C-mannose. The enzyme has three subunits, DPM1, DPM2 and
           DPM3. DPM is synthesized from dolichol phosphate and
           GDP-Man on the cytosolic surface of the ER membrane by
           DPM synthase and then is flipped onto the luminal side
           and used as a donor substrate. This protein family
           belongs to Glycosyltransferase 2 superfamily.
          Length = 181

 Score =  126 bits (318), Expect = 2e-34
 Identities = 57/176 (32%), Positives = 91/176 (51%), Gaps = 16/176 (9%)

Query: 204 VLLPTYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIV 263
           +++P YNE+ENLP +   +   ++   Y YEII +DDGS D TL+  ++L +     K++
Sbjct: 1   IVVPVYNEEENLPELYERLKAVLESLGYDYEIIFVDDGSTDRTLEILRELAARDPRVKVI 60

Query: 264 LKPRKKKL----GLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDVVT 319
                 +L    G   A + GL +A G+ +I MDADL   P+ IPEM+  + +E  DVV 
Sbjct: 61  ------RLSRNFGQQAALLAGLDHARGDAVITMDADLQDPPELIPEMLA-KWEEGYDVVY 113

Query: 320 GTRYVGTGGVYGWDFKRKLVSRGANYLTQLLLRPGVSDLTGSFRLYKKQVLENLVS 375
           G R           + ++L S+    L   L    + D  G FRL  ++V++ L+ 
Sbjct: 114 GVRKNRKES-----WLKRLTSKLFYRLINKLSGVDIPDNGGDFRLMDRKVVDALLL 164



 Score =  107 bits (269), Expect = 2e-27
 Identities = 49/154 (31%), Positives = 77/154 (50%), Gaps = 16/154 (10%)

Query: 1   MDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKL----GLGTAYMHGL 56
           ++   Y YEII +DDGS D TL+  ++L +     K++      +L    G   A + GL
Sbjct: 23  LESLGYDYEIIFVDDGSTDRTLEILRELAARDPRVKVI------RLSRNFGQQAALLAGL 76

Query: 57  KYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDVVTGTRYVGTGGVYGWDFKRKLVSR 116
            +A G+ +I MDADL   P+ IPEM+  + +E  DVV G R           + ++L S+
Sbjct: 77  DHARGDAVITMDADLQDPPELIPEMLA-KWEEGYDVVYGVRKNRKES-----WLKRLTSK 130

Query: 117 GANYLTQLLLRPGVSDLTGSFRLYKKQVLENLVS 150
               L   L    + D  G FRL  ++V++ L+ 
Sbjct: 131 LFYRLINKLSGVDIPDNGGDFRLMDRKVVDALLL 164


>gnl|CDD|215980 pfam00535, Glycos_transf_2, Glycosyl transferase family 2.  Diverse
           family, transferring sugar from UDP-glucose,
           UDP-N-acetyl- galactosamine, GDP-mannose or
           CDP-abequose, to a range of substrates including
           cellulose, dolichol phosphate and teichoic acids.
          Length = 168

 Score =  121 bits (306), Expect = 6e-33
 Identities = 53/172 (30%), Positives = 91/172 (52%), Gaps = 6/172 (3%)

Query: 203 TVLLPTYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKI 262
           +V++PTYNE++ L      +   +++    +EIIV+DDGS DGT++ A++        ++
Sbjct: 1   SVIIPTYNEEKYLE---ETLESLLNQTYKNFEIIVVDDGSTDGTVEIAEEYAKNDP--RV 55

Query: 263 VLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDVVTGTR 322
            +   ++ LG   A   GLK ATG++I  +DAD    P ++ ++++L ++   D+V G+R
Sbjct: 56  RVIRLEENLGKAAARNAGLKLATGDYIAFLDADDEVAPDWLEKLVELLEKNGADIVIGSR 115

Query: 323 YVGTGGVYGWDF-KRKLVSRGANYLTQLLLRPGVSDLTGSFRLYKKQVLENL 373
            V  G    +    R  +      L    L   V  L GS  LY+++VLE L
Sbjct: 116 VVINGETRLYGRALRFELLLLLGKLGARSLGLKVLFLIGSNALYRREVLEEL 167



 Score =  112 bits (282), Expect = 1e-29
 Identities = 46/144 (31%), Positives = 75/144 (52%), Gaps = 3/144 (2%)

Query: 6   YPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFII 65
             +EIIV+DDGS DGT++ A++        ++ +   ++ LG   A   GLK ATG++I 
Sbjct: 26  KNFEIIVVDDGSTDGTVEIAEEYAKNDP--RVRVIRLEENLGKAAARNAGLKLATGDYIA 83

Query: 66  IMDADLSHHPKFIPEMIKLQQQENLDVVTGTRYVGTGGVYGWDF-KRKLVSRGANYLTQL 124
            +DAD    P ++ ++++L ++   D+V G+R V  G    +    R  +      L   
Sbjct: 84  FLDADDEVAPDWLEKLVELLEKNGADIVIGSRVVINGETRLYGRALRFELLLLLGKLGAR 143

Query: 125 LLRPGVSDLTGSFRLYKKQVLENL 148
            L   V  L GS  LY+++VLE L
Sbjct: 144 SLGLKVLFLIGSNALYRREVLEEL 167


>gnl|CDD|223539 COG0463, WcaA, Glycosyltransferases involved in cell wall
           biogenesis [Cell envelope biogenesis, outer membrane].
          Length = 291

 Score = 94.0 bits (232), Expect = 1e-21
 Identities = 55/230 (23%), Positives = 99/230 (43%), Gaps = 21/230 (9%)

Query: 199 KNKYTVLLPTYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYG 258
             K +V++PTYNE+E LP  +  +     +    +EIIV+DDGS DGT + A +  +   
Sbjct: 2   MPKVSVVIPTYNEEEYLPEALESLLNQTYK---DFEIIVVDDGSTDGTTEIAIEYGAKDV 58

Query: 259 SEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDVV 318
             +++    ++  GLG A   GL+YA G++I+ +DAD  H P+ IP +         D  
Sbjct: 59  --RVIRLINERNGGLGAARNAGLEYARGDYIVFLDADDQHPPELIPLV-----AAGGDGD 111

Query: 319 TGTRYVGTGGVYGWDFKRKLVSRGANYLTQLLLRPGVSDLTGSFRLYKKQVLENLVSSCV 378
              R      ++        + +    L   LL   + D  G  RL  +  ++ L +   
Sbjct: 112 YIARLDDRDDIWLPRKLLSKLVK---LLGNRLLGVLIPDGFGDLRLLVRDAVDGLRA--F 166

Query: 379 SKGYVFQMEMVIRARQYNYTIGEVPISFVDRVYGESKLGGTEIFQFAKAL 428
            +G    + +++R          +    ++ +     LG ++     K L
Sbjct: 167 LEGRSRFLRLLLRKLVL------IRREVLEYLLLYRLLGASKRVLLGKLL 210



 Score = 75.9 bits (185), Expect = 2e-15
 Identities = 39/144 (27%), Positives = 65/144 (45%), Gaps = 10/144 (6%)

Query: 7   PYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIII 66
            +EIIV+DDGS DGT + A +  +     +++    ++  GLG A   GL+YA G++I+ 
Sbjct: 32  DFEIIVVDDGSTDGTTEIAIEYGAKDV--RVIRLINERNGGLGAARNAGLEYARGDYIVF 89

Query: 67  MDADLSHHPKFIPEMIKLQQQENLDVVTGTRYVGTGGVYGWDFKRKLVSRGANYLTQLLL 126
           +DAD  H P+ IP +         D     R      ++        + +    L   LL
Sbjct: 90  LDADDQHPPELIPLV-----AAGGDGDYIARLDDRDDIWLPRKLLSKLVK---LLGNRLL 141

Query: 127 RPGVSDLTGSFRLYKKQVLENLVS 150
              + D  G  RL  +  ++ L +
Sbjct: 142 GVLIPDGFGDLRLLVRDAVDGLRA 165


>gnl|CDD|132997 cd00761, Glyco_tranf_GTA_type, Glycosyltransferase family A (GT-A)
           includes diverse families of glycosyl transferases with
           a common GT-A type structural fold.
           Glycosyltransferases (GTs) are enzymes that synthesize
           oligosaccharides, polysaccharides, and glycoconjugates
           by transferring the sugar moiety from an activated
           nucleotide-sugar donor to an acceptor molecule, which
           may be a growing oligosaccharide, a lipid, or a protein.
            Based on the stereochemistry of the donor and acceptor
           molecules, GTs are classified as either retaining or
           inverting enzymes. To date, all GT structures adopt one
           of two possible folds, termed GT-A fold and GT-B fold.
           This hierarchy includes diverse families of glycosyl
           transferases with a common GT-A type structural fold,
           which has two tightly associated beta/alpha/beta domains
           that tend to form a continuous central sheet of at least
           eight beta-strands. The majority of the proteins in this
           superfamily are Glycosyltransferase family 2 (GT-2)
           proteins. But it also includes families GT-43, GT-6,
           GT-8, GT13 and GT-7; which are evolutionarily related to
           GT-2 and share structure similarities.
          Length = 156

 Score = 87.6 bits (217), Expect = 2e-20
 Identities = 35/119 (29%), Positives = 56/119 (47%), Gaps = 6/119 (5%)

Query: 204 VLLPTYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIV 263
           V++P YNE+  L   +  +          +E+IV+DDGS DGTL+  ++          V
Sbjct: 1   VIIPAYNEEPYLERCLESLLAQ---TYPNFEVIVVDDGSTDGTLEILEEYAKKDPRVIRV 57

Query: 264 LKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIK-LQQQENLDVVTGT 321
           +   ++  GL  A   GLK A G +I+ +DAD    P ++  ++  L      D V G 
Sbjct: 58  IN--EENQGLAAARNAGLKAARGEYILFLDADDLLLPDWLERLVAELLADPEADAVGGP 114



 Score = 76.8 bits (189), Expect = 1e-16
 Identities = 29/91 (31%), Positives = 45/91 (49%), Gaps = 3/91 (3%)

Query: 7   PYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIII 66
            +E+IV+DDGS DGTL+  ++          V+   ++  GL  A   GLK A G +I+ 
Sbjct: 26  NFEVIVVDDGSTDGTLEILEEYAKKDPRVIRVIN--EENQGLAAARNAGLKAARGEYILF 83

Query: 67  MDADLSHHPKFIPEMIK-LQQQENLDVVTGT 96
           +DAD    P ++  ++  L      D V G 
Sbjct: 84  LDADDLLLPDWLERLVAELLADPEADAVGGP 114


>gnl|CDD|240336 PTZ00260, PTZ00260, dolichyl-phosphate beta-glucosyltransferase;
           Provisional.
          Length = 333

 Score = 84.4 bits (209), Expect = 4e-18
 Identities = 59/219 (26%), Positives = 112/219 (51%), Gaps = 14/219 (6%)

Query: 203 TVLLPTYNEKENLPI-----IVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQL--QS 255
           ++++P YNE++ LP      I YL ++   +  + YEII+++DGS D TL  AK    Q+
Sbjct: 73  SIVIPAYNEEDRLPKMLKETIKYLESRSRKDPKFKYEIIIVNDGSKDKTLKVAKDFWRQN 132

Query: 256 IYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDAD----LSHHPKFIPEMIKLQQ 311
           I  +  I L    +  G G A   G+  + G +I+++DAD    +    K    M+K++Q
Sbjct: 133 INPNIDIRLLSLLRNKGKGGAVRIGMLASRGKYILMVDADGATDIDDFDKLEDIMLKIEQ 192

Query: 312 QENLDVVTGTR--YVGTGGVYGWDFKRKLVSRGANYLTQLLLRPGVSDLTGSFRLYKKQV 369
              L +V G+R   V +  V    + R ++  G +++   +    + D    F+L+ ++ 
Sbjct: 193 NG-LGIVFGSRNHLVDSDVVAKRKWYRNILMYGFHFIVNTICGTNLKDTQCGFKLFTRET 251

Query: 370 LENLVSSCVSKGYVFQMEMVIRARQYNYTIGEVPISFVD 408
              +  S   + + F +E+V+ A++ N  I EVP+++ +
Sbjct: 252 ARIIFPSLHLERWAFDIEIVMIAQKLNLPIAEVPVNWTE 290



 Score = 72.9 bits (179), Expect = 3e-14
 Identities = 50/189 (26%), Positives = 95/189 (50%), Gaps = 9/189 (4%)

Query: 3   EGNYPYEIIVIDDGSPDGTLDAAKQL--QSIYGSEKIVLKPRKKKLGLGTAYMHGLKYAT 60
           +  + YEII+++DGS D TL  AK    Q+I  +  I L    +  G G A   G+  + 
Sbjct: 103 DPKFKYEIIIVNDGSKDKTLKVAKDFWRQNINPNIDIRLLSLLRNKGKGGAVRIGMLASR 162

Query: 61  GNFIIIMDAD----LSHHPKFIPEMIKLQQQENLDVVTGTR--YVGTGGVYGWDFKRKLV 114
           G +I+++DAD    +    K    M+K++Q   L +V G+R   V +  V    + R ++
Sbjct: 163 GKYILMVDADGATDIDDFDKLEDIMLKIEQNG-LGIVFGSRNHLVDSDVVAKRKWYRNIL 221

Query: 115 SRGANYLTQLLLRPGVSDLTGSFRLYKKQVLENLVSSCVSKGYVFQMEMVIRARQYNYTI 174
             G +++   +    + D    F+L+ ++    +  S   + + F +E+V+ A++ N  I
Sbjct: 222 MYGFHFIVNTICGTNLKDTQCGFKLFTRETARIIFPSLHLERWAFDIEIVMIAQKLNLPI 281

Query: 175 GEVPISFVD 183
            EVP+++ +
Sbjct: 282 AEVPVNWTE 290


>gnl|CDD|182669 PRK10714, PRK10714, undecaprenyl phosphate
           4-deoxy-4-formamido-L-arabinose transferase;
           Provisional.
          Length = 325

 Score = 81.7 bits (202), Expect = 4e-17
 Identities = 61/217 (28%), Positives = 103/217 (47%), Gaps = 14/217 (6%)

Query: 200 NKYTVLLPTYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGS 259
            K +V++P YNE+E+LP ++   T   +     YEI++IDDGS D + +   +      S
Sbjct: 6   KKVSVVIPVYNEQESLPELIRRTTAACESLGKEYEILLIDDGSSDNSAEMLVEAAQAPDS 65

Query: 260 EKI-VLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDVV 318
             + +L  R    G  +A M G  + TG+ II +DADL + P+ IP ++  +  E  DVV
Sbjct: 66  HIVAILLNRN--YGQHSAIMAGFSHVTGDLIITLDADLQNPPEEIPRLVA-KADEGYDVV 122

Query: 319 TGTRYVGTGGVYGWDFKRKLVSRGANYLTQLLLRPGVSDLTGSFRLYKKQVLENLVSSCV 378
              R       +     RK  S+  N L Q      + D     R Y++ +++ ++  C 
Sbjct: 123 GTVRQNRQDSWF-----RKTASKMINRLIQRTTGKAMGDYGCMLRAYRRHIVDAML-HCH 176

Query: 379 SKGYVFQMEMVIRARQYNYTIGEVPISFVDRVYGESK 415
            +     +     AR+    I E+P+   +R +G+SK
Sbjct: 177 ERSTFIPILANTFARR---AI-EIPVHHAEREFGDSK 209



 Score = 59.4 bits (144), Expect = 1e-09
 Identities = 41/141 (29%), Positives = 65/141 (46%), Gaps = 9/141 (6%)

Query: 7   PYEIIVIDDGSPDGTLDAAKQLQSIYGSEKI-VLKPRKKKLGLGTAYMHGLKYATGNFII 65
            YEI++IDDGS D + +   +      S  + +L  R    G  +A M G  + TG+ II
Sbjct: 38  EYEILLIDDGSSDNSAEMLVEAAQAPDSHIVAILLNRN--YGQHSAIMAGFSHVTGDLII 95

Query: 66  IMDADLSHHPKFIPEMIKLQQQENLDVVTGTRYVGTGGVYGWDFKRKLVSRGANYLTQLL 125
            +DADL + P+ IP ++  +  E  DVV   R       +     RK  S+  N L Q  
Sbjct: 96  TLDADLQNPPEEIPRLVA-KADEGYDVVGTVRQNRQDSWF-----RKTASKMINRLIQRT 149

Query: 126 LRPGVSDLTGSFRLYKKQVLE 146
               + D     R Y++ +++
Sbjct: 150 TGKAMGDYGCMLRAYRRHIVD 170


>gnl|CDD|133045 cd06423, CESA_like, CESA_like is  the cellulose synthase
           superfamily.  The cellulose synthase (CESA) superfamily
           includes a wide variety of glycosyltransferase family 2
           enzymes that share the common characteristic of
           catalyzing the elongation of polysaccharide chains. The
           members include cellulose synthase catalytic subunit,
           chitin synthase, glucan biosynthesis protein and other
           families of CESA-like proteins. Cellulose synthase
           catalyzes the polymerization reaction of cellulose, an
           aggregate of unbranched polymers of beta-1,4-linked
           glucose residues in  plants, most algae, some bacteria
           and fungi, and even some animals. In bacteria, algae and
           lower eukaryotes, there is a second unrelated type of
           cellulose synthase (Type II), which produces acylated
           cellulose, a derivative of cellulose. Chitin synthase
           catalyzes the incorporation of GlcNAc from substrate
           UDP-GlcNAc into chitin, which is a linear homopolymer of
           beta-(1,4)-linked GlcNAc residues and Glucan
           Biosynthesis protein catalyzes the elongation of
           beta-1,2 polyglucose chains of Glucan.
          Length = 180

 Score = 71.1 bits (175), Expect = 2e-14
 Identities = 43/183 (23%), Positives = 82/183 (44%), Gaps = 27/183 (14%)

Query: 204 VLLPTYNEKENLPIIVYLITKYMDEGNYP-YEIIVIDDGSPDGTLDAAKQLQSIYGSEKI 262
           +++P YNE+  +   +    + +   +YP  E+IV+DDGS D TL+  ++L ++Y    +
Sbjct: 1   IIVPAYNEEAVIERTI----ESLLALDYPKLEVIVVDDGSTDDTLEILEELAALYIRRVL 56

Query: 263 VL--KPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQ-ENLDVVT 319
           V+  K    K G   A   GL++A G+ ++++DAD    P  +  ++        +  V 
Sbjct: 57  VVRDKENGGKAG---ALNAGLRHAKGDIVVVLDADTILEPDALKRLVVPFFADPKVGAVQ 113

Query: 320 GTRYVGTGGVYGWDFKRKLVSRGAN---YLTQLLLRPGVSDL------TGSFRLYKKQVL 370
           G   V        +    L++R           L R   S L      +G+F  ++++ L
Sbjct: 114 GRVRVR-------NGSENLLTRLQAIEYLSIFRLGRRAQSALGGVLVLSGAFGAFRREAL 166

Query: 371 ENL 373
             +
Sbjct: 167 REV 169



 Score = 62.6 bits (153), Expect = 1e-11
 Identities = 39/157 (24%), Positives = 70/157 (44%), Gaps = 23/157 (14%)

Query: 5   NYP-YEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVL--KPRKKKLGLGTAYMHGLKYATG 61
           +YP  E+IV+DDGS D TL+  ++L ++Y    +V+  K    K G   A   GL++A G
Sbjct: 23  DYPKLEVIVVDDGSTDDTLEILEELAALYIRRVLVVRDKENGGKAG---ALNAGLRHAKG 79

Query: 62  NFIIIMDADLSHHPKFIPEMIKLQQQ-ENLDVVTGTRYVGTGGVYGWDFKRKLVSRGAN- 119
           + ++++DAD    P  +  ++        +  V G   V        +    L++R    
Sbjct: 80  DIVVVLDADTILEPDALKRLVVPFFADPKVGAVQGRVRVR-------NGSENLLTRLQAI 132

Query: 120 --YLTQLLLRPGVSDL------TGSFRLYKKQVLENL 148
                  L R   S L      +G+F  ++++ L  +
Sbjct: 133 EYLSIFRLGRRAQSALGGVLVLSGAFGAFRREALREV 169


>gnl|CDD|224136 COG1215, COG1215, Glycosyltransferases, probably involved in cell
           wall biogenesis [Cell envelope biogenesis, outer
           membrane].
          Length = 439

 Score = 68.8 bits (168), Expect = 1e-12
 Identities = 49/182 (26%), Positives = 81/182 (44%), Gaps = 16/182 (8%)

Query: 201 KYTVLLPTYNE-KENLPIIVYLITKYMDEGNYP-YEIIVIDDGSPDGTLDAAKQLQSIYG 258
           K +V++P YNE  E L   +  +       +YP YE+IV+DDGS D T +  ++L + YG
Sbjct: 55  KVSVIIPAYNEEPEVLEETLESLLSQ----DYPRYEVIVVDDGSTDETYEILEELGAEYG 110

Query: 259 SEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDVV 318
               V+ P KK  G   A  +GLK A G+ ++I+DAD    P  + E++   +   +  V
Sbjct: 111 PNFRVIYPEKKNGGKAGALNNGLKRAKGDVVVILDADTVPEPDALRELVSPFEDPPVGAV 170

Query: 319 TGTRYVGTGGVY-GWDFKRKLVSRGANYLTQLLLRPGVSD------LTGSFRLYKKQVLE 371
            GT  +           + + +                S       L+GS   +++  LE
Sbjct: 171 VGTPRIRNRPDPSNLLGRIQAIE---YLSAFYFRLRAASKGGLISFLSGSSSAFRRSALE 227

Query: 372 NL 373
            +
Sbjct: 228 EV 229



 Score = 59.2 bits (143), Expect = 2e-09
 Identities = 41/152 (26%), Positives = 68/152 (44%), Gaps = 11/152 (7%)

Query: 5   NYP-YEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNF 63
           +YP YE+IV+DDGS D T +  ++L + YG    V+ P KK  G   A  +GLK A G+ 
Sbjct: 81  DYPRYEVIVVDDGSTDETYEILEELGAEYGPNFRVIYPEKKNGGKAGALNNGLKRAKGDV 140

Query: 64  IIIMDADLSHHPKFIPEMIKLQQQENLDVVTGTRYVGTGGVY-GWDFKRKLVSRGANYLT 122
           ++I+DAD    P  + E++   +   +  V GT  +           + + +        
Sbjct: 141 VVILDADTVPEPDALRELVSPFEDPPVGAVVGTPRIRNRPDPSNLLGRIQAIE---YLSA 197

Query: 123 QLLLRPGVSD------LTGSFRLYKKQVLENL 148
                   S       L+GS   +++  LE +
Sbjct: 198 FYFRLRAASKGGLISFLSGSSSAFRRSALEEV 229


>gnl|CDD|133016 cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the
           biosynthesis of succinoglycan.  Succinoglycan
           Biosynthesis Protein ExoA catalyzes the formation of a
           beta-1,3 linkage of the second sugar (glucose) of the
           succinoglycan with the galactose on the lipid carrie.
           Succinoglycan is an acidic exopolysaccharide that is
           important for invasion of the nodules. Succinoglycan is
           a high-molecular-weight polymer composed of repeating
           octasaccharide units. These units are synthesized on
           membrane-bound isoprenoid lipid carriers, beginning with
           galactose followed by seven glucose molecules, and
           modified by the addition of acetate, succinate, and
           pyruvate. ExoA is a membrane protein with a
           transmembrance domain at c-terminus.
          Length = 249

 Score = 64.9 bits (159), Expect = 6e-12
 Identities = 43/183 (23%), Positives = 79/183 (43%), Gaps = 26/183 (14%)

Query: 203 TVLLPTYNEKENLPIIVYLITKYMDEGNYP---YEIIVIDDGSPDGTLDAAKQLQSIYGS 259
           ++++P  NE++    I  L+   +   +YP    EIIV+D GS DGT +  ++  +    
Sbjct: 3   SIIIPVRNEEKY---IEELLESLL-NQSYPKDLIEIIVVDGGSTDGTREIVQEYAAKD-- 56

Query: 260 EKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPK-FIPEMIKLQQQENLDVV 318
            +I L    K++        G++ + G+ II +DA  + +PK +I E+++  ++   D V
Sbjct: 57  PRIRLIDNPKRI-QSAGLNIGIRNSRGDIIIRVDAH-AVYPKDYILELVEALKRTGADNV 114

Query: 319 TGTRYV----GTGGVYGWDFKRKLVSRGANYLTQLLLRPGVSDL----TGSFRLYKKQVL 370
            G                     L S G+ Y      R G   +    T     Y+++V 
Sbjct: 115 GGPMETIGESKFQKAIAVAQSSPLGSGGSAY------RGGAVKIGYVDTVHHGAYRREVF 168

Query: 371 ENL 373
           E +
Sbjct: 169 EKV 171



 Score = 61.1 bits (149), Expect = 1e-10
 Identities = 36/151 (23%), Positives = 63/151 (41%), Gaps = 19/151 (12%)

Query: 7   PYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIII 66
             EIIV+D GS DGT +  ++  +     +I L    K++        G++ + G+ II 
Sbjct: 31  LIEIIVVDGGSTDGTREIVQEYAAKD--PRIRLIDNPKRI-QSAGLNIGIRNSRGDIIIR 87

Query: 67  MDADLSHHPK-FIPEMIKLQQQENLDVVTGTRYV----GTGGVYGWDFKRKLVSRGANYL 121
           +DA  + +PK +I E+++  ++   D V G                     L S G+ Y 
Sbjct: 88  VDAH-AVYPKDYILELVEALKRTGADNVGGPMETIGESKFQKAIAVAQSSPLGSGGSAY- 145

Query: 122 TQLLLRPGVSDL----TGSFRLYKKQVLENL 148
                R G   +    T     Y+++V E +
Sbjct: 146 -----RGGAVKIGYVDTVHHGAYRREVFEKV 171


>gnl|CDD|222281 pfam13641, Glyco_tranf_2_3, Glycosyltransferase like family 2.
           Members of this family of prokaryotic proteins include
           putative glucosyltransferase, which are involved in
           bacterial capsule biosynthesis.
          Length = 229

 Score = 60.4 bits (147), Expect = 2e-10
 Identities = 29/127 (22%), Positives = 59/127 (46%), Gaps = 6/127 (4%)

Query: 201 KYTVLLPTYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSE 260
           +  V++PT+NE     ++   +   + +     E+IV+ DGS D TLD A++L + Y   
Sbjct: 2   RVAVVVPTWNE---DKVLGRTLRSILAQDYPRLEVIVVVDGSDDETLDVARELAAAYPDV 58

Query: 261 KIVLKPRKKKLGLGT---AYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDV 317
           ++ +  R +  G      A    L+    + ++++DAD    P  +  ++     + +  
Sbjct: 59  RVHVVRRPRPPGPTGKARALNEALRAIKSDLVVLLDADSVVDPDTLRRLLPFFLSKGVGA 118

Query: 318 VTGTRYV 324
           V G  +V
Sbjct: 119 VQGPVFV 125



 Score = 55.8 bits (135), Expect = 5e-09
 Identities = 39/200 (19%), Positives = 75/200 (37%), Gaps = 32/200 (16%)

Query: 7   PYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGT---AYMHGLKYATGNF 63
             E+IV+ DGS D TLD A++L + Y   ++ +  R +  G      A    L+    + 
Sbjct: 30  RLEVIVVVDGSDDETLDVARELAAAYPDVRVHVVRRPRPPGPTGKARALNEALRAIKSDL 89

Query: 64  IIIMDADLSHHPKFIPEMIKLQQQENLDVVTGTRYVGTGG----------VYGWDFKRKL 113
           ++++DAD    P  +  ++     + +  V G  +V                    +   
Sbjct: 90  VVLLDADSVVDPDTLRRLLPFFLSKGVGAVQGPVFVLNLRTAVAPLYALEFALRHLRFMA 149

Query: 114 VSRGANYLTQLLLRPGVSDLTGSFRLYKKQVLENL----VSSCVSKGYVFQMEMVIRARQ 169
           + R            GV+ L GS  L+++ VLE +        + +     +    R R+
Sbjct: 150 LRRAL----------GVAPLAGSGSLFRRSVLEEIGGFDPGFLLGEDKELGL----RLRR 195

Query: 170 YNYTIGEVPISFVDRVVFTT 189
             +    VP + V   +  +
Sbjct: 196 AGWRTAYVPGAAV-YELSPS 214


>gnl|CDD|133035 cd04192, GT_2_like_e, Subfamily of Glycosyltransferase Family GT2
           of unknown function.  GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 229

 Score = 55.4 bits (134), Expect = 8e-09
 Identities = 37/146 (25%), Positives = 63/146 (43%), Gaps = 26/146 (17%)

Query: 204 VLLPTYNEKENLPIIVYLITKYMDEGNYP---YEIIVIDDGSPDGT---LDAA-----KQ 252
           V++   NE ENLP ++  ++      +YP   +E+I++DD S DGT   L+ A      Q
Sbjct: 1   VVIAARNEAENLPRLLQSLSAL----DYPKEKFEVILVDDHSTDGTVQILEFAAAKPNFQ 56

Query: 253 LQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQ 312
           L+ +  S    +  +K       A    +K A G++I+  DAD      ++   +   Q+
Sbjct: 57  LKILNNSRV-SISGKKN------ALTTAIKAAKGDWIVTTDADCVVPSNWLLTFVAFIQK 109

Query: 313 ENLDVVTGTRYVGTGGVYGWDFKRKL 338
           E + +V G          G     K 
Sbjct: 110 EQIGLVAG----PVIYFKGKSLLAKF 131



 Score = 48.1 bits (115), Expect = 3e-06
 Identities = 30/120 (25%), Positives = 50/120 (41%), Gaps = 22/120 (18%)

Query: 5   NYP---YEIIVIDDGSPDGT---LDAA-----KQLQSIYGSEKIVLKPRKKKLGLGTAYM 53
           +YP   +E+I++DD S DGT   L+ A      QL+ +  S    +  +K       A  
Sbjct: 23  DYPKEKFEVILVDDHSTDGTVQILEFAAAKPNFQLKILNNSRV-SISGKKN------ALT 75

Query: 54  HGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDVVTGTRYVGTGGVYGWDFKRKL 113
             +K A G++I+  DAD      ++   +   Q+E + +V G          G     K 
Sbjct: 76  TAIKAAKGDWIVTTDADCVVPSNWLLTFVAFIQKEQIGLVAG----PVIYFKGKSLLAKF 131


>gnl|CDD|182223 PRK10073, PRK10073, putative glycosyl transferase; Provisional.
          Length = 328

 Score = 53.9 bits (130), Expect = 5e-08
 Identities = 31/136 (22%), Positives = 60/136 (44%), Gaps = 10/136 (7%)

Query: 196 DSVKNKYTVLLPTYNEKENLP-IIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQ 254
            +   K ++++P YN  ++    +  LI +         EII+++DGS D +++ AK   
Sbjct: 2   MNSTPKLSIIIPLYNAGKDFRAFMESLIAQTWTA----LEIIIVNDGSTDNSVEIAKHYA 57

Query: 255 SIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQEN 314
             Y   +++    +   G+  A   GL  ATG ++   DAD   +P     ++ +  +++
Sbjct: 58  ENYPHVRLL---HQANAGVSVARNTGLAVATGKYVAFPDADDVVYPTMYETLMTMALEDD 114

Query: 315 LDVVT--GTRYVGTGG 328
           LDV            G
Sbjct: 115 LDVAQCNADWCFRDTG 130



 Score = 50.0 bits (120), Expect = 1e-06
 Identities = 25/98 (25%), Positives = 45/98 (45%), Gaps = 5/98 (5%)

Query: 8   YEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIM 67
            EII+++DGS D +++ AK     Y   +++    +   G+  A   GL  ATG ++   
Sbjct: 36  LEIIIVNDGSTDNSVEIAKHYAENYPHVRLL---HQANAGVSVARNTGLAVATGKYVAFP 92

Query: 68  DADLSHHPKFIPEMIKLQQQENLDVVT--GTRYVGTGG 103
           DAD   +P     ++ +  +++LDV            G
Sbjct: 93  DADDVVYPTMYETLMTMALEDDLDVAQCNADWCFRDTG 130


>gnl|CDD|133061 cd06439, CESA_like_1, CESA_like_1 is a member of the cellulose
           synthase (CESA) superfamily.  This is a subfamily of
           cellulose synthase (CESA) superfamily.  CESA superfamily
           includes a wide variety of glycosyltransferase family 2
           enzymes that share the common characteristic of
           catalyzing the elongation of polysaccharide chains.  The
           members of the superfamily include cellulose synthase
           catalytic subunit, chitin synthase, glucan biosynthesis
           protein and other families of CESA-like proteins.
          Length = 251

 Score = 53.4 bits (129), Expect = 6e-08
 Identities = 36/133 (27%), Positives = 59/133 (44%), Gaps = 12/133 (9%)

Query: 201 KYTVLLPTYNEKENLPIIVYLITKYMDEGNYP---YEIIVIDDGSPDGTLDAAKQLQSIY 257
             T+++P YNE+    I   L        +YP    EIIV+ DGS DGT + A++    Y
Sbjct: 30  TVTIIIPAYNEEAV--IEAKLENLLAL--DYPRDRLEIIVVSDGSTDGTAEIARE----Y 81

Query: 258 GSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDV 317
             + + L    ++ G   A    L  ATG  ++  DA+    P  +  +++     ++  
Sbjct: 82  ADKGVKLLRFPERRGKAAALNRALALATGEIVVFTDANALLDPDALRLLVRHFADPSVGA 141

Query: 318 VTGT-RYVGTGGV 329
           V+G    V  GG 
Sbjct: 142 VSGELVIVDGGGS 154



 Score = 46.4 bits (111), Expect = 9e-06
 Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 8/104 (7%)

Query: 5   NYP---YEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATG 61
           +YP    EIIV+ DGS DGT + A++    Y  + + L    ++ G   A    L  ATG
Sbjct: 55  DYPRDRLEIIVVSDGSTDGTAEIARE----YADKGVKLLRFPERRGKAAALNRALALATG 110

Query: 62  NFIIIMDADLSHHPKFIPEMIKLQQQENLDVVTGT-RYVGTGGV 104
             ++  DA+    P  +  +++     ++  V+G    V  GG 
Sbjct: 111 EIVVFTDANALLDPDALRLLVRHFADPSVGAVSGELVIVDGGGS 154


>gnl|CDD|220577 pfam10111, Glyco_tranf_2_2, Glycosyltransferase like family 2.
           Members of this family of prokaryotic proteins include
           putative glucosyltransferase, which are involved in
           bacterial capsule biosynthesis.
          Length = 278

 Score = 53.2 bits (128), Expect = 7e-08
 Identities = 26/114 (22%), Positives = 43/114 (37%), Gaps = 5/114 (4%)

Query: 203 TVLLPTYNEKENLPII--VYLITKYMDEGNYPY--EIIVIDDGSPDGTL-DAAKQLQSIY 257
           +V++P Y       I+  +      +         EIIV+D  SP     +  K +    
Sbjct: 1   SVIIPVYLSLRTRDILKRLQSCLNRLSYFLSDPDVEIIVVDGDSPLSFAKELKKIIAKNG 60

Query: 258 GSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQ 311
               I  K  K       A   G +Y++ +FI  +D D    P  + ++IK  Q
Sbjct: 61  AINYISHKTHKDLFSPAKARNRGAEYSSSDFIFFLDVDCLISPDTLEKIIKHFQ 114



 Score = 49.3 bits (118), Expect = 1e-06
 Identities = 22/79 (27%), Positives = 33/79 (41%), Gaps = 1/79 (1%)

Query: 9   EIIVIDDGSPDGTL-DAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIM 67
           EIIV+D  SP     +  K +        I  K  K       A   G +Y++ +FI  +
Sbjct: 36  EIIVVDGDSPLSFAKELKKIIAKNGAINYISHKTHKDLFSPAKARNRGAEYSSSDFIFFL 95

Query: 68  DADLSHHPKFIPEMIKLQQ 86
           D D    P  + ++IK  Q
Sbjct: 96  DVDCLISPDTLEKIIKHFQ 114


>gnl|CDD|200433 TIGR04182, glyco_TIGR04182, glycosyltransferase, TIGR04182 family. 
           Members of this family are glycosyltransferases
           restricted to the archaea. All but two members are from
           species with the PGF-CTERM/archaeosortase A system, a
           proposed maturation system for exported, glycosylated
           proteins as are found often in S-layers.
          Length = 293

 Score = 53.1 bits (128), Expect = 8e-08
 Identities = 55/237 (23%), Positives = 102/237 (43%), Gaps = 33/237 (13%)

Query: 203 TVLLPTYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKI 262
            VL+PT NE      I  +I  + + G    +I+VID  S DGT + AK+  +     ++
Sbjct: 3   CVLIPTLNEA---ATIGDVIDGFQELGYS--DILVIDGNSTDGTQEIAKEAGA-----RV 52

Query: 263 VLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDVVTGTR 322
           V++  K K   G A     +     +++++D D ++ P     +++       D V G R
Sbjct: 53  VIQSGKGK---GQAVREAFELIDAPYVLMLDGDGTYLPADADALLEPLLSGRADHVIGNR 109

Query: 323 YVGTGGVYGWDFKRKLVSRGANYLTQLLLR--PGV--SDLTGSFRLYKKQVLENLVSSCV 378
           +          F R  ++   N L   L R   GV   D+   +R + ++ +  +  +  
Sbjct: 110 FADMEPG---AFTR--LNLVGNRLINRLFRIIHGVDLRDILSGYRAFTRESIRRMELT-- 162

Query: 379 SKGYVFQMEMVIRARQYNYTIGEVPISFVDRVYG-ESKLG--------GTEIFQFAK 426
             G+  + E+ +   + N  +  VPI++  R  G ++KL         G  I++ AK
Sbjct: 163 ETGFEIETEIAVECVKKNLRVEVVPITYRARPGGSDTKLNPLRDGFKIGLTIYRLAK 219



 Score = 40.0 bits (94), Expect = 0.002
 Identities = 39/181 (21%), Positives = 78/181 (43%), Gaps = 19/181 (10%)

Query: 8   YEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIM 67
            +I+VID  S DGT + AK+  +     ++V++  K K   G A     +     +++++
Sbjct: 28  SDILVIDGNSTDGTQEIAKEAGA-----RVVIQSGKGK---GQAVREAFELIDAPYVLML 79

Query: 68  DADLSHHPKFIPEMIKLQQQENLDVVTGTRYVGTGGVYGWDFKRKLVSRGANYLTQLLLR 127
           D D ++ P     +++       D V G R+          F R  ++   N L   L R
Sbjct: 80  DGDGTYLPADADALLEPLLSGRADHVIGNRFADMEPG---AFTR--LNLVGNRLINRLFR 134

Query: 128 --PGV--SDLTGSFRLYKKQVLENLVSSCVSKGYVFQMEMVIRARQYNYTIGEVPISFVD 183
              GV   D+   +R + ++ +  +  +    G+  + E+ +   + N  +  VPI++  
Sbjct: 135 IIHGVDLRDILSGYRAFTRESIRRMELT--ETGFEIETEIAVECVKKNLRVEVVPITYRA 192

Query: 184 R 184
           R
Sbjct: 193 R 193


>gnl|CDD|236882 PRK11204, PRK11204, N-glycosyltransferase; Provisional.
          Length = 420

 Score = 52.2 bits (126), Expect = 2e-07
 Identities = 32/121 (26%), Positives = 54/121 (44%), Gaps = 10/121 (8%)

Query: 203 TVLLPTYNEKENL-PIIVYLITKYMDEGNYP-YEIIVIDDGSPDGTLDAAKQLQSIYGSE 260
           ++L+P YNE EN+   I +L+        YP YE+I I+DGS D T +   +L +     
Sbjct: 57  SILVPCYNEGENVEETISHLLAL-----RYPNYEVIAINDGSSDNTGEILDRLAAQI--P 109

Query: 261 KIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIK-LQQQENLDVVT 319
           ++ +    +  G   A   G   A   +++ +D D    P     M++       +  VT
Sbjct: 110 RLRVIHLAENQGKANALNTGAAAARSEYLVCIDGDALLDPDAAAYMVEHFLHNPRVGAVT 169

Query: 320 G 320
           G
Sbjct: 170 G 170



 Score = 43.4 bits (103), Expect = 1e-04
 Identities = 23/93 (24%), Positives = 39/93 (41%), Gaps = 4/93 (4%)

Query: 5   NYP-YEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNF 63
            YP YE+I I+DGS D T +   +L +     ++ +    +  G   A   G   A   +
Sbjct: 80  RYPNYEVIAINDGSSDNTGEILDRLAAQI--PRLRVIHLAENQGKANALNTGAAAARSEY 137

Query: 64  IIIMDADLSHHPKFIPEMIK-LQQQENLDVVTG 95
           ++ +D D    P     M++       +  VTG
Sbjct: 138 LVCIDGDALLDPDAAAYMVEHFLHNPRVGAVTG 170


>gnl|CDD|224137 COG1216, COG1216, Predicted glycosyltransferases [General function
           prediction only].
          Length = 305

 Score = 51.3 bits (123), Expect = 4e-07
 Identities = 49/234 (20%), Positives = 96/234 (41%), Gaps = 35/234 (14%)

Query: 203 TVLLPTYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKI 262
           ++++ TYN  E+L  +  L +           I+V+D+GS DG+L+A K          +
Sbjct: 6   SIIIVTYNRGEDL--VECLASLAAQT-YPDDVIVVVDNGSTDGSLEALK----ARFFPNV 58

Query: 263 VLKPRKKKLGLGTAYMHGLKYATGN---FIIIMDADLSHHPKFIPEMIKLQQQENLDVVT 319
            L    + LG    +  G+KYA      ++++++ D    P  + E++K  +++    V 
Sbjct: 59  RLIENGENLGFAGGFNRGIKYALAKGDDYVLLLNPDTVVEPDLLEELLKAAEEDPAAGVV 118

Query: 320 GTRYVGTGG---VYGWDFKRKLVSRGANYLTQLLLRP-------GVSDLTGSFRLYKKQV 369
           G           +     +   ++ G      L + P        V+ L+G+  L +++ 
Sbjct: 119 GPLIRNYDESLYIDRRGGESDGLTGGWRASPLLEIAPDLSSYLEVVASLSGACLLIRREA 178

Query: 370 LENLVSSCVSKG--------YVFQMEMVIRARQYNYTIGEVPISFVDRVYGESK 415
            E +       G        Y   +++ +RAR+  Y I  VP + +    G SK
Sbjct: 179 FEKV-------GGFDERFFIYYEDVDLCLRARKAGYKIYYVPDAIIYHKIGSSK 225



 Score = 44.4 bits (105), Expect = 5e-05
 Identities = 40/197 (20%), Positives = 80/197 (40%), Gaps = 32/197 (16%)

Query: 7   PYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGN---F 63
              I+V+D+GS DG+L+A K          + L    + LG    +  G+KYA      +
Sbjct: 32  DDVIVVVDNGSTDGSLEALK----ARFFPNVRLIENGENLGFAGGFNRGIKYALAKGDDY 87

Query: 64  IIIMDADLSHHPKFIPEMIKLQQQENLDVVTGTRYVGTGG---VYGWDFKRKLVSRGANY 120
           +++++ D    P  + E++K  +++    V G           +     +   ++ G   
Sbjct: 88  VLLLNPDTVVEPDLLEELLKAAEEDPAAGVVGPLIRNYDESLYIDRRGGESDGLTGGWRA 147

Query: 121 LTQLLLRP-------GVSDLTGSFRLYKKQVLENLVSSCVSKG--------YVFQMEMVI 165
              L + P        V+ L+G+  L +++  E +       G        Y   +++ +
Sbjct: 148 SPLLEIAPDLSSYLEVVASLSGACLLIRREAFEKV-------GGFDERFFIYYEDVDLCL 200

Query: 166 RARQYNYTIGEVPISFV 182
           RAR+  Y I  VP + +
Sbjct: 201 RARKAGYKIYYVPDAII 217


>gnl|CDD|188452 TIGR03937, PgaC_IcaA, poly-beta-1,6 N-acetyl-D-glucosamine
           synthase.  Members of this protein family are
           biofilm-forming enzymes that polymerize
           N-acetyl-D-glucosamine residues in beta(1,6) linkage.
           One named members is IcaA (intercellular adhesin protein
           A), an enzyme that acts (with aid of subunit IcaD) in
           Polysaccharide Intercellular Adhesin (PIA) biosynthesis
           in Staphylococcus epidermis). The homologous member in
           E. coli is designated PgaC. Members are often encoded
           next to a polysaccharide deacetylase and involved in
           biofilm formation. Note that chitin, although also made
           from N-acetylglucosamine, is formed with beta-1,4
           linkages.
          Length = 407

 Score = 50.7 bits (122), Expect = 8e-07
 Identities = 32/121 (26%), Positives = 54/121 (44%), Gaps = 10/121 (8%)

Query: 203 TVLLPTYNEKENL-PIIVYLITKYMDEGNYP-YEIIVIDDGSPDGTLDAAKQLQSIYGSE 260
           ++L+P YNE  N+   I +L+        YP +EII I+DGS D T +   +L +     
Sbjct: 48  SILVPCYNEGANVEETISHLLAL-----RYPNFEIIAINDGSKDNTAEILDRLAAQD--P 100

Query: 261 KIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIK-LQQQENLDVVT 319
           ++ +    +  G   A   GL  A   +++ +D D    P     M++       +  VT
Sbjct: 101 RLRVIHLAENQGKANALNTGLLAAKYEYLVCIDGDALLDPDAAYWMVEHFLSNPRVGAVT 160

Query: 320 G 320
           G
Sbjct: 161 G 161



 Score = 41.1 bits (97), Expect = 8e-04
 Identities = 24/93 (25%), Positives = 40/93 (43%), Gaps = 4/93 (4%)

Query: 5   NYP-YEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNF 63
            YP +EII I+DGS D T +   +L +     ++ +    +  G   A   GL  A   +
Sbjct: 71  RYPNFEIIAINDGSKDNTAEILDRLAAQD--PRLRVIHLAENQGKANALNTGLLAAKYEY 128

Query: 64  IIIMDADLSHHPKFIPEMIK-LQQQENLDVVTG 95
           ++ +D D    P     M++       +  VTG
Sbjct: 129 LVCIDGDALLDPDAAYWMVEHFLSNPRVGAVTG 161


>gnl|CDD|237556 PRK13915, PRK13915, putative glucosyl-3-phosphoglycerate synthase;
           Provisional.
          Length = 306

 Score = 50.3 bits (121), Expect = 8e-07
 Identities = 37/106 (34%), Positives = 54/106 (50%), Gaps = 8/106 (7%)

Query: 203 TVLLPTYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQL-QSIYGSEK 261
           +V+LP  NE+E +  +V  I   + E     E+IVID GS D T + A      +   E+
Sbjct: 34  SVVLPALNEEETVGKVVDSIRPLLMEPL-VDELIVIDSGSTDATAERAAAAGARVVSREE 92

Query: 262 IV--LKPRKKKLGLGTAYMHGLKYATGNFIIIMDADL-SHHPKFIP 304
           I+  L PR  K   G A    L   TG+ ++ +DADL +  P F+P
Sbjct: 93  ILPELPPRPGK---GEALWRSLAATTGDIVVFVDADLINFDPMFVP 135



 Score = 41.4 bits (98), Expect = 6e-04
 Identities = 28/75 (37%), Positives = 39/75 (52%), Gaps = 7/75 (9%)

Query: 9   EIIVIDDGSPDGTLDAAKQL-QSIYGSEKIV--LKPRKKKLGLGTAYMHGLKYATGNFII 65
           E+IVID GS D T + A      +   E+I+  L PR  K   G A    L   TG+ ++
Sbjct: 64  ELIVIDSGSTDATAERAAAAGARVVSREEILPELPPRPGK---GEALWRSLAATTGDIVV 120

Query: 66  IMDADL-SHHPKFIP 79
            +DADL +  P F+P
Sbjct: 121 FVDADLINFDPMFVP 135


>gnl|CDD|133043 cd06421, CESA_CelA_like, CESA_CelA_like are involved in the
           elongation of the glucan chain of cellulose.  Family of
           proteins related to  Agrobacterium tumefaciens CelA and 
           Gluconacetobacter xylinus BscA. These proteins are
           involved in the elongation of the glucan chain of
           cellulose, an aggregate of unbranched polymers of
           beta-1,4-linked glucose residues. They are putative
           catalytic subunit of cellulose synthase, which is a
           glycosyltransferase using UDP-glucose as the substrate.
           The catalytic subunit is an integral membrane protein
           with 6 transmembrane segments and it is postulated that
           the protein is anchored in the membrane at the
           N-terminal end.
          Length = 234

 Score = 49.1 bits (118), Expect = 1e-06
 Identities = 29/106 (27%), Positives = 42/106 (39%), Gaps = 16/106 (15%)

Query: 204 VLLPTYNEKENLPIIVYLITKYMDEGNYP---YEIIVIDDGSPDGTLDAAKQLQSIYGSE 260
           V +PTYNE   +          +D   YP     + V+DDG        A +L   YG  
Sbjct: 5   VFIPTYNEPLEIVRKTLRAALAID---YPHDKLRVYVLDDGRRPELRALAAELGVEYGYR 61

Query: 261 KIV-LKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPE 305
            +     R  K G      + L + TG+F+ I+DAD       +P 
Sbjct: 62  YLTRPDNRHAKAG---NLNNALAHTTGDFVAILDAD------HVPT 98



 Score = 41.4 bits (98), Expect = 4e-04
 Identities = 22/80 (27%), Positives = 33/80 (41%), Gaps = 13/80 (16%)

Query: 5  NYP---YEIIVIDDGSPDGTLDAAKQLQSIYGSEKIV-LKPRKKKLGLGTAYMHGLKYAT 60
          +YP     + V+DDG        A +L   YG   +     R  K G      + L + T
Sbjct: 28 DYPHDKLRVYVLDDGRRPELRALAAELGVEYGYRYLTRPDNRHAKAG---NLNNALAHTT 84

Query: 61 GNFIIIMDADLSHHPKFIPE 80
          G+F+ I+DAD       +P 
Sbjct: 85 GDFVAILDAD------HVPT 98


>gnl|CDD|133042 cd06420, GT2_Chondriotin_Pol_N, N-terminal domain of Chondroitin
           polymerase functions as a GalNAc transferase.
           Chondroitin polymerase is a two domain, bi-functional
           protein. The N-terminal domain functions as a GalNAc
           transferase. The bacterial chondroitin polymerase
           catalyzes elongation of the chondroitin chain by
           alternatively transferring the GlcUA and GalNAc moiety
           from UDP-GlcUA and UDP-GalNAc to the non-reducing ends
           of the chondroitin chain. The enzyme consists of
           N-terminal and C-terminal domains in which the two
           active sites catalyze the addition of GalNAc and GlcUA,
           respectively. Chondroitin chains range from 40 to over
           100 repeating units of the disaccharide. Sulfated
           chondroitins are involved in the regulation of various
           biological functions such as central nervous system
           development, wound repair, infection, growth factor
           signaling, and morphogenesis, in addition to its
           conventional structural roles. In Caenorhabditis
           elegans, chondroitin is an essential factor for the worm
           to undergo cytokinesis and cell division. Chondroitin is
           synthesized as proteoglycans, sulfated and secreted to
           the cell surface or extracellular matrix.
          Length = 182

 Score = 48.3 bits (116), Expect = 1e-06
 Identities = 28/109 (25%), Positives = 55/109 (50%), Gaps = 9/109 (8%)

Query: 204 VLLPTYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIV 263
           +++ TYN  E L +++      +++   P+E+I+ DDGS + T +  ++ +S      I 
Sbjct: 1   LIITTYNRPEALELVL---KSVLNQSILPFEVIIADDGSTEETKELIEEFKS---QFPIP 54

Query: 264 LKP-RKKKLG--LGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKL 309
           +K   ++  G          +  A G+++I +D D   HP FI + I+L
Sbjct: 55  IKHVWQEDEGFRKAKIRNKAIAAAKGDYLIFIDGDCIPHPDFIADHIEL 103



 Score = 40.3 bits (95), Expect = 6e-04
 Identities = 23/82 (28%), Positives = 41/82 (50%), Gaps = 6/82 (7%)

Query: 6   YPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKP-RKKKLG--LGTAYMHGLKYATGN 62
            P+E+I+ DDGS + T +  ++ +S      I +K   ++  G          +  A G+
Sbjct: 25  LPFEVIIADDGSTEETKELIEEFKS---QFPIPIKHVWQEDEGFRKAKIRNKAIAAAKGD 81

Query: 63  FIIIMDADLSHHPKFIPEMIKL 84
           ++I +D D   HP FI + I+L
Sbjct: 82  YLIFIDGDCIPHPDFIADHIEL 103


>gnl|CDD|133005 cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4
           glucosyltransferase is required for biosynthesis of
           lipooligosaccharide.  UDP-glucose: lipooligosaccharide
           (LOS)  beta-1-4-glucosyltransferase catalyzes the
           addition of the first residue, glucose, of the
           lacto-N-neotetrase structure to HepI of the LOS inner
           core.  LOS is the major constituent of the outer leaflet
           of the outer membrane of gram-positive bacteria. It
           consists of a short oligosaccharide chain of variable
           composition (alpha chain) attached to a branched inner
           core which is lined in turn to lipid A. Beta 1,4
           glucosyltransferase is required to attach the alpha
           chain to the inner core.
          Length = 229

 Score = 46.9 bits (112), Expect = 6e-06
 Identities = 28/96 (29%), Positives = 48/96 (50%), Gaps = 20/96 (20%)

Query: 203 TVLLPTYNEKENLP-II--VYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGS 259
           +V++ T NE+ N+   +  V              EIIV+D GS D T++ AK+    YG+
Sbjct: 3   SVVIITKNEERNIERCLESVKWAVD---------EIIVVDSGSTDRTVEIAKE----YGA 49

Query: 260 EKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDAD 295
            K+    ++   G G      L+ AT ++++ +DAD
Sbjct: 50  -KVY---QRWWDGFGAQRNFALELATNDWVLSLDAD 81



 Score = 44.2 bits (105), Expect = 5e-05
 Identities = 22/62 (35%), Positives = 36/62 (58%), Gaps = 8/62 (12%)

Query: 9  EIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMD 68
          EIIV+D GS D T++ AK+    YG+ K+    ++   G G      L+ AT ++++ +D
Sbjct: 28 EIIVVDSGSTDRTVEIAKE----YGA-KVY---QRWWDGFGAQRNFALELATNDWVLSLD 79

Query: 69 AD 70
          AD
Sbjct: 80 AD 81


>gnl|CDD|213815 TIGR03469, HpnB, hopene-associated glycosyltransferase HpnB.  This
           family of genes include a glycosyl transferase, group 2
           domain (pfam00535) which are responsible, generally for
           the transfer of nucleotide-diphosphate sugars to
           substrates such as polysaccharides and lipids. The genes
           of this family are often found in the same genetic locus
           with squalene-hopene cyclase genes, and are never
           associated with genes for the metabolism of phytoene.
           Indeed, the members of this family appear to never be
           found in a genome lacking squalene-hopene cyclase (SHC),
           although not all genomes encoding SHC have this glycosyl
           transferase. In the organism Zymomonas mobilis the
           linkage of this gene to hopanoid biosynthesis has been
           noted and the gene named HpnB. Hopanoids are known to
           feature polar glycosyl head groups in many organisms.
          Length = 384

 Score = 47.4 bits (113), Expect = 8e-06
 Identities = 28/105 (26%), Positives = 49/105 (46%), Gaps = 18/105 (17%)

Query: 5   NYP--YEIIVIDDGSPDGTLDAAKQLQSIYGSEK--IVL--KPRKK----KL-----GLG 49
           +YP    +I++DD S DGT D A+     YG      V+  +P       KL     G+ 
Sbjct: 66  DYPGKLHVILVDDHSTDGTADIARAAARAYGRGDRLTVVSGQPLPPGWSGKLWAVSQGIA 125

Query: 50  TAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDVVT 94
            A          +++++ DAD++H P  +  ++   + E LD+V+
Sbjct: 126 AA---RTLAPPADYLLLTDADIAHGPDNLARLVARARAEGLDLVS 167



 Score = 47.4 bits (113), Expect = 8e-06
 Identities = 28/105 (26%), Positives = 49/105 (46%), Gaps = 18/105 (17%)

Query: 230 NYP--YEIIVIDDGSPDGTLDAAKQLQSIYGSEK--IVL--KPRKK----KL-----GLG 274
           +YP    +I++DD S DGT D A+     YG      V+  +P       KL     G+ 
Sbjct: 66  DYPGKLHVILVDDHSTDGTADIARAAARAYGRGDRLTVVSGQPLPPGWSGKLWAVSQGIA 125

Query: 275 TAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDVVT 319
            A          +++++ DAD++H P  +  ++   + E LD+V+
Sbjct: 126 AA---RTLAPPADYLLLTDADIAHGPDNLARLVARARAEGLDLVS 167


>gnl|CDD|234532 TIGR04283, glyco_like_mftF, transferase 2,
           rSAM/selenodomain-associated.  This enzyme may transfer
           a nucleotide, or it sugar moiety, as part of a
           biosynthetic pathway. Other proposed members of the
           pathway include another transferase (TIGR04282), a
           phosphoesterase, and a radical SAM enzyme (TIGR04167)
           whose C-terminal domain (pfam12345) frequently contains
           a selenocysteine [Unknown function, Enzymes of unknown
           specificity].
          Length = 220

 Score = 45.6 bits (109), Expect = 1e-05
 Identities = 26/95 (27%), Positives = 46/95 (48%), Gaps = 17/95 (17%)

Query: 203 TVLLPTYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKI 262
           ++++P  NE   LP ++  +     +     E+IV+D GS DGT++ A+ L       K+
Sbjct: 2   SIIIPVLNEAATLPELLADLQALPGDA----EVIVVDGGSTDGTVEIARSL-----GAKV 52

Query: 263 VLKPRKKKLGLGTAY-M-HGLKYATGNFIIIMDAD 295
           +  P+      G A  M  G   A G+ ++ + AD
Sbjct: 53  IHSPK------GRARQMNAGAALAKGDILLFLHAD 81



 Score = 36.7 bits (86), Expect = 0.014
 Identities = 21/67 (31%), Positives = 33/67 (49%), Gaps = 13/67 (19%)

Query: 6  YPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAY-M-HGLKYATGNF 63
             E+IV+D GS DGT++ A+ L       K++  P+      G A  M  G   A G+ 
Sbjct: 26 GDAEVIVVDGGSTDGTVEIARSL-----GAKVIHSPK------GRARQMNAGAALAKGDI 74

Query: 64 IIIMDAD 70
          ++ + AD
Sbjct: 75 LLFLHAD 81


>gnl|CDD|133059 cd06437, CESA_CaSu_A2, Cellulose synthase catalytic subunit A2
           (CESA2) is a catalytic subunit or a catalytic subunit
           substitute of the cellulose synthase complex.  Cellulose
           synthase (CESA) catalyzes the polymerization reaction of
           cellulose using UDP-glucose as the substrate. Cellulose
           is an aggregate of unbranched polymers of
           beta-1,4-linked glucose residues, which is an abundant
           polysaccharide produced by plants and in varying degrees
           by several other organisms including algae, bacteria,
           fungi, and even some animals. Genomes from higher plants
           harbor multiple CESA genes. There are ten in
           Arabidopsis. At least three different CESA proteins are
           required to form a functional complex. In Arabidopsis,
           CESA1, 3 and 6 and CESA4, 7 and 8, are required for
           cellulose biosynthesis during primary and secondary cell
           wall formation. CESA2 is very closely related to CESA6
           and is viewed as a prime substitute for CESA6. They
           functionally compensate each other. The cesa2 and cesa6
           double mutant plants were significantly smaller, while
           the single mutant plants were almost normal.
          Length = 232

 Score = 45.4 bits (108), Expect = 2e-05
 Identities = 41/158 (25%), Positives = 66/158 (41%), Gaps = 35/158 (22%)

Query: 203 TVLLPTYNEKENLPIIVYLI--TKYMDEGNYP---YEIIVIDDGSPDGTLDAAKQLQSIY 257
           TV LP +NEK    ++  LI     +D   YP    EI V+DD S D T+  A+++   Y
Sbjct: 4   TVQLPVFNEKY---VVERLIEAACALD---YPKDRLEIQVLDD-STDETVRLAREIVEEY 56

Query: 258 GSEKIVLKPRKK------KLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQ 311
            ++ + +K  ++      K G   A   G+K A G ++ I DAD    P F+ +      
Sbjct: 57  AAQGVNIKHVRRADRTGYKAG---ALAEGMKVAKGEYVAIFDADFVPPPDFLQKTPPYFA 113

Query: 312 QENLDVVTGTRYVGTGGVYGWDFKRKLVSRGANYLTQL 349
              L  V             W      ++   + LT++
Sbjct: 114 DPKLGFVQT----------RWGH----INANYSLLTRV 137



 Score = 35.7 bits (83), Expect = 0.028
 Identities = 29/122 (23%), Positives = 50/122 (40%), Gaps = 24/122 (19%)

Query: 9   EIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKK------KLGLGTAYMHGLKYATGN 62
           EI V+DD S D T+  A+++   Y ++ + +K  ++      K G   A   G+K A G 
Sbjct: 34  EIQVLDD-STDETVRLAREIVEEYAAQGVNIKHVRRADRTGYKAG---ALAEGMKVAKGE 89

Query: 63  FIIIMDADLSHHPKFIPEMIKLQQQENLDVVTGTRYVGTGGVYGWDFKRKLVSRGANYLT 122
           ++ I DAD    P F+ +         L  V             W      ++   + LT
Sbjct: 90  YVAIFDADFVPPPDFLQKTPPYFADPKLGFVQT----------RWGH----INANYSLLT 135

Query: 123 QL 124
           ++
Sbjct: 136 RV 137


>gnl|CDD|133013 cd02522, GT_2_like_a, GT_2_like_a represents a glycosyltransferase
           family-2 subfamily with unknown function.
           Glycosyltransferase family 2 (GT-2) subfamily of unknown
           function. GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 221

 Score = 44.9 bits (107), Expect = 2e-05
 Identities = 31/96 (32%), Positives = 46/96 (47%), Gaps = 16/96 (16%)

Query: 202 YTVLLPTYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEK 261
            ++++PT NE ENLP    L+         P EIIV+D GS DGT+  A+          
Sbjct: 1   LSIIIPTLNEAENLPR---LLASLRRLNPLPLEIIVVDGGSTDGTVAIARSA------GV 51

Query: 262 IVLKPRKKKLGLGTAY-M-HGLKYATGNFIIIMDAD 295
           +V+   K     G A  M  G   A G++++ + AD
Sbjct: 52  VVISSPK-----GRARQMNAGAAAARGDWLLFLHAD 82



 Score = 33.3 bits (77), Expect = 0.15
 Identities = 22/67 (32%), Positives = 32/67 (47%), Gaps = 13/67 (19%)

Query: 6  YPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAY-M-HGLKYATGNF 63
           P EIIV+D GS DGT+  A+          +V+   K     G A  M  G   A G++
Sbjct: 27 LPLEIIVVDGGSTDGTVAIARSA------GVVVISSPK-----GRARQMNAGAAAARGDW 75

Query: 64 IIIMDAD 70
          ++ + AD
Sbjct: 76 LLFLHAD 82


>gnl|CDD|133049 cd06427, CESA_like_2, CESA_like_2 is a member of the cellulose
           synthase superfamily.  The cellulose synthase (CESA)
           superfamily includes a wide variety of
           glycosyltransferase family 2 enzymes that share the
           common characteristic of catalyzing the elongation of
           polysaccharide chains.  The members include cellulose
           synthase catalytic subunit, chitin synthase, Glucan
           Biosynthesis protein and other families of CESA-like
           proteins. Cellulose synthase catalyzes the
           polymerization reaction of cellulose, an aggregate of
           unbranched polymers of beta-1,4-linked glucose residues
           in  plants, most algae, some bacteria and fungi, and
           even some animals. In bacteria, algae and lower
           eukaryotes, there is a second unrelated type of
           cellulose synthase (Type II), which produces acylated
           cellulose, a derivative of cellulose.  Chitin synthase
           catalyzes the incorporation of GlcNAc from substrate
           UDP-GlcNAc into chitin, which is a linear homopolymer of
           beta-(1,4)-linked GlcNAc residues and Glucan
           Biosynthesis protein catalyzes the elongation of
           beta-1,2 polyglucose chains of glucan.
          Length = 241

 Score = 44.6 bits (106), Expect = 4e-05
 Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 16/101 (15%)

Query: 202 YTVLLPTYNEKENLPIIVYLITKYMDEGNYPYE----IIVIDDGSPDGTLDAAKQLQSIY 257
           YT+L+P Y E E LP ++      +   +YP       +++++   + T+ AA+ L+   
Sbjct: 3   YTILVPLYKEAEVLPQLI----ASLSALDYPRSKLDVKLLLEEDDEE-TIAAARALRLPS 57

Query: 258 GSEKIVL---KPRKKKLGLGTAYMHGLKYATGNFIIIMDAD 295
               +V+   +PR K      A  + L +A G +++I DA+
Sbjct: 58  IFRVVVVPPSQPRTKP----KACNYALAFARGEYVVIYDAE 94


>gnl|CDD|133029 cd04186, GT_2_like_c, Subfamily of Glycosyltransferase Family GT2
           of unknown function.  GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 166

 Score = 42.5 bits (101), Expect = 8e-05
 Identities = 19/81 (23%), Positives = 44/81 (54%), Gaps = 5/81 (6%)

Query: 7   PYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIII 66
            +E+IV+D+ S DG+++  ++L       ++ L    + LG G     G++ A G+++++
Sbjct: 26  DFEVIVVDNASTDGSVELLRELF-----PEVRLIRNGENLGFGAGNNQGIREAKGDYVLL 80

Query: 67  MDADLSHHPKFIPEMIKLQQQ 87
           ++ D    P  + E++   +Q
Sbjct: 81  LNPDTVVEPGALLELLDAAEQ 101



 Score = 42.5 bits (101), Expect = 8e-05
 Identities = 19/81 (23%), Positives = 44/81 (54%), Gaps = 5/81 (6%)

Query: 232 PYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIII 291
            +E+IV+D+ S DG+++  ++L       ++ L    + LG G     G++ A G+++++
Sbjct: 26  DFEVIVVDNASTDGSVELLRELF-----PEVRLIRNGENLGFGAGNNQGIREAKGDYVLL 80

Query: 292 MDADLSHHPKFIPEMIKLQQQ 312
           ++ D    P  + E++   +Q
Sbjct: 81  LNPDTVVEPGALLELLDAAEQ 101


>gnl|CDD|133017 cd02526, GT2_RfbF_like, RfbF is a putative dTDP-rhamnosyl
           transferase.  Shigella flexneri RfbF protein is a
           putative dTDP-rhamnosyl transferase. dTDP rhamnosyl
           transferases of Shigella flexneri  add rhamnose sugars
           to N-acetyl-glucosamine in the O-antigen tetrasaccharide
           repeat. Lipopolysaccharide O antigens are important
           virulence determinants for many bacteria. The variations
           of sugar composition, the sequence of the sugars and the
           linkages in the O antigen provide structural diversity
           of the O antigen.
          Length = 237

 Score = 43.4 bits (103), Expect = 8e-05
 Identities = 19/83 (22%), Positives = 38/83 (45%), Gaps = 8/83 (9%)

Query: 9   EIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGN---FII 65
           +++V+D+ S         +L+    SEKI L    + LG+  A   G+K A  N   +++
Sbjct: 26  KVVVVDNSS-----GNDIELRLRLNSEKIELIHLGENLGIAKALNIGIKAALENGADYVL 80

Query: 66  IMDADLSHHPKFIPEMIKLQQQE 88
           + D D    P  + +++  +   
Sbjct: 81  LFDQDSVPPPDMVEKLLAYKILS 103



 Score = 43.4 bits (103), Expect = 8e-05
 Identities = 19/83 (22%), Positives = 38/83 (45%), Gaps = 8/83 (9%)

Query: 234 EIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGN---FII 290
           +++V+D+ S         +L+    SEKI L    + LG+  A   G+K A  N   +++
Sbjct: 26  KVVVVDNSS-----GNDIELRLRLNSEKIELIHLGENLGIAKALNIGIKAALENGADYVL 80

Query: 291 IMDADLSHHPKFIPEMIKLQQQE 313
           + D D    P  + +++  +   
Sbjct: 81  LFDQDSVPPPDMVEKLLAYKILS 103


>gnl|CDD|133055 cd06433, GT_2_WfgS_like, WfgS and WfeV are involved in O-antigen
           biosynthesis.  Escherichia coli WfgS and Shigella
           dysenteriae WfeV are glycosyltransferase 2 family
           enzymes involved in O-antigen biosynthesis. GT-2 enzymes
           have GT-A type structural fold, which has two tightly
           associated beta/alpha/beta domains that tend to form a
           continuous central sheet of at least eight beta-strands.
           These are enzymes that catalyze the transfer of sugar
           moieties from activated donor molecules to specific
           acceptor molecules, forming glycosidic bonds.
           Glycosyltransferases have been classified into more than
           90 distinct sequence based families.
          Length = 202

 Score = 41.8 bits (99), Expect = 2e-04
 Identities = 42/190 (22%), Positives = 71/190 (37%), Gaps = 17/190 (8%)

Query: 230 NYP-YEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNF 288
            YP  E IVID GS DGT+D  K+ +        + +P K   G+  A   G+  ATG+ 
Sbjct: 24  TYPNIEYIVIDGGSTDGTVDIIKKYEDKI--TYWISEPDK---GIYDAMNKGIALATGDI 78

Query: 289 IIIMDA-DLSHHPKFIPEMIKLQQQENLDVVTG-TRYVGTGGVYGWDFKRKLVSRGANYL 346
           I  +++ D       +  +    +   +DVV G    V   G      +     R   +L
Sbjct: 79  IGFLNSDDTLLPGALLAVVAAFAEHPEVDVVYGDVLLVDENG------RVIGRRRPPPFL 132

Query: 347 TQLLLRPGVSDLTGSFRLYKKQVLENLVSSCVSKGYVFQMEMVIRARQYNYTIGEVPISF 406
            + LL         +F  +++ + E       S       ++++R          +P   
Sbjct: 133 DKFLLYGMPICHQATF--FRRSLFEKYGGFDESYRIAADYDLLLRLLLAGKIFKYLPEVL 190

Query: 407 VD-RVYGESK 415
              R+ G S 
Sbjct: 191 AAFRLGGVSS 200



 Score = 41.0 bits (97), Expect = 5e-04
 Identities = 30/106 (28%), Positives = 45/106 (42%), Gaps = 8/106 (7%)

Query: 5   NYP-YEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNF 63
            YP  E IVID GS DGT+D  K+ +        + +P K   G+  A   G+  ATG+ 
Sbjct: 24  TYPNIEYIVIDGGSTDGTVDIIKKYEDKI--TYWISEPDK---GIYDAMNKGIALATGDI 78

Query: 64  IIIMDA-DLSHHPKFIPEMIKLQQQENLDVVTG-TRYVGTGGVYGW 107
           I  +++ D       +  +    +   +DVV G    V   G    
Sbjct: 79  IGFLNSDDTLLPGALLAVVAAFAEHPEVDVVYGDVLLVDENGRVIG 124


>gnl|CDD|133039 cd04196, GT_2_like_d, Subfamily of Glycosyltransferase Family GT2
           of unknown function.  GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 214

 Score = 41.8 bits (99), Expect = 2e-04
 Identities = 29/102 (28%), Positives = 43/102 (42%), Gaps = 22/102 (21%)

Query: 203 TVLLPTYN-EKENLPIIVYLITKYMDE--------GNYPYEIIVIDDGSPDGTLDAAKQL 253
            VL+ TYN EK            Y+ E             E+I+ DDGS DGT++  K+ 
Sbjct: 1   AVLMATYNGEK------------YLREQLDSILAQTYKNDELIISDDGSTDGTVEIIKEY 48

Query: 254 QSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDAD 295
                   I+L    K LG+   +   L+ A G+++   D D
Sbjct: 49  IDKD-PFIIILIRNGKNLGVARNFESLLQAADGDYVFFCDQD 89



 Score = 40.3 bits (95), Expect = 7e-04
 Identities = 20/62 (32%), Positives = 32/62 (51%), Gaps = 1/62 (1%)

Query: 9  EIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMD 68
          E+I+ DDGS DGT++  K+         I+L    K LG+   +   L+ A G+++   D
Sbjct: 29 ELIISDDGSTDGTVEIIKEYIDKD-PFIIILIRNGKNLGVARNFESLLQAADGDYVFFCD 87

Query: 69 AD 70
           D
Sbjct: 88 QD 89


>gnl|CDD|234095 TIGR03030, CelA, cellulose synthase catalytic subunit
           (UDP-forming).  Cellulose synthase catalyzes the
           beta-1,4 polymerization of glucose residues in the
           formation of cellulose. In bacteria, the substrate is
           UDP-glucose. The synthase consists of two subunits (or
           domains in the frequent cases where it is encoded as a
           single polypeptide), the catalytic domain modelled here
           and the regulatory domain (pfam03170). The regulatory
           domain binds the allosteric activator cyclic di-GMP. The
           protein is membrane-associated and probably assembles
           into multimers such that the individual cellulose
           strands can self-assemble into multi-strand fibrils.
          Length = 713

 Score = 42.7 bits (101), Expect = 3e-04
 Identities = 33/114 (28%), Positives = 48/114 (42%), Gaps = 24/114 (21%)

Query: 204 VLLPTYNEKENLPIIVYLITKYMDEGNYPYE---IIVIDDGSPDGTLDAAKQLQSIYGSE 260
           V +PTYNE   +     L  K MD   YP +   + ++DDG  D   +     Q+     
Sbjct: 135 VFIPTYNEDLEIVATTVLAAKNMD---YPADKFRVWILDDGGTDQKRNDPDPEQAEAAQR 191

Query: 261 KIVLKPRKKKLGLGTAYM--------------HGLKYATGNFIIIMDADLSHHP 300
           +  LK   +KLG+   Y+              + LK+  G  I+I DAD  H P
Sbjct: 192 REELKEFCRKLGVN--YITRPRNVHAKAGNINNALKHTDGELILIFDAD--HVP 241



 Score = 29.2 bits (66), Expect = 4.4
 Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 21/87 (24%)

Query: 6   YPYE---IIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYM--------- 53
           YP +   + ++DDG  D   +     Q+     +  LK   +KLG+   Y+         
Sbjct: 159 YPADKFRVWILDDGGTDQKRNDPDPEQAEAAQRREELKEFCRKLGVN--YITRPRNVHAK 216

Query: 54  -----HGLKYATGNFIIIMDADLSHHP 75
                + LK+  G  I+I DAD  H P
Sbjct: 217 AGNINNALKHTDGELILIFDAD--HVP 241


>gnl|CDD|133056 cd06434, GT2_HAS, Hyaluronan synthases catalyze polymerization of
           hyaluronan.  Hyaluronan synthases (HASs) are
           bi-functional glycosyltransferases that catalyze
           polymerization of hyaluronan. HASs transfer both GlcUA
           and GlcNAc in beta-(1,3) and beta-(1,4) linkages,
           respectively to the hyaluronan chain using UDP-GlcNAc
           and UDP-GlcUA as substrates. HA is made as a free
           glycan, not attached to a protein or lipid. HASs do not
           need a primer for HA synthesis; they initiate HA
           biosynthesis de novo with only UDP-GlcNAc, UDP-GlcUA,
           and Mg2+. Hyaluronan (HA) is a linear
           heteropolysaccharide composed of (1-3)-linked
           beta-D-GlcUA-beta-D-GlcNAc disaccharide repeats. It can
           be found in vertebrates and a few microbes and is
           typically on the cell surface or in the extracellular
           space, but is also found inside mammalian cells.
           Hyaluronan has several physiochemical and biological
           functions such as space filling, lubrication, and
           providing a hydrated matrix through which cells can
           migrate.
          Length = 235

 Score = 41.5 bits (98), Expect = 4e-04
 Identities = 26/106 (24%), Positives = 45/106 (42%), Gaps = 8/106 (7%)

Query: 203 TVLLPTYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKI 262
           TV++P Y+E  ++     L +        P EIIV+ DG           L        I
Sbjct: 3   TVIIPVYDEDPDVFREC-LRSIL---RQKPLEIIVVTDGD---DEPYLSILSQTVKYGGI 55

Query: 263 VLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIK 308
            +       G   A   G+++ T + ++++D+D    P  +PEM+K
Sbjct: 56  FVITVPHP-GKRRALAEGIRHVTTDIVVLLDSDTVWPPNALPEMLK 100



 Score = 35.7 bits (83), Expect = 0.030
 Identities = 20/79 (25%), Positives = 33/79 (41%), Gaps = 4/79 (5%)

Query: 5   NYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFI 64
             P EIIV+ DG           L        I +       G   A   G+++ T + +
Sbjct: 26  QKPLEIIVVTDGD---DEPYLSILSQTVKYGGIFVITVPHP-GKRRALAEGIRHVTTDIV 81

Query: 65  IIMDADLSHHPKFIPEMIK 83
           +++D+D    P  +PEM+K
Sbjct: 82  VLLDSDTVWPPNALPEMLK 100


>gnl|CDD|133028 cd04185, GT_2_like_b, Subfamily of Glycosyltransferase Family GT2
           of unknown function.  GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 202

 Score = 38.8 bits (91), Expect = 0.002
 Identities = 29/113 (25%), Positives = 50/113 (44%), Gaps = 6/113 (5%)

Query: 7   PYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYA--TG-NF 63
           P  IIVID+ S DGT +    L S+   + IV     + LG    +  G++ A   G ++
Sbjct: 26  PDHIIVIDNASTDGTAEW---LTSLGDLDNIVYLRLPENLGGAGGFYEGVRRAYELGYDW 82

Query: 64  IIIMDADLSHHPKFIPEMIKLQQQENLDVVTGTRYVGTGGVYGWDFKRKLVSR 116
           I +MD D    P  + +++    ++N   +        G   G    R++V +
Sbjct: 83  IWLMDDDAIPDPDALEKLLAYADKDNPQFLAPLVLDPDGSFVGVLISRRVVEK 135



 Score = 38.8 bits (91), Expect = 0.002
 Identities = 29/113 (25%), Positives = 50/113 (44%), Gaps = 6/113 (5%)

Query: 232 PYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYA--TG-NF 288
           P  IIVID+ S DGT +    L S+   + IV     + LG    +  G++ A   G ++
Sbjct: 26  PDHIIVIDNASTDGTAEW---LTSLGDLDNIVYLRLPENLGGAGGFYEGVRRAYELGYDW 82

Query: 289 IIIMDADLSHHPKFIPEMIKLQQQENLDVVTGTRYVGTGGVYGWDFKRKLVSR 341
           I +MD D    P  + +++    ++N   +        G   G    R++V +
Sbjct: 83  IWLMDDDAIPDPDALEKLLAYADKDNPQFLAPLVLDPDGSFVGVLISRRVVEK 135


>gnl|CDD|234419 TIGR03965, mycofact_glyco, mycofactocin system glycosyltransferase.
            Members of this protein family are putative
           glycosyltransferases, members of pfam00535 (glycosyl
           transferase family 2). Members appear mostly in the
           Actinobacteria, where they appear to be part of a system
           for converting a precursor peptide (TIGR03969) into a
           novel redox carrier designated mycofactocin. A radical
           SAM enzyme, TIGR03962, is a proposed to be a key
           maturase for mycofactocin.
          Length = 467

 Score = 39.7 bits (93), Expect = 0.002
 Identities = 19/74 (25%), Positives = 35/74 (47%), Gaps = 4/74 (5%)

Query: 9   EIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMD 68
           E+IV+DDGS D     A +   +    +++  PR  + G   A   G + A   F+   D
Sbjct: 107 EVIVVDDGSEDPVPTRAARGARL--PVRVIRHPR--RQGPAAARNAGARAARTEFVAFTD 162

Query: 69  ADLSHHPKFIPEMI 82
           +D+   P ++  ++
Sbjct: 163 SDVVPRPGWLRALL 176



 Score = 39.7 bits (93), Expect = 0.002
 Identities = 19/74 (25%), Positives = 35/74 (47%), Gaps = 4/74 (5%)

Query: 234 EIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMD 293
           E+IV+DDGS D     A +   +    +++  PR  + G   A   G + A   F+   D
Sbjct: 107 EVIVVDDGSEDPVPTRAARGARL--PVRVIRHPR--RQGPAAARNAGARAARTEFVAFTD 162

Query: 294 ADLSHHPKFIPEMI 307
           +D+   P ++  ++
Sbjct: 163 SDVVPRPGWLRALL 176


>gnl|CDD|133004 cd02510, pp-GalNAc-T, pp-GalNAc-T initiates the formation of
          mucin-type O-linked glycans.  UDP-GalNAc: polypeptide
          alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T)
          initiate the formation of mucin-type, O-linked glycans
          by catalyzing the transfer of
          alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to
          hydroxyl groups of Ser or Thr residues of core proteins
          to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These
          enzymes are type II membrane proteins with a GT-A type
          catalytic domain and a lectin domain located on the
          lumen side of the Golgi apparatus. In human, there are
          15 isozymes of pp-GalNAc-Ts, representing the largest
          of all glycosyltransferase families. Each isozyme has
          unique but partially redundant substrate specificity
          for glycosylation sites on acceptor proteins.
          Length = 299

 Score = 38.7 bits (91), Expect = 0.003
 Identities = 18/61 (29%), Positives = 31/61 (50%)

Query: 9  EIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMD 68
          EII++DD S    L    +        K+ +   KK+ GL  A + G + ATG+ ++ +D
Sbjct: 32 EIILVDDFSDKPELKLLLEEYYKKYLPKVKVLRLKKREGLIRARIAGARAATGDVLVFLD 91

Query: 69 A 69
          +
Sbjct: 92 S 92



 Score = 38.7 bits (91), Expect = 0.003
 Identities = 18/61 (29%), Positives = 31/61 (50%)

Query: 234 EIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMD 293
           EII++DD S    L    +        K+ +   KK+ GL  A + G + ATG+ ++ +D
Sbjct: 32  EIILVDDFSDKPELKLLLEEYYKKYLPKVKVLRLKKREGLIRARIAGARAATGDVLVFLD 91

Query: 294 A 294
           +
Sbjct: 92  S 92


>gnl|CDD|184755 PRK14583, hmsR, N-glycosyltransferase; Provisional.
          Length = 444

 Score = 35.3 bits (81), Expect = 0.063
 Identities = 29/127 (22%), Positives = 48/127 (37%), Gaps = 10/127 (7%)

Query: 203 TVLLPTYNEKENL--PIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSE 260
           ++L+P +NE  N    I   L   Y +      E+I I+DGS D T      L +     
Sbjct: 78  SILVPCFNEGLNARETIHAALAQTYTN-----IEVIAINDGSSDDTAQVLDALLAEDPRL 132

Query: 261 KIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIK-LQQQENLDVVT 319
           +++        G   A   G   A   +++ +D D       +P ++  L        VT
Sbjct: 133 RVI--HLAHNQGKAIALRMGAAAARSEYLVCIDGDALLDKNAVPYLVAPLIANPRTGAVT 190

Query: 320 GTRYVGT 326
           G   + T
Sbjct: 191 GNPRIRT 197



 Score = 30.0 bits (67), Expect = 3.0
 Identities = 21/94 (22%), Positives = 35/94 (37%), Gaps = 3/94 (3%)

Query: 9   EIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMD 68
           E+I I+DGS D T      L +     +++        G   A   G   A   +++ +D
Sbjct: 106 EVIAINDGSSDDTAQVLDALLAEDPRLRVI--HLAHNQGKAIALRMGAAAARSEYLVCID 163

Query: 69  ADLSHHPKFIPEMIK-LQQQENLDVVTGTRYVGT 101
            D       +P ++  L        VTG   + T
Sbjct: 164 GDALLDKNAVPYLVAPLIANPRTGAVTGNPRIRT 197


>gnl|CDD|132155 TIGR03111, glyc2_xrt_Gpos1, putative glycosyltransferase,
           exosortase G-associated.  Members of this protein family
           are probable glycosyltransferases of family 2, whose
           genes are near those for the exosortase homolog XrtG
           (TIGR03110), which is restricted to Gram-positive
           bacteria. Other genes in the conserved gene neighborhood
           include a 6-pyruvoyl tetrahydropterin synthase homolog
           (TIGR03112) and an uncharacterized intergral membrane
           protein (TIGR03766).
          Length = 439

 Score = 34.7 bits (80), Expect = 0.097
 Identities = 26/123 (21%), Positives = 54/123 (43%), Gaps = 10/123 (8%)

Query: 203 TVLLPTYNEKENLPIIVYLITKYMDEGNYPYE---IIVIDDGSPDGTLDAAKQLQSIYGS 259
           T+++P YN ++ L   +    + +    YP E   II+ ++ S D +     + Q+ +  
Sbjct: 52  TIIIPVYNSEDTLFNCI----ESIYNQTYPIELIDIILANNQSTDDSFQVFCRAQNEF-- 105

Query: 260 EKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMI-KLQQQENLDVV 318
             + L+      G   A    +  + G +II +D+D   H   I  M+ + +   ++  +
Sbjct: 106 PGLSLRYMNSDQGKAKALNAAIYNSIGKYIIHIDSDGKLHKDAIKNMVTRFENNPDIHAM 165

Query: 319 TGT 321
           TG 
Sbjct: 166 TGV 168



 Score = 29.3 bits (66), Expect = 4.3
 Identities = 21/100 (21%), Positives = 42/100 (42%), Gaps = 6/100 (6%)

Query: 1   MDEGNYPYE---IIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLK 57
           +    YP E   II+ ++ S D +     + Q+ +    + L+      G   A    + 
Sbjct: 71  IYNQTYPIELIDIILANNQSTDDSFQVFCRAQNEF--PGLSLRYMNSDQGKAKALNAAIY 128

Query: 58  YATGNFIIIMDADLSHHPKFIPEMI-KLQQQENLDVVTGT 96
            + G +II +D+D   H   I  M+ + +   ++  +TG 
Sbjct: 129 NSIGKYIIHIDSDGKLHKDAIKNMVTRFENNPDIHAMTGV 168


>gnl|CDD|222329 pfam13704, Glyco_tranf_2_4, Glycosyl transferase family 2.
          Members of this family of prokaryotic proteins include
          putative glucosyltransferases,.
          Length = 97

 Score = 32.2 bits (74), Expect = 0.10
 Identities = 14/66 (21%), Positives = 33/66 (50%), Gaps = 9/66 (13%)

Query: 9  EIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRK--KKLGLGTAYMHGL--KYATGNFI 64
            +++D+ S DGT +    L  +      V++     +       +++ L  +YA G+++
Sbjct: 21 HFLIVDNDSDDGTAEILAALPDV-----GVIRTDLSYRDARRQVDWLNALIRRYADGDWV 75

Query: 65 IIMDAD 70
          +++DAD
Sbjct: 76 LVLDAD 81



 Score = 32.2 bits (74), Expect = 0.10
 Identities = 14/66 (21%), Positives = 33/66 (50%), Gaps = 9/66 (13%)

Query: 234 EIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRK--KKLGLGTAYMHGL--KYATGNFI 289
             +++D+ S DGT +    L  +      V++     +       +++ L  +YA G+++
Sbjct: 21  HFLIVDNDSDDGTAEILAALPDV-----GVIRTDLSYRDARRQVDWLNALIRRYADGDWV 75

Query: 290 IIMDAD 295
           +++DAD
Sbjct: 76  LVLDAD 81


>gnl|CDD|133012 cd02520, Glucosylceramide_synthase, Glucosylceramide synthase
           catalyzes the first glycosylation step of
           glycosphingolipid synthesis.
           UDP-glucose:N-acylsphingosine D-glucosyltransferase
           (glucosylceramide synthase or ceramide
           glucosyltransferase) catalyzes the first glycosylation
           step of glycosphingolipid synthesis. Its product,
           glucosylceramide, serves as the core of more than 300
           glycosphingolipids (GSL). GSLs are a group of membrane
           components that have the lipid portion embedded in the
           outer plasma membrane leaflet and the sugar chains
           extended to the outer environment. Several lines of
           evidence suggest the importance of GSLs in various
           cellular processes such as differentiation, adhesion,
           proliferation, and cell-cell recognition. In pathogenic
           fungus Cryptococcus neoformans,  glucosylceramide serves
           as an antigen that elicits an antibody response in
           patients and it is essential for fungal growth in host
           extracellular environment.
          Length = 196

 Score = 33.7 bits (78), Expect = 0.12
 Identities = 23/100 (23%), Positives = 43/100 (43%), Gaps = 14/100 (14%)

Query: 5   NYP-YEII-VIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLG-------LGTAYMHG 55
           +YP YEI+  + D   D  +   ++L + Y +    L    +K+G       L   Y   
Sbjct: 27  DYPKYEILFCVQDED-DPAIPVVRKLIAKYPNVDARLLIGGEKVGINPKVNNLIKGYEE- 84

Query: 56  LKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDVVTG 95
              A  + ++I D+D+S  P ++  M+       + +VT 
Sbjct: 85  ---ARYDILVISDSDISVPPDYLRRMVAPLMDPGVGLVTC 121



 Score = 33.7 bits (78), Expect = 0.12
 Identities = 23/100 (23%), Positives = 43/100 (43%), Gaps = 14/100 (14%)

Query: 230 NYP-YEII-VIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLG-------LGTAYMHG 280
           +YP YEI+  + D   D  +   ++L + Y +    L    +K+G       L   Y   
Sbjct: 27  DYPKYEILFCVQDED-DPAIPVVRKLIAKYPNVDARLLIGGEKVGINPKVNNLIKGYEE- 84

Query: 281 LKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDVVTG 320
              A  + ++I D+D+S  P ++  M+       + +VT 
Sbjct: 85  ---ARYDILVISDSDISVPPDYLRRMVAPLMDPGVGLVTC 121


>gnl|CDD|133060 cd06438, EpsO_like, EpsO protein participates in the methanolan
           synthesis.  The Methylobacillus sp EpsO protein is
           predicted to participate in the methanolan synthesis.
           Methanolan is an exopolysaccharide (EPS), composed of
           glucose, mannose and galactose.  A 21 genes cluster was
           predicted to participate in the methanolan synthesis.
           Gene disruption analysis revealed that EpsO is one of
           the glycosyltransferase enzymes involved in the
           synthesis of repeating sugar units onto the lipid
           carrier.
          Length = 183

 Score = 33.0 bits (76), Expect = 0.19
 Identities = 28/130 (21%), Positives = 49/130 (37%), Gaps = 24/130 (18%)

Query: 204 VLLPTYNEKENLPIIVYLIT--KYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEK 261
           +L+P +NE+    +I   +   K  D     Y I V+ D   D T   A+       +  
Sbjct: 1   ILIPAHNEEA---VIGNTVRSLKAQDYPRELYRIFVVADNCTDDTAQVARA------AGA 51

Query: 262 IVLK---PRKKKLGLGTAYMHGLKYATG-----NFIIIMDADLSHHPKFIPEM-IKLQQQ 312
            VL+   P ++  G G A   G ++        + +++ DAD    P  + E+  +    
Sbjct: 52  TVLERHDPERR--GKGYALDFGFRHLLNLADDPDAVVVFDADNLVDPNALEELNARFAAG 109

Query: 313 ENLDVVTGTR 322
               VV    
Sbjct: 110 A--RVVQAYY 117



 Score = 31.8 bits (73), Expect = 0.43
 Identities = 23/105 (21%), Positives = 39/105 (37%), Gaps = 22/105 (20%)

Query: 5   NYP---YEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLK---PRKKKLGLGTAYMHGLKY 58
           +YP   Y I V+ D   D T   A+       +   VL+   P ++  G G A   G ++
Sbjct: 23  DYPRELYRIFVVADNCTDDTAQVARA------AGATVLERHDPERR--GKGYALDFGFRH 74

Query: 59  ATG-----NFIIIMDADLSHHPKFIPEM-IKLQQQENLDVVTGTR 97
                   + +++ DAD    P  + E+  +        VV    
Sbjct: 75  LLNLADDPDAVVVFDADNLVDPNALEELNARFAAGA--RVVQAYY 117


>gnl|CDD|133038 cd04195, GT2_AmsE_like, GT2_AmsE_like is involved in
           exopolysaccharide amylovora biosynthesis.  AmsE is a
           glycosyltransferase involved in exopolysaccharide
           amylovora biosynthesis in Erwinia amylovora. Amylovara
           is one of the three exopolysaccharide produced by E.
           amylovora. Amylovara-deficient mutants are
           non-pathogenic. It is a subfamily of Glycosyltransferase
           Family GT2, which includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds.
          Length = 201

 Score = 33.1 bits (76), Expect = 0.19
 Identities = 22/92 (23%), Positives = 42/92 (45%), Gaps = 5/92 (5%)

Query: 7   PYEIIVIDDGS-PDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFII 65
           P E++++ DG       +  ++ +      K+V     K  GLG A   GLK+ T +++ 
Sbjct: 29  PDEVVLVKDGPVTQSLNEVLEEFKRKLPL-KVVPLE--KNRGLGKALNEGLKHCTYDWVA 85

Query: 66  IMDADLSHHPKFIPEMIK-LQQQENLDVVTGT 96
            MD D    P    + +  +++   +D+V G 
Sbjct: 86  RMDTDDISLPDRFEKQLDFIEKNPEIDIVGGG 117



 Score = 33.1 bits (76), Expect = 0.19
 Identities = 22/92 (23%), Positives = 42/92 (45%), Gaps = 5/92 (5%)

Query: 232 PYEIIVIDDGS-PDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFII 290
           P E++++ DG       +  ++ +      K+V     K  GLG A   GLK+ T +++ 
Sbjct: 29  PDEVVLVKDGPVTQSLNEVLEEFKRKLPL-KVVPLE--KNRGLGKALNEGLKHCTYDWVA 85

Query: 291 IMDADLSHHPKFIPEMIK-LQQQENLDVVTGT 321
            MD D    P    + +  +++   +D+V G 
Sbjct: 86  RMDTDDISLPDRFEKQLDFIEKNPEIDIVGGG 117


>gnl|CDD|237453 PRK13633, PRK13633, cobalt transporter ATP-binding subunit;
           Provisional.
          Length = 280

 Score = 32.8 bits (75), Expect = 0.30
 Identities = 15/33 (45%), Positives = 19/33 (57%), Gaps = 2/33 (6%)

Query: 210 NEKENLPIIVYLITKYMDEGNYPYEIIVIDDGS 242
           N+K  + II  LIT YM+E      IIV+D G 
Sbjct: 191 NKKYGITII--LITHYMEEAVEADRIIVMDSGK 221


>gnl|CDD|133027 cd04184, GT2_RfbC_Mx_like, Myxococcus xanthus RfbC like proteins
           are required for O-antigen biosynthesis.  The rfbC gene
           encodes a predicted protein of 1,276 amino acids, which
           is required for O-antigen biosynthesis in Myxococcus
           xanthus. It is a subfamily of Glycosyltransferase Family
           GT2, which includes diverse families of glycosyl
           transferases with a common GT-A type structural fold,
           which has two tightly associated beta/alpha/beta domains
           that tend to form a continuous central sheet of at least
           eight beta-strands. These are enzymes that catalyze the
           transfer of sugar moieties from activated donor
           molecules to specific acceptor molecules, forming
           glycosidic bonds.
          Length = 202

 Score = 31.8 bits (73), Expect = 0.47
 Identities = 24/101 (23%), Positives = 41/101 (40%), Gaps = 21/101 (20%)

Query: 204 VLLPTYNEKENLPIIVYL-------ITKYMDEGNYPY-EIIVIDDGSPD-GTLDAAKQLQ 254
           +++P YN         YL         +      YP  E+ + DD S D       K+  
Sbjct: 5   IVMPVYN-----TPEKYLREAIESVRAQT-----YPNWELCIADDASTDPEVKRVLKKYA 54

Query: 255 SIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDAD 295
           +     +I +  R++  G+  A    L+ ATG F+ ++D D
Sbjct: 55  AQ--DPRIKVVFREENGGISAATNSALELATGEFVALLDHD 93



 Score = 30.6 bits (70), Expect = 1.2
 Identities = 19/67 (28%), Positives = 32/67 (47%), Gaps = 4/67 (5%)

Query: 6  YPY-EIIVIDDGSPD-GTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNF 63
          YP  E+ + DD S D       K+  +     +I +  R++  G+  A    L+ ATG F
Sbjct: 29 YPNWELCIADDASTDPEVKRVLKKYAAQ--DPRIKVVFREENGGISAATNSALELATGEF 86

Query: 64 IIIMDAD 70
          + ++D D
Sbjct: 87 VALLDHD 93


>gnl|CDD|188524 TIGR04009, wcaE, colanic acid biosynthesis glycosyl transferase
           WcaE.  This gene is one of the glycosyl transferases
           involved in the biosynthesis of colanic acid, an
           exopolysaccharide expressed in Enterobacteraceae
           species.
          Length = 248

 Score = 30.6 bits (69), Expect = 1.2
 Identities = 20/88 (22%), Positives = 40/88 (45%), Gaps = 4/88 (4%)

Query: 8   YEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIM 67
           +E IV+D GS DGT +  ++L   +    +  K      G+  A   G+  A G + + +
Sbjct: 34  FEWIVVDGGSNDGTAEFLEKLNGEFNLRFVSEKDN----GIYDAMNKGIAMAQGRYTLFL 89

Query: 68  DADLSHHPKFIPEMIKLQQQENLDVVTG 95
           ++    H      + +L +Q++  +  G
Sbjct: 90  NSGDIFHEDVADFVRQLARQKDNAMYIG 117



 Score = 30.6 bits (69), Expect = 1.2
 Identities = 20/88 (22%), Positives = 40/88 (45%), Gaps = 4/88 (4%)

Query: 233 YEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIM 292
           +E IV+D GS DGT +  ++L   +    +  K      G+  A   G+  A G + + +
Sbjct: 34  FEWIVVDGGSNDGTAEFLEKLNGEFNLRFVSEKDN----GIYDAMNKGIAMAQGRYTLFL 89

Query: 293 DADLSHHPKFIPEMIKLQQQENLDVVTG 320
           ++    H      + +L +Q++  +  G
Sbjct: 90  NSGDIFHEDVADFVRQLARQKDNAMYIG 117


>gnl|CDD|130619 TIGR01556, rhamnosyltran, L-rhamnosyltransferase.  This model
           subfamily is comprised of gamma proteobacteria whose
           proteins function as L-rhamnosyltransferases in the
           synthesis of their respective surface polysaccharides.
           Rhamnolipids are glycolipids containing mono- or di-
           L-rhamnose molecules. Rhamnolipid synthesis occurs by
           sequential glycosyltransferase reactions involving two
           distinct rhamnosyltransferase enzymes. In P.aeruginosa,
           the synthesis of mono-rhamnolipids is catalyzed by
           rhamnosyltransferase 1, and proceeds by a
           glycosyltransfer reaction catalyzed by
           rhamnosyltransferase 2 to yield di-rhamnolipids [Cell
           envelope, Biosynthesis and degradation of surface
           polysaccharides and lipopolysaccharides].
          Length = 281

 Score = 30.9 bits (70), Expect = 1.2
 Identities = 23/114 (20%), Positives = 39/114 (34%), Gaps = 12/114 (10%)

Query: 208 TYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPR 267
           T+N   +L  +  LIT    + +    II +D+              +    +KI L   
Sbjct: 2   TFNP--DLEHLGELITSLPKQVDR---IIAVDNSP----HSDQPLKNARLRGQKIALIHL 52

Query: 268 KKKLGLGTAYMHGLKYATGN---FIIIMDADLSHHPKFIPEMIKLQQQENLDVV 318
               G+  A   GL  +       ++++D D      F+    KL   EN    
Sbjct: 53  GDNQGIAGAQNQGLDASFRRGVQGVLLLDQDSRPGNAFLAAQWKLLSAENGQAC 106



 Score = 30.9 bits (70), Expect = 1.2
 Identities = 18/86 (20%), Positives = 27/86 (31%), Gaps = 6/86 (6%)

Query: 11  IVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGN---FIIIM 67
           I+  D SP                +KI L       G+  A   GL  +       ++++
Sbjct: 24  IIAVDNSPHSDQPLKNARLRG---QKIALIHLGDNQGIAGAQNQGLDASFRRGVQGVLLL 80

Query: 68  DADLSHHPKFIPEMIKLQQQENLDVV 93
           D D      F+    KL   EN    
Sbjct: 81  DQDSRPGNAFLAAQWKLLSAENGQAC 106


>gnl|CDD|222336 pfam13712, Glyco_tranf_2_5, Glycosyltransferase like family.
           Members of this family of prokaryotic proteins include
           putative glucosyltransferases, which are involved in
           bacterial capsule biosynthesis.
          Length = 213

 Score = 30.7 bits (70), Expect = 1.2
 Identities = 13/93 (13%), Positives = 39/93 (41%), Gaps = 7/93 (7%)

Query: 26  KQLQSIYG--SEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHH-PKFIPEMI 82
           + ++ +     E I +   +    + +AY   +  A   + + +  D+      F+ +++
Sbjct: 18  RSIRRLKVPPGELIAIDNIRNAKSMASAYNEAISRAKAKYKVFIHQDVFLINRDFLYDLL 77

Query: 83  -KLQQQENLDV--VTGTRYVGTGGVYGWDFKRK 112
              ++   L +  V G++++   G+  W+ K  
Sbjct: 78  ELFKRDPKLGMIGVAGSQFLPPNGI-WWEGKNL 109



 Score = 30.7 bits (70), Expect = 1.2
 Identities = 13/93 (13%), Positives = 39/93 (41%), Gaps = 7/93 (7%)

Query: 251 KQLQSIYG--SEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHH-PKFIPEMI 307
           + ++ +     E I +   +    + +AY   +  A   + + +  D+      F+ +++
Sbjct: 18  RSIRRLKVPPGELIAIDNIRNAKSMASAYNEAISRAKAKYKVFIHQDVFLINRDFLYDLL 77

Query: 308 -KLQQQENLDV--VTGTRYVGTGGVYGWDFKRK 337
              ++   L +  V G++++   G+  W+ K  
Sbjct: 78  ELFKRDPKLGMIGVAGSQFLPPNGI-WWEGKNL 109


>gnl|CDD|182218 PRK10063, PRK10063, putative glycosyl transferase; Provisional.
          Length = 248

 Score = 30.5 bits (69), Expect = 1.4
 Identities = 27/105 (25%), Positives = 49/105 (46%), Gaps = 6/105 (5%)

Query: 8   YEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIM 67
           +E IV+D GS DGT +  + L  I+ + + V +P     G+  A   G+  A G F + +
Sbjct: 34  FEWIVVDGGSNDGTREFLENLNGIF-NLRFVSEPDN---GIYDAMNKGIAMAQGRFALFL 89

Query: 68  DADLSHHPKFIPEMIKLQQQENLDVVTGTRYVGTGGVYGWDFKRK 112
           ++    H      + +L+ Q++  ++ G   +  G   G   KR 
Sbjct: 90  NSGDIFHQDAANFVRQLKMQKDNAMIIGDALLDFGD--GHKIKRS 132



 Score = 30.5 bits (69), Expect = 1.4
 Identities = 27/105 (25%), Positives = 49/105 (46%), Gaps = 6/105 (5%)

Query: 233 YEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIM 292
           +E IV+D GS DGT +  + L  I+ + + V +P     G+  A   G+  A G F + +
Sbjct: 34  FEWIVVDGGSNDGTREFLENLNGIF-NLRFVSEPDN---GIYDAMNKGIAMAQGRFALFL 89

Query: 293 DADLSHHPKFIPEMIKLQQQENLDVVTGTRYVGTGGVYGWDFKRK 337
           ++    H      + +L+ Q++  ++ G   +  G   G   KR 
Sbjct: 90  NSGDIFHQDAANFVRQLKMQKDNAMIIGDALLDFGD--GHKIKRS 132


>gnl|CDD|222183 pfam13506, Glyco_transf_21, Glycosyl transferase family 21.  This
           is a family of ceramide beta-glucosyltransferases -
           EC:2.4.1.80.
          Length = 171

 Score = 29.1 bits (66), Expect = 2.8
 Identities = 19/98 (19%), Positives = 37/98 (37%), Gaps = 10/98 (10%)

Query: 51  AYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDVVTGT-RYVGTGGVYGWDF 109
           A     +Y   + ++I D+D+   P ++ E++       + +VTG        G      
Sbjct: 25  ALDAKARY---DLLVISDSDVRVPPDYLRELLAPLADPKVGLVTGPPYGADRRGFAA--- 78

Query: 110 KRKLVSRGANYLTQLLLR-PGVSDLTGSFRLYKKQVLE 146
              L +   N L  +L    G+    G     +++ LE
Sbjct: 79  --ALEAAFLNTLAGVLQALAGLGFAVGMSMALRREALE 114



 Score = 29.1 bits (66), Expect = 2.8
 Identities = 19/98 (19%), Positives = 37/98 (37%), Gaps = 10/98 (10%)

Query: 276 AYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKLQQQENLDVVTGT-RYVGTGGVYGWDF 334
           A     +Y   + ++I D+D+   P ++ E++       + +VTG        G      
Sbjct: 25  ALDAKARY---DLLVISDSDVRVPPDYLRELLAPLADPKVGLVTGPPYGADRRGFAA--- 78

Query: 335 KRKLVSRGANYLTQLLLR-PGVSDLTGSFRLYKKQVLE 371
              L +   N L  +L    G+    G     +++ LE
Sbjct: 79  --ALEAAFLNTLAGVLQALAGLGFAVGMSMALRREALE 114


>gnl|CDD|133058 cd06436, GlcNAc-1-P_transferase, N-acetyl-glucosamine transferase
           is involved in the synthesis of
           Poly-beta-1,6-N-acetyl-D-glucosamine.
           N-acetyl-glucosamine transferase is responsible for the
           synthesis of bacteria
           Poly-beta-1,6-N-acetyl-D-glucosamine (PGA).
           Poly-beta-1,6-N-acetyl-D-glucosamine is a homopolymer
           that serves as an adhesion for the maintenance of
           biofilm structural stability in diverse eubacteria.
           N-acetyl-glucosamine transferase is the product of gene
           pgaC. Genetic analysis indicated that all four genes of
           the pgaABCD locus were required for the PGA production,
           pgaC being a glycosyltransferase.
          Length = 191

 Score = 29.3 bits (66), Expect = 3.1
 Identities = 13/48 (27%), Positives = 25/48 (52%), Gaps = 4/48 (8%)

Query: 204 VLLPTYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPDGTLDAAK 251
           VL+P  NE+  +   +  + +  ++ N+   ++VIDD S D T    +
Sbjct: 1   VLVPCLNEEAVIQRTLASLLR--NKPNF--LVLVIDDASDDDTAGIVR 44



 Score = 28.1 bits (63), Expect = 6.3
 Identities = 7/17 (41%), Positives = 10/17 (58%)

Query: 10 IIVIDDGSPDGTLDAAK 26
          ++VIDD S D T    +
Sbjct: 28 VLVIDDASDDDTAGIVR 44


>gnl|CDD|224680 COG1766, fliF, Flagellar basal body M-ring protein [Cell motility
           and secretion].
          Length = 545

 Score = 29.7 bits (67), Expect = 3.9
 Identities = 16/74 (21%), Positives = 28/74 (37%), Gaps = 13/74 (17%)

Query: 185 VVFTTQAIMSGDSVKNKYTVLLPTYNEKENLPIIVYLITKYMDEGNYPYEIIVIDDGSPD 244
            +      +   S + KY  L    +E++   I+  L     D+ N PY++      S D
Sbjct: 33  ALVAVLVALLLWSRQPKYRTLYSNLSEEDAGRIVQEL-----DQMNIPYKV------SSD 81

Query: 245 GT--LDAAKQLQSI 256
           G   L    Q+  +
Sbjct: 82  GGTILVPEDQVYEL 95


>gnl|CDD|236918 PRK11498, bcsA, cellulose synthase catalytic subunit; Provisional.
          Length = 852

 Score = 29.2 bits (66), Expect = 4.7
 Identities = 30/101 (29%), Positives = 47/101 (46%), Gaps = 16/101 (15%)

Query: 204 VLLPTYNEKENLPIIVYLITKYMDEG-NYPYE---IIVIDDGSPDGTLDAAKQLQSIYGS 259
           + +PTYNE     + V   T Y   G ++P +   I ++DDG  +     A+++    G 
Sbjct: 264 IFVPTYNE----DLNVVKNTIYASLGIDWPKDKLNIWILDDGGREEFRQFAQEV----GV 315

Query: 260 EKIVLKPRKKKLGLGTAYMHGLKYATGNFIIIMDADLSHHP 300
           + I  +P  +    G    + LKYA G F+ I D D  H P
Sbjct: 316 KYIA-RPTHEHAKAGNIN-NALKYAKGEFVAIFDCD--HVP 352


>gnl|CDD|173391 PTZ00098, PTZ00098, phosphoethanolamine N-methyltransferase;
           Provisional.
          Length = 263

 Score = 28.8 bits (64), Expect = 5.7
 Identities = 19/60 (31%), Positives = 27/60 (45%), Gaps = 5/60 (8%)

Query: 224 KYMDEGNYPYEIIVIDDGSPDGTLDAAKQLQSIYGSEKIVLKPRKKKLGLGTAYMHGLKY 283
           +Y DEG   YE I  +D    G ++A  ++ S      I L    K L +G+    G KY
Sbjct: 15  QYSDEGIKAYEFIFGEDYISSGGIEATTKILS-----DIELNENSKVLDIGSGLGGGCKY 69


>gnl|CDD|171849 PRK13033, PRK13033, formyl peptide receptor-like 1 inhibitory
           protein; Reviewed.
          Length = 133

 Score = 27.7 bits (61), Expect = 7.4
 Identities = 12/38 (31%), Positives = 20/38 (52%), Gaps = 2/38 (5%)

Query: 219 VYLITKYMDEGNYPYEIIVIDDGSPDG--TLDAAKQLQ 254
           +YLI K + +G+    ++ I DG P    T D  + L+
Sbjct: 76  LYLIVKKLSQGDVKKAVVRIKDGGPRDYYTFDLTRPLE 113


>gnl|CDD|236319 PRK08641, sdhA, succinate dehydrogenase flavoprotein subunit;
           Reviewed.
          Length = 589

 Score = 28.4 bits (64), Expect = 8.6
 Identities = 14/36 (38%), Positives = 14/36 (38%), Gaps = 7/36 (19%)

Query: 49  GTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKL 84
              Y  G  YA G FI I       HP  IP   KL
Sbjct: 218 SRVYQQGAYYANGEFIQI-------HPTAIPGDDKL 246



 Score = 28.4 bits (64), Expect = 8.6
 Identities = 14/36 (38%), Positives = 14/36 (38%), Gaps = 7/36 (19%)

Query: 274 GTAYMHGLKYATGNFIIIMDADLSHHPKFIPEMIKL 309
              Y  G  YA G FI I       HP  IP   KL
Sbjct: 218 SRVYQQGAYYANGEFIQI-------HPTAIPGDDKL 246


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.139    0.403 

Gapped
Lambda     K      H
   0.267   0.0717    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,932,188
Number of extensions: 2300865
Number of successful extensions: 2024
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1928
Number of HSP's successfully gapped: 127
Length of query: 435
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 335
Effective length of database: 6,502,202
Effective search space: 2178237670
Effective search space used: 2178237670
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 60 (26.9 bits)