RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7267
         (569 letters)



>gnl|CDD|133031 cd04188, DPG_synthase, DPG_synthase is involved in protein N-linked
           glycosylation.  UDP-glucose:dolichyl-phosphate
           glucosyltransferase (DPG_synthase) is a
           transmembrane-bound enzyme of the endoplasmic reticulum
           involved in protein N-linked glycosylation. This enzyme
           catalyzes the transfer of glucose from UDP-glucose to
           dolichyl phosphate.
          Length = 211

 Score =  321 bits (824), Expect = e-107
 Identities = 119/230 (51%), Positives = 147/230 (63%), Gaps = 19/230 (8%)

Query: 94  VIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEKCG 153
           V++PAYNE+ RL P L+E +E+L ER    P+F YEIIVV DGS D T +V  +   K  
Sbjct: 1   VVIPAYNEEKRLPPTLEEAVEYLEER----PSFSYEIIVVDDGSKDGTAEVARKLARKNP 56

Query: 154 EDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADADGATKFADLEKLEDKLKELTDGDY 213
             I R L L KNRGKGGAV  G   ARG  ILFADAD AT F +LEKLE+ LK      Y
Sbjct: 57  ALI-RVLTLPKNRGKGGAVRAGMLAARGDYILFADADLATPFEELEKLEEALKT---SGY 112

Query: 214 IEDKNNAAGCNGVIVGSRAHLEALANVQRSFFRNILMKGFHFIVWFTGVRTIRDTQCGFK 273
                       + +GSRAHL + A V+RS+ RN+L +GF+F+V       I+DTQCGFK
Sbjct: 113 -----------DIAIGSRAHLASAAVVKRSWLRNLLGRGFNFLVRLLLGLGIKDTQCGFK 161

Query: 274 LFTRKSALQLFSSIHVQRWAFDVELLFIAEVLHIPMAEVSVNWTEIEGSK 323
           LFTR +A +LF  +H++RWAFDVELL +A  L  P+ EV V W EI GSK
Sbjct: 162 LFTRDAARRLFPRLHLERWAFDVELLVLARRLGYPIEEVPVRWVEIPGSK 211


>gnl|CDD|240336 PTZ00260, PTZ00260, dolichyl-phosphate beta-glucosyltransferase;
           Provisional.
          Length = 333

 Score =  322 bits (828), Expect = e-106
 Identities = 144/345 (41%), Positives = 204/345 (59%), Gaps = 39/345 (11%)

Query: 24  PILTEFEIH-CYIALFLVCGI--LSYAYRSFKTKNPFPDIDFLSKGERVFLDVKTGQELE 80
            I      H   I L LV G+  L Y Y S+               ++V   VK+    E
Sbjct: 3   NIWKVIFKHRMLIVLGLVVGLALLFYPYISWPDD------------DKVIRQVKSSVIHE 50

Query: 81  FPSLDD----------EPSVNLSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEI 130
                D          +  V+LS+++PAYNE+DRL  ML ETI++L  R +K P FKYEI
Sbjct: 51  KSKEVDKENYINNILKDSDVDLSIVIPAYNEEDRLPKMLKETIKYLESRSRKDPKFKYEI 110

Query: 131 IVVSDGSTDKTMQVVHQY-TEKCGEDI-VRGLKLLKNRGKGGAVTLGTKCARGSIILFAD 188
           I+V+DGS DKT++V   +  +    +I +R L LL+N+GKGGAV +G   +RG  IL  D
Sbjct: 111 IIVNDGSKDKTLKVAKDFWRQNINPNIDIRLLSLLRNKGKGGAVRIGMLASRGKYILMVD 170

Query: 189 ADGATKFADLEKLEDKLKELTDGDYIEDKNNAAGCNGVIVGSRAHL-EALANVQRSFFRN 247
           ADGAT   D +KLED + ++                G++ GSR HL ++    +R ++RN
Sbjct: 171 ADGATDIDDFDKLEDIMLKIEQNGL-----------GIVFGSRNHLVDSDVVAKRKWYRN 219

Query: 248 ILMKGFHFIVWFTGVRTIRDTQCGFKLFTRKSALQLFSSIHVQRWAFDVELLFIAEVLHI 307
           ILM GFHFIV       ++DTQCGFKLFTR++A  +F S+H++RWAFD+E++ IA+ L++
Sbjct: 220 ILMYGFHFIVNTICGTNLKDTQCGFKLFTRETARIIFPSLHLERWAFDIEIVMIAQKLNL 279

Query: 308 PMAEVSVNWTEIEGSKIVPVFSWIQMGWDVLNIWLHYTLGLWKIK 352
           P+AEV VNWTE+EGSK+  + + IQM  D+L +   Y LG+WK+K
Sbjct: 280 PIAEVPVNWTEVEGSKLNVISASIQMARDILLVRSFYLLGIWKVK 324


>gnl|CDD|133022 cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member
           of the Glycosyltransferase 2 superfamily.  DPM1 is the
           catalytic subunit of eukaryotic dolichol-phosphate
           mannose (DPM) synthase. DPM synthase is required for
           synthesis of the glycosylphosphatidylinositol (GPI)
           anchor, N-glycan precursor, protein O-mannose, and
           C-mannose. In higher eukaryotes,the enzyme has three
           subunits, DPM1, DPM2 and DPM3. DPM is synthesized from
           dolichol phosphate and GDP-Man on the cytosolic surface
           of the ER membrane by DPM synthase and then is flipped
           onto the luminal side and used as a donor substrate. In
           lower eukaryotes, such as Saccharomyces cerevisiae and
           Trypanosoma brucei, DPM synthase consists of a single
           component (Dpm1p and TbDpm1, respectively) that
           possesses one predicted transmembrane region near the C
           terminus for anchoring to the ER membrane. In contrast,
           the Dpm1 homologues of higher eukaryotes, namely fission
           yeast, fungi, and animals, have no transmembrane region,
           suggesting the existence of adapter molecules for
           membrane anchoring. This family also includes bacteria
           and archaea DPM1_like enzymes. However, the enzyme
           structure and mechanism of function are not well
           understood. The UDP-glucose:dolichyl-phosphate
           glucosyltransferase (DPG_synthase) is a
           transmembrane-bound enzyme of the endoplasmic reticulum
           involved in protein N-linked glycosylation. This enzyme
           catalyzes the transfer of glucose from UDP-glucose to
           dolichyl phosphate. This protein family belongs to
           Glycosyltransferase 2 superfamily.
          Length = 185

 Score =  175 bits (447), Expect = 4e-52
 Identities = 69/209 (33%), Positives = 102/209 (48%), Gaps = 24/209 (11%)

Query: 94  VIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEKCG 153
           V++PAYNE++ +  +++  +  L E       + YEIIVV DGSTD T ++  +   +  
Sbjct: 1   VVIPAYNEEENIPELVERLLAVLEE------GYDYEIIVVDDGSTDGTAEIARELAARVP 54

Query: 154 EDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADADGATKFADLEKLEDKLKELTDGDY 213
              VR ++L +N GKG AV  G K ARG I++  DAD      D+ KL +KL E   G  
Sbjct: 55  R--VRVIRLSRNFGKGAAVRAGFKAARGDIVVTMDADLQHPPEDIPKLLEKLLE---GGA 109

Query: 214 IEDKNNAAGCNGVIVGSRAHLEALANVQRSFFRNILMKGFHFIVWFTGVRTIRDTQCGFK 273
             D         V++GSR              R +  + F+F++       I DTQ GF+
Sbjct: 110 --D---------VVIGSRFVRGG--GAGMPLLRRLGSRLFNFLIRLLLGVRISDTQSGFR 156

Query: 274 LFTRKSALQLFSSIHVQRWAFDVELLFIA 302
           LF R+    L S +    + F +ELL  A
Sbjct: 157 LFRREVLEALLSLLESNGFEFGLELLVGA 185


>gnl|CDD|215980 pfam00535, Glycos_transf_2, Glycosyl transferase family 2.  Diverse
           family, transferring sugar from UDP-glucose,
           UDP-N-acetyl- galactosamine, GDP-mannose or
           CDP-abequose, to a range of substrates including
           cellulose, dolichol phosphate and teichoic acids.
          Length = 168

 Score = 88.3 bits (219), Expect = 2e-20
 Identities = 54/192 (28%), Positives = 84/192 (43%), Gaps = 24/192 (12%)

Query: 93  SVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEKC 152
           SVI+P YNE+  L+    ET+E L  +  K     +EIIVV DGSTD T+++  +Y +  
Sbjct: 1   SVIIPTYNEEKYLE----ETLESLLNQTYK----NFEIIVVDDGSTDGTVEIAEEYAKN- 51

Query: 153 GEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADADGATKFADLEKLEDKLKELTDGD 212
            +  VR ++L +N GK  A   G K A G  I F DAD       LEKL + L++     
Sbjct: 52  -DPRVRVIRLEENLGKAAARNAGLKLATGDYIAFLDADDEVAPDWLEKLVELLEK----- 105

Query: 213 YIEDKNNAAGCNGVIVGSRAHLEALANVQRSFFRNILMKGFHFIVWFTGVRTIRDTQCGF 272
                        +++GSR  +     +     R  L+     +   +    +       
Sbjct: 106 ---------NGADIVIGSRVVINGETRLYGRALRFELLLLLGKLGARSLGLKVLFLIGSN 156

Query: 273 KLFTRKSALQLF 284
            L+ R+   +L 
Sbjct: 157 ALYRREVLEELL 168


>gnl|CDD|133062 cd06442, DPM1_like, DPM1_like represents putative enzymes similar
           to eukaryotic DPM1.  Proteins similar to eukaryotic
           DPM1, including enzymes from bacteria and archaea; DPM1
           is the catalytic subunit of eukaryotic
           dolichol-phosphate mannose (DPM) synthase. DPM synthase
           is required for synthesis of the
           glycosylphosphatidylinositol (GPI) anchor, N-glycan
           precursor, protein O-mannose, and C-mannose. In higher
           eukaryotes,the enzyme has three subunits, DPM1, DPM2 and
           DPM3. DPM is synthesized from dolichol phosphate and
           GDP-Man on the cytosolic surface of the ER membrane by
           DPM synthase and then is flipped onto the luminal side
           and used as a donor substrate. In lower eukaryotes, such
           as Saccharomyces cerevisiae and Trypanosoma brucei, DPM
           synthase consists of a single component (Dpm1p and
           TbDpm1, respectively) that possesses one predicted
           transmembrane region near the C terminus for anchoring
           to the ER membrane. In contrast, the Dpm1 homologues of
           higher eukaryotes, namely fission yeast, fungi, and
           animals, have no transmembrane region, suggesting the
           existence of adapter molecules for membrane anchoring.
           This family also includes bacteria and archaea DPM1_like
           enzymes. However, the enzyme structure and mechanism of
           function are not well understood. This protein family
           belongs to Glycosyltransferase 2 superfamily.
          Length = 224

 Score = 86.8 bits (216), Expect = 2e-19
 Identities = 60/235 (25%), Positives = 103/235 (43%), Gaps = 30/235 (12%)

Query: 94  VIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEKCG 153
           +I+P YNE++ +  +++     L           YEIIVV D S D T ++V +  ++  
Sbjct: 1   IIIPTYNERENIPELIERLDAALKG-------IDYEIIVVDDNSPDGTAEIVRELAKEYP 53

Query: 154 ED--IVRGLKLLKNRGKGGAVTLGTKCARGSIILFADADGATKFADLEKLEDKLKELTDG 211
               IVR       RG G A   G K ARG +I+  DAD +     + +L +   E    
Sbjct: 54  RVRLIVRP----GKRGLGSAYIEGFKAARGDVIVVMDADLSHPPEYIPELLEAQLE-GGA 108

Query: 212 DYIEDKNNAAGCNGVIVGSRAHLEALANVQRSFFRNILMKGFHFIVWFTGVRTIRDTQCG 271
           D             +++GSR ++E          R ++ +G + +      R + D   G
Sbjct: 109 D-------------LVIGSR-YVEGGGVEGWGLKRKLISRGANLLARLLLGRKVSDPTSG 154

Query: 272 FKLFTRKSALQLFSSIHVQRWAFDVELLFIAEVLHIPMAEVSVNWTE-IEG-SKI 324
           F+ + R+   +L  S+  + + F +ELL  A  L   + EV + + +   G SK+
Sbjct: 155 FRAYRREVLEKLIDSLVSKGYKFQLELLVRARRLGYRIVEVPITFVDREHGESKL 209


>gnl|CDD|133030 cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to
           eukaryotic DPM1.  A family of  bacterial enzymes related
           to eukaryotic DPM1; Although the mechanism of eukaryotic
           enzyme is well studied, the mechanism of the  bacterial
           enzymes is not well understood. The eukaryotic DPM1 is
           the catalytic subunit of eukaryotic Dolichol-phosphate
           mannose (DPM) synthase. DPM synthase is required for
           synthesis of the glycosylphosphatidylinositol (GPI)
           anchor, N-glycan precursor, protein O-mannose, and
           C-mannose. The enzyme has three subunits, DPM1, DPM2 and
           DPM3. DPM is synthesized from dolichol phosphate and
           GDP-Man on the cytosolic surface of the ER membrane by
           DPM synthase and then is flipped onto the luminal side
           and used as a donor substrate. This protein family
           belongs to Glycosyltransferase 2 superfamily.
          Length = 181

 Score = 84.5 bits (210), Expect = 5e-19
 Identities = 46/186 (24%), Positives = 83/186 (44%), Gaps = 30/186 (16%)

Query: 94  VIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEKCG 153
           ++VP YNE++ L  + +     L         + YEII V DGSTD+T++++ +   +  
Sbjct: 1   IVVPVYNEEENLPELYERLKAVLES-----LGYDYEIIFVDDGSTDRTLEILRELAARDP 55

Query: 154 EDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADADGATKFADLEKLEDKLKELTDGDY 213
              V+ ++L +N G+  A+  G   ARG  ++  DAD       + ++  K +E  D   
Sbjct: 56  R--VKVIRLSRNFGQQAALLAGLDHARGDAVITMDADLQDPPELIPEMLAKWEEGYD--- 110

Query: 214 IEDKNNAAGCNGVIVGSRAHLEALANVQRSFFRNILMKGFHFIV-WFTGVRTIRDTQCGF 272
                       V+ G R       N + S+ + +  K F+ ++   +GV    +    F
Sbjct: 111 ------------VVYGVR------KNRKESWLKRLTSKLFYRLINKLSGVDIPDNG-GDF 151

Query: 273 KLFTRK 278
           +L  RK
Sbjct: 152 RLMDRK 157


>gnl|CDD|132997 cd00761, Glyco_tranf_GTA_type, Glycosyltransferase family A (GT-A)
           includes diverse families of glycosyl transferases with
           a common GT-A type structural fold.
           Glycosyltransferases (GTs) are enzymes that synthesize
           oligosaccharides, polysaccharides, and glycoconjugates
           by transferring the sugar moiety from an activated
           nucleotide-sugar donor to an acceptor molecule, which
           may be a growing oligosaccharide, a lipid, or a protein.
            Based on the stereochemistry of the donor and acceptor
           molecules, GTs are classified as either retaining or
           inverting enzymes. To date, all GT structures adopt one
           of two possible folds, termed GT-A fold and GT-B fold.
           This hierarchy includes diverse families of glycosyl
           transferases with a common GT-A type structural fold,
           which has two tightly associated beta/alpha/beta domains
           that tend to form a continuous central sheet of at least
           eight beta-strands. The majority of the proteins in this
           superfamily are Glycosyltransferase family 2 (GT-2)
           proteins. But it also includes families GT-43, GT-6,
           GT-8, GT13 and GT-7; which are evolutionarily related to
           GT-2 and share structure similarities.
          Length = 156

 Score = 80.2 bits (198), Expect = 9e-18
 Identities = 51/173 (29%), Positives = 76/173 (43%), Gaps = 17/173 (9%)

Query: 94  VIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEKCG 153
           VI+PAYNE    +P L+  +E L           +E+IVV DGSTD T++++ +Y +K  
Sbjct: 1   VIIPAYNE----EPYLERCLESL----LAQTYPNFEVIVVDDGSTDGTLEILEEYAKK-- 50

Query: 154 EDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADADGATKFADLEKLEDKLKELTDGDY 213
           +  V  +   +N+G   A   G K ARG  ILF DAD          L D L+ L     
Sbjct: 51  DPRVIRVINEENQGLAAARNAGLKAARGEYILFLDADDLL-------LPDWLERLVAELL 103

Query: 214 IEDKNNAAGCNGVIVGSRAHLEALANVQRSFFRNILMKGFHFIVWFTGVRTIR 266
            + + +A G  G ++  R  LE +     +         F   +   G    R
Sbjct: 104 ADPEADAVGGPGNLLFRRELLEEIGGFDEALLSGEEDDDFLLRLLRGGKVAFR 156


>gnl|CDD|223539 COG0463, WcaA, Glycosyltransferases involved in cell wall
           biogenesis [Cell envelope biogenesis, outer membrane].
          Length = 291

 Score = 81.3 bits (199), Expect = 7e-17
 Identities = 55/282 (19%), Positives = 99/282 (35%), Gaps = 24/282 (8%)

Query: 92  LSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEK 151
           +SV++P YNE++ L   L+  +    +         +EIIVV DGSTD T ++  +Y  K
Sbjct: 5   VSVVIPTYNEEEYLPEALESLLNQTYK--------DFEIIVVDDGSTDGTTEIAIEYGAK 56

Query: 152 CGEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADADGATKFADLEKLEDKLKELTDG 211
               I    +  +N G G A   G + ARG  I+F DAD         +L   +    DG
Sbjct: 57  DVRVIRLINE--RNGGLGAARNAGLEYARGDYIVFLDADDQHP----PELIPLVAAGGDG 110

Query: 212 DYIEDKNNAAGCNGVIVGSRAHLEALANVQRSFFRNILMKGFHFIVWFTGVRTIRDTQCG 271
           DYI   ++       I   R  L  L  +  +    +L+      +       +RD   G
Sbjct: 111 DYIARLDDR----DDIWLPRKLLSKLVKLLGNRLLGVLIPDGFGDLRL----LVRDAVDG 162

Query: 272 FKLFTRKSALQLFSSIHVQRWAFDVELLFIAEVLHIPMAEVSVNWTEIE--GSKIVPVFS 329
            + F    +  L   +          L ++     +  ++  +    +      +     
Sbjct: 163 LRAFLEGRSRFLRLLLRKLVLIRREVLEYLLLYRLLGASKRVLLGKLLRLGLLLLDSKLK 222

Query: 330 WIQMGWDVLNIWLHYTLGLWKIKAMENKYRAVYVPSQTSPVY 371
            + +   +L       L +  +  +    R +          
Sbjct: 223 AVPLDLLLLLGLFIALLSILLLLLLLLLRRLLGANLSEKGKP 264


>gnl|CDD|133045 cd06423, CESA_like, CESA_like is  the cellulose synthase
           superfamily.  The cellulose synthase (CESA) superfamily
           includes a wide variety of glycosyltransferase family 2
           enzymes that share the common characteristic of
           catalyzing the elongation of polysaccharide chains. The
           members include cellulose synthase catalytic subunit,
           chitin synthase, glucan biosynthesis protein and other
           families of CESA-like proteins. Cellulose synthase
           catalyzes the polymerization reaction of cellulose, an
           aggregate of unbranched polymers of beta-1,4-linked
           glucose residues in  plants, most algae, some bacteria
           and fungi, and even some animals. In bacteria, algae and
           lower eukaryotes, there is a second unrelated type of
           cellulose synthase (Type II), which produces acylated
           cellulose, a derivative of cellulose. Chitin synthase
           catalyzes the incorporation of GlcNAc from substrate
           UDP-GlcNAc into chitin, which is a linear homopolymer of
           beta-(1,4)-linked GlcNAc residues and Glucan
           Biosynthesis protein catalyzes the elongation of
           beta-1,2 polyglucose chains of Glucan.
          Length = 180

 Score = 73.0 bits (180), Expect = 5e-15
 Identities = 34/98 (34%), Positives = 55/98 (56%), Gaps = 11/98 (11%)

Query: 94  VIVPAYNEQDRLKPMLDETIE-FLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEKC 152
           +IVPAYNE+     +++ TIE  L     K      E+IVV DGSTD T++++ +     
Sbjct: 1   IIVPAYNEEA----VIERTIESLLALDYPK-----LEVIVVDDGSTDDTLEILEELAALY 51

Query: 153 GEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADAD 190
              ++  ++  +N GK GA+  G + A+G I++  DAD
Sbjct: 52  IRRVLV-VRDKENGGKAGALNAGLRHAKGDIVVVLDAD 88


>gnl|CDD|133061 cd06439, CESA_like_1, CESA_like_1 is a member of the cellulose
           synthase (CESA) superfamily.  This is a subfamily of
           cellulose synthase (CESA) superfamily.  CESA superfamily
           includes a wide variety of glycosyltransferase family 2
           enzymes that share the common characteristic of
           catalyzing the elongation of polysaccharide chains.  The
           members of the superfamily include cellulose synthase
           catalytic subunit, chitin synthase, glucan biosynthesis
           protein and other families of CESA-like proteins.
          Length = 251

 Score = 73.4 bits (181), Expect = 1e-14
 Identities = 41/123 (33%), Positives = 64/123 (52%), Gaps = 18/123 (14%)

Query: 71  LDVKTGQELEFPSLDDEPSVNLSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEI 130
           L     +    P     P+V  ++I+PAYNE+  ++  L+  +  L+  R ++     EI
Sbjct: 12  LARLRPKPPSLPDPAYLPTV--TIIIPAYNEEAVIEAKLENLLA-LDYPRDRL-----EI 63

Query: 131 IVVSDGSTDKTMQVVHQYTEKCGEDIVRGLKLL---KNRGKGGAVTLGTKCARGSIILFA 187
           IVVSDGSTD T ++  +Y +K       G+KLL   + RGK  A+      A G I++F 
Sbjct: 64  IVVSDGSTDGTAEIAREYADK-------GVKLLRFPERRGKAAALNRALALATGEIVVFT 116

Query: 188 DAD 190
           DA+
Sbjct: 117 DAN 119


>gnl|CDD|133016 cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the
           biosynthesis of succinoglycan.  Succinoglycan
           Biosynthesis Protein ExoA catalyzes the formation of a
           beta-1,3 linkage of the second sugar (glucose) of the
           succinoglycan with the galactose on the lipid carrie.
           Succinoglycan is an acidic exopolysaccharide that is
           important for invasion of the nodules. Succinoglycan is
           a high-molecular-weight polymer composed of repeating
           octasaccharide units. These units are synthesized on
           membrane-bound isoprenoid lipid carriers, beginning with
           galactose followed by seven glucose molecules, and
           modified by the addition of acetate, succinate, and
           pyruvate. ExoA is a membrane protein with a
           transmembrance domain at c-terminus.
          Length = 249

 Score = 71.5 bits (176), Expect = 7e-14
 Identities = 32/102 (31%), Positives = 51/102 (50%), Gaps = 15/102 (14%)

Query: 93  SVIVPAYNEQDRLKPMLDETIE-FLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEK 151
           S+I+P  NE+  ++    E +E  LN+     P    EIIVV  GSTD T ++V +Y  K
Sbjct: 3   SIIIPVRNEEKYIE----ELLESLLNQSY---PKDLIEIIVVDGGSTDGTREIVQEYAAK 55

Query: 152 CGEDIVRGLKLLKNRGK--GGAVTLGTKCARGSIILFADADG 191
                   ++L+ N  +     + +G + +RG II+  DA  
Sbjct: 56  DPR-----IRLIDNPKRIQSAGLNIGIRNSRGDIIIRVDAHA 92


>gnl|CDD|236882 PRK11204, PRK11204, N-glycosyltransferase; Provisional.
          Length = 420

 Score = 71.5 bits (176), Expect = 3e-13
 Identities = 33/109 (30%), Positives = 56/109 (51%), Gaps = 12/109 (11%)

Query: 82  PSLDDEPSVNLSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKT 141
             L + P V  S++VP YNE + +    +ETI  L     + P   YE+I ++DGS+D T
Sbjct: 48  AQLKEYPGV--SILVPCYNEGENV----EETISHLL--ALRYP--NYEVIAINDGSSDNT 97

Query: 142 MQVVHQYTEKCGEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADAD 190
            +++ +   +     +R + L +N+GK  A+  G   AR   ++  D D
Sbjct: 98  GEILDRLAAQ--IPRLRVIHLAENQGKANALNTGAAAARSEYLVCIDGD 144


>gnl|CDD|188452 TIGR03937, PgaC_IcaA, poly-beta-1,6 N-acetyl-D-glucosamine
           synthase.  Members of this protein family are
           biofilm-forming enzymes that polymerize
           N-acetyl-D-glucosamine residues in beta(1,6) linkage.
           One named members is IcaA (intercellular adhesin protein
           A), an enzyme that acts (with aid of subunit IcaD) in
           Polysaccharide Intercellular Adhesin (PIA) biosynthesis
           in Staphylococcus epidermis). The homologous member in
           E. coli is designated PgaC. Members are often encoded
           next to a polysaccharide deacetylase and involved in
           biofilm formation. Note that chitin, although also made
           from N-acetylglucosamine, is formed with beta-1,4
           linkages.
          Length = 407

 Score = 69.2 bits (170), Expect = 2e-12
 Identities = 38/124 (30%), Positives = 62/124 (50%), Gaps = 16/124 (12%)

Query: 67  ERVFLDVKTGQELEFPSLDDEPSVNLSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTF 126
           ER +   +T      P LD+ P V  S++VP YNE   +    +ETI  L   R   P  
Sbjct: 28  ERKWPLPRTPP----PPLDEYPGV--SILVPCYNEGANV----EETISHLLALR--YP-- 73

Query: 127 KYEIIVVSDGSTDKTMQVVHQYTEKCGEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILF 186
            +EII ++DGS D T +++ +   +  +  +R + L +N+GK  A+  G   A+   ++ 
Sbjct: 74  NFEIIAINDGSKDNTAEILDRLAAQ--DPRLRVIHLAENQGKANALNTGLLAAKYEYLVC 131

Query: 187 ADAD 190
            D D
Sbjct: 132 IDGD 135


>gnl|CDD|234532 TIGR04283, glyco_like_mftF, transferase 2,
           rSAM/selenodomain-associated.  This enzyme may transfer
           a nucleotide, or it sugar moiety, as part of a
           biosynthetic pathway. Other proposed members of the
           pathway include another transferase (TIGR04282), a
           phosphoesterase, and a radical SAM enzyme (TIGR04167)
           whose C-terminal domain (pfam12345) frequently contains
           a selenocysteine [Unknown function, Enzymes of unknown
           specificity].
          Length = 220

 Score = 61.4 bits (150), Expect = 1e-10
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 20/100 (20%)

Query: 92  LSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEK 151
           LS+I+P  NE   L  +L +      +          E+IVV  GSTD T+++       
Sbjct: 1   LSIIIPVLNEAATLPELLADLQALPGD---------AEVIVVDGGSTDGTVEIARSL--- 48

Query: 152 CGEDIVRGLKLLKN-RGKGGAVTLGTKCARGSIILFADAD 190
                  G K++ + +G+   +  G   A+G I+LF  AD
Sbjct: 49  -------GAKVIHSPKGRARQMNAGAALAKGDILLFLHAD 81


>gnl|CDD|133035 cd04192, GT_2_like_e, Subfamily of Glycosyltransferase Family GT2
           of unknown function.  GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 229

 Score = 59.2 bits (144), Expect = 7e-10
 Identities = 34/102 (33%), Positives = 52/102 (50%), Gaps = 15/102 (14%)

Query: 94  VIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEKCG 153
           V++ A NE + L  +L  ++  L+      P  K+E+I+V D STD T+Q++     K  
Sbjct: 1   VVIAARNEAENLPRLLQ-SLSALD-----YPKEKFEVILVDDHSTDGTVQILEFAAAK-- 52

Query: 154 EDIVRGLKLLKNR-----GKGGAVTLGTKCARGSIILFADAD 190
               + LK+L N      GK  A+T   K A+G  I+  DAD
Sbjct: 53  -PNFQ-LKILNNSRVSISGKKNALTTAIKAAKGDWIVTTDAD 92


>gnl|CDD|224136 COG1215, COG1215, Glycosyltransferases, probably involved in cell
           wall biogenesis [Cell envelope biogenesis, outer
           membrane].
          Length = 439

 Score = 60.7 bits (147), Expect = 1e-09
 Identities = 37/103 (35%), Positives = 57/103 (55%), Gaps = 15/103 (14%)

Query: 92  LSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEK 151
           +SVI+PAYNE+        E +E   E        +YE+IVV DGSTD+T +++    E+
Sbjct: 56  VSVIIPAYNEEP-------EVLEETLESLLSQDYPRYEVIVVDDGSTDETYEIL----EE 104

Query: 152 CGEDIVRGLKLL----KNRGKGGAVTLGTKCARGSIILFADAD 190
            G +     +++    KN GK GA+  G K A+G +++  DAD
Sbjct: 105 LGAEYGPNFRVIYPEKKNGGKAGALNNGLKRAKGDVVVILDAD 147


>gnl|CDD|182669 PRK10714, PRK10714, undecaprenyl phosphate
           4-deoxy-4-formamido-L-arabinose transferase;
           Provisional.
          Length = 325

 Score = 57.1 bits (138), Expect = 7e-09
 Identities = 47/191 (24%), Positives = 82/191 (42%), Gaps = 27/191 (14%)

Query: 87  EPSVNLSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVH 146
            P   +SV++P YNEQ+ L  ++  T        K     +YEI+++ DGS+D + +++ 
Sbjct: 3   HPIKKVSVVIPVYNEQESLPELIRRTTAACESLGK-----EYEILLIDDGSSDNSAEMLV 57

Query: 147 QYTEKCGEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADADGATKFADLEKLEDKLK 206
           +  +     IV  L L +N G+  A+  G     G +I+  DAD      ++ +L  K  
Sbjct: 58  EAAQAPDSHIVAIL-LNRNYGQHSAIMAGFSHVTGDLIITLDADLQNPPEEIPRLVAKAD 116

Query: 207 ELTDGDYIEDKNNAAGCNGVIVGSRAHLEALANVQRSFFRNILMKGFHFIVWFTGVRTIR 266
           E  D                +VG+        N Q S+FR    K  + ++  T  + + 
Sbjct: 117 EGYD----------------VVGTVR-----QNRQDSWFRKTASKMINRLIQRTTGKAMG 155

Query: 267 DTQCGFKLFTR 277
           D  C  + + R
Sbjct: 156 DYGCMLRAYRR 166


>gnl|CDD|133013 cd02522, GT_2_like_a, GT_2_like_a represents a glycosyltransferase
           family-2 subfamily with unknown function.
           Glycosyltransferase family 2 (GT-2) subfamily of unknown
           function. GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 221

 Score = 55.7 bits (135), Expect = 8e-09
 Identities = 30/99 (30%), Positives = 42/99 (42%), Gaps = 17/99 (17%)

Query: 92  LSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEK 151
           LS+I+P  NE + L       +  L  RR        EIIVV  GSTD T+ +       
Sbjct: 1   LSIIIPTLNEAENLPR----LLASL--RRLNPLPL--EIIVVDGGSTDGTVAIARSA--- 49

Query: 152 CGEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADAD 190
            G  ++        +G+   +  G   ARG  +LF  AD
Sbjct: 50  -GVVVISS-----PKGRARQMNAGAAAARGDWLLFLHAD 82


>gnl|CDD|215385 PLN02726, PLN02726, dolichyl-phosphate beta-D-mannosyltransferase.
          Length = 243

 Score = 55.9 bits (135), Expect = 1e-08
 Identities = 40/103 (38%), Positives = 50/103 (48%), Gaps = 16/103 (15%)

Query: 93  SVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFK-YEIIVVSDGSTDKTMQVVHQYTEK 151
           S+IVP YNE+  +       I +L    K +   K +EIIVV DGS D T  VV Q  + 
Sbjct: 12  SIIVPTYNERLNIA-----LIVYL--IFKALQDVKDFEIIVVDDGSPDGTQDVVKQLQKV 64

Query: 152 CGEDIV----RGLKLLKNRGKGGAVTLGTKCARGSIILFADAD 190
            GED +    R  KL    G G A   G K A G  ++  DAD
Sbjct: 65  YGEDRILLRPRPGKL----GLGTAYIHGLKHASGDFVVIMDAD 103


>gnl|CDD|182223 PRK10073, PRK10073, putative glycosyl transferase; Provisional.
          Length = 328

 Score = 54.7 bits (132), Expect = 5e-08
 Identities = 51/204 (25%), Positives = 69/204 (33%), Gaps = 46/204 (22%)

Query: 92  LSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEK 151
           LS+I+P YN     +  + E++               EII+V+DGSTD ++++   Y E 
Sbjct: 8   LSIIIPLYNAGKDFRAFM-ESLIAQTWTA-------LEIIIVNDGSTDNSVEIAKHYAEN 59

Query: 152 CGEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADADGATKFADLEKL-----EDKLK 206
                VR L    N G   A   G   A G  + F DAD        E L     ED L 
Sbjct: 60  YPH--VRLLH-QANAGVSVARNTGLAVATGKYVAFPDADDVVYPTMYETLMTMALEDDLD 116

Query: 207 ELT-DGDYIEDKNNAA---------GCNGVIVGSRAHLEALA----------NVQRSFFR 246
               + D+                    GV+ G      AL+           V R  F 
Sbjct: 117 VAQCNADWCFRDTGETWQSIPSDRLRSTGVLSGPDWLRMALSSRRWTHVVWLGVYRRDF- 175

Query: 247 NILMKGFHF--------IVWFTGV 262
            I+     F        I W T V
Sbjct: 176 -IVKNNIKFEPGLHHQDIPWTTEV 198


>gnl|CDD|234419 TIGR03965, mycofact_glyco, mycofactocin system glycosyltransferase.
            Members of this protein family are putative
           glycosyltransferases, members of pfam00535 (glycosyl
           transferase family 2). Members appear mostly in the
           Actinobacteria, where they appear to be part of a system
           for converting a precursor peptide (TIGR03969) into a
           novel redox carrier designated mycofactocin. A radical
           SAM enzyme, TIGR03962, is a proposed to be a key
           maturase for mycofactocin.
          Length = 467

 Score = 50.5 bits (121), Expect = 1e-06
 Identities = 30/110 (27%), Positives = 44/110 (40%), Gaps = 13/110 (11%)

Query: 82  PSLDDE-PSVNLSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDK 140
           P+L    PSV  +V+VP  N    L  +L              P  + E+IVV DGS D 
Sbjct: 67  PALLPSPPSV--TVVVPVRNRPAGLARLLAAL------LALDYPRDRLEVIVVDDGSEDP 118

Query: 141 TMQVVHQYTEKCGEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADAD 190
                     +     VR ++  + +G   A   G + AR   + F D+D
Sbjct: 119 V-PTRAARGARLP---VRVIRHPRRQGPAAARNAGARAARTEFVAFTDSD 164


>gnl|CDD|184755 PRK14583, hmsR, N-glycosyltransferase; Provisional.
          Length = 444

 Score = 48.1 bits (114), Expect = 9e-06
 Identities = 35/118 (29%), Positives = 56/118 (47%), Gaps = 14/118 (11%)

Query: 74  KTGQELEFPSLDDEPSVNLSVIVPAYNEQDRLKPMLDETIEF-LNERRKKIPTFKYEIIV 132
             G ++  P L   P V  S++VP +NE    +    ETI   L +    I     E+I 
Sbjct: 61  PWGDDVPAPELKGHPLV--SILVPCFNEGLNAR----ETIHAALAQTYTNI-----EVIA 109

Query: 133 VSDGSTDKTMQVVHQYTEKCGEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADAD 190
           ++DGS+D T QV+     +  +  +R + L  N+GK  A+ +G   AR   ++  D D
Sbjct: 110 INDGSSDDTAQVLDALLAE--DPRLRVIHLAHNQGKAIALRMGAAAARSEYLVCIDGD 165


>gnl|CDD|237556 PRK13915, PRK13915, putative glucosyl-3-phosphoglycerate synthase;
           Provisional.
          Length = 306

 Score = 46.1 bits (110), Expect = 3e-05
 Identities = 33/114 (28%), Positives = 50/114 (43%), Gaps = 38/114 (33%)

Query: 93  SVIVPAYNEQ-------DRLKPMLDET-IEFLNERRKKIPTFKYEIIVVSDGSTDKTMQ- 143
           SV++PA NE+       D ++P+L E  ++              E+IV+  GSTD T + 
Sbjct: 34  SVVLPALNEEETVGKVVDSIRPLLMEPLVD--------------ELIVIDSGSTDATAER 79

Query: 144 -------VVHQYTEKCGEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADAD 190
                  VV +      E+I+  L      GKG A+        G I++F DAD
Sbjct: 80  AAAAGARVVSR------EEILPELPPR--PGKGEALWRSLAATTGDIVVFVDAD 125


>gnl|CDD|222281 pfam13641, Glyco_tranf_2_3, Glycosyltransferase like family 2.
           Members of this family of prokaryotic proteins include
           putative glucosyltransferase, which are involved in
           bacterial capsule biosynthesis.
          Length = 229

 Score = 42.7 bits (101), Expect = 2e-04
 Identities = 26/102 (25%), Positives = 43/102 (42%), Gaps = 11/102 (10%)

Query: 92  LSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQY--- 148
           ++V+VP +NE   L   L   +     R         E+IVV DGS D+T+ V  +    
Sbjct: 3   VAVVVPTWNEDKVLGRTLRSILAQDYPR--------LEVIVVVDGSDDETLDVARELAAA 54

Query: 149 TEKCGEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADAD 190
                  +VR  +     GK  A+    +  +  +++  DAD
Sbjct: 55  YPDVRVHVVRRPRPPGPTGKARALNEALRAIKSDLVVLLDAD 96


>gnl|CDD|220577 pfam10111, Glyco_tranf_2_2, Glycosyltransferase like family 2.
           Members of this family of prokaryotic proteins include
           putative glucosyltransferase, which are involved in
           bacterial capsule biosynthesis.
          Length = 278

 Score = 42.0 bits (99), Expect = 4e-04
 Identities = 25/119 (21%), Positives = 38/119 (31%), Gaps = 7/119 (5%)

Query: 93  SVIVPAYNEQDRLKPMLDETIEFLNERRKKI-PTFKYEIIVVSDGSTDKTMQVVHQYTEK 151
           SVI+P Y         + + ++    R          EIIVV   S     + + +   K
Sbjct: 1   SVIIPVYL--SLRTRDILKRLQSCLNRLSYFLSDPDVEIIVVDGDSPLSFAKELKKIIAK 58

Query: 152 CGEDIVRGLKLLKNR-GKGGAVTLGTKCARGSIILFADADGATKFADLEKLEDKLKELT 209
            G       K  K+      A   G + +    I F D D        + LE  +K   
Sbjct: 59  NGAINYISHKTHKDLFSPAKARNRGAEYSSSDFIFFLDVD---CLISPDTLEKIIKHFQ 114


>gnl|CDD|133005 cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4
           glucosyltransferase is required for biosynthesis of
           lipooligosaccharide.  UDP-glucose: lipooligosaccharide
           (LOS)  beta-1-4-glucosyltransferase catalyzes the
           addition of the first residue, glucose, of the
           lacto-N-neotetrase structure to HepI of the LOS inner
           core.  LOS is the major constituent of the outer leaflet
           of the outer membrane of gram-positive bacteria. It
           consists of a short oligosaccharide chain of variable
           composition (alpha chain) attached to a branched inner
           core which is lined in turn to lipid A. Beta 1,4
           glucosyltransferase is required to attach the alpha
           chain to the inner core.
          Length = 229

 Score = 40.7 bits (96), Expect = 9e-04
 Identities = 26/100 (26%), Positives = 45/100 (45%), Gaps = 19/100 (19%)

Query: 91  NLSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTE 150
            LSV++   NE+  ++  L E++++  +          EIIVV  GSTD+T+++  +Y  
Sbjct: 1   TLSVVIITKNEERNIERCL-ESVKWAVD----------EIIVVDSGSTDRTVEIAKEYGA 49

Query: 151 KCGEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADAD 190
           K    + +        G G       + A    +L  DAD
Sbjct: 50  K----VYQ----RWWDGFGAQRNFALELATNDWVLSLDAD 81


>gnl|CDD|200433 TIGR04182, glyco_TIGR04182, glycosyltransferase, TIGR04182 family. 
           Members of this family are glycosyltransferases
           restricted to the archaea. All but two members are from
           species with the PGF-CTERM/archaeosortase A system, a
           proposed maturation system for exported, glycosylated
           proteins as are found often in S-layers.
          Length = 293

 Score = 39.6 bits (93), Expect = 0.003
 Identities = 49/229 (21%), Positives = 90/229 (39%), Gaps = 52/229 (22%)

Query: 94  VIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFK----YEIIVVSDGSTDKTMQVVHQYT 149
           V++P  NE   +  ++D               F+     +I+V+   STD T ++  +  
Sbjct: 4   VLIPTLNEAATIGDVIDG--------------FQELGYSDILVIDGNSTDGTQEIAKEA- 48

Query: 150 EKCGEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADADGATKFADLEKLEDKLKELT 209
              G  +V    +   +GKG AV    +      +L  D DG    AD + L + L    
Sbjct: 49  ---GARVV----IQSGKGKGQAVREAFELIDAPYVLMLDGDGTYLPADADALLEPL---- 97

Query: 210 DGDYIEDKNNAAGCNGVIVGSR-AHLEALANVQRSFFRN-ILMKGFHFIVWFTGVRTIRD 267
                      +G    ++G+R A +E  A  + +   N ++ + F  I    GV  +RD
Sbjct: 98  ----------LSGRADHVIGNRFADMEPGAFTRLNLVGNRLINRLFRII---HGVD-LRD 143

Query: 268 TQCGFKLFTRKSALQLFSSIHVQRWAFDVELLFIAEVLH--IPMAEVSV 314
              G++ FTR+S       + +    F++E     E +   + +  V +
Sbjct: 144 ILSGYRAFTRES----IRRMELTETGFEIETEIAVECVKKNLRVEVVPI 188


>gnl|CDD|133039 cd04196, GT_2_like_d, Subfamily of Glycosyltransferase Family GT2
           of unknown function.  GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 214

 Score = 39.2 bits (92), Expect = 0.003
 Identities = 18/65 (27%), Positives = 31/65 (47%), Gaps = 7/65 (10%)

Query: 129 EIIVVSDGSTDKTMQVVHQYTEKCGEDIVRGLKLLKNRGKGGAVT---LGTKCARGSIIL 185
           E+I+  DGSTD T++++ +Y +K        + L++N    G         + A G  + 
Sbjct: 29  ELIISDDGSTDGTVEIIKEYIDK----DPFIIILIRNGKNLGVARNFESLLQAADGDYVF 84

Query: 186 FADAD 190
           F D D
Sbjct: 85  FCDQD 89


>gnl|CDD|133056 cd06434, GT2_HAS, Hyaluronan synthases catalyze polymerization of
           hyaluronan.  Hyaluronan synthases (HASs) are
           bi-functional glycosyltransferases that catalyze
           polymerization of hyaluronan. HASs transfer both GlcUA
           and GlcNAc in beta-(1,3) and beta-(1,4) linkages,
           respectively to the hyaluronan chain using UDP-GlcNAc
           and UDP-GlcUA as substrates. HA is made as a free
           glycan, not attached to a protein or lipid. HASs do not
           need a primer for HA synthesis; they initiate HA
           biosynthesis de novo with only UDP-GlcNAc, UDP-GlcUA,
           and Mg2+. Hyaluronan (HA) is a linear
           heteropolysaccharide composed of (1-3)-linked
           beta-D-GlcUA-beta-D-GlcNAc disaccharide repeats. It can
           be found in vertebrates and a few microbes and is
           typically on the cell surface or in the extracellular
           space, but is also found inside mammalian cells.
           Hyaluronan has several physiochemical and biological
           functions such as space filling, lubrication, and
           providing a hydrated matrix through which cells can
           migrate.
          Length = 235

 Score = 38.4 bits (90), Expect = 0.006
 Identities = 23/99 (23%), Positives = 43/99 (43%), Gaps = 13/99 (13%)

Query: 92  LSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEK 151
           ++VI+P Y+E        D   E L    ++ P    EIIVV+DG  +     +   ++ 
Sbjct: 2   VTVIIPVYDED------PDVFRECLRSILRQKP---LEIIVVTDGDDEP---YLSILSQT 49

Query: 152 CGEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADAD 190
                +  +  + + GK  A+  G +     I++  D+D
Sbjct: 50  VKYGGIF-VITVPHPGKRRALAEGIRHVTTDIVVLLDSD 87


>gnl|CDD|133004 cd02510, pp-GalNAc-T, pp-GalNAc-T initiates the formation of
           mucin-type O-linked glycans.  UDP-GalNAc: polypeptide
           alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T)
           initiate the formation of mucin-type, O-linked glycans
           by catalyzing the transfer of
           alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to
           hydroxyl groups of Ser or Thr residues of core proteins
           to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These
           enzymes are type II membrane proteins with a GT-A type
           catalytic domain and a lectin domain located on the
           lumen side of the Golgi apparatus. In human, there are
           15 isozymes of pp-GalNAc-Ts, representing the largest of
           all glycosyltransferase families. Each isozyme has
           unique but partially redundant substrate specificity for
           glycosylation sites on acceptor proteins.
          Length = 299

 Score = 38.3 bits (90), Expect = 0.007
 Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 11/100 (11%)

Query: 93  SVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKY---EIIVVSDGSTDKTMQVVHQYT 149
           SVI+  +NE       L  T+  +  R     T      EII+V D S    ++++ +  
Sbjct: 1   SVIIIFHNEA---LSTLLRTVHSVINR-----TPPELLKEIILVDDFSDKPELKLLLEEY 52

Query: 150 EKCGEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADA 189
            K     V+ L+L K  G   A   G + A G +++F D+
Sbjct: 53  YKKYLPKVKVLRLKKREGLIRARIAGARAATGDVLVFLDS 92


>gnl|CDD|133029 cd04186, GT_2_like_c, Subfamily of Glycosyltransferase Family GT2
           of unknown function.  GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 166

 Score = 37.2 bits (87), Expect = 0.009
 Identities = 33/139 (23%), Positives = 55/139 (39%), Gaps = 27/139 (19%)

Query: 94  VIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEKCG 153
           +I+  YN  + LK  LD     L +         +E+IVV + STD +++++ +      
Sbjct: 1   IIIVNYNSLEYLKACLDS---LLAQTYPD-----FEVIVVDNASTDGSVELLREL----- 47

Query: 154 EDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADADGATKFADLEKLEDKLKELTDGDY 213
              VR ++  +N G G     G + A+G  +L  +        D       L EL   D 
Sbjct: 48  FPEVRLIRNGENLGFGAGNNQGIREAKGDYVLLLN-------PDTVVEPGALLELL--DA 98

Query: 214 IEDKNNAAGCNGVIVGSRA 232
            E   +       IVG + 
Sbjct: 99  AEQDPDVG-----IVGPKV 112


>gnl|CDD|224137 COG1216, COG1216, Predicted glycosyltransferases [General function
           prediction only].
          Length = 305

 Score = 36.7 bits (85), Expect = 0.023
 Identities = 35/200 (17%), Positives = 64/200 (32%), Gaps = 31/200 (15%)

Query: 91  NLSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTE 150
            +S+I+  YN         ++ +E L     +       I+VV +GSTD +++ +     
Sbjct: 4   KISIIIVTYNR-------GEDLVECLASLAAQTYPDDV-IVVVDNGSTDGSLEALKAR-- 53

Query: 151 KCGEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADADGATKFA----DLEKLEDKLK 206
                 VR ++  +N G  G    G        I +A A G         D     D L+
Sbjct: 54  --FFPNVRLIENGENLGFAGGFNRG--------IKYALAKGDDYVLLLNPDTVVEPDLLE 103

Query: 207 ELTDGDYIEDKNNAAGCNGVIVGS--RAHLEALANVQRSFFRNILMKGFHFIVWFTGVRT 264
           EL      E+   A     +I       +++     +               +       
Sbjct: 104 EL--LKAAEEDPAAGVVGPLIRNYDESLYIDRRG-GESDGLTGGWRASPLLEIAPDLSSY 160

Query: 265 --IRDTQCGFKLFTRKSALQ 282
             +  +  G  L  R+ A +
Sbjct: 161 LEVVASLSGACLLIRREAFE 180


>gnl|CDD|133055 cd06433, GT_2_WfgS_like, WfgS and WfeV are involved in O-antigen
           biosynthesis.  Escherichia coli WfgS and Shigella
           dysenteriae WfeV are glycosyltransferase 2 family
           enzymes involved in O-antigen biosynthesis. GT-2 enzymes
           have GT-A type structural fold, which has two tightly
           associated beta/alpha/beta domains that tend to form a
           continuous central sheet of at least eight beta-strands.
           These are enzymes that catalyze the transfer of sugar
           moieties from activated donor molecules to specific
           acceptor molecules, forming glycosidic bonds.
           Glycosyltransferases have been classified into more than
           90 distinct sequence based families.
          Length = 202

 Score = 35.2 bits (82), Expect = 0.054
 Identities = 36/130 (27%), Positives = 55/130 (42%), Gaps = 29/130 (22%)

Query: 93  SVIVPAYNEQDRLKPMLDETIE-FLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEK 151
           S+I P YN+ + L+    ETI+  L++    I     E IV+  GSTD T+ ++ +Y +K
Sbjct: 1   SIITPTYNQAETLE----ETIDSVLSQTYPNI-----EYIVIDGGSTDGTVDIIKKYEDK 51

Query: 152 CGEDIVRGLKLLKNRGKG--GAVTLGTKCARGSIILF--AD---ADGATKFADLEKLEDK 204
                      +    KG   A+  G   A G II F  +D     GA     L  +   
Sbjct: 52  I-------TYWISEPDKGIYDAMNKGIALATGDIIGFLNSDDTLLPGA-----LLAVVAA 99

Query: 205 LKELTDGDYI 214
             E  + D +
Sbjct: 100 FAEHPEVDVV 109


>gnl|CDD|213815 TIGR03469, HpnB, hopene-associated glycosyltransferase HpnB.  This
           family of genes include a glycosyl transferase, group 2
           domain (pfam00535) which are responsible, generally for
           the transfer of nucleotide-diphosphate sugars to
           substrates such as polysaccharides and lipids. The genes
           of this family are often found in the same genetic locus
           with squalene-hopene cyclase genes, and are never
           associated with genes for the metabolism of phytoene.
           Indeed, the members of this family appear to never be
           found in a genome lacking squalene-hopene cyclase (SHC),
           although not all genomes encoding SHC have this glycosyl
           transferase. In the organism Zymomonas mobilis the
           linkage of this gene to hopanoid biosynthesis has been
           noted and the gene named HpnB. Hopanoids are known to
           feature polar glycosyl head groups in many organisms.
          Length = 384

 Score = 35.8 bits (83), Expect = 0.061
 Identities = 36/119 (30%), Positives = 48/119 (40%), Gaps = 19/119 (15%)

Query: 82  PSLDDEPSVNLSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKT 141
           PS +  P+V    +VPA NE D    ++ E +  L    +  P  K  +I+V D STD T
Sbjct: 34  PSPEAWPAV--VAVVPARNEAD----VIGECVTSL--LEQDYPG-KLHVILVDDHSTDGT 84

Query: 142 ----MQVVHQYTEKCGEDIVRGLKLLKN-RGKGGAVTLGTKCARGSI-----ILFADAD 190
                     Y       +V G  L     GK  AV+ G   AR        +L  DAD
Sbjct: 85  ADIARAAARAYGRGDRLTVVSGQPLPPGWSGKLWAVSQGIAAARTLAPPADYLLLTDAD 143


>gnl|CDD|132155 TIGR03111, glyc2_xrt_Gpos1, putative glycosyltransferase,
           exosortase G-associated.  Members of this protein family
           are probable glycosyltransferases of family 2, whose
           genes are near those for the exosortase homolog XrtG
           (TIGR03110), which is restricted to Gram-positive
           bacteria. Other genes in the conserved gene neighborhood
           include a 6-pyruvoyl tetrahydropterin synthase homolog
           (TIGR03112) and an uncharacterized intergral membrane
           protein (TIGR03766).
          Length = 439

 Score = 35.5 bits (82), Expect = 0.067
 Identities = 23/100 (23%), Positives = 46/100 (46%), Gaps = 8/100 (8%)

Query: 92  LSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEK 151
           +++I+P YN +D L   ++          +  P    +II+ ++ STD + QV  +   +
Sbjct: 51  ITIIIPVYNSEDTLFNCIESIYN------QTYPIELIDIILANNQSTDDSFQVFCRAQNE 104

Query: 152 CGEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADADG 191
                +R +    ++GK  A+      + G  I+  D+DG
Sbjct: 105 FPGLSLRYMN--SDQGKAKALNAAIYNSIGKYIIHIDSDG 142


>gnl|CDD|133042 cd06420, GT2_Chondriotin_Pol_N, N-terminal domain of Chondroitin
           polymerase functions as a GalNAc transferase.
           Chondroitin polymerase is a two domain, bi-functional
           protein. The N-terminal domain functions as a GalNAc
           transferase. The bacterial chondroitin polymerase
           catalyzes elongation of the chondroitin chain by
           alternatively transferring the GlcUA and GalNAc moiety
           from UDP-GlcUA and UDP-GalNAc to the non-reducing ends
           of the chondroitin chain. The enzyme consists of
           N-terminal and C-terminal domains in which the two
           active sites catalyze the addition of GalNAc and GlcUA,
           respectively. Chondroitin chains range from 40 to over
           100 repeating units of the disaccharide. Sulfated
           chondroitins are involved in the regulation of various
           biological functions such as central nervous system
           development, wound repair, infection, growth factor
           signaling, and morphogenesis, in addition to its
           conventional structural roles. In Caenorhabditis
           elegans, chondroitin is an essential factor for the worm
           to undergo cytokinesis and cell division. Chondroitin is
           synthesized as proteoglycans, sulfated and secreted to
           the cell surface or extracellular matrix.
          Length = 182

 Score = 33.7 bits (78), Expect = 0.13
 Identities = 14/59 (23%), Positives = 31/59 (52%), Gaps = 8/59 (13%)

Query: 94  VIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEKC 152
           +I+  YN  + L+ +L      LN+         +E+I+  DGST++T +++ ++  + 
Sbjct: 1   LIITTYNRPEALELVLKS---VLNQSILP-----FEVIIADDGSTEETKELIEEFKSQF 51


>gnl|CDD|219174 pfam06780, Erp_C, Erp protein C-terminus.  This family represents
           the C-terminus of bacterial Erp proteins that seem to be
           specific to Borrelia burgdorferi (a causative agent of
           Lyme disease). Borrelia Erp proteins are particularly
           heterogeneous, which might enable them to interact with
           a wide variety of host components.
          Length = 141

 Score = 32.8 bits (75), Expect = 0.20
 Identities = 15/34 (44%), Positives = 22/34 (64%), Gaps = 2/34 (5%)

Query: 197 DLEKLEDKLKELTDGDYIEDKNNAAGCNGVIVGS 230
           DLEKL+ KL+E+   +Y+EDK+N     G +  S
Sbjct: 110 DLEKLKSKLEEVK--EYLEDKDNFEEIKGYVKDS 141


>gnl|CDD|211965 TIGR04242, nodulat_NodC, chitooligosaccharide synthase NodC.
           Members of this family are NodC, an
           N-acetylglucosaminyltransferase involved in the
           production of nodulation factors through which rhizobia
           establish symbioses with leguminous plants.
          Length = 395

 Score = 33.9 bits (78), Expect = 0.23
 Identities = 33/117 (28%), Positives = 48/117 (41%), Gaps = 21/117 (17%)

Query: 80  EFPSLDDEPSVNLSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKY----EIIVVSD 135
           +    D  PSV+  VIVP +NE  R           L+E    I    Y     + VV D
Sbjct: 40  DALPSDPLPSVD--VIVPCFNEDPRT----------LSECLASIAAQDYAGKLRVYVVDD 87

Query: 136 GSTDKT--MQVVHQYTEKCGEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADAD 190
           GST++   + V   Y     +     + L KN GK  A     + + G ++L  D+D
Sbjct: 88  GSTNRDALVPVHDAYA---SDPRFNFILLPKNVGKRKAQIAAIRRSSGDLVLNVDSD 141


>gnl|CDD|215127 PLN02210, PLN02210, UDP-glucosyl transferase.
          Length = 456

 Score = 33.5 bits (76), Expect = 0.29
 Identities = 21/67 (31%), Positives = 27/67 (40%), Gaps = 17/67 (25%)

Query: 31  IHCYIALFLVCGILSYAYRSFKTKNPFPDIDFLSKGERVFLDVKTGQELEFPSLD----- 85
           I C I     CG  S  YR +   N FPD++ L+            Q +E P+L      
Sbjct: 125 IPCAILWIQACGAYSVYYRYYMKTNSFPDLEDLN------------QTVELPALPLLEVR 172

Query: 86  DEPSVNL 92
           D PS  L
Sbjct: 173 DLPSFML 179


>gnl|CDD|133059 cd06437, CESA_CaSu_A2, Cellulose synthase catalytic subunit A2
           (CESA2) is a catalytic subunit or a catalytic subunit
           substitute of the cellulose synthase complex.  Cellulose
           synthase (CESA) catalyzes the polymerization reaction of
           cellulose using UDP-glucose as the substrate. Cellulose
           is an aggregate of unbranched polymers of
           beta-1,4-linked glucose residues, which is an abundant
           polysaccharide produced by plants and in varying degrees
           by several other organisms including algae, bacteria,
           fungi, and even some animals. Genomes from higher plants
           harbor multiple CESA genes. There are ten in
           Arabidopsis. At least three different CESA proteins are
           required to form a functional complex. In Arabidopsis,
           CESA1, 3 and 6 and CESA4, 7 and 8, are required for
           cellulose biosynthesis during primary and secondary cell
           wall formation. CESA2 is very closely related to CESA6
           and is viewed as a prime substitute for CESA6. They
           functionally compensate each other. The cesa2 and cesa6
           double mutant plants were significantly smaller, while
           the single mutant plants were almost normal.
          Length = 232

 Score = 31.1 bits (71), Expect = 1.1
 Identities = 30/109 (27%), Positives = 52/109 (47%), Gaps = 18/109 (16%)

Query: 88  PSVNLSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQ 147
           P V  +V +P +NE+     +++  IE         P  + EI V+ D STD+T+++  +
Sbjct: 1   PMV--TVQLPVFNEKY----VVERLIE--AACALDYPKDRLEIQVLDD-STDETVRLARE 51

Query: 148 YTEKCGEDIVRGLKLL----KNRG--KGGAVTLGTKCARGSIILFADAD 190
             E+      +G+ +      +R   K GA+  G K A+G  +   DAD
Sbjct: 52  IVEEYAA---QGVNIKHVRRADRTGYKAGALAEGMKVAKGEYVAIFDAD 97


>gnl|CDD|190261 pfam02257, RFX_DNA_binding, RFX DNA-binding domain.  RFX is a
           regulatory factor which binds to the X box of MHC class
           II genes and is essential for their expression. The
           DNA-binding domain of RFX is the central domain of the
           protein and binds ssDNA as either a monomer or
           homodimer.
          Length = 73

 Score = 28.3 bits (64), Expect = 1.8
 Identities = 11/38 (28%), Positives = 22/38 (57%), Gaps = 1/38 (2%)

Query: 427 HCSSVVCRPMNAVTFGRTVSESSPGIGTIHKFGVSGKS 464
           HC+ +  +P++A +FG+ +    PG+    + G  G+S
Sbjct: 36  HCAELGLKPLSAASFGKLIRSVFPGL-KTRRLGTRGQS 72


>gnl|CDD|133027 cd04184, GT2_RfbC_Mx_like, Myxococcus xanthus RfbC like proteins
           are required for O-antigen biosynthesis.  The rfbC gene
           encodes a predicted protein of 1,276 amino acids, which
           is required for O-antigen biosynthesis in Myxococcus
           xanthus. It is a subfamily of Glycosyltransferase Family
           GT2, which includes diverse families of glycosyl
           transferases with a common GT-A type structural fold,
           which has two tightly associated beta/alpha/beta domains
           that tend to form a continuous central sheet of at least
           eight beta-strands. These are enzymes that catalyze the
           transfer of sugar moieties from activated donor
           molecules to specific acceptor molecules, forming
           glycosidic bonds.
          Length = 202

 Score = 30.2 bits (69), Expect = 1.9
 Identities = 30/137 (21%), Positives = 55/137 (40%), Gaps = 17/137 (12%)

Query: 92  LSVIVPAYN-EQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTD-KTMQVVHQYT 149
           +S+++P YN  +  L+  ++                 +E+ +  D STD +  +V+ +Y 
Sbjct: 3   ISIVMPVYNTPEKYLREAIESVRAQTYPN--------WELCIADDASTDPEVKRVLKKYA 54

Query: 150 EKCGEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADADGA-TKFADLEKLEDKLKEL 208
            +  +  ++ +   +N G   A     + A G  +   D D      A  E +   L E 
Sbjct: 55  AQ--DPRIKVVFREENGGISAATNSALELATGEFVALLDHDDELAPHALYEVV-KALNEH 111

Query: 209 TDGDYI---EDKNNAAG 222
            D D I   EDK +  G
Sbjct: 112 PDADLIYSDEDKIDEGG 128


>gnl|CDD|133063 cd06913, beta3GnTL1_like, Beta 1, 3-N-acetylglucosaminyltransferase
           is essential for the formation of
           poly-N-acetyllactosamine .  This family includes human
           Beta3GnTL1 and related eukaryotic proteins. Human
           Beta3GnTL1 is a putative
           beta-1,3-N-acetylglucosaminyltransferase. Beta3GnTL1 is
           expressed at various levels in most of tissues examined.
           Beta 1, 3-N-acetylglucosaminyltransferase has been found
           to be essential for the formation of
           poly-N-acetyllactosamine. Poly-N-acetyllactosamine is a
           unique carbohydrate composed of N-acetyllactosamine
           repeats. It is often an important part of
           cell-type-specific oligosaccharide structures and some
           functional oligosaccharides. It has been shown that the
           structure and biosynthesis of poly-N-acetyllactosamine
           display a dramatic change during development and
           oncogenesis. Several members of beta-1,
           3-N-acetylglucosaminyltransferase have been identified.
          Length = 219

 Score = 30.1 bits (68), Expect = 2.1
 Identities = 23/101 (22%), Positives = 46/101 (45%), Gaps = 11/101 (10%)

Query: 94  VIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVVHQYTEKC- 152
           +I+P +N +      LDE +E + ++  +      E+ V +D STDK+ +++ ++ +K  
Sbjct: 1   IILPVHNGE----QWLDECLESVLQQDFEGTL---ELSVFNDASTDKSAEIIEKWRKKLE 53

Query: 153 ---GEDIVRGLKLLKNRGKGGAVTLGTKCARGSIILFADAD 190
                 +V        +G G A       + G  + F D+D
Sbjct: 54  DSGVIVLVGSHNSPSPKGVGYAKNQAIAQSSGRYLCFLDSD 94


>gnl|CDD|216924 pfam02191, OLF, Olfactomedin-like domain. 
          Length = 250

 Score = 29.9 bits (68), Expect = 3.1
 Identities = 14/44 (31%), Positives = 20/44 (45%), Gaps = 14/44 (31%)

Query: 38  FLVCGILSYAYRSFKTKNPFPDIDFLSKGERVFL--DVKTGQEL 79
           F+VCG+L Y  RS+  ++            R+F   D  TG E 
Sbjct: 177 FMVCGVL-YVTRSYSLRD-----------TRIFYAYDTSTGTEE 208


>gnl|CDD|133060 cd06438, EpsO_like, EpsO protein participates in the methanolan
           synthesis.  The Methylobacillus sp EpsO protein is
           predicted to participate in the methanolan synthesis.
           Methanolan is an exopolysaccharide (EPS), composed of
           glucose, mannose and galactose.  A 21 genes cluster was
           predicted to participate in the methanolan synthesis.
           Gene disruption analysis revealed that EpsO is one of
           the glycosyltransferase enzymes involved in the
           synthesis of repeating sugar units onto the lipid
           carrier.
          Length = 183

 Score = 29.5 bits (67), Expect = 3.2
 Identities = 15/52 (28%), Positives = 26/52 (50%), Gaps = 6/52 (11%)

Query: 94  VIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDGSTDKTMQVV 145
           +++PA+NE+  +   +       + + +  P   Y I VV+D  TD T QV 
Sbjct: 1   ILIPAHNEEAVIGNTVR------SLKAQDYPRELYRIFVVADNCTDDTAQVA 46


>gnl|CDD|177282 PHA00447, PHA00447, lysozyme.
          Length = 142

 Score = 29.0 bits (65), Expect = 4.0
 Identities = 10/17 (58%), Positives = 13/17 (76%)

Query: 544 NHHSVPVCVSGGIDAEG 560
           N +SV VC+ GGID +G
Sbjct: 72  NSNSVGVCLVGGIDDKG 88


>gnl|CDD|215756 pfam00155, Aminotran_1_2, Aminotransferase class I and II. 
          Length = 357

 Score = 29.6 bits (67), Expect = 4.8
 Identities = 19/74 (25%), Positives = 30/74 (40%), Gaps = 9/74 (12%)

Query: 179 ARGSIILFADADGATKFADLEKLEDKLKELTDGD---YIEDKNNAAGCNGVIVGSRAHLE 235
           A G ++ +          D + LE  LKE T+G+        +N  G     V +   LE
Sbjct: 110 AGGEVVRYPLYSSNDFHLDFDALEAALKEATEGNKVVLHTSPHNPTG----TVATLEELE 165

Query: 236 ALANVQRSFFRNIL 249
            L ++ +    NIL
Sbjct: 166 KLLDLAKE--HNIL 177


>gnl|CDD|211918 TIGR04195, S_glycosyl_SunS, peptide S-glycosyltransferase, SunS
           family.  Members of this family include SunS, the
           S-glycosyltransferase that transfers a sugar (substrate
           is variable in reconstitution assays) onto the precursor
           of the glycopeptide sublancin, which once was thought to
           be a lantibiotic.
          Length = 422

 Score = 29.6 bits (66), Expect = 4.9
 Identities = 21/75 (28%), Positives = 30/75 (40%), Gaps = 14/75 (18%)

Query: 72  DVKTGQELEFP-SLDDEPSVNLSVIVPAYNEQDRLKPMLDETIEFLNERRKKIPTFKYEI 130
           D     EL       D PS+   +I   YNE+  +K  LD   +  +E           I
Sbjct: 40  DSMNINELVAGYEKADTPSITCGIIT--YNEEKCIKKCLDSLGDDFDE-----------I 86

Query: 131 IVVSDGSTDKTMQVV 145
           IV+   STD T  ++
Sbjct: 87  IVLDSHSTDDTTDII 101


>gnl|CDD|182218 PRK10063, PRK10063, putative glycosyl transferase; Provisional.
          Length = 248

 Score = 29.4 bits (66), Expect = 5.1
 Identities = 24/98 (24%), Positives = 36/98 (36%), Gaps = 10/98 (10%)

Query: 124 PTFKYEIIVVSDGSTDKTMQVVHQYTEKCGEDIVRGLKLLKNRGKGGAVTLGTKCARGSI 183
           P   +E IVV  GS D T + +           V       + G   A+  G   A+G  
Sbjct: 30  PGISFEWIVVDGGSNDGTREFLENLNGIFNLRFVSE----PDNGIYDAMNKGIAMAQGRF 85

Query: 184 ILFADA-----DGATKFADLEKLEDKLKELTDGDYIED 216
            LF ++       A  F    K++ K   +  GD + D
Sbjct: 86  ALFLNSGDIFHQDAANFVRQLKMQ-KDNAMIIGDALLD 122


>gnl|CDD|233208 TIGR00956, 3a01205, Pleiotropic Drug Resistance (PDR) Family protein.
             [Transport and binding proteins, Other].
          Length = 1394

 Score = 29.7 bits (67), Expect = 6.1
 Identities = 14/48 (29%), Positives = 26/48 (54%)

Query: 479  YYYYFWYPQIIICPEMSKTGLLNKNLIIFWNLKCVFFCGYKLLSQSII 526
             +++ WY  +      SKTG +++  ++FW L  +FF  +  L Q +I
Sbjct: 1191 IFFFIWYYPVGFYWNASKTGQVHERGVLFWLLSTMFFLYFSTLGQMVI 1238


>gnl|CDD|235392 PRK05291, trmE, tRNA modification GTPase TrmE; Reviewed.
          Length = 449

 Score = 29.3 bits (67), Expect = 6.8
 Identities = 12/48 (25%), Positives = 22/48 (45%), Gaps = 6/48 (12%)

Query: 196 ADLEKLEDKLKELTDGDYIEDKNNAAGCNGVIVGSRAHLEALANVQRS 243
             +++L + +KEL  G +  ++       GV + +  HLEAL      
Sbjct: 356 EGIDELREAIKELAFGGFGGNQE------GVFLTNARHLEALERALEH 397


>gnl|CDD|221463 pfam12211, LMWSLP_N, Low molecular weight S layer protein N
           terminal.  This family of proteins is found in bacteria.
           Proteins in this family are typically between 328 and
           381 amino acids in length. There is a conserved LGDG
           sequence motif. Clostridial species have a layer of
           surface proteins surrounding their membrane. This layer
           is comprised of a high molecular weight protein and a
           low molecular weight protein. This domain is the N
           terminal domain of the low molecular weight protein. It
           is a structural domain.
          Length = 253

 Score = 28.5 bits (64), Expect = 7.6
 Identities = 13/30 (43%), Positives = 18/30 (60%), Gaps = 4/30 (13%)

Query: 190 DGATKFADLEKL----EDKLKELTDGDYIE 215
           D A K A  +KL    ++KL  L DGDY++
Sbjct: 49  DAADKSAAADKLYNLVDNKLDNLGDGDYVD 78


>gnl|CDD|206175 pfam14005, YpjP, YpjP-like protein.  The YpjP-like protein family
           includes the B. subtilis YpjP protein, which is
           functionally uncharacterized. This family of proteins is
           found in bacteria. Proteins in this family are
           approximately 200 amino acids in length.
          Length = 137

 Score = 27.6 bits (62), Expect = 9.5
 Identities = 13/83 (15%), Positives = 34/83 (40%), Gaps = 10/83 (12%)

Query: 94  VIVPAYNE--QDRLKPMLDETIEFLNERRKKIPTFKYEIIVVSDG-STDKTMQVVHQYTE 150
           VI   + E    +++  + + +        ++P    + + +S+  +  +  ++ H Y  
Sbjct: 32  VIEDEFREEILPKIEEAIADLLA-------ELPEDDLQDLEISEAPAGGQGEKIFHIYNR 84

Query: 151 KCGEDIVRGLKLLKNRGKGGAVT 173
           K GED++R      +  + G   
Sbjct: 85  KTGEDLLRFHVRRDHPPQDGYWF 107


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.139    0.433 

Gapped
Lambda     K      H
   0.267   0.0739    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 28,936,067
Number of extensions: 2837928
Number of successful extensions: 2541
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2482
Number of HSP's successfully gapped: 55
Length of query: 569
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 467
Effective length of database: 6,413,494
Effective search space: 2995101698
Effective search space used: 2995101698
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 62 (27.6 bits)