RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8674
         (439 letters)



>gnl|CDD|133004 cd02510, pp-GalNAc-T, pp-GalNAc-T initiates the formation of
           mucin-type O-linked glycans.  UDP-GalNAc: polypeptide
           alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T)
           initiate the formation of mucin-type, O-linked glycans
           by catalyzing the transfer of
           alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to
           hydroxyl groups of Ser or Thr residues of core proteins
           to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These
           enzymes are type II membrane proteins with a GT-A type
           catalytic domain and a lectin domain located on the
           lumen side of the Golgi apparatus. In human, there are
           15 isozymes of pp-GalNAc-Ts, representing the largest of
           all glycosyltransferase families. Each isozyme has
           unique but partially redundant substrate specificity for
           glycosylation sites on acceptor proteins.
          Length = 299

 Score =  290 bits (745), Expect = 3e-96
 Identities = 96/183 (52%), Positives = 130/183 (71%), Gaps = 4/183 (2%)

Query: 1   DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
           +L   LE+Y +++  KV+++R  +REGLIR R  GA+ + G+V+VFLD+HCEV + WL P
Sbjct: 44  ELKLLLEEYYKKYLPKVKVLRLKKREGLIRARIAGARAATGDVLVFLDSHCEVNVGWLEP 103

Query: 61  LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
           LLA I  +RK +  P+ID ID  T+E+R         RG F+W + +K   LPE E ++R
Sbjct: 104 LLARIAENRKTVVCPIIDVIDADTFEYRGSSGDA---RGGFDWSLHFKWLPLPEEE-RRR 159

Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
           +  + P +SPT AGGLFA+DR +FLELGGYD G+ +WGGEN ELSFK+W CGGSIE VPC
Sbjct: 160 ESPTAPIRSPTMAGGLFAIDREWFLELGGYDEGMDIWGGENLELSFKVWQCGGSIEIVPC 219

Query: 181 SRI 183
           SR+
Sbjct: 220 SRV 222



 Score =  240 bits (615), Expect = 1e-76
 Identities = 81/159 (50%), Positives = 111/159 (69%), Gaps = 3/159 (1%)

Query: 272 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 331
           SVI++FHNE  S+L+RTVHS+I RTP + L+EIILVDDFS K +L   LE+Y +++  KV
Sbjct: 1   SVIIIFHNEALSTLLRTVHSVINRTPPELLKEIILVDDFSDKPELKLLLEEYYKKYLPKV 60

Query: 332 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 391
           +++R  +REGLIR R  GA+ + G+V+VFLD+HCEV + WL PLLA I  +RK +  P+I
Sbjct: 61  KVLRLKKREGLIRARIAGARAATGDVLVFLDSHCEVNVGWLEPLLARIAENRKTVVCPII 120

Query: 392 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 430
           D ID  T+E+R         RG F+W + +K   LPE E
Sbjct: 121 DVIDADTFEYRGSSGDA---RGGFDWSLHFKWLPLPEEE 156


>gnl|CDD|215980 pfam00535, Glycos_transf_2, Glycosyl transferase family 2.  Diverse
           family, transferring sugar from UDP-glucose,
           UDP-N-acetyl- galactosamine, GDP-mannose or
           CDP-abequose, to a range of substrates including
           cellulose, dolichol phosphate and teichoic acids.
          Length = 168

 Score = 93.7 bits (233), Expect = 1e-22
 Identities = 41/155 (26%), Positives = 68/155 (43%), Gaps = 5/155 (3%)

Query: 272 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 331
           SVI+  +NE    L  T+ S++ +T   +  EII+VDD  S     +  E+Y +  + +V
Sbjct: 1   SVIIPTYNEE-KYLEETLESLLNQTYKNF--EIIVVDD-GSTDGTVEIAEEYAKN-DPRV 55

Query: 332 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 391
           R+IR  E  G    R+ G K + G+ I FLDA  EV  +WL  L+  +  +   + +   
Sbjct: 56  RVIRLEENLGKAAARNAGLKLATGDYIAFLDADDEVAPDWLEKLVELLEKNGADIVIGSR 115

Query: 392 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 426
             I+ +T  +      +                 L
Sbjct: 116 VVINGETRLYGRALRFELLLLLGKLGARSLGLKVL 150



 Score = 67.1 bits (164), Expect = 3e-13
 Identities = 26/139 (18%), Positives = 52/139 (37%), Gaps = 15/139 (10%)

Query: 6   LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
            E+Y +  + +VR+IR  E  G    R+ G K + G+ I FLDA  EV  +WL  L+  +
Sbjct: 45  AEEYAKN-DPRVRVIRLEENLGKAAARNAGLKLATGDYIAFLDADDEVAPDWLEKLVELL 103

Query: 66  YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 125
             +   + +     I+ +T  +                 + ++   L  +   +      
Sbjct: 104 EKNGADIVIGSRVVINGETRLYG--------------RALRFELLLLLGKLGARSLGLKV 149

Query: 126 PYKSPTHAGGLFAMDRAFF 144
            +   ++A     +     
Sbjct: 150 LFLIGSNALYRREVLEELL 168


>gnl|CDD|132997 cd00761, Glyco_tranf_GTA_type, Glycosyltransferase family A (GT-A)
           includes diverse families of glycosyl transferases with
           a common GT-A type structural fold.
           Glycosyltransferases (GTs) are enzymes that synthesize
           oligosaccharides, polysaccharides, and glycoconjugates
           by transferring the sugar moiety from an activated
           nucleotide-sugar donor to an acceptor molecule, which
           may be a growing oligosaccharide, a lipid, or a protein.
            Based on the stereochemistry of the donor and acceptor
           molecules, GTs are classified as either retaining or
           inverting enzymes. To date, all GT structures adopt one
           of two possible folds, termed GT-A fold and GT-B fold.
           This hierarchy includes diverse families of glycosyl
           transferases with a common GT-A type structural fold,
           which has two tightly associated beta/alpha/beta domains
           that tend to form a continuous central sheet of at least
           eight beta-strands. The majority of the proteins in this
           superfamily are Glycosyltransferase family 2 (GT-2)
           proteins. But it also includes families GT-43, GT-6,
           GT-8, GT13 and GT-7; which are evolutionarily related to
           GT-2 and share structure similarities.
          Length = 156

 Score = 64.4 bits (157), Expect = 2e-12
 Identities = 39/113 (34%), Positives = 65/113 (57%), Gaps = 5/113 (4%)

Query: 273 VILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVR 332
           VI+  +NE    L R + S++ +T   +  E+I+VDD S+   L + LE+Y ++   +V 
Sbjct: 1   VIIPAYNEE-PYLERCLESLLAQTYPNF--EVIVVDDGSTDGTL-EILEEYAKKDP-RVI 55

Query: 333 LIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 385
            + N E +GL   R+ G K +RGE I+FLDA   +  +WL  L+A + +D + 
Sbjct: 56  RVINEENQGLAAARNAGLKAARGEYILFLDADDLLLPDWLERLVAELLADPEA 108



 Score = 44.8 bits (106), Expect = 1e-05
 Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 1/68 (1%)

Query: 4   KKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 63
           + LE+Y ++   +V  + N E +GL   R+ G K +RGE I+FLDA   +  +WL  L+A
Sbjct: 42  EILEEYAKKDP-RVIRVINEENQGLAAARNAGLKAARGEYILFLDADDLLLPDWLERLVA 100

Query: 64  PIYSDRKI 71
            + +D + 
Sbjct: 101 ELLADPEA 108


>gnl|CDD|133016 cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the
           biosynthesis of succinoglycan.  Succinoglycan
           Biosynthesis Protein ExoA catalyzes the formation of a
           beta-1,3 linkage of the second sugar (glucose) of the
           succinoglycan with the galactose on the lipid carrie.
           Succinoglycan is an acidic exopolysaccharide that is
           important for invasion of the nodules. Succinoglycan is
           a high-molecular-weight polymer composed of repeating
           octasaccharide units. These units are synthesized on
           membrane-bound isoprenoid lipid carriers, beginning with
           galactose followed by seven glucose molecules, and
           modified by the addition of acetate, succinate, and
           pyruvate. ExoA is a membrane protein with a
           transmembrance domain at c-terminus.
          Length = 249

 Score = 58.0 bits (141), Expect = 1e-09
 Identities = 26/105 (24%), Positives = 55/105 (52%), Gaps = 8/105 (7%)

Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
             S+I+   NE    +   + S++ ++  + L EII+VD  S+    +  +++Y  +   
Sbjct: 1   FVSIIIPVRNEE-KYIEELLESLLNQSYPKDLIEIIVVDGGSTDGTRE-IVQEYAAKDP- 57

Query: 330 KVRLIRNTEREGLIRT--RSRGAKESRGEVIVFLDAHCEVGLNWL 372
           ++RLI N      I++   + G + SRG++I+ +DAH     +++
Sbjct: 58  RIRLIDN---PKRIQSAGLNIGIRNSRGDIIIRVDAHAVYPKDYI 99



 Score = 44.9 bits (107), Expect = 3e-05
 Identities = 38/195 (19%), Positives = 74/195 (37%), Gaps = 35/195 (17%)

Query: 6   LEDYIERFNGKVRLIRNTEREGLIRT--RSRGAKESRGEVIVFLDAHCEVGLNWLPPLL- 62
           +++Y  +   ++RLI N      I++   + G + SRG++I+ +DAH     +++  L+ 
Sbjct: 49  VQEYAAKDP-RIRLIDN---PKRIQSAGLNIGIRNSRGDIIIRVDAHAVYPKDYILELVE 104

Query: 63  APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 122
           A   +    +  P+      ++   +++        G                      Y
Sbjct: 105 ALKRTGADNVGGPME--TIGESKFQKAIAVAQSSPLGS-----------------GGSAY 145

Query: 123 NSEPYK----SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
                K       H G   A  R  F ++GG+D  L+    E+ EL++++   G  I   
Sbjct: 146 RGGAVKIGYVDTVHHG---AYRREVFEKVGGFDESLVR--NEDAELNYRLRKAGYKIWLS 200

Query: 179 PCSRIVSLIRPVFKA 193
           P  R+    R   K 
Sbjct: 201 PDIRVYYYPRSTLKK 215


>gnl|CDD|223539 COG0463, WcaA, Glycosyltransferases involved in cell wall
           biogenesis [Cell envelope biogenesis, outer membrane].
          Length = 291

 Score = 55.1 bits (131), Expect = 2e-08
 Identities = 41/165 (24%), Positives = 71/165 (43%), Gaps = 6/165 (3%)

Query: 267 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 326
            +PK SV++  +NE    L   + S++ +T   +  EII+VDD S+     +   +Y  +
Sbjct: 1   MMPKVSVVIPTYNEE-EYLPEALESLLNQTYKDF--EIIVVDDGSTD-GTTEIAIEYGAK 56

Query: 327 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 386
            + +V  + N    GL   R+ G + +RG+ IVFLDA  +     L PL+A       I 
Sbjct: 57  -DVRVIRLINERNGGLGAARNAGLEYARGDYIVFLDAD-DQHPPELIPLVAAGGDGDYIA 114

Query: 387 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREA 431
            +   D I         + +   +          + +  L  R+A
Sbjct: 115 RLDDRDDIWLPRKLLSKLVKLLGNRLLGVLIPDGFGDLRLLVRDA 159



 Score = 33.1 bits (74), Expect = 0.24
 Identities = 24/102 (23%), Positives = 39/102 (38%), Gaps = 1/102 (0%)

Query: 16  KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
           +V  + N    GL   R+ G + +RG+ IVFLDA  +     L PL+A       I  + 
Sbjct: 59  RVIRLINERNGGLGAARNAGLEYARGDYIVFLDAD-DQHPPELIPLVAAGGDGDYIARLD 117

Query: 76  VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREA 117
             D I         + +   +          + +  L  R+A
Sbjct: 118 DRDDIWLPRKLLSKLVKLLGNRLLGVLIPDGFGDLRLLVRDA 159


>gnl|CDD|224136 COG1215, COG1215, Glycosyltransferases, probably involved in cell
           wall biogenesis [Cell envelope biogenesis, outer
           membrane].
          Length = 439

 Score = 55.7 bits (134), Expect = 2e-08
 Identities = 36/127 (28%), Positives = 59/127 (46%), Gaps = 4/127 (3%)

Query: 263 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 322
           D    LPK SVI+  +NE    L  T+ S++ +   +Y  E+I+VDD  S  +  + LE+
Sbjct: 48  DADKLLPKVSVIIPAYNEEPEVLEETLESLLSQDYPRY--EVIVVDD-GSTDETYEILEE 104

Query: 323 YIQRFNGKVRLIR-NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
               +    R+I    +  G     + G K ++G+V+V LDA      + L  L++P   
Sbjct: 105 LGAEYGPNFRVIYPEKKNGGKAGALNNGLKRAKGDVVVILDADTVPEPDALRELVSPFED 164

Query: 382 DRKIMTV 388
                 V
Sbjct: 165 PPVGAVV 171



 Score = 42.6 bits (100), Expect = 3e-04
 Identities = 33/188 (17%), Positives = 63/188 (33%), Gaps = 27/188 (14%)

Query: 4   KKLEDYIERFNGKVRLIR-NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 62
           + LE+    +    R+I    +  G     + G K ++G+V+V LDA      + L  L+
Sbjct: 100 EILEELGAEYGPNFRVIYPEKKNGGKAGALNNGLKRAKGDVVVILDADTVPEPDALRELV 159

Query: 63  APIYSDRKIMTV--PVID-GIDYQTWEFRS-VYEPDHHYRGIFEWGMLYKENELPEREAK 118
           +P         V  P I    D      R    E    +                     
Sbjct: 160 SPFEDPPVGAVVGTPRIRNRPDPSNLLGRIQAIEYLSAFYFRLRAASKGG---------- 209

Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
                         +G   A  R+   E+GG+    +    E+ +L+ ++ + G  + +V
Sbjct: 210 ---------LISFLSGSSSAFRRSALEEVGGWLEDTI---TEDADLTLRLHLRGYRVVYV 257

Query: 179 PCSRIVSL 186
           P + + + 
Sbjct: 258 PEAIVWTE 265


>gnl|CDD|133045 cd06423, CESA_like, CESA_like is  the cellulose synthase
           superfamily.  The cellulose synthase (CESA) superfamily
           includes a wide variety of glycosyltransferase family 2
           enzymes that share the common characteristic of
           catalyzing the elongation of polysaccharide chains. The
           members include cellulose synthase catalytic subunit,
           chitin synthase, glucan biosynthesis protein and other
           families of CESA-like proteins. Cellulose synthase
           catalyzes the polymerization reaction of cellulose, an
           aggregate of unbranched polymers of beta-1,4-linked
           glucose residues in  plants, most algae, some bacteria
           and fungi, and even some animals. In bacteria, algae and
           lower eukaryotes, there is a second unrelated type of
           cellulose synthase (Type II), which produces acylated
           cellulose, a derivative of cellulose. Chitin synthase
           catalyzes the incorporation of GlcNAc from substrate
           UDP-GlcNAc into chitin, which is a linear homopolymer of
           beta-(1,4)-linked GlcNAc residues and Glucan
           Biosynthesis protein catalyzes the elongation of
           beta-1,2 polyglucose chains of Glucan.
          Length = 180

 Score = 51.5 bits (124), Expect = 9e-08
 Identities = 30/113 (26%), Positives = 61/113 (53%), Gaps = 4/113 (3%)

Query: 273 VILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVR 332
           +I+  +NE  + + RT+ S++     +   E+I+VDD  S  D  + LE+    +  +V 
Sbjct: 1   IIVPAYNEE-AVIERTIESLLALDYPKL--EVIVVDD-GSTDDTLEILEELAALYIRRVL 56

Query: 333 LIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 385
           ++R+ E  G     + G + ++G+++V LDA   +  + L  L+ P ++D K+
Sbjct: 57  VVRDKENGGKAGALNAGLRHAKGDIVVVLDADTILEPDALKRLVVPFFADPKV 109



 Score = 29.9 bits (68), Expect = 1.6
 Identities = 13/68 (19%), Positives = 30/68 (44%), Gaps = 4/68 (5%)

Query: 8   DYIERFNGKVRLIRNTEREGLIRTRSR----GAKESRGEVIVFLDAHCEVGLNWLPPLLA 63
           + +E             R+     ++     G + ++G+++V LDA   +  + L  L+ 
Sbjct: 42  EILEELAALYIRRVLVVRDKENGGKAGALNAGLRHAKGDIVVVLDADTILEPDALKRLVV 101

Query: 64  PIYSDRKI 71
           P ++D K+
Sbjct: 102 PFFADPKV 109


>gnl|CDD|234419 TIGR03965, mycofact_glyco, mycofactocin system glycosyltransferase.
            Members of this protein family are putative
           glycosyltransferases, members of pfam00535 (glycosyl
           transferase family 2). Members appear mostly in the
           Actinobacteria, where they appear to be part of a system
           for converting a precursor peptide (TIGR03969) into a
           novel redox carrier designated mycofactocin. A radical
           SAM enzyme, TIGR03962, is a proposed to be a key
           maturase for mycofactocin.
          Length = 467

 Score = 52.5 bits (126), Expect = 3e-07
 Identities = 45/169 (26%), Positives = 70/169 (41%), Gaps = 21/169 (12%)

Query: 17  VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 76
           VR+IR+  R+G    R+ GA+ +R E + F D+       WL  LLA        +  P 
Sbjct: 131 VRVIRHPRRQGPAAARNAGARAARTEFVAFTDSDVVPRPGWLRALLAHFDDPGVALVAPR 190

Query: 77  IDGID-YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
           +  +    T   R  YE     R   + G           EA  R      Y  P+ A  
Sbjct: 191 VVALPAEDTRLAR--YE---AVRSSLDLG---------PEEAVVRPRGPVSY-VPSAA-- 233

Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIV 184
              + R   LE+GG+D  L V  GE+ +L +++   GG + + P + + 
Sbjct: 234 -LLVRRRALLEVGGFDERLEV--GEDVDLCWRLCEAGGRVRYEPAAVVA 279



 Score = 49.8 bits (119), Expect = 2e-06
 Identities = 40/159 (25%), Positives = 65/159 (40%), Gaps = 10/159 (6%)

Query: 221 PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNE 280
           P A R  G    G   +  ETS   +  R + D  +   +        P  +V++   N 
Sbjct: 29  PAAARLLGGGRGGGLEVRDETS--AALLRRLLDSGVVHPR-PALLPSPPSVTVVVPVRNR 85

Query: 281 GFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG-KVRLIRNTER 339
             + L R + +++     +   E+I+VDD S        +     R     VR+IR+  R
Sbjct: 86  P-AGLARLLAALLALDYPRDRLEVIVVDDGSEDP-----VPTRAARGARLPVRVIRHPRR 139

Query: 340 EGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 378
           +G    R+ GA+ +R E + F D+       WL  LLA 
Sbjct: 140 QGPAAARNAGARAARTEFVAFTDSDVVPRPGWLRALLAH 178


>gnl|CDD|133029 cd04186, GT_2_like_c, Subfamily of Glycosyltransferase Family GT2
           of unknown function.  GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 166

 Score = 49.1 bits (118), Expect = 6e-07
 Identities = 28/113 (24%), Positives = 55/113 (48%), Gaps = 8/113 (7%)

Query: 273 VILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVR 332
           +I+V +N     L   + S++ +T   +  E+I+VD+ S+   ++      ++    +VR
Sbjct: 1   IIIVNYN-SLEYLKACLDSLLAQTYPDF--EVIVVDNASTDGSVE-----LLRELFPEVR 52

Query: 333 LIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 385
           LIRN E  G     ++G +E++G+ ++ L+    V    L  LL     D  +
Sbjct: 53  LIRNGENLGFGAGNNQGIREAKGDYVLLLNPDTVVEPGALLELLDAAEQDPDV 105



 Score = 41.0 bits (97), Expect = 3e-04
 Identities = 32/177 (18%), Positives = 60/177 (33%), Gaps = 55/177 (31%)

Query: 8   DYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 67
           + +     +VRLIRN E  G     ++G +E++G+ ++ L+    V    L  LL     
Sbjct: 42  ELLRELFPEVRLIRNGENLGFGAGNNQGIREAKGDYVLLLNPDTVVEPGALLELLDAAEQ 101

Query: 68  DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 127
           D                                 + G++                     
Sbjct: 102 DP--------------------------------DVGIV--------------------- 108

Query: 128 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIV 184
             P  +G    + R  F E+GG+D    ++  E+ +L  +  + G  + +VP + I 
Sbjct: 109 -GPKVSGAFLLVRREVFEEVGGFDEDFFLYY-EDVDLCLRARLAGYRVLYVPQAVIY 163


>gnl|CDD|133035 cd04192, GT_2_like_e, Subfamily of Glycosyltransferase Family GT2
           of unknown function.  GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 229

 Score = 49.2 bits (118), Expect = 9e-07
 Identities = 30/116 (25%), Positives = 50/116 (43%), Gaps = 18/116 (15%)

Query: 273 VILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK-- 330
           V++   NE   +L R + S+      +   E+ILVDD S+    D      ++    K  
Sbjct: 1   VVIAARNEA-ENLPRLLQSLSALDYPKEKFEVILVDDHST----DGT-VQILEFAAAKPN 54

Query: 331 --VRLIRNTEREG-----LIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
             ++++ N+          + T     K ++G+ IV  DA C V  NWL   +A I
Sbjct: 55  FQLKILNNSRVSISGKKNALTT---AIKAAKGDWIVTTDADCVVPSNWLLTFVAFI 107



 Score = 33.4 bits (77), Expect = 0.15
 Identities = 13/31 (41%), Positives = 17/31 (54%)

Query: 35  GAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
             K ++G+ IV  DA C V  NWL   +A I
Sbjct: 77  AIKAAKGDWIVTTDADCVVPSNWLLTFVAFI 107


>gnl|CDD|220577 pfam10111, Glyco_tranf_2_2, Glycosyltransferase like family 2.
           Members of this family of prokaryotic proteins include
           putative glucosyltransferase, which are involved in
           bacterial capsule biosynthesis.
          Length = 278

 Score = 48.9 bits (117), Expect = 2e-06
 Identities = 34/171 (19%), Positives = 64/171 (37%), Gaps = 25/171 (14%)

Query: 3   DKKLEDYIERFNGKVRLIRNTERE--GLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
            K+L+  I +      +   T ++     + R+RGA+ S  + I FLD  C +  + L  
Sbjct: 49  AKELKKIIAKNGAINYISHKTHKDLFSPAKARNRGAEYSSSDFIFFLDVDCLISPDTLEK 108

Query: 61  LLAPIYSDRKI----MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 116
           ++      +      + +P +      +  F S ++       I E              
Sbjct: 109 IIKHFQELQTNPNAFLALPCLYLSKEGSEIFLSDFKYLLREE-ILE-------------- 153

Query: 117 AKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 167
                  +        A     ++R FFL++GG+D      GGE+FEL ++
Sbjct: 154 ----DAITGKSTFFALASSCILINRDFFLKIGGFDENFRGHGGEDFELLYR 200



 Score = 41.6 bits (98), Expect = 5e-04
 Identities = 25/100 (25%), Positives = 47/100 (47%), Gaps = 7/100 (7%)

Query: 272 SVIL-VFHNEGFSSLMRTVHSIIKRT---PAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
           SVI+ V+ +     +++ + S + R     +    EII+VD  S  +   ++L+  I + 
Sbjct: 1   SVIIPVYLSLRTRDILKRLQSCLNRLSYFLSDPDVEIIVVDGDSPLSF-AKELKKIIAKN 59

Query: 328 NGKVRLIRNTERE--GLIRTRSRGAKESRGEVIVFLDAHC 365
                +   T ++     + R+RGA+ S  + I FLD  C
Sbjct: 60  GAINYISHKTHKDLFSPAKARNRGAEYSSSDFIFFLDVDC 99


>gnl|CDD|217196 pfam02709, Glyco_transf_7C, N-terminal domain of
           galactosyltransferase.  This is the N-terminal domain of
           a family of galactosyltransferases from a wide range of
           Metazoa with three related galactosyltransferases
           activities, all three of which are possessed by one
           sequence in some cases. EC:2.4.1.90, N-acetyllactosamine
           synthase; EC:2.4.1.38,
           Beta-N-acetylglucosaminyl-glycopeptide beta-1,4-
           galactosyltransferase; and EC:2.4.1.22 Lactose synthase.
           Note that N-acetyllactosamine synthase is a component of
           Lactose synthase along with alpha-lactalbumin, in the
           absence of alpha-lactalbumin EC:2.4.1.90 is the
           catalyzed reaction.
          Length = 78

 Score = 44.1 bits (105), Expect = 4e-06
 Identities = 14/58 (24%), Positives = 26/58 (44%)

Query: 127 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIV 184
                + GG+ A  +  FL++ G+      WGGE+ +L  ++ + G  IE    +   
Sbjct: 14  LPYKGYFGGVLAFSKEDFLKVNGFSNNFWGWGGEDDDLYARLLLAGLKIERPKFAIGR 71


>gnl|CDD|222281 pfam13641, Glyco_tranf_2_3, Glycosyltransferase like family 2.
           Members of this family of prokaryotic proteins include
           putative glucosyltransferase, which are involved in
           bacterial capsule biosynthesis.
          Length = 229

 Score = 47.3 bits (113), Expect = 4e-06
 Identities = 34/171 (19%), Positives = 60/171 (35%), Gaps = 28/171 (16%)

Query: 14  NGKVRLIRNTEREGLIRTRSRGAKE----SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 69
           + +V ++R     G    ++R   E     + +++V LDA   V  + L  LL    S  
Sbjct: 57  DVRVHVVRRPRPPG-PTGKARALNEALRAIKSDLVVLLDADSVVDPDTLRRLLPFFLSKG 115

Query: 70  KIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML-YKENELPEREAKKRKYNSEPYK 128
                  +  +  Q   F       +    +     L +    L     ++         
Sbjct: 116 -------VGAV--QGPVF-----VLNLRTAVAPLYALEFALRHLRFMALRRA------LG 155

Query: 129 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
               AG      R+   E+GG+DPG L+   E+ EL  ++   G    +VP
Sbjct: 156 VAPLAGSGSLFRRSVLEEIGGFDPGFLLG--EDKELGLRLRRAGWRTAYVP 204



 Score = 33.1 bits (76), Expect = 0.18
 Identities = 30/120 (25%), Positives = 53/120 (44%), Gaps = 10/120 (8%)

Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF- 327
           P+ +V++   NE    L RT+ SI+ +   +   E+I+V D S    LD    +    + 
Sbjct: 1   PRVAVVVPTWNED-KVLGRTLRSILAQDYPRL--EVIVVVDGSDDETLD-VARELAAAYP 56

Query: 328 NGKVRLIRNTEREGLIRTRSRGAKE----SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 383
           + +V ++R     G    ++R   E     + +++V LDA   V  + L  LL    S  
Sbjct: 57  DVRVHVVRRPRPPG-PTGKARALNEALRAIKSDLVVLLDADSVVDPDTLRRLLPFFLSKG 115


>gnl|CDD|133022 cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member
           of the Glycosyltransferase 2 superfamily.  DPM1 is the
           catalytic subunit of eukaryotic dolichol-phosphate
           mannose (DPM) synthase. DPM synthase is required for
           synthesis of the glycosylphosphatidylinositol (GPI)
           anchor, N-glycan precursor, protein O-mannose, and
           C-mannose. In higher eukaryotes,the enzyme has three
           subunits, DPM1, DPM2 and DPM3. DPM is synthesized from
           dolichol phosphate and GDP-Man on the cytosolic surface
           of the ER membrane by DPM synthase and then is flipped
           onto the luminal side and used as a donor substrate. In
           lower eukaryotes, such as Saccharomyces cerevisiae and
           Trypanosoma brucei, DPM synthase consists of a single
           component (Dpm1p and TbDpm1, respectively) that
           possesses one predicted transmembrane region near the C
           terminus for anchoring to the ER membrane. In contrast,
           the Dpm1 homologues of higher eukaryotes, namely fission
           yeast, fungi, and animals, have no transmembrane region,
           suggesting the existence of adapter molecules for
           membrane anchoring. This family also includes bacteria
           and archaea DPM1_like enzymes. However, the enzyme
           structure and mechanism of function are not well
           understood. The UDP-glucose:dolichyl-phosphate
           glucosyltransferase (DPG_synthase) is a
           transmembrane-bound enzyme of the endoplasmic reticulum
           involved in protein N-linked glycosylation. This enzyme
           catalyzes the transfer of glucose from UDP-glucose to
           dolichyl phosphate. This protein family belongs to
           Glycosyltransferase 2 superfamily.
          Length = 185

 Score = 46.0 bits (110), Expect = 7e-06
 Identities = 26/94 (27%), Positives = 45/94 (47%), Gaps = 9/94 (9%)

Query: 273 VILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD---QKLEDYIQRFNG 329
           V++  +NE   ++   V  ++      Y  EII+VDD S+    D   +   +   R   
Sbjct: 1   VVIPAYNEE-ENIPELVERLLAVLEEGYDYEIIVVDDGST----DGTAEIARELAAR-VP 54

Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 363
           +VR+IR +   G       G K +RG+++V +DA
Sbjct: 55  RVRVIRLSRNFGKGAAVRAGFKAARGDIVVTMDA 88



 Score = 31.0 bits (71), Expect = 0.67
 Identities = 13/46 (28%), Positives = 23/46 (50%), Gaps = 1/46 (2%)

Query: 4  KKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 49
          +   +   R   +VR+IR +   G       G K +RG+++V +DA
Sbjct: 44 EIARELAAR-VPRVRVIRLSRNFGKGAAVRAGFKAARGDIVVTMDA 88


>gnl|CDD|224137 COG1216, COG1216, Predicted glycosyltransferases [General function
           prediction only].
          Length = 305

 Score = 46.3 bits (110), Expect = 1e-05
 Identities = 37/149 (24%), Positives = 59/149 (39%), Gaps = 10/149 (6%)

Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
           PK S+I+V +N G   L+  + S+  +T     + I++VD+ S+    D  LE    RF 
Sbjct: 3   PKISIIIVTYNRG-EDLVECLASLAAQTY--PDDVIVVVDNGST----DGSLEALKARFF 55

Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEV---IVFLDAHCEVGLNWLPPLLAPIYSDRKI 385
             VRLI N E  G     +RG K +  +    ++ L+    V  + L  LL     D   
Sbjct: 56  PNVRLIENGENLGFAGGFNRGIKYALAKGDDYVLLLNPDTVVEPDLLEELLKAAEEDPAA 115

Query: 386 MTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
             V  +     ++              G 
Sbjct: 116 GVVGPLIRNYDESLYIDRRGGESDGLTGG 144



 Score = 45.9 bits (109), Expect = 2e-05
 Identities = 41/185 (22%), Positives = 66/185 (35%), Gaps = 13/185 (7%)

Query: 3   DKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEV---IVFLDAHCEVGLNWLP 59
           D  LE    RF   VRLI N E  G     +RG K +  +    ++ L+    V  + L 
Sbjct: 44  DGSLEALKARFFPNVRLIENGENLGFAGGFNRGIKYALAKGDDYVLLLNPDTVVEPDLLE 103

Query: 60  PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
            LL     D     V  +     ++             RG    G+       P  E   
Sbjct: 104 ELLKAAEEDPAAGVVGPLIRNYDESLYI--------DRRGGESDGLTGGWRASPLLE-IA 154

Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
              +S      + +G    + R  F ++GG+D    ++  E+ +L  +    G  I +VP
Sbjct: 155 PDLSSYLEVVASLSGACLLIRREAFEKVGGFDERFFIY-YEDVDLCLRARKAGYKIYYVP 213

Query: 180 CSRIV 184
            + I 
Sbjct: 214 DAIIY 218


>gnl|CDD|133031 cd04188, DPG_synthase, DPG_synthase is involved in protein N-linked
           glycosylation.  UDP-glucose:dolichyl-phosphate
           glucosyltransferase (DPG_synthase) is a
           transmembrane-bound enzyme of the endoplasmic reticulum
           involved in protein N-linked glycosylation. This enzyme
           catalyzes the transfer of glucose from UDP-glucose to
           dolichyl phosphate.
          Length = 211

 Score = 45.3 bits (108), Expect = 2e-05
 Identities = 17/61 (27%), Positives = 28/61 (45%), Gaps = 1/61 (1%)

Query: 303 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
           EII+VDD  SK    +      ++    +R++   +  G       G   +RG+ I+F D
Sbjct: 32  EIIVVDD-GSKDGTAEVARKLARKNPALIRVLTLPKNRGKGGAVRAGMLAARGDYILFAD 90

Query: 363 A 363
           A
Sbjct: 91  A 91



 Score = 33.7 bits (78), Expect = 0.11
 Identities = 9/46 (19%), Positives = 18/46 (39%)

Query: 4  KKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 49
          +       +    +R++   +  G       G   +RG+ I+F DA
Sbjct: 46 EVARKLARKNPALIRVLTLPKNRGKGGAVRAGMLAARGDYILFADA 91


>gnl|CDD|133062 cd06442, DPM1_like, DPM1_like represents putative enzymes similar
           to eukaryotic DPM1.  Proteins similar to eukaryotic
           DPM1, including enzymes from bacteria and archaea; DPM1
           is the catalytic subunit of eukaryotic
           dolichol-phosphate mannose (DPM) synthase. DPM synthase
           is required for synthesis of the
           glycosylphosphatidylinositol (GPI) anchor, N-glycan
           precursor, protein O-mannose, and C-mannose. In higher
           eukaryotes,the enzyme has three subunits, DPM1, DPM2 and
           DPM3. DPM is synthesized from dolichol phosphate and
           GDP-Man on the cytosolic surface of the ER membrane by
           DPM synthase and then is flipped onto the luminal side
           and used as a donor substrate. In lower eukaryotes, such
           as Saccharomyces cerevisiae and Trypanosoma brucei, DPM
           synthase consists of a single component (Dpm1p and
           TbDpm1, respectively) that possesses one predicted
           transmembrane region near the C terminus for anchoring
           to the ER membrane. In contrast, the Dpm1 homologues of
           higher eukaryotes, namely fission yeast, fungi, and
           animals, have no transmembrane region, suggesting the
           existence of adapter molecules for membrane anchoring.
           This family also includes bacteria and archaea DPM1_like
           enzymes. However, the enzyme structure and mechanism of
           function are not well understood. This protein family
           belongs to Glycosyltransferase 2 superfamily.
          Length = 224

 Score = 41.0 bits (97), Expect = 5e-04
 Identities = 23/61 (37%), Positives = 33/61 (54%), Gaps = 2/61 (3%)

Query: 303 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
           EII+VDD S   D   ++   + +   +VRLI    + GL      G K +RG+VIV +D
Sbjct: 29  EIIVVDDNS--PDGTAEIVRELAKEYPRVRLIVRPGKRGLGSAYIEGFKAARGDVIVVMD 86

Query: 363 A 363
           A
Sbjct: 87  A 87



 Score = 32.9 bits (76), Expect = 0.24
 Identities = 15/36 (41%), Positives = 20/36 (55%)

Query: 14 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 49
            +VRLI    + GL      G K +RG+VIV +DA
Sbjct: 52 YPRVRLIVRPGKRGLGSAYIEGFKAARGDVIVVMDA 87


>gnl|CDD|133027 cd04184, GT2_RfbC_Mx_like, Myxococcus xanthus RfbC like proteins
           are required for O-antigen biosynthesis.  The rfbC gene
           encodes a predicted protein of 1,276 amino acids, which
           is required for O-antigen biosynthesis in Myxococcus
           xanthus. It is a subfamily of Glycosyltransferase Family
           GT2, which includes diverse families of glycosyl
           transferases with a common GT-A type structural fold,
           which has two tightly associated beta/alpha/beta domains
           that tend to form a continuous central sheet of at least
           eight beta-strands. These are enzymes that catalyze the
           transfer of sugar moieties from activated donor
           molecules to specific acceptor molecules, forming
           glycosidic bonds.
          Length = 202

 Score = 38.3 bits (90), Expect = 0.003
 Identities = 19/96 (19%), Positives = 44/96 (45%), Gaps = 7/96 (7%)

Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE--EIILVDDFSSKADLDQKLEDYIQR 326
           P  S+++  +N     L   + S+     AQ     E+ + DD S+  ++ + L+ Y  +
Sbjct: 1   PLISIVMPVYNTPEKYLREAIESVR----AQTYPNWELCIADDASTDPEVKRVLKKYAAQ 56

Query: 327 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
            + +++++   E  G+    +   + + GE +  LD
Sbjct: 57  -DPRIKVVFREENGGISAATNSALELATGEFVALLD 91


>gnl|CDD|133061 cd06439, CESA_like_1, CESA_like_1 is a member of the cellulose
           synthase (CESA) superfamily.  This is a subfamily of
           cellulose synthase (CESA) superfamily.  CESA superfamily
           includes a wide variety of glycosyltransferase family 2
           enzymes that share the common characteristic of
           catalyzing the elongation of polysaccharide chains.  The
           members of the superfamily include cellulose synthase
           catalytic subunit, chitin synthase, glucan biosynthesis
           protein and other families of CESA-like proteins.
          Length = 251

 Score = 38.7 bits (91), Expect = 0.003
 Identities = 28/105 (26%), Positives = 45/105 (42%), Gaps = 23/105 (21%)

Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRT---------PAQYLEEIILVDDFSSKADLDQ 318
           LP  ++I+  +NE          ++I+           P   LE II+V D  S     +
Sbjct: 28  LPTVTIIIPAYNE---------EAVIEAKLENLLALDYPRDRLE-IIVVSD-GSTDGTAE 76

Query: 319 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 363
              +Y  +    V+L+R  ER G     +R    + GE++VF DA
Sbjct: 77  IAREYADK---GVKLLRFPERRGKAAALNRALALATGEIVVFTDA 118



 Score = 31.4 bits (72), Expect = 0.73
 Identities = 13/35 (37%), Positives = 19/35 (54%)

Query: 15  GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 49
             V+L+R  ER G     +R    + GE++VF DA
Sbjct: 84  KGVKLLRFPERRGKAAALNRALALATGEIVVFTDA 118


>gnl|CDD|133030 cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to
           eukaryotic DPM1.  A family of  bacterial enzymes related
           to eukaryotic DPM1; Although the mechanism of eukaryotic
           enzyme is well studied, the mechanism of the  bacterial
           enzymes is not well understood. The eukaryotic DPM1 is
           the catalytic subunit of eukaryotic Dolichol-phosphate
           mannose (DPM) synthase. DPM synthase is required for
           synthesis of the glycosylphosphatidylinositol (GPI)
           anchor, N-glycan precursor, protein O-mannose, and
           C-mannose. The enzyme has three subunits, DPM1, DPM2 and
           DPM3. DPM is synthesized from dolichol phosphate and
           GDP-Man on the cytosolic surface of the ER membrane by
           DPM synthase and then is flipped onto the luminal side
           and used as a donor substrate. This protein family
           belongs to Glycosyltransferase 2 superfamily.
          Length = 181

 Score = 37.8 bits (89), Expect = 0.004
 Identities = 23/96 (23%), Positives = 45/96 (46%), Gaps = 12/96 (12%)

Query: 273 VILVFHNE--GFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD---QKLEDYIQRF 327
           +++  +NE      L   + ++++     Y  EII VDD S+    D   + L +   R 
Sbjct: 1   IVVPVYNEEENLPELYERLKAVLESLGYDY--EIIFVDDGST----DRTLEILRELAAR- 53

Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 363
           + +V++IR +   G       G   +RG+ ++ +DA
Sbjct: 54  DPRVKVIRLSRNFGQQAALLAGLDHARGDAVITMDA 89


>gnl|CDD|133039 cd04196, GT_2_like_d, Subfamily of Glycosyltransferase Family GT2
           of unknown function.  GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 214

 Score = 37.6 bits (88), Expect = 0.005
 Identities = 16/60 (26%), Positives = 32/60 (53%), Gaps = 1/60 (1%)

Query: 303 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
           E+I+ DD S+   ++  +++YI +    + LIRN +  G+ R      + + G+ + F D
Sbjct: 29  ELIISDDGSTDGTVEI-IKEYIDKDPFIIILIRNGKNLGVARNFESLLQAADGDYVFFCD 87



 Score = 31.1 bits (71), Expect = 0.93
 Identities = 11/43 (25%), Positives = 23/43 (53%)

Query: 6  LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 48
          +++YI++    + LIRN +  G+ R      + + G+ + F D
Sbjct: 45 IKEYIDKDPFIIILIRNGKNLGVARNFESLLQAADGDYVFFCD 87


>gnl|CDD|182223 PRK10073, PRK10073, putative glycosyl transferase; Provisional.
          Length = 328

 Score = 37.7 bits (88), Expect = 0.008
 Identities = 27/99 (27%), Positives = 48/99 (48%), Gaps = 14/99 (14%)

Query: 269 PKASVILVFHNEG--FSSLMRTVHSIIKRTPAQYLE--EIILVDDFSSKADLDQKLEDYI 324
           PK S+I+  +N G  F + M ++        AQ     EII+V+D S+   + +  + Y 
Sbjct: 6   PKLSIIIPLYNAGKDFRAFMESLI-------AQTWTALEIIIVNDGSTDNSV-EIAKHYA 57

Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 363
           + +   VRL+ +    G+   R+ G   + G+ + F DA
Sbjct: 58  ENY-PHVRLL-HQANAGVSVARNTGLAVATGKYVAFPDA 94


>gnl|CDD|133056 cd06434, GT2_HAS, Hyaluronan synthases catalyze polymerization of
           hyaluronan.  Hyaluronan synthases (HASs) are
           bi-functional glycosyltransferases that catalyze
           polymerization of hyaluronan. HASs transfer both GlcUA
           and GlcNAc in beta-(1,3) and beta-(1,4) linkages,
           respectively to the hyaluronan chain using UDP-GlcNAc
           and UDP-GlcUA as substrates. HA is made as a free
           glycan, not attached to a protein or lipid. HASs do not
           need a primer for HA synthesis; they initiate HA
           biosynthesis de novo with only UDP-GlcNAc, UDP-GlcUA,
           and Mg2+. Hyaluronan (HA) is a linear
           heteropolysaccharide composed of (1-3)-linked
           beta-D-GlcUA-beta-D-GlcNAc disaccharide repeats. It can
           be found in vertebrates and a few microbes and is
           typically on the cell surface or in the extracellular
           space, but is also found inside mammalian cells.
           Hyaluronan has several physiochemical and biological
           functions such as space filling, lubrication, and
           providing a hydrated matrix through which cells can
           migrate.
          Length = 235

 Score = 36.8 bits (86), Expect = 0.011
 Identities = 26/114 (22%), Positives = 49/114 (42%), Gaps = 9/114 (7%)

Query: 272 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 331
           +VI+  ++E        + SI+++ P     EII+V D   +  L    +    ++ G  
Sbjct: 3   TVIIPVYDEDPDVFRECLRSILRQKPL----EIIVVTDGDDEPYLSILSQTV--KYGGIF 56

Query: 332 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 385
             +      G  R  + G +    +++V LD+      N LP +L P + D K+
Sbjct: 57  --VITVPHPGKRRALAEGIRHVTTDIVVLLDSDTVWPPNALPEMLKP-FEDPKV 107


>gnl|CDD|133013 cd02522, GT_2_like_a, GT_2_like_a represents a glycosyltransferase
           family-2 subfamily with unknown function.
           Glycosyltransferase family 2 (GT-2) subfamily of unknown
           function. GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 221

 Score = 35.6 bits (83), Expect = 0.032
 Identities = 33/137 (24%), Positives = 50/137 (36%), Gaps = 35/137 (25%)

Query: 34  RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 93
            GA  +RG+ ++FL A   +  +W          D  I+     DG       FR  ++ 
Sbjct: 66  AGAAAARGDWLLFLHADTRLPPDW----------DAAIIETLRADG--AVAGAFRLRFDD 113

Query: 94  DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG--GLFAMDRAFFLELGGYD 151
                 + E G             + R +   PY      G  GLF + R  F ELGG+ 
Sbjct: 114 PGPRLRLLELGAN----------LRSRLFGL-PY------GDQGLF-IRRELFEELGGFP 155

Query: 152 PGLLVWGGENFELSFKI 168
              L    E+ EL  ++
Sbjct: 156 ELPL---MEDVELVRRL 169


>gnl|CDD|188532 TIGR04017, WcaA, colanic acid biosynthesis glycosyl transferase
           WcaA.  This gene is one of the glycosyl transferases
           involved in the biosynthesis of colanic acid, an
           exopolysaccharide expressed in Enterobacteraceae
           species.
          Length = 279

 Score = 33.9 bits (77), Expect = 0.13
 Identities = 24/93 (25%), Positives = 47/93 (50%), Gaps = 10/93 (10%)

Query: 286 MRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG-KVRLIRNTEREGLIR 344
           +R + S++++    +  E+I+VDD SS     ++L+ +++  N  +VR   N    G   
Sbjct: 21  IRAIKSVLRQDYDNW--EMIIVDDCSSSY---EQLQQFVEALNDPRVRYTHNDINSGACA 75

Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
            R++   +++GE I  +D   E    W P  L+
Sbjct: 76  VRNQAIMQAKGEYITGIDDDDE----WTPNRLS 104


>gnl|CDD|222273 pfam13632, Glyco_trans_2_3, Glycosyl transferase family group 2.
           Members of this family of prokaryotic proteins include
           putative glucosyltransferases, which are involved in
           bacterial capsule biosynthesis.
          Length = 194

 Score = 32.7 bits (75), Expect = 0.23
 Identities = 12/51 (23%), Positives = 24/51 (47%), Gaps = 3/51 (5%)

Query: 133 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 183
            G      R+   E+GG+D   +    E+F+ + ++   G  + +VP S +
Sbjct: 73  VGSGALFRRSALQEVGGFDESSV---SEDFDFALRLRRAGYRVRFVPESGV 120


>gnl|CDD|182197 PRK10018, PRK10018, putative glycosyl transferase; Provisional.
          Length = 279

 Score = 32.3 bits (73), Expect = 0.39
 Identities = 23/93 (24%), Positives = 46/93 (49%), Gaps = 10/93 (10%)

Query: 286 MRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG-KVRLIRNTEREGLIR 344
           +R + S++++  + +  E+I+VDD S+     ++L+ Y+   N  ++  I N    G   
Sbjct: 21  IRAIKSVLRQDYSNW--EMIIVDDCSTSW---EQLQQYVTALNDPRITYIHNDINSGACA 75

Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
            R++    ++GE I  +D   E    W P  L+
Sbjct: 76  VRNQAIMLAQGEYITGIDDDDE----WTPNRLS 104


>gnl|CDD|234532 TIGR04283, glyco_like_mftF, transferase 2,
           rSAM/selenodomain-associated.  This enzyme may transfer
           a nucleotide, or it sugar moiety, as part of a
           biosynthetic pathway. Other proposed members of the
           pathway include another transferase (TIGR04282), a
           phosphoesterase, and a radical SAM enzyme (TIGR04167)
           whose C-terminal domain (pfam12345) frequently contains
           a selenocysteine [Unknown function, Enzymes of unknown
           specificity].
          Length = 220

 Score = 31.7 bits (73), Expect = 0.61
 Identities = 28/142 (19%), Positives = 52/142 (36%), Gaps = 45/142 (31%)

Query: 34  RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDG-----IDYQTWEFR 88
            GA  ++G++++FL A   +  ++L  +   +     +       G      D      R
Sbjct: 65  AGAALAKGDILLFLHADTRLPKDFLEAIRRALAPPGAVA------GCFDLRFDGPGLLLR 118

Query: 89  SVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG--GLFAMDRAFFLE 146
                      + EWG+             + +    PY      G  GLF + R+ F +
Sbjct: 119 -----------LIEWGV-----------NLRSRLTGIPY------GDQGLF-VRRSLFEQ 149

Query: 147 LGGYDPGLLVWGGENFELSFKI 168
           +GG+    L    E+ ELS ++
Sbjct: 150 IGGFPDIPL---MEDIELSRRL 168


>gnl|CDD|240336 PTZ00260, PTZ00260, dolichyl-phosphate beta-glucosyltransferase;
           Provisional.
          Length = 333

 Score = 31.7 bits (72), Expect = 0.70
 Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 10/67 (14%)

Query: 303 EIILVDDFSSKADLDQKLEDYIQRFNGK------VRLIRNTEREGLIRTRSRGAKESRGE 356
           EII+V+D  SK    +  +D+ ++          + L+RN  + G +R    G   SRG+
Sbjct: 109 EIIIVND-GSKDKTLKVAKDFWRQNINPNIDIRLLSLLRNKGKGGAVR---IGMLASRGK 164

Query: 357 VIVFLDA 363
            I+ +DA
Sbjct: 165 YILMVDA 171


>gnl|CDD|226577 COG4092, COG4092, Predicted glycosyltransferase involved in capsule
           biosynthesis [Cell envelope biogenesis, outer membrane].
          Length = 346

 Score = 31.1 bits (70), Expect = 1.1
 Identities = 27/124 (21%), Positives = 46/124 (37%), Gaps = 13/124 (10%)

Query: 43  VIVFLDAHCEVGLNWLPPLL--APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 100
           +++FLD  C    +    +L  A I   R  +  P++  + +       V+         
Sbjct: 97  LVLFLDVDCFGSSDNFAKMLSIATIKKMRTNIDAPLVLPVYHLNKADTQVFFDVEDMF-- 154

Query: 101 FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 160
                +  E+ L E   K+  +   PY        +F ++R  F   GGYD      G E
Sbjct: 155 --LDAMIFESPLAEFR-KEDNFFIAPYT------NIFLINRRMFSLTGGYDERFRGHGSE 205

Query: 161 NFEL 164
           +FE 
Sbjct: 206 DFEF 209


>gnl|CDD|237843 PRK14869, PRK14869, putative manganese-dependent inorganic
           pyrophosphatase; Provisional.
          Length = 546

 Score = 31.3 bits (72), Expect = 1.2
 Identities = 17/63 (26%), Positives = 26/63 (41%), Gaps = 21/63 (33%)

Query: 239 METSNHISF--DRTIPDLR--MEECKYWDYP-----------------LDLPKASVILVF 277
           M T + ++F  D  + D++  M + +Y  YP                 L   +  VILV 
Sbjct: 252 MTTEDLVTFSKDDYLEDVKEVMLKSRYRSYPVVDEDGKVVGVISRYHLLSPVRKKVILVD 311

Query: 278 HNE 280
           HNE
Sbjct: 312 HNE 314


>gnl|CDD|215385 PLN02726, PLN02726, dolichyl-phosphate beta-D-mannosyltransferase.
          Length = 243

 Score = 30.8 bits (70), Expect = 1.2
 Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 5/97 (5%)

Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
            K S+I+  +NE  + +   V+ I K        EII+VDD S   D  Q +   +Q+  
Sbjct: 9   MKYSIIVPTYNERLN-IALIVYLIFKALQDVKDFEIIVVDDGSP--DGTQDVVKQLQKVY 65

Query: 329 G--KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 363
           G  ++ L     + GL      G K + G+ +V +DA
Sbjct: 66  GEDRILLRPRPGKLGLGTAYIHGLKHASGDFVVIMDA 102


>gnl|CDD|218633 pfam05553, DUF761, Cotton fibre expressed protein.  This family
           consists of several plant proteins of unknown function.
           Three of the sequences (from Gossypium hirsutum) in this
           family are described as cotton fibre expressed proteins.
           The remaining sequences, found in Arabidopsis thaliana,
           are uncharacterized.
          Length = 38

 Score = 26.7 bits (60), Expect = 2.3
 Identities = 8/22 (36%), Positives = 16/22 (72%)

Query: 312 SKADLDQKLEDYIQRFNGKVRL 333
           S  ++D++ E++I RF  ++RL
Sbjct: 1   SDDEVDRRAEEFIARFREQLRL 22



 Score = 26.4 bits (59), Expect = 3.7
 Identities = 7/19 (36%), Positives = 15/19 (78%)

Query: 1  DLDKKLEDYIERFNGKVRL 19
          ++D++ E++I RF  ++RL
Sbjct: 4  EVDRRAEEFIARFREQLRL 22


>gnl|CDD|133057 cd06435, CESA_NdvC_like, NdvC_like  proteins in this family are
           putative bacterial beta-(1,6)-glucosyltransferase.
           NdvC_like  proteins in this family are putative
           bacterial beta-(1,6)-glucosyltransferase. Bradyrhizobium
           japonicum synthesizes periplasmic cyclic
           beta-(1,3),beta-(1,6)-D-glucans during growth under
           hypoosmotic conditions. Two genes (ndvB, ndvC) are
           involved in the beta-(1, 3), beta-(1,6)-glucan
           synthesis. The ndvC mutant strain resulted in synthesis
           of altered cyclic beta-glucans composed almost entirely
           of beta-(1, 3)-glycosyl linkages. The periplasmic cyclic
           beta-(1,3),beta-(1,6)-D-glucans function for
           osmoregulation. The ndvC mutation also affects the
           ability of the bacteria to establish a successful
           symbiotic interaction with host plant. Thus, the
           beta-glucans may function as suppressors of a host
           defense response.
          Length = 236

 Score = 29.3 bits (66), Expect = 3.0
 Identities = 30/123 (24%), Positives = 52/123 (42%), Gaps = 11/123 (8%)

Query: 303 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREG-----LIRTRSRGAKESRGEV 357
           E+I++D+ +    L + +E +  +   + R        G     L     R A ++  E+
Sbjct: 30  EVIVIDNNTKDEALWKPVEAHCAQLGERFRFFHVEPLPGAKAGALNYALERTAPDA--EI 87

Query: 358 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP-DHHYRGIFE 416
           I  +DA  +V  +WL   L PI+ D ++  V      DY+  E           Y+G F+
Sbjct: 88  IAVIDADYQVEPDWLKR-LVPIFDDPRVGFVQAPQ--DYRDGEESLFKRMCYAEYKGFFD 144

Query: 417 WGM 419
            GM
Sbjct: 145 IGM 147


>gnl|CDD|225979 COG3448, COG3448, CBS-domain-containing membrane protein [Signal
           transduction mechanisms].
          Length = 382

 Score = 29.8 bits (67), Expect = 3.3
 Identities = 25/97 (25%), Positives = 36/97 (37%), Gaps = 15/97 (15%)

Query: 262 WDYPLDLPKA--SVILV-----FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
           +++ L  P A  S+ILV     ++N    +      S                  FSS  
Sbjct: 149 YNFVL-WPVALNSLILVGLALLYNNLTRRTYPHHWLSPPANLHGTADPPPSQRVGFSS-E 206

Query: 315 DLDQKLEDY-----IQRFNGKVRLIRNTEREGLIRTR 346
           DLD  L+       I R +   RL+R TE + L R  
Sbjct: 207 DLDAALQRLGETLDIDR-DDLERLLRETELQALRRRM 242


>gnl|CDD|213239 cd03272, ABC_SMC3_euk, ATP-binding cassette domain of eukaryotic
           SMC3 proteins.  The structural maintenance of
           chromosomes (SMC) proteins are large (approximately 110
           to 170 kDa), and each is arranged into five recognizable
           domains. Amino-acid sequence homology of SMC proteins
           between species is largely confined to the amino- and
           carboxy-terminal globular domains. The amino-terminal
           domain contains a 'Walker A' nucleotide-binding domain
           (GxxGxGKS/T, in the single-letter amino-acid code),
           which by mutational studies has been shown to be
           essential in several proteins. The carboxy-terminal
           domain contains a sequence (the DA-box) that resembles a
           'Walker B' motif, and a motif with homology to the
           signature sequence of the ATP-binding cassette (ABC)
           family of ATPases. The sequence homology within the
           carboxy-terminal domain is relatively high within the
           SMC1-SMC4 group, whereas SMC5 and SMC6 show some
           divergence in both of these sequences. In eukaryotic
           cells, the proteins are found as heterodimers of SMC1
           paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6
           (formerly known as Rad18).
          Length = 243

 Score = 29.2 bits (66), Expect = 3.8
 Identities = 12/51 (23%), Positives = 20/51 (39%)

Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 321
           A V ++F N      +      ++RT     +E  L     +K D+   LE
Sbjct: 74  AYVEIIFDNSDNRFPIDKEEVRLRRTIGLKKDEYFLDKKNVTKNDVMNLLE 124


>gnl|CDD|184755 PRK14583, hmsR, N-glycosyltransferase; Provisional.
          Length = 444

 Score = 28.8 bits (64), Expect = 5.8
 Identities = 31/137 (22%), Positives = 62/137 (45%), Gaps = 13/137 (9%)

Query: 260 KYWDYPLDL--------PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 311
           ++W +  D+        P  S+++   NEG  +   T+H+ + +T      E+I ++D S
Sbjct: 58  RHWPWGDDVPAPELKGHPLVSILVPCFNEG-LNARETIHAALAQTYTNI--EVIAINDGS 114

Query: 312 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 371
           S         D +   + ++R+I     +G       GA  +R E +V +D    +  N 
Sbjct: 115 SDDTAQVL--DALLAEDPRLRVIHLAHNQGKAIALRMGAAAARSEYLVCIDGDALLDKNA 172

Query: 372 LPPLLAPIYSDRKIMTV 388
           +P L+AP+ ++ +   V
Sbjct: 173 VPYLVAPLIANPRTGAV 189


>gnl|CDD|133038 cd04195, GT2_AmsE_like, GT2_AmsE_like is involved in
           exopolysaccharide amylovora biosynthesis.  AmsE is a
           glycosyltransferase involved in exopolysaccharide
           amylovora biosynthesis in Erwinia amylovora. Amylovara
           is one of the three exopolysaccharide produced by E.
           amylovora. Amylovara-deficient mutants are
           non-pathogenic. It is a subfamily of Glycosyltransferase
           Family GT2, which includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds.
          Length = 201

 Score = 28.0 bits (63), Expect = 7.2
 Identities = 21/93 (22%), Positives = 46/93 (49%), Gaps = 5/93 (5%)

Query: 272 SVIL-VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 330
           SV++ V+  E    L   + SI+K+T     +E++LV D      L++ LE++ ++    
Sbjct: 1   SVLMSVYIKEKPEFLREALESILKQTLPP--DEVVLVKDGPVTQSLNEVLEEFKRKLP-- 56

Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 363
           ++++   +  GL +  + G K    + +  +D 
Sbjct: 57  LKVVPLEKNRGLGKALNEGLKHCTYDWVARMDT 89


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.141    0.440 

Gapped
Lambda     K      H
   0.267   0.0708    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 23,725,351
Number of extensions: 2386209
Number of successful extensions: 1964
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1936
Number of HSP's successfully gapped: 72
Length of query: 439
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 339
Effective length of database: 6,502,202
Effective search space: 2204246478
Effective search space used: 2204246478
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 60 (26.9 bits)