RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10463
         (605 letters)



>gnl|CDD|133004 cd02510, pp-GalNAc-T, pp-GalNAc-T initiates the formation of
           mucin-type O-linked glycans.  UDP-GalNAc: polypeptide
           alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T)
           initiate the formation of mucin-type, O-linked glycans
           by catalyzing the transfer of
           alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to
           hydroxyl groups of Ser or Thr residues of core proteins
           to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These
           enzymes are type II membrane proteins with a GT-A type
           catalytic domain and a lectin domain located on the
           lumen side of the Golgi apparatus. In human, there are
           15 isozymes of pp-GalNAc-Ts, representing the largest of
           all glycosyltransferase families. Each isozyme has
           unique but partially redundant substrate specificity for
           glycosylation sites on acceptor proteins.
          Length = 299

 Score =  453 bits (1167), Expect = e-157
 Identities = 165/303 (54%), Positives = 210/303 (69%), Gaps = 5/303 (1%)

Query: 201 SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT 260
           S++I+FHNEA S LLRTVHSVI+R+P  +LKEI+LVDD S +  LK  L+EY  K     
Sbjct: 1   SVIIIFHNEALSTLLRTVHSVINRTPPELLKEIILVDDFSDKPELKLLLEEYYKKYLPKV 60

Query: 261 RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVI 320
           +V+R   R GLI+AR+ GAR A G++LVFLD+HCE  +GWLE L+AR+AE+R  VVCP+I
Sbjct: 61  KVLRLKKREGLIRARIAGARAATGDVLVFLDSHCEVNVGWLEPLLARIAENRKTVVCPII 120

Query: 321 DIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAI 380
           D+I   TF Y  S     G F+W LHF+W      +   +R+  T P ++P MAGGLFAI
Sbjct: 121 DVIDADTFEYRGSSGDARGGFDWSLHFKWLPLPEEE--RRRESPTAPIRSPTMAGGLFAI 178

Query: 381 DRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFR-KASPYSFPGG 439
           DR +F  +G YDE M +WGGENLE+SF+VWQCGGSIEI PCS V H+FR K  PY+FPGG
Sbjct: 179 DREWFLELGGYDEGMDIWGGENLELSFKVWQCGGSIEIVPCSRVGHIFRRKRKPYTFPGG 238

Query: 440 VSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLT 499
              VL  N  RVA VWMDE+ E+++K  PE     D   +  R  LR++LKC SFKWYL 
Sbjct: 239 SGTVLR-NYKRVAEVWMDEYKEYFYKARPELRNI-DYGDLSERKALRERLKCKSFKWYLE 296

Query: 500 HVW 502
           +V+
Sbjct: 297 NVY 299



 Score =  158 bits (403), Expect = 4e-44
 Identities = 55/97 (56%), Positives = 71/97 (73%)

Query: 93  SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT 152
           S++I+FHNEA S LLRTVHSVI+R+P  +LKEI+LVDD S +  LK  L+EY  K     
Sbjct: 1   SVIIIFHNEALSTLLRTVHSVINRTPPELLKEIILVDDFSDKPELKLLLEEYYKKYLPKV 60

Query: 153 RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECT 189
           +V+R   R GLI+AR+ GAR A G++LVFLD+HCE  
Sbjct: 61  KVLRLKKREGLIRARIAGARAATGDVLVFLDSHCEVN 97


>gnl|CDD|215980 pfam00535, Glycos_transf_2, Glycosyl transferase family 2.  Diverse
           family, transferring sugar from UDP-glucose,
           UDP-N-acetyl- galactosamine, GDP-mannose or
           CDP-abequose, to a range of substrates including
           cellulose, dolichol phosphate and teichoic acids.
          Length = 168

 Score =  107 bits (270), Expect = 3e-27
 Identities = 47/185 (25%), Positives = 74/185 (40%), Gaps = 17/185 (9%)

Query: 201 SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT 260
           S++I  +NE    L  T+ S+++++ +    EI++VDD ST        +EY AK     
Sbjct: 1   SVIIPTYNEE-KYLEETLESLLNQTYK--NFEIIVVDDGSTDG-TVEIAEEY-AKNDPRV 55

Query: 261 RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVI 320
           RVIR    +G   AR  G + A G+ + FLDA  E    WLE LV  + ++   +V    
Sbjct: 56  RVIRLEENLGKAAARNAGLKLATGDYIAFLDADDEVAPDWLEKLVELLEKNGADIVIGSR 115

Query: 321 DIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAI 380
            +I+  T  Y R+          +L  R                   F   + A     +
Sbjct: 116 VVINGETRLYGRALRFELLLLLGKLGARSL------------GLKVLFLIGSNALYRREV 163

Query: 381 DRAYF 385
                
Sbjct: 164 LEELL 168



 Score = 81.0 bits (200), Expect = 7e-18
 Identities = 35/107 (32%), Positives = 54/107 (50%), Gaps = 9/107 (8%)

Query: 93  SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT 152
           S++I  +NE    L  T+ S+++++ +    EI++VDD ST        +EY AK     
Sbjct: 1   SVIIPTYNEE-KYLEETLESLLNQTYK--NFEIIVVDDGSTDG-TVEIAEEY-AKNDPRV 55

Query: 153 RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
           RVIR    +G   AR  G + A G+ + FLDA  E       ++L K
Sbjct: 56  RVIRLEENLGKAAARNAGLKLATGDYIAFLDADDEVA----PDWLEK 98


>gnl|CDD|132997 cd00761, Glyco_tranf_GTA_type, Glycosyltransferase family A (GT-A)
           includes diverse families of glycosyl transferases with
           a common GT-A type structural fold.
           Glycosyltransferases (GTs) are enzymes that synthesize
           oligosaccharides, polysaccharides, and glycoconjugates
           by transferring the sugar moiety from an activated
           nucleotide-sugar donor to an acceptor molecule, which
           may be a growing oligosaccharide, a lipid, or a protein.
            Based on the stereochemistry of the donor and acceptor
           molecules, GTs are classified as either retaining or
           inverting enzymes. To date, all GT structures adopt one
           of two possible folds, termed GT-A fold and GT-B fold.
           This hierarchy includes diverse families of glycosyl
           transferases with a common GT-A type structural fold,
           which has two tightly associated beta/alpha/beta domains
           that tend to form a continuous central sheet of at least
           eight beta-strands. The majority of the proteins in this
           superfamily are Glycosyltransferase family 2 (GT-2)
           proteins. But it also includes families GT-43, GT-6,
           GT-8, GT13 and GT-7; which are evolutionarily related to
           GT-2 and share structure similarities.
          Length = 156

 Score = 69.5 bits (170), Expect = 5e-14
 Identities = 39/116 (33%), Positives = 59/116 (50%), Gaps = 5/116 (4%)

Query: 202 IVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTR 261
           ++I  +NE    L R + S+++++  +   E+++VDD ST   L+  L+EY  K     R
Sbjct: 1   VIIPAYNEE-PYLERCLESLLAQTYPNF--EVIVVDDGSTDGTLEI-LEEYAKKDPRVIR 56

Query: 262 VIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVC 317
           VI    + GL  AR  G + A GE ++FLDA       WLE LVA +  D      
Sbjct: 57  VINEENQ-GLAAARNAGLKAARGEYILFLDADDLLLPDWLERLVAELLADPEADAV 111



 Score = 57.9 bits (140), Expect = 5e-10
 Identities = 43/150 (28%), Positives = 66/150 (44%), Gaps = 21/150 (14%)

Query: 94  IVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTR 153
           ++I  +NE    L R + S+++++  +   E+++VDD ST   L+  L+EY  K     R
Sbjct: 1   VIIPAYNEE-PYLERCLESLLAQTYPNF--EVIVVDDGSTDGTLEI-LEEYAKKDPRVIR 56

Query: 154 VIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECT----LVFNEEFLPKSSIVIVFHNE 209
           VI    + GL  AR  G + A GE ++FLDA              E L       V    
Sbjct: 57  VINEENQ-GLAAARNAGLKAARGEYILFLDADDLLLPDWLERLVAELLADPEADAVGGP- 114

Query: 210 AWSALLRTVHSVISRSPRSMLKEILLVDDA 239
             + L R          R +L+EI   D+A
Sbjct: 115 -GNLLFR----------RELLEEIGGFDEA 133


>gnl|CDD|234419 TIGR03965, mycofact_glyco, mycofactocin system glycosyltransferase.
            Members of this protein family are putative
           glycosyltransferases, members of pfam00535 (glycosyl
           transferase family 2). Members appear mostly in the
           Actinobacteria, where they appear to be part of a system
           for converting a precursor peptide (TIGR03969) into a
           novel redox carrier designated mycofactocin. A radical
           SAM enzyme, TIGR03962, is a proposed to be a key
           maturase for mycofactocin.
          Length = 467

 Score = 67.9 bits (166), Expect = 6e-12
 Identities = 77/296 (26%), Positives = 119/296 (40%), Gaps = 50/296 (16%)

Query: 151 PTRVIRSPGRVGLIKARLLGARQAEGEIL------VFLDAHCECTLVFNEEFLPKS---- 200
           P R++R    +    ARLLG  +  G  +        L    +  +V     L  S    
Sbjct: 21  PLRLLR----LAPAAARLLGGGRGGGLEVRDETSAALLRRLLDSGVVHPRPALLPSPPSV 76

Query: 201 SIVIVFHNEAWSALLRTVHSVISRS-PRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP 259
           ++V+   N   + L R + ++++   PR  L E+++VDD S                 +P
Sbjct: 77  TVVVPVRNRP-AGLARLLAALLALDYPRDRL-EVIVVDDGSEDP----VPTRAARGARLP 130

Query: 260 TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA-----RVAEDRTR 314
            RVIR P R G   AR  GAR A  E + F D+      GWL  L+A      VA    R
Sbjct: 131 VRVIRHPRRQGPAAARNAGARAARTEFVAFTDSDVVPRPGWLRALLAHFDDPGVALVAPR 190

Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRW-YTYGSSDAIIKRKDFTEPFKTPAM 373
           VV    +      +  VRS  L  G     +  R   +Y  S A++ R            
Sbjct: 191 VVALPAEDTRLARYEAVRS-SLDLGPEEAVVRPRGPVSYVPSAALLVR------------ 237

Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFR 429
                   R     +G +DE ++V  GE++++ +R+ + GG +   P + VAH  R
Sbjct: 238 --------RRALLEVGGFDERLEV--GEDVDLCWRLCEAGGRVRYEPAAVVAHDHR 283



 Score = 49.8 bits (119), Expect = 3e-06
 Identities = 30/100 (30%), Positives = 45/100 (45%), Gaps = 7/100 (7%)

Query: 89  LPKSSIVIVFHNEAWSALLRTVHSVISRS-PRSMLKEILLVDDASTREFLKSSLDEYVAK 147
            P  ++V+   N   + L R + ++++   PR  L E+++VDD S               
Sbjct: 73  PPSVTVVVPVRNRP-AGLARLLAALLALDYPRDRL-EVIVVDDGSEDP----VPTRAARG 126

Query: 148 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
             +P RVIR P R G   AR  GAR A  E + F D+   
Sbjct: 127 ARLPVRVIRHPRRQGPAAARNAGARAARTEFVAFTDSDVV 166


>gnl|CDD|220577 pfam10111, Glyco_tranf_2_2, Glycosyltransferase like family 2.
           Members of this family of prokaryotic proteins include
           putative glucosyltransferase, which are involved in
           bacterial capsule biosynthesis.
          Length = 278

 Score = 64.7 bits (158), Expect = 2e-11
 Identities = 49/246 (19%), Positives = 84/246 (34%), Gaps = 30/246 (12%)

Query: 200 SSIVIVFHNEAWSALLRTVHSVISRSPRSMLK---EILLVDDASTREF---LKSSLDE-- 251
           S I+ V+ +     +L+ + S ++R    +     EI++VD  S   F   LK  + +  
Sbjct: 1   SVIIPVYLSLRTRDILKRLQSCLNRLSYFLSDPDVEIIVVDGDSPLSFAKELKKIIAKNG 60

Query: 252 --YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVA 309
                       +  SP      KAR  GA  +  + + FLD  C  +   LE ++    
Sbjct: 61  AINYISHKTHKDLF-SPA-----KARNRGAEYSSSDFIFFLDVDCLISPDTLEKIIKHFQ 114

Query: 310 EDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFK 369
           E +T     +      ++      F          L    Y        I     T    
Sbjct: 115 ELQTNPNAFLALPCLYLSKEGSEIF----------LSDFKYL---LREEILEDAITGKST 161

Query: 370 TPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFR 429
             A+A     I+R +F  IG +DE  +  GGE+ E+ +R+              + +   
Sbjct: 162 FFALASSCILINRDFFLKIGGFDENFRGHGGEDFELLYRLLLYYKKFPPPK-DLLTYDEY 220

Query: 430 KASPYS 435
           K     
Sbjct: 221 KWPITY 226



 Score = 37.4 bits (87), Expect = 0.014
 Identities = 26/116 (22%), Positives = 46/116 (39%), Gaps = 16/116 (13%)

Query: 92  SSIVIVFHNEAWSALLRTVHSVISRSPRSMLK---EILLVDDASTREF---LKSSLDE-- 143
           S I+ V+ +     +L+ + S ++R    +     EI++VD  S   F   LK  + +  
Sbjct: 1   SVIIPVYLSLRTRDILKRLQSCLNRLSYFLSDPDVEIIVVDGDSPLSFAKELKKIIAKNG 60

Query: 144 --YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
                       +  SP      KAR  GA  +  + + FLD  C  +    E+ +
Sbjct: 61  AINYISHKTHKDLF-SPA-----KARNRGAEYSSSDFIFFLDVDCLISPDTLEKII 110


>gnl|CDD|133016 cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the
           biosynthesis of succinoglycan.  Succinoglycan
           Biosynthesis Protein ExoA catalyzes the formation of a
           beta-1,3 linkage of the second sugar (glucose) of the
           succinoglycan with the galactose on the lipid carrie.
           Succinoglycan is an acidic exopolysaccharide that is
           important for invasion of the nodules. Succinoglycan is
           a high-molecular-weight polymer composed of repeating
           octasaccharide units. These units are synthesized on
           membrane-bound isoprenoid lipid carriers, beginning with
           galactose followed by seven glucose molecules, and
           modified by the addition of acetate, succinate, and
           pyruvate. ExoA is a membrane protein with a
           transmembrance domain at c-terminus.
          Length = 249

 Score = 62.2 bits (152), Expect = 8e-11
 Identities = 47/220 (21%), Positives = 87/220 (39%), Gaps = 29/220 (13%)

Query: 201 SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEYVAKLS 257
           SI+I   NE    +   + S++++S    L EI++VD  S   TRE ++       A   
Sbjct: 3   SIIIPVRNEE-KYIEELLESLLNQSYPKDLIEIIVVDGGSTDGTREIVQE-----YAAKD 56

Query: 258 VPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVC 317
              R+I +P R+       +G R + G+I++ +DAH      ++  LV   A  RT    
Sbjct: 57  PRIRLIDNPKRI-QSAGLNIGIRNSRGDIIIRVDAHAVYPKDYILELVE--ALKRTGADN 113

Query: 318 PVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTY-GSSDAIIKRKDFTEPFKTPAMAGG 376
               + +                F   +     +  GS  +  +     +      +  G
Sbjct: 114 VGGPMETIGE-----------SKFQKAIAVAQSSPLGSGGSAYRGGAV-KIGYVDTVHHG 161

Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSI 416
            +   R  F  +G +DE +     E+ E+++R+ + G  I
Sbjct: 162 AY--RREVFEKVGGFDESLVR--NEDAELNYRLRKAGYKI 197



 Score = 51.8 bits (125), Expect = 3e-07
 Identities = 28/98 (28%), Positives = 48/98 (48%), Gaps = 10/98 (10%)

Query: 93  SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEYVAKLS 149
           SI+I   NE    +   + S++++S    L EI++VD  S   TRE ++       A   
Sbjct: 3   SIIIPVRNEE-KYIEELLESLLNQSYPKDLIEIIVVDGGSTDGTREIVQE-----YAAKD 56

Query: 150 VPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
              R+I +P R+       +G R + G+I++ +DAH  
Sbjct: 57  PRIRLIDNPKRI-QSAGLNIGIRNSRGDIIIRVDAHAV 93


>gnl|CDD|133035 cd04192, GT_2_like_e, Subfamily of Glycosyltransferase Family GT2
           of unknown function.  GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 229

 Score = 57.7 bits (140), Expect = 2e-09
 Identities = 47/208 (22%), Positives = 80/208 (38%), Gaps = 33/208 (15%)

Query: 202 IVIVFHNEAWSALLRTVHSVISRS-PRSMLKEILLVDDAS---TREFLKSSLDE--YVAK 255
           +VI   NEA + L R + S+ +   P+    E++LVDD S   T + L+ +  +  +  K
Sbjct: 1   VVIAARNEAEN-LPRLLQSLSALDYPKEKF-EVILVDDHSTDGTVQILEFAAAKPNFQLK 58

Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
           +   +RV  S  +  L  A     + A+G+ +V  DA C     WL   VA + +++  +
Sbjct: 59  ILNNSRVSISGKKNALTTA----IKAAKGDWIVTTDADCVVPSNWLLTFVAFIQKEQIGL 114

Query: 316 VC-PVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
           V  PVI        A  +  +                 G                 P M 
Sbjct: 115 VAGPVIYFKGKSLLAKFQRLD------------WLSLLGLIAGSFG-------LGKPFMC 155

Query: 375 GGL-FAIDRAYFFHIGAYDEEMQVWGGE 401
            G   A  +  FF +G ++    +  G+
Sbjct: 156 NGANMAYRKEAFFEVGGFEGNDHIASGD 183



 Score = 41.9 bits (99), Expect = 4e-04
 Identities = 29/99 (29%), Positives = 49/99 (49%), Gaps = 12/99 (12%)

Query: 94  IVIVFHNEAWSALLRTVHSVISRS-PRSMLKEILLVDDAS---TREFLKSSLDE--YVAK 147
           +VI   NEA + L R + S+ +   P+    E++LVDD S   T + L+ +  +  +  K
Sbjct: 1   VVIAARNEAEN-LPRLLQSLSALDYPKEKF-EVILVDDHSTDGTVQILEFAAAKPNFQLK 58

Query: 148 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHC 186
           +   +RV  S  +  L  A     + A+G+ +V  DA C
Sbjct: 59  ILNNSRVSISGKKNALTTA----IKAAKGDWIVTTDADC 93


>gnl|CDD|133061 cd06439, CESA_like_1, CESA_like_1 is a member of the cellulose
           synthase (CESA) superfamily.  This is a subfamily of
           cellulose synthase (CESA) superfamily.  CESA superfamily
           includes a wide variety of glycosyltransferase family 2
           enzymes that share the common characteristic of
           catalyzing the elongation of polysaccharide chains.  The
           members of the superfamily include cellulose synthase
           catalytic subunit, chitin synthase, glucan biosynthesis
           protein and other families of CESA-like proteins.
          Length = 251

 Score = 56.4 bits (137), Expect = 7e-09
 Identities = 28/117 (23%), Positives = 49/117 (41%), Gaps = 19/117 (16%)

Query: 75  RKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST- 133
           R       + +  +LP  +I+I  +NE  + +   + ++++        EI++V D ST 
Sbjct: 14  RLRPKPPSLPDPAYLPTVTIIIPAYNEE-AVIEAKLENLLALDYPRDRLEIIVVSDGSTD 72

Query: 134 ------REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 184
                 RE+    +           +++R P R G   A       A GEI+VF DA
Sbjct: 73  GTAEIAREYADKGV-----------KLLRFPERRGKAAALNRALALATGEIVVFTDA 118



 Score = 55.3 bits (134), Expect = 2e-08
 Identities = 37/162 (22%), Positives = 63/162 (38%), Gaps = 21/162 (12%)

Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST------- 241
             + +  +LP  +I+I  +NE  + +   + ++++        EI++V D ST       
Sbjct: 20  PSLPDPAYLPTVTIIIPAYNEE-AVIEAKLENLLALDYPRDRLEIIVVSDGSTDGTAEIA 78

Query: 242 REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
           RE+    +           +++R P R G   A       A GEI+VF DA+       L
Sbjct: 79  REYADKGV-----------KLLRFPERRGKAAALNRALALATGEIVVFTDANALLDPDAL 127

Query: 302 ENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNW 343
             LV   A+     V   + I+            L+W   NW
Sbjct: 128 RLLVRHFADPSVGAVSGELVIVDGGGSG--SGEGLYWKYENW 167


>gnl|CDD|133029 cd04186, GT_2_like_c, Subfamily of Glycosyltransferase Family GT2
           of unknown function.  GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 166

 Score = 54.5 bits (132), Expect = 1e-08
 Identities = 42/227 (18%), Positives = 84/227 (37%), Gaps = 63/227 (27%)

Query: 202 IVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTR 261
           I+IV +N +   L   + S+++++      E+++VD+AST   +     E + +L    R
Sbjct: 1   IIIVNYN-SLEYLKACLDSLLAQTYPD--FEVIVVDNASTDGSV-----ELLRELFPEVR 52

Query: 262 VIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED-RTRVVCPVI 320
           +IR+   +G       G R+A+G+ ++ L+       G L  L+    +D    +V P +
Sbjct: 53  LIRNGENLGFGAGNNQGIREAKGDYVLLLNPDTVVEPGALLELLDAAEQDPDVGIVGPKV 112

Query: 321 DIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAI 380
                             GAF                                      +
Sbjct: 113 -----------------SGAF------------------------------------LLV 119

Query: 381 DRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
            R  F  +G +DE+  ++  E++++  R    G  +   P + + H 
Sbjct: 120 RREVFEEVGGFDEDFFLY-YEDVDLCLRARLAGYRVLYVPQAVIYHH 165



 Score = 44.9 bits (107), Expect = 2e-05
 Identities = 23/90 (25%), Positives = 48/90 (53%), Gaps = 8/90 (8%)

Query: 94  IVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTR 153
           I+IV +N +   L   + S+++++      E+++VD+AST   +     E + +L    R
Sbjct: 1   IIIVNYN-SLEYLKACLDSLLAQTYPD--FEVIVVDNASTDGSV-----ELLRELFPEVR 52

Query: 154 VIRSPGRVGLIKARLLGARQAEGEILVFLD 183
           +IR+   +G       G R+A+G+ ++ L+
Sbjct: 53  LIRNGENLGFGAGNNQGIREAKGDYVLLLN 82


>gnl|CDD|133045 cd06423, CESA_like, CESA_like is  the cellulose synthase
           superfamily.  The cellulose synthase (CESA) superfamily
           includes a wide variety of glycosyltransferase family 2
           enzymes that share the common characteristic of
           catalyzing the elongation of polysaccharide chains. The
           members include cellulose synthase catalytic subunit,
           chitin synthase, glucan biosynthesis protein and other
           families of CESA-like proteins. Cellulose synthase
           catalyzes the polymerization reaction of cellulose, an
           aggregate of unbranched polymers of beta-1,4-linked
           glucose residues in  plants, most algae, some bacteria
           and fungi, and even some animals. In bacteria, algae and
           lower eukaryotes, there is a second unrelated type of
           cellulose synthase (Type II), which produces acylated
           cellulose, a derivative of cellulose. Chitin synthase
           catalyzes the incorporation of GlcNAc from substrate
           UDP-GlcNAc into chitin, which is a linear homopolymer of
           beta-(1,4)-linked GlcNAc residues and Glucan
           Biosynthesis protein catalyzes the elongation of
           beta-1,2 polyglucose chains of Glucan.
          Length = 180

 Score = 53.8 bits (130), Expect = 2e-08
 Identities = 46/204 (22%), Positives = 75/204 (36%), Gaps = 38/204 (18%)

Query: 202 IVIVFHNEAWSALLRTVHSVISRSPRSMLK------EILLVDDASTREFLKSSLDEYVAK 255
           I++  +NE          +VI R+  S+L       E+++VDD ST + L+  L+E  A 
Sbjct: 1   IIVPAYNEE---------AVIERTIESLLALDYPKLEVIVVDDGSTDDTLEI-LEELAAL 50

Query: 256 LSVPTRVIRSPGRVGLIKARLL--GARQAEGEILVFLDAHCECTLGWLENLVARVAED-R 312
                 V+R     G  KA  L  G R A+G+I+V LDA        L+ LV     D +
Sbjct: 51  YIRRVLVVRDKENGG--KAGALNAGLRHAKGDIVVVLDADTILEPDALKRLVVPFFADPK 108

Query: 313 TRVVCPVIDIISDVTFAYVRSFEL-HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTP 371
              V   + + +       R   + +   F      +                       
Sbjct: 109 VGAVQGRVRVRNGSENLLTRLQAIEYLSIFRLGRRAQSAL----------------GGVL 152

Query: 372 AMAGGLFAIDRAYFFHIGAYDEEM 395
            ++G   A  R     +G +DE+ 
Sbjct: 153 VLSGAFGAFRREALREVGGWDEDT 176



 Score = 46.8 bits (112), Expect = 5e-06
 Identities = 33/99 (33%), Positives = 49/99 (49%), Gaps = 20/99 (20%)

Query: 94  IVIVFHNEAWSALLRTVHSVISRSPRSMLK------EILLVDDASTREFLKSSLDEYVAK 147
           I++  +NE          +VI R+  S+L       E+++VDD ST + L+  L+E  A 
Sbjct: 1   IIVPAYNEE---------AVIERTIESLLALDYPKLEVIVVDDGSTDDTLEI-LEELAAL 50

Query: 148 LSVPTRVIRSPGRVGLIKARLL--GARQAEGEILVFLDA 184
                 V+R     G  KA  L  G R A+G+I+V LDA
Sbjct: 51  YIRRVLVVRDKENGG--KAGALNAGLRHAKGDIVVVLDA 87


>gnl|CDD|224137 COG1216, COG1216, Predicted glycosyltransferases [General function
           prediction only].
          Length = 305

 Score = 50.9 bits (122), Expect = 7e-07
 Identities = 57/277 (20%), Positives = 105/277 (37%), Gaps = 24/277 (8%)

Query: 198 PKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLS 257
           PK SI+IV +N     L+  + S+ +++       I++VD+ ST      SL+   A+  
Sbjct: 3   PKISIIIVTYNRG-EDLVECLASLAAQTYPD--DVIVVVDNGSTDG----SLEALKARFF 55

Query: 258 VPTRVIRSPGRVGLIKARLLGARQAEGE---ILVFLDAHCECTLGWLENLVARVAEDRTR 314
              R+I +   +G       G + A  +    ++ L+         LE L+    ED   
Sbjct: 56  PNVRLIENGENLGFAGGFNRGIKYALAKGDDYVLLLNPDTVVEPDLLEELLKAAEEDPAA 115

Query: 315 -VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAM 373
            VV P+I    +  +   R      G  +  L   W      +       + E     ++
Sbjct: 116 GVVGPLIRNYDESLYIDRR------GGESDGLTGGWRASPLLEIAPDLSSYLEVV--ASL 167

Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASP 433
           +G    I R  F  +G +DE   ++  E++++  R  + G  I   P + + H    +S 
Sbjct: 168 SGACLLIRREAFEKVGGFDERFFIYY-EDVDLCLRARKAGYKIYYVPDAIIYH-KIGSSK 225

Query: 434 YSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEA 470
            S   G     Y +L R  + +     +     +P  
Sbjct: 226 GSSILG---TFYYHLRRNRIYFAKLNEKGLLLRSPLR 259


>gnl|CDD|224136 COG1215, COG1215, Glycosyltransferases, probably involved in cell
           wall biogenesis [Cell envelope biogenesis, outer
           membrane].
          Length = 439

 Score = 50.3 bits (120), Expect = 2e-06
 Identities = 50/238 (21%), Positives = 92/238 (38%), Gaps = 26/238 (10%)

Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSS 248
             ++ LPK S++I  +NE    L  T+ S++S+       E+++VDD S   T E L+  
Sbjct: 48  DADKLLPKVSVIIPAYNEEPEVLEETLESLLSQDYPR--YEVIVVDDGSTDETYEILEEL 105

Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
             EY     V   +       G   A   G ++A+G+++V LDA        L  LV+  
Sbjct: 106 GAEYGPNFRV---IYPEKKNGGKAGALNNGLKRAKGDVVVILDADTVPEPDALRELVSPF 162

Query: 309 AEDRTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
            +     V     I +    +  +   +     +    +FR     S   +I        
Sbjct: 163 EDPPVGAVVGTPRIRNRPDPSNLLGRIQAIE--YLSAFYFRL-RAASKGGLI-------- 211

Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVA 425
                ++G   A  R+    +G + E+      E+ +++ R+   G  +   P + V 
Sbjct: 212 ---SFLSGSSSAFRRSALEEVGGWLEDT---ITEDADLTLRLHLRGYRVVYVPEAIVW 263



 Score = 45.7 bits (108), Expect = 6e-05
 Identities = 36/118 (30%), Positives = 56/118 (47%), Gaps = 9/118 (7%)

Query: 70  TLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVD 129
            LP  R  K K     ++ LPK S++I  +NE    L  T+ S++S+       E+++VD
Sbjct: 35  VLPLSRPRK-KLPKDADKLLPKVSVIIPAYNEEPEVLEETLESLLSQDYPR--YEVIVVD 91

Query: 130 DAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 184
           D S   T E L+    EY     V   +       G   A   G ++A+G+++V LDA
Sbjct: 92  DGSTDETYEILEELGAEYGPNFRV---IYPEKKNGGKAGALNNGLKRAKGDVVVILDA 146


>gnl|CDD|223539 COG0463, WcaA, Glycosyltransferases involved in cell wall
           biogenesis [Cell envelope biogenesis, outer membrane].
          Length = 291

 Score = 49.7 bits (117), Expect = 2e-06
 Identities = 52/289 (17%), Positives = 91/289 (31%), Gaps = 23/289 (7%)

Query: 88  FLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEY 144
            +PK S+VI  +NE    L   + S+++++ +    EI++VDD S   T E       + 
Sbjct: 1   MMPKVSVVIPTYNEE-EYLPEALESLLNQTYK--DFEIIVVDDGSTDGTTEIAIEYGAKD 57

Query: 145 VAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHC--ECTLVFNEEFLPKSSI 202
           V         + +    GL  AR  G   A G+ +VFLDA       L+           
Sbjct: 58  V-----RVIRLINERNGGLGAARNAGLEYARGDYIVFLDADDQHPPELIPLVAAGGDGDY 112

Query: 203 VIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRV 262
           +    +     L R + S + +   + L  +L+ D       L     + +         
Sbjct: 113 IARLDDRDDIWLPRKLLSKLVKLLGNRLLGVLIPDGFGDLRLLVRDAVDGLRAFLEGRSR 172

Query: 263 IRSPGRVGLIK------ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
                   L+         LL  R       V L       L  L++ +  V  D   ++
Sbjct: 173 FLRLLLRKLVLIRREVLEYLLLYRLLGASKRVLLGKLLRLGLLLLDSKLKAVPLDLLLLL 232

Query: 317 CPVIDIISDV----TFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKR 361
              I ++S +         R    +       L            +   
Sbjct: 233 GLFIALLSILLLLLLLLLRRLLGANLSEKGKPLLLIKLLILLKLILRLL 281


>gnl|CDD|133022 cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member
           of the Glycosyltransferase 2 superfamily.  DPM1 is the
           catalytic subunit of eukaryotic dolichol-phosphate
           mannose (DPM) synthase. DPM synthase is required for
           synthesis of the glycosylphosphatidylinositol (GPI)
           anchor, N-glycan precursor, protein O-mannose, and
           C-mannose. In higher eukaryotes,the enzyme has three
           subunits, DPM1, DPM2 and DPM3. DPM is synthesized from
           dolichol phosphate and GDP-Man on the cytosolic surface
           of the ER membrane by DPM synthase and then is flipped
           onto the luminal side and used as a donor substrate. In
           lower eukaryotes, such as Saccharomyces cerevisiae and
           Trypanosoma brucei, DPM synthase consists of a single
           component (Dpm1p and TbDpm1, respectively) that
           possesses one predicted transmembrane region near the C
           terminus for anchoring to the ER membrane. In contrast,
           the Dpm1 homologues of higher eukaryotes, namely fission
           yeast, fungi, and animals, have no transmembrane region,
           suggesting the existence of adapter molecules for
           membrane anchoring. This family also includes bacteria
           and archaea DPM1_like enzymes. However, the enzyme
           structure and mechanism of function are not well
           understood. The UDP-glucose:dolichyl-phosphate
           glucosyltransferase (DPG_synthase) is a
           transmembrane-bound enzyme of the endoplasmic reticulum
           involved in protein N-linked glycosylation. This enzyme
           catalyzes the transfer of glucose from UDP-glucose to
           dolichyl phosphate. This protein family belongs to
           Glycosyltransferase 2 superfamily.
          Length = 185

 Score = 48.0 bits (115), Expect = 2e-06
 Identities = 26/94 (27%), Positives = 39/94 (41%), Gaps = 9/94 (9%)

Query: 94  IVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEYVAKLSV 150
           +VI  +NE    +   V  +++        EI++VDD S   T E  +            
Sbjct: 1   VVIPAYNEE-ENIPELVERLLAVLEEGYDYEIIVVDDGSTDGTAEIARELAARVPRV--- 56

Query: 151 PTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 184
             RVIR     G   A   G + A G+I+V +DA
Sbjct: 57  --RVIRLSRNFGKGAAVRAGFKAARGDIVVTMDA 88



 Score = 48.0 bits (115), Expect = 2e-06
 Identities = 26/94 (27%), Positives = 39/94 (41%), Gaps = 9/94 (9%)

Query: 202 IVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEYVAKLSV 258
           +VI  +NE    +   V  +++        EI++VDD S   T E  +            
Sbjct: 1   VVIPAYNEE-ENIPELVERLLAVLEEGYDYEIIVVDDGSTDGTAEIARELAARVPRV--- 56

Query: 259 PTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 292
             RVIR     G   A   G + A G+I+V +DA
Sbjct: 57  --RVIRLSRNFGKGAAVRAGFKAARGDIVVTMDA 88


>gnl|CDD|133013 cd02522, GT_2_like_a, GT_2_like_a represents a glycosyltransferase
           family-2 subfamily with unknown function.
           Glycosyltransferase family 2 (GT-2) subfamily of unknown
           function. GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 221

 Score = 45.6 bits (109), Expect = 2e-05
 Identities = 57/216 (26%), Positives = 76/216 (35%), Gaps = 57/216 (26%)

Query: 201 SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL-SVP 259
           SI+I   NEA   L R + S+   +P     EI++VD  ST        D  VA   S  
Sbjct: 2   SIIIPTLNEA-ENLPRLLASLRRLNPLP--LEIIVVDGGST--------DGTVAIARSAG 50

Query: 260 TRVIRS-PGRVGLIKARLL--GARQAEGEILVFLDAHCECTL--GWLENLVARVAEDRTR 314
             VI S  GR     AR +  GA  A G+ L+FL  H +  L   W   ++  +  D   
Sbjct: 51  VVVISSPKGR-----ARQMNAGAAAARGDWLLFL--HADTRLPPDWDAAIIETLRADGAV 103

Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
                +               L  GA    L  R                   F  P   
Sbjct: 104 AGAFRLRFDDPGPRL----RLLELGAN---LRSRL------------------FGLPYGD 138

Query: 375 GGLFAIDRAYFFHIGAYDEE--MQVWGGENLEMSFR 408
            GLF I R  F  +G + E   M     E++E+  R
Sbjct: 139 QGLF-IRRELFEELGGFPELPLM-----EDVELVRR 168



 Score = 38.3 bits (90), Expect = 0.006
 Identities = 36/102 (35%), Positives = 46/102 (45%), Gaps = 22/102 (21%)

Query: 93  SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL-SVP 151
           SI+I   NEA   L R + S+   +P     EI++VD  ST        D  VA   S  
Sbjct: 2   SIIIPTLNEA-ENLPRLLASLRRLNPLP--LEIIVVDGGST--------DGTVAIARSAG 50

Query: 152 TRVIRS-PGRVGLIKARLL--GARQAEGEILVFLDAHCECTL 190
             VI S  GR     AR +  GA  A G+ L+FL  H +  L
Sbjct: 51  VVVISSPKGR-----ARQMNAGAAAARGDWLLFL--HADTRL 85


>gnl|CDD|234532 TIGR04283, glyco_like_mftF, transferase 2,
           rSAM/selenodomain-associated.  This enzyme may transfer
           a nucleotide, or it sugar moiety, as part of a
           biosynthetic pathway. Other proposed members of the
           pathway include another transferase (TIGR04282), a
           phosphoesterase, and a radical SAM enzyme (TIGR04167)
           whose C-terminal domain (pfam12345) frequently contains
           a selenocysteine [Unknown function, Enzymes of unknown
           specificity].
          Length = 220

 Score = 44.4 bits (106), Expect = 6e-05
 Identities = 37/127 (29%), Positives = 55/127 (43%), Gaps = 33/127 (25%)

Query: 201 SIVIVFHNEA--WSALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEYVAK 255
           SI+I   NEA     LL  + ++   +      E+++VD  S   T E  +S        
Sbjct: 2   SIIIPVLNEAATLPELLADLQALPGDA------EVIVVDGGSTDGTVEIARSLGA----- 50

Query: 256 LSVPTRVIRSP-GRVGLIKARLL--GARQAEGEILVFLDAHCECTL--GWLENLVARVAE 310
                +VI SP GR     AR +  GA  A+G+IL+FL  H +  L   +LE +   +A 
Sbjct: 51  -----KVIHSPKGR-----ARQMNAGAALAKGDILLFL--HADTRLPKDFLEAIRRALAP 98

Query: 311 DRTRVVC 317
                 C
Sbjct: 99  PGAVAGC 105



 Score = 43.3 bits (103), Expect = 1e-04
 Identities = 33/106 (31%), Positives = 48/106 (45%), Gaps = 31/106 (29%)

Query: 93  SIVIVFHNEA--WSALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEYVAK 147
           SI+I   NEA     LL  + ++   +      E+++VD  S   T E  +S        
Sbjct: 2   SIIIPVLNEAATLPELLADLQALPGDA------EVIVVDGGSTDGTVEIARSLGA----- 50

Query: 148 LSVPTRVIRSP-GRVGLIKARLL--GARQAEGEILVFLDAHCECTL 190
                +VI SP GR     AR +  GA  A+G+IL+FL  H +  L
Sbjct: 51  -----KVIHSPKGR-----ARQMNAGAALAKGDILLFL--HADTRL 84


>gnl|CDD|133042 cd06420, GT2_Chondriotin_Pol_N, N-terminal domain of Chondroitin
           polymerase functions as a GalNAc transferase.
           Chondroitin polymerase is a two domain, bi-functional
           protein. The N-terminal domain functions as a GalNAc
           transferase. The bacterial chondroitin polymerase
           catalyzes elongation of the chondroitin chain by
           alternatively transferring the GlcUA and GalNAc moiety
           from UDP-GlcUA and UDP-GalNAc to the non-reducing ends
           of the chondroitin chain. The enzyme consists of
           N-terminal and C-terminal domains in which the two
           active sites catalyze the addition of GalNAc and GlcUA,
           respectively. Chondroitin chains range from 40 to over
           100 repeating units of the disaccharide. Sulfated
           chondroitins are involved in the regulation of various
           biological functions such as central nervous system
           development, wound repair, infection, growth factor
           signaling, and morphogenesis, in addition to its
           conventional structural roles. In Caenorhabditis
           elegans, chondroitin is an essential factor for the worm
           to undergo cytokinesis and cell division. Chondroitin is
           synthesized as proteoglycans, sulfated and secreted to
           the cell surface or extracellular matrix.
          Length = 182

 Score = 43.7 bits (104), Expect = 7e-05
 Identities = 45/221 (20%), Positives = 78/221 (35%), Gaps = 62/221 (28%)

Query: 202 IVIVFHNEAWS-ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT 260
           ++I  +N     AL   + SV+++S   +  E+++ DD ST E  K  ++E+ ++  +P 
Sbjct: 1   LIITTYN--RPEALELVLKSVLNQS--ILPFEVIIADDGSTEE-TKELIEEFKSQFPIPI 55

Query: 261 RVIRSPGRVG--LIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCP 318
           + +      G    K R      A+G+ L+F+D  C                       P
Sbjct: 56  KHVWQED-EGFRKAKIRNKAIAAAKGDYLIFIDGDC----------------------IP 92

Query: 319 VIDIISDVTFAYVRSFELHWGAFNWELHFR-----WYTYGSSDAIIKRKDFTEPFKTPAM 373
             D I+D                    H        +  GS      R    E      +
Sbjct: 93  HPDFIAD--------------------HIELAEPGVFLSGS------RVLLNEKLTERGI 126

Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGG 414
            G   +  +     +  +DEE   WGGE+ E+  R+   G 
Sbjct: 127 RGCNMSFWKKDLLAVNGFDEEFTGWGGEDSELVARLLNSGI 167



 Score = 35.6 bits (83), Expect = 0.034
 Identities = 25/96 (26%), Positives = 48/96 (50%), Gaps = 9/96 (9%)

Query: 94  IVIVFHNEAWS-ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT 152
           ++I  +N     AL   + SV+++S   +  E+++ DD ST E  K  ++E+ ++  +P 
Sbjct: 1   LIITTYN--RPEALELVLKSVLNQS--ILPFEVIIADDGSTEE-TKELIEEFKSQFPIPI 55

Query: 153 RVIRSPGRVG--LIKARLLGARQAEGEILVFLDAHC 186
           + +      G    K R      A+G+ L+F+D  C
Sbjct: 56  KHVWQED-EGFRKAKIRNKAIAAAKGDYLIFIDGDC 90


>gnl|CDD|217196 pfam02709, Glyco_transf_7C, N-terminal domain of
           galactosyltransferase.  This is the N-terminal domain of
           a family of galactosyltransferases from a wide range of
           Metazoa with three related galactosyltransferases
           activities, all three of which are possessed by one
           sequence in some cases. EC:2.4.1.90, N-acetyllactosamine
           synthase; EC:2.4.1.38,
           Beta-N-acetylglucosaminyl-glycopeptide beta-1,4-
           galactosyltransferase; and EC:2.4.1.22 Lactose synthase.
           Note that N-acetyllactosamine synthase is a component of
           Lactose synthase along with alpha-lactalbumin, in the
           absence of alpha-lactalbumin EC:2.4.1.90 is the
           catalyzed reaction.
          Length = 78

 Score = 39.5 bits (93), Expect = 3e-04
 Identities = 14/57 (24%), Positives = 24/57 (42%), Gaps = 1/57 (1%)

Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH-VAHLFR 429
            GG+ A  +  F  +  +      WGGE+ ++  R+   G  IE    +    H+ R
Sbjct: 20  FGGVLAFSKEDFLKVNGFSNNFWGWGGEDDDLYARLLLAGLKIERPKFAIGRYHMLR 76


>gnl|CDD|133027 cd04184, GT2_RfbC_Mx_like, Myxococcus xanthus RfbC like proteins
           are required for O-antigen biosynthesis.  The rfbC gene
           encodes a predicted protein of 1,276 amino acids, which
           is required for O-antigen biosynthesis in Myxococcus
           xanthus. It is a subfamily of Glycosyltransferase Family
           GT2, which includes diverse families of glycosyl
           transferases with a common GT-A type structural fold,
           which has two tightly associated beta/alpha/beta domains
           that tend to form a continuous central sheet of at least
           eight beta-strands. These are enzymes that catalyze the
           transfer of sugar moieties from activated donor
           molecules to specific acceptor molecules, forming
           glycosidic bonds.
          Length = 202

 Score = 41.4 bits (98), Expect = 5e-04
 Identities = 26/94 (27%), Positives = 42/94 (44%), Gaps = 3/94 (3%)

Query: 90  PKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLS 149
           P  SIV+  +N     L   + SV +++  +   E+ + DDAST   +K  L +Y A+  
Sbjct: 1   PLISIVMPVYNTPEKYLREAIESVRAQTYPNW--ELCIADDASTDPEVKRVLKKYAAQDP 58

Query: 150 VPTRVIRSPGRVGLIKARLLGARQAEGEILVFLD 183
              +V+      G+  A       A GE +  LD
Sbjct: 59  R-IKVVFREENGGISAATNSALELATGEFVALLD 91



 Score = 41.4 bits (98), Expect = 5e-04
 Identities = 26/94 (27%), Positives = 42/94 (44%), Gaps = 3/94 (3%)

Query: 198 PKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLS 257
           P  SIV+  +N     L   + SV +++  +   E+ + DDAST   +K  L +Y A+  
Sbjct: 1   PLISIVMPVYNTPEKYLREAIESVRAQTYPNW--ELCIADDASTDPEVKRVLKKYAAQDP 58

Query: 258 VPTRVIRSPGRVGLIKARLLGARQAEGEILVFLD 291
              +V+      G+  A       A GE +  LD
Sbjct: 59  R-IKVVFREENGGISAATNSALELATGEFVALLD 91


>gnl|CDD|222281 pfam13641, Glyco_tranf_2_3, Glycosyltransferase like family 2.
           Members of this family of prokaryotic proteins include
           putative glucosyltransferase, which are involved in
           bacterial capsule biosynthesis.
          Length = 229

 Score = 39.6 bits (93), Expect = 0.002
 Identities = 53/239 (22%), Positives = 80/239 (33%), Gaps = 35/239 (14%)

Query: 198 PKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVD--DASTREFLKSSLDEYVAK 255
           P+ ++V+   NE    L RT+ S I       L+ I++VD  D  T +       E  A 
Sbjct: 1   PRVAVVVPTWNED-KVLGRTLRS-ILAQDYPRLEVIVVVDGSDDETLDVA----RELAAA 54

Query: 256 LS-VPTRVIRSPGRVGLI-KARLL--GARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
              V   V+R P   G   KAR L    R  + +++V LDA               V  D
Sbjct: 55  YPDVRVHVVRRPRPPGPTGKARALNEALRAIKSDLVVLLDADS------------VVDPD 102

Query: 312 RTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDA--IIKRKDFTEPFK 369
             R + P       V       F L+              Y    A   ++         
Sbjct: 103 TLRRLLPFFLS-KGVGAVQGPVFVLNLR------TAVAPLYALEFALRHLRFMALRRALG 155

Query: 370 TPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
              +AG      R+    IG +D    +   E+ E+  R+ + G      P + V  L 
Sbjct: 156 VAPLAGSGSLFRRSVLEEIGGFDPGFLLG--EDKELGLRLRRAGWRTAYVPGAAVYELS 212



 Score = 29.6 bits (67), Expect = 3.3
 Identities = 31/101 (30%), Positives = 45/101 (44%), Gaps = 12/101 (11%)

Query: 90  PKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVD--DASTREFLKSSLDEYVAK 147
           P+ ++V+   NE    L RT+ S I       L+ I++VD  D  T +       E  A 
Sbjct: 1   PRVAVVVPTWNED-KVLGRTLRS-ILAQDYPRLEVIVVVDGSDDETLDVA----RELAAA 54

Query: 148 LS-VPTRVIRSPGRVGLI-KARLL--GARQAEGEILVFLDA 184
              V   V+R P   G   KAR L    R  + +++V LDA
Sbjct: 55  YPDVRVHVVRRPRPPGPTGKARALNEALRAIKSDLVVLLDA 95


>gnl|CDD|133031 cd04188, DPG_synthase, DPG_synthase is involved in protein N-linked
           glycosylation.  UDP-glucose:dolichyl-phosphate
           glucosyltransferase (DPG_synthase) is a
           transmembrane-bound enzyme of the endoplasmic reticulum
           involved in protein N-linked glycosylation. This enzyme
           catalyzes the transfer of glucose from UDP-glucose to
           dolichyl phosphate.
          Length = 211

 Score = 38.3 bits (90), Expect = 0.005
 Identities = 20/64 (31%), Positives = 28/64 (43%), Gaps = 7/64 (10%)

Query: 124 EILLVDDAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILV 180
           EI++VDD S   T E  +    +   K     RV+  P   G   A   G   A G+ ++
Sbjct: 32  EIIVVDDGSKDGTAEVAR----KLARKNPALIRVLTLPKNRGKGGAVRAGMLAARGDYIL 87

Query: 181 FLDA 184
           F DA
Sbjct: 88  FADA 91



 Score = 38.3 bits (90), Expect = 0.005
 Identities = 20/64 (31%), Positives = 28/64 (43%), Gaps = 7/64 (10%)

Query: 232 EILLVDDAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILV 288
           EI++VDD S   T E  +    +   K     RV+  P   G   A   G   A G+ ++
Sbjct: 32  EIIVVDDGSKDGTAEVAR----KLARKNPALIRVLTLPKNRGKGGAVRAGMLAARGDYIL 87

Query: 289 FLDA 292
           F DA
Sbjct: 88  FADA 91


>gnl|CDD|214672 smart00458, RICIN, Ricin-type beta-trefoil.  Carbohydrate-binding
           domain formed from presumed gene triplication.
          Length = 118

 Score = 36.3 bits (84), Expect = 0.009
 Identities = 19/87 (21%), Positives = 29/87 (33%), Gaps = 9/87 (10%)

Query: 515 GRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
           G IR   T  C+                +L  C       Q + +     +   D   CL
Sbjct: 39  GAIRIKDTDLCLTAN------GNTGSTVTLYSCDGTN-DNQYWEVNKDGTIRNPDSGKCL 91

Query: 575 DVPEYENDISPRVRILACSGFNRQRWT 601
           DV   + +   +V +  CSG   Q+W 
Sbjct: 92  DV--KDGNTGTKVILWTCSGNPNQKWI 116


>gnl|CDD|133062 cd06442, DPM1_like, DPM1_like represents putative enzymes similar
           to eukaryotic DPM1.  Proteins similar to eukaryotic
           DPM1, including enzymes from bacteria and archaea; DPM1
           is the catalytic subunit of eukaryotic
           dolichol-phosphate mannose (DPM) synthase. DPM synthase
           is required for synthesis of the
           glycosylphosphatidylinositol (GPI) anchor, N-glycan
           precursor, protein O-mannose, and C-mannose. In higher
           eukaryotes,the enzyme has three subunits, DPM1, DPM2 and
           DPM3. DPM is synthesized from dolichol phosphate and
           GDP-Man on the cytosolic surface of the ER membrane by
           DPM synthase and then is flipped onto the luminal side
           and used as a donor substrate. In lower eukaryotes, such
           as Saccharomyces cerevisiae and Trypanosoma brucei, DPM
           synthase consists of a single component (Dpm1p and
           TbDpm1, respectively) that possesses one predicted
           transmembrane region near the C terminus for anchoring
           to the ER membrane. In contrast, the Dpm1 homologues of
           higher eukaryotes, namely fission yeast, fungi, and
           animals, have no transmembrane region, suggesting the
           existence of adapter molecules for membrane anchoring.
           This family also includes bacteria and archaea DPM1_like
           enzymes. However, the enzyme structure and mechanism of
           function are not well understood. This protein family
           belongs to Glycosyltransferase 2 superfamily.
          Length = 224

 Score = 37.5 bits (88), Expect = 0.012
 Identities = 23/64 (35%), Positives = 36/64 (56%), Gaps = 8/64 (12%)

Query: 124 EILLVDDAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILV 180
           EI++VDD S   T E ++      +AK     R+I  PG+ GL  A + G + A G+++V
Sbjct: 29  EIIVVDDNSPDGTAEIVRE-----LAKEYPRVRLIVRPGKRGLGSAYIEGFKAARGDVIV 83

Query: 181 FLDA 184
            +DA
Sbjct: 84  VMDA 87



 Score = 37.5 bits (88), Expect = 0.012
 Identities = 23/64 (35%), Positives = 36/64 (56%), Gaps = 8/64 (12%)

Query: 232 EILLVDDAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILV 288
           EI++VDD S   T E ++      +AK     R+I  PG+ GL  A + G + A G+++V
Sbjct: 29  EIIVVDDNSPDGTAEIVRE-----LAKEYPRVRLIVRPGKRGLGSAYIEGFKAARGDVIV 83

Query: 289 FLDA 292
            +DA
Sbjct: 84  VMDA 87


>gnl|CDD|133030 cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to
           eukaryotic DPM1.  A family of  bacterial enzymes related
           to eukaryotic DPM1; Although the mechanism of eukaryotic
           enzyme is well studied, the mechanism of the  bacterial
           enzymes is not well understood. The eukaryotic DPM1 is
           the catalytic subunit of eukaryotic Dolichol-phosphate
           mannose (DPM) synthase. DPM synthase is required for
           synthesis of the glycosylphosphatidylinositol (GPI)
           anchor, N-glycan precursor, protein O-mannose, and
           C-mannose. The enzyme has three subunits, DPM1, DPM2 and
           DPM3. DPM is synthesized from dolichol phosphate and
           GDP-Man on the cytosolic surface of the ER membrane by
           DPM synthase and then is flipped onto the luminal side
           and used as a donor substrate. This protein family
           belongs to Glycosyltransferase 2 superfamily.
          Length = 181

 Score = 35.5 bits (83), Expect = 0.035
 Identities = 26/96 (27%), Positives = 39/96 (40%), Gaps = 12/96 (12%)

Query: 94  IVIVFHNEAWS--ALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEYVAKL 148
           IV+  +NE  +   L   + +V+         EI+ VDD S   T E L+          
Sbjct: 1   IVVPVYNEEENLPELYERLKAVLESLGYDY--EIIFVDDGSTDRTLEILRELAARD---- 54

Query: 149 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 184
               +VIR     G   A L G   A G+ ++ +DA
Sbjct: 55  -PRVKVIRLSRNFGQQAALLAGLDHARGDAVITMDA 89



 Score = 35.5 bits (83), Expect = 0.035
 Identities = 26/96 (27%), Positives = 39/96 (40%), Gaps = 12/96 (12%)

Query: 202 IVIVFHNEAWS--ALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEYVAKL 256
           IV+  +NE  +   L   + +V+         EI+ VDD S   T E L+          
Sbjct: 1   IVVPVYNEEENLPELYERLKAVLESLGYDY--EIIFVDDGSTDRTLEILRELAARD---- 54

Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 292
               +VIR     G   A L G   A G+ ++ +DA
Sbjct: 55  -PRVKVIRLSRNFGQQAALLAGLDHARGDAVITMDA 89


>gnl|CDD|133039 cd04196, GT_2_like_d, Subfamily of Glycosyltransferase Family GT2
           of unknown function.  GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 214

 Score = 33.4 bits (77), Expect = 0.22
 Identities = 17/63 (26%), Positives = 31/63 (49%), Gaps = 7/63 (11%)

Query: 124 EILLVDDAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILV 180
           E+++ DD S   T E +K    EY+ K      +IR+   +G+ +      + A+G+ + 
Sbjct: 29  ELIISDDGSTDGTVEIIK----EYIDKDPFIIILIRNGKNLGVARNFESLLQAADGDYVF 84

Query: 181 FLD 183
           F D
Sbjct: 85  FCD 87



 Score = 33.4 bits (77), Expect = 0.22
 Identities = 17/63 (26%), Positives = 31/63 (49%), Gaps = 7/63 (11%)

Query: 232 EILLVDDAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILV 288
           E+++ DD S   T E +K    EY+ K      +IR+   +G+ +      + A+G+ + 
Sbjct: 29  ELIISDDGSTDGTVEIIK----EYIDKDPFIIILIRNGKNLGVARNFESLLQAADGDYVF 84

Query: 289 FLD 291
           F D
Sbjct: 85  FCD 87


>gnl|CDD|215385 PLN02726, PLN02726, dolichyl-phosphate beta-D-mannosyltransferase.
          Length = 243

 Score = 32.0 bits (73), Expect = 0.69
 Identities = 20/64 (31%), Positives = 33/64 (51%), Gaps = 6/64 (9%)

Query: 124 EILLVDDAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILV 180
           EI++VDD S   T++ +K     Y         +   PG++GL  A + G + A G+ +V
Sbjct: 42  EIIVVDDGSPDGTQDVVKQLQKVYGEDR---ILLRPRPGKLGLGTAYIHGLKHASGDFVV 98

Query: 181 FLDA 184
            +DA
Sbjct: 99  IMDA 102



 Score = 32.0 bits (73), Expect = 0.69
 Identities = 20/64 (31%), Positives = 33/64 (51%), Gaps = 6/64 (9%)

Query: 232 EILLVDDAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILV 288
           EI++VDD S   T++ +K     Y         +   PG++GL  A + G + A G+ +V
Sbjct: 42  EIIVVDDGSPDGTQDVVKQLQKVYGEDR---ILLRPRPGKLGLGTAYIHGLKHASGDFVV 98

Query: 289 FLDA 292
            +DA
Sbjct: 99  IMDA 102


>gnl|CDD|133058 cd06436, GlcNAc-1-P_transferase, N-acetyl-glucosamine transferase
           is involved in the synthesis of
           Poly-beta-1,6-N-acetyl-D-glucosamine.
           N-acetyl-glucosamine transferase is responsible for the
           synthesis of bacteria
           Poly-beta-1,6-N-acetyl-D-glucosamine (PGA).
           Poly-beta-1,6-N-acetyl-D-glucosamine is a homopolymer
           that serves as an adhesion for the maintenance of
           biofilm structural stability in diverse eubacteria.
           N-acetyl-glucosamine transferase is the product of gene
           pgaC. Genetic analysis indicated that all four genes of
           the pgaABCD locus were required for the PGA production,
           pgaC being a glycosyltransferase.
          Length = 191

 Score = 31.6 bits (72), Expect = 0.74
 Identities = 28/114 (24%), Positives = 49/114 (42%), Gaps = 23/114 (20%)

Query: 208 NEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPG 267
           NE  + + RT+ S + R+  + L  +L++DDAS         D+    +    R+  +  
Sbjct: 7   NEE-AVIQRTLAS-LLRNKPNFL--VLVIDDASD--------DDTAGIV----RLAITDS 50

Query: 268 RVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVID 321
           RV L++  L  AR  +G+ L             +  ++     D  RV+  VID
Sbjct: 51  RVHLLRRHLPNARTGKGDALNA-------AYDQIRQILIEEGADPERVIIAVID 97



 Score = 28.5 bits (64), Expect = 6.9
 Identities = 26/107 (24%), Positives = 47/107 (43%), Gaps = 17/107 (15%)

Query: 100 NEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPG 159
           NE  + + RT+ S + R+  + L  +L++DDAS         D+    +    R+  +  
Sbjct: 7   NEE-AVIQRTLAS-LLRNKPNFL--VLVIDDASD--------DDTAGIV----RLAITDS 50

Query: 160 RVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVF 206
           RV L++  L  AR  +G+ L          ++  E   P+  I+ V 
Sbjct: 51  RVHLLRRHLPNARTGKGDALNAAYDQI-RQILIEEGADPERVIIAVI 96


>gnl|CDD|212674 cd10232, ScSsz1p_like_NBD, Nucleotide-binding domain of
           Saccharmomyces cerevisiae Ssz1pp and similar proteins.
           Saccharomyces cerevisiae Ssz1p (also known as
           /Pdr13p/YHR064C) belongs to the heat shock protein 70
           (HSP70) family of chaperones that assist in protein
           folding and assembly and can direct incompetent "client"
           proteins towards degradation. Typically, HSP70s have a
           nucleotide-binding domain (NBD) and a substrate-binding
           domain (SBD). The nucleotide sits in a deep cleft formed
           between the two lobes of the NBD. The two subdomains of
           each lobe change conformation between ATP-bound,
           ADP-bound, and nucleotide-free states. ATP binding opens
           up the substrate-binding site; substrate-binding
           increases the rate of ATP hydrolysis. HSP70 chaperone
           activity is regulated by various co-chaperones: J-domain
           proteins and nucleotide exchange factors (NEFs). Some
           family members are not chaperones but rather, function
           as NEFs for their Hsp70 partners, while other family
           members function as both chaperones and NEFs. Ssz1 does
           not function as a chaperone; it facilitates the
           interaction between the HSP70 Ssb protein and its
           partner J-domain protein Zuo1 (also known as zuotin) on
           the ribosome. Ssz1 is found in a stable heterodimer
           (called RAC, ribosome associated complex) with Zuo1.
           Zuo1 can only stimulate the ATPase activity of Ssb, when
           it is in complex with Ssz1. Ssz1 binds ATP but neither
           nucleotide-binding, hydrolysis, or its SBD, is needed
           for its in vivo function.
          Length = 386

 Score = 32.4 bits (74), Expect = 0.76
 Identities = 29/122 (23%), Positives = 44/122 (36%), Gaps = 14/122 (11%)

Query: 221 VISRSPRSMLKEILLVDDASTREF--LKSSLDEYVAK------LSVPTRVIRSPGRVGLI 272
           V  +      + IL V + + R    LK + ++++ K      LSVPT          L+
Sbjct: 99  VQEKEEPVPKETILTVHEVTVRFLRRLKEAAEDFLGKKVAGAVLSVPTW-FSDEQTEALV 157

Query: 273 KARLLGARQAEGEILVFLDAHCECTLGWLENL-VARVAEDRTRVVCPVIDIISDVTFAYV 331
           KA    A  A   +L  +       L +         A DR  VV       +DV+   V
Sbjct: 158 KA----AEAAGLPVLQLIPEPAAALLAYDAGEPTEDEALDRNVVVADFGGTRTDVSVIAV 213

Query: 332 RS 333
           R 
Sbjct: 214 RG 215


>gnl|CDD|216044 pfam00652, Ricin_B_lectin, Ricin-type beta-trefoil lectin domain. 
          Length = 124

 Score = 30.6 bits (69), Expect = 0.87
 Identities = 13/33 (39%), Positives = 15/33 (45%), Gaps = 1/33 (3%)

Query: 573 CLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
           CLDVP    D  P V +  C G   Q WT   +
Sbjct: 14  CLDVPGGSADGGP-VGLYPCHGGGNQLWTLTGD 45


>gnl|CDD|240336 PTZ00260, PTZ00260, dolichyl-phosphate beta-glucosyltransferase;
           Provisional.
          Length = 333

 Score = 32.0 bits (73), Expect = 0.95
 Identities = 24/101 (23%), Positives = 43/101 (42%), Gaps = 11/101 (10%)

Query: 93  SIVIVFHNEAWSALLRTVHSVI------SRSPRSMLKEILLVDDAS---TREFLKSSLDE 143
           SIVI  +NE    L + +   I      SR       EI++V+D S   T +  K    +
Sbjct: 73  SIVIPAYNEE-DRLPKMLKETIKYLESRSRKDPKFKYEIIIVNDGSKDKTLKVAKDFWRQ 131

Query: 144 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 184
            +   ++  R++      G   A  +G   + G+ ++ +DA
Sbjct: 132 NINP-NIDIRLLSLLRNKGKGGAVRIGMLASRGKYILMVDA 171



 Score = 32.0 bits (73), Expect = 0.95
 Identities = 24/101 (23%), Positives = 43/101 (42%), Gaps = 11/101 (10%)

Query: 201 SIVIVFHNEAWSALLRTVHSVI------SRSPRSMLKEILLVDDAS---TREFLKSSLDE 251
           SIVI  +NE    L + +   I      SR       EI++V+D S   T +  K    +
Sbjct: 73  SIVIPAYNEE-DRLPKMLKETIKYLESRSRKDPKFKYEIIIVNDGSKDKTLKVAKDFWRQ 131

Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 292
            +   ++  R++      G   A  +G   + G+ ++ +DA
Sbjct: 132 NINP-NIDIRLLSLLRNKGKGGAVRIGMLASRGKYILMVDA 171


>gnl|CDD|240427 PTZ00470, PTZ00470, glycoside hydrolase family 47 protein;
           Provisional.
          Length = 522

 Score = 32.0 bits (73), Expect = 1.1
 Identities = 11/41 (26%), Positives = 21/41 (51%), Gaps = 1/41 (2%)

Query: 464 FKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPH 504
           F    E   Q +K  ++ R  +R+ +K H+++ Y  + W H
Sbjct: 55  FNKIDEVYYQNEKLNIKRRESVREAMK-HAWEGYKEYAWGH 94


>gnl|CDD|223207 COG0129, IlvD, Dihydroxyacid dehydratase/phosphogluconate
           dehydratase [Amino acid transport and metabolism /
           Carbohydrate transport and metabolism].
          Length = 575

 Score = 31.0 bits (71), Expect = 1.9
 Identities = 26/85 (30%), Positives = 36/85 (42%), Gaps = 24/85 (28%)

Query: 373 MAGGLFAI----DRAYFFHIGAYDEEMQVWG---GENLEMSFRVWQCGGSIEIAPCSHVA 425
            AGG+ A+     +A   H     + + V G   GENLE            EI   +   
Sbjct: 328 RAGGVPAVMKELLKAGLLH----GDVLTVTGKTLGENLEG----------AEIRD-TPDR 372

Query: 426 HLFRKAS-PYSFPGGVSEVLYGNLA 449
            + R    P+S  GG++ VL GNLA
Sbjct: 373 DVIRPLDNPFSADGGLA-VLKGNLA 396


>gnl|CDD|133038 cd04195, GT2_AmsE_like, GT2_AmsE_like is involved in
           exopolysaccharide amylovora biosynthesis.  AmsE is a
           glycosyltransferase involved in exopolysaccharide
           amylovora biosynthesis in Erwinia amylovora. Amylovara
           is one of the three exopolysaccharide produced by E.
           amylovora. Amylovara-deficient mutants are
           non-pathogenic. It is a subfamily of Glycosyltransferase
           Family GT2, which includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds.
          Length = 201

 Score = 29.6 bits (67), Expect = 3.1
 Identities = 21/88 (23%), Positives = 38/88 (43%), Gaps = 4/88 (4%)

Query: 97  VFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIR 156
           V+  E    L   + S++ ++      E++LV D    + L   L+E+  KL  P +V+ 
Sbjct: 6   VYIKEKPEFLREALESILKQT--LPPDEVVLVKDGPVTQSLNEVLEEFKRKL--PLKVVP 61

Query: 157 SPGRVGLIKARLLGARQAEGEILVFLDA 184
                GL KA   G +    + +  +D 
Sbjct: 62  LEKNRGLGKALNEGLKHCTYDWVARMDT 89



 Score = 29.6 bits (67), Expect = 3.1
 Identities = 21/88 (23%), Positives = 38/88 (43%), Gaps = 4/88 (4%)

Query: 205 VFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIR 264
           V+  E    L   + S++ ++      E++LV D    + L   L+E+  KL  P +V+ 
Sbjct: 6   VYIKEKPEFLREALESILKQT--LPPDEVVLVKDGPVTQSLNEVLEEFKRKL--PLKVVP 61

Query: 265 SPGRVGLIKARLLGARQAEGEILVFLDA 292
                GL KA   G +    + +  +D 
Sbjct: 62  LEKNRGLGKALNEGLKHCTYDWVARMDT 89


>gnl|CDD|167649 PRK03963, PRK03963, V-type ATP synthase subunit E; Provisional.
          Length = 198

 Score = 29.7 bits (67), Expect = 3.2
 Identities = 25/101 (24%), Positives = 42/101 (41%), Gaps = 2/101 (1%)

Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGR-VGLIK 273
           L     +IS    ++ + +  + +    E LK+   E V +L     V+RS  R + LI 
Sbjct: 76  LAVQEELISEVLEAVRERLAELPEDEYFETLKALTKEAVEELGEDKVVVRSNERTLKLID 135

Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWL-ENLVARVAEDRT 313
           +RL   R   G++ + L    E   G + E     +  D T
Sbjct: 136 SRLEEIRDELGDVEIELGEPIETIGGVIVETKDGTIRVDNT 176


>gnl|CDD|226577 COG4092, COG4092, Predicted glycosyltransferase involved in capsule
           biosynthesis [Cell envelope biogenesis, outer membrane].
          Length = 346

 Score = 29.2 bits (65), Expect = 6.2
 Identities = 28/147 (19%), Positives = 51/147 (34%), Gaps = 23/147 (15%)

Query: 273 KARLLGA----RQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR------VVCPVIDI 322
                GA     + E  +++FLD  C  +      +++     + R      +V PV  +
Sbjct: 80  ICANNGADYSHEKCESNLVLFLDVDCFGSSDNFAKMLSIATIKKMRTNIDAPLVLPVYHL 139

Query: 323 ISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDR 382
               T  +    ++   A  +E             + + +     F  P     +F I+R
Sbjct: 140 NKADTQVFFDVEDMFLDAMIFE-----------SPLAEFRKEDNFFIAPYT--NIFLINR 186

Query: 383 AYFFHIGAYDEEMQVWGGENLEMSFRV 409
             F   G YDE  +  G E+ E   R+
Sbjct: 187 RMFSLTGGYDERFRGHGSEDFEFLTRL 213


>gnl|CDD|165267 PHA02961, PHA02961, hypothetical protein; Provisional.
          Length = 658

 Score = 29.5 bits (66), Expect = 7.1
 Identities = 19/64 (29%), Positives = 28/64 (43%), Gaps = 9/64 (14%)

Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHF--RWYTYGSSDAIIKRKDF 364
            + ED T+ +C  IDI       Y +  EL   AF W L++    +     D I  +KD 
Sbjct: 288 NITEDETKFICKHIDI-------YKKKSELFVNAFRWHLYYCDNMHPQKIFDPITCKKDK 340

Query: 365 TEPF 368
            E +
Sbjct: 341 HEKY 344


>gnl|CDD|217382 pfam03135, CagE_TrbE_VirB, CagE, TrbE, VirB family, component of
           type IV transporter system.  This family includes the
           Helicobacter pylori protein CagE, which together with
           other proteins from the cag pathogenicity island (PAI),
           encodes a type IV transporter secretion system. The
           precise role of CagE is not known, but studies in animal
           models have shown that it is essential for pathogenesis
           in Helicobacter pylori induced gastritis and peptic
           ulceration. Indeed, the expression of the cag PAI has
           been shown to be essential for stimulating human gastric
           epithelial cell apoptosis in vitro. Similar type IV
           transport systems are also found in other bacteria. This
           family includes the TrbE and VirB proteins from the
           respective trb and Vir conjugal transfer systems in
           Agrobacterium tumefaciens. Homologues of VirB proteins
           from other species are also members of this family, e.g.
           VirB from Brucella suis.
          Length = 205

 Score = 28.8 bits (65), Expect = 7.3
 Identities = 20/74 (27%), Positives = 27/74 (36%), Gaps = 7/74 (9%)

Query: 136 FLKSSLDEYVAKLSVP----TRVIRSPG---RVGLIKARLLGARQAEGEILVFLDAHCEC 188
                LDEY+A   VP      +IR  G     G+I  +    +   G + V LD   E 
Sbjct: 34  PRGRLLDEYIATSRVPFGKDGFIIRHAGNEKFAGIISIKEYPDKTRSGALDVLLDLDFEF 93

Query: 189 TLVFNEEFLPKSSI 202
            L  +  F  K   
Sbjct: 94  ILTQSFSFKSKQKA 107


>gnl|CDD|238148 cd00244, AlgLyase, Alginate Lyase A1-III; enzymatically
           depolymerizes alginate, a complex copolymer of
           beta-D-mannuronate and alpha-L-guluronate, by cleaving
           the beta-(1,4) glycosidic bond.
          Length = 339

 Score = 28.7 bits (64), Expect = 9.3
 Identities = 11/37 (29%), Positives = 16/37 (43%), Gaps = 1/37 (2%)

Query: 341 FNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGL 377
           F W L      YG +  + + K  T PFK   + G +
Sbjct: 300 FAW-LEPYCALYGCAPDVRELKFMTVPFKDFRLGGDV 335


>gnl|CDD|211965 TIGR04242, nodulat_NodC, chitooligosaccharide synthase NodC.
           Members of this family are NodC, an
           N-acetylglucosaminyltransferase involved in the
           production of nodulation factors through which rhizobia
           establish symbioses with leguminous plants.
          Length = 395

 Score = 28.9 bits (65), Expect = 9.4
 Identities = 28/110 (25%), Positives = 51/110 (46%), Gaps = 6/110 (5%)

Query: 77  PKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST-RE 135
           P   +     + LP   +++   NE    L   + S+ ++     L+ + +VDD ST R+
Sbjct: 35  PATSSDALPSDPLPSVDVIVPCFNEDPRTLSECLASIAAQDYAGKLR-VYVVDDGSTNRD 93

Query: 136 FLKSSLDEYVAKLSVPT-RVIRSPGRVGLIKARLLGARQAEGEILVFLDA 184
            L    D Y    S P    I  P  VG  KA++   R++ G++++ +D+
Sbjct: 94  ALVPVHDAYA---SDPRFNFILLPKNVGKRKAQIAAIRRSSGDLVLNVDS 140


>gnl|CDD|224508 COG1592, COG1592, Rubrerythrin [Energy production and conversion].
          Length = 166

 Score = 28.1 bits (63), Expect = 9.8
 Identities = 14/59 (23%), Positives = 19/59 (32%), Gaps = 3/59 (5%)

Query: 423 HVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRS 481
           H  +  +          V      NL         E  E Y  F   AE++  K+A RS
Sbjct: 52  HAKNHLKLLGKLLL---VLGDTRENLEEAIEGETYEITEMYPVFAEVAEEEGFKEAARS 107


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.324    0.138    0.426 

Gapped
Lambda     K      H
   0.267   0.0634    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 31,544,805
Number of extensions: 3135760
Number of successful extensions: 3194
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3141
Number of HSP's successfully gapped: 81
Length of query: 605
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 502
Effective length of database: 6,369,140
Effective search space: 3197308280
Effective search space used: 3197308280
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.5 bits)
S2: 62 (27.9 bits)