RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3172
         (157 letters)



>gnl|CDD|133004 cd02510, pp-GalNAc-T, pp-GalNAc-T initiates the formation of
           mucin-type O-linked glycans.  UDP-GalNAc: polypeptide
           alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T)
           initiate the formation of mucin-type, O-linked glycans
           by catalyzing the transfer of
           alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to
           hydroxyl groups of Ser or Thr residues of core proteins
           to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These
           enzymes are type II membrane proteins with a GT-A type
           catalytic domain and a lectin domain located on the
           lumen side of the Golgi apparatus. In human, there are
           15 isozymes of pp-GalNAc-Ts, representing the largest of
           all glycosyltransferase families. Each isozyme has
           unique but partially redundant substrate specificity for
           glycosylation sites on acceptor proteins.
          Length = 299

 Score =  161 bits (410), Expect = 7e-50
 Identities = 60/93 (64%), Positives = 66/93 (70%), Gaps = 5/93 (5%)

Query: 70  SVIITFHNEARSALLRSVVSIINRTPAHLLKEIILVDDFSDDPSDGEELE-----RIEKV 124
           SVII FHNEA S LLR+V S+INRTP  LLKEIILVDDFSD P     LE      + KV
Sbjct: 1   SVIIIFHNEALSTLLRTVHSVINRTPPELLKEIILVDDFSDKPELKLLLEEYYKKYLPKV 60

Query: 125 SVLRNDKREGLMRSRVKGADAATAPVLTFLDSH 157
            VLR  KREGL+R+R+ GA AAT  VL FLDSH
Sbjct: 61  KVLRLKKREGLIRARIAGARAATGDVLVFLDSH 93


>gnl|CDD|215980 pfam00535, Glycos_transf_2, Glycosyl transferase family 2.  Diverse
           family, transferring sugar from UDP-glucose,
           UDP-N-acetyl- galactosamine, GDP-mannose or
           CDP-abequose, to a range of substrates including
           cellulose, dolichol phosphate and teichoic acids.
          Length = 168

 Score = 67.5 bits (165), Expect = 6e-15
 Identities = 32/92 (34%), Positives = 47/92 (51%), Gaps = 8/92 (8%)

Query: 70  SVIITFHNEARSALLRSVVSIINRTPAHLLKEIILVDDFSDDPSDGEELERIEK----VS 125
           SVII  +NE    L  ++ S++N+T  +   EII+VDD S D +  E  E   K    V 
Sbjct: 1   SVIIPTYNEE-KYLEETLESLLNQT--YKNFEIIVVDDGSTDGT-VEIAEEYAKNDPRVR 56

Query: 126 VLRNDKREGLMRSRVKGADAATAPVLTFLDSH 157
           V+R ++  G   +R  G   AT   + FLD+ 
Sbjct: 57  VIRLEENLGKAAARNAGLKLATGDYIAFLDAD 88


>gnl|CDD|132997 cd00761, Glyco_tranf_GTA_type, Glycosyltransferase family A (GT-A)
           includes diverse families of glycosyl transferases with
           a common GT-A type structural fold.
           Glycosyltransferases (GTs) are enzymes that synthesize
           oligosaccharides, polysaccharides, and glycoconjugates
           by transferring the sugar moiety from an activated
           nucleotide-sugar donor to an acceptor molecule, which
           may be a growing oligosaccharide, a lipid, or a protein.
            Based on the stereochemistry of the donor and acceptor
           molecules, GTs are classified as either retaining or
           inverting enzymes. To date, all GT structures adopt one
           of two possible folds, termed GT-A fold and GT-B fold.
           This hierarchy includes diverse families of glycosyl
           transferases with a common GT-A type structural fold,
           which has two tightly associated beta/alpha/beta domains
           that tend to form a continuous central sheet of at least
           eight beta-strands. The majority of the proteins in this
           superfamily are Glycosyltransferase family 2 (GT-2)
           proteins. But it also includes families GT-43, GT-6,
           GT-8, GT13 and GT-7; which are evolutionarily related to
           GT-2 and share structure similarities.
          Length = 156

 Score = 50.2 bits (120), Expect = 2e-08
 Identities = 31/91 (34%), Positives = 47/91 (51%), Gaps = 8/91 (8%)

Query: 71  VIITFHNEARSALLRSVVSIINRTPAHLLKEIILVDDFSDDPSDGEELERIEK----VSV 126
           VII  +NE    L R + S++ +T  +   E+I+VDD S D +  E LE   K    V  
Sbjct: 1   VIIPAYNEEPY-LERCLESLLAQTYPNF--EVIVVDDGSTDGT-LEILEEYAKKDPRVIR 56

Query: 127 LRNDKREGLMRSRVKGADAATAPVLTFLDSH 157
           + N++ +GL  +R  G  AA    + FLD+ 
Sbjct: 57  VINEENQGLAAARNAGLKAARGEYILFLDAD 87


>gnl|CDD|223539 COG0463, WcaA, Glycosyltransferases involved in cell wall
           biogenesis [Cell envelope biogenesis, outer membrane].
          Length = 291

 Score = 50.9 bits (120), Expect = 3e-08
 Identities = 29/90 (32%), Positives = 45/90 (50%), Gaps = 6/90 (6%)

Query: 70  SVIITFHNEARSALLRSVVSIINRTPAHLLKEIILVDDFSDDPSDGEELERIEKVS---V 126
           SV+I  +NE    L  ++ S++N+T      EII+VDD S D +    +E   K      
Sbjct: 6   SVVIPTYNEEEY-LPEALESLLNQTYKDF--EIIVVDDGSTDGTTEIAIEYGAKDVRVIR 62

Query: 127 LRNDKREGLMRSRVKGADAATAPVLTFLDS 156
           L N++  GL  +R  G + A    + FLD+
Sbjct: 63  LINERNGGLGAARNAGLEYARGDYIVFLDA 92


>gnl|CDD|234419 TIGR03965, mycofact_glyco, mycofactocin system glycosyltransferase.
            Members of this protein family are putative
           glycosyltransferases, members of pfam00535 (glycosyl
           transferase family 2). Members appear mostly in the
           Actinobacteria, where they appear to be part of a system
           for converting a precursor peptide (TIGR03969) into a
           novel redox carrier designated mycofactocin. A radical
           SAM enzyme, TIGR03962, is a proposed to be a key
           maturase for mycofactocin.
          Length = 467

 Score = 43.6 bits (103), Expect = 1e-05
 Identities = 29/101 (28%), Positives = 48/101 (47%), Gaps = 6/101 (5%)

Query: 59  RHALPRDLPATSVIITFHN--EARSALLRSVVSIINRTPAHLLKEIILVDDFSDDPSDGE 116
           R AL    P+ +V++   N     + LL +++++          E+I+VDD S+DP    
Sbjct: 66  RPALLPSPPSVTVVVPVRNRPAGLARLLAALLALDYPRDRL---EVIVVDDGSEDPVPTR 122

Query: 117 ELERIE-KVSVLRNDKREGLMRSRVKGADAATAPVLTFLDS 156
                   V V+R+ +R+G   +R  GA AA    + F DS
Sbjct: 123 AARGARLPVRVIRHPRRQGPAAARNAGARAARTEFVAFTDS 163


>gnl|CDD|224136 COG1215, COG1215, Glycosyltransferases, probably involved in cell
           wall biogenesis [Cell envelope biogenesis, outer
           membrane].
          Length = 439

 Score = 43.0 bits (101), Expect = 2e-05
 Identities = 28/99 (28%), Positives = 43/99 (43%), Gaps = 11/99 (11%)

Query: 65  DLPATSVIITFHNEARSALLRSVVSIINRT-PAHLLKEIILVDDFSDDPSD------GEE 117
            LP  SVII  +NE    L  ++ S++++  P +   E+I+VDD S D +       G E
Sbjct: 52  LLPKVSVIIPAYNEEPEVLEETLESLLSQDYPRY---EVIVVDDGSTDETYEILEELGAE 108

Query: 118 LERIEKVSVLRNDKREGLMRSRVKGADAATAPVLTFLDS 156
                +V      K  G   +   G   A   V+  LD+
Sbjct: 109 YGPNFRVIYPE-KKNGGKAGALNNGLKRAKGDVVVILDA 146


>gnl|CDD|133022 cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member
           of the Glycosyltransferase 2 superfamily.  DPM1 is the
           catalytic subunit of eukaryotic dolichol-phosphate
           mannose (DPM) synthase. DPM synthase is required for
           synthesis of the glycosylphosphatidylinositol (GPI)
           anchor, N-glycan precursor, protein O-mannose, and
           C-mannose. In higher eukaryotes,the enzyme has three
           subunits, DPM1, DPM2 and DPM3. DPM is synthesized from
           dolichol phosphate and GDP-Man on the cytosolic surface
           of the ER membrane by DPM synthase and then is flipped
           onto the luminal side and used as a donor substrate. In
           lower eukaryotes, such as Saccharomyces cerevisiae and
           Trypanosoma brucei, DPM synthase consists of a single
           component (Dpm1p and TbDpm1, respectively) that
           possesses one predicted transmembrane region near the C
           terminus for anchoring to the ER membrane. In contrast,
           the Dpm1 homologues of higher eukaryotes, namely fission
           yeast, fungi, and animals, have no transmembrane region,
           suggesting the existence of adapter molecules for
           membrane anchoring. This family also includes bacteria
           and archaea DPM1_like enzymes. However, the enzyme
           structure and mechanism of function are not well
           understood. The UDP-glucose:dolichyl-phosphate
           glucosyltransferase (DPG_synthase) is a
           transmembrane-bound enzyme of the endoplasmic reticulum
           involved in protein N-linked glycosylation. This enzyme
           catalyzes the transfer of glucose from UDP-glucose to
           dolichyl phosphate. This protein family belongs to
           Glycosyltransferase 2 superfamily.
          Length = 185

 Score = 41.4 bits (98), Expect = 4e-05
 Identities = 24/94 (25%), Positives = 42/94 (44%), Gaps = 14/94 (14%)

Query: 71  VIITFHNEARSALLRSVVSIINRTPAHLLK----EIILVDDFSDDPSDGEELERI----E 122
           V+I  +NE  +     +  ++ R  A L +    EII+VDD S D +  E    +     
Sbjct: 1   VVIPAYNEEEN-----IPELVERLLAVLEEGYDYEIIVVDDGSTDGT-AEIARELAARVP 54

Query: 123 KVSVLRNDKREGLMRSRVKGADAATAPVLTFLDS 156
           +V V+R  +  G   +   G  AA   ++  +D+
Sbjct: 55  RVRVIRLSRNFGKGAAVRAGFKAARGDIVVTMDA 88


>gnl|CDD|133061 cd06439, CESA_like_1, CESA_like_1 is a member of the cellulose
           synthase (CESA) superfamily.  This is a subfamily of
           cellulose synthase (CESA) superfamily.  CESA superfamily
           includes a wide variety of glycosyltransferase family 2
           enzymes that share the common characteristic of
           catalyzing the elongation of polysaccharide chains.  The
           members of the superfamily include cellulose synthase
           catalytic subunit, chitin synthase, glucan biosynthesis
           protein and other families of CESA-like proteins.
          Length = 251

 Score = 39.1 bits (92), Expect = 3e-04
 Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 8/97 (8%)

Query: 63  PRDLPATSVIITFHNEAR--SALLRSVVSIINRTPAHLLKEIILVDDFSDDPSDGEELER 120
           P  LP  ++II  +NE     A L +++++    P   L EII+V D S D +  E    
Sbjct: 25  PAYLPTVTIIIPAYNEEAVIEAKLENLLAL--DYPRDRL-EIIVVSDGSTDGTA-EIARE 80

Query: 121 I--EKVSVLRNDKREGLMRSRVKGADAATAPVLTFLD 155
              + V +LR  +R G   +  +    AT  ++ F D
Sbjct: 81  YADKGVKLLRFPERRGKAAALNRALALATGEIVVFTD 117


>gnl|CDD|133013 cd02522, GT_2_like_a, GT_2_like_a represents a glycosyltransferase
           family-2 subfamily with unknown function.
           Glycosyltransferase family 2 (GT-2) subfamily of unknown
           function. GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 221

 Score = 36.8 bits (86), Expect = 0.002
 Identities = 29/88 (32%), Positives = 37/88 (42%), Gaps = 13/88 (14%)

Query: 70  SVIITFHNEARS--ALLRSVVSIINRTPAHLLKEIILVDDFSDDPSDGEELERIEKVSVL 127
           S+II   NEA +   LL S+  +          EII+VD  S D      + R   V V+
Sbjct: 2   SIIIPTLNEAENLPRLLASLRRLNPLP-----LEIIVVDGGSTD--GTVAIARSAGVVVI 54

Query: 128 RNDK-REGLMRSRVKGADAATAPVLTFL 154
            + K R   M     GA AA    L FL
Sbjct: 55  SSPKGRARQMN---AGAAAARGDWLLFL 79


>gnl|CDD|133016 cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the
           biosynthesis of succinoglycan.  Succinoglycan
           Biosynthesis Protein ExoA catalyzes the formation of a
           beta-1,3 linkage of the second sugar (glucose) of the
           succinoglycan with the galactose on the lipid carrie.
           Succinoglycan is an acidic exopolysaccharide that is
           important for invasion of the nodules. Succinoglycan is
           a high-molecular-weight polymer composed of repeating
           octasaccharide units. These units are synthesized on
           membrane-bound isoprenoid lipid carriers, beginning with
           galactose followed by seven glucose molecules, and
           modified by the addition of acetate, succinate, and
           pyruvate. ExoA is a membrane protein with a
           transmembrance domain at c-terminus.
          Length = 249

 Score = 36.4 bits (85), Expect = 0.003
 Identities = 17/44 (38%), Positives = 24/44 (54%), Gaps = 5/44 (11%)

Query: 70  SVIITFHNEARS--ALLRSVVSIINRTPAHLLKEIILVDDFSDD 111
           S+II   NE +    LL S   ++N++    L EII+VD  S D
Sbjct: 3   SIIIPVRNEEKYIEELLES---LLNQSYPKDLIEIIVVDGGSTD 43


>gnl|CDD|234532 TIGR04283, glyco_like_mftF, transferase 2,
           rSAM/selenodomain-associated.  This enzyme may transfer
           a nucleotide, or it sugar moiety, as part of a
           biosynthetic pathway. Other proposed members of the
           pathway include another transferase (TIGR04282), a
           phosphoesterase, and a radical SAM enzyme (TIGR04167)
           whose C-terminal domain (pfam12345) frequently contains
           a selenocysteine [Unknown function, Enzymes of unknown
           specificity].
          Length = 220

 Score = 34.8 bits (81), Expect = 0.009
 Identities = 27/95 (28%), Positives = 38/95 (40%), Gaps = 22/95 (23%)

Query: 70  SVIITFHNEARSALLRSVVSIINRTPAHLL-----KEIILVDDFSDDPSDGEELERIEKV 124
           S+II   NEA +         +    A L       E+I+VD  S D +   E+ R    
Sbjct: 2   SIIIPVLNEAAT---------LPELLADLQALPGDAEVIVVDGGSTDGT--VEIARSLGA 50

Query: 125 SVLRNDK-REGLMRSRVKGADAATAPVLTFL--DS 156
            V+ + K R   M     GA  A   +L FL  D+
Sbjct: 51  KVIHSPKGRARQMN---AGAALAKGDILLFLHADT 82


>gnl|CDD|133035 cd04192, GT_2_like_e, Subfamily of Glycosyltransferase Family GT2
           of unknown function.  GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 229

 Score = 33.8 bits (78), Expect = 0.020
 Identities = 18/67 (26%), Positives = 30/67 (44%), Gaps = 5/67 (7%)

Query: 71  VIITFHNEARSALLRSVVSIINRTPAHLLKEIILVDDFSDDPS----DGEELERIEKVSV 126
           V+I   NEA + L R + S+          E+ILVDD S D +    +    +   ++ +
Sbjct: 1   VVIAARNEAEN-LPRLLQSLSALDYPKEKFEVILVDDHSTDGTVQILEFAAAKPNFQLKI 59

Query: 127 LRNDKRE 133
           L N +  
Sbjct: 60  LNNSRVS 66


>gnl|CDD|133027 cd04184, GT2_RfbC_Mx_like, Myxococcus xanthus RfbC like proteins
           are required for O-antigen biosynthesis.  The rfbC gene
           encodes a predicted protein of 1,276 amino acids, which
           is required for O-antigen biosynthesis in Myxococcus
           xanthus. It is a subfamily of Glycosyltransferase Family
           GT2, which includes diverse families of glycosyl
           transferases with a common GT-A type structural fold,
           which has two tightly associated beta/alpha/beta domains
           that tend to form a continuous central sheet of at least
           eight beta-strands. These are enzymes that catalyze the
           transfer of sugar moieties from activated donor
           molecules to specific acceptor molecules, forming
           glycosidic bonds.
          Length = 202

 Score = 33.3 bits (77), Expect = 0.024
 Identities = 19/94 (20%), Positives = 41/94 (43%), Gaps = 6/94 (6%)

Query: 67  PATSVIITFHNEARSALLRSVVSIINRTPAHLLKEIILVDDFSDDPSDGEELERI----E 122
           P  S+++  +N     L  ++ S+  +T  +   E+ + DD S DP     L++      
Sbjct: 1   PLISIVMPVYNTPEKYLREAIESVRAQTYPNW--ELCIADDASTDPEVKRVLKKYAAQDP 58

Query: 123 KVSVLRNDKREGLMRSRVKGADAATAPVLTFLDS 156
           ++ V+  ++  G+  +     + AT   +  LD 
Sbjct: 59  RIKVVFREENGGISAATNSALELATGEFVALLDH 92


>gnl|CDD|133029 cd04186, GT_2_like_c, Subfamily of Glycosyltransferase Family GT2
           of unknown function.  GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 166

 Score = 33.3 bits (77), Expect = 0.025
 Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 3/64 (4%)

Query: 71  VIITFHNEARSALLRSVVSIINRTPAHLLKEIILVDDFSDDPSDGEELERIEKVSVLRND 130
           +II  +N     L   + S++ +T      E+I+VD+ S D S     E   +V ++RN 
Sbjct: 1   IIIVNYNSLEY-LKACLDSLLAQTYPDF--EVIVVDNASTDGSVELLRELFPEVRLIRNG 57

Query: 131 KREG 134
           +  G
Sbjct: 58  ENLG 61


>gnl|CDD|133030 cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to
           eukaryotic DPM1.  A family of  bacterial enzymes related
           to eukaryotic DPM1; Although the mechanism of eukaryotic
           enzyme is well studied, the mechanism of the  bacterial
           enzymes is not well understood. The eukaryotic DPM1 is
           the catalytic subunit of eukaryotic Dolichol-phosphate
           mannose (DPM) synthase. DPM synthase is required for
           synthesis of the glycosylphosphatidylinositol (GPI)
           anchor, N-glycan precursor, protein O-mannose, and
           C-mannose. The enzyme has three subunits, DPM1, DPM2 and
           DPM3. DPM is synthesized from dolichol phosphate and
           GDP-Man on the cytosolic surface of the ER membrane by
           DPM synthase and then is flipped onto the luminal side
           and used as a donor substrate. This protein family
           belongs to Glycosyltransferase 2 superfamily.
          Length = 181

 Score = 32.4 bits (75), Expect = 0.047
 Identities = 16/64 (25%), Positives = 28/64 (43%), Gaps = 7/64 (10%)

Query: 71  VIITFHNEARS--ALLRSVVSIINRTPAHLLKEIILVDDFSDDPSDG---EELERIEKVS 125
           +++  +NE  +   L   + +++         EII VDD S D +     E   R  +V 
Sbjct: 1   IVVPVYNEEENLPELYERLKAVLESLGYDY--EIIFVDDGSTDRTLEILRELAARDPRVK 58

Query: 126 VLRN 129
           V+R 
Sbjct: 59  VIRL 62


>gnl|CDD|133062 cd06442, DPM1_like, DPM1_like represents putative enzymes similar
           to eukaryotic DPM1.  Proteins similar to eukaryotic
           DPM1, including enzymes from bacteria and archaea; DPM1
           is the catalytic subunit of eukaryotic
           dolichol-phosphate mannose (DPM) synthase. DPM synthase
           is required for synthesis of the
           glycosylphosphatidylinositol (GPI) anchor, N-glycan
           precursor, protein O-mannose, and C-mannose. In higher
           eukaryotes,the enzyme has three subunits, DPM1, DPM2 and
           DPM3. DPM is synthesized from dolichol phosphate and
           GDP-Man on the cytosolic surface of the ER membrane by
           DPM synthase and then is flipped onto the luminal side
           and used as a donor substrate. In lower eukaryotes, such
           as Saccharomyces cerevisiae and Trypanosoma brucei, DPM
           synthase consists of a single component (Dpm1p and
           TbDpm1, respectively) that possesses one predicted
           transmembrane region near the C terminus for anchoring
           to the ER membrane. In contrast, the Dpm1 homologues of
           higher eukaryotes, namely fission yeast, fungi, and
           animals, have no transmembrane region, suggesting the
           existence of adapter molecules for membrane anchoring.
           This family also includes bacteria and archaea DPM1_like
           enzymes. However, the enzyme structure and mechanism of
           function are not well understood. This protein family
           belongs to Glycosyltransferase 2 superfamily.
          Length = 224

 Score = 32.5 bits (75), Expect = 0.049
 Identities = 25/94 (26%), Positives = 46/94 (48%), Gaps = 10/94 (10%)

Query: 71  VII-TFHNEARSALLRSVVSIINRTPAHLLKEIILVDDFSDDPSDG--EELERI-EKVSV 126
           +II T+ NE  +  +  ++  ++     +  EII+VDD S D +     EL +   +V +
Sbjct: 1   IIIPTY-NEREN--IPELIERLDAALKGIDYEIIVVDDNSPDGTAEIVRELAKEYPRVRL 57

Query: 127 LRNDKREGLMRSRVKGADAATAPVLTFLD---SH 157
           +    + GL  + ++G  AA   V+  +D   SH
Sbjct: 58  IVRPGKRGLGSAYIEGFKAARGDVIVVMDADLSH 91


>gnl|CDD|133045 cd06423, CESA_like, CESA_like is  the cellulose synthase
           superfamily.  The cellulose synthase (CESA) superfamily
           includes a wide variety of glycosyltransferase family 2
           enzymes that share the common characteristic of
           catalyzing the elongation of polysaccharide chains. The
           members include cellulose synthase catalytic subunit,
           chitin synthase, glucan biosynthesis protein and other
           families of CESA-like proteins. Cellulose synthase
           catalyzes the polymerization reaction of cellulose, an
           aggregate of unbranched polymers of beta-1,4-linked
           glucose residues in  plants, most algae, some bacteria
           and fungi, and even some animals. In bacteria, algae and
           lower eukaryotes, there is a second unrelated type of
           cellulose synthase (Type II), which produces acylated
           cellulose, a derivative of cellulose. Chitin synthase
           catalyzes the incorporation of GlcNAc from substrate
           UDP-GlcNAc into chitin, which is a linear homopolymer of
           beta-(1,4)-linked GlcNAc residues and Glucan
           Biosynthesis protein catalyzes the elongation of
           beta-1,2 polyglucose chains of Glucan.
          Length = 180

 Score = 31.4 bits (72), Expect = 0.11
 Identities = 17/70 (24%), Positives = 32/70 (45%), Gaps = 12/70 (17%)

Query: 71  VIITFHNEARS--ALLRSVVSIINRTPAHLLKEIILVDDFSDDPSDGEELERIE----KV 124
           +I+  +NE       + S++++      +   E+I+VDD S D    E LE +     + 
Sbjct: 1   IIVPAYNEEAVIERTIESLLALD-----YPKLEVIVVDDGSTD-DTLEILEELAALYIRR 54

Query: 125 SVLRNDKREG 134
            ++  DK  G
Sbjct: 55  VLVVRDKENG 64


>gnl|CDD|182223 PRK10073, PRK10073, putative glycosyl transferase; Provisional.
          Length = 328

 Score = 30.4 bits (69), Expect = 0.30
 Identities = 27/93 (29%), Positives = 42/93 (45%), Gaps = 9/93 (9%)

Query: 67  PATSVIITFHNEARSALLRSVVSIINRTPAHLLKEIILVDDFSDDPSDGEELERIEK--- 123
           P  S+II  +N  +      + S+I +T   L  EII+V+D S D S  E  +   +   
Sbjct: 6   PKLSIIIPLYNAGKD-FRAFMESLIAQTWTAL--EIIIVNDGSTDNS-VEIAKHYAENYP 61

Query: 124 -VSVLRNDKREGLMRSRVKGADAATAPVLTFLD 155
            V +L +    G+  +R  G   AT   + F D
Sbjct: 62  HVRLL-HQANAGVSVARNTGLAVATGKYVAFPD 93


>gnl|CDD|188452 TIGR03937, PgaC_IcaA, poly-beta-1,6 N-acetyl-D-glucosamine
           synthase.  Members of this protein family are
           biofilm-forming enzymes that polymerize
           N-acetyl-D-glucosamine residues in beta(1,6) linkage.
           One named members is IcaA (intercellular adhesin protein
           A), an enzyme that acts (with aid of subunit IcaD) in
           Polysaccharide Intercellular Adhesin (PIA) biosynthesis
           in Staphylococcus epidermis). The homologous member in
           E. coli is designated PgaC. Members are often encoded
           next to a polysaccharide deacetylase and involved in
           biofilm formation. Note that chitin, although also made
           from N-acetylglucosamine, is formed with beta-1,4
           linkages.
          Length = 407

 Score = 30.3 bits (69), Expect = 0.34
 Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 16/66 (24%)

Query: 64  RDLPATSVIITFHNEARSALLRSVVSIINRTPAHLLK------EIILVDDFSDDPSDGEE 117
            + P  S+++  +NE   A +   +S       HLL       EII ++D S D +  E 
Sbjct: 42  DEYPGVSILVPCYNE--GANVEETIS-------HLLALRYPNFEIIAINDGSKDNT-AEI 91

Query: 118 LERIEK 123
           L+R+  
Sbjct: 92  LDRLAA 97


>gnl|CDD|133005 cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4
           glucosyltransferase is required for biosynthesis of
           lipooligosaccharide.  UDP-glucose: lipooligosaccharide
           (LOS)  beta-1-4-glucosyltransferase catalyzes the
           addition of the first residue, glucose, of the
           lacto-N-neotetrase structure to HepI of the LOS inner
           core.  LOS is the major constituent of the outer leaflet
           of the outer membrane of gram-positive bacteria. It
           consists of a short oligosaccharide chain of variable
           composition (alpha chain) attached to a branched inner
           core which is lined in turn to lipid A. Beta 1,4
           glucosyltransferase is required to attach the alpha
           chain to the inner core.
          Length = 229

 Score = 29.6 bits (67), Expect = 0.50
 Identities = 16/41 (39%), Positives = 22/41 (53%), Gaps = 7/41 (17%)

Query: 71  VIITFHNEARSALLRSVVSIINRTPAHLLKEIILVDDFSDD 111
           VIIT  NE R+ + R + S+        + EII+VD  S D
Sbjct: 5   VIITK-NEERN-IERCLESVKW-----AVDEIIVVDSGSTD 38


>gnl|CDD|132155 TIGR03111, glyc2_xrt_Gpos1, putative glycosyltransferase,
           exosortase G-associated.  Members of this protein family
           are probable glycosyltransferases of family 2, whose
           genes are near those for the exosortase homolog XrtG
           (TIGR03110), which is restricted to Gram-positive
           bacteria. Other genes in the conserved gene neighborhood
           include a 6-pyruvoyl tetrahydropterin synthase homolog
           (TIGR03112) and an uncharacterized intergral membrane
           protein (TIGR03766).
          Length = 439

 Score = 30.1 bits (68), Expect = 0.51
 Identities = 19/56 (33%), Positives = 27/56 (48%), Gaps = 3/56 (5%)

Query: 59  RHALPRDLPATSVIITFHNEARSALLRSVVSIINRT-PAHLLKEIILVDDFSDDPS 113
           R      LP  ++II  +N     L   + SI N+T P  L+  IIL ++ S D S
Sbjct: 41  RLKFIGKLPDITIIIPVYNSE-DTLFNCIESIYNQTYPIELID-IILANNQSTDDS 94


>gnl|CDD|213815 TIGR03469, HpnB, hopene-associated glycosyltransferase HpnB.  This
           family of genes include a glycosyl transferase, group 2
           domain (pfam00535) which are responsible, generally for
           the transfer of nucleotide-diphosphate sugars to
           substrates such as polysaccharides and lipids. The genes
           of this family are often found in the same genetic locus
           with squalene-hopene cyclase genes, and are never
           associated with genes for the metabolism of phytoene.
           Indeed, the members of this family appear to never be
           found in a genome lacking squalene-hopene cyclase (SHC),
           although not all genomes encoding SHC have this glycosyl
           transferase. In the organism Zymomonas mobilis the
           linkage of this gene to hopanoid biosynthesis has been
           noted and the gene named HpnB. Hopanoids are known to
           feature polar glycosyl head groups in many organisms.
          Length = 384

 Score = 29.6 bits (67), Expect = 0.66
 Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 2/49 (4%)

Query: 63  PRDLPATSVIITFHNEARSALLRSVVSIINRTPAHLLKEIILVDDFSDD 111
           P   PA   ++   NEA   +   V S++ +     L  +ILVDD S D
Sbjct: 36  PEAWPAVVAVVPARNEA-DVIGECVTSLLEQDYPGKLH-VILVDDHSTD 82


>gnl|CDD|133031 cd04188, DPG_synthase, DPG_synthase is involved in protein N-linked
           glycosylation.  UDP-glucose:dolichyl-phosphate
           glucosyltransferase (DPG_synthase) is a
           transmembrane-bound enzyme of the endoplasmic reticulum
           involved in protein N-linked glycosylation. This enzyme
           catalyzes the transfer of glucose from UDP-glucose to
           dolichyl phosphate.
          Length = 211

 Score = 29.1 bits (66), Expect = 0.79
 Identities = 18/59 (30%), Positives = 24/59 (40%), Gaps = 4/59 (6%)

Query: 101 EIILVDDFSDD--PSDGEEL--ERIEKVSVLRNDKREGLMRSRVKGADAATAPVLTFLD 155
           EII+VDD S D       +L  +    + VL   K  G   +   G  AA    + F D
Sbjct: 32  EIIVVDDGSKDGTAEVARKLARKNPALIRVLTLPKNRGKGGAVRAGMLAARGDYILFAD 90


>gnl|CDD|222329 pfam13704, Glyco_tranf_2_4, Glycosyl transferase family 2.  Members
           of this family of prokaryotic proteins include putative
           glucosyltransferases,.
          Length = 97

 Score = 27.9 bits (63), Expect = 0.99
 Identities = 14/52 (26%), Positives = 21/52 (40%), Gaps = 5/52 (9%)

Query: 90  IINRTPAHLLK----EIILVDDFSDDPSDGEELERIEKVSVLRNDKREGLMR 137
            +    AH         ++VD+ SDD +  E L  +  V V+R D      R
Sbjct: 6   FLPEFLAHHRALGFDHFLIVDNDSDDGTA-EILAALPDVGVIRTDLSYRDAR 56


>gnl|CDD|211965 TIGR04242, nodulat_NodC, chitooligosaccharide synthase NodC.
           Members of this family are NodC, an
           N-acetylglucosaminyltransferase involved in the
           production of nodulation factors through which rhizobia
           establish symbioses with leguminous plants.
          Length = 395

 Score = 28.9 bits (65), Expect = 1.00
 Identities = 21/58 (36%), Positives = 30/58 (51%), Gaps = 2/58 (3%)

Query: 55  PACTRHALPRD-LPATSVIITFHNEARSALLRSVVSIINRTPAHLLKEIILVDDFSDD 111
           PA +  ALP D LP+  VI+   NE    L   + SI  +  A  L+ + +VDD S +
Sbjct: 35  PATSSDALPSDPLPSVDVIVPCFNEDPRTLSECLASIAAQDYAGKLR-VYVVDDGSTN 91


>gnl|CDD|237549 PRK13897, PRK13897, type IV secretion system component VirD4;
           Provisional.
          Length = 606

 Score = 29.0 bits (65), Expect = 1.1
 Identities = 10/41 (24%), Positives = 20/41 (48%), Gaps = 3/41 (7%)

Query: 9   VRYFDESAY---LEKGALRPGQDPYESNKYNQAASDNTPSN 46
           ++Y+++  +   L      P Q+PY+    N  AS+ T + 
Sbjct: 548 IKYYEDKFFTSRLLPPTFVPTQEPYDPRANNNEASEETETI 588


>gnl|CDD|226390 COG3872, COG3872, Predicted metal-dependent enzyme [General
           function prediction only].
          Length = 318

 Score = 29.0 bits (65), Expect = 1.1
 Identities = 14/48 (29%), Positives = 22/48 (45%), Gaps = 2/48 (4%)

Query: 84  LRSVVSIINRTPAHLLKEIILVDDFSD-DPSDGEELERIEKVSVLRND 130
           LR  VS+I  +    +K +    D  D DP + ++ +  EK S L   
Sbjct: 60  LRGFVSLI-ESLVIGIKHLNFSADRYDEDPKEDDDEKLSEKKSKLAMI 106


>gnl|CDD|236882 PRK11204, PRK11204, N-glycosyltransferase; Provisional.
          Length = 420

 Score = 28.7 bits (65), Expect = 1.2
 Identities = 17/65 (26%), Positives = 31/65 (47%), Gaps = 16/65 (24%)

Query: 65  DLPATSVIITFHNEARSALLRSVVSIINRTPAHLLK------EIILVDDFSDDPSDGEEL 118
           + P  S+++  +NE  +         +  T +HLL       E+I ++D S D + GE L
Sbjct: 52  EYPGVSILVPCYNEGEN---------VEETISHLLALRYPNYEVIAINDGSSDNT-GEIL 101

Query: 119 ERIEK 123
           +R+  
Sbjct: 102 DRLAA 106


>gnl|CDD|119312 pfam10792, DUF2605, Protein of unknown function (DUF2605).  This
           family is conserved in Cyanobacteria. The function is
           not known.
          Length = 99

 Score = 27.6 bits (62), Expect = 1.4
 Identities = 15/51 (29%), Positives = 26/51 (50%), Gaps = 3/51 (5%)

Query: 98  LLKEII--LVDDFSDDPSDGEELERIEKVSVLRNDKREGLMRSRVKGADAA 146
           LLK ++  L++DF        EL   E +S +  ++++ L+  RVK A   
Sbjct: 8   LLKTLLEPLLEDFQHWFQRSRELLETEPISFMSPEEQQALL-ERVKEAQKE 57


>gnl|CDD|182669 PRK10714, PRK10714, undecaprenyl phosphate
           4-deoxy-4-formamido-L-arabinose transferase;
           Provisional.
          Length = 325

 Score = 28.2 bits (63), Expect = 1.8
 Identities = 18/49 (36%), Positives = 24/49 (48%), Gaps = 10/49 (20%)

Query: 70  SVIITFHNEARSALLRSVVSIINRTPAHLLK-----EIILVDDFSDDPS 113
           SV+I  +NE  S     +  +I RT A         EI+L+DD S D S
Sbjct: 9   SVVIPVYNEQES-----LPELIRRTTAACESLGKEYEILLIDDGSSDNS 52


>gnl|CDD|234160 TIGR03295, frhA, coenzyme F420 hydrogenase, subunit alpha.  This
           model represents that clade of F420-dependent
           hydrogenases (FRH) beta subunits found exclusively and
           universally in methanogenic archaea. This protein is a
           member of the Nickel-dependent hydrogenase superfamily
           represented by Pfam model, pfam00374.
          Length = 411

 Score = 27.9 bits (62), Expect = 2.3
 Identities = 12/46 (26%), Positives = 21/46 (45%)

Query: 83  LLRSVVSIINRTPAHLLKEIILVDDFSDDPSDGEELERIEKVSVLR 128
           LLR +    NR  +H L   ++  DF  +       E I+++  +R
Sbjct: 92  LLRELTGCANRLHSHALHHFLIAPDFIPETDTKLVREVIKRIQRMR 137


>gnl|CDD|224629 COG1715, Mrr, Restriction endonuclease [Defense mechanisms].
          Length = 308

 Score = 27.4 bits (61), Expect = 3.2
 Identities = 15/86 (17%), Positives = 28/86 (32%), Gaps = 10/86 (11%)

Query: 82  ALLRSVVSIINRTPAHLLKEII--LVDDFSDDPSDGEELERIEKVSVLRND--------K 131
             +  V+          ++E I    ++F  D  +  E+    +     N         K
Sbjct: 8   TFMEPVLKYTADGQERSVREKIHAAAEEFGLDEQERAEVLPSGRQQTYANRIGWALTNLK 67

Query: 132 REGLMRSRVKGADAATAPVLTFLDSH 157
           R GL+ S ++G    T      L + 
Sbjct: 68  RAGLLDSPLRGYVQITDLGRQILKAF 93


>gnl|CDD|237230 PRK12849, groEL, chaperonin GroEL; Reviewed.
          Length = 542

 Score = 27.5 bits (62), Expect = 4.0
 Identities = 10/16 (62%), Positives = 10/16 (62%)

Query: 72 IITFHNEARSALLRSV 87
          II F  EAR AL R V
Sbjct: 4  IIKFDEEARRALERGV 19


>gnl|CDD|132942 cd06611, STKc_SLK_like, Catalytic domain of Ste20-like kinase-like
           Protein Serine/Threonine Kinases.  Serine/threonine
           kinases (STKs), Ste20-like kinase (SLK)-like subfamily,
           catalytic (c) domain. STKs catalyze the transfer of the
           gamma-phosphoryl group from ATP to serine/threonine
           residues on protein substrates. The SLK-like subfamily
           is part of a larger superfamily that includes the
           catalytic domains of other protein STKs, protein
           tyrosine kinases, RIO kinases, aminoglycoside
           phosphotransferase, choline kinase, and phosphoinositide
           3-kinase. Members of the subfamily include SLK, STK10
           (also called LOK for lymphocyte-oriented kinase), SmSLK
           (Schistosoma mansoni SLK), and related proteins. SLK
           promotes apoptosis through apoptosis signal-regulating
           kinase 1 (ASK1) and the mitogen-activated protein kinase
           (MAPK) p38. It also plays a role in mediating actin
           reorganization. STK10 is responsible in regulating the
           CD28 responsive element in T cells, as well as leukocyte
           function associated antigen (LFA-1)-mediated lymphocyte
           adhesion. SmSLK is capable of activating the MAPK Jun
           N-terminal kinase (JNK) pathway in human embryonic
           kidney (HEK) cells as well as in Xenopus oocytes. It may
           participate in regulating MAPK cascades during
           host-parasite interactions.
          Length = 280

 Score = 27.0 bits (60), Expect = 5.1
 Identities = 16/61 (26%), Positives = 30/61 (49%), Gaps = 11/61 (18%)

Query: 64  RDLPATSVIITFHNEAR------SALLRSVV----SIINRTPAHLLKEIILVDDFSDDPS 113
           RDL A ++++T   + +      SA  +S +    + I  TP  +  E++  + F D+P 
Sbjct: 127 RDLKAGNILLTLDGDVKLADFGVSAKNKSTLQKRDTFIG-TPYWMAPEVVACETFKDNPY 185

Query: 114 D 114
           D
Sbjct: 186 D 186


>gnl|CDD|221345 pfam11965, DUF3479, Domain of unknown function (DUF3479).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 160 amino acids in length. This
           domain is found associated with pfam02514.
          Length = 164

 Score = 26.0 bits (58), Expect = 7.2
 Identities = 12/50 (24%), Positives = 18/50 (36%)

Query: 71  VIITFHNEARSALLRSVVSIINRTPAHLLKEIILVDDFSDDPSDGEELER 120
           VI+T       AL R+   +       L   +    +   DP+  EEL  
Sbjct: 4   VIVTLDAHYNGALARAAERLNRERFPGLELSVFAAAELERDPAALEELAA 53


>gnl|CDD|163441 TIGR03729, acc_ester, putative phosphoesterase.  Members of this
          protein family belong to the larger family pfam00149
          (calcineurin-like phosphoesterase), a family largely
          defined by small motifs of metal-chelating residues.
          The subfamily in this model shows a good but imperfect
          co-occurrence in species with domain TIGR03715 that
          defines a novel class of signal peptide typical of the
          accessory secretory system.
          Length = 239

 Score = 26.1 bits (58), Expect = 9.1
 Identities = 7/17 (41%), Positives = 10/17 (58%)

Query: 4  DNHVDVRYFDESAYLEK 20
          D H+D+ +FD    LE 
Sbjct: 7  DLHIDLNHFDTEEMLET 23


>gnl|CDD|109608 pfam00558, Vpu, Vpu protein.  The Vpu protein contains an
           N-terminal transmembrane spanning region and a
           C-terminal cytoplasmic region. The HIV-1 Vpu protein
           stimulates virus production by enhancing the release of
           viral particles from infected cells. The VPU protein
           binds specifically to CD4.
          Length = 81

 Score = 25.0 bits (55), Expect = 9.7
 Identities = 8/26 (30%), Positives = 14/26 (53%)

Query: 98  LLKEIILVDDFSDDPSDGEELERIEK 123
           L+K I    + S + S+G+E E +  
Sbjct: 42  LIKRIRERAEDSGNESNGDEEEELAD 67


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.132    0.376 

Gapped
Lambda     K      H
   0.267   0.0790    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,158,769
Number of extensions: 742860
Number of successful extensions: 548
Number of sequences better than 10.0: 1
Number of HSP's gapped: 539
Number of HSP's successfully gapped: 45
Length of query: 157
Length of database: 10,937,602
Length adjustment: 89
Effective length of query: 68
Effective length of database: 6,990,096
Effective search space: 475326528
Effective search space used: 475326528
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 55 (24.8 bits)