RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10463
(605 letters)
>gnl|CDD|133004 cd02510, pp-GalNAc-T, pp-GalNAc-T initiates the formation of
mucin-type O-linked glycans. UDP-GalNAc: polypeptide
alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T)
initiate the formation of mucin-type, O-linked glycans
by catalyzing the transfer of
alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to
hydroxyl groups of Ser or Thr residues of core proteins
to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These
enzymes are type II membrane proteins with a GT-A type
catalytic domain and a lectin domain located on the
lumen side of the Golgi apparatus. In human, there are
15 isozymes of pp-GalNAc-Ts, representing the largest of
all glycosyltransferase families. Each isozyme has
unique but partially redundant substrate specificity for
glycosylation sites on acceptor proteins.
Length = 299
Score = 453 bits (1167), Expect = e-157
Identities = 165/303 (54%), Positives = 210/303 (69%), Gaps = 5/303 (1%)
Query: 201 SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT 260
S++I+FHNEA S LLRTVHSVI+R+P +LKEI+LVDD S + LK L+EY K
Sbjct: 1 SVIIIFHNEALSTLLRTVHSVINRTPPELLKEIILVDDFSDKPELKLLLEEYYKKYLPKV 60
Query: 261 RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVI 320
+V+R R GLI+AR+ GAR A G++LVFLD+HCE +GWLE L+AR+AE+R VVCP+I
Sbjct: 61 KVLRLKKREGLIRARIAGARAATGDVLVFLDSHCEVNVGWLEPLLARIAENRKTVVCPII 120
Query: 321 DIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAI 380
D+I TF Y S G F+W LHF+W + +R+ T P ++P MAGGLFAI
Sbjct: 121 DVIDADTFEYRGSSGDARGGFDWSLHFKWLPLPEEE--RRRESPTAPIRSPTMAGGLFAI 178
Query: 381 DRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFR-KASPYSFPGG 439
DR +F +G YDE M +WGGENLE+SF+VWQCGGSIEI PCS V H+FR K PY+FPGG
Sbjct: 179 DREWFLELGGYDEGMDIWGGENLELSFKVWQCGGSIEIVPCSRVGHIFRRKRKPYTFPGG 238
Query: 440 VSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLT 499
VL N RVA VWMDE+ E+++K PE D + R LR++LKC SFKWYL
Sbjct: 239 SGTVLR-NYKRVAEVWMDEYKEYFYKARPELRNI-DYGDLSERKALRERLKCKSFKWYLE 296
Query: 500 HVW 502
+V+
Sbjct: 297 NVY 299
Score = 158 bits (403), Expect = 4e-44
Identities = 55/97 (56%), Positives = 71/97 (73%)
Query: 93 SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT 152
S++I+FHNEA S LLRTVHSVI+R+P +LKEI+LVDD S + LK L+EY K
Sbjct: 1 SVIIIFHNEALSTLLRTVHSVINRTPPELLKEIILVDDFSDKPELKLLLEEYYKKYLPKV 60
Query: 153 RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECT 189
+V+R R GLI+AR+ GAR A G++LVFLD+HCE
Sbjct: 61 KVLRLKKREGLIRARIAGARAATGDVLVFLDSHCEVN 97
>gnl|CDD|215980 pfam00535, Glycos_transf_2, Glycosyl transferase family 2. Diverse
family, transferring sugar from UDP-glucose,
UDP-N-acetyl- galactosamine, GDP-mannose or
CDP-abequose, to a range of substrates including
cellulose, dolichol phosphate and teichoic acids.
Length = 168
Score = 107 bits (270), Expect = 3e-27
Identities = 47/185 (25%), Positives = 74/185 (40%), Gaps = 17/185 (9%)
Query: 201 SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT 260
S++I +NE L T+ S+++++ + EI++VDD ST +EY AK
Sbjct: 1 SVIIPTYNEE-KYLEETLESLLNQTYK--NFEIIVVDDGSTDG-TVEIAEEY-AKNDPRV 55
Query: 261 RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVI 320
RVIR +G AR G + A G+ + FLDA E WLE LV + ++ +V
Sbjct: 56 RVIRLEENLGKAAARNAGLKLATGDYIAFLDADDEVAPDWLEKLVELLEKNGADIVIGSR 115
Query: 321 DIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAI 380
+I+ T Y R+ +L R F + A +
Sbjct: 116 VVINGETRLYGRALRFELLLLLGKLGARSL------------GLKVLFLIGSNALYRREV 163
Query: 381 DRAYF 385
Sbjct: 164 LEELL 168
Score = 81.0 bits (200), Expect = 7e-18
Identities = 35/107 (32%), Positives = 54/107 (50%), Gaps = 9/107 (8%)
Query: 93 SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT 152
S++I +NE L T+ S+++++ + EI++VDD ST +EY AK
Sbjct: 1 SVIIPTYNEE-KYLEETLESLLNQTYK--NFEIIVVDDGSTDG-TVEIAEEY-AKNDPRV 55
Query: 153 RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
RVIR +G AR G + A G+ + FLDA E ++L K
Sbjct: 56 RVIRLEENLGKAAARNAGLKLATGDYIAFLDADDEVA----PDWLEK 98
>gnl|CDD|132997 cd00761, Glyco_tranf_GTA_type, Glycosyltransferase family A (GT-A)
includes diverse families of glycosyl transferases with
a common GT-A type structural fold.
Glycosyltransferases (GTs) are enzymes that synthesize
oligosaccharides, polysaccharides, and glycoconjugates
by transferring the sugar moiety from an activated
nucleotide-sugar donor to an acceptor molecule, which
may be a growing oligosaccharide, a lipid, or a protein.
Based on the stereochemistry of the donor and acceptor
molecules, GTs are classified as either retaining or
inverting enzymes. To date, all GT structures adopt one
of two possible folds, termed GT-A fold and GT-B fold.
This hierarchy includes diverse families of glycosyl
transferases with a common GT-A type structural fold,
which has two tightly associated beta/alpha/beta domains
that tend to form a continuous central sheet of at least
eight beta-strands. The majority of the proteins in this
superfamily are Glycosyltransferase family 2 (GT-2)
proteins. But it also includes families GT-43, GT-6,
GT-8, GT13 and GT-7; which are evolutionarily related to
GT-2 and share structure similarities.
Length = 156
Score = 69.5 bits (170), Expect = 5e-14
Identities = 39/116 (33%), Positives = 59/116 (50%), Gaps = 5/116 (4%)
Query: 202 IVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTR 261
++I +NE L R + S+++++ + E+++VDD ST L+ L+EY K R
Sbjct: 1 VIIPAYNEE-PYLERCLESLLAQTYPNF--EVIVVDDGSTDGTLEI-LEEYAKKDPRVIR 56
Query: 262 VIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVC 317
VI + GL AR G + A GE ++FLDA WLE LVA + D
Sbjct: 57 VINEENQ-GLAAARNAGLKAARGEYILFLDADDLLLPDWLERLVAELLADPEADAV 111
Score = 57.9 bits (140), Expect = 5e-10
Identities = 43/150 (28%), Positives = 66/150 (44%), Gaps = 21/150 (14%)
Query: 94 IVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTR 153
++I +NE L R + S+++++ + E+++VDD ST L+ L+EY K R
Sbjct: 1 VIIPAYNEE-PYLERCLESLLAQTYPNF--EVIVVDDGSTDGTLEI-LEEYAKKDPRVIR 56
Query: 154 VIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECT----LVFNEEFLPKSSIVIVFHNE 209
VI + GL AR G + A GE ++FLDA E L V
Sbjct: 57 VINEENQ-GLAAARNAGLKAARGEYILFLDADDLLLPDWLERLVAELLADPEADAVGGP- 114
Query: 210 AWSALLRTVHSVISRSPRSMLKEILLVDDA 239
+ L R R +L+EI D+A
Sbjct: 115 -GNLLFR----------RELLEEIGGFDEA 133
>gnl|CDD|234419 TIGR03965, mycofact_glyco, mycofactocin system glycosyltransferase.
Members of this protein family are putative
glycosyltransferases, members of pfam00535 (glycosyl
transferase family 2). Members appear mostly in the
Actinobacteria, where they appear to be part of a system
for converting a precursor peptide (TIGR03969) into a
novel redox carrier designated mycofactocin. A radical
SAM enzyme, TIGR03962, is a proposed to be a key
maturase for mycofactocin.
Length = 467
Score = 67.9 bits (166), Expect = 6e-12
Identities = 77/296 (26%), Positives = 119/296 (40%), Gaps = 50/296 (16%)
Query: 151 PTRVIRSPGRVGLIKARLLGARQAEGEIL------VFLDAHCECTLVFNEEFLPKS---- 200
P R++R + ARLLG + G + L + +V L S
Sbjct: 21 PLRLLR----LAPAAARLLGGGRGGGLEVRDETSAALLRRLLDSGVVHPRPALLPSPPSV 76
Query: 201 SIVIVFHNEAWSALLRTVHSVISRS-PRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP 259
++V+ N + L R + ++++ PR L E+++VDD S +P
Sbjct: 77 TVVVPVRNRP-AGLARLLAALLALDYPRDRL-EVIVVDDGSEDP----VPTRAARGARLP 130
Query: 260 TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA-----RVAEDRTR 314
RVIR P R G AR GAR A E + F D+ GWL L+A VA R
Sbjct: 131 VRVIRHPRRQGPAAARNAGARAARTEFVAFTDSDVVPRPGWLRALLAHFDDPGVALVAPR 190
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRW-YTYGSSDAIIKRKDFTEPFKTPAM 373
VV + + VRS L G + R +Y S A++ R
Sbjct: 191 VVALPAEDTRLARYEAVRS-SLDLGPEEAVVRPRGPVSYVPSAALLVR------------ 237
Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFR 429
R +G +DE ++V GE++++ +R+ + GG + P + VAH R
Sbjct: 238 --------RRALLEVGGFDERLEV--GEDVDLCWRLCEAGGRVRYEPAAVVAHDHR 283
Score = 49.8 bits (119), Expect = 3e-06
Identities = 30/100 (30%), Positives = 45/100 (45%), Gaps = 7/100 (7%)
Query: 89 LPKSSIVIVFHNEAWSALLRTVHSVISRS-PRSMLKEILLVDDASTREFLKSSLDEYVAK 147
P ++V+ N + L R + ++++ PR L E+++VDD S
Sbjct: 73 PPSVTVVVPVRNRP-AGLARLLAALLALDYPRDRL-EVIVVDDGSEDP----VPTRAARG 126
Query: 148 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
+P RVIR P R G AR GAR A E + F D+
Sbjct: 127 ARLPVRVIRHPRRQGPAAARNAGARAARTEFVAFTDSDVV 166
>gnl|CDD|220577 pfam10111, Glyco_tranf_2_2, Glycosyltransferase like family 2.
Members of this family of prokaryotic proteins include
putative glucosyltransferase, which are involved in
bacterial capsule biosynthesis.
Length = 278
Score = 64.7 bits (158), Expect = 2e-11
Identities = 49/246 (19%), Positives = 84/246 (34%), Gaps = 30/246 (12%)
Query: 200 SSIVIVFHNEAWSALLRTVHSVISRSPRSMLK---EILLVDDASTREF---LKSSLDE-- 251
S I+ V+ + +L+ + S ++R + EI++VD S F LK + +
Sbjct: 1 SVIIPVYLSLRTRDILKRLQSCLNRLSYFLSDPDVEIIVVDGDSPLSFAKELKKIIAKNG 60
Query: 252 --YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVA 309
+ SP KAR GA + + + FLD C + LE ++
Sbjct: 61 AINYISHKTHKDLF-SPA-----KARNRGAEYSSSDFIFFLDVDCLISPDTLEKIIKHFQ 114
Query: 310 EDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFK 369
E +T + ++ F L Y I T
Sbjct: 115 ELQTNPNAFLALPCLYLSKEGSEIF----------LSDFKYL---LREEILEDAITGKST 161
Query: 370 TPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFR 429
A+A I+R +F IG +DE + GGE+ E+ +R+ + +
Sbjct: 162 FFALASSCILINRDFFLKIGGFDENFRGHGGEDFELLYRLLLYYKKFPPPK-DLLTYDEY 220
Query: 430 KASPYS 435
K
Sbjct: 221 KWPITY 226
Score = 37.4 bits (87), Expect = 0.014
Identities = 26/116 (22%), Positives = 46/116 (39%), Gaps = 16/116 (13%)
Query: 92 SSIVIVFHNEAWSALLRTVHSVISRSPRSMLK---EILLVDDASTREF---LKSSLDE-- 143
S I+ V+ + +L+ + S ++R + EI++VD S F LK + +
Sbjct: 1 SVIIPVYLSLRTRDILKRLQSCLNRLSYFLSDPDVEIIVVDGDSPLSFAKELKKIIAKNG 60
Query: 144 --YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
+ SP KAR GA + + + FLD C + E+ +
Sbjct: 61 AINYISHKTHKDLF-SPA-----KARNRGAEYSSSDFIFFLDVDCLISPDTLEKII 110
>gnl|CDD|133016 cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the
biosynthesis of succinoglycan. Succinoglycan
Biosynthesis Protein ExoA catalyzes the formation of a
beta-1,3 linkage of the second sugar (glucose) of the
succinoglycan with the galactose on the lipid carrie.
Succinoglycan is an acidic exopolysaccharide that is
important for invasion of the nodules. Succinoglycan is
a high-molecular-weight polymer composed of repeating
octasaccharide units. These units are synthesized on
membrane-bound isoprenoid lipid carriers, beginning with
galactose followed by seven glucose molecules, and
modified by the addition of acetate, succinate, and
pyruvate. ExoA is a membrane protein with a
transmembrance domain at c-terminus.
Length = 249
Score = 62.2 bits (152), Expect = 8e-11
Identities = 47/220 (21%), Positives = 87/220 (39%), Gaps = 29/220 (13%)
Query: 201 SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEYVAKLS 257
SI+I NE + + S++++S L EI++VD S TRE ++ A
Sbjct: 3 SIIIPVRNEE-KYIEELLESLLNQSYPKDLIEIIVVDGGSTDGTREIVQE-----YAAKD 56
Query: 258 VPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVC 317
R+I +P R+ +G R + G+I++ +DAH ++ LV A RT
Sbjct: 57 PRIRLIDNPKRI-QSAGLNIGIRNSRGDIIIRVDAHAVYPKDYILELVE--ALKRTGADN 113
Query: 318 PVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTY-GSSDAIIKRKDFTEPFKTPAMAGG 376
+ + F + + GS + + + + G
Sbjct: 114 VGGPMETIGE-----------SKFQKAIAVAQSSPLGSGGSAYRGGAV-KIGYVDTVHHG 161
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSI 416
+ R F +G +DE + E+ E+++R+ + G I
Sbjct: 162 AY--RREVFEKVGGFDESLVR--NEDAELNYRLRKAGYKI 197
Score = 51.8 bits (125), Expect = 3e-07
Identities = 28/98 (28%), Positives = 48/98 (48%), Gaps = 10/98 (10%)
Query: 93 SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEYVAKLS 149
SI+I NE + + S++++S L EI++VD S TRE ++ A
Sbjct: 3 SIIIPVRNEE-KYIEELLESLLNQSYPKDLIEIIVVDGGSTDGTREIVQE-----YAAKD 56
Query: 150 VPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
R+I +P R+ +G R + G+I++ +DAH
Sbjct: 57 PRIRLIDNPKRI-QSAGLNIGIRNSRGDIIIRVDAHAV 93
>gnl|CDD|133035 cd04192, GT_2_like_e, Subfamily of Glycosyltransferase Family GT2
of unknown function. GT-2 includes diverse families of
glycosyltransferases with a common GT-A type structural
fold, which has two tightly associated beta/alpha/beta
domains that tend to form a continuous central sheet of
at least eight beta-strands. These are enzymes that
catalyze the transfer of sugar moieties from activated
donor molecules to specific acceptor molecules, forming
glycosidic bonds. Glycosyltransferases have been
classified into more than 90 distinct sequence based
families.
Length = 229
Score = 57.7 bits (140), Expect = 2e-09
Identities = 47/208 (22%), Positives = 80/208 (38%), Gaps = 33/208 (15%)
Query: 202 IVIVFHNEAWSALLRTVHSVISRS-PRSMLKEILLVDDAS---TREFLKSSLDE--YVAK 255
+VI NEA + L R + S+ + P+ E++LVDD S T + L+ + + + K
Sbjct: 1 VVIAARNEAEN-LPRLLQSLSALDYPKEKF-EVILVDDHSTDGTVQILEFAAAKPNFQLK 58
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
+ +RV S + L A + A+G+ +V DA C WL VA + +++ +
Sbjct: 59 ILNNSRVSISGKKNALTTA----IKAAKGDWIVTTDADCVVPSNWLLTFVAFIQKEQIGL 114
Query: 316 VC-PVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
V PVI A + + G P M
Sbjct: 115 VAGPVIYFKGKSLLAKFQRLD------------WLSLLGLIAGSFG-------LGKPFMC 155
Query: 375 GGL-FAIDRAYFFHIGAYDEEMQVWGGE 401
G A + FF +G ++ + G+
Sbjct: 156 NGANMAYRKEAFFEVGGFEGNDHIASGD 183
Score = 41.9 bits (99), Expect = 4e-04
Identities = 29/99 (29%), Positives = 49/99 (49%), Gaps = 12/99 (12%)
Query: 94 IVIVFHNEAWSALLRTVHSVISRS-PRSMLKEILLVDDAS---TREFLKSSLDE--YVAK 147
+VI NEA + L R + S+ + P+ E++LVDD S T + L+ + + + K
Sbjct: 1 VVIAARNEAEN-LPRLLQSLSALDYPKEKF-EVILVDDHSTDGTVQILEFAAAKPNFQLK 58
Query: 148 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHC 186
+ +RV S + L A + A+G+ +V DA C
Sbjct: 59 ILNNSRVSISGKKNALTTA----IKAAKGDWIVTTDADC 93
>gnl|CDD|133061 cd06439, CESA_like_1, CESA_like_1 is a member of the cellulose
synthase (CESA) superfamily. This is a subfamily of
cellulose synthase (CESA) superfamily. CESA superfamily
includes a wide variety of glycosyltransferase family 2
enzymes that share the common characteristic of
catalyzing the elongation of polysaccharide chains. The
members of the superfamily include cellulose synthase
catalytic subunit, chitin synthase, glucan biosynthesis
protein and other families of CESA-like proteins.
Length = 251
Score = 56.4 bits (137), Expect = 7e-09
Identities = 28/117 (23%), Positives = 49/117 (41%), Gaps = 19/117 (16%)
Query: 75 RKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST- 133
R + + +LP +I+I +NE + + + ++++ EI++V D ST
Sbjct: 14 RLRPKPPSLPDPAYLPTVTIIIPAYNEE-AVIEAKLENLLALDYPRDRLEIIVVSDGSTD 72
Query: 134 ------REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 184
RE+ + +++R P R G A A GEI+VF DA
Sbjct: 73 GTAEIAREYADKGV-----------KLLRFPERRGKAAALNRALALATGEIVVFTDA 118
Score = 55.3 bits (134), Expect = 2e-08
Identities = 37/162 (22%), Positives = 63/162 (38%), Gaps = 21/162 (12%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST------- 241
+ + +LP +I+I +NE + + + ++++ EI++V D ST
Sbjct: 20 PSLPDPAYLPTVTIIIPAYNEE-AVIEAKLENLLALDYPRDRLEIIVVSDGSTDGTAEIA 78
Query: 242 REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
RE+ + +++R P R G A A GEI+VF DA+ L
Sbjct: 79 REYADKGV-----------KLLRFPERRGKAAALNRALALATGEIVVFTDANALLDPDAL 127
Query: 302 ENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNW 343
LV A+ V + I+ L+W NW
Sbjct: 128 RLLVRHFADPSVGAVSGELVIVDGGGSG--SGEGLYWKYENW 167
>gnl|CDD|133029 cd04186, GT_2_like_c, Subfamily of Glycosyltransferase Family GT2
of unknown function. GT-2 includes diverse families of
glycosyltransferases with a common GT-A type structural
fold, which has two tightly associated beta/alpha/beta
domains that tend to form a continuous central sheet of
at least eight beta-strands. These are enzymes that
catalyze the transfer of sugar moieties from activated
donor molecules to specific acceptor molecules, forming
glycosidic bonds. Glycosyltransferases have been
classified into more than 90 distinct sequence based
families.
Length = 166
Score = 54.5 bits (132), Expect = 1e-08
Identities = 42/227 (18%), Positives = 84/227 (37%), Gaps = 63/227 (27%)
Query: 202 IVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTR 261
I+IV +N + L + S+++++ E+++VD+AST + E + +L R
Sbjct: 1 IIIVNYN-SLEYLKACLDSLLAQTYPD--FEVIVVDNASTDGSV-----ELLRELFPEVR 52
Query: 262 VIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED-RTRVVCPVI 320
+IR+ +G G R+A+G+ ++ L+ G L L+ +D +V P +
Sbjct: 53 LIRNGENLGFGAGNNQGIREAKGDYVLLLNPDTVVEPGALLELLDAAEQDPDVGIVGPKV 112
Query: 321 DIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAI 380
GAF +
Sbjct: 113 -----------------SGAF------------------------------------LLV 119
Query: 381 DRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
R F +G +DE+ ++ E++++ R G + P + + H
Sbjct: 120 RREVFEEVGGFDEDFFLY-YEDVDLCLRARLAGYRVLYVPQAVIYHH 165
Score = 44.9 bits (107), Expect = 2e-05
Identities = 23/90 (25%), Positives = 48/90 (53%), Gaps = 8/90 (8%)
Query: 94 IVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTR 153
I+IV +N + L + S+++++ E+++VD+AST + E + +L R
Sbjct: 1 IIIVNYN-SLEYLKACLDSLLAQTYPD--FEVIVVDNASTDGSV-----ELLRELFPEVR 52
Query: 154 VIRSPGRVGLIKARLLGARQAEGEILVFLD 183
+IR+ +G G R+A+G+ ++ L+
Sbjct: 53 LIRNGENLGFGAGNNQGIREAKGDYVLLLN 82
>gnl|CDD|133045 cd06423, CESA_like, CESA_like is the cellulose synthase
superfamily. The cellulose synthase (CESA) superfamily
includes a wide variety of glycosyltransferase family 2
enzymes that share the common characteristic of
catalyzing the elongation of polysaccharide chains. The
members include cellulose synthase catalytic subunit,
chitin synthase, glucan biosynthesis protein and other
families of CESA-like proteins. Cellulose synthase
catalyzes the polymerization reaction of cellulose, an
aggregate of unbranched polymers of beta-1,4-linked
glucose residues in plants, most algae, some bacteria
and fungi, and even some animals. In bacteria, algae and
lower eukaryotes, there is a second unrelated type of
cellulose synthase (Type II), which produces acylated
cellulose, a derivative of cellulose. Chitin synthase
catalyzes the incorporation of GlcNAc from substrate
UDP-GlcNAc into chitin, which is a linear homopolymer of
beta-(1,4)-linked GlcNAc residues and Glucan
Biosynthesis protein catalyzes the elongation of
beta-1,2 polyglucose chains of Glucan.
Length = 180
Score = 53.8 bits (130), Expect = 2e-08
Identities = 46/204 (22%), Positives = 75/204 (36%), Gaps = 38/204 (18%)
Query: 202 IVIVFHNEAWSALLRTVHSVISRSPRSMLK------EILLVDDASTREFLKSSLDEYVAK 255
I++ +NE +VI R+ S+L E+++VDD ST + L+ L+E A
Sbjct: 1 IIVPAYNEE---------AVIERTIESLLALDYPKLEVIVVDDGSTDDTLEI-LEELAAL 50
Query: 256 LSVPTRVIRSPGRVGLIKARLL--GARQAEGEILVFLDAHCECTLGWLENLVARVAED-R 312
V+R G KA L G R A+G+I+V LDA L+ LV D +
Sbjct: 51 YIRRVLVVRDKENGG--KAGALNAGLRHAKGDIVVVLDADTILEPDALKRLVVPFFADPK 108
Query: 313 TRVVCPVIDIISDVTFAYVRSFEL-HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTP 371
V + + + R + + F +
Sbjct: 109 VGAVQGRVRVRNGSENLLTRLQAIEYLSIFRLGRRAQSAL----------------GGVL 152
Query: 372 AMAGGLFAIDRAYFFHIGAYDEEM 395
++G A R +G +DE+
Sbjct: 153 VLSGAFGAFRREALREVGGWDEDT 176
Score = 46.8 bits (112), Expect = 5e-06
Identities = 33/99 (33%), Positives = 49/99 (49%), Gaps = 20/99 (20%)
Query: 94 IVIVFHNEAWSALLRTVHSVISRSPRSMLK------EILLVDDASTREFLKSSLDEYVAK 147
I++ +NE +VI R+ S+L E+++VDD ST + L+ L+E A
Sbjct: 1 IIVPAYNEE---------AVIERTIESLLALDYPKLEVIVVDDGSTDDTLEI-LEELAAL 50
Query: 148 LSVPTRVIRSPGRVGLIKARLL--GARQAEGEILVFLDA 184
V+R G KA L G R A+G+I+V LDA
Sbjct: 51 YIRRVLVVRDKENGG--KAGALNAGLRHAKGDIVVVLDA 87
>gnl|CDD|224137 COG1216, COG1216, Predicted glycosyltransferases [General function
prediction only].
Length = 305
Score = 50.9 bits (122), Expect = 7e-07
Identities = 57/277 (20%), Positives = 105/277 (37%), Gaps = 24/277 (8%)
Query: 198 PKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLS 257
PK SI+IV +N L+ + S+ +++ I++VD+ ST SL+ A+
Sbjct: 3 PKISIIIVTYNRG-EDLVECLASLAAQTYPD--DVIVVVDNGSTDG----SLEALKARFF 55
Query: 258 VPTRVIRSPGRVGLIKARLLGARQAEGE---ILVFLDAHCECTLGWLENLVARVAEDRTR 314
R+I + +G G + A + ++ L+ LE L+ ED
Sbjct: 56 PNVRLIENGENLGFAGGFNRGIKYALAKGDDYVLLLNPDTVVEPDLLEELLKAAEEDPAA 115
Query: 315 -VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAM 373
VV P+I + + R G + L W + + E ++
Sbjct: 116 GVVGPLIRNYDESLYIDRR------GGESDGLTGGWRASPLLEIAPDLSSYLEVV--ASL 167
Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASP 433
+G I R F +G +DE ++ E++++ R + G I P + + H +S
Sbjct: 168 SGACLLIRREAFEKVGGFDERFFIYY-EDVDLCLRARKAGYKIYYVPDAIIYH-KIGSSK 225
Query: 434 YSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEA 470
S G Y +L R + + + +P
Sbjct: 226 GSSILG---TFYYHLRRNRIYFAKLNEKGLLLRSPLR 259
>gnl|CDD|224136 COG1215, COG1215, Glycosyltransferases, probably involved in cell
wall biogenesis [Cell envelope biogenesis, outer
membrane].
Length = 439
Score = 50.3 bits (120), Expect = 2e-06
Identities = 50/238 (21%), Positives = 92/238 (38%), Gaps = 26/238 (10%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSS 248
++ LPK S++I +NE L T+ S++S+ E+++VDD S T E L+
Sbjct: 48 DADKLLPKVSVIIPAYNEEPEVLEETLESLLSQDYPR--YEVIVVDDGSTDETYEILEEL 105
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
EY V + G A G ++A+G+++V LDA L LV+
Sbjct: 106 GAEYGPNFRV---IYPEKKNGGKAGALNNGLKRAKGDVVVILDADTVPEPDALRELVSPF 162
Query: 309 AEDRTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ V I + + + + + +FR S +I
Sbjct: 163 EDPPVGAVVGTPRIRNRPDPSNLLGRIQAIE--YLSAFYFRL-RAASKGGLI-------- 211
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVA 425
++G A R+ +G + E+ E+ +++ R+ G + P + V
Sbjct: 212 ---SFLSGSSSAFRRSALEEVGGWLEDT---ITEDADLTLRLHLRGYRVVYVPEAIVW 263
Score = 45.7 bits (108), Expect = 6e-05
Identities = 36/118 (30%), Positives = 56/118 (47%), Gaps = 9/118 (7%)
Query: 70 TLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVD 129
LP R K K ++ LPK S++I +NE L T+ S++S+ E+++VD
Sbjct: 35 VLPLSRPRK-KLPKDADKLLPKVSVIIPAYNEEPEVLEETLESLLSQDYPR--YEVIVVD 91
Query: 130 DAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 184
D S T E L+ EY V + G A G ++A+G+++V LDA
Sbjct: 92 DGSTDETYEILEELGAEYGPNFRV---IYPEKKNGGKAGALNNGLKRAKGDVVVILDA 146
>gnl|CDD|223539 COG0463, WcaA, Glycosyltransferases involved in cell wall
biogenesis [Cell envelope biogenesis, outer membrane].
Length = 291
Score = 49.7 bits (117), Expect = 2e-06
Identities = 52/289 (17%), Positives = 91/289 (31%), Gaps = 23/289 (7%)
Query: 88 FLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEY 144
+PK S+VI +NE L + S+++++ + EI++VDD S T E +
Sbjct: 1 MMPKVSVVIPTYNEE-EYLPEALESLLNQTYK--DFEIIVVDDGSTDGTTEIAIEYGAKD 57
Query: 145 VAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHC--ECTLVFNEEFLPKSSI 202
V + + GL AR G A G+ +VFLDA L+
Sbjct: 58 V-----RVIRLINERNGGLGAARNAGLEYARGDYIVFLDADDQHPPELIPLVAAGGDGDY 112
Query: 203 VIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRV 262
+ + L R + S + + + L +L+ D L + +
Sbjct: 113 IARLDDRDDIWLPRKLLSKLVKLLGNRLLGVLIPDGFGDLRLLVRDAVDGLRAFLEGRSR 172
Query: 263 IRSPGRVGLIK------ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
L+ LL R V L L L++ + V D ++
Sbjct: 173 FLRLLLRKLVLIRREVLEYLLLYRLLGASKRVLLGKLLRLGLLLLDSKLKAVPLDLLLLL 232
Query: 317 CPVIDIISDV----TFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKR 361
I ++S + R + L +
Sbjct: 233 GLFIALLSILLLLLLLLLRRLLGANLSEKGKPLLLIKLLILLKLILRLL 281
>gnl|CDD|133022 cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member
of the Glycosyltransferase 2 superfamily. DPM1 is the
catalytic subunit of eukaryotic dolichol-phosphate
mannose (DPM) synthase. DPM synthase is required for
synthesis of the glycosylphosphatidylinositol (GPI)
anchor, N-glycan precursor, protein O-mannose, and
C-mannose. In higher eukaryotes,the enzyme has three
subunits, DPM1, DPM2 and DPM3. DPM is synthesized from
dolichol phosphate and GDP-Man on the cytosolic surface
of the ER membrane by DPM synthase and then is flipped
onto the luminal side and used as a donor substrate. In
lower eukaryotes, such as Saccharomyces cerevisiae and
Trypanosoma brucei, DPM synthase consists of a single
component (Dpm1p and TbDpm1, respectively) that
possesses one predicted transmembrane region near the C
terminus for anchoring to the ER membrane. In contrast,
the Dpm1 homologues of higher eukaryotes, namely fission
yeast, fungi, and animals, have no transmembrane region,
suggesting the existence of adapter molecules for
membrane anchoring. This family also includes bacteria
and archaea DPM1_like enzymes. However, the enzyme
structure and mechanism of function are not well
understood. The UDP-glucose:dolichyl-phosphate
glucosyltransferase (DPG_synthase) is a
transmembrane-bound enzyme of the endoplasmic reticulum
involved in protein N-linked glycosylation. This enzyme
catalyzes the transfer of glucose from UDP-glucose to
dolichyl phosphate. This protein family belongs to
Glycosyltransferase 2 superfamily.
Length = 185
Score = 48.0 bits (115), Expect = 2e-06
Identities = 26/94 (27%), Positives = 39/94 (41%), Gaps = 9/94 (9%)
Query: 94 IVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEYVAKLSV 150
+VI +NE + V +++ EI++VDD S T E +
Sbjct: 1 VVIPAYNEE-ENIPELVERLLAVLEEGYDYEIIVVDDGSTDGTAEIARELAARVPRV--- 56
Query: 151 PTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 184
RVIR G A G + A G+I+V +DA
Sbjct: 57 --RVIRLSRNFGKGAAVRAGFKAARGDIVVTMDA 88
Score = 48.0 bits (115), Expect = 2e-06
Identities = 26/94 (27%), Positives = 39/94 (41%), Gaps = 9/94 (9%)
Query: 202 IVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEYVAKLSV 258
+VI +NE + V +++ EI++VDD S T E +
Sbjct: 1 VVIPAYNEE-ENIPELVERLLAVLEEGYDYEIIVVDDGSTDGTAEIARELAARVPRV--- 56
Query: 259 PTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 292
RVIR G A G + A G+I+V +DA
Sbjct: 57 --RVIRLSRNFGKGAAVRAGFKAARGDIVVTMDA 88
>gnl|CDD|133013 cd02522, GT_2_like_a, GT_2_like_a represents a glycosyltransferase
family-2 subfamily with unknown function.
Glycosyltransferase family 2 (GT-2) subfamily of unknown
function. GT-2 includes diverse families of
glycosyltransferases with a common GT-A type structural
fold, which has two tightly associated beta/alpha/beta
domains that tend to form a continuous central sheet of
at least eight beta-strands. These are enzymes that
catalyze the transfer of sugar moieties from activated
donor molecules to specific acceptor molecules, forming
glycosidic bonds. Glycosyltransferases have been
classified into more than 90 distinct sequence based
families.
Length = 221
Score = 45.6 bits (109), Expect = 2e-05
Identities = 57/216 (26%), Positives = 76/216 (35%), Gaps = 57/216 (26%)
Query: 201 SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL-SVP 259
SI+I NEA L R + S+ +P EI++VD ST D VA S
Sbjct: 2 SIIIPTLNEA-ENLPRLLASLRRLNPLP--LEIIVVDGGST--------DGTVAIARSAG 50
Query: 260 TRVIRS-PGRVGLIKARLL--GARQAEGEILVFLDAHCECTL--GWLENLVARVAEDRTR 314
VI S GR AR + GA A G+ L+FL H + L W ++ + D
Sbjct: 51 VVVISSPKGR-----ARQMNAGAAAARGDWLLFL--HADTRLPPDWDAAIIETLRADGAV 103
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
+ L GA L R F P
Sbjct: 104 AGAFRLRFDDPGPRL----RLLELGAN---LRSRL------------------FGLPYGD 138
Query: 375 GGLFAIDRAYFFHIGAYDEE--MQVWGGENLEMSFR 408
GLF I R F +G + E M E++E+ R
Sbjct: 139 QGLF-IRRELFEELGGFPELPLM-----EDVELVRR 168
Score = 38.3 bits (90), Expect = 0.006
Identities = 36/102 (35%), Positives = 46/102 (45%), Gaps = 22/102 (21%)
Query: 93 SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL-SVP 151
SI+I NEA L R + S+ +P EI++VD ST D VA S
Sbjct: 2 SIIIPTLNEA-ENLPRLLASLRRLNPLP--LEIIVVDGGST--------DGTVAIARSAG 50
Query: 152 TRVIRS-PGRVGLIKARLL--GARQAEGEILVFLDAHCECTL 190
VI S GR AR + GA A G+ L+FL H + L
Sbjct: 51 VVVISSPKGR-----ARQMNAGAAAARGDWLLFL--HADTRL 85
>gnl|CDD|234532 TIGR04283, glyco_like_mftF, transferase 2,
rSAM/selenodomain-associated. This enzyme may transfer
a nucleotide, or it sugar moiety, as part of a
biosynthetic pathway. Other proposed members of the
pathway include another transferase (TIGR04282), a
phosphoesterase, and a radical SAM enzyme (TIGR04167)
whose C-terminal domain (pfam12345) frequently contains
a selenocysteine [Unknown function, Enzymes of unknown
specificity].
Length = 220
Score = 44.4 bits (106), Expect = 6e-05
Identities = 37/127 (29%), Positives = 55/127 (43%), Gaps = 33/127 (25%)
Query: 201 SIVIVFHNEA--WSALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEYVAK 255
SI+I NEA LL + ++ + E+++VD S T E +S
Sbjct: 2 SIIIPVLNEAATLPELLADLQALPGDA------EVIVVDGGSTDGTVEIARSLGA----- 50
Query: 256 LSVPTRVIRSP-GRVGLIKARLL--GARQAEGEILVFLDAHCECTL--GWLENLVARVAE 310
+VI SP GR AR + GA A+G+IL+FL H + L +LE + +A
Sbjct: 51 -----KVIHSPKGR-----ARQMNAGAALAKGDILLFL--HADTRLPKDFLEAIRRALAP 98
Query: 311 DRTRVVC 317
C
Sbjct: 99 PGAVAGC 105
Score = 43.3 bits (103), Expect = 1e-04
Identities = 33/106 (31%), Positives = 48/106 (45%), Gaps = 31/106 (29%)
Query: 93 SIVIVFHNEA--WSALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEYVAK 147
SI+I NEA LL + ++ + E+++VD S T E +S
Sbjct: 2 SIIIPVLNEAATLPELLADLQALPGDA------EVIVVDGGSTDGTVEIARSLGA----- 50
Query: 148 LSVPTRVIRSP-GRVGLIKARLL--GARQAEGEILVFLDAHCECTL 190
+VI SP GR AR + GA A+G+IL+FL H + L
Sbjct: 51 -----KVIHSPKGR-----ARQMNAGAALAKGDILLFL--HADTRL 84
>gnl|CDD|133042 cd06420, GT2_Chondriotin_Pol_N, N-terminal domain of Chondroitin
polymerase functions as a GalNAc transferase.
Chondroitin polymerase is a two domain, bi-functional
protein. The N-terminal domain functions as a GalNAc
transferase. The bacterial chondroitin polymerase
catalyzes elongation of the chondroitin chain by
alternatively transferring the GlcUA and GalNAc moiety
from UDP-GlcUA and UDP-GalNAc to the non-reducing ends
of the chondroitin chain. The enzyme consists of
N-terminal and C-terminal domains in which the two
active sites catalyze the addition of GalNAc and GlcUA,
respectively. Chondroitin chains range from 40 to over
100 repeating units of the disaccharide. Sulfated
chondroitins are involved in the regulation of various
biological functions such as central nervous system
development, wound repair, infection, growth factor
signaling, and morphogenesis, in addition to its
conventional structural roles. In Caenorhabditis
elegans, chondroitin is an essential factor for the worm
to undergo cytokinesis and cell division. Chondroitin is
synthesized as proteoglycans, sulfated and secreted to
the cell surface or extracellular matrix.
Length = 182
Score = 43.7 bits (104), Expect = 7e-05
Identities = 45/221 (20%), Positives = 78/221 (35%), Gaps = 62/221 (28%)
Query: 202 IVIVFHNEAWS-ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT 260
++I +N AL + SV+++S + E+++ DD ST E K ++E+ ++ +P
Sbjct: 1 LIITTYN--RPEALELVLKSVLNQS--ILPFEVIIADDGSTEE-TKELIEEFKSQFPIPI 55
Query: 261 RVIRSPGRVG--LIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCP 318
+ + G K R A+G+ L+F+D C P
Sbjct: 56 KHVWQED-EGFRKAKIRNKAIAAAKGDYLIFIDGDC----------------------IP 92
Query: 319 VIDIISDVTFAYVRSFELHWGAFNWELHFR-----WYTYGSSDAIIKRKDFTEPFKTPAM 373
D I+D H + GS R E +
Sbjct: 93 HPDFIAD--------------------HIELAEPGVFLSGS------RVLLNEKLTERGI 126
Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGG 414
G + + + +DEE WGGE+ E+ R+ G
Sbjct: 127 RGCNMSFWKKDLLAVNGFDEEFTGWGGEDSELVARLLNSGI 167
Score = 35.6 bits (83), Expect = 0.034
Identities = 25/96 (26%), Positives = 48/96 (50%), Gaps = 9/96 (9%)
Query: 94 IVIVFHNEAWS-ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT 152
++I +N AL + SV+++S + E+++ DD ST E K ++E+ ++ +P
Sbjct: 1 LIITTYN--RPEALELVLKSVLNQS--ILPFEVIIADDGSTEE-TKELIEEFKSQFPIPI 55
Query: 153 RVIRSPGRVG--LIKARLLGARQAEGEILVFLDAHC 186
+ + G K R A+G+ L+F+D C
Sbjct: 56 KHVWQED-EGFRKAKIRNKAIAAAKGDYLIFIDGDC 90
>gnl|CDD|217196 pfam02709, Glyco_transf_7C, N-terminal domain of
galactosyltransferase. This is the N-terminal domain of
a family of galactosyltransferases from a wide range of
Metazoa with three related galactosyltransferases
activities, all three of which are possessed by one
sequence in some cases. EC:2.4.1.90, N-acetyllactosamine
synthase; EC:2.4.1.38,
Beta-N-acetylglucosaminyl-glycopeptide beta-1,4-
galactosyltransferase; and EC:2.4.1.22 Lactose synthase.
Note that N-acetyllactosamine synthase is a component of
Lactose synthase along with alpha-lactalbumin, in the
absence of alpha-lactalbumin EC:2.4.1.90 is the
catalyzed reaction.
Length = 78
Score = 39.5 bits (93), Expect = 3e-04
Identities = 14/57 (24%), Positives = 24/57 (42%), Gaps = 1/57 (1%)
Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH-VAHLFR 429
GG+ A + F + + WGGE+ ++ R+ G IE + H+ R
Sbjct: 20 FGGVLAFSKEDFLKVNGFSNNFWGWGGEDDDLYARLLLAGLKIERPKFAIGRYHMLR 76
>gnl|CDD|133027 cd04184, GT2_RfbC_Mx_like, Myxococcus xanthus RfbC like proteins
are required for O-antigen biosynthesis. The rfbC gene
encodes a predicted protein of 1,276 amino acids, which
is required for O-antigen biosynthesis in Myxococcus
xanthus. It is a subfamily of Glycosyltransferase Family
GT2, which includes diverse families of glycosyl
transferases with a common GT-A type structural fold,
which has two tightly associated beta/alpha/beta domains
that tend to form a continuous central sheet of at least
eight beta-strands. These are enzymes that catalyze the
transfer of sugar moieties from activated donor
molecules to specific acceptor molecules, forming
glycosidic bonds.
Length = 202
Score = 41.4 bits (98), Expect = 5e-04
Identities = 26/94 (27%), Positives = 42/94 (44%), Gaps = 3/94 (3%)
Query: 90 PKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLS 149
P SIV+ +N L + SV +++ + E+ + DDAST +K L +Y A+
Sbjct: 1 PLISIVMPVYNTPEKYLREAIESVRAQTYPNW--ELCIADDASTDPEVKRVLKKYAAQDP 58
Query: 150 VPTRVIRSPGRVGLIKARLLGARQAEGEILVFLD 183
+V+ G+ A A GE + LD
Sbjct: 59 R-IKVVFREENGGISAATNSALELATGEFVALLD 91
Score = 41.4 bits (98), Expect = 5e-04
Identities = 26/94 (27%), Positives = 42/94 (44%), Gaps = 3/94 (3%)
Query: 198 PKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLS 257
P SIV+ +N L + SV +++ + E+ + DDAST +K L +Y A+
Sbjct: 1 PLISIVMPVYNTPEKYLREAIESVRAQTYPNW--ELCIADDASTDPEVKRVLKKYAAQDP 58
Query: 258 VPTRVIRSPGRVGLIKARLLGARQAEGEILVFLD 291
+V+ G+ A A GE + LD
Sbjct: 59 R-IKVVFREENGGISAATNSALELATGEFVALLD 91
>gnl|CDD|222281 pfam13641, Glyco_tranf_2_3, Glycosyltransferase like family 2.
Members of this family of prokaryotic proteins include
putative glucosyltransferase, which are involved in
bacterial capsule biosynthesis.
Length = 229
Score = 39.6 bits (93), Expect = 0.002
Identities = 53/239 (22%), Positives = 80/239 (33%), Gaps = 35/239 (14%)
Query: 198 PKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVD--DASTREFLKSSLDEYVAK 255
P+ ++V+ NE L RT+ S I L+ I++VD D T + E A
Sbjct: 1 PRVAVVVPTWNED-KVLGRTLRS-ILAQDYPRLEVIVVVDGSDDETLDVA----RELAAA 54
Query: 256 LS-VPTRVIRSPGRVGLI-KARLL--GARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
V V+R P G KAR L R + +++V LDA V D
Sbjct: 55 YPDVRVHVVRRPRPPGPTGKARALNEALRAIKSDLVVLLDADS------------VVDPD 102
Query: 312 RTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDA--IIKRKDFTEPFK 369
R + P V F L+ Y A ++
Sbjct: 103 TLRRLLPFFLS-KGVGAVQGPVFVLNLR------TAVAPLYALEFALRHLRFMALRRALG 155
Query: 370 TPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+AG R+ IG +D + E+ E+ R+ + G P + V L
Sbjct: 156 VAPLAGSGSLFRRSVLEEIGGFDPGFLLG--EDKELGLRLRRAGWRTAYVPGAAVYELS 212
Score = 29.6 bits (67), Expect = 3.3
Identities = 31/101 (30%), Positives = 45/101 (44%), Gaps = 12/101 (11%)
Query: 90 PKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVD--DASTREFLKSSLDEYVAK 147
P+ ++V+ NE L RT+ S I L+ I++VD D T + E A
Sbjct: 1 PRVAVVVPTWNED-KVLGRTLRS-ILAQDYPRLEVIVVVDGSDDETLDVA----RELAAA 54
Query: 148 LS-VPTRVIRSPGRVGLI-KARLL--GARQAEGEILVFLDA 184
V V+R P G KAR L R + +++V LDA
Sbjct: 55 YPDVRVHVVRRPRPPGPTGKARALNEALRAIKSDLVVLLDA 95
>gnl|CDD|133031 cd04188, DPG_synthase, DPG_synthase is involved in protein N-linked
glycosylation. UDP-glucose:dolichyl-phosphate
glucosyltransferase (DPG_synthase) is a
transmembrane-bound enzyme of the endoplasmic reticulum
involved in protein N-linked glycosylation. This enzyme
catalyzes the transfer of glucose from UDP-glucose to
dolichyl phosphate.
Length = 211
Score = 38.3 bits (90), Expect = 0.005
Identities = 20/64 (31%), Positives = 28/64 (43%), Gaps = 7/64 (10%)
Query: 124 EILLVDDAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILV 180
EI++VDD S T E + + K RV+ P G A G A G+ ++
Sbjct: 32 EIIVVDDGSKDGTAEVAR----KLARKNPALIRVLTLPKNRGKGGAVRAGMLAARGDYIL 87
Query: 181 FLDA 184
F DA
Sbjct: 88 FADA 91
Score = 38.3 bits (90), Expect = 0.005
Identities = 20/64 (31%), Positives = 28/64 (43%), Gaps = 7/64 (10%)
Query: 232 EILLVDDAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILV 288
EI++VDD S T E + + K RV+ P G A G A G+ ++
Sbjct: 32 EIIVVDDGSKDGTAEVAR----KLARKNPALIRVLTLPKNRGKGGAVRAGMLAARGDYIL 87
Query: 289 FLDA 292
F DA
Sbjct: 88 FADA 91
>gnl|CDD|214672 smart00458, RICIN, Ricin-type beta-trefoil. Carbohydrate-binding
domain formed from presumed gene triplication.
Length = 118
Score = 36.3 bits (84), Expect = 0.009
Identities = 19/87 (21%), Positives = 29/87 (33%), Gaps = 9/87 (10%)
Query: 515 GRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
G IR T C+ +L C Q + + + D CL
Sbjct: 39 GAIRIKDTDLCLTAN------GNTGSTVTLYSCDGTN-DNQYWEVNKDGTIRNPDSGKCL 91
Query: 575 DVPEYENDISPRVRILACSGFNRQRWT 601
DV + + +V + CSG Q+W
Sbjct: 92 DV--KDGNTGTKVILWTCSGNPNQKWI 116
>gnl|CDD|133062 cd06442, DPM1_like, DPM1_like represents putative enzymes similar
to eukaryotic DPM1. Proteins similar to eukaryotic
DPM1, including enzymes from bacteria and archaea; DPM1
is the catalytic subunit of eukaryotic
dolichol-phosphate mannose (DPM) synthase. DPM synthase
is required for synthesis of the
glycosylphosphatidylinositol (GPI) anchor, N-glycan
precursor, protein O-mannose, and C-mannose. In higher
eukaryotes,the enzyme has three subunits, DPM1, DPM2 and
DPM3. DPM is synthesized from dolichol phosphate and
GDP-Man on the cytosolic surface of the ER membrane by
DPM synthase and then is flipped onto the luminal side
and used as a donor substrate. In lower eukaryotes, such
as Saccharomyces cerevisiae and Trypanosoma brucei, DPM
synthase consists of a single component (Dpm1p and
TbDpm1, respectively) that possesses one predicted
transmembrane region near the C terminus for anchoring
to the ER membrane. In contrast, the Dpm1 homologues of
higher eukaryotes, namely fission yeast, fungi, and
animals, have no transmembrane region, suggesting the
existence of adapter molecules for membrane anchoring.
This family also includes bacteria and archaea DPM1_like
enzymes. However, the enzyme structure and mechanism of
function are not well understood. This protein family
belongs to Glycosyltransferase 2 superfamily.
Length = 224
Score = 37.5 bits (88), Expect = 0.012
Identities = 23/64 (35%), Positives = 36/64 (56%), Gaps = 8/64 (12%)
Query: 124 EILLVDDAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILV 180
EI++VDD S T E ++ +AK R+I PG+ GL A + G + A G+++V
Sbjct: 29 EIIVVDDNSPDGTAEIVRE-----LAKEYPRVRLIVRPGKRGLGSAYIEGFKAARGDVIV 83
Query: 181 FLDA 184
+DA
Sbjct: 84 VMDA 87
Score = 37.5 bits (88), Expect = 0.012
Identities = 23/64 (35%), Positives = 36/64 (56%), Gaps = 8/64 (12%)
Query: 232 EILLVDDAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILV 288
EI++VDD S T E ++ +AK R+I PG+ GL A + G + A G+++V
Sbjct: 29 EIIVVDDNSPDGTAEIVRE-----LAKEYPRVRLIVRPGKRGLGSAYIEGFKAARGDVIV 83
Query: 289 FLDA 292
+DA
Sbjct: 84 VMDA 87
>gnl|CDD|133030 cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to
eukaryotic DPM1. A family of bacterial enzymes related
to eukaryotic DPM1; Although the mechanism of eukaryotic
enzyme is well studied, the mechanism of the bacterial
enzymes is not well understood. The eukaryotic DPM1 is
the catalytic subunit of eukaryotic Dolichol-phosphate
mannose (DPM) synthase. DPM synthase is required for
synthesis of the glycosylphosphatidylinositol (GPI)
anchor, N-glycan precursor, protein O-mannose, and
C-mannose. The enzyme has three subunits, DPM1, DPM2 and
DPM3. DPM is synthesized from dolichol phosphate and
GDP-Man on the cytosolic surface of the ER membrane by
DPM synthase and then is flipped onto the luminal side
and used as a donor substrate. This protein family
belongs to Glycosyltransferase 2 superfamily.
Length = 181
Score = 35.5 bits (83), Expect = 0.035
Identities = 26/96 (27%), Positives = 39/96 (40%), Gaps = 12/96 (12%)
Query: 94 IVIVFHNEAWS--ALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEYVAKL 148
IV+ +NE + L + +V+ EI+ VDD S T E L+
Sbjct: 1 IVVPVYNEEENLPELYERLKAVLESLGYDY--EIIFVDDGSTDRTLEILRELAARD---- 54
Query: 149 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 184
+VIR G A L G A G+ ++ +DA
Sbjct: 55 -PRVKVIRLSRNFGQQAALLAGLDHARGDAVITMDA 89
Score = 35.5 bits (83), Expect = 0.035
Identities = 26/96 (27%), Positives = 39/96 (40%), Gaps = 12/96 (12%)
Query: 202 IVIVFHNEAWS--ALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEYVAKL 256
IV+ +NE + L + +V+ EI+ VDD S T E L+
Sbjct: 1 IVVPVYNEEENLPELYERLKAVLESLGYDY--EIIFVDDGSTDRTLEILRELAARD---- 54
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 292
+VIR G A L G A G+ ++ +DA
Sbjct: 55 -PRVKVIRLSRNFGQQAALLAGLDHARGDAVITMDA 89
>gnl|CDD|133039 cd04196, GT_2_like_d, Subfamily of Glycosyltransferase Family GT2
of unknown function. GT-2 includes diverse families of
glycosyltransferases with a common GT-A type structural
fold, which has two tightly associated beta/alpha/beta
domains that tend to form a continuous central sheet of
at least eight beta-strands. These are enzymes that
catalyze the transfer of sugar moieties from activated
donor molecules to specific acceptor molecules, forming
glycosidic bonds. Glycosyltransferases have been
classified into more than 90 distinct sequence based
families.
Length = 214
Score = 33.4 bits (77), Expect = 0.22
Identities = 17/63 (26%), Positives = 31/63 (49%), Gaps = 7/63 (11%)
Query: 124 EILLVDDAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILV 180
E+++ DD S T E +K EY+ K +IR+ +G+ + + A+G+ +
Sbjct: 29 ELIISDDGSTDGTVEIIK----EYIDKDPFIIILIRNGKNLGVARNFESLLQAADGDYVF 84
Query: 181 FLD 183
F D
Sbjct: 85 FCD 87
Score = 33.4 bits (77), Expect = 0.22
Identities = 17/63 (26%), Positives = 31/63 (49%), Gaps = 7/63 (11%)
Query: 232 EILLVDDAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILV 288
E+++ DD S T E +K EY+ K +IR+ +G+ + + A+G+ +
Sbjct: 29 ELIISDDGSTDGTVEIIK----EYIDKDPFIIILIRNGKNLGVARNFESLLQAADGDYVF 84
Query: 289 FLD 291
F D
Sbjct: 85 FCD 87
>gnl|CDD|215385 PLN02726, PLN02726, dolichyl-phosphate beta-D-mannosyltransferase.
Length = 243
Score = 32.0 bits (73), Expect = 0.69
Identities = 20/64 (31%), Positives = 33/64 (51%), Gaps = 6/64 (9%)
Query: 124 EILLVDDAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILV 180
EI++VDD S T++ +K Y + PG++GL A + G + A G+ +V
Sbjct: 42 EIIVVDDGSPDGTQDVVKQLQKVYGEDR---ILLRPRPGKLGLGTAYIHGLKHASGDFVV 98
Query: 181 FLDA 184
+DA
Sbjct: 99 IMDA 102
Score = 32.0 bits (73), Expect = 0.69
Identities = 20/64 (31%), Positives = 33/64 (51%), Gaps = 6/64 (9%)
Query: 232 EILLVDDAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILV 288
EI++VDD S T++ +K Y + PG++GL A + G + A G+ +V
Sbjct: 42 EIIVVDDGSPDGTQDVVKQLQKVYGEDR---ILLRPRPGKLGLGTAYIHGLKHASGDFVV 98
Query: 289 FLDA 292
+DA
Sbjct: 99 IMDA 102
>gnl|CDD|133058 cd06436, GlcNAc-1-P_transferase, N-acetyl-glucosamine transferase
is involved in the synthesis of
Poly-beta-1,6-N-acetyl-D-glucosamine.
N-acetyl-glucosamine transferase is responsible for the
synthesis of bacteria
Poly-beta-1,6-N-acetyl-D-glucosamine (PGA).
Poly-beta-1,6-N-acetyl-D-glucosamine is a homopolymer
that serves as an adhesion for the maintenance of
biofilm structural stability in diverse eubacteria.
N-acetyl-glucosamine transferase is the product of gene
pgaC. Genetic analysis indicated that all four genes of
the pgaABCD locus were required for the PGA production,
pgaC being a glycosyltransferase.
Length = 191
Score = 31.6 bits (72), Expect = 0.74
Identities = 28/114 (24%), Positives = 49/114 (42%), Gaps = 23/114 (20%)
Query: 208 NEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPG 267
NE + + RT+ S + R+ + L +L++DDAS D+ + R+ +
Sbjct: 7 NEE-AVIQRTLAS-LLRNKPNFL--VLVIDDASD--------DDTAGIV----RLAITDS 50
Query: 268 RVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVID 321
RV L++ L AR +G+ L + ++ D RV+ VID
Sbjct: 51 RVHLLRRHLPNARTGKGDALNA-------AYDQIRQILIEEGADPERVIIAVID 97
Score = 28.5 bits (64), Expect = 6.9
Identities = 26/107 (24%), Positives = 47/107 (43%), Gaps = 17/107 (15%)
Query: 100 NEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPG 159
NE + + RT+ S + R+ + L +L++DDAS D+ + R+ +
Sbjct: 7 NEE-AVIQRTLAS-LLRNKPNFL--VLVIDDASD--------DDTAGIV----RLAITDS 50
Query: 160 RVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVF 206
RV L++ L AR +G+ L ++ E P+ I+ V
Sbjct: 51 RVHLLRRHLPNARTGKGDALNAAYDQI-RQILIEEGADPERVIIAVI 96
>gnl|CDD|212674 cd10232, ScSsz1p_like_NBD, Nucleotide-binding domain of
Saccharmomyces cerevisiae Ssz1pp and similar proteins.
Saccharomyces cerevisiae Ssz1p (also known as
/Pdr13p/YHR064C) belongs to the heat shock protein 70
(HSP70) family of chaperones that assist in protein
folding and assembly and can direct incompetent "client"
proteins towards degradation. Typically, HSP70s have a
nucleotide-binding domain (NBD) and a substrate-binding
domain (SBD). The nucleotide sits in a deep cleft formed
between the two lobes of the NBD. The two subdomains of
each lobe change conformation between ATP-bound,
ADP-bound, and nucleotide-free states. ATP binding opens
up the substrate-binding site; substrate-binding
increases the rate of ATP hydrolysis. HSP70 chaperone
activity is regulated by various co-chaperones: J-domain
proteins and nucleotide exchange factors (NEFs). Some
family members are not chaperones but rather, function
as NEFs for their Hsp70 partners, while other family
members function as both chaperones and NEFs. Ssz1 does
not function as a chaperone; it facilitates the
interaction between the HSP70 Ssb protein and its
partner J-domain protein Zuo1 (also known as zuotin) on
the ribosome. Ssz1 is found in a stable heterodimer
(called RAC, ribosome associated complex) with Zuo1.
Zuo1 can only stimulate the ATPase activity of Ssb, when
it is in complex with Ssz1. Ssz1 binds ATP but neither
nucleotide-binding, hydrolysis, or its SBD, is needed
for its in vivo function.
Length = 386
Score = 32.4 bits (74), Expect = 0.76
Identities = 29/122 (23%), Positives = 44/122 (36%), Gaps = 14/122 (11%)
Query: 221 VISRSPRSMLKEILLVDDASTREF--LKSSLDEYVAK------LSVPTRVIRSPGRVGLI 272
V + + IL V + + R LK + ++++ K LSVPT L+
Sbjct: 99 VQEKEEPVPKETILTVHEVTVRFLRRLKEAAEDFLGKKVAGAVLSVPTW-FSDEQTEALV 157
Query: 273 KARLLGARQAEGEILVFLDAHCECTLGWLENL-VARVAEDRTRVVCPVIDIISDVTFAYV 331
KA A A +L + L + A DR VV +DV+ V
Sbjct: 158 KA----AEAAGLPVLQLIPEPAAALLAYDAGEPTEDEALDRNVVVADFGGTRTDVSVIAV 213
Query: 332 RS 333
R
Sbjct: 214 RG 215
>gnl|CDD|216044 pfam00652, Ricin_B_lectin, Ricin-type beta-trefoil lectin domain.
Length = 124
Score = 30.6 bits (69), Expect = 0.87
Identities = 13/33 (39%), Positives = 15/33 (45%), Gaps = 1/33 (3%)
Query: 573 CLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
CLDVP D P V + C G Q WT +
Sbjct: 14 CLDVPGGSADGGP-VGLYPCHGGGNQLWTLTGD 45
>gnl|CDD|240336 PTZ00260, PTZ00260, dolichyl-phosphate beta-glucosyltransferase;
Provisional.
Length = 333
Score = 32.0 bits (73), Expect = 0.95
Identities = 24/101 (23%), Positives = 43/101 (42%), Gaps = 11/101 (10%)
Query: 93 SIVIVFHNEAWSALLRTVHSVI------SRSPRSMLKEILLVDDAS---TREFLKSSLDE 143
SIVI +NE L + + I SR EI++V+D S T + K +
Sbjct: 73 SIVIPAYNEE-DRLPKMLKETIKYLESRSRKDPKFKYEIIIVNDGSKDKTLKVAKDFWRQ 131
Query: 144 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 184
+ ++ R++ G A +G + G+ ++ +DA
Sbjct: 132 NINP-NIDIRLLSLLRNKGKGGAVRIGMLASRGKYILMVDA 171
Score = 32.0 bits (73), Expect = 0.95
Identities = 24/101 (23%), Positives = 43/101 (42%), Gaps = 11/101 (10%)
Query: 201 SIVIVFHNEAWSALLRTVHSVI------SRSPRSMLKEILLVDDAS---TREFLKSSLDE 251
SIVI +NE L + + I SR EI++V+D S T + K +
Sbjct: 73 SIVIPAYNEE-DRLPKMLKETIKYLESRSRKDPKFKYEIIIVNDGSKDKTLKVAKDFWRQ 131
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 292
+ ++ R++ G A +G + G+ ++ +DA
Sbjct: 132 NINP-NIDIRLLSLLRNKGKGGAVRIGMLASRGKYILMVDA 171
>gnl|CDD|240427 PTZ00470, PTZ00470, glycoside hydrolase family 47 protein;
Provisional.
Length = 522
Score = 32.0 bits (73), Expect = 1.1
Identities = 11/41 (26%), Positives = 21/41 (51%), Gaps = 1/41 (2%)
Query: 464 FKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPH 504
F E Q +K ++ R +R+ +K H+++ Y + W H
Sbjct: 55 FNKIDEVYYQNEKLNIKRRESVREAMK-HAWEGYKEYAWGH 94
>gnl|CDD|223207 COG0129, IlvD, Dihydroxyacid dehydratase/phosphogluconate
dehydratase [Amino acid transport and metabolism /
Carbohydrate transport and metabolism].
Length = 575
Score = 31.0 bits (71), Expect = 1.9
Identities = 26/85 (30%), Positives = 36/85 (42%), Gaps = 24/85 (28%)
Query: 373 MAGGLFAI----DRAYFFHIGAYDEEMQVWG---GENLEMSFRVWQCGGSIEIAPCSHVA 425
AGG+ A+ +A H + + V G GENLE EI +
Sbjct: 328 RAGGVPAVMKELLKAGLLH----GDVLTVTGKTLGENLEG----------AEIRD-TPDR 372
Query: 426 HLFRKAS-PYSFPGGVSEVLYGNLA 449
+ R P+S GG++ VL GNLA
Sbjct: 373 DVIRPLDNPFSADGGLA-VLKGNLA 396
>gnl|CDD|133038 cd04195, GT2_AmsE_like, GT2_AmsE_like is involved in
exopolysaccharide amylovora biosynthesis. AmsE is a
glycosyltransferase involved in exopolysaccharide
amylovora biosynthesis in Erwinia amylovora. Amylovara
is one of the three exopolysaccharide produced by E.
amylovora. Amylovara-deficient mutants are
non-pathogenic. It is a subfamily of Glycosyltransferase
Family GT2, which includes diverse families of
glycosyltransferases with a common GT-A type structural
fold, which has two tightly associated beta/alpha/beta
domains that tend to form a continuous central sheet of
at least eight beta-strands. These are enzymes that
catalyze the transfer of sugar moieties from activated
donor molecules to specific acceptor molecules, forming
glycosidic bonds.
Length = 201
Score = 29.6 bits (67), Expect = 3.1
Identities = 21/88 (23%), Positives = 38/88 (43%), Gaps = 4/88 (4%)
Query: 97 VFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIR 156
V+ E L + S++ ++ E++LV D + L L+E+ KL P +V+
Sbjct: 6 VYIKEKPEFLREALESILKQT--LPPDEVVLVKDGPVTQSLNEVLEEFKRKL--PLKVVP 61
Query: 157 SPGRVGLIKARLLGARQAEGEILVFLDA 184
GL KA G + + + +D
Sbjct: 62 LEKNRGLGKALNEGLKHCTYDWVARMDT 89
Score = 29.6 bits (67), Expect = 3.1
Identities = 21/88 (23%), Positives = 38/88 (43%), Gaps = 4/88 (4%)
Query: 205 VFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIR 264
V+ E L + S++ ++ E++LV D + L L+E+ KL P +V+
Sbjct: 6 VYIKEKPEFLREALESILKQT--LPPDEVVLVKDGPVTQSLNEVLEEFKRKL--PLKVVP 61
Query: 265 SPGRVGLIKARLLGARQAEGEILVFLDA 292
GL KA G + + + +D
Sbjct: 62 LEKNRGLGKALNEGLKHCTYDWVARMDT 89
>gnl|CDD|167649 PRK03963, PRK03963, V-type ATP synthase subunit E; Provisional.
Length = 198
Score = 29.7 bits (67), Expect = 3.2
Identities = 25/101 (24%), Positives = 42/101 (41%), Gaps = 2/101 (1%)
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGR-VGLIK 273
L +IS ++ + + + + E LK+ E V +L V+RS R + LI
Sbjct: 76 LAVQEELISEVLEAVRERLAELPEDEYFETLKALTKEAVEELGEDKVVVRSNERTLKLID 135
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWL-ENLVARVAEDRT 313
+RL R G++ + L E G + E + D T
Sbjct: 136 SRLEEIRDELGDVEIELGEPIETIGGVIVETKDGTIRVDNT 176
>gnl|CDD|226577 COG4092, COG4092, Predicted glycosyltransferase involved in capsule
biosynthesis [Cell envelope biogenesis, outer membrane].
Length = 346
Score = 29.2 bits (65), Expect = 6.2
Identities = 28/147 (19%), Positives = 51/147 (34%), Gaps = 23/147 (15%)
Query: 273 KARLLGA----RQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR------VVCPVIDI 322
GA + E +++FLD C + +++ + R +V PV +
Sbjct: 80 ICANNGADYSHEKCESNLVLFLDVDCFGSSDNFAKMLSIATIKKMRTNIDAPLVLPVYHL 139
Query: 323 ISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDR 382
T + ++ A +E + + + F P +F I+R
Sbjct: 140 NKADTQVFFDVEDMFLDAMIFE-----------SPLAEFRKEDNFFIAPYT--NIFLINR 186
Query: 383 AYFFHIGAYDEEMQVWGGENLEMSFRV 409
F G YDE + G E+ E R+
Sbjct: 187 RMFSLTGGYDERFRGHGSEDFEFLTRL 213
>gnl|CDD|165267 PHA02961, PHA02961, hypothetical protein; Provisional.
Length = 658
Score = 29.5 bits (66), Expect = 7.1
Identities = 19/64 (29%), Positives = 28/64 (43%), Gaps = 9/64 (14%)
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHF--RWYTYGSSDAIIKRKDF 364
+ ED T+ +C IDI Y + EL AF W L++ + D I +KD
Sbjct: 288 NITEDETKFICKHIDI-------YKKKSELFVNAFRWHLYYCDNMHPQKIFDPITCKKDK 340
Query: 365 TEPF 368
E +
Sbjct: 341 HEKY 344
>gnl|CDD|217382 pfam03135, CagE_TrbE_VirB, CagE, TrbE, VirB family, component of
type IV transporter system. This family includes the
Helicobacter pylori protein CagE, which together with
other proteins from the cag pathogenicity island (PAI),
encodes a type IV transporter secretion system. The
precise role of CagE is not known, but studies in animal
models have shown that it is essential for pathogenesis
in Helicobacter pylori induced gastritis and peptic
ulceration. Indeed, the expression of the cag PAI has
been shown to be essential for stimulating human gastric
epithelial cell apoptosis in vitro. Similar type IV
transport systems are also found in other bacteria. This
family includes the TrbE and VirB proteins from the
respective trb and Vir conjugal transfer systems in
Agrobacterium tumefaciens. Homologues of VirB proteins
from other species are also members of this family, e.g.
VirB from Brucella suis.
Length = 205
Score = 28.8 bits (65), Expect = 7.3
Identities = 20/74 (27%), Positives = 27/74 (36%), Gaps = 7/74 (9%)
Query: 136 FLKSSLDEYVAKLSVP----TRVIRSPG---RVGLIKARLLGARQAEGEILVFLDAHCEC 188
LDEY+A VP +IR G G+I + + G + V LD E
Sbjct: 34 PRGRLLDEYIATSRVPFGKDGFIIRHAGNEKFAGIISIKEYPDKTRSGALDVLLDLDFEF 93
Query: 189 TLVFNEEFLPKSSI 202
L + F K
Sbjct: 94 ILTQSFSFKSKQKA 107
>gnl|CDD|238148 cd00244, AlgLyase, Alginate Lyase A1-III; enzymatically
depolymerizes alginate, a complex copolymer of
beta-D-mannuronate and alpha-L-guluronate, by cleaving
the beta-(1,4) glycosidic bond.
Length = 339
Score = 28.7 bits (64), Expect = 9.3
Identities = 11/37 (29%), Positives = 16/37 (43%), Gaps = 1/37 (2%)
Query: 341 FNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGL 377
F W L YG + + + K T PFK + G +
Sbjct: 300 FAW-LEPYCALYGCAPDVRELKFMTVPFKDFRLGGDV 335
>gnl|CDD|211965 TIGR04242, nodulat_NodC, chitooligosaccharide synthase NodC.
Members of this family are NodC, an
N-acetylglucosaminyltransferase involved in the
production of nodulation factors through which rhizobia
establish symbioses with leguminous plants.
Length = 395
Score = 28.9 bits (65), Expect = 9.4
Identities = 28/110 (25%), Positives = 51/110 (46%), Gaps = 6/110 (5%)
Query: 77 PKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST-RE 135
P + + LP +++ NE L + S+ ++ L+ + +VDD ST R+
Sbjct: 35 PATSSDALPSDPLPSVDVIVPCFNEDPRTLSECLASIAAQDYAGKLR-VYVVDDGSTNRD 93
Query: 136 FLKSSLDEYVAKLSVPT-RVIRSPGRVGLIKARLLGARQAEGEILVFLDA 184
L D Y S P I P VG KA++ R++ G++++ +D+
Sbjct: 94 ALVPVHDAYA---SDPRFNFILLPKNVGKRKAQIAAIRRSSGDLVLNVDS 140
>gnl|CDD|224508 COG1592, COG1592, Rubrerythrin [Energy production and conversion].
Length = 166
Score = 28.1 bits (63), Expect = 9.8
Identities = 14/59 (23%), Positives = 19/59 (32%), Gaps = 3/59 (5%)
Query: 423 HVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRS 481
H + + V NL E E Y F AE++ K+A RS
Sbjct: 52 HAKNHLKLLGKLLL---VLGDTRENLEEAIEGETYEITEMYPVFAEVAEEEGFKEAARS 107
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.324 0.138 0.426
Gapped
Lambda K H
0.267 0.0634 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 31,544,805
Number of extensions: 3135760
Number of successful extensions: 3194
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3141
Number of HSP's successfully gapped: 81
Length of query: 605
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 502
Effective length of database: 6,369,140
Effective search space: 3197308280
Effective search space used: 3197308280
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.5 bits)
S2: 62 (27.9 bits)