RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254780918|ref|YP_003065331.1| glycosyl transferase family protein [Candidatus Liberibacter asiaticus str. psy62] (623 letters) >gnl|CDD|133027 cd04184, GT2_RfbC_Mx_like, Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis. The rfbC gene encodes a predicted protein of 1,276 amino acids, which is required for O-antigen biosynthesis in Myxococcus xanthus. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Length = 202 Score = 336 bits (864), Expect = 1e-92 Identities = 124/204 (60%), Positives = 153/204 (75%), Gaps = 2/204 (0%) Query: 67 PLISVIMPVYKIKKEWLEMAIESVRSQIYSHWELCIAEDCSGDIETVSLLKKYANMDSRI 126 PLIS++MPVY +++L AIESVR+Q Y +WELCIA+D S D E +LKKYA D RI Sbjct: 1 PLISIVMPVYNTPEKYLREAIESVRAQTYPNWELCIADDASTDPEVKRVLKKYAAQDPRI 60 Query: 127 KVVFRAKNGHISAASNSAAQLATSEWLALLDHDDLLHPTALYYVADAINNNPNAEIIYSD 186 KVVFR +NG ISAA+NSA +LAT E++ALLDHDD L P ALY V A+N +P+A++IYSD Sbjct: 61 KVVFREENGGISAATNSALELATGEFVALLDHDDELAPHALYEVVKALNEHPDADLIYSD 120 Query: 187 EDKINENQIRSGPYFKYDFNPELFHVHNMITHLGVYRTETFKKIGGFREKFEGAQDYDLV 246 EDKI+E RS P+FK D++P+L N I HL VYR +++GGFRE FEGAQDYDLV Sbjct: 121 EDKIDEGGKRSEPFFKPDWSPDLLLSQNYIGHLLVYRRSLVRQVGGFREGFEGAQDYDLV 180 Query: 247 LRFLENIDLSQIIHIPRVLYHWRM 270 LR E+ D +I HIPRVLYHWR Sbjct: 181 LRVSEHTD--RIAHIPRVLYHWRA 202 Score = 53.4 bits (129), Expect = 2e-07 Identities = 30/99 (30%), Positives = 52/99 (52%), Gaps = 7/99 (7%) Query: 324 PLVSIIIPTYN-HHHLLKICLESIYHKTTYSSFEVIIIDNLSDDSKTFLYLQKIQKKYPN 382 PL+SI++P YN L+ +ES+ TY ++E+ I D+ S D + L+K + P Sbjct: 1 PLISIVMPVYNTPEKYLREAIESV-RAQTYPNWELCIADDASTDPEVKRVLKKYAAQDPR 59 Query: 383 LRVI--TDNTHPFNYSRINNNATLHAKGQYFCFLNNDTE 419 ++V+ +N S N+A A G++ L++D E Sbjct: 60 IKVVFREENGGI---SAATNSALELATGEFVALLDHDDE 95 >gnl|CDD|133055 cd06433, GT_2_WfgS_like, WfgS and WfeV are involved in O-antigen biosynthesis. Escherichia coli WfgS and Shigella dysenteriae WfeV are glycosyltransferase 2 family enzymes involved in O-antigen biosynthesis. GT-2 enzymes have GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Length = 202 Score = 127 bits (322), Expect = 9e-30 Identities = 67/209 (32%), Positives = 95/209 (45%), Gaps = 8/209 (3%) Query: 70 SVIMPVYKIKKEWLEMAIESVRSQIYSHWELCIAEDCSGDIETVSLLKKYANMDSRIKVV 129 S+I P Y + E LE I+SV SQ Y + E + + S D TV ++KKY D + Sbjct: 1 SIITPTYN-QAETLEETIDSVLSQTYPNIEYIVIDGGSTD-GTVDIIKKYE--DKITYWI 56 Query: 130 FRAKNGHISAASNSAAQLATSEWLALLDHDDLLHPTALYYVADAINNNPNAEIIYSDEDK 189 G I A N LAT + + L+ DD L P AL V A +P +++Y D Sbjct: 57 SEPDKG-IYDAMNKGIALATGDIIGFLNSDDTLLPGALLAVVAAFAEHPEVDVVYGDVLL 115 Query: 190 INENQIRSGPYFKYDFNPELFHVHNMITHLGV-YRTETFKKIGGFREKFEGAQDYDLVLR 248 ++EN G F + I H +R F+K GGF E + A DYDL+LR Sbjct: 116 VDENGRVIGRRRPPPFLDKFLLYGMPICHQATFFRRSLFEKYGGFDESYRIAADYDLLLR 175 Query: 249 FLENIDLSQIIHIPRVLYHWRMHDNSTAQ 277 L ++P VL +R+ S+ Sbjct: 176 LLLA--GKIFKYLPEVLAAFRLGGVSSTS 202 Score = 61.8 bits (151), Expect = 5e-10 Identities = 56/226 (24%), Positives = 86/226 (38%), Gaps = 40/226 (17%) Query: 327 SIIIPTYNHHHLLKICLESIYHKTTYSSFEVIIIDNLSDDSKTFLYLQKIQKKYPNLRVI 386 SII PTYN L+ ++S+ +T Y + E I+ID S D T ++K + K I Sbjct: 1 SIITPTYNQAETLEETIDSVLSQT-YPNIEYIVIDGGSTDG-TVDIIKKYEDK--ITYWI 56 Query: 387 T--DNTHPFNYSRINNNATLHAKGQYFCFLNNDTEVINGQWLSEMMGIASQPQVGAVGAR 444 + D Y + N A G FLN+D ++ G L+ + A P+V V Sbjct: 57 SEPDKGI---YDAM-NKGIALATGDIIGFLNSDDTLLPGALLAVVAAFAEHPEVDVVYGD 112 Query: 445 LWYRRKKLWKRSKRLQHGGVIMGINNIAGHKNKHHKARCSVPNYQAFAMHFTHSISAVTG 504 + + G ++G +K + + QA Sbjct: 113 VLLVDE-----------NGRVIGRRRPPPFLDKFLLYGMPICH-QATFFR---------- 150 Query: 505 ACMVMSKKCFMHVGGFDEKNTPVVFSDIDLCLRILEAGYRNVWTPH 550 + F GGFDE + +D DL LR+L AG + P Sbjct: 151 ------RSLFEKYGGFDESYR--IAADYDLLLRLLLAGKIFKYLPE 188 >gnl|CDD|133029 cd04186, GT_2_like_c, Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Length = 166 Score = 115 bits (291), Expect = 3e-26 Identities = 43/132 (32%), Positives = 64/132 (48%), Gaps = 10/132 (7%) Query: 328 IIIPTYNHHHLLKICLESIYHKTTYSSFEVIIIDNLSDDSKTFLYLQKIQKKYPNLRVIT 387 III YN LK CL+S+ TY FEVI++DN S D ++ +++ +P +R+I Sbjct: 1 IIIVNYNSLEYLKACLDSLL-AQTYPDFEVIVVDNASTD----GSVELLRELFPEVRLIR 55 Query: 388 DNTHPFNYSRINNNATLHAKGQYFCFLNNDTEVINGQWLSEMMGIASQPQVGAVGARLW- 446 N + NN AKG Y LN DT V G L + P VG VG ++ Sbjct: 56 -NGENLGFGAGNNQGIREAKGDYVLLLNPDTVVEPGALLELLDAAEQDPDVGIVGPKVSG 114 Query: 447 ---YRRKKLWKR 455 R+++++ Sbjct: 115 AFLLVRREVFEE 126 Score = 68.4 bits (168), Expect = 6e-12 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%) Query: 502 VTGACMVMSKKCFMHVGGFDEKNTPVVFSDIDLCLRILEAGYRNVWTPHADLYHDE 557 V+GA +++ ++ F VGGFDE + D+DLCLR AGYR ++ P A +YH Sbjct: 112 VSGAFLLVRREVFEEVGGFDEDFFLY-YEDVDLCLRARLAGYRVLYVPQAVIYHHG 166 Score = 61.8 bits (151), Expect = 5e-10 Identities = 44/186 (23%), Positives = 83/186 (44%), Gaps = 46/186 (24%) Query: 71 VIMPVYKIKKEWLEMAIESVRSQIYSHWELCIAEDCSGDIETVSLLKKYANMDSRIKVVF 130 +I+ Y E+L+ ++S+ +Q Y +E+ + ++ S D +V LL++ ++++ Sbjct: 1 IIIVNYN-SLEYLKACLDSLLAQTYPDFEVIVVDNASTD-GSVELLRELF---PEVRLIR 55 Query: 131 RAKNGHISAASNSAAQLATSEWLALLDHDDLLHPTALYYVADAINNNPNAEIIYSDEDKI 190 +N A +N + A +++ LL+ D ++ P AL + DA +P+ I+ Sbjct: 56 NGENLGFGAGNNQGIREAKGDYVLLLNPDTVVEPGALLELLDAAEQDPDVGIV------- 108 Query: 191 NENQIRSGPYFKYDFNPELFHVHNMITHLG---VYRTETFKKIGGFREKF----EGAQDY 243 GP ++ G + R E F+++GGF E F E D Sbjct: 109 -------GP---------------KVS--GAFLLVRREVFEEVGGFDEDFFLYYE---DV 141 Query: 244 DLVLRF 249 DL LR Sbjct: 142 DLCLRA 147 >gnl|CDD|144210 pfam00535, Glycos_transf_2, Glycosyl transferase family 2. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids. Length = 168 Score = 105 bits (263), Expect = 5e-23 Identities = 46/170 (27%), Positives = 81/170 (47%), Gaps = 10/170 (5%) Query: 70 SVIMPVYKIKKEWLEMAIESVRSQIYSHWELCIAEDCSGDIETVSLLKKYANMDSRIKVV 129 SVI+P Y ++++LE +ES+ +Q Y ++E+ + +D S D TV + ++YA D RI+V+ Sbjct: 1 SVIIPTYN-EEKYLEECLESLLNQTYPNFEIIVVDDGSTD-GTVEIAEEYAKKDPRIRVI 58 Query: 130 FRAKNGHISAASNSAAQLATSEWLALLDHDDLLHPTALYYVADAINNNPNAEIIYS---- 185 +N +AA N+ +LAT +++ LD DD + P L + + + N +I S Sbjct: 59 RLEENLGKAAARNAGLKLATGDYILFLDADDEVAPDWLEKLVELLEKNGADIVIGSRVVI 118 Query: 186 ----DEDKINENQIRSGPYFKYDFNPELFHVHNMITHLGVYRTETFKKIG 231 D K V +I +YR E +++ Sbjct: 119 FGETRLDGRALRFELLLLLGKLGDRSLGLKVLFLIGSNALYRREALEELL 168 Score = 91.0 bits (226), Expect = 8e-19 Identities = 42/192 (21%), Positives = 72/192 (37%), Gaps = 24/192 (12%) Query: 327 SIIIPTYNHHHLLKICLESIYHKTTYSSFEVIIIDNLSDDSKTFLYLQKIQKKYPNLRVI 386 S+IIPTYN L+ CLES+ TY +FE+I++D+ S D T ++ KK P +RVI Sbjct: 1 SVIIPTYNEEKYLEECLESLL-NQTYPNFEIIVVDDGSTDG-TVEIAEEYAKKDPRIRVI 58 Query: 387 TDNTHPFNYSRINNNATLHAKGQYFCFLNNDTEVINGQWLSEMMGIASQPQVGAVGARLW 446 + N A G Y FL+ D EV WL +++ + + V Sbjct: 59 R-LEENLGKAAARNAGLKLATGDYILFLDADDEVAPD-WLEKLVELLEKNGADIV----- 111 Query: 447 YRRKKLWKRSKRLQHGGVIMGINNIAGHKNKHHKARCSVPNYQAFAMHFTHSISAVTGAC 506 G + I + + + + + + G+ Sbjct: 112 ---------------IGSRVVIFGETRLDGRALRFELLLLLGKLGDRSLGLKVLFLIGSN 156 Query: 507 MVMSKKCFMHVG 518 + ++ + Sbjct: 157 ALYRREALEELL 168 >gnl|CDD|132997 cd00761, Glyco_tranf_GTA_type, Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold. Glycosyltransferases (GTs) are enzymes that synthesize oligosaccharides, polysaccharides, and glycoconjugates by transferring the sugar moiety from an activated nucleotide-sugar donor to an acceptor molecule, which may be a growing oligosaccharide, a lipid, or a protein. Based on the stereochemistry of the donor and acceptor molecules, GTs are classified as either retaining or inverting enzymes. To date, all GT structures adopt one of two possible folds, termed GT-A fold and GT-B fold. This hierarchy includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. The majority of the proteins in this superfamily are Glycosyltransferase family 2 (GT-2) proteins. But it also includes families GT-43, GT-6, GT-8, GT13 and GT-7; which are evolutionarily related to GT-2 and share structure similarities. Length = 156 Score = 94.1 bits (234), Expect = 1e-19 Identities = 51/183 (27%), Positives = 87/183 (47%), Gaps = 34/183 (18%) Query: 71 VIMPVYKIKKEWLEMAIESVRSQIYSHWELCIAEDCSGDIETVSLLKKYANMDSRIKVVF 130 VI+P Y ++ +LE +ES+ +Q Y ++E+ + +D S D T+ +L++YA D R+ V Sbjct: 1 VIIPAYN-EEPYLERCLESLLAQTYPNFEVIVVDDGSTD-GTLEILEEYAKKDPRVIRVI 58 Query: 131 RAKNGHISAASNSAAQLATSEWLALLDHDDLLHPTALYYVADAINNNPNAEIIYSDEDKI 190 +N ++AA N+ + A E++ LD DDLL P L + + +P A+ + Sbjct: 59 NEENQGLAAARNAGLKAARGEYILFLDADDLLLPDWLERLVAELLADPEADAVGG----- 113 Query: 191 NENQIRSGPYFKYDFNPELFHVHNMITHLGVYRTETFKKIGGFREKFE-GAQDYDLVLRF 249 N++ +R E ++IGGF E G +D D +LR Sbjct: 114 ---------------------PGNLL-----FRRELLEEIGGFDEALLSGEEDDDFLLRL 147 Query: 250 LEN 252 L Sbjct: 148 LRG 150 Score = 77.9 bits (192), Expect = 8e-15 Identities = 39/127 (30%), Positives = 64/127 (50%), Gaps = 5/127 (3%) Query: 328 IIIPTYNHHHLLKICLESIYHKTTYSSFEVIIIDNLSDDSKTFLYLQKIQKKYPNLRVIT 387 +IIP YN L+ CLES+ TY +FEVI++D+ S D T L++ KK P + + Sbjct: 1 VIIPAYNEEPYLERCLESLL-AQTYPNFEVIVVDDGSTD-GTLEILEEYAKKDPRVIRVI 58 Query: 388 DNTHPFNYSRINNNATLHAKGQYFCFLNNDTEVINGQWLSEMMGIASQPQVGAVG--ARL 445 + + + N A+G+Y FL+ D ++ + + + P+ AVG L Sbjct: 59 NEENQGLAAAR-NAGLKAARGEYILFLDADDLLLPDWLERLVAELLADPEADAVGGPGNL 117 Query: 446 WYRRKKL 452 +RR+ L Sbjct: 118 LFRRELL 124 >gnl|CDD|31409 COG1216, COG1216, Predicted glycosyltransferases [General function prediction only]. Length = 305 Score = 90.6 bits (224), Expect = 1e-18 Identities = 64/244 (26%), Positives = 110/244 (45%), Gaps = 25/244 (10%) Query: 324 PLVSIIIPTYNHHHLLKICLESIYHKTTYSSFEVIIIDNLS-DDSKTFLYLQKIQKKYPN 382 P +SIII TYN L CL S+ TY ++++DN S D S L + +PN Sbjct: 3 PKISIIIVTYNRGEDLVECLASLA-AQTYPDDVIVVVDNGSTDGSLEALK----ARFFPN 57 Query: 383 LRVITDNTHPFNYSRINNNATLHAKGQ---YFCFLNNDTEVINGQWLSEMM-GIASQPQV 438 +R+I + + ++ N +A + Y LN DT V L E++ P Sbjct: 58 VRLIENGENLG-FAGGFNRGIKYALAKGDDYVLLLNPDTVVEPD-LLEELLKAAEEDPAA 115 Query: 439 GAVGARLWYRRKKLWKRSKRLQHGGVIMGINNIAGHKNKHHKARCSVPNYQAFAMHFTHS 498 G VG + + L+ + + G+ G + + Sbjct: 116 GVVGPLIRNYDESLYIDRRGGESDGLTGGWRASPLLEIAPDLSSYLEV------------ 163 Query: 499 ISAVTGACMVMSKKCFMHVGGFDEKNTPVVFSDIDLCLRILEAGYRNVWTPHADLYHDES 558 +++++GAC+++ ++ F VGGFDE+ + + D+DLCLR +AGY+ + P A +YH Sbjct: 164 VASLSGACLLIRREAFEKVGGFDER-FFIYYEDVDLCLRARKAGYKIYYVPDAIIYHKIG 222 Query: 559 RTRK 562 ++ Sbjct: 223 SSKG 226 Score = 55.1 bits (132), Expect = 5e-08 Identities = 45/205 (21%), Positives = 86/205 (41%), Gaps = 25/205 (12%) Query: 65 NKPLISVIMPVYKIKKEWLEMAIESVRSQIYSHWELCIAEDCSGDIETVSLLKKYANMDS 124 + P IS+I+ Y + E L + S+ +Q Y + + ++ S D ++ LK Sbjct: 1 SMPKISIIIVTY-NRGEDLVECLASLAAQTYPDDVIVVVDNGSTD-GSLEALKARFF--P 56 Query: 125 RIKVVFRAKNGHISAASNSAAQLATSE---WLALLDHDDLLHPTALYYVADAINNNPNAE 181 ++++ +N + N + A ++ ++ LL+ D ++ P L + A +P A Sbjct: 57 NVRLIENGENLGFAGGFNRGIKYALAKGDDYVLLLNPDTVVEPDLLEELLKAAEEDPAAG 116 Query: 182 II---------------YSDEDKINENQIRSGPYFKY--DFNPELFHVHNMITHLGVYRT 224 ++ E R+ P + D + L V ++ + R Sbjct: 117 VVGPLIRNYDESLYIDRRGGESDGLTGGWRASPLLEIAPDLSSYLEVVASLSGACLLIRR 176 Query: 225 ETFKKIGGFREKFEG-AQDYDLVLR 248 E F+K+GGF E+F +D DL LR Sbjct: 177 EAFEKVGGFDERFFIYYEDVDLCLR 201 >gnl|CDD|30811 COG0463, WcaA, Glycosyltransferases involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane]. Length = 291 Score = 80.6 bits (196), Expect = 1e-15 Identities = 50/277 (18%), Positives = 97/277 (35%), Gaps = 3/277 (1%) Query: 65 NKPLISVIMPVYKIKKEWLEMAIESVRSQIYSHWELCIAEDCSGDIETVSLLKKYANMDS 124 P +SV++P Y ++E+L A+ES+ +Q Y +E+ + +D S D T + +Y D Sbjct: 1 MMPKVSVVIPTYN-EEEYLPEALESLLNQTYKDFEIIVVDDGSTD-GTTEIAIEYGAKDV 58 Query: 125 RIKVVFRAKNGHISAASNSAAQLATSEWLALLDHDDLLHPTALYYVADAINNNPNAEIIY 184 R+ + +NG + AA N+ + A +++ LD DD HP L + A + + Sbjct: 59 RVIRLINERNGGLGAARNAGLEYARGDYIVFLDADD-QHPPELIPLVAAGGDGDYIARLD 117 Query: 185 SDEDKINENQIRSGPYFKYDFNPELFHVHNMITHLGVYRTETFKKIGGFREKFEGAQDYD 244 +D ++ S + + L + + + F E Sbjct: 118 DRDDIWLPRKLLSKLVKLLGNRLLGVLIPDGFGDLRLLVRDAVDGLRAFLEGRSRFLRLL 177 Query: 245 LVLRFLENIDLSQIIHIPRVLYHWRMHDNSTAQKIGNKNYAGKAGERALNEHFQRTGIAA 304 L L ++ + + + R+L + ++G K L+ A Sbjct: 178 LRKLVLIRREVLEYLLLYRLLGASKRVLLGKLLRLGLLLLDSKLKAVPLDLLLLLGLFIA 237 Query: 305 KAVFDGAQYRTHYMIPNPPPLVSIIIPTYNHHHLLKI 341 L P L+ + Sbjct: 238 LLSILLLLLLLLLRRLLGANLSEKGKPLLLIKLLILL 274 Score = 57.8 bits (137), Expect = 9e-09 Identities = 41/151 (27%), Positives = 59/151 (39%), Gaps = 4/151 (2%) Query: 324 PLVSIIIPTYNHHHLLKICLESIYHKTTYSSFEVIIIDNLSDDSKTFLYLQKIQKKYPNL 383 P VS++IPTYN L LES+ TY FE+I++D+ S D T + K + Sbjct: 3 PKVSVVIPTYNEEEYLPEALESLL-NQTYKDFEIIVVDDGSTD-GTTEIAIEYGAKDVRV 60 Query: 384 RVITDNTHPFNYSRINNNATLHAKGQYFCFLNND-TEVINGQWLSEMMGIASQPQVGAVG 442 + N N +A+G Y FL+ D L G Sbjct: 61 IRLI-NERNGGLGAARNAGLEYARGDYIVFLDADDQHPPELIPLVAAGGDGDYIARLDDR 119 Query: 443 ARLWYRRKKLWKRSKRLQHGGVIMGINNIAG 473 +W RK L K K L + + + I + G Sbjct: 120 DDIWLPRKLLSKLVKLLGNRLLGVLIPDGFG 150 >gnl|CDD|31408 COG1215, COG1215, Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane]. Length = 439 Score = 75.4 bits (184), Expect = 4e-14 Identities = 60/239 (25%), Positives = 91/239 (38%), Gaps = 33/239 (13%) Query: 319 IPNPPPLVSIIIPTYN-HHHLLKICLESIYHKTTYSSFEVIIIDNLSDDSKTFLYLQKIQ 377 P VS+IIP YN +L+ LES+ Y +EVI++D+ S D +T+ L+++ Sbjct: 49 ADKLLPKVSVIIPAYNEEPEVLEETLESLL-SQDYPRYEVIVVDDGSTD-ETYEILEELG 106 Query: 378 KKYPNLRVITDNTHPFNYSRIN-NNATLHAKGQYFCFLNNDTEVINGQWLSEMMGIASQP 436 +Y + NN AKG L+ DT V L E++ P Sbjct: 107 AEYGPNFRVIYPEKKNGGKAGALNNGLKRAKGDVVVILDADT-VPEPDALRELVSPFEDP 165 Query: 437 QVGAVGARLWYRRKK----LWKRSKRLQHGGVIMGINNIAGHKNKHHKARCSVPNYQAFA 492 VGAV R + L R + +++ A Sbjct: 166 PVGAVVGTPRIRNRPDPSNLLGRIQAIEYLSAFYFRLRAASKGG---------------- 209 Query: 493 MHFTHSISAVTGACMVMSKKCFMHVGGFDEKNTPVVFSDIDLCLRILEAGYRNVWTPHA 551 IS ++G+ + VGG+ E + D DL LR+ GYR V+ P A Sbjct: 210 -----LISFLSGSSSAFRRSALEEVGGWLEDT---ITEDADLTLRLHLRGYRVVYVPEA 260 Score = 63.4 bits (153), Expect = 2e-10 Identities = 48/222 (21%), Positives = 85/222 (38%), Gaps = 19/222 (8%) Query: 42 SSLFHDTSPQFTKKILQNISQWQNKPLISVIMPVYKIKKEWLEMAIESVRSQIYSHWELC 101 P + + P +SVI+P Y + E LE +ES+ SQ Y +E+ Sbjct: 29 LGYLLLVLPLSRPRKKLPKDADKLLPKVSVIIPAYNEEPEVLEETLESLLSQDYPRYEVI 88 Query: 102 IAEDCSGDIETVSLLKKYANMDSRIKVVFR--AKNGHISAASNSAAQLATSEWLALLDHD 159 + +D S D ET +L++ V KNG + A N+ + A + + +LD D Sbjct: 89 VVDDGSTD-ETYEILEELGAEYGPNFRVIYPEKKNGGKAGALNNGLKRAKGDVVVILDAD 147 Query: 160 DLLHPTALYYVADAINNNPNAEIIYSDEDKINENQIRSGPYFK---YDFNPELFHVHNMI 216 + P AL + + P ++ + I S + ++ + Sbjct: 148 TVPEPDALRELVSPFEDPPVGAVVGTPR--IRNRPDPSNLLGRIQAIEYLSAFYFRLRAA 205 Query: 217 THLG----------VYRTETFKKIGGFREKFEGAQDYDLVLR 248 + G +R +++GG+ E +D DL LR Sbjct: 206 SKGGLISFLSGSSSAFRRSALEEVGGWLEDT-ITEDADLTLR 246 >gnl|CDD|133016 cd02525, Succinoglycan_BP_ExoA, ExoA is involved in the biosynthesis of succinoglycan. Succinoglycan Biosynthesis Protein ExoA catalyzes the formation of a beta-1,3 linkage of the second sugar (glucose) of the succinoglycan with the galactose on the lipid carrie. Succinoglycan is an acidic exopolysaccharide that is important for invasion of the nodules. Succinoglycan is a high-molecular-weight polymer composed of repeating octasaccharide units. These units are synthesized on membrane-bound isoprenoid lipid carriers, beginning with galactose followed by seven glucose molecules, and modified by the addition of acetate, succinate, and pyruvate. ExoA is a membrane protein with a transmembrance domain at c-terminus. Length = 249 Score = 73.4 bits (181), Expect = 1e-13 Identities = 52/241 (21%), Positives = 86/241 (35%), Gaps = 44/241 (18%) Query: 325 LVSIIIPTYNHHHLLKICLESIYHKTTYS--SFEVIIIDNLSDDSKTFLYLQKIQKKYPN 382 VSIIIP N ++ LES+ +Y E+I++D S D T +Q+ K P Sbjct: 1 FVSIIIPVRNEEKYIEELLESL-LNQSYPKDLIEIIVVDGGSTD-GTREIVQEYAAKDPR 58 Query: 383 LRVITDNTHPFNYSRIN----NNATLHAKGQYFCFLNNDTEVINGQWLSEMMGIASQPQV 438 +R+I N RI N +++G ++ ++ E++ + Sbjct: 59 IRLID------NPKRIQSAGLNIGIRNSRGDIIIRVDAHAVYPKD-YILELVEALKRTGA 111 Query: 439 GAV-GARLWYRRKKLWKRSKRLQHGGVIMGINNIAGHKNKHHKARCSVPNYQAFAMHFTH 497 V G K K Q + G + ++ K V Sbjct: 112 DNVGGPMETIGESKFQKAIAVAQSSPLGSG---GSAYRGGAVKIG-YVD----------- 156 Query: 498 SISAVTGACMVMSKKCFMHVGGFDEKNTPVVFS---DIDLCLRILEAGYRNVWTPHADLY 554 T ++ F VGGFDE D +L R+ +AGY+ +P +Y Sbjct: 157 -----TVHHGAYRREVFEKVGGFDES-----LVRNEDAELNYRLRKAGYKIWLSPDIRVY 206 Query: 555 H 555 + Sbjct: 207 Y 207 Score = 66.5 bits (163), Expect = 2e-11 Identities = 46/203 (22%), Positives = 84/203 (41%), Gaps = 28/203 (13%) Query: 68 LISVIMPVYKIKKEWLEMAIESVRSQIYSH--WELCIAEDCSGDIETVSLLKKYANMDSR 125 +S+I+PV +K ++E +ES+ +Q Y E+ + + S D T ++++YA D R Sbjct: 1 FVSIIIPVRNEEK-YIEELLESLLNQSYPKDLIEIIVVDGGSTD-GTREIVQEYAAKDPR 58 Query: 126 IKVVFRAKNGHISAASNSAAQLATSEWLALLD-HDDLLHPTALYYVADAINN--NPNA-- 180 I+++ K SA N + + + + +D H P Y+ + + A Sbjct: 59 IRLIDNPK-RIQSAGLNIGIRNSRGDIIIRVDAHAVY--PKD--YILELVEALKRTGADN 113 Query: 181 -----EIIYSDEDKINENQIRSGP------YFKYDFNPELFHVHNMITHLGVYRTETFKK 229 E I + + +S P ++ + H G YR E F+K Sbjct: 114 VGGPMETIGESKFQKAIAVAQSSPLGSGGSAYRGGAVKIGYVDT---VHHGAYRREVFEK 170 Query: 230 IGGFREKFEGAQDYDLVLRFLEN 252 +GGF E +D +L R + Sbjct: 171 VGGFDESLVRNEDAELNYRLRKA 193 >gnl|CDD|133039 cd04196, GT_2_like_d, Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Length = 214 Score = 69.2 bits (170), Expect = 3e-12 Identities = 51/230 (22%), Positives = 94/230 (40%), Gaps = 47/230 (20%) Query: 70 SVIMPVYKIKKEWLEMAIESVRSQIYSHWELCIAEDCSGDIETVSLLKKYANMDSRIKVV 129 +V+M Y +K +L ++S+ +Q Y + EL I++D S D TV ++K+Y + D I ++ Sbjct: 1 AVLMATYNGEK-YLREQLDSILAQTYKNDELIISDDGSTD-GTVEIIKEYIDKDPFIIIL 58 Query: 130 FRAKNGHISAASN--SAAQLATSEWLALLDHDDLLHPTALYYVADAINNNPNAEIIYSDE 187 R ++ A N S Q A +++ D DD+ P L + A + ++YSD Sbjct: 59 IRNG-KNLGVARNFESLLQAADGDYVFFCDQDDIWLPDKLERLLKAFLKDDKPLLVYSDL 117 Query: 188 DKINENQIRSGPYF------KYDFNPELFHVHNMITHLGVYRTETFKKIGG----FREKF 237 + ++EN G F K + N++T G F + Sbjct: 118 ELVDENGNPIGESFFEYQKIKPGTSFNNLLFQNVVT--------------GCTMAFNREL 163 Query: 238 EGAQDYDLVLRFLENIDLS-------------QIIHIPRVLYHWRMHDNS 274 +L L F + + +++ + L +R H N+ Sbjct: 164 -----LELALPFPDADVIMHDWWLALLASAFGKVVFLDEPLILYRQHGNN 208 >gnl|CDD|133045 cd06423, CESA_like, CESA_like is the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis protein catalyzes the elongation of beta-1,2 polyglucose chains of Glucan. Length = 180 Score = 68.0 bits (167), Expect = 6e-12 Identities = 46/200 (23%), Positives = 78/200 (39%), Gaps = 27/200 (13%) Query: 328 IIIPTYNHHHLLKICLESIYHKTTYSSFEVIIIDNLSDDSKTFLYLQKIQKKYPNLRVIT 387 II+P YN +++ +ES+ Y EVI++D+ S D T L+++ Y ++ Sbjct: 1 IIVPAYNEEAVIERTIESL-LALDYPKLEVIVVDDGSTD-DTLEILEELAALYIRRVLVV 58 Query: 388 DNTHPFNYSRINNNATLHAKGQYFCFLNNDTEVINGQWLSEM-MGIASQPQVGAVGARLW 446 + + N HAKG L+ DT ++ L + + + P+VGAV R+ Sbjct: 59 RDKENGGKAGALNAGLRHAKGDIVVVLDADT-ILEPDALKRLVVPFFADPKVGAVQGRVR 117 Query: 447 YR--RKKLWKRSKRLQHGGVIMGINNIAGHKNKHHKARCSVPNYQAFAMHFTHSISAVTG 504 R + L RLQ + G A + ++G Sbjct: 118 VRNGSENLL---TRLQAIEYLSIFR--LGR----------------RAQSALGGVLVLSG 156 Query: 505 ACMVMSKKCFMHVGGFDEKN 524 A ++ VGG+DE Sbjct: 157 AFGAFRREALREVGGWDEDT 176 Score = 61.9 bits (151), Expect = 5e-10 Identities = 37/177 (20%), Positives = 74/177 (41%), Gaps = 13/177 (7%) Query: 71 VIMPVYKIKKEWLEMAIESVRSQIYSHWELCIAEDCSGDIETVSLLKKYANMDSRIKVVF 130 +I+P Y ++ +E IES+ + Y E+ + +D S D +T+ +L++ A + R +V Sbjct: 1 IIVPAYN-EEAVIERTIESLLALDYPKLEVIVVDDGSTD-DTLEILEELAALYIRRVLVV 58 Query: 131 RAKNGH-ISAASNSAAQLATSEWLALLDHDDLLHPTALYYVADAINNNPNAEI------I 183 R K + A N+ + A + + +LD D +L P AL + +P + Sbjct: 59 RDKENGGKAGALNAGLRHAKGDIVVVLDADTILEPDALKRLVVPFFADPKVGAVQGRVRV 118 Query: 184 YSDEDKINENQIRSGPYFKYDFNPELFHVHNMITHL----GVYRTETFKKIGGFREK 236 + + + + + L G +R E +++GG+ E Sbjct: 119 RNGSENLLTRLQAIEYLSIFRLGRRAQSALGGVLVLSGAFGAFRREALREVGGWDED 175 >gnl|CDD|133042 cd06420, GT2_Chondriotin_Pol_N, N-terminal domain of Chondroitin polymerase functions as a GalNAc transferase. Chondroitin polymerase is a two domain, bi-functional protein. The N-terminal domain functions as a GalNAc transferase. The bacterial chondroitin polymerase catalyzes elongation of the chondroitin chain by alternatively transferring the GlcUA and GalNAc moiety from UDP-GlcUA and UDP-GalNAc to the non-reducing ends of the chondroitin chain. The enzyme consists of N-terminal and C-terminal domains in which the two active sites catalyze the addition of GalNAc and GlcUA, respectively. Chondroitin chains range from 40 to over 100 repeating units of the disaccharide. Sulfated chondroitins are involved in the regulation of various biological functions such as central nervous system development, wound repair, infection, growth factor signaling, and morphogenesis, in addition to its conventional structural roles. In Caenorhabditis elegans, chondroitin is an essential factor for the worm to undergo cytokinesis and cell division. Chondroitin is synthesized as proteoglycans, sulfated and secreted to the cell surface or extracellular matrix. Length = 182 Score = 58.4 bits (142), Expect = 6e-09 Identities = 48/188 (25%), Positives = 81/188 (43%), Gaps = 29/188 (15%) Query: 71 VIMPVYKIKKEWLEMAIESVRSQIYSHWELCIAEDCSGDIETVSLLKKYANMDS-RIKVV 129 +I+ Y + E LE+ ++SV +Q +E+ IA+D S + ET L++++ + IK V Sbjct: 1 LIITTYN-RPEALELVLKSVLNQSILPFEVIIADDGSTE-ETKELIEEFKSQFPIPIKHV 58 Query: 130 FRAKNGHISAAS-NSAAQLATSEWLALLDHDDLLHPTALYYVADAINNNPNAEIIYSDED 188 ++ G A N A A ++L +D D + HP ++AD I AE Sbjct: 59 WQEDEGFRKAKIRNKAIAAAKGDYLIFIDGDCIPHP---DFIADHIE---LAE------- 105 Query: 189 KINENQIRSGPYFKYD--FNPELFHVHNMITHLGVYRTETFKKIGGFREKFE--GAQDYD 244 SG + NM ++ + + GF E+F G +D + Sbjct: 106 ---PGVFLSGSRVLLNEKLTERGIRGCNM----SFWKKD-LLAVNGFDEEFTGWGGEDSE 157 Query: 245 LVLRFLEN 252 LV R L + Sbjct: 158 LVARLLNS 165 Score = 46.8 bits (112), Expect = 2e-05 Identities = 27/91 (29%), Positives = 50/91 (54%), Gaps = 3/91 (3%) Query: 328 IIIPTYNHHHLLKICLESIYHKTTYSSFEVIIIDNLSDDSKTFLYLQKIQKKYP-NLRVI 386 +II TYN L++ L+S+ +++ FEVII D+ S + L +++ + ++P ++ + Sbjct: 1 LIITTYNRPEALELVLKSVLNQSIL-PFEVIIADDGSTEETKEL-IEEFKSQFPIPIKHV 58 Query: 387 TDNTHPFNYSRINNNATLHAKGQYFCFLNND 417 F ++I N A AKG Y F++ D Sbjct: 59 WQEDEGFRKAKIRNKAIAAAKGDYLIFIDGD 89 >gnl|CDD|133035 cd04192, GT_2_like_e, Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Length = 229 Score = 55.4 bits (134), Expect = 4e-08 Identities = 53/224 (23%), Positives = 87/224 (38%), Gaps = 31/224 (13%) Query: 328 IIIPTYNHHHLLKICLESIYH-KTTYSSFEVIIIDNLSDDSKTFLYLQKI-QKKYPNLRV 385 ++I N L L+S+ FEVI++D+ S D T L+ K L++ Sbjct: 1 VVIAARNEAENLPRLLQSLSALDYPKEKFEVILVDDHSTD-GTVQILEFAAAKPNFQLKI 59 Query: 386 ITDNTHPFNYSRINNNATL--HAKGQYFCFLNNDTEVINGQWLSEMMGIASQPQVGAV-G 442 + +N+ + N T AKG + + D V+ WL + + Q+G V G Sbjct: 60 L-NNSRVSISGKKNALTTAIKAAKGDWIVTTDADC-VVPSNWLLTFVAFIQKEQIGLVAG 117 Query: 443 ARLWYRRKKLWKRSKRLQHGGVIMGINNIAGHKNKHHKARCSVPNYQAFAMHFTHSISAV 502 ++++ K L + +RL ++ I G F + Sbjct: 118 PVIYFKGKSLLAKFQRLDWLSLLGLIAGSFGLG-----------------KPFMCN---- 156 Query: 503 TGACMVMSKKCFMHVGGFDEKNTPVVFSDIDLCLRILEAGYRNV 546 GA M K+ F VGGF E N + D +L L + + Y V Sbjct: 157 -GANMAYRKEAFFEVGGF-EGNDHIASGDDELLLAKVASKYPKV 198 >gnl|CDD|133063 cd06913, beta3GnTL1_like, Beta 1, 3-N-acetylglucosaminyltransferase is essential for the formation of poly-N-acetyllactosamine . This family includes human Beta3GnTL1 and related eukaryotic proteins. Human Beta3GnTL1 is a putative beta-1,3-N-acetylglucosaminyltransferase. Beta3GnTL1 is expressed at various levels in most of tissues examined. Beta 1, 3-N-acetylglucosaminyltransferase has been found to be essential for the formation of poly-N-acetyllactosamine. Poly-N-acetyllactosamine is a unique carbohydrate composed of N-acetyllactosamine repeats. It is often an important part of cell-type-specific oligosaccharide structures and some functional oligosaccharides. It has been shown that the structure and biosynthesis of poly-N-acetyllactosamine display a dramatic change during development and oncogenesis. Several members of beta-1, 3-N-acetylglucosaminyltransferase have been identified. Length = 219 Score = 50.1 bits (120), Expect = 2e-06 Identities = 54/216 (25%), Positives = 87/216 (40%), Gaps = 51/216 (23%) Query: 328 IIIPTYNHHHLLKICLESIYHKTTYSSFEVIIIDNLSDDSKTFLYLQKIQKKYPNLRVIT 387 II+P +N L CLES+ + + E+ + ++ S D K+ ++K +KK + VI Sbjct: 1 IILPVHNGEQWLDECLESVLQQDFEGTLELSVFNDASTD-KSAEIIEKWRKKLEDSGVIV 59 Query: 388 -----DNTHPFNYSRINNNATLHAKGQYFCFLNNDTEVINGQWLSEMMGIASQPQVGAVG 442 ++ P N A + G+Y CFL++D +V MM PQ Sbjct: 60 LVGSHNSPSPKGVGYAKNQAIAQSSGRYLCFLDSD-DV--------MM-----PQ----- 100 Query: 443 ARLWYRRKKLWKRSKRLQHGGVIMGINNIAGHKNKHHKARCSVPNYQAFAMHFTHS---- 498 R+ RLQ+ + N+I G + + S Y + T Sbjct: 101 -RI------------RLQYEAALQHPNSIIGCQVRRIPED-STERYTRWINTLTREQLLT 146 Query: 499 -ISAVTGACMVM-----SKKCFMHVGGFDE--KNTP 526 + G ++M S++ F HVG FDE K P Sbjct: 147 QVYTSHGPTVIMPTWFCSREWFSHVGPFDEGGKGVP 182 Score = 44.8 bits (106), Expect = 8e-05 Identities = 52/227 (22%), Positives = 97/227 (42%), Gaps = 26/227 (11%) Query: 71 VIMPVYKIKKEWLEMAIESVRSQIYSH-WELCIAEDCSGDIETVSLLKKYANMDSRIKVV 129 +I+PV+ ++WL+ +ESV Q + EL + D S D ++ +++K+ V+ Sbjct: 1 IILPVHN-GEQWLDECLESVLQQDFEGTLELSVFNDASTD-KSAEIIEKWRKKLEDSGVI 58 Query: 130 FRAKNGH------ISAASNSAAQLATSEWLALLDHDDLLHPTALYYVADAINNNPNAEII 183 + + + A N A ++ +L LD DD++ P + +A +PN+ II Sbjct: 59 VLVGSHNSPSPKGVGYAKNQAIAQSSGRYLCFLDSDDVMMPQRIRLQYEAALQHPNS-II 117 Query: 184 YSDEDKINENQIRSGPYFKYDFN--------PELFHVH--NMITHLGVYRTETFKKIGGF 233 +I E+ S + N +++ H +I E F +G F Sbjct: 118 GCQVRRIPED---STERYTRWINTLTREQLLTQVYTSHGPTVIMPTWFCSREWFSHVGPF 174 Query: 234 REKFEGAQDYDLVLRFLENIDLS-QIIHIPRVLYHWRMHDNSTAQKI 279 E +G + DL L F E++ + + R L +R H +T + Sbjct: 175 DEGGKGVPE-DL-LFFYEHLRKGGGVYRVDRCLLLYRYHPGATTHSV 219 >gnl|CDD|133038 cd04195, GT2_AmsE_like, GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis. AmsE is a glycosyltransferase involved in exopolysaccharide amylovora biosynthesis in Erwinia amylovora. Amylovara is one of the three exopolysaccharide produced by E. amylovora. Amylovara-deficient mutants are non-pathogenic. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Length = 201 Score = 48.1 bits (115), Expect = 8e-06 Identities = 53/191 (27%), Positives = 82/191 (42%), Gaps = 13/191 (6%) Query: 70 SVIMPVY-KIKKEWLEMAIESVRSQIYSHWELCIAED--CSGDIETVSLLKKYANMDSRI 126 SV+M VY K K E+L A+ES+ Q E+ + +D + + V L+++ + Sbjct: 1 SVLMSVYIKEKPEFLREALESILKQTLPPDEVVLVKDGPVTQSLNEV--LEEFKR-KLPL 57 Query: 127 KVVFRAKNGHISAASNSAAQLATSEWLALLDHDDLLHPTALYYVADAINNNPNAEII--Y 184 KVV KN + A N + T +W+A +D DD+ P D I NP +I+ Sbjct: 58 KVVPLEKNRGLGKALNEGLKHCTYDWVARMDTDDISLPDRFEKQLDFIEKNPEIDIVGGG 117 Query: 185 SDEDKINENQI--RSGPYFKYDFNPELFHVHNMITHLGV-YRTETFKKIGGFREKFEGAQ 241 E + N I R P D + + H V +R +GG++ + Sbjct: 118 VLEFDSDGNDIGKRRLPTSHDDIL-KFARRRSPFNHPTVMFRKSKVLAVGGYQ-DLPLVE 175 Query: 242 DYDLVLRFLEN 252 DY L R L N Sbjct: 176 DYALWARMLAN 186 >gnl|CDD|133030 cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1. A family of bacterial enzymes related to eukaryotic DPM1; Although the mechanism of eukaryotic enzyme is well studied, the mechanism of the bacterial enzymes is not well understood. The eukaryotic DPM1 is the catalytic subunit of eukaryotic Dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. The enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. This protein family belongs to Glycosyltransferase 2 superfamily. Length = 181 Score = 45.2 bits (108), Expect = 6e-05 Identities = 23/85 (27%), Positives = 42/85 (49%), Gaps = 8/85 (9%) Query: 86 AIESVRSQIYSHWELCIAEDCSGDIETVSLLKKYANMDSRIKVVFRAKN-GHISAASNSA 144 +++V + +E+ +D S D T+ +L++ A D R+KV+ ++N G +A Sbjct: 18 RLKAVLESLGYDYEIIFVDDGSTD-RTLEILRELAARDPRVKVIRLSRNFGQQAA---LL 73 Query: 145 AQL--ATSEWLALLDHDDLLHPTAL 167 A L A + + +D DL P L Sbjct: 74 AGLDHARGDAVITMD-ADLQDPPEL 97 Score = 40.2 bits (95), Expect = 0.002 Identities = 19/66 (28%), Positives = 35/66 (53%), Gaps = 13/66 (19%) Query: 328 IIIPTYN-------HHHLLKICLESIYHKTTYSSFEVIIIDNLSDDSKTFLYLQKIQKKY 380 I++P YN + LK LES+ + +E+I +D+ S D +T L+++ + Sbjct: 1 IVVPVYNEEENLPELYERLKAVLESLGY-----DYEIIFVDDGSTD-RTLEILRELAARD 54 Query: 381 PNLRVI 386 P ++VI Sbjct: 55 PRVKVI 60 >gnl|CDD|133022 cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily. DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi, and animals, have no transmembrane region, suggesting the existence of adapter molecules for membrane anchoring. This family also includes bacteria and archaea DPM1_like enzymes. However, the enzyme structure and mechanism of function are not well understood. The UDP-glucose:dolichyl-phosphate glucosyltransferase (DPG_synthase) is a transmembrane-bound enzyme of the endoplasmic reticulum involved in protein N-linked glycosylation. This enzyme catalyzes the transfer of glucose from UDP-glucose to dolichyl phosphate. This protein family belongs to Glycosyltransferase 2 superfamily. Length = 185 Score = 44.1 bits (105), Expect = 1e-04 Identities = 16/69 (23%), Positives = 32/69 (46%), Gaps = 5/69 (7%) Query: 328 IIIPTYNHHHLLKICLESIY-HKTTYSSFEVIIIDNLSDDSKTFLYLQKIQKKYPNLRVI 386 ++IP YN + +E + +E+I++D+ S D T +++ + P +RVI Sbjct: 1 VVIPAYNEEENIPELVERLLAVLEEGYDYEIIVVDDGSTD-GTAEIARELAARVPRVRVI 59 Query: 387 TDNTHPFNY 395 N+ Sbjct: 60 ---RLSRNF 65 Score = 39.9 bits (94), Expect = 0.002 Identities = 23/97 (23%), Positives = 49/97 (50%), Gaps = 7/97 (7%) Query: 71 VIMPVYKIKKEWLEMAIESVRS--QIYSHWELCIAEDCSGDIETVSLLKKYANMDSRIKV 128 V++P Y ++E + +E + + + +E+ + +D S D T + ++ A R++V Sbjct: 1 VVIPAYN-EEENIPELVERLLAVLEEGYDYEIIVVDDGSTD-GTAEIARELAARVPRVRV 58 Query: 129 VFRAKN-GHISAASNSAAQLATSEWLALLDHDDLLHP 164 + ++N G AA + + A + + +D DL HP Sbjct: 59 IRLSRNFGK-GAAVRAGFKAARGDIVVTMD-ADLQHP 93 >gnl|CDD|133013 cd02522, GT_2_like_a, GT_2_like_a represents a glycosyltransferase family-2 subfamily with unknown function. Glycosyltransferase family 2 (GT-2) subfamily of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Length = 221 Score = 43.3 bits (103), Expect = 2e-04 Identities = 44/217 (20%), Positives = 72/217 (33%), Gaps = 52/217 (23%) Query: 326 VSIIIPTYNHHHLLKICLESIYHKTTYSSFEVIIIDNLSDDSKTFLYLQKIQKKYPNLRV 385 +SIIIPT N L L S+ E+I++D S D + + V Sbjct: 1 LSIIIPTLNEAENLPRLLASLRRLNPL-PLEIIVVDGGSTDGTV----AIARSA--GVVV 53 Query: 386 ITDNTHPFNYSR---INNNATLHAKGQYFCFLNNDTEVINGQWLSEMMGIASQPQVGAVG 442 I+ R +N A A+G + FL+ DT + W + ++ A Sbjct: 54 ISSPK-----GRARQMNAGAA-AARGDWLLFLHADTRLPPD-WDAAIIETLRADGAVAGA 106 Query: 443 ARLWYRRKKLWKRSKRLQHGGVIMGINNIAGHKNKHHKARCSVPNYQAFAMHFTHSISAV 502 RL + R + +G N R Sbjct: 107 FRLRFDDPGPRLR-------LLELGAN-----------LRS----------RLFGLP--- 135 Query: 503 TG-ACMVMSKKCFMHVGGFDEKNTPVVFSDIDLCLRI 538 G + + ++ F +GGF E P + D++L R+ Sbjct: 136 YGDQGLFIRRELFEELGGFPE--LP-LMEDVELVRRL 169 >gnl|CDD|133061 cd06439, CESA_like_1, CESA_like_1 is a member of the cellulose synthase (CESA) superfamily. This is a subfamily of cellulose synthase (CESA) superfamily. CESA superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members of the superfamily include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins. Length = 251 Score = 43.0 bits (102), Expect = 2e-04 Identities = 30/104 (28%), Positives = 53/104 (50%), Gaps = 6/104 (5%) Query: 66 KPLISVIMPVYKIKKEWLEMAIESVRSQIYSH--WELCIAEDCSGDIETVSLLKKYANMD 123 P +++I+P Y ++ +E +E++ + Y E+ + D S D T + ++YA D Sbjct: 28 LPTVTIIIPAYN-EEAVIEAKLENLLALDYPRDRLEIIVVSDGSTD-GTAEIAREYA--D 83 Query: 124 SRIKVVFRAKNGHISAASNSAAQLATSEWLALLDHDDLLHPTAL 167 +K++ + +AA N A LAT E + D + LL P AL Sbjct: 84 KGVKLLRFPERRGKAAALNRALALATGEIVVFTDANALLDPDAL 127 Score = 28.7 bits (65), Expect = 5.4 Identities = 9/15 (60%), Positives = 10/15 (66%) Query: 320 PNPPPLVSIIIPTYN 334 P P V+IIIP YN Sbjct: 25 PAYLPTVTIIIPAYN 39 >gnl|CDD|133028 cd04185, GT_2_like_b, Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Length = 202 Score = 41.5 bits (98), Expect = 6e-04 Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 2/52 (3%) Query: 329 IIPTYNHHHLLKICLESIYHKTTYSSFEVIIIDNLS-DDSKTFLYLQKIQKK 379 ++ TYN LLK CL+++ T +I+IDN S D + +L Sbjct: 2 VVVTYNRLDLLKECLDAL-LAQTRPPDHIIVIDNASTDGTAEWLTSLGDLDN 52 >gnl|CDD|133056 cd06434, GT2_HAS, Hyaluronan synthases catalyze polymerization of hyaluronan. Hyaluronan synthases (HASs) are bi-functional glycosyltransferases that catalyze polymerization of hyaluronan. HASs transfer both GlcUA and GlcNAc in beta-(1,3) and beta-(1,4) linkages, respectively to the hyaluronan chain using UDP-GlcNAc and UDP-GlcUA as substrates. HA is made as a free glycan, not attached to a protein or lipid. HASs do not need a primer for HA synthesis; they initiate HA biosynthesis de novo with only UDP-GlcNAc, UDP-GlcUA, and Mg2+. Hyaluronan (HA) is a linear heteropolysaccharide composed of (1-3)-linked beta-D-GlcUA-beta-D-GlcNAc disaccharide repeats. It can be found in vertebrates and a few microbes and is typically on the cell surface or in the extracellular space, but is also found inside mammalian cells. Hyaluronan has several physiochemical and biological functions such as space filling, lubrication, and providing a hydrated matrix through which cells can migrate. Length = 235 Score = 41.5 bits (98), Expect = 7e-04 Identities = 35/131 (26%), Positives = 50/131 (38%), Gaps = 11/131 (8%) Query: 326 VSIIIPTYNHH-HLLKICLESIYHKTTYSSFEVIIIDNLSDDSKTFLYLQKIQKKYPNLR 384 V++IIP Y+ + + CL SI + V+ + L + KY + Sbjct: 2 VTVIIPVYDEDPDVFRECLRSIL-RQKPLEIIVVTDGDDEPY----LSILSQTVKYGGIF 56 Query: 385 VITDNTHPFNYSRINNNATLHAKGQYFCFLNNDTEVINGQWLSEMMGIASQPQVGAVGA- 443 VIT HP R H L++DT V L EM+ P+VG VG Sbjct: 57 VITV-PHP-GKRRALAEGIRHVTTDIVVLLDSDT-VWPPNALPEMLKPFEDPKVGGVGTN 113 Query: 444 -RLWYRRKKLW 453 R+ R W Sbjct: 114 QRILRPRDSKW 124 Score = 28.0 bits (63), Expect = 9.1 Identities = 7/25 (28%), Positives = 13/25 (52%) Query: 69 ISVIMPVYKIKKEWLEMAIESVRSQ 93 ++VI+PVY + + S+ Q Sbjct: 2 VTVIIPVYDEDPDVFRECLRSILRQ 26 >gnl|CDD|133031 cd04188, DPG_synthase, DPG_synthase is involved in protein N-linked glycosylation. UDP-glucose:dolichyl-phosphate glucosyltransferase (DPG_synthase) is a transmembrane-bound enzyme of the endoplasmic reticulum involved in protein N-linked glycosylation. This enzyme catalyzes the transfer of glucose from UDP-glucose to dolichyl phosphate. Length = 211 Score = 40.6 bits (96), Expect = 0.001 Identities = 28/101 (27%), Positives = 43/101 (42%), Gaps = 20/101 (19%) Query: 328 IIIPTYNHHHLLKICLESIY---HKTTYSSFEVIIIDNLSDDSKTFLYLQKIQKKYPNL- 383 ++IP YN L LE + S+E+I++D+ S D T +K+ +K P L Sbjct: 1 VVIPAYNEEKRLPPTLEEAVEYLEERPSFSYEIIVVDDGSKD-GTAEVARKLARKNPALI 59 Query: 384 RVITDNTHPFNYSRIN-------NNATLHAKGQYFCFLNND 417 RV+T + N L A+G Y F + D Sbjct: 60 RVLT-------LPK-NRGKGGAVRAGMLAARGDYILFADAD 92 Score = 29.1 bits (66), Expect = 3.6 Identities = 23/93 (24%), Positives = 42/93 (45%), Gaps = 5/93 (5%) Query: 71 VIMPVYKIKK---EWLEMAIESVRSQIYSHWELCIAEDCSGDIETVSLLKKYAN-MDSRI 126 V++P Y +K LE A+E + + +E+ + +D S D T + +K A + I Sbjct: 1 VVIPAYNEEKRLPPTLEEAVEYLEERPSFSYEIIVVDDGSKD-GTAEVARKLARKNPALI 59 Query: 127 KVVFRAKNGHISAASNSAAQLATSEWLALLDHD 159 +V+ KN A + A +++ D D Sbjct: 60 RVLTLPKNRGKGGAVRAGMLAARGDYILFADAD 92 >gnl|CDD|133012 cd02520, Glucosylceramide_synthase, Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis. UDP-glucose:N-acylsphingosine D-glucosyltransferase (glucosylceramide synthase or ceramide glucosyltransferase) catalyzes the first glycosylation step of glycosphingolipid synthesis. Its product, glucosylceramide, serves as the core of more than 300 glycosphingolipids (GSL). GSLs are a group of membrane components that have the lipid portion embedded in the outer plasma membrane leaflet and the sugar chains extended to the outer environment. Several lines of evidence suggest the importance of GSLs in various cellular processes such as differentiation, adhesion, proliferation, and cell-cell recognition. In pathogenic fungus Cryptococcus neoformans, glucosylceramide serves as an antigen that elicits an antibody response in patients and it is essential for fungal growth in host extracellular environment. Length = 196 Score = 39.9 bits (94), Expect = 0.002 Identities = 33/147 (22%), Positives = 57/147 (38%), Gaps = 29/147 (19%) Query: 324 PLVSIIIP-------TYNHHHLLKICLESIYHKTTYSSFEVII-IDNLSDDSKTFLYLQK 375 P VSI+ P Y LES + + Y +E++ + + D + ++K Sbjct: 1 PGVSILKPLCGVDPNLYE-------NLESFFQQD-YPKYEILFCVQDEDDPA--IPVVRK 50 Query: 376 IQKKYPNL--RVITDNTHPFNYSRINN--NATLHAKGQYFCFLNNDTEVINGQWLSEMMG 431 + KYPN+ R++ ++NN A+ ++D V +L M+ Sbjct: 51 LIAKYPNVDARLLIGGEKVGINPKVNNLIKGYEEARYDILVISDSDISVPPD-YLRRMVA 109 Query: 432 IASQPQVGAV------GARLWYRRKKL 452 P VG V G + RR+ L Sbjct: 110 PLMDPGVGLVTCLCAFGKSMALRREVL 136 >gnl|CDD|133062 cd06442, DPM1_like, DPM1_like represents putative enzymes similar to eukaryotic DPM1. Proteins similar to eukaryotic DPM1, including enzymes from bacteria and archaea; DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi, and animals, have no transmembrane region, suggesting the existence of adapter molecules for membrane anchoring. This family also includes bacteria and archaea DPM1_like enzymes. However, the enzyme structure and mechanism of function are not well understood. This protein family belongs to Glycosyltransferase 2 superfamily. Length = 224 Score = 39.8 bits (94), Expect = 0.002 Identities = 19/61 (31%), Positives = 34/61 (55%), Gaps = 5/61 (8%) Query: 328 IIIPTYNHHHLLKICLESIYHKTTYSSFEVIIIDNLSDDSK--TFLYLQKIQKKYPNLRV 385 IIIPTYN + +E + +E+I++D D+S T ++++ K+YP +R+ Sbjct: 1 IIIPTYNERENIPELIERLDAALKGIDYEIIVVD---DNSPDGTAEIVRELAKEYPRVRL 57 Query: 386 I 386 I Sbjct: 58 I 58 Score = 38.7 bits (91), Expect = 0.005 Identities = 21/95 (22%), Positives = 48/95 (50%), Gaps = 4/95 (4%) Query: 71 VIMPVYKIKKEWLEMAIESVRSQIYS-HWELCIAEDCSGDIETVSLLKKYANMDSRIKVV 129 +I+P Y ++E + IE + + + +E+ + +D S D T ++++ A R++++ Sbjct: 1 IIIPTYN-ERENIPELIERLDAALKGIDYEIIVVDDNSPD-GTAEIVRELAKEYPRVRLI 58 Query: 130 FRAKNGHISAASNSAAQLATSEWLALLDHDDLLHP 164 R + +A + A + + ++D DL HP Sbjct: 59 VRPGKRGLGSAYIEGFKAARGDVIVVMDA-DLSHP 92 >gnl|CDD|38189 KOG2978, KOG2978, KOG2978, Dolichol-phosphate mannosyltransferase [General function prediction only]. Length = 238 Score = 38.4 bits (89), Expect = 0.006 Identities = 26/95 (27%), Positives = 40/95 (42%), Gaps = 6/95 (6%) Query: 327 SIIIPTYNHHHLLKICLESI--YHKTTYSSFEVIIIDNLSDDSKTFLYLQKIQKKY--PN 382 S+I+PTYN L I I Y +E+II+D+ S D T + +QK Y N Sbjct: 6 SVILPTYNEKENLPIITRLIAKYMSEEGKKYEIIIVDDASPDG-TQEVAKALQKIYGEDN 64 Query: 383 LRVITDNTHPFNYSRINNNATLHAKGQYFCFLNND 417 + + T + HA G + ++ D Sbjct: 65 ILLKP-RTKKLGLGTAYIHGLKHATGDFIVIMDAD 98 >gnl|CDD|38188 KOG2977, KOG2977, KOG2977, Glycosyltransferase [General function prediction only]. Length = 323 Score = 35.7 bits (82), Expect = 0.043 Identities = 27/104 (25%), Positives = 45/104 (43%), Gaps = 17/104 (16%) Query: 326 VSIIIPTYNHHHLLKIC-------LESIYHKTTYSSFEVIIIDNLSDDSKTFLYLQKIQK 378 +S+I+P YN + LE Y ++E+I++D+ S DS + L+ +K Sbjct: 69 LSVIVPAYNEEGRIGAMLDEAVDYLEKRYLSDKSFTYEIIVVDDGSTDSTVEVALKFSRK 128 Query: 379 -KYPNLRVITDNTHPFNYSRINNNAT----LHAKGQYFCFLNND 417 N+RVI +R A L ++GQ F + D Sbjct: 129 LGDDNIRVI-----KLKKNRGKGGAVRKGMLSSRGQKILFADAD 167 >gnl|CDD|38940 KOG3736, KOG3736, KOG3736, Polypeptide N-acetylgalactosaminyltransferase [Posttranslational modification, protein turnover, chaperones]. Length = 578 Score = 31.9 bits (72), Expect = 0.49 Identities = 25/109 (22%), Positives = 48/109 (44%), Gaps = 10/109 (9%) Query: 324 PLVSIIIPTYNHHHLLKICLESIYHKTTYSS----FEVIIIDNLSDDSKTFLYLQKIQKK 379 P S+II +N L +++ + E+I++D+ SD L++ K+ Sbjct: 142 PTTSVIIIFHNEA--WSTLLRTVHSVINRTPPYLLKEIILVDDFSDRDHLKDKLEEYVKR 199 Query: 380 YPNLRVI-TDNTHPFNYSRINNNATLHAKGQYFCFLNNDTEVINGQWLS 427 + +R++ T +R+ + A G+ FL++ EV G WL Sbjct: 200 FSKVRILRTKKREGLIRARLLGAS--MATGEVLTFLDSHCEVNVG-WLE 245 >gnl|CDD|133005 cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide. UDP-glucose: lipooligosaccharide (LOS) beta-1-4-glucosyltransferase catalyzes the addition of the first residue, glucose, of the lacto-N-neotetrase structure to HepI of the LOS inner core. LOS is the major constituent of the outer leaflet of the outer membrane of gram-positive bacteria. It consists of a short oligosaccharide chain of variable composition (alpha chain) attached to a branched inner core which is lined in turn to lipid A. Beta 1,4 glucosyltransferase is required to attach the alpha chain to the inner core. Length = 229 Score = 31.5 bits (72), Expect = 0.70 Identities = 28/103 (27%), Positives = 44/103 (42%), Gaps = 29/103 (28%) Query: 69 ISVIMPVY----KIKKEWLEMAIESVRSQIYSHWELCIAE-----DCSGDIETVSLLKKY 119 +SV++ I++ +ESV+ + E S D TV + K+Y Sbjct: 2 LSVVIITKNEERNIER-----CLESVK--------WAVDEIIVVDSGSTD-RTVEIAKEY 47 Query: 120 ANMDSRIKVVFRAKNGHISAASNSAAQLATSEWLALLDHDDLL 162 KV R +G A N A +LAT++W+ LD D+ L Sbjct: 48 -----GAKVYQRWWDGF-GAQRNFALELATNDWVLSLDADERL 84 Score = 30.7 bits (70), Expect = 1.2 Identities = 13/41 (31%), Positives = 23/41 (56%), Gaps = 4/41 (9%) Query: 326 VSIIIPTYNHHHLLKICLESIYHKTTYSSFEVIIIDNLSDD 366 +S++I T N ++ CLES+ E+I++D+ S D Sbjct: 2 LSVVIITKNEERNIERCLESVKWAVD----EIIVVDSGSTD 38 >gnl|CDD|132897 cd07057, BMC_CcmK, Carbon dioxide concentrating mechanism (CcmK); Bacterial Micro-Compartment (BCM) domain. CcmK1-4 and CcmL proteins found in Synechocystis sp. strain PCC 6803 make up the beta carboxysome shell. These CcmK proteins have been shown to form hexameric units, while the CcmL proteins have been shown to form pentameric units. Together these proteins further assemble into the flat facets of the polyhedral carboxysome shell. The structures suggest that the central pores and the gaps between hexamers limit the transport of metabolites into and out of the the carboxysome. Length = 88 Score = 31.1 bits (71), Expect = 0.84 Identities = 9/31 (29%), Positives = 15/31 (48%) Query: 301 GIAAKAVFDGAQYRTHYMIPNPPPLVSIIIP 331 GI A G + +H +IP P + ++P Sbjct: 57 GIEAAERVHGGEVLSHTIIPRPHENLEAVLP 87 >gnl|CDD|133017 cd02526, GT2_RfbF_like, RfbF is a putative dTDP-rhamnosyl transferase. Shigella flexneri RfbF protein is a putative dTDP-rhamnosyl transferase. dTDP rhamnosyl transferases of Shigella flexneri add rhamnose sugars to N-acetyl-glucosamine in the O-antigen tetrasaccharide repeat. Lipopolysaccharide O antigens are important virulence determinants for many bacteria. The variations of sugar composition, the sequence of the sugars and the linkages in the O antigen provide structural diversity of the O antigen. Length = 237 Score = 30.7 bits (70), Expect = 1.3 Identities = 23/114 (20%), Positives = 37/114 (32%), Gaps = 13/114 (11%) Query: 137 ISAASNSAAQLATS---EWLALLDHDDLLHPTA---LYYVADAINNNPNAEII---YSDE 187 I+ A N + A +++ L D D + P L + N N + D Sbjct: 60 IAKALNIGIKAALENGADYVLLFDQDSVPPPDMVEKLLAYKILSDKNSNIGAVGPRIIDR 119 Query: 188 DKINENQIRSGPYFKYDFNPELFH----VHNMITHLGVYRTETFKKIGGFREKF 237 + +K E V +IT + E +K+GGF E Sbjct: 120 RTGENSPGVRKSGYKLRIQKEGEEGLKEVDFLITSGSLISLEALEKVGGFDEDL 173 Score = 30.7 bits (70), Expect = 1.4 Identities = 18/52 (34%), Positives = 24/52 (46%), Gaps = 7/52 (13%) Query: 507 MVMSKKCFMHVGGFDEKNTPVVF---SDIDLCLRILEAGYRNVWTPHADLYH 555 ++S + VGGFDE +F D + CLR GY+ P A L H Sbjct: 156 SLISLEALEKVGGFDED----LFIDYVDTEWCLRARSKGYKIYVVPDAVLKH 203 >gnl|CDD|38403 KOG3193, KOG3193, KOG3193, K+ channel subunit [Inorganic ion transport and metabolism]. Length = 1087 Score = 30.5 bits (68), Expect = 1.5 Identities = 17/68 (25%), Positives = 31/68 (45%), Gaps = 7/68 (10%) Query: 342 CLESIYHKTTYSSFEVIIIDNLSD-------DSKTFLYLQKIQKKYPNLRVITDNTHPFN 394 L+ + S+ V+++ + D T + +QKI + +P LR+IT+ TH N Sbjct: 773 NLDCLLRAGVSSAEHVVVVKETAVMAEEHTADCNTIITVQKIHRMFPRLRMITELTHATN 832 Query: 395 YSRINNNA 402 + N Sbjct: 833 MRFVQFNP 840 >gnl|CDD|38942 KOG3738, KOG3738, KOG3738, Predicted polypeptide N-acetylgalactosaminyltransferase [Posttranslational modification, protein turnover, chaperones]. Length = 559 Score = 30.0 bits (67), Expect = 1.8 Identities = 27/109 (24%), Positives = 47/109 (43%), Gaps = 8/109 (7%) Query: 324 PLVSIIIPTYNHHH--LLKICLESIYHKTTYSSFEVIIIDNLSDDSKTFLYLQKIQKKYP 381 P S+II +N LL+ + + + E+I++D+ S D + L++I K Sbjct: 124 PPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIHEIILVDDFSQDPEDGKLLKRIP-KVR 182 Query: 382 NLRVITDNTHPFNYSRINNNATLHAKGQYFCFLNNDTEVINGQWLSEMM 430 LR + SR+ A+ FL++ EV N WL ++ Sbjct: 183 VLR--NNEREGLIRSRVRGADV--AQATVLTFLDSHCEV-NEGWLEPLL 226 >gnl|CDD|37758 KOG2547, KOG2547, KOG2547, Ceramide glucosyltransferase [Lipid transport and metabolism, Cell wall/membrane/envelope biogenesis]. Length = 431 Score = 30.3 bits (68), Expect = 1.8 Identities = 29/119 (24%), Positives = 52/119 (43%), Gaps = 9/119 (7%) Query: 55 KILQNISQWQNKPLISVIMPVYKIKKEWLEMAIESVRSQIYSHWELCIAEDCSGD--IET 112 ++ + P +S+I P+ K L +ES + Y +EL + S D IE Sbjct: 73 RLHKKSKPDPKLPGVSIIKPL-KGVDPNLYHNLESFFTSQYHKYELLFCVESSEDPAIEV 131 Query: 113 VS-LLKKYANMDSRIKVVFRAKNGHISAASNS---AAQLATSEWLALLDHDDLLHPTAL 167 V LLKKY N+D+ K+ F + ++ N+ + A + + + D + P + Sbjct: 132 VERLLKKYPNVDA--KLFFGGEKVGLNPKINNMMPGYRAAKYDLVLISDSGIFMKPDTI 188 >gnl|CDD|146567 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1 family). Length = 619 Score = 29.8 bits (67), Expect = 2.1 Identities = 23/95 (24%), Positives = 46/95 (48%), Gaps = 6/95 (6%) Query: 11 SLLRKIKLKFPSRHKRRIIKFENFEKEYRE-WSSLFHDTSPQFTKKILQN---ISQWQNK 66 L K+ ++ K+R K ++ ++ +E W+ L S + K+ ++ +SQ K Sbjct: 210 DKLDKMLENLAAQKKQRADKIDDLREQIQELWNRL--QISDEEQKRFVREATILSQESIK 267 Query: 67 PLISVIMPVYKIKKEWLEMAIESVRSQIYSHWELC 101 L + + +KK+ L+ IE +R +I W+L Sbjct: 268 RLEEEVERLEALKKQNLKKFIEDLRIEIQELWDLL 302 >gnl|CDD|38795 KOG3588, KOG3588, KOG3588, Chondroitin synthase 1 [Carbohydrate transport and metabolism]. Length = 494 Score = 29.2 bits (65), Expect = 3.2 Identities = 14/47 (29%), Positives = 24/47 (51%) Query: 509 MSKKCFMHVGGFDEKNTPVVFSDIDLCLRILEAGYRNVWTPHADLYH 555 + F+ VGGFD + D+DL + + +G + + TP L+H Sbjct: 390 QYRSDFLTVGGFDMEIKGWGGEDVDLYRKYVHSGLKVIRTPEPGLFH 436 >gnl|CDD|37255 KOG2044, KOG2044, KOG2044, 5'-3' exonuclease HKE1/RAT1 [Replication, recombination and repair, RNA processing and modification]. Length = 931 Score = 28.8 bits (64), Expect = 4.2 Identities = 11/30 (36%), Positives = 17/30 (56%), Gaps = 3/30 (10%) Query: 160 DLLHPTALYYVADAINNNP---NAEIIYSD 186 D L YY+ D +N++P N ++I SD Sbjct: 168 DRLAKALRYYIHDRLNSDPGWKNIKVILSD 197 >gnl|CDD|133057 cd06435, CESA_NdvC_like, NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. Bradyrhizobium japonicum synthesizes periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans during growth under hypoosmotic conditions. Two genes (ndvB, ndvC) are involved in the beta-(1, 3), beta-(1,6)-glucan synthesis. The ndvC mutant strain resulted in synthesis of altered cyclic beta-glucans composed almost entirely of beta-(1, 3)-glycosyl linkages. The periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans function for osmoregulation. The ndvC mutation also affects the ability of the bacteria to establish a successful symbiotic interaction with host plant. Thus, the beta-glucans may function as suppressors of a host defense response. Length = 236 Score = 28.5 bits (64), Expect = 5.4 Identities = 56/239 (23%), Positives = 87/239 (36%), Gaps = 53/239 (22%) Query: 327 SIIIPTYNHH-HLLKICLESIYHKTTYSSFEVIIIDNLSDDSKTFLYLQKIQK--KYPNL 383 SI +P Y ++K L+S+ Y +FEVI+IDN + D + + ++ Sbjct: 1 SIHVPCYEEPPEMVKETLDSL-AALDYPNFEVIVIDNNTKDEALW---KPVEAHCAQLGE 56 Query: 384 RVITDNTHPFNYSRINNNATLHAKGQYFC--FLNNDTEVI---------NGQWLSEMMGI 432 R F + + A + D E+I WL ++ I Sbjct: 57 R--------FRFFHVEPLPGAKAGALNYALERTAPDAEIIAVIDADYQVEPDWLKRLVPI 108 Query: 433 ASQPQVGAVGARLWYRRKKLWKRS--KRLQHGGVIMGINNIAGHKNKHHKARCSVPNYQA 490 P+VG V A YR + S KR+ + A +K S A Sbjct: 109 FDDPRVGFVQAPQDYRD---GEESLFKRMCY----------AEYKGFFDIGMVSRNERNA 155 Query: 491 FAMHFTHSISAVTGACMVMSKKCFMHVGGFDEKNTPVVFSDIDLCLRILEAGYRNVWTP 549 H T C++ + VGG+DE + D +L LR+ EAGY V+ Sbjct: 156 IIQHGT--------MCLI-RRSALDDVGGWDEW---CITEDSELGLRMHEAGYIGVYVA 202 >gnl|CDD|37204 KOG1993, KOG1993, KOG1993, Nuclear transport receptor KAP120 (importin beta superfamily) [Nuclear structure, Intracellular trafficking, secretion, and vesicular transport]. Length = 978 Score = 28.4 bits (63), Expect = 6.9 Identities = 10/47 (21%), Positives = 13/47 (27%) Query: 512 KCFMHVGGFDEKNTPVVFSDIDLCLRILEAGYRNVWTPHADLYHDES 558 M V F P FS C LE ++ + L Sbjct: 267 IKLMKVLAFLFNRHPFSFSFYSPCPVKLEFSIDYIFDEYDFLGQISG 313 >gnl|CDD|32766 COG2943, MdoH, Membrane glycosyltransferase [Cell envelope biogenesis, outer membrane]. Length = 736 Score = 28.0 bits (62), Expect = 8.7 Identities = 8/35 (22%), Positives = 18/35 (51%) Query: 151 EWLALLDHDDLLHPTALYYVADAINNNPNAEIIYS 185 ++ +LD D ++ L + + NP+A +I + Sbjct: 242 SYMLVLDADSVMTGDCLVRLVRLMEANPDAGLIQT 276 >gnl|CDD|36140 KOG0922, KOG0922, KOG0922, DEAH-box RNA helicase [RNA processing and modification]. Length = 674 Score = 27.9 bits (62), Expect = 8.9 Identities = 14/38 (36%), Positives = 21/38 (55%), Gaps = 4/38 (10%) Query: 353 SSFEVIIIDNLSDDSKT----FLYLQKIQKKYPNLRVI 386 S + VII+D + S L+KI KK P+L++I Sbjct: 162 SKYSVIILDEAHERSLHTDILLGLLKKILKKRPDLKLI 199 >gnl|CDD|133049 cd06427, CESA_like_2, CESA_like_2 is a member of the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, Glucan Biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis protein catalyzes the elongation of beta-1,2 polyglucose chains of glucan. Length = 241 Score = 27.6 bits (62), Expect = 9.5 Identities = 15/41 (36%), Positives = 18/41 (43%), Gaps = 3/41 (7%) Query: 504 GACMVMSKKCFMHVGGFDEKNTPVVFSDIDLCLRILEAGYR 544 G +GG+D N V D DL LR+ AGYR Sbjct: 163 GTSNHFRTDVLRELGGWDPFN---VTEDADLGLRLARAGYR 200 >gnl|CDD|35375 KOG0153, KOG0153, KOG0153, Predicted RNA-binding protein (RRM superfamily) [General function prediction only]. Length = 377 Score = 27.7 bits (61), Expect = 9.8 Identities = 15/61 (24%), Positives = 23/61 (37%), Gaps = 2/61 (3%) Query: 520 FDEKNTPVV--FSDIDLCLRILEAGYRNVWTPHADLYHDESRTRKYDHEDPAKMIVFQEA 577 + ++N P + F C R E YR+ P L + R Y DP + + A Sbjct: 155 YYKRNRPHICSFFVKGECKRGAECPYRHEKPPDDPLSLQNIKDRYYGLNDPVALKILNRA 214 Query: 578 C 578 Sbjct: 215 G 215 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.321 0.136 0.422 Gapped Lambda K H 0.267 0.0690 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 8,036,511 Number of extensions: 438335 Number of successful extensions: 1242 Number of sequences better than 10.0: 1 Number of HSP's gapped: 1191 Number of HSP's successfully gapped: 74 Length of query: 623 Length of database: 6,263,737 Length adjustment: 100 Effective length of query: 523 Effective length of database: 4,102,837 Effective search space: 2145783751 Effective search space used: 2145783751 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 60 (27.0 bits)