Query psy6218
Match_columns 533
No_of_seqs 260 out of 1284
Neff 5.4
Searched_HMMs 46136
Date Fri Aug 16 20:43:26 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6218.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6218hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF02434 Fringe: Fringe-like; 100.0 1.3E-30 2.9E-35 261.3 9.3 188 332-527 4-207 (252)
2 KOG2246|consensus 100.0 1.3E-29 2.9E-34 265.7 14.5 186 329-529 86-275 (364)
3 PLN03133 beta-1,3-galactosyltr 99.9 8.3E-26 1.8E-30 249.4 18.3 232 270-526 341-596 (636)
4 PLN03193 beta-1,3-galactosyltr 99.9 7.3E-26 1.6E-30 238.0 15.4 187 331-525 136-353 (408)
5 KOG2287|consensus 99.9 1.2E-24 2.6E-29 227.7 16.9 186 333-524 94-303 (349)
6 PF01762 Galactosyl_T: Galacto 99.9 1.4E-23 3.1E-28 201.4 13.6 167 350-522 2-194 (195)
7 PLN03153 hypothetical protein; 99.9 1.3E-21 2.8E-26 210.2 20.0 193 331-530 119-320 (537)
8 KOG3708|consensus 99.8 7.4E-19 1.6E-23 186.4 7.9 186 321-529 14-200 (681)
9 KOG2288|consensus 99.8 4.2E-18 9.2E-23 168.7 12.0 184 331-522 8-220 (274)
10 PTZ00210 UDP-GlcNAc-dependent 99.7 6.3E-16 1.4E-20 161.4 13.4 176 332-516 78-307 (382)
11 KOG3708|consensus 99.6 6.9E-17 1.5E-21 171.6 3.7 153 7-172 125-310 (681)
12 PF05679 CHGN: Chondroitin N-a 99.3 1.7E-12 3.7E-17 142.3 2.5 58 54-111 1-67 (499)
13 PF02434 Fringe: Fringe-like; 98.8 1.7E-09 3.8E-14 109.0 0.6 87 7-93 114-209 (252)
14 KOG3588|consensus 98.1 9E-07 1.9E-11 92.6 0.7 80 43-133 1-93 (494)
15 PLN03153 hypothetical protein; 97.4 0.00014 3E-09 80.0 4.7 106 7-119 238-344 (537)
16 PF04646 DUF604: Protein of un 97.0 0.00098 2.1E-08 67.5 5.9 58 471-529 10-68 (255)
17 KOG2246|consensus 94.5 0.034 7.4E-07 59.5 3.9 69 7-86 196-268 (364)
18 PF04646 DUF604: Protein of un 93.0 0.1 2.2E-06 53.1 4.0 82 36-119 11-93 (255)
19 PF13506 Glyco_transf_21: Glyc 91.8 1.1 2.3E-05 43.0 9.1 104 414-523 30-147 (175)
20 PF01755 Glyco_transf_25: Glyc 90.9 2.1 4.5E-05 40.9 10.2 148 338-491 4-189 (200)
21 PF13641 Glyco_tranf_2_3: Glyc 88.3 1.8 3.8E-05 41.4 7.6 109 404-522 77-202 (228)
22 cd04186 GT_2_like_c Subfamily 82.6 4.1 8.8E-05 36.1 6.6 90 415-529 74-164 (166)
23 PF01762 Galactosyl_T: Galacto 79.7 1.3 2.8E-05 42.6 2.5 48 27-78 140-187 (195)
24 TIGR03472 HpnI hopanoid biosyn 76.5 75 0.0016 33.7 14.8 116 399-523 113-246 (373)
25 cd02520 Glucosylceramide_synth 75.7 11 0.00023 35.6 7.4 85 415-524 86-171 (196)
26 cd06532 Glyco_transf_25 Glycos 70.0 9.6 0.00021 34.3 5.4 107 339-492 3-118 (128)
27 cd06421 CESA_CelA_like CESA_Ce 69.8 16 0.00035 34.6 7.2 102 415-524 84-205 (234)
28 cd02526 GT2_RfbF_like RfbF is 68.5 27 0.00059 33.3 8.5 103 415-523 75-197 (237)
29 PTZ00210 UDP-GlcNAc-dependent 68.0 4.7 0.0001 43.6 3.3 68 7-80 226-307 (382)
30 PF13632 Glyco_trans_2_3: Glyc 67.7 19 0.00042 33.6 7.1 100 418-524 1-118 (193)
31 cd04185 GT_2_like_b Subfamily 67.1 18 0.0004 33.7 6.9 85 414-519 78-163 (202)
32 cd06420 GT2_Chondriotin_Pol_N 63.8 42 0.00092 30.5 8.5 92 415-520 79-170 (182)
33 TIGR01556 rhamnosyltran L-rham 63.1 36 0.00078 34.1 8.5 106 404-518 65-189 (281)
34 TIGR03469 HonB hopene-associat 62.8 28 0.00062 37.1 8.1 98 415-518 133-250 (384)
35 PF15167 DUF4581: Domain of un 62.7 1.9 4E-05 38.9 -0.7 14 140-153 31-44 (128)
36 KOG2287|consensus 60.4 6.5 0.00014 41.9 2.7 49 30-82 249-297 (349)
37 cd04196 GT_2_like_d Subfamily 58.2 35 0.00075 31.7 7.0 93 415-513 79-188 (214)
38 cd04192 GT_2_like_e Subfamily 58.0 38 0.00082 31.8 7.2 102 405-512 74-191 (229)
39 cd06434 GT2_HAS Hyaluronan syn 55.1 30 0.00065 33.0 6.1 106 415-521 77-204 (235)
40 cd06435 CESA_NdvC_like NdvC_li 50.1 54 0.0012 31.4 7.0 112 405-524 74-203 (236)
41 cd02525 Succinoglycan_BP_ExoA 50.0 83 0.0018 29.9 8.3 101 415-521 81-199 (249)
42 cd06437 CESA_CaSu_A2 Cellulose 49.5 42 0.00091 32.3 6.1 111 405-524 79-207 (232)
43 cd06439 CESA_like_1 CESA_like_ 47.4 64 0.0014 31.2 7.1 108 405-524 101-223 (251)
44 cd04187 DPM1_like_bac Bacteria 47.4 91 0.002 28.5 7.8 74 415-489 80-164 (181)
45 cd06427 CESA_like_2 CESA_like_ 46.8 54 0.0012 31.9 6.5 110 404-522 75-204 (241)
46 PRK11204 N-glycosyltransferase 46.6 59 0.0013 34.6 7.3 111 405-524 126-254 (420)
47 cd06433 GT_2_WfgS_like WfgS an 44.9 1.5E+02 0.0032 26.8 8.8 106 405-519 67-183 (202)
48 cd04184 GT2_RfbC_Mx_like Myxoc 43.2 86 0.0019 29.0 7.0 104 415-524 83-195 (202)
49 cd04179 DPM_DPG-synthase_like 42.7 1.1E+02 0.0024 27.7 7.5 74 415-488 79-167 (185)
50 cd06442 DPM1_like DPM1_like re 42.2 96 0.0021 29.1 7.3 73 415-488 78-167 (224)
51 cd02522 GT_2_like_a GT_2_like_ 39.9 92 0.002 29.2 6.7 97 415-518 72-175 (221)
52 PRK14716 bacteriophage N4 adso 37.9 65 0.0014 36.3 6.1 109 414-524 157-283 (504)
53 PLN03133 beta-1,3-galactosyltr 37.6 19 0.00041 41.6 1.9 47 30-78 536-582 (636)
54 PF00535 Glycos_transf_2: Glyc 35.4 32 0.0007 29.9 2.7 80 405-487 70-168 (169)
55 PF13704 Glyco_tranf_2_4: Glyc 34.2 40 0.00086 28.2 2.9 28 404-432 61-88 (97)
56 cd06436 GlcNAc-1-P_transferase 33.0 75 0.0016 29.9 4.9 73 415-487 89-178 (191)
57 cd00761 Glyco_tranf_GTA_type G 32.0 1.5E+02 0.0033 24.7 6.3 72 415-513 77-149 (156)
58 PRK14583 hmsR N-glycosyltransf 31.5 1.3E+02 0.0027 32.9 6.9 111 405-524 147-275 (444)
59 cd06438 EpsO_like EpsO protein 31.3 1.1E+02 0.0024 28.2 5.7 73 414-486 80-169 (183)
60 cd04191 Glucan_BSP_ModH Glucan 29.3 1.5E+02 0.0032 30.1 6.6 107 414-523 94-224 (254)
61 cd06423 CESA_like CESA_like is 26.4 1.4E+02 0.0031 25.6 5.3 26 415-440 78-103 (180)
62 cd02510 pp-GalNAc-T pp-GalNAc- 26.0 1.8E+02 0.0039 29.4 6.6 27 415-441 83-109 (299)
63 PRK11234 nfrB bacteriophage N4 25.9 3.3E+02 0.0072 32.2 9.4 107 415-523 155-279 (727)
64 PLN03193 beta-1,3-galactosyltr 25.8 50 0.0011 36.3 2.6 40 33-76 302-341 (408)
65 PRK10714 undecaprenyl phosphat 23.2 2.7E+02 0.0058 29.2 7.4 75 415-489 90-174 (325)
66 TIGR03030 CelA cellulose synth 22.6 1.9E+02 0.0042 33.8 6.7 108 405-522 220-350 (713)
67 PF08209 Sgf11: Sgf11 (transcr 22.2 31 0.00067 24.7 0.1 11 50-60 18-28 (33)
68 COG1216 Predicted glycosyltran 21.8 2.8E+02 0.006 28.4 7.1 112 404-524 76-214 (305)
69 COG1215 Glycosyltransferases, 20.3 2.7E+02 0.0058 29.4 6.8 109 405-523 129-258 (439)
No 1
>PF02434 Fringe: Fringe-like; InterPro: IPR003378 The Notch receptor is a large, cell surface transmembrane protein involved in a wide variety of developmental processes in higher organisms []. It becomes activated when its extracellular region binds to ligands located on adjacent cells. Much of this extracellular region is composed of EGF-like repeats, many of which can be O-fucosylated. A number of these O-fucosylated repeats can in turn be further modified by the action of a beta-1,3-N-acetylglucosaminyltransferase enzyme known as Fringe []. Fringe potentiates the activation of Notch by Delta ligands, while inhibiting activation by Serrate/Jagged ligands. This regulation of Notch signalling by Fringe is important in many processes []. Four distinct Fringe proteins have so far been studied in detail; Drosophila Fringe (Dfng) and its three mammalian homologues Lunatic Fringe (Lfng), Radical Fringe (Rfng) and Manic Fringe (Mfng). Dfng, Lfng and Rfng have all been shown to play important roles in developmental processes within their host, though the phenotype of mutants can vary between species e.g. Rfng mutants are retarded in wing development in chickens, but have no obvious phenotype in mice [, , ]. Mfng mutants have not, so far, been charcterised. Biochemical studies indicate that the Fringe proteins are fucose-specific transferases requiring manganese for activity and utilising UDP-N-acetylglucosamine as a donor substrate []. The three mammalian proteins show distinct variations in their catalytic efficiencies with different substrates. Dfng is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands which is localised to the Golgi apparatus [] (not secreted as previously thought). Modification of Notch occurs through glycosylation by Dfng. This entry consists of Fringe proteins and related glycosyltransferase enzymes including: Beta-1,3-glucosyltransferase, which glucosylates O-linked fucosylglycan on thrombospondin type 1 repeat domains []. Core 1 beta1,3-galactosyltransferase 1, generates the core T antigen, which is a precursor for many extended O-glycans in glycoproteins and plays a central role in many processes, such as angiogenesis, thrombopoiesis and kidney homeostasis development []. ; GO: 0016757 transferase activity, transferring glycosyl groups, 0016020 membrane; PDB: 2J0B_A 2J0A_A.
Probab=99.96 E-value=1.3e-30 Score=261.29 Aligned_cols=188 Identities=28% Similarity=0.455 Sum_probs=110.1
Q ss_pred CCceEEEEEEcCCCChHHHHHHHHHhhhccCCCcEEE-EecCCCCCCCCCCCCCc-----EEecCCCCCCC-chHHHHHH
Q psy6218 332 ESGLLFVGVMTANSYLNTRATSVYDTWARDIPGRVAF-FTSESSTLPAARPDLPL-----VKLRGVDDSYP-PQKKSFLM 404 (533)
Q Consensus 332 ~~~~LlI~V~Tsp~~~~tRr~AIR~TWgr~~~~rvvF-fsg~~~~i~~~d~~lpv-----V~L~~~dD~Yn-lt~Kt~~~ 404 (533)
...+|+|+|+|++++|++|+++|++||+++|+ ++.| |++.++ ++++. +..++.++.+. .+.+.++.
T Consensus 4 ~~~dI~i~V~T~~k~h~tR~~~I~~TW~~~~~-~~~~ifsd~~d------~~l~~~~~~~l~~~~~~~~~~~~~~~~~~~ 76 (252)
T PF02434_consen 4 TLDDIFIAVKTTKKFHKTRAPAIKQTWAKRCN-KQTFIFSDAED------PSLPTVTGVHLVNPNCDAGHCRKTLSCKMA 76 (252)
T ss_dssp -GGGEEEEEE--GGGTTTTHHHHHHTGGGGSG-GGEEEEESS--------HHHHHHHGGGEEE-------------HHHH
T ss_pred ccccEEEEEEeCHHHHHHHHHHHHHHHHhhcC-CceEEecCccc------cccccccccccccCCCcchhhHHHHHHHHH
Confidence 45799999999999999999999999999997 5556 687653 33432 23444444432 22233323
Q ss_pred HHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcCCCCCCeEEEeecCCCcccc-cc----ccCCCCCCCccCCCceee
Q psy6218 405 LQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQAGRGNQEEF-GL----LSLEYDENFCMGGPGVIM 479 (533)
Q Consensus 405 L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~dpseplYiG~~~~~~~~ey-~~----~~~~~~~~Y~sGGAGYVL 479 (533)
..|-+ ++.+++|||+++||||||+++||+++|.+|||++|+|+|+++.....+- .. ........|++|||||||
T Consensus 77 ~~y~~-~~~~~~~Wf~~~DDDtyv~~~~L~~~L~~~~~~~~~yiG~~~~~~~~~~~~~~~~~~~~~~~~~f~~GGaG~vl 155 (252)
T PF02434_consen 77 YEYDH-FLNSDKDWFCFADDDTYVNVENLRRLLSKYDPSEPIYIGRPSGDRPIEIIHRFNPNKSKDSGFWFATGGAGYVL 155 (252)
T ss_dssp HHHHH-HHHHT-SEEEEEETTEEE-HHHHHHHHTTS-TTS--EEE-EE----------------------EE-GGG-EEE
T ss_pred HHHHh-hhcCCceEEEEEeCCceecHHHHHHHHhhCCCccCEEeeeeccCccceeeccccccccCcCceEeeCCCeeHHH
Confidence 33321 2234789999999999999999999999999999999999875432110 00 011235689999999999
Q ss_pred CHHHHHHHHHhHHHhhcc----CCCCchhHHHHHHHHhccCCceeecCCCcc
Q psy6218 480 SRATLALVAPHIKYCLKN----LYTTHEDVELGRCVQKFAGIPCTWSYEVSI 527 (533)
Q Consensus 480 Sr~aLrkL~~~~~~C~~~----~~~~~EDV~LG~CL~~~lGV~~t~s~e~~~ 527 (533)
|+++|++|.+....|... ....+||+.||.|++.++||+++++..+++
T Consensus 156 Sr~~~~k~~~~~~~~~~~~~~~~~~~~dD~~lG~ci~~~lgv~lt~s~~fhs 207 (252)
T PF02434_consen 156 SRALLKKMSPWASGCKCPSTDEKIRLPDDMTLGYCIENLLGVPLTHSPLFHS 207 (252)
T ss_dssp EHHHHHHHHHHHTT-TTS--TTTTTS-HHHHHHHHHHHTT---EEE-TT---
T ss_pred hHHHHHHHhhhcccccccCCcCCCCCcccChhhhhHHhcCCcceeechhhcc
Confidence 999999998765544322 124589999999999989999999988765
No 2
>KOG2246|consensus
Probab=99.96 E-value=1.3e-29 Score=265.73 Aligned_cols=186 Identities=27% Similarity=0.477 Sum_probs=156.3
Q ss_pred CCCCCceEEEEEEcCCCChHHHHHHHHHhhhccCCCcEEEEecCCCCCCCCCCCCCcEEecCCCCCCCchHHHHHHHHHH
Q psy6218 329 GLSESGLLFVGVMTANSYLNTRATSVYDTWARDIPGRVAFFTSESSTLPAARPDLPLVKLRGVDDSYPPQKKSFLMLQYM 408 (533)
Q Consensus 329 ~l~~~~~LlI~V~Tsp~~~~tRr~AIR~TWgr~~~~rvvFfsg~~~~i~~~d~~lpvV~L~~~dD~Ynlt~Kt~~~L~~l 408 (533)
.+..+.+|+|+|+|++.++.+|++++.+||+++|+ +..|++..- .+.+..++.|.++..+..++++.|+..+++|+
T Consensus 86 ~l~r~~~v~cwv~t~~~~~~~~~~~v~~TW~~rc~-~~~f~s~~~---s~~~~~f~~v~~~~~~g~~~~~~ktr~~~~yv 161 (364)
T KOG2246|consen 86 WLSRSGRVLCWVLTSPMRHVTRADAVKETWLKRCD-KGIFFSPTL---SKDDSRFPTVYYNLPDGYRSLWRKTRIAFKYV 161 (364)
T ss_pred ccCCCceEEEEEEecCcCceeehhhhhcccccccC-cceecCccC---CCCCCcCceeeccCCcchHHHHHHHHHHHHHH
Confidence 34678899999999999999999999999999997 899998431 12356788887754433346899999999999
Q ss_pred HHHhCCCCceEEEEcCCccccHHHHHHHHhcCCCCCCeEEEeecCCCccccccccCCCCCCCccCCCceeeCHHHHHHHH
Q psy6218 409 WERYGDKYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQAGRGNQEEFGLLSLEYDENFCMGGPGVIMSRATLALVA 488 (533)
Q Consensus 409 ~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~dpseplYiG~~~~~~~~ey~~~~~~~~~~Y~sGGAGYVLSr~aLrkL~ 488 (533)
++++.+++|||+|+|||||+++|||+++|.+|||++|+|+|+..... ....|.+|||||++|+++++.++
T Consensus 162 ~~~~~~~~dWf~~aDDDTy~i~eNLr~~L~~yDp~~p~YiG~~~~~~----------~~~~y~~g~ag~~ls~aa~~~la 231 (364)
T KOG2246|consen 162 YDHILKDYDWFLKADDDTYFIMENLRYVLSKYDPEKPVYLGYRSKSY----------FQNGYSSGGAGYVLSFAALRRLA 231 (364)
T ss_pred HHhccCCCCeEEeccCCeEEeHHHHHHHHhhcCCCCcEEeccccccc----------cccccccCCCCcceeHHHHHHHH
Confidence 99998999999999999999999999999999999999999987642 12339999999999999999998
Q ss_pred HhH----HHhhccCCCCchhHHHHHHHHhccCCceeecCCCcccc
Q psy6218 489 PHI----KYCLKNLYTTHEDVELGRCVQKFAGIPCTWSYEVSITS 529 (533)
Q Consensus 489 ~~~----~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~e~~~~~ 529 (533)
+.+ +.|..+.....||+.||+||++ +||++++++|.....
T Consensus 232 ~~l~~~~~~C~~~~~~~~eD~~i~~Cl~~-~GV~~~d~~d~dg~~ 275 (364)
T KOG2246|consen 232 ERLLNNEDKCPQRYPSYGEDRRIGRCLAE-VGVPATDERDEDGRG 275 (364)
T ss_pred HHHhcchhhcccccCCchhHHHHHHHHHH-hCCCccCchhhhccc
Confidence 754 5687652225999999999999 899999997765543
No 3
>PLN03133 beta-1,3-galactosyltransferase; Provisional
Probab=99.94 E-value=8.3e-26 Score=249.41 Aligned_cols=232 Identities=19% Similarity=0.216 Sum_probs=163.7
Q ss_pred cccccccceeeeehhcccccccccccccccccCCCCCCCCCCCCCCCCCCCCCCcccccCCCCCceEEEEEEcCCCChHH
Q psy6218 270 VGLSESGLLFVGVMTANSYLNTRATSVYDTWARDIPATQGQTDGWSSSGGEKGRGISDVGLSESGLLFVGVMTANSYLNT 349 (533)
Q Consensus 270 ~~~~~~~~~~~~~~~a~~~l~~~~~~~~~~~~~~~p~~~~q~~~~~~~~~~~~~~~~~~~l~~~~~LlI~V~Tsp~~~~t 349 (533)
..|+-+|++.+-+|.+..++.+.+. .+.-++ +.|.+ .++....+.+|+|+|.|++++++
T Consensus 341 ~~l~V~GDv~l~SV~a~~~p~~~~~----~~~~d~-------e~lkA---------ppL~~~~~~~LlI~V~Sap~nf~- 399 (636)
T PLN03133 341 SEVRISGDLKLISVLASGLPTSEDS----EHVIDL-------EALKS---------PPLSPKKPLDLFIGVFSTANNFK- 399 (636)
T ss_pred eEEEEeCcEEEEEEEeeCCCCCCch----hcccch-------HHhcC---------CCCCCCCceEEEEEEeCCcccHH-
Confidence 4578889999999999977654431 222222 22211 11212355799999999999986
Q ss_pred HHHHHHHhhhccCC-----CcEEEEecCCCCCC------CCCCCCCcEEecCCCCCC-CchHHHHHHHHHHHHHhCCCCc
Q psy6218 350 RATSVYDTWARDIP-----GRVAFFTSESSTLP------AARPDLPLVKLRGVDDSY-PPQKKSFLMLQYMWERYGDKYE 417 (533)
Q Consensus 350 Rr~AIR~TWgr~~~-----~rvvFfsg~~~~i~------~~d~~lpvV~L~~~dD~Y-nlt~Kt~~~L~~l~e~~~~~~d 417 (533)
||+|||+|||+... .+++|++|.+.+.. .+...+..|.+.++.|+| |+++|+++++.|.. ++ .+++
T Consensus 400 rR~AIR~TWg~~~~~~~~~v~~rFvVG~s~n~~l~~~L~~Ea~~ygDIIq~dF~DsY~NLTlKtl~~~~wa~-~c-~~ak 477 (636)
T PLN03133 400 RRMAVRRTWMQYDAVRSGAVAVRFFVGLHKNQMVNEELWNEARTYGDIQLMPFVDYYSLITWKTLAICIFGT-EV-VSAK 477 (636)
T ss_pred HHHHHHHhhccccccCCCceEEEEEEecCCcHHHHHHHHHHHHHcCCeEEEeeechhhhhHHHHHHHHHHHH-hC-CCce
Confidence 78999999998531 25789999764210 122233344444577777 57999999999874 34 5899
Q ss_pred eEEEEcCCccccHHHHHHHHhcCCCCCCeEEEeecCCC---c---ccccc---cc-CCCCCCCccCCCceeeCHHHHHHH
Q psy6218 418 WFMRADDDLYVRGERLARMLRSVDSSKPQFIGQAGRGN---Q---EEFGL---LS-LEYDENFCMGGPGVIMSRATLALV 487 (533)
Q Consensus 418 WFlKvDDDTYVnvdNL~~~L~~~dpseplYiG~~~~~~---~---~ey~~---~~-~~~~~~Y~sGGAGYVLSr~aLrkL 487 (533)
|+||+|||+|||+++|.++|+..++.+.+|+|+..... + ..|+. .+ ...+|+||+| +|||||++++++|
T Consensus 478 FilK~DDDvFVnv~~Ll~~L~~~~~~~~Ly~G~v~~~~~PiRd~~sKWYVs~~eyp~~~YPpYasG-~gYVlS~Dla~~L 556 (636)
T PLN03133 478 YVMKTDDDAFVRVDEVLASLKRTNVSHGLLYGLINSDSQPHRNPDSKWYISPEEWPEETYPPWAHG-PGYVVSRDIAKEV 556 (636)
T ss_pred EEEEcCCceEEcHHHHHHHHHhcCCCCceEEEEeccCCCcccCCCCCCCCCHHHCCCCCCCCCCCc-CEEEEcHHHHHHH
Confidence 99999999999999999999988778889999875421 1 22321 12 2368999975 9999999999999
Q ss_pred HHhHHHhhccCCCCchhHHHHHHHHhc--cCCceeecCCCc
Q psy6218 488 APHIKYCLKNLYTTHEDVELGRCVQKF--AGIPCTWSYEVS 526 (533)
Q Consensus 488 ~~~~~~C~~~~~~~~EDV~LG~CL~~~--lGV~~t~s~e~~ 526 (533)
+....... -++++.|||++|.|++.+ +|+++.+..+..
T Consensus 557 ~~~s~s~~-l~~f~lEDVyvGi~l~~l~k~gl~v~~~~~~r 596 (636)
T PLN03133 557 YKRHKEGR-LKMFKLEDVAMGIWIAEMKKEGLEVKYENDGR 596 (636)
T ss_pred HHhhhhcc-cCcCChhhHhHHHHHHHhcccCCCceeeCCCc
Confidence 97653221 246889999999998642 577776666543
No 4
>PLN03193 beta-1,3-galactosyltransferase; Provisional
Probab=99.93 E-value=7.3e-26 Score=237.99 Aligned_cols=187 Identities=18% Similarity=0.126 Sum_probs=140.9
Q ss_pred CCCceEEEEEEcCCCChHHHHHHHHHhhhccCC----------CcEEEEecCCCCC-C-------CCCCCCCcEEecCCC
Q psy6218 331 SESGLLFVGVMTANSYLNTRATSVYDTWARDIP----------GRVAFFTSESSTL-P-------AARPDLPLVKLRGVD 392 (533)
Q Consensus 331 ~~~~~LlI~V~Tsp~~~~tRr~AIR~TWgr~~~----------~rvvFfsg~~~~i-~-------~~d~~lpvV~L~~~d 392 (533)
..+..|+|+|.|++++++ ||+|||+||++... .+++|++|.+.+. . .+......+.+.++.
T Consensus 136 ~~~~~LvIgI~Sap~~~~-RR~AIR~TWg~~~~~~~kle~~~gv~vrFVIG~s~~~~~~ldr~Le~Ea~~ygDIL~lDfv 214 (408)
T PLN03193 136 KRRYLMVVGINTAFSSRK-RRDSVRATWMPQGEKRKKLEEEKGIIIRFVIGHSATSGGILDRAIEAEDRKHGDFLRLDHV 214 (408)
T ss_pred cceEEEEEEEeCCCCCHH-HHHHHHHHHcCCcccccccccCCcEEEEEEeecCCCcchHHHHHHHHHHHHhCCEEEEecc
Confidence 456799999999999986 88999999997421 3688999987420 0 112223333333466
Q ss_pred CCC-CchHHHHHHHHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcCCCCCCeEEEeecCCCc-----ccccc--cc-
Q psy6218 393 DSY-PPQKKSFLMLQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQAGRGNQ-----EEFGL--LS- 463 (533)
Q Consensus 393 D~Y-nlt~Kt~~~L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~dpseplYiG~~~~~~~-----~ey~~--~~- 463 (533)
|.| |++.|++++++|+++.+ +++|+||+|||+|||+++|..+|++......+|+|....+.. ..|+. .+
T Consensus 215 DsY~NLT~KTl~~f~wA~~~~--dAkF~mK~DDDvfVnv~~L~~~L~~~~~~~rlYiG~m~~gPvr~~~~~ky~epe~w~ 292 (408)
T PLN03193 215 EGYLELSAKTKTYFATAVAMW--DADFYVKVDDDVHVNIATLGETLVRHRKKPRVYIGCMKSGPVLSQKGVRYHEPEYWK 292 (408)
T ss_pred cccccchHHHHHHHHHHHHcC--CCeEEEEcCCCceEcHHHHHHHHHhcCCCCCEEEEecccCccccCCCCcCcCccccc
Confidence 777 68999999999998764 899999999999999999999998776666799999754321 11110 11
Q ss_pred ----CCCCCCCccCCCceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeecCCC
Q psy6218 464 ----LEYDENFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSYEV 525 (533)
Q Consensus 464 ----~~~~~~Y~sGGAGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~e~ 525 (533)
...+|+||+| +|||||+++++.|+.+...- ..+..|||+||.||.. ++|...|++.+
T Consensus 293 ~~~~~~~YPpyAsG-~gYVlS~DLa~~I~~n~~~L---~~y~~EDV~vG~Wl~~-L~V~~vdd~~f 353 (408)
T PLN03193 293 FGENGNKYFRHATG-QLYAISKDLASYISINQHVL---HKYANEDVSLGSWFIG-LDVEHIDDRRL 353 (408)
T ss_pred ccCccccCCCCCCc-ceEEehHHHHHHHHhChhhh---cccCcchhhhhhHhcc-CCceeeecccc
Confidence 1357999976 99999999999999765543 2367999999999987 78888777765
No 5
>KOG2287|consensus
Probab=99.92 E-value=1.2e-24 Score=227.75 Aligned_cols=186 Identities=19% Similarity=0.252 Sum_probs=147.6
Q ss_pred CceEEEEEEcCCCChHHHHHHHHHhhhccCC-----CcEEEEecCCCCCC-------CCCCCCCcEEecCCCCCC-CchH
Q psy6218 333 SGLLFVGVMTANSYLNTRATSVYDTWARDIP-----GRVAFFTSESSTLP-------AARPDLPLVKLRGVDDSY-PPQK 399 (533)
Q Consensus 333 ~~~LlI~V~Tsp~~~~tRr~AIR~TWgr~~~-----~rvvFfsg~~~~i~-------~~d~~lpvV~L~~~dD~Y-nlt~ 399 (533)
..+|+++|.|.+++++ ||+|||+|||+... .+++|++|...... .+.+....|.+.++.|+| |++.
T Consensus 94 ~~~lLl~V~S~~~~fa-rR~aiR~TW~~~~~v~~~~v~~~FLvG~~~~~~~~~~~l~~Ea~~ygDIi~~df~Dty~nltl 172 (349)
T KOG2287|consen 94 PPELLLLVKSAPDNFA-RRNAIRKTWGNENNVRGGRVRVLFLVGLPSNEDKLNKLLADEARLYGDIIQVDFEDTYFNLTL 172 (349)
T ss_pred CceEEEEEecCCCCHH-HHHHHHHHhcCccccCCCcEEEEEEecCCCcHHHHHHHHHHHHHHhCCEEEEecccchhchHH
Confidence 4789999999999997 77999999999764 37899999876310 111223445555677777 6899
Q ss_pred HHHHHHHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcC-CCCCCeEEEeecCCC------cccccc---ccC-CCCC
Q psy6218 400 KSFLMLQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSV-DSSKPQFIGQAGRGN------QEEFGL---LSL-EYDE 468 (533)
Q Consensus 400 Kt~~~L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~-dpseplYiG~~~~~~------~~ey~~---~~~-~~~~ 468 (533)
|++++++|..+++ +++++++|+|||+||++++|..+|++. ++++.+|+|+..... ...|+. .++ ..+|
T Consensus 173 Ktl~~l~w~~~~c-p~akfi~K~DDDvfv~~~~L~~~L~~~~~~~~~~~~G~v~~~~~p~R~~~~KwyVp~~~y~~~~YP 251 (349)
T KOG2287|consen 173 KTLAILLWGVSKC-PDAKFILKIDDDVFVNPDNLLEYLDKLNDPSSDLYYGRVIQNAPPIRDKTSKWYVPESEYPCSVYP 251 (349)
T ss_pred HHHHHHHHHHhcC-CcceEEEeccCceEEcHHHHHHHHhccCCCCcceEEEeecccCCCCCCCCCCCccCHHHCCCCCCC
Confidence 9999999998776 599999999999999999999999999 899999999976531 122221 123 3689
Q ss_pred CCccCCCceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeecCC
Q psy6218 469 NFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSYE 524 (533)
Q Consensus 469 ~Y~sGGAGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~e 524 (533)
+||+| +|||+|++++++|++.... ..++..|||++|.|+++.+||.+.+...
T Consensus 252 ~Y~sG-~gYvis~~~a~~l~~~s~~---~~~~~iEDV~~g~~l~~~~gi~~~~~~~ 303 (349)
T KOG2287|consen 252 PYASG-PGYVISGDAARRLLKASKH---LKFFPIEDVFVGGCLAEDLGIKPVNHPG 303 (349)
T ss_pred CcCCC-ceeEecHHHHHHHHHHhcC---CCccchHHHHHHHHHHHhcCCCcccCcc
Confidence 99975 9999999999999985433 3568899999999999955999887766
No 6
>PF01762 Galactosyl_T: Galactosyltransferase; InterPro: IPR002659 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 31 (GH31 from CAZY) comprises enzymes with a number of known activities; N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase (2.4.1.149 from EC); beta-1,3-galactosyltransferase (2.4.1 from EC); fucose-specific beta-1,3-N-acetylglucosaminyltransferase (2.4.1 from EC); globotriosylceramide beta-1,3-GalNAc transferase (2.4.1.79 from EC) [, ].; GO: 0008378 galactosyltransferase activity, 0006486 protein glycosylation, 0016020 membrane
Probab=99.90 E-value=1.4e-23 Score=201.38 Aligned_cols=167 Identities=22% Similarity=0.335 Sum_probs=128.2
Q ss_pred HHHHHHHhhhccCC-----CcEEEEecCCC--CC------CCCCCCCCcEEecCCCCCC-CchHHHHHHHHHHHHHhCCC
Q psy6218 350 RATSVYDTWARDIP-----GRVAFFTSESS--TL------PAARPDLPLVKLRGVDDSY-PPQKKSFLMLQYMWERYGDK 415 (533)
Q Consensus 350 Rr~AIR~TWgr~~~-----~rvvFfsg~~~--~i------~~~d~~lpvV~L~~~dD~Y-nlt~Kt~~~L~~l~e~~~~~ 415 (533)
||++||+||++... .+++|++|.+. +. .++.+....+.+.++.|+| |++.|++++++|+.++| ++
T Consensus 2 rR~~IR~TW~~~~~~~~~~~~~~FvvG~~~~~~~~~~~~l~~E~~~y~Dil~~d~~D~y~nlt~K~~~~~~w~~~~c-~~ 80 (195)
T PF01762_consen 2 RRQAIRETWGNQRNFKGVRVKVVFVVGESPNSDSDLQEALQEEAEKYGDILQGDFVDSYRNLTLKTLAGLKWASKHC-PN 80 (195)
T ss_pred hHHHHHHHHhcccccCCCcEEEEEEEecCCCCcHHHHHHhhhhhhhcCceEeeecccccchhhHHHHHHHHHHHhhC-Cc
Confidence 78999999998753 37899999876 21 1223334444444566666 68999999999999876 58
Q ss_pred CceEEEEcCCccccHHHHHHHHhcC--CCCCCeEEEeecCCCc------ccccc---cc-CCCCCCCccCCCceeeCHHH
Q psy6218 416 YEWFMRADDDLYVRGERLARMLRSV--DSSKPQFIGQAGRGNQ------EEFGL---LS-LEYDENFCMGGPGVIMSRAT 483 (533)
Q Consensus 416 ~dWFlKvDDDTYVnvdNL~~~L~~~--dpseplYiG~~~~~~~------~ey~~---~~-~~~~~~Y~sGGAGYVLSr~a 483 (533)
++|++|+|||+|||+++|..+|... ++.+..+.|....... ..++. .+ ...+|+||+| +||+||+++
T Consensus 81 ~~~v~k~DDD~~vn~~~l~~~L~~~~~~~~~~~~~g~~~~~~~~~r~~~~kw~v~~~~y~~~~yP~y~~G-~~yvls~~~ 159 (195)
T PF01762_consen 81 AKYVLKVDDDVFVNPDRLVSFLKSLKQDPSKNSIYGGCIKNGPPIRDPSSKWYVSEEEYPDDYYPPYCSG-GGYVLSSDV 159 (195)
T ss_pred hhheeecCcEEEEehHHhhhhhhhcccCccccccccccccCCccccccccCceeeeeecccccCCCcCCC-CeEEecHHH
Confidence 9999999999999999999999987 6667777787654321 11211 11 3367999985 899999999
Q ss_pred HHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeec
Q psy6218 484 LALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWS 522 (533)
Q Consensus 484 LrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s 522 (533)
++.|+.....- +.++.|||++|.|+++ +||+++|.
T Consensus 160 v~~i~~~~~~~---~~~~~eDv~iGi~~~~-~~i~~~~~ 194 (195)
T PF01762_consen 160 VKRIYKASSHT---PFFPLEDVFIGILAEK-LGIKPIHD 194 (195)
T ss_pred HHHHHHHhhcC---CCCCchHHHHHHHHHH-CCCCccCC
Confidence 99999875543 4578999999999999 89999875
No 7
>PLN03153 hypothetical protein; Provisional
Probab=99.88 E-value=1.3e-21 Score=210.20 Aligned_cols=193 Identities=16% Similarity=0.266 Sum_probs=142.4
Q ss_pred CCCceEEEEEEcCCCChHHHHHHHHHhhhccCCCcEEEEecCCCCCCCCCCCCCcEEecCCCCC--C----Cch--HHHH
Q psy6218 331 SESGLLFVGVMTANSYLNTRATSVYDTWARDIPGRVAFFTSESSTLPAARPDLPLVKLRGVDDS--Y----PPQ--KKSF 402 (533)
Q Consensus 331 ~~~~~LlI~V~Tsp~~~~tRr~AIR~TWgr~~~~rvvFfsg~~~~i~~~d~~lpvV~L~~~dD~--Y----nlt--~Kt~ 402 (533)
..-.+|+++|.++.+.+++|+..|+.||........+|+...... ...+..+|.+.+....+. | .+. ..+.
T Consensus 119 t~~~hIvF~I~~s~~~w~~R~~yik~wW~p~~~rg~v~ld~~~~~-~~~~~~~P~i~is~d~s~f~y~~~~Gh~sa~rI~ 197 (537)
T PLN03153 119 LSLNHIMFGIAGSSQLWKRRKELVRLWWRPNQMRGHVWLEEQVSP-EEGDDSLPPIMVSEDTSRFRYTNPTGHPSGLRIS 197 (537)
T ss_pred CccccEEEEEEEchhhhhhhhhhhhhhcCcccceeEEEecccCCC-CCCcCCCCCEEeCCCcccccccCCCCcHHHHHHH
Confidence 456799999999999999999999999998543245555333221 123566787776422221 2 122 2222
Q ss_pred HHHHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcCCCCCCeEEEeecCCCccccccccCCCCCCCccCCCceeeCHH
Q psy6218 403 LMLQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQAGRGNQEEFGLLSLEYDENFCMGGPGVIMSRA 482 (533)
Q Consensus 403 ~~L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~dpseplYiG~~~~~~~~ey~~~~~~~~~~Y~sGGAGYVLSr~ 482 (533)
.+....++...+++|||+++|||||++++||+++|++||+++++|+|........+. .....|++|||||+||++
T Consensus 198 rmv~et~~~~~pd~kWfVf~DDDTyf~~~NLv~~Ls~YDptkp~YIGs~Se~~~qn~-----~f~~~fA~GGAG~~LSrP 272 (537)
T PLN03153 198 RIVLESFRLGLPDVRWFVLGDDDTIFNADNLVAVLSKYDPSEMVYVGGPSESHSANS-----YFSHNMAFGGGGIAISYP 272 (537)
T ss_pred HHHHHHHHhhCCCCCEEEEecCCccccHHHHHHHHhhcCCCCCEEeccccccccccc-----ccccccccCCceEEEcHH
Confidence 233434444457999999999999999999999999999999999999876532111 012468899999999999
Q ss_pred HHHHHHHhHHHhhccC-CCCchhHHHHHHHHhccCCceeecCCCccccc
Q psy6218 483 TLALVAPHIKYCLKNL-YTTHEDVELGRCVQKFAGIPCTWSYEVSITSL 530 (533)
Q Consensus 483 aLrkL~~~~~~C~~~~-~~~~EDV~LG~CL~~~lGV~~t~s~e~~~~~~ 530 (533)
++++|.+..+.|.... ....+|..||+|+++ +||++++...++|.-.
T Consensus 273 Lae~L~~~~d~C~~rY~~~~~gD~rL~~CL~e-lGV~LT~~~gfhQ~D~ 320 (537)
T PLN03153 273 LAEALSRILDDCLDRYPKLYGSDDRLHACITE-LGVPLSREPGFHQWDI 320 (537)
T ss_pred HHHHHHHHhhhhhhhcccCCCcHHHHHHHHHH-cCCCceecCCcccccc
Confidence 9999999988997531 235789999999998 7999999999988643
No 8
>KOG3708|consensus
Probab=99.76 E-value=7.4e-19 Score=186.44 Aligned_cols=186 Identities=24% Similarity=0.404 Sum_probs=149.8
Q ss_pred CCCcccccCCCCCceEEEEEEcCCCChHHHHHHHHHhhhccCCCcEEEEecCCCCCCCCCCCCCcEEecCCCCCCCchHH
Q psy6218 321 KGRGISDVGLSESGLLFVGVMTANSYLNTRATSVYDTWARDIPGRVAFFTSESSTLPAARPDLPLVKLRGVDDSYPPQKK 400 (533)
Q Consensus 321 ~~~~~~~~~l~~~~~LlI~V~Tsp~~~~tRr~AIR~TWgr~~~~rvvFfsg~~~~i~~~d~~lpvV~L~~~dD~Ynlt~K 400 (533)
+.+|+++ +++.+.+|+++|+|.. +-+.+|.+|-+++++ ++.||.+.+.. ......+.+|.. .|. +.++.
T Consensus 14 RtRy~~t-ELG~RErl~~aVmte~----tlA~a~NrT~ahhvp-rv~~F~~~~~i-~~~~a~~~~vs~---~d~-r~~~~ 82 (681)
T KOG3708|consen 14 RTRYAAT-ELGTRERLMAAVMTES----TLALAINRTLAHHVP-RVHLFADSSRI-DNDLAQLTNVSP---YDL-RGQKT 82 (681)
T ss_pred hhhhhhh-hhhhHHHHHHHHHHHH----HHHHHHHHHHHhhcc-eeEEeeccccc-cccHhhccccCc---ccc-Ccccc
Confidence 4556665 4899999999999922 577899999999997 99999887642 000111222221 111 23567
Q ss_pred HHHHHHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcCCCCCCeEEEeecCCCccccccccCCCCCCCccCCCceeeC
Q psy6218 401 SFLMLQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQAGRGNQEEFGLLSLEYDENFCMGGPGVIMS 480 (533)
Q Consensus 401 t~~~L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~dpseplYiG~~~~~~~~ey~~~~~~~~~~Y~sGGAGYVLS 480 (533)
..+.++++++++.++||||+.+.|+|||+...|++++..++-++++|||.-..+ ...=|++++||+||
T Consensus 83 ~s~vl~~l~~~~~~~YDwFll~~D~tYv~a~~L~~l~~hmsin~dlymGEe~~~------------gs~rC~l~~G~LLS 150 (681)
T KOG3708|consen 83 HSMVLGLLFNMVHNNYDWFLLAKDSTYVNAFVLLRLIDHMSINEDLYMGEEAED------------GSGRCRLDTGMLLS 150 (681)
T ss_pred HHHHHHHHHHhhccccceEEEecCcceecHHHHHHHHhhcccccccccchhhhC------------ccCccccccceeec
Confidence 778899999999999999999999999999999999999999999999954331 12229999999999
Q ss_pred HHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeecCCCc-ccc
Q psy6218 481 RATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSYEVS-ITS 529 (533)
Q Consensus 481 r~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~e~~-~~~ 529 (533)
+.++++|..+.+.|.......-+|+++|+|++..+||.|+...++. +++
T Consensus 151 ~s~l~~lrnnle~C~~~~lsad~d~~lgrCi~~At~v~C~~~hQGvrq~s 200 (681)
T KOG3708|consen 151 QSLLHALRNNLEGCRNDILSADPDEWLGRCIQDATGVGCKPLHQGVRQYS 200 (681)
T ss_pred HHHHHHHHhhHHHhhcccccCCcHHHHHHHHHHhhcCCccchhhhHHhhh
Confidence 9999999999999988777788899999999998999999999988 444
No 9
>KOG2288|consensus
Probab=99.75 E-value=4.2e-18 Score=168.75 Aligned_cols=184 Identities=21% Similarity=0.255 Sum_probs=135.8
Q ss_pred CCCceEEEEEEcCCCChHHHHHHHHHhhhccCC----------CcEEEEecCCCCC-------CCCCCCC-CcEEecCCC
Q psy6218 331 SESGLLFVGVMTANSYLNTRATSVYDTWARDIP----------GRVAFFTSESSTL-------PAARPDL-PLVKLRGVD 392 (533)
Q Consensus 331 ~~~~~LlI~V~Tsp~~~~tRr~AIR~TWgr~~~----------~rvvFfsg~~~~i-------~~~d~~l-pvV~L~~~d 392 (533)
..+...+|+|.|++...+ ||+++|+||....+ ..+.|++|..... .+++... ..+.|++..
T Consensus 8 ~~k~l~vigI~T~f~s~~-RR~~vR~TWmp~~~~l~rle~e~gv~~RFvIG~~~~g~~~~r~ie~E~~~~~DfllLd~h~ 86 (274)
T KOG2288|consen 8 RRKVLLVIGINTAFSSRK-RRDSVRQTWMPSGEGLKRLEEEKGVIIRFVIGTATLGASLDRALEEENAQHGDFLLLDRHE 86 (274)
T ss_pred ccceEEEEEeecccchhh-hHHHHHHhhcCCccchhhhccccceEEEEEeccCCccHHHHHHHHHHHHhcCCeEeechhH
Confidence 357789999999999876 88999999998732 1378999984321 1112222 345566555
Q ss_pred CCC-CchHHHHHHHHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcCCCCCCeEEEeecCCCc--ccccccc------
Q psy6218 393 DSY-PPQKKSFLMLQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQAGRGNQ--EEFGLLS------ 463 (533)
Q Consensus 393 D~Y-nlt~Kt~~~L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~dpseplYiG~~~~~~~--~ey~~~~------ 463 (533)
+.| ++..|++.++.++..++ +++||+|+|||+||+++.|...|+++.....+|+|..-.+.. ++-++++
T Consensus 87 E~Y~~Ls~Kt~~~f~~A~~~~--daeFyvKvDDDv~v~l~~L~~~la~~r~~pr~YiGcmksg~v~~~~~~kw~EpeWkf 164 (274)
T KOG2288|consen 87 EAYEELSAKTKAFFSAAVAHW--DAEFYVKVDDDVYVRLARLGTLLARERSHPRLYIGCMKSGPVLTQPGGKWYEPEWKF 164 (274)
T ss_pred HHHHHHHHHHHHHHHHHHHhc--cceEEEEccccceecHHHHHHHHHhhccCCceEEEEecCCccccCCCCcccChhhhc
Confidence 667 58999999999999988 899999999999999999999999987667899999754322 1011111
Q ss_pred CC--CCCCCccCCCceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeec
Q psy6218 464 LE--YDENFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWS 522 (533)
Q Consensus 464 ~~--~~~~Y~sGGAGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s 522 (533)
.+ .+.+|+.| +||+||++++..|.-+.+.-. .+..|||.+|.|+-- +.|.-.|+
T Consensus 165 g~~g~YfrhA~G-~~YvlS~dLa~yi~in~~lL~---~y~nEDVSlGaW~~g-ldV~h~dd 220 (274)
T KOG2288|consen 165 GDNGNYFRHATG-GGYVLSKDLATYISINRQLLH---KYANEDVSLGAWMIG-LDVEHVDD 220 (274)
T ss_pred CcccccchhccC-ceEEeeHHHHHHHHHhHHHHH---hhccCCcccceeeee-eeeeEecC
Confidence 11 26788876 899999999999998765532 268999999999964 44444443
No 10
>PTZ00210 UDP-GlcNAc-dependent glycosyltransferase; Provisional
Probab=99.66 E-value=6.3e-16 Score=161.38 Aligned_cols=176 Identities=15% Similarity=0.213 Sum_probs=121.3
Q ss_pred CCceEEEEEEcCCCC-hHHHHHHHHHhhhccCC---------C--cEEEEecCCCCC----C----CCCCCCC-cEEecC
Q psy6218 332 ESGLLFVGVMTANSY-LNTRATSVYDTWARDIP---------G--RVAFFTSESSTL----P----AARPDLP-LVKLRG 390 (533)
Q Consensus 332 ~~~~LlI~V~Tsp~~-~~tRr~AIR~TWgr~~~---------~--rvvFfsg~~~~i----~----~~d~~lp-vV~L~~ 390 (533)
..-.++++|.|..++ ..+||.++|+||-+-.. + -+.|+++...+. . ++..... +|.++
T Consensus 78 ~~~lv~~Gi~S~d~~~r~~rR~lqr~t~w~y~~va~~~n~ftg~~lv~y~l~~H~~~~~~~~~~L~eEA~~~~DIVilp- 156 (382)
T PTZ00210 78 QRFLAVLGIPSVDNSERSRRRDLQRQTCWKYSGVATRSNNFSGSLLPLYLLAPHQSNSYLISHSLKEEAARTHDIITLP- 156 (382)
T ss_pred CCceEEEeccCCCchHHHHHHHHHHhhhhcchhhhhhccCCchhhhhhhhhccCCccchhhhHHHHHHHHHhCCEEEEe-
Confidence 345778889888765 23588999999987421 1 256777765321 0 1111122 33332
Q ss_pred CC------------------CCC-CchHHHHHHHHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcCCCCCCeEEEee
Q psy6218 391 VD------------------DSY-PPQKKSFLMLQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQA 451 (533)
Q Consensus 391 ~d------------------D~Y-nlt~Kt~~~L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~dpseplYiG~~ 451 (533)
+. |.+ ++++|+++.++|++..+ ++++|+||+|||+||+++++..+|+.. |+..+|+|+.
T Consensus 157 f~d~~~tTnKkiG~~g~WG~e~e~~mT~KT~l~~~wA~~~c-P~a~YImKgDDDvFVrVp~lL~~Lr~~-prr~LY~G~v 234 (382)
T PTZ00210 157 TNDVSPSTRKKIGENGNWGIEAEVAMSRKTYLWLRFALHMF-PNVSYIVKGDDDIFIRVPKYLADLRVM-PRHGLYMGRY 234 (382)
T ss_pred cccCccccccccccCCcccchhhcchhHHHHHHHHHHHHhC-CCCCeEEEcCCCeEeeHHHHHHHHhhC-CCCceEEEee
Confidence 21 222 36899999999998876 599999999999999999999999664 6677999997
Q ss_pred cCCCccccccccCCCCCCCccCCCceeeCHHHHHHHHHhHHHhhc--------------cCCCCchhHHHHHHHHhccC
Q psy6218 452 GRGNQEEFGLLSLEYDENFCMGGPGVIMSRATLALVAPHIKYCLK--------------NLYTTHEDVELGRCVQKFAG 516 (533)
Q Consensus 452 ~~~~~~ey~~~~~~~~~~Y~sGGAGYVLSr~aLrkL~~~~~~C~~--------------~~~~~~EDV~LG~CL~~~lG 516 (533)
..... ......++||+| .||+||+++++.|++......- ......||+.+|.+|+..++
T Consensus 235 ~~~~~-----p~Rd~~PpY~~G-~gYvLSrDVA~~Lvs~~pl~rL~~~pys~~~~~~y~~~~~~~EDiMvG~vLr~~~k 307 (382)
T PTZ00210 235 NYYNR-----IWRRNQLTYVNG-YCITLSRDTAQAIISYKPLERLVNMPFSMWDYFDFLDLGMFYEDVMVGMILREKVV 307 (382)
T ss_pred CCCCc-----cccCCCCCcccc-ceeeccHHHHHHHHhhChHhHhhcCCCchHHHHHHHHhhcCchHHHHHHHHHHhcC
Confidence 65321 111235899975 9999999999999976322110 02368999999999975443
No 11
>KOG3708|consensus
Probab=99.64 E-value=6.9e-17 Score=171.64 Aligned_cols=153 Identities=18% Similarity=0.271 Sum_probs=103.0
Q ss_pred ccceeeccCCCCCcchhcccCCccCCCcccCCCceeeeHHHHHHhcccHHHHHhhccCCCCccccchhhhhhcCCccccc
Q psy6218 7 RDKSFYTHGGRGNQEEFGLLSLEYDENFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPFLSS 86 (533)
Q Consensus 7 ~~~~~~g~~~~g~~~~~~~~~~~~~~~yC~~g~G~~~Sr~~l~~~~p~l~~C~~~~~s~~~D~~lGRCi~~~~~~~C~~~ 86 (533)
..+||||. |+ .+ +..-|.++.||+||+++|++|+|||+.|++++.|.++|+||||||++++||.|+..
T Consensus 125 n~dlymGE-------e~--~~---gs~rC~l~~G~LLS~s~l~~lrnnle~C~~~~lsad~d~~lgrCi~~At~v~C~~~ 192 (681)
T KOG3708|consen 125 NEDLYMGE-------EA--ED---GSGRCRLDTGMLLSQSLLHALRNNLEGCRNDILSADPDEWLGRCIQDATGVGCKPL 192 (681)
T ss_pred ccccccch-------hh--hC---ccCccccccceeecHHHHHHHHhhHHHhhcccccCCcHHHHHHHHHHhhcCCccch
Confidence 35899993 44 22 22229999999999999999999999999999999999999999999999999998
Q ss_pred c---------cccc---c-cccccCCcccccccccccc------eeeccc-------c------CCCCCcccccceeeec
Q psy6218 87 R---------NRRE---D-VDMIGESQPLRKQTSVQDK------IRLGRQ-------S------NPRPSTYKASEVSLST 134 (533)
Q Consensus 87 ~---------~~~~---~-~~~~~~~~~f~~a~tv~~~------~~~~~~-------~------~~~~~~~~~~~~~~~~ 134 (533)
+ +..+ | +.+.+.+++|++|+||||- -+||.- + -=.-..|.-++.+.+.
T Consensus 193 hQGvrq~s~~~dspgr~~~~~e~~~s~aFr~A~tv~pv~~p~d~yrLH~yfsr~elera~~ei~~leaei~rla~~~~~~ 272 (681)
T KOG3708|consen 193 HQGVRQYSEREDSPGRHDSIPEWEGSPAFRSALTVHPVLSPADMYRLHKYFSRVELERAYREIIKLEAEISRLAELEARG 272 (681)
T ss_pred hhhHHhhhHhhcCCCccccchhhcCChHHhhhhccCccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccC
Confidence 2 1112 2 3333999999999999996 233321 0 0001122334445555
Q ss_pred eeecccccchhhhhhcccccchhhhhhh-hhcCCCcccc
Q psy6218 135 KFKLGLWSPAIELFFLCPQDFAWASTWA-SSCGRNFSVS 172 (533)
Q Consensus 135 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 172 (533)
+-..--|-+++---+--|.-|. ..+|- .-=+|||+-+
T Consensus 273 ~~~ai~WP~glPpps~P~sR~e-VltWdyft~th~F~~~ 310 (681)
T KOG3708|consen 273 TGEAISWPVGLPPPSKPPSRFE-VLTWDYFTETHLFSCA 310 (681)
T ss_pred ccceecCCCCCCCCCCCcchhe-eeehhhhccccccccC
Confidence 5444457777655555454443 23443 2345666654
No 12
>PF05679 CHGN: Chondroitin N-acetylgalactosaminyltransferase; InterPro: IPR008428 This family represents Chondroitin N-acetylgalactosaminyltransferase. Proteins have a type II transmembrane topology. The enzyme is involved in the biosynthetic initiation and elongation of chondroitin sulphate and is the key enzyme responsible for the selective chain assembly of chondroitin/dermatan sulphate on the linkage region tetrasaccharide common to various proteoglycans containing chondroitin/dermatan sulphate or heparin/heparan sulphate chains. ; GO: 0016758 transferase activity, transferring hexosyl groups, 0032580 Golgi cisterna membrane
Probab=99.26 E-value=1.7e-12 Score=142.34 Aligned_cols=58 Identities=36% Similarity=0.661 Sum_probs=51.6
Q ss_pred cHHHHHhhccCCCCccccchhhhhhcCCccccc--------cccc-cccccccCCcccccccccccc
Q psy6218 54 HIKYCLKNLYTTHEDVELGRCVQKFAGIPFLSS--------RNRR-EDVDMIGESQPLRKQTSVQDK 111 (533)
Q Consensus 54 ~l~~C~~~~~s~~~D~~lGRCi~~~~~~~C~~~--------~~~~-~~~~~~~~~~~f~~a~tv~~~ 111 (533)
||++|+++++|+||||||||||++++||+||++ |++. +++...+++++|.+|+||||-
T Consensus 1 hl~~C~~~~~s~~~Dv~lGRCI~~~~gi~Ct~~~q~l~y~~~~~~~~~~~~~~~~~~~~~AiTlHPv 67 (499)
T PF05679_consen 1 HLDWCLKNIYSNHEDVELGRCIKKFTGISCTWSYQGLFYHNYELNKNDFIGDLKNKEFHNAITLHPV 67 (499)
T ss_pred ChhHHhhhcCCCCchhHHHHHHHHhcCCCeeecccceEEEeeccCCCcccccccchhhhcceeeccC
Confidence 899999999999999999999999999999999 4555 344444889999999999997
No 13
>PF02434 Fringe: Fringe-like; InterPro: IPR003378 The Notch receptor is a large, cell surface transmembrane protein involved in a wide variety of developmental processes in higher organisms []. It becomes activated when its extracellular region binds to ligands located on adjacent cells. Much of this extracellular region is composed of EGF-like repeats, many of which can be O-fucosylated. A number of these O-fucosylated repeats can in turn be further modified by the action of a beta-1,3-N-acetylglucosaminyltransferase enzyme known as Fringe []. Fringe potentiates the activation of Notch by Delta ligands, while inhibiting activation by Serrate/Jagged ligands. This regulation of Notch signalling by Fringe is important in many processes []. Four distinct Fringe proteins have so far been studied in detail; Drosophila Fringe (Dfng) and its three mammalian homologues Lunatic Fringe (Lfng), Radical Fringe (Rfng) and Manic Fringe (Mfng). Dfng, Lfng and Rfng have all been shown to play important roles in developmental processes within their host, though the phenotype of mutants can vary between species e.g. Rfng mutants are retarded in wing development in chickens, but have no obvious phenotype in mice [, , ]. Mfng mutants have not, so far, been charcterised. Biochemical studies indicate that the Fringe proteins are fucose-specific transferases requiring manganese for activity and utilising UDP-N-acetylglucosamine as a donor substrate []. The three mammalian proteins show distinct variations in their catalytic efficiencies with different substrates. Dfng is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands which is localised to the Golgi apparatus [] (not secreted as previously thought). Modification of Notch occurs through glycosylation by Dfng. This entry consists of Fringe proteins and related glycosyltransferase enzymes including: Beta-1,3-glucosyltransferase, which glucosylates O-linked fucosylglycan on thrombospondin type 1 repeat domains []. Core 1 beta1,3-galactosyltransferase 1, generates the core T antigen, which is a precursor for many extended O-glycans in glycoproteins and plays a central role in many processes, such as angiogenesis, thrombopoiesis and kidney homeostasis development []. ; GO: 0016757 transferase activity, transferring glycosyl groups, 0016020 membrane; PDB: 2J0B_A 2J0A_A.
Probab=98.76 E-value=1.7e-09 Score=108.99 Aligned_cols=87 Identities=22% Similarity=0.330 Sum_probs=46.4
Q ss_pred ccceeeccCCCCCcchhcc-----cCCccCCCcccCCCceeeeHHHHHHhcccHHHHHhhccC----CCCccccchhhhh
Q psy6218 7 RDKSFYTHGGRGNQEEFGL-----LSLEYDENFCMGGPGVIMSRATLALVAPHIKYCLKNLYT----THEDVELGRCVQK 77 (533)
Q Consensus 7 ~~~~~~g~~~~g~~~~~~~-----~~~~~~~~yC~~g~G~~~Sr~~l~~~~p~l~~C~~~~~s----~~~D~~lGRCi~~ 77 (533)
.+++|||++...++.+-.. -....+-.||+||+|++|||+++++|.|+...|.-.-.. ..+|+.||+||..
T Consensus 114 ~~~~yiG~~~~~~~~~~~~~~~~~~~~~~~~~f~~GGaG~vlSr~~~~k~~~~~~~~~~~~~~~~~~~~dD~~lG~ci~~ 193 (252)
T PF02434_consen 114 SEPIYIGRPSGDRPIEIIHRFNPNKSKDSGFWFATGGAGYVLSRALLKKMSPWASGCKCPSTDEKIRLPDDMTLGYCIEN 193 (252)
T ss_dssp TS--EEE-EE----------------------EE-GGG-EEEEHHHHHHHHHHHTT-TTS--TTTTTS-HHHHHHHHHHH
T ss_pred ccCEEeeeeccCccceeeccccccccCcCceEeeCCCeeHHHhHHHHHHHhhhcccccccCCcCCCCCcccChhhhhHHh
Confidence 5799999987766655210 023456679999999999999999999988877432222 3789999999999
Q ss_pred hcCCcccccccccccc
Q psy6218 78 FAGIPFLSSRNRREDV 93 (533)
Q Consensus 78 ~~~~~C~~~~~~~~~~ 93 (533)
.+||+|+.+..+..+.
T Consensus 194 ~lgv~lt~s~~fhs~~ 209 (252)
T PF02434_consen 194 LLGVPLTHSPLFHSHL 209 (252)
T ss_dssp TT---EEE-TT---SS
T ss_pred cCCcceeechhhcccC
Confidence 9999999995544433
No 14
>KOG3588|consensus
Probab=98.08 E-value=9e-07 Score=92.65 Aligned_cols=80 Identities=35% Similarity=0.541 Sum_probs=55.6
Q ss_pred eeHHHHHHhcccHHHHHhhccCCCCccccchhhhhhcCCccccccccccccccccCCcccccccccccceeeccccCCC-
Q psy6218 43 MSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPFLSSRNRREDVDMIGESQPLRKQTSVQDKIRLGRQSNPR- 121 (533)
Q Consensus 43 ~Sr~~l~~~~p~l~~C~~~~~s~~~D~~lGRCi~~~~~~~C~~~~~~~~~~~~~~~~~~f~~a~tv~~~~~~~~~~~~~- 121 (533)
|||++|+ ||+.|++.+++.|||+||||||+++..-=-+. |-... ....-++-.-.+.+||||++|||.
T Consensus 1 Msr~~l~----hl~~cL~~m~t~hedve~grc~r~~mq~Lf~~------n~~an-k~sy~~~~~~~~~aitlhp~K~p~~ 69 (494)
T KOG3588|consen 1 MSRDTLL----HLESCLQHMLTSHEDVELGRCIRKHMQKLFHN------NQSAN-KESYAKNMKELKDAITLHPIKDPAV 69 (494)
T ss_pred CcHHHHH----HHHHHHHHHHhhccCcchhHHHHHHHHHHhhh------hhhhc-ChhhhccchhhhhhhccccccCHHH
Confidence 6899998 99999999999999999999999994310111 11111 111122244567889999999994
Q ss_pred ------------CCcccccceeee
Q psy6218 122 ------------PSTYKASEVSLS 133 (533)
Q Consensus 122 ------------~~~~~~~~~~~~ 133 (533)
-|+-+++++-+|
T Consensus 70 ~~r~h~~~l~~k~sear~~~~l~s 93 (494)
T KOG3588|consen 70 MRRVHLRNLEIKLSEARAKRSLLS 93 (494)
T ss_pred HHHHHHHHHHHHhhhhhhhhhhhc
Confidence 356667766665
No 15
>PLN03153 hypothetical protein; Provisional
Probab=97.42 E-value=0.00014 Score=79.95 Aligned_cols=106 Identities=15% Similarity=0.286 Sum_probs=69.4
Q ss_pred ccceeeccCCCCCcchhcccCCccCCCcccCCCceeeeHHHHHHhcccHHHHHhhccC-CCCccccchhhhhhcCCcccc
Q psy6218 7 RDKSFYTHGGRGNQEEFGLLSLEYDENFCMGGPGVIMSRATLALVAPHIKYCLKNLYT-THEDVELGRCVQKFAGIPFLS 85 (533)
Q Consensus 7 ~~~~~~g~~~~g~~~~~~~~~~~~~~~yC~~g~G~~~Sr~~l~~~~p~l~~C~~~~~s-~~~D~~lGRCi~~~~~~~C~~ 85 (533)
+++.|||. +-|+..-.......|-.||+||+||+.++++|.++++.|...--. .-+|..||.|+.+. ||.=+.
T Consensus 238 tkp~YIGs-----~Se~~~qn~~f~~~fA~GGAG~~LSrPLae~L~~~~d~C~~rY~~~~~gD~rL~~CL~el-GV~LT~ 311 (537)
T PLN03153 238 SEMVYVGG-----PSESHSANSYFSHNMAFGGGGIAISYPLAEALSRILDDCLDRYPKLYGSDDRLHACITEL-GVPLSR 311 (537)
T ss_pred CCCEEecc-----cccccccccccccccccCCceEEEcHHHHHHHHHHhhhhhhhcccCCCcHHHHHHHHHHc-CCCcee
Confidence 68999998 555421111122368999999999999999999999999753222 35888999999975 888877
Q ss_pred ccccccccccccCCcccccccccccceeeccccC
Q psy6218 86 SRNRREDVDMIGESQPLRKQTSVQDKIRLGRQSN 119 (533)
Q Consensus 86 ~~~~~~~~~~~~~~~~f~~a~tv~~~~~~~~~~~ 119 (533)
+-.|.+ +|+-++-.-|-+|--+-|-+.||--+.
T Consensus 312 ~~gfhQ-~D~~Gd~~G~les~p~~P~vSlHH~~~ 344 (537)
T PLN03153 312 EPGFHQ-WDIRGNAHGLLSSHPIAPFVSIHHVEA 344 (537)
T ss_pred cCCccc-cccCCCcchHhhcCCCCCceeeeeccc
Confidence 733221 111122233444444445566666554
No 16
>PF04646 DUF604: Protein of unknown function, DUF604; InterPro: IPR006740 This family includes a conserved region found in several uncharacterised plant proteins.
Probab=97.05 E-value=0.00098 Score=67.48 Aligned_cols=58 Identities=19% Similarity=0.294 Sum_probs=50.3
Q ss_pred ccCCCceeeCHHHHHHHHHhHHHhhcc-CCCCchhHHHHHHHHhccCCceeecCCCcccc
Q psy6218 471 CMGGPGVIMSRATLALVAPHIKYCLKN-LYTTHEDVELGRCVQKFAGIPCTWSYEVSITS 529 (533)
Q Consensus 471 ~sGGAGYVLSr~aLrkL~~~~~~C~~~-~~~~~EDV~LG~CL~~~lGV~~t~s~e~~~~~ 529 (533)
+.||+|+.+|.++++.|.+..+.|... +.....|-.|..|+.+ +||+.|.+..++|+-
T Consensus 10 AfGGgG~~iS~pLa~~L~~~~d~C~~r~~~~~g~D~~i~~C~~~-lgv~LT~e~g~hQ~D 68 (255)
T PF04646_consen 10 AFGGGGFAISYPLAKALAKMQDDCIERYPHLYGGDQRIQACIAE-LGVPLTKEPGFHQMD 68 (255)
T ss_pred cccCceeEEcHHHHHHHHHHHHHHHHhcCCCCCchHHHHHHHHH-hCCCceecCCceeEe
Confidence 459999999999999999999999864 2235689999999998 899999999888864
No 17
>KOG2246|consensus
Probab=94.50 E-value=0.034 Score=59.53 Aligned_cols=69 Identities=20% Similarity=0.413 Sum_probs=53.0
Q ss_pred ccceeeccCCCCCcchhcccCCccCCCcccCCCceeeeHHHHHHh----cccHHHHHhhccCCCCccccchhhhhhcCCc
Q psy6218 7 RDKSFYTHGGRGNQEEFGLLSLEYDENFCMGGPGVIMSRATLALV----APHIKYCLKNLYTTHEDVELGRCVQKFAGIP 82 (533)
Q Consensus 7 ~~~~~~g~~~~g~~~~~~~~~~~~~~~yC~~g~G~~~Sr~~l~~~----~p~l~~C~~~~~s~~~D~~lGRCi~~~~~~~ 82 (533)
.+++|||. .-+. .. .++ |=.||+|.++|++.++++ -+..+.|.++-..--||..||+|+.. .||.
T Consensus 196 ~~p~YiG~-----~~~~---~~-~~~-y~~g~ag~~ls~aa~~~la~~l~~~~~~C~~~~~~~~eD~~i~~Cl~~-~GV~ 264 (364)
T KOG2246|consen 196 EKPVYLGY-----RSKS---YF-QNG-YSSGGAGYVLSFAALRRLAERLLNNEDKCPQRYPSYGEDRRIGRCLAE-VGVP 264 (364)
T ss_pred CCcEEecc-----cccc---cc-ccc-cccCCCCcceeHHHHHHHHHHHhcchhhcccccCCchhHHHHHHHHHH-hCCC
Confidence 46999999 4441 11 222 999999999999998884 56678899876667899999999976 4667
Q ss_pred cccc
Q psy6218 83 FLSS 86 (533)
Q Consensus 83 C~~~ 86 (533)
=+++
T Consensus 265 ~~d~ 268 (364)
T KOG2246|consen 265 ATDE 268 (364)
T ss_pred ccCc
Confidence 7777
No 18
>PF04646 DUF604: Protein of unknown function, DUF604; InterPro: IPR006740 This family includes a conserved region found in several uncharacterised plant proteins.
Probab=92.97 E-value=0.1 Score=53.12 Aligned_cols=82 Identities=16% Similarity=0.329 Sum_probs=55.7
Q ss_pred cCCCceeeeHHHHHHhcccHHHHHhhccCC-CCccccchhhhhhcCCccccccccccccccccCCcccccccccccceee
Q psy6218 36 MGGPGVIMSRATLALVAPHIKYCLKNLYTT-HEDVELGRCVQKFAGIPFLSSRNRREDVDMIGESQPLRKQTSVQDKIRL 114 (533)
Q Consensus 36 ~~g~G~~~Sr~~l~~~~p~l~~C~~~~~s~-~~D~~lGRCi~~~~~~~C~~~~~~~~~~~~~~~~~~f~~a~tv~~~~~~ 114 (533)
.||.||.+|+.|.++|.+.+|.|++....- --|.-|-.||.+ +||+=|.+-.|.+ +|.-++---|-.|.-+-|-++|
T Consensus 11 fGGgG~~iS~pLa~~L~~~~d~C~~r~~~~~g~D~~i~~C~~~-lgv~LT~e~g~hQ-~Di~Gd~~G~~~a~~~~pl~Sl 88 (255)
T PF04646_consen 11 FGGGGFAISYPLAKALAKMQDDCIERYPHLYGGDQRIQACIAE-LGVPLTKEPGFHQ-MDIRGDPSGFLEAHPLAPLVSL 88 (255)
T ss_pred ccCceeEEcHHHHHHHHHHHHHHHHhcCCCCCchHHHHHHHHH-hCCCceecCCcee-EeeccCcceeeecCCCCceeee
Confidence 489999999999999999999999764333 479999999965 7888766622211 1111222335555555556677
Q ss_pred ccccC
Q psy6218 115 GRQSN 119 (533)
Q Consensus 115 ~~~~~ 119 (533)
|-.+.
T Consensus 89 HH~~~ 93 (255)
T PF04646_consen 89 HHWDS 93 (255)
T ss_pred eehhh
Confidence 76543
No 19
>PF13506 Glyco_transf_21: Glycosyl transferase family 21
Probab=91.79 E-value=1.1 Score=42.96 Aligned_cols=104 Identities=18% Similarity=0.256 Sum_probs=66.9
Q ss_pred CCCceEEEEcCCccccHHHHHHHHhcCC-CCCCeEEEeecCCCccccc-------------cccCCCCCCCccCCCceee
Q psy6218 414 DKYEWFMRADDDLYVRGERLARMLRSVD-SSKPQFIGQAGRGNQEEFG-------------LLSLEYDENFCMGGPGVIM 479 (533)
Q Consensus 414 ~~~dWFlKvDDDTYVnvdNL~~~L~~~d-pseplYiG~~~~~~~~ey~-------------~~~~~~~~~Y~sGGAGYVL 479 (533)
.++|+++..|+|+.|..+-|.+++..+. |.-.+.-|.........+. .+......+++.| +.+++
T Consensus 30 a~~d~~~~~DsDi~v~p~~L~~lv~~l~~p~vglVt~~~~~~~~~~~~~~l~~~~~~~~~~~~~a~~~~~~~~G-~~m~~ 108 (175)
T PF13506_consen 30 AKYDYLVISDSDIRVPPDYLRELVAPLADPGVGLVTGLPRGVPARGFWSRLEAAFFNFLPGVLQALGGAPFAWG-GSMAF 108 (175)
T ss_pred CCCCEEEEECCCeeECHHHHHHHHHHHhCCCCcEEEecccccCCcCHHHHHHHHHHhHHHHHHHHhcCCCceec-ceeee
Confidence 4899999999999999999999998775 3333332322211111110 0001134667766 77999
Q ss_pred CHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeecC
Q psy6218 480 SRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSY 523 (533)
Q Consensus 480 Sr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~ 523 (533)
.+++++++-- +..-. + .-.||..+|+.+.+ .|....-+.
T Consensus 109 rr~~L~~~GG-~~~l~-~--~ladD~~l~~~~~~-~G~~v~~~~ 147 (175)
T PF13506_consen 109 RREALEEIGG-FEALA-D--YLADDYALGRRLRA-RGYRVVLSP 147 (175)
T ss_pred EHHHHHHccc-HHHHh-h--hhhHHHHHHHHHHH-CCCeEEEcc
Confidence 9999998742 11111 1 46899999999998 687755444
No 20
>PF01755 Glyco_transf_25: Glycosyltransferase family 25 (LPS biosynthesis protein); InterPro: IPR002654 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 25 GT25 from CAZY comprises enzymes with only one known activity; as a lipopolysaccharide biosynthesis protein. These enzymes catalyse the transfer of various sugars onto the growing lipopolysaccharide chain during its biosynthesis [].; GO: 0009103 lipopolysaccharide biosynthetic process
Probab=90.88 E-value=2.1 Score=40.90 Aligned_cols=148 Identities=10% Similarity=0.046 Sum_probs=75.6
Q ss_pred EEEEcCCCChHHHHHHHHHhhhccCCCcEEEEecCCCCCCCCC---CCCC-cEEecCCCCCC-----CchHHHHHHHHHH
Q psy6218 338 VGVMTANSYLNTRATSVYDTWARDIPGRVAFFTSESSTLPAAR---PDLP-LVKLRGVDDSY-----PPQKKSFLMLQYM 408 (533)
Q Consensus 338 I~V~Tsp~~~~tRr~AIR~TWgr~~~~rvvFfsg~~~~i~~~d---~~lp-vV~L~~~dD~Y-----nlt~Kt~~~L~~l 408 (533)
|.|.+=+... .|+..+.+...+..- .+.||.+..+...... ..++ ........... -.+..++.+|+.+
T Consensus 4 i~vInL~~~~-~Rr~~~~~~~~~~~~-~~e~~~Avdg~~l~~~~~~~~~~~~~~~~~~~~~lt~gEiGC~lSH~~~w~~~ 81 (200)
T PF01755_consen 4 IYVINLDRST-ERRERIQQQLAKLGI-NFEFFDAVDGRDLSEDELFRRYDPELFKKRYGRPLTPGEIGCALSHIKAWQRI 81 (200)
T ss_pred EEEEECCCCH-HHHHHHHHHHHHcCC-ceEEEEeecccccchHHHHHHhhhhhhhccccccCCcceEeehhhHHHHHHHH
Confidence 3455556554 488888777766543 6778866654211000 0000 00000000000 0256677788888
Q ss_pred HHHhCCCCceEEEEcCCccccHHHHHHHHh----cCCCCCCeEEEeecCCC----------ccccc----cccC------
Q psy6218 409 WERYGDKYEWFMRADDDLYVRGERLARMLR----SVDSSKPQFIGQAGRGN----------QEEFG----LLSL------ 464 (533)
Q Consensus 409 ~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~----~~dpseplYiG~~~~~~----------~~ey~----~~~~------ 464 (533)
.+. +.++.+...||+++..+ +.+.|. .....+-++.|...... ...+. ....
T Consensus 82 v~~---~~~~~lIlEDDv~~~~~-f~~~l~~~~~~~~~~~~l~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (200)
T PF01755_consen 82 VDS---GLEYALILEDDVIFDPD-FKEFLEEILSHIPDWDFLRLGGWKDNSYSPGDIFLSRLSTFLSRSKRYKRKPIPPF 157 (200)
T ss_pred HHc---CCCeEEEEecccccccc-HHHHHHHHHhhcccccchhhccccccccccccccceeeeehhhhhhhcccCccccc
Confidence 763 68999999999999744 333333 32222233443211100 00000 0000
Q ss_pred -----CCCCCCccCCCceeeCHHHHHHHHHhH
Q psy6218 465 -----EYDENFCMGGPGVIMSRATLALVAPHI 491 (533)
Q Consensus 465 -----~~~~~Y~sGGAGYVLSr~aLrkL~~~~ 491 (533)
.....+..|..||++|+.++++|++..
T Consensus 158 ~~~~~~~~~~~~~~t~aY~Is~~gA~kLL~~~ 189 (200)
T PF01755_consen 158 GSRKLIRPAKYPYGTCAYLISRKGARKLLEAS 189 (200)
T ss_pred CCceEEeecCCCCcceeeeeCHHHHHHHHHhC
Confidence 001234456679999999999999764
No 21
>PF13641 Glyco_tranf_2_3: Glycosyltransferase like family 2; PDB: 4FIY_B 4FIX_A.
Probab=88.32 E-value=1.8 Score=41.41 Aligned_cols=109 Identities=20% Similarity=0.261 Sum_probs=59.9
Q ss_pred HHHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcC-CCCCCeEEEeecCCCcc---------ccccc----c---CCC
Q psy6218 404 MLQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSV-DSSKPQFIGQAGRGNQE---------EFGLL----S---LEY 466 (533)
Q Consensus 404 ~L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~-dpseplYiG~~~~~~~~---------ey~~~----~---~~~ 466 (533)
.+.+..+.. +++|++..|||+.+..+-|..+++.+ ++.-....|........ .+... . ...
T Consensus 77 a~n~~~~~~--~~d~i~~lD~D~~~~p~~l~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (228)
T PF13641_consen 77 ALNEALAAA--RGDYILFLDDDTVLDPDWLERLLAAFADPGVGAVGGPVFPDNDRNWLTRLQDLFFARWHLRFRSGRRAL 154 (228)
T ss_dssp HHHHHHHH-----SEEEEE-SSEEE-CHHHHHHHHHHHBSS--EEEEEEEETTCCCEEEE-TT--S-EETTTS-TT-B--
T ss_pred HHHHHHHhc--CCCEEEEECCCcEECHHHHHHHHHHHHhCCCCeEeeeEeecCCCCHHHHHHHHHHhhhhhhhhhhhccc
Confidence 456666654 59999999999999999999988877 45444444443211110 00000 0 011
Q ss_pred CCCCccCCCceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeec
Q psy6218 467 DENFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWS 522 (533)
Q Consensus 467 ~~~Y~sGGAGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s 522 (533)
...++. |++.++.+++++++... +. ....||..++.-+.. .|......
T Consensus 155 ~~~~~~-G~~~~~rr~~~~~~g~f-d~-----~~~~eD~~l~~r~~~-~G~~~~~~ 202 (228)
T PF13641_consen 155 GVAFLS-GSGMLFRRSALEEVGGF-DP-----FILGEDFDLCLRLRA-AGWRIVYA 202 (228)
T ss_dssp --S-B---TEEEEEHHHHHHH-S---S-----SSSSHHHHHHHHHHH-TT--EEEE
T ss_pred ceeecc-CcEEEEEHHHHHHhCCC-CC-----CCcccHHHHHHHHHH-CCCcEEEE
Confidence 123344 59999999999999632 11 244599999988887 68776543
No 22
>cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=82.61 E-value=4.1 Score=36.09 Aligned_cols=90 Identities=18% Similarity=0.221 Sum_probs=61.7
Q ss_pred CCceEEEEcCCccccHHHHHHHHhcCCCCC-CeEEEeecCCCccccccccCCCCCCCccCCCceeeCHHHHHHHHHhHHH
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRSVDSSK-PQFIGQAGRGNQEEFGLLSLEYDENFCMGGPGVIMSRATLALVAPHIKY 493 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~~dpse-plYiG~~~~~~~~ey~~~~~~~~~~Y~sGGAGYVLSr~aLrkL~~~~~~ 493 (533)
+.+|++..|||.++..+.|..+++.+.... -..++.. + .|+..++++++++++...-..
T Consensus 74 ~~~~i~~~D~D~~~~~~~l~~~~~~~~~~~~~~~~~~~-------------------~-~~~~~~~~~~~~~~~~~~~~~ 133 (166)
T cd04186 74 KGDYVLLLNPDTVVEPGALLELLDAAEQDPDVGIVGPK-------------------V-SGAFLLVRREVFEEVGGFDED 133 (166)
T ss_pred CCCEEEEECCCcEECccHHHHHHHHHHhCCCceEEEcc-------------------C-ceeeEeeeHHHHHHcCCCChh
Confidence 689999999999999998988887543322 1222221 2 358899999999987532122
Q ss_pred hhccCCCCchhHHHHHHHHhccCCceeecCCCcccc
Q psy6218 494 CLKNLYTTHEDVELGRCVQKFAGIPCTWSYEVSITS 529 (533)
Q Consensus 494 C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~e~~~~~ 529 (533)
. ....||..+...+.. .|.+....++..-+|
T Consensus 134 ~----~~~~eD~~~~~~~~~-~g~~i~~~~~~~~~h 164 (166)
T cd04186 134 F----FLYYEDVDLCLRARL-AGYRVLYVPQAVIYH 164 (166)
T ss_pred h----hccccHHHHHHHHHH-cCCeEEEccceEEEe
Confidence 1 125689999887776 588877776654444
No 23
>PF01762 Galactosyl_T: Galactosyltransferase; InterPro: IPR002659 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 31 (GH31 from CAZY) comprises enzymes with a number of known activities; N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase (2.4.1.149 from EC); beta-1,3-galactosyltransferase (2.4.1 from EC); fucose-specific beta-1,3-N-acetylglucosaminyltransferase (2.4.1 from EC); globotriosylceramide beta-1,3-GalNAc transferase (2.4.1.79 from EC) [, ].; GO: 0008378 galactosyltransferase activity, 0006486 protein glycosylation, 0016020 membrane
Probab=79.71 E-value=1.3 Score=42.56 Aligned_cols=48 Identities=21% Similarity=0.359 Sum_probs=35.8
Q ss_pred CCccCCCcccCCCceeeeHHHHHHhcccHHHHHhhccCCCCccccchhhhhh
Q psy6218 27 SLEYDENFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKF 78 (533)
Q Consensus 27 ~~~~~~~yC~~g~G~~~Sr~~l~~~~p~l~~C~~~~~s~~~D~~lGRCi~~~ 78 (533)
..+.=..||. |+|++||+.++++|.. .....-.-..||+.+|-|..+.
T Consensus 140 ~~~~yP~y~~-G~~yvls~~~v~~i~~---~~~~~~~~~~eDv~iGi~~~~~ 187 (195)
T PF01762_consen 140 PDDYYPPYCS-GGGYVLSSDVVKRIYK---ASSHTPFFPLEDVFIGILAEKL 187 (195)
T ss_pred ccccCCCcCC-CCeEEecHHHHHHHHH---HhhcCCCCCchHHHHHHHHHHC
Confidence 4445678997 6799999999999985 2323333458999999998764
No 24
>TIGR03472 HpnI hopanoid biosynthesis associated glycosyl transferase protein HpnI. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The member of this clade from Acidithiobacillus ferrooxidans ATCC 23270 (AFE_0974) is found in the same locus as squalene-hopene cyclase (SHC, TIGR01507) and other genes associated with the biosynthesis of hopanoid natural products. Similarly, in Ralstonia eutropha JMP134 (Reut_B4902) this gene is adjacent to HpnAB, IspH and HpnH (TIGR03470), although SHC itself is elsewhere in the genome. Notably, this gene (here named HpnI) and three others form a conserved set (HpnIJKL) which occur in a subset of all genomes containing the SHC enzyme. This relationship was discerned using the method of partial phylogenetic profiling. This group includes Zymomonas mobilis, the organism where the initial hopano
Probab=76.51 E-value=75 Score=33.71 Aligned_cols=116 Identities=17% Similarity=0.196 Sum_probs=68.8
Q ss_pred HHHHHHHHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcCC-CCCCeEEEeecCCCcccc-c---------cccC---
Q psy6218 399 KKSFLMLQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVD-SSKPQFIGQAGRGNQEEF-G---------LLSL--- 464 (533)
Q Consensus 399 ~Kt~~~L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~d-pseplYiG~~~~~~~~ey-~---------~~~~--- 464 (533)
.|...+.+ +.+.. +.+|++..|+|+.+..+-|.+.++.+. |...+.-|.......+.+ . ...+
T Consensus 113 ~K~~~l~~-~~~~a--~ge~i~~~DaD~~~~p~~L~~lv~~~~~~~v~~V~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 189 (373)
T TIGR03472 113 RKVSNLIN-MLPHA--RHDILVIADSDISVGPDYLRQVVAPLADPDVGLVTCLYRGRPVPGFWSRLGAMGINHNFLPSVM 189 (373)
T ss_pred hHHHHHHH-HHHhc--cCCEEEEECCCCCcChhHHHHHHHHhcCCCcceEeccccCCCCCCHHHHHHHHHhhhhhhHHHH
Confidence 35444333 33433 799999999999999999999998874 332333332111110000 0 0000
Q ss_pred ----CCCCCCccCCCceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeecC
Q psy6218 465 ----EYDENFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSY 523 (533)
Q Consensus 465 ----~~~~~Y~sGGAGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~ 523 (533)
.....++. |+.+++.+++++++--.-.. .....||..++.=+.+ .|....-..
T Consensus 190 ~~~~~~~~~~~~-G~~~a~RR~~l~~iGGf~~~----~~~~~ED~~l~~~i~~-~G~~v~~~~ 246 (373)
T TIGR03472 190 VARALGRARFCF-GATMALRRATLEAIGGLAAL----AHHLADDYWLGELVRA-LGLRVVLAP 246 (373)
T ss_pred HHHhccCCcccc-ChhhheeHHHHHHcCChHHh----cccchHHHHHHHHHHH-cCCeEEecc
Confidence 01224454 58899999999998743211 1134699999999988 686654443
No 25
>cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis. UDP-glucose:N-acylsphingosine D-glucosyltransferase (glucosylceramide synthase or ceramide glucosyltransferase) catalyzes the first glycosylation step of glycosphingolipid synthesis. Its product, glucosylceramide, serves as the core of more than 300 glycosphingolipids (GSL). GSLs are a group of membrane components that have the lipid portion embedded in the outer plasma membrane leaflet and the sugar chains extended to the outer environment. Several lines of evidence suggest the importance of GSLs in various cellular processes such as differentiation, adhesion, proliferation, and cell-cell recognition. In pathogenic fungus Cryptococcus neoformans, glucosylceramide serves as an antigen that elicits an antibody response in patients and it is essential for fungal growth in host extracellular environment.
Probab=75.65 E-value=11 Score=35.64 Aligned_cols=85 Identities=19% Similarity=0.207 Sum_probs=58.8
Q ss_pred CCceEEEEcCCccccHHHHHHHHhcCC-CCCCeEEEeecCCCccccccccCCCCCCCccCCCceeeCHHHHHHHHHhHHH
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRSVD-SSKPQFIGQAGRGNQEEFGLLSLEYDENFCMGGPGVIMSRATLALVAPHIKY 493 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~~d-pseplYiG~~~~~~~~ey~~~~~~~~~~Y~sGGAGYVLSr~aLrkL~~~~~~ 493 (533)
+.+|++..|+|+.+..+-|.+++..+. +...+..|. ++ .|++.++.+++++++.-. ..
T Consensus 86 ~~d~i~~~D~D~~~~~~~l~~l~~~~~~~~~~~v~~~-------------------~~-~g~~~~~r~~~~~~~ggf-~~ 144 (196)
T cd02520 86 RYDILVISDSDISVPPDYLRRMVAPLMDPGVGLVTCL-------------------CA-FGKSMALRREVLDAIGGF-EA 144 (196)
T ss_pred CCCEEEEECCCceEChhHHHHHHHHhhCCCCCeEEee-------------------cc-cCceeeeEHHHHHhccCh-HH
Confidence 689999999999999999988887653 222222222 23 458899999999988532 11
Q ss_pred hhccCCCCchhHHHHHHHHhccCCceeecCC
Q psy6218 494 CLKNLYTTHEDVELGRCVQKFAGIPCTWSYE 524 (533)
Q Consensus 494 C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~e 524 (533)
. .....||..++.-+.+ .|.......+
T Consensus 145 ~---~~~~~eD~~l~~rl~~-~G~~i~~~~~ 171 (196)
T cd02520 145 F---ADYLAEDYFLGKLIWR-LGYRVVLSPY 171 (196)
T ss_pred H---hHHHHHHHHHHHHHHH-cCCeEEEcch
Confidence 1 1124699999999887 5766554444
No 26
>cd06532 Glyco_transf_25 Glycosyltransferase family 25 [lipooligosaccharide (LOS) biosynthesis protein] is a family of glycosyltransferases involved in LOS biosynthesis. The members include the beta(1,4) galactosyltransferases: Lgt2 of Moraxella catarrhalis, LgtB and LgtE of Neisseria gonorrhoeae and Lic2A of Haemophilus influenzae. M. catarrhalis Lgt2 catalyzes the addition of galactose (Gal) to the growing chain of LOS on the cell surface. N. gonorrhoeae LgtB and LgtE link Gal-beta(1,4) to GlcNAc (N-acetylglucosamine) and Glc (glucose), respectively. The genes encoding LgtB and LgtE are two genes of a five gene locus involved in the synthesis of gonococcal LOS. LgtE is believed to perform the first step in LOS biosynthesis.
Probab=69.96 E-value=9.6 Score=34.33 Aligned_cols=107 Identities=15% Similarity=0.146 Sum_probs=63.2
Q ss_pred EEEcCCCChHHHHHHHHHhhhccCCCcEEEEecCCCCCCCCCCCCCcEE----ecCCCCC-----CCchHHHHHHHHHHH
Q psy6218 339 GVMTANSYLNTRATSVYDTWARDIPGRVAFFTSESSTLPAARPDLPLVK----LRGVDDS-----YPPQKKSFLMLQYMW 409 (533)
Q Consensus 339 ~V~Tsp~~~~tRr~AIR~TWgr~~~~rvvFfsg~~~~i~~~d~~lpvV~----L~~~dD~-----Ynlt~Kt~~~L~~l~ 409 (533)
.|++-+... .|+..+.+......- .+.||.+....... ...+.... ....... .-.+..++.+|+.+.
T Consensus 3 ~vInL~~~~-~Rr~~~~~~~~~~~~-~~~~~~Avd~~~~~-~~~~~~~~~~~~~~~~~~~l~~gEiGC~lSH~~~w~~~~ 79 (128)
T cd06532 3 FVINLDRST-DRRERMEAQLAALGL-DFEFFDAVDGKDLS-EEELAALYDALFLPRYGRPLTPGEIGCFLSHYKLWQKIV 79 (128)
T ss_pred EEEECCCCH-HHHHHHHHHHHHcCC-CeEEEeccccccCC-HHHHHHHhHHHhhhhcCCCCChhhHHHHHHHHHHHHHHH
Confidence 344555554 477788776655442 67777666542100 00000000 0000000 012456667788776
Q ss_pred HHhCCCCceEEEEcCCccccHHHHHHHHhcCCCCCCeEEEeecCCCccccccccCCCCCCCccCCCceeeCHHHHHHHHH
Q psy6218 410 ERYGDKYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQAGRGNQEEFGLLSLEYDENFCMGGPGVIMSRATLALVAP 489 (533)
Q Consensus 410 e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~dpseplYiG~~~~~~~~ey~~~~~~~~~~Y~sGGAGYVLSr~aLrkL~~ 489 (533)
+ .+.+|.+..+||+.+..+ |..||++|+.++++|+.
T Consensus 80 ~---~~~~~alIlEDDv~~~~~-----------------------------------------~~~~Y~vs~~~A~~ll~ 115 (128)
T cd06532 80 E---SNLEYALILEDDAILDPD-----------------------------------------GTAGYLVSRKGAKKLLA 115 (128)
T ss_pred H---cCCCeEEEEccCcEECCC-----------------------------------------CceEEEeCHHHHHHHHH
Confidence 5 267999999999998766 34789999999999997
Q ss_pred hHH
Q psy6218 490 HIK 492 (533)
Q Consensus 490 ~~~ 492 (533)
...
T Consensus 116 ~~~ 118 (128)
T cd06532 116 ALE 118 (128)
T ss_pred hCC
Confidence 543
No 27
>cd06421 CESA_CelA_like CESA_CelA_like are involved in the elongation of the glucan chain of cellulose. Family of proteins related to Agrobacterium tumefaciens CelA and Gluconacetobacter xylinus BscA. These proteins are involved in the elongation of the glucan chain of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues. They are putative catalytic subunit of cellulose synthase, which is a glycosyltransferase using UDP-glucose as the substrate. The catalytic subunit is an integral membrane protein with 6 transmembrane segments and it is postulated that the protein is anchored in the membrane at the N-terminal end.
Probab=69.80 E-value=16 Score=34.63 Aligned_cols=102 Identities=16% Similarity=0.161 Sum_probs=61.6
Q ss_pred CCceEEEEcCCccccHHHHHHHHhcCCCCCC--eEEEeecCCCccc-------cc--------cc-cC--CCCCCCccCC
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRSVDSSKP--QFIGQAGRGNQEE-------FG--------LL-SL--EYDENFCMGG 474 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~~dpsep--lYiG~~~~~~~~e-------y~--------~~-~~--~~~~~Y~sGG 474 (533)
+.+|++..|+|+++..+-|..+++.+..+.. +..|......... +. .. .. .....++. |
T Consensus 84 ~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-g 162 (234)
T cd06421 84 TGDFVAILDADHVPTPDFLRRTLGYFLDDPKVALVQTPQFFYNPDPFDWLADGAPNEQELFYGVIQPGRDRWGAAFCC-G 162 (234)
T ss_pred CCCEEEEEccccCcCccHHHHHHHHHhcCCCeEEEecceEEecCCcchhHHHHHHHHHHHHHHHHHHHHhhcCCceec-C
Confidence 6899999999999999999999987754222 2222211100000 00 00 00 01234455 4
Q ss_pred CceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeecCC
Q psy6218 475 PGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSYE 524 (533)
Q Consensus 475 AGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~e 524 (533)
++.++++++++++... .. ....||..++.=+.+ .|.......+
T Consensus 163 ~~~~~r~~~~~~ig~~-~~-----~~~~eD~~l~~r~~~-~g~~i~~~~~ 205 (234)
T cd06421 163 SGAVVRREALDEIGGF-PT-----DSVTEDLATSLRLHA-KGWRSVYVPE 205 (234)
T ss_pred ceeeEeHHHHHHhCCC-Cc-----cceeccHHHHHHHHH-cCceEEEecC
Confidence 8999999999987532 11 134799999977776 5766544433
No 28
>cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase. Shigella flexneri RfbF protein is a putative dTDP-rhamnosyl transferase. dTDP rhamnosyl transferases of Shigella flexneri add rhamnose sugars to N-acetyl-glucosamine in the O-antigen tetrasaccharide repeat. Lipopolysaccharide O antigens are important virulence determinants for many bacteria. The variations of sugar composition, the sequence of the sugars and the linkages in the O antigen provide structural diversity of the O antigen.
Probab=68.53 E-value=27 Score=33.31 Aligned_cols=103 Identities=16% Similarity=0.129 Sum_probs=57.5
Q ss_pred CCceEEEEcCCccccHHHHHHHH---hcCCCCCC-eEEEeecCCCc-cc--cccc--------cC---C--CCCCCccCC
Q psy6218 415 KYEWFMRADDDLYVRGERLARML---RSVDSSKP-QFIGQAGRGNQ-EE--FGLL--------SL---E--YDENFCMGG 474 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L---~~~dpsep-lYiG~~~~~~~-~e--y~~~--------~~---~--~~~~Y~sGG 474 (533)
+++|++..|+|+.+..+-|..++ ..+..... ..+|....... .. +... .. . ....++. |
T Consensus 75 ~~d~v~~lD~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 153 (237)
T cd02526 75 GADYVLLFDQDSVPPPDMVEKLLAYKILSDKNSNIGAVGPRIIDRRTGENSPGVRKSGYKLRIQKEGEEGLKEVDFLI-T 153 (237)
T ss_pred CCCEEEEECCCCCcCHhHHHHHHHHHHhhccCCCeEEEeeeEEcCCCCeeccceeccCccceecccccCCceEeeeee-c
Confidence 68999999999999999999885 33322222 22333211110 00 0000 00 0 0112333 4
Q ss_pred CceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeecC
Q psy6218 475 PGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSY 523 (533)
Q Consensus 475 AGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~ 523 (533)
+|.++++++++++-..-... +...||+.+..-+.+ .|......+
T Consensus 154 ~~~~~rr~~~~~~ggfd~~~----~~~~eD~d~~~r~~~-~G~~~~~~~ 197 (237)
T cd02526 154 SGSLISLEALEKVGGFDEDL----FIDYVDTEWCLRARS-KGYKIYVVP 197 (237)
T ss_pred cceEEcHHHHHHhCCCCHHH----cCccchHHHHHHHHH-cCCcEEEEc
Confidence 78899999999975321221 234689988877776 576554333
No 29
>PTZ00210 UDP-GlcNAc-dependent glycosyltransferase; Provisional
Probab=68.04 E-value=4.7 Score=43.58 Aligned_cols=68 Identities=16% Similarity=0.054 Sum_probs=45.5
Q ss_pred ccceeeccCCCCCcchhcccCCccCCCcccCCCceeeeHHHHHHhcccHHHHHh--------------hccCCCCccccc
Q psy6218 7 RDKSFYTHGGRGNQEEFGLLSLEYDENFCMGGPGVIMSRATLALVAPHIKYCLK--------------NLYTTHEDVELG 72 (533)
Q Consensus 7 ~~~~~~g~~~~g~~~~~~~~~~~~~~~yC~~g~G~~~Sr~~l~~~~p~l~~C~~--------------~~~s~~~D~~lG 72 (533)
+..+|+|+--... +........||. |+|++|||.+++.|.-......- .+.-.|||+-+|
T Consensus 226 rr~LY~G~v~~~~-----~p~Rd~~PpY~~-G~gYvLSrDVA~~Lvs~~pl~rL~~~pys~~~~~~y~~~~~~~EDiMvG 299 (382)
T PTZ00210 226 RHGLYMGRYNYYN-----RIWRRNQLTYVN-GYCITLSRDTAQAIISYKPLERLVNMPFSMWDYFDFLDLGMFYEDVMVG 299 (382)
T ss_pred CCceEEEeeCCCC-----ccccCCCCCccc-cceeeccHHHHHHHHhhChHhHhhcCCCchHHHHHHHHhhcCchHHHHH
Confidence 4569999722110 011112467865 68999999999999887443321 255679999999
Q ss_pred hhhhhhcC
Q psy6218 73 RCVQKFAG 80 (533)
Q Consensus 73 RCi~~~~~ 80 (533)
+=+.+...
T Consensus 300 ~vLr~~~k 307 (382)
T PTZ00210 300 MILREKVV 307 (382)
T ss_pred HHHHHhcC
Confidence 99977764
No 30
>PF13632 Glyco_trans_2_3: Glycosyl transferase family group 2
Probab=67.69 E-value=19 Score=33.57 Aligned_cols=100 Identities=20% Similarity=0.304 Sum_probs=60.6
Q ss_pred eEEEEcCCccccHHHHHHHHhcCC-CCCCeEEEeecCCCcc---------ccc----ccc----CCCCCCCccCCCceee
Q psy6218 418 WFMRADDDLYVRGERLARMLRSVD-SSKPQFIGQAGRGNQE---------EFG----LLS----LEYDENFCMGGPGVIM 479 (533)
Q Consensus 418 WFlKvDDDTYVnvdNL~~~L~~~d-pseplYiG~~~~~~~~---------ey~----~~~----~~~~~~Y~sGGAGYVL 479 (533)
|++..|+||-+..+-|.+.+..+. |+-.+.-|........ ++. ... ......+++ |+|.++
T Consensus 1 ~v~~~DaDt~~~~d~l~~~~~~~~~~~~~~vq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-G~~~~~ 79 (193)
T PF13632_consen 1 YVLFLDADTRLPPDFLERLVAALEDPKVDAVQGPIIFRNRGSLLTRLQDFEYAISHGLSRLSQSSLGRPLFLS-GSGMLF 79 (193)
T ss_pred CEEEEcCCCCCChHHHHHHHHHHhCCCceEEEccEEecCCCChhheeehhhhhhhhhhhHHHHHhcCCCcccc-Ccceee
Confidence 789999999999998888888776 3222322332211110 110 000 012234555 599999
Q ss_pred CHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeecCC
Q psy6218 480 SRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSYE 524 (533)
Q Consensus 480 Sr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~e 524 (533)
++++++++..- + ......||..++.=+.+ .|..+...++
T Consensus 80 r~~~l~~vg~~-~----~~~~~~ED~~l~~~l~~-~G~~~~~~~~ 118 (193)
T PF13632_consen 80 RREALREVGGF-D----DPFSIGEDMDLGFRLRR-AGYRIVYVPD 118 (193)
T ss_pred eHHHHHHhCcc-c----ccccccchHHHHHHHHH-CCCEEEEecc
Confidence 99999988521 1 12356799999877766 5766554444
No 31
>cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=67.11 E-value=18 Score=33.73 Aligned_cols=85 Identities=22% Similarity=0.248 Sum_probs=56.3
Q ss_pred CCCceEEEEcCCccccHHHHHHHHhcCC-CCCCeEEEeecCCCccccccccCCCCCCCccCCCceeeCHHHHHHHHHhHH
Q psy6218 414 DKYEWFMRADDDLYVRGERLARMLRSVD-SSKPQFIGQAGRGNQEEFGLLSLEYDENFCMGGPGVIMSRATLALVAPHIK 492 (533)
Q Consensus 414 ~~~dWFlKvDDDTYVnvdNL~~~L~~~d-pseplYiG~~~~~~~~ey~~~~~~~~~~Y~sGGAGYVLSr~aLrkL~~~~~ 492 (533)
.+++|+++.|||+.+..+.|..+++.++ +.-.++.|..... .. .+.|.++.+++++++--...
T Consensus 78 ~~~d~v~~ld~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~------------~~----~~~~~~~~~~~~~~~g~~~~ 141 (202)
T cd04185 78 LGYDWIWLMDDDAIPDPDALEKLLAYADKDNPQFLAPLVLDP------------DG----SFVGVLISRRVVEKIGLPDK 141 (202)
T ss_pred cCCCEEEEeCCCCCcChHHHHHHHHHHhcCCceEecceeEcC------------CC----ceEEEEEeHHHHHHhCCCCh
Confidence 3789999999999999988888887765 3333444443221 11 23678999999988742111
Q ss_pred HhhccCCCCchhHHHHHHHHhccCCce
Q psy6218 493 YCLKNLYTTHEDVELGRCVQKFAGIPC 519 (533)
Q Consensus 493 ~C~~~~~~~~EDV~LG~CL~~~lGV~~ 519 (533)
. .+...||+.+..=+.. .|...
T Consensus 142 ~----~~~~~eD~~~~~r~~~-~G~~i 163 (202)
T cd04185 142 E----FFIWGDDTEYTLRASK-AGPGI 163 (202)
T ss_pred h----hhccchHHHHHHHHHH-cCCcE
Confidence 1 1245689998887776 56554
No 32
>cd06420 GT2_Chondriotin_Pol_N N-terminal domain of Chondroitin polymerase functions as a GalNAc transferase. Chondroitin polymerase is a two domain, bi-functional protein. The N-terminal domain functions as a GalNAc transferase. The bacterial chondroitin polymerase catalyzes elongation of the chondroitin chain by alternatively transferring the GlcUA and GalNAc moiety from UDP-GlcUA and UDP-GalNAc to the non-reducing ends of the chondroitin chain. The enzyme consists of N-terminal and C-terminal domains in which the two active sites catalyze the addition of GalNAc and GlcUA, respectively. Chondroitin chains range from 40 to over 100 repeating units of the disaccharide. Sulfated chondroitins are involved in the regulation of various biological functions such as central nervous system development, wound repair, infection, growth factor signaling, and morphogenesis, in addition to its conventional structural roles. In Caenorhabditis elegans, chondroitin is an essential factor for the worm
Probab=63.75 E-value=42 Score=30.47 Aligned_cols=92 Identities=17% Similarity=0.097 Sum_probs=56.4
Q ss_pred CCceEEEEcCCccccHHHHHHHHhcCCCCCCeEEEeecCCCccccccccCCCCCCCccCCCceeeCHHHHHHHHHhHHHh
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQAGRGNQEEFGLLSLEYDENFCMGGPGVIMSRATLALVAPHIKYC 494 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~~dpseplYiG~~~~~~~~ey~~~~~~~~~~Y~sGGAGYVLSr~aLrkL~~~~~~C 494 (533)
+.+|++..|+|+.+..+-|.++++.+++. ....|....... ........ |+++++.+..+.++... ...
T Consensus 79 ~g~~i~~lD~D~~~~~~~l~~~~~~~~~~-~~v~g~~~~~~~--------~~~~~~~~-~~~~~~~r~~~~~~ggf-~~~ 147 (182)
T cd06420 79 KGDYLIFIDGDCIPHPDFIADHIELAEPG-VFLSGSRVLLNE--------KLTERGIR-GCNMSFWKKDLLAVNGF-DEE 147 (182)
T ss_pred cCCEEEEEcCCcccCHHHHHHHHHHhCCC-cEEecceeeccc--------ccceeEec-cceEEEEHHHHHHhCCC-Ccc
Confidence 68999999999999998899888877443 333444332110 01112223 47788888888755432 221
Q ss_pred hccCCCCchhHHHHHHHHhccCCcee
Q psy6218 495 LKNLYTTHEDVELGRCVQKFAGIPCT 520 (533)
Q Consensus 495 ~~~~~~~~EDV~LG~CL~~~lGV~~t 520 (533)
. .....||+.++.=+.+ .|+...
T Consensus 148 ~--~~~~~eD~~l~~r~~~-~g~~~~ 170 (182)
T cd06420 148 F--TGWGGEDSELVARLLN-SGIKFR 170 (182)
T ss_pred c--ccCCcchHHHHHHHHH-cCCcEE
Confidence 1 1124799988877776 575443
No 33
>TIGR01556 rhamnosyltran L-rhamnosyltransferase. Rhamnolipids are glycolipids containing mono- or di- L-rhamnose molecules. Rhamnolipid synthesis occurs by sequential glycosyltransferase reactions involving two distinct rhamnosyltransferase enzymes. In P.aeruginosa, the synthesis of mono-rhamnolipids is catalyzed by rhamnosyltransferase 1, and proceeds by a glycosyltransfer reaction catalyzed by rhamnosyltransferase 2 to yield di-rhamnolipids.
Probab=63.13 E-value=36 Score=34.05 Aligned_cols=106 Identities=18% Similarity=0.213 Sum_probs=57.2
Q ss_pred HHHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcCCCC--CCeEEEeecCCCc--ccccc----------ccCC----
Q psy6218 404 MLQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVDSS--KPQFIGQAGRGNQ--EEFGL----------LSLE---- 465 (533)
Q Consensus 404 ~L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~dps--eplYiG~~~~~~~--~ey~~----------~~~~---- 465 (533)
+++++.+ .++||++..|||+.+..+-|..+++.++.. ....+|-...... ..+.. ....
T Consensus 65 Gi~~a~~---~~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 141 (281)
T TIGR01556 65 GLDASFR---RGVQGVLLLDQDSRPGNAFLAAQWKLLSAENGQACALGPRFFDRGTSRRLPAIHLDGLLLRQISLDGLTT 141 (281)
T ss_pred HHHHHHH---CCCCEEEEECCCCCCCHHHHHHHHHHHHhcCCceEEECCeEEcCCCcccCCceeecccceeeecccccCC
Confidence 5676654 368999999999999988888887755432 2233443211100 00000 0000
Q ss_pred -CCCCCccCCCceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCc
Q psy6218 466 -YDENFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIP 518 (533)
Q Consensus 466 -~~~~Y~sGGAGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~ 518 (533)
....++. ++|.++++++++++----+. .+...||..+..=+.+ .|..
T Consensus 142 ~~~~~~~~-~sg~li~~~~~~~iG~fde~----~fi~~~D~e~~~R~~~-~G~~ 189 (281)
T TIGR01556 142 PQKTSFLI-SSGCLITREVYQRLGMMDEE----LFIDHVDTEWSLRAQN-YGIP 189 (281)
T ss_pred ceeccEEE-cCcceeeHHHHHHhCCccHh----hcccchHHHHHHHHHH-CCCE
Confidence 0112333 46789999999988531111 1234578766433333 3544
No 34
>TIGR03469 HonB hopene-associated glycosyltransferase HpnB. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The genes of this family are often found in the same genetic locus with squalene-hopene cyclase genes, and are never associated with genes for the metabolism of phytoene. Indeed, the members of this family appear to never be found in a genome lacking squalene-hopene cyclase (SHC), although not all genomes encoding SHC have this glycosyl transferase. In the organism Zymomonas mobilis the linkage of this gene to hopanoid biosynthesis has been noted and the gene named HpnB. Hopanoids are known to feature polar glycosyl head groups in many organisms.
Probab=62.83 E-value=28 Score=37.07 Aligned_cols=98 Identities=13% Similarity=0.130 Sum_probs=58.5
Q ss_pred CCceEEEEcCCccccHHHHHHHHhcCCCCC-CeEEEeecCCCccccc------------ccc-------CCCCCCCccCC
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRSVDSSK-PQFIGQAGRGNQEEFG------------LLS-------LEYDENFCMGG 474 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~~dpse-plYiG~~~~~~~~ey~------------~~~-------~~~~~~Y~sGG 474 (533)
+.+|++.+|+|+.+..+.|.++++.+.... .+.-|.........+. ... +.....++. |
T Consensus 133 ~gd~llflDaD~~~~p~~l~~lv~~~~~~~~~~vs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-G 211 (384)
T TIGR03469 133 PADYLLLTDADIAHGPDNLARLVARARAEGLDLVSLMVRLRCESFWEKLLIPAFVFFFQKLYPFRWVNDPRRRTAAAA-G 211 (384)
T ss_pred CCCEEEEECCCCCCChhHHHHHHHHHHhCCCCEEEecccccCCCHHHHHHHHHHHHHHHHhcchhhhcCCCccceeec-c
Confidence 489999999999999999988887664322 2322222111000000 000 001112233 5
Q ss_pred CceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCc
Q psy6218 475 PGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIP 518 (533)
Q Consensus 475 AGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~ 518 (533)
+.+++++++++++--.-... ....||+.+++-+++ .|..
T Consensus 212 ~~~lirr~~~~~vGGf~~~~----~~~~ED~~L~~r~~~-~G~~ 250 (384)
T TIGR03469 212 GCILIRREALERIGGIAAIR----GALIDDCTLAAAVKR-SGGR 250 (384)
T ss_pred eEEEEEHHHHHHcCCHHHHh----hCcccHHHHHHHHHH-cCCc
Confidence 78999999999985332211 135799999999998 5644
No 35
>PF15167 DUF4581: Domain of unknown function (DUF4581)
Probab=62.71 E-value=1.9 Score=38.88 Aligned_cols=14 Identities=29% Similarity=0.940 Sum_probs=12.3
Q ss_pred cccchhhhhhcccc
Q psy6218 140 LWSPAIELFFLCPQ 153 (533)
Q Consensus 140 ~~~~~~~~~~~~~~ 153 (533)
-|.|.+-.|.|||+
T Consensus 31 yadPgvSfYVlCPd 44 (128)
T PF15167_consen 31 YADPGVSFYVLCPD 44 (128)
T ss_pred ccCCCCceEEEeec
Confidence 48899999999998
No 36
>KOG2287|consensus
Probab=60.39 E-value=6.5 Score=41.89 Aligned_cols=49 Identities=22% Similarity=0.325 Sum_probs=38.9
Q ss_pred cCCCcccCCCceeeeHHHHHHhcccHHHHHhhccCCCCccccchhhhhhcCCc
Q psy6218 30 YDENFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIP 82 (533)
Q Consensus 30 ~~~~yC~~g~G~~~Sr~~l~~~~p~l~~C~~~~~s~~~D~~lGRCi~~~~~~~ 82 (533)
.=..||+ |+|+|+|+.+.++|.- .+.+--+-.-|||-+|-|+.+.+||.
T Consensus 249 ~YP~Y~s-G~gYvis~~~a~~l~~---~s~~~~~~~iEDV~~g~~l~~~~gi~ 297 (349)
T KOG2287|consen 249 VYPPYAS-GPGYVISGDAARRLLK---ASKHLKFFPIEDVFVGGCLAEDLGIK 297 (349)
T ss_pred CCCCcCC-CceeEecHHHHHHHHH---HhcCCCccchHHHHHHHHHHHhcCCC
Confidence 3445775 6799999999999976 56554455589999999999998766
No 37
>cd04196 GT_2_like_d Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=58.21 E-value=35 Score=31.70 Aligned_cols=93 Identities=17% Similarity=0.188 Sum_probs=54.4
Q ss_pred CCceEEEEcCCccccHHHHHHHHhc-CCCCCC-eEEEeecC--CCcccc--ccc--cCC---------CCCCCccCCCce
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRS-VDSSKP-QFIGQAGR--GNQEEF--GLL--SLE---------YDENFCMGGPGV 477 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~-~dpsep-lYiG~~~~--~~~~ey--~~~--~~~---------~~~~Y~sGGAGY 477 (533)
+.+|++..|+|..+..+.|..+++. ...... ++.|.... ...... ... ... ....+. .|+++
T Consensus 79 ~g~~v~~ld~Dd~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 157 (214)
T cd04196 79 DGDYVFFCDQDDIWLPDKLERLLKAFLKDDKPLLVYSDLELVDENGNPIGESFFEYQKIKPGTSFNNLLFQNVV-TGCTM 157 (214)
T ss_pred CCCEEEEECCCcccChhHHHHHHHHHhcCCCceEEecCcEEECCCCCCcccccccccccCCccCHHHHHHhCcc-CCcee
Confidence 7899999999999998888888886 333333 44443211 000000 000 000 001233 36889
Q ss_pred eeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHh
Q psy6218 478 IMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQK 513 (533)
Q Consensus 478 VLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~ 513 (533)
++.+++++++...-.. ....+|..+...+..
T Consensus 158 ~~r~~~~~~~~~~~~~-----~~~~~D~~~~~~~~~ 188 (214)
T cd04196 158 AFNRELLELALPFPDA-----DVIMHDWWLALLASA 188 (214)
T ss_pred eEEHHHHHhhcccccc-----ccccchHHHHHHHHH
Confidence 9999999998653211 135678888777765
No 38
>cd04192 GT_2_like_e Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=58.00 E-value=38 Score=31.83 Aligned_cols=102 Identities=17% Similarity=0.139 Sum_probs=57.7
Q ss_pred HHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcCCC-CCCeEEEeecCCCccc----ccc---------c--cCCCCC
Q psy6218 405 LQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVDS-SKPQFIGQAGRGNQEE----FGL---------L--SLEYDE 468 (533)
Q Consensus 405 L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~dp-seplYiG~~~~~~~~e----y~~---------~--~~~~~~ 468 (533)
+.+..+.. +.+|++.+|+|+.+..+-|.+++..+.. ....+.|......... +.. . ......
T Consensus 74 ~n~g~~~~--~~d~i~~~D~D~~~~~~~l~~l~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 151 (229)
T cd04192 74 LTTAIKAA--KGDWIVTTDADCVVPSNWLLTFVAFIQKEQIGLVAGPVIYFKGKSLLAKFQRLDWLSLLGLIAGSFGLGK 151 (229)
T ss_pred HHHHHHHh--cCCEEEEECCCcccCHHHHHHHHHHhhcCCCcEEeeeeeecCCccHHHHHHHHHHHHHHHHHhhHHHhcC
Confidence 44444433 6899999999999999888888885543 3345666543221100 000 0 001112
Q ss_pred CCccCCCceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHH
Q psy6218 469 NFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQ 512 (533)
Q Consensus 469 ~Y~sGGAGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~ 512 (533)
++..-|+++++++++++++--.-. ......||..++.-+.
T Consensus 152 ~~~~~g~~~~~rr~~~~~~ggf~~----~~~~~~eD~~~~~~~~ 191 (229)
T cd04192 152 PFMCNGANMAYRKEAFFEVGGFEG----NDHIASGDDELLLAKV 191 (229)
T ss_pred ccccccceEEEEHHHHHHhcCCcc----ccccccCCHHHHHHHH
Confidence 333346889999999999853211 1123456665554333
No 39
>cd06434 GT2_HAS Hyaluronan synthases catalyze polymerization of hyaluronan. Hyaluronan synthases (HASs) are bi-functional glycosyltransferases that catalyze polymerization of hyaluronan. HASs transfer both GlcUA and GlcNAc in beta-(1,3) and beta-(1,4) linkages, respectively to the hyaluronan chain using UDP-GlcNAc and UDP-GlcUA as substrates. HA is made as a free glycan, not attached to a protein or lipid. HASs do not need a primer for HA synthesis; they initiate HA biosynthesis de novo with only UDP-GlcNAc, UDP-GlcUA, and Mg2+. Hyaluronan (HA) is a linear heteropolysaccharide composed of (1-3)-linked beta-D-GlcUA-beta-D-GlcNAc disaccharide repeats. It can be found in vertebrates and a few microbes and is typically on the cell surface or in the extracellular space, but is also found inside mammalian cells. Hyaluronan has several physiochemical and biological functions such as space filling, lubrication, and providing a hydrated matrix through which cells can migrate.
Probab=55.13 E-value=30 Score=32.97 Aligned_cols=106 Identities=12% Similarity=0.038 Sum_probs=59.9
Q ss_pred CCceEEEEcCCccccHHHHHHHHhcCC-CCCCeEEEeecCCCc--ccc--------ccc----c---CCCCCCCccCCCc
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRSVD-SSKPQFIGQAGRGNQ--EEF--------GLL----S---LEYDENFCMGGPG 476 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~~d-pseplYiG~~~~~~~--~ey--------~~~----~---~~~~~~Y~sGGAG 476 (533)
+.+|++.+|+|+.+..+.|..+++.++ +.-....|....... ..+ ... . ......++..|+.
T Consensus 77 ~~d~v~~lD~D~~~~~~~l~~l~~~~~~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~ 156 (235)
T cd06434 77 TTDIVVLLDSDTVWPPNALPEMLKPFEDPKVGGVGTNQRILRPRDSKWSFLAAEYLERRNEEIRAAMSYDGGVPCLSGRT 156 (235)
T ss_pred CCCEEEEECCCceeChhHHHHHHHhccCCCEeEEcCceEeecCcccHHHHHHHHHHHHHHHHHHHHHhhCCCEEEccCcH
Confidence 689999999999999999999999885 222222222211100 000 000 0 0011223344577
Q ss_pred eeeCHHHHHHHHHhHH---Hh-hccCCCCchhHHHHHHHHhccCCceee
Q psy6218 477 VIMSRATLALVAPHIK---YC-LKNLYTTHEDVELGRCVQKFAGIPCTW 521 (533)
Q Consensus 477 YVLSr~aLrkL~~~~~---~C-~~~~~~~~EDV~LG~CL~~~lGV~~t~ 521 (533)
.++.+++++++.-... .+ ........||..++.=+.+ .|....-
T Consensus 157 ~~~rr~~l~~~~~~~~~~~~~~~~~~~~~~eD~~l~~~~~~-~g~~~~~ 204 (235)
T cd06434 157 AAYRTEILKDFLFLEEFTNETFMGRRLNAGDDRFLTRYVLS-HGYKTVY 204 (235)
T ss_pred HHHHHHHHhhhhhHHHhhhhhhcCCCCCcCchHHHHHHHHH-CCCeEEE
Confidence 8888998887653211 11 1112356799998776666 4665443
No 40
>cd06435 CESA_NdvC_like NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. Bradyrhizobium japonicum synthesizes periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans during growth under hypoosmotic conditions. Two genes (ndvB, ndvC) are involved in the beta-(1, 3), beta-(1,6)-glucan synthesis. The ndvC mutant strain resulted in synthesis of altered cyclic beta-glucans composed almost entirely of beta-(1, 3)-glycosyl linkages. The periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans function for osmoregulation. The ndvC mutation also affects the ability of the bacteria to establish a successful symbiotic interaction with host plant. Thus, the beta-glucans may function as suppressors of a host defense response.
Probab=50.08 E-value=54 Score=31.35 Aligned_cols=112 Identities=22% Similarity=0.253 Sum_probs=64.0
Q ss_pred HHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcCCCCCCeEE-EeecC-CC-ccccc--------ccc-------CCC
Q psy6218 405 LQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVDSSKPQFI-GQAGR-GN-QEEFG--------LLS-------LEY 466 (533)
Q Consensus 405 L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~dpseplYi-G~~~~-~~-~~ey~--------~~~-------~~~ 466 (533)
+.+..+....++||++..|+|+.+..+.|.+++..+...+--.+ |.... .. ...+. ... ...
T Consensus 74 ~n~g~~~a~~~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (236)
T cd06435 74 LNYALERTAPDAEIIAVIDADYQVEPDWLKRLVPIFDDPRVGFVQAPQDYRDGEESLFKRMCYAEYKGFFDIGMVSRNER 153 (236)
T ss_pred HHHHHHhcCCCCCEEEEEcCCCCcCHHHHHHHHHHhcCCCeeEEecCccccCCCccHHHHHHhHHHHHHHHHHhcccccc
Confidence 44444443335899999999999999999999987753221222 21111 00 00000 000 000
Q ss_pred CCCCccCCCceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeecCC
Q psy6218 467 DENFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSYE 524 (533)
Q Consensus 467 ~~~Y~sGGAGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~e 524 (533)
...+++ |++.++++++++++--. +.+ ...||..++.=+.+ .|......++
T Consensus 154 ~~~~~~-g~~~~~rr~~~~~iGgf-~~~-----~~~eD~dl~~r~~~-~G~~~~~~~~ 203 (236)
T cd06435 154 NAIIQH-GTMCLIRRSALDDVGGW-DEW-----CITEDSELGLRMHE-AGYIGVYVAQ 203 (236)
T ss_pred CceEEe-cceEEEEHHHHHHhCCC-CCc-----cccchHHHHHHHHH-CCcEEEEcch
Confidence 112344 47889999999998531 221 34799999877776 5766544443
No 41
>cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan. Succinoglycan Biosynthesis Protein ExoA catalyzes the formation of a beta-1,3 linkage of the second sugar (glucose) of the succinoglycan with the galactose on the lipid carrie. Succinoglycan is an acidic exopolysaccharide that is important for invasion of the nodules. Succinoglycan is a high-molecular-weight polymer composed of repeating octasaccharide units. These units are synthesized on membrane-bound isoprenoid lipid carriers, beginning with galactose followed by seven glucose molecules, and modified by the addition of acetate, succinate, and pyruvate. ExoA is a membrane protein with a transmembrance domain at c-terminus.
Probab=49.95 E-value=83 Score=29.91 Aligned_cols=101 Identities=14% Similarity=0.114 Sum_probs=56.7
Q ss_pred CCceEEEEcCCccccHHHHHHHHhcCC-CCCCeEEEeecCCCcccc------------ccc---c--CCCCCCCccCCCc
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRSVD-SSKPQFIGQAGRGNQEEF------------GLL---S--LEYDENFCMGGPG 476 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~~d-pseplYiG~~~~~~~~ey------------~~~---~--~~~~~~Y~sGGAG 476 (533)
+.+|++..|||+.+..+-|.++++.+. +...+..|.........+ ... . ......++..|.+
T Consensus 81 ~~d~v~~lD~D~~~~~~~l~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (249)
T cd02525 81 RGDIIIRVDAHAVYPKDYILELVEALKRTGADNVGGPMETIGESKFQKAIAVAQSSPLGSGGSAYRGGAVKIGYVDTVHH 160 (249)
T ss_pred CCCEEEEECCCccCCHHHHHHHHHHHhcCCCCEEecceecCCCChHHHHHHHHhhchhccCCcccccccccccccccccc
Confidence 689999999999999888888886543 333344454322111000 000 0 0001022334577
Q ss_pred eeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceee
Q psy6218 477 VIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTW 521 (533)
Q Consensus 477 YVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~ 521 (533)
.++++++++++.- ++. .+...||..++.=+.+ .|.....
T Consensus 161 ~~~~~~~~~~~g~-~~~----~~~~~eD~~l~~r~~~-~G~~~~~ 199 (249)
T cd02525 161 GAYRREVFEKVGG-FDE----SLVRNEDAELNYRLRK-AGYKIWL 199 (249)
T ss_pred ceEEHHHHHHhCC-CCc----ccCccchhHHHHHHHH-cCcEEEE
Confidence 8899999888742 111 1244689888755554 4655443
No 42
>cd06437 CESA_CaSu_A2 Cellulose synthase catalytic subunit A2 (CESA2) is a catalytic subunit or a catalytic subunit substitute of the cellulose synthase complex. Cellulose synthase (CESA) catalyzes the polymerization reaction of cellulose using UDP-glucose as the substrate. Cellulose is an aggregate of unbranched polymers of beta-1,4-linked glucose residues, which is an abundant polysaccharide produced by plants and in varying degrees by several other organisms including algae, bacteria, fungi, and even some animals. Genomes from higher plants harbor multiple CESA genes. There are ten in Arabidopsis. At least three different CESA proteins are required to form a functional complex. In Arabidopsis, CESA1, 3 and 6 and CESA4, 7 and 8, are required for cellulose biosynthesis during primary and secondary cell wall formation. CESA2 is very closely related to CESA6 and is viewed as a prime substitute for CESA6. They functionally compensate each other. The cesa2 and cesa6 double mutant plants we
Probab=49.50 E-value=42 Score=32.27 Aligned_cols=111 Identities=15% Similarity=0.078 Sum_probs=60.6
Q ss_pred HHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcCCCCCC-eEEEeecC-CCcccc-------ccc--c-----C-CCC
Q psy6218 405 LQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVDSSKP-QFIGQAGR-GNQEEF-------GLL--S-----L-EYD 467 (533)
Q Consensus 405 L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~dpsep-lYiG~~~~-~~~~ey-------~~~--~-----~-~~~ 467 (533)
+....+.. +.+|++.+|.|+.+..+-|...+..+...+- ...|.... .....+ ... . . ...
T Consensus 79 ~n~g~~~a--~~~~i~~~DaD~~~~~~~l~~~~~~~~~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (232)
T cd06437 79 LAEGMKVA--KGEYVAIFDADFVPPPDFLQKTPPYFADPKLGFVQTRWGHINANYSLLTRVQAMSLDYHFTIEQVARSST 156 (232)
T ss_pred HHHHHHhC--CCCEEEEEcCCCCCChHHHHHhhhhhcCCCeEEEecceeeEcCCCchhhHhhhhhHHhhhhHhHhhHhhc
Confidence 34444433 7899999999999999999986655432221 22232211 000000 000 0 0 001
Q ss_pred CCC-ccCCCceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeecCC
Q psy6218 468 ENF-CMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSYE 524 (533)
Q Consensus 468 ~~Y-~sGGAGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~e 524 (533)
..+ +..|++.++.+++++++.- ++. . ...||+.++.-+.. .|......++
T Consensus 157 ~~~~~~~g~~~~~rr~~~~~vgg-~~~---~--~~~ED~~l~~rl~~-~G~~~~~~~~ 207 (232)
T cd06437 157 GLFFNFNGTAGVWRKECIEDAGG-WNH---D--TLTEDLDLSYRAQL-KGWKFVYLDD 207 (232)
T ss_pred CCeEEeccchhhhhHHHHHHhCC-CCC---C--cchhhHHHHHHHHH-CCCeEEEecc
Confidence 111 1234666788888887642 111 1 35799999998887 6766554443
No 43
>cd06439 CESA_like_1 CESA_like_1 is a member of the cellulose synthase (CESA) superfamily. This is a subfamily of cellulose synthase (CESA) superfamily. CESA superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members of the superfamily include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins.
Probab=47.40 E-value=64 Score=31.18 Aligned_cols=108 Identities=12% Similarity=0.110 Sum_probs=59.0
Q ss_pred HHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcCCCCC-CeEEEeecCCCccc----------ccc-c--cC-CCCCC
Q psy6218 405 LQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVDSSK-PQFIGQAGRGNQEE----------FGL-L--SL-EYDEN 469 (533)
Q Consensus 405 L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~dpse-plYiG~~~~~~~~e----------y~~-~--~~-~~~~~ 469 (533)
+....+.. +.||++..|+|+.+..+-|..+++.+...+ .+..|......... +.. . .. .....
T Consensus 101 ~n~gi~~a--~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (251)
T cd06439 101 LNRALALA--TGEIVVFTDANALLDPDALRLLVRHFADPSVGAVSGELVIVDGGGSGSGEGLYWKYENWLKRAESRLGST 178 (251)
T ss_pred HHHHHHHc--CCCEEEEEccccCcCHHHHHHHHHHhcCCCccEEEeEEEecCCcccchhHHHHHHHHHHHHHHHHhcCCe
Confidence 34444443 469999999999999888999888875333 34444433211100 000 0 00 01112
Q ss_pred CccCCCceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeecCC
Q psy6218 470 FCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSYE 524 (533)
Q Consensus 470 Y~sGGAGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~e 524 (533)
+...|+.+++.+++++ .. ......||..++.=+.. .|......++
T Consensus 179 ~~~~g~~~~~rr~~~~----~~-----~~~~~~eD~~l~~~~~~-~G~~~~~~~~ 223 (251)
T cd06439 179 VGANGAIYAIRRELFR----PL-----PADTINDDFVLPLRIAR-QGYRVVYEPD 223 (251)
T ss_pred eeecchHHHhHHHHhc----CC-----CcccchhHHHHHHHHHH-cCCeEEeccc
Confidence 2233455666666665 11 11245789888776665 5766554443
No 44
>cd04187 DPM1_like_bac Bacterial DPM1_like enzymes are related to eukaryotic DPM1. A family of bacterial enzymes related to eukaryotic DPM1; Although the mechanism of eukaryotic enzyme is well studied, the mechanism of the bacterial enzymes is not well understood. The eukaryotic DPM1 is the catalytic subunit of eukaryotic Dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. The enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. This protein family belongs to Glycosyltransferase 2 superfamily.
Probab=47.36 E-value=91 Score=28.51 Aligned_cols=74 Identities=18% Similarity=0.103 Sum_probs=48.6
Q ss_pred CCceEEEEcCCccccHHHHHHHHhcCCCCCCeEEEeecCCCcccc--------ccc-c--CCCCCCCccCCCceeeCHHH
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQAGRGNQEEF--------GLL-S--LEYDENFCMGGPGVIMSRAT 483 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~~dpseplYiG~~~~~~~~ey--------~~~-~--~~~~~~Y~sGGAGYVLSr~a 483 (533)
..+|++..|+|.-...+-|..+++.+++...+.+|.......... ... . .....+.. +|+..++++++
T Consensus 80 ~~d~i~~~D~D~~~~~~~l~~l~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~r~~ 158 (181)
T cd04187 80 RGDAVITMDADLQDPPELIPEMLAKWEEGYDVVYGVRKNRKESWLKRLTSKLFYRLINKLSGVDIPDN-GGDFRLMDRKV 158 (181)
T ss_pred CCCEEEEEeCCCCCCHHHHHHHHHHHhCCCcEEEEEecCCcchHHHHHHHHHHHHHHHHHcCCCCCCC-CCCEEEEcHHH
Confidence 469999999999999888888888766666677787553321100 000 0 11122233 45778999999
Q ss_pred HHHHHH
Q psy6218 484 LALVAP 489 (533)
Q Consensus 484 LrkL~~ 489 (533)
++.+..
T Consensus 159 ~~~i~~ 164 (181)
T cd04187 159 VDALLL 164 (181)
T ss_pred HHHHHh
Confidence 999874
No 45
>cd06427 CESA_like_2 CESA_like_2 is a member of the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, Glucan Biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis prot
Probab=46.82 E-value=54 Score=31.89 Aligned_cols=110 Identities=15% Similarity=0.139 Sum_probs=64.3
Q ss_pred HHHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcCCCC--CCeEEEeec-CCC-ccc---------cccc----cC--
Q psy6218 404 MLQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVDSS--KPQFIGQAG-RGN-QEE---------FGLL----SL-- 464 (533)
Q Consensus 404 ~L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~dps--eplYiG~~~-~~~-~~e---------y~~~----~~-- 464 (533)
++.+..++. +.+|++..|+|+.+..+.|.+.++.+... +-.++|... ... ... +... .+
T Consensus 75 a~n~g~~~a--~gd~i~~~DaD~~~~~~~l~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (241)
T cd06427 75 ACNYALAFA--RGEYVVIYDAEDAPDPDQLKKAVAAFARLDDKLACVQAPLNYYNARENWLTRMFALEYAAWFDYLLPGL 152 (241)
T ss_pred HHHHHHHhc--CCCEEEEEcCCCCCChHHHHHHHHHHHhcCCCEEEEeCceEeeCCCccHHHHHHHHHHHHHHHHHHHHH
Confidence 345555543 67999999999999999999999876532 223333221 110 000 0000 00
Q ss_pred -CCCCCCccCCCceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeec
Q psy6218 465 -EYDENFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWS 522 (533)
Q Consensus 465 -~~~~~Y~sGGAGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s 522 (533)
........+|++.++++++++.+... + .....||..++.=+.. .|..+...
T Consensus 153 ~~~~~~~~~~g~~~~~rr~~~~~vgg~-~-----~~~~~eD~~l~~rl~~-~G~r~~~~ 204 (241)
T cd06427 153 ARLGLPIPLGGTSNHFRTDVLRELGGW-D-----PFNVTEDADLGLRLAR-AGYRTGVL 204 (241)
T ss_pred HhcCCeeecCCchHHhhHHHHHHcCCC-C-----cccchhhHHHHHHHHH-CCceEEEe
Confidence 01112233678899999999998532 1 1234699988876665 57665544
No 46
>PRK11204 N-glycosyltransferase; Provisional
Probab=46.62 E-value=59 Score=34.64 Aligned_cols=111 Identities=17% Similarity=0.202 Sum_probs=65.7
Q ss_pred HHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcC--CCCCCeEEEeecCCCcc---------ccccc----c---CCC
Q psy6218 405 LQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSV--DSSKPQFIGQAGRGNQE---------EFGLL----S---LEY 466 (533)
Q Consensus 405 L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~--dpseplYiG~~~~~~~~---------ey~~~----~---~~~ 466 (533)
+....+.. +.||++..|+|+.+..+-|.++++.+ +|+-....|.+...... +|... . ...
T Consensus 126 ln~g~~~a--~~d~i~~lDaD~~~~~d~L~~l~~~~~~~~~v~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 203 (420)
T PRK11204 126 LNTGAAAA--RSEYLVCIDGDALLDPDAAAYMVEHFLHNPRVGAVTGNPRIRNRSTLLGRIQVGEFSSIIGLIKRAQRVY 203 (420)
T ss_pred HHHHHHHc--CCCEEEEECCCCCCChhHHHHHHHHHHhCCCeEEEECCceeccchhHHHHHHHHHHHHhhhHHHHHHHHh
Confidence 44444443 68999999999999999999998877 33322333332111100 11000 0 001
Q ss_pred CCCCccCCCceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeecCC
Q psy6218 467 DENFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSYE 524 (533)
Q Consensus 467 ~~~Y~sGGAGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~e 524 (533)
...++.+|++.++.+++++++-.. + .....||+.++.=+.+ .|....-.++
T Consensus 204 ~~~~~~~G~~~~~rr~~l~~vgg~-~-----~~~~~ED~~l~~rl~~-~G~~i~~~p~ 254 (420)
T PRK11204 204 GRVFTVSGVITAFRKSALHEVGYW-S-----TDMITEDIDISWKLQL-RGWDIRYEPR 254 (420)
T ss_pred CCceEecceeeeeeHHHHHHhCCC-C-----CCcccchHHHHHHHHH-cCCeEEeccc
Confidence 122333568889999999887421 1 1135799999988887 6876554443
No 47
>cd06433 GT_2_WfgS_like WfgS and WfeV are involved in O-antigen biosynthesis. Escherichia coli WfgS and Shigella dysenteriae WfeV are glycosyltransferase 2 family enzymes involved in O-antigen biosynthesis. GT-2 enzymes have GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=44.95 E-value=1.5e+02 Score=26.84 Aligned_cols=106 Identities=11% Similarity=-0.057 Sum_probs=58.7
Q ss_pred HHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcC--CCCCCeEEEeecCC--Cccccc-c-----ccCC-CCCCCccC
Q psy6218 405 LQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSV--DSSKPQFIGQAGRG--NQEEFG-L-----LSLE-YDENFCMG 473 (533)
Q Consensus 405 L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~--dpseplYiG~~~~~--~~~ey~-~-----~~~~-~~~~Y~sG 473 (533)
+....+.. +.+|++..|+|..+..+.+...+... ++...+..|..... ...... . .... ....+. .
T Consensus 67 ~n~~~~~a--~~~~v~~ld~D~~~~~~~~~~~~~~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 143 (202)
T cd06433 67 MNKGIALA--TGDIIGFLNSDDTLLPGALLAVVAAFAEHPEVDVVYGDVLLVDENGRVIGRRRPPPFLDKFLLYGMPI-C 143 (202)
T ss_pred HHHHHHHc--CCCEEEEeCCCcccCchHHHHHHHHHHhCCCccEEEeeeEEEcCCCCcccCCCCcchhhhHHhhcCcc-c
Confidence 34444443 68999999999999988888887322 33445666654221 100000 0 0000 112333 4
Q ss_pred CCceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCce
Q psy6218 474 GPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPC 519 (533)
Q Consensus 474 GAGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~ 519 (533)
+++.++++++++++...... +...+|..+..=+.+ .|...
T Consensus 144 ~~~~~~~~~~~~~~~~f~~~-----~~~~~D~~~~~r~~~-~g~~~ 183 (202)
T cd06433 144 HQATFFRRSLFEKYGGFDES-----YRIAADYDLLLRLLL-AGKIF 183 (202)
T ss_pred CcceEEEHHHHHHhCCCchh-----hCchhhHHHHHHHHH-cCCce
Confidence 57899999999988631111 234567766544444 35444
No 48
>cd04184 GT2_RfbC_Mx_like Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis. The rfbC gene encodes a predicted protein of 1,276 amino acids, which is required for O-antigen biosynthesis in Myxococcus xanthus. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds.
Probab=43.22 E-value=86 Score=28.96 Aligned_cols=104 Identities=10% Similarity=0.017 Sum_probs=59.8
Q ss_pred CCceEEEEcCCccccHHHHHHHHhcCC--CCCCeEEEeecCCC-ccccc-cc-cC----CCCCCCccCCCceeeCHHHHH
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRSVD--SSKPQFIGQAGRGN-QEEFG-LL-SL----EYDENFCMGGPGVIMSRATLA 485 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~~d--pseplYiG~~~~~~-~~ey~-~~-~~----~~~~~Y~sGGAGYVLSr~aLr 485 (533)
+.+|++..|+|..+..+.|..+++.++ +.-.+..|...... ..... .. .+ ......+..+.+.++++++++
T Consensus 83 ~~d~i~~ld~D~~~~~~~l~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~ 162 (202)
T cd04184 83 TGEFVALLDHDDELAPHALYEVVKALNEHPDADLIYSDEDKIDEGGKRSEPFFKPDWSPDLLLSQNYIGHLLVYRRSLVR 162 (202)
T ss_pred cCCEEEEECCCCcCChHHHHHHHHHHHhCCCCCEEEccHHhccCCCCEeccccCCCCCHHHhhhcCCccceEeEEHHHHH
Confidence 579999999999999998988888762 33334444322110 00000 00 00 001122233466789999998
Q ss_pred HHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeecCC
Q psy6218 486 LVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSYE 524 (533)
Q Consensus 486 kL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~e 524 (533)
++.-- +. .+...||..++.=+.+ .|....+.++
T Consensus 163 ~iggf-~~----~~~~~eD~~l~~rl~~-~g~~~~~~~~ 195 (202)
T cd04184 163 QVGGF-RE----GFEGAQDYDLVLRVSE-HTDRIAHIPR 195 (202)
T ss_pred HhCCC-Cc----CcccchhHHHHHHHHh-ccceEEEccH
Confidence 87521 11 1235689888777766 5766655544
No 49
>cd04179 DPM_DPG-synthase_like DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily. DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi, and animals, have no transmembrane region, suggesting the ex
Probab=42.71 E-value=1.1e+02 Score=27.74 Aligned_cols=74 Identities=15% Similarity=0.075 Sum_probs=46.5
Q ss_pred CCceEEEEcCCccccHHHHHHHHhc-CCCCCCeEEEeecCCCc---cc-ccc--------c-c-CCCCCCCccCCCceee
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRS-VDSSKPQFIGQAGRGNQ---EE-FGL--------L-S-LEYDENFCMGGPGVIM 479 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~-~dpseplYiG~~~~~~~---~e-y~~--------~-~-~~~~~~Y~sGGAGYVL 479 (533)
..+|++..|+|+.+..+-|.+++.. .++...+.+|....... .. +.. . . ..........|+-+++
T Consensus 79 ~gd~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (185)
T cd04179 79 RGDIVVTMDADLQHPPEDIPKLLEKLLEGGADVVIGSRFVRGGGAGMPLLRRLGSRLFNFLIRLLLGVRISDTQSGFRLF 158 (185)
T ss_pred cCCEEEEEeCCCCCCHHHHHHHHHHHhccCCcEEEEEeecCCCcccchHHHHHHHHHHHHHHHHHcCCCCcCCCCceeee
Confidence 3599999999999999999999986 45555677777543221 00 000 0 0 0111122223455789
Q ss_pred CHHHHHHHH
Q psy6218 480 SRATLALVA 488 (533)
Q Consensus 480 Sr~aLrkL~ 488 (533)
++++++.+.
T Consensus 159 ~r~~~~~i~ 167 (185)
T cd04179 159 RREVLEALL 167 (185)
T ss_pred HHHHHHHHH
Confidence 999999996
No 50
>cd06442 DPM1_like DPM1_like represents putative enzymes similar to eukaryotic DPM1. Proteins similar to eukaryotic DPM1, including enzymes from bacteria and archaea; DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi,
Probab=42.18 E-value=96 Score=29.15 Aligned_cols=73 Identities=18% Similarity=0.129 Sum_probs=46.1
Q ss_pred CCceEEEEcCCccccHHHHHHHHhc-CCCCCCeEEEeecCCCc--cccc--------------cccCCCCCCCccCCCce
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRS-VDSSKPQFIGQAGRGNQ--EEFG--------------LLSLEYDENFCMGGPGV 477 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~-~dpseplYiG~~~~~~~--~ey~--------------~~~~~~~~~Y~sGGAGY 477 (533)
..+|++..|+|..+..+.|..+++. .++...+..|....... ..+. ..........++| +..
T Consensus 78 ~gd~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 156 (224)
T cd06442 78 RGDVIVVMDADLSHPPEYIPELLEAQLEGGADLVIGSRYVEGGGVEGWGLKRKLISRGANLLARLLLGRKVSDPTS-GFR 156 (224)
T ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHhcCCCCEEEEeeeecCCccCCCcHHHHHHHHHHHHHHHHHcCCCCCCCCC-ccc
Confidence 4699999999999999999999886 34444566665432111 1110 0001122334444 557
Q ss_pred eeCHHHHHHHH
Q psy6218 478 IMSRATLALVA 488 (533)
Q Consensus 478 VLSr~aLrkL~ 488 (533)
++++++++++.
T Consensus 157 ~~~r~~~~~ig 167 (224)
T cd06442 157 AYRREVLEKLI 167 (224)
T ss_pred hhhHHHHHHHh
Confidence 89999999997
No 51
>cd02522 GT_2_like_a GT_2_like_a represents a glycosyltransferase family-2 subfamily with unknown function. Glycosyltransferase family 2 (GT-2) subfamily of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=39.87 E-value=92 Score=29.22 Aligned_cols=97 Identities=16% Similarity=0.136 Sum_probs=54.3
Q ss_pred CCceEEEEcCCccccHHHHHHHHhcCCCCCCeEEEeecCCCc-ccccc------ccCCCCCCCccCCCceeeCHHHHHHH
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQAGRGNQ-EEFGL------LSLEYDENFCMGGPGVIMSRATLALV 487 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~~dpseplYiG~~~~~~~-~ey~~------~~~~~~~~Y~sGGAGYVLSr~aLrkL 487 (533)
+.+|++..|+|+++..+.|.+++...........+....... ..... ...........++.|.++++++++++
T Consensus 72 ~~~~i~~~D~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~ 151 (221)
T cd02522 72 RGDWLLFLHADTRLPPDWDAAIIETLRADGAVAGAFRLRFDDPGPRLRLLELGANLRSRLFGLPYGDQGLFIRRELFEEL 151 (221)
T ss_pred cCCEEEEEcCCCCCChhHHHHHHHHhhcCCcEEEEEEeeecCCccchhhhhhcccceecccCCCcCCceEEEEHHHHHHh
Confidence 479999999999999998888877665443322222211110 00000 00111112223457899999998877
Q ss_pred HHhHHHhhccCCCCchhHHHHHHHHhccCCc
Q psy6218 488 APHIKYCLKNLYTTHEDVELGRCVQKFAGIP 518 (533)
Q Consensus 488 ~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~ 518 (533)
-.. +. . ...||..+..=+.+ .|..
T Consensus 152 G~f-d~----~-~~~ED~d~~~r~~~-~G~~ 175 (221)
T cd02522 152 GGF-PE----L-PLMEDVELVRRLRR-RGRP 175 (221)
T ss_pred CCC-Cc----c-ccccHHHHHHHHHh-CCCE
Confidence 532 11 1 26788877655554 4543
No 52
>PRK14716 bacteriophage N4 adsorption protein B; Provisional
Probab=37.92 E-value=65 Score=36.32 Aligned_cols=109 Identities=17% Similarity=0.144 Sum_probs=63.0
Q ss_pred CCCceEEEEcCCccccHHHHHHHHhcCCCCC-----CeEEEeecCCCc------ccccccc----C--C-CCCCCccCCC
Q psy6218 414 DKYEWFMRADDDLYVRGERLARMLRSVDSSK-----PQFIGQAGRGNQ------EEFGLLS----L--E-YDENFCMGGP 475 (533)
Q Consensus 414 ~~~dWFlKvDDDTYVnvdNL~~~L~~~dpse-----plYiG~~~~~~~------~ey~~~~----~--~-~~~~Y~sGGA 475 (533)
.++++++..|-|+.+..+-|..+...+ ++. |++.+....... .+|.... + . -......+|.
T Consensus 157 ~~~d~vvi~DAD~~v~Pd~Lr~~~~~~-~~~~~VQ~pv~~~~~~~~~~~ag~y~~ef~~~~~~~l~~r~~LG~~~~~~Gt 235 (504)
T PRK14716 157 IRFAIIVLHDAEDVIHPLELRLYNYLL-PRHDFVQLPVFSLPRDWGEWVAGTYMDEFAESHLKDLPVREALGGLIPSAGV 235 (504)
T ss_pred CCcCEEEEEcCCCCcCccHHHHHHhhc-CCCCEEecceeccCCchhHHHHHHHHHHHHHHHHHHHHHHHhcCCccccCCe
Confidence 367999999999999999998764433 222 122111100000 1111000 0 0 1122234579
Q ss_pred ceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeecCC
Q psy6218 476 GVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSYE 524 (533)
Q Consensus 476 GYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~e 524 (533)
|+++++++++++........=+...-.||..+|.-+.. .|..+.-.++
T Consensus 236 g~afRR~aLe~l~~~~GG~~fd~~sLTED~dLglRL~~-~G~rv~y~p~ 283 (504)
T PRK14716 236 GTAFSRRALERLAAERGGQPFDSDSLTEDYDIGLRLKR-AGFRQIFVRV 283 (504)
T ss_pred eEEeEHHHHHHHHhhcCCCCCCCCCcchHHHHHHHHHH-CCCEEEEecc
Confidence 99999999999864211100011257899999999988 6888765444
No 53
>PLN03133 beta-1,3-galactosyltransferase; Provisional
Probab=37.59 E-value=19 Score=41.60 Aligned_cols=47 Identities=23% Similarity=0.256 Sum_probs=32.6
Q ss_pred cCCCcccCCCceeeeHHHHHHhcccHHHHHhhccCCCCccccchhhhhh
Q psy6218 30 YDENFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKF 78 (533)
Q Consensus 30 ~~~~yC~~g~G~~~Sr~~l~~~~p~l~~C~~~~~s~~~D~~lGRCi~~~ 78 (533)
.=..||. |+|+|||+.++++|.--.....-. +-..|||-+|-|+...
T Consensus 536 ~YPpYas-G~gYVlS~Dla~~L~~~s~s~~l~-~f~lEDVyvGi~l~~l 582 (636)
T PLN03133 536 TYPPWAH-GPGYVVSRDIAKEVYKRHKEGRLK-MFKLEDVAMGIWIAEM 582 (636)
T ss_pred CCCCCCC-cCEEEEcHHHHHHHHHhhhhcccC-cCChhhHhHHHHHHHh
Confidence 3456886 789999999999985432221112 3348999999998643
No 54
>PF00535 Glycos_transf_2: Glycosyl transferase family 2; InterPro: IPR001173 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. This domain is found in a diverse family of glycosyl transferases that transfer the sugar from UDP-glucose, UDP-N-acetyl-galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids.; PDB: 2Z87_A 2Z86_B 2D7R_A 2D7I_A 3CKN_A 3CKQ_A 3CKJ_A 3CKV_A 3CKO_A 2FFU_A ....
Probab=35.40 E-value=32 Score=29.90 Aligned_cols=80 Identities=15% Similarity=0.214 Sum_probs=43.0
Q ss_pred HHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcCCC-CCCeEEEeecCCCc--------ccc---ccc-------cCC
Q psy6218 405 LQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVDS-SKPQFIGQAGRGNQ--------EEF---GLL-------SLE 465 (533)
Q Consensus 405 L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~dp-seplYiG~~~~~~~--------~ey---~~~-------~~~ 465 (533)
+....+.. +.+|++.+|||+++..+.|..+++.++. ...+.+|....... ..+ ... ...
T Consensus 70 ~n~~~~~a--~~~~i~~ld~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 147 (169)
T PF00535_consen 70 RNRGIKHA--KGEYILFLDDDDIISPDWLEELVEALEKNPPDVVIGSVIYIDDDNRYPDRRLRFSFWNRFERKIFNNIRF 147 (169)
T ss_dssp HHHHHHH----SSEEEEEETTEEE-TTHHHHHHHHHHHCTTEEEEEEEEEEECTTETEECCCTSEEEECCHCHHHHTTHS
T ss_pred cccccccc--ceeEEEEeCCCceEcHHHHHHHHHHHHhCCCcEEEEEEEEecCCccccccccchhhhhhhhhHHHHhhhc
Confidence 34444443 4679999999999977666666654433 33355555421100 000 000 011
Q ss_pred CCCCCccCCCceeeCHHHHHHH
Q psy6218 466 YDENFCMGGPGVIMSRATLALV 487 (533)
Q Consensus 466 ~~~~Y~sGGAGYVLSr~aLrkL 487 (533)
....++.| ++.++++++++++
T Consensus 148 ~~~~~~~~-~~~~~rr~~~~~~ 168 (169)
T PF00535_consen 148 WKISFFIG-SCALFRRSVFEEI 168 (169)
T ss_dssp TTSSEESS-SCEEEEEHHHHHC
T ss_pred CCcccccc-cEEEEEHHHHHhh
Confidence 23445544 7899999998875
No 55
>PF13704 Glyco_tranf_2_4: Glycosyl transferase family 2
Probab=34.23 E-value=40 Score=28.21 Aligned_cols=28 Identities=18% Similarity=0.237 Sum_probs=19.8
Q ss_pred HHHHHHHHhCCCCceEEEEcCCccccHHH
Q psy6218 404 MLQYMWERYGDKYEWFMRADDDLYVRGER 432 (533)
Q Consensus 404 ~L~~l~e~~~~~~dWFlKvDDDTYVnvdN 432 (533)
..+.+.+.+ .+.+|++.+|-|-|+.++.
T Consensus 61 ~~~~~~~~~-~~~dWvl~~D~DEfl~~~~ 88 (97)
T PF13704_consen 61 WRNALIERA-FDADWVLFLDADEFLVPPP 88 (97)
T ss_pred HHHHHHHhC-CCCCEEEEEeeeEEEecCC
Confidence 334444443 3789999999999996654
No 56
>cd06436 GlcNAc-1-P_transferase N-acetyl-glucosamine transferase is involved in the synthesis of Poly-beta-1,6-N-acetyl-D-glucosamine. N-acetyl-glucosamine transferase is responsible for the synthesis of bacteria Poly-beta-1,6-N-acetyl-D-glucosamine (PGA). Poly-beta-1,6-N-acetyl-D-glucosamine is a homopolymer that serves as an adhesion for the maintenance of biofilm structural stability in diverse eubacteria. N-acetyl-glucosamine transferase is the product of gene pgaC. Genetic analysis indicated that all four genes of the pgaABCD locus were required for the PGA production, pgaC being a glycosyltransferase.
Probab=33.01 E-value=75 Score=29.86 Aligned_cols=73 Identities=14% Similarity=0.136 Sum_probs=42.4
Q ss_pred CCceEEEEcCCccccHHHHHHHHhcCCCCC-CeEEEeec-CCCcc---------ccccc------cCCCCCCCccCCCce
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRSVDSSK-PQFIGQAG-RGNQE---------EFGLL------SLEYDENFCMGGPGV 477 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~~dpse-plYiG~~~-~~~~~---------ey~~~------~~~~~~~Y~sGGAGY 477 (533)
+.+|++..|.|+.+..+-|...+..+...+ ....|... ..... +|... ..........||.|.
T Consensus 89 ~~d~v~~~DaD~~~~~~~l~~~~~~~~~~~v~~v~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~G~~~ 168 (191)
T cd06436 89 ERVIIAVIDADGRLDPNALEAVAPYFSDPRVAGTQSRVRMYNRHKNLLTILQDLEFFIIIAATQSLRALTGTVGLGGNGQ 168 (191)
T ss_pred CccEEEEECCCCCcCHhHHHHHHHhhcCCceEEEeeeEEEecCCCCHHHHHHHHHHHHHHHHHHHHHHhcCcEEECCeeE
Confidence 357999999999999999988766554222 22222221 11111 11100 001111223478999
Q ss_pred eeCHHHHHHH
Q psy6218 478 IMSRATLALV 487 (533)
Q Consensus 478 VLSr~aLrkL 487 (533)
++++++++.+
T Consensus 169 ~~r~~~l~~v 178 (191)
T cd06436 169 FMRLSALDGL 178 (191)
T ss_pred EEeHHHHHHh
Confidence 9999999998
No 57
>cd00761 Glyco_tranf_GTA_type Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold. Glycosyltransferases (GTs) are enzymes that synthesize oligosaccharides, polysaccharides, and glycoconjugates by transferring the sugar moiety from an activated nucleotide-sugar donor to an acceptor molecule, which may be a growing oligosaccharide, a lipid, or a protein. Based on the stereochemistry of the donor and acceptor molecules, GTs are classified as either retaining or inverting enzymes. To date, all GT structures adopt one of two possible folds, termed GT-A fold and GT-B fold. This hierarchy includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. The majority of the proteins in this superfamily are Glycosyltransferase family 2 (GT-2) proteins. But it als
Probab=31.99 E-value=1.5e+02 Score=24.71 Aligned_cols=72 Identities=15% Similarity=0.208 Sum_probs=45.2
Q ss_pred CCceEEEEcCCccccHHHHHHHHhcCCCCCC-eEEEeecCCCccccccccCCCCCCCccCCCceeeCHHHHHHHHHhHHH
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRSVDSSKP-QFIGQAGRGNQEEFGLLSLEYDENFCMGGPGVIMSRATLALVAPHIKY 493 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~~dpsep-lYiG~~~~~~~~ey~~~~~~~~~~Y~sGGAGYVLSr~aLrkL~~~~~~ 493 (533)
+.+|++.+|+|..+..+.+..++..+...+. ..++ +.+++++++++++++......
T Consensus 77 ~~d~v~~~d~D~~~~~~~~~~~~~~~~~~~~~~~v~-----------------------~~~~~~~~~~~~~~~~~~~~~ 133 (156)
T cd00761 77 RGEYILFLDADDLLLPDWLERLVAELLADPEADAVG-----------------------GPGNLLFRRELLEEIGGFDEA 133 (156)
T ss_pred cCCEEEEECCCCccCccHHHHHHHHHhcCCCceEEe-----------------------ccchheeeHHHHHHhCCcchH
Confidence 6899999999999999888887443322211 1111 116789999999988643221
Q ss_pred hhccCCCCchhHHHHHHHHh
Q psy6218 494 CLKNLYTTHEDVELGRCVQK 513 (533)
Q Consensus 494 C~~~~~~~~EDV~LG~CL~~ 513 (533)
- ....||..+..-+..
T Consensus 134 ~----~~~~ed~~~~~~~~~ 149 (156)
T cd00761 134 L----LSGEEDDDFLLRLLR 149 (156)
T ss_pred h----cCCcchHHHHHHHHh
Confidence 1 122567766655544
No 58
>PRK14583 hmsR N-glycosyltransferase; Provisional
Probab=31.48 E-value=1.3e+02 Score=32.87 Aligned_cols=111 Identities=15% Similarity=0.191 Sum_probs=66.9
Q ss_pred HHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcC--CCCCCeEEEeecCCCcc---------ccccc----c---CCC
Q psy6218 405 LQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSV--DSSKPQFIGQAGRGNQE---------EFGLL----S---LEY 466 (533)
Q Consensus 405 L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~--dpseplYiG~~~~~~~~---------ey~~~----~---~~~ 466 (533)
++...+. .+.||++..|+|+.+..+.|.++++.+ +|+-...-|.+...... +|... . ...
T Consensus 147 lN~gl~~--a~~d~iv~lDAD~~~~~d~L~~lv~~~~~~~~~g~v~g~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ 224 (444)
T PRK14583 147 LRMGAAA--ARSEYLVCIDGDALLDKNAVPYLVAPLIANPRTGAVTGNPRIRTRSTLIGRVQVGEFSSIIGLIKRTQRVY 224 (444)
T ss_pred HHHHHHh--CCCCEEEEECCCCCcCHHHHHHHHHHHHhCCCeEEEEccceecCCCcchhhHHHHHHHHHHHHHHHHHHHh
Confidence 4554443 278999999999999999999888765 33322333332111100 11100 0 011
Q ss_pred CCCCccCCCceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeecCC
Q psy6218 467 DENFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSYE 524 (533)
Q Consensus 467 ~~~Y~sGGAGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~e 524 (533)
...++.+|++.++.+++++++--.. .....||..++.-+.. .|-.....++
T Consensus 225 g~~~~~sG~~~~~rr~al~~vGg~~------~~~i~ED~dl~~rl~~-~G~~i~~~p~ 275 (444)
T PRK14583 225 GQVFTVSGVVAAFRRRALADVGYWS------PDMITEDIDISWKLQL-KHWSVFFEPR 275 (444)
T ss_pred CCceEecCceeEEEHHHHHHcCCCC------CCcccccHHHHHHHHH-cCCeEEEeec
Confidence 2344555788899999998875211 1134699999999987 6876554443
No 59
>cd06438 EpsO_like EpsO protein participates in the methanolan synthesis. The Methylobacillus sp EpsO protein is predicted to participate in the methanolan synthesis. Methanolan is an exopolysaccharide (EPS), composed of glucose, mannose and galactose. A 21 genes cluster was predicted to participate in the methanolan synthesis. Gene disruption analysis revealed that EpsO is one of the glycosyltransferase enzymes involved in the synthesis of repeating sugar units onto the lipid carrier.
Probab=31.35 E-value=1.1e+02 Score=28.23 Aligned_cols=73 Identities=18% Similarity=0.178 Sum_probs=46.5
Q ss_pred CCCceEEEEcCCccccHHHHHHHHhcCCCCCCeEEEeecCCCcc-c---------ccc---cc----CCCCCCCccCCCc
Q psy6218 414 DKYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQAGRGNQE-E---------FGL---LS----LEYDENFCMGGPG 476 (533)
Q Consensus 414 ~~~dWFlKvDDDTYVnvdNL~~~L~~~dpseplYiG~~~~~~~~-e---------y~~---~~----~~~~~~Y~sGGAG 476 (533)
.+.+|++..|.|+.+..+-|..++..+.....+..|........ . +.. .. ..........|+|
T Consensus 80 ~~~d~v~~~DaD~~~~p~~l~~l~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~ 159 (183)
T cd06438 80 DDPDAVVVFDADNLVDPNALEELNARFAAGARVVQAYYNSKNPDDSWITRLYAFAFLVFNRLRPLGRSNLGLSCQLGGTG 159 (183)
T ss_pred CCCCEEEEEcCCCCCChhHHHHHHHHHhhCCCeeEEEEeeeCCccCHHHHHHHHHHHHHHHHHHHHHHHcCCCeeecCch
Confidence 36899999999999999999999887765545666654321110 0 000 00 0011122235689
Q ss_pred eeeCHHHHHH
Q psy6218 477 VIMSRATLAL 486 (533)
Q Consensus 477 YVLSr~aLrk 486 (533)
++++++++++
T Consensus 160 ~~~rr~~l~~ 169 (183)
T cd06438 160 MCFPWAVLRQ 169 (183)
T ss_pred hhhHHHHHHh
Confidence 9999999998
No 60
>cd04191 Glucan_BSP_ModH Glucan_BSP_ModH catalyzes the elongation of beta-1,2 polyglucose chains of glucan. Periplasmic Glucan Biosynthesis protein ModH is a glucosyltransferase that catalyzes the elongation of beta-1,2 polyglucose chains of glucan, requiring a beta-glucoside as a primer and UDP-glucose as a substrate. Glucans are composed of 5 to 10 units of glucose forming a highly branched structure, where beta-1,2-linked glucose constitutes a linear backbone to which branches are attached by beta-1,6 linkages. In Escherichia coli, glucans are located in the periplasmic space, functioning as regulator of osmolarity. It is synthesized at a maximum when cells are grown in a medium with low osmolarity. It has been shown to span the cytoplasmic membrane.
Probab=29.27 E-value=1.5e+02 Score=30.06 Aligned_cols=107 Identities=16% Similarity=0.166 Sum_probs=61.4
Q ss_pred CCCceEEEEcCCccccHHHHHHHHhcC--CCCCCeEEEeecCCC-cccc-ccc------------cC-----CCCCCCcc
Q psy6218 414 DKYEWFMRADDDLYVRGERLARMLRSV--DSSKPQFIGQAGRGN-QEEF-GLL------------SL-----EYDENFCM 472 (533)
Q Consensus 414 ~~~dWFlKvDDDTYVnvdNL~~~L~~~--dpseplYiG~~~~~~-~~ey-~~~------------~~-----~~~~~Y~s 472 (533)
.++++++..|.|+.+..+-|.+++..+ ||.--+ +..+.... ...+ ..+ .. .....++.
T Consensus 94 ~~~~~i~~~DaD~~~~p~~l~~~v~~~~~~~~vg~-vq~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 172 (254)
T cd04191 94 SRYDYMVVLDADSLMSGDTIVRLVRRMEANPRAGI-IQTAPKLIGAETLFARLQQFANRLYGPVFGRGLAAWQGGEGNYW 172 (254)
T ss_pred CCCCEEEEEeCCCCCCHHHHHHHHHHHHhCCCEEE-EeCCceeECCCCHHHHHHHHHHHHHHHHHHHHHHHhcCCccCcc
Confidence 478999999999999999999999876 333111 11111110 0000 000 00 01123443
Q ss_pred CCCceeeCHHHHHHHHH--hHHHhhcc-CCCCchhHHHHHHHHhccCCceeecC
Q psy6218 473 GGPGVIMSRATLALVAP--HIKYCLKN-LYTTHEDVELGRCVQKFAGIPCTWSY 523 (533)
Q Consensus 473 GGAGYVLSr~aLrkL~~--~~~~C~~~-~~~~~EDV~LG~CL~~~lGV~~t~s~ 523 (533)
|+.+++.++++..+.- .+...... ...-.||..+|.-+.. .|-...-.+
T Consensus 173 -G~~~~~Rr~al~~~~~~~~i~g~g~~~~~~l~eD~~l~~~~~~-~G~ri~~~~ 224 (254)
T cd04191 173 -GHNAIIRVAAFMEHCALPVLPGRPPFGGHILSHDFVEAALMRR-AGWEVRLAP 224 (254)
T ss_pred -ceEEEEEHHHHHHhcCCccccCCCCCCCCeecHHHHHHHHHHH-cCCEEEEcc
Confidence 5899999999988631 11111100 1246899999999997 585544433
No 61
>cd06423 CESA_like CESA_like is the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis protein catalyzes the
Probab=26.39 E-value=1.4e+02 Score=25.61 Aligned_cols=26 Identities=19% Similarity=0.256 Sum_probs=22.0
Q ss_pred CCceEEEEcCCccccHHHHHHHHhcC
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRSV 440 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~~ 440 (533)
+.+|++..|+|.++..+-|..++..+
T Consensus 78 ~~~~i~~~D~D~~~~~~~l~~~~~~~ 103 (180)
T cd06423 78 KGDIVVVLDADTILEPDALKRLVVPF 103 (180)
T ss_pred CCCEEEEECCCCCcChHHHHHHHHHh
Confidence 68999999999999988888885544
No 62
>cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans. UDP-GalNAc: polypeptide alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T) initiate the formation of mucin-type, O-linked glycans by catalyzing the transfer of alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to hydroxyl groups of Ser or Thr residues of core proteins to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These enzymes are type II membrane proteins with a GT-A type catalytic domain and a lectin domain located on the lumen side of the Golgi apparatus. In human, there are 15 isozymes of pp-GalNAc-Ts, representing the largest of all glycosyltransferase families. Each isozyme has unique but partially redundant substrate specificity for glycosylation sites on acceptor proteins.
Probab=25.98 E-value=1.8e+02 Score=29.38 Aligned_cols=27 Identities=15% Similarity=0.085 Sum_probs=23.1
Q ss_pred CCceEEEEcCCccccHHHHHHHHhcCC
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRSVD 441 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~~d 441 (533)
+.+|++..|+|+.+..+-|..+|+.+.
T Consensus 83 ~gd~i~fLD~D~~~~~~wL~~ll~~l~ 109 (299)
T cd02510 83 TGDVLVFLDSHCEVNVGWLEPLLARIA 109 (299)
T ss_pred cCCEEEEEeCCcccCccHHHHHHHHHH
Confidence 689999999999998888888887553
No 63
>PRK11234 nfrB bacteriophage N4 adsorption protein B; Provisional
Probab=25.85 E-value=3.3e+02 Score=32.22 Aligned_cols=107 Identities=14% Similarity=0.122 Sum_probs=62.1
Q ss_pred CCceEEEEcCCccccHHHHHHHHhcCCCCCCeEEEe--ecCCCc---------cccc----cccC--C-CCCCCccCCCc
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQ--AGRGNQ---------EEFG----LLSL--E-YDENFCMGGPG 476 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~~dpseplYiG~--~~~~~~---------~ey~----~~~~--~-~~~~Y~sGGAG 476 (533)
+++.++..|-|+.|..+.|. .+..+.+.....-+- +....+ .+|. .... . -......+|.|
T Consensus 155 ~~~vvvi~DAD~~v~pd~L~-~~~~l~~~~~~VQ~p~~p~~~~~~~~~~~~~~~EFa~~~~~~~~~~~~lgg~~~l~G~~ 233 (727)
T PRK11234 155 AFAGFILHDAEDVISPMELR-LFNYLVERKDLIQIPVYPFEREWTHFTSGTYIDEFAELHGKDVPVREALAGQVPSAGVG 233 (727)
T ss_pred cccEEEEEcCCCCCChhHHH-HHHhhcCCCCeEeecccCCCccHHHHHHHHHHHHHHHHhhhhhHHHHHcCCCcccCCce
Confidence 56778889999999999998 334333222221111 111011 1222 0000 0 11223456799
Q ss_pred eeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeecC
Q psy6218 477 VIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSY 523 (533)
Q Consensus 477 YVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~ 523 (533)
.++||.+++.+.+....+.-....-.||..+|.=+.. .|..+.-.+
T Consensus 234 ~af~Rr~l~al~~~ggg~~~~~~~lTED~dlg~rL~~-~G~~v~f~~ 279 (727)
T PRK11234 234 TCFSRRAVTALLEDGDGIAFDVQSLTEDYDIGFRLKE-KGMREIFVR 279 (727)
T ss_pred EEEecccHHHHHHhcCCCCcCCCcchHHHHHHHHHHH-CCCEEEEcc
Confidence 9999998877776432221123467899999999997 688875544
No 64
>PLN03193 beta-1,3-galactosyltransferase; Provisional
Probab=25.83 E-value=50 Score=36.29 Aligned_cols=40 Identities=20% Similarity=0.291 Sum_probs=30.2
Q ss_pred CcccCCCceeeeHHHHHHhcccHHHHHhhccCCCCccccchhhh
Q psy6218 33 NFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQ 76 (533)
Q Consensus 33 ~yC~~g~G~~~Sr~~l~~~~p~l~~C~~~~~s~~~D~~lGRCi~ 76 (533)
.|| .|+|++||+.+.+.|.-+-. .-..+ ..|||-+|-|+.
T Consensus 302 pyA-sG~gYVlS~DLa~~I~~n~~--~L~~y-~~EDV~vG~Wl~ 341 (408)
T PLN03193 302 RHA-TGQLYAISKDLASYISINQH--VLHKY-ANEDVSLGSWFI 341 (408)
T ss_pred CCC-CcceEEehHHHHHHHHhChh--hhccc-CcchhhhhhHhc
Confidence 345 56899999999999985554 22344 499999999984
No 65
>PRK10714 undecaprenyl phosphate 4-deoxy-4-formamido-L-arabinose transferase; Provisional
Probab=23.23 E-value=2.7e+02 Score=29.17 Aligned_cols=75 Identities=13% Similarity=0.066 Sum_probs=46.7
Q ss_pred CCceEEEEcCCccccHHHHHHHHhcCCCCCCeEEEeecCCCcccc--------ccc-cCCCCCCCccCCCce-eeCHHHH
Q psy6218 415 KYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQAGRGNQEEF--------GLL-SLEYDENFCMGGPGV-IMSRATL 484 (533)
Q Consensus 415 ~~dWFlKvDDDTYVnvdNL~~~L~~~dpseplYiG~~~~~~~~ey--------~~~-~~~~~~~Y~sGGAGY-VLSr~aL 484 (533)
+.||++..|+|.-..++.+.++++.+...-++..|.......+.+ ..+ ..-...++...++|+ +++++++
T Consensus 90 ~gd~vv~~DaD~q~~p~~i~~l~~~~~~~~DvV~~~r~~~~~~~~r~~~s~~~~~l~~~~~g~~~~d~~~gfr~~~r~~~ 169 (325)
T PRK10714 90 TGDLIITLDADLQNPPEEIPRLVAKADEGYDVVGTVRQNRQDSWFRKTASKMINRLIQRTTGKAMGDYGCMLRAYRRHIV 169 (325)
T ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHhhCCEEEEEEcCCCCcHHHHHHHHHHHHHHHHHcCCCCCCCCcCeEEEcHHHH
Confidence 689999999999999999999998775433455555432111101 000 000112233344666 8999999
Q ss_pred HHHHH
Q psy6218 485 ALVAP 489 (533)
Q Consensus 485 rkL~~ 489 (533)
+.+..
T Consensus 170 ~~l~~ 174 (325)
T PRK10714 170 DAMLH 174 (325)
T ss_pred HHHHH
Confidence 99863
No 66
>TIGR03030 CelA cellulose synthase catalytic subunit (UDP-forming). Cellulose synthase catalyzes the beta-1,4 polymerization of glucose residues in the formation of cellulose. In bacteria, the substrate is UDP-glucose. The synthase consists of two subunits (or domains in the frequent cases where it is encoded as a single polypeptide), the catalytic domain modelled here and the regulatory domain (pfam03170). The regulatory domain binds the allosteric activator cyclic di-GMP. The protein is membrane-associated and probably assembles into multimers such that the individual cellulose strands can self-assemble into multi-strand fibrils.
Probab=22.59 E-value=1.9e+02 Score=33.83 Aligned_cols=108 Identities=15% Similarity=0.184 Sum_probs=64.2
Q ss_pred HHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcCCCCCC-eEEEeecC-CCcc----c--------------ccccc-
Q psy6218 405 LQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVDSSKP-QFIGQAGR-GNQE----E--------------FGLLS- 463 (533)
Q Consensus 405 L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~dpsep-lYiG~~~~-~~~~----e--------------y~~~~- 463 (533)
+.+..++. +.+|++..|.|+.+..+-|.+.+..+..+.. -.++.+.. ..+. + |....
T Consensus 220 LN~al~~a--~gd~Il~lDAD~v~~pd~L~~~v~~f~~dp~v~~Vqtp~~f~~p~~~~~nl~~~~~~~~e~~~f~~~i~~ 297 (713)
T TIGR03030 220 INNALKHT--DGELILIFDADHVPTRDFLQRTVGWFVEDPKLFLVQTPHFFVSPDPIERNLGTFRRMPNENELFYGLIQD 297 (713)
T ss_pred HHHHHHhc--CCCEEEEECCCCCcChhHHHHHHHHHHhCCCEEEEeCCeeccCCCHHhhhhHHHHHhhhHHHHHHHHHHH
Confidence 45555554 6899999999999999999988876632222 12221110 0000 0 00000
Q ss_pred --CCCCCCCccCCCceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeec
Q psy6218 464 --LEYDENFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWS 522 (533)
Q Consensus 464 --~~~~~~Y~sGGAGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s 522 (533)
......++. |++.++.+++++++--... ....||..++..+.+ .|-.....
T Consensus 298 g~~~~~~~~~~-Gs~~~iRR~al~~iGGf~~------~~vtED~~l~~rL~~-~G~~~~y~ 350 (713)
T TIGR03030 298 GNDFWNAAFFC-GSAAVLRREALDEIGGIAG------ETVTEDAETALKLHR-RGWNSAYL 350 (713)
T ss_pred HHhhhCCeeec-CceeEEEHHHHHHcCCCCC------CCcCcHHHHHHHHHH-cCCeEEEe
Confidence 001223443 5889999999998753211 135799999999987 68765433
No 67
>PF08209 Sgf11: Sgf11 (transcriptional regulation protein); InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=22.20 E-value=31 Score=24.72 Aligned_cols=11 Identities=36% Similarity=0.878 Sum_probs=9.4
Q ss_pred HhcccHHHHHh
Q psy6218 50 LVAPHIKYCLK 60 (533)
Q Consensus 50 ~~~p~l~~C~~ 60 (533)
+..||||.|+.
T Consensus 18 RfA~HLekCmg 28 (33)
T PF08209_consen 18 RFAPHLEKCMG 28 (33)
T ss_dssp GHHHHHHHHTC
T ss_pred hhHHHHHHHHc
Confidence 67899999984
No 68
>COG1216 Predicted glycosyltransferases [General function prediction only]
Probab=21.83 E-value=2.8e+02 Score=28.41 Aligned_cols=112 Identities=16% Similarity=0.224 Sum_probs=65.9
Q ss_pred HHHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcCCCCC-CeEEEeecCCCc-----cc--------cccc-------
Q psy6218 404 MLQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVDSSK-PQFIGQAGRGNQ-----EE--------FGLL------- 462 (533)
Q Consensus 404 ~L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~dpse-plYiG~~~~~~~-----~e--------y~~~------- 462 (533)
..+++.+. ..+|++..++||.+..+-|.++++..+... ....|....... +. ....
T Consensus 76 g~~~a~~~---~~~~~l~LN~D~~~~~~~l~~ll~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (305)
T COG1216 76 GIKYALAK---GDDYVLLLNPDTVVEPDLLEELLKAAEEDPAAGVVGPLIRNYDESLYIDRRGGESDGLTGGWRASPLLE 152 (305)
T ss_pred HHHHHhcC---CCcEEEEEcCCeeeChhHHHHHHHHHHhCCCCeEeeeeEecCCCCcchheeccccccccccceeccccc
Confidence 55665442 233999999999999999988888554332 233343321100 00 0000
Q ss_pred cCCC------CCCCccCCCceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeecCC
Q psy6218 463 SLEY------DENFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSYE 524 (533)
Q Consensus 463 ~~~~------~~~Y~sGGAGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~e 524 (533)
.... ...+++ |+..++++++++++-- ++. ..+...||+-++.=+.+ +|.+..-.+.
T Consensus 153 ~~~~~~~~~~~~~~~~-G~~~li~~~~~~~vG~-~de---~~F~y~eD~D~~~R~~~-~G~~i~~~p~ 214 (305)
T COG1216 153 IAPDLSSYLEVVASLS-GACLLIRREAFEKVGG-FDE---RFFIYYEDVDLCLRARK-AGYKIYYVPD 214 (305)
T ss_pred ccccccchhhhhhhcc-eeeeEEcHHHHHHhCC-CCc---ccceeehHHHHHHHHHH-cCCeEEEeec
Confidence 0000 011244 4889999999999974 222 24568899999888887 6876543333
No 69
>COG1215 Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane]
Probab=20.28 E-value=2.7e+02 Score=29.42 Aligned_cols=109 Identities=17% Similarity=0.162 Sum_probs=69.9
Q ss_pred HHHHHHHhCCCCceEEEEcCCccccHHHHHHHHhcCCCCCCe-EEEeecCCCc------------cccccc------c--
Q psy6218 405 LQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVDSSKPQ-FIGQAGRGNQ------------EEFGLL------S-- 463 (533)
Q Consensus 405 L~~l~e~~~~~~dWFlKvDDDTYVnvdNL~~~L~~~dpsepl-YiG~~~~~~~------------~ey~~~------~-- 463 (533)
+.+..... +.|+++..|.|+.+..+-|.+.+..+...... +.|.+..... -+|... .
T Consensus 129 l~~~l~~~--~~d~V~~~DaD~~~~~d~l~~~~~~f~~~~~~~v~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 206 (439)
T COG1215 129 LNNGLKRA--KGDVVVILDADTVPEPDALRELVSPFEDPPVGAVVGTPRIRNRPDPSNLLGRIQAIEYLSAFYFRLRAAS 206 (439)
T ss_pred HHHHHhhc--CCCEEEEEcCCCCCChhHHHHHHhhhcCCCeeEEeCCceeeecCChhhhcchhcchhhhhhHHHhhhhhh
Confidence 45544433 59999999999999999999999987644333 5554421110 011000 0
Q ss_pred CCCCCCCccCCCceeeCHHHHHHHHHhHHHhhccCCCCchhHHHHHHHHhccCCceeecC
Q psy6218 464 LEYDENFCMGGPGVIMSRATLALVAPHIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSY 523 (533)
Q Consensus 464 ~~~~~~Y~sGGAGYVLSr~aLrkL~~~~~~C~~~~~~~~EDV~LG~CL~~~lGV~~t~s~ 523 (533)
......++ .|++.++-+++++++..... ..-.||..++.-+.. .|......+
T Consensus 207 ~~g~~~~~-~G~~~~~rr~aL~~~g~~~~------~~i~ED~~lt~~l~~-~G~~~~~~~ 258 (439)
T COG1215 207 KGGLISFL-SGSSSAFRRSALEEVGGWLE------DTITEDADLTLRLHL-RGYRVVYVP 258 (439)
T ss_pred hcCCeEEE-cceeeeEEHHHHHHhCCCCC------CceeccHHHHHHHHH-CCCeEEEee
Confidence 01123445 46999999999999983222 256799999999986 676655443
Done!