Query 009021
Match_columns 546
No_of_seqs 333 out of 985
Neff 5.7
Searched_HMMs 46136
Date Thu Mar 28 19:26:55 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/009021.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/009021hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PLN03153 hypothetical protein; 100.0 7E-128 1E-132 1024.7 39.6 522 17-546 2-537 (537)
2 PF04646 DUF604: Protein of un 100.0 7.5E-74 1.6E-78 565.3 21.1 252 261-517 1-255 (255)
3 KOG2246 Galactosyltransferases 100.0 2.1E-43 4.5E-48 370.9 11.6 274 122-427 86-363 (364)
4 PF02434 Fringe: Fringe-like; 100.0 1.1E-33 2.3E-38 284.9 8.4 210 123-353 2-233 (252)
5 KOG3708 Uncharacterized conser 99.5 2.8E-14 6E-19 152.1 8.7 177 111-320 13-191 (681)
6 KOG2287 Galactosyltransferases 99.2 1.4E-10 2.9E-15 122.6 12.2 208 127-355 95-329 (349)
7 PLN03193 beta-1,3-galactosyltr 99.1 1.3E-09 2.9E-14 115.9 13.9 114 209-327 227-356 (408)
8 PF01762 Galactosyl_T: Galacto 99.0 1.1E-09 2.3E-14 106.0 10.2 110 208-322 69-195 (195)
9 PLN03133 beta-1,3-galactosyltr 99.0 6.8E-09 1.5E-13 116.3 14.6 181 128-320 386-591 (636)
10 PTZ00210 UDP-GlcNAc-dependent 98.6 1.9E-07 4.2E-12 98.5 10.6 142 208-355 188-347 (382)
11 KOG2288 Galactosyltransferases 97.9 9.9E-06 2.2E-10 81.5 4.1 104 217-325 106-224 (274)
12 cd04186 GT_2_like_c Subfamily 93.4 0.19 4.1E-06 45.1 6.1 84 219-322 74-158 (166)
13 PF13506 Glyco_transf_21: Glyc 93.3 0.44 9.6E-06 45.7 8.8 100 217-322 29-147 (175)
14 cd02520 Glucosylceramide_synth 90.7 0.52 1.1E-05 44.8 5.9 85 218-322 85-170 (196)
15 cd02526 GT2_RfbF_like RfbF is 90.2 1.3 2.9E-05 42.7 8.4 99 218-321 74-196 (237)
16 PF13641 Glyco_tranf_2_3: Glyc 89.7 0.63 1.4E-05 44.8 5.6 104 208-321 77-202 (228)
17 PF05679 CHGN: Chondroitin N-a 86.9 0.28 6.1E-06 54.8 1.3 69 289-358 1-74 (499)
18 cd04185 GT_2_like_b Subfamily 86.5 2.2 4.8E-05 40.2 7.0 86 217-318 77-163 (202)
19 TIGR01556 rhamnosyltran L-rham 85.4 2.4 5.3E-05 42.7 7.1 99 218-322 72-194 (281)
20 cd04192 GT_2_like_e Subfamily 85.1 3.9 8.5E-05 38.8 8.0 94 218-316 81-195 (229)
21 PF13632 Glyco_trans_2_3: Glyc 84.1 3.5 7.5E-05 38.8 7.1 94 222-321 1-116 (193)
22 cd06421 CESA_CelA_like CESA_Ce 83.8 2.3 5.1E-05 40.7 5.9 92 219-318 84-200 (234)
23 cd06438 EpsO_like EpsO protein 82.7 2 4.2E-05 40.3 4.7 69 217-285 79-169 (183)
24 cd06437 CESA_CaSu_A2 Cellulose 82.3 4.1 8.9E-05 39.5 7.0 96 218-320 86-204 (232)
25 cd06434 GT2_HAS Hyaluronan syn 82.1 4 8.6E-05 39.3 6.8 28 218-245 76-103 (235)
26 PF01755 Glyco_transf_25: Glyc 80.7 3.2 6.9E-05 39.8 5.5 22 269-290 168-189 (200)
27 cd06427 CESA_like_2 CESA_like_ 80.1 4.9 0.00011 39.5 6.8 96 218-320 83-203 (241)
28 cd06435 CESA_NdvC_like NdvC_li 79.1 11 0.00024 36.3 8.9 96 218-320 83-200 (236)
29 cd06420 GT2_Chondriotin_Pol_N 78.5 8.7 0.00019 35.2 7.6 91 218-318 78-169 (182)
30 cd06436 GlcNAc-1-P_transferase 78.5 3.3 7.2E-05 39.3 4.8 67 219-286 89-178 (191)
31 cd04195 GT2_AmsE_like GT2_AmsE 78.3 5.1 0.00011 37.5 6.0 96 218-320 79-192 (201)
32 PRK11204 N-glycosyltransferase 77.8 5.1 0.00011 43.0 6.5 98 218-322 133-253 (420)
33 cd04188 DPG_synthase DPG_synth 77.1 14 0.00031 35.2 8.8 99 219-323 82-202 (211)
34 TIGR03472 HpnI hopanoid biosyn 76.4 8.9 0.00019 40.9 7.8 99 218-322 125-246 (373)
35 cd04184 GT2_RfbC_Mx_like Myxoc 75.3 12 0.00025 35.0 7.6 99 218-322 82-194 (202)
36 PRK14583 hmsR N-glycosyltransf 72.7 7.6 0.00016 42.4 6.3 98 218-322 154-274 (444)
37 cd06442 DPM1_like DPM1_like re 72.1 19 0.0004 34.3 8.2 38 219-256 78-116 (224)
38 cd02525 Succinoglycan_BP_ExoA 72.1 16 0.00034 35.2 7.8 95 219-319 81-198 (249)
39 cd06532 Glyco_transf_25 Glycos 70.9 3.8 8.1E-05 37.1 2.9 38 218-291 81-118 (128)
40 cd02522 GT_2_like_a GT_2_like_ 70.1 16 0.00034 34.7 7.2 91 219-317 72-175 (221)
41 TIGR03469 HonB hopene-associat 70.1 11 0.00025 40.2 6.9 97 217-319 131-252 (384)
42 cd04196 GT_2_like_d Subfamily 70.0 22 0.00047 33.3 8.0 92 217-314 77-189 (214)
43 cd06433 GT_2_WfgS_like WfgS an 69.5 20 0.00043 32.9 7.6 95 218-318 74-183 (202)
44 cd06439 CESA_like_1 CESA_like_ 69.3 12 0.00025 36.6 6.2 37 219-255 109-146 (251)
45 PLN02726 dolichyl-phosphate be 67.1 26 0.00057 34.4 8.3 99 218-322 92-211 (243)
46 cd06913 beta3GnTL1_like Beta 1 63.2 18 0.00039 34.7 6.1 38 218-255 83-120 (219)
47 cd00761 Glyco_tranf_GTA_type G 62.8 21 0.00045 30.4 5.9 53 219-289 77-130 (156)
48 cd04187 DPM1_like_bac Bacteria 59.4 21 0.00045 33.0 5.6 70 219-288 80-164 (181)
49 cd04179 DPM_DPG-synthase_like 57.1 20 0.00044 32.8 5.1 38 220-257 80-118 (185)
50 COG1215 Glycosyltransferases, 56.5 29 0.00063 36.9 6.9 99 218-323 136-259 (439)
51 cd06423 CESA_like CESA_like is 55.7 17 0.00036 31.9 4.1 26 219-244 78-103 (180)
52 cd02510 pp-GalNAc-T pp-GalNAc- 54.6 30 0.00066 35.2 6.4 27 218-244 82-108 (299)
53 COG1216 Predicted glycosyltran 52.4 42 0.00092 34.6 7.1 98 222-324 87-215 (305)
54 PTZ00260 dolichyl-phosphate be 49.7 65 0.0014 34.0 8.1 99 218-322 161-285 (333)
55 cd04191 Glucan_BSP_ModH Glucan 48.2 53 0.0011 33.4 6.9 103 218-322 94-224 (254)
56 PLN03181 glycosyltransferase; 47.6 75 0.0016 35.2 8.1 45 200-245 180-224 (453)
57 PRK10714 undecaprenyl phosphat 47.2 38 0.00083 35.6 5.9 71 218-288 89-174 (325)
58 PRK14716 bacteriophage N4 adso 45.9 67 0.0014 36.3 7.8 101 218-321 157-281 (504)
59 PF00535 Glycos_transf_2: Glyc 40.7 14 0.0003 32.5 1.1 38 219-256 78-116 (169)
60 TIGR03030 CelA cellulose synth 38.1 81 0.0018 37.0 7.2 94 218-319 227-348 (713)
61 PF05637 Glyco_transf_34: gala 35.9 34 0.00075 34.6 3.3 31 202-233 60-90 (239)
62 PRK13183 psbN photosystem II r 31.5 56 0.0012 25.1 2.9 29 39-67 6-34 (46)
63 CHL00020 psbN photosystem II p 30.7 60 0.0013 24.6 2.9 28 40-67 4-31 (43)
64 KOG2246 Galactosyltransferases 29.7 32 0.00069 37.2 1.9 45 121-165 66-111 (364)
65 PRK11234 nfrB bacteriophage N4 27.6 1.8E+02 0.0039 34.5 7.7 101 219-322 155-279 (727)
66 TIGR03111 glyc2_xrt_Gpos1 puta 27.1 1.2E+02 0.0026 33.2 5.9 94 218-318 130-256 (439)
67 PF13704 Glyco_tranf_2_4: Glyc 23.5 66 0.0014 26.9 2.4 25 217-241 69-97 (97)
68 KOG4748 Subunit of Golgi manno 22.3 89 0.0019 33.9 3.6 35 198-233 155-189 (364)
69 PF10111 Glyco_tranf_2_2: Glyc 21.9 3.9E+02 0.0084 27.2 8.1 97 218-319 87-212 (281)
70 cd02514 GT13_GLCNAC-TI GT13_GL 21.7 2.3E+02 0.0049 30.5 6.5 76 208-286 88-174 (334)
71 PRK05454 glucosyltransferase M 21.6 3.4E+02 0.0073 32.1 8.4 35 210-244 211-245 (691)
72 PF02468 PsbN: Photosystem II 20.5 93 0.002 23.6 2.3 28 40-67 4-31 (43)
73 PF06072 Herpes_US9: Alphaherp 20.0 34 0.00075 27.6 -0.0 17 29-45 26-42 (60)
No 1
>PLN03153 hypothetical protein; Provisional
Probab=100.00 E-value=6.8e-128 Score=1024.71 Aligned_cols=522 Identities=76% Similarity=1.287 Sum_probs=481.5
Q ss_pred cCcccCCCCCCC--ccccccccccccchhHHHHHHHHHHHHhheeEEeecccccceeecccccCCCCCCCCcccCCCCCc
Q 009021 17 LHEKNSCRSPLL--RSRNRRWRHSRIGTAVVMVAALLLSTTVWLSVVFSATTCHFWIRVREWEGSPRSRSHFHTHNHRHS 94 (546)
Q Consensus 17 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 94 (546)
++|+++|+|||| |+|+||||+++++++++.++++||++++|+++|++.++..|||++++||++|+ +++|+.+..
T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~ 77 (537)
T PLN03153 2 SSPHRPCSSPLSLSRTRSRRWRHSRTGTAVVAVAALLLSTTAWLSLVFSGTTARCWHRFKDWEGSPD----TLLWNKRYH 77 (537)
T ss_pred CCCCCCCCCCCCcCchhhhhhhhccccchHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccc----ccccCcccc
Confidence 689999999999 77899999999999999999999999999999999999999999999999999 999965553
Q ss_pred cCCCCCCCCCCCCCCC--------CCCCCCc----cccccCCCCCcEEEEEecCcccHHHHHHHHHHhhhccCCcEEEEe
Q 009021 95 LPWISTPPPPPPSPSP--------NRSLSLS----EEKQEELSLKHIVFGIAGSSHLWKRRKEFVRLWWLPNLMRGHVWL 162 (546)
Q Consensus 95 ~~~~~~~~~p~~~~~~--------~~~~s~~----~~~~~~~~~~~Ivf~I~Ts~~~~~tR~~~ik~tW~~~~~r~~vFs 162 (546)
++...+.++|.+.+++ .++.+.. +++..+++++||+|||+|++++|++|++|++.||+++.+|++||+
T Consensus 78 ~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~hIvF~I~~s~~~w~~R~~yik~wW~p~~~rg~v~l 157 (537)
T PLN03153 78 HPIVTPLPPPPSSPSLPSSLLLDHFRNRSLSEIERLKVEAELSLNHIMFGIAGSSQLWKRRKELVRLWWRPNQMRGHVWL 157 (537)
T ss_pred CcccccccccccccccccccccccccccccCCCCCcccCCCCccccEEEEEEEchhhhhhhhhhhhhhcCcccceeEEEe
Confidence 3433332222222222 2333322 345678999999999999999999999999999999999999999
Q ss_pred cCCccccCCCCCCCCCCCceeecCCccccccCCCCCCCCCchhhHHHHHHHHhcCCCccEEEEEcCCccccHHHHHHHHh
Q 009021 163 EDNYKLQKGQGDDSLQLPPIMVSEDISRFRYTNPTGHPSGLRISRIVIECFRLGLPNVRWFVLADDDTIFNSDNLVAVLS 242 (546)
Q Consensus 163 D~~~~v~~~~~~~~~~LP~v~is~dts~f~Y~~~~G~~~a~ri~rmv~e~~~~~~p~~kWfv~~DDDTy~~~~nL~~~Ls 242 (546)
|.. +....++.+ +|+++++.|+++|+|+++.||+++++|.+|+.|+++.+.|++||||++||||||+++||+++|+
T Consensus 158 d~~--~~~~~~~~~--~P~i~is~d~s~f~y~~~~Gh~sa~rI~rmv~et~~~~~pd~kWfVf~DDDTyf~~~NLv~~Ls 233 (537)
T PLN03153 158 EEQ--VSPEEGDDS--LPPIMVSEDTSRFRYTNPTGHPSGLRISRIVLESFRLGLPDVRWFVLGDDDTIFNADNLVAVLS 233 (537)
T ss_pred ccc--CCCCCCcCC--CCCEEeCCCcccccccCCCCcHHHHHHHHHHHHHHHhhCCCCCEEEEecCCccccHHHHHHHHh
Confidence 987 443335666 9999999999999999999999999999999999998899999999999999999999999999
Q ss_pred cCCCCCCeEEeccCCCCCCCccccccccccCcccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceecC
Q 009021 243 KYDPTEMVYIGTPSESHSANTYFSHSMAFGGGGIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVEP 322 (546)
Q Consensus 243 kyD~~ep~YIG~~se~~~~~~~~g~~fa~GGaG~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~p 322 (546)
+||+++++|||..++...++..|++.||+|||||+||++||++|.+++++|..+|+..++||.+|++||+++||+||+++
T Consensus 234 ~YDptkp~YIGs~Se~~~qn~~f~~~fA~GGAG~~LSrPLae~L~~~~d~C~~rY~~~~~gD~rL~~CL~elGV~LT~~~ 313 (537)
T PLN03153 234 KYDPSEMVYVGGPSESHSANSYFSHNMAFGGGGIAISYPLAEALSRILDDCLDRYPKLYGSDDRLHACITELGVPLSREP 313 (537)
T ss_pred hcCCCCCEEecccccccccccccccccccCCceEEEcHHHHHHHHHHhhhhhhhcccCCCcHHHHHHHHHHcCCCceecC
Confidence 99999999999999999888888888999999999999999999999999999888789999999999999999999999
Q ss_pred CCCCCCCCCCCCCccCCCCCCCceecccCCCCCCCCCCCCcHHHHHHHHHHhhcCCCCeeeEEEeecCCCceEEEEEece
Q 009021 323 GFHQWDIRGNAHGLLSSHPIAPFVSIHHVEAVDPFYPGLTSLDSLKLFTKAMRADPRSFLQRSICYDLAGHFTLVVSLGY 402 (546)
Q Consensus 323 gfhq~d~~gd~~g~~es~~~~PlvSlHH~~~~~~~fP~~~~~~~~~~l~~aa~~~~~~~lqr~i~~D~~~~w~~~vs~Gy 402 (546)
||||+|+.||++||+++|+.+|+|||||++.++|+||++++.+||+++++||++|++++|||+||||..++|+|+|||||
T Consensus 314 gfhQ~D~~Gd~~G~les~p~~P~vSlHH~~~~~p~fP~~~~~~~~~~l~~a~~~d~~~~lq~siCyd~~~~w~fsvSwGy 393 (537)
T PLN03153 314 GFHQWDIRGNAHGLLSSHPIAPFVSIHHVEAVDPFYPGLSSLDSLKLFTRAMKVDPRSFLQRSICYDHTHHLTFSISLGY 393 (537)
T ss_pred CccccccCCCcchHhhcCCCCCceeeeeccccccccCCcchHHHHHHHHHHhhcCchhHHHHHHhhhcccceeEEEeccE
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred EEEEeCCCCCChhhhhHHHHHHhhhcCCCCCccccCCCCCCCCCCCCceEEEEccccccCCcEEEEEeeecccccccccc
Q 009021 403 VVQVFPNIVLPRDLDRSEKTYTAWNRLSHRNEFDFDTRDPHRSICKKPVLFFFKDVKRHGNATVGSYARATGKDYLKRKV 482 (546)
Q Consensus 403 Si~~y~~~~~~~dl~~~e~Tf~~w~~~~~~~~~~f~~r~~~~~~~~r~~~~~l~~v~~~~~~v~~~Y~r~~~~~~~~~~~ 482 (546)
||++|++++.+.||+++|+||.+|++..+...|+||||++.+++|++|++|||+++..++++|+|+|+|++.+++.+..+
T Consensus 394 sV~~y~~~~~~~dl~~~e~Tf~~w~~~~~~~~f~fntr~~~r~~c~~p~~f~l~~~~~~~~~~~~~Y~r~~~~~~~~~~~ 473 (537)
T PLN03153 394 VVQVFPSIVLPRDLERSELTYSAWNKISHRNEFDLDTRDPIKSVCKKPILFFLKDVGREGNATLGTYSRARMKDDLKRKV 473 (537)
T ss_pred EEEEecCCCCchhhhhhHhhhhhhcccCCCCCccccCCCCCCCcccCceEEEeeeccccCCeeEEEEEEecccccccccc
Confidence 99999999999999999999999999998889999999999999999999999999877889999999998776667668
Q ss_pred CCCCCCCCCCCceeEEEEcccCCCCcccccCcceeeeccCCCCEEEEEEeeecCCceeeeccCC
Q 009021 483 LCFPRSPPLNHVKNIQVLGFPLRKNWHLVPRRLCCKINQSSDAILKLTVGQCRKGALSSVTDSV 546 (546)
Q Consensus 483 ~c~~~~~~~~~~~~i~v~~~~~~~~w~~~prr~cc~~~~~~~~~~~~~~~~c~~~e~~~~~~~~ 546 (546)
.|.+.+.|+.+|+.|+|+++|++++|++||||+||+|+++++++|+|+||+|++||++++.+++
T Consensus 474 ~C~~~~~~~~~v~~i~V~~~~~~~~w~~aprr~CC~v~~~~~~~~~i~v~~C~~~e~~~~~~~~ 537 (537)
T PLN03153 474 FCFPRSLPLPYVEKIQVLGFPLSKNWHLVPRRLCCRLNQTSDELLTLTVGQCEKGSLGSFIDSV 537 (537)
T ss_pred cccccCCChhhceEEEEecCCCccchhhcchhhheeccCCCCCcEEEEEEeccCCcccccccCC
Confidence 8999877888999999999999999999999999999999999999999999999999987764
No 2
>PF04646 DUF604: Protein of unknown function, DUF604; InterPro: IPR006740 This family includes a conserved region found in several uncharacterised plant proteins.
Probab=100.00 E-value=7.5e-74 Score=565.34 Aligned_cols=252 Identities=50% Similarity=0.950 Sum_probs=243.5
Q ss_pred CCccccccccccCcccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceecCCCCCCCCCCCCCCccCCC
Q 009021 261 ANTYFSHSMAFGGGGIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVEPGFHQWDIRGNAHGLLSSH 340 (546)
Q Consensus 261 ~~~~~g~~fa~GGaG~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~pgfhq~d~~gd~~g~~es~ 340 (546)
||..|+|.||||||||+||++|+++|.+++|.|+++|+.+|++|.+|..||+++||+||.++||||+|++||++||+++|
T Consensus 1 Qn~~fs~~MAfGGgG~~iS~pLa~~L~~~~d~C~~r~~~~~g~D~~i~~C~~~lgv~LT~e~g~hQ~Di~Gd~~G~~~a~ 80 (255)
T PF04646_consen 1 QNVMFSYNMAFGGGGFAISYPLAKALAKMQDDCIERYPHLYGGDQRIQACIAELGVPLTKEPGFHQMDIRGDPSGFLEAH 80 (255)
T ss_pred CCceeeccccccCceeEEcHHHHHHHHHHHHHHHHhcCCCCCchHHHHHHHHHhCCCceecCCceeEeeccCcceeeecC
Confidence 57889999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCCCceecccCCCCCCCCCCCCcHHHHHHHHHHhhcCCCCeeeEEEeecCCCceEEEEEeceEEEEeCCCCCChhhhhHH
Q 009021 341 PIAPFVSIHHVEAVDPFYPGLTSLDSLKLFTKAMRADPRSFLQRSICYDLAGHFTLVVSLGYVVQVFPNIVLPRDLDRSE 420 (546)
Q Consensus 341 ~~~PlvSlHH~~~~~~~fP~~~~~~~~~~l~~aa~~~~~~~lqr~i~~D~~~~w~~~vs~GySi~~y~~~~~~~dl~~~e 420 (546)
+..|++||||++.++|+||+|++.+||++|++||++|++.+|||+||||+.++|+||||||||||+|++.++++||++++
T Consensus 81 ~~~pl~SlHH~~~~~PifP~~~~~~al~~L~~a~~~d~~~~lqqsicyd~~~~wsvsVSwGYsVqvy~~~l~~~dLe~~~ 160 (255)
T PF04646_consen 81 PLAPLVSLHHWDSVDPIFPNMSRLQALRHLLKAAKVDPARILQQSICYDRRRNWSVSVSWGYSVQVYRGILTPRDLETPE 160 (255)
T ss_pred CCCceeeeeehhhccccCCCCCHHHHHHHHHHHHhhChHhhhheeeeccCceEEEEEEEccEEEEEECCCCChHHHhhHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHhhhcCCCCCccccCCCCCCCCCCCCceEEEEccccc--cCCcEEEEEeeeccccccccccCCCCCCC-CCCCceeE
Q 009021 421 KTYTAWNRLSHRNEFDFDTRDPHRSICKKPVLFFFKDVKR--HGNATVGSYARATGKDYLKRKVLCFPRSP-PLNHVKNI 497 (546)
Q Consensus 421 ~Tf~~w~~~~~~~~~~f~~r~~~~~~~~r~~~~~l~~v~~--~~~~v~~~Y~r~~~~~~~~~~~~c~~~~~-~~~~~~~i 497 (546)
+||.+|++.++..+|+|||||+++++|+||++|||++|.. +++++++.|+|...+ .++|.+.+. |+.+|+.|
T Consensus 161 rTF~~W~~~~~~~~f~FnTRp~~~dpC~rP~vffL~~v~~~~~~~~t~s~Y~r~~~~-----~~~C~~~~~~p~~~v~~I 235 (255)
T PF04646_consen 161 RTFRTWYRRSDRTPFAFNTRPVPRDPCQRPTVFFLSSVRSDSGSNQTVSSYVRHRVR-----NPNCCWPMADPLSKVQRI 235 (255)
T ss_pred HHhhcccCcCcCCceeccCCCCcCCCCCCCeEEEEeeeeecCCCCeEEEEEEecccC-----CCCCCCCCCCchhhceEE
Confidence 9999999999989999999999999999999999999985 688999999999876 368999875 77999999
Q ss_pred EEEcccCCCCcccccCccee
Q 009021 498 QVLGFPLRKNWHLVPRRLCC 517 (546)
Q Consensus 498 ~v~~~~~~~~w~~~prr~cc 517 (546)
+|+++++++.|++|||||||
T Consensus 236 ~V~~k~~~~~w~~aPRR~CC 255 (255)
T PF04646_consen 236 RVLKKPDPDLWKKAPRRQCC 255 (255)
T ss_pred EEEcccCCcccccCccccCC
Confidence 99999999999999999999
No 3
>KOG2246 consensus Galactosyltransferases [Carbohydrate transport and metabolism]
Probab=100.00 E-value=2.1e-43 Score=370.89 Aligned_cols=274 Identities=35% Similarity=0.496 Sum_probs=235.7
Q ss_pred cCCCCCcEEEEEecCcccHHHHHHHHHHhhhccCCcEEEEecCCccccCCCCCCCCCCCceeecCCccccc---cCCCCC
Q 009021 122 EELSLKHIVFGIAGSSHLWKRRKEFVRLWWLPNLMRGHVWLEDNYKLQKGQGDDSLQLPPIMVSEDISRFR---YTNPTG 198 (546)
Q Consensus 122 ~~~~~~~Ivf~I~Ts~~~~~tR~~~ik~tW~~~~~r~~vFsD~~~~v~~~~~~~~~~LP~v~is~dts~f~---Y~~~~G 198 (546)
.......|+|.|+|++..+.+|++.+++||.++|.++.+|+. + ++.++++|+ |++++|
T Consensus 86 ~l~r~~~v~cwv~t~~~~~~~~~~~v~~TW~~rc~~~~f~s~------------------~-~s~~~~~f~~v~~~~~~g 146 (364)
T KOG2246|consen 86 WLSRSGRVLCWVLTSPMRHVTRADAVKETWLKRCDKGIFFSP------------------T-LSKDDSRFPTVYYNLPDG 146 (364)
T ss_pred ccCCCceEEEEEEecCcCceeehhhhhcccccccCcceecCc------------------c-CCCCCCcCceeeccCCcc
Confidence 345556899999999999999999999999999988666653 1 222333333 556678
Q ss_pred CCCCchhhHHHHHHHH-hcCCCccEEEEEcCCccccHHHHHHHHhcCCCCCCeEEeccCCCCCCCccccccccccCcccc
Q 009021 199 HPSGLRISRIVIECFR-LGLPNVRWFVLADDDTIFNSDNLVAVLSKYDPTEMVYIGTPSESHSANTYFSHSMAFGGGGIA 277 (546)
Q Consensus 199 ~~~a~ri~rmv~e~~~-~~~p~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~YIG~~se~~~~~~~~g~~fa~GGaG~v 277 (546)
.+.+|++.+++.++++ ++.+++|||+++||||||+++||+++|.+|||++|+|||..++...++. +++.+|.||+|++
T Consensus 147 ~~~~~~ktr~~~~yv~~~~~~~~dWf~~aDDDTy~i~eNLr~~L~~yDp~~p~YiG~~~~~~~~~~-y~~g~ag~~ls~a 225 (364)
T KOG2246|consen 147 YRSLWRKTRIAFKYVYDHILKDYDWFLKADDDTYFIMENLRYVLSKYDPEKPVYLGYRSKSYFQNG-YSSGGAGYVLSFA 225 (364)
T ss_pred hHHHHHHHHHHHHHHHHhccCCCCeEEeccCCeEEeHHHHHHHHhhcCCCCcEEeccccccccccc-cccCCCCcceeHH
Confidence 8899999999999886 7889999999999999999999999999999999999999999887775 4434444444555
Q ss_pred cCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceecCCCCCCCCCCCCCCccCCCCCCCceecccCCCCCCC
Q 009021 278 ISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVEPGFHQWDIRGNAHGLLSSHPIAPFVSIHHVEAVDPF 357 (546)
Q Consensus 278 LSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~pgfhq~d~~gd~~g~~es~~~~PlvSlHH~~~~~~~ 357 (546)
.++.++++|.+..+.|..++.. +++|.+||+||+++||+++.+ ||.|..+...|+..+|+..|++++||+.-+ +
T Consensus 226 a~~~la~~l~~~~~~C~~~~~~-~~eD~~i~~Cl~~~GV~~~d~---~d~dg~~rf~~~~p~~~~~p~~s~~~~~~~--~ 299 (364)
T KOG2246|consen 226 ALRRLAERLLNNEDKCPQRYPS-YGEDRRIGRCLAEVGVPATDE---RDEDGRGRFLPLLPAHPIAPLVSLHHLWLV--Y 299 (364)
T ss_pred HHHHHHHHHhcchhhcccccCC-chhHHHHHHHHHHhCCCccCc---hhhhcccccCCCChhhccCCccccccceee--e
Confidence 5555555566677889988765 789999999999999999998 999999999999999999999999999988 9
Q ss_pred CCCCCcHHHHHHHHHHhhcCCCCeeeEEEeecCCCceEEEEEeceEEEEeCCCCCChhhhhHHHHHHhhh
Q 009021 358 YPGLTSLDSLKLFTKAMRADPRSFLQRSICYDLAGHFTLVVSLGYVVQVFPNIVLPRDLDRSEKTYTAWN 427 (546)
Q Consensus 358 fP~~~~~~~~~~l~~aa~~~~~~~lqr~i~~D~~~~w~~~vs~GySi~~y~~~~~~~dl~~~e~Tf~~w~ 427 (546)
||+++...+.++++.+++.++. .+|+.+|||..+.|+++++|||.+++++.... ++++||.+|+
T Consensus 300 fp~~~~~~~~s~~~vsfh~~~~-~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~t~~~~~ 363 (364)
T KOG2246|consen 300 FPNQNGSGCCSDLAVSFHYLSP-IEMQSFCYDIYRLRTFGVSWGYTVQIIRPNLS-----RPSRTFSSWN 363 (364)
T ss_pred cCCCchhhHHHHhhHhhccCCH-HHHHHHhhhhhheeeccccccccccccccccc-----ccccccCCCC
Confidence 9999999999999999999988 99999999999999999999999999998876 8889999986
No 4
>PF02434 Fringe: Fringe-like; InterPro: IPR003378 The Notch receptor is a large, cell surface transmembrane protein involved in a wide variety of developmental processes in higher organisms []. It becomes activated when its extracellular region binds to ligands located on adjacent cells. Much of this extracellular region is composed of EGF-like repeats, many of which can be O-fucosylated. A number of these O-fucosylated repeats can in turn be further modified by the action of a beta-1,3-N-acetylglucosaminyltransferase enzyme known as Fringe []. Fringe potentiates the activation of Notch by Delta ligands, while inhibiting activation by Serrate/Jagged ligands. This regulation of Notch signalling by Fringe is important in many processes []. Four distinct Fringe proteins have so far been studied in detail; Drosophila Fringe (Dfng) and its three mammalian homologues Lunatic Fringe (Lfng), Radical Fringe (Rfng) and Manic Fringe (Mfng). Dfng, Lfng and Rfng have all been shown to play important roles in developmental processes within their host, though the phenotype of mutants can vary between species e.g. Rfng mutants are retarded in wing development in chickens, but have no obvious phenotype in mice [, , ]. Mfng mutants have not, so far, been charcterised. Biochemical studies indicate that the Fringe proteins are fucose-specific transferases requiring manganese for activity and utilising UDP-N-acetylglucosamine as a donor substrate []. The three mammalian proteins show distinct variations in their catalytic efficiencies with different substrates. Dfng is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands which is localised to the Golgi apparatus [] (not secreted as previously thought). Modification of Notch occurs through glycosylation by Dfng. This entry consists of Fringe proteins and related glycosyltransferase enzymes including: Beta-1,3-glucosyltransferase, which glucosylates O-linked fucosylglycan on thrombospondin type 1 repeat domains []. Core 1 beta1,3-galactosyltransferase 1, generates the core T antigen, which is a precursor for many extended O-glycans in glycoproteins and plays a central role in many processes, such as angiogenesis, thrombopoiesis and kidney homeostasis development []. ; GO: 0016757 transferase activity, transferring glycosyl groups, 0016020 membrane; PDB: 2J0B_A 2J0A_A.
Probab=100.00 E-value=1.1e-33 Score=284.88 Aligned_cols=210 Identities=29% Similarity=0.447 Sum_probs=121.3
Q ss_pred CCCCCcEEEEEecCcccHHHHHHHHHHhhhccCCcEE-EEecCCccccCCCCCCCCCCCce----eecCCccccccCCCC
Q 009021 123 ELSLKHIVFGIAGSSHLWKRRKEFVRLWWLPNLMRGH-VWLEDNYKLQKGQGDDSLQLPPI----MVSEDISRFRYTNPT 197 (546)
Q Consensus 123 ~~~~~~Ivf~I~Ts~~~~~tR~~~ik~tW~~~~~r~~-vFsD~~~~v~~~~~~~~~~LP~v----~is~dts~f~Y~~~~ 197 (546)
++++++|+|+|+|++++|++|+++|+.||+++|.+.. +|+|.++ .. +|.+ .+..+++ .
T Consensus 2 ~~~~~dI~i~V~T~~k~h~tR~~~I~~TW~~~~~~~~~ifsd~~d--------~~--l~~~~~~~l~~~~~~-------~ 64 (252)
T PF02434_consen 2 PVTLDDIFIAVKTTKKFHKTRAPAIKQTWAKRCNKQTFIFSDAED--------PS--LPTVTGVHLVNPNCD-------A 64 (252)
T ss_dssp ---GGGEEEEEE--GGGTTTTHHHHHHTGGGGSGGGEEEEESS----------HH--HHHHHGGGEEE------------
T ss_pred CcccccEEEEEEeCHHHHHHHHHHHHHHHHhhcCCceEEecCccc--------cc--cccccccccccCCCc-------c
Confidence 6889999999999999999999999999999887544 4899763 12 4433 1223332 2
Q ss_pred CCC-CCchhhHHHHHHHHhcCCCccEEEEEcCCccccHHHHHHHHhcCCCCCCeEEeccCCCCCCCc----------ccc
Q 009021 198 GHP-SGLRISRIVIECFRLGLPNVRWFVLADDDTIFNSDNLVAVLSKYDPTEMVYIGTPSESHSANT----------YFS 266 (546)
Q Consensus 198 G~~-~a~ri~rmv~e~~~~~~p~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~YIG~~se~~~~~~----------~~g 266 (546)
++. +++.+ .|..+...+..+++|||+++|||||++++||+++|++||+++++|||.++....... ..+
T Consensus 65 ~~~~~~~~~-~~~~~y~~~~~~~~~Wf~~~DDDtyv~~~~L~~~L~~~~~~~~~yiG~~~~~~~~~~~~~~~~~~~~~~~ 143 (252)
T PF02434_consen 65 GHCRKTLSC-KMAYEYDHFLNSDKDWFCFADDDTYVNVENLRRLLSKYDPSEPIYIGRPSGDRPIEIIHRFNPNKSKDSG 143 (252)
T ss_dssp --------H-HHHHHHHHHHHHT-SEEEEEETTEEE-HHHHHHHHTTS-TTS--EEE-EE--------------------
T ss_pred hhhHHHHHH-HHHHHHHhhhcCCceEEEEEeCCceecHHHHHHHHhhCCCccCEEeeeeccCccceeeccccccccCcCc
Confidence 333 34444 333333333457899999999999999999999999999999999999987553222 124
Q ss_pred ccccccCcccccCHHHHHHHHhhhccc--cccCCC-CCCchHHHHHHHHH-cCCCceecCCCCCCCCC--CCCCCccCCC
Q 009021 267 HSMAFGGGGIAISYPLAEALSNIHDDC--IDRYPK-LYGSDDRLHACITE-LGIPLTVEPGFHQWDIR--GNAHGLLSSH 340 (546)
Q Consensus 267 ~~fa~GGaG~vLSr~LlkkL~~~~d~C--~~~~~~-~~~~D~~Lg~Cl~~-lGV~LT~~pgfhq~d~~--gd~~g~~es~ 340 (546)
+.|++|||||+||++||++|.+....| ...... ...+|..||.||+. +||++|+.+.|||+-.. ......+..
T Consensus 144 ~~f~~GGaG~vlSr~~~~k~~~~~~~~~~~~~~~~~~~~dD~~lG~ci~~~lgv~lt~s~~fhs~~~~l~~~~~~~l~~- 222 (252)
T PF02434_consen 144 FWFATGGAGYVLSRALLKKMSPWASGCKCPSTDEKIRLPDDMTLGYCIENLLGVPLTHSPLFHSHLENLQDYNPETLHR- 222 (252)
T ss_dssp --EE-GGG-EEEEHHHHHHHHHHHTT-TTS--TTTTTS-HHHHHHHHHHHTT---EEE-TT---SSS-GGG--TTTGGG-
T ss_pred eEeeCCCeeHHHhHHHHHHHhhhcccccccCCcCCCCCcccChhhhhHHhcCCcceeechhhcccCcccccCCHHHhcc-
Confidence 679999999999999999998765544 332221 24578899999999 99999999999997533 222233333
Q ss_pred CCCCceecccCCC
Q 009021 341 PIAPFVSIHHVEA 353 (546)
Q Consensus 341 ~~~PlvSlHH~~~ 353 (546)
+..||+|+...
T Consensus 223 --q~~~s~~~~~~ 233 (252)
T PF02434_consen 223 --QVPISYHKFEN 233 (252)
T ss_dssp ---SEEE-EEETT
T ss_pred --CCCeecCCCcC
Confidence 34589999963
No 5
>KOG3708 consensus Uncharacterized conserved protein [Function unknown]
Probab=99.51 E-value=2.8e-14 Score=152.09 Aligned_cols=177 Identities=21% Similarity=0.282 Sum_probs=134.9
Q ss_pred CCCCCCccccccCCCCCcEEEEEecCcccHHHHHHHHHHhhhccCCcEEEEecCCccccCCCCCCCCCCCceeecCCccc
Q 009021 111 NRSLSLSEEKQEELSLKHIVFGIAGSSHLWKRRKEFVRLWWLPNLMRGHVWLEDNYKLQKGQGDDSLQLPPIMVSEDISR 190 (546)
Q Consensus 111 ~~~~s~~~~~~~~~~~~~Ivf~I~Ts~~~~~tR~~~ik~tW~~~~~r~~vFsD~~~~v~~~~~~~~~~LP~v~is~dts~ 190 (546)
.|+++.++ +.-+.++++++|+|.. +-+.+++.|.+++++|+.+|.|... + +.. +-...+. |
T Consensus 13 ~RtRy~~t---ELG~RErl~~aVmte~----tlA~a~NrT~ahhvprv~~F~~~~~-i-----~~~--~a~~~~v---s- 73 (681)
T KOG3708|consen 13 TRTRYAAT---ELGTRERLMAAVMTES----TLALAINRTLAHHVPRVHLFADSSR-I-----DND--LAQLTNV---S- 73 (681)
T ss_pred chhhhhhh---hhhhHHHHHHHHHHHH----HHHHHHHHHHHhhcceeEEeecccc-c-----ccc--Hhhcccc---C-
Confidence 46777776 2334458999999922 7788899999999999999999761 1 221 2212111 1
Q ss_pred cccCCCCCCCCCchhhHHHHHHHHhcCCCccEEEEEcCCccccHHHHHHHHhcCCCCCCeEEeccCCCCCCCcccccccc
Q 009021 191 FRYTNPTGHPSGLRISRIVIECFRLGLPNVRWFVLADDDTIFNSDNLVAVLSKYDPTEMVYIGTPSESHSANTYFSHSMA 270 (546)
Q Consensus 191 f~Y~~~~G~~~a~ri~rmv~e~~~~~~p~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~YIG~~se~~~~~~~~g~~fa 270 (546)
....+..|+.+..+...+.++..++|||+++-||||++...|++++.+.+.++++|+|.-.++.. .-|
T Consensus 74 -----~~d~r~~~~~s~vl~~l~~~~~~~YDwFll~~D~tYv~a~~L~~l~~hmsin~dlymGEe~~~gs-------~rC 141 (681)
T KOG3708|consen 74 -----PYDLRGQKTHSMVLGLLFNMVHNNYDWFLLAKDSTYVNAFVLLRLIDHMSINEDLYMGEEAEDGS-------GRC 141 (681)
T ss_pred -----ccccCccccHHHHHHHHHHhhccccceEEEecCcceecHHHHHHHHhhcccccccccchhhhCcc-------Ccc
Confidence 01224577777777777777889999999999999999999999999999999999996655322 138
Q ss_pred ccCcccccCHHHHHHHHhhhccccccCCCCC-CchHHHHHHHHH-cCCCcee
Q 009021 271 FGGGGIAISYPLAEALSNIHDDCIDRYPKLY-GSDDRLHACITE-LGIPLTV 320 (546)
Q Consensus 271 ~GGaG~vLSr~LlkkL~~~~d~C~~~~~~~~-~~D~~Lg~Cl~~-lGV~LT~ 320 (546)
.+|.|++||++++.+|.++.+.|.... +- -.|..||+||.. +||.|+.
T Consensus 142 ~l~~G~LLS~s~l~~lrnnle~C~~~~--lsad~d~~lgrCi~~At~v~C~~ 191 (681)
T KOG3708|consen 142 RLDTGMLLSQSLLHALRNNLEGCRNDI--LSADPDEWLGRCIQDATGVGCKP 191 (681)
T ss_pred ccccceeecHHHHHHHHhhHHHhhccc--ccCCcHHHHHHHHHHhhcCCccc
Confidence 999999999999999999999997531 22 246689999986 7998864
No 6
>KOG2287 consensus Galactosyltransferases [Carbohydrate transport and metabolism]
Probab=99.18 E-value=1.4e-10 Score=122.64 Aligned_cols=208 Identities=19% Similarity=0.274 Sum_probs=130.0
Q ss_pred CcEEEEEecCcccHHHHHHHHHHhhhccC------CcEEEEecCCccc---cCCCCCCCCCCCceeecCCccccccCCCC
Q 009021 127 KHIVFGIAGSSHLWKRRKEFVRLWWLPNL------MRGHVWLEDNYKL---QKGQGDDSLQLPPIMVSEDISRFRYTNPT 197 (546)
Q Consensus 127 ~~Ivf~I~Ts~~~~~tR~~~ik~tW~~~~------~r~~vFsD~~~~v---~~~~~~~~~~LP~v~is~dts~f~Y~~~~ 197 (546)
-+|+++|+|..... .|..+++.||+... .+.++++-...+. +..-..++-.--.+.+ .|-.+ .|.
T Consensus 95 ~~lLl~V~S~~~~f-arR~aiR~TW~~~~~v~~~~v~~~FLvG~~~~~~~~~~~l~~Ea~~ygDIi~-~df~D-ty~--- 168 (349)
T KOG2287|consen 95 PELLLLVKSAPDNF-ARRNAIRKTWGNENNVRGGRVRVLFLVGLPSNEDKLNKLLADEARLYGDIIQ-VDFED-TYF--- 168 (349)
T ss_pred ceEEEEEecCCCCH-HHHHHHHHHhcCccccCCCcEEEEEEecCCCcHHHHHHHHHHHHHHhCCEEE-Eeccc-chh---
Confidence 47999999988776 46678888887643 3444554433100 0000000000011111 11100 121
Q ss_pred CCCCCchhh-HHHHHHHHhcCCCccEEEEEcCCccccHHHHHHHHhcC-CCCCCeEEeccCCCCCCC----ccc------
Q 009021 198 GHPSGLRIS-RIVIECFRLGLPNVRWFVLADDDTIFNSDNLVAVLSKY-DPTEMVYIGTPSESHSAN----TYF------ 265 (546)
Q Consensus 198 G~~~a~ri~-rmv~e~~~~~~p~~kWfv~~DDDTy~~~~nL~~~Lsky-D~~ep~YIG~~se~~~~~----~~~------ 265 (546)
.+.+. -++.++...++|+++.++++|||+||++++|+++|.+. ++.+.+|.|......... ..|
T Consensus 169 ----nltlKtl~~l~w~~~~cp~akfi~K~DDDvfv~~~~L~~~L~~~~~~~~~~~~G~v~~~~~p~R~~~~KwyVp~~~ 244 (349)
T KOG2287|consen 169 ----NLTLKTLAILLWGVSKCPDAKFILKIDDDVFVNPDNLLEYLDKLNDPSSDLYYGRVIQNAPPIRDKTSKWYVPESE 244 (349)
T ss_pred ----chHHHHHHHHHHHHhcCCcceEEEeccCceEEcHHHHHHHHhccCCCCcceEEEeecccCCCCCCCCCCCccCHHH
Confidence 22221 24456666679999999999999999999999999999 999999999887752211 111
Q ss_pred ----cccccccCcccccCHHHHHHHHhhhccccccCCCCC-CchHHHHHHHHHc-CCCceecCCCCCCCCCCCCCCccCC
Q 009021 266 ----SHSMAFGGGGIAISYPLAEALSNIHDDCIDRYPKLY-GSDDRLHACITEL-GIPLTVEPGFHQWDIRGNAHGLLSS 339 (546)
Q Consensus 266 ----g~~fa~GGaG~vLSr~LlkkL~~~~d~C~~~~~~~~-~~D~~Lg~Cl~~l-GV~LT~~pgfhq~d~~gd~~g~~es 339 (546)
.|+-..+|+||+||+.++++|......- ..+ -+|..+|.||++. |+.....+++......-+.+.+
T Consensus 245 y~~~~YP~Y~sG~gYvis~~~a~~l~~~s~~~-----~~~~iEDV~~g~~l~~~~gi~~~~~~~~~~~~~~~~~~~~--- 316 (349)
T KOG2287|consen 245 YPCSVYPPYASGPGYVISGDAARRLLKASKHL-----KFFPIEDVFVGGCLAEDLGIKPVNHPGFFEIPLSFDPCCY--- 316 (349)
T ss_pred CCCCCCCCcCCCceeEecHHHHHHHHHHhcCC-----CccchHHHHHHHHHHHhcCCCcccCcccccccccCCCCcc---
Confidence 2443457889999999999999843221 234 4777999999986 9998887774443222222222
Q ss_pred CCCCCceecccCCCCC
Q 009021 340 HPIAPFVSIHHVEAVD 355 (546)
Q Consensus 340 ~~~~PlvSlHH~~~~~ 355 (546)
.-+++.|..++.+
T Consensus 317 ---~~~~~~H~~~p~e 329 (349)
T KOG2287|consen 317 ---RDLLAVHRLSPNE 329 (349)
T ss_pred ---cceEEEecCCHHH
Confidence 3478999998654
No 7
>PLN03193 beta-1,3-galactosyltransferase; Provisional
Probab=99.08 E-value=1.3e-09 Score=115.95 Aligned_cols=114 Identities=20% Similarity=0.158 Sum_probs=82.3
Q ss_pred HHHHHHhcCCCccEEEEEcCCccccHHHHHHHHhcCCCCCCeEEeccCCCCCC---C------ccc-------ccccccc
Q 009021 209 VIECFRLGLPNVRWFVLADDDTIFNSDNLVAVLSKYDPTEMVYIGTPSESHSA---N------TYF-------SHSMAFG 272 (546)
Q Consensus 209 v~e~~~~~~p~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~YIG~~se~~~~---~------~~~-------g~~fa~G 272 (546)
...+... +.++++|+++|||+|+++++|+++|++......+|+|........ . ..| .|+....
T Consensus 227 ~f~wA~~-~~dAkF~mK~DDDvfVnv~~L~~~L~~~~~~~rlYiG~m~~gPvr~~~~~ky~epe~w~~~~~~~~YPpyAs 305 (408)
T PLN03193 227 YFATAVA-MWDADFYVKVDDDVHVNIATLGETLVRHRKKPRVYIGCMKSGPVLSQKGVRYHEPEYWKFGENGNKYFRHAT 305 (408)
T ss_pred HHHHHHH-cCCCeEEEEcCCCceEcHHHHHHHHHhcCCCCCEEEEecccCccccCCCCcCcCcccccccCccccCCCCCC
Confidence 3344333 568999999999999999999999988766667999998542110 0 011 1322246
Q ss_pred CcccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceecCCCCCC
Q 009021 273 GGGIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVEPGFHQW 327 (546)
Q Consensus 273 GaG~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~pgfhq~ 327 (546)
|+||+||+.+++.|......- ..| -.+|+.+|.||..++|...+.+.|+..
T Consensus 306 G~gYVlS~DLa~~I~~n~~~L-~~y---~~EDV~vG~Wl~~L~V~~vdd~~fcc~ 356 (408)
T PLN03193 306 GQLYAISKDLASYISINQHVL-HKY---ANEDVSLGSWFIGLDVEHIDDRRLCCG 356 (408)
T ss_pred cceEEehHHHHHHHHhChhhh-ccc---CcchhhhhhHhccCCceeeecccccCC
Confidence 779999999999998654322 122 257889999998888888888888764
No 8
>PF01762 Galactosyl_T: Galactosyltransferase; InterPro: IPR002659 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 31 (GH31 from CAZY) comprises enzymes with a number of known activities; N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase (2.4.1.149 from EC); beta-1,3-galactosyltransferase (2.4.1 from EC); fucose-specific beta-1,3-N-acetylglucosaminyltransferase (2.4.1 from EC); globotriosylceramide beta-1,3-GalNAc transferase (2.4.1.79 from EC) [, ].; GO: 0008378 galactosyltransferase activity, 0006486 protein glycosylation, 0016020 membrane
Probab=99.03 E-value=1.1e-09 Score=106.00 Aligned_cols=110 Identities=21% Similarity=0.305 Sum_probs=82.5
Q ss_pred HHHHHHHhcCCCccEEEEEcCCccccHHHHHHHHhcC--CCCCCeEEeccCCCCCC----Cccc----------cc-ccc
Q 009021 208 IVIECFRLGLPNVRWFVLADDDTIFNSDNLVAVLSKY--DPTEMVYIGTPSESHSA----NTYF----------SH-SMA 270 (546)
Q Consensus 208 mv~e~~~~~~p~~kWfv~~DDDTy~~~~nL~~~Lsky--D~~ep~YIG~~se~~~~----~~~~----------g~-~fa 270 (546)
+..+|...+++++++++++|||+|+++++|.+.|.+. +..+..+.|........ ...| .| +||
T Consensus 69 ~~~~w~~~~c~~~~~v~k~DDD~~vn~~~l~~~L~~~~~~~~~~~~~g~~~~~~~~~r~~~~kw~v~~~~y~~~~yP~y~ 148 (195)
T PF01762_consen 69 AGLKWASKHCPNAKYVLKVDDDVFVNPDRLVSFLKSLKQDPSKNSIYGGCIKNGPPIRDPSSKWYVSEEEYPDDYYPPYC 148 (195)
T ss_pred HHHHHHHhhCCchhheeecCcEEEEehHHhhhhhhhcccCccccccccccccCCccccccccCceeeeeecccccCCCcC
Confidence 3456666679999999999999999999999999987 77777787877654321 1111 12 455
Q ss_pred ccCcccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceecC
Q 009021 271 FGGGGIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVEP 322 (546)
Q Consensus 271 ~GGaG~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~p 322 (546)
.|+||+||+.+++.|...... .+...-+|..+|.|+.++||+.++.|
T Consensus 149 -~G~~yvls~~~v~~i~~~~~~----~~~~~~eDv~iGi~~~~~~i~~~~~~ 195 (195)
T PF01762_consen 149 -SGGGYVLSSDVVKRIYKASSH----TPFFPLEDVFIGILAEKLGIKPIHDP 195 (195)
T ss_pred -CCCeEEecHHHHHHHHHHhhc----CCCCCchHHHHHHHHHHCCCCccCCC
Confidence 577999999999999975432 22334578889999999999987653
No 9
>PLN03133 beta-1,3-galactosyltransferase; Provisional
Probab=98.97 E-value=6.8e-09 Score=116.35 Aligned_cols=181 Identities=14% Similarity=0.217 Sum_probs=105.6
Q ss_pred cEEEEEecCcccHHHHHHHHHHhhhccC------CcEEEEecCCcc--ccCCCCCCCCCCCceeecCCccccccCCCCCC
Q 009021 128 HIVFGIAGSSHLWKRRKEFVRLWWLPNL------MRGHVWLEDNYK--LQKGQGDDSLQLPPIMVSEDISRFRYTNPTGH 199 (546)
Q Consensus 128 ~Ivf~I~Ts~~~~~tR~~~ik~tW~~~~------~r~~vFsD~~~~--v~~~~~~~~~~LP~v~is~dts~f~Y~~~~G~ 199 (546)
.++++|.|+.... .|..+|+.||++.. ....+++-...+ ++..-..++...-.+.+ .|.-+ .|.
T Consensus 386 ~LlI~V~Sap~nf-~rR~AIR~TWg~~~~~~~~~v~~rFvVG~s~n~~l~~~L~~Ea~~ygDIIq-~dF~D-sY~----- 457 (636)
T PLN03133 386 DLFIGVFSTANNF-KRRMAVRRTWMQYDAVRSGAVAVRFFVGLHKNQMVNEELWNEARTYGDIQL-MPFVD-YYS----- 457 (636)
T ss_pred EEEEEEeCCcccH-HHHHHHHHhhccccccCCCceEEEEEEecCCcHHHHHHHHHHHHHcCCeEE-Eeeec-hhh-----
Confidence 6899999888765 57778888888631 123344432210 00000000000011111 11100 121
Q ss_pred CCCchhhHHHHHHHHhcCCCccEEEEEcCCccccHHHHHHHHhcCCCCCCeEEeccCCCCCC----Cccc----------
Q 009021 200 PSGLRISRIVIECFRLGLPNVRWFVLADDDTIFNSDNLVAVLSKYDPTEMVYIGTPSESHSA----NTYF---------- 265 (546)
Q Consensus 200 ~~a~ri~rmv~e~~~~~~p~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~YIG~~se~~~~----~~~~---------- 265 (546)
.+.+.-++...+..+++++++++++|||+|+++++|+++|.+.+..+.+|+|........ ...|
T Consensus 458 --NLTlKtl~~~~wa~~c~~akFilK~DDDvFVnv~~Ll~~L~~~~~~~~Ly~G~v~~~~~PiRd~~sKWYVs~~eyp~~ 535 (636)
T PLN03133 458 --LITWKTLAICIFGTEVVSAKYVMKTDDDAFVRVDEVLASLKRTNVSHGLLYGLINSDSQPHRNPDSKWYISPEEWPEE 535 (636)
T ss_pred --hhHHHHHHHHHHHHhCCCceEEEEcCCceEEcHHHHHHHHHhcCCCCceEEEEeccCCCcccCCCCCCCCCHHHCCCC
Confidence 222222222233346899999999999999999999999998888888999987653221 1111
Q ss_pred cccccccCcccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHH---cCCCcee
Q 009021 266 SHSMAFGGGGIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITE---LGIPLTV 320 (546)
Q Consensus 266 g~~fa~GGaG~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~---lGV~LT~ 320 (546)
.|+-..+|+||+||+.+++.|........-. ...-+|+.+|.|+++ +|+++..
T Consensus 536 ~YPpYasG~gYVlS~Dla~~L~~~s~s~~l~--~f~lEDVyvGi~l~~l~k~gl~v~~ 591 (636)
T PLN03133 536 TYPPWAHGPGYVVSRDIAKEVYKRHKEGRLK--MFKLEDVAMGIWIAEMKKEGLEVKY 591 (636)
T ss_pred CCCCCCCcCEEEEcHHHHHHHHHhhhhcccC--cCChhhHhHHHHHHHhcccCCCcee
Confidence 2433346789999999999998754322111 123478899999975 4655543
No 10
>PTZ00210 UDP-GlcNAc-dependent glycosyltransferase; Provisional
Probab=98.61 E-value=1.9e-07 Score=98.52 Aligned_cols=142 Identities=20% Similarity=0.185 Sum_probs=89.6
Q ss_pred HHHHHHHhcCCCccEEEEEcCCccccHHHHHHHHhcCCCCCCeEEeccCCCCCCCccccccccccCcccccCHHHHHHHH
Q 009021 208 IVIECFRLGLPNVRWFVLADDDTIFNSDNLVAVLSKYDPTEMVYIGTPSESHSANTYFSHSMAFGGGGIAISYPLAEALS 287 (546)
Q Consensus 208 mv~e~~~~~~p~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~YIG~~se~~~~~~~~g~~fa~GGaG~vLSr~LlkkL~ 287 (546)
|..++....+|++++++++|||+|++++++++.|.. .+.+.+|+|...........-.-+| .+|.||+||+.+++.|.
T Consensus 188 l~~~wA~~~cP~a~YImKgDDDvFVrVp~lL~~Lr~-~prr~LY~G~v~~~~~p~Rd~~PpY-~~G~gYvLSrDVA~~Lv 265 (382)
T PTZ00210 188 LWLRFALHMFPNVSYIVKGDDDIFIRVPKYLADLRV-MPRHGLYMGRYNYYNRIWRRNQLTY-VNGYCITLSRDTAQAII 265 (382)
T ss_pred HHHHHHHHhCCCCCeEEEcCCCeEeeHHHHHHHHhh-CCCCceEEEeeCCCCccccCCCCCc-cccceeeccHHHHHHHH
Confidence 344555567899999999999999999999999955 5677899998876432111111245 46779999999999998
Q ss_pred hhhcc--c-cccCC----------CCCCchHHHHHHHHH-cC-CCc-eecC-CCCCCCCCCCCCCccCCCCCC-Cceecc
Q 009021 288 NIHDD--C-IDRYP----------KLYGSDDRLHACITE-LG-IPL-TVEP-GFHQWDIRGNAHGLLSSHPIA-PFVSIH 349 (546)
Q Consensus 288 ~~~d~--C-~~~~~----------~~~~~D~~Lg~Cl~~-lG-V~L-T~~p-gfhq~d~~gd~~g~~es~~~~-PlvSlH 349 (546)
....- - ...|. ....+|.++|.-|.. ++ -++ +... .-|-+|..... +. +.+. -.|-+|
T Consensus 266 s~~pl~rL~~~pys~~~~~~y~~~~~~~EDiMvG~vLr~~~k~~~l~~V~~~~c~Fhd~~~~~-~~---~~v~~~sVvvH 341 (382)
T PTZ00210 266 SYKPLERLVNMPFSMWDYFDFLDLGMFYEDVMVGMILREKVVYRNLISVEMGRCHFHNAGKFG-VR---KSVRNMSVVIH 341 (382)
T ss_pred hhChHhHhhcCCCchHHHHHHHHhhcCchHHHHHHHHHHhcCcCceeeeccccccceecCCCC-Cc---cccccceEEEE
Confidence 75221 1 00111 123589999999954 32 222 2223 33333443211 11 1122 358999
Q ss_pred cCCCCC
Q 009021 350 HVEAVD 355 (546)
Q Consensus 350 H~~~~~ 355 (546)
|+++.+
T Consensus 342 hike~d 347 (382)
T PTZ00210 342 HIQEAD 347 (382)
T ss_pred ecCHHH
Confidence 999864
No 11
>KOG2288 consensus Galactosyltransferases [Carbohydrate transport and metabolism]
Probab=97.89 E-value=9.9e-06 Score=81.49 Aligned_cols=104 Identities=23% Similarity=0.300 Sum_probs=72.0
Q ss_pred CCCccEEEEEcCCccccHHHHHHHHhcCCCCCCeEEeccCCCCC--C-Ccc-------c-----cccccccCcccccCHH
Q 009021 217 LPNVRWFVLADDDTIFNSDNLVAVLSKYDPTEMVYIGTPSESHS--A-NTY-------F-----SHSMAFGGGGIAISYP 281 (546)
Q Consensus 217 ~p~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~YIG~~se~~~--~-~~~-------~-----g~~fa~GGaG~vLSr~ 281 (546)
.=++++|+++|||+|+++..|...|+++-....+|||......- + ... | -|++|+| +||+||+.
T Consensus 106 ~~daeFyvKvDDDv~v~l~~L~~~la~~r~~pr~YiGcmksg~v~~~~~~kw~EpeWkfg~~g~YfrhA~G-~~YvlS~d 184 (274)
T KOG2288|consen 106 HWDAEFYVKVDDDVYVRLARLGTLLARERSHPRLYIGCMKSGPVLTQPGGKWYEPEWKFGDNGNYFRHATG-GGYVLSKD 184 (274)
T ss_pred hccceEEEEccccceecHHHHHHHHHhhccCCceEEEEecCCccccCCCCcccChhhhcCcccccchhccC-ceEEeeHH
Confidence 45899999999999999999999999987778899999764321 0 000 1 1256765 58999999
Q ss_pred HHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceecCCCC
Q 009021 282 LAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVEPGFH 325 (546)
Q Consensus 282 LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~pgfh 325 (546)
|+.-|.-..+- +..| -.+|+-||.-|.-+.|.-.+.+.++
T Consensus 185 La~yi~in~~l-L~~y---~nEDVSlGaW~~gldV~h~dd~rlC 224 (274)
T KOG2288|consen 185 LATYISINRQL-LHKY---ANEDVSLGAWMIGLDVEHVDDPRLC 224 (274)
T ss_pred HHHHHHHhHHH-HHhh---ccCCcccceeeeeeeeeEecCCccc
Confidence 99988765433 3222 2367788887654444444444443
No 12
>cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=93.39 E-value=0.19 Score=45.06 Aligned_cols=84 Identities=18% Similarity=0.264 Sum_probs=58.5
Q ss_pred CccEEEEEcCCccccHHHHHHHHhcCCCCC-CeEEeccCCCCCCCccccccccccCcccccCHHHHHHHHhhhccccccC
Q 009021 219 NVRWFVLADDDTIFNSDNLVAVLSKYDPTE-MVYIGTPSESHSANTYFSHSMAFGGGGIAISYPLAEALSNIHDDCIDRY 297 (546)
Q Consensus 219 ~~kWfv~~DDDTy~~~~nL~~~LskyD~~e-p~YIG~~se~~~~~~~~g~~fa~GGaG~vLSr~LlkkL~~~~d~C~~~~ 297 (546)
+.+|++++|||.++..+.|.+++..+.... -..+|.. ..|+++++++.+++++....+..
T Consensus 74 ~~~~i~~~D~D~~~~~~~l~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~---- 134 (166)
T cd04186 74 KGDYVLLLNPDTVVEPGALLELLDAAEQDPDVGIVGPK---------------VSGAFLLVRREVFEEVGGFDEDF---- 134 (166)
T ss_pred CCCEEEEECCCcEECccHHHHHHHHHHhCCCceEEEcc---------------CceeeEeeeHHHHHHcCCCChhh----
Confidence 799999999999999888888877554332 2233333 46788999999999875432222
Q ss_pred CCCCCchHHHHHHHHHcCCCceecC
Q 009021 298 PKLYGSDDRLHACITELGIPLTVEP 322 (546)
Q Consensus 298 ~~~~~~D~~Lg~Cl~~lGV~LT~~p 322 (546)
..++.|..+...+.+.|.++...|
T Consensus 135 -~~~~eD~~~~~~~~~~g~~i~~~~ 158 (166)
T cd04186 135 -FLYYEDVDLCLRARLAGYRVLYVP 158 (166)
T ss_pred -hccccHHHHHHHHHHcCCeEEEcc
Confidence 125678888888877787765443
No 13
>PF13506 Glyco_transf_21: Glycosyl transferase family 21
Probab=93.32 E-value=0.44 Score=45.69 Aligned_cols=100 Identities=18% Similarity=0.301 Sum_probs=64.5
Q ss_pred CCCccEEEEEcCCccccHHHHHHHHhcC-CCCCCeEEe----ccCCCCCCC--------------ccccccccccCcccc
Q 009021 217 LPNVRWFVLADDDTIFNSDNLVAVLSKY-DPTEMVYIG----TPSESHSAN--------------TYFSHSMAFGGGGIA 277 (546)
Q Consensus 217 ~p~~kWfv~~DDDTy~~~~nL~~~Lsky-D~~ep~YIG----~~se~~~~~--------------~~~g~~fa~GGaG~v 277 (546)
....+++++.|+|+.+..+-|.++++.+ |++..+.-| .+..+.... ..-+..++.| +.++
T Consensus 29 ~a~~d~~~~~DsDi~v~p~~L~~lv~~l~~p~vglVt~~~~~~~~~~~~~~l~~~~~~~~~~~~~a~~~~~~~~G-~~m~ 107 (175)
T PF13506_consen 29 GAKYDYLVISDSDIRVPPDYLRELVAPLADPGVGLVTGLPRGVPARGFWSRLEAAFFNFLPGVLQALGGAPFAWG-GSMA 107 (175)
T ss_pred hCCCCEEEEECCCeeECHHHHHHHHHHHhCCCCcEEEecccccCCcCHHHHHHHHHHhHHHHHHHHhcCCCceec-ceee
Confidence 3789999999999999999998888876 344443333 222211100 0012345554 4699
Q ss_pred cCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceecC
Q 009021 278 ISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVEP 322 (546)
Q Consensus 278 LSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~p 322 (546)
+.+.+++++-.. +.-. ...++|..|++.+.+.|.++...+
T Consensus 108 ~rr~~L~~~GG~-~~l~----~~ladD~~l~~~~~~~G~~v~~~~ 147 (175)
T PF13506_consen 108 FRREALEEIGGF-EALA----DYLADDYALGRRLRARGYRVVLSP 147 (175)
T ss_pred eEHHHHHHcccH-HHHh----hhhhHHHHHHHHHHHCCCeEEEcc
Confidence 999999886432 1111 234678899999999998876554
No 14
>cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis. UDP-glucose:N-acylsphingosine D-glucosyltransferase (glucosylceramide synthase or ceramide glucosyltransferase) catalyzes the first glycosylation step of glycosphingolipid synthesis. Its product, glucosylceramide, serves as the core of more than 300 glycosphingolipids (GSL). GSLs are a group of membrane components that have the lipid portion embedded in the outer plasma membrane leaflet and the sugar chains extended to the outer environment. Several lines of evidence suggest the importance of GSLs in various cellular processes such as differentiation, adhesion, proliferation, and cell-cell recognition. In pathogenic fungus Cryptococcus neoformans, glucosylceramide serves as an antigen that elicits an antibody response in patients and it is essential for fungal growth in host extracellular environment.
Probab=90.74 E-value=0.52 Score=44.82 Aligned_cols=85 Identities=19% Similarity=0.250 Sum_probs=58.4
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcC-CCCCCeEEeccCCCCCCCccccccccccCcccccCHHHHHHHHhhhcccccc
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKY-DPTEMVYIGTPSESHSANTYFSHSMAFGGGGIAISYPLAEALSNIHDDCIDR 296 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~Lsky-D~~ep~YIG~~se~~~~~~~~g~~fa~GGaG~vLSr~LlkkL~~~~d~C~~~ 296 (546)
...+|++++|+|+.+..+-|.+++..+ ++......|. ...|+++++.+.+++++... +.-
T Consensus 85 a~~d~i~~~D~D~~~~~~~l~~l~~~~~~~~~~~v~~~---------------~~~g~~~~~r~~~~~~~ggf-~~~--- 145 (196)
T cd02520 85 ARYDILVISDSDISVPPDYLRRMVAPLMDPGVGLVTCL---------------CAFGKSMALRREVLDAIGGF-EAF--- 145 (196)
T ss_pred CCCCEEEEECCCceEChhHHHHHHHHhhCCCCCeEEee---------------cccCceeeeEHHHHHhccCh-HHH---
Confidence 568999999999999888888888765 3332222222 24578899999999987543 211
Q ss_pred CCCCCCchHHHHHHHHHcCCCceecC
Q 009021 297 YPKLYGSDDRLHACITELGIPLTVEP 322 (546)
Q Consensus 297 ~~~~~~~D~~Lg~Cl~~lGV~LT~~p 322 (546)
....++|..|..-+.+.|.+....+
T Consensus 146 -~~~~~eD~~l~~rl~~~G~~i~~~~ 170 (196)
T cd02520 146 -ADYLAEDYFLGKLIWRLGYRVVLSP 170 (196)
T ss_pred -hHHHHHHHHHHHHHHHcCCeEEEcc
Confidence 1123578888888888887665433
No 15
>cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase. Shigella flexneri RfbF protein is a putative dTDP-rhamnosyl transferase. dTDP rhamnosyl transferases of Shigella flexneri add rhamnose sugars to N-acetyl-glucosamine in the O-antigen tetrasaccharide repeat. Lipopolysaccharide O antigens are important virulence determinants for many bacteria. The variations of sugar composition, the sequence of the sugars and the linkages in the O antigen provide structural diversity of the O antigen.
Probab=90.25 E-value=1.3 Score=42.73 Aligned_cols=99 Identities=15% Similarity=0.101 Sum_probs=56.4
Q ss_pred CCccEEEEEcCCccccHHHHHHHH---hcCCCCCCe-EEeccCCCCCCCc-----cc-------------cc--cccccC
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVL---SKYDPTEMV-YIGTPSESHSANT-----YF-------------SH--SMAFGG 273 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~L---skyD~~ep~-YIG~~se~~~~~~-----~~-------------g~--~fa~GG 273 (546)
.+.+|++++|+|+.+..+.|.+++ ..+...... .+|.......... .+ .. .....|
T Consensus 74 ~~~d~v~~lD~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (237)
T cd02526 74 NGADYVLLFDQDSVPPPDMVEKLLAYKILSDKNSNIGAVGPRIIDRRTGENSPGVRKSGYKLRIQKEGEEGLKEVDFLIT 153 (237)
T ss_pred CCCCEEEEECCCCCcCHhHHHHHHHHHHhhccCCCeEEEeeeEEcCCCCeeccceeccCccceecccccCCceEeeeeec
Confidence 378999999999999988888874 333222222 2232221110000 00 00 011236
Q ss_pred cccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceec
Q 009021 274 GGIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVE 321 (546)
Q Consensus 274 aG~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~ 321 (546)
+|.++++.+++++....+.. ...++|..+..-+.+.|..+...
T Consensus 154 ~~~~~rr~~~~~~ggfd~~~-----~~~~eD~d~~~r~~~~G~~~~~~ 196 (237)
T cd02526 154 SGSLISLEALEKVGGFDEDL-----FIDYVDTEWCLRARSKGYKIYVV 196 (237)
T ss_pred cceEEcHHHHHHhCCCCHHH-----cCccchHHHHHHHHHcCCcEEEE
Confidence 78899999999876532222 12356888877777778665443
No 16
>PF13641 Glyco_tranf_2_3: Glycosyltransferase like family 2; PDB: 4FIY_B 4FIX_A.
Probab=89.74 E-value=0.63 Score=44.80 Aligned_cols=104 Identities=19% Similarity=0.277 Sum_probs=56.6
Q ss_pred HHHHHHHhcCCCccEEEEEcCCccccHHHHHHHHhcC-CCCCCeEEeccCCCCCCC---------------------ccc
Q 009021 208 IVIECFRLGLPNVRWFVLADDDTIFNSDNLVAVLSKY-DPTEMVYIGTPSESHSAN---------------------TYF 265 (546)
Q Consensus 208 mv~e~~~~~~p~~kWfv~~DDDTy~~~~nL~~~Lsky-D~~ep~YIG~~se~~~~~---------------------~~~ 265 (546)
.+.+.++. -..+|++++|||+.+.++-|.++++.+ ++.-...-|........+ ...
T Consensus 77 a~n~~~~~--~~~d~i~~lD~D~~~~p~~l~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (228)
T PF13641_consen 77 ALNEALAA--ARGDYILFLDDDTVLDPDWLERLLAAFADPGVGAVGGPVFPDNDRNWLTRLQDLFFARWHLRFRSGRRAL 154 (228)
T ss_dssp HHHHHHHH-----SEEEEE-SSEEE-CHHHHHHHHHHHBSS--EEEEEEEETTCCCEEEE-TT--S-EETTTS-TT-B--
T ss_pred HHHHHHHh--cCCCEEEEECCCcEECHHHHHHHHHHHHhCCCCeEeeeEeecCCCCHHHHHHHHHHhhhhhhhhhhhccc
Confidence 34454443 248999999999999988888888777 444443333332111110 001
Q ss_pred cccccccCcccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceec
Q 009021 266 SHSMAFGGGGIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVE 321 (546)
Q Consensus 266 g~~fa~GGaG~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~ 321 (546)
+. ....|+++++.+.+++++-.. +. ...++|..++.-+.+.|.++...
T Consensus 155 ~~-~~~~G~~~~~rr~~~~~~g~f-d~------~~~~eD~~l~~r~~~~G~~~~~~ 202 (228)
T PF13641_consen 155 GV-AFLSGSGMLFRRSALEEVGGF-DP------FILGEDFDLCLRLRAAGWRIVYA 202 (228)
T ss_dssp ---S-B--TEEEEEHHHHHHH-S---S------SSSSHHHHHHHHHHHTT--EEEE
T ss_pred ce-eeccCcEEEEEHHHHHHhCCC-CC------CCcccHHHHHHHHHHCCCcEEEE
Confidence 11 223578999999999998642 22 13468888888888888776543
No 17
>PF05679 CHGN: Chondroitin N-acetylgalactosaminyltransferase; InterPro: IPR008428 This family represents Chondroitin N-acetylgalactosaminyltransferase. Proteins have a type II transmembrane topology. The enzyme is involved in the biosynthetic initiation and elongation of chondroitin sulphate and is the key enzyme responsible for the selective chain assembly of chondroitin/dermatan sulphate on the linkage region tetrasaccharide common to various proteoglycans containing chondroitin/dermatan sulphate or heparin/heparan sulphate chains. ; GO: 0016758 transferase activity, transferring hexosyl groups, 0032580 Golgi cisterna membrane
Probab=86.89 E-value=0.28 Score=54.80 Aligned_cols=69 Identities=22% Similarity=0.358 Sum_probs=43.0
Q ss_pred hhccccccCCCCCCchHHHHHHHHH-cCCCceecCC---CCCCCCC-CCCCCccCCCCCCCceecccCCCCCCCC
Q 009021 289 IHDDCIDRYPKLYGSDDRLHACITE-LGIPLTVEPG---FHQWDIR-GNAHGLLSSHPIAPFVSIHHVEAVDPFY 358 (546)
Q Consensus 289 ~~d~C~~~~~~~~~~D~~Lg~Cl~~-lGV~LT~~pg---fhq~d~~-gd~~g~~es~~~~PlvSlHH~~~~~~~f 358 (546)
+++.|..... ...+|.+||+|+.+ +||+||.+.. +|..... .+..+..+..-+...||+|+++....+|
T Consensus 1 hl~~C~~~~~-s~~~Dv~lGRCI~~~~gi~Ct~~~q~l~y~~~~~~~~~~~~~~~~~~~~~AiTlHPvk~p~~mY 74 (499)
T PF05679_consen 1 HLDWCLKNIY-SNHEDVELGRCIKKFTGISCTWSYQGLFYHNYELNKNDFIGDLKNKEFHNAITLHPVKSPAYMY 74 (499)
T ss_pred ChhHHhhhcC-CCCchhHHHHHHHHhcCCCeeecccceEEEeeccCCCcccccccchhhhcceeeccCCCHHHHH
Confidence 3567987531 23478899999987 8999998751 2222222 1222333333356679999999875444
No 18
>cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=86.47 E-value=2.2 Score=40.23 Aligned_cols=86 Identities=21% Similarity=0.268 Sum_probs=57.0
Q ss_pred CCCccEEEEEcCCccccHHHHHHHHhcCC-CCCCeEEeccCCCCCCCccccccccccCcccccCHHHHHHHHhhhccccc
Q 009021 217 LPNVRWFVLADDDTIFNSDNLVAVLSKYD-PTEMVYIGTPSESHSANTYFSHSMAFGGGGIAISYPLAEALSNIHDDCID 295 (546)
Q Consensus 217 ~p~~kWfv~~DDDTy~~~~nL~~~LskyD-~~ep~YIG~~se~~~~~~~~g~~fa~GGaG~vLSr~LlkkL~~~~d~C~~ 295 (546)
..+.+|++++|||..+..+.|.++++.+. +.-.++.|...... + .++|+++.+.+++++.. .+.-
T Consensus 77 ~~~~d~v~~ld~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~------~-----~~~~~~~~~~~~~~~g~-~~~~-- 142 (202)
T cd04185 77 ELGYDWIWLMDDDAIPDPDALEKLLAYADKDNPQFLAPLVLDPD------G-----SFVGVLISRRVVEKIGL-PDKE-- 142 (202)
T ss_pred ccCCCEEEEeCCCCCcChHHHHHHHHHHhcCCceEecceeEcCC------C-----ceEEEEEeHHHHHHhCC-CChh--
Confidence 35789999999999999888877777665 33334433332211 1 34678999999988742 2211
Q ss_pred cCCCCCCchHHHHHHHHHcCCCc
Q 009021 296 RYPKLYGSDDRLHACITELGIPL 318 (546)
Q Consensus 296 ~~~~~~~~D~~Lg~Cl~~lGV~L 318 (546)
+ ..+++|..+..=+.+.|..+
T Consensus 143 -~-~~~~eD~~~~~r~~~~G~~i 163 (202)
T cd04185 143 -F-FIWGDDTEYTLRASKAGPGI 163 (202)
T ss_pred -h-hccchHHHHHHHHHHcCCcE
Confidence 2 24677888877777777665
No 19
>TIGR01556 rhamnosyltran L-rhamnosyltransferase. Rhamnolipids are glycolipids containing mono- or di- L-rhamnose molecules. Rhamnolipid synthesis occurs by sequential glycosyltransferase reactions involving two distinct rhamnosyltransferase enzymes. In P.aeruginosa, the synthesis of mono-rhamnolipids is catalyzed by rhamnosyltransferase 1, and proceeds by a glycosyltransfer reaction catalyzed by rhamnosyltransferase 2 to yield di-rhamnolipids.
Probab=85.37 E-value=2.4 Score=42.68 Aligned_cols=99 Identities=18% Similarity=0.187 Sum_probs=53.8
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcCCCC--CCeEEeccCCCCCCCcc----------------------ccccccccC
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKYDPT--EMVYIGTPSESHSANTY----------------------FSHSMAFGG 273 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~LskyD~~--ep~YIG~~se~~~~~~~----------------------~g~~fa~GG 273 (546)
.+.+|++++|||+.+..+.|.++++.++.. .-..+|....+...... ....+ ..+
T Consensus 72 ~~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 150 (281)
T TIGR01556 72 RGVQGVLLLDQDSRPGNAFLAAQWKLLSAENGQACALGPRFFDRGTSRRLPAIHLDGLLLRQISLDGLTTPQKTSF-LIS 150 (281)
T ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHhcCCceEEECCeEEcCCCcccCCceeecccceeeecccccCCceeccE-EEc
Confidence 478999999999999977777766655432 22334432111000000 00011 235
Q ss_pred cccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceecC
Q 009021 274 GGIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVEP 322 (546)
Q Consensus 274 aG~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~p 322 (546)
+|.++++.+++++-. +++- + ..+..|.++..=+.+.|..+...|
T Consensus 151 sg~li~~~~~~~iG~-fde~---~-fi~~~D~e~~~R~~~~G~~i~~~~ 194 (281)
T TIGR01556 151 SGCLITREVYQRLGM-MDEE---L-FIDHVDTEWSLRAQNYGIPLYIDP 194 (281)
T ss_pred CcceeeHHHHHHhCC-ccHh---h-cccchHHHHHHHHHHCCCEEEEeC
Confidence 678999999998754 2221 1 123456655433445676554433
No 20
>cd04192 GT_2_like_e Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=85.07 E-value=3.9 Score=38.82 Aligned_cols=94 Identities=13% Similarity=0.136 Sum_probs=56.9
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcCC-CCCCeEEeccCCCCCCC--------------------ccccccccccCccc
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKYD-PTEMVYIGTPSESHSAN--------------------TYFSHSMAFGGGGI 276 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~LskyD-~~ep~YIG~~se~~~~~--------------------~~~g~~fa~GGaG~ 276 (546)
.+.+|++++|+|+.+..+-|.+++..+. +....+.|......... ...+..+..-|+++
T Consensus 81 ~~~d~i~~~D~D~~~~~~~l~~l~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 160 (229)
T cd04192 81 AKGDWIVTTDADCVVPSNWLLTFVAFIQKEQIGLVAGPVIYFKGKSLLAKFQRLDWLSLLGLIAGSFGLGKPFMCNGANM 160 (229)
T ss_pred hcCCEEEEECCCcccCHHHHHHHHHHhhcCCCcEEeeeeeecCCccHHHHHHHHHHHHHHHHHhhHHHhcCccccccceE
Confidence 4689999999999998888877777554 34456666544321110 00122333457888
Q ss_pred ccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCC
Q 009021 277 AISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGI 316 (546)
Q Consensus 277 vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV 316 (546)
++++.+++++-..... .....+|..+..-+...|.
T Consensus 161 ~~rr~~~~~~ggf~~~-----~~~~~eD~~~~~~~~~~g~ 195 (229)
T cd04192 161 AYRKEAFFEVGGFEGN-----DHIASGDDELLLAKVASKY 195 (229)
T ss_pred EEEHHHHHHhcCCccc-----cccccCCHHHHHHHHHhCC
Confidence 9999999997653211 1234567766554444444
No 21
>PF13632 Glyco_trans_2_3: Glycosyl transferase family group 2
Probab=84.13 E-value=3.5 Score=38.84 Aligned_cols=94 Identities=15% Similarity=0.207 Sum_probs=57.5
Q ss_pred EEEEEcCCccccHHHHHHHHhcCC-CCCCeEEeccCCCCCC-------Cc--------------cccccccccCcccccC
Q 009021 222 WFVLADDDTIFNSDNLVAVLSKYD-PTEMVYIGTPSESHSA-------NT--------------YFSHSMAFGGGGIAIS 279 (546)
Q Consensus 222 Wfv~~DDDTy~~~~nL~~~LskyD-~~ep~YIG~~se~~~~-------~~--------------~~g~~fa~GGaG~vLS 279 (546)
|++++|+||-+..+-|.+++..++ ++-...-|.......+ .. ..+.....-|+|.+++
T Consensus 1 ~v~~~DaDt~~~~d~l~~~~~~~~~~~~~~vq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~r 80 (193)
T PF13632_consen 1 YVLFLDADTRLPPDFLERLVAALEDPKVDAVQGPIIFRNRGSLLTRLQDFEYAISHGLSRLSQSSLGRPLFLSGSGMLFR 80 (193)
T ss_pred CEEEEcCCCCCChHHHHHHHHHHhCCCceEEEccEEecCCCChhheeehhhhhhhhhhhHHHHHhcCCCccccCcceeee
Confidence 799999999999988888887776 2222222222211100 00 0122234568999999
Q ss_pred HHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceec
Q 009021 280 YPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVE 321 (546)
Q Consensus 280 r~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~ 321 (546)
+++++++....+. ...++|..++.=+.+.|.++...
T Consensus 81 ~~~l~~vg~~~~~------~~~~ED~~l~~~l~~~G~~~~~~ 116 (193)
T PF13632_consen 81 REALREVGGFDDP------FSIGEDMDLGFRLRRAGYRIVYV 116 (193)
T ss_pred HHHHHHhCccccc------ccccchHHHHHHHHHCCCEEEEe
Confidence 9999987532101 23568888887777777655433
No 22
>cd06421 CESA_CelA_like CESA_CelA_like are involved in the elongation of the glucan chain of cellulose. Family of proteins related to Agrobacterium tumefaciens CelA and Gluconacetobacter xylinus BscA. These proteins are involved in the elongation of the glucan chain of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues. They are putative catalytic subunit of cellulose synthase, which is a glycosyltransferase using UDP-glucose as the substrate. The catalytic subunit is an integral membrane protein with 6 transmembrane segments and it is postulated that the protein is anchored in the membrane at the N-terminal end.
Probab=83.77 E-value=2.3 Score=40.72 Aligned_cols=92 Identities=12% Similarity=0.115 Sum_probs=56.8
Q ss_pred CccEEEEEcCCccccHHHHHHHHhcCCCCCC-eEEecc-CC-CCCCC----c------------------cccccccccC
Q 009021 219 NVRWFVLADDDTIFNSDNLVAVLSKYDPTEM-VYIGTP-SE-SHSAN----T------------------YFSHSMAFGG 273 (546)
Q Consensus 219 ~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep-~YIG~~-se-~~~~~----~------------------~~g~~fa~GG 273 (546)
+.+|++++|+|+++..+-|.++++.+..+.. -.++.. .. ..... . .++..+ ..|
T Consensus 84 ~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~g 162 (234)
T cd06421 84 TGDFVAILDADHVPTPDFLRRTLGYFLDDPKVALVQTPQFFYNPDPFDWLADGAPNEQELFYGVIQPGRDRWGAAF-CCG 162 (234)
T ss_pred CCCEEEEEccccCcCccHHHHHHHHHhcCCCeEEEecceEEecCCcchhHHHHHHHHHHHHHHHHHHHHhhcCCce-ecC
Confidence 7899999999999998888888887754322 233322 11 00000 0 001112 346
Q ss_pred cccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCc
Q 009021 274 GGIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPL 318 (546)
Q Consensus 274 aG~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~L 318 (546)
+|.++++.+++++... +. ..+++|..++.=+...|..+
T Consensus 163 ~~~~~r~~~~~~ig~~-~~------~~~~eD~~l~~r~~~~g~~i 200 (234)
T cd06421 163 SGAVVRREALDEIGGF-PT------DSVTEDLATSLRLHAKGWRS 200 (234)
T ss_pred ceeeEeHHHHHHhCCC-Cc------cceeccHHHHHHHHHcCceE
Confidence 7899999999987543 21 13467888877666666554
No 23
>cd06438 EpsO_like EpsO protein participates in the methanolan synthesis. The Methylobacillus sp EpsO protein is predicted to participate in the methanolan synthesis. Methanolan is an exopolysaccharide (EPS), composed of glucose, mannose and galactose. A 21 genes cluster was predicted to participate in the methanolan synthesis. Gene disruption analysis revealed that EpsO is one of the glycosyltransferase enzymes involved in the synthesis of repeating sugar units onto the lipid carrier.
Probab=82.68 E-value=2 Score=40.33 Aligned_cols=69 Identities=13% Similarity=0.233 Sum_probs=45.8
Q ss_pred CCCccEEEEEcCCccccHHHHHHHHhcCCCCCCeEEeccCCCCC-CCc---------------------cccccccccCc
Q 009021 217 LPNVRWFVLADDDTIFNSDNLVAVLSKYDPTEMVYIGTPSESHS-ANT---------------------YFSHSMAFGGG 274 (546)
Q Consensus 217 ~p~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~YIG~~se~~~-~~~---------------------~~g~~fa~GGa 274 (546)
..+.+|++++|.|+.+.++.|.++++.+........|....... .+. .++..+..+|+
T Consensus 79 ~~~~d~v~~~DaD~~~~p~~l~~l~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~ 158 (183)
T cd06438 79 ADDPDAVVVFDADNLVDPNALEELNARFAAGARVVQAYYNSKNPDDSWITRLYAFAFLVFNRLRPLGRSNLGLSCQLGGT 158 (183)
T ss_pred CCCCCEEEEEcCCCCCChhHHHHHHHHHhhCCCeeEEEEeeeCCccCHHHHHHHHHHHHHHHHHHHHHHHcCCCeeecCc
Confidence 35799999999999999988888888775544555555432111 110 01222345788
Q ss_pred ccccCHHHHHH
Q 009021 275 GIAISYPLAEA 285 (546)
Q Consensus 275 G~vLSr~Llkk 285 (546)
|+++++.++++
T Consensus 159 ~~~~rr~~l~~ 169 (183)
T cd06438 159 GMCFPWAVLRQ 169 (183)
T ss_pred hhhhHHHHHHh
Confidence 88888888887
No 24
>cd06437 CESA_CaSu_A2 Cellulose synthase catalytic subunit A2 (CESA2) is a catalytic subunit or a catalytic subunit substitute of the cellulose synthase complex. Cellulose synthase (CESA) catalyzes the polymerization reaction of cellulose using UDP-glucose as the substrate. Cellulose is an aggregate of unbranched polymers of beta-1,4-linked glucose residues, which is an abundant polysaccharide produced by plants and in varying degrees by several other organisms including algae, bacteria, fungi, and even some animals. Genomes from higher plants harbor multiple CESA genes. There are ten in Arabidopsis. At least three different CESA proteins are required to form a functional complex. In Arabidopsis, CESA1, 3 and 6 and CESA4, 7 and 8, are required for cellulose biosynthesis during primary and secondary cell wall formation. CESA2 is very closely related to CESA6 and is viewed as a prime substitute for CESA6. They functionally compensate each other. The cesa2 and cesa6 double mutant plants we
Probab=82.31 E-value=4.1 Score=39.52 Aligned_cols=96 Identities=11% Similarity=0.048 Sum_probs=56.9
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcCCCCCCeEEeccCC--CCCCC-------c----c----------ccccccccCc
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKYDPTEMVYIGTPSE--SHSAN-------T----Y----------FSHSMAFGGG 274 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~YIG~~se--~~~~~-------~----~----------~g~~fa~GGa 274 (546)
.+.+|++++|+|+.+..+-|.+++..+...+--.++.... +...+ . . .+..+...|+
T Consensus 86 a~~~~i~~~DaD~~~~~~~l~~~~~~~~~~~v~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 165 (232)
T cd06437 86 AKGEYVAIFDADFVPPPDFLQKTPPYFADPKLGFVQTRWGHINANYSLLTRVQAMSLDYHFTIEQVARSSTGLFFNFNGT 165 (232)
T ss_pred CCCCEEEEEcCCCCCChHHHHHhhhhhcCCCeEEEecceeeEcCCCchhhHhhhhhHHhhhhHhHhhHhhcCCeEEeccc
Confidence 4789999999999999888888665553333223333211 00000 0 0 0111123466
Q ss_pred ccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCcee
Q 009021 275 GIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTV 320 (546)
Q Consensus 275 G~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~ 320 (546)
+.++.+.+++++... +. ..+.+|..|..-+...|.++..
T Consensus 166 ~~~~rr~~~~~vgg~-~~------~~~~ED~~l~~rl~~~G~~~~~ 204 (232)
T cd06437 166 AGVWRKECIEDAGGW-NH------DTLTEDLDLSYRAQLKGWKFVY 204 (232)
T ss_pred hhhhhHHHHHHhCCC-CC------CcchhhHHHHHHHHHCCCeEEE
Confidence 677888888887532 22 1246888898888777766543
No 25
>cd06434 GT2_HAS Hyaluronan synthases catalyze polymerization of hyaluronan. Hyaluronan synthases (HASs) are bi-functional glycosyltransferases that catalyze polymerization of hyaluronan. HASs transfer both GlcUA and GlcNAc in beta-(1,3) and beta-(1,4) linkages, respectively to the hyaluronan chain using UDP-GlcNAc and UDP-GlcUA as substrates. HA is made as a free glycan, not attached to a protein or lipid. HASs do not need a primer for HA synthesis; they initiate HA biosynthesis de novo with only UDP-GlcNAc, UDP-GlcUA, and Mg2+. Hyaluronan (HA) is a linear heteropolysaccharide composed of (1-3)-linked beta-D-GlcUA-beta-D-GlcNAc disaccharide repeats. It can be found in vertebrates and a few microbes and is typically on the cell surface or in the extracellular space, but is also found inside mammalian cells. Hyaluronan has several physiochemical and biological functions such as space filling, lubrication, and providing a hydrated matrix through which cells can migrate.
Probab=82.06 E-value=4 Score=39.30 Aligned_cols=28 Identities=25% Similarity=0.459 Sum_probs=25.5
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcCC
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKYD 245 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~LskyD 245 (546)
.+.+|++++|+|+.+..+.|.+++..++
T Consensus 76 a~~d~v~~lD~D~~~~~~~l~~l~~~~~ 103 (235)
T cd06434 76 VTTDIVVLLDSDTVWPPNALPEMLKPFE 103 (235)
T ss_pred hCCCEEEEECCCceeChhHHHHHHHhcc
Confidence 3789999999999999999999998886
No 26
>PF01755 Glyco_transf_25: Glycosyltransferase family 25 (LPS biosynthesis protein); InterPro: IPR002654 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 25 GT25 from CAZY comprises enzymes with only one known activity; as a lipopolysaccharide biosynthesis protein. These enzymes catalyse the transfer of various sugars onto the growing lipopolysaccharide chain during its biosynthesis [].; GO: 0009103 lipopolysaccharide biosynthetic process
Probab=80.69 E-value=3.2 Score=39.82 Aligned_cols=22 Identities=23% Similarity=0.167 Sum_probs=18.8
Q ss_pred ccccCcccccCHHHHHHHHhhh
Q 009021 269 MAFGGGGIAISYPLAEALSNIH 290 (546)
Q Consensus 269 fa~GGaG~vLSr~LlkkL~~~~ 290 (546)
+..|.+||+||+..|++|.+..
T Consensus 168 ~~~~t~aY~Is~~gA~kLL~~~ 189 (200)
T PF01755_consen 168 YPYGTCAYLISRKGARKLLEAS 189 (200)
T ss_pred CCCcceeeeeCHHHHHHHHHhC
Confidence 4568889999999999998764
No 27
>cd06427 CESA_like_2 CESA_like_2 is a member of the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, Glucan Biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis prot
Probab=80.14 E-value=4.9 Score=39.45 Aligned_cols=96 Identities=15% Similarity=0.192 Sum_probs=59.4
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcCCCC--CCeEEeccCCCC--CCCc---------------------ccccccccc
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKYDPT--EMVYIGTPSESH--SANT---------------------YFSHSMAFG 272 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~LskyD~~--ep~YIG~~se~~--~~~~---------------------~~g~~fa~G 272 (546)
...+|++++|+|+.+..+.|.++++.+... +-.++|...... .++. ..+.....+
T Consensus 83 a~gd~i~~~DaD~~~~~~~l~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 162 (241)
T cd06427 83 ARGEYVVIYDAEDAPDPDQLKKAVAAFARLDDKLACVQAPLNYYNARENWLTRMFALEYAAWFDYLLPGLARLGLPIPLG 162 (241)
T ss_pred cCCCEEEEEcCCCCCChHHHHHHHHHHHhcCCCEEEEeCceEeeCCCccHHHHHHHHHHHHHHHHHHHHHHhcCCeeecC
Confidence 356999999999999998888888776432 223444332111 0000 011223468
Q ss_pred CcccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCcee
Q 009021 273 GGGIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTV 320 (546)
Q Consensus 273 GaG~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~ 320 (546)
|+++++++.+++++... +. ..+.+|..|..=+.+.|..+..
T Consensus 163 g~~~~~rr~~~~~vgg~-~~------~~~~eD~~l~~rl~~~G~r~~~ 203 (241)
T cd06427 163 GTSNHFRTDVLRELGGW-DP------FNVTEDADLGLRLARAGYRTGV 203 (241)
T ss_pred CchHHhhHHHHHHcCCC-Cc------ccchhhHHHHHHHHHCCceEEE
Confidence 88999999999987553 11 1245787776666666666543
No 28
>cd06435 CESA_NdvC_like NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. Bradyrhizobium japonicum synthesizes periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans during growth under hypoosmotic conditions. Two genes (ndvB, ndvC) are involved in the beta-(1, 3), beta-(1,6)-glucan synthesis. The ndvC mutant strain resulted in synthesis of altered cyclic beta-glucans composed almost entirely of beta-(1, 3)-glycosyl linkages. The periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans function for osmoregulation. The ndvC mutation also affects the ability of the bacteria to establish a successful symbiotic interaction with host plant. Thus, the beta-glucans may function as suppressors of a host defense response.
Probab=79.13 E-value=11 Score=36.32 Aligned_cols=96 Identities=16% Similarity=0.229 Sum_probs=58.5
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcCCCCCCeEEeccCCCCCCCc-------cccc---------------cccccCcc
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKYDPTEMVYIGTPSESHSANT-------YFSH---------------SMAFGGGG 275 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~YIG~~se~~~~~~-------~~g~---------------~fa~GGaG 275 (546)
.+.+|++++|+|+.+..+.|.++++.+...+--.++.......... .+.+ .....|++
T Consensus 83 ~~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 162 (236)
T cd06435 83 PDAEIIAVIDADYQVEPDWLKRLVPIFDDPRVGFVQAPQDYRDGEESLFKRMCYAEYKGFFDIGMVSRNERNAIIQHGTM 162 (236)
T ss_pred CCCCEEEEEcCCCCcCHHHHHHHHHHhcCCCeeEEecCccccCCCccHHHHHHhHHHHHHHHHHhccccccCceEEecce
Confidence 4589999999999999988888887775323223433321111000 0000 01124667
Q ss_pred cccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCcee
Q 009021 276 IAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTV 320 (546)
Q Consensus 276 ~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~ 320 (546)
.++++.+++++-. ++++ .+.+|..++.=+.+.|.++..
T Consensus 163 ~~~rr~~~~~iGg-f~~~------~~~eD~dl~~r~~~~G~~~~~ 200 (236)
T cd06435 163 CLIRRSALDDVGG-WDEW------CITEDSELGLRMHEAGYIGVY 200 (236)
T ss_pred EEEEHHHHHHhCC-CCCc------cccchHHHHHHHHHCCcEEEE
Confidence 8999999998754 3333 246788887777777766543
No 29
>cd06420 GT2_Chondriotin_Pol_N N-terminal domain of Chondroitin polymerase functions as a GalNAc transferase. Chondroitin polymerase is a two domain, bi-functional protein. The N-terminal domain functions as a GalNAc transferase. The bacterial chondroitin polymerase catalyzes elongation of the chondroitin chain by alternatively transferring the GlcUA and GalNAc moiety from UDP-GlcUA and UDP-GalNAc to the non-reducing ends of the chondroitin chain. The enzyme consists of N-terminal and C-terminal domains in which the two active sites catalyze the addition of GalNAc and GlcUA, respectively. Chondroitin chains range from 40 to over 100 repeating units of the disaccharide. Sulfated chondroitins are involved in the regulation of various biological functions such as central nervous system development, wound repair, infection, growth factor signaling, and morphogenesis, in addition to its conventional structural roles. In Caenorhabditis elegans, chondroitin is an essential factor for the worm
Probab=78.53 E-value=8.7 Score=35.25 Aligned_cols=91 Identities=18% Similarity=0.231 Sum_probs=57.5
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcCCCCCCeEEeccCCCCCCCccccccccccCcccccCHHHHHHHHhhhccccccC
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKYDPTEMVYIGTPSESHSANTYFSHSMAFGGGGIAISYPLAEALSNIHDDCIDRY 297 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~YIG~~se~~~~~~~~g~~fa~GGaG~vLSr~LlkkL~~~~d~C~~~~ 297 (546)
...+|++++|+|+.+..+.|.++++..++. ....|.......... .....|+++++.+..+.++.. ++...
T Consensus 78 a~g~~i~~lD~D~~~~~~~l~~~~~~~~~~-~~v~g~~~~~~~~~~----~~~~~~~~~~~~r~~~~~~gg-f~~~~--- 148 (182)
T cd06420 78 AKGDYLIFIDGDCIPHPDFIADHIELAEPG-VFLSGSRVLLNEKLT----ERGIRGCNMSFWKKDLLAVNG-FDEEF--- 148 (182)
T ss_pred hcCCEEEEEcCCcccCHHHHHHHHHHhCCC-cEEecceeecccccc----eeEeccceEEEEHHHHHHhCC-CCccc---
Confidence 467999999999999888788887776433 344455443221111 123456778888888886543 33332
Q ss_pred CCCC-CchHHHHHHHHHcCCCc
Q 009021 298 PKLY-GSDDRLHACITELGIPL 318 (546)
Q Consensus 298 ~~~~-~~D~~Lg~Cl~~lGV~L 318 (546)
..+ ++|..+..=+.+.|+.+
T Consensus 149 -~~~~~eD~~l~~r~~~~g~~~ 169 (182)
T cd06420 149 -TGWGGEDSELVARLLNSGIKF 169 (182)
T ss_pred -ccCCcchHHHHHHHHHcCCcE
Confidence 123 47878877777777544
No 30
>cd06436 GlcNAc-1-P_transferase N-acetyl-glucosamine transferase is involved in the synthesis of Poly-beta-1,6-N-acetyl-D-glucosamine. N-acetyl-glucosamine transferase is responsible for the synthesis of bacteria Poly-beta-1,6-N-acetyl-D-glucosamine (PGA). Poly-beta-1,6-N-acetyl-D-glucosamine is a homopolymer that serves as an adhesion for the maintenance of biofilm structural stability in diverse eubacteria. N-acetyl-glucosamine transferase is the product of gene pgaC. Genetic analysis indicated that all four genes of the pgaABCD locus were required for the PGA production, pgaC being a glycosyltransferase.
Probab=78.47 E-value=3.3 Score=39.31 Aligned_cols=67 Identities=15% Similarity=0.218 Sum_probs=42.2
Q ss_pred CccEEEEEcCCccccHHHHHHHHhcCCCCCCeEEeccC--CCCCCCc---------------------cccccccccCcc
Q 009021 219 NVRWFVLADDDTIFNSDNLVAVLSKYDPTEMVYIGTPS--ESHSANT---------------------YFSHSMAFGGGG 275 (546)
Q Consensus 219 ~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~YIG~~s--e~~~~~~---------------------~~g~~fa~GGaG 275 (546)
+.+|++++|.|+.+.++.|.+++..+...+-..++... .+...+. ..+ ....||.|
T Consensus 89 ~~d~v~~~DaD~~~~~~~l~~~~~~~~~~~v~~v~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~-~~~~~G~~ 167 (191)
T cd06436 89 ERVIIAVIDADGRLDPNALEAVAPYFSDPRVAGTQSRVRMYNRHKNLLTILQDLEFFIIIAATQSLRALTG-TVGLGGNG 167 (191)
T ss_pred CccEEEEECCCCCcCHhHHHHHHHhhcCCceEEEeeeEEEecCCCCHHHHHHHHHHHHHHHHHHHHHHhcC-cEEECCee
Confidence 45799999999999988888866655322222222221 1111110 012 24579999
Q ss_pred cccCHHHHHHH
Q 009021 276 IAISYPLAEAL 286 (546)
Q Consensus 276 ~vLSr~LlkkL 286 (546)
.++++.+++++
T Consensus 168 ~~~r~~~l~~v 178 (191)
T cd06436 168 QFMRLSALDGL 178 (191)
T ss_pred EEEeHHHHHHh
Confidence 99999999998
No 31
>cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis. AmsE is a glycosyltransferase involved in exopolysaccharide amylovora biosynthesis in Erwinia amylovora. Amylovara is one of the three exopolysaccharide produced by E. amylovora. Amylovara-deficient mutants are non-pathogenic. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds.
Probab=78.27 E-value=5.1 Score=37.52 Aligned_cols=96 Identities=19% Similarity=0.148 Sum_probs=55.3
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcCC--CCCCeEEeccCCCCCCC-----c----------ccc-ccccccCcccccC
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKYD--PTEMVYIGTPSESHSAN-----T----------YFS-HSMAFGGGGIAIS 279 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~LskyD--~~ep~YIG~~se~~~~~-----~----------~~g-~~fa~GGaG~vLS 279 (546)
.+.+|++++|+|.++..+.|.+++..+. ++-.++.|......... . .+. ......|+++++.
T Consensus 79 a~gd~i~~lD~Dd~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 158 (201)
T cd04195 79 CTYDWVARMDTDDISLPDRFEKQLDFIEKNPEIDIVGGGVLEFDSDGNDIGKRRLPTSHDDILKFARRRSPFNHPTVMFR 158 (201)
T ss_pred cCCCEEEEeCCccccCcHHHHHHHHHHHhCCCeEEEcccEEEECCCCCeeccccCCCCHHHHHHHhccCCCCCChHHhhh
Confidence 4689999999999999888877777653 22233333321100000 0 000 0122345667888
Q ss_pred HHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCcee
Q 009021 280 YPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTV 320 (546)
Q Consensus 280 r~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~ 320 (546)
+.+++++... +. ..+++|..+...+...|.++..
T Consensus 159 r~~~~~~g~~-~~------~~~~eD~~~~~r~~~~g~~~~~ 192 (201)
T cd04195 159 KSKVLAVGGY-QD------LPLVEDYALWARMLANGARFAN 192 (201)
T ss_pred HHHHHHcCCc-CC------CCCchHHHHHHHHHHcCCceec
Confidence 8877775432 11 1356788888888777765443
No 32
>PRK11204 N-glycosyltransferase; Provisional
Probab=77.79 E-value=5.1 Score=43.01 Aligned_cols=98 Identities=13% Similarity=0.235 Sum_probs=62.1
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcC--CCCCCeEEeccCCCCCCC---------------------ccccccccccCc
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKY--DPTEMVYIGTPSESHSAN---------------------TYFSHSMAFGGG 274 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~Lsky--D~~ep~YIG~~se~~~~~---------------------~~~g~~fa~GGa 274 (546)
.+.+|+++.|+|+.+..+.|.++++.+ |++-....|.+......+ ..++..++.+|+
T Consensus 133 a~~d~i~~lDaD~~~~~d~L~~l~~~~~~~~~v~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~ 212 (420)
T PRK11204 133 ARSEYLVCIDGDALLDPDAAAYMVEHFLHNPRVGAVTGNPRIRNRSTLLGRIQVGEFSSIIGLIKRAQRVYGRVFTVSGV 212 (420)
T ss_pred cCCCEEEEECCCCCCChhHHHHHHHHHHhCCCeEEEECCceeccchhHHHHHHHHHHHHhhhHHHHHHHHhCCceEecce
Confidence 468999999999999998888888877 333223333322111000 001222345688
Q ss_pred ccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceecC
Q 009021 275 GIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVEP 322 (546)
Q Consensus 275 G~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~p 322 (546)
+.++.+.+++++... +. ...++|..++.-+.+.|.++...|
T Consensus 213 ~~~~rr~~l~~vgg~-~~------~~~~ED~~l~~rl~~~G~~i~~~p 253 (420)
T PRK11204 213 ITAFRKSALHEVGYW-ST------DMITEDIDISWKLQLRGWDIRYEP 253 (420)
T ss_pred eeeeeHHHHHHhCCC-CC------CcccchHHHHHHHHHcCCeEEecc
Confidence 889999999886432 21 134688888888888887765544
No 33
>cd04188 DPG_synthase DPG_synthase is involved in protein N-linked glycosylation. UDP-glucose:dolichyl-phosphate glucosyltransferase (DPG_synthase) is a transmembrane-bound enzyme of the endoplasmic reticulum involved in protein N-linked glycosylation. This enzyme catalyzes the transfer of glucose from UDP-glucose to dolichyl phosphate.
Probab=77.11 E-value=14 Score=35.19 Aligned_cols=99 Identities=17% Similarity=0.129 Sum_probs=63.0
Q ss_pred CccEEEEEcCCccccHHHHHHHHhc-CCCCCCeEEeccCCCCCCC----cc----------------ccccccc-cCccc
Q 009021 219 NVRWFVLADDDTIFNSDNLVAVLSK-YDPTEMVYIGTPSESHSAN----TY----------------FSHSMAF-GGGGI 276 (546)
Q Consensus 219 ~~kWfv~~DDDTy~~~~nL~~~Lsk-yD~~ep~YIG~~se~~~~~----~~----------------~g~~fa~-GGaG~ 276 (546)
..+|++++|+|..+.++.|.++++. .+....+.+|......... .. .+..+.. ..+..
T Consensus 82 ~gd~i~~ld~D~~~~~~~l~~l~~~~~~~~~~~v~g~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~g~~ 161 (211)
T cd04188 82 RGDYILFADADLATPFEELEKLEEALKTSGYDIAIGSRAHLASAAVVKRSWLRNLLGRGFNFLVRLLLGLGIKDTQCGFK 161 (211)
T ss_pred cCCEEEEEeCCCCCCHHHHHHHHHHHhccCCcEEEEEeeccCCcccccccHHHHHHHHHHHHHHHHHcCCCCcccccCce
Confidence 3599999999999999888888887 4566678888776433210 00 0111111 22347
Q ss_pred ccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceecCC
Q 009021 277 AISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVEPG 323 (546)
Q Consensus 277 vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~pg 323 (546)
++++.+++++..... ...|..|..|..-+.+.|.++...|-
T Consensus 162 ~~~r~~~~~~~~~~~------~~~~~~d~el~~r~~~~g~~~~~vpi 202 (211)
T cd04188 162 LFTRDAARRLFPRLH------LERWAFDVELLVLARRLGYPIEEVPV 202 (211)
T ss_pred eEcHHHHHHHHhhhh------ccceEeeHHHHHHHHHcCCeEEEcCc
Confidence 899999988763211 12356688766666677877666553
No 34
>TIGR03472 HpnI hopanoid biosynthesis associated glycosyl transferase protein HpnI. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The member of this clade from Acidithiobacillus ferrooxidans ATCC 23270 (AFE_0974) is found in the same locus as squalene-hopene cyclase (SHC, TIGR01507) and other genes associated with the biosynthesis of hopanoid natural products. Similarly, in Ralstonia eutropha JMP134 (Reut_B4902) this gene is adjacent to HpnAB, IspH and HpnH (TIGR03470), although SHC itself is elsewhere in the genome. Notably, this gene (here named HpnI) and three others form a conserved set (HpnIJKL) which occur in a subset of all genomes containing the SHC enzyme. This relationship was discerned using the method of partial phylogenetic profiling. This group includes Zymomonas mobilis, the organism where the initial hopano
Probab=76.36 E-value=8.9 Score=40.86 Aligned_cols=99 Identities=18% Similarity=0.245 Sum_probs=62.3
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcCC-CCCCeEEeccCCCCCCCcc------------c----------cccccccCc
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKYD-PTEMVYIGTPSESHSANTY------------F----------SHSMAFGGG 274 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~LskyD-~~ep~YIG~~se~~~~~~~------------~----------g~~fa~GGa 274 (546)
.+.+|+++.|+|+.+..+-|.++++.+. ++....-| .......... + +......|+
T Consensus 125 a~ge~i~~~DaD~~~~p~~L~~lv~~~~~~~v~~V~~-~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~G~ 203 (373)
T TIGR03472 125 ARHDILVIADSDISVGPDYLRQVVAPLADPDVGLVTC-LYRGRPVPGFWSRLGAMGINHNFLPSVMVARALGRARFCFGA 203 (373)
T ss_pred ccCCEEEEECCCCCcChhHHHHHHHHhcCCCcceEec-cccCCCCCCHHHHHHHHHhhhhhhHHHHHHHhccCCccccCh
Confidence 5789999999999999999988888874 33333222 2221110000 0 101113467
Q ss_pred ccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceecC
Q 009021 275 GIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVEP 322 (546)
Q Consensus 275 G~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~p 322 (546)
.+++.+++++++-... .-. ...++|..|+.=+.+.|.++...+
T Consensus 204 ~~a~RR~~l~~iGGf~-~~~----~~~~ED~~l~~~i~~~G~~v~~~~ 246 (373)
T TIGR03472 204 TMALRRATLEAIGGLA-ALA----HHLADDYWLGELVRALGLRVVLAP 246 (373)
T ss_pred hhheeHHHHHHcCChH-Hhc----ccchHHHHHHHHHHHcCCeEEecc
Confidence 7899999999876542 111 124578899999998887765444
No 35
>cd04184 GT2_RfbC_Mx_like Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis. The rfbC gene encodes a predicted protein of 1,276 amino acids, which is required for O-antigen biosynthesis in Myxococcus xanthus. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds.
Probab=75.28 E-value=12 Score=35.02 Aligned_cols=99 Identities=13% Similarity=0.152 Sum_probs=57.5
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcCC--CCCCeEEeccCCCCC-C-------Cccc----cccccccCcccccCHHHH
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKYD--PTEMVYIGTPSESHS-A-------NTYF----SHSMAFGGGGIAISYPLA 283 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~LskyD--~~ep~YIG~~se~~~-~-------~~~~----g~~fa~GGaG~vLSr~Ll 283 (546)
...+|++++|+|..+..+.|.++++.++ +.-.+..|....... . ...+ .....+.|++.++++.++
T Consensus 82 a~~d~i~~ld~D~~~~~~~l~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~ 161 (202)
T cd04184 82 ATGEFVALLDHDDELAPHALYEVVKALNEHPDADLIYSDEDKIDEGGKRSEPFFKPDWSPDLLLSQNYIGHLLVYRRSLV 161 (202)
T ss_pred hcCCEEEEECCCCcCChHHHHHHHHHHHhCCCCCEEEccHHhccCCCCEeccccCCCCCHHHhhhcCCccceEeEEHHHH
Confidence 3579999999999999888888887763 222333332221100 0 0000 112344566778899998
Q ss_pred HHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceecC
Q 009021 284 EALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVEP 322 (546)
Q Consensus 284 kkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~p 322 (546)
+++... +.. + ..++|..|..=+.+.|.++...|
T Consensus 162 ~~iggf-~~~---~--~~~eD~~l~~rl~~~g~~~~~~~ 194 (202)
T cd04184 162 RQVGGF-REG---F--EGAQDYDLVLRVSEHTDRIAHIP 194 (202)
T ss_pred HHhCCC-CcC---c--ccchhHHHHHHHHhccceEEEcc
Confidence 887643 221 1 13567777665666676665544
No 36
>PRK14583 hmsR N-glycosyltransferase; Provisional
Probab=72.66 E-value=7.6 Score=42.44 Aligned_cols=98 Identities=11% Similarity=0.179 Sum_probs=63.2
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcC--CCCCCeEEeccCCCCCCC---------------------ccccccccccCc
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKY--DPTEMVYIGTPSESHSAN---------------------TYFSHSMAFGGG 274 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~Lsky--D~~ep~YIG~~se~~~~~---------------------~~~g~~fa~GGa 274 (546)
.+.+++++.|+|+.+..+.|.++++.+ |++-...-|.+......+ ..+|-.++.+|+
T Consensus 154 a~~d~iv~lDAD~~~~~d~L~~lv~~~~~~~~~g~v~g~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~g~~~~~sG~ 233 (444)
T PRK14583 154 ARSEYLVCIDGDALLDKNAVPYLVAPLIANPRTGAVTGNPRIRTRSTLIGRVQVGEFSSIIGLIKRTQRVYGQVFTVSGV 233 (444)
T ss_pred CCCCEEEEECCCCCcCHHHHHHHHHHHHhCCCeEEEEccceecCCCcchhhHHHHHHHHHHHHHHHHHHHhCCceEecCc
Confidence 578999999999999998888888766 443333333322111100 011223456788
Q ss_pred ccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceecC
Q 009021 275 GIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVEP 322 (546)
Q Consensus 275 G~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~p 322 (546)
+.++.+.+++++..... ...++|..++.-+...|-....+|
T Consensus 234 ~~~~rr~al~~vGg~~~-------~~i~ED~dl~~rl~~~G~~i~~~p 274 (444)
T PRK14583 234 VAAFRRRALADVGYWSP-------DMITEDIDISWKLQLKHWSVFFEP 274 (444)
T ss_pred eeEEEHHHHHHcCCCCC-------CcccccHHHHHHHHHcCCeEEEee
Confidence 88999999888643211 124688899999988887766554
No 37
>cd06442 DPM1_like DPM1_like represents putative enzymes similar to eukaryotic DPM1. Proteins similar to eukaryotic DPM1, including enzymes from bacteria and archaea; DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi,
Probab=72.13 E-value=19 Score=34.25 Aligned_cols=38 Identities=16% Similarity=0.177 Sum_probs=29.0
Q ss_pred CccEEEEEcCCccccHHHHHHHHhc-CCCCCCeEEeccC
Q 009021 219 NVRWFVLADDDTIFNSDNLVAVLSK-YDPTEMVYIGTPS 256 (546)
Q Consensus 219 ~~kWfv~~DDDTy~~~~nL~~~Lsk-yD~~ep~YIG~~s 256 (546)
..+|++++|+|..+.++.|..++.. .+....+..|...
T Consensus 78 ~gd~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~g~~~ 116 (224)
T cd06442 78 RGDVIVVMDADLSHPPEYIPELLEAQLEGGADLVIGSRY 116 (224)
T ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHhcCCCCEEEEeee
Confidence 3489999999999988888888876 4555566667543
No 38
>cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan. Succinoglycan Biosynthesis Protein ExoA catalyzes the formation of a beta-1,3 linkage of the second sugar (glucose) of the succinoglycan with the galactose on the lipid carrie. Succinoglycan is an acidic exopolysaccharide that is important for invasion of the nodules. Succinoglycan is a high-molecular-weight polymer composed of repeating octasaccharide units. These units are synthesized on membrane-bound isoprenoid lipid carriers, beginning with galactose followed by seven glucose molecules, and modified by the addition of acetate, succinate, and pyruvate. ExoA is a membrane protein with a transmembrance domain at c-terminus.
Probab=72.09 E-value=16 Score=35.16 Aligned_cols=95 Identities=9% Similarity=0.099 Sum_probs=53.5
Q ss_pred CccEEEEEcCCccccHHHHHHHHhcCC-CCCCeEEeccCCCCCC---------------C--ccc-----cccccccCcc
Q 009021 219 NVRWFVLADDDTIFNSDNLVAVLSKYD-PTEMVYIGTPSESHSA---------------N--TYF-----SHSMAFGGGG 275 (546)
Q Consensus 219 ~~kWfv~~DDDTy~~~~nL~~~LskyD-~~ep~YIG~~se~~~~---------------~--~~~-----g~~fa~GGaG 275 (546)
+.+|++++|||+.+...-|.++++.+. +......|........ . ..+ ...++..|++
T Consensus 81 ~~d~v~~lD~D~~~~~~~l~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (249)
T cd02525 81 RGDIIIRVDAHAVYPKDYILELVEALKRTGADNVGGPMETIGESKFQKAIAVAQSSPLGSGGSAYRGGAVKIGYVDTVHH 160 (249)
T ss_pred CCCEEEEECCCccCCHHHHHHHHHHHhcCCCCEEecceecCCCChHHHHHHHHhhchhccCCcccccccccccccccccc
Confidence 689999999999998877877776543 3333343433211000 0 000 0012345667
Q ss_pred cccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCce
Q 009021 276 IAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLT 319 (546)
Q Consensus 276 ~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT 319 (546)
.++++.+++++.. +++.. ..++|..+..=+.+.|..+.
T Consensus 161 ~~~~~~~~~~~g~-~~~~~-----~~~eD~~l~~r~~~~G~~~~ 198 (249)
T cd02525 161 GAYRREVFEKVGG-FDESL-----VRNEDAELNYRLRKAGYKIW 198 (249)
T ss_pred ceEEHHHHHHhCC-CCccc-----CccchhHHHHHHHHcCcEEE
Confidence 7889998888643 33221 23467766555555665554
No 39
>cd06532 Glyco_transf_25 Glycosyltransferase family 25 [lipooligosaccharide (LOS) biosynthesis protein] is a family of glycosyltransferases involved in LOS biosynthesis. The members include the beta(1,4) galactosyltransferases: Lgt2 of Moraxella catarrhalis, LgtB and LgtE of Neisseria gonorrhoeae and Lic2A of Haemophilus influenzae. M. catarrhalis Lgt2 catalyzes the addition of galactose (Gal) to the growing chain of LOS on the cell surface. N. gonorrhoeae LgtB and LgtE link Gal-beta(1,4) to GlcNAc (N-acetylglucosamine) and Glc (glucose), respectively. The genes encoding LgtB and LgtE are two genes of a five gene locus involved in the synthesis of gonococcal LOS. LgtE is believed to perform the first step in LOS biosynthesis.
Probab=70.88 E-value=3.8 Score=37.10 Aligned_cols=38 Identities=26% Similarity=0.454 Sum_probs=32.9
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcCCCCCCeEEeccCCCCCCCccccccccccCcccccCHHHHHHHHhhhc
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKYDPTEMVYIGTPSESHSANTYFSHSMAFGGGGIAISYPLAEALSNIHD 291 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~YIG~~se~~~~~~~~g~~fa~GGaG~vLSr~LlkkL~~~~d 291 (546)
.+.++.++.|||..+..+ |.+||++|+.++++|.....
T Consensus 81 ~~~~~alIlEDDv~~~~~------------------------------------~~~~Y~vs~~~A~~ll~~~~ 118 (128)
T cd06532 81 SNLEYALILEDDAILDPD------------------------------------GTAGYLVSRKGAKKLLAALE 118 (128)
T ss_pred cCCCeEEEEccCcEECCC------------------------------------CceEEEeCHHHHHHHHHhCC
Confidence 567999999999998876 67799999999999987643
No 40
>cd02522 GT_2_like_a GT_2_like_a represents a glycosyltransferase family-2 subfamily with unknown function. Glycosyltransferase family 2 (GT-2) subfamily of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=70.15 E-value=16 Score=34.72 Aligned_cols=91 Identities=18% Similarity=0.225 Sum_probs=54.5
Q ss_pred CccEEEEEcCCccccHHHHHHHHhcCCCCCCeEEeccC-CCCCCC------------ccccccccccCcccccCHHHHHH
Q 009021 219 NVRWFVLADDDTIFNSDNLVAVLSKYDPTEMVYIGTPS-ESHSAN------------TYFSHSMAFGGGGIAISYPLAEA 285 (546)
Q Consensus 219 ~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~YIG~~s-e~~~~~------------~~~g~~fa~GGaG~vLSr~Llkk 285 (546)
..+|++++|+|.++..+.|.+++..+..... .+|... ...... ........+++.|+++++.++++
T Consensus 72 ~~~~i~~~D~D~~~~~~~l~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~ 150 (221)
T cd02522 72 RGDWLLFLHADTRLPPDWDAAIIETLRADGA-VAGAFRLRFDDPGPRLRLLELGANLRSRLFGLPYGDQGLFIRRELFEE 150 (221)
T ss_pred cCCEEEEEcCCCCCChhHHHHHHHHhhcCCc-EEEEEEeeecCCccchhhhhhcccceecccCCCcCCceEEEEHHHHHH
Confidence 3799999999999998888887766544433 333321 111100 00011345677899999998887
Q ss_pred HHhhhccccccCCCCCCchHHHHHHHHHcCCC
Q 009021 286 LSNIHDDCIDRYPKLYGSDDRLHACITELGIP 317 (546)
Q Consensus 286 L~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~ 317 (546)
+-. +++. .+++|..+..=+.+.|..
T Consensus 151 ~G~-fd~~------~~~ED~d~~~r~~~~G~~ 175 (221)
T cd02522 151 LGG-FPEL------PLMEDVELVRRLRRRGRP 175 (221)
T ss_pred hCC-CCcc------ccccHHHHHHHHHhCCCE
Confidence 753 3322 256777766555566644
No 41
>TIGR03469 HonB hopene-associated glycosyltransferase HpnB. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The genes of this family are often found in the same genetic locus with squalene-hopene cyclase genes, and are never associated with genes for the metabolism of phytoene. Indeed, the members of this family appear to never be found in a genome lacking squalene-hopene cyclase (SHC), although not all genomes encoding SHC have this glycosyl transferase. In the organism Zymomonas mobilis the linkage of this gene to hopanoid biosynthesis has been noted and the gene named HpnB. Hopanoids are known to feature polar glycosyl head groups in many organisms.
Probab=70.08 E-value=11 Score=40.25 Aligned_cols=97 Identities=19% Similarity=0.148 Sum_probs=58.3
Q ss_pred CCCccEEEEEcCCccccHHHHHHHHhcCCC-CCCeEEeccCCCCCC--Cc---------------------cc-cccccc
Q 009021 217 LPNVRWFVLADDDTIFNSDNLVAVLSKYDP-TEMVYIGTPSESHSA--NT---------------------YF-SHSMAF 271 (546)
Q Consensus 217 ~p~~kWfv~~DDDTy~~~~nL~~~LskyD~-~ep~YIG~~se~~~~--~~---------------------~~-g~~fa~ 271 (546)
.++.+|+++.|+|+.+.++.|.++++.+.. ...+.-|.+...... .. .. ....+
T Consensus 131 ~~~gd~llflDaD~~~~p~~l~~lv~~~~~~~~~~vs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 209 (384)
T TIGR03469 131 APPADYLLLTDADIAHGPDNLARLVARARAEGLDLVSLMVRLRCESFWEKLLIPAFVFFFQKLYPFRWVNDPRRRTAAA- 209 (384)
T ss_pred CCCCCEEEEECCCCCCChhHHHHHHHHHHhCCCCEEEecccccCCCHHHHHHHHHHHHHHHHhcchhhhcCCCccceee-
Confidence 345899999999999998888887776532 223322222110000 00 00 00112
Q ss_pred cCcccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCce
Q 009021 272 GGGGIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLT 319 (546)
Q Consensus 272 GGaG~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT 319 (546)
-|+++++++++.+++-...+.- ....+|..|++-+++.|.++.
T Consensus 210 ~G~~~lirr~~~~~vGGf~~~~-----~~~~ED~~L~~r~~~~G~~v~ 252 (384)
T TIGR03469 210 AGGCILIRREALERIGGIAAIR-----GALIDDCTLAAAVKRSGGRIW 252 (384)
T ss_pred cceEEEEEHHHHHHcCCHHHHh-----hCcccHHHHHHHHHHcCCcEE
Confidence 3667999999999986543211 124678899999998875544
No 42
>cd04196 GT_2_like_d Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=69.96 E-value=22 Score=33.31 Aligned_cols=92 Identities=16% Similarity=0.215 Sum_probs=54.0
Q ss_pred CCCccEEEEEcCCccccHHHHHHHHhc-CCCCCCeEEeccCC----CCC--CCccc---c-----------ccccccCcc
Q 009021 217 LPNVRWFVLADDDTIFNSDNLVAVLSK-YDPTEMVYIGTPSE----SHS--ANTYF---S-----------HSMAFGGGG 275 (546)
Q Consensus 217 ~p~~kWfv~~DDDTy~~~~nL~~~Lsk-yD~~ep~YIG~~se----~~~--~~~~~---g-----------~~fa~GGaG 275 (546)
..+.+|++++|+|.++.++.|.++++. .......+++.... ... ....+ . ......|++
T Consensus 77 ~~~g~~v~~ld~Dd~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (214)
T cd04196 77 AADGDYVFFCDQDDIWLPDKLERLLKAFLKDDKPLLVYSDLELVDENGNPIGESFFEYQKIKPGTSFNNLLFQNVVTGCT 156 (214)
T ss_pred hCCCCEEEEECCCcccChhHHHHHHHHHhcCCCceEEecCcEEECCCCCCcccccccccccCCccCHHHHHHhCccCCce
Confidence 467999999999999998888888876 33333334443221 100 00000 0 012346888
Q ss_pred cccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHc
Q 009021 276 IAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITEL 314 (546)
Q Consensus 276 ~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~l 314 (546)
+++.+.+++++...... ..+..|..+...+...
T Consensus 157 ~~~r~~~~~~~~~~~~~------~~~~~D~~~~~~~~~~ 189 (214)
T cd04196 157 MAFNRELLELALPFPDA------DVIMHDWWLALLASAF 189 (214)
T ss_pred eeEEHHHHHhhcccccc------ccccchHHHHHHHHHc
Confidence 99999999887653211 0245677666666553
No 43
>cd06433 GT_2_WfgS_like WfgS and WfeV are involved in O-antigen biosynthesis. Escherichia coli WfgS and Shigella dysenteriae WfeV are glycosyltransferase 2 family enzymes involved in O-antigen biosynthesis. GT-2 enzymes have GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=69.55 E-value=20 Score=32.86 Aligned_cols=95 Identities=16% Similarity=0.160 Sum_probs=55.6
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcC--CCCCCeEEeccCCCCCCC--c-----------cccccccccCcccccCHHH
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKY--DPTEMVYIGTPSESHSAN--T-----------YFSHSMAFGGGGIAISYPL 282 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~Lsky--D~~ep~YIG~~se~~~~~--~-----------~~g~~fa~GGaG~vLSr~L 282 (546)
-+.+|++++|+|.++..+.+.++|..+ +++..+..|......... . .........|+|+++++.+
T Consensus 74 a~~~~v~~ld~D~~~~~~~~~~~~~~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (202)
T cd06433 74 ATGDIIGFLNSDDTLLPGALLAVVAAFAEHPEVDVVYGDVLLVDENGRVIGRRRPPPFLDKFLLYGMPICHQATFFRRSL 153 (202)
T ss_pred cCCCEEEEeCCCcccCchHHHHHHHHHHhCCCccEEEeeeEEEcCCCCcccCCCCcchhhhHHhhcCcccCcceEEEHHH
Confidence 357999999999999988888777322 333445555433211000 0 0111334567789999999
Q ss_pred HHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCc
Q 009021 283 AEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPL 318 (546)
Q Consensus 283 lkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~L 318 (546)
++++.. ++. .+ .+++|..+..=+.+.|...
T Consensus 154 ~~~~~~-f~~---~~--~~~~D~~~~~r~~~~g~~~ 183 (202)
T cd06433 154 FEKYGG-FDE---SY--RIAADYDLLLRLLLAGKIF 183 (202)
T ss_pred HHHhCC-Cch---hh--CchhhHHHHHHHHHcCCce
Confidence 998754 221 11 2456776655555556554
No 44
>cd06439 CESA_like_1 CESA_like_1 is a member of the cellulose synthase (CESA) superfamily. This is a subfamily of cellulose synthase (CESA) superfamily. CESA superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members of the superfamily include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins.
Probab=69.31 E-value=12 Score=36.62 Aligned_cols=37 Identities=14% Similarity=0.254 Sum_probs=27.4
Q ss_pred CccEEEEEcCCccccHHHHHHHHhcCCCCC-CeEEecc
Q 009021 219 NVRWFVLADDDTIFNSDNLVAVLSKYDPTE-MVYIGTP 255 (546)
Q Consensus 219 ~~kWfv~~DDDTy~~~~nL~~~LskyD~~e-p~YIG~~ 255 (546)
..+|++++|+|+++..+-|.++++.+...+ .+..|..
T Consensus 109 ~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~~~~ 146 (251)
T cd06439 109 TGEIVVFTDANALLDPDALRLLVRHFADPSVGAVSGEL 146 (251)
T ss_pred CCCEEEEEccccCcCHHHHHHHHHHhcCCCccEEEeEE
Confidence 359999999999999888888888875333 3344433
No 45
>PLN02726 dolichyl-phosphate beta-D-mannosyltransferase
Probab=67.08 E-value=26 Score=34.42 Aligned_cols=99 Identities=14% Similarity=0.072 Sum_probs=56.7
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcCC-CCCCeEEeccCCCCCCC------cc-------------ccccc-cccCccc
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKYD-PTEMVYIGTPSESHSAN------TY-------------FSHSM-AFGGGGI 276 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~LskyD-~~ep~YIG~~se~~~~~------~~-------------~g~~f-a~GGaG~ 276 (546)
...+|++++|+|..+.++.|.+++..+. ..-.+.+|......... .. .+... ...|+-.
T Consensus 92 a~g~~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~g~r~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~d~~g~~~ 171 (243)
T PLN02726 92 ASGDFVVIMDADLSHHPKYLPSFIKKQRETGADIVTGTRYVKGGGVHGWDLRRKLTSRGANVLAQTLLWPGVSDLTGSFR 171 (243)
T ss_pred cCCCEEEEEcCCCCCCHHHHHHHHHHHHhcCCcEEEEccccCCCCcCCccHHHHHHHHHHHHHHHHHhCCCCCcCCCccc
Confidence 3578999999999999888878777653 34567777654321110 00 00011 1223345
Q ss_pred ccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceecC
Q 009021 277 AISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVEP 322 (546)
Q Consensus 277 vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~p 322 (546)
++++.+++++....+. ..|..|.+|..=+...|.++...|
T Consensus 172 ~~rr~~~~~i~~~~~~------~~~~~~~el~~~~~~~g~~i~~vp 211 (243)
T PLN02726 172 LYKRSALEDLVSSVVS------KGYVFQMEIIVRASRKGYRIEEVP 211 (243)
T ss_pred ceeHHHHHHHHhhccC------CCcEEehHHHHHHHHcCCcEEEeC
Confidence 7799999988653221 123445544333345677766555
No 46
>cd06913 beta3GnTL1_like Beta 1, 3-N-acetylglucosaminyltransferase is essential for the formation of poly-N-acetyllactosamine . This family includes human Beta3GnTL1 and related eukaryotic proteins. Human Beta3GnTL1 is a putative beta-1,3-N-acetylglucosaminyltransferase. Beta3GnTL1 is expressed at various levels in most of tissues examined. Beta 1, 3-N-acetylglucosaminyltransferase has been found to be essential for the formation of poly-N-acetyllactosamine. Poly-N-acetyllactosamine is a unique carbohydrate composed of N-acetyllactosamine repeats. It is often an important part of cell-type-specific oligosaccharide structures and some functional oligosaccharides. It has been shown that the structure and biosynthesis of poly-N-acetyllactosamine display a dramatic change during development and oncogenesis. Several members of beta-1, 3-N-acetylglucosaminyltransferase have been identified.
Probab=63.16 E-value=18 Score=34.66 Aligned_cols=38 Identities=13% Similarity=0.180 Sum_probs=26.7
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcCCCCCCeEEecc
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKYDPTEMVYIGTP 255 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~YIG~~ 255 (546)
...+|++++|+|+.+..+.|.+++..+.......+|..
T Consensus 83 a~gd~i~~lD~D~~~~~~~l~~~~~~~~~~~~~~v~~~ 120 (219)
T cd06913 83 SSGRYLCFLDSDDVMMPQRIRLQYEAALQHPNSIIGCQ 120 (219)
T ss_pred cCCCEEEEECCCccCChhHHHHHHHHHHhCCCcEEEEE
Confidence 46799999999999998777766655432333455654
No 47
>cd00761 Glyco_tranf_GTA_type Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold. Glycosyltransferases (GTs) are enzymes that synthesize oligosaccharides, polysaccharides, and glycoconjugates by transferring the sugar moiety from an activated nucleotide-sugar donor to an acceptor molecule, which may be a growing oligosaccharide, a lipid, or a protein. Based on the stereochemistry of the donor and acceptor molecules, GTs are classified as either retaining or inverting enzymes. To date, all GT structures adopt one of two possible folds, termed GT-A fold and GT-B fold. This hierarchy includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. The majority of the proteins in this superfamily are Glycosyltransferase family 2 (GT-2) proteins. But it als
Probab=62.78 E-value=21 Score=30.39 Aligned_cols=53 Identities=15% Similarity=0.228 Sum_probs=36.6
Q ss_pred CccEEEEEcCCccccHHHHHHHHhcCCCCC-CeEEeccCCCCCCCccccccccccCcccccCHHHHHHHHhh
Q 009021 219 NVRWFVLADDDTIFNSDNLVAVLSKYDPTE-MVYIGTPSESHSANTYFSHSMAFGGGGIAISYPLAEALSNI 289 (546)
Q Consensus 219 ~~kWfv~~DDDTy~~~~nL~~~LskyD~~e-p~YIG~~se~~~~~~~~g~~fa~GGaG~vLSr~LlkkL~~~ 289 (546)
+.+|++++|+|..+..+.+..++..+-..+ ...+ .|.++++++++.++++...
T Consensus 77 ~~d~v~~~d~D~~~~~~~~~~~~~~~~~~~~~~~v------------------~~~~~~~~~~~~~~~~~~~ 130 (156)
T cd00761 77 RGEYILFLDADDLLLPDWLERLVAELLADPEADAV------------------GGPGNLLFRRELLEEIGGF 130 (156)
T ss_pred cCCEEEEECCCCccCccHHHHHHHHHhcCCCceEE------------------eccchheeeHHHHHHhCCc
Confidence 799999999999998888877644332221 1111 1117889999999988654
No 48
>cd04187 DPM1_like_bac Bacterial DPM1_like enzymes are related to eukaryotic DPM1. A family of bacterial enzymes related to eukaryotic DPM1; Although the mechanism of eukaryotic enzyme is well studied, the mechanism of the bacterial enzymes is not well understood. The eukaryotic DPM1 is the catalytic subunit of eukaryotic Dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. The enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. This protein family belongs to Glycosyltransferase 2 superfamily.
Probab=59.38 E-value=21 Score=32.98 Aligned_cols=70 Identities=14% Similarity=0.088 Sum_probs=46.9
Q ss_pred CccEEEEEcCCccccHHHHHHHHhcCCCCCCeEEeccCCCCCCC--------------ccccccc-cccCcccccCHHHH
Q 009021 219 NVRWFVLADDDTIFNSDNLVAVLSKYDPTEMVYIGTPSESHSAN--------------TYFSHSM-AFGGGGIAISYPLA 283 (546)
Q Consensus 219 ~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~YIG~~se~~~~~--------------~~~g~~f-a~GGaG~vLSr~Ll 283 (546)
..+|++++|+|.....+-|.++++.++....+.+|......... ...+... ..+|+.+++++.++
T Consensus 80 ~~d~i~~~D~D~~~~~~~l~~l~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~ 159 (181)
T cd04187 80 RGDAVITMDADLQDPPELIPEMLAKWEEGYDVVYGVRKNRKESWLKRLTSKLFYRLINKLSGVDIPDNGGDFRLMDRKVV 159 (181)
T ss_pred CCCEEEEEeCCCCCCHHHHHHHHHHHhCCCcEEEEEecCCcchHHHHHHHHHHHHHHHHHcCCCCCCCCCCEEEEcHHHH
Confidence 35999999999999887777777776666677778766433110 0011111 23556678999999
Q ss_pred HHHHh
Q 009021 284 EALSN 288 (546)
Q Consensus 284 kkL~~ 288 (546)
+++..
T Consensus 160 ~~i~~ 164 (181)
T cd04187 160 DALLL 164 (181)
T ss_pred HHHHh
Confidence 98764
No 49
>cd04179 DPM_DPG-synthase_like DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily. DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi, and animals, have no transmembrane region, suggesting the ex
Probab=57.14 E-value=20 Score=32.82 Aligned_cols=38 Identities=21% Similarity=0.194 Sum_probs=29.7
Q ss_pred ccEEEEEcCCccccHHHHHHHHhc-CCCCCCeEEeccCC
Q 009021 220 VRWFVLADDDTIFNSDNLVAVLSK-YDPTEMVYIGTPSE 257 (546)
Q Consensus 220 ~kWfv~~DDDTy~~~~nL~~~Lsk-yD~~ep~YIG~~se 257 (546)
.+|++++|+|+.+..+-|.++++. ........+|....
T Consensus 80 gd~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~g~~~~ 118 (185)
T cd04179 80 GDIVVTMDADLQHPPEDIPKLLEKLLEGGADVVIGSRFV 118 (185)
T ss_pred CCEEEEEeCCCCCCHHHHHHHHHHHhccCCcEEEEEeec
Confidence 399999999999998888888886 44555677776553
No 50
>COG1215 Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane]
Probab=56.50 E-value=29 Score=36.95 Aligned_cols=99 Identities=18% Similarity=0.198 Sum_probs=68.7
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcCCCCCCe-EEeccCCCC----------CCC--------------cccccccccc
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKYDPTEMV-YIGTPSESH----------SAN--------------TYFSHSMAFG 272 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~-YIG~~se~~----------~~~--------------~~~g~~fa~G 272 (546)
...+++++.|.|+....+-|.+++..++...-. +.|.+.-.. .+. ...|.....+
T Consensus 136 ~~~d~V~~~DaD~~~~~d~l~~~~~~f~~~~~~~v~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 215 (439)
T COG1215 136 AKGDVVVILDADTVPEPDALRELVSPFEDPPVGAVVGTPRIRNRPDPSNLLGRIQAIEYLSAFYFRLRAASKGGLISFLS 215 (439)
T ss_pred cCCCEEEEEcCCCCCChhHHHHHHhhhcCCCeeEEeCCceeeecCChhhhcchhcchhhhhhHHHhhhhhhhcCCeEEEc
Confidence 459999999999999999999999988654433 666652111 000 0112244578
Q ss_pred CcccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceecCC
Q 009021 273 GGGIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVEPG 323 (546)
Q Consensus 273 GaG~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~pg 323 (546)
|++.++-+++++++......+ .++|..|+.=+...|......+.
T Consensus 216 G~~~~~rr~aL~~~g~~~~~~-------i~ED~~lt~~l~~~G~~~~~~~~ 259 (439)
T COG1215 216 GSSSAFRRSALEEVGGWLEDT-------ITEDADLTLRLHLRGYRVVYVPE 259 (439)
T ss_pred ceeeeEEHHHHHHhCCCCCCc-------eeccHHHHHHHHHCCCeEEEeec
Confidence 999999999999987433322 46788999888887876655443
No 51
>cd06423 CESA_like CESA_like is the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis protein catalyzes the
Probab=55.72 E-value=17 Score=31.88 Aligned_cols=26 Identities=27% Similarity=0.406 Sum_probs=21.3
Q ss_pred CccEEEEEcCCccccHHHHHHHHhcC
Q 009021 219 NVRWFVLADDDTIFNSDNLVAVLSKY 244 (546)
Q Consensus 219 ~~kWfv~~DDDTy~~~~nL~~~Lsky 244 (546)
+.+|++++|+|.++....|.+++..+
T Consensus 78 ~~~~i~~~D~D~~~~~~~l~~~~~~~ 103 (180)
T cd06423 78 KGDIVVVLDADTILEPDALKRLVVPF 103 (180)
T ss_pred CCCEEEEECCCCCcChHHHHHHHHHh
Confidence 78999999999999887777774443
No 52
>cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans. UDP-GalNAc: polypeptide alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T) initiate the formation of mucin-type, O-linked glycans by catalyzing the transfer of alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to hydroxyl groups of Ser or Thr residues of core proteins to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These enzymes are type II membrane proteins with a GT-A type catalytic domain and a lectin domain located on the lumen side of the Golgi apparatus. In human, there are 15 isozymes of pp-GalNAc-Ts, representing the largest of all glycosyltransferase families. Each isozyme has unique but partially redundant substrate specificity for glycosylation sites on acceptor proteins.
Probab=54.63 E-value=30 Score=35.19 Aligned_cols=27 Identities=19% Similarity=0.174 Sum_probs=20.9
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcC
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKY 244 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~Lsky 244 (546)
-..+|++++|+|+.+...-|.++|+.+
T Consensus 82 A~gd~i~fLD~D~~~~~~wL~~ll~~l 108 (299)
T cd02510 82 ATGDVLVFLDSHCEVNVGWLEPLLARI 108 (299)
T ss_pred ccCCEEEEEeCCcccCccHHHHHHHHH
Confidence 357999999999999866666666544
No 53
>COG1216 Predicted glycosyltransferases [General function prediction only]
Probab=52.39 E-value=42 Score=34.57 Aligned_cols=98 Identities=19% Similarity=0.245 Sum_probs=59.3
Q ss_pred EEEEEcCCccccHHHHHHHHhcCCCCC-CeEEeccCCCCC--------C---------------Ccc-c---c--cccc-
Q 009021 222 WFVLADDDTIFNSDNLVAVLSKYDPTE-MVYIGTPSESHS--------A---------------NTY-F---S--HSMA- 270 (546)
Q Consensus 222 Wfv~~DDDTy~~~~nL~~~LskyD~~e-p~YIG~~se~~~--------~---------------~~~-~---g--~~fa- 270 (546)
|+++.++||++..+.|.++|+..+... ...+|....+.. . ... . . ...+
T Consensus 87 ~~l~LN~D~~~~~~~l~~ll~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (305)
T COG1216 87 YVLLLNPDTVVEPDLLEELLKAAEEDPAAGVVGPLIRNYDESLYIDRRGGESDGLTGGWRASPLLEIAPDLSSYLEVVAS 166 (305)
T ss_pred EEEEEcCCeeeChhHHHHHHHHHHhCCCCeEeeeeEecCCCCcchheeccccccccccceecccccccccccchhhhhhh
Confidence 999999999998777766665543322 222222221110 0 000 0 0 0112
Q ss_pred ccCcccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceecCCC
Q 009021 271 FGGGGIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVEPGF 324 (546)
Q Consensus 271 ~GGaG~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~pgf 324 (546)
.-|+.+++++.+++++-. +|+ +| ..+.+|..++.=+.++|.++--.|..
T Consensus 167 ~~G~~~li~~~~~~~vG~-~de---~~-F~y~eD~D~~~R~~~~G~~i~~~p~a 215 (305)
T COG1216 167 LSGACLLIRREAFEKVGG-FDE---RF-FIYYEDVDLCLRARKAGYKIYYVPDA 215 (305)
T ss_pred cceeeeEEcHHHHHHhCC-CCc---cc-ceeehHHHHHHHHHHcCCeEEEeecc
Confidence 467889999999999876 443 22 35678888888888899877655543
No 54
>PTZ00260 dolichyl-phosphate beta-glucosyltransferase; Provisional
Probab=49.73 E-value=65 Score=34.04 Aligned_cols=99 Identities=10% Similarity=0.141 Sum_probs=59.3
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcC----CCCCCeEEeccCCCCCC----Cc-----------------ccccccccc
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKY----DPTEMVYIGTPSESHSA----NT-----------------YFSHSMAFG 272 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~Lsky----D~~ep~YIG~~se~~~~----~~-----------------~~g~~fa~G 272 (546)
...+|++++|.|+...++.+.+++..+ ++.-.+.+|........ .. ..+..+..-
T Consensus 161 a~gd~I~~~DaD~~~~~~~l~~l~~~l~~~~~~~~dvV~GsR~~~~~~~~~~~~~~~r~~~~~~~~~l~~~~~~~~i~D~ 240 (333)
T PTZ00260 161 SRGKYILMVDADGATDIDDFDKLEDIMLKIEQNGLGIVFGSRNHLVDSDVVAKRKWYRNILMYGFHFIVNTICGTNLKDT 240 (333)
T ss_pred ccCCEEEEEeCCCCCCHHHHHHHHHHHHHhhccCCceEEeeccccccCcccccCcHHHHHHHHHHHHHHHHHcCCCcccC
Confidence 356999999999998876655555443 34456888987532110 00 012233334
Q ss_pred Cccc-ccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceecC
Q 009021 273 GGGI-AISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVEP 322 (546)
Q Consensus 273 GaG~-vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~p 322 (546)
..|+ ++++.+++.+.+.. -. ..|.-|.++-..+...|.++...|
T Consensus 241 ~~Gfk~~~r~~~~~i~~~~--~~----~~~~fd~Ell~~a~~~g~~I~EvP 285 (333)
T PTZ00260 241 QCGFKLFTRETARIIFPSL--HL----ERWAFDIEIVMIAQKLNLPIAEVP 285 (333)
T ss_pred CCCeEEEeHHHHHHHhhhc--cc----cCccchHHHHHHHHHcCCCEEEEc
Confidence 4564 78999998875431 11 134567766666667787766555
No 55
>cd04191 Glucan_BSP_ModH Glucan_BSP_ModH catalyzes the elongation of beta-1,2 polyglucose chains of glucan. Periplasmic Glucan Biosynthesis protein ModH is a glucosyltransferase that catalyzes the elongation of beta-1,2 polyglucose chains of glucan, requiring a beta-glucoside as a primer and UDP-glucose as a substrate. Glucans are composed of 5 to 10 units of glucose forming a highly branched structure, where beta-1,2-linked glucose constitutes a linear backbone to which branches are attached by beta-1,6 linkages. In Escherichia coli, glucans are located in the periplasmic space, functioning as regulator of osmolarity. It is synthesized at a maximum when cells are grown in a medium with low osmolarity. It has been shown to span the cytoplasmic membrane.
Probab=48.20 E-value=53 Score=33.38 Aligned_cols=103 Identities=16% Similarity=0.232 Sum_probs=59.8
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcC--CCCCCeEEeccCCCCCCCcc----------------------c-ccccccc
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKY--DPTEMVYIGTPSESHSANTY----------------------F-SHSMAFG 272 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~Lsky--D~~ep~YIG~~se~~~~~~~----------------------~-g~~fa~G 272 (546)
.+.++++++|.|+.+.++.|.+++..+ ||+-- .+..+......... | +....+.
T Consensus 94 ~~~~~i~~~DaD~~~~p~~l~~~v~~~~~~~~vg-~vq~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 172 (254)
T cd04191 94 SRYDYMVVLDADSLMSGDTIVRLVRRMEANPRAG-IIQTAPKLIGAETLFARLQQFANRLYGPVFGRGLAAWQGGEGNYW 172 (254)
T ss_pred CCCCEEEEEeCCCCCCHHHHHHHHHHHHhCCCEE-EEeCCceeECCCCHHHHHHHHHHHHHHHHHHHHHHHhcCCccCcc
Confidence 578999999999999999999998876 44311 11111110000000 0 0011234
Q ss_pred CcccccCHHHHHHHHhhh--ccccccC-CCCCCchHHHHHHHHHcCCCceecC
Q 009021 273 GGGIAISYPLAEALSNIH--DDCIDRY-PKLYGSDDRLHACITELGIPLTVEP 322 (546)
Q Consensus 273 GaG~vLSr~LlkkL~~~~--d~C~~~~-~~~~~~D~~Lg~Cl~~lGV~LT~~p 322 (546)
|+++++.+.++.++.... .... .+ .....+|..++.-+...|-.+.-.|
T Consensus 173 G~~~~~Rr~al~~~~~~~~i~g~g-~~~~~~l~eD~~l~~~~~~~G~ri~~~~ 224 (254)
T cd04191 173 GHNAIIRVAAFMEHCALPVLPGRP-PFGGHILSHDFVEAALMRRAGWEVRLAP 224 (254)
T ss_pred ceEEEEEHHHHHHhcCCccccCCC-CCCCCeecHHHHHHHHHHHcCCEEEEcc
Confidence 788999999888754321 1110 11 1123578899999988886665444
No 56
>PLN03181 glycosyltransferase; Provisional
Probab=47.57 E-value=75 Score=35.15 Aligned_cols=45 Identities=20% Similarity=0.347 Sum_probs=30.6
Q ss_pred CCCchhhHHHHHHHHhcCCCccEEEEEcCCccccHHHHHHHHhcCC
Q 009021 200 PSGLRISRIVIECFRLGLPNVRWFVLADDDTIFNSDNLVAVLSKYD 245 (546)
Q Consensus 200 ~~a~ri~rmv~e~~~~~~p~~kWfv~~DDDTy~~~~nL~~~LskyD 245 (546)
+..|.-.+++++++. ..|+++||..+|.|++|-=.++.--|.+|+
T Consensus 180 p~~WaKipalRaAM~-a~PeAEWfWWLDsDALIMNp~~sLPl~ry~ 224 (453)
T PLN03181 180 NSYWAKLPVVRAAML-AHPEAEWIWWVDSDAVFTDMDFKLPLHRYR 224 (453)
T ss_pred chhhhHHHHHHHHHH-HCCCceEEEEecCCceeecCCCCCCHhhcC
Confidence 456654477777764 489999999999999886332221255554
No 57
>PRK10714 undecaprenyl phosphate 4-deoxy-4-formamido-L-arabinose transferase; Provisional
Probab=47.19 E-value=38 Score=35.57 Aligned_cols=71 Identities=13% Similarity=0.039 Sum_probs=47.7
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcCCCCCCeEEeccCCCCCCC--------------ccccccccccCccc-ccCHHH
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKYDPTEMVYIGTPSESHSAN--------------TYFSHSMAFGGGGI-AISYPL 282 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~YIG~~se~~~~~--------------~~~g~~fa~GGaG~-vLSr~L 282 (546)
.+.+|++++|+|.-.+++.+.++++.....-++..|......... ...+..+...++|| ++++.+
T Consensus 89 A~gd~vv~~DaD~q~~p~~i~~l~~~~~~~~DvV~~~r~~~~~~~~r~~~s~~~~~l~~~~~g~~~~d~~~gfr~~~r~~ 168 (325)
T PRK10714 89 VTGDLIITLDADLQNPPEEIPRLVAKADEGYDVVGTVRQNRQDSWFRKTASKMINRLIQRTTGKAMGDYGCMLRAYRRHI 168 (325)
T ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHhhCCEEEEEEcCCCCcHHHHHHHHHHHHHHHHHcCCCCCCCCcCeEEEcHHH
Confidence 367999999999999998888888776433345555443221100 01133456677788 789999
Q ss_pred HHHHHh
Q 009021 283 AEALSN 288 (546)
Q Consensus 283 lkkL~~ 288 (546)
++++..
T Consensus 169 ~~~l~~ 174 (325)
T PRK10714 169 VDAMLH 174 (325)
T ss_pred HHHHHH
Confidence 999864
No 58
>PRK14716 bacteriophage N4 adsorption protein B; Provisional
Probab=45.94 E-value=67 Score=36.29 Aligned_cols=101 Identities=14% Similarity=0.054 Sum_probs=59.2
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcCCCCC-----CeEEeccCCCCC----CC--------------ccccccccccCc
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKYDPTE-----MVYIGTPSESHS----AN--------------TYFSHSMAFGGG 274 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~LskyD~~e-----p~YIG~~se~~~----~~--------------~~~g~~fa~GGa 274 (546)
..++++++.|.|+.+.++.|..+-. ..++. +++.+....... .. ...|.....+|.
T Consensus 157 ~~~d~vvi~DAD~~v~Pd~Lr~~~~-~~~~~~~VQ~pv~~~~~~~~~~~ag~y~~ef~~~~~~~l~~r~~LG~~~~~~Gt 235 (504)
T PRK14716 157 IRFAIIVLHDAEDVIHPLELRLYNY-LLPRHDFVQLPVFSLPRDWGEWVAGTYMDEFAESHLKDLPVREALGGLIPSAGV 235 (504)
T ss_pred CCcCEEEEEcCCCCcCccHHHHHHh-hcCCCCEEecceeccCCchhHHHHHHHHHHHHHHHHHHHHHHHhcCCccccCCe
Confidence 3579999999999999888875533 22222 222221111000 00 001222345699
Q ss_pred ccccCHHHHHHHHhhhccccccCC-CCCCchHHHHHHHHHcCCCceec
Q 009021 275 GIAISYPLAEALSNIHDDCIDRYP-KLYGSDDRLHACITELGIPLTVE 321 (546)
Q Consensus 275 G~vLSr~LlkkL~~~~d~C~~~~~-~~~~~D~~Lg~Cl~~lGV~LT~~ 321 (546)
|+++++.+++++....... .+. ....+|..|+.-+...|.+....
T Consensus 236 g~afRR~aLe~l~~~~GG~--~fd~~sLTED~dLglRL~~~G~rv~y~ 281 (504)
T PRK14716 236 GTAFSRRALERLAAERGGQ--PFDSDSLTEDYDIGLRLKRAGFRQIFV 281 (504)
T ss_pred eEEeEHHHHHHHHhhcCCC--CCCCCCcchHHHHHHHHHHCCCEEEEe
Confidence 9999999999984321110 010 12468999999999888776543
No 59
>PF00535 Glycos_transf_2: Glycosyl transferase family 2; InterPro: IPR001173 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. This domain is found in a diverse family of glycosyl transferases that transfer the sugar from UDP-glucose, UDP-N-acetyl-galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids.; PDB: 2Z87_A 2Z86_B 2D7R_A 2D7I_A 3CKN_A 3CKQ_A 3CKJ_A 3CKV_A 3CKO_A 2FFU_A ....
Probab=40.74 E-value=14 Score=32.54 Aligned_cols=38 Identities=24% Similarity=0.371 Sum_probs=22.2
Q ss_pred CccEEEEEcCCccccHHHHHHHHhcCCC-CCCeEEeccC
Q 009021 219 NVRWFVLADDDTIFNSDNLVAVLSKYDP-TEMVYIGTPS 256 (546)
Q Consensus 219 ~~kWfv~~DDDTy~~~~nL~~~LskyD~-~ep~YIG~~s 256 (546)
..+|++++|||+++....|.++++.++. .....+|...
T Consensus 78 ~~~~i~~ld~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~ 116 (169)
T PF00535_consen 78 KGEYILFLDDDDIISPDWLEELVEALEKNPPDVVIGSVI 116 (169)
T ss_dssp -SSEEEEEETTEEE-TTHHHHHHHHHHHCTTEEEEEEEE
T ss_pred ceeEEEEeCCCceEcHHHHHHHHHHHHhCCCcEEEEEEE
Confidence 3449999999999996555444444333 2334555443
No 60
>TIGR03030 CelA cellulose synthase catalytic subunit (UDP-forming). Cellulose synthase catalyzes the beta-1,4 polymerization of glucose residues in the formation of cellulose. In bacteria, the substrate is UDP-glucose. The synthase consists of two subunits (or domains in the frequent cases where it is encoded as a single polypeptide), the catalytic domain modelled here and the regulatory domain (pfam03170). The regulatory domain binds the allosteric activator cyclic di-GMP. The protein is membrane-associated and probably assembles into multimers such that the individual cellulose strands can self-assemble into multi-strand fibrils.
Probab=38.05 E-value=81 Score=37.02 Aligned_cols=94 Identities=13% Similarity=0.185 Sum_probs=58.9
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcCCCCCC-eEEeccCCCC--C---CC----------------------ccccccc
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKYDPTEM-VYIGTPSESH--S---AN----------------------TYFSHSM 269 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep-~YIG~~se~~--~---~~----------------------~~~g~~f 269 (546)
.+.+|+++.|.|+....+-|.+++..+..+.. -.++.+.... . .+ ..++-.+
T Consensus 227 a~gd~Il~lDAD~v~~pd~L~~~v~~f~~dp~v~~Vqtp~~f~~p~~~~~nl~~~~~~~~e~~~f~~~i~~g~~~~~~~~ 306 (713)
T TIGR03030 227 TDGELILIFDADHVPTRDFLQRTVGWFVEDPKLFLVQTPHFFVSPDPIERNLGTFRRMPNENELFYGLIQDGNDFWNAAF 306 (713)
T ss_pred cCCCEEEEECCCCCcChhHHHHHHHHHHhCCCEEEEeCCeeccCCCHHhhhhHHHHHhhhHHHHHHHHHHHHHhhhCCee
Confidence 45799999999999999888888877632222 2333221100 0 00 0011122
Q ss_pred cccCcccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCce
Q 009021 270 AFGGGGIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLT 319 (546)
Q Consensus 270 a~GGaG~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT 319 (546)
+.|++.++.+++++++.....+ ...+|..++..+.+.|-+..
T Consensus 307 -~~Gs~~~iRR~al~~iGGf~~~-------~vtED~~l~~rL~~~G~~~~ 348 (713)
T TIGR03030 307 -FCGSAAVLRREALDEIGGIAGE-------TVTEDAETALKLHRRGWNSA 348 (713)
T ss_pred -ecCceeEEEHHHHHHcCCCCCC-------CcCcHHHHHHHHHHcCCeEE
Confidence 4578899999999987543211 24689999999988886643
No 61
>PF05637 Glyco_transf_34: galactosyl transferase GMA12/MNN10 family; InterPro: IPR008630 This family contains a number of glycosyltransferase enzymes that contain a DXD motif. This family includes a number of Caenorhabditis elegans homologues where the DXD is replaced by DXH. Some members of this family are included in glycosyltransferase family 34.; GO: 0016758 transferase activity, transferring hexosyl groups, 0016021 integral to membrane; PDB: 2P72_B 2P73_A 2P6W_A.
Probab=35.87 E-value=34 Score=34.58 Aligned_cols=31 Identities=13% Similarity=0.297 Sum_probs=21.5
Q ss_pred CchhhHHHHHHHHhcCCCccEEEEEcCCcccc
Q 009021 202 GLRISRIVIECFRLGLPNVRWFVLADDDTIFN 233 (546)
Q Consensus 202 a~ri~rmv~e~~~~~~p~~kWfv~~DDDTy~~ 233 (546)
.|.-.+++++++. ..|+++|++.+|.|++|.
T Consensus 60 ~W~K~~~lr~~m~-~~P~~~wv~~lD~Dali~ 90 (239)
T PF05637_consen 60 SWAKIPALRAAMK-KYPEAEWVWWLDSDALIM 90 (239)
T ss_dssp HHTHHHHHHHHHH-H-TT-SEEEEE-TTEEE-
T ss_pred hhHHHHHHHHHHH-hCCCCCEEEEEcCCeEEE
Confidence 4644477888775 479999999999999987
No 62
>PRK13183 psbN photosystem II reaction center protein N; Provisional
Probab=31.52 E-value=56 Score=25.11 Aligned_cols=29 Identities=14% Similarity=0.036 Sum_probs=23.1
Q ss_pred ccchhHHHHHHHHHHHHhheeEEeecccc
Q 009021 39 RIGTAVVMVAALLLSTTVWLSVVFSATTC 67 (546)
Q Consensus 39 ~~~~~~~~~~~~~~~~~~w~~~~~~~~~~ 67 (546)
++-.+++++.+|||+++.+..|.-.++-+
T Consensus 6 ~A~~~~i~i~~lL~~~TgyaiYtaFGppS 34 (46)
T PRK13183 6 PALSLAITILAILLALTGFGIYTAFGPPS 34 (46)
T ss_pred hhHHHHHHHHHHHHHHhhheeeeccCCcc
Confidence 34456678889999999999999887644
No 63
>CHL00020 psbN photosystem II protein N
Probab=30.66 E-value=60 Score=24.61 Aligned_cols=28 Identities=14% Similarity=0.129 Sum_probs=22.8
Q ss_pred cchhHHHHHHHHHHHHhheeEEeecccc
Q 009021 40 IGTAVVMVAALLLSTTVWLSVVFSATTC 67 (546)
Q Consensus 40 ~~~~~~~~~~~~~~~~~w~~~~~~~~~~ 67 (546)
+-.+.+++.+||++++.+.+|.-.++-+
T Consensus 4 A~~~~i~i~~ll~~~Tgy~iYtaFGppS 31 (43)
T CHL00020 4 ATLVAIFISGLLVSFTGYALYTAFGQPS 31 (43)
T ss_pred hhhHHHHHHHHHHHhhheeeeeccCCch
Confidence 4456778889999999999999887644
No 64
>KOG2246 consensus Galactosyltransferases [Carbohydrate transport and metabolism]
Probab=29.69 E-value=32 Score=37.25 Aligned_cols=45 Identities=33% Similarity=0.468 Sum_probs=39.5
Q ss_pred ccCCCCCcEEEE-EecCcccHHHHHHHHHHhhhccCCcEEEEecCC
Q 009021 121 QEELSLKHIVFG-IAGSSHLWKRRKEFVRLWWLPNLMRGHVWLEDN 165 (546)
Q Consensus 121 ~~~~~~~~Ivf~-I~Ts~~~~~tR~~~ik~tW~~~~~r~~vFsD~~ 165 (546)
...+++.++++| |.++...|..|..+|.-||....+++.++++..
T Consensus 66 ~~~~~i~~~~~g~~~~s~~~~l~r~~~v~cwv~t~~~~~~~~~~~v 111 (364)
T KOG2246|consen 66 SLTTDILHLVFGIIASSIALWLSRSGRVLCWVLTSPMRHVTRADAV 111 (364)
T ss_pred ccccchhhhccCCccccchhccCCCceEEEEEEecCcCceeehhhh
Confidence 347889999999 889999999999999999998888988998864
No 65
>PRK11234 nfrB bacteriophage N4 adsorption protein B; Provisional
Probab=27.64 E-value=1.8e+02 Score=34.46 Aligned_cols=101 Identities=14% Similarity=0.079 Sum_probs=59.7
Q ss_pred CccEEEEEcCCccccHHHHHHHHhcCCCCCCeEEeccCCCCCCC----------c--------------cccccccccCc
Q 009021 219 NVRWFVLADDDTIFNSDNLVAVLSKYDPTEMVYIGTPSESHSAN----------T--------------YFSHSMAFGGG 274 (546)
Q Consensus 219 ~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~YIG~~se~~~~~----------~--------------~~g~~fa~GGa 274 (546)
..+-+++.|.|+.+-++.|. +++.+.......-+ +......+ . ..+.....+|.
T Consensus 155 ~~~vvvi~DAD~~v~pd~L~-~~~~l~~~~~~VQ~-p~~p~~~~~~~~~~~~~~~EFa~~~~~~~~~~~~lgg~~~l~G~ 232 (727)
T PRK11234 155 AFAGFILHDAEDVISPMELR-LFNYLVERKDLIQI-PVYPFEREWTHFTSGTYIDEFAELHGKDVPVREALAGQVPSAGV 232 (727)
T ss_pred cccEEEEEcCCCCCChhHHH-HHHhhcCCCCeEee-cccCCCccHHHHHHHHHHHHHHHHhhhhhHHHHHcCCCcccCCc
Confidence 45678999999999999997 44444332222111 11111100 0 01223467899
Q ss_pred ccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCceecC
Q 009021 275 GIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLTVEP 322 (546)
Q Consensus 275 G~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT~~p 322 (546)
|+++||.+++.|.+.-..+.- ....-++|..||.=+...|....-.+
T Consensus 233 ~~af~Rr~l~al~~~ggg~~~-~~~~lTED~dlg~rL~~~G~~v~f~~ 279 (727)
T PRK11234 233 GTCFSRRAVTALLEDGDGIAF-DVQSLTEDYDIGFRLKEKGMREIFVR 279 (727)
T ss_pred eEEEecccHHHHHHhcCCCCc-CCCcchHHHHHHHHHHHCCCEEEEcc
Confidence 999998877766654322210 01224689999999999887765444
No 66
>TIGR03111 glyc2_xrt_Gpos1 putative glycosyltransferase TIGR03111. Members of this protein family probable glycosyltransferases of family 2, whose genes are near those for Gram-positive proteins (TIGR03110) related to the proposed exosortase (TIGR02602).
Probab=27.13 E-value=1.2e+02 Score=33.18 Aligned_cols=94 Identities=14% Similarity=0.222 Sum_probs=54.4
Q ss_pred CCccEEEEEcCCccccHHHHHHHHhcCCCCCCe--EEeccCCCCC---C----------C-ccc----------------
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLSKYDPTEMV--YIGTPSESHS---A----------N-TYF---------------- 265 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~LskyD~~ep~--YIG~~se~~~---~----------~-~~~---------------- 265 (546)
.+.+|++++|+|+.+..+.|.++++.+..++.+ .-|....... . . ..+
T Consensus 130 s~g~~v~~~DaD~~~~~d~L~~l~~~f~~~~~v~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~l~~r~~~s~~ 209 (439)
T TIGR03111 130 SIGKYIIHIDSDGKLHKDAIKNMVTRFENNPDIHAMTGVILTDKELIEKTKGRFLKLIRRCEYFEYAQAFLAGRNFESQV 209 (439)
T ss_pred ccCCEEEEECCCCCcChHHHHHHHHHHHhCCCeEEEEeEEecCchhhhhhcchhhhHhHHhHHHHHHHHHHhhhHHHHhc
Confidence 457899999999999988888888877433322 2232211100 0 0 000
Q ss_pred cccccccCcccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHH-cCCCc
Q 009021 266 SHSMAFGGGGIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITE-LGIPL 318 (546)
Q Consensus 266 g~~fa~GGaG~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~-lGV~L 318 (546)
+..++.+|++.++.+++++++... +. ...++|..++.=+.. .|-..
T Consensus 210 ~~~~~~sGa~~~~Rr~~l~~vggf-~~------~~i~ED~~l~~rl~~~~g~kv 256 (439)
T TIGR03111 210 NSLFTLSGAFSAFRRETILKTQLY-NS------ETVGEDTDMTFQIRELLDGKV 256 (439)
T ss_pred CCeEEEccHHHhhhHHHHHHhCCC-CC------CCcCccHHHHHHHHHhcCCeE
Confidence 112344677888999988876432 21 124788887765543 34443
No 67
>PF13704 Glyco_tranf_2_4: Glycosyl transferase family 2
Probab=23.47 E-value=66 Score=26.91 Aligned_cols=25 Identities=20% Similarity=0.403 Sum_probs=19.5
Q ss_pred CCCccEEEEEcCCccccHH----HHHHHH
Q 009021 217 LPNVRWFVLADDDTIFNSD----NLVAVL 241 (546)
Q Consensus 217 ~p~~kWfv~~DDDTy~~~~----nL~~~L 241 (546)
..+.+|++++|-|-|+..+ +|.++|
T Consensus 69 ~~~~dWvl~~D~DEfl~~~~~~~~l~~~L 97 (97)
T PF13704_consen 69 AFDADWVLFLDADEFLVPPPGRRSLRDFL 97 (97)
T ss_pred CCCCCEEEEEeeeEEEecCCCCCCHHHhC
Confidence 4589999999999999863 355554
No 68
>KOG4748 consensus Subunit of Golgi mannosyltransferase complex [Carbohydrate transport and metabolism; Cell wall/membrane/envelope biogenesis]
Probab=22.25 E-value=89 Score=33.91 Aligned_cols=35 Identities=14% Similarity=0.264 Sum_probs=28.7
Q ss_pred CCCCCchhhHHHHHHHHhcCCCccEEEEEcCCcccc
Q 009021 198 GHPSGLRISRIVIECFRLGLPNVRWFVLADDDTIFN 233 (546)
Q Consensus 198 G~~~a~ri~rmv~e~~~~~~p~~kWfv~~DDDTy~~ 233 (546)
+.+.-|...+|++++.+. +|+++|+=.+|-|+.+-
T Consensus 155 e~~~~W~KiP~Ir~tM~k-yP~AeWIWWlD~DAlim 189 (364)
T KOG4748|consen 155 ELPGVWAKLPAIRQTMLK-YPDAEWIWWLDQDALIM 189 (364)
T ss_pred cccchhHHhHHHHHHHHH-CCCCcEEEEecccchhh
Confidence 345667666999999864 89999999999999775
No 69
>PF10111 Glyco_tranf_2_2: Glycosyltransferase like family 2; InterPro: IPR019290 This conserved domain is found in a set of prokaryotic proteins including putative glucosyltransferases, which are involved in bacterial capsule biosynthesis [, ].
Probab=21.89 E-value=3.9e+02 Score=27.19 Aligned_cols=97 Identities=15% Similarity=0.185 Sum_probs=57.5
Q ss_pred CCccEEEEEcCCccccHHHHHHHHh---cCCCCC-CeEEeccC-CCCCCC------------------------cccccc
Q 009021 218 PNVRWFVLADDDTIFNSDNLVAVLS---KYDPTE-MVYIGTPS-ESHSAN------------------------TYFSHS 268 (546)
Q Consensus 218 p~~kWfv~~DDDTy~~~~nL~~~Ls---kyD~~e-p~YIG~~s-e~~~~~------------------------~~~g~~ 268 (546)
-..+|++++|.|.++..+.|.+++. ..+.+. ..+++... .+.... ..++ .
T Consensus 87 A~~d~l~flD~D~i~~~~~i~~~~~~~~~l~~~~~~~~~~p~~yl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 165 (281)
T PF10111_consen 87 ARGDYLIFLDADCIPSPDFIEKLLNHVKKLDKNPNAFLVYPCLYLSEEGSEKFYSQFKNLWDHEFLESFISGKNSLWE-F 165 (281)
T ss_pred cCCCEEEEEcCCeeeCHHHHHHHHHHHHHHhcCCCceEEEeeeeccchhhHHHhhcchhcchHHHHHHHhhccccccc-c
Confidence 4799999999999999988888887 444332 33332221 111000 0011 1
Q ss_pred ccccCcccccCHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHcCCCce
Q 009021 269 MAFGGGGIAISYPLAEALSNIHDDCIDRYPKLYGSDDRLHACITELGIPLT 319 (546)
Q Consensus 269 fa~GGaG~vLSr~LlkkL~~~~d~C~~~~~~~~~~D~~Lg~Cl~~lGV~LT 319 (546)
.+..|+-+++++....++... |+ .|....++|..++.=|.+.|..+.
T Consensus 166 ~~~~s~~~~i~r~~f~~iGGf-DE---~f~G~G~ED~D~~~RL~~~~~~~~ 212 (281)
T PF10111_consen 166 IAFASSCFLINREDFLEIGGF-DE---RFRGWGYEDIDFGYRLKKAGYKFK 212 (281)
T ss_pred ccccceEEEEEHHHHHHhCCC-Cc---cccCCCcchHHHHHHHHHcCCcEe
Confidence 234457788999988887653 22 231122478877766777776654
No 70
>cd02514 GT13_GLCNAC-TI GT13_GLCNAC-TI is involved in an essential step in the synthesis of complex or hybrid-type N-linked oligosaccharides. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GLCNAC-T I , GNT-I) transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide, an essential step in the synthesis of complex or hybrid-type N-linked oligosaccharides. The enzyme is an integral membrane protein localized to the Golgi apparatus. The catalytic domain is located at the C-terminus. These proteins are members of the glycosy transferase family 13.
Probab=21.73 E-value=2.3e+02 Score=30.45 Aligned_cols=76 Identities=16% Similarity=0.251 Sum_probs=47.9
Q ss_pred HHHHHHHhcCCCccEEEEEcCCccccHH---HHHHHHhcCCCCCCeEEeccC-CC-CC------CCccccccccccCccc
Q 009021 208 IVIECFRLGLPNVRWFVLADDDTIFNSD---NLVAVLSKYDPTEMVYIGTPS-ES-HS------ANTYFSHSMAFGGGGI 276 (546)
Q Consensus 208 mv~e~~~~~~p~~kWfv~~DDDTy~~~~---nL~~~LskyD~~ep~YIG~~s-e~-~~------~~~~~g~~fa~GGaG~ 276 (546)
.+...+.. -+++-++++|||-.+.++ -+.+.|..|..++.++.=+.. .. .. ....+. .-.+.|.|.
T Consensus 88 aln~vF~~--~~~~~vIILEDDl~~sPdFf~yf~~~l~~y~~D~~v~~ISa~NdnG~~~~~~~~~~~lyr-s~ff~glGW 164 (334)
T cd02514 88 ALTQTFNL--FGYSFVIILEDDLDIAPDFFSYFQATLPLLEEDPSLWCISAWNDNGKEHFVDDTPSLLYR-TDFFPGLGW 164 (334)
T ss_pred HHHHHHHh--cCCCEEEEECCCCccCHhHHHHHHHHHHHHhcCCCEEEEEeeccCCcccccCCCcceEEE-ecCCCchHH
Confidence 34444432 379999999999999876 667777777666554332211 11 10 111111 234688999
Q ss_pred ccCHHHHHHH
Q 009021 277 AISYPLAEAL 286 (546)
Q Consensus 277 vLSr~LlkkL 286 (546)
++.+.+.+++
T Consensus 165 ml~r~~W~e~ 174 (334)
T cd02514 165 MLTRKLWKEL 174 (334)
T ss_pred HHHHHHHHHh
Confidence 9999999988
No 71
>PRK05454 glucosyltransferase MdoH; Provisional
Probab=21.65 E-value=3.4e+02 Score=32.07 Aligned_cols=35 Identities=20% Similarity=0.289 Sum_probs=28.8
Q ss_pred HHHHHhcCCCccEEEEEcCCccccHHHHHHHHhcC
Q 009021 210 IECFRLGLPNVRWFVLADDDTIFNSDNLVAVLSKY 244 (546)
Q Consensus 210 ~e~~~~~~p~~kWfv~~DDDTy~~~~nL~~~Lsky 244 (546)
.++.+....++++++..|.|+.+..+-|.+++..+
T Consensus 211 ~~~~~~~~~~~eyivvLDADs~m~~d~L~~lv~~m 245 (691)
T PRK05454 211 ADFCRRWGGAYDYMVVLDADSLMSGDTLVRLVRLM 245 (691)
T ss_pred HHHHHhcCCCcCEEEEEcCCCCCCHHHHHHHHHHH
Confidence 44454445689999999999999999999998876
No 72
>PF02468 PsbN: Photosystem II reaction centre N protein (psbN); InterPro: IPR003398 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbN found in PSII. PsbN may have a role in PSII stability, however its actual function unknown. PsbN does not appear to be essential for photoautotrophic growth or normal PSII function.; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane
Probab=20.54 E-value=93 Score=23.63 Aligned_cols=28 Identities=14% Similarity=0.153 Sum_probs=22.7
Q ss_pred cchhHHHHHHHHHHHHhheeEEeecccc
Q 009021 40 IGTAVVMVAALLLSTTVWLSVVFSATTC 67 (546)
Q Consensus 40 ~~~~~~~~~~~~~~~~~w~~~~~~~~~~ 67 (546)
+-++.+++.++||+++.+..|...++.+
T Consensus 4 a~~~~i~i~~~lv~~Tgy~iYtaFGppS 31 (43)
T PF02468_consen 4 ATVLAIFISCLLVSITGYAIYTAFGPPS 31 (43)
T ss_pred eeeHHHHHHHHHHHHHhhhhhheeCCCc
Confidence 3456778889999999999999887654
No 73
>PF06072 Herpes_US9: Alphaherpesvirus tegument protein US9; InterPro: IPR009278 This family consists of several US9 and related proteins from the Alphaherpesviruses. The function of the US9 protein is unknown although in Bovine herpesvirus 5 Us9 is essential for the anterograde spread of the virus from the olfactory mucosa to the bulb [].; GO: 0019033 viral tegument
Probab=20.03 E-value=34 Score=27.61 Aligned_cols=17 Identities=35% Similarity=0.309 Sum_probs=8.5
Q ss_pred ccccccccccccchhHH
Q 009021 29 RSRNRRWRHSRIGTAVV 45 (546)
Q Consensus 29 ~~~~~~~~~~~~~~~~~ 45 (546)
|+|.|+|+-.-+++++|
T Consensus 26 r~RrRrc~~~v~~v~~~ 42 (60)
T PF06072_consen 26 RRRRRRCRLAVAIVFAV 42 (60)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 44555565544444444
Done!