Query psy10879
Match_columns 245
No_of_seqs 119 out of 1188
Neff 8.5
Searched_HMMs 46136
Date Fri Aug 16 20:44:57 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10879.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10879hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2287|consensus 100.0 1.2E-54 2.6E-59 383.8 22.3 241 3-245 76-330 (349)
2 PLN03133 beta-1,3-galactosyltr 100.0 5.2E-52 1.1E-56 383.5 23.2 222 17-245 382-617 (636)
3 PLN03193 beta-1,3-galactosyltr 100.0 7.9E-50 1.7E-54 350.4 24.1 225 13-242 132-379 (408)
4 PF01762 Galactosyl_T: Galacto 100.0 4.2E-48 9.1E-53 317.4 16.6 184 34-219 1-195 (195)
5 KOG2288|consensus 100.0 1.6E-40 3.4E-45 271.9 16.0 207 17-224 8-226 (274)
6 PTZ00210 UDP-GlcNAc-dependent 100.0 2.2E-35 4.8E-40 255.8 16.8 212 17-244 77-347 (382)
7 PF02434 Fringe: Fringe-like; 99.9 5.5E-26 1.2E-30 192.9 12.0 208 20-241 6-232 (252)
8 KOG2246|consensus 99.9 2.3E-22 4.9E-27 177.8 11.8 168 17-218 88-268 (364)
9 PLN03153 hypothetical protein; 99.7 1.1E-16 2.4E-21 144.4 12.7 180 20-224 122-318 (537)
10 KOG3708|consensus 99.1 1.7E-10 3.8E-15 103.4 6.1 158 20-216 26-190 (681)
11 PF13641 Glyco_tranf_2_3: Glyc 97.6 0.0019 4.1E-08 53.2 13.1 189 20-219 1-203 (228)
12 TIGR03472 HpnI hopanoid biosyn 97.3 0.0077 1.7E-07 54.1 13.7 192 18-218 39-245 (373)
13 PF04646 DUF604: Protein of un 97.2 0.00063 1.4E-08 57.1 5.1 52 172-223 11-66 (255)
14 cd02520 Glucosylceramide_synth 97.1 0.06 1.3E-06 43.4 15.7 139 52-220 31-171 (196)
15 PF01755 Glyco_transf_25: Glyc 96.9 0.026 5.6E-07 45.9 12.0 160 24-195 4-193 (200)
16 cd04186 GT_2_like_c Subfamily 96.8 0.11 2.5E-06 39.7 14.4 90 105-221 71-160 (166)
17 cd06439 CESA_like_1 CESA_like_ 96.7 0.053 1.1E-06 45.2 12.7 190 16-219 25-222 (251)
18 TIGR03469 HonB hopene-associat 96.6 0.39 8.4E-06 43.3 18.3 190 19-215 39-251 (384)
19 PF13506 Glyco_transf_21: Glyc 96.5 0.007 1.5E-07 48.6 5.8 123 94-220 17-148 (175)
20 cd04192 GT_2_like_e Subfamily 96.4 0.15 3.2E-06 41.5 13.3 156 52-213 29-195 (229)
21 cd02525 Succinoglycan_BP_ExoA 96.2 0.55 1.2E-05 38.7 16.8 162 50-220 30-202 (249)
22 cd06421 CESA_CelA_like CESA_Ce 96.1 0.18 3.9E-06 41.3 12.7 115 101-218 77-203 (234)
23 cd04184 GT2_RfbC_Mx_like Myxoc 96.1 0.47 1E-05 37.9 14.8 120 98-220 73-195 (202)
24 cd02510 pp-GalNAc-T pp-GalNAc- 96.0 0.46 9.9E-06 41.0 15.0 120 98-217 73-216 (299)
25 cd04196 GT_2_like_d Subfamily 95.8 0.61 1.3E-05 37.4 14.2 155 51-212 27-190 (214)
26 cd06427 CESA_like_2 CESA_like_ 95.8 0.43 9.4E-06 39.7 13.6 120 98-220 74-206 (241)
27 PRK11204 N-glycosyltransferase 95.7 0.61 1.3E-05 42.3 15.3 188 16-219 50-253 (420)
28 cd06532 Glyco_transf_25 Glycos 95.4 0.22 4.7E-06 37.7 9.5 112 24-193 2-119 (128)
29 cd06420 GT2_Chondriotin_Pol_N 95.3 1.1 2.3E-05 35.1 14.6 100 99-215 70-169 (182)
30 PLN02726 dolichyl-phosphate be 95.2 1.4 3E-05 36.7 14.7 160 51-219 40-211 (243)
31 cd04185 GT_2_like_b Subfamily 95.1 1.4 3E-05 35.2 14.0 96 96-215 68-163 (202)
32 TIGR03111 glyc2_xrt_Gpos1 puta 94.9 0.46 9.9E-06 43.7 11.9 190 16-214 45-255 (439)
33 PF00535 Glycos_transf_2: Glyc 94.9 1.2 2.6E-05 33.6 12.9 155 22-187 2-168 (169)
34 PRK14583 hmsR N-glycosyltransf 94.8 0.87 1.9E-05 41.9 13.5 188 18-219 73-274 (444)
35 cd06434 GT2_HAS Hyaluronan syn 94.4 1.4 3E-05 36.1 12.6 157 52-218 29-205 (235)
36 cd06433 GT_2_WfgS_like WfgS an 93.9 2.5 5.4E-05 33.2 14.8 120 97-219 64-187 (202)
37 COG1215 Glycosyltransferases, 93.8 3 6.5E-05 37.8 14.7 191 20-221 54-260 (439)
38 cd04191 Glucan_BSP_ModH Glucan 93.1 2.4 5.1E-05 36.1 12.0 189 24-220 3-225 (254)
39 PF13632 Glyco_trans_2_3: Glyc 93.1 0.4 8.6E-06 38.3 6.9 107 111-219 1-117 (193)
40 PF10111 Glyco_tranf_2_2: Glyc 93.0 5.3 0.00011 34.3 14.4 163 50-216 33-212 (281)
41 cd02526 GT2_RfbF_like RfbF is 92.9 4.4 9.5E-05 33.1 15.1 111 108-219 75-197 (237)
42 COG1216 Predicted glycosyltran 92.7 6.2 0.00013 34.2 16.0 190 21-219 4-213 (305)
43 cd04195 GT2_AmsE_like GT2_AmsE 92.6 3.1 6.7E-05 33.0 11.5 116 99-219 71-194 (201)
44 cd06423 CESA_like CESA_like is 92.4 1.4 3.1E-05 33.3 9.0 93 97-189 67-171 (180)
45 cd06442 DPM1_like DPM1_like re 92.2 5.2 0.00011 32.3 14.2 113 102-219 72-196 (224)
46 cd06435 CESA_NdvC_like NdvC_li 92.2 2.7 6E-05 34.4 11.1 120 98-220 72-203 (236)
47 PRK14716 bacteriophage N4 adso 92.0 11 0.00024 35.5 16.3 193 18-219 64-282 (504)
48 cd04187 DPM1_like_bac Bacteria 89.4 4.5 9.8E-05 31.6 9.4 132 51-191 29-166 (181)
49 TIGR01556 rhamnosyltran L-rham 88.6 13 0.00028 31.5 12.4 117 98-216 64-191 (281)
50 cd02522 GT_2_like_a GT_2_like_ 86.5 6.1 0.00013 31.8 8.7 105 103-216 67-177 (221)
51 TIGR03030 CelA cellulose synth 86.1 37 0.0008 33.4 16.7 116 99-218 219-350 (713)
52 PRK11234 nfrB bacteriophage N4 85.8 38 0.00082 33.5 15.0 190 18-217 61-277 (727)
53 cd04179 DPM_DPG-synthase_like 85.2 7.6 0.00017 30.1 8.5 130 52-188 29-167 (185)
54 cd00761 Glyco_tranf_GTA_type G 84.7 13 0.00028 27.0 11.8 85 98-212 67-151 (156)
55 PRK05454 glucosyltransferase M 84.2 24 0.00051 34.7 12.8 198 15-220 119-350 (691)
56 cd04188 DPG_synthase DPG_synth 83.4 21 0.00046 28.5 12.5 161 51-219 30-201 (211)
57 PF02485 Branch: Core-2/I-Bran 82.4 16 0.00034 30.5 9.7 213 24-244 3-239 (244)
58 cd06437 CESA_CaSu_A2 Cellulose 82.3 25 0.00055 28.6 14.8 114 99-219 78-206 (232)
59 PF05212 DUF707: Protein of un 79.4 7.1 0.00015 33.8 6.5 177 39-217 26-244 (294)
60 cd02514 GT13_GLCNAC-TI GT13_GL 76.6 55 0.0012 29.1 12.8 84 98-187 87-174 (334)
61 cd04190 Chitin_synth_C C-termi 75.5 8.9 0.00019 31.9 6.1 110 108-217 73-211 (244)
62 PLN03181 glycosyltransferase; 74.6 49 0.0011 30.4 10.6 99 20-121 95-211 (453)
63 cd06913 beta3GnTL1_like Beta 1 72.6 48 0.001 26.6 10.9 44 100-144 76-119 (219)
64 PRK11498 bcsA cellulose syntha 68.5 1.4E+02 0.003 30.2 15.2 117 99-218 330-461 (852)
65 PRK10714 undecaprenyl phosphat 64.1 1E+02 0.0022 27.1 11.7 132 51-190 38-175 (325)
66 cd06438 EpsO_like EpsO protein 62.1 73 0.0016 24.8 11.6 85 98-186 70-169 (183)
67 PHA01631 hypothetical protein 61.1 57 0.0012 25.9 7.2 94 75-191 38-134 (176)
68 PRK10073 putative glycosyl tra 60.2 1.2E+02 0.0026 26.6 12.1 105 20-133 6-110 (328)
69 PF13704 Glyco_tranf_2_4: Glyc 58.9 49 0.0011 22.9 6.3 39 86-125 49-88 (97)
70 COG4092 Predicted glycosyltran 57.9 99 0.0021 26.8 8.7 179 33-214 19-220 (346)
71 PHA02688 ORF059 IMV protein VP 56.3 16 0.00035 32.0 3.8 95 93-193 97-203 (323)
72 PF04666 Glyco_transf_54: N-Ac 53.3 88 0.0019 27.3 8.0 23 102-124 161-185 (297)
73 KOG2547|consensus 51.4 56 0.0012 29.6 6.5 110 18-135 85-197 (431)
74 PRK10018 putative glycosyl tra 50.2 1.6E+02 0.0036 25.2 9.7 36 99-134 76-111 (279)
75 PF03213 Pox_P35: Poxvirus P35 44.8 59 0.0013 28.6 5.5 94 92-194 98-206 (325)
76 PF00483 NTP_transferase: Nucl 43.2 1.7E+02 0.0036 24.0 8.1 94 95-189 85-192 (248)
77 PF03071 GNT-I: GNT-I family; 41.5 2.7E+02 0.0058 25.8 9.5 92 94-190 173-272 (434)
78 COG3306 Glycosyltransferase in 34.6 44 0.00096 28.5 3.2 52 171-222 154-207 (255)
79 PF03452 Anp1: Anp1; InterPro 33.6 3.1E+02 0.0068 23.6 10.0 127 13-146 18-177 (269)
80 PF03142 Chitin_synth_2: Chiti 33.4 3E+02 0.0064 26.3 8.7 29 108-136 201-229 (527)
81 PF06306 CgtA: Beta-1,4-N-acet 33.2 64 0.0014 28.6 3.9 66 67-132 129-199 (347)
82 PTZ00260 dolichyl-phosphate be 31.5 3.6E+02 0.0078 23.7 17.0 192 17-218 67-284 (333)
83 PF13712 Glyco_tranf_2_5: Glyc 31.4 59 0.0013 26.8 3.4 115 97-215 43-187 (217)
84 PF03490 Varsurf_PPLC: Variant 31.1 28 0.0006 21.7 1.0 22 41-62 9-34 (51)
85 cd06436 GlcNAc-1-P_transferase 30.9 33 0.0007 27.2 1.8 76 109-187 90-178 (191)
86 cd02511 Beta4Glucosyltransfera 29.4 1.3E+02 0.0027 24.6 5.1 37 98-134 61-97 (229)
87 PF09258 Glyco_transf_64: Glyc 28.7 92 0.002 26.3 4.2 104 108-216 75-188 (247)
88 KOG2336|consensus 28.6 54 0.0012 28.6 2.7 38 8-47 171-215 (422)
89 PLN03182 xyloglucan 6-xylosylt 27.6 4.9E+02 0.011 24.0 8.9 87 31-121 103-210 (429)
90 PRK15489 nfrB bacteriophage N4 27.3 6.3E+02 0.014 25.1 15.8 192 18-217 69-285 (703)
91 TIGR02148 Fibro_Slime fibro-sl 27.1 38 0.00083 24.0 1.3 17 109-125 20-36 (90)
92 PLN02458 transferase, transfer 24.4 5.1E+02 0.011 23.1 11.6 37 96-133 190-226 (346)
93 PF13075 DUF3939: Protein of u 20.5 2.1E+02 0.0045 22.1 4.2 45 98-144 18-63 (140)
94 PF02709 Glyco_transf_7C: N-te 20.3 1.4E+02 0.0031 20.3 3.1 47 171-217 18-64 (78)
No 1
>KOG2287|consensus
Probab=100.00 E-value=1.2e-54 Score=383.79 Aligned_cols=241 Identities=27% Similarity=0.364 Sum_probs=216.9
Q ss_pred cccccCCCCCCCCCCCC--eeEEEEEeCCCCCHHHHHHHHHHHcCCcC--CCcEEEEEEeecCch--HHHHHHHHHhhhC
Q psy10879 3 ISANQTGLSKFKNLKHS--YFIVILILTSSKNYIRRKNIRDTWIKNIE--KYQVKYLFSIGNKEN--VVDVKLKEEIRRY 76 (245)
Q Consensus 3 ~~~~~~~l~~~~~c~~~--~~lli~V~S~~~~~~~R~~IR~TW~~~~~--~~~~~~~Fv~g~~~~--~~~~~l~~E~~~~ 76 (245)
++.....+..|+.|.+. ++|+++|.|+++++++|++||+|||+... +.+++.+|++|.+++ ..+..|.+|++.|
T Consensus 76 ~~~~~~~l~~p~~~~~~~~~~lLl~V~S~~~~farR~aiR~TW~~~~~v~~~~v~~~FLvG~~~~~~~~~~~l~~Ea~~y 155 (349)
T KOG2287|consen 76 FQKFFYLLYLPEICDPDRPPELLLLVKSAPDNFARRNAIRKTWGNENNVRGGRVRVLFLVGLPSNEDKLNKLLADEARLY 155 (349)
T ss_pred hccChhhhcCChhhcCCCCceEEEEEecCCCCHHHHHHHHHHhcCccccCCCcEEEEEEecCCCcHHHHHHHHHHHHHHh
Confidence 44555667778777544 79999999999999999999999999985 788999999999983 4588999999999
Q ss_pred CCEEEcCCccccCCchhHHHHHHHHHHHhcc-ceeEEEEecCceeeehHHHHHHHhhc-CcCCcccEEEEecCCCeeecC
Q psy10879 77 DDILLLNQVPDEYSSLSQKVLHSFKYIYEHF-DFQYLLKCDDDTFVRVPNIIHELENK-FHYEKKLYWGFFDGRARVRRA 154 (245)
Q Consensus 77 ~Dii~~~d~~d~y~~l~~K~~~~l~w~~~~~-~~~~~~k~DDD~~v~~~~L~~~L~~~-~~~~~~~y~G~~~~~~~~~r~ 154 (245)
||||+ .||.|+|.|+|+|+++++.|..++| +++|++|+|||+||++++|.++|... .+. +.+|+|++..+..|.|+
T Consensus 156 gDIi~-~df~Dty~nltlKtl~~l~w~~~~cp~akfi~K~DDDvfv~~~~L~~~L~~~~~~~-~~~~~G~v~~~~~p~R~ 233 (349)
T KOG2287|consen 156 GDIIQ-VDFEDTYFNLTLKTLAILLWGVSKCPDAKFILKIDDDVFVNPDNLLEYLDKLNDPS-SDLYYGRVIQNAPPIRD 233 (349)
T ss_pred CCEEE-EecccchhchHHHHHHHHHHHHhcCCcceEEEeccCceEEcHHHHHHHHhccCCCC-cceEEEeecccCCCCCC
Confidence 99995 5899999999999999999999999 99999999999999999999999999 776 89999999988888885
Q ss_pred --CCCcccC-cccCCCCCCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhC-CCeEeecCCccccc-CCCC
Q psy10879 155 --GKWKEKN-WFLCDRYLPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSAL-DITRYHDVRFDTEF-QSRG 229 (245)
Q Consensus 155 --~k~~~~~-~~~~~~yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~-~v~~~~~~~f~~~~-~~~~ 229 (245)
+|||+++ .|+++.|||||+|+||++|+++|+.|++++.+.+.+.+|||++|+|+++. ||.++++++|.... ...+
T Consensus 234 ~~~KwyVp~~~y~~~~YP~Y~sG~gYvis~~~a~~l~~~s~~~~~~~iEDV~~g~~l~~~~gi~~~~~~~~~~~~~~~~~ 313 (349)
T KOG2287|consen 234 KTSKWYVPESEYPCSVYPPYASGPGYVISGDAARRLLKASKHLKFFPIEDVFVGGCLAEDLGIKPVNHPGFFEIPLSFDP 313 (349)
T ss_pred CCCCCccCHHHCCCCCCCCcCCCceeEecHHHHHHHHHHhcCCCccchHHHHHHHHHHHhcCCCcccCcccccccccCCC
Confidence 8999995 89999999999999999999999999999999999999999999999998 99999999855322 2367
Q ss_pred cc-CCeEEEeccCCCCC
Q psy10879 230 CN-NSYLIVHKQNMHQL 245 (245)
Q Consensus 230 c~-~~~~~~h~~~p~~m 245 (245)
|+ .+.+++|.++|.+|
T Consensus 314 ~~~~~~~~~H~~~p~e~ 330 (349)
T KOG2287|consen 314 CCYRDLLAVHRLSPNEM 330 (349)
T ss_pred CcccceEEEecCCHHHH
Confidence 77 59999999998875
No 2
>PLN03133 beta-1,3-galactosyltransferase; Provisional
Probab=100.00 E-value=5.2e-52 Score=383.46 Aligned_cols=222 Identities=27% Similarity=0.488 Sum_probs=197.0
Q ss_pred CCCeeEEEEEeCCCCCHHHHHHHHHHHcCCcC--CCcEEEEEEeecCc-hHHHHHHHHHhhhCCCEEEcCCccccCCchh
Q psy10879 17 KHSYFIVILILTSSKNYIRRKNIRDTWIKNIE--KYQVKYLFSIGNKE-NVVDVKLKEEIRRYDDILLLNQVPDEYSSLS 93 (245)
Q Consensus 17 ~~~~~lli~V~S~~~~~~~R~~IR~TW~~~~~--~~~~~~~Fv~g~~~-~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~ 93 (245)
..+++|+|+|+|+++|+++|++||+|||+... +..++.+|++|.+. +..+..|..|+++|||||+. ||.|+|.|+|
T Consensus 382 ~~~~~LlI~V~Sap~nf~rR~AIR~TWg~~~~~~~~~v~~rFvVG~s~n~~l~~~L~~Ea~~ygDIIq~-dF~DsY~NLT 460 (636)
T PLN03133 382 KKPLDLFIGVFSTANNFKRRMAVRRTWMQYDAVRSGAVAVRFFVGLHKNQMVNEELWNEARTYGDIQLM-PFVDYYSLIT 460 (636)
T ss_pred CCceEEEEEEeCCcccHHHHHHHHHhhccccccCCCceEEEEEEecCCcHHHHHHHHHHHHHcCCeEEE-eeechhhhhH
Confidence 35689999999999999999999999999653 45689999999987 77889999999999999965 7999999999
Q ss_pred HHHHHHHHHHHhcc-ceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecC--CCCccc-CcccCCCCC
Q psy10879 94 QKVLHSFKYIYEHF-DFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRA--GKWKEK-NWFLCDRYL 169 (245)
Q Consensus 94 ~K~~~~l~w~~~~~-~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~--~k~~~~-~~~~~~~yp 169 (245)
+|+++++.|+. +| +++|+||+|||+|||+++|+++|...... +.+|+|++..+..|.|+ +|||++ +.|+.+.||
T Consensus 461 lKtl~~~~wa~-~c~~akFilK~DDDvFVnv~~Ll~~L~~~~~~-~~Ly~G~v~~~~~PiRd~~sKWYVs~~eyp~~~YP 538 (636)
T PLN03133 461 WKTLAICIFGT-EVVSAKYVMKTDDDAFVRVDEVLASLKRTNVS-HGLLYGLINSDSQPHRNPDSKWYISPEEWPEETYP 538 (636)
T ss_pred HHHHHHHHHHH-hCCCceEEEEcCCceEEcHHHHHHHHHhcCCC-CceEEEEeccCCCcccCCCCCCCCCHHHCCCCCCC
Confidence 99999999985 56 99999999999999999999999877665 78999999988889986 899998 589999999
Q ss_pred CccCCCeeeecHHHHHHHHHhc--ccCCCCCCChhHHHHHHHh---CCCeEeec--CCcccccCCCCccCCeEEEeccCC
Q psy10879 170 PYALGGGYVLSHLLVKFISENS--HMLSSYISEDVSVGVWLSA---LDITRYHD--VRFDTEFQSRGCNNSYLIVHKQNM 242 (245)
Q Consensus 170 ~y~~G~gyvlS~~~~~~l~~~~--~~~~~~~~eDv~iG~~l~~---~~v~~~~~--~~f~~~~~~~~c~~~~~~~h~~~p 242 (245)
|||+|+||+||+++|+.|+.++ ..++.+.+|||++|+|+.. +|+.+.+. .+|+ .++|..++|++|+++|
T Consensus 539 pYasG~gYVlS~Dla~~L~~~s~s~~l~~f~lEDVyvGi~l~~l~k~gl~v~~~~~~r~~----~~~C~~~~i~~H~~sP 614 (636)
T PLN03133 539 PWAHGPGYVVSRDIAKEVYKRHKEGRLKMFKLEDVAMGIWIAEMKKEGLEVKYENDGRIY----NEGCKDGYVVAHYQSP 614 (636)
T ss_pred CCCCcCEEEEcHHHHHHHHHhhhhcccCcCChhhHhHHHHHHHhcccCCCceeeCCCccc----CCcCCCCeEEEecCCH
Confidence 9999999999999999999875 4688999999999999874 46665443 3444 5789999999999999
Q ss_pred CCC
Q psy10879 243 HQL 245 (245)
Q Consensus 243 ~~m 245 (245)
++|
T Consensus 615 ~eM 617 (636)
T PLN03133 615 REM 617 (636)
T ss_pred HHH
Confidence 987
No 3
>PLN03193 beta-1,3-galactosyltransferase; Provisional
Probab=100.00 E-value=7.9e-50 Score=350.37 Aligned_cols=225 Identities=26% Similarity=0.421 Sum_probs=192.8
Q ss_pred CCCCCCCeeEEEEEeCCCCCHHHHHHHHHHHcCCcC-------CCcEEEEEEeecCc---hHHHHHHHHHhhhCCCEEEc
Q psy10879 13 FKNLKHSYFIVILILTSSKNYIRRKNIRDTWIKNIE-------KYQVKYLFSIGNKE---NVVDVKLKEEIRRYDDILLL 82 (245)
Q Consensus 13 ~~~c~~~~~lli~V~S~~~~~~~R~~IR~TW~~~~~-------~~~~~~~Fv~g~~~---~~~~~~l~~E~~~~~Dii~~ 82 (245)
+.+-+++++++|+|.|++++++||++||+||++..+ ..+++++|++|.+. ...+..|++|+++|||||++
T Consensus 132 ~~~~~~~~~LvIgI~Sap~~~~RR~AIR~TWg~~~~~~~kle~~~gv~vrFVIG~s~~~~~~ldr~Le~Ea~~ygDIL~l 211 (408)
T PLN03193 132 QSSGKRRYLMVVGINTAFSSRKRRDSVRATWMPQGEKRKKLEEEKGIIIRFVIGHSATSGGILDRAIEAEDRKHGDFLRL 211 (408)
T ss_pred CCCCcceEEEEEEEeCCCCCHHHHHHHHHHHcCCcccccccccCCcEEEEEEeecCCCcchHHHHHHHHHHHHhCCEEEE
Confidence 344567799999999999999999999999998642 36799999999986 46888999999999999976
Q ss_pred CCccccCCchhHHHHHHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecC---CCCcc
Q psy10879 83 NQVPDEYSSLSQKVLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRA---GKWKE 159 (245)
Q Consensus 83 ~d~~d~y~~l~~K~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~---~k~~~ 159 (245)
|+.|+|.|++.||+++++|+.++|+++|+||+|||+|||+++|..+|...... .++|+|++..+ |.|+ .||+.
T Consensus 212 -DfvDsY~NLT~KTl~~f~wA~~~~dAkF~mK~DDDvfVnv~~L~~~L~~~~~~-~rlYiG~m~~g--Pvr~~~~~ky~e 287 (408)
T PLN03193 212 -DHVEGYLELSAKTKTYFATAVAMWDADFYVKVDDDVHVNIATLGETLVRHRKK-PRVYIGCMKSG--PVLSQKGVRYHE 287 (408)
T ss_pred -ecccccccchHHHHHHHHHHHHcCCCeEEEEcCCCceEcHHHHHHHHHhcCCC-CCEEEEecccC--ccccCCCCcCcC
Confidence 79999999999999999999998899999999999999999999999877654 67999999764 4453 45677
Q ss_pred cCcc----cCCCCCCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecCCcccccCCCCcc----
Q psy10879 160 KNWF----LCDRYLPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDVRFDTEFQSRGCN---- 231 (245)
Q Consensus 160 ~~~~----~~~~yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~~f~~~~~~~~c~---- 231 (245)
+++| +.+.|||||+|+||+||+++++.|+.+...++.+.+|||++|.|+.+++|+.+|+++|++. ....|.
T Consensus 288 pe~w~~~~~~~~YPpyAsG~gYVlS~DLa~~I~~n~~~L~~y~~EDV~vG~Wl~~L~V~~vdd~~fcc~-~~~~C~~~~~ 366 (408)
T PLN03193 288 PEYWKFGENGNKYFRHATGQLYAISKDLASYISINQHVLHKYANEDVSLGSWFIGLDVEHIDDRRLCCG-TPPDCEWKAQ 366 (408)
T ss_pred cccccccCccccCCCCCCcceEEehHHHHHHHHhChhhhcccCcchhhhhhHhccCCceeeecccccCC-CCcccccccc
Confidence 7655 5689999999999999999999999999889999999999999999999999999999874 223443
Q ss_pred --CCeEEEeccCC
Q psy10879 232 --NSYLIVHKQNM 242 (245)
Q Consensus 232 --~~~~~~h~~~p 242 (245)
.-.++++.+++
T Consensus 367 ~~~~c~~~~~~~c 379 (408)
T PLN03193 367 AGNICVASFDWSC 379 (408)
T ss_pred CCCeeEEEecccC
Confidence 12356665554
No 4
>PF01762 Galactosyl_T: Galactosyltransferase; InterPro: IPR002659 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 31 (GH31 from CAZY) comprises enzymes with a number of known activities; N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase (2.4.1.149 from EC); beta-1,3-galactosyltransferase (2.4.1 from EC); fucose-specific beta-1,3-N-acetylglucosaminyltransferase (2.4.1 from EC); globotriosylceramide beta-1,3-GalNAc transferase (2.4.1.79 from EC) [, ].; GO: 0008378 galactosyltransferase activity, 0006486 protein glycosylation, 0016020 membrane
Probab=100.00 E-value=4.2e-48 Score=317.37 Aligned_cols=184 Identities=33% Similarity=0.481 Sum_probs=168.7
Q ss_pred HHHHHHHHHHcCCcC--CCcEEEEEEeecCc---hHHHHHHHHHhhhCCCEEEcCCccccCCchhHHHHHHHHHHHhcc-
Q psy10879 34 IRRKNIRDTWIKNIE--KYQVKYLFSIGNKE---NVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQKVLHSFKYIYEHF- 107 (245)
Q Consensus 34 ~~R~~IR~TW~~~~~--~~~~~~~Fv~g~~~---~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~K~~~~l~w~~~~~- 107 (245)
+||++||+||++... ..+++++|++|.++ ...+..|.+|+++|||||+. |+.|+|.|+++|++++++|+.++|
T Consensus 1 ~rR~~IR~TW~~~~~~~~~~~~~~FvvG~~~~~~~~~~~~l~~E~~~y~Dil~~-d~~D~y~nlt~K~~~~~~w~~~~c~ 79 (195)
T PF01762_consen 1 ERRQAIRETWGNQRNFKGVRVKVVFVVGESPNSDSDLQEALQEEAEKYGDILQG-DFVDSYRNLTLKTLAGLKWASKHCP 79 (195)
T ss_pred ChHHHHHHHHhcccccCCCcEEEEEEEecCCCCcHHHHHHhhhhhhhcCceEee-ecccccchhhHHHHHHHHHHHhhCC
Confidence 489999999999874 68899999999998 45678899999999999965 899999999999999999999999
Q ss_pred ceeEEEEecCceeeehHHHHHHHhhc--CcCCcccEEEEecCCCeeecC--CCCccc-CcccCCCCCCccCCCeeeecHH
Q psy10879 108 DFQYLLKCDDDTFVRVPNIIHELENK--FHYEKKLYWGFFDGRARVRRA--GKWKEK-NWFLCDRYLPYALGGGYVLSHL 182 (245)
Q Consensus 108 ~~~~~~k~DDD~~v~~~~L~~~L~~~--~~~~~~~y~G~~~~~~~~~r~--~k~~~~-~~~~~~~yp~y~~G~gyvlS~~ 182 (245)
+++|++|+|||+|||+++|.++|... .+. ...+.|.+..+.+|.|+ +||+++ +.|+.+.|||||+|+||+||++
T Consensus 80 ~~~~v~k~DDD~~vn~~~l~~~L~~~~~~~~-~~~~~g~~~~~~~~~r~~~~kw~v~~~~y~~~~yP~y~~G~~yvls~~ 158 (195)
T PF01762_consen 80 NAKYVLKVDDDVFVNPDRLVSFLKSLKQDPS-KNSIYGGCIKNGPPIRDPSSKWYVSEEEYPDDYYPPYCSGGGYVLSSD 158 (195)
T ss_pred chhheeecCcEEEEehHHhhhhhhhcccCcc-ccccccccccCCccccccccCceeeeeecccccCCCcCCCCeEEecHH
Confidence 79999999999999999999999988 444 67888888777777775 889998 4799999999999999999999
Q ss_pred HHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecC
Q psy10879 183 LVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDV 219 (245)
Q Consensus 183 ~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~ 219 (245)
+++.|+.++..++.++.|||++|+|+.++||+++|+|
T Consensus 159 ~v~~i~~~~~~~~~~~~eDv~iGi~~~~~~i~~~~~~ 195 (195)
T PF01762_consen 159 VVKRIYKASSHTPFFPLEDVFIGILAEKLGIKPIHDP 195 (195)
T ss_pred HHHHHHHHhhcCCCCCchHHHHHHHHHHCCCCccCCC
Confidence 9999999999999999999999999999999999875
No 5
>KOG2288|consensus
Probab=100.00 E-value=1.6e-40 Score=271.92 Aligned_cols=207 Identities=34% Similarity=0.562 Sum_probs=192.2
Q ss_pred CCCeeEEEEEeCCCCCHHHHHHHHHHHcCCc------C-CCcEEEEEEeecCc--hHHHHHHHHHhhhCCCEEEcCCccc
Q psy10879 17 KHSYFIVILILTSSKNYIRRKNIRDTWIKNI------E-KYQVKYLFSIGNKE--NVVDVKLKEEIRRYDDILLLNQVPD 87 (245)
Q Consensus 17 ~~~~~lli~V~S~~~~~~~R~~IR~TW~~~~------~-~~~~~~~Fv~g~~~--~~~~~~l~~E~~~~~Dii~~~d~~d 87 (245)
+++++++|+|.|++....||+++|+||+... + ..++..+|++|... .+...+|++|.++|+|.+.+.+.+|
T Consensus 8 ~~k~l~vigI~T~f~s~~RR~~vR~TWmp~~~~l~rle~e~gv~~RFvIG~~~~g~~~~r~ie~E~~~~~DfllLd~h~E 87 (274)
T KOG2288|consen 8 RRKVLLVIGINTAFSSRKRRDSVRQTWMPSGEGLKRLEEEKGVIIRFVIGTATLGASLDRALEEENAQHGDFLLLDRHEE 87 (274)
T ss_pred ccceEEEEEeecccchhhhHHHHHHhhcCCccchhhhccccceEEEEEeccCCccHHHHHHHHHHHHhcCCeEeechhHH
Confidence 7789999999999999999999999999993 2 67899999999955 7889999999999999999876999
Q ss_pred cCCchhHHHHHHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeec-CCCCcccCcccCC
Q psy10879 88 EYSSLSQKVLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRR-AGKWKEKNWFLCD 166 (245)
Q Consensus 88 ~y~~l~~K~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r-~~k~~~~~~~~~~ 166 (245)
.|.+|+.||+.++.++..+++++|++|+|||+||++..|...|...... ..+|+|.+.++..+.+ .+||+.|+|..++
T Consensus 88 ~Y~~Ls~Kt~~~f~~A~~~~daeFyvKvDDDv~v~l~~L~~~la~~r~~-pr~YiGcmksg~v~~~~~~kw~EpeWkfg~ 166 (274)
T KOG2288|consen 88 AYEELSAKTKAFFSAAVAHWDAEFYVKVDDDVYVRLARLGTLLARERSH-PRLYIGCMKSGPVLTQPGGKWYEPEWKFGD 166 (274)
T ss_pred HHHHHHHHHHHHHHHHHHhccceEEEEccccceecHHHHHHHHHhhccC-CceEEEEecCCccccCCCCcccChhhhcCc
Confidence 9999999999999999999999999999999999999999999887765 7899999998877766 6999999987766
Q ss_pred C--CCCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecCCcccc
Q psy10879 167 R--YLPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDVRFDTE 224 (245)
Q Consensus 167 ~--yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~~f~~~ 224 (245)
. |.+|..|++|+||++++.-|.-+.+.+..+..|||.+|.|+..++|+.+|++++++.
T Consensus 167 ~g~YfrhA~G~~YvlS~dLa~yi~in~~lL~~y~nEDVSlGaW~~gldV~h~dd~rlC~~ 226 (274)
T KOG2288|consen 167 NGNYFRHATGGGYVLSKDLATYISINRQLLHKYANEDVSLGAWMIGLDVEHVDDPRLCCS 226 (274)
T ss_pred ccccchhccCceEEeeHHHHHHHHHhHHHHHhhccCCcccceeeeeeeeeEecCCccccc
Confidence 5 999999999999999999999999999999999999999999999999999999863
No 6
>PTZ00210 UDP-GlcNAc-dependent glycosyltransferase; Provisional
Probab=100.00 E-value=2.2e-35 Score=255.76 Aligned_cols=212 Identities=25% Similarity=0.336 Sum_probs=172.1
Q ss_pred CCCeeEEEEEeCCCCC--HHHHHHHHHHHcCCcC--------CCcEEEEEEeecCc-h--HHHHHHHHHhhhCCCEEEcC
Q psy10879 17 KHSYFIVILILTSSKN--YIRRKNIRDTWIKNIE--------KYQVKYLFSIGNKE-N--VVDVKLKEEIRRYDDILLLN 83 (245)
Q Consensus 17 ~~~~~lli~V~S~~~~--~~~R~~IR~TW~~~~~--------~~~~~~~Fv~g~~~-~--~~~~~l~~E~~~~~Dii~~~ 83 (245)
.++..++++|+|...+ +.||+++|+||.+-.. ...+-++|++|..+ . +.++++.+|++.|||||.+
T Consensus 77 ~~~~lv~~Gi~S~d~~~r~~rR~lqr~t~w~y~~va~~~n~ftg~~lv~y~l~~H~~~~~~~~~~L~eEA~~~~DIVil- 155 (382)
T PTZ00210 77 AQRFLAVLGIPSVDNSERSRRRDLQRQTCWKYSGVATRSNNFSGSLLPLYLLAPHQSNSYLISHSLKEEAARTHDIITL- 155 (382)
T ss_pred cCCceEEEeccCCCchHHHHHHHHHHhhhhcchhhhhhccCCchhhhhhhhhccCCccchhhhHHHHHHHHHhCCEEEE-
Confidence 3567899999999988 8889999999988752 46788899999998 4 8899999999999999976
Q ss_pred Cc------------------cccCCchhHHHHHHHHHHHhcc-ceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEE
Q psy10879 84 QV------------------PDEYSSLSQKVLHSFKYIYEHF-DFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGF 144 (245)
Q Consensus 84 d~------------------~d~y~~l~~K~~~~l~w~~~~~-~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~ 144 (245)
+| .|++.++++||+++++|+.+.| +++|++|+|||+||++++++++|+.... ..+|+|+
T Consensus 156 pf~d~~~tTnKkiG~~g~WG~e~e~~mT~KT~l~~~wA~~~cP~a~YImKgDDDvFVrVp~lL~~Lr~~pr--r~LY~G~ 233 (382)
T PTZ00210 156 PTNDVSPSTRKKIGENGNWGIEAEVAMSRKTYLWLRFALHMFPNVSYIVKGDDDIFIRVPKYLADLRVMPR--HGLYMGR 233 (382)
T ss_pred ecccCccccccccccCCcccchhhcchhHHHHHHHHHHHHhCCCCCeEEEcCCCeEeeHHHHHHHHhhCCC--CceEEEe
Confidence 58 6777889999999999999999 9999999999999999999999966654 6799999
Q ss_pred ecCCCeeecCCCCcccCcccCCCCCCccCCCeeeecHHHHHHHHHhccc--C---------------CCCCCChhHHHHH
Q psy10879 145 FDGRARVRRAGKWKEKNWFLCDRYLPYALGGGYVLSHLLVKFISENSHM--L---------------SSYISEDVSVGVW 207 (245)
Q Consensus 145 ~~~~~~~~r~~k~~~~~~~~~~~yp~y~~G~gyvlS~~~~~~l~~~~~~--~---------------~~~~~eDv~iG~~ 207 (245)
+.....|.|++ +||||+|+||+||+++|+.|++.... + -.+..||+++|.+
T Consensus 234 v~~~~~p~Rd~------------~PpY~~G~gYvLSrDVA~~Lvs~~pl~rL~~~pys~~~~~~y~~~~~~~EDiMvG~v 301 (382)
T PTZ00210 234 YNYYNRIWRRN------------QLTYVNGYCITLSRDTAQAIISYKPLERLVNMPFSMWDYFDFLDLGMFYEDVMVGMI 301 (382)
T ss_pred eCCCCccccCC------------CCCccccceeeccHHHHHHHHhhChHhHhhcCCCchHHHHHHHHhhcCchHHHHHHH
Confidence 98776666642 59999999999999999999998532 1 1246899999999
Q ss_pred HHh-CC---CeEeec--CCcccccCCCCc----cCCeEEEeccCCCC
Q psy10879 208 LSA-LD---ITRYHD--VRFDTEFQSRGC----NNSYLIVHKQNMHQ 244 (245)
Q Consensus 208 l~~-~~---v~~~~~--~~f~~~~~~~~c----~~~~~~~h~~~p~~ 244 (245)
|+. ++ +--+.. .+||... ...| ..+.|++|++.++|
T Consensus 302 Lr~~~k~~~l~~V~~~~c~Fhd~~-~~~~~~~v~~~sVvvHhike~d 347 (382)
T PTZ00210 302 LREKVVYRNLISVEMGRCHFHNAG-KFGVRKSVRNMSVVIHHIQEAD 347 (382)
T ss_pred HHHhcCcCceeeeccccccceecC-CCCCccccccceEEEEecCHHH
Confidence 964 43 223333 3555422 2233 36899999998764
No 7
>PF02434 Fringe: Fringe-like; InterPro: IPR003378 The Notch receptor is a large, cell surface transmembrane protein involved in a wide variety of developmental processes in higher organisms []. It becomes activated when its extracellular region binds to ligands located on adjacent cells. Much of this extracellular region is composed of EGF-like repeats, many of which can be O-fucosylated. A number of these O-fucosylated repeats can in turn be further modified by the action of a beta-1,3-N-acetylglucosaminyltransferase enzyme known as Fringe []. Fringe potentiates the activation of Notch by Delta ligands, while inhibiting activation by Serrate/Jagged ligands. This regulation of Notch signalling by Fringe is important in many processes []. Four distinct Fringe proteins have so far been studied in detail; Drosophila Fringe (Dfng) and its three mammalian homologues Lunatic Fringe (Lfng), Radical Fringe (Rfng) and Manic Fringe (Mfng). Dfng, Lfng and Rfng have all been shown to play important roles in developmental processes within their host, though the phenotype of mutants can vary between species e.g. Rfng mutants are retarded in wing development in chickens, but have no obvious phenotype in mice [, , ]. Mfng mutants have not, so far, been charcterised. Biochemical studies indicate that the Fringe proteins are fucose-specific transferases requiring manganese for activity and utilising UDP-N-acetylglucosamine as a donor substrate []. The three mammalian proteins show distinct variations in their catalytic efficiencies with different substrates. Dfng is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands which is localised to the Golgi apparatus [] (not secreted as previously thought). Modification of Notch occurs through glycosylation by Dfng. This entry consists of Fringe proteins and related glycosyltransferase enzymes including: Beta-1,3-glucosyltransferase, which glucosylates O-linked fucosylglycan on thrombospondin type 1 repeat domains []. Core 1 beta1,3-galactosyltransferase 1, generates the core T antigen, which is a precursor for many extended O-glycans in glycoproteins and plays a central role in many processes, such as angiogenesis, thrombopoiesis and kidney homeostasis development []. ; GO: 0016757 transferase activity, transferring glycosyl groups, 0016020 membrane; PDB: 2J0B_A 2J0A_A.
Probab=99.93 E-value=5.5e-26 Score=192.94 Aligned_cols=208 Identities=20% Similarity=0.220 Sum_probs=111.3
Q ss_pred eeEEEEEeCCCCCHHHHH-HHHHHHcCCcCCCcEEEEE-EeecCchHHHHHHHHHhhhCCCEEEcCCccccCCchhHHHH
Q psy10879 20 YFIVILILTSSKNYIRRK-NIRDTWIKNIEKYQVKYLF-SIGNKENVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQKVL 97 (245)
Q Consensus 20 ~~lli~V~S~~~~~~~R~-~IR~TW~~~~~~~~~~~~F-v~g~~~~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~K~~ 97 (245)
-+|+|+|+|++++++.|. +|++||+++++ +..| +.....+ .+..+ ...+++ .+++...+...+++.+
T Consensus 6 ~dI~i~V~T~~k~h~tR~~~I~~TW~~~~~----~~~~ifsd~~d~----~l~~~--~~~~l~-~~~~~~~~~~~~~~~~ 74 (252)
T PF02434_consen 6 DDIFIAVKTTKKFHKTRAPAIKQTWAKRCN----KQTFIFSDAEDP----SLPTV--TGVHLV-NPNCDAGHCRKTLSCK 74 (252)
T ss_dssp GGEEEEEE--GGGTTTTHHHHHHTGGGGSG----GGEEEEESS--H----HHHHH--HGGGEE-E-------------HH
T ss_pred ccEEEEEEeCHHHHHHHHHHHHHHHHhhcC----CceEEecCcccc----ccccc--cccccc-cCCCcchhhHHHHHHH
Confidence 379999999999999999 99999999994 2345 3433222 22222 233455 4555544444344444
Q ss_pred HHHHHHHhcc-ceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecCCCCcccCcccCCCCCCc-cCCC
Q psy10879 98 HSFKYIYEHF-DFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRAGKWKEKNWFLCDRYLPY-ALGG 175 (245)
Q Consensus 98 ~~l~w~~~~~-~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~~~~~~~~~~yp~y-~~G~ 175 (245)
+.+.+-...- +++||+++||||||++++|.++|...+++ +++|+|+...........+.... ......| .| .||+
T Consensus 75 ~~~~y~~~~~~~~~Wf~~~DDDtyv~~~~L~~~L~~~~~~-~~~yiG~~~~~~~~~~~~~~~~~-~~~~~~~-~f~~GGa 151 (252)
T PF02434_consen 75 MAYEYDHFLNSDKDWFCFADDDTYVNVENLRRLLSKYDPS-EPIYIGRPSGDRPIEIIHRFNPN-KSKDSGF-WFATGGA 151 (252)
T ss_dssp HHHHHHHHHHHT-SEEEEEETTEEE-HHHHHHHHTTS-TT-S--EEE-EE------------------------EE-GGG
T ss_pred HHHHHHhhhcCCceEEEEEeCCceecHHHHHHHHhhCCCc-cCEEeeeeccCccceeecccccc-ccCcCce-EeeCCCe
Confidence 4444432223 78999999999999999999999999998 99999998654321110000000 0001112 25 5789
Q ss_pred eeeecHHHHHHHHHhcccCC---C----CCCChhHHHHHHHh-CCCeEeecCCcccccCC-------CCccCCeEEEecc
Q psy10879 176 GYVLSHLLVKFISENSHMLS---S----YISEDVSVGVWLSA-LDITRYHDVRFDTEFQS-------RGCNNSYLIVHKQ 240 (245)
Q Consensus 176 gyvlS~~~~~~l~~~~~~~~---~----~~~eDv~iG~~l~~-~~v~~~~~~~f~~~~~~-------~~c~~~~~~~h~~ 240 (245)
||+|||.++++|......+. . -..||+.||.|+.. +||+++|.+.||...+. .-+.+-.|++|..
T Consensus 152 G~vlSr~~~~k~~~~~~~~~~~~~~~~~~~~dD~~lG~ci~~~lgv~lt~s~~fhs~~~~l~~~~~~~l~~q~~~s~~~~ 231 (252)
T PF02434_consen 152 GYVLSRALLKKMSPWASGCKCPSTDEKIRLPDDMTLGYCIENLLGVPLTHSPLFHSHLENLQDYNPETLHRQVPISYHKF 231 (252)
T ss_dssp -EEEEHHHHHHHHHHHTT-TTS--TTTTTS-HHHHHHHHHHHTT---EEE-TT---SSS-GGG--TTTGGG-SEEE-EEE
T ss_pred eHHHhHHHHHHHhhhcccccccCCcCCCCCcccChhhhhHHhcCCcceeechhhcccCcccccCCHHHhccCCCeecCCC
Confidence 99999999999966543222 1 24899999999999 99999999999976543 2345678888877
Q ss_pred C
Q psy10879 241 N 241 (245)
Q Consensus 241 ~ 241 (245)
.
T Consensus 232 ~ 232 (252)
T PF02434_consen 232 E 232 (252)
T ss_dssp T
T ss_pred c
Confidence 3
No 8
>KOG2246|consensus
Probab=99.88 E-value=2.3e-22 Score=177.84 Aligned_cols=168 Identities=28% Similarity=0.502 Sum_probs=135.4
Q ss_pred CCCeeEEEEEeCCCCCHHHHH-HHHHHHcCCcCCCcEEEEEEe---ecCchHHHHHHHHHhhhCCCEEEcCCccccCCch
Q psy10879 17 KHSYFIVILILTSSKNYIRRK-NIRDTWIKNIEKYQVKYLFSI---GNKENVVDVKLKEEIRRYDDILLLNQVPDEYSSL 92 (245)
Q Consensus 17 ~~~~~lli~V~S~~~~~~~R~-~IR~TW~~~~~~~~~~~~Fv~---g~~~~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l 92 (245)
..+.+|+++|+|++.+...|. .+-+||+++|+ +..|+. .+.. ..+.-|. . +..|+|+++
T Consensus 88 ~r~~~v~cwv~t~~~~~~~~~~~v~~TW~~rc~----~~~f~s~~~s~~~-----------~~f~~v~-~-~~~~g~~~~ 150 (364)
T KOG2246|consen 88 SRSGRVLCWVLTSPMRHVTRADAVKETWLKRCD----KGIFFSPTLSKDD-----------SRFPTVY-Y-NLPDGYRSL 150 (364)
T ss_pred CCCceEEEEEEecCcCceeehhhhhcccccccC----cceecCccCCCCC-----------CcCceee-c-cCCcchHHH
Confidence 567899999999999999999 99999999995 455655 3322 2222232 4 579999999
Q ss_pred hHHHHHHHHHHHhcc--ceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecCCCCcccCcccCCCCCC
Q psy10879 93 SQKVLHSFKYIYEHF--DFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRAGKWKEKNWFLCDRYLP 170 (245)
Q Consensus 93 ~~K~~~~l~w~~~~~--~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~~~~~~~~~~yp~ 170 (245)
++|+..+++++.++. +++||+|+|||||+.++||..+|..++|+ +++|+|+...... ...|
T Consensus 151 ~~ktr~~~~yv~~~~~~~~dWf~~aDDDTy~i~eNLr~~L~~yDp~-~p~YiG~~~~~~~--------------~~~y-- 213 (364)
T KOG2246|consen 151 WRKTRIAFKYVYDHILKDYDWFLKADDDTYFIMENLRYVLSKYDPE-KPVYLGYRSKSYF--------------QNGY-- 213 (364)
T ss_pred HHHHHHHHHHHHHhccCCCCeEEeccCCeEEeHHHHHHHHhhcCCC-CcEEecccccccc--------------cccc--
Confidence 999999999999776 89999999999999999999999999999 9999998654211 1111
Q ss_pred ccCCCeeeecHHHHHHHHHhc----ccCCC-C--CCChhHHHHHHHhCCCeEeec
Q psy10879 171 YALGGGYVLSHLLVKFISENS----HMLSS-Y--ISEDVSVGVWLSALDITRYHD 218 (245)
Q Consensus 171 y~~G~gyvlS~~~~~~l~~~~----~~~~~-~--~~eDv~iG~~l~~~~v~~~~~ 218 (245)
-.+|+||++|+++++.+++.. ..|+. . ..||..||+|+..+||.+.+.
T Consensus 214 ~~g~ag~~ls~aa~~~la~~l~~~~~~C~~~~~~~~eD~~i~~Cl~~~GV~~~d~ 268 (364)
T KOG2246|consen 214 SSGGAGYVLSFAALRRLAERLLNNEDKCPQRYPSYGEDRRIGRCLAEVGVPATDE 268 (364)
T ss_pred ccCCCCcceeHHHHHHHHHHHhcchhhcccccCCchhHHHHHHHHHHhCCCccCc
Confidence 357999999999998877654 23443 2 389999999999999998876
No 9
>PLN03153 hypothetical protein; Provisional
Probab=99.71 E-value=1.1e-16 Score=144.42 Aligned_cols=180 Identities=18% Similarity=0.177 Sum_probs=120.3
Q ss_pred eeEEEEEeCCCCCHHHHH-HHHHHHcCCcCCCcEE-EEEEeecCchHHHHHHHHHhhhCCCEEEcCCcccc----CC---
Q psy10879 20 YFIVILILTSSKNYIRRK-NIRDTWIKNIEKYQVK-YLFSIGNKENVVDVKLKEEIRRYDDILLLNQVPDE----YS--- 90 (245)
Q Consensus 20 ~~lli~V~S~~~~~~~R~-~IR~TW~~~~~~~~~~-~~Fv~g~~~~~~~~~l~~E~~~~~Dii~~~d~~d~----y~--- 90 (245)
-.|+++|.++.+....|. .|+.+|.... ++ .+|+.....+. .+...-.-|. +. .|+ |.
T Consensus 122 ~hIvF~I~~s~~~w~~R~~yik~wW~p~~----~rg~v~ld~~~~~~------~~~~~~P~i~-is--~d~s~f~y~~~~ 188 (537)
T PLN03153 122 NHIMFGIAGSSQLWKRRKELVRLWWRPNQ----MRGHVWLEEQVSPE------EGDDSLPPIM-VS--EDTSRFRYTNPT 188 (537)
T ss_pred ccEEEEEEEchhhhhhhhhhhhhhcCccc----ceeEEEecccCCCC------CCcCCCCCEE-eC--CCcccccccCCC
Confidence 389999999999887776 8999998754 22 45554433210 0001111122 21 121 22
Q ss_pred ch--hHHHHHHHHHHHhc-c-ceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecCCCCcccCcccCC
Q psy10879 91 SL--SQKVLHSFKYIYEH-F-DFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRAGKWKEKNWFLCD 166 (245)
Q Consensus 91 ~l--~~K~~~~l~w~~~~-~-~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~~~~~~~~~ 166 (245)
+. +..+..+....... . +++||+++|||||++++||+.+|..++++ ++.|+|.......... .+
T Consensus 189 Gh~sa~rI~rmv~et~~~~~pd~kWfVf~DDDTyf~~~NLv~~Ls~YDpt-kp~YIGs~Se~~~qn~--~f--------- 256 (537)
T PLN03153 189 GHPSGLRISRIVLESFRLGLPDVRWFVLGDDDTIFNADNLVAVLSKYDPS-EMVYVGGPSESHSANS--YF--------- 256 (537)
T ss_pred CcHHHHHHHHHHHHHHHhhCCCCCEEEEecCCccccHHHHHHHHhhcCCC-CCEEeccccccccccc--cc---------
Confidence 22 22222234444444 4 89999999999999999999999999998 9999998764321100 00
Q ss_pred CCCCccCCCeeeecHHHHHHHHHhcccCC----CCCCChhHHHHHHHhCCCeEeecCCcccc
Q psy10879 167 RYLPYALGGGYVLSHLLVKFISENSHMLS----SYISEDVSVGVWLSALDITRYHDVRFDTE 224 (245)
Q Consensus 167 ~yp~y~~G~gyvlS~~~~~~l~~~~~~~~----~~~~eDv~iG~~l~~~~v~~~~~~~f~~~ 224 (245)
.|--..||+||+||+.+++.|.+....|. ....+|..||.|+..+||.+++.++||..
T Consensus 257 ~~~fA~GGAG~~LSrPLae~L~~~~d~C~~rY~~~~~gD~rL~~CL~elGV~LT~~~gfhQ~ 318 (537)
T PLN03153 257 SHNMAFGGGGIAISYPLAEALSRILDDCLDRYPKLYGSDDRLHACITELGVPLSREPGFHQW 318 (537)
T ss_pred ccccccCCceEEEcHHHHHHHHHHhhhhhhhcccCCCcHHHHHHHHHHcCCCceecCCcccc
Confidence 01013589999999999999888754332 23568999999999999999999999953
No 10
>KOG3708|consensus
Probab=99.08 E-value=1.7e-10 Score=103.43 Aligned_cols=158 Identities=13% Similarity=0.132 Sum_probs=114.4
Q ss_pred eeEEEEEeCCCCCHHHHHHHHHHHcCCcCCCcEEEEEEeecCchHHHHHHHHHhhhCCCEEEcCCccccCCchhHHHHHH
Q psy10879 20 YFIVILILTSSKNYIRRKNIRDTWIKNIEKYQVKYLFSIGNKENVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQKVLHS 99 (245)
Q Consensus 20 ~~lli~V~S~~~~~~~R~~IR~TW~~~~~~~~~~~~Fv~g~~~~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~K~~~~ 99 (245)
-+|+++|+|. ..---+|.+|-+...+ ++.||.+...-..+-+.. .++ .+.|.... ....+.
T Consensus 26 Erl~~aVmte---~tlA~a~NrT~ahhvp----rv~~F~~~~~i~~~~a~~-------~~v---s~~d~r~~--~~~s~v 86 (681)
T KOG3708|consen 26 ERLMAAVMTE---STLALAINRTLAHHVP----RVHLFADSSRIDNDLAQL-------TNV---SPYDLRGQ--KTHSMV 86 (681)
T ss_pred HHHHHHHHHH---HHHHHHHHHHHHhhcc----eeEEeeccccccccHhhc-------ccc---CccccCcc--ccHHHH
Confidence 3778888882 2444599999999984 788888886511110000 122 23333322 445677
Q ss_pred HHHHHhcc--ceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecCCCCcccCcccCCCCCCcc-CCCe
Q psy10879 100 FKYIYEHF--DFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRAGKWKEKNWFLCDRYLPYA-LGGG 176 (245)
Q Consensus 100 l~w~~~~~--~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~~~~~~~~~~yp~y~-~G~g 176 (245)
++++.+++ +++||+.+-|+||||...|.+++.+++.+ +++|+|.-...+ ++ =| .|.|
T Consensus 87 l~~l~~~~~~~YDwFll~~D~tYv~a~~L~~l~~hmsin-~dlymGEe~~~g---------------s~----rC~l~~G 146 (681)
T KOG3708|consen 87 LGLLFNMVHNNYDWFLLAKDSTYVNAFVLLRLIDHMSIN-EDLYMGEEAEDG---------------SG----RCRLDTG 146 (681)
T ss_pred HHHHHHhhccccceEEEecCcceecHHHHHHHHhhcccc-cccccchhhhCc---------------cC----ccccccc
Confidence 88888888 89999999999999999999999999998 999999422211 11 16 4899
Q ss_pred eeecHHHHHHHHHhcccCCCC---CCChhHHHHHHHhC-CCeEe
Q psy10879 177 YVLSHLLVKFISENSHMLSSY---ISEDVSVGVWLSAL-DITRY 216 (245)
Q Consensus 177 yvlS~~~~~~l~~~~~~~~~~---~~eDv~iG~~l~~~-~v~~~ 216 (245)
|+||+.++.+|..+-..|.-+ .-+|+.+|+|+..+ ||..+
T Consensus 147 ~LLS~s~l~~lrnnle~C~~~~lsad~d~~lgrCi~~At~v~C~ 190 (681)
T KOG3708|consen 147 MLLSQSLLHALRNNLEGCRNDILSADPDEWLGRCIQDATGVGCK 190 (681)
T ss_pred eeecHHHHHHHHhhHHHhhcccccCCcHHHHHHHHHHhhcCCcc
Confidence 999999999999998766533 45789999999975 77644
No 11
>PF13641 Glyco_tranf_2_3: Glycosyltransferase like family 2; PDB: 4FIY_B 4FIX_A.
Probab=97.62 E-value=0.0019 Score=53.21 Aligned_cols=189 Identities=11% Similarity=0.052 Sum_probs=92.0
Q ss_pred eeEEEEEeCCCCCHHHHHHHHHHHcCCcCCCcEEEEEEeecCchHHHHHHHHHhhhCCCE--EEcCCccccCCch--hHH
Q psy10879 20 YFIVILILTSSKNYIRRKNIRDTWIKNIEKYQVKYLFSIGNKENVVDVKLKEEIRRYDDI--LLLNQVPDEYSSL--SQK 95 (245)
Q Consensus 20 ~~lli~V~S~~~~~~~R~~IR~TW~~~~~~~~~~~~Fv~g~~~~~~~~~l~~E~~~~~Di--i~~~d~~d~y~~l--~~K 95 (245)
|.|.|+|++.-+...-++.++.--...- .+++++++...++++..+.+.+-.+.++.. -.+.. + .+. ..|
T Consensus 1 P~v~Vvip~~~~~~~l~~~l~sl~~~~~--~~~~v~vvd~~~~~~~~~~~~~~~~~~~~~~v~vi~~---~-~~~g~~~k 74 (228)
T PF13641_consen 1 PRVSVVIPAYNEDDVLRRCLESLLAQDY--PRLEVVVVDDGSDDETAEILRALAARYPRVRVRVIRR---P-RNPGPGGK 74 (228)
T ss_dssp --EEEE--BSS-HHHHHHHHHHHTTSHH--HTEEEEEEEE-SSS-GCTTHHHHHHTTGG-GEEEEE--------HHHHHH
T ss_pred CEEEEEEEecCCHHHHHHHHHHHHcCCC--CCeEEEEEECCCChHHHHHHHHHHHHcCCCceEEeec---C-CCCCcchH
Confidence 3466777776554455555555443322 235666666444433334455445556553 22211 1 221 123
Q ss_pred HHHHHHHHHhccceeEEEEecCceeeehHHHHHHHhhc-CcCCcccEEEEecCCCe---eecCCCCc--cc--Cccc-CC
Q psy10879 96 VLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENK-FHYEKKLYWGFFDGRAR---VRRAGKWK--EK--NWFL-CD 166 (245)
Q Consensus 96 ~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~-~~~~~~~y~G~~~~~~~---~~r~~k~~--~~--~~~~-~~ 166 (245)
-.++.+..+..+.+|++.+|||+.+..+-|.+++... .+. -....|.+..... ..+-..+. .. .... ..
T Consensus 75 -~~a~n~~~~~~~~d~i~~lD~D~~~~p~~l~~~~~~~~~~~-~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (228)
T PF13641_consen 75 -ARALNEALAAARGDYILFLDDDTVLDPDWLERLLAAFADPG-VGAVGGPVFPDNDRNWLTRLQDLFFARWHLRFRSGRR 152 (228)
T ss_dssp -HHHHHHHHHH---SEEEEE-SSEEE-CHHHHHHHHHHHBSS---EEEEEEEETTCCCEEEE-TT--S-EETTTS-TT-B
T ss_pred -HHHHHHHHHhcCCCEEEEECCCcEECHHHHHHHHHHHHhCC-CCeEeeeEeecCCCCHHHHHHHHHHhhhhhhhhhhhc
Confidence 3455666677789999999999999999888888877 443 4444455432210 01101110 00 0011 11
Q ss_pred CCC-CccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecC
Q psy10879 167 RYL-PYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDV 219 (245)
Q Consensus 167 ~yp-~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~ 219 (245)
.+. .++.|++.++++++++.+-.... ....||..++.-+.+.|.+....+
T Consensus 153 ~~~~~~~~G~~~~~rr~~~~~~g~fd~---~~~~eD~~l~~r~~~~G~~~~~~~ 203 (228)
T PF13641_consen 153 ALGVAFLSGSGMLFRRSALEEVGGFDP---FILGEDFDLCLRLRAAGWRIVYAP 203 (228)
T ss_dssp ----S-B--TEEEEEHHHHHHH-S--S---SSSSHHHHHHHHHHHTT--EEEEE
T ss_pred ccceeeccCcEEEEEHHHHHHhCCCCC---CCcccHHHHHHHHHHCCCcEEEEC
Confidence 111 34679999999999999864332 445699999999999988776544
No 12
>TIGR03472 HpnI hopanoid biosynthesis associated glycosyl transferase protein HpnI. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The member of this clade from Acidithiobacillus ferrooxidans ATCC 23270 (AFE_0974) is found in the same locus as squalene-hopene cyclase (SHC, TIGR01507) and other genes associated with the biosynthesis of hopanoid natural products. Similarly, in Ralstonia eutropha JMP134 (Reut_B4902) this gene is adjacent to HpnAB, IspH and HpnH (TIGR03470), although SHC itself is elsewhere in the genome. Notably, this gene (here named HpnI) and three others form a conserved set (HpnIJKL) which occur in a subset of all genomes containing the SHC enzyme. This relationship was discerned using the method of partial phylogenetic profiling. This group includes Zymomonas mobilis, the organism where the initial hopano
Probab=97.29 E-value=0.0077 Score=54.11 Aligned_cols=192 Identities=15% Similarity=0.159 Sum_probs=106.6
Q ss_pred CCeeEEEEEeCCCCCHHHHHHHHHHHcCCcCCCcEEEEEEeecCchHHHHHHHHHhhhCCC--EEEcCCccccCCchhHH
Q psy10879 18 HSYFIVILILTSSKNYIRRKNIRDTWIKNIEKYQVKYLFSIGNKENVVDVKLKEEIRRYDD--ILLLNQVPDEYSSLSQK 95 (245)
Q Consensus 18 ~~~~lli~V~S~~~~~~~R~~IR~TW~~~~~~~~~~~~Fv~g~~~~~~~~~l~~E~~~~~D--ii~~~d~~d~y~~l~~K 95 (245)
..+.+-|+|++.-+...-.+.++ +..++. ....+++++.+.+.+...+.+++-.+.|.+ +..+.+-.. .....|
T Consensus 39 ~~p~VSViiP~~nee~~l~~~L~-Sl~~q~-Yp~~EIivvdd~s~D~t~~iv~~~~~~~p~~~i~~v~~~~~--~G~~~K 114 (373)
T TIGR03472 39 AWPPVSVLKPLHGDEPELYENLA-SFCRQD-YPGFQMLFGVQDPDDPALAVVRRLRADFPDADIDLVIDARR--HGPNRK 114 (373)
T ss_pred CCCCeEEEEECCCCChhHHHHHH-HHHhcC-CCCeEEEEEeCCCCCcHHHHHHHHHHhCCCCceEEEECCCC--CCCChH
Confidence 34556677776655544455554 444443 223678877665553222333333456665 432211111 122246
Q ss_pred HHHHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecCCCCc------cc-Cccc----
Q psy10879 96 VLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRAGKWK------EK-NWFL---- 164 (245)
Q Consensus 96 ~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~------~~-~~~~---- 164 (245)
.-...+ +.+..+.+|++..|+|+.+..+-|...+.....++-.+..|..... +.. +-|. .. .+++
T Consensus 115 ~~~l~~-~~~~a~ge~i~~~DaD~~~~p~~L~~lv~~~~~~~v~~V~~~~~~~--~~~-~~~~~l~~~~~~~~~~~~~~~ 190 (373)
T TIGR03472 115 VSNLIN-MLPHARHDILVIADSDISVGPDYLRQVVAPLADPDVGLVTCLYRGR--PVP-GFWSRLGAMGINHNFLPSVMV 190 (373)
T ss_pred HHHHHH-HHHhccCCEEEEECCCCCcChhHHHHHHHHhcCCCcceEeccccCC--CCC-CHHHHHHHHHhhhhhhHHHHH
Confidence 655544 4466799999999999999999988888776432122333321111 111 1010 00 0111
Q ss_pred --CCCCCCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeec
Q psy10879 165 --CDRYLPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHD 218 (245)
Q Consensus 165 --~~~yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~ 218 (245)
...-+.+|.|+++++.|++++.+--... ......||..+|.-+.+.|.+..-.
T Consensus 191 ~~~~~~~~~~~G~~~a~RR~~l~~iGGf~~-~~~~~~ED~~l~~~i~~~G~~v~~~ 245 (373)
T TIGR03472 191 ARALGRARFCFGATMALRRATLEAIGGLAA-LAHHLADDYWLGELVRALGLRVVLA 245 (373)
T ss_pred HHhccCCccccChhhheeHHHHHHcCChHH-hcccchHHHHHHHHHHHcCCeEEec
Confidence 0011245889999999999998754432 1222359999999999988776543
No 13
>PF04646 DUF604: Protein of unknown function, DUF604; InterPro: IPR006740 This family includes a conserved region found in several uncharacterised plant proteins.
Probab=97.18 E-value=0.00063 Score=57.12 Aligned_cols=52 Identities=17% Similarity=0.265 Sum_probs=44.5
Q ss_pred cCCCeeeecHHHHHHHHHhcccC----CCCCCChhHHHHHHHhCCCeEeecCCccc
Q psy10879 172 ALGGGYVLSHLLVKFISENSHML----SSYISEDVSVGVWLSALDITRYHDVRFDT 223 (245)
Q Consensus 172 ~~G~gyvlS~~~~~~l~~~~~~~----~~~~~eDv~iG~~l~~~~v~~~~~~~f~~ 223 (245)
.||+|+.||..+++.|.+....| +.+.-.|-.|..|+..+||.++..++||.
T Consensus 11 fGGgG~~iS~pLa~~L~~~~d~C~~r~~~~~g~D~~i~~C~~~lgv~LT~e~g~hQ 66 (255)
T PF04646_consen 11 FGGGGFAISYPLAKALAKMQDDCIERYPHLYGGDQRIQACIAELGVPLTKEPGFHQ 66 (255)
T ss_pred ccCceeEEcHHHHHHHHHHHHHHHHhcCCCCCchHHHHHHHHHhCCCceecCCcee
Confidence 48999999999999999875432 44556899999999999999999999995
No 14
>cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis. UDP-glucose:N-acylsphingosine D-glucosyltransferase (glucosylceramide synthase or ceramide glucosyltransferase) catalyzes the first glycosylation step of glycosphingolipid synthesis. Its product, glucosylceramide, serves as the core of more than 300 glycosphingolipids (GSL). GSLs are a group of membrane components that have the lipid portion embedded in the outer plasma membrane leaflet and the sugar chains extended to the outer environment. Several lines of evidence suggest the importance of GSLs in various cellular processes such as differentiation, adhesion, proliferation, and cell-cell recognition. In pathogenic fungus Cryptococcus neoformans, glucosylceramide serves as an antigen that elicits an antibody response in patients and it is essential for fungal growth in host extracellular environment.
Probab=97.06 E-value=0.06 Score=43.40 Aligned_cols=139 Identities=18% Similarity=0.191 Sum_probs=84.6
Q ss_pred EEEEEEeecCchHHHHHHHHHhhhCC--CEEEcCCccccCCchhHHHHHHHHHHHhccceeEEEEecCceeeehHHHHHH
Q psy10879 52 VKYLFSIGNKENVVDVKLKEEIRRYD--DILLLNQVPDEYSSLSQKVLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHE 129 (245)
Q Consensus 52 ~~~~Fv~g~~~~~~~~~l~~E~~~~~--Dii~~~d~~d~y~~l~~K~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~ 129 (245)
+++++|...+.+...+.+++-.+.+. ++..+..-... ....|.- .+....+..+.+|++..|+|+.+..+-|...
T Consensus 31 ~eiivVdd~s~d~t~~~~~~~~~~~~~~~~~~~~~~~~~--g~~~~~~-~~n~g~~~a~~d~i~~~D~D~~~~~~~l~~l 107 (196)
T cd02520 31 YEILFCVQDEDDPAIPVVRKLIAKYPNVDARLLIGGEKV--GINPKVN-NLIKGYEEARYDILVISDSDISVPPDYLRRM 107 (196)
T ss_pred eEEEEEeCCCcchHHHHHHHHHHHCCCCcEEEEecCCcC--CCCHhHH-HHHHHHHhCCCCEEEEECCCceEChhHHHHH
Confidence 67777776666333444444455554 33212111111 1223432 2344455668999999999999988888877
Q ss_pred HhhcCcCCcccEEEEecCCCeeecCCCCcccCcccCCCCCCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHH
Q psy10879 130 LENKFHYEKKLYWGFFDGRARVRRAGKWKEKNWFLCDRYLPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLS 209 (245)
Q Consensus 130 L~~~~~~~~~~y~G~~~~~~~~~r~~k~~~~~~~~~~~yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~ 209 (245)
+.......-.+..|. ++.|++.++.+++++++-.... ......||..++.-+.
T Consensus 108 ~~~~~~~~~~~v~~~--------------------------~~~g~~~~~r~~~~~~~ggf~~-~~~~~~eD~~l~~rl~ 160 (196)
T cd02520 108 VAPLMDPGVGLVTCL--------------------------CAFGKSMALRREVLDAIGGFEA-FADYLAEDYFLGKLIW 160 (196)
T ss_pred HHHhhCCCCCeEEee--------------------------cccCceeeeEHHHHHhccChHH-HhHHHHHHHHHHHHHH
Confidence 766432202222221 5678999999999998743321 2223469999999999
Q ss_pred hCCCeEeecCC
Q psy10879 210 ALDITRYHDVR 220 (245)
Q Consensus 210 ~~~v~~~~~~~ 220 (245)
..|.+....+.
T Consensus 161 ~~G~~i~~~~~ 171 (196)
T cd02520 161 RLGYRVVLSPY 171 (196)
T ss_pred HcCCeEEEcch
Confidence 88877765544
No 15
>PF01755 Glyco_transf_25: Glycosyltransferase family 25 (LPS biosynthesis protein); InterPro: IPR002654 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 25 GT25 from CAZY comprises enzymes with only one known activity; as a lipopolysaccharide biosynthesis protein. These enzymes catalyse the transfer of various sugars onto the growing lipopolysaccharide chain during its biosynthesis [].; GO: 0009103 lipopolysaccharide biosynthetic process
Probab=96.88 E-value=0.026 Score=45.89 Aligned_cols=160 Identities=17% Similarity=0.192 Sum_probs=80.1
Q ss_pred EEEeCCCCCHHHHHHHHHHHcCCcCCCcEEEEEEeecCc---hH--HHHHHHHHhhh--CCCEEEcCCccccCCchhHHH
Q psy10879 24 ILILTSSKNYIRRKNIRDTWIKNIEKYQVKYLFSIGNKE---NV--VDVKLKEEIRR--YDDILLLNQVPDEYSSLSQKV 96 (245)
Q Consensus 24 i~V~S~~~~~~~R~~IR~TW~~~~~~~~~~~~Fv~g~~~---~~--~~~~l~~E~~~--~~Dii~~~d~~d~y~~l~~K~ 96 (245)
|.|.|-+...+||+.+.+...... +.+-|+-|-.. .. .......+... ++-.+ .-+.-.-++-.
T Consensus 4 i~vInL~~~~~Rr~~~~~~~~~~~----~~~e~~~Avdg~~l~~~~~~~~~~~~~~~~~~~~~l-----t~gEiGC~lSH 74 (200)
T PF01755_consen 4 IYVINLDRSTERRERIQQQLAKLG----INFEFFDAVDGRDLSEDELFRRYDPELFKKRYGRPL-----TPGEIGCALSH 74 (200)
T ss_pred EEEEECCCCHHHHHHHHHHHHHcC----CceEEEEeecccccchHHHHHHhhhhhhhccccccC-----CcceEeehhhH
Confidence 557778888999998888777653 34555544433 11 11111111111 11111 11222234556
Q ss_pred HHHHHHHHhccceeEEEEecCceeeehHHHHHHHhhc----CcCCcccEEEEecCCC--e----e-------ecCCCCcc
Q psy10879 97 LHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENK----FHYEKKLYWGFFDGRA--R----V-------RRAGKWKE 159 (245)
Q Consensus 97 ~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~----~~~~~~~y~G~~~~~~--~----~-------~r~~k~~~ 159 (245)
+..|+-+.+. +.++.+...||+.+..+ +...|... ... .-+..|...... . . .+..++..
T Consensus 75 ~~~w~~~v~~-~~~~~lIlEDDv~~~~~-f~~~l~~~~~~~~~~-~~l~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 151 (200)
T PF01755_consen 75 IKAWQRIVDS-GLEYALILEDDVIFDPD-FKEFLEEILSHIPDW-DFLRLGGWKDNSYSPGDIFLSRLSTFLSRSKRYKR 151 (200)
T ss_pred HHHHHHHHHc-CCCeEEEEecccccccc-HHHHHHHHHhhcccc-cchhhccccccccccccccceeeeehhhhhhhccc
Confidence 6677666533 57899999999999864 44444332 221 233333221100 0 0 00000000
Q ss_pred cCc--c-cCC--CCCC-ccCCCeeeecHHHHHHHHHhcccCC
Q psy10879 160 KNW--F-LCD--RYLP-YALGGGYVLSHLLVKFISENSHMLS 195 (245)
Q Consensus 160 ~~~--~-~~~--~yp~-y~~G~gyvlS~~~~~~l~~~~~~~~ 195 (245)
... + ... .... .++.+||++|+..|++|++......
T Consensus 152 ~~~~~~~~~~~~~~~~~~~~t~aY~Is~~gA~kLL~~~~~~~ 193 (200)
T PF01755_consen 152 KPIPPFGSRKLIRPAKYPYGTCAYLISRKGARKLLEASKPIR 193 (200)
T ss_pred CcccccCCceEEeecCCCCcceeeeeCHHHHHHHHHhCcCCC
Confidence 000 0 000 1112 3567899999999999999985433
No 16
>cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=96.76 E-value=0.11 Score=39.67 Aligned_cols=90 Identities=12% Similarity=-0.006 Sum_probs=61.7
Q ss_pred hccceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecCCCCcccCcccCCCCCCccCCCeeeecHHHH
Q psy10879 105 EHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRAGKWKEKNWFLCDRYLPYALGGGYVLSHLLV 184 (245)
Q Consensus 105 ~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~~~~~~~~~~yp~y~~G~gyvlS~~~~ 184 (245)
+..+.+|++..|||..+..+.+..++...........+|.. +.|++.+++++++
T Consensus 71 ~~~~~~~i~~~D~D~~~~~~~l~~~~~~~~~~~~~~~~~~~--------------------------~~~~~~~~~~~~~ 124 (166)
T cd04186 71 REAKGDYVLLLNPDTVVEPGALLELLDAAEQDPDVGIVGPK--------------------------VSGAFLLVRREVF 124 (166)
T ss_pred hhCCCCEEEEECCCcEECccHHHHHHHHHHhCCCceEEEcc--------------------------CceeeEeeeHHHH
Confidence 33489999999999999988888888754332022222211 5788999999999
Q ss_pred HHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecCCc
Q psy10879 185 KFISENSHMLSSYISEDVSVGVWLSALDITRYHDVRF 221 (245)
Q Consensus 185 ~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~~f 221 (245)
+.+-........ ..||..+..-+...|.+....+..
T Consensus 125 ~~~~~~~~~~~~-~~eD~~~~~~~~~~g~~i~~~~~~ 160 (166)
T cd04186 125 EEVGGFDEDFFL-YYEDVDLCLRARLAGYRVLYVPQA 160 (166)
T ss_pred HHcCCCChhhhc-cccHHHHHHHHHHcCCeEEEccce
Confidence 876433322222 679999988777778877666543
No 17
>cd06439 CESA_like_1 CESA_like_1 is a member of the cellulose synthase (CESA) superfamily. This is a subfamily of cellulose synthase (CESA) superfamily. CESA superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members of the superfamily include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins.
Probab=96.66 E-value=0.053 Score=45.23 Aligned_cols=190 Identities=9% Similarity=0.015 Sum_probs=98.5
Q ss_pred CCCCeeEEEEEeCCCCCHHHHHHHHHHHcCCcCCCcEEEEEEeecCchHHHHHHHHHhhhCCCEEEcCCccccCCchhHH
Q psy10879 16 LKHSYFIVILILTSSKNYIRRKNIRDTWIKNIEKYQVKYLFSIGNKENVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQK 95 (245)
Q Consensus 16 c~~~~~lli~V~S~~~~~~~R~~IR~TW~~~~~~~~~~~~Fv~g~~~~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~K 95 (245)
....+.+-|+|++.-+...-...|+.-..+.......+++++...+.+...+.+.+..+. .+..+.. . .|. .|
T Consensus 25 ~~~~~~isVvip~~n~~~~l~~~l~si~~q~~~~~~~eiivvdd~s~d~t~~~~~~~~~~--~v~~i~~-~---~~~-g~ 97 (251)
T cd06439 25 PAYLPTVTIIIPAYNEEAVIEAKLENLLALDYPRDRLEIIVVSDGSTDGTAEIAREYADK--GVKLLRF-P---ERR-GK 97 (251)
T ss_pred CCCCCEEEEEEecCCcHHHHHHHHHHHHhCcCCCCcEEEEEEECCCCccHHHHHHHHhhC--cEEEEEc-C---CCC-Ch
Confidence 344556777777765544445566655554432223566666544443323333322222 2332321 1 111 22
Q ss_pred HHHHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCee--ecCC--CCcccC----cccCCC
Q psy10879 96 VLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARV--RRAG--KWKEKN----WFLCDR 167 (245)
Q Consensus 96 ~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~--~r~~--k~~~~~----~~~~~~ 167 (245)
. ..+....+..+.+|++.+|+|+.+..+-|.+++.........+..|........ .+.. .|.... ......
T Consensus 98 ~-~a~n~gi~~a~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (251)
T cd06439 98 A-AALNRALALATGEIVVFTDANALLDPDALRLLVRHFADPSVGAVSGELVIVDGGGSGSGEGLYWKYENWLKRAESRLG 176 (251)
T ss_pred H-HHHHHHHHHcCCCEEEEEccccCcCHHHHHHHHHHhcCCCccEEEeEEEecCCcccchhHHHHHHHHHHHHHHHHhcC
Confidence 2 344455555667999999999999987788877776533134555554321110 0100 011000 000011
Q ss_pred CCCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecC
Q psy10879 168 YLPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDV 219 (245)
Q Consensus 168 yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~ 219 (245)
....+.|+++.+.+++++ ........||..++.-+...|......+
T Consensus 177 ~~~~~~g~~~~~rr~~~~------~~~~~~~~eD~~l~~~~~~~G~~~~~~~ 222 (251)
T cd06439 177 STVGANGAIYAIRRELFR------PLPADTINDDFVLPLRIARQGYRVVYEP 222 (251)
T ss_pred CeeeecchHHHhHHHHhc------CCCcccchhHHHHHHHHHHcCCeEEecc
Confidence 122356777777777766 1122335799999988888887665544
No 18
>TIGR03469 HonB hopene-associated glycosyltransferase HpnB. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The genes of this family are often found in the same genetic locus with squalene-hopene cyclase genes, and are never associated with genes for the metabolism of phytoene. Indeed, the members of this family appear to never be found in a genome lacking squalene-hopene cyclase (SHC), although not all genomes encoding SHC have this glycosyl transferase. In the organism Zymomonas mobilis the linkage of this gene to hopanoid biosynthesis has been noted and the gene named HpnB. Hopanoids are known to feature polar glycosyl head groups in many organisms.
Probab=96.56 E-value=0.39 Score=43.32 Aligned_cols=190 Identities=11% Similarity=0.046 Sum_probs=96.7
Q ss_pred CeeEEEEEeCCCCCHHHHHHHHHHHcCCcCCCcEEEEEEeecCchHHHHHHHHHhhhCC---CEEEcCCccccCCchhHH
Q psy10879 19 SYFIVILILTSSKNYIRRKNIRDTWIKNIEKYQVKYLFSIGNKENVVDVKLKEEIRRYD---DILLLNQVPDEYSSLSQK 95 (245)
Q Consensus 19 ~~~lli~V~S~~~~~~~R~~IR~TW~~~~~~~~~~~~Fv~g~~~~~~~~~l~~E~~~~~---Dii~~~d~~d~y~~l~~K 95 (245)
.+.+-|+|++.-+...-.+.++.--.+.- ....+++++-+.+.+...+.+++-.+++. .+..+. ..+.-.+-..|
T Consensus 39 ~p~VSVIIpa~Ne~~~L~~~L~sL~~q~y-p~~~eIIVVDd~StD~T~~i~~~~~~~~~~~~~i~vi~-~~~~~~g~~Gk 116 (384)
T TIGR03469 39 WPAVVAVVPARNEADVIGECVTSLLEQDY-PGKLHVILVDDHSTDGTADIARAAARAYGRGDRLTVVS-GQPLPPGWSGK 116 (384)
T ss_pred CCCEEEEEecCCcHhHHHHHHHHHHhCCC-CCceEEEEEeCCCCCcHHHHHHHHHHhcCCCCcEEEec-CCCCCCCCcch
Confidence 44566666665444333444443333222 22356666655554332233332233343 344332 12211222234
Q ss_pred H---HHHHHHHHhcc-ceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecCCCCc---cc-------C
Q psy10879 96 V---LHSFKYIYEHF-DFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRAGKWK---EK-------N 161 (245)
Q Consensus 96 ~---~~~l~w~~~~~-~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~---~~-------~ 161 (245)
. -.+++.+.+.. +.+|++..|+|+.+..+.|.+.+.........+..|..... ..+.|. .+ .
T Consensus 117 ~~A~n~g~~~A~~~~~~gd~llflDaD~~~~p~~l~~lv~~~~~~~~~~vs~~~~~~----~~~~~~~~~~~~~~~~~~~ 192 (384)
T TIGR03469 117 LWAVSQGIAAARTLAPPADYLLLTDADIAHGPDNLARLVARARAEGLDLVSLMVRLR----CESFWEKLLIPAFVFFFQK 192 (384)
T ss_pred HHHHHHHHHHHhccCCCCCEEEEECCCCCCChhHHHHHHHHHHhCCCCEEEeccccc----CCCHHHHHHHHHHHHHHHH
Confidence 3 33444444344 48999999999999998888888765433123333322110 001010 00 0
Q ss_pred ccc------CCCCCCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeE
Q psy10879 162 WFL------CDRYLPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITR 215 (245)
Q Consensus 162 ~~~------~~~yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~ 215 (245)
.++ ......++.|++.++++++.+++--.... .....||+.++.-+++.|-+.
T Consensus 193 ~~~~~~~~~~~~~~~~~~G~~~lirr~~~~~vGGf~~~-~~~~~ED~~L~~r~~~~G~~v 251 (384)
T TIGR03469 193 LYPFRWVNDPRRRTAAAAGGCILIRREALERIGGIAAI-RGALIDDCTLAAAVKRSGGRI 251 (384)
T ss_pred hcchhhhcCCCccceeecceEEEEEHHHHHHcCCHHHH-hhCcccHHHHHHHHHHcCCcE
Confidence 000 00111245699999999999987333221 122579999999999887443
No 19
>PF13506 Glyco_transf_21: Glycosyl transferase family 21
Probab=96.47 E-value=0.007 Score=48.59 Aligned_cols=123 Identities=19% Similarity=0.175 Sum_probs=80.1
Q ss_pred HHHHHHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecCCCCc------c---cCccc
Q psy10879 94 QKVLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRAGKWK------E---KNWFL 164 (245)
Q Consensus 94 ~K~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~------~---~~~~~ 164 (245)
.|.-.+...+....++++++..|+|+.|..+-|.+.+.....+.-.+..|... .. +.+ +-|. . +..+.
T Consensus 17 ~Kv~nL~~~~~~~a~~d~~~~~DsDi~v~p~~L~~lv~~l~~p~vglVt~~~~-~~-~~~-~~~~~l~~~~~~~~~~~~~ 93 (175)
T PF13506_consen 17 PKVNNLAQGLEAGAKYDYLVISDSDIRVPPDYLRELVAPLADPGVGLVTGLPR-GV-PAR-GFWSRLEAAFFNFLPGVLQ 93 (175)
T ss_pred hHHHHHHHHHHhhCCCCEEEEECCCeeECHHHHHHHHHHHhCCCCcEEEeccc-cc-CCc-CHHHHHHHHHHhHHHHHHH
Confidence 67777777765435999999999999999999999988765421333333221 11 111 1111 0 01110
Q ss_pred CCCCCCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecCC
Q psy10879 165 CDRYLPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDVR 220 (245)
Q Consensus 165 ~~~yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~~ 220 (245)
.-.-.++|.|+++++.+++++.+- ........-.||..+|..+.+.|.+..-.+.
T Consensus 94 a~~~~~~~~G~~m~~rr~~L~~~G-G~~~l~~~ladD~~l~~~~~~~G~~v~~~~~ 148 (175)
T PF13506_consen 94 ALGGAPFAWGGSMAFRREALEEIG-GFEALADYLADDYALGRRLRARGYRVVLSPY 148 (175)
T ss_pred HhcCCCceecceeeeEHHHHHHcc-cHHHHhhhhhHHHHHHHHHHHCCCeEEEcch
Confidence 112356889999999999998762 1122334568999999999999988776653
No 20
>cd04192 GT_2_like_e Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=96.36 E-value=0.15 Score=41.50 Aligned_cols=156 Identities=10% Similarity=-0.068 Sum_probs=80.9
Q ss_pred EEEEEEeecCchHHHHHHH-HHhhhCCCEEEcCCccccCCchhHHHHHHHHHHHhccceeEEEEecCceeeehHHHHHHH
Q psy10879 52 VKYLFSIGNKENVVDVKLK-EEIRRYDDILLLNQVPDEYSSLSQKVLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHEL 130 (245)
Q Consensus 52 ~~~~Fv~g~~~~~~~~~l~-~E~~~~~Dii~~~d~~d~y~~l~~K~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L 130 (245)
++++.+-+.+.+...+.+. .....+..+..+. ... ..+. .|. .++.+..+++..+|++.+|+|..+..+-|.+++
T Consensus 29 ~eiivvdd~s~d~t~~~~~~~~~~~~~~v~~~~-~~~-~~~~-g~~-~a~n~g~~~~~~d~i~~~D~D~~~~~~~l~~l~ 104 (229)
T cd04192 29 FEVILVDDHSTDGTVQILEFAAAKPNFQLKILN-NSR-VSIS-GKK-NALTTAIKAAKGDWIVTTDADCVVPSNWLLTFV 104 (229)
T ss_pred eEEEEEcCCCCcChHHHHHHHHhCCCcceEEee-ccC-cccc-hhH-HHHHHHHHHhcCCEEEEECCCcccCHHHHHHHH
Confidence 5666665554433333333 2222233444442 222 1221 222 445666667789999999999999988888877
Q ss_pred hhcCcCCcccEEEEecCCCeeecC-CCCcc-----c----CcccCCCCCCccCCCeeeecHHHHHHHHHhcccCCCCCCC
Q psy10879 131 ENKFHYEKKLYWGFFDGRARVRRA-GKWKE-----K----NWFLCDRYLPYALGGGYVLSHLLVKFISENSHMLSSYISE 200 (245)
Q Consensus 131 ~~~~~~~~~~y~G~~~~~~~~~r~-~k~~~-----~----~~~~~~~yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~e 200 (245)
..........+.|..... .+... ..+.. . ........+..+.|+++++++++++++--.... .....|
T Consensus 105 ~~~~~~~~~~v~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~rr~~~~~~ggf~~~-~~~~~e 182 (229)
T cd04192 105 AFIQKEQIGLVAGPVIYF-KGKSLLAKFQRLDWLSLLGLIAGSFGLGKPFMCNGANMAYRKEAFFEVGGFEGN-DHIASG 182 (229)
T ss_pred HHhhcCCCcEEeeeeeec-CCccHHHHHHHHHHHHHHHHHhhHHHhcCccccccceEEEEHHHHHHhcCCccc-cccccC
Confidence 754433144566654322 11000 00000 0 001112344456799999999999987543322 234456
Q ss_pred hhHHHHHHHhCCC
Q psy10879 201 DVSVGVWLSALDI 213 (245)
Q Consensus 201 Dv~iG~~l~~~~v 213 (245)
|..++.-+...|.
T Consensus 183 D~~~~~~~~~~g~ 195 (229)
T cd04192 183 DDELLLAKVASKY 195 (229)
T ss_pred CHHHHHHHHHhCC
Confidence 6666544444444
No 21
>cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan. Succinoglycan Biosynthesis Protein ExoA catalyzes the formation of a beta-1,3 linkage of the second sugar (glucose) of the succinoglycan with the galactose on the lipid carrie. Succinoglycan is an acidic exopolysaccharide that is important for invasion of the nodules. Succinoglycan is a high-molecular-weight polymer composed of repeating octasaccharide units. These units are synthesized on membrane-bound isoprenoid lipid carriers, beginning with galactose followed by seven glucose molecules, and modified by the addition of acetate, succinate, and pyruvate. ExoA is a membrane protein with a transmembrance domain at c-terminus.
Probab=96.22 E-value=0.55 Score=38.66 Aligned_cols=162 Identities=8% Similarity=-0.029 Sum_probs=86.1
Q ss_pred CcEEEEEEeecCchHHHHHHHHHhhhCCCEEEcCCccccCCchhHHHHHHHHHHHhccceeEEEEecCceeeehHHHHHH
Q psy10879 50 YQVKYLFSIGNKENVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQKVLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHE 129 (245)
Q Consensus 50 ~~~~~~Fv~g~~~~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~K~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~ 129 (245)
...+++.+-+.+.++....+++..+++..+.++.+ .. .+. -.++....+..+.+|++.+|+|..+...-|.+.
T Consensus 30 ~~~evivvd~~s~d~~~~~~~~~~~~~~~v~~i~~-~~--~~~----~~a~N~g~~~a~~d~v~~lD~D~~~~~~~l~~~ 102 (249)
T cd02525 30 DLIEIIVVDGGSTDGTREIVQEYAAKDPRIRLIDN-PK--RIQ----SAGLNIGIRNSRGDIIIRVDAHAVYPKDYILEL 102 (249)
T ss_pred CccEEEEEeCCCCccHHHHHHHHHhcCCeEEEEeC-CC--CCc----hHHHHHHHHHhCCCEEEEECCCccCCHHHHHHH
Confidence 34566666555544444444444445444554322 11 111 134555555558999999999999988878887
Q ss_pred HhhcCcCCcccEEEEecCC-Ceeec-------CCCCccc-CcccCCCC--CCccCCCeeeecHHHHHHHHHhcccCCCCC
Q psy10879 130 LENKFHYEKKLYWGFFDGR-ARVRR-------AGKWKEK-NWFLCDRY--LPYALGGGYVLSHLLVKFISENSHMLSSYI 198 (245)
Q Consensus 130 L~~~~~~~~~~y~G~~~~~-~~~~r-------~~k~~~~-~~~~~~~y--p~y~~G~gyvlS~~~~~~l~~~~~~~~~~~ 198 (245)
+.........+..|..... ..+.. ...+... ..+..... -.++.|++.++++++++++--... ....
T Consensus 103 ~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~--~~~~ 180 (249)
T cd02525 103 VEALKRTGADNVGGPMETIGESKFQKAIAVAQSSPLGSGGSAYRGGAVKIGYVDTVHHGAYRREVFEKVGGFDE--SLVR 180 (249)
T ss_pred HHHHhcCCCCEEecceecCCCChHHHHHHHHhhchhccCCccccccccccccccccccceEEHHHHHHhCCCCc--ccCc
Confidence 7654432133444543211 11100 0000000 00000000 114568888999999887643222 2234
Q ss_pred CChhHHHHHHHhCCCeEeecCC
Q psy10879 199 SEDVSVGVWLSALDITRYHDVR 220 (245)
Q Consensus 199 ~eDv~iG~~l~~~~v~~~~~~~ 220 (245)
.||..++.-+.+.|.+....+.
T Consensus 181 ~eD~~l~~r~~~~G~~~~~~~~ 202 (249)
T cd02525 181 NEDAELNYRLRKAGYKIWLSPD 202 (249)
T ss_pred cchhHHHHHHHHcCcEEEEcCC
Confidence 6999998777877877665543
No 22
>cd06421 CESA_CelA_like CESA_CelA_like are involved in the elongation of the glucan chain of cellulose. Family of proteins related to Agrobacterium tumefaciens CelA and Gluconacetobacter xylinus BscA. These proteins are involved in the elongation of the glucan chain of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues. They are putative catalytic subunit of cellulose synthase, which is a glycosyltransferase using UDP-glucose as the substrate. The catalytic subunit is an integral membrane protein with 6 transmembrane segments and it is postulated that the protein is anchored in the membrane at the N-terminal end.
Probab=96.13 E-value=0.18 Score=41.29 Aligned_cols=115 Identities=13% Similarity=0.005 Sum_probs=66.8
Q ss_pred HHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcc-cEEEEecC-CCeee-cCCC-Cccc-Ccc-----cC--CCC
Q psy10879 101 KYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKK-LYWGFFDG-RARVR-RAGK-WKEK-NWF-----LC--DRY 168 (245)
Q Consensus 101 ~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~-~y~G~~~~-~~~~~-r~~k-~~~~-~~~-----~~--~~y 168 (245)
....+..+.+|++.+|+|+++..+.|..++.......+. ...|.... ..... .-.. +... ..+ .. ...
T Consensus 77 n~~~~~a~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (234)
T cd06421 77 NNALAHTTGDFVAILDADHVPTPDFLRRTLGYFLDDPKVALVQTPQFFYNPDPFDWLADGAPNEQELFYGVIQPGRDRWG 156 (234)
T ss_pred HHHHHhCCCCEEEEEccccCcCccHHHHHHHHHhcCCCeEEEecceEEecCCcchhHHHHHHHHHHHHHHHHHHHHhhcC
Confidence 344445589999999999999998888888776542022 22232111 10000 0000 0000 000 00 011
Q ss_pred CCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeec
Q psy10879 169 LPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHD 218 (245)
Q Consensus 169 p~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~ 218 (245)
..++.|++.++++++++++-.-. .....||..++.-+...|.+....
T Consensus 157 ~~~~~g~~~~~r~~~~~~ig~~~---~~~~~eD~~l~~r~~~~g~~i~~~ 203 (234)
T cd06421 157 AAFCCGSGAVVRREALDEIGGFP---TDSVTEDLATSLRLHAKGWRSVYV 203 (234)
T ss_pred CceecCceeeEeHHHHHHhCCCC---ccceeccHHHHHHHHHcCceEEEe
Confidence 34577999999999998874322 234579999998888888766543
No 23
>cd04184 GT2_RfbC_Mx_like Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis. The rfbC gene encodes a predicted protein of 1,276 amino acids, which is required for O-antigen biosynthesis in Myxococcus xanthus. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds.
Probab=96.10 E-value=0.47 Score=37.85 Aligned_cols=120 Identities=15% Similarity=0.076 Sum_probs=67.2
Q ss_pred HHHHHHHhccceeEEEEecCceeeehHHHHHHHhhc-CcCCcccEEEEecCCC-eeecCCCCcccCcccCCCCCCc-cCC
Q psy10879 98 HSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENK-FHYEKKLYWGFFDGRA-RVRRAGKWKEKNWFLCDRYLPY-ALG 174 (245)
Q Consensus 98 ~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~-~~~~~~~y~G~~~~~~-~~~r~~k~~~~~~~~~~~yp~y-~~G 174 (245)
.++....+..+.+|++..|+|..+..+.|...+... ..+...+.+|...... .......+..+.+ ....+... +.|
T Consensus 73 ~a~n~g~~~a~~d~i~~ld~D~~~~~~~l~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 151 (202)
T cd04184 73 AATNSALELATGEFVALLDHDDELAPHALYEVVKALNEHPDADLIYSDEDKIDEGGKRSEPFFKPDW-SPDLLLSQNYIG 151 (202)
T ss_pred HHHHHHHHhhcCCEEEEECCCCcCChHHHHHHHHHHHhCCCCCEEEccHHhccCCCCEeccccCCCC-CHHHhhhcCCcc
Confidence 344455555578999999999999998888888775 2221334444322100 0000000111111 00111111 335
Q ss_pred CeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecCC
Q psy10879 175 GGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDVR 220 (245)
Q Consensus 175 ~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~~ 220 (245)
++-++++++++++-.-.. .....||..++.-+.+.|.+..+.+.
T Consensus 152 ~~~~~~r~~~~~iggf~~--~~~~~eD~~l~~rl~~~g~~~~~~~~ 195 (202)
T cd04184 152 HLLVYRRSLVRQVGGFRE--GFEGAQDYDLVLRVSEHTDRIAHIPR 195 (202)
T ss_pred ceEeEEHHHHHHhCCCCc--CcccchhHHHHHHHHhccceEEEccH
Confidence 666789988887643222 12357999998888888888777664
No 24
>cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans. UDP-GalNAc: polypeptide alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T) initiate the formation of mucin-type, O-linked glycans by catalyzing the transfer of alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to hydroxyl groups of Ser or Thr residues of core proteins to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These enzymes are type II membrane proteins with a GT-A type catalytic domain and a lectin domain located on the lumen side of the Golgi apparatus. In human, there are 15 isozymes of pp-GalNAc-Ts, representing the largest of all glycosyltransferase families. Each isozyme has unique but partially redundant substrate specificity for glycosylation sites on acceptor proteins.
Probab=95.95 E-value=0.46 Score=41.03 Aligned_cols=120 Identities=12% Similarity=0.006 Sum_probs=69.7
Q ss_pred HHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCC---Ce---e----ecC-CCCccc-Ccc--
Q psy10879 98 HSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGR---AR---V----RRA-GKWKEK-NWF-- 163 (245)
Q Consensus 98 ~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~---~~---~----~r~-~k~~~~-~~~-- 163 (245)
.+.....+....+|++..|+|+.+...-|..++.........+..|.+... .. . .+. ..|... .+.
T Consensus 73 ~a~N~g~~~A~gd~i~fLD~D~~~~~~wL~~ll~~l~~~~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (299)
T cd02510 73 RARIAGARAATGDVLVFLDSHCEVNVGWLEPLLARIAENRKTVVCPIIDVIDADTFEYRGSSGDARGGFDWSLHFKWLPL 152 (299)
T ss_pred HHHHHHHHHccCCEEEEEeCCcccCccHHHHHHHHHHhCCCeEEEeeeccccCCCeeEecCCCceeEEecccceeccccC
Confidence 344555555679999999999999888887777765433123333332110 00 0 000 011110 000
Q ss_pred ---------c-CCCCCCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEee
Q psy10879 164 ---------L-CDRYLPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYH 217 (245)
Q Consensus 164 ---------~-~~~yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~ 217 (245)
+ ....-+++.|+++++++++.+.+-.-.........||+-+..=+...|-+.+.
T Consensus 153 ~~~~~~~~~~~~~~~~~~~~g~~~~irr~~~~~vGgfDe~~~~~~~ED~Dl~~R~~~~G~~i~~ 216 (299)
T cd02510 153 PEEERRRESPTAPIRSPTMAGGLFAIDREWFLELGGYDEGMDIWGGENLELSFKVWQCGGSIEI 216 (299)
T ss_pred CHHHhhhcCCCCCccCccccceeeEEEHHHHHHhCCCCCcccccCchhHHHHHHHHHcCCeEEE
Confidence 0 11122456799999999999988544444444457999988777777765543
No 25
>cd04196 GT_2_like_d Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=95.77 E-value=0.61 Score=37.37 Aligned_cols=155 Identities=14% Similarity=0.105 Sum_probs=79.8
Q ss_pred cEEEEEEeecCchHHHHHHHHHhhhCCCEEEcCCccccCCchhHHHHHHHHHHHhccceeEEEEecCceeeehHHHHHHH
Q psy10879 51 QVKYLFSIGNKENVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQKVLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHEL 130 (245)
Q Consensus 51 ~~~~~Fv~g~~~~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~K~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L 130 (245)
.++++++-..+++...+.+++-+.+++..+.+-. .+.-.+. ...+......++.+|++..|+|..+.++.|..++
T Consensus 27 ~~eiiVvddgS~d~t~~~~~~~~~~~~~~~~~~~-~~~~~G~----~~~~n~g~~~~~g~~v~~ld~Dd~~~~~~l~~~~ 101 (214)
T cd04196 27 NDELIISDDGSTDGTVEIIKEYIDKDPFIIILIR-NGKNLGV----ARNFESLLQAADGDYVFFCDQDDIWLPDKLERLL 101 (214)
T ss_pred CeEEEEEeCCCCCCcHHHHHHHHhcCCceEEEEe-CCCCccH----HHHHHHHHHhCCCCEEEEECCCcccChhHHHHHH
Confidence 4566666554443333344444555543332211 1111121 2233333456689999999999999888888888
Q ss_pred hh-cCcCCcccEEEEec---CCCeeecCCCCcccCc-----ccCCCCCCccCCCeeeecHHHHHHHHHhcccCCCCCCCh
Q psy10879 131 EN-KFHYEKKLYWGFFD---GRARVRRAGKWKEKNW-----FLCDRYLPYALGGGYVLSHLLVKFISENSHMLSSYISED 201 (245)
Q Consensus 131 ~~-~~~~~~~~y~G~~~---~~~~~~r~~k~~~~~~-----~~~~~yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eD 201 (245)
.. .......++.|... ..........+..... +.......++.|+++++.+++++++....... ...||
T Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~--~~~~D 179 (214)
T cd04196 102 KAFLKDDKPLLVYSDLELVDENGNPIGESFFEYQKIKPGTSFNNLLFQNVVTGCTMAFNRELLELALPFPDAD--VIMHD 179 (214)
T ss_pred HHHhcCCCceEEecCcEEECCCCCCcccccccccccCCccCHHHHHHhCccCCceeeEEHHHHHhhccccccc--cccch
Confidence 76 23221334444322 1111111111100000 00011123567889999999999876543322 46788
Q ss_pred hHHHHHHHhCC
Q psy10879 202 VSVGVWLSALD 212 (245)
Q Consensus 202 v~iG~~l~~~~ 212 (245)
.++...+...|
T Consensus 180 ~~~~~~~~~~~ 190 (214)
T cd04196 180 WWLALLASAFG 190 (214)
T ss_pred HHHHHHHHHcC
Confidence 88877666643
No 26
>cd06427 CESA_like_2 CESA_like_2 is a member of the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, Glucan Biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis prot
Probab=95.77 E-value=0.43 Score=39.68 Aligned_cols=120 Identities=8% Similarity=0.002 Sum_probs=71.1
Q ss_pred HHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcc-cEEE-EecCCCeeec-CCCCcccC---c----cc---
Q psy10879 98 HSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKK-LYWG-FFDGRARVRR-AGKWKEKN---W----FL--- 164 (245)
Q Consensus 98 ~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~-~y~G-~~~~~~~~~r-~~k~~~~~---~----~~--- 164 (245)
.++....++.+.+|++.+|+|+.+..+.|.+.+......... .++| .+........ -.+....+ . .+
T Consensus 74 ~a~n~g~~~a~gd~i~~~DaD~~~~~~~l~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (241)
T cd06427 74 KACNYALAFARGEYVVIYDAEDAPDPDQLKKAVAAFARLDDKLACVQAPLNYYNARENWLTRMFALEYAAWFDYLLPGLA 153 (241)
T ss_pred HHHHHHHHhcCCCEEEEEcCCCCCChHHHHHHHHHHHhcCCCEEEEeCceEeeCCCccHHHHHHHHHHHHHHHHHHHHHH
Confidence 455666666788999999999999999999888776431022 2332 2211100000 00000000 0 00
Q ss_pred CCCCCCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecCC
Q psy10879 165 CDRYLPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDVR 220 (245)
Q Consensus 165 ~~~yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~~ 220 (245)
....+..++|++.++++++++.+-.... ....||..++.-+...|.+....+.
T Consensus 154 ~~~~~~~~~g~~~~~rr~~~~~vgg~~~---~~~~eD~~l~~rl~~~G~r~~~~~~ 206 (241)
T cd06427 154 RLGLPIPLGGTSNHFRTDVLRELGGWDP---FNVTEDADLGLRLARAGYRTGVLNS 206 (241)
T ss_pred hcCCeeecCCchHHhhHHHHHHcCCCCc---ccchhhHHHHHHHHHCCceEEEecc
Confidence 0122234678889999999998754322 2357999999888888877766543
No 27
>PRK11204 N-glycosyltransferase; Provisional
Probab=95.69 E-value=0.61 Score=42.33 Aligned_cols=188 Identities=13% Similarity=0.066 Sum_probs=102.0
Q ss_pred CCCCeeEEEEEeCCCCCHHHHHHHHHHHcCCcCCCcEEEEEEeecCc-hHHHHHHHHHhhhCCCEEEcCCccccCCchhH
Q psy10879 16 LKHSYFIVILILTSSKNYIRRKNIRDTWIKNIEKYQVKYLFSIGNKE-NVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQ 94 (245)
Q Consensus 16 c~~~~~lli~V~S~~~~~~~R~~IR~TW~~~~~~~~~~~~Fv~g~~~-~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~ 94 (245)
..+.+.+-|+|++--+...-++.++.-=.... ...+++ ++..++ ++..+.+++..+++..+..+. .. .|. .
T Consensus 50 ~~~~p~vsViIp~yne~~~i~~~l~sl~~q~y--p~~eii-VvdD~s~d~t~~~l~~~~~~~~~v~~i~-~~---~n~-G 121 (420)
T PRK11204 50 LKEYPGVSILVPCYNEGENVEETISHLLALRY--PNYEVI-AINDGSSDNTGEILDRLAAQIPRLRVIH-LA---ENQ-G 121 (420)
T ss_pred cCCCCCEEEEEecCCCHHHHHHHHHHHHhCCC--CCeEEE-EEECCCCccHHHHHHHHHHhCCcEEEEE-cC---CCC-C
Confidence 34456777878776544333444433222221 234444 444443 444555555566666665432 11 221 2
Q ss_pred HHHHHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcC-CcccEEEEecCCCeeecCCCCccc---Cccc------
Q psy10879 95 KVLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHY-EKKLYWGFFDGRARVRRAGKWKEK---NWFL------ 164 (245)
Q Consensus 95 K~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~-~~~~y~G~~~~~~~~~r~~k~~~~---~~~~------ 164 (245)
| -.++....+.++.+|++..|+|+.+..+.|.+.++....+ +-....|.. .......+... ..|.
T Consensus 122 k-a~aln~g~~~a~~d~i~~lDaD~~~~~d~L~~l~~~~~~~~~v~~v~g~~----~~~~~~~~~~~~~~~~~~~~~~~~ 196 (420)
T PRK11204 122 K-ANALNTGAAAARSEYLVCIDGDALLDPDAAAYMVEHFLHNPRVGAVTGNP----RIRNRSTLLGRIQVGEFSSIIGLI 196 (420)
T ss_pred H-HHHHHHHHHHcCCCEEEEECCCCCCChhHHHHHHHHHHhCCCeEEEECCc----eeccchhHHHHHHHHHHHHhhhHH
Confidence 3 3445566666789999999999999999888888776322 022222321 11110000000 0000
Q ss_pred ---CCCC--CCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecC
Q psy10879 165 ---CDRY--LPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDV 219 (245)
Q Consensus 165 ---~~~y--p~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~ 219 (245)
...+ ....+|++.++++++++++-... +....||+.++.-+.+.|.+....+
T Consensus 197 ~~~~~~~~~~~~~~G~~~~~rr~~l~~vgg~~---~~~~~ED~~l~~rl~~~G~~i~~~p 253 (420)
T PRK11204 197 KRAQRVYGRVFTVSGVITAFRKSALHEVGYWS---TDMITEDIDISWKLQLRGWDIRYEP 253 (420)
T ss_pred HHHHHHhCCceEecceeeeeeHHHHHHhCCCC---CCcccchHHHHHHHHHcCCeEEecc
Confidence 0011 11346888999999998763222 2234699999998888887766544
No 28
>cd06532 Glyco_transf_25 Glycosyltransferase family 25 [lipooligosaccharide (LOS) biosynthesis protein] is a family of glycosyltransferases involved in LOS biosynthesis. The members include the beta(1,4) galactosyltransferases: Lgt2 of Moraxella catarrhalis, LgtB and LgtE of Neisseria gonorrhoeae and Lic2A of Haemophilus influenzae. M. catarrhalis Lgt2 catalyzes the addition of galactose (Gal) to the growing chain of LOS on the cell surface. N. gonorrhoeae LgtB and LgtE link Gal-beta(1,4) to GlcNAc (N-acetylglucosamine) and Glc (glucose), respectively. The genes encoding LgtB and LgtE are two genes of a five gene locus involved in the synthesis of gonococcal LOS. LgtE is believed to perform the first step in LOS biosynthesis.
Probab=95.40 E-value=0.22 Score=37.71 Aligned_cols=112 Identities=15% Similarity=0.162 Sum_probs=67.9
Q ss_pred EEEeCCCCCHHHHHHHHHHHcCCcCCCcEEEEEEeecCchHH-HHHHHHHh-----hhCCCEEEcCCccccCCchhHHHH
Q psy10879 24 ILILTSSKNYIRRKNIRDTWIKNIEKYQVKYLFSIGNKENVV-DVKLKEEI-----RRYDDILLLNQVPDEYSSLSQKVL 97 (245)
Q Consensus 24 i~V~S~~~~~~~R~~IR~TW~~~~~~~~~~~~Fv~g~~~~~~-~~~l~~E~-----~~~~Dii~~~d~~d~y~~l~~K~~ 97 (245)
+.|.+-+...+||+.+++...... +.+-|+-|-...+. ...+.... ..++-- +..+.-.-.+..+
T Consensus 2 i~vInL~~~~~Rr~~~~~~~~~~~----~~~~~~~Avd~~~~~~~~~~~~~~~~~~~~~~~~-----l~~gEiGC~lSH~ 72 (128)
T cd06532 2 IFVINLDRSTDRRERMEAQLAALG----LDFEFFDAVDGKDLSEEELAALYDALFLPRYGRP-----LTPGEIGCFLSHY 72 (128)
T ss_pred EEEEECCCCHHHHHHHHHHHHHcC----CCeEEEeccccccCCHHHHHHHhHHHhhhhcCCC-----CChhhHHHHHHHH
Confidence 456777888899999888655543 34556655443111 11111111 112222 2222233345556
Q ss_pred HHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecCCCCcccCcccCCCCCCccCCCee
Q psy10879 98 HSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRAGKWKEKNWFLCDRYLPYALGGGY 177 (245)
Q Consensus 98 ~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~~~~~~~~~~yp~y~~G~gy 177 (245)
..|+-+.+. +.++.+..+||+.+..+ +..||
T Consensus 73 ~~w~~~~~~-~~~~alIlEDDv~~~~~------------------------------------------------~~~~Y 103 (128)
T cd06532 73 KLWQKIVES-NLEYALILEDDAILDPD------------------------------------------------GTAGY 103 (128)
T ss_pred HHHHHHHHc-CCCeEEEEccCcEECCC------------------------------------------------CceEE
Confidence 666655432 56789999999999887 34689
Q ss_pred eecHHHHHHHHHhccc
Q psy10879 178 VLSHLLVKFISENSHM 193 (245)
Q Consensus 178 vlS~~~~~~l~~~~~~ 193 (245)
++|+.++++|+.....
T Consensus 104 ~vs~~~A~~ll~~~~~ 119 (128)
T cd06532 104 LVSRKGAKKLLAALEP 119 (128)
T ss_pred EeCHHHHHHHHHhCCC
Confidence 9999999999998754
No 29
>cd06420 GT2_Chondriotin_Pol_N N-terminal domain of Chondroitin polymerase functions as a GalNAc transferase. Chondroitin polymerase is a two domain, bi-functional protein. The N-terminal domain functions as a GalNAc transferase. The bacterial chondroitin polymerase catalyzes elongation of the chondroitin chain by alternatively transferring the GlcUA and GalNAc moiety from UDP-GlcUA and UDP-GalNAc to the non-reducing ends of the chondroitin chain. The enzyme consists of N-terminal and C-terminal domains in which the two active sites catalyze the addition of GalNAc and GlcUA, respectively. Chondroitin chains range from 40 to over 100 repeating units of the disaccharide. Sulfated chondroitins are involved in the regulation of various biological functions such as central nervous system development, wound repair, infection, growth factor signaling, and morphogenesis, in addition to its conventional structural roles. In Caenorhabditis elegans, chondroitin is an essential factor for the worm
Probab=95.30 E-value=1.1 Score=35.10 Aligned_cols=100 Identities=12% Similarity=-0.080 Sum_probs=62.6
Q ss_pred HHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecCCCCcccCcccCCCCCCccCCCeee
Q psy10879 99 SFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRAGKWKEKNWFLCDRYLPYALGGGYV 178 (245)
Q Consensus 99 ~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~~~~~~~~~~yp~y~~G~gyv 178 (245)
.+....+.++.+|++..|+|..+..+-|...++...+ .....|.......... .....|+++.
T Consensus 70 ~~n~g~~~a~g~~i~~lD~D~~~~~~~l~~~~~~~~~--~~~v~g~~~~~~~~~~---------------~~~~~~~~~~ 132 (182)
T cd06420 70 IRNKAIAAAKGDYLIFIDGDCIPHPDFIADHIELAEP--GVFLSGSRVLLNEKLT---------------ERGIRGCNMS 132 (182)
T ss_pred HHHHHHHHhcCCEEEEEcCCcccCHHHHHHHHHHhCC--CcEEecceeecccccc---------------eeEeccceEE
Confidence 3444445568899999999999988888887777643 3333343221110000 0234677788
Q ss_pred ecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeE
Q psy10879 179 LSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITR 215 (245)
Q Consensus 179 lS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~ 215 (245)
+.+..+..+-...........||..++.=+.+.|+..
T Consensus 133 ~~r~~~~~~ggf~~~~~~~~~eD~~l~~r~~~~g~~~ 169 (182)
T cd06420 133 FWKKDLLAVNGFDEEFTGWGGEDSELVARLLNSGIKF 169 (182)
T ss_pred EEHHHHHHhCCCCcccccCCcchHHHHHHHHHcCCcE
Confidence 8888877544333333334589999998888888443
No 30
>PLN02726 dolichyl-phosphate beta-D-mannosyltransferase
Probab=95.18 E-value=1.4 Score=36.74 Aligned_cols=160 Identities=9% Similarity=-0.050 Sum_probs=84.5
Q ss_pred cEEEEEEeecCchHHHHHHHHHhhhCCC--EEEcCCccccCCchhHHHHHHHHHHHhccceeEEEEecCceeeehHHHHH
Q psy10879 51 QVKYLFSIGNKENVVDVKLKEEIRRYDD--ILLLNQVPDEYSSLSQKVLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIH 128 (245)
Q Consensus 51 ~~~~~Fv~g~~~~~~~~~l~~E~~~~~D--ii~~~d~~d~y~~l~~K~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~ 128 (245)
.++++++-+.+.+...+.+.+-.++|++ +..+. .. .|.- + -.++.......+.+|++.+|.|..+.++.|..
T Consensus 40 ~~eiivvDdgS~D~t~~i~~~~~~~~~~~~v~~~~--~~--~n~G-~-~~a~n~g~~~a~g~~i~~lD~D~~~~~~~l~~ 113 (243)
T PLN02726 40 DFEIIVVDDGSPDGTQDVVKQLQKVYGEDRILLRP--RP--GKLG-L-GTAYIHGLKHASGDFVVIMDADLSHHPKYLPS 113 (243)
T ss_pred CeEEEEEeCCCCCCHHHHHHHHHHhcCCCcEEEEe--cC--CCCC-H-HHHHHHHHHHcCCCEEEEEcCCCCCCHHHHHH
Confidence 4566666555543333334443445543 33221 11 1111 1 23444444555789999999999999988888
Q ss_pred HHhhcCcCCcccEEEEecCCCeeecCCCCc---cc-------CcccCCCCCCccCCCeeeecHHHHHHHHHhcccCCCCC
Q psy10879 129 ELENKFHYEKKLYWGFFDGRARVRRAGKWK---EK-------NWFLCDRYLPYALGGGYVLSHLLVKFISENSHMLSSYI 198 (245)
Q Consensus 129 ~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~---~~-------~~~~~~~yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~ 198 (245)
++.........+..|.......-.....|. .+ ........ +.+.|++.++++++++.+....... ..
T Consensus 114 l~~~~~~~~~~~v~g~r~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~-~d~~g~~~~~rr~~~~~i~~~~~~~--~~ 190 (243)
T PLN02726 114 FIKKQRETGADIVTGTRYVKGGGVHGWDLRRKLTSRGANVLAQTLLWPGV-SDLTGSFRLYKRSALEDLVSSVVSK--GY 190 (243)
T ss_pred HHHHHHhcCCcEEEEccccCCCCcCCccHHHHHHHHHHHHHHHHHhCCCC-CcCCCcccceeHHHHHHHHhhccCC--Cc
Confidence 887653321567777643211000000010 00 00001111 2356788899999999997543322 23
Q ss_pred CChhHHHHHHHhCCCeEeecC
Q psy10879 199 SEDVSVGVWLSALDITRYHDV 219 (245)
Q Consensus 199 ~eDv~iG~~l~~~~v~~~~~~ 219 (245)
..|.-+..-+...|.+.++.|
T Consensus 191 ~~~~el~~~~~~~g~~i~~vp 211 (243)
T PLN02726 191 VFQMEIIVRASRKGYRIEEVP 211 (243)
T ss_pred EEehHHHHHHHHcCCcEEEeC
Confidence 346667666667777666554
No 31
>cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=95.07 E-value=1.4 Score=35.22 Aligned_cols=96 Identities=13% Similarity=0.076 Sum_probs=60.9
Q ss_pred HHHHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecCCCCcccCcccCCCCCCccCCC
Q psy10879 96 VLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRAGKWKEKNWFLCDRYLPYALGG 175 (245)
Q Consensus 96 ~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~~~~~~~~~~yp~y~~G~ 175 (245)
.-.+++++. ..+.+|++..|+|..+..+.|..++.....+.-.++.|..... ++ .++
T Consensus 68 ~n~~~~~a~-~~~~d~v~~ld~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~-----~~-----------------~~~ 124 (202)
T cd04185 68 FYEGVRRAY-ELGYDWIWLMDDDAIPDPDALEKLLAYADKDNPQFLAPLVLDP-----DG-----------------SFV 124 (202)
T ss_pred HHHHHHHHh-ccCCCEEEEeCCCCCcChHHHHHHHHHHhcCCceEecceeEcC-----CC-----------------ceE
Confidence 344556654 3488999999999999988777777665422133333332211 00 345
Q ss_pred eeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeE
Q psy10879 176 GYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITR 215 (245)
Q Consensus 176 gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~ 215 (245)
|.++++++++.+--.... -....||+.++.-+...|...
T Consensus 125 ~~~~~~~~~~~~g~~~~~-~~~~~eD~~~~~r~~~~G~~i 163 (202)
T cd04185 125 GVLISRRVVEKIGLPDKE-FFIWGDDTEYTLRASKAGPGI 163 (202)
T ss_pred EEEEeHHHHHHhCCCChh-hhccchHHHHHHHHHHcCCcE
Confidence 688999999877322111 123569999998888888665
No 32
>TIGR03111 glyc2_xrt_Gpos1 putative glycosyltransferase TIGR03111. Members of this protein family probable glycosyltransferases of family 2, whose genes are near those for Gram-positive proteins (TIGR03110) related to the proposed exosortase (TIGR02602).
Probab=94.90 E-value=0.46 Score=43.71 Aligned_cols=190 Identities=9% Similarity=0.010 Sum_probs=95.3
Q ss_pred CCCCeeEEEEEeCCCCCHHHHHHHHHHHcCCcCCCcEEEEEEeecCchHHHHHHHHHhhhCCCEEEcCCccccCCchhHH
Q psy10879 16 LKHSYFIVILILTSSKNYIRRKNIRDTWIKNIEKYQVKYLFSIGNKENVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQK 95 (245)
Q Consensus 16 c~~~~~lli~V~S~~~~~~~R~~IR~TW~~~~~~~~~~~~Fv~g~~~~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~K 95 (245)
....+.+-|+|++--+...-++.|+.--.+.-+...++++.+-+.+.++..+.+++-.+++..+.... .+.-. .|
T Consensus 45 ~~~~P~vsVIIP~yNe~~~l~~~l~sl~~q~yp~~~~eIiVVDd~StD~T~~il~~~~~~~~~v~v~~--~~~~~---Gk 119 (439)
T TIGR03111 45 IGKLPDITIIIPVYNSEDTLFNCIESIYNQTYPIELIDIILANNQSTDDSFQVFCRAQNEFPGLSLRY--MNSDQ---GK 119 (439)
T ss_pred cCCCCCEEEEEEeCCChHHHHHHHHHHHhcCCCCCCeEEEEEECCCChhHHHHHHHHHHhCCCeEEEE--eCCCC---CH
Confidence 34445666666665444444555555433332222344444433333333333343345555543211 11111 23
Q ss_pred HHHHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcC-CcccEEEEecCCCee-ecCCCC---cccC--c--cc--
Q psy10879 96 VLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHY-EKKLYWGFFDGRARV-RRAGKW---KEKN--W--FL-- 164 (245)
Q Consensus 96 ~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~-~~~~y~G~~~~~~~~-~r~~k~---~~~~--~--~~-- 164 (245)
-.++.+..++++.+|++.+|.|..+..+.|.+.+.....+ .-....|.+...... .....+ ...+ . |.
T Consensus 120 -a~AlN~gl~~s~g~~v~~~DaD~~~~~d~L~~l~~~f~~~~~v~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~ 198 (439)
T TIGR03111 120 -AKALNAAIYNSIGKYIIHIDSDGKLHKDAIKNMVTRFENNPDIHAMTGVILTDKELIEKTKGRFLKLIRRCEYFEYAQA 198 (439)
T ss_pred -HHHHHHHHHHccCCEEEEECCCCCcChHHHHHHHHHHHhCCCeEEEEeEEecCchhhhhhcchhhhHhHHhHHHHHHHH
Confidence 3455566666788999999999999999998888776432 023334554321110 000000 0000 0 00
Q ss_pred --------C-CCCCCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHH-hCCCe
Q psy10879 165 --------C-DRYLPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLS-ALDIT 214 (245)
Q Consensus 165 --------~-~~yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~-~~~v~ 214 (245)
. ..-+..++|++.++.+++++++--.. +....||..++.=+. ..|-+
T Consensus 199 ~l~~r~~~s~~~~~~~~sGa~~~~Rr~~l~~vggf~---~~~i~ED~~l~~rl~~~~g~k 255 (439)
T TIGR03111 199 FLAGRNFESQVNSLFTLSGAFSAFRRETILKTQLYN---SETVGEDTDMTFQIRELLDGK 255 (439)
T ss_pred HHhhhHHHHhcCCeEEEccHHHhhhHHHHHHhCCCC---CCCcCccHHHHHHHHHhcCCe
Confidence 0 01112367888889999887753222 222479999986554 33443
No 33
>PF00535 Glycos_transf_2: Glycosyl transferase family 2; InterPro: IPR001173 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. This domain is found in a diverse family of glycosyl transferases that transfer the sugar from UDP-glucose, UDP-N-acetyl-galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids.; PDB: 2Z87_A 2Z86_B 2D7R_A 2D7I_A 3CKN_A 3CKQ_A 3CKJ_A 3CKV_A 3CKO_A 2FFU_A ....
Probab=94.88 E-value=1.2 Score=33.60 Aligned_cols=155 Identities=14% Similarity=0.106 Sum_probs=77.5
Q ss_pred EEEEEeCCCCCHHHHH-HHHHHHcCCcCCCcEEEEEEeecCchHHHHHHHHHhhhCCCEEEcCCccccCCchhHHHHHHH
Q psy10879 22 IVILILTSSKNYIRRK-NIRDTWIKNIEKYQVKYLFSIGNKENVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQKVLHSF 100 (245)
Q Consensus 22 lli~V~S~~~~~~~R~-~IR~TW~~~~~~~~~~~~Fv~g~~~~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~K~~~~l 100 (245)
|+|.+.-.++...+-- .+++. . ....+++++-..+.++....+++-.+....+..+. .. .|. -.-.++
T Consensus 2 vvip~~n~~~~l~~~l~sl~~q--~---~~~~eiivvdd~s~d~~~~~~~~~~~~~~~i~~i~-~~---~n~--g~~~~~ 70 (169)
T PF00535_consen 2 VVIPTYNEAEYLERTLESLLKQ--T---DPDFEIIVVDDGSTDETEEILEEYAESDPNIRYIR-NP---ENL--GFSAAR 70 (169)
T ss_dssp EEEEESS-TTTHHHHHHHHHHH--S---GCEEEEEEEECS-SSSHHHHHHHHHCCSTTEEEEE-HC---CCS--HHHHHH
T ss_pred EEEEeeCCHHHHHHHHHHHhhc--c---CCCEEEEEecccccccccccccccccccccccccc-cc---ccc--cccccc
Confidence 3444444545555443 56666 1 22345655555443444444444443244444331 12 232 234445
Q ss_pred HHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecC---CCe-eecCCC---Cccc-C---cccCCCCC
Q psy10879 101 KYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDG---RAR-VRRAGK---WKEK-N---WFLCDRYL 169 (245)
Q Consensus 101 ~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~---~~~-~~r~~k---~~~~-~---~~~~~~yp 169 (245)
....++.+.+|++.+|||.++..+.|..+++..........+|.... ... ..+... +... . ......--
T Consensus 71 n~~~~~a~~~~i~~ld~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 150 (169)
T PF00535_consen 71 NRGIKHAKGEYILFLDDDDIISPDWLEELVEALEKNPPDVVIGSVIYIDDDNRYPDRRLRFSFWNRFERKIFNNIRFWKI 150 (169)
T ss_dssp HHHHHH--SSEEEEEETTEEE-TTHHHHHHHHHHHCTTEEEEEEEEEEECTTETEECCCTSEEEECCHCHHHHTTHSTTS
T ss_pred cccccccceeEEEEeCCCceEcHHHHHHHHHHHHhCCCcEEEEEEEEecCCccccccccchhhhhhhhhHHHHhhhcCCc
Confidence 55556666779999999999999877777766554214566665432 111 111110 1110 0 01111222
Q ss_pred CccCCCeeeecHHHHHHH
Q psy10879 170 PYALGGGYVLSHLLVKFI 187 (245)
Q Consensus 170 ~y~~G~gyvlS~~~~~~l 187 (245)
+++.|++.++++++++++
T Consensus 151 ~~~~~~~~~~rr~~~~~~ 168 (169)
T PF00535_consen 151 SFFIGSCALFRRSVFEEI 168 (169)
T ss_dssp SEESSSCEEEEEHHHHHC
T ss_pred ccccccEEEEEHHHHHhh
Confidence 467889999999998875
No 34
>PRK14583 hmsR N-glycosyltransferase; Provisional
Probab=94.83 E-value=0.87 Score=41.92 Aligned_cols=188 Identities=11% Similarity=0.070 Sum_probs=102.1
Q ss_pred CCeeEEEEEeCCCCCHHHHHHHHHHHcCCcCCCcEEEEEEeecCchHHHHHHHHHhhhCCCEEEcCCccccCCchhHHHH
Q psy10879 18 HSYFIVILILTSSKNYIRRKNIRDTWIKNIEKYQVKYLFSIGNKENVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQKVL 97 (245)
Q Consensus 18 ~~~~lli~V~S~~~~~~~R~~IR~TW~~~~~~~~~~~~Fv~g~~~~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~K~~ 97 (245)
..+.+-|+|++.-+...-++.|+.- .++. ..+.+++++...+.++..+.+.+..+++..+..+. .. .|. .| -
T Consensus 73 ~~p~vsViIP~yNE~~~i~~~l~sl-l~q~-yp~~eIivVdDgs~D~t~~~~~~~~~~~~~v~vv~-~~---~n~-Gk-a 144 (444)
T PRK14583 73 GHPLVSILVPCFNEGLNARETIHAA-LAQT-YTNIEVIAINDGSSDDTAQVLDALLAEDPRLRVIH-LA---HNQ-GK-A 144 (444)
T ss_pred CCCcEEEEEEeCCCHHHHHHHHHHH-HcCC-CCCeEEEEEECCCCccHHHHHHHHHHhCCCEEEEE-eC---CCC-CH-H
Confidence 3456777777775554444444432 2222 22456555544444444444555556666554332 11 121 13 2
Q ss_pred HHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcC-CcccEEEEecCCCeeecC---CCCcccCccc--------C
Q psy10879 98 HSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHY-EKKLYWGFFDGRARVRRA---GKWKEKNWFL--------C 165 (245)
Q Consensus 98 ~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~-~~~~y~G~~~~~~~~~r~---~k~~~~~~~~--------~ 165 (245)
.+++.....++.+|++..|.|+.+..+.|...+.....+ +.....|..... .++ ++....++.. .
T Consensus 145 ~AlN~gl~~a~~d~iv~lDAD~~~~~d~L~~lv~~~~~~~~~g~v~g~~~~~---~~~~~~~~~~~~e~~~~~~~~~~~~ 221 (444)
T PRK14583 145 IALRMGAAAARSEYLVCIDGDALLDKNAVPYLVAPLIANPRTGAVTGNPRIR---TRSTLIGRVQVGEFSSIIGLIKRTQ 221 (444)
T ss_pred HHHHHHHHhCCCCEEEEECCCCCcCHHHHHHHHHHHHhCCCeEEEEccceec---CCCcchhhHHHHHHHHHHHHHHHHH
Confidence 455555566689999999999999999888888765322 022222321110 011 1100000000 0
Q ss_pred CCC-CC-ccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecC
Q psy10879 166 DRY-LP-YALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDV 219 (245)
Q Consensus 166 ~~y-p~-y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~ 219 (245)
..+ +. .++|++..+.+++++++--.. +....||..+|.-+...|.+....+
T Consensus 222 ~~~g~~~~~sG~~~~~rr~al~~vGg~~---~~~i~ED~dl~~rl~~~G~~i~~~p 274 (444)
T PRK14583 222 RVYGQVFTVSGVVAAFRRRALADVGYWS---PDMITEDIDISWKLQLKHWSVFFEP 274 (444)
T ss_pred HHhCCceEecCceeEEEHHHHHHcCCCC---CCcccccHHHHHHHHHcCCeEEEee
Confidence 111 12 356888999999998874322 2234699999999998887765544
No 35
>cd06434 GT2_HAS Hyaluronan synthases catalyze polymerization of hyaluronan. Hyaluronan synthases (HASs) are bi-functional glycosyltransferases that catalyze polymerization of hyaluronan. HASs transfer both GlcUA and GlcNAc in beta-(1,3) and beta-(1,4) linkages, respectively to the hyaluronan chain using UDP-GlcNAc and UDP-GlcUA as substrates. HA is made as a free glycan, not attached to a protein or lipid. HASs do not need a primer for HA synthesis; they initiate HA biosynthesis de novo with only UDP-GlcNAc, UDP-GlcUA, and Mg2+. Hyaluronan (HA) is a linear heteropolysaccharide composed of (1-3)-linked beta-D-GlcUA-beta-D-GlcNAc disaccharide repeats. It can be found in vertebrates and a few microbes and is typically on the cell surface or in the extracellular space, but is also found inside mammalian cells. Hyaluronan has several physiochemical and biological functions such as space filling, lubrication, and providing a hydrated matrix through which cells can migrate.
Probab=94.38 E-value=1.4 Score=36.07 Aligned_cols=157 Identities=10% Similarity=-0.026 Sum_probs=82.5
Q ss_pred EEEEEEeecCchHHHHHHHHHhhhCCCEEEcCCccccCCchhHHHHHHHHHHHhccceeEEEEecCceeeehHHHHHHHh
Q psy10879 52 VKYLFSIGNKENVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQKVLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELE 131 (245)
Q Consensus 52 ~~~~Fv~g~~~~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~K~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~ 131 (245)
.+++++...+.+...+.+ ++...+..+.++. .+. . .|.- ++....+..+.+|++.+|+|+.+..+.|...+.
T Consensus 29 ~eiivvdd~s~d~~~~~l-~~~~~~~~~~v~~--~~~-~---g~~~-a~n~g~~~a~~d~v~~lD~D~~~~~~~l~~l~~ 100 (235)
T cd06434 29 LEIIVVTDGDDEPYLSIL-SQTVKYGGIFVIT--VPH-P---GKRR-ALAEGIRHVTTDIVVLLDSDTVWPPNALPEMLK 100 (235)
T ss_pred CEEEEEeCCCChHHHHHH-HhhccCCcEEEEe--cCC-C---ChHH-HHHHHHHHhCCCEEEEECCCceeChhHHHHHHH
Confidence 466666555543333333 3455666666432 111 2 2322 233333444899999999999999999988888
Q ss_pred hcCcCCcccEEEEecCCCeeecCCCCcc------c---C-ccc-CCCC-C-CccCCCeeeecHHHHHHHHHhcc------
Q psy10879 132 NKFHYEKKLYWGFFDGRARVRRAGKWKE------K---N-WFL-CDRY-L-PYALGGGYVLSHLLVKFISENSH------ 192 (245)
Q Consensus 132 ~~~~~~~~~y~G~~~~~~~~~r~~k~~~------~---~-~~~-~~~y-p-~y~~G~gyvlS~~~~~~l~~~~~------ 192 (245)
....+.-....|....... . .+.|.. . . .+. ...+ . +.+.|++.++.+++++.+.-...
T Consensus 101 ~~~~~~v~~v~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~rr~~l~~~~~~~~~~~~~~ 178 (235)
T cd06434 101 PFEDPKVGGVGTNQRILRP-R-DSKWSFLAAEYLERRNEEIRAAMSYDGGVPCLSGRTAAYRTEILKDFLFLEEFTNETF 178 (235)
T ss_pred hccCCCEeEEcCceEeecC-c-ccHHHHHHHHHHHHHHHHHHHHHhhCCCEEEccCcHHHHHHHHHhhhhhHHHhhhhhh
Confidence 8753212222232211100 0 011100 0 0 000 0011 1 23567888888888876543221
Q ss_pred -cCCCCCCChhHHHHHHHhCCCeEeec
Q psy10879 193 -MLSSYISEDVSVGVWLSALDITRYHD 218 (245)
Q Consensus 193 -~~~~~~~eDv~iG~~l~~~~v~~~~~ 218 (245)
..+....||..++.-+.+.|....-.
T Consensus 179 ~~~~~~~~eD~~l~~~~~~~g~~~~~~ 205 (235)
T cd06434 179 MGRRLNAGDDRFLTRYVLSHGYKTVYQ 205 (235)
T ss_pred cCCCCCcCchHHHHHHHHHCCCeEEEe
Confidence 12345679999988888888765543
No 36
>cd06433 GT_2_WfgS_like WfgS and WfeV are involved in O-antigen biosynthesis. Escherichia coli WfgS and Shigella dysenteriae WfeV are glycosyltransferase 2 family enzymes involved in O-antigen biosynthesis. GT-2 enzymes have GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=93.89 E-value=2.5 Score=33.15 Aligned_cols=120 Identities=8% Similarity=-0.129 Sum_probs=69.9
Q ss_pred HHHHHHHHhccceeEEEEecCceeeehHHHHHHHhhc--CcCCcccEEEEecCCC-eeecCCCCcccCccc-CCCCCCcc
Q psy10879 97 LHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENK--FHYEKKLYWGFFDGRA-RVRRAGKWKEKNWFL-CDRYLPYA 172 (245)
Q Consensus 97 ~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~--~~~~~~~y~G~~~~~~-~~~r~~k~~~~~~~~-~~~yp~y~ 172 (245)
-.++..+.+.++.+|++..|+|..+..+.+...+... .+. ..+..|...... .......+....... .-....+.
T Consensus 64 ~~a~n~~~~~a~~~~v~~ld~D~~~~~~~~~~~~~~~~~~~~-~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 142 (202)
T cd06433 64 YDAMNKGIALATGDIIGFLNSDDTLLPGALLAVVAAFAEHPE-VDVVYGDVLLVDENGRVIGRRRPPPFLDKFLLYGMPI 142 (202)
T ss_pred HHHHHHHHHHcCCCEEEEeCCCcccCchHHHHHHHHHHhCCC-ccEEEeeeEEEcCCCCcccCCCCcchhhhHHhhcCcc
Confidence 3445566667788999999999999999999887443 222 456666543210 000011110000011 11122356
Q ss_pred CCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecC
Q psy10879 173 LGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDV 219 (245)
Q Consensus 173 ~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~ 219 (245)
.|++.++++++++.+-..... ....+|..+..-+...|......+
T Consensus 143 ~~~~~~~~~~~~~~~~~f~~~--~~~~~D~~~~~r~~~~g~~~~~~~ 187 (202)
T cd06433 143 CHQATFFRRSLFEKYGGFDES--YRIAADYDLLLRLLLAGKIFKYLP 187 (202)
T ss_pred cCcceEEEHHHHHHhCCCchh--hCchhhHHHHHHHHHcCCceEecc
Confidence 778889999999887532211 224578888777777776665444
No 37
>COG1215 Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane]
Probab=93.83 E-value=3 Score=37.75 Aligned_cols=191 Identities=9% Similarity=0.006 Sum_probs=116.2
Q ss_pred eeEEEEEeCCCCCH-HHHHHHHHHHcCCcCCCcEEEEEEeecCchHHHHHHHHHhhhCC-CEEEcCCccccCCchhHHHH
Q psy10879 20 YFIVILILTSSKNY-IRRKNIRDTWIKNIEKYQVKYLFSIGNKENVVDVKLKEEIRRYD-DILLLNQVPDEYSSLSQKVL 97 (245)
Q Consensus 20 ~~lli~V~S~~~~~-~~R~~IR~TW~~~~~~~~~~~~Fv~g~~~~~~~~~l~~E~~~~~-Dii~~~d~~d~y~~l~~K~~ 97 (245)
+.+-++|++--+.. -..+.++..=...- .+.+++.+.+.+.++..+.+.+-.++++ ++... .. ......-.
T Consensus 54 p~vsviiP~ynE~~~~~~~~l~s~~~~dy--p~~evivv~d~~~d~~~~~~~~~~~~~~~~~~~~--~~---~~~~~gK~ 126 (439)
T COG1215 54 PKVSVIIPAYNEEPEVLEETLESLLSQDY--PRYEVIVVDDGSTDETYEILEELGAEYGPNFRVI--YP---EKKNGGKA 126 (439)
T ss_pred CceEEEEecCCCchhhHHHHHHHHHhCCC--CCceEEEECCCCChhHHHHHHHHHhhcCcceEEE--ec---cccCccch
Confidence 77778888876666 33334444444432 2256666666555555555666666775 44422 11 11112335
Q ss_pred HHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCccc-EEEEecCCCeee-cC--CCC----cccC-----ccc
Q psy10879 98 HSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKL-YWGFFDGRARVR-RA--GKW----KEKN-----WFL 164 (245)
Q Consensus 98 ~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~-y~G~~~~~~~~~-r~--~k~----~~~~-----~~~ 164 (245)
.++.+.....+.++++..|-|+.+..+.|.+.+.....+ ... +.|.......+. .+ ++. +... ...
T Consensus 127 ~al~~~l~~~~~d~V~~~DaD~~~~~d~l~~~~~~f~~~-~~~~v~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 205 (439)
T COG1215 127 GALNNGLKRAKGDVVVILDADTVPEPDALRELVSPFEDP-PVGAVVGTPRIRNRPDPSNLLGRIQAIEYLSAFYFRLRAA 205 (439)
T ss_pred HHHHHHHhhcCCCEEEEEcCCCCCChhHHHHHHhhhcCC-CeeEEeCCceeeecCChhhhcchhcchhhhhhHHHhhhhh
Confidence 666777767789999999999999999999999887654 233 444332111110 00 000 0000 001
Q ss_pred -CCCCCCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecCCc
Q psy10879 165 -CDRYLPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDVRF 221 (245)
Q Consensus 165 -~~~yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~~f 221 (245)
......+|.|++.++.+++++.+- ...+..--||..++..+...|.+....+.-
T Consensus 206 ~~~g~~~~~~G~~~~~rr~aL~~~g---~~~~~~i~ED~~lt~~l~~~G~~~~~~~~~ 260 (439)
T COG1215 206 SKGGLISFLSGSSSAFRRSALEEVG---GWLEDTITEDADLTLRLHLRGYRVVYVPEA 260 (439)
T ss_pred hhcCCeEEEcceeeeEEHHHHHHhC---CCCCCceeccHHHHHHHHHCCCeEEEeecc
Confidence 122466799999999999999887 223444579999999999888877765544
No 38
>cd04191 Glucan_BSP_ModH Glucan_BSP_ModH catalyzes the elongation of beta-1,2 polyglucose chains of glucan. Periplasmic Glucan Biosynthesis protein ModH is a glucosyltransferase that catalyzes the elongation of beta-1,2 polyglucose chains of glucan, requiring a beta-glucoside as a primer and UDP-glucose as a substrate. Glucans are composed of 5 to 10 units of glucose forming a highly branched structure, where beta-1,2-linked glucose constitutes a linear backbone to which branches are attached by beta-1,6 linkages. In Escherichia coli, glucans are located in the periplasmic space, functioning as regulator of osmolarity. It is synthesized at a maximum when cells are grown in a medium with low osmolarity. It has been shown to span the cytoplasmic membrane.
Probab=93.11 E-value=2.4 Score=36.06 Aligned_cols=189 Identities=9% Similarity=0.058 Sum_probs=101.7
Q ss_pred EEEeCCCCCHH-HHHHHHHHHcC---CcCCCcEEEEEEeecCc-hHHHHH-------HHHHhhhCCCEEEcCCccccCCc
Q psy10879 24 ILILTSSKNYI-RRKNIRDTWIK---NIEKYQVKYLFSIGNKE-NVVDVK-------LKEEIRRYDDILLLNQVPDEYSS 91 (245)
Q Consensus 24 i~V~S~~~~~~-~R~~IR~TW~~---~~~~~~~~~~Fv~g~~~-~~~~~~-------l~~E~~~~~Dii~~~d~~d~y~~ 91 (245)
|+|++.-...+ -.+.++..+.. +....++++ |++..+. ++.... +.+|....-.+.... ...-.+
T Consensus 3 IliP~~ne~~~~l~~~l~~~~~~~~~~~~~~~~eI-~vldD~~d~~~~~~~~~~~~~l~~~~~~~~~v~~~~--r~~~~g 79 (254)
T cd04191 3 IVMPVYNEDPARVFAGLRAMYESLAKTGLADHFDF-FILSDTRDPDIWLAEEAAWLDLCEELGAQGRIYYRR--RRENTG 79 (254)
T ss_pred EEEeCCCCCHHHHHHHHHHHHHHHHhcCCcCceEE-EEECCCCChHHHHHHHHHHHHHHHHhCCCCcEEEEE--cCCCCC
Confidence 66777777766 56677776642 111123555 8887766 432211 222222223344221 222222
Q ss_pred hhHHHHHHHHHHHhc-cceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeee-cCCCCcc----c-Ccc-
Q psy10879 92 LSQKVLHSFKYIYEH-FDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVR-RAGKWKE----K-NWF- 163 (245)
Q Consensus 92 l~~K~~~~l~w~~~~-~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~-r~~k~~~----~-~~~- 163 (245)
+|.-..-..+... .+++|++..|.|+.+..+-|.+.+.....+ .. +|-+....... +++.+.. . ..+
T Consensus 80 --~Kag~l~~~~~~~~~~~~~i~~~DaD~~~~p~~l~~~v~~~~~~-~~--vg~vq~~~~~~n~~~~~~~~~~~~~~~~~ 154 (254)
T cd04191 80 --RKAGNIADFCRRWGSRYDYMVVLDADSLMSGDTIVRLVRRMEAN-PR--AGIIQTAPKLIGAETLFARLQQFANRLYG 154 (254)
T ss_pred --ccHHHHHHHHHHhCCCCCEEEEEeCCCCCCHHHHHHHHHHHHhC-CC--EEEEeCCceeECCCCHHHHHHHHHHHHHH
Confidence 4555444444333 378999999999999999999999876422 11 22222111110 1111100 0 000
Q ss_pred c----C-C---CCCCccCCCeeeecHHHHHHHHHhc-----ccC-CCCCCChhHHHHHHHhCCCeEeecCC
Q psy10879 164 L----C-D---RYLPYALGGGYVLSHLLVKFISENS-----HML-SSYISEDVSVGVWLSALDITRYHDVR 220 (245)
Q Consensus 164 ~----~-~---~yp~y~~G~gyvlS~~~~~~l~~~~-----~~~-~~~~~eDv~iG~~l~~~~v~~~~~~~ 220 (245)
+ . . .--.+|.|...++.+++++.+.... ... ...-.||..+|..+...|-+.+-.+.
T Consensus 155 ~~~~~~~~~~~~~~~~~~G~~~~~Rr~al~~~~~~~~i~g~g~~~~~~l~eD~~l~~~~~~~G~ri~~~~~ 225 (254)
T cd04191 155 PVFGRGLAAWQGGEGNYWGHNAIIRVAAFMEHCALPVLPGRPPFGGHILSHDFVEAALMRRAGWEVRLAPD 225 (254)
T ss_pred HHHHHHHHHhcCCccCccceEEEEEHHHHHHhcCCccccCCCCCCCCeecHHHHHHHHHHHcCCEEEEccC
Confidence 0 0 0 0012566999999999988753221 111 22457999999999999988776654
No 39
>PF13632 Glyco_trans_2_3: Glycosyl transferase family group 2
Probab=93.08 E-value=0.4 Score=38.30 Aligned_cols=107 Identities=14% Similarity=-0.003 Sum_probs=64.8
Q ss_pred EEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecCCCCcccC-------cc---cCCCCCCccCCCeeeec
Q psy10879 111 YLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRAGKWKEKN-------WF---LCDRYLPYALGGGYVLS 180 (245)
Q Consensus 111 ~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~~~~-------~~---~~~~yp~y~~G~gyvlS 180 (245)
|++.+|+|+.+..+-|.+.+.....++-...-|.+........-.++...+ .. ..-..+.++.|++.+++
T Consensus 1 ~v~~~DaDt~~~~d~l~~~~~~~~~~~~~~vq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~r 80 (193)
T PF13632_consen 1 YVLFLDADTRLPPDFLERLVAALEDPKVDAVQGPIIFRNRGSLLTRLQDFEYAISHGLSRLSQSSLGRPLFLSGSGMLFR 80 (193)
T ss_pred CEEEEcCCCCCChHHHHHHHHHHhCCCceEEEccEEecCCCChhheeehhhhhhhhhhhHHHHHhcCCCccccCcceeee
Confidence 689999999999998888887766211333334332110000000111100 00 01233556889999999
Q ss_pred HHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecC
Q psy10879 181 HLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDV 219 (245)
Q Consensus 181 ~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~ 219 (245)
+++++++-... -.....||..++.-+.+.|.+....+
T Consensus 81 ~~~l~~vg~~~--~~~~~~ED~~l~~~l~~~G~~~~~~~ 117 (193)
T PF13632_consen 81 REALREVGGFD--DPFSIGEDMDLGFRLRRAGYRIVYVP 117 (193)
T ss_pred HHHHHHhCccc--ccccccchHHHHHHHHHCCCEEEEec
Confidence 99999875333 23455699999988888887665443
No 40
>PF10111 Glyco_tranf_2_2: Glycosyltransferase like family 2; InterPro: IPR019290 This conserved domain is found in a set of prokaryotic proteins including putative glucosyltransferases, which are involved in bacterial capsule biosynthesis [, ].
Probab=93.01 E-value=5.3 Score=34.26 Aligned_cols=163 Identities=11% Similarity=-0.029 Sum_probs=94.9
Q ss_pred CcEEEEEEeecCchHHHHHHHHHhhhCCCEEEcCCc-cccCCchhHHHHHHHHHHHhccceeEEEEecCceeeehHHHHH
Q psy10879 50 YQVKYLFSIGNKENVVDVKLKEEIRRYDDILLLNQV-PDEYSSLSQKVLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIH 128 (245)
Q Consensus 50 ~~~~~~Fv~g~~~~~~~~~l~~E~~~~~Dii~~~d~-~d~y~~l~~K~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~ 128 (245)
..++++++=+.++.+..+.|.+..+.++-+..+... ....-+. -.+..-..+.++.+|++..|.|+.+..+.+..
T Consensus 33 ~~~eiIvvd~~s~~~~~~~l~~~~~~~~~~~~i~~~~~~~~f~~----a~arN~g~~~A~~d~l~flD~D~i~~~~~i~~ 108 (281)
T PF10111_consen 33 PDFEIIVVDDGSSDEFDEELKKLCEKNGFIRYIRHEDNGEPFSR----AKARNIGAKYARGDYLIFLDADCIPSPDFIEK 108 (281)
T ss_pred CCEEEEEEECCCchhHHHHHHHHHhccCceEEEEcCCCCCCcCH----HHHHHHHHHHcCCCEEEEEcCCeeeCHHHHHH
Confidence 456777766655555556677777777766212211 1111121 23344445566899999999999999999999
Q ss_pred HHh---hcCcCCcccEEEEecCCCeeec----C-C--CCcc--cC-ccc--CCCCC-CccCCCeeeecHHHHHHHHHhcc
Q psy10879 129 ELE---NKFHYEKKLYWGFFDGRARVRR----A-G--KWKE--KN-WFL--CDRYL-PYALGGGYVLSHLLVKFISENSH 192 (245)
Q Consensus 129 ~L~---~~~~~~~~~y~G~~~~~~~~~r----~-~--k~~~--~~-~~~--~~~yp-~y~~G~gyvlS~~~~~~l~~~~~ 192 (245)
.+. ........+.++-+.....+.. . . .|.. .+ ... .+.+. ....|++.+++++.-.++--...
T Consensus 109 ~~~~~~~l~~~~~~~~~~p~~yl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~i~r~~f~~iGGfDE 188 (281)
T PF10111_consen 109 LLNHVKKLDKNPNAFLVYPCLYLSEEGSEKFYSQFKNLWDHEFLESFISGKNSLWEFIAFASSCFLINREDFLEIGGFDE 188 (281)
T ss_pred HHHHHHHHhcCCCceEEEeeeeccchhhHHHhhcchhcchHHHHHHHhhccccccccccccceEEEEEHHHHHHhCCCCc
Confidence 888 4432213444443322111110 0 0 0110 00 000 11221 12346999999999998877776
Q ss_pred cCCCCCCChhHHHHHHHhCCCeEe
Q psy10879 193 MLSSYISEDVSVGVWLSALDITRY 216 (245)
Q Consensus 193 ~~~~~~~eDv~iG~~l~~~~v~~~ 216 (245)
.......||.-++.=|.+.|....
T Consensus 189 ~f~G~G~ED~D~~~RL~~~~~~~~ 212 (281)
T PF10111_consen 189 RFRGWGYEDIDFGYRLKKAGYKFK 212 (281)
T ss_pred cccCCCcchHHHHHHHHHcCCcEe
Confidence 677778999999877887776553
No 41
>cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase. Shigella flexneri RfbF protein is a putative dTDP-rhamnosyl transferase. dTDP rhamnosyl transferases of Shigella flexneri add rhamnose sugars to N-acetyl-glucosamine in the O-antigen tetrasaccharide repeat. Lipopolysaccharide O antigens are important virulence determinants for many bacteria. The variations of sugar composition, the sequence of the sugars and the linkages in the O antigen provide structural diversity of the O antigen.
Probab=92.92 E-value=4.4 Score=33.07 Aligned_cols=111 Identities=11% Similarity=-0.062 Sum_probs=60.4
Q ss_pred ceeEEEEecCceeeehHHHHHHH---hhcCcCCcccEEEEe-cCCCeeecCCCCcccC------cccCCCC--CCccCCC
Q psy10879 108 DFQYLLKCDDDTFVRVPNIIHEL---ENKFHYEKKLYWGFF-DGRARVRRAGKWKEKN------WFLCDRY--LPYALGG 175 (245)
Q Consensus 108 ~~~~~~k~DDD~~v~~~~L~~~L---~~~~~~~~~~y~G~~-~~~~~~~r~~k~~~~~------~~~~~~y--p~y~~G~ 175 (245)
+++|++..|+|+.+..+.|..++ ........-..+|.. ...........+.... ....... ..++.|+
T Consensus 75 ~~d~v~~lD~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (237)
T cd02526 75 GADYVLLFDQDSVPPPDMVEKLLAYKILSDKNSNIGAVGPRIIDRRTGENSPGVRKSGYKLRIQKEGEEGLKEVDFLITS 154 (237)
T ss_pred CCCEEEEECCCCCcCHhHHHHHHHHHHhhccCCCeEEEeeeEEcCCCCeeccceeccCccceecccccCCceEeeeeecc
Confidence 57999999999999988888875 222221022233322 1111110000000000 0000111 1245678
Q ss_pred eeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecC
Q psy10879 176 GYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDV 219 (245)
Q Consensus 176 gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~ 219 (245)
|.++++++++.+-...... ....||..++.-+...|.+....+
T Consensus 155 ~~~~rr~~~~~~ggfd~~~-~~~~eD~d~~~r~~~~G~~~~~~~ 197 (237)
T cd02526 155 GSLISLEALEKVGGFDEDL-FIDYVDTEWCLRARSKGYKIYVVP 197 (237)
T ss_pred ceEEcHHHHHHhCCCCHHH-cCccchHHHHHHHHHcCCcEEEEc
Confidence 8899999998874332211 234689999888888887665544
No 42
>COG1216 Predicted glycosyltransferases [General function prediction only]
Probab=92.69 E-value=6.2 Score=34.22 Aligned_cols=190 Identities=14% Similarity=0.079 Sum_probs=101.0
Q ss_pred eEEEEEeCCCCCHHHHHHHHHHHcCCcCCCcEEEEEEeecCc-hHHHHHHHHHhhhCCCEEEcCCccccCCchhHHHHHH
Q psy10879 21 FIVILILTSSKNYIRRKNIRDTWIKNIEKYQVKYLFSIGNKE-NVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQKVLHS 99 (245)
Q Consensus 21 ~lli~V~S~~~~~~~R~~IR~TW~~~~~~~~~~~~Fv~g~~~-~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~K~~~~ 99 (245)
.+-++|.|--.-....+.++..=..... ...+.++..++ +...+.+.... +.++..+. ..++.- ...=.-.+
T Consensus 4 ~i~~iiv~yn~~~~l~~~l~~l~~~~~~---~~~iv~vDn~s~d~~~~~~~~~~--~~~v~~i~-~~~NlG-~agg~n~g 76 (305)
T COG1216 4 KISIIIVTYNRGEDLVECLASLAAQTYP---DDVIVVVDNGSTDGSLEALKARF--FPNVRLIE-NGENLG-FAGGFNRG 76 (305)
T ss_pred ceEEEEEecCCHHHHHHHHHHHhcCCCC---CcEEEEccCCCCCCCHHHHHhhc--CCcEEEEE-cCCCcc-chhhhhHH
Confidence 4455555554344444455554444431 12233555444 33233333222 67777553 222211 11112245
Q ss_pred HHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCC----eeec-C-------CCCcccC------
Q psy10879 100 FKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRA----RVRR-A-------GKWKEKN------ 161 (245)
Q Consensus 100 l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~----~~~r-~-------~k~~~~~------ 161 (245)
++.+...... |++..++|+.+..+-|.++++...........|...... .+.+ . ..|....
T Consensus 77 ~~~a~~~~~~-~~l~LN~D~~~~~~~l~~ll~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (305)
T COG1216 77 IKYALAKGDD-YVLLLNPDTVVEPDLLEELLKAAEEDPAAGVVGPLIRNYDESLYIDRRGGESDGLTGGWRASPLLEIAP 155 (305)
T ss_pred HHHHhcCCCc-EEEEEcCCeeeChhHHHHHHHHHHhCCCCeEeeeeEecCCCCcchheeccccccccccceecccccccc
Confidence 5555444322 999999999999999999998876542344455433210 0110 0 1111111
Q ss_pred -cccCCCCCCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecC
Q psy10879 162 -WFLCDRYLPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDV 219 (245)
Q Consensus 162 -~~~~~~yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~ 219 (245)
.........++.|++.++++++++++-- ...-=....||+-++.=++++|.+..-.+
T Consensus 156 ~~~~~~~~~~~~~G~~~li~~~~~~~vG~-~de~~F~y~eD~D~~~R~~~~G~~i~~~p 213 (305)
T COG1216 156 DLSSYLEVVASLSGACLLIRREAFEKVGG-FDERFFIYYEDVDLCLRARKAGYKIYYVP 213 (305)
T ss_pred cccchhhhhhhcceeeeEEcHHHHHHhCC-CCcccceeehHHHHHHHHHHcCCeEEEee
Confidence 0001112225789999999999999876 22222347999999999999997655433
No 43
>cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis. AmsE is a glycosyltransferase involved in exopolysaccharide amylovora biosynthesis in Erwinia amylovora. Amylovara is one of the three exopolysaccharide produced by E. amylovora. Amylovara-deficient mutants are non-pathogenic. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds.
Probab=92.56 E-value=3.1 Score=33.05 Aligned_cols=116 Identities=10% Similarity=-0.010 Sum_probs=64.2
Q ss_pred HHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcC-CcccEEEEecC---CCeeecCCCCccc---Cccc-CCCCCC
Q psy10879 99 SFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHY-EKKLYWGFFDG---RARVRRAGKWKEK---NWFL-CDRYLP 170 (245)
Q Consensus 99 ~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~-~~~~y~G~~~~---~~~~~r~~k~~~~---~~~~-~~~yp~ 170 (245)
++......++.+|++..|+|..+..+.|...+...... ...++.|.... ...... .+.... ..+. ....-+
T Consensus 71 a~N~g~~~a~gd~i~~lD~Dd~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 149 (201)
T cd04195 71 ALNEGLKHCTYDWVARMDTDDISLPDRFEKQLDFIEKNPEIDIVGGGVLEFDSDGNDIG-KRRLPTSHDDILKFARRRSP 149 (201)
T ss_pred HHHHHHHhcCCCEEEEeCCccccCcHHHHHHHHHHHhCCCeEEEcccEEEECCCCCeec-cccCCCCHHHHHHHhccCCC
Confidence 34455555689999999999999998888888775322 13455554321 111110 010000 0000 000011
Q ss_pred ccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecC
Q psy10879 171 YALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDV 219 (245)
Q Consensus 171 y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~ 219 (245)
..|++.++.+.+++++-... +....||..+..-+...|......+
T Consensus 150 -~~~~~~~~rr~~~~~~g~~~---~~~~~eD~~~~~r~~~~g~~~~~~~ 194 (201)
T cd04195 150 -FNHPTVMFRKSKVLAVGGYQ---DLPLVEDYALWARMLANGARFANLP 194 (201)
T ss_pred -CCChHHhhhHHHHHHcCCcC---CCCCchHHHHHHHHHHcCCceeccc
Confidence 23566777777776653322 2267899999888877776655443
No 44
>cd06423 CESA_like CESA_like is the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis protein catalyzes the
Probab=92.35 E-value=1.4 Score=33.27 Aligned_cols=93 Identities=9% Similarity=-0.088 Sum_probs=51.0
Q ss_pred HHHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcC-CcccEEEEecCCCe---eecC---CCCccc-C----ccc
Q psy10879 97 LHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHY-EKKLYWGFFDGRAR---VRRA---GKWKEK-N----WFL 164 (245)
Q Consensus 97 ~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~-~~~~y~G~~~~~~~---~~r~---~k~~~~-~----~~~ 164 (245)
-..+.+..+.++.+|++.+|+|..+..+.|..++...... .-.+..|....... .... ..+... . ...
T Consensus 67 ~~~~n~~~~~~~~~~i~~~D~D~~~~~~~l~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 146 (180)
T cd06423 67 AGALNAGLRHAKGDIVVVLDADTILEPDALKRLVVPFFADPKVGAVQGRVRVRNGSENLLTRLQAIEYLSIFRLGRRAQS 146 (180)
T ss_pred hHHHHHHHHhcCCCEEEEECCCCCcChHHHHHHHHHhccCCCeeeEeeeEEEecCcCcceeccchheecceeeeeeehhh
Confidence 3455555666689999999999999877777774554332 12344444321111 0100 000000 0 000
Q ss_pred CCCCCCccCCCeeeecHHHHHHHHH
Q psy10879 165 CDRYLPYALGGGYVLSHLLVKFISE 189 (245)
Q Consensus 165 ~~~yp~y~~G~gyvlS~~~~~~l~~ 189 (245)
.......+.|+++++++++++++-.
T Consensus 147 ~~~~~~~~~g~~~~~~~~~~~~~gg 171 (180)
T cd06423 147 ALGGVLVLSGAFGAFRREALREVGG 171 (180)
T ss_pred eecceeecCchHHHHHHHHHHHhCC
Confidence 1223346789999999999887643
No 45
>cd06442 DPM1_like DPM1_like represents putative enzymes similar to eukaryotic DPM1. Proteins similar to eukaryotic DPM1, including enzymes from bacteria and archaea; DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi,
Probab=92.22 E-value=5.2 Score=32.25 Aligned_cols=113 Identities=12% Similarity=-0.056 Sum_probs=61.5
Q ss_pred HHHhccceeEEEEecCceeeehHHHHHHHhh-cCcCCcccEEEEecCCCeeecCCCCcc--------cCccc---CCCCC
Q psy10879 102 YIYEHFDFQYLLKCDDDTFVRVPNIIHELEN-KFHYEKKLYWGFFDGRARVRRAGKWKE--------KNWFL---CDRYL 169 (245)
Q Consensus 102 w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~-~~~~~~~~y~G~~~~~~~~~r~~k~~~--------~~~~~---~~~yp 169 (245)
...+....+|++.+|+|..+..+.|..++.. ..+. ..+..|......... ..|.. ...+. ...-.
T Consensus 72 ~g~~~a~gd~i~~lD~D~~~~~~~l~~l~~~~~~~~-~~~v~g~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 148 (224)
T cd06442 72 EGFKAARGDVIVVMDADLSHPPEYIPELLEAQLEGG-ADLVIGSRYVEGGGV--EGWGLKRKLISRGANLLARLLLGRKV 148 (224)
T ss_pred HHHHHcCCCEEEEEECCCCCCHHHHHHHHHHHhcCC-CCEEEEeeeecCCcc--CCCcHHHHHHHHHHHHHHHHHcCCCC
Confidence 3334455699999999999998888888876 3333 556666543211111 11110 00000 01112
Q ss_pred CccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecC
Q psy10879 170 PYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDV 219 (245)
Q Consensus 170 ~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~ 219 (245)
+.+.|++.++++++++++.... ......+|.-+..-+.+.|......+
T Consensus 149 ~~~~~~~~~~~r~~~~~ig~~~--~~~~~~~~~~l~~~~~~~g~~i~~~p 196 (224)
T cd06442 149 SDPTSGFRAYRREVLEKLIDSL--VSKGYKFQLELLVRARRLGYRIVEVP 196 (224)
T ss_pred CCCCCccchhhHHHHHHHhhhc--cCCCcEEeHHHHHHHHHcCCeEEEeC
Confidence 3467888899999999987221 11112223334444556666655544
No 46
>cd06435 CESA_NdvC_like NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. Bradyrhizobium japonicum synthesizes periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans during growth under hypoosmotic conditions. Two genes (ndvB, ndvC) are involved in the beta-(1, 3), beta-(1,6)-glucan synthesis. The ndvC mutant strain resulted in synthesis of altered cyclic beta-glucans composed almost entirely of beta-(1, 3)-glycosyl linkages. The periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans function for osmoregulation. The ndvC mutation also affects the ability of the bacteria to establish a successful symbiotic interaction with host plant. Thus, the beta-glucans may function as suppressors of a host defense response.
Probab=92.21 E-value=2.7 Score=34.38 Aligned_cols=120 Identities=13% Similarity=0.015 Sum_probs=69.6
Q ss_pred HHHHHHHhcc--ceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecC-CC--eee-cCCCCcccCccc----C-C
Q psy10879 98 HSFKYIYEHF--DFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDG-RA--RVR-RAGKWKEKNWFL----C-D 166 (245)
Q Consensus 98 ~~l~w~~~~~--~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~-~~--~~~-r~~k~~~~~~~~----~-~ 166 (245)
.++.+..+.+ +++|++..|+|+.+..+.|.+++.......-.+..|.... .. .+. +...|.....+. . .
T Consensus 72 ~a~n~g~~~a~~~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 151 (236)
T cd06435 72 GALNYALERTAPDAEIIAVIDADYQVEPDWLKRLVPIFDDPRVGFVQAPQDYRDGEESLFKRMCYAEYKGFFDIGMVSRN 151 (236)
T ss_pred HHHHHHHHhcCCCCCEEEEEcCCCCcCHHHHHHHHHHhcCCCeeEEecCccccCCCccHHHHHHhHHHHHHHHHHhcccc
Confidence 3566666666 4799999999999999999988877643212222221110 00 000 000010000000 0 0
Q ss_pred CC-CCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecCC
Q psy10879 167 RY-LPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDVR 220 (245)
Q Consensus 167 ~y-p~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~~ 220 (245)
.. -.++.|++.++++++++.+-.... .+..||..++.=+.+.|.+....+.
T Consensus 152 ~~~~~~~~g~~~~~rr~~~~~iGgf~~---~~~~eD~dl~~r~~~~G~~~~~~~~ 203 (236)
T cd06435 152 ERNAIIQHGTMCLIRRSALDDVGGWDE---WCITEDSELGLRMHEAGYIGVYVAQ 203 (236)
T ss_pred ccCceEEecceEEEEHHHHHHhCCCCC---ccccchHHHHHHHHHCCcEEEEcch
Confidence 00 124678889999999998743222 2357999999888888887765543
No 47
>PRK14716 bacteriophage N4 adsorption protein B; Provisional
Probab=91.97 E-value=11 Score=35.47 Aligned_cols=193 Identities=13% Similarity=0.114 Sum_probs=96.2
Q ss_pred CCeeEEEEEeCCCCCHHHHHHHHHHHcCCcCCCcEEEEEEeecCchHHHHHHHHHhhhCCCEEEcCCccccCCchhHHHH
Q psy10879 18 HSYFIVILILTSSKNYIRRKNIRDTWIKNIEKYQVKYLFSIGNKENVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQKVL 97 (245)
Q Consensus 18 ~~~~lli~V~S~~~~~~~R~~IR~TW~~~~~~~~~~~~Fv~g~~~~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~K~~ 97 (245)
+.+.+-|+|+.--+..--.+.|+..=.+.. ..+.+++++.....++..+.+++=+.+|..+..+- ... .+-+-|.-
T Consensus 64 ~~p~vaIlIPA~NE~~vI~~~l~s~L~~ld-Y~~~eIiVv~d~ndd~T~~~v~~l~~~~p~v~~vv-~~~--~gp~~Ka~ 139 (504)
T PRK14716 64 PEKRIAIFVPAWREADVIGRMLEHNLATLD-YENYRIFVGTYPNDPATLREVDRLAARYPRVHLVI-VPH--DGPTSKAD 139 (504)
T ss_pred CCCceEEEEeccCchhHHHHHHHHHHHcCC-CCCeEEEEEECCCChhHHHHHHHHHHHCCCeEEEE-eCC--CCCCCHHH
Confidence 355666666665443323334443311111 12344444443222333344443356677653221 111 11123433
Q ss_pred HHHHHHH------hcc---ceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecCCCC----cccCccc
Q psy10879 98 HSFKYIY------EHF---DFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRAGKW----KEKNWFL 164 (245)
Q Consensus 98 ~~l~w~~------~~~---~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~----~~~~~~~ 164 (245)
.+.|+. +.+ ++++++..|-|..+.++.|..+- ...+. ..+.-..+..... ..+.| +..++..
T Consensus 140 -aLN~~l~~~~~~e~~~G~~~d~vvi~DAD~~v~Pd~Lr~~~-~~~~~-~~~VQ~pv~~~~~--~~~~~~ag~y~~ef~~ 214 (504)
T PRK14716 140 -CLNWIYQAIFAFERERGIRFAIIVLHDAEDVIHPLELRLYN-YLLPR-HDFVQLPVFSLPR--DWGEWVAGTYMDEFAE 214 (504)
T ss_pred -HHHHHHHHHHHhhhhcCCCcCEEEEEcCCCCcCccHHHHHH-hhcCC-CCEEecceeccCC--chhHHHHHHHHHHHHH
Confidence 333332 222 46999999999999998887643 33332 2221111111100 01111 1111100
Q ss_pred --------CCC--CCCccCCCeeeecHHHHHHHHHhcc---cCCCCCCChhHHHHHHHhCCCeEeecC
Q psy10879 165 --------CDR--YLPYALGGGYVLSHLLVKFISENSH---MLSSYISEDVSVGVWLSALDITRYHDV 219 (245)
Q Consensus 165 --------~~~--yp~y~~G~gyvlS~~~~~~l~~~~~---~~~~~~~eDv~iG~~l~~~~v~~~~~~ 219 (245)
-+. -+-.+.|.|+++++++++++..... ..+..--||..+|.-+...|.+..-.+
T Consensus 215 ~~~~~l~~r~~LG~~~~~~Gtg~afRR~aLe~l~~~~GG~~fd~~sLTED~dLglRL~~~G~rv~y~p 282 (504)
T PRK14716 215 SHLKDLPVREALGGLIPSAGVGTAFSRRALERLAAERGGQPFDSDSLTEDYDIGLRLKRAGFRQIFVR 282 (504)
T ss_pred HHHHHHHHHHhcCCccccCCeeEEeEHHHHHHHHhhcCCCCCCCCCcchHHHHHHHHHHCCCEEEEec
Confidence 011 1224679999999999999865332 223456899999999999998876543
No 48
>cd04187 DPM1_like_bac Bacterial DPM1_like enzymes are related to eukaryotic DPM1. A family of bacterial enzymes related to eukaryotic DPM1; Although the mechanism of eukaryotic enzyme is well studied, the mechanism of the bacterial enzymes is not well understood. The eukaryotic DPM1 is the catalytic subunit of eukaryotic Dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. The enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. This protein family belongs to Glycosyltransferase 2 superfamily.
Probab=89.35 E-value=4.5 Score=31.59 Aligned_cols=132 Identities=11% Similarity=0.032 Sum_probs=71.7
Q ss_pred cEEEEEEeecCchHHHHHHHHHhhhCCCEEEcCCccccCCchhHHHHHHHHHHHhccceeEEEEecCceeeehHHHHHHH
Q psy10879 51 QVKYLFSIGNKENVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQKVLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHEL 130 (245)
Q Consensus 51 ~~~~~Fv~g~~~~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~K~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L 130 (245)
..+++.+-+.+.+...+.+....+++..+..+. .. .|. .+ -.++....+....+|++.+|+|.....+-|..++
T Consensus 29 ~~eiivvdd~s~d~t~~~~~~~~~~~~~i~~i~-~~---~n~-G~-~~a~n~g~~~a~~d~i~~~D~D~~~~~~~l~~l~ 102 (181)
T cd04187 29 DYEIIFVDDGSTDRTLEILRELAARDPRVKVIR-LS---RNF-GQ-QAALLAGLDHARGDAVITMDADLQDPPELIPEML 102 (181)
T ss_pred CeEEEEEeCCCCccHHHHHHHHHhhCCCEEEEE-ec---CCC-Cc-HHHHHHHHHhcCCCEEEEEeCCCCCCHHHHHHHH
Confidence 356566555444333333444444555554332 12 221 11 2344444455667999999999999888788888
Q ss_pred hhcCcCCcccEEEEecCCCeeecCCCCcc------cCcccCCCCCCccCCCeeeecHHHHHHHHHhc
Q psy10879 131 ENKFHYEKKLYWGFFDGRARVRRAGKWKE------KNWFLCDRYLPYALGGGYVLSHLLVKFISENS 191 (245)
Q Consensus 131 ~~~~~~~~~~y~G~~~~~~~~~r~~k~~~------~~~~~~~~yp~y~~G~gyvlS~~~~~~l~~~~ 191 (245)
+..... ..+.+|.......+. ..++.. .... ....-+...|++.++++.+++.+....
T Consensus 103 ~~~~~~-~~~v~g~~~~~~~~~-~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~r~~~~~i~~~d 166 (181)
T cd04187 103 AKWEEG-YDVVYGVRKNRKESW-LKRLTSKLFYRLINKL-SGVDIPDNGGDFRLMDRKVVDALLLLP 166 (181)
T ss_pred HHHhCC-CcEEEEEecCCcchH-HHHHHHHHHHHHHHHH-cCCCCCCCCCCEEEEcHHHHHHHHhcC
Confidence 775554 567777654321100 001100 0000 111123466888999999999887544
No 49
>TIGR01556 rhamnosyltran L-rhamnosyltransferase. Rhamnolipids are glycolipids containing mono- or di- L-rhamnose molecules. Rhamnolipid synthesis occurs by sequential glycosyltransferase reactions involving two distinct rhamnosyltransferase enzymes. In P.aeruginosa, the synthesis of mono-rhamnolipids is catalyzed by rhamnosyltransferase 1, and proceeds by a glycosyltransfer reaction catalyzed by rhamnosyltransferase 2 to yield di-rhamnolipids.
Probab=88.63 E-value=13 Score=31.54 Aligned_cols=117 Identities=12% Similarity=0.052 Sum_probs=61.1
Q ss_pred HHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCC-cccEEEE-ecCCCeee-----cCCCCcccCcccC---CC
Q psy10879 98 HSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYE-KKLYWGF-FDGRARVR-----RAGKWKEKNWFLC---DR 167 (245)
Q Consensus 98 ~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~-~~~y~G~-~~~~~~~~-----r~~k~~~~~~~~~---~~ 167 (245)
.+++++.+ .+++|++..|||+.+..+.|..++....... ...++|- +....... +...+........ ..
T Consensus 64 ~Gi~~a~~-~~~d~i~~lD~D~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 142 (281)
T TIGR01556 64 QGLDASFR-RGVQGVLLLDQDSRPGNAFLAAQWKLLSAENGQACALGPRFFDRGTSRRLPAIHLDGLLLRQISLDGLTTP 142 (281)
T ss_pred HHHHHHHH-CCCCEEEEECCCCCCCHHHHHHHHHHHHhcCCceEEECCeEEcCCCcccCCceeecccceeeecccccCCc
Confidence 35555532 2789999999999999887777776554320 1223332 11111000 0000000000000 00
Q ss_pred C-CCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEe
Q psy10879 168 Y-LPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRY 216 (245)
Q Consensus 168 y-p~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~ 216 (245)
. ..++.++|.++++++++++-.-.... .+..||+-+..=+.+.|.+..
T Consensus 143 ~~~~~~~~sg~li~~~~~~~iG~fde~~-fi~~~D~e~~~R~~~~G~~i~ 191 (281)
T TIGR01556 143 QKTSFLISSGCLITREVYQRLGMMDEEL-FIDHVDTEWSLRAQNYGIPLY 191 (281)
T ss_pred eeccEEEcCcceeeHHHHHHhCCccHhh-cccchHHHHHHHHHHCCCEEE
Confidence 1 12445667789999998874321111 235689988766677776544
No 50
>cd02522 GT_2_like_a GT_2_like_a represents a glycosyltransferase family-2 subfamily with unknown function. Glycosyltransferase family 2 (GT-2) subfamily of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=86.46 E-value=6.1 Score=31.78 Aligned_cols=105 Identities=15% Similarity=-0.066 Sum_probs=59.2
Q ss_pred HHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecC----CCeeec--CCCCcccCcccCCCCCCccCCCe
Q psy10879 103 IYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDG----RARVRR--AGKWKEKNWFLCDRYLPYALGGG 176 (245)
Q Consensus 103 ~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~----~~~~~r--~~k~~~~~~~~~~~yp~y~~G~g 176 (245)
..+.++.+|++.+|+|..+..+.+..++...... ...+|.... .....+ ..++.... .....+ .++.+
T Consensus 67 g~~~a~~~~i~~~D~D~~~~~~~l~~l~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~-~~~~~ 140 (221)
T cd02522 67 GAAAARGDWLLFLHADTRLPPDWDAAIIETLRAD--GAVAGAFRLRFDDPGPRLRLLELGANLRS---RLFGLP-YGDQG 140 (221)
T ss_pred HHHhccCCEEEEEcCCCCCChhHHHHHHHHhhcC--CcEEEEEEeeecCCccchhhhhhccccee---cccCCC-cCCce
Confidence 3344578999999999999887777766655443 333343211 110101 00000000 000111 24568
Q ss_pred eeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEe
Q psy10879 177 YVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRY 216 (245)
Q Consensus 177 yvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~ 216 (245)
.++++++.+.+-..... +..||.-++.-+.+.|....
T Consensus 141 ~~~r~~~~~~~G~fd~~---~~~ED~d~~~r~~~~G~~~~ 177 (221)
T cd02522 141 LFIRRELFEELGGFPEL---PLMEDVELVRRLRRRGRPAL 177 (221)
T ss_pred EEEEHHHHHHhCCCCcc---ccccHHHHHHHHHhCCCEEE
Confidence 89999988877543322 27899988877777775543
No 51
>TIGR03030 CelA cellulose synthase catalytic subunit (UDP-forming). Cellulose synthase catalyzes the beta-1,4 polymerization of glucose residues in the formation of cellulose. In bacteria, the substrate is UDP-glucose. The synthase consists of two subunits (or domains in the frequent cases where it is encoded as a single polypeptide), the catalytic domain modelled here and the regulatory domain (pfam03170). The regulatory domain binds the allosteric activator cyclic di-GMP. The protein is membrane-associated and probably assembles into multimers such that the individual cellulose strands can self-assemble into multi-strand fibrils.
Probab=86.09 E-value=37 Score=33.45 Aligned_cols=116 Identities=14% Similarity=0.091 Sum_probs=67.5
Q ss_pred HHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcc-cEEEEec--CCCee-ecC-CCCc-cc-C---ccc----
Q psy10879 99 SFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKK-LYWGFFD--GRARV-RRA-GKWK-EK-N---WFL---- 164 (245)
Q Consensus 99 ~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~-~y~G~~~--~~~~~-~r~-~k~~-~~-~---~~~---- 164 (245)
.+....++.+.+|++..|.|+.+..+-|.+.+.....+ .. ..++... .+..+ .++ +... .+ + +|.
T Consensus 219 nLN~al~~a~gd~Il~lDAD~v~~pd~L~~~v~~f~~d-p~v~~Vqtp~~f~~p~~~~~nl~~~~~~~~e~~~f~~~i~~ 297 (713)
T TIGR03030 219 NINNALKHTDGELILIFDADHVPTRDFLQRTVGWFVED-PKLFLVQTPHFFVSPDPIERNLGTFRRMPNENELFYGLIQD 297 (713)
T ss_pred HHHHHHHhcCCCEEEEECCCCCcChhHHHHHHHHHHhC-CCEEEEeCCeeccCCCHHhhhhHHHHHhhhHHHHHHHHHHH
Confidence 34555666788999999999999999888888765322 22 2222111 11111 111 0000 00 0 010
Q ss_pred -CCCC-CCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeec
Q psy10879 165 -CDRY-LPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHD 218 (245)
Q Consensus 165 -~~~y-p~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~ 218 (245)
.+.+ ..++.|++.++.|++++.+--.. .....||..+|.-+...|.+....
T Consensus 298 g~~~~~~~~~~Gs~~~iRR~al~~iGGf~---~~~vtED~~l~~rL~~~G~~~~y~ 350 (713)
T TIGR03030 298 GNDFWNAAFFCGSAAVLRREALDEIGGIA---GETVTEDAETALKLHRRGWNSAYL 350 (713)
T ss_pred HHhhhCCeeecCceeEEEHHHHHHcCCCC---CCCcCcHHHHHHHHHHcCCeEEEe
Confidence 0111 23566889999999998763222 122479999999999888875543
No 52
>PRK11234 nfrB bacteriophage N4 adsorption protein B; Provisional
Probab=85.81 E-value=38 Score=33.50 Aligned_cols=190 Identities=9% Similarity=0.070 Sum_probs=100.0
Q ss_pred CCeeEEEEEeCCCCCHHHHHHHHHHHcCCcCCCcEEEEEEeecCchHHHHHHHHHhhhCCCEEEcCCccccCCchhHHHH
Q psy10879 18 HSYFIVILILTSSKNYIRRKNIRDTWIKNIEKYQVKYLFSIGNKENVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQKVL 97 (245)
Q Consensus 18 ~~~~lli~V~S~~~~~~~R~~IR~TW~~~~~~~~~~~~Fv~g~~~~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~K~~ 97 (245)
+.+.+-|+|+.--+..--.+.|...=. ..+..+.+++++.....+...+.+++-.++|+++..+..-... + +.|.
T Consensus 61 ~~~~vsIlVPa~nE~~vi~~~i~~ll~-~ldYP~~eI~vi~~~nD~~T~~~~~~l~~~~p~~~~v~~~~~g-~--~gKa- 135 (727)
T PRK11234 61 DEKPLAIMVPAWNETGVIGNMAELAAT-TLDYENYHIFVGTYPNDPATQADVDAVCARFPNVHKVVCARPG-P--TSKA- 135 (727)
T ss_pred CCCCEEEEEecCcchhhHHHHHHHHHH-hCCCCCeEEEEEecCCChhHHHHHHHHHHHCCCcEEEEeCCCC-C--CCHH-
Confidence 446677777764444333333333211 2222235556655422244445556556778875322111111 1 2343
Q ss_pred HHHHHHHhcc---------ceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecC-CCC----cccCcc
Q psy10879 98 HSFKYIYEHF---------DFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRA-GKW----KEKNWF 163 (245)
Q Consensus 98 ~~l~w~~~~~---------~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~-~k~----~~~~~~ 163 (245)
.++.|+...+ +++.++..|-|+.|.++.|. .+...... ....-+-.. +..++ +.| +..++.
T Consensus 136 ~aLN~~l~~~~~~e~~~~~~~~vvvi~DAD~~v~pd~L~-~~~~l~~~-~~~VQ~p~~---p~~~~~~~~~~~~~~~EFa 210 (727)
T PRK11234 136 DCLNNVLDAITQFERSANFAFAGFILHDAEDVISPMELR-LFNYLVER-KDLIQIPVY---PFEREWTHFTSGTYIDEFA 210 (727)
T ss_pred HHHHHHHHHHHhhhcccCCcccEEEEEcCCCCCChhHHH-HHHhhcCC-CCeEeeccc---CCCccHHHHHHHHHHHHHH
Confidence 3344444432 34567889999999999997 44444432 222222111 11111 111 111111
Q ss_pred ----c----CCC--CCCccCCCeeeecHHHHHHHHHhc---ccCCCCCCChhHHHHHHHhCCCeEee
Q psy10879 164 ----L----CDR--YLPYALGGGYVLSHLLVKFISENS---HMLSSYISEDVSVGVWLSALDITRYH 217 (245)
Q Consensus 164 ----~----~~~--yp~y~~G~gyvlS~~~~~~l~~~~---~~~~~~~~eDv~iG~~l~~~~v~~~~ 217 (245)
. ... -+-.++|.|+.+||.+++.+.... ......--||.-+|.-+...|.+..-
T Consensus 211 ~~~~~~~~~~~~lgg~~~l~G~~~af~Rr~l~al~~~ggg~~~~~~~lTED~dlg~rL~~~G~~v~f 277 (727)
T PRK11234 211 ELHGKDVPVREALAGQVPSAGVGTCFSRRAVTALLEDGDGIAFDVQSLTEDYDIGFRLKEKGMREIF 277 (727)
T ss_pred HHhhhhhHHHHHcCCCcccCCceEEEecccHHHHHHhcCCCCcCCCcchHHHHHHHHHHHCCCEEEE
Confidence 0 011 233578999999999877776655 33445568999999999999987653
No 53
>cd04179 DPM_DPG-synthase_like DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily. DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi, and animals, have no transmembrane region, suggesting the ex
Probab=85.21 E-value=7.6 Score=30.14 Aligned_cols=130 Identities=8% Similarity=-0.060 Sum_probs=67.6
Q ss_pred EEEEEEeecCchHHHHHHHHHhhhCCCEEEcCCccccCCchhHHHHHHHHHHHhccceeEEEEecCceeeehHHHHHHHh
Q psy10879 52 VKYLFSIGNKENVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQKVLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELE 131 (245)
Q Consensus 52 ~~~~Fv~g~~~~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~K~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~ 131 (245)
.+++.+-..+.+.....+.+..+++..+..+. ..... + .-.++....+....+|++..|+|..+..+.|..++.
T Consensus 29 ~eiivvd~~s~d~~~~~~~~~~~~~~~~~~~~-~~~n~-G----~~~a~n~g~~~a~gd~i~~lD~D~~~~~~~l~~l~~ 102 (185)
T cd04179 29 YEIIVVDDGSTDGTAEIARELAARVPRVRVIR-LSRNF-G----KGAAVRAGFKAARGDIVVTMDADLQHPPEDIPKLLE 102 (185)
T ss_pred EEEEEEcCCCCCChHHHHHHHHHhCCCeEEEE-ccCCC-C----ccHHHHHHHHHhcCCEEEEEeCCCCCCHHHHHHHHH
Confidence 44454444443333444444455555543221 12221 1 123334444455569999999999999888888888
Q ss_pred h-cCcCCcccEEEEecCCCeee-cC--CCCc---ccCc-ccCCCCC-CccCCCeeeecHHHHHHHH
Q psy10879 132 N-KFHYEKKLYWGFFDGRARVR-RA--GKWK---EKNW-FLCDRYL-PYALGGGYVLSHLLVKFIS 188 (245)
Q Consensus 132 ~-~~~~~~~~y~G~~~~~~~~~-r~--~k~~---~~~~-~~~~~yp-~y~~G~gyvlS~~~~~~l~ 188 (245)
. .... ..+.+|......... .. .++. .... ......+ ....|+.+++++++++.+-
T Consensus 103 ~~~~~~-~~~v~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~i~ 167 (185)
T cd04179 103 KLLEGG-ADVVIGSRFVRGGGAGMPLLRRLGSRLFNFLIRLLLGVRISDTQSGFRLFRREVLEALL 167 (185)
T ss_pred HHhccC-CcEEEEEeecCCCcccchHHHHHHHHHHHHHHHHHcCCCCcCCCCceeeeHHHHHHHHH
Confidence 6 3333 567777644221100 00 0010 0000 0001122 2356778899999999986
No 54
>cd00761 Glyco_tranf_GTA_type Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold. Glycosyltransferases (GTs) are enzymes that synthesize oligosaccharides, polysaccharides, and glycoconjugates by transferring the sugar moiety from an activated nucleotide-sugar donor to an acceptor molecule, which may be a growing oligosaccharide, a lipid, or a protein. Based on the stereochemistry of the donor and acceptor molecules, GTs are classified as either retaining or inverting enzymes. To date, all GT structures adopt one of two possible folds, termed GT-A fold and GT-B fold. This hierarchy includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. The majority of the proteins in this superfamily are Glycosyltransferase family 2 (GT-2) proteins. But it als
Probab=84.65 E-value=13 Score=26.97 Aligned_cols=85 Identities=15% Similarity=-0.002 Sum_probs=50.3
Q ss_pred HHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecCCCCcccCcccCCCCCCccCCCee
Q psy10879 98 HSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRAGKWKEKNWFLCDRYLPYALGGGY 177 (245)
Q Consensus 98 ~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~~~~~~~~~~yp~y~~G~gy 177 (245)
.++..+.+.++.+|++.+|+|..+..+.+...+...........++. .+++
T Consensus 67 ~~~~~~~~~~~~d~v~~~d~D~~~~~~~~~~~~~~~~~~~~~~~v~~-----------------------------~~~~ 117 (156)
T cd00761 67 AARNAGLKAARGEYILFLDADDLLLPDWLERLVAELLADPEADAVGG-----------------------------PGNL 117 (156)
T ss_pred HHHHHHHHHhcCCEEEEECCCCccCccHHHHHHHHHhcCCCceEEec-----------------------------cchh
Confidence 33344444448999999999999988888876444332201111111 0678
Q ss_pred eecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCC
Q psy10879 178 VLSHLLVKFISENSHMLSSYISEDVSVGVWLSALD 212 (245)
Q Consensus 178 vlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~ 212 (245)
++++++++.+......... ..||..++.-+...|
T Consensus 118 ~~~~~~~~~~~~~~~~~~~-~~ed~~~~~~~~~~g 151 (156)
T cd00761 118 LFRRELLEEIGGFDEALLS-GEEDDDFLLRLLRGG 151 (156)
T ss_pred eeeHHHHHHhCCcchHhcC-CcchHHHHHHHHhhc
Confidence 8888888876544332221 267777765555443
No 55
>PRK05454 glucosyltransferase MdoH; Provisional
Probab=84.16 E-value=24 Score=34.67 Aligned_cols=198 Identities=9% Similarity=0.036 Sum_probs=106.8
Q ss_pred CCCCCeeEEEEEeCCCCCHHH-HHHHHHHHcCCc---CCCcEEEEEEeecCc-hHHH----HHHHHHhhhC---CCEEEc
Q psy10879 15 NLKHSYFIVILILTSSKNYIR-RKNIRDTWIKNI---EKYQVKYLFSIGNKE-NVVD----VKLKEEIRRY---DDILLL 82 (245)
Q Consensus 15 ~c~~~~~lli~V~S~~~~~~~-R~~IR~TW~~~~---~~~~~~~~Fv~g~~~-~~~~----~~l~~E~~~~---~Dii~~ 82 (245)
..+..+++.|+|++.-+..++ ++.|+.+..+-. ...+.++ |++..+. ++.. ..+.+-.+++ +.+...
T Consensus 119 ~~~~~~~VaVliP~yNEd~~~v~~~L~a~~~Sl~~~~~~~~~e~-~vLdD~~d~~~~~~e~~~~~~L~~~~~~~~~i~yr 197 (691)
T PRK05454 119 PPPPEARTAILMPIYNEDPARVFAGLRAMYESLAATGHGAHFDF-FILSDTRDPDIAAAEEAAWLELRAELGGEGRIFYR 197 (691)
T ss_pred CCCCCCceEEEEeCCCCChHHHHHHHHHHHHHHHhcCCCCCEEE-EEEECCCChhHHHHHHHHHHHHHHhcCCCCcEEEE
Confidence 455667888888888766543 346666664321 1223454 7777766 3321 1122223344 345532
Q ss_pred CCccccCCchhHHHHHHHHHHHhcc-ceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeec-CCCCccc
Q psy10879 83 NQVPDEYSSLSQKVLHSFKYIYEHF-DFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRR-AGKWKEK 160 (245)
Q Consensus 83 ~d~~d~y~~l~~K~~~~l~w~~~~~-~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r-~~k~~~~ 160 (245)
. -..|.-+|.-..-.++.... +++|++-.|-|+.+..+.|.+.+..+..+ .. +|-+.....+.. ++-+..-
T Consensus 198 ~----R~~n~~~KaGNl~~~~~~~~~~~eyivvLDADs~m~~d~L~~lv~~m~~d-P~--vGlVQt~~~~~n~~slfaR~ 270 (691)
T PRK05454 198 R----RRRNVGRKAGNIADFCRRWGGAYDYMVVLDADSLMSGDTLVRLVRLMEAN-PR--AGLIQTLPVAVGADTLFARL 270 (691)
T ss_pred E----CCcCCCccHHHHHHHHHhcCCCcCEEEEEcCCCCCCHHHHHHHHHHHhhC-cC--EEEEeCCccCcCCCCHHHHH
Confidence 1 11333357666666664333 78999999999999999999999876432 22 344332211111 1101000
Q ss_pred CcccCCCCC--------------CccCCCeeeecHHHHHHHHH-----hccc-CCCCCCChhHHHHHHHhCCCeEeecCC
Q psy10879 161 NWFLCDRYL--------------PYALGGGYVLSHLLVKFISE-----NSHM-LSSYISEDVSVGVWLSALDITRYHDVR 220 (245)
Q Consensus 161 ~~~~~~~yp--------------~y~~G~gyvlS~~~~~~l~~-----~~~~-~~~~~~eDv~iG~~l~~~~v~~~~~~~ 220 (245)
..+....|- -.+.|...++.+++....-. .... -...-.||...|..+...|-+.+..+.
T Consensus 271 qqf~~~~y~~~~~~G~~~w~~~~g~f~G~naIiR~~af~~~~glp~L~g~~p~~~~~LseD~~~a~~l~~~GyrV~~~pd 350 (691)
T PRK05454 271 QQFATRVYGPLFAAGLAWWQGGEGNYWGHNAIIRVKAFAEHCGLPPLPGRGPFGGHILSHDFVEAALMRRAGWGVWLAPD 350 (691)
T ss_pred HHHHHHHHHHHHHhhhhhhccCccccccceEEEEHHHHHHhcCCccccccCCCCCCcccHHHHHHHHHHHCCCEEEEcCc
Confidence 000000000 01235556788887765421 1111 122357899999999999988776555
No 56
>cd04188 DPG_synthase DPG_synthase is involved in protein N-linked glycosylation. UDP-glucose:dolichyl-phosphate glucosyltransferase (DPG_synthase) is a transmembrane-bound enzyme of the endoplasmic reticulum involved in protein N-linked glycosylation. This enzyme catalyzes the transfer of glucose from UDP-glucose to dolichyl phosphate.
Probab=83.44 E-value=21 Score=28.54 Aligned_cols=161 Identities=12% Similarity=-0.009 Sum_probs=86.7
Q ss_pred cEEEEEEeecCchHHHHHHHHHhhhCCCEEEcCCccccCCchhHHHHHHHHHHHhccceeEEEEecCceeeehHHHHHHH
Q psy10879 51 QVKYLFSIGNKENVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQKVLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHEL 130 (245)
Q Consensus 51 ~~~~~Fv~g~~~~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~K~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L 130 (245)
..+++.+-+.+.+...+.+++..++++..+++-....+ .+ + -.++....+.+..+|++.+|.|....++.+..++
T Consensus 30 ~~eiivvdd~S~D~t~~~~~~~~~~~~~~i~~i~~~~n-~G---~-~~a~~~g~~~a~gd~i~~ld~D~~~~~~~l~~l~ 104 (211)
T cd04188 30 SYEIIVVDDGSKDGTAEVARKLARKNPALIRVLTLPKN-RG---K-GGAVRAGMLAARGDYILFADADLATPFEELEKLE 104 (211)
T ss_pred CEEEEEEeCCCCCchHHHHHHHHHhCCCcEEEEEcccC-CC---c-HHHHHHHHHHhcCCEEEEEeCCCCCCHHHHHHHH
Confidence 35666665555444444455555666654211011111 11 1 2333344455567999999999999999998888
Q ss_pred hh-cCcCCcccEEEEecCCCeee-cCCCCc---ccC----cccCCCCCCc--cCCCeeeecHHHHHHHHHhcccCCCCCC
Q psy10879 131 EN-KFHYEKKLYWGFFDGRARVR-RAGKWK---EKN----WFLCDRYLPY--ALGGGYVLSHLLVKFISENSHMLSSYIS 199 (245)
Q Consensus 131 ~~-~~~~~~~~y~G~~~~~~~~~-r~~k~~---~~~----~~~~~~yp~y--~~G~gyvlS~~~~~~l~~~~~~~~~~~~ 199 (245)
.. .... ..+.+|......... ....|. .+. +...-.-+++ +..+..++++.+++.+..... .. ...
T Consensus 105 ~~~~~~~-~~~v~g~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~g~~~~~r~~~~~~~~~~~-~~-~~~ 181 (211)
T cd04188 105 EALKTSG-YDIAIGSRAHLASAAVVKRSWLRNLLGRGFNFLVRLLLGLGIKDTQCGFKLFTRDAARRLFPRLH-LE-RWA 181 (211)
T ss_pred HHHhccC-CcEEEEEeeccCCcccccccHHHHHHHHHHHHHHHHHcCCCCcccccCceeEcHHHHHHHHhhhh-cc-ceE
Confidence 87 3333 567778654221100 000111 000 0000001112 234668999999999874432 22 234
Q ss_pred ChhHHHHHHHhCCCeEeecC
Q psy10879 200 EDVSVGVWLSALDITRYHDV 219 (245)
Q Consensus 200 eDv~iG~~l~~~~v~~~~~~ 219 (245)
+|..+-.-+.+.|....+.|
T Consensus 182 ~d~el~~r~~~~g~~~~~vp 201 (211)
T cd04188 182 FDVELLVLARRLGYPIEEVP 201 (211)
T ss_pred eeHHHHHHHHHcCCeEEEcC
Confidence 58877777777888777666
No 57
>PF02485 Branch: Core-2/I-Branching enzyme; InterPro: IPR003406 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. This is the glycosyltransferase family 14 GT14 from CAZY, a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme (2.4.1.150 from EC) and core-2 branching enzyme (2.4.1.102 from EC). I-branching enzyme, an integral membrane protein, converts linear into branched poly-N-acetyllactosaminoglycans in the glycosylation pathway, and is responsible for the production of the blood group I-antigen during embryonic development []. Core-2 branching enzyme, also an integral membrane protein, forms crucial side-chain branches in O-glycans in the glycosylation pathway [].; GO: 0008375 acetylglucosaminyltransferase activity, 0016020 membrane; PDB: 3OTK_D 2GAM_A 2GAK_B.
Probab=82.44 E-value=16 Score=30.45 Aligned_cols=213 Identities=15% Similarity=0.203 Sum_probs=96.7
Q ss_pred EEEeCCCCCHHH-HHHHHHHHcCCcCCCcEEEEEEeecCc-hHHHHHHHHHhhhCCCEEEcCCccc---cCCchhHHHHH
Q psy10879 24 ILILTSSKNYIR-RKNIRDTWIKNIEKYQVKYLFSIGNKE-NVVDVKLKEEIRRYDDILLLNQVPD---EYSSLSQKVLH 98 (245)
Q Consensus 24 i~V~S~~~~~~~-R~~IR~TW~~~~~~~~~~~~Fv~g~~~-~~~~~~l~~E~~~~~Dii~~~d~~d---~y~~l~~K~~~ 98 (245)
.+|+....+.+. +..++.....+. .+.+.+.... ....+.+..-...++.+..+++-.+ +-.++..=++.
T Consensus 3 ylil~h~~~~~~~~~l~~~l~~~~~-----~f~iHiD~k~~~~~~~~~~~~~~~~~nv~~v~~r~~v~WG~~S~v~A~l~ 77 (244)
T PF02485_consen 3 YLILAHKNDPEQLERLLRLLYHPDN-----DFYIHIDKKSPDYFYEEIKKLISCFPNVHFVPKRVDVRWGGFSLVEATLN 77 (244)
T ss_dssp EEEEESS--HHHHHHHHHHH--TTS-----EEEEEE-TTS-HHHHHHHHHHHCT-TTEEE-SS-----TTSHHHHHHHHH
T ss_pred EEEEecCCCHHHHHHHHHHhcCCCC-----EEEEEEcCCCChHHHHHHHHhcccCCceeecccccccccCCccHHHHHHH
Confidence 344443333333 346666664444 4555566653 3344445554677788765652222 23456677888
Q ss_pred HHHHHHhc-cceeEEEEecCceeee--hHHHHHHHhhcCcCCcccEEEEecCCCeeecCCCCcccCcccC----CCCCCc
Q psy10879 99 SFKYIYEH-FDFQYLLKCDDDTFVR--VPNIIHELENKFHYEKKLYWGFFDGRARVRRAGKWKEKNWFLC----DRYLPY 171 (245)
Q Consensus 99 ~l~w~~~~-~~~~~~~k~DDD~~v~--~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~~~~~~~~----~~yp~y 171 (245)
.++.+.+. .+.+|++.+-.+-|.- .+.+.++|+..... ...+-+....... .+ .+|........ ..-..|
T Consensus 78 ll~~al~~~~~~~y~~llSg~D~Pl~s~~~i~~~l~~~~~~-~~f~~~~~~~~~~-~~-~r~~~~~~~~~~~~~~~~~~~ 154 (244)
T PF02485_consen 78 LLREALKRDGDWDYFILLSGQDYPLKSNEEIHEFLESNNGD-NNFIESFSDEDPR-ES-GRYNPRIYDPFRPFFRKRTLY 154 (244)
T ss_dssp HHHHHHHH-S---EEEEEETTEEESS-HHHHHHHHHHTTT---B---BEE--GGG-G--HHHHEEEETTEEEEEEEE--E
T ss_pred HHHHHHhcCCCCcEEEEcccccccccchHHHHHHHHhcCCC-Ccceecccccccc-hh-hcceeeeeeeccccccccccc
Confidence 89999885 4899988888777764 47788888777432 2222222111110 00 11111100000 000225
Q ss_pred cCCCeeeecHHHHHHHHHhcc-------cC-CCCCCChhHHHHHHHhCCCe-E---eecCCcccccCCCCccCCeEEEec
Q psy10879 172 ALGGGYVLSHLLVKFISENSH-------ML-SSYISEDVSVGVWLSALDIT-R---YHDVRFDTEFQSRGCNNSYLIVHK 239 (245)
Q Consensus 172 ~~G~gyvlS~~~~~~l~~~~~-------~~-~~~~~eDv~iG~~l~~~~v~-~---~~~~~f~~~~~~~~c~~~~~~~h~ 239 (245)
-|..-+.||+++++-|+.... .+ .....|..++.-++...+-. . .+.-++..+.+..+|..+.+....
T Consensus 155 ~GSqW~~Ltr~~v~~il~~~~~~~~~~~~~~~~~~pDE~ffqTll~n~~~~~~~~~~~~~r~i~W~~~~~~~p~~~~~~~ 234 (244)
T PF02485_consen 155 KGSQWFSLTRDFVEYILDDPNYRPKLKKYFRFSLCPDESFFQTLLNNSGHFKDTIVNRNLRYIDWSRRGGCHPKTLTICD 234 (244)
T ss_dssp EE-S--EEEHHHHHHHHH-HHHHHHHHHHT-TSSSGGGTHHHHH--SSGGG-B-TTTSSSEEE-BTGT-SS---SSEEEE
T ss_pred ccceeeEeeHHHHHHhhhhHHHHHHHHHhhcCccCcchhhHHHhhcccchhcccccCCCEEEEECCCCCCCCCCeeeeee
Confidence 566669999999999885432 12 23467888887665543211 1 112344443335666666666776
Q ss_pred cCCCC
Q psy10879 240 QNMHQ 244 (245)
Q Consensus 240 ~~p~~ 244 (245)
+++.+
T Consensus 235 ~~~~d 239 (244)
T PF02485_consen 235 LGPED 239 (244)
T ss_dssp --GGG
T ss_pred eCHHH
Confidence 66654
No 58
>cd06437 CESA_CaSu_A2 Cellulose synthase catalytic subunit A2 (CESA2) is a catalytic subunit or a catalytic subunit substitute of the cellulose synthase complex. Cellulose synthase (CESA) catalyzes the polymerization reaction of cellulose using UDP-glucose as the substrate. Cellulose is an aggregate of unbranched polymers of beta-1,4-linked glucose residues, which is an abundant polysaccharide produced by plants and in varying degrees by several other organisms including algae, bacteria, fungi, and even some animals. Genomes from higher plants harbor multiple CESA genes. There are ten in Arabidopsis. At least three different CESA proteins are required to form a functional complex. In Arabidopsis, CESA1, 3 and 6 and CESA4, 7 and 8, are required for cellulose biosynthesis during primary and secondary cell wall formation. CESA2 is very closely related to CESA6 and is viewed as a prime substitute for CESA6. They functionally compensate each other. The cesa2 and cesa6 double mutant plants we
Probab=82.30 E-value=25 Score=28.57 Aligned_cols=114 Identities=7% Similarity=-0.128 Sum_probs=62.8
Q ss_pred HHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecCCCCcc-----c-C-cc--------
Q psy10879 99 SFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRAGKWKE-----K-N-WF-------- 163 (245)
Q Consensus 99 ~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~~-----~-~-~~-------- 163 (245)
++....+..+.+|++.+|.|+.+..+.|...+.....+.-....|.+... .....|.. . . .+
T Consensus 78 a~n~g~~~a~~~~i~~~DaD~~~~~~~l~~~~~~~~~~~v~~v~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (232)
T cd06437 78 ALAEGMKVAKGEYVAIFDADFVPPPDFLQKTPPYFADPKLGFVQTRWGHI---NANYSLLTRVQAMSLDYHFTIEQVARS 154 (232)
T ss_pred HHHHHHHhCCCCEEEEEcCCCCCChHHHHHhhhhhcCCCeEEEecceeeE---cCCCchhhHhhhhhHHhhhhHhHhhHh
Confidence 34555566689999999999999998888855444332112233322110 01111110 0 0 00
Q ss_pred cCCCCCCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeecC
Q psy10879 164 LCDRYLPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHDV 219 (245)
Q Consensus 164 ~~~~yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~~ 219 (245)
....+ ..+.|++-++.+++++.+--.. .....||+.++.-+...|.+....+
T Consensus 155 ~~~~~-~~~~g~~~~~rr~~~~~vgg~~---~~~~~ED~~l~~rl~~~G~~~~~~~ 206 (232)
T cd06437 155 STGLF-FNFNGTAGVWRKECIEDAGGWN---HDTLTEDLDLSYRAQLKGWKFVYLD 206 (232)
T ss_pred hcCCe-EEeccchhhhhHHHHHHhCCCC---CCcchhhHHHHHHHHHCCCeEEEec
Confidence 01111 1134556678888887763222 1235799999988888887665544
No 59
>PF05212 DUF707: Protein of unknown function (DUF707); InterPro: IPR007877 This family consists of uncharacterised proteins from Arabidopsis thaliana.
Probab=79.43 E-value=7.1 Score=33.80 Aligned_cols=177 Identities=17% Similarity=0.106 Sum_probs=88.9
Q ss_pred HHHHHcCCcC----CCcEEEEEEeecCc-hHHHHHHHHHhhhCCCEEEc--CCccccCCch-------------hHHHHH
Q psy10879 39 IRDTWIKNIE----KYQVKYLFSIGNKE-NVVDVKLKEEIRRYDDILLL--NQVPDEYSSL-------------SQKVLH 98 (245)
Q Consensus 39 IR~TW~~~~~----~~~~~~~Fv~g~~~-~~~~~~l~~E~~~~~Dii~~--~d~~d~y~~l-------------~~K~~~ 98 (245)
+|+-|++..+ ..+.-+.+-+|..+ ..++..+.+.++ .=||+++ .+..|.+.++ ..|-..
T Consensus 26 ~r~lw~~p~~~~~~~~k~Lla~~VG~kqk~~vd~~v~Kf~~-nF~i~LfhYDg~vd~w~~~~ws~~aiHv~~~kqtKww~ 104 (294)
T PF05212_consen 26 LRPLWGNPSEDLPKKPKYLLAMTVGIKQKDNVDAIVKKFSD-NFDIMLFHYDGRVDEWDDFEWSDRAIHVSARKQTKWWF 104 (294)
T ss_pred eeecCCCccccccCCCceEEEEEecHHHHhhhhHHHhhhcc-CceEEEEEecCCcCchhhcccccceEEEEeccceEEee
Confidence 3556887764 22344555666665 445555555533 3344433 2333443321 011111
Q ss_pred HHHHHHhcc--ceeEEEEecCceeeehHHHHHHHh---hc--CcC-----CcccEEEEec----CCCeeecCCCCcccCc
Q psy10879 99 SFKYIYEHF--DFQYLLKCDDDTFVRVPNIIHELE---NK--FHY-----EKKLYWGFFD----GRARVRRAGKWKEKNW 162 (245)
Q Consensus 99 ~l~w~~~~~--~~~~~~k~DDD~~v~~~~L~~~L~---~~--~~~-----~~~~y~G~~~----~~~~~~r~~k~~~~~~ 162 (245)
+.++++=-. .++||+.-|||.-|.--...++|+ .. ... ....++-+.. ......|... ..+.+
T Consensus 105 akrfLHPdiv~~YdYiflwDeDL~vd~f~~~ry~~Ivk~~gLeISQPALd~~~~~~~~~iT~R~~~~~vhr~~~-~~~~~ 183 (294)
T PF05212_consen 105 AKRFLHPDIVAPYDYIFLWDEDLGVDHFDINRYFEIVKKEGLEISQPALDPDSSEIHHPITKRRPDSEVHRKTR-GGPRC 183 (294)
T ss_pred hhhhcChhhhccceeEEecCCccCcCcCCHHHHHHHHHHhCCcccCcccCCCCceeeeeEEeecCCceeEeccC-CCCCc
Confidence 222221111 799999999999986655555553 32 111 0112222211 1111112100 01123
Q ss_pred ccCCCCCCc---cCCCeeeecHHHHHHHHHhccc-CCCCCCChhHHHHHHHhC--CCeEee
Q psy10879 163 FLCDRYLPY---ALGGGYVLSHLLVKFISENSHM-LSSYISEDVSVGVWLSAL--DITRYH 217 (245)
Q Consensus 163 ~~~~~yp~y---~~G~gyvlS~~~~~~l~~~~~~-~~~~~~eDv~iG~~l~~~--~v~~~~ 217 (245)
....+.||+ ...++=|+||++.+.+....+. +...+-=|..+|.|+... +|..++
T Consensus 184 ~~~~~~ppct~fVEiMAPVFSr~Awrcvw~miqNDLvhGWGLDf~~~~c~~~~~~kiGVVD 244 (294)
T PF05212_consen 184 CDDSTGPPCTGFVEIMAPVFSRAAWRCVWHMIQNDLVHGWGLDFKWGYCAGDRHKKIGVVD 244 (294)
T ss_pred CCCCCCCCcceEEEEecceechHHHHHHHhcccCCCccccchhhhHHHHhccccccEEEEe
Confidence 334567775 3478889999999988775542 233345588899999532 444444
No 60
>cd02514 GT13_GLCNAC-TI GT13_GLCNAC-TI is involved in an essential step in the synthesis of complex or hybrid-type N-linked oligosaccharides. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GLCNAC-T I , GNT-I) transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide, an essential step in the synthesis of complex or hybrid-type N-linked oligosaccharides. The enzyme is an integral membrane protein localized to the Golgi apparatus. The catalytic domain is located at the C-terminus. These proteins are members of the glycosy transferase family 13.
Probab=76.64 E-value=55 Score=29.13 Aligned_cols=84 Identities=15% Similarity=0.154 Sum_probs=47.6
Q ss_pred HHHHHHHhccceeEEEEecCceeeehH---HHHHHHhhcCcCCcccEEEEecCCCeeecCCCCcccCcccCCCCC-CccC
Q psy10879 98 HSFKYIYEHFDFQYLLKCDDDTFVRVP---NIIHELENKFHYEKKLYWGFFDGRARVRRAGKWKEKNWFLCDRYL-PYAL 173 (245)
Q Consensus 98 ~~l~w~~~~~~~~~~~k~DDD~~v~~~---~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~~~~~~~~~~yp-~y~~ 173 (245)
.++.|+.+..++++++.+|||..+.++ -+...|..+..+ +.+++ +.+.+ .+++......-+...|- .|+.
T Consensus 87 ~aln~vF~~~~~~~vIILEDDl~~sPdFf~yf~~~l~~y~~D-~~v~~--ISa~N---dnG~~~~~~~~~~~lyrs~ff~ 160 (334)
T cd02514 87 WALTQTFNLFGYSFVIILEDDLDIAPDFFSYFQATLPLLEED-PSLWC--ISAWN---DNGKEHFVDDTPSLLYRTDFFP 160 (334)
T ss_pred HHHHHHHHhcCCCEEEEECCCCccCHhHHHHHHHHHHHHhcC-CCEEE--EEeec---cCCcccccCCCcceEEEecCCC
Confidence 366666655579999999999999987 444555444443 33332 22211 01211100000111111 3678
Q ss_pred CCeeeecHHHHHHH
Q psy10879 174 GGGYVLSHLLVKFI 187 (245)
Q Consensus 174 G~gyvlS~~~~~~l 187 (245)
|.|.++.+++-+.+
T Consensus 161 glGWml~r~~W~e~ 174 (334)
T cd02514 161 GLGWMLTRKLWKEL 174 (334)
T ss_pred chHHHHHHHHHHHh
Confidence 99999999999887
No 61
>cd04190 Chitin_synth_C C-terminal domain of Chitin Synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin. Chitin synthase, also called UDP-N-acetyl-D-glucosamine:chitin 4-beta-N-acetylglucosaminyltransferase, catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of GlcNAc residues formed by covalent beta-1,4 linkages. Chitin is an important component of the cell wall of fungi and bacteria and it is synthesized on the cytoplasmic surface of the cell membrane by membrane bound chitin synthases. Studies with fungi have revealed that most of them contain more than one chitin synthase gene. At least five subclasses of chitin synthases have been identified.
Probab=75.53 E-value=8.9 Score=31.94 Aligned_cols=110 Identities=13% Similarity=-0.004 Sum_probs=64.1
Q ss_pred ceeEEEEecCceeeehHHHHHHHhhcCcC-CcccEEEEecCCCeeecC-C-----CCcccC-----cccCCCCCCccCCC
Q psy10879 108 DFQYLLKCDDDTFVRVPNIIHELENKFHY-EKKLYWGFFDGRARVRRA-G-----KWKEKN-----WFLCDRYLPYALGG 175 (245)
Q Consensus 108 ~~~~~~k~DDD~~v~~~~L~~~L~~~~~~-~~~~y~G~~~~~~~~~r~-~-----k~~~~~-----~~~~~~yp~y~~G~ 175 (245)
+.+|++.+|.|+.+..+.|..++.....+ +-....|.+......... . .|.... ....-.....+.|+
T Consensus 73 ~~e~i~~~DaD~~~~~~~l~~l~~~~~~~p~vg~v~g~~~~~~~~~~~~~~~q~~ey~~~~~~~~~~~s~~g~~~~~~G~ 152 (244)
T cd04190 73 DPEFILLVDADTKFDPDSIVQLYKAMDKDPEIGGVCGEIHPMGKKQGPLVMYQVFEYAISHWLDKAFESVFGFVTCLPGC 152 (244)
T ss_pred CCCEEEEECCCCcCCHhHHHHHHHHHHhCCCEEEEEeeeEEcCCcchhHHHhHheehhhhhhhcccHHHcCCceEECCCc
Confidence 89999999999999999988888776322 022344543221110000 0 000000 00011234567899
Q ss_pred eeeecHHHHHHHHHhccc----------C-------CCCCCChhHHHHHHHhCCCeEee
Q psy10879 176 GYVLSHLLVKFISENSHM----------L-------SSYISEDVSVGVWLSALDITRYH 217 (245)
Q Consensus 176 gyvlS~~~~~~l~~~~~~----------~-------~~~~~eDv~iG~~l~~~~v~~~~ 217 (245)
+.++.+++++.+...... . .....||..++..+...|.+...
T Consensus 153 ~~~~R~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ED~~l~~~l~~~G~~~~~ 211 (244)
T cd04190 153 FSMYRIEALKGDNGGKGPLLDYAYLTNTVDSLHKKNNLDLGEDRILCTLLLKAGPKRKY 211 (244)
T ss_pred eEEEEehhhcCCccccccchhhccccCcccchHHHHHHhHhcccceeHHHhccCCccEE
Confidence 999999998876332211 0 11247999999888877766544
No 62
>PLN03181 glycosyltransferase; Provisional
Probab=74.62 E-value=49 Score=30.38 Aligned_cols=99 Identities=17% Similarity=0.179 Sum_probs=59.8
Q ss_pred eeEEEEEeCCCCCHHHHHHHHHHHcCCcC----CCcEEEEEEeecCc--------hH-HHHHH--HHH-hhhCC-CEEEc
Q psy10879 20 YFIVILILTSSKNYIRRKNIRDTWIKNIE----KYQVKYLFSIGNKE--------NV-VDVKL--KEE-IRRYD-DILLL 82 (245)
Q Consensus 20 ~~lli~V~S~~~~~~~R~~IR~TW~~~~~----~~~~~~~Fv~g~~~--------~~-~~~~l--~~E-~~~~~-Dii~~ 82 (245)
+.+-..+-....+.+.+ |+.|.+..+ +.+-+++.|.|..+ +. +...+ ..+ +++|| ++...
T Consensus 95 p~~~ytl~~~i~~wD~k---R~~Wl~~~p~~~~~~~prVViVT~Sdp~~C~~~~gD~~LlriikNR~dYArrHGY~lf~~ 171 (453)
T PLN03181 95 PDLSYSIEKPIKNWDEK---RAEWLKLHPSFAPGAEERVVMVTGSQPTPCKNPIGDHLLLRFFKNKVDYCRIHGYDIFYN 171 (453)
T ss_pred CCCceecCCCcCCHHHH---HHHHHHhCCCCCCCCCCCEEEEECCCCCCCCCcccHHHHHHHHHHHHHHHHHhCCcEEEe
Confidence 33333333333454444 788887653 45567777777652 11 12222 112 66777 55532
Q ss_pred CCccc-cCCchhHHHHHHHHHHHhccceeEEEEecCceee
Q psy10879 83 NQVPD-EYSSLSQKVLHSFKYIYEHFDFQYLLKCDDDTFV 121 (245)
Q Consensus 83 ~d~~d-~y~~l~~K~~~~l~w~~~~~~~~~~~k~DDD~~v 121 (245)
....+ .+...+.|...+-.-+.++-+++||.-+|.|+++
T Consensus 172 ~a~Ld~~~p~~WaKipalRaAM~a~PeAEWfWWLDsDALI 211 (453)
T PLN03181 172 NALLHPKMNSYWAKLPVVRAAMLAHPEAEWIWWVDSDAVF 211 (453)
T ss_pred ccccCccCchhhhHHHHHHHHHHHCCCceEEEEecCCcee
Confidence 21223 5677788888777777665599999999999987
No 63
>cd06913 beta3GnTL1_like Beta 1, 3-N-acetylglucosaminyltransferase is essential for the formation of poly-N-acetyllactosamine . This family includes human Beta3GnTL1 and related eukaryotic proteins. Human Beta3GnTL1 is a putative beta-1,3-N-acetylglucosaminyltransferase. Beta3GnTL1 is expressed at various levels in most of tissues examined. Beta 1, 3-N-acetylglucosaminyltransferase has been found to be essential for the formation of poly-N-acetyllactosamine. Poly-N-acetyllactosamine is a unique carbohydrate composed of N-acetyllactosamine repeats. It is often an important part of cell-type-specific oligosaccharide structures and some functional oligosaccharides. It has been shown that the structure and biosynthesis of poly-N-acetyllactosamine display a dramatic change during development and oncogenesis. Several members of beta-1, 3-N-acetylglucosaminyltransferase have been identified.
Probab=72.61 E-value=48 Score=26.60 Aligned_cols=44 Identities=16% Similarity=0.077 Sum_probs=30.3
Q ss_pred HHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEE
Q psy10879 100 FKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGF 144 (245)
Q Consensus 100 l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~ 144 (245)
.....+..+.+|++.+|+|..+..+.+...+...... ....+|.
T Consensus 76 ~N~g~~~a~gd~i~~lD~D~~~~~~~l~~~~~~~~~~-~~~~v~~ 119 (219)
T cd06913 76 KNQAIAQSSGRYLCFLDSDDVMMPQRIRLQYEAALQH-PNSIIGC 119 (219)
T ss_pred HHHHHHhcCCCEEEEECCCccCChhHHHHHHHHHHhC-CCcEEEE
Confidence 3444455588999999999999998888776655433 2334443
No 64
>PRK11498 bcsA cellulose synthase catalytic subunit; Provisional
Probab=68.54 E-value=1.4e+02 Score=30.25 Aligned_cols=117 Identities=11% Similarity=0.039 Sum_probs=67.7
Q ss_pred HHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecC--CCee-ecC-CCC--cccC---ccc-----
Q psy10879 99 SFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDG--RARV-RRA-GKW--KEKN---WFL----- 164 (245)
Q Consensus 99 ~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~--~~~~-~r~-~k~--~~~~---~~~----- 164 (245)
.+....++++.+|++..|.|+.+..+-|.+.+.....+.+-..++.... +..| .++ +.. ...+ +|.
T Consensus 330 nLN~aL~~a~GEyIavlDAD~ip~pdfL~~~V~~f~~dP~VglVQtp~~f~n~dp~~rnl~~~~~~~~e~~~fy~~iq~g 409 (852)
T PRK11498 330 NINNALKYAKGEFVAIFDCDHVPTRSFLQMTMGWFLKDKKLAMMQTPHHFFSPDPFERNLGRFRKTPNEGTLFYGLVQDG 409 (852)
T ss_pred HHHHHHHhCCCCEEEEECCCCCCChHHHHHHHHHHHhCCCeEEEEcceeccCCchHHHhhHHHhhcccchhHHHHHHHhH
Confidence 4566666678999999999999998888877755322102222221110 1101 111 000 0000 010
Q ss_pred CCC-CCCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeec
Q psy10879 165 CDR-YLPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHD 218 (245)
Q Consensus 165 ~~~-yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~ 218 (245)
.+. --.+|.|++.++.+++++.+---.. ....||..++.-+...|.+....
T Consensus 410 ~~~~~a~~~~Gs~aviRReaLeeVGGfd~---~titED~dlslRL~~~Gyrv~yl 461 (852)
T PRK11498 410 NDMWDATFFCGSCAVIRRKPLDEIGGIAV---ETVTEDAHTSLRLHRRGYTSAYM 461 (852)
T ss_pred HHhhcccccccceeeeEHHHHHHhcCCCC---CccCccHHHHHHHHHcCCEEEEE
Confidence 000 0135678999999999998843322 12469999999999998876643
No 65
>PRK10714 undecaprenyl phosphate 4-deoxy-4-formamido-L-arabinose transferase; Provisional
Probab=64.13 E-value=1e+02 Score=27.07 Aligned_cols=132 Identities=11% Similarity=-0.011 Sum_probs=67.5
Q ss_pred cEEEEEEeecCchHHHHHHHHHhhhCCC-EEEcCCccccCCchhHHHHHHHHHHHhccceeEEEEecCceeeehHHHHHH
Q psy10879 51 QVKYLFSIGNKENVVDVKLKEEIRRYDD-ILLLNQVPDEYSSLSQKVLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHE 129 (245)
Q Consensus 51 ~~~~~Fv~g~~~~~~~~~l~~E~~~~~D-ii~~~d~~d~y~~l~~K~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~ 129 (245)
..+++++-+.+.+...+.+.+.++.+++ ++.+. ...++ . |- .++....++++.+|++.+|.|.-..++.+.++
T Consensus 38 ~~EIIvVDDgS~D~T~~il~~~~~~~~~~v~~i~-~~~n~-G---~~-~A~~~G~~~A~gd~vv~~DaD~q~~p~~i~~l 111 (325)
T PRK10714 38 EYEILLIDDGSSDNSAEMLVEAAQAPDSHIVAIL-LNRNY-G---QH-SAIMAGFSHVTGDLIITLDADLQNPPEEIPRL 111 (325)
T ss_pred CEEEEEEeCCCCCcHHHHHHHHHhhcCCcEEEEE-eCCCC-C---HH-HHHHHHHHhCCCCEEEEECCCCCCCHHHHHHH
Confidence 4677777665553333333333344443 43221 12221 1 21 23333445568899999999999999999998
Q ss_pred HhhcCcCCcccEEEEecCC-CeeecC--CCCcc--cCcccCCCCCCccCCCeeeecHHHHHHHHHh
Q psy10879 130 LENKFHYEKKLYWGFFDGR-ARVRRA--GKWKE--KNWFLCDRYLPYALGGGYVLSHLLVKFISEN 190 (245)
Q Consensus 130 L~~~~~~~~~~y~G~~~~~-~~~~r~--~k~~~--~~~~~~~~yp~y~~G~gyvlS~~~~~~l~~~ 190 (245)
++..... .++..|..... .++.|. ++... -.......++.. .++.-++++++++.+...
T Consensus 112 ~~~~~~~-~DvV~~~r~~~~~~~~r~~~s~~~~~l~~~~~g~~~~d~-~~gfr~~~r~~~~~l~~~ 175 (325)
T PRK10714 112 VAKADEG-YDVVGTVRQNRQDSWFRKTASKMINRLIQRTTGKAMGDY-GCMLRAYRRHIVDAMLHC 175 (325)
T ss_pred HHHHHhh-CCEEEEEEcCCCCcHHHHHHHHHHHHHHHHHcCCCCCCC-CcCeEEEcHHHHHHHHHC
Confidence 8876543 45555543221 111111 11000 000112223222 223358999999998643
No 66
>cd06438 EpsO_like EpsO protein participates in the methanolan synthesis. The Methylobacillus sp EpsO protein is predicted to participate in the methanolan synthesis. Methanolan is an exopolysaccharide (EPS), composed of glucose, mannose and galactose. A 21 genes cluster was predicted to participate in the methanolan synthesis. Gene disruption analysis revealed that EpsO is one of the glycosyltransferase enzymes involved in the synthesis of repeating sugar units onto the lipid carrier.
Probab=62.14 E-value=73 Score=24.76 Aligned_cols=85 Identities=11% Similarity=0.076 Sum_probs=51.2
Q ss_pred HHHHHHHhcc-ceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecCCCCcc---------cC-cc--c
Q psy10879 98 HSFKYIYEHF-DFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRAGKWKE---------KN-WF--L 164 (245)
Q Consensus 98 ~~l~w~~~~~-~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~~---------~~-~~--~ 164 (245)
.+++++...- +.+|++..|.|+.+.++-|..++...... ..+..|...... +. ..|.. .. .. .
T Consensus 70 ~g~~~a~~~~~~~d~v~~~DaD~~~~p~~l~~l~~~~~~~-~~~v~g~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~ 145 (183)
T cd06438 70 FGFRHLLNLADDPDAVVVFDADNLVDPNALEELNARFAAG-ARVVQAYYNSKN-PD--DSWITRLYAFAFLVFNRLRPLG 145 (183)
T ss_pred HHHHHHHhcCCCCCEEEEEcCCCCCChhHHHHHHHHHhhC-CCeeEEEEeeeC-Cc--cCHHHHHHHHHHHHHHHHHHHH
Confidence 3444443222 68999999999999998888888777554 456666543211 11 11110 00 00 0
Q ss_pred C--CCCCCccCCCeeeecHHHHHH
Q psy10879 165 C--DRYLPYALGGGYVLSHLLVKF 186 (245)
Q Consensus 165 ~--~~yp~y~~G~gyvlS~~~~~~ 186 (245)
. -.-+.++.|++++++++++++
T Consensus 146 ~~~~~~~~~~~G~~~~~rr~~l~~ 169 (183)
T cd06438 146 RSNLGLSCQLGGTGMCFPWAVLRQ 169 (183)
T ss_pred HHHcCCCeeecCchhhhHHHHHHh
Confidence 0 112235789999999999988
No 67
>PHA01631 hypothetical protein
Probab=61.14 E-value=57 Score=25.95 Aligned_cols=94 Identities=10% Similarity=0.216 Sum_probs=53.0
Q ss_pred hCCCEEEcCCccccCCchhHHHHHHHHHHHhc---cceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCee
Q psy10879 75 RYDDILLLNQVPDEYSSLSQKVLHSFKYIYEH---FDFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARV 151 (245)
Q Consensus 75 ~~~Dii~~~d~~d~y~~l~~K~~~~l~w~~~~---~~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~ 151 (245)
-+.+|++.. ..+ .....+..-++..+.++ .+-+.++.+|.|++|+.-. .. .+. ..++.=+. |
T Consensus 38 ~~~~Ii~~~--t~~-e~Rr~RIAk~Ll~Iln~~s~i~DDi~~iIDSDV~ipn~~--~~----~~~-~~v~t~Ci-----P 102 (176)
T PHA01631 38 YQEKIIWIM--TNT-EIRWLRIAKQLLTIVNFAKNIEDDIIAIIDSDLIIPNLR--EI----IPN-ERVFTPCY-----W 102 (176)
T ss_pred cCCceEEec--ccc-hhHHHHHHHHHHHHHHhhccCCccEEEEeccceEecCcc--cc----ccC-CCccceee-----e
Confidence 355777542 222 22234444555566554 3678889999999997532 11 111 12222111 1
Q ss_pred ecCCCCcccCcccCCCCCCccCCCeeeecHHHHHHHHHhc
Q psy10879 152 RRAGKWKEKNWFLCDRYLPYALGGGYVLSHLLVKFISENS 191 (245)
Q Consensus 152 ~r~~k~~~~~~~~~~~yp~y~~G~gyvlS~~~~~~l~~~~ 191 (245)
.+ .|| .+..-+||.|--|++.++.+..|....
T Consensus 103 A~-~kp-------~~~v~~FC~sTNf~~pr~~l~~l~~vi 134 (176)
T PHA01631 103 LY-YDW-------ANEIRPFCSGTNYIFRKSLLPYLEYTI 134 (176)
T ss_pred ee-ecC-------CCcEEEEEccccEEeeHHHhHHHHHHH
Confidence 11 111 223447999999999999998887664
No 68
>PRK10073 putative glycosyl transferase; Provisional
Probab=60.24 E-value=1.2e+02 Score=26.62 Aligned_cols=105 Identities=12% Similarity=-0.024 Sum_probs=54.6
Q ss_pred eeEEEEEeCCCCCHHHHHHHHHHHcCCcCCCcEEEEEEeecCchHHHHHHHHHhhhCCCEEEcCCccccCCchhHHHHHH
Q psy10879 20 YFIVILILTSSKNYIRRKNIRDTWIKNIEKYQVKYLFSIGNKENVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQKVLHS 99 (245)
Q Consensus 20 ~~lli~V~S~~~~~~~R~~IR~TW~~~~~~~~~~~~Fv~g~~~~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~K~~~~ 99 (245)
+.|-|+|++--...--++.+..- .++. ...++++.+-..+++...+.+.+-++++..+..+. . .|. -.-.+
T Consensus 6 p~vSVIIP~yN~~~~L~~~l~Sl-~~Qt-~~~~EIIiVdDgStD~t~~i~~~~~~~~~~i~vi~--~---~n~--G~~~a 76 (328)
T PRK10073 6 PKLSIIIPLYNAGKDFRAFMESL-IAQT-WTALEIIIVNDGSTDNSVEIAKHYAENYPHVRLLH--Q---ANA--GVSVA 76 (328)
T ss_pred CeEEEEEeccCCHHHHHHHHHHH-HhCC-CCCeEEEEEeCCCCccHHHHHHHHHhhCCCEEEEE--C---CCC--ChHHH
Confidence 34556566543222223333322 2222 12356666654444333333333345555555432 1 121 12333
Q ss_pred HHHHHhccceeEEEEecCceeeehHHHHHHHhhc
Q psy10879 100 FKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENK 133 (245)
Q Consensus 100 l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~ 133 (245)
.....+.++.+|++++|+|-++.++.+..++...
T Consensus 77 rN~gl~~a~g~yi~flD~DD~~~p~~l~~l~~~~ 110 (328)
T PRK10073 77 RNTGLAVATGKYVAFPDADDVVYPTMYETLMTMA 110 (328)
T ss_pred HHHHHHhCCCCEEEEECCCCccChhHHHHHHHHH
Confidence 4445566688999999999999988777777654
No 69
>PF13704 Glyco_tranf_2_4: Glycosyl transferase family 2
Probab=58.89 E-value=49 Score=22.91 Aligned_cols=39 Identities=23% Similarity=0.281 Sum_probs=25.4
Q ss_pred cccCCchhHHHHHHHHHHHhcc-ceeEEEEecCceeeehHH
Q psy10879 86 PDEYSSLSQKVLHSFKYIYEHF-DFQYLLKCDDDTFVRVPN 125 (245)
Q Consensus 86 ~d~y~~l~~K~~~~l~w~~~~~-~~~~~~k~DDD~~v~~~~ 125 (245)
...+..-... ....+.+.+.. +++|++.+|-|-|+.++.
T Consensus 49 ~~~~~~~~~~-~~~~~~~~~~~~~~dWvl~~D~DEfl~~~~ 88 (97)
T PF13704_consen 49 VDPYRDERRQ-RAWRNALIERAFDADWVLFLDADEFLVPPP 88 (97)
T ss_pred CCCccchHHH-HHHHHHHHHhCCCCCEEEEEeeeEEEecCC
Confidence 4445432233 33444454454 899999999999998765
No 70
>COG4092 Predicted glycosyltransferase involved in capsule biosynthesis [Cell envelope biogenesis, outer membrane]
Probab=57.92 E-value=99 Score=26.80 Aligned_cols=179 Identities=13% Similarity=0.053 Sum_probs=98.5
Q ss_pred HHHHHHHHHHHcCCcC-CCcEEEEEEeecCchHHHHHHHHHhhhCCCEEEcCC-ccccCCchhHHHHHHHHHHHhcccee
Q psy10879 33 YIRRKNIRDTWIKNIE-KYQVKYLFSIGNKENVVDVKLKEEIRRYDDILLLNQ-VPDEYSSLSQKVLHSFKYIYEHFDFQ 110 (245)
Q Consensus 33 ~~~R~~IR~TW~~~~~-~~~~~~~Fv~g~~~~~~~~~l~~E~~~~~Dii~~~d-~~d~y~~l~~K~~~~l~w~~~~~~~~ 110 (245)
.+.|+..|-.-.+--. ...+.++++-|.. .....|..=......++.+.+ ..+.+..-+.--..+..|..+.|+..
T Consensus 19 ~~~R~f~~~~~~k~fts~~~~~vi~~~~~~--~~d~~i~~~i~~~~~~~yl~~~s~~~F~s~~~c~n~ga~Ysh~~~~Sn 96 (346)
T COG4092 19 TDSRQFSRTSAVKVFTSSDITMVICLRAHE--VMDRLIRSYIDPMPRVLYLDFGSPEPFASETICANNGADYSHEKCESN 96 (346)
T ss_pred hHHHHHhhHhhhhhccccccEEEEEEecch--hHHHHHHHHhccccceEEEecCCCccccchhhhhhccchhhhcccccc
Confidence 4556655544333322 3345555555543 233444444555555554532 23455554666778889999999999
Q ss_pred EEEEecCceeeehHHHHHHHhhcC-----cCC-cccEEEEecCCCeee-------cCCCCccc--C-----cccCCCCCC
Q psy10879 111 YLLKCDDDTFVRVPNIIHELENKF-----HYE-KKLYWGFFDGRARVR-------RAGKWKEK--N-----WFLCDRYLP 170 (245)
Q Consensus 111 ~~~k~DDD~~v~~~~L~~~L~~~~-----~~~-~~~y~G~~~~~~~~~-------r~~k~~~~--~-----~~~~~~yp~ 170 (245)
++++.|-|+|...++....|+-.. ... ..+.+- +....+.. .+-+|..- + .-.++.|.-
T Consensus 97 ~vlFlDvDc~~S~dnF~k~l~~~~ikk~~tnI~a~~vlP-V~~LNk~~~~v~f~~~d~f~d~~i~es~~~~~~~~~~ff~ 175 (346)
T COG4092 97 LVLFLDVDCFGSSDNFAKMLSIATIKKMRTNIDAPLVLP-VYHLNKADTQVFFDVEDMFLDAMIFESPLAEFRKEDNFFI 175 (346)
T ss_pred EEEEEeccccccHHHHHHHHHHHHHHHHHhccCcceeee-eeecchhhhhHHHHHHHHhhhhHhhhhHHHHhCccccccc
Confidence 999999999999999999884321 111 111211 11111100 01122211 1 001233311
Q ss_pred ccCCCeeeecHHHHHHHHHhcccCCCCCCChh-HHHHHHHhCCCe
Q psy10879 171 YALGGGYVLSHLLVKFISENSHMLSSYISEDV-SVGVWLSALDIT 214 (245)
Q Consensus 171 y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv-~iG~~l~~~~v~ 214 (245)
-...+.+++.++.-.+.--+.......+.||. ++.++...++..
T Consensus 176 ~~~T~~~liN~~~F~~tgGydE~F~GhG~EDfe~~~R~~l~~~~~ 220 (346)
T COG4092 176 APYTNIFLINRRMFSLTGGYDERFRGHGSEDFEFLTRLGLYIKNL 220 (346)
T ss_pred ccccceEEEehhHHHHhcCCccccccCCchhHHHHHHHHHHHhcc
Confidence 12345688888887777766777777889998 446666655443
No 71
>PHA02688 ORF059 IMV protein VP55; Provisional
Probab=56.28 E-value=16 Score=32.03 Aligned_cols=95 Identities=11% Similarity=0.048 Sum_probs=49.8
Q ss_pred hHHHHHHHH--HHHhcc--ceeEEEEecCceeeeh-HHHHHHHhhcCcCCcccEEEEe---cCCC--eeecCCCCcccCc
Q psy10879 93 SQKVLHSFK--YIYEHF--DFQYLLKCDDDTFVRV-PNIIHELENKFHYEKKLYWGFF---DGRA--RVRRAGKWKEKNW 162 (245)
Q Consensus 93 ~~K~~~~l~--w~~~~~--~~~~~~k~DDD~~v~~-~~L~~~L~~~~~~~~~~y~G~~---~~~~--~~~r~~k~~~~~~ 162 (245)
..+.++.|+ +..+.- +.+|++-++||+.++- ..+...+..+..+ .+.+=+. ..+. +..-..- +.+..
T Consensus 97 ia~HlsLWes~~~~~~k~~~~~yivVlEDDnTi~~~~~~~~~I~~M~~n--~idilQLre~~~~~~~~~~~~~~-~~~~~ 173 (323)
T PHA02688 97 IARHLSLWESYANADIKDKEDEYIVVVEDDNTLRDITTLHPIIKAMKEK--NIDILQLRETLHNNNVRTLLNQE-GNPAL 173 (323)
T ss_pred HHHHHHHHHhhccCCccccCCCeEEEEcCCCcccccHHHHHHHHHHHhc--CeEEEEeehhhhCCcccccccCC-CCcce
Confidence 345566666 333333 5899999999999875 3344444554432 2333222 1111 1100000 01111
Q ss_pred ccCCCCCC-c-cCCCeeeecHHHHHHHHHhccc
Q psy10879 163 FLCDRYLP-Y-ALGGGYVLSHLLVKFISENSHM 193 (245)
Q Consensus 163 ~~~~~yp~-y-~~G~gyvlS~~~~~~l~~~~~~ 193 (245)
..|++ | .+=++|+++...+.+|+.....
T Consensus 174 ---~~Y~ggydvSLsAYIIr~~~a~kl~~~~ik 203 (323)
T PHA02688 174 ---YSYTGGYDVSLSAYIIRVSTAKKLYDEIIK 203 (323)
T ss_pred ---EEecCCcceeeEEEEEeHHHHHHHHHHHHh
Confidence 11221 3 3448899999999999987543
No 72
>PF04666 Glyco_transf_54: N-Acetylglucosaminyltransferase-IV (GnT-IV) conserved region; InterPro: IPR006759 The complex-type of oligosaccharides are synthesised through elongation by glycosyltransferases after trimming of the precursor oligosaccharides transferred to proteins in the endoplasmic reticulum. N-Acetylglucosaminyltransferases (GnTs) take part in the formation of branches in the biosynthesis of complex-type sugar chains. In vertebrates, six GnTs, designated as GnT-I to -VI, which catalyse the transfer of GlcNAc to the core mannose residues of Asn-linked sugar chains, have been identified. GnT-IV (2.4.1.145 from EC) catalyzes the transfer of GlcNAc from UDP-GlcNAc to the GlcNAc1-2Man1-3 arm of core oligosaccharide [Gn2(22)core oligosaccharide] and forms a GlcNAc1-4(GlcNAc1-2)Man1-3 structure on the core oligosaccharide (Gn3(2,4,2)core oligosaccharide). In some members the conserved region occupies all but the very N-terminal, where there is a signal sequence on all members. For other members the conserved region does not occupy the entire protein but is still to the N-terminal end of the protein [].; GO: 0016758 transferase activity, transferring hexosyl groups, 0005975 carbohydrate metabolic process, 0016020 membrane
Probab=53.31 E-value=88 Score=27.35 Aligned_cols=23 Identities=17% Similarity=0.267 Sum_probs=17.7
Q ss_pred HHHhcc--ceeEEEEecCceeeehH
Q psy10879 102 YIYEHF--DFQYLLKCDDDTFVRVP 124 (245)
Q Consensus 102 w~~~~~--~~~~~~k~DDD~~v~~~ 124 (245)
++..+| .++||+-+.||+.....
T Consensus 161 ~Lm~y~~~~~~YyL~LEDDVia~~~ 185 (297)
T PF04666_consen 161 FLMNYCQNLGDYYLQLEDDVIAAPG 185 (297)
T ss_pred HHHHHHHhcCCeEEEecCCeEechh
Confidence 344566 68999999999887664
No 73
>KOG2547|consensus
Probab=51.35 E-value=56 Score=29.57 Aligned_cols=110 Identities=14% Similarity=0.243 Sum_probs=63.1
Q ss_pred CCeeEEEEEeCCCCCHHHHHHHHHHHcCCcCCCcEEEEEEeecCchHHHHHHHHHhhhCCCE---EEcCCccccCCchhH
Q psy10879 18 HSYFIVILILTSSKNYIRRKNIRDTWIKNIEKYQVKYLFSIGNKENVVDVKLKEEIRRYDDI---LLLNQVPDEYSSLSQ 94 (245)
Q Consensus 18 ~~~~lli~V~S~~~~~~~R~~IR~TW~~~~~~~~~~~~Fv~g~~~~~~~~~l~~E~~~~~Di---i~~~d~~d~y~~l~~ 94 (245)
+.+.++--++-..+|. -+.+|.-.-. . ..+.+++|.+-.+++..-+.++.-..+|..+ +++++ ++. .+..
T Consensus 85 PgVSiikPl~G~d~nl--~~Nlesffts-~-Y~~~ElLfcv~s~eDpAi~vv~~Ll~kyp~VdAklf~gG-~~v--g~np 157 (431)
T KOG2547|consen 85 PGVSIIKPLKGVDPNL--YHNLESFFTS-Q-YHKYELLFCVESSEDPAIEVVERLLKKYPNVDAKLFFGG-EKV--GLNP 157 (431)
T ss_pred CCceEEeecccCCchh--HHhHHHHHhh-c-cCceEEEEEEccCCCcHHHHHHHHHhhCCCcceEEEEcc-ccc--ccCh
Confidence 4455555555554442 2233322222 2 2367899999887733334455556677633 12332 222 2224
Q ss_pred HHHHHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcCc
Q psy10879 95 KVLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKFH 135 (245)
Q Consensus 95 K~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~~ 135 (245)
|.-.++-- ++..++++++..|+|+++..+.+.++...+..
T Consensus 158 KInN~mpg-y~~a~ydlvlisDsgI~m~pdtildm~t~M~s 197 (431)
T KOG2547|consen 158 KINNMMPG-YRAAKYDLVLISDSGIFMKPDTILDMATTMMS 197 (431)
T ss_pred hhhccCHH-HHHhcCCEEEEecCCeeecCchHHHHHHhhhc
Confidence 54444432 12336789999999999999999999888653
No 74
>PRK10018 putative glycosyl transferase; Provisional
Probab=50.21 E-value=1.6e+02 Score=25.18 Aligned_cols=36 Identities=17% Similarity=-0.020 Sum_probs=28.3
Q ss_pred HHHHHHhccceeEEEEecCceeeehHHHHHHHhhcC
Q psy10879 99 SFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKF 134 (245)
Q Consensus 99 ~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~ 134 (245)
+.....+.++.+|+++.|+|..+.++.|..++....
T Consensus 76 a~N~gi~~a~g~~I~~lDaDD~~~p~~l~~~~~~~~ 111 (279)
T PRK10018 76 VRNQAIMLAQGEYITGIDDDDEWTPNRLSVFLAHKQ 111 (279)
T ss_pred HHHHHHHHcCCCEEEEECCCCCCCccHHHHHHHHHH
Confidence 344445567899999999999999998888877653
No 75
>PF03213 Pox_P35: Poxvirus P35 protein; InterPro: IPR004900 The Poxvirus P35 protein is an immunodominant envelope protein. It binds to heparan sulphate on the cell surface to provide virion attachment to target cell [].; GO: 0019031 viral envelope
Probab=44.80 E-value=59 Score=28.58 Aligned_cols=94 Identities=13% Similarity=0.100 Sum_probs=52.6
Q ss_pred hhHHHHHHHHHHHhc--c--ceeEEEEecCceeeeh-HHHHHHHhhcCcCCcccEEEEecC---CCeeecCC--CCcccC
Q psy10879 92 LSQKVLHSFKYIYEH--F--DFQYLLKCDDDTFVRV-PNIIHELENKFHYEKKLYWGFFDG---RARVRRAG--KWKEKN 161 (245)
Q Consensus 92 l~~K~~~~l~w~~~~--~--~~~~~~k~DDD~~v~~-~~L~~~L~~~~~~~~~~y~G~~~~---~~~~~r~~--k~~~~~ 161 (245)
...+.++.|+...+- - +.+|++-++||+-+.- ..+...+..+.. ..+++=++.. +.. .|.. .-..+.
T Consensus 98 sia~HlsLWe~~~~~~~~~~~~~yivVvEddnT~~~~~~l~~~I~aM~~--k~idilQLre~~~~~~-~rt~~~~~~~p~ 174 (325)
T PF03213_consen 98 SIANHLSLWESISNADIKDPEDKYIVVVEDDNTLRDITTLHPIIKAMKK--KNIDILQLRETYHNSN-VRTLLPQEGNPS 174 (325)
T ss_pred HHHHHHHHHHHhhccCcccCCCCeEEEEeCCCcccccHHHHHHHHHHHH--cCceEEEEehhhhccc-cccccccccCcc
Confidence 345677788877322 2 6899999998877754 444445555544 2344433321 111 1110 001111
Q ss_pred cccCCCCCCccCC-----CeeeecHHHHHHHHHhcccC
Q psy10879 162 WFLCDRYLPYALG-----GGYVLSHLLVKFISENSHML 194 (245)
Q Consensus 162 ~~~~~~yp~y~~G-----~gyvlS~~~~~~l~~~~~~~ 194 (245)
.-.|.|| ++|++.-..+.+|+......
T Consensus 175 ------~~~Y~GGyD~SLsAYIIr~~~a~kl~~~i~~~ 206 (325)
T PF03213_consen 175 ------MYSYTGGYDVSLSAYIIRVSTAKKLYNEIIKN 206 (325)
T ss_pred ------eeeecCccceeeEEEEEEhHHHHHHHHHHHhC
Confidence 1124455 78999999999999886433
No 76
>PF00483 NTP_transferase: Nucleotidyl transferase This Prosite entry is only a sub-family of the Pfam entry.; InterPro: IPR005835 Nucleotidyl transferases transfer nucleotides from one compound to another. This domain is found in a number of enzymes that transfer nucleotides onto phosphosugars.; GO: 0016779 nucleotidyltransferase activity, 0009058 biosynthetic process; PDB: 1YP2_C 1YP4_D 1YP3_B 1H5S_D 1H5R_C 1H5T_C 2E3D_B 1JYL_C 1JYK_A 1MP5_C ....
Probab=43.22 E-value=1.7e+02 Score=23.97 Aligned_cols=94 Identities=10% Similarity=-0.002 Sum_probs=50.5
Q ss_pred HHHHHHHHHHhccc----eeEEEEecCceeeehHHHHHHHhhcCcCCc--ccEEEEecCCC-----eeecCCCCcccCcc
Q psy10879 95 KVLHSFKYIYEHFD----FQYLLKCDDDTFVRVPNIIHELENKFHYEK--KLYWGFFDGRA-----RVRRAGKWKEKNWF 163 (245)
Q Consensus 95 K~~~~l~w~~~~~~----~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~--~~y~G~~~~~~-----~~~r~~k~~~~~~~ 163 (245)
-+-.++..+..... .+.++.+.+|++... .+.++|+.+..... .+.++...... ....++.-.+.+..
T Consensus 85 Gta~al~~a~~~i~~~~~~~~~lv~~gD~i~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~d~~~~V~~~~ 163 (248)
T PF00483_consen 85 GTAGALLQALDFIEEEDDDEDFLVLNGDIIFDD-DLQDMLEFHRESNADGTVTLLVVPVEDPSRYGVVEVDEDGRVIRIV 163 (248)
T ss_dssp CHHHHHHHTHHHHTTSEE-SEEEEETTEEEEST-THHHHHHHHHHHSSCESEEEEEEESSGGGGSEEEEEETTSEEEEEE
T ss_pred chhHHHHHHHHHhhhccccceEEEEeccccccc-hhhhHHHhhhccccccccccccccccccccceeeeeccceeEEEEe
Confidence 44555555555552 346899999998888 77777766543212 23444332111 11111111111111
Q ss_pred cCCCC---CCccCCCeeeecHHHHHHHHH
Q psy10879 164 LCDRY---LPYALGGGYVLSHLLVKFISE 189 (245)
Q Consensus 164 ~~~~y---p~y~~G~gyvlS~~~~~~l~~ 189 (245)
..+.- ..++.++.|+|++.++..+.+
T Consensus 164 EKP~~~~~~~~~~~G~Y~~~~~~~~~~~~ 192 (248)
T PF00483_consen 164 EKPDNPNASNLINTGIYIFKPEIFDFLLE 192 (248)
T ss_dssp ESCSSHSHSSEEEEEEEEEETHHHHHHHH
T ss_pred ccCcccccceeccCceEEEcchHHHHHhh
Confidence 11111 236778889999999998866
No 77
>PF03071 GNT-I: GNT-I family; InterPro: IPR004139 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GNT-I, GLCNAC-T I) 2.4.1.101 from EC transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide. This is an essential step in the synthesis of complex or hybrid-type N-linked oligosaccharides. The enzyme is an integral membrane protein localized to the Golgi apparatus, and is probably distributed in all tissues. The catalytic domain is located at the C terminus []. These proteins are members of the glycosyl transferase family 13 (GH13 from CAZY); GO: 0003827 alpha-1,3-mannosylglycoprotein 2-beta-N-acetylglucosaminyltransferase activity, 0006487 protein N-linked glycosylation, 0000139 Golgi membrane; PDB: 2APC_A 2AM4_A 1FO9_A 2AM3_A 1FOA_A 2AM5_A 1FO8_A.
Probab=41.49 E-value=2.7e+02 Score=25.84 Aligned_cols=92 Identities=11% Similarity=0.087 Sum_probs=43.4
Q ss_pred HHHHHHHHHHHhcc----ceeEEEEecCceeeehHHHHHHHhhc---CcCCcccEEEEecC-CCeeecCCCCcccCcccC
Q psy10879 94 QKVLHSFKYIYEHF----DFQYLLKCDDDTFVRVPNIIHELENK---FHYEKKLYWGFFDG-RARVRRAGKWKEKNWFLC 165 (245)
Q Consensus 94 ~K~~~~l~w~~~~~----~~~~~~k~DDD~~v~~~~L~~~L~~~---~~~~~~~y~G~~~~-~~~~~r~~k~~~~~~~~~ 165 (245)
.|.-.-++|+.++. +++.++.++||.-+-++=+.-+-... ..+ ..+++=.-.. ++....-..-.....|..
T Consensus 173 ~~IA~HYk~aL~~vF~~~~~~~vIIlEDDL~isPDFf~Yf~~~~~ll~~D-~sl~ciSawNdnG~~~~~~~~~~~~lyRs 251 (434)
T PF03071_consen 173 YKIARHYKWALSQVFNKFKYSSVIILEDDLEISPDFFEYFSATLPLLEND-PSLWCISAWNDNGKEHFVDDSRPSLLYRS 251 (434)
T ss_dssp HHHHHHHHHHHHHHHHTS--SEEEEEETTEEE-TTHHHHHHHHHHHHHH--TTEEEEES--TT-BGGGS-TT-TT-EEEE
T ss_pred HHHHHHHHHHHHHHHHhcCCceEEEEecCcccCccHHHHHHHHHHHHhcC-CCeEEEEccccCCccccccCCCccceEec
Confidence 34556666766654 67889999999999775443332221 222 3455422111 111000000001112222
Q ss_pred CCCCCccCCCeeeecHHHHHHHHHh
Q psy10879 166 DRYLPYALGGGYVLSHLLVKFISEN 190 (245)
Q Consensus 166 ~~yp~y~~G~gyvlS~~~~~~l~~~ 190 (245)
+ |..|-|.+|++++-..|...
T Consensus 252 d----ffpglGWml~r~~w~el~~~ 272 (434)
T PF03071_consen 252 D----FFPGLGWMLTRELWDELEPK 272 (434)
T ss_dssp S----S---SSEEEEHHHHHHHGGG
T ss_pred c----cCCchHHHhhHHHHHhhccc
Confidence 2 55789999999999866544
No 78
>COG3306 Glycosyltransferase involved in LPS biosynthesis [Cell envelope biogenesis, outer membrane]
Probab=34.63 E-value=44 Score=28.49 Aligned_cols=52 Identities=12% Similarity=-0.055 Sum_probs=33.3
Q ss_pred ccCCCeeeecHHHHHHHHHhcccCCCC-CCChhHHHHHHHhCCCeE-eecCCcc
Q psy10879 171 YALGGGYVLSHLLVKFISENSHMLSSY-ISEDVSVGVWLSALDITR-YHDVRFD 222 (245)
Q Consensus 171 y~~G~gyvlS~~~~~~l~~~~~~~~~~-~~eDv~iG~~l~~~~v~~-~~~~~f~ 222 (245)
+.+-+||++|+.+++.+++.....+.. +.|+.+.-.++...|... ...|..+
T Consensus 154 ~~gt~gYiis~~aAk~fl~~~~~~~~~~pvD~~~~~~~~~~~~~~vyq~~p~~~ 207 (255)
T COG3306 154 HLGTAGYIISRKAAKKFLELTESFKVVLPVDWFMFLEFLHGVGNKVYQPFPAIC 207 (255)
T ss_pred ccCccceeecHHHHHHHHHHhhhcccccChhHHHhhhhccccCceEEEeccccc
Confidence 467899999999999999998754443 455555445553434333 2344444
No 79
>PF03452 Anp1: Anp1; InterPro: IPR005109 The members of this family (Anp1, Van1 and Mnn9) are membrane proteins required for proper Golgi function. These proteins colocalize within the cis Golgi, where they are physically associated in two distinct complexes [].
Probab=33.60 E-value=3.1e+02 Score=23.61 Aligned_cols=127 Identities=11% Similarity=0.131 Sum_probs=69.5
Q ss_pred CCCCCCCeeEEEEEeCCCCCHHHHHHHHHHHcCCcC----CCcEEEEEEeecCc--hHHHHHHHHH----------hhhC
Q psy10879 13 FKNLKHSYFIVILILTSSKNYIRRKNIRDTWIKNIE----KYQVKYLFSIGNKE--NVVDVKLKEE----------IRRY 76 (245)
Q Consensus 13 ~~~c~~~~~lli~V~S~~~~~~~R~~IR~TW~~~~~----~~~~~~~Fv~g~~~--~~~~~~l~~E----------~~~~ 76 (245)
+..-..+..|||+.+-.-.. +- +-+-|-+-.. ...+.+-|+++.+. ....+.++.+ ...|
T Consensus 18 ~~~~~~~e~VLILtplrna~--~~--l~~y~~~L~~L~YP~~lIsLgfLv~d~~e~d~t~~~l~~~~~~~q~~~~~~~~F 93 (269)
T PF03452_consen 18 SDAARNKESVLILTPLRNAA--SF--LPDYFDNLLSLTYPHELISLGFLVSDSSEFDNTLKILEAALKKLQSHGPESKRF 93 (269)
T ss_pred CcccccCCeEEEEEecCCch--HH--HHHHHHHHHhCCCCchheEEEEEcCCCchhHHHHHHHHHHHHHHhccCcccCCc
Confidence 33445555666555442211 11 3333433321 45689999999987 4444444433 3456
Q ss_pred CCEEEcC-CccccC-------C-----chhHHHHHHH-HHHHhcc---ceeEEEEecCceeeehHHHHHHHhhcCcCCcc
Q psy10879 77 DDILLLN-QVPDEY-------S-----SLSQKVLHSF-KYIYEHF---DFQYLLKCDDDTFVRVPNIIHELENKFHYEKK 139 (245)
Q Consensus 77 ~Dii~~~-d~~d~y-------~-----~l~~K~~~~l-~w~~~~~---~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~ 139 (245)
+.|..+. ||.+.- + ..-++.++-. .|+.... ..+|++-.|-|+.-.++.|++.|-..+. +
T Consensus 94 ~~itIl~~df~~~~~~~~~~RH~~~~Q~~RR~~mAraRN~LL~~aL~p~~swVlWlDaDIv~~P~~lI~dli~~~k---d 170 (269)
T PF03452_consen 94 RSITILRKDFGQQLSQDRSERHAFEVQRPRRRAMARARNFLLSSALGPWHSWVLWLDADIVETPPTLIQDLIAHDK---D 170 (269)
T ss_pred ceEEEEcCCCcccccCchhhccchhhHHHHHHHHHHHHHHHHHhhcCCcccEEEEEecCcccCChHHHHHHHhCCC---C
Confidence 6776442 443321 0 0111222111 2333332 6899999999999999999998877654 4
Q ss_pred cEEEEec
Q psy10879 140 LYWGFFD 146 (245)
Q Consensus 140 ~y~G~~~ 146 (245)
+++=+|.
T Consensus 171 IivPn~~ 177 (269)
T PF03452_consen 171 IIVPNCW 177 (269)
T ss_pred EEcccee
Confidence 5554443
No 80
>PF03142 Chitin_synth_2: Chitin synthase; InterPro: IPR004835 Chitin synthase (2.4.1.16 from EC), also known as chitin-UDP acetyl-glucosaminyl transferase, is a plasma membrane-bound protein which catalyses the conversion of UDP-N-acettyl-D-glucosamine and {(1,4)-(N-acetyl- beta-D-glucosaminyl)}(N) to UDP and {(1,4)-(N-acetyl-beta-D- glucosaminyl)}(N+1). It plays a major role in cell wall biogenesis. ; GO: 0016758 transferase activity, transferring hexosyl groups
Probab=33.36 E-value=3e+02 Score=26.26 Aligned_cols=29 Identities=17% Similarity=0.195 Sum_probs=26.3
Q ss_pred ceeEEEEecCceeeehHHHHHHHhhcCcC
Q psy10879 108 DFQYLLKCDDDTFVRVPNIIHELENKFHY 136 (245)
Q Consensus 108 ~~~~~~k~DDD~~v~~~~L~~~L~~~~~~ 136 (245)
.++|++-+|-||-+.++.+.+++.....+
T Consensus 201 ~~~~il~~DaDt~~~p~~~~~lv~~m~~d 229 (527)
T PF03142_consen 201 FYEYILMVDADTKFDPDSVNRLVDAMERD 229 (527)
T ss_pred ceEEEEEecCCceEcHHHHHHHHHHHcCC
Confidence 68999999999999999999999987653
No 81
>PF06306 CgtA: Beta-1,4-N-acetylgalactosaminyltransferase (CgtA); InterPro: IPR010446 This family consists of several beta-1,4-N-acetylgalactosaminyltransferase proteins from Campylobacter jejuni [].
Probab=33.15 E-value=64 Score=28.57 Aligned_cols=66 Identities=11% Similarity=0.237 Sum_probs=42.5
Q ss_pred HHHHHHhhhCCCEEEc--C-C-ccccCCchhHHHHHHHHHHHhcc-ceeEEEEecCceeeehHHHHHHHhh
Q psy10879 67 VKLKEEIRRYDDILLL--N-Q-VPDEYSSLSQKVLHSFKYIYEHF-DFQYLLKCDDDTFVRVPNIIHELEN 132 (245)
Q Consensus 67 ~~l~~E~~~~~Dii~~--~-d-~~d~y~~l~~K~~~~l~w~~~~~-~~~~~~k~DDD~~v~~~~L~~~L~~ 132 (245)
+.|.+=.++|.+-+-+ | . ....-.+...+...-..|+.... +.+|++|+|.|-......|...+-.
T Consensus 129 Eiil~fckkyP~fip~~Ypy~v~~~n~~~~~n~l~~YYNy~ls~ipk~~w~iKID~DhIy~~~KL~ksfY~ 199 (347)
T PF06306_consen 129 EIILEFCKKYPSFIPIKYPYEVIIKNPKSEENSLYNYYNYVLSFIPKNEWAIKIDADHIYDTKKLYKSFYI 199 (347)
T ss_pred HHHHHHHHhCcccccccCcchhhccCCchhhhhhhhhhhhhhcccccceEEEEeccceeecHHHHhhhhee
Confidence 3344446677765522 1 0 00111223345667778887777 8999999999999999999776543
No 82
>PTZ00260 dolichyl-phosphate beta-glucosyltransferase; Provisional
Probab=31.50 E-value=3.6e+02 Score=23.67 Aligned_cols=192 Identities=14% Similarity=0.074 Sum_probs=93.0
Q ss_pred CCCeeEEEEEeCCCCCHHHHHHHHHHHcCC------cCCCcEEEEEEeecCchHHHHHHHHHhhhC----CCEEEcCCcc
Q psy10879 17 KHSYFIVILILTSSKNYIRRKNIRDTWIKN------IEKYQVKYLFSIGNKENVVDVKLKEEIRRY----DDILLLNQVP 86 (245)
Q Consensus 17 ~~~~~lli~V~S~~~~~~~R~~IR~TW~~~------~~~~~~~~~Fv~g~~~~~~~~~l~~E~~~~----~Dii~~~d~~ 86 (245)
...+.+-|+|++--+...-...++++.... ......++++|-..+.+...+.+.+-.+.+ .++-.+. ..
T Consensus 67 ~~~~~isVVIP~yNe~~~i~~~L~~l~~~~~~~~~~~~~~~~EIIVVDDgStD~T~~i~~~~~~~~~~~~~~i~vi~-~~ 145 (333)
T PTZ00260 67 DSDVDLSIVIPAYNEEDRLPKMLKETIKYLESRSRKDPKFKYEIIIVNDGSKDKTLKVAKDFWRQNINPNIDIRLLS-LL 145 (333)
T ss_pred CCCeEEEEEEeeCCCHHHHHHHHHHHHHHHHhhhccCCCCCEEEEEEeCCCCCchHHHHHHHHHhcCCCCCcEEEEE-cC
Confidence 345567777766544433344666655421 112245666665444432222222223332 1233221 11
Q ss_pred ccCCchhHHHHHHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcC----cCCcccEEEEecCCCe--eecCCCCcc-
Q psy10879 87 DEYSSLSQKVLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKF----HYEKKLYWGFFDGRAR--VRRAGKWKE- 159 (245)
Q Consensus 87 d~y~~l~~K~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~----~~~~~~y~G~~~~~~~--~~r~~k~~~- 159 (245)
.|. .|- .+++...+....+|++.+|.|....++.+..++.... +. .++.+|....... ......|+.
T Consensus 146 ---~N~-G~~-~A~~~Gi~~a~gd~I~~~DaD~~~~~~~l~~l~~~l~~~~~~~-~dvV~GsR~~~~~~~~~~~~~~~r~ 219 (333)
T PTZ00260 146 ---RNK-GKG-GAVRIGMLASRGKYILMVDADGATDIDDFDKLEDIMLKIEQNG-LGIVFGSRNHLVDSDVVAKRKWYRN 219 (333)
T ss_pred ---CCC-ChH-HHHHHHHHHccCCEEEEEeCCCCCCHHHHHHHHHHHHHhhccC-CceEEeeccccccCcccccCcHHHH
Confidence 221 121 2233333445789999999999998877666665542 22 4678887542110 000111210
Q ss_pred --c-------CcccCCCCCCccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEeec
Q psy10879 160 --K-------NWFLCDRYLPYALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYHD 218 (245)
Q Consensus 160 --~-------~~~~~~~yp~y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~~ 218 (245)
. ........+ =...+.-++++++++.++.... ...+ .-|+.+-..+.+.|.+..+.
T Consensus 220 ~~~~~~~~l~~~~~~~~i~-D~~~Gfk~~~r~~~~~i~~~~~-~~~~-~fd~Ell~~a~~~g~~I~Ev 284 (333)
T PTZ00260 220 ILMYGFHFIVNTICGTNLK-DTQCGFKLFTRETARIIFPSLH-LERW-AFDIEIVMIAQKLNLPIAEV 284 (333)
T ss_pred HHHHHHHHHHHHHcCCCcc-cCCCCeEEEeHHHHHHHhhhcc-ccCc-cchHHHHHHHHHcCCCEEEE
Confidence 0 011111221 1234556999999998865432 2222 23666666666777655443
No 83
>PF13712 Glyco_tranf_2_5: Glycosyltransferase like family; PDB: 2QGI_A 2NXV_B.
Probab=31.43 E-value=59 Score=26.83 Aligned_cols=115 Identities=11% Similarity=0.039 Sum_probs=54.0
Q ss_pred HHHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcC-cCCcccEEEEecCCCeeecCCCCccc---------------
Q psy10879 97 LHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKF-HYEKKLYWGFFDGRARVRRAGKWKEK--------------- 160 (245)
Q Consensus 97 ~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~-~~~~~~y~G~~~~~~~~~r~~k~~~~--------------- 160 (245)
-.++..+.+..+++|++.++||+++.-+++...|...- ...+--++|-.-....|....-|..+
T Consensus 43 ~~~yN~a~~~a~~~ylvflHqDv~i~~~~~l~~il~~~~~~~~~G~iGvaG~~~~~~~~~~w~~~~~~g~~~~~~~~~~~ 122 (217)
T PF13712_consen 43 AAAYNEAMEKAKAKYLVFLHQDVFIINENWLEDILEIFEEDPNIGMIGVAGSKRLPPNGVWWESPNKVGKVREYGRIMHG 122 (217)
T ss_dssp TTHHHHHGGG--SSEEEEEETTEE-SSHHHHHHHHHHHHH-TTEEEEESEEEESS-S-TTS---EEEEEETTEEEE----
T ss_pred HHHHHHHHHhCCCCEEEEEeCCeEEcchhHHHHHHHHHhhCCCccEEEeecCCcCCCCCccccccccccccccccccccc
Confidence 45777777778999999999999997666544443332 22023344532211111111111110
Q ss_pred -------C-cccC---CCCCC--ccCCCeeeecHHHHHHHHHhccc-CCCCCCChhHHHHHHHhCCCeE
Q psy10879 161 -------N-WFLC---DRYLP--YALGGGYVLSHLLVKFISENSHM-LSSYISEDVSVGVWLSALDITR 215 (245)
Q Consensus 161 -------~-~~~~---~~yp~--y~~G~gyvlS~~~~~~l~~~~~~-~~~~~~eDv~iG~~l~~~~v~~ 215 (245)
+ .+.. ..+-+ -..|..+++.+++. ..... +..+..-|+-.-.-+++.|-..
T Consensus 123 ~~~~~~~~~~~~~~~~~~~~~V~avDg~ll~~~~dv~----~fde~~~~gfH~Ydvd~cl~~~~~G~~v 187 (217)
T PF13712_consen 123 HGPNSAGEVRYGGPRNDPPEEVQAVDGLLLATQKDVP----RFDEDLFTGFHFYDVDQCLEARRAGYRV 187 (217)
T ss_dssp E-------------ES-SSEEEEEE-TTEEEEETTB---------SS--SSSSHHHHHHHHHHHTT-EE
T ss_pred ccccccccccccccccCCceeEEEecceEEEEEcccC----CCCccccCCcchHHHHHHHHHHHhCCEE
Confidence 0 0000 01111 13478888888877 33322 4667778888877777777654
No 84
>PF03490 Varsurf_PPLC: Variant-surface-glycoprotein phospholipase C; InterPro: IPR003633 Variant-surface-glycoprotein phospholipase C, by hydrolysis of the attached glycolipid, releases soluble variant surface glycoprotein containing phosphoinositol from the cell wall after lysis. It catalyses the conversion of variant-surface-glycoprotein 1,2 didecanoyl-SN-phosphatidylinositol and water to 1,2-didecanoylglycerol and the soluble variant-surface-glycoprotein. It also cleaves similar membrane anchors on some mammalian proteins.; GO: 0047396 glycosylphosphatidylinositol diacylglycerol-lyase activity, 0006650 glycerophospholipid metabolic process
Probab=31.13 E-value=28 Score=21.74 Aligned_cols=22 Identities=18% Similarity=0.644 Sum_probs=17.9
Q ss_pred HHHcCCcC----CCcEEEEEEeecCc
Q psy10879 41 DTWIKNIE----KYQVKYLFSIGNKE 62 (245)
Q Consensus 41 ~TW~~~~~----~~~~~~~Fv~g~~~ 62 (245)
++|+++.+ +..+..+|++|...
T Consensus 9 qSWM~DLrS~I~~~~I~ql~ipGsHn 34 (51)
T PF03490_consen 9 QSWMSDLRSSIGEMAITQLFIPGSHN 34 (51)
T ss_pred HHHHHHHHHHHhcceeeeEEeccccc
Confidence 68888874 67789999999875
No 85
>cd06436 GlcNAc-1-P_transferase N-acetyl-glucosamine transferase is involved in the synthesis of Poly-beta-1,6-N-acetyl-D-glucosamine. N-acetyl-glucosamine transferase is responsible for the synthesis of bacteria Poly-beta-1,6-N-acetyl-D-glucosamine (PGA). Poly-beta-1,6-N-acetyl-D-glucosamine is a homopolymer that serves as an adhesion for the maintenance of biofilm structural stability in diverse eubacteria. N-acetyl-glucosamine transferase is the product of gene pgaC. Genetic analysis indicated that all four genes of the pgaABCD locus were required for the PGA production, pgaC being a glycosyltransferase.
Probab=30.88 E-value=33 Score=27.19 Aligned_cols=76 Identities=8% Similarity=-0.144 Sum_probs=42.5
Q ss_pred eeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecCCCCcc----cCccc--------CCCC-CCccCCC
Q psy10879 109 FQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRAGKWKE----KNWFL--------CDRY-LPYALGG 175 (245)
Q Consensus 109 ~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~~k~~~----~~~~~--------~~~y-p~y~~G~ 175 (245)
.+|++.+|.|+.+..+.|.........+.-....|.+... .++.+|.. -++.. .... .-+++|.
T Consensus 90 ~d~v~~~DaD~~~~~~~l~~~~~~~~~~~v~~v~~~~~~~---~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~G~ 166 (191)
T cd06436 90 RVIIAVIDADGRLDPNALEAVAPYFSDPRVAGTQSRVRMY---NRHKNLLTILQDLEFFIIIAATQSLRALTGTVGLGGN 166 (191)
T ss_pred ccEEEEECCCCCcCHhHHHHHHHhhcCCceEEEeeeEEEe---cCCCCHHHHHHHHHHHHHHHHHHHHHHhcCcEEECCe
Confidence 4799999999999999888855544322012222332111 11222311 01110 0001 1135899
Q ss_pred eeeecHHHHHHH
Q psy10879 176 GYVLSHLLVKFI 187 (245)
Q Consensus 176 gyvlS~~~~~~l 187 (245)
|.++++++++.+
T Consensus 167 ~~~~r~~~l~~v 178 (191)
T cd06436 167 GQFMRLSALDGL 178 (191)
T ss_pred eEEEeHHHHHHh
Confidence 999999999988
No 86
>cd02511 Beta4Glucosyltransferase UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide. UDP-glucose: lipooligosaccharide (LOS) beta-1-4-glucosyltransferase catalyzes the addition of the first residue, glucose, of the lacto-N-neotetrase structure to HepI of the LOS inner core. LOS is the major constituent of the outer leaflet of the outer membrane of gram-positive bacteria. It consists of a short oligosaccharide chain of variable composition (alpha chain) attached to a branched inner core which is lined in turn to lipid A. Beta 1,4 glucosyltransferase is required to attach the alpha chain to the inner core.
Probab=29.41 E-value=1.3e+02 Score=24.56 Aligned_cols=37 Identities=14% Similarity=-0.008 Sum_probs=28.5
Q ss_pred HHHHHHHhccceeEEEEecCceeeehHHHHHHHhhcC
Q psy10879 98 HSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENKF 134 (245)
Q Consensus 98 ~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~~ 134 (245)
.+..++.+.+..+|++.+|.|..+..+.+..++....
T Consensus 61 ~~~n~~~~~a~~d~vl~lDaD~~~~~~~~~~l~~~~~ 97 (229)
T cd02511 61 AQRNFALELATNDWVLSLDADERLTPELADEILALLA 97 (229)
T ss_pred HHHHHHHHhCCCCEEEEEeCCcCcCHHHHHHHHHHHh
Confidence 4556677777888999999999998877766665544
No 87
>PF09258 Glyco_transf_64: Glycosyl transferase family 64 domain; InterPro: IPR015338 Members of this entry catalyse the transfer reaction of N-acetylglucosamine and N-acetylgalactosamine from the respective UDP-sugars to the non-reducing end of [glucuronic acid]beta 1-3[galactose]beta 1-O-naphthalenemethanol, an acceptor substrate analogue of the natural common linker of various glycosylaminoglycans. They are also required for the biosynthesis of heparan-sulphate []. ; GO: 0016758 transferase activity, transferring hexosyl groups, 0031227 intrinsic to endoplasmic reticulum membrane; PDB: 1ON6_B 1OMZ_B 1OMX_B 1ON8_B.
Probab=28.72 E-value=92 Score=26.32 Aligned_cols=104 Identities=15% Similarity=0.235 Sum_probs=53.4
Q ss_pred ceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeee-cCCCCcccCcccCCCCCCc--cCCCeeeecHHHH
Q psy10879 108 DFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVR-RAGKWKEKNWFLCDRYLPY--ALGGGYVLSHLLV 184 (245)
Q Consensus 108 ~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~-r~~k~~~~~~~~~~~yp~y--~~G~gyvlS~~~~ 184 (245)
+.+-++-+|||+.+..+.|...+...... .+-++|...+..... ..++|.-. ...--.| ...++-++.+.-+
T Consensus 75 ~T~AVl~~DDDv~~~~~~l~faF~~W~~~-pdrlVGf~~R~h~~~~~~~~~~Y~----~~~~~~ySmvLt~aaf~h~~yl 149 (247)
T PF09258_consen 75 ETDAVLSLDDDVMLSCDELEFAFQVWREF-PDRLVGFPPRSHSWDPSSGRWKYT----SEWSNEYSMVLTGAAFYHRYYL 149 (247)
T ss_dssp -SSEEEEEETTEEE-HHHHHHHHHHHCCS-TTSEEES-EEEEEEE-ETTEEEEE-----SSS--BSEE-TTEEEEETHHH
T ss_pred CcceEEEecCCcccCHHHHHHHHHHHHhC-hhheeCCccceeecCCCccccccc----cCCCCcchhhhhhhHhhcchHH
Confidence 67889999999999999998877776655 567777543222111 12333211 0000113 3355666677665
Q ss_pred HHHHHhccc------CCCCCCChhHHHHHHHhC-CCeEe
Q psy10879 185 KFISENSHM------LSSYISEDVSVGVWLSAL-DITRY 216 (245)
Q Consensus 185 ~~l~~~~~~------~~~~~~eDv~iG~~l~~~-~v~~~ 216 (245)
......... -....=||+.+--++..+ |-.|+
T Consensus 150 ~~Y~~~~p~~~r~~Vd~~~NCEDI~mNflvs~~T~~pPi 188 (247)
T PF09258_consen 150 ELYTHWLPASIREYVDEHFNCEDIAMNFLVSNLTGKPPI 188 (247)
T ss_dssp HHHHT-S-HHHHHHHHHHTS-HHHHHHHHHHHHHSS-SE
T ss_pred HHHhcCcHHHHHHHHhccCCHHHHHHHHHHHHhccCCCC
Confidence 543321110 113456899886665543 44443
No 88
>KOG2336|consensus
Probab=28.63 E-value=54 Score=28.59 Aligned_cols=38 Identities=21% Similarity=0.280 Sum_probs=27.2
Q ss_pred CCCCCCCCCCCCeeEEEEEeCCCCCHHHHHHH-------HHHHcCCc
Q psy10879 8 TGLSKFKNLKHSYFIVILILTSSKNYIRRKNI-------RDTWIKNI 47 (245)
Q Consensus 8 ~~l~~~~~c~~~~~lli~V~S~~~~~~~R~~I-------R~TW~~~~ 47 (245)
.++..-..|+.++ +=+|+|+.+|++-|.+| ++||+...
T Consensus 171 ~~is~g~~~~gkp--vDLVLSCVDNfEARMavN~ACNE~~q~WmESG 215 (422)
T KOG2336|consen 171 DRISNGSLCPGKP--VDLVLSCVDNFEARMAVNQACNELNQTWMESG 215 (422)
T ss_pred HHhhcCCCCCCCc--ceEEeeehhhHHHHHHHHHHHHHhhhHHHHcc
Confidence 3455555676553 33488999999999976 56999875
No 89
>PLN03182 xyloglucan 6-xylosyltransferase; Provisional
Probab=27.61 E-value=4.9e+02 Score=24.00 Aligned_cols=87 Identities=15% Similarity=0.243 Sum_probs=56.4
Q ss_pred CCHHHHHHHHHHHcCCcC------CCcEEEEEEeecCc-h--------HHHHHHH---HHhhhCCCEEEcCCc---cccC
Q psy10879 31 KNYIRRKNIRDTWIKNIE------KYQVKYLFSIGNKE-N--------VVDVKLK---EEIRRYDDILLLNQV---PDEY 89 (245)
Q Consensus 31 ~~~~~R~~IR~TW~~~~~------~~~~~~~Fv~g~~~-~--------~~~~~l~---~E~~~~~Dii~~~d~---~d~y 89 (245)
.+.+.+ |+.|.+..+ ..+.+++.|.|..+ + -++..++ .=+++||=-+.. ++ .+..
T Consensus 103 ~~wd~~---R~~wl~~~p~~~~~~~g~prVviVT~sdp~~c~n~~gd~yLlks~kNK~dYAr~HGY~~fy-n~~~ld~~~ 178 (429)
T PLN03182 103 SDWDEQ---RRRWLRKNPGFPSFVNGKPRVLLVTGSQPKPCENPVGDHYLLKSLKNKIDYCRLHGIEIFY-NMAHLDAEM 178 (429)
T ss_pred CCHHHH---HHHHHHhCCCCCCccCCCCCEEEEeCCCCCcCCCcccHHHHHHHHHHHHHHHHHhCCEEEe-ehhhcCcCC
Confidence 444444 888988764 24568888887665 1 1122221 115677733323 22 2344
Q ss_pred CchhHHHHHHHHHHHhccceeEEEEecCceee
Q psy10879 90 SSLSQKVLHSFKYIYEHFDFQYLLKCDDDTFV 121 (245)
Q Consensus 90 ~~l~~K~~~~l~w~~~~~~~~~~~k~DDD~~v 121 (245)
...+-|...+.+.+.++-+++||.-+|.|+++
T Consensus 179 p~~WaKlpaLR~aM~~~PeaEWiWWLDsDALI 210 (429)
T PLN03182 179 AGFWAKLPLLRKLMLAHPEVEWIWWMDSDALF 210 (429)
T ss_pred CcchhHHHHHHHHHHHCCCceEEEEecCCcee
Confidence 67788988888888766699999999999987
No 90
>PRK15489 nfrB bacteriophage N4 adsorption protein B; Provisional
Probab=27.30 E-value=6.3e+02 Score=25.10 Aligned_cols=192 Identities=13% Similarity=0.071 Sum_probs=96.6
Q ss_pred CCeeEEEEEeCCCCCHHHHHHHHHHHcCCcCCCcEEEEEEeecCc-hHHHHHHHHHhhhCCCEEEcCCccccCCchhHHH
Q psy10879 18 HSYFIVILILTSSKNYIRRKNIRDTWIKNIEKYQVKYLFSIGNKE-NVVDVKLKEEIRRYDDILLLNQVPDEYSSLSQKV 96 (245)
Q Consensus 18 ~~~~lli~V~S~~~~~~~R~~IR~TW~~~~~~~~~~~~Fv~g~~~-~~~~~~l~~E~~~~~Dii~~~d~~d~y~~l~~K~ 96 (245)
+...+-|+|+---+..--.+.|+..=.+.. ..+. .+|++.... .+....+..-...|..+-.+. ..+.- -+-|-
T Consensus 69 ~~~~vsIlVPa~nE~~VI~~~v~~ll~~ld-Yp~~-~I~v~~~~nD~~T~~~~~~~~~~~p~~~~v~-~~~~g--p~gKa 143 (703)
T PRK15489 69 DEQPLAIMVPAWKEYDVIAKMIENMLATLD-YRRY-VIFVGTYPNDAETITEVERMRRRYKRLVRVE-VPHDG--PTCKA 143 (703)
T ss_pred CCCceEEEEeCCCcHHHHHHHHHHHHhcCC-CCCe-EEEEEecCCCccHHHHHHHHhccCCcEEEEE-cCCCC--CCCHH
Confidence 445666777665444333444544322211 1122 355544343 444555555555555333221 12211 11232
Q ss_pred HHHHHHHHhc----c-----ceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEEecCCCeeecC--CCCcccCccc-
Q psy10879 97 LHSFKYIYEH----F-----DFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGFFDGRARVRRA--GKWKEKNWFL- 164 (245)
Q Consensus 97 ~~~l~w~~~~----~-----~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~~~~~~~~~r~--~k~~~~~~~~- 164 (245)
.+++|+... + .++.++-.|-|-.+.++.|.. +....+. .++.-|-+.....+..+ ++.+..|+..
T Consensus 144 -~ALN~~l~~~~~~e~~~~~~fa~vvi~DAEd~~~P~~L~~-~~~~~~~-~~~iQ~pV~~~~~~~~~~l~~~~~~Efa~~ 220 (703)
T PRK15489 144 -DCLNWIIQAIFRYEAGHGIEFAGVILHDSEDVLHPLELKY-FNYLLPR-KDLVQLPVLSLERKWYEWVAGTYMDEFAEW 220 (703)
T ss_pred -HHHHHHHHHHHhhhhhccCccceEEEEcCCCCCChhHHHH-HHhhcCC-cceeeeeeccCCCccccHHHHHHHHHHHHH
Confidence 244444433 2 234489999999999999854 4544443 44433323321111000 1111111100
Q ss_pred --------CCC-CCCccCCCeeeecHHHHHHHHHhcc---cCCCCCCChhHHHHHHHhCCCeEee
Q psy10879 165 --------CDR-YLPYALGGGYVLSHLLVKFISENSH---MLSSYISEDVSVGVWLSALDITRYH 217 (245)
Q Consensus 165 --------~~~-yp~y~~G~gyvlS~~~~~~l~~~~~---~~~~~~~eDv~iG~~l~~~~v~~~~ 217 (245)
... -|--.+|-|..+++++++.+.+... .....-=||.-+|.=+...|.+..-
T Consensus 221 ~~~~l~~r~~l~~~ipl~Gv~~~frr~aL~~l~~~gg~~~~n~~sLTED~Dlg~RL~~~G~r~~f 285 (703)
T PRK15489 221 HQKDLVVRESLTGTVPSAGVGTCFSRRALLALMKERGNQPFNTSSLTEDYDFSFRLAELGMQEIF 285 (703)
T ss_pred hhhHHHHHHHcCCceeccCcceeeeHHHHHHHHHhcCCCCCCCCCchHhHHHHHHHHHCCCceEE
Confidence 011 1112678899999999999877642 2223347999999889888887554
No 91
>TIGR02148 Fibro_Slime fibro-slime domain. This model represents a conserved region of about 90 amino acids, shared in at least 4 distinct large putative proteins from the slime mold Dictyostelium discoideum and 10 proteins from the rumen bacterium Fibrobacter succinogenes, and in no other species so far. We propose here the name fibro-slime domain
Probab=27.11 E-value=38 Score=24.02 Aligned_cols=17 Identities=29% Similarity=0.483 Sum_probs=14.2
Q ss_pred eeEEEEecCceeeehHH
Q psy10879 109 FQYLLKCDDDTFVRVPN 125 (245)
Q Consensus 109 ~~~~~k~DDD~~v~~~~ 125 (245)
-.|-+.+|||++|.+++
T Consensus 20 e~F~F~GDDDvWVFIn~ 36 (90)
T TIGR02148 20 QYFEFRGDDDVWVFINN 36 (90)
T ss_pred cEEEEEcCCeEEEEECC
Confidence 45678899999999965
No 92
>PLN02458 transferase, transferring glycosyl groups
Probab=24.43 E-value=5.1e+02 Score=23.13 Aligned_cols=37 Identities=8% Similarity=-0.034 Sum_probs=26.9
Q ss_pred HHHHHHHHHhccceeEEEEecCceeeehHHHHHHHhhc
Q psy10879 96 VLHSFKYIYEHFDFQYLLKCDDDTFVRVPNIIHELENK 133 (245)
Q Consensus 96 ~~~~l~w~~~~~~~~~~~k~DDD~~v~~~~L~~~L~~~ 133 (245)
--.+|+|+.++-..--+.++|||....++ |.+.++..
T Consensus 190 RN~AL~~IR~h~l~GVVyFADDdNtYsl~-LFeEmR~i 226 (346)
T PLN02458 190 RNLALRHIEHHKLSGIVHFAGLSNVYDLD-FFDEIRDI 226 (346)
T ss_pred HHHHHHHHHhcCcCceEEEccCCCcccHH-HHHHHhcC
Confidence 46899999887755668899999888874 44555443
No 93
>PF13075 DUF3939: Protein of unknown function (DUF3939)
Probab=20.51 E-value=2.1e+02 Score=22.05 Aligned_cols=45 Identities=9% Similarity=0.103 Sum_probs=34.1
Q ss_pred HHHHHHHhcc-ceeEEEEecCceeeehHHHHHHHhhcCcCCcccEEEE
Q psy10879 98 HSFKYIYEHF-DFQYLLKCDDDTFVRVPNIIHELENKFHYEKKLYWGF 144 (245)
Q Consensus 98 ~~l~w~~~~~-~~~~~~k~DDD~~v~~~~L~~~L~~~~~~~~~~y~G~ 144 (245)
++.++..+.- ....-+.++||.-++.++|..+|..... ++.|+.+
T Consensus 18 Av~~f~~~lp~gi~rt~lv~~d~~iD~~~L~~yL~g~p~--q~FymSk 63 (140)
T PF13075_consen 18 AVHQFEEDLPKGINRTILVNDDQSIDFERLAPYLGGIPD--QRFYMSK 63 (140)
T ss_pred HHHHHHHhCccCCceEEEEcCCceecHHHHhhhcCCCCC--cceeeeH
Confidence 3444443323 6778899999999999999999988766 5777765
No 94
>PF02709 Glyco_transf_7C: N-terminal domain of galactosyltransferase; InterPro: IPR003859 This is a family of galactosyltransferases from a wide range of metazoa with three related galactosyltransferase activities; all three of which are possessed by one sequence in some cases. The three functions are N-acetyllactosamine synthase (2.4.1.90 from EC); beta-N-acetylglucosaminyl-glycopeptide beta-1,4-galactosyltransferase (2.4.1.38 from EC); and lactose synthase (2.4.1.22 from EC). Note that N-acetyllactosamine synthase is a component of lactose synthase along with alpha-lactalbumin, in the absence of alpha-lactalbumin N-acetyllactosamine synthase is used.; GO: 0016757 transferase activity, transferring glycosyl groups, 0005975 carbohydrate metabolic process; PDB: 2AGD_B 3EE5_A 2AE7_B 2AEC_A 2FYA_A 2AES_B 2AH9_A 2FYB_A 2FY7_A 3LW6_A ....
Probab=20.30 E-value=1.4e+02 Score=20.26 Aligned_cols=47 Identities=13% Similarity=0.057 Sum_probs=30.4
Q ss_pred ccCCCeeeecHHHHHHHHHhcccCCCCCCChhHHHHHHHhCCCeEee
Q psy10879 171 YALGGGYVLSHLLVKFISENSHMLSSYISEDVSVGVWLSALDITRYH 217 (245)
Q Consensus 171 y~~G~gyvlS~~~~~~l~~~~~~~~~~~~eDv~iG~~l~~~~v~~~~ 217 (245)
.+.|+.+.+++....++=--.........||.-+..=+..+|++...
T Consensus 18 ~~~Gg~~~~~~~~f~~vnGfde~f~gWG~ED~Dl~~Rl~~~g~~~~~ 64 (78)
T PF02709_consen 18 NFFGGVFAISREDFEKVNGFDERFWGWGGEDDDLYNRLWKAGLKIVR 64 (78)
T ss_dssp T---SEEEEEHHHHHHTTSS-SS-TSCSSHHHHHHHHHHHTT---B-
T ss_pred CeeEEEEEEeHHHHHHcCCCCccccccCccHHHHHHHHHHcCCeEEe
Confidence 35688899999988887655555667789999998777878877554
Done!