Query gi|254780365|ref|YP_003064778.1| UTP-glucose-1-phosphate uridylyltransferase protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 299 No_of_seqs 134 out of 5514 Neff 7.5 Searched_HMMs 39220 Date Sun May 29 17:40:13 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780365.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01099 galU UTP-glucose-1-p 100.0 0 0 611.1 11.2 265 7-272 1-270 (270) 2 PRK13389 UTP--glucose-1-phosph 100.0 0 0 573.9 18.6 293 1-296 1-299 (302) 3 COG1210 GalU UDP-glucose pyrop 100.0 0 0 574.1 17.0 290 4-297 1-290 (291) 4 PRK10122 UTP--glucose-1-phosph 100.0 0 0 566.8 18.5 291 1-297 1-297 (297) 5 cd02541 UGPase_prokaryotic Pro 100.0 0 0 488.1 18.3 267 8-279 1-267 (267) 6 TIGR01208 rmlA_long glucose-1- 100.0 0 0 467.0 13.3 232 9-276 1-238 (361) 7 cd02538 G1P_TT_short G1P_TT_sh 100.0 0 0 399.1 14.4 232 9-276 2-237 (240) 8 COG1209 RfbA dTDP-glucose pyro 100.0 0 0 391.1 13.2 231 9-276 2-235 (286) 9 cd06428 M1P_guanylylT_A_like_N 100.0 0 0 390.1 13.0 232 10-274 1-255 (257) 10 cd06425 M1P_guanylylT_B_like_N 100.0 0 0 389.8 12.9 228 9-276 2-232 (233) 11 cd06915 NTP_transferase_WcbM_l 100.0 0 0 376.4 13.3 222 10-273 1-223 (223) 12 cd04189 G1P_TT_long G1P_TT_lon 100.0 0 0 370.6 16.2 230 9-276 2-233 (236) 13 pfam00483 NTP_transferase Nucl 100.0 0 0 375.0 12.2 235 9-276 1-244 (247) 14 cd06422 NTP_transferase_like_1 100.0 0 0 367.4 13.9 219 9-272 1-221 (221) 15 PRK05293 glgC glucose-1-phosph 100.0 0 0 366.0 14.4 245 2-277 2-258 (381) 16 cd02524 G1P_cytidylyltransfera 100.0 0 0 363.1 13.9 244 10-276 1-245 (253) 17 COG1208 GCD1 Nucleoside-diphos 100.0 0 0 351.8 14.7 230 9-276 3-234 (358) 18 TIGR02091 glgC glucose-1-phosp 100.0 0 0 356.0 9.7 252 9-283 2-302 (421) 19 TIGR01105 galF regulatory prot 100.0 0 0 354.7 9.6 290 4-296 1-296 (297) 20 cd04181 NTP_transferase NTP_tr 100.0 0 0 344.7 14.2 216 10-264 1-217 (217) 21 PRK03282 consensus 100.0 0 0 344.0 14.6 248 1-277 1-278 (406) 22 cd06426 NTP_transferase_like_2 100.0 0 0 342.1 13.6 218 10-273 1-220 (220) 23 cd02540 GT2_GlmU_N_bac N-termi 100.0 0 0 345.3 10.6 219 10-268 1-228 (229) 24 PRK00844 glgC glucose-1-phosph 100.0 0 0 335.8 14.9 245 3-277 2-276 (409) 25 PRK02862 glgC glucose-1-phosph 100.0 0 0 331.3 13.4 243 4-277 1-273 (429) 26 PRK04928 consensus 100.0 0 0 330.8 13.7 252 4-280 1-274 (405) 27 KOG1322 consensus 100.0 0 0 333.7 9.9 226 9-273 11-240 (371) 28 PRK00725 glgC glucose-1-phosph 100.0 0 0 320.8 12.7 253 3-276 15-290 (431) 29 PRK01884 consensus 100.0 0 0 318.0 12.4 250 7-277 17-290 (435) 30 PRK03701 consensus 100.0 0 0 313.9 13.9 251 6-277 18-291 (431) 31 PRK09451 glmU bifunctional N-a 100.0 0 0 307.1 9.8 217 9-268 7-229 (456) 32 cd02523 PC_cytidylyltransferas 100.0 1.4E-45 0 301.9 10.9 221 10-273 1-229 (229) 33 cd04183 GT2_BcE_like GT2_BcbE_ 100.0 5.6E-44 0 290.6 11.0 221 11-269 2-230 (231) 34 COG0448 GlgC ADP-glucose pyrop 100.0 1.7E-44 0 293.8 7.7 253 8-282 6-266 (393) 35 TIGR01173 glmU UDP-N-acetylglu 100.0 1.8E-42 0 281.3 10.6 235 9-280 2-247 (461) 36 cd02508 ADP_Glucose_PP ADP-glu 100.0 1.8E-40 4.2E-45 269.0 8.5 195 10-263 1-200 (200) 37 TIGR02623 G1P_cyt_trans glucos 100.0 4.8E-40 1.3E-44 266.4 10.2 232 9-266 2-236 (256) 38 COG1207 GlmU N-acetylglucosami 100.0 7.4E-37 1.9E-41 246.8 10.5 235 8-281 3-245 (460) 39 TIGR01207 rmlA glucose-1-phosp 100.0 8E-36 2E-40 240.4 8.8 232 9-274 1-234 (286) 40 KOG1460 consensus 100.0 7.9E-35 2E-39 234.3 4.9 237 7-275 2-262 (407) 41 cd04198 eIF-2B_gamma_N The N-t 100.0 9.5E-34 2.4E-38 227.6 8.9 136 9-169 2-139 (214) 42 cd04197 eIF-2B_epsilon_N The N 100.0 3.7E-33 9.4E-38 224.0 8.6 190 9-221 2-217 (217) 43 cd02507 eIF-2B_gamma_N_like Th 100.0 9.5E-33 2.4E-37 221.5 7.8 191 9-221 2-216 (216) 44 cd02509 GDP-M1P_Guanylyltransf 100.0 7.6E-31 1.9E-35 209.8 1.8 234 9-268 2-273 (274) 45 COG0836 {ManC} Mannose-1-phosp 99.9 2.6E-25 6.6E-30 175.7 4.3 241 9-273 3-282 (333) 46 COG1213 Predicted sugar nucleo 99.9 5.8E-24 1.5E-28 167.4 10.0 224 1-277 1-229 (239) 47 KOG1462 consensus 99.7 5.3E-18 1.4E-22 130.7 0.5 241 9-274 11-307 (433) 48 COG4750 LicC CTP:phosphocholin 99.7 2.5E-17 6.3E-22 126.6 3.0 214 9-272 2-221 (231) 49 TIGR03310 matur_ygfJ molybdenu 99.6 1.1E-15 2.9E-20 116.3 6.9 179 10-270 2-186 (188) 50 cd04182 GT_2_like_f GT_2_like_ 99.6 1.4E-14 3.5E-19 109.7 8.5 178 9-269 2-185 (186) 51 TIGR02092 glgD glucose-1-phosp 99.6 1E-14 2.6E-19 110.5 7.8 247 9-287 4-274 (383) 52 KOG1461 consensus 99.5 2.6E-13 6.7E-18 101.8 10.1 234 6-275 23-288 (673) 53 TIGR03202 pucB xanthine dehydr 99.5 1.1E-13 2.9E-18 104.0 7.8 182 9-269 2-190 (190) 54 PRK00317 mobA molybdopterin-gu 99.5 4.3E-13 1.1E-17 100.5 10.5 187 4-273 1-190 (193) 55 COG2068 Uncharacterized MobA-r 99.4 3.3E-12 8.4E-17 95.0 11.0 188 5-275 4-196 (199) 56 TIGR00454 TIGR00454 conserved 99.4 2.4E-13 6.2E-18 102.0 3.9 126 9-160 2-128 (204) 57 cd02503 MobA MobA catalyzes th 99.3 5.9E-12 1.5E-16 93.5 8.1 176 9-269 2-181 (181) 58 PRK00155 ispD 2-C-methyl-D-ery 99.3 5.6E-11 1.4E-15 87.5 11.3 209 4-273 2-220 (228) 59 PRK13368 3-deoxy-manno-octulos 99.3 4.2E-11 1.1E-15 88.2 10.3 224 4-273 1-235 (238) 60 pfam01128 IspD Uncharacterized 99.3 3E-11 7.7E-16 89.1 8.2 208 9-273 2-216 (221) 61 cd02517 CMP-KDO-Synthetase CMP 99.2 1.2E-10 3.1E-15 85.4 10.4 221 9-273 3-237 (239) 62 PRK02726 molybdopterin-guanine 99.2 6.9E-11 1.8E-15 86.9 9.1 189 4-273 4-195 (200) 63 cd02516 CDP-ME_synthetase CDP- 99.2 9.1E-11 2.3E-15 86.2 8.0 203 9-270 2-216 (218) 64 PRK05450 3-deoxy-manno-octulos 99.2 8E-10 2.1E-14 80.3 12.4 225 4-274 1-244 (248) 65 COG2266 GTP:adenosylcobinamide 99.1 7.8E-11 2E-15 86.6 3.0 98 9-141 2-100 (177) 66 TIGR01479 GMP_PMI mannose-1-ph 99.0 1.1E-10 2.8E-15 85.7 2.7 239 10-273 3-290 (478) 67 TIGR00453 ispD 2-C-methyl-D-er 99.0 4.3E-10 1.1E-14 82.0 5.1 209 10-273 2-223 (226) 68 PRK00560 molybdopterin-guanine 99.0 6.8E-09 1.7E-13 74.6 9.9 178 8-274 9-189 (196) 69 COG0746 MobA Molybdopterin-gua 98.9 3.6E-09 9.1E-14 76.4 5.6 180 8-273 5-189 (192) 70 PRK09382 ispDF bifunctional 2- 98.8 9.2E-09 2.3E-13 73.8 6.9 61 1-63 1-62 (382) 71 PRK13385 2-C-methyl-D-erythrit 98.8 1.4E-08 3.6E-13 72.7 6.3 53 8-63 3-56 (238) 72 pfam02348 CTP_transf_3 Cytidyl 98.7 3.1E-08 7.8E-13 70.6 6.4 182 10-230 2-194 (197) 73 COG1211 IspD 4-diphosphocytidy 98.7 4.5E-08 1.2E-12 69.6 6.5 216 6-276 3-226 (230) 74 TIGR02665 molyb_mobA molybdopt 98.5 2.9E-08 7.3E-13 70.8 2.1 122 9-162 2-127 (202) 75 cd02513 CMP-NeuAc_Synthase CMP 98.4 3.4E-06 8.8E-11 58.0 9.1 204 9-273 3-220 (223) 76 COG1212 KdsB CMP-2-keto-3-deox 98.2 1.6E-05 4.1E-10 53.9 9.9 224 8-274 4-240 (247) 77 cd02518 GT2_SpsF SpsF is a gly 98.1 3.4E-06 8.7E-11 58.0 3.9 117 10-163 2-121 (233) 78 PTZ00339 UDP-N-acetylglucosami 97.9 6.3E-05 1.6E-09 50.2 7.8 234 9-263 111-396 (499) 79 TIGR03584 PseF pseudaminic aci 97.8 6.3E-05 1.6E-09 50.2 6.1 202 10-274 2-218 (222) 80 cd04193 UDPGlcNAc_PPase UDPGlc 97.8 4E-05 1E-09 51.4 4.7 192 9-221 17-251 (323) 81 COG1083 NeuA CMP-N-acetylneura 97.7 0.0003 7.6E-09 46.1 8.6 205 8-276 4-222 (228) 82 COG4284 UDP-glucose pyrophosph 97.6 0.00032 8.1E-09 45.9 7.4 192 9-220 107-333 (472) 83 cd00761 Glyco_tranf_GTA_type G 97.5 0.00041 1E-08 45.2 7.0 83 38-142 9-93 (156) 84 cd04180 UGPase_euk_like Eukary 97.4 0.00018 4.6E-09 47.4 4.3 60 10-72 3-73 (266) 85 cd06424 UGGPase UGGPase cataly 97.4 0.00027 7E-09 46.3 4.5 61 10-73 3-76 (315) 86 cd00897 UGPase_euk Eukaryotic 96.6 0.003 7.5E-08 39.9 4.3 60 9-71 5-71 (300) 87 cd02526 GT2_RfbF_like RfbF is 96.1 0.063 1.6E-06 31.8 8.7 190 39-267 11-209 (237) 88 TIGR00466 kdsB 3-deoxy-D-manno 95.6 0.02 5E-07 34.9 4.3 48 9-62 4-51 (246) 89 COG1861 SpsF Spore coat polysa 95.0 0.041 1E-06 32.9 4.4 49 9-63 4-54 (241) 90 pfam01704 UDPGP UTP--glucose-1 93.6 0.12 3.2E-06 29.9 4.5 51 9-62 55-111 (416) 91 pfam00535 Glycos_transf_2 Glyc 91.9 0.26 6.5E-06 28.0 4.2 105 32-162 3-110 (168) 92 KOG2388 consensus 91.0 0.29 7.3E-06 27.7 3.6 36 9-47 99-137 (477) 93 cd04188 DPG_synthase DPG_synth 87.9 2.1 5.4E-05 22.3 6.8 145 102-263 58-207 (211) 94 PRK00576 molybdopterin-guanine 87.7 1 2.7E-05 24.2 4.5 35 28-63 3-39 (178) 95 pfam07959 Fucokinase L-fucokin 86.5 2.4 6.2E-05 22.0 5.8 137 131-274 57-253 (414) 96 cd04184 GT2_RfbC_Mx_like Myxoc 85.7 2.3 5.8E-05 22.2 5.4 101 32-158 6-111 (202) 97 cd04186 GT_2_like_c Subfamily 82.4 1.7 4.2E-05 23.0 3.5 96 32-156 2-100 (166) 98 PRK10018 predicted glycosyl tr 81.7 2.2 5.6E-05 22.3 3.9 104 29-158 3-113 (279) 99 cd06433 GT_2_WfgS_like WfgS an 79.9 4.8 0.00012 20.2 7.0 98 32-158 3-103 (202) 100 TIGR01584 citF citrate lyase, 79.7 0.38 9.8E-06 26.9 -0.5 16 49-64 236-251 (496) 101 cd04179 DPM_DPG-synthase_like 79.6 1.8 4.6E-05 22.8 2.9 61 102-166 55-116 (185) 102 cd04195 GT2_AmsE_like GT2_AmsE 78.3 1.1 2.8E-05 24.1 1.5 100 32-158 3-108 (201) 103 cd06434 GT2_HAS Hyaluronan syn 77.1 3.6 9.2E-05 20.9 3.8 100 32-159 5-106 (235) 104 cd06442 DPM1_like DPM1_like re 76.9 3.1 7.8E-05 21.4 3.4 146 102-262 54-201 (224) 105 COG1216 Predicted glycosyltran 75.8 6.3 0.00016 19.5 7.5 204 32-267 9-225 (305) 106 cd06913 beta3GnTL1_like Beta 1 75.2 4.4 0.00011 20.4 3.8 112 32-165 2-119 (219) 107 cd04185 GT_2_like_b Subfamily 75.0 5.6 0.00014 19.8 4.3 168 36-268 7-179 (202) 108 TIGR01320 mal_quin_oxido malat 75.0 2.4 6.1E-05 22.1 2.4 110 120-268 154-274 (487) 109 cd04187 DPM1_like_bac Bacteria 73.1 6.2 0.00016 19.5 4.1 110 102-229 56-167 (181) 110 TIGR03552 F420_cofC 2-phospho- 64.3 2.6 6.7E-05 21.8 0.7 112 9-157 1-118 (195) 111 COG1788 AtoD Acyl CoA:acetate/ 62.1 12 0.0003 17.8 3.8 34 11-62 20-53 (220) 112 TIGR01096 3A0103s03R lysine-ar 62.1 6.2 0.00016 19.5 2.3 43 257-299 255-320 (333) 113 cd06421 CESA_CelA_like CESA_Ce 61.1 5.9 0.00015 19.6 2.1 99 32-157 6-111 (234) 114 PRK13412 fkp bifunctional fuco 59.8 13 0.00034 17.4 4.9 136 130-276 155-311 (974) 115 cd06439 CESA_like_1 CESA_like_ 59.7 10 0.00026 18.2 3.1 100 32-159 34-138 (251) 116 cd06427 CESA_like_2 CESA_like_ 59.6 12 0.0003 17.7 3.4 103 32-159 6-113 (241) 117 cd03409 Chelatase_Class_II Cla 58.0 8.4 0.00021 18.7 2.4 49 12-60 2-66 (101) 118 cd04196 GT_2_like_d Subfamily 58.0 7.2 0.00018 19.1 2.1 102 32-158 3-107 (214) 119 PRK10073 predicted glycosyl tr 57.4 6.3 0.00016 19.4 1.7 105 33-164 12-119 (329) 120 cd06437 CESA_CaSu_A2 Cellulose 56.7 8.9 0.00023 18.5 2.4 105 32-159 6-116 (232) 121 PRK00923 sirohydrochlorin coba 51.8 18 0.00046 16.6 5.7 51 8-60 3-69 (130) 122 pfam02633 Creatininase Creatin 51.6 18 0.00047 16.6 3.5 26 40-65 87-112 (235) 123 TIGR03111 glyc2_xrt_Gpos1 puta 48.9 17 0.00043 16.8 2.7 100 32-157 54-158 (439) 124 cd04192 GT_2_like_e Subfamily 48.6 20 0.0005 16.4 3.1 210 32-277 2-227 (229) 125 TIGR00027 mthyl_TIGR00027 meth 47.8 7.6 0.00019 19.0 0.8 18 7-24 91-108 (281) 126 cd02525 Succinoglycan_BP_ExoA 45.5 23 0.00058 16.0 7.9 211 32-277 5-224 (249) 127 PRK11204 N-glycosyltransferase 44.3 7.9 0.0002 18.8 0.5 95 36-156 64-160 (421) 128 PRK09485 mmuM homocysteine met 37.3 15 0.00039 17.1 1.1 60 1-60 1-67 (308) 129 COG3315 O-Methyltransferase in 36.9 21 0.00053 16.3 1.7 43 7-52 93-138 (297) 130 pfam04858 TH1 TH1 protein. TH1 33.2 5.1 0.00013 20.0 -1.9 30 45-74 92-124 (582) 131 pfam04555 XhoI Restriction end 33.1 9.4 0.00024 18.4 -0.6 35 40-74 26-60 (196) 132 TIGR02017 hutG_amidohyd N-form 31.8 38 0.00096 14.7 5.1 64 235-298 175-263 (269) 133 cd03413 CbiK_C Anaerobic cobal 30.9 39 0.00099 14.6 3.5 13 129-141 63-75 (103) 134 PRK06263 sdhA succinate dehydr 30.5 32 0.00082 15.1 1.8 20 8-27 186-205 (539) 135 pfam05060 MGAT2 N-acetylglucos 29.4 41 0.0011 14.4 3.7 41 23-63 27-69 (357) 136 PRK09822 lipopolysaccharide co 29.2 42 0.0011 14.4 2.8 49 8-62 22-71 (269) 137 KOG4180 consensus 27.5 42 0.0011 14.4 2.0 31 9-39 107-139 (395) 138 cd03411 Ferrochelatase_N Ferro 27.2 45 0.0011 14.2 2.9 19 238-256 101-121 (159) 139 TIGR02351 thiH thiazole biosyn 26.8 24 0.00062 15.8 0.7 32 41-72 110-143 (378) 140 TIGR01815 TrpE-clade3 anthrani 26.6 6.1 0.00015 19.5 -2.5 156 102-259 135-321 (726) 141 COG1581 Ssh10b Archaeal DNA-bi 26.0 48 0.0012 14.0 4.2 31 29-59 3-35 (91) 142 TIGR00763 lon ATP-dependent pr 24.8 34 0.00086 15.0 1.1 17 260-276 236-252 (941) 143 COG1924 Activator of 2-hydroxy 24.8 50 0.0013 13.9 3.8 61 6-73 145-209 (396) 144 KOG3349 consensus 24.5 51 0.0013 13.9 2.7 23 42-64 25-47 (170) 145 TIGR00461 gcvP glycine dehydro 24.1 48 0.0012 14.0 1.8 14 236-249 362-375 (965) 146 KOG2638 consensus 23.6 53 0.0013 13.8 7.3 47 10-59 106-157 (498) 147 TIGR02468 sucrsPsyn_pln sucros 23.6 43 0.0011 14.3 1.4 81 214-297 272-355 (1072) 148 PRK09231 fumarate reductase fl 23.5 48 0.0012 14.0 1.7 20 8-27 185-204 (582) 149 COG0381 WecB UDP-N-acetylgluco 22.9 53 0.0013 13.8 1.8 21 44-64 22-43 (383) 150 TIGR02852 spore_dpaB dipicolin 22.8 55 0.0014 13.7 3.4 59 42-114 18-76 (188) 151 TIGR02429 pcaI_scoA_fam 3-oxoa 22.7 55 0.0014 13.7 3.2 35 10-62 20-54 (222) 152 cd06423 CESA_like CESA_like is 22.6 55 0.0014 13.6 5.8 101 32-157 2-105 (180) 153 TIGR02640 gas_vesic_GvpN gas v 22.2 34 0.00087 14.9 0.7 41 95-140 16-59 (265) 154 TIGR00423 TIGR00423 conserved 22.1 57 0.0014 13.6 3.3 28 41-68 41-68 (331) 155 COG0567 SucA 2-oxoglutarate de 22.0 57 0.0014 13.6 2.5 27 38-64 213-239 (906) 156 TIGR01514 NAPRTase nicotinate 21.3 20 0.00051 16.4 -0.7 40 236-275 274-316 (430) 157 TIGR01854 lipid_A_lpxH UDP-2,3 20.8 36 0.00091 14.8 0.6 18 127-144 106-123 (241) 158 COG3051 CitF Citrate lyase, al 20.4 7 0.00018 19.2 -3.1 17 252-268 268-284 (513) 159 KOG1447 consensus 20.4 51 0.0013 13.9 1.3 46 234-279 320-368 (412) 160 cd00419 Ferrochelatase_C Ferro 20.2 62 0.0016 13.3 3.6 24 239-263 80-105 (135) No 1 >TIGR01099 galU UTP-glucose-1-phosphate uridylyltransferase; InterPro: IPR005771 Uridine 5'-triphosphate:glucose-1-phosphate uridylyltransferase (uridine diphosphoglucose pyrophosphorylase, 2.7.7.9 from EC) is responsible for the synthesis of UDP-glucose, a key compound in the biosynthesis of polysaccharides. Glucose is fermented through the glycolysis step to pyruvate, which in turn is converted to lactate. The glycolytic intermediate glucose 6-phosphate is converted to glucose 1-phosphate by phosphoglucomutase activity, and this metabolite is subsequently converted to UDP-glucose by UDP-glucose pyrophosphorylase (GalU) activity. ; GO: 0003983 UTP:glucose-1-phosphate uridylyltransferase activity, 0006011 UDP-glucose metabolic process. Probab=100.00 E-value=0 Score=611.10 Aligned_cols=265 Identities=49% Similarity=0.831 Sum_probs=255.9 Q ss_pred CCEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCC Q ss_conf 45899956888833432440888333568836279999999997899889998135544410100117454445654187 Q gi|254780365|r 7 VRKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNK 86 (299) Q Consensus 7 i~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~ 86 (299) ||||||||||+||||+|+|+.+||.||||.|||+|+|+|+++++|||++|+||||++|++|+||||.+++||..|++++| T Consensus 1 irKAViPaAGlGTRfLPATKA~PKEMLPiVdKP~IQY~VEEav~aGie~i~~VTGr~K~aIEDHFD~s~ELE~~L~~~~K 80 (270) T TIGR01099 1 IRKAVIPAAGLGTRFLPATKAIPKEMLPIVDKPLIQYIVEEAVEAGIEDILIVTGRGKRAIEDHFDTSYELEHQLEKKNK 80 (270) T ss_pred CCEEEECCCCCCCCCCHHHHCCCCCCCCCCCCCEEHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 97788527675310023451688656764587421045798986489347999658850101425878889999875104 Q ss_pred CEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCEEE Q ss_conf 0000113441124857998433342334665421101122333311127864445443--32211222122343220011 Q gi|254780365|r 87 KAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEG--ENCMANMIKLYEKEGANILA 164 (299) Q Consensus 87 ~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~--~~~l~~li~~~~~~~~~vi~ 164 (299) ...|.+++++.+...+|.||||++++|+||||+||+++||||||.|++||+|+..... .++|+||++.|++.++++|| T Consensus 81 ~~lL~~v~~I~~~~~~i~yvRQke~kGLGhAvL~a~~~vGdEpFaV~LgDdi~~~~~~D~~~~Lkqm~~~Y~~~g~s~ia 160 (270) T TIGR01099 81 EELLKEVRKISNLNANIFYVRQKEQKGLGHAVLCARPLVGDEPFAVLLGDDIVVSEEPDLQEALKQMIDLYEKYGCSIIA 160 (270) T ss_pred HHHHHHHHHHCCCCCCEEEEECCCCCCCCHHHHHCCCCCCCCCEEEEECCEEEECCCCCHHHHHHHHHHHHHHHCCEEEE T ss_conf 88999999862699728998138888551275435212588860676352656358875468999999999863881788 Q ss_pred EEECCHHHCCCCCCEECCC-CCCCCCCE--EEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHH Q ss_conf 2432854433133100133-33335300--00002577888673423302020027233110001226667675003789 Q gi|254780365|r 165 VSECDPQLSCKYGMVQVGK-AIDHQVFH--ISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDS 241 (299) Q Consensus 165 ~~~v~~~~~~~yGvi~~~~-~~~~~~~~--I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~ 241 (299) |++|+++++++|||+.... -.+.+.++ |.+|||||++++||||||++|+|||+|+||+.|+++++|.| |||||||| T Consensus 161 V~~Vp~e~V~~YGii~~~~dy~~~~~y~s~i~~mVEKP~~e~APSNLAi~GRYvL~P~IF~~L~~t~~G~G-gEIQLTDA 239 (270) T TIGR01099 161 VEEVPKEEVSKYGIIDGEGDYLEEGLYESVIKDMVEKPKPEEAPSNLAIVGRYVLTPDIFDLLEETPPGKG-GEIQLTDA 239 (270) T ss_pred EEECCHHHCCCCEEECCCCCHHHCCCEEEEEECCEECCCCCCCCCCEEEECHHCCCHHHHHHHHHCCCCCC-CCCHHHHH T ss_conf 86344551775515727420001232234221414076678887421022001168568888761799999-73118889 Q ss_pred HHHHHHCCCEEEEEEEEEEEECCCHHHHHHH Q ss_conf 9999831984899981378643998899999 Q gi|254780365|r 242 MRKLSERHDFLAYHFKGHTYDCGSKKGFVLA 272 (299) Q Consensus 242 i~~l~~~~~v~a~~~~g~w~DiG~~~~yl~A 272 (299) |+.|+++..|+||.++|++||||+..+|++| T Consensus 240 l~~l~~~e~v~A~~f~G~ryD~G~~~Gy~~A 270 (270) T TIGR01099 240 LRKLLKKERVLAYKFEGKRYDCGSKLGYLKA 270 (270) T ss_pred HHHHHHCCCEEEEEEECCEEEECCCCCCEEC T ss_conf 9998623977888750316654483331009 No 2 >PRK13389 UTP--glucose-1-phosphate uridylyltransferase subunit GalU; Provisional Probab=100.00 E-value=0 Score=573.93 Aligned_cols=293 Identities=37% Similarity=0.628 Sum_probs=271.7 Q ss_pred CCCC-CCCCEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHH Q ss_conf 9788-854589995688883343244088833356883627999999999789988999813554441010011745444 Q gi|254780365|r 1 MGSL-KKVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQ 79 (299) Q Consensus 1 ~~~m-~~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~ 79 (299) |--| ||||||||||||+||||+|+|+.+|||||||.|||+|+|+++++.++||++|+||||+.|++|+|||+.+++++. T Consensus 1 ~~~m~kki~KaVI~AAG~GTRl~P~Tk~~PKemLPi~dkPii~~~vee~~~aGI~ei~iV~~~~K~~I~d~F~~~~ele~ 80 (302) T PRK13389 1 MAAINTKVKKAVIPVAGLGTRMLPATKAIPKEMLPLVDKPLIQYVVNECIAAGITEIVLVTHSSKNSIENHFDTSFELEA 80 (302) T ss_pred CCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHCCCCHHHHH T ss_conf 96435775679998986753668650469812154898875999999999869978999927984135563477088888 Q ss_pred HHHHCCCCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCC Q ss_conf 565418700001134411248579984333423346654211011223333111278644454---43322112221223 Q gi|254780365|r 80 SLRKRNKKAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPL---EGENCMANMIKLYE 156 (299) Q Consensus 80 ~l~~~~~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~---~~~~~l~~li~~~~ 156 (299) .+..+++...+..+....+.+.+++|++|++|+||||||+||+++++||||+|++||+++... ...+++++|++.|+ T Consensus 81 ~L~~~~k~~~l~~~~~i~~~~~~i~~v~Q~~~~GlGhAV~~A~~~v~depF~VllgD~li~~~~~~l~~~~l~~li~~~~ 160 (302) T PRK13389 81 MLEKRVKRQLLDEVQSICPPHVTIMQVRQGLAKGLGHAVLCAHPVVGDEPVAVILPDVILDEYESDLSQDNLAEMIRRFD 160 (302) T ss_pred HHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCCEEEEECCHHHCCCCCCCCHHHHHHHHHHHH T ss_conf 88763257777776530358835999867898768999998898629965799936334135566520568999999986 Q ss_pred CCCCCEEEEEECCHHHCCCCCCEECCCC--CCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCC Q ss_conf 4322001124328544331331001333--33353000000257788867342330202002723311000122666767 Q gi|254780365|r 157 KEGANILAVSECDPQLSCKYGMVQVGKA--IDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKG 234 (299) Q Consensus 157 ~~~~~vi~~~~v~~~~~~~yGvi~~~~~--~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~g 234 (299) +++++++++.+| +++++|||+.+... ..++.++|+++||||+++.+|||++++|+|+|+|+||++|++++++. +| T Consensus 161 ~~g~~~I~v~~V--ed~s~YGVv~~~g~~~~~~~~~~I~~iVEKP~~e~aPSnlAi~GrYif~p~IF~~L~~~~p~~-~G 237 (302) T PRK13389 161 ETGHSQIMVEPV--ADVTAYGVVDCKGVELAPGESVPMVGVVEKPKADVAPSNLAIVGRYVLSADIWPLLAKTPPGA-GD 237 (302) T ss_pred HCCCCEEEEEEC--CCCCCEEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCEEEEEEEEECHHHHHHHHCCCCCC-CC T ss_conf 149868999986--776324479635664456763322456743788789877578767887889999996079999-98 Q ss_pred CCCHHHHHHHHHHCCCEEEEEEEEEEEECCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 50037899999831984899981378643998899999999863183466999999999988 Q gi|254780365|r 235 EIQLTDSMRKLSERHDFLAYHFKGHTYDCGSKKGFVLANIAFALARQDIRSDIETDLKTLVS 296 (299) Q Consensus 235 E~~ltD~i~~l~~~~~v~a~~~~g~w~DiG~~~~yl~A~~~~al~~~~~~~~~~~~l~~~~~ 296 (299) ||||||||+.|+++++|+||.++|+|||||++.+|++||+++||+|++++++|++||++.+- T Consensus 238 EiqLTDAi~~l~~~~~v~~~~~~G~r~D~G~~~gyl~A~i~~al~~~~~~~~~~~~l~~~~g 299 (302) T PRK13389 238 EIQLTDAIDMLIEKETVEAYHMKGKSHDCGNKLGYMQAFVEYGIRHNTLGTEFKAWLEEEMG 299 (302) T ss_pred CEEHHHHHHHHHHHCCEEEEEEEEEEECCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHC T ss_conf 08499999999842976999965579787680137999999982798788999999999738 No 3 >COG1210 GalU UDP-glucose pyrophosphorylase [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=0 Score=574.06 Aligned_cols=290 Identities=46% Similarity=0.832 Sum_probs=279.9 Q ss_pred CCCCCEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHH Q ss_conf 88545899956888833432440888333568836279999999997899889998135544410100117454445654 Q gi|254780365|r 4 LKKVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRK 83 (299) Q Consensus 4 m~~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~ 83 (299) |++|+||||+|||+|||+.|.|+.+||.||||.|||+|+|+++++++|||++|+||||++|+.|+||||.+++|++.|+. T Consensus 1 ~~~irKAViPaAGlGTRfLPATKaiPKEMLPIvdKP~IqYiVeEa~~aGIe~i~iVTgr~K~~IeDhFD~s~ELE~~L~~ 80 (291) T COG1210 1 MMKIRKAVIPAAGLGTRFLPATKAIPKEMLPIVDKPLIQYIVEEAVAAGIEEILIVTGRGKRAIEDHFDTSYELENTLEK 80 (291) T ss_pred CCCCCEEEEECCCCCCCCCCCCCCCCHHHCCCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHCCCCHHHHHHHHH T ss_conf 97531798874676455466003576644366672219999999998699779999669835688867686999999987 Q ss_pred CCCCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE Q ss_conf 18700001134411248579984333423346654211011223333111278644454433221122212234322001 Q gi|254780365|r 84 RNKKAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANIL 163 (299) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi 163 (299) ++|...+..+++. +..++++|++|.+|+|+||||+||++++|||||+|++||+++.+. .+|++||++.|++.+++++ T Consensus 81 ~~K~~~L~~v~~i-~~~~~i~~vRQ~e~~GLGhAVl~A~~~vg~EpFaVlL~Ddl~~~~--~~~l~qmi~~y~~~g~svi 157 (291) T COG1210 81 RGKRELLEEVRSI-PPLVTISFVRQKEPLGLGHAVLCAKPFVGDEPFAVLLPDDLVDSE--KPCLKQMIELYEETGGSVI 157 (291) T ss_pred HCHHHHHHHHHHC-CCCCEEEEEECCCCCCHHHHHHHHHHHCCCCCEEEEECCEEECCC--CHHHHHHHHHHHHHCCCEE T ss_conf 2779999999723-577539998658767601788853543289856998177263278--0699999999998599279 Q ss_pred EEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHH Q ss_conf 12432854433133100133333353000000257788867342330202002723311000122666767500378999 Q gi|254780365|r 164 AVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMR 243 (299) Q Consensus 164 ~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~ 243 (299) ++++|+++++++|||+..+...+.+.++|++|||||++++||||++..|+|+|+|+||+.|++.+++. .||+||||+|. T Consensus 158 ~v~ev~~e~v~kYGvi~~g~~~~~~~~~v~~~VEKP~~~~APSnlai~GRYil~p~IF~~L~~~~~G~-ggEiQLTDai~ 236 (291) T COG1210 158 GVEEVPPEDVSKYGVIDPGEPVEKGVYKVKGMVEKPKPEEAPSNLAIVGRYVLTPEIFDILEETKPGA-GGEIQLTDAIK 236 (291) T ss_pred EEEECCHHHCCCCCEEECCCCCCCCEEEEEEEEECCCCCCCCCCEEEEEEEECCHHHHHHHHHCCCCC-CCEEEHHHHHH T ss_conf 99988878776562672485035772899777766999999854157654656899999996279999-98766799999 Q ss_pred HHHHCCCEEEEEEEEEEEECCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 998319848999813786439988999999998631834669999999999886 Q gi|254780365|r 244 KLSERHDFLAYHFKGHTYDCGSKKGFVLANIAFALARQDIRSDIETDLKTLVSA 297 (299) Q Consensus 244 ~l~~~~~v~a~~~~g~w~DiG~~~~yl~A~~~~al~~~~~~~~~~~~l~~~~~~ 297 (299) .|+++..|+||.++|++||||++.+|++|+++++|+|++++.++++||+++++. T Consensus 237 ~L~~~~~v~a~~~~GkryD~G~k~Gyi~a~v~~~l~~~~~~~~~~~~l~~~~~~ 290 (291) T COG1210 237 KLLKKEPVLAYVFEGKRYDCGSKLGYIKANVEFALRRPKLGAEFKEYLKELLKE 290 (291) T ss_pred HHHHHCCEEEEEECCCEECCCCCCCHHHHHHHHHHHCHHHHHHHHHHHHHHHHC T ss_conf 998538579999456276168712489999999970866659999999997405 No 4 >PRK10122 UTP--glucose-1-phosphate uridylyltransferase subunit GalF; Provisional Probab=100.00 E-value=0 Score=566.80 Aligned_cols=291 Identities=34% Similarity=0.607 Sum_probs=269.1 Q ss_pred CCCCCCCCEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHH Q ss_conf 97888545899956888833432440888333568836279999999997899889998135544410100117454445 Q gi|254780365|r 1 MGSLKKVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQS 80 (299) Q Consensus 1 ~~~m~~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~ 80 (299) |+.| ||||||||+||||+|+|+.+|||||||.|||+|+|++++++++||+||+|||++.|++|++||+.+++++.. T Consensus 1 ~~~M----KaVI~aAG~GTRl~P~Tk~~PKemLPv~dkPii~~~vee~~~aGI~eiiiV~~~~k~~ie~~F~~~~~le~~ 76 (297) T PRK10122 1 MTNL----KAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEIVLVTHASKNAVENHFDTSYELESL 76 (297) T ss_pred CCCC----EEEEECCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHCCCHHHHHHH T ss_conf 9975----499989867635685315789244628985369999999998799789999179711577746671888999 Q ss_pred HHHCCCCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCC Q ss_conf 654187000011344112485799843334233466542110112233331112786444544332---21122212234 Q gi|254780365|r 81 LRKRNKKAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGEN---CMANMIKLYEK 157 (299) Q Consensus 81 l~~~~~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~---~l~~li~~~~~ 157 (299) ++.+.+...+..+....+.+++|+|++|++|+||||||+||+++++|+||+|++||+++.....++ ++++|++.|++ T Consensus 77 l~~~~k~~~l~~~~~~~~~gv~i~yv~Q~eplGlGhAV~~A~~~v~depF~VllgD~l~~~~~~~~~~~~l~~~i~~~~~ 156 (297) T PRK10122 77 LEQRVKRQLLAEVQSICPPGVTIMNVRQGQPLGLGHSILCARPAIGDNPFVVVLPDVVIDDASADPLRYNLAAMIARFNE 156 (297) T ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCEECCCCCCHHHHHHHHHHHHHHH T ss_conf 87611799999887524489649998779998678999988873499607997266142278641678889999999971 Q ss_pred CCCCEEEEEECCHHHCCCCCCEECCCC--CCCCCCEEEEECCCC-CCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCC Q ss_conf 322001124328544331331001333--333530000002577-88867342330202002723311000122666767 Q gi|254780365|r 158 EGANILAVSECDPQLSCKYGMVQVGKA--IDHQVFHISDMIEKP-DSSTFISNFFINGRYILHPDIFSILNDWKENEGKG 234 (299) Q Consensus 158 ~~~~vi~~~~v~~~~~~~yGvi~~~~~--~~~~~~~I~~~vEKP-~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~g 234 (299) ++++++++.+++ +++++|||+..+.. .+++..+|++++||| .++.+|||++++|+|+|+|+||++|++.+++. +| T Consensus 157 ~~~~~v~~~~v~-~d~~~yGVv~~~~~~d~~g~v~~I~~ivEKP~~P~~apSnlai~GrYi~~p~IF~~L~~~~p~~-~G 234 (297) T PRK10122 157 TGRSQVLAKRMP-GDLSEYSVIQTKEPLDREGKVSRIVEFIEKPDQPQTLDSDLMAVGRYVLSADIWPELERTQPGA-WG 234 (297) T ss_pred CCCCEEEEEEEC-CCCCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCCEEEEEEEECHHHHHHHHHCCCCC-CC T ss_conf 499599999957-8765157995565546677533356514677898667777337789972879999997089999-98 Q ss_pred CCCHHHHHHHHHHCCCEEEEEEEEEEEECCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 500378999998319848999813786439988999999998631834669999999999886 Q gi|254780365|r 235 EIQLTDSMRKLSERHDFLAYHFKGHTYDCGSKKGFVLANIAFALARQDIRSDIETDLKTLVSA 297 (299) Q Consensus 235 E~~ltD~i~~l~~~~~v~a~~~~g~w~DiG~~~~yl~A~~~~al~~~~~~~~~~~~l~~~~~~ 297 (299) ||||||||+.|+++++|+|+.++|.|||||+|.+|++||+++||+|++++.+|++||+++++. T Consensus 235 EiqLTDAi~~l~~~~~v~~~~~~G~~~D~G~~~g~l~A~i~~al~~~~~~~~~~~~l~~~l~~ 297 (297) T PRK10122 235 RIQLTDAIAELAKKQSVDAMLMTGDSYDCGKKMGYMQAFVKYGLRNLKEGAKFRKGIEKLLSE 297 (297) T ss_pred EEEHHHHHHHHHCCCCEEEEEECCEEECCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHC T ss_conf 588999999997019879999551797579862699999999967987889999999999619 No 5 >cd02541 UGPase_prokaryotic Prokaryotic UGPase catalyses the synthesis of UDP-glucose. Prokaryotic UDP-Glucose Pyrophosphorylase (UGPase) catalyzes a reversible production of UDP-Glucose and pyrophosphate (PPi) from glucose-1-phosphate and UTP. UDP-glucose plays pivotal roles in galactose utilization, in glycogen synthesis, and in the synthesis of the carbohydrate moieties of glycolipids , glycoproteins , and proteoglycans. UGPase is found in both prokaryotes and eukaryotes, although prokaryotic and eukaryotic forms of UGPase catalyze the same reaction, they share low sequence similarity. Probab=100.00 E-value=0 Score=488.10 Aligned_cols=267 Identities=49% Similarity=0.837 Sum_probs=244.3 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCC Q ss_conf 58999568888334324408883335688362799999999978998899981355444101001174544456541870 Q gi|254780365|r 8 RKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKK 87 (299) Q Consensus 8 ~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~ 87 (299) |||||||||+||||||+|+.+||||+||+|||+|+|+++.|.++||++|+|+++|++++|++||+.+++++..+..+++. T Consensus 1 ~KAvILAgG~GTRL~PlT~~~PKpLlpi~gkPii~~~i~~l~~~Gi~~iii~~~~~~~~I~~~f~~~~~~~~~l~~~~k~ 80 (267) T cd02541 1 RKAVIPAAGLGTRFLPATKAIPKEMLPIVDKPVIQYIVEEAVAAGIEDIIIVTGRGKRAIEDHFDRSYELEETLEKKGKT 80 (267) T ss_pred CEEEEECCCCCCCCCHHHCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEEEECCHHHHHHHHCCCHHHHHHHHHCCCH T ss_conf 96999898774667842469984364899998699999999986998899991246666244506658899998526856 Q ss_pred EEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEE Q ss_conf 00011344112485799843334233466542110112233331112786444544332211222122343220011243 Q gi|254780365|r 88 AELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANILAVSE 167 (299) Q Consensus 88 ~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi~~~~ 167 (299) ..+.. ......++++.|+.|++|+||||||++|+++++|++|+|++||+++... ...+++|++.|++.+++++++.+ T Consensus 81 ~~~~~-~~~~~~~~~i~~~~e~~~lGtggai~~a~~~~~~~~flv~~gD~l~~~~--~~~l~~li~~~~~~~~~~~~~~~ 157 (267) T cd02541 81 DLLEE-VRIISDLANIHYVRQKEPLGLGHAVLCAKPFIGDEPFAVLLGDDLIDSK--EPCLKQLIEAYEKTGASVIAVEE 157 (267) T ss_pred HHHHH-HHCCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCCEEEEECCEEEECC--CHHHHHHHHHHHHHCCEEEEEEE T ss_conf 55555-4224789559999778988767898721010499867999689888537--50668999999730872689999 Q ss_pred CCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 28544331331001333333530000002577888673423302020027233110001226667675003789999983 Q gi|254780365|r 168 CDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLSE 247 (299) Q Consensus 168 v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~~ 247 (299) ++++++++|||++++.. +++.++|++|+|||+++.++||++++|+|+|++++|++|++.+++. +||+++||+++.|++ T Consensus 158 v~~~~~~~yGvv~~~~~-~~~~~~i~~~~EKP~~~~~~s~l~~~GiYi~~~~i~~~i~~~~~~~-~~e~~l~d~i~~li~ 235 (267) T cd02541 158 VPPEDVSKYGIVKGEKI-DGDVFKVKGLVEKPKPEEAPSNLAIVGRYVLTPDIFDILENTKPGK-GGEIQLTDAIAKLLE 235 (267) T ss_pred ECCCCCCCCCEEEECCC-CCCEEEEEEEEECCCCCCCCCCEEEEEEEEECHHHHHHHHHCCCCC-CCCCCHHHHHHHHHH T ss_conf 68667873643896677-8706888876406887679853166344998899987897289999-983307799999985 Q ss_pred CCCEEEEEEEEEEEECCCHHHHHHHHHHHHHC Q ss_conf 19848999813786439988999999998631 Q gi|254780365|r 248 RHDFLAYHFKGHTYDCGSKKGFVLANIAFALA 279 (299) Q Consensus 248 ~~~v~a~~~~g~w~DiG~~~~yl~A~~~~al~ 279 (299) +++|++|.++|+|+|||||++|++||.++||+ T Consensus 236 ~~~v~~~~~~g~W~DiGtp~~y~~A~~~~~~~ 267 (267) T cd02541 236 EEPVYAYVFEGKRYDCGNKLGYLKATVEFALK 267 (267) T ss_pred CCCEEEEEECCEEEECCCHHHHHHHHHHHHCC T ss_conf 79969998598889579989999999998659 No 6 >TIGR01208 rmlA_long glucose-1-phosphate thymidylyltransferase; InterPro: IPR005908 Synonym: dTDP-D-glucose synthase This group of proteins comprises a tightly conserved but broadly distributed subfamily of known and putative bacterial glucose-1-phosphate thymidylyltransferases (2.7.7.24 from EC). It is well characterised in several species as the first of four enzymes involved in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.dTTP + alpha-D-glucose 1-phosphate = diphosphate + dTDP-glucose The family of known and putative glucose-1-phosphate thymidyltransferases shows a deep split into a short form (see IPR005907 from INTERPRO) and a long form described by this model. The homotetrameric short form is found in numerous bacterial species that incorporate dTDP-L-rhamnose, which it helps synthesize, into the cell wall. It is subject to feedback inhibition. The long form, in contrast, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced. . Probab=100.00 E-value=0 Score=466.99 Aligned_cols=232 Identities=28% Similarity=0.473 Sum_probs=208.7 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCC-CCCCCHHHCCCCHHHHHHHHHCCCC Q ss_conf 899956888833432440888333568836279999999997899889998135-5444101001174544456541870 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGR-GKGLIKDYFDIQFELEQSLRKRNKK 87 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~-~k~~I~~~f~~~~~~~~~l~~~~~~ 87 (299) ||+|||||.||||||||-+.||+|+||+|||+|+|.|+.++.|||+||-||+|+ .+++|+.+.. T Consensus 1 KaliL~aG~GTRLRPLTFt~pK~LiPvAnKPi~~Yaie~~~~AGI~diGIvvg~~~~e~i~~~~g--------------- 65 (361) T TIGR01208 1 KALILAAGKGTRLRPLTFTRPKQLIPVANKPILQYAIEDLIEAGITDIGIVVGPETGEEIKEIVG--------------- 65 (361) T ss_pred CCEECCCCCCCCCCCCCCCCCCCEEEECCCCHHHHHHHHHHHCCCEEEEEEECCCCCHHHEEEEC--------------- T ss_conf 92442587867677766578843066168732367688887469769999846988212202324--------------- Q ss_pred EEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CEEEE Q ss_conf 0001134411248579984333423346654211011-223333111278644454433221122212234322-00112 Q gi|254780365|r 88 AELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNI-IGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGA-NILAV 165 (299) Q Consensus 88 ~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~-i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~-~vi~~ 165 (299) .....+.+|+|+.|++|||.||||+.|++| ++|++|+|.+||||+... ++++++.|++.+. ..+++ T Consensus 66 -------~g~~fg~kityI~Q~~plGlAHAv~~A~~fGlgd~~FvvYLGDNl~~~g-----i~~fv~~F~~~~~da~ILL 133 (361) T TIGR01208 66 -------EGERFGAKITYIVQGEPLGLAHAVYVARDFGLGDEDFVVYLGDNLIQDG-----IKRFVKSFEEKDYDALILL 133 (361) T ss_pred -------CCCCCCEEEEEEECCCCCHHHHHHHHHHHHCCCCCCEEEECCCCHHHHH-----HHHHHHHHCCCCCCCEEEC T ss_conf -------8830232898982278760467645788847899860797374210423-----7899885331480030110 Q ss_pred EECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECH--HHHHHHCCCCCCCCCCCCCHHHHHH Q ss_conf 43285443313310013333335300000025778886734233020200272--3311000122666767500378999 Q gi|254780365|r 166 SECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHP--DIFSILNDWKENEGKGEIQLTDSMR 243 (299) Q Consensus 166 ~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~--~i~~~L~~~~~~~~~gE~~ltD~i~ 243 (299) .+| +++++|||+.++.+.. +|.++||||+ .+|||||+.|+|.|.| .||+.++.++|+. |||+|+||+|+ T Consensus 134 ~~V--~dP~~FGVA~l~d~G~----~i~~LvEKPk--~PPSNLAvvGlY~F~pPe~if~~~~~~kPSw-RGElEITD~IQ 204 (361) T TIGR01208 134 KKV--EDPTAFGVAELEDDGK----RILKLVEKPK--EPPSNLAVVGLYMFRPPELIFEAIKNIKPSW-RGELEITDAIQ 204 (361) T ss_pred CCC--CCCCCCCEEEEECCCC----EEEEEEECCC--CCCCCCEEEEEEECCCHHHHHHHHHCCCCCC-CCCHHHHHHHH T ss_conf 438--8788455589924897----9999883386--5787701233453387779888985288877-66215775775 Q ss_pred HHHHCC-CEEEEEEEEEEEECCCHHHHHHHHHHH Q ss_conf 998319-848999813786439988999999998 Q gi|254780365|r 244 KLSERH-DFLAYHFKGHTYDCGSKKGFVLANIAF 276 (299) Q Consensus 244 ~l~~~~-~v~a~~~~g~w~DiG~~~~yl~A~~~~ 276 (299) +|+++| +|..+.+.|+|.|+|+|+++|+||.-+ T Consensus 205 ~lIe~Gy~V~~~~v~GWWkDTGk~eDLL~AN~~i 238 (361) T TIGR01208 205 YLIEKGYKVGGSKVKGWWKDTGKPEDLLDANRLI 238 (361) T ss_pred HHHHCCCEEEEEEEEEEECCCCCCHHHHHHHHHH T ss_conf 6531583777799978870159802389999997 No 7 >cd02538 G1P_TT_short G1P_TT_short is the short form of glucose-1-phosphate thymidylyltransferase. This family is the short form of glucose-1-phosphate thymidylyltransferase. Glucose-1-phosphate thymidylyltransferase catalyses the formation of dTDP-glucose, from dTTP and glucose 1-phosphate. It is the first enzyme in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.There are two forms of Glucose-1-phosphate thymidylyltransferase in bacteria and archeae; short form and long form. The homotetrameric, feedback inhibited short form is found in numerous bacterial species that produce dTDP-L-rhamnose. The long form, which has an extra 50 amino acids c-terminal, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced. Probab=100.00 E-value=0 Score=399.07 Aligned_cols=232 Identities=22% Similarity=0.371 Sum_probs=194.9 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCC-CCCCHHHCCCCHHHHHHHHHCCCC Q ss_conf 8999568888334324408883335688362799999999978998899981355-444101001174544456541870 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRG-KGLIKDYFDIQFELEQSLRKRNKK 87 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~-k~~I~~~f~~~~~~~~~l~~~~~~ 87 (299) ||||||||+||||+|+|+.+||||+||+|||+|+|++++|..+||++|++|++++ .+++++||.. T Consensus 2 kavILAgG~GtRL~PlT~~~PKpLlpi~~kPli~~~l~~l~~~Gi~~i~iv~~~~~~~~~~~~~~~-------------- 67 (240) T cd02538 2 KGIILAGGSGTRLYPLTKVVSKQLLPVYDKPMIYYPLSTLMLAGIREILIISTPEDLPLFKELLGD-------------- 67 (240) T ss_pred EEEEECCCCCCCCCCCCCCCCCCCCEECCEEEHHHHHHHHHHCCCCCCEEECCHHHHHHHHHHHHC-------------- T ss_conf 099988878667873237987432779999718999999998599623100234269999999744-------------- Q ss_pred EEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-EEEEE Q ss_conf 00011344112485799843334233466542110112233331112786444544332211222122343220-01124 Q gi|254780365|r 88 AELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGAN-ILAVS 166 (299) Q Consensus 88 ~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~-vi~~~ 166 (299) ....+.++.|+.|++|+|||||+.+|++++++++|+|++||+++...+ +++|++.|.+.++. ++++. T Consensus 68 --------~~~~~~~i~y~~e~~~~Gt~~al~~a~~~~~~~~~~v~~gD~i~~~~~----~~~~l~~~~~~~~~~ti~~~ 135 (240) T cd02538 68 --------GSDLGIRITYAVQPKPGGLAQAFIIGEEFIGDDPVCLILGDNIFYGQG----LSPILQRAAAQKEGATVFGY 135 (240) T ss_pred --------CCCCCEEEEEEEEECCCCCCHHHHHHHHHCCCCCEEEECCCCEECCCC----HHHHHHHHHHCCCCCEEEEE T ss_conf --------763752799999862686313788888762898734443874010656----89999999966688478777 Q ss_pred ECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 32854433133100133333353000000257788867342330202002723311000122666767500378999998 Q gi|254780365|r 167 ECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLS 246 (299) Q Consensus 167 ~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~ 246 (299) +++ ++++||++.++.. ++|.+|+|||+ .++||++++|+|+|++++|+++++.+++. +||+++||+++.++ T Consensus 136 ~v~--~~~~yGvv~~d~~-----~~v~~~~EKP~--~~~s~~~~~GiYi~~~~i~~~i~~~~~~~-~ge~~i~d~~~~~i 205 (240) T cd02538 136 EVN--DPERYGVVEFDEN-----GRVLSIEEKPK--KPKSNYAVTGLYFYDNDVFEIAKQLKPSA-RGELEITDVNNEYL 205 (240) T ss_pred EEC--CCCCCCCCCCCCC-----CCEEEEEECCC--CCCCCEEEEEEEEECHHHHHHHHHCCCCC-CCEEEHHHHHHHHH T ss_conf 723--6113672311478-----85789888999--99988799999998989999998589999-99075889999999 Q ss_pred HCCCEEEEE--EEEEEEECCCHHHHHHHHHHH Q ss_conf 319848999--813786439988999999998 Q gi|254780365|r 247 ERHDFLAYH--FKGHTYDCGSKKGFVLANIAF 276 (299) Q Consensus 247 ~~~~v~a~~--~~g~w~DiG~~~~yl~A~~~~ 276 (299) +++++.+.. .+|+|+|||||++|++||-.+ T Consensus 206 ~~g~~~~~~~~~~~~W~DiGtpe~~~~An~~i 237 (240) T cd02538 206 EKGKLSVELLGRGFAWLDTGTHESLLEASNFV 237 (240) T ss_pred HCCCCEEEEECCCCEEEECCCHHHHHHHHHHH T ss_conf 85997399968998089799999999999889 No 8 >COG1209 RfbA dTDP-glucose pyrophosphorylase [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=0 Score=391.11 Aligned_cols=231 Identities=28% Similarity=0.503 Sum_probs=197.7 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCC-CCCCHHHCCCCHHHHHHHHHCCCC Q ss_conf 8999568888334324408883335688362799999999978998899981355-444101001174544456541870 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRG-KGLIKDYFDIQFELEQSLRKRNKK 87 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~-k~~I~~~f~~~~~~~~~l~~~~~~ 87 (299) ||||||||.||||+|+|...||+|+||.+||||+|.++.+..+||++|+||+++. ...++++ T Consensus 2 KgiILAgG~GTRL~PlT~~~~KqLlpV~~KPmi~y~l~~L~~aGI~dI~II~~~~~~~~~~~l----------------- 64 (286) T COG1209 2 KGVILAGGSGTRLRPLTRVVPKQLLPVYDKPMIYYPLETLMLAGIRDILIVVGPEDKPTFKEL----------------- 64 (286) T ss_pred CCEEECCCCCCCCCCCCCCCCCCCCEECCCCHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHH----------------- T ss_conf 727963767544332313677533211686215857999997598569999668850566633----------------- Q ss_pred EEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCEEEEE Q ss_conf 0001134411248579984333423346654211011223333111278644454433221122212234-322001124 Q gi|254780365|r 88 AELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEK-EGANILAVS 166 (299) Q Consensus 88 ~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~-~~~~vi~~~ 166 (299) +.+....+.+|+|+.|++|+|.|||++.|++|++|++|+|++|||++.. . +++++..+.+ .....+.+. T Consensus 65 -----lGdgs~~gv~itY~~Q~~p~GlA~Av~~a~~fv~~~~f~l~LGDNi~~~-~----l~~~~~~~~~~~~ga~i~~~ 134 (286) T COG1209 65 -----LGDGSDFGVDITYAVQPEPDGLAHAVLIAEDFVGDDDFVLYLGDNIFQD-G----LSELLEHFAEEGSGATILLY 134 (286) T ss_pred -----HCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHCCCCCEEEEECCCEECC-C----HHHHHHHHHCCCCCCEEEEE T ss_conf -----3671003863699965898768999998886458986599926844105-7----79999998524899489999 Q ss_pred ECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 32854433133100133333353000000257788867342330202002723311000122666767500378999998 Q gi|254780365|r 167 ECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLS 246 (299) Q Consensus 167 ~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~ 246 (299) +|+ +++||||++++.+. +|.+++|||+.| +||||++|+|+|++++|+.++.++|+. +||++|||+++.++ T Consensus 135 ~V~--dP~rfGV~e~d~~~-----~v~~l~EKP~~P--~SNlAvtGlY~~d~~Vf~~~~~ikPS~-RGElEITd~i~~~i 204 (286) T COG1209 135 EVD--DPSRYGVVEFDEDG-----KVIGLEEKPKEP--KSNLAVTGLYFYDPSVFEAIKQIKPSA-RGELEITDAIDLYI 204 (286) T ss_pred ECC--CCCCCEEEEECCCC-----CEEEEEECCCCC--CCCEEEEEEEEECHHHHHHHHCCCCCC-CCCEEEHHHHHHHH T ss_conf 848--95304489986899-----687767889999--876268899970858999987189887-78567528899999 Q ss_pred HCC-CEEEEEEEEEEEECCCHHHHHHHHHHH Q ss_conf 319-848999813786439988999999998 Q gi|254780365|r 247 ERH-DFLAYHFKGHTYDCGSKKGFVLANIAF 276 (299) Q Consensus 247 ~~~-~v~a~~~~g~w~DiG~~~~yl~A~~~~ 276 (299) ++| .+++....|.|.|+||+++|++|+..+ T Consensus 205 ~~G~~~~~~~~~G~WlDtGt~~slleA~~~i 235 (286) T COG1209 205 EKGYLVVAILIRGWWLDTGTPESLLEANNFV 235 (286) T ss_pred HCCCEEEEEECCCEEEECCCHHHHHHHHHHH T ss_conf 7696888987151688638701289999999 No 9 >cd06428 M1P_guanylylT_A_like_N N-terminal domain of M1P_guanylyl_A_ like proteins are likely to be a isoform of GDP-mannose pyrophosphorylase. N-terminal domain of the M1P-guanylyltransferase A-isoform like proteins: The proteins of this family are likely to be a isoform of GDP-mannose pyrophosphorylase. Their sequences are highly conserved with mannose-1-phosphate guanyltransferase, but generally about 40-60 bases longer. GDP-mannose pyrophosphorylase (GTP: alpha-d-mannose-1-phosphate guanyltransferase) catalyzes the formation of GDP-d-mannose from GTP and alpha-d-mannose-1-Phosphate. It contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain. GDP-d-mannose is the activated form of mannose for formation of cell wall lipoarabinomannan and various mannose-containing glycolipids and polysaccharides. The function of GDP-mannose pyrophosphorylase is essential for cell wall integrity, morphogenesis and viability. Repre Probab=100.00 E-value=0 Score=390.12 Aligned_cols=232 Identities=21% Similarity=0.350 Sum_probs=200.1 Q ss_pred EEEECCC--CCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHH-CCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCC Q ss_conf 9995688--8833432440888333568836279999999997-899889998135544410100117454445654187 Q gi|254780365|r 10 AVFPIAG--LGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALE-AGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNK 86 (299) Q Consensus 10 avIlAaG--~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~-~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~ 86 (299) ||||||| +||||+|+|+.+||||+||+|||+|+|++++|.+ +|+++|+++++|..+++.+|+.... T Consensus 1 AvILaGG~~~GTRL~plT~~~PKpllpv~gkPiI~~~i~~l~~~~gi~~i~i~~~~~~~~~~~~~~~~~----------- 69 (257) T cd06428 1 AVILVGGPQKGTRFRPLSLDVPKPLFPVAGKPMIHHHIEACAKVPDLKEVLLIGFYPESVFSDFISDAQ----------- 69 (257) T ss_pred CEEECCCCCCCCCCCHHHCCCCCCCCEECCEEEHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCC----------- T ss_conf 989788999973167433799844257999954199999999747984899980575799999874234----------- Q ss_pred CEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-E Q ss_conf 0000113441124857998433342334665421101122---33331112786444544332211222122343220-0 Q gi|254780365|r 87 KAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIG---DNPFALLLPDMIMSPLEGENCMANMIKLYEKEGAN-I 162 (299) Q Consensus 87 ~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~---de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~-v 162 (299) ...+.++.|+.|++|+||||||..+++++. ++.|+|++||+++..+ +++|+++|.++++. + T Consensus 70 ----------~~~~~~i~~~~e~~~lGtagal~~~~~~i~~~~~~~~lv~~gD~l~~~d-----l~~l~~~h~~~~~~~t 134 (257) T cd06428 70 ----------QEFNVPIRYLQEYKPLGTAGGLYHFRDQILAGNPSAFFVLNADVCCDFP-----LQELLEFHKKHGASGT 134 (257) T ss_pred ----------CCCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCC-----HHHHHHHHHHCCCCEE T ss_conf ----------3369605998647666736789999999734899769998885003878-----9999999997799869 Q ss_pred EEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCH---- Q ss_conf 1124328544331331001333333530000002577888673423302020027233110001226667675003---- Q gi|254780365|r 163 LAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQL---- 238 (299) Q Consensus 163 i~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~l---- 238 (299) +++.+++.+++++||++..+ .+.++|++|+|||+ .++||++|+|+|+|++++|++|++..++. +||+++ T Consensus 135 i~~~~v~~~~~s~fGvvv~~----~~~~~i~~f~EKP~--~~~s~lin~G~Yvf~~~if~~i~~~~~~~-~~e~~~~d~~ 207 (257) T cd06428 135 ILGTEASREQASNYGCIVED----PSTGEVLHYVEKPE--TFVSDLINCGVYLFSPEIFDTIKKAFQSR-QQEAQLGDDN 207 (257) T ss_pred EEEEECCCCCCCCEEEEEEE----CCCCCEEEEEECCC--CCCCCEEEEEEEEECHHHHHHHHHCCCCC-CCCCCCCCCC T ss_conf 99998782136540289997----89995999998899--98888089999999889999997518454-6741013433 Q ss_pred ------------HHHHHHHHHCCCEEEEEEEEEEEECCCHHHHHHHHH Q ss_conf ------------789999983198489998137864399889999999 Q gi|254780365|r 239 ------------TDSMRKLSERHDFLAYHFKGHTYDCGSKKGFVLANI 274 (299) Q Consensus 239 ------------tD~i~~l~~~~~v~a~~~~g~w~DiG~~~~yl~A~~ 274 (299) +|+++.|+++++++||.++|+|.|||||++|++||. T Consensus 208 ~~~~~~~~~~~e~dvlp~l~~~g~l~~y~~~g~W~DiGTp~~~~~AN~ 255 (257) T cd06428 208 NREGRAEVIRLEQDVLTPLAGSGKLYVYKTDDFWSQIKTAGSAIYANR 255 (257) T ss_pred CCCCCCCCCEECHHHHHHHHHCCCEEEEECCCEEECCCCHHHHHHHHC T ss_conf 334666444000326799985798899938983875989899999874 No 10 >cd06425 M1P_guanylylT_B_like_N N-terminal domain of the M1P-guanylyltransferase B-isoform like proteins. GDP-mannose pyrophosphorylase (GTP: alpha-d-mannose-1-phosphate guanyltransferase) catalyzes the formation of GDP-d-mannose from GTP and alpha-d-mannose-1-Phosphate. It contains an N-terminal catalytic domain and a C-terminal Lefthanded-beta-Helix fold domain. GDP-d-mannose is the activated form of mannose for formation of cell wall lipoarabinomannan and various mannose-containing glycolipids and polysaccharides. The function of GDP-mannose pyrophosphorylase is essential for cell wall integrity, morphogenesis and viability. Repression of GDP-mannose pyrophosphorylase in yeast leads to phenotypes, such as cell lysis, defective cell wall, and failure of polarized growth and cell separation. Probab=100.00 E-value=0 Score=389.78 Aligned_cols=228 Identities=24% Similarity=0.429 Sum_probs=194.1 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 89995688883343244088833356883627999999999789988999813554441010011745444565418700 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) ||||||||+||||+|+|+.+||||+||+|||+|+|+++.|.++||++|+|+++|++++|++||..... T Consensus 2 kavIlagG~GtRl~plT~~~PKpLlpv~gkpli~~~i~~l~~~gi~~iii~~~y~~~~i~~~~~~~~~------------ 69 (233) T cd06425 2 KALILVGGYGTRLRPLTLTVPKPLVEFCNKPMIEHQIEALAKAGVKEIILAVNYRPEDMVPFLKEYEK------------ 69 (233) T ss_pred EEEEECCCCCCCCCCHHCCCCCCCCEECCEEHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHH------------ T ss_conf 69998897765788013299830108999869999999999859973210112357799999987777------------ Q ss_pred EECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-EEEE Q ss_conf 00113441124857998433342334665421101122--33331112786444544332211222122343220-0112 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIG--DNPFALLLPDMIMSPLEGENCMANMIKLYEKEGAN-ILAV 165 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~--de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~-vi~~ 165 (299) ..+.+|.|+.|++|+||||||++|+++++ +++|+|++||+++..+ +++|+++|.++++. ++++ T Consensus 70 ---------~~~~~i~~~~e~~~lGtagal~~a~~~i~~~~~~flVlngD~~~~~~-----l~~~~~~h~~~~~~~ti~~ 135 (233) T cd06425 70 ---------KLGIKITFSIETEPLGTAGPLALARDLLGDDDEPFFVLNSDVICDFP-----LAELLDFHKKHGAEGTILV 135 (233) T ss_pred ---------HCCCEEEECCCCCCCCCHHHHHHHHHHHCCCCCCEEEECCCHHCCHH-----HHHHHHHHHHCCCCEEEEE T ss_conf ---------43960565225766672169999999846899868997783210203-----6999999985589849999 Q ss_pred EECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHH Q ss_conf 43285443313310013333335300000025778886734233020200272331100012266676750037899999 Q gi|254780365|r 166 SECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKL 245 (299) Q Consensus 166 ~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l 245 (299) .++ +++++||++..+.. .++|.+|+|||+ .++||++++|+|+|++++|+.++..+. +++ +|+++.+ T Consensus 136 ~~~--~~~~~yGv~~~d~~----~g~v~~f~EKP~--~~~~~~i~~Giyi~~~~i~~~i~~~~~-----~~~-~di~~~l 201 (233) T cd06425 136 TKV--EDPSKYGVVVHDEN----TGRIERFVEKPK--VFVGNKINAGIYILNPSVLDRIPLRPT-----SIE-KEIFPKM 201 (233) T ss_pred EEC--CCHHHCCEEEECCC----CCCCCCEEECCC--CCCCCCEEEEEEEECHHHHHHCCCCCC-----CCH-HHHHHHH T ss_conf 976--78455680886687----674222037788--888987036899998999965666888-----746-9999999 Q ss_pred HHCCCEEEEEEEEEEEECCCHHHHHHHHHHH Q ss_conf 8319848999813786439988999999998 Q gi|254780365|r 246 SERHDFLAYHFKGHTYDCGSKKGFVLANIAF 276 (299) Q Consensus 246 ~~~~~v~a~~~~g~w~DiG~~~~yl~A~~~~ 276 (299) ++++++++|.++|+|+|||||++|++||..+ T Consensus 202 ~~~~~v~~~~~~g~W~DiGtp~~~l~A~~~~ 232 (233) T cd06425 202 ASEGQLYAYELPGFWMDIGQPKDFLKGMSLY 232 (233) T ss_pred HHCCCEEEEEECCEEEECCCHHHHHHHHHHH T ss_conf 8479979998599999497989999999986 No 11 >cd06915 NTP_transferase_WcbM_like WcbM_like is a subfamily of nucleotidyl transferases. WcbM protein of Burkholderia mallei is involved in the biosynthesis, export or translocation of capsule. It is a subfamily of nucleotidyl transferases that transfer nucleotides onto phosphosugars. Probab=100.00 E-value=0 Score=376.42 Aligned_cols=222 Identities=21% Similarity=0.337 Sum_probs=193.4 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEE Q ss_conf 99956888833432440888333568836279999999997899889998135544410100117454445654187000 Q gi|254780365|r 10 AVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAE 89 (299) Q Consensus 10 avIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~ 89 (299) |||||||+||||+|+|+.+||||+||+|||+|+|+|+.|.++||++|++++||+.++|++||.... T Consensus 1 AvIlAaG~GtRl~p~T~~~PKpLl~i~gkpli~~~l~~l~~~gi~~iii~~~~~~~~i~~~~~~~~-------------- 66 (223) T cd06915 1 AVILAGGLGTRLRSVVKDLPKPLAPVAGRPFLEYLLEYLARQGISRIVLSVGYLAEQIEEYFGDGY-------------- 66 (223) T ss_pred CEEECCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHCCC-------------- T ss_conf 989478788678811079983112899998999999999975997788730102343344331023-------------- Q ss_pred ECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-EEEEEEC Q ss_conf 011344112485799843334233466542110112233331112786444544332211222122343220-0112432 Q gi|254780365|r 90 LTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGAN-ILAVSEC 168 (299) Q Consensus 90 ~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~-vi~~~~v 168 (299) ....++.|+.|++|+|||||+++|++++++++|+|++||+++..+ +++|+++|.+.++. ++++.++ T Consensus 67 --------~~~~~i~~~~e~~~lGt~gal~~a~~~i~~~~flv~~gD~~~~~~-----l~~~~~~~~~~~~~~~l~~~~~ 133 (223) T cd06915 67 --------RGGIRIYYVIEPEPLGTGGAIKNALPKLPEDQFLVLNGDTYFDVD-----LLALLAALRASGADATMALRRV 133 (223) T ss_pred --------CCCCCCEEEEEEEECCHHHHHHHHHHHCCCCCEEEECCCEEECCC-----HHHHHHHHHHCCCCEEEEEEEC T ss_conf --------457761478864236468899888863699988998277112677-----8999999985799789999989 Q ss_pred CHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHC Q ss_conf 85443313310013333335300000025778886734233020200272331100012266676750037899999831 Q gi|254780365|r 169 DPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLSER 248 (299) Q Consensus 169 ~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~~~ 248 (299) +++++||++.++.. .+|.+++|||..+ +|+++++|+|+|++++|+.+++...+ ..+|+++.|+++ T Consensus 134 --~~~~~~g~v~~d~~-----~~i~~~~eK~~~~--~~~~i~~Giyi~~~~i~~~~~~~~~~------~~~d~~~~li~~ 198 (223) T cd06915 134 --PDASRYGNVTVDGD-----GRVIAFVEKGPGA--APGLINGGVYLLRKEILAEIPADAFS------LEADVLPALVKR 198 (223) T ss_pred --CCCCCCCCEEECCC-----CCEEEEEECCCCC--CCCEEEEEEEEECHHHHHHHHCCCCC------CHHHHHHHHHHC T ss_conf --98302785898797-----3142347548987--66824463899989999741425997------279999999834 Q ss_pred CCEEEEEEEEEEEECCCHHHHHHHH Q ss_conf 9848999813786439988999999 Q gi|254780365|r 249 HDFLAYHFKGHTYDCGSKKGFVLAN 273 (299) Q Consensus 249 ~~v~a~~~~g~w~DiG~~~~yl~A~ 273 (299) +++++|.++|+|+|||||++|++|+ T Consensus 199 ~~v~~~~~~g~w~DiGt~e~~~~A~ 223 (223) T cd06915 199 GRLYGFEVDGYFIDIGIPEDYARAQ 223 (223) T ss_pred CCEEEEEECCEEEECCCHHHHHHHC T ss_conf 9979999799199798989999739 No 12 >cd04189 G1P_TT_long G1P_TT_long represents the long form of glucose-1-phosphate thymidylyltransferase. This family is the long form of Glucose-1-phosphate thymidylyltransferase. Glucose-1-phosphate thymidylyltransferase catalyses the formation of dTDP-glucose, from dTTP and glucose 1-phosphate. It is the first enzyme in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.There are two forms of Glucose-1-phosphate thymidylyltransferase in bacteria and archeae; short form and long form. The long form, which has an extra 50 amino acids c-terminal, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced.The long from enzymes also have a left-handed parallel helix domain at the c-terminus, whereas, th eshort form enzymes do not have this domain. The homotetrameric, feedback inhibited short form is found in Probab=100.00 E-value=0 Score=370.61 Aligned_cols=230 Identities=28% Similarity=0.459 Sum_probs=200.2 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 89995688883343244088833356883627999999999789988999813554441010011745444565418700 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) ||||||||+||||+|+|..+||||+||+|||+|+|+++.+.++||++|++|+||..++|.+||.... T Consensus 2 kaiILAaG~GtRl~p~t~~~PK~Ll~i~gkpli~~~i~~l~~~gi~~Iiiv~~~~~~~i~~~~~~~~------------- 68 (236) T cd04189 2 KGLILAGGKGTRLRPLTYTRPKQLIPVAGKPIIQYAIEDLREAGIEDIGIVVGPTGEEIKEALGDGS------------- 68 (236) T ss_pred EEEEECCCCCCCCCCCCCCCCCEEEEECCCHHHHHHHHHHHHCCCCEEEEEEECCHHHCCCCCCCCC------------- T ss_conf 0999888687688700079980203899860999999999986995899985414122012234666------------- Q ss_pred EECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-EEEEEE Q ss_conf 0011344112485799843334233466542110112233331112786444544332211222122343220-011243 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGAN-ILAVSE 167 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~-vi~~~~ 167 (299) ..+.++.|+.|++|+|||+|+++|++++++++|+|++||+++..+ +.++++.|.+.++. ++++.+ T Consensus 69 ---------~~~~~i~~v~~~~~~Gt~~al~~a~~~l~~~~~iv~~gD~~~~~~-----~~~~~~~~~~~~~~~~i~~~~ 134 (236) T cd04189 69 ---------RFGVRITYILQEEPLGLAHAVLAARDFLGDEPFVVYLGDNLIQEG-----ISPLVRDFLEEDADASILLAE 134 (236) T ss_pred ---------CCCCCEEEEEECCCCCHHHHHHHHHHHCCCCCEEEEECCCEECCC-----HHHHHHHHHHCCCCCEEEEEE T ss_conf ---------478754898621553245787776651489975897279465468-----999999998627861147898 Q ss_pred CCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 28544331331001333333530000002577888673423302020027233110001226667675003789999983 Q gi|254780365|r 168 CDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLSE 247 (299) Q Consensus 168 v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~~ 247 (299) + +++++||++..+.. +|.+++|||+.+ .|+++++|+|+|++++|+.+++..++. +||+++||+++.|++ T Consensus 135 ~--~~~~~~g~~~~~~~------~v~~i~eK~~~~--~~~~~~~Giy~f~~~i~~~~~~~~~~~-~~e~~i~d~i~~li~ 203 (236) T cd04189 135 V--EDPRRFGVAVVDDG------RIVRLVEKPKEP--PSNLALVGVYAFTPAIFDAISRLKPSW-RGELEITDAIQWLID 203 (236) T ss_pred E--ECCCCCCEEEECCC------EEEECCCCCCCC--CCCEEEEEEEEECHHHHHHHHHCCCCC-CCCEEHHHHHHHHHH T ss_conf 6--24667628997362------354313258898--767599999999989999998479987-881338999999998 Q ss_pred CC-CEEEEEEEEEEEECCCHHHHHHHHHHH Q ss_conf 19-848999813786439988999999998 Q gi|254780365|r 248 RH-DFLAYHFKGHTYDCGSKKGFVLANIAF 276 (299) Q Consensus 248 ~~-~v~a~~~~g~w~DiG~~~~yl~A~~~~ 276 (299) +| +|.++.++|.|+|||||++|.+|+..+ T Consensus 204 ~g~~v~~~~~~g~W~Digtpedl~~A~~~l 233 (236) T cd04189 204 RGRRVGYSIVTGWWKDTGTPEDLLEANRLL 233 (236) T ss_pred CCCCEEEEEECCEEEECCCHHHHHHHHHHH T ss_conf 799889997199799798999999998998 No 13 >pfam00483 NTP_transferase Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars. Probab=100.00 E-value=0 Score=375.03 Aligned_cols=235 Identities=23% Similarity=0.374 Sum_probs=192.9 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEECC-CCCCCCHHHCCCCHHHHHHHHHCCC Q ss_conf 899956888833432440888333568836-27999999999789988999813-5544410100117454445654187 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDR-PVIQYVIEEALEAGLTDFVFVTG-RGKGLIKDYFDIQFELEQSLRKRNK 86 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~k-piI~~~i~~l~~~Gi~~i~iv~~-~~k~~I~~~f~~~~~~~~~l~~~~~ 86 (299) ||||||||+||||||+|+.+||||+||+|+ |+|+|+++++..+|++++++|++ +..++|++||... T Consensus 1 KavIlAgG~GtRl~PlT~~~pKpllpv~~~~pli~~~l~~l~~~g~~~~~~v~~~~~~~~i~~~~~~~------------ 68 (247) T pfam00483 1 KAIILAGGSGTRLWPLTRTLAKPLVPVLDKYPMIQYTLSRLMNAGIREPIVICTQEHRFLVAEQLGDG------------ 68 (247) T ss_pred CEEEECCCCCCCCCCCCCCCCCCEEEECCCCCHHHHHHHHHHHCCCCHHEEECCHHHHHHHHHHHHHC------------ T ss_conf 97998897777788010688988518289978999999999865995203313367788899998633------------ Q ss_pred CEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE- Q ss_conf 000011344112485799843334233466542110112233---3311127864445443322112221223432200- Q gi|254780365|r 87 KAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDN---PFALLLPDMIMSPLEGENCMANMIKLYEKEGANI- 162 (299) Q Consensus 87 ~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de---~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~v- 162 (299) ...+.++.|+.|++|+||||||.+|+++++++ +|+|++||+++..+ +++++++|.+.++.+ T Consensus 69 ----------~~~~~~i~~~~e~~~~Gta~ai~~a~~~l~~~~~~~~lv~~~D~~~~~~-----~~~~l~~h~~~~~~~~ 133 (247) T pfam00483 69 ----------SKFGLQVTYALQPEPRGTAPAVALAADFLGDDDPELVLVLGGDHIYRMD-----FEEAVQKARAKAADGT 133 (247) T ss_pred ----------CCCCCEEEEEEECCCCCCHHHHHHHHHHHCCCCCCEEEEECCCCCCCHH-----HHHHHHHHHHHCCCCE T ss_conf ----------6677437774312777705689999986434787627997785334440-----9999999998479986 Q ss_pred EEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHH Q ss_conf 11243285443313310013333335300000025778886734233020200272331100012266676750037899 Q gi|254780365|r 163 LAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSM 242 (299) Q Consensus 163 i~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i 242 (299) +++..++.+++++||++.++.. ++|.+|+|||+.+.+ ||++++|+|+|++++|+.+.+......++|+++||++ T Consensus 134 ~~~~~~~~~~~~~yGvv~~d~~-----~~V~~~~EKP~~~~~-~~~~~~G~Y~~~~~i~~~~~~~~~~~~~~e~~~~d~i 207 (247) T pfam00483 134 VTFGIVPVEDPTGYGVIEFDEN-----GRVIRFVEKPDLPKA-SNYASMGIYFFNSGVFLFLAKYLKELARGEDEITDAI 207 (247) T ss_pred EEEEEECCCCCCCCCEEEECCC-----CCCCCEEECCCCHHH-HHHEEEEEEEECHHHHHHHHHHCCCCCCCCCCHHHHH T ss_conf 8885565557634514898886-----650304677996244-3425611499889999999965777767994499999 Q ss_pred HHHHHCCC-EEEEEEEE--EEEECCCHHHHHHHHHHH Q ss_conf 99983198-48999813--786439988999999998 Q gi|254780365|r 243 RKLSERHD-FLAYHFKG--HTYDCGSKKGFVLANIAF 276 (299) Q Consensus 243 ~~l~~~~~-v~a~~~~g--~w~DiG~~~~yl~A~~~~ 276 (299) ++++++++ +++|.+.| +|+|||||++|++||+++ T Consensus 208 ~~~i~~g~~~~~~~~~~~~~W~DiGt~e~~~~an~~l 244 (247) T pfam00483 208 PAALEDGDLDLAFLFRGYAYWSDVGTWDSLWEANLKL 244 (247) T ss_pred HHHHHCCCCEEEEEECCCEEEEECCCHHHHHHHHHHH T ss_conf 9999879963999607988468698999999999998 No 14 >cd06422 NTP_transferase_like_1 NTP_transferase_like_1 is a member of the nucleotidyl transferase family. This is a subfamily of nucleotidyl transferases. Nucleotidyl transferases transfer nucleotides onto phosphosugars. The activated sugars are precursors for synthesis of lipopolysaccharide, glycolipids and polysaccharides. Other subfamilies of nucleotidyl transferases include Alpha-D-Glucose-1-Phosphate Cytidylyltransferase, Mannose-1-phosphate guanyltransferase, and Glucose-1-phosphate thymidylyltransferase. Probab=100.00 E-value=0 Score=367.43 Aligned_cols=219 Identities=19% Similarity=0.263 Sum_probs=184.7 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 89995688883343244088833356883627999999999789988999813554441010011745444565418700 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) ||||||||+||||+|+|+.+||||+||+|||+|+|+|+++.++||++|+|+++|.+++|++||... T Consensus 1 KAvIlAgG~GtRl~plT~~~PKpllpv~~kpii~~~i~~l~~~gi~~iii~~~y~~~~i~~~~~~~-------------- 66 (221) T cd06422 1 KAMILAAGLGTRMRPLTDTRPKPLVPVAGKPLIDHALDRLAAAGIRRIVVNTHHLADQIEAHLGDS-------------- 66 (221) T ss_pred CEEEECCCCCCCCCHHHCCCCHHEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCC-------------- T ss_conf 989988847668874457998023799998999999999998299679983253278899987225-------------- Q ss_pred EECCCCCCCCCCCEEEECCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CEEEEE Q ss_conf 00113441124857998433-3423346654211011223333111278644454433221122212234322-001124 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQ-YERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGA-NILAVS 166 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q-~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~-~vi~~~ 166 (299) ..+.++.|+.| ++|+||||||.+|++++++++|+|++||+++..+ +.+|+++|..+.+ .++++. T Consensus 67 ---------~~~~~i~~~~e~~~~lGT~~ai~~a~~~~~~d~flv~~gD~l~~~~-----~~~~~~~~~~~~~~~~~~~~ 132 (221) T cd06422 67 ---------RFGLRITISDEPDELLETGGGIKKALPLLGDEPFLVVNGDILWDGD-----LAPLLLLHAWRMDALLLLLP 132 (221) T ss_pred ---------CCCCCEEEEEECCCCCCCCHHHHHHHHHCCCCCEEEEECCEEECCC-----HHHHHHHHHHHCCCCEEEEE T ss_conf ---------5787379974033236650588888863499868998067578189-----89999999970689779999 Q ss_pred ECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 32854433133100133333353000000257788867342330202002723311000122666767500378999998 Q gi|254780365|r 167 ECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLS 246 (299) Q Consensus 167 ~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~ 246 (299) .++..+.++||++..+... ++. +||+. .+|+++++|+|+|++++|+.++ +||+++||+++.|+ T Consensus 133 ~~~~~~~~~~g~~~~~~~~-----~i~---~~~~~--~~~~~~~~G~yif~~~if~~i~-------~~~~~~~d~~~~li 195 (221) T cd06422 133 LVRNPGHNGVGDFSLDADG-----RLR---RGGGG--AVAPFTFTGIQILSPELFAGIP-------PGKFSLNPLWDRAI 195 (221) T ss_pred EECCCCCCCCCEEEECCCE-----EEE---ECCCC--CCCEEEEEEEEEEEHHHHHHCC-------CCCCCHHHHHHHHH T ss_conf 8678765575379876865-----899---77888--7742788779998189995478-------88775589999999 Q ss_pred HCCCEEEEEEEEEEEECCCHHHHHHH Q ss_conf 31984899981378643998899999 Q gi|254780365|r 247 ERHDFLAYHFKGHTYDCGSKKGFVLA 272 (299) Q Consensus 247 ~~~~v~a~~~~g~w~DiG~~~~yl~A 272 (299) ++++++++.++|+|+|||||++|++| T Consensus 196 ~~~~v~~~~~~g~w~DiGtpe~~~~A 221 (221) T cd06422 196 AAGRLFGLVYDGLWFDVGTPERLLAA 221 (221) T ss_pred HCCCEEEEECCCEEEECCCHHHHHHC T ss_conf 66984999729979949999999749 No 15 >PRK05293 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=100.00 E-value=0 Score=366.02 Aligned_cols=245 Identities=19% Similarity=0.239 Sum_probs=199.6 Q ss_pred CCCCCCCEEEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHH Q ss_conf 7888545899956888833432440888333568836-279999999997899889998135544410100117454445 Q gi|254780365|r 2 GSLKKVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDR-PVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQS 80 (299) Q Consensus 2 ~~m~~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~k-piI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~ 80 (299) |.| ||||||||+||||||+|..+||||+||+|| |||+|++++|.++||++++++++|+.+.|.+||..+..+.- T Consensus 2 ~~M----kAvILAgG~GtRlrPLT~~rPK~llPv~gk~~lId~~L~~~~~~gI~~v~v~~~~~~~~i~~h~g~g~~w~l- 76 (381) T PRK05293 2 KEM----LAMILAGGQGTRLGKLTKNIAKPAVPFGGKYRIIDFTLSNCANSGIYTVGVLTQYQPLELNNHIGIGSPWDL- 76 (381) T ss_pred CCC----EEEEECCCCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHCCCCCCCC- T ss_conf 850----999978888767254458974203377891512899999999869999999868997999998717650364- Q ss_pred HHHCCCCEEECCCCCCCCCCCEE--EECCCCCC---CCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 65418700001134411248579--98433342---3346654211011223---3331112786444544332211222 Q gi|254780365|r 81 LRKRNKKAELTLLAESIPSIGNA--VFTWQYER---KGLGHAVWCARNIIGD---NPFALLLPDMIMSPLEGENCMANMI 152 (299) Q Consensus 81 l~~~~~~~~~~~~~~~~~~~~~i--~~v~Q~~p---~Gtg~Ai~~a~~~i~d---e~flv~~gD~i~~~~~~~~~l~~li 152 (299) ..+ ..+..+ .|..|.++ +|||||++++++++.+ +.|+|++||+++..+ +++|+ T Consensus 77 ---~~~-----------~~g~~il~~~~~~~~~~~~~Gta~al~~~~~~i~~~~~e~vlv~~GD~i~~~d-----l~~ll 137 (381) T PRK05293 77 ---DRK-----------NGGVTILPPYSESSGGKWYKGTAHAIYQNIEYIDQYDPEYVLILSGDHIYKMD-----YDKML 137 (381) T ss_pred ---CCC-----------CCCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCEEEEECCCEEECCC-----HHHHH T ss_conf ---344-----------48779834300146776667428999999998644899989997699886789-----99999 Q ss_pred CCCCCCCCC-EEEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHH-HHHCCCCCC Q ss_conf 122343220-011243285443313310013333335300000025778886734233020200272331-100012266 Q gi|254780365|r 153 KLYEKEGAN-ILAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIF-SILNDWKEN 230 (299) Q Consensus 153 ~~~~~~~~~-vi~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~-~~L~~~~~~ 230 (299) ++|+++++. ++++.+++.+++++|||+..+.. .+|++|+|||+.+ .||++++|+|+|++++| ++|.+..+. T Consensus 138 ~~H~~~~ad~Ti~~~~v~~~~~~~yGvv~~d~~-----g~I~~~~EKp~~~--~s~l~n~GiYif~~~~L~~~l~~~~~~ 210 (381) T PRK05293 138 DYHKEKEADVTIAVIEVPWEEASRFGIMNTDEE-----MRIVEFEEKPKFP--KSNLASMGIYIFNWKRLKEYLEEDEKN 210 (381) T ss_pred HHHHHCCCCCCEEEEEECCCCCCCCEEEEECCC-----CCEEEEEECCCCC--CCCCCCCCEEEECHHHHHHHHHHHCCC T ss_conf 999864666503789805126441608998799-----9799999788998--766543316995699999999875138 Q ss_pred CCCCCCCHHHHHHHHHHCC-CEEEEEEEEEEEECCCHHHHHHHHHHHH Q ss_conf 6767500378999998319-8489998137864399889999999986 Q gi|254780365|r 231 EGKGEIQLTDSMRKLSERH-DFLAYHFKGHTYDCGSKKGFVLANIAFA 277 (299) Q Consensus 231 ~~~gE~~ltD~i~~l~~~~-~v~a~~~~g~w~DiG~~~~yl~A~~~~a 277 (299) ....+...+|+++.|++++ ++++|.++|+|.|+|+|++|++||+++. T Consensus 211 ~~~~~~~~~d~i~~ll~~~~~v~~y~~~g~w~dig~~~~~~~an~~lL 258 (381) T PRK05293 211 PNSSHDFGKDVIPLYLEEGEKLYAYPFEGYWKDVGTIESLWEANMELL 258 (381) T ss_pred CCCHHHHHHHHHHHHHHCCCCEEEEEECCEEEECCCHHHHHHHHHHHH T ss_conf 863346689999999977996899985586888898899999999986 No 16 >cd02524 G1P_cytidylyltransferase G1P_cytidylyltransferase catalyzes the production of CDP-D-Glucose. Alpha-D-Glucose-1-phosphate Cytidylyltransferase catalyzes the production of CDP-D-Glucose from alpha-D-Glucose-1-phosphate and MgCTP as substrate. CDP-D-Glucose is the precursor for synthesizing four of the five naturally occurring 3,6-dideoxy sugars-abequose (3,6-dideoxy-D-Xylo-hexose), ascarylose (3,6-dideoxy-L-arabino-hexose), paratose (3,6-dideoxy-D-ribohexose), and tyvelose (3,6-dideoxy-D-arabino-hexose. Deoxysugars are ubiquitous in nature where they function in a variety of biological processes, including cell adhesion, immune response, determination of ABO blood groups, fertilization, antibiotic function, and microbial pathogenicity. Probab=100.00 E-value=0 Score=363.14 Aligned_cols=244 Identities=18% Similarity=0.279 Sum_probs=194.7 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEE Q ss_conf 99956888833432440888333568836279999999997899889998135544410100117454445654187000 Q gi|254780365|r 10 AVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAE 89 (299) Q Consensus 10 avIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~ 89 (299) .||||||+||||+|+|..+||||+||+|||+|+|+++.+.++||++|++++||+.++|++||.........+........ T Consensus 1 VvILaGG~GtRL~pLT~~~PKpLvpi~gkPii~~ii~~l~~~gi~~iil~~gy~~~~i~~~f~~~~~~~~~~~~~~~~~~ 80 (253) T cd02524 1 VVILAGGLGTRLSEETELKPKPMVEIGGRPILWHIMKIYSHYGHNDFILCLGYKGHVIKEYFLNYFLHNSDVTIDLGTNR 80 (253) T ss_pred CEEECCCCCCCCCHHHCCCCCCCEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHCCCEEEEECCCE T ss_conf 98934787554143127998003189999899999999998699879995322511379998777761585589933772 Q ss_pred ECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEC Q ss_conf 01134411248579984333423346654211011223-33311127864445443322112221223432200112432 Q gi|254780365|r 90 LTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGD-NPFALLLPDMIMSPLEGENCMANMIKLYEKEGANILAVSEC 168 (299) Q Consensus 90 ~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~d-e~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi~~~~v 168 (299) +. +........+|.++.|.++.|||+|+..|++++++ |+|+|++||++++.+ +++|+++|.++++.+.. ..+ T Consensus 81 ~~-~~~~~~~~~~i~~~~~~~~~~tgg~l~~~~~~l~~~e~flv~nGD~l~d~d-----l~~l~~~h~~~~~~~tl-~~v 153 (253) T cd02524 81 IE-LHNSDIEDWKVTLVDTGLNTMTGGRLKRVRRYLGDDETFMLTYGDGVSDVN-----INALIEFHRSHGKLATV-TAV 153 (253) T ss_pred EE-EECCCCCCCEEEEEECCCCCHHHHHHHHHHHHCCCCCEEEEEECCCCCCCC-----HHHHHHHHHHCCCEEEE-EEC T ss_conf 79-833667786058985366550338999999874899739999155234368-----79999999866985999-965 Q ss_pred CHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHC Q ss_conf 85443313310013333335300000025778886734233020200272331100012266676750037899999831 Q gi|254780365|r 169 DPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLSER 248 (299) Q Consensus 169 ~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~~~ 248 (299) +.+++||++.++.. .+|.+|+|||..+ ++++|+|+|+|++++|+++.....+ + -+|+++.|+++ T Consensus 154 --~~~~~yG~v~~d~~-----~~v~~f~EKP~~~---~~~in~GiYv~~~~if~~i~~~~~~-----~-e~d~~p~li~~ 217 (253) T cd02524 154 --HPPGRFGELDLDDD-----GQVTSFTEKPQGD---GGWINGGFFVLEPEVFDYIDGDDTV-----F-EREPLERLAKD 217 (253) T ss_pred --CCCCCCCEEEECCC-----CEEEEEEECCCCC---CCCCEEEEEEECHHHHHHCCCCCCC-----H-HHHHHHHHHHC T ss_conf --88888867999899-----8599987378887---7742149999889999546678874-----2-67899999966 Q ss_pred CCEEEEEEEEEEEECCCHHHHHHHHHHH Q ss_conf 9848999813786439988999999998 Q gi|254780365|r 249 HDFLAYHFKGHTYDCGSKKGFVLANIAF 276 (299) Q Consensus 249 ~~v~a~~~~g~w~DiG~~~~yl~A~~~~ 276 (299) +++++|.++|+|+|||||++|.+||-.+ T Consensus 218 g~l~~y~~~g~W~DigT~~d~~~l~~~~ 245 (253) T cd02524 218 GELMAYKHTGFWQCMDTLRDKQTLEELW 245 (253) T ss_pred CCEEEEECCCEEECCCCHHHHHHHHHHH T ss_conf 9979996698881899979999999999 No 17 >COG1208 GCD1 Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) [Cell envelope biogenesis, outer membrane / Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=351.85 Aligned_cols=230 Identities=24% Similarity=0.406 Sum_probs=195.1 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 89995688883343244088833356883627999999999789988999813554441010011745444565418700 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) ||||||||+||||||+|..+||||+||+|||+|+|+++.|.++|+++|+++++|.+++|++||... T Consensus 3 kavILagG~GtRLrPlT~~~PKPllpI~gkPii~~~l~~L~~~Gv~eivi~~~y~~~~i~~~~~d~-------------- 68 (358) T COG1208 3 KAVILAGGYGTRLRPLTDDRPKPLLPIAGKPLIEYVLEALAAAGVEEIVLVVGYLGEQIEEYFGDG-------------- 68 (358) T ss_pred EEEEECCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHC-------------- T ss_conf 599994665666355448997446579986459999999998799789998774689999998626-------------- Q ss_pred EECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-EEEEEE Q ss_conf 0011344112485799843334233466542110112233331112786444544332211222122343220-011243 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGAN-ILAVSE 167 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~-vi~~~~ 167 (299) ...+..|.|+.|++|+|||+|++.|.+++++++|+|++||.++..+ ++.|+++|+++++. ++.+.. T Consensus 69 --------~~~~~~I~y~~e~~~lGTag~l~~a~~~l~~~~f~v~~GDv~~~~d-----l~~l~~~~~~~~~~~~~~~~~ 135 (358) T COG1208 69 --------EGLGVRITYVVEKEPLGTAGALKNALDLLGGDDFLVLNGDVLTDLD-----LSELLEFHKKKGALATIALTR 135 (358) T ss_pred --------CCCCCCEEEEECCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCC-----HHHHHHHHHHCCCCCEEEEEE T ss_conf --------3569865997168766648999999886079978999788622468-----899999987356642799875 Q ss_pred CCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 28544331331001333333530000002577888673423302020027233110001226667675003789999983 Q gi|254780365|r 168 CDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLSE 247 (299) Q Consensus 168 v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~~ 247 (299) ++ +.++||++..+... .+|.+|.|||..+.++|+++|+|+|+|++++|+++....+.. ++- |.++.|++ T Consensus 136 ~~--~~~~~Gvv~~~~~~----~~v~~f~ekp~~~~~~~~~in~Giyi~~~~v~~~i~~~~~~~----~~~-~~~~~l~~ 204 (358) T COG1208 136 VL--DPSEFGVVETDDGD----GRVVEFREKPGPEEPPSNLINAGIYIFDPEVFDYIEKGERFD----FEE-ELLPALAA 204 (358) T ss_pred CC--CCCCCCEEEECCCC----CEEEEEEECCCCCCCCCCEEEEEEEEECHHHHCCCCCCCCCC----CHH-HHHHHHHH T ss_conf 27--76558639975998----489889972667678886699999998978943013357432----147-78999987 Q ss_pred CCC-EEEEEEEEEEEECCCHHHHHHHHHHH Q ss_conf 198-48999813786439988999999998 Q gi|254780365|r 248 RHD-FLAYHFKGHTYDCGSKKGFVLANIAF 276 (299) Q Consensus 248 ~~~-v~a~~~~g~w~DiG~~~~yl~A~~~~ 276 (299) ++. +++|.++|+|+|+|+|++|++|+-.+ T Consensus 205 ~~~~v~~~~~~g~W~dig~p~d~~~a~~~~ 234 (358) T COG1208 205 KGEDVYGYVFEGYWLDIGTPEDLLEANELL 234 (358) T ss_pred CCCCEEEEEECCEEEECCCHHHHHHHHHHH T ss_conf 699569997578799699989999999999 No 18 >TIGR02091 glgC glucose-1-phosphate adenylyltransferase; InterPro: IPR011831 This enzyme, glucose-1-phosphate adenylyltransferase, is also called ADP-glucose pyrophosphorylase. The plant form is an alpha2, beta2 heterodimer, allosterically regulated in plants. Both subunits are homologous and included in this entry. In bacteria, both homomeric forms of GlgC and more active heterodimers of GlgC and GlgD have been described. This entry describes the GlgC subunit only. This enzyme appears in variants of glycogen synthesis pathways that use ADP-glucose, rather than UDP-glucose as in animals.; GO: 0008878 glucose-1-phosphate adenylyltransferase activity. Probab=100.00 E-value=0 Score=355.98 Aligned_cols=252 Identities=20% Similarity=0.308 Sum_probs=209.6 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCC--CCHHHHHHHHHCC Q ss_conf 899956888833432440888333568836-2799999999978998899981355444101001--1745444565418 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDR-PVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFD--IQFELEQSLRKRN 85 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~k-piI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~--~~~~~~~~l~~~~ 85 (299) =|+|||||+||||+|||+.+.||-||++|| -|||+.|.+|++|||.+|+|+|+|+...+..|.. +++.+...+ T Consensus 2 la~iLAGG~GsRL~pLT~~rAKPAVpFGGkYRiIDF~LSNc~NSGi~ri~VLTQY~~~sL~~Hi~iG~~W~~~~~~---- 77 (421) T TIGR02091 2 LAMILAGGQGSRLSPLTKRRAKPAVPFGGKYRIIDFALSNCINSGINRIGVLTQYKSHSLNRHIQIGRGWDFDGPE---- 77 (421) T ss_pred EEEEECCCCCCCCCHHHHHHCCCCCCCCCCEEEECCCHHHHHHCCCCEEEEEEECCHHHHHHHHHCCCCCCCCCCC---- T ss_conf 7889438887546133465257765556301452100135554154457888505167888884068765677442---- Q ss_pred CCEEECCCCCCCC--CCCEEEECCCCCCCCCCCCCCCCCCCCCC-C---CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 7000011344112--48579984333423346654211011223-3---3311127864445443322112221223432 Q gi|254780365|r 86 KKAELTLLAESIP--SIGNAVFTWQYERKGLGHAVWCARNIIGD-N---PFALLLPDMIMSPLEGENCMANMIKLYEKEG 159 (299) Q Consensus 86 ~~~~~~~~~~~~~--~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~d-e---~flv~~gD~i~~~~~~~~~l~~li~~~~~~~ 159 (299) .+..++.+-.... ...+-+| .||||||++-..++.+ . .+|++.|||||..+. ++|++.|.+++ T Consensus 78 ~~~fv~~Lpa~~~~~~~~~~WY------~GTADAvYQNl~~i~~~~~p~yvlILsGDHiYkMDy-----~~mL~~H~e~~ 146 (421) T TIGR02091 78 IGGFVELLPAQQRFEESGKDWY------QGTADAVYQNLDLIEETYDPEYVLILSGDHIYKMDY-----EKMLDYHIEKG 146 (421) T ss_pred CCCCEEEECCCEEECCCCCCCE------EECHHHHHHHHHHHHCCCCCCEEEEECCCCEECCCH-----HHHHHHHHHCC T ss_conf 4762676437511126788532------112378899999874016924899934661010487-----89999998567 Q ss_pred CC-EEEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCC----------------------EEECHHHEE Q ss_conf 20-0112432854433133100133333353000000257788867342----------------------330202002 Q gi|254780365|r 160 AN-ILAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISN----------------------FFINGRYIL 216 (299) Q Consensus 160 ~~-vi~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Sn----------------------l~~~GiYi~ 216 (299) |+ +|++.+|+.+++|+|||+.+|... +|++|+|||..|. |. ||++|+||| T Consensus 147 Ad~Tia~~~Vp~~eAs~fGvm~vD~~g-----~i~~F~EKP~~P~--~~~~~~~~~s~~~~~~~~~~~~~yLASMGiYiF 219 (421) T TIGR02091 147 ADLTIACIPVPRKEASRFGVMQVDEDG-----RIVDFEEKPANPP--SIPGDPDKSSDLGLSKLDAQKGPYLASMGIYIF 219 (421) T ss_pred CCEEEEEEECCHHHCCCCCEEEECCCC-----CEEEEECCCCCCC--CCCCCCCCHHHHCCCCCCCCCCCCEEECCEEEE T ss_conf 987998542785551528728887899-----6888882688886--677776774440455145334774475551364 Q ss_pred CHHHH-HHHCCCCCCC-CCCCCCHHHHHHHHHHCC-CEEEEEEE----------EEEEECCCHHHHHHHHHHHHHCC--- Q ss_conf 72331-1000122666-767500378999998319-84899981----------37864399889999999986318--- Q gi|254780365|r 217 HPDIF-SILNDWKENE-GKGEIQLTDSMRKLSERH-DFLAYHFK----------GHTYDCGSKKGFVLANIAFALAR--- 280 (299) Q Consensus 217 ~~~i~-~~L~~~~~~~-~~gE~~ltD~i~~l~~~~-~v~a~~~~----------g~w~DiG~~~~yl~A~~~~al~~--- 280 (299) +.+++ ++|++..... ....|. -|+||.++++| +|+||.|+ |||-||||.++|.+|||++..+. T Consensus 220 ~~~~L~~~L~~d~~~~~s~~DFG-kdiIP~~~~~g~~~~Ay~F~~s~v~g~~~e~YWrDVGTidsfweANmdL~~~~~Pq 298 (421) T TIGR02091 220 DKDVLKELLEEDAKDPESSHDFG-KDIIPKLLEEGKSVQAYDFSQSCVPGEEKEGYWRDVGTIDSFWEANMDLVSEVKPQ 298 (421) T ss_pred CHHHHHHHHHHHHCCCCCCCCCC-HHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCEEEECHHHHHHHHHHHHCCCCCCC T ss_conf 57899999999753876777544-35688985089718998046760568788876223004065667518762788888 Q ss_pred -HHH Q ss_conf -346 Q gi|254780365|r 281 -QDI 283 (299) Q Consensus 281 -~~~ 283 (299) ++| T Consensus 299 vP~f 302 (421) T TIGR02091 299 VPPF 302 (421) T ss_pred CCCC T ss_conf 8877 No 19 >TIGR01105 galF regulatory protein GalF; InterPro: IPR005774 This family of proteins, GalF, represents a non-catalytic subunit of the UTP-glucose pyrophosphorylase modulating the enzyme activity to increase the formation of UDP-glucose . ; GO: 0030234 enzyme regulator activity, 0006011 UDP-glucose metabolic process. Probab=100.00 E-value=0 Score=354.74 Aligned_cols=290 Identities=34% Similarity=0.624 Sum_probs=267.7 Q ss_pred CCCCCEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHH Q ss_conf 88545899956888833432440888333568836279999999997899889998135544410100117454445654 Q gi|254780365|r 4 LKKVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRK 83 (299) Q Consensus 4 m~~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~ 83 (299) |.++ ||||+.||.|..|.|.|+.+||.|||+.+||+|+|++|+.+.+||+||++||+..|+.+++|||++|+++..++. T Consensus 1 m~nl-kaviPvaGlGmhmlPatkaiPkemlP~~dkPmiqyivdeivaaGikei~lvth~sknavenhfdtsyele~lleq 79 (297) T TIGR01105 1 MANL-KAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEIVLVTHASKNAVENHFDTSYELESLLEQ 79 (297) T ss_pred CCCC-CEECCCCCCHHHHHHHHHCCCHHHCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHCCCHHHHHHHHHH T ss_conf 9754-101221120123200111021121221143278899999886030103234201034454430203778999988 Q ss_pred CCCCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCC Q ss_conf 187000011344112485799843334233466542110112233331112786444544332---21122212234322 Q gi|254780365|r 84 RNKKAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGEN---CMANMIKLYEKEGA 160 (299) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~---~l~~li~~~~~~~~ 160 (299) +.|...+.+++..+++++.|.-++|.+|+|+||+|+||++.+||+||+|+++|.+++....++ .|..|+..|+++|. T Consensus 80 rvkrqllaevq~iCPPGv~imnvrq~~PlGlGhsilCarP~~GdnPf~vvlPd~~~d~~~adPlrynlaam~arfnetGr 159 (297) T TIGR01105 80 RVKRQLLAEVQSICPPGVTIMNVRQAQPLGLGHSILCARPVVGDNPFVVVLPDIIIDDATADPLRYNLAAMIARFNETGR 159 (297) T ss_pred HHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCHHEECCCCCCCCCEEEEECCEEECCCCCCHHHHHHHHHHHHHHCCCH T ss_conf 88899988765228997179863047765444011011553268875898052356177766167889999987520330 Q ss_pred CEEEEEECCHHHCCCCCCEECCCCC--CCCCCEEEEECCCCCCC-CCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCC Q ss_conf 0011243285443313310013333--33530000002577888-67342330202002723311000122666767500 Q gi|254780365|r 161 NILAVSECDPQLSCKYGMVQVGKAI--DHQVFHISDMIEKPDSS-TFISNFFINGRYILHPDIFSILNDWKENEGKGEIQ 237 (299) Q Consensus 161 ~vi~~~~v~~~~~~~yGvi~~~~~~--~~~~~~I~~~vEKP~~~-~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ 237 (299) +.+...+.+ -+.|.|.|++..+.. +++.-||.+|+|||+.+ +..|++...|.|+++.++|..|++..|+. +|.+| T Consensus 160 sqvlakrm~-GdlseysviqtkePl~~eG~v~rivefiekPd~Pqtldsd~mavGryvlsadiW~elert~PGa-WGriq 237 (297) T TIGR01105 160 SQVLAKRMP-GDLSEYSVIQTKEPLDREGKVSRIVEFIEKPDQPQTLDSDLMAVGRYVLSADIWAELERTEPGA-WGRIQ 237 (297) T ss_pred HHHHHHHCC-CCCCHHEEEECCCCHHHCCCEEEEEHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCC-CCCHH T ss_conf 233221036-6520000110234021247412210002088987521346776634654344455543168986-31014 Q ss_pred HHHHHHHHHHCCCEEEEEEEEEEEECCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 37899999831984899981378643998899999999863183466999999999988 Q gi|254780365|r 238 LTDSMRKLSERHDFLAYHFKGHTYDCGSKKGFVLANIAFALARQDIRSDIETDLKTLVS 296 (299) Q Consensus 238 ltD~i~~l~~~~~v~a~~~~g~w~DiG~~~~yl~A~~~~al~~~~~~~~~~~~l~~~~~ 296 (299) |||+|..|+++..|.|..+.|..||||...+|.+|.+.++|++=+-+..|++.++.+++ T Consensus 238 ltdaiaelakkqsvdamlmtG~sydCGkkmGymqafv~yGlrnlkeG~kfr~~i~~ll~ 296 (297) T TIGR01105 238 LTDAIAELAKKQSVDAMLMTGDSYDCGKKMGYMQAFVKYGLRNLKEGAKFRKSIEKLLS 296 (297) T ss_pred HHHHHHHHHHHHHHHHHEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 77899988743102231011674100124468899998524531102578999999851 No 20 >cd04181 NTP_transferase NTP_transferases catalyze the transfer of nucleotides onto phosphosugars. Nucleotidyltransferases transfer nucleotides onto phosphosugars. The enzyme family includes Alpha-D-Glucose-1-Phosphate Cytidylyltransferase, Mannose-1-phosphate guanyltransferase, and Glucose-1-phosphate thymidylyltransferase. The products are activated sugars that are precursors for synthesis of lipopolysaccharide, glycolipids and polysaccharides. Probab=100.00 E-value=0 Score=344.69 Aligned_cols=216 Identities=29% Similarity=0.485 Sum_probs=188.2 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEE Q ss_conf 99956888833432440888333568836279999999997899889998135544410100117454445654187000 Q gi|254780365|r 10 AVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAE 89 (299) Q Consensus 10 avIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~ 89 (299) |||||||+||||+|+|..+||||+||+|||+|+|+++.+..+|+++|+||++|+.++|.+||.... T Consensus 1 AiIlaaG~GtRl~~~t~~~PK~Ll~v~gkplI~~~l~~l~~~g~~~iiiv~~~~~~~i~~~~~~~~-------------- 66 (217) T cd04181 1 AVILAAGKGTRLRPLTDTRPKPLLPIAGKPILEYIIERLARAGIDEIILVVGYLGEQIEEYFGDGS-------------- 66 (217) T ss_pred CEEECCCCCCCCCHHHCCCCCCCCEECCCHHHHHHHHHHHHCCCCEEEEEEECCCCCHHEEEECCC-------------- T ss_conf 989068886657821369984023899944999999999974997799986123310000110012-------------- Q ss_pred ECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-EEEEEEC Q ss_conf 011344112485799843334233466542110112233331112786444544332211222122343220-0112432 Q gi|254780365|r 90 LTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGAN-ILAVSEC 168 (299) Q Consensus 90 ~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~-vi~~~~v 168 (299) ....++.|+.|++|+|||+|+++|++++++++|+|++||+++..+ +.++++.|.+.++. ++++.+. T Consensus 67 --------~~~~~i~~~~e~~~~Gt~~al~~a~~~i~~~~~lv~~~D~i~~~~-----~~~~~~~~~~~~~~~~l~~~~~ 133 (217) T cd04181 67 --------KFGVNIEYVVQEEPLGTAGAVRNAEDFLGDDDFLVVNGDVLTDLD-----LSELLRFHREKGADATIAVKEV 133 (217) T ss_pred --------CCCCEEEEEECCCCCCCHHHHHHHHCCCCCCCEEEECCCCCCCCC-----HHHHHHHHHHCCCCEEEEEEEE T ss_conf --------579659995348877634554321002699978997278222508-----9999999975799679998740 Q ss_pred CHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHC Q ss_conf 85443313310013333335300000025778886734233020200272331100012266676750037899999831 Q gi|254780365|r 169 DPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLSER 248 (299) Q Consensus 169 ~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~~~ 248 (299) +++++||++.++... +|.+|+|||..+ +|+++++|+|+|++++|+++++... ++|++++|++++++++ T Consensus 134 --~~~~~yg~v~~d~~~-----~v~~i~EKp~~~--~~~~~~~G~y~~~~~~f~~i~~~~~---~~~~~l~d~~~~l~~~ 201 (217) T cd04181 134 --EDPSRYGVVELDDDG-----RVTRFVEKPTLP--ESNLANAGIYIFEPEILDYIPEILP---RGEDELTDAIPLLIEE 201 (217) T ss_pred --CCCCCCEEEEECCCC-----CEEEEEECCCCC--CCCEEEEEEEEECHHHHHHHHHCCC---CCEEEHHHHHHHHHHC T ss_conf --367763089988851-----043689889998--7884899899999999997651588---8751189999999857 Q ss_pred CCEEEEEEEEEEEECC Q ss_conf 9848999813786439 Q gi|254780365|r 249 HDFLAYHFKGHTYDCG 264 (299) Q Consensus 249 ~~v~a~~~~g~w~DiG 264 (299) ++|.++.++|+|+||| T Consensus 202 ~kv~~~~~~g~W~DiG 217 (217) T cd04181 202 GKVYGYPVDGYWLDIG 217 (217) T ss_pred CCEEEEEECCEEEECC T ss_conf 9969999799999179 No 21 >PRK03282 consensus Probab=100.00 E-value=0 Score=344.02 Aligned_cols=248 Identities=17% Similarity=0.238 Sum_probs=197.3 Q ss_pred CCCCCCCCEEEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHH Q ss_conf 97888545899956888833432440888333568836-27999999999789988999813554441010011745444 Q gi|254780365|r 1 MGSLKKVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDR-PVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQ 79 (299) Q Consensus 1 ~~~m~~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~k-piI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~ 79 (299) |+.|+++ .|||||||+||||||||..+||||||++|| |+|+|++++|.++||++++++++|+.+++.+||...+.+.. T Consensus 1 ~~~~~~v-~avILagG~GtRLrPLT~~rpKp~vP~~g~yrlIdf~Lsnl~~~Gi~~v~v~~~y~~~sl~~h~g~~w~~~~ 79 (406) T PRK03282 1 MRERPKV-LGIVLAGGEGKRLMPLTADRAKPAVPFGGAYRLIDFVLSNLVNAGYLRICVLTQYKSHSLDRHISQTWRMSG 79 (406) T ss_pred CCCCCCE-EEEEECCCCCCCCCCCCCCCCHHHCEECCCCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHCCCCCC T ss_conf 9978837-999978998772330227983540065896645799999888769988999817777999999862137556 Q ss_pred HHHHCCCCEEECCCCCCCCCCCEEEECCCC------CCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCC Q ss_conf 565418700001134411248579984333------4233466542110112233---3311127864445443322112 Q gi|254780365|r 80 SLRKRNKKAELTLLAESIPSIGNAVFTWQY------ERKGLGHAVWCARNIIGDN---PFALLLPDMIMSPLEGENCMAN 150 (299) Q Consensus 80 ~l~~~~~~~~~~~~~~~~~~~~~i~~v~Q~------~p~Gtg~Ai~~a~~~i~de---~flv~~gD~i~~~~~~~~~l~~ 150 (299) . .+..+..+.++ ..+|||||++.+.+++.++ .|+++.||+++..+ +++ T Consensus 80 ~------------------~g~~v~~~~~~~~~g~~~~~Gta~ai~~~~~~l~~~~~~~vvv~~gD~v~~~D-----~~~ 136 (406) T PRK03282 80 L------------------LGNYITPVPAQQRLGPRWYTGSADAIYQSLNLIYDEDPDYVVVFGADHVYRMD-----PEQ 136 (406) T ss_pred C------------------CCCEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCEEEECC-----HHH T ss_conf 5------------------66379756521236742145778999999999862699879997897998079-----999 Q ss_pred CCCCCCCCCCCE-EEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCC-----CCCCCEEECHHHEECHHH-HHH Q ss_conf 221223432200-1124328544331331001333333530000002577888-----673423302020027233-110 Q gi|254780365|r 151 MIKLYEKEGANI-LAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSS-----TFISNFFINGRYILHPDI-FSI 223 (299) Q Consensus 151 li~~~~~~~~~v-i~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~-----~~~Snl~~~GiYi~~~~i-~~~ 223 (299) |+++|.++++.+ +++.+++.+++++||++.++.+ ..+|++|+|||..+ .+.|+++++|+|+|++++ ++. T Consensus 137 ~l~~H~~~~a~~Ti~~~~v~~~d~s~~Gvv~~d~d----~~~V~~f~EKP~~~~~~~~~~~~~l~n~GiYvf~~~~l~~~ 212 (406) T PRK03282 137 MVQQHIESGAGVTVAGIRVPRSEATAFGVIDADDD----SGRIREFLEKPADPPGLPDDPDETFASMGNYVFTTKALIDA 212 (406) T ss_pred HHHHHHHCCCCEEEEEEECCHHHHHHCCEEEECCC----CCEEEEEEECCCCCCCCCCCCCCEECCCEEEEECHHHHHHH T ss_conf 99999976998599999827677623781888289----99799999557876578898010020210699849999999 Q ss_pred HCCCCCCCCCCCCCH-HHHHHHHHHCCCEEEEEEE------------EEEEECCCHHHHHHHHHHHH Q ss_conf 001226667675003-7899999831984899981------------37864399889999999986 Q gi|254780365|r 224 LNDWKENEGKGEIQL-TDSMRKLSERHDFLAYHFK------------GHTYDCGSKKGFVLANIAFA 277 (299) Q Consensus 224 L~~~~~~~~~gE~~l-tD~i~~l~~~~~v~a~~~~------------g~w~DiG~~~~yl~A~~~~a 277 (299) |+...... .+++.+ .|+++.|++++.+.+|.+. |+|.|||||++|++||+++. T Consensus 213 l~~~~~~~-~~~~d~~~di~p~l~~~g~~~~y~~~~~~v~g~~~~~~GYW~dIgt~~~y~~an~dll 278 (406) T PRK03282 213 LRADAEDE-DSDHDMGGDIIPRFVARGEAAVYDFSDNEVPGATDRDRGYWRDVGTLDAFYDAHMDLI 278 (406) T ss_pred HHHHCCCC-CCCCCCHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCEEEECCCHHHHHHHHHHHH T ss_conf 98650177-7644226778999997098279985035355754455517986798999999889862 No 22 >cd06426 NTP_transferase_like_2 NTP_trnasferase_like_2 is a member of the nucleotidyl transferase family. This is a subfamily of nucleotidyl transferases. Nucleotidyl transferases transfer nucleotides onto phosphosugars. The activated sugars are precursors for synthesis of lipopolysaccharide, glycolipids and polysaccharides. Other subfamilies of nucleotidyl transferases include Alpha-D-Glucose-1-Phosphate Cytidylyltransferase, Mannose-1-phosphate guanyltransferase, and Glucose-1-phosphate thymidylyltransferase. Probab=100.00 E-value=0 Score=342.12 Aligned_cols=218 Identities=21% Similarity=0.364 Sum_probs=184.6 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEE Q ss_conf 99956888833432440888333568836279999999997899889998135544410100117454445654187000 Q gi|254780365|r 10 AVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAE 89 (299) Q Consensus 10 avIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~ 89 (299) .||||||+||||+|+|+.+||||+||+|||+|+|+++.+.++|+++|+|++||++++|++||... T Consensus 1 viILAaG~GtRl~plT~~~PK~Ll~i~gkpli~~~i~~l~~~gi~~i~i~~~y~~e~i~~~~~~~--------------- 65 (220) T cd06426 1 VVIMAGGKGTRLRPLTENTPKPMLKVGGKPILETIIDRFIAQGFRNFYISVNYLAEMIEDYFGDG--------------- 65 (220) T ss_pred CEEECCCCCCCCCHHHCCCCCCCEEECCCHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHCCC--------------- T ss_conf 98937757557884446998103299995399999999998399689995014611323343156--------------- Q ss_pred ECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE-EEEEEC Q ss_conf 0113441124857998433342334665421101122333311127864445443322112221223432200-112432 Q gi|254780365|r 90 LTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANI-LAVSEC 168 (299) Q Consensus 90 ~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~v-i~~~~v 168 (299) ...+.++.|+.|++|+|||+|+.+|++++ +++|+|++||+++..+ +++|++.|.+.++.+ +++.+. T Consensus 66 -------~~~~~~i~~i~e~~~lGt~gal~~~~~~~-~~~~iv~~gD~i~~~~-----~~~~~~~~~~~~~~~~~~~~~~ 132 (220) T cd06426 66 -------SKFGVNISYVREDKPLGTAGALSLLPEKP-TDPFLVMNGDILTNLN-----YEHLLDFHKENNADATVCVREY 132 (220) T ss_pred -------CCCCCEEEEEECCCCCCHHHHHHHHHHCC-CCCEEEECCCEECCCC-----HHHHHHHHHHCCCCEEEEEEEC T ss_conf -------65784388721466555789998667618-9978997777304759-----9999999997499669999862 Q ss_pred CHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHC Q ss_conf 85443313310013333335300000025778886734233020200272331100012266676750037899999831 Q gi|254780365|r 169 DPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLSER 248 (299) Q Consensus 169 ~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~~~ 248 (299) + ...+||++..+. .+|.+++|||+. ++++++|+|+|++++|+.+++ .+++++||+++.++++ T Consensus 133 ~--~~~~~gv~~~~~------~~v~~~~ekp~~----~~~~~~Giy~~~~~~~~~i~~------~~~~~~~d~~~~li~~ 194 (220) T cd06426 133 E--VQVPYGVVETEG------GRITSIEEKPTH----SFLVNAGIYVLEPEVLDLIPK------NEFFDMPDLIEKLIKE 194 (220) T ss_pred C--CCCCCCEEEECC------CCEEEEEECCCC----CCCCCEEEEEECHHHHHHCCC------CCCCCCCHHHHHHHHC T ss_conf 5--687776599348------837677766888----652331699989999974689------9944545899999987 Q ss_pred C-CEEEEEEEEEEEECCCHHHHHHHH Q ss_conf 9-848999813786439988999999 Q gi|254780365|r 249 H-DFLAYHFKGHTYDCGSKKGFVLAN 273 (299) Q Consensus 249 ~-~v~a~~~~g~w~DiG~~~~yl~A~ 273 (299) + ++.++.++|+|+|||||++|.+|| T Consensus 195 g~~v~~~~~~~~W~DiGt~e~l~~AN 220 (220) T cd06426 195 GKKVGVFPIHEYWLDIGRPEDYEKAN 220 (220) T ss_pred CCCEEEEEECCEEEECCCHHHHHHHC T ss_conf 99889997499999698989999749 No 23 >cd02540 GT2_GlmU_N_bac N-terminal domain of bacterial GlmU. The N-terminal domain of N-Acetylglucosamine-1-phosphate uridyltransferase (GlmU). GlmU is an essential bacterial enzyme with both an acetyltransferase and an uridyltransferase activity which have been mapped to the C-terminal and N-terminal domains, respectively. This family represents the N-terminal uridyltransferase. GlmU performs the last two steps in the synthesis of UDP-N-acetylglucosamine (UDP-GlcNAc), which is an essential precursor in both the peptidoglycan and the lipopolysaccharide metabolic pathways in Gram-positive and Gram-negative bacteria, respectively. Probab=100.00 E-value=0 Score=345.33 Aligned_cols=219 Identities=22% Similarity=0.297 Sum_probs=182.8 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEE Q ss_conf 99956888833432440888333568836279999999997899889998135544410100117454445654187000 Q gi|254780365|r 10 AVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAE 89 (299) Q Consensus 10 avIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~ 89 (299) |||||||+||||+| .+||||+||+|||+|+|+++.+..+|+++|+||+||.+++|++||.. T Consensus 1 AvIlAaG~GtRl~~---~~PKpllpi~~kpii~~ii~~l~~~gi~~i~iv~~~~~e~i~~~~~~---------------- 61 (229) T cd02540 1 AVILAAGKGTRMKS---DLPKVLHPLAGKPMLEHVLDAARALGPDRIVVVVGHGAEQVKKALAN---------------- 61 (229) T ss_pred CEEECCCCCCCCCC---CCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHCC---------------- T ss_conf 98946878757998---97841518998799999999999769975996357689999987543---------------- Q ss_pred ECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-EEEEE Q ss_conf 0113441124857998433342334665421101122--33331112786444544332211222122343220-01124 Q gi|254780365|r 90 LTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIG--DNPFALLLPDMIMSPLEGENCMANMIKLYEKEGAN-ILAVS 166 (299) Q Consensus 90 ~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~--de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~-vi~~~ 166 (299) .++.|+.|++|+||||||.+|+++++ +++|+|++||+.+.. ...+++|++.|.+.++. ++++. T Consensus 62 -----------~~i~~v~Q~~~lGta~Av~~a~~~i~~~~~~~lVl~gD~pli~---~~~l~~l~~~~~~~~~~~ti~~~ 127 (229) T cd02540 62 -----------PNVEFVLQEEQLGTGHAVKQALPALKDFEGDVLVLYGDVPLIT---PETLQRLLEAHREAGADVTVLTA 127 (229) T ss_pred -----------CCEEEEEECCCCCCHHHHHHHHHHCCCCCCEEEEEECCCCCCC---HHHHHHHHHHHHHCCCCEEEEEE T ss_conf -----------8726998068897378999998744588872999958964536---78899999999965996699998 Q ss_pred ECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCC--CCCCCCCEEECHHHEECHH-HHHHHCCCCCCCCCCCCCHHHHHH Q ss_conf 3285443313310013333335300000025778--8867342330202002723-311000122666767500378999 Q gi|254780365|r 167 ECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPD--SSTFISNFFINGRYILHPD-IFSILNDWKENEGKGEIQLTDSMR 243 (299) Q Consensus 167 ~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~--~~~~~Snl~~~GiYi~~~~-i~~~L~~~~~~~~~gE~~ltD~i~ 243 (299) ++ +++++||++..+.. ++|.++||||. +...+|+++++|+|+|+++ +|++|++++++..+|||++||+++ T Consensus 128 ~~--~~p~~YG~v~~d~~-----g~v~~ivEkk~~~~~~~~~~~~n~GiYif~~~~l~~~l~~l~~~~~~gE~~ltD~i~ 200 (229) T cd02540 128 EL--EDPTGYGRIIRDGN-----GKVLRIVEEKDATEEEKAIREVNAGIYAFDAEFLFEALPKLTNNNAQGEYYLTDIIA 200 (229) T ss_pred EE--CCCCCCCEEEECCC-----CCEEEEEECCCCCCCCCCCCEEEEEEEEECHHHHHHHHHHCCCCCCCCEEEHHHHHH T ss_conf 60--48767867998899-----978999988787654345432222799985999999998479876688788999999 Q ss_pred HHHHCC-CEEEEEEEEEEEECC--CHHH Q ss_conf 998319-848999813786439--9889 Q gi|254780365|r 244 KLSERH-DFLAYHFKGHTYDCG--SKKG 268 (299) Q Consensus 244 ~l~~~~-~v~a~~~~g~w~DiG--~~~~ 268 (299) .|+++| +|.+|.++++|-|+| |+++ T Consensus 201 ~~i~~g~kv~~~~~~~~~~~~Gin~~~d 228 (229) T cd02540 201 LAVADGLKVAAVLADDEEEVLGVNDRVQ 228 (229) T ss_pred HHHHCCCCEEEEEECCHHHEECCCCHHH T ss_conf 9998899189999288766137889787 No 24 >PRK00844 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=100.00 E-value=0 Score=335.79 Aligned_cols=245 Identities=20% Similarity=0.288 Sum_probs=195.0 Q ss_pred CCCCCCEEEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHH Q ss_conf 888545899956888833432440888333568836-2799999999978998899981355444101001174544456 Q gi|254780365|r 3 SLKKVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDR-PVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSL 81 (299) Q Consensus 3 ~m~~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~k-piI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l 81 (299) .|+++ +|||||||+||||||||..+||||||++|| |+|+|++++|.++||++++++++|+.+++.+|+...+.+.. T Consensus 2 ~m~~~-~aVILAgG~GtRLrPLT~~rpKpllPvagkyplIdf~Ls~l~~aGi~~v~v~~~y~~~sl~~hi~~~w~~~~-- 78 (409) T PRK00844 2 EMPKV-LGIVLAGGEGKRLMPLTADRAKPAVPFGGSYRLIDFVLSNLVNAGLLRICVLTQYKSHSLDRHISQGWRLSG-- 78 (409) T ss_pred CCCCE-EEEEECCCCCCCCCCCCCCCCHHCCEECCCCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCCCC-- T ss_conf 97876-999988998774430006980440065883607899999998769988999907787999999862726567-- Q ss_pred HHCCCCEEECCCCCCCCCCCEEEECCCCC------CCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 54187000011344112485799843334------23346654211011223---3331112786444544332211222 Q gi|254780365|r 82 RKRNKKAELTLLAESIPSIGNAVFTWQYE------RKGLGHAVWCARNIIGD---NPFALLLPDMIMSPLEGENCMANMI 152 (299) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~i~~v~Q~~------p~Gtg~Ai~~a~~~i~d---e~flv~~gD~i~~~~~~~~~l~~li 152 (299) . .+.-+.++.+++ .+|||||++++++++.+ +.|+|+.||+++..+ +++|+ T Consensus 79 ----~------------~~~~i~~~~~~~~~~~~~~~Gtadai~~~~~~i~~~~~d~vlv~~gD~i~~~d-----l~~~l 137 (409) T PRK00844 79 ----L------------LGEYITPVPAQQRLGKRWYEGSADAIYQSLNLIEDEDPDYVVVFGADHVYRMD-----PEQMV 137 (409) T ss_pred ----C------------CCCEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCEEECCC-----HHHHH T ss_conf ----6------------77358756721246765455778999999988741799889997899897079-----99999 Q ss_pred CCCCCCCCCE-EEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCC-----CCCCEEECHHHEECHHHH-HHHC Q ss_conf 1223432200-11243285443313310013333335300000025778886-----734233020200272331-1000 Q gi|254780365|r 153 KLYEKEGANI-LAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSST-----FISNFFINGRYILHPDIF-SILN 225 (299) Q Consensus 153 ~~~~~~~~~v-i~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~-----~~Snl~~~GiYi~~~~i~-~~L~ 225 (299) ++|.++++.+ ++..+++.+++++||++.++.. .+|++|+|||+.+. ..++++++|+|+|++++| +.|. T Consensus 138 ~~H~~~~a~~Ti~~~~v~~~~~~~~Gvv~~d~~-----g~V~~f~EKP~~~~~~~~~~~~~l~n~GiYvf~~~~L~~~l~ 212 (409) T PRK00844 138 EFHIESGAGLTVAGIRVPREEASAFGVIEVDED-----GRIRGFLEKPADPPGLPDDPDETLASMGNYIFTTDVLIDALR 212 (409) T ss_pred HHHHHCCCCCEEEEEEECHHHCCCCCEEEECCC-----CCEEEEEECCCCCCCCCCCCCCCEEEEEEEEECHHHHHHHHH T ss_conf 999756886347899926687453678998899-----969999815788767789712011201359951999999998 Q ss_pred CCCCCCCCCCCCH-HHHHHHHHHCCCEEEEEE------------EEEEEECCCHHHHHHHHHHHH Q ss_conf 1226667675003-789999983198489998------------137864399889999999986 Q gi|254780365|r 226 DWKENEGKGEIQL-TDSMRKLSERHDFLAYHF------------KGHTYDCGSKKGFVLANIAFA 277 (299) Q Consensus 226 ~~~~~~~~gE~~l-tD~i~~l~~~~~v~a~~~------------~g~w~DiG~~~~yl~A~~~~a 277 (299) ...... .+...+ +|+++.|++++++++|.+ +|+|.|+|++++|++||+++. T Consensus 213 ~~~~~~-~~~~d~~~diip~l~~~g~~~~y~~~~~~v~g~~~~~~GYW~digt~~~y~~an~dlL 276 (409) T PRK00844 213 EDAEDE-DSSHDMGGDIIPKLVPRGEAAVYDFSDNEVPGATERDRGYWRDVGTIDSFYDAHMDLL 276 (409) T ss_pred HHCCCC-CCCCCCHHHHHHHHHHHCCEEEEEECCCCCCCCCCCCCCEEEECCCHHHHHHHHHHHH T ss_conf 750167-7633206778999986388689984145445643456505897899899999889986 No 25 >PRK02862 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=100.00 E-value=0 Score=331.33 Aligned_cols=243 Identities=19% Similarity=0.315 Sum_probs=192.6 Q ss_pred CCCCCEEEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHH Q ss_conf 88545899956888833432440888333568836-27999999999789988999813554441010011745444565 Q gi|254780365|r 4 LKKVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDR-PVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLR 82 (299) Q Consensus 4 m~~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~k-piI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~ 82 (299) |+++ -|||||||+||||+|||..+||||+|++|| ++|++.+.+|+++||++|.++++|+.+++.+|....+.+.. T Consensus 1 m~~v-~avILaGG~GtRL~PLT~~rpKp~vPf~GrYRlIDf~Lsn~~nsgI~~V~v~tqy~~~sl~~Hl~~~w~~~~--- 76 (429) T PRK02862 1 MKRV-LAIILGGGAGTRLYPLTKLRAKPAVPLAGKYRLIDIPISNCINSGINKIYVLTQFNSASLNRHISRTYNFSG--- 76 (429) T ss_pred CCCE-EEEEECCCCCCCCCHHHCCCCHHHEEECCCCHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHCCCCCCC--- T ss_conf 9843-999966888860425445886773258882420899999898869988999946588999998624215766--- Q ss_pred HCCCCEEECCCCCCCCCCCEEEECCC--CCC---CCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 41870000113441124857998433--342---3346654211011223---333111278644454433221122212 Q gi|254780365|r 83 KRNKKAELTLLAESIPSIGNAVFTWQ--YER---KGLGHAVWCARNIIGD---NPFALLLPDMIMSPLEGENCMANMIKL 154 (299) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~i~~v~Q--~~p---~Gtg~Ai~~a~~~i~d---e~flv~~gD~i~~~~~~~~~l~~li~~ 154 (299) +.... ..+-...| +.| +|||+|+..+..++.+ +.|+|+.||+++..+ +++|+++ T Consensus 77 ---~~~g~----------v~~l~~~~~~~~~~~~~Gtadai~~~~~~l~~~~~~~~lV~~GD~l~~~D-----~~~~l~~ 138 (429) T PRK02862 77 ---FSQGF----------VEVLAAQQTPDNPSWFQGTADAVRQYLWLFQEWDVDEYLILSGDQLYRMD-----YSLFVQH 138 (429) T ss_pred ---CCCCE----------EEECCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCEEECCC-----HHHHHHH T ss_conf ---67860----------68604355147876556648999999999973799739997488778269-----9999999 Q ss_pred CCCCCCCE-EEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCC-------------------CCCCEEECHHH Q ss_conf 23432200-11243285443313310013333335300000025778886-------------------73423302020 Q gi|254780365|r 155 YEKEGANI-LAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSST-------------------FISNFFINGRY 214 (299) Q Consensus 155 ~~~~~~~v-i~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~-------------------~~Snl~~~GiY 214 (299) |.++++.+ +++.+++.+++++||++.++.+ ++|++|+|||+.+. ..++++++|+| T Consensus 139 H~~~~AdiTi~~~~v~~~~~~~fGvv~~D~~-----grV~~f~EKP~~~~~~~~~~~~~~~~~~~~~~~~~~~lasmGiY 213 (429) T PRK02862 139 HRETGADITLAVLPVDEKQASGFGLMKTDDD-----GRITEFSEKPKGEELKAMAVDTSRLGLSPEEAKEKPYLASMGIY 213 (429) T ss_pred HHHCCCCEEEEEEECCHHHCCCCCEEEECCC-----CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEE T ss_conf 9974998279987558576553628999999-----97999995788754442114652235673223567640122599 Q ss_pred EECHHHH-HHHCCCCCCCCCCCCCHHHHHHHHHHCCCEEEEEEEEEEEECCCHHHHHHHHHHHH Q ss_conf 0272331-10001226667675003789999983198489998137864399889999999986 Q gi|254780365|r 215 ILHPDIF-SILNDWKENEGKGEIQLTDSMRKLSERHDFLAYHFKGHTYDCGSKKGFVLANIAFA 277 (299) Q Consensus 215 i~~~~i~-~~L~~~~~~~~~gE~~ltD~i~~l~~~~~v~a~~~~g~w~DiG~~~~yl~A~~~~a 277 (299) +|++++| ++|++... ..+|. .|+|+.++++.++++|.++|+|.||||+++|++||+++. T Consensus 214 if~~~~L~~ll~~~~~---~~Dfg-~diip~~~~~~~v~~y~~~GYw~dIgti~sy~~AnmdLl 273 (429) T PRK02862 214 VFSRDVLFDLLNKNPE---HTDFG-KEIIPEAARGYNLQAYLFDDYWEDIGTIEAFYEANLALT 273 (429) T ss_pred EECHHHHHHHHHHCCC---CCHHH-HHHHHHHHCCCEEEEEEECCEEEECCCHHHHHHHHHHHH T ss_conf 9859999999975865---42238-889999847783899981472687899899999989986 No 26 >PRK04928 consensus Probab=100.00 E-value=0 Score=330.80 Aligned_cols=252 Identities=15% Similarity=0.235 Sum_probs=190.9 Q ss_pred CCCCCEEEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHH Q ss_conf 88545899956888833432440888333568836-27999999999789988999813554441010011745444565 Q gi|254780365|r 4 LKKVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDR-PVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLR 82 (299) Q Consensus 4 m~~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~k-piI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~ 82 (299) |.++ -|||||||+||||||||..+||||+|++|+ |+|+|++++|+++||++|+++++++.+++.+||...+++... T Consensus 1 m~~~-~aiILagG~gtRLrPLT~~rpKp~lP~~g~yrlIdf~Ls~l~~sGi~~V~v~~~~~~~sl~~hl~~gw~l~~~-- 77 (405) T PRK04928 1 MAGV-LGMILAGGEGSRLRPLTESRTKPAVPFGGSYRLIDFALNNFVNADLMRIYVLTQFKSQSLYIHMKKGWNINGI-- 77 (405) T ss_pred CCCE-EEEEECCCCCCCCCHHHCCCCHHHEEECCCCCEEHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCCCCC-- T ss_conf 9965-9999789987733232159856523878857131899999998799889999088879999997466376641-- Q ss_pred HCCCCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 41870000113441124857998433342334665421101122---333311127864445443322112221223432 Q gi|254780365|r 83 KRNKKAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIG---DNPFALLLPDMIMSPLEGENCMANMIKLYEKEG 159 (299) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~---de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~ 159 (299) ....+. ..+... -.-++...|||||++.+.+|+. ++.|+|+.||+++..+ +++|+++|++++ T Consensus 78 ---~~~~~~----~~p~~~---~~~~~~~~Gta~ai~~~~~~l~~~~~e~~lv~~gD~l~~~D-----l~~ll~~H~~~~ 142 (405) T PRK04928 78 ---TDRFID----PIPAQM---RTGKRWYEGTADAIYQNLRFIELSEPEHVCIFGSDHIYKMD-----IKQMLDFHKEKE 142 (405) T ss_pred ---CCCEEE----ECCHHH---CCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCEEECCC-----HHHHHHHHHHCC T ss_conf ---351787----362241---25764456568999999999854699889997898896689-----999999998568 Q ss_pred CCE-EEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCC-----CCCEEECHHHEECHHHH-HHH-CCCCCCC Q ss_conf 200-112432854433133100133333353000000257788867-----34233020200272331-100-0122666 Q gi|254780365|r 160 ANI-LAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTF-----ISNFFINGRYILHPDIF-SIL-NDWKENE 231 (299) Q Consensus 160 ~~v-i~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~-----~Snl~~~GiYi~~~~i~-~~L-~~~~~~~ 231 (299) +.+ +++.+++.+++++||++.+++.. +|++|+|||+.+.. .++++++|+|+|++++| ..| +...... T Consensus 143 a~iTi~~~~~~~e~~~~~Gvv~~D~~g-----~V~~f~EKP~~~~~~~g~~~~~li~~GiYvf~~~vl~~~l~~~~~~~~ 217 (405) T PRK04928 143 AALTVSALRMPLEEASQFGVIEVDAEG-----RMIGFEEKPANPKSIPGDPDHALVSMGNYIFEAEVLFKELIEDAANEN 217 (405) T ss_pred CCCEEEEEECCHHCCCCCCEEEECCCC-----CEEEEEECCCCCCCCCCCCCCEEEECCEEEECHHHHHHHHHHHHCCCC T ss_conf 873168998241125537768888999-----899999768987566798331175032267519999999986501478 Q ss_pred CCCCCCHHHHHHHHHHCCCEEEEEE----------EEEEEECCCHHHHHHHHHHHHHCC Q ss_conf 7675003789999983198489998----------137864399889999999986318 Q gi|254780365|r 232 GKGEIQLTDSMRKLSERHDFLAYHF----------KGHTYDCGSKKGFVLANIAFALAR 280 (299) Q Consensus 232 ~~gE~~ltD~i~~l~~~~~v~a~~~----------~g~w~DiG~~~~yl~A~~~~al~~ 280 (299) ...+| .+|+++.|++++++++|.+ +|+|.|+|+|++|++||+++ |++ T Consensus 218 ~~~d~-~~dvip~l~~~g~~~~y~~~~~~i~g~~~~gYW~digt~~~y~~an~dl-L~~ 274 (405) T PRK04928 218 SSHDF-GKDIIPKMFPRGKVFVYDFSTNRIPGEKEEVYWRDVGTIDSYWQAHMDL-LKK 274 (405) T ss_pred CCCCC-HHHHHHHHHHHCCEEEEEECCCCCCCCCCCEEEEECCCHHHHHHHHHHH-HCC T ss_conf 76741-5788999987288689980244335756650589789878999999998-668 No 27 >KOG1322 consensus Probab=100.00 E-value=0 Score=333.73 Aligned_cols=226 Identities=24% Similarity=0.396 Sum_probs=195.4 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 89995688883343244088833356883627999999999789988999813554441010011745444565418700 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) ||+||.||.||||||||+.+|||++|++++|||+|.+++|+++||++|++.++|+.+++..|+.+.++ T Consensus 11 kaiILvGG~GTRLrPLT~t~pKPlVpfgn~pmI~hqieal~nsGi~~I~la~~y~s~sl~~~~~k~y~------------ 78 (371) T KOG1322 11 KAIILVGGYGTRLRPLTLTRPKPLVPFGNKPMILHQIEALINSGITKIVLATQYNSESLNRHLSKAYG------------ 78 (371) T ss_pred EEEEEECCCCCEEECEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHHHH------------ T ss_conf 58999547774460001368886654276305277899987579747999971382899999999761------------ Q ss_pred EECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-EEE Q ss_conf 00113441124857998433342334665421101122---33331112786444544332211222122343220-011 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIG---DNPFALLLPDMIMSPLEGENCMANMIKLYEKEGAN-ILA 164 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~---de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~-vi~ 164 (299) ...++++.+..|.+|+||++.+..|++++- +.||+|+++|+++..+ +++|+++|..+++. +++ T Consensus 79 --------~~lgVei~~s~eteplgtaGpl~laR~~L~~~~~~~ffVLnsDvi~~~p-----~~~~vqfH~~~gae~TI~ 145 (371) T KOG1322 79 --------KELGVEILASTETEPLGTAGPLALARDFLWVFEDAPFFVLNSDVICRMP-----YKEMVQFHRAHGAEITIV 145 (371) T ss_pred --------HCCCEEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCEEEECCCEEECCC-----HHHHHHHHHHCCCCEEEE T ss_conf --------2455599998206887654669999988640379867995577264377-----999999998559965899 Q ss_pred EEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHH Q ss_conf 24328544331331001333333530000002577888673423302020027233110001226667675003789999 Q gi|254780365|r 165 VSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRK 244 (299) Q Consensus 165 ~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~ 244 (299) +.+++ ++|+||++..+.. .++|.+|+|||+ ..-||-+++|+|+|++++++.|...+.+-++ .+++. T Consensus 146 ~t~vd--epSkyGvv~~d~~----~grV~~F~EKPk--d~vsnkinaGiYi~~~~vL~ri~~~ptSiek------EifP~ 211 (371) T KOG1322 146 VTKVD--EPSKYGVVVIDED----TGRVIRFVEKPK--DLVSNKINAGIYILNPEVLDRILLRPTSIEK------EIFPA 211 (371) T ss_pred EEECC--CCCCCCEEEEECC----CCCEEEEHHCCH--HHHHCCCCCEEEEECHHHHHHHHHCCCCHHH------HHHHH T ss_conf 87446--8532636999358----882558021756--5653312415899779998676506663011------11155 Q ss_pred HHHCCCEEEEEEEEEEEECCCHHHHHHHH Q ss_conf 98319848999813786439988999999 Q gi|254780365|r 245 LSERHDFLAYHFKGHTYDCGSKKGFVLAN 273 (299) Q Consensus 245 l~~~~~v~a~~~~g~w~DiG~~~~yl~A~ 273 (299) +++.++++||.++|+|+|||+|.+|++|. T Consensus 212 ~a~~~~l~a~~l~gfWmDIGqpkdf~~g~ 240 (371) T KOG1322 212 MAEEHQLYAFDLPGFWMDIGQPKDFLTGF 240 (371) T ss_pred HHHCCCEEEEECCCHHHHCCCHHHHHHHH T ss_conf 64117258996276132068889998889 No 28 >PRK00725 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=100.00 E-value=0 Score=320.77 Aligned_cols=253 Identities=17% Similarity=0.244 Sum_probs=201.6 Q ss_pred CCCCCCEEEEECCCCCCCCCHHHCCCCCCEEEECCEE-HHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHH Q ss_conf 8885458999568888334324408883335688362-799999999978998899981355444101001174544456 Q gi|254780365|r 3 SLKKVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDRP-VIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSL 81 (299) Q Consensus 3 ~m~~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~kp-iI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l 81 (299) -|++=.=|||||||+||||+|||+.+|||++|++||. +|||.|.+|++|||++|.++++|+.+++.||+..++.+.. T Consensus 15 ~~~~~~~aviLAGG~GtRL~pLT~~RpkpavPfgGryRlIDF~LSn~vnsGI~~V~V~t~y~~~SL~dHlg~GW~~~~-- 92 (431) T PRK00725 15 QLPRKSVALILAGGRGSRLKELTDKRAKPAVYFGGKFRIIDFALSNCLNSGIRRIGVITQYQAHSLVRHIQRGWSFFR-- 92 (431) T ss_pred CCCCCEEEEEECCCCCCCCHHHHCCCCCCCCEECCCCEEEHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHCCCCCCC-- T ss_conf 156767999963888775515542784003153782354023676688759998999927787999998753424444-- Q ss_pred HHCCCCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 541870000113441124857998433342334665421101122---33331112786444544332211222122343 Q gi|254780365|r 82 RKRNKKAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIG---DNPFALLLPDMIMSPLEGENCMANMIKLYEKE 158 (299) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~---de~flv~~gD~i~~~~~~~~~l~~li~~~~~~ 158 (299) ......+.. ++ ....+..|..++||++|+..+++|+. .+.|+|+.||+++..+ +++|+++|.++ T Consensus 93 --~~~g~~~~~----lp--~~~~~~~~~~~~Gtadav~~n~~~l~~~~~eyvvI~~gD~I~~~D-----~~~~l~~H~~~ 159 (431) T PRK00725 93 --EERNEFVDL----LP--AQQRVSEENWYRGTADAVYQNLDIIRRYKAEYVVILAGDHIYKMD-----YSRMLIDHVEK 159 (431) T ss_pred --CCCCCCEEE----CC--CCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCEEECCC-----HHHHHHHHHHC T ss_conf --468960797----77--611258876546858999999999975699879994598898179-----99999999877 Q ss_pred CCCE-EEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCC-----CCCCCCEEECHHHEECHH-HHHHHCCCCCCC Q ss_conf 2200-112432854433133100133333353000000257788-----867342330202002723-311000122666 Q gi|254780365|r 159 GANI-LAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDS-----STFISNFFINGRYILHPD-IFSILNDWKENE 231 (299) Q Consensus 159 ~~~v-i~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~-----~~~~Snl~~~GiYi~~~~-i~~~L~~~~~~~ 231 (299) ++.+ +++.+++.+++++|||+.++.+. +|++|+|||.. ....|+++++|+|+|+++ ++++|++..... T Consensus 160 gAdiTi~~~~v~~~~~~~fGv~~~D~~g-----rV~~f~EKP~~p~~~~g~~~~~l~smGiYvf~~~~L~~~L~~~~~~~ 234 (431) T PRK00725 160 GADCTVGCLEVPREEASAFGVMAVDEND-----KIIAFVEKPADPPAMPGDPDKSLASMGIYVFDADYLYELLERDAADP 234 (431) T ss_pred CCCEEEEEEECCHHHCCCCCEEEECCCC-----CEEEEEECCCCCCCCCCCCCCCEECCEEEEECHHHHHHHHHHHHCCC T ss_conf 9987999998487885547749999999-----89999933788434689755330001589985999999998652177 Q ss_pred -CCCCCCHHHHHHHHHHCCCEEEEEEE-----------EEEEECCCHHHHHHHHHHH Q ss_conf -76750037899999831984899981-----------3786439988999999998 Q gi|254780365|r 232 -GKGEIQLTDSMRKLSERHDFLAYHFK-----------GHTYDCGSKKGFVLANIAF 276 (299) Q Consensus 232 -~~gE~~ltD~i~~l~~~~~v~a~~~~-----------g~w~DiG~~~~yl~A~~~~ 276 (299) ...+|. .|+++.+++++++++|.++ |+|.|+||+++|++||+++ T Consensus 235 ~~~~Dfg-~diip~ll~~~~~~ay~f~~~~~~~~~~~~GYw~dIgt~~sY~~AnmdL 290 (431) T PRK00725 235 NSSHDFG-KDIIPKIVEEGKAYAHPFPLSCVRSDPEAEPYWRDVGTLDAYWQANLDL 290 (431) T ss_pred CCCCCCH-HHHHHHHHHHCCEEEEECCCCCCCCCCCCCCEEEECCCHHHHHHHHHHH T ss_conf 6421026-7789999974987898625333456656664178879999999998777 No 29 >PRK01884 consensus Probab=100.00 E-value=0 Score=317.99 Aligned_cols=250 Identities=18% Similarity=0.245 Sum_probs=195.3 Q ss_pred CCEEEEECCCCCCCCCHHHCCCCCCEEEECC-EEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCC Q ss_conf 4589995688883343244088833356883-627999999999789988999813554441010011745444565418 Q gi|254780365|r 7 VRKAVFPIAGLGMRFFPISKVIPKEMLAIVD-RPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRN 85 (299) Q Consensus 7 i~KavIlAaG~GtRl~P~T~~~pKpllpi~~-kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~ 85 (299) =+=|||||||+||||+|||..+|||++|++| +|+|||++.+|.+|||++|.++++|+.+++.||+..++.+. ... T Consensus 17 ~~~avILAGG~GtRL~pLT~~RpkpavPfgGryRiIDF~LSNlvnsGI~~V~V~t~y~~~SL~dHl~~Gw~~~----~~~ 92 (435) T PRK01884 17 DTLVLILAGGRGSRLHELTDKRAKPALYFGGNRRIIDFALSNCINSGLNRIGVVTQYAAHSLLRHLQTGWSFL----PQE 92 (435) T ss_pred HEEEEEECCCCCCCCHHHHCCCCHHCCCCCCCCEEEEHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCCC----CCC T ss_conf 0089996588876151655387151162488871210168678876998899981768799999986163655----557 Q ss_pred CCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCC----CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 700001134411248579984333423346654211011223----3331112786444544332211222122343220 Q gi|254780365|r 86 KKAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGD----NPFALLLPDMIMSPLEGENCMANMIKLYEKEGAN 161 (299) Q Consensus 86 ~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~d----e~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~ 161 (299) ....+..+ +... ....+...+|||+|++.+.+++.+ +.|+|+.||+++..+ +++|+++|.++++. T Consensus 93 ~g~~i~~l----p~~~--~~~~~~~~~Gtadai~~n~~~i~~~~~~~yVlVl~gD~I~~~D-----~~~~l~~H~~~~Ad 161 (435) T PRK01884 93 RGEFIDML----PARQ--QIDDSTWYRGTADAVYQNMAIIKNHYRPKYILILAGDHIYKQD-----YSQMLLDHVNSGAK 161 (435) T ss_pred CCCEEEEC----CCCC--CCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCEEEECC-----HHHHHHHHHHCCCC T ss_conf 99527867----8742--2588751037399999889999724688879995599898078-----99999999975998 Q ss_pred E-EEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCC-----CCCCCEEECHHHEECHHH-HHHHCCCCCCC-CC Q ss_conf 0-1124328544331331001333333530000002577888-----673423302020027233-11000122666-76 Q gi|254780365|r 162 I-LAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSS-----TFISNFFINGRYILHPDI-FSILNDWKENE-GK 233 (299) Q Consensus 162 v-i~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~-----~~~Snl~~~GiYi~~~~i-~~~L~~~~~~~-~~ 233 (299) + +++.+++.+++++|||+.++... +|+.|+|||+.+ ...|+++++|+|+|++++ ++.|++..... .. T Consensus 162 iTva~~~v~~e~~~~~Gvv~~d~~~-----rV~~f~EKP~~~~~~~g~~~~~lismGiYvf~~~~L~e~L~~~~~~~~~~ 236 (435) T PRK01884 162 CTVGCIEVPREEAHEFGVMAVNENL-----KVKAFVEKPKDPPAMVGKPDSSLASMGIYVFDADYLYKMLEREVNTPQTS 236 (435) T ss_pred EEEEEEECCHHHHHHCCEEEECCCC-----CEEEEEECCCCCCCCCCCCCCCEECCEEEEECHHHHHHHHHHHHCCCCCH T ss_conf 5899998687883007749989999-----78899956888110379866422023259973999999999763177751 Q ss_pred CCCCHHHHHHHHHHCCCEEEEEEEE-----------EEEECCCHHHHHHHHHHHH Q ss_conf 7500378999998319848999813-----------7864399889999999986 Q gi|254780365|r 234 GEIQLTDSMRKLSERHDFLAYHFKG-----------HTYDCGSKKGFVLANIAFA 277 (299) Q Consensus 234 gE~~ltD~i~~l~~~~~v~a~~~~g-----------~w~DiG~~~~yl~A~~~~a 277 (299) .+| ..|+|+.+++++.++||++++ ||.||||+++|++||+++. T Consensus 237 ~Df-g~diip~~l~~g~v~ay~f~~~~~G~~~~~~~Yw~dIgt~~sY~~AnmdLL 290 (435) T PRK01884 237 HDF-GKDILPKALEEGVLYAHPFSRSCMGRNTEGEIYWRDVGTLDSFWQSNIDLV 290 (435) T ss_pred HHH-HHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCEEEECCCHHHHHHHHHHHH T ss_conf 257-898999998519389997367644544567723545899899999899986 No 30 >PRK03701 consensus Probab=100.00 E-value=0 Score=313.92 Aligned_cols=251 Identities=18% Similarity=0.238 Sum_probs=198.0 Q ss_pred CCCEEEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHC Q ss_conf 545899956888833432440888333568836-2799999999978998899981355444101001174544456541 Q gi|254780365|r 6 KVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDR-PVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKR 84 (299) Q Consensus 6 ~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~k-piI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~ 84 (299) |=+=|||||||+||||+|||..+|||+||++|| |+|||.+.+|++|||++|.++++++.+++.||+..++.+ ... T Consensus 18 ~~~~aviLAGG~gtRL~pLT~~RpkpaVPf~GrYRlIDF~LSn~vnsGI~~V~V~tq~~~~SL~dHlg~Gw~~----~~~ 93 (431) T PRK03701 18 LKSVALILAGGRGTRLKDLTNKRAKPAVHFGGKFRIIDFALSNCLNSGIRRIGVITQYQSHTLVQHIQRGWSF----FNE 93 (431) T ss_pred CCEEEEEECCCCCCCCCHHHCCCCHHCCCCCCCCHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHCCCCH----HCC T ss_conf 1269999438987644055438844211007852347999999998699989999278889999998636210----022 Q ss_pred CCCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 870000113441124857998433342334665421101122---33331112786444544332211222122343220 Q gi|254780365|r 85 NKKAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIG---DNPFALLLPDMIMSPLEGENCMANMIKLYEKEGAN 161 (299) Q Consensus 85 ~~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~---de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~ 161 (299) ..+..+..+ +. ...+..|+.++||++|+....+|+. .+.|+++.||+++..+ +++|+++|.++++. T Consensus 94 ~~~~~~~~l----p~--~~~~~~~~~~~Gtadai~~n~~~l~~~~~eyVvI~~gD~I~~~D-----~~~~l~~H~~~gAd 162 (431) T PRK03701 94 EMNEFVDLL----PA--QQRMKGENWYRGTADAVTQNLDIIRRYKAEYVVILAGDHIYKQD-----YSRMLIDHVEKGAR 162 (431) T ss_pred CCCCEEEEC----CC--CCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCEEECCC-----HHHHHHHHHHCCCC T ss_conf 578549977----77--41047886435748999999999974699859996598887589-----99999999975998 Q ss_pred E-EEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCC-----CCCCCEEECHHHEECHHH-HHHHCCCCCCC-CC Q ss_conf 0-1124328544331331001333333530000002577888-----673423302020027233-11000122666-76 Q gi|254780365|r 162 I-LAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSS-----TFISNFFINGRYILHPDI-FSILNDWKENE-GK 233 (299) Q Consensus 162 v-i~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~-----~~~Snl~~~GiYi~~~~i-~~~L~~~~~~~-~~ 233 (299) + +++.+++.+++++|||+.++... +|++|+|||..+ ...++++++|+|+|++++ +++|++..... +. T Consensus 163 iTv~~~~v~~e~~~~~GVl~~D~~g-----rV~~f~EKP~~p~~~~~~~~~~~~smgiYVf~~~~L~~ll~~~~~~~~~~ 237 (431) T PRK03701 163 CTVACMPVPIEEASAFGVMAVDEND-----KIIEFVEKPANPPSMPGDPSKSLASMGIYVFDADYLYELLEEDDRDENSS 237 (431) T ss_pred CEEEEEECCHHHCCCCCEEEECCCC-----CEEEEEECCCCCCCCCCCCCCCEEEEEEEEECHHHHHHHHHHHCCCCCCC T ss_conf 2699987487883206859988999-----79999955888655689855332100589975999999986330156676 Q ss_pred CCCCHHHHHHHHHHCCCEEEEEEE-----------EEEEECCCHHHHHHHHHHHH Q ss_conf 750037899999831984899981-----------37864399889999999986 Q gi|254780365|r 234 GEIQLTDSMRKLSERHDFLAYHFK-----------GHTYDCGSKKGFVLANIAFA 277 (299) Q Consensus 234 gE~~ltD~i~~l~~~~~v~a~~~~-----------g~w~DiG~~~~yl~A~~~~a 277 (299) .+| ..|+++.+++++.+++|.++ |||.||||+++|.+||+++. T Consensus 238 ~Df-~~diip~l~~~~~v~~y~f~~~~v~~~~~~~gYw~dIgti~sY~~anmdlL 291 (431) T PRK03701 238 HDF-GKDIIPKITEAGLAYAHPFPLSCVQSDPDAEPYWRDVGTLEAYWKANLDLA 291 (431) T ss_pred CCC-HHHHHHHHHHHCCEEEEEECCCCCCCCCCCCCCEEECCCHHHHHHHHHHHH T ss_conf 543-788999998609689985044444467667772145799999999899873 No 31 >PRK09451 glmU bifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase; Provisional Probab=100.00 E-value=0 Score=307.14 Aligned_cols=217 Identities=21% Similarity=0.297 Sum_probs=171.8 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 89995688883343244088833356883627999999999789988999813554441010011745444565418700 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) +|||||||+||||++ .+||||+||+|||||+|+++.+.++|+++|++|++|.++++++||.. T Consensus 7 ~aVILAAGkGTRM~s---~~PKvL~pi~gkPml~hvi~~l~~~g~~~ivvVvg~~~e~i~~~~~~--------------- 68 (456) T PRK09451 7 SVVILAAGKGTRMYS---DLPKVLHTLAGKPMVQHVIDAANELGAAHVHLVYGHGGDLLKQTLKD--------------- 68 (456) T ss_pred EEEEECCCCCCCCCC---CCCHHHHEECCHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCC--------------- T ss_conf 399985778777799---97957504489869999999998769980999969987999987445--------------- Q ss_pred EECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 001134411248579984333423346654211011223-3331112786-44454433221122212234322001124 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGD-NPFALLLPDM-IMSPLEGENCMANMIKLYEKEGANILAVS 166 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~d-e~flv~~gD~-i~~~~~~~~~l~~li~~~~~~~~~vi~~~ 166 (299) .++.|+.|++|+||||||.+|.+++.+ ++|+|++||. ++.. ..+++|++.+.+.+.+++. . T Consensus 69 ------------~~i~~v~Q~eqlGTghAV~~A~~~l~~~~~vLVl~GD~PLi~~----~tl~~l~~~~~~~~~~llt-~ 131 (456) T PRK09451 69 ------------EPLNWVLQAEQLGTGHAMQQAAPFFADDEDILMLYGDVPLISV----ETLQRLRDAKPQGGIGLLT-V 131 (456) T ss_pred ------------CCCEEEEECCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCH----HHHHHHHHHCCCCCEEEEE-E T ss_conf ------------8844999588897299999988860668858999598015699----9999998625138859999-9 Q ss_pred ECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCC--CCCCCCEEECHHHEECHHHH-HHHCCCCCCCCCCCCCHHHHHH Q ss_conf 32854433133100133333353000000257788--86734233020200272331-1000122666767500378999 Q gi|254780365|r 167 ECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDS--STFISNFFINGRYILHPDIF-SILNDWKENEGKGEIQLTDSMR 243 (299) Q Consensus 167 ~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~--~~~~Snl~~~GiYi~~~~i~-~~L~~~~~~~~~gE~~ltD~i~ 243 (299) .+ ++++.||++..+ ..++.++|||++. +...++.+|+|+|+|+++.| ++|.++.+...+|||+|||++. T Consensus 132 ~~--~dP~~YGrIi~~------~g~v~~IVE~kda~~~e~~i~eiNaGIy~f~~~~l~~~L~~i~~~n~~gEyyLTDii~ 203 (456) T PRK09451 132 KL--DNPTGYGRITRE------NGKVVGIVEHKDATDEQRQIQEINTGILVANGADLKRWLAKLTNNNAQGEYYITDIIA 203 (456) T ss_pred EC--CCCCCCEEEEEC------CCCEEEEEECCCCCHHHHCCCEECCEEEEECHHHHHHHHHHCCCCCCCCCEEECHHHH T ss_conf 76--896546489943------8957999982679845521112202489957999999887506755456256305678 Q ss_pred HHHHCC-CEEEEEEEEEEEECCCHHH Q ss_conf 998319-8489998137864399889 Q gi|254780365|r 244 KLSERH-DFLAYHFKGHTYDCGSKKG 268 (299) Q Consensus 244 ~l~~~~-~v~a~~~~g~w~DiG~~~~ 268 (299) .+.+++ ++.++.....|--.|-... T Consensus 204 ~~~~~g~~v~~v~~~~~~e~~GvN~~ 229 (456) T PRK09451 204 LAYQEGREIVAVHPTRLSEVEGVNNR 229 (456) T ss_pred HHHHCCCEEEEEECCCHHHHCCCCCH T ss_conf 88746957999982887994467989 No 32 >cd02523 PC_cytidylyltransferase Phosphocholine cytidylyltransferases catalyze the synthesis of CDP-choline. This family contains proteins similar to prokaryotic phosphocholine (P-cho) cytidylyltransferases. Phosphocholine (PC) cytidylyltransferases catalyze the transfer of a cytidine monophosphate from CTP to phosphocholine to form CDP-choline. PC is the most abundant phospholipid in eukaryotic membranes and it is also important in prokaryotic membranes. For pathogenic prokaryotes, the cell surface PC facilitates the interaction with host surface and induces attachment and invasion. In addition cell wall PC serves as scaffold for a group of choline-binding proteins that are secreted from the cells. Phosphocholine (PC) cytidylyltransferase is a key enzyme in the prokaryotic choline metabolism pathway. It has been hypothesized to consist of a choline transport system, a choline kinase, CTP:phosphocholine cytidylyltransferase, and a choline phosphotransferase that transfers P-Cho from CDP Probab=100.00 E-value=1.4e-45 Score=301.88 Aligned_cols=221 Identities=21% Similarity=0.263 Sum_probs=166.8 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEE Q ss_conf 99956888833432440888333568836279999999997899889998135544410100117454445654187000 Q gi|254780365|r 10 AVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAE 89 (299) Q Consensus 10 avIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~ 89 (299) |||||||+||||+|+|+.+||||+||+|||+|+|+++.+.++||++|+||+||++++|++||.... T Consensus 1 AiILAaG~GtRl~plT~~~PK~L~~i~gkpli~~~i~~l~~~gi~~i~iv~gy~~e~i~~~~~~~~-------------- 66 (229) T cd02523 1 AIILAAGRGSRLRPLTEDRPKCLLEINGKPLLERQIETLKEAGIDDIVIVTGYKKEQIEELLKKYP-------------- 66 (229) T ss_pred CEEECCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCCCEEECCCCHHHHHHHHHHCC-------------- T ss_conf 989368776568864579880321799879999999999984998536655548999999985313-------------- Q ss_pred ECCCCCCCCCCCEEEECCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CEEEEE Q ss_conf 01134411248579984333--423346654211011223333111278644454433221122212234322-001124 Q gi|254780365|r 90 LTLLAESIPSIGNAVFTWQY--ERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGA-NILAVS 166 (299) Q Consensus 90 ~~~~~~~~~~~~~i~~v~Q~--~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~-~vi~~~ 166 (299) ++.++.|+ ++.|||+|+++|+++++ ++|+|++||+++... .++. +.++.+ .++++. T Consensus 67 ------------~i~~i~n~~~~~~g~~~sl~~a~~~~~-~~~lv~~gD~i~~~~----~~~~----~~~~~~~~~~~~~ 125 (229) T cd02523 67 ------------NIKFVYNPDYAETNNIYSLYLARDFLD-EDFLLLEGDVVFDPS----ILER----LLSSPADNAILVD 125 (229) T ss_pred ------------CCCCEEEEECCCCCHHHHHHHHHHHCC-CCEEEEECCEECCHH----HHHH----HHHCCCCCEEEEE T ss_conf ------------654034000236782999998887558-985999577654667----9999----9847788879999 Q ss_pred ECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHH----HCCCCCCCCCCCCCHHHHH Q ss_conf 328544331331001333333530000002577888673423302020027233110----0012266676750037899 Q gi|254780365|r 167 ECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSI----LNDWKENEGKGEIQLTDSM 242 (299) Q Consensus 167 ~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~----L~~~~~~~~~gE~~ltD~i 242 (299) ........+|++...+. ..+..++|||..+.. .+..++|+|+|++++++. +++..+.. ++++++||++ T Consensus 126 ~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~-~~~~~~Gi~~~~~~~~~~l~~~l~~~~~~~-~~~~~~~~~i 197 (229) T cd02523 126 KKTKEWEDEYVKDLDDA------GVLLGIISKAKNLEE-IQGEYVGISKFSPEDADRLAEALEELIEAG-RVNLYYEDAL 197 (229) T ss_pred ECCCCCCCCEEEEECCC------CCEEEEEECCCCCCC-CCCEEEEEEEECHHHHHHHHHHHHHHCCCC-CCCCCHHHHH T ss_conf 56457787447994268------708999868999887-462899999998999999999999728677-7872399999 Q ss_pred HHHHHCCCEEEEEE-EEEEEECCCHHHHHHHH Q ss_conf 99983198489998-13786439988999999 Q gi|254780365|r 243 RKLSERHDFLAYHF-KGHTYDCGSKKGFVLAN 273 (299) Q Consensus 243 ~~l~~~~~v~a~~~-~g~w~DiG~~~~yl~A~ 273 (299) +.|++++++.++.+ +|.|+|||||++|.+|+ T Consensus 198 ~~li~~~~~~~~~~~~~~W~dIgt~edl~~Ae 229 (229) T cd02523 198 QRLISEEGVKVKDISDGFWYEIDDLEDLERAE 229 (229) T ss_pred HHHHHCCCCEEEECCCCEEEECCCHHHHHHHC T ss_conf 99980799669984893199696999999649 No 33 >cd04183 GT2_BcE_like GT2_BcbE_like is likely involved in the biosynthesis of the polysaccharide capsule. GT2_BcbE_like: The bcbE gene is one of the genes in the capsule biosynthetic locus of Pasteurella multocida. Its deducted product is likely involved in the biosynthesis of the polysaccharide capsule, which is found on surface of a wide range of bacteria. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Probab=100.00 E-value=5.6e-44 Score=290.57 Aligned_cols=221 Identities=18% Similarity=0.301 Sum_probs=163.6 Q ss_pred EEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEE Q ss_conf 99568888334324408883335688362799999999978998899981355444101001174544456541870000 Q gi|254780365|r 11 VFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAEL 90 (299) Q Consensus 11 vIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~ 90 (299) ||||||+||||+|+|..+||||+||+|||||+|+++.+..+|++++++|++.. .+.++ ++...+.. T Consensus 2 VIlaaG~GtRl~~~t~~~PK~Ll~v~gkp~i~~~i~~l~~~~~~~~i~v~~~~--~~~~~-----~~~~~~~~------- 67 (231) T cd04183 2 IIPMAGLGSRFKKAGYTYPKPLIEVDGKPMIEWVIESLAKIFDSRFIFICRDE--HNTKF-----HLDESLKL------- 67 (231) T ss_pred EEECCCCCCCCCCCCCCCCCEEEEECCEEHHHHHHHHHHHCCCCEEEEECCHH--HHHHH-----HHHHHHCC------- T ss_conf 89867887364756799887501799957899999999864992468614546--66678-----88777515------- Q ss_pred CCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE-EEEEEC Q ss_conf 113441124857998433342334665421101122-333311127864445443322112221223432200-112432 Q gi|254780365|r 91 TLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIG-DNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANI-LAVSEC 168 (299) Q Consensus 91 ~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~-de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~v-i~~~~v 168 (299) .....++ +..|++++|||||+++|.+++. |++|+|++||+++..+ +.++++.|.+.++.. +.+... T Consensus 68 ------~~~~~~i-~~~~~~~~G~a~av~~a~~~i~~~~~~lv~~gD~~~~~~-----~~~~~~~~~~~~~~~~v~~~~~ 135 (231) T cd04183 68 ------LAPNATV-VELDGETLGAACTVLLAADLIDNDDPLLIFNCDQIVESD-----LLAFLAAFRERDLDGGVLTFFS 135 (231) T ss_pred ------CCCCCEE-EEECCCCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCCC-----HHHHHHHHHHCCCCEEEEEEEE T ss_conf ------5897189-995699996799999999864899809997076113638-----8999999985699879999991 Q ss_pred CHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECH--HHHHHHCCCCC--CCCCCCCCHHHHHHH Q ss_conf 85443313310013333335300000025778886734233020200272--33110001226--667675003789999 Q gi|254780365|r 169 DPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHP--DIFSILNDWKE--NEGKGEIQLTDSMRK 244 (299) Q Consensus 169 ~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~--~i~~~L~~~~~--~~~~gE~~ltD~i~~ 244 (299) +. .+||.+..++. .+|.+++|||. +|+++++|+|+|++ .+++++++... ...+|||++||+++. T Consensus 136 ~~---~~~g~v~~~~~-----~~v~~i~EK~~----~s~~~~~GiY~F~~~~~f~~~~~~~~~~~~~~~gE~yltd~i~~ 203 (231) T cd04183 136 SH---PRWSYVKLDEN-----GRVIETAEKEP----ISDLATAGLYYFKSGSLFVEAAKKMIRKDDSVNGEFYISPLYNE 203 (231) T ss_pred CC---CCCCCEEECCC-----CCEEECCCCCC----CCHHHCCEEEEECCHHHHHHHHHHHHHCCCCCCCEEEHHHHHHH T ss_conf 25---77763578479-----84765023588----75002014699746799999999986448655999985899999 Q ss_pred HHHCC-CEEEEEEE-EEEEECCCHHHH Q ss_conf 98319-84899981-378643998899 Q gi|254780365|r 245 LSERH-DFLAYHFK-GHTYDCGSKKGF 269 (299) Q Consensus 245 l~~~~-~v~a~~~~-g~w~DiG~~~~y 269 (299) |+++| +|.++.++ ++|+|+|||++| T Consensus 204 li~~g~kv~~~~i~~~~~~d~GtPedl 230 (231) T cd04183 204 LILDGKKVGIYLIDKDDYHSFGTPEDL 230 (231) T ss_pred HHHCCCEEEEEEECCCEEEECCCHHHC T ss_conf 998889189999789969989696984 No 34 >COG0448 GlgC ADP-glucose pyrophosphorylase [Carbohydrate transport and metabolism] Probab=100.00 E-value=1.7e-44 Score=293.84 Aligned_cols=253 Identities=21% Similarity=0.332 Sum_probs=200.4 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCC Q ss_conf 5899956888833432440888333568836-279999999997899889998135544410100117454445654187 Q gi|254780365|r 8 RKAVFPIAGLGMRFFPISKVIPKEMLAIVDR-PVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNK 86 (299) Q Consensus 8 ~KavIlAaG~GtRl~P~T~~~pKpllpi~~k-piI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~ 86 (299) .=|+|||||.|+||.|||+.++||-+|++|| -|||+.+.+|++|||.+|.++|+|+..++.+|.+.++.+ ....+ T Consensus 6 ~laiILaGg~G~rL~~LT~~RakpAVpFgGkYRiIDF~LSN~vNSGi~~I~VltQy~~~SL~~Hi~~G~~w----~l~~~ 81 (393) T COG0448 6 VLAIILAGGRGSRLSPLTKDRAKPAVPFGGKYRIIDFALSNCVNSGIRRIGVLTQYKSHSLNDHIGRGWPW----DLDRK 81 (393) T ss_pred EEEEEECCCCCCCCCHHHHCCCCCCCCCCCEEEEEEEECCCCCCCCCCEEEEEECCCHHHHHHHHHCCCCC----CCCCC T ss_conf 69999758987755224547503444427646898677000010388769999435536799986289865----66555 Q ss_pred CEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE- Q ss_conf 00001134411248579984333423346654211011223---33311127864445443322112221223432200- Q gi|254780365|r 87 KAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGD---NPFALLLPDMIMSPLEGENCMANMIKLYEKEGANI- 162 (299) Q Consensus 87 ~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~d---e~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~v- 162 (299) +..+..+..........+| .|||+|+++-..++.+ |.++++.||||+..+ .++|++.|.+.++.+ T Consensus 82 ~~~v~ilp~~~~~~~~~wy------~Gtadai~Qnl~~i~~~~~eyvlIlsgDhIYkmD-----y~~ml~~H~~~gadiT 150 (393) T COG0448 82 NGGVFILPAQQREGGERWY------EGTADAIYQNLLIIRRSDPEYVLILSGDHIYKMD-----YSDMLDFHIESGADVT 150 (393) T ss_pred CCCEEEECCHHCCCCCCCE------ECCHHHHHHHHHHHHHCCCCEEEEECCCEEEECC-----HHHHHHHHHHCCCCEE T ss_conf 5847995711125787312------0447999976899974398889995287789448-----9999999998199879 Q ss_pred EEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCC-EEECHHHEECHHHH-HHHCCCCCC-CCCCCCCHH Q ss_conf 112432854433133100133333353000000257788867342-33020200272331-100012266-676750037 Q gi|254780365|r 163 LAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISN-FFINGRYILHPDIF-SILNDWKEN-EGKGEIQLT 239 (299) Q Consensus 163 i~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Sn-l~~~GiYi~~~~i~-~~L~~~~~~-~~~gE~~lt 239 (299) +++++|+.+++++||++.++... +|++|+|||.. ..+|| ++.+|+|||+.+.+ ++|++.... .+..+| .- T Consensus 151 v~~~~Vp~~eas~fGim~~D~~~-----~i~~F~eKp~~-~~~~~~laSMgiYIf~~~~L~~~L~~~~~~~~~~~Df-gk 223 (393) T COG0448 151 VAVKEVPREEASRFGVMNVDENG-----RIIEFVEKPAD-GPPSNSLASMGIYIFNTDLLKELLEEDAKDPNSSHDF-GK 223 (393) T ss_pred EEEEECCHHHHHHCCCEEECCCC-----CEEEEEECCCC-CCCCCCEEEEEEEEECHHHHHHHHHHHHCCCCCCCCH-HH T ss_conf 99998796760024815887999-----78873526676-8866531355669982999999999874166773310-67 Q ss_pred HHHHHHHHCCCEEEEEEEEEEEECCCHHHHHHHHHHHHHCCHH Q ss_conf 8999998319848999813786439988999999998631834 Q gi|254780365|r 240 DSMRKLSERHDFLAYHFKGHTYDCGSKKGFVLANIAFALARQD 282 (299) Q Consensus 240 D~i~~l~~~~~v~a~~~~g~w~DiG~~~~yl~A~~~~al~~~~ 282 (299) |+|+.+.++++++||+++|||.||||.++|.+||+++.-..++ T Consensus 224 diIp~~~~~~~v~AY~f~gYw~dVgTi~syy~aNmdLl~~~~~ 266 (393) T COG0448 224 DIIPKLLERGKVYAYEFSGYWRDVGTIDSYYEANMDLLSPQPE 266 (393) T ss_pred HHHHHHHHCCCEEEEECCCHHHHCCCHHHHHHHHHHHCCCCCC T ss_conf 7889887508777874256665266399999843876289975 No 35 >TIGR01173 glmU UDP-N-acetylglucosamine pyrophosphorylase; InterPro: IPR005882 N-Acetylglucosamine-1-PO(4) uridyltransferase (GlmU, 2.7.7.23 from EC) is a trimeric bifunctional enzyme that catalyzes the last two sequential reactions in the de novo biosynthetic pathway for UDP-GlcNAc. The X-ray crystal structure of Escherichia coli GlmU in complex with UDP-GlcNAc and CoA has been determined to 2.1 A resolution and reveals a two-domain architecture that is responsible for these two reactions . The C-terminal domain is responsible for the CoA-dependent acetylation of Glc-1-PO(4) to GlcNAc-1-PO(4) and displays the longest left-handed parallel beta-helix observed to date. The acetyltransferase active site defined by the binding site for CoA makes use of residues from all three subunits and is positioned beneath an open cavity large enough to accommodate the Glc-1-PO(4) acetyl acceptor. The N-terminal domain catalyzes uridyl transfer from UTP to GlcNAc-1-PO(4) to form the final products UDP-GlcNAc and pyrophosphate. This domain is composed of a central seven-stranded beta-sheet surrounded by six alpha-helices in a Rossmann fold-like topology. ; GO: 0003977 UDP-N-acetylglucosamine diphosphorylase activity, 0009103 lipopolysaccharide biosynthetic process. Probab=100.00 E-value=1.8e-42 Score=281.28 Aligned_cols=235 Identities=21% Similarity=0.297 Sum_probs=181.1 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 89995688883343244088833356883627999999999789988999813554441010011745444565418700 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) .+||||||+||||+ +..||-|+||+|||||+|+|+.+.+-+.++|++|+||.++++++.|... T Consensus 2 ~~vILAAGkGTRMk---S~lPKVLH~laGkpMl~hVi~~A~~L~~~~i~vV~GH~~~~V~~~l~~~-------------- 64 (461) T TIGR01173 2 SVVILAAGKGTRMK---SKLPKVLHPLAGKPMLEHVIDAARKLSPEKIHVVLGHGAEQVRKALAEE-------------- 64 (461) T ss_pred EEEEEECCCCCCCC---CCCCCCCCHHHCCCHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHCC-------------- T ss_conf 38998068883111---3868601201032278999999853783415999816869999985037-------------- Q ss_pred EECCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCC-CCCCE Q ss_conf 0011344112485799843334233-4665421101122---33331112786444544332211-22212234-32200 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQYERKG-LGHAVWCARNIIG---DNPFALLLPDMIMSPLEGENCMA-NMIKLYEK-EGANI 162 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~~p~G-tg~Ai~~a~~~i~---de~flv~~gD~i~~~~~~~~~l~-~li~~~~~-~~~~v 162 (299) ....+.|+.|.+|+| |||||.||.+|+. ++..||+|||. ++....+|+ +|++.|.+ .++.+ T Consensus 65 ----------~d~t~~~v~Q~~qlGGTGHAv~~a~~~l~~~~~~~vLvLyGDv---PLi~~eTL~m~Ll~~~~~~~~~~~ 131 (461) T TIGR01173 65 ----------NDKTVNWVLQAEQLGGTGHAVLQALPFLSEDDDGRVLVLYGDV---PLISAETLEMRLLESHRQLNGAKV 131 (461) T ss_pred ----------CCCEEEEEEECCCCCCHHHHHHHHCCCCCCCCCCEEEEEECCC---CCCCHHHHHHHHHHHHHCCCCCCE T ss_conf ----------9967999996889787289998711678886686089995887---877567787798886300010451 Q ss_pred -EEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCC--CEEECHHHEECHH-HHHHHCCCCCCCCCCCCCH Q ss_conf -11243285443313310013333335300000025778886734--2330202002723-3110001226667675003 Q gi|254780365|r 163 -LAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFIS--NFFINGRYILHPD-IFSILNDWKENEGKGEIQL 238 (299) Q Consensus 163 -i~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~S--nl~~~GiYi~~~~-i~~~L~~~~~~~~~gE~~l 238 (299) +++.. .++|+.||-+-.++ .++|.++||--+...+.- +=+|+|+|+|+.+ +..+|.++.....+||||| T Consensus 132 tlLt~~--l~DP~GYGRI~r~~-----~g~V~~IVE~KDA~~eqk~I~eiNtG~y~~~~~~L~~~L~~l~n~NaqgEYYL 204 (461) T TIGR01173 132 TLLTAK--LEDPTGYGRIIREN-----DGAVQAIVEEKDANEEQKAIKEINTGVYVFDGAALKRWLPKLSNNNAQGEYYL 204 (461) T ss_pred EEEEEE--ECCCCCCCEEEECC-----CCCEEEEECCCCCCHHHHCCCEEEEEEEEECHHHHHHHHHHHCCCCCCCCCCH T ss_conf 799987--18889543589848-----99589997335988698035278887999832899988876287704443147 Q ss_pred HHHHHHHHHCC-CEEEEEEEEEEEECCCHHHHHHHHHHHHHCC Q ss_conf 78999998319-8489998137864399889999999986318 Q gi|254780365|r 239 TDSMRKLSERH-DFLAYHFKGHTYDCGSKKGFVLANIAFALAR 280 (299) Q Consensus 239 tD~i~~l~~~~-~v~a~~~~g~w~DiG~~~~yl~A~~~~al~~ 280 (299) ||+|..+.+++ .|.++.+...--=-|-=..+-.|++|=.+++ T Consensus 205 TD~ia~a~~~g~~v~~~~~~d~~E~~GvNdR~qLa~lE~~~q~ 247 (461) T TIGR01173 205 TDVIALAVADGEEVRAVQVDDSEEVLGVNDRLQLAQLERILQR 247 (461) T ss_pred HHHHHHHHCCCCEEEEEEECCHHHHCCCCCHHHHHHHHHHHHH T ss_conf 8999998508947899980875983366798899999999999 No 36 >cd02508 ADP_Glucose_PP ADP-glucose pyrophosphorylase is involved in the biosynthesis of glycogen or starch. ADP-glucose pyrophosphorylase (glucose-1-phosphate adenylyltransferase) catalyzes a very important step in the biosynthesis of alpha 1,4-glucans (glycogen or starch) in bacteria and plants: synthesis of the activated glucosyl donor, ADP-glucose, from glucose-1-phosphate and ATP. ADP-glucose pyrophosphorylase is a tetrameric allosterically regulated enzyme. While a homotetramer in bacteria, in plant chloroplasts and amyloplasts, it is a heterotetramer of two different, yet evolutionary related, subunits. There are a number of conserved regions in the sequence of bacterial and plant ADP-glucose pyrophosphorylase subunits. It is a subfamily of a very diverse glycosy transferase family 2. Probab=100.00 E-value=1.8e-40 Score=268.95 Aligned_cols=195 Identities=20% Similarity=0.253 Sum_probs=152.1 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 99956888833432440888333568836-27999999999789988999813554441010011745444565418700 Q gi|254780365|r 10 AVFPIAGLGMRFFPISKVIPKEMLAIVDR-PVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 10 avIlAaG~GtRl~P~T~~~pKpllpi~~k-piI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) |||||||+||||+|+|+.+||||+||+|| |+|+|++++++++||++|+++++|+.++|.+||..+.++.. ..+.. T Consensus 1 AvIlagG~GtRL~pLT~~~PKp~lpi~gk~~iId~~l~~l~~~Gi~~i~i~~~y~~~~i~~hl~~g~~~~~----~~~~~ 76 (200) T cd02508 1 AIILAGGEGTRLSPLTKKRAKPAVPFGGRYRLIDFPLSNMVNSGIRNVGVLTQYKSRSLNDHLGSGKEWDL----DRKNG 76 (200) T ss_pred CEEECCCCCCCCHHHHCCCCCCCEEECCCCCCHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCCCC----CCCCC T ss_conf 98957888863625447997413088991620799999887679767998345358999999856401257----65668 Q ss_pred EECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 001134411248579984333423346654211011223---33311127864445443322112221223432200112 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGD---NPFALLLPDMIMSPLEGENCMANMIKLYEKEGANILAV 165 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~d---e~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi~~ 165 (299) .+... + ...+..+..++|||+|+..+.+++.+ +.|+|++||+++..+ +++|+++|.++++.+..+ T Consensus 77 gi~i~----~---~~~~~~~~~~~Gt~~~i~~~~~~l~~~~~~~~lv~ngD~l~~~D-----~~~~l~~H~~~~a~~Ti~ 144 (200) T cd02508 77 GLFIL----P---PQQRKGGDWYRGTADAIYQNLDYIERSDPEYVLILSGDHIYNMD-----YREMLDFHIESGADITVV 144 (200) T ss_pred CEEEE----E---CCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEECCCEEEECC-----HHHHHHHHHHCCCCEEEE T ss_conf 58996----3---13336876553478999999998721788729997699787069-----999999998769998999 Q ss_pred EECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHH-HHHCCCCCCCCCCCCCHHHHHHH Q ss_conf 43285443313310013333335300000025778886734233020200272331-10001226667675003789999 Q gi|254780365|r 166 SECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIF-SILNDWKENEGKGEIQLTDSMRK 244 (299) Q Consensus 166 ~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~-~~L~~~~~~~~~gE~~ltD~i~~ 244 (299) . .+++|+|+|++++| +++++..... . ..-++|+++. T Consensus 145 ~-----------------------------------------~in~GiYi~~~~~l~~~l~~~~~~~-~-~d~~~d~ip~ 181 (200) T cd02508 145 Y-----------------------------------------KASMGIYIFSKDLLIELLEEDAADG-S-HDFGKDIIPA 181 (200) T ss_pred E-----------------------------------------EEECEEEEEEHHHHHHHHHHHCCCC-C-CCCHHHHHHH T ss_conf 9-----------------------------------------8504899980899999998632467-6-7558999999 Q ss_pred HHHCCCEEEEEEEEEEEEC Q ss_conf 9831984899981378643 Q gi|254780365|r 245 LSERHDFLAYHFKGHTYDC 263 (299) Q Consensus 245 l~~~~~v~a~~~~g~w~Di 263 (299) ++++++|+||+++|+|+|| T Consensus 182 l~~~~~v~~y~~~gyw~DI 200 (200) T cd02508 182 MLKKLKIYAYEFNGYWADI 200 (200) T ss_pred HHHCCCEEEEECCCEEECC T ss_conf 9603988999668857494 No 37 >TIGR02623 G1P_cyt_trans glucose-1-phosphate cytidylyltransferase; InterPro: IPR013446 Glucose-1-phosphate cytidylyltransferases, also known as CDP-glucose pyrophosphorylase, are the product of the rfbF gene and produce CDP-D-glucose.. Probab=100.00 E-value=4.8e-40 Score=266.40 Aligned_cols=232 Identities=20% Similarity=0.315 Sum_probs=190.4 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 89995688883343244088833356883627999999999789988999813554441010011745444565418700 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) ||||||||.|||+-.-|..+||||++|++||||+|++.--...||+|++|+.||+...|++||- ||.+...--...-.. T Consensus 2 KaViLAGGlGTRisEEt~lrPKPM~EiGgkPIlWHI~k~Y~~~Gi~~FiiC~GYkGy~IKeyF~-NY~l~~SDvT~~l~~ 80 (256) T TIGR02623 2 KAVILAGGLGTRISEETHLRPKPMVEIGGKPILWHIMKIYSAHGIKEFIICLGYKGYVIKEYFA-NYFLHMSDVTIDLED 80 (256) T ss_pred EEEEEECCCCCCCCCCCCCCCCCCEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCEEEEECC-CCEEECCEEEEEECC T ss_conf 3799717864432243535897607767963778899999752833027885343322010010-632311316787515 Q ss_pred EECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC--CEEEE Q ss_conf 00113441124857998433342334665421101122-3333111278644454433221122212234322--00112 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIG-DNPFALLLPDMIMSPLEGENCMANMIKLYEKEGA--NILAV 165 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~-de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~--~vi~~ 165 (299) .-..+...-..+++++.|.-.+.--||+.+..-++|++ |++||+.|||-+-+.+ ++.++.+|++++. ++.|+ T Consensus 81 n~~~~H~~~~EPW~VTLvDTG~~t~TGGRl~Rv~~~~~~d~~Fc~TYGDGv~~~~-----I~~~~~~H~~~g~~AT~tAv 155 (256) T TIGR02623 81 NTIEVHEKRAEPWRVTLVDTGESTQTGGRLKRVREYLEDDEAFCLTYGDGVADID-----IKALIAFHRKHGKKATVTAV 155 (256) T ss_pred CEEEECCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHCCCCCEEEEECCCCCCCC-----HHHHHHHHHHCCCCEEEEEE T ss_conf 7045312138971688885687766783278899984378768987158503325-----89999998862980678860 Q ss_pred EECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHH Q ss_conf 43285443313310013333335300000025778886734233020200272331100012266676750037899999 Q gi|254780365|r 166 SECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKL 245 (299) Q Consensus 166 ~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l 245 (299) +| +.|||.+.+... +|+.|.|||.-.. .++|.|++||+|++++.++.+.....+ +.+..| T Consensus 156 ~P-----PGRfGaL~~~~~------~V~~F~EKP~gd~---g~iNGGFFVL~P~V~d~I~~D~~~wE~------~~L~~L 215 (256) T TIGR02623 156 QP-----PGRFGALELEGE------SVTSFQEKPLGDG---GLINGGFFVLNPSVLDLIDGDATVWES------EPLETL 215 (256) T ss_pred CC-----CCCEEEEEECCC------CEEEECCCCCCCC---EEEECCEEEECCCEEEEECCCCEEECC------HHHHHH T ss_conf 78-----883023677255------1111014748898---168065687483202022488502010------378999 Q ss_pred HHCCCEEEEEEEEEEEECCCH Q ss_conf 831984899981378643998 Q gi|254780365|r 246 SERHDFLAYHFKGHTYDCGSK 266 (299) Q Consensus 246 ~~~~~v~a~~~~g~w~DiG~~ 266 (299) ++++++.||.+.|.|.=.-|. T Consensus 216 ~~~g~L~aY~H~GFW~PMDTL 236 (256) T TIGR02623 216 AQRGELSAYEHSGFWQPMDTL 236 (256) T ss_pred HHCCCCEEEEECCCCCCCHHH T ss_conf 744881378517756852225 No 38 >COG1207 GlmU N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=7.4e-37 Score=246.76 Aligned_cols=235 Identities=22% Similarity=0.304 Sum_probs=177.6 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCC Q ss_conf 58999568888334324408883335688362799999999978998899981355444101001174544456541870 Q gi|254780365|r 8 RKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKK 87 (299) Q Consensus 8 ~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~ 87 (299) +.+||||||+||||+ +..||.|.||+||||++|+|+.+...+.+++++|+||..+++...+. T Consensus 3 ~~~vILAAGkGTRMk---S~lPKVLH~vaGkpMl~hVi~~a~~l~~~~i~vVvGh~ae~V~~~~~--------------- 64 (460) T COG1207 3 LSAVILAAGKGTRMK---SDLPKVLHPVAGKPMLEHVIDAARALGPDDIVVVVGHGAEQVREALA--------------- 64 (460) T ss_pred CEEEEEECCCCCCCC---CCCCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHC--------------- T ss_conf 169999448875345---79841110016800899999987626866189998687789998756--------------- Q ss_pred EEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCC-CCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE-E Q ss_conf 00011344112485799843334233466542110112-233--3311127864445443322112221223432200-1 Q gi|254780365|r 88 AELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNII-GDN--PFALLLPDMIMSPLEGENCMANMIKLYEKEGANI-L 163 (299) Q Consensus 88 ~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i-~de--~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~v-i 163 (299) ...++.|+.|++|+||||||++|++++ .+. ..+|++||.=+ .....|++|++.|...++.+ + T Consensus 65 -----------~~~~v~~v~Q~eqlGTgHAV~~a~~~l~~~~~g~vLVl~GD~PL---it~~TL~~L~~~~~~~~~~~tv 130 (460) T COG1207 65 -----------ERDDVEFVLQEEQLGTGHAVLQALPALADDYDGDVLVLYGDVPL---ITAETLEELLAAHPAHGAAATV 130 (460) T ss_pred -----------CCCCCEEEEECCCCCHHHHHHHHHHHHHCCCCCCEEEEECCCCC---CCHHHHHHHHHHHHHCCCCEEE T ss_conf -----------35473399962558738999966676404789738999699666---7899999999863403775599 Q ss_pred EEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCC-C-CCCCCEEECHHHEECHH-HHHHHCCCCCCCCCCCCCHHH Q ss_conf 12432854433133100133333353000000257788-8-67342330202002723-311000122666767500378 Q gi|254780365|r 164 AVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDS-S-TFISNFFINGRYILHPD-IFSILNDWKENEGKGEIQLTD 240 (299) Q Consensus 164 ~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~-~-~~~Snl~~~GiYi~~~~-i~~~L~~~~~~~~~gE~~ltD 240 (299) +.... +++..||-+..+.. ++|..+||--+. + ...=+.+|+|+|+|+.+ .+++|.++..+..+||||||| T Consensus 131 Lt~~~--~dP~GYGRIvr~~~-----g~V~~IVE~KDA~~eek~I~eiNtGiy~f~~~~L~~~L~~l~nnNaqgEYYLTD 203 (460) T COG1207 131 LTAEL--DDPTGYGRIVRDGN-----GEVTAIVEEKDASEEEKQIKEINTGIYAFDGAALLRALPKLSNNNAQGEYYLTD 203 (460) T ss_pred EEEEC--CCCCCCCEEEECCC-----CCEEEEEECCCCCHHHHCCCEEEEEEEEECHHHHHHHHHHHCCCCCCCCEEHHH T ss_conf 99873--89987546999399-----869999974789978950838820689986799999998725666557486899 Q ss_pred HHHHHHHCC-CEEEEEEEEEEEECCCHHHHHHHHHHHHHCCH Q ss_conf 999998319-84899981378643998899999999863183 Q gi|254780365|r 241 SMRKLSERH-DFLAYHFKGHTYDCGSKKGFVLANIAFALARQ 281 (299) Q Consensus 241 ~i~~l~~~~-~v~a~~~~g~w~DiG~~~~yl~A~~~~al~~~ 281 (299) ++..+..+| +|.++..+..+-=.|--..+--|+++-.++++ T Consensus 204 vI~i~~~~g~~V~a~~~~d~~E~~GVN~R~qLa~~e~~~q~r 245 (460) T COG1207 204 VIAIARNEGEKVRAVHVDDEEEVLGVNDRVQLAEAERIMQRR 245 (460) T ss_pred HHHHHHHCCCEEEEEECCCHHHHCCCCCHHHHHHHHHHHHHH T ss_conf 999998579769999669668862767499999999999999 No 39 >TIGR01207 rmlA glucose-1-phosphate thymidylyltransferase; InterPro: IPR005907 Synonym: dTDP-D-glucose synthase This group of proteins comprises a tightly conserved but broadly distributed subfamily (here designated as short form) of known and putative bacterial glucose-1-phosphate thymidylyltransferases (2.7.7.24 from EC). It is well characterised in several species as the first of four enzymes involved in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme. The enzyme is active as a homotetramer.dTTP + alpha-D-glucose 1-phosphate = diphosphate + dTDP-glucose The family of known and putative glucose-1-phosphate thymidyltransferases shows a deep split into a short form (see IPR005908 from INTERPRO) and a long form described by this model. The homotetrameric short form is found in numerous bacterial species that incorporate dTDP-L-rhamnose, which it helps synthesize, into the cell wall. It is subject to feedback inhibition. The long form, in contrast, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced. ; GO: 0008879 glucose-1-phosphate thymidylyltransferase activity, 0045226 extracellular polysaccharide biosynthetic process. Probab=100.00 E-value=8e-36 Score=240.40 Aligned_cols=232 Identities=24% Similarity=0.390 Sum_probs=192.1 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 89995688883343244088833356883627999999999789988999813554441010011745444565418700 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) |+||||||.||||+|.|..+.|.||||.+||+|.|.+..+..+||++|.+++...... .+ T Consensus 1 ~Gi~laGG~Gtrl~P~t~~~skqllP~ydkPmiyyPl~~lmlaG~r~i~~i~~~~~~~---------~f----------- 60 (286) T TIGR01207 1 KGIILAGGSGTRLYPITLAVSKQLLPVYDKPMIYYPLSVLMLAGIRDILIISTPEDTP---------RF----------- 60 (286) T ss_pred CCEEEECCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHEEEEECCCHHH---------HH----------- T ss_conf 9268726887401243454432103101575012318999873022201352110005---------78----------- Q ss_pred EECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEC Q ss_conf 00113441124857998433342334665421101122333311127864445443322112221223432200112432 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANILAVSEC 168 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi~~~~v 168 (299) ...+.+....+..+.|..|++|.|.+.|...+.+|+++++.++++||++|........+++. .....|+++.+.. | T Consensus 61 -~~llGdG~~~G~~~~y~~q~~P~Gla~af~ig~~f~~~~~~~l~lGdn~fyG~~~~~~l~~~--~~~~~Ga~vf~y~-v 136 (286) T TIGR01207 61 -KRLLGDGSQFGVELSYAVQPSPDGLAQAFIIGEEFIGDDKSALVLGDNIFYGHDLSELLKRA--AAREEGATVFAYQ-V 136 (286) T ss_pred -HHHHCCCCHHCEEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHH--HHHHCCCEEEEEE-E T ss_conf -98713520104578871157863258888776887267761799605001021489999998--7530685799987-5 Q ss_pred CHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHC Q ss_conf 85443313310013333335300000025778886734233020200272331100012266676750037899999831 Q gi|254780365|r 169 DPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLSER 248 (299) Q Consensus 169 ~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~~~ 248 (299) .++.+|||++++... +..++.|||..|+ |+++++|.|.++.++.+.-+.++++. +||+++||+...+.++ T Consensus 137 --~~P~~yGv~~f~~~~-----~~~~~eekP~~P~--s~y~v~Gly~yd~~~~~~a~~~~~s~-rGelei~~~n~~yl~~ 206 (286) T TIGR01207 137 --KDPERYGVVEFDEEG-----RAISLEEKPAKPK--SNYAVTGLYFYDNRVVEIARELKPSA-RGELEITDLNRVYLEE 206 (286) T ss_pred --CCCCCCCEEEECCCC-----CEEEEECCCCCCC--CCEEEEEHHHHHHHHHHHHHHCCCCC-CCCCHHHHHHHHHHHC T ss_conf --076412306663873-----1786413788876--34134011101068999997313245-6640288899999863 Q ss_pred CCEEEEEE-EEE-EEECCCHHHHHHHHH Q ss_conf 98489998-137-864399889999999 Q gi|254780365|r 249 HDFLAYHF-KGH-TYDCGSKKGFVLANI 274 (299) Q Consensus 249 ~~v~a~~~-~g~-w~DiG~~~~yl~A~~ 274 (299) |.+..-.+ .|. |+|.||.+++++|+. T Consensus 207 G~l~v~~~~rG~aWldtGt~~sl~~a~~ 234 (286) T TIGR01207 207 GKLSVELLGRGYAWLDTGTHDSLLEASS 234 (286) T ss_pred CCEEEHHHHCCCCEECCCHHHHHHHHHH T ss_conf 6502014526511000111688999999 No 40 >KOG1460 consensus Probab=100.00 E-value=7.9e-35 Score=234.27 Aligned_cols=237 Identities=21% Similarity=0.317 Sum_probs=185.0 Q ss_pred CCEEEEECCCC--CCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHH-CCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHH Q ss_conf 45899956888--833432440888333568836279999999997-899889998135544410100117454445654 Q gi|254780365|r 7 VRKAVFPIAGL--GMRFFPISKVIPKEMLAIVDRPVIQYVIEEALE-AGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRK 83 (299) Q Consensus 7 i~KavIlAaG~--GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~-~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~ 83 (299) .-|||||.||. |||+||++-.+||||+||+|+|||+|.|+.|.+ +|..+|++|-=|..+...+| ++. T Consensus 2 ~~~AVIlVGGP~kGTRFRPLSf~vPKPLfpiaG~pmI~Hhi~ac~qi~~l~eI~LvGFy~e~~f~~f----------is~ 71 (407) T KOG1460 2 KVKAVILVGGPQKGTRFRPLSFNVPKPLFPIAGVPMIHHHISACKQISGLAEILLVGFYEERVFTDF----------ISA 71 (407) T ss_pred CEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCHHHEEEEECCCCHHHHHH----------HHH T ss_conf 3379999558888851034435899875431885140406898751656023367740450589999----------999 Q ss_pred CCCCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCC-C--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 1870000113441124857998433342334665421101122-3--333111278644454433221122212234322 Q gi|254780365|r 84 RNKKAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIG-D--NPFALLLPDMIMSPLEGENCMANMIKLYEKEGA 160 (299) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~-d--e~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~ 160 (299) +.+. ....+.|.++.+|+|||++++..++.|- + +.|+|+++|.-++.. |++|.+.|...+. T Consensus 72 ---------~~~e--~~~pvrYL~E~~plGtaGgLyhFrdqIl~g~ps~vFvlnaDVCcsfP-----l~~ml~ahr~~g~ 135 (407) T KOG1460 72 ---------IQQE--FKVPVRYLREDNPLGTAGGLYHFRDQILAGSPSAVFVLNADVCCSFP-----LQDMLEAHRRYGG 135 (407) T ss_pred ---------HHHH--CCCCHHHHCCCCCCCCCCCEEEHHHHHHCCCCCEEEEEECCEECCCC-----HHHHHHHHHHCCC T ss_conf ---------9864--46525543457887766532244457746998628997055015874-----7999999863388 Q ss_pred C-EEEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCC----------- Q ss_conf 0-0112432854433133100133333353000000257788867342330202002723311000122----------- Q gi|254780365|r 161 N-ILAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWK----------- 228 (299) Q Consensus 161 ~-vi~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~----------- 228 (299) . ++.+.++.++..++||.+.-+ ....++.+.+|||+ +..|+.+++|+|+|++++|+.+++.- T Consensus 136 ~~tll~tkvs~e~asnfG~lV~d----P~t~evlHYveKPs--TfvSd~InCGvYlF~~eif~~i~~v~~q~~~~~~~~~ 209 (407) T KOG1460 136 IGTLLVTKVSREQASNFGCLVED----PSTGEVLHYVEKPS--TFVSDIINCGVYLFTPEIFNAIAEVYRQRQDLLEVEK 209 (407) T ss_pred CEEEEEEEECHHHHHCCCEEEEC----CCCCCEEEEECCCC--HHHHCCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHH T ss_conf 45999997557675105715406----87673578505862--3533350114899468997889999998776554542 Q ss_pred --CCCCCC---CCCH-HHHHHHHHHCCCEEEEEEEEEEEECCCHHHHHHHHHH Q ss_conf --666767---5003-7899999831984899981378643998899999999 Q gi|254780365|r 229 --ENEGKG---EIQL-TDSMRKLSERHDFLAYHFKGHTYDCGSKKGFVLANIA 275 (299) Q Consensus 229 --~~~~~g---E~~l-tD~i~~l~~~~~v~a~~~~g~w~DiG~~~~yl~A~~~ 275 (299) +..+.| =+.| +|++..|+.++++|+|...+.|--|-|+-.-+.||-- T Consensus 210 ~~~~l~~g~~d~irLeqDvlspLag~k~lY~y~t~~fW~QiKtagsal~as~l 262 (407) T KOG1460 210 DLPLLQPGPADFIRLEQDVLSPLAGSKQLYAYETTDFWSQIKTAGSALYASRL 262 (407) T ss_pred CCCCCCCCCCCEEEEECHHHHHHCCCCCEEEEECCCHHHHHCCCCCEEEHHHH T ss_conf 16344877666078430010011277724787061088873466213333266 No 41 >cd04198 eIF-2B_gamma_N The N-terminal domain of gamma subunit of the eIF-2B is a subfamily of glycosyltransferase 2. N-terminal domain of gamma subunit of the eukaryotic translation initiation factor 2B (eIF-2B): eIF-2B is a guanine nucleotide-exchange factor which mediates the exchange of GDP (bound to initiation factor eIF2) for GTP, generating active eIF2.GTP complex. EIF2B is a complex multimeric protein consisting of five subunits named alpha, beta, gamma, delta and epsilon. Subunit gamma shares sequence similarity with epsilon subunit, and with a family of bifunctional nucleotide-binding enzymes such as ADP-glucose pyrophosphorylase, suggesting that epsilon subunit may play roles in nucleotide binding activity. In yeast, eIF2B gamma enhances the activity of eIF2B-epsilon leading to the idea that these subunits form the catalytic subcomplex. Probab=100.00 E-value=9.5e-34 Score=227.63 Aligned_cols=136 Identities=22% Similarity=0.344 Sum_probs=107.7 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCC-CCCHHHCCCCHHHHHHHHHCCCC Q ss_conf 89995688883343244088833356883627999999999789988999813554-44101001174544456541870 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGK-GLIKDYFDIQFELEQSLRKRNKK 87 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k-~~I~~~f~~~~~~~~~l~~~~~~ 87 (299) ||||||||+||||+|+|..+||||+||+|||+|+|+++.|.++|+++|+|++++.. +.|..|+. +.. T Consensus 2 qAVILagG~GtRl~plT~~~PK~Llpi~n~P~i~y~l~~l~~~G~~~i~i~~~~~~~~~i~~~~~----------~~~-- 69 (214) T cd04198 2 QAVILAGGGGSRLYPLTDNIPKALLPVANKPMIWYPLDWLEKAGFEDVIVVVPEEEQAEISTYLR----------SFP-- 69 (214) T ss_pred EEEEECCCCCCCCCHHHCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHH----------HHC-- T ss_conf 59998688876656435699864668999974999999999869988999957535999999999----------704-- Q ss_pred EEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE-EEEE Q ss_conf 000113441124857998433342334665421101122333311127864445443322112221223432200-1124 Q gi|254780365|r 88 AELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANI-LAVS 166 (299) Q Consensus 88 ~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~v-i~~~ 166 (299) ........+.++.+++|+|||+|+..+++.+. ++|+|++||.+...+ +++|+++|.++++.+ +++. T Consensus 70 -------~~~~~~~~~~~~~~~~~~gT~~aLr~~~~~i~-~dflvl~gD~i~~i~-----l~~ll~~Hr~~~a~~T~~l~ 136 (214) T cd04198 70 -------LNLKQKLDEVTIVLDEDMGTADSLRHIRKKIK-KDFLVLSCDLITDLP-----LIELVDLHRSHDASLTVLLY 136 (214) T ss_pred -------CCCCCCEEEEEEECCCCCCHHHHHHHHHHCCC-CCEEEECCCEEECCC-----HHHHHHHHHHHCCCEEEEEE T ss_conf -------56677628999657987871999999985189-998998599836688-----89999999970994899995 Q ss_pred ECC Q ss_conf 328 Q gi|254780365|r 167 ECD 169 (299) Q Consensus 167 ~v~ 169 (299) ++. T Consensus 137 ~~~ 139 (214) T cd04198 137 PPP 139 (214) T ss_pred ECC T ss_conf 357 No 42 >cd04197 eIF-2B_epsilon_N The N-terminal domain of epsilon subunit of the eIF-2B is a subfamily of glycosyltransferase 2. N-terminal domain of epsilon subunit of the eukaryotic translation initiation factor 2B (eIF-2B): eIF-2B is a guanine nucleotide-exchange factor which mediates the exchange of GDP (bound to initiation factor eIF2) for GTP, generating active eIF2.GTP complex. EIF2B is a complex multimeric protein consisting of five subunits named alpha, beta, gamma, delta and epsilon. Subunit epsilon shares sequence similarity with gamma subunit, and with a family of bifunctional nucleotide-binding enzymes such as ADP-glucose pyrophosphorylase, suggesting that epsilon subunit may play roles in nucleotide binding activity. In yeast, eIF2B gamma enhances the activity of eIF2B-epsilon leading to the idea that these subunits form the catalytic subcomplex. Probab=100.00 E-value=3.7e-33 Score=224.01 Aligned_cols=190 Identities=19% Similarity=0.277 Sum_probs=134.2 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 89995688883343244088833356883627999999999789988999813554441010011745444565418700 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) ||||||||+||||+|+|..+||||+||+|+|+|+|+++.|..+|+++|++++++..++|++||...... . T Consensus 2 qAVIlAgg~gtRl~PlT~~~PK~LlPv~n~Pli~y~l~~L~~~G~~ei~v~~~~~~~~i~~~~~~~~~~-----~----- 71 (217) T cd04197 2 QAVVLADSFNRRFRPLTKEKPRCLLPLANVPLIDYTLEFLALNGVEEVFVFCCSHSDQIKEYIEKSKWS-----K----- 71 (217) T ss_pred EEEEECCCCCCCCCHHHCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHC-----C----- T ss_conf 599987888772562035898653379999839999999998799779999468999999999843221-----5----- Q ss_pred EECCCCCCCCCCCEEEECCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-----CCC Q ss_conf 0011344112485799843334233466542--110112233331112786444544332211222122343-----220 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQYERKGLGHAVW--CARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKE-----GAN 161 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~--~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~-----~~~ 161 (299) .......+.++..++..++|+|+. .++.++. ++|+|++||.+...+ +++|+++|+++ ++. T Consensus 72 -------~~~~~~~v~~~~~~~~~~~Gdalr~l~~~~~i~-~dFlv~~gD~it~~~-----l~~~l~~Hr~~r~~dk~a~ 138 (217) T cd04197 72 -------PKSSLMIVIIIMSEDCRSLGDALRDLDAKGLIR-GDFILVSGDVVSNID-----LKEILEEHKERRKKDKNAI 138 (217) T ss_pred -------CCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCC-CCEEEEECCCEECCC-----HHHHHHHHHHHHCCCCCCE T ss_conf -------667773489975688876516999887604478-999999799041378-----8999999986313375744 Q ss_pred E-EEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCC------------------CCCCEEECHHHEECHHHH Q ss_conf 0-11243285443313310013333335300000025778886------------------734233020200272331 Q gi|254780365|r 162 I-LAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSST------------------FISNFFINGRYILHPDIF 221 (299) Q Consensus 162 v-i~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~------------------~~Snl~~~GiYi~~~~i~ 221 (299) + +.+.++.+...++..--...-..+.+..++..+.++|..+. -.|||..+|+||++|+++ T Consensus 139 ~T~~~~~~~~~~~~~~~~~~~~v~id~~~~rll~~~~~~~~~~~~~~~l~~~ll~~~~~~~i~~dL~D~hIyIcsp~VL 217 (217) T cd04197 139 MTMVLKEASPPHRTRRTGEEFVIAVDPKTSRLLHYEELPGSKYRSITDLPSELLGSNSEVEIRHDLLDCHIDICSPDVL 217 (217) T ss_pred EEEEECCCCCCCCCCCCCCCCEEEEECCCCEEEEEECCCCCCCCCCCCCCHHHHCCCCCEEEECCCEECEEEEEECCCC T ss_conf 8998704688742345776508999589981999842466554443345799970599589984885033899602549 No 43 >cd02507 eIF-2B_gamma_N_like The N-terminal of eIF-2B_gamma_like is predicted to have glycosyltransferase activity. N-terminal domain of eEIF-2B epsilon and gamma, subunits of eukaryotic translation initiators, is a subfamily of glycosyltranferase 2 and is predicted to have glycosyltranferase activity. eIF-2B is a guanine nucleotide-exchange factor which mediates the exchange of GDP (bound to initiation factor eIF2) for GTP, generating active eIF2.GTP complex. EIF2B is a complex multimeric protein consisting of five subunits named alpha, beta, gamma, delta and epsilon. Subunit epsilon shares sequence similarity with gamma subunit, and with a family of bifunctional nucleotide-binding enzymes such as ADP-glucose pyrophosphorylase, suggesting that epsilon subunit may play roles in nucleotide binding activity. In yeast, eIF2B gamma enhances the activity of eIF2B-epsilon leading to the idea that these subunits form the catalytic subcomplex. Probab=99.98 E-value=9.5e-33 Score=221.46 Aligned_cols=191 Identities=20% Similarity=0.259 Sum_probs=130.8 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 89995688883343244088833356883627999999999789988999813554441010011745444565418700 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) ||||||||+||||+|+|+..|||||||+|+|+|+|+++.|.++||++|+|++++..+++++||...+.. T Consensus 2 qAVIlagg~gtrl~Plt~~~PK~lLpi~n~P~i~y~l~~l~~~gi~~i~i~~~~~~~~~~~~~~~~~~~----------- 70 (216) T cd02507 2 QAVVLADGFGSRFLPLTSDIPKALLPVANVPLIDYTLEWLEKAGVEEVFVVCCEHSQAIIEHLLKSKWS----------- 70 (216) T ss_pred EEEEECCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCC----------- T ss_conf 399982789773660006998655679999979999999998799889999588889999999862434----------- Q ss_pred EECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC----CCCCCCEEE Q ss_conf 0011344112485799843334233466542110112233331112786444544332211222122----343220011 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLY----EKEGANILA 164 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~----~~~~~~vi~ 164 (299) .........+.+..+.+++|||+++..+++.+. ++|+|+.||.+...+ ++.+++.| ....++++. T Consensus 71 -----~~~~~~~~~~~~~~~~~~~Gta~~l~~~~~~i~-~dflvl~gD~i~~~~-----l~~~l~~~~~~~~~~~~tl~~ 139 (216) T cd02507 71 -----SLSSKMIVDVITSDLCESAGDALRLRDIRGLIR-SDFLLLSCDLVSNIP-----LSELLEERRKKDKNAIATLTV 139 (216) T ss_pred -----CCCCCCCEEEEECCCCCCCCCHHHHHHHHHCCC-CCEEEECCCEEECCC-----HHHHHHHHHHHCCCCCEEEEE T ss_conf -----567676447896556887655799998761579-998998599787689-----899999998748577449999 Q ss_pred EEECCHH---------HCCCCCCEECCC---------CCCC-CCCE-EEEECCCCCCCCCCCCEEECHHHEECHHHH Q ss_conf 2432854---------433133100133---------3333-5300-000025778886734233020200272331 Q gi|254780365|r 165 VSECDPQ---------LSCKYGMVQVGK---------AIDH-QVFH-ISDMIEKPDSSTFISNFFINGRYILHPDIF 221 (299) Q Consensus 165 ~~~v~~~---------~~~~yGvi~~~~---------~~~~-~~~~-I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~ 221 (299) ...-+.. +..-+|+...+. ..+. .... -..+.++|..-...||++++|+|||+|+++ T Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~l~i~~~~~~~~~~~~~i~~~~l~~~~~~~i~snL~D~giYI~sp~VL 216 (216) T cd02507 140 LLASPPVSTEQSKKTEEEDVIAVDSKTQRLLLLHYEEDLDEDLELIIRKSLLSKHPNVTIRTDLLDCHIYICSPDVL 216 (216) T ss_pred EEECCCCCCCCCCCCCCCCEEEEECCCCEEEEEEECCCCCCCCCCCCCHHHHHCCCCEEEECCCEEEEEEEECCCCC T ss_conf 98237776543334676637999379987999961003454344213278861699749844884857998241009 No 44 >cd02509 GDP-M1P_Guanylyltransferase GDP-M1P_Guanylyltransferase catalyzes the formation of GDP-Mannose. GDP-mannose-1-phosphate guanylyltransferase, also called GDP-mannose pyrophosphorylase (GDP-MP), catalyzes the formation of GDP-Mannose from mannose-1-phosphate and GTP. Mannose is a key monosaccharide for glycosylation of proteins and lipids. GDP-Mannose is the activated donor for mannosylation of various biomolecules. This enzyme is known to be bifunctional, as both mannose-6-phosphate isomerase and mannose-1-phosphate guanylyltransferase. This CD covers the N-terminal GDP-mannose-1-phosphate guanylyltransferase domain, whereas the isomerase function is located at the C-terminal half. GDP-MP is a member of the nucleotidyltransferase family of enzymes. Probab=99.96 E-value=7.6e-31 Score=209.77 Aligned_cols=234 Identities=18% Similarity=0.295 Sum_probs=159.2 Q ss_pred EEEEECCCCCCCCCHHHC-CCCCCEEEEC-CEEHHHHHHHHHHH-CCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCC Q ss_conf 899956888833432440-8883335688-36279999999997-89988999813554441010011745444565418 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISK-VIPKEMLAIV-DRPVIQYVIEEALE-AGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRN 85 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~-~~pKpllpi~-~kpiI~~~i~~l~~-~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~ 85 (299) .+||||||.||||||+|+ ..||+++++. ++++++++++.+.. .+.++++|||+..-. +..+.++.+. T Consensus 2 ~~VIlaGG~GtRLWPlSr~~~PKQf~~~~~~~sLlq~T~~R~~~~~~~~~i~IvTn~~~~---------~~v~~ql~~~- 71 (274) T cd02509 2 YPVILAGGSGTRLWPLSRESYPKQFLKLFGDKSLLQQTLDRLKGLVPPDRILVVTNEEYR---------FLVREQLPEG- 71 (274) T ss_pred EEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCEEEECCHHH---------HHHHHHHHHC- T ss_conf 899947986044687558889977521799998999999987468986780999572179---------9999998843- Q ss_pred CCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCC----CCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCC Q ss_conf 70000113441124857998433342334665421101122----33331112786444544-33221122212234322 Q gi|254780365|r 86 KKAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIG----DNPFALLLPDMIMSPLE-GENCMANMIKLYEKEGA 160 (299) Q Consensus 86 ~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~----de~flv~~gD~i~~~~~-~~~~l~~li~~~~~~~~ 160 (299) ......+.|+.+++||.||..|..++. |+.++|+-+||++.+.. +...++..++..++..- T Consensus 72 --------------~~~~~ii~EP~~rnTApAI~laa~~~~~~~~d~~l~v~PsDH~I~d~~~F~~~i~~a~~~a~~~~i 137 (274) T cd02509 72 --------------LPEENIILEPEGRNTAPAIALAALYLAKRDPDAVLLVLPSDHLIEDVEAFLKAVKKAVEAAEEGYL 137 (274) T ss_pred --------------CCCCCEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCE T ss_conf --------------986648967688897999999999999759997799984740200889999999999999733988 Q ss_pred CEEEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCC-----C-EEECHHHEECHHHH-HHHCCCCC---- Q ss_conf 0011243285443313310013333335300000025778886734-----2-33020200272331-10001226---- Q gi|254780365|r 161 NILAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFIS-----N-FFINGRYILHPDIF-SILNDWKE---- 229 (299) Q Consensus 161 ~vi~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~S-----n-l~~~GiYi~~~~i~-~~L~~~~~---- 229 (299) .++++.+..++ ..||.++.+....++.++|.+|+|||+.++|.. | +=|+|+++++.+.+ +.+++..| T Consensus 138 vt~GI~P~~P~--TgYGYI~~~~~~~~~~~~V~~F~EKP~~~~A~~~i~sg~y~WNsGiFv~~~~~~l~~~~~~~p~i~~ 215 (274) T cd02509 138 VTFGIKPTRPE--TGYGYIEAGEKLGGGVYRVKRFVEKPDLETAKEYLESGNYLWNSGIFLFRAKTFLEELKKHAPDIYE 215 (274) T ss_pred EEEEEEECCCC--CCCEEEEECCCCCCCCEEEEEEEECCCHHHHHHHHHCCCCEEECHHHHHHHHHHHHHHHHHCHHHHH T ss_conf 99742137789--8862898575357774257887647988999999875990330023220099999999987999999 Q ss_pred -------CC-CCCCCCHH-H--------HHHHH-HHC-CCEEEEEEEEEEEECCCHHH Q ss_conf -------66-76750037-8--------99999-831-98489998137864399889 Q gi|254780365|r 230 -------NE-GKGEIQLT-D--------SMRKL-SER-HDFLAYHFKGHTYDCGSKKG 268 (299) Q Consensus 230 -------~~-~~gE~~lt-D--------~i~~l-~~~-~~v~a~~~~g~w~DiG~~~~ 268 (299) .. ...++++- + .|+.. +++ .++..++.+..|-|+|++++ T Consensus 216 ~~~~~~~~~~~~~~~~~~~~~f~~~~~iSIDyavmEk~~~~~vv~~~~~WsDlGsW~s 273 (274) T cd02509 216 ALEKALAAAGTDDFLRLLEEAFAKIPSISIDYAVMEKTKKVAVVPADFGWSDLGSWDA 273 (274) T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCHHHHEECCCCEEEEECCCCCCCCCCCC T ss_conf 9999997567873020568898528775675498540788459983898587678667 No 45 >COG0836 {ManC} Mannose-1-phosphate guanylyltransferase [Cell envelope biogenesis, outer membrane] Probab=99.91 E-value=2.6e-25 Score=175.72 Aligned_cols=241 Identities=20% Similarity=0.261 Sum_probs=157.6 Q ss_pred EEEEECCCCCCCCCHHHCC-CCCCEEEE-CCEEHHHHHHHHHHH-CCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCC Q ss_conf 8999568888334324408-88333568-836279999999997-89988999813554441010011745444565418 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKV-IPKEMLAI-VDRPVIQYVIEEALE-AGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRN 85 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~-~pKpllpi-~~kpiI~~~i~~l~~-~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~ 85 (299) ++||||||.||||||+|+. .||++|++ +++.+++.+++.+.. .+++++++|||.. |. +....++.+- T Consensus 3 ~pvIlaGG~GsRLWPLSR~~~PKQFl~L~~~~Sllq~T~~R~~~l~~~~~~~vVtne~------~~---f~v~eql~e~- 72 (333) T COG0836 3 IPVILAGGSGSRLWPLSRKDYPKQFLKLFGDLSLLQQTVKRLAFLGDIEEPLVVTNEK------YR---FIVKEQLPEI- 72 (333) T ss_pred EEEEEECCCCCCCCCCCCCCCCCCCEEECCCCCHHHHHHHHHHHCCCCCCEEEEECHH------HH---HHHHHHHHHH- T ss_conf 2599707876536876754377230341799718999999875147856609995779------99---9999986641- Q ss_pred CCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCC----CCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCC Q ss_conf 7000011344112485799843334233466542110112----233331112786444544-33221122212234322 Q gi|254780365|r 86 KKAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNII----GDNPFALLLPDMIMSPLE-GENCMANMIKLYEKEGA 160 (299) Q Consensus 86 ~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i----~de~flv~~gD~i~~~~~-~~~~l~~li~~~~~~~~ 160 (299) +....-.++.+++-+-||-||..|.-.+ +|.-++|+-+||++.+.. +...+++..+..++..- T Consensus 73 ------------~~~~~~~illEP~gRnTApAIA~aa~~~~~~~~d~~~lVlpsDH~I~d~~af~~av~~A~~~A~~g~l 140 (333) T COG0836 73 ------------DIENAAGIILEPEGRNTAPAIALAALSATAEGGDALVLVLPSDHVIADEEAFLNAVKKAEKAAEEGGI 140 (333) T ss_pred ------------HHCCCCCEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCEECCHHHHHHHHHHHHHHHHCCCE T ss_conf ------------01256606854578774999999999998758983899944755655689999999999999971987 Q ss_pred CEEEEEECCHHHCCCCCCEECCCCCCC-CCCEEEEECCCCCCCCCCCC------EEECHHHEECHHH-HHHHCCCCCCC- Q ss_conf 001124328544331331001333333-53000000257788867342------3302020027233-11000122666- Q gi|254780365|r 161 NILAVSECDPQLSCKYGMVQVGKAIDH-QVFHISDMIEKPDSSTFISN------FFINGRYILHPDI-FSILNDWKENE- 231 (299) Q Consensus 161 ~vi~~~~v~~~~~~~yGvi~~~~~~~~-~~~~I~~~vEKP~~~~~~Sn------l~~~GiYi~~~~i-~~~L~~~~~~~- 231 (299) -++++.+..+++ .||.++.+..... +.++|.+|||||+.++|.+- +=|.|+++|.... ++.+++..|.- T Consensus 141 VTfGI~Pt~PeT--GYGYIe~G~~~~~~~~~~V~~FvEKPd~etA~~yv~sG~y~WNSGmF~Fra~~~l~e~~~~~P~i~ 218 (333) T COG0836 141 VTFGIPPTRPET--GYGYIETGESIAENGVYKVDRFVEKPDLETAKKYVESGEYLWNSGMFLFRASVFLEELKKHQPDIY 218 (333) T ss_pred EEEECCCCCCCC--CCCEEECCCCCCCCCCEEEEEEEECCCHHHHHHHHHCCCEEEECCCEEEEHHHHHHHHHHHCCHHH T ss_conf 998327898866--741463286245678257555313887899999997396576323168888999999986090999 Q ss_pred ---------C---------CCCCCHHH--HHHHH-HHC-CCEEEEEEEEEEEECCCHHHHHHHH Q ss_conf ---------7---------67500378--99999-831-9848999813786439988999999 Q gi|254780365|r 232 ---------G---------KGEIQLTD--SMRKL-SER-HDFLAYHFKGHTYDCGSKKGFVLAN 273 (299) Q Consensus 232 ---------~---------~gE~~ltD--~i~~l-~~~-~~v~a~~~~g~w~DiG~~~~yl~A~ 273 (299) . ..+|+..+ .|+.- +++ .++..++.+-.|-|+|+|..+.+.. T Consensus 219 ~~~~~~~~~~~d~~~~~l~~e~f~~~p~iSIDYAiMEkt~~~aVVp~~f~WsDlGsW~Al~~~~ 282 (333) T COG0836 219 CAAEKAFEAAVDENSVRLDNEAYEEIPAISIDYAIMEKTSKAAVVPADFGWSDLGSWHALWEVL 282 (333) T ss_pred HHHHHHHHCCCCCCHHHCCHHHHHHCCCCCCHHHHHHHHCCEEEEECCCCCCCCCCHHHHHHHH T ss_conf 9999987440334221004888744831220378774116648995678856556789999875 No 46 >COG1213 Predicted sugar nucleotidyltransferases [Cell envelope biogenesis, outer membrane] Probab=99.91 E-value=5.8e-24 Score=167.37 Aligned_cols=224 Identities=22% Similarity=0.328 Sum_probs=135.0 Q ss_pred CCCCCCCCEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEC-CCCCCCCHHHCCCCHHHHH Q ss_conf 978885458999568888334324408883335688362799999999978998899981-3554441010011745444 Q gi|254780365|r 1 MGSLKKVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVT-GRGKGLIKDYFDIQFELEQ 79 (299) Q Consensus 1 ~~~m~~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~-~~~k~~I~~~f~~~~~~~~ 79 (299) |..| ||||||||.|+||.| .+||+|+.|+|||+|++.|++|.++||+++++|+ |+..+.++.+. T Consensus 1 ~~~~----kavILAAG~GsRlg~---~~PK~Lvev~gr~ii~~~i~~L~~~gi~e~vvV~~g~~~~lve~~l-------- 65 (239) T COG1213 1 MHPM----KAVILAAGFGSRLGP---DIPKALVEVGGREIIYRTIENLAKAGITEFVVVTNGYRADLVEEFL-------- 65 (239) T ss_pred CCCE----EEEEEECCCCCCCCC---CCCCHHHHCCCEEEHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHH-------- T ss_conf 9740----478770344553479---9971443158867389999999876986399996230278999998-------- Q ss_pred HHHHCCCCEEECCCCCCCCCCCEEEECCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 565418700001134411248579984333--423346654211011223333111278644454433221122212234 Q gi|254780365|r 80 SLRKRNKKAELTLLAESIPSIGNAVFTWQY--ERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEK 157 (299) Q Consensus 80 ~l~~~~~~~~~~~~~~~~~~~~~i~~v~Q~--~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~ 157 (299) ... +. +..+++-+ +--+|++.+++|++++... |++++||++|.+. +++.++++ T Consensus 66 --~~~-------------~~--~~~iv~N~~y~ktN~~~Sl~~akd~~~~~-fii~~sD~vye~~----~~e~l~~a--- 120 (239) T COG1213 66 --KKY-------------PF--NAKIVINSDYEKTNTGYSLLLAKDYMDGR-FILVMSDHVYEPS----ILERLLEA--- 120 (239) T ss_pred --HCC-------------CC--CEEEEECCCCCCCCCEEEEEEEHHHHCCC-EEEEECCEEECHH----HHHHHHHC--- T ss_conf --518-------------96--16999689855577646774543463385-7999578750689----99999847--- Q ss_pred CCCCEEEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCC Q ss_conf 32200112432854433133100133333353000000257788867342330202002723311000122666767500 Q gi|254780365|r 158 EGANILAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQ 237 (299) Q Consensus 158 ~~~~vi~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ 237 (299) .+..+++.... +|.-.+-........+++.++= |--.+. +-..+|+.+|++++|..+.+.-.. +-++. T Consensus 121 ~~~~li~d~~~------~~~~~~ea~kv~~e~G~i~~ig-K~l~e~---~~e~iGi~~l~~~i~~~~~~~~~e--~~~~~ 188 (239) T COG1213 121 PGEGLIVDRRP------RYVGVEEATKVKDEGGRIVEIG-KDLTEY---DGEDIGIFILSDSIFEDTYELLVE--RSEYD 188 (239) T ss_pred CCCCEEEECCC------CCCCCCCEEEEEECCCEEEHHC-CCCCCC---CCEEEEEEEECHHHHHHHHHHHHH--HHHHH T ss_conf 57767895233------3355675069984497876210-785513---621653588345787878988765--56677 Q ss_pred HHHHHHHHH-HCCCEEEEEEEE-EEEECCCHHHHHHHHHHHH Q ss_conf 378999998-319848999813-7864399889999999986 Q gi|254780365|r 238 LTDSMRKLS-ERHDFLAYHFKG-HTYDCGSKKGFVLANIAFA 277 (299) Q Consensus 238 ltD~i~~l~-~~~~v~a~~~~g-~w~DiG~~~~yl~A~~~~a 277 (299) ..++..... +...+... ++| .|+||-||+++.+|.-.+. T Consensus 189 ~~~~~~~~~~~~~~~di~-~~g~~w~EVDtpeDl~~ar~~~~ 229 (239) T COG1213 189 YREVEKEAGLPFTEVDIH-VDGLFWMEVDTPEDLERARKYLV 229 (239) T ss_pred HHHHHHHHCCCEEEEECC-CCCCEEEECCCHHHHHHHHHHHH T ss_conf 999999858761785102-36852486388899999999999 No 47 >KOG1462 consensus Probab=99.67 E-value=5.3e-18 Score=130.70 Aligned_cols=241 Identities=16% Similarity=0.200 Sum_probs=138.1 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCC-CCCCCHHHCCCCHHHHHHHHHCCCC Q ss_conf 899956888833432440888333568836279999999997899889998135-5444101001174544456541870 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGR-GKGLIKDYFDIQFELEQSLRKRNKK 87 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~-~k~~I~~~f~~~~~~~~~l~~~~~~ 87 (299) .||++|||-||||--+|...||+||||+++|||+|.+..|.++|+++|++|+.. .+..|+ ..+.....- T Consensus 11 qavV~a~~ggt~~p~~~~~~pKaLLPIgn~PMi~YpL~~L~~~gfteiiVv~~e~e~~~i~----------~al~~~~~l 80 (433) T KOG1462 11 QAVVLAGGGGTRMPEVTSRLPKALLPIGNKPMILYPLNSLEQAGFTEIIVVVNEDEKLDIE----------SALGSNIDL 80 (433) T ss_pred HHHEEECCCCEECHHHHHHCCHHHCCCCCCCEEEEEHHHHHHCCCEEEEEEECHHHHHHHH----------HHHHCCCCC T ss_conf 5532525896045444523632112458964364126689756973899995378899999----------987507754 Q ss_pred EEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEE- Q ss_conf 0001134411248579984333423346654211011223333111278644454433221122212234322001124- Q gi|254780365|r 88 AELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANILAVS- 166 (299) Q Consensus 88 ~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi~~~- 166 (299) . . .+.-+.+...- ++-.||++++..--+.++.++|+|+-+|-+.+.. +.-|++.|..++++..++. T Consensus 81 ~-~------~~~~v~ip~~~-~~d~gtadsLr~Iy~kikS~DflvlsCD~Vtdv~-----l~~lvd~FR~~d~slamli~ 147 (433) T KOG1462 81 K-K------RPDYVEIPTDD-NSDFGTADSLRYIYSKIKSEDFLVLSCDFVTDVP-----LQPLVDKFRATDASLAMLIG 147 (433) T ss_pred C-C------CCCEEEEECCC-CCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCC-----CHHHHHHHHCCCHHHHHHHC T ss_conf 3-2------13279830456-3344778888654554146877999653026888-----08889887524765867752 Q ss_pred ECCHH------------HCCCCCCEECCCC--------CCCCCCEEEEE-CCC-CCCCCCCCCEEECHHHEECHHHHHHH Q ss_conf 32854------------4331331001333--------33353000000-257-78886734233020200272331100 Q gi|254780365|r 167 ECDPQ------------LSCKYGMVQVGKA--------IDHQVFHISDM-IEK-PDSSTFISNFFINGRYILHPDIFSIL 224 (299) Q Consensus 167 ~v~~~------------~~~~yGvi~~~~~--------~~~~~~~I~~~-vEK-P~~~~~~Snl~~~GiYi~~~~i~~~L 224 (299) ....+ ...-+|+.+-.+. ..+..+.+..- .-| |.. +..++|..+-+|+|+..++++| T Consensus 148 ~~~s~~~~pgqk~k~k~~~d~igi~e~t~rl~y~~~~~d~~~~l~i~~slL~~~prl-tl~t~L~dahiY~~k~~v~d~l 226 (433) T KOG1462 148 NALSEVPIPGQKGKKKQARDVIGINEDTERLAYSSDSADEEEPLVIRKSLLWNHPRL-TLTTKLVDAHIYVFKHWVIDLL 226 (433) T ss_pred CCCCCCCCCCCCCCCCCCCCEEEECCCCCEEEEEECCCCCCCCEEHHHHHHHCCCCE-EEECCCCCEEEEEEHHHHHHHH T ss_conf 645445366734455440213444156633688405776777432113455218864-8861543146642389999997 Q ss_pred CCCC-CCCCCCCCCHHHHHHHHHHC-----------------------------CCEEEEEEE--EEEEECCCHHHHHHH Q ss_conf 0122-66676750037899999831-----------------------------984899981--378643998899999 Q gi|254780365|r 225 NDWK-ENEGKGEIQLTDSMRKLSER-----------------------------HDFLAYHFK--GHTYDCGSKKGFVLA 272 (299) Q Consensus 225 ~~~~-~~~~~gE~~ltD~i~~l~~~-----------------------------~~v~a~~~~--g~w~DiG~~~~yl~A 272 (299) .+-+ ....++|| +.+.+....++ -+.++|... +-..-+.|.-.|.++ T Consensus 227 ~~~~sisSfk~~f-~P~lvkkQ~q~~~~~~~~~~~~l~t~~~~~~d~~~~~~d~ik~y~~~~p~e~~~~raNtL~~y~ei 305 (433) T KOG1462 227 SEKESISSFKADF-LPYLVKKQFQKNPPLKKNETSILPTPNLNNPDGIHSPDDRIKCYAYILPTESLFVRANTLLSYMEI 305 (433) T ss_pred HCCCCCEEECCCC-CCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCEEEECCHHHHHHH T ss_conf 3178620200145-536666565307775653001367755568532347445113568990476535875336777764 Q ss_pred HH Q ss_conf 99 Q gi|254780365|r 273 NI 274 (299) Q Consensus 273 ~~ 274 (299) |. T Consensus 306 N~ 307 (433) T KOG1462 306 NR 307 (433) T ss_pred HH T ss_conf 07 No 48 >COG4750 LicC CTP:phosphocholine cytidylyltransferase involved in choline phosphorylation for cell surface LPS epitopes [Cell envelope biogenesis, outer membrane] Probab=99.67 E-value=2.5e-17 Score=126.58 Aligned_cols=214 Identities=18% Similarity=0.253 Sum_probs=119.6 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 89995688883343244088833356883627999999999789988999813554441010011745444565418700 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) +|||||||.|||+.|+|..+||+||.|.|+|+|++.|+.|.++||++|+|||||.|++.+ +|..+. T Consensus 2 nAIIlAAG~gsR~~plT~~tpK~LlkV~g~plIErqI~~L~e~gI~dI~IVvGYlkE~Fe-----------YLkdKy--- 67 (231) T COG4750 2 NAIILAAGLGSRFVPLTQSTPKSLLKVNGEPLIERQIEQLREAGIDDITIVVGYLKEQFE-----------YLKDKY--- 67 (231) T ss_pred CEEEEECCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHCCCCEEEEEEEEHHHHHH-----------HHHHHC--- T ss_conf 618994255665331310287678873585509999999997798618999631489999-----------998715--- Q ss_pred EECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEC Q ss_conf 00113441124857998433342334665421101122333311127864445443322112221223432200112432 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANILAVSEC 168 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi~~~~v 168 (299) ++++.|-.--.------.+++|++++.|. .++-+|+.+....+. .++. .+.-.++.+- T Consensus 68 -----------~vtLvyN~kY~~yNn~ySlyla~d~l~nt--YiidsDnyl~kNif~--------~~~~-~S~Yfav~~~ 125 (231) T COG4750 68 -----------DVTLVYNPKYREYNNIYSLYLARDFLNNT--YIIDSDNYLTKNIFL--------TKES-HSKYFAVYRS 125 (231) T ss_pred -----------CEEEEECCHHHHHHHHHHHHHHHHHHCCC--EEECCCHHHHHHHHH--------CCCC-CCEEEEEEEC T ss_conf -----------74999574077663099999999985256--795050476665552--------5735-2127899963 Q ss_pred CHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHH----HHHHHCCCCC-CCCCCCCCHHHHHH Q ss_conf 854433133100133333353000000257788867342330202002723----3110001226-66767500378999 Q gi|254780365|r 169 DPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPD----IFSILNDWKE-NEGKGEIQLTDSMR 243 (299) Q Consensus 169 ~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~----i~~~L~~~~~-~~~~gE~~ltD~i~ 243 (299) ... ..|- +.. +..++|++++ ..- .++++..|+-.|++. +..+|+..-. ...+. ..--++.. T Consensus 126 ~~t--nEw~-l~~-----~~~~ki~~v~----Igg-~~~~imsG~sff~~~~~~ki~~ll~~~yv~~e~~k-~yWd~v~~ 191 (231) T COG4750 126 GKT--NEWL-LIY-----NSDGKITRVD----IGG-LNGYIMSGISFFDAQFSNKIKKLLKEYYVRLENRK-LYWDTVPM 191 (231) T ss_pred CCC--CEEE-EEE-----CCCCCEEEEE----ECC-CCCCEEEEEEEECCHHHHHHHHHHHHHHHCCHHHH-HHHHHHHH T ss_conf 877--4257-997-----6998489999----668-55625751144234247999999999983724266-78887799 Q ss_pred HHHHCCCEEEEEE-EEEEEECCCHHHHHHH Q ss_conf 9983198489998-1378643998899999 Q gi|254780365|r 244 KLSERHDFLAYHF-KGHTYDCGSKKGFVLA 272 (299) Q Consensus 244 ~l~~~~~v~a~~~-~g~w~DiG~~~~yl~A 272 (299) ..++.-.++.-.+ ++.-|.+-+.++|.+- T Consensus 192 ~ni~~l~m~iek~~~n~IyE~DsLdelrk~ 221 (231) T COG4750 192 ENIKELDMYIEKLNDNDIYEFDSLDELRKF 221 (231) T ss_pred HHHHHHHHHHHHHCCCCEEEECCHHHHHHH T ss_conf 877887676886167845874369999864 No 49 >TIGR03310 matur_ygfJ molybdenum hydroxylase accessory protein, YgfJ family. Members of this protein family are probable accessory proteins for the biosynthesis of enzymes related to xanthine dehydrogenase. Comparative genomics suggests a role in the maturation of selenium-dependent molybdenum hydroxylases, although a tenuous alternative hypothesis is a role for this protein (with a requirement for SelD, the selenium donor protein in the selenocysteine and selenouridine biosynthesis pathways) metabolizing a selenium-containing substrate such as selenate. Probab=99.61 E-value=1.1e-15 Score=116.35 Aligned_cols=179 Identities=15% Similarity=0.242 Sum_probs=114.5 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEE Q ss_conf 99956888833432440888333568836279999999997899889998135544410100117454445654187000 Q gi|254780365|r 10 AVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAE 89 (299) Q Consensus 10 avIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~ 89 (299) |||||||+++|| ..+|.|+|++|+|+|+|+++.+.+++++++++|+++..+.+..++.. T Consensus 2 AIILAaG~S~Rm-----G~~K~Ll~~~g~~ll~~~i~~~~~~~~~~vivv~~~~~~~~~~~~~~---------------- 60 (188) T TIGR03310 2 AIILAAGLSSRM-----GQNKLLLPYKGKTILEHVVDNALRLFFDEVILVLGHEADELVALLAN---------------- 60 (188) T ss_pred EEEECCCCCCCC-----CCCCCCCEECCCCHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHCC---------------- T ss_conf 699889876659-----88900688898329999999998569996699827837899998626---------------- Q ss_pred ECCCCCCCCCCCEEEECCCCC-CCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEE Q ss_conf 011344112485799843334-2334665421101122-33331112786444544332211222122343220011243 Q gi|254780365|r 90 LTLLAESIPSIGNAVFTWQYE-RKGLGHAVWCARNIIG-DNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANILAVSE 167 (299) Q Consensus 90 ~~~~~~~~~~~~~i~~v~Q~~-p~Gtg~Ai~~a~~~i~-de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi~~~~ 167 (299) ..+ +..+..+. ..|.+++|.++..... .+.++++.+|.=+. ....+++|++.+..++..++.. T Consensus 61 --------~~~--~~~v~n~~~~~G~~~si~~gl~~~~~~~~~li~~~D~P~l---~~~~i~~L~~~~~~~~~~iv~p-- 125 (188) T TIGR03310 61 --------HSN--ITLVHNPQYAEGQSSSIKLGLELPVQSDGYLFLLGDQPFV---TPDIIQLLLEAFALKNDEIVVP-- 125 (188) T ss_pred --------CCC--CCEECCCCCCCCHHHHHHHHHHCCCCCCEEEEEECCCCCC---CHHHHHHHHHHHHHCCCCCEEE-- T ss_conf --------689--6265486556780899999997065788599971887889---9999999999998579961734-- Q ss_pred CCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 28544331331001333333530000002577888673423302020027233110001226667675003789999983 Q gi|254780365|r 168 CDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLSE 247 (299) Q Consensus 168 v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~~ 247 (299) .|. .++-.|. +|+...|+.|.+..-+ .| ...+++ T Consensus 126 -------~~~-------------------g~~g~Pv-----------lf~~~~~~~L~~l~gd--~G-------~r~ll~ 159 (188) T TIGR03310 126 -------LYK-------------------GKRGHPV-----------LFPRKLFPELLALTGD--TG-------GRQILR 159 (188) T ss_pred -------CCC-------------------CCCCCCE-----------EECHHHHHHHHHCCCC--CC-------HHHHHH T ss_conf -------559-------------------9536766-----------8779999999844388--03-------899998 Q ss_pred CC--CEEEEEEE--EEEEECCCHHHHH Q ss_conf 19--84899981--3786439988999 Q gi|254780365|r 248 RH--DFLAYHFK--GHTYDCGSKKGFV 270 (299) Q Consensus 248 ~~--~v~a~~~~--g~w~DiG~~~~yl 270 (299) +. .+.-+.++ +..+||-||++|- T Consensus 160 ~~~~~~~~v~~~d~~~~~DiDTpeD~~ 186 (188) T TIGR03310 160 ELPHEVKYVEVKDPGILFDIDTPEDYQ 186 (188) T ss_pred HCCCCEEEEECCCCCCCCCCCCHHHHH T ss_conf 480676999728989041899999996 No 50 >cd04182 GT_2_like_f GT_2_like_f is a subfamily of the glycosyltransferase family 2 (GT-2) with unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=99.56 E-value=1.4e-14 Score=109.66 Aligned_cols=178 Identities=20% Similarity=0.312 Sum_probs=111.2 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 89995688883343244088833356883627999999999789988999813554441010011745444565418700 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) .|||||||+|+||. .+|+|+++.|+|+|+|+++.+..+++.++++|+++..+.+..++.. T Consensus 2 ~~iILAgG~s~RmG-----~~K~l~~~~g~pli~~~i~~~~~~~~~~i~vv~~~~~~~~~~~~~~--------------- 61 (186) T cd04182 2 AAIILAAGRSSRMG-----GNKLLLPLDGKPLLRHALDAALAAGLSRVIVVLGAEADAVRAALAG--------------- 61 (186) T ss_pred EEEEECCCCCCCCC-----CCEECCEECCCCHHHHHHHHHHHCCCCCEEECCCCCHHHHHHHHCC--------------- T ss_conf 69998997653599-----8912558997319999999999749983252377407899998547--------------- Q ss_pred EECCCCCCCCCCCEEEECCCC-CCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 001134411248579984333-423346654211011223--33311127864445443322112221223432200112 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQY-ERKGLGHAVWCARNIIGD--NPFALLLPDMIMSPLEGENCMANMIKLYEKEGANILAV 165 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~-~p~Gtg~Ai~~a~~~i~d--e~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi~~ 165 (299) . .+..+.-+ -..|.+++|..+...+.. +.++++.+|.=+. ....+++|++.+...++.++.. T Consensus 62 ----------~--~~~~v~~~~~~~G~~~si~~gl~~~~~~~~~~lv~~~D~P~i---~~~~i~~L~~~~~~~~~~iv~~ 126 (186) T cd04182 62 ----------L--PVVVVINPDWEEGMSSSLAAGLEALPADADAVLILLADQPLV---TAETLRALIDAFREDGAGIVAP 126 (186) T ss_pred ----------C--CCEEEECCCCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCC---CHHHHHHHHHHHHHCCCCEEEE T ss_conf ----------9--936751787567853789999997334887689843777878---9999999999997589958999 Q ss_pred EECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHH Q ss_conf 43285443313310013333335300000025778886734233020200272331100012266676750037899999 Q gi|254780365|r 166 SECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKL 245 (299) Q Consensus 166 ~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l 245 (299) . |. .++..|. +|+...|+.|.+..-+ +| ...+ T Consensus 127 ~---------~~-------------------g~~g~Pv-----------l~~~~~~~~l~~l~gd--~G-------~r~~ 158 (186) T cd04182 127 V---------YQ-------------------GRRGHPV-----------LFPRSLFPELLALSGD--KG-------ARSL 158 (186) T ss_pred E---------CC-------------------CCCCCCE-----------EECHHHHHHHHHCCCC--CC-------HHHH T ss_conf 6---------17-------------------9756864-----------7779999999840277--03-------7999 Q ss_pred HHCC-CEEEEEE--EEEEEECCCHHHH Q ss_conf 8319-8489998--1378643998899 Q gi|254780365|r 246 SERH-DFLAYHF--KGHTYDCGSKKGF 269 (299) Q Consensus 246 ~~~~-~v~a~~~--~g~w~DiG~~~~y 269 (299) +++. ....+.+ .+.-+||-||++| T Consensus 159 l~~~~~~~~~~~~~~~~~~didTpeDl 185 (186) T cd04182 159 LRAHPDRVVVEVDDPGVLIDIDTPEDL 185 (186) T ss_pred HHHCCCCEEEECCCCCCCCCCCCHHHC T ss_conf 986988689963998826689895880 No 51 >TIGR02092 glgD glucose-1-phosphate adenylyltransferase, GlgD subunit; InterPro: IPR011832 This family is GlgD, an apparent regulatory protein that appears in an alpha2/beta2 heterotetramer with GlgC (glucose-1-phosphate adenylyltransferase, IPR011831 from INTERPRO) in a subset of bacteria that use GlgC for glycogen biosynthesis.; GO: 0005978 glycogen biosynthetic process. Probab=99.56 E-value=1e-14 Score=110.46 Aligned_cols=247 Identities=18% Similarity=0.243 Sum_probs=165.1 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEECCCCCC-CCHHHCCCCHHHHHHHHHCCC Q ss_conf 899956888833432440888333568836-279999999997899889998135544-410100117454445654187 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDR-PVIQYVIEEALEAGLTDFVFVTGRGKG-LIKDYFDIQFELEQSLRKRNK 86 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~k-piI~~~i~~l~~~Gi~~i~iv~~~~k~-~I~~~f~~~~~~~~~l~~~~~ 86 (299) -|||=-.-.-..|.|||+.+|=.+||++|| -|||+.|.++++|||+.|.+..+.+.. ++.||...+-+ |.-..+ T Consensus 4 ~aIi~l~~~~~~l~~LT~~Rp~A~~PfgGrYRlIDF~LSn~~NAgI~~v~~~~~~~~~~Sl~DHLg~G~e----WdL~r~ 79 (383) T TIGR02092 4 SAIINLTESKKNLSPLTKVRPVASLPFGGRYRLIDFPLSNMVNAGIRNVAVFFKNKERRSLFDHLGSGRE----WDLNRK 79 (383) T ss_pred EEEEECCCCCHHCCCHHHHCCCEEECCCCEEEEEECCHHCCCCCCEEEEEEECCCCCCCHHHHHCCCCCC----CCCCCC T ss_conf 8997357882011314550870332358647788301002310360022465266665315221348688----361102 Q ss_pred CEEECCCCCCCCCCCEEEEC--CCCC-CCCCCC--CCCCC-----CCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 00001134411248579984--3334-233466--54211-----01122---333311127864445443322112221 Q gi|254780365|r 87 KAELTLLAESIPSIGNAVFT--WQYE-RKGLGH--AVWCA-----RNIIG---DNPFALLLPDMIMSPLEGENCMANMIK 153 (299) Q Consensus 87 ~~~~~~~~~~~~~~~~i~~v--~Q~~-p~Gtg~--Ai~~a-----~~~i~---de~flv~~gD~i~~~~~~~~~l~~li~ 153 (299) .+.+ +-|+ +++. +.-+++ |++.. .+|+. .+..+|+.++.++-.+ ++++++ T Consensus 80 ~~gl------------F~~pP~~~~~~~~~~~~kaall~~~~~~~l~fl~rS~~~y~V~~~s~~v~NiD-----l~~~l~ 142 (383) T TIGR02092 80 RDGL------------FLFPPKYNDRDDVSEGGKAALLKRYFSNNLEFLKRSTSEYVVVLNSHMVCNID-----LKAVLK 142 (383) T ss_pred CCCE------------EEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEECCCEEEECC-----HHHHHH T ss_conf 5752------------77068768643234563377889999887678750899718985474578302-----899999 Q ss_pred CCCCCCCCEEEEE-ECCHHHCCC-C-CCEECCCCCCCCCCEEEEEC----CCCCCCCCCCCEEECHHHEECHHH-HHHHC Q ss_conf 2234322001124-328544331-3-31001333333530000002----577888673423302020027233-11000 Q gi|254780365|r 154 LYEKEGANILAVS-ECDPQLSCK-Y-GMVQVGKAIDHQVFHISDMI----EKPDSSTFISNFFINGRYILHPDI-FSILN 225 (299) Q Consensus 154 ~~~~~~~~vi~~~-~v~~~~~~~-y-Gvi~~~~~~~~~~~~I~~~v----EKP~~~~~~Snl~~~GiYi~~~~i-~~~L~ 225 (299) .|++.++.+..|+ +|++.+.+. + -++.+|+.. +|.++. -|-......+|+ ..-+||++.+. .++|. T Consensus 143 ~H~~~~~~iT~VYKkv~~~~~~~~~c~~L~~De~g-----~v~s~~Pms~~~~~~~~e~~n~-Sl~~yi~~t~lLiell~ 216 (383) T TIGR02092 143 RHEETGKDITVVYKKVKKADLSEEDCTILKFDEKG-----EVKSIGPMSVAKNLNKEEEENL-SLDIYILSTELLIELLY 216 (383) T ss_pred HHHHCCCCEEEEECCCCCCCCCCCCCCEEEECCCC-----CEEEEECEEEECCCCCHHHHHH-HHHHEEECHHHHHHHHH T ss_conf 98744896689973478467762357278863678-----5688632000026761344432-01002426689999999 Q ss_pred CCCCCCCCCCC-CHHHHHHHHHHCCCEEEEEEEEEEEECCCHHHHHHHHHHHHHCCHHHHHHH Q ss_conf 12266676750-037899999831984899981378643998899999999863183466999 Q gi|254780365|r 226 DWKENEGKGEI-QLTDSMRKLSERHDFLAYHFKGHTYDCGSKKGFVLANIAFALARQDIRSDI 287 (299) Q Consensus 226 ~~~~~~~~gE~-~ltD~i~~l~~~~~v~a~~~~g~w~DiG~~~~yl~A~~~~al~~~~~~~~~ 287 (299) .... +|+. .+.+.|...+.+..+.||++.||.--|-+...|.+|||++ |+ ++...++ T Consensus 217 ~~~~---~g~~~~l~~~~~~~l~~~~~~aYEY~GYl~~I~s~~sYY~aNM~l-L~-~~~f~~L 274 (383) T TIGR02092 217 ECIQ---KGKKTSLEELIRENLKELNINAYEYKGYLANINSVKSYYKANMDL-LE-PKIFQSL 274 (383) T ss_pred HHHH---CCCHHHHHHHHHHHHHHCCCCCCEEEEEEEEECCHHHHHHHHHHH-HH-HHHHHHH T ss_conf 8763---155679999999999864754330342211110167898872886-31-6876887 No 52 >KOG1461 consensus Probab=99.49 E-value=2.6e-13 Score=101.79 Aligned_cols=234 Identities=16% Similarity=0.221 Sum_probs=137.9 Q ss_pred CCCEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCC Q ss_conf 54589995688883343244088833356883627999999999789988999813554441010011745444565418 Q gi|254780365|r 6 KVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRN 85 (299) Q Consensus 6 ~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~ 85 (299) ..-.||++|--+-||++|+|...|+.|||++|.|||+|+++.|..+||.++++.++....||.+|...+. |. T Consensus 23 ~rLqAIllaDsf~trF~Plt~~~p~~LLPlaNVpmIdYtL~~L~~agV~eVfvfc~~~~~qi~e~i~~se-----w~--- 94 (673) T KOG1461 23 HRLQAILLADSFETRFRPLTLEKPRVLLPLANVPMIDYTLEWLERAGVEEVFVFCSAHAAQIIEYIEKSE-----WY--- 94 (673) T ss_pred CCEEEEEEECCCHHCCCCCCCCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEECCCHHHHHHHHHHCC-----CC--- T ss_conf 6417999721410022313357874475644716799999999865961899995145799999986352-----03--- Q ss_pred CCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC----CCC Q ss_conf 7000011344112485799843334233466542--1101122333311127864445443322112221223----432 Q gi|254780365|r 86 KKAELTLLAESIPSIGNAVFTWQYERKGLGHAVW--CARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYE----KEG 159 (299) Q Consensus 86 ~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~--~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~----~~~ 159 (299) .+....+..+.-.+.+-+|+|.. -.+..+. .+|++++||++.... ++++++.|. +.. T Consensus 95 -----------~~~~~~v~ti~s~~~~S~GDamR~id~k~lit-gDFiLVsgd~vsN~p-----l~~~l~eHr~r~k~Dk 157 (673) T KOG1461 95 -----------LPMSFIVVTICSGESRSVGDAMRDIDEKQLIT-GDFILVSGDTVSNMP-----LRNVLEEHRKRRKEDK 157 (673) T ss_pred -----------CCCCCEEEEECCCCCCCHHHHHHHHHHCCEEE-CCEEEEECCEEECCC-----HHHHHHHHHHHHHHCC T ss_conf -----------66560279980788672888988877426010-646998377342475-----6999999998765172 Q ss_pred CCE--EEEEECC-HHHCCCCCCEECCCCCCCCCCEEEEECC----CCCCCCC------------CCCEEECHHHEECHHH Q ss_conf 200--1124328-5443313310013333335300000025----7788867------------3423302020027233 Q gi|254780365|r 160 ANI--LAVSECD-PQLSCKYGMVQVGKAIDHQVFHISDMIE----KPDSSTF------------ISNFFINGRYILHPDI 220 (299) Q Consensus 160 ~~v--i~~~~v~-~~~~~~yGvi~~~~~~~~~~~~I~~~vE----KP~~~~~------------~Snl~~~GiYi~~~~i 220 (299) .++ +.+.+.. ...... =++.++.... ++..+-+ |-..... -+++.-++|-|.+|++ T Consensus 158 ~~iMTmv~k~~st~~~~~~-~~~avd~~T~----~ll~yq~~~~~~~~~~l~~sl~d~~~~v~vr~DL~dc~IdIcS~~V 232 (673) T KOG1461 158 DAIMTMVFKESSTRETTEQ-VVIAVDSRTS----RLLHYQKCVREKHDIQLDLSLFDSNDEVEVRNDLLDCQIDICSPEV 232 (673) T ss_pred CCEEEEEEECCCCCCCCCC-EEEEECCCCC----EEEEEHHHCCCCCCCCCCHHHHCCCCCEEEECCCCCCEEEEECHHH T ss_conf 4357899822445667760-5999867766----2786521225456533688895589828997367776225766767 Q ss_pred HHHHCCCCCCCCCCCCCH-HHHHHHHH-HC--C-CEEEEEEEE--EEEECCCHHHHHHHHHH Q ss_conf 110001226667675003-78999998-31--9-848999813--78643998899999999 Q gi|254780365|r 221 FSILNDWKENEGKGEIQL-TDSMRKLS-ER--H-DFLAYHFKG--HTYDCGSKKGFVLANIA 275 (299) Q Consensus 221 ~~~L~~~~~~~~~gE~~l-tD~i~~l~-~~--~-~v~a~~~~g--~w~DiG~~~~yl~A~~~ 275 (299) +.+..+.- .||. +|...-+. .+ | +++.+..+. +-.-+-++..|.--+.+ T Consensus 233 ~sLF~dNF------Dyq~r~DfV~GvL~~dilg~kI~~~~~~~~~yA~rv~n~~syd~vSkD 288 (673) T KOG1461 233 LSLFTDNF------DYQTRDDFVRGVLVDDILGYKIHVHVLSSIDYAARVENLRSYDLVSKD 288 (673) T ss_pred HHHHHHCC------CCEEHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHCCCHHHHHHHHH T ss_conf 77763145------624444466543366542776899975735444313431888888899 No 53 >TIGR03202 pucB xanthine dehydrogenase accessory protein pucB. In Bacillus subtilis the expression of this protein, located in an operon with the structural subunits of xanthine dehydrogenase, has been found to be essential for XDH activity. Some members of this family appear to have a distant relationship to the MobA protein involved in molybdopterin biosynthesis, although this may be coincidental. Probab=99.48 E-value=1.1e-13 Score=104.03 Aligned_cols=182 Identities=18% Similarity=0.252 Sum_probs=113.3 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 89995688883343244088833356883627999999999789988999813554441010011745444565418700 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) .|||||||.++||. .+|.|+|+.|+|++.|.++.+..+++.++++|+++....+.. ..... T Consensus 2 ~aiILAAG~S~RmG-----~~KlLl~~~g~~ll~~~~~~~~~~~~~~vvVV~g~~~~~~~~----------~~~~~---- 62 (190) T TIGR03202 2 VAIYLAAGQSRRMG-----ENKLALPLGETTLGSASLKTALSSRLSKVIVVIGEKYAHLSW----------LDPYL---- 62 (190) T ss_pred EEEEECCCCCCCCC-----CCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCHHHHHHH----------HHHHH---- T ss_conf 79997998766899-----884238509911999999999866998199981871888876----------67876---- Q ss_pred EECCCCCCCCCCCEEEECC-CCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEE Q ss_conf 0011344112485799843-3342334665421101122---33331112786444544332211222122343220011 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTW-QYERKGLGHAVWCARNIIG---DNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANILA 164 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~-Q~~p~Gtg~Ai~~a~~~i~---de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi~ 164 (299) .....+.++. .+-..|.++.|.++..... -+.++++++|.=+. ....+++|++.|.+++..++. T Consensus 63 ---------~~~~~~~~v~n~~~~~G~~sSi~~Gl~~~~~~~~d~~li~l~D~P~l---~~~~i~~L~~~~~~~~~~iv~ 130 (190) T TIGR03202 63 ---------LADERIMLVCCRDACEGQAHSLKCGLRKAEAMGADAVVILLADQPFL---TADVINALLALAKRRPDDYVA 130 (190) T ss_pred ---------HCCCCEEEEECCCCCCCCHHHHHHHHHHHCCCCCCEEEEEECCCCCC---CHHHHHHHHHHHHHCCCCEEE T ss_conf ---------33487499968986546079999999974366889699964897788---999999999999858990899 Q ss_pred EEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHH Q ss_conf 24328544331331001333333530000002577888673423302020027233110001226667675003789999 Q gi|254780365|r 165 VSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRK 244 (299) Q Consensus 165 ~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~ 244 (299) . .|. .+|..|. +|+..+|+.|...+-+. | ... T Consensus 131 ~---------~~~-------------------g~~G~Pv-----------lf~~~~f~~L~~l~GD~--G-------ar~ 162 (190) T TIGR03202 131 A---------SFK-------------------GKPRPPI-----------LFSKSLFPKLKALKGDE--G-------ARA 162 (190) T ss_pred E---------EEC-------------------CCCCCCE-----------EECHHHHHHHHHCCCCH--H-------HHH T ss_conf 8---------628-------------------9636988-----------97799999988378874--5-------999 Q ss_pred HHHCC-CEEEEEE--EEEEEECCCHHHH Q ss_conf 98319-8489998--1378643998899 Q gi|254780365|r 245 LSERH-DFLAYHF--KGHTYDCGSKKGF 269 (299) Q Consensus 245 l~~~~-~v~a~~~--~g~w~DiG~~~~y 269 (299) ++++. ....... .|.-+||-||++| T Consensus 163 ll~~~~~~~~~~~~d~~~~~DIDTpeDy 190 (190) T TIGR03202 163 LLRKDKSGLALPVADASAFFDIDTKEDY 190 (190) T ss_pred HHHCCCCCEEEEECCCCCCCCCCCHHHC T ss_conf 9841776569983898916079884549 No 54 >PRK00317 mobA molybdopterin-guanine dinucleotide biosynthesis protein A; Reviewed Probab=99.48 E-value=4.3e-13 Score=100.48 Aligned_cols=187 Identities=14% Similarity=0.159 Sum_probs=107.6 Q ss_pred CCCCCEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHH Q ss_conf 88545899956888833432440888333568836279999999997899889998135544410100117454445654 Q gi|254780365|r 4 LKKVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRK 83 (299) Q Consensus 4 m~~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~ 83 (299) |++| .|||||||+++|| ...+|.++++.|+|+|+|+++.+.. .+.+++|+++...+....+ T Consensus 1 m~~i-~gvILAGG~SsRm----gG~dK~ll~~~G~~li~~~~~~l~~-~~~~v~i~~~~~~~~~~~~------------- 61 (193) T PRK00317 1 MPPI-TGVILAGGRARRM----GGVDKGLQELNGKPLIQHVIDRLAP-QVDEIVINANRNLARYAAF------------- 61 (193) T ss_pred CCCC-EEEEECCCCCCCC----CCCCCCCCEECCEEHHHHHHHHHHH-HCCEEEEECCCCHHHHHHC------------- T ss_conf 9985-6999887676789----9999750658997499999998775-4576899768788888644------------- Q ss_pred CCCCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE Q ss_conf 18700001134411248579984333423346654211011223333111278644454433221122212234322001 Q gi|254780365|r 84 RNKKAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANIL 163 (299) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi 163 (299) +..+..-.+....|.-.+|..|......+.++|+.+|.=+.. ...++.|++.+.+.++.++ T Consensus 62 ----------------~~~vi~D~~~~~~GPL~Gi~saL~~~~~~~vlv~~~DmP~i~---~~~i~~L~~~~~~~~~~iv 122 (193) T PRK00317 62 ----------------GLPVIPDELADFPGPLAGILAGLQQAETEWVLFVPCDTPFLP---RDLVARLAQALIEEDADIA 122 (193) T ss_pred ----------------CCEEEECCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCC---HHHHHHHHHHHHHCCCCEE T ss_conf ----------------996996378776675899999986268775999626778898---9999999999874799838 Q ss_pred EEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHH Q ss_conf 12432854433133100133333353000000257788867342330202002723311000122666767500378999 Q gi|254780365|r 164 AVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMR 243 (299) Q Consensus 164 ~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~ 243 (299) ... . + ++.. + + .|+ ++.++++.|+..-. .|+.. +. T Consensus 123 ~~~-~-------------~--------------~~~~-P-----L--~al--y~~~~l~~l~~~l~---~g~~~----l~ 157 (193) T PRK00317 123 VAH-D-------------G--------------ERDH-P-----T--FAL--YSVALLPDLEAALA---AGERK----VM 157 (193) T ss_pred EEE-E-------------C--------------CCCC-C-----C--CCC--CCHHHHHHHHHHHH---CCCCC----HH T ss_conf 986-4-------------8--------------9546-5-----3--110--47889999999999---09965----99 Q ss_pred HHHHCCCEEEEEEEE---EEEECCCHHHHHHHH Q ss_conf 998319848999813---786439988999999 Q gi|254780365|r 244 KLSERHDFLAYHFKG---HTYDCGSKKGFVLAN 273 (299) Q Consensus 244 ~l~~~~~v~a~~~~g---~w~DiG~~~~yl~A~ 273 (299) .+.++.++..+.++. .++||-||++|-+|. T Consensus 158 ~~l~~~~~~~v~~~~~~~~f~NiNTpeDl~~~~ 190 (193) T PRK00317 158 AFYARHGTVAVDFSDPKDAFFNINTPEDLAQLE 190 (193) T ss_pred HHHHHCCCEEEEECCCCCCCCCCCCHHHHHHHH T ss_conf 999878958998079866236899999999999 No 55 >COG2068 Uncharacterized MobA-related protein [General function prediction only] Probab=99.41 E-value=3.3e-12 Score=95.04 Aligned_cols=188 Identities=23% Similarity=0.291 Sum_probs=121.8 Q ss_pred CCCCEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHC Q ss_conf 85458999568888334324408883335688362799999999978998899981355444101001174544456541 Q gi|254780365|r 5 KKVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKR 84 (299) Q Consensus 5 ~~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~ 84 (299) .+| -|||||||+||||. .+|-|+|+.|+|++.++++.+..++.+++++|++|.- ...+ +... T Consensus 4 ~~v-~~VvLAAGrssRmG-----~~KlLap~~g~plv~~~~~~a~~a~~~~vivV~g~~~--~~~~-------~a~~--- 65 (199) T COG2068 4 STV-AAVVLAAGRSSRMG-----QPKLLAPLDGKPLVRASAETALSAGLDRVIVVTGHRV--AEAV-------EALL--- 65 (199) T ss_pred CCE-EEEEECCCCCCCCC-----CCCEECCCCCCCHHHHHHHHHHHCCCCEEEEEECCCH--HHHH-------HHHH--- T ss_conf 646-78887166666678-----7516541499828999999997268870999927611--3678-------8653--- Q ss_pred CCCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 87000011344112485799843334233466542110112233--3311127864445443322112221223432200 Q gi|254780365|r 85 NKKAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDN--PFALLLPDMIMSPLEGENCMANMIKLYEKEGANI 162 (299) Q Consensus 85 ~~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de--~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~v 162 (299) ...+..+.+..+ -..|.++.+..+......+ ..++++||.= ......+..+++.+...+ . T Consensus 66 ------------~~~~~~~v~npd-~~~Gls~Sl~ag~~a~~~~~~~v~~~lgDmP---~V~~~t~~rl~~~~~~~~-~- 127 (199) T COG2068 66 ------------AQLGVTVVVNPD-YAQGLSTSLKAGLRAADAEGDGVVLMLGDMP---QVTPATVRRLIAAFRARG-A- 127 (199) T ss_pred ------------CCCCEEEEECCC-HHHHHHHHHHHHHHHCCCCCCEEEEEECCCC---CCCHHHHHHHHHHCCCCC-C- T ss_conf ------------369808996955-2223769999998735567976999968888---787899999998443467-3- Q ss_pred EEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHH Q ss_conf 11243285443313310013333335300000025778886734233020200272331100012266676750037899 Q gi|254780365|r 163 LAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSM 242 (299) Q Consensus 163 i~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i 242 (299) ++.+ .|+ .+...|. +|+++.|..+....-+ .| . T Consensus 128 -~v~p-------~~~-------------------g~rG~Pv-----------~~~~~~~~~l~~l~GD--~G-------~ 160 (199) T COG2068 128 -AVRP-------VYG-------------------GARGHPV-----------LLSKDLFPALARLSGD--VG-------A 160 (199) T ss_pred -EEEE-------ECC-------------------CCCCCCE-----------EECHHHHHHHHHCCCC--HH-------H T ss_conf -4422-------206-------------------8767964-----------5656678998616886--22-------8 Q ss_pred HHHHHCCC--EEEEEE-EEEEEECCCHHHHHHHHHH Q ss_conf 99983198--489998-1378643998899999999 Q gi|254780365|r 243 RKLSERHD--FLAYHF-KGHTYDCGSKKGFVLANIA 275 (299) Q Consensus 243 ~~l~~~~~--v~a~~~-~g~w~DiG~~~~yl~A~~~ 275 (299) .+++++++ +..++. .|.-.||-||++|.+++-. T Consensus 161 r~ll~~~~~~~~~V~~~~g~llDVDTped~~~a~~~ 196 (199) T COG2068 161 RQLLEEGGLPLVEVEVDAGVLLDVDTPEDLARAQDL 196 (199) T ss_pred HHHHHHCCCCEEEECCCCCEEECCCCHHHHHHHHHH T ss_conf 999975386347652688517547998999999876 No 56 >TIGR00454 TIGR00454 conserved hypothetical protein TIGR00454; InterPro: IPR005245 This entry describes a family of conserved hypothetical proteins with no known function. . Probab=99.39 E-value=2.4e-13 Score=102.01 Aligned_cols=126 Identities=17% Similarity=0.276 Sum_probs=78.4 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCC Q ss_conf 8999568888334324408883335688362799999999978-998899981355444101001174544456541870 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEA-GLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKK 87 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~-Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~ 87 (299) -|+|||||+||||. ...-|||++++|+++|||++..+.++ |++.|+++|+++--.-++|. +.+.+. T Consensus 2 ~aL~MaGGkGTRlg---~d~EKPL~evcGr~lIDhvl~~l~~~~GV~~ii~~TSPhTP~Te~y~----------~~~~~~ 68 (204) T TIGR00454 2 LALVMAGGKGTRLG---RDVEKPLLEVCGRKLIDHVLEALKKAKGVDNIIVVTSPHTPKTEEYV----------AEKYKE 68 (204) T ss_pred CCEEECCCCCCCCC---CCCCCCHHHHCCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCCHHHH----------HHCCCE T ss_conf 50240588731226---55565337565850457777866404787537998377797637887----------305850 Q ss_pred EEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 0001134411248579984333423346654211011223333111278644454433221122212234322 Q gi|254780365|r 88 AELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGA 160 (299) Q Consensus 88 ~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~ 160 (299) -....+.+.... .|++- .--++.+.+..+ +|||+|+.+|-++..+. .+...++.|-+..+ T Consensus 69 ~~~ivvidasGk----GYiED-----l~E~~~hlE~~~-~EP~lV~ssDl~~~r~~---~id~Ivd~y~~~~~ 128 (204) T TIGR00454 69 YKRIVVIDASGK----GYIED-----LREVLSHLELAF-SEPLLVVSSDLVLVRDK---IIDSIVDAYYKIKK 128 (204) T ss_pred EEEEEEEECCCC----CCHHH-----HHHHHHHHHHHH-CCCEEEEECCHHHHHHH---HHHHHHHHHHCCCC T ss_conf 457889976999----60252-----789999877663-39857874360243367---78777643210488 No 57 >cd02503 MobA MobA catalyzes the formation of molybdopterin guanine dinucleotide. The prokaryotic enzyme molybdopterin-guanine dinucleotide biosynthesis protein A (MobA). All mononuclear molybdoenzymes bind molybdenum in complex with an organic cofactor termed molybdopterin (MPT). In many bacteria, including Escherichia coli, molybdopterin can be further modified by attachment of a GMP group to the terminal phosphate of molybdopterin to form molybdopterin guanine dinucleotide (MGD). This GMP attachment step is catalyzed by MobA, by linking a guanosine 5'-phosphate to MPT forming molybdopterin guanine dinucleotide. This reaction requires GTP, MgCl2, and the MPT form of the cofactor. It is a reaction unique to prokaryotes, and therefore may represent a potential drug target. Probab=99.33 E-value=5.9e-12 Score=93.49 Aligned_cols=176 Identities=17% Similarity=0.243 Sum_probs=102.6 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 89995688883343244088833356883627999999999789988999813554441010011745444565418700 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) .|||||||.++||. .+|.|+++.|+|+|+|+++.+... ..+++||++...+. ++. T Consensus 2 ~aiILAgG~S~RmG-----~~K~ll~~~g~~li~~~~~~l~~~-~~~v~vv~~~~~~~---~~~---------------- 56 (181) T cd02503 2 TGVILAGGKSRRMG-----GDKALLELGGKPLLEHVLERLKPL-VDEVVISANRDQER---YAL---------------- 56 (181) T ss_pred EEEEECCCCCCCCC-----CCCCCCEECCCCHHHHHHHHHHHH-CCCEEEEECCCHHH---HHC---------------- T ss_conf 69997897734189-----985177989840999999999864-68079991772776---631---------------- Q ss_pred EECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEC Q ss_conf 00113441124857998433342334665421101122333311127864445443322112221223432200112432 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANILAVSEC 168 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi~~~~v 168 (299) .+..+..- +....|.-.+|..+......+.++|+.+|.-+.. ...+++|++.+++....++... T Consensus 57 ----------~~~~~i~d-~~~~~GPl~gi~~~l~~~~~~~~lv~~~D~P~i~---~~~i~~L~~~~~~~~~~v~~~~-- 120 (181) T cd02503 57 ----------LGVPVIPD-EPPGKGPLAGILAALRAAPADWVLVLACDMPFLP---PELLERLLAAAEEGADAVVPKS-- 120 (181) T ss_pred ----------CCCEEEEC-CCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCC---HHHHHHHHHHHHCCCCEEEEEE-- T ss_conf ----------79869947-9978784899999986278775999647989888---9999999997326997899988-- Q ss_pred CHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHC Q ss_conf 85443313310013333335300000025778886734233020200272331100012266676750037899999831 Q gi|254780365|r 169 DPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLSER 248 (299) Q Consensus 169 ~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~~~ 248 (299) + +++. + + .| +++.++++.|++.-. .|+.. +..+.++ T Consensus 121 -------------~--------------g~~~-P-----l--~~--~~~~~~~~~l~~~l~---~g~~~----~~~~l~~ 156 (181) T cd02503 121 -------------G--------------GRLQ-P-----L--HA--LYHKSLLPALEELLE---AGERR----LRRLLEK 156 (181) T ss_pred -------------C--------------CEEC-C-----E--EE--EECHHHHHHHHHHHH---CCCCC----HHHHHHH T ss_conf -------------9--------------9003-1-----3--35--645999999999998---29925----9999987 Q ss_pred CCEEEEEEE----EEEEECCCHHHH Q ss_conf 984899981----378643998899 Q gi|254780365|r 249 HDFLAYHFK----GHTYDCGSKKGF 269 (299) Q Consensus 249 ~~v~a~~~~----g~w~DiG~~~~y 269 (299) .++....++ +.++||-||+++ T Consensus 157 ~~~~~v~~~~~~~~~~~niNTpeDL 181 (181) T cd02503 157 LGVQYVEFEDERLDAFFNINTPEDL 181 (181) T ss_pred CCCEEEECCCCCCCCCCCCCCHHHC T ss_conf 8987997888885704278996669 No 58 >PRK00155 ispD 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; Reviewed Probab=99.29 E-value=5.6e-11 Score=87.46 Aligned_cols=209 Identities=20% Similarity=0.252 Sum_probs=107.6 Q ss_pred CCCCCEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEECCCCCCCCHHHCCCCHHHHHHHH Q ss_conf 885458999568888334324408883335688362799999999978-9988999813554441010011745444565 Q gi|254780365|r 4 LKKVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEA-GLTDFVFVTGRGKGLIKDYFDIQFELEQSLR 82 (299) Q Consensus 4 m~~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~-Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~ 82 (299) |+|+ -|||||||.|+||. ..+||.+++++|+|+|+|.++.+.+. .|++|++|++.... ++ .+..+. T Consensus 2 m~~i-~aIIlAaG~G~R~g---~~~pKQf~~l~gkpil~~sl~~f~~~~~i~~IvvV~~~~~~---~~------~~~~~~ 68 (228) T PRK00155 2 MPMV-YAIIPAAGIGSRMG---ADRPKQYLPLGGKPILEHTLEAFLAHPAIDEIIVVVPPDDT---EY------AKLLLA 68 (228) T ss_pred CCCE-EEEEECCCCCCCCC---CCCCCCCCEECCEEHHHHHHHHHHCCCCCCEEEEEECCHHH---HH------HHHHHH T ss_conf 8757-99993772051378---67996556689569999999999759998979999576269---99------999987 Q ss_pred HCCCCEEECCCCCCCCCCCEEEECCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 41870000113441124857998433342334--6654211011223333111278644454433221122212234322 Q gi|254780365|r 83 KRNKKAELTLLAESIPSIGNAVFTWQYERKGL--GHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGA 160 (299) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gt--g~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~ 160 (299) ... ..+.++. =|. ...|+.+...+.+++++++. |- -.+......+.++++..++.++ T Consensus 69 ~~~---------------~~i~~v~----GG~tR~~Sv~ngL~~l~~~~~VlIH-Da-ARP~is~~~i~~li~~~~~~~~ 127 (228) T PRK00155 69 KFD---------------KRIKVVA----GGAERQDSVLNGLQALPDDDWVLVH-DA-ARPFLTPDDIDRLIEAAEEYGA 127 (228) T ss_pred HCC---------------CCEEEEC----CCHHHHHHHHHHHHHHCCCCEEEEE-CC-CCCCCCHHHHHHHHHHHHHCCC T ss_conf 449---------------8579976----9703999999999984039979997-06-6768999999999999984698 Q ss_pred CEEEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHH------HHHCCCCCCCCCC Q ss_conf 0011243285443313310013333335300000025778886734233020200272331------1000122666767 Q gi|254780365|r 161 NILAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIF------SILNDWKENEGKG 234 (299) Q Consensus 161 ~vi~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~------~~L~~~~~~~~~g 234 (299) .+.++ ++.+ ++ -..+.. ..|.+.+. . +++.. +-+|..| +.+++.. .. T Consensus 128 ~ip~~-p~~D-Ti-----k~~~~~-----~~i~~tl~---R----~~l~~----~QTPQ~F~~~~l~~a~~~~~----~~ 180 (228) T PRK00155 128 AILAV-PVRD-TI-----KRVDAG-----GFIVDTPD---R----SGLWA----AQTPQGFRVELLKEALARAL----AE 180 (228) T ss_pred CEEEE-ECCC-CE-----EEECCC-----CCEEEECC---C----HHEEE----EECCCCCCHHHHHHHHHHHH----HC T ss_conf 46987-6024-17-----896589-----81553035---1----44366----52885325999999999998----75 Q ss_pred CCCHHHHHHHHHHCC-CEEEEEEEEEEEECCCHHHHHHHH Q ss_conf 500378999998319-848999813786439988999999 Q gi|254780365|r 235 EIQLTDSMRKLSERH-DFLAYHFKGHTYDCGSKKGFVLAN 273 (299) Q Consensus 235 E~~ltD~i~~l~~~~-~v~a~~~~g~w~DiG~~~~yl~A~ 273 (299) .++.||=...+...| ++..+.-+..=+=|=+|+++.-|. T Consensus 181 ~~~~TDd~sl~~~~g~~v~~v~G~~~N~KIT~~eDL~~ae 220 (228) T PRK00155 181 GFTITDDASAVEWLGKPVRLVEGRYDNIKVTTPEDLALAE 220 (228) T ss_pred CCCCCCHHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHH T ss_conf 9987789999998699769996685437728999999999 No 59 >PRK13368 3-deoxy-manno-octulosonate cytidylyltransferase; Provisional Probab=99.28 E-value=4.2e-11 Score=88.24 Aligned_cols=224 Identities=16% Similarity=0.148 Sum_probs=132.0 Q ss_pred CCCCCEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEECCCCCCCCHHHCCCCHHHHHHHH Q ss_conf 885458999568888334324408883335688362799999999978-9988999813554441010011745444565 Q gi|254780365|r 4 LKKVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEA-GLTDFVFVTGRGKGLIKDYFDIQFELEQSLR 82 (299) Q Consensus 4 m~~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~-Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~ 82 (299) |+ +-+||+|=+-.||+- -|||.+|+|+|||+|+.+.+.++ ++++++|.|... .|.++ .. T Consensus 1 Mk--~~~iIPAR~~S~R~p------~K~L~~i~g~pmi~~v~~~a~~s~~~d~v~VaTdd~--eI~~~----------~k 60 (238) T PRK13368 1 MK--VVVVIPARYGSSRLP------GKPLLDILGKPMIQHVYERAAQAAGVEEVYVATDDQ--RIEDA----------VE 60 (238) T ss_pred CC--EEEEECCCCCCCCCC------CCCCHHHCCCCHHHHHHHHHHHCCCCCEEEEECCCH--HHHHH----------HH T ss_conf 97--899984687787899------870125289699999999998468988299956842--48777----------50 Q ss_pred HCCCCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-C Q ss_conf 418700001134411248579984333423346654211011223333111278644454433221122212234322-0 Q gi|254780365|r 83 KRNKKAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGA-N 161 (299) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~-~ 161 (299) . .+.+......+...||- .+.-|...++.+-++-+.||.-+... ..+.++++.+.+... . T Consensus 61 ~---------------~g~~~i~ts~~~~~GTd-Ri~Ea~~~~~~d~ivnvQgDePli~p---~~I~~~i~~~~~~~~~~ 121 (238) T PRK13368 61 A---------------FGGKVVMTSDDHLSGTD-RLAEVMLKIEADIYINVQGDEPLIRP---RDIDTLIQPMLDDPSIN 121 (238) T ss_pred C---------------CCCEEEECCCCCCCCHH-HHHHHHHHCCCCEEEEECCCCCCCCH---HHHHHHHHHHHHCCCCE T ss_conf 0---------------46327851565577147-89999851598789995388677899---99999999998377652 Q ss_pred E-EEEEECC-HHHCCCCCCEECCCCCCCCCCEEEEECCCC--CCCC-CCCC-EEECHHHEECHHHHHHHCCCCCCCCCCC Q ss_conf 0-1124328-544331331001333333530000002577--8886-7342-3302020027233110001226667675 Q gi|254780365|r 162 I-LAVSECD-PQLSCKYGMVQVGKAIDHQVFHISDMIEKP--DSST-FISN-FFINGRYILHPDIFSILNDWKENEGKGE 235 (299) Q Consensus 162 v-i~~~~v~-~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP--~~~~-~~Sn-l~~~GiYi~~~~i~~~L~~~~~~~~~gE 235 (299) + .++.+.. .++...--++.+-- +...++.-|--.| .... .+.+ +-..|+|.|+++.+....+++++ + T Consensus 122 v~t~~~~~~~~~~~~n~n~vKvv~---~~~~~~lyfSR~~ip~~~~~~~~~~~khiGiy~f~~~~L~~f~~l~~s----~ 194 (238) T PRK13368 122 VATLCAPISTEEEFESPNVVKVVV---DKNGDALYFSRSPIPSRRDGESARYLKHVGIYAFRRDVLQQFSQLPET----P 194 (238) T ss_pred EEEEEECCCCHHHHCCCCCEEEEE---CCCCCCCCCCCCCCCCCCCCCCCHHHEEEEEEEECHHHHHHHHHCCCC----H T ss_conf 767874168888843898529998---997870114525675112454201220301122119999998708998----5 Q ss_pred CCHHHHHHHH--HHCC-CEEEEEEEEEEEECCCHHHHHHHH Q ss_conf 0037899999--8319-848999813786439988999999 Q gi|254780365|r 236 IQLTDSMRKL--SERH-DFLAYHFKGHTYDCGSKKGFVLAN 273 (299) Q Consensus 236 ~~ltD~i~~l--~~~~-~v~a~~~~g~w~DiG~~~~yl~A~ 273 (299) ++...-+++| +++| ++..+.++..-+.|-||+++.+|. T Consensus 195 lE~~E~lEqLR~leng~~I~~~~~~~~~~~VDt~eDl~~v~ 235 (238) T PRK13368 195 LEQIESLEQLRALEHGYKIRMVEVEATSIGVDTPEDLERVR 235 (238) T ss_pred HHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHHHH T ss_conf 67777089999998699267999599999987999999999 No 60 >pfam01128 IspD Uncharacterized protein family UPF0007. Probab=99.25 E-value=3e-11 Score=89.12 Aligned_cols=208 Identities=17% Similarity=0.217 Sum_probs=105.2 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCC Q ss_conf 8999568888334324408883335688362799999999978-998899981355444101001174544456541870 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEA-GLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKK 87 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~-Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~ 87 (299) -|||||||.||||. ..+||.+++++|||+|+|.++.+.+. .+++|++|++... .+++ ..+. T Consensus 2 ~aIIlAaG~G~R~~---~~~pKQf~~l~gkpil~~sl~~f~~~~~i~~Ivvv~~~~~---~~~~-------~~~~----- 63 (221) T pfam01128 2 VAVIPAAGSGKRMG---AGVPKQFLQLLGQPLLEHTVDAFLASPVVDRIVVAVSPDD---TPEF-------RQLL----- 63 (221) T ss_pred EEEEECCEECCCCC---CCCCCCCCEECCEEHHHHHHHHHHCCCCCCEEEEEECHHH---HHHH-------HHHC----- T ss_conf 89993581061269---5898653668988899999999963888897999967788---9999-------9860----- Q ss_pred EEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCEEEE Q ss_conf 00011344112485799843334233466542110112233-331112786444544332211222122343-2200112 Q gi|254780365|r 88 AELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDN-PFALLLPDMIMSPLEGENCMANMIKLYEKE-GANILAV 165 (299) Q Consensus 88 ~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de-~flv~~gD~i~~~~~~~~~l~~li~~~~~~-~~~vi~~ 165 (299) ...++.++.-.. .-.++|+.|...+.++ +++++. |- -.+......+.++++...+. ++.+.+ T Consensus 64 -----------~~~~i~~v~GG~--tR~~SV~ngL~~l~~~~~~VlIH-Da-aRP~v~~~~i~~li~~~~~~~~~~i~~- 127 (221) T pfam01128 64 -----------GDPSIQLVAGGD--TRQDSVLNGLKALAGTAKFVLVH-DG-ARPCLPHADLARLLAALETGTQGAILA- 127 (221) T ss_pred -----------CCCCEEEECCCC--CHHHHHHHHHHHHCCCCCEEEEE-EC-CCCCCCHHHHHHHHHHHHHHCCCCEEE- T ss_conf -----------799879967996--38999999999835899989998-36-566799999999999986004861354- Q ss_pred EECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHH--H-CCCCCCCCCCCCCHHHHH Q ss_conf 4328544331331001333333530000002577888673423302020027233110--0-012266676750037899 Q gi|254780365|r 166 SECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSI--L-NDWKENEGKGEIQLTDSM 242 (299) Q Consensus 166 ~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~--L-~~~~~~~~~gE~~ltD~i 242 (299) .++.+ ++ + ..+. ...+.+.+ +. +++.. +-+|..|.+ | +...... ...++.||-. T Consensus 128 ~p~~D-Ti-k----~v~~-----~~~i~~t~---dR----~~l~~----~QTPQ~F~~~~L~~a~~~~~-~~~~~~TDd~ 184 (221) T pfam01128 128 LPVTD-TI-K----RVEA-----DGVVAGTP---DR----SGLWA----AQTPQGFRVDLLLAAHQRGD-QPGAEITDDA 184 (221) T ss_pred ECCCC-CE-E----EECC-----CCCEECCC---CC----HHHEE----CCCCCCCCHHHHHHHHHHHH-HCCCCCCCHH T ss_conf 12466-43-5----7537-----86044035---64----21011----00762006999999999888-6299976299 Q ss_pred HHHHHCC-CEEEEEEEEEEEECCCHHHHHHHH Q ss_conf 9998319-848999813786439988999999 Q gi|254780365|r 243 RKLSERH-DFLAYHFKGHTYDCGSKKGFVLAN 273 (299) Q Consensus 243 ~~l~~~~-~v~a~~~~g~w~DiG~~~~yl~A~ 273 (299) ..+...| ++..+.-+..=+=|=+|+++.-|. T Consensus 185 sl~~~~g~~v~~v~G~~~N~KIT~~eDl~~ae 216 (221) T pfam01128 185 SLVEHAGGSVQVVPGRPDNLKITTPEDLALAE 216 (221) T ss_pred HHHHHCCCCEEEEECCCCCCCCCCHHHHHHHH T ss_conf 99997799779996584238878999999999 No 61 >cd02517 CMP-KDO-Synthetase CMP-KDO synthetase catalyzes the activation of KDO which is an essential component of the lipopolysaccharide. CMP-KDO Synthetase: 3-Deoxy-D-manno-octulosonate cytidylyltransferase (CMP-KDO synthetase) catalyzes the conversion of CTP and 3-deoxy-D-manno-octulosonate into CMP-3-deoxy-D-manno-octulosonate (CMP-KDO) and pyrophosphate. KDO is an essential component of the lipopolysaccharide found in the outer surface of gram-negative eubacteria. It is also a constituent of the capsular polysaccharides of some gram-negative eubacteria. Its presence in the cell wall polysaccharides of green algae and plant were also discovered. However, they have not been found in yeast and animals. The absence of the enzyme in mammalian cells makes it an attractive target molecule for drug design. Probab=99.23 E-value=1.2e-10 Score=85.38 Aligned_cols=221 Identities=16% Similarity=0.148 Sum_probs=127.4 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCC Q ss_conf 8999568888334324408883335688362799999999978-998899981355444101001174544456541870 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEA-GLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKK 87 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~-Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~ 87 (299) =+||+|=+..|||- -|||.+|+|+|||+|+.+.+.++ ++++++|.|.. +.|.++- .. T Consensus 3 i~iIPAR~~S~Rlp------~K~L~~I~G~pmI~~v~~~a~~s~~~d~V~VATDd--~eI~~~~----------~~---- 60 (239) T cd02517 3 IVVIPARYASSRLP------GKPLADIAGKPMIQHVYERAKKAKGLDEVVVATDD--ERIADAV----------ES---- 60 (239) T ss_pred EEEECCCCCCCCCC------CCCCHHHCCCCHHHHHHHHHHHCCCCCEEEEECCC--HHHHHHH----------CC---- T ss_conf 99975887687899------97304539959899999999966998849996482--6555332----------14---- Q ss_pred EEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCE-E Q ss_conf 0001134411248579984333423346654211011223--33311127864445443322112221223432-200-1 Q gi|254780365|r 88 AELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGD--NPFALLLPDMIMSPLEGENCMANMIKLYEKEG-ANI-L 163 (299) Q Consensus 88 ~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~d--e~flv~~gD~i~~~~~~~~~l~~li~~~~~~~-~~v-i 163 (299) .+++....-.+-+.||- .+.-|.+.++. +-++.+.||.-+... ..+..+++.+.+.. ..+ . T Consensus 61 -----------~g~~~imT~~~h~~GTd-Ri~ea~~~l~~~~D~iinvQGDePli~p---~~I~~~i~~~~~~~~~~v~t 125 (239) T cd02517 61 -----------FGGKVVMTSPDHPSGTD-RIAEVAEKLDADDDIVVNVQGDEPLIPP---EMIDQVVAALKDDPGVDMAT 125 (239) T ss_pred -----------CCCCCCCCCCCCCCCCH-HHHHHHHHCCCCCCEEEEEECCCCCCCH---HHHHHHHHHHHHCCCCEEEE T ss_conf -----------79740104643356215-9999999719988989995187567899---99999999985288755987 Q ss_pred EEEEC-CHHHCCCCCCEECCCCCCCCCCEEEEECCCC--CC--CCCCC-CEEECHHHEECHHHHHHHCCCCCCCCCCCCC Q ss_conf 12432-8544331331001333333530000002577--88--86734-2330202002723311000122666767500 Q gi|254780365|r 164 AVSEC-DPQLSCKYGMVQVGKAIDHQVFHISDMIEKP--DS--STFIS-NFFINGRYILHPDIFSILNDWKENEGKGEIQ 237 (299) Q Consensus 164 ~~~~v-~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP--~~--~~~~S-nl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ 237 (299) ++.+. +.++...-.++.+--..++ ++.-|-=-| -. ....+ -+-..|+|.|+++.+....+.+++. +| T Consensus 126 ~~~~i~~~~~~~~~n~VKvv~~~~~---~alyfSRs~IP~~~~~~~~~~~~khiGIy~f~~~~L~~f~~~~~s~----lE 198 (239) T cd02517 126 LATPISDEEELFNPNVVKVVLDKDG---YALYFSRSPIPYPRDSSEDFPYYKHIGIYAYRRDFLLRFAALPPSP----LE 198 (239) T ss_pred EEEECCCHHHHCCCCCEEEEECCCC---CCHHCCCCCCCCCCCCCCCCCEEEEEEEEEECHHHHHHHHCCCCCH----HH T ss_conf 4023699889418996399978988---5110334788643456555041037765563399999987379982----57 Q ss_pred HHHHHHHH--HHCC-CEEEEEEEEEEEECCCHHHHHHHH Q ss_conf 37899999--8319-848999813786439988999999 Q gi|254780365|r 238 LTDSMRKL--SERH-DFLAYHFKGHTYDCGSKKGFVLAN 273 (299) Q Consensus 238 ltD~i~~l--~~~~-~v~a~~~~g~w~DiG~~~~yl~A~ 273 (299) ...-+++| +++| ++..+..+..-+.|-||+++.+|. T Consensus 199 ~~E~lEqLR~leng~~I~~~~~~~~~~~VDt~eDl~~ve 237 (239) T cd02517 199 QIESLEQLRALENGYKIKVVETDHESIGVDTPEDLERVE 237 (239) T ss_pred HHHHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHH T ss_conf 776089999998398068999699998988999999997 No 62 >PRK02726 molybdopterin-guanine dinucleotide biosynthesis protein A; Provisional Probab=99.23 E-value=6.9e-11 Score=86.90 Aligned_cols=189 Identities=12% Similarity=0.123 Sum_probs=103.2 Q ss_pred CCCCCEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHH Q ss_conf 88545899956888833432440888333568836279999999997899889998135544410100117454445654 Q gi|254780365|r 4 LKKVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRK 83 (299) Q Consensus 4 m~~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~ 83 (299) |+.==.|||||||+++|| ..+|.|+++.|+|+|+|+++.+...+ ++++|+++. .+....+. T Consensus 4 m~~~i~~vILAGG~S~RM-----G~dK~ll~~~G~~ll~~~~~~l~~~~-~~v~i~~~~-~~~~~~~~------------ 64 (200) T PRK02726 4 MKNNLVALILAGGKSSRM-----GQDKALITWQGVPLLQRVCRIAAACA-DEVYIITPW-PERYQSLL------------ 64 (200) T ss_pred CCCCCEEEEECCCCHHHC-----CCCCCCCEECCEEHHHHHHHHHHHHC-CEEEEECCC-HHHHHHHC------------ T ss_conf 557717999857654027-----99843387898759999999987418-869996799-78877525------------ Q ss_pred CCCCEEECCCCCCCCCCCEEEECCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 18700001134411248579984333-42334665421101122333311127864445443322112221223432200 Q gi|254780365|r 84 RNKKAELTLLAESIPSIGNAVFTWQY-ERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANI 162 (299) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~i~~v~Q~-~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~v 162 (299) ..+. .++... .-.|.-.+|+.|......+.++++.+|.=+. ....+++|++......... T Consensus 65 --------------~~~~--~~i~D~~~~~GPL~Gi~saL~~~~~~~~lv~~cD~P~l---~~~~i~~l~~~l~~~~~~~ 125 (200) T PRK02726 65 --------------PPGC--HWLREPPPSGGPLVAFAQGLPQIKTEWVLLLACDLPKL---RVGVLQEWLQQLDNVPEEA 125 (200) T ss_pred --------------CCCC--EEEECCCCCCCCHHHHHHHHHHCCCCCEEEECCCCCCC---CHHHHHHHHHHHHHCCCCC T ss_conf --------------8885--79016998898599999999856888389943887888---9999999999986088873 Q ss_pred EEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHH Q ss_conf 11243285443313310013333335300000025778886734233020200272331100012266676750037899 Q gi|254780365|r 163 LAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSM 242 (299) Q Consensus 163 i~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i 242 (299) .++.... .. +. .+ ..|+| +++.+..|++.-. .|+..+.+ T Consensus 126 ~~~v~~~-------------~~------~~--------~P-------l~aly--~~~~~~~l~~~l~---~g~r~l~~-- 164 (200) T PRK02726 126 IAALPKQ-------------EK------GW--------EP-------LCGFY--RRRCLPSLEQFIQ---QGGRSFQG-- 164 (200) T ss_pred EEEEECC-------------CC------CC--------CC-------EEEEE--CHHHHHHHHHHHH---HCCCCHHH-- T ss_conf 7999715-------------99------64--------73-------12110--5889999999999---09954999-- Q ss_pred HHHHHCCCEEEEEEEE--EEEECCCHHHHHHHH Q ss_conf 9998319848999813--786439988999999 Q gi|254780365|r 243 RKLSERHDFLAYHFKG--HTYDCGSKKGFVLAN 273 (299) Q Consensus 243 ~~l~~~~~v~a~~~~g--~w~DiG~~~~yl~A~ 273 (299) +.++..+..+.++. ..+++-||++|-.++ T Consensus 165 --~l~~~~~~~v~~~~~~~~~NiNTPeDL~~~~ 195 (200) T PRK02726 165 --WLAQVPVQELALSDPDMLFNCNTPEDLAVIQ 195 (200) T ss_pred --HHHHCCCEEEECCCHHHHCCCCCHHHHHHHH T ss_conf --9987897797269845625899999999986 No 63 >cd02516 CDP-ME_synthetase CDP-ME synthetase is involved in mevalonate-independent isoprenoid production. 4-diphosphocytidyl-2-methyl-D-erythritol synthase (CDP-ME), also called 2C-methyl-d-erythritol 4-phosphate cytidylyltransferase catalyzes the third step in the alternative (non-mevalonate) pathway of Isopentenyl diphosphate (IPP) biosynthesis: the formation of 4-diphosphocytidyl-2C-methyl-D-erythritol from CTP and 2C-methyl-D-erythritol 4-phosphate. This mevalonate independent pathway that utilizes pyruvate and glyceraldehydes 3-phosphate as starting materials for production of IPP occurs in a variety of bacteria, archaea and plant cells, but is absent in mammals. Thus, CDP-ME synthetase is an attractive targets for the structure-based design of selective antibacterial, herbicidal and antimalarial drugs. Probab=99.19 E-value=9.1e-11 Score=86.17 Aligned_cols=203 Identities=17% Similarity=0.223 Sum_probs=102.5 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCC Q ss_conf 8999568888334324408883335688362799999999978-998899981355444101001174544456541870 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEA-GLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKK 87 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~-Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~ 87 (299) -|||||||.|+||. ..+||.+++|+|+|+|+|.++.+... .|++|++|++.... +++ +...... T Consensus 2 ~aIIlAaG~G~R~~---~~~PKQf~~i~gk~ii~~sl~~f~~~~~i~~Iivv~~~~~~---~~~------~~~~~~~--- 66 (218) T cd02516 2 AAIILAAGSGSRMG---ADIPKQFLELGGKPVLEHTLEAFLAHPAIDEIVVVVPPDDI---DLA------KELAKYG--- 66 (218) T ss_pred EEEEECCCCCCCCC---CCCCCCEEEECCEEHHHHHHHHHHCCCCCCEEEEEECHHHH---HHH------HHHHHHC--- T ss_conf 89992683471488---57985233589679999999999758998979999683789---999------9988644--- Q ss_pred EEECCCCCCCCCCCEEEECCCCCCCC--C-CCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE Q ss_conf 00011344112485799843334233--4-6654211011223-333111278644454433221122212234322001 Q gi|254780365|r 88 AELTLLAESIPSIGNAVFTWQYERKG--L-GHAVWCARNIIGD-NPFALLLPDMIMSPLEGENCMANMIKLYEKEGANIL 163 (299) Q Consensus 88 ~~~~~~~~~~~~~~~i~~v~Q~~p~G--t-g~Ai~~a~~~i~d-e~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi 163 (299) ....+.++. | | .++|+.+...+.+ ++-.|+--|- -.+......+.++++...++++.+. T Consensus 67 -----------~~~~i~~v~-----GG~tR~~SV~ngl~~l~~~~~~~VlIHDa-aRP~i~~~~i~~li~~~~~~~~~i~ 129 (218) T cd02516 67 -----------LSKVVKIVE-----GGATRQDSVLNGLKALPDADPDIVLIHDA-ARPFVSPELIDRLIDALKEYGAAIP 129 (218) T ss_pred -----------CCCCEEEEC-----CCCCHHHHHHHHHHHHCCCCCCEEEEECC-CCCCCCHHHHHHHHHHHHHCCCEEE T ss_conf -----------798769989-----98409999998987401269998999257-6778999999999999973787189 Q ss_pred EEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHH------HHHCCCCCCCCCCCCC Q ss_conf 1243285443313310013333335300000025778886734233020200272331------1000122666767500 Q gi|254780365|r 164 AVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIF------SILNDWKENEGKGEIQ 237 (299) Q Consensus 164 ~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~------~~L~~~~~~~~~gE~~ 237 (299) ++ ++. ++ +...+.. ..+.+.+ +. +++.. +-+|..| +.+++.. +..+. T Consensus 130 ~~-p~~-DT-----ik~~~~~-----~~i~~t~---dR----~~l~~----~QTPQ~F~~~~l~~a~~~~~----~~~~~ 182 (218) T cd02516 130 AV-PVT-DT-----IKRVDDD-----GVVVETL---DR----EKLWA----AQTPQAFRLDLLLKAHRQAS----EEGEE 182 (218) T ss_pred EE-CCH-HC-----EEEECCC-----CEEEEEC---CC----CCEEE----EECCCCCCHHHHHHHHHHHH----HCCCC T ss_conf 84-251-01-----7996389-----7798623---76----13798----74875425999999999998----64999 Q ss_pred HHHHHHHHHHCC-CEEEEEEEEEEEECCCHHHHH Q ss_conf 378999998319-848999813786439988999 Q gi|254780365|r 238 LTDSMRKLSERH-DFLAYHFKGHTYDCGSKKGFV 270 (299) Q Consensus 238 ltD~i~~l~~~~-~v~a~~~~g~w~DiG~~~~yl 270 (299) .||-...+...| ++..++-+-.=+=|=+|+++. T Consensus 183 ~TDd~sl~~~~g~~v~~v~G~~~N~KIT~~eDl~ 216 (218) T cd02516 183 FTDDASLVEAAGGKVALVEGSEDNIKITTPEDLA 216 (218) T ss_pred CCCHHHHHHHCCCCEEEEECCCCCCCCCCHHHHH T ss_conf 7858999998699769997587558599899982 No 64 >PRK05450 3-deoxy-manno-octulosonate cytidylyltransferase; Provisional Probab=99.18 E-value=8e-10 Score=80.35 Aligned_cols=225 Identities=16% Similarity=0.111 Sum_probs=125.2 Q ss_pred CCCCCEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEECCCCCCCCHHHCCCCHHHHHHHH Q ss_conf 885458999568888334324408883335688362799999999978-9988999813554441010011745444565 Q gi|254780365|r 4 LKKVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEA-GLTDFVFVTGRGKGLIKDYFDIQFELEQSLR 82 (299) Q Consensus 4 m~~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~-Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~ 82 (299) |+ +-+||||=...|||- -|||.+|+|+|||+|+.+.+.++ ++++++|.|.. +.|.++- . T Consensus 1 Mk--~~~iIPAR~~SsR~P------gKpL~~I~GkpmI~~v~~~a~~~~~~~~V~VATdd--~~I~~~~----------~ 60 (248) T PRK05450 1 MK--FVVIIPARYASTRLP------GKPLADIGGKPMIVRVYERASKASGADRVVVATDD--ERIADAV----------E 60 (248) T ss_pred CC--EEEEEECCCCCCCCC------CCCCHHHCCCCHHHHHHHHHHHCCCCCEEEEEECC--HHHHHHH----------H T ss_conf 97--799980797787899------86202418969899999999966898829997088--5244353----------1 Q ss_pred HCCCCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCC--CCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 41870000113441124857998433342334665421101122--33331-1127864445443322112221223432 Q gi|254780365|r 83 KRNKKAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIG--DNPFA-LLLPDMIMSPLEGENCMANMIKLYEKEG 159 (299) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~--de~fl-v~~gD~i~~~~~~~~~l~~li~~~~~~~ 159 (299) . .+.+....-.+-+.||- .+.-|.+.++ +.+++ -+.||.-+... ..+..+++.+.+.. T Consensus 61 ~---------------~g~~~imTs~~h~~GTd-Ri~Ea~~~l~~~~~d~IInvQGDEPli~p---~~I~~li~~~~~~~ 121 (248) T PRK05450 61 A---------------FGGEVVMTSADHPSGTD-RIAEAAAKLGLSDDDIVVNVQGDEPLIPP---ELIDQVAEPLAAHP 121 (248) T ss_pred C---------------CCCCEEECCCCCCCCHH-HHHHHHHHCCCCCCCEEEEECCCCCCCCH---HHHHHHHHHHHHCC T ss_conf 5---------------68623404645477038-99999985087778689993598677899---99999999998588 Q ss_pred C-CE-EEEEEC-CHHHCCCCCCEECCCCCCCCCCEEEEECCCCCC-------CCCCCC-EEECHHHEECHHHHHHHCCCC Q ss_conf 2-00-112432-854433133100133333353000000257788-------867342-330202002723311000122 Q gi|254780365|r 160 A-NI-LAVSEC-DPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDS-------STFISN-FFINGRYILHPDIFSILNDWK 228 (299) Q Consensus 160 ~-~v-i~~~~v-~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~-------~~~~Sn-l~~~GiYi~~~~i~~~L~~~~ 228 (299) . .+ .++.+. +.++...--++.+--.. ..++.-|---|=| ....++ +-..|+|.|+++.+....+++ T Consensus 122 ~~~i~tl~~~~~~~~~~~d~n~VKvv~~~---~~~alyfSRs~IP~~~~~~~~~~~~~~~~hiGiy~f~~~~L~~f~~l~ 198 (248) T PRK05450 122 EADMATLAVPIDDEEELFNPNVVKVVLDK---NGYALYFSRAPIPWGRDAADPTAPTPVYRHIGIYAYRRGFLRRFVSLP 198 (248) T ss_pred CCEEEEEEEECCCHHHHHCCCCEEEEECC---CCCEEEEECCCCCCCCCCCCCCCCCCEEEEEEEEEEHHHHHHHHHHCC T ss_conf 66399999862888996088834899899---998405542678754443212355551689998860099999987469 Q ss_pred CCCCCCCCCHHHHHHHH--HHCC-CEEEEEEEE-EEEECCCHHHHHHHHH Q ss_conf 66676750037899999--8319-848999813-7864399889999999 Q gi|254780365|r 229 ENEGKGEIQLTDSMRKL--SERH-DFLAYHFKG-HTYDCGSKKGFVLANI 274 (299) Q Consensus 229 ~~~~~gE~~ltD~i~~l--~~~~-~v~a~~~~g-~w~DiG~~~~yl~A~~ 274 (299) ++. +|...-+++| +++| ++..+.++. .-+-|-||+++.++.. T Consensus 199 ~s~----lE~~E~lEqLR~leng~~I~~~~~~~~~~~~VDT~eDl~~v~~ 244 (248) T PRK05450 199 PSP----LEQIESLEQLRALENGYRIHVAVVDEAPSPGVDTPEDLERVRA 244 (248) T ss_pred CCH----HHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHH T ss_conf 981----4656526999999859957999967899999779999999999 No 65 >COG2266 GTP:adenosylcobinamide-phosphate guanylyltransferase [Coenzyme metabolism] Probab=99.06 E-value=7.8e-11 Score=86.60 Aligned_cols=98 Identities=18% Similarity=0.273 Sum_probs=66.9 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 89995688883343244088833356883627999999999789988999813554441010011745444565418700 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) .|||||||+||||- ..-|||++++|||+|+|+++++.+ .+++|++.++.+--.++.| +.+.+ T Consensus 2 ~~iiMAGGrGtRmg----~~EKPlleV~GkpLI~~v~~al~~-~~d~i~v~isp~tp~t~~~----------~~~~g--- 63 (177) T COG2266 2 MAIIMAGGRGTRMG----RPEKPLLEVCGKPLIDRVLEALRK-IVDEIIVAISPHTPKTKEY----------LESVG--- 63 (177) T ss_pred CEEEECCCCCCCCC----CCCCCCHHHCCCCHHHHHHHHHHH-HCCCEEEEECCCCHHHHHH----------HHHCC--- T ss_conf 35896288544468----876752020781389999999972-2183899967998769999----------97359--- Q ss_pred EECCCCCCCCCCCEEEECCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 00113441124857998433342-334665421101122333311127864445 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQYER-KGLGHAVWCARNIIGDNPFALLLPDMIMSP 141 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~~p-~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~ 141 (299) +++.. .| -|.-.-+..|.++++- ||+++.+|-.+.. T Consensus 64 ------------v~vi~----tpG~GYv~Dl~~al~~l~~-P~lvvsaDLp~l~ 100 (177) T COG2266 64 ------------VKVIE----TPGEGYVEDLRFALESLGT-PILVVSADLPFLN 100 (177) T ss_pred ------------CEEEE----CCCCCHHHHHHHHHHHCCC-CEEEEECCCCCCC T ss_conf ------------36997----5998708999999974499-4599865534178 No 66 >TIGR01479 GMP_PMI mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase; InterPro: IPR006375 This enzyme is known to be bifunctional, as both mannose-6-phosphate isomerase (5.3.1.8 from EC) (PMI) and mannose-1-phosphate guanylyltransferase (2.7.7.22 from EC) in Pseudomonas aeruginosa, Xanthomonas campestris, and Acetobacter xylinus. The literature on the enzyme from Escherichia coli attributes mannose-6-phosphate isomerase activity to an adjacent gene, but the present sequence has not been shown to lack the activity. The PMI domain lies at the C-terminal. ; GO: 0008928 mannose-1-phosphate guanylyltransferase (GDP) activity, 0009103 lipopolysaccharide biosynthetic process. Probab=99.03 E-value=1.1e-10 Score=85.70 Aligned_cols=239 Identities=17% Similarity=0.302 Sum_probs=139.1 Q ss_pred EEEECCCCCCCCCHHHCC-CCCCEEEECC-E-EHHHHHHHHHHHC-CCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCC Q ss_conf 999568888334324408-8833356883-6-2799999999978-9988999813554441010011745444565418 Q gi|254780365|r 10 AVFPIAGLGMRFFPISKV-IPKEMLAIVD-R-PVIQYVIEEALEA-GLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRN 85 (299) Q Consensus 10 avIlAaG~GtRl~P~T~~-~pKpllpi~~-k-piI~~~i~~l~~~-Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~ 85 (299) .+|++||.|||+||++.. .||.++++.+ . ++++-.+..+... ....-+++++..-+ +-....++..+ T Consensus 3 p~~~~gg~g~~lwp~~~~~~p~~f~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~n~~~~---------f~~~~~~~~~~ 73 (478) T TIGR01479 3 PVILAGGSGTRLWPLSRELYPKQFLALVGDLYTLLQETLKRLAGLPDCSSPLVICNEEHR---------FLVAEQLRELG 73 (478) T ss_pred CEEEECCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCEEEECCHHH---------HHHHHHHHHHC T ss_conf 226407877501212112200344320452046789999862266422365255242034---------68888898606 Q ss_pred CCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCC-----CCCCCCCC-CCCCCCCCCCCCCC-CCCCCCCCCCCCCC Q ss_conf 7000011344112485799843334233466542110-----11223333-11127864445443-32211222122343 Q gi|254780365|r 86 KKAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCAR-----NIIGDNPF-ALLLPDMIMSPLEG-ENCMANMIKLYEKE 158 (299) Q Consensus 86 ~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~-----~~i~de~f-lv~~gD~i~~~~~~-~~~l~~li~~~~~~ 158 (299) ... -..+.++..+.|+.|+..+. ..-+.++. +++..|+++..... ..+++......... T Consensus 74 ~~~--------------~~~~l~p~g~~~~p~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~g 139 (478) T TIGR01479 74 LLA--------------SNILLEPVGRNTAPAIALAALLALRRAPGEDPLLLVLAADHVITDEDAFRAAVKLALPAAAEG 139 (478) T ss_pred CCC--------------CCEEECCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHCC T ss_conf 522--------------221430266641578999999876405787747898615511100367899987643243317 Q ss_pred CCCEEEEEECCHHHCCCCCCEECC--CCCCC-----CCCEEEEECCCCCCCCCCCC------EEECHHHEECHHHH-HHH Q ss_conf 220011243285443313310013--33333-----53000000257788867342------33020200272331-100 Q gi|254780365|r 159 GANILAVSECDPQLSCKYGMVQVG--KAIDH-----QVFHISDMIEKPDSSTFISN------FFINGRYILHPDIF-SIL 224 (299) Q Consensus 159 ~~~vi~~~~v~~~~~~~yGvi~~~--~~~~~-----~~~~I~~~vEKP~~~~~~Sn------l~~~GiYi~~~~i~-~~L 224 (299) ...+.++.+..++ ..||.+..+ ....+ +.+.+..|+|||....+.+- +-+.|.++|....+ ..| T Consensus 140 ~~~~fg~~p~~p~--~gygy~~~g~~~~~~~~~~~~~~~~~~~f~~~p~~~~~~~~~~~g~~~wn~g~f~~~~~~~~~~l 217 (478) T TIGR01479 140 KLVTFGIVPTTPE--TGYGYIRRGLDAPLAGGDDGDDVYAVERFVEKPDLATAQEYLESGDYYWNSGMFLFRASRYLAEL 217 (478) T ss_pred CEEEEECCCCCCC--CCCCHHHCCCCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHCCCCEECCCCEEHHHHHHHHHH T ss_conf 5478502357765--45311101443102442112214666665405455678999751762001430000135789988 Q ss_pred CCCCC------------CCCCCC--------CCH--HHHHHHHH-HC-CCEEEEEEEEEEEECCCHHHHHHHH Q ss_conf 01226------------667675--------003--78999998-31-9848999813786439988999999 Q gi|254780365|r 225 NDWKE------------NEGKGE--------IQL--TDSMRKLS-ER-HDFLAYHFKGHTYDCGSKKGFVLAN 273 (299) Q Consensus 225 ~~~~~------------~~~~gE--------~~l--tD~i~~l~-~~-~~v~a~~~~g~w~DiG~~~~yl~A~ 273 (299) ++..+ ...... |.- .+.++..+ ++ ......+.+..|-|+|++..+.+.+ T Consensus 218 ~~~~p~~~~~~~~~~~~~~~d~~~~~~~~~~f~~~p~~~~d~~~~~~~~~~~~~p~~~~w~d~g~w~~~w~~~ 290 (478) T TIGR01479 218 KKHAPDIYDACEAALEASEPDLDFIRLDKEAFEACPSESIDYAVMEKTSDAVVVPLDAGWSDVGSWSALWEIS 290 (478) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHCCCHHHHHCCCCCCCCHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHC T ss_conf 7630588999999874102342010004578740621000023222145706872356654334313443200 No 67 >TIGR00453 ispD 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; InterPro: IPR001228 The bacterial ispD protein catalyzes the third step of the deoxyxylulose-5-phosphate pathway (DXP) of isoprenoid biosynthesis; the formation of 4-diphosphocytidyl-2C-methyl-D-erythritol from CTP and 2C-methyl-D-erythritol 4-phosphate.; GO: 0008299 isoprenoid biosynthetic process. Probab=99.01 E-value=4.3e-10 Score=82.00 Aligned_cols=209 Identities=20% Similarity=0.323 Sum_probs=117.6 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 999568888334324408883335688362799999999978-9988999813554441010011745444565418700 Q gi|254780365|r 10 AVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEA-GLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 10 avIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~-Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) |||||||.|+||. ...||..++|+|+|+|.|.++.+..+ .|++|+||+..... .||. ..+..+.. T Consensus 2 avi~AAG~G~R~~---~~~pKQy~~l~G~Pll~h~~~~~~~~~~~~~vvVV~~~~~~---~~~~------~~~~~~~~-- 67 (226) T TIGR00453 2 AVIPAAGRGTRMG---SGVPKQYLELAGRPLLEHTLDAFLAHPAIDEVVVVVSPDDT---EFFQ------KALAARAK-- 67 (226) T ss_pred EEEEECCCCEECC---CCCCEEEEEECCEEEHHHHHHHHHHHHCCCEEEEEECCCCH---HHHH------HHHCCCCC-- T ss_conf 6873167111336---79870266518802217789998632320646898367415---8888------86325657-- Q ss_pred EECCCCCCCCCCCEEEECCCCCCCC-C-CCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCE Q ss_conf 0011344112485799843334233-4-66542110112---23333111278644454433221122212234-32200 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQYERKG-L-GHAVWCARNII---GDNPFALLLPDMIMSPLEGENCMANMIKLYEK-EGANI 162 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~~p~G-t-g~Ai~~a~~~i---~de~flv~~gD~i~~~~~~~~~l~~li~~~~~-~~~~v 162 (299) . ..+.++. =| | .+.|+.+...+ .+..|+++. |- =.+....+.+.++++.-.+ .++.+ T Consensus 68 ----------~-~~~~~v~----GG~~Rq~SV~~GL~a~~~~~~~~~VlvH-Da-ARPf~~~~~~~~l~~~~~~~~~a~~ 130 (226) T TIGR00453 68 ----------F-KVVKIVA----GGDTRQDSVRNGLKALPERADAEIVLVH-DA-ARPFVPKELIDRLLEALEKGAGAAI 130 (226) T ss_pred ----------C-CCCEEEC----CCCCHHHHHHHHHHHHHHCCCCCEEEEE-CC-CCCCCCHHHHHHHHHHHHHCCCCEE T ss_conf ----------8-6115846----9874689999999987635898828984-77-3458898799999999860798348 Q ss_pred EEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHH-EECHHHHH--HHCCCCCC-CCCCCCCH Q ss_conf 1124328544331331001333333530000002577888673423302020-02723311--00012266-67675003 Q gi|254780365|r 163 LAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRY-ILHPDIFS--ILNDWKEN-EGKGEIQL 238 (299) Q Consensus 163 i~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiY-i~~~~i~~--~L~~~~~~-~~~gE~~l 238 (299) +++ ++.+ +. ...+ ..+.+.+-+ ..+ ++| +.+|-.|+ .|.+.-.. ....-++. T Consensus 131 ~A~-Pv~D-Tl-----K~~~-----~~G~~~~T~---dR~---------~Lw~~QTPQ~F~~~~L~~Ah~~~A~~~g~~~ 186 (226) T TIGR00453 131 LAL-PVAD-TL-----KRVD-----ADGFVVETV---DRE---------GLWAAQTPQAFRRELLLKAHARAAKAEGFEI 186 (226) T ss_pred EEE-ECCC-EE-----EEEC-----CCCCEECCC---CCC---------CCCEEECCCCCCHHHHHHHHHHHHHHCCCCC T ss_conf 742-0543-15-----7744-----888620266---621---------4525576887688899999999998469851 Q ss_pred HHHHHHHHHC-C-CEEEEEEEEEEEECCCHHHHHHHH Q ss_conf 7899999831-9-848999813786439988999999 Q gi|254780365|r 239 TDSMRKLSER-H-DFLAYHFKGHTYDCGSKKGFVLAN 273 (299) Q Consensus 239 tD~i~~l~~~-~-~v~a~~~~g~w~DiG~~~~yl~A~ 273 (299) ||=-..+-.- | ++.-++=+-.=+=|=+|+|+.-|. T Consensus 187 TDDA~~~E~~~G~~v~LVeG~~~n~KiT~P~DL~~Ae 223 (226) T TIGR00453 187 TDDASAVERLFGGKVALVEGDADNFKITTPEDLALAE 223 (226) T ss_pred CCHHHHHHHHCCCCEEEEECCCCCEECCCHHHHHHHH T ss_conf 3589999996389728872761011036727899987 No 68 >PRK00560 molybdopterin-guanine dinucleotide biosynthesis protein A; Provisional Probab=98.97 E-value=6.8e-09 Score=74.65 Aligned_cols=178 Identities=15% Similarity=0.130 Sum_probs=100.6 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEE-HHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCC Q ss_conf 58999568888334324408883335688362-79999999997899889998135544410100117454445654187 Q gi|254780365|r 8 RKAVFPIAGLGMRFFPISKVIPKEMLAIVDRP-VIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNK 86 (299) Q Consensus 8 ~KavIlAaG~GtRl~P~T~~~pKpllpi~~kp-iI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~ 86 (299) =.|||||||+++|| ..+|.++++.|+| +|+|+++.+... +++|+|+++.... + |. T Consensus 9 I~~vILAGGkSsRm-----G~dKall~~~g~~~Lie~~~~~l~~~-~~~i~is~~~~~~---~-~~-------------- 64 (196) T PRK00560 9 IPCVILAGGKSSRM-----GENKALLPFGSYSSLLEYQYTRLLKL-FKKVYISTKDKKF---E-FN-------------- 64 (196) T ss_pred CCEEEECCCCCCCC-----CCCCCCCEECCCCHHHHHHHHHHHHH-CCEEEEECCCHHH---H-CC-------------- T ss_conf 40899737872239-----99862527699555999999998863-6768996481366---4-59-------------- Q ss_pred CEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 00001134411248579984333423346654211011223333111278644454433221122212234322001124 Q gi|254780365|r 87 KAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANILAVS 166 (299) Q Consensus 87 ~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi~~~ 166 (299) .. .+ .-.+....|.-.+|+.|......+.++++-+|.=+... ..++.|++ ..+..++... T Consensus 65 ~~-------------~i-~d~~~~~~GPL~Gi~saL~~~~~~~~~v~~cD~Pfl~~---~~i~~L~~---~~~~~~v~~~ 124 (196) T PRK00560 65 AP-------------FL-LEKESDLFSPLFGIHNAFLTLQTPEIFFISVDTPFVSF---ESIKKLCG---KENFSVTYAK 124 (196) T ss_pred CC-------------EE-ECCCCCCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCH---HHHHHHHH---CCCCCEEEEE T ss_conf 98-------------88-65788988809999999985777868999578888899---99999972---2699989995 Q ss_pred ECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 32854433133100133333353000000257788867342330202002723311000122666767500378999998 Q gi|254780365|r 167 ECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLS 246 (299) Q Consensus 167 ~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~ 246 (299) . -+++. + ..|+| +++.++.|++.-. .|++.+ ..++ T Consensus 125 -------------------------~---~~~~~-P-------L~aly--~~~~l~~l~~~l~---~g~~~l----~~~l 159 (196) T PRK00560 125 -------------------------S---PTKEH-Y-------LISLW--HRSLLNALNYALK---TQNYRL----SDLV 159 (196) T ss_pred -------------------------E---CCCCC-C-------EEEEE--CHHHHHHHHHHHH---CCCCHH----HHHH T ss_conf -------------------------4---89013-0-------46785--2889999999998---299519----9999 Q ss_pred HCCCEEEEEEE--EEEEECCCHHHHHHHHH Q ss_conf 31984899981--37864399889999999 Q gi|254780365|r 247 ERHDFLAYHFK--GHTYDCGSKKGFVLANI 274 (299) Q Consensus 247 ~~~~v~a~~~~--g~w~DiG~~~~yl~A~~ 274 (299) ++-+...+.++ ...+++-||++|-+|.- T Consensus 160 ~~~~~~~v~~~~~~~F~NiNTpeD~~~A~~ 189 (196) T PRK00560 160 KNSSSQAVHFEDEEEFLNLNTLKDYELALQ 189 (196) T ss_pred HHCCCEEEECCCCCCCCCCCCHHHHHHHHH T ss_conf 878968974599874246899999999999 No 69 >COG0746 MobA Molybdopterin-guanine dinucleotide biosynthesis protein A [Coenzyme metabolism] Probab=98.87 E-value=3.6e-09 Score=76.37 Aligned_cols=180 Identities=21% Similarity=0.280 Sum_probs=102.7 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCC Q ss_conf 58999568888334324408883335688362799999999978998899981355444101001174544456541870 Q gi|254780365|r 8 RKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKK 87 (299) Q Consensus 8 ~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~ 87 (299) ..+||||||+|+|| .+|.|++++|+|+|+|+++.+....- .++|+.+.+.+. |. . T Consensus 5 ~~~vILAGG~srRm------~dK~l~~~~g~~lie~v~~~L~~~~~-~vvi~~~~~~~~---~~-----------~---- 59 (192) T COG0746 5 MTGVILAGGKSRRM------RDKALLPLNGRPLIEHVIDRLRPQVD-VVVISANRNQGR---YA-----------E---- 59 (192) T ss_pred CEEEEECCCCCCCC------CCCEEEEECCEEHHHHHHHHHHCCCC-EEEEECCCCHHH---HH-----------C---- T ss_conf 36999778753567------88745787982899999998740188-799976873424---43-----------1---- Q ss_pred EEECCCCCCCCCCCEEEECCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 0001134411248579984333423-346654211011223333111278644454433221122212234322001124 Q gi|254780365|r 88 AELTLLAESIPSIGNAVFTWQYERK-GLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANILAVS 166 (299) Q Consensus 88 ~~~~~~~~~~~~~~~i~~v~Q~~p~-Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi~~~ 166 (299) .+ +..+.-..+- |.=.+|+.|....+.+.++++-+|.=+... ..++.|.....+.++.++. T Consensus 60 -----------~g--~~vv~D~~~~~GPL~Gi~~al~~~~~~~~~v~~~D~P~i~~---~lv~~l~~~~~~~~~~~~~-- 121 (192) T COG0746 60 -----------FG--LPVVPDELPGFGPLAGILAALRHFGTEWVLVLPCDMPFIPP---ELVERLLSAFKQTGAAIVP-- 121 (192) T ss_pred -----------CC--CCEEECCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCH---HHHHHHHHHHCCCCCCEEE-- T ss_conf -----------69--85754788888878999999985798759998167787899---9999999862347884788-- Q ss_pred ECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 32854433133100133333353000000257788867342330202002723311000122666767500378999998 Q gi|254780365|r 167 ECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLS 246 (299) Q Consensus 167 ~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~ 246 (299) ..+ -| ++. | ..++| +..+.+.|+.... .||.. +..+. T Consensus 122 ~~~------~g-------------~~~-----P----------l~aly--~~~l~~~l~~~l~---~g~~~----~~~~l 158 (192) T COG0746 122 AHD------DG-------------RLE-----P----------LFALY--HRALLPALEEYLA---KGERR----LSALL 158 (192) T ss_pred ECC------CC-------------CEE-----E----------EEEEE--HHHHHHHHHHHHH---HCCCH----HHHHH T ss_conf 658------99-------------376-----6----------43222--1989999999999---48814----78899 Q ss_pred HCCCEEEEEEE--E--EEEECCCHHHHHHHH Q ss_conf 31984899981--3--786439988999999 Q gi|254780365|r 247 ERHDFLAYHFK--G--HTYDCGSKKGFVLAN 273 (299) Q Consensus 247 ~~~~v~a~~~~--g--~w~DiG~~~~yl~A~ 273 (299) +...+..+.++ . ..++|-||+++-++. T Consensus 159 ~~~~~~~v~~~~~~~~~F~NiNtpeDL~~~~ 189 (192) T COG0746 159 ERLGTEYVEFEDLEEDSFFNINTPEDLARAR 189 (192) T ss_pred HHCCCEEEECCCCCCCCHHCCCCHHHHHHHH T ss_conf 8779379976655640021269999999886 No 70 >PRK09382 ispDF bifunctional 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase/2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase protein; Provisional Probab=98.84 E-value=9.2e-09 Score=73.83 Aligned_cols=61 Identities=25% Similarity=0.285 Sum_probs=51.7 Q ss_pred CCCCCCCCEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEECCCC Q ss_conf 978885458999568888334324408883335688362799999999978-998899981355 Q gi|254780365|r 1 MGSLKKVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEA-GLTDFVFVTGRG 63 (299) Q Consensus 1 ~~~m~~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~-Gi~~i~iv~~~~ 63 (299) |-.|+++ -|||||||.|+||.. ...+||++++++|+|+|+|.++.+.+. .|++|++|++.. T Consensus 1 ~~~M~~i-~aIIlAAG~G~Rfg~-~~~ipKQf~~l~gkpil~~sl~~f~~~~~i~~Ivvvv~~~ 62 (382) T PRK09382 1 ETVMSDI-SLVIVAAGRSTRFGQ-SAEVKKQWLRIGGKPLWLHVLENFSSAPDFKEIVVVIHPD 62 (382) T ss_pred CCCCCCE-EEEEECCCCCCCCCC-CCCCCCCEEEECCEEHHHHHHHHHHCCCCCCEEEEEECHH T ss_conf 9745763-799968722504899-8799974106797789999999996689989899993888 No 71 >PRK13385 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; Provisional Probab=98.79 E-value=1.4e-08 Score=72.70 Aligned_cols=53 Identities=23% Similarity=0.358 Sum_probs=46.1 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHH-CCCCEEEEECCCC Q ss_conf 5899956888833432440888333568836279999999997-8998899981355 Q gi|254780365|r 8 RKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALE-AGLTDFVFVTGRG 63 (299) Q Consensus 8 ~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~-~Gi~~i~iv~~~~ 63 (299) .-+||||||.|+||. ..+||++++|.|+|+|.|.++.+.. ..|++|++|++.. T Consensus 3 ~~~IilAaG~G~R~~---~~~pKQf~~l~gkpil~~sl~~f~~~~~i~~Ivvv~~~~ 56 (238) T PRK13385 3 YELIFLAAGQGKMGN---VPLPKQFLDIDNKPILIHTIEKFILVSEFNEIIIATPAQ 56 (238) T ss_pred EEEEEECCCCCCCCC---CCCCCCEEEECCEEHHHHHHHHHHCCCCCCEEEEECCHH T ss_conf 699997773461488---689964327997889999999985587678799967788 No 72 >pfam02348 CTP_transf_3 Cytidylyltransferase. This family consists of two main Cytidylyltransferase activities: 1) 3-deoxy-manno-octulosonate cytidylyltransferase,, EC:2.7.7.38 catalysing the reaction:- CTP + 3-deoxy-D-manno-octulosonate <= diphosphate + CMP-3-deoxy-D-manno-octulosonate, 2) acylneuraminate cytidylyltransferase EC:2.7.7.43,, catalysing the reaction:- CTP + N-acylneuraminate <= diphosphate + CMP-N-acylneuraminate. NeuAc cytydilyltransferase of Mannheimia haemolytica has been characterized describing kinetics and regulation by substrate charge, energetic charge and amino-sugar demand. Probab=98.72 E-value=3.1e-08 Score=70.62 Aligned_cols=182 Identities=16% Similarity=0.128 Sum_probs=98.2 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 9995688883343244088833356883627999999999789-988999813554441010011745444565418700 Q gi|254780365|r 10 AVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAG-LTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 10 avIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~G-i~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) +||+|=+-.||+- -|||.+|+|+|||+|+++.+.+++ +++++|.|.. +.|.++- T Consensus 2 ~iIpAR~~S~Rlp------~K~L~~i~gkpmI~~v~~~a~~s~~~~~viVaTd~--~~I~~~~----------------- 56 (197) T pfam02348 2 AIIPARSGSKRLP------GKNLLKLGGKPLIARVIEAALQSKLFDKVVVATDS--EEIADIA----------------- 56 (197) T ss_pred EEECCCCCCCCCC------CCCCHHHCCCCHHHHHHHHHHHCCCCCEEEEECCC--CEEEEEC----------------- T ss_conf 8984588887899------96203529906899999999975898728995483--0144201----------------- Q ss_pred EECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-EE-EE Q ss_conf 0011344112485799843334233466542110112233-331112786444544332211222122343220-01-12 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDN-PFALLLPDMIMSPLEGENCMANMIKLYEKEGAN-IL-AV 165 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de-~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~-vi-~~ 165 (299) . ..+.++.....+-+.||.-....++.+..+. -++-+.||.-+... ..+.++++.+.+.... +. .+ T Consensus 57 -----~---~~~~~~i~t~~~~~~gt~R~~ea~~~~~~~~d~ivnvqgD~Pli~p---~~i~~~i~~~~~~~~~~~~t~~ 125 (197) T pfam02348 57 -----E---KFGGGVVVRRGSLASGTDRFIEAVKAFLADEEIIVNLQGDEPLLQP---TSILRAIEHLREAGEDYVSTLV 125 (197) T ss_pred -----C---CCCCCEEEEEECCCCCCCHHHHHHHHHCCHHEEEEECCCCEEECCH---HHHHHHHHHHHHCCCCCEEEEE T ss_conf -----2---3233135764023565211999998643410099965896687588---9999999999858987289820 Q ss_pred EEC-CHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCC--C---CC-CEEECHHHEECHHHHHHHCCCCCC Q ss_conf 432-85443313310013333335300000025778886--7---34-233020200272331100012266 Q gi|254780365|r 166 SEC-DPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSST--F---IS-NFFINGRYILHPDIFSILNDWKEN 230 (299) Q Consensus 166 ~~v-~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~--~---~S-nl~~~GiYi~~~~i~~~L~~~~~~ 230 (299) .++ ..++...--.+.+-- +...+..-|...|-+.. . +. .+-..|+|.|+++.+.....++++ T Consensus 126 ~~~~~~~~~~~~n~vKvv~---~~~~~alyfsR~~iP~~~~~~~~~~~~~~h~gIY~f~~~~L~~f~~l~~s 194 (197) T pfam02348 126 KPVVSSEEILNPNPLKVVL---DDAGYALYFSRSPIPYIRDHPAPLQLYLRHKGIYAFRKEHLIRYVIDTPS 194 (197) T ss_pred EECCCHHHHCCCCCCEEEE---CCCCCEEECCCCCCCCCCCCCCCCCCCCEEEEEEEECHHHHHHHHCCCCC T ss_conf 4737888933998749998---88998824137889743123255787548999989889999998768999 No 73 >COG1211 IspD 4-diphosphocytidyl-2-methyl-D-erithritol synthase [Lipid metabolism] Probab=98.69 E-value=4.5e-08 Score=69.58 Aligned_cols=216 Identities=19% Similarity=0.292 Sum_probs=111.1 Q ss_pred CCCEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHC Q ss_conf 5458999568888334324408883335688362799999999978-998899981355444101001174544456541 Q gi|254780365|r 6 KVRKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEA-GLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKR 84 (299) Q Consensus 6 ~i~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~-Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~ 84 (299) +-.-|||||||.||||.. ..||..++++|+|+++|.++.+..+ .|++|+++++.... +||. .... T Consensus 3 ~~~~~vilAaG~G~R~~~---~~pKq~l~l~g~pll~~tl~~f~~~~~i~~ivvv~~~~~~---~~~~------~~~~-- 68 (230) T COG1211 3 MMVSAVILAAGFGSRMGN---PVPKQYLELGGRPLLEHTLEAFLESPAIDEIVVVVSPEDD---PYFE------KLPK-- 68 (230) T ss_pred CEEEEEEECCCCCCCCCC---CCCCEEEEECCEEEHHHHHHHHHHCCCCCEEEEEECHHHH---HHHH------HHHH-- T ss_conf 407999975763433579---9984588999988569999999747676769999784664---8999------8632-- Q ss_pred CCCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 870000113441124857998433342334665421101122--333311127864445443322112221223432200 Q gi|254780365|r 85 NKKAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIG--DNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANI 162 (299) Q Consensus 85 ~~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~--de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~v 162 (299) ......+.++.-.. .-...|+.+...+. +++++++ -|- -.+......+.+++....+.++.+ T Consensus 69 ------------~~~~~~v~~v~GG~--~R~~SV~~gL~~~~~~~~~~Vlv-HDa-aRPf~~~~~i~~li~~~~~~~aai 132 (230) T COG1211 69 ------------LSADKRVEVVKGGA--TRQESVYNGLQALSKYDSDWVLV-HDA-ARPFLTPKLIKRLIELADKYGAAI 132 (230) T ss_pred ------------HCCCCEEEEECCCC--CHHHHHHHHHHHHHCCCCCEEEE-ECC-CCCCCCHHHHHHHHHHHCCCCCEE T ss_conf ------------12487079934875--17999999999721158988999-646-667899999999998622388179 Q ss_pred EEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHH-EECHHHHH--HHCCC-CCCCCCCCCCH Q ss_conf 1124328544331331001333333530000002577888673423302020-02723311--00012-26667675003 Q gi|254780365|r 163 LAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRY-ILHPDIFS--ILNDW-KENEGKGEIQL 238 (299) Q Consensus 163 i~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiY-i~~~~i~~--~L~~~-~~~~~~gE~~l 238 (299) .++ ++.+ + +.... +...|.+. |.. | +.| +..|..|. .|.+. ......| ++. T Consensus 133 ~al-pv~D-T------ik~~~----~~~~i~~t---~~R----~-----~l~~~QTPQ~F~~~~L~~a~~~a~~~~-~~~ 187 (230) T COG1211 133 LAL-PVTD-T------LKRVD----ADGNIVET---VDR----S-----GLWAAQTPQAFRLELLKQALARAFAEG-REI 187 (230) T ss_pred EEE-ECCC-C------EEEEC----CCCCEEEC---CCH----H-----HHHHHHCCCCCCHHHHHHHHHHHHHCC-CCC T ss_conf 975-0467-3------78865----89872344---375----5-----423433876331999999999998628-776 Q ss_pred HHHHHHHHHCC-CEEEEEEEEEEEECCCHHHHHHHHHHH Q ss_conf 78999998319-848999813786439988999999998 Q gi|254780365|r 239 TDSMRKLSERH-DFLAYHFKGHTYDCGSKKGFVLANIAF 276 (299) Q Consensus 239 tD~i~~l~~~~-~v~a~~~~g~w~DiG~~~~yl~A~~~~ 276 (299) ||=...+...| ++.-+.-+-+=+-+=+|+++.-|...+ T Consensus 188 tDdas~~e~~G~~v~lV~G~~~n~KiTtpeDL~~a~~il 226 (230) T COG1211 188 TDDASAIEKAGGPVSLVEGSADNFKITTPEDLEIAEAIL 226 (230) T ss_pred CCHHHHHHHCCCCEEEEECCCCEEEECCHHHHHHHHHHH T ss_conf 886999987699819982683305762888999999875 No 74 >TIGR02665 molyb_mobA molybdopterin-guanine dinucleotide biosynthesis protein A; InterPro: IPR013482 In many molybdopterin-containing enzymes, including nitrate reductase and dimethylsulphoxide reductase, the cofactor is molybdopterin-guanine dinucleotide. This entry describes proteobacterial MobA (molybdopterin-guanine dinucleotide biosynthesis protein A). MobA synthesises molybdopterin-guanine dinucleotide from molybdopterin and GTP. ; GO: 0003824 catalytic activity, 0006777 Mo-molybdopterin cofactor biosynthetic process. Probab=98.55 E-value=2.9e-08 Score=70.80 Aligned_cols=122 Identities=16% Similarity=0.193 Sum_probs=71.9 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 89995688883343244088833356883627999999999789988999813554441010011745444565418700 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) .+||||||+|+|| -...|.|+++.|+|||+|+++.+.-. +++|+|.-|.+.+... ++ +.. T Consensus 2 ~gviLAGG~arRM----GG~DKGL~~L~g~PL~~hv~~rL~PQ-v~~~~IsANRn~~~Y~-~~-------------g~G- 61 (202) T TIGR02665 2 SGVILAGGRARRM----GGRDKGLVELGGKPLIEHVLARLRPQ-VSDLAISANRNPERYA-QA-------------GFG- 61 (202) T ss_pred CEEEECCCCCCCC----CCCCCCCEECCCCCHHHHHHHHHCCC-HHHHHHHCCCCHHHHH-HH-------------CCC- T ss_conf 7367658740026----88886610128652899999984350-7666741388977898-86-------------089- Q ss_pred EECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCC----CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 0011344112485799843334233466542110112----2333311127864445443322112221223432200 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNII----GDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANI 162 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i----~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~v 162 (299) +..+.+.. ...-.| .-|| | .|+.+.+.. .-+.++++-+|. +....+.+.+|.+.-..+.+.+ T Consensus 62 -l~V~~D~~--DA~~~F---~GPL--A-GilagL~~a~~~~~~~~vl~~PCD~---P~lP~dLv~RL~~a~~~~~a~i 127 (202) T TIGR02665 62 -LPVVPDDV--DALADF---PGPL--A-GILAGLRWAVAGTGTDWVLTVPCDT---PFLPEDLVARLAAALEAQDADI 127 (202) T ss_pred -CEECCCCC--CCCCCC---CCCH--H-HHHHHHHHHHHCCCCCEEEEECCCC---CCCCHHHHHHHHHHHHCCCCCE T ss_conf -73127853--435788---8876--8-9999999998446998288822888---9988789999999975289978 No 75 >cd02513 CMP-NeuAc_Synthase CMP-NeuAc_Synthase activates N-acetylneuraminic acid by adding CMP moiety. CMP-N-acetylneuraminic acid synthetase (CMP-NeuAc synthetase) or acylneuraminate cytidylyltransferase catalyzes the transfer the CMP moiety of CTP to the anomeric hydroxyl group of NeuAc in the presence of Mg++. It is the second to last step in the sialylation of the oligosaccharide component of glycoconjugates by providing the activated sugar-nucleotide cytidine 5'-monophosphate N-acetylneuraminic acid (CMP-Neu5Ac), the substrate for sialyltransferases. Eukaryotic CMP-NeuAc synthetases are predominantly located in the nucleus. The activated CMP-Neu5Ac diffuses from the nucleus into the cytoplasm. Probab=98.37 E-value=3.4e-06 Score=58.00 Aligned_cols=204 Identities=18% Similarity=0.192 Sum_probs=103.8 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCC Q ss_conf 89995688883343244088833356883627999999999789-98899981355444101001174544456541870 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAG-LTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKK 87 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~G-i~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~ 87 (299) =|+|+|=|-.+|+. -|.+.+++|+|+|+|+++.+.+++ +++|+|.|.. +.|.++- ... T Consensus 3 ~aiIpAR~gSkri~------~KN~~~i~gkpLi~~~i~~a~ks~~~d~IivSTDs--~~i~~~~----------~k~--- 61 (223) T cd02513 3 LAIIPARGGSKGIP------GKNIRPLGGKPLIAWTIEAALESKLFDRVVVSTDD--EEIAEVA----------RKY--- 61 (223) T ss_pred EEEECCCCCCCCCC------CHHHHHHCCCCHHHHHHHHHHHCCCCCEEEEECCH--HHHHHHH----------CCC--- T ss_conf 99986787787899------70044638959699999999966995507895587--8975421----------245--- Q ss_pred EEECCCCCCCCCCCEEEECCCCCC----CCCCCCCCCCCCCCC-----CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 000113441124857998433342----334665421101122-----33331112786444544332211222122343 Q gi|254780365|r 88 AELTLLAESIPSIGNAVFTWQYER----KGLGHAVWCARNIIG-----DNPFALLLPDMIMSPLEGENCMANMIKLYEKE 158 (299) Q Consensus 88 ~~~~~~~~~~~~~~~i~~v~Q~~p----~Gtg~Ai~~a~~~i~-----de~flv~~gD~i~~~~~~~~~l~~li~~~~~~ 158 (299) +..+.+.|-++- -.+-+.+..+..++. -+.++++.++.-+... ..+.++++.+.+. T Consensus 62 ------------~~~~~~~Rpk~ls~d~~~~~dvi~~~l~~~~~~~~~~d~iv~l~pTsP~r~~---~~I~~~i~~~~~~ 126 (223) T cd02513 62 ------------GAEVPFLRPAELATDTASSIDVILHALDQLEELGRDFDIVVLLQPTSPLRSA---EDIDEAIELLLSE 126 (223) T ss_pred ------------CCEEEECCCHHHCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCH---HHHHHHHHHHHHC T ss_conf ------------6417973846876886780677778998899868998999996279762789---9999999999857 Q ss_pred CC-CEEEEEECCHHHCCCCCCEECCCCCCCCCCEEE-EECCCCCCCCCCCCE-EECHHHEECHHHHHHHCCCCCCCCCCC Q ss_conf 22-001124328544331331001333333530000-002577888673423-302020027233110001226667675 Q gi|254780365|r 159 GA-NILAVSECDPQLSCKYGMVQVGKAIDHQVFHIS-DMIEKPDSSTFISNF-FINGRYILHPDIFSILNDWKENEGKGE 235 (299) Q Consensus 159 ~~-~vi~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~-~~vEKP~~~~~~Snl-~~~GiYi~~~~i~~~L~~~~~~~~~gE 235 (299) +. +++++.+.... +.+.... .. ++..... ..-..+.....+..+ .++++|+++.+.|- +. . . T Consensus 127 k~dsl~sv~~~~~~-~~~~~~~--~~---~~~~~~~~~~~~~~~rQ~~~~~y~~~g~~yi~~~~~~~--k~----~-~-- 191 (223) T cd02513 127 GADSVFSVTEFHRF-PWRALGL--DD---NGLEPVNYPEDKRTRRQDLPPAYHENGAIYIAKREALL--ES----N-S-- 191 (223) T ss_pred CCCEEEEEEECCCC-HHHHEEE--CC---CCCCCCCCCHHHCCCCCCCCEEEEECCEEEEEEHHHHH--HC----C-C-- T ss_conf 99999999854768-1885676--26---87021368611067644676047860489999999998--54----9-8-- Q ss_pred CCHHHHHHHHHHCCCEEEEEEEE-EEEECCCHHHHHHHH Q ss_conf 00378999998319848999813-786439988999999 Q gi|254780365|r 236 IQLTDSMRKLSERHDFLAYHFKG-HTYDCGSKKGFVLAN 273 (299) Q Consensus 236 ~~ltD~i~~l~~~~~v~a~~~~g-~w~DiG~~~~yl~A~ 273 (299) + -.++...|.++. +.+||-++++|.-|. T Consensus 192 ~----------~~~k~~~~~~~~~~siDID~~~Dl~~Ae 220 (223) T cd02513 192 F----------FGGKTGPYEMPRERSIDIDTEEDFELAE 220 (223) T ss_pred C----------CCCCEEEEEECCCCEECCCCHHHHHHHH T ss_conf 0----------3897799994887168489999999999 No 76 >COG1212 KdsB CMP-2-keto-3-deoxyoctulosonic acid synthetase [Cell envelope biogenesis, outer membrane] Probab=98.24 E-value=1.6e-05 Score=53.90 Aligned_cols=224 Identities=17% Similarity=0.132 Sum_probs=119.3 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCC Q ss_conf 58999568888334324408883335688362799999999978998899981355444101001174544456541870 Q gi|254780365|r 8 RKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKK 87 (299) Q Consensus 8 ~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~ 87 (299) .-.||+|-=..|||- -|||--|+|+|||.++.+++.+||.+++++-|... .|.+. .. T Consensus 4 ~~viIPAR~~STRLp------gKPLadI~GkpmI~rV~e~a~~s~~~rvvVATDde--~I~~a----------v~----- 60 (247) T COG1212 4 FVVIIPARLASTRLP------GKPLADIGGKPMIVRVAERALKSGADRVVVATDDE--RIAEA----------VQ----- 60 (247) T ss_pred EEEEEECCHHCCCCC------CCCHHHHCCCHHHHHHHHHHHHCCCCEEEEECCCH--HHHHH----------HH----- T ss_conf 599985420026689------97357737960789999999873787289974988--99999----------99----- Q ss_pred EEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE--E Q ss_conf 000113441124857998433342334665421101--122333311127864445443322112221223432200--1 Q gi|254780365|r 88 AELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARN--IIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANI--L 163 (299) Q Consensus 88 ~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~--~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~v--i 163 (299) ..++......-+-+.||--.-..++. +-.++-.+=+.||.=+... ..+.++.+..+++.+.+ + T Consensus 61 ----------~~G~~avmT~~~h~SGTdR~~Ev~~~l~~~~~~iIVNvQGDeP~i~p---~~I~~~~~~L~~~~~~~aTl 127 (247) T COG1212 61 ----------AFGGEAVMTSKDHQSGTDRLAEVVEKLGLPDDEIIVNVQGDEPFIEP---EVIRAVAENLENSNADMATL 127 (247) T ss_pred ----------HHCCEEEECCCCCCCCCHHHHHHHHHCCCCCCEEEEECCCCCCCCCH---HHHHHHHHHHHHCCCCEEEE T ss_conf ----------70978996178889800899999996599964299986588777797---99999999987377660455 Q ss_pred EEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCC--CCCCCC---CCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCH Q ss_conf 12432854433133100133333353000000257--788867---3423302020027233110001226667675003 Q gi|254780365|r 164 AVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEK--PDSSTF---ISNFFINGRYILHPDIFSILNDWKENEGKGEIQL 238 (299) Q Consensus 164 ~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEK--P~~~~~---~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~l 238 (299) ++.-.+.++.-+=-++++-. +..++-.-|--- |-.-+. .+-|--+|+|-|..+++.-.-..+|+. ++. T Consensus 128 ~~~i~~~ee~~nPN~VKvV~---d~~g~ALYFSRs~iP~~rd~~~~~p~l~HIGIYayr~~~L~~f~~~~ps~----LE~ 200 (247) T COG1212 128 AVKITDEEEAFNPNVVKVVL---DKEGYALYFSRAPIPYGRDNFGGTPFLRHIGIYAYRAGFLERFVALKPSP----LEK 200 (247) T ss_pred EEECCCHHHHCCCCCEEEEE---CCCCCEEEEECCCCCCCCCCCCCCCHHHEEEHHHHHHHHHHHHHCCCCCH----HHH T ss_conf 34337888845998279997---59982899876878775434677643642205886799999997069960----567 Q ss_pred HHHHHHHH--HCC-CEEEEEEEEEE-EECCCHHHHHHHHH Q ss_conf 78999998--319-84899981378-64399889999999 Q gi|254780365|r 239 TDSMRKLS--ERH-DFLAYHFKGHT-YDCGSKKGFVLANI 274 (299) Q Consensus 239 tD~i~~l~--~~~-~v~a~~~~g~w-~DiG~~~~yl~A~~ 274 (299) +.-+++|. ..| ++.....+..- .-+-||+++-++-. T Consensus 201 ~E~LEQLR~Le~G~kI~v~i~~~~p~~gVDT~EDLe~v~~ 240 (247) T COG1212 201 IESLEQLRVLENGEKIHVEIVKEVPSIGVDTPEDLERVRK 240 (247) T ss_pred HHHHHHHHHHHCCCEEEEEEECCCCCCCCCCHHHHHHHHH T ss_conf 8778999998769757899841688889898899999999 No 77 >cd02518 GT2_SpsF SpsF is a glycosyltrnasferase implicated in the synthesis of the spore coat. Spore coat polysaccharide biosynthesis protein F (spsF) is a glycosyltransferase implicated in the synthesis of the spore coat in a variety of bacteria challenged by stress as starvation. The spsF gene is expressed in the late stage of coat development responsible for a terminal step in coat formation that involves the glycosylation of the coat. SpsF gene mutation resulted in spores that appeared normal. But, the spores tended to aggregate and had abnormal adsorption properties, indicating a surface alteration. Probab=98.08 E-value=3.4e-06 Score=58.03 Aligned_cols=117 Identities=15% Similarity=0.181 Sum_probs=73.4 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEECCCCC--CCCHHHCCCCHHHHHHHHHCCC Q ss_conf 999568888334324408883335688362799999999978-9988999813554--4410100117454445654187 Q gi|254780365|r 10 AVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEA-GLTDFVFVTGRGK--GLIKDYFDIQFELEQSLRKRNK 86 (299) Q Consensus 10 avIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~-Gi~~i~iv~~~~k--~~I~~~f~~~~~~~~~l~~~~~ 86 (299) |||.|-.-.|||- -|+|++|+|+|+|+|+++.+..+ ++++|+++|+..+ +.++++ ... T Consensus 2 ~iI~aR~~S~RLp------~K~L~~i~~~~~i~~~i~r~k~~~~~~~IivaTs~~~~Dd~l~~~----------~~~--- 62 (233) T cd02518 2 AIIQARMGSTRLP------GKVLKPLGGKPLLEHLLDRLKRSKLIDEIVIATSTNEEDDPLEAL----------AKK--- 62 (233) T ss_pred EEEECCCCCCCCC------CCCHHHHCCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHH----------HHH--- T ss_conf 8996077883789------860023099579999999999668988499981786522699999----------974--- Q ss_pred CEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE Q ss_conf 00001134411248579984333423346654211011223333111278644454433221122212234322001 Q gi|254780365|r 87 KAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANIL 163 (299) Q Consensus 87 ~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi 163 (299) .++.+.+-.+...+. .+..|....+-+.++-+.||+-+.+. ..+.++++.|.+.+...+ T Consensus 63 --------------~~i~~frGs~~dvl~-R~~~a~~~~~~d~ivri~gD~P~idp---~~id~~i~~~~~~~~Dyv 121 (233) T cd02518 63 --------------LGVKVFRGSEEDVLG-RYYQAAEEYNADVVVRITGDCPLIDP---EIIDAVIRLFLKSGADYT 121 (233) T ss_pred --------------CCEEEEECCCCHHHH-HHHHHHHHCCCCEEEEECCCCCCCCH---HHHHHHHHHHHCCCCCEE T ss_conf --------------848999578305889-99999871578889997687775898---999999999850799989 No 78 >PTZ00339 UDP-N-acetylglucosamine pyrophosphorylase; Provisional Probab=97.91 E-value=6.3e-05 Score=50.21 Aligned_cols=234 Identities=13% Similarity=0.170 Sum_probs=108.2 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEEC---CEEHHHHHHHHHH------H--C--C---CCEEEEECC-CCCCCCHHHC Q ss_conf 8999568888334324408883335688---3627999999999------7--8--9---988999813-5544410100 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIV---DRPVIQYVIEEAL------E--A--G---LTDFVFVTG-RGKGLIKDYF 71 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~---~kpiI~~~i~~l~------~--~--G---i~~i~iv~~-~~k~~I~~~f 71 (299) -+|+||||.||||. ..-||-|++|+ +|++.+...+.+. . . | .=-++|.|+ ...+...+|| T Consensus 111 avlllAGGqGTRLG---~~~PKG~~~igl~s~ksLfql~aeri~~lq~la~~~~~~~~~~~ip~yIMTS~~t~~~T~~ff 187 (499) T PTZ00339 111 AVLILAGGLGTRLG---SDKPKGLLECTPLKKKSLFQFHCEKIRRLEEMAAAASGGGDDPTIYILVLTSSFNHDQTRQFL 187 (499) T ss_pred EEEEECCCCCCCCC---CCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCHHHHHHHHH T ss_conf 99995078746679---999971111589998719999999999999998875067888860399977850469999999 Q ss_pred CCCHHHHHHHHHCCCCEEECCCCC-CC---C-CCCEEEEC----CCCCCCCCCCCCCCCCC-------CCCC------CC Q ss_conf 117454445654187000011344-11---2-48579984----33342334665421101-------1223------33 Q gi|254780365|r 72 DIQFELEQSLRKRNKKAELTLLAE-SI---P-SIGNAVFT----WQYERKGLGHAVWCARN-------IIGD------NP 129 (299) Q Consensus 72 ~~~~~~~~~l~~~~~~~~~~~~~~-~~---~-~~~~i~~v----~Q~~p~Gtg~Ai~~a~~-------~i~d------e~ 129 (299) ..+..+.- +...+....+ .+ . ..+.+-.- +-..|-|.|+-. .|.. .+.| +. T Consensus 188 ~~~~~FGl------~~~~V~fF~Q~~lP~~d~~~Gkille~~~~i~~aPdGNGG~y-~AL~~sg~~~~iL~dm~~rGI~y 260 (499) T PTZ00339 188 QENNFFGL------DKEQVIFFKQSSLPCYDENTGRVLMESRGSLCTAPGGNGDVF-KALAKCSEFMSILDKLEKLGIKY 260 (499) T ss_pred HHCCCCCC------CHHHEEEEECCCEEEEECCCCEEEECCCCCEEECCCCCHHHH-HHHHHCCCHHHHHHHHHHCCCEE T ss_conf 86777788------757689997387336854799687558880687789963899-99886243112799999749869 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE--EEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCC----C Q ss_conf 3111278644454433221122212234322001--124328544331331001333333530000002577888----6 Q gi|254780365|r 130 FALLLPDMIMSPLEGENCMANMIKLYEKEGANIL--AVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSS----T 203 (299) Q Consensus 130 flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi--~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~----~ 203 (299) .-+..-||++...... -|+-++...++.+. .+.+.++. .+-||+-. .++...|.+.-|-|..- . T Consensus 261 i~~~~VDN~L~kvaDP----~FiG~~~~~~~~~~~Kvv~K~~p~--E~VGvv~~----~~g~~~VvEYSEi~~~~ae~~~ 330 (499) T PTZ00339 261 VQIISIDNILAKVADP----EFIGLASSFPAHVVLNKCPKRSDD--ESVGVFCL----KDDEWQVVEYTEINERILENKD 330 (499) T ss_pred EEEEEECCCCCCCCCC----HHHHHHHHCCCCHHCEEEECCCCC--CCCCEEEE----ECCEEEEEEECCCCHHHHHCCC T ss_conf 9999865333246780----663487764861000278726898--76543898----8892689970366877762778 Q ss_pred CC-----CCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHH-HHHHH-HHCCCEEEEEEEEEEEEC Q ss_conf 73-----42330202002723311000122666767500378-99999-831984899981378643 Q gi|254780365|r 204 FI-----SNFFINGRYILHPDIFSILNDWKENEGKGEIQLTD-SMRKL-SERHDFLAYHFKGHTYDC 263 (299) Q Consensus 204 ~~-----Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD-~i~~l-~~~~~v~a~~~~g~w~Di 263 (299) +. -+.+|++..+|+-+++..+....... .=.+++.. =|+.. .+.++-.|+.++-.-||+ T Consensus 331 ~~~~l~~f~agnI~~H~fs~~FL~~v~~~~~~~-~l~~H~A~KKip~~~~~p~~pNgiKlE~FIFDv 396 (499) T PTZ00339 331 ATTNLKQFNCGNICSHIFSLDFLKKVAANRLYE-STPYHAARKKIPYIFLFNGPSPGYKLEAFIFDI 396 (499) T ss_pred CCCCEEECCCCCHHHEEECHHHHHHHHHHHHHH-CCCCCHHHCCCCEECCCCCCCCEEEEEEEEEEE T ss_conf 877601024344321363199999887765553-075502214777057999987658887764431 No 79 >TIGR03584 PseF pseudaminic acid CMP-transferase. The sequences in this family include the pfam02348 (cytidyltransferase) domain and are homologous to the NeuA protein responsible for the transfer of CMP to neuraminic acid. According to, this gene is responsible for the transfer of CMP to the structurally related sugar, pseudaminic acid which is observed as a component of sugar modifications of flagellin in Campylobacter species. This gene is commonly observed in apparent operons with other genes responsible for the biosynthesis of pseudaminic acid and as a component of flagellar and exopolysaccharide biosynthesis loci. Probab=97.79 E-value=6.3e-05 Score=50.24 Aligned_cols=202 Identities=16% Similarity=0.196 Sum_probs=97.8 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCC-CEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCE Q ss_conf 99956888833432440888333568836279999999997899-88999813554441010011745444565418700 Q gi|254780365|r 10 AVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGL-TDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKA 88 (299) Q Consensus 10 avIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi-~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~ 88 (299) |||||=|-.+|+. -|-+.+++|+|+++|.++.+.++++ ++|++-|... .|.+. ..+ T Consensus 2 aiIPaR~gSkri~------~KN~~~~~g~pLi~~~i~~~~~s~~~d~i~vSTD~~--~i~~i-----------a~~---- 58 (222) T TIGR03584 2 AIIPARGGSKRIP------RKNIKPFCGKPMIAYSIEAALNSGLFDEVVVSTDDE--EIAEV-----------AKS---- 58 (222) T ss_pred EEECCCCCCCCCC------CCCHHHHCCCCHHHHHHHHHHHCCCCCEEEECCCHH--HHHHH-----------HHC---- T ss_conf 8984377887678------601256589285999999998579965388727889--98755-----------521---- Q ss_pred EECCCCCCCCCCCEEEECCCCCC----CCCCCCCCCCCCCCC---C-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 00113441124857998433342----334665421101122---3-333111278644454433221122212234322 Q gi|254780365|r 89 ELTLLAESIPSIGNAVFTWQYER----KGLGHAVWCARNIIG---D-NPFALLLPDMIMSPLEGENCMANMIKLYEKEGA 160 (299) Q Consensus 89 ~~~~~~~~~~~~~~i~~v~Q~~p----~Gtg~Ai~~a~~~i~---d-e~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~ 160 (299) .+..+.|.+-++- -.+-+++..+..++. + +.++++.+-.-+. ....+...++.+.+.++ T Consensus 59 ----------~~~~~~~~Rp~~ls~d~~~~~~vi~~~l~~~~~~~~~d~i~~l~pTsP~r---~~~~I~~ai~~~~~~~~ 125 (222) T TIGR03584 59 ----------YGASVPFMRPAELSDDFTGTAPVVAHAIERLQLQKQPDHACCIYATAPLL---QAKILKEAFELLKQPNK 125 (222) T ss_pred ----------CCCCCEEECCHHHCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCC---CHHHHHHHHHHHHHCCC T ss_conf ----------69864251865555886214599999999998658999899954897615---89999999999972699 Q ss_pred -CEEEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCC---CCCCCCCCCE-EECHHHEECHHHHHHHCCCCCCCCCCC Q ss_conf -00112432854433133100133333353000000257---7888673423-302020027233110001226667675 Q gi|254780365|r 161 -NILAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEK---PDSSTFISNF-FINGRYILHPDIFSILNDWKENEGKGE 235 (299) Q Consensus 161 -~vi~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEK---P~~~~~~Snl-~~~GiYi~~~~i~~~L~~~~~~~~~gE 235 (299) +++.+.+........| .+ +.. + .+.-+..+ |.....+..+ .+.++|+++.+.|- +. +. T Consensus 126 ds~~sv~~~~~~~~~~~-~~--~~~--~---~~~~~~~~~~~~r~Qdl~~~y~~~G~~y~~~~~~~~--~~-------~~ 188 (222) T TIGR03584 126 HFVFTVTEFAFPIQRAF-SL--KEN--G---GVTMFFPEHYNTRSQDLEEAYHDAGQFYWGKSQAWL--ES-------GP 188 (222) T ss_pred CCCEEEECCCCCHHHHE-EE--CCC--C---CEEECCHHHHHHHHHHCCCCEEECCEEEEEEHHHHH--HC-------CC T ss_conf 85506533776858937-98--679--9---463347455202444244337544789998899998--44-------97 Q ss_pred CCHHHHHHHHHHCCCEEEEEEEE-EEEECCCHHHHHHHHH Q ss_conf 00378999998319848999813-7864399889999999 Q gi|254780365|r 236 IQLTDSMRKLSERHDFLAYHFKG-HTYDCGSKKGFVLANI 274 (299) Q Consensus 236 ~~ltD~i~~l~~~~~v~a~~~~g-~w~DiG~~~~yl~A~~ 274 (299) + -.++.+.|.++. +.+||-++++|.-|.. T Consensus 189 ~----------~~~~~~~~~~~~~~siDID~~~D~~~Ae~ 218 (222) T TIGR03584 189 I----------FSPHSIPIILPRHLVQDIDTLEDWERAEL 218 (222) T ss_pred C----------CCCCEEEEEECCCCCCCCCCHHHHHHHHH T ss_conf 0----------28987999989986289999999999999 No 80 >cd04193 UDPGlcNAc_PPase UDPGlcNAc pyrophosphorylase catalayzes the synthesis of UDPGlcNAc. UDP-N-acetylglucosamine (UDPGlcNAc) pyrophosphorylase (UAP) (also named GlcNAc1P uridyltransferase), catalyzes the reversible conversion of UTP and GlcNAc1 to PPi and UDPGlcNAc. UDP-N-acetylglucosamine (UDPGlcNAc), the activated form of GlcNAc, is a key precursor of N- and O-linked glycosylations. It is essential for the synthesis of chitin (a major component of the fungal cell wall) and of the glycosylphosphatidylinositol (GPI) linker which anchors a variety of cell surface proteins to the plasma membrane. In bacteria, UDPGlcNAc represents an essential precursor for both peptidoglycan and lipopolysaccharide biosynthesis. Human UAP has two isoforms, resulting from alternative splicing of a single gene and differing by the presence or absence of 17 amino acids. UDPGlcNAc pyrophosphorylase shares significant sequence and structure conservation with UDPglucose pyrophosphorylase. Probab=97.76 E-value=4e-05 Score=51.44 Aligned_cols=192 Identities=17% Similarity=0.179 Sum_probs=90.4 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEEC---CEEHHHHHHHHHHH----CC------CC-EEEEECCC-CCCCCHHHCCC Q ss_conf 8999568888334324408883335688---36279999999997----89------98-89998135-54441010011 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIV---DRPVIQYVIEEALE----AG------LT-DFVFVTGR-GKGLIKDYFDI 73 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~---~kpiI~~~i~~l~~----~G------i~-~i~iv~~~-~k~~I~~~f~~ 73 (299) -+|+||||.||||. ...||-|+||. ++++++...+.+.. +| +. -++|.++. ..+...+||.. T Consensus 17 avv~laGG~GTrlG---~~~pKg~~~i~~~~~~sl~~l~~e~i~~l~~~~~~~~~~~~~ip~~iMtS~~T~~~T~~~l~~ 93 (323) T cd04193 17 AVLLLAGGQGTRLG---FDGPKGMFPVGLPSKKSLFQLQAERILKLQELAGEASGKKVPIPWYIMTSEATHEETRKFFKE 93 (323) T ss_pred EEEEECCCCCCCCC---CCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHH T ss_conf 99995788745568---999716257268899809999999999999998774388885169997885345899999986 Q ss_pred CH--HHHHHHHHCCCCEEECCC-CC---CCCCCCEEE----ECCCCCCCCCCCCCCCC------CCCC--CCCCCCCCCC Q ss_conf 74--544456541870000113-44---112485799----84333423346654211------0112--2333311127 Q gi|254780365|r 74 QF--ELEQSLRKRNKKAELTLL-AE---SIPSIGNAV----FTWQYERKGLGHAVWCA------RNII--GDNPFALLLP 135 (299) Q Consensus 74 ~~--~~~~~l~~~~~~~~~~~~-~~---~~~~~~~i~----~v~Q~~p~Gtg~Ai~~a------~~~i--~de~flv~~g 135 (299) +. .+.. ..+... +. .+.....+. .-....|-|.|+-.... .++. |-+...+.+. T Consensus 94 ~~~fGl~~--------~~i~~f~Q~~~P~~~~~g~~~l~~~~~~~~~P~GhG~i~~aL~~sG~l~~l~~~Gi~yi~v~~v 165 (323) T cd04193 94 NNYFGLDP--------EQVHFFQQGMLPCVDFDGKILLEEKGKIAMAPNGNGGLYKALQTAGILEDMKKRGIKYIHVYSV 165 (323) T ss_pred CCCCCCCH--------HHEEEEECCCCEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCHHHHHHCCCEEEEEEEC T ss_conf 76468885--------5647871687314868987156899751427999717999999878488999669889999965 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCEE--EEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCC-C---CCC--- Q ss_conf 8644454433221122212234322001--124328544331331001333333530000002577888-6---734--- Q gi|254780365|r 136 DMIMSPLEGENCMANMIKLYEKEGANIL--AVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSS-T---FIS--- 206 (299) Q Consensus 136 D~i~~~~~~~~~l~~li~~~~~~~~~vi--~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~-~---~~S--- 206 (299) ||.+...... .++-++...++.+. .+.+.+++ .+-|++... ++...+.+.-|=|... . ..+ T Consensus 166 DN~L~~~~Dp----~~lG~~~~~~~~~~~kvv~K~~~~--ekvG~l~~~----~gk~~vvEYsel~~~~~~~~~~~g~l~ 235 (323) T cd04193 166 DNILVKVADP----VFIGFCISKGADVGAKVVRKRYPT--EKVGVVVLV----DGKPQVVEYSEISDELAEKRDADGELQ 235 (323) T ss_pred CCCCCCCCCH----HHHHHHHHCCCCEEEEEEECCCCC--CCEEEEEEE----CCEEEEEEECCCCHHHHHHHCCCCCEE T ss_conf 7443232488----999999874996378998888899--831089998----990699984258988984018766165 Q ss_pred -CEEECHHHEECHHHH Q ss_conf -233020200272331 Q gi|254780365|r 207 -NFFINGRYILHPDIF 221 (299) Q Consensus 207 -nl~~~GiYi~~~~i~ 221 (299) +..|++.++|+-+++ T Consensus 236 f~~gNi~~~~fsl~fl 251 (323) T cd04193 236 YNAGNIANHFFSLDFL 251 (323) T ss_pred ECCCCHHHHHEEHHHH T ss_conf 1455677753139999 No 81 >COG1083 NeuA CMP-N-acetylneuraminic acid synthetase [Cell envelope biogenesis, outer membrane] Probab=97.72 E-value=0.0003 Score=46.07 Aligned_cols=205 Identities=17% Similarity=0.209 Sum_probs=106.7 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCC Q ss_conf 58999568888334324408883335688362799999999978998899981355444101001174544456541870 Q gi|254780365|r 8 RKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKK 87 (299) Q Consensus 8 ~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~ 87 (299) .-|+|+|-|-..|.. -|-+-+++|||+|.|+|+.+.++++=+=++++.-+. .|.+. .. T Consensus 4 ~iAiIpAR~gSKgI~------~KNi~~~~gkpLi~~~I~aA~ns~~fd~VviSsDs~-~Il~~-------------A~-- 61 (228) T COG1083 4 NIAIIPARGGSKGIK------NKNIRKFGGKPLIGYTIEAALNSKLFDKVVISSDSE-EILEE-------------AK-- 61 (228) T ss_pred EEEEEECCCCCCCCC------CCCHHHHCCCCHHHHHHHHHHCCCCCCEEEECCCCH-HHHHH-------------HH-- T ss_conf 489976267887687------633577579615899999986588522699847958-99999-------------98-- Q ss_pred EEECCCCCCCCCCCEEEECCCCCCCCCC-----CCCCCCCCCCC-CCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 0001134411248579984333423346-----65421101122-333-3111278644454433221122212234322 Q gi|254780365|r 88 AELTLLAESIPSIGNAVFTWQYERKGLG-----HAVWCARNIIG-DNP-FALLLPDMIMSPLEGENCMANMIKLYEKEGA 160 (299) Q Consensus 88 ~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg-----~Ai~~a~~~i~-de~-flv~~gD~i~~~~~~~~~l~~li~~~~~~~~ 160 (299) ..+....+.+ ++++-.. ++++++.+..+ ++. .+++.+-. +.-....+++.++.+.+.+. T Consensus 62 ----------~ygak~~~~R-p~~LA~D~ast~~~~lh~le~~~~~~~~~~lLq~Ts---PLl~~~~ik~A~e~f~~~~~ 127 (228) T COG1083 62 ----------KYGAKVFLKR-PKELASDRASTIDAALHALESFNIDEDTLILLQPTS---PLLTSLHIKEAFEKFLNNQY 127 (228) T ss_pred ----------HHCCCCCCCC-CHHHCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCC---CCCCHHHHHHHHHHHHCCCC T ss_conf ----------7276110248-745416721678999999987442457068840685---00014579999999965887 Q ss_pred -CEEEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCC----CCCE-EECHHHEECHHHHHHHCCCCCCCCCC Q ss_conf -00112432854433133100133333353000000257788867----3423-30202002723311000122666767 Q gi|254780365|r 161 -NILAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTF----ISNF-FINGRYILHPDIFSILNDWKENEGKG 234 (299) Q Consensus 161 -~vi~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~----~Snl-~~~GiYi~~~~i~~~L~~~~~~~~~g 234 (299) +++++.++.......|- .+. +.+..+-|-|..... |.-+ .|..+|+++.+.|- ++.. T Consensus 128 ~sl~sa~e~e~~p~k~f~---~~~------~~~~~~~~~~~~~~rrQ~Lpk~Y~~NgaiYi~~~~~l~--e~~~------ 190 (228) T COG1083 128 DSLFSAVECEHHPYKAFS---LNN------GEVKPVNEDPDFETRRQDLPKAYRENGAIYINKKDALL--ENDC------ 190 (228) T ss_pred CCEEEEEECCCCHHHHHH---HCC------CCEEECCCCCCCCCCCCCCHHHHHHCCCEEEEHHHHHH--HCCC------ T ss_conf 635887532234688887---507------84410114776554211051656536767975279874--1575------ Q ss_pred CCCHHHHHHHHHHCCCEEEEEEEE-EEEECCCHHHHHHHHHHH Q ss_conf 500378999998319848999813-786439988999999998 Q gi|254780365|r 235 EIQLTDSMRKLSERHDFLAYHFKG-HTYDCGSKKGFVLANIAF 276 (299) Q Consensus 235 E~~ltD~i~~l~~~~~v~a~~~~g-~w~DiG~~~~yl~A~~~~ 276 (299) ... .....|.++. ..+||-+..+|-.|..-+ T Consensus 191 ---------~f~--~~~~~y~m~~~~~~DID~~~Dl~iae~l~ 222 (228) T COG1083 191 ---------FFI--PNTILYEMPEDESIDIDTELDLEIAENLI 222 (228) T ss_pred ---------EEC--CCCEEEECCCCCCCCCCCHHHHHHHHHHH T ss_conf ---------415--89668981712000566087699999986 No 82 >COG4284 UDP-glucose pyrophosphorylase [Carbohydrate transport and metabolism] Probab=97.60 E-value=0.00032 Score=45.89 Aligned_cols=192 Identities=13% Similarity=0.142 Sum_probs=92.5 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEEC-CEEHHHHHHHHH----HHCCCC-EEEEECCCCCCCCHHHCCCCHHHHHHHH Q ss_conf 8999568888334324408883335688-362799999999----978998-8999813554441010011745444565 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIV-DRPVIQYVIEEA----LEAGLT-DFVFVTGRGKGLIKDYFDIQFELEQSLR 82 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~-~kpiI~~~i~~l----~~~Gi~-~i~iv~~~~k~~I~~~f~~~~~~~~~l~ 82 (299) -+|.||||.||||. ..-||.+++|. |+++++..++.. .+++++ ..+|.++...++-..||.... +. T Consensus 107 Avl~LaGGqGtrlG---~~gPKgl~~V~~gks~~dl~~~qIk~ln~~~~~~vP~~iMtS~nt~~t~s~f~~~~-----Y~ 178 (472) T COG4284 107 AVLKLAGGQGTRLG---CDGPKGLFEVKDGKSLFDLQAEQIKYLNRQYNVDVPLYIMTSLNTEETDSYFKSND-----YF 178 (472) T ss_pred EEEEECCCCCCCCC---CCCCCEEEEECCCCCHHHHHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHH-----HC T ss_conf 89994488665335---67875268945898499999999999999738997779882678277777776622-----03 Q ss_pred HCCCCEEECCCCCCCCCCCEEE---EC------CCCCCCCCCCCCCCCCC-------CC--CCCCCCCCCCCCCCCCCCC Q ss_conf 4187000011344112485799---84------33342334665421101-------12--2333311127864445443 Q gi|254780365|r 83 KRNKKAELTLLAESIPSIGNAV---FT------WQYERKGLGHAVWCARN-------II--GDNPFALLLPDMIMSPLEG 144 (299) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~i~---~v------~Q~~p~Gtg~Ai~~a~~-------~i--~de~flv~~gD~i~~~~~~ 144 (299) ...|.+.....+...+.....+ +. ...-|-|.|+ +..|.. ++ |-|...|.+-||+-... . T Consensus 179 ~~~k~~I~fF~Q~~~P~~~~~sg~~~~~~~~~~~~~~P~GnG~-lf~aL~~SG~le~l~~~G~e~lfV~nIDNL~~~v-D 256 (472) T COG4284 179 GLDKEDIFFFVQSLFPRLLSDSGLPFLESDDSNLAWYPPGNGD-LFKALKSSGILEKLIAQGIEYLFVSNIDNLGATV-D 256 (472) T ss_pred CCCHHHEEEEECCCCCEEECCCCCCCCCCCCCCCCCCCCCCCC-HHHHHHHCCHHHHHHHCCCEEEEEECCCCCCCCC-C T ss_conf 7777783777247744356566852214589631017899860-8999875136999985683699995143123445-8 Q ss_pred CCCCCCCCCCCCCCCCCEEEEEECCHHH-CCCCCCEE-CCCCCCCCCCEEEEECCCCCC--CCCCC------CEEE-CHH Q ss_conf 3221122212234322001124328544-33133100-133333353000000257788--86734------2330-202 Q gi|254780365|r 145 ENCMANMIKLYEKEGANILAVSECDPQL-SCKYGMVQ-VGKAIDHQVFHISDMIEKPDS--STFIS------NFFI-NGR 213 (299) Q Consensus 145 ~~~l~~li~~~~~~~~~vi~~~~v~~~~-~~~yGvi~-~~~~~~~~~~~I~~~vEKP~~--~~~~S------nl~~-~Gi 213 (299) .. ++.++..++..+ +...+++.. ...=|++. .++. .++.+.-|=|.. +...| ..+| .++ T Consensus 257 ~~----~lg~~~~~~~e~-~~e~t~Kt~a~ekvG~Lv~~~g~-----~rllEysev~~~~~~~~~s~~~~~~~n~Nni~l 326 (472) T COG4284 257 LK----FLGFMAETNYEY-LMETTDKTKADEKVGILVTYDGK-----LRLLEYSEVPNEHREEFTSDGKLKYFNTNNIWL 326 (472) T ss_pred HH----HHHHHHHCCCCE-EEEEEECCCCCCCCEEEEEECCC-----EEEEEEECCCHHHHHHHCCCCCEEEECCCCCEE T ss_conf 99----999998627532-57875224566641179986795-----678998407846743204622123542642220 Q ss_pred HEECHHH Q ss_conf 0027233 Q gi|254780365|r 214 YILHPDI 220 (299) Q Consensus 214 Yi~~~~i 220 (299) |++.-+. T Consensus 327 ~~~~~~~ 333 (472) T COG4284 327 HLFSVKF 333 (472) T ss_pred EHHHHHH T ss_conf 2667788 No 83 >cd00761 Glyco_tranf_GTA_type Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold. Glycosyltransferases (GTs) are enzymes that synthesize oligosaccharides, polysaccharides, and glycoconjugates by transferring the sugar moiety from an activated nucleotide-sugar donor to an acceptor molecule, which may be a growing oligosaccharide, a lipid, or a protein. Based on the stereochemistry of the donor and acceptor molecules, GTs are classified as either retaining or inverting enzymes. To date, all GT structures adopt one of two possible folds, termed GT-A fold and GT-B fold. This hierarchy includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. The majority of the proteins in this superfamily are Glycosyltransferase family 2 (GT-2) proteins. But it als Probab=97.52 E-value=0.00041 Score=45.22 Aligned_cols=83 Identities=13% Similarity=0.102 Sum_probs=59.4 Q ss_pred EEHHHHHHHHHHHCC--CCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEECCCCCCCCCC Q ss_conf 627999999999789--988999813554441010011745444565418700001134411248579984333423346 Q gi|254780365|r 38 RPVIQYVIEEALEAG--LTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFTWQYERKGLG 115 (299) Q Consensus 38 kpiI~~~i~~l~~~G--i~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg 115 (299) .+.|.+.++.+.+++ ..+++|+.+...+...+.+... ......+.+..++...|++ T Consensus 9 ~~~l~~~l~si~~~~~~~~eiiiid~~s~~~~~~~~~~~----------------------~~~~~~~~~~~~~~~~g~~ 66 (156) T cd00761 9 EPYLERCLESLLAQTYPNFEVIVVDDGSTDGTLEILEEY----------------------AKKDPRVIRVINEENQGLA 66 (156) T ss_pred HHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHH----------------------CCCCCEEEEECCCCCCCHH T ss_conf 899999999998289989799999899980022100000----------------------1456638995056787846 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 654211011223333111278644454 Q gi|254780365|r 116 HAVWCARNIIGDNPFALLLPDMIMSPL 142 (299) Q Consensus 116 ~Ai~~a~~~i~de~flv~~gD~i~~~~ 142 (299) +|+..+.+..+.+.++++.+|+++... T Consensus 67 ~~~n~~~~~~~~~~v~~~d~D~~~~~~ 93 (156) T cd00761 67 AARNAGLKAARGEYILFLDADDLLLPD 93 (156) T ss_pred HHHHHHHHHCCCCEEEEECCCCCCCHH T ss_conf 887788874899999999999857874 No 84 >cd04180 UGPase_euk_like Eukaryotic UGPase-like includes UDPase and UDPGlcNAc pyrophosphorylase enzymes. This family includes UDP-Glucose Pyrophosphorylase (UDPase) and UDPGlcNAc pyrophosphorylase enzymes. The two enzymes share significant sequence and structure similarity. UDP-Glucose Pyrophosphorylase catalyzes a reversible production of UDP-Glucose and pyrophosphate (PPi) from Glucose-1-phosphate and UTP. UDP-glucose plays pivotal roles in galactose utilization, in glycogen synthesis, and in the synthesis of the carbohydrate moieties of glycolipids , glycoproteins , and proteoglycans . UDP-N-acetylglucosamine (UDPGlcNAc) pyrophosphorylase (UAP) (also named GlcNAc1P uridyltransferase), catalyzes the reversible conversion of UTP and GlcNAc1P from PPi and UDPGlcNAc, which is a key precursor of N- and O-linked glycosylations and is essential for the synthesis of chitin (a major component of the fungal cell wall) and of the glycosylphosphatidylinositol (GPI) linker anchoring a variety o Probab=97.43 E-value=0.00018 Score=47.42 Aligned_cols=60 Identities=22% Similarity=0.251 Sum_probs=42.0 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEEC---CEEHHHHHHHHHHH------CCCC-EEEEECCC-CCCCCHHHCC Q ss_conf 999568888334324408883335688---36279999999997------8998-89998135-5444101001 Q gi|254780365|r 10 AVFPIAGLGMRFFPISKVIPKEMLAIV---DRPVIQYVIEEALE------AGLT-DFVFVTGR-GKGLIKDYFD 72 (299) Q Consensus 10 avIlAaG~GtRl~P~T~~~pKpllpi~---~kpiI~~~i~~l~~------~Gi~-~i~iv~~~-~k~~I~~~f~ 72 (299) +|+||||.||||. ..-||-|+||+ +++++++.++++.. .|.. -++|.++. ..+...+||. T Consensus 3 vllLaGGlGTrLG---~~~pKg~~~v~~~~~~t~~~l~~~~i~~l~~~~~~~~~iPl~iMtS~~T~~~T~~~~~ 73 (266) T cd04180 3 VVLLAGGLGTRLG---KDGPKSSTDVGLPSGQCFLQLIGEKILTLQEIDLYSCKIPEQLMNSKYTHEKTQCYFE 73 (266) T ss_pred EEEECCCCCCCCC---CCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEECCHHHHHHHHHHHH T ss_conf 9996687623469---9997370672589998599999999999999861799831999668004479999999 No 85 >cd06424 UGGPase UGGPase catalyzes the synthesis of UDP-Glucose/UDP-Galactose. UGGPase: UDP-Galactose/Glucose Pyrophosphorylase catalyzes the reversible production of UDP-Glucose/UDP-Galactose and pyrophosphate (PPi) from Glucose-1-phosphate/Galactose-1-phosphate and UTP. Its dual substrate specificity distinguishes it from the single substrate enzyme UDP-glucose pyrophosphorylase. It may play a key role in the galactose metabolism in raffinose oligosaccharide (RFO) metabolizing plants. RFO raffinose is a major photoassimilate and is a galactosylderivative of sucrose (Suc) containing a galactose (Gal) moiety. Upon arriving at the sink tissue, the Gal moieties of the RFOs are initially removed by alpha-galactosidase and then are phosphorylated to Gal-1-P. Gal-1-P is converted to UDP-Gal. The UDP-Gal is further metabolized to UDP-Glc via an epimerase reaction. The UDP-Glc can be directly utilized in cell wall metabolism or in Suc synthesis. However, for the Suc synthesis UDP-Glc must be f Probab=97.37 E-value=0.00027 Score=46.29 Aligned_cols=61 Identities=20% Similarity=0.198 Sum_probs=43.2 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEEC---CEEHHHHHHHHHHH----CC----C-CEEEEECCC-CCCCCHHHCCC Q ss_conf 999568888334324408883335688---36279999999997----89----9-889998135-54441010011 Q gi|254780365|r 10 AVFPIAGLGMRFFPISKVIPKEMLAIV---DRPVIQYVIEEALE----AG----L-TDFVFVTGR-GKGLIKDYFDI 73 (299) Q Consensus 10 avIlAaG~GtRl~P~T~~~pKpllpi~---~kpiI~~~i~~l~~----~G----i-~~i~iv~~~-~k~~I~~~f~~ 73 (299) .|+||||.||||. ..-||-|+||. +++++++.++.+.. ++ . --++|.|+. ..+...+||.. T Consensus 3 vvllaGG~GTRLG---~~~pK~~l~v~~~~~~tl~~l~~~~i~~~~~~~~~~~~~~iP~~iMtS~~t~~~T~~~~~~ 76 (315) T cd06424 3 FVLVAGGLGERLG---YSGIKIGLPVELTTNTTYLQYYLNYIRAFQEASKKGEKMEIPFVIMTSDDTHSKTLKLLEE 76 (315) T ss_pred EEEECCCCCCCCC---CCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCHHHHHHHHHH T ss_conf 9997787843248---9988556012788998099999999999999860578988657997898736999999986 No 86 >cd00897 UGPase_euk Eukaryotic UGPase catalyses the synthesis of UDP-Glucose. UGPase (UDP-Glucose Pyrophosphorylase) catalyzes the reversible production of UDP-Glucose and pyrophosphate (PPi) from Glucose-1-phosphate and UTP. UDP-glucose plays pivotal roles in galactose utilization, in glycogen synthesis, and in the synthesis of the carbohydrate moieties of glycolipids, glycoproteins, and proteoglycans. UGPase is found in both prokaryotes and eukaryotes. Interestingly, while the prokaryotic and eukaryotic forms of UGPase catalyze the same reaction, they share low sequence similarity. This family consists of mainly eukaryotic UTP-glucose-1-phosphate uridylyltransferases. Probab=96.62 E-value=0.003 Score=39.93 Aligned_cols=60 Identities=13% Similarity=0.203 Sum_probs=41.8 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEEC-CEEHHHHHHHHH----HHCCCCE-EEEECCCC-CCCCHHHC Q ss_conf 8999568888334324408883335688-362799999999----9789988-99981355-44410100 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIV-DRPVIQYVIEEA----LEAGLTD-FVFVTGRG-KGLIKDYF 71 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~-~kpiI~~~i~~l----~~~Gi~~-i~iv~~~~-k~~I~~~f 71 (299) -+|.||||.||||. ..-||.|+||. +++++|.+++++ ..+|..= .+|-++.. .+...+|| T Consensus 5 avlkLnGGlGTrlG---~~~pK~~i~V~~~~tfldl~~~qi~~l~~~yg~~vPl~iMtS~~T~~~T~~~l 71 (300) T cd00897 5 VVLKLNGGLGTSMG---CTGPKSLIEVRDGKTFLDLTVQQIEHLNKTYGVDVPLVLMNSFNTDEDTKKIL 71 (300) T ss_pred EEEEECCCCCCCCC---CCCCCEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCEEEECCCCHHHHHHHHH T ss_conf 99992698745568---89981778828998099999999999999719983189978864778999999 No 87 >cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase. Shigella flexneri RfbF protein is a putative dTDP-rhamnosyl transferase. dTDP rhamnosyl transferases of Shigella flexneri add rhamnose sugars to N-acetyl-glucosamine in the O-antigen tetrasaccharide repeat. Lipopolysaccharide O antigens are important virulence determinants for many bacteria. The variations of sugar composition, the sequence of the sugars and the linkages in the O antigen provide structural diversity of the O antigen. Probab=96.10 E-value=0.063 Score=31.78 Aligned_cols=190 Identities=12% Similarity=0.071 Sum_probs=83.2 Q ss_pred EHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEECCCCCCCCCCCCC Q ss_conf 27999999999789988999813554441010011745444565418700001134411248579984333423346654 Q gi|254780365|r 39 PVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFTWQYERKGLGHAV 118 (299) Q Consensus 39 piI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai 118 (299) ..+.-.|+.+.. .+.+|+||=|...+...+ .++. ...++.++..++.+|.|.|. T Consensus 11 ~~l~~~L~sl~~-q~~eIiVVDN~S~d~~~~----------~~~~---------------~~~~v~~i~~~~N~G~a~g~ 64 (237) T cd02526 11 SKLKELLAALAE-QVDKVVVVDNSSGNDIEL----------RLRL---------------NSEKIELIHLGENLGIAKAL 64 (237) T ss_pred HHHHHHHHHHHC-CCCEEEEEECCCCHHHHH----------HHHH---------------CCCCEEEEECCCCCCHHHHC T ss_conf 999999997663-799899996988902899----------9986---------------19986999899977858872 Q ss_pred CCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCCCCC---CC-CCCEEEEEECCHHHCCCCCCEECCCCCCCCCCE Q ss_conf 211011223---33311127864445443322112221223---43-220011243285443313310013333335300 Q gi|254780365|r 119 WCARNIIGD---NPFALLLPDMIMSPLEGENCMANMIKLYE---KE-GANILAVSECDPQLSCKYGMVQVGKAIDHQVFH 191 (299) Q Consensus 119 ~~a~~~i~d---e~flv~~gD~i~~~~~~~~~l~~li~~~~---~~-~~~vi~~~~v~~~~~~~yGvi~~~~~~~~~~~~ 191 (299) -.+...... |-++++.+|.++.+. .++.|++... +. ...+++..-.+......+........ . T Consensus 65 N~Gi~~a~~~~~d~i~~ln~D~~~~~~----~l~~l~~~~~~~~~~~~vg~~~p~~~d~~~~~~~~~~~~~~~------~ 134 (237) T cd02526 65 NIGIKAALENGADYVLLFDQDSVPPPD----MVEKLLAYKILSDKNSNIGAVGPRIIDRRTGENSPGVRKSGY------K 134 (237) T ss_pred CCCCHHHHCCCCCEEEEECCCCCCCHH----HHHHHHHHHHHCCCCCCEEEEEEEEECCCCCCCCCCCCCCCC------C T ss_conf 743033411797789993575655905----999999999852227988999717986899800454211253------3 Q ss_pred EEEECCCCCCCCCCCCE-EECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCC-CEEEEEEEEEEEECCCHH Q ss_conf 00002577888673423-30202002723311000122666767500378999998319-848999813786439988 Q gi|254780365|r 192 ISDMIEKPDSSTFISNF-FINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLSERH-DFLAYHFKGHTYDCGSKK 267 (299) Q Consensus 192 I~~~vEKP~~~~~~Snl-~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~~~~-~v~a~~~~g~w~DiG~~~ 267 (299) ...-...+.... +... ...| .+++.++|+.+.-..+...- .++=+|.--.+.+.| +++.++-.-.|+..|+.. T Consensus 135 ~~~~~~~~~~~~-~~~~~~~sg-~lir~~~~~~vG~fde~fF~-y~ED~Dl~~R~~~~G~~i~~~p~a~v~H~~G~~~ 209 (237) T cd02526 135 LRIQKEGEEGLK-EVDFLITSG-SLISLEALEKVGGFDEDLFI-DYVDTEWCLRARSKGYKIYVVPDAVLKHELGDKR 209 (237) T ss_pred CCCCCCCCCCCE-ECCEEEECC-EEEEHHHHHHHCCCCHHHCC-CCCHHHHHHHHHHCCCCEEEECCEEEEECCCCCC T ss_conf 223544247844-604303014-37789999985799689779-3719999999998599799989889997899845 No 88 >TIGR00466 kdsB 3-deoxy-D-manno-octulosonate cytidylyltransferase; InterPro: IPR004528 3-Deoxy-D-manno-octulosonate cytidylyltransferase (2.7.7.38 from EC) activates KDO, a required 8-carbon sugar, for incorporation into bacterial lipopolysaccharide in Gram negative bacteria. It acts as a homodimer and catalyses the conversion of CTP and 3-deoxy-D-manno-octulosonate into CMP-3-deoxy-D-manno-octulosonate and pyrophosphate.; GO: 0008690 3-deoxy-manno-octulosonate cytidylyltransferase activity, 0009103 lipopolysaccharide biosynthetic process, 0005737 cytoplasm. Probab=95.56 E-value=0.02 Score=34.89 Aligned_cols=48 Identities=25% Similarity=0.323 Sum_probs=42.3 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 899956888833432440888333568836279999999997899889998135 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGR 62 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~ 62 (299) -.||||==..|||- -|||.-|.|+|||.|.++++.++|++++++-+.. T Consensus 4 ~vIIPAR~~SsRl~------~K~L~DI~G~PMi~~v~~~A~~s~~~~~ivA~D~ 51 (246) T TIGR00466 4 VVIIPARLASSRLP------GKPLEDIKGKPMIVHVLEKANESGADRVIVATDD 51 (246) T ss_pred EEEECCCCCCCCCC------CCCCCCCCCCCHHHHHHHHHHHCCCCEEEEECCC T ss_conf 89853653113589------8720355789368899999752363568997487 No 89 >COG1861 SpsF Spore coat polysaccharide biosynthesis protein F, CMP-KDO synthetase homolog [Cell envelope biogenesis, outer membrane] Probab=94.96 E-value=0.041 Score=32.92 Aligned_cols=49 Identities=29% Similarity=0.447 Sum_probs=37.9 Q ss_pred EEEEECCCCC-CCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEECCCC Q ss_conf 8999568888-3343244088833356883627999999999789-98899981355 Q gi|254780365|r 9 KAVFPIAGLG-MRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAG-LTDFVFVTGRG 63 (299) Q Consensus 9 KavIlAaG~G-tRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~G-i~~i~iv~~~~ 63 (299) -++|+.|-.| ||| | -|.|+|+++.|+|++.++.+.++- ++++++.|+-. T Consensus 4 I~~IiQARmgStRL-p-----gKvLlpL~~~pmI~~~lervrks~~~d~ivvATS~~ 54 (241) T COG1861 4 ILVIIQARMGSTRL-P-----GKVLLPLGGEPMIEYQLERVRKSKDLDKIVVATSDK 54 (241) T ss_pred EEEEEEECCCCCCC-C-----CCHHHHCCCCCHHHHHHHHHHCCCCCCCEEEEECCC T ss_conf 79885300267657-8-----613333078741999999986043324358973477 No 90 >pfam01704 UDPGP UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate uridylyltransferase catalyses the interconversion of MgUTP + glucose-1-phosphate and UDP-glucose + MgPPi. UDP-glucose is an important intermediate in mammalian carbohydrate interconversion involved in various metabolic roles depending on tissue type. In Dictyostelium (slime mold) mutants in this enzyme abort the development cycle. Also within the family is UDP-N-acetylglucosamine or AGX1 and two hypothetical proteins from Borrelia burgdorferi the lyme disease spirochaete. Probab=93.61 E-value=0.12 Score=29.95 Aligned_cols=51 Identities=12% Similarity=0.178 Sum_probs=36.4 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEEC-CEEHHHHHHHHH----HHCCCCE-EEEECCC Q ss_conf 8999568888334324408883335688-362799999999----9789988-9998135 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIV-DRPVIQYVIEEA----LEAGLTD-FVFVTGR 62 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~-~kpiI~~~i~~l----~~~Gi~~-i~iv~~~ 62 (299) -.|-|+||.||+|. ..-||.|++|. |++.+|.++.++ ...|+.= .++..+. T Consensus 55 aVlKLNGGLGTsMG---~~~pKSli~Vr~g~TFLDl~v~Qi~~ln~~y~~~vPl~lMnSf 111 (416) T pfam01704 55 AVLKLNGGLGTSMG---CKGPKSMIEVRNGNTFLDLIVQQIEQLNKRYNCDVPLLLMNSF 111 (416) T ss_pred EEEECCCCCCCCCC---CCCCCEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCEEEECCC T ss_conf 99982698777668---8888557797899818999999999998754998666996686 No 91 >pfam00535 Glycos_transf_2 Glycosyl transferase family 2. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids. Probab=91.94 E-value=0.26 Score=28.02 Aligned_cols=105 Identities=15% Similarity=0.172 Sum_probs=62.7 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC--CEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEECCC Q ss_conf 3568836-279999999997899--8899981355444101001174544456541870000113441124857998433 Q gi|254780365|r 32 MLAIVDR-PVIQYVIEEALEAGL--TDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFTWQ 108 (299) Q Consensus 32 llpi~~k-piI~~~i~~l~~~Gi--~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v~Q 108 (299) .+|..|. ..|...++.+.++.. -||+||-+...+.-.+ .+.+.......+.++.. T Consensus 3 iip~yN~~~~l~~~l~sl~~q~~~~~eiiivDd~S~d~t~~----------------------~~~~~~~~~~~v~~~~~ 60 (168) T pfam00535 3 IIPTYNEEKYLEECLESLLNQTYPNFEIIVVDDGSTDGTVE----------------------IAEEYAKKDPRIRVIRL 60 (168) T ss_pred EEEECCCHHHHHHHHHHHHCCCCCCEEEEEEECCCCCHHHC----------------------EEEEECCCCCCCEEEEE T ss_conf 99961888999999999973879985999998999810100----------------------12100134676002442 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 342334665421101122333311127864445443322112221223432200 Q gi|254780365|r 109 YERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANI 162 (299) Q Consensus 109 ~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~v 162 (299) +++.|.|+|+..+......+-++++-+|+++... .++.|++..++.+..+ T Consensus 61 ~~~~g~~~a~n~g~~~a~~~~v~~lD~D~~~~~~----~l~~~~~~~~~~~~~~ 110 (168) T pfam00535 61 EENLGKAAARNAGLKLATGDYILFLDADDEVAPD----WLEKLVELLEKNGADI 110 (168) T ss_pred ECCCCCHHHHHHHHHHCCCCEEEEECCCCCCCHH----HHHHHHHHHHCCCCEE T ss_conf 0245746888899996799859998589986876----9999999998399769 No 92 >KOG2388 consensus Probab=91.01 E-value=0.29 Score=27.72 Aligned_cols=36 Identities=25% Similarity=0.236 Sum_probs=28.2 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECC---EEHHHHHHHH Q ss_conf 89995688883343244088833356883---6279999999 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKVIPKEMLAIVD---RPVIQYVIEE 47 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~~pKpllpi~~---kpiI~~~i~~ 47 (299) -++++|||.||||. ...||-+.|++- +.+.++..+. T Consensus 99 a~~llaGgqgtRLg---~~~pkg~~~~G~~~~~slf~~qae~ 137 (477) T KOG2388 99 AVVLLAGGQGTRLG---SSGPKGCYPIGLPSGKSLFQIQAER 137 (477) T ss_pred EEEEECCCCEEEEC---CCCCCCEEECCCCCCCCHHHHHHHH T ss_conf 38995167403113---5787533664775654065534999 No 93 >cd04188 DPG_synthase DPG_synthase is involved in protein N-linked glycosylation. UDP-glucose:dolichyl-phosphate glucosyltransferase (DPG_synthase) is a transmembrane-bound enzyme of the endoplasmic reticulum involved in protein N-linked glycosylation. This enzyme catalyzes the transfer of glucose from UDP-glucose to dolichyl phosphate. Probab=87.91 E-value=2.1 Score=22.35 Aligned_cols=145 Identities=8% Similarity=-0.047 Sum_probs=65.3 Q ss_pred EEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE-EEEEECCHHH-CCCCCCE Q ss_conf 7998433342334665421101122333311127864445443322112221223432200-1124328544-3313310 Q gi|254780365|r 102 NAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANI-LAVSECDPQL-SCKYGMV 179 (299) Q Consensus 102 ~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~v-i~~~~v~~~~-~~~yGvi 179 (299) .+..+.+++..|.|+|+..+......+.++++-+|.-+... .+.+|++.-...+..+ ++........ ....... T Consensus 58 ~~~vi~~~~n~G~g~A~~~G~~~a~~d~i~~~DaD~~~~~~----~i~~l~~~~~~~~~d~V~GsR~~~~~~~~~~~~~~ 133 (211) T cd04188 58 LIRVLTLPKNRGKGGAVRAGMLAARGDYILFADADLATPFE----ELEKLEEALKTSGYDIAIGSRAHLASAAVVKRSWL 133 (211) T ss_pred EEEEEECCCCCCCHHHHHHHHHHHHCCEEEEECCCCCCHHH----HHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCHH T ss_conf 59999658888804899999996004855677689740788----99999999984898199997556799863356388 Q ss_pred E-CCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHH-CCCCCCCCCCCCCHHHHHHHHHHCC-CEEEEEE Q ss_conf 0-13333335300000025778886734233020200272331100-0122666767500378999998319-8489998 Q gi|254780365|r 180 Q-VGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSIL-NDWKENEGKGEIQLTDSMRKLSERH-DFLAYHF 256 (299) Q Consensus 180 ~-~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L-~~~~~~~~~gE~~ltD~i~~l~~~~-~v~a~~~ 256 (299) . +...... .+.++.=+....+ ..+|+-+|+.++++.+ ..... .+. +++ ++.+-.+.+.| ++.-+++ T Consensus 134 r~~~s~~~~---~~~~~l~~~~i~D-----~~~gfr~~~~~~l~~i~~~~~~-~~f-~~~-~El~~~~~~~g~ki~EvPi 202 (211) T cd04188 134 RNLLGRGFN---FLVRLLLGLGIKD-----TQCGFKLFTRDAARRLFPRLHL-ERW-AFD-VELLVLARRLGYPIEEVPV 202 (211) T ss_pred HHHHHHHHH---HHHHHHHCCCCCC-----CCCCEEEEEHHHHHHHHHHCCC-CCC-CCH-HHHHHHHHHCCCCEEEEEE T ss_conf 999999999---9999983898788-----8867465779999998755646-885-015-9999999986991999900 Q ss_pred EEEEEEC Q ss_conf 1378643 Q gi|254780365|r 257 KGHTYDC 263 (299) Q Consensus 257 ~g~w~Di 263 (299) .|.|- T Consensus 203 --~~~~r 207 (211) T cd04188 203 --RWVEI 207 (211) T ss_pred --EEEEC T ss_conf --89989 No 94 >PRK00576 molybdopterin-guanine dinucleotide biosynthesis protein A; Provisional Probab=87.72 E-value=1 Score=24.25 Aligned_cols=35 Identities=11% Similarity=0.155 Sum_probs=28.5 Q ss_pred CCCCEEEECC--EEHHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 8833356883--62799999999978998899981355 Q gi|254780365|r 28 IPKEMLAIVD--RPVIQYVIEEALEAGLTDFVFVTGRG 63 (299) Q Consensus 28 ~pKpllpi~~--kpiI~~~i~~l~~~Gi~~i~iv~~~~ 63 (299) ..|.++++.| +|+|+|+++.+... +++++|+++.. T Consensus 3 ~DKAll~~~G~~~tLlerv~~~l~~~-~~~V~vv~~~~ 39 (178) T PRK00576 3 RDKATLPLPGGTTTLVEHVVGIVGQR-CAPVFVMAAPG 39 (178) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHH-CCEEEEECCCC T ss_conf 88774276799825999999999865-89899988998 No 95 >pfam07959 Fucokinase L-fucokinase. In the salvage pathway of GDP-L-fucose, free cytosolic fucose is phosphorylated by L-fucokinase to form L-fucose-L-phosphate, which is then further converted to GDP-L-fucose in the reaction catalysed by GDP-L-fucose pyrophosphorylase. Probab=86.54 E-value=2.4 Score=22.01 Aligned_cols=137 Identities=17% Similarity=0.189 Sum_probs=65.2 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCHHHCCCCCCEECCCCCCCC----CCEEEEECCCCCCCCCCC Q ss_conf 1112786444544332211222122343220011243285443313310013333335----300000025778886734 Q gi|254780365|r 131 ALLLPDMIMSPLEGENCMANMIKLYEKEGANILAVSECDPQLSCKYGMVQVGKAIDHQ----VFHISDMIEKPDSSTFIS 206 (299) Q Consensus 131 lv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi~~~~v~~~~~~~yGvi~~~~~~~~~----~~~I~~~vEKP~~~~~~S 206 (299) +|..+|+++..... ..++ +.+.+..++++ +.+.+....-||...+...... ...+.+|.-||+.++-.. T Consensus 57 ~V~s~D~ll~~~~~-----~~~~-~~~~g~~~la~-p~~~~~a~~HGVfv~d~~~~~~~~~~~~~~~~~LqKPS~eem~~ 129 (414) T pfam07959 57 LVTSGDELLSVGDA-----PGIS-FDEPGATALAH-PSSLAIATNHGVFVTDSQGSLAHDLTYRLVDDFLQKPTIEELVQ 129 (414) T ss_pred EEEECCEEEECCCC-----CCCC-CCCCCEEEEEE-CCCHHHHCCCEEEEECCCCCCCCCCHHHHHHHHHCCCCHHHHHH T ss_conf 99856755622764-----4566-57898289970-47866735653899688887443303556887632898999964 Q ss_pred ---------CEEECHHHEECHHHHHHHCCC------------CC-CCCCCCCCH-HHHHHHHH----------------- Q ss_conf ---------233020200272331100012------------26-667675003-78999998----------------- Q gi|254780365|r 207 ---------NFFINGRYILHPDIFSILNDW------------KE-NEGKGEIQL-TDSMRKLS----------------- 246 (299) Q Consensus 207 ---------nl~~~GiYi~~~~i~~~L~~~------------~~-~~~~gE~~l-tD~i~~l~----------------- 246 (299) -+..+|++.|+.+..+.|-.. .. +..+-|+.+ .|.+..|. T Consensus 130 ~~av~~~g~~~ldtG~~~~s~~a~~~L~~~~~~~~~~~~~~~~~~~~~~~eidlY~Dfl~aLg~~at~e~~~~~~~~~~~ 209 (414) T pfam07959 130 FNAVGRDGLFLLDTGILSLSGEAVESLFAMDHSSAKMLLTYYELIGALKCEISLYGDFLLALGPGATREHPRLTSRVMKE 209 (414) T ss_pred CCCCCCCCCEEEEEEEEEECHHHHHHHHHHCCCCHHHHHHHHCCCCCCCEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCH T ss_conf 46625788376541028853899999997655777888765313788642675488999984788840033344665540 Q ss_pred ---------------HCCCEEEEEE-EEEEEECCCHHHHHHHHH Q ss_conf ---------------3198489998-137864399889999999 Q gi|254780365|r 247 ---------------ERHDFLAYHF-KGHTYDCGSKKGFVLANI 274 (299) Q Consensus 247 ---------------~~~~v~a~~~-~g~w~DiG~~~~yl~A~~ 274 (299) +.-++..+.+ +|..+.+||-..|+.... T Consensus 210 ~~~l~~~R~~i~~~L~~~~l~v~~l~~~~F~H~GTs~E~L~~~t 253 (414) T pfam07959 210 ESYLKLMRQKIFHLLKGTSLNVVVLPDGGFYHFGTSAEYLDHLT 253 (414) T ss_pred HHHHHHHHHHHHHHHCCCCEEEEEECCCEEEECCCCHHHHHHHC T ss_conf 14578899999998608833799908977877058899998633 No 96 >cd04184 GT2_RfbC_Mx_like Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis. The rfbC gene encodes a predicted protein of 1,276 amino acids, which is required for O-antigen biosynthesis in Myxococcus xanthus. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Probab=85.72 E-value=2.3 Score=22.16 Aligned_cols=101 Identities=15% Similarity=0.205 Sum_probs=55.7 Q ss_pred EEEECCE--EHHHHHHHHHHHCCCC--EEEEECCCCCC-CCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEEC Q ss_conf 3568836--2799999999978998--89998135544-41010011745444565418700001134411248579984 Q gi|254780365|r 32 MLAIVDR--PVIQYVIEEALEAGLT--DFVFVTGRGKG-LIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFT 106 (299) Q Consensus 32 llpi~~k--piI~~~i~~l~~~Gi~--~i~iv~~~~k~-~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v 106 (299) .+|..|. ..|...|+.+.++-.. |++||-+-..+ ... ..++... .....+.++ T Consensus 6 iip~yN~~~~~l~~~l~Si~~Qt~~~~EiIvvDd~StD~~~~----------~~~~~~~------------~~~~~i~~~ 63 (202) T cd04184 6 VMPVYNTPEKYLREAIESVRAQTYPNWELCIADDASTDPEVK----------RVLKKYA------------AQDPRIKVV 63 (202) T ss_pred EEECCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHH----------HHHHHHH------------HHCCCCCCC T ss_conf 997488966999999999983789987999998998947899----------9999988------------518852001 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 3334233466542110112233331112786444544332211222122343 Q gi|254780365|r 107 WQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKE 158 (299) Q Consensus 107 ~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~ 158 (299) .+++..|.+.|.-.+.....++-++.+-+|+++.+. .++.+++...+. T Consensus 64 ~~~~n~G~~~a~N~gi~~a~geyi~flD~DD~~~p~----~l~~~~~~~~~~ 111 (202) T cd04184 64 FREENGGISAATNSALELATGEFVALLDHDDELAPH----ALYEVVKALNEH 111 (202) T ss_pred CCCCCCCHHHHHHHHHHCCCCCEEEECCCCCEECHH----HHHHHHHHHHHC T ss_conf 267887899998864513687667732778552843----999999999868 No 97 >cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=82.38 E-value=1.7 Score=23.02 Aligned_cols=96 Identities=14% Similarity=0.169 Sum_probs=52.8 Q ss_pred EEEECCEE-HHHHHHHHHHHCCCC--EEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEECCC Q ss_conf 35688362-799999999978998--899981355444101001174544456541870000113441124857998433 Q gi|254780365|r 32 MLAIVDRP-VIQYVIEEALEAGLT--DFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFTWQ 108 (299) Q Consensus 32 llpi~~kp-iI~~~i~~l~~~Gi~--~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v~Q 108 (299) .+|..|++ .|...|+.+.++-.. ||+||-+-..+.-. +.++.. ...+.++.. T Consensus 2 IIptyN~~~~l~~~l~Sl~~q~~~~~eiivVD~~S~d~t~----------~~~~~~---------------~~~i~~i~~ 56 (166) T cd04186 2 IIVNYNSLEYLKACLDSLLAQTYPDFEVIVVDNASTDGSV----------ELLREL---------------FPEVRLIRN 56 (166) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHH----------HHHHHC---------------CCCEEEEEC T ss_conf 9998898899999999998377998099999689983156----------777742---------------898899989 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 342334665421101122333311127864445443322112221223 Q gi|254780365|r 109 YERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYE 156 (299) Q Consensus 109 ~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~ 156 (299) +++.|.+.|.-.+......+-++.+..|+++... .++.+++.++ T Consensus 57 ~~n~G~~~a~N~g~~~a~g~~i~~lD~D~~~~~~----~l~~~~~~~~ 100 (166) T cd04186 57 GENLGFGAGNNQGIREAKGDYVLLLNPDTVVEPG----ALLELLDAAE 100 (166) T ss_pred CCCCCHHHHHHHHHHHHCCCCEEEECCCEEECCC----HHHHHHHHHH T ss_conf 8987767876168987165726998799388839----9999999998 No 98 >PRK10018 predicted glycosyl transferase; Provisional Probab=81.73 E-value=2.2 Score=22.25 Aligned_cols=104 Identities=11% Similarity=0.093 Sum_probs=59.6 Q ss_pred CCCEEEE----CCE-EHHHHHHHHHHHCCCCE--EEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCC Q ss_conf 8333568----836-27999999999789988--9998135544410100117454445654187000011344112485 Q gi|254780365|r 29 PKEMLAI----VDR-PVIQYVIEEALEAGLTD--FVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIG 101 (299) Q Consensus 29 pKpllpi----~~k-piI~~~i~~l~~~Gi~~--i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 101 (299) .+|++.| .+. ..|...|+.+.++...+ ++||-....+ . .......... ... T Consensus 3 ~~PlVSVIIP~yN~~~~l~~aI~SVl~Qty~n~EiIIVDD~Std----~----~~~~~~~~~~--------------~d~ 60 (279) T PRK10018 3 DNPLISIYMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTS----W----EQLQQYVTAL--------------NDP 60 (279) T ss_pred CCCCEEEEECCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCC----H----HHHHHHHHHC--------------CCC T ss_conf 99809999947997799999999999579989899999899998----7----9999999975--------------899 Q ss_pred EEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 799843334233466542110112233331112786444544332211222122343 Q gi|254780365|r 102 NAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKE 158 (299) Q Consensus 102 ~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~ 158 (299) .|.++.+++..|.+.|.-.|.....++-++.+-+||++.+.- |+.+++..++. T Consensus 61 RI~~i~~~~N~G~~~aRN~gi~~A~GeyIafLDsDD~~~Pnk----LE~ql~~~~~~ 113 (279) T PRK10018 61 RITYIHNDINSGACAVRNQAIMLAQGEYITGIDDDDEWTPNR----LSVFLAHKQQL 113 (279) T ss_pred CEEEEECCCCCCHHHHHHHHHHHHCCCEEEEECCCCCCCCCH----HHHHHHHHHHC T ss_conf 889998789788999999999995699899999876878566----99999614422 No 99 >cd06433 GT_2_WfgS_like WfgS and WfeV are involved in O-antigen biosynthesis. Escherichia coli WfgS and Shigella dysenteriae WfeV are glycosyltransferase 2 family enzymes involved in O-antigen biosynthesis. GT-2 enzymes have GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=79.90 E-value=4.8 Score=20.20 Aligned_cols=98 Identities=10% Similarity=0.201 Sum_probs=50.1 Q ss_pred EEEECCE-EHHHHHHHHHHHCCCC--EEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEECCC Q ss_conf 3568836-2799999999978998--899981355444101001174544456541870000113441124857998433 Q gi|254780365|r 32 MLAIVDR-PVIQYVIEEALEAGLT--DFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFTWQ 108 (299) Q Consensus 32 llpi~~k-piI~~~i~~l~~~Gi~--~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v~Q 108 (299) .+|.-+. ..|...|+.+.++-.. ||+||=+...+.- .++ +++. . ...+.+ .. T Consensus 3 iip~yN~~~~l~~~l~Si~~Q~~~~~EiIvVDd~S~D~t-------~~~---~~~~-------------~-~~~~~~-~~ 57 (202) T cd06433 3 ITPTYNQAETLEETIDSVLSQTYPNIEYIVIDGGSTDGT-------VDI---IKKY-------------E-DKITYW-IS 57 (202) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCH-------HHH---HHHC-------------C-CCEEEE-EE T ss_conf 999769889999999999837899979999979988423-------445---3311-------------3-412599-98 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 34233466542110112233331112786444544332211222122343 Q gi|254780365|r 109 YERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKE 158 (299) Q Consensus 109 ~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~ 158 (299) ++..|.++|.-.+......+-++.+.+|+++.+.. +.+++..+.+. T Consensus 58 ~~~~G~~~a~N~g~~~a~g~yi~~ld~DD~~~~~~----~~~~~~~~~~~ 103 (202) T cd06433 58 EPDKGIYDAMNKGIALATGDIIGFLNSDDTLLPGA----LLAVVAAFAEH 103 (202) T ss_pred CCCCCHHHHHHHHHHHCCCCEEECCCCCCEECCHH----HHHHHHHHHHC T ss_conf 88888899987327754855532248886268449----99999999878 No 100 >TIGR01584 citF citrate lyase, alpha subunit; InterPro: IPR006472 These sequences, from both Gram-positive and Gram-negative bacteria, represent the alpha subunit of the holoenzyme citrate lyase composed of alpha (2.8.3.10 from EC), beta, and acyl carrier protein subunits in a stoichiometric relationship of 6:6:6. Citrate lyase is an enzyme which converts citrate to oxaloacetate. In bacteria, this reaction is involved in citrate fermentation. The alpha subunit catalyzes the reaction Acetyl-CoA + citrate = acetate + (3S)-citryl-CoA. The protein from Lactococcus lactis subsp. lactis has been experimentally characterised . ; GO: 0008814 citrate CoA-transferase activity, 0006084 acetyl-CoA metabolic process, 0005737 cytoplasm, 0009346 citrate lyase complex. Probab=79.74 E-value=0.38 Score=26.93 Aligned_cols=16 Identities=19% Similarity=0.274 Sum_probs=8.4 Q ss_pred HHCCCCEEEEECCCCC Q ss_conf 9789988999813554 Q gi|254780365|r 49 LEAGLTDFVFVTGRGK 64 (299) Q Consensus 49 ~~~Gi~~i~iv~~~~k 64 (299) ++.-+.++++.++|-| T Consensus 236 IA~~~~~VI~aSGYfK 251 (496) T TIGR01584 236 IAKKAADVIVASGYFK 251 (496) T ss_pred HHHHHHHEEEECCCCC T ss_conf 8642300022268313 No 101 >cd04179 DPM_DPG-synthase_like DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily. DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi, and animals, have no transmembrane region, suggesting the ex Probab=79.64 E-value=1.8 Score=22.77 Aligned_cols=61 Identities=13% Similarity=0.150 Sum_probs=39.0 Q ss_pred EEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE-EEEE Q ss_conf 7998433342334665421101122333311127864445443322112221223432200-1124 Q gi|254780365|r 102 NAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANI-LAVS 166 (299) Q Consensus 102 ~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~v-i~~~ 166 (299) ++..+..++..|-|+|+..+......+.++.+-+|.-+.+. .+.+|++...+.+..+ +|.+ T Consensus 55 ~~~~i~~~~n~G~g~ai~~G~~~a~~d~i~~~D~D~~~~~~----~i~~~i~~~~~~~~d~V~gsR 116 (185) T cd04179 55 RVRVIRLSRNFGKGAAVRAGFKAARGDIVVTMDADLQHPPE----DIPKLLEKLLEGGADVVIGSR 116 (185) T ss_pred EEEEEECCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCHH----HHHHHHHHHHHCCCEEEEEEE T ss_conf 49986324577705899988875258889992799870999----999999999978985999997 No 102 >cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis. AmsE is a glycosyltransferase involved in exopolysaccharide amylovora biosynthesis in Erwinia amylovora. Amylovara is one of the three exopolysaccharide produced by E. amylovora. Amylovara-deficient mutants are non-pathogenic. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Probab=78.32 E-value=1.1 Score=24.09 Aligned_cols=100 Identities=18% Similarity=0.189 Sum_probs=57.1 Q ss_pred EEEECC--EE-HHHHHHHHHHHCCCC--EEEEEC-CCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEE Q ss_conf 356883--62-799999999978998--899981-355444101001174544456541870000113441124857998 Q gi|254780365|r 32 MLAIVD--RP-VIQYVIEEALEAGLT--DFVFVT-GRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVF 105 (299) Q Consensus 32 llpi~~--kp-iI~~~i~~l~~~Gi~--~i~iv~-~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~ 105 (299) .+|+.| +| .|.-.|+.+.++-.. |++||- |...+.. ...+... .....+.+ T Consensus 3 iip~YN~e~~~~l~~~l~Svl~Qt~~~~EiIiVdDgSs~d~~----------~~i~~~~-------------~~~~~i~~ 59 (201) T cd04195 3 LMSVYIKEKPEFLREALESILKQTLPPDEVVLVKDGPVTQSL----------NEVLEEF-------------KRKLPLKV 59 (201) T ss_pred EEECCCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHH----------HHHHHHH-------------HCCCCEEE T ss_conf 998884897899999999999579998189999899996543----------9999986-------------14799899 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 43334233466542110112233331112786444544332211222122343 Q gi|254780365|r 106 TWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKE 158 (299) Q Consensus 106 v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~ 158 (299) +..++..|.+.|.-.+.....++-++.+-+|+++.+.. ++.+++..+++ T Consensus 60 i~~~~N~G~~~a~N~gi~~a~g~yI~~lD~DD~~~p~~----l~~~~~~l~~~ 108 (201) T cd04195 60 VPLEKNRGLGKALNEGLKHCTYDWVARMDTDDISLPDR----FEKQLDFIEKN 108 (201) T ss_pred EECCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCHHH----HHHHHHHHHHC T ss_conf 98788789899977636426766999818898467169----99999999878 No 103 >cd06434 GT2_HAS Hyaluronan synthases catalyze polymerization of hyaluronan. Hyaluronan synthases (HASs) are bi-functional glycosyltransferases that catalyze polymerization of hyaluronan. HASs transfer both GlcUA and GlcNAc in beta-(1,3) and beta-(1,4) linkages, respectively to the hyaluronan chain using UDP-GlcNAc and UDP-GlcUA as substrates. HA is made as a free glycan, not attached to a protein or lipid. HASs do not need a primer for HA synthesis; they initiate HA biosynthesis de novo with only UDP-GlcNAc, UDP-GlcUA, and Mg2+. Hyaluronan (HA) is a linear heteropolysaccharide composed of (1-3)-linked beta-D-GlcUA-beta-D-GlcNAc disaccharide repeats. It can be found in vertebrates and a few microbes and is typically on the cell surface or in the extracellular space, but is also found inside mammalian cells. Hyaluronan has several physiochemical and biological functions such as space filling, lubrication, and providing a hydrated matrix through which cells can migrate. Probab=77.13 E-value=3.6 Score=20.95 Aligned_cols=100 Identities=17% Similarity=0.206 Sum_probs=56.0 Q ss_pred EEEECC-EE-HHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEECCCC Q ss_conf 356883-62-7999999999789988999813554441010011745444565418700001134411248579984333 Q gi|254780365|r 32 MLAIVD-RP-VIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFTWQY 109 (299) Q Consensus 32 llpi~~-kp-iI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v~Q~ 109 (299) ++|..| .| +|...++.+....-.||++|.....+. ....+..... .. .+. +... T Consensus 5 iIPayNE~~~il~~~l~s~~~~~~~eiivV~D~s~d~----------~~~~~~~~~~------------~~-~~~-v~~~ 60 (235) T cd06434 5 IIPVYDEDPDVFRECLRSILRQKPLEIIVVTDGDDEP----------YLSILSQTVK------------YG-GIF-VITV 60 (235) T ss_pred EEEECCCCHHHHHHHHHHHHCCCCCEEEEEECCCCCC----------HHHHHHHHCC------------CC-CEE-EEEC T ss_conf 9952269768999999999808999899998899972----------8999998634------------89-789-9988 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 42334665421101122333311127864445443322112221223432 Q gi|254780365|r 110 ERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEG 159 (299) Q Consensus 110 ~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~ 159 (299) +..|-++|+..|.+....|-++++-+|.+...+ .|++|+..++... T Consensus 61 ~~~GK~~Aln~~l~~a~~d~v~~~DaD~~~~~~----~l~~l~~~f~d~~ 106 (235) T cd06434 61 PHPGKRRALAEGIRHVTTDIVVLLDSDTVWPPN----ALPEMLKPFEDPK 106 (235) T ss_pred CCCCHHHHHHHHHHHCCCCEEEEECCCCCCCHH----HHHHHHHHHCCCC T ss_conf 989989999999997577889998588535888----9999999648998 No 104 >cd06442 DPM1_like DPM1_like represents putative enzymes similar to eukaryotic DPM1. Proteins similar to eukaryotic DPM1, including enzymes from bacteria and archaea; DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi, Probab=76.90 E-value=3.1 Score=21.36 Aligned_cols=146 Identities=11% Similarity=0.137 Sum_probs=66.8 Q ss_pred EEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-EEEEEECCHHHCCCCCCEE Q ss_conf 799843334233466542110112233331112786444544332211222122343220-0112432854433133100 Q gi|254780365|r 102 NAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGAN-ILAVSECDPQLSCKYGMVQ 180 (299) Q Consensus 102 ~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~-vi~~~~v~~~~~~~yGvi~ 180 (299) ++..+.+++.+|-|+|+..+.+....+-++++-+|.-+.+. .+.+|++...+.+.. +++..-........++... T Consensus 54 ~~~~i~~~~n~G~g~Ai~~G~~~a~~~~i~~~DaD~q~~p~----~i~~li~~~~~~~~d~V~GsR~~~~~~~~~~~~~r 129 (224) T cd06442 54 RVRLIVRPGKRGLGSAYIEGFKAARGDVIVVMDADLSHPPE----YIPELLEAQLEGGADLVIGSRYVEGGGVEGWGLKR 129 (224) T ss_pred CEEEEECCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHH----HHHHHHHHHHCCCCCEEEEEEECCCCCCCCCHHHH T ss_conf 24786336678727999999986017846886599976277----78999999871887247988762798635770666 Q ss_pred CCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCC-CEEEEEEEEE Q ss_conf 133333353000000257788867342330202002723311000122666767500378999998319-8489998137 Q gi|254780365|r 181 VGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLSERH-DFLAYHFKGH 259 (299) Q Consensus 181 ~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~~~~-~v~a~~~~g~ 259 (299) .... ++-.++-+---....++ ..+|.-+|+.++++.+.....+.+. ++. ++.+-.+.+.+ ++.-+++ . T Consensus 130 ---~~~s---~~~~~i~~~l~~~~i~D-~~~gfr~~~~~~~~~i~~~~~~~~f-~~~-~ei~~~~~~~~~ki~evpv--~ 198 (224) T cd06442 130 ---KLIS---RGANLLARLLLGRKVSD-PTSGFRAYRREVLEKLIDSLVSKGY-KFQ-LELLVRARRLGYRIVEVPI--T 198 (224) T ss_pred ---HHHH---HHHHHHHHHCCCCEECC-CCCCEEEEEHHHHHHHHHHCCCCCC-CCH-HHHHHHHHHCCCEEEEEEE--E T ss_conf ---7888---88887632305842431-7996399889999988775456663-018-9999999988998999972--7 Q ss_pred EEE Q ss_conf 864 Q gi|254780365|r 260 TYD 262 (299) Q Consensus 260 w~D 262 (299) |.+ T Consensus 199 ~~~ 201 (224) T cd06442 199 FVD 201 (224) T ss_pred ECC T ss_conf 779 No 105 >COG1216 Predicted glycosyltransferases [General function prediction only] Probab=75.76 E-value=6.3 Score=19.46 Aligned_cols=204 Identities=12% Similarity=0.035 Sum_probs=90.1 Q ss_pred EEEECCEEHHHHHHHHHHHCCCCEEEE--ECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEECCCC Q ss_conf 356883627999999999789988999--813554441010011745444565418700001134411248579984333 Q gi|254780365|r 32 MLAIVDRPVIQYVIEEALEAGLTDFVF--VTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFTWQY 109 (299) Q Consensus 32 llpi~~kpiI~~~i~~l~~~Gi~~i~i--v~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v~Q~ 109 (299) .+..+...-+...+..+.++.-....+ |-+...+.-.+. +.... ..++.++.-+ T Consensus 9 iv~yn~~~~l~~~l~~l~~~~~~~~~iv~vDn~s~d~~~~~----------~~~~~--------------~~~v~~i~~~ 64 (305) T COG1216 9 IVTYNRGEDLVECLASLAAQTYPDDVIVVVDNGSTDGSLEA----------LKARF--------------FPNVRLIENG 64 (305) T ss_pred EEEECCHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHH----------HHHHC--------------CCCEEEEECC T ss_conf 99968888999999999717687635998079997322888----------77605--------------8868999889 Q ss_pred CCCCCCCCCCCCCCCC-CC-C-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCHHH----CCCCCCEECC Q ss_conf 4233466542110112-23-3-33111278644454433221122212234322001124328544----3313310013 Q gi|254780365|r 110 ERKGLGHAVWCARNII-GD-N-PFALLLPDMIMSPLEGENCMANMIKLYEKEGANILAVSECDPQL----SCKYGMVQVG 182 (299) Q Consensus 110 ~p~Gtg~Ai~~a~~~i-~d-e-~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi~~~~v~~~~----~~~yGvi~~~ 182 (299) +-+|.|++........ ++ . .++++++|..... ..+++|++.+++.+...+....+...+ .+..| ... T Consensus 65 ~NlG~agg~n~g~~~a~~~~~~~~l~LN~D~~~~~----~~l~~ll~~~~~~~~~~~~~~~i~~~~~~~~~~~~~-~~~- 138 (305) T COG1216 65 ENLGFAGGFNRGIKYALAKGDDYVLLLNPDTVVEP----DLLEELLKAAEEDPAAGVVGPLIRNYDESLYIDRRG-GES- 138 (305) T ss_pred CCCEEHHHHHHHHHHHHCCCCCEEEEEECCCCCCH----HHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCC-CCC- T ss_conf 88670888889999986289848999838868790----699999999986898779634654887532344345-644- Q ss_pred CCCCCCCCEEEEECCCCCCCCC-CCC--EEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCC-CEEEEEEEE Q ss_conf 3333353000000257788867-342--330202002723311000122666767500378999998319-848999813 Q gi|254780365|r 183 KAIDHQVFHISDMIEKPDSSTF-ISN--FFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLSERH-DFLAYHFKG 258 (299) Q Consensus 183 ~~~~~~~~~I~~~vEKP~~~~~-~Sn--l~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~~~~-~v~a~~~~g 258 (299) .............+.+..... +.. .++.+.-+++.+.|+.+.-..... .-=.+=+|.=-.+.+.| +++-++-.- T Consensus 139 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~li~~~~~~~vG~~de~~-F~y~eD~D~~~R~~~~G~~i~~~p~a~ 216 (305) T COG1216 139 -DGLTGGWRASPLLEIAPDLSSYLEVVASLSGACLLIRREAFEKVGGFDERF-FIYYEDVDLCLRARKAGYKIYYVPDAI 216 (305) T ss_pred -CCCCCCHHCCCCCCCCCCCCCCCHHHHHCCCCEEEEEHHHHHHHCCCCHHH-HHHHHHHHHHHHHHHCCCCEEEEECEE T ss_conf -433220002433344443334201333225617997499999847987687-210068799999997599699800459 Q ss_pred EEEECCCHH Q ss_conf 786439988 Q gi|254780365|r 259 HTYDCGSKK 267 (299) Q Consensus 259 ~w~DiG~~~ 267 (299) .|+.+|.-. T Consensus 217 i~H~~g~s~ 225 (305) T COG1216 217 IYHKIGSSK 225 (305) T ss_pred EEECCCCCC T ss_conf 997377777 No 106 >cd06913 beta3GnTL1_like Beta 1, 3-N-acetylglucosaminyltransferase is essential for the formation of poly-N-acetyllactosamine . This family includes human Beta3GnTL1 and related eukaryotic proteins. Human Beta3GnTL1 is a putative beta-1,3-N-acetylglucosaminyltransferase. Beta3GnTL1 is expressed at various levels in most of tissues examined. Beta 1, 3-N-acetylglucosaminyltransferase has been found to be essential for the formation of poly-N-acetyllactosamine. Poly-N-acetyllactosamine is a unique carbohydrate composed of N-acetyllactosamine repeats. It is often an important part of cell-type-specific oligosaccharide structures and some functional oligosaccharides. It has been shown that the structure and biosynthesis of poly-N-acetyllactosamine display a dramatic change during development and oncogenesis. Several members of beta-1, 3-N-acetylglucosaminyltransferase have been identified. Probab=75.24 E-value=4.4 Score=20.39 Aligned_cols=112 Identities=14% Similarity=0.134 Sum_probs=57.7 Q ss_pred EEEECCE-EHHHHHHHHHHHCCCC---EEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEECC Q ss_conf 3568836-2799999999978998---89998135544410100117454445654187000011344112485799843 Q gi|254780365|r 32 MLAIVDR-PVIQYVIEEALEAGLT---DFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFTW 107 (299) Q Consensus 32 llpi~~k-piI~~~i~~l~~~Gi~---~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v~ 107 (299) .+|+.|. ..|.-.|+.+..+..+ |++||=.-..+ ...+..+.+..+.... ...+.... T Consensus 2 IiP~YN~~~~l~~~l~Svl~Qty~~~~EiIiVDDgStD-------~t~~ii~~~~~~~~~~-----------~~~~~~~~ 63 (219) T cd06913 2 ILPVHNGEQWLDECLESVLQQDFEGTLELSVFNDASTD-------KSAEIIEKWRKKLEDS-----------GVIVLVGS 63 (219) T ss_pred EECCCCCHHHHHHHHHHHHHCCCCCCEEEEEEECCCCC-------CHHHHHHHHHHHCCCC-----------CEEEEEEC T ss_conf 88678988999999999984889998999999899981-------5699999999866677-----------76999823 Q ss_pred --CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEE Q ss_conf --3342334665421101122333311127864445443322112221223432200112 Q gi|254780365|r 108 --QYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANILAV 165 (299) Q Consensus 108 --Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi~~ 165 (299) .+.|+|.|.|.-.+...-.++-++.+-+|+++.+.. ++.+++....+...+++. T Consensus 64 ~~~~~~~G~~~arN~gi~~A~GeyI~flDsDD~~~p~~----l~~~~~~~~~~~~~~v~~ 119 (219) T cd06913 64 HNSPSPKGVGYAKNQAIAQSSGRYLCFLDSDDVMMPQR----IRLQYEAALQHPNSIIGC 119 (219) T ss_pred CCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCHHH----HHHHHHHHHHCCCCEEEE T ss_conf 67766467899999999977888788618976427659----999999998599969998 No 107 >cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=75.04 E-value=5.6 Score=19.75 Aligned_cols=168 Identities=13% Similarity=0.131 Sum_probs=77.2 Q ss_pred CCEEHHHHHHHHHHHCC--CCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEECCCCCCCC Q ss_conf 83627999999999789--9889998135544410100117454445654187000011344112485799843334233 Q gi|254780365|r 36 VDRPVIQYVIEEALEAG--LTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFTWQYERKG 113 (299) Q Consensus 36 ~~kpiI~~~i~~l~~~G--i~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v~Q~~p~G 113 (299) +....|.-.|+.+.++- ..||+||=+-..+.-.+. + +......++.++..++.+| T Consensus 7 N~~~~l~~~L~Sl~~q~~~~~eIiVVDn~StD~t~~~----------l-------------~~~~~~~~v~~i~~~~N~G 63 (202) T cd04185 7 NRLDLLKECLDALLAQTRPPDHIIVIDNASTDGTAEW----------L-------------TSLGDLDNIVYLRLPENLG 63 (202) T ss_pred CCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHH----------H-------------HHHCCCCCEEEEECCCCCC T ss_conf 8889999999999827799988999979498761665----------3-------------0003566689985688873 Q ss_pred CCCCCCCCCCCC-C-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCHHHCCCCCCEECCCCCCCCCCE Q ss_conf 466542110112-2-33331112786444544332211222122343220011243285443313310013333335300 Q gi|254780365|r 114 LGHAVWCARNII-G-DNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANILAVSECDPQLSCKYGMVQVGKAIDHQVFH 191 (299) Q Consensus 114 tg~Ai~~a~~~i-~-de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi~~~~v~~~~~~~yGvi~~~~~~~~~~~~ 191 (299) .|+|...+.... + +.++++++.|+... ....++.|++..+.....+++-...+.+ | T Consensus 64 ~a~~~N~Gi~~a~~~~~d~v~~ld~D~~~---~~~~l~~l~~~~~~~~~~~~~~~~~~~~-----~-------------- 121 (202) T cd04185 64 GAGGFYEGVRRAYELGYDWIWLMDDDAIP---DPDALEKLLAYADKDNPQFLAPLVLDPD-----G-------------- 121 (202) T ss_pred HHHHHHHHHHHHHHCCCEEEEEECCCCCC---CCCHHHHHHHHHHCCCCEEEEEEEECCC-----C-------------- T ss_conf 27899999998875598189998998754---9229999999865799779997898479-----9-------------- Q ss_pred EEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCC-CEEEEEEEEEEEECCCHHH Q ss_conf 0000257788867342330202002723311000122666767500378999998319-8489998137864399889 Q gi|254780365|r 192 ISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLSERH-DFLAYHFKGHTYDCGSKKG 268 (299) Q Consensus 192 I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~~~~-~v~a~~~~g~w~DiG~~~~ 268 (299) .+ .| .+++.++|+.+--..+..... ++=.|.--.+.+.| +++ ++-.-.|+++|...+ T Consensus 122 ---------------~F--~g-~l~~r~v~~~vG~fde~~f~~-~eD~D~~~R~~~~G~~i~-~p~a~v~H~~~~~~~ 179 (202) T cd04185 122 ---------------SF--VG-VLISRRVVEKIGLPDKEFFIW-GDDTEYTLRASKAGPGIY-VPDAVVVHKTAINKG 179 (202) T ss_pred ---------------CE--EE-EEEEHHHHHHHCCCCHHHCCC-CCHHHHHHHHHHCCCCEE-ECCEEEEECCCCCCC T ss_conf ---------------98--99-988899999758997686673-829999999997499699-778899987877788 No 108 >TIGR01320 mal_quin_oxido malate:quinone-oxidoreductase; InterPro: IPR006231 The membrane-associated enzyme, malate:quinone-oxidoreductase, is an alternative to the better-known NAD-dependent malate dehydrogenase as part of the TCA cycle. The reduction of a quinone rather than NAD+ makes the reaction essentially irreversible in the direction of malate oxidation to oxaloacetate. Both forms of malate dehydrogenase are active in E. coli; disruption of this form causes less phenotypic change. In some bacteria, this form is the only or the more important malate dehydrogenase . ; GO: 0008924 malate dehydrogenase (acceptor) activity, 0006099 tricarboxylic acid cycle. Probab=74.97 E-value=2.4 Score=22.06 Aligned_cols=110 Identities=17% Similarity=0.144 Sum_probs=57.8 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCEEEEEECCHHHCCCCC----CEECCCCCC-CCCCEEE Q ss_conf 110112233331112786444544332211222122343-2200112432854433133----100133333-3530000 Q gi|254780365|r 120 CARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKE-GANILAVSECDPQLSCKYG----MVQVGKAID-HQVFHIS 193 (299) Q Consensus 120 ~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~-~~~vi~~~~v~~~~~~~yG----vi~~~~~~~-~~~~~I~ 193 (299) +++.-=-+||+.+-+.|+=.+.+ +.-..+||+++.... |..+.-=++|. +.+|=+ ++.+....+ ++..+|. T Consensus 154 M~~gR~~~~pvA~~~~~~GTDvd-FGalt~Ql~~~~~~~PG~~~~yg~~V~--~l~r~sD~~W~vt~~d~r~PG~~~~l~ 230 (487) T TIGR01320 154 MAKGRDFSEPVAANWVAEGTDVD-FGALTKQLLKYLEQEPGTKIRYGHEVK--DLKRESDGAWVVTVKDTRTPGGKRTLK 230 (487) T ss_pred CCCCCCCCCHHHHHHHCCCCEEC-HHHHHHHHHHHHHCCCCCEEECCCCEE--HHHHCCCCCCEEEECCCCCCCCCEEEE T ss_conf 56876886343321204784002-778999999997028981676385000--110105885378841220888762676 Q ss_pred EECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHH----HCC-CEEEEEEEEEEEECCCHHH Q ss_conf 00257788867342330202002723311000122666767500378999998----319-8489998137864399889 Q gi|254780365|r 194 DMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKENEGKGEIQLTDSMRKLS----ERH-DFLAYHFKGHTYDCGSKKG 268 (299) Q Consensus 194 ~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~----~~~-~v~a~~~~g~w~DiG~~~~ 268 (299) .++++.| +| | ++++.|- +++ .+-|+++.|.|+=|++|+- T Consensus 231 ------------a~FvFlG------------------AG-G-----~AL~lLq~SGIpevKGFaGFPvSG~fLR~~Npe~ 274 (487) T TIGR01320 231 ------------ARFVFLG------------------AG-G-----GALPLLQKSGIPEVKGFAGFPVSGLFLRCSNPEL 274 (487) T ss_pred ------------EEEEEEE------------------CC-C-----CHHHHHHHCCCCCCCCCCCCCCCCEECCCCCHHH T ss_conf ------------6778971------------------57-4-----1688988667987665336575550001788799 No 109 >cd04187 DPM1_like_bac Bacterial DPM1_like enzymes are related to eukaryotic DPM1. A family of bacterial enzymes related to eukaryotic DPM1; Although the mechanism of eukaryotic enzyme is well studied, the mechanism of the bacterial enzymes is not well understood. The eukaryotic DPM1 is the catalytic subunit of eukaryotic Dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. The enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. This protein family belongs to Glycosyltransferase 2 superfamily. Probab=73.12 E-value=6.2 Score=19.51 Aligned_cols=110 Identities=11% Similarity=0.090 Sum_probs=54.7 Q ss_pred EEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE-EEEEECCHHHC-CCCCCE Q ss_conf 7998433342334665421101122333311127864445443322112221223432200-11243285443-313310 Q gi|254780365|r 102 NAVFTWQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANI-LAVSECDPQLS-CKYGMV 179 (299) Q Consensus 102 ~i~~v~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~v-i~~~~v~~~~~-~~yGvi 179 (299) ++.++..++..|-|.|+..+-.....+-++.+-+|.-..+ ..+..|++..+ .+..+ ++......... .+++. T Consensus 56 ~v~~i~~~~N~G~g~Ai~~g~~~a~~d~i~~~D~D~q~~p----~~i~~l~~~~~-~~~D~V~gsR~~r~~~~~r~~~s- 129 (181) T cd04187 56 RVKVIRLSRNFGQQAALLAGLDHARGDAVITMDADLQDPP----ELIPEMLAKWE-EGYDVVYGVRKNRKESWLKRLTS- 129 (181) T ss_pred CEEEEECCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCH----HHHHHHHHHHH-CCCEEEEEEEECCCCCHHHHHHH- T ss_conf 2045314557644678898998556974775189998299----99999999997-59919999970788768999999- Q ss_pred ECCCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCC Q ss_conf 01333333530000002577888673423302020027233110001226 Q gi|254780365|r 180 QVGKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKE 229 (299) Q Consensus 180 ~~~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~ 229 (299) ... ..+.++.=+.... -..+|+.+|++++++.|..... T Consensus 130 ----~~~---~~l~~~l~~~~~~-----D~~~gfr~~~r~~~~~i~~~~~ 167 (181) T cd04187 130 ----KLF---YRLINKLSGVDIP-----DNGGDFRLMDRKVVDALLLLPE 167 (181) T ss_pred ----HHH---HHHHHHHHCCCCC-----CCCCCEEEEEHHHHHHHHHCCC T ss_conf ----999---9999998799900-----7892869988999999874899 No 110 >TIGR03552 F420_cofC 2-phospho-L-lactate guanylyltransferase CofC. Members of this protein family are the CofC enzyme of coenzyme F420 biosynthesis. Probab=64.29 E-value=2.6 Score=21.79 Aligned_cols=112 Identities=20% Similarity=0.252 Sum_probs=58.0 Q ss_pred EEEEECCCC---CCCCCHH-HCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHC Q ss_conf 899956888---8334324-408883335688362799999999978998899981355444101001174544456541 Q gi|254780365|r 9 KAVFPIAGL---GMRFFPI-SKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKR 84 (299) Q Consensus 9 KavIlAaG~---GtRl~P~-T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~ 84 (299) +|||+.=.. =|||.|. +..---. .-+-|+..++..+..+++.++++|+.-. .+.+ . .. T Consensus 1 ~aviPvK~~~~aKtRLs~~L~~~eR~~----L~~~ml~dtl~~l~~~~~~~v~vvt~d~--~~~~----------~-a~- 62 (195) T TIGR03552 1 RAVIPVKRLANAKSRLSPVLSPEEREE----LALAMLRDVITALRGAGAGAVLVVSPDP--ALLE----------A-AR- 62 (195) T ss_pred CEEEECCCCCCCCCCCCCCCCHHHHHH----HHHHHHHHHHHHHHHCCCCCEEEEECCH--HHHH----------H-HH- T ss_conf 917836888745200453079999999----9999999999999858997189990987--8999----------9-98- Q ss_pred CCCEEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 870000113441124857998433342334665421101122--3333111278644454433221122212234 Q gi|254780365|r 85 NKKAELTLLAESIPSIGNAVFTWQYERKGLGHAVWCARNIIG--DNPFALLLPDMIMSPLEGENCMANMIKLYEK 157 (299) Q Consensus 85 ~~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gtg~Ai~~a~~~i~--de~flv~~gD~i~~~~~~~~~l~~li~~~~~ 157 (299) .. .+..+.|+.. |++.|+..|..... ..+.+++.+|- +......++++++...+ T Consensus 63 -------------~~--g~~~i~~~~~-gLn~al~~a~~~~~~~~~~vlii~aDl---P~l~~~~l~~~l~~~~~ 118 (195) T TIGR03552 63 -------------NL--GAPVLRDPGP-GLNNALNAALAEAREPGGAVLILMADL---PLLTPRELKRLLAAATE 118 (195) T ss_pred -------------HC--CCEEEECCCC-CHHHHHHHHHHHHHCCCCCEEEECCCC---CCCCHHHHHHHHHHCCC T ss_conf -------------71--9979826998-889999999999873899389965887---88999999999984444 No 111 >COG1788 AtoD Acyl CoA:acetate/3-ketoacid CoA transferase, alpha subunit [Lipid metabolism] Probab=62.08 E-value=12 Score=17.76 Aligned_cols=34 Identities=21% Similarity=0.419 Sum_probs=27.1 Q ss_pred EEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 9956888833432440888333568836279999999997899889998135 Q gi|254780365|r 11 VFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGR 62 (299) Q Consensus 11 vIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~ 62 (299) -|+.||+|+..-|. .+|++++++|+++..+|.+- T Consensus 20 ti~~gGFg~~g~P~------------------alI~ali~~GvkdLt~I~n~ 53 (220) T COG1788 20 TIMIGGFGTCGIPE------------------ALIHALIRQGVKDLTVISNN 53 (220) T ss_pred EEEECCCCCCCCHH------------------HHHHHHHHCCCCCEEEEECC T ss_conf 99986545668969------------------99999997498635999468 No 112 >TIGR01096 3A0103s03R lysine-arginine-ornithine-binding periplasmic protein; InterPro: IPR005768 Bacterial high affinity transport systems are involved in active transport of solutes across the cytoplasmic membrane. The protein components of these traffic systems include one or two transmembrane protein components, one or two membrane-associated ATP-binding proteins (ABC transporters; see IPR003439 from INTERPRO) and a high affinity periplasmic solute-binding protein. The later are thought to bind the substrate in the vicinity of the inner membrane, and to transfer it to a complex of inner membrane proteins for concentration into the cytoplasm. In Gram-positive bacteria which are surrounded by a single membrane and have therefore no periplasmic region the equivalent proteins are bound to the membrane via an N-terminal lipid anchor. These homolog proteins do not play an integral role in the transport process per se, but probably serve as receptors to trigger or initiate translocation of the solute through the membrane by binding to external sites of the integral membrane proteins of the efflux system. In addition at least some solute-binding proteins function in the initiation of sensory transduction pathways. On the basis of sequence similarities, the vast majority of these solute-binding proteins can be grouped into eight families of clusters, which generally correlate with the nature of the solute bound. Family 3 groups together specific amino acids and opine-binding periplasmic proteins and a periplasmic homolog with catalytic activity. ; GO: 0005215 transporter activity, 0006810 transport, 0030288 outer membrane-bounded periplasmic space. Probab=62.06 E-value=6.2 Score=19.48 Aligned_cols=43 Identities=19% Similarity=0.401 Sum_probs=35.4 Q ss_pred EEEEEECCCHHHH--HHHH-HHHHHCC-------------------HH-HHHHHHHHHHHHHHHHC Q ss_conf 1378643998899--9999-9986318-------------------34-66999999999988609 Q gi|254780365|r 257 KGHTYDCGSKKGF--VLAN-IAFALAR-------------------QD-IRSDIETDLKTLVSALK 299 (299) Q Consensus 257 ~g~w~DiG~~~~y--l~A~-~~~al~~-------------------~~-~~~~~~~~l~~~~~~~~ 299 (299) +|+|+||=+|..| +.|+ +-+++++ ++ ++.|++..+..-+.++| T Consensus 255 ~g~W~DiV~W~~YAL~~A~E~Gig~rk~de~~~s~Np~i~r~lg~~~Dt~G~eL~~~~n~Al~~~~ 320 (333) T TIGR01096 255 DGYWFDIVRWSAYALVAAEEVGIGLRKVDEVKASTNPEIKRLLGSEGDTIGKELKAKFNKALKAIR 320 (333) T ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHCCCCCCHHHHHCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 883001214466777634432125345588851888302322045565121899999999999998 No 113 >cd06421 CESA_CelA_like CESA_CelA_like are involved in the elongation of the glucan chain of cellulose. Family of proteins related to Agrobacterium tumefaciens CelA and Gluconacetobacter xylinus BscA. These proteins are involved in the elongation of the glucan chain of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues. They are putative catalytic subunit of cellulose synthase, which is a glycosyltransferase using UDP-glucose as the substrate. The catalytic subunit is an integral membrane protein with 6 transmembrane segments and it is postulated that the protein is anchored in the membrane at the N-terminal end. Probab=61.11 E-value=5.9 Score=19.64 Aligned_cols=99 Identities=9% Similarity=0.076 Sum_probs=52.2 Q ss_pred EEEECCEE--HHHHHHHHHHHCCC--C--EEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEE Q ss_conf 35688362--79999999997899--8--899981355444101001174544456541870000113441124857998 Q gi|254780365|r 32 MLAIVDRP--VIQYVIEEALEAGL--T--DFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVF 105 (299) Q Consensus 32 llpi~~kp--iI~~~i~~l~~~Gi--~--~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~ 105 (299) ++|..|.+ +|.-.|+.+.+.-- + ||++|-+-..+. ..+..+.. .....+.+ T Consensus 6 iIPayNE~~~~i~~~l~sl~~q~YP~~~~eIiVvdD~std~----------t~~~~~~~-------------~~~~~~~~ 62 (234) T cd06421 6 FIPTYNEPLEIVRKTLRAALAIDYPHDKLRVYVLDDGRRPE----------LRALAAEL-------------GVEYGYRY 62 (234) T ss_pred EEECCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCHH----------HHHHHHHH-------------CCCCEEEE T ss_conf 99838997899999999999679999828999998989878----------89888884-------------56643688 Q ss_pred CCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 43334233-46654211011223333111278644454433221122212234 Q gi|254780365|r 106 TWQYERKG-LGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEK 157 (299) Q Consensus 106 v~Q~~p~G-tg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~ 157 (299) +.++.+.| -++|+-.+......|-++++-+|++...+ .|+.++..+.. T Consensus 63 ~~~~~~~g~Ka~alN~g~~~a~gd~v~~~DaD~~~~~~----~L~~~~~~~~~ 111 (234) T cd06421 63 LTRPDNRHAKAGNLNNALAHTTGDFVAILDADHVPTPD----FLRRTLGYFLD 111 (234) T ss_pred EECCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCHH----HHHHHHHHHHH T ss_conf 87488999769999999997789999998898770876----99999999986 No 114 >PRK13412 fkp bifunctional fucokinase/L-fucose-1-P-guanylyltransferase; Provisional Probab=59.82 E-value=13 Score=17.43 Aligned_cols=136 Identities=15% Similarity=0.180 Sum_probs=77.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCHHHCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCC---- Q ss_conf 3111278644454433221122212234322001124328544331331001333333530000002577888673---- Q gi|254780365|r 130 FALLLPDMIMSPLEGENCMANMIKLYEKEGANILAVSECDPQLSCKYGMVQVGKAIDHQVFHISDMIEKPDSSTFI---- 205 (299) Q Consensus 130 flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi~~~~v~~~~~~~yGvi~~~~~~~~~~~~I~~~vEKP~~~~~~---- 205 (299) .+++.||.+...+.. + .+. ...+..++++ .++++..+.-||+..++.. ...+..+.-||+.++-. T Consensus 155 ~lI~sGDV~l~~~~~---~---~~~-p~~dv~c~gl-~v~~~la~nHGVfvsdr~~---p~~ld~mLQKPS~~el~~l~~ 223 (974) T PRK13412 155 TLIASGDVYIRSEKP---L---QDI-PEADVVCYGL-WVDPSLATHHGVFVSDRKD---PERLDFMLQKPSLEELGGLSK 223 (974) T ss_pred EEEEECCEEEECCCC---C---CCC-CCCCCEEEEE-CCCHHHHCCCCEEEECCCC---HHHHHHHHHCCCHHHHHHHHC T ss_conf 599967789966755---4---776-5358479980-3686775357379967988---278888751988999998863 Q ss_pred --CCEEECHHHEECHHHHHHHCCCC---CCCCCCCCCH-HHHHHHH----------HHCCCEEEEEE-EEEEEECCCHHH Q ss_conf --42330202002723311000122---6667675003-7899999----------83198489998-137864399889 Q gi|254780365|r 206 --SNFFINGRYILHPDIFSILNDWK---ENEGKGEIQL-TDSMRKL----------SERHDFLAYHF-KGHTYDCGSKKG 268 (299) Q Consensus 206 --Snl~~~GiYi~~~~i~~~L~~~~---~~~~~gE~~l-tD~i~~l----------~~~~~v~a~~~-~g~w~DiG~~~~ 268 (299) .-+.-+|+++++.+.++.|-+.. ......+++| .|....| +..-++..+++ .|..+..||-.. T Consensus 224 ~~~~llD~Giw~ls~rAv~~L~~l~~~~~~~~~~~ydLY~Df~~aLg~~~~~~d~~~~~l~~~i~pLp~~eF~H~GTs~E 303 (974) T PRK13412 224 THLFLMDIGIWLLSDRAVERLMKRSGKEEADGLKYYDLYSDFGLALGGHPRIGDHEVNALSVAILPLPGGEFYHYGTSRE 303 (974) T ss_pred CCCEEEEEEEEEECCHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCHHHHCCCEEEEEECCCCEEEEECCCHH T ss_conf 89640000599988489999999733566443210004989888507999831445225527999768863688068489 Q ss_pred HHHHHHHH Q ss_conf 99999998 Q gi|254780365|r 269 FVLANIAF 276 (299) Q Consensus 269 yl~A~~~~ 276 (299) ++...... T Consensus 304 ll~~t~~~ 311 (974) T PRK13412 304 LISSTLAV 311 (974) T ss_pred HHHHHHHH T ss_conf 98766655 No 115 >cd06439 CESA_like_1 CESA_like_1 is a member of the cellulose synthase (CESA) superfamily. This is a subfamily of cellulose synthase (CESA) superfamily. CESA superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members of the superfamily include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins. Probab=59.71 E-value=10 Score=18.19 Aligned_cols=100 Identities=12% Similarity=0.196 Sum_probs=55.1 Q ss_pred EEEEC-CEEHHHHHHHHHHHCCC--C--EEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEEC Q ss_conf 35688-36279999999997899--8--8999813554441010011745444565418700001134411248579984 Q gi|254780365|r 32 MLAIV-DRPVIQYVIEEALEAGL--T--DFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFT 106 (299) Q Consensus 32 llpi~-~kpiI~~~i~~l~~~Gi--~--~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v 106 (299) ++|.. ..+.|.-.|+.+.+..- + ||+||-+-..+ ...++-..+. .. .+..+ T Consensus 34 iIP~yNE~~~i~~~l~sl~~q~Yp~~~~eVIvvdD~StD-------~T~ei~~~~~---------------~~--~~~v~ 89 (251) T cd06439 34 IIPAYNEEAVIEAKLENLLALDYPRDRLEIIVVSDGSTD-------GTAEIAREYA---------------DK--GVKLL 89 (251) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCH-------HHHHHHHHHH---------------CC--CCEEE T ss_conf 997389889999999999966999887899999799981-------4999999974---------------55--85043 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 33342334665421101122333311127864445443322112221223432 Q gi|254780365|r 107 WQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEG 159 (299) Q Consensus 107 ~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~ 159 (299) .++...|-++|+..+......|-++++-+|++...+ .++.|+..+...+ T Consensus 90 ~~~~n~Gk~~AlN~gi~~a~gd~i~~lDaD~~~~~~----~l~~l~~~f~d~~ 138 (251) T cd06439 90 RFPERRGKAAALNRALALATGEIVVFTDANALLDPD----ALRLLVRHFADPS 138 (251) T ss_pred CCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCHH----HHHHHHHHHHCCC T ss_conf 167776799999999987678989833787651876----9999999860998 No 116 >cd06427 CESA_like_2 CESA_like_2 is a member of the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, Glucan Biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis prot Probab=59.59 E-value=12 Score=17.75 Aligned_cols=103 Identities=12% Similarity=0.088 Sum_probs=53.8 Q ss_pred EEEECCEE-HHHHHHHHHHHCCC--C--EEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEEC Q ss_conf 35688362-79999999997899--8--8999813554441010011745444565418700001134411248579984 Q gi|254780365|r 32 MLAIVDRP-VIQYVIEEALEAGL--T--DFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFT 106 (299) Q Consensus 32 llpi~~kp-iI~~~i~~l~~~Gi--~--~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v 106 (299) ++|..|-+ .|...|+.+.+..- + +|++|.....+ .-....+.... .....+..+ T Consensus 6 iIPa~NE~~vI~~ti~sl~~~~YP~~~~evivv~d~s~~----------~t~~~~~~~~~-----------~~~~~~~~~ 64 (241) T cd06427 6 LVPLYKEAEVLPQLIASLSALDYPRSKLDVKLLLEEDDE----------ETIAAARALRL-----------PSIFRVVVV 64 (241) T ss_pred EEECCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCH----------HHHHHHHHHCC-----------CCCCEEEEE T ss_conf 996689889999999999967999761799999889996----------89999998578-----------767089996 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 33342334665421101122333311127864445443322112221223432 Q gi|254780365|r 107 WQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEG 159 (299) Q Consensus 107 ~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~ 159 (299) ....+.|-|+|+-.|..+...|-++++=+|++...+ .|+.++..+.+.+ T Consensus 65 ~~~~~~gK~~alN~al~~a~gd~v~~~DAD~~~~p~----~L~~~v~~f~~~~ 113 (241) T cd06427 65 PPSQPRTKPKACNYALAFARGEYVVIYDAEDAPDPD----QLKKAVAAFARLD 113 (241) T ss_pred CCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCHH----HHHHHHHHHHHCC T ss_conf 389887469999999996158859997865564977----9999999998579 No 117 >cd03409 Chelatase_Class_II Class II Chelatase: a family of ATP-independent monomeric or homodimeric enzymes that catalyze the insertion of metal into protoporphyrin rings. This family includes protoporphyrin IX ferrochelatase (HemH), sirohydrochlorin ferrochelatase (SirB) and the cobaltochelatases, CbiK and CbiX. HemH and SirB are involved in heme and siroheme biosynthesis, respectively, while the cobaltochelatases are associated with cobalamin biosynthesis. Excluded from this family are the ATP-dependent heterotrimeric chelatases (class I) and the multifunctional homodimeric enzymes with dehydrogenase and chelatase activities (class III). Probab=58.03 E-value=8.4 Score=18.68 Aligned_cols=49 Identities=24% Similarity=0.272 Sum_probs=29.9 Q ss_pred EECCCCCCCCC-HHHCCC---------CCCEEEE------CCEEHHHHHHHHHHHCCCCEEEEEC Q ss_conf 95688883343-244088---------8333568------8362799999999978998899981 Q gi|254780365|r 12 FPIAGLGMRFF-PISKVI---------PKEMLAI------VDRPVIQYVIEEALEAGLTDFVFVT 60 (299) Q Consensus 12 IlAaG~GtRl~-P~T~~~---------pKpllpi------~~kpiI~~~i~~l~~~Gi~~i~iv~ 60 (299) ||.=|.|||+- |+-+.+ -.|..|+ ...|-++-.+++|.+.|+++|+++- T Consensus 2 iL~v~hGs~~~dpy~~~ie~~a~~i~~~~~~~~v~~~f~e~~~P~i~eai~~l~~~G~~~ivvvP 66 (101) T cd03409 2 LLVVGHGSPYKDPYKKDIEAQAHNLAESLPDFPYYVGFQSGLGPDTEEAIRELAEEGYQRVVIVP 66 (101) T ss_pred EEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCEEHHHHHHCCCCHHHHHHHHHHCCCCEEEEEE T ss_conf 69996699976660899999999999887998241437977199999999999976986799986 No 118 >cd04196 GT_2_like_d Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=57.97 E-value=7.2 Score=19.11 Aligned_cols=102 Identities=9% Similarity=0.111 Sum_probs=48.2 Q ss_pred EEEECCE-EHHHHHHHHHHHCCCC--EEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEECCC Q ss_conf 3568836-2799999999978998--899981355444101001174544456541870000113441124857998433 Q gi|254780365|r 32 MLAIVDR-PVIQYVIEEALEAGLT--DFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFTWQ 108 (299) Q Consensus 32 llpi~~k-piI~~~i~~l~~~Gi~--~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v~Q 108 (299) .+|..+. +.|.-.|+.+.++-.. ||+||-+-..+. ..++...+.... ...+.+... T Consensus 3 iip~yN~~~~l~~~i~Sil~Qt~~~~EiivvDDgStD~-------t~~ii~~~~~~~--------------~~~~~~~~~ 61 (214) T cd04196 3 LMATYNGEKYLREQLDSILAQTYKNDELIISDDGSTDG-------TVEIIKEYIDKD--------------PFIIILIRN 61 (214) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCC-------HHHHHHHHHHHC--------------CCCCEEEEC T ss_conf 99917988899999999983799886999998999723-------799999999758--------------865136665 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 34233466542110112233331112786444544332211222122343 Q gi|254780365|r 109 YERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKE 158 (299) Q Consensus 109 ~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~ 158 (299) ++.+|.+.+..++.....+|-++.+-+|+++.+.. ++.+++...+. T Consensus 62 ~~n~G~~~n~n~gi~~a~geyi~~lD~DD~~~p~~----l~~~~~~~~~~ 107 (214) T cd04196 62 GKNLGVARNFESLLQAADGDYVFFCDQDDIWLPDK----LERLLKAFLKD 107 (214) T ss_pred CCCCHHHHHHHHHHHHCCCCEEECCCCCCCCCHHH----HHHHHHHHHHC T ss_conf 88805999999999864787574137765628659----99999999858 No 119 >PRK10073 predicted glycosyl transferase; Provisional Probab=57.45 E-value=6.3 Score=19.45 Aligned_cols=105 Identities=10% Similarity=0.138 Sum_probs=56.8 Q ss_pred EEECC-EEHHHHHHHHHHHCCCCE--EEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEECCCC Q ss_conf 56883-627999999999789988--999813554441010011745444565418700001134411248579984333 Q gi|254780365|r 33 LAIVD-RPVIQYVIEEALEAGLTD--FVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFTWQY 109 (299) Q Consensus 33 lpi~~-kpiI~~~i~~l~~~Gi~~--i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v~Q~ 109 (299) +|+.| ..-|+--++.+..+..++ |++|-.-.. |.+.++.+.+..+ ...+..+.| T Consensus 12 iP~YN~e~yl~~cl~Si~~Qt~~~~EiIiVdDgSt-------D~s~~i~~~~~~~---------------~~~i~vi~~- 68 (329) T PRK10073 12 VPLYNAGKDFRACMESLIAQTWTALEIIIVNDGST-------DNSVEIAKHYAEN---------------YPHVRLLHQ- 68 (329) T ss_pred EECCCCHHHHHHHHHHHHCCCCCCEEEEEEECCCC-------CCHHHHHHHHHHH---------------CCCEEEEEC- T ss_conf 92889889999999999808999979999989998-------2589999999812---------------998999966- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEE Q ss_conf 4233466542110112233331112786444544332211222122343220011 Q gi|254780365|r 110 ERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANILA 164 (299) Q Consensus 110 ~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi~ 164 (299) +-.|.+.|--.+.....++.+..+=+||.+... .++.|++..++.+++++. T Consensus 69 ~N~G~s~ARN~gl~~a~G~yi~f~DsDD~l~~~----~le~l~~~a~~~~~DIv~ 119 (329) T PRK10073 69 ANAGASVARNTGLAVATGKYVAFVDADDEVYPT----MYETLMTMALEDDLDVAQ 119 (329) T ss_pred CCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCC----HHHHHHHHHHHCCCCEEE T ss_conf 888618999999997488889997577275887----699999999747999999 No 120 >cd06437 CESA_CaSu_A2 Cellulose synthase catalytic subunit A2 (CESA2) is a catalytic subunit or a catalytic subunit substitute of the cellulose synthase complex. Cellulose synthase (CESA) catalyzes the polymerization reaction of cellulose using UDP-glucose as the substrate. Cellulose is an aggregate of unbranched polymers of beta-1,4-linked glucose residues, which is an abundant polysaccharide produced by plants and in varying degrees by several other organisms including algae, bacteria, fungi, and even some animals. Genomes from higher plants harbor multiple CESA genes. There are ten in Arabidopsis. At least three different CESA proteins are required to form a functional complex. In Arabidopsis, CESA1, 3 and 6 and CESA4, 7 and 8, are required for cellulose biosynthesis during primary and secondary cell wall formation. CESA2 is very closely related to CESA6 and is viewed as a prime substitute for CESA6. They functionally compensate each other. The cesa2 and cesa6 double mutant plants we Probab=56.67 E-value=8.9 Score=18.52 Aligned_cols=105 Identities=12% Similarity=0.142 Sum_probs=55.6 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC--C--EEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEEC Q ss_conf 3568836-279999999997899--8--8999813554441010011745444565418700001134411248579984 Q gi|254780365|r 32 MLAIVDR-PVIQYVIEEALEAGL--T--DFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFT 106 (299) Q Consensus 32 llpi~~k-piI~~~i~~l~~~Gi--~--~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v 106 (299) ++|..|- ..|...|+.+.+.-- + +|+++-+...+..+ ...... .+....+.++.++ T Consensus 6 iIPa~NE~~~I~~~l~sl~~q~YP~~~~~I~VvDdstD~t~~-------~~~~~~------------~~~~~~~~~~~~~ 66 (232) T cd06437 6 QLPVFNEKYVVERLIEAACALDYPKDRLEIQVLDDSTDETVR-------LAREIV------------EEYAAQGVNIKHV 66 (232) T ss_pred EEECCCCHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCHHHH-------HHHHHH------------HHHHHCCCCEEEE T ss_conf 995589889999999999967999980899999799966999-------999999------------7666419968998 Q ss_pred CCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 3334233-4665421101122333311127864445443322112221223432 Q gi|254780365|r 107 WQYERKG-LGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEG 159 (299) Q Consensus 107 ~Q~~p~G-tg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~ 159 (299) ..+++.| -++|+-.+.+....|-++++-+|++...+ .|++++..+.... T Consensus 67 ~~~~~~g~Ka~aln~gl~~a~gd~i~~~DaD~~~~~d----~L~~~~~~f~~~~ 116 (232) T cd06437 67 RRADRTGYKAGALAEGMKVAKGEYVAIFDADFVPPPD----FLQKTPPYFADPK 116 (232) T ss_pred ECCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCHH----HHHHHHHHHCCCC T ss_conf 3698888669999999997789889997764473847----9999999831998 No 121 >PRK00923 sirohydrochlorin cobaltochelatase; Reviewed Probab=51.82 E-value=18 Score=16.62 Aligned_cols=51 Identities=16% Similarity=0.377 Sum_probs=33.4 Q ss_pred CEEEEECCCCCCCCCHHHC------------CCCCCEEEE----CCEEHHHHHHHHHHHCCCCEEEEEC Q ss_conf 5899956888833432440------------888333568----8362799999999978998899981 Q gi|254780365|r 8 RKAVFPIAGLGMRFFPISK------------VIPKEMLAI----VDRPVIQYVIEEALEAGLTDFVFVT 60 (299) Q Consensus 8 ~KavIlAaG~GtRl~P~T~------------~~pKpllpi----~~kpiI~~~i~~l~~~Gi~~i~iv~ 60 (299) ++||++.| .|||.- -.. ..+-..+.. ...|-|.-.++.|...|+++|++|- T Consensus 3 k~alllvg-HGSr~~-~~~~~~~~~a~~l~~~~~~~~V~~afle~~~P~i~~~~~~l~~~G~~~ivvvP 69 (130) T PRK00923 3 KIGLLLVG-HGSRLP-YNKEVVTEIAEKIKEKFPDYIVEVGFMEFNEPTIPEALKKFAGTGVDKIIVVP 69 (130) T ss_pred CEEEEEEE-CCCCCH-HHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCHHHHHHHHHHCCCCEEEEEE T ss_conf 56999997-899976-78999999999999858998298877615789899999999966998899986 No 122 >pfam02633 Creatininase Creatinine amidohydrolase. Creatinine amidohydrolase (EC:3.5.2.10), or creatininase, catalyses the hydrolysis of creatinine to creatine. Probab=51.58 E-value=18 Score=16.59 Aligned_cols=26 Identities=19% Similarity=0.401 Sum_probs=18.9 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCC Q ss_conf 79999999997899889998135544 Q gi|254780365|r 40 VIQYVIEEALEAGLTDFVFVTGRGKG 65 (299) Q Consensus 40 iI~~~i~~l~~~Gi~~i~iv~~~~k~ 65 (299) ++.-+++.+...|+++|+|+.+|..+ T Consensus 87 ~l~di~~sl~~~Gf~~ivilNgHgGN 112 (235) T pfam02633 87 LLRDIGRSLARHGFRRLVIVNGHGGN 112 (235) T ss_pred HHHHHHHHHHHCCCCEEEEEECCCCC T ss_conf 99999999997797589999168774 No 123 >TIGR03111 glyc2_xrt_Gpos1 putative glycosyltransferase TIGR03111. Members of this protein family probable glycosyltransferases of family 2, whose genes are near those for Gram-positive proteins (TIGR03110) related to the proposed exosortase (TIGR02602). Probab=48.85 E-value=17 Score=16.84 Aligned_cols=100 Identities=11% Similarity=0.105 Sum_probs=56.8 Q ss_pred EEEECC-EEHHHHHHHHHHHCCC--C--EEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEEC Q ss_conf 356883-6279999999997899--8--8999813554441010011745444565418700001134411248579984 Q gi|254780365|r 32 MLAIVD-RPVIQYVIEEALEAGL--T--DFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFT 106 (299) Q Consensus 32 llpi~~-kpiI~~~i~~l~~~Gi--~--~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v 106 (299) ++|..| ...|...++.+.+..- + ||++|-.-..+ ...+.-....+. .+...+ . T Consensus 54 iIPayNEe~~I~~~i~Sl~~~dYP~~~lEIIvVdDgStD-------~T~ei~~~~~~~-------------~p~~~v--~ 111 (439) T TIGR03111 54 IIPVYNSEDTLFNCIESIYNQTYPIELIDIILANNQSTD-------DSFQVFCRAQNE-------------FPGLSL--R 111 (439) T ss_pred EEECCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCH-------HHHHHHHHHHHH-------------CCCEEE--E T ss_conf 993587388999999999967999985599999789951-------799999998865-------------898399--9 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 333423346654211011223333111278644454433221122212234 Q gi|254780365|r 107 WQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEK 157 (299) Q Consensus 107 ~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~ 157 (299) ..+++.|-++|+-.|.....+|-++++=+|++...+ .++.|+..+.+ T Consensus 112 ~~~~n~GKa~ALN~gl~~A~Ge~iv~~DADt~le~d----aL~~lv~~F~~ 158 (439) T TIGR03111 112 YMNSDQGKAKALNAAIYNSIGKYIIHIDSDGKLHKD----AIKNMVTRFEN 158 (439) T ss_pred ECCCCCCHHHHHHHHHHHCCCCEEEEEECCCCCCHH----HHHHHHHHHHH T ss_conf 768989859999999997788989998089886856----99999998720 No 124 >cd04192 GT_2_like_e Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=48.64 E-value=20 Score=16.41 Aligned_cols=210 Identities=10% Similarity=0.049 Sum_probs=87.9 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC--C--EEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEEC Q ss_conf 3568836-279999999997899--8--8999813554441010011745444565418700001134411248579984 Q gi|254780365|r 32 MLAIVDR-PVIQYVIEEALEAGL--T--DFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFT 106 (299) Q Consensus 32 llpi~~k-piI~~~i~~l~~~Gi--~--~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v 106 (299) ++|..|. ..|...|+.+.++.- + ||+||-+-..+.- .++-..+... ....+..+ T Consensus 2 iIP~~Ne~~~i~~~l~sl~~q~yp~~~~EVivvdd~StD~T-------~~i~~~~~~~--------------~~~~~~~~ 60 (229) T cd04192 2 VIAARNEAENLPRLLQSLSALDYPKEKFEVILVDDHSTDGT-------VQILEFAAAK--------------PNFQLKIL 60 (229) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCCH-------HHHHHHHHHC--------------CCCCEEEC T ss_conf 99988988999999999995789988689999989797167-------9999999705--------------69745530 Q ss_pred CC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE-EEEEC-CHH-HCCCCCCEEC Q ss_conf 33--3423346654211011223333111278644454433221122212234322001-12432-854-4331331001 Q gi|254780365|r 107 WQ--YERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANIL-AVSEC-DPQ-LSCKYGMVQV 181 (299) Q Consensus 107 ~Q--~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi-~~~~v-~~~-~~~~yGvi~~ 181 (299) .. ....|-++|+..+......|-++++-+|++...+ .++.++..+.......+ +-... +.+ -..++..++. T Consensus 61 ~~~~~~~~gk~~aln~g~~~a~ge~i~~lDaD~~~~~~----~l~~~~~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~ 136 (229) T cd04192 61 NNSRVSISGKKNALTTAIKAAKGDWIVTTDADCVVPSN----WLLTFVAFIQKEQIGLVAGPVIYFKGKSLLAKFQRLDW 136 (229) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHH----HHHHHHHHHHCCCCEEEECCEEEECCCCHHHHHHHHHH T ss_conf 24677772599999999986467769985676565876----99999999748994799887410079999999999999 Q ss_pred CCCCCCCCCEEEEECCCCCCCCCCCCEEECHHHEECHHHHHHHCCCCC--CCCCCCCCHHHHHHHHHHCCCEEEEEEE-E Q ss_conf 333333530000002577888673423302020027233110001226--6676750037899999831984899981-3 Q gi|254780365|r 182 GKAIDHQVFHISDMIEKPDSSTFISNFFINGRYILHPDIFSILNDWKE--NEGKGEIQLTDSMRKLSERHDFLAYHFK-G 258 (299) Q Consensus 182 ~~~~~~~~~~I~~~vEKP~~~~~~Snl~~~GiYi~~~~i~~~L~~~~~--~~~~gE~~ltD~i~~l~~~~~v~a~~~~-g 258 (299) ... ... ...... ...+ ...+.+...|..++|..+.-... ....+|.. +....+.+.|.-.+|..+ . T Consensus 137 ~~~--~~~--~~~~~~----~~~~-~~~~G~~~~~rr~~~~~vGGfd~~~~~~~~Dd~--~~~~~l~~~g~ki~y~~~p~ 205 (229) T cd04192 137 LSL--LGL--IAGSFG----LGKP-FMCNGANMAYRKEAFFEVGGFEGNDHIASGDDE--LLLAKVASKYPKVAYLKNPE 205 (229) T ss_pred HHH--HHH--HHHHHH----CCCC-EEEECCHHHHHHHHHHHCCCCCCCCCCCHHHHH--HHHHHHHHCCCEEEEEECCC T ss_conf 999--999--999850----6995-585062153239999980998899986369999--99999980899799983798 Q ss_pred --EEE-ECCCHHHHHHHHHHHH Q ss_conf --786-4399889999999986 Q gi|254780365|r 259 --HTY-DCGSKKGFVLANIAFA 277 (299) Q Consensus 259 --~w~-DiG~~~~yl~A~~~~a 277 (299) .|. --.|...|++-...|| T Consensus 206 a~v~~~~p~t~~~~~~Qr~RWa 227 (229) T cd04192 206 ALVTTQPVTSWKELLNQRKRWA 227 (229) T ss_pred EEEEEECCCCHHHHHHHHHHCC T ss_conf 6999878899999999997448 No 125 >TIGR00027 mthyl_TIGR00027 methyltransferase, putative, TIGR00027 family; InterPro: IPR011610 This entry represents a paralogous family with ~15 members in Mycobacterium tuberculosis and two members in Mycobacteriam leprae. No other homologs are found. . Probab=47.83 E-value=7.6 Score=18.96 Aligned_cols=18 Identities=33% Similarity=0.536 Sum_probs=14.3 Q ss_pred CCEEEEECCCCCCCCCHH Q ss_conf 458999568888334324 Q gi|254780365|r 7 VRKAVFPIAGLGMRFFPI 24 (299) Q Consensus 7 i~KavIlAaG~GtRl~P~ 24 (299) |+.+||||||+=||-+=+ T Consensus 91 ~rQVViLgAGLDtRAyRL 108 (281) T TIGR00027 91 IRQVVILGAGLDTRAYRL 108 (281) T ss_pred CCEEEEECCHHHHHHHHC T ss_conf 567543012123587635 No 126 >cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan. Succinoglycan Biosynthesis Protein ExoA catalyzes the formation of a beta-1,3 linkage of the second sugar (glucose) of the succinoglycan with the galactose on the lipid carrie. Succinoglycan is an acidic exopolysaccharide that is important for invasion of the nodules. Succinoglycan is a high-molecular-weight polymer composed of repeating octasaccharide units. These units are synthesized on membrane-bound isoprenoid lipid carriers, beginning with galactose followed by seven glucose molecules, and modified by the addition of acetate, succinate, and pyruvate. ExoA is a membrane protein with a transmembrance domain at c-terminus. Probab=45.54 E-value=23 Score=16.01 Aligned_cols=211 Identities=8% Similarity=0.060 Sum_probs=90.1 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC----CEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEEC Q ss_conf 3568836-279999999997899----88999813554441010011745444565418700001134411248579984 Q gi|254780365|r 32 MLAIVDR-PVIQYVIEEALEAGL----TDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFT 106 (299) Q Consensus 32 llpi~~k-piI~~~i~~l~~~Gi----~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v 106 (299) .+|..|. ..|...|+.+.++-. -||+||-+...+. ..+.-+.+..+ ...+..+ T Consensus 5 iIp~yN~~~~l~~~l~Sl~~q~yp~~~~EVIvVDd~S~D~-------t~~~~~~~~~~---------------~~~i~~~ 62 (249) T cd02525 5 IIPVRNEEKYIEELLESLLNQSYPKDLIEIIVVDGGSTDG-------TREIVQEYAAK---------------DPRIRLI 62 (249) T ss_pred EECCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCC-------HHHHHHHHHHC---------------CCCEEEC T ss_conf 9935697899999999998468999988999998959615-------69999999740---------------7713211 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE-EEEECCHHHCCCCCCEECCCCC Q ss_conf 333423346654211011223333111278644454433221122212234322001-1243285443313310013333 Q gi|254780365|r 107 WQYERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEKEGANIL-AVSECDPQLSCKYGMVQVGKAI 185 (299) Q Consensus 107 ~Q~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~~~~~vi-~~~~v~~~~~~~yGvi~~~~~~ 185 (299) +.+..|.|.|.-.+......+-++.+-+|+++.++ .++.+++.+++.+...+ +..................... T Consensus 63 -~~~~~~~~~a~N~gi~~a~gd~i~~lD~D~~~~~~----~l~~~~~~~~~~~~~~v~g~~~~~~~~~~~~~~~~~~~~~ 137 (249) T cd02525 63 -DNPKRIQSAGLNIGIRNSRGDIIIRVDAHAVYPKD----YILELVEALKRTGADNVGGPMETIGESKFQKAIAVAQSSP 137 (249) T ss_pred -CCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCHH----HHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHHHHCC T ss_conf -35556589999898996687768841477656956----9999999987789719998885068972777778876264 Q ss_pred CCCCCEEEEECCCCCCCCCCCCEEECHH-HEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCC-CEEEEE-EEEEEEE Q ss_conf 3353000000257788867342330202-002723311000122666767500378999998319-848999-8137864 Q gi|254780365|r 186 DHQVFHISDMIEKPDSSTFISNFFINGR-YILHPDIFSILNDWKENEGKGEIQLTDSMRKLSERH-DFLAYH-FKGHTYD 262 (299) Q Consensus 186 ~~~~~~I~~~vEKP~~~~~~Snl~~~Gi-Yi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~~~~-~v~a~~-~~g~w~D 262 (299) ..........+... ...+..|. -.+.+++|+.+.-..+....+| =+|.--.+.+.| ++.-.+ ..-++.. T Consensus 138 ---~~~~~~~~~~~~~~---~~~~~~~~~~~~rr~~~~~iGgfde~~~~~E--D~dl~~R~~~~G~ki~~~p~a~v~h~~ 209 (249) T cd02525 138 ---LGSGGSAYRGGAVK---IGYVDTVHHGAYRREVFEKVGGFDESLVRNE--DAELNYRLRKAGYKIWLSPDIRVYYYP 209 (249) T ss_pred ---CCCCCCCCCCCCCC---CCEECCEEEEEEEHHHHHHHCCCCCCCCCCH--HHHHHHHHHHCCCEEEEECCEEEEEEC T ss_conf ---32352000132245---8454213579888889997389987778238--999999999769979991887999989 Q ss_pred CCCHHHHHHHHHHHH Q ss_conf 399889999999986 Q gi|254780365|r 263 CGSKKGFVLANIAFA 277 (299) Q Consensus 263 iG~~~~yl~A~~~~a 277 (299) --|...+.+-...++ T Consensus 210 ~~s~~~~~~Q~~Rw~ 224 (249) T cd02525 210 RSTLKKLARQYFRYG 224 (249) T ss_pred CCCHHHHHHHHHHHH T ss_conf 999999999999998 No 127 >PRK11204 N-glycosyltransferase PgaC; Provisional Probab=44.29 E-value=7.9 Score=18.83 Aligned_cols=95 Identities=13% Similarity=0.083 Sum_probs=53.8 Q ss_pred CCEEHHHHHHHHHHHCCC--CEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEECCCCCCCC Q ss_conf 836279999999997899--889998135544410100117454445654187000011344112485799843334233 Q gi|254780365|r 36 VDRPVIQYVIEEALEAGL--TDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFTWQYERKG 113 (299) Q Consensus 36 ~~kpiI~~~i~~l~~~Gi--~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v~Q~~p~G 113 (299) +.-..|..+|+.+.+..- -||++|..-..+. ..+.-..+.. ...++..+.+++..| T Consensus 64 NEe~~I~~tI~sll~~~YP~~eIiVVdDgStD~-------T~~i~~~~~~---------------~~p~~~vi~~~~n~G 121 (421) T PRK11204 64 NEGENVEETISAALALRYPNYEVIAINDGSSDN-------TGEILDRLAA---------------EDPRLRVIHLAENQG 121 (421) T ss_pred CCHHHHHHHHHHHHHCCCCCCEEEEEECCCCHH-------HHHHHHHHHH---------------HCCCEEEEECCCCCC T ss_conf 978999999999995899986699998999740-------8999999970---------------689689997699988 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 4665421101122333311127864445443322112221223 Q gi|254780365|r 114 LGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYE 156 (299) Q Consensus 114 tg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~ 156 (299) -|+|+..|......|-++++=+|++...+ .++.|+..+. T Consensus 122 Ka~ALN~gl~~a~ge~Vv~~DAD~~~~~d----~L~~~v~~f~ 160 (421) T PRK11204 122 KAIALNTGAAAARSEYLVCIDGDALLDPD----AAAYMVEHFL 160 (421) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCCHH----HHHHHHHHHH T ss_conf 99999999984378989998998714867----9999999997 No 128 >PRK09485 mmuM homocysteine methyltransferase; Provisional Probab=37.33 E-value=15 Score=17.08 Aligned_cols=60 Identities=13% Similarity=0.201 Sum_probs=36.1 Q ss_pred CCCCCCC---CEEEEECCCCCCCCCHHHCCCCCCEEE---ECCEE-HHHHHHHHHHHCCCCEEEEEC Q ss_conf 9788854---589995688883343244088833356---88362-799999999978998899981 Q gi|254780365|r 1 MGSLKKV---RKAVFPIAGLGMRFFPISKVIPKEMLA---IVDRP-VIQYVIEEALEAGLTDFVFVT 60 (299) Q Consensus 1 ~~~m~~i---~KavIlAaG~GtRl~P~T~~~pKpllp---i~~kp-iI~~~i~~l~~~Gi~~i~iv~ 60 (299) ||.++++ ++-+||=||.||-|.-.-...+.||-. +...| +|.-+=.+.++||-+=|.-.| T Consensus 1 ~~~~~~~ll~~~~lilDGgmGteL~~~G~~~~~~lWs~~~ll~~Pd~V~~iH~~yi~AGAdiI~TNT 67 (308) T PRK09485 1 MNPFKALLLAQSPLILDGALATELEARGCDLSDPLWSAKVLLENPELIYQVHLDYFRAGADIAITAS 67 (308) T ss_pred CCHHHHHHHCCCEEEEECHHHHHHHHCCCCCCCCCCCHHHHHCCHHHHHHHHHHHHHCCCCEEEECC T ss_conf 9557899853990999897699999869999986545666611929999999999980878787245 No 129 >COG3315 O-Methyltransferase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism] Probab=36.93 E-value=21 Score=16.26 Aligned_cols=43 Identities=35% Similarity=0.496 Sum_probs=22.3 Q ss_pred CCEEEEECCCCCCCCCHHHCCCCCCEEEE--CCEE-HHHHHHHHHHHCC Q ss_conf 45899956888833432440888333568--8362-7999999999789 Q gi|254780365|r 7 VRKAVFPIAGLGMRFFPISKVIPKEMLAI--VDRP-VIQYVIEEALEAG 52 (299) Q Consensus 7 i~KavIlAaG~GtRl~P~T~~~pKpllpi--~~kp-iI~~~i~~l~~~G 52 (299) ++..|||+||.=||.+-+.. |+ =+++ .|.| +|++=-..+...| T Consensus 93 ~~qvViLgaGLDTRayRl~~--~~-~~~vfEvD~Pevi~~K~~~l~e~~ 138 (297) T COG3315 93 IRQVVILGAGLDTRAYRLDW--PK-GTRVFEVDLPEVIEFKKKLLAERG 138 (297) T ss_pred CCEEEEECCCCCCCEEECCC--CC-CCEEEECCCCHHHHHHHHHHHHCC T ss_conf 65899975546532345488--88-874897887379999998730327 No 130 >pfam04858 TH1 TH1 protein. TH1 is a highly conserved but uncharacterized metazoan protein. No homologue has been identified in Caenorhabditis elegans. TH1 binds specifically to A-Raf kinase. Probab=33.23 E-value=5.1 Score=20.00 Aligned_cols=30 Identities=27% Similarity=0.301 Sum_probs=14.5 Q ss_pred HHHHHHCCCCEEE---EECCCCCCCCHHHCCCC Q ss_conf 9999978998899---98135544410100117 Q gi|254780365|r 45 IEEALEAGLTDFV---FVTGRGKGLIKDYFDIQ 74 (299) Q Consensus 45 i~~l~~~Gi~~i~---iv~~~~k~~I~~~f~~~ 74 (299) .++|+..|++..- .|.++.|+.|.++||+. T Consensus 92 ~~WL~~~g~~~~~Vq~~ve~hLk~li~k~FDpk 124 (582) T pfam04858 92 AEWLIVTGVSPTKVQAMVENHLKTLIRKRFDPN 124 (582) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 999997299889999999999999999848932 No 131 >pfam04555 XhoI Restriction endonuclease XhoI. This family consists of type II restriction enzymes (EC:3.1.21.4) that recognize the double-stranded sequence CTCGAG and cleave after C-1. Probab=33.10 E-value=9.4 Score=18.37 Aligned_cols=35 Identities=17% Similarity=0.204 Sum_probs=21.3 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCC Q ss_conf 79999999997899889998135544410100117 Q gi|254780365|r 40 VIQYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQ 74 (299) Q Consensus 40 iI~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~ 74 (299) ++..+.+.+.++|+.+--|..+...-.+--||.+. T Consensus 26 f~~li~~~~~~~Gl~~a~i~~~~~~l~lPGYfRpt 60 (196) T pfam04555 26 FAALVLDIIRANGLAHAEIFQQRTALTLPGYFRPT 60 (196) T ss_pred HHHHHHHHHHHCCCCHHHHHHCCCCCCCCCCCCCC T ss_conf 99999999998599777875247531056542445 No 132 >TIGR02017 hutG_amidohyd N-formylglutamate amidohydrolase; InterPro: IPR010247 In some species, histidine is converted to via urocanate and then formimino-L-glutamate to glutamate in four steps, where the fourth step is conversion of N-formimino-L-glutamate to L-glutamate and formamide. In others, that pathway from formimino-L-glutamate may differ, with the next enzyme being formiminoglutamate hydrolase (HutF) yielding N-formyl-L-glutamate. This entry represents the enzyme N-formylglutamate deformylase, also called N-formylglutamate amidohydrolase, which then produces glutamate.. Probab=31.84 E-value=38 Score=14.68 Aligned_cols=64 Identities=19% Similarity=0.223 Sum_probs=45.5 Q ss_pred CCCHHHHHHHHHHCCCEEEEEEEEE----EE--ECCCHHHHHHH-HHHHHHC------------------CHHHHHHHHH Q ss_conf 5003789999983198489998137----86--43998899999-9998631------------------8346699999 Q gi|254780365|r 235 EIQLTDSMRKLSERHDFLAYHFKGH----TY--DCGSKKGFVLA-NIAFALA------------------RQDIRSDIET 289 (299) Q Consensus 235 E~~ltD~i~~l~~~~~v~a~~~~g~----w~--DiG~~~~yl~A-~~~~al~------------------~~~~~~~~~~ 289 (299) .-.|+|++.....+-+=+..+++|+ |. .=|.|..=+.| |||+|.+ ++.....++. T Consensus 175 dp~L~~~~~~~c~k~~G~~~V~NGRFKGG~iTRhYG~P~~gvHA~QmELA~~~Ym~~PE~~tPe~~P~aY~~~~A~~lra 254 (269) T TIGR02017 175 DPALADAVEAVCAKATGYSHVLNGRFKGGWITRHYGQPQNGVHAVQMELAQRGYMEEPEKGTPETWPFAYDPARAAPLRA 254 (269) T ss_pred CHHHHHHHHHHHHHHCCCCEEEEEEECCCCEEECCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 57899999999752155424651012387342168789874038888876412331468885688878986566689999 Q ss_pred HHHHHHHHH Q ss_conf 999998860 Q gi|254780365|r 290 DLKTLVSAL 298 (299) Q Consensus 290 ~l~~~~~~~ 298 (299) .||++|.++ T Consensus 255 ~L~~~L~~~ 263 (269) T TIGR02017 255 VLKQLLQAL 263 (269) T ss_pred HHHHHHHHH T ss_conf 999999999 No 133 >cd03413 CbiK_C Anaerobic cobalamin biosynthetic cobalt chelatase (CbiK), C-terminal domain. CbiK is part of the cobalt-early path for cobalamin biosynthesis. It catalyzes the insertion of cobalt into the oxidized form of precorrin-2, factor II (sirohydrochlorin), the second step of the anaerobic branch of vitamin B12 biosynthesis. CbiK belongs to the class II family of chelatases, and is a homomeric enzyme that does not require ATP for its enzymatic activity. Probab=30.95 E-value=39 Score=14.58 Aligned_cols=13 Identities=23% Similarity=0.345 Sum_probs=8.4 Q ss_pred CCCCCCCCCCCCC Q ss_conf 3311127864445 Q gi|254780365|r 129 PFALLLPDMIMSP 141 (299) Q Consensus 129 ~flv~~gD~i~~~ 141 (299) ||+++.|||...+ T Consensus 63 Plm~VAGdHa~nD 75 (103) T cd03413 63 PLMLVAGDHAHND 75 (103) T ss_pred CHHHHHHHHHHHH T ss_conf 1064401757763 No 134 >PRK06263 sdhA succinate dehydrogenase flavoprotein subunit; Reviewed Probab=30.51 E-value=32 Score=15.09 Aligned_cols=20 Identities=20% Similarity=0.351 Sum_probs=14.0 Q ss_pred CEEEEECCCCCCCCCHHHCC Q ss_conf 58999568888334324408 Q gi|254780365|r 8 RKAVFPIAGLGMRFFPISKV 27 (299) Q Consensus 8 ~KavIlAaG~GtRl~P~T~~ 27 (299) .||||||.|=..+++|.|.. T Consensus 186 AkaVILATGG~~~ly~~~tn 205 (539) T PRK06263 186 AKSTILATGGAGQLYPITSN 205 (539) T ss_pred ECEEEECCCCCCCCCCCCCC T ss_conf 05699898987656877679 No 135 >pfam05060 MGAT2 N-acetylglucosaminyltransferase II (MGAT2). UDP-N-acetyl-D-glucosamine:alpha-6-D-mannoside beta-1,2-N- acetylglucosaminyltransferase II (EC 2.4.1.143) (GnT II/MGAT2) is a Golgi resident enzyme that catalyses an essential step in the biosynthetic pathway leading from high mannose to complex N-linked oligosaccharides. Mutations in the MGAT2 gene lead to congenital disorder of glycosylation (CDG IIa). CDG IIa patients have an increased bleeding tendency, unrelated to coagulation factors. Probab=29.45 E-value=41 Score=14.43 Aligned_cols=41 Identities=15% Similarity=0.331 Sum_probs=29.2 Q ss_pred HHHCCCCCCEEEECCEE-HHHHHHHHHHH-CCCCEEEEECCCC Q ss_conf 24408883335688362-79999999997-8998899981355 Q gi|254780365|r 23 PISKVIPKEMLAIVDRP-VIQYVIEEALE-AGLTDFVFVTGRG 63 (299) Q Consensus 23 P~T~~~pKpllpi~~kp-iI~~~i~~l~~-~Gi~~i~iv~~~~ 63 (299) |+.....--.+.|-+|| -+.++|+.|.+ .||++.++|.++. T Consensus 27 ~l~~d~~viVvqVHnR~~YL~~li~SL~~a~gI~~~LLIfSHD 69 (357) T pfam05060 27 DLPKDHVVIVVQVHNRPQYLRLLIESLRKAKGISETLLIFSHD 69 (357) T ss_pred CCCCCCEEEEEEECCCHHHHHHHHHHHHHHCCCCCEEEEEECC T ss_conf 8888977999998882799999999998754723158999666 No 136 >PRK09822 lipopolysaccharide core biosynthesis protein; Provisional Probab=29.22 E-value=42 Score=14.40 Aligned_cols=49 Identities=16% Similarity=0.274 Sum_probs=35.6 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCE-EEEECCC Q ss_conf 589995688883343244088833356883627999999999789988-9998135 Q gi|254780365|r 8 RKAVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTD-FVFVTGR 62 (299) Q Consensus 8 ~KavIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~-i~iv~~~ 62 (299) .-.+|+|-|..+|-.|++....+|++-++|- +......+|+- ++++|.. T Consensus 22 ~dvlI~aSGpS~k~~pls~l~~~pvI~vNGs------~s~~l~~~ikp~~YVitD~ 71 (269) T PRK09822 22 EDCIIFLSGPTSRKTPLSLLRMKDVIAVNGS------VQYLLNNNVKPFLYLLTDV 71 (269) T ss_pred CCEEEECCCCCCCCCCHHHCCCCCEEEECCH------HHEEECCCCCEEEEEEECC T ss_conf 6289980377425577122377866887661------1101236751689998131 No 137 >KOG4180 consensus Probab=27.54 E-value=42 Score=14.36 Aligned_cols=31 Identities=23% Similarity=0.408 Sum_probs=21.3 Q ss_pred EEEEECCCCCCCCCHHHCC--CCCCEEEECCEE Q ss_conf 8999568888334324408--883335688362 Q gi|254780365|r 9 KAVFPIAGLGMRFFPISKV--IPKEMLAIVDRP 39 (299) Q Consensus 9 KavIlAaG~GtRl~P~T~~--~pKpllpi~~kp 39 (299) -+||.|||-||=|...++. ..||-+-|+--| T Consensus 107 D~VisvGGDGTfL~Aasrv~~~~~PViGvNtDP 139 (395) T KOG4180 107 DMVISVGGDGTFLLAASRVIDDSKPVIGVNTDP 139 (395) T ss_pred HEEEEECCCCCEEEHHHHHHCCCCCEEEECCCC T ss_conf 179971686313210012221688445534898 No 138 >cd03411 Ferrochelatase_N Ferrochelatase, N-terminal domain: Ferrochelatase (protoheme ferrolyase or HemH) is the terminal enzyme of the heme biosynthetic pathway. It catalyzes the insertion of ferrous iron into the protoporphyrin IX ring yielding protoheme. This enzyme is ubiquitous in nature and widely distributed in bacteria and eukaryotes. Recently, some archaeal members have been identified. The oligomeric state of these enzymes varies depending on the presence of a dimerization motif at the C-terminus. Probab=27.25 E-value=45 Score=14.19 Aligned_cols=19 Identities=5% Similarity=0.296 Sum_probs=8.6 Q ss_pred HHHHHHHHHHCC--CEEEEEE Q ss_conf 378999998319--8489998 Q gi|254780365|r 238 LTDSMRKLSERH--DFLAYHF 256 (299) Q Consensus 238 ltD~i~~l~~~~--~v~a~~~ 256 (299) +.|++..+.++| ++.++++ T Consensus 101 i~~~l~~l~~~g~~~iv~lPl 121 (159) T cd03411 101 IEEALEELKADGVDRIVVLPL 121 (159) T ss_pred HHHHHHHHHHCCCCEEEEEEC T ss_conf 999999998548884999979 No 139 >TIGR02351 thiH thiazole biosynthesis protein ThiH; InterPro: IPR012726 Members of this protein family are the ThiH proteins involved in thiamine biosynthesis. They are homologues of the BioB protein of biotin biosynthesis. Genes encoding these proteins are generally found in operons with other thiamin biosynthesis genes. ; GO: 0005506 iron ion binding, 0051539 4 iron 4 sulfur cluster binding, 0009228 thiamin biosynthetic process. Probab=26.80 E-value=24 Score=15.84 Aligned_cols=32 Identities=25% Similarity=0.490 Sum_probs=19.8 Q ss_pred HHHHHHHHHHCC-CCEEEEECCCCC-CCCHHHCC Q ss_conf 999999999789-988999813554-44101001 Q gi|254780365|r 41 IQYVIEEALEAG-LTDFVFVTGRGK-GLIKDYFD 72 (299) Q Consensus 41 I~~~i~~l~~~G-i~~i~iv~~~~k-~~I~~~f~ 72 (299) |+-=++...+.| +++|++|||.+. -.=.+||. T Consensus 110 i~~E~~aI~k~gPf~~iLlvtGE~e~~~g~~Yi~ 143 (378) T TIGR02351 110 IEREIEAIKKSGPFKEILLVTGESEKAAGVEYIE 143 (378) T ss_pred HHHHHHHHHHCCCCHHEEEECCCCCCCCCHHHHH T ss_conf 9999999862077013001115777558837899 No 140 >TIGR01815 TrpE-clade3 anthranilate synthase; InterPro: IPR010112 This entry represents a small clade of anthranilate synthases from alpha proteobacteria and Nostoc (a cyanobacterium). This enzyme is the first step in the pathway for the biosynthesis of tryprophan from chorismate.; GO: 0004049 anthranilate synthase activity, 0000162 tryptophan biosynthetic process. Probab=26.60 E-value=6.1 Score=19.55 Aligned_cols=156 Identities=14% Similarity=0.139 Sum_probs=83.4 Q ss_pred EEEECCCCCCCCCCCCCC-----CC------CCCCCCC-CCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCEEEEEEC Q ss_conf 799843334233466542-----11------0112233-331112786444544332-2112221223432200112432 Q gi|254780365|r 102 NAVFTWQYERKGLGHAVW-----CA------RNIIGDN-PFALLLPDMIMSPLEGEN-CMANMIKLYEKEGANILAVSEC 168 (299) Q Consensus 102 ~i~~v~Q~~p~Gtg~Ai~-----~a------~~~i~de-~flv~~gD~i~~~~~~~~-~l~~li~~~~~~~~~vi~~~~v 168 (299) ++....+++-||+=||-- +. .+-=.|+ +.++.+||.|+..+.+.. ..-.-.++-.+++++ -++... T Consensus 135 ~lF~s~eD~~LGLYGAFgYDLaFQFe~I~~~LeRPdDQRDlvLyLPDel~vvD~ya~~A~~~~YdFita~gST-egl~~~ 213 (726) T TIGR01815 135 DLFASEEDELLGLYGAFGYDLAFQFEAIEQKLERPDDQRDLVLYLPDELVVVDYYAAKAFLVEYDFITAAGST-EGLEAD 213 (726) T ss_pred HHHCCCCCCCCCCCHHHHHHHHHHCCHHHHCCCCCCCHHHHHHHCCCEEEEECHHHHHHHHHHHHHHHHHCCC-CCCCCC T ss_conf 6316787766662002455454211125431477851347876388526886203421445555576530575-664568 Q ss_pred CHHHCCCCCCEECCCC-CCCCCCEEEEECCCCCCCCCCCCE--EECHHHEE------CHHHHHHHCCCCCCC-------C Q ss_conf 8544331331001333-333530000002577888673423--30202002------723311000122666-------7 Q gi|254780365|r 169 DPQLSCKYGMVQVGKA-IDHQVFHISDMIEKPDSSTFISNF--FINGRYIL------HPDIFSILNDWKENE-------G 232 (299) Q Consensus 169 ~~~~~~~yGvi~~~~~-~~~~~~~I~~~vEKP~~~~~~Snl--~~~GiYi~------~~~i~~~L~~~~~~~-------~ 232 (299) -.+.+.| |....... .|+..+.--++|||=+...-..+| ++.|.-.+ +.++|+-|+.++|++ | T Consensus 214 g~~~pyR-~~~~~pp~n~DH~pGeYa~lVe~Ak~~FRRGDLFEVVPgQ~F~e~Ce~~PS~~f~rLkaiNPsPY~ff~NLG 292 (726) T TIGR01815 214 GEDEPYR-SDTSIPPQNADHKPGEYAKLVEKAKEAFRRGDLFEVVPGQKFYEACEAAPSEVFERLKAINPSPYEFFVNLG 292 (726) T ss_pred CCCCCCC-CCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCEEEECCCCCCCHHHCCCCHHHHHHHHHHCCCCCHHHEECC T ss_conf 8888888-877888555788886248999999863178862565588750022247832789998752688304202457 Q ss_pred CCCCCHHHHHHHHHH--CCCEEEEEEEEE Q ss_conf 675003789999983--198489998137 Q gi|254780365|r 233 KGEIQLTDSMRKLSE--RHDFLAYHFKGH 259 (299) Q Consensus 233 ~gE~~ltD~i~~l~~--~~~v~a~~~~g~ 259 (299) +|||=+--.-++.+. .++|-.++++|+ T Consensus 293 ~gEylvgASPEMfVRV~GrRvETCPISGT 321 (726) T TIGR01815 293 RGEYLVGASPEMFVRVEGRRVETCPISGT 321 (726) T ss_pred CCCEEEEECCCCEEEECCCEEEECCCCCC T ss_conf 73078861774268863835772577654 No 141 >COG1581 Ssh10b Archaeal DNA-binding protein [Transcription] Probab=25.99 E-value=48 Score=14.05 Aligned_cols=31 Identities=16% Similarity=0.489 Sum_probs=24.4 Q ss_pred CCCEEEECCEEHHHHHHHHH--HHCCCCEEEEE Q ss_conf 83335688362799999999--97899889998 Q gi|254780365|r 29 PKEMLAIVDRPVIQYVIEEA--LEAGLTDFVFV 59 (299) Q Consensus 29 pKpllpi~~kpiI~~~i~~l--~~~Gi~~i~iv 59 (299) +++-+-++.||...|++.-+ ...|.+++++- T Consensus 3 ~envV~vG~KPvmNYVlAvlt~fn~g~~eViiK 35 (91) T COG1581 3 EENVVLVGKKPVMNYVLAVLTQFNEGADEVIIK 35 (91) T ss_pred CCCEEEECCCCHHHHHHHHHHHHHCCCCEEEEE T ss_conf 753799857644999999999987689789998 No 142 >TIGR00763 lon ATP-dependent protease La; InterPro: IPR004815 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This signature defines the bacterial and eukaryotic lon proteases, which are ATP-dependent serine peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SF). This family of sequences does not include the archaeal lon homologs, IPR004663 from INTERPRO. In the eukaryotes the majority of the proteins are located in the mitochondrial matrix , . In yeast, Pim1, is located in the mitochondrial matrix, is required for mitochondrial function, is constitutively expressed but is increased after thermal stress, suggesting that Pim1 may play a role in the heat shock response .; GO: 0004176 ATP-dependent peptidase activity, 0005524 ATP binding, 0006510 ATP-dependent proteolysis. Probab=24.84 E-value=34 Score=14.96 Aligned_cols=17 Identities=18% Similarity=0.243 Sum_probs=11.4 Q ss_pred EEECCCHHHHHHHHHHH Q ss_conf 86439988999999998 Q gi|254780365|r 260 TYDCGSKKGFVLANIAF 276 (299) Q Consensus 260 w~DiG~~~~yl~A~~~~ 276 (299) .-++|...++.-|.+.+ T Consensus 236 ~~~P~~LaD~~Aa~~~~ 252 (941) T TIGR00763 236 IDEPGRLADLVAASLQL 252 (941) T ss_pred CCCCHHHHHHHHHHHHH T ss_conf 04622588899986320 No 143 >COG1924 Activator of 2-hydroxyglutaryl-CoA dehydratase (HSP70-class ATPase domain) [Lipid metabolism] Probab=24.79 E-value=50 Score=13.91 Aligned_cols=61 Identities=25% Similarity=0.262 Sum_probs=40.8 Q ss_pred CCCEEEEECCCC--CCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEE--EEECCCCCCCCHHHCCC Q ss_conf 545899956888--833432440888333568836279999999997899889--99813554441010011 Q gi|254780365|r 6 KVRKAVFPIAGL--GMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDF--VFVTGRGKGLIKDYFDI 73 (299) Q Consensus 6 ~i~KavIlAaG~--GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i--~iv~~~~k~~I~~~f~~ 73 (299) .-||+|+|.-|. +++..+.|...|++ +-.+.-.++++-... .+| ..+|||+++.+..||.. T Consensus 145 TtTK~VLm~d~~~I~~~~~~~t~g~p~~------~~~l~~~le~l~~~~-~~I~~~~~TGYGR~~v~~~~~a 209 (396) T COG1924 145 TTTKAVLMEDGKEILYGFYVSTKGRPIA------EKALKEALEELGEKL-EEILGLGVTGYGRNLVGAALGA 209 (396) T ss_pred CCEEEEEEECCCEEEEEEEECCCCCHHH------HHHHHHHHHHCCCCH-HEEEEEEEECCCHHHHHHHHCC T ss_conf 4135999837975788899757887668------999999999806471-2122243221208776333057 No 144 >KOG3349 consensus Probab=24.50 E-value=51 Score=13.88 Aligned_cols=23 Identities=26% Similarity=0.580 Sum_probs=16.7 Q ss_pred HHHHHHHHHCCCCEEEEECCCCC Q ss_conf 99999999789988999813554 Q gi|254780365|r 42 QYVIEEALEAGLTDFVFVTGRGK 64 (299) Q Consensus 42 ~~~i~~l~~~Gi~~i~iv~~~~k 64 (299) +-..+++.+-|+++++|=.|.++ T Consensus 25 ~~~~~~L~k~G~~kLiiQ~Grg~ 47 (170) T KOG3349 25 EEFLQELQKRGFTKLIIQIGRGQ 47 (170) T ss_pred HHHHHHHHHCCCCEEEEEECCCC T ss_conf 99999999739638998735886 No 145 >TIGR00461 gcvP glycine dehydrogenase; InterPro: IPR003437 This family consists of glycine cleavage system P-proteins (1.4.4.2 from EC) from bacterial, mammalian and plant sources. The P protein is part of the glycine decarboxylase multienzyme complex (2.1.2.10 from EC (GDC) also annotated as glycine cleavage system or glycine synthase. The P protein binds the alpha-amino group of glycine through its pyridoxal phosphate cofactor, carbon dioxide is released and the remaining methylamin moiety is then transferred to the lipoamide cofactor of the H protein. GDC consists of four proteins P, H, L and T . The reaction catalysed by this protein is: Glycine + lipoylprotein = S-aminomethyldihydrolipoylprotein + CO2 ; GO: 0004375 glycine dehydrogenase (decarboxylating) activity, 0006544 glycine metabolic process, 0005961 glycine dehydrogenase complex (decarboxylating). Probab=24.11 E-value=48 Score=13.99 Aligned_cols=14 Identities=21% Similarity=0.152 Sum_probs=6.7 Q ss_pred CCHHHHHHHHHHCC Q ss_conf 00378999998319 Q gi|254780365|r 236 IQLTDSMRKLSERH 249 (299) Q Consensus 236 ~~ltD~i~~l~~~~ 249 (299) ++||+++-.-+++. T Consensus 362 h~LT~iLA~gl~~~ 375 (965) T TIGR00461 362 HRLTSILAAGLKRK 375 (965) T ss_pred HHHHHHHHHHHHHC T ss_conf 88999999997530 No 146 >KOG2638 consensus Probab=23.60 E-value=53 Score=13.77 Aligned_cols=47 Identities=11% Similarity=0.108 Sum_probs=32.7 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEEC-CEEHHHHHHHH---H-HHCCCCEEEEE Q ss_conf 999568888334324408883335688-36279999999---9-97899889998 Q gi|254780365|r 10 AVFPIAGLGMRFFPISKVIPKEMLAIV-DRPVIQYVIEE---A-LEAGLTDFVFV 59 (299) Q Consensus 10 avIlAaG~GtRl~P~T~~~pKpllpi~-~kpiI~~~i~~---l-~~~Gi~~i~iv 59 (299) .+=|-||.||-|. ..-||.+++|. |...+|.++.+ | ...+++--.+. T Consensus 106 vlKLNGGlGttmG---c~gPKS~ieVR~g~tFLDL~V~QIe~LN~~Y~~dVPlvL 157 (498) T KOG2638 106 VLKLNGGLGTTMG---CKGPKSVIEVRDGLTFLDLTVRQIENLNKTYNVDVPLVL 157 (498) T ss_pred EEEECCCCCCCCC---CCCCCEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCEEE T ss_conf 9994588677545---678730699728972157899999987763289977798 No 147 >TIGR02468 sucrsPsyn_pln sucrose phosphate synthase; InterPro: IPR012819 Sucrose occupies a central position in the metabolic pathways of all plants and plays a variety of roles including transport sugar, storage reserve, compatible solute, and signal compound . This compound is produced by the combined action of two enzymes, sucrose-phosphate synthase (2.4.1.14 from EC) and sucrose-phosphate phosphatase (3.1.3.24 from EC), via the intermediate sucrose 6-phosphate. Several studies have shown a positive correlation between sucrose-phosphate synthase activity and plant growth rate and yield in agronomically important plants, though direct proof of a causal link is lacking. This entry represents sucrose-phosphate synthase from plants, which is known to exist in multigene families in several species of both monocots and dicots. The enzyme contains an N-terminal domain glucosyltransferase domain, a variable linker region, and a C-terminal domain similar to that of sucrose-phosphate phosphatase, the next and final enzyme of sucrose biosynthesis. The C-terminal domain is likely to serve a binding - not a catalytic - function, as sucrose-phosphate phosphatase is always encoded by a distinct protein. ; GO: 0046524 sucrose-phosphate synthase activity, 0005985 sucrose metabolic process. Probab=23.59 E-value=43 Score=14.34 Aligned_cols=81 Identities=12% Similarity=0.325 Sum_probs=42.0 Q ss_pred HEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCC-CEEEEEEEEEEEECCCHHHHHHHHH--HHHHCCHHHHHHHHHH Q ss_conf 002723311000122666767500378999998319-8489998137864399889999999--9863183466999999 Q gi|254780365|r 214 YILHPDIFSILNDWKENEGKGEIQLTDSMRKLSERH-DFLAYHFKGHTYDCGSKKGFVLANI--AFALARQDIRSDIETD 290 (299) Q Consensus 214 Yi~~~~i~~~L~~~~~~~~~gE~~ltD~i~~l~~~~-~v~a~~~~g~w~DiG~~~~yl~A~~--~~al~~~~~~~~~~~~ 290 (299) ||-+..+|.+|.+.-.++=..-++++-++--.+..| .|.-+.++|.+-|.|....+|-..+ =+.|....++ ++. T Consensus 272 Y~~KE~LWP~i~EFVDGAL~Hi~~mSk~LGEqiG~G~pVWP~~IHGHYADAGd~AAlLSGALNVPMvlTGHsLG---RdK 348 (1072) T TIGR02468 272 YIAKEELWPYIPEFVDGALSHIVNMSKVLGEQIGSGKPVWPYVIHGHYADAGDSAALLSGALNVPMVLTGHSLG---RDK 348 (1072) T ss_pred HCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHCCHHHHHHCCCCC---HHH T ss_conf 11400057865137767899999999987552078842156265063004689999874110212576154334---236 Q ss_pred HHHHHHH Q ss_conf 9999886 Q gi|254780365|r 291 LKTLVSA 297 (299) Q Consensus 291 l~~~~~~ 297 (299) |..||+- T Consensus 349 LEQLLkq 355 (1072) T TIGR02468 349 LEQLLKQ 355 (1072) T ss_pred HHHHHHH T ss_conf 9999971 No 148 >PRK09231 fumarate reductase flavoprotein subunit; Validated Probab=23.54 E-value=48 Score=14.01 Aligned_cols=20 Identities=20% Similarity=0.165 Sum_probs=16.2 Q ss_pred CEEEEECCCCCCCCCHHHCC Q ss_conf 58999568888334324408 Q gi|254780365|r 8 RKAVFPIAGLGMRFFPISKV 27 (299) Q Consensus 8 ~KavIlAaG~GtRl~P~T~~ 27 (299) .||||||.|=..|+++.|.+ T Consensus 185 AkaVILATGG~g~l~~~~tn 204 (582) T PRK09231 185 ANAVVMATGGAGRVYRYNTN 204 (582) T ss_pred ECEEEECCCCCCCCCCCCCC T ss_conf 37699867887655357788 No 149 >COG0381 WecB UDP-N-acetylglucosamine 2-epimerase [Cell envelope biogenesis, outer membrane] Probab=22.95 E-value=53 Score=13.78 Aligned_cols=21 Identities=14% Similarity=0.234 Sum_probs=14.9 Q ss_pred HHHHHHHCC-CCEEEEECCCCC Q ss_conf 999999789-988999813554 Q gi|254780365|r 44 VIEEALEAG-LTDFVFVTGRGK 64 (299) Q Consensus 44 ~i~~l~~~G-i~~i~iv~~~~k 64 (299) ++.++.+.+ ++.++++||-.. T Consensus 22 li~~~~~~~~~~~~vi~TGQH~ 43 (383) T COG0381 22 LVKALEKDPDFELIVIHTGQHR 43 (383) T ss_pred HHHHHHHCCCCCEEEEEECCCC T ss_conf 9999985899735999706654 No 150 >TIGR02852 spore_dpaB dipicolinic acid synthetase, B subunit; InterPro: IPR014214 Members of this entry represent the B subunit of dipicolinic acid synthetase, an enzyme that synthesizes a small molecule that appears to confer heat stability to bacterial endospores such as those of Bacillus subtilis. The A and B subunits are together in what was originally designated the spoVF locus for stage V of endospore formation.. Probab=22.83 E-value=55 Score=13.68 Aligned_cols=59 Identities=12% Similarity=-0.009 Sum_probs=32.8 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEECCCCCCCCC Q ss_conf 9999999978998899981355444101001174544456541870000113441124857998433342334 Q gi|254780365|r 42 QYVIEEALEAGLTDFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFTWQYERKGL 114 (299) Q Consensus 42 ~~~i~~l~~~Gi~~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v~Q~~p~Gt 114 (299) .=.|++|+.-|.+=+=|++ +.=+.-.--|+...++.+.+++- . ...-|..+.+=||.|- T Consensus 18 ~p~le~L~~~GAev~P~~s-~~V~~T~TrFG~~~dwi~~~E~i------------T-G~~~i~ti~~AEP~GP 76 (188) T TIGR02852 18 MPQLEKLVEEGAEVTPIVS-ETVQTTDTRFGDGADWIKKIEEI------------T-GKKAINTIVEAEPFGP 76 (188) T ss_pred HHHHHHHHHCCCEEEEEEE-ECCCCCCCCCCCHHHHHHHHHHH------------C-CCHHHHCCCCCCCCCC T ss_conf 8999999966984888863-00043565568618899998863------------0-6311300345545976 No 151 >TIGR02429 pcaI_scoA_fam 3-oxoacid CoA-transferase, A subunit; InterPro: IPR012792 CoA-transferases catalyse the reversible transfer of of coenzyme A from CoA-thioesters to free acids, and can be divided into three families . Family I includes transferases for 3-oxoacids (2.8.3.5 from EC, 2.8.3.6 from EC), short-chain fatty acids (2.8.3.8 from EC, 2.8.3.9 from EC) and glutaconate (2.8.3.12 from EC). Most of the family I enzymes use acetyl-CoA or succinyl-CoA as CoA donors, and are composed of two separate polypeptides, subunits A and B, which generally aggregate as heterodimers or heterotetramers. The eukaryotic enzymes, however, are generally composed of a single two-domain polypetide representing a fusion of the A and B subunits. The transfer of CoA from one substrate to another occurs via a ping pong mechanism which involves the formation of thioester bond between CoA and a conserved glutamate residue at the active site of the enzyme . This entry represents the CoA-binding A subunit of family I CoA-transferases. This domain forms a three-layer alpha-beta-alpha sandwich where the central layer is an all parallel beta-sheet, against which helices pack from both sides , . The active site is thought to be located at the interface of the A and B subunits and formed by loops from both subunits.; GO: 0008410 CoA-transferase activity. Probab=22.73 E-value=55 Score=13.66 Aligned_cols=35 Identities=29% Similarity=0.493 Sum_probs=26.9 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 99956888833432440888333568836279999999997899889998135 Q gi|254780365|r 10 AVFPIAGLGMRFFPISKVIPKEMLAIVDRPVIQYVIEEALEAGLTDFVFVTGR 62 (299) Q Consensus 10 avIlAaG~GtRl~P~T~~~pKpllpi~~kpiI~~~i~~l~~~Gi~~i~iv~~~ 62 (299) |=||.+|+||-=.|. -+|+.|++.|.++..||+|- T Consensus 20 aTi~igGFG~~G~P~------------------eLi~aLi~~g~k~LTivsNN 54 (222) T TIGR02429 20 ATILIGGFGTAGLPE------------------ELIDALIETGAKDLTIVSNN 54 (222) T ss_pred CEEEEECCCCCCCHH------------------HHHHHHHHHCCCCCEEEECC T ss_conf 669862779888758------------------99999997067777788637 No 152 >cd06423 CESA_like CESA_like is the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis protein catalyzes the Probab=22.56 E-value=55 Score=13.64 Aligned_cols=101 Identities=16% Similarity=0.201 Sum_probs=51.0 Q ss_pred EEEECCEE-HHHHHHHHHHHCCCC--EEEEECCCCCCCCHHHCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEEECCC Q ss_conf 35688362-799999999978998--899981355444101001174544456541870000113441124857998433 Q gi|254780365|r 32 MLAIVDRP-VIQYVIEEALEAGLT--DFVFVTGRGKGLIKDYFDIQFELEQSLRKRNKKAELTLLAESIPSIGNAVFTWQ 108 (299) Q Consensus 32 llpi~~kp-iI~~~i~~l~~~Gi~--~i~iv~~~~k~~I~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~i~~v~Q 108 (299) .+|..+.+ .|.-.|+.+.++-.. ||+||-+...+. ..+..+.+... .......+.. T Consensus 2 iIp~~N~~~~l~~~l~sl~~Q~~~~~eIivvdd~S~d~-------t~~~~~~~~~~--------------~~~~~~~~~~ 60 (180) T cd06423 2 IVPAYNEEAVIERTIESLLALDYPKLEVIVVDDGSTDD-------TLEILEELAAL--------------YIRRVLVVRD 60 (180) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCHH-------HHHHHHHHHHC--------------CCCEEEEECC T ss_conf 99977978999999999981899981899998999757-------89999998621--------------7971898447 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 3423346654211011223333111278644454433221122212234 Q gi|254780365|r 109 YERKGLGHAVWCARNIIGDNPFALLLPDMIMSPLEGENCMANMIKLYEK 157 (299) Q Consensus 109 ~~p~Gtg~Ai~~a~~~i~de~flv~~gD~i~~~~~~~~~l~~li~~~~~ 157 (299) .+..|.++|.-.+......+-++.+.+|+++..+ .++.|+....+ T Consensus 61 ~~~~g~~~a~N~g~~~a~~d~i~~lD~D~~~~~~----~l~~~~~~~~~ 105 (180) T cd06423 61 KENGGKAGALNAGLRHAKGDIVVVLDADTILEPD----ALKRLVVPFFA 105 (180) T ss_pred CCCCCHHHHHHHCCCCCCCCEEEEECCCCCCCHH----HHHHHHHHHHH T ss_conf 7767799997525102675579874387200856----99999999987 No 153 >TIGR02640 gas_vesic_GvpN gas vesicle protein GvpN; InterPro: IPR013462 The GvpN protein is associated with the production of gas vesicles produced in some prokaryotes to give cells buoyancy , . It belongs to a larger family of ATPases .; GO: 0000166 nucleotide binding, 0005524 ATP binding, 0031412 gas vesicle organization and biogenesis, 0031411 gas vesicle. Probab=22.16 E-value=34 Score=14.94 Aligned_cols=41 Identities=22% Similarity=0.267 Sum_probs=24.4 Q ss_pred CCCCCCCEEEECCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 41124857998433342334665---42110112233331112786444 Q gi|254780365|r 95 ESIPSIGNAVFTWQYERKGLGHA---VWCARNIIGDNPFALLLPDMIMS 140 (299) Q Consensus 95 ~~~~~~~~i~~v~Q~~p~Gtg~A---i~~a~~~i~de~flv~~gD~i~~ 140 (299) ..+..+-.+|+. .|-|||-- ...|+. -|-|+++++||+=+. T Consensus 16 ~yL~~G~PvHl~---GPaG~GKT~LA~hvA~~--r~RPV~l~~Gd~eL~ 59 (265) T TIGR02640 16 RYLKSGYPVHLR---GPAGTGKTTLAMHVARK--RDRPVVLINGDAELT 59 (265) T ss_pred HHHCCCCCEEEE---CCCCCCHHHHHHHHHHH--CCCCEEEEECCCCCC T ss_conf 632278866744---78885568999999973--689689986582326 No 154 >TIGR00423 TIGR00423 conserved hypothetical protein TIGR00423; InterPro: IPR005244 This entry describes a family of conserved hypothetical proteins with no known function. . Probab=22.05 E-value=57 Score=13.58 Aligned_cols=28 Identities=25% Similarity=0.262 Sum_probs=22.2 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCCCCCH Q ss_conf 9999999997899889998135544410 Q gi|254780365|r 41 IQYVIEEALEAGLTDFVFVTGRGKGLIK 68 (299) Q Consensus 41 I~~~i~~l~~~Gi~~i~iv~~~~k~~I~ 68 (299) |..-+.++.+.|++||+|+-|.+.+.=. T Consensus 41 I~~Kv~ea~~~G~tE~~iQGGlnP~~~~ 68 (331) T TIGR00423 41 ILRKVKEAVAKGATEICIQGGLNPQLDI 68 (331) T ss_pred HHHHHHHHHHCCCEEEEEECCCCCCCCC T ss_conf 9999999997198278852342788764 No 155 >COG0567 SucA 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes [Energy production and conversion] Probab=21.96 E-value=57 Score=13.57 Aligned_cols=27 Identities=19% Similarity=0.329 Sum_probs=22.2 Q ss_pred EEHHHHHHHHHHHCCCCEEEEECCCCC Q ss_conf 627999999999789988999813554 Q gi|254780365|r 38 RPVIQYVIEEALEAGLTDFVFVTGRGK 64 (299) Q Consensus 38 kpiI~~~i~~l~~~Gi~~i~iv~~~~k 64 (299) .|+++.+|+.+.+.|+++|+|=.-|.. T Consensus 213 ip~l~~~i~~~~~~G~~~vviGMaHRG 239 (906) T COG0567 213 IPMLDELIDRAGKQGVKEVVIGMAHRG 239 (906) T ss_pred HHHHHHHHHHHHHCCCCEEEECCCCCC T ss_conf 899999999988668230784365542 No 156 >TIGR01514 NAPRTase nicotinate phosphoribosyltransferase; InterPro: IPR006406 This family represents nicotinate phosphoribosyltransferase, the first enzyme in the salvage pathway of NAD biosynthesis from nicontinate (niacin). Members are primarily proteobacterial but also include yeasts and Methanosarcina acetivorans. A related family, apparently non-overlapping in species distribution, is IPR006405 from INTERPRO. Members of that family differ in substantially in sequence and have a long C-terminal extension missing from this family, but are proposed also to act as nicotinate phosphoribosyltransferase. ; GO: 0004516 nicotinate phosphoribosyltransferase activity, 0019357 nicotinate nucleotide biosynthetic process. Probab=21.29 E-value=20 Score=16.37 Aligned_cols=40 Identities=25% Similarity=0.301 Sum_probs=31.7 Q ss_pred CCHHHHH--HHHHHC-CCEEEEEEEEEEEECCCHHHHHHHHHH Q ss_conf 0037899--999831-984899981378643998899999999 Q gi|254780365|r 236 IQLTDSM--RKLSER-HDFLAYHFKGHTYDCGSKKGFVLANIA 275 (299) Q Consensus 236 ~~ltD~i--~~l~~~-~~v~a~~~~g~w~DiG~~~~yl~A~~~ 275 (299) +-|||.+ +.-.++ ++-+|-.+.|-++|.|.|..|.+=-.+ T Consensus 274 ~ALTDt~t~DaFLrdF~~~~A~~y~GlRHDSGDP~~wg~K~~~ 316 (430) T TIGR01514 274 IALTDTLTTDAFLRDFSKKFAKLYDGLRHDSGDPVEWGEKAIA 316 (430) T ss_pred ECCCCCCCCCHHHHCCCHHHHHHCCCCCCCCCCCHHHHHHHHH T ss_conf 3034100001463116854676617762047885168779998 No 157 >TIGR01854 lipid_A_lpxH UDP-2,3-diacylglucosamine hydrolase; InterPro: IPR010138 This entry represents LpxH, or UDP-2,3-diacylglucosamine hydrolase (or pyrophosphatase), which is an essential enzyme in Escherichia coli that catalyses the fourth step in lipid A biosynthesis. This reaction involves pyrophosphate bond hydrolysis of the precursor UDP-2,3-diacylglucosamine to yield 2,3-diacylglucosamine 1-phosphate and UMP . Several Gram-negative bacteria have homologues of LpxH and/or LpxH2, while others have only distant orthologues of LpxH or LpxH2, or lack these enzymes altogether .; GO: 0016462 pyrophosphatase activity, 0009245 lipid A biosynthetic process, 0005737 cytoplasm. Probab=20.80 E-value=36 Score=14.81 Aligned_cols=18 Identities=17% Similarity=0.174 Sum_probs=14.8 Q ss_pred CCCCCCCCCCCCCCCCCC Q ss_conf 333311127864445443 Q gi|254780365|r 127 DNPFALLLPDMIMSPLEG 144 (299) Q Consensus 127 de~flv~~gD~i~~~~~~ 144 (299) +++.|+|.||.||+.+.. T Consensus 106 G~k~Ll~HGDtLCTdD~a 123 (241) T TIGR01854 106 GQKVLLMHGDTLCTDDEA 123 (241) T ss_pred CCCEEEEECCCCCHHHHH T ss_conf 882799537863020189 No 158 >COG3051 CitF Citrate lyase, alpha subunit [Energy production and conversion] Probab=20.40 E-value=7 Score=19.18 Aligned_cols=17 Identities=18% Similarity=0.352 Sum_probs=10.9 Q ss_pred EEEEEEEEEEECCCHHH Q ss_conf 89998137864399889 Q gi|254780365|r 252 LAYHFKGHTYDCGSKKG 268 (299) Q Consensus 252 ~a~~~~g~w~DiG~~~~ 268 (299) .+|--+|..+-.||--. T Consensus 268 SgYFkdGFs~QTGtGGA 284 (513) T COG3051 268 SGYFKDGFSMQTGTGGA 284 (513) T ss_pred CCCCCCCEEEECCCCCH T ss_conf 77534760565388835 No 159 >KOG1447 consensus Probab=20.36 E-value=51 Score=13.86 Aligned_cols=46 Identities=20% Similarity=0.231 Sum_probs=27.5 Q ss_pred CCCCHHHHHHHHHHCCCEEEEEEE--EEEEECCCHHH-HHHHHHHHHHC Q ss_conf 750037899999831984899981--37864399889-99999998631 Q gi|254780365|r 234 GEIQLTDSMRKLSERHDFLAYHFK--GHTYDCGSKKG-FVLANIAFALA 279 (299) Q Consensus 234 gE~~ltD~i~~l~~~~~v~a~~~~--g~w~DiG~~~~-yl~A~~~~al~ 279 (299) .|-+.-.++..+...-+|.+..++ |-...|-+... ..+|...+.|+ T Consensus 320 ~EdqV~~Af~ilTaDPkVk~iLvNiFGGIVNCAtIANGiv~A~~kl~Ln 368 (412) T KOG1447 320 KEDQVYQAFKILTADPKVKAILVNIFGGIVNCATIANGIVKACRKLELN 368 (412) T ss_pred CHHHHHHHHHHHCCCCCEEEEEEEHHCCEEHHHHHHHHHHHHHHHHCCC T ss_conf 3889977764312588403999830046311476765799998862578 No 160 >cd00419 Ferrochelatase_C Ferrochelatase, C-terminal domain: Ferrochelatase (protoheme ferrolyase or HemH) is the terminal enzyme of the heme biosynthetic pathway. It catalyzes the insertion of ferrous iron into the protoporphyrin IX ring yielding protoheme. This enzyme is ubiquitous in nature and widely distributed in bacteria and eukaryotes. Recently, some archaeal members have been identified. The oligomeric state of these enzymes varies depending on the presence of a dimerization motif at the C-terminus. Probab=20.24 E-value=62 Score=13.35 Aligned_cols=24 Identities=17% Similarity=0.326 Sum_probs=11.9 Q ss_pred HHHHHHHHHCC--CEEEEEEEEEEEEC Q ss_conf 78999998319--84899981378643 Q gi|254780365|r 239 TDSMRKLSERH--DFLAYHFKGHTYDC 263 (299) Q Consensus 239 tD~i~~l~~~~--~v~a~~~~g~w~Di 263 (299) .|++..|.++| .+..++ -|.-.|+ T Consensus 80 ~~~l~~l~~~G~~~v~v~p-~gF~sD~ 105 (135) T cd00419 80 DDALEELAKEGVKNVVVVP-IGFVSDH 105 (135) T ss_pred HHHHHHHHHCCCCCEEEEC-CEEECCC T ss_conf 9999999866997199989-8530365 Done!