Query gi|254780942|ref|YP_003065355.1| UDP-N-acetylglucosamine pyrophosphorylase protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 442 No_of_seqs 249 out of 10187 Neff 7.9 Searched_HMMs 39220 Date Mon May 30 01:55:56 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780942.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01173 glmU UDP-N-acetylglu 100.0 0 0 934.1 26.4 436 5-440 1-461 (461) 2 PRK09451 glmU bifunctional N-a 100.0 0 0 850.4 27.8 437 1-441 2-456 (456) 3 COG1207 GlmU N-acetylglucosami 100.0 0 0 826.8 27.1 438 4-442 2-460 (460) 4 PRK05293 glgC glucose-1-phosph 100.0 0 0 436.3 22.5 335 2-371 1-359 (381) 5 TIGR01208 rmlA_long glucose-1- 100.0 0 0 436.0 17.6 343 7-388 2-359 (361) 6 COG1208 GCD1 Nucleoside-diphos 100.0 0 0 421.4 22.2 349 4-413 1-358 (358) 7 TIGR02091 glgC glucose-1-phosp 100.0 0 0 385.9 15.0 316 5-343 1-375 (421) 8 PRK00844 glgC glucose-1-phosph 100.0 0 0 377.2 20.4 333 3-370 4-381 (409) 9 PRK04928 consensus 100.0 0 0 374.8 20.2 338 3-370 2-377 (405) 10 cd02540 GT2_GlmU_N_bac N-termi 100.0 0 0 378.2 17.0 226 7-234 1-229 (229) 11 PRK02862 glgC glucose-1-phosph 100.0 0 0 374.4 20.4 381 3-412 2-429 (429) 12 PRK03282 consensus 100.0 0 0 373.8 19.4 338 3-370 5-379 (406) 13 PRK00725 glgC glucose-1-phosph 100.0 0 0 361.5 17.2 345 1-370 16-397 (431) 14 PRK01884 consensus 100.0 0 0 361.2 16.8 334 1-369 14-394 (435) 15 PRK03701 consensus 100.0 0 0 347.2 17.2 322 3-342 18-386 (431) 16 KOG1322 consensus 100.0 0 0 346.8 15.0 223 3-240 8-242 (371) 17 COG0448 GlgC ADP-glucose pyrop 100.0 0 0 322.4 15.9 321 1-344 2-352 (393) 18 COG1209 RfbA dTDP-glucose pyro 100.0 0 0 319.5 15.1 228 5-243 1-237 (286) 19 cd02541 UGPase_prokaryotic Pro 100.0 0 0 311.6 16.6 228 5-241 1-264 (267) 20 cd02538 G1P_TT_short G1P_TT_sh 100.0 0 0 311.9 13.9 229 5-242 1-238 (240) 21 PRK13389 UTP--glucose-1-phosph 100.0 0 0 305.3 15.9 229 1-238 4-276 (302) 22 PRK10122 UTP--glucose-1-phosph 100.0 0 0 304.7 15.9 227 4-238 3-273 (297) 23 cd06425 M1P_guanylylT_B_like_N 100.0 0 0 305.7 13.5 222 5-241 1-232 (233) 24 cd04189 G1P_TT_long G1P_TT_lon 100.0 0 0 302.9 14.5 228 5-243 1-235 (236) 25 cd06428 M1P_guanylylT_A_like_N 100.0 0 0 296.7 15.1 225 7-241 1-257 (257) 26 cd03353 LbH_GlmU_C N-acetyl-gl 100.0 0 0 297.1 13.5 176 253-428 1-193 (193) 27 pfam00483 NTP_transferase Nucl 100.0 0 0 292.5 13.3 228 7-240 2-243 (247) 28 cd06915 NTP_transferase_WcbM_l 100.0 0 0 289.2 14.8 215 7-238 1-223 (223) 29 cd02524 G1P_cytidylyltransfera 100.0 0 0 285.4 15.9 223 7-248 1-252 (253) 30 cd04181 NTP_transferase NTP_tr 100.0 0 0 280.9 13.7 206 7-225 1-214 (217) 31 KOG1460 consensus 100.0 1.4E-45 0 279.2 13.8 314 5-332 3-352 (407) 32 cd06426 NTP_transferase_like_2 100.0 1.4E-45 0 277.5 15.2 212 8-238 2-220 (220) 33 cd04183 GT2_BcE_like GT2_BcbE_ 100.0 4.3E-44 0 269.3 16.1 218 8-235 2-231 (231) 34 cd06422 NTP_transferase_like_1 100.0 1.1E-43 0 267.2 15.0 212 6-237 1-221 (221) 35 KOG1461 consensus 100.0 4E-41 1.4E-45 252.5 19.8 333 3-348 23-395 (673) 36 cd02523 PC_cytidylyltransferas 100.0 1.8E-41 0 254.5 13.0 221 7-238 1-229 (229) 37 TIGR01099 galU UTP-glucose-1-p 100.0 9.8E-40 2.5E-44 244.6 11.8 222 5-234 2-267 (270) 38 KOG1462 consensus 100.0 9.1E-39 2.3E-43 239.1 10.7 335 4-365 9-413 (433) 39 COG1210 GalU UDP-glucose pyrop 100.0 1.2E-36 3E-41 227.1 14.5 227 1-236 1-264 (291) 40 cd03352 LbH_LpxD UDP-3-O-acyl- 100.0 2.3E-37 5.9E-42 231.1 10.5 170 257-433 10-204 (205) 41 cd02508 ADP_Glucose_PP ADP-glu 100.0 9.9E-36 2.5E-40 221.8 12.4 178 7-225 1-198 (200) 42 PRK12461 UDP-N-acetylglucosami 100.0 6.3E-36 1.6E-40 222.9 11.0 178 256-435 2-194 (256) 43 cd03351 LbH_UDP-GlcNAc_AT UDP- 100.0 6.3E-36 1.6E-40 222.9 10.9 177 257-435 2-194 (254) 44 cd04197 eIF-2B_epsilon_N The N 100.0 6.9E-35 1.8E-39 217.0 13.8 176 5-183 1-217 (217) 45 cd04198 eIF-2B_gamma_N The N-t 100.0 6.1E-35 1.6E-39 217.3 12.7 174 5-183 1-214 (214) 46 PRK05289 UDP-N-acetylglucosami 100.0 2.9E-35 7.4E-40 219.1 10.9 178 256-435 4-197 (261) 47 cd05636 LbH_G1P_TT_C_like Puta 100.0 3.4E-35 8.7E-40 218.7 10.6 139 275-413 7-163 (163) 48 PRK00892 lpxD UDP-3-O-[3-hydro 100.0 2.9E-34 7.5E-39 213.4 8.4 171 262-438 123-319 (343) 49 TIGR01852 lipid_A_lpxA acyl-[a 100.0 4.5E-34 1.2E-38 212.3 8.8 171 258-430 2-189 (257) 50 TIGR02623 G1P_cyt_trans glucos 100.0 4.8E-33 1.2E-37 206.5 10.9 226 5-248 1-253 (256) 51 cd02507 eIF-2B_gamma_N_like Th 100.0 1.7E-32 4.2E-37 203.4 12.8 174 5-183 1-216 (216) 52 TIGR01853 lipid_A_lpxD UDP-3-O 100.0 2.1E-33 5.4E-38 208.5 7.1 171 260-436 124-322 (336) 53 TIGR03308 phn_thr-fam phosphon 100.0 3.2E-29 8.1E-34 184.8 10.9 156 269-431 4-160 (204) 54 COG1043 LpxA Acyl-[acyl carrie 100.0 2.5E-28 6.3E-33 179.7 8.3 174 255-430 4-193 (260) 55 COG1044 LpxD UDP-3-O-[3-hydrox 99.9 7E-28 1.8E-32 177.1 8.7 169 261-435 123-316 (338) 56 TIGR01207 rmlA glucose-1-phosp 99.9 3.6E-27 9.1E-32 173.1 9.4 228 7-243 2-238 (286) 57 COG1213 Predicted sugar nucleo 99.9 2.6E-26 6.5E-31 168.2 11.8 219 3-241 2-228 (239) 58 cd02509 GDP-M1P_Guanylyltransf 99.9 6.9E-27 1.8E-31 171.4 8.4 186 5-191 1-210 (274) 59 TIGR03532 DapD_Ac 2,3,4,5-tetr 99.9 6.4E-26 1.6E-30 165.9 10.8 127 273-433 86-214 (231) 60 cd04645 LbH_gamma_CA_like Gamm 99.9 1.3E-24 3.4E-29 158.5 10.2 72 354-433 60-133 (153) 61 cd03350 LbH_THP_succinylT 2,3, 99.9 1.1E-24 2.7E-29 159.0 9.6 72 354-425 49-121 (139) 62 cd05635 LbH_unknown Uncharacte 99.9 2.6E-25 6.5E-30 162.5 4.8 87 284-372 10-97 (101) 63 cd04650 LbH_FBP Ferripyochelin 99.9 3.6E-24 9.2E-29 156.0 10.1 71 354-432 61-133 (154) 64 cd04745 LbH_paaY_like paaY-lik 99.9 4E-24 1E-28 155.7 10.1 72 354-433 61-134 (155) 65 TIGR02092 glgD glucose-1-phosp 99.9 2.2E-23 5.6E-28 151.5 12.0 344 3-385 1-380 (383) 66 TIGR01853 lipid_A_lpxD UDP-3-O 99.9 6.6E-24 1.7E-28 154.5 6.7 163 258-425 105-293 (336) 67 PRK00892 lpxD UDP-3-O-[3-hydro 99.9 6.4E-23 1.6E-27 148.9 9.2 165 257-425 101-289 (343) 68 COG1044 LpxD UDP-3-O-[3-hydrox 99.9 2.1E-22 5.2E-27 146.0 9.1 160 262-424 106-289 (338) 69 PRK13627 carnitine operon prot 99.9 5.8E-22 1.5E-26 143.4 10.7 72 354-433 71-144 (196) 70 cd03358 LbH_WxcM_N_like WcxM-l 99.9 4.8E-22 1.2E-26 143.9 9.5 53 378-430 66-118 (119) 71 PRK05450 3-deoxy-manno-octulos 99.9 1.1E-21 2.8E-26 141.8 11.2 228 4-242 2-247 (248) 72 pfam01128 IspD Uncharacterized 99.9 2.8E-21 7.1E-26 139.6 13.0 216 6-242 2-220 (221) 73 COG0836 {ManC} Mannose-1-phosp 99.9 2.2E-21 5.7E-26 140.1 12.3 186 5-191 2-214 (333) 74 cd03352 LbH_LpxD UDP-3-O-acyl- 99.9 5.5E-22 1.4E-26 143.5 9.0 156 268-423 2-178 (205) 75 TIGR03570 NeuD_NnaD sugar O-ac 99.9 1.5E-21 3.7E-26 141.1 10.0 66 354-426 135-200 (201) 76 cd02517 CMP-KDO-Synthetase CMP 99.9 6.4E-21 1.6E-25 137.5 11.2 225 4-240 1-239 (239) 77 cd03360 LbH_AT_putative Putati 99.9 2E-21 5E-26 140.4 8.0 66 354-426 132-197 (197) 78 PRK13368 3-deoxy-manno-octulos 99.8 3.4E-20 8.7E-25 133.4 11.8 224 4-241 2-238 (238) 79 PRK00155 ispD 2-C-methyl-D-ery 99.8 6.6E-20 1.7E-24 131.7 13.0 223 1-245 1-227 (228) 80 PRK09677 putative lipopolysacc 99.8 1.7E-20 4.4E-25 135.0 9.9 60 374-433 125-184 (192) 81 cd03359 LbH_Dynactin_5 Dynacti 99.8 3.4E-20 8.6E-25 133.4 11.0 134 268-433 8-145 (161) 82 cd03357 LbH_MAT_GAT Maltose O- 99.8 2.8E-20 7E-25 133.9 9.7 52 379-430 118-169 (169) 83 cd05636 LbH_G1P_TT_C_like Puta 99.8 1.7E-20 4.4E-25 135.1 8.3 124 263-394 13-162 (163) 84 PRK09527 lacA galactoside O-ac 99.8 3.6E-20 9.2E-25 133.2 9.9 54 379-432 131-184 (203) 85 COG4750 LicC CTP:phosphocholin 99.8 7.4E-21 1.9E-25 137.1 6.4 163 5-187 1-169 (231) 86 TIGR00453 ispD 2-C-methyl-D-er 99.8 1.3E-19 3.4E-24 130.0 12.6 211 6-241 1-226 (226) 87 cd04646 LbH_Dynactin_6 Dynacti 99.8 5.6E-20 1.4E-24 132.1 10.2 130 270-431 2-137 (164) 88 cd03351 LbH_UDP-GlcNAc_AT UDP- 99.8 3.6E-20 9.1E-25 133.3 8.9 148 264-422 2-165 (254) 89 PRK12461 UDP-N-acetylglucosami 99.8 3.2E-20 8E-25 133.5 8.5 147 264-422 3-165 (256) 90 PRK10502 putative colanic acid 99.8 6.8E-20 1.7E-24 131.7 9.9 54 379-432 125-178 (179) 91 PRK05289 UDP-N-acetylglucosami 99.8 5.2E-20 1.3E-24 132.3 9.0 150 263-423 4-169 (261) 92 COG0663 PaaY Carbonic anhydras 99.8 8.1E-20 2.1E-24 131.2 9.7 72 354-433 72-145 (176) 93 TIGR01105 galF regulatory prot 99.8 6.9E-20 1.8E-24 131.6 8.9 217 4-227 3-264 (297) 94 cd02516 CDP-ME_synthetase CDP- 99.8 5E-19 1.3E-23 126.7 13.2 210 6-236 2-217 (218) 95 cd03349 LbH_XAT Xenobiotic acy 99.8 1.4E-19 3.5E-24 129.9 9.4 54 378-431 72-125 (145) 96 PRK09451 glmU bifunctional N-a 99.8 2.2E-19 5.6E-24 128.8 9.4 101 286-394 318-427 (456) 97 TIGR01173 glmU UDP-N-acetylglu 99.8 2.2E-19 5.5E-24 128.8 8.1 106 284-397 322-436 (461) 98 cd00710 LbH_gamma_CA Gamma car 99.8 2.3E-19 6E-24 128.6 8.2 135 264-421 5-140 (167) 99 cd04182 GT_2_like_f GT_2_like_ 99.8 2.9E-19 7.5E-24 128.0 8.7 118 6-127 2-120 (186) 100 PRK10092 maltose O-acetyltrans 99.8 6.4E-19 1.6E-23 126.1 10.1 54 379-432 129-182 (183) 101 cd03353 LbH_GlmU_C N-acetyl-gl 99.8 7E-19 1.8E-23 125.9 10.2 142 276-423 12-172 (193) 102 PRK13385 2-C-methyl-D-erythrit 99.8 2.3E-18 6E-23 122.9 12.6 224 4-246 2-236 (238) 103 TIGR03310 matur_ygfJ molybdenu 99.8 1E-18 2.6E-23 124.9 9.3 118 7-128 2-120 (188) 104 COG1211 IspD 4-diphosphocytidy 99.8 9.2E-18 2.3E-22 119.5 13.8 217 1-243 1-228 (230) 105 TIGR01852 lipid_A_lpxA acyl-[a 99.8 6.4E-19 1.6E-23 126.1 7.3 53 355-414 102-154 (257) 106 PRK02726 molybdopterin-guanine 99.8 1.1E-17 2.8E-22 119.1 12.4 195 1-243 4-200 (200) 107 PRK00317 mobA molybdopterin-gu 99.7 2.5E-17 6.5E-22 117.0 12.2 185 4-240 3-192 (193) 108 COG2068 Uncharacterized MobA-r 99.7 5.4E-17 1.4E-21 115.2 13.2 192 3-241 4-197 (199) 109 COG1212 KdsB CMP-2-keto-3-deox 99.7 2.4E-17 6.1E-22 117.2 11.1 226 4-243 3-244 (247) 110 cd04647 LbH_MAT_like Maltose O 99.7 9.2E-18 2.4E-22 119.5 8.9 51 379-429 58-108 (109) 111 cd03354 LbH_SAT Serine acetylt 99.7 1.2E-17 3.2E-22 118.8 8.9 48 379-426 54-101 (101) 112 cd03350 LbH_THP_succinylT 2,3, 99.7 1.6E-17 4.2E-22 118.1 9.1 128 263-425 9-136 (139) 113 cd00710 LbH_gamma_CA Gamma car 99.7 6.3E-17 1.6E-21 114.8 11.3 120 269-422 4-125 (167) 114 TIGR03202 pucB xanthine dehydr 99.7 4.1E-17 1E-21 115.8 9.4 121 6-128 2-126 (190) 115 COG0110 WbbJ Acetyltransferase 99.7 5.4E-17 1.4E-21 115.1 10.0 55 379-433 124-178 (190) 116 TIGR00454 TIGR00454 conserved 99.7 1.3E-17 3.3E-22 118.7 6.8 121 5-125 1-125 (204) 117 cd05825 LbH_wcaF_like wcaF-lik 99.7 5.8E-17 1.5E-21 115.0 9.6 51 379-429 56-106 (107) 118 PRK10191 putative colanic acid 99.7 1.2E-16 3.1E-21 113.1 10.8 53 379-431 92-144 (146) 119 PRK09382 ispDF bifunctional 2- 99.7 6.7E-16 1.7E-20 108.9 12.0 206 4-241 6-216 (382) 120 PRK11830 dapD 2,3,4,5-tetrahyd 99.7 4.6E-16 1.2E-20 109.8 10.6 120 272-425 95-230 (265) 121 cd04650 LbH_FBP Ferripyochelin 99.7 2.2E-16 5.5E-21 111.7 8.2 131 270-420 3-138 (154) 122 cd04745 LbH_paaY_like paaY-lik 99.7 1.4E-16 3.6E-21 112.8 6.9 136 264-421 3-139 (155) 123 COG2266 GTP:adenosylcobinamide 99.7 5.2E-16 1.3E-20 109.5 9.1 111 5-124 1-112 (177) 124 TIGR01172 cysE serine O-acetyl 99.7 3.3E-17 8.4E-22 116.4 2.5 54 374-428 110-163 (163) 125 cd04645 LbH_gamma_CA_like Gamm 99.7 5.6E-16 1.4E-20 109.4 8.7 131 270-420 2-137 (153) 126 cd02503 MobA MobA catalyzes th 99.6 8.3E-16 2.1E-20 108.4 9.2 114 5-129 1-115 (181) 127 COG2171 DapD Tetrahydrodipicol 99.6 4.7E-16 1.2E-20 109.8 7.5 118 273-424 108-227 (271) 128 cd04646 LbH_Dynactin_6 Dynacti 99.6 2.9E-16 7.4E-21 111.0 6.4 122 264-416 14-138 (164) 129 TIGR02287 PaaY phenylacetic ac 99.6 4.8E-16 1.2E-20 109.8 7.4 72 354-433 69-142 (193) 130 PRK13627 carnitine operon prot 99.6 1E-15 2.6E-20 107.9 8.5 136 264-421 13-149 (196) 131 PRK00560 molybdopterin-guanine 99.6 3.4E-15 8.7E-20 104.9 11.1 185 1-244 5-194 (196) 132 COG1207 GlmU N-acetylglucosami 99.6 1.1E-15 2.8E-20 107.7 8.1 107 284-398 319-434 (460) 133 KOG4750 consensus 99.6 5.8E-16 1.5E-20 109.3 6.6 53 378-431 200-252 (269) 134 pfam02348 CTP_transf_3 Cytidyl 99.6 7.2E-16 1.8E-20 108.7 6.9 176 6-187 1-189 (197) 135 COG1043 LpxA Acyl-[acyl carrie 99.6 1.4E-15 3.5E-20 107.2 7.3 147 264-422 6-169 (260) 136 COG1045 CysE Serine acetyltran 99.6 4.8E-15 1.2E-19 104.1 9.9 56 378-433 118-173 (194) 137 COG0746 MobA Molybdopterin-gua 99.6 5.8E-15 1.5E-19 103.6 9.8 115 1-127 1-116 (192) 138 TIGR03570 NeuD_NnaD sugar O-ac 99.6 1.6E-15 4E-20 106.8 6.8 102 255-397 88-189 (201) 139 cd03360 LbH_AT_putative Putati 99.6 2E-15 5.1E-20 106.2 5.3 99 254-368 84-182 (197) 140 cd00208 LbetaH Left-handed par 99.5 2.3E-14 5.8E-19 100.2 8.4 59 355-413 19-78 (78) 141 PRK11132 cysE serine acetyltra 99.5 2.5E-14 6.4E-19 100.0 7.9 54 379-432 193-246 (273) 142 KOG1461 consensus 99.5 1.4E-14 3.6E-19 101.4 6.4 101 289-415 319-420 (673) 143 cd04652 LbH_eIF2B_gamma_C eIF- 99.5 1.2E-14 3E-19 101.8 4.2 79 288-367 2-80 (81) 144 COG0663 PaaY Carbonic anhydras 99.5 6.1E-14 1.6E-18 97.8 6.7 120 264-406 14-134 (176) 145 cd03359 LbH_Dynactin_5 Dynacti 99.5 2.3E-13 5.9E-18 94.5 9.3 119 267-420 21-149 (161) 146 TIGR03308 phn_thr-fam phosphon 99.5 1.7E-13 4.4E-18 95.2 8.2 130 267-407 19-152 (204) 147 cd03358 LbH_WxcM_N_like WcxM-l 99.4 2.9E-13 7.5E-18 93.9 6.7 109 270-406 1-110 (119) 148 TIGR00466 kdsB 3-deoxy-D-manno 99.4 3.4E-12 8.7E-17 87.8 11.9 218 7-235 5-246 (246) 149 PRK05293 glgC glucose-1-phosph 99.4 9.6E-13 2.4E-17 91.0 9.0 42 301-343 307-348 (381) 150 cd05787 LbH_eIF2B_epsilon eIF- 99.4 1.2E-13 3E-18 96.2 4.1 55 288-343 2-56 (79) 151 cd03356 LbH_G1P_AT_C_like Left 99.4 2.8E-13 7E-18 94.0 5.9 60 305-365 2-61 (79) 152 COG2171 DapD Tetrahydrodipicol 99.4 1E-12 2.7E-17 90.8 8.3 114 263-403 110-224 (271) 153 TIGR03532 DapD_Ac 2,3,4,5-tetr 99.4 1.2E-12 3E-17 90.5 8.3 111 262-406 93-203 (231) 154 cd05787 LbH_eIF2B_epsilon eIF- 99.4 8.7E-13 2.2E-17 91.2 7.3 78 304-407 1-78 (79) 155 cd05824 LbH_M1P_guanylylT_C Ma 99.4 5.3E-13 1.3E-17 92.4 5.4 47 284-331 16-62 (80) 156 cd02513 CMP-NeuAc_Synthase CMP 99.4 6E-12 1.5E-16 86.4 9.9 210 4-240 1-222 (223) 157 cd04652 LbH_eIF2B_gamma_C eIF- 99.3 1.6E-12 4E-17 89.8 6.0 80 304-410 1-80 (81) 158 cd05824 LbH_M1P_guanylylT_C Ma 99.3 5.1E-13 1.3E-17 92.5 3.1 61 322-389 19-79 (80) 159 COG1208 GCD1 Nucleoside-diphos 99.3 2.3E-12 5.9E-17 88.8 5.7 89 262-368 256-344 (358) 160 TIGR02665 molyb_mobA molybdopt 99.3 1.6E-12 4E-17 89.8 4.8 122 6-131 2-129 (202) 161 cd03356 LbH_G1P_AT_C_like Left 99.3 1.2E-12 2.9E-17 90.5 4.0 77 288-389 2-78 (79) 162 PRK02862 glgC glucose-1-phosph 99.3 4.6E-12 1.2E-16 87.1 6.1 59 285-343 324-383 (429) 163 COG1083 NeuA CMP-N-acetylneura 99.2 2.3E-11 5.8E-16 83.1 7.3 213 3-242 2-223 (228) 164 PRK04928 consensus 99.2 5.1E-12 1.3E-16 86.8 3.6 71 284-367 310-380 (405) 165 TIGR03584 PseF pseudaminic aci 99.2 9.8E-11 2.5E-15 79.5 9.6 209 6-241 1-220 (222) 166 PRK00844 glgC glucose-1-phosph 99.2 1.5E-11 3.7E-16 84.2 3.8 63 267-332 315-377 (409) 167 cd02518 GT2_SpsF SpsF is a gly 99.2 1.3E-10 3.4E-15 78.8 7.9 116 6-129 1-120 (233) 168 PRK03282 consensus 99.2 3E-11 7.8E-16 82.4 4.1 69 285-366 313-381 (406) 169 PRK10502 putative colanic acid 99.1 2.4E-10 6E-15 77.4 7.9 111 268-406 53-168 (179) 170 cd04651 LbH_G1P_AT_C Glucose-1 99.1 6.2E-11 1.6E-15 80.6 4.1 59 284-344 11-69 (104) 171 cd00208 LbetaH Left-handed par 99.1 2.7E-10 6.8E-15 77.1 6.6 34 354-394 44-77 (78) 172 cd04651 LbH_G1P_AT_C Glucose-1 99.1 1.2E-10 3E-15 79.1 4.7 72 293-366 3-74 (104) 173 PTZ00339 UDP-N-acetylglucosami 99.1 6.4E-09 1.6E-13 69.2 13.6 182 4-187 109-357 (499) 174 cd04193 UDPGlcNAc_PPase UDPGlc 99.1 1.4E-08 3.5E-13 67.3 14.9 183 4-188 15-256 (323) 175 KOG1462 consensus 99.1 7.9E-11 2E-15 80.1 3.4 92 302-426 334-426 (433) 176 PRK09677 putative lipopolysacc 99.1 1.2E-09 3E-14 73.4 9.1 126 260-406 42-173 (192) 177 cd03354 LbH_SAT Serine acetylt 99.1 3.3E-10 8.4E-15 76.5 6.0 93 282-406 5-97 (101) 178 TIGR00965 dapD 2,3,4,5-tetrahy 99.0 6E-10 1.5E-14 75.0 7.1 120 272-424 102-236 (275) 179 PRK09527 lacA galactoside O-ac 99.0 4.5E-10 1.2E-14 75.7 6.4 111 271-406 59-174 (203) 180 cd04649 LbH_THP_succinylT_puta 99.0 6.2E-10 1.6E-14 75.0 6.8 61 355-417 48-109 (147) 181 PRK01884 consensus 99.0 1.6E-10 4.1E-15 78.3 3.8 33 336-368 344-376 (435) 182 PRK00725 glgC glucose-1-phosph 99.0 3.9E-10 9.9E-15 76.1 3.9 62 285-348 331-392 (431) 183 cd03357 LbH_MAT_GAT Maltose O- 99.0 1.9E-09 4.9E-14 72.2 7.4 99 283-406 60-161 (169) 184 PRK10092 maltose O-acetyltrans 99.0 1.4E-09 3.6E-14 73.0 6.6 100 283-407 71-173 (183) 185 TIGR02091 glgC glucose-1-phosp 99.0 4.6E-10 1.2E-14 75.7 4.0 58 285-344 336-411 (421) 186 PRK03701 consensus 98.9 7.2E-10 1.8E-14 74.6 3.6 70 298-369 327-396 (431) 187 TIGR02287 PaaY phenylacetic ac 98.9 1.9E-09 4.8E-14 72.2 5.2 99 284-406 25-131 (193) 188 COG4284 UDP-glucose pyrophosph 98.9 2.6E-08 6.7E-13 65.7 10.5 183 3-186 103-337 (472) 189 COG1045 CysE Serine acetyltran 98.9 6.1E-09 1.5E-13 69.3 6.2 69 302-397 87-155 (194) 190 cd04647 LbH_MAT_like Maltose O 98.8 8.1E-09 2.1E-13 68.6 6.6 45 353-406 57-101 (109) 191 PRK10191 putative colanic acid 98.8 1.4E-08 3.4E-13 67.4 7.1 43 355-406 93-135 (146) 192 cd05635 LbH_unknown Uncharacte 98.8 3.5E-09 9E-14 70.7 3.7 73 270-345 14-87 (101) 193 KOG1460 consensus 98.8 4.7E-09 1.2E-13 69.9 4.0 39 304-343 308-346 (407) 194 TIGR00965 dapD 2,3,4,5-tetrahy 98.8 1.1E-08 2.9E-13 67.8 5.5 52 293-344 134-201 (275) 195 PRK11830 dapD 2,3,4,5-tetrahyd 98.8 2E-08 5E-13 66.4 6.5 57 252-317 95-152 (265) 196 COG0448 GlgC ADP-glucose pyrop 98.8 9.3E-09 2.4E-13 68.3 4.5 81 283-366 277-357 (393) 197 COG1861 SpsF Spore coat polysa 98.7 2.9E-08 7.4E-13 65.5 6.8 115 4-131 2-126 (241) 198 cd04649 LbH_THP_succinylT_puta 98.7 5.1E-08 1.3E-12 64.1 6.5 80 302-400 31-110 (147) 199 cd00897 UGPase_euk Eukaryotic 98.7 2.5E-06 6.4E-11 54.4 15.1 74 4-77 3-86 (300) 200 cd04180 UGPase_euk_like Eukary 98.6 4.2E-07 1.1E-11 58.8 9.8 144 6-151 2-190 (266) 201 KOG1322 consensus 98.6 3.3E-08 8.5E-13 65.1 3.4 73 305-396 273-345 (371) 202 TIGR02353 NRPS_term_dom non-ri 98.6 1.5E-07 3.9E-12 61.4 6.6 79 343-421 621-713 (719) 203 TIGR01479 GMP_PMI mannose-1-ph 98.6 1E-07 2.5E-12 62.4 5.5 184 6-189 2-219 (478) 204 cd05825 LbH_wcaF_like wcaF-lik 98.5 6.4E-08 1.6E-12 63.5 3.7 45 353-406 55-99 (107) 205 TIGR01172 cysE serine O-acetyl 98.5 1.4E-07 3.5E-12 61.6 4.8 93 270-397 58-150 (163) 206 TIGR02353 NRPS_term_dom non-ri 98.5 1E-07 2.7E-12 62.3 3.4 68 355-429 622-708 (719) 207 KOG3121 consensus 98.5 1.2E-07 3E-12 62.0 3.3 47 356-409 103-149 (184) 208 cd06424 UGGPase UGGPase cataly 98.4 6.5E-06 1.6E-10 52.1 12.0 72 7-78 3-93 (315) 209 cd03349 LbH_XAT Xenobiotic acy 98.4 1.1E-06 2.9E-11 56.4 7.2 70 353-435 72-143 (145) 210 TIGR02092 glgD glucose-1-phosp 98.4 2.6E-07 6.5E-12 60.1 3.6 49 32-80 106-159 (383) 211 KOG4042 consensus 98.3 1.3E-06 3.2E-11 56.2 5.4 130 268-426 9-144 (190) 212 TIGR01208 rmlA_long glucose-1- 98.2 5.9E-07 1.5E-11 58.0 2.5 199 174-406 143-359 (361) 213 PRK11132 cysE serine acetyltra 98.2 2.8E-06 7.1E-11 54.2 5.7 35 356-397 195-229 (273) 214 COG0110 WbbJ Acetyltransferase 98.2 3.3E-06 8.3E-11 53.8 6.0 99 284-406 66-167 (190) 215 PRK00576 molybdopterin-guanine 98.2 4.3E-06 1.1E-10 53.1 6.6 96 22-125 3-102 (178) 216 KOG4750 consensus 98.1 6.9E-06 1.8E-10 51.9 6.2 35 356-397 202-236 (269) 217 TIGR03536 DapD_gpp 2,3,4,5-tet 98.0 1.5E-05 3.9E-10 50.0 6.1 58 356-415 226-284 (341) 218 TIGR03535 DapD_actino 2,3,4,5- 97.9 2.8E-05 7.2E-10 48.4 6.3 58 356-415 201-259 (319) 219 KOG3121 consensus 97.8 3E-05 7.6E-10 48.3 5.1 68 355-430 85-154 (184) 220 TIGR03536 DapD_gpp 2,3,4,5-tet 97.7 0.00013 3.2E-09 44.7 5.9 36 356-400 252-287 (341) 221 TIGR03552 F420_cofC 2-phospho- 97.6 6E-05 1.5E-09 46.6 4.0 115 7-126 2-119 (195) 222 PRK13412 fkp bifunctional fuco 97.6 0.00071 1.8E-08 40.5 9.2 82 100-187 154-245 (974) 223 pfam01704 UDPGP UTP--glucose-1 97.5 0.005 1.3E-07 35.7 13.2 112 4-117 52-206 (416) 224 TIGR03535 DapD_actino 2,3,4,5- 97.5 0.00032 8E-09 42.5 6.0 36 356-400 227-262 (319) 225 cd00761 Glyco_tranf_GTA_type G 97.5 0.00048 1.2E-08 41.4 6.8 91 26-119 2-98 (156) 226 KOG2388 consensus 97.3 0.0031 8E-08 36.8 9.8 75 4-78 97-192 (477) 227 COG4801 Predicted acyltransfer 97.2 0.00057 1.5E-08 41.0 5.0 101 284-415 15-115 (277) 228 KOG4042 consensus 96.9 0.00039 1E-08 42.0 1.4 40 263-302 22-64 (190) 229 pfam07959 Fucokinase L-fucokin 96.8 0.0033 8.4E-08 36.7 5.8 78 100-183 55-153 (414) 230 cd06442 DPM1_like DPM1_like re 96.7 0.037 9.3E-07 30.7 10.5 187 26-222 2-199 (224) 231 KOG2638 consensus 96.7 0.039 9.9E-07 30.6 17.4 74 4-77 102-186 (498) 232 cd04188 DPG_synthase DPG_synth 96.6 0.022 5.5E-07 32.0 8.8 183 26-221 2-203 (211) 233 cd04184 GT2_RfbC_Mx_like Myxoc 96.6 0.045 1.1E-06 30.3 10.5 103 26-131 6-116 (202) 234 cd06434 GT2_HAS Hyaluronan syn 95.9 0.055 1.4E-06 29.7 7.7 94 26-125 5-104 (235) 235 cd06433 GT_2_WfgS_like WfgS an 95.9 0.1 2.7E-06 28.2 9.5 101 26-130 3-107 (202) 236 cd06439 CESA_like_1 CESA_like_ 95.8 0.056 1.4E-06 29.7 7.4 94 26-124 34-135 (251) 237 cd04186 GT_2_like_c Subfamily 95.8 0.064 1.6E-06 29.4 7.6 99 26-127 2-103 (166) 238 cd02526 GT2_RfbF_like RfbF is 95.7 0.13 3.3E-06 27.6 11.7 101 33-135 11-115 (237) 239 PRK11204 N-glycosyltransferase 95.5 0.083 2.1E-06 28.7 7.2 101 26-129 59-165 (421) 240 TIGR03111 glyc2_xrt_Gpos1 puta 95.4 0.15 3.8E-06 27.3 8.4 104 23-129 49-162 (439) 241 cd06435 CESA_NdvC_like NdvC_li 95.3 0.12 3E-06 27.9 7.6 99 26-125 3-111 (236) 242 pfam01983 CofC Guanylyl transf 95.0 0.061 1.6E-06 29.5 5.4 116 5-131 1-120 (217) 243 cd06421 CESA_CelA_like CESA_Ce 95.0 0.19 4.8E-06 26.7 7.7 103 26-131 6-117 (234) 244 COG1216 Predicted glycosyltran 94.7 0.26 6.6E-06 25.9 10.8 112 24-136 7-122 (305) 245 cd06438 EpsO_like EpsO protein 94.5 0.3 7.6E-06 25.6 7.9 101 26-127 2-110 (183) 246 COG1920 Predicted nucleotidylt 94.4 0.076 1.9E-06 28.9 4.6 172 7-238 3-181 (210) 247 pfam00535 Glycos_transf_2 Glyc 94.0 0.14 3.7E-06 27.4 5.3 104 26-132 3-113 (168) 248 cd06437 CESA_CaSu_A2 Cellulose 93.9 0.39 1E-05 24.9 7.9 97 26-125 6-114 (232) 249 cd06427 CESA_like_2 CESA_like_ 93.1 0.54 1.4E-05 24.1 8.2 98 26-126 6-112 (241) 250 cd04185 GT_2_like_b Subfamily 93.1 0.55 1.4E-05 24.1 7.7 98 27-127 3-108 (202) 251 cd04192 GT_2_like_e Subfamily 92.4 0.67 1.7E-05 23.6 7.5 100 26-128 2-112 (229) 252 cd06423 CESA_like CESA_like is 92.2 0.72 1.8E-05 23.4 7.8 102 26-130 2-110 (180) 253 cd02525 Succinoglycan_BP_ExoA 90.9 0.99 2.5E-05 22.6 10.1 102 26-131 5-114 (249) 254 cd04196 GT_2_like_d Subfamily 90.7 1 2.6E-05 22.5 9.4 104 26-132 3-113 (214) 255 cd04195 GT2_AmsE_like GT2_AmsE 90.0 1.2 3E-05 22.1 8.8 103 26-132 3-114 (201) 256 PRK10073 predicted glycosyl tr 87.8 1.7 4.4E-05 21.2 8.5 111 19-133 2-121 (329) 257 cd04187 DPM1_like_bac Bacteria 87.2 1.9 4.8E-05 21.0 6.4 104 26-132 2-114 (181) 258 cd02520 Glucosylceramide_synth 86.4 2.1 5.2E-05 20.8 7.2 95 26-124 6-112 (196) 259 COG1215 Glycosyltransferases, 86.1 2.1 5.4E-05 20.7 6.3 101 26-129 59-168 (439) 260 TIGR03472 HpnI hopanoid biosyn 84.5 2.5 6.5E-05 20.3 6.8 100 22-125 40-153 (373) 261 PRK10018 predicted glycosyl tr 84.4 2.6 6.6E-05 20.2 8.1 98 26-126 10-113 (279) 262 cd04179 DPM_DPG-synthase_like 84.2 2.6 6.7E-05 20.2 7.9 106 26-135 2-116 (185) 263 KOG4184 consensus 83.1 1.1 2.9E-05 22.3 2.8 14 104-119 180-193 (478) 264 KOG2978 consensus 77.9 4.4 0.00011 18.9 9.1 101 31-136 17-126 (238) 265 TIGR03469 HonB hopene-associat 76.3 4.9 0.00013 18.6 6.8 101 26-127 45-162 (384) 266 cd02522 GT_2_like_a GT_2_like_ 75.7 5.1 0.00013 18.5 5.8 93 26-128 4-102 (221) 267 PTZ00260 glycosyl transferase 71.0 6.7 0.00017 17.9 9.7 175 33-221 96-294 (336) 268 PRK10714 undecaprenyl phosphat 67.6 7.9 0.0002 17.5 7.5 106 26-135 11-126 (324) 269 PRK00923 sirohydrochlorin coba 64.5 9.1 0.00023 17.1 4.3 51 1-54 1-69 (130) 270 cd02511 Beta4Glucosyltransfera 63.7 9.5 0.00024 17.0 5.9 92 26-124 5-97 (229) 271 COG0381 WecB UDP-N-acetylgluco 60.1 11 0.00028 16.6 5.6 71 38-108 22-102 (383) 272 cd06436 GlcNAc-1-P_transferase 55.2 13 0.00034 16.2 7.3 96 26-124 2-115 (191) 273 cd04191 Glucan_BSP_ModH Glucan 52.3 15 0.00038 15.9 7.1 107 25-132 3-129 (254) 274 PRK00129 upp uracil phosphorib 52.1 13 0.00032 16.3 2.5 22 27-48 3-24 (208) 275 TIGR00461 gcvP glycine dehydro 50.4 16 0.00041 15.7 3.3 87 35-129 148-239 (965) 276 PRK08266 hypothetical protein; 49.4 17 0.00043 15.6 6.1 73 36-109 5-102 (531) 277 TIGR00420 trmU tRNA (5-methyla 44.2 18 0.00045 15.5 2.3 22 404-425 279-300 (394) 278 TIGR02661 MauD methylamine deh 42.2 22 0.00055 15.0 5.0 73 7-84 63-141 (189) 279 TIGR00027 mthyl_TIGR00027 meth 40.8 11 0.00028 16.7 0.7 13 5-17 92-104 (281) 280 cd03414 CbiX_SirB_C Sirohydroc 40.3 23 0.00059 14.8 6.7 48 6-54 1-66 (117) 281 COG4565 CitB Response regulato 39.5 7.9 0.0002 17.5 -0.1 63 81-145 29-98 (224) 282 cd03415 CbiX_CbiC Archaeal sir 37.6 26 0.00066 14.5 2.3 48 6-54 1-65 (125) 283 TIGR01668 YqeG_hyp_ppase HAD s 34.0 30 0.00075 14.2 4.6 100 33-141 55-167 (196) 284 PRK13337 putative lipid kinase 33.9 30 0.00076 14.2 5.9 46 60-108 24-69 (305) 285 PRK11914 diacylglycerol kinase 32.7 31 0.00079 14.1 5.1 61 45-108 4-73 (304) 286 PRK04015 chromatin protein; Pr 32.4 32 0.0008 14.0 3.0 30 26-55 6-37 (91) 287 PRK09946 hypothetical protein; 32.2 32 0.00081 14.0 4.4 12 202-213 32-43 (270) 288 TIGR01307 pgm_bpd_ind 2,3-bisp 31.2 24 0.0006 14.8 1.2 40 4-44 2-42 (529) 289 pfam07355 GRDB Glycine/sarcosi 30.8 21 0.00054 15.0 0.9 73 68-145 21-98 (349) 290 cd06420 GT2_Chondriotin_Pol_N 29.6 35 0.0009 13.8 6.8 92 26-121 2-102 (182) 291 PRK05782 precorrin-8X methylmu 28.7 32 0.00081 14.0 1.5 50 4-55 2-69 (332) 292 COG1105 FruK Fructose-1-phosph 28.3 37 0.00095 13.6 2.8 44 10-54 34-83 (310) 293 PRK10063 predicted glycosyl tr 27.7 38 0.00097 13.6 9.3 73 31-107 12-92 (248) 294 TIGR03568 NeuC_NnaA UDP-N-acet 27.4 39 0.00098 13.5 6.8 71 38-108 19-103 (365) 295 pfam07997 DUF1694 Protein of u 26.9 23 0.00058 14.9 0.5 90 57-150 21-111 (120) 296 TIGR00285 TIGR00285 DNA-bindin 25.8 20 0.00051 15.2 0.1 29 27-55 4-34 (88) 297 COG1099 Predicted metal-depend 25.1 43 0.0011 13.3 6.2 13 39-51 17-29 (254) 298 pfam02776 TPP_enzyme_N Thiamin 24.6 43 0.0011 13.2 7.0 97 35-136 4-100 (172) 299 COG3315 O-Methyltransferase in 24.5 42 0.0011 13.3 1.5 12 178-189 183-194 (297) 300 cd02510 pp-GalNAc-T pp-GalNAc- 24.0 45 0.0011 13.2 6.7 98 27-128 4-113 (299) 301 cd06913 beta3GnTL1_like Beta 1 23.5 46 0.0012 13.1 7.5 100 26-128 2-114 (219) 302 PRK10310 galactitol-specific P 21.8 50 0.0013 12.9 3.2 28 48-75 3-37 (94) 303 cd03413 CbiK_C Anaerobic cobal 20.8 52 0.0013 12.8 3.2 23 30-52 39-61 (103) 304 TIGR01980 sufB FeS assembly pr 20.6 53 0.0013 12.8 3.9 34 381-415 355-389 (469) 305 TIGR00575 dnlj DNA ligase, NAD 20.3 54 0.0014 12.7 1.8 22 261-282 328-349 (706) No 1 >TIGR01173 glmU UDP-N-acetylglucosamine pyrophosphorylase; InterPro: IPR005882 N-Acetylglucosamine-1-PO(4) uridyltransferase (GlmU, 2.7.7.23 from EC) is a trimeric bifunctional enzyme that catalyzes the last two sequential reactions in the de novo biosynthetic pathway for UDP-GlcNAc. The X-ray crystal structure of Escherichia coli GlmU in complex with UDP-GlcNAc and CoA has been determined to 2.1 A resolution and reveals a two-domain architecture that is responsible for these two reactions . The C-terminal domain is responsible for the CoA-dependent acetylation of Glc-1-PO(4) to GlcNAc-1-PO(4) and displays the longest left-handed parallel beta-helix observed to date. The acetyltransferase active site defined by the binding site for CoA makes use of residues from all three subunits and is positioned beneath an open cavity large enough to accommodate the Glc-1-PO(4) acetyl acceptor. The N-terminal domain catalyzes uridyl transfer from UTP to GlcNAc-1-PO(4) to form the final products UDP-GlcNAc and pyrophosphate. This domain is composed of a central seven-stranded beta-sheet surrounded by six alpha-helices in a Rossmann fold-like topology. ; GO: 0003977 UDP-N-acetylglucosamine diphosphorylase activity, 0009103 lipopolysaccharide biosynthetic process. Probab=100.00 E-value=0 Score=934.06 Aligned_cols=436 Identities=41% Similarity=0.686 Sum_probs=413.7 Q ss_pred CEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCC-CEEEEEECCCCCC- Q ss_conf 08999458877232788874412588485799999999977997799995798899999730269-7289998599886- Q gi|254780942|r 5 RLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPT-LSVEYYIQDCQQG- 82 (442) Q Consensus 5 ~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~-~~i~~v~q~~~~G- 82 (442) +.+||||||+||||+|.+||.|.||+|||||+|+|+.+.+...++|+||+||++|+|++.+.... .-+.|+.|.+|+| T Consensus 1 ~~~vILAAGkGTRMkS~lPKVLH~laGkpMl~hVi~~A~~L~~~~i~vV~GH~~~~V~~~l~~~~d~t~~~v~Q~~qlGG 80 (461) T TIGR01173 1 LSVVILAAGKGTRMKSKLPKVLHPLAGKPMLEHVIDAARKLSPEKIHVVLGHGAEQVRKALAEENDKTVNWVLQAEQLGG 80 (461) T ss_pred CEEEEEECCCCCCCCCCCCCCCCHHHCCCHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHCCCCCEEEEEEECCCCCC T ss_conf 93899806888311138686012010322789999998537834159998168699999850379967999996889787 Q ss_pred HHHHHHHCCCCCC-CCCCEEEEEECCCCCCCCCCCC-CCCCCCC--CCCCCCCEEEECCCCCCCCEEEECC-CCCEECCC Q ss_conf 1446641000012-3446058985154111533220-0000013--5861001001012215744042023-33100100 Q gi|254780942|r 83 TAHAVLTAQDAIK-PGYDDVIIMYGDVPLVSSHTLK-KAMDKIA--QGYSIAVVGFNADNPKGYGRLLIKN-NEIIAIRE 157 (442) Q Consensus 83 Ta~Ai~~a~~~l~-~~~~~~lVl~GD~~li~~~~l~-~l~~~~~--~~~~~~i~~~~~~dp~~yGvV~~d~-~~v~~i~E 157 (442) ||||+.+++++|. +..+++||||||+|||+.+.|+ +|++.|. +++.++++++..+||++||||.+++ ++|.+|+| T Consensus 81 TGHAv~~a~~~l~~~~~~~vLvLyGDvPLi~~eTL~m~Ll~~~~~~~~~~~tlLt~~l~DP~GYGRI~r~~~g~V~~IVE 160 (461) T TIGR01173 81 TGHAVLQALPFLSEDDDGRVLVLYGDVPLISAETLEMRLLESHRQLNGAKVTLLTAKLEDPTGYGRIIRENDGAVQAIVE 160 (461) T ss_pred HHHHHHHHCCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCCEEEECCCCCEEEEEC T ss_conf 28999871167888668608999588787756778779888630001045179998718889543589848995899973 Q ss_pred CCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECH-HHEECCCCCCCCCC Q ss_conf 14667542100035560011125014677753288874001112114899997307717999172-14301112100000 Q gi|254780942|r 158 ENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKE-QEVCGCNNRYELSL 236 (442) Q Consensus 158 k~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~-~~~~gv~~~~~L~~ 236 (442) +||++++|+.+.+||+|+|+|+.++|++||++++++|.||||||||++..+..+|..+.++.+.+ .++.|+|+|++|+. T Consensus 161 ~KDA~~eqk~I~eiNtG~y~~~~~~L~~~L~~l~n~NaqgEYYLTD~ia~a~~~g~~v~~~~~~d~~E~~GvNdR~qLa~ 240 (461) T TIGR01173 161 EKDANEEQKAIKEINTGVYVFDGAALKRWLPKLSNNNAQGEYYLTDVIALAVADGEEVRAVQVDDSEEVLGVNDRLQLAQ 240 (461) T ss_pred CCCCCHHHHCCCEEEEEEEEECHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHCCCCEEEEEEECCHHHHCCCCCHHHHHH T ss_conf 35988698035278887999832899988876287704443147899999850894789998087598336679889999 Q ss_pred CCCCCCCCCCCCCCCCCCCEEECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCC---------------- Q ss_conf 0122222112212223321021152799411233178410012232234420002111631343---------------- Q gi|254780942|r 237 IENIWQSRYRRQMMISGVTMIAPETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYL---------------- 300 (442) Q Consensus 237 ~~~~~~~~~~~~~l~~gv~~~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i---------------- 300 (442) +|+.||.++.+.||.+||++.+|+++||+.++.||.||+|+|+|+|.+++.||++|+|||+|+| T Consensus 241 lE~~~q~r~~~~l~~~GVt~~dP~~~~i~~~v~~G~Dv~I~~~v~leG~v~iG~~v~IGp~~~I~ns~I~~~~~I~~~s~ 320 (461) T TIGR01173 241 LERILQRRIAKKLLLQGVTLIDPARTDIRGTVEIGQDVVIDPNVILEGKVQIGDDVVIGPGCVIKNSVIGSNAVIKPYSV 320 (461) T ss_pred HHHHHHHHHHHHHHHCCCEEECCCEEEECCCEEECCEEEECCCCEECCEEEECCCEEECCCCEEEEEEECCCCEEEEEEE T ss_conf 99999999999898589299868337873547987716975583883407987870788985899728868855888884 Q ss_pred -CCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCC Q ss_conf -3212111210021344320000025864432023102112478544874555663988998997980883445775456 Q gi|254780942|r 301 -EGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYK 379 (442) Q Consensus 301 -~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~ 379 (442) |+|.|+++|.|||||||||+|+|+++++||||||+|||.|++|||++||||||||.||++||||||||||||||.+||+ T Consensus 321 ~e~~~ig~~~~vGPFArLRP~~~L~~~~hiGNFVE~Kna~iG~gsKA~HLsYlGDAeiG~~vNiGAGtITcNYDG~nK~~ 400 (461) T TIGR01173 321 LEGSEIGEGCDVGPFARLRPGSVLGAGVHIGNFVEVKNARIGEGSKAGHLSYLGDAEIGSNVNIGAGTITCNYDGVNKHK 400 (461) T ss_pred CCCCEECCCCEECCCCCCCCCCHHHCCCEEEEEEEEECCEECCCCCCCCCCEEEEEEECCCCEECCEEEEEECCCCCCCC T ss_conf 14678615640277300186432117781602586524761886442314333202507960032316899327962203 Q ss_pred CEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEEEECCCCCCCCC Q ss_conf 2987887988699893783988998997783576643698689813560770774221001 Q gi|254780942|r 380 THINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQIVKEDGALSMRK 440 (442) Q Consensus 380 ~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~~~~~~~~~~~ 440 (442) |+|||+||||+|+.+++||+||++|+|||||+||+|||+++|+++|++|.++|+|+.++++ T Consensus 401 T~IGd~VFiGSnt~LVAPV~iG~gA~iaAGstvt~DVp~g~La~~R~~Q~~iegW~~~~~~ 461 (461) T TIGR01173 401 TIIGDGVFIGSNTQLVAPVKIGDGATIAAGSTVTKDVPEGALAIARARQRNIEGWVRPKKK 461 (461) T ss_pred CEECCCCEECCCCEEECCEEECCCCEECCCCEEECCCCCCCCEEECCCCEECCCCCCCCCC T ss_conf 4864887877676045543943811871241480234888526624556552276326789 No 2 >PRK09451 glmU bifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase; Provisional Probab=100.00 E-value=0 Score=850.43 Aligned_cols=437 Identities=37% Similarity=0.653 Sum_probs=404.6 Q ss_pred CCCCCEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEECCCC Q ss_conf 98740899945887723278887441258848579999999997799779999579889999973026972899985998 Q gi|254780942|r 1 MKRKRLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQDCQ 80 (442) Q Consensus 1 m~~~~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~ 80 (442) |++.|.|||||||+||||||.+||||+||+|||||+|+|+.+++++++++++|++|+.|++++++.+. +++|+.|++| T Consensus 2 ~~~~m~aVILAAGkGTRM~s~~PKvL~pi~gkPml~hvi~~l~~~g~~~ivvVvg~~~e~i~~~~~~~--~i~~v~Q~eq 79 (456) T PRK09451 2 LNSAMSVVILAAGKGTRMYSDLPKVLHTLAGKPMVQHVIDAANELGAAHVHLVYGHGGDLLKQTLKDE--PLNWVLQAEQ 79 (456) T ss_pred CCCCCEEEEECCCCCCCCCCCCCHHHHEECCHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCC--CCEEEEECCC T ss_conf 98875399985778777799979575044898699999999987699809999699879999874458--8449995888 Q ss_pred CCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCEEEECCCCCEECCCCCC Q ss_conf 86144664100001234460589851541115332200000013586100100101221574404202333100100146 Q gi|254780942|r 81 QGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNADNPKGYGRLLIKNNEIIAIREEND 160 (442) Q Consensus 81 ~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~~dp~~yGvV~~d~~~v~~i~Ek~d 160 (442) +||||||++|.+++.+ +++|||+|||+||++..+|+.|++.+.+ +++++++++++||++||+|..+++++.+|+|||| T Consensus 80 lGTghAV~~A~~~l~~-~~~vLVl~GD~PLi~~~tl~~l~~~~~~-~~~~llt~~~~dP~~YGrIi~~~g~v~~IVE~kd 157 (456) T PRK09451 80 LGTGHAMQQAAPFFAD-DEDILMLYGDVPLISVETLQRLRDAKPQ-GGIGLLTVKLDNPTGYGRITRENGKVVGIVEHKD 157 (456) T ss_pred CCCHHHHHHHHHHCCC-CCCEEEEECCCCCCCHHHHHHHHHHCCC-CCEEEEEEECCCCCCCEEEEECCCCEEEEEECCC T ss_conf 9729999998886066-8858999598015699999999862513-8859999976896546489943895799998267 Q ss_pred CCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHH-HEECCCCCCCCCCCCC Q ss_conf 675421000355600111250146777532888740011121148999973077179991721-4301112100000012 Q gi|254780942|r 161 ATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQ-EVCGCNNRYELSLIEN 239 (442) Q Consensus 161 ~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~-~~~gv~~~~~L~~~~~ 239 (442) ++++++.++++|+|+|+|+.+.|+++|+++.+++.++||||||+++.+.++|..+.++...++ +..|+|++.+|+.+++ T Consensus 158 a~~~e~~i~eiNaGIy~f~~~~l~~~L~~i~~~n~~gEyyLTDii~~~~~~g~~v~~v~~~~~~e~~GvN~~~~La~~e~ 237 (456) T PRK09451 158 ATDEQRQIQEINTGILVANGADLKRWLAKLTNNNAQGEYYITDIIALAYQEGREIVAVHPTRLSEVEGVNNRLQLARLER 237 (456) T ss_pred CCHHHHCCCEECCEEEEECHHHHHHHHHHCCCCCCCCCEEECHHHHHHHHCCCEEEEEECCCHHHHCCCCCHHHHHHHHH T ss_conf 98455211122024899579999998875067554562563056788874695799998288799446798999999899 Q ss_pred CCCCCCCCCCCCCCCCEEECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEE-----------------CCCCCCCCC Q ss_conf 222211221222332102115279941123317841001223223442000211-----------------163134332 Q gi|254780942|r 240 IWQSRYRRQMMISGVTMIAPETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQ-----------------IRAFSYLEG 302 (442) Q Consensus 240 ~~~~~~~~~~l~~gv~~~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~-----------------I~~~~~i~~ 302 (442) +|+.+..+.||.+|+++.+|+++++++++.++++++|+++|+|.+++.||++|+ |+++|+|++ T Consensus 238 ~~~~~~~~~~m~~GVti~dP~~~~I~~~v~ig~dv~I~~nvvi~G~v~IG~~v~Ig~g~~I~ns~Ig~~~~I~~~S~Ie~ 317 (456) T PRK09451 238 VYQAEQAEKLLLAGVMLRDPARFDLRGTLTHGRDVEIDTNVIIEGNVTLGHRVKIGAGCVLKNCVIGDDCEISPYSVVED 317 (456) T ss_pred HHHHHHHHHHHHCCCEEECCCEEEEECCEEECCCEEECCCEEEECCEEECCCEEECCCEEEECCEECCCCEEECCEEECC T ss_conf 99998999998679888158807770328965750775770884424757975982407985769846878942256506 Q ss_pred CCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEE Q ss_conf 12111210021344320000025864432023102112478544874555663988998997980883445775456298 Q gi|254780942|r 303 VHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHI 382 (442) Q Consensus 303 ~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~i 382 (442) |+||++|.|||||||||+|+|+++++||||||+|||.||+|+|++|||||||++||++||||||||||||||.+||+|+| T Consensus 318 s~Ig~~~~IGPfA~lRp~t~i~~~~~iGnfvEiK~s~i~~g~k~~HlsYiGDa~iG~~~NiGAGtit~NyDG~~K~~t~i 397 (456) T PRK09451 318 ANLGAACTIGPFARLRPGAELLEGAHVGNFVEMKKARLGKGSKAGHLTYLGDAEIGDNVNIGAGTITCNYDGANKFKTII 397 (456) T ss_pred CEECCCCEECCCCCCCCCCEECCCCEEEEEEEEECCEECCCCEECCEEEECCCEECCCCEECCCEEEEECCCCCCCCCEE T ss_conf 63436716888643488762367888822899824597589770423366144765886888876997224876656487 Q ss_pred CCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEEEECCCCCCCCCC Q ss_conf 78879886998937839889989977835766436986898135607707742210015 Q gi|254780942|r 383 NENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQIVKEDGALSMRKK 441 (442) Q Consensus 383 g~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~~~~~~~~~~~~ 441 (442) ||+||||+|++++||++||++|+|||||+|++|||+++|+++|++|.++++|..+++|| T Consensus 398 gd~~fiGsn~~lvapv~iG~~a~i~aGs~it~dVp~~~l~i~r~~q~~~~~~~~~~~kk 456 (456) T PRK09451 398 GDDVFVGSDTQLVAPVTVGKGATIGAGTTVTRDVAENELVISRVPQRHIQGWQRPVKKK 456 (456) T ss_pred CCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEECCCCCCCCCCCCCCCCCC T ss_conf 89829987844835849889979998987885069997777046623355556744479 No 3 >COG1207 GlmU N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=0 Score=826.75 Aligned_cols=438 Identities=45% Similarity=0.733 Sum_probs=412.8 Q ss_pred CCEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEECCCCCCH Q ss_conf 40899945887723278887441258848579999999997799779999579889999973026972899985998861 Q gi|254780942|r 4 KRLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQDCQQGT 83 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~~GT 83 (442) .+.|||||||+||||++.+||.|.|++||||++|+|+.+...+.+++++|++|++|++++.+.... +++|+.|++|+|| T Consensus 2 ~~~~vILAAGkGTRMkS~lPKVLH~vaGkpMl~hVi~~a~~l~~~~i~vVvGh~ae~V~~~~~~~~-~v~~v~Q~eqlGT 80 (460) T COG1207 2 SLSAVILAAGKGTRMKSDLPKVLHPVAGKPMLEHVIDAARALGPDDIVVVVGHGAEQVREALAERD-DVEFVLQEEQLGT 80 (460) T ss_pred CCEEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCC-CCEEEEECCCCCH T ss_conf 616999944887534579841110016800899999987626866189998687789998756354-7339996255873 Q ss_pred HHHHHHCCCCC-CCCCCEEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCCEEEECCCCCCCCEEEEC-CCCCEECCCCCC Q ss_conf 44664100001-23446058985154111533220000001-3586100100101221574404202-333100100146 Q gi|254780942|r 84 AHAVLTAQDAI-KPGYDDVIIMYGDVPLVSSHTLKKAMDKI-AQGYSIAVVGFNADNPKGYGRLLIK-NNEIIAIREEND 160 (442) Q Consensus 84 a~Ai~~a~~~l-~~~~~~~lVl~GD~~li~~~~l~~l~~~~-~~~~~~~i~~~~~~dp~~yGvV~~d-~~~v~~i~Ek~d 160 (442) |||+.+++++| ++..+++||+|||+||++...|+.|++.| ..+++++++++..+||++||++.++ ++.|.+|+|+|| T Consensus 81 gHAV~~a~~~l~~~~~g~vLVl~GD~PLit~~TL~~L~~~~~~~~~~~tvLt~~~~dP~GYGRIvr~~~g~V~~IVE~KD 160 (460) T COG1207 81 GHAVLQALPALADDYDGDVLVLYGDVPLITAETLEELLAAHPAHGAAATVLTAELDDPTGYGRIVRDGNGEVTAIVEEKD 160 (460) T ss_pred HHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCEEEECCCCCEEEEEECCC T ss_conf 89999666764047897389996996667899999999863403775599998738998754699939986999997478 Q ss_pred CCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEEC-HHHEECCCCCCCCCCCCC Q ss_conf 6754210003556001112501467775328887400111211489999730771799917-214301112100000012 Q gi|254780942|r 161 ATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVK-EQEVCGCNNRYELSLIEN 239 (442) Q Consensus 161 ~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~-~~~~~gv~~~~~L~~~~~ 239 (442) ++++|+.+.+||+|+|+|+.+.|++||+++.++|.+|||||||++..+..+|.++.++..+ .+++.|+|++++|+++++ T Consensus 161 A~~eek~I~eiNtGiy~f~~~~L~~~L~~l~nnNaqgEYYLTDvI~i~~~~g~~V~a~~~~d~~E~~GVN~R~qLa~~e~ 240 (460) T COG1207 161 ASEEEKQIKEINTGIYAFDGAALLRALPKLSNNNAQGEYYLTDVIAIARNEGEKVRAVHVDDEEEVLGVNDRVQLAEAER 240 (460) T ss_pred CCHHHHCCCEEEEEEEEECHHHHHHHHHHHCCCCCCCCEEHHHHHHHHHHCCCEEEEEECCCHHHHCCCCCHHHHHHHHH T ss_conf 99789508388206899867999999987256665574868999999985797699996696688627674999999999 Q ss_pred CCCCCCCCCCCCCCCCEEECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCC-----------------CCCCC Q ss_conf 2222112212223321021152799411233178410012232234420002111631-----------------34332 Q gi|254780942|r 240 IWQSRYRRQMMISGVTMIAPETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAF-----------------SYLEG 302 (442) Q Consensus 240 ~~~~~~~~~~l~~gv~~~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~-----------------~~i~~ 302 (442) +||.++++.||.+|+++++|+++||+.++.|+++++|+|+++|.+++.||++|+|+++ |++++ T Consensus 241 ~~q~r~~~~~m~~GVtl~dP~t~~i~~dv~ig~DvvI~p~v~l~G~t~ig~~v~iGpg~~i~ds~I~~~a~I~~~S~ie~ 320 (460) T COG1207 241 IMQRRIAEKLMLAGVTLIDPATTYIRGDVEIGRDVVIEPNVILEGNTVIGDNVVIGPGSVIKDSVIGDNAVIKAYSVIEG 320 (460) T ss_pred HHHHHHHHHHHHCCCEEECCCEEEECCCEEECCCEEEECCCEEEEEEEECCCEEECCCCEEEEEEECCCCEEEECCEEEC T ss_conf 99999999999769599678848873717987814991484894158967964999996787538769988975104305 Q ss_pred CCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEE Q ss_conf 12111210021344320000025864432023102112478544874555663988998997980883445775456298 Q gi|254780942|r 303 VHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHI 382 (442) Q Consensus 303 ~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~i 382 (442) |.||++|.||||+|+||+|.|+++++||||||+|||.|++|+|++||+|+|||.||++||||||||||||||.+||+|+| T Consensus 321 s~vg~~~~VGPfA~LRPg~~L~~~~hIGNFVEvK~a~ig~gsKa~HLtYlGDA~iG~~~NiGAGtItcNYDG~nK~~T~I 400 (460) T COG1207 321 STVGEGATVGPFARLRPGAVLGADVHIGNFVEVKKATIGKGSKAGHLTYLGDAEIGENVNIGAGTITCNYDGKNKFKTII 400 (460) T ss_pred CEECCCCCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCCEEEECCCEECCCCEECCCEEEECCCCCCCCEEEE T ss_conf 58547963378310179685267976742599843560688633532342133206874214425998478863320064 Q ss_pred CCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEEEECCCCCCCCCCC Q ss_conf 788798869989378398899899778357664369868981356077077422100159 Q gi|254780942|r 383 NENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQIVKEDGALSMRKKK 442 (442) Q Consensus 383 g~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~~~~~~~~~~~~~ 442 (442) |+++|||+|++++++|+||++++|||||+||+|||+++|+++|++|.+||+|+.+.+++| T Consensus 401 Gd~vFiGSns~LVAPV~IGd~a~iaAGStIT~DVp~~aLai~RarQ~~~egw~~~~~~~~ 460 (460) T COG1207 401 GDNVFIGSNSQLVAPVTIGDGATIAAGSTITKDVPEGALAISRARQTNKEGWVRKKKKKK 460 (460) T ss_pred CCCCEECCCCCEEEEEEECCCCEECCCCEECCCCCCCCEEEEECCEEECCCCCCCCCCCC T ss_conf 688577668718864896698488146368365799844674044143244214565699 No 4 >PRK05293 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=100.00 E-value=0 Score=436.26 Aligned_cols=335 Identities=18% Similarity=0.226 Sum_probs=260.2 Q ss_pred CCCCEEEEECCCCCCCCCC---CCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCC---C----- Q ss_conf 8740899945887723278---887441258848-579999999997799779999579889999973026---9----- Q gi|254780942|r 2 KRKRLAIVLAAGRGHRMKS---SSSKVLQKIAGK-PMISHVMETIAAAGIENVALVLGYGAEEITRINFPP---T----- 69 (442) Q Consensus 2 ~~~~~AiILAaG~GtRl~p---~~pKpLlpi~gk-pli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~---~----- 69 (442) +++|.|||||||+|||||| .+||||+||+|| |||+|+|+++.++|+++++++++|+.+.+.+|+... + T Consensus 1 ~~~MkAvILAgG~GtRlrPLT~~rPK~llPv~gk~~lId~~L~~~~~~gI~~v~v~~~~~~~~i~~h~g~g~~w~l~~~~ 80 (381) T PRK05293 1 KKEMLAMILAGGQGTRLGKLTKNIAKPAVPFGGKYRIIDFTLSNCANSGIYTVGVLTQYQPLELNNHIGIGSPWDLDRKN 80 (381) T ss_pred CCCCEEEEECCCCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHCCCCCCCCCCCC T ss_conf 98509999788887672544589742033778915128999999998699999998689979999987176503643444 Q ss_pred --CEEE--EEECCCC---CCHHHHHHHCCCCCCCCC-CEEEEEECCCCCCCCCCCCCCCCCCC-CCCCCCCEEEEC--CC Q ss_conf --7289--9985998---861446641000012344-60589851541115332200000013-586100100101--22 Q gi|254780942|r 70 --LSVE--YYIQDCQ---QGTAHAVLTAQDAIKPGY-DDVIIMYGDVPLVSSHTLKKAMDKIA-QGYSIAVVGFNA--DN 138 (442) Q Consensus 70 --~~i~--~v~q~~~---~GTa~Ai~~a~~~l~~~~-~~~lVl~GD~~li~~~~l~~l~~~~~-~~~~~~i~~~~~--~d 138 (442) ..+. |..|.++ +|||||++++++++.... +.|||++||+ +...++++|+++|+ +++++|+++.++ ++ T Consensus 81 ~g~~il~~~~~~~~~~~~~Gta~al~~~~~~i~~~~~e~vlv~~GD~--i~~~dl~~ll~~H~~~~ad~Ti~~~~v~~~~ 158 (381) T PRK05293 81 GGVTILPPYSESSGGKWYKGTAHAIYQNIEYIDQYDPEYVLILSGDH--IYKMDYDKMLDYHKEKEADVTIAVIEVPWEE 158 (381) T ss_pred CCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCEEEEECCCE--EECCCHHHHHHHHHHCCCCCCEEEEEECCCC T ss_conf 87798343001467766674289999999986448999899976998--8678999999999864666503789805126 Q ss_pred CCCCCEEEEC-CCCCEECCCCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEE Q ss_conf 1574404202-333100100146675421000355600111250146777532888740011121148999973077179 Q gi|254780942|r 139 PKGYGRLLIK-NNEIIAIREENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIAS 217 (442) Q Consensus 139 p~~yGvV~~d-~~~v~~i~Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~ 217 (442) |++||++..| +++|++|.||| +++.++++|+|+|+|+++.|+++|.+....+.+.++|++|+++.|+++|.++.+ T Consensus 159 ~~~yGvv~~d~~g~I~~~~EKp----~~~~s~l~n~GiYif~~~~L~~~l~~~~~~~~~~~~~~~d~i~~ll~~~~~v~~ 234 (381) T PRK05293 159 ASRFGIMNTDEEMRIVEFEEKP----KFPKSNLASMGIYIFNWKRLKEYLEEDEKNPNSSHDFGKDVIPLYLEEGEKLYA 234 (381) T ss_pred CCCCEEEEECCCCCEEEEEECC----CCCCCCCCCCCEEEECHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCEEE T ss_conf 4416089987999799999788----998766543316995699999999875138863346689999999977996899 Q ss_pred EEECHHHEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCC Q ss_conf 99172143011121000000122222112212223321021152799411233178410012232234420002111631 Q gi|254780942|r 218 IDVKEQEVCGCNNRYELSLIENIWQSRYRRQMMISGVTMIAPETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAF 297 (442) Q Consensus 218 ~~~~~~~~~gv~~~~~L~~~~~~~~~~~~~~~l~~gv~~~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~ 297 (442) |..+++| .++++++++..++..+..... ......++..|.....+.|++.+++++.| T Consensus 235 y~~~g~w-~dig~~~~~~~an~~lL~~~~-------------~~~~~~~~~~i~~~~~~~~p~~i~~~~~I--------- 291 (381) T PRK05293 235 YPFEGYW-KDVGTIESLWEANMELLRPEN-------------PLNLRDRNWRIYSVNPILPPQYIAENAKV--------- 291 (381) T ss_pred EEECCEE-EECCCHHHHHHHHHHHHCCCC-------------CCCCCCCCCCEECCCCCCCCEEECCCCEE--------- T ss_conf 9855868-888988999999999866676-------------43345767614146877899298799789--------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECC Q ss_conf 34332121112100213443200000258644320231021124785448745556639889989979808834 Q gi|254780942|r 298 SYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCN 371 (442) Q Consensus 298 ~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n 371 (442) .++.|+++|.|.... ++|+|+++|.||.+++|+||+||++++|++.++|.+|+||++|.||+||++.| T Consensus 292 ---~~s~i~~gc~I~g~V---~nSvIg~~v~Ig~ga~I~nSiI~~~~~Ig~~~~I~nsIi~~~~~Ig~~~~~~~ 359 (381) T PRK05293 292 ---KNSLVVEGCEVYGTV---KHSVLFQGVQVGEGSIVKDSVIMPGAKIGENVVIERAIIGENAVIGDGVIGVI 359 (381) T ss_pred ---ECCEECCCCEEEEEE---ECCEECCCCEECCCCEEECCEECCCCEECCCCEEEEEEECCCCEECCCCEEEC T ss_conf ---688890798996267---54898899899999999778985969999899990129959799997968957 No 5 >TIGR01208 rmlA_long glucose-1-phosphate thymidylyltransferase; InterPro: IPR005908 Synonym: dTDP-D-glucose synthase This group of proteins comprises a tightly conserved but broadly distributed subfamily of known and putative bacterial glucose-1-phosphate thymidylyltransferases (2.7.7.24 from EC). It is well characterised in several species as the first of four enzymes involved in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.dTTP + alpha-D-glucose 1-phosphate = diphosphate + dTDP-glucose The family of known and putative glucose-1-phosphate thymidyltransferases shows a deep split into a short form (see IPR005907 from INTERPRO) and a long form described by this model. The homotetrameric short form is found in numerous bacterial species that incorporate dTDP-L-rhamnose, which it helps synthesize, into the cell wall. It is subject to feedback inhibition. The long form, in contrast, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced. . Probab=100.00 E-value=0 Score=436.04 Aligned_cols=343 Identities=23% Similarity=0.326 Sum_probs=264.4 Q ss_pred EEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECC-CHHHHHHHHCC---CCCEEEEEECCC Q ss_conf 999458877232788---87441258848579999999997799779999579-88999997302---697289998599 Q gi|254780942|r 7 AIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGY-GAEEITRINFP---PTLSVEYYIQDC 79 (442) Q Consensus 7 AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~-~~e~i~~~~~~---~~~~i~~v~q~~ 79 (442) |+|||||+||||||+ +||+|+||+|||+|+|.|+.+.++|++||-+|+++ ..|+|+++... .+.+++|+.|.+ T Consensus 2 aliL~aG~GTRLRPLTFt~pK~LiPvAnKPi~~Yaie~~~~AGI~diGIvvg~~~~e~i~~~~g~g~~fg~kityI~Q~~ 81 (361) T TIGR01208 2 ALILAAGKGTRLRPLTFTRPKQLIPVANKPILQYAIEDLIEAGITDIGIVVGPETGEEIKEIVGEGERFGAKITYIVQGE 81 (361) T ss_pred CEECCCCCCCCCCCCCCCCCCCEEEECCCCHHHHHHHHHHHCCCEEEEEEECCCCCHHHEEEECCCCCCCEEEEEEECCC T ss_conf 24425878676777665788430661687323676888874697699998469882122023248830232898982278 Q ss_pred CCCHHHHHHHCCCC-CCCCCCEEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCCEEEECCCCCCCCEEEECC-C-CCEEC Q ss_conf 88614466410000-123446058985154111533220000001-35861001001012215744042023-3-31001 Q gi|254780942|r 80 QQGTAHAVLTAQDA-IKPGYDDVIIMYGDVPLVSSHTLKKAMDKI-AQGYSIAVVGFNADNPKGYGRLLIKN-N-EIIAI 155 (442) Q Consensus 80 ~~GTa~Ai~~a~~~-l~~~~~~~lVl~GD~~li~~~~l~~l~~~~-~~~~~~~i~~~~~~dp~~yGvV~~d~-~-~v~~i 155 (442) |+|-||||+.|+++ +.+ ++|+|--||+ |+ .+.+.++++++ +++.++.|+..+|+||++||+..+++ | +|+++ T Consensus 82 plGlAHAv~~A~~fGlgd--~~FvvYLGDN-l~-~~gi~~fv~~F~~~~~da~ILL~~V~dP~~FGVA~l~d~G~~i~~L 157 (361) T TIGR01208 82 PLGLAHAVYVARDFGLGD--EDFVVYLGDN-LI-QDGIKRFVKSFEEKDYDALILLKKVEDPTAFGVAELEDDGKRILKL 157 (361) T ss_pred CCHHHHHHHHHHHHCCCC--CCEEEECCCC-HH-HHHHHHHHHHHCCCCCCCEEECCCCCCCCCCCEEEEECCCCEEEEE T ss_conf 760467645788847899--8607973742-10-4237899885331480030110438878845558992489799998 Q ss_pred CCCCCCCCCCCCCCCCCCCHHEECC-HHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCC Q ss_conf 0014667542100035560011125-014677753288874001112114899997307717999172143011121000 Q gi|254780942|r 156 REENDATDEERKIHYCNSGLMAIDG-LYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYEL 234 (442) Q Consensus 156 ~Ek~d~~~~~~~~~lin~GiY~f~~-~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L 234 (442) +||| .++.|||+-.|+|.|++ +.+++.+..++++ ++|||++||.+++|+++|++|.+..+.+|| .++..+.|| T Consensus 158 vEKP----k~PPSNLAvvGlY~F~pPe~if~~~~~~kPS-wRGElEITD~IQ~lIe~Gy~V~~~~v~GWW-kDTGk~eDL 231 (361) T TIGR01208 158 VEKP----KEPPSNLAVVGLYMFRPPELIFEAIKNIKPS-WRGELEITDAIQYLIEKGYKVGGSKVKGWW-KDTGKPEDL 231 (361) T ss_pred EECC----CCCCCCCEEEEEEECCCHHHHHHHHHCCCCC-CCCCHHHHHHHHHHHHCCCEEEEEEEEEEE-CCCCCCHHH T ss_conf 8338----6578770123345338777988898528887-766215775775653158377779997887-015980238 Q ss_pred CCCCCCCCC-CCCCCCCCCCCCEEECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCC-CCC Q ss_conf 000122222-112212223321021152799411233178410012232234420002111631343321211121-002 Q gi|254780942|r 235 SLIENIWQS-RYRRQMMISGVTMIAPETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKT-IIG 312 (442) Q Consensus 235 ~~~~~~~~~-~~~~~~l~~gv~~~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~-~Ig 312 (442) .++++++.+ .+.+.. +|. .. ...-|...|.|++++.|.-+.+|.+.+.||++|.| +|| .|| T Consensus 232 L~AN~~iLd~~~~~~v--~G~-~~--~~s~i~GRV~v~~gakid~ns~i~GPa~IG~d~~I------------~nsGyiG 294 (361) T TIGR01208 232 LDANRLILDEEVEREV--KGE-VD--DESKIEGRVEVGEGAKIDVNSVIRGPAVIGEDCII------------ENSGYIG 294 (361) T ss_pred HHHHHHHHHHHCCHHC--CCC-EE--CCCEEEEEEEECCCCEECCCCEEECCEEECCCCEE------------ECCCCCC T ss_conf 9999997443223200--661-32--26578840790897884237778706087795278------------2686117 Q ss_pred CCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCC-CCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEE Q ss_conf 13443200000258644320231021124785448745-55663988998997980883445775456298788798 Q gi|254780942|r 313 PFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLS-YVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFI 388 (442) Q Consensus 313 ~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~-~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~i 388 (442) |+|+|+++++|- ++|+.+|+|+++|.|.... -|-||+||.++.|=.+- .-...++-+|||...+ T Consensus 295 ------PYtSig~~~~I~-d~~vE~S~vldes~I~~v~~Ri~dS~iG~~~~i~~~~-----~~P~~~rL~igd~S~v 359 (361) T TIGR01208 295 ------PYTSIGEGVVIR-DAEVEHSIVLDESVIEGVEKRIVDSVIGKKVRIKGNR-----RRPKALRLIIGDYSQV 359 (361) T ss_pred ------CEEEECCCEEEE-CCCCCCEEEECCCEEEEHHHHHHHEECCCCCEEECCC-----CCCCCCCCEEECCCEE T ss_conf ------707725753874-1630203564421143103202230117820784375-----7841022147054057 No 6 >COG1208 GCD1 Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) [Cell envelope biogenesis, outer membrane / Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=421.38 Aligned_cols=349 Identities=28% Similarity=0.409 Sum_probs=277.6 Q ss_pred CCEEEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCC---CCEEEEEEC Q ss_conf 408999458877232788---87441258848579999999997799779999579889999973026---972899985 Q gi|254780942|r 4 KRLAIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPP---TLSVEYYIQ 77 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~---~~~i~~v~q 77 (442) +|.|||||||+||||||+ +||||+||+|||||+|+|+.|+++|+++++++++|+.+++++++.+. +.++.|+.| T Consensus 1 ~mkavILagG~GtRLrPlT~~~PKPllpI~gkPii~~~l~~L~~~Gv~eivi~~~y~~~~i~~~~~d~~~~~~~I~y~~e 80 (358) T COG1208 1 PMKAVILAGGYGTRLRPLTDDRPKPLLPIAGKPLIEYVLEALAAAGVEEIVLVVGYLGEQIEEYFGDGEGLGVRITYVVE 80 (358) T ss_pred CCEEEEECCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHCCCCCCCEEEEEC T ss_conf 93599994665666355448997446579986459999999998799789998774689999998626356986599716 Q ss_pred CCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCC-CCCCCEEEECCCCCCCCEEEEC-CC-CCEE Q ss_conf 9988614466410000123446058985154111533220000001358-6100100101221574404202-33-3100 Q gi|254780942|r 78 DCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQG-YSIAVVGFNADNPKGYGRLLIK-NN-EIIA 154 (442) Q Consensus 78 ~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~-~~~~i~~~~~~dp~~yGvV~~d-~~-~v~~ 154 (442) ++++|||+||+++.+++.. ++|+++|||+ +++.++..+++.|+++ ...++...++++|+.||++..+ ++ ++.+ T Consensus 81 ~~~lGTag~l~~a~~~l~~--~~f~v~~GDv--~~~~dl~~l~~~~~~~~~~~~~~~~~~~~~~~~Gvv~~~~~~~~v~~ 156 (358) T COG1208 81 KEPLGTAGALKNALDLLGG--DDFLVLNGDV--LTDLDLSELLEFHKKKGALATIALTRVLDPSEFGVVETDDGDGRVVE 156 (358) T ss_pred CCCCCCHHHHHHHHHHHCC--CCEEEEECCC--CCCCCHHHHHHHHHHCCCCCEEEEEECCCCCCCCEEEECCCCCEEEE T ss_conf 8766648999999886079--9789997886--22468899999987356642799875277655863997599848988 Q ss_pred CCCCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCC Q ss_conf 10014667542100035560011125014677753288874001112114899997307717999172143011121000 Q gi|254780942|r 155 IREENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYEL 234 (442) Q Consensus 155 i~Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L 234 (442) |.|||.. ++..++++|+|+|+|+++.+ ++++........ .|+++.+.+++..+.++..++ .|.++++++++ T Consensus 157 f~ekp~~--~~~~~~~in~Giyi~~~~v~-~~i~~~~~~~~~-----~~~~~~l~~~~~~v~~~~~~g-~W~dig~p~d~ 227 (358) T COG1208 157 FREKPGP--EEPPSNLINAGIYIFDPEVF-DYIEKGERFDFE-----EELLPALAAKGEDVYGYVFEG-YWLDIGTPEDL 227 (358) T ss_pred EEECCCC--CCCCCCEEEEEEEEECHHHH-CCCCCCCCCCCH-----HHHHHHHHHCCCCEEEEEECC-EEEECCCHHHH T ss_conf 9972667--67888669999999897894-301335743214-----778999987699569997578-79969998999 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCC Q ss_conf 00012222211221222332102115279941123317841001223223442000211163134332121112100213 Q gi|254780942|r 235 SLIENIWQSRYRRQMMISGVTMIAPETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPF 314 (442) Q Consensus 235 ~~~~~~~~~~~~~~~l~~gv~~~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~ 314 (442) ..+++.+....... +...+ .....+ ++. +.|. ..++|+++|.||++ T Consensus 228 ~~a~~~~~~~~~~~----------~~~~~-------~~~~~~-----------~~~-~~i~-----gp~~ig~~~~i~~~ 273 (358) T COG1208 228 LEANELLLRGDGKS----------PLGPI-------EEPVVI-----------IRS-AYII-----GPVVIGPGAKIGPG 273 (358) T ss_pred HHHHHHHHHHCCCC----------CCCCC-------CCCCCC-----------CCC-EEEE-----CCEEECCCCEECCC T ss_conf 99999998512235----------65443-------334556-----------662-3885-----89498269789888 Q ss_pred CCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEE Q ss_conf 44320000025864432023102112478544874555663988998997980883445775456298788798869989 Q gi|254780942|r 315 ARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSL 394 (442) Q Consensus 315 ~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i 394 (442) +.|+++|+|+++|.||+++++++|+|++++.++|.+|++||+||+||+||+ + . .+|| +.+|.++.+ T Consensus 274 ~~i~~~~~ig~~~~I~~~~~i~~Sii~~~~~i~~~~~i~~sIi~~~~~ig~-----~-------~-~i~d-~~~g~~~~i 339 (358) T COG1208 274 ALIGPYTVIGEGVTIGNGVEIKNSIIMDNVVIGHGSYIGDSIIGENCKIGA-----S-------L-IIGD-VVIGINSEI 339 (358) T ss_pred CEECCCCEECCCCEECCCCEEEEEEEECCCEECCCCEECCEEECCCCEECC-----C-------C-EEEE-EEECCCEEE T ss_conf 486898789999888998778754887787988888882069818859998-----6-------1-5701-586366086 Q ss_pred ECCEEECCCCEECCCCEEC Q ss_conf 3783988998997783576 Q gi|254780942|r 395 IAPITIGQGTYVASGSIIT 413 (442) Q Consensus 395 ~~gv~ig~~~~i~ag~~v~ 413 (442) .+|+++|.++.+.+++++. T Consensus 340 ~~g~~~~~~~~~~~~~~~~ 358 (358) T COG1208 340 LPGVVVGPGSVVESGEIED 358 (358) T ss_pred CCCEEECCCCCCCCCCCCC T ss_conf 2761847860615763039 No 7 >TIGR02091 glgC glucose-1-phosphate adenylyltransferase; InterPro: IPR011831 This enzyme, glucose-1-phosphate adenylyltransferase, is also called ADP-glucose pyrophosphorylase. The plant form is an alpha2, beta2 heterodimer, allosterically regulated in plants. Both subunits are homologous and included in this entry. In bacteria, both homomeric forms of GlgC and more active heterodimers of GlgC and GlgD have been described. This entry describes the GlgC subunit only. This enzyme appears in variants of glycogen synthesis pathways that use ADP-glucose, rather than UDP-glucose as in animals.; GO: 0008878 glucose-1-phosphate adenylyltransferase activity. Probab=100.00 E-value=0 Score=385.93 Aligned_cols=316 Identities=20% Similarity=0.236 Sum_probs=225.4 Q ss_pred CEEEEECCCCCCCCCCCC---CCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHC--CCC-C------- Q ss_conf 089994588772327888---7441258848-5799999999977997799995798899999730--269-7------- Q gi|254780942|r 5 RLAIVLAAGRGHRMKSSS---SKVLQKIAGK-PMISHVMETIAAAGIENVALVLGYGAEEITRINF--PPT-L------- 70 (442) Q Consensus 5 ~~AiILAaG~GtRl~p~~---pKpLlpi~gk-pli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~--~~~-~------- 70 (442) ++|||||||+||||+|+| +||.+|++|| .|||++|+|+.++|+.+|+|+|+|+++.+.+|+. ... . T Consensus 1 ~la~iLAGG~GsRL~pLT~~rAKPAVpFGGkYRiIDF~LSNc~NSGi~ri~VLTQY~~~sL~~Hi~iG~~W~~~~~~~~~ 80 (421) T TIGR02091 1 VLAMILAGGQGSRLSPLTKRRAKPAVPFGGKYRIIDFALSNCINSGINRIGVLTQYKSHSLNRHIQIGRGWDFDGPEIGG 80 (421) T ss_pred CEEEEECCCCCCCCCHHHHHHCCCCCCCCCCEEEECCCHHHHHHCCCCEEEEEEECCHHHHHHHHHCCCCCCCCCCCCCC T ss_conf 97889438887546133465257765556301452100135554154457888505167888884068765677442476 Q ss_pred EEEEE--ECC--C--C---CCHHHHHHHCCCCCCC-C-CCEEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCCEEEEC-- Q ss_conf 28999--859--9--8---8614466410000123-4-46058985154111533220000001-3586100100101-- Q gi|254780942|r 71 SVEYY--IQD--C--Q---QGTAHAVLTAQDAIKP-G-YDDVIIMYGDVPLVSSHTLKKAMDKI-AQGYSIAVVGFNA-- 136 (442) Q Consensus 71 ~i~~v--~q~--~--~---~GTa~Ai~~a~~~l~~-~-~~~~lVl~GD~~li~~~~l~~l~~~~-~~~~~~~i~~~~~-- 136 (442) -|+.+ .|. + + +|||||||+.+.+|+. . .+.+|||+||+ |..+|.++|+++| .+++++||.+.+| T Consensus 81 fv~~Lpa~~~~~~~~~~WY~GTADAvYQNl~~i~~~~~p~yvlILsGDH--iYkMDy~~mL~~H~e~~Ad~Tia~~~Vp~ 158 (421) T TIGR02091 81 FVELLPAQQRFEESGKDWYQGTADAVYQNLDLIEETYDPEYVLILSGDH--IYKMDYEKMLDYHIEKGADLTIACIPVPR 158 (421) T ss_pred CEEEECCCEEECCCCCCCEEECHHHHHHHHHHHHCCCCCCEEEEECCCC--EECCCHHHHHHHHHHCCCCEEEEEEECCH T ss_conf 2676437511126788532112378899999874016924899934661--01048789999998567987998542785 Q ss_pred CCCCCCCEEEEC-CCCCEECCCCCCCCCC---CCCCC---------------CCCCCHHEECCHHHHHHHHHCCCCCCCC Q ss_conf 221574404202-3331001001466754---21000---------------3556001112501467775328887400 Q gi|254780942|r 137 DNPKGYGRLLIK-NNEIIAIREENDATDE---ERKIH---------------YCNSGLMAIDGLYIMDWLLQIKKNKVSQ 197 (442) Q Consensus 137 ~dp~~yGvV~~d-~~~v~~i~Ek~d~~~~---~~~~~---------------lin~GiY~f~~~~L~~~l~~l~~~~~~g 197 (442) +++++||++..| +++|++|.|||..++. .+... |+++|||+|+.++|+++|.+-..++.+. T Consensus 159 ~eAs~fGvm~vD~~g~i~~F~EKP~~P~~~~~~~~~~s~~~~~~~~~~~~~yLASMGiYiF~~~~L~~~L~~d~~~~~s~ 238 (421) T TIGR02091 159 KEASRFGVMQVDEDGRIVDFEEKPANPPSIPGDPDKSSDLGLSKLDAQKGPYLASMGIYIFDKDVLKELLEEDAKDPESS 238 (421) T ss_pred HHCCCCCEEEECCCCCEEEEECCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCEEECCEEEECHHHHHHHHHHHHCCCCCC T ss_conf 55152872888789968888826888866777767744404551453347744755513645789999999975387677 Q ss_pred HHHHHHHHHHHHHHHCCEEEEEEC----------HHHEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEEEEE Q ss_conf 111211489999730771799917----------2143011121000000122222112212223321021152799411 Q gi|254780942|r 198 EYYLTDIIEKARLDGKSIASIDVK----------EQEVCGCNNRYELSLIENIWQSRYRRQMMISGVTMIAPETVFLSHD 267 (442) Q Consensus 198 E~yltDii~~l~~~g~~i~~~~~~----------~~~~~gv~~~~~L~~~~~~~~~~~~~~~l~~gv~~~~p~~~~i~~~ 267 (442) +-|-.|++|.++.+|+++.+|.++ .+| -+|+|-..+.+++-.|-.... -.-|+.-..+++ T Consensus 239 ~DFGkdiIP~~~~~g~~~~Ay~F~~s~v~g~~~e~YW-rDVGTidsfweANmdL~~~~~---------PqvP~f~lYd~~ 308 (421) T TIGR02091 239 HDFGKDIIPKLLEEGKSVQAYDFSQSCVPGEEKEGYW-RDVGTIDSFWEANMDLVSEVK---------PQVPPFDLYDRK 308 (421) T ss_pred CCCCHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCE-EEECHHHHHHHHHHHHCCCCC---------CCCCCCCCCCCC T ss_conf 7544356889850897189980467605687888762-230040656675187627888---------888877763777 Q ss_pred EEECCCCCEECCCCCCCCCCCCEEEECCCCCC--CCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCC Q ss_conf 23317841001223223442000211163134--33212111210021344320000025864432023102112478 Q gi|254780942|r 268 TIIQPDTVIEPHVFFGCGVSIENYVQIRAFSY--LEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEG 343 (442) Q Consensus 268 ~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~--i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g 343 (442) +.|.......|++..-... ++.. +.+|.|+++|.|.- |.| .+|+|+.+++|+++++|++|+||.+ T Consensus 309 WpI~t~~~~~pPa~f~~~~---------~~~~G~v~~Slv~~Gc~i~g-~~v-~~SvL~~~v~i~~~~~v~dsVim~~ 375 (421) T TIGR02091 309 WPIYTYNEFLPPAKFVDSD---------EGRKGQVSDSLVAEGCIISG-ATV-SHSVLGSRVRIGSGSTVEDSVIMGD 375 (421) T ss_pred CCCCCCCCCCCCCEEECCC---------CCCCCEEEEEEEECCCEECC-CEE-EEEEEECCCEECCCCEEEEEEECCC T ss_conf 8755788888886045576---------77842074407756718807-678-7228526868546557988688798 No 8 >PRK00844 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=100.00 E-value=0 Score=377.22 Aligned_cols=333 Identities=17% Similarity=0.171 Sum_probs=217.9 Q ss_pred CCCEEEEECCCCCCCCCCC---CCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCC----C---E Q ss_conf 7408999458877232788---87441258848-5799999999977997799995798899999730269----7---2 Q gi|254780942|r 3 RKRLAIVLAAGRGHRMKSS---SSKVLQKIAGK-PMISHVMETIAAAGIENVALVLGYGAEEITRINFPPT----L---S 71 (442) Q Consensus 3 ~~~~AiILAaG~GtRl~p~---~pKpLlpi~gk-pli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~----~---~ 71 (442) ++++|||||||+||||||+ +||||+||+|| |||+|+|++|.++|+++++++++|+.+.+.+++.... . - T Consensus 4 ~~~~aVILAgG~GtRLrPLT~~rpKpllPvagkyplIdf~Ls~l~~aGi~~v~v~~~y~~~sl~~hi~~~w~~~~~~~~~ 83 (409) T PRK00844 4 PKVLGIVLAGGEGKRLMPLTADRAKPAVPFGGSYRLIDFVLSNLVNAGLLRICVLTQYKSHSLDRHISQGWRLSGLLGEY 83 (409) T ss_pred CCEEEEEECCCCCCCCCCCCCCCCHHCCEECCCCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCCCCCCCCE T ss_conf 87699998899877443000698044006588360789999999876998899990778799999986272656767735 Q ss_pred EEEEECCC------CCCHHHHHHHCCCCCCCC-CCEEEEEECCCCCCCCCCCCCCCCCCC-CCCCCCCEEEEC--CCCCC Q ss_conf 89998599------886144664100001234-460589851541115332200000013-586100100101--22157 Q gi|254780942|r 72 VEYYIQDC------QQGTAHAVLTAQDAIKPG-YDDVIIMYGDVPLVSSHTLKKAMDKIA-QGYSIAVVGFNA--DNPKG 141 (442) Q Consensus 72 i~~v~q~~------~~GTa~Ai~~a~~~l~~~-~~~~lVl~GD~~li~~~~l~~l~~~~~-~~~~~~i~~~~~--~dp~~ 141 (442) +.++.+++ .+|||||++++++++... .+.|+|++||+ +...++++++++|+ +++++|++..++ ++|++ T Consensus 84 i~~~~~~~~~~~~~~~Gtadai~~~~~~i~~~~~d~vlv~~gD~--i~~~dl~~~l~~H~~~~a~~Ti~~~~v~~~~~~~ 161 (409) T PRK00844 84 ITPVPAQQRLGKRWYEGSADAIYQSLNLIEDEDPDYVVVFGADH--VYRMDPEQMVEFHIESGAGLTVAGIRVPREEASA 161 (409) T ss_pred EEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCE--EECCCHHHHHHHHHHCCCCCEEEEEEECHHHCCC T ss_conf 87567212467654557789999999887417998899978998--9707999999999756886347899926687453 Q ss_pred CCEEEEC-CCCCEECCCCCCCCC---CCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEE Q ss_conf 4404202-333100100146675---421000355600111250146777532888740011121148999973077179 Q gi|254780942|r 142 YGRLLIK-NNEIIAIREENDATD---EERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIAS 217 (442) Q Consensus 142 yGvV~~d-~~~v~~i~Ek~d~~~---~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~ 217 (442) ||++..| +++|++|.|||...+ .++...++|+|+|+|+.+.|.+.|......+.+.+.+.+|+++.++++|. +.. T Consensus 162 ~Gvv~~d~~g~V~~f~EKP~~~~~~~~~~~~~l~n~GiYvf~~~~L~~~l~~~~~~~~~~~d~~~diip~l~~~g~-~~~ 240 (409) T PRK00844 162 FGVIEVDEDGRIRGFLEKPADPPGLPDDPDETLASMGNYIFTTDVLIDALREDAEDEDSSHDMGGDIIPKLVPRGE-AAV 240 (409) T ss_pred CCEEEECCCCCEEEEEECCCCCCCCCCCCCCCEEEEEEEEECHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCC-EEE T ss_conf 6789988999699998157887677897120112013599519999999987501677633206778999986388-689 Q ss_pred EEEC-----------HHHEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEEEEEEEECCCCCEECCC------ Q ss_conf 9917-----------21430111210000001222221122122233210211527994112331784100122------ Q gi|254780942|r 218 IDVK-----------EQEVCGCNNRYELSLIENIWQSRYRRQMMISGVTMIAPETVFLSHDTIIQPDTVIEPHV------ 280 (442) Q Consensus 218 ~~~~-----------~~~~~gv~~~~~L~~~~~~~~~~~~~~~l~~gv~~~~p~~~~i~~~~~I~~~~~I~~~~------ 280 (442) |.+. ..+|.+++++.++.+++..+.+..... .+ ...+..|.....-.|++ T Consensus 241 y~~~~~~v~g~~~~~~GYW~digt~~~y~~an~dlL~~~~~~------~l-------~~~~~~i~t~~~~~pp~~~~~~~ 307 (409) T PRK00844 241 YDFSDNEVPGATERDRGYWRDVGTIDSFYDAHMDLLSVHPVF------NL-------YNRDWPIRTYSPNLPPAKFVDGG 307 (409) T ss_pred EEECCCCCCCCCCCCCCEEEECCCHHHHHHHHHHHHCCCCHH------HC-------CCCCCCCCCCCCCCCCEEEECCC T ss_conf 984145445643456505897899899999889986788423------21-------37788411567888983898589 Q ss_pred ---CCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEE Q ss_conf ---32234420002111631343321211121002134432000002586443202310211247854487455566398 Q gi|254780942|r 281 ---FFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVV 357 (442) Q Consensus 281 ---~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~i 357 (442) .+-.++.|+++|.|. ++.+++|+||++|+||++| .|+||+|++++.|++.+.|.+|+| T Consensus 308 ~~~~~v~~s~I~~g~~I~-g~~V~nSvIg~~v~Ig~ga------------------~I~nSII~~~~~Ig~~a~I~~sIi 368 (409) T PRK00844 308 GNEGIAIDSIVSAGSIIS-GGSVRNSVLSPNVRVDSGA------------------IVEGSVLMDGVRIGRGAVVRRAIL 368 (409) T ss_pred CCEEEEECCEECCCEEEE-CCEEECCEECCCCEECCCC------------------EEECCEEECCCEECCCCEEEEEEE T ss_conf 835787258887970996-9988888977998999998------------------995189929499999999980198 Q ss_pred CCCCEECCCEEEC Q ss_conf 8998997980883 Q gi|254780942|r 358 GKNVNIGAGTITC 370 (442) Q Consensus 358 G~~~~ig~g~i~~ 370 (442) |+||.||+||++. T Consensus 369 dk~v~Ig~gt~I~ 381 (409) T PRK00844 369 DKNVVVPPGTQIG 381 (409) T ss_pred CCCCEECCCCEEC T ss_conf 9998989998989 No 9 >PRK04928 consensus Probab=100.00 E-value=0 Score=374.75 Aligned_cols=338 Identities=15% Similarity=0.181 Sum_probs=228.7 Q ss_pred CCCEEEEECCCCCCCCCCC---CCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCC----CEEEE Q ss_conf 7408999458877232788---87441258848-5799999999977997799995798899999730269----72899 Q gi|254780942|r 3 RKRLAIVLAAGRGHRMKSS---SSKVLQKIAGK-PMISHVMETIAAAGIENVALVLGYGAEEITRINFPPT----LSVEY 74 (442) Q Consensus 3 ~~~~AiILAaG~GtRl~p~---~pKpLlpi~gk-pli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~----~~i~~ 74 (442) .+++|||||||+||||||+ +||||+||+|+ |||+|+|++|.++|+++++++++|+.+.+.+|+.... ....| T Consensus 2 ~~~~aiILagG~gtRLrPLT~~rpKp~lP~~g~yrlIdf~Ls~l~~sGi~~V~v~~~~~~~sl~~hl~~gw~l~~~~~~~ 81 (405) T PRK04928 2 AGVLGMILAGGEGSRLRPLTESRTKPAVPFGGSYRLIDFALNNFVNADLMRIYVLTQFKSQSLYIHMKKGWNINGITDRF 81 (405) T ss_pred CCEEEEEECCCCCCCCCHHHCCCCHHHEEECCCCCEEHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCCCCCCCCE T ss_conf 96599997899877332321598565238788571318999999987998899990888799999974663766413517 Q ss_pred EEC---------CCCCCHHHHHHHCCCCCCCC-CCEEEEEECCCCCCCCCCCCCCCCCCC-CCCCCCCEEEEC--CCCCC Q ss_conf 985---------99886144664100001234-460589851541115332200000013-586100100101--22157 Q gi|254780942|r 75 YIQ---------DCQQGTAHAVLTAQDAIKPG-YDDVIIMYGDVPLVSSHTLKKAMDKIA-QGYSIAVVGFNA--DNPKG 141 (442) Q Consensus 75 v~q---------~~~~GTa~Ai~~a~~~l~~~-~~~~lVl~GD~~li~~~~l~~l~~~~~-~~~~~~i~~~~~--~dp~~ 141 (442) +.. +..+|||||+++++++++.. .+.|||++||+ +.+.+++++++.|+ +++++|+..+++ ++|++ T Consensus 82 ~~~~p~~~~~~~~~~~Gta~ai~~~~~~l~~~~~e~~lv~~gD~--l~~~Dl~~ll~~H~~~~a~iTi~~~~~~~e~~~~ 159 (405) T PRK04928 82 IDPIPAQMRTGKRWYEGTADAIYQNLRFIELSEPEHVCIFGSDH--IYKMDIKQMLDFHKEKEAALTVSALRMPLEEASQ 159 (405) T ss_pred EEECCHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCE--EECCCHHHHHHHHHHCCCCCEEEEEECCHHCCCC T ss_conf 87362241257644565689999999998546998899978988--9668999999999856887316899824112553 Q ss_pred CCEEEEC-CCCCEECCCCCCCC---CCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEE Q ss_conf 4404202-33310010014667---5421000355600111250146777532888740011121148999973077179 Q gi|254780942|r 142 YGRLLIK-NNEIIAIREENDAT---DEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIAS 217 (442) Q Consensus 142 yGvV~~d-~~~v~~i~Ek~d~~---~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~ 217 (442) ||++..| +++|++|.|||... +.+...+++|+|+|+|+++.|+..|.+-.....+.+.+.+|+++.++.+| ++.. T Consensus 160 ~Gvv~~D~~g~V~~f~EKP~~~~~~~g~~~~~li~~GiYvf~~~vl~~~l~~~~~~~~~~~d~~~dvip~l~~~g-~~~~ 238 (405) T PRK04928 160 FGVIEVDAEGRMIGFEEKPANPKSIPGDPDHALVSMGNYIFEAEVLFKELIEDAANENSSHDFGKDIIPKMFPRG-KVFV 238 (405) T ss_pred CCEEEECCCCCEEEEEECCCCCCCCCCCCCCEEEECCEEEECHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHC-CEEE T ss_conf 776888899989999976898756679833117503226751999999998650147876741578899998728-8689 Q ss_pred EEEC---------HHHEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEEEEEEEECCCCCEECCCCCC----C Q ss_conf 9917---------21430111210000001222221122122233210211527994112331784100122322----3 Q gi|254780942|r 218 IDVK---------EQEVCGCNNRYELSLIENIWQSRYRRQMMISGVTMIAPETVFLSHDTIIQPDTVIEPHVFFG----C 284 (442) Q Consensus 218 ~~~~---------~~~~~gv~~~~~L~~~~~~~~~~~~~~~l~~gv~~~~p~~~~i~~~~~I~~~~~I~~~~~Ig----~ 284 (442) |.+. +.+|.+++++.++.+++..+.+......+ ...+..+.....-.|++.+. . T Consensus 239 y~~~~~~i~g~~~~gYW~digt~~~y~~an~dlL~~~~~~~l-------------~~~~~~i~t~~~~~pp~~~~~~~~~ 305 (405) T PRK04928 239 YDFSTNRIPGEKEEVYWRDVGTIDSYWQAHMDLLKKDAPFSL-------------YNRKWPLHTYYPPLPPATFVDSDNG 305 (405) T ss_pred EEECCCCCCCCCCCEEEEECCCHHHHHHHHHHHHCCCCHHHC-------------CCCCCCEEECCCCCCCEEEECCCCC T ss_conf 980244335756650589789878999999998668856652-------------4778732413788898389868896 Q ss_pred CCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEEC Q ss_conf 44200021116313433212111210021344320000025864432023102112478544874555663988998997 Q gi|254780942|r 285 GVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIG 364 (442) Q Consensus 285 ~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig 364 (442) .+.| .+|.|+++|.|. .+.| ++|+|+++|+||.+++|+||+|+.++.|++.++|.+|+|+++|.|| T Consensus 306 ~~~v------------~~s~I~~G~~I~-G~~I-~~SvIg~~~~Ig~ga~I~nSII~~~~~Ig~ga~i~~sIidk~v~Ig 371 (405) T PRK04928 306 KVQI------------IDSLISGGSYIR-GSRI-EKSVLGFRSNIASACDISESVLLGDVKIGEGCVLRRVIIDKDVEIA 371 (405) T ss_pred EEEE------------ECCEECCCCEEE-CCEE-ECCEECCCCEECCCCEEECCEEECCCEECCCCEEECCEECCCCEEC T ss_conf 2798------------566887887898-9998-7689769989999949955899692999999999654988999999 Q ss_pred CCEEEC Q ss_conf 980883 Q gi|254780942|r 365 AGTITC 370 (442) Q Consensus 365 ~g~i~~ 370 (442) +||++. T Consensus 372 ~g~~I~ 377 (405) T PRK04928 372 PGTQIG 377 (405) T ss_pred CCCEEC T ss_conf 999998 No 10 >cd02540 GT2_GlmU_N_bac N-terminal domain of bacterial GlmU. The N-terminal domain of N-Acetylglucosamine-1-phosphate uridyltransferase (GlmU). GlmU is an essential bacterial enzyme with both an acetyltransferase and an uridyltransferase activity which have been mapped to the C-terminal and N-terminal domains, respectively. This family represents the N-terminal uridyltransferase. GlmU performs the last two steps in the synthesis of UDP-N-acetylglucosamine (UDP-GlcNAc), which is an essential precursor in both the peptidoglycan and the lipopolysaccharide metabolic pathways in Gram-positive and Gram-negative bacteria, respectively. Probab=100.00 E-value=0 Score=378.24 Aligned_cols=226 Identities=40% Similarity=0.679 Sum_probs=208.9 Q ss_pred EEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEECCCCCCHHHH Q ss_conf 99945887723278887441258848579999999997799779999579889999973026972899985998861446 Q gi|254780942|r 7 AIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQDCQQGTAHA 86 (442) Q Consensus 7 AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~~GTa~A 86 (442) |||||||+||||+|.+||||+||+|||||+|+++++.++|+++++++++|+.+++++++.. .++.|+.|++++||||| T Consensus 1 AvIlAaG~GtRl~~~~PKpllpi~~kpii~~ii~~l~~~gi~~i~iv~~~~~e~i~~~~~~--~~i~~v~Q~~~lGta~A 78 (229) T cd02540 1 AVILAAGKGTRMKSDLPKVLHPLAGKPMLEHVLDAARALGPDRIVVVVGHGAEQVKKALAN--PNVEFVLQEEQLGTGHA 78 (229) T ss_pred CEEECCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHCC--CCEEEEEECCCCCCHHH T ss_conf 9894687875799897841518998799999999999769975996357689999987543--87269980688973789 Q ss_pred HHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCC-CCCCCCCEEEECCCCCCCCEEEEC-CCCCEECCCCCCCCCC Q ss_conf 64100001234460589851541115332200000013-586100100101221574404202-3331001001466754 Q gi|254780942|r 87 VLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIA-QGYSIAVVGFNADNPKGYGRLLIK-NNEIIAIREENDATDE 164 (442) Q Consensus 87 i~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~-~~~~~~i~~~~~~dp~~yGvV~~d-~~~v~~i~Ek~d~~~~ 164 (442) +++++++++..+++|+|++||+||++..++++|++.|. .+++++++++++++|++||++..+ ++++.+++|||++.+. T Consensus 79 v~~a~~~i~~~~~~~lVl~gD~pli~~~~l~~l~~~~~~~~~~~ti~~~~~~~p~~YG~v~~d~~g~v~~ivEkk~~~~~ 158 (229) T cd02540 79 VKQALPALKDFEGDVLVLYGDVPLITPETLQRLLEAHREAGADVTVLTAELEDPTGYGRIIRDGNGKVLRIVEEKDATEE 158 (229) T ss_pred HHHHHHHCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCEEEECCCCCEEEEEECCCCCCC T ss_conf 99998744588872999958964536788999999999659966999986048767867998899978999988787654 Q ss_pred CCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEEC-HHHEECCCCCCCC Q ss_conf 210003556001112501467775328887400111211489999730771799917-2143011121000 Q gi|254780942|r 165 ERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVK-EQEVCGCNNRYEL 234 (442) Q Consensus 165 ~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~-~~~~~gv~~~~~L 234 (442) ++.++++|+|+|+|+++.|+++++++++++.+||||+||+++.|+++|.++.++... .|+++|+|++.|| T Consensus 159 ~~~~~~~n~GiYif~~~~l~~~l~~l~~~~~~gE~~ltD~i~~~i~~g~kv~~~~~~~~~~~~Gin~~~dl 229 (229) T cd02540 159 EKAIREVNAGIYAFDAEFLFEALPKLTNNNAQGEYYLTDIIALAVADGLKVAAVLADDEEEVLGVNDRVQL 229 (229) T ss_pred CCCCCEEEEEEEEECHHHHHHHHHHCCCCCCCCEEEHHHHHHHHHHCCCCEEEEEECCHHHEECCCCHHHC T ss_conf 34543222279998599999999847987668878899999999988991899992887661378897879 No 11 >PRK02862 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=100.00 E-value=0 Score=374.44 Aligned_cols=381 Identities=17% Similarity=0.178 Sum_probs=237.1 Q ss_pred CCCEEEEECCCCCCCCCCC---CCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCC-C---E--- Q ss_conf 7408999458877232788---87441258848-5799999999977997799995798899999730269-7---2--- Q gi|254780942|r 3 RKRLAIVLAAGRGHRMKSS---SSKVLQKIAGK-PMISHVMETIAAAGIENVALVLGYGAEEITRINFPPT-L---S--- 71 (442) Q Consensus 3 ~~~~AiILAaG~GtRl~p~---~pKpLlpi~gk-pli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~-~---~--- 71 (442) ++++|||||||+||||||+ +||||+|++|| +||+++|+++.++|+++|+++++|+.+.+.+|+.... . . T Consensus 2 ~~v~avILaGG~GtRL~PLT~~rpKp~vPf~GrYRlIDf~Lsn~~nsgI~~V~v~tqy~~~sl~~Hl~~~w~~~~~~~g~ 81 (429) T PRK02862 2 KRVLAIILGGGAGTRLYPLTKLRAKPAVPLAGKYRLIDIPISNCINSGINKIYVLTQFNSASLNRHISRTYNFSGFSQGF 81 (429) T ss_pred CCEEEEEECCCCCCCCCHHHCCCCHHHEEECCCCHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHCCCCCCCCCCCE T ss_conf 84399996688886042544588677325888242089999989886998899994658899999862421576667860 Q ss_pred EE--EEECCC-----CCCHHHHHHHCCCCCCCC-CCEEEEEECCCCCCCCCCCCCCCCCCC-CCCCCCCEEEEC--CCCC Q ss_conf 89--998599-----886144664100001234-460589851541115332200000013-586100100101--2215 Q gi|254780942|r 72 VE--YYIQDC-----QQGTAHAVLTAQDAIKPG-YDDVIIMYGDVPLVSSHTLKKAMDKIA-QGYSIAVVGFNA--DNPK 140 (442) Q Consensus 72 i~--~v~q~~-----~~GTa~Ai~~a~~~l~~~-~~~~lVl~GD~~li~~~~l~~l~~~~~-~~~~~~i~~~~~--~dp~ 140 (442) +. ...|.. -+|||+|+++++.++... .+.|||++||+ +...++++++++|. +++++|+.+.++ ++|+ T Consensus 82 v~~l~~~~~~~~~~~~~Gtadai~~~~~~l~~~~~~~~lV~~GD~--l~~~D~~~~l~~H~~~~AdiTi~~~~v~~~~~~ 159 (429) T PRK02862 82 VEVLAAQQTPDNPSWFQGTADAVRQYLWLFQEWDVDEYLILSGDQ--LYRMDYSLFVQHHRETGADITLAVLPVDEKQAS 159 (429) T ss_pred EEECCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCE--EECCCHHHHHHHHHHCCCCEEEEEEECCHHHCC T ss_conf 686043551478765566489999999999737997399974887--782699999999997499827998755857655 Q ss_pred CCCEEEEC-CCCCEECCCCCCCC-----------------CCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHH Q ss_conf 74404202-33310010014667-----------------5421000355600111250146777532888740011121 Q gi|254780942|r 141 GYGRLLIK-NNEIIAIREENDAT-----------------DEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLT 202 (442) Q Consensus 141 ~yGvV~~d-~~~v~~i~Ek~d~~-----------------~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~ylt 202 (442) +||++..| +++|++|.|||... +......++|+|+|+|+++.|.++++..... .+| -. T Consensus 160 ~fGvv~~D~~grV~~f~EKP~~~~~~~~~~~~~~~~~~~~~~~~~~~lasmGiYif~~~~L~~ll~~~~~~---~Df-g~ 235 (429) T PRK02862 160 GFGLMKTDDDGRITEFSEKPKGEELKAMAVDTSRLGLSPEEAKEKPYLASMGIYVFSRDVLFDLLNKNPEH---TDF-GK 235 (429) T ss_pred CCCEEEECCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEECHHHHHHHHHHCCCC---CHH-HH T ss_conf 36289999999799999578875444211465223567322356764012259998599999999758654---223-88 Q ss_pred HHHHHHHHHHCCEEEEEECHHHEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEEEEEEEECCCCCEECCCCC Q ss_conf 14899997307717999172143011121000000122222112212223321021152799411233178410012232 Q gi|254780942|r 203 DIIEKARLDGKSIASIDVKEQEVCGCNNRYELSLIENIWQSRYRRQMMISGVTMIAPETVFLSHDTIIQPDTVIEPHVFF 282 (442) Q Consensus 203 Dii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~~~~~~~~~~~~~~~l~~gv~~~~p~~~~i~~~~~I~~~~~I~~~~~I 282 (442) |+++.+++ +.++.+|..+++| ..++++.++.+++..+.... .|..-+.+++..|.......|++.+ T Consensus 236 diip~~~~-~~~v~~y~~~GYw-~dIgti~sy~~AnmdLl~~~------------~~~~~l~~~~~pI~Tk~~~~pP~~~ 301 (429) T PRK02862 236 EIIPEAAR-GYNLQAYLFDDYW-EDIGTIEAFYEANLALTQQP------------NPPFSFYDEKAPIYTRPRYLPPSKL 301 (429) T ss_pred HHHHHHHC-CCEEEEEEECCEE-EECCCHHHHHHHHHHHHCCC------------CCCCCCCCCCCCEEECCCCCCCEEE T ss_conf 89999847-7838999814726-87899899999989986488------------8642221689944626888898588 Q ss_pred C----CCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCC--CCCCCCCCCCCCCCCCE Q ss_conf 2----34420002111631343321211121002134432000002586443202310211--24785448745556639 Q gi|254780942|r 283 G----CGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKAT--IKEGSKINHLSYVGDSV 356 (442) Q Consensus 283 g----~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~--i~~g~~i~~~~~igd~~ 356 (442) . .++.|++||+| .++.+++|+||.+|.||++|.|+ +|+|+.+.......+..... -..++.|+..|.|.+|+ T Consensus 302 ~~~~V~~SlI~~GciI-~g~~V~nSVlg~~v~I~~ga~V~-nSIimg~d~~~~~~~~~~~~~~~~~~v~IG~~~~i~~aI 379 (429) T PRK02862 302 LDAQITESIISEGCIL-KNCSIHHSVLGVRSRIESDCVLE-DTLVMGADFYESSEEREALRKEGKIPVGIGEGTTIKRAI 379 (429) T ss_pred ECCEEECCEECCCEEE-ECCEEECCEECCCCEECCCCEEE-EEEEECCCCCCCHHHHHHHHCCCCCCCCCCCCCEEECCE T ss_conf 4678864597597698-38888767888985898998996-318987754443334322211479986799899997589 Q ss_pred ECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECC-EEECCCCEECCCCEE Q ss_conf 88998997980883445775456298788798869989378-398899899778357 Q gi|254780942|r 357 VGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAP-ITIGQGTYVASGSII 412 (442) Q Consensus 357 iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~g-v~ig~~~~i~ag~~v 412 (442) |++||.||+|+.+.| ...+++..+...+..+..| +.|++++.|..|++| T Consensus 380 idk~~~Ig~~~~I~~-------~~~~~~~~~~~~~~~i~~Givvv~k~~~ip~gt~i 429 (429) T PRK02862 380 IDKNARIGNNVTIVN-------KDNVEEADREDEGFYIRNGIVVVVKNATIPDGTVI 429 (429) T ss_pred ECCCCEECCCCEECC-------CCCCCCCCCCCCCEEEECCEEEECCCCCCCCCCCC T ss_conf 878989999998978-------98776442456988996998998999898999899 No 12 >PRK03282 consensus Probab=100.00 E-value=0 Score=373.77 Aligned_cols=338 Identities=18% Similarity=0.200 Sum_probs=229.1 Q ss_pred CCCEEEEECCCCCCCCCCC---CCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCC----CCEEEE Q ss_conf 7408999458877232788---87441258848-579999999997799779999579889999973026----972899 Q gi|254780942|r 3 RKRLAIVLAAGRGHRMKSS---SSKVLQKIAGK-PMISHVMETIAAAGIENVALVLGYGAEEITRINFPP----TLSVEY 74 (442) Q Consensus 3 ~~~~AiILAaG~GtRl~p~---~pKpLlpi~gk-pli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~----~~~i~~ 74 (442) .+++|||||||+||||||+ +||||+|++|+ |||+|+|++|.++|+++++++++|+.+.+.+|+... .....| T Consensus 5 ~~v~avILagG~GtRLrPLT~~rpKp~vP~~g~yrlIdf~Lsnl~~~Gi~~v~v~~~y~~~sl~~h~g~~w~~~~~~g~~ 84 (406) T PRK03282 5 PKVLGIVLAGGEGKRLMPLTADRAKPAVPFGGAYRLIDFVLSNLVNAGYLRICVLTQYKSHSLDRHISQTWRMSGLLGNY 84 (406) T ss_pred CCEEEEEECCCCCCCCCCCCCCCCHHHCEECCCCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHCCCCCCCCCCE T ss_conf 83799997899877233022798354006589664579999988876998899981777799999986213755656637 Q ss_pred EECCC---------CCCHHHHHHHCCCCCCCC-CCEEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCCEEEEC--CCCCC Q ss_conf 98599---------886144664100001234-46058985154111533220000001-3586100100101--22157 Q gi|254780942|r 75 YIQDC---------QQGTAHAVLTAQDAIKPG-YDDVIIMYGDVPLVSSHTLKKAMDKI-AQGYSIAVVGFNA--DNPKG 141 (442) Q Consensus 75 v~q~~---------~~GTa~Ai~~a~~~l~~~-~~~~lVl~GD~~li~~~~l~~l~~~~-~~~~~~~i~~~~~--~dp~~ 141 (442) +.+.+ .+|||+|++++++++... .+.|+|++||+ +...++++++++| ++++++|+...++ ++|++ T Consensus 85 v~~~~~~~~~g~~~~~Gta~ai~~~~~~l~~~~~~~vvv~~gD~--v~~~D~~~~l~~H~~~~a~~Ti~~~~v~~~d~s~ 162 (406) T PRK03282 85 ITPVPAQQRLGPRWYTGSADAIYQSLNLIYDEDPDYVVVFGADH--VYRMDPEQMVQQHIESGAGVTVAGIRVPRSEATA 162 (406) T ss_pred EEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCE--EEECCHHHHHHHHHHCCCCEEEEEEECCHHHHHH T ss_conf 97565212367421457789999999998626998799978979--9807999999999976998599999827677623 Q ss_pred CCEEEEC--CCCCEECCCCCCCC---CCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEE Q ss_conf 4404202--33310010014667---542100035560011125014677753288874001112114899997307717 Q gi|254780942|r 142 YGRLLIK--NNEIIAIREENDAT---DEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIA 216 (442) Q Consensus 142 yGvV~~d--~~~v~~i~Ek~d~~---~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~ 216 (442) ||++..| ++++++|+|||... +..+.++++|+|+|+|+++.|++++............+..|+++.++++|. .. T Consensus 163 ~Gvv~~d~d~~~V~~f~EKP~~~~~~~~~~~~~l~n~GiYvf~~~~l~~~l~~~~~~~~~~~d~~~di~p~l~~~g~-~~ 241 (406) T PRK03282 163 FGVIDADDDSGRIREFLEKPADPPGLPDDPDETFASMGNYVFTTKALIDALRADAEDEDSDHDMGGDIIPRFVARGE-AA 241 (406) T ss_pred CCEEEECCCCCEEEEEEECCCCCCCCCCCCCCEECCCEEEEECHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCC-EE T ss_conf 78188828999799999557876578898010020210699849999999986501777644226778999997098-27 Q ss_pred EEEEC-----------HHHEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEEEEEEEECCCCCEECCCCCCCC Q ss_conf 99917-----------2143011121000000122222112212223321021152799411233178410012232234 Q gi|254780942|r 217 SIDVK-----------EQEVCGCNNRYELSLIENIWQSRYRRQMMISGVTMIAPETVFLSHDTIIQPDTVIEPHVFFGCG 285 (442) Q Consensus 217 ~~~~~-----------~~~~~gv~~~~~L~~~~~~~~~~~~~~~l~~gv~~~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~ 285 (442) .|.+. ..+|.+++++.++.+++..+.+... .......+..|.......|++.+..+ T Consensus 242 ~y~~~~~~v~g~~~~~~GYW~dIgt~~~y~~an~dll~~~~-------------~~~l~~~~~~i~t~~~~~~p~~~~~~ 308 (406) T PRK03282 242 VYDFSDNEVPGATDRDRGYWRDVGTLDAFYDAHMDLISVHP-------------VFNLYNKRWPIRTESENLPPAKFVNG 308 (406) T ss_pred EEEEECCCCCCCCCCCCCEEEECCCHHHHHHHHHHHHCCCC-------------CCCCCCCCCCCCCCCCCCCCEEEECC T ss_conf 99850353557544555179867989999998898627785-------------31345778864467767898088279 Q ss_pred CCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECC Q ss_conf 42000211163134332121112100213443200000258644320231021124785448745556639889989979 Q gi|254780942|r 286 VSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGA 365 (442) Q Consensus 286 ~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~ 365 (442) + .+.+|.|+++|.|.. +.+ .+|+|+++|+||.+++|+||+|+.++.|++.++|.+|+|++||.||+ T Consensus 309 ~------------~v~~S~i~~G~vI~g-~~V-~nSvIg~~v~Ig~ga~I~~SIIm~~~~Ig~~a~l~~~Iidk~v~Ig~ 374 (406) T PRK03282 309 G------------SAQESIVSAGSIISG-ASV-RNSVLSPNVVVDSGAIVEGSVLMPGVRIGRGAVVRHAILDKNVVVPP 374 (406) T ss_pred C------------EEECCEECCCCEEEC-CEE-ECCEECCCCEECCCCEEECCEEECCCEECCCCEEECCEECCCCEECC T ss_conf 6------------788778648859859-988-89997899899999899708890968999999997319899999998 Q ss_pred CEEEC Q ss_conf 80883 Q gi|254780942|r 366 GTITC 370 (442) Q Consensus 366 g~i~~ 370 (442) |+++. T Consensus 375 g~~Ig 379 (406) T PRK03282 375 GAMVG 379 (406) T ss_pred CCEEC T ss_conf 98999 No 13 >PRK00725 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=100.00 E-value=0 Score=361.53 Aligned_cols=345 Identities=19% Similarity=0.235 Sum_probs=232.5 Q ss_pred CCCCCEEEEECCCCCCCCCCC---CCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCC------- Q ss_conf 987408999458877232788---87441258848-5799999999977997799995798899999730269------- Q gi|254780942|r 1 MKRKRLAIVLAAGRGHRMKSS---SSKVLQKIAGK-PMISHVMETIAAAGIENVALVLGYGAEEITRINFPPT------- 69 (442) Q Consensus 1 m~~~~~AiILAaG~GtRl~p~---~pKpLlpi~gk-pli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~------- 69 (442) |+++.+|||||||+||||+|+ +|||++||+|+ +||+|+|+++.++|+++|.++++|+.+.+.+|+...+ T Consensus 16 ~~~~~~aviLAGG~GtRL~pLT~~RpkpavPfgGryRlIDF~LSn~vnsGI~~V~V~t~y~~~SL~dHlg~GW~~~~~~~ 95 (431) T PRK00725 16 LPRKSVALILAGGRGSRLKELTDKRAKPAVYFGGKFRIIDFALSNCLNSGIRRIGVITQYQAHSLVRHIQRGWSFFREER 95 (431) T ss_pred CCCCEEEEEECCCCCCCCHHHHCCCCCCCCEECCCCEEEHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHCCCCCCCCCC T ss_conf 56767999963888775515542784003153782354023676688759998999927787999998753424444468 Q ss_pred -CEEEEE-------ECCCCCCHHHHHHHCCCCCCC-CCCEEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCCEEEEC--C Q ss_conf -728999-------859988614466410000123-446058985154111533220000001-3586100100101--2 Q gi|254780942|r 70 -LSVEYY-------IQDCQQGTAHAVLTAQDAIKP-GYDDVIIMYGDVPLVSSHTLKKAMDKI-AQGYSIAVVGFNA--D 137 (442) Q Consensus 70 -~~i~~v-------~q~~~~GTa~Ai~~a~~~l~~-~~~~~lVl~GD~~li~~~~l~~l~~~~-~~~~~~~i~~~~~--~ 137 (442) .-+.+. .+...+|||+|++++++++.. ..+.|+|++||+ +...+++++++.| ++++++|+...++ + T Consensus 96 g~~~~~lp~~~~~~~~~~~~Gtadav~~n~~~l~~~~~eyvvI~~gD~--I~~~D~~~~l~~H~~~gAdiTi~~~~v~~~ 173 (431) T PRK00725 96 NEFVDLLPAQQRVSEENWYRGTADAVYQNLDIIRRYKAEYVVILAGDH--IYKMDYSRMLIDHVEKGADCTVGCLEVPRE 173 (431) T ss_pred CCCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCE--EECCCHHHHHHHHHHCCCCEEEEEEECCHH T ss_conf 960797776112588765468589999999999756998799945988--981799999999987799879999984878 Q ss_pred CCCCCCEEEEC-CCCCEECCCCCCCC---CCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 21574404202-33310010014667---542100035560011125014677753288874001112114899997307 Q gi|254780942|r 138 NPKGYGRLLIK-NNEIIAIREENDAT---DEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGK 213 (442) Q Consensus 138 dp~~yGvV~~d-~~~v~~i~Ek~d~~---~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~ 213 (442) +|++||++..| +++|++|.|||.-+ +.+..++++|+|+|+|+++.|+++|.+....+.....+..|+++.+++++. T Consensus 174 ~~~~fGv~~~D~~grV~~f~EKP~~p~~~~g~~~~~l~smGiYvf~~~~L~~~L~~~~~~~~~~~Dfg~diip~ll~~~~ 253 (431) T PRK00725 174 EASAFGVMAVDENDKIIAFVEKPADPPAMPGDPDKSLASMGIYVFDADYLYELLERDAADPNSSHDFGKDIIPKIVEEGK 253 (431) T ss_pred HCCCCCEEEECCCCCEEEEEECCCCCCCCCCCCCCCEECCEEEEECHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCC T ss_conf 85547749999999899999337884346897553300015899859999999986521776421026778999997498 Q ss_pred CEEEEEEC----------HHHEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEEEEEEEECCCCCEECCCCCC Q ss_conf 71799917----------21430111210000001222221122122233210211527994112331784100122322 Q gi|254780942|r 214 SIASIDVK----------EQEVCGCNNRYELSLIENIWQSRYRRQMMISGVTMIAPETVFLSHDTIIQPDTVIEPHVFFG 283 (442) Q Consensus 214 ~i~~~~~~----------~~~~~gv~~~~~L~~~~~~~~~~~~~~~l~~gv~~~~p~~~~i~~~~~I~~~~~I~~~~~Ig 283 (442) +.+|.++ ..+|.+++++..+.+++..+.+.. |..-...++..|.......|++..- T Consensus 254 -~~ay~f~~~~~~~~~~~~GYw~dIgt~~sY~~AnmdLL~~~-------------~~~~lf~~~~pI~Tk~~~~pP~k~~ 319 (431) T PRK00725 254 -AYAHPFPLSCVRSDPEAEPYWRDVGTLDAYWQANLDLASVV-------------PELDMYDRNWPIRTYMESLPPAKFV 319 (431) T ss_pred -EEEEECCCCCCCCCCCCCCEEEECCCHHHHHHHHHHHHCCC-------------CHHHCCCCCCCCCCCCCCCCCCEEE T ss_conf -78986253334566566641788799999999987774578-------------4543258999633667768982786 Q ss_pred CCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEE Q ss_conf 34420002111631343321211121002134432000002586443202310211247854487455566398899899 Q gi|254780942|r 284 CGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNI 363 (442) Q Consensus 284 ~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~i 363 (442) .+... . .+.+.+|.|+.+|.|.- +.+ ++|+|+.+|+||.+++|+||+||.++.|+..+.|.+|+|+++|.| T Consensus 320 ~~~~g-~------~g~v~nSli~~GcvI~G-~~V-enSVI~r~v~Ig~gA~V~nSIIm~~~~IG~ga~l~naIiDk~v~I 390 (431) T PRK00725 320 QDRSG-S------HGMTLNSLVSGGCIISG-SVV-VQSVLFTRVRVNSFSNIDSAVLLPEVWVGRSCRLRRCVIDRACVI 390 (431) T ss_pred CCCCC-C------CCCEECCEEECCEEEEC-CEE-EECCCCCCCEECCCCEEEEEEECCCCEECCCCEEEEEEECCCCEE T ss_conf 35676-6------54143679839989969-999-829825998999998996139889898899999971398999898 Q ss_pred CCCEEEC Q ss_conf 7980883 Q gi|254780942|r 364 GAGTITC 370 (442) Q Consensus 364 g~g~i~~ 370 (442) |+|+++. T Consensus 391 ~~G~~Ig 397 (431) T PRK00725 391 PEGMVIG 397 (431) T ss_pred CCCCEEC T ss_conf 9899989 No 14 >PRK01884 consensus Probab=100.00 E-value=0 Score=361.18 Aligned_cols=334 Identities=16% Similarity=0.194 Sum_probs=226.3 Q ss_pred CCCCCEEEEECCCCCCCCCCC---CCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCC-------- Q ss_conf 987408999458877232788---87441258848-579999999997799779999579889999973026-------- Q gi|254780942|r 1 MKRKRLAIVLAAGRGHRMKSS---SSKVLQKIAGK-PMISHVMETIAAAGIENVALVLGYGAEEITRINFPP-------- 68 (442) Q Consensus 1 m~~~~~AiILAaG~GtRl~p~---~pKpLlpi~gk-pli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~-------- 68 (442) |.++++|||||||+||||+|+ +|||++||+|+ |||+|+|+++.++|+++|.|+++|+.+.+.+|+... T Consensus 14 ~~~~~~avILAGG~GtRL~pLT~~RpkpavPfgGryRiIDF~LSNlvnsGI~~V~V~t~y~~~SL~dHl~~Gw~~~~~~~ 93 (435) T PRK01884 14 LVKDTLVLILAGGRGSRLHELTDKRAKPALYFGGNRRIIDFALSNCINSGLNRIGVVTQYAAHSLLRHLQTGWSFLPQER 93 (435) T ss_pred HHHHEEEEEECCCCCCCCHHHHCCCCHHCCCCCCCCEEEEHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCCCCCCC T ss_conf 53500899965888761516553871511624888712101686788769988999817687999999861636555579 Q ss_pred CCEEEEEECC-------CCCCHHHHHHHCCCCCCCC--CCEEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCCEEEEC-- Q ss_conf 9728999859-------9886144664100001234--46058985154111533220000001-3586100100101-- Q gi|254780942|r 69 TLSVEYYIQD-------CQQGTAHAVLTAQDAIKPG--YDDVIIMYGDVPLVSSHTLKKAMDKI-AQGYSIAVVGFNA-- 136 (442) Q Consensus 69 ~~~i~~v~q~-------~~~GTa~Ai~~a~~~l~~~--~~~~lVl~GD~~li~~~~l~~l~~~~-~~~~~~~i~~~~~-- 136 (442) +.-+.+..+. ..+|||+|++++++++... .+.++|++||+ +...++++++++| ++++++|+...++ T Consensus 94 g~~i~~lp~~~~~~~~~~~~Gtadai~~n~~~i~~~~~~~yVlVl~gD~--I~~~D~~~~l~~H~~~~AdiTva~~~v~~ 171 (435) T PRK01884 94 GEFIDMLPARQQIDDSTWYRGTADAVYQNMAIIKNHYRPKYILILAGDH--IYKQDYSQMLLDHVNSGAKCTVGCIEVPR 171 (435) T ss_pred CCEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCE--EEECCHHHHHHHHHHCCCCEEEEEEECCH T ss_conf 9527867874225887510373999998899997246888799955998--98078999999999759985899998687 Q ss_pred CCCCCCCEEEEC-CCCCEECCCCCCCC---CCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 221574404202-33310010014667---54210003556001112501467775328887400111211489999730 Q gi|254780942|r 137 DNPKGYGRLLIK-NNEIIAIREENDAT---DEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDG 212 (442) Q Consensus 137 ~dp~~yGvV~~d-~~~v~~i~Ek~d~~---~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g 212 (442) ++|++||++..| +++++.|.|||.-. +.+..++++|+|+|+|+++.|.+.|++..........+..|+++.+++++ T Consensus 172 e~~~~~Gvv~~d~~~rV~~f~EKP~~~~~~~g~~~~~lismGiYvf~~~~L~e~L~~~~~~~~~~~Dfg~diip~~l~~g 251 (435) T PRK01884 172 EEAHEFGVMAVNENLKVKAFVEKPKDPPAMVGKPDSSLASMGIYVFDADYLYKMLEREVNTPQTSHDFGKDILPKALEEG 251 (435) T ss_pred HHHHHCCEEEECCCCCEEEEEECCCCCCCCCCCCCCCEECCEEEEECHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCC T ss_conf 88300774998999978899956888110379866422023259973999999999763177751257898999998519 Q ss_pred CCEEEEEECHH----------HEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEEEEEEEECCCCCEECCCCC Q ss_conf 77179991721----------43011121000000122222112212223321021152799411233178410012232 Q gi|254780942|r 213 KSIASIDVKEQ----------EVCGCNNRYELSLIENIWQSRYRRQMMISGVTMIAPETVFLSHDTIIQPDTVIEPHVFF 282 (442) Q Consensus 213 ~~i~~~~~~~~----------~~~gv~~~~~L~~~~~~~~~~~~~~~l~~gv~~~~p~~~~i~~~~~I~~~~~I~~~~~I 282 (442) . +.+|.++.. +|.+++++.++.+++..+.+.... .....++..|.....-.+++.. T Consensus 252 ~-v~ay~f~~~~~G~~~~~~~Yw~dIgt~~sY~~AnmdLL~~~~~-------------~~lf~~~~pI~tk~~~~~P~~~ 317 (435) T PRK01884 252 V-LYAHPFSRSCMGRNTEGEIYWRDVGTLDSFWQSNIDLVSENPQ-------------LDIYDQSWPIRGNPVQAYPSKF 317 (435) T ss_pred C-EEEEECCCCCCCCCCCCCCEEEECCCHHHHHHHHHHHHCCCCH-------------HHCCCCCCCEECCCCCCCCCEE T ss_conf 3-8999736764454456772354589989999989998678844-------------4324778953467777899557 Q ss_pred ---------CCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCC Q ss_conf ---------23442000211163134332121112100213443200000258644320231021124785448745556 Q gi|254780942|r 283 ---------GCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVG 353 (442) Q Consensus 283 ---------g~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~ig 353 (442) -.++.|+++|+|..+ .+ ++|+|+++|+||.+++|+||+|+.+++|+..+.|. T Consensus 318 ~~~~~~~~~V~nSlIa~GcvI~~g------------------~V-~nSVL~r~v~I~~gA~V~nSIIm~~~~IG~ga~l~ 378 (435) T PRK01884 318 FYKKSNVHPVDNSLIAGGCVITDA------------------SI-SNSVLFDRIKVDEFSKIDHCVVLPQVKIGKNCVLK 378 (435) T ss_pred EECCCCCCEEECCEECCCEEEECC------------------EE-ECCEEECCCEECCCCEEEECEEECCCEECCCCEEE T ss_conf 625888655328998199799398------------------99-71982089699999999828880869899999996 Q ss_pred CCEECCCCEECCCEEE Q ss_conf 6398899899798088 Q gi|254780942|r 354 DSVVGKNVNIGAGTIT 369 (442) Q Consensus 354 d~~iG~~~~ig~g~i~ 369 (442) +|+|++||.|++|+++ T Consensus 379 ~~IiDk~v~I~~g~~I 394 (435) T PRK01884 379 RCIIDRHCVIPDGMQI 394 (435) T ss_pred EEEECCCCEECCCCEE T ss_conf 2698999999979999 No 15 >PRK03701 consensus Probab=100.00 E-value=0 Score=347.19 Aligned_cols=322 Identities=19% Similarity=0.238 Sum_probs=213.0 Q ss_pred CCCEEEEECCCCCCCCCCC---CCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCC--------C Q ss_conf 7408999458877232788---87441258848-5799999999977997799995798899999730269--------7 Q gi|254780942|r 3 RKRLAIVLAAGRGHRMKSS---SSKVLQKIAGK-PMISHVMETIAAAGIENVALVLGYGAEEITRINFPPT--------L 70 (442) Q Consensus 3 ~~~~AiILAaG~GtRl~p~---~pKpLlpi~gk-pli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~--------~ 70 (442) .+.+|||||||+||||+|+ ||||++||+|+ |||+|+|+++.++|+++|.++++|+.+.+.+|+.... . T Consensus 18 ~~~~aviLAGG~gtRL~pLT~~RpkpaVPf~GrYRlIDF~LSn~vnsGI~~V~V~tq~~~~SL~dHlg~Gw~~~~~~~~~ 97 (431) T PRK03701 18 LKSVALILAGGRGTRLKDLTNKRAKPAVHFGGKFRIIDFALSNCLNSGIRRIGVITQYQSHTLVQHIQRGWSFFNEEMNE 97 (431) T ss_pred CCEEEEEECCCCCCCCCHHHCCCCHHCCCCCCCCHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHCCCCHHCCCCCC T ss_conf 12699994389876440554388442110078523479999999986999899992788899999986362100225785 Q ss_pred EEEEEE-------CCCCCCHHHHHHHCCCCCCCC-CCEEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCCEEEEC--CCC Q ss_conf 289998-------599886144664100001234-46058985154111533220000001-3586100100101--221 Q gi|254780942|r 71 SVEYYI-------QDCQQGTAHAVLTAQDAIKPG-YDDVIIMYGDVPLVSSHTLKKAMDKI-AQGYSIAVVGFNA--DNP 139 (442) Q Consensus 71 ~i~~v~-------q~~~~GTa~Ai~~a~~~l~~~-~~~~lVl~GD~~li~~~~l~~l~~~~-~~~~~~~i~~~~~--~dp 139 (442) -+.+.. |...+|||+|+++.++++... .+.|+|++||+ +...++++++++| ++++++|+...++ ++| T Consensus 98 ~~~~lp~~~~~~~~~~~~Gtadai~~n~~~l~~~~~eyVvI~~gD~--I~~~D~~~~l~~H~~~gAdiTv~~~~v~~e~~ 175 (431) T PRK03701 98 FVDLLPAQQRMKGENWYRGTADAVTQNLDIIRRYKAEYVVILAGDH--IYKQDYSRMLIDHVEKGARCTVACMPVPIEEA 175 (431) T ss_pred EEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCE--EECCCHHHHHHHHHHCCCCCEEEEEECCHHHC T ss_conf 4997777410478864357489999999999746998599965988--87589999999999759982699987487883 Q ss_pred CCCCEEEEC-CCCCEECCCCCCCC---CCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCE Q ss_conf 574404202-33310010014667---54210003556001112501467775328887400111211489999730771 Q gi|254780942|r 140 KGYGRLLIK-NNEIIAIREENDAT---DEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSI 215 (442) Q Consensus 140 ~~yGvV~~d-~~~v~~i~Ek~d~~---~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i 215 (442) ++||++..| +++|++|.|||... +.++.++++|+|+|+|+++.|.+++++..........+..|+++.+.+.+. + T Consensus 176 ~~~GVl~~D~~grV~~f~EKP~~p~~~~~~~~~~~~smgiYVf~~~~L~~ll~~~~~~~~~~~Df~~diip~l~~~~~-v 254 (431) T PRK03701 176 SAFGVMAVDENDKIIEFVEKPANPPSMPGDPSKSLASMGIYVFDADYLYELLEEDDRDENSSHDFGKDIIPKITEAGL-A 254 (431) T ss_pred CCCCEEEECCCCCEEEEEECCCCCCCCCCCCCCCEEEEEEEEECHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCC-E T ss_conf 206859988999799999558886556898553321005899759999999863301566765437889999986096-8 Q ss_pred EEEEEC----------HHHEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEEEEEEEECCCCCEECCCCCC-- Q ss_conf 799917----------21430111210000001222221122122233210211527994112331784100122322-- Q gi|254780942|r 216 ASIDVK----------EQEVCGCNNRYELSLIENIWQSRYRRQMMISGVTMIAPETVFLSHDTIIQPDTVIEPHVFFG-- 283 (442) Q Consensus 216 ~~~~~~----------~~~~~gv~~~~~L~~~~~~~~~~~~~~~l~~gv~~~~p~~~~i~~~~~I~~~~~I~~~~~Ig-- 283 (442) .+|.++ ..+|..+++..++.+++..+.+.. |..-...++..|.....-.|++..- T Consensus 255 ~~y~f~~~~v~~~~~~~gYw~dIgti~sY~~anmdlL~~~-------------~~~~l~~~~~pI~Tk~~~~pP~~y~~~ 321 (431) T PRK03701 255 YAHPFPLSCVQSDPDAEPYWRDVGTLEAYWKANLDLASVT-------------PELDMYDRNWPIRTHMESLPPAKFVQD 321 (431) T ss_pred EEEEECCCCCCCCCCCCCCEEECCCHHHHHHHHHHHHCCC-------------CCCCCCCCCCCEECCCCCCCCCEEECC T ss_conf 9985044444467667772145799999999899873777-------------655445789930536888998168756 Q ss_pred --------CCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCC Q ss_conf --------34420002111631343321211121002134432000002586443202310211247 Q gi|254780942|r 284 --------CGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKE 342 (442) Q Consensus 284 --------~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~ 342 (442) .++.|+++|.|. ++.+++|+|+.+|.||+.|.|+ +|+|+++|.||.++.++|++|++ T Consensus 322 ~~g~~~~v~nSlia~GciI~-G~~V~nSVl~r~v~I~~gA~V~-nSIIm~~~~Ig~ga~l~~~IiDk 386 (431) T PRK03701 322 RSGSHGMTLNSLVSGGCIIS-GSVVVQSVLFPRVRVNSFCNID-SAVLLPEVWVGRSCRLRRCVIDR 386 (431) T ss_pred CCCCCCEEECCEEECCEEEE-CCEEECCCCCCCCEECCCCEEE-CCEECCCCEECCCCEEEEEEECC T ss_conf 67665416478985998993-8488809811888999998997-26985969899999998379899 No 16 >KOG1322 consensus Probab=100.00 E-value=0 Score=346.84 Aligned_cols=223 Identities=20% Similarity=0.286 Sum_probs=183.8 Q ss_pred CCCEEEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHH-----HHHCCCCCEEEE Q ss_conf 7408999458877232788---87441258848579999999997799779999579889999-----973026972899 Q gi|254780942|r 3 RKRLAIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEIT-----RINFPPTLSVEY 74 (442) Q Consensus 3 ~~~~AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~-----~~~~~~~~~i~~ 74 (442) +.+.|+||.||.||||||+ +||||+|++++|||+|++++|.++|+++|++.++|.++++. .|....+.++.+ T Consensus 8 ~~vkaiILvGG~GTRLrPLT~t~pKPlVpfgn~pmI~hqieal~nsGi~~I~la~~y~s~sl~~~~~k~y~~~lgVei~~ 87 (371) T KOG1322 8 QSVKAIILVGGYGTRLRPLTLTRPKPLVPFGNKPMILHQIEALINSGITKIVLATQYNSESLNRHLSKAYGKELGVEILA 87 (371) T ss_pred CCEEEEEEECCCCCEEECEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHHHHHCCCEEEEE T ss_conf 35158999547774460001368886654276305277899987579747999971382899999999761245559999 Q ss_pred EECCCCCCHHHHHHHCCCCCCCCCC-EEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCCEEEECCCCCCCCEEEEC--CC Q ss_conf 9859988614466410000123446-058985154111533220000001-3586100100101221574404202--33 Q gi|254780942|r 75 YIQDCQQGTAHAVLTAQDAIKPGYD-DVIIMYGDVPLVSSHTLKKAMDKI-AQGYSIAVVGFNADNPKGYGRLLIK--NN 150 (442) Q Consensus 75 v~q~~~~GTa~Ai~~a~~~l~~~~~-~~lVl~GD~~li~~~~l~~l~~~~-~~~~~~~i~~~~~~dp~~yGvV~~d--~~ 150 (442) ..|.+|+||+|.+..++++|-...+ +|+|||||+ +..+++++|+++| +++++.+++++++++|+.||++..| .+ T Consensus 88 s~eteplgtaGpl~laR~~L~~~~~~~ffVLnsDv--i~~~p~~~~vqfH~~~gae~TI~~t~vdepSkyGvv~~d~~~g 165 (371) T KOG1322 88 STETEPLGTAGPLALARDFLWVFEDAPFFVLNSDV--ICRMPYKEMVQFHRAHGAEITIVVTKVDEPSKYGVVVIDEDTG 165 (371) T ss_pred EECCCCCCCCCHHHHHHHHHHHCCCCCEEEECCCE--EECCCHHHHHHHHHHCCCCEEEEEEECCCCCCCCEEEEECCCC T ss_conf 82068876546699999886403798679955772--6437799999999855996589987446853263699935888 Q ss_pred CCEECCCCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCC Q ss_conf 31001001466754210003556001112501467775328887400111211489999730771799917214301112 Q gi|254780942|r 151 EIIAIREENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNN 230 (442) Q Consensus 151 ~v~~i~Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~ 230 (442) +|.+|+|||. +..++-+|+|+|+|+++.|.+.+ +.+..+.+| ++|.+.++ +.+.++..++.| +++.+ T Consensus 166 rV~~F~EKPk----d~vsnkinaGiYi~~~~vL~ri~--~~ptSiekE-----ifP~~a~~-~~l~a~~l~gfW-mDIGq 232 (371) T KOG1322 166 RVIRFVEKPK----DLVSNKINAGIYILNPEVLDRIL--LRPTSIEKE-----IFPAMAEE-HQLYAFDLPGFW-MDIGQ 232 (371) T ss_pred CEEEEHHCCH----HHHHCCCCCEEEEECHHHHHHHH--HCCCCHHHH-----HHHHHHHC-CCEEEEECCCHH-HHCCC T ss_conf 2558021756----56533124158997799986765--066630111-----11556411-725899627613-20688 Q ss_pred CCCCCCCCCC Q ss_conf 1000000122 Q gi|254780942|r 231 RYELSLIENI 240 (442) Q Consensus 231 ~~~L~~~~~~ 240 (442) +.+..+.-.+ T Consensus 233 pkdf~~g~~~ 242 (371) T KOG1322 233 PKDFLTGFSF 242 (371) T ss_pred HHHHHHHHHH T ss_conf 8999888988 No 17 >COG0448 GlgC ADP-glucose pyrophosphorylase [Carbohydrate transport and metabolism] Probab=100.00 E-value=0 Score=322.44 Aligned_cols=321 Identities=20% Similarity=0.209 Sum_probs=229.3 Q ss_pred CCCCCEEEEECCCCCCCCCCC---CCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCC------- Q ss_conf 987408999458877232788---87441258848-5799999999977997799995798899999730269------- Q gi|254780942|r 1 MKRKRLAIVLAAGRGHRMKSS---SSKVLQKIAGK-PMISHVMETIAAAGIENVALVLGYGAEEITRINFPPT------- 69 (442) Q Consensus 1 m~~~~~AiILAaG~GtRl~p~---~pKpLlpi~gk-pli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~------- 69 (442) |.++++|+|||||+|+||.|+ |+||.+|++|| .||+++|+++.++|+.+|.|+++|+...+.+|+.... T Consensus 2 ~~~~~laiILaGg~G~rL~~LT~~RakpAVpFgGkYRiIDF~LSN~vNSGi~~I~VltQy~~~SL~~Hi~~G~~w~l~~~ 81 (393) T COG0448 2 MKKNVLAIILAGGRGSRLSPLTKDRAKPAVPFGGKYRIIDFALSNCVNSGIRRIGVLTQYKSHSLNDHIGRGWPWDLDRK 81 (393) T ss_pred CCCCEEEEEECCCCCCCCCHHHHCCCCCCCCCCCEEEEEEEECCCCCCCCCCEEEEEECCCHHHHHHHHHCCCCCCCCCC T ss_conf 74466999975898775522454750344442764689867700001038876999943553679998628986566555 Q ss_pred -CEEEEEE-C---C---CCCCHHHHHHHCCCCCCC-CCCEEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCCEEEECC-- Q ss_conf -7289998-5---9---988614466410000123-446058985154111533220000001-35861001001012-- Q gi|254780942|r 70 -LSVEYYI-Q---D---CQQGTAHAVLTAQDAIKP-GYDDVIIMYGDVPLVSSHTLKKAMDKI-AQGYSIAVVGFNAD-- 137 (442) Q Consensus 70 -~~i~~v~-q---~---~~~GTa~Ai~~a~~~l~~-~~~~~lVl~GD~~li~~~~l~~l~~~~-~~~~~~~i~~~~~~-- 137 (442) -.+.... | . .-.|||+|+++.+..+.. ..+.+||++||+ +...+++++++.| ++++++|+...+++ T Consensus 82 ~~~v~ilp~~~~~~~~~wy~Gtadai~Qnl~~i~~~~~eyvlIlsgDh--IYkmDy~~ml~~H~~~gadiTv~~~~Vp~~ 159 (393) T COG0448 82 NGGVFILPAQQREGGERWYEGTADAIYQNLLIIRRSDPEYVLILSGDH--IYKMDYSDMLDFHIESGADVTVAVKEVPRE 159 (393) T ss_pred CCCEEEECCHHCCCCCCCEECCHHHHHHHHHHHHHCCCCEEEEECCCE--EEECCHHHHHHHHHHCCCCEEEEEEECCHH T ss_conf 584799571112578731204479999768999743988899952877--894489999999998199879999987967 Q ss_pred CCCCCCEEEEC-CCCCEECCCCCCCCCCCCCCC-CCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCE Q ss_conf 21574404202-333100100146675421000-3556001112501467775328887400111211489999730771 Q gi|254780942|r 138 NPKGYGRLLIK-NNEIIAIREENDATDEERKIH-YCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSI 215 (442) Q Consensus 138 dp~~yGvV~~d-~~~v~~i~Ek~d~~~~~~~~~-lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i 215 (442) ++++||++..| +++++.|.|||.. .+.++ ++++|+|+|+++.|+++|.+...++.+..-+..|+++.++++|. + T Consensus 160 eas~fGim~~D~~~~i~~F~eKp~~---~~~~~~laSMgiYIf~~~~L~~~L~~~~~~~~~~~DfgkdiIp~~~~~~~-v 235 (393) T COG0448 160 EASRFGVMNVDENGRIIEFVEKPAD---GPPSNSLASMGIYIFNTDLLKELLEEDAKDPNSSHDFGKDIIPKLLERGK-V 235 (393) T ss_pred HHHHCCCEEECCCCCEEEEEECCCC---CCCCCCEEEEEEEEECHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCC-E T ss_conf 6002481588799978873526676---88665313556699829999999998741667733106778898875087-7 Q ss_pred EEEEECHHHEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEEEEEEEECCCCCEECCCCCCCCCC-----CCE Q ss_conf 799917214301112100000012222211221222332102115279941123317841001223223442-----000 Q gi|254780942|r 216 ASIDVKEQEVCGCNNRYELSLIENIWQSRYRRQMMISGVTMIAPETVFLSHDTIIQPDTVIEPHVFFGCGVS-----IEN 290 (442) Q Consensus 216 ~~~~~~~~~~~gv~~~~~L~~~~~~~~~~~~~~~l~~gv~~~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~-----Ig~ 290 (442) .+|.++++| ..+++-..+.+++..+.+ ..|+....++.+.|.......|++.+..+.. |++ T Consensus 236 ~AY~f~gYw-~dVgTi~syy~aNmdLl~-------------~~~~~~lyd~~w~IyT~~~~~pPak~~~~s~v~nSLv~~ 301 (393) T COG0448 236 YAYEFSGYW-RDVGTIDSYYEANMDLLS-------------PQPELNLYDRNWPIYTKNKNLPPAKFVNDSEVSNSLVAG 301 (393) T ss_pred EEEECCCHH-HHCCCHHHHHHHHHHHCC-------------CCCCCCCCCCCCCEEECCCCCCCCEEECCCEEEEEEEEC T ss_conf 787425666-526639999984387628-------------997553248998126137888981675585576226607 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCC Q ss_conf 211163134332121112100213443200000258644320231021124785 Q gi|254780942|r 291 YVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGS 344 (442) Q Consensus 291 ~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~ 344 (442) ||+|.. .+++|+++.++.|+.+|.| ++|+|+++|.||.+|.+++++|++++ T Consensus 302 GciI~G--~V~nSVL~~~v~I~~gs~i-~~svim~~~~IG~~~~l~~aIIDk~v 352 (393) T COG0448 302 GCIISG--TVENSVLFRGVRIGKGSVI-ENSVIMPDVEIGEGAVLRRAIIDKNV 352 (393) T ss_pred CEEEEE--EEEEEEEECCEEECCCCEE-EEEEEECCCEECCCCEEEEEEECCCC T ss_conf 718963--8986387567197789888-75387278688899889998967996 No 18 >COG1209 RfbA dTDP-glucose pyrophosphorylase [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=0 Score=319.52 Aligned_cols=228 Identities=25% Similarity=0.378 Sum_probs=200.9 Q ss_pred CEEEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECC-CHHHHHHHHCC---CCCEEEEEEC Q ss_conf 08999458877232788---87441258848579999999997799779999579-88999997302---6972899985 Q gi|254780942|r 5 RLAIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGY-GAEEITRINFP---PTLSVEYYIQ 77 (442) Q Consensus 5 ~~AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~-~~e~i~~~~~~---~~~~i~~v~q 77 (442) |.|||||||.||||||. .||+|+||++||||+|.|+.|..+|+++|.+++++ +..++++++.+ .+.+++|+.| T Consensus 1 mKgiILAgG~GTRL~PlT~~~~KqLlpV~~KPmi~y~l~~L~~aGI~dI~II~~~~~~~~~~~llGdgs~~gv~itY~~Q 80 (286) T COG1209 1 MKGVILAGGSGTRLRPLTRVVPKQLLPVYDKPMIYYPLETLMLAGIRDILIVVGPEDKPTFKELLGDGSDFGVDITYAVQ 80 (286) T ss_pred CCCEEECCCCCCCCCCCCCCCCCCCCEECCCCHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHCCCCCCCCCEEEEEC T ss_conf 97279637675443323136775332116862158579999975985699996688505666333671003863699965 Q ss_pred CCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCCEEEECCCCCCCCEEEEC-CCCCEEC Q ss_conf 9988614466410000123446058985154111533220000001-3586100100101221574404202-3331001 Q gi|254780942|r 78 DCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKI-AQGYSIAVVGFNADNPKGYGRLLIK-NNEIIAI 155 (442) Q Consensus 78 ~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~-~~~~~~~i~~~~~~dp~~yGvV~~d-~~~v~~i 155 (442) ++|+|-|+|+..+++++.+ ++|+++-||+ ++.. .+.+++.++ ++..+++++..+|+||++||+++.| +++++++ T Consensus 81 ~~p~GlA~Av~~a~~fv~~--~~f~l~LGDN-i~~~-~l~~~~~~~~~~~~ga~i~~~~V~dP~rfGV~e~d~~~~v~~l 156 (286) T COG1209 81 PEPDGLAHAVLIAEDFVGD--DDFVLYLGDN-IFQD-GLSELLEHFAEEGSGATILLYEVDDPSRYGVVEFDEDGKVIGL 156 (286) T ss_pred CCCCCHHHHHHHHHHHCCC--CCEEEEECCC-EECC-CHHHHHHHHHCCCCCCEEEEEECCCCCCCEEEEECCCCCEEEE T ss_conf 8987689999988864589--8659992684-4105-7799999985248994899998489530448998689968776 Q ss_pred CCCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCCC Q ss_conf 00146675421000355600111250146777532888740011121148999973077179991721430111210000 Q gi|254780942|r 156 REENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYELS 235 (442) Q Consensus 156 ~Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~ 235 (442) +||| .+++|||+-+|+|.|+ +.++++++.++++ ++||+++||+++.++++|..+......+ ||.+++++.+|. T Consensus 157 ~EKP----~~P~SNlAvtGlY~~d-~~Vf~~~~~ikPS-~RGElEITd~i~~~i~~G~~~~~~~~~G-~WlDtGt~~sll 229 (286) T COG1209 157 EEKP----KEPKSNLAVTGLYFYD-PSVFEAIKQIKPS-ARGELEITDAIDLYIEKGYLVVAILIRG-WWLDTGTPESLL 229 (286) T ss_pred EECC----CCCCCCEEEEEEEEEC-HHHHHHHHCCCCC-CCCCEEEHHHHHHHHHCCCEEEEEECCC-EEEECCCHHHHH T ss_conf 7889----9998762688999708-5899998718988-7785675288999997696888987151-688638701289 Q ss_pred CCCCCCCC Q ss_conf 00122222 Q gi|254780942|r 236 LIENIWQS 243 (442) Q Consensus 236 ~~~~~~~~ 243 (442) +++.+... T Consensus 230 eA~~~i~~ 237 (286) T COG1209 230 EANNFVRT 237 (286) T ss_pred HHHHHHHH T ss_conf 99999988 No 19 >cd02541 UGPase_prokaryotic Prokaryotic UGPase catalyses the synthesis of UDP-glucose. Prokaryotic UDP-Glucose Pyrophosphorylase (UGPase) catalyzes a reversible production of UDP-Glucose and pyrophosphate (PPi) from glucose-1-phosphate and UTP. UDP-glucose plays pivotal roles in galactose utilization, in glycogen synthesis, and in the synthesis of the carbohydrate moieties of glycolipids , glycoproteins , and proteoglycans. UGPase is found in both prokaryotes and eukaryotes, although prokaryotic and eukaryotic forms of UGPase catalyze the same reaction, they share low sequence similarity. Probab=100.00 E-value=0 Score=311.59 Aligned_cols=228 Identities=24% Similarity=0.296 Sum_probs=185.7 Q ss_pred CEEEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCC-------------- Q ss_conf 08999458877232788---8744125884857999999999779977999957988999997302-------------- Q gi|254780942|r 5 RLAIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFP-------------- 67 (442) Q Consensus 5 ~~AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~-------------- 67 (442) +.|||||||+||||||. +||||+||+|||||+|+|+.|.++|+++|+++++|++++|++|+.. T Consensus 1 ~KAvILAgG~GTRL~PlT~~~PKpLlpi~gkPii~~~i~~l~~~Gi~~iii~~~~~~~~I~~~f~~~~~~~~~l~~~~k~ 80 (267) T cd02541 1 RKAVIPAAGLGTRFLPATKAIPKEMLPIVDKPVIQYIVEEAVAAGIEDIIIVTGRGKRAIEDHFDRSYELEETLEKKGKT 80 (267) T ss_pred CEEEEECCCCCCCCCHHHCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEEEECCHHHHHHHHCCCHHHHHHHHHCCCH T ss_conf 96999898774667842469984364899998699999999986998899991246666244506658899998526856 Q ss_pred ----------CCCEEEEEECCCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCC--CCCCCCCCCCCCCCCCCEEEE Q ss_conf ----------69728999859988614466410000123446058985154111533--220000001358610010010 Q gi|254780942|r 68 ----------PTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSH--TLKKAMDKIAQGYSIAVVGFN 135 (442) Q Consensus 68 ----------~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~--~l~~l~~~~~~~~~~~i~~~~ 135 (442) .+..+.|+.|++++|||+||+++++++.+ ++|+|++||+ +++.. .+.+|+++|.+....+++... T Consensus 81 ~~~~~~~~~~~~~~i~~~~e~~~lGtggai~~a~~~~~~--~~flv~~gD~-l~~~~~~~l~~li~~~~~~~~~~~~~~~ 157 (267) T cd02541 81 DLLEEVRIISDLANIHYVRQKEPLGLGHAVLCAKPFIGD--EPFAVLLGDD-LIDSKEPCLKQLIEAYEKTGASVIAVEE 157 (267) T ss_pred HHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHCCCCCCC--CCEEEEECCE-EEECCCHHHHHHHHHHHHHCCEEEEEEE T ss_conf 555554224789559999778988767898721010499--8679996898-8853750668999999730872689999 Q ss_pred C--CCCCCCCEEEECCC-----CCEECCCCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHH Q ss_conf 1--22157440420233-----3100100146675421000355600111250146777532888740011121148999 Q gi|254780942|r 136 A--DNPKGYGRLLIKNN-----EIIAIREENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKA 208 (442) Q Consensus 136 ~--~dp~~yGvV~~d~~-----~v~~i~Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l 208 (442) + +++++||++..++. +++.|+|||. ++++.++++++|+|+|+++ +++++++.+.+ .++|++++|+++.+ T Consensus 158 v~~~~~~~yGvv~~~~~~~~~~~i~~~~EKP~--~~~~~s~l~~~GiYi~~~~-i~~~i~~~~~~-~~~e~~l~d~i~~l 233 (267) T cd02541 158 VPPEDVSKYGIVKGEKIDGDVFKVKGLVEKPK--PEEAPSNLAIVGRYVLTPD-IFDILENTKPG-KGGEIQLTDAIAKL 233 (267) T ss_pred ECCCCCCCCCEEEECCCCCCEEEEEEEEECCC--CCCCCCCEEEEEEEEECHH-HHHHHHHCCCC-CCCCCCHHHHHHHH T ss_conf 68667873643896677870688887640688--7679853166344998899-98789728999-99833077999999 Q ss_pred HHHHCCEEEEEECHHHEECCCCCCCCCCCCCCC Q ss_conf 973077179991721430111210000001222 Q gi|254780942|r 209 RLDGKSIASIDVKEQEVCGCNNRYELSLIENIW 241 (442) Q Consensus 209 ~~~g~~i~~~~~~~~~~~gv~~~~~L~~~~~~~ 241 (442) ++++ ++.++..++. |.++++|.++.+++..+ T Consensus 234 i~~~-~v~~~~~~g~-W~DiGtp~~y~~A~~~~ 264 (267) T cd02541 234 LEEE-PVYAYVFEGK-RYDCGNKLGYLKATVEF 264 (267) T ss_pred HHCC-CEEEEEECCE-EEECCCHHHHHHHHHHH T ss_conf 8579-9699985988-89579989999999998 No 20 >cd02538 G1P_TT_short G1P_TT_short is the short form of glucose-1-phosphate thymidylyltransferase. This family is the short form of glucose-1-phosphate thymidylyltransferase. Glucose-1-phosphate thymidylyltransferase catalyses the formation of dTDP-glucose, from dTTP and glucose 1-phosphate. It is the first enzyme in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.There are two forms of Glucose-1-phosphate thymidylyltransferase in bacteria and archeae; short form and long form. The homotetrameric, feedback inhibited short form is found in numerous bacterial species that produce dTDP-L-rhamnose. The long form, which has an extra 50 amino acids c-terminal, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced. Probab=100.00 E-value=0 Score=311.86 Aligned_cols=229 Identities=23% Similarity=0.313 Sum_probs=190.9 Q ss_pred CEEEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCC-HHHHHHHHCC---CCCEEEEEEC Q ss_conf 08999458877232788---874412588485799999999977997799995798-8999997302---6972899985 Q gi|254780942|r 5 RLAIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYG-AEEITRINFP---PTLSVEYYIQ 77 (442) Q Consensus 5 ~~AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~-~e~i~~~~~~---~~~~i~~v~q 77 (442) |.|||||||+||||||. +||||+||+|+|||+|+|+.|.++|+++|+++++++ .+++++++.. .+.+++|+.| T Consensus 1 MkavILAgG~GtRL~PlT~~~PKpLlpi~~kPli~~~l~~l~~~Gi~~i~iv~~~~~~~~~~~~~~~~~~~~~~i~y~~e 80 (240) T cd02538 1 MKGIILAGGSGTRLYPLTKVVSKQLLPVYDKPMIYYPLSTLMLAGIREILIISTPEDLPLFKELLGDGSDLGIRITYAVQ 80 (240) T ss_pred CEEEEECCCCCCCCCCCCCCCCCCCCEECCEEEHHHHHHHHHHCCCCCCEEECCHHHHHHHHHHHHCCCCCCEEEEEEEE T ss_conf 90999888786678732379874327799997189999999985996231002342699999997447637527999998 Q ss_pred CCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCC-CCCCCCCEEEECCCCCCCCEEEEC-CCCCEEC Q ss_conf 99886144664100001234460589851541115332200000013-586100100101221574404202-3331001 Q gi|254780942|r 78 DCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIA-QGYSIAVVGFNADNPKGYGRLLIK-NNEIIAI 155 (442) Q Consensus 78 ~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~-~~~~~~i~~~~~~dp~~yGvV~~d-~~~v~~i 155 (442) ++++|||+|++.+++++.+ ++|+|++||. ++++.++.+++++|. ++.+.++++.++++|++||+++.| ++++++| T Consensus 81 ~~~~Gt~~al~~a~~~~~~--~~~~v~~gD~-i~~~~~~~~~l~~~~~~~~~~ti~~~~v~~~~~yGvv~~d~~~~v~~~ 157 (240) T cd02538 81 PKPGGLAQAFIIGEEFIGD--DPVCLILGDN-IFYGQGLSPILQRAAAQKEGATVFGYEVNDPERYGVVEFDENGRVLSI 157 (240) T ss_pred ECCCCCCHHHHHHHHHCCC--CCEEEECCCC-EECCCCHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCCCCCCCCEEEE T ss_conf 6268631378888876289--8734443874-010656899999999666884787777236113672311478857898 Q ss_pred CCCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCCC Q ss_conf 00146675421000355600111250146777532888740011121148999973077179991721430111210000 Q gi|254780942|r 156 REENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYELS 235 (442) Q Consensus 156 ~Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~ 235 (442) +|||. ++.++++++|+|+|+++ ++++++.+.+. .++|+++||+++.++++|.....+.....+|.++++|.+|. T Consensus 158 ~EKP~----~~~s~~~~~GiYi~~~~-i~~~i~~~~~~-~~ge~~i~d~~~~~i~~g~~~~~~~~~~~~W~DiGtpe~~~ 231 (240) T cd02538 158 EEKPK----KPKSNYAVTGLYFYDND-VFEIAKQLKPS-ARGELEITDVNNEYLEKGKLSVELLGRGFAWLDTGTHESLL 231 (240) T ss_pred EECCC----CCCCCEEEEEEEEECHH-HHHHHHHCCCC-CCCEEEHHHHHHHHHHCCCCEEEEECCCCEEEECCCHHHHH T ss_conf 88999----99988799999998989-99999858999-99907588999999985997399968998089799999999 Q ss_pred CCCCCCC Q ss_conf 0012222 Q gi|254780942|r 236 LIENIWQ 242 (442) Q Consensus 236 ~~~~~~~ 242 (442) +++.+.+ T Consensus 232 ~An~~i~ 238 (240) T cd02538 232 EASNFVQ 238 (240) T ss_pred HHHHHHH T ss_conf 9998896 No 21 >PRK13389 UTP--glucose-1-phosphate uridylyltransferase subunit GalU; Provisional Probab=100.00 E-value=0 Score=305.26 Aligned_cols=229 Identities=20% Similarity=0.273 Sum_probs=188.5 Q ss_pred CCCCC-EEEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCC--------- Q ss_conf 98740-8999458877232788---8744125884857999999999779977999957988999997302--------- Q gi|254780942|r 1 MKRKR-LAIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFP--------- 67 (442) Q Consensus 1 m~~~~-~AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~--------- 67 (442) |+++. -|||||||.||||+|. +||+||||.+||+|+|+++.+.++|+++|++|+++..+.|++|+.. T Consensus 4 m~kki~KaVI~AAG~GTRl~P~Tk~~PKemLPi~dkPii~~~vee~~~aGI~ei~iV~~~~K~~I~d~F~~~~ele~~L~ 83 (302) T PRK13389 4 INTKVKKAVIPVAGLGTRMLPATKAIPKEMLPLVDKPLIQYVVNECIAAGITEIVLVTHSSKNSIENHFDTSFELEAMLE 83 (302) T ss_pred CCCCCCEEEEECCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHCCCCHHHHHHHH T ss_conf 35775679998986753668650469812154898875999999999869978999927984135563477088888887 Q ss_pred ----------------CCCEEEEEECCCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCC-------CCCCCCCCCCC Q ss_conf ----------------6972899985998861446641000012344605898515411153-------32200000013 Q gi|254780942|r 68 ----------------PTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSS-------HTLKKAMDKIA 124 (442) Q Consensus 68 ----------------~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~-------~~l~~l~~~~~ 124 (442) ...++.|+.|.+++|+||||++|++++.+ ++|+|+.||. +++. ..+.++++.+. T Consensus 84 ~~~k~~~l~~~~~i~~~~~~i~~v~Q~~~~GlGhAV~~A~~~v~d--epF~VllgD~-li~~~~~~l~~~~l~~li~~~~ 160 (302) T PRK13389 84 KRVKRQLLDEVQSICPPHVTIMQVRQGLAKGLGHAVLCAHPVVGD--EPVAVILPDV-ILDEYESDLSQDNLAEMIRRFD 160 (302) T ss_pred HHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCC--CCEEEEECCH-HHCCCCCCCCHHHHHHHHHHHH T ss_conf 632577777765303588359998678987689999988986299--6579993633-4135566520568999999986 Q ss_pred CCCCCCCEEEECCCCCCCCEEEECC--------CCCEECCCCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCC Q ss_conf 5861001001012215744042023--------33100100146675421000355600111250146777532888740 Q gi|254780942|r 125 QGYSIAVVGFNADNPKGYGRLLIKN--------NEIIAIREENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVS 196 (442) Q Consensus 125 ~~~~~~i~~~~~~dp~~yGvV~~d~--------~~v~~i~Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~ 196 (442) +....++++.+++||++||++..++ .++..++|||+ +++..||++++|.|+|+ +.+|++|++++++ .+ T Consensus 161 ~~g~~~I~v~~Ved~s~YGVv~~~g~~~~~~~~~~I~~iVEKP~--~e~aPSnlAi~GrYif~-p~IF~~L~~~~p~-~~ 236 (302) T PRK13389 161 ETGHSQIMVEPVADVTAYGVVDCKGVELAPGESVPMVGVVEKPK--ADVAPSNLAIVGRYVLS-ADIWPLLAKTPPG-AG 236 (302) T ss_pred HCCCCEEEEEECCCCCCEEEEECCCCCCCCCCCCCEEEEEECCC--CCCCCCCEEEEEEEEEC-HHHHHHHHCCCCC-CC T ss_conf 14986899998677632447963566445676332245674378--87898775787678878-8999999607999-99 Q ss_pred CHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCCCCCC Q ss_conf 011121148999973077179991721430111210000001 Q gi|254780942|r 197 QEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYELSLIE 238 (442) Q Consensus 197 gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~~~~ 238 (442) ||++|||++..+++++ .+.++.+++.| .+++++..|..+. T Consensus 237 GEiqLTDAi~~l~~~~-~v~~~~~~G~r-~D~G~~~gyl~A~ 276 (302) T PRK13389 237 DEIQLTDAIDMLIEKE-TVEAYHMKGKS-HDCGNKLGYMQAF 276 (302) T ss_pred CCEEHHHHHHHHHHHC-CEEEEEEEEEE-ECCCCCCHHHHHH T ss_conf 8084999999998429-76999965579-7876801379999 No 22 >PRK10122 UTP--glucose-1-phosphate uridylyltransferase subunit GalF; Provisional Probab=100.00 E-value=0 Score=304.66 Aligned_cols=227 Identities=19% Similarity=0.256 Sum_probs=183.1 Q ss_pred CCEEEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCC------------- Q ss_conf 408999458877232788---8744125884857999999999779977999957988999997302------------- Q gi|254780942|r 4 KRLAIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFP------------- 67 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~------------- 67 (442) +|-|||||||+||||+|. +||+||||.+||+|+|+++.+.++|+++|++|+++..+.+++|+.. T Consensus 3 ~MKaVI~aAG~GTRl~P~Tk~~PKemLPv~dkPii~~~vee~~~aGI~eiiiV~~~~k~~ie~~F~~~~~le~~l~~~~k 82 (297) T PRK10122 3 NLKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEIVLVTHASKNAVENHFDTSYELESLLEQRVK 82 (297) T ss_pred CCEEEEECCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHCCCHHHHHHHHHHHHH T ss_conf 75499989867635685315789244628985369999999998799789999179711577746671888999876117 Q ss_pred ------------CCCEEEEEECCCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCC-------CCCCCCCCCCCCCC Q ss_conf ------------69728999859988614466410000123446058985154111533-------22000000135861 Q gi|254780942|r 68 ------------PTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSH-------TLKKAMDKIAQGYS 128 (442) Q Consensus 68 ------------~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~-------~l~~l~~~~~~~~~ 128 (442) .+.++.|+.|++|+||||||++|++++.+ ++|+|+.||. +++.. .+.++++.+.+... T Consensus 83 ~~~l~~~~~~~~~gv~i~yv~Q~eplGlGhAV~~A~~~v~d--epF~VllgD~-l~~~~~~~~~~~~l~~~i~~~~~~~~ 159 (297) T PRK10122 83 RQLLAEVQSICPPGVTIMNVRQGQPLGLGHSILCARPAIGD--NPFVVVLPDV-VIDDASADPLRYNLAAMIARFNETGR 159 (297) T ss_pred HHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHHCCC--CEEEEECCCC-EECCCCCCHHHHHHHHHHHHHHHCCC T ss_conf 99999887524489649998779998678999988873499--6079972661-42278641678889999999971499 Q ss_pred CCCEEEE-CCCCCCCCEEEECC--------CCCEECCCCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHH Q ss_conf 0010010-12215744042023--------33100100146675421000355600111250146777532888740011 Q gi|254780942|r 129 IAVVGFN-ADNPKGYGRLLIKN--------NEIIAIREENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEY 199 (442) Q Consensus 129 ~~i~~~~-~~dp~~yGvV~~d~--------~~v~~i~Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~ 199 (442) .+++..+ .+||++||++..++ .++.+++|||+. |++..||++++|.|+|+ +.+|++|+++++. .+||+ T Consensus 160 ~~v~~~~v~~d~~~yGVv~~~~~~d~~g~v~~I~~ivEKP~~-P~~apSnlai~GrYi~~-p~IF~~L~~~~p~-~~GEi 236 (297) T PRK10122 160 SQVLAKRMPGDLSEYSVIQTKEPLDREGKVSRIVEFIEKPDQ-PQTLDSDLMAVGRYVLS-ADIWPELERTQPG-AWGRI 236 (297) T ss_pred CEEEEEEECCCCCCEEEEECCCCCCCCCCEEEEEECCCCCCC-CCCCCCCCEEEEEEEEC-HHHHHHHHHCCCC-CCCEE T ss_conf 599999957876515799556554667753335651467789-86677773377899728-7999999708999-99858 Q ss_pred HHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCCCCCC Q ss_conf 121148999973077179991721430111210000001 Q gi|254780942|r 200 YLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYELSLIE 238 (442) Q Consensus 200 yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~~~~ 238 (442) +|||+++.+.+++ .+.++.+++.| .+++++..|..+. T Consensus 237 qLTDAi~~l~~~~-~v~~~~~~G~~-~D~G~~~g~l~A~ 273 (297) T PRK10122 237 QLTDAIAELAKKQ-SVDAMLMTGDS-YDCGKKMGYMQAF 273 (297) T ss_pred EHHHHHHHHHCCC-CEEEEEECCEE-ECCCCHHHHHHHH T ss_conf 8999999997019-87999955179-7579862699999 No 23 >cd06425 M1P_guanylylT_B_like_N N-terminal domain of the M1P-guanylyltransferase B-isoform like proteins. GDP-mannose pyrophosphorylase (GTP: alpha-d-mannose-1-phosphate guanyltransferase) catalyzes the formation of GDP-d-mannose from GTP and alpha-d-mannose-1-Phosphate. It contains an N-terminal catalytic domain and a C-terminal Lefthanded-beta-Helix fold domain. GDP-d-mannose is the activated form of mannose for formation of cell wall lipoarabinomannan and various mannose-containing glycolipids and polysaccharides. The function of GDP-mannose pyrophosphorylase is essential for cell wall integrity, morphogenesis and viability. Repression of GDP-mannose pyrophosphorylase in yeast leads to phenotypes, such as cell lysis, defective cell wall, and failure of polarized growth and cell separation. Probab=100.00 E-value=0 Score=305.67 Aligned_cols=222 Identities=19% Similarity=0.315 Sum_probs=184.9 Q ss_pred CEEEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCC----CCCEEEEEEC Q ss_conf 08999458877232788---8744125884857999999999779977999957988999997302----6972899985 Q gi|254780942|r 5 RLAIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFP----PTLSVEYYIQ 77 (442) Q Consensus 5 ~~AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~----~~~~i~~v~q 77 (442) |.|||||||+||||+|. +||||+||+|+|+|+|+|+.|.++|+++|+++++|+++++.+++.. .+.++.|+.| T Consensus 1 MkavIlagG~GtRl~plT~~~PKpLlpv~gkpli~~~i~~l~~~gi~~iii~~~y~~~~i~~~~~~~~~~~~~~i~~~~e 80 (233) T cd06425 1 MKALILVGGYGTRLRPLTLTVPKPLVEFCNKPMIEHQIEALAKAGVKEIILAVNYRPEDMVPFLKEYEKKLGIKITFSIE 80 (233) T ss_pred CEEEEECCCCCCCCCCHHCCCCCCCCEECCEEHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEECCC T ss_conf 96999889776578801329983010899986999999999985997321011235779999998777743960565225 Q ss_pred CCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCC-CCCCCCCEEEECCCCCCCCEEEECC--CCCEE Q ss_conf 99886144664100001234460589851541115332200000013-5861001001012215744042023--33100 Q gi|254780942|r 78 DCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIA-QGYSIAVVGFNADNPKGYGRLLIKN--NEIIA 154 (442) Q Consensus 78 ~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~-~~~~~~i~~~~~~dp~~yGvV~~d~--~~v~~ 154 (442) ++++||||||+.+++++....++|+|+|||+ +.+.++++++++|. ++.++++++.++++|++||++..++ +++.+ T Consensus 81 ~~~lGtagal~~a~~~i~~~~~~flVlngD~--~~~~~l~~~~~~h~~~~~~~ti~~~~~~~~~~yGv~~~d~~~g~v~~ 158 (233) T cd06425 81 TEPLGTAGPLALARDLLGDDDEPFFVLNSDV--ICDFPLAELLDFHKKHGAEGTILVTKVEDPSKYGVVVHDENTGRIER 158 (233) T ss_pred CCCCCCHHHHHHHHHHHCCCCCCEEEECCCH--HCCHHHHHHHHHHHHCCCCEEEEEEECCCHHHCCEEEECCCCCCCCC T ss_conf 7666721699999998468998689977832--10203699999998558984999997678455680886687674222 Q ss_pred CCCCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCC Q ss_conf 10014667542100035560011125014677753288874001112114899997307717999172143011121000 Q gi|254780942|r 155 IREENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYEL 234 (442) Q Consensus 155 i~Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L 234 (442) |+|||. +..++++|+|+|+|+++ ++++++ ..+.+.| .|+++.+++++ ++.++..+++ |.++++|.++ T Consensus 159 f~EKP~----~~~~~~i~~Giyi~~~~-i~~~i~---~~~~~~~---~di~~~l~~~~-~v~~~~~~g~-W~DiGtp~~~ 225 (233) T cd06425 159 FVEKPK----VFVGNKINAGIYILNPS-VLDRIP---LRPTSIE---KEIFPKMASEG-QLYAYELPGF-WMDIGQPKDF 225 (233) T ss_pred EEECCC----CCCCCCEEEEEEEECHH-HHHHCC---CCCCCCH---HHHHHHHHHCC-CEEEEEECCE-EEECCCHHHH T ss_conf 037788----88898703689999899-996566---6888746---99999998479-9799985999-9949798999 Q ss_pred CCCCCCC Q ss_conf 0001222 Q gi|254780942|r 235 SLIENIW 241 (442) Q Consensus 235 ~~~~~~~ 241 (442) .++++++ T Consensus 226 l~A~~~~ 232 (233) T cd06425 226 LKGMSLY 232 (233) T ss_pred HHHHHHH T ss_conf 9999986 No 24 >cd04189 G1P_TT_long G1P_TT_long represents the long form of glucose-1-phosphate thymidylyltransferase. This family is the long form of Glucose-1-phosphate thymidylyltransferase. Glucose-1-phosphate thymidylyltransferase catalyses the formation of dTDP-glucose, from dTTP and glucose 1-phosphate. It is the first enzyme in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.There are two forms of Glucose-1-phosphate thymidylyltransferase in bacteria and archeae; short form and long form. The long form, which has an extra 50 amino acids c-terminal, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced.The long from enzymes also have a left-handed parallel helix domain at the c-terminus, whereas, th eshort form enzymes do not have this domain. The homotetrameric, feedback inhibited short form is found in Probab=100.00 E-value=0 Score=302.92 Aligned_cols=228 Identities=22% Similarity=0.347 Sum_probs=194.5 Q ss_pred CEEEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCC---CCCEEEEEECC Q ss_conf 08999458877232788---8744125884857999999999779977999957988999997302---69728999859 Q gi|254780942|r 5 RLAIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFP---PTLSVEYYIQD 78 (442) Q Consensus 5 ~~AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~---~~~~i~~v~q~ 78 (442) |.|||||||+||||+|. +||||+||+|+|||+|+|+.|.++|+++|+++++|+++++++++.. .+.++.|+.|+ T Consensus 1 MkaiILAaG~GtRl~p~t~~~PK~Ll~i~gkpli~~~i~~l~~~gi~~Iiiv~~~~~~~i~~~~~~~~~~~~~i~~v~~~ 80 (236) T cd04189 1 MKGLILAGGKGTRLRPLTYTRPKQLIPVAGKPIIQYAIEDLREAGIEDIGIVVGPTGEEIKEALGDGSRFGVRITYILQE 80 (236) T ss_pred CEEEEECCCCCCCCCCCCCCCCCEEEEECCCHHHHHHHHHHHHCCCCEEEEEEECCHHHCCCCCCCCCCCCCCEEEEEEC T ss_conf 90999888687688700079980203899860999999999986995899985414122012234666478754898621 Q ss_pred CCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCCEEEECCCCCCCCEEEECCCCCEECCC Q ss_conf 988614466410000123446058985154111533220000001-3586100100101221574404202333100100 Q gi|254780942|r 79 CQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKI-AQGYSIAVVGFNADNPKGYGRLLIKNNEIIAIRE 157 (442) Q Consensus 79 ~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~-~~~~~~~i~~~~~~dp~~yGvV~~d~~~v~~i~E 157 (442) +++|||+|++.+++++.+ ++|++++||+ ++ ..++.+++..| +.+.++++++.+.++|++||++..+++++.++.| T Consensus 81 ~~~Gt~~al~~a~~~l~~--~~~iv~~gD~-~~-~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~g~~~~~~~~v~~i~e 156 (236) T cd04189 81 EPLGLAHAVLAARDFLGD--EPFVVYLGDN-LI-QEGISPLVRDFLEEDADASILLAEVEDPRRFGVAVVDDGRIVRLVE 156 (236) T ss_pred CCCCHHHHHHHHHHHCCC--CCEEEEECCC-EE-CCCHHHHHHHHHHCCCCCEEEEEEEECCCCCCEEEECCCEEEECCC T ss_conf 553245787776651489--9758972794-65-4689999999986278611478986246676289973623543132 Q ss_pred CCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCCCCC Q ss_conf 14667542100035560011125014677753288874001112114899997307717999172143011121000000 Q gi|254780942|r 158 ENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYELSLI 237 (442) Q Consensus 158 k~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~~~ 237 (442) ||.. +.++++++|+|+|+ +.++++++.+.+ +.++|++++|+++.++++|.++.++..++. |.++++|.+|.++ T Consensus 157 K~~~----~~~~~~~~Giy~f~-~~i~~~~~~~~~-~~~~e~~i~d~i~~li~~g~~v~~~~~~g~-W~Digtpedl~~A 229 (236) T cd04189 157 KPKE----PPSNLALVGVYAFT-PAIFDAISRLKP-SWRGELEITDAIQWLIDRGRRVGYSIVTGW-WKDTGTPEDLLEA 229 (236) T ss_pred CCCC----CCCCEEEEEEEEEC-HHHHHHHHHCCC-CCCCCEEHHHHHHHHHHCCCCEEEEEECCE-EEECCCHHHHHHH T ss_conf 5889----87675999999999-899999984799-878813389999999987998899971997-9979899999999 Q ss_pred CCCCCC Q ss_conf 122222 Q gi|254780942|r 238 ENIWQS 243 (442) Q Consensus 238 ~~~~~~ 243 (442) ++.+.+ T Consensus 230 ~~~l~~ 235 (236) T cd04189 230 NRLLLD 235 (236) T ss_pred HHHHHC T ss_conf 899871 No 25 >cd06428 M1P_guanylylT_A_like_N N-terminal domain of M1P_guanylyl_A_ like proteins are likely to be a isoform of GDP-mannose pyrophosphorylase. N-terminal domain of the M1P-guanylyltransferase A-isoform like proteins: The proteins of this family are likely to be a isoform of GDP-mannose pyrophosphorylase. Their sequences are highly conserved with mannose-1-phosphate guanyltransferase, but generally about 40-60 bases longer. GDP-mannose pyrophosphorylase (GTP: alpha-d-mannose-1-phosphate guanyltransferase) catalyzes the formation of GDP-d-mannose from GTP and alpha-d-mannose-1-Phosphate. It contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain. GDP-d-mannose is the activated form of mannose for formation of cell wall lipoarabinomannan and various mannose-containing glycolipids and polysaccharides. The function of GDP-mannose pyrophosphorylase is essential for cell wall integrity, morphogenesis and viability. Repre Probab=100.00 E-value=0 Score=296.67 Aligned_cols=225 Identities=20% Similarity=0.296 Sum_probs=183.0 Q ss_pred EEEECCC--CCCCCCC---CCCCCEEEECCEEHHHHHHHHHHH-CCCCEEEEEECCCHHHHHHHHCC----CCCEEEEEE Q ss_conf 9994588--7723278---887441258848579999999997-79977999957988999997302----697289998 Q gi|254780942|r 7 AIVLAAG--RGHRMKS---SSSKVLQKIAGKPMISHVMETIAA-AGIENVALVLGYGAEEITRINFP----PTLSVEYYI 76 (442) Q Consensus 7 AiILAaG--~GtRl~p---~~pKpLlpi~gkpli~~~i~~l~~-~g~~~ivvv~~~~~e~i~~~~~~----~~~~i~~v~ 76 (442) ||||||| +||||+| .+||||+||+|||||+|+++.|.+ .++++|+++++|+.+.+.+++.. .+.+++|+. T Consensus 1 AvILaGG~~~GTRL~plT~~~PKpllpv~gkPiI~~~i~~l~~~~gi~~i~i~~~~~~~~~~~~~~~~~~~~~~~i~~~~ 80 (257) T cd06428 1 AVILVGGPQKGTRFRPLSLDVPKPLFPVAGKPMIHHHIEACAKVPDLKEVLLIGFYPESVFSDFISDAQQEFNVPIRYLQ 80 (257) T ss_pred CEEECCCCCCCCCCCHHHCCCCCCCCEECCEEEHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCCCCCCEEEEE T ss_conf 98978899997316743379984425799995419999999974798489998057579999987423433696059986 Q ss_pred CCCCCCHHHHHHHCCCCCCC-CCCEEEEEECCCCCCCCCCCCCCCCCCCC-CCCCCCEEEEC--CCCCCCCEEEEC--CC Q ss_conf 59988614466410000123-44605898515411153322000000135-86100100101--221574404202--33 Q gi|254780942|r 77 QDCQQGTAHAVLTAQDAIKP-GYDDVIIMYGDVPLVSSHTLKKAMDKIAQ-GYSIAVVGFNA--DNPKGYGRLLIK--NN 150 (442) Q Consensus 77 q~~~~GTa~Ai~~a~~~l~~-~~~~~lVl~GD~~li~~~~l~~l~~~~~~-~~~~~i~~~~~--~dp~~yGvV~~d--~~ 150 (442) |++++||||||+++++.+.. ..++|+|+|||+ +...++.+|++.|++ ++++++++.++ ++|++||++..+ ++ T Consensus 81 e~~~lGtagal~~~~~~i~~~~~~~~lv~~gD~--l~~~dl~~l~~~h~~~~~~~ti~~~~v~~~~~s~fGvvv~~~~~~ 158 (257) T cd06428 81 EYKPLGTAGGLYHFRDQILAGNPSAFFVLNADV--CCDFPLQELLEFHKKHGASGTILGTEASREQASNYGCIVEDPSTG 158 (257) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCC--CCCCCHHHHHHHHHHCCCCEEEEEEECCCCCCCCEEEEEEECCCC T ss_conf 476667367899999997348997699988850--038789999999997799869999987821365402899978999 Q ss_pred CCEECCCCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHH----------------HHHHHHHHHHHCC Q ss_conf 310010014667542100035560011125014677753288874001112----------------1148999973077 Q gi|254780942|r 151 EIIAIREENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYL----------------TDIIEKARLDGKS 214 (442) Q Consensus 151 ~v~~i~Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yl----------------tDii~~l~~~g~~ 214 (442) +|++|+|||. ++.++++|+|+|+|+ +.++++++++.... ++|+++ +|+++.++++| + T Consensus 159 ~i~~f~EKP~----~~~s~lin~G~Yvf~-~~if~~i~~~~~~~-~~e~~~~d~~~~~~~~~~~~~e~dvlp~l~~~g-~ 231 (257) T cd06428 159 EVLHYVEKPE----TFVSDLINCGVYLFS-PEIFDTIKKAFQSR-QQEAQLGDDNNREGRAEVIRLEQDVLTPLAGSG-K 231 (257) T ss_pred CEEEEEECCC----CCCCCEEEEEEEEEC-HHHHHHHHHCCCCC-CCCCCCCCCCCCCCCCCCCEECHHHHHHHHHCC-C T ss_conf 5999998899----988880899999998-89999997518454-674101343333466644400032679998579-8 Q ss_pred EEEEEECHHHEECCCCCCCCCCCCCCC Q ss_conf 179991721430111210000001222 Q gi|254780942|r 215 IASIDVKEQEVCGCNNRYELSLIENIW 241 (442) Q Consensus 215 i~~~~~~~~~~~gv~~~~~L~~~~~~~ 241 (442) +.+|..+++| .++++|.++..++++| T Consensus 232 l~~y~~~g~W-~DiGTp~~~~~AN~~~ 257 (257) T cd06428 232 LYVYKTDDFW-SQIKTAGSAIYANRLY 257 (257) T ss_pred EEEEECCCEE-ECCCCHHHHHHHHCCC T ss_conf 8999389838-7598989999987439 No 26 >cd03353 LbH_GlmU_C N-acetyl-glucosamine-1-phosphate uridyltransferase (GlmU), C-terminal left-handed beta-helix (LbH) acetyltransferase domain: GlmU is also known as UDP-N-acetylglucosamine pyrophosphorylase. It is a bifunctional bacterial enzyme that catalyzes two consecutive steps in the formation of UDP-N-acetylglucosamine (UDP-GlcNAc), an important precursor in bacterial cell wall formation. The two enzymatic activities, uridyltransferase and acetyltransferase, are carried out by two independent domains. The C-terminal LbH domain possesses the acetyltransferase activity. It catalyzes the CoA-dependent acetylation of GlcN-1-phosphate to GlcNAc-1-phosphate. The LbH domain contains 10 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X. The acetyltransferase active site is located at the interface between two subunits of the active LbH trimer. Probab=100.00 E-value=0 Score=297.12 Aligned_cols=176 Identities=51% Similarity=0.821 Sum_probs=162.9 Q ss_pred CCCEEECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCCCCC-----------------CCCCCCCCCCC Q ss_conf 3210211527994112331784100122322344200021116313433212-----------------11121002134 Q gi|254780942|r 253 GVTMIAPETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVH-----------------IGKKTIIGPFA 315 (442) Q Consensus 253 gv~~~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~-----------------Ig~~~~Ig~~~ 315 (442) |+++++|+++|+++++.|+++++|+|+++|+++++||++|.|++++++.++. +++++.||+|+ T Consensus 1 gv~~i~p~s~~I~~~v~IG~~v~I~p~~~I~~~v~IG~~~~Ig~~~~I~~~~IG~~~~I~~~~vi~~~~ig~~~~IG~~~ 80 (193) T cd03353 1 GVTLIDPETTYIDGDVEIGVDVVIDPGVILEGKTVIGEDCVIGPNCVIKDSTIGDGVVIKASSVIEGAVIGNGATVGPFA 80 (193) T ss_pred CCEEECCCEEEECCCCEECCCCEECCCEEEECCCEECCCCEECCCEEECCCEECCCCEECCCEECCCCEECCCCEECCCE T ss_conf 98795898069999829999999989829929859999989957345778688887400210011352517652666850 Q ss_pred CCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEE Q ss_conf 43200000258644320231021124785448745556639889989979808834457754562987887988699893 Q gi|254780942|r 316 RIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLI 395 (442) Q Consensus 316 ~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~ 395 (442) +||++++|+++++||++++++|+.|+++++++|++|+||++||++|.||++++++|++|..+|+++|||+|+||.++.+. T Consensus 81 ~i~~~~~I~~~~~iG~~v~i~~~~ig~~~~i~h~~~igd~~Ig~~v~Ig~~~i~~n~~g~~~~~t~IGd~v~IG~~~~i~ 160 (193) T cd03353 81 HLRPGTVLGEGVHIGNFVEIKKSTIGEGSKANHLSYLGDAEIGEGVNIGAGTITCNYDGVNKHRTVIGDNVFIGSNSQLV 160 (193) T ss_pred EECCCCEECCEEEECCCEEEECCCCCCCCEECCEEECCCCEECCCCEECCCEEEECCCCCCCCCCEECCCCEECCCCEEE T ss_conf 76587688751674253576444336884877244618856777888866758873434335623433561898898798 Q ss_pred CCEEECCCCEECCCCEECCCCCCCCEEEECCCE Q ss_conf 783988998997783576643698689813560 Q gi|254780942|r 396 APITIGQGTYVASGSIITQDTPENSLVFARSRQ 428 (442) Q Consensus 396 ~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~ 428 (442) ++++||++++|+|+|+|+|||||++||++|++| T Consensus 161 ~~v~IG~~a~Igags~V~kdVpp~~lvvg~~~~ 193 (193) T cd03353 161 APVTIGDGATIAAGSTITKDVPPGALAIARARQ 193 (193) T ss_pred CCEEECCCCEECCCCEECCCCCCCCEEEEECCC T ss_conf 983999998998698888423999789962489 No 27 >pfam00483 NTP_transferase Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars. Probab=100.00 E-value=0 Score=292.46 Aligned_cols=228 Identities=23% Similarity=0.243 Sum_probs=183.4 Q ss_pred EEEECCCCCCCCCCC---CCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEEC-CCHHHHHHHHCC---CCCEEEEEECC Q ss_conf 999458877232788---87441258848-57999999999779977999957-988999997302---69728999859 Q gi|254780942|r 7 AIVLAAGRGHRMKSS---SSKVLQKIAGK-PMISHVMETIAAAGIENVALVLG-YGAEEITRINFP---PTLSVEYYIQD 78 (442) Q Consensus 7 AiILAaG~GtRl~p~---~pKpLlpi~gk-pli~~~i~~l~~~g~~~ivvv~~-~~~e~i~~~~~~---~~~~i~~v~q~ 78 (442) |||||||+||||||. +||||+|++|+ |||+|++++|.++|++++++|++ |+.+++++++.. .+.++.|+.|+ T Consensus 2 avIlAgG~GtRl~PlT~~~pKpllpv~~~~pli~~~l~~l~~~g~~~~~~v~~~~~~~~i~~~~~~~~~~~~~i~~~~e~ 81 (247) T pfam00483 2 AIILAGGSGTRLWPLTRTLAKPLVPVLDKYPMIQYTLSRLMNAGIREPIVICTQEHRFLVAEQLGDGSKFGLQVTYALQP 81 (247) T ss_pred EEEECCCCCCCCCCCCCCCCCCEEEECCCCCHHHHHHHHHHHCCCCHHEEECCHHHHHHHHHHHHHCCCCCCEEEEEEEC T ss_conf 79988977777880106889885182899789999999998659952033133677888999986336677437774312 Q ss_pred CCCCHHHHHHHCCCCCCCC-CCEEEEEECCCCCCCCCCCCCCCCCCCC-CCC--CCCEEEECCCCCCCCEEEEC-CCCCE Q ss_conf 9886144664100001234-4605898515411153322000000135-861--00100101221574404202-33310 Q gi|254780942|r 79 CQQGTAHAVLTAQDAIKPG-YDDVIIMYGDVPLVSSHTLKKAMDKIAQ-GYS--IAVVGFNADNPKGYGRLLIK-NNEII 153 (442) Q Consensus 79 ~~~GTa~Ai~~a~~~l~~~-~~~~lVl~GD~~li~~~~l~~l~~~~~~-~~~--~~i~~~~~~dp~~yGvV~~d-~~~v~ 153 (442) +++|||+|++.+++++.+. .+.|+|++||+ +...++.+++++|.+ +++ +++...+.++|++||++..+ +++|. T Consensus 82 ~~~Gta~ai~~a~~~l~~~~~~~~lv~~~D~--~~~~~~~~~l~~h~~~~~~~~~~~~~~~~~~~~~yGvv~~d~~~~V~ 159 (247) T pfam00483 82 EPRGTAPAVALAADFLGDDDPELVLVLGGDH--IYRMDFEEAVQKARAKAADGTVTFGIVPVEDPTGYGVIEFDENGRVI 159 (247) T ss_pred CCCCCHHHHHHHHHHHCCCCCCEEEEECCCC--CCCHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCEEEECCCCCCC T ss_conf 7777056899999864347876279977853--34440999999999847998688855655576345148988866503 Q ss_pred ECCCCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHH-HEECCCCCC Q ss_conf 0100146675421000355600111250146777532888740011121148999973077179991721-430111210 Q gi|254780942|r 154 AIREENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQ-EVCGCNNRY 232 (442) Q Consensus 154 ~i~Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~-~~~gv~~~~ 232 (442) +|+|||+.. ..++++|+|+|+|+++.+ +++.+......++|++++|+++.++++|..+.++...+. +|.+++++. T Consensus 160 ~~~EKP~~~---~~~~~~~~G~Y~~~~~i~-~~~~~~~~~~~~~e~~~~d~i~~~i~~g~~~~~~~~~~~~~W~DiGt~e 235 (247) T pfam00483 160 RFVEKPDLP---KASNYASMGIYFFNSGVF-LFLAKYLKELARGEDEITDAIPAALEDGDLDLAFLFRGYAYWSDVGTWD 235 (247) T ss_pred CEEECCCCH---HHHHHEEEEEEEECHHHH-HHHHHHCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCEEEEECCCHH T ss_conf 046779962---443425611499889999-9999657777679944999999999879963999607988468698999 Q ss_pred CCCCCCCC Q ss_conf 00000122 Q gi|254780942|r 233 ELSLIENI 240 (442) Q Consensus 233 ~L~~~~~~ 240 (442) +|.+++.. T Consensus 236 ~~~~an~~ 243 (247) T pfam00483 236 SLWEANLK 243 (247) T ss_pred HHHHHHHH T ss_conf 99999999 No 28 >cd06915 NTP_transferase_WcbM_like WcbM_like is a subfamily of nucleotidyl transferases. WcbM protein of Burkholderia mallei is involved in the biosynthesis, export or translocation of capsule. It is a subfamily of nucleotidyl transferases that transfer nucleotides onto phosphosugars. Probab=100.00 E-value=0 Score=289.23 Aligned_cols=215 Identities=21% Similarity=0.303 Sum_probs=179.3 Q ss_pred EEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCC---CCEEEEEECCCC Q ss_conf 999458877232788---87441258848579999999997799779999579889999973026---972899985998 Q gi|254780942|r 7 AIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPP---TLSVEYYIQDCQ 80 (442) Q Consensus 7 AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~---~~~i~~v~q~~~ 80 (442) |||||||+||||+|. +||||+||+|+|||+|+|+.|.++|+++|+++++|+.+++.+++... ..++.++.|+++ T Consensus 1 AvIlAaG~GtRl~p~T~~~PKpLl~i~gkpli~~~l~~l~~~gi~~iii~~~~~~~~i~~~~~~~~~~~~~i~~~~e~~~ 80 (223) T cd06915 1 AVILAGGLGTRLRSVVKDLPKPLAPVAGRPFLEYLLEYLARQGISRIVLSVGYLAEQIEEYFGDGYRGGIRIYYVIEPEP 80 (223) T ss_pred CEEECCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHCCCCCCCCCEEEEEEEE T ss_conf 98947878867881107998311289999899999999997599778873010234334433102345776147886423 Q ss_pred CCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCC-CCCCCCEEEECCCCCCCCEEEEC-CCCCEECCCC Q ss_conf 861446641000012344605898515411153322000000135-86100100101221574404202-3331001001 Q gi|254780942|r 81 QGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQ-GYSIAVVGFNADNPKGYGRLLIK-NNEIIAIREE 158 (442) Q Consensus 81 ~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~-~~~~~i~~~~~~dp~~yGvV~~d-~~~v~~i~Ek 158 (442) +|||+|++.+++++.+ ++|+|++||+ +...++++++++|.+ +.++++++.++++|++||.+..+ ++++..+.|| T Consensus 81 lGt~gal~~a~~~i~~--~~flv~~gD~--~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~g~v~~d~~~~i~~~~eK 156 (223) T cd06915 81 LGTGGAIKNALPKLPE--DQFLVLNGDT--YFDVDLLALLAALRASGADATMALRRVPDASRYGNVTVDGDGRVIAFVEK 156 (223) T ss_pred CCHHHHHHHHHHHCCC--CCEEEECCCE--EECCCHHHHHHHHHHCCCCEEEEEEECCCCCCCCCEEECCCCCEEEEEEC T ss_conf 6468899888863699--9889982771--12677899999998579978999998998302785898797314234754 Q ss_pred CCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCCCCCC Q ss_conf 46675421000355600111250146777532888740011121148999973077179991721430111210000001 Q gi|254780942|r 159 NDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYELSLIE 238 (442) Q Consensus 159 ~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~~~~ 238 (442) |. ...++++++|+|+|+.+ +++++++.. .+ ..+|+++.+++++ ++.++..++.| .++++|.+|.+|+ T Consensus 157 ~~----~~~~~~i~~Giyi~~~~-i~~~~~~~~---~~---~~~d~~~~li~~~-~v~~~~~~g~w-~DiGt~e~~~~A~ 223 (223) T cd06915 157 GP----GAAPGLINGGVYLLRKE-ILAEIPADA---FS---LEADVLPALVKRG-RLYGFEVDGYF-IDIGIPEDYARAQ 223 (223) T ss_pred CC----CCCCCEEEEEEEEECHH-HHHHHHCCC---CC---CHHHHHHHHHHCC-CEEEEEECCEE-EECCCHHHHHHHC T ss_conf 89----87668244638999899-997414259---97---2799999998349-97999979919-9798989999739 No 29 >cd02524 G1P_cytidylyltransferase G1P_cytidylyltransferase catalyzes the production of CDP-D-Glucose. Alpha-D-Glucose-1-phosphate Cytidylyltransferase catalyzes the production of CDP-D-Glucose from alpha-D-Glucose-1-phosphate and MgCTP as substrate. CDP-D-Glucose is the precursor for synthesizing four of the five naturally occurring 3,6-dideoxy sugars-abequose (3,6-dideoxy-D-Xylo-hexose), ascarylose (3,6-dideoxy-L-arabino-hexose), paratose (3,6-dideoxy-D-ribohexose), and tyvelose (3,6-dideoxy-D-arabino-hexose. Deoxysugars are ubiquitous in nature where they function in a variety of biological processes, including cell adhesion, immune response, determination of ABO blood groups, fertilization, antibiotic function, and microbial pathogenicity. Probab=100.00 E-value=0 Score=285.40 Aligned_cols=223 Identities=17% Similarity=0.289 Sum_probs=177.4 Q ss_pred EEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCC--------------- Q ss_conf 999458877232788---87441258848579999999997799779999579889999973026--------------- Q gi|254780942|r 7 AIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPP--------------- 68 (442) Q Consensus 7 AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~--------------- 68 (442) .||||||+||||||. +||||+||+|||+|+|+++.|.++|+++++++++|+++++++++... T Consensus 1 VvILaGG~GtRL~pLT~~~PKpLvpi~gkPii~~ii~~l~~~gi~~iil~~gy~~~~i~~~f~~~~~~~~~~~~~~~~~~ 80 (253) T cd02524 1 VVILAGGLGTRLSEETELKPKPMVEIGGRPILWHIMKIYSHYGHNDFILCLGYKGHVIKEYFLNYFLHNSDVTIDLGTNR 80 (253) T ss_pred CEEECCCCCCCCCHHHCCCCCCCEEECCEEHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHCCCEEEEECCCE T ss_conf 98934787554143127998003189999899999999998699879995322511379998777761585589933772 Q ss_pred ---------CCEEEEEECCCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCC-CCCCEEEECCC Q ss_conf ---------97289998599886144664100001234460589851541115332200000013586-10010010122 Q gi|254780942|r 69 ---------TLSVEYYIQDCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGY-SIAVVGFNADN 138 (442) Q Consensus 69 ---------~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~-~~~i~~~~~~d 138 (442) ..+|.++.+.++.|||+||+.+++++.+ .++|+|+|||+ +.+.++++|+++|++.. .+++. .+++ T Consensus 81 ~~~~~~~~~~~~i~~~~~~~~~~tgg~l~~~~~~l~~-~e~flv~nGD~--l~d~dl~~l~~~h~~~~~~~tl~--~v~~ 155 (253) T cd02524 81 IELHNSDIEDWKVTLVDTGLNTMTGGRLKRVRRYLGD-DETFMLTYGDG--VSDVNINALIEFHRSHGKLATVT--AVHP 155 (253) T ss_pred EEEECCCCCCCEEEEEECCCCCHHHHHHHHHHHHCCC-CCEEEEEECCC--CCCCCHHHHHHHHHHCCCEEEEE--ECCC T ss_conf 7983366778605898536655033899999987489-97399991552--34368799999998669859999--6588 Q ss_pred CCCCCEEEEC-CCCCEECCCCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEE Q ss_conf 1574404202-333100100146675421000355600111250146777532888740011121148999973077179 Q gi|254780942|r 139 PKGYGRLLIK-NNEIIAIREENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIAS 217 (442) Q Consensus 139 p~~yGvV~~d-~~~v~~i~Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~ 217 (442) |++||++..+ +++|.+|.|||... ..++|+|+|+|+++ ++++++ .+.... .+|+++.++++| ++.+ T Consensus 156 ~~~yG~v~~d~~~~v~~f~EKP~~~-----~~~in~GiYv~~~~-if~~i~---~~~~~~---e~d~~p~li~~g-~l~~ 222 (253) T cd02524 156 PGRFGELDLDDDGQVTSFTEKPQGD-----GGWINGGFFVLEPE-VFDYID---GDDTVF---EREPLERLAKDG-ELMA 222 (253) T ss_pred CCCCCEEEECCCCEEEEEEECCCCC-----CCCCEEEEEEECHH-HHHHCC---CCCCCH---HHHHHHHHHHCC-CEEE T ss_conf 8888679998998599987378887-----77421499998899-995466---788742---678999999669-9799 Q ss_pred EEECHHHEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9917214301112100000012222211221 Q gi|254780942|r 218 IDVKEQEVCGCNNRYELSLIENIWQSRYRRQ 248 (442) Q Consensus 218 ~~~~~~~~~gv~~~~~L~~~~~~~~~~~~~~ 248 (442) |..++. |.+++++.||.++++.|......| T Consensus 223 y~~~g~-W~DigT~~d~~~l~~~~~~~~~~~ 252 (253) T cd02524 223 YKHTGF-WQCMDTLRDKQTLEELWNSGKAPW 252 (253) T ss_pred EECCCE-EECCCCHHHHHHHHHHHHCCCCCC T ss_conf 966988-818999799999999997399999 No 30 >cd04181 NTP_transferase NTP_transferases catalyze the transfer of nucleotides onto phosphosugars. Nucleotidyltransferases transfer nucleotides onto phosphosugars. The enzyme family includes Alpha-D-Glucose-1-Phosphate Cytidylyltransferase, Mannose-1-phosphate guanyltransferase, and Glucose-1-phosphate thymidylyltransferase. The products are activated sugars that are precursors for synthesis of lipopolysaccharide, glycolipids and polysaccharides. Probab=100.00 E-value=0 Score=280.91 Aligned_cols=206 Identities=28% Similarity=0.443 Sum_probs=175.5 Q ss_pred EEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCC---CCCEEEEEECCCC Q ss_conf 999458877232788---8744125884857999999999779977999957988999997302---6972899985998 Q gi|254780942|r 7 AIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFP---PTLSVEYYIQDCQ 80 (442) Q Consensus 7 AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~---~~~~i~~v~q~~~ 80 (442) |||||||+||||+|. +||||+||+|+|||+|+++.|..+|+++++++++|+.+++.+++.. ...+++|+.|+++ T Consensus 1 AiIlaaG~GtRl~~~t~~~PK~Ll~v~gkplI~~~l~~l~~~g~~~iiiv~~~~~~~i~~~~~~~~~~~~~i~~~~e~~~ 80 (217) T cd04181 1 AVILAAGKGTRLRPLTDTRPKPLLPIAGKPILEYIIERLARAGIDEIILVVGYLGEQIEEYFGDGSKFGVNIEYVVQEEP 80 (217) T ss_pred CEEECCCCCCCCCHHHCCCCCCCCEECCCHHHHHHHHHHHHCCCCEEEEEEECCCCCHHEEEECCCCCCCEEEEEECCCC T ss_conf 98906888665782136998402389994499999999997499779998612331000011001257965999534887 Q ss_pred CCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCCEEEECCCCCCCCEEEEC-CCCCEECCCC Q ss_conf 8614466410000123446058985154111533220000001-3586100100101221574404202-3331001001 Q gi|254780942|r 81 QGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKI-AQGYSIAVVGFNADNPKGYGRLLIK-NNEIIAIREE 158 (442) Q Consensus 81 ~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~-~~~~~~~i~~~~~~dp~~yGvV~~d-~~~v~~i~Ek 158 (442) +|||+|++++++++.+ ++|+|++||+ +...++..+++.| +++.++++++.+.++|++||++..+ +++|..|.|| T Consensus 81 ~Gt~~al~~a~~~i~~--~~~lv~~~D~--i~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~yg~v~~d~~~~v~~i~EK 156 (217) T cd04181 81 LGTAGAVRNAEDFLGD--DDFLVVNGDV--LTDLDLSELLRFHREKGADATIAVKEVEDPSRYGVVELDDDGRVTRFVEK 156 (217) T ss_pred CCCHHHHHHHHCCCCC--CCEEEECCCC--CCCCCHHHHHHHHHHCCCCEEEEEEEECCCCCCEEEEECCCCCEEEEEEC T ss_conf 7634554321002699--9789972782--22508999999997579967999874036776308998885104368988 Q ss_pred CCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHE Q ss_conf 4667542100035560011125014677753288874001112114899997307717999172143 Q gi|254780942|r 159 NDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEV 225 (442) Q Consensus 159 ~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~ 225 (442) |+.. .++++++|+|+|+.+ +++++++.. .++|++++|+++.+++++ ++.++..++.|. T Consensus 157 p~~~----~~~~~~~G~y~~~~~-~f~~i~~~~---~~~~~~l~d~~~~l~~~~-kv~~~~~~g~W~ 214 (217) T cd04181 157 PTLP----ESNLANAGIYIFEPE-ILDYIPEIL---PRGEDELTDAIPLLIEEG-KVYGYPVDGYWL 214 (217) T ss_pred CCCC----CCCEEEEEEEEECHH-HHHHHHHCC---CCCEEEHHHHHHHHHHCC-CEEEEEECCEEE T ss_conf 9998----788489989999999-999765158---887511899999998579-969999799999 No 31 >KOG1460 consensus Probab=100.00 E-value=1.4e-45 Score=279.16 Aligned_cols=314 Identities=19% Similarity=0.242 Sum_probs=209.1 Q ss_pred CEEEEECCC--CCCCCCC---CCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEEECCCHHHHHHHHCC----CCCEEEE Q ss_conf 089994588--7723278---8874412588485799999999977-9977999957988999997302----6972899 Q gi|254780942|r 5 RLAIVLAAG--RGHRMKS---SSSKVLQKIAGKPMISHVMETIAAA-GIENVALVLGYGAEEITRINFP----PTLSVEY 74 (442) Q Consensus 5 ~~AiILAaG--~GtRl~p---~~pKpLlpi~gkpli~~~i~~l~~~-g~~~ivvv~~~~~e~i~~~~~~----~~~~i~~ 74 (442) ..||||.|| +|||||| +.||||+||+|.|||.|.|+.+++. +..+|+++-=|..+.+..+... ....++| T Consensus 3 ~~AVIlVGGP~kGTRFRPLSf~vPKPLfpiaG~pmI~Hhi~ac~qi~~l~eI~LvGFy~e~~f~~fis~~~~e~~~pvrY 82 (407) T KOG1460 3 VKAVILVGGPQKGTRFRPLSFNVPKPLFPIAGVPMIHHHISACKQISGLAEILLVGFYEERVFTDFISAIQQEFKVPVRY 82 (407) T ss_pred EEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCHHHEEEEECCCCHHHHHHHHHHHHHCCCCHHH T ss_conf 37999955888885103443589987543188514040689875165602336774045058999999998644652554 Q ss_pred EECCCCCCHHHHHHHCCCCC-CCCCCEEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCCEEEEC--CCCCCCCEEEEC-- Q ss_conf 98599886144664100001-23446058985154111533220000001-3586100100101--221574404202-- Q gi|254780942|r 75 YIQDCQQGTAHAVLTAQDAI-KPGYDDVIIMYGDVPLVSSHTLKKAMDKI-AQGYSIAVVGFNA--DNPKGYGRLLIK-- 148 (442) Q Consensus 75 v~q~~~~GTa~Ai~~a~~~l-~~~~~~~lVl~GD~~li~~~~l~~l~~~~-~~~~~~~i~~~~~--~dp~~yGvV~~d-- 148 (442) ..|+.++||||+++..++++ .+..+.++|+|+|+ -.+..|.+|++.| +.+...++++.++ ++.++||.++.| T Consensus 83 L~E~~plGtaGgLyhFrdqIl~g~ps~vFvlnaDV--CcsfPl~~ml~ahr~~g~~~tll~tkvs~e~asnfG~lV~dP~ 160 (407) T KOG1460 83 LREDNPLGTAGGLYHFRDQILAGSPSAVFVLNADV--CCSFPLQDMLEAHRRYGGIGTLLVTKVSREQASNFGCLVEDPS 160 (407) T ss_pred HCCCCCCCCCCCEEEHHHHHHCCCCCEEEEEECCE--ECCCCHHHHHHHHHHCCCCEEEEEEEECHHHHHCCCEEEECCC T ss_conf 34578877665322444577469986289970550--1587479999998633884599999755767510571540687 Q ss_pred CCCCEECCCCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHC----CC------C---CCCC-HHH---HHHHHHHHHHH Q ss_conf 333100100146675421000355600111250146777532----88------8---7400-111---21148999973 Q gi|254780942|r 149 NNEIIAIREENDATDEERKIHYCNSGLMAIDGLYIMDWLLQI----KK------N---KVSQ-EYY---LTDIIEKARLD 211 (442) Q Consensus 149 ~~~v~~i~Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l----~~------~---~~~g-E~y---ltDii~~l~~~ 211 (442) .+++++.+|||. ...|+.||+|+|+|..+.+ +.+.+. +. + ...+ +-| -+|+++.+... T Consensus 161 t~evlHYveKPs----TfvSd~InCGvYlF~~eif-~~i~~v~~q~~~~~~~~~~~~~l~~g~~d~irLeqDvlspLag~ 235 (407) T KOG1460 161 TGEVLHYVEKPS----TFVSDIINCGVYLFTPEIF-NAIAEVYRQRQDLLEVEKDLPLLQPGPADFIRLEQDVLSPLAGS 235 (407) T ss_pred CCCEEEEECCCC----HHHHCCCCEEEEEECHHHH-HHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEECHHHHHHCCC T ss_conf 673578505862----3533350114899468997-88999999877655454216344877666078430010011277 Q ss_pred HCCEEEEEECHHHEECCCCCCCCCCCCCCCCCCCC---CCCCCCCCCEEECCCEEEEEEEEECCCCCEECCCCCCCCCCC Q ss_conf 07717999172143011121000000122222112---212223321021152799411233178410012232234420 Q gi|254780942|r 212 GKSIASIDVKEQEVCGCNNRYELSLIENIWQSRYR---RQMMISGVTMIAPETVFLSHDTIIQPDTVIEPHVFFGCGVSI 288 (442) Q Consensus 212 g~~i~~~~~~~~~~~gv~~~~~L~~~~~~~~~~~~---~~~l~~gv~~~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~I 288 (442) + .+.+|..+++|- -+.++..-..+.+.+..+.. ...+-.+ +....+|-.++.|++++.++|.+.||+|+.| T Consensus 236 k-~lY~y~t~~fW~-QiKtagsal~as~lYLs~yk~t~p~~Lak~----pgt~a~IigdVyIhPsakvhptAkiGPNVSI 309 (407) T KOG1460 236 K-QLYAYETTDFWS-QIKTAGSALYASRLYLSQYKRTHPARLAKG----PGTQAEIIGDVYIHPSAKVHPTAKIGPNVSI 309 (407) T ss_pred C-CEEEEECCCHHH-HHCCCCCEEEHHHHHHHHHHHCCCHHHCCC----CCCCCEEEEEEEECCCCEECCCCCCCCCCEE T ss_conf 7-247870610888-734662133332668888763080420479----9998548742477675404776620897144 Q ss_pred CEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCE Q ss_conf 00211163134332121112100213443200000258644320 Q gi|254780942|r 289 ENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNF 332 (442) Q Consensus 289 g~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~ 332 (442) |.+++|++++.|..|+|-++|.|..|+.+- +|+||..+.||-- T Consensus 310 ga~vrvg~GvRl~~sIIl~d~ei~enavVl-~sIigw~s~iGrW 352 (407) T KOG1460 310 GANVRVGPGVRLRESIILDDAEIEENAVVL-HSIIGWKSSIGRW 352 (407) T ss_pred CCCCEECCCCEEEEEEECCCCEEECCCEEE-EEEECCCCCCCCE T ss_conf 688254688266545602673762163677-6541144322542 No 32 >cd06426 NTP_transferase_like_2 NTP_trnasferase_like_2 is a member of the nucleotidyl transferase family. This is a subfamily of nucleotidyl transferases. Nucleotidyl transferases transfer nucleotides onto phosphosugars. The activated sugars are precursors for synthesis of lipopolysaccharide, glycolipids and polysaccharides. Other subfamilies of nucleotidyl transferases include Alpha-D-Glucose-1-Phosphate Cytidylyltransferase, Mannose-1-phosphate guanyltransferase, and Glucose-1-phosphate thymidylyltransferase. Probab=100.00 E-value=1.4e-45 Score=277.51 Aligned_cols=212 Identities=21% Similarity=0.331 Sum_probs=177.8 Q ss_pred EEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCC---CCCEEEEEECCCCC Q ss_conf 99458877232788---8744125884857999999999779977999957988999997302---69728999859988 Q gi|254780942|r 8 IVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFP---PTLSVEYYIQDCQQ 81 (442) Q Consensus 8 iILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~---~~~~i~~v~q~~~~ 81 (442) ||||||+||||+|. +||||+||+|+|||+|+|+.|.++|+++|+++++|+++++++++.. .+.+++|+.|++++ T Consensus 2 iILAaG~GtRl~plT~~~PK~Ll~i~gkpli~~~i~~l~~~gi~~i~i~~~y~~e~i~~~~~~~~~~~~~i~~i~e~~~l 81 (220) T cd06426 2 VIMAGGKGTRLRPLTENTPKPMLKVGGKPILETIIDRFIAQGFRNFYISVNYLAEMIEDYFGDGSKFGVNISYVREDKPL 81 (220) T ss_pred EEECCCCCCCCCHHHCCCCCCCEEECCCHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHCCCCCCCCEEEEEECCCCC T ss_conf 89377575578844469981032999953999999999983996899950146113233431566578438872146655 Q ss_pred CHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCCEEEECCCCCCCCEEEECCCCCEECCCCCC Q ss_conf 614466410000123446058985154111533220000001-3586100100101221574404202333100100146 Q gi|254780942|r 82 GTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKI-AQGYSIAVVGFNADNPKGYGRLLIKNNEIIAIREEND 160 (442) Q Consensus 82 GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~-~~~~~~~i~~~~~~dp~~yGvV~~d~~~v~~i~Ek~d 160 (442) |||+||..+.+.++ ++|+|++||+ +...++.++++.| +++++.++++...+.+..||++..++++|.++.|||. T Consensus 82 Gt~gal~~~~~~~~---~~~iv~~gD~--i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~v~~~~ekp~ 156 (220) T cd06426 82 GTAGALSLLPEKPT---DPFLVMNGDI--LTNLNYEHLLDFHKENNADATVCVREYEVQVPYGVVETEGGRITSIEEKPT 156 (220) T ss_pred CHHHHHHHHHHCCC---CCEEEECCCE--ECCCCHHHHHHHHHHCCCCEEEEEEECCCCCCCCEEEECCCCEEEEEECCC T ss_conf 57899986676189---9789977773--047599999999997499669999862568777659934883767776688 Q ss_pred CCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCCCCCC Q ss_conf 675421000355600111250146777532888740011121148999973077179991721430111210000001 Q gi|254780942|r 161 ATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYELSLIE 238 (442) Q Consensus 161 ~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~~~~ 238 (442) . ++++++|+|+|+++. ++.++. .+++.++|+++.++++|.++.++..++. |.+++++.+|.+++ T Consensus 157 ~------~~~~~~Giy~~~~~~----~~~i~~---~~~~~~~d~~~~li~~g~~v~~~~~~~~-W~DiGt~e~l~~AN 220 (220) T cd06426 157 H------SFLVNAGIYVLEPEV----LDLIPK---NEFFDMPDLIEKLIKEGKKVGVFPIHEY-WLDIGRPEDYEKAN 220 (220) T ss_pred C------CCCCCEEEEEECHHH----HHHCCC---CCCCCCCHHHHHHHHCCCCEEEEEECCE-EEECCCHHHHHHHC T ss_conf 8------652331699989999----974689---9944545899999987998899974999-99698989999749 No 33 >cd04183 GT2_BcE_like GT2_BcbE_like is likely involved in the biosynthesis of the polysaccharide capsule. GT2_BcbE_like: The bcbE gene is one of the genes in the capsule biosynthetic locus of Pasteurella multocida. Its deducted product is likely involved in the biosynthesis of the polysaccharide capsule, which is found on surface of a wide range of bacteria. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Probab=100.00 E-value=4.3e-44 Score=269.31 Aligned_cols=218 Identities=20% Similarity=0.277 Sum_probs=171.1 Q ss_pred EEECCCCCCCCCC---CCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHH---HHHHHHCCCCCEEE-EEECCCC Q ss_conf 9945887723278---887441258848579999999997799779999579889---99997302697289-9985998 Q gi|254780942|r 8 IVLAAGRGHRMKS---SSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAE---EITRINFPPTLSVE-YYIQDCQ 80 (442) Q Consensus 8 iILAaG~GtRl~p---~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e---~i~~~~~~~~~~i~-~v~q~~~ 80 (442) ||||||+||||+| .+||||+||+|||||+|+|+.|+.+++++++++++.... .+.........+.. +..|+++ T Consensus 2 VIlaaG~GtRl~~~t~~~PK~Ll~v~gkp~i~~~i~~l~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~ 81 (231) T cd04183 2 IIPMAGLGSRFKKAGYTYPKPLIEVDGKPMIEWVIESLAKIFDSRFIFICRDEHNTKFHLDESLKLLAPNATVVELDGET 81 (231) T ss_pred EEECCCCCCCCCCCCCCCCCEEEEECCEEHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHCCCCCCCEEEEECCCC T ss_conf 89867887364756799887501799957899999999864992468614546666788877751558971899956999 Q ss_pred CCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCC-CCCCCCCCCEEEECCCCCCCCEEEEC-CCCCEECCCC Q ss_conf 861446641000012344605898515411153322000000-13586100100101221574404202-3331001001 Q gi|254780942|r 81 QGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDK-IAQGYSIAVVGFNADNPKGYGRLLIK-NNEIIAIREE 158 (442) Q Consensus 81 ~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~-~~~~~~~~i~~~~~~dp~~yGvV~~d-~~~v~~i~Ek 158 (442) +|||+|++.+.+.++. +++|||++||+ +...++..++.. ..++.+++++++..++| +||.+..+ ++++.+++|| T Consensus 82 ~G~a~av~~a~~~i~~-~~~~lv~~gD~--~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~-~~g~v~~~~~~~v~~i~EK 157 (231) T cd04183 82 LGAACTVLLAADLIDN-DDPLLIFNCDQ--IVESDLLAFLAAFRERDLDGGVLTFFSSHP-RWSYVKLDENGRVIETAEK 157 (231) T ss_pred CCHHHHHHHHHHHCCC-CCEEEEEECCC--CCCCCHHHHHHHHHHCCCCEEEEEEEECCC-CCCCEEECCCCCEEECCCC T ss_conf 9679999999986489-98099970761--136388999999985699879999991257-7763578479847650235 Q ss_pred CCCCCCCCCCCCCCCCHHEECCH-HHHHHHHHCC--CCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCCC Q ss_conf 46675421000355600111250-1467775328--88740011121148999973077179991721430111210000 Q gi|254780942|r 159 NDATDEERKIHYCNSGLMAIDGL-YIMDWLLQIK--KNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYELS 235 (442) Q Consensus 159 ~d~~~~~~~~~lin~GiY~f~~~-~L~~~l~~l~--~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~ 235 (442) ++ .++++++|+|+|+.. .+.+++.++. ..+.+||||+||+++.|+++|.++.++..+..+|.++++|+||. T Consensus 158 ~~------~s~~~~~GiY~F~~~~~f~~~~~~~~~~~~~~~gE~yltd~i~~li~~g~kv~~~~i~~~~~~d~GtPedle 231 (231) T cd04183 158 EP------ISDLATAGLYYFKSGSLFVEAAKKMIRKDDSVNGEFYISPLYNELILDGKKVGIYLIDKDDYHSFGTPEDLE 231 (231) T ss_pred CC------CCHHHCCEEEEECCHHHHHHHHHHHHHCCCCCCCEEEHHHHHHHHHHCCCEEEEEEECCCEEEECCCHHHCC T ss_conf 88------750020146997467999999999864486559999858999999988891899997899699896969849 No 34 >cd06422 NTP_transferase_like_1 NTP_transferase_like_1 is a member of the nucleotidyl transferase family. This is a subfamily of nucleotidyl transferases. Nucleotidyl transferases transfer nucleotides onto phosphosugars. The activated sugars are precursors for synthesis of lipopolysaccharide, glycolipids and polysaccharides. Other subfamilies of nucleotidyl transferases include Alpha-D-Glucose-1-Phosphate Cytidylyltransferase, Mannose-1-phosphate guanyltransferase, and Glucose-1-phosphate thymidylyltransferase. Probab=100.00 E-value=1.1e-43 Score=267.16 Aligned_cols=212 Identities=19% Similarity=0.223 Sum_probs=164.9 Q ss_pred EEEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCC--CCEEEEEEC-CC Q ss_conf 8999458877232788---87441258848579999999997799779999579889999973026--972899985-99 Q gi|254780942|r 6 LAIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPP--TLSVEYYIQ-DC 79 (442) Q Consensus 6 ~AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~--~~~i~~v~q-~~ 79 (442) .|||||||+||||+|. +||||+||+|+|||+|+|+.|.++|+++|+++++|+++++++++... +.++.|+.| ++ T Consensus 1 KAvIlAgG~GtRl~plT~~~PKpllpv~~kpii~~~i~~l~~~gi~~iii~~~y~~~~i~~~~~~~~~~~~i~~~~e~~~ 80 (221) T cd06422 1 KAMILAAGLGTRMRPLTDTRPKPLVPVAGKPLIDHALDRLAAAGIRRIVVNTHHLADQIEAHLGDSRFGLRITISDEPDE 80 (221) T ss_pred CEEEECCCCCCCCCHHHCCCCHHEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCCCCEEEEEECCC T ss_conf 98998884766887445799802379999899999999999829967998325327889998722557873799740332 Q ss_pred CCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCC-CCCCCCEEEECCCC--CCCCEEEECCCCCEECC Q ss_conf 8861446641000012344605898515411153322000000135-86100100101221--57440420233310010 Q gi|254780942|r 80 QQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQ-GYSIAVVGFNADNP--KGYGRLLIKNNEIIAIR 156 (442) Q Consensus 80 ~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~-~~~~~i~~~~~~dp--~~yGvV~~d~~~v~~i~ 156 (442) ++|||+||+++++++.+ ++|+|++||+ +...++.+++++|.. .++.+++...+++| ++||++..+++..+... T Consensus 81 ~lGT~~ai~~a~~~~~~--d~flv~~gD~--l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~i~~~ 156 (221) T cd06422 81 LLETGGGIKKALPLLGD--EPFLVVNGDI--LWDGDLAPLLLLHAWRMDALLLLLPLVRNPGHNGVGDFSLDADGRLRRG 156 (221) T ss_pred CCCCCHHHHHHHHHCCC--CCEEEEECCE--EECCCHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCEEEECCCEEEEEC T ss_conf 36650588888863499--8689980675--7818989999999970689779999867876557537987686589977 Q ss_pred CCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCCCC Q ss_conf 01466754210003556001112501467775328887400111211489999730771799917214301112100000 Q gi|254780942|r 157 EENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYELSL 236 (442) Q Consensus 157 Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~~ 236 (442) ++ ...++++++|+|+|+++ ++++++ ++++.++|+++.++++| ++.++..+++ |.++++|.+|.+ T Consensus 157 ~~------~~~~~~~~~G~yif~~~-if~~i~-------~~~~~~~d~~~~li~~~-~v~~~~~~g~-w~DiGtpe~~~~ 220 (221) T cd06422 157 GG------GAVAPFTFTGIQILSPE-LFAGIP-------PGKFSLNPLWDRAIAAG-RLFGLVYDGL-WFDVGTPERLLA 220 (221) T ss_pred CC------CCCCEEEEEEEEEEEHH-HHHHCC-------CCCCCHHHHHHHHHHCC-CEEEEECCCE-EEECCCHHHHHH T ss_conf 88------87742788779998189-995478-------88775589999999669-8499972997-994999999974 Q ss_pred C Q ss_conf 0 Q gi|254780942|r 237 I 237 (442) Q Consensus 237 ~ 237 (442) | T Consensus 221 A 221 (221) T cd06422 221 A 221 (221) T ss_pred C T ss_conf 9 No 35 >KOG1461 consensus Probab=100.00 E-value=4e-41 Score=252.49 Aligned_cols=333 Identities=19% Similarity=0.224 Sum_probs=215.0 Q ss_pred CCCEEEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCC------CCEEE Q ss_conf 7408999458877232788---87441258848579999999997799779999579889999973026------97289 Q gi|254780942|r 3 RKRLAIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPP------TLSVE 73 (442) Q Consensus 3 ~~~~AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~------~~~i~ 73 (442) .+.+||+||--+-|||+|. +|+.|||++|.|||+|+|+.|.++|++++++.|+.++.||++|+.+. ...+. T Consensus 23 ~rLqAIllaDsf~trF~Plt~~~p~~LLPlaNVpmIdYtL~~L~~agV~eVfvfc~~~~~qi~e~i~~sew~~~~~~~v~ 102 (673) T KOG1461 23 HRLQAILLADSFETRFRPLTLEKPRVLLPLANVPMIDYTLEWLERAGVEEVFVFCSAHAAQIIEYIEKSEWYLPMSFIVV 102 (673) T ss_pred CCEEEEEEECCCHHCCCCCCCCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEECCCHHHHHHHHHHCCCCCCCCCEEE T ss_conf 64179997214100223133578744756447167999999998659618999951457999999863520366560279 Q ss_pred EEECCCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCC---CCCCCCCEEEECCCC------CCCC- Q ss_conf 998599886144664100001234460589851541115332200000013---586100100101221------5744- Q gi|254780942|r 74 YYIQDCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIA---QGYSIAVVGFNADNP------KGYG- 143 (442) Q Consensus 74 ~v~q~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~---~~~~~~i~~~~~~dp------~~yG- 143 (442) ++.-.+.+-.|||+|..-+. .-.++||++++||+ +++.+|..+++.|+ +....+++|+..... .+-- T Consensus 103 ti~s~~~~S~GDamR~id~k-~litgDFiLVsgd~--vsN~pl~~~l~eHr~r~k~Dk~~iMTmv~k~~st~~~~~~~~~ 179 (673) T KOG1461 103 TICSGESRSVGDAMRDIDEK-QLITGDFILVSGDT--VSNMPLRNVLEEHRKRRKEDKDAIMTMVFKESSTRETTEQVVI 179 (673) T ss_pred EECCCCCCCHHHHHHHHHHC-CEEECCEEEEECCE--EECCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCEEE T ss_conf 98078867288898887742-60106469983773--4247569999999987651724357899822445667760599 Q ss_pred EEEECCCCCEECCC----CCC---------C-CCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHH Q ss_conf 04202333100100----146---------6-754210003556001112501467775328887400111211489999 Q gi|254780942|r 144 RLLIKNNEIIAIRE----END---------A-TDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKAR 209 (442) Q Consensus 144 vV~~d~~~v~~i~E----k~d---------~-~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~ 209 (442) .|...+++++.+-+ +.+ . ..-+.+.++..++|-++++..+.-+-++..-. . ..|.+.-++ T Consensus 180 avd~~T~~ll~yq~~~~~~~~~~l~~sl~d~~~~v~vr~DL~dc~IdIcS~~V~sLF~dNFDyq-~-----r~DfV~GvL 253 (673) T KOG1461 180 AVDSRTSRLLHYQKCVREKHDIQLDLSLFDSNDEVEVRNDLLDCQIDICSPEVLSLFTDNFDYQ-T-----RDDFVRGVL 253 (673) T ss_pred EECCCCCEEEEEHHHCCCCCCCCCCHHHHCCCCCEEEECCCCCCEEEEECHHHHHHHHHCCCCE-E-----HHHHHHHHH T ss_conf 9867766278652122545653368889558982899736777622576676777763145624-4-----444665433 Q ss_pred H---HHCCEEEEEECHH-HEECCCCCCCCCCCCCCCCCCCCCCCC--CCCCCEEECCC-EEEEEEEEECCCCCEECCCCC Q ss_conf 7---3077179991721-430111210000001222221122122--23321021152-799411233178410012232 Q gi|254780942|r 210 L---DGKSIASIDVKEQ-EVCGCNNRYELSLIENIWQSRYRRQMM--ISGVTMIAPET-VFLSHDTIIQPDTVIEPHVFF 282 (442) Q Consensus 210 ~---~g~~i~~~~~~~~-~~~gv~~~~~L~~~~~~~~~~~~~~~l--~~gv~~~~p~~-~~i~~~~~I~~~~~I~~~~~I 282 (442) . -|.+|.+...... +-.++.+ |..+..+-++.+.+|.. ..-..+..-.+ .+-..+..-++++++...+.+ T Consensus 254 ~~dilg~kI~~~~~~~~~yA~rv~n---~~syd~vSkDiI~RW~YP~Vpd~~~~~~q~~~~~r~~IYk~~dv~~~~~~~v 330 (673) T KOG1461 254 VDDILGYKIHVHVLSSIDYAARVEN---LRSYDLVSKDIIQRWTYPLVPDINFSGNQTFSLERRNIYKSPDVVLSHSVIV 330 (673) T ss_pred HHHHCCCEEEEEECCHHHHHHHHCC---CHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEECCCCCCCCCCCEEHHHCCC T ss_conf 6654277689997573544431343---1888888899997514565566667787424311244014740100012012 Q ss_pred CCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCC Q ss_conf 234420002111631343321211121002134432000002586443202310211247854487 Q gi|254780942|r 283 GCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINH 348 (442) Q Consensus 283 g~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~ 348 (442) +.++.||.++.|+.++.|.|++||.||.||.+++| .+|.|+++|+||++|+|.+++|+++++|+. T Consensus 331 ~~~~~ig~gT~Ig~g~~I~NSVIG~~c~IgsN~~I-~~S~iw~~v~Igdnc~I~~aii~d~v~i~~ 395 (673) T KOG1461 331 GANVVIGAGTKIGSGSKISNSVIGANCRIGSNVRI-KNSFIWNNVTIGDNCRIDHAIICDDVKIGE 395 (673) T ss_pred CCEEEECCCCCCCCCCEEECCEECCCCEECCCEEE-EEEEEECCCEECCCCEEEEEEEECCCEECC T ss_conf 44079514541267985552266589774574588-513662583887886573468605837678 No 36 >cd02523 PC_cytidylyltransferase Phosphocholine cytidylyltransferases catalyze the synthesis of CDP-choline. This family contains proteins similar to prokaryotic phosphocholine (P-cho) cytidylyltransferases. Phosphocholine (PC) cytidylyltransferases catalyze the transfer of a cytidine monophosphate from CTP to phosphocholine to form CDP-choline. PC is the most abundant phospholipid in eukaryotic membranes and it is also important in prokaryotic membranes. For pathogenic prokaryotes, the cell surface PC facilitates the interaction with host surface and induces attachment and invasion. In addition cell wall PC serves as scaffold for a group of choline-binding proteins that are secreted from the cells. Phosphocholine (PC) cytidylyltransferase is a key enzyme in the prokaryotic choline metabolism pathway. It has been hypothesized to consist of a choline transport system, a choline kinase, CTP:phosphocholine cytidylyltransferase, and a choline phosphotransferase that transfers P-Cho from CDP Probab=100.00 E-value=1.8e-41 Score=254.47 Aligned_cols=221 Identities=20% Similarity=0.328 Sum_probs=163.7 Q ss_pred EEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEECC--CCC Q ss_conf 999458877232788---874412588485799999999977997799995798899999730269728999859--988 Q gi|254780942|r 7 AIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQD--CQQ 81 (442) Q Consensus 7 AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~--~~~ 81 (442) |||||||+||||+|. +||||+||+|+|||+|+|+.|.++|+++|+++++|+++++++++.. ..+++++.++ ++. T Consensus 1 AiILAaG~GtRl~plT~~~PK~L~~i~gkpli~~~i~~l~~~gi~~i~iv~gy~~e~i~~~~~~-~~~i~~i~n~~~~~~ 79 (229) T cd02523 1 AIILAAGRGSRLRPLTEDRPKCLLEINGKPLLERQIETLKEAGIDDIVIVTGYKKEQIEELLKK-YPNIKFVYNPDYAET 79 (229) T ss_pred CEEECCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCCCEEECCCCHHHHHHHHHH-CCCCCCEEEEECCCC T ss_conf 9893687765688645798803217998799999999999849985366555489999999853-136540340002367 Q ss_pred CHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCEEEECCCCCEECCCCCCC Q ss_conf 61446641000012344605898515411153322000000135861001001012215744042023331001001466 Q gi|254780942|r 82 GTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNADNPKGYGRLLIKNNEIIAIREENDA 161 (442) Q Consensus 82 GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~~dp~~yGvV~~d~~~v~~i~Ek~d~ 161 (442) ||++|+++++++++ ++|+|+|||+ +++...++.+. .++.....++.....+++..|++...+.+.+..++|++.. T Consensus 80 g~~~sl~~a~~~~~---~~~lv~~gD~-i~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (229) T cd02523 80 NNIYSLYLARDFLD---EDFLLLEGDV-VFDPSILERLL-SSPADNAILVDKKTKEWEDEYVKDLDDAGVLLGIISKAKN 154 (229) T ss_pred CHHHHHHHHHHHCC---CCEEEEECCE-ECCHHHHHHHH-HCCCCCEEEEEECCCCCCCCEEEEECCCCCEEEEEECCCC T ss_conf 82999998887558---9859995776-54667999998-4778887999956457787447994268708999868999 Q ss_pred CCCCCCCCCCCCCHHEECCHHH---HHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCCCCCC Q ss_conf 7542100035560011125014---6777532888740011121148999973077179991721430111210000001 Q gi|254780942|r 162 TDEERKIHYCNSGLMAIDGLYI---MDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYELSLIE 238 (442) Q Consensus 162 ~~~~~~~~lin~GiY~f~~~~L---~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~~~~ 238 (442) .. ..+.+++|+|.|+...+ ++.++.+...+ ++++|++|+++.+++++. +.++...+..|.+++++.||..++ T Consensus 155 ~~---~~~~~~~Gi~~~~~~~~~~l~~~l~~~~~~~-~~~~~~~~~i~~li~~~~-~~~~~~~~~~W~dIgt~edl~~Ae 229 (229) T cd02523 155 LE---EIQGEYVGISKFSPEDADRLAEALEELIEAG-RVNLYYEDALQRLISEEG-VKVKDISDGFWYEIDDLEDLERAE 229 (229) T ss_pred CC---CCCCEEEEEEEECHHHHHHHHHHHHHHCCCC-CCCCCHHHHHHHHHHCCC-CEEEECCCCEEEECCCHHHHHHHC T ss_conf 88---7462899999998999999999999728677-787239999999980799-669984893199696999999649 No 37 >TIGR01099 galU UTP-glucose-1-phosphate uridylyltransferase; InterPro: IPR005771 Uridine 5'-triphosphate:glucose-1-phosphate uridylyltransferase (uridine diphosphoglucose pyrophosphorylase, 2.7.7.9 from EC) is responsible for the synthesis of UDP-glucose, a key compound in the biosynthesis of polysaccharides. Glucose is fermented through the glycolysis step to pyruvate, which in turn is converted to lactate. The glycolytic intermediate glucose 6-phosphate is converted to glucose 1-phosphate by phosphoglucomutase activity, and this metabolite is subsequently converted to UDP-glucose by UDP-glucose pyrophosphorylase (GalU) activity. ; GO: 0003983 UTP:glucose-1-phosphate uridylyltransferase activity, 0006011 UDP-glucose metabolic process. Probab=100.00 E-value=9.8e-40 Score=244.58 Aligned_cols=222 Identities=24% Similarity=0.309 Sum_probs=179.4 Q ss_pred CEEEEECCCCCCCCCCCC---CCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHC--------------- Q ss_conf 089994588772327888---74412588485799999999977997799995798899999730--------------- Q gi|254780942|r 5 RLAIVLAAGRGHRMKSSS---SKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINF--------------- 66 (442) Q Consensus 5 ~~AiILAaG~GtRl~p~~---pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~--------------- 66 (442) |-|||.|||.||||-|.| ||-||||-+||+|+|+++-+.++|+++|++|||-++..|++|+. T Consensus 2 rKAViPaAGlGTRfLPATKA~PKEMLPiVdKP~IQY~VEEav~aGie~i~~VTGr~K~aIEDHFD~s~ELE~~L~~~~K~ 81 (270) T TIGR01099 2 RKAVIPAAGLGTRFLPATKAIPKEMLPIVDKPLIQYIVEEAVEAGIEDILIVTGRGKRAIEDHFDTSYELEHQLEKKNKE 81 (270) T ss_pred CEEEECCCCCCCCCCHHHHCCCCCCCCCCCCCEEHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 77885276753100234516886567645874210457989864893479996588501014258788899998751048 Q ss_pred ---------CCC-CEEEEEECCCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCC-----CCCCCCCCCCCCCCCCC Q ss_conf ---------269-728999859988614466410000123446058985154111533-----22000000135861001 Q gi|254780942|r 67 ---------PPT-LSVEYYIQDCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSH-----TLKKAMDKIAQGYSIAV 131 (442) Q Consensus 67 ---------~~~-~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~-----~l~~l~~~~~~~~~~~i 131 (442) ... .++.|+.|.+++|-||||++|++++.+ ++|.|+-||..+.... .|++|++.|.+....+| T Consensus 82 ~lL~~v~~I~~~~~~i~yvRQke~kGLGhAvL~a~~~vGd--EpFaV~LgDdi~~~~~~D~~~~Lkqm~~~Y~~~g~s~i 159 (270) T TIGR01099 82 ELLKEVRKISNLNANIFYVRQKEQKGLGHAVLCARPLVGD--EPFAVLLGDDIVVSEEPDLQEALKQMIDLYEKYGCSII 159 (270) T ss_pred HHHHHHHHHCCCCCCEEEEECCCCCCCCHHHHHCCCCCCC--CCEEEEECCEEEECCCCCHHHHHHHHHHHHHHHCCEEE T ss_conf 8999999862699728998138888551275435212588--86067635265635887546899999999986388178 Q ss_pred EEEEC--CCCCCCCEEEECC-------CC--CEECCCCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHH Q ss_conf 00101--2215744042023-------33--1001001466754210003556001112501467775328887400111 Q gi|254780942|r 132 VGFNA--DNPKGYGRLLIKN-------NE--IIAIREENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYY 200 (442) Q Consensus 132 ~~~~~--~dp~~yGvV~~d~-------~~--v~~i~Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~y 200 (442) ...+| ++.++||++..+. .. |..++|||. +++.+|||+=.|=|+|+ +.+|+.|++.++. ..+|.. T Consensus 160 aV~~Vp~e~V~~YGii~~~~dy~~~~~y~s~i~~mVEKP~--~e~APSNLAi~GRYvL~-P~IF~~L~~t~~G-~GgEIQ 235 (270) T TIGR01099 160 AVEEVPKEEVSKYGIIDGEGDYLEEGLYESVIKDMVEKPK--PEEAPSNLAIVGRYVLT-PDIFDLLEETPPG-KGGEIQ 235 (270) T ss_pred EEEECCHHHCCCCEEECCCCCHHHCCCEEEEEECCEECCC--CCCCCCCEEEECHHCCC-HHHHHHHHHCCCC-CCCCCH T ss_conf 8863445517755157274200012322342214140766--78887421022001168-5688887617999-997311 Q ss_pred HHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCC Q ss_conf 2114899997307717999172143011121000 Q gi|254780942|r 201 LTDIIEKARLDGKSIASIDVKEQEVCGCNNRYEL 234 (442) Q Consensus 201 ltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L 234 (442) |||.+..++++. .|.++.+.+..+ +++++..+ T Consensus 236 LTDAl~~l~~~e-~v~A~~f~G~ry-D~G~~~Gy 267 (270) T TIGR01099 236 LTDALRKLLKKE-RVLAYKFEGKRY-DCGSKLGY 267 (270) T ss_pred HHHHHHHHHHCC-CEEEEEEECCEE-EECCCCCC T ss_conf 888999986239-778887503166-54483331 No 38 >KOG1462 consensus Probab=100.00 E-value=9.1e-39 Score=239.07 Aligned_cols=335 Identities=19% Similarity=0.257 Sum_probs=185.1 Q ss_pred CCEEEEECCCCCCCCCC---CCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECC-CHHHHHHHHCCC-CCEEE--EEE Q ss_conf 40899945887723278---887441258848579999999997799779999579-889999973026-97289--998 Q gi|254780942|r 4 KRLAIVLAAGRGHRMKS---SSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGY-GAEEITRINFPP-TLSVE--YYI 76 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p---~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~-~~e~i~~~~~~~-~~~i~--~v~ 76 (442) +.+|||||||-||||-- ..||+||||+|+|||+|+|++|.++||++++|++.. ..+.++..+... .++.+ ++. T Consensus 9 efqavV~a~~ggt~~p~~~~~~pKaLLPIgn~PMi~YpL~~L~~~gfteiiVv~~e~e~~~i~~al~~~~~l~~~~~~v~ 88 (433) T KOG1462 9 EFQAVVLAGGGGTRMPEVTSRLPKALLPIGNKPMILYPLNSLEQAGFTEIIVVVNEDEKLDIESALGSNIDLKKRPDYVE 88 (433) T ss_pred HHHHHEEECCCCEECHHHHHHCCHHHCCCCCCCEEEEEHHHHHHCCCEEEEEEECHHHHHHHHHHHHCCCCCCCCCCEEE T ss_conf 76553252589604544452363211245896436412668975697389999537889999998750775432132798 Q ss_pred ----CCCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCC-CCCEEEE----CCCCCC------ Q ss_conf ----5998861446641000012344605898515411153322000000135861-0010010----122157------ Q gi|254780942|r 77 ----QDCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYS-IAVVGFN----ADNPKG------ 141 (442) Q Consensus 77 ----q~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~-~~i~~~~----~~dp~~------ 141 (442) .++..|||++|+..-..++. +||||++|| |+++..+..+++.++...+ .+++... +..|.+ T Consensus 89 ip~~~~~d~gtadsLr~Iy~kikS--~DflvlsCD--~Vtdv~l~~lvd~FR~~d~slamli~~~~s~~~~pgqk~k~k~ 164 (433) T KOG1462 89 IPTDDNSDFGTADSLRYIYSKIKS--EDFLVLSCD--FVTDVPLQPLVDKFRATDASLAMLIGNALSEVPIPGQKGKKKQ 164 (433) T ss_pred EECCCCCCCCCHHHHHHHHHHHCC--CCEEEEECC--CCCCCCCHHHHHHHHCCCHHHHHHHCCCCCCCCCCCCCCCCCC T ss_conf 304563344778888654554146--877999653--0268880888988752476586775264544536673445544 Q ss_pred ----CCEEEECCCCCE------------ECCCC--CCCCCCCCCCCCCCCCHHEECCHHHHHHHHHC-CCCCCCCHHHHH Q ss_conf ----440420233310------------01001--46675421000355600111250146777532-888740011121 Q gi|254780942|r 142 ----YGRLLIKNNEII------------AIREE--NDATDEERKIHYCNSGLMAIDGLYIMDWLLQI-KKNKVSQEYYLT 202 (442) Q Consensus 142 ----yGvV~~d~~~v~------------~i~Ek--~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l-~~~~~~gE~ylt 202 (442) +|+. .+..|+. .+.-+ +..+.-...++|.++.+|+|+.+. .++|.+- ....+++|| ++ T Consensus 165 ~~d~igi~-e~t~rl~y~~~~~d~~~~l~i~~slL~~~prltl~t~L~dahiY~~k~~v-~d~l~~~~sisSfk~~f-~P 241 (433) T KOG1462 165 ARDVIGIN-EDTERLAYSSDSADEEEPLVIRKSLLWNHPRLTLTTKLVDAHIYVFKHWV-IDLLSEKESISSFKADF-LP 241 (433) T ss_pred CCCEEEEC-CCCCEEEEEECCCCCCCCEEHHHHHHHCCCCEEEECCCCCEEEEEEHHHH-HHHHHCCCCCEEECCCC-CC T ss_conf 02134441-56633688405776777432113455218864886154314664238999-99973178620200145-53 Q ss_pred HHHHHHHHHH----------------------------CCEEEEEECHH-HEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 1489999730----------------------------77179991721-430111210000001222221122122233 Q gi|254780942|r 203 DIIEKARLDG----------------------------KSIASIDVKEQ-EVCGCNNRYELSLIENIWQSRYRRQMMISG 253 (442) Q Consensus 203 Dii~~l~~~g----------------------------~~i~~~~~~~~-~~~gv~~~~~L~~~~~~~~~~~~~~~l~~g 253 (442) +++..+.... .++.+|..... -...+|+...+.++++ ++.... T Consensus 242 ~lvkkQ~q~~~~~~~~~~~~l~t~~~~~~d~~~~~~d~ik~y~~~~p~e~~~~raNtL~~y~eiN~---~k~~~~----- 313 (433) T KOG1462 242 YLVKKQFQKNPPLKKNETSILPTPNLNNPDGIHSPDDRIKCYAYILPTESLFVRANTLLSYMEINR---DKKLKK----- 313 (433) T ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCEEEECCHHHHHHHHH---HHHHHH----- T ss_conf 666656530777565300136775556853234744511356899047653587533677776407---889987----- Q ss_pred CCEEECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEE Q ss_conf 21021152799411233178410012232234420002111631343321211121002134432000002586443202 Q gi|254780942|r 254 VTMIAPETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFC 333 (442) Q Consensus 254 v~~~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ 333 (442) ..|+.-++. +... ....+|..+.++++|.|+.++.|..|+||+||.||+.+.+. +|.|++++.||+.+ T Consensus 314 ---l~~e~~~~k-------~~~~-~~~l~g~d~iv~~~t~i~~~s~ik~SviG~nC~Ig~~~~v~-nSilm~nV~vg~G~ 381 (433) T KOG1462 314 ---LCSEAKFVK-------NYVK-KVALVGADSIVGDNTQIGENSNIKRSVIGSNCDIGERVKVA-NSILMDNVVVGDGV 381 (433) T ss_pred ---HCCCCCCCC-------CHHH-HEECCCHHHCCCCCCEECCCCEEEEEEECCCCCCCCCCEEE-EEEEECCCEECCCC T ss_conf ---364456641-------4456-21102323101788552555402200125776116872777-11766583755884 Q ss_pred EEECCCCCCCCCCCCCCCCCCCEECCCCEECC Q ss_conf 31021124785448745556639889989979 Q gi|254780942|r 334 EVKKATIKEGSKINHLSYVGDSVVGKNVNIGA 365 (442) Q Consensus 334 ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~ 365 (442) .++||+|+.|++|+..|.+-||.||.+-.+.| T Consensus 382 ~IensIIg~gA~Ig~gs~L~nC~Ig~~yvVea 413 (433) T KOG1462 382 NIENSIIGMGAQIGSGSKLKNCIIGPGYVVEA 413 (433) T ss_pred CEECCEECCCCEECCCCEEEEEEECCCCEECC T ss_conf 02042662665654797033348559827766 No 39 >COG1210 GalU UDP-glucose pyrophosphorylase [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=1.2e-36 Score=227.08 Aligned_cols=227 Identities=24% Similarity=0.330 Sum_probs=181.5 Q ss_pred CCCCCEEEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHC----------- Q ss_conf 987408999458877232788---874412588485799999999977997799995798899999730----------- Q gi|254780942|r 1 MKRKRLAIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINF----------- 66 (442) Q Consensus 1 m~~~~~AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~----------- 66 (442) |.+-+.|||.|||.||||-|. .||-||||-+||+|+|+++.+.++|+++|++||+.++..|++|+. T Consensus 1 ~~~irKAViPaAGlGTRfLPATKaiPKEMLPIvdKP~IqYiVeEa~~aGIe~i~iVTgr~K~~IeDhFD~s~ELE~~L~~ 80 (291) T COG1210 1 MMKIRKAVIPAAGLGTRFLPATKAIPKEMLPIVDKPLIQYIVEEAVAAGIEEILIVTGRGKRAIEDHFDTSYELENTLEK 80 (291) T ss_pred CCCCCEEEEECCCCCCCCCCCCCCCCHHHCCCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHCCCCHHHHHHHHH T ss_conf 97531798874676455466003576644366672219999999998699779999669835688867686999999987 Q ss_pred -------------CCCCEEEEEECCCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCC--CCCCCCCCCCCCCCCCCC Q ss_conf -------------26972899985998861446641000012344605898515411153--322000000135861001 Q gi|254780942|r 67 -------------PPTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSS--HTLKKAMDKIAQGYSIAV 131 (442) Q Consensus 67 -------------~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~--~~l~~l~~~~~~~~~~~i 131 (442) ....++.|+.|.+++|.|||+++|++++.+ ++|.|+-+|. ++.. ..+++|++.+++....++ T Consensus 81 ~~K~~~L~~v~~i~~~~~i~~vRQ~e~~GLGhAVl~A~~~vg~--EpFaVlL~Dd-l~~~~~~~l~qmi~~y~~~g~svi 157 (291) T COG1210 81 RGKRELLEEVRSIPPLVTISFVRQKEPLGLGHAVLCAKPFVGD--EPFAVLLPDD-LVDSEKPCLKQMIELYEETGGSVI 157 (291) T ss_pred HCHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHHCCC--CCEEEEECCE-EECCCCHHHHHHHHHHHHHCCCEE T ss_conf 2779999999723577539998658767601788853543289--8569981772-632780699999999998599279 Q ss_pred EEEEC--CCCCCCCEEE----ECCC--CCEECCCCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHH Q ss_conf 00101--2215744042----0233--31001001466754210003556001112501467775328887400111211 Q gi|254780942|r 132 VGFNA--DNPKGYGRLL----IKNN--EIIAIREENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTD 203 (442) Q Consensus 132 ~~~~~--~dp~~yGvV~----~d~~--~v~~i~Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltD 203 (442) ...++ ++.++||++. .+.+ ++..++|||. +++..||+.-.|-|+|. +.+|+.|++.+.. ..+|..||| T Consensus 158 ~v~ev~~e~v~kYGvi~~g~~~~~~~~~v~~~VEKP~--~~~APSnlai~GRYil~-p~IF~~L~~~~~G-~ggEiQLTD 233 (291) T COG1210 158 GVEEVPPEDVSKYGVIDPGEPVEKGVYKVKGMVEKPK--PEEAPSNLAIVGRYVLT-PEIFDILEETKPG-AGGEIQLTD 233 (291) T ss_pred EEEECCHHHCCCCCEEECCCCCCCCEEEEEEEEECCC--CCCCCCCEEEEEEEECC-HHHHHHHHHCCCC-CCCEEEHHH T ss_conf 9998887877656267248503577289977776699--99998541576546568-9999999627999-998766799 Q ss_pred HHHHHHHHHCCEEEEEECHHHEECCCCCCCCCC Q ss_conf 489999730771799917214301112100000 Q gi|254780942|r 204 IIEKARLDGKSIASIDVKEQEVCGCNNRYELSL 236 (442) Q Consensus 204 ii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~~ 236 (442) .+..+.++ ..+.++.+.+..+ +++++..+.+ T Consensus 234 ai~~L~~~-~~v~a~~~~Gkry-D~G~k~Gyi~ 264 (291) T COG1210 234 AIKKLLKK-EPVLAYVFEGKRY-DCGSKLGYIK 264 (291) T ss_pred HHHHHHHH-CCEEEEEECCCEE-CCCCCCCHHH T ss_conf 99999853-8579999456276-1687124899 No 40 >cd03352 LbH_LpxD UDP-3-O-acyl-glucosamine N-acyltransferase (LpxD): The enzyme catalyzes the transfer of 3-hydroxymyristic acid or 3-hydroxy-arachidic acid, depending on the organism, from the acyl carrier protein (ACP) to UDP-3-O-acyl-glucosamine to produce UDP-2,3-diacyl-GlcNAc. This constitutes the third step in the lipid A biosynthetic pathway in Gram-negative bacteria. LpxD is a homotrimer, with each subunit consisting of a novel combination of an N-terminal uridine-binding domain, a core lipid-binding left-handed parallel beta helix (LbH) domain, and a C-terminal alpha-helical extension. The LbH domain contains 9 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Probab=100.00 E-value=2.3e-37 Score=231.05 Aligned_cols=170 Identities=29% Similarity=0.430 Sum_probs=140.6 Q ss_pred EECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCC-CCCCCCCCCCCCCCCCCCC---------------- Q ss_conf 21152799411233178410012232234420002111631343-3212111210021344320---------------- Q gi|254780942|r 257 IAPETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYL-EGVHIGKKTIIGPFARIRQ---------------- 319 (442) Q Consensus 257 ~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i-~~~~Ig~~~~Ig~~~~i~~---------------- 319 (442) ++| +++|++++.|++++.|+|+++|+++++||++|.|++++.| +++.||++|.|++++.|.. T Consensus 10 I~~-~avI~~~v~Ig~n~~I~~~~~I~~~v~IG~~~~I~~~~~I~~~~~IG~~~~I~~~~vIg~~~f~~~~~~~~~~~~~ 88 (205) T cd03352 10 IGP-NAVIGEGVVIGDGVVIGPGVVIGDGVVIGDDCVIHPNVTIYEGCIIGDRVIIHSGAVIGSDGFGFAPDGGGWVKIP 88 (205) T ss_pred ECC-CCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCCEECCEEEECCCEEECCCCCCCCCCCCCCEECC T ss_conf 999-8799999999999999898899999688988366003447278687544777153486213532314798404514 Q ss_pred ---CEEECCCCCCCCEEEEE-----CCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCC Q ss_conf ---00002586443202310-----2112478544874555663988998997980883445775456298788798869 Q gi|254780942|r 320 ---ETTIEKNVRIGNFCEVK-----KATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSN 391 (442) Q Consensus 320 ---~s~i~~~~~Ig~~~ei~-----~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~ 391 (442) +..|+++|.||.++.+. ++.|++++++.+++ +||++|.||+++++++++...+ ++.|||+|+||.+ T Consensus 89 ~~g~v~Ig~~v~Ig~~~~I~~g~~~~T~IG~~~~i~~~v-----~igh~~~iG~~~~i~~~~~~~g-~~~Igd~~~iG~~ 162 (205) T cd03352 89 QLGGVIIGDDVEIGANTTIDRGALGDTVIGDGTKIDNLV-----QIAHNVRIGENCLIAAQVGIAG-STTIGDNVIIGGQ 162 (205) T ss_pred CCCCEEECCCEEECCCCEECCCCCCCCCCCCCCEECCCC-----CCCCCCCCCCCEEECCCCCCCC-CEEECCCCEECCC T ss_conf 213189848849888627636642677478776864545-----2556404158707503200026-5249743539896 Q ss_pred CEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEEEECC Q ss_conf 989378398899899778357664369868981356077077 Q gi|254780942|r 392 SSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQIVKED 433 (442) Q Consensus 392 ~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~~~~ 433 (442) ++++||++||++|+|||||+|+|||||++++.|.|.+..||| T Consensus 163 s~i~~gv~IG~~a~VgagSvVtkdVp~~~iv~G~PAr~ir~~ 204 (205) T cd03352 163 VGIAGHLTIGDGVVIGAGSGVTSIVPPGEYVSGTPAQPHREW 204 (205) T ss_pred CEECCCCEECCCCEECCCCEECCCCCCCCEEEEEECCHHHHH T ss_conf 488299399999899868879648699969997628435641 No 41 >cd02508 ADP_Glucose_PP ADP-glucose pyrophosphorylase is involved in the biosynthesis of glycogen or starch. ADP-glucose pyrophosphorylase (glucose-1-phosphate adenylyltransferase) catalyzes a very important step in the biosynthesis of alpha 1,4-glucans (glycogen or starch) in bacteria and plants: synthesis of the activated glucosyl donor, ADP-glucose, from glucose-1-phosphate and ATP. ADP-glucose pyrophosphorylase is a tetrameric allosterically regulated enzyme. While a homotetramer in bacteria, in plant chloroplasts and amyloplasts, it is a heterotetramer of two different, yet evolutionary related, subunits. There are a number of conserved regions in the sequence of bacterial and plant ADP-glucose pyrophosphorylase subunits. It is a subfamily of a very diverse glycosy transferase family 2. Probab=100.00 E-value=9.9e-36 Score=221.77 Aligned_cols=178 Identities=21% Similarity=0.274 Sum_probs=142.5 Q ss_pred EEEECCCCCCCCCCC---CCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCC---CC-----EEEE Q ss_conf 999458877232788---87441258848-579999999997799779999579889999973026---97-----2899 Q gi|254780942|r 7 AIVLAAGRGHRMKSS---SSKVLQKIAGK-PMISHVMETIAAAGIENVALVLGYGAEEITRINFPP---TL-----SVEY 74 (442) Q Consensus 7 AiILAaG~GtRl~p~---~pKpLlpi~gk-pli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~---~~-----~i~~ 74 (442) |||||||+||||||. +||||+||+|| |||+|+|++|.++|+++|+++++|+.+++.+|+... .. .+.+ T Consensus 1 AvIlagG~GtRL~pLT~~~PKp~lpi~gk~~iId~~l~~l~~~Gi~~i~i~~~y~~~~i~~hl~~g~~~~~~~~~~gi~i 80 (200) T cd02508 1 AIILAGGEGTRLSPLTKKRAKPAVPFGGRYRLIDFPLSNMVNSGIRNVGVLTQYKSRSLNDHLGSGKEWDLDRKNGGLFI 80 (200) T ss_pred CEEECCCCCCCCHHHHCCCCCCCEEECCCCCCHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCCCCCCCCCCEEE T ss_conf 98957888863625447997413088991620799999887679767998345358999999856401257656685899 Q ss_pred E------ECCCCCCHHHHHHHCCCCCCCC-CCEEEEEECCCCCCCCCCCCCCCCCCCC-CCCCCCEEEECCCCCCCCEEE Q ss_conf 9------8599886144664100001234-4605898515411153322000000135-861001001012215744042 Q gi|254780942|r 75 Y------IQDCQQGTAHAVLTAQDAIKPG-YDDVIIMYGDVPLVSSHTLKKAMDKIAQ-GYSIAVVGFNADNPKGYGRLL 146 (442) Q Consensus 75 v------~q~~~~GTa~Ai~~a~~~l~~~-~~~~lVl~GD~~li~~~~l~~l~~~~~~-~~~~~i~~~~~~dp~~yGvV~ 146 (442) . .+...+|||+|++.+++++... .+.|+|+|||+ +++.++++++++|.+ ++++|+.. T Consensus 81 ~~~~~~~~~~~~~Gt~~~i~~~~~~l~~~~~~~~lv~ngD~--l~~~D~~~~l~~H~~~~a~~Ti~~------------- 145 (200) T cd02508 81 LPPQQRKGGDWYRGTADAIYQNLDYIERSDPEYVLILSGDH--IYNMDYREMLDFHIESGADITVVY------------- 145 (200) T ss_pred EECCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEECCCE--EEECCHHHHHHHHHHCCCCEEEEE------------- T ss_conf 63133368765534789999999987217887299976997--870699999999987699989999------------- Q ss_pred ECCCCCEECCCCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHE Q ss_conf 0233310010014667542100035560011125014677753288874001112114899997307717999172143 Q gi|254780942|r 147 IKNNEIIAIREENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEV 225 (442) Q Consensus 147 ~d~~~v~~i~Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~ 225 (442) .+|+|+|+|+++.|.+++++....+ ...+++|+++.+++++ ++.+|..++.|. T Consensus 146 -----------------------~in~GiYi~~~~~l~~~l~~~~~~~--~~d~~~d~ip~l~~~~-~v~~y~~~gyw~ 198 (200) T cd02508 146 -----------------------KASMGIYIFSKDLLIELLEEDAADG--SHDFGKDIIPAMLKKL-KIYAYEFNGYWA 198 (200) T ss_pred -----------------------EEECEEEEEEHHHHHHHHHHHCCCC--CCCCHHHHHHHHHHCC-CEEEEECCCEEE T ss_conf -----------------------8504899980899999998632467--6755899999996039-889996688574 No 42 >PRK12461 UDP-N-acetylglucosamine acyltransferase; Provisional Probab=100.00 E-value=6.3e-36 Score=222.91 Aligned_cols=178 Identities=20% Similarity=0.240 Sum_probs=147.9 Q ss_pred EEECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCC------------CCEE Q ss_conf 0211527994112331784100122322344200021116313433-21211121002134432------------0000 Q gi|254780942|r 256 MIAPETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIR------------QETT 322 (442) Q Consensus 256 ~~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~------------~~s~ 322 (442) +++|. .+|++++.|++++.|+|+|+|++++.||++|.|+++++|. ++.||++|.|.+++.|. .... T Consensus 2 ~IHpt-AiI~~~A~Ig~~v~Igp~~vIg~~V~IG~~~~I~~~v~I~~~t~IG~~~~I~~~avIG~~pqd~~~~g~~~~v~ 80 (256) T PRK12461 2 MIHPT-AVIDPSAKLGEGVEIGPFAVIGADVEIGDGTWIMPHAVILGWTRIGKNNKIHQGAVVGDEPQDFGYKGEESHLI 80 (256) T ss_pred CCCCC-CEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCEEEEECCCCCCCCCCCCCCCCCCEEE T ss_conf 29998-79899899999999999999999999999999949818919989997439975431176333444158620699 Q ss_pred ECCCCCCCCEEEEECCCCCCC-CCCCCCCCC-CCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEE Q ss_conf 025864432023102112478-544874555-663988998997980883445775456298788798869989378398 Q gi|254780942|r 323 IEKNVRIGNFCEVKKATIKEG-SKINHLSYV-GDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITI 400 (442) Q Consensus 323 i~~~~~Ig~~~ei~~s~i~~g-~~i~~~~~i-gd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~i 400 (442) |++++.|+.+|.+.+.+.+.+ ++|++.+++ ..++|++||.||.+++++|..+..+| +.|||+|+||.++.++++++| T Consensus 81 IGd~~~Ire~~tI~rgt~~~~~T~IG~~~~im~~vhIaHd~~IG~~~iian~~~laG~-v~Igd~v~iGg~~~v~q~v~I 159 (256) T PRK12461 81 IGDNNVIREGVTIHRGTKGGTVTRIGNDNLLMANSHIAHDCQIGNNVILVNGALLAGH-VTVGDRAIISGNCLVHQFCRI 159 (256) T ss_pred ECCCCEECCCCCCCCCCCCCCCEEECCCCEEECCCCCCCCCCCCCCEEEECCEEECCC-EEECCCCEECCCCEECCCCEE T ss_conf 8998787987556786148986698878587436433464313677067222023242-599997299573477179799 Q ss_pred CCCCEECCCCEECCCCCCCCEEEECCCEEEECCCC Q ss_conf 89989977835766436986898135607707742 Q gi|254780942|r 401 GQGTYVASGSIITQDTPENSLVFARSRQIVKEDGA 435 (442) Q Consensus 401 g~~~~i~ag~~v~~dv~~~~~v~~~~~~~~~~~~~ 435 (442) |++|+||++|.|++||||+++|.|.|..+.--|.. T Consensus 160 G~~a~Ig~gS~V~kDVPp~~~v~G~pA~~~glN~v 194 (256) T PRK12461 160 GRLAMMAGGSRISKDVPPYCMMAGNTTNVHGLNAV 194 (256) T ss_pred CCCCEECCCCEEEEECCCCEEEECCCCEEEECHHH T ss_conf 99969987857775379983882687765401044 No 43 >cd03351 LbH_UDP-GlcNAc_AT UDP-N-acetylglucosamine O-acyltransferase (UDP-GlcNAc acyltransferase): Proteins in this family catalyze the transfer of (R)-3-hydroxymyristic acid from its acyl carrier protein thioester to UDP-GlcNAc. It is the first enzyme in the lipid A biosynthetic pathway and is also referred to as LpxA. Lipid A is essential for the growth of Escherichia coli and related bacteria. It is also essential for maintaining the integrity of the outer membrane. UDP-GlcNAc acyltransferase is a homotrimer of left-handed parallel beta helix (LbH) subunits. Each subunit contains an N-terminal LbH region with 9 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), and a C-terminal alpha-helical region. Probab=100.00 E-value=6.3e-36 Score=222.91 Aligned_cols=177 Identities=24% Similarity=0.320 Sum_probs=149.8 Q ss_pred EECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCC------------CCEEE Q ss_conf 211527994112331784100122322344200021116313433-21211121002134432------------00000 Q gi|254780942|r 257 IAPETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIR------------QETTI 323 (442) Q Consensus 257 ~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~------------~~s~i 323 (442) ++| +..|++++.|++++.|+|.|+|+++++||++|.|+++++|. ++.||++|.|++++.|. .+..| T Consensus 2 IHp-tAiI~~~A~IG~~v~Igp~~vIg~~V~IG~~~~I~~~v~I~~~t~IG~~~~I~~~avIG~~pqd~~~~g~~~~v~I 80 (254) T cd03351 2 IHP-TAIVDPGAKIGENVEIGPFCVIGPNVEIGDGTVIGSHVVIDGPTTIGKNNRIFPFASIGEAPQDLKYKGEPTRLEI 80 (254) T ss_pred CCC-CCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCEEECCCCCCCCCCCCCCCCCCCCEEEE T ss_conf 189-7796998999997999999899999999999999799689199797765256057534778767673798676997 Q ss_pred CCCCCCCCEEEEECCCCCCC--CCCCCCCCC-CCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEE Q ss_conf 25864432023102112478--544874555-663988998997980883445775456298788798869989378398 Q gi|254780942|r 324 EKNVRIGNFCEVKKATIKEG--SKINHLSYV-GDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITI 400 (442) Q Consensus 324 ~~~~~Ig~~~ei~~s~i~~g--~~i~~~~~i-gd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~i 400 (442) |+++.|+.+|.+.+.+...+ ++|++.+++ ..++|++||.||.+++++|..+..+| +.|||+|+||.++.++++++| T Consensus 81 G~~~~Ire~vtI~rgt~~~~~~T~IG~~~~im~~~HIaHdc~IG~~~iian~~~laGh-v~Igd~a~IGg~~~v~q~v~I 159 (254) T cd03351 81 GDNNTIREFVTIHRGTAQGGGVTRIGNNNLLMAYVHVAHDCVIGNNVILANNATLAGH-VEIGDYAIIGGLSAVHQFCRI 159 (254) T ss_pred CCCCEECCCCCCCCCCCCCCCCEEECCCCEEEECCCCCCCCCCCCCEEEECCCCCCCC-EEECCCCEECCCCEECCCCEE T ss_conf 9977899533023674378875597888578202544567733797599334455685-698998489883677598199 Q ss_pred CCCCEECCCCEECCCCCCCCEEEECCCEEEECCCC Q ss_conf 89989977835766436986898135607707742 Q gi|254780942|r 401 GQGTYVASGSIITQDTPENSLVFARSRQIVKEDGA 435 (442) Q Consensus 401 g~~~~i~ag~~v~~dv~~~~~v~~~~~~~~~~~~~ 435 (442) |++|+||+||+|++||||+++|.|.|..+.-=|.. T Consensus 160 G~~a~Igags~V~kDVpp~~~v~G~PA~~~glN~v 194 (254) T cd03351 160 GRHAMVGGGSGVVQDVPPYVIAAGNRARLRGLNLV 194 (254) T ss_pred CCCEEECCCCEEECCCCCCCEECCCCCEEEEEEHH T ss_conf 96619976745611369872651796578721489 No 44 >cd04197 eIF-2B_epsilon_N The N-terminal domain of epsilon subunit of the eIF-2B is a subfamily of glycosyltransferase 2. N-terminal domain of epsilon subunit of the eukaryotic translation initiation factor 2B (eIF-2B): eIF-2B is a guanine nucleotide-exchange factor which mediates the exchange of GDP (bound to initiation factor eIF2) for GTP, generating active eIF2.GTP complex. EIF2B is a complex multimeric protein consisting of five subunits named alpha, beta, gamma, delta and epsilon. Subunit epsilon shares sequence similarity with gamma subunit, and with a family of bifunctional nucleotide-binding enzymes such as ADP-glucose pyrophosphorylase, suggesting that epsilon subunit may play roles in nucleotide binding activity. In yeast, eIF2B gamma enhances the activity of eIF2B-epsilon leading to the idea that these subunits form the catalytic subcomplex. Probab=100.00 E-value=6.9e-35 Score=216.96 Aligned_cols=176 Identities=18% Similarity=0.188 Sum_probs=132.1 Q ss_pred CEEEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCC--------CCEEE Q ss_conf 08999458877232788---87441258848579999999997799779999579889999973026--------97289 Q gi|254780942|r 5 RLAIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPP--------TLSVE 73 (442) Q Consensus 5 ~~AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~--------~~~i~ 73 (442) ++|||||||+||||+|. +||||+||+|+|||+|+|++|.++|+++|+++++|+.+++++|+... ...+. T Consensus 1 ~qAVIlAgg~gtRl~PlT~~~PK~LlPv~n~Pli~y~l~~L~~~G~~ei~v~~~~~~~~i~~~~~~~~~~~~~~~~~~v~ 80 (217) T cd04197 1 LQAVVLADSFNRRFRPLTKEKPRCLLPLANVPLIDYTLEFLALNGVEEVFVFCCSHSDQIKEYIEKSKWSKPKSSLMIVI 80 (217) T ss_pred CEEEEECCCCCCCCCHHHCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHCCCCCCCCEEE T ss_conf 95999878887725620358986533799998399999999987997799994689999999998432215667773489 Q ss_pred EEECCCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCC------CCCCCEEEECCCCC------- Q ss_conf 99859988614466410000123446058985154111533220000001358------61001001012215------- Q gi|254780942|r 74 YYIQDCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQG------YSIAVVGFNADNPK------- 140 (442) Q Consensus 74 ~v~q~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~------~~~~i~~~~~~dp~------- 140 (442) +...++..++|+|++...+. ....+||+|++||+ +++.+++++++.|++. +.+|++..+...+. T Consensus 81 ~~~~~~~~~~Gdalr~l~~~-~~i~~dFlv~~gD~--it~~~l~~~l~~Hr~~r~~dk~a~~T~~~~~~~~~~~~~~~~~ 157 (217) T cd04197 81 IIMSEDCRSLGDALRDLDAK-GLIRGDFILVSGDV--VSNIDLKEILEEHKERRKKDKNAIMTMVLKEASPPHRTRRTGE 157 (217) T ss_pred EEECCCCCCHHHHHHHHHHH-CCCCCCEEEEECCC--EECCCHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCCC T ss_conf 97568887651699988760-44789999997990--4137889999999863133757448998704688742345776 Q ss_pred CCCEEEE-CCCCCEECCCCCCCCC----------------CCCCCCCCCCCHHEECCHHH Q ss_conf 7440420-2333100100146675----------------42100035560011125014 Q gi|254780942|r 141 GYGRLLI-KNNEIIAIREENDATD----------------EERKIHYCNSGLMAIDGLYI 183 (442) Q Consensus 141 ~yGvV~~-d~~~v~~i~Ek~d~~~----------------~~~~~~lin~GiY~f~~~~L 183 (442) .+.++.. ++++++.+.++++... .+..++|+++|+|+|+++.| T Consensus 158 ~~~v~id~~~~rll~~~~~~~~~~~~~~~l~~~ll~~~~~~~i~~dL~D~hIyIcsp~VL 217 (217) T cd04197 158 EFVIAVDPKTSRLLHYEELPGSKYRSITDLPSELLGSNSEVEIRHDLLDCHIDICSPDVL 217 (217) T ss_pred CCEEEEECCCCEEEEEECCCCCCCCCCCCCCHHHHCCCCCEEEECCCEECEEEEEECCCC T ss_conf 508999589981999842466554443345799970599589984885033899602549 No 45 >cd04198 eIF-2B_gamma_N The N-terminal domain of gamma subunit of the eIF-2B is a subfamily of glycosyltransferase 2. N-terminal domain of gamma subunit of the eukaryotic translation initiation factor 2B (eIF-2B): eIF-2B is a guanine nucleotide-exchange factor which mediates the exchange of GDP (bound to initiation factor eIF2) for GTP, generating active eIF2.GTP complex. EIF2B is a complex multimeric protein consisting of five subunits named alpha, beta, gamma, delta and epsilon. Subunit gamma shares sequence similarity with epsilon subunit, and with a family of bifunctional nucleotide-binding enzymes such as ADP-glucose pyrophosphorylase, suggesting that epsilon subunit may play roles in nucleotide binding activity. In yeast, eIF2B gamma enhances the activity of eIF2B-epsilon leading to the idea that these subunits form the catalytic subcomplex. Probab=100.00 E-value=6.1e-35 Score=217.28 Aligned_cols=174 Identities=21% Similarity=0.261 Sum_probs=124.2 Q ss_pred CEEEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCH-HHHHHHHCC------CCCEEEE Q ss_conf 08999458877232788---8744125884857999999999779977999957988-999997302------6972899 Q gi|254780942|r 5 RLAIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGA-EEITRINFP------PTLSVEY 74 (442) Q Consensus 5 ~~AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~-e~i~~~~~~------~~~~i~~ 74 (442) ++|||||||+||||+|. +||||+||+|+|||+|+|++|.++|+++|++++++.. +.++.+... ....+.+ T Consensus 1 ~qAVILagG~GtRl~plT~~~PK~Llpi~n~P~i~y~l~~l~~~G~~~i~i~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 80 (214) T cd04198 1 FQAVILAGGGGSRLYPLTDNIPKALLPVANKPMIWYPLDWLEKAGFEDVIVVVPEEEQAEISTYLRSFPLNLKQKLDEVT 80 (214) T ss_pred CEEEEECCCCCCCCCHHHCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHCCCCCCCEEEEE T ss_conf 95999868887665643569986466899997499999999986998899995753599999999970456677628999 Q ss_pred EECCCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCC-CCCCCCEEEECCCCC------------- Q ss_conf 985998861446641000012344605898515411153322000000135-861001001012215------------- Q gi|254780942|r 75 YIQDCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQ-GYSIAVVGFNADNPK------------- 140 (442) Q Consensus 75 v~q~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~-~~~~~i~~~~~~dp~------------- 140 (442) ..+++++||||||+.+.+.++ +||+|+|||. +++.++++++++|++ ++.+|++..+...++ T Consensus 81 ~~~~~~~gT~~aLr~~~~~i~---~dflvl~gD~--i~~i~l~~ll~~Hr~~~a~~T~~l~~~~~~~~~~~~~~~~~~~~ 155 (214) T cd04198 81 IVLDEDMGTADSLRHIRKKIK---KDFLVLSCDL--ITDLPLIELVDLHRSHDASLTVLLYPPPVSSEQKGGKGKSKKAD 155 (214) T ss_pred EECCCCCCHHHHHHHHHHCCC---CCEEEECCCE--EECCCHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCCC T ss_conf 657987871999999985189---9989985998--36688899999999709948999953576432456777445667 Q ss_pred CCCEEE--ECCCCCEECCCCCCC--------------CCCCCCCCCCCCCHHEECCHHH Q ss_conf 744042--023331001001466--------------7542100035560011125014 Q gi|254780942|r 141 GYGRLL--IKNNEIIAIREENDA--------------TDEERKIHYCNSGLMAIDGLYI 183 (442) Q Consensus 141 ~yGvV~--~d~~~v~~i~Ek~d~--------------~~~~~~~~lin~GiY~f~~~~L 183 (442) .+-++. ..+++++.+..+.|. +..+.+++|+++|+|+|+.+.| T Consensus 156 ~~~~~~~d~~~~~ll~~~~~~d~~~~~~i~~~ll~~~p~~~i~tdL~D~hiYI~s~~VL 214 (214) T cd04198 156 ERDVIGLDEKTQRLLFITSEEDLDEDLELRKSLLKRHPRVTITTKLLDAHVYIFKRWVL 214 (214) T ss_pred CCCEEEECCCCCEEEEECCCCCCCCCCCCCHHHHHCCCCEEEECCCEEEEEEEEECCCC T ss_conf 67608985899989996677762214324499983099879971773517989503429 No 46 >PRK05289 UDP-N-acetylglucosamine acyltransferase; Provisional Probab=100.00 E-value=2.9e-35 Score=219.13 Aligned_cols=178 Identities=22% Similarity=0.310 Sum_probs=148.3 Q ss_pred EEECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCCC------------CEE Q ss_conf 0211527994112331784100122322344200021116313433-212111210021344320------------000 Q gi|254780942|r 256 MIAPETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQ------------ETT 322 (442) Q Consensus 256 ~~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~------------~s~ 322 (442) -++|. ..|++++.|++++.|+|+|+|++++.||++|.|+++++|. ++.||++|.|+|++.|.. ... T Consensus 4 ~IHpt-AiI~~~A~IG~~v~Igp~~vIg~~v~IG~~~~I~~~v~I~~~t~IG~~~~I~~~avIG~~pqd~~~~g~~~~v~ 82 (261) T PRK05289 4 KIHPT-AIVEPGAEIGENVEIGPFCVIGPDVVIGDGTEIGSHVVIDGHTTIGKNNRIFPFASIGEEPQDLKYKGEPTRLV 82 (261) T ss_pred CCCCC-CEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCCEECCCCCCCCCCCCCCCCCCCCEEE T ss_conf 40898-89899899999899999999999999999999979968918988898834524753377876666069853699 Q ss_pred ECCCCCCCCEEEEECCCCCCC--CCCCCCCCC-CCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEE Q ss_conf 025864432023102112478--544874555-66398899899798088344577545629878879886998937839 Q gi|254780942|r 323 IEKNVRIGNFCEVKKATIKEG--SKINHLSYV-GDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPIT 399 (442) Q Consensus 323 i~~~~~Ig~~~ei~~s~i~~g--~~i~~~~~i-gd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ 399 (442) ||+++.|+.+|.|.+.+...+ ++|++.+++ ..++|++||.||.+++++|..+..+| ++|||+|+||.++.++++++ T Consensus 83 IGd~~~Ire~vtI~rgT~~~~g~T~IG~~~~im~~vhIaHd~~IG~~~iian~~~laG~-v~Igd~v~iGg~~~v~q~v~ 161 (261) T PRK05289 83 IGDNNTIREFVTINRGTVQGGGVTRIGDNNLLMAYVHVAHDCVVGNNVILANNATLAGH-VEVGDYAIIGGLSGVHQFVR 161 (261) T ss_pred ECCCCEECCCCEECCCCCCCCCCEEECCCCEEECCCCCCCCCCCCCCCEECCCEEECCC-EEECCCEEECCCCEECCCCE T ss_conf 89977898541443661058864498888544003643445512787165430566460-79889709976156648839 Q ss_pred ECCCCEECCCCEECCCCCCCCEEEECCCEEEECCCC Q ss_conf 889989977835766436986898135607707742 Q gi|254780942|r 400 IGQGTYVASGSIITQDTPENSLVFARSRQIVKEDGA 435 (442) Q Consensus 400 ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~~~~~~ 435 (442) ||++|+||++|+|++||||+++|.|.|..+.--|.. T Consensus 162 IG~~a~vgagS~V~kDVpp~~~v~G~PAr~~glN~v 197 (261) T PRK05289 162 IGAHAMVGGMSAVVQDVPPYVLAEGNPARLRGLNIV 197 (261) T ss_pred ECCCEEECCCCEECCCCCCCCEEECCCCEEEEEEHH T ss_conf 898719997636542479986784683568861288 No 47 >cd05636 LbH_G1P_TT_C_like Putative glucose-1-phosphate thymidylyltransferase, C-terminal Left-handed parallel beta-Helix (LbH) domain: Proteins in this family show simlarity to glucose-1-phosphate adenylyltransferases in that they contain N-terminal catalytic domains that resemble a dinucleotide-binding Rossmann fold and C-terminal LbH fold domains. Members in this family are predicted to be glucose-1-phosphate thymidylyltransferases, which are involved in the dTDP-L-rhamnose biosynthetic pathway. Glucose-1-phosphate thymidylyltransferase catalyzes the synthesis of deoxy-thymidine di-phosphate (dTDP)-L-rhamnose, an important component of the cell wall of many microorganisms. The C-terminal LbH domain contains multiple turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=100.00 E-value=3.4e-35 Score=218.73 Aligned_cols=139 Identities=42% Similarity=0.602 Sum_probs=120.5 Q ss_pred CEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCC Q ss_conf 100122322344200021116313433-2121112100213443200000258644320231021124785448745556 Q gi|254780942|r 275 VIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVG 353 (442) Q Consensus 275 ~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~ig 353 (442) .|+|+++|++++.|+++++|+++++|+ +++||++|.||++|+||++++|+++|+||+++++++++|++++++.|.+|+| T Consensus 7 ~I~p~a~I~~~v~Ig~~a~I~~~a~I~~~v~IG~~~~Ig~~~~I~~~~~IG~~~~IG~~~~I~~~vi~~~~~i~~~~~ig 86 (163) T cd05636 7 TVEEGVTIKGPVWIGEGAIVRSGAYIEGPVIIGKGCEIGPNAYIRGYTVLGDGCVVGNSVEVKNSIIMDGTKVPHLNYVG 86 (163) T ss_pred EECCCCEECCCEEECCCCEECCCCEEECCEEECCCCEEEECCCCCCCEEECCEEEECCCEEECCCCCCCCCCCCCCCEEC T ss_conf 99999999898199999999999999797599899899364122698586130120551242124014784327756877 Q ss_pred CCEECCCCEECCCEEECCC--C---------------CCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEEC Q ss_conf 6398899899798088344--5---------------7754562987887988699893783988998997783576 Q gi|254780942|r 354 DSVVGKNVNIGAGTITCNY--D---------------GTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIIT 413 (442) Q Consensus 354 d~~iG~~~~ig~g~i~~n~--~---------------g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~ 413 (442) |++||++|+||++++++|+ + +..|++++|||+|+||+|++|+||++||++|+|+|||+|. T Consensus 87 ~siIG~~v~IGagtvi~n~~~~~~~~~v~~~~~~~~~~~~~~G~vIGd~~~IG~ns~I~~G~~IG~~a~IgaGsvVr 163 (163) T cd05636 87 DSVLGENVNLGAGTITANLRFDDKPVKVRLKGERVDTGRRKLGAIIGDGVKTGINVSLNPGVKIGPGSWVYPGCVVR 163 (163) T ss_pred CCEECCCCEECCCEEEECCCCCCCCEEEEECCEEECCCCEECCCEECCCCEECCCCEECCCEEECCCCEECCCCEEC T ss_conf 72992996988983781776788634784077062367704077977997999898986995999998999997969 No 48 >PRK00892 lpxD UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase; Provisional Probab=100.00 E-value=2.9e-34 Score=213.41 Aligned_cols=171 Identities=24% Similarity=0.308 Sum_probs=123.9 Q ss_pred EEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCC-CCCCCCCCCCCCCCCCCCC-------------------CE Q ss_conf 799411233178410012232234420002111631343-3212111210021344320-------------------00 Q gi|254780942|r 262 VFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYL-EGVHIGKKTIIGPFARIRQ-------------------ET 321 (442) Q Consensus 262 ~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i-~~~~Ig~~~~Ig~~~~i~~-------------------~s 321 (442) ++|++++.||++++|+++|+|+++|.||++|.|+++++| ++|.||++|.|++++.|.. +. T Consensus 123 ~vIg~~v~IG~~~~I~~~~~Ig~~v~IG~~~~I~~nv~I~~~~~IG~~~~I~~g~vIG~dGFGf~~~~~~~~kip~~G~V 202 (343) T PRK00892 123 AVIGAGVEIGDGVIIGAGAVIGAGVKIGDDCRLHANVTIYHRVQIGNRVLIHSGAVIGQDGFGFAPDRGGWVKIPQLGRV 202 (343) T ss_pred CEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCEEECCCCEECCCCEECCCCCCCCCCCCCEEECCCCCEE T ss_conf 89999989999999929819958818889958811856625558888756468862236675413679940464643249 Q ss_pred EECCCCCCCCEEEEE-----CCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEEC Q ss_conf 002586443202310-----211247854487455566398899899798088344577545629878879886998937 Q gi|254780942|r 322 TIEKNVRIGNFCEVK-----KATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIA 396 (442) Q Consensus 322 ~i~~~~~Ig~~~ei~-----~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~ 396 (442) +|+++|.||+++.|. +++|++|+++.++ ++||+||.||.+++++|+.|..+| +.|||+|.+|.++.+.+ T Consensus 203 ~I~d~veIgan~tIdrg~~~~T~Ig~~~kidn~-----vhIaHn~~IG~~~iia~~~giaGs-~~igd~v~igG~~gi~~ 276 (343) T PRK00892 203 IIGDDVEIGANTTIDRGALDDTVIGEGVKIDNL-----VQIAHNVRIGRHCAIAAQVGIAGS-TKIGRYCMIGGQVGIAG 276 (343) T ss_pred EECCCCEECCCCEECCCCCCCCEECCCCEEEEE-----EEECCCCEECCCCEEEECCEEEEE-EEECCCEEECCCCCCCC T ss_conf 988986888873272366779788989659211-----176167478788388521147005-79999149944132348 Q ss_pred CEEECCCCEECCCCEECCCCCC-CCEEEECCCEEEECCCCCCC Q ss_conf 8398899899778357664369-86898135607707742210 Q gi|254780942|r 397 PITIGQGTYVASGSIITQDTPE-NSLVFARSRQIVKEDGALSM 438 (442) Q Consensus 397 gv~ig~~~~i~ag~~v~~dv~~-~~~v~~~~~~~~~~~~~~~~ 438 (442) .++||++++|++.|.|++|+|+ ++.+.|-|-+..++|+.... T Consensus 277 h~~Ig~~~~i~~~sgv~~di~~~g~~~~G~Pa~~~~~~~r~~a 319 (343) T PRK00892 277 HLEIGDGVQITAMSGVMKSITEPGEYSSGIPAQPNKEWRKTAA 319 (343) T ss_pred CCEECCCCEEEECCEECCCCCCCCEEEEEECCCHHHHHHHHHH T ss_conf 8698999799006514364699980897534600899999998 No 49 >TIGR01852 lipid_A_lpxA acyl-[acyl-carrier-protein]--UDP-N-acetylglucosamine O-acyltransferase; InterPro: IPR010137 This entry describes LpxA, an enzyme for the biosynthesis of lipid A, a component of lipopolysaccharide (LPS) in the outer membrane outer leaflet of most Gram-negative bacteria. Some differences are found between lipid A of different species, but this protein represents the first step (from UDP-N-acetyl-D-glucosamine) and appears to be conserved in function. Proteins from this family contain many copies of the bacterial transferase hexapeptide repeat.; GO: 0008780 acyl-[acyl-carrier-protein]-UDP-N-acetylglucosamine O-acyltransferase activity, 0008610 lipid biosynthetic process. Probab=100.00 E-value=4.5e-34 Score=212.33 Aligned_cols=171 Identities=25% Similarity=0.335 Sum_probs=143.9 Q ss_pred ECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCCC-CCCCCCCCCCCCCCCC------------CCEEEC Q ss_conf 115279941123317841001223223442000211163134332-1211121002134432------------000002 Q gi|254780942|r 258 APETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEG-VHIGKKTIIGPFARIR------------QETTIE 324 (442) Q Consensus 258 ~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~-~~Ig~~~~Ig~~~~i~------------~~s~i~ 324 (442) +| +++|.+.+.|++++.|+|.|+|+++++|+++|+|.++++|++ +.||+||+|-|+|.|. ....|| T Consensus 2 HP-TA~ie~gA~iG~~V~iGPf~~v~~~V~ig~g~~l~shV~i~G~T~iG~~~~i~p~A~iG~~PQdlky~GE~t~l~IG 80 (257) T TIGR01852 2 HP-TAIIEPGAEIGEDVEIGPFCIVGPGVKIGDGVELKSHVVIEGHTTIGEGTKIFPGAVIGGEPQDLKYKGERTELIIG 80 (257) T ss_pred CC-CEEECCCCEECCCCEEEEEEEECCCCEECCCCEECCCEEEECCEEECCCCEEECCEEECCCCCCCEECCCCCEEEEC T ss_conf 96-21878866828985780188878975886885875736896770878998880760767988565014870179988 Q ss_pred CCCCCCCEEEEECCCCCC-C-CCCCCCCCC-CCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEEC Q ss_conf 586443202310211247-8-544874555-6639889989979808834457754562987887988699893783988 Q gi|254780942|r 325 KNVRIGNFCEVKKATIKE-G-SKINHLSYV-GDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIG 401 (442) Q Consensus 325 ~~~~Ig~~~ei~~s~i~~-g-~~i~~~~~i-gd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig 401 (442) ++|.|..||.|.+.+-.. + |.|++.+++ -.|+|++||.||..+|++|..-..+| +.|||+|.||..+.+++.+||| T Consensus 81 ~~n~IRE~VTi~~GT~~g~g~T~iG~~nllMAysHvAHDC~vGn~vv~aN~a~LAGH-V~vgD~a~iGG~~avHQFvRIG 159 (257) T TIGR01852 81 DNNTIREFVTINRGTKSGGGVTSIGNNNLLMAYSHVAHDCVVGNHVVLANNATLAGH-VEVGDYAIIGGLVAVHQFVRIG 159 (257) T ss_pred CCCEEEEEEEECCCCCCCCCEEEECCCCEECCCCEEEECCEECCEEEEECCCEECEE-EEECCEEEECCCCCEEEEEEEH T ss_conf 965697557850662588888998887657205356124368881698136312227-9988899977897211013300 Q ss_pred CCCEECCCCEECCCCCCCCEEEEC-CCEEE Q ss_conf 998997783576643698689813-56077 Q gi|254780942|r 402 QGTYVASGSIITQDTPENSLVFAR-SRQIV 430 (442) Q Consensus 402 ~~~~i~ag~~v~~dv~~~~~v~~~-~~~~~ 430 (442) .+||||+.|.|.+|||||.|+-|. +-.+. T Consensus 160 ~~aMigG~s~v~~DvpPY~~~~G~~~a~l~ 189 (257) T TIGR01852 160 RYAMIGGLSAVSKDVPPYGLVEGNSRAVLR 189 (257) T ss_pred HHHHHHHHCCCCCCCCCCEEECCCCCCEEE T ss_conf 023221200246787761786378753378 No 50 >TIGR02623 G1P_cyt_trans glucose-1-phosphate cytidylyltransferase; InterPro: IPR013446 Glucose-1-phosphate cytidylyltransferases, also known as CDP-glucose pyrophosphorylase, are the product of the rfbF gene and produce CDP-D-glucose.. Probab=100.00 E-value=4.8e-33 Score=206.52 Aligned_cols=226 Identities=19% Similarity=0.269 Sum_probs=174.5 Q ss_pred CEEEEECCCCCCCCCC---CCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCC---E------- Q ss_conf 0899945887723278---88744125884857999999999779977999957988999997302697---2------- Q gi|254780942|r 5 RLAIVLAAGRGHRMKS---SSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTL---S------- 71 (442) Q Consensus 5 ~~AiILAaG~GtRl~p---~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~---~------- 71 (442) |-|||||||.|||+-. .+||||+.|+|||||+|+++.....|+++++|.+||+...|++|+....+ + T Consensus 1 MKaViLAGGlGTRisEEt~lrPKPM~EiGgkPIlWHI~k~Y~~~Gi~~FiiC~GYkGy~IKeyF~NY~l~~SDvT~~l~~ 80 (256) T TIGR02623 1 MKAVILAGGLGTRISEETHLRPKPMVEIGGKPILWHIMKIYSAHGIKEFIICLGYKGYVIKEYFANYFLHMSDVTIDLED 80 (256) T ss_pred CEEEEEECCCCCCCCCCCCCCCCCCEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCEEEEECCCCEEECCEEEEEECC T ss_conf 93799717864432243535897607767963778899999752833027885343322010010632311316787515 Q ss_pred --------------EEEEECCCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECC Q ss_conf --------------899985998861446641000012344605898515411153322000000135861001001012 Q gi|254780942|r 72 --------------VEYYIQDCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNAD 137 (442) Q Consensus 72 --------------i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~~ 137 (442) +..|.--+..-|||.|+...+++.+ +++|++.|||- +.+.++.+++.+|++....+++++.-+ T Consensus 81 n~~~~H~~~~EPW~VTLvDTG~~t~TGGRl~Rv~~~~~~-d~~Fc~TYGDG--v~~~~I~~~~~~H~~~g~~AT~tAv~P 157 (256) T TIGR02623 81 NTIEVHEKRAEPWRVTLVDTGESTQTGGRLKRVREYLED-DEAFCLTYGDG--VADIDIKALIAFHRKHGKKATVTAVQP 157 (256) T ss_pred CEEEECCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHCC-CCCEEEEECCC--CCCCCHHHHHHHHHHCCCCEEEEEECC T ss_conf 704531213897168888568776678327889998437-87689871585--033258999999886298067886078 Q ss_pred CCCCCCEEEECCCCCEECCCCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEE Q ss_conf 21574404202333100100146675421000355600111250146777532888740011121148999973077179 Q gi|254780942|r 138 NPKGYGRLLIKNNEIIAIREENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIAS 217 (442) Q Consensus 138 dp~~yGvV~~d~~~v~~i~Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~ 217 (442) |-|||.+..++..|..|.|||.... .+||.|.|++++.. |+.+..|...-| .+.+..|.++|. ..+ T Consensus 158 -PGRfGaL~~~~~~V~~F~EKP~gd~-----g~iNGGFFVL~P~V----~d~I~~D~~~wE---~~~L~~L~~~g~-L~a 223 (256) T TIGR02623 158 -PGRFGALELEGESVTSFQEKPLGDG-----GLINGGFFVLNPSV----LDLIDGDATVWE---SEPLETLAQRGE-LSA 223 (256) T ss_pred -CCCEEEEEECCCCEEEECCCCCCCC-----EEEECCEEEECCCE----EEEECCCCEEEC---CHHHHHHHHCCC-CEE T ss_conf -8830236772551111014748898-----16806568748320----202248850201---037899974488-137 Q ss_pred EEECHHHEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9917214301112100000012222211221 Q gi|254780942|r 218 IDVKEQEVCGCNNRYELSLIENIWQSRYRRQ 248 (442) Q Consensus 218 ~~~~~~~~~gv~~~~~L~~~~~~~~~~~~~~ 248 (442) |...+.| ...++..|-..++.+|+...++| T Consensus 224 Y~H~GFW-~PMDTLRDk~~L~~LW~~~~APW 253 (256) T TIGR02623 224 YEHSGFW-QPMDTLRDKNKLEELWESNRAPW 253 (256) T ss_pred EEECCCC-CCCHHHHHHHHHHHHHHCCCCCC T ss_conf 8517756-85222566799998875378884 No 51 >cd02507 eIF-2B_gamma_N_like The N-terminal of eIF-2B_gamma_like is predicted to have glycosyltransferase activity. N-terminal domain of eEIF-2B epsilon and gamma, subunits of eukaryotic translation initiators, is a subfamily of glycosyltranferase 2 and is predicted to have glycosyltranferase activity. eIF-2B is a guanine nucleotide-exchange factor which mediates the exchange of GDP (bound to initiation factor eIF2) for GTP, generating active eIF2.GTP complex. EIF2B is a complex multimeric protein consisting of five subunits named alpha, beta, gamma, delta and epsilon. Subunit epsilon shares sequence similarity with gamma subunit, and with a family of bifunctional nucleotide-binding enzymes such as ADP-glucose pyrophosphorylase, suggesting that epsilon subunit may play roles in nucleotide binding activity. In yeast, eIF2B gamma enhances the activity of eIF2B-epsilon leading to the idea that these subunits form the catalytic subcomplex. Probab=100.00 E-value=1.7e-32 Score=203.42 Aligned_cols=174 Identities=20% Similarity=0.200 Sum_probs=121.9 Q ss_pred CEEEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCC---------CCEE Q ss_conf 08999458877232788---87441258848579999999997799779999579889999973026---------9728 Q gi|254780942|r 5 RLAIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPP---------TLSV 72 (442) Q Consensus 5 ~~AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~---------~~~i 72 (442) ++|||||||+||||+|. .||||+||+|+|||+|+|++|.++|+++|+++++++.+++++++... ...+ T Consensus 1 ~qAVIlagg~gtrl~Plt~~~PK~lLpi~n~P~i~y~l~~l~~~gi~~i~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (216) T cd02507 1 FQAVVLADGFGSRFLPLTSDIPKALLPVANVPLIDYTLEWLEKAGVEEVFVVCCEHSQAIIEHLLKSKWSSLSSKMIVDV 80 (216) T ss_pred CEEEEECCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCCCCCCCCEEE T ss_conf 93999827897736600069986556799999799999999987998899995888899999998624345676764478 Q ss_pred EEEECCCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCC--CCCCCEEEE-CC--CCC------- Q ss_conf 999859988614466410000123446058985154111533220000001358--610010010-12--215------- Q gi|254780942|r 73 EYYIQDCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQG--YSIAVVGFN-AD--NPK------- 140 (442) Q Consensus 73 ~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~--~~~~i~~~~-~~--dp~------- 140 (442) .+..+.+++|||++++...+.+. +||+|++||. +++.++..+++.|++. ...++++.. .+ +++ T Consensus 81 ~~~~~~~~~Gta~~l~~~~~~i~---~dflvl~gD~--i~~~~l~~~l~~~~~~~~~~~~tl~~~~~~~~~~~~~~~~~~ 155 (216) T cd02507 81 ITSDLCESAGDALRLRDIRGLIR---SDFLLLSCDL--VSNIPLSELLEERRKKDKNAIATLTVLLASPPVSTEQSKKTE 155 (216) T ss_pred EECCCCCCCCCHHHHHHHHHCCC---CCEEEECCCE--EECCCHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCCCCC T ss_conf 96556887655799998761579---9989985997--876898999999987485774499999823777654333467 Q ss_pred --CCCEEEECCCCCEE--CCCCCC--------------CCCCCCCCCCCCCCHHEECCHHH Q ss_conf --74404202333100--100146--------------67542100035560011125014 Q gi|254780942|r 141 --GYGRLLIKNNEIIA--IREEND--------------ATDEERKIHYCNSGLMAIDGLYI 183 (442) Q Consensus 141 --~yGvV~~d~~~v~~--i~Ek~d--------------~~~~~~~~~lin~GiY~f~~~~L 183 (442) .+..++..++++.- ..++.+ .+..+..++++++|+|+|+++.| T Consensus 156 ~~~~~~~d~~~~~~l~i~~~~~~~~~~~~~i~~~~l~~~~~~~i~snL~D~giYI~sp~VL 216 (216) T cd02507 156 EEDVIAVDSKTQRLLLLHYEEDLDEDLELIIRKSLLSKHPNVTIRTDLLDCHIYICSPDVL 216 (216) T ss_pred CCCEEEEECCCCEEEEEEECCCCCCCCCCCCCHHHHHCCCCEEEECCCEEEEEEEECCCCC T ss_conf 6637999379987999961003454344213278861699749844884857998241009 No 52 >TIGR01853 lipid_A_lpxD UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase; InterPro: IPR007691 UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase (2.3.1 from EC) catalyses an early step in lipid A biosynthesis :UDP-3-O-(3-hydroxytetradecanoyl)glucosamine + (R)-3-hydroxytetradecanoyl- [acyl carrier protein] = UDP-2,3-bis(3-hydroxytetradecanoyl)glucosamine + [acyl carrier protein] Members of this family also contain a hexapeptide repeat (IPR001451 from INTERPRO). This entry represents the non-repeating region of LPXD proteins.; GO: 0016747 transferase activity transferring groups other than amino-acyl groups, 0009245 lipid A biosynthetic process. Probab=100.00 E-value=2.1e-33 Score=208.50 Aligned_cols=171 Identities=26% Similarity=0.313 Sum_probs=140.6 Q ss_pred CCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCC-------------------- Q ss_conf 527994112331784100122322344200021116313433-21211121002134432-------------------- Q gi|254780942|r 260 ETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIR-------------------- 318 (442) Q Consensus 260 ~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~-------------------- 318 (442) ..++|+.++.||++|.|+|+|+||++|.||++|+|.|+++|- ++.||+||.|.|.|.|. T Consensus 124 pnvvIG~~v~IG~~~~I~pg~~iG~~v~IG~~~~i~~~v~i~~~~~lG~nV~I~~GaVIG~DGFGya~~~~g~~~KI~q~ 203 (336) T TIGR01853 124 PNVVIGAGVEIGENVIIGPGVVIGDDVVIGDGSVIYPNVVIYERVQLGKNVIIHAGAVIGSDGFGYAHTANGGHVKIPQI 203 (336) T ss_pred CEEEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEEEECCEECCEEEECCCCEECCCCCCEEECCCCCEEEEEEE T ss_conf 54178888687786363577687688687996778067089724156471587688788678732035788971544130 Q ss_pred CCEEECCCCCCCCEEEEE-----CCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCE Q ss_conf 000002586443202310-----211247854487455566398899899798088344577545629878879886998 Q gi|254780942|r 319 QETTIEKNVRIGNFCEVK-----KATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSS 393 (442) Q Consensus 319 ~~s~i~~~~~Ig~~~ei~-----~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~ 393 (442) ...+|+|+|.||+++.|. +++|++|+||.+| ++|||||.||.+++++-+.|..+. |.||+||.+|..+- T Consensus 204 G~ViI~D~VEIGANTtIDRg~~~dT~i~~gtKIDNL-----VQIAHNc~IG~n~~ivaqvGiaGS-t~iG~~V~~GGqvG 277 (336) T TIGR01853 204 GRVIIEDDVEIGANTTIDRGAFDDTVIGEGTKIDNL-----VQIAHNCRIGENCIIVAQVGIAGS-TKIGRNVILGGQVG 277 (336) T ss_pred CEEEEECEEEECCCCHHHCCCCCCCEEECCCEECCE-----EEEEEEEEECCCCEEEEEEECCEE-EEECCEEEECCEEE T ss_conf 248960233874687021102147187238466231-----574133462664278964201134-67558069626026 Q ss_pred EECCEEECCCCEECCCCEECCCCCCCCEEEE--CCCEEEECCCCC Q ss_conf 9378398899899778357664369868981--356077077422 Q gi|254780942|r 394 LIAPITIGQGTYVASGSIITQDTPENSLVFA--RSRQIVKEDGAL 436 (442) Q Consensus 394 i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~--~~~~~~~~~~~~ 436 (442) +.-.++|||+++|+|.|.|++|||++..--| -|-+..|+|... T Consensus 278 v~GHl~IgD~v~i~A~sgV~~~~~~~~~ygggi~Pa~~~~~w~k~ 322 (336) T TIGR01853 278 VAGHLEIGDNVTIGAKSGVTKSVPPNGRYGGGIIPAMPQKEWLKI 322 (336) T ss_pred EECEEEECCCCEEEEECCCCCCCCCCCCCCEEECCCCHHHHHHHH T ss_conf 511089438348845122067779886010011141202478999 No 53 >TIGR03308 phn_thr-fam phosphonate metabolim protein, transferase hexapeptide repeat family. This family of proteins contains copies of the Bacterial transferase hexapeptide repeat family (pfam00132) and is only found in operons encoding the phosphonate C-P lyase system (GenProp0232). Many C-P lyase operons, however, lack a homolog of this protein. Probab=99.96 E-value=3.2e-29 Score=184.75 Aligned_cols=156 Identities=19% Similarity=0.346 Sum_probs=118.9 Q ss_pred EECCCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECC-CCCCCCCCC Q ss_conf 3317841001223223442000211163134332121112100213443200000258644320231021-124785448 Q gi|254780942|r 269 IIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKA-TIKEGSKIN 347 (442) Q Consensus 269 ~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s-~i~~g~~i~ 347 (442) .+++...|+|.|.| .+|.+|++|.|+++|.|.+|.||+.+.|+++|.|. ++.||+.|.|+++|.|... --+..+... T Consensus 4 ~l~~~p~Ihp~a~i-~~~~lG~~v~Ig~~~~i~ns~IGDySyI~~~~~I~-n~~IGkfcsIa~~v~I~~~nHp~~~~s~~ 81 (204) T TIGR03308 4 LLSPEPTLHPTAEL-TESKLGRYTEIGERTRLREVALGDYSYVMRDCDII-YTTIGKFCSIAAMVRINATNHPMERPTLH 81 (204) T ss_pred CCCCCCEECCCCEE-CCCCCCCCEEECCCCEEECCEECCCCEECCCCEEE-CCEECCCCEECCCCEECCCCCCCCCCEEC T ss_conf 34799768999868-32783783399999588337999985888997598-88699984888987989877876672677 Q ss_pred CCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCC Q ss_conf 74555663988998997980883445775456298788798869989378398899899778357664369868981356 Q gi|254780942|r 348 HLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSR 427 (442) Q Consensus 348 ~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~ 427 (442) |.+|.++...+ +.+...++++++++++++||++||||.|++|+||++||++|+|||||+|++||||+++|.|.|. T Consensus 82 ~f~y~~~~~~~-----~~~~~~~~~~~~~~~~~~Ig~dvwiG~~~~i~~gv~IG~gavigagsvVtkdv~~~~iv~G~PA 156 (204) T TIGR03308 82 HFTYRAAMYFD-----DASDDADFFAWRRAKRVTIGHDVWIGHGAVILPGVTIGNGAVIAAGAVVTKDVAPYTIVAGVPA 156 (204) T ss_pred CEEEECCCCCC-----CCCCCHHHHCCCCCCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCCCCEEEEECCC T ss_conf 77553753346-----6666122321334787098997688799899199799999899779989951799849984188 Q ss_pred EEEE Q ss_conf 0770 Q gi|254780942|r 428 QIVK 431 (442) Q Consensus 428 ~~~~ 431 (442) +++| T Consensus 157 k~ir 160 (204) T TIGR03308 157 KLIR 160 (204) T ss_pred EEEE T ss_conf 5663 No 54 >COG1043 LpxA Acyl-[acyl carrier protein] Probab=99.95 E-value=2.5e-28 Score=179.66 Aligned_cols=174 Identities=23% Similarity=0.357 Sum_probs=140.6 Q ss_pred CEEECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCCC-CCCCCCCCCCCCCCCC------------CCE Q ss_conf 102115279941123317841001223223442000211163134332-1211121002134432------------000 Q gi|254780942|r 255 TMIAPETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEG-VHIGKKTIIGPFARIR------------QET 321 (442) Q Consensus 255 ~~~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~-~~Ig~~~~Ig~~~~i~------------~~s 321 (442) +.++|. ..|.+.+.|++++.|+|.|+|+++++|+++++|++++++++ ..||+++.|.|++.|. ... T Consensus 4 ~~IHPT-AiIe~gA~ig~~V~IGpf~iIg~~V~ig~~t~l~shvvv~G~T~IG~~n~I~~~A~iG~~pQdlKykge~T~l 82 (260) T COG1043 4 AKIHPT-AIIEPGAEIGEDVKIGPFCIIGPNVEIGDGTVLKSHVVVEGHTTIGRNNRIFPFASIGEDPQDLKYKGEPTRL 82 (260) T ss_pred CCCCCC-EEECCCCCCCCCCEECCEEEECCCCEECCCCEECCCEEEECCCEECCCCEEECCCCCCCCCCCCCCCCCCEEE T ss_conf 645762-1527878828998888337988971988995883437996770777898791110428987545027984279 Q ss_pred EECCCCCCCCEEEEECCCCCC-C-CCCCCCCCC-CCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCE Q ss_conf 002586443202310211247-8-544874555-6639889989979808834457754562987887988699893783 Q gi|254780942|r 322 TIEKNVRIGNFCEVKKATIKE-G-SKINHLSYV-GDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPI 398 (442) Q Consensus 322 ~i~~~~~Ig~~~ei~~s~i~~-g-~~i~~~~~i-gd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv 398 (442) +||+++.|..+|.+...+... + +.|++.+.+ -.++|.++|.||..+|++|..-..+| +.|||++.+|..+.+++.+ T Consensus 83 ~IG~~n~IRE~vTi~~GT~~g~g~T~IGdnnl~May~HVAHDC~iGn~~ilaNnatLAGH-V~igD~aiiGG~saVHQF~ 161 (260) T COG1043 83 IIGDNNTIREFVTIHRGTVQGGGVTRIGDNNLIMAYAHVAHDCVIGNNCILANNATLAGH-VEVGDYAIIGGLSAVHQFV 161 (260) T ss_pred EECCCCEEEEEEEEECCCCCCCEEEEECCCCEEEEEEEEECCCEECCCEEEECCCEEECE-EEECCEEEECCCCEEEEEE T ss_conf 988997586688986154478615997788789873103134445671799668668540-7887789974752377788 Q ss_pred EECCCCEECCCCEECCCCCCCCEEEECCCEEE Q ss_conf 98899899778357664369868981356077 Q gi|254780942|r 399 TIGQGTYVASGSIITQDTPENSLVFARSRQIV 430 (442) Q Consensus 399 ~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~ 430 (442) +||++||+|.+|.|.+||||+.++.|.+-... T Consensus 162 rIG~~amiGg~S~v~~DVpPy~~~~Gn~a~l~ 193 (260) T COG1043 162 RIGAHAMIGGLSAVSQDVPPYVIASGNHARLR 193 (260) T ss_pred EECCHHEECCCHHHCCCCCCEEEECCCCCCCC T ss_conf 97401123441010367798388528704421 No 55 >COG1044 LpxD UDP-3-O-[3-hydroxymyristoyl] Probab=99.95 E-value=7e-28 Score=177.10 Aligned_cols=169 Identities=25% Similarity=0.308 Sum_probs=126.7 Q ss_pred CEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCCC-------------------C Q ss_conf 27994112331784100122322344200021116313433-212111210021344320-------------------0 Q gi|254780942|r 261 TVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQ-------------------E 320 (442) Q Consensus 261 ~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~-------------------~ 320 (442) ...|.+++.||+++.|+++++|++++.||++|.|.+++.|. ++.||++|.|++.+.|+. . T Consensus 123 ~vvI~~~v~IG~~~~I~~~avIg~~~~IG~~~~i~~~v~I~~~~~IG~~v~I~~GavIG~dgFg~a~~~~g~~Ki~q~g~ 202 (338) T COG1044 123 NVVIGAGVVIGENVVIGAGAVIGENVKIGDGTVIHPNVTIYHNVVIGNNVIIHSGAVIGADGFGYAGTAIGWVKIPQIGR 202 (338) T ss_pred CEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCEEECCCCEECCCCCCCCCCCCCCEECCEECE T ss_conf 74988897878985898897888896888885884787895275787822888998875685212246688567126065 Q ss_pred EEECCCCCCCCEEEEE-----CCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEE Q ss_conf 0002586443202310-----21124785448745556639889989979808834457754562987887988699893 Q gi|254780942|r 321 TTIEKNVRIGNFCEVK-----KATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLI 395 (442) Q Consensus 321 s~i~~~~~Ig~~~ei~-----~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~ 395 (442) .+|+++|.||+.+.|. +++|+++++|.++ ++||+||.||.+++++...|..++ +.||++|.||..+.+. T Consensus 203 V~Igd~VeIGanT~Idrga~~dTvIg~~~kIdN~-----vqIaHnv~IG~~~~I~~~vgIaGs-~~IG~~v~igg~vgI~ 276 (338) T COG1044 203 VIIGDDVEIGANTTIDRGALDDTVIGEGVKIDNL-----VQIGHNVRIGEHCIIAGQVGIAGS-VKIGKYVIIGGQVGIA 276 (338) T ss_pred EEECCCEEECCCCEECCCCCCCCEECCCCEECCE-----EEECCCCEECCCCEEECCCEEECC-CEECCEEEECCCEEEC T ss_conf 9979824873552425655567265378487643-----078155278888377026316135-3577807987613464 Q ss_pred CCEEECCCCEECCCCEECCCCCCCCEEEECCCEEEECCCC Q ss_conf 7839889989977835766436986898135607707742 Q gi|254780942|r 396 APITIGQGTYVASGSIITQDTPENSLVFARSRQIVKEDGA 435 (442) Q Consensus 396 ~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~~~~~~ 435 (442) -..+|||++.|++.+.|.++||++...-+-|.+..|+|-. T Consensus 277 gh~~IgD~~~I~~~~~v~~~i~~~~~~gg~P~~p~k~w~k 316 (338) T COG1044 277 GHLEIGDGVTIGARSGVMASITEPGYSGGIPAQPIKEWLK 316 (338) T ss_pred CCEEECCCCEEECCCCCCCCCCCCCEECCCCCCHHHHHHH T ss_conf 8418757878804524312357886633677753999999 No 56 >TIGR01207 rmlA glucose-1-phosphate thymidylyltransferase; InterPro: IPR005907 Synonym: dTDP-D-glucose synthase This group of proteins comprises a tightly conserved but broadly distributed subfamily (here designated as short form) of known and putative bacterial glucose-1-phosphate thymidylyltransferases (2.7.7.24 from EC). It is well characterised in several species as the first of four enzymes involved in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme. The enzyme is active as a homotetramer.dTTP + alpha-D-glucose 1-phosphate = diphosphate + dTDP-glucose The family of known and putative glucose-1-phosphate thymidyltransferases shows a deep split into a short form (see IPR005908 from INTERPRO) and a long form described by this model. The homotetrameric short form is found in numerous bacterial species that incorporate dTDP-L-rhamnose, which it helps synthesize, into the cell wall. It is subject to feedback inhibition. The long form, in contrast, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced. ; GO: 0008879 glucose-1-phosphate thymidylyltransferase activity, 0045226 extracellular polysaccharide biosynthetic process. Probab=99.94 E-value=3.6e-27 Score=173.08 Aligned_cols=228 Identities=21% Similarity=0.315 Sum_probs=178.6 Q ss_pred EEEECCCCCCCCCCCC---CCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECC-CHHHHHHHHCCC---CCEEEEEECCC Q ss_conf 9994588772327888---7441258848579999999997799779999579-889999973026---97289998599 Q gi|254780942|r 7 AIVLAAGRGHRMKSSS---SKVLQKIAGKPMISHVMETIAAAGIENVALVLGY-GAEEITRINFPP---TLSVEYYIQDC 79 (442) Q Consensus 7 AiILAaG~GtRl~p~~---pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~-~~e~i~~~~~~~---~~~i~~v~q~~ 79 (442) .||||||.||||.|.| +|.|+||..||||.|+|..|.-+|+.+|.+++.. +....++.+.+. ++.+.|..|++ T Consensus 2 Gi~laGG~Gtrl~P~t~~~skqllP~ydkPmiyyPl~~lmlaG~r~i~~i~~~~~~~~f~~llGdG~~~G~~~~y~~q~~ 81 (286) T TIGR01207 2 GIILAGGSGTRLYPITLAVSKQLLPVYDKPMIYYPLSVLMLAGIRDILIISTPEDTPRFKRLLGDGSQFGVELSYAVQPS 81 (286) T ss_pred CEEEECCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHEEEEECCCHHHHHHHHHCCCCHHCEEEEEECCCC T ss_conf 26872688740124345443210310157501231899987302220135211000578987135201045788711578 Q ss_pred CCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCCEEEECCCCCCCCEEEEC-CCCCEECCC Q ss_conf 88614466410000123446058985154111533220000001-3586100100101221574404202-333100100 Q gi|254780942|r 80 QQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKI-AQGYSIAVVGFNADNPKGYGRLLIK-NNEIIAIRE 157 (442) Q Consensus 80 ~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~-~~~~~~~i~~~~~~dp~~yGvV~~d-~~~v~~i~E 157 (442) |.|.|.|...+.++++. +...++-||+ ++....+...+.+. .+....+++++.+.||.+||++..+ +++.+.+-| T Consensus 82 P~Gla~af~ig~~f~~~--~~~~l~lGdn-~fyG~~~~~~l~~~~~~~~Ga~vf~y~v~~P~~yGv~~f~~~~~~~~~ee 158 (286) T TIGR01207 82 PDGLAQAFIIGEEFIGD--DKSALVLGDN-IFYGHDLSELLKRAAAREEGATVFAYQVKDPERYGVVEFDEEGRAISLEE 158 (286) T ss_pred CCHHHHHHHHHHHHHCC--CCEEEEECCC-CCCCHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCEEEECCCCCEEEEEC T ss_conf 63258888776887267--7617996050-01021489999998753068579998750764123066638731786413 Q ss_pred CCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCCCCC Q ss_conf 14667542100035560011125014677753288874001112114899997307717999172143011121000000 Q gi|254780942|r 158 ENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYELSLI 237 (442) Q Consensus 158 k~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~~~ 237 (442) || .+++++++-+|+|.++... .++-..+++. .+||+++||+-...+++|..-......+.-|++..+...|.++ T Consensus 159 kP----~~P~s~y~v~Gly~yd~~~-~~~a~~~~~s-~rGelei~~~n~~yl~~G~l~v~~~~rG~aWldtGt~~sl~~a 232 (286) T TIGR01207 159 KP----AKPKSNYAVTGLYFYDNRV-VEIARELKPS-ARGELEITDLNRVYLEEGKLSVELLGRGYAWLDTGTHDSLLEA 232 (286) T ss_pred CC----CCCCCCEEEEEHHHHHHHH-HHHHHHCCCC-CCCCCHHHHHHHHHHHCCCEEEHHHHCCCCEECCCHHHHHHHH T ss_conf 78----8876341340111010689-9999731324-5664028889999986365020145265110001116889999 Q ss_pred CCCCCC Q ss_conf 122222 Q gi|254780942|r 238 ENIWQS 243 (442) Q Consensus 238 ~~~~~~ 243 (442) ..+.+. T Consensus 233 ~~f~~t 238 (286) T TIGR01207 233 SSFIET 238 (286) T ss_pred HHHHHH T ss_conf 999999 No 57 >COG1213 Predicted sugar nucleotidyltransferases [Cell envelope biogenesis, outer membrane] Probab=99.94 E-value=2.6e-26 Score=168.21 Aligned_cols=219 Identities=20% Similarity=0.303 Sum_probs=149.5 Q ss_pred CCCEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEE-CCCHHHHHHHHCCCCCEEEEEECCCC- Q ss_conf 7408999458877232788874412588485799999999977997799995-79889999973026972899985998- Q gi|254780942|r 3 RKRLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAGIENVALVL-GYGAEEITRINFPPTLSVEYYIQDCQ- 80 (442) Q Consensus 3 ~~~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~-~~~~e~i~~~~~~~~~~i~~v~q~~~- 80 (442) ..+.|||||||+||||.|..||+|+.++|+|||+|+|++|+++|++++++|+ +|..+.+++++.....+.+++..+.+ T Consensus 2 ~~~kavILAAG~GsRlg~~~PK~Lvev~gr~ii~~~i~~L~~~gi~e~vvV~~g~~~~lve~~l~~~~~~~~iv~N~~y~ 81 (239) T COG1213 2 HPMKAVILAAGFGSRLGPDIPKALVEVGGREIIYRTIENLAKAGITEFVVVTNGYRADLVEEFLKKYPFNAKIVINSDYE 81 (239) T ss_pred CCEEEEEEECCCCCCCCCCCCCHHHHCCCEEEHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHCCCCCEEEEECCCCC T ss_conf 74047877034455347999714431588673899999998769863999962302789999985189616999689855 Q ss_pred -CCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEE-E--ECCCCCCCCEEEECCCCCEECC Q ss_conf -86144664100001234460589851541115332200000013586100100-1--0122157440420233310010 Q gi|254780942|r 81 -QGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVG-F--NADNPKGYGRLLIKNNEIIAIR 156 (442) Q Consensus 81 -~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~-~--~~~dp~~yGvV~~d~~~v~~i~ 156 (442) -.|+.+++.++++++ +.|++++||+ ++.+..++.+++.-..+. ++.-. . ..++.+ .+..+++++.++- T Consensus 82 ktN~~~Sl~~akd~~~---~~fii~~sD~-vye~~~~e~l~~a~~~~l-i~d~~~~~~~~~ea~---kv~~e~G~i~~ig 153 (239) T COG1213 82 KTNTGYSLLLAKDYMD---GRFILVMSDH-VYEPSILERLLEAPGEGL-IVDRRPRYVGVEEAT---KVKDEGGRIVEIG 153 (239) T ss_pred CCCCEEEEEEEHHHHC---CCEEEEECCE-EECHHHHHHHHHCCCCCE-EEECCCCCCCCCCEE---EEEECCCEEEHHC T ss_conf 5776467745434633---8579995787-506899999984757767-895233335567506---9984497876210 Q ss_pred CCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEEC--HHHEECCCCCCCC Q ss_conf 01466754210003556001112501467775328887400111211489999730771799917--2143011121000 Q gi|254780942|r 157 EENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVK--EQEVCGCNNRYEL 234 (442) Q Consensus 157 Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~--~~~~~gv~~~~~L 234 (442) |+... .+-..+|++.|+.. ++..+.++..+. .++ +........+.++.-+... +..|+.+++|+|| T Consensus 154 --K~l~e----~~~e~iGi~~l~~~-i~~~~~~~~~e~--~~~---~~~~~~~~~~~~~~~~di~~~g~~w~EVDtpeDl 221 (239) T COG1213 154 --KDLTE----YDGEDIGIFILSDS-IFEDTYELLVER--SEY---DYREVEKEAGLPFTEVDIHVDGLFWMEVDTPEDL 221 (239) T ss_pred --CCCCC----CCCEEEEEEEECHH-HHHHHHHHHHHH--HHH---HHHHHHHHHCCCEEEEECCCCCCEEEECCCHHHH T ss_conf --78551----36216535883457-878789887655--667---7999999858761785102368524863888999 Q ss_pred CCCCCCC Q ss_conf 0001222 Q gi|254780942|r 235 SLIENIW 241 (442) Q Consensus 235 ~~~~~~~ 241 (442) ..+.+.+ T Consensus 222 ~~ar~~~ 228 (239) T COG1213 222 ERARKYL 228 (239) T ss_pred HHHHHHH T ss_conf 9999999 No 58 >cd02509 GDP-M1P_Guanylyltransferase GDP-M1P_Guanylyltransferase catalyzes the formation of GDP-Mannose. GDP-mannose-1-phosphate guanylyltransferase, also called GDP-mannose pyrophosphorylase (GDP-MP), catalyzes the formation of GDP-Mannose from mannose-1-phosphate and GTP. Mannose is a key monosaccharide for glycosylation of proteins and lipids. GDP-Mannose is the activated donor for mannosylation of various biomolecules. This enzyme is known to be bifunctional, as both mannose-6-phosphate isomerase and mannose-1-phosphate guanylyltransferase. This CD covers the N-terminal GDP-mannose-1-phosphate guanylyltransferase domain, whereas the isomerase function is located at the C-terminal half. GDP-MP is a member of the nucleotidyltransferase family of enzymes. Probab=99.94 E-value=6.9e-27 Score=171.45 Aligned_cols=186 Identities=16% Similarity=0.204 Sum_probs=134.9 Q ss_pred CEEEEECCCCCCCCCCC----CCCCEEEEC-CEEHHHHHHHHHHHC-CCCEEEEEECCC-HHHHHHHHCCCCCEEEEEEC Q ss_conf 08999458877232788----874412588-485799999999977-997799995798-89999973026972899985 Q gi|254780942|r 5 RLAIVLAAGRGHRMKSS----SSKVLQKIA-GKPMISHVMETIAAA-GIENVALVLGYG-AEEITRINFPPTLSVEYYIQ 77 (442) Q Consensus 5 ~~AiILAaG~GtRl~p~----~pKpLlpi~-gkpli~~~i~~l~~~-g~~~ivvv~~~~-~e~i~~~~~~~~~~i~~v~q 77 (442) +.+||||||.||||||. +||+++++. +++|++++++++.+. ++++++|+|+.. .+.+++.+.....+..++.| T Consensus 1 i~~VIlaGG~GtRLWPlSr~~~PKQf~~~~~~~sLlq~T~~R~~~~~~~~~i~IvTn~~~~~~v~~ql~~~~~~~~ii~E 80 (274) T cd02509 1 IYPVILAGGSGTRLWPLSRESYPKQFLKLFGDKSLLQQTLDRLKGLVPPDRILVVTNEEYRFLVREQLPEGLPEENIILE 80 (274) T ss_pred CEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCEEEECCHHHHHHHHHHHHCCCCCCEEEC T ss_conf 98999479860446875588899775217999989999999874689867809995721799999998843986648967 Q ss_pred CCCCCHHHHHHHCCCCCCC--CCCEEEEEECCCCCCCC-CCCCCCCC-C--CCCCCCCCCEEEECCCCC-CCCEEEECCC Q ss_conf 9988614466410000123--44605898515411153-32200000-0--135861001001012215-7440420233 Q gi|254780942|r 78 DCQQGTAHAVLTAQDAIKP--GYDDVIIMYGDVPLVSS-HTLKKAMD-K--IAQGYSIAVVGFNADNPK-GYGRLLIKNN 150 (442) Q Consensus 78 ~~~~GTa~Ai~~a~~~l~~--~~~~~lVl~GD~~li~~-~~l~~l~~-~--~~~~~~~~i~~~~~~dp~-~yGvV~~d~~ 150 (442) +..++||.|+..+..++.. .++-++|+.+|+ +|.+ ..+.+.+. . +.+...++++..+...|+ +||.+..++. T Consensus 81 P~~rnTApAI~laa~~~~~~~~d~~l~v~PsDH-~I~d~~~F~~~i~~a~~~a~~~~ivt~GI~P~~P~TgYGYI~~~~~ 159 (274) T cd02509 81 PEGRNTAPAIALAALYLAKRDPDAVLLVLPSDH-LIEDVEAFLKAVKKAVEAAEEGYLVTFGIKPTRPETGYGYIEAGEK 159 (274) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCC-CCCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCCEEEEECCC T ss_conf 688897999999999999759997799984740-2008899999999999997339889974213778988628985753 Q ss_pred ------CCEECCCCCCCCCCCCCC----CCCCCCHHEECCHHHHHHHHHCC Q ss_conf ------310010014667542100----03556001112501467775328 Q gi|254780942|r 151 ------EIIAIREENDATDEERKI----HYCNSGLMAIDGLYIMDWLLQIK 191 (442) Q Consensus 151 ------~v~~i~Ek~d~~~~~~~~----~lin~GiY~f~~~~L~~~l~~l~ 191 (442) +|.+|+|||+....+... -+=|+|+|+|+.+.+.+.+.+.. T Consensus 160 ~~~~~~~V~~F~EKP~~~~A~~~i~sg~y~WNsGiFv~~~~~~l~~~~~~~ 210 (274) T cd02509 160 LGGGVYRVKRFVEKPDLETAKEYLESGNYLWNSGIFLFRAKTFLEELKKHA 210 (274) T ss_pred CCCCCEEEEEEEECCCHHHHHHHHHCCCCEEECHHHHHHHHHHHHHHHHHC T ss_conf 577742578876479889999998759903300232200999999999879 No 59 >TIGR03532 DapD_Ac 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-acetyltransferase. Alternate name: tetrahydrodipicolinate N-acetyltransferase. Note that IUBMB lists this alternate name as the accepted name. Unfortunately, the related succinyl transferase acting on the same substrate (EC:2.3.1.117, TIGR00695) uses the opposite standard. We have decided to give these two enzymes names which more clearly indicated that they act on the same substrate. Probab=99.93 E-value=6.4e-26 Score=165.94 Aligned_cols=127 Identities=29% Similarity=0.498 Sum_probs=93.9 Q ss_pred CCCEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCC Q ss_conf 84100122322344200021116313433-21211121002134432000002586443202310211247854487455 Q gi|254780942|r 273 DTVIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSY 351 (442) Q Consensus 273 ~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~ 351 (442) ++.|+|++.|.+.+.||++++|.+++++. +++||++++|.. ++.++| T Consensus 86 ~aRIepgaiir~~v~IG~~aVImmga~INiGA~IGegtmId~-----------------------------~Avvg~--- 133 (231) T TIGR03532 86 NARIEPGAIIRDQVIIGDNAVIMMGAVINIGAEIGEGTMIDM-----------------------------NAVLGG--- 133 (231) T ss_pred CCEECCCCEEECCEEECCCEEEECCCEEECCCEECCCCEECC-----------------------------CCEECC--- T ss_conf 835778858842529879839913888704859878945805-----------------------------856445--- Q ss_pred CCCCEECCCCEECCCEEECC-CCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEEE Q ss_conf 56639889989979808834-45775456298788798869989378398899899778357664369868981356077 Q gi|254780942|r 352 VGDSVVGKNVNIGAGTITCN-YDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQIV 430 (442) Q Consensus 352 igd~~iG~~~~ig~g~i~~n-~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~ 430 (442) +|+||+||.||+|+.++. +......+++|||+||||+|+++++|++||++|+|+|||+|++|+|++++|.|.|.+++ T Consensus 134 --~a~IGk~~HisaGa~laGVleP~~a~PViIeDnV~IGAnAvIl~GV~IG~gsVIgAGsVVtkdVP~~sVVaG~PAKII 211 (231) T TIGR03532 134 --RATVGKNVHIGAGAVLAGVIEPPSAKPVVIEDNVLIGANAVILEGVRVGKGAVVAAGAIVTEDVPPNTVVAGVPAKVI 211 (231) T ss_pred --CCEECCCCEECCCCEEECCCCCCCCCCEEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCCCCCEEEECCEEEE T ss_conf --468879705889864300225766799798788598799799199899999999849889565799979985481999 Q ss_pred ECC Q ss_conf 077 Q gi|254780942|r 431 KED 433 (442) Q Consensus 431 ~~~ 433 (442) |+- T Consensus 212 K~v 214 (231) T TIGR03532 212 KQV 214 (231) T ss_pred EEC T ss_conf 855 No 60 >cd04645 LbH_gamma_CA_like Gamma carbonic anhydrase-like: This family is composed of gamma carbonic anhydrase (CA), Ferripyochelin Binding Protein (FBP), E. coli paaY protein, and similar proteins. CAs are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism, involving the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. They are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three evolutionary distinct groups - alpha, beta and gamma carbonic anhydrases - which show no significant sequence identity or structural similarity. Gamma CAs are trimeric enzymes with left-handed parallel beta helix (LbH) structural domain. Probab=99.92 E-value=1.3e-24 Score=158.47 Aligned_cols=72 Identities=31% Similarity=0.580 Sum_probs=64.7 Q ss_pred CCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEEC--CCCCCCCEEEECCCEEEE Q ss_conf 639889989979808834457754562987887988699893783988998997783576--643698689813560770 Q gi|254780942|r 354 DSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIIT--QDTPENSLVFARSRQIVK 431 (442) Q Consensus 354 d~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~--~dv~~~~~v~~~~~~~~~ 431 (442) ++.||++|.||+++++ +++.||++|+||.|+++++|++||++|+|||||+|+ +|||+++++.|.|-+++| T Consensus 60 ~~~IG~~v~Ig~~~~i--------~g~~Ig~~~~IG~~a~I~~gv~IG~~~vIgagsvV~~~~~vp~~~v~~G~PAr~ir 131 (153) T cd04645 60 PTIIGDNVTVGHGAVL--------HGCTIGDNCLIGMGAIILDGAVIGKGSIVAAGSLVPPGKVIPPGSLVAGSPAKVVR 131 (153) T ss_pred CCEECCCCEECCCCEE--------ECEEEEEEEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEECCCEEEC T ss_conf 9798999899999599--------24187300477054398569998899799449897489796998599962837971 Q ss_pred CC Q ss_conf 77 Q gi|254780942|r 432 ED 433 (442) Q Consensus 432 ~~ 433 (442) += T Consensus 132 ~~ 133 (153) T cd04645 132 EL 133 (153) T ss_pred CC T ss_conf 59 No 61 >cd03350 LbH_THP_succinylT 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate (THDP) N-succinyltransferase (also called THP succinyltransferase): THDP N-succinyltransferase catalyzes the conversion of tetrahydrodipicolinate and succinyl-CoA to N-succinyltetrahydrodipicolinate and CoA. It is the committed step in the succinylase pathway by which bacteria synthesize L-lysine and meso-diaminopimelate, a component of peptidoglycan. The enzyme is homotrimeric and each subunit contains an N-terminal region with alpha helices and hairpin loops, as well as a C-terminal region with a left-handed parallel alpha-helix (LbH) structural motif encoded by hexapeptide repeat motifs. Probab=99.92 E-value=1.1e-24 Score=158.99 Aligned_cols=72 Identities=26% Similarity=0.495 Sum_probs=63.0 Q ss_pred CCEECCCCEECCCEEECC-CCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEEC Q ss_conf 639889989979808834-457754562987887988699893783988998997783576643698689813 Q gi|254780942|r 354 DSVVGKNVNIGAGTITCN-YDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFAR 425 (442) Q Consensus 354 d~~iG~~~~ig~g~i~~n-~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~ 425 (442) +|.||+||.|++++.+++ +.+..+.+++|+|+||||+|++++||++||++|+|||||+|+||+|++.++-++ T Consensus 49 ~~~IG~~v~I~~g~~i~g~~~p~~~~~v~IeD~~~IGa~a~V~~Gv~IG~~avIGAGsVVtkdvp~y~~~~g~ 121 (139) T cd03350 49 CAQIGKNVHLSAGAVIGGVLEPLQATPVIIEDDVFIGANCEVVEGVIVGKGAVLAAGVVLTQSTPIYDRETGE 121 (139) T ss_pred CCEECCCCEECCCCEECCCCCCCCCCCCEECCCEEEEECCEEECCCEECCCCEECCCCEEECCCCCEEEECCC T ss_conf 6699993289888678444257646886887995993099894687999998999798891898846823780 No 62 >cd05635 LbH_unknown Uncharacterized proteins, Left-handed parallel beta-Helix (LbH) domain: Members in this group are uncharacterized bacterial proteins containing a LbH domain with multiple turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.91 E-value=2.6e-25 Score=162.52 Aligned_cols=87 Identities=33% Similarity=0.441 Sum_probs=81.1 Q ss_pred CCCCCCEEEECCCCCCCCC-CCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCE Q ss_conf 3442000211163134332-121112100213443200000258644320231021124785448745556639889989 Q gi|254780942|r 284 CGVSIENYVQIRAFSYLEG-VHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVN 362 (442) Q Consensus 284 ~~~~Ig~~~~I~~~~~i~~-~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ 362 (442) +.+.|+++++|.++++|++ ++||++|.|+|+|+||+++.|+++|+||+ |+|+|+|+++++++|++|||||+||++|+ T Consensus 10 GpV~I~~~~~I~~~~~i~Gp~~IG~~~~I~p~a~Ir~~t~Ig~~~~IG~--Eik~S~i~~~s~~~H~~YlGDS~iG~~vN 87 (101) T cd05635 10 GPIYIGKDAVIEPFAVIEGPVYIGPGSRVKMGARIYGNTTIGPTCKIGG--EVEDSIIEGYSNKQHDGFLGHSYLGSWCN 87 (101) T ss_pred CCEEECCCCEECCCCEEECCEEECCCCEECCCCEEECCCEECCCCEECC--EEEEEEEECCCCCCCCCEEEEEEECCCCE T ss_conf 9999879999999999976669999999999978638879999977614--08761885787568775876117838757 Q ss_pred ECCCEEECCC Q ss_conf 9798088344 Q gi|254780942|r 363 IGAGTITCNY 372 (442) Q Consensus 363 ig~g~i~~n~ 372 (442) |||||+++|+ T Consensus 88 iGAGT~t~Nl 97 (101) T cd05635 88 LGAGTNNSDL 97 (101) T ss_pred ECCCCEECCC T ss_conf 8778578055 No 63 >cd04650 LbH_FBP Ferripyochelin Binding Protein (FBP): FBP is an outer membrane protein which plays a role in iron acquisition. It binds iron when it is complexed with pyochelin. It adopts the left-handed parallel beta-helix (LbH) structure, and contains imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Acyltransferase activity has not been observed in this group. Probab=99.91 E-value=3.6e-24 Score=155.99 Aligned_cols=71 Identities=21% Similarity=0.492 Sum_probs=64.0 Q ss_pred CCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEEC--CCCCCCCEEEECCCEEEE Q ss_conf 639889989979808834457754562987887988699893783988998997783576--643698689813560770 Q gi|254780942|r 354 DSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIIT--QDTPENSLVFARSRQIVK 431 (442) Q Consensus 354 d~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~--~dv~~~~~v~~~~~~~~~ 431 (442) ++.||++|.||.++++ +++.||++|+||.|++|++|++||++|+|+|||+|+ +|||+++++.|.|.+++| T Consensus 61 ~~~ig~~v~ig~~~~i--------~g~~Ig~~v~IG~~a~I~~Gv~IG~~svIgaGsVV~~~~~Vp~~~lv~G~PAr~ir 132 (154) T cd04650 61 PTEIGDYVTIGHNAVV--------HGAKVGNYVIVGMGAILLNGAKIGDHVIIGAGAVVTPGKEIPDYSLVLGVPAKVVR 132 (154) T ss_pred CCEECCCCEECCCEEE--------ECCEECCEEEECCCEEEECCCEECCCCEECCCCEECCCCEECCCCEEEECCEEEEE T ss_conf 8687997688995288--------23022013598872299628798999799889888799791998499961818977 Q ss_pred C Q ss_conf 7 Q gi|254780942|r 432 E 432 (442) Q Consensus 432 ~ 432 (442) + T Consensus 133 ~ 133 (154) T cd04650 133 K 133 (154) T ss_pred E T ss_conf 2 No 64 >cd04745 LbH_paaY_like paaY-like: This group is composed by uncharacterized proteins with similarity to the protein product of the E. coli paaY gene, which is part of the paa gene cluster responsible for phenylacetic acid degradation. Proteins in this group are expected to adopt the left-handed parallel beta-helix (LbH) structure. They contain imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Similarity to gamma carbonic anhydrase and Ferripyochelin Binding Protein (FBP) may suggest metal binding capacity. Probab=99.91 E-value=4e-24 Score=155.73 Aligned_cols=72 Identities=26% Similarity=0.445 Sum_probs=64.4 Q ss_pred CCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECC--CCCCCCEEEECCCEEEE Q ss_conf 6398899899798088344577545629878879886998937839889989977835766--43698689813560770 Q gi|254780942|r 354 DSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQ--DTPENSLVFARSRQIVK 431 (442) Q Consensus 354 d~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~--dv~~~~~v~~~~~~~~~ 431 (442) |+.||+++.||.++++ +++.||++|+||.|++|++|++||++|+|||||+|++ |||+++++.|.|-+++| T Consensus 61 ~~~Ig~~~~ig~~~~~--------~g~~Ig~~~~IG~~a~I~~G~~IG~~siIgagSvV~~~~~VP~~~l~~GnPAk~ir 132 (155) T cd04745 61 DTVLEENGHIGHGAIL--------HGCTIGRNALVGMNAVVMDGAVIGEESIVGAMAFVKAGTVIPPRSLIAGSPAKVIR 132 (155) T ss_pred CCEECCCCEECCCCEE--------CCCEEEEEEEECCCCEEECCCEECCCCEEEECCEECCCCCCCCCEEEEEECCEEEE T ss_conf 8698999789987487--------15177432497687599597798999799418787699683799899972867974 Q ss_pred CC Q ss_conf 77 Q gi|254780942|r 432 ED 433 (442) Q Consensus 432 ~~ 433 (442) += T Consensus 133 ~l 134 (155) T cd04745 133 EL 134 (155) T ss_pred CC T ss_conf 39 No 65 >TIGR02092 glgD glucose-1-phosphate adenylyltransferase, GlgD subunit; InterPro: IPR011832 This family is GlgD, an apparent regulatory protein that appears in an alpha2/beta2 heterotetramer with GlgC (glucose-1-phosphate adenylyltransferase, IPR011831 from INTERPRO) in a subset of bacteria that use GlgC for glycogen biosynthesis.; GO: 0005978 glycogen biosynthetic process. Probab=99.90 E-value=2.2e-23 Score=151.53 Aligned_cols=344 Identities=15% Similarity=0.198 Sum_probs=194.4 Q ss_pred CCCEEEEECCCCCCCCCCC---CCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCHH-HHHHHHCCC---CCE--- Q ss_conf 7408999458877232788---87441258848-579999999997799779999579889-999973026---972--- Q gi|254780942|r 3 RKRLAIVLAAGRGHRMKSS---SSKVLQKIAGK-PMISHVMETIAAAGIENVALVLGYGAE-EITRINFPP---TLS--- 71 (442) Q Consensus 3 ~~~~AiILAaG~GtRl~p~---~pKpLlpi~gk-pli~~~i~~l~~~g~~~ivvv~~~~~e-~i~~~~~~~---~~~--- 71 (442) +++.|||=-.-.-..|.|+ ||=..+||+|+ .|||++|+++.+||+++|.+....+.- .+.+|++.+ +++ T Consensus 1 ~~~~aIi~l~~~~~~l~~LT~~Rp~A~~PfgGrYRlIDF~LSn~~NAgI~~v~~~~~~~~~~Sl~DHLg~G~eWdL~r~~ 80 (383) T TIGR02092 1 NKMSAIINLTESKKNLSPLTKVRPVASLPFGGRYRLIDFPLSNMVNAGIRNVAVFFKNKERRSLFDHLGSGREWDLNRKR 80 (383) T ss_pred CCEEEEEECCCCCHHCCCHHHHCCCEEECCCCEEEEEECCHHCCCCCCEEEEEEECCCCCCCHHHHHCCCCCCCCCCCCC T ss_conf 93589973578820113145508703323586477883010023103600224652666653152213486883611025 Q ss_pred -EEEEE----CCC---CCCH-----HHHHHHCCCCCCC-CCCEEEEEECCCCCCCCCCCCCCCCCCCC-CCCCCCEEEEC Q ss_conf -89998----599---8861-----4466410000123-44605898515411153322000000135-86100100101 Q gi|254780942|r 72 -VEYYI----QDC---QQGT-----AHAVLTAQDAIKP-GYDDVIIMYGDVPLVSSHTLKKAMDKIAQ-GYSIAVVGFNA 136 (442) Q Consensus 72 -i~~v~----q~~---~~GT-----a~Ai~~a~~~l~~-~~~~~lVl~GD~~li~~~~l~~l~~~~~~-~~~~~i~~~~~ 136 (442) --|+. +++ ..+- -+-..+.+++|.. ..+.++|+++++ +.+.||+++++.|++ +.++|++=.++ T Consensus 81 ~glF~~pP~~~~~~~~~~~~kaall~~~~~~~l~fl~rS~~~y~V~~~s~~--v~NiDl~~~l~~H~~~~~~iT~VYKkv 158 (383) T TIGR02092 81 DGLFLFPPKYNDRDDVSEGGKAALLKRYFSNNLEFLKRSTSEYVVVLNSHM--VCNIDLKAVLKRHEETGKDITVVYKKV 158 (383) T ss_pred CCEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEECCCE--EEECCHHHHHHHHHHCCCCEEEEECCC T ss_conf 752770687686432345633778899998876787508997189854745--783028999999874489668997347 Q ss_pred CC--CCC-C-CEEEEC-CCCCEECC--CCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHH-HHHHHHHH Q ss_conf 22--157-4-404202-33310010--01466754210003556001112501467775328887400111-21148999 Q gi|254780942|r 137 DN--PKG-Y-GRLLIK-NNEIIAIR--EENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYY-LTDIIEKA 208 (442) Q Consensus 137 ~d--p~~-y-GvV~~d-~~~v~~i~--Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~y-ltDii~~l 208 (442) ++ ++. + -++..| .|++..+. .+.+..+.+...+ +++-+|+++.+.|.++|...... |+.. +.+++... T Consensus 159 ~~~~~~~~~c~~L~~De~g~v~s~~Pms~~~~~~~~e~~n-~Sl~~yi~~t~lLiell~~~~~~---g~~~~l~~~~~~~ 234 (383) T TIGR02092 159 KKADLSEEDCTILKFDEKGEVKSIGPMSVAKNLNKEEEEN-LSLDIYILSTELLIELLYECIQK---GKKTSLEELIREN 234 (383) T ss_pred CCCCCCCCCCCEEEECCCCCEEEEECEEEECCCCCHHHHH-HHHHHEEECHHHHHHHHHHHHHC---CCHHHHHHHHHHH T ss_conf 8467762357278863678568863200002676134443-20100242668999999987631---5567999999999 Q ss_pred HHHHCCEEEEEECHHHEECCCCCCCCCCCC-CCCCCCCCCCCCCCCCCEEECCCEE-EEEEEEECCCCCEECCCCCCCCC Q ss_conf 973077179991721430111210000001-2222211221222332102115279-94112331784100122322344 Q gi|254780942|r 209 RLDGKSIASIDVKEQEVCGCNNRYELSLIE-NIWQSRYRRQMMISGVTMIAPETVF-LSHDTIIQPDTVIEPHVFFGCGV 286 (442) Q Consensus 209 ~~~g~~i~~~~~~~~~~~gv~~~~~L~~~~-~~~~~~~~~~~l~~gv~~~~p~~~~-i~~~~~I~~~~~I~~~~~Ig~~~ 286 (442) +.+ +.+.+|...++.-. +++-..+-.++ .++....... .| +...-.|..-+.=++|+.-..++ T Consensus 235 l~~-~~~~aYEY~GYl~~-I~s~~sYY~aNM~lL~~~~f~~-------------LF~~~~~rpIYTKvKdE~PT~y~~~S 299 (383) T TIGR02092 235 LKE-LNINAYEYKGYLAN-INSVKSYYKANMDLLEPKIFQS-------------LFYISQSRPIYTKVKDEPPTYYAETS 299 (383) T ss_pred HHH-CCCCCCEEEEEEEE-ECCHHHHHHHHHHHHHHHHHHH-------------HHCCCCCCEEEEEECCCCCCCCCCCC T ss_conf 986-47543303422111-1016789887288631687688-------------70778887147873588887446787 Q ss_pred CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCC Q ss_conf 20002111631343321211121002134432000002586443202310211247854487455566398899899798 Q gi|254780942|r 287 SIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAG 366 (442) Q Consensus 287 ~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g 366 (442) .+. +..+++||+|||-+ ++|+|..+|.|+..+.||||+||.++.|+..+.+..+++.+++.|+.+ T Consensus 300 ~V~-nSl~anGc~IEG~V--------------~NSil~R~V~v~kd~~~kn~iimq~~~Ig~ga~L~~vI~DK~~~I~~n 364 (383) T TIGR02092 300 KVE-NSLVANGCIIEGKV--------------ENSILSRGVYVGKDALIKNSIIMQRTVIGEGAHLENVIIDKDVVIEPN 364 (383) T ss_pred EEE-EEEEECCCEECCEE--------------EEEEEECCEEECCCCEEEEEEECCCCEECCCCEEEEEEECCCCEECCC T ss_conf 676-66676263561488--------------557985541872786787547758877787865615886576287887 Q ss_pred EEECCCCCCCCCCCEECCC Q ss_conf 0883445775456298788 Q gi|254780942|r 367 TITCNYDGTHKYKTHINEN 385 (442) Q Consensus 367 ~i~~n~~g~~~~~~~ig~~ 385 (442) +.+ .|...+|.+|..+ T Consensus 365 ~~~---~G~~e~PlVi~Kg 380 (383) T TIGR02092 365 EKI---AGTSEQPLVISKG 380 (383) T ss_pred EEE---EECCCCCEEECCC T ss_conf 068---5247886264487 No 66 >TIGR01853 lipid_A_lpxD UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase; InterPro: IPR007691 UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase (2.3.1 from EC) catalyses an early step in lipid A biosynthesis :UDP-3-O-(3-hydroxytetradecanoyl)glucosamine + (R)-3-hydroxytetradecanoyl- [acyl carrier protein] = UDP-2,3-bis(3-hydroxytetradecanoyl)glucosamine + [acyl carrier protein] Members of this family also contain a hexapeptide repeat (IPR001451 from INTERPRO). This entry represents the non-repeating region of LPXD proteins.; GO: 0016747 transferase activity transferring groups other than amino-acyl groups, 0009245 lipid A biosynthetic process. Probab=99.90 E-value=6.6e-24 Score=154.49 Aligned_cols=163 Identities=25% Similarity=0.329 Sum_probs=118.9 Q ss_pred ECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCC-CCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEE Q ss_conf 1152799411233178410012232234420002111631343-321211121002134432000002586443202310 Q gi|254780942|r 258 APETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYL-EGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVK 336 (442) Q Consensus 258 ~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i-~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~ 336 (442) +|. ..+++++.|++++.|+|+|+||.++.||++|+|+|+|+| ++|.||++|.|.|++.|++.+.||++|.|-+.+.|- T Consensus 105 ~p~-a~~~~~a~~~~~v~I~pnvvIG~~v~IG~~~~I~pg~~iG~~v~IG~~~~i~~~v~i~~~~~lG~nV~I~~GaVIG 183 (336) T TIGR01853 105 HPT-AVVEPSAKIGEGVTIGPNVVIGAGVEIGENVIIGPGVVIGDDVVIGDGSVIYPNVVIYERVQLGKNVIIHAGAVIG 183 (336) T ss_pred CCC-CCCCCCCCCCCCCEECCEEEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEEEECCEECCEEEECCCCEEC T ss_conf 768-7305653107843687541788886877863635776876886879967780670897241564715876887886 Q ss_pred -------CCCCCCCCCCCCCCCCCCCEECCCCEECCCEEEC--CCC--------CCCCCCCEECCCCEECCCCEEEC--- Q ss_conf -------2112478544874555663988998997980883--445--------77545629878879886998937--- Q gi|254780942|r 337 -------KATIKEGSKINHLSYVGDSVVGKNVNIGAGTITC--NYD--------GTHKYKTHINENAFIGSNSSLIA--- 396 (442) Q Consensus 337 -------~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~--n~~--------g~~~~~~~ig~~~~iG~~~~i~~--- 396 (442) .+--+...||+| +|..+|+++|.|||+|.+= -++ -.. .-+.|++||+||.|+.|.+ T Consensus 184 ~DGFGya~~~~g~~~KI~q---~G~ViI~D~VEIGANTtIDRg~~~dT~i~~gtKID-NLVQIAHNc~IG~n~~ivaqvG 259 (336) T TIGR01853 184 SDGFGYAHTANGGHVKIPQ---IGRVIIEDDVEIGANTTIDRGAFDDTVIGEGTKID-NLVQIAHNCRIGENCIIVAQVG 259 (336) T ss_pred CCCCCEEECCCCCEEEEEE---ECEEEEECEEEECCCCHHHCCCCCCCEEECCCEEC-CEEEEEEEEEECCCCEEEEEEE T ss_conf 7873203578897154413---02489602338746870211021471872384662-3157413346266427896420 Q ss_pred --C-EEECCCCEECCCCEECCC--CCCCCEEEEC Q ss_conf --8-398899899778357664--3698689813 Q gi|254780942|r 397 --P-ITIGQGTYVASGSIITQD--TPENSLVFAR 425 (442) Q Consensus 397 --g-v~ig~~~~i~ag~~v~~d--v~~~~~v~~~ 425 (442) | ++||+||++|..+.|..= +-+++.+-++ T Consensus 260 iaGSt~iG~~V~~GGqvGv~GHl~IgD~v~i~A~ 293 (336) T TIGR01853 260 IAGSTKIGRNVILGGQVGVAGHLEIGDNVTIGAK 293 (336) T ss_pred CCEEEEECCEEEECCEEEEECEEEECCCCEEEEE T ss_conf 1134675580696260265110894383488451 No 67 >PRK00892 lpxD UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase; Provisional Probab=99.89 E-value=6.4e-23 Score=148.87 Aligned_cols=165 Identities=25% Similarity=0.313 Sum_probs=111.5 Q ss_pred EECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEE Q ss_conf 211527994112331784100122322344200021116313433-2121112100213443200000258644320231 Q gi|254780942|r 257 IAPETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEV 335 (442) Q Consensus 257 ~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei 335 (442) ++| +.++++++.|++++.|+|+++|++++.||++|+|+++++|. ++.||+||.|+|++.|+++|.||++|.|.+++.| T Consensus 101 Ih~-sA~I~~~a~Ig~~v~Igp~~vIg~~v~IG~~~~I~~~~~Ig~~v~IG~~~~I~~nv~I~~~~~IG~~~~I~~g~vI 179 (343) T PRK00892 101 IHP-SAVIDPTAKIGKSVSIGPNAVIGAGVEIGDGVIIGAGAVIGAGVKIGDDCRLHANVTIYHRVQIGNRVLIHSGAVI 179 (343) T ss_pred CCC-CCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCEEECCCCEECCCCEE T ss_conf 797-5361887787999889999899999899999999298199588188899588118566255588887564688622 Q ss_pred ECC------CCCCCCCCCCCCCCCCCEECCCCEECCCEEECC--CC----CC-C--CCCCEECCCCEECCCCEEE----- Q ss_conf 021------124785448745556639889989979808834--45----77-5--4562987887988699893----- Q gi|254780942|r 336 KKA------TIKEGSKINHLSYVGDSVVGKNVNIGAGTITCN--YD----GT-H--KYKTHINENAFIGSNSSLI----- 395 (442) Q Consensus 336 ~~s------~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n--~~----g~-~--~~~~~ig~~~~iG~~~~i~----- 395 (442) -.- .-+...+++|. |..+||++|.||+++++-. ++ |. . ...+.|++||.||.|+.+. T Consensus 180 G~dGFGf~~~~~~~~kip~~---G~V~I~d~veIgan~tIdrg~~~~T~Ig~~~kidn~vhIaHn~~IG~~~iia~~~gi 256 (343) T PRK00892 180 GQDGFGFAPDRGGWVKIPQL---GRVIIGDDVEIGANTTIDRGALDDTVIGEGVKIDNLVQIAHNVRIGRHCAIAAQVGI 256 (343) T ss_pred CCCCCCCCCCCCCEEECCCC---CEEEECCCCEECCCCEECCCCCCCCEECCCCEEEEEEEECCCCEECCCCEEEECCEE T ss_conf 36675413679940464643---249988986888873272366779788989659211176167478788388521147 Q ss_pred C-CEEECCCCEECCCCEECCCC--CCCCEEEEC Q ss_conf 7-83988998997783576643--698689813 Q gi|254780942|r 396 A-PITIGQGTYVASGSIITQDT--PENSLVFAR 425 (442) Q Consensus 396 ~-gv~ig~~~~i~ag~~v~~dv--~~~~~v~~~ 425 (442) + .++||+++++|..+.|..-+ -+++++-++ T Consensus 257 aGs~~igd~v~igG~~gi~~h~~Ig~~~~i~~~ 289 (343) T PRK00892 257 AGSTKIGRYCMIGGQVGIAGHLEIGDGVQITAM 289 (343) T ss_pred EEEEEECCCEEECCCCCCCCCCEECCCCEEEEC T ss_conf 005799991499441323488698999799006 No 68 >COG1044 LpxD UDP-3-O-[3-hydroxymyristoyl] Probab=99.88 E-value=2.1e-22 Score=145.99 Aligned_cols=160 Identities=26% Similarity=0.332 Sum_probs=87.0 Q ss_pred EEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEEC-CC Q ss_conf 7994112331784100122322344200021116313433-212111210021344320000025864432023102-11 Q gi|254780942|r 262 VFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKK-AT 339 (442) Q Consensus 262 ~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~-s~ 339 (442) ..+++.+.+++++.|+++++|+.++.||++|.|+++++|. ++.||++|.|++++.|++++.||++|.|++.+.|.. .. T Consensus 106 A~i~p~A~i~~~~~ig~~vvI~~~v~IG~~~~I~~~avIg~~~~IG~~~~i~~~v~I~~~~~IG~~v~I~~GavIG~dgF 185 (338) T COG1044 106 AVIDPTATIGKNVSIGPNVVIGAGVVIGENVVIGAGAVIGENVKIGDGTVIHPNVTIYHNVVIGNNVIIHSGAVIGADGF 185 (338) T ss_pred CCCCCCCCCCCCCCCCCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCEEECCCCEECCCCC T ss_conf 53177670377873288749888978789858988978888968888858847878952757878228889988756852 Q ss_pred CCCC-----CCCCCCCCCCCCEECCCCEECCCEEECC--CCC------C-CCCCCEECCCCEECCCCEEEC------CEE Q ss_conf 2478-----5448745556639889989979808834--457------7-545629878879886998937------839 Q gi|254780942|r 340 IKEG-----SKINHLSYVGDSVVGKNVNIGAGTITCN--YDG------T-HKYKTHINENAFIGSNSSLIA------PIT 399 (442) Q Consensus 340 i~~g-----~~i~~~~~igd~~iG~~~~ig~g~i~~n--~~g------~-~~~~~~ig~~~~iG~~~~i~~------gv~ 399 (442) ...+ .|+.| +|.++||++|.||+.|.+.. ++- . -...+.|+.||+||.++.+.. .++ T Consensus 186 g~a~~~~g~~Ki~q---~g~V~Igd~VeIGanT~Idrga~~dTvIg~~~kIdN~vqIaHnv~IG~~~~I~~~vgIaGs~~ 262 (338) T COG1044 186 GYAGTAIGWVKIPQ---IGRVIIGDDVEIGANTTIDRGALDDTVIGEGVKIDNLVQIGHNVRIGEHCIIAGQVGIAGSVK 262 (338) T ss_pred CCCCCCCCCEECCE---ECEEEECCCEEECCCCEECCCCCCCCEECCCCEECCEEEECCCCEECCCCEEECCCEEECCCE T ss_conf 12246688567126---065997982487355242565556726537848764307815527888837702631613535 Q ss_pred ECCCCEECCCCEECCCC--CCCCEEEE Q ss_conf 88998997783576643--69868981 Q gi|254780942|r 400 IGQGTYVASGSIITQDT--PENSLVFA 424 (442) Q Consensus 400 ig~~~~i~ag~~v~~dv--~~~~~v~~ 424 (442) ||++|++|..+.+...+ -+++.+-+ T Consensus 263 IG~~v~igg~vgI~gh~~IgD~~~I~~ 289 (338) T COG1044 263 IGKYVIIGGQVGIAGHLEIGDGVTIGA 289 (338) T ss_pred ECCEEEECCCEEECCCEEECCCCEEEC T ss_conf 778079876134648418757878804 No 69 >PRK13627 carnitine operon protein CaiE; Provisional Probab=99.87 E-value=5.8e-22 Score=143.44 Aligned_cols=72 Identities=28% Similarity=0.451 Sum_probs=64.8 Q ss_pred CCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCC--CCCCCEEEECCCEEEE Q ss_conf 63988998997980883445775456298788798869989378398899899778357664--3698689813560770 Q gi|254780942|r 354 DSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQD--TPENSLVFARSRQIVK 431 (442) Q Consensus 354 d~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~d--v~~~~~v~~~~~~~~~ 431 (442) +..||++|+||.++++ |+|+|||+|+||.|++|+.|++||++|+|+|||+|+.+ +|+++|+.|.|-++++ T Consensus 71 ~~~IG~~vtIGH~ail--------hgc~Igd~~lIGmgavVldga~Ig~~~iI~AgslV~~g~~ip~~~L~~G~PAk~vR 142 (196) T PRK13627 71 DTIVGENGHIGHGAIL--------HGCVIGRDALVGMNSVIMDGAVIGEESIVAAMSFVKAGFQGEKRQLLMGTPARAVR 142 (196) T ss_pred CCEECCCEEECCCEEE--------EEEEEECCCEECCCCEEECCCEECCCCEECCCCEECCCCCCCCCCEEEECCEEEEE T ss_conf 8588898058787299--------63499489788469799288899999899768645799493898199877816811 Q ss_pred CC Q ss_conf 77 Q gi|254780942|r 432 ED 433 (442) Q Consensus 432 ~~ 433 (442) += T Consensus 143 ~l 144 (196) T PRK13627 143 SV 144 (196) T ss_pred CC T ss_conf 49 No 70 >cd03358 LbH_WxcM_N_like WcxM-like, Left-handed parallel beta-Helix (LbH) N-terminal domain: This group is composed of Xanthomonas campestris WcxM and proteins with similarity to the WcxM N-terminal domain. WcxM is thought to be bifunctional, catalyzing both the isomerization and transacetylation reactions of keto-hexoses. It contains an N-terminal LbH domain responsible for the transacetylation function and a C-terminal isomerase domain. The LbH domain contains imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), typical of enzymes with acyltransferase activity. Probab=99.87 E-value=4.8e-22 Score=143.88 Aligned_cols=53 Identities=30% Similarity=0.575 Sum_probs=48.9 Q ss_pred CCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEEE Q ss_conf 56298788798869989378398899899778357664369868981356077 Q gi|254780942|r 378 YKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQIV 430 (442) Q Consensus 378 ~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~ 430 (442) .+++||++|+||.+++++||++||++|+|+|||+|++|+|+++++.|.|.+++ T Consensus 66 ~~v~Ig~~v~IG~~~~I~~gv~IG~~~~IgagSvV~~dv~~~~i~~G~PAk~I 118 (119) T cd03358 66 KGTTVKRGASIGANATILPGVTIGEYALVGAGAVVTKDVPPYALVVGNPARII 118 (119) T ss_pred CCCEECCCEEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEEECCEEE T ss_conf 48398788384887477899498999899779989036599979997497991 No 71 >PRK05450 3-deoxy-manno-octulosonate cytidylyltransferase; Provisional Probab=99.87 E-value=1.1e-21 Score=141.84 Aligned_cols=228 Identities=22% Similarity=0.268 Sum_probs=152.1 Q ss_pred CCEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEEECCCHHHHHHHHCCCCCEEEEEECCCCCC Q ss_conf 408999458877232788874412588485799999999977-9977999957988999997302697289998599886 Q gi|254780942|r 4 KRLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAA-GIENVALVLGYGAEEITRINFPPTLSVEYYIQDCQQG 82 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~-g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~~G 82 (442) +.++||.|=..+||| |. |||.+|+|+|||+|+.+++.++ .+++++|.| ..++|.++....+.++-....+.+.| T Consensus 2 k~~~iIPAR~~SsR~-Pg--KpL~~I~GkpmI~~v~~~a~~~~~~~~V~VAT--dd~~I~~~~~~~g~~~imTs~~h~~G 76 (248) T PRK05450 2 KFVVIIPARYASTRL-PG--KPLADIGGKPMIVRVYERASKASGADRVVVAT--DDERIADAVEAFGGEVVMTSADHPSG 76 (248) T ss_pred CEEEEEECCCCCCCC-CC--CCCHHHCCCCHHHHHHHHHHHCCCCCEEEEEE--CCHHHHHHHHCCCCCEEECCCCCCCC T ss_conf 779998079778789-98--62024189698999999999668988299970--88524435315686234046454770 Q ss_pred HHHHHHHCCCCCCCCCCEE-EEEECCCCCCCCCCCCCCCCCCCCCC--CCCCEEEECC------CCCCCCEEEECCCCCE Q ss_conf 1446641000012344605-89851541115332200000013586--1001001012------2157440420233310 Q gi|254780942|r 83 TAHAVLTAQDAIKPGYDDV-IIMYGDVPLVSSHTLKKAMDKIAQGY--SIAVVGFNAD------NPKGYGRLLIKNNEII 153 (442) Q Consensus 83 Ta~Ai~~a~~~l~~~~~~~-lVl~GD~~li~~~~l~~l~~~~~~~~--~~~i~~~~~~------dp~~yGvV~~d~~~v~ 153 (442) | +.+..+.+.++....|+ +-+.||.||+++..++.+++....+. +++.+..+.. ||+.--++...+++++ T Consensus 77 T-dRi~Ea~~~l~~~~~d~IInvQGDEPli~p~~I~~li~~~~~~~~~~i~tl~~~~~~~~~~~d~n~VKvv~~~~~~al 155 (248) T PRK05450 77 T-DRIAEAAAKLGLSDDDIVVNVQGDEPLIPPELIDQVAEPLAAHPEADMATLAVPIDDEEELFNPNVVKVVLDKNGYAL 155 (248) T ss_pred H-HHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCHHHHHCCCCEEEEECCCCCEE T ss_conf 3-899999985087778689993598677899999999999985886639999986288899608883489989999840 Q ss_pred ECCCCCCC------CCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHH--HHHHHCCEEEEEECHHHE Q ss_conf 01001466------7542100035560011125014677753288874001112114899--997307717999172143 Q gi|254780942|r 154 AIREENDA------TDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEK--ARLDGKSIASIDVKEQEV 225 (442) Q Consensus 154 ~i~Ek~d~------~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~--l~~~g~~i~~~~~~~~~~ 225 (442) -|-..|=+ .......-+-..|+|.|+.++|..|.. ++.... | ...-+++ ++++|.++.++..+.... T Consensus 156 yfSRs~IP~~~~~~~~~~~~~~~~hiGiy~f~~~~L~~f~~-l~~s~l--E--~~E~lEqLR~leng~~I~~~~~~~~~~ 230 (248) T PRK05450 156 YFSRAPIPWGRDAADPTAPTPVYRHIGIYAYRRGFLRRFVS-LPPSPL--E--QIESLEQLRALENGYRIHVAVVDEAPS 230 (248) T ss_pred EEECCCCCCCCCCCCCCCCCCEEEEEEEEEEHHHHHHHHHH-CCCCHH--H--HHHHHHHHHHHHCCCCEEEEEECCCCC T ss_conf 55426787544432123555516899988600999999874-699814--6--565269999998599579999678999 Q ss_pred ECCCCCCCCCCCCCCCC Q ss_conf 01112100000012222 Q gi|254780942|r 226 CGCNNRYELSLIENIWQ 242 (442) Q Consensus 226 ~gv~~~~~L~~~~~~~~ 242 (442) .+++++.||..++.++. T Consensus 231 ~~VDT~eDl~~v~~~lk 247 (248) T PRK05450 231 PGVDTPEDLERVRALLA 247 (248) T ss_pred CCCCCHHHHHHHHHHHC T ss_conf 99779999999999866 No 72 >pfam01128 IspD Uncharacterized protein family UPF0007. Probab=99.87 E-value=2.8e-21 Score=139.56 Aligned_cols=216 Identities=14% Similarity=0.212 Sum_probs=148.9 Q ss_pred EEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCH-HHHHHHHCCCCCEEEEEECCCCCCH Q ss_conf 89994588772327888744125884857999999999779-977999957988-9999973026972899985998861 Q gi|254780942|r 6 LAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAG-IENVALVLGYGA-EEITRINFPPTLSVEYYIQDCQQGT 83 (442) Q Consensus 6 ~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g-~~~ivvv~~~~~-e~i~~~~~~~~~~i~~v~q~~~~GT 83 (442) .|||||||.|+||....||++++++|+||++|.++.+.+.. +++|+++++.+. +.+++... ..++.++.--.. - T Consensus 2 ~aIIlAaG~G~R~~~~~pKQf~~l~gkpil~~sl~~f~~~~~i~~Ivvv~~~~~~~~~~~~~~--~~~i~~v~GG~t--R 77 (221) T pfam01128 2 VAVIPAAGSGKRMGAGVPKQFLQLLGQPLLEHTVDAFLASPVVDRIVVAVSPDDTPEFRQLLG--DPSIQLVAGGDT--R 77 (221) T ss_pred EEEEECCEECCCCCCCCCCCCCEECCEEHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHCC--CCCEEEECCCCC--H T ss_conf 899935810612695898653668988899999999963888897999967788999998607--998799679963--8 Q ss_pred HHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCEEEEC-CCCCEECCCCCCCC Q ss_conf 44664100001234460589851541115332200000013586100100101221574404202-33310010014667 Q gi|254780942|r 84 AHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNADNPKGYGRLLIK-NNEIIAIREENDAT 162 (442) Q Consensus 84 a~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~~dp~~yGvV~~d-~~~v~~i~Ek~d~~ 162 (442) .++++++++.+....+.++|.-|.-||++...+.++++...++.++++.+.++.|.- ...+ ++.+.+..+..... T Consensus 78 ~~SV~ngL~~l~~~~~~VlIHDaaRP~v~~~~i~~li~~~~~~~~~~i~~~p~~DTi----k~v~~~~~i~~t~dR~~l~ 153 (221) T pfam01128 78 QDSVLNGLKALAGTAKFVLVHDGARPCLPHADLARLLAALETGTQGAILALPVTDTI----KRVEADGVVAGTPDRSGLW 153 (221) T ss_pred HHHHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEECCCCCE----EEECCCCCEECCCCCHHHE T ss_conf 999999999835899989998365667999999999999860048613541246643----5753786044035642101 Q ss_pred CCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCCCCCCCCCC Q ss_conf 54210003556001112501467775328887400111211489999730771799917214301112100000012222 Q gi|254780942|r 163 DEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYELSLIENIWQ 242 (442) Q Consensus 163 ~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~~~~~~~~ 242 (442) ...+. -+|+.+.|.+++++... ..++.||-.+.+...|.++..+..+... ..+++|.||..++.+++ T Consensus 154 --~~QTP------Q~F~~~~L~~a~~~~~~----~~~~~TDd~sl~~~~g~~v~~v~G~~~N-~KIT~~eDl~~ae~lL~ 220 (221) T pfam01128 154 --AAQTP------QGFRVDLLLAAHQRGDQ----PGAEITDDASLVEHAGGSVQVVPGRPDN-LKITTPEDLALAEAILR 220 (221) T ss_pred --ECCCC------CCCCHHHHHHHHHHHHH----CCCCCCCHHHHHHHCCCCEEEEECCCCC-CCCCCHHHHHHHHHHHC T ss_conf --10076------20069999999998886----2999762999999779977999658423-88789999999999967 No 73 >COG0836 {ManC} Mannose-1-phosphate guanylyltransferase [Cell envelope biogenesis, outer membrane] Probab=99.87 E-value=2.2e-21 Score=140.11 Aligned_cols=186 Identities=18% Similarity=0.217 Sum_probs=123.5 Q ss_pred CEEEEECCCCCCCCCCC----CCCCEEEEC-CEEHHHHHHHHHHH-CCCCEEEEEECCC-HHHHHHHHCC--CCCEEEEE Q ss_conf 08999458877232788----874412588-48579999999997-7997799995798-8999997302--69728999 Q gi|254780942|r 5 RLAIVLAAGRGHRMKSS----SSKVLQKIA-GKPMISHVMETIAA-AGIENVALVLGYG-AEEITRINFP--PTLSVEYY 75 (442) Q Consensus 5 ~~AiILAaG~GtRl~p~----~pKpLlpi~-gkpli~~~i~~l~~-~g~~~ivvv~~~~-~e~i~~~~~~--~~~~i~~v 75 (442) +.++|||||.||||||+ .||+++++. +++|++.++++++. .+.+++++||+.. ...+++-+.. ....-.++ T Consensus 2 ~~pvIlaGG~GsRLWPLSR~~~PKQFl~L~~~~Sllq~T~~R~~~l~~~~~~~vVtne~~~f~v~eql~e~~~~~~~~il 81 (333) T COG0836 2 MIPVILAGGSGSRLWPLSRKDYPKQFLKLFGDLSLLQQTVKRLAFLGDIEEPLVVTNEKYRFIVKEQLPEIDIENAAGII 81 (333) T ss_pred CEEEEEECCCCCCCCCCCCCCCCCCCEEECCCCCHHHHHHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHHHHCCCCCEE T ss_conf 12599707876536876754377230341799718999999875147856609995779999999986641012566068 Q ss_pred ECCCCCCHHHHHHHCCCCCCCCCCE--EEEEECCCCCCCCC-CCCCCCCC---CCCCCCCCCEEEECCCC-CCCCEEEEC Q ss_conf 8599886144664100001234460--58985154111533-22000000---13586100100101221-574404202 Q gi|254780942|r 76 IQDCQQGTAHAVLTAQDAIKPGYDD--VIIMYGDVPLVSSH-TLKKAMDK---IAQGYSIAVVGFNADNP-KGYGRLLIK 148 (442) Q Consensus 76 ~q~~~~GTa~Ai~~a~~~l~~~~~~--~lVl~GD~~li~~~-~l~~l~~~---~~~~~~~~i~~~~~~dp-~~yGvV~~d 148 (442) .|++.+-||-|+..+.-.+....+| ++|+.+|+ +|.+. .+.+-+.. ..+...++++....+.| ++||.+... T Consensus 82 lEP~gRnTApAIA~aa~~~~~~~~d~~~lVlpsDH-~I~d~~af~~av~~A~~~A~~g~lVTfGI~Pt~PeTGYGYIe~G 160 (333) T COG0836 82 LEPEGRNTAPAIALAALSATAEGGDALVLVLPSDH-VIADEEAFLNAVKKAEKAAEEGGIVTFGIPPTRPETGYGYIETG 160 (333) T ss_pred ECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCC-EECCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCEEECC T ss_conf 54578774999999999998758983899944755-65568999999999999997198799832789886674146328 Q ss_pred -------CCCCEECCCCCCCCCCCCC----CCCCCCCHHEECCHHHHHHHHHCC Q ss_conf -------3331001001466754210----003556001112501467775328 Q gi|254780942|r 149 -------NNEIIAIREENDATDEERK----IHYCNSGLMAIDGLYIMDWLLQIK 191 (442) Q Consensus 149 -------~~~v~~i~Ek~d~~~~~~~----~~lin~GiY~f~~~~L~~~l~~l~ 191 (442) --.|.+|+||||....+.. ..+=|+|+|+|+...+.+.+++.. T Consensus 161 ~~~~~~~~~~V~~FvEKPd~etA~~yv~sG~y~WNSGmF~Fra~~~l~e~~~~~ 214 (333) T COG0836 161 ESIAENGVYKVDRFVEKPDLETAKKYVESGEYLWNSGMFLFRASVFLEELKKHQ 214 (333) T ss_pred CCCCCCCCEEEEEEEECCCHHHHHHHHHCCCEEEECCCEEEEHHHHHHHHHHHC T ss_conf 624567825755531388789999999739657632316888899999998609 No 74 >cd03352 LbH_LpxD UDP-3-O-acyl-glucosamine N-acyltransferase (LpxD): The enzyme catalyzes the transfer of 3-hydroxymyristic acid or 3-hydroxy-arachidic acid, depending on the organism, from the acyl carrier protein (ACP) to UDP-3-O-acyl-glucosamine to produce UDP-2,3-diacyl-GlcNAc. This constitutes the third step in the lipid A biosynthetic pathway in Gram-negative bacteria. LpxD is a homotrimer, with each subunit consisting of a novel combination of an N-terminal uridine-binding domain, a core lipid-binding left-handed parallel beta helix (LbH) domain, and a C-terminal alpha-helical extension. The LbH domain contains 9 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Probab=99.87 E-value=5.5e-22 Score=143.54 Aligned_cols=156 Identities=23% Similarity=0.310 Sum_probs=91.7 Q ss_pred EEECCCCCEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCC---CCC Q ss_conf 2331784100122322344200021116313433-212111210021344320000025864432023102112---478 Q gi|254780942|r 268 TIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATI---KEG 343 (442) Q Consensus 268 ~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i---~~g 343 (442) +.|++++.|+|+++|++++.||++++|+++++|. ++.||++|.|++++.|++++.|+++|.|++++.+...-. ... T Consensus 2 a~ig~~v~I~~~avI~~~v~Ig~n~~I~~~~~I~~~v~IG~~~~I~~~~~I~~~~~IG~~~~I~~~~vIg~~~f~~~~~~ 81 (205) T cd03352 2 AKIGENVSIGPNAVIGEGVVIGDGVVIGPGVVIGDGVVIGDDCVIHPNVTIYEGCIIGDRVIIHSGAVIGSDGFGFAPDG 81 (205) T ss_pred CEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCCEECCEEEECCCEEECCCCCCCCCCC T ss_conf 69199999999879999999999999989889999968898836600344727868754477715348621353231479 Q ss_pred CCCCCCCCCCCCEECCCCEECCCEEECCCC------CC---------CCCCCEECCCCEECCCCEEECCEEECCCCEECC Q ss_conf 544874555663988998997980883445------77---------545629878879886998937839889989977 Q gi|254780942|r 344 SKINHLSYVGDSVVGKNVNIGAGTITCNYD------GT---------HKYKTHINENAFIGSNSSLIAPITIGQGTYVAS 408 (442) Q Consensus 344 ~~i~~~~~igd~~iG~~~~ig~g~i~~n~~------g~---------~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~a 408 (442) ....+..++|...||++|.||+++.+..-. |. -.|.+.||+++.+..++.+.-.++||++|++|+ T Consensus 82 ~~~~~~~~~g~v~Ig~~v~Ig~~~~I~~g~~~~T~IG~~~~i~~~v~igh~~~iG~~~~i~~~~~~~g~~~Igd~~~iG~ 161 (205) T cd03352 82 GGWVKIPQLGGVIIGDDVEIGANTTIDRGALGDTVIGDGTKIDNLVQIAHNVRIGENCLIAAQVGIAGSTTIGDNVIIGG 161 (205) T ss_pred CCCEECCCCCCEEECCCEEECCCCEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCEEECCCCCCCCCEEECCCCEECC T ss_conf 84045142131898488498886276366426774787768645452556404158707503200026524974353989 Q ss_pred CCEECCCC--CCCCEEE Q ss_conf 83576643--6986898 Q gi|254780942|r 409 GSIITQDT--PENSLVF 423 (442) Q Consensus 409 g~~v~~dv--~~~~~v~ 423 (442) ++++...+ -+++++- T Consensus 162 ~s~i~~gv~IG~~a~Vg 178 (205) T cd03352 162 QVGIAGHLTIGDGVVIG 178 (205) T ss_pred CCEECCCCEECCCCEEC T ss_conf 64882993999998998 No 75 >TIGR03570 NeuD_NnaD sugar O-acyltransferase, sialic acid O-acetyltransferase NeuD family. These proteins contain repeats of the bacterial transferase hexapeptide (pfam00132), although often these do not register above the trusted cutoff. Probab=99.86 E-value=1.5e-21 Score=141.15 Aligned_cols=66 Identities=33% Similarity=0.649 Sum_probs=57.9 Q ss_pred CCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECC Q ss_conf 6398899899798088344577545629878879886998937839889989977835766436986898135 Q gi|254780942|r 354 DSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARS 426 (442) Q Consensus 354 d~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~ 426 (442) ||.||++|.|++++.++. .+.||++|+||.|+++.|+++||++++|||||+|++|+|++++++|.| T Consensus 135 d~~ig~~~~i~~~~~i~G-------~v~Ig~~~~iG~~~~i~~~i~Ig~~~~Igags~V~~~v~~~~~~~G~P 200 (201) T TIGR03570 135 DCVIGDYVHIAPGVTLSG-------GVVIGEGVFIGAGATIIQGVTIGAGAIVGAGAVVTKDIPDGGVVVGVP 200 (201) T ss_pred CCEECCCCEECCCCEECC-------CCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCCCCCEEEEEC T ss_conf 254778768999859989-------887977735989889889989999999997988904379997999758 No 76 >cd02517 CMP-KDO-Synthetase CMP-KDO synthetase catalyzes the activation of KDO which is an essential component of the lipopolysaccharide. CMP-KDO Synthetase: 3-Deoxy-D-manno-octulosonate cytidylyltransferase (CMP-KDO synthetase) catalyzes the conversion of CTP and 3-deoxy-D-manno-octulosonate into CMP-3-deoxy-D-manno-octulosonate (CMP-KDO) and pyrophosphate. KDO is an essential component of the lipopolysaccharide found in the outer surface of gram-negative eubacteria. It is also a constituent of the capsular polysaccharides of some gram-negative eubacteria. Its presence in the cell wall polysaccharides of green algae and plant were also discovered. However, they have not been found in yeast and animals. The absence of the enzyme in mammalian cells makes it an attractive target molecule for drug design. Probab=99.85 E-value=6.4e-21 Score=137.49 Aligned_cols=225 Identities=21% Similarity=0.297 Sum_probs=150.9 Q ss_pred CCEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEEECCCHHHHHHHHCCCCCEEEEEECCCCCC Q ss_conf 408999458877232788874412588485799999999977-9977999957988999997302697289998599886 Q gi|254780942|r 4 KRLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAA-GIENVALVLGYGAEEITRINFPPTLSVEYYIQDCQQG 82 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~-g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~~G 82 (442) +.++||.|=+.+||| |. |||.+|+|+|||+|+.+++.++ .+++++|.| ..++|.++....+.++-...++.+.| T Consensus 1 k~i~iIPAR~~S~Rl-p~--K~L~~I~G~pmI~~v~~~a~~s~~~d~V~VAT--Dd~eI~~~~~~~g~~~imT~~~h~~G 75 (239) T cd02517 1 KVIVVIPARYASSRL-PG--KPLADIAGKPMIQHVYERAKKAKGLDEVVVAT--DDERIADAVESFGGKVVMTSPDHPSG 75 (239) T ss_pred CEEEEECCCCCCCCC-CC--CCCHHHCCCCHHHHHHHHHHHCCCCCEEEEEC--CCHHHHHHHCCCCCCCCCCCCCCCCC T ss_conf 989997588768789-99--73045399598999999999669988499964--82655533214797401046433562 Q ss_pred HHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCC--CCCCCEEEECC------CCCCCCEEEECCCCCEE Q ss_conf 14466410000123446058985154111533220000001358--61001001012------21574404202333100 Q gi|254780942|r 83 TAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQG--YSIAVVGFNAD------NPKGYGRLLIKNNEIIA 154 (442) Q Consensus 83 Ta~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~--~~~~i~~~~~~------dp~~yGvV~~d~~~v~~ 154 (442) | +.+..+.+.++...|-++.+.||.||+++..++.+++.+... ..++.+..+.. ||+..-++...+++++- T Consensus 76 T-dRi~ea~~~l~~~~D~iinvQGDePli~p~~I~~~i~~~~~~~~~~v~t~~~~i~~~~~~~~~n~VKvv~~~~~~aly 154 (239) T cd02517 76 T-DRIAEVAEKLDADDDIVVNVQGDEPLIPPEMIDQVVAALKDDPGVDMATLATPISDEEELFNPNVVKVVLDKDGYALY 154 (239) T ss_pred C-HHHHHHHHHCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCHHHHCCCCCEEEEECCCCCCHH T ss_conf 1-599999997199889899951875678999999999998528875598740236998894189963999789885110 Q ss_pred CCCCCCCCC---CCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHH--HHHHHCCEEEEEECHHHEECCC Q ss_conf 100146675---42100035560011125014677753288874001112114899--9973077179991721430111 Q gi|254780942|r 155 IREENDATD---EERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEK--ARLDGKSIASIDVKEQEVCGCN 229 (442) Q Consensus 155 i~Ek~d~~~---~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~--l~~~g~~i~~~~~~~~~~~gv~ 229 (442) |--.+=+.. .....-+-..|+|.|+.++|..|. .++.... | ...-+++ ++++|.++.++..+.. ..+++ T Consensus 155 fSRs~IP~~~~~~~~~~~~khiGIy~f~~~~L~~f~-~~~~s~l--E--~~E~lEqLR~leng~~I~~~~~~~~-~~~VD 228 (239) T cd02517 155 FSRSPIPYPRDSSEDFPYYKHIGIYAYRRDFLLRFA-ALPPSPL--E--QIESLEQLRALENGYKIKVVETDHE-SIGVD 228 (239) T ss_pred CCCCCCCCCCCCCCCCCEEEEEEEEEECHHHHHHHH-CCCCCHH--H--HHHHHHHHHHHHCCCEEEEEEECCC-CCCCC T ss_conf 334788643456555041037765563399999987-3799825--7--7760899999983980689996999-98988 Q ss_pred CCCCCCCCCCC Q ss_conf 21000000122 Q gi|254780942|r 230 NRYELSLIENI 240 (442) Q Consensus 230 ~~~~L~~~~~~ 240 (442) ++.||..++++ T Consensus 229 t~eDl~~veki 239 (239) T cd02517 229 TPEDLERVEAL 239 (239) T ss_pred CHHHHHHHHCC T ss_conf 99999999709 No 77 >cd03360 LbH_AT_putative Putative Acyltransferase (AT), Left-handed parallel beta-Helix (LbH) domain; This group is composed of mostly uncharacterized proteins containing an N-terminal helical subdomain followed by a LbH domain. The alignment contains 6 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. A few members are identified as NeuD, a sialic acid (Sia) O-acetyltransferase that is required for Sia synthesis and surface polysaccharide sialylation. Probab=99.85 E-value=2e-21 Score=140.40 Aligned_cols=66 Identities=35% Similarity=0.702 Sum_probs=55.2 Q ss_pred CCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECC Q ss_conf 6398899899798088344577545629878879886998937839889989977835766436986898135 Q gi|254780942|r 354 DSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARS 426 (442) Q Consensus 354 d~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~ 426 (442) ||.||++|.|++++.++. .+.||++|+||.|+++.++++||++++|||||+|++|+|++++++|.| T Consensus 132 d~~ig~~~~i~~~~~i~g-------~v~Ig~~~~iG~~~~i~~~i~Ig~~~~igags~v~~~i~~~~~~~G~P 197 (197) T cd03360 132 DCVIGDFVHIAPGVVLSG-------GVTIGEGAFIGAGATIIQGVTIGAGAIIGAGAVVTKDVPDGSVVVGNP 197 (197) T ss_pred CCCCCCCCEECCCEEECC-------CEEEEEECCCCCCCEECCCCEECCCCEECCCCEEEEECCCCEEEEECC T ss_conf 150131010667236778-------308912162989989879989999999997998912569990999309 No 78 >PRK13368 3-deoxy-manno-octulosonate cytidylyltransferase; Provisional Probab=99.84 E-value=3.4e-20 Score=133.36 Aligned_cols=224 Identities=21% Similarity=0.246 Sum_probs=152.7 Q ss_pred CCEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEEECCCHHHHHHHHCCCCCEEEEEECCCCCC Q ss_conf 408999458877232788874412588485799999999977-9977999957988999997302697289998599886 Q gi|254780942|r 4 KRLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAA-GIENVALVLGYGAEEITRINFPPTLSVEYYIQDCQQG 82 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~-g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~~G 82 (442) +.++||.|=+.+||| |. |||.+|+|+|||+|+.+++.++ .+++++|.| ..++|.++....+.++-...++...| T Consensus 2 k~~~iIPAR~~S~R~-p~--K~L~~i~g~pmi~~v~~~a~~s~~~d~v~VaT--dd~eI~~~~k~~g~~~i~ts~~~~~G 76 (238) T PRK13368 2 KVVVVIPARYGSSRL-PG--KPLLDILGKPMIQHVYERAAQAAGVEEVYVAT--DDQRIEDAVEAFGGKVVMTSDDHLSG 76 (238) T ss_pred CEEEEECCCCCCCCC-CC--CCCHHHCCCCHHHHHHHHHHHCCCCCEEEEEC--CCHHHHHHHHCCCCEEEECCCCCCCC T ss_conf 789998468778789-98--70125289699999999998468988299956--84248777500463278515655771 Q ss_pred HHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCC--CCCEEEEC------CCCCCCCEEEECCCCCEE Q ss_conf 1446641000012344605898515411153322000000135861--00100101------221574404202333100 Q gi|254780942|r 83 TAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYS--IAVVGFNA------DNPKGYGRLLIKNNEIIA 154 (442) Q Consensus 83 Ta~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~--~~i~~~~~------~dp~~yGvV~~d~~~v~~ 154 (442) | +.+..+.+.++ .|-++.+.||.||+++..++.+++.+.++.. ++.++... .+|+.--++...+++++- T Consensus 77 T-dRi~Ea~~~~~--~d~ivnvQgDePli~p~~I~~~i~~~~~~~~~~v~t~~~~~~~~~~~~n~n~vKvv~~~~~~~ly 153 (238) T PRK13368 77 T-DRLAEVMLKIE--ADIYINVQGDEPLIRPRDIDTLIQPMLDDPSINVATLCAPISTEEEFESPNVVKVVVDKNGDALY 153 (238) T ss_pred H-HHHHHHHHHCC--CCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHCCCCCEEEEECCCCCCCC T ss_conf 4-78999985159--87899953886778999999999999837765276787416888884389852999899787011 Q ss_pred CCCCCCCCCC--CCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHH--HHHHHCCEEEEEECHHHEECCCC Q ss_conf 1001466754--2100035560011125014677753288874001112114899--99730771799917214301112 Q gi|254780942|r 155 IREENDATDE--ERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEK--ARLDGKSIASIDVKEQEVCGCNN 230 (442) Q Consensus 155 i~Ek~d~~~~--~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~--l~~~g~~i~~~~~~~~~~~gv~~ 230 (442) |-..+-+... +...-+-..|+|.|+.++|..|.. ++.... +...-+++ ++++|.++.++..+.. ..++++ T Consensus 154 fSR~~ip~~~~~~~~~~~khiGiy~f~~~~L~~f~~-l~~s~l----E~~E~lEqLR~leng~~I~~~~~~~~-~~~VDt 227 (238) T PRK13368 154 FSRSPIPSRRDGESARYLKHVGIYAFRRDVLQQFSQ-LPETPL----EQIESLEQLRALEHGYKIRMVEVEAT-SIGVDT 227 (238) T ss_pred CCCCCCCCCCCCCCCHHHEEEEEEEECHHHHHHHHH-CCCCHH----HHHHHHHHHHHHHCCCCEEEEEECCC-CCCCCC T ss_conf 452567511245420122030112211999999870-899856----77770899999986992679995999-999879 Q ss_pred CCCCCCCCCCC Q ss_conf 10000001222 Q gi|254780942|r 231 RYELSLIENIW 241 (442) Q Consensus 231 ~~~L~~~~~~~ 241 (442) +.||..++.++ T Consensus 228 ~eDl~~v~kim 238 (238) T PRK13368 228 PEDLERVRAIM 238 (238) T ss_pred HHHHHHHHHHC T ss_conf 99999999859 No 79 >PRK00155 ispD 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; Reviewed Probab=99.84 E-value=6.6e-20 Score=131.73 Aligned_cols=223 Identities=17% Similarity=0.239 Sum_probs=148.3 Q ss_pred CCCCCEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEEECCCH-HHHHHHHCCCCCEEEEEECC Q ss_conf 987408999458877232788874412588485799999999977-9977999957988-99999730269728999859 Q gi|254780942|r 1 MKRKRLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAA-GIENVALVLGYGA-EEITRINFPPTLSVEYYIQD 78 (442) Q Consensus 1 m~~~~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~-g~~~ivvv~~~~~-e~i~~~~~~~~~~i~~v~q~ 78 (442) |-.+..|||||||.|+||....||++++++|+|+++|.++.+.+. .+++|+++++.+. +..++.+......+.++.-- T Consensus 1 ~m~~i~aIIlAaG~G~R~g~~~pKQf~~l~gkpil~~sl~~f~~~~~i~~IvvV~~~~~~~~~~~~~~~~~~~i~~v~GG 80 (228) T PRK00155 1 MMPMVYAIIPAAGIGSRMGADRPKQYLPLGGKPILEHTLEAFLAHPAIDEIIVVVPPDDTEYAKLLLAKFDKRIKVVAGG 80 (228) T ss_pred CCCCEEEEEECCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHHCCCCEEEECCC T ss_conf 98757999937720513786799655668956999999999975999897999957626999999987449857997697 Q ss_pred CCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCEEEEC-CCCCEECCC Q ss_conf 9886144664100001234460589851541115332200000013586100100101221574404202-333100100 Q gi|254780942|r 79 CQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNADNPKGYGRLLIK-NNEIIAIRE 157 (442) Q Consensus 79 ~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~~dp~~yGvV~~d-~~~v~~i~E 157 (442) +.+ .++++++++++.. .+.++|.-|--||++...++.+++..+. .++++.+.++.|. +...+ ++.+.+.+. T Consensus 81 ~tR--~~Sv~ngL~~l~~-~~~VlIHDaARP~is~~~i~~li~~~~~-~~~~ip~~p~~DT----ik~~~~~~~i~~tl~ 152 (228) T PRK00155 81 AER--QDSVLNGLQALPD-DDWVLVHDAARPFLTPDDIDRLIEAAEE-YGAAILAVPVRDT----IKRVDAGGFIVDTPD 152 (228) T ss_pred HHH--HHHHHHHHHHHCC-CCEEEEECCCCCCCCHHHHHHHHHHHHH-CCCCEEEEECCCC----EEEECCCCCEEEECC T ss_conf 039--9999999998403-9979997066768999999999999984-6984698760241----789658981553035 Q ss_pred CCCCCCCCCCCCCCCCC-HHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCCCC Q ss_conf 14667542100035560-01112501467775328887400111211489999730771799917214301112100000 Q gi|254780942|r 158 ENDATDEERKIHYCNSG-LMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYELSL 236 (442) Q Consensus 158 k~d~~~~~~~~~lin~G-iY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~~ 236 (442) . .++..+= =-+|+.+.|.+...+...+ ++..||-.+.+...|.++..+..+... .-+.+|.||.. T Consensus 153 R---------~~l~~~QTPQ~F~~~~l~~a~~~~~~~----~~~~TDd~sl~~~~g~~v~~v~G~~~N-~KIT~~eDL~~ 218 (228) T PRK00155 153 R---------SGLWAAQTPQGFRVELLKEALARALAE----GFTITDDASAVEWLGKPVRLVEGRYDN-IKVTTPEDLAL 218 (228) T ss_pred C---------HHEEEEECCCCCCHHHHHHHHHHHHHC----CCCCCCHHHHHHHCCCCEEEEECCCCC-CCCCCHHHHHH T ss_conf 1---------443665288532599999999999875----998778999999869976999668543-77289999999 Q ss_pred CCCCCCCCC Q ss_conf 012222211 Q gi|254780942|r 237 IENIWQSRY 245 (442) Q Consensus 237 ~~~~~~~~~ 245 (442) ++.++++++ T Consensus 219 ae~ll~~r~ 227 (228) T PRK00155 219 AEAILKRRI 227 (228) T ss_pred HHHHHHHHC T ss_conf 999998531 No 80 >PRK09677 putative lipopolysaccharide biosynthesis O-acetyl transferase WbbJ; Provisional Probab=99.84 E-value=1.7e-20 Score=135.02 Aligned_cols=60 Identities=25% Similarity=0.513 Sum_probs=53.8 Q ss_pred CCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEEEECC Q ss_conf 775456298788798869989378398899899778357664369868981356077077 Q gi|254780942|r 374 GTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQIVKED 433 (442) Q Consensus 374 g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~~~~ 433 (442) +....+++||++||||++++|+||++||++|+|||||+|++|+|+++++.|.|..++|+. T Consensus 125 ~~~~~pv~Ig~~~wIg~~~~Il~Gv~Ig~~~vvgA~SvV~~dvp~~~i~~G~PAk~Ik~~ 184 (192) T PRK09677 125 TLESSAVVIGQRVWIGENVTILPGVSIGNGCIVGANSVVTKSIPENTVIAGNPAKIIKKY 184 (192) T ss_pred CCCCCCEEECCCCEECCCCEEECCCEECCCCEECCCCEECCCCCCCEEEEEECCEEEEEC T ss_conf 677599898999899999999199699999899169999858599919997083889712 No 81 >cd03359 LbH_Dynactin_5 Dynactin 5 (or subunit p25); Dynactin is a major component of the activator complex that stimulates dynein-mediated vesicle transport. Dynactin is a heterocomplex of at least eight subunits, including a 150,000-MW protein called Glued, the actin-capping protein Arp1, and dynamatin. In vitro binding experiments show that dynactin enhances dynein-dependent motility, possibly through interaction with microtubules and vesicles. Subunit p25 is part of the pointed-end subcomplex in dynactin that also includes p26, p27, and Arp11. This subcomplex interacts with membranous cargoes. p25 and p27 contain imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), indicating a left-handed parallel beta helix (LbH) structural domain. Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.83 E-value=3.4e-20 Score=133.38 Aligned_cols=134 Identities=21% Similarity=0.227 Sum_probs=89.4 Q ss_pred EEECCCCCEEC--CCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCC Q ss_conf 23317841001--2232234420002111631343321211121002134432000002586443202310211247854 Q gi|254780942|r 268 TIIQPDTVIEP--HVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSK 345 (442) Q Consensus 268 ~~I~~~~~I~~--~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~ 345 (442) .+|++++.|.+ ++.||+++.|.++|+|.... ....||++|.|++++.|++.+..... . T Consensus 8 n~I~k~a~I~G~~nI~IG~~ssI~~~avirGD~--~~I~IG~~~~I~d~~~I~~~~~~~~~-----------------~- 67 (161) T cd03359 8 NKVSRKSVICGSQNIVLNGKTIIQSDVIIRGDL--ATVSIGRYCILSEGCVIRPPFKKFSK-----------------G- 67 (161) T ss_pred CEECCCCEEECCCCEEECCCCEECCCCEEECCC--CCEEECCCCEECCCCEEECCCCCCCC-----------------C- T ss_conf 958776299788775999987899996896788--65499998588899899246443577-----------------8- Q ss_pred CCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCC--CCCCCEEE Q ss_conf 4874555663988998997980883445775456298788798869989378398899899778357664--36986898 Q gi|254780942|r 346 INHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQD--TPENSLVF 423 (442) Q Consensus 346 i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~d--v~~~~~v~ 423 (442) ..-.+..||++|.||.++++ +.+.||++|+||.+++|++|+.||++|+|++||+|..+ +||++++. T Consensus 68 ----~~~~p~~IGd~v~Ig~~~~i--------~~~~Ig~~v~IG~~~~I~~g~~Ig~~~~I~~gsvV~~~~~ip~~~~~~ 135 (161) T cd03359 68 ----VAFFPLHIGDYVFIGENCVV--------NAAQIGSYVHIGKNCVIGRRCIIKDCVKILDGTVVPPDTVIPPYSVVS 135 (161) T ss_pred ----CCCCCCEECCCEEECCCEEE--------ECCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEE T ss_conf ----88877284898798998288--------488966771557981994997999998998898988998989981998 Q ss_pred ECCCEEEECC Q ss_conf 1356077077 Q gi|254780942|r 424 ARSRQIVKED 433 (442) Q Consensus 424 ~~~~~~~~~~ 433 (442) |.|.+++|+= T Consensus 136 G~Pak~i~~l 145 (161) T cd03359 136 GRPARFIGEL 145 (161) T ss_pred ECCEEEEEEC T ss_conf 5180895769 No 82 >cd03357 LbH_MAT_GAT Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates. MAT acetylates maltose and glucose exclusively at the C6 position of the nonreducing end glucosyl moiety. GAT specifically acetylates galactopyranosides. Furthermore, MAT shows higher affinity toward artificial substrates containing an alkyl or hydrophobic chain as well as a glucosyl unit. Active MAT and GAT are homotrimers, with each subunit consisting of an N-terminal alpha-helical region and a C-terminal left-handed parallel alpha-helix (LbH) subdomain with 6 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Probab=99.83 E-value=2.8e-20 Score=133.89 Aligned_cols=52 Identities=25% Similarity=0.578 Sum_probs=48.9 Q ss_pred CCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEEE Q ss_conf 6298788798869989378398899899778357664369868981356077 Q gi|254780942|r 379 KTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQIV 430 (442) Q Consensus 379 ~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~ 430 (442) +.+|||+||||+|++|+|||+||++|+|||||+|++|+|+++++.|.|.+++ T Consensus 118 PI~Igd~vWIG~~~~IlpGVtIG~~~vIgAGsVVtkdip~~~v~~GnPArvi 169 (169) T cd03357 118 PITIGDNVWIGGGVIILPGVTIGDNSVIGAGSVVTKDIPANVVAAGNPARVI 169 (169) T ss_pred CEEECCCCEECCCCEEECCCEECCCCEECCCCEECCCCCCCEEEEEECCEEC T ss_conf 5698974362899789099799999999889879746599919997586989 No 83 >cd05636 LbH_G1P_TT_C_like Putative glucose-1-phosphate thymidylyltransferase, C-terminal Left-handed parallel beta-Helix (LbH) domain: Proteins in this family show simlarity to glucose-1-phosphate adenylyltransferases in that they contain N-terminal catalytic domains that resemble a dinucleotide-binding Rossmann fold and C-terminal LbH fold domains. Members in this family are predicted to be glucose-1-phosphate thymidylyltransferases, which are involved in the dTDP-L-rhamnose biosynthetic pathway. Glucose-1-phosphate thymidylyltransferase catalyzes the synthesis of deoxy-thymidine di-phosphate (dTDP)-L-rhamnose, an important component of the cell wall of many microorganisms. The C-terminal LbH domain contains multiple turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.83 E-value=1.7e-20 Score=135.06 Aligned_cols=124 Identities=11% Similarity=0.231 Sum_probs=72.6 Q ss_pred EEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCC Q ss_conf 994112331784100122322344200021116313433-2121112100213443200000258644320231021124 Q gi|254780942|r 263 FLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIK 341 (442) Q Consensus 263 ~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~ 341 (442) .|++.+.|++++.|+++++|.+++.||++|+|+++++|. ++.||++|+||+++.|. +++|+++++|++++.+..|+|+ T Consensus 13 ~I~~~v~Ig~~a~I~~~a~I~~~v~IG~~~~Ig~~~~I~~~~~IG~~~~IG~~~~I~-~~vi~~~~~i~~~~~ig~siIG 91 (163) T cd05636 13 TIKGPVWIGEGAIVRSGAYIEGPVIIGKGCEIGPNAYIRGYTVLGDGCVVGNSVEVK-NSIIMDGTKVPHLNYVGDSVLG 91 (163) T ss_pred EECCCEEECCCCEECCCCEEECCEEECCCCEEEECCCCCCCEEECCEEEECCCEEEC-CCCCCCCCCCCCCCEECCCEEC T ss_conf 998981999999999999997975998998993641226985861301205512421-2401478432775687772992 Q ss_pred CCCCCCCCCCCC-------------------------CCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEE Q ss_conf 785448745556-------------------------63988998997980883445775456298788798869989 Q gi|254780942|r 342 EGSKINHLSYVG-------------------------DSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSL 394 (442) Q Consensus 342 ~g~~i~~~~~ig-------------------------d~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i 394 (442) +++.++..+.++ -++||++|.||+++++.- ++.||++|+||+++++ T Consensus 92 ~~v~IGagtvi~n~~~~~~~~~v~~~~~~~~~~~~~~G~vIGd~~~IG~ns~I~~-------G~~IG~~a~IgaGsvV 162 (163) T cd05636 92 ENVNLGAGTITANLRFDDKPVKVRLKGERVDTGRRKLGAIIGDGVKTGINVSLNP-------GVKIGPGSWVYPGCVV 162 (163) T ss_pred CCCEECCCEEEECCCCCCCCEEEEECCEEECCCCEECCCEECCCCEECCCCEECC-------CEEECCCCEECCCCEE T ss_conf 9969889837817767886347840770623677040779779979998989869-------9599999899999796 No 84 >PRK09527 lacA galactoside O-acetyltransferase; Reviewed Probab=99.83 E-value=3.6e-20 Score=133.21 Aligned_cols=54 Identities=28% Similarity=0.543 Sum_probs=51.3 Q ss_pred CCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEEEEC Q ss_conf 629878879886998937839889989977835766436986898135607707 Q gi|254780942|r 379 KTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQIVKE 432 (442) Q Consensus 379 ~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~~~ 432 (442) +.+||++||||++++|+|||+||++|+|||||+|++|+|+++++.|.|.+++|+ T Consensus 131 pi~Ig~~vwIG~~~~I~pGv~IG~~~vigAgsvVtkdip~~~v~~G~Parvir~ 184 (203) T PRK09527 131 PITIGNNVWIGSHVVINPGVTIGDNSVIGAGSVVTKDIPPNVVAAGVPCRVIRE 184 (203) T ss_pred CEEECCEEEECCCCEECCCCEECCCCEECCCCEECCCCCCCCEEEEECCEEEEE T ss_conf 859998689899999909929999979914988853069982999628288986 No 85 >COG4750 LicC CTP:phosphocholine cytidylyltransferase involved in choline phosphorylation for cell surface LPS epitopes [Cell envelope biogenesis, outer membrane] Probab=99.83 E-value=7.4e-21 Score=137.13 Aligned_cols=163 Identities=21% Similarity=0.277 Sum_probs=107.3 Q ss_pred CEEEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEECCCC- Q ss_conf 08999458877232788---87441258848579999999997799779999579889999973026972899985998- Q gi|254780942|r 5 RLAIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQDCQ- 80 (442) Q Consensus 5 ~~AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~- 80 (442) |-|||||||.||||+|. |||+|+.|.|+|||+++|+.|.++|+.+|++||||.+|+..-...+. ++..+..+.- T Consensus 1 ~nAIIlAAG~gsR~~plT~~tpK~LlkV~g~plIErqI~~L~e~gI~dI~IVvGYlkE~FeYLkdKy--~vtLvyN~kY~ 78 (231) T COG4750 1 MNAIILAAGLGSRFVPLTQSTPKSLLKVNGEPLIERQIEQLREAGIDDITIVVGYLKEQFEYLKDKY--DVTLVYNPKYR 78 (231) T ss_pred CCEEEEECCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHCCCCEEEEEEEEHHHHHHHHHHHC--CEEEEECCHHH T ss_conf 9618994255665331310287678873585509999999997798618999631489999998715--74999574077 Q ss_pred -CCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCC-CCCCCEEEECCCCCEECCCC Q ss_conf -8614466410000123446058985154111533220000001358610010010122-15744042023331001001 Q gi|254780942|r 81 -QGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNADN-PKGYGRLLIKNNEIIAIREE 158 (442) Q Consensus 81 -~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~~d-p~~yGvV~~d~~~v~~i~Ek 158 (442) .-.--+++.+++.+. +..|+.+|++|..+.-+.. ..+.. -.+...+. ...+=.+..++++++++.= T Consensus 79 ~yNn~ySlyla~d~l~----ntYiidsDnyl~kNif~~~----~~~S~---Yfav~~~~~tnEw~l~~~~~~ki~~v~I- 146 (231) T COG4750 79 EYNNIYSLYLARDFLN----NTYIIDSDNYLTKNIFLTK----ESHSK---YFAVYRSGKTNEWLLIYNSDGKITRVDI- 146 (231) T ss_pred HHHHHHHHHHHHHHHC----CCEEECCCHHHHHHHHHCC----CCCCE---EEEEEECCCCCEEEEEECCCCCEEEEEE- T ss_conf 6630999999999852----5679505047666555257----35212---7899963877425799769984899996- Q ss_pred CCCCCCCCCCCCCCCCHHEECCHHHHHHH Q ss_conf 46675421000355600111250146777 Q gi|254780942|r 159 NDATDEERKIHYCNSGLMAIDGLYIMDWL 187 (442) Q Consensus 159 ~d~~~~~~~~~lin~GiY~f~~~~L~~~l 187 (442) .. ...++-+|+-.|++++....+ T Consensus 147 ---gg---~~~~imsG~sff~~~~~~ki~ 169 (231) T COG4750 147 ---GG---LNGYIMSGISFFDAQFSNKIK 169 (231) T ss_pred ---CC---CCCCEEEEEEEECCHHHHHHH T ss_conf ---68---556257511442342479999 No 86 >TIGR00453 ispD 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; InterPro: IPR001228 The bacterial ispD protein catalyzes the third step of the deoxyxylulose-5-phosphate pathway (DXP) of isoprenoid biosynthesis; the formation of 4-diphosphocytidyl-2C-methyl-D-erythritol from CTP and 2C-methyl-D-erythritol 4-phosphate.; GO: 0008299 isoprenoid biosynthetic process. Probab=99.83 E-value=1.3e-19 Score=130.01 Aligned_cols=211 Identities=18% Similarity=0.283 Sum_probs=152.5 Q ss_pred EEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEEECCCH-HHHHHHHCCCC--CEEEEEECCCCC Q ss_conf 8999458877232788874412588485799999999977-9977999957988-99999730269--728999859988 Q gi|254780942|r 6 LAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAA-GIENVALVLGYGA-EEITRINFPPT--LSVEYYIQDCQQ 81 (442) Q Consensus 6 ~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~-g~~~ivvv~~~~~-e~i~~~~~~~~--~~i~~v~q~~~~ 81 (442) .|||||||+|+||.+..||.+++++|+|+|.|.|+.+.++ .+++|+||+.... +.+++...... ..++++.=-+.+ T Consensus 1 ~avi~AAG~G~R~~~~~pKQy~~l~G~Pll~h~~~~~~~~~~~~~vvVV~~~~~~~~~~~~~~~~~~~~~~~~v~GG~~R 80 (226) T TIGR00453 1 SAVIPAAGRGTRMGSGVPKQYLELAGRPLLEHTLDAFLAHPAIDEVVVVVSPDDTEFFQKALAARAKFKVVKIVAGGDTR 80 (226) T ss_pred CEEEEECCCCEECCCCCCEEEEEECCEEEHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHCCCCCCCCCEEECCCCCH T ss_conf 96873167111336798702665188022177899986323206468983674158888863256578611584698746 Q ss_pred CHHHHHHHCCCCCCC--CCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCEEEEC-CCCCEECCCC Q ss_conf 614466410000123--4460589851541115332200000013586100100101221574404202-3331001001 Q gi|254780942|r 82 GTAHAVLTAQDAIKP--GYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNADNPKGYGRLLIK-NNEIIAIREE 158 (442) Q Consensus 82 GTa~Ai~~a~~~l~~--~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~~dp~~yGvV~~d-~~~v~~i~Ek 158 (442) +++|+++++.+.. ..+.+||.=|==||++...|.++++..+++..+++++.++.|. ....+ +|.+.+-++ T Consensus 81 --q~SV~~GL~a~~~~~~~~~VlvHDaARPf~~~~~~~~l~~~~~~~~~a~~~A~Pv~DT----lK~~~~~G~~~~T~d- 153 (226) T TIGR00453 81 --QDSVRNGLKALPERADAEIVLVHDAARPFVPKELIDRLLEALEKGAGAAILALPVADT----LKRVDADGFVVETVD- 153 (226) T ss_pred --HHHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCE----EEEECCCCCEECCCC- T ss_conf --8999999998763589882898477345889879999999986079834874205431----577448886202666- Q ss_pred CCCCCCCCCCCCCCCCHH------EECCHHHHHHHHH-CCCCCCCCHHHHHHHHHHHHHH-HCCEEEEEECHHHEECCCC Q ss_conf 466754210003556001------1125014677753-2888740011121148999973-0771799917214301112 Q gi|254780942|r 159 NDATDEERKIHYCNSGLM------AIDGLYIMDWLLQ-IKKNKVSQEYYLTDIIEKARLD-GKSIASIDVKEQEVCGCNN 230 (442) Q Consensus 159 ~d~~~~~~~~~lin~GiY------~f~~~~L~~~l~~-l~~~~~~gE~yltDii~~l~~~-g~~i~~~~~~~~~~~gv~~ 230 (442) +. ++| +|+.+.|.+.+.+ -.. ..+.+||=-+.+..- |.++..+.-+.+. .-+.+ T Consensus 154 --------R~-----~Lw~~QTPQ~F~~~~L~~Ah~~~A~~----~g~~~TDDA~~~E~~~G~~v~LVeG~~~n-~KiT~ 215 (226) T TIGR00453 154 --------RE-----GLWAAQTPQAFRRELLLKAHARAAKA----EGFEITDDASAVERLFGGKVALVEGDADN-FKITT 215 (226) T ss_pred --------CC-----CCCEEECCCCCCHHHHHHHHHHHHHH----CCCCCCCHHHHHHHHCCCCEEEEECCCCC-EECCC T ss_conf --------21-----45255768876888999999999984----69851358999999638972887276101-10367 Q ss_pred CCCCCCCCCCC Q ss_conf 10000001222 Q gi|254780942|r 231 RYELSLIENIW 241 (442) Q Consensus 231 ~~~L~~~~~~~ 241 (442) |.||..++.++ T Consensus 216 P~DL~~Ae~~l 226 (226) T TIGR00453 216 PEDLALAEALL 226 (226) T ss_pred HHHHHHHHHCC T ss_conf 27899987509 No 87 >cd04646 LbH_Dynactin_6 Dynactin 6 (or subunit p27): Dynactin is a major component of the activator complex that stimulates dynein-mediated vesicle transport. Dynactin is a heterocomplex of at least eight subunits, including a 150,000-MW protein called Glued, the actin-capping protein Arp1, and dynamatin. In vitro binding experiments show that dynactin enhances dynein-dependent motility, possibly through interaction with microtubules and vesicles. Subunit p27 is part of the pointed-end subcomplex in dynactin that also includes p25, p26, and Arp11. This subcomplex interacts with membranous cargoes. p25 and p27 contain the imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), indicating a left-handed parallel beta helix (LbH) structural domain. Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.82 E-value=5.6e-20 Score=132.14 Aligned_cols=130 Identities=18% Similarity=0.233 Sum_probs=84.6 Q ss_pred ECCCCCEECCCCCCCCCCCCEEEECCCCCCCCC----CCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCC Q ss_conf 317841001223223442000211163134332----1211121002134432000002586443202310211247854 Q gi|254780942|r 270 IQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEG----VHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSK 345 (442) Q Consensus 270 I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~----~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~ 345 (442) |++.+.|.|.+.|-++++||++|.|+++++|.+ ..||++|.|+++ ++|.....- ....+. T Consensus 2 I~~~a~I~~~A~iiGdV~IG~~~sI~p~avIrgd~g~I~IG~~~nIqd~------~~I~~~~~~-------~~~~g~--- 65 (164) T cd04646 2 IAPGAVVCQESEIRGDVTIGPGTVVHPRATIIAEAGPIIIGENNIIEEQ------VTIVNKKPK-------DPAEPK--- 65 (164) T ss_pred CCCCCEECCCCEEEEEEEECCCCEECCCCEEEECCCCEEECCCCEECCC------EEEECCCCC-------CCCCCC--- T ss_conf 4898799999999821799999989957599814378898999666698------599417877-------778998--- Q ss_pred CCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCC--CCCCCEEE Q ss_conf 4874555663988998997980883445775456298788798869989378398899899778357664--36986898 Q gi|254780942|r 346 INHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQD--TPENSLVF 423 (442) Q Consensus 346 i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~d--v~~~~~v~ 423 (442) ..+||+|+.|+.|+++ |.+.||+++.||.++.++.|++||++|+|||||+|+.+ +||++|++ T Consensus 66 --------~~~IG~~~~i~~g~~i--------h~~~IG~~~lIg~~a~V~~ga~Ig~~ciIgags~V~~~~~ip~~slv~ 129 (164) T cd04646 66 --------PMIIGSNNVFEVGCKC--------EALKIGNNNVFESKSFVGKNVIITDGCIIGAGCKLPSSEILPENTVIY 129 (164) T ss_pred --------CEEECCCCEECCCEEE--------ECCEECCCCEECCCCEECCCCEECCCCEECCCCCCCCCCEECCCEEEE T ss_conf --------8798899888678199--------555976788896898999998999998999898928997929985997 Q ss_pred ECCCEEEE Q ss_conf 13560770 Q gi|254780942|r 424 ARSRQIVK 431 (442) Q Consensus 424 ~~~~~~~~ 431 (442) |.|.+..+ T Consensus 130 G~pa~~~~ 137 (164) T cd04646 130 GADCLRRT 137 (164) T ss_pred CCCEEEEE T ss_conf 88747812 No 88 >cd03351 LbH_UDP-GlcNAc_AT UDP-N-acetylglucosamine O-acyltransferase (UDP-GlcNAc acyltransferase): Proteins in this family catalyze the transfer of (R)-3-hydroxymyristic acid from its acyl carrier protein thioester to UDP-GlcNAc. It is the first enzyme in the lipid A biosynthetic pathway and is also referred to as LpxA. Lipid A is essential for the growth of Escherichia coli and related bacteria. It is also essential for maintaining the integrity of the outer membrane. UDP-GlcNAc acyltransferase is a homotrimer of left-handed parallel beta helix (LbH) subunits. Each subunit contains an N-terminal LbH region with 9 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), and a C-terminal alpha-helical region. Probab=99.82 E-value=3.6e-20 Score=133.26 Aligned_cols=148 Identities=23% Similarity=0.259 Sum_probs=65.7 Q ss_pred EEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCCCCEEECCC------------CCCC Q ss_conf 94112331784100122322344200021116313433-2121112100213443200000258------------6443 Q gi|254780942|r 264 LSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQETTIEKN------------VRIG 330 (442) Q Consensus 264 i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~~s~i~~~------------~~Ig 330 (442) |++.+.|.+++.|+++|.|++.|.|+++++|+.+|.|. +++|..+++||+++.|.++++|+.. +.|| T Consensus 2 IHptAiI~~~A~IG~~v~Igp~~vIg~~V~IG~~~~I~~~v~I~~~t~IG~~~~I~~~avIG~~pqd~~~~g~~~~v~IG 81 (254) T cd03351 2 IHPTAIVDPGAKIGENVEIGPFCVIGPNVEIGDGTVIGSHVVIDGPTTIGKNNRIFPFASIGEAPQDLKYKGEPTRLEIG 81 (254) T ss_pred CCCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCEEECCCCCCCCCCCCCCCCCCCCEEEEC T ss_conf 18977969989999979999998999999999999997996891997977652560575347787676737986769979 Q ss_pred CEEEEE-CCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCC Q ss_conf 202310-2112478544874555663988998997980883445775456298788798869989378398899899778 Q gi|254780942|r 331 NFCEVK-KATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASG 409 (442) Q Consensus 331 ~~~ei~-~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag 409 (442) ++++|+ +++|.+++..+. +...||++|.|.+++-++ |.+.||++|.+..++.+-..|+||++|++|++ T Consensus 82 ~~~~Ire~vtI~rgt~~~~----~~T~IG~~~~im~~~HIa-------Hdc~IG~~~iian~~~laGhv~Igd~a~IGg~ 150 (254) T cd03351 82 DNNTIREFVTIHRGTAQGG----GVTRIGNNNLLMAYVHVA-------HDCVIGNNVILANNATLAGHVEIGDYAIIGGL 150 (254) T ss_pred CCCEECCCCCCCCCCCCCC----CCEEECCCCEEEECCCCC-------CCCCCCCCEEEECCCCCCCCEEECCCCEECCC T ss_conf 9778995330236743788----755978885782025445-------67733797599334455685698998489883 Q ss_pred CEECCCC--CCCCEE Q ss_conf 3576643--698689 Q gi|254780942|r 410 SIITQDT--PENSLV 422 (442) Q Consensus 410 ~~v~~dv--~~~~~v 422 (442) +.|.+-+ -+++++ T Consensus 151 ~~v~q~v~IG~~a~I 165 (254) T cd03351 151 SAVHQFCRIGRHAMV 165 (254) T ss_pred CEECCCCEECCCEEE T ss_conf 677598199966199 No 89 >PRK12461 UDP-N-acetylglucosamine acyltransferase; Provisional Probab=99.82 E-value=3.2e-20 Score=133.55 Aligned_cols=147 Identities=18% Similarity=0.234 Sum_probs=72.5 Q ss_pred EEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCCCCEEECCC------------CCCC Q ss_conf 94112331784100122322344200021116313433-2121112100213443200000258------------6443 Q gi|254780942|r 264 LSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQETTIEKN------------VRIG 330 (442) Q Consensus 264 i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~~s~i~~~------------~~Ig 330 (442) |++.+.|++++.|+++|.|++.|.|+++++|+++|.|. +++|+.+++||++++|.++++||.. +.|| T Consensus 3 IHptAiI~~~A~Ig~~v~Igp~~vIg~~V~IG~~~~I~~~v~I~~~t~IG~~~~I~~~avIG~~pqd~~~~g~~~~v~IG 82 (256) T PRK12461 3 IHPTAVIDPSAKLGEGVEIGPFAVIGADVEIGDGTWIMPHAVILGWTRIGKNNKIHQGAVVGDEPQDFGYKGEESHLIIG 82 (256) T ss_pred CCCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCEEEEECCCCCCCCCCCCCCCCCCEEEEC T ss_conf 99987989989999999999999999999999999994981891998999743997543117633344415862069989 Q ss_pred CEEEEE-CCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCC Q ss_conf 202310-2112478544874555663988998997980883445775456298788798869989378398899899778 Q gi|254780942|r 331 NFCEVK-KATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASG 409 (442) Q Consensus 331 ~~~ei~-~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag 409 (442) ++++|+ +++|.+|+.-++ ...||++|.+.+.+-++ |.+.||++|.+..++.+.-.++||+++++|+. T Consensus 83 d~~~Ire~~tI~rgt~~~~-----~T~IG~~~~im~~vhIa-------Hd~~IG~~~iian~~~laG~v~Igd~v~iGg~ 150 (256) T PRK12461 83 DNNVIREGVTIHRGTKGGT-----VTRIGNDNLLMANSHIA-------HDCQIGNNVILVNGALLAGHVTVGDRAIISGN 150 (256) T ss_pred CCCEECCCCCCCCCCCCCC-----CEEECCCCEEECCCCCC-------CCCCCCCCEEEECCEEECCCEEECCCCEECCC T ss_conf 9878798755678614898-----66988785874364334-------64313677067222023242599997299573 Q ss_pred CEECCCC--CCCCEE Q ss_conf 3576643--698689 Q gi|254780942|r 410 SIITQDT--PENSLV 422 (442) Q Consensus 410 ~~v~~dv--~~~~~v 422 (442) +.|.+-+ -+++++ T Consensus 151 ~~v~q~v~IG~~a~I 165 (256) T PRK12461 151 CLVHQFCRIGRLAMM 165 (256) T ss_pred CEECCCCEECCCCEE T ss_conf 477179799999699 No 90 >PRK10502 putative colanic acid biosynthesis acetyltransferase WcaF; Provisional Probab=99.82 E-value=6.8e-20 Score=131.65 Aligned_cols=54 Identities=19% Similarity=0.388 Sum_probs=51.2 Q ss_pred CCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEEEEC Q ss_conf 629878879886998937839889989977835766436986898135607707 Q gi|254780942|r 379 KTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQIVKE 432 (442) Q Consensus 379 ~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~~~ 432 (442) +.+|||+|+||+++++.||++||++|+|||||+|++|+|+++++.|.|.+.+|+ T Consensus 125 pI~Igd~~wIga~a~I~pGv~Ig~gavigA~SvV~kdv~~~~i~~G~PA~~ik~ 178 (179) T PRK10502 125 PIVIGEGCWLATDVFVAPGVTIGDGAVVGARSSVFKSLPANTVCRGNPAVVIRQ 178 (179) T ss_pred CCEECCCEEECCCCEEECCCEECCCCEECCCCEEEEECCCCCEEEECCCEEEEC T ss_conf 858589969999969908899999979954998921779996999318478713 No 91 >PRK05289 UDP-N-acetylglucosamine acyltransferase; Provisional Probab=99.82 E-value=5.2e-20 Score=132.30 Aligned_cols=150 Identities=21% Similarity=0.210 Sum_probs=72.3 Q ss_pred EEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCCCCEEECCC------------CCC Q ss_conf 994112331784100122322344200021116313433-2121112100213443200000258------------644 Q gi|254780942|r 263 FLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQETTIEKN------------VRI 329 (442) Q Consensus 263 ~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~~s~i~~~------------~~I 329 (442) .|++.+.|++++.|+++|.|++.|.|+++++|+++|.|. +++|..+++||+++.|.++++|+.. +.| T Consensus 4 ~IHptAiI~~~A~IG~~v~Igp~~vIg~~v~IG~~~~I~~~v~I~~~t~IG~~~~I~~~avIG~~pqd~~~~g~~~~v~I 83 (261) T PRK05289 4 KIHPTAIVEPGAEIGENVEIGPFCVIGPDVVIGDGTEIGSHVVIDGHTTIGKNNRIFPFASIGEEPQDLKYKGEPTRLVI 83 (261) T ss_pred CCCCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCCEECCCCCCCCCCCCCCCCCCCCEEEE T ss_conf 40898898998999998999999999999999999999799689189888988345247533778766660698536998 Q ss_pred CCEEEEE-CCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECC Q ss_conf 3202310-211247854487455566398899899798088344577545629878879886998937839889989977 Q gi|254780942|r 330 GNFCEVK-KATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVAS 408 (442) Q Consensus 330 g~~~ei~-~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~a 408 (442) |++++|+ +++|.+|+.... +...||++|.|.+++-++ |.+.||++|.+..++.+.-.++||+++++|+ T Consensus 84 Gd~~~Ire~vtI~rgT~~~~----g~T~IG~~~~im~~vhIa-------Hd~~IG~~~iian~~~laG~v~Igd~v~iGg 152 (261) T PRK05289 84 GDNNTIREFVTINRGTVQGG----GVTRIGDNNLLMAYVHVA-------HDCVVGNNVILANNATLAGHVEVGDYAIIGG 152 (261) T ss_pred CCCCEECCCCEECCCCCCCC----CCEEECCCCEEECCCCCC-------CCCCCCCCCEECCCEEECCCEEECCCEEECC T ss_conf 99778985414436610588----644988885440036434-------4551278716543056646079889709976 Q ss_pred CCEECCCC--CCCCEEE Q ss_conf 83576643--6986898 Q gi|254780942|r 409 GSIITQDT--PENSLVF 423 (442) Q Consensus 409 g~~v~~dv--~~~~~v~ 423 (442) ++.|.+-+ -+++++- T Consensus 153 ~~~v~q~v~IG~~a~vg 169 (261) T PRK05289 153 LSGVHQFVRIGAHAMVG 169 (261) T ss_pred CCEECCCCEECCCEEEC T ss_conf 15664883989871999 No 92 >COG0663 PaaY Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily [General function prediction only] Probab=99.82 E-value=8.1e-20 Score=131.22 Aligned_cols=72 Identities=28% Similarity=0.569 Sum_probs=64.0 Q ss_pred CCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECC--CCCCCCEEEECCCEEEE Q ss_conf 6398899899798088344577545629878879886998937839889989977835766--43698689813560770 Q gi|254780942|r 354 DSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQ--DTPENSLVFARSRQIVK 431 (442) Q Consensus 354 d~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~--dv~~~~~v~~~~~~~~~ 431 (442) .+.||++|+||-++++ |+|.||++|+||.|++++.|.+||++|+|||||+|+. -+|+++|++|.|.++.+ T Consensus 72 p~~IG~~vtIGH~aiv--------HGc~Ig~~~lIGmgA~vldga~IG~~~iVgAgalV~~~k~~p~~~L~~G~Pak~~r 143 (176) T COG0663 72 PVTIGDDVTIGHGAVV--------HGCTIGDNVLIGMGATVLDGAVIGDGSIVGAGALVTPGKEIPGGSLVVGSPAKVVR 143 (176) T ss_pred CEEECCCCEECCCCEE--------EEEEECCCCEEECCCEEECCCEECCCCEECCCCCCCCCCCCCCCEEEECCCCEEEE T ss_conf 9297799588576589--------88598788589138667578698898698668700588377997596237401440 Q ss_pred CC Q ss_conf 77 Q gi|254780942|r 432 ED 433 (442) Q Consensus 432 ~~ 433 (442) +- T Consensus 144 ~l 145 (176) T COG0663 144 PL 145 (176) T ss_pred CC T ss_conf 28 No 93 >TIGR01105 galF regulatory protein GalF; InterPro: IPR005774 This family of proteins, GalF, represents a non-catalytic subunit of the UTP-glucose pyrophosphorylase modulating the enzyme activity to increase the formation of UDP-glucose . ; GO: 0030234 enzyme regulator activity, 0006011 UDP-glucose metabolic process. Probab=99.81 E-value=6.9e-20 Score=131.61 Aligned_cols=217 Identities=19% Similarity=0.282 Sum_probs=160.0 Q ss_pred CCEEEEECCCCCCCCCCC---CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHH--------------- Q ss_conf 408999458877232788---87441258848579999999997799779999579889999973--------------- Q gi|254780942|r 4 KRLAIVLAAGRGHRMKSS---SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRIN--------------- 65 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p~---~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~--------------- 65 (442) ...|||..||.|-.|-|. .||-|+|+..||||.|+++....+|+++|+++++.-+..++.++ T Consensus 3 nlkaviPvaGlGmhmlPatkaiPkemlP~~dkPmiqyivdeivaaGikei~lvth~sknavenhfdtsyele~lleqrvk 82 (297) T TIGR01105 3 NLKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEIVLVTHASKNAVENHFDTSYELESLLEQRVK 82 (297) T ss_pred CCCEECCCCCCHHHHHHHHHCCCHHHCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 54101221120123200111021121221143278899999886030103234201034454430203778999988888 Q ss_pred ----------CCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCC-------CCCCCCCCCCCCCCC Q ss_conf ----------026972899985998861446641000012344605898515411153-------322000000135861 Q gi|254780942|r 66 ----------FPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSS-------HTLKKAMDKIAQGYS 128 (442) Q Consensus 66 ----------~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~-------~~l~~l~~~~~~~~~ 128 (442) -.++..+--+.|.+|+|.||++.+++..+.+ .+|+|+--|+ .++. .+|..++..+.+... T Consensus 83 rqllaevq~iCPPGv~imnvrq~~PlGlGhsilCarP~~Gd--nPf~vvlPd~-~~d~~~adPlrynlaam~arfnetGr 159 (297) T TIGR01105 83 RQLLAEVQSICPPGVTIMNVRQAQPLGLGHSILCARPVVGD--NPFVVVLPDI-IIDDATADPLRYNLAAMIARFNETGR 159 (297) T ss_pred HHHHHHHHHCCCCCCEEEEEECCCCCCCCCHHEECCCCCCC--CCEEEEECCE-EECCCCCCHHHHHHHHHHHHHHCCCH T ss_conf 99988765228997179863047765444011011553268--8758980523-56177766167889999987520330 Q ss_pred CCCEEEECC-CCCCCCEEEECC--------CCCEECCCCCCCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHH Q ss_conf 001001012-215744042023--------33100100146675421000355600111250146777532888740011 Q gi|254780942|r 129 IAVVGFNAD-NPKGYGRLLIKN--------NEIIAIREENDATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEY 199 (442) Q Consensus 129 ~~i~~~~~~-dp~~yGvV~~d~--------~~v~~i~Ek~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~ 199 (442) ..+++.+.. |-+.|.++..+. .++++|+||||. |....+++...|-|+++.+.- .-|++..+..+ |.. T Consensus 160 sqvlakrm~GdlseysviqtkePl~~eG~v~rivefiekPd~-Pqtldsd~mavGryvlsadiW-~elert~PGaW-Gri 236 (297) T TIGR01105 160 SQVLAKRMPGDLSEYSVIQTKEPLDREGKVSRIVEFIEKPDQ-PQTLDSDLMAVGRYVLSADIW-AELERTEPGAW-GRI 236 (297) T ss_pred HHHHHHHCCCCCCHHEEEECCCCHHHCCCEEEEEHHHCCCCC-CCCHHHHHHHHHHHHHHHHHH-HHHHCCCCCCC-CCH T ss_conf 233221036652000011023402124741221000208898-752134677663465434445-55431689863-101 Q ss_pred HHHHHHHHHHHHHCCEEEEEEC-HHHEEC Q ss_conf 1211489999730771799917-214301 Q gi|254780942|r 200 YLTDIIEKARLDGKSIASIDVK-EQEVCG 227 (442) Q Consensus 200 yltDii~~l~~~g~~i~~~~~~-~~~~~g 227 (442) .|||.+..+.++. .+.+.... +.+++| T Consensus 237 qltdaiaelakkq-svdamlmtG~sydCG 264 (297) T TIGR01105 237 QLTDAIAELAKKQ-SVDAMLMTGDSYDCG 264 (297) T ss_pred HHHHHHHHHHHHH-HHHHHEECCCCCHHH T ss_conf 4778999887431-022310116741001 No 94 >cd02516 CDP-ME_synthetase CDP-ME synthetase is involved in mevalonate-independent isoprenoid production. 4-diphosphocytidyl-2-methyl-D-erythritol synthase (CDP-ME), also called 2C-methyl-d-erythritol 4-phosphate cytidylyltransferase catalyzes the third step in the alternative (non-mevalonate) pathway of Isopentenyl diphosphate (IPP) biosynthesis: the formation of 4-diphosphocytidyl-2C-methyl-D-erythritol from CTP and 2C-methyl-D-erythritol 4-phosphate. This mevalonate independent pathway that utilizes pyruvate and glyceraldehydes 3-phosphate as starting materials for production of IPP occurs in a variety of bacteria, archaea and plant cells, but is absent in mammals. Thus, CDP-ME synthetase is an attractive targets for the structure-based design of selective antibacterial, herbicidal and antimalarial drugs. Probab=99.81 E-value=5e-19 Score=126.74 Aligned_cols=210 Identities=19% Similarity=0.246 Sum_probs=135.4 Q ss_pred EEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEEECCCHH-HHHHHHCC-CCCEEEEEECCCCCC Q ss_conf 8999458877232788874412588485799999999977-99779999579889-99997302-697289998599886 Q gi|254780942|r 6 LAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAA-GIENVALVLGYGAE-EITRINFP-PTLSVEYYIQDCQQG 82 (442) Q Consensus 6 ~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~-g~~~ivvv~~~~~e-~i~~~~~~-~~~~i~~v~q~~~~G 82 (442) .|||||||.|+||.+..||++++++|+||++|.++.+.+. .+++|+++++.+.. .+++.... ....+.++.--..+ T Consensus 2 ~aIIlAaG~G~R~~~~~PKQf~~i~gk~ii~~sl~~f~~~~~i~~Iivv~~~~~~~~~~~~~~~~~~~~i~~v~GG~tR- 80 (218) T cd02516 2 AAIILAAGSGSRMGADIPKQFLELGGKPVLEHTLEAFLAHPAIDEIVVVVPPDDIDLAKELAKYGLSKVVKIVEGGATR- 80 (218) T ss_pred EEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHHHCCCCCEEEECCCCCH- T ss_conf 8999268347148857985233589679999999999758998979999683789999998864479876998998409- Q ss_pred HHHHHHHCCCCCCCC-CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCEEEECC-CCCEECCCCCC Q ss_conf 144664100001234-4605898515411153322000000135861001001012215744042023-33100100146 Q gi|254780942|r 83 TAHAVLTAQDAIKPG-YDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNADNPKGYGRLLIKN-NEIIAIREEND 160 (442) Q Consensus 83 Ta~Ai~~a~~~l~~~-~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~~dp~~yGvV~~d~-~~v~~i~Ek~d 160 (442) .++++++++.+... .+.++|.-+.-||++...++.+++..++ .++++.+.++.|. +...++ +.+.+.+.. T Consensus 81 -~~SV~ngl~~l~~~~~~~VlIHDaaRP~i~~~~i~~li~~~~~-~~~~i~~~p~~DT----ik~~~~~~~i~~t~dR-- 152 (218) T cd02516 81 -QDSVLNGLKALPDADPDIVLIHDAARPFVSPELIDRLIDALKE-YGAAIPAVPVTDT----IKRVDDDGVVVETLDR-- 152 (218) T ss_pred -HHHHHHHHHHHCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHH-CCCEEEEECCHHC----EEEECCCCEEEEECCC-- T ss_conf -9999989874012699989992576778999999999999973-7871898425101----7996389779862376-- Q ss_pred CCCCCCCCCCCCCC-HHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCCCC Q ss_conf 67542100035560-01112501467775328887400111211489999730771799917214301112100000 Q gi|254780942|r 161 ATDEERKIHYCNSG-LMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYELSL 236 (442) Q Consensus 161 ~~~~~~~~~lin~G-iY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~~ 236 (442) .++..+- =-+|+.+.|++++++...+ +++.||-.+.+...|.++..+..+..- .-+.++.||.. T Consensus 153 -------~~l~~~QTPQ~F~~~~l~~a~~~~~~~----~~~~TDd~sl~~~~g~~v~~v~G~~~N-~KIT~~eDl~~ 217 (218) T cd02516 153 -------EKLWAAQTPQAFRLDLLLKAHRQASEE----GEEFTDDASLVEAAGGKVALVEGSEDN-IKITTPEDLAL 217 (218) T ss_pred -------CCEEEEECCCCCCHHHHHHHHHHHHHC----CCCCCCHHHHHHHCCCCEEEEECCCCC-CCCCCHHHHHH T ss_conf -------137987487542599999999999864----999785899999869976999758755-85998999824 No 95 >cd03349 LbH_XAT Xenobiotic acyltransferase (XAT): The XAT class of hexapeptide acyltransferases is composed of a large number of microbial enzymes that catalyze the CoA-dependent acetylation of a variety of hydroxyl-bearing acceptors such as chloramphenicol and streptogramin, among others. Members of this class of enzymes include Enterococcus faecium streptogramin A acetyltransferase and Pseudomonas aeruginosa chloramphenicol acetyltransferase. They contain repeated copies of a six-residue hexapeptide repeat sequence motif (X-[STAV]-X-[LIV]-[GAED]-X) and adopt a left-handed parallel beta helix (LbH) structure. The active enzyme is a trimer with CoA and substrate binding sites at the interface of two separate LbH subunits. XATs are implicated in inactivating xenobiotics leading to xenobiotic resistance in patients. Probab=99.81 E-value=1.4e-19 Score=129.88 Aligned_cols=54 Identities=24% Similarity=0.583 Sum_probs=50.9 Q ss_pred CCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEEEE Q ss_conf 562987887988699893783988998997783576643698689813560770 Q gi|254780942|r 378 YKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQIVK 431 (442) Q Consensus 378 ~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~~ 431 (442) .+++||++||||+|++|+||++||++|+|||||+|++||||++++.|.|.+++| T Consensus 72 ~~i~Ig~~vwIG~n~~Il~GV~IG~gavIgAgSvVtkdvp~~~i~~G~PAk~ik 125 (145) T cd03349 72 GDVIIGNDVWIGHGATILPGVTIGDGAVIAAGAVVTKDVPPYAIVGGNPAKVIR 125 (145) T ss_pred CCEEECCCEEECCCCEEECCCEECCCCEEECCCEECCCCCCCCEEEEECCEEEE T ss_conf 997988996999988895880967886892897897046999499833887881 No 96 >PRK09451 glmU bifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase; Provisional Probab=99.80 E-value=2.2e-19 Score=128.76 Aligned_cols=101 Identities=20% Similarity=0.414 Sum_probs=47.5 Q ss_pred CCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCC----CCC----CCE Q ss_conf 4200021116313433-2121112100213443200000258644320231021124785448745----556----639 Q gi|254780942|r 286 VSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLS----YVG----DSV 356 (442) Q Consensus 286 ~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~----~ig----d~~ 356 (442) +.||++|.|||+++|. ++.|++++.||+|+-+. +|.|++++++++..-+..|.||+++.++..+ |=| -.+ T Consensus 318 s~Ig~~~~IGPfA~lRp~t~i~~~~~iGnfvEiK-~s~i~~g~k~~HlsYiGDa~iG~~~NiGAGtit~NyDG~~K~~t~ 396 (456) T PRK09451 318 ANLGAACTIGPFARLRPGAELLEGAHVGNFVEMK-KARLGKGSKAGHLTYLGDAEIGDNVNIGAGTITCNYDGANKFKTI 396 (456) T ss_pred CEECCCCEECCCCCCCCCCEECCCCEEEEEEEEE-CCEECCCCEECCEEEECCCEECCCCEECCCEEEEECCCCCCCCCE T ss_conf 6343671688864348876236788882289982-459758977042336614476588688887699722487665648 Q ss_pred ECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEE Q ss_conf 88998997980883445775456298788798869989 Q gi|254780942|r 357 VGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSL 394 (442) Q Consensus 357 iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i 394 (442) ||++|+||.+|.+.- ++.||++++||++++| T Consensus 397 igd~~fiGsn~~lva-------pv~iG~~a~i~aGs~i 427 (456) T PRK09451 397 IGDDVFVGSDTQLVA-------PVTVGKGATIGAGTTV 427 (456) T ss_pred ECCCCEECCCCEEEC-------CEEECCCCEECCCCEE T ss_conf 789829987844835-------8498899799989878 No 97 >TIGR01173 glmU UDP-N-acetylglucosamine pyrophosphorylase; InterPro: IPR005882 N-Acetylglucosamine-1-PO(4) uridyltransferase (GlmU, 2.7.7.23 from EC) is a trimeric bifunctional enzyme that catalyzes the last two sequential reactions in the de novo biosynthetic pathway for UDP-GlcNAc. The X-ray crystal structure of Escherichia coli GlmU in complex with UDP-GlcNAc and CoA has been determined to 2.1 A resolution and reveals a two-domain architecture that is responsible for these two reactions . The C-terminal domain is responsible for the CoA-dependent acetylation of Glc-1-PO(4) to GlcNAc-1-PO(4) and displays the longest left-handed parallel beta-helix observed to date. The acetyltransferase active site defined by the binding site for CoA makes use of residues from all three subunits and is positioned beneath an open cavity large enough to accommodate the Glc-1-PO(4) acetyl acceptor. The N-terminal domain catalyzes uridyl transfer from UTP to GlcNAc-1-PO(4) to form the final products UDP-GlcNAc and pyrophosphate. This domain is composed of a central seven-stranded beta-sheet surrounded by six alpha-helices in a Rossmann fold-like topology. ; GO: 0003977 UDP-N-acetylglucosamine diphosphorylase activity, 0009103 lipopolysaccharide biosynthetic process. Probab=99.79 E-value=2.2e-19 Score=128.80 Aligned_cols=106 Identities=24% Similarity=0.397 Sum_probs=84.4 Q ss_pred CCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCC----CCC----C Q ss_conf 344200021116313433-2121112100213443200000258644320231021124785448745----556----6 Q gi|254780942|r 284 CGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLS----YVG----D 354 (442) Q Consensus 284 ~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~----~ig----d 354 (442) +++.|+++|.|||+++|. +|.+++++.||+|+=+ -+|.||++++.++-+-+-+|-||+++.|+-.+ |=| - T Consensus 322 e~~~ig~~~~vGPFArLRP~~~L~~~~hiGNFVE~-Kna~iG~gsKA~HLsYlGDAeiG~~vNiGAGtITcNYDG~nK~~ 400 (461) T TIGR01173 322 EGSEIGEGCDVGPFARLRPGSVLGAGVHIGNFVEV-KNARIGEGSKAGHLSYLGDAEIGSNVNIGAGTITCNYDGVNKHK 400 (461) T ss_pred CCCEECCCCEECCCCCCCCCCHHHCCCEEEEEEEE-ECCEECCCCCCCCCCEEEEEEECCCCEECCEEEEEECCCCCCCC T ss_conf 46786156402773001864321177816025865-24761886442314333202507960032316899327962203 Q ss_pred CEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECC Q ss_conf 3988998997980883445775456298788798869989378 Q gi|254780942|r 355 SVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAP 397 (442) Q Consensus 355 ~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~g 397 (442) ++||++++||+.|.+.- ++.||++++||++++|--. T Consensus 401 T~IGd~VFiGSnt~LVA-------PV~iG~gA~iaAGstvt~D 436 (461) T TIGR01173 401 TIIGDGVFIGSNTQLVA-------PVKIGDGATIAAGSTVTKD 436 (461) T ss_pred CEECCCCEECCCCEEEC-------CEEECCCCEECCCCEEECC T ss_conf 48648878776760455-------4394381187124148023 No 98 >cd00710 LbH_gamma_CA Gamma carbonic anhydrases (CA): Carbonic anhydrases are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism, involving the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. They are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three distinct groups of carbonic anhydrases - alpha, beta and gamma - which show no significant sequence identity or structural similarity. Gamma CAs are homotrimeric enzymes, with each subunit containing a left-handed parallel beta helix (LbH) structural domain. Probab=99.79 E-value=2.3e-19 Score=128.59 Aligned_cols=135 Identities=27% Similarity=0.384 Sum_probs=76.2 Q ss_pred EEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCC Q ss_conf 94112331784100122322344200021116313433212111210021344320000025864432023102112478 Q gi|254780942|r 264 LSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEG 343 (442) Q Consensus 264 i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g 343 (442) +++++.|.+++++-+.+.|++++.|+.+|+|+... .....||++|.|+++ ++|.- .-+ ..+.|+++ T Consensus 5 i~~~afIap~A~viGdV~ig~~~sIw~~aviRgD~-~~~I~IG~~~nIqD~------~viH~--~~~-----~~~~IG~~ 70 (167) T cd00710 5 IDPSAYVHPTAVVIGDVIIGDNVFVGPGASIRADE-GTPIIIGANVNIQDG------VVIHA--LEG-----YSVWIGKN 70 (167) T ss_pred CCCCEEECCCCEEEEEEEECCCCEECCCCEEECCC-CCCEEECCCCCCCCC------EEEEC--CCC-----CCEEECCC T ss_conf 69993989999899708999999998887894788-887698887434777------07822--576-----67497798 Q ss_pred CCCCCCCCC-CCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCE Q ss_conf 544874555-663988998997980883445775456298788798869989378398899899778357664369868 Q gi|254780942|r 344 SKINHLSYV-GDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSL 421 (442) Q Consensus 344 ~~i~~~~~i-gd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~ 421 (442) +.|+|.+++ |.|.||++|.||.|+++.| ++||++|.||+++.+ +|++|.++.++.+|+.|++..|+..+ T Consensus 71 vtIgh~a~ihG~c~Igd~~~IG~gaiv~~--------a~Ig~~~vIgags~V-~g~~i~~g~~v~~G~~v~~~~pa~~~ 140 (167) T cd00710 71 VSIAHGAIVHGPAYIGDNCFIGFRSVVFN--------AKVGDNCVIGHNAVV-DGVEIPPGRYVPAGAVITSQTQADAL 140 (167) T ss_pred CEECCCCEECCEEEECCCCEECCCCEEEC--------CEECCCCEECCCCEE-ECCCCCCCCEECCCCEEECCCCHHHC T ss_conf 28989848812049839988988989982--------597699899889888-47783999895588288369966973 No 99 >cd04182 GT_2_like_f GT_2_like_f is a subfamily of the glycosyltransferase family 2 (GT-2) with unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=99.79 E-value=2.9e-19 Score=128.04 Aligned_cols=118 Identities=31% Similarity=0.516 Sum_probs=98.0 Q ss_pred EEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEEC-CCCCCHH Q ss_conf 899945887723278887441258848579999999997799779999579889999973026972899985-9988614 Q gi|254780942|r 6 LAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQ-DCQQGTA 84 (442) Q Consensus 6 ~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q-~~~~GTa 84 (442) .|||||||+|+||. .+|+|+++.|+||++|+++.+..+++.++++++++..+.+...+..... .++.. ....|.+ T Consensus 2 ~~iILAgG~s~RmG--~~K~l~~~~g~pli~~~i~~~~~~~~~~i~vv~~~~~~~~~~~~~~~~~--~~v~~~~~~~G~~ 77 (186) T cd04182 2 AAIILAAGRSSRMG--GNKLLLPLDGKPLLRHALDAALAAGLSRVIVVLGAEADAVRAALAGLPV--VVVINPDWEEGMS 77 (186) T ss_pred EEEEECCCCCCCCC--CCEECCEECCCCHHHHHHHHHHHCCCCCEEECCCCCHHHHHHHHCCCCC--EEEECCCCCCCCC T ss_conf 69998997653599--8912558997319999999999749983252377407899998547993--6751787567853 Q ss_pred HHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 4664100001234460589851541115332200000013586 Q gi|254780942|r 85 HAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGY 127 (442) Q Consensus 85 ~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~ 127 (442) .+++.++.++....+.++|+.||+|+++...+..|++.+.... T Consensus 78 ~si~~gl~~~~~~~~~~lv~~~D~P~i~~~~i~~L~~~~~~~~ 120 (186) T cd04182 78 SSLAAGLEALPADADAVLILLADQPLVTAETLRALIDAFREDG 120 (186) T ss_pred HHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHHHHHHHHHCC T ss_conf 7899999973348876898437778789999999999997589 No 100 >PRK10092 maltose O-acetyltransferase; Provisional Probab=99.79 E-value=6.4e-19 Score=126.12 Aligned_cols=54 Identities=33% Similarity=0.591 Sum_probs=51.6 Q ss_pred CCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEEEEC Q ss_conf 629878879886998937839889989977835766436986898135607707 Q gi|254780942|r 379 KTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQIVKE 432 (442) Q Consensus 379 ~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~~~ 432 (442) +.+||++||||.+++|+|||+||++|+|||||+|++|+|+++++.|.|.+++|+ T Consensus 129 pi~Ig~~vwig~~~~i~pGv~IG~~~vigagsvV~~dip~~~v~~G~Par~ir~ 182 (183) T PRK10092 129 PVTIGNNVWIGGRAVINPGVTIGDNVVVASGAVVTKDVPDNVVVGGNPARIIKK 182 (183) T ss_pred CEEECCCCEECCCCEEECCCEECCCCEECCCCEEEEECCCCCEEEEECCEEEEC T ss_conf 779899868889989907989999989975988922779992999747578861 No 101 >cd03353 LbH_GlmU_C N-acetyl-glucosamine-1-phosphate uridyltransferase (GlmU), C-terminal left-handed beta-helix (LbH) acetyltransferase domain: GlmU is also known as UDP-N-acetylglucosamine pyrophosphorylase. It is a bifunctional bacterial enzyme that catalyzes two consecutive steps in the formation of UDP-N-acetylglucosamine (UDP-GlcNAc), an important precursor in bacterial cell wall formation. The two enzymatic activities, uridyltransferase and acetyltransferase, are carried out by two independent domains. The C-terminal LbH domain possesses the acetyltransferase activity. It catalyzes the CoA-dependent acetylation of GlcN-1-phosphate to GlcNAc-1-phosphate. The LbH domain contains 10 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X. The acetyltransferase active site is located at the interface between two subunits of the active LbH trimer. Probab=99.79 E-value=7e-19 Score=125.89 Aligned_cols=142 Identities=23% Similarity=0.361 Sum_probs=84.4 Q ss_pred EECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCC-CC Q ss_conf 00122322344200021116313433212111210021344320000025864432023102112478544874555-66 Q gi|254780942|r 276 IEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYV-GD 354 (442) Q Consensus 276 I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~i-gd 354 (442) |+|++.||+++.|+++|+|++ ++.||++|.|||++.|+ ++.|+++++|.+++.+.++.+++++.++..+++ .. T Consensus 12 I~~~v~IG~~v~I~p~~~I~~-----~v~IG~~~~Ig~~~~I~-~~~IG~~~~I~~~~vi~~~~ig~~~~IG~~~~i~~~ 85 (193) T cd03353 12 IDGDVEIGVDVVIDPGVILEG-----KTVIGEDCVIGPNCVIK-DSTIGDGVVIKASSVIEGAVIGNGATVGPFAHLRPG 85 (193) T ss_pred ECCCCEECCCCEECCCEEEEC-----CCEECCCCEECCCEEEC-CCEECCCCEECCCEECCCCEECCCCEECCCEEECCC T ss_conf 999829999999989829929-----85999998995734577-868888740021001135251765266685076587 Q ss_pred CEECCCCEECCCEEECCC---CCC-CCC-----CCEECCCCEECCCCEEEC-------CEEECCCCEECCCCEECCCC-- Q ss_conf 398899899798088344---577-545-----629878879886998937-------83988998997783576643-- Q gi|254780942|r 355 SVVGKNVNIGAGTITCNY---DGT-HKY-----KTHINENAFIGSNSSLIA-------PITIGQGTYVASGSIITQDT-- 416 (442) Q Consensus 355 ~~iG~~~~ig~g~i~~n~---~g~-~~~-----~~~ig~~~~iG~~~~i~~-------gv~ig~~~~i~ag~~v~~dv-- 416 (442) ++|++++.||..+.+-|- ++. -.| .+.||++|.||+++.+.- .+.||++|++|+++.+...+ T Consensus 86 ~~I~~~~~iG~~v~i~~~~ig~~~~i~h~~~igd~~Ig~~v~Ig~~~i~~n~~g~~~~~t~IGd~v~IG~~~~i~~~v~I 165 (193) T cd03353 86 TVLGEGVHIGNFVEIKKSTIGEGSKANHLSYLGDAEIGEGVNIGAGTITCNYDGVNKHRTVIGDNVFIGSNSQLVAPVTI 165 (193) T ss_pred CEECCEEEECCCEEEECCCCCCCCEECCEEECCCCEECCCCEECCCEEEECCCCCCCCCCEECCCCEECCCCEEECCEEE T ss_conf 68875167425357644433688487724461885677788886675887343433562343356189889879898399 Q ss_pred CCCCEEE Q ss_conf 6986898 Q gi|254780942|r 417 PENSLVF 423 (442) Q Consensus 417 ~~~~~v~ 423 (442) -+++++- T Consensus 166 G~~a~Ig 172 (193) T cd03353 166 GDGATIA 172 (193) T ss_pred CCCCEEC T ss_conf 9998998 No 102 >PRK13385 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; Provisional Probab=99.79 E-value=2.3e-18 Score=122.90 Aligned_cols=224 Identities=14% Similarity=0.157 Sum_probs=138.2 Q ss_pred CCEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEEECCCH-HHHHHHHCCC---CCEEEEEECC Q ss_conf 408999458877232788874412588485799999999977-9977999957988-9999973026---9728999859 Q gi|254780942|r 4 KRLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAA-GIENVALVLGYGA-EEITRINFPP---TLSVEYYIQD 78 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~-g~~~ivvv~~~~~-e~i~~~~~~~---~~~i~~v~q~ 78 (442) +..+||||||.|+||...+||+++++.|+|+++|.++.+.+. .+++|++++.... +.+++..... ...+.++.-- T Consensus 2 ~~~~IilAaG~G~R~~~~~pKQf~~l~gkpil~~sl~~f~~~~~i~~Ivvv~~~~~~~~~~~~~~~~~~~~~~v~iv~GG 81 (238) T PRK13385 2 NYELIFLAAGQGKMGNVPLPKQFLDIDNKPILIHTIEKFILVSEFNEIIIATPAQWISHTQDILKKYNITDQRVKVVAGG 81 (238) T ss_pred CEEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHCCCCCCEEEEECCHHHHHHHHHHHHHCCCCCCCEEEECCC T ss_conf 56999977734614886899643279978899999999855876787999677888999999998615444768993289 Q ss_pred CCCCHHHHHHHCCCCCCC----CCCE-EEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCEEEECCCCCE Q ss_conf 988614466410000123----4460-58985154111533220000001358610010010122157440420233310 Q gi|254780942|r 79 CQQGTAHAVLTAQDAIKP----GYDD-VIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNADNPKGYGRLLIKNNEII 153 (442) Q Consensus 79 ~~~GTa~Ai~~a~~~l~~----~~~~-~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~~dp~~yGvV~~d~~~v~ 153 (442) +.+ .++++++++++.. ..++ ++|.-|--||++...+.+.++...+...+++...+++|. +...+++.+. T Consensus 82 ~tR--~~SV~ngl~~l~~~~~~~~~~~VlIHDaARP~vs~~~i~~~i~~~~~~~~~~~~~~~~~dt----i~~~~~~~i~ 155 (238) T PRK13385 82 TDR--NETIMNIIDHIRNVNGINNDDVIVTHDAVRPFLTQRIIKENIEVAVKYGAVDTVIEAIDTI----VMSKDKQNIH 155 (238) T ss_pred CCH--HHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCEEEEEEECCCE----EEEECCCCEE T ss_conf 864--8999999999874114787688999546677899999999999987219737999613445----9992699524 Q ss_pred ECCCCCCCCCCCCCCCCCCCC-HHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCC Q ss_conf 010014667542100035560-0111250146777532888740011121148999973077179991721430111210 Q gi|254780942|r 154 AIREENDATDEERKIHYCNSG-LMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRY 232 (442) Q Consensus 154 ~i~Ek~d~~~~~~~~~lin~G-iY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~ 232 (442) +.+. +.++..+= =-.|+.+.|.+....+...+ .+..||-.+.+...|.++..+..+..- .-+.+|. T Consensus 156 ~t~d---------R~~l~~~QTPQ~F~~~~L~~a~~~~~~~~---~~~~TDdasl~e~~g~~v~~v~G~~~N-iKIT~~e 222 (238) T PRK13385 156 SIPV---------RNEMYQGQTPQSFNIKLLQDSYRALSSEQ---KEILSDACKIIVESGHPVKLVRGELYN-IKVTTPY 222 (238) T ss_pred ECCC---------HHHHHHHCCCCCCCHHHHHHHHHHHHHCC---CCCCCCHHHHHHHCCCCEEEEECCCCC-CCCCCHH T ss_conf 1268---------78820002996135999999999998628---998888999998779966999669634-7799899 Q ss_pred CCCCCCCCCCCCCC Q ss_conf 00000122222112 Q gi|254780942|r 233 ELSLIENIWQSRYR 246 (442) Q Consensus 233 ~L~~~~~~~~~~~~ 246 (442) ||..++.+++..++ T Consensus 223 DL~~a~~ll~~~~~ 236 (238) T PRK13385 223 DLKVANAIIQGDIA 236 (238) T ss_pred HHHHHHHHHCCCCC T ss_conf 99999999754656 No 103 >TIGR03310 matur_ygfJ molybdenum hydroxylase accessory protein, YgfJ family. Members of this protein family are probable accessory proteins for the biosynthesis of enzymes related to xanthine dehydrogenase. Comparative genomics suggests a role in the maturation of selenium-dependent molybdenum hydroxylases, although a tenuous alternative hypothesis is a role for this protein (with a requirement for SelD, the selenium donor protein in the selenocysteine and selenouridine biosynthesis pathways) metabolizing a selenium-containing substrate such as selenate. Probab=99.78 E-value=1e-18 Score=124.93 Aligned_cols=118 Identities=24% Similarity=0.369 Sum_probs=96.1 Q ss_pred EEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEECC-CCCCHHH Q ss_conf 999458877232788874412588485799999999977997799995798899999730269728999859-9886144 Q gi|254780942|r 7 AIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQD-CQQGTAH 85 (442) Q Consensus 7 AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~-~~~GTa~ 85 (442) |||||||+++||. .+|+|+|+.|+||++|+++.+.+++++++++|+++..+++.+.... ...++++..+ ...|++. T Consensus 2 AIILAaG~S~RmG--~~K~Ll~~~g~~ll~~~i~~~~~~~~~~vivv~~~~~~~~~~~~~~-~~~~~~v~n~~~~~G~~~ 78 (188) T TIGR03310 2 AIILAAGLSSRMG--QNKLLLPYKGKTILEHVVDNALRLFFDEVILVLGHEADELVALLAN-HSNITLVHNPQYAEGQSS 78 (188) T ss_pred EEEECCCCCCCCC--CCCCCCEECCCCHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHCC-CCCCCEECCCCCCCCHHH T ss_conf 6998898766598--8900688898329999999998569996699827837899998626-689626548655678089 Q ss_pred HHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 6641000012344605898515411153322000000135861 Q gi|254780942|r 86 AVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYS 128 (442) Q Consensus 86 Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~ 128 (442) +|+.+...... .+.++|+.||+|+++...++.|++.+..+.+ T Consensus 79 si~~gl~~~~~-~~~~li~~~D~P~l~~~~i~~L~~~~~~~~~ 120 (188) T TIGR03310 79 SIKLGLELPVQ-SDGYLFLLGDQPFVTPDIIQLLLEAFALKND 120 (188) T ss_pred HHHHHHHCCCC-CCEEEEEECCCCCCCHHHHHHHHHHHHHCCC T ss_conf 99999970657-8859997188788999999999999985799 No 104 >COG1211 IspD 4-diphosphocytidyl-2-methyl-D-erithritol synthase [Lipid metabolism] Probab=99.78 E-value=9.2e-18 Score=119.52 Aligned_cols=217 Identities=18% Similarity=0.291 Sum_probs=148.1 Q ss_pred CCCCCEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEEECC-CHHHHHHHH-CCCCCEEEEEEC Q ss_conf 987408999458877232788874412588485799999999977-99779999579-889999973-026972899985 Q gi|254780942|r 1 MKRKRLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAA-GIENVALVLGY-GAEEITRIN-FPPTLSVEYYIQ 77 (442) Q Consensus 1 m~~~~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~-g~~~ivvv~~~-~~e~i~~~~-~~~~~~i~~v~q 77 (442) |...+.|||||||+||||....||.++++.|+||++|.|+.+..+ .+++|+++++. ..+.+.++. ......+.++.. T Consensus 1 ~~~~~~~vilAaG~G~R~~~~~pKq~l~l~g~pll~~tl~~f~~~~~i~~ivvv~~~~~~~~~~~~~~~~~~~~v~~v~G 80 (230) T COG1211 1 MRMMVSAVILAAGFGSRMGNPVPKQYLELGGRPLLEHTLEAFLESPAIDEIVVVVSPEDDPYFEKLPKLSADKRVEVVKG 80 (230) T ss_pred CCCEEEEEEECCCCCCCCCCCCCCEEEEECCEEEHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHCCCCEEEEECC T ss_conf 97407999975763433579998458899998856999999974767676999978466489998632124870799348 Q ss_pred CCCCCHHHHHHHCCCCCCCC-CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCEEEEC-CCCCEEC Q ss_conf 99886144664100001234-460589851541115332200000013586100100101221574404202-3331001 Q gi|254780942|r 78 DCQQGTAHAVLTAQDAIKPG-YDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNADNPKGYGRLLIK-NNEIIAI 155 (442) Q Consensus 78 ~~~~GTa~Ai~~a~~~l~~~-~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~~dp~~yGvV~~d-~~~v~~i 155 (442) -+.+ .++++++++.++.. .+.|||.-+--||++...++++++ ......+++++.++.|.- ...+ ++.+.+. T Consensus 81 G~~R--~~SV~~gL~~~~~~~~~~VlvHDaaRPf~~~~~i~~li~-~~~~~~aai~alpv~DTi----k~~~~~~~i~~t 153 (230) T COG1211 81 GATR--QESVYNGLQALSKYDSDWVLVHDAARPFLTPKLIKRLIE-LADKYGAAILALPVTDTL----KRVDADGNIVET 153 (230) T ss_pred CCCH--HHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHH-HHCCCCCEEEEEECCCCE----EEECCCCCEEEC T ss_conf 7517--999999999721158988999646667899999999998-622388179975046737----886589872344 Q ss_pred CCCCCCCCCCCCCCCCCCCHH------EECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCC Q ss_conf 001466754210003556001------11250146777532888740011121148999973077179991721430111 Q gi|254780942|r 156 REENDATDEERKIHYCNSGLM------AIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCN 229 (442) Q Consensus 156 ~Ek~d~~~~~~~~~lin~GiY------~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~ 229 (442) .. ++ ++| .|+.+.|.+.+.+...+. +.+||--..+...|.++..+..+... .-+. T Consensus 154 ~~---------R~-----~l~~~QTPQ~F~~~~L~~a~~~a~~~~----~~~tDdas~~e~~G~~v~lV~G~~~n-~KiT 214 (230) T COG1211 154 VD---------RS-----GLWAAQTPQAFRLELLKQALARAFAEG----REITDDASAIEKAGGPVSLVEGSADN-FKIT 214 (230) T ss_pred CC---------HH-----HHHHHHCCCCCCHHHHHHHHHHHHHCC----CCCCCHHHHHHHCCCCEEEEECCCCE-EEEC T ss_conf 37---------55-----423433876331999999999998628----77688699998769981998268330-5762 Q ss_pred CCCCCCCCCCCCCC Q ss_conf 21000000122222 Q gi|254780942|r 230 NRYELSLIENIWQS 243 (442) Q Consensus 230 ~~~~L~~~~~~~~~ 243 (442) +|.||..++.++.. T Consensus 215 tpeDL~~a~~il~~ 228 (230) T COG1211 215 TPEDLEIAEAILRR 228 (230) T ss_pred CHHHHHHHHHHHCC T ss_conf 88899999987526 No 105 >TIGR01852 lipid_A_lpxA acyl-[acyl-carrier-protein]--UDP-N-acetylglucosamine O-acyltransferase; InterPro: IPR010137 This entry describes LpxA, an enzyme for the biosynthesis of lipid A, a component of lipopolysaccharide (LPS) in the outer membrane outer leaflet of most Gram-negative bacteria. Some differences are found between lipid A of different species, but this protein represents the first step (from UDP-N-acetyl-D-glucosamine) and appears to be conserved in function. Proteins from this family contain many copies of the bacterial transferase hexapeptide repeat.; GO: 0008780 acyl-[acyl-carrier-protein]-UDP-N-acetylglucosamine O-acyltransferase activity, 0008610 lipid biosynthetic process. Probab=99.77 E-value=6.4e-19 Score=126.11 Aligned_cols=53 Identities=13% Similarity=0.220 Sum_probs=23.0 Q ss_pred CEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECC Q ss_conf 398899899798088344577545629878879886998937839889989977835766 Q gi|254780942|r 355 SVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQ 414 (442) Q Consensus 355 ~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~ 414 (442) ..||+||-+.|.+=++ |.|+|||+|.+..|++|---|.|||+|+||.+|.|-+ T Consensus 102 T~iG~~nllMAysHvA-------HDC~vGn~vv~aN~a~LAGHV~vgD~a~iGG~~avHQ 154 (257) T TIGR01852 102 TSIGNNNLLMAYSHVA-------HDCVVGNHVVLANNATLAGHVEVGDYAIIGGLVAVHQ 154 (257) T ss_pred EEECCCCEECCCCEEE-------ECCEECCEEEEECCCEECEEEEECCEEEECCCCCEEE T ss_conf 9988876572053561-------2436888169813631222799888999778972110 No 106 >PRK02726 molybdopterin-guanine dinucleotide biosynthesis protein A; Provisional Probab=99.76 E-value=1.1e-17 Score=119.10 Aligned_cols=195 Identities=17% Similarity=0.256 Sum_probs=120.7 Q ss_pred CCCCCEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEECC-C Q ss_conf 987408999458877232788874412588485799999999977997799995798899999730269728999859-9 Q gi|254780942|r 1 MKRKRLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQD-C 79 (442) Q Consensus 1 m~~~~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~-~ 79 (442) |+.+..|||||||+++||. .+|+|+++.|+|+++|+++.+... .+++++++. ..++...+.. ..++++... + T Consensus 4 m~~~i~~vILAGG~S~RMG--~dK~ll~~~G~~ll~~~~~~l~~~-~~~v~i~~~-~~~~~~~~~~---~~~~~i~D~~~ 76 (200) T PRK02726 4 MKNNLVALILAGGKSSRMG--QDKALITWQGVPLLQRVCRIAAAC-ADEVYIITP-WPERYQSLLP---PGCHWLREPPP 76 (200) T ss_pred CCCCCEEEEECCCCHHHCC--CCCCCCEECCEEHHHHHHHHHHHH-CCEEEEECC-CHHHHHHHCC---CCCEEEECCCC T ss_conf 5577179998576540279--984338789875999999998741-886999679-9788775258---88579016998 Q ss_pred CCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCEEEECCCCCEECCCCC Q ss_conf 88614466410000123446058985154111533220000001358610010010122157440420233310010014 Q gi|254780942|r 80 QQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNADNPKGYGRLLIKNNEIIAIREEN 159 (442) Q Consensus 80 ~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~~dp~~yGvV~~d~~~v~~i~Ek~ 159 (442) ..|.-.+|+.++.+++. +.++++.+|+|+++...++.+++...+..+....... .++++. T Consensus 77 ~~GPL~Gi~saL~~~~~--~~~lv~~cD~P~l~~~~i~~l~~~l~~~~~~~~~~v~-----------~~~~~~------- 136 (200) T PRK02726 77 SGGPLVAFAQGLPQIKT--EWVLLLACDLPKLRVGVLQEWLQQLDNVPEEAIAALP-----------KQEKGW------- 136 (200) T ss_pred CCCCHHHHHHHHHHCCC--CCEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEE-----------CCCCCC------- T ss_conf 89859999999985688--8389943887888999999999998608887379997-----------159964------- Q ss_pred CCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECH-HHEECCCCCCCCCCCC Q ss_conf 667542100035560011125014677753288874001112114899997307717999172-1430111210000001 Q gi|254780942|r 160 DATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKE-QEVCGCNNRYELSLIE 238 (442) Q Consensus 160 d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~-~~~~gv~~~~~L~~~~ 238 (442) + .+ .++| +...+..+-..+.. |++-+.+.+ .+ ..+..+.+++ .++.-+|+|.||+.++ T Consensus 137 -----~---Pl--~aly--~~~~~~~l~~~l~~----g~r~l~~~l----~~-~~~~~v~~~~~~~~~NiNTPeDL~~~~ 195 (200) T PRK02726 137 -----E---PL--CGFY--RRRCLPSLEQFIQQ----GGRSFQGWL----AQ-VPVQELALSDPDMLFNCNTPEDLAVIQ 195 (200) T ss_pred -----C---CE--EEEE--CHHHHHHHHHHHHH----CCCCHHHHH----HH-CCCEEEECCCHHHHCCCCCHHHHHHHH T ss_conf -----7---31--2110--58899999999990----995499999----87-897797269845625899999999986 Q ss_pred CCCCC Q ss_conf 22222 Q gi|254780942|r 239 NIWQS 243 (442) Q Consensus 239 ~~~~~ 243 (442) .+.++ T Consensus 196 ~i~~~ 200 (200) T PRK02726 196 GIGRD 200 (200) T ss_pred HHHCC T ss_conf 24319 No 107 >PRK00317 mobA molybdopterin-guanine dinucleotide biosynthesis protein A; Reviewed Probab=99.75 E-value=2.5e-17 Score=117.01 Aligned_cols=185 Identities=16% Similarity=0.236 Sum_probs=113.9 Q ss_pred CCEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEECC--CCC Q ss_conf 408999458877232788874412588485799999999977997799995798899999730269728999859--988 Q gi|254780942|r 4 KRLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQD--CQQ 81 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~--~~~ 81 (442) +..|||||||++|||. ..+|+|+++.|+|+++|+++.|... +.+++++++.+.+.... .+.. ++... ... T Consensus 3 ~i~gvILAGG~SsRmg-G~dK~ll~~~G~~li~~~~~~l~~~-~~~v~i~~~~~~~~~~~----~~~~--vi~D~~~~~~ 74 (193) T PRK00317 3 PITGVILAGGRARRMG-GVDKGLQELNGKPLIQHVIDRLAPQ-VDEIVINANRNLARYAA----FGLP--VIPDELADFP 74 (193) T ss_pred CCEEEEECCCCCCCCC-CCCCCCCEECCEEHHHHHHHHHHHH-CCEEEEECCCCHHHHHH----CCCE--EEECCCCCCC T ss_conf 8569998876767899-9997506589974999999987754-57689976878888864----4996--9963787766 Q ss_pred CHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCEEEECCCCCEECCCCCCC Q ss_conf 61446641000012344605898515411153322000000135861001001012215744042023331001001466 Q gi|254780942|r 82 GTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNADNPKGYGRLLIKNNEIIAIREENDA 161 (442) Q Consensus 82 GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~~dp~~yGvV~~d~~~v~~i~Ek~d~ 161 (442) |.-.+++.++...+ .+.++|+.+|+|+++...+..|++...+....++... ++++. + T Consensus 75 GPL~Gi~saL~~~~--~~~vlv~~~DmP~i~~~~i~~L~~~~~~~~~~iv~~~-------------~~~~~-----~--- 131 (193) T PRK00317 75 GPLAGILAGLQQAE--TEWVLFVPCDTPFLPRDLVARLAQALIEEDADIAVAH-------------DGERD-----H--- 131 (193) T ss_pred CCHHHHHHHHHHCC--CCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEE-------------ECCCC-----C--- T ss_conf 75899999986268--7759996267788989999999999874799838986-------------48954-----6--- Q ss_pred CCCCCCCCCCCCCHHEECCHHHHHHHHH-CCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECH--HHEECCCCCCCCCCCC Q ss_conf 7542100035560011125014677753-288874001112114899997307717999172--1430111210000001 Q gi|254780942|r 162 TDEERKIHYCNSGLMAIDGLYIMDWLLQ-IKKNKVSQEYYLTDIIEKARLDGKSIASIDVKE--QEVCGCNNRYELSLIE 238 (442) Q Consensus 162 ~~~~~~~~lin~GiY~f~~~~L~~~l~~-l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~--~~~~gv~~~~~L~~~~ 238 (442) | + .++| +...+ +.++. +.. |++- +..++++ ..+..+.+++ .++..+|+|.||..++ T Consensus 132 -P------L--~aly--~~~~l-~~l~~~l~~----g~~~----l~~~l~~-~~~~~v~~~~~~~~f~NiNTpeDl~~~~ 190 (193) T PRK00317 132 -P------T--FALY--SVALL-PDLEAALAA----GERK----VMAFYAR-HGTVAVDFSDPKDAFFNINTPEDLAQLE 190 (193) T ss_pred -C------C--CCCC--CHHHH-HHHHHHHHC----CCCC----HHHHHHH-CCCEEEEECCCCCCCCCCCCHHHHHHHH T ss_conf -5------3--1104--78899-999999990----9965----9999987-8958998079866236899999999999 Q ss_pred CC Q ss_conf 22 Q gi|254780942|r 239 NI 240 (442) Q Consensus 239 ~~ 240 (442) ++ T Consensus 191 ~~ 192 (193) T PRK00317 191 QL 192 (193) T ss_pred HH T ss_conf 61 No 108 >COG2068 Uncharacterized MobA-related protein [General function prediction only] Probab=99.74 E-value=5.4e-17 Score=115.16 Aligned_cols=192 Identities=22% Similarity=0.353 Sum_probs=133.6 Q ss_pred CCCEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCH-HHHHHHHCCCCCEEEEEECC-CC Q ss_conf 74089994588772327888744125884857999999999779977999957988-99999730269728999859-98 Q gi|254780942|r 3 RKRLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGA-EEITRINFPPTLSVEYYIQD-CQ 80 (442) Q Consensus 3 ~~~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~-e~i~~~~~~~~~~i~~v~q~-~~ 80 (442) -+..|||||||+||||. .+|.|+|+.|+|++.|+++.+.++++++++++++|.. +...+.. ....+.++..+ -. T Consensus 4 ~~v~~VvLAAGrssRmG--~~KlLap~~g~plv~~~~~~a~~a~~~~vivV~g~~~~~~~~a~~--~~~~~~~v~npd~~ 79 (199) T COG2068 4 STVAAVVLAAGRSSRMG--QPKLLAPLDGKPLVRASAETALSAGLDRVIVVTGHRVAEAVEALL--AQLGVTVVVNPDYA 79 (199) T ss_pred CCEEEEEECCCCCCCCC--CCCEECCCCCCCHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHH--CCCCEEEEECCCHH T ss_conf 64678887166666678--751654149982899999999726887099992761136788653--36980899695522 Q ss_pred CCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCEEEECCCCCEECCCCCC Q ss_conf 86144664100001234460589851541115332200000013586100100101221574404202333100100146 Q gi|254780942|r 81 QGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNADNPKGYGRLLIKNNEIIAIREEND 160 (442) Q Consensus 81 ~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~~dp~~yGvV~~d~~~v~~i~Ek~d 160 (442) +|.+.+++.+..+.+...+-++++.||+|+++..++..+++...... .+. . |. | ++. . T Consensus 80 ~Gls~Sl~ag~~a~~~~~~~v~~~lgDmP~V~~~t~~rl~~~~~~~~--~~v-~----p~-~------~g~--------r 137 (199) T COG2068 80 QGLSTSLKAGLRAADAEGDGVVLMLGDMPQVTPATVRRLIAAFRARG--AAV-R----PV-Y------GGA--------R 137 (199) T ss_pred HHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCHHHHHHHHHHCCCCC--CEE-E----EE-C------CCC--------C T ss_conf 23769999998735567976999968888787899999998443467--344-2----22-0------687--------6 Q ss_pred CCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCCCCCCCC Q ss_conf 67542100035560011125014677753288874001112114899997307717999172143011121000000122 Q gi|254780942|r 161 ATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYELSLIENI 240 (442) Q Consensus 161 ~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~~~~~~ 240 (442) ..| .+|+ ..++.++.+++-+. | .-..+.+-+.....+..+.....++++|.||..++.. T Consensus 138 G~P------------v~~~-~~~~~~l~~l~GD~--G------~r~ll~~~~~~~~~V~~~~g~llDVDTped~~~a~~~ 196 (199) T COG2068 138 GHP------------VLLS-KDLFPALARLSGDV--G------ARQLLEEGGLPLVEVEVDAGVLLDVDTPEDLARAQDL 196 (199) T ss_pred CCC------------EEEC-HHHHHHHHHCCCCH--H------HHHHHHHCCCCEEEECCCCCEEECCCCHHHHHHHHHH T ss_conf 796------------4565-66789986168862--2------8999975386347652688517547998999999876 Q ss_pred C Q ss_conf 2 Q gi|254780942|r 241 W 241 (442) Q Consensus 241 ~ 241 (442) + T Consensus 197 ~ 197 (199) T COG2068 197 L 197 (199) T ss_pred H T ss_conf 2 No 109 >COG1212 KdsB CMP-2-keto-3-deoxyoctulosonic acid synthetase [Cell envelope biogenesis, outer membrane] Probab=99.74 E-value=2.4e-17 Score=117.17 Aligned_cols=226 Identities=19% Similarity=0.258 Sum_probs=151.3 Q ss_pred CCEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEECCCCCCH Q ss_conf 40899945887723278887441258848579999999997799779999579889999973026972899985998861 Q gi|254780942|r 4 KRLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQDCQQGT 83 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~~GT 83 (442) +.+.||.|==.+||| | -|||-.|+|+|||.|+.++..+++.++++|.| +.++|.+.....+.++-....+.+.|| T Consensus 3 ~~~viIPAR~~STRL-p--gKPLadI~GkpmI~rV~e~a~~s~~~rvvVAT--Dde~I~~av~~~G~~avmT~~~h~SGT 77 (247) T COG1212 3 KFVVIIPARLASTRL-P--GKPLADIGGKPMIVRVAERALKSGADRVVVAT--DDERIAEAVQAFGGEAVMTSKDHQSGT 77 (247) T ss_pred CEEEEEECCHHCCCC-C--CCCHHHHCCCHHHHHHHHHHHHCCCCEEEEEC--CCHHHHHHHHHHCCEEEECCCCCCCCC T ss_conf 559998542002668-9--97357737960789999999873787289974--988999999970978996178889800 Q ss_pred HHHHHHCCCCCCCCCCEE-EEEECCCCCCCCCCCCCCCCCCCCC-CCCCCEEEECCC------CCCCCEEEEC-CCCCEE Q ss_conf 446641000012344605-8985154111533220000001358-610010010122------1574404202-333100 Q gi|254780942|r 84 AHAVLTAQDAIKPGYDDV-IIMYGDVPLVSSHTLKKAMDKIAQG-YSIAVVGFNADN------PKGYGRLLIK-NNEIIA 154 (442) Q Consensus 84 a~Ai~~a~~~l~~~~~~~-lVl~GD~~li~~~~l~~l~~~~~~~-~~~~i~~~~~~d------p~~yGvV~~d-~~~v~~ 154 (442) +-+..+...++...++. +-+.||.||+.+..+.++.+..+.. .+.+.++.+..+ |.-.-+| .| +++.+= T Consensus 78 -dR~~Ev~~~l~~~~~~iIVNvQGDeP~i~p~~I~~~~~~L~~~~~~~aTl~~~i~~~ee~~nPN~VKvV-~d~~g~ALY 155 (247) T COG1212 78 -DRLAEVVEKLGLPDDEIIVNVQGDEPFIEPEVIRAVAENLENSNADMATLAVKITDEEEAFNPNVVKVV-LDKEGYALY 155 (247) T ss_pred -HHHHHHHHHCCCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHCCCCCEEEE-ECCCCCEEE T ss_conf -899999996599964299986588777797999999999873776604553433788884599827999-759982899 Q ss_pred CCCCCCCCCCCC-----CCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHH--HHHHCCEEEEEECHHHEEC Q ss_conf 100146675421-----000355600111250146777532888740011121148999--9730771799917214301 Q gi|254780942|r 155 IREENDATDEER-----KIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKA--RLDGKSIASIDVKEQEVCG 227 (442) Q Consensus 155 i~Ek~d~~~~~~-----~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l--~~~g~~i~~~~~~~~~~~g 227 (442) |---+ -|..+ ..-+-..|+|.|+..+|.+|... +++.. | .+.-+++| +..|.+|.+...++..-.| T Consensus 156 FSRs~--iP~~rd~~~~~p~l~HIGIYayr~~~L~~f~~~-~ps~L--E--~~E~LEQLR~Le~G~kI~v~i~~~~p~~g 228 (247) T COG1212 156 FSRAP--IPYGRDNFGGTPFLRHIGIYAYRAGFLERFVAL-KPSPL--E--KIESLEQLRVLENGEKIHVEIVKEVPSIG 228 (247) T ss_pred EECCC--CCCCCCCCCCCCHHHEEEHHHHHHHHHHHHHCC-CCCHH--H--HHHHHHHHHHHHCCCEEEEEEECCCCCCC T ss_conf 87687--877543467764364220588679999999706-99605--6--78778999998769757899841688889 Q ss_pred CCCCCCCCCCCCCCCC Q ss_conf 1121000000122222 Q gi|254780942|r 228 CNNRYELSLIENIWQS 243 (442) Q Consensus 228 v~~~~~L~~~~~~~~~ 243 (442) ++++.||..+++++.. T Consensus 229 VDT~EDLe~v~~~~~~ 244 (247) T COG1212 229 VDTPEDLERVRKILSN 244 (247) T ss_pred CCCHHHHHHHHHHHHH T ss_conf 8988999999999875 No 110 >cd04647 LbH_MAT_like Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins. MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates. MAT acetylates maltose and glucose exclusively while GAT specifically acetylates galactopyranosides. XAT catalyzes the CoA-dependent acetylation of a variety of hydroxyl-bearing acceptors such as chloramphenicol and streptogramin, among others. XATs are implicated in inactivating xenobiotics leading to xenobiotic resistance in patients. Members of this family contain a a left-handed parallel beta-helix (LbH) domain with at least 5 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). They are trimeric in their active form. Probab=99.74 E-value=9.2e-18 Score=119.51 Aligned_cols=51 Identities=33% Similarity=0.665 Sum_probs=48.0 Q ss_pred CCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEE Q ss_conf 629878879886998937839889989977835766436986898135607 Q gi|254780942|r 379 KTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQI 429 (442) Q Consensus 379 ~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~ 429 (442) ++.||++||||.+++++||++||++|+|+|||+|++|+|+++++.|.|.++ T Consensus 58 ~v~Ig~~v~IG~~~~I~~gv~Ig~~~iIga~SvV~k~i~~~~i~~G~PAk~ 108 (109) T cd04647 58 PIVIGDDVWIGANVVILPGVTIGDGAVVGAGSVVTKDVPPNSIVAGNPAKV 108 (109) T ss_pred CEEEECCCEECCCEEEEECCEECCCCEECCCCEECCCCCCCCEEEEECCEE T ss_conf 979942357889839928513089869937988914839998999758699 No 111 >cd03354 LbH_SAT Serine acetyltransferase (SAT): SAT catalyzes the CoA-dependent acetylation of the side chain hydroxyl group of L-serine to form O-acetylserine, as the first step of a two-step biosynthetic pathway in bacteria and plants leading to the formation of L-cysteine. This reaction represents a key metabolic point of regulation for the cysteine biosynthetic pathway due to its feedback inhibition by cysteine. The enzyme is a 175 kDa homohexamer, composed of a dimer of homotrimers. Each subunit contains an N-terminal alpha helical region and a C-terminal left-handed beta-helix (LbH) subdomain with 5 turns, each containing a hexapeptide repeat motif characteristic of the acyltransferase superfamily of enzymes. The trimer interface mainly involves the C-terminal LbH subdomain while the dimer (of trimers) interface is mediated by the N-terminal alpha helical subdomain. Probab=99.73 E-value=1.2e-17 Score=118.78 Aligned_cols=48 Identities=29% Similarity=0.571 Sum_probs=45.7 Q ss_pred CCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECC Q ss_conf 629878879886998937839889989977835766436986898135 Q gi|254780942|r 379 KTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARS 426 (442) Q Consensus 379 ~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~ 426 (442) ++.||++||||.|+++.++++||++|+|||||+|++|+|++++++|.| T Consensus 54 ~~~Ig~~v~IG~~~~I~~gv~IG~~~vIgagsvV~~dv~~~si~~G~P 101 (101) T cd03354 54 HPTIGDNVVIGAGAKILGNITIGDNVKIGANAVVTKDVPANSTVVGVP 101 (101) T ss_pred CCCCCCCEEECCCCEEECCCEECCCCEECCCCEECCCCCCCCEEEECC T ss_conf 761487586358988929879999879987988996859998998039 No 112 >cd03350 LbH_THP_succinylT 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate (THDP) N-succinyltransferase (also called THP succinyltransferase): THDP N-succinyltransferase catalyzes the conversion of tetrahydrodipicolinate and succinyl-CoA to N-succinyltetrahydrodipicolinate and CoA. It is the committed step in the succinylase pathway by which bacteria synthesize L-lysine and meso-diaminopimelate, a component of peptidoglycan. The enzyme is homotrimeric and each subunit contains an N-terminal region with alpha helices and hairpin loops, as well as a C-terminal region with a left-handed parallel alpha-helix (LbH) structural motif encoded by hexapeptide repeat motifs. Probab=99.73 E-value=1.6e-17 Score=118.09 Aligned_cols=128 Identities=16% Similarity=0.255 Sum_probs=72.2 Q ss_pred EEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCC Q ss_conf 99411233178410012232234420002111631343321211121002134432000002586443202310211247 Q gi|254780942|r 263 FLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKE 342 (442) Q Consensus 263 ~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~ 342 (442) ++++.+.|++++.|+++++|..++.||++|.|+.+ ++|+++|.||++++|.+++.|..... . T Consensus 9 ~I~~ga~IG~g~vI~~~a~I~~g~~IG~~~~I~~~-----~~Ig~~~~IG~~v~I~~g~~i~g~~~-------------p 70 (139) T cd03350 9 IIRDGAFIGPGAVLMMPSYVNIGAYVDEGTMVDSW-----ATVGSCAQIGKNVHLSAGAVIGGVLE-------------P 70 (139) T ss_pred EECCCCEECCCCEECCCCEECCCCEECCCCEECCC-----CEECCCCEECCCCEECCCCEECCCCC-------------C T ss_conf 98899999999899889998439898987278666-----60268669999328988867844425-------------7 Q ss_pred CCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEE Q ss_conf 85448745556639889989979808834457754562987887988699893783988998997783576643698689 Q gi|254780942|r 343 GSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLV 422 (442) Q Consensus 343 g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v 422 (442) -.+ -..+|+++|.||+++++.- +++||+++.||+++++...+.+ +.. .+|.+...++||+++| T Consensus 71 ~~~-------~~v~IeD~~~IGa~a~V~~-------Gv~IG~~avIGAGsVVtkdvp~--y~~-~~g~~~~~~~p~~~~~ 133 (139) T cd03350 71 LQA-------TPVIIEDDVFIGANCEVVE-------GVIVGKGAVLAAGVVLTQSTPI--YDR-ETGEIYYGRVPPGSVV 133 (139) T ss_pred CCC-------CCCEECCCEEEEECCEEEC-------CCEECCCCEECCCCEEECCCCC--EEE-ECCCEEECCCCCCCEE T ss_conf 646-------8868879959930998946-------8799999899979889189884--682-3780970116998787 Q ss_pred EEC Q ss_conf 813 Q gi|254780942|r 423 FAR 425 (442) Q Consensus 423 ~~~ 425 (442) ..+ T Consensus 134 ~~~ 136 (139) T cd03350 134 VAG 136 (139) T ss_pred CCC T ss_conf 458 No 113 >cd00710 LbH_gamma_CA Gamma carbonic anhydrases (CA): Carbonic anhydrases are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism, involving the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. They are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three distinct groups of carbonic anhydrases - alpha, beta and gamma - which show no significant sequence identity or structural similarity. Gamma CAs are homotrimeric enzymes, with each subunit containing a left-handed parallel beta helix (LbH) structural domain. Probab=99.72 E-value=6.3e-17 Score=114.78 Aligned_cols=120 Identities=23% Similarity=0.337 Sum_probs=76.0 Q ss_pred EECCCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCC Q ss_conf 33178410012232234420002111631343321211121002134432000002586443202310211247854487 Q gi|254780942|r 269 IIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINH 348 (442) Q Consensus 269 ~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~ 348 (442) .|++++.|.|++++-+++.||++|.|+++++|.+-. + ....||+++.|++++ +|. +.-+ T Consensus 4 ~i~~~afIap~A~viGdV~ig~~~sIw~~aviRgD~-~------------~~I~IG~~~nIqD~~-----viH--~~~~- 62 (167) T cd00710 4 VIDPSAYVHPTAVVIGDVIIGDNVFVGPGASIRADE-G------------TPIIIGANVNIQDGV-----VIH--ALEG- 62 (167) T ss_pred CCCCCEEECCCCEEEEEEEECCCCEECCCCEEECCC-C------------CCEEECCCCCCCCCE-----EEE--CCCC- T ss_conf 069993989999899708999999998887894788-8------------876988874347770-----782--2576- Q ss_pred CCCCCCCEECCCCEECCCEEECCCCCCCCCC-CEECCCCEECCCCEEECCEEECCCCEECCCCEECCC-CCCCCEE Q ss_conf 4555663988998997980883445775456-298788798869989378398899899778357664-3698689 Q gi|254780942|r 349 LSYVGDSVVGKNVNIGAGTITCNYDGTHKYK-THINENAFIGSNSSLIAPITIGQGTYVASGSIITQD-TPENSLV 422 (442) Q Consensus 349 ~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~-~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~d-v~~~~~v 422 (442) .++.||++|+||.++++ |+ ++||++|+||.+++++. .+||++|+|+|||+|... +||++++ T Consensus 63 ----~~~~IG~~vtIgh~a~i--------hG~c~Igd~~~IG~gaiv~~-a~Ig~~~vIgags~V~g~~i~~g~~v 125 (167) T cd00710 63 ----YSVWIGKNVSIAHGAIV--------HGPAYIGDNCFIGFRSVVFN-AKVGDNCVIGHNAVVDGVEIPPGRYV 125 (167) T ss_pred ----CCEEECCCCEECCCCEE--------CCEEEECCCCEECCCCEEEC-CEECCCCEECCCCEEECCCCCCCCEE T ss_conf ----67497798289898488--------12049839988988989982-59769989988988847783999895 No 114 >TIGR03202 pucB xanthine dehydrogenase accessory protein pucB. In Bacillus subtilis the expression of this protein, located in an operon with the structural subunits of xanthine dehydrogenase, has been found to be essential for XDH activity. Some members of this family appear to have a distant relationship to the MobA protein involved in molybdopterin biosynthesis, although this may be coincidental. Probab=99.72 E-value=4.1e-17 Score=115.84 Aligned_cols=121 Identities=20% Similarity=0.301 Sum_probs=96.9 Q ss_pred EEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHC--CCCCEEEEEEC-CCCCC Q ss_conf 8999458877232788874412588485799999999977997799995798899999730--26972899985-99886 Q gi|254780942|r 6 LAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINF--PPTLSVEYYIQ-DCQQG 82 (442) Q Consensus 6 ~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~--~~~~~i~~v~q-~~~~G 82 (442) .|||||||.++||. .+|.|+|+.|+||+.|.++.+..+++.++++|+++..+++..... .....++++.. +...| T Consensus 2 ~aiILAAG~S~RmG--~~KlLl~~~g~~ll~~~~~~~~~~~~~~vvVV~g~~~~~~~~~~~~~~~~~~~~~v~n~~~~~G 79 (190) T TIGR03202 2 VAIYLAAGQSRRMG--ENKLALPLGETTLGSASLKTALSSRLSKVIVVIGEKYAHLSWLDPYLLADERIMLVCCRDACEG 79 (190) T ss_pred EEEEECCCCCCCCC--CCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHCCCCEEEEECCCCCCC T ss_conf 79997998766899--8842385099119999999998669981999818718888766787633487499968986546 Q ss_pred HHHHHHHCCCCCC-CCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 1446641000012-344605898515411153322000000135861 Q gi|254780942|r 83 TAHAVLTAQDAIK-PGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYS 128 (442) Q Consensus 83 Ta~Ai~~a~~~l~-~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~ 128 (442) .+.+++.+.+... ...+-++|+-||+|+++...++.+++.+.+... T Consensus 80 ~~sSi~~Gl~~~~~~~~d~~li~l~D~P~l~~~~i~~L~~~~~~~~~ 126 (190) T TIGR03202 80 QAHSLKCGLRKAEAMGADAVVILLADQPFLTADVINALLALAKRRPD 126 (190) T ss_pred CHHHHHHHHHHHCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCC T ss_conf 07999999997436688969996489778899999999999985899 No 115 >COG0110 WbbJ Acetyltransferase (isoleucine patch superfamily) [General function prediction only] Probab=99.72 E-value=5.4e-17 Score=115.14 Aligned_cols=55 Identities=25% Similarity=0.630 Sum_probs=51.5 Q ss_pred CCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEEEECC Q ss_conf 6298788798869989378398899899778357664369868981356077077 Q gi|254780942|r 379 KTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQIVKED 433 (442) Q Consensus 379 ~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~~~~ 433 (442) +++||++||||.+++++|||+||++++|||||+|++|+|+++++.|.|.+++|+- T Consensus 124 ~v~IG~~vwIG~~a~IlpGV~IG~gavigagsVVtkdvp~~~iv~G~Pa~vir~~ 178 (190) T COG0110 124 PVTIGEDVWIGAGAVILPGVTIGEGAVIGAGSVVTKDVPPYGIVAGNPARVIRKR 178 (190) T ss_pred CCEECCCCEECCCCEECCCEEECCCEEEECCEEEECCCCCCEEEECCCCEEEEEC T ss_conf 9788998587789789798598998199788589577799828979998898741 No 116 >TIGR00454 TIGR00454 conserved hypothetical protein TIGR00454; InterPro: IPR005245 This entry describes a family of conserved hypothetical proteins with no known function. . Probab=99.72 E-value=1.3e-17 Score=118.69 Aligned_cols=121 Identities=19% Similarity=0.323 Sum_probs=93.8 Q ss_pred CEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCHHHHHHHHCCCCCEEE-E-EECCCCC Q ss_conf 089994588772327888744125884857999999999779-977999957988999997302697289-9-9859988 Q gi|254780942|r 5 RLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAG-IENVALVLGYGAEEITRINFPPTLSVE-Y-YIQDCQQ 81 (442) Q Consensus 5 ~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g-~~~ivvv~~~~~e~i~~~~~~~~~~i~-~-v~q~~~~ 81 (442) |.|.|||||+||||--+--|||+.+||++||+|+|+.|.++. ++.|+++|+++.-.-++|......... + +....-. T Consensus 1 M~aL~MaGGkGTRlg~d~EKPL~evcGr~lIDhvl~~l~~~~GV~~ii~~TSPhTP~Te~y~~~~~~~~~~ivvidasGk 80 (204) T TIGR00454 1 MLALVMAGGKGTRLGRDVEKPLLEVCGRKLIDHVLEALKKAKGVDNIIVVTSPHTPKTEEYVAEKYKEYKRIVVIDASGK 80 (204) T ss_pred CCCEEECCCCCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHHCCCEEEEEEEEECCCC T ss_conf 95024058873122655565337565850457777866404787537998377797637887305850457889976999 Q ss_pred CHHHHHHHCCCCCC-CCCCEEEEEECCCCCCCCCCCCCCCCCCCC Q ss_conf 61446641000012-344605898515411153322000000135 Q gi|254780942|r 82 GTAHAVLTAQDAIK-PGYDDVIIMYGDVPLVSSHTLKKAMDKIAQ 125 (442) Q Consensus 82 GTa~Ai~~a~~~l~-~~~~~~lVl~GD~~li~~~~l~~l~~~~~~ 125 (442) |--.-+..+..++. ...++|||+++|.+++.+..++..++.+.+ T Consensus 81 GYiEDl~E~~~hlE~~~~EP~lV~ssDl~~~r~~~id~Ivd~y~~ 125 (204) T TIGR00454 81 GYIEDLREVLSHLELAFSEPLLVVSSDLVLVRDKIIDSIVDAYYK 125 (204) T ss_pred CCHHHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHHC T ss_conf 602527899998776633985787436024336778777643210 No 117 >cd05825 LbH_wcaF_like wcaF-like: This group is composed of the protein product of the E. coli wcaF gene and similar proteins. WcaF is part of the gene cluster responsible for the biosynthesis of the extracellular polysaccharide colanic acid. The wcaF protein is predicted to contain a left-handed parallel beta-helix (LbH) domain encoded by imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Many are trimeric in their active forms. Probab=99.71 E-value=5.8e-17 Score=114.98 Aligned_cols=51 Identities=20% Similarity=0.341 Sum_probs=48.1 Q ss_pred CCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEE Q ss_conf 629878879886998937839889989977835766436986898135607 Q gi|254780942|r 379 KTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQI 429 (442) Q Consensus 379 ~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~ 429 (442) ++.||++||||.|++++||++||++|+|+|||+|++|+|+++++.|.|.++ T Consensus 56 ~v~Ig~~v~IG~~~~I~~gv~Ig~~svVgagSvV~kdvp~~~i~~G~PAk~ 106 (107) T cd05825 56 PIVIGDGAWVAAEAFVGPGVTIGEGAVVGARSVVVRDLPAWTVYAGNPAVP 106 (107) T ss_pred CEEECCCCEECCCCEEECCCEECCCCEEECCCEECCCCCCCCEEEEECCEE T ss_conf 979938858749878969919189979948988903859997998147589 No 118 >PRK10191 putative colanic acid biosynthesis acetyltransferase WcaB; Provisional Probab=99.71 E-value=1.2e-16 Score=113.13 Aligned_cols=53 Identities=26% Similarity=0.514 Sum_probs=48.8 Q ss_pred CCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEEEE Q ss_conf 62987887988699893783988998997783576643698689813560770 Q gi|254780942|r 379 KTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQIVK 431 (442) Q Consensus 379 ~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~~ 431 (442) ..+|||+|+||+|+.++.+++||+++.|||||+|++|||++++|+|.|.++.. T Consensus 92 ~P~Ig~~V~Igaga~IlG~i~IG~~~~IGA~svV~~dvp~~~~v~G~Pari~~ 144 (146) T PRK10191 92 CPHIGNGVELGANVIILGDITLGNNVTVGAGSVVLDSVPDNALVVGEKARVKV 144 (146) T ss_pred CCEECCCEEEECCCEEECCCEECCCCEECCCCEECCCCCCCCEEEEECCEEEE T ss_conf 89879995992899996485999999999895897035999999976809995 No 119 >PRK09382 ispDF bifunctional 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase/2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase protein; Provisional Probab=99.68 E-value=6.7e-16 Score=108.91 Aligned_cols=206 Identities=15% Similarity=0.209 Sum_probs=132.7 Q ss_pred CCEEEEECCCCCCCCCC--CCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEEECCCHHHHHHHHCCCCCEEEEEECCCC Q ss_conf 40899945887723278--8874412588485799999999977-99779999579889999973026972899985998 Q gi|254780942|r 4 KRLAIVLAAGRGHRMKS--SSSKVLQKIAGKPMISHVMETIAAA-GIENVALVLGYGAEEITRINFPPTLSVEYYIQDCQ 80 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p--~~pKpLlpi~gkpli~~~i~~l~~~-g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~ 80 (442) +..|||||||.|+||.. ..||++++++|+|+++|.++.+.+. .+++|+++++.+... ++.....++.++.--+. T Consensus 6 ~i~aIIlAAG~G~Rfg~~~~ipKQf~~l~gkpil~~sl~~f~~~~~i~~Ivvvv~~~~~~---~~~~~~~~i~~V~GG~t 82 (382) T PRK09382 6 DISLVIVAAGRSTRFGQSAEVKKQWLRIGGKPLWLHVLENFSSAPDFKEIVVVIHPDDIA---YMKRAVPEIKFVTGGAT 82 (382) T ss_pred CEEEEEECCCCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHCCCCCCEEEEEECHHHHH---HHHHHCCCCEEECCCHH T ss_conf 637999687225048998799974106797789999999996689989899993888999---99974579579378505 Q ss_pred CCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCEEEECCCCCEECCCCCC Q ss_conf 86144664100001234460589851541115332200000013586100100101221574404202333100100146 Q gi|254780942|r 81 QGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNADNPKGYGRLLIKNNEIIAIREEND 160 (442) Q Consensus 81 ~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~~dp~~yGvV~~d~~~v~~i~Ek~d 160 (442) + .+++++++++++. +.+||.-+--||++...++.+++..++ .++++.+.++.|.-.+ .++.+ | T Consensus 83 R--q~SV~ngL~~i~~--~~VlIHDaARP~vs~~lI~~li~~~~~-~~a~ip~lpv~DTik~-----~~~tv-------d 145 (382) T PRK09382 83 R--QESVRNALEALDS--EYVLISDAARPFVPKELIDRLIEALDK-ADCVLPALPVADTLKR-----ANETV-------D 145 (382) T ss_pred H--HHHHHHHHHHCCC--CEEEEECCCCCCCCHHHHHHHHHHHCC-CCCCEEEEECCCCEEE-----CCCCC-------C T ss_conf 9--9999999975678--889995542578999999999986514-7761587524335010-----46646-------8 Q ss_pred CCCCCCCCCC--CCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCCCCCC Q ss_conf 6754210003--55600111250146777532888740011121148999973077179991721430111210000001 Q gi|254780942|r 161 ATDEERKIHY--CNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYELSLIE 238 (442) Q Consensus 161 ~~~~~~~~~l--in~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~~~~ 238 (442) +.++ +.|= -.|+.+.|.++.++.... ....||--+.+...|.++..+.-+... .-+.+|.||..++ T Consensus 146 ------R~~l~~~QTP-Q~F~~~~L~~a~~~a~~~----~~~~TDDasl~e~~G~~V~~V~G~~~N-~KITtpeDL~~~~ 213 (382) T PRK09382 146 ------REGLKLIQTP-QLSRTKTLLAAHRKALSG----RTDFTDDSSIAEAAGGKVALVEGSEDL-HKLTFKEDLKALD 213 (382) T ss_pred ------HHHCEEECCC-CCCCHHHHHHHHHHHHHC----CCCCCCHHHHHHHCCCCEEEEECCCCC-CCCCCHHHHHHHH T ss_conf ------6784562277-535599999999999863----998674099999869927999567548-5389999999998 Q ss_pred CCC Q ss_conf 222 Q gi|254780942|r 239 NIW 241 (442) Q Consensus 239 ~~~ 241 (442) .+. T Consensus 214 ~~~ 216 (382) T PRK09382 214 RLS 216 (382) T ss_pred HHH T ss_conf 644 No 120 >PRK11830 dapD 2,3,4,5-tetrahydropyridine-2,6-carboxylate N-succinyltransferase; Provisional Probab=99.68 E-value=4.6e-16 Score=109.83 Aligned_cols=120 Identities=20% Similarity=0.339 Sum_probs=81.6 Q ss_pred CCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCC Q ss_conf 78410012232234420002111631343321211121002134432000002586443202310211247854487455 Q gi|254780942|r 272 PDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSY 351 (442) Q Consensus 272 ~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~ 351 (442) .+..+-|++.+..++.|++++++.|.-+--|++|+++++|..|+.+.+.+ T Consensus 95 ~gvRvVP~a~vR~Gayi~~~vVlMPs~VNiGAyv~~gtmiDtwatvGScA------------------------------ 144 (265) T PRK11830 95 AGVRVVPGAVVRRGAYIAKNVVLMPSYVNIGAYVDEGTMVDTWATVGSCA------------------------------ 144 (265) T ss_pred CCEEECCCCEEEECEEECCCCEEEEEEEEEEEEECCCCEEECCCCEECHH------------------------------ T ss_conf 79088788537603054599588641777603984895480476241153------------------------------ Q ss_pred CCCCEECCCCEECCCEEECC-CCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECC---------------C Q ss_conf 56639889989979808834-4577545629878879886998937839889989977835766---------------4 Q gi|254780942|r 352 VGDSVVGKNVNIGAGTITCN-YDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQ---------------D 415 (442) Q Consensus 352 igd~~iG~~~~ig~g~i~~n-~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~---------------d 415 (442) +||+||.|++|+-+.. +......+++|+|+|+||+++.+..|+.|+++|++++|..+++ + T Consensus 145 ----qiGknvhls~g~gigGvleP~~a~p~iied~~fiGa~~~v~eGv~v~~~avl~~gv~l~~st~I~D~~~~ev~~g~ 220 (265) T PRK11830 145 ----QIGKNVHLSGGVGIGGVLEPLQANPVIIEDNCFIGARSEVVEGVIVEEGAVIGMGVFLGQSTKIYDRETGEVHYGR 220 (265) T ss_pred ----EECCCEEECCCCEEEEEECCCCCCCEEEECCCEECCCCEEEEEEEEECCCEECCEEEECCCCEEEECCCCCEEECC T ss_conf ----2378336878741426524766798798368287357479613999458697210476067167872478287434 Q ss_pred CCCCCEEEEC Q ss_conf 3698689813 Q gi|254780942|r 416 TPENSLVFAR 425 (442) Q Consensus 416 v~~~~~v~~~ 425 (442) ||++++|+.. T Consensus 221 vP~~svvv~G 230 (265) T PRK11830 221 VPAGSVVVPG 230 (265) T ss_pred CCCCCEEECC T ss_conf 7999889557 No 121 >cd04650 LbH_FBP Ferripyochelin Binding Protein (FBP): FBP is an outer membrane protein which plays a role in iron acquisition. It binds iron when it is complexed with pyochelin. It adopts the left-handed parallel beta-helix (LbH) structure, and contains imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Acyltransferase activity has not been observed in this group. Probab=99.67 E-value=2.2e-16 Score=111.71 Aligned_cols=131 Identities=21% Similarity=0.314 Sum_probs=79.6 Q ss_pred ECCCCCEECCCCCCCCCCCCEEEECCCCCCCC----CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCC Q ss_conf 31784100122322344200021116313433----21211121002134432000002586443202310211247854 Q gi|254780942|r 270 IQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE----GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSK 345 (442) Q Consensus 270 I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~----~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~ 345 (442) |++++.|+|.+.|.+++.||++|.|+++++|. ...||++|.|++++.|... -+..+.||+ .+. T Consensus 3 I~~~~~I~p~a~i~G~v~IG~~~~I~~~~~Ir~d~~~i~IG~~~~I~~n~~I~~~--~~~~~~ig~-----------~v~ 69 (154) T cd04650 3 ISPKAYVHPTSYVIGDVVIGELTSVWHYAVIRGDNDSIYIGKYSNVQENVSIHTD--HGYPTEIGD-----------YVT 69 (154) T ss_pred CCCCEEECCCCEEECEEEECCCCEECCCCEEECCCCCEEECCCEEECCCCEEEEC--CCCCCEECC-----------CCE T ss_conf 6998799999999760599999999875389558876599993398999889846--999868799-----------768 Q ss_pred CCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCC-CEECCCCCCCC Q ss_conf 4874555663988998997980883445775456298788798869989378398899899778-35766436986 Q gi|254780942|r 346 INHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASG-SIITQDTPENS 420 (442) Q Consensus 346 i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag-~~v~~dv~~~~ 420 (442) ++|.+.+..+.||++|.||+++++.. +++||+++.||+++++.++..|.++++++.. +-+-|++++.. T Consensus 70 ig~~~~i~g~~Ig~~v~IG~~a~I~~-------Gv~IG~~svIgaGsVV~~~~~Vp~~~lv~G~PAr~ir~l~~~~ 138 (154) T cd04650 70 IGHNAVVHGAKVGNYVIVGMGAILLN-------GAKIGDHVIIGAGAVVTPGKEIPDYSLVLGVPAKVVRKLTEEE 138 (154) T ss_pred ECCCEEEECCEECCEEEECCCEEEEC-------CCEECCCCEECCCCEECCCCEECCCCEEEECCEEEEEECCHHH T ss_conf 89952882302201359887229962-------8798999799889888799791998499961818977299999 No 122 >cd04745 LbH_paaY_like paaY-like: This group is composed by uncharacterized proteins with similarity to the protein product of the E. coli paaY gene, which is part of the paa gene cluster responsible for phenylacetic acid degradation. Proteins in this group are expected to adopt the left-handed parallel beta-helix (LbH) structure. They contain imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Similarity to gamma carbonic anhydrase and Ferripyochelin Binding Protein (FBP) may suggest metal binding capacity. Probab=99.67 E-value=1.4e-16 Score=112.77 Aligned_cols=136 Identities=15% Similarity=0.269 Sum_probs=83.8 Q ss_pred EEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCC Q ss_conf 94112331784100122322344200021116313433212111210021344320000025864432023102112478 Q gi|254780942|r 264 LSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEG 343 (442) Q Consensus 264 i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g 343 (442) |++++.|++++.|.+++.||++|.|+++|.|... .....||++|.|+++|.|. +..+.++ .|+++ T Consensus 3 i~~~a~I~p~A~I~G~V~IG~~~~I~~~~~Ir~d--~g~i~IG~~~~I~~n~~Ih--~~~~~~~-----------~Ig~~ 67 (155) T cd04745 3 VDPSSFVHPTAVLIGDVIIGKNCYIGPHASLRGD--FGRIVIRDGANVQDNCVIH--GFPGQDT-----------VLEEN 67 (155) T ss_pred CCCCEEECCCCEEEEEEEECCCCEECCCEEEECC--CCCEEECCCCEECCCCEEC--CCCCCCC-----------EECCC T ss_conf 7998599999999844799899799846079638--7778999997978784401--3599986-----------98999 Q ss_pred CCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCC-CEECCCCCCCCE Q ss_conf 544874555663988998997980883445775456298788798869989378398899899778-357664369868 Q gi|254780942|r 344 SKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASG-SIITQDTPENSL 421 (442) Q Consensus 344 ~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag-~~v~~dv~~~~~ 421 (442) +.++|.+.+..|.||+||.||+++++.. ++.||++|.||+++++.++..|.++++++.. +-+-|++.+..+ T Consensus 68 ~~ig~~~~~~g~~Ig~~~~IG~~a~I~~-------G~~IG~~siIgagSvV~~~~~VP~~~l~~GnPAk~ir~l~~~e~ 139 (155) T cd04745 68 GHIGHGAILHGCTIGRNALVGMNAVVMD-------GAVIGEESIVGAMAFVKAGTVIPPRSLIAGSPAKVIRELSDEEV 139 (155) T ss_pred CEECCCCEECCCEEEEEEEECCCCEEEC-------CCEECCCCEEEECCEECCCCCCCCCEEEEEECCEEEECCCHHHH T ss_conf 7899874871517743249768759959-------77989997994187876996837998999728679743999999 No 123 >COG2266 GTP:adenosylcobinamide-phosphate guanylyltransferase [Coenzyme metabolism] Probab=99.66 E-value=5.2e-16 Score=109.54 Aligned_cols=111 Identities=21% Similarity=0.396 Sum_probs=89.9 Q ss_pred CEEEEECCCCCCCCC-CCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEECCCCCCH Q ss_conf 089994588772327-8887441258848579999999997799779999579889999973026972899985998861 Q gi|254780942|r 5 RLAIVLAAGRGHRMK-SSSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQDCQQGT 83 (442) Q Consensus 5 ~~AiILAaG~GtRl~-p~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~~GT 83 (442) |.|+|||||+||||. |. |||++++|||||+|+++.|.+ .++++++.++.+....+++....+.++ +. ..-.|- T Consensus 1 m~~iiMAGGrGtRmg~~E--KPlleV~GkpLI~~v~~al~~-~~d~i~v~isp~tp~t~~~~~~~gv~v--i~-tpG~GY 74 (177) T COG2266 1 MMAIIMAGGRGTRMGRPE--KPLLEVCGKPLIDRVLEALRK-IVDEIIVAISPHTPKTKEYLESVGVKV--IE-TPGEGY 74 (177) T ss_pred CCEEEECCCCCCCCCCCC--CCCHHHCCCCHHHHHHHHHHH-HCCCEEEEECCCCHHHHHHHHHCCCEE--EE-CCCCCH T ss_conf 935896288544468876--752020781389999999972-218389996799876999997359369--97-599870 Q ss_pred HHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 44664100001234460589851541115332200000013 Q gi|254780942|r 84 AHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIA 124 (442) Q Consensus 84 a~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~ 124 (442) -.-++.+.+.+. .++|++++|.||+.+..++.+++... T Consensus 75 v~Dl~~al~~l~---~P~lvvsaDLp~l~~~~i~~vi~~~~ 112 (177) T COG2266 75 VEDLRFALESLG---TPILVVSADLPFLNPSIIDSVIDAAA 112 (177) T ss_pred HHHHHHHHHHCC---CCEEEEECCCCCCCHHHHHHHHHHHH T ss_conf 899999997449---94599865534178899999999985 No 124 >TIGR01172 cysE serine O-acetyltransferase; InterPro: IPR005881 The biosynthesis of L-cysteine is the predominant way by which inorganic sulphur is incorporated into organic compounds. In this process, the most abundant utilizable source of sulphur, inorganic sulphate, is taken up and reduced to sulphide. Sulphide is used to produce L-cysteine, which serves for protein synthesis or the production of other sulphur-containing organic compounds. Two routes for cysteine biosynthesis in nature have been documented. Serine transacetylase (2.3.1.30 from EC) catalyzes steps in pathway I, the activation of L-serine by acetyl-coenzyme A, yielding O-acetyl-L-serine.; GO: 0009001 serine O-acetyltransferase activity, 0006535 cysteine biosynthetic process from serine, 0005737 cytoplasm. Probab=99.65 E-value=3.3e-17 Score=116.38 Aligned_cols=54 Identities=20% Similarity=0.478 Sum_probs=48.3 Q ss_pred CCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCE Q ss_conf 7754562987887988699893783988998997783576643698689813560 Q gi|254780942|r 374 GTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQ 428 (442) Q Consensus 374 g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~ 428 (442) |..+| .+|+++|.||+|+.|+-.++||+||-|||||+|.+|||+++-|+|.|-| T Consensus 110 G~kRH-Pt~~~gV~iGAGAKvLG~I~vG~~AkiGAnsVVl~dVP~~~TVVGvPar 163 (163) T TIGR01172 110 GVKRH-PTIGEGVVIGAGAKVLGNIEVGENAKIGANSVVLKDVPAGATVVGVPAR 163 (163) T ss_pred CCCCC-CCCCCCEEEEECCEEECCEEECCCCEECCCEEECCCCCCCCEEEEECCC T ss_conf 88988-8256871784078240376677997772332673537898658743589 No 125 >cd04645 LbH_gamma_CA_like Gamma carbonic anhydrase-like: This family is composed of gamma carbonic anhydrase (CA), Ferripyochelin Binding Protein (FBP), E. coli paaY protein, and similar proteins. CAs are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism, involving the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. They are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three evolutionary distinct groups - alpha, beta and gamma carbonic anhydrases - which show no significant sequence identity or structural similarity. Gamma CAs are trimeric enzymes with left-handed parallel beta helix (LbH) structural domain. Probab=99.65 E-value=5.6e-16 Score=109.37 Aligned_cols=131 Identities=19% Similarity=0.279 Sum_probs=81.8 Q ss_pred ECCCCCEECCCCCCCCCCCCEEEECCCCCCCC----CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCC Q ss_conf 31784100122322344200021116313433----21211121002134432000002586443202310211247854 Q gi|254780942|r 270 IQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE----GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSK 345 (442) Q Consensus 270 I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~----~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~ 345 (442) |++++.|+|++.|.+++.||++|.|+++++|. ...||++|.|++++.|. +....++.||+ ++. T Consensus 2 I~~~a~I~p~A~I~G~V~IG~~~~I~~~~~I~~d~~~i~IG~~~~I~~~~~Ih--~~~~~~~~IG~-----------~v~ 68 (153) T cd04645 2 IDPSAFIAPNATVIGDVTLGEGSSVWFGAVLRGDVNPIRIGERTNIQDGSVLH--VDPGYPTIIGD-----------NVT 68 (153) T ss_pred CCCCEEECCCCEEECCEEECCCCEECCCCEEECCCCCEEECCCCEECCCCCCC--CCCCCCCEECC-----------CCE T ss_conf 18981999999997726998999996583842466635999998989685435--25999979899-----------989 Q ss_pred CCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCC-CEECCCCCCCC Q ss_conf 4874555663988998997980883445775456298788798869989378398899899778-35766436986 Q gi|254780942|r 346 INHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASG-SIITQDTPENS 420 (442) Q Consensus 346 i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag-~~v~~dv~~~~ 420 (442) +.|.+.+.+|.||++|.||+++++.. ++.||++|.||+++++.++..|.++++++.. +-+-|++++.. T Consensus 69 Ig~~~~i~g~~Ig~~~~IG~~a~I~~-------gv~IG~~~vIgagsvV~~~~~vp~~~v~~G~PAr~ir~~~~~e 137 (153) T cd04645 69 VGHGAVLHGCTIGDNCLIGMGAIILD-------GAVIGKGSIVAAGSLVPPGKVIPPGSLVAGSPAKVVRELTDEE 137 (153) T ss_pred ECCCCEEECEEEEEEEEECCCCEECC-------CCEECCCCEECCCCEECCCCEECCCCEEEECCCEEECCCCHHH T ss_conf 99995992418730047705439856-------9998899799449897489796998599962837971599999 No 126 >cd02503 MobA MobA catalyzes the formation of molybdopterin guanine dinucleotide. The prokaryotic enzyme molybdopterin-guanine dinucleotide biosynthesis protein A (MobA). All mononuclear molybdoenzymes bind molybdenum in complex with an organic cofactor termed molybdopterin (MPT). In many bacteria, including Escherichia coli, molybdopterin can be further modified by attachment of a GMP group to the terminal phosphate of molybdopterin to form molybdopterin guanine dinucleotide (MGD). This GMP attachment step is catalyzed by MobA, by linking a guanosine 5'-phosphate to MPT forming molybdopterin guanine dinucleotide. This reaction requires GTP, MgCl2, and the MPT form of the cofactor. It is a reaction unique to prokaryotes, and therefore may represent a potential drug target. Probab=99.65 E-value=8.3e-16 Score=108.39 Aligned_cols=114 Identities=24% Similarity=0.420 Sum_probs=87.5 Q ss_pred CEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEE-CCCCCCH Q ss_conf 089994588772327888744125884857999999999779977999957988999997302697289998-5998861 Q gi|254780942|r 5 RLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYI-QDCQQGT 83 (442) Q Consensus 5 ~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~-q~~~~GT 83 (442) ..|||||||+++||. .+|+|+++.|+|+++|+++.+... +.++++++++..+.. . ..++.++. +.+..|. T Consensus 1 i~aiILAgG~S~RmG--~~K~ll~~~g~~li~~~~~~l~~~-~~~v~vv~~~~~~~~----~--~~~~~~i~d~~~~~GP 71 (181) T cd02503 1 ITGVILAGGKSRRMG--GDKALLELGGKPLLEHVLERLKPL-VDEVVISANRDQERY----A--LLGVPVIPDEPPGKGP 71 (181) T ss_pred CEEEEECCCCCCCCC--CCCCCCEECCCCHHHHHHHHHHHH-CCCEEEEECCCHHHH----H--CCCCEEEECCCCCCCC T ss_conf 969997897734189--985177989840999999999864-680799917727766----3--1798699479978784 Q ss_pred HHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 4466410000123446058985154111533220000001358610 Q gi|254780942|r 84 AHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSI 129 (442) Q Consensus 84 a~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~ 129 (442) ..+++.+++.+.. +.++|+.+|+|+++...+..|+++++++.++ T Consensus 72 l~gi~~~l~~~~~--~~~lv~~~D~P~i~~~~i~~L~~~~~~~~~~ 115 (181) T cd02503 72 LAGILAALRAAPA--DWVLVLACDMPFLPPELLERLLAAAEEGADA 115 (181) T ss_pred HHHHHHHHHHCCC--CEEEEEECCCCCCCHHHHHHHHHHHHCCCCE T ss_conf 8999999862787--7599964798988899999999973269978 No 127 >COG2171 DapD Tetrahydrodipicolinate N-succinyltransferase [Amino acid transport and metabolism] Probab=99.64 E-value=4.7e-16 Score=109.77 Aligned_cols=118 Identities=28% Similarity=0.393 Sum_probs=81.4 Q ss_pred CCCEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCC Q ss_conf 84100122322344200021116313433-21211121002134432000002586443202310211247854487455 Q gi|254780942|r 273 DTVIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSY 351 (442) Q Consensus 273 ~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~ 351 (442) ++.|.|++.+..++.|++|+++.+.+++. |+.++.+|+|.-|+.+... T Consensus 108 g~RI~p~a~VR~ga~i~~gtvvM~~sfVNigA~~~~gtMVd~~as~G~~------------------------------- 156 (271) T COG2171 108 GVRIVPGAIVRLGAYIAKGTVVMPESFVNIGAGTGEGTMVDGRASVGSC------------------------------- 156 (271) T ss_pred CEEECCCCEEEECCEECCCCEECCCCEEEECCCCCCCEEEEEEEEEECC------------------------------- T ss_conf 6363676477605287799387555668777602752589615655131------------------------------- Q ss_pred CCCCEECCCCEECCCEEECC-CCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEE Q ss_conf 56639889989979808834-45775456298788798869989378398899899778357664369868981 Q gi|254780942|r 352 VGDSVVGKNVNIGAGTITCN-YDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFA 424 (442) Q Consensus 352 igd~~iG~~~~ig~g~i~~n-~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~ 424 (442) ++||+||.||+|+.+.. +.+...+++.|||||+||+|+.+.-|+.+|++|+|+||..|++|+|.+..+-+ T Consensus 157 ---a~VGkn~higgGa~I~GVLep~~a~Pv~IgdncliGAns~~veGV~vGdg~VV~aGv~I~~~tki~~~~~g 227 (271) T COG2171 157 ---AQVGKNSHIGGGASIGGVLEPLQANPVIIGDNCLIGANSEVVEGVIVGDGCVVAAGVFITQDTKIYDRVAG 227 (271) T ss_pred ---EEECCCCCCCCCCEEEEEECCCCCCCEEECCCCEECCCCCEEEEEEECCCCEEECCEEEECCCCEEEEECC T ss_conf ---79799845278636767734888788698776186245334861696789688545089379514774346 No 128 >cd04646 LbH_Dynactin_6 Dynactin 6 (or subunit p27): Dynactin is a major component of the activator complex that stimulates dynein-mediated vesicle transport. Dynactin is a heterocomplex of at least eight subunits, including a 150,000-MW protein called Glued, the actin-capping protein Arp1, and dynamatin. In vitro binding experiments show that dynactin enhances dynein-dependent motility, possibly through interaction with microtubules and vesicles. Subunit p27 is part of the pointed-end subcomplex in dynactin that also includes p25, p26, and Arp11. This subcomplex interacts with membranous cargoes. p25 and p27 contain the imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), indicating a left-handed parallel beta helix (LbH) structural domain. Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.64 E-value=2.9e-16 Score=110.97 Aligned_cols=122 Identities=17% Similarity=0.224 Sum_probs=75.5 Q ss_pred EEEEEEECCCCCEECCCCCCCCC---CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCC Q ss_conf 94112331784100122322344---200021116313433212111210021344320000025864432023102112 Q gi|254780942|r 264 LSHDTIIQPDTVIEPHVFFGCGV---SIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATI 340 (442) Q Consensus 264 i~~~~~I~~~~~I~~~~~Ig~~~---~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i 340 (442) +-.++.|++++.|+|+++|.+.. +||++ +.|.++|+|..... .....++.+.||++ .+| T Consensus 14 iiGdV~IG~~~sI~p~avIrgd~g~I~IG~~-----------~nIqd~~~I~~~~~--~~~~~g~~~~IG~~-----~~i 75 (164) T cd04646 14 IRGDVTIGPGTVVHPRATIIAEAGPIIIGEN-----------NIIEEQVTIVNKKP--KDPAEPKPMIIGSN-----NVF 75 (164) T ss_pred EEEEEEECCCCEECCCCEEEECCCCEEECCC-----------CEECCCEEEECCCC--CCCCCCCCEEECCC-----CEE T ss_conf 9821799999989957599814378898999-----------66669859941787--77789988798899-----888 Q ss_pred CCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCC Q ss_conf 4785448745556639889989979808834457754562987887988699893783988998997783576643 Q gi|254780942|r 341 KEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDT 416 (442) Q Consensus 341 ~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv 416 (442) +.|+.+ -.|.||+||.||.++++.+ +++||++|.||+++.+.++..|.+++++.......+.. T Consensus 76 ~~g~~i------h~~~IG~~~lIg~~a~V~~-------ga~Ig~~ciIgags~V~~~~~ip~~slv~G~pa~~~~~ 138 (164) T cd04646 76 EVGCKC------EALKIGNNNVFESKSFVGK-------NVIITDGCIIGAGCKLPSSEILPENTVIYGADCLRRTQ 138 (164) T ss_pred CCCEEE------ECCEECCCCEECCCCEECC-------CCEECCCCEECCCCCCCCCCEECCCEEEECCCEEEEEC T ss_conf 678199------5559767888968989999-------98999998999898928997929985997887478121 No 129 >TIGR02287 PaaY phenylacetic acid degradation protein PaaY; InterPro: IPR011974 Members of this family are located next to other genes organised into apparent operons for phenylacetic acid degradation . PaaY is located near the end of these gene clusters and often next to PaaX, a transcriptional regulator.. Probab=99.64 E-value=4.8e-16 Score=109.76 Aligned_cols=72 Identities=28% Similarity=0.491 Sum_probs=63.2 Q ss_pred CCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCC--CCCCCEEEECCCEEEE Q ss_conf 63988998997980883445775456298788798869989378398899899778357664--3698689813560770 Q gi|254780942|r 354 DSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQD--TPENSLVFARSRQIVK 431 (442) Q Consensus 354 d~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~d--v~~~~~v~~~~~~~~~ 431 (442) |.++++|=.||-|.|+ |+|.||.|+.||.|+++|.|..||++|+|+|-|-|.-. +|+.+||+|.|-|+++ T Consensus 69 DTvVeenGHvGHgAiL--------HgC~vgrnaLvGMNAVVMDgAvige~sIVaA~aFVKAg~E~paq~Lv~GsPAkv~R 140 (193) T TIGR02287 69 DTVVEENGHVGHGAIL--------HGCRVGRNALVGMNAVVMDGAVIGERSIVAASAFVKAGAEIPAQVLVVGSPAKVVR 140 (193) T ss_pred CEEEECCCCCCCCEEE--------CCCEECCCCEECCCEEEECCCEECCCCCEEHHHHHHCCCCCCCCCEEECCCCEEEE T ss_conf 8676068823321235--------15463252311165066166466774120020334347347846126628730553 Q ss_pred CC Q ss_conf 77 Q gi|254780942|r 432 ED 433 (442) Q Consensus 432 ~~ 433 (442) += T Consensus 141 ~L 142 (193) T TIGR02287 141 EL 142 (193) T ss_pred CC T ss_conf 31 No 130 >PRK13627 carnitine operon protein CaiE; Provisional Probab=99.64 E-value=1e-15 Score=107.88 Aligned_cols=136 Identities=14% Similarity=0.233 Sum_probs=85.5 Q ss_pred EEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCC Q ss_conf 94112331784100122322344200021116313433212111210021344320000025864432023102112478 Q gi|254780942|r 264 LSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEG 343 (442) Q Consensus 264 i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g 343 (442) +++++.|.+++++-+.|.||+++.|+.+|+|.... ....||++|.|+++| +|.... + ..++|+++ T Consensus 13 i~~~afIap~A~viGdV~ig~~~sIw~~aVlRGD~--~~I~IG~~tNIQD~~------viH~~~--~-----~~~~IG~~ 77 (196) T PRK13627 13 VHPTAFVHPSAVLIGDVIVGAGVYIGPLASLRGDY--GRLIVQAGANLQDGC------IMHGYC--D-----TDTIVGEN 77 (196) T ss_pred CCCCEEECCCCEEEEEEEECCCCEECCCEEEECCC--CCEEECCCCEECCCC------EEECCC--C-----CCCEECCC T ss_conf 29983889999899849999998897770895688--717989986767983------781489--9-----98588898 Q ss_pred CCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEEC-CCCEECCCCCCCCE Q ss_conf 5448745556639889989979808834457754562987887988699893783988998997-78357664369868 Q gi|254780942|r 344 SKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVA-SGSIITQDTPENSL 421 (442) Q Consensus 344 ~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~-ag~~v~~dv~~~~~ 421 (442) +.|+|.+.+-.|.||+||.||.|+++.+ +++||++|.||+++.+.+|.+|.++.++. .=+-+.+++.+..+ T Consensus 78 vtIGH~ailhgc~Igd~~lIGmgavVld-------ga~Ig~~~iI~AgslV~~g~~ip~~~L~~G~PAk~vR~lt~~ei 149 (196) T PRK13627 78 GHIGHGAILHGCVIGRDALVGMNSVIMD-------GAVIGEESIVAAMSFVKAGFQGEKRQLLMGTPARAVRSVSDQEL 149 (196) T ss_pred EEECCCEEEEEEEEECCCEECCCCEEEC-------CCEECCCCEECCCCEECCCCCCCCCCEEEECCEEEEECCCHHHH T ss_conf 0587872996349948978846979928-------88999998997686457994938981998778168114999999 No 131 >PRK00560 molybdopterin-guanine dinucleotide biosynthesis protein A; Provisional Probab=99.64 E-value=3.4e-15 Score=104.90 Aligned_cols=185 Identities=18% Similarity=0.251 Sum_probs=114.4 Q ss_pred CCCCCEEEEECCCCCCCCCCCCCCCEEEECCEE-HHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEEC-- Q ss_conf 987408999458877232788874412588485-79999999997799779999579889999973026972899985-- Q gi|254780942|r 1 MKRKRLAIVLAAGRGHRMKSSSSKVLQKIAGKP-MISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQ-- 77 (442) Q Consensus 1 m~~~~~AiILAaG~GtRl~p~~pKpLlpi~gkp-li~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q-- 77 (442) |+.+-.|||||||+++||. ++|+++++.|+| |++|+++.+... ++++++++..... . .+..++.. T Consensus 5 m~~~I~~vILAGGkSsRmG--~dKall~~~g~~~Lie~~~~~l~~~-~~~i~is~~~~~~---~------~~~~~i~d~~ 72 (196) T PRK00560 5 MIDNIPCVILAGGKSSRMG--ENKALLPFGSYSSLLEYQYTRLLKL-FKKVYISTKDKKF---E------FNAPFLLEKE 72 (196) T ss_pred CCCCCCEEEECCCCCCCCC--CCCCCCEECCCCHHHHHHHHHHHHH-CCEEEEECCCHHH---H------CCCCEEECCC T ss_conf 5368408997378722399--9862527699555999999998863-6768996481366---4------5998886578 Q ss_pred CCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCEEEECCCCCEECCC Q ss_conf 99886144664100001234460589851541115332200000013586100100101221574404202333100100 Q gi|254780942|r 78 DCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNADNPKGYGRLLIKNNEIIAIRE 157 (442) Q Consensus 78 ~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~~dp~~yGvV~~d~~~v~~i~E 157 (442) ....|.-.+|..++..+.. +.++++.+|+||++...+..|.+. .+.++++ ...+++. T Consensus 73 ~~~~GPL~Gi~saL~~~~~--~~~~v~~cD~Pfl~~~~i~~L~~~--~~~~~v~--------------~~~~~~~----- 129 (196) T PRK00560 73 SDLFSPLFGIHNAFLTLQT--PEIFFISVDTPFVSFESIKKLCGK--ENFSVTY--------------AKSPTKE----- 129 (196) T ss_pred CCCCCHHHHHHHHHHHCCC--CCEEEEECCCCCCCHHHHHHHHHC--CCCCEEE--------------EEECCCC----- T ss_conf 8988809999999985777--868999578888899999999722--6999899--------------9548901----- Q ss_pred CCCCCCCCCCCCCCCCCHHEECCHHHHHHHHH-CCCCCCCCHHHHHHHHHHHHHHHCCEEEEEEC-HHHEECCCCCCCCC Q ss_conf 14667542100035560011125014677753-28887400111211489999730771799917-21430111210000 Q gi|254780942|r 158 ENDATDEERKIHYCNSGLMAIDGLYIMDWLLQ-IKKNKVSQEYYLTDIIEKARLDGKSIASIDVK-EQEVCGCNNRYELS 235 (442) Q Consensus 158 k~d~~~~~~~~~lin~GiY~f~~~~L~~~l~~-l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~-~~~~~gv~~~~~L~ 235 (442) + | + .++| +...+ +.+++ +.. |++-+.+.+ ++ .....+.+. ..++.-+|+|.||. T Consensus 130 ~----P------L--~aly--~~~~l-~~l~~~l~~----g~~~l~~~l----~~-~~~~~v~~~~~~~F~NiNTpeD~~ 185 (196) T PRK00560 130 H----Y------L--ISLW--HRSLL-NALNYALKT----QNYRLSDLV----KN-SSSQAVHFEDEEEFLNLNTLKDYE 185 (196) T ss_pred C----C------E--EEEE--CHHHH-HHHHHHHHC----CCCHHHHHH----HH-CCCEEEECCCCCCCCCCCCHHHHH T ss_conf 3----0------4--6785--28899-999999982----995199999----87-896897459987424689999999 Q ss_pred CCCCCCCCC Q ss_conf 001222221 Q gi|254780942|r 236 LIENIWQSR 244 (442) Q Consensus 236 ~~~~~~~~~ 244 (442) .+..++..+ T Consensus 186 ~A~~~lk~~ 194 (196) T PRK00560 186 LALQILKSR 194 (196) T ss_pred HHHHHHHHC T ss_conf 999998723 No 132 >COG1207 GlmU N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) [Cell envelope biogenesis, outer membrane] Probab=99.63 E-value=1.1e-15 Score=107.68 Aligned_cols=107 Identities=25% Similarity=0.446 Sum_probs=87.5 Q ss_pred CCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCC----CCC----C Q ss_conf 344200021116313433-2121112100213443200000258644320231021124785448745----556----6 Q gi|254780942|r 284 CGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLS----YVG----D 354 (442) Q Consensus 284 ~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~----~ig----d 354 (442) .++.|+++|.|+|+++|. ++.+++++.||+|+-+. ++.|+++++.++.+-+.++.||+++.|+-.+ |=| - T Consensus 319 e~s~vg~~~~VGPfA~LRPg~~L~~~~hIGNFVEvK-~a~ig~gsKa~HLtYlGDA~iG~~~NiGAGtItcNYDG~nK~~ 397 (460) T COG1207 319 EGSTVGEGATVGPFARLRPGAVLGADVHIGNFVEVK-KATIGKGSKAGHLTYLGDAEIGENVNIGAGTITCNYDGKNKFK 397 (460) T ss_pred ECCEECCCCCCCCCCCCCCCCCCCCCCEEEEEEEEE-CCCCCCCCCCCCEEEECCCEECCCCEECCCEEEECCCCCCCCE T ss_conf 055854796337831017968526797674259984-3560688633532342133206874214425998478863320 Q ss_pred CEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCE Q ss_conf 39889989979808834457754562987887988699893783 Q gi|254780942|r 355 SVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPI 398 (442) Q Consensus 355 ~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv 398 (442) .+||++++||+.+.+.- ++.|||++.||++++|-..| T Consensus 398 T~IGd~vFiGSns~LVA-------PV~IGd~a~iaAGStIT~DV 434 (460) T COG1207 398 TIIGDNVFIGSNSQLVA-------PVTIGDGATIAAGSTITKDV 434 (460) T ss_pred EEECCCCEECCCCCEEE-------EEEECCCCEECCCCEECCCC T ss_conf 06468857766871886-------48966984881463683657 No 133 >KOG4750 consensus Probab=99.63 E-value=5.8e-16 Score=109.30 Aligned_cols=53 Identities=28% Similarity=0.646 Sum_probs=48.3 Q ss_pred CCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEEEE Q ss_conf 562987887988699893783988998997783576643698689813560770 Q gi|254780942|r 378 YKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQIVK 431 (442) Q Consensus 378 ~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~~ 431 (442) |+ .|||||+||++++|+++|+||++++|+|||+|++||||+.+++|.|...+. T Consensus 200 hP-~Igd~vliGaGvtILgnV~IGegavIaAGsvV~kDVP~~~~AvGnPAklIg 252 (269) T KOG4750 200 HP-KIGDNVLIGAGVTILGNVTIGEGAVIAAGSVVLKDVPPNTLAVGNPAKLIG 252 (269) T ss_pred CC-CCCCCEEECCCCEEECCEEECCCCEEECCCEEEECCCCCCEECCCCHHHCC T ss_conf 88-556770875541785781677886773050697216987530488233234 No 134 >pfam02348 CTP_transf_3 Cytidylyltransferase. This family consists of two main Cytidylyltransferase activities: 1) 3-deoxy-manno-octulosonate cytidylyltransferase,, EC:2.7.7.38 catalysing the reaction:- CTP + 3-deoxy-D-manno-octulosonate <= diphosphate + CMP-3-deoxy-D-manno-octulosonate, 2) acylneuraminate cytidylyltransferase EC:2.7.7.43,, catalysing the reaction:- CTP + N-acylneuraminate <= diphosphate + CMP-N-acylneuraminate. NeuAc cytydilyltransferase of Mannheimia haemolytica has been characterized describing kinetics and regulation by substrate charge, energetic charge and amino-sugar demand. Probab=99.63 E-value=7.2e-16 Score=108.73 Aligned_cols=176 Identities=19% Similarity=0.249 Sum_probs=112.9 Q ss_pred EEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCHHHHHHHHCCCCCEEEEEECCCCCCHH Q ss_conf 89994588772327888744125884857999999999779-97799995798899999730269728999859988614 Q gi|254780942|r 6 LAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAG-IENVALVLGYGAEEITRINFPPTLSVEYYIQDCQQGTA 84 (442) Q Consensus 6 ~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g-~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~~GTa 84 (442) ++||.|=+.+||| |. |||.+|+|+|||+|+++.+.+++ +++++|.| ..+++.+.....+.++.....+...||. T Consensus 1 i~iIpAR~~S~Rl-p~--K~L~~i~gkpmI~~v~~~a~~s~~~~~viVaT--d~~~I~~~~~~~~~~~i~t~~~~~~gt~ 75 (197) T pfam02348 1 IAIIPARSGSKRL-PG--KNLLKLGGKPLIARVIEAALQSKLFDKVVVAT--DSEEIADIAEKFGGGVVVRRGSLASGTD 75 (197) T ss_pred CEEECCCCCCCCC-CC--CCCHHHCCCCHHHHHHHHHHHCCCCCEEEEEC--CCCEEEEECCCCCCCEEEEEECCCCCCC T ss_conf 9898458888789-99--62035299068999999999758987289954--8301442012323313576402356521 Q ss_pred HHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCC--CCCEEEE------CCCCCCCCEEEECCCCCEECC Q ss_conf 46641000012344605898515411153322000000135861--0010010------122157440420233310010 Q gi|254780942|r 85 HAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYS--IAVVGFN------ADNPKGYGRLLIKNNEIIAIR 156 (442) Q Consensus 85 ~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~--~~i~~~~------~~dp~~yGvV~~d~~~v~~i~ 156 (442) . +..+.+.+....+-++-+.||.||+++..+..+++.+.++.. .+.+... ..+|+..-++..++++..-|. T Consensus 76 R-~~ea~~~~~~~~d~ivnvqgD~Pli~p~~i~~~i~~~~~~~~~~~~t~~~~~~~~~~~~~~n~vKvv~~~~~~alyfs 154 (197) T pfam02348 76 R-FIEAVKAFLADEEIIVNLQGDEPLLQPTSILRAIEHLREAGEDYVSTLVKPVVSSEEILNPNPLKVVLDDAGYALYFS 154 (197) T ss_pred H-HHHHHHHHCCHHEEEEECCCCEEECCHHHHHHHHHHHHHCCCCCEEEEEEECCCHHHHCCCCCCEEEECCCCCEEECC T ss_conf 1-999998643410099965896687588999999999985898728982047378889339987499988899882413 Q ss_pred CCCCCCC----CCCCCCCCCCCHHEECCHHHHHHH Q ss_conf 0146675----421000355600111250146777 Q gi|254780942|r 157 EENDATD----EERKIHYCNSGLMAIDGLYIMDWL 187 (442) Q Consensus 157 Ek~d~~~----~~~~~~lin~GiY~f~~~~L~~~l 187 (442) ..+-+.. ......+-+.|+|.|+.+.|..|. T Consensus 155 R~~iP~~~~~~~~~~~~~~h~gIY~f~~~~L~~f~ 189 (197) T pfam02348 155 RSPIPYIRDHPAPLQLYLRHKGIYAFRKEHLIRYV 189 (197) T ss_pred CCCCCCCCCCCCCCCCCCEEEEEEEECHHHHHHHH T ss_conf 78897431232557875489999898899999987 No 135 >COG1043 LpxA Acyl-[acyl carrier protein] Probab=99.61 E-value=1.4e-15 Score=107.17 Aligned_cols=147 Identities=22% Similarity=0.242 Sum_probs=60.0 Q ss_pred EEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCCCCEEECCC------------CCCC Q ss_conf 94112331784100122322344200021116313433-2121112100213443200000258------------6443 Q gi|254780942|r 264 LSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQETTIEKN------------VRIG 330 (442) Q Consensus 264 i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~~s~i~~~------------~~Ig 330 (442) |+|.+.|.+++.|+++|.||+-|.||++++|+.++.|. .++|.-.++||.+++|.+.++||+. +.|| T Consensus 6 IHPTAiIe~gA~ig~~V~IGpf~iIg~~V~ig~~t~l~shvvv~G~T~IG~~n~I~~~A~iG~~pQdlKykge~T~l~IG 85 (260) T COG1043 6 IHPTAIIEPGAEIGEDVKIGPFCIIGPNVEIGDGTVLKSHVVVEGHTTIGRNNRIFPFASIGEDPQDLKYKGEPTRLIIG 85 (260) T ss_pred CCCCEEECCCCCCCCCCEECCEEEECCCCEECCCCEECCCEEEECCCEECCCCEEECCCCCCCCCCCCCCCCCCEEEEEC T ss_conf 57621527878828998888337988971988995883437996770777898791110428987545027984279988 Q ss_pred CEEEEECCCCCCCCCCCCCCCC--CCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECC Q ss_conf 2023102112478544874555--66398899899798088344577545629878879886998937839889989977 Q gi|254780942|r 331 NFCEVKKATIKEGSKINHLSYV--GDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVAS 408 (442) Q Consensus 331 ~~~ei~~s~i~~g~~i~~~~~i--gd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~a 408 (442) +.++| .+++.++..+.- |-..||+|+.|.+.+=++ |.|+||++|..-.|+++.-.|.|||++++|. T Consensus 86 ~~n~I-----RE~vTi~~GT~~g~g~T~IGdnnl~May~HVA-------HDC~iGn~~ilaNnatLAGHV~igD~aiiGG 153 (260) T COG1043 86 DNNTI-----REFVTIHRGTVQGGGVTRIGDNNLIMAYAHVA-------HDCVIGNNCILANNATLAGHVEVGDYAIIGG 153 (260) T ss_pred CCCEE-----EEEEEEECCCCCCCEEEEECCCCEEEEEEEEE-------CCCEECCCEEEECCCEEECEEEECCEEEECC T ss_conf 99758-----66889861544786159977887898731031-------3444567179966866854078877899747 Q ss_pred CCEECCC--CCCCCEE Q ss_conf 8357664--3698689 Q gi|254780942|r 409 GSIITQD--TPENSLV 422 (442) Q Consensus 409 g~~v~~d--v~~~~~v 422 (442) .|.|.+= +-+++|+ T Consensus 154 ~saVHQF~rIG~~ami 169 (260) T COG1043 154 LSAVHQFVRIGAHAMI 169 (260) T ss_pred CCEEEEEEEECCHHEE T ss_conf 5237778897401123 No 136 >COG1045 CysE Serine acetyltransferase [Amino acid transport and metabolism] Probab=99.61 E-value=4.8e-15 Score=104.07 Aligned_cols=56 Identities=21% Similarity=0.511 Sum_probs=51.1 Q ss_pred CCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEEEECC Q ss_conf 56298788798869989378398899899778357664369868981356077077 Q gi|254780942|r 378 YKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQIVKED 433 (442) Q Consensus 378 ~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~~~~ 433 (442) .+.+||++|.||+++.++-.++||+++.|||||+|.+|||+++.|+|.|-+++... T Consensus 118 RhPtIg~~V~IGagAkILG~I~IGd~a~IGA~sVVlkdVP~~~tvvGvPArii~~~ 173 (194) T COG1045 118 RHPTIGNGVYIGAGAKILGNIEIGDNAKIGAGSVVLKDVPPNATVVGVPARVIGRP 173 (194) T ss_pred CCCCCCCCEEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEECCCCEEECCC T ss_conf 79831788598999889716688789788878668157899966866764685357 No 137 >COG0746 MobA Molybdopterin-guanine dinucleotide biosynthesis protein A [Coenzyme metabolism] Probab=99.61 E-value=5.8e-15 Score=103.58 Aligned_cols=115 Identities=18% Similarity=0.304 Sum_probs=86.3 Q ss_pred CCCCCEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEECCCC Q ss_conf 98740899945887723278887441258848579999999997799779999579889999973026972899985998 Q gi|254780942|r 1 MKRKRLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQDCQ 80 (442) Q Consensus 1 m~~~~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~ 80 (442) |...+.+||||||+++|| .+|+|+++.|+||++|+++.|....- .+++....+.++ +. ..+ +.++....+ T Consensus 1 ~~~~~~~vILAGG~srRm---~dK~l~~~~g~~lie~v~~~L~~~~~-~vvi~~~~~~~~---~~-~~g--~~vv~D~~~ 70 (192) T COG0746 1 MMTPMTGVILAGGKSRRM---RDKALLPLNGRPLIEHVIDRLRPQVD-VVVISANRNQGR---YA-EFG--LPVVPDELP 70 (192) T ss_pred CCCCCEEEEECCCCCCCC---CCCEEEEECCEEHHHHHHHHHHCCCC-EEEEECCCCHHH---HH-CCC--CCEEECCCC T ss_conf 998736999778753567---88745787982899999998740188-799976873424---43-169--857547888 Q ss_pred C-CHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 8-6144664100001234460589851541115332200000013586 Q gi|254780942|r 81 Q-GTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGY 127 (442) Q Consensus 81 ~-GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~ 127 (442) . |.-.+++.++++.. .+.++++.||+|+++.+.++.+....++.. T Consensus 71 ~~GPL~Gi~~al~~~~--~~~~~v~~~D~P~i~~~lv~~l~~~~~~~~ 116 (192) T COG0746 71 GFGPLAGILAALRHFG--TEWVLVLPCDMPFIPPELVERLLSAFKQTG 116 (192) T ss_pred CCCCHHHHHHHHHHCC--CCEEEEEECCCCCCCHHHHHHHHHHHCCCC T ss_conf 8887899999998579--875999816778789999999998623478 No 138 >TIGR03570 NeuD_NnaD sugar O-acyltransferase, sialic acid O-acetyltransferase NeuD family. These proteins contain repeats of the bacterial transferase hexapeptide (pfam00132), although often these do not register above the trusted cutoff. Probab=99.60 E-value=1.6e-15 Score=106.82 Aligned_cols=102 Identities=26% Similarity=0.376 Sum_probs=60.8 Q ss_pred CEEECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEE Q ss_conf 10211527994112331784100122322344200021116313433212111210021344320000025864432023 Q gi|254780942|r 255 TMIAPETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCE 334 (442) Q Consensus 255 ~~~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~e 334 (442) ++++|. +++++++.|++++.|.|++.|+.++.||++|.|.++| .|+++|.||+|++|.+++.++.+|+|| T Consensus 88 ~lIhp~-a~i~~~~~ig~G~~I~~~~~i~~~~~IG~~~~i~~~~-----~I~Hd~~ig~~~~i~~~~~i~G~v~Ig---- 157 (201) T TIGR03570 88 TLIHPS-AIVSPSASIGEGTVIMAGAVINPDVRIGDNVIINTGA-----IVEHDCVIGDYVHIAPGVTLSGGVVIG---- 157 (201) T ss_pred EEECCC-CEECCCCEECCCCEEECCCEECCCCEEEEEEEECCCC-----CCCCCCEECCCCEECCCCEECCCCEEC---- T ss_conf 999998-0988986787998996286987664780267980423-----214425477876899985998988797---- Q ss_pred EECCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECC Q ss_conf 102112478544874555663988998997980883445775456298788798869989378 Q gi|254780942|r 335 VKKATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAP 397 (442) Q Consensus 335 i~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~g 397 (442) ++|.||+++++.. +..||+++.||+++++... T Consensus 158 ------------------------~~~~iG~~~~i~~-------~i~Ig~~~~Igags~V~~~ 189 (201) T TIGR03570 158 ------------------------EGVFIGAGATIIQ-------GVTIGAGAIVGAGAVVTKD 189 (201) T ss_pred ------------------------CCCEECCCCEECC-------CCEECCCCEECCCCEEECC T ss_conf ------------------------7735989889889-------9899999999979889043 No 139 >cd03360 LbH_AT_putative Putative Acyltransferase (AT), Left-handed parallel beta-Helix (LbH) domain; This group is composed of mostly uncharacterized proteins containing an N-terminal helical subdomain followed by a LbH domain. The alignment contains 6 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. A few members are identified as NeuD, a sialic acid (Sia) O-acetyltransferase that is required for Sia synthesis and surface polysaccharide sialylation. Probab=99.58 E-value=2e-15 Score=106.21 Aligned_cols=99 Identities=28% Similarity=0.411 Sum_probs=61.5 Q ss_pred CCEEECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEE Q ss_conf 21021152799411233178410012232234420002111631343321211121002134432000002586443202 Q gi|254780942|r 254 VTMIAPETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFC 333 (442) Q Consensus 254 v~~~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ 333 (442) .++++|. .++++++.|++++.|.|++.|+.++.||++|.|.+++ .|+++|.||+||+|.++++|..+|+||++ T Consensus 84 ~niIhp~-a~i~~~~~ig~G~~I~~~~~i~~~~~IG~~~~i~~~~-----~i~Hd~~ig~~~~i~~~~~i~g~v~Ig~~- 156 (197) T cd03360 84 ATLIHPS-AVVSPSAVIGEGCVIMAGAVINPDARIGDNVIINTGA-----VIGHDCVIGDFVHIAPGVVLSGGVTIGEG- 156 (197) T ss_pred EEEECCC-CEECCCCEECCCCEEECCCEECCCCCCCCEEEECCCH-----HCCCCCCCCCCCEECCCEEECCCEEEEEE- T ss_conf 8999999-5987774799942993175986565304368987700-----10531501310106672367783089121- Q ss_pred EEECCCCCCCCCCCCCCCCCCCEECCCCEECCCEE Q ss_conf 31021124785448745556639889989979808 Q gi|254780942|r 334 EVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGTI 368 (442) Q Consensus 334 ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i 368 (442) +.|+.++.+.+ +..||++|.||||++ T Consensus 157 ----~~iG~~~~i~~-----~i~Ig~~~~igags~ 182 (197) T cd03360 157 ----AFIGAGATIIQ-----GVTIGAGAIIGAGAV 182 (197) T ss_pred ----CCCCCCCEECC-----CCEECCCCEECCCCE T ss_conf ----62989989879-----989999999997998 No 140 >cd00208 LbetaH Left-handed parallel beta-Helix (LbetaH or LbH) domain: The alignment contains 5 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity, however, some subfamilies in this hierarchy also show activities related to ion transport or translation initiation. Many are trimeric in their active forms. Probab=99.55 E-value=2.3e-14 Score=100.22 Aligned_cols=59 Identities=32% Similarity=0.557 Sum_probs=43.6 Q ss_pred CEECCCCEECCCEEECCCC-CCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEEC Q ss_conf 3988998997980883445-7754562987887988699893783988998997783576 Q gi|254780942|r 355 SVVGKNVNIGAGTITCNYD-GTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIIT 413 (442) Q Consensus 355 ~~iG~~~~ig~g~i~~n~~-g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~ 413 (442) +.||++|+|++++++.+.. .....++.||++|+||+|++|+|+++||++|+|+|||+|| T Consensus 19 v~IG~~~~I~~~~~I~~~~~~~~~~~~~Ig~~v~Ig~~a~I~~gv~IGdn~~IgagavVt 78 (78) T cd00208 19 VVIGDNVNIGPGAVIGAATGPNEKNPTIIGDNVEIGANAVIHGGVKIGDNAVIGAGAVVT 78 (78) T ss_pred EEECCCCEECCCCEEEECCCCCCCCCCEECCCEEECCCCEEECCEEECCCCEECCCCEEC T ss_conf 699999989999899637988667898999996998689897981999999999885699 No 141 >PRK11132 cysE serine acetyltransferase; Provisional Probab=99.54 E-value=2.5e-14 Score=99.98 Aligned_cols=54 Identities=24% Similarity=0.519 Sum_probs=49.4 Q ss_pred CCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEEECCCEEEEC Q ss_conf 629878879886998937839889989977835766436986898135607707 Q gi|254780942|r 379 KTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTPENSLVFARSRQIVKE 432 (442) Q Consensus 379 ~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~~~~~v~~~~~~~~~~ 432 (442) +..||++|.||+++.|+-+++||+++.|||||+|.+|||+++.|+|.|.+++.+ T Consensus 193 HP~i~~~v~igaga~iLG~i~iG~~a~igA~svVl~dvp~~~tv~GvPa~~vg~ 246 (273) T PRK11132 193 HPKIREGVMIGAGAKILGNIEVGRGAKIGAGSVVLQPVPPHTTAAGVPARIVGK 246 (273) T ss_pred CCCCCCCEEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEECCCCEEECC T ss_conf 980488848754888971858899988867866864379998796667203088 No 142 >KOG1461 consensus Probab=99.53 E-value=1.4e-14 Score=101.40 Aligned_cols=101 Identities=25% Similarity=0.347 Sum_probs=64.5 Q ss_pred CEEEECCCCCCCC-CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCCE Q ss_conf 0021116313433-212111210021344320000025864432023102112478544874555663988998997980 Q gi|254780942|r 289 ENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGT 367 (442) Q Consensus 289 g~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~ 367 (442) ++.+++.+.+.+. +..||.++.||..+.|+ +|+||.+|+||..|.|++|.|+++ |.||+||.|. T Consensus 319 ~~dv~~~~~~~v~~~~~ig~gT~Ig~g~~I~-NSVIG~~c~IgsN~~I~~S~iw~~-----------v~Igdnc~I~--- 383 (673) T KOG1461 319 SPDVVLSHSVIVGANVVIGAGTKIGSGSKIS-NSVIGANCRIGSNVRIKNSFIWNN-----------VTIGDNCRID--- 383 (673) T ss_pred CCCCEEHHHCCCCCEEEECCCCCCCCCCEEE-CCEECCCCEECCCEEEEEEEEECC-----------CEECCCCEEE--- T ss_conf 7401000120124407951454126798555-226658977457458851366258-----------3887886573--- Q ss_pred EECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCC Q ss_conf 883445775456298788798869989378398899899778357664 Q gi|254780942|r 368 ITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQD 415 (442) Q Consensus 368 i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~d 415 (442) +++|+|+|.|+.++.+.||+.+|.+.+++.+.++.++ T Consensus 384 -----------~aii~d~v~i~~~~~l~~g~vl~~~VVv~~~~~l~~n 420 (673) T KOG1461 384 -----------HAIICDDVKIGEGAILKPGSVLGFGVVVGRNFVLPKN 420 (673) T ss_pred -----------EEEEECCCEECCCCCCCCCCEEEEEEEECCCCCCCCC T ss_conf -----------4686058376788440788577521686798312454 No 143 >cd04652 LbH_eIF2B_gamma_C eIF-2B gamma subunit, C-terminal Left-handed parallel beta-Helix (LbH) domain: eIF-2B is a eukaryotic translation initiator, a guanine nucleotide exchange factor (GEF) composed of five different subunits (alpha, beta, gamma, delta and epsilon). eIF2B is important for regenerating GTP-bound eIF2 during the initiation process. This event is obligatory for eIF2 to bind initiator methionyl-tRNA, forming the ternary initiation complex. The eIF-2B gamma subunit contains an N-terminal domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH domain with 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). The epsilon and gamma subunits form the catalytic subcomplex of eIF-2B, which binds eIF2 and catalyzes guanine nucleotide exchange. Probab=99.51 E-value=1.2e-14 Score=101.84 Aligned_cols=79 Identities=25% Similarity=0.386 Sum_probs=51.7 Q ss_pred CCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCCE Q ss_conf 00021116313433212111210021344320000025864432023102112478544874555663988998997980 Q gi|254780942|r 288 IENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGT 367 (442) Q Consensus 288 Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~ 367 (442) ||+++.|++++.+++++||+||+||++++|+ +|+|+++++||+++.+++|+|++++.|+..+.+.||+||+++.+++|| T Consensus 2 IG~~~~Ig~~~~I~~SvIG~nc~Ig~~~~i~-~sii~~~v~I~~~~~i~~sIIg~~~~Ig~~~~l~~~vig~~~~V~~gt 80 (81) T cd04652 2 VGENTQVGEKTSIKRSVIGANCKIGKRVKIT-NCVIMDNVTIEDGCTLENCIIGNGAVIGEKCKLKDCLVGSGYRVEAGT 80 (81) T ss_pred CCCCCEECCCCEEECCEECCCCEECCCCEEE-CCEEECCCEECCCCEEECCEEECCCEECCCCEEECCEECCCCEECCCC T ss_conf 8798899999989679997999999996896-899969879899989974999097999999999373988998989898 No 144 >COG0663 PaaY Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily [General function prediction only] Probab=99.49 E-value=6.1e-14 Score=97.77 Aligned_cols=120 Identities=26% Similarity=0.379 Sum_probs=78.8 Q ss_pred EEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCC Q ss_conf 94112331784100122322344200021116313433212111210021344320000025864432023102112478 Q gi|254780942|r 264 LSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEG 343 (442) Q Consensus 264 i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g 343 (442) |++.+.|.+++.|-+.+.|+.++.|+.+++++... +.-.||++|.|+ ++++|.-... .-++|+++ T Consensus 14 i~~~a~Va~~A~viGdV~Ig~~vsIw~~aVlRgD~--~~I~IG~~tNIQ------Dg~ViH~~~~-------~p~~IG~~ 78 (176) T COG0663 14 IDPTAFVAPSATVIGDVRIGAGVSIWPGAVLRGDV--EPIRIGARTNIQ------DGVVIHADPG-------YPVTIGDD 78 (176) T ss_pred CCCCEEECCCCEEEEEEEECCCCEECCCEEEECCC--CCEEECCCCEEC------CCEEEECCCC-------CCEEECCC T ss_conf 89862877999799859999997887763897467--756987885603------7838952799-------99297799 Q ss_pred CCCCCCCCC-CCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEE Q ss_conf 544874555-663988998997980883445775456298788798869989378398899899 Q gi|254780942|r 344 SKINHLSYV-GDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYV 406 (442) Q Consensus 344 ~~i~~~~~i-gd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i 406 (442) +.|+|.+.+ | |.||++|.||-|.++.| ++.||++|.||+++.+.++.++.++.++ T Consensus 79 vtIGH~aivHG-c~Ig~~~lIGmgA~vld-------ga~IG~~~iVgAgalV~~~k~~p~~~L~ 134 (176) T COG0663 79 VTIGHGAVVHG-CTIGDNVLIGMGATVLD-------GAVIGDGSIVGAGALVTPGKEIPGGSLV 134 (176) T ss_pred CEECCCCEEEE-EEECCCCEEECCCEEEC-------CCEECCCCEECCCCCCCCCCCCCCCEEE T ss_conf 58857658988-59878858913866757-------8698898698668700588377997596 No 145 >cd03359 LbH_Dynactin_5 Dynactin 5 (or subunit p25); Dynactin is a major component of the activator complex that stimulates dynein-mediated vesicle transport. Dynactin is a heterocomplex of at least eight subunits, including a 150,000-MW protein called Glued, the actin-capping protein Arp1, and dynamatin. In vitro binding experiments show that dynactin enhances dynein-dependent motility, possibly through interaction with microtubules and vesicles. Subunit p25 is part of the pointed-end subcomplex in dynactin that also includes p26, p27, and Arp11. This subcomplex interacts with membranous cargoes. p25 and p27 contain imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), indicating a left-handed parallel beta helix (LbH) structural domain. Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.48 E-value=2.3e-13 Score=94.47 Aligned_cols=119 Identities=20% Similarity=0.266 Sum_probs=66.6 Q ss_pred EEEECCCCCEECCCCCCCC---CCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCC------CEEECCCCCCCCEEEEEC Q ss_conf 1233178410012232234---4200021116313433212111210021344320------000025864432023102 Q gi|254780942|r 267 DTIIQPDTVIEPHVFFGCG---VSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQ------ETTIEKNVRIGNFCEVKK 337 (442) Q Consensus 267 ~~~I~~~~~I~~~~~Ig~~---~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~------~s~i~~~~~Ig~~~ei~~ 337 (442) ++.|+.++.|.++|+|.++ .+||++|.|+++++| .|...... ...||++|.||+.+.+. T Consensus 21 nI~IG~~ssI~~~avirGD~~~I~IG~~~~I~d~~~I-----------~~~~~~~~~~~~~~p~~IGd~v~Ig~~~~i~- 88 (161) T cd03359 21 NIVLNGKTIIQSDVIIRGDLATVSIGRYCILSEGCVI-----------RPPFKKFSKGVAFFPLHIGDYVFIGENCVVN- 88 (161) T ss_pred CEEECCCCEECCCCEEECCCCCEEECCCCEECCCCEE-----------ECCCCCCCCCCCCCCCEECCCEEECCCEEEE- T ss_conf 7599998789999689678865499998588899899-----------2464435778888772848987989982884- Q ss_pred CCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCC-CEECCCC Q ss_conf 112478544874555663988998997980883445775456298788798869989378398899899778-3576643 Q gi|254780942|r 338 ATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASG-SIITQDT 416 (442) Q Consensus 338 s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag-~~v~~dv 416 (442) .|.||++|.||.++++.. +++||++|+||+++++.++..|.++++++.. +.+.+++ T Consensus 89 ----------------~~~Ig~~v~IG~~~~I~~-------g~~Ig~~~~I~~gsvV~~~~~ip~~~~~~G~Pak~i~~l 145 (161) T cd03359 89 ----------------AAQIGSYVHIGKNCVIGR-------RCIIKDCVKILDGTVVPPDTVIPPYSVVSGRPARFIGEL 145 (161) T ss_pred ----------------CCEECCCEEECCCCEECC-------CCEECCCCEECCCCEECCCCEECCCCEEEECCEEEEEEC T ss_conf ----------------889667715579819949-------979999989988989889989899819985180895769 Q ss_pred CCCC Q ss_conf 6986 Q gi|254780942|r 417 PENS 420 (442) Q Consensus 417 ~~~~ 420 (442) ++.. T Consensus 146 ~e~~ 149 (161) T cd03359 146 PECT 149 (161) T ss_pred CHHH T ss_conf 9999 No 146 >TIGR03308 phn_thr-fam phosphonate metabolim protein, transferase hexapeptide repeat family. This family of proteins contains copies of the Bacterial transferase hexapeptide repeat family (pfam00132) and is only found in operons encoding the phosphonate C-P lyase system (GenProp0232). Many C-P lyase operons, however, lack a homolog of this protein. Probab=99.48 E-value=1.7e-13 Score=95.23 Aligned_cols=130 Identities=17% Similarity=0.175 Sum_probs=81.8 Q ss_pred EEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCC Q ss_conf 12331784100122322344200021116313433212111210021344320000025864432023102112478544 Q gi|254780942|r 267 DTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKI 346 (442) Q Consensus 267 ~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i 346 (442) ++.+|.++.|++++.|. +|.||++++|+++|.+.++.||+.|.|+++++|.++..-.+.+....++-..+...+ +... T Consensus 19 ~~~lG~~v~Ig~~~~i~-ns~IGDySyI~~~~~I~n~~IGkfcsIa~~v~I~~~nHp~~~~s~~~f~y~~~~~~~-~~~~ 96 (204) T TIGR03308 19 ESKLGRYTEIGERTRLR-EVALGDYSYVMRDCDIIYTTIGKFCSIAAMVRINATNHPMERPTLHHFTYRAAMYFD-DASD 96 (204) T ss_pred CCCCCCCEEECCCCEEE-CCEECCCCEECCCCEEECCEECCCCEECCCCEECCCCCCCCCCEECCEEEECCCCCC-CCCC T ss_conf 27837833999995883-379999858889975988869998488898798987787667267777553753346-6666 Q ss_pred CCC--C--CCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEEC Q ss_conf 874--5--556639889989979808834457754562987887988699893783988998997 Q gi|254780942|r 347 NHL--S--YVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVA 407 (442) Q Consensus 347 ~~~--~--~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ 407 (442) ... . .-....||++|.||+++++.. ++.||+++.||+++++.. .|.++++++ T Consensus 97 ~~~~~~~~~~~~~~Ig~dvwiG~~~~i~~-------gv~IG~gavigagsvVtk--dv~~~~iv~ 152 (204) T TIGR03308 97 DADFFAWRRAKRVTIGHDVWIGHGAVILP-------GVTIGNGAVIAAGAVVTK--DVAPYTIVA 152 (204) T ss_pred CHHHHCCCCCCCEEECCCCEECCCCEECC-------CCEECCCCEECCCCEECC--CCCCCEEEE T ss_conf 12232133478709899768879989919-------979999989977998995--179984998 No 147 >cd03358 LbH_WxcM_N_like WcxM-like, Left-handed parallel beta-Helix (LbH) N-terminal domain: This group is composed of Xanthomonas campestris WcxM and proteins with similarity to the WcxM N-terminal domain. WcxM is thought to be bifunctional, catalyzing both the isomerization and transacetylation reactions of keto-hexoses. It contains an N-terminal LbH domain responsible for the transacetylation function and a C-terminal isomerase domain. The LbH domain contains imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), typical of enzymes with acyltransferase activity. Probab=99.43 E-value=2.9e-13 Score=93.88 Aligned_cols=109 Identities=22% Similarity=0.306 Sum_probs=62.4 Q ss_pred ECCCCCEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCC Q ss_conf 31784100122322344200021116313433-21211121002134432000002586443202310211247854487 Q gi|254780942|r 270 IQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINH 348 (442) Q Consensus 270 I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~ 348 (442) ||+++.|++++.|++++.||++|.|++++.|. ++.||++|.||+++.+ ..+...+. . ..+ T Consensus 1 IG~~~~Ig~~~~I~~~~~IG~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~~------~~~~~~~~-------~------~~~ 61 (119) T cd03358 1 IGDNCIIGTNVFIENDVKIGDNVKIQSNVSIYEGVTIEDDVFIGPNVVF------TNDLYPRS-------K------IYR 61 (119) T ss_pred CCCCCEECCCCEECCCCEECCCEEECCCCCCCCCCEEEEEEEECCCCCC------CCCCCCCC-------C------CCC T ss_conf 9899899999899999999999699998636789998437797788633------04446676-------4------346 Q ss_pred CCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEE Q ss_conf 4555663988998997980883445775456298788798869989378398899899 Q gi|254780942|r 349 LSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYV 406 (442) Q Consensus 349 ~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i 406 (442) ....+...||++|.||+++++.. ++.||+++.||+++++... +.+++++ T Consensus 62 ~~~~~~v~Ig~~v~IG~~~~I~~-------gv~IG~~~~IgagSvV~~d--v~~~~i~ 110 (119) T cd03358 62 KWELKGTTVKRGASIGANATILP-------GVTIGEYALVGAGAVVTKD--VPPYALV 110 (119) T ss_pred CCCCCCCEECCCEEECCCCEEEC-------CEEECCCCEECCCCEECCC--CCCCCEE T ss_conf 76324839878838488747789-------9498999899779989036--5999799 No 148 >TIGR00466 kdsB 3-deoxy-D-manno-octulosonate cytidylyltransferase; InterPro: IPR004528 3-Deoxy-D-manno-octulosonate cytidylyltransferase (2.7.7.38 from EC) activates KDO, a required 8-carbon sugar, for incorporation into bacterial lipopolysaccharide in Gram negative bacteria. It acts as a homodimer and catalyses the conversion of CTP and 3-deoxy-D-manno-octulosonate into CMP-3-deoxy-D-manno-octulosonate and pyrophosphate.; GO: 0008690 3-deoxy-manno-octulosonate cytidylyltransferase activity, 0009103 lipopolysaccharide biosynthetic process, 0005737 cytoplasm. Probab=99.43 E-value=3.4e-12 Score=87.83 Aligned_cols=218 Identities=20% Similarity=0.308 Sum_probs=139.4 Q ss_pred EEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEECCCCCCHHHH Q ss_conf 99945887723278887441258848579999999997799779999579889999973026972899985998861446 Q gi|254780942|r 7 AIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQDCQQGTAHA 86 (442) Q Consensus 7 AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~~GTa~A 86 (442) .||.|==.+||| --|||..|.|+|||.|.+++..+++.+++++.| +.|.|.+.....+.++-....+...||-.- T Consensus 5 vIIPAR~~SsRl---~~K~L~DI~G~PMi~~v~~~A~~s~~~~~ivA~--D~E~v~~~~~~~~~~~~~T~~~H~SGTERl 79 (246) T TIGR00466 5 VIIPARLASSRL---PGKPLEDIKGKPMIVHVLEKANESGADRVIVAT--DDESVAQTVKKFGIEVCLTSKDHNSGTERL 79 (246) T ss_pred EEECCCCCCCCC---CCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEEC--CCHHHHHHHHHCCCEEEEECCCCCCCHHHH T ss_conf 985365311358---987203557893688999997523635689974--872699999972971898516678504689 Q ss_pred HHHCCCCCCCCCCE-EEEEECCCCCCCCCCCCCCCCCCCC-CCCCCCEEEECC------CCCCCCEEEECCCCCEECC-- Q ss_conf 64100001234460-5898515411153322000000135-861001001012------2157440420233310010-- Q gi|254780942|r 87 VLTAQDAIKPGYDD-VIIMYGDVPLVSSHTLKKAMDKIAQ-GYSIAVVGFNAD------NPKGYGRLLIKNNEIIAIR-- 156 (442) Q Consensus 87 i~~a~~~l~~~~~~-~lVl~GD~~li~~~~l~~l~~~~~~-~~~~~i~~~~~~------dp~~yGvV~~d~~~v~~i~-- 156 (442) +- +.+.+.-.++. ++=+.||.||+....+.++++...+ +...+.++.++- +|...-+|...+|+++=|- T Consensus 80 ~E-~~~~la~~~d~~i~N~QGDEP~~~~~ii~~~~~nLA~~~~~~~~~a~ki~d~E~~~~PN~VKVvlD~~~~~LYFSRs 158 (246) T TIGR00466 80 AE-VVEKLALKDDEVIVNLQGDEPLIPKEIIRQVADNLAKKNVPMAALAVKIHDAEEAFNPNAVKVVLDSEGYALYFSRS 158 (246) T ss_pred HH-HHHHHCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCEEEEECC T ss_conf 99-99984789885799820898765389999999988630401121122222121205997479997078867886402 Q ss_pred --C-CCCCC--CCCC-----CC-CCCCCCHHEEC-CHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH--HHCCEEEEEECH Q ss_conf --0-14667--5421-----00-03556001112-5014677753288874001112114899997--307717999172 Q gi|254780942|r 157 --E-ENDAT--DEER-----KI-HYCNSGLMAID-GLYIMDWLLQIKKNKVSQEYYLTDIIEKARL--DGKSIASIDVKE 222 (442) Q Consensus 157 --E-k~d~~--~~~~-----~~-~lin~GiY~f~-~~~L~~~l~~l~~~~~~gE~yltDii~~l~~--~g~~i~~~~~~~ 222 (442) . .+|.. ..+. .- -|-+.|||.|+ ..+|.+|...-+.. .. +. .-+++|+- .|.+|+.-.... T Consensus 159 ~IP~~RD~~~~~~~~~~~~~~D~~L~HiGIYgy~n~~~~~~~v~~~P~~-LE-~i---E~LEQLR~Lyy~~kI~V~~~~~ 233 (246) T TIGR00466 159 LIPFDRDRFAKRQEVQKVQLSDALLRHIGIYGYRNAGFIKEYVAWKPSV-LE-EI---EKLEQLRVLYYGEKIHVKIAQE 233 (246) T ss_pred CCCCCHHHHHHHCCCCEEEHHHHHHHHHCHHHCCHHHHHHHHHHCCCCC-HH-HH---HHHHHHHHHHCCCEEEEEEEEC T ss_conf 1542001244305510020113445442211100247999997228874-02-45---6678888876056368987503 Q ss_pred HHEECCCCCCCCC Q ss_conf 1430111210000 Q gi|254780942|r 223 QEVCGCNNRYELS 235 (442) Q Consensus 223 ~~~~gv~~~~~L~ 235 (442) ..-.|++++.||. T Consensus 234 ~P~vGvDT~EDL~ 246 (246) T TIGR00466 234 VPSVGVDTQEDLE 246 (246) T ss_pred CCCCCCCCHHHCC T ss_conf 7886778623149 No 149 >PRK05293 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=99.42 E-value=9.6e-13 Score=90.96 Aligned_cols=42 Identities=17% Similarity=0.259 Sum_probs=19.4 Q ss_pred CCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCC Q ss_conf 3212111210021344320000025864432023102112478 Q gi|254780942|r 301 EGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEG 343 (442) Q Consensus 301 ~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g 343 (442) ++|+|+++|.||++|.|+ +|+|+++++||+++.|+||+|+++ T Consensus 307 ~nSvIg~~v~Ig~ga~I~-nSiI~~~~~Ig~~~~I~nsIi~~~ 348 (381) T PRK05293 307 KHSVLFQGVQVGEGSIVK-DSVIMPGAKIGENVVIERAIIGEN 348 (381) T ss_pred ECCEECCCCEECCCCEEE-CCEECCCCEECCCCEEEEEEECCC T ss_conf 548988998999999997-789859699998999901299597 No 150 >cd05787 LbH_eIF2B_epsilon eIF-2B epsilon subunit, central Left-handed parallel beta-Helix (LbH) domain: eIF-2B is a eukaryotic translation initiator, a guanine nucleotide exchange factor (GEF) composed of five different subunits (alpha, beta, gamma, delta and epsilon). eIF2B is important for regenerating GTP-bound eIF2 during the initiation process. This event is obligatory for eIF2 to bind initiator methionyl-tRNA, forming the ternary initiation complex. The eIF-2B epsilon subunit contains an N-terminal domain that resembles a dinucleotide-binding Rossmann fold, a central LbH domain containing 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), and a C-terminal domain of unknown function that is present in eIF-4 gamma, eIF-5, and eIF-2B epsilon. The epsilon and gamma subunits form the catalytic subcomplex of eIF-2B, which binds eIF2 and catalyzes guanine nucleotide exchange. Probab=99.42 E-value=1.2e-13 Score=96.15 Aligned_cols=55 Identities=22% Similarity=0.324 Sum_probs=25.7 Q ss_pred CCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCC Q ss_conf 00021116313433212111210021344320000025864432023102112478 Q gi|254780942|r 288 IENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEG 343 (442) Q Consensus 288 Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g 343 (442) ||+++.|++++.+++++||+||+||+++.|. +|+|+++++||+++++++|+|+++ T Consensus 2 IG~~t~Ig~~~~I~~svIG~nc~Ig~~~~I~-~svi~d~~~Ig~~~~i~~siI~~~ 56 (79) T cd05787 2 IGRGTSIGEGTTIKNSVIGRNCKIGKNVVID-NSYIWDDVTIEDGCTIHHSIVADG 56 (79) T ss_pred CCCCCEECCCCEEECCEECCCCEECCCCEEE-CCEEECCCEECCCCEECCCEECCC T ss_conf 9998999999999599989999999998894-899919789999999908999798 No 151 >cd03356 LbH_G1P_AT_C_like Left-handed parallel beta-Helix (LbH) domain of a group of proteins with similarity to glucose-1-phosphate adenylyltransferase: Included in this family are glucose-1-phosphate adenylyltransferase, mannose-1-phosphate guanylyltransferase, and the eukaryotic translation initiation factor eIF-2B subunits, epsilon and gamma. Most members of this family contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold, followed by a LbH fold domain with at least 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). eIF-2B epsilon contains an additional domain of unknown function at the C-terminus. Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.42 E-value=2.8e-13 Score=94.05 Aligned_cols=60 Identities=35% Similarity=0.550 Sum_probs=29.7 Q ss_pred CCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECC Q ss_conf 1112100213443200000258644320231021124785448745556639889989979 Q gi|254780942|r 305 IGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGA 365 (442) Q Consensus 305 Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~ 365 (442) ||++|.||+++.|+ .|+|+++|+||+.+++++|+|++++.|++.+++.+|+||+++.||+ T Consensus 2 IG~~~~Ig~~~~I~-~svIG~~c~Ig~~~~I~~Sii~~~~~Ig~~~~i~~sII~~~~~Ig~ 61 (79) T cd03356 2 IGESTVIGENAIIK-NSVIGDNVRIGDGVTITNSILMDNVTIGANSVIVDSIIGDNAVIGE 61 (79) T ss_pred CCCCCEECCCCEEE-CCEECCCCEECCCCEEEEEEEECCCEECCCCEEECCEECCCCEECC T ss_conf 98979999999995-9999999999999699610997979999999996999949899999 No 152 >COG2171 DapD Tetrahydrodipicolinate N-succinyltransferase [Amino acid transport and metabolism] Probab=99.41 E-value=1e-12 Score=90.75 Aligned_cols=114 Identities=11% Similarity=0.128 Sum_probs=64.3 Q ss_pred EEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCCC-CCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCC Q ss_conf 9941123317841001223223442000211163134332-121112100213443200000258644320231021124 Q gi|254780942|r 263 FLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEG-VHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIK 341 (442) Q Consensus 263 ~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~-~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~ 341 (442) -|.+.+.+-.++.|++++++....+|-=|+.++.++.+|. .++|.-+.||.++||..|+.|+. ++. T Consensus 110 RI~p~a~VR~ga~i~~gtvvM~~sfVNigA~~~~gtMVd~~as~G~~a~VGkn~higgGa~I~G-------------VLe 176 (271) T COG2171 110 RIVPGAIVRLGAYIAKGTVVMPESFVNIGAGTGEGTMVDGRASVGSCAQVGKNSHIGGGASIGG-------------VLE 176 (271) T ss_pred EECCCCEEEECCEECCCCEECCCCEEEECCCCCCCEEEEEEEEEECCEEECCCCCCCCCCEEEE-------------EEC T ss_conf 6367647760528779938755566877760275258961565513179799845278636767-------------734 Q ss_pred CCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCC Q ss_conf 78544874555663988998997980883445775456298788798869989378398899 Q gi|254780942|r 342 EGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQG 403 (442) Q Consensus 342 ~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~ 403 (442) ....- ..+||+||.|||.+...- ++.+|++|.|++++.|.++++|..- T Consensus 177 p~~a~-------Pv~IgdncliGAns~~ve-------GV~vGdg~VV~aGv~I~~~tki~~~ 224 (271) T COG2171 177 PLQAN-------PVIIGDNCLIGANSEVVE-------GVIVGDGCVVAAGVFITQDTKIYDR 224 (271) T ss_pred CCCCC-------CEEECCCCEECCCCCEEE-------EEEECCCCEEECCEEEECCCCEEEE T ss_conf 88878-------869877618624533486-------1696789688545089379514774 No 153 >TIGR03532 DapD_Ac 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-acetyltransferase. Alternate name: tetrahydrodipicolinate N-acetyltransferase. Note that IUBMB lists this alternate name as the accepted name. Unfortunately, the related succinyl transferase acting on the same substrate (EC:2.3.1.117, TIGR00695) uses the opposite standard. We have decided to give these two enzymes names which more clearly indicated that they act on the same substrate. Probab=99.40 E-value=1.2e-12 Score=90.49 Aligned_cols=111 Identities=19% Similarity=0.274 Sum_probs=70.1 Q ss_pred EEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCC Q ss_conf 79941123317841001223223442000211163134332121112100213443200000258644320231021124 Q gi|254780942|r 262 VFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIK 341 (442) Q Consensus 262 ~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~ 341 (442) ..|...+.||++++|.++++|.-++.||+++.|..++ ++|..|.||.||||.+|++|+. ++. T Consensus 93 aiir~~v~IG~~aVImmga~INiGA~IGegtmId~~A-----vvg~~a~IGk~~HisaGa~laG-------------Vle 154 (231) T TIGR03532 93 AIIRDQVIIGDNAVIMMGAVINIGAEIGEGTMIDMNA-----VLGGRATVGKNVHIGAGAVLAG-------------VIE 154 (231) T ss_pred CEEECCEEECCCEEEECCCEEECCCEECCCCEECCCC-----EECCCCEECCCCEECCCCEEEC-------------CCC T ss_conf 5884252987983991388870485987894580585-----6445468879705889864300-------------225 Q ss_pred CCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEE Q ss_conf 78544874555663988998997980883445775456298788798869989378398899899 Q gi|254780942|r 342 EGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYV 406 (442) Q Consensus 342 ~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i 406 (442) +-+ -..++||+||.||+++++.. ++.||+++.||+++++... +.+++++ T Consensus 155 P~~-------a~PViIeDnV~IGAnAvIl~-------GV~IG~gsVIgAGsVVtkd--VP~~sVV 203 (231) T TIGR03532 155 PPS-------AKPVVIEDNVLIGANAVILE-------GVRVGKGAVVAAGAIVTED--VPPNTVV 203 (231) T ss_pred CCC-------CCCEEECCCEEECCCCEECC-------CCEECCCCEECCCCEECCC--CCCCCEE T ss_conf 766-------79979878859879979919-------9899999999849889565--7999799 No 154 >cd05787 LbH_eIF2B_epsilon eIF-2B epsilon subunit, central Left-handed parallel beta-Helix (LbH) domain: eIF-2B is a eukaryotic translation initiator, a guanine nucleotide exchange factor (GEF) composed of five different subunits (alpha, beta, gamma, delta and epsilon). eIF2B is important for regenerating GTP-bound eIF2 during the initiation process. This event is obligatory for eIF2 to bind initiator methionyl-tRNA, forming the ternary initiation complex. The eIF-2B epsilon subunit contains an N-terminal domain that resembles a dinucleotide-binding Rossmann fold, a central LbH domain containing 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), and a C-terminal domain of unknown function that is present in eIF-4 gamma, eIF-5, and eIF-2B epsilon. The epsilon and gamma subunits form the catalytic subcomplex of eIF-2B, which binds eIF2 and catalyzes guanine nucleotide exchange. Probab=99.40 E-value=8.7e-13 Score=91.21 Aligned_cols=78 Identities=22% Similarity=0.355 Sum_probs=53.4 Q ss_pred CCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEEC Q ss_conf 21112100213443200000258644320231021124785448745556639889989979808834457754562987 Q gi|254780942|r 304 HIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHIN 383 (442) Q Consensus 304 ~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig 383 (442) +||+++.||+++.| ++|+|+++|+||+.+.+++|+|++++ .||++|.|- +++|+ T Consensus 1 vIG~~t~Ig~~~~I-~~svIG~nc~Ig~~~~I~~svi~d~~-----------~Ig~~~~i~--------------~siI~ 54 (79) T cd05787 1 VIGRGTSIGEGTTI-KNSVIGRNCKIGKNVVIDNSYIWDDV-----------TIEDGCTIH--------------HSIVA 54 (79) T ss_pred CCCCCCEECCCCEE-ECCEECCCCEECCCCEEECCEEECCC-----------EECCCCEEC--------------CCEEC T ss_conf 99998999999999-59998999999999889489991978-----------999999990--------------89997 Q ss_pred CCCEECCCCEEECCEEECCCCEEC Q ss_conf 887988699893783988998997 Q gi|254780942|r 384 ENAFIGSNSSLIAPITIGQGTYVA 407 (442) Q Consensus 384 ~~~~iG~~~~i~~gv~ig~~~~i~ 407 (442) ++|.||.++.+.+|..||+++.|| T Consensus 55 ~~~~Ig~~~~i~~g~vIg~~~~Ig 78 (79) T cd05787 55 DGAVIGKGCTIPPGSLISFGVVIG 78 (79) T ss_pred CCCEECCCCEECCCCEECCCCEEC T ss_conf 989999999999999997997969 No 155 >cd05824 LbH_M1P_guanylylT_C Mannose-1-phosphate guanylyltransferase, C-terminal Left-handed parallel beta helix (LbH) domain: Mannose-1-phosphate guanylyltransferase is also known as GDP-mannose pyrophosphorylase. It catalyzes the synthesis of GDP-mannose from GTP and mannose-1-phosphate, and is involved in the maintenance of cell wall integrity and glycosylation. Similar to ADP-glucose pyrophosphorylase, it contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain, presumably with 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.38 E-value=5.3e-13 Score=92.43 Aligned_cols=47 Identities=17% Similarity=0.157 Sum_probs=18.1 Q ss_pred CCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCC Q ss_conf 344200021116313433212111210021344320000025864432 Q gi|254780942|r 284 CGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGN 331 (442) Q Consensus 284 ~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~ 331 (442) +++.||++|+|++++.|++|+|.++++|+++++|. +|+||+++.||. T Consensus 16 p~vvIG~nv~IG~gv~I~~~vI~~~~~I~~~s~i~-~siIG~~~~iG~ 62 (80) T cd05824 16 PNVVIGPNVTIGDGVRLQRCVILSNSTVRDHSWVK-SSIVGWNSTVGR 62 (80) T ss_pred CCCEECCCCEECCCCEEEEEEEECCCEECCCCEEE-CCEECCCCCCCC T ss_conf 98589999999999799730895789998986997-899857970489 No 156 >cd02513 CMP-NeuAc_Synthase CMP-NeuAc_Synthase activates N-acetylneuraminic acid by adding CMP moiety. CMP-N-acetylneuraminic acid synthetase (CMP-NeuAc synthetase) or acylneuraminate cytidylyltransferase catalyzes the transfer the CMP moiety of CTP to the anomeric hydroxyl group of NeuAc in the presence of Mg++. It is the second to last step in the sialylation of the oligosaccharide component of glycoconjugates by providing the activated sugar-nucleotide cytidine 5'-monophosphate N-acetylneuraminic acid (CMP-Neu5Ac), the substrate for sialyltransferases. Eukaryotic CMP-NeuAc synthetases are predominantly located in the nucleus. The activated CMP-Neu5Ac diffuses from the nucleus into the cytoplasm. Probab=99.36 E-value=6e-12 Score=86.44 Aligned_cols=210 Identities=16% Similarity=0.225 Sum_probs=126.7 Q ss_pred CCEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCHHHHHHHHCCCCCEEEEEECC---- Q ss_conf 4089994588772327888744125884857999999999779-97799995798899999730269728999859---- Q gi|254780942|r 4 KRLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAG-IENVALVLGYGAEEITRINFPPTLSVEYYIQD---- 78 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g-~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~---- 78 (442) +.+|+|+|=|-.+|+ |. |.+.+++|+||++|+++.+.+++ +++|+|.| +.+++.+.....+..+-+-..+ T Consensus 1 Ki~aiIpAR~gSkri-~~--KN~~~i~gkpLi~~~i~~a~ks~~~d~IivST--Ds~~i~~~~~k~~~~~~~~Rpk~ls~ 75 (223) T cd02513 1 KILAIIPARGGSKGI-PG--KNIRPLGGKPLIAWTIEAALESKLFDRVVVST--DDEEIAEVARKYGAEVPFLRPAELAT 75 (223) T ss_pred CEEEEECCCCCCCCC-CC--HHHHHHCCCCHHHHHHHHHHHCCCCCEEEEEC--CHHHHHHHHCCCCCEEEECCCHHHCC T ss_conf 989998678778789-97--00446389596999999999669955078955--87897542124564179738468768 Q ss_pred CCCCHHHHHHHCCCCCCC---CCCEEEEEECCCCCCCCCCCCCCCCCCCC-CCCCCCEEEECCCCCCCCEEEECCCCCEE Q ss_conf 988614466410000123---44605898515411153322000000135-86100100101221574404202333100 Q gi|254780942|r 79 CQQGTAHAVLTAQDAIKP---GYDDVIIMYGDVPLVSSHTLKKAMDKIAQ-GYSIAVVGFNADNPKGYGRLLIKNNEIIA 154 (442) Q Consensus 79 ~~~GTa~Ai~~a~~~l~~---~~~~~lVl~GD~~li~~~~l~~l~~~~~~-~~~~~i~~~~~~dp~~yGvV~~d~~~v~~ 154 (442) ..-.+-+.+..+.+.+.. ..|.++.+..+.||.+..++..+++.+.+ +.+..+.+.+..++-.+-.. ..++.... T Consensus 76 d~~~~~dvi~~~l~~~~~~~~~~d~iv~l~pTsP~r~~~~I~~~i~~~~~~k~dsl~sv~~~~~~~~~~~~-~~~~~~~~ 154 (223) T cd02513 76 DTASSIDVILHALDQLEELGRDFDIVVLLQPTSPLRSAEDIDEAIELLLSEGADSVFSVTEFHRFPWRALG-LDDNGLEP 154 (223) T ss_pred CCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHEE-ECCCCCCC T ss_conf 86780677778998899868998999996279762789999999999985799999999854768188567-62687021 Q ss_pred CCCCC-CCCCCC--CCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCC Q ss_conf 10014-667542--100035560011125014677753288874001112114899997307717999172143011121 Q gi|254780942|r 155 IREEN-DATDEE--RKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNR 231 (442) Q Consensus 155 i~Ek~-d~~~~~--~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~ 231 (442) +..++ .....| ...-..+.++|+++.+.+... .. . -|.++..|..+..+..++++. T Consensus 155 ~~~~~~~~~~rQ~~~~~y~~~g~~yi~~~~~~~k~----~~-----------~------~~~k~~~~~~~~~~siDID~~ 213 (223) T cd02513 155 VNYPEDKRTRRQDLPPAYHENGAIYIAKREALLES----NS-----------F------FGGKTGPYEMPRERSIDIDTE 213 (223) T ss_pred CCCCHHHCCCCCCCCEEEEECCEEEEEEHHHHHHC----CC-----------C------CCCCEEEEEECCCCEECCCCH T ss_conf 36861106764467604786048999999999854----98-----------0------389779999488716848999 Q ss_pred CCCCCCCCC Q ss_conf 000000122 Q gi|254780942|r 232 YELSLIENI 240 (442) Q Consensus 232 ~~L~~~~~~ 240 (442) .||..++.+ T Consensus 214 ~Dl~~Ae~l 222 (223) T cd02513 214 EDFELAEAL 222 (223) T ss_pred HHHHHHHHH T ss_conf 999999986 No 157 >cd04652 LbH_eIF2B_gamma_C eIF-2B gamma subunit, C-terminal Left-handed parallel beta-Helix (LbH) domain: eIF-2B is a eukaryotic translation initiator, a guanine nucleotide exchange factor (GEF) composed of five different subunits (alpha, beta, gamma, delta and epsilon). eIF2B is important for regenerating GTP-bound eIF2 during the initiation process. This event is obligatory for eIF2 to bind initiator methionyl-tRNA, forming the ternary initiation complex. The eIF-2B gamma subunit contains an N-terminal domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH domain with 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). The epsilon and gamma subunits form the catalytic subcomplex of eIF-2B, which binds eIF2 and catalyzes guanine nucleotide exchange. Probab=99.35 E-value=1.6e-12 Score=89.77 Aligned_cols=80 Identities=24% Similarity=0.408 Sum_probs=54.2 Q ss_pred CCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEEC Q ss_conf 21112100213443200000258644320231021124785448745556639889989979808834457754562987 Q gi|254780942|r 304 HIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHIN 383 (442) Q Consensus 304 ~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig 383 (442) .||+++.||+++.| ++|+|+++|+||..+++++|+|+++++|...+.+.+ ++|+ T Consensus 1 vIG~~~~Ig~~~~I-~~SvIG~nc~Ig~~~~i~~sii~~~v~I~~~~~i~~-------------------------sIIg 54 (81) T cd04652 1 LVGENTQVGEKTSI-KRSVIGANCKIGKRVKITNCVIMDNVTIEDGCTLEN-------------------------CIIG 54 (81) T ss_pred CCCCCCEECCCCEE-ECCEECCCCEECCCCEEECCEEECCCEECCCCEEEC-------------------------CEEE T ss_conf 98798899999989-679997999999996896899969879899989974-------------------------9990 Q ss_pred CCCEECCCCEEECCEEECCCCEECCCC Q ss_conf 887988699893783988998997783 Q gi|254780942|r 384 ENAFIGSNSSLIAPITIGQGTYVASGS 410 (442) Q Consensus 384 ~~~~iG~~~~i~~gv~ig~~~~i~ag~ 410 (442) +++.||.++.| .++.||+++.|.+|+ T Consensus 55 ~~~~Ig~~~~l-~~~vig~~~~V~~gt 80 (81) T cd04652 55 NGAVIGEKCKL-KDCLVGSGYRVEAGT 80 (81) T ss_pred CCCEECCCCEE-ECCEECCCCEECCCC T ss_conf 97999999999-373988998989898 No 158 >cd05824 LbH_M1P_guanylylT_C Mannose-1-phosphate guanylyltransferase, C-terminal Left-handed parallel beta helix (LbH) domain: Mannose-1-phosphate guanylyltransferase is also known as GDP-mannose pyrophosphorylase. It catalyzes the synthesis of GDP-mannose from GTP and mannose-1-phosphate, and is involved in the maintenance of cell wall integrity and glycosylation. Similar to ADP-glucose pyrophosphorylase, it contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain, presumably with 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.34 E-value=5.1e-13 Score=92.52 Aligned_cols=61 Identities=30% Similarity=0.370 Sum_probs=26.8 Q ss_pred EECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEEC Q ss_conf 00258644320231021124785448745556639889989979808834457754562987887988 Q gi|254780942|r 322 TIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIG 389 (442) Q Consensus 322 ~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG 389 (442) +||++|+||+.|.+++|+|++++.+.+.+|+.+|+||.+|.||+.+-+-+ .++|||++.|+ T Consensus 19 vIG~nv~IG~gv~I~~~vI~~~~~I~~~s~i~~siIG~~~~iG~~~ri~~-------~~viGd~v~I~ 79 (80) T cd05824 19 VIGPNVTIGDGVRLQRCVILSNSTVRDHSWVKSSIVGWNSTVGRWTRLEN-------VTVLGDDVTIK 79 (80) T ss_pred EECCCCEECCCCEEEEEEEECCCEECCCCEEECCEECCCCCCCCCEEECC-------CCEECCCCEEC T ss_conf 89999999999799730895789998986997899857970489403658-------62988986988 No 159 >COG1208 GCD1 Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) [Cell envelope biogenesis, outer membrane / Translation, ribosomal structure and biogenesis] Probab=99.32 E-value=2.3e-12 Score=88.80 Aligned_cols=89 Identities=27% Similarity=0.377 Sum_probs=62.7 Q ss_pred EEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCC Q ss_conf 79941123317841001223223442000211163134332121112100213443200000258644320231021124 Q gi|254780942|r 262 VFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIK 341 (442) Q Consensus 262 ~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~ 341 (442) +++...+.|++++.|++++.|++++.||++|.|++++.+.+|+|.++|.||+.++++ +|+|+++|+||+. . . T Consensus 256 ~~i~gp~~ig~~~~i~~~~~i~~~~~ig~~~~I~~~~~i~~Sii~~~~~i~~~~~i~-~sIi~~~~~ig~~-----~-~- 327 (358) T COG1208 256 AYIIGPVVIGPGAKIGPGALIGPYTVIGEGVTIGNGVEIKNSIIMDNVVIGHGSYIG-DSIIGENCKIGAS-----L-I- 327 (358) T ss_pred EEEECCEEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEEECCCEECCCCEEC-CEEECCCCEECCC-----C-E- T ss_conf 388589498269789888486898789999888998778754887787988888882-0698188599986-----1-5- Q ss_pred CCCCCCCCCCCCCCEECCCCEECCCEE Q ss_conf 785448745556639889989979808 Q gi|254780942|r 342 EGSKINHLSYVGDSVVGKNVNIGAGTI 368 (442) Q Consensus 342 ~g~~i~~~~~igd~~iG~~~~ig~g~i 368 (442) +||+.+|.+|.+.+|++ T Consensus 328 ----------i~d~~~g~~~~i~~g~~ 344 (358) T COG1208 328 ----------IGDVVIGINSEILPGVV 344 (358) T ss_pred ----------EEEEEECCCEEECCCEE T ss_conf ----------70158636608627618 No 160 >TIGR02665 molyb_mobA molybdopterin-guanine dinucleotide biosynthesis protein A; InterPro: IPR013482 In many molybdopterin-containing enzymes, including nitrate reductase and dimethylsulphoxide reductase, the cofactor is molybdopterin-guanine dinucleotide. This entry describes proteobacterial MobA (molybdopterin-guanine dinucleotide biosynthesis protein A). MobA synthesises molybdopterin-guanine dinucleotide from molybdopterin and GTP. ; GO: 0003824 catalytic activity, 0006777 Mo-molybdopterin cofactor biosynthetic process. Probab=99.32 E-value=1.6e-12 Score=89.75 Aligned_cols=122 Identities=23% Similarity=0.322 Sum_probs=85.7 Q ss_pred EEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEE---CCCCCC Q ss_conf 89994588772327888744125884857999999999779977999957988999997302697289998---599886 Q gi|254780942|r 6 LAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYI---QDCQQG 82 (442) Q Consensus 6 ~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~---q~~~~G 82 (442) .+||||||+|+||-- ..|.|+++.|+||++|+++.|.-. +.+|+|..+=+.+..+.+. .+..+.-=. ...-.| T Consensus 2 ~gviLAGG~arRMGG-~DKGL~~L~g~PL~~hv~~rL~PQ-v~~~~IsANRn~~~Y~~~g--~Gl~V~~D~~DA~~~F~G 77 (202) T TIGR02665 2 SGVILAGGRARRMGG-RDKGLVELGGKPLIEHVLARLRPQ-VSDLAISANRNPERYAQAG--FGLPVVPDDVDALADFPG 77 (202) T ss_pred CEEEECCCCCCCCCC-CCCCCEECCCCCHHHHHHHHHCCC-HHHHHHHCCCCHHHHHHHC--CCCEECCCCCCCCCCCCC T ss_conf 736765874002688-886610128652899999984350-7666741388977898860--897312785343578888 Q ss_pred HHHHHHHCCCCC-C-CCCCEEEEEECCCCCCCCCCCCCCCCCCCC-CCCCCC Q ss_conf 144664100001-2-344605898515411153322000000135-861001 Q gi|254780942|r 83 TAHAVLTAQDAI-K-PGYDDVIIMYGDVPLVSSHTLKKAMDKIAQ-GYSIAV 131 (442) Q Consensus 83 Ta~Ai~~a~~~l-~-~~~~~~lVl~GD~~li~~~~l~~l~~~~~~-~~~~~i 131 (442) -=-.|..+++.. . ...+.+|++.||+||++.+.+..|.++... ..++++ T Consensus 78 PLAGilagL~~a~~~~~~~~vl~~PCD~P~lP~dLv~RL~~a~~~~~a~iav 129 (202) T TIGR02665 78 PLAGILAGLRWAVAGTGTDWVLTVPCDTPFLPEDLVARLAAALEAQDADIAV 129 (202) T ss_pred CHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEE T ss_conf 7689999999998446998288822888998878999999997528997899 No 161 >cd03356 LbH_G1P_AT_C_like Left-handed parallel beta-Helix (LbH) domain of a group of proteins with similarity to glucose-1-phosphate adenylyltransferase: Included in this family are glucose-1-phosphate adenylyltransferase, mannose-1-phosphate guanylyltransferase, and the eukaryotic translation initiation factor eIF-2B subunits, epsilon and gamma. Most members of this family contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold, followed by a LbH fold domain with at least 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). eIF-2B epsilon contains an additional domain of unknown function at the C-terminus. Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.32 E-value=1.2e-12 Score=90.51 Aligned_cols=77 Identities=22% Similarity=0.310 Sum_probs=52.7 Q ss_pred CCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCCE Q ss_conf 00021116313433212111210021344320000025864432023102112478544874555663988998997980 Q gi|254780942|r 288 IENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGT 367 (442) Q Consensus 288 Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~ 367 (442) ||++|.|++++.+.+++||++|+||++++|. +|+|++++.||+++++++|+|++++ .||++|.+..+ T Consensus 2 IG~~~~Ig~~~~I~~svIG~~c~Ig~~~~I~-~Sii~~~~~Ig~~~~i~~sII~~~~-----------~Ig~~~~i~~~- 68 (79) T cd03356 2 IGESTVIGENAIIKNSVIGDNVRIGDGVTIT-NSILMDNVTIGANSVIVDSIIGDNA-----------VIGENVRVVNL- 68 (79) T ss_pred CCCCCEECCCCEEECCEECCCCEECCCCEEE-EEEEECCCEECCCCEEECCEECCCC-----------EECCCCEEECC- T ss_conf 9897999999999599999999999996996-1099797999999999699994989-----------99999999288- Q ss_pred EECCCCCCCCCCCEECCCCEEC Q ss_conf 8834457754562987887988 Q gi|254780942|r 368 ITCNYDGTHKYKTHINENAFIG 389 (442) Q Consensus 368 i~~n~~g~~~~~~~ig~~~~iG 389 (442) +++|+++.|+ T Consensus 69 ------------~viG~~~~I~ 78 (79) T cd03356 69 ------------CIIGDDVVVE 78 (79) T ss_pred ------------CEECCCCEEC T ss_conf ------------3997995988 No 162 >PRK02862 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=99.30 E-value=4.6e-12 Score=87.11 Aligned_cols=59 Identities=15% Similarity=0.070 Sum_probs=22.5 Q ss_pred CCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEE-CCCCCCCCEEEEECCCCCCC Q ss_conf 442000211163134332121112100213443200000-25864432023102112478 Q gi|254780942|r 285 GVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTI-EKNVRIGNFCEVKKATIKEG 343 (442) Q Consensus 285 ~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i-~~~~~Ig~~~ei~~s~i~~g 343 (442) +++|+.+|+|+++|++++|+|..+.........+..... ..++.||.+|.|++++|+++ T Consensus 324 nSVlg~~v~I~~ga~V~nSIimg~d~~~~~~~~~~~~~~~~~~v~IG~~~~i~~aIidk~ 383 (429) T PRK02862 324 HSVLGVRSRIESDCVLEDTLVMGADFYESSEEREALRKEGKIPVGIGEGTTIKRAIIDKN 383 (429) T ss_pred CCEECCCCEECCCCEEEEEEEECCCCCCCHHHHHHHHCCCCCCCCCCCCCEEECCEECCC T ss_conf 678889858989989963189877544433343222114799867998999975898789 No 163 >COG1083 NeuA CMP-N-acetylneuraminic acid synthetase [Cell envelope biogenesis, outer membrane] Probab=99.24 E-value=2.3e-11 Score=83.13 Aligned_cols=213 Identities=20% Similarity=0.273 Sum_probs=136.9 Q ss_pred CCCEEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCCC-CEEEEEECCCHHHHHHHHCCCCCEEEEEECCCC- Q ss_conf 740899945887723278887441258848579999999997799-779999579889999973026972899985998- Q gi|254780942|r 3 RKRLAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAGI-ENVALVLGYGAEEITRINFPPTLSVEYYIQDCQ- 80 (442) Q Consensus 3 ~~~~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g~-~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~- 80 (442) .+.+|+|+|-|-++|. |+ |-+.+++|+|||.|.|+.+.+++. ++|++-+ +++.|.+.....+.++.+...++- T Consensus 2 ~~~iAiIpAR~gSKgI-~~--KNi~~~~gkpLi~~~I~aA~ns~~fd~VviSs--Ds~~Il~~A~~ygak~~~~Rp~~LA 76 (228) T COG1083 2 MKNIAIIPARGGSKGI-KN--KNIRKFGGKPLIGYTIEAALNSKLFDKVVISS--DSEEILEEAKKYGAKVFLKRPKELA 76 (228) T ss_pred CCEEEEEECCCCCCCC-CC--CCHHHHCCCCHHHHHHHHHHCCCCCCEEEECC--CCHHHHHHHHHHCCCCCCCCCHHHC T ss_conf 6148997626788768-76--33577579615899999986588522699847--9589999998727611024874541 Q ss_pred CC---HHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCC-CCCCCCEEEECCCCCCCCEEEECCCCCEECC Q ss_conf 86---1446641000012344605898515411153322000000135-8610010010122157440420233310010 Q gi|254780942|r 81 QG---TAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQ-GYSIAVVGFNADNPKGYGRLLIKNNEIIAIR 156 (442) Q Consensus 81 ~G---Ta~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~-~~~~~i~~~~~~dp~~yGvV~~d~~~v~~i~ 156 (442) .+ |-+++..+++.+....+.++.+++-.||.+..++++..+.+.. ..+..+.+.+.+.. -|-....+++.+..+. T Consensus 77 ~D~ast~~~~lh~le~~~~~~~~~~lLq~TsPLl~~~~ik~A~e~f~~~~~~sl~sa~e~e~~-p~k~f~~~~~~~~~~~ 155 (228) T COG1083 77 SDRASTIDAALHALESFNIDEDTLILLQPTSPLLTSLHIKEAFEKFLNNQYDSLFSAVECEHH-PYKAFSLNNGEVKPVN 155 (228) T ss_pred CCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHCCCCCCEEEEEECCCC-HHHHHHHCCCCEEECC T ss_conf 672167899999998744245706884068500014579999999965887635887532234-6888875078441011 Q ss_pred CCCCCCC-CC--CCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCC Q ss_conf 0146675-42--10003556001112501467775328887400111211489999730771799917214301112100 Q gi|254780942|r 157 EENDATD-EE--RKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYE 233 (442) Q Consensus 157 Ek~d~~~-~~--~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~ 233 (442) |.+|... .| +..-.-|.-+|+++.+.+.+- +.. + . .+...|..+.....++++..| T Consensus 156 ~~~~~~~rrQ~Lpk~Y~~NgaiYi~~~~~l~e~------~~~---------f----~--~~~~~y~m~~~~~~DID~~~D 214 (228) T COG1083 156 EDPDFETRRQDLPKAYRENGAIYINKKDALLEN------DCF---------F----I--PNTILYEMPEDESIDIDTELD 214 (228) T ss_pred CCCCCCCCCCCCHHHHHHCCCEEEEHHHHHHHC------CCE---------E----C--CCCEEEECCCCCCCCCCCHHH T ss_conf 477655421105165653676797527987415------754---------1----5--896689817120005660876 Q ss_pred CCCCCCCCC Q ss_conf 000012222 Q gi|254780942|r 234 LSLIENIWQ 242 (442) Q Consensus 234 L~~~~~~~~ 242 (442) |..++.++. T Consensus 215 l~iae~l~~ 223 (228) T COG1083 215 LEIAENLIF 223 (228) T ss_pred HHHHHHHHH T ss_conf 999999865 No 164 >PRK04928 consensus Probab=99.24 E-value=5.1e-12 Score=86.84 Aligned_cols=71 Identities=23% Similarity=0.310 Sum_probs=55.8 Q ss_pred CCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEE Q ss_conf 34420002111631343321211121002134432000002586443202310211247854487455566398899899 Q gi|254780942|r 284 CGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNI 363 (442) Q Consensus 284 ~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~i 363 (442) .++.|+++|.|. ++.+++|+||.+|.||++|.|+ +|+|+++|.||+++.+++|+|++++ .||+|+.| T Consensus 310 ~~s~I~~G~~I~-G~~I~~SvIg~~~~Ig~ga~I~-nSII~~~~~Ig~ga~i~~sIidk~v-----------~Ig~g~~I 376 (405) T PRK04928 310 IDSLISGGSYIR-GSRIEKSVLGFRSNIASACDIS-ESVLLGDVKIGEGCVLRRVIIDKDV-----------EIAPGTQI 376 (405) T ss_pred ECCEECCCCEEE-CCEEECCEECCCCEECCCCEEE-CCEEECCCEECCCCEEECCEECCCC-----------EECCCCEE T ss_conf 566887887898-9998768976998999994995-5899692999999999654988999-----------99999999 Q ss_pred CCCE Q ss_conf 7980 Q gi|254780942|r 364 GAGT 367 (442) Q Consensus 364 g~g~ 367 (442) |... T Consensus 377 ~~~~ 380 (405) T PRK04928 377 GENL 380 (405) T ss_pred CCCH T ss_conf 8371 No 165 >TIGR03584 PseF pseudaminic acid CMP-transferase. The sequences in this family include the pfam02348 (cytidyltransferase) domain and are homologous to the NeuA protein responsible for the transfer of CMP to neuraminic acid. According to, this gene is responsible for the transfer of CMP to the structurally related sugar, pseudaminic acid which is observed as a component of sugar modifications of flagellin in Campylobacter species. This gene is commonly observed in apparent operons with other genes responsible for the biosynthesis of pseudaminic acid and as a component of flagellar and exopolysaccharide biosynthesis loci. Probab=99.22 E-value=9.8e-11 Score=79.53 Aligned_cols=209 Identities=19% Similarity=0.195 Sum_probs=127.1 Q ss_pred EEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCHHHHHHHHCCCCCEEEEEECCC----C Q ss_conf 89994588772327888744125884857999999999779-977999957988999997302697289998599----8 Q gi|254780942|r 6 LAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAAG-IENVALVLGYGAEEITRINFPPTLSVEYYIQDC----Q 80 (442) Q Consensus 6 ~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g-~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~----~ 80 (442) +|||+|=|-++|+ |. |-+.+++|+||++|.++.+.+++ +++|+|.| +.+++.+.....+..+.+...++ . T Consensus 1 laiIPaR~gSkri-~~--KN~~~~~g~pLi~~~i~~~~~s~~~d~i~vST--D~~~i~~ia~~~~~~~~~~Rp~~ls~d~ 75 (222) T TIGR03584 1 IAIIPARGGSKRI-PR--KNIKPFCGKPMIAYSIEAALNSGLFDEVVVST--DDEEIAEVAKSYGASVPFMRPAELSDDF 75 (222) T ss_pred CEEECCCCCCCCC-CC--CCHHHHCCCCHHHHHHHHHHHCCCCCEEEECC--CHHHHHHHHHCCCCCCEEECCHHHCCCC T ss_conf 9898437788767-86--01256589285999999998579965388727--8899875552169864251865555886 Q ss_pred CCHHHHHHHCCCCCCC--CCCEEEEEECCCCCCCCCCCCCCCCCCCC-CCCCCCEEEECCC-CCCCCEEEECCCCCEECC Q ss_conf 8614466410000123--44605898515411153322000000135-8610010010122-157440420233310010 Q gi|254780942|r 81 QGTAHAVLTAQDAIKP--GYDDVIIMYGDVPLVSSHTLKKAMDKIAQ-GYSIAVVGFNADN-PKGYGRLLIKNNEIIAIR 156 (442) Q Consensus 81 ~GTa~Ai~~a~~~l~~--~~~~~lVl~GD~~li~~~~l~~l~~~~~~-~~~~~i~~~~~~d-p~~yGvV~~d~~~v~~i~ 156 (442) -.+.+.+..+.+.+.. ..+.+++++...||.+..++...++.+.+ +.+..+.+.+... |.+ .....+++.+.-+. T Consensus 76 ~~~~~vi~~~l~~~~~~~~~d~i~~l~pTsP~r~~~~I~~ai~~~~~~~~ds~~sv~~~~~~~~~-~~~~~~~~~~~~~~ 154 (222) T TIGR03584 76 TGTAPVVAHAIERLQLQKQPDHACCIYATAPLLQAKILKEAFELLKQPNKHFVFTVTEFAFPIQR-AFSLKENGGVTMFF 154 (222) T ss_pred CHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCCEEEECCCCCHHH-HEEECCCCCEEECC T ss_conf 21459999999999865899989995489761589999999999972699855065337768589-37986799463347 Q ss_pred CCCCCCCCC-C-CCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCC Q ss_conf 014667542-1-00035560011125014677753288874001112114899997307717999172143011121000 Q gi|254780942|r 157 EENDATDEE-R-KIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYEL 234 (442) Q Consensus 157 Ek~d~~~~~-~-~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L 234 (442) .+......| . ..-..+..+|+++.+.+.+ . +. . -|.+...|..+..+..+++++.|| T Consensus 155 ~~~~~~r~Qdl~~~y~~~G~~y~~~~~~~~~---~-------~~-----~------~~~~~~~~~~~~~~siDID~~~D~ 213 (222) T TIGR03584 155 PEHYNTRSQDLEEAYHDAGQFYWGKSQAWLE---S-------GP-----I------FSPHSIPIILPRHLVQDIDTLEDW 213 (222) T ss_pred HHHHHHHHHHCCCCEEECCEEEEEEHHHHHH---C-------CC-----C------CCCCEEEEEECCCCCCCCCCHHHH T ss_conf 4552024442443375447899988999984---4-------97-----0------289879999899862899999999 Q ss_pred CCCCCCC Q ss_conf 0001222 Q gi|254780942|r 235 SLIENIW 241 (442) Q Consensus 235 ~~~~~~~ 241 (442) ..++.++ T Consensus 214 ~~Ae~l~ 220 (222) T TIGR03584 214 ERAELLY 220 (222) T ss_pred HHHHHHH T ss_conf 9999986 No 166 >PRK00844 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=99.18 E-value=1.5e-11 Score=84.24 Aligned_cols=63 Identities=21% Similarity=0.233 Sum_probs=42.5 Q ss_pred EEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCE Q ss_conf 123317841001223223442000211163134332121112100213443200000258644320 Q gi|254780942|r 267 DTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNF 332 (442) Q Consensus 267 ~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~ 332 (442) ++.|+++|.|.+ +.+ .+++||++|+|++++.|++|+|.++|+||.+|+|+ +|+|+++|.||.+ T Consensus 315 ~s~I~~g~~I~g-~~V-~nSvIg~~v~Ig~ga~I~nSII~~~~~Ig~~a~I~-~sIidk~v~Ig~g 377 (409) T PRK00844 315 DSIVSAGSIISG-GSV-RNSVLSPNVRVDSGAIVEGSVLMDGVRIGRGAVVR-RAILDKNVVVPPG 377 (409) T ss_pred CCEECCCEEEEC-CEE-ECCEECCCCEECCCCEEECCEEECCCEECCCCEEE-EEEECCCCEECCC T ss_conf 588879709969-988-88897799899999899518992949999999998-0198999898999 No 167 >cd02518 GT2_SpsF SpsF is a glycosyltrnasferase implicated in the synthesis of the spore coat. Spore coat polysaccharide biosynthesis protein F (spsF) is a glycosyltransferase implicated in the synthesis of the spore coat in a variety of bacteria challenged by stress as starvation. The spsF gene is expressed in the late stage of coat development responsible for a terminal step in coat formation that involves the glycosylation of the coat. SpsF gene mutation resulted in spores that appeared normal. But, the spores tended to aggregate and had abnormal adsorption properties, indicating a surface alteration. Probab=99.16 E-value=1.3e-10 Score=78.80 Aligned_cols=116 Identities=18% Similarity=0.340 Sum_probs=82.9 Q ss_pred EEEEECCCCCCCCCCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEEECCC--HHHHHHHHCCCCCEEEEEECCCCCC Q ss_conf 8999458877232788874412588485799999999977-997799995798--8999997302697289998599886 Q gi|254780942|r 6 LAIVLAAGRGHRMKSSSSKVLQKIAGKPMISHVMETIAAA-GIENVALVLGYG--AEEITRINFPPTLSVEYYIQDCQQG 82 (442) Q Consensus 6 ~AiILAaG~GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~-g~~~ivvv~~~~--~e~i~~~~~~~~~~i~~v~q~~~~G 82 (442) +|||.|--.+||| |. |+|++|+|+|||+|++++++++ .+++|++.|... .+.+.++....+ +.+..-.+... T Consensus 1 i~iI~aR~~S~RL-p~--K~L~~i~~~~~i~~~i~r~k~~~~~~~IivaTs~~~~Dd~l~~~~~~~~--i~~frGs~~dv 75 (233) T cd02518 1 VAIIQARMGSTRL-PG--KVLKPLGGKPLLEHLLDRLKRSKLIDEIVIATSTNEEDDPLEALAKKLG--VKVFRGSEEDV 75 (233) T ss_pred CEEEECCCCCCCC-CC--CCHHHHCCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHCC--EEEEECCCCHH T ss_conf 9899607788378-98--6002309957999999999966898849998178652269999997484--89995783058 Q ss_pred HHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCC-CCCC Q ss_conf 1446641000012344605898515411153322000000135-8610 Q gi|254780942|r 83 TAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQ-GYSI 129 (442) Q Consensus 83 Ta~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~-~~~~ 129 (442) + +.+..|.+..+ .+.++-+.||.||+++..++.+++.+.+ +.|. T Consensus 76 l-~R~~~a~~~~~--~d~ivri~gD~P~idp~~id~~i~~~~~~~~Dy 120 (233) T cd02518 76 L-GRYYQAAEEYN--ADVVVRITGDCPLIDPEIIDAVIRLFLKSGADY 120 (233) T ss_pred H-HHHHHHHHHCC--CCEEEEECCCCCCCCHHHHHHHHHHHHCCCCCE T ss_conf 8-99999987157--888999768777589899999999985079998 No 168 >PRK03282 consensus Probab=99.15 E-value=3e-11 Score=82.42 Aligned_cols=69 Identities=16% Similarity=0.182 Sum_probs=51.5 Q ss_pred CCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEEC Q ss_conf 44200021116313433212111210021344320000025864432023102112478544874555663988998997 Q gi|254780942|r 285 GVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIG 364 (442) Q Consensus 285 ~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig 364 (442) ++.|+++|+|. ++.+++|+||.+|.||++|.|+ +|+|+++++||.++.+++|+|++++ +||+++.|| T Consensus 313 ~S~i~~G~vI~-g~~V~nSvIg~~v~Ig~ga~I~-~SIIm~~~~Ig~~a~l~~~Iidk~v-----------~Ig~g~~Ig 379 (406) T PRK03282 313 ESIVSAGSIIS-GASVRNSVLSPNVVVDSGAIVE-GSVLMPGVRIGRGAVVRHAILDKNV-----------VVPPGAMVG 379 (406) T ss_pred CCEECCCCEEE-CCEEECCEECCCCEECCCCEEE-CCEEECCCEECCCCEEECCEECCCC-----------EECCCCEEC T ss_conf 77864885985-9988899978998999998997-0889096899999999731989999-----------999898999 Q ss_pred CC Q ss_conf 98 Q gi|254780942|r 365 AG 366 (442) Q Consensus 365 ~g 366 (442) .. T Consensus 380 ~~ 381 (406) T PRK03282 380 VD 381 (406) T ss_pred CC T ss_conf 88 No 169 >PRK10502 putative colanic acid biosynthesis acetyltransferase WcaF; Provisional Probab=99.13 E-value=2.4e-10 Score=77.36 Aligned_cols=111 Identities=19% Similarity=0.254 Sum_probs=63.5 Q ss_pred EEECCCCCEECCCCC--CCCCCCCEEEECCCCCCCC---CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCC Q ss_conf 233178410012232--2344200021116313433---21211121002134432000002586443202310211247 Q gi|254780942|r 268 TIIQPDTVIEPHVFF--GCGVSIENYVQIRAFSYLE---GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKE 342 (442) Q Consensus 268 ~~I~~~~~I~~~~~I--g~~~~Ig~~~~I~~~~~i~---~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~ 342 (442) ++||+++.|.|++.| .-+..||++|.|++++.|. +..||+||.|+++++|..++.--++- ++ -+ T Consensus 53 AkIG~~v~I~p~v~I~~PwnL~IGd~~~Ig~~v~i~~~~~I~IG~~v~Isq~v~l~t~sHd~~~~---~~-----~l--- 121 (179) T PRK10502 53 AKIGKGVVIRPSVRITYPWKLTIGDYAWIGDDVVLYTLGEITIGAHCVISQKSYLCTGSHDYSSP---HF-----DL--- 121 (179) T ss_pred CEECCCCEECCCCEEEECCCEEECCCCEECCCCEEEECCCCEECCCEEECCCCEEECCCCCCCCC---CC-----CC--- T ss_conf 86799988999657950430798896578898189848961798841277971896388997676---66-----73--- Q ss_pred CCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEE Q ss_conf 8544874555663988998997980883445775456298788798869989378398899899 Q gi|254780942|r 343 GSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYV 406 (442) Q Consensus 343 g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i 406 (442) +....+||++|.||+++++.. ++.||+++.||+++++.. .+.+++++ T Consensus 122 --------~~~pI~Igd~~wIga~a~I~p-------Gv~Ig~gavigA~SvV~k--dv~~~~i~ 168 (179) T PRK10502 122 --------NAAPIVIGEGCWLATDVFVAP-------GVTIGDGAVVGARSSVFK--SLPANTVC 168 (179) T ss_pred --------CCCCCEECCCEEECCCCEEEC-------CCEECCCCEECCCCEEEE--ECCCCCEE T ss_conf --------248858589969999969908-------899999979954998921--77999699 No 170 >cd04651 LbH_G1P_AT_C Glucose-1-phosphate adenylyltransferase, C-terminal Left-handed parallel beta helix (LbH) domain: Glucose-1-phosphate adenylyltransferase is also known as ADP-glucose synthase or ADP-glucose pyrophosphorylase. It catalyzes the first committed and rate-limiting step in starch biosynthesis in plants and glycogen biosynthesis in bacteria. It is the enzymatic site for regulation of storage polysaccharide accumulation in plants and bacteria. The enzyme is a homotetramer, with each subunit containing an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain with at 5 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). The LbH domain is involved in cooperative allosteric regulation and oligomerization. Probab=99.11 E-value=6.2e-11 Score=80.64 Aligned_cols=59 Identities=17% Similarity=0.182 Sum_probs=40.0 Q ss_pred CCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCC Q ss_conf 3442000211163134332121112100213443200000258644320231021124785 Q gi|254780942|r 284 CGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGS 344 (442) Q Consensus 284 ~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~ 344 (442) .++.|+++|.|. ++.+++|+||++|.||++|.| .+|+|++++.||.+++++|++|++++ T Consensus 11 ~~S~Ig~Gc~I~-~~~V~~SVIg~~~~Ig~~a~I-~~svi~~~~~Ig~~~~l~~~Ii~~~~ 69 (104) T cd04651 11 KNSLVSEGCIIS-GGTVENSVLFRGVRVGSGSVV-EDSVIMPNVGIGRNAVIRRAIIDKNV 69 (104) T ss_pred EECEECCCCEEE-CCEEECCEECCCCEECCCCEE-ECCEEECCCEECCCCEEEEEEECCCC T ss_conf 849988997995-999980897299799999899-60099655689999999861999999 No 171 >cd00208 LbetaH Left-handed parallel beta-Helix (LbetaH or LbH) domain: The alignment contains 5 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity, however, some subfamilies in this hierarchy also show activities related to ion transport or translation initiation. Many are trimeric in their active forms. Probab=99.09 E-value=2.7e-10 Score=77.06 Aligned_cols=34 Identities=32% Similarity=0.635 Sum_probs=24.8 Q ss_pred CCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEE Q ss_conf 63988998997980883445775456298788798869989 Q gi|254780942|r 354 DSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSL 394 (442) Q Consensus 354 d~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i 394 (442) +.+||++|.||+++++.- ++.|||||+||+++++ T Consensus 44 ~~~Ig~~v~Ig~~a~I~~-------gv~IGdn~~IgagavV 77 (78) T cd00208 44 PTIIGDNVEIGANAVIHG-------GVKIGDNAVIGAGAVV 77 (78) T ss_pred CCEECCCEEECCCCEEEC-------CEEECCCCEECCCCEE T ss_conf 989999969986898979-------8199999999988569 No 172 >cd04651 LbH_G1P_AT_C Glucose-1-phosphate adenylyltransferase, C-terminal Left-handed parallel beta helix (LbH) domain: Glucose-1-phosphate adenylyltransferase is also known as ADP-glucose synthase or ADP-glucose pyrophosphorylase. It catalyzes the first committed and rate-limiting step in starch biosynthesis in plants and glycogen biosynthesis in bacteria. It is the enzymatic site for regulation of storage polysaccharide accumulation in plants and bacteria. The enzyme is a homotetramer, with each subunit containing an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain with at 5 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). The LbH domain is involved in cooperative allosteric regulation and oligomerization. Probab=99.08 E-value=1.2e-10 Score=79.07 Aligned_cols=72 Identities=19% Similarity=0.270 Sum_probs=44.2 Q ss_pred ECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCC Q ss_conf 11631343321211121002134432000002586443202310211247854487455566398899899798 Q gi|254780942|r 293 QIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAG 366 (442) Q Consensus 293 ~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g 366 (442) .+++.+.+++|.|+++|.|. .+.+ .+|+|+.+|+||.++.+++|+|++++.|+..+.+-+|+|+++|.|++| T Consensus 3 ~~~~~~~v~~S~Ig~Gc~I~-~~~V-~~SVIg~~~~Ig~~a~I~~svi~~~~~Ig~~~~l~~~Ii~~~~~I~~g 74 (104) T cd04651 3 YIGRRGEVKNSLVSEGCIIS-GGTV-ENSVLFRGVRVGSGSVVEDSVIMPNVGIGRNAVIRRAIIDKNVVIPDG 74 (104) T ss_pred EECCCCEEEECEECCCCEEE-CCEE-ECCEECCCCEECCCCEEECCEEECCCEECCCCEEEEEEECCCCEECCC T ss_conf 88899489849988997995-9999-808972997999998996009965568999999986199999999989 No 173 >PTZ00339 UDP-N-acetylglucosamine pyrophosphorylase; Provisional Probab=99.08 E-value=6.4e-09 Score=69.21 Aligned_cols=182 Identities=15% Similarity=0.168 Sum_probs=99.1 Q ss_pred CCEEEEECCCCCCCCCCCCCCCEEEEC---CEEHHHHHHHHHHHC-------------CCCEEEEEECC-CHHHHHHHHC Q ss_conf 408999458877232788874412588---485799999999977-------------99779999579-8899999730 Q gi|254780942|r 4 KRLAIVLAAGRGHRMKSSSSKVLQKIA---GKPMISHVMETIAAA-------------GIENVALVLGY-GAEEITRINF 66 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p~~pKpLlpi~---gkpli~~~i~~l~~~-------------g~~~ivvv~~~-~~e~i~~~~~ 66 (442) +..+++||||+||||...-||-|+||+ +|+|.+...+.+.+. ..-.++|.|+. ..+..++++. T Consensus 109 kVavlllAGGqGTRLG~~~PKG~~~igl~s~ksLfql~aeri~~lq~la~~~~~~~~~~~ip~yIMTS~~t~~~T~~ff~ 188 (499) T PTZ00339 109 EVAVLILAGGLGTRLGSDKPKGLLECTPLKKKSLFQFHCEKIRRLEEMAAAASGGGDDPTIYILVLTSSFNHDQTRQFLQ 188 (499) T ss_pred CEEEEEECCCCCCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCHHHHHHHHHH T ss_conf 88999950787466799999711115899987199999999999999988750678888603999778504699999998 Q ss_pred CCC------CEEEEEECC----------------------CCCCHHHHHHHCCCC-------CCC----CCCEEEEEECC Q ss_conf 269------728999859----------------------988614466410000-------123----44605898515 Q gi|254780942|r 67 PPT------LSVEYYIQD----------------------CQQGTAHAVLTAQDA-------IKP----GYDDVIIMYGD 107 (442) Q Consensus 67 ~~~------~~i~~v~q~----------------------~~~GTa~Ai~~a~~~-------l~~----~~~~~lVl~GD 107 (442) ... .+|.|..|. .|-|.||- +.++.. |++ +-..+-+.+=| T Consensus 189 ~~~~FGl~~~~V~fF~Q~~lP~~d~~~Gkille~~~~i~~aPdGNGG~-y~AL~~sg~~~~iL~dm~~rGI~yi~~~~VD 267 (499) T PTZ00339 189 ENNFFGLDKEQVIFFKQSSLPCYDENTGRVLMESRGSLCTAPGGNGDV-FKALAKCSEFMSILDKLEKLGIKYVQIISID 267 (499) T ss_pred HCCCCCCCHHHEEEEECCCEEEEECCCCEEEECCCCCEEECCCCCHHH-HHHHHHCCCHHHHHHHHHHCCCEEEEEEEEC T ss_conf 677778875768999738733685479968755888068778996389-9998862431127999997498699999865 Q ss_pred CCCCCCCCCCCCCC-CCCCCCCCCCEEEECCCC-CCCCEEEECCC--CCEECCC-------CCCCCCCCCCCCCCCCCHH Q ss_conf 41115332200000-013586100100101221-57440420233--3100100-------1466754210003556001 Q gi|254780942|r 108 VPLVSSHTLKKAMD-KIAQGYSIAVVGFNADNP-KGYGRLLIKNN--EIIAIRE-------ENDATDEERKIHYCNSGLM 176 (442) Q Consensus 108 ~~li~~~~l~~l~~-~~~~~~~~~i~~~~~~dp-~~yGvV~~d~~--~v~~i~E-------k~d~~~~~~~~~lin~GiY 176 (442) ++|....|- .|+- +..++.+++.=..+..+| ...|++-..++ .|++.-| .+|.......-+.-|.... T Consensus 268 N~L~kvaDP-~FiG~~~~~~~~~~~Kvv~K~~p~E~VGvv~~~~g~~~VvEYSEi~~~~ae~~~~~~~l~~f~agnI~~H 346 (499) T PTZ00339 268 NILAKVADP-EFIGLASSFPAHVVLNKCPKRSDDESVGVFCLKDDEWQVVEYTEINERILENKDATTNLKQFNCGNICSH 346 (499) T ss_pred CCCCCCCCC-HHHHHHHHCCCCHHCEEEECCCCCCCCCEEEEECCEEEEEEECCCCHHHHHCCCCCCCEEECCCCCHHHE T ss_conf 333246780-6634877648610002787268987654389888926899703668777627788776010243443213 Q ss_pred EECCHHHHHHH Q ss_conf 11250146777 Q gi|254780942|r 177 AIDGLYIMDWL 187 (442) Q Consensus 177 ~f~~~~L~~~l 187 (442) +|+.++|.+.. T Consensus 347 ~fs~~FL~~v~ 357 (499) T PTZ00339 347 IFSLDFLKKVA 357 (499) T ss_pred EECHHHHHHHH T ss_conf 63199999887 No 174 >cd04193 UDPGlcNAc_PPase UDPGlcNAc pyrophosphorylase catalayzes the synthesis of UDPGlcNAc. UDP-N-acetylglucosamine (UDPGlcNAc) pyrophosphorylase (UAP) (also named GlcNAc1P uridyltransferase), catalyzes the reversible conversion of UTP and GlcNAc1 to PPi and UDPGlcNAc. UDP-N-acetylglucosamine (UDPGlcNAc), the activated form of GlcNAc, is a key precursor of N- and O-linked glycosylations. It is essential for the synthesis of chitin (a major component of the fungal cell wall) and of the glycosylphosphatidylinositol (GPI) linker which anchors a variety of cell surface proteins to the plasma membrane. In bacteria, UDPGlcNAc represents an essential precursor for both peptidoglycan and lipopolysaccharide biosynthesis. Human UAP has two isoforms, resulting from alternative splicing of a single gene and differing by the presence or absence of 17 amino acids. UDPGlcNAc pyrophosphorylase shares significant sequence and structure conservation with UDPglucose pyrophosphorylase. Probab=99.07 E-value=1.4e-08 Score=67.31 Aligned_cols=183 Identities=17% Similarity=0.173 Sum_probs=101.5 Q ss_pred CCEEEEECCCCCCCCCCCCCCCEEEEC---CEEHHHHHHHHHHHC----C-------CCEEEEEECCC-HHHHHHHHCCC Q ss_conf 408999458877232788874412588---485799999999977----9-------97799995798-89999973026 Q gi|254780942|r 4 KRLAIVLAAGRGHRMKSSSSKVLQKIA---GKPMISHVMETIAAA----G-------IENVALVLGYG-AEEITRINFPP 68 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p~~pKpLlpi~---gkpli~~~i~~l~~~----g-------~~~ivvv~~~~-~e~i~~~~~~~ 68 (442) +..+++||||.||||.-..||-|+||. ++++++...+.+... + .-.+++.++.. .++.++++... T Consensus 15 kvavv~laGG~GTrlG~~~pKg~~~i~~~~~~sl~~l~~e~i~~l~~~~~~~~~~~~~ip~~iMtS~~T~~~T~~~l~~~ 94 (323) T cd04193 15 KVAVLLLAGGQGTRLGFDGPKGMFPVGLPSKKSLFQLQAERILKLQELAGEASGKKVPIPWYIMTSEATHEETRKFFKEN 94 (323) T ss_pred CEEEEEECCCCCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHHC T ss_conf 98999957887455689997162572688998099999999999999987743888851699978853458999999867 Q ss_pred ------CCEEEEEECC---------------------CCCCHHHHH---HHC--CCCC-CCCCCEEEEEECCCCCCCCCC Q ss_conf ------9728999859---------------------988614466---410--0001-234460589851541115332 Q gi|254780942|r 69 ------TLSVEYYIQD---------------------CQQGTAHAV---LTA--QDAI-KPGYDDVIIMYGDVPLVSSHT 115 (442) Q Consensus 69 ------~~~i~~v~q~---------------------~~~GTa~Ai---~~a--~~~l-~~~~~~~lVl~GD~~li~~~~ 115 (442) ..+|.+..|. .|-|.||-. +.. ++.+ ..+...+.+.+.|++|....| T Consensus 95 ~~fGl~~~~i~~f~Q~~~P~~~~~g~~~l~~~~~~~~~P~GhG~i~~aL~~sG~l~~l~~~Gi~yi~v~~vDN~L~~~~D 174 (323) T cd04193 95 NYFGLDPEQVHFFQQGMLPCVDFDGKILLEEKGKIAMAPNGNGGLYKALQTAGILEDMKKRGIKYIHVYSVDNILVKVAD 174 (323) T ss_pred CCCCCCHHHEEEEECCCCEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCHHHHHHCCCEEEEEEECCCCCCCCCC T ss_conf 64688855647871687314868987156899751427999717999999878488999669889999965744323248 Q ss_pred CCCCCC-CCCCCCCCCCEEEECCCC-CCCCEEEECCCCCEECCCCCCCCCC---------CCCCCCCCCCHHEECCHHHH Q ss_conf 200000-013586100100101221-5744042023331001001466754---------21000355600111250146 Q gi|254780942|r 116 LKKAMD-KIAQGYSIAVVGFNADNP-KGYGRLLIKNNEIIAIREENDATDE---------ERKIHYCNSGLMAIDGLYIM 184 (442) Q Consensus 116 l~~l~~-~~~~~~~~~i~~~~~~dp-~~yGvV~~d~~~v~~i~Ek~d~~~~---------~~~~~lin~GiY~f~~~~L~ 184 (442) .. ++- +..++.+.+.-+.+..+| ...|++...+++. .++|--+.+++ ...-+.-|..++.|+-++|. T Consensus 175 p~-~lG~~~~~~~~~~~kvv~K~~~~ekvG~l~~~~gk~-~vvEYsel~~~~~~~~~~~g~l~f~~gNi~~~~fsl~fl~ 252 (323) T cd04193 175 PV-FIGFCISKGADVGAKVVRKRYPTEKVGVVVLVDGKP-QVVEYSEISDELAEKRDADGELQYNAGNIANHFFSLDFLE 252 (323) T ss_pred HH-HHHHHHHCCCCEEEEEEECCCCCCCEEEEEEECCEE-EEEEECCCCHHHHHHHCCCCCEEECCCCHHHHHEEHHHHH T ss_conf 89-999998749963789988888998310899989906-9998425898898401876616514556777531399999 Q ss_pred HHHH Q ss_conf 7775 Q gi|254780942|r 185 DWLL 188 (442) Q Consensus 185 ~~l~ 188 (442) +.++ T Consensus 253 ~~~~ 256 (323) T cd04193 253 KAAE 256 (323) T ss_pred HHHH T ss_conf 9987 No 175 >KOG1462 consensus Probab=99.07 E-value=7.9e-11 Score=80.06 Aligned_cols=92 Identities=25% Similarity=0.409 Sum_probs=61.0 Q ss_pred CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCE Q ss_conf 21211121002134432000002586443202310211247854487455566398899899798088344577545629 Q gi|254780942|r 302 GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTH 381 (442) Q Consensus 302 ~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ 381 (442) ++.|+++|.||.++.|. .|+||.+|.||..+.+.||++|+|+. ||++|+|. .++ T Consensus 334 d~iv~~~t~i~~~s~ik-~SviG~nC~Ig~~~~v~nSilm~nV~-----------vg~G~~Ie--------------nsI 387 (433) T KOG1462 334 DSIVGDNTQIGENSNIK-RSVIGSNCDIGERVKVANSILMDNVV-----------VGDGVNIE--------------NSI 387 (433) T ss_pred HHCCCCCCEECCCCEEE-EEEECCCCCCCCCCEEEEEEEECCCE-----------ECCCCCEE--------------CCE T ss_conf 31017885525554022-00125776116872777117665837-----------55884020--------------426 Q ss_pred ECCCCEECCCCEEECCEEECCCCEECCCCEEC-CCCCCCCEEEECC Q ss_conf 87887988699893783988998997783576-6436986898135 Q gi|254780942|r 382 INENAFIGSNSSLIAPITIGQGTYVASGSIIT-QDTPENSLVFARS 426 (442) Q Consensus 382 ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~-~dv~~~~~v~~~~ 426 (442) ||+++.||+++.+ .+|+||+|-+|. +..--++..++.. T Consensus 388 Ig~gA~Ig~gs~L-------~nC~Ig~~yvVeak~~~~~ev~~~~~ 426 (433) T KOG1462 388 IGMGAQIGSGSKL-------KNCIIGPGYVVEAKGKHGGEVLVSNE 426 (433) T ss_pred ECCCCEECCCCEE-------EEEEECCCCEECCCCCCCCCEEECCC T ss_conf 6266565479703-------33485598277665522563761241 No 176 >PRK09677 putative lipopolysaccharide biosynthesis O-acetyl transferase WbbJ; Provisional Probab=99.06 E-value=1.2e-09 Score=73.38 Aligned_cols=126 Identities=15% Similarity=0.208 Sum_probs=66.1 Q ss_pred CCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCC---CCCCCCCCCCCCCCCCCCCCEEE--CCCCCCCCEEE Q ss_conf 52799411233178410012232234420002111631343---32121112100213443200000--25864432023 Q gi|254780942|r 260 ETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYL---EGVHIGKKTIIGPFARIRQETTI--EKNVRIGNFCE 334 (442) Q Consensus 260 ~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i---~~~~Ig~~~~Ig~~~~i~~~s~i--~~~~~Ig~~~e 334 (442) ..+.++.+..+++++.+...- .+...||++|.|+.++.| ++..||+||.|+++++|.+...- .......+ T Consensus 42 ~~I~~g~~~~~g~~~rl~~~~--~gkl~IGdNv~Ig~~~~I~~~~~I~IG~nVlIa~~V~I~d~nH~~~~~~~~~~~--- 116 (192) T PRK09677 42 GSINFGEGFTSGVGLRLDAFG--RGKLFFGDNVQVNDYVHIACIESITIGRDTLIASKVFITDHNHGSFKHSDDFSS--- 116 (192) T ss_pred CCEEECCCEEECCCEEEEEEC--CCEEEECCCCEECCCCEEEECCCEEECCCCEECCCEEEECCCCCCCCCCCCCCC--- T ss_conf 656878961966877999706--750899998489998589867767989986887988996788876555454345--- Q ss_pred EEC-CCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEE Q ss_conf 102-112478544874555663988998997980883445775456298788798869989378398899899 Q gi|254780942|r 335 VKK-ATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYV 406 (442) Q Consensus 335 i~~-s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i 406 (442) .+ .....+.+ .+..+||++|.||+++++.. ++.||++|.+|+++++... +.+++++ T Consensus 117 -~~~~~~~~~~~------~~pv~Ig~~~wIg~~~~Il~-------Gv~Ig~~~vvgA~SvV~~d--vp~~~i~ 173 (192) T PRK09677 117 -PNLPPDMRTLE------SSAVVIGQRVWIGENVTILP-------GVSIGNGCIVGANSVVTKS--IPENTVI 173 (192) T ss_pred -CCCCCCCCCCC------CCCEEECCCCEECCCCEEEC-------CCEECCCCEECCCCEECCC--CCCCEEE T ss_conf -67682126677------59989899989999999919-------9699999899169999858--5999199 No 177 >cd03354 LbH_SAT Serine acetyltransferase (SAT): SAT catalyzes the CoA-dependent acetylation of the side chain hydroxyl group of L-serine to form O-acetylserine, as the first step of a two-step biosynthetic pathway in bacteria and plants leading to the formation of L-cysteine. This reaction represents a key metabolic point of regulation for the cysteine biosynthetic pathway due to its feedback inhibition by cysteine. The enzyme is a 175 kDa homohexamer, composed of a dimer of homotrimers. Each subunit contains an N-terminal alpha helical region and a C-terminal left-handed beta-helix (LbH) subdomain with 5 turns, each containing a hexapeptide repeat motif characteristic of the acyltransferase superfamily of enzymes. The trimer interface mainly involves the C-terminal LbH subdomain while the dimer (of trimers) interface is mediated by the N-terminal alpha helical subdomain. Probab=99.06 E-value=3.3e-10 Score=76.54 Aligned_cols=93 Identities=27% Similarity=0.352 Sum_probs=50.8 Q ss_pred CCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCC Q ss_conf 22344200021116313433212111210021344320000025864432023102112478544874555663988998 Q gi|254780942|r 282 FGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNV 361 (442) Q Consensus 282 Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~ 361 (442) |++++.||++++|..++ +++|++++.||+++.|.+++.|+.+..+. .+ +...||++| T Consensus 5 I~~~~~IG~~~~I~~~~---~v~Ig~~~~Ig~~~~I~~~~~ig~~~~~~-----------~~---------~~~~Ig~~v 61 (101) T cd03354 5 IHPGAKIGPGLFIDHGT---GIVIGETAVIGDNCTIYQGVTLGGKGKGG-----------GK---------RHPTIGDNV 61 (101) T ss_pred ECCCCEECCCEEECCCC---CEEECCCCEECCCCEECCCCEECCCCCCC-----------CC---------CCCCCCCCE T ss_conf 89999999492991998---77999998999998998887886631146-----------87---------776148758 Q ss_pred EECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEE Q ss_conf 997980883445775456298788798869989378398899899 Q gi|254780942|r 362 NIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYV 406 (442) Q Consensus 362 ~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i 406 (442) .||+++++.. ++.||+++.||+++++... +.+++++ T Consensus 62 ~IG~~~~I~~-------gv~IG~~~vIgagsvV~~d--v~~~si~ 97 (101) T cd03354 62 VIGAGAKILG-------NITIGDNVKIGANAVVTKD--VPANSTV 97 (101) T ss_pred EECCCCEEEC-------CCEECCCCEECCCCEECCC--CCCCCEE T ss_conf 6358988929-------8799998799879889968--5999899 No 178 >TIGR00965 dapD 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase; InterPro: IPR005664 Bacteria, plants and fungi metabolise aspartic acid to produce four amino acids - lysine, threonine, methionine and isoleucine - in a series of reactions known as the aspartate pathway. Additionally, several important metabolic intermediates are produced by these reactions, such as diaminopimelic acid, an essential component of bacterial cell wall biosynthesis, and dipicolinic acid, which is involved in sporulation in Gram-positive bacteria. Members of the animal kingdom do not posses this pathway and must therefore acquire these essential amino acids through their diet. Research into improving the metabolic flux through this pathway has the potential to increase the yield of the essential amino acids in important crops, thus improving their nutritional value. Additionally, since the enzymes are not present in animals, inhibitors of them are promising targets for the development of novel antibiotics and herbicides. For more information see . The lysine/diaminopimelic acid branch of the aspartate pathway produces the essential amino acid lysine via the intermediate meso-diaminopimelic acid (meso-DAP), which is also a vital cell wall component in Gram-negative bacteria . The production of dihydropicolinate from aspartate-semialdehyde controls flux into the lysine/diaminopimelic acid pathway. Three variants of this pathway exist, differing in how tetrahydropicolinate (formed by reduction of dihydropicolinate) is metabolised to meso-DAP. One variant, the most commonly found one in archaea and bacteria, uses primarily succinyl intermediates, while a second variant, found only in Bacillus, utilises primarily acetyl intermediates. In the third variant, found in some Gram-positive bacteria, a dehydrogenase converts tetrahydropicolinate directly to meso-DAP. In all variants meso-DAP is subsequently converted to lysine by a decarboxylase, or, in Gram-negative bacteria, assimilated into the cell wall. Evidence exists that a fourth, currently unknown, variant of this pathway may function in plants . 2,3,4,5-tetrahydropyridine-2-carboxylate N-succinyltransferase (also known as tetrahydrodipicolinate N-succinyltransferase or DapD) is part of the succinyl route of of lysine/DAP biosynthesis. The DapD protein is a homotrimer is a trimeric enzyme with each monomer composed of three domain: an N-terminal helical domain, a distinctive left-handed parallel beta-helix (LBH) domain, and a predominantly beta C-terminal domain , . The LBH structure is encoded by an imperfect tandem-repeated hexapeptide sequence. Each trimer contains three independent active sites, always occuring at the boundary of two subunits, and formed by residues from one N-terminal domain, one C-terminal domain and two adjacent LBH domains.; GO: 0016757 transferase activity transferring glycosyl groups, 0009089 lysine biosynthetic process via diaminopimelate. Probab=99.05 E-value=6e-10 Score=75.04 Aligned_cols=120 Identities=23% Similarity=0.356 Sum_probs=85.3 Q ss_pred CCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCC Q ss_conf 78410012232234420002111631343321211121002134432000002586443202310211247854487455 Q gi|254780942|r 272 PDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSY 351 (442) Q Consensus 272 ~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~ 351 (442) ..-..-|++.+..++.|..|+++.|.-+=-|++|.++++|..||.+.+.+.||+|||+.-.+ T Consensus 102 ~g~R~VPgA~vR~G~fiAkNvVlMPSyvNIGAYvDEGtMvDTWATvGSCAQIGKNvHlSGG~------------------ 163 (275) T TIGR00965 102 AGFRKVPGAIVRRGTFIAKNVVLMPSYVNIGAYVDEGTMVDTWATVGSCAQIGKNVHLSGGV------------------ 163 (275) T ss_pred CCCCCCCHHHEECCCEEEEEEEECCCEEECCCEEECCCEECCHHHHCCHHHHCCCEEEECCC------------------ T ss_conf 66654874360057357752377555210136751686010012230032206625751783------------------ Q ss_pred CCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCC---------------CEECCCC Q ss_conf 5663988998997980883445775456298788798869989378398899899778---------------3576643 Q gi|254780942|r 352 VGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASG---------------SIITQDT 416 (442) Q Consensus 352 igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag---------------~~v~~dv 416 (442) -|| |+ +....-.+++|+||||||+.+-|.-||-+.++++|..| -+....| T Consensus 164 ----------GIG-GV----LEPLQA~P~IIEDNCFiGARSEi~EGVIVEEGSVISMGVfIG~STkI~~R~TGEI~~GRv 228 (275) T TIGR00965 164 ----------GIG-GV----LEPLQAKPTIIEDNCFIGARSEIVEGVIVEEGSVISMGVFIGQSTKIVDRDTGEIIYGRV 228 (275) T ss_pred ----------CCC-CC----CCCCCCCCEEEECCCEECCCEEEEEEEEECCCCEEEEEEEECCCCEEEECCCCEEEECCC T ss_conf ----------316-52----463357875882476004611356207971685888406855601688536671765513 Q ss_pred CCCCEEEE Q ss_conf 69868981 Q gi|254780942|r 417 PENSLVFA 424 (442) Q Consensus 417 ~~~~~v~~ 424 (442) |++|+|+. T Consensus 229 PA~SVVVs 236 (275) T TIGR00965 229 PAGSVVVS 236 (275) T ss_pred CCEEEEEC T ss_conf 56027971 No 179 >PRK09527 lacA galactoside O-acetyltransferase; Reviewed Probab=99.05 E-value=4.5e-10 Score=75.75 Aligned_cols=111 Identities=22% Similarity=0.297 Sum_probs=63.9 Q ss_pred CCCCCEECC--CCCCCCCCCCEEEECCCCCCCCC---CCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCC Q ss_conf 178410012--23223442000211163134332---1211121002134432000002586443202310211247854 Q gi|254780942|r 271 QPDTVIEPH--VFFGCGVSIENYVQIRAFSYLEG---VHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSK 345 (442) Q Consensus 271 ~~~~~I~~~--~~Ig~~~~Ig~~~~I~~~~~i~~---~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~ 345 (442) ++++.|+|+ |--|.|..||+++.|+.+|++-+ ..||++|.|||++.|. +.-.+ +. .+.++. T Consensus 59 g~~~~IepPF~cdYG~NI~iG~~~fiN~n~~ilD~~~I~IGd~v~iGPnV~i~--t~~Hp---~~--~~~R~~------- 124 (203) T PRK09527 59 GENAWVEPPVYFSYGSNIHIGRNFYANFNLTIVDDYTVTIGDNVLIAPNVTLS--VTGHP---VH--HELRKN------- 124 (203) T ss_pred CCCCEEECCEEEECCCCEEECCCEEECCCCEEEECCCEEECCCEEECCCCEEE--CCCCC---CC--HHHHHC------- T ss_conf 99978969978725479697798798688389957637998972886997893--69898---89--889733------- Q ss_pred CCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEE Q ss_conf 4874555663988998997980883445775456298788798869989378398899899 Q gi|254780942|r 346 INHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYV 406 (442) Q Consensus 346 i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i 406 (442) ...|-....||++|.||+++++.. ++.||+++.||+++++.. .|.+++++ T Consensus 125 --~~~~~~pi~Ig~~vwIG~~~~I~p-------Gv~IG~~~vigAgsvVtk--dip~~~v~ 174 (203) T PRK09527 125 --GEMYSFPITIGNNVWIGSHVVINP-------GVTIGDNSVIGAGSVVTK--DIPPNVVA 174 (203) T ss_pred --CCCCCCCEEECCEEEECCCCEECC-------CCEECCCCEECCCCEECC--CCCCCCEE T ss_conf --876688859998689899999909-------929999979914988853--06998299 No 180 >cd04649 LbH_THP_succinylT_putative Putative 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate (THDP) N-succinyltransferase (THP succinyltransferase), C-terminal left-handed parallel alpha-helix (LbH) domain: This group is composed of mostly uncharacterized proteins containing an N-terminal domain of unknown function and a C-terminal LbH domain with similarity to THP succinyltransferase LbH. THP succinyltransferase catalyzes the conversion of tetrahydrodipicolinate and succinyl-CoA to N-succinyltetrahydrodipicolinate and CoA. It is the committed step in the succinylase pathway by which bacteria synthesize L-lysine and meso-diaminopimelate, a component of peptidoglycan. The enzyme is trimeric and displays the left-handed parallel alpha-helix (LbH) structural motif encoded by the hexapeptide repeat motif. Probab=99.04 E-value=6.2e-10 Score=74.96 Aligned_cols=61 Identities=26% Similarity=0.494 Sum_probs=47.4 Q ss_pred CEECCCCEECCCEEEC-CCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCCCC Q ss_conf 3988998997980883-44577545629878879886998937839889989977835766436 Q gi|254780942|r 355 SVVGKNVNIGAGTITC-NYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQDTP 417 (442) Q Consensus 355 ~~iG~~~~ig~g~i~~-n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~dv~ 417 (442) ++||+||.||.|..+. -+.+-.+.++.||++|+||+|+.+ |+.+||+|+|.+|-.++.+.+ T Consensus 48 v~VG~~~diggGasi~G~Ls~g~~~pi~Ig~ncliGAnsgi--Gi~lGd~cvv~aGlyi~~gTk 109 (147) T cd04649 48 VIVGKGSDVGGGASIMGTLSGGGNNVISIGKRCLLGANSGI--GISLGDNCIVEAGLYVTAGTK 109 (147) T ss_pred EEECCCCCCCCCCEEEEECCCCCCCCEEECCCEEECCCCCC--CEEECCCCEEECCEEECCCCE T ss_conf 69899833278617878716788235698875076146545--848689868824459839978 No 181 >PRK01884 consensus Probab=99.04 E-value=1.6e-10 Score=78.28 Aligned_cols=33 Identities=21% Similarity=0.365 Sum_probs=13.7 Q ss_pred ECCCCCCCCCCCCCCCCCCCEECCCCEECCCEE Q ss_conf 021124785448745556639889989979808 Q gi|254780942|r 336 KKATIKEGSKINHLSYVGDSVVGKNVNIGAGTI 368 (442) Q Consensus 336 ~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i 368 (442) +||+|+++++|+..+.|-||+|.++|.||+|+. T Consensus 344 ~nSVL~r~v~I~~gA~V~nSIIm~~~~IG~ga~ 376 (435) T PRK01884 344 SNSVLFDRIKVDEFSKIDHCVVLPQVKIGKNCV 376 (435) T ss_pred ECCEEECCCEECCCCEEEECEEECCCEECCCCE T ss_conf 719820896999999998288808698999999 No 182 >PRK00725 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=98.98 E-value=3.9e-10 Score=76.14 Aligned_cols=62 Identities=15% Similarity=0.226 Sum_probs=45.2 Q ss_pred CCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCC Q ss_conf 4420002111631343321211121002134432000002586443202310211247854487 Q gi|254780942|r 285 GVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINH 348 (442) Q Consensus 285 ~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~ 348 (442) ++.|+++|+|. ++.+++|+|+.+|+||+.|.|+ +|+|+++|.||.++.++|++|+++++|+. T Consensus 331 nSli~~GcvI~-G~~VenSVI~r~v~Ig~gA~V~-nSIIm~~~~IG~ga~l~naIiDk~v~I~~ 392 (431) T PRK00725 331 NSLVSGGCIIS-GSVVVQSVLFTRVRVNSFSNID-SAVLLPEVWVGRSCRLRRCVIDRACVIPE 392 (431) T ss_pred CCEEECCEEEE-CCEEEECCCCCCCEECCCCEEE-EEEECCCCEECCCCEEEEEEECCCCEECC T ss_conf 67983998996-9999829825998999998996-13988989889999997139899989898 No 183 >cd03357 LbH_MAT_GAT Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates. MAT acetylates maltose and glucose exclusively at the C6 position of the nonreducing end glucosyl moiety. GAT specifically acetylates galactopyranosides. Furthermore, MAT shows higher affinity toward artificial substrates containing an alkyl or hydrophobic chain as well as a glucosyl unit. Active MAT and GAT are homotrimers, with each subunit consisting of an N-terminal alpha-helical region and a C-terminal left-handed parallel alpha-helix (LbH) subdomain with 6 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Probab=98.98 E-value=1.9e-09 Score=72.17 Aligned_cols=99 Identities=26% Similarity=0.391 Sum_probs=58.2 Q ss_pred CCCCCCCEEEECCCCCCCC---CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECC Q ss_conf 2344200021116313433---2121112100213443200000258644320231021124785448745556639889 Q gi|254780942|r 283 GCGVSIENYVQIRAFSYLE---GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGK 359 (442) Q Consensus 283 g~~~~Ig~~~~I~~~~~i~---~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~ 359 (442) |.+..||+++.|+.+|++. ...||++|.|||++.|...+. + +. .+-++ .+ ..+-....||+ T Consensus 60 G~ni~iG~~~~in~n~~ild~~~I~IG~~v~igp~v~i~t~~H--~---~d--~~~r~----~~-----~~~~~PI~Igd 123 (169) T cd03357 60 GYNIHIGDNFYANFNCTILDVAPVTIGDNVLIGPNVQIYTAGH--P---LD--PEERN----RG-----LEYAKPITIGD 123 (169) T ss_pred CCCEEECCEEEECCCCEEEECCCEEECCCEEECCCCEEECCCC--C---CC--HHHHH----CC-----CCCCCCEEECC T ss_conf 3677988844887884897068879999807879858976899--9---99--78963----46-----65245569897 Q ss_pred CCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEE Q ss_conf 98997980883445775456298788798869989378398899899 Q gi|254780942|r 360 NVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYV 406 (442) Q Consensus 360 ~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i 406 (442) +|.||++++++- +++||+++.||+++++... |.+++++ T Consensus 124 ~vWIG~~~~Ilp-------GVtIG~~~vIgAGsVVtkd--ip~~~v~ 161 (169) T cd03357 124 NVWIGGGVIILP-------GVTIGDNSVIGAGSVVTKD--IPANVVA 161 (169) T ss_pred CCEECCCCEEEC-------CCEECCCCEECCCCEECCC--CCCCEEE T ss_conf 436289978909-------9799999999889879746--5999199 No 184 >PRK10092 maltose O-acetyltransferase; Provisional Probab=98.97 E-value=1.4e-09 Score=72.96 Aligned_cols=100 Identities=18% Similarity=0.275 Sum_probs=59.1 Q ss_pred CCCCCCCEEEECCCCCCCC---CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECC Q ss_conf 2344200021116313433---2121112100213443200000258644320231021124785448745556639889 Q gi|254780942|r 283 GCGVSIENYVQIRAFSYLE---GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGK 359 (442) Q Consensus 283 g~~~~Ig~~~~I~~~~~i~---~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~ 359 (442) |.|..||+++.|+.+|++. ...||++|.|||++.|.... ++ +. .+-+++ + ..|-....||+ T Consensus 71 G~ni~iG~~~~iN~~~~ilD~~~I~IG~~v~iGpnv~i~t~~--Hp---~d--~~~R~~----~-----~~~~~pi~Ig~ 134 (183) T PRK10092 71 GYNIFLGNNFYANFDCVMLDVCPIHIGDNCMLAPGVHIYTAT--HP---LD--PVARNS----G-----AELGKPVTIGN 134 (183) T ss_pred CCCEEECCCEEEECCEEEEECCCEEECCCEEECCCCEEECCC--CC---CC--HHHHHC----C-----CEECCCEEECC T ss_conf 778786787699278499947873999980584997898599--98---89--889525----8-----63168779899 Q ss_pred CCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEEC Q ss_conf 989979808834457754562987887988699893783988998997 Q gi|254780942|r 360 NVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVA 407 (442) Q Consensus 360 ~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ 407 (442) +|.||++++++- ++.||+++.||+++++... |.++++++ T Consensus 135 ~vwig~~~~i~p-------Gv~IG~~~vigagsvV~~d--ip~~~v~~ 173 (183) T PRK10092 135 NVWIGGRAVINP-------GVTIGDNVVVASGAVVTKD--VPDNVVVG 173 (183) T ss_pred CCEECCCCEEEC-------CCEECCCCEECCCCEEEEE--CCCCCEEE T ss_conf 868889989907-------9899999899759889227--79992999 No 185 >TIGR02091 glgC glucose-1-phosphate adenylyltransferase; InterPro: IPR011831 This enzyme, glucose-1-phosphate adenylyltransferase, is also called ADP-glucose pyrophosphorylase. The plant form is an alpha2, beta2 heterodimer, allosterically regulated in plants. Both subunits are homologous and included in this entry. In bacteria, both homomeric forms of GlgC and more active heterodimers of GlgC and GlgD have been described. This entry describes the GlgC subunit only. This enzyme appears in variants of glycogen synthesis pathways that use ADP-glucose, rather than UDP-glucose as in animals.; GO: 0008878 glucose-1-phosphate adenylyltransferase activity. Probab=98.97 E-value=4.6e-10 Score=75.71 Aligned_cols=58 Identities=17% Similarity=0.260 Sum_probs=43.6 Q ss_pred CCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCC------------------CCCCCEEEEECCCCCCCC Q ss_conf 442000211163134332121112100213443200000258------------------644320231021124785 Q gi|254780942|r 285 GVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKN------------------VRIGNFCEVKKATIKEGS 344 (442) Q Consensus 285 ~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~------------------~~Ig~~~ei~~s~i~~g~ 344 (442) ++.|++||+|.. |.+++|++|.++.|+.+|.| ++|||+++ +.||.+|+|+|+||++++ T Consensus 336 ~Slv~~Gc~i~g-~~v~~SvL~~~v~i~~~~~v-~dsVim~~Dfy~~~~E~~~l~~G~~a~~IG~~~~i~~~IiD~~~ 411 (421) T TIGR02091 336 DSLVAEGCIISG-ATVSHSVLGSRVRIGSGSTV-EDSVIMGDDFYENESEIDALKDGKVAVGIGRGAVIRNAIIDKNV 411 (421) T ss_pred EEEEECCCEECC-CEEEEEEEECCCEECCCCEE-EEEEECCCCCCCCHHHHHHHHCCCCCEEECCCCEEEEEECCCCC T ss_conf 407756718807-67872285268685465579-88688798654785688888649752231788888663805877 No 186 >PRK03701 consensus Probab=98.93 E-value=7.2e-10 Score=74.61 Aligned_cols=70 Identities=14% Similarity=0.247 Sum_probs=42.4 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCCEEE Q ss_conf 343321211121002134432000002586443202310211247854487455566398899899798088 Q gi|254780942|r 298 SYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGTIT 369 (442) Q Consensus 298 ~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~ 369 (442) +.+.+|.|+.+|.|. .+.+ .+|+|+.+|+||.+++|+||+|+.+++|+..+.+.+|+|+++|.|++|+++ T Consensus 327 ~~v~nSlia~GciI~-G~~V-~nSVl~r~v~I~~gA~V~nSIIm~~~~Ig~ga~l~~~IiDk~v~I~~g~vI 396 (431) T PRK03701 327 GMTLNSLVSGGCIIS-GSVV-VQSVLFPRVRVNSFCNIDSAVLLPEVWVGRSCRLRRCVIDRACVIPEGMVI 396 (431) T ss_pred CEEECCEEECCEEEE-CCEE-ECCCCCCCCEECCCCEEECCEECCCCEECCCCEEEEEEECCCCEECCCCEE T ss_conf 416478985998993-8488-809811888999998997269859698999999983798999999979998 No 187 >TIGR02287 PaaY phenylacetic acid degradation protein PaaY; InterPro: IPR011974 Members of this family are located next to other genes organised into apparent operons for phenylacetic acid degradation . PaaY is located near the end of these gene clusters and often next to PaaX, a transcriptional regulator.. Probab=98.91 E-value=1.9e-09 Score=72.23 Aligned_cols=99 Identities=19% Similarity=0.299 Sum_probs=55.0 Q ss_pred CCCCCCEEEECCCCCCCCC----CCCCCCCCCCCCCCCC----CCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 3442000211163134332----1211121002134432----0000025864432023102112478544874555663 Q gi|254780942|r 284 CGVSIENYVQIRAFSYLEG----VHIGKKTIIGPFARIR----QETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDS 355 (442) Q Consensus 284 ~~~~Ig~~~~I~~~~~i~~----~~Ig~~~~Ig~~~~i~----~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~ 355 (442) +++.||++|+|+|.+.|.| -++.+++.|+++|... ..|+++++-+||+.+.+ | -| T Consensus 25 GDVi~G~rcYvgPlASLRGDFGrIvl~eGAN~QD~CVMHGFPg~DTvVeenGHvGHgAiL------------H-----gC 87 (193) T TIGR02287 25 GDVIVGKRCYVGPLASLRGDFGRIVLKEGANVQDTCVMHGFPGQDTVVEENGHVGHGAIL------------H-----GC 87 (193) T ss_pred EEEEECCCCEECCCCCCCCCCCCEEEECCCCCCCCCEECCCCCCCEEEECCCCCCCCEEE------------C-----CC T ss_conf 217887952467663135775725753488614642436729988676068823321235------------1-----54 Q ss_pred EECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEE Q ss_conf 988998997980883445775456298788798869989378398899899 Q gi|254780942|r 356 VVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYV 406 (442) Q Consensus 356 ~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i 406 (442) .||+|.-||-+.+.- | +++||.++.+++-+-+-+|..+..+..| T Consensus 88 ~vgrnaLvGMNAVVM--D-----gAvige~sIVaA~aFVKAg~E~paq~Lv 131 (193) T TIGR02287 88 RVGRNALVGMNAVVM--D-----GAVIGERSIVAASAFVKAGAEIPAQVLV 131 (193) T ss_pred EECCCCEECCCEEEE--C-----CCEECCCCCEEHHHHHHCCCCCCCCCEE T ss_conf 632523111650661--6-----6466774120020334347347846126 No 188 >COG4284 UDP-glucose pyrophosphorylase [Carbohydrate transport and metabolism] Probab=98.89 E-value=2.6e-08 Score=65.70 Aligned_cols=183 Identities=17% Similarity=0.196 Sum_probs=102.8 Q ss_pred CCCEE-EEECCCCCCCCCCCCCCCEEEEC-CEEHHHHHHHHHHHC----CCC-EEEEEECCCHHHHHHH------HCCCC Q ss_conf 74089-99458877232788874412588-485799999999977----997-7999957988999997------30269 Q gi|254780942|r 3 RKRLA-IVLAAGRGHRMKSSSSKVLQKIA-GKPMISHVMETIAAA----GIE-NVALVLGYGAEEITRI------NFPPT 69 (442) Q Consensus 3 ~~~~A-iILAaG~GtRl~p~~pKpLlpi~-gkpli~~~i~~l~~~----g~~-~ivvv~~~~~e~i~~~------~~~~~ 69 (442) ..++| ++||||+||||...-||-|++|. |+++++.+.+.++.+ +++ ..++.++...++-..+ ..... T Consensus 103 ~~klAvl~LaGGqGtrlG~~gPKgl~~V~~gks~~dl~~~qIk~ln~~~~~~vP~~iMtS~nt~~t~s~f~~~~Y~~~~k 182 (472) T COG4284 103 LGKLAVLKLAGGQGTRLGCDGPKGLFEVKDGKSLFDLQAEQIKYLNRQYNVDVPLYIMTSLNTEETDSYFKSNDYFGLDK 182 (472) T ss_pred CCCEEEEEECCCCCCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHHHCCCCH T ss_conf 37548999448866533567875268945898499999999999999738997779882678277777776622037777 Q ss_pred CEEEEEECCC-----------------------CCCHHH---HHHH--CC-CCCCCCCCEEEEEECCCCCCCCCCCCCCC Q ss_conf 7289998599-----------------------886144---6641--00-00123446058985154111533220000 Q gi|254780942|r 70 LSVEYYIQDC-----------------------QQGTAH---AVLT--AQ-DAIKPGYDDVIIMYGDVPLVSSHTLKKAM 120 (442) Q Consensus 70 ~~i~~v~q~~-----------------------~~GTa~---Ai~~--a~-~~l~~~~~~~lVl~GD~~li~~~~l~~l~ 120 (442) .+|.++.|.. |-|.|+ |+.. .. +.+..+.+.+.|.|-|+ |--..|+.-+- T Consensus 183 ~~I~fF~Q~~~P~~~~~sg~~~~~~~~~~~~~~P~GnG~lf~aL~~SG~le~l~~~G~e~lfV~nIDN-L~~~vD~~~lg 261 (472) T COG4284 183 EDIFFFVQSLFPRLLSDSGLPFLESDDSNLAWYPPGNGDLFKALKSSGILEKLIAQGIEYLFVSNIDN-LGATVDLKFLG 261 (472) T ss_pred HHEEEEECCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCHHHHHHHCCCEEEEEECCCC-CCCCCCHHHHH T ss_conf 78377724774435656685221458963101789986089998751369999856836999951431-23445899999 Q ss_pred CCCCCCCCCCCEEEECCCC-CCCCEEEECCC--CCEECCCCCCC----CCCC--CCCCCCC-CCHHEECCHHHHHH Q ss_conf 0013586100100101221-57440420233--31001001466----7542--1000355-60011125014677 Q gi|254780942|r 121 DKIAQGYSIAVVGFNADNP-KGYGRLLIKNN--EIIAIREENDA----TDEE--RKIHYCN-SGLMAIDGLYIMDW 186 (442) Q Consensus 121 ~~~~~~~~~~i~~~~~~dp-~~yGvV~~d~~--~v~~i~Ek~d~----~~~~--~~~~lin-~GiY~f~~~~L~~~ 186 (442) .+...+.+.++=+....+| ..-|++..-++ +++++-|=++. ..+. .+....| .++|++..+.|++. T Consensus 262 ~~~~~~~e~~~e~t~Kt~a~ekvG~Lv~~~g~~rllEysev~~~~~~~~~s~~~~~~~n~Nni~l~~~~~~~l~~~ 337 (472) T COG4284 262 FMAETNYEYLMETTDKTKADEKVGILVTYDGKLRLLEYSEVPNEHREEFTSDGKLKYFNTNNIWLHLFSVKFLKEA 337 (472) T ss_pred HHHHCCCCEEEEEEECCCCCCCCEEEEEECCCEEEEEEECCCHHHHHHHCCCCCEEEECCCCCEEEHHHHHHHHHH T ss_conf 9986275325787522456664117998679567899840784674320462212354264222026677888862 No 189 >COG1045 CysE Serine acetyltransferase [Amino acid transport and metabolism] Probab=98.85 E-value=6.1e-09 Score=69.34 Aligned_cols=69 Identities=32% Similarity=0.507 Sum_probs=41.5 Q ss_pred CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCE Q ss_conf 21211121002134432000002586443202310211247854487455566398899899798088344577545629 Q gi|254780942|r 302 GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTH 381 (442) Q Consensus 302 ~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ 381 (442) +.+||+.+.||+++.|+.+++||..-+= .|-. | -.||++|.||||+.+.. .-. T Consensus 87 GvVIgeta~IGddv~I~~gVTLGgtg~~------------~g~R--h------PtIg~~V~IGagAkILG-------~I~ 139 (194) T COG1045 87 GVVIGETAVIGDDVTIYHGVTLGGTGKE------------SGKR--H------PTIGNGVYIGAGAKILG-------NIE 139 (194) T ss_pred EEEECCEEEECCCEEEEECEEECCCCCC------------CCCC--C------CCCCCCEEECCCCEEEC-------CEE T ss_conf 5897642498797289706176688787------------7877--9------83178859899988971-------668 Q ss_pred ECCCCEECCCCEEECC Q ss_conf 8788798869989378 Q gi|254780942|r 382 INENAFIGSNSSLIAP 397 (442) Q Consensus 382 ig~~~~iG~~~~i~~g 397 (442) ||||++||+|++++.. T Consensus 140 IGd~a~IGA~sVVlkd 155 (194) T COG1045 140 IGDNAKIGAGSVVLKD 155 (194) T ss_pred ECCCCEECCCCEECCC T ss_conf 8789788878668157 No 190 >cd04647 LbH_MAT_like Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins. MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates. MAT acetylates maltose and glucose exclusively while GAT specifically acetylates galactopyranosides. XAT catalyzes the CoA-dependent acetylation of a variety of hydroxyl-bearing acceptors such as chloramphenicol and streptogramin, among others. XATs are implicated in inactivating xenobiotics leading to xenobiotic resistance in patients. Members of this family contain a a left-handed parallel beta-helix (LbH) domain with at least 5 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). They are trimeric in their active form. Probab=98.85 E-value=8.1e-09 Score=68.62 Aligned_cols=45 Identities=24% Similarity=0.402 Sum_probs=33.6 Q ss_pred CCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEE Q ss_conf 663988998997980883445775456298788798869989378398899899 Q gi|254780942|r 353 GDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYV 406 (442) Q Consensus 353 gd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i 406 (442) ++..||++|.||+++++.. ++.||++|.||+++.+... +.+++++ T Consensus 57 ~~v~Ig~~v~IG~~~~I~~-------gv~Ig~~~iIga~SvV~k~--i~~~~i~ 101 (109) T cd04647 57 APIVIGDDVWIGANVVILP-------GVTIGDGAVVGAGSVVTKD--VPPNSIV 101 (109) T ss_pred CCEEEECCCEECCCEEEEE-------CCEECCCCEECCCCEECCC--CCCCCEE T ss_conf 8979942357889839928-------5130898699379889148--3999899 No 191 >PRK10191 putative colanic acid biosynthesis acetyltransferase WcaB; Provisional Probab=98.82 E-value=1.4e-08 Score=67.35 Aligned_cols=43 Identities=21% Similarity=0.446 Sum_probs=30.0 Q ss_pred CEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEE Q ss_conf 3988998997980883445775456298788798869989378398899899 Q gi|254780942|r 355 SVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYV 406 (442) Q Consensus 355 ~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i 406 (442) -.||+||.||+|+.+.. +.+|||||+||+|+++... +.+++.+ T Consensus 93 P~Ig~~V~Igaga~IlG-------~i~IG~~~~IGA~svV~~d--vp~~~~v 135 (146) T PRK10191 93 PHIGNGVELGANVIILG-------DITLGNNVTVGAGSVVLDS--VPDNALV 135 (146) T ss_pred CEECCCEEEECCCEEEC-------CCEECCCCEECCCCEECCC--CCCCCEE T ss_conf 98799959928999964-------8599999999989589703--5999999 No 192 >cd05635 LbH_unknown Uncharacterized proteins, Left-handed parallel beta-Helix (LbH) domain: Members in this group are uncharacterized bacterial proteins containing a LbH domain with multiple turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=98.81 E-value=3.5e-09 Score=70.68 Aligned_cols=73 Identities=19% Similarity=0.182 Sum_probs=30.7 Q ss_pred ECCCCCEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCC Q ss_conf 31784100122322344200021116313433-21211121002134432000002586443202310211247854 Q gi|254780942|r 270 IQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSK 345 (442) Q Consensus 270 I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~ 345 (442) |++++.|+|+++|.+++.||++|.|++++.|. ++.||++|.||. -+ .+|+|++++++++..-+.+|+|++.+. T Consensus 14 I~~~~~I~~~~~i~Gp~~IG~~~~I~p~a~Ir~~t~Ig~~~~IG~--Ei-k~S~i~~~s~~~H~~YlGDS~iG~~vN 87 (101) T cd05635 14 IGKDAVIEPFAVIEGPVYIGPGSRVKMGARIYGNTTIGPTCKIGG--EV-EDSIIEGYSNKQHDGFLGHSYLGSWCN 87 (101) T ss_pred ECCCCEECCCCEEECCEEECCCCEECCCCEEECCCEECCCCEECC--EE-EEEEEECCCCCCCCCEEEEEEECCCCE T ss_conf 879999999999976669999999999978638879999977614--08-761885787568775876117838757 No 193 >KOG1460 consensus Probab=98.79 E-value=4.7e-09 Score=69.95 Aligned_cols=39 Identities=28% Similarity=0.354 Sum_probs=19.4 Q ss_pred CCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCC Q ss_conf 2111210021344320000025864432023102112478 Q gi|254780942|r 304 HIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEG 343 (442) Q Consensus 304 ~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g 343 (442) .||.+++||+.++|| .|+|-++|.|-.++.+-+|+|+-+ T Consensus 308 SIga~vrvg~GvRl~-~sIIl~d~ei~enavVl~sIigw~ 346 (407) T KOG1460 308 SIGANVRVGPGVRLR-ESIILDDAEIEENAVVLHSIIGWK 346 (407) T ss_pred EECCCCEECCCCEEE-EEEECCCCEEECCCEEEEEEECCC T ss_conf 446882546882665-456026737621636776541144 No 194 >TIGR00965 dapD 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase; InterPro: IPR005664 Bacteria, plants and fungi metabolise aspartic acid to produce four amino acids - lysine, threonine, methionine and isoleucine - in a series of reactions known as the aspartate pathway. Additionally, several important metabolic intermediates are produced by these reactions, such as diaminopimelic acid, an essential component of bacterial cell wall biosynthesis, and dipicolinic acid, which is involved in sporulation in Gram-positive bacteria. Members of the animal kingdom do not posses this pathway and must therefore acquire these essential amino acids through their diet. Research into improving the metabolic flux through this pathway has the potential to increase the yield of the essential amino acids in important crops, thus improving their nutritional value. Additionally, since the enzymes are not present in animals, inhibitors of them are promising targets for the development of novel antibiotics and herbicides. For more information see . The lysine/diaminopimelic acid branch of the aspartate pathway produces the essential amino acid lysine via the intermediate meso-diaminopimelic acid (meso-DAP), which is also a vital cell wall component in Gram-negative bacteria . The production of dihydropicolinate from aspartate-semialdehyde controls flux into the lysine/diaminopimelic acid pathway. Three variants of this pathway exist, differing in how tetrahydropicolinate (formed by reduction of dihydropicolinate) is metabolised to meso-DAP. One variant, the most commonly found one in archaea and bacteria, uses primarily succinyl intermediates, while a second variant, found only in Bacillus, utilises primarily acetyl intermediates. In the third variant, found in some Gram-positive bacteria, a dehydrogenase converts tetrahydropicolinate directly to meso-DAP. In all variants meso-DAP is subsequently converted to lysine by a decarboxylase, or, in Gram-negative bacteria, assimilated into the cell wall. Evidence exists that a fourth, currently unknown, variant of this pathway may function in plants . 2,3,4,5-tetrahydropyridine-2-carboxylate N-succinyltransferase (also known as tetrahydrodipicolinate N-succinyltransferase or DapD) is part of the succinyl route of of lysine/DAP biosynthesis. The DapD protein is a homotrimer is a trimeric enzyme with each monomer composed of three domain: an N-terminal helical domain, a distinctive left-handed parallel beta-helix (LBH) domain, and a predominantly beta C-terminal domain , . The LBH structure is encoded by an imperfect tandem-repeated hexapeptide sequence. Each trimer contains three independent active sites, always occuring at the boundary of two subunits, and formed by residues from one N-terminal domain, one C-terminal domain and two adjacent LBH domains.; GO: 0016757 transferase activity transferring glycosyl groups, 0009089 lysine biosynthetic process via diaminopimelate. Probab=98.77 E-value=1.1e-08 Score=67.79 Aligned_cols=52 Identities=25% Similarity=0.337 Sum_probs=24.2 Q ss_pred ECCCCCCCCC-CCCCCCCCCCCCCCCCCC--------------EEECCCCCCCCEEEEE-CCCCCCCC Q ss_conf 1163134332-121112100213443200--------------0002586443202310-21124785 Q gi|254780942|r 293 QIRAFSYLEG-VHIGKKTIIGPFARIRQE--------------TTIEKNVRIGNFCEVK-KATIKEGS 344 (442) Q Consensus 293 ~I~~~~~i~~-~~Ig~~~~Ig~~~~i~~~--------------s~i~~~~~Ig~~~ei~-~s~i~~g~ 344 (442) +|..++.++- ++||+=+.||.|+||..| ++|+|||.||+..||- ..++.+|| T Consensus 134 YvDEGtMvDTWATvGSCAQIGKNvHlSGG~GIGGVLEPLQA~P~IIEDNCFiGARSEi~EGVIVEEGS 201 (275) T TIGR00965 134 YVDEGTMVDTWATVGSCAQIGKNVHLSGGVGIGGVLEPLQAKPTIIEDNCFIGARSEIVEGVIVEEGS 201 (275) T ss_pred EEECCCEECCHHHHCCHHHHCCCEEEECCCCCCCCCCCCCCCCEEEECCCEECCCEEEEEEEEECCCC T ss_conf 75168601001223003220662575178331652463357875882476004611356207971685 No 195 >PRK11830 dapD 2,3,4,5-tetrahydropyridine-2,6-carboxylate N-succinyltransferase; Provisional Probab=98.77 E-value=2e-08 Score=66.44 Aligned_cols=57 Identities=14% Similarity=0.267 Sum_probs=24.4 Q ss_pred CCCCEEECCCEEEEEEEEECCCCCEECCCCCCCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCCCCC Q ss_conf 33210211527994112331784100122322344200021116313433-2121112100213443 Q gi|254780942|r 252 SGVTMIAPETVFLSHDTIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE-GVHIGKKTIIGPFARI 317 (442) Q Consensus 252 ~gv~~~~p~~~~i~~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~-~~~Ig~~~~Ig~~~~i 317 (442) .|+++..+ ..+...+.|++++++.| +++. =+++|++++.++ .+.+|.-+.||.++|| T Consensus 95 ~gvRvVP~--a~vR~Gayi~~~vVlMP-s~VN------iGAyv~~gtmiDtwatvGScAqiGknvhl 152 (265) T PRK11830 95 AGVRVVPG--AVVRRGAYIAKNVVLMP-SYVN------IGAYVDEGTMVDTWATVGSCAQIGKNVHL 152 (265) T ss_pred CCEEECCC--CEEEECEEECCCCEEEE-EEEE------EEEEECCCCEEECCCCEECHHEECCCEEE T ss_conf 79088788--53760305459958864-1777------60398489548047624115323783368 No 196 >COG0448 GlgC ADP-glucose pyrophosphorylase [Carbohydrate transport and metabolism] Probab=98.75 E-value=9.3e-09 Score=68.26 Aligned_cols=81 Identities=19% Similarity=0.204 Sum_probs=42.7 Q ss_pred CCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCE Q ss_conf 23442000211163134332121112100213443200000258644320231021124785448745556639889989 Q gi|254780942|r 283 GCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVN 362 (442) Q Consensus 283 g~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ 362 (442) -......+.+++...+.+++|.|+.+|.|-- .+ .+|+|..+++|+.++.|++|+||.++.|+..+.+-.|+|.+||. T Consensus 277 yT~~~~~pPak~~~~s~v~nSLv~~GciI~G--~V-~nSVL~~~v~I~~gs~i~~svim~~~~IG~~~~l~~aIIDk~v~ 353 (393) T COG0448 277 YTKNKNLPPAKFVNDSEVSNSLVAGGCIISG--TV-ENSVLFRGVRIGKGSVIENSVIMPDVEIGEGAVLRRAIIDKNVV 353 (393) T ss_pred EECCCCCCCCEEECCCEEEEEEEECCEEEEE--EE-EEEEEECCEEECCCCEEEEEEEECCCEECCCCEEEEEEECCCCE T ss_conf 6137888981675585576226607718963--89-86387567197789888753872786888998899989679968 Q ss_pred ECCC Q ss_conf 9798 Q gi|254780942|r 363 IGAG 366 (442) Q Consensus 363 ig~g 366 (442) |+.| T Consensus 354 I~~g 357 (393) T COG0448 354 IGEG 357 (393) T ss_pred ECCC T ss_conf 6898 No 197 >COG1861 SpsF Spore coat polysaccharide biosynthesis protein F, CMP-KDO synthetase homolog [Cell envelope biogenesis, outer membrane] Probab=98.74 E-value=2.9e-08 Score=65.47 Aligned_cols=115 Identities=23% Similarity=0.388 Sum_probs=79.4 Q ss_pred CCEEEEECCCCCCCCCCCC--CCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCH--HHHHHHHCCCCCEEEEEECC Q ss_conf 4089994588772327888--744125884857999999999779-977999957988--99999730269728999859 Q gi|254780942|r 4 KRLAIVLAAGRGHRMKSSS--SKVLQKIAGKPMISHVMETIAAAG-IENVALVLGYGA--EEITRINFPPTLSVEYYIQD 78 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p~~--pKpLlpi~gkpli~~~i~~l~~~g-~~~ivvv~~~~~--e~i~~~~~~~~~~i~~v~q~ 78 (442) -++++|+.| ||.+++ -|.|+|++++|||+++|++++++. .+++++.|+-+. +.++++....+..+. T Consensus 2 ~~I~~IiQA----RmgStRLpgKvLlpL~~~pmI~~~lervrks~~~d~ivvATS~~~~d~~l~~~~~~~G~~vf----- 72 (241) T COG1861 2 SMILVIIQA----RMGSTRLPGKVLLPLGGEPMIEYQLERVRKSKDLDKIVVATSDKEEDDALEEVCRSHGFYVF----- 72 (241) T ss_pred CCEEEEEEE----CCCCCCCCCCHHHHCCCCCHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHCCEEEE----- T ss_conf 847988530----02676578613333078741999999986043324358973477204579999997082476----- Q ss_pred CCCCHHHHHH----HCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCC-CCCCCCCCCC Q ss_conf 9886144664----100001234460589851541115332200000-0135861001 Q gi|254780942|r 79 CQQGTAHAVL----TAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMD-KIAQGYSIAV 131 (442) Q Consensus 79 ~~~GTa~Ai~----~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~-~~~~~~~~~i 131 (442) +|.+.-+. .+.+... .+.++=+-||.||+++..+...++ +.+++.+-+- T Consensus 73 --rGs~~dVL~Rf~~a~~a~~--~~~VVRvTGD~P~~dp~l~d~~v~~~l~~gaDY~s 126 (241) T COG1861 73 --RGSEEDVLQRFIIAIKAYS--ADVVVRVTGDNPFLDPELVDAAVDRHLEKGADYVS 126 (241) T ss_pred --CCCHHHHHHHHHHHHHHCC--CCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCCCC T ss_conf --3888999999999997358--87699960899877889999999999865986013 No 198 >cd04649 LbH_THP_succinylT_putative Putative 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate (THDP) N-succinyltransferase (THP succinyltransferase), C-terminal left-handed parallel alpha-helix (LbH) domain: This group is composed of mostly uncharacterized proteins containing an N-terminal domain of unknown function and a C-terminal LbH domain with similarity to THP succinyltransferase LbH. THP succinyltransferase catalyzes the conversion of tetrahydrodipicolinate and succinyl-CoA to N-succinyltetrahydrodipicolinate and CoA. It is the committed step in the succinylase pathway by which bacteria synthesize L-lysine and meso-diaminopimelate, a component of peptidoglycan. The enzyme is trimeric and displays the left-handed parallel alpha-helix (LbH) structural motif encoded by the hexapeptide repeat motif. Probab=98.68 E-value=5.1e-08 Score=64.08 Aligned_cols=80 Identities=15% Similarity=0.223 Sum_probs=44.9 Q ss_pred CCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCE Q ss_conf 21211121002134432000002586443202310211247854487455566398899899798088344577545629 Q gi|254780942|r 302 GVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTH 381 (442) Q Consensus 302 ~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ 381 (442) |+.+...++|. .+|..++++++++.||..+.+-. ++..+.+. ...||+||.|||..-+ +.. T Consensus 31 nAgt~g~~Mie--grissgv~VG~~~diggGasi~G-~Ls~g~~~-------pi~Ig~ncliGAnsgi---------Gi~ 91 (147) T cd04649 31 NAGTLGNCMVE--GRISSGVIVGKGSDVGGGASIMG-TLSGGGNN-------VISIGKRCLLGANSGI---------GIS 91 (147) T ss_pred CCCCCCCCEEE--EEEEECEEECCCCCCCCCCEEEE-ECCCCCCC-------CEEECCCEEECCCCCC---------CEE T ss_conf 66306661687--67851569899833278617878-71678823-------5698875076146545---------848 Q ss_pred ECCCCEECCCCEEECCEEE Q ss_conf 8788798869989378398 Q gi|254780942|r 382 INENAFIGSNSSLIAPITI 400 (442) Q Consensus 382 ig~~~~iG~~~~i~~gv~i 400 (442) +||+|.|.++..|-++.+| T Consensus 92 lGd~cvv~aGlyi~~gTki 110 (147) T cd04649 92 LGDNCIVEAGLYVTAGTKV 110 (147) T ss_pred ECCCCEEECCEEECCCCEE T ss_conf 6898688244598399789 No 199 >cd00897 UGPase_euk Eukaryotic UGPase catalyses the synthesis of UDP-Glucose. UGPase (UDP-Glucose Pyrophosphorylase) catalyzes the reversible production of UDP-Glucose and pyrophosphate (PPi) from Glucose-1-phosphate and UTP. UDP-glucose plays pivotal roles in galactose utilization, in glycogen synthesis, and in the synthesis of the carbohydrate moieties of glycolipids, glycoproteins, and proteoglycans. UGPase is found in both prokaryotes and eukaryotes. Interestingly, while the prokaryotic and eukaryotic forms of UGPase catalyze the same reaction, they share low sequence similarity. This family consists of mainly eukaryotic UTP-glucose-1-phosphate uridylyltransferases. Probab=98.68 E-value=2.5e-06 Score=54.43 Aligned_cols=74 Identities=20% Similarity=0.385 Sum_probs=51.2 Q ss_pred CCEEEEECCCCCCCCCCCCCCCEEEEC-CEEHHHHHHHHHHH----CCCC-EEEEEECCCH-HHHHHHHCC---CCCEEE Q ss_conf 408999458877232788874412588-48579999999997----7997-7999957988-999997302---697289 Q gi|254780942|r 4 KRLAIVLAAGRGHRMKSSSSKVLQKIA-GKPMISHVMETIAA----AGIE-NVALVLGYGA-EEITRINFP---PTLSVE 73 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p~~pKpLlpi~-gkpli~~~i~~l~~----~g~~-~ivvv~~~~~-e~i~~~~~~---~~~~i~ 73 (442) |...|.||||.||||.-..||.|+|+. |+++++.+++++.. .|.. ..++.+++.. ++..+++.+ ...++. T Consensus 3 k~avlkLnGGlGTrlG~~~pK~~i~V~~~~tfldl~~~qi~~l~~~yg~~vPl~iMtS~~T~~~T~~~l~ky~~~~~~i~ 82 (300) T cd00897 3 KLVVLKLNGGLGTSMGCTGPKSLIEVRDGKTFLDLTVQQIEHLNKTYGVDVPLVLMNSFNTDEDTKKILKKYAGVNVDIH 82 (300) T ss_pred CEEEEEECCCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCEEEECCCCHHHHHHHHHHHCCCCCCCEE T ss_conf 37999926987455688998177882899809999999999999971998318997886477899999997488788769 Q ss_pred EEEC Q ss_conf 9985 Q gi|254780942|r 74 YYIQ 77 (442) Q Consensus 74 ~v~q 77 (442) +..| T Consensus 83 ~F~Q 86 (300) T cd00897 83 TFNQ 86 (300) T ss_pred EEEC T ss_conf 9843 No 200 >cd04180 UGPase_euk_like Eukaryotic UGPase-like includes UDPase and UDPGlcNAc pyrophosphorylase enzymes. This family includes UDP-Glucose Pyrophosphorylase (UDPase) and UDPGlcNAc pyrophosphorylase enzymes. The two enzymes share significant sequence and structure similarity. UDP-Glucose Pyrophosphorylase catalyzes a reversible production of UDP-Glucose and pyrophosphate (PPi) from Glucose-1-phosphate and UTP. UDP-glucose plays pivotal roles in galactose utilization, in glycogen synthesis, and in the synthesis of the carbohydrate moieties of glycolipids , glycoproteins , and proteoglycans . UDP-N-acetylglucosamine (UDPGlcNAc) pyrophosphorylase (UAP) (also named GlcNAc1P uridyltransferase), catalyzes the reversible conversion of UTP and GlcNAc1P from PPi and UDPGlcNAc, which is a key precursor of N- and O-linked glycosylations and is essential for the synthesis of chitin (a major component of the fungal cell wall) and of the glycosylphosphatidylinositol (GPI) linker anchoring a variety o Probab=98.63 E-value=4.2e-07 Score=58.84 Aligned_cols=144 Identities=17% Similarity=0.134 Sum_probs=85.8 Q ss_pred EEEEECCCCCCCCCCCCCCCEEEEC---CEEHHHHHHHHHHHC------CC-CEEEEEECCC-HHHHHHHHCCC---CCE Q ss_conf 8999458877232788874412588---485799999999977------99-7799995798-89999973026---972 Q gi|254780942|r 6 LAIVLAAGRGHRMKSSSSKVLQKIA---GKPMISHVMETIAAA------GI-ENVALVLGYG-AEEITRINFPP---TLS 71 (442) Q Consensus 6 ~AiILAaG~GtRl~p~~pKpLlpi~---gkpli~~~i~~l~~~------g~-~~ivvv~~~~-~e~i~~~~~~~---~~~ 71 (442) .+++||||.||||.-.-||-|+||. |+++++..++++.+. +. --+++.++.. .++..+++... ..+ T Consensus 2 avllLaGGlGTrLG~~~pKg~~~v~~~~~~t~~~l~~~~i~~l~~~~~~~~~iPl~iMtS~~T~~~T~~~~~~~~~~~~~ 81 (266) T cd04180 2 AVVLLAGGLGTRLGKDGPKSSTDVGLPSGQCFLQLIGEKILTLQEIDLYSCKIPEQLMNSKYTHEKTQCYFEKINQKNSY 81 (266) T ss_pred EEEEECCCCCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEECCHHHHHHHHHHHHHCCCCCCC T ss_conf 89996687623469999737067258999859999999999999986179983199966800447999999983999664 Q ss_pred EEEEECC---------------------CCCCHHHHHHHCC------CCC-CCCCCEEEEEECCCCCCCCCCCCCCCC-C Q ss_conf 8999859---------------------9886144664100------001-234460589851541115332200000-0 Q gi|254780942|r 72 VEYYIQD---------------------CQQGTAHAVLTAQ------DAI-KPGYDDVIIMYGDVPLVSSHTLKKAMD-K 122 (442) Q Consensus 72 i~~v~q~---------------------~~~GTa~Ai~~a~------~~l-~~~~~~~lVl~GD~~li~~~~l~~l~~-~ 122 (442) +.+..|. .|-|.|| ++.++ +.+ +.+...+.+.+-|+.+....|. .++- + T Consensus 82 v~~f~Q~~~P~~~~~g~~~~~~~~~~~~~P~GhGd-i~~aL~~sG~Ld~l~~~Gieyi~v~~vDN~la~v~DP-~flG~~ 159 (266) T cd04180 82 VITFMQGKLPLKNDDDARDPHNKTKCHLFPCGHGD-VVLALIHSGHLNKLLEKGYRYIHFIGVDNLLVKVADP-LFIGIA 159 (266) T ss_pred EEEEECCCCCEECCCCCCCCCCCCCEEECCCCCCH-HHHHHHHCCCHHHHHHCCCEEEEEEECCCCCCHHHHH-HHHHHH T ss_conf 49995288614857887167999942477799627-9999997780999996698899999668666002479-999999 Q ss_pred CCCCCCCCCEEEECCCCC-CCCEEE-ECCCC Q ss_conf 135861001001012215-744042-02333 Q gi|254780942|r 123 IAQGYSIAVVGFNADNPK-GYGRLL-IKNNE 151 (442) Q Consensus 123 ~~~~~~~~i~~~~~~dp~-~yGvV~-~d~~~ 151 (442) ..++.+.+.-+.+..+|. .-|++. .++++ T Consensus 160 ~~~~~~~~~kvv~Kt~~dek~G~l~~~~~g~ 190 (266) T cd04180 160 IQNRKAINQKVVPKTRNEESGGYRIANINGR 190 (266) T ss_pred HHHCCCEEEEEEECCCCCCEEEEEEEEECCC T ss_conf 9719975899978998986045999980892 No 201 >KOG1322 consensus Probab=98.59 E-value=3.3e-08 Score=65.13 Aligned_cols=73 Identities=33% Similarity=0.457 Sum_probs=43.7 Q ss_pred CCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECC Q ss_conf 11121002134432000002586443202310211247854487455566398899899798088344577545629878 Q gi|254780942|r 305 IGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINE 384 (442) Q Consensus 305 Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~ 384 (442) +|++|.||| +++||.+|+|++.|++++|+++.++.+...+.+..+.+|-+|.||.|. +|.. T Consensus 273 iG~~C~Ig~------~vvIG~r~~i~~gV~l~~s~il~~~~~~~~s~i~s~ivg~~~~IG~~~-------------~id~ 333 (371) T KOG1322 273 IGENCSIGP------NVVIGPRVRIEDGVRLQDSTILGADYYETHSEISSSIVGWNVPIGIWA-------------RIDK 333 (371) T ss_pred CCCCCEECC------CCEECCCCEECCCEEEEEEEEECCCEECHHHHHHHHHCCCCCCCCCCE-------------EEEC T ss_conf 388637888------736778728627628874279725121016888764325652005746-------------9712 Q ss_pred CCEECCCCEEEC Q ss_conf 879886998937 Q gi|254780942|r 385 NAFIGSNSSLIA 396 (442) Q Consensus 385 ~~~iG~~~~i~~ 396 (442) ++++|.|+.+.. T Consensus 334 ~a~lG~nV~V~d 345 (371) T KOG1322 334 NAVLGKNVIVAD 345 (371) T ss_pred CCEECCCEEEEC T ss_conf 537536259942 No 202 >TIGR02353 NRPS_term_dom non-ribosomal peptide synthetase terminal domain of unknown function; InterPro: IPR012728 This domain is found exclusively in non-ribosomal peptide synthetases and always as the final domain in the polypeptide. This domain is roughly 700 amino acids in size and is found in polypeptides roughly twice that size.. Probab=98.58 E-value=1.5e-07 Score=61.36 Aligned_cols=79 Identities=19% Similarity=0.406 Sum_probs=62.7 Q ss_pred CCCCCCCCCCC-------C-CEECCCCEECCCEEECC---CCCCCCC-CCEECCCCEECCCCEEECCEEECCCCEECCCC Q ss_conf 85448745556-------6-39889989979808834---4577545-62987887988699893783988998997783 Q gi|254780942|r 343 GSKINHLSYVG-------D-SVVGKNVNIGAGTITCN---YDGTHKY-KTHINENAFIGSNSSLIAPITIGQGTYVASGS 410 (442) Q Consensus 343 g~~i~~~~~ig-------d-~~iG~~~~ig~g~i~~n---~~g~~~~-~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~ 410 (442) |+||+..+|++ | +.||++++++.|+++=- .|++=|. +++|||+|.+|.++.++++++||++|.+|+.| T Consensus 621 G~kiGr~v~~d~~~ltE~dLv~IG~d~tlN~G~~~QtHlfeDRv~k~D~v~igdgatlG~~aiVlY~~~~G~~a~lgp~S 700 (719) T TIGR02353 621 GVKIGRGVYIDTTDLTERDLVTIGDDSTLNEGSVIQTHLFEDRVMKSDTVTIGDGATLGPGAIVLYGAVVGEDAVLGPDS 700 (719) T ss_pred CCEECCCCEECCCCCCCHHHEEECCCCEECCCCCEECCCCCCCEEEECEEEEECEEEECCCCEECCCEEECCCCEECCCC T ss_conf 82455532226756752220365686124687511114000675674528972403888863585731556864346843 Q ss_pred EECCC--CCCCCE Q ss_conf 57664--369868 Q gi|254780942|r 411 IITQD--TPENSL 421 (442) Q Consensus 411 ~v~~d--v~~~~~ 421 (442) .|-|- ||+.+- T Consensus 701 lVMkGE~vP~~tr 713 (719) T TIGR02353 701 LVMKGEEVPAHTR 713 (719) T ss_pred EEECCCCCCCCCC T ss_conf 2503552888785 No 203 >TIGR01479 GMP_PMI mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase; InterPro: IPR006375 This enzyme is known to be bifunctional, as both mannose-6-phosphate isomerase (5.3.1.8 from EC) (PMI) and mannose-1-phosphate guanylyltransferase (2.7.7.22 from EC) in Pseudomonas aeruginosa, Xanthomonas campestris, and Acetobacter xylinus. The literature on the enzyme from Escherichia coli attributes mannose-6-phosphate isomerase activity to an adjacent gene, but the present sequence has not been shown to lack the activity. The PMI domain lies at the C-terminal. ; GO: 0008928 mannose-1-phosphate guanylyltransferase (GDP) activity, 0009103 lipopolysaccharide biosynthetic process. Probab=98.58 E-value=1e-07 Score=62.41 Aligned_cols=184 Identities=16% Similarity=0.231 Sum_probs=109.8 Q ss_pred EEEEECCCCCCCCCCC----CCCCEEEEC-CE-EHHHHHHHHHHHC-CCCEEEEEECCCHH-HHHHHHCCCCC-EEEEEE Q ss_conf 8999458877232788----874412588-48-5799999999977-99779999579889-99997302697-289998 Q gi|254780942|r 6 LAIVLAAGRGHRMKSS----SSKVLQKIA-GK-PMISHVMETIAAA-GIENVALVLGYGAE-EITRINFPPTL-SVEYYI 76 (442) Q Consensus 6 ~AiILAaG~GtRl~p~----~pKpLlpi~-gk-pli~~~i~~l~~~-g~~~ivvv~~~~~e-~i~~~~~~~~~-~i~~v~ 76 (442) +.+||+||.|+||||. .||.++++. .. +++.-.+..+... ....-++++..... .+.+.....+. .-.++. T Consensus 2 ~p~~~~gg~g~~lwp~~~~~~p~~f~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~n~~~~f~~~~~~~~~~~~~~~~~l 81 (478) T TIGR01479 2 LPVILAGGSGTRLWPLSRELYPKQFLALVGDLYTLLQETLKRLAGLPDCSSPLVICNEEHRFLVAEQLRELGLLASNILL 81 (478) T ss_pred CCEEEECCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCEEEECCHHHHHHHHHHHHHCCCCCCEEE T ss_conf 52264078775012121122003443204520467899998622664223652552420346888889860652222143 Q ss_pred CCCCCCHHHHHHHCCCCC-CC--CCCE-EEEEECCCCCCCCCCCCCCCCCC---CCCCCCCCEEEECCCC-CCCCEEEEC Q ss_conf 599886144664100001-23--4460-58985154111533220000001---3586100100101221-574404202 Q gi|254780942|r 77 QDCQQGTAHAVLTAQDAI-KP--GYDD-VIIMYGDVPLVSSHTLKKAMDKI---AQGYSIAVVGFNADNP-KGYGRLLIK 148 (442) Q Consensus 77 q~~~~GTa~Ai~~a~~~l-~~--~~~~-~lVl~GD~~li~~~~l~~l~~~~---~~~~~~~i~~~~~~dp-~~yGvV~~d 148 (442) ++....|+-|+..+.-.. .. ..+. ++++..|+.+.+...+...+... ........+......| ++||.+... T Consensus 82 ~p~g~~~~p~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~g~~~~fg~~p~~p~~gygy~~~g 161 (478) T TIGR01479 82 EPVGRNTAPAIALAALLALRRAPGEDPLLLVLAADHVITDEDAFRAAVKLALPAAAEGKLVTFGIVPTTPETGYGYIRRG 161 (478) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCHHHCC T ss_conf 02666415789999998764057877478986155111003678999876432433175478502357765453111014 Q ss_pred --C-----------CCCEECCCCCCCCCCCCC----CCCCCCCHHEECCHHHHHHHHH Q ss_conf --3-----------331001001466754210----0035560011125014677753 Q gi|254780942|r 149 --N-----------NEIIAIREENDATDEERK----IHYCNSGLMAIDGLYIMDWLLQ 189 (442) Q Consensus 149 --~-----------~~v~~i~Ek~d~~~~~~~----~~lin~GiY~f~~~~L~~~l~~ 189 (442) . ..+..|+|||+....+.. ..+-|+|+|.|+.....+-+.+ T Consensus 162 ~~~~~~~~~~~~~~~~~~~f~~~p~~~~~~~~~~~g~~~wn~g~f~~~~~~~~~~l~~ 219 (478) T TIGR01479 162 LDAPLAGGDDGDDVYAVERFVEKPDLATAQEYLESGDYYWNSGMFLFRASRYLAELKK 219 (478) T ss_pred CCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHCCCCEECCCCEEHHHHHHHHHHHH T ss_conf 4310244211221466666540545567899975176200143000013578998876 No 204 >cd05825 LbH_wcaF_like wcaF-like: This group is composed of the protein product of the E. coli wcaF gene and similar proteins. WcaF is part of the gene cluster responsible for the biosynthesis of the extracellular polysaccharide colanic acid. The wcaF protein is predicted to contain a left-handed parallel beta-helix (LbH) domain encoded by imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Many are trimeric in their active forms. Probab=98.54 E-value=6.4e-08 Score=63.49 Aligned_cols=45 Identities=20% Similarity=0.262 Sum_probs=32.5 Q ss_pred CCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEE Q ss_conf 663988998997980883445775456298788798869989378398899899 Q gi|254780942|r 353 GDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYV 406 (442) Q Consensus 353 gd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i 406 (442) +...||++|.||+++++.. ++.||+++.+|+++++... +.+++++ T Consensus 55 ~~v~Ig~~v~IG~~~~I~~-------gv~Ig~~svVgagSvV~kd--vp~~~i~ 99 (107) T cd05825 55 APIVIGDGAWVAAEAFVGP-------GVTIGEGAVVGARSVVVRD--LPAWTVY 99 (107) T ss_pred CCEEECCCCEECCCCEEEC-------CCEECCCCEEECCCEECCC--CCCCCEE T ss_conf 9979938858749878969-------9191899799489889038--5999799 No 205 >TIGR01172 cysE serine O-acetyltransferase; InterPro: IPR005881 The biosynthesis of L-cysteine is the predominant way by which inorganic sulphur is incorporated into organic compounds. In this process, the most abundant utilizable source of sulphur, inorganic sulphate, is taken up and reduced to sulphide. Sulphide is used to produce L-cysteine, which serves for protein synthesis or the production of other sulphur-containing organic compounds. Two routes for cysteine biosynthesis in nature have been documented. Serine transacetylase (2.3.1.30 from EC) catalyzes steps in pathway I, the activation of L-serine by acetyl-coenzyme A, yielding O-acetyl-L-serine.; GO: 0009001 serine O-acetyltransferase activity, 0006535 cysteine biosynthetic process from serine, 0005737 cytoplasm. Probab=98.51 E-value=1.4e-07 Score=61.60 Aligned_cols=93 Identities=32% Similarity=0.437 Sum_probs=50.9 Q ss_pred ECCCCCEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCC Q ss_conf 31784100122322344200021116313433212111210021344320000025864432023102112478544874 Q gi|254780942|r 270 IQPDTVIEPHVFFGCGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHL 349 (442) Q Consensus 270 I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~ 349 (442) +-.+..|+|.+.||.++.|.- + .|.+||+-|.||+++.|+.+..||-.-+- .|.|= | T Consensus 58 ~~tgidIHPgA~IG~g~fIDH------g---~GvVIGETa~iGddv~iyhGVTLGGtgk~------------~G~kR-H- 114 (163) T TIGR01172 58 VLTGIDIHPGARIGRGVFIDH------G---TGVVIGETAEIGDDVTIYHGVTLGGTGKE------------KGVKR-H- 114 (163) T ss_pred HHCCCCCCCCCCCCCEEEEEC------C---CCEEEEEEEEECCCEEEEECEEECCCCCC------------CCCCC-C- T ss_conf 530232176430077307775------6---03685003687388378632120678888------------78898-8- Q ss_pred CCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECC Q ss_conf 555663988998997980883445775456298788798869989378 Q gi|254780942|r 350 SYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAP 397 (442) Q Consensus 350 ~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~g 397 (442) =.||++|-||||..+.. .-.||+|++||+|++++.. T Consensus 115 -----Pt~~~gV~iGAGAKvLG-------~I~vG~~AkiGAnsVVl~d 150 (163) T TIGR01172 115 -----PTIGEGVVIGAGAKVLG-------NIEVGENAKIGANSVVLKD 150 (163) T ss_pred -----CCCCCCEEEEECCEEEC-------CEEECCCCEECCCEEECCC T ss_conf -----82568717840782403-------7667799777233267353 No 206 >TIGR02353 NRPS_term_dom non-ribosomal peptide synthetase terminal domain of unknown function; InterPro: IPR012728 This domain is found exclusively in non-ribosomal peptide synthetases and always as the final domain in the polypeptide. This domain is roughly 700 amino acids in size and is found in polypeptides roughly twice that size.. Probab=98.47 E-value=1e-07 Score=62.30 Aligned_cols=68 Identities=24% Similarity=0.456 Sum_probs=36.4 Q ss_pred CEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEE-----------ECCEEECCCCEECCC------CEECCC-- Q ss_conf 3988998997980883445775456298788798869989-----------378398899899778------357664-- Q gi|254780942|r 355 SVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSL-----------IAPITIGQGTYVASG------SIITQD-- 415 (442) Q Consensus 355 ~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i-----------~~gv~ig~~~~i~ag------~~v~~d-- 415 (442) |.||++|.++.--+|- .| =+.|||.+.++.++.+ ...|+|||+|++|++ ++|-.| T Consensus 622 ~kiGr~v~~d~~~ltE-~d-----Lv~IG~d~tlN~G~~~QtHlfeDRv~k~D~v~igdgatlG~~aiVlY~~~~G~~a~ 695 (719) T TIGR02353 622 VKIGRGVYIDTTDLTE-RD-----LVTIGDDSTLNEGSVIQTHLFEDRVMKSDTVTIGDGATLGPGAIVLYGAVVGEDAV 695 (719) T ss_pred CEECCCCEECCCCCCC-HH-----HEEECCCCEECCCCCEECCCCCCCEEEECEEEEECEEEECCCCEECCCEEECCCCE T ss_conf 2455532226756752-22-----03656861246875111140006756745289724038888635857315568643 Q ss_pred CCCCCEEEECCCEE Q ss_conf 36986898135607 Q gi|254780942|r 416 TPENSLVFARSRQI 429 (442) Q Consensus 416 v~~~~~v~~~~~~~ 429 (442) +.|.|+| +|++++ T Consensus 696 lgp~SlV-MkGE~v 708 (719) T TIGR02353 696 LGPDSLV-MKGEEV 708 (719) T ss_pred ECCCCEE-ECCCCC T ss_conf 4684325-035528 No 207 >KOG3121 consensus Probab=98.46 E-value=1.2e-07 Score=62.03 Aligned_cols=47 Identities=11% Similarity=0.198 Sum_probs=28.3 Q ss_pred EECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCC Q ss_conf 988998997980883445775456298788798869989378398899899778 Q gi|254780942|r 356 VVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASG 409 (442) Q Consensus 356 ~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag 409 (442) +||+.+.+|+..++.| ++++.|-|+|-.++++.|...+..++.+++. T Consensus 103 qIgsyVh~GknaviGr-------rCVlkdCc~ild~tVlPpet~vppy~~~~g~ 149 (184) T KOG3121 103 QIGSYVHLGKNAVIGR-------RCVLKDCCRILDDTVLPPETLVPPYSTIGGN 149 (184) T ss_pred HHEEEEEECCCEEECC-------CEEHHHHEECCCCCCCCCCCCCCCCEEECCC T ss_conf 6120368546406737-------5083341041589525820136971677388 No 208 >cd06424 UGGPase UGGPase catalyzes the synthesis of UDP-Glucose/UDP-Galactose. UGGPase: UDP-Galactose/Glucose Pyrophosphorylase catalyzes the reversible production of UDP-Glucose/UDP-Galactose and pyrophosphate (PPi) from Glucose-1-phosphate/Galactose-1-phosphate and UTP. Its dual substrate specificity distinguishes it from the single substrate enzyme UDP-glucose pyrophosphorylase. It may play a key role in the galactose metabolism in raffinose oligosaccharide (RFO) metabolizing plants. RFO raffinose is a major photoassimilate and is a galactosylderivative of sucrose (Suc) containing a galactose (Gal) moiety. Upon arriving at the sink tissue, the Gal moieties of the RFOs are initially removed by alpha-galactosidase and then are phosphorylated to Gal-1-P. Gal-1-P is converted to UDP-Gal. The UDP-Gal is further metabolized to UDP-Glc via an epimerase reaction. The UDP-Glc can be directly utilized in cell wall metabolism or in Suc synthesis. However, for the Suc synthesis UDP-Glc must be f Probab=98.45 E-value=6.5e-06 Score=52.10 Aligned_cols=72 Identities=14% Similarity=0.155 Sum_probs=50.9 Q ss_pred EEEECCCCCCCCCCCCCCCEEEEC---CEEHHHHHHHHHHH----CC----C-CEEEEEECCC-HHHHHHHHCCC----- Q ss_conf 999458877232788874412588---48579999999997----79----9-7799995798-89999973026----- Q gi|254780942|r 7 AIVLAAGRGHRMKSSSSKVLQKIA---GKPMISHVMETIAA----AG----I-ENVALVLGYG-AEEITRINFPP----- 68 (442) Q Consensus 7 AiILAaG~GtRl~p~~pKpLlpi~---gkpli~~~i~~l~~----~g----~-~~ivvv~~~~-~e~i~~~~~~~----- 68 (442) -|+||||.||||.-..||-++|+. ++++++..++.+.+ ++ . -.+++.++.. .++..+++.+. T Consensus 3 vvllaGG~GTRLG~~~pK~~l~v~~~~~~tl~~l~~~~i~~~~~~~~~~~~~~iP~~iMtS~~t~~~T~~~~~~~~~FGl 82 (315) T cd06424 3 FVLVAGGLGERLGYSGIKIGLPVELTTNTTYLQYYLNYIRAFQEASKKGEKMEIPFVIMTSDDTHSKTLKLLEENNYFGL 82 (315) T ss_pred EEEECCCCCCCCCCCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCHHHHHHHHHHCCCCCC T ss_conf 99977878432489988556012788998099999999999999860578988657997898736999999986630368 Q ss_pred -CCEEEEEECC Q ss_conf -9728999859 Q gi|254780942|r 69 -TLSVEYYIQD 78 (442) Q Consensus 69 -~~~i~~v~q~ 78 (442) ..++.+..|. T Consensus 83 ~~~~v~~f~Q~ 93 (315) T cd06424 83 EKDQVHILKQE 93 (315) T ss_pred CCCCEEEEECC T ss_conf 85455998438 No 209 >cd03349 LbH_XAT Xenobiotic acyltransferase (XAT): The XAT class of hexapeptide acyltransferases is composed of a large number of microbial enzymes that catalyze the CoA-dependent acetylation of a variety of hydroxyl-bearing acceptors such as chloramphenicol and streptogramin, among others. Members of this class of enzymes include Enterococcus faecium streptogramin A acetyltransferase and Pseudomonas aeruginosa chloramphenicol acetyltransferase. They contain repeated copies of a six-residue hexapeptide repeat sequence motif (X-[STAV]-X-[LIV]-[GAED]-X) and adopt a left-handed parallel beta helix (LbH) structure. The active enzyme is a trimer with CoA and substrate binding sites at the interface of two separate LbH subunits. XATs are implicated in inactivating xenobiotics leading to xenobiotic resistance in patients. Probab=98.40 E-value=1.1e-06 Score=56.43 Aligned_cols=70 Identities=20% Similarity=0.276 Sum_probs=44.5 Q ss_pred CCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECC--CCEECCCCCCCCEEEECCCEEE Q ss_conf 66398899899798088344577545629878879886998937839889989977--8357664369868981356077 Q gi|254780942|r 353 GDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVAS--GSIITQDTPENSLVFARSRQIV 430 (442) Q Consensus 353 gd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~a--g~~v~~dv~~~~~v~~~~~~~~ 430 (442) ++.+||++|.||+++++.. +++||+++.||+++++.-. +.++++++. ...+.+..++.. ..++.. T Consensus 72 ~~i~Ig~~vwIG~n~~Il~-------GV~IG~gavIgAgSvVtkd--vp~~~i~~G~PAk~ik~Rf~~~~----~~~l~~ 138 (145) T cd03349 72 GDVIIGNDVWIGHGATILP-------GVTIGDGAVIAAGAVVTKD--VPPYAIVGGNPAKVIRYRFDEET----IERLLA 138 (145) T ss_pred CCEEECCCEEECCCCEEEC-------CCEECCCCEEECCCEECCC--CCCCCEEEEECCEEEECCCCHHH----HHHHHH T ss_conf 9979889969999888958-------8096788689289789704--69994998338878815599999----999987 Q ss_pred ECCCC Q ss_conf 07742 Q gi|254780942|r 431 KEDGA 435 (442) Q Consensus 431 ~~~~~ 435 (442) .+||. T Consensus 139 ~~ww~ 143 (145) T cd03349 139 LKWWD 143 (145) T ss_pred CCCCC T ss_conf 63646 No 210 >TIGR02092 glgD glucose-1-phosphate adenylyltransferase, GlgD subunit; InterPro: IPR011832 This family is GlgD, an apparent regulatory protein that appears in an alpha2/beta2 heterotetramer with GlgC (glucose-1-phosphate adenylyltransferase, IPR011831 from INTERPRO) in a subset of bacteria that use GlgC for glycogen biosynthesis.; GO: 0005978 glycogen biosynthetic process. Probab=98.39 E-value=2.6e-07 Score=60.09 Aligned_cols=49 Identities=10% Similarity=0.091 Sum_probs=31.6 Q ss_pred EEHHHHHHHHHHHCCCCEEEEEECC-----CHHHHHHHHCCCCCEEEEEECCCC Q ss_conf 8579999999997799779999579-----889999973026972899985998 Q gi|254780942|r 32 KPMISHVMETIAAAGIENVALVLGY-----GAEEITRINFPPTLSVEYYIQDCQ 80 (442) Q Consensus 32 kpli~~~i~~l~~~g~~~ivvv~~~-----~~e~i~~~~~~~~~~i~~v~q~~~ 80 (442) +.=+.-.|+-|.++.-+.+++.-++ +-+++.+++...+.+|+.|.++-. T Consensus 106 ~~~~~~~l~fl~rS~~~y~V~~~s~~v~NiDl~~~l~~H~~~~~~iT~VYKkv~ 159 (383) T TIGR02092 106 KRYFSNNLEFLKRSTSEYVVVLNSHMVCNIDLKAVLKRHEETGKDITVVYKKVK 159 (383) T ss_pred HHHHHHHHHHHHHCCCCCEEEECCCEEEECCHHHHHHHHHHCCCCEEEEECCCC T ss_conf 999988767875089971898547457830289999998744896689973478 No 211 >KOG4042 consensus Probab=98.30 E-value=1.3e-06 Score=56.15 Aligned_cols=130 Identities=19% Similarity=0.303 Sum_probs=79.7 Q ss_pred EEECCCCCEECCCCCCCCCCCCEEEECCCCCCCC---C-CCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCC Q ss_conf 2331784100122322344200021116313433---2-12111210021344320000025864432023102112478 Q gi|254780942|r 268 TIIQPDTVIEPHVFFGCGVSIENYVQIRAFSYLE---G-VHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEG 343 (442) Q Consensus 268 ~~I~~~~~I~~~~~Ig~~~~Ig~~~~I~~~~~i~---~-~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g 343 (442) ++|.+++++---..|.+.++|+++|++.|++.+. | -+||+|+.|..++.|+. -+.++. T Consensus 9 vkIap~AvVCvEs~irGdvti~~gcVvHP~a~~iA~aGPI~iGEnniiEEyA~i~n--~~~~~~---------------- 70 (190) T KOG4042 9 VKIAPSAVVCVESDIRGDVTIKEGCVVHPFAVFIATAGPIYIGENNIIEEYAVIRN--RLEPGA---------------- 70 (190) T ss_pred EEECCCEEEEEECCCCCCEEECCCCEECCEEEEECCCCCEEECCCCHHHHHHHHHH--HCCCCC---------------- T ss_conf 46457518998602133148658838533079970569889866712566788770--068887---------------- Q ss_pred CCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECC--CCCCCCE Q ss_conf 54487455566398899899798088344577545629878879886998937839889989977835766--4369868 Q gi|254780942|r 344 SKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQ--DTPENSL 421 (442) Q Consensus 344 ~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~--dv~~~~~ 421 (442) ..+.-.. .-+||.|-.+.-|+- . ....+||+-.|+..+.+-+|+.+-++|.|||+..+.. .+|++.. T Consensus 71 -~~d~~~~--pmiIGt~NvFeVgc~----s----~A~kvGd~NVieskayvg~gv~vssgC~vGA~c~v~~~q~lpent~ 139 (190) T KOG4042 71 -VWDSDGQ--PMIIGTWNVFEVGCK----S----SAKKVGDRNVIESKAYVGDGVSVSSGCSVGAKCTVFSHQNLPENTS 139 (190) T ss_pred -CCCCCCC--EEEEECCCEEEEECH----H----HHHHHCCCCEEEEEEEECCCCEECCCCEECCCEEEECCCCCCCCCE T ss_conf -5478997--678723635886123----3----4555167654766668649948727965414057711034697516 Q ss_pred EEECC Q ss_conf 98135 Q gi|254780942|r 422 VFARS 426 (442) Q Consensus 422 v~~~~ 426 (442) |+|.+ T Consensus 140 vYga~ 144 (190) T KOG4042 140 VYGAT 144 (190) T ss_pred EECCC T ss_conf 87213 No 212 >TIGR01208 rmlA_long glucose-1-phosphate thymidylyltransferase; InterPro: IPR005908 Synonym: dTDP-D-glucose synthase This group of proteins comprises a tightly conserved but broadly distributed subfamily of known and putative bacterial glucose-1-phosphate thymidylyltransferases (2.7.7.24 from EC). It is well characterised in several species as the first of four enzymes involved in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.dTTP + alpha-D-glucose 1-phosphate = diphosphate + dTDP-glucose The family of known and putative glucose-1-phosphate thymidyltransferases shows a deep split into a short form (see IPR005907 from INTERPRO) and a long form described by this model. The homotetrameric short form is found in numerous bacterial species that incorporate dTDP-L-rhamnose, which it helps synthesize, into the cell wall. It is subject to feedback inhibition. The long form, in contrast, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced. . Probab=98.23 E-value=5.9e-07 Score=58.00 Aligned_cols=199 Identities=17% Similarity=0.185 Sum_probs=97.1 Q ss_pred CHHEEC--CHHHHHHHHHCC--CCCCC--CHHHHHH--HHHHHHHHHCCEEEEEECHHHEECCCCCCCCC-C-CCCCCCC Q ss_conf 001112--501467775328--88740--0111211--48999973077179991721430111210000-0-0122222 Q gi|254780942|r 174 GLMAID--GLYIMDWLLQIK--KNKVS--QEYYLTD--IIEKARLDGKSIASIDVKEQEVCGCNNRYELS-L-IENIWQS 243 (442) Q Consensus 174 GiY~f~--~~~L~~~l~~l~--~~~~~--gE~yltD--ii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~-~-~~~~~~~ 243 (442) |+-.|+ .+.+..++++-+ ++|.. |=|+.++ .+..++++=++- + -.+ +.+.|.-|+. + =.+.--. T Consensus 143 GVA~l~d~G~~i~~LvEKPk~PPSNLAvvGlY~F~pPe~if~~~~~~kPS-w--RGE---lEITD~IQ~lIe~Gy~V~~~ 216 (361) T TIGR01208 143 GVAELEDDGKRILKLVEKPKEPPSNLAVVGLYMFRPPELIFEAIKNIKPS-W--RGE---LEITDAIQYLIEKGYKVGGS 216 (361) T ss_pred CEEEEECCCCEEEEEEECCCCCCCCCEEEEEEECCCHHHHHHHHHCCCCC-C--CCC---HHHHHHHHHHHHCCCEEEEE T ss_conf 55899248979999883386578770123345338777988898528887-7--662---15775775653158377779 Q ss_pred CCCCCCCCCCCC--EEECCCEEEEEEEEECCCCC--EECCCCCCCCCCCCEEEECCCCCCCCC-CCCCCCCCCCCCCCCC Q ss_conf 112212223321--02115279941123317841--001223223442000211163134332-1211121002134432 Q gi|254780942|r 244 RYRRQMMISGVT--MIAPETVFLSHDTIIQPDTV--IEPHVFFGCGVSIENYVQIRAFSYLEG-VHIGKKTIIGPFARIR 318 (442) Q Consensus 244 ~~~~~~l~~gv~--~~~p~~~~i~~~~~I~~~~~--I~~~~~Ig~~~~Ig~~~~I~~~~~i~~-~~Ig~~~~Ig~~~~i~ 318 (442) +..-||.+.|-. +.+......+.. +.+++. ......|.++|.|+++++|.-.+++.| ++||+||.| T Consensus 217 ~v~GWWkDTGk~eDLL~AN~~iLd~~--~~~~v~G~~~~~s~i~GRV~v~~gakid~ns~i~GPa~IG~d~~I------- 287 (361) T TIGR01208 217 KVKGWWKDTGKPEDLLDANRLILDEE--VEREVKGEVDDESKIEGRVEVGEGAKIDVNSVIRGPAVIGEDCII------- 287 (361) T ss_pred EEEEEECCCCCCHHHHHHHHHHHHHH--CCHHCCCCEECCCEEEEEEEECCCCEECCCCEEECCEEECCCCEE------- T ss_conf 99788701598023899999974432--232006613226578840790897884237778706087795278------- Q ss_pred CCE-EECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECC Q ss_conf 000-0025864432023102112478544874555663988998997980883445775456298788798869989378 Q gi|254780942|r 319 QET-TIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAP 397 (442) Q Consensus 319 ~~s-~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~g 397 (442) .+| .||+.+.||+.|.|+||.+. +|+|=+.|.|- .+..... -++||.+++|-.|-..... T Consensus 288 ~nsGyiGPYtSig~~~~I~d~~vE------------~S~vldes~I~-----~v~~Ri~--dS~iG~~~~i~~~~~~P~~ 348 (361) T TIGR01208 288 ENSGYIGPYTSIGEGVVIRDAEVE------------HSIVLDESVIE-----GVEKRIV--DSVIGKKVRIKGNRRRPKA 348 (361) T ss_pred ECCCCCCCEEEECCCEEEECCCCC------------CEEEECCCEEE-----EHHHHHH--HEECCCCCEEECCCCCCCC T ss_conf 268611770772575387416302------------03564421143-----1032022--3011782078437578410 Q ss_pred --EEECCCCEE Q ss_conf --398899899 Q gi|254780942|r 398 --ITIGQGTYV 406 (442) Q Consensus 398 --v~ig~~~~i 406 (442) .+|||.+.| T Consensus 349 ~rL~igd~S~v 359 (361) T TIGR01208 349 LRLIIGDYSQV 359 (361) T ss_pred CCCEEECCCEE T ss_conf 22147054057 No 213 >PRK11132 cysE serine acetyltransferase; Provisional Probab=98.22 E-value=2.8e-06 Score=54.17 Aligned_cols=35 Identities=29% Similarity=0.467 Sum_probs=21.6 Q ss_pred EECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECC Q ss_conf 988998997980883445775456298788798869989378 Q gi|254780942|r 356 VVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAP 397 (442) Q Consensus 356 ~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~g 397 (442) .||++|.||||..+.. ...||++++||+|++++.. T Consensus 195 ~i~~~v~igaga~iLG-------~i~iG~~a~igA~svVl~d 229 (273) T PRK11132 195 KIREGVMIGAGAKILG-------NIEVGRGAKIGAGSVVLQP 229 (273) T ss_pred CCCCCEEECCCCEEEC-------CEEECCCCEECCCCEECCC T ss_conf 0488848754888971-------8588999888678668643 No 214 >COG0110 WbbJ Acetyltransferase (isoleucine patch superfamily) [General function prediction only] Probab=98.22 E-value=3.3e-06 Score=53.80 Aligned_cols=99 Identities=27% Similarity=0.416 Sum_probs=56.6 Q ss_pred CCCCCCEEEECCCCCCC---CCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCC Q ss_conf 34420002111631343---321211121002134432000002586443202310211247854487455566398899 Q gi|254780942|r 284 CGVSIENYVQIRAFSYL---EGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKN 360 (442) Q Consensus 284 ~~~~Ig~~~~I~~~~~i---~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~ 360 (442) .+..+|++|.++.++.+ .+..||+++.++++++|. ...+.+.+.+. +... ..+.++++||++ T Consensus 66 ~~~~iG~~~~i~~~~~~~~~~~i~ig~~~~i~~~v~i~------~~~h~~~~~~~-~~~~--------~~~~~~v~IG~~ 130 (190) T COG0110 66 KNLTIGDLCFIGVNVVILVGEGITIGDNVVVGPNVTIY------TNSHPGDFVTA-NIGA--------LVGAGPVTIGED 130 (190) T ss_pred CCEEECCEEEEECCCEEEECCCCEECCCCEECCCEEEC------CCCCCCCCCCC-CCCC--------CCCCCCCEECCC T ss_conf 21064634677025449834885889985895994982------68765652200-0143--------235789788998 Q ss_pred CEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEE Q ss_conf 8997980883445775456298788798869989378398899899 Q gi|254780942|r 361 VNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYV 406 (442) Q Consensus 361 ~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i 406 (442) |.||+++++.. +++||+++.||+++++.. -+.+++++ T Consensus 131 vwIG~~a~Ilp-------GV~IG~gavigagsVVtk--dvp~~~iv 167 (190) T COG0110 131 VWIGAGAVILP-------GVTIGEGAVIGAGSVVTK--DVPPYGIV 167 (190) T ss_pred CEECCCCEECC-------CEEECCCEEEECCEEEEC--CCCCCEEE T ss_conf 58778978979-------859899819978858957--77998289 No 215 >PRK00576 molybdopterin-guanine dinucleotide biosynthesis protein A; Provisional Probab=98.21 E-value=4.3e-06 Score=53.09 Aligned_cols=96 Identities=14% Similarity=0.140 Sum_probs=62.5 Q ss_pred CCCCEEEECC--EEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEE-ECCCCCCHHHHHHHCCCCC-CCC Q ss_conf 8744125884--85799999999977997799995798899999730269728999-8599886144664100001-234 Q gi|254780942|r 22 SSKVLQKIAG--KPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYY-IQDCQQGTAHAVLTAQDAI-KPG 97 (442) Q Consensus 22 ~pKpLlpi~g--kpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v-~q~~~~GTa~Ai~~a~~~l-~~~ 97 (442) +.|.|+|+.| +|||+|+++.|... +.+++|+++.... +. .....++ .+.+..|--.++..++... ... T Consensus 3 ~DKAll~~~G~~~tLlerv~~~l~~~-~~~V~vv~~~~~~-----~~--~~~~~vi~D~~pg~GPL~Gi~~gL~~a~~~~ 74 (178) T PRK00576 3 RDKATLPLPGGTTTLVEHVVGIVGQR-CAPVFVMAAPGQP-----LP--ELPAPVLRDEVRGLGPLPATGRGLRAAAEAG 74 (178) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHH-CCEEEEECCCCCC-----CC--CCCCCEECCCCCCCCCHHHHHHHHHHHHHCC T ss_conf 88774276799825999999999865-8989998899876-----65--6899886058999785899999999998658 Q ss_pred CCEEEEEECCCCCCCCCCCCCCCCCCCC Q ss_conf 4605898515411153322000000135 Q gi|254780942|r 98 YDDVIIMYGDVPLVSSHTLKKAMDKIAQ 125 (442) Q Consensus 98 ~~~~lVl~GD~~li~~~~l~~l~~~~~~ 125 (442) .+.++++.+|+||++...+..|.+.... T Consensus 75 ~~~v~v~acDmP~l~~~lv~~L~~~~~~ 102 (178) T PRK00576 75 ARLAFVCAVDMPYLTPELIDDLARPAVR 102 (178) T ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHC T ss_conf 7818997457888999999999987544 No 216 >KOG4750 consensus Probab=98.13 E-value=6.9e-06 Score=51.92 Aligned_cols=35 Identities=34% Similarity=0.497 Sum_probs=25.3 Q ss_pred EECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECC Q ss_conf 988998997980883445775456298788798869989378 Q gi|254780942|r 356 VVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAP 397 (442) Q Consensus 356 ~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~g 397 (442) .||+||-||+|+++.. ++.||.++.||+++++.-. T Consensus 202 ~Igd~vliGaGvtILg-------nV~IGegavIaAGsvV~kD 236 (269) T KOG4750 202 KIGDNVLIGAGVTILG-------NVTIGEGAVIAAGSVVLKD 236 (269) T ss_pred CCCCCEEECCCCEEEC-------CEEECCCCEEECCCEEEEC T ss_conf 5567708755417857-------8167788677305069721 No 217 >TIGR03536 DapD_gpp 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase. 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase (DapD) is involved in the succinylated branch of the "lysine biosynthesis via diaminopimelate (DAP)" pathway (GenProp0125). This model represents a clade of DapD sequences most closely related to the actinobacterial DapD family represented by the TIGR03535 model. All of the genes evaluated for the seed of this model are found in genomes where the downstream desuccinylase is present, but known DapD genes are absent. Additionally, many of the genes identified by this model are found proximal to genes involved in this lysine biosynthesis pathway. Probab=98.01 E-value=1.5e-05 Score=49.99 Aligned_cols=58 Identities=28% Similarity=0.477 Sum_probs=27.3 Q ss_pred EECCCCEECCCEEE-CCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCC Q ss_conf 98899899798088-3445775456298788798869989378398899899778357664 Q gi|254780942|r 356 VVGKNVNIGAGTIT-CNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQD 415 (442) Q Consensus 356 ~iG~~~~ig~g~i~-~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~d 415 (442) +||+++.||.|.-| ..+.|-.+....||.+|.+|+|+- -|+-+||+|+|.||--|+.. T Consensus 226 vVg~gsDiGGgaSimGtLSGGg~~~isiG~~cLlGANsg--~GI~LGD~C~veaGlyvtag 284 (341) T TIGR03536 226 MVGKGSDLGGGCSTMGTLSGGGNIVISVGEGCLLGANAG--IGIPLGDRCTVEAGLYITAG 284 (341) T ss_pred EECCCCCCCCCEEEEEEECCCCCEEEEECCCCCCCCCCC--CCCCCCCCCEEECCEEEECC T ss_conf 966787667750247763589823688873220144455--57424898478124499279 No 218 >TIGR03535 DapD_actino 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase. Alternate name: tetrahydrodipicolinate N-succinyltransferase. Probab=97.93 E-value=2.8e-05 Score=48.44 Aligned_cols=58 Identities=31% Similarity=0.495 Sum_probs=25.8 Q ss_pred EECCCCEECCCEE-ECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCC Q ss_conf 9889989979808-83445775456298788798869989378398899899778357664 Q gi|254780942|r 356 VVGKNVNIGAGTI-TCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQD 415 (442) Q Consensus 356 ~iG~~~~ig~g~i-~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~d 415 (442) +||++..||.|.- ..-+.|-.+....||.+|.+|+|+- -|+-+||+|+|.||--||.. T Consensus 201 vvg~gsDiGGgaS~mGtLSGGg~~~isiG~~cLlGANsG--iGI~LGD~C~VeaGlyitaG 259 (319) T TIGR03535 201 VVGDGSDIGGGASIMGTLSGGGKEVISIGERCLLGANSG--IGISLGDDCVVEAGLYVTAG 259 (319) T ss_pred EECCCCCCCCCCEEEEEECCCCCEEEEECCCCEECCCCC--CCCCCCCCCEEECCEEECCC T ss_conf 966787667750114530489822478865320045455--57435897178112288179 No 219 >KOG3121 consensus Probab=97.84 E-value=3e-05 Score=48.31 Aligned_cols=68 Identities=16% Similarity=0.278 Sum_probs=38.0 Q ss_pred CEECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCC--CCCCCEEEECCCEEE Q ss_conf 3988998997980883445775456298788798869989378398899899778357664--369868981356077 Q gi|254780942|r 355 SVVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQD--TPENSLVFARSRQIV 430 (442) Q Consensus 355 ~~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~d--v~~~~~v~~~~~~~~ 430 (442) ..||+++.|+.+.+. | .+.||.++++|.|++|-.++.+-+-|.|-.++++.-+ +||++..-+++.|+. T Consensus 85 ~hiGdhVFieE~cVV-n-------AAqIgsyVh~GknaviGrrCVlkdCc~ild~tVlPpet~vppy~~~~g~p~~~~ 154 (184) T KOG3121 85 VHIGDHVFIEEECVV-N-------AAQIGSYVHLGKNAVIGRRCVLKDCCRILDDTVLPPETLVPPYSTIGGNPAQVV 154 (184) T ss_pred EEECCEEEEECCEEE-E-------HHHHEEEEEECCCEEECCCEEHHHHEECCCCCCCCCCCCCCCCEEECCCCCEEE T ss_conf 652456887044186-2-------666120368546406737508334104158952582013697167738872353 No 220 >TIGR03536 DapD_gpp 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase. 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase (DapD) is involved in the succinylated branch of the "lysine biosynthesis via diaminopimelate (DAP)" pathway (GenProp0125). This model represents a clade of DapD sequences most closely related to the actinobacterial DapD family represented by the TIGR03535 model. All of the genes evaluated for the seed of this model are found in genomes where the downstream desuccinylase is present, but known DapD genes are absent. Additionally, many of the genes identified by this model are found proximal to genes involved in this lysine biosynthesis pathway. Probab=97.65 E-value=0.00013 Score=44.73 Aligned_cols=36 Identities=14% Similarity=0.209 Sum_probs=25.3 Q ss_pred EECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEE Q ss_conf 988998997980883445775456298788798869989378398 Q gi|254780942|r 356 VVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITI 400 (442) Q Consensus 356 ~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~i 400 (442) .||++|-+||+.-+ +.-+||+|.|.++..+-+|.+| T Consensus 252 siG~~cLlGANsg~---------GI~LGD~C~veaGlyvtagtkv 287 (341) T TIGR03536 252 SVGEGCLLGANAGI---------GIPLGDRCTVEAGLYITAGTKV 287 (341) T ss_pred EECCCCCCCCCCCC---------CCCCCCCCEEECCEEEECCCEE T ss_conf 88732201444555---------7424898478124499279679 No 221 >TIGR03552 F420_cofC 2-phospho-L-lactate guanylyltransferase CofC. Members of this protein family are the CofC enzyme of coenzyme F420 biosynthesis. Probab=97.63 E-value=6e-05 Score=46.61 Aligned_cols=115 Identities=17% Similarity=0.203 Sum_probs=75.0 Q ss_pred EEEECCCC---CCCCCCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEECCCCCCH Q ss_conf 99945887---723278887441258848579999999997799779999579889999973026972899985998861 Q gi|254780942|r 7 AIVLAAGR---GHRMKSSSSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQDCQQGT 83 (442) Q Consensus 7 AiILAaG~---GtRl~p~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~~GT 83 (442) |||.-=.. =|||.|...+.=-.=.-+-|+..++..+.+++..++++++.- +.+.......+. .++.|+.. |- T Consensus 2 aviPvK~~~~aKtRLs~~L~~~eR~~L~~~ml~dtl~~l~~~~~~~v~vvt~d--~~~~~~a~~~g~--~~i~~~~~-gL 76 (195) T TIGR03552 2 AVIPVKRLANAKSRLSPVLSPEEREELALAMLRDVITALRGAGAGAVLVVSPD--PALLEAARNLGA--PVLRDPGP-GL 76 (195) T ss_pred EEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECC--HHHHHHHHHCCC--EEEECCCC-CH T ss_conf 17836888745200453079999999999999999999985899718999098--789999987199--79826998-88 Q ss_pred HHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCC Q ss_conf 4466410000123446058985154111533220000001358 Q gi|254780942|r 84 AHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQG 126 (442) Q Consensus 84 a~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~ 126 (442) ..++..+.+......+.++|+.+|.|+++..++..++....+. T Consensus 77 n~al~~a~~~~~~~~~~vlii~aDlP~l~~~~l~~~l~~~~~~ 119 (195) T TIGR03552 77 NNALNAALAEAREPGGAVLILMADLPLLTPRELKRLLAAATEG 119 (195) T ss_pred HHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCCC T ss_conf 9999999999873899389965887889999999999844448 No 222 >PRK13412 fkp bifunctional fucokinase/L-fucose-1-P-guanylyltransferase; Provisional Probab=97.61 E-value=0.00071 Score=40.47 Aligned_cols=82 Identities=16% Similarity=0.241 Sum_probs=40.3 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEEC--CCCCCCCEEEECCC--CCEECCCCCCCCCCCCC------CC Q ss_conf 0589851541115332200000013586100100101--22157440420233--31001001466754210------00 Q gi|254780942|r 100 DVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNA--DNPKGYGRLLIKNN--EIIAIREENDATDEERK------IH 169 (442) Q Consensus 100 ~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~--~dp~~yGvV~~d~~--~v~~i~Ek~d~~~~~~~------~~ 169 (442) .++|.+||++ ++. +........ +++++.+..+ +-.+.-|++..+.+ ..+.+.=+|+ +.+|.. .- T Consensus 154 g~lI~sGDV~-l~~---~~~~~~~p~-~dv~c~gl~v~~~la~nHGVfvsdr~~p~~ld~mLQKP-S~~el~~l~~~~~~ 227 (974) T PRK13412 154 HTLIASGDVY-IRS---EKPLQDIPE-ADVVCYGLWVDPSLATHHGVFVSDRKDPERLDFMLQKP-SLEELGGLSKTHLF 227 (974) T ss_pred CEEEEECCEE-EEC---CCCCCCCCC-CCCEEEEECCCHHHHCCCCEEEECCCCHHHHHHHHHCC-CHHHHHHHHCCCCE T ss_conf 6599967789-966---755477653-58479980368677535737996798827888875198-89999988638964 Q ss_pred CCCCCHHEECCHHHHHHH Q ss_conf 355600111250146777 Q gi|254780942|r 170 YCNSGLMAIDGLYIMDWL 187 (442) Q Consensus 170 lin~GiY~f~~~~L~~~l 187 (442) +..+|+..++.+....+. T Consensus 228 llD~Giw~ls~rAv~~L~ 245 (974) T PRK13412 228 LMDIGIWLLSDRAVERLM 245 (974) T ss_pred EEEEEEEEECCHHHHHHH T ss_conf 000059998848999999 No 223 >pfam01704 UDPGP UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate uridylyltransferase catalyses the interconversion of MgUTP + glucose-1-phosphate and UDP-glucose + MgPPi. UDP-glucose is an important intermediate in mammalian carbohydrate interconversion involved in various metabolic roles depending on tissue type. In Dictyostelium (slime mold) mutants in this enzyme abort the development cycle. Also within the family is UDP-N-acetylglucosamine or AGX1 and two hypothetical proteins from Borrelia burgdorferi the lyme disease spirochaete. Probab=97.55 E-value=0.005 Score=35.67 Aligned_cols=112 Identities=16% Similarity=0.312 Sum_probs=63.8 Q ss_pred CCEEE-EECCCCCCCCCCCCCCCEEEEC-CEEHHHHHHHHHHHC----CCC-EEEEEECCCH-HHHHHHH---CCCCCEE Q ss_conf 40899-9458877232788874412588-485799999999977----997-7999957988-9999973---0269728 Q gi|254780942|r 4 KRLAI-VLAAGRGHRMKSSSSKVLQKIA-GKPMISHVMETIAAA----GIE-NVALVLGYGA-EEITRIN---FPPTLSV 72 (442) Q Consensus 4 ~~~Ai-ILAaG~GtRl~p~~pKpLlpi~-gkpli~~~i~~l~~~----g~~-~ivvv~~~~~-e~i~~~~---~~~~~~i 72 (442) .++|| -|+||.||+|.=..||.|++|. |++.++-++.++... |.. ..++..++.. ++..+++ .....++ T Consensus 52 ~KlaVlKLNGGLGTsMG~~~pKSli~Vr~g~TFLDl~v~Qi~~ln~~y~~~vPl~lMnSf~T~~dT~~~l~ky~~~~v~i 131 (416) T pfam01704 52 NKLAVLKLNGGLGTSMGCKGPKSMIEVRNGNTFLDLIVQQIEQLNKRYNCDVPLLLMNSFNTDEETKKFLEKYSGSKVEI 131 (416) T ss_pred HHEEEEECCCCCCCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCE T ss_conf 22699982698777668888855779789981899999999999875499866699668663588999999805888774 Q ss_pred EEEECCC-------------------------CCCHHHHHHHCC-------CCCCCCCCEEEEEECCCCCCCCCCCC Q ss_conf 9998599-------------------------886144664100-------00123446058985154111533220 Q gi|254780942|r 73 EYYIQDC-------------------------QQGTAHAVLTAQ-------DAIKPGYDDVIIMYGDVPLVSSHTLK 117 (442) Q Consensus 73 ~~v~q~~-------------------------~~GTa~Ai~~a~-------~~l~~~~~~~lVl~GD~~li~~~~l~ 117 (442) ....|.. |-|-|| ++.++ ..+..+.+.++|-|.|+ |-...++. T Consensus 132 ~tF~Q~~~Pri~~d~l~pv~~~~~~~~~~~WyPPGHGD-ly~aL~~SG~LD~Ll~~G~eYifVSN~DN-LGA~vD~~ 206 (416) T pfam01704 132 KTFQQSRYPRIYKDSLLPVPKKKDSMAPDEWYPPGHGD-LFRSLYNSGLLDTLLAQGKEYIFVSNVDN-LGATVDLN 206 (416) T ss_pred EEECCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCH-HHHHHHHCCHHHHHHHCCCEEEEEEECCC-CCCCCCHH T ss_conf 88625776446138876445768889733558999730-88987726679999962958999950454-31125799 No 224 >TIGR03535 DapD_actino 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase. Alternate name: tetrahydrodipicolinate N-succinyltransferase. Probab=97.48 E-value=0.00032 Score=42.49 Aligned_cols=36 Identities=11% Similarity=0.252 Sum_probs=23.6 Q ss_pred EECCCCEECCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEE Q ss_conf 988998997980883445775456298788798869989378398 Q gi|254780942|r 356 VVGKNVNIGAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITI 400 (442) Q Consensus 356 ~iG~~~~ig~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~i 400 (442) .||++|-+||+.-+ +.-+||+|.+-++-.+-+|.+| T Consensus 227 siG~~cLlGANsGi---------GI~LGD~C~VeaGlyitaGTkV 262 (319) T TIGR03535 227 SIGERCLLGANSGI---------GISLGDDCVVEAGLYVTAGTKV 262 (319) T ss_pred EECCCCEECCCCCC---------CCCCCCCCEEECCEEECCCCEE T ss_conf 88653200454555---------7435897178112288179779 No 225 >cd00761 Glyco_tranf_GTA_type Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold. Glycosyltransferases (GTs) are enzymes that synthesize oligosaccharides, polysaccharides, and glycoconjugates by transferring the sugar moiety from an activated nucleotide-sugar donor to an acceptor molecule, which may be a growing oligosaccharide, a lipid, or a protein. Based on the stereochemistry of the donor and acceptor molecules, GTs are classified as either retaining or inverting enzymes. To date, all GT structures adopt one of two possible folds, termed GT-A fold and GT-B fold. This hierarchy includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. The majority of the proteins in this superfamily are Glycosyltransferase family 2 (GT-2) proteins. But it als Probab=97.46 E-value=0.00048 Score=41.45 Aligned_cols=91 Identities=16% Similarity=0.112 Sum_probs=64.3 Q ss_pred EEEECCE-EHHHHHHHHHHHCC--CCEEEEEECCCHHHHHHHHCCC---CCEEEEEECCCCCCHHHHHHHCCCCCCCCCC Q ss_conf 1258848-57999999999779--9779999579889999973026---9728999859988614466410000123446 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAG--IENVALVLGYGAEEITRINFPP---TLSVEYYIQDCQQGTAHAVLTAQDAIKPGYD 99 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g--~~~ivvv~~~~~e~i~~~~~~~---~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~ 99 (442) ++|..|. ++|.+.|+.+.+.. ..+++++.....+...+.+... ...+.+..+.+..|+++++..+.+..+. + T Consensus 2 ii~~~n~~~~l~~~l~si~~~~~~~~eiiiid~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~n~~~~~~~~--~ 79 (156) T cd00761 2 IIPAYNEEPYLERCLESLLAQTYPNFEVIVVDDGSTDGTLEILEEYAKKDPRVIRVINEENQGLAAARNAGLKAARG--E 79 (156) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHCCC--C T ss_conf 99988988999999999982899897999998999800221000001456638995056787846887788874899--9 Q ss_pred EEEEEECCCCCCCCCCCCCC Q ss_conf 05898515411153322000 Q gi|254780942|r 100 DVIIMYGDVPLVSSHTLKKA 119 (442) Q Consensus 100 ~~lVl~GD~~li~~~~l~~l 119 (442) .++++.+|.+ +.+..+..+ T Consensus 80 ~v~~~d~D~~-~~~~~~~~~ 98 (156) T cd00761 80 YILFLDADDL-LLPDWLERL 98 (156) T ss_pred EEEEECCCCC-CCHHHHHHH T ss_conf 9999999985-787499999 No 226 >KOG2388 consensus Probab=97.35 E-value=0.0031 Score=36.81 Aligned_cols=75 Identities=24% Similarity=0.249 Sum_probs=46.5 Q ss_pred CCEEEEECCCCCCCCCCCCCCCEEEEC---CEEHHHHHHHHHHHC----------CC-CEEEEEECCC-HHHHHHHHCC- Q ss_conf 408999458877232788874412588---485799999999977----------99-7799995798-8999997302- Q gi|254780942|r 4 KRLAIVLAAGRGHRMKSSSSKVLQKIA---GKPMISHVMETIAAA----------GI-ENVALVLGYG-AEEITRINFP- 67 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p~~pKpLlpi~---gkpli~~~i~~l~~~----------g~-~~ivvv~~~~-~e~i~~~~~~- 67 (442) +..+++||||+||||-..-||-+.|++ |+.++++..+.+... +. -..++.|+.. .+...+++.. T Consensus 97 ~~a~~llaGgqgtRLg~~~pkg~~~~G~~~~~slf~~qae~il~lq~~a~~~~~~~~~I~w~ImtS~~T~e~T~~~f~~~ 176 (477) T KOG2388 97 KVAVVLLAGGQGTRLGSSGPKGCYPIGLPSGKSLFQIQAERILKLQELASMAVSDGVDIPWYIMTSAFTHEATLEYFESH 176 (477) T ss_pred CEEEEEECCCCEEEECCCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHC T ss_conf 60389951674031135787533664775654065534999999999766540468864899953688617777677535 Q ss_pred -----CCCEEEEEECC Q ss_conf -----69728999859 Q gi|254780942|r 68 -----PTLSVEYYIQD 78 (442) Q Consensus 68 -----~~~~i~~v~q~ 78 (442) ...++.+..|. T Consensus 177 ~~FGl~~~qv~~f~Q~ 192 (477) T KOG2388 177 KYFGLKPEQVTFFQQG 192 (477) T ss_pred CCCCCCHHHEEEEECC T ss_conf 8877774672421026 No 227 >COG4801 Predicted acyltransferase [General function prediction only] Probab=97.24 E-value=0.00057 Score=41.01 Aligned_cols=101 Identities=27% Similarity=0.424 Sum_probs=0.0 Q ss_pred CCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEECCCCEE Q ss_conf 34420002111631343321211121002134432000002586443202310211247854487455566398899899 Q gi|254780942|r 284 CGVSIENYVQIRAFSYLEGVHIGKKTIIGPFARIRQETTIEKNVRIGNFCEVKKATIKEGSKINHLSYVGDSVVGKNVNI 363 (442) Q Consensus 284 ~~~~Ig~~~~I~~~~~i~~~~Ig~~~~Ig~~~~i~~~s~i~~~~~Ig~~~ei~~s~i~~g~~i~~~~~igd~~iG~~~~i 363 (442) .+......+.|++++.+.-.+++++..+|.+++| +|.++..+++|+..|++..-++.++ |+.||+.++| T Consensus 15 ~~ivv~gdViIG~nS~l~~~V~g~~iivge~v~i-~Gdiva~diridmw~kv~gNV~ve~----------dayiGE~~sI 83 (277) T COG4801 15 AIIVVKGDVIIGKNSMLKYGVVGEEIIVGERVRI-YGDIVAKDIRIDMWCKVTGNVIVEN----------DAYIGEFSSI 83 (277) T ss_pred EEEEEECCEEECCCCEEEEEEEEEEEEECCCCEE-EEEEEECCEEEEEEEEEECCEEECC----------CEEEECCCEE T ss_conf 6689754689856642644322015884367078-6068731136621567523189847----------4077033024 Q ss_pred CCCEEECCCCCCCCCCCEECCCCEECCCCEEECCEEECCCCEECCCCEECCC Q ss_conf 7980883445775456298788798869989378398899899778357664 Q gi|254780942|r 364 GAGTITCNYDGTHKYKTHINENAFIGSNSSLIAPITIGQGTYVASGSIITQD 415 (442) Q Consensus 364 g~g~i~~n~~g~~~~~~~ig~~~~iG~~~~i~~gv~ig~~~~i~ag~~v~~d 415 (442) +. +.+++..-.||.++.| ++.+++.|-.+-+| T Consensus 84 ~g-------------kl~v~gdLdig~dV~I-------eggfva~g~Ivirn 115 (277) T COG4801 84 KG-------------KLTVIGDLDIGADVII-------EGGFVAKGWIVIRN 115 (277) T ss_pred EE-------------EEEEECCCCCCCCEEE-------ECCEEECCEEEECC T ss_conf 01-------------6999533235651687-------56766256189707 No 228 >KOG4042 consensus Probab=96.87 E-value=0.00039 Score=41.96 Aligned_cols=40 Identities=18% Similarity=0.267 Sum_probs=23.8 Q ss_pred EEEEEEEECCCCCEECCCCCC---CCCCCCEEEECCCCCCCCC Q ss_conf 994112331784100122322---3442000211163134332 Q gi|254780942|r 263 FLSHDTIIQPDTVIEPHVFFG---CGVSIENYVQIRAFSYLEG 302 (442) Q Consensus 263 ~i~~~~~I~~~~~I~~~~~Ig---~~~~Ig~~~~I~~~~~i~~ 302 (442) .|..++.|+++++++|.+.+- +...||+|+.|..++++.+ T Consensus 22 ~irGdvti~~gcVvHP~a~~iA~aGPI~iGEnniiEEyA~i~n 64 (190) T KOG4042 22 DIRGDVTIKEGCVVHPFAVFIATAGPIYIGENNIIEEYAVIRN 64 (190) T ss_pred CCCCCEEECCCCEECCEEEEECCCCCEEECCCCHHHHHHHHHH T ss_conf 2133148658838533079970569889866712566788770 No 229 >pfam07959 Fucokinase L-fucokinase. In the salvage pathway of GDP-L-fucose, free cytosolic fucose is phosphorylated by L-fucokinase to form L-fucose-L-phosphate, which is then further converted to GDP-L-fucose in the reaction catalysed by GDP-L-fucose pyrophosphorylase. Probab=96.82 E-value=0.0033 Score=36.68 Aligned_cols=78 Identities=13% Similarity=0.192 Sum_probs=34.6 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEEC--CCCCCCCEEEECCCC----------CEECCCCCCCCCCCCC Q ss_conf 0589851541115332200000013586100100101--221574404202333----------1001001466754210 Q gi|254780942|r 100 DVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNA--DNPKGYGRLLIKNNE----------IIAIREENDATDEERK 167 (442) Q Consensus 100 ~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~--~dp~~yGvV~~d~~~----------v~~i~Ek~d~~~~~~~ 167 (442) -++|.+||+ ++.-.+ ...++.- ..+.+.++.+. +-.+.-|+...|... +..+.-||.. ++.. T Consensus 55 Gv~V~s~D~-ll~~~~-~~~~~~~--~~g~~~la~p~~~~~a~~HGVfv~d~~~~~~~~~~~~~~~~~LqKPS~--eem~ 128 (414) T pfam07959 55 GVLVTSGDE-LLSVGD-APGISFD--EPGATALAHPSSLAIATNHGVFVTDSQGSLAHDLTYRLVDDFLQKPTI--EELV 128 (414) T ss_pred CEEEEECCE-EEECCC-CCCCCCC--CCCEEEEEECCCHHHHCCCEEEEECCCCCCCCCCHHHHHHHHHCCCCH--HHHH T ss_conf 649985675-562276-4456657--898289970478667356538996888874433035568876328989--9996 Q ss_pred C---------CCCCCCHHEECCHHH Q ss_conf 0---------035560011125014 Q gi|254780942|r 168 I---------HYCNSGLMAIDGLYI 183 (442) Q Consensus 168 ~---------~lin~GiY~f~~~~L 183 (442) . -+..+|++.|+.++. T Consensus 129 ~~~av~~~g~~~ldtG~~~~s~~a~ 153 (414) T pfam07959 129 QFNAVGRDGLFLLDTGILSLSGEAV 153 (414) T ss_pred HCCCCCCCCCEEEEEEEEEECHHHH T ss_conf 4466257883765410288538999 No 230 >cd06442 DPM1_like DPM1_like represents putative enzymes similar to eukaryotic DPM1. Proteins similar to eukaryotic DPM1, including enzymes from bacteria and archaea; DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi, Probab=96.70 E-value=0.037 Score=30.74 Aligned_cols=187 Identities=16% Similarity=0.194 Sum_probs=99.6 Q ss_pred EEEECCE-EHHHHHHHHHHHC--CCC-EEEEEEC--C-CHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCC Q ss_conf 1258848-5799999999977--997-7999957--9-889999973026972899985998861446641000012344 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAA--GIE-NVALVLG--Y-GAEEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPGY 98 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~--g~~-~ivvv~~--~-~~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~ 98 (442) .+|..|. --|.-+++.+.+. +.. +|++|=. . +..++.+.+.....++.++..+..+|-++|++.+.+.... T Consensus 2 IIP~yNEe~~i~~~l~~i~~~~~~~~~eIIvVdDgStD~T~~i~~~l~~~~~~~~~i~~~~n~G~g~Ai~~G~~~a~~-- 79 (224) T cd06442 2 IIPTYNERENIPELIERLDAALKGIDYEIIVVDDNSPDGTAEIVRELAKEYPRVRLIVRPGKRGLGSAYIEGFKAARG-- 79 (224) T ss_pred EEEECCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHHCCCCCEEEEECCCCCCHHHHHHHHHHHHCC-- T ss_conf 999548788899999999986068994999998969737178886542247724786336678727999999986017-- Q ss_pred CEEEEEECCCCCCCCCCCCCCCCCCCC-CCCCCCEEEECCCC---CCCCEEEECCCCCEECCCCCCCCCCCCCCCCCCCC Q ss_conf 605898515411153322000000135-86100100101221---57440420233310010014667542100035560 Q gi|254780942|r 99 DDVIIMYGDVPLVSSHTLKKAMDKIAQ-GYSIAVVGFNADNP---KGYGRLLIKNNEIIAIREENDATDEERKIHYCNSG 174 (442) Q Consensus 99 ~~~lVl~GD~~li~~~~l~~l~~~~~~-~~~~~i~~~~~~dp---~~yGvV~~d~~~v~~i~Ek~d~~~~~~~~~lin~G 174 (442) +-++++.||. =.++.++..|++...+ +.+.++ +.+.... ..++....-.+++..++-+.- .. . ...=..+| T Consensus 80 ~~i~~~DaD~-q~~p~~i~~li~~~~~~~~d~V~-GsR~~~~~~~~~~~~~r~~~s~~~~~i~~~l-~~-~-~i~D~~~g 154 (224) T cd06442 80 DVIVVMDADL-SHPPEYIPELLEAQLEGGADLVI-GSRYVEGGGVEGWGLKRKLISRGANLLARLL-LG-R-KVSDPTSG 154 (224) T ss_pred CCEEEECCCC-CCCHHHHHHHHHHHHCCCCCEEE-EEEECCCCCCCCCHHHHHHHHHHHHHHHHHC-CC-C-EECCCCCC T ss_conf 8468865999-76277789999998718872479-8876279863577066678888888763230-58-4-24317996 Q ss_pred HHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECH Q ss_conf 011125014677753288874001112114899997307717999172 Q gi|254780942|r 175 LMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKE 222 (442) Q Consensus 175 iY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~ 222 (442) .-+|+.+.+...........+. +.++++-.+...|.++.-+++.. T Consensus 155 fr~~~~~~~~~i~~~~~~~~f~---~~~ei~~~~~~~~~ki~evpv~~ 199 (224) T cd06442 155 FRAYRREVLEKLIDSLVSKGYK---FQLELLVRARRLGYRIVEVPITF 199 (224) T ss_pred EEEEEHHHHHHHHHHCCCCCCC---CHHHHHHHHHHCCCEEEEEEEEE T ss_conf 3998899999887754566630---18999999998899899997277 No 231 >KOG2638 consensus Probab=96.67 E-value=0.039 Score=30.61 Aligned_cols=74 Identities=16% Similarity=0.403 Sum_probs=45.3 Q ss_pred CCEEEE-ECCCCCCCCCCCCCCCEEEEC-CEEHHHHH---HHHHHHC-CCC-EEEEEECCCHH-HHHHH---HCCCCCEE Q ss_conf 408999-458877232788874412588-48579999---9999977-997-79999579889-99997---30269728 Q gi|254780942|r 4 KRLAIV-LAAGRGHRMKSSSSKVLQKIA-GKPMISHV---METIAAA-GIE-NVALVLGYGAE-EITRI---NFPPTLSV 72 (442) Q Consensus 4 ~~~AiI-LAaG~GtRl~p~~pKpLlpi~-gkpli~~~---i~~l~~~-g~~-~ivvv~~~~~e-~i~~~---~~~~~~~i 72 (442) .++||+ |-||.||-|.-.-||.++++- |.+.++-+ +++|.+. .++ -.++..++..+ +.+++ +.....++ T Consensus 102 ~KLavlKLNGGlGttmGc~gPKS~ieVR~g~tFLDL~V~QIe~LN~~Y~~dVPlvLMNSfnTdedT~kil~ky~~~kv~i 181 (498) T KOG2638 102 NKLAVLKLNGGLGTTMGCKGPKSVIEVRDGLTFLDLTVRQIENLNKTYNVDVPLVLMNSFNTDEDTQKILKKYAGSKVDI 181 (498) T ss_pred HHEEEEEECCCCCCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEE T ss_conf 31489994588677545678730699728972157899999987763289977798505566177999999832674038 Q ss_pred EEEEC Q ss_conf 99985 Q gi|254780942|r 73 EYYIQ 77 (442) Q Consensus 73 ~~v~q 77 (442) +.+.| T Consensus 182 ~TF~Q 186 (498) T KOG2638 182 KTFNQ 186 (498) T ss_pred EEECC T ss_conf 98603 No 232 >cd04188 DPG_synthase DPG_synthase is involved in protein N-linked glycosylation. UDP-glucose:dolichyl-phosphate glucosyltransferase (DPG_synthase) is a transmembrane-bound enzyme of the endoplasmic reticulum involved in protein N-linked glycosylation. This enzyme catalyzes the transfer of glucose from UDP-glucose to dolichyl phosphate. Probab=96.62 E-value=0.022 Score=32.04 Aligned_cols=183 Identities=14% Similarity=0.130 Sum_probs=96.3 Q ss_pred EEEECCE-EHHHHHHHHH----HHCC-CC-EEEEEEC--C-CH-HHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCC Q ss_conf 1258848-5799999999----9779-97-7999957--9-88-999997302697289998599886144664100001 Q gi|254780942|r 26 LQKIAGK-PMISHVMETI----AAAG-IE-NVALVLG--Y-GA-EEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAI 94 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l----~~~g-~~-~ivvv~~--~-~~-e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l 94 (442) .+|..|. .-|..+++.+ .+.. .+ ||++|-. . +. +.++++.......+.++..+...|-|.|++.+..+. T Consensus 2 iIP~yNE~~~I~~~i~~i~~~~~~~~~~~~eIivVddgS~D~T~~i~~~~~~~~~~~~~vi~~~~n~G~g~A~~~G~~~a 81 (211) T cd04188 2 VIPAYNEEKRLPPTLEEAVEYLEERPSFSYEIIVVDDGSKDGTAEVARKLARKNPALIRVLTLPKNRGKGGAVRAGMLAA 81 (211) T ss_pred EEEECCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHH T ss_conf 99924788889999999999986336998389999899971879999999741786599996588888048999999960 Q ss_pred CCCCCEEEEEECCCCCCCCCCCCCCCCCCC-CCCCCCCEEEECCCCC---CCCEEE--EC--CCCCEECCCCCCCCCCCC Q ss_conf 234460589851541115332200000013-5861001001012215---744042--02--333100100146675421 Q gi|254780942|r 95 KPGYDDVIIMYGDVPLVSSHTLKKAMDKIA-QGYSIAVVGFNADNPK---GYGRLL--IK--NNEIIAIREENDATDEER 166 (442) Q Consensus 95 ~~~~~~~lVl~GD~~li~~~~l~~l~~~~~-~~~~~~i~~~~~~dp~---~yGvV~--~d--~~~v~~i~Ek~d~~~~~~ 166 (442) .. +-++++.+|. -.++.++..|++... .+.+.++.+-...+.. ...... .. -+.+..+.-..+ T Consensus 82 ~~--d~i~~~DaD~-~~~~~~i~~l~~~~~~~~~d~V~GsR~~~~~~~~~~~~~~r~~~s~~~~~~~~~l~~~~------ 152 (211) T cd04188 82 RG--DYILFADADL-ATPFEELEKLEEALKTSGYDIAIGSRAHLASAAVVKRSWLRNLLGRGFNFLVRLLLGLG------ 152 (211) T ss_pred HC--CEEEEECCCC-CCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCC------ T ss_conf 04--8556776897-40788999999999848981999975567998633563889999999999999983898------ Q ss_pred CCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEEC Q ss_conf 0003556001112501467775328887400111211489999730771799917 Q gi|254780942|r 167 KIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVK 221 (442) Q Consensus 167 ~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~ 221 (442) ..-..+|.-+|+.+.+.+.++.+....+.-+ ++++-.+...|.++.-+++. T Consensus 153 -i~D~~~gfr~~~~~~l~~i~~~~~~~~f~~~---~El~~~~~~~g~ki~EvPi~ 203 (211) T cd04188 153 -IKDTQCGFKLFTRDAARRLFPRLHLERWAFD---VELLVLARRLGYPIEEVPVR 203 (211) T ss_pred -CCCCCCCEEEEEHHHHHHHHHHCCCCCCCCH---HHHHHHHHHCCCCEEEEEEE T ss_conf -7888867465779999998755646885015---99999999869919999008 No 233 >cd04184 GT2_RfbC_Mx_like Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis. The rfbC gene encodes a predicted protein of 1,276 amino acids, which is required for O-antigen biosynthesis in Myxococcus xanthus. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Probab=96.58 E-value=0.045 Score=30.26 Aligned_cols=103 Identities=9% Similarity=0.134 Sum_probs=64.2 Q ss_pred EEEECCE--EHHHHHHHHHHHCCCC--EEEEEE--CCCH--HHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCC Q ss_conf 1258848--5799999999977997--799995--7988--999997302697289998599886144664100001234 Q gi|254780942|r 26 LQKIAGK--PMISHVMETIAAAGIE--NVALVL--GYGA--EEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPG 97 (442) Q Consensus 26 Llpi~gk--pli~~~i~~l~~~g~~--~ivvv~--~~~~--e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~ 97 (442) .+|..|. ..|...|+.+.+.-.. |++|+- +.+. +.+.+.+......+.++..++..|.+.|.-.+.+.-.+ T Consensus 6 iip~yN~~~~~l~~~l~Si~~Qt~~~~EiIvvDd~StD~~~~~~~~~~~~~~~~i~~~~~~~n~G~~~a~N~gi~~a~g- 84 (202) T cd04184 6 VMPVYNTPEKYLREAIESVRAQTYPNWELCIADDASTDPEVKRVLKKYAAQDPRIKVVFREENGGISAATNSALELATG- 84 (202) T ss_pred EEECCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCC- T ss_conf 9974889669999999999837899879999989989478999999988518852001267887899998864513687- Q ss_pred CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 4605898515411153322000000135861001 Q gi|254780942|r 98 YDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAV 131 (442) Q Consensus 98 ~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i 131 (442) +-++.+.+|- ++.+.-+..++..+.+..+..+ T Consensus 85 -eyi~flD~DD-~~~p~~l~~~~~~~~~~~~~~~ 116 (202) T cd04184 85 -EFVALLDHDD-ELAPHALYEVVKALNEHPDADL 116 (202) T ss_pred -CEEEECCCCC-EECHHHHHHHHHHHHHCCCCCE T ss_conf -6677327785-5284399999999986899359 No 234 >cd06434 GT2_HAS Hyaluronan synthases catalyze polymerization of hyaluronan. Hyaluronan synthases (HASs) are bi-functional glycosyltransferases that catalyze polymerization of hyaluronan. HASs transfer both GlcUA and GlcNAc in beta-(1,3) and beta-(1,4) linkages, respectively to the hyaluronan chain using UDP-GlcNAc and UDP-GlcUA as substrates. HA is made as a free glycan, not attached to a protein or lipid. HASs do not need a primer for HA synthesis; they initiate HA biosynthesis de novo with only UDP-GlcNAc, UDP-GlcUA, and Mg2+. Hyaluronan (HA) is a linear heteropolysaccharide composed of (1-3)-linked beta-D-GlcUA-beta-D-GlcNAc disaccharide repeats. It can be found in vertebrates and a few microbes and is typically on the cell surface or in the extracellular space, but is also found inside mammalian cells. Hyaluronan has several physiochemical and biological functions such as space filling, lubrication, and providing a hydrated matrix through which cells can migrate. Probab=95.90 E-value=0.055 Score=29.75 Aligned_cols=94 Identities=11% Similarity=0.141 Sum_probs=62.1 Q ss_pred EEEECCE-E-HHHHHHHHHHHCCCCEEEEEECCCH----HHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCCC Q ss_conf 1258848-5-7999999999779977999957988----99999730269728999859988614466410000123446 Q gi|254780942|r 26 LQKIAGK-P-MISHVMETIAAAGIENVALVLGYGA----EEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYD 99 (442) Q Consensus 26 Llpi~gk-p-li~~~i~~l~~~g~~~ivvv~~~~~----e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~ 99 (442) ++|..|. | ++...|+.+......+|++|..-.. +.+++.... ..+. +...+..|-++|+..+++..+. + T Consensus 5 iIPayNE~~~il~~~l~s~~~~~~~eiivV~D~s~d~~~~~~~~~~~~--~~~~-v~~~~~~GK~~Aln~~l~~a~~--d 79 (235) T cd06434 5 IIPVYDEDPDVFRECLRSILRQKPLEIIVVTDGDDEPYLSILSQTVKY--GGIF-VITVPHPGKRRALAEGIRHVTT--D 79 (235) T ss_pred EEEECCCCHHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHCCC--CCEE-EEECCCCCHHHHHHHHHHHCCC--C T ss_conf 995226976899999999980899989999889997289999986348--9789-9988989989999999997577--8 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCCC Q ss_conf 05898515411153322000000135 Q gi|254780942|r 100 DVIIMYGDVPLVSSHTLKKAMDKIAQ 125 (442) Q Consensus 100 ~~lVl~GD~~li~~~~l~~l~~~~~~ 125 (442) -++++-+|. .+++..|+.++.++.. T Consensus 80 ~v~~~DaD~-~~~~~~l~~l~~~f~d 104 (235) T cd06434 80 IVVLLDSDT-VWPPNALPEMLKPFED 104 (235) T ss_pred EEEEECCCC-CCCHHHHHHHHHHHCC T ss_conf 899985885-3588899999996489 No 235 >cd06433 GT_2_WfgS_like WfgS and WfeV are involved in O-antigen biosynthesis. Escherichia coli WfgS and Shigella dysenteriae WfeV are glycosyltransferase 2 family enzymes involved in O-antigen biosynthesis. GT-2 enzymes have GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=95.90 E-value=0.1 Score=28.16 Aligned_cols=101 Identities=8% Similarity=0.048 Sum_probs=54.5 Q ss_pred EEEECCE-EHHHHHHHHHHHCCCC--EEEEEECCCHHHHHHHHCCC-CCEEEEEECCCCCCHHHHHHHCCCCCCCCCCEE Q ss_conf 1258848-5799999999977997--79999579889999973026-972899985998861446641000012344605 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAGIE--NVALVLGYGAEEITRINFPP-TLSVEYYIQDCQQGTAHAVLTAQDAIKPGYDDV 101 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g~~--~ivvv~~~~~e~i~~~~~~~-~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~ 101 (442) ++|..|. ..|...|+.+.+.... |+++|=.-=.|.-.+.+... ...+.++.+ +..|.+.|.-.+.+...+ +-+ T Consensus 3 iip~yN~~~~l~~~l~Si~~Q~~~~~EiIvVDd~S~D~t~~~~~~~~~~~~~~~~~-~~~G~~~a~N~g~~~a~g--~yi 79 (202) T cd06433 3 ITPTYNQAETLEETIDSVLSQTYPNIEYIVIDGGSTDGTVDIIKKYEDKITYWISE-PDKGIYDAMNKGIALATG--DII 79 (202) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHCCCCEEEEEEC-CCCCHHHHHHHHHHHCCC--CEE T ss_conf 99976988999999999983789997999997998842344533113412599988-888889998732775485--553 Q ss_pred EEEECCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 89851541115332200000013586100 Q gi|254780942|r 102 IIMYGDVPLVSSHTLKKAMDKIAQGYSIA 130 (442) Q Consensus 102 lVl~GD~~li~~~~l~~l~~~~~~~~~~~ 130 (442) +++.+|- ++.+..+..++....+..... T Consensus 80 ~~ld~DD-~~~~~~~~~~~~~~~~~~~~~ 107 (202) T cd06433 80 GFLNSDD-TLLPGALLAVVAAFAEHPEVD 107 (202) T ss_pred ECCCCCC-EECCHHHHHHHHHHHHCCCCC T ss_conf 2248886-268449999999998789940 No 236 >cd06439 CESA_like_1 CESA_like_1 is a member of the cellulose synthase (CESA) superfamily. This is a subfamily of cellulose synthase (CESA) superfamily. CESA superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members of the superfamily include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins. Probab=95.81 E-value=0.056 Score=29.70 Aligned_cols=94 Identities=18% Similarity=0.198 Sum_probs=61.7 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC--C--EEEEEECC---CHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCC Q ss_conf 1258848-579999999997799--7--79999579---88999997302697289998599886144664100001234 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAGI--E--NVALVLGY---GAEEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPG 97 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g~--~--~ivvv~~~---~~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~ 97 (442) ++|..|. +.|.-.|+.+.+... + ||+|+-.- ...++.+.+.. ..++++.++...|-++|+..+.+..++ T Consensus 34 iIP~yNE~~~i~~~l~sl~~q~Yp~~~~eVIvvdD~StD~T~ei~~~~~~--~~~~v~~~~~n~Gk~~AlN~gi~~a~g- 110 (251) T cd06439 34 IIPAYNEEAVIEAKLENLLALDYPRDRLEIIVVSDGSTDGTAEIAREYAD--KGVKLLRFPERRGKAAALNRALALATG- 110 (251) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHHHHHC--CCCEEECCCCCCCHHHHHHHHHHHCCC- T ss_conf 99738988999999999996699988789999979998149999999745--585043167776799999999987678- Q ss_pred CCEEEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 460589851541115332200000013 Q gi|254780942|r 98 YDDVIIMYGDVPLVSSHTLKKAMDKIA 124 (442) Q Consensus 98 ~~~~lVl~GD~~li~~~~l~~l~~~~~ 124 (442) +-++++.+|. .+++.-+..++..+. T Consensus 111 -d~i~~lDaD~-~~~~~~l~~l~~~f~ 135 (251) T cd06439 111 -EIVVFTDANA-LLDPDALRLLVRHFA 135 (251) T ss_pred -CEEEECCCCC-CCCHHHHHHHHHHHH T ss_conf -9898337876-518769999999860 No 237 >cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=95.80 E-value=0.064 Score=29.36 Aligned_cols=99 Identities=13% Similarity=0.068 Sum_probs=64.1 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC--CEEEEEECCCHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCCCEEE Q ss_conf 1258848-579999999997799--7799995798899999730269728999859988614466410000123446058 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAGI--ENVALVLGYGAEEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYDDVI 102 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g~--~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~l 102 (442) .+|..|. ..|...|+.+.+.-. -+|+|+=.--.+...+.......++.++..++..|-+.|.-.+.+..+. +-++ T Consensus 2 IIptyN~~~~l~~~l~Sl~~q~~~~~eiivVD~~S~d~t~~~~~~~~~~i~~i~~~~n~G~~~a~N~g~~~a~g--~~i~ 79 (166) T cd04186 2 IIVNYNSLEYLKACLDSLLAQTYPDFEVIVVDNASTDGSVELLRELFPEVRLIRNGENLGFGAGNNQGIREAKG--DYVL 79 (166) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHCC--CCEE T ss_conf 99988988999999999983779980999996899831567777428988999898987767876168987165--7269 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9851541115332200000013586 Q gi|254780942|r 103 IMYGDVPLVSSHTLKKAMDKIAQGY 127 (442) Q Consensus 103 Vl~GD~~li~~~~l~~l~~~~~~~~ 127 (442) .+..|. ++.+.-++.++..+.+.. T Consensus 80 ~lD~D~-~~~~~~l~~~~~~~~~~~ 103 (166) T cd04186 80 LLNPDT-VVEPGALLELLDAAEQDP 103 (166) T ss_pred EECCCE-EECCCHHHHHHHHHHHCC T ss_conf 987993-888399999999998697 No 238 >cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase. Shigella flexneri RfbF protein is a putative dTDP-rhamnosyl transferase. dTDP rhamnosyl transferases of Shigella flexneri add rhamnose sugars to N-acetyl-glucosamine in the O-antigen tetrasaccharide repeat. Lipopolysaccharide O antigens are important virulence determinants for many bacteria. The variations of sugar composition, the sequence of the sugars and the linkages in the O antigen provide structural diversity of the O antigen. Probab=95.69 E-value=0.13 Score=27.64 Aligned_cols=101 Identities=17% Similarity=0.180 Sum_probs=62.8 Q ss_pred EHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCC-CCCEEEEEECCCCCC Q ss_conf 5799999999977997799995798899999730269728999859988614466410000123-446058985154111 Q gi|254780942|r 33 PMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKP-GYDDVIIMYGDVPLV 111 (442) Q Consensus 33 pli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~-~~~~~lVl~GD~~li 111 (442) ..+.-.|+.+.+. ..+|+||=.--.+...+.......+++++..++-+|-|+|.-.+.++... ..+.+++++-|. .+ T Consensus 11 ~~l~~~L~sl~~q-~~eIiVVDN~S~d~~~~~~~~~~~~v~~i~~~~N~G~a~g~N~Gi~~a~~~~~d~i~~ln~D~-~~ 88 (237) T cd02526 11 SKLKELLAALAEQ-VDKVVVVDNSSGNDIELRLRLNSEKIELIHLGENLGIAKALNIGIKAALENGADYVLLFDQDS-VP 88 (237) T ss_pred HHHHHHHHHHHCC-CCEEEEEECCCCHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCHHHHCCCCCEEEEECCCC-CC T ss_conf 9999999976637-998999969889028999986199869998999778588727430334117977899935756-55 Q ss_pred CCCCCCCCCCCC---CCCCCCCCEEEE Q ss_conf 533220000001---358610010010 Q gi|254780942|r 112 SSHTLKKAMDKI---AQGYSIAVVGFN 135 (442) Q Consensus 112 ~~~~l~~l~~~~---~~~~~~~i~~~~ 135 (442) ++..++.|++.. .+...+.+++-. T Consensus 89 ~~~~l~~l~~~~~~~~~~~~vg~~~p~ 115 (237) T cd02526 89 PPDMVEKLLAYKILSDKNSNIGAVGPR 115 (237) T ss_pred CHHHHHHHHHHHHHCCCCCCEEEEEEE T ss_conf 905999999999852227988999717 No 239 >PRK11204 N-glycosyltransferase PgaC; Provisional Probab=95.45 E-value=0.083 Score=28.73 Aligned_cols=101 Identities=14% Similarity=0.101 Sum_probs=68.2 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC--CEEEEEECCC---HHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCCC Q ss_conf 1258848-579999999997799--7799995798---899999730269728999859988614466410000123446 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAGI--ENVALVLGYG---AEEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYD 99 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g~--~~ivvv~~~~---~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~ 99 (442) ++|-.|. ..|..+|+++.+... -||+++-.-- ..++.+.+.....+++++..++..|=|+|+..+++..+. | T Consensus 59 lIPayNEe~~I~~tI~sll~~~YP~~eIiVVdDgStD~T~~i~~~~~~~~p~~~vi~~~~n~GKa~ALN~gl~~a~g--e 136 (421) T PRK11204 59 LVPCYNEGENVEETISAALALRYPNYEVIAINDGSSDNTGEILDRLAAEDPRLRVIHLAENQGKAIALNTGAAAARS--E 136 (421) T ss_pred EECCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHCCC--C T ss_conf 98479978999999999995899986699998999740899999997068968999769998899999999984378--9 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 058985154111533220000001358610 Q gi|254780942|r 100 DVIIMYGDVPLVSSHTLKKAMDKIAQGYSI 129 (442) Q Consensus 100 ~~lVl~GD~~li~~~~l~~l~~~~~~~~~~ 129 (442) -++++-+|. .++++.+..++.+..++..+ T Consensus 137 ~Vv~~DAD~-~~~~d~L~~~v~~f~~dp~V 165 (421) T PRK11204 137 YLVCIDGDA-LLDPDAAAYMVEHFLHNPRV 165 (421) T ss_pred EEEEECCCC-CCCHHHHHHHHHHHHHCCCC T ss_conf 899989987-14867999999999719864 No 240 >TIGR03111 glyc2_xrt_Gpos1 putative glycosyltransferase TIGR03111. Members of this protein family probable glycosyltransferases of family 2, whose genes are near those for Gram-positive proteins (TIGR03110) related to the proposed exosortase (TIGR02602). Probab=95.41 E-value=0.15 Score=27.25 Aligned_cols=104 Identities=13% Similarity=0.106 Sum_probs=68.5 Q ss_pred CCC--EEEECCE-EHHHHHHHHHHHCCC--C--EEEEEEC-CC--HHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCC Q ss_conf 744--1258848-579999999997799--7--7999957-98--89999973026972899985998861446641000 Q gi|254780942|r 23 SKV--LQKIAGK-PMISHVMETIAAAGI--E--NVALVLG-YG--AEEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQD 92 (442) Q Consensus 23 pKp--Llpi~gk-pli~~~i~~l~~~g~--~--~ivvv~~-~~--~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~ 92 (442) |+. ++|..|. ..|...++.+.+... + ||+++-. .. ..++.+...+...++.....++..|=++|+-.+.+ T Consensus 49 P~VSViIPayNEe~~I~~~i~Sl~~~dYP~~~lEIIvVdDgStD~T~ei~~~~~~~~p~~~v~~~~~n~GKa~ALN~gl~ 128 (439) T TIGR03111 49 PDITIIIPVYNSEDTLFNCIESIYNQTYPIELIDIILANNQSTDDSFQVFCRAQNEFPGLSLRYMNSDQGKAKALNAAIY 128 (439) T ss_pred CCEEEEEECCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHH T ss_conf 97699993587388999999999967999985599999789951799999998865898399976898985999999999 Q ss_pred CCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 0123446058985154111533220000001358610 Q gi|254780942|r 93 AIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSI 129 (442) Q Consensus 93 ~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~ 129 (442) .-.+ +-++++-+|. .++++.++.++.++.++.++ T Consensus 129 ~A~G--e~iv~~DADt-~le~daL~~lv~~F~~dp~V 162 (439) T TIGR03111 129 NSIG--KYIIHIDSDG-KLHKDAIKNMVTRFENNPDI 162 (439) T ss_pred HCCC--CEEEEEECCC-CCCHHHHHHHHHHHHHCCCE T ss_conf 7788--9899980898-86856999999987209966 No 241 >cd06435 CESA_NdvC_like NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. Bradyrhizobium japonicum synthesizes periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans during growth under hypoosmotic conditions. Two genes (ndvB, ndvC) are involved in the beta-(1, 3), beta-(1,6)-glucan synthesis. The ndvC mutant strain resulted in synthesis of altered cyclic beta-glucans composed almost entirely of beta-(1, 3)-glycosyl linkages. The periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans function for osmoregulation. The ndvC mutation also affects the ability of the bacteria to establish a successful symbiotic interaction with host plant. Thus, the beta-glucans may function as suppressors of a host defense response. Probab=95.32 E-value=0.12 Score=27.87 Aligned_cols=99 Identities=14% Similarity=0.160 Sum_probs=65.0 Q ss_pred EEEECCEE--HHHHHHHHHHHCCC--CEEEEEECC--CH---HHHHHHHCCCCCEEEEEECCCCCC-HHHHHHHCCCCCC Q ss_conf 12588485--79999999997799--779999579--88---999997302697289998599886-1446641000012 Q gi|254780942|r 26 LQKIAGKP--MISHVMETIAAAGI--ENVALVLGY--GA---EEITRINFPPTLSVEYYIQDCQQG-TAHAVLTAQDAIK 95 (442) Q Consensus 26 Llpi~gkp--li~~~i~~l~~~g~--~~ivvv~~~--~~---e~i~~~~~~~~~~i~~v~q~~~~G-Ta~Ai~~a~~~l~ 95 (442) ++|..|.+ +|.-.|+.|.+... .+|+|+-.- +. +.++++....+..+.++..+...| -++|+-.+++... T Consensus 3 iIPayNE~~~vi~~~l~sl~~~~Yp~~eIiVvdd~stD~t~~~~v~~~~~~~~~~~~~~~~~~~~g~K~~alN~~l~~~~ 82 (236) T cd06435 3 HVPCYEEPPEMVKETLDSLAALDYPNFEVIVIDNNTKDEALWKPVEAHCAQLGERFRFFHVEPLPGAKAGALNYALERTA 82 (236) T ss_pred EEECCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHCC T ss_conf 98358992899999999999579999879999892998158999999998729967999948889970999999999647 Q ss_pred CCCCEEEEEECCCCCCCCCCCCCCCCCCCC Q ss_conf 344605898515411153322000000135 Q gi|254780942|r 96 PGYDDVIIMYGDVPLVSSHTLKKAMDKIAQ 125 (442) Q Consensus 96 ~~~~~~lVl~GD~~li~~~~l~~l~~~~~~ 125 (442) ...+-++++-+|. +++++-|+.++.++.. T Consensus 83 ~~~e~i~~~DaD~-~~~pd~L~~~v~~f~~ 111 (236) T cd06435 83 PDAEIIAVIDADY-QVEPDWLKRLVPIFDD 111 (236) T ss_pred CCCEEEEEECCCC-CCCHHHHHHHHHHHCC T ss_conf 9854999976887-7786899999986049 No 242 >pfam01983 CofC Guanylyl transferase CofC like. Coenzyme F420 is a hydride carrier cofactor that functions during methanogenesis. This family of proteins represents CofC, a nucleotidyl transferase that is involved in coenzyme F420 biosynthesis. CofC has been shown to catalyse the formation of lactyl-2-diphospho-5'-guanosine from 2-phospho-L-lactate and GTP. Probab=95.01 E-value=0.061 Score=29.47 Aligned_cols=116 Identities=21% Similarity=0.251 Sum_probs=65.7 Q ss_pred CEEEEECCCC---CCCCCCC-CCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEECCCC Q ss_conf 0899945887---7232788-87441258848579999999997799779999579889999973026972899985998 Q gi|254780942|r 5 RLAIVLAAGR---GHRMKSS-SSKVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQDCQ 80 (442) Q Consensus 5 ~~AiILAaG~---GtRl~p~-~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~ 80 (442) |-+||.---. =|||.+. .|-.=- =.=+-|+..++..+... ++++++ ..+.+.++.. ....++++.++ . T Consensus 1 M~vvIPvK~f~~aKtRLs~~Ls~eeR~-~La~~Ml~dVl~al~~~---~v~vvs--~d~~v~~~a~-~~~g~~v~~~~-~ 72 (217) T pfam01983 1 MRIIIPVSANNHPKTRLSSILSEEERK-ELLRLMLLDVIDALKPV---DVLVFS--EDEVVLPSAL-DVLGVEVVVET-E 72 (217) T ss_pred CEEEEECCCCCCCCHHCCCCCCHHHHH-HHHHHHHHHHHHHHHCC---CEEEEE--CCHHHHHHHH-HCCCCEEEECC-C T ss_conf 979997168875400023107999999-99999999999986338---669993--7588778888-61795487348-7 Q ss_pred CCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 861446641000012344605898515411153322000000135861001 Q gi|254780942|r 81 QGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAV 131 (442) Q Consensus 81 ~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i 131 (442) .|-..|+..+.... ..+.++|+.+|.||++..+++.+++.... .++++ T Consensus 73 ~gLN~Av~~a~~~~--~~~~v~Iv~aDLPli~~~~l~~ll~~~~~-~~vvI 120 (217) T pfam01983 73 SDLNTAVNQAFMAP--EEAPVIIIPSDIPLISKEVLKRFLETEGR-ADVVI 120 (217) T ss_pred CCHHHHHHHHHHHC--CCCCEEEEECCCCCCCHHHHHHHHHHCCC-CCEEE T ss_conf 46789999999728--99868997076678998999999983778-88897 No 243 >cd06421 CESA_CelA_like CESA_CelA_like are involved in the elongation of the glucan chain of cellulose. Family of proteins related to Agrobacterium tumefaciens CelA and Gluconacetobacter xylinus BscA. These proteins are involved in the elongation of the glucan chain of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues. They are putative catalytic subunit of cellulose synthase, which is a glycosyltransferase using UDP-glucose as the substrate. The catalytic subunit is an integral membrane protein with 6 transmembrane segments and it is postulated that the protein is anchored in the membrane at the N-terminal end. Probab=94.96 E-value=0.19 Score=26.70 Aligned_cols=103 Identities=13% Similarity=0.146 Sum_probs=63.2 Q ss_pred EEEECCEE--HHHHHHHHHHHCCC--C--EEEEEECCCHHHHHHHHCCC--CCEEEEEECCCCCC-HHHHHHHCCCCCCC Q ss_conf 12588485--79999999997799--7--79999579889999973026--97289998599886-14466410000123 Q gi|254780942|r 26 LQKIAGKP--MISHVMETIAAAGI--E--NVALVLGYGAEEITRINFPP--TLSVEYYIQDCQQG-TAHAVLTAQDAIKP 96 (442) Q Consensus 26 Llpi~gkp--li~~~i~~l~~~g~--~--~ivvv~~~~~e~i~~~~~~~--~~~i~~v~q~~~~G-Ta~Ai~~a~~~l~~ 96 (442) ++|..|.+ +|+..|+.+.+... . +|+|+-.--.+...+..... ...+.++.++...| -|+|+..+.+..+. T Consensus 6 iIPayNE~~~~i~~~l~sl~~q~YP~~~~eIiVvdD~std~t~~~~~~~~~~~~~~~~~~~~~~g~Ka~alN~g~~~a~g 85 (234) T cd06421 6 FIPTYNEPLEIVRKTLRAALAIDYPHDKLRVYVLDDGRRPELRALAAELGVEYGYRYLTRPDNRHAKAGNLNNALAHTTG 85 (234) T ss_pred EEECCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCC T ss_conf 99838997899999999999679999828999998989878898888845664368887488999769999999997789 Q ss_pred CCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 44605898515411153322000000135861001 Q gi|254780942|r 97 GYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAV 131 (442) Q Consensus 97 ~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i 131 (442) |-++++.+|. ++++.-|..++.++..+..+.. T Consensus 86 --d~v~~~DaD~-~~~~~~L~~~~~~~~~~~~v~~ 117 (234) T cd06421 86 --DFVAILDADH-VPTPDFLRRTLGYFLDDPKVAL 117 (234) T ss_pred --CEEEEECCCC-CCCHHHHHHHHHHHHHCCCEEE T ss_conf --9999988987-7087699999999986997589 No 244 >COG1216 Predicted glycosyltransferases [General function prediction only] Probab=94.71 E-value=0.26 Score=25.90 Aligned_cols=112 Identities=13% Similarity=0.069 Sum_probs=72.3 Q ss_pred CCEEEECCEEHHHHHHHHHHHCCCCEEEEE-E-CCCHHHHHHHHCCC-CCEEEEEECCCCCCHHHHHHHCCCCCCCCCC- Q ss_conf 441258848579999999997799779999-5-79889999973026-9728999859988614466410000123446- Q gi|254780942|r 24 KVLQKIAGKPMISHVMETIAAAGIENVALV-L-GYGAEEITRINFPP-TLSVEYYIQDCQQGTAHAVLTAQDAIKPGYD- 99 (442) Q Consensus 24 KpLlpi~gkpli~~~i~~l~~~g~~~ivvv-~-~~~~e~i~~~~~~~-~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~- 99 (442) .-.+..+...-+...++.+.+.......++ + .--.+...+..... ..++.++...+-+|=|++...+........+ T Consensus 7 ~iiv~yn~~~~l~~~l~~l~~~~~~~~~iv~vDn~s~d~~~~~~~~~~~~~v~~i~~~~NlG~agg~n~g~~~a~~~~~~ 86 (305) T COG1216 7 IIIVTYNRGEDLVECLASLAAQTYPDDVIVVVDNGSTDGSLEALKARFFPNVRLIENGENLGFAGGFNRGIKYALAKGDD 86 (305) T ss_pred EEEEEECCHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECCCCCEEHHHHHHHHHHHHCCCCC T ss_conf 99999688889999999997176876359980799973228887760588689998898867088888999998628984 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEEC Q ss_conf 0589851541115332200000013586100100101 Q gi|254780942|r 100 DVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNA 136 (442) Q Consensus 100 ~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~ 136 (442) -++++|=|+ .+.+..++++++.+++.....+....+ T Consensus 87 ~~l~LN~D~-~~~~~~l~~ll~~~~~~~~~~~~~~~i 122 (305) T COG1216 87 YVLLLNPDT-VVEPDLLEELLKAAEEDPAAGVVGPLI 122 (305) T ss_pred EEEEEECCC-CCCHHHHHHHHHHHHHCCCCEEEEEEE T ss_conf 899983886-879069999999998689877963465 No 245 >cd06438 EpsO_like EpsO protein participates in the methanolan synthesis. The Methylobacillus sp EpsO protein is predicted to participate in the methanolan synthesis. Methanolan is an exopolysaccharide (EPS), composed of glucose, mannose and galactose. A 21 genes cluster was predicted to participate in the methanolan synthesis. Gene disruption analysis revealed that EpsO is one of the glycosyltransferase enzymes involved in the synthesis of repeating sugar units onto the lipid carrier. Probab=94.46 E-value=0.3 Score=25.55 Aligned_cols=101 Identities=15% Similarity=0.174 Sum_probs=60.1 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC----CEEEEEECCCHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCC---C Q ss_conf 1258848-579999999997799----7799995798899999730269728999859988614466410000123---4 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAGI----ENVALVLGYGAEEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKP---G 97 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g~----~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~---~ 97 (442) ++|..|. ..|.-+|+++.+... -+|+++...-.+.-.+.....+..+.........|-+.|+..+++++.. . T Consensus 2 lIPa~NEe~vI~~ti~~l~~~~YP~~~~eIivvdD~stD~T~~~a~~~~~~v~~~~~~~~~gK~~aln~~~~~~~~~~~~ 81 (183) T cd06438 2 LIPAHNEEAVIGNTVRSLKAQDYPRELYRIFVVADNCTDDTAQVARAAGATVLERHDPERRGKGYALDFGFRHLLNLADD 81 (183) T ss_pred EEECCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHCCCEEEEEECCCCCCCHHHHHHHHHHHHCCCCC T ss_conf 88207809999999999983689978569999848986208999998399699973777776288999889998424788 Q ss_pred CCEEEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 460589851541115332200000013586 Q gi|254780942|r 98 YDDVIIMYGDVPLVSSHTLKKAMDKIAQGY 127 (442) Q Consensus 98 ~~~~lVl~GD~~li~~~~l~~l~~~~~~~~ 127 (442) .+-++++-+|. .+.++.|..+..++.++. T Consensus 82 ~d~v~i~DAD~-~~~~d~l~~~~~~f~~~~ 110 (183) T cd06438 82 PDAVVVFDADN-LVDPNALEELNARFAAGA 110 (183) T ss_pred CCEEEEECCCC-CCCHHHHHHHHHHHHCCC T ss_conf 66899955666-779789999999983899 No 246 >COG1920 Predicted nucleotidyltransferase, CobY/MobA/RfbA family [General function prediction only] Probab=94.36 E-value=0.076 Score=28.94 Aligned_cols=172 Identities=19% Similarity=0.220 Sum_probs=93.5 Q ss_pred EEEE---CCCCCCCCCCCCC----CCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEECCC Q ss_conf 9994---5887723278887----44125884857999999999779977999957988999997302697289998599 Q gi|254780942|r 7 AIVL---AAGRGHRMKSSSS----KVLQKIAGKPMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQDC 79 (442) Q Consensus 7 AiIL---AaG~GtRl~p~~p----KpLlpi~gkpli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~ 79 (442) +||. .++.-|||-|..+ |-++ .-|+-.++..++.. +.+|.|++.- +.+.++..+. +...++ T Consensus 3 ~iIPvk~~~~aKTRLs~~lS~eeRe~~~----laML~dvi~Al~~~-~~~i~Vvtpd--e~~~~~a~~~----~vl~d~- 70 (210) T COG1920 3 AIIPVKRLADAKTRLSPVLSAEERENFA----LAMLVDVLGALAGV-LGEITVVTPD--EEVLVPATKL----EVLADP- 70 (210) T ss_pred EEEECCCCCCCHHCCCCCCCHHHHHHHH----HHHHHHHHHHHHHH-CCCCEEECCC--HHHHHHCCCC----EEEECC- T ss_conf 5886266676200133113988899999----99999999986400-4775287687--6762100034----045141- Q ss_pred CCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCEEEECCCCCEECCCCC Q ss_conf 88614466410000123446058985154111533220000001358610010010122157440420233310010014 Q gi|254780942|r 80 QQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNADNPKGYGRLLIKNNEIIAIREEN 159 (442) Q Consensus 80 ~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~~dp~~yGvV~~d~~~v~~i~Ek~ 159 (442) .+ -.|+.++++.+... +.++|+.+|.||+.+..++.++++.+. .+..+. |.+ ..|- T Consensus 71 dL--N~Ai~aa~~~~~~p-~~v~vvmaDLPLl~~~~i~~~~~~~~d-~dvvia------P~~------gGGT-------- 126 (210) T COG1920 71 DL--NTAINAALDEIPLP-SEVIVVMADLPLLSPEHIERALSAAKD-ADVVIA------PGR------GGGT-------- 126 (210) T ss_pred CH--HHHHHHHHHHCCCC-CCEEEEECCCCCCCHHHHHHHHHHCCC-CCEEEE------CCC------CCCE-------- T ss_conf 06--78999998627887-616998445543799999999973477-768982------079------9966-------- Q ss_pred CCCCCCCCCCCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECHHHEECCCCCCCCCCCC Q ss_conf 6675421000355600111250146777532888740011121148999973077179991721430111210000001 Q gi|254780942|r 160 DATDEERKIHYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVKEQEVCGCNNRYELSLIE 238 (442) Q Consensus 160 d~~~~~~~~~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~~~~~~gv~~~~~L~~~~ 238 (442) | .+|+++ ..+ + .+.....+ + +=+..+.+.|..+..|..- ....++++|.||.++. T Consensus 127 ---------n----~L~~r~-~~~-~--~~y~g~SF-----~-~Hl~~Ark~G~~~~~~dSf-~l~~DVDtpeDL~e~~ 181 (210) T COG1920 127 ---------N----VLFARK-SAF-R--PRYGGVSF-----L-RHLEEARKRGLVVLTYDSF-GLSADVDTPEDLVEAF 181 (210) T ss_pred ---------E----EEEEEC-CCC-C--CCCCCCCH-----H-HHHHHHHHCCCEEEEECCC-CEECCCCCHHHHHHHH T ss_conf ---------7----899844-544-4--23468528-----8-8999998769889972354-1113779888999999 No 247 >pfam00535 Glycos_transf_2 Glycosyl transferase family 2. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids. Probab=93.96 E-value=0.14 Score=27.36 Aligned_cols=104 Identities=12% Similarity=0.093 Sum_probs=67.8 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC--CEEEEEECCCHHH---HHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCCC Q ss_conf 1258848-579999999997799--7799995798899---999730269728999859988614466410000123446 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAGI--ENVALVLGYGAEE---ITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYD 99 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g~--~~ivvv~~~~~e~---i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~ 99 (442) .+|..|. ..|...|+.+.+... -+|+++=.--.+. +.+.+......+.++..++..|-++|+..+.++.+. + T Consensus 3 iip~yN~~~~l~~~l~sl~~q~~~~~eiiivDd~S~d~t~~~~~~~~~~~~~v~~~~~~~~~g~~~a~n~g~~~a~~--~ 80 (168) T pfam00535 3 IIPTYNEEKYLEECLESLLNQTYPNFEIIVVDDGSTDGTVEIAEEYAKKDPRIRVIRLEENLGKAAARNAGLKLATG--D 80 (168) T ss_pred EEEECCCHHHHHHHHHHHHCCCCCCEEEEEEECCCCCHHHCEEEEECCCCCCCEEEEEECCCCCHHHHHHHHHHCCC--C T ss_conf 99961888999999999973879985999998999810100121001346760024420245746888899996799--8 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCC-CCCCCE Q ss_conf 058985154111533220000001358-610010 Q gi|254780942|r 100 DVIIMYGDVPLVSSHTLKKAMDKIAQG-YSIAVV 132 (442) Q Consensus 100 ~~lVl~GD~~li~~~~l~~l~~~~~~~-~~~~i~ 132 (442) -++++.+|. ...+..++.+++...+. .+.+.. T Consensus 81 ~v~~lD~D~-~~~~~~l~~~~~~~~~~~~~~~~~ 113 (168) T pfam00535 81 YILFLDADD-EVAPDWLEKLVELLEKNGADIVIG 113 (168) T ss_pred EEEEECCCC-CCCHHHHHHHHHHHHCCCCEEEEE T ss_conf 599985899-868769999999998399769999 No 248 >cd06437 CESA_CaSu_A2 Cellulose synthase catalytic subunit A2 (CESA2) is a catalytic subunit or a catalytic subunit substitute of the cellulose synthase complex. Cellulose synthase (CESA) catalyzes the polymerization reaction of cellulose using UDP-glucose as the substrate. Cellulose is an aggregate of unbranched polymers of beta-1,4-linked glucose residues, which is an abundant polysaccharide produced by plants and in varying degrees by several other organisms including algae, bacteria, fungi, and even some animals. Genomes from higher plants harbor multiple CESA genes. There are ten in Arabidopsis. At least three different CESA proteins are required to form a functional complex. In Arabidopsis, CESA1, 3 and 6 and CESA4, 7 and 8, are required for cellulose biosynthesis during primary and secondary cell wall formation. CESA2 is very closely related to CESA6 and is viewed as a prime substitute for CESA6. They functionally compensate each other. The cesa2 and cesa6 double mutant plants we Probab=93.89 E-value=0.39 Score=24.86 Aligned_cols=97 Identities=13% Similarity=0.137 Sum_probs=61.9 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC--C--EEEEEECCCH---HHHHHH---HCCCCCEEEEEECCCCCC-HHHHHHHCCCC Q ss_conf 1258848-579999999997799--7--7999957988---999997---302697289998599886-14466410000 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAGI--E--NVALVLGYGA---EEITRI---NFPPTLSVEYYIQDCQQG-TAHAVLTAQDA 93 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g~--~--~ivvv~~~~~---e~i~~~---~~~~~~~i~~v~q~~~~G-Ta~Ai~~a~~~ 93 (442) ++|..|. ..|...|+++.+... + +|+|+-+... +.+.++ +...+.++.++..+++.| -++|+..+++. T Consensus 6 iIPa~NE~~~I~~~l~sl~~q~YP~~~~~I~VvDdstD~t~~~~~~~~~~~~~~~~~~~~~~~~~~~g~Ka~aln~gl~~ 85 (232) T cd06437 6 QLPVFNEKYVVERLIEAACALDYPKDRLEIQVLDDSTDETVRLAREIVEEYAAQGVNIKHVRRADRTGYKAGALAEGMKV 85 (232) T ss_pred EEECCCCHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHH T ss_conf 99558988999999999996799998089999979996699999999976664199689983698888669999999997 Q ss_pred CCCCCCEEEEEECCCCCCCCCCCCCCCCCCCC Q ss_conf 12344605898515411153322000000135 Q gi|254780942|r 94 IKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQ 125 (442) Q Consensus 94 l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~ 125 (442) .+. +-++++.+|. .++++-|..++.+... T Consensus 86 a~g--d~i~~~DaD~-~~~~d~L~~~~~~f~~ 114 (232) T cd06437 86 AKG--EYVAIFDADF-VPPPDFLQKTPPYFAD 114 (232) T ss_pred CCC--CEEEEECCCC-CCCHHHHHHHHHHHCC T ss_conf 789--8899977644-7384799999998319 No 249 >cd06427 CESA_like_2 CESA_like_2 is a member of the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, Glucan Biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis prot Probab=93.08 E-value=0.54 Score=24.09 Aligned_cols=98 Identities=17% Similarity=0.200 Sum_probs=62.4 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC--C--EEEEEECCCHH----HHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCC Q ss_conf 1258848-579999999997799--7--79999579889----9999730269728999859988614466410000123 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAGI--E--NVALVLGYGAE----EITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKP 96 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g~--~--~ivvv~~~~~e----~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~ 96 (442) ++|..|. ..|...|+++.+... . +|+++..-..+ .++++.......+..+.+..+.|-++|+-.++.+.+. T Consensus 6 iIPa~NE~~vI~~ti~sl~~~~YP~~~~evivv~d~s~~~t~~~~~~~~~~~~~~~~~~~~~~~~gK~~alN~al~~a~g 85 (241) T cd06427 6 LVPLYKEAEVLPQLIASLSALDYPRSKLDVKLLLEEDDEETIAAARALRLPSIFRVVVVPPSQPRTKPKACNYALAFARG 85 (241) T ss_pred EEECCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHHHCCC T ss_conf 99668988999999999996799976179999988999689999998578767089996389887469999999996158 Q ss_pred CCCEEEEEECCCCCCCCCCCCCCCCCCCCC Q ss_conf 446058985154111533220000001358 Q gi|254780942|r 97 GYDDVIIMYGDVPLVSSHTLKKAMDKIAQG 126 (442) Q Consensus 97 ~~~~~lVl~GD~~li~~~~l~~l~~~~~~~ 126 (442) |-++++-+|. .++++.|..++.++.+. T Consensus 86 --d~v~~~DAD~-~~~p~~L~~~v~~f~~~ 112 (241) T cd06427 86 --EYVVIYDAED-APDPDQLKKAVAAFARL 112 (241) T ss_pred --CEEEEECCCC-CCCHHHHHHHHHHHHHC T ss_conf --8599978655-64977999999999857 No 250 >cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=93.05 E-value=0.55 Score=24.06 Aligned_cols=98 Identities=15% Similarity=0.212 Sum_probs=61.6 Q ss_pred EEECCE-EHHHHHHHHHHHCC--CCEEEEEEC--C-C-HHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCC-CC Q ss_conf 258848-57999999999779--977999957--9-8-899999730269728999859988614466410000123-44 Q gi|254780942|r 27 QKIAGK-PMISHVMETIAAAG--IENVALVLG--Y-G-AEEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKP-GY 98 (442) Q Consensus 27 lpi~gk-pli~~~i~~l~~~g--~~~ivvv~~--~-~-~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~-~~ 98 (442) +|-.|. ..|...|+.|.+.- ..+|+||=. . + .+.+++.. ...++.++..++.+|.|+|.-.+.+.... .. T Consensus 3 I~t~N~~~~l~~~L~Sl~~q~~~~~eIiVVDn~StD~t~~~l~~~~--~~~~v~~i~~~~N~G~a~~~N~Gi~~a~~~~~ 80 (202) T cd04185 3 VVTYNRLDLLKECLDALLAQTRPPDHIIVIDNASTDGTAEWLTSLG--DLDNIVYLRLPENLGGAGGFYEGVRRAYELGY 80 (202) T ss_pred EEECCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHC--CCCCEEEEECCCCCCHHHHHHHHHHHHHHCCC T ss_conf 9002888999999999982779998899997949876166530003--56668998568887327899999998875598 Q ss_pred CEEEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 60589851541115332200000013586 Q gi|254780942|r 99 DDVIIMYGDVPLVSSHTLKKAMDKIAQGY 127 (442) Q Consensus 99 ~~~lVl~GD~~li~~~~l~~l~~~~~~~~ 127 (442) +.+++++.|. .+.+..|+.++++.+... T Consensus 81 d~v~~ld~D~-~~~~~~l~~l~~~~~~~~ 108 (202) T cd04185 81 DWIWLMDDDA-IPDPDALEKLLAYADKDN 108 (202) T ss_pred EEEEEECCCC-CCCCCHHHHHHHHHHCCC T ss_conf 1899989987-549229999999865799 No 251 >cd04192 GT_2_like_e Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=92.41 E-value=0.67 Score=23.55 Aligned_cols=100 Identities=20% Similarity=0.204 Sum_probs=61.5 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC--C--EEEEEECC---CHHHH-HHHHCCCCCEEEEEECCC--CCCHHHHHHHCCCCC Q ss_conf 1258848-579999999997799--7--79999579---88999-997302697289998599--886144664100001 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAGI--E--NVALVLGY---GAEEI-TRINFPPTLSVEYYIQDC--QQGTAHAVLTAQDAI 94 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g~--~--~ivvv~~~---~~e~i-~~~~~~~~~~i~~v~q~~--~~GTa~Ai~~a~~~l 94 (442) ++|..|. ..|...|+.+.+... + ||+|+-.- +..++ +.+.......+..+.... ..|-++|+..+.+.. T Consensus 2 iIP~~Ne~~~i~~~l~sl~~q~yp~~~~EVivvdd~StD~T~~i~~~~~~~~~~~~~~~~~~~~~~~gk~~aln~g~~~a 81 (229) T cd04192 2 VIAARNEAENLPRLLQSLSALDYPKEKFEVILVDDHSTDGTVQILEFAAAKPNFQLKILNNSRVSISGKKNALTTAIKAA 81 (229) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCCHHHHHHHHHHCCCCCEEECCCCCCCCCHHHHHHHHHHHHC T ss_conf 99988988999999999995789988689999989797167999999970569745530246777725999999999864 Q ss_pred CCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2344605898515411153322000000135861 Q gi|254780942|r 95 KPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYS 128 (442) Q Consensus 95 ~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~ 128 (442) +. +-++++.+|. .+.+.-|+.++..+..... T Consensus 82 ~g--e~i~~lDaD~-~~~~~~l~~~~~~~~~~~~ 112 (229) T cd04192 82 KG--DWIVTTDADC-VVPSNWLLTFVAFIQKEQI 112 (229) T ss_pred CC--CEEEEECCCC-CCCHHHHHHHHHHHHCCCC T ss_conf 67--7699856765-6587699999999748994 No 252 >cd06423 CESA_like CESA_like is the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis protein catalyzes the Probab=92.17 E-value=0.72 Score=23.38 Aligned_cols=102 Identities=16% Similarity=0.129 Sum_probs=63.1 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC--CEEEEEECCC----HHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCC Q ss_conf 1258848-579999999997799--7799995798----89999973026972899985998861446641000012344 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAGI--ENVALVLGYG----AEEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPGY 98 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g~--~~ivvv~~~~----~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~ 98 (442) .+|..|. -.|...|+.+.+.-. -+|+|+-..- .+.++++.........+....+..|.++|+-.+....+. T Consensus 2 iIp~~N~~~~l~~~l~sl~~Q~~~~~eIivvdd~S~d~t~~~~~~~~~~~~~~~~~~~~~~~~g~~~a~N~g~~~a~~-- 79 (180) T cd06423 2 IVPAYNEEAVIERTIESLLALDYPKLEVIVVDDGSTDDTLEILEELAALYIRRVLVVRDKENGGKAGALNAGLRHAKG-- 79 (180) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHCCCCCCC-- T ss_conf 999779789999999999818999818999989997578999999862179718984477767799997525102675-- Q ss_pred CEEEEEECCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 60589851541115332200000013586100 Q gi|254780942|r 99 DDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIA 130 (442) Q Consensus 99 ~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~ 130 (442) +-++.+.+|. .+.+.-|..++.+..++.... T Consensus 80 d~i~~lD~D~-~~~~~~l~~~~~~~~~~~~~~ 110 (180) T cd06423 80 DIVVVLDADT-ILEPDALKRLVVPFFADPKVG 110 (180) T ss_pred CEEEEECCCC-CCCHHHHHHHHHHHHHCCCEE T ss_conf 5798743872-008569999999998795989 No 253 >cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan. Succinoglycan Biosynthesis Protein ExoA catalyzes the formation of a beta-1,3 linkage of the second sugar (glucose) of the succinoglycan with the galactose on the lipid carrie. Succinoglycan is an acidic exopolysaccharide that is important for invasion of the nodules. Succinoglycan is a high-molecular-weight polymer composed of repeating octasaccharide units. These units are synthesized on membrane-bound isoprenoid lipid carriers, beginning with galactose followed by seven glucose molecules, and modified by the addition of acetate, succinate, and pyruvate. ExoA is a membrane protein with a transmembrance domain at c-terminus. Probab=90.87 E-value=0.99 Score=22.58 Aligned_cols=102 Identities=9% Similarity=0.087 Sum_probs=57.8 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC----CEEEEEECC--C-HHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCC Q ss_conf 1258848-579999999997799----779999579--8-8999997302697289998599886144664100001234 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAGI----ENVALVLGY--G-AEEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPG 97 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g~----~~ivvv~~~--~-~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~ 97 (442) ++|..|. ..|...|+.|.+... -||+|+-.- + ..++.+.+......+.++. .+..|-+.|.-.+.+..+. T Consensus 5 iIp~yN~~~~l~~~l~Sl~~q~yp~~~~EVIvVDd~S~D~t~~~~~~~~~~~~~i~~~~-~~~~~~~~a~N~gi~~a~g- 82 (249) T cd02525 5 IIPVRNEEKYIEELLESLLNQSYPKDLIEIIVVDGGSTDGTREIVQEYAAKDPRIRLID-NPKRIQSAGLNIGIRNSRG- 82 (249) T ss_pred EECCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCCHHHHHHHHHHCCCCEEECC-CCCCCHHHHHHHHHHHCCC- T ss_conf 99356978999999999984689999889999989596156999999974077132113-5556589999898996687- Q ss_pred CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 4605898515411153322000000135861001 Q gi|254780942|r 98 YDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAV 131 (442) Q Consensus 98 ~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i 131 (442) +-++++.+|. .+++.-++.++....+.....+ T Consensus 83 -d~i~~lD~D~-~~~~~~l~~~~~~~~~~~~~~v 114 (249) T cd02525 83 -DIIIRVDAHA-VYPKDYILELVEALKRTGADNV 114 (249) T ss_pred -CEEEEECCCC-CCCHHHHHHHHHHHHCCCCEEE T ss_conf -7688414776-5695699999999877897199 No 254 >cd04196 GT_2_like_d Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=90.71 E-value=1 Score=22.50 Aligned_cols=104 Identities=14% Similarity=0.109 Sum_probs=58.3 Q ss_pred EEEECCE-EHHHHHHHHHHHCCCC--EEEEEE-CC--CH-HHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCC Q ss_conf 1258848-5799999999977997--799995-79--88-9999973026972899985998861446641000012344 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAGIE--NVALVL-GY--GA-EEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPGY 98 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g~~--~ivvv~-~~--~~-e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~ 98 (442) ++|..|. +.|...|+.+.+.-.. ||+|+= +. +. +.++++.......+.+...+..+|.+.+...+.....+ T Consensus 3 iip~yN~~~~l~~~i~Sil~Qt~~~~EiivvDDgStD~t~~ii~~~~~~~~~~~~~~~~~~n~G~~~n~n~gi~~a~g-- 80 (214) T cd04196 3 LMATYNGEKYLREQLDSILAQTYKNDELIISDDGSTDGTVEIIKEYIDKDPFIIILIRNGKNLGVARNFESLLQAADG-- 80 (214) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHHHCCC-- T ss_conf 999179888999999999837998869999989997237999999997588651366658880599999999986478-- Q ss_pred CEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 6058985154111533220000001358610010 Q gi|254780942|r 99 DDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVV 132 (442) Q Consensus 99 ~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~ 132 (442) +-+.++.+|- +..+..++.+++...+..++.+. T Consensus 81 eyi~~lD~DD-~~~p~~l~~~~~~~~~~~~~~~v 113 (214) T cd04196 81 DYVFFCDQDD-IWLPDKLERLLKAFLKDDKPLLV 113 (214) T ss_pred CEEECCCCCC-CCCHHHHHHHHHHHHHCCCEEEE T ss_conf 7574137765-62865999999999858990899 No 255 >cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis. AmsE is a glycosyltransferase involved in exopolysaccharide amylovora biosynthesis in Erwinia amylovora. Amylovara is one of the three exopolysaccharide produced by E. amylovora. Amylovara-deficient mutants are non-pathogenic. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Probab=89.99 E-value=1.2 Score=22.14 Aligned_cols=103 Identities=21% Similarity=0.221 Sum_probs=63.6 Q ss_pred EEEECCE--E-HHHHHHHHHHHCCC--CEEEEEE-CCCH---HHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCC Q ss_conf 1258848--5-79999999997799--7799995-7988---99999730269728999859988614466410000123 Q gi|254780942|r 26 LQKIAGK--P-MISHVMETIAAAGI--ENVALVL-GYGA---EEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKP 96 (442) Q Consensus 26 Llpi~gk--p-li~~~i~~l~~~g~--~~ivvv~-~~~~---e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~ 96 (442) ++|..|+ | .|...|+.+.+.-. .|+++|- +... +++.+.+. ...++.++..++..|.+.|.-.+...-++ T Consensus 3 iip~YN~e~~~~l~~~l~Svl~Qt~~~~EiIiVdDgSs~d~~~~i~~~~~-~~~~i~~i~~~~N~G~~~a~N~gi~~a~g 81 (201) T cd04195 3 LMSVYIKEKPEFLREALESILKQTLPPDEVVLVKDGPVTQSLNEVLEEFK-RKLPLKVVPLEKNRGLGKALNEGLKHCTY 81 (201) T ss_pred EEECCCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHH-CCCCEEEEECCCCCCHHHHHHHHHHHCCC T ss_conf 99888489789999999999957999818999989999654399999861-47998999878878989997763642676 Q ss_pred CCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 446058985154111533220000001358610010 Q gi|254780942|r 97 GYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVV 132 (442) Q Consensus 97 ~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~ 132 (442) +-+..+.+|- +..+.-++.+++...++..+.+. T Consensus 82 --~yI~~lD~DD-~~~p~~l~~~~~~l~~~~~~~~v 114 (201) T cd04195 82 --DWVARMDTDD-ISLPDRFEKQLDFIEKNPEIDIV 114 (201) T ss_pred --CEEEECCCCC-CCCHHHHHHHHHHHHHCCCEEEE T ss_conf --6999818898-46716999999999878996999 No 256 >PRK10073 predicted glycosyl transferase; Provisional Probab=87.79 E-value=1.7 Score=21.24 Aligned_cols=111 Identities=16% Similarity=0.114 Sum_probs=69.5 Q ss_pred CCCCCCC--EEEECCE-EHHHHHHHHHHHCCCC--EEEEEE--CC-CHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHC Q ss_conf 7888744--1258848-5799999999977997--799995--79-8899999730269728999859988614466410 Q gi|254780942|r 19 KSSSSKV--LQKIAGK-PMISHVMETIAAAGIE--NVALVL--GY-GAEEITRINFPPTLSVEYYIQDCQQGTAHAVLTA 90 (442) Q Consensus 19 ~p~~pKp--Llpi~gk-pli~~~i~~l~~~g~~--~ivvv~--~~-~~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a 90 (442) ....||- .+|+.|. .-|+..|+.+.+.-.+ ||++|= +. ++.++.+.+.....++.++.| +-.|.+.|-..+ T Consensus 2 m~~~P~vSiiiP~YN~e~yl~~cl~Si~~Qt~~~~EiIiVdDgStD~s~~i~~~~~~~~~~i~vi~~-~N~G~s~ARN~g 80 (329) T PRK10073 2 MNSTPKLSIIVPLYNAGKDFRACMESLIAQTWTALEIIIVNDGSTDNSVEIAKHYAENYPHVRLLHQ-ANAGASVARNTG 80 (329) T ss_pred CCCCCCEEEEEECCCCHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCHHHHHHHHHHHCCCEEEEEC-CCCCHHHHHHHH T ss_conf 9999978999928898899999999998089999799999899982589999999812998999966-888618999999 Q ss_pred CCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCCEE Q ss_conf 000123446058985154111533220000001-3586100100 Q gi|254780942|r 91 QDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKI-AQGYSIAVVG 133 (442) Q Consensus 91 ~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~-~~~~~~~i~~ 133 (442) ++.-.+ +-+..+-+|- ++.+..++.+++.. +.+.|+++.. T Consensus 81 l~~a~G--~yi~f~DsDD-~l~~~~le~l~~~a~~~~~DIv~~~ 121 (329) T PRK10073 81 LAVATG--KYVAFVDADD-EVYPTMYETLMTMALEDDLDVAQCN 121 (329) T ss_pred HHHCCC--CEEEEECCCC-CCCCCHHHHHHHHHHHCCCCEEEEC T ss_conf 997488--8899975772-7588769999999974799999987 No 257 >cd04187 DPM1_like_bac Bacterial DPM1_like enzymes are related to eukaryotic DPM1. A family of bacterial enzymes related to eukaryotic DPM1; Although the mechanism of eukaryotic enzyme is well studied, the mechanism of the bacterial enzymes is not well understood. The eukaryotic DPM1 is the catalytic subunit of eukaryotic Dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. The enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. This protein family belongs to Glycosyltransferase 2 superfamily. Probab=87.15 E-value=1.9 Score=21.02 Aligned_cols=104 Identities=18% Similarity=0.222 Sum_probs=61.5 Q ss_pred EEEECCE-EHHHHHHHHH----HHCCCC-EEEEEEC--C-CHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCC Q ss_conf 1258848-5799999999----977997-7999957--9-8899999730269728999859988614466410000123 Q gi|254780942|r 26 LQKIAGK-PMISHVMETI----AAAGIE-NVALVLG--Y-GAEEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKP 96 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l----~~~g~~-~ivvv~~--~-~~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~ 96 (442) .+|..|. .-|.-+++.+ .+...+ +|+++=. . +..++.+.+.....+++++..++..|-+.|++.+.+.... T Consensus 2 IIP~yNEe~~I~~~i~~i~~~~~~~~~~~eIivVDD~StD~T~~i~~~l~~~~~~v~~i~~~~N~G~g~Ai~~g~~~a~~ 81 (181) T cd04187 2 VVPVYNEEENLPELYERLKAVLESLGYDYEIIFVDDGSTDRTLEILRELAARDPRVKVIRLSRNFGQQAALLAGLDHARG 81 (181) T ss_pred EEEECCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHCCCCCCCCEEEEECCCCCCHHHHHHHHHHHCCC T ss_conf 99616788879999999999986559987999998999756245420121233320453145576446788989985569 Q ss_pred CCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 446058985154111533220000001358610010 Q gi|254780942|r 97 GYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVV 132 (442) Q Consensus 97 ~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~ 132 (442) +-++.+-||- =.++.++..|++...++.+.++. T Consensus 82 --d~i~~~D~D~-q~~p~~i~~l~~~~~~~~D~V~g 114 (181) T cd04187 82 --DAVITMDADL-QDPPELIPEMLAKWEEGYDVVYG 114 (181) T ss_pred --CEEEECCCCC-CCCHHHHHHHHHHHHCCCEEEEE T ss_conf --7477518999-82999999999999759919999 No 258 >cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis. UDP-glucose:N-acylsphingosine D-glucosyltransferase (glucosylceramide synthase or ceramide glucosyltransferase) catalyzes the first glycosylation step of glycosphingolipid synthesis. Its product, glucosylceramide, serves as the core of more than 300 glycosphingolipids (GSL). GSLs are a group of membrane components that have the lipid portion embedded in the outer plasma membrane leaflet and the sugar chains extended to the outer environment. Several lines of evidence suggest the importance of GSLs in various cellular processes such as differentiation, adhesion, proliferation, and cell-cell recognition. In pathogenic fungus Cryptococcus neoformans, glucosylceramide serves as an antigen that elicits an antibody response in patients and it is essential for fungal growth in host extracellular environment. Probab=86.41 E-value=2.1 Score=20.79 Aligned_cols=95 Identities=14% Similarity=0.287 Sum_probs=58.3 Q ss_pred EEEECCE-EHHHHHHHHHHHCCCC--EEEEEECCC---H-HHHHHHHCC-CCCEEEEEECCCCCCH---HHHHHHCCCCC Q ss_conf 1258848-5799999999977997--799995798---8-999997302-6972899985998861---44664100001 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAGIE--NVALVLGYG---A-EEITRINFP-PTLSVEYYIQDCQQGT---AHAVLTAQDAI 94 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g~~--~ivvv~~~~---~-e~i~~~~~~-~~~~i~~v~q~~~~GT---a~Ai~~a~~~l 94 (442) ++|..|. +.|...|+.+.+.... +|+++..-. . +.++++... ....+.++.+..+.|. ++++..+.+.- T Consensus 6 ivP~~nee~~i~~~l~sll~qdYp~~Eiivv~d~s~D~t~~i~~~~~~~~p~~~~~~~~~~~~~g~~~K~~~l~~g~~~a 85 (196) T cd02520 6 LKPLCGVDPNLYENLESFFQQDYPKYEILFCVQDEDDPAIPVVRKLIAKYPNVDARLLIGGEKVGINPKVNNLIKGYEEA 85 (196) T ss_pred EEECCCCCHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHC T ss_conf 99689992779999999996689980899997899997899999987428776659997688878448999999999964 Q ss_pred CCCCCEEE-EEECCCCCCCCCCCCCCCCCCC Q ss_conf 23446058-9851541115332200000013 Q gi|254780942|r 95 KPGYDDVI-IMYGDVPLVSSHTLKKAMDKIA 124 (442) Q Consensus 95 ~~~~~~~l-Vl~GD~~li~~~~l~~l~~~~~ 124 (442) + ++++ ++-+|. .+.++-|+.++.++. T Consensus 86 ~---gdii~~~DaD~-~~~~~~L~~lv~~f~ 112 (196) T cd02520 86 R---YDILVISDSDI-SVPPDYLRRMVAPLM 112 (196) T ss_pred C---CCEEEEECCCC-CCCHHHHHHHHHHHC T ss_conf 5---89999988997-729559999999856 No 259 >COG1215 Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane] Probab=86.11 E-value=2.1 Score=20.70 Aligned_cols=101 Identities=14% Similarity=0.167 Sum_probs=63.0 Q ss_pred EEEECCE-E-HHHHHHHHHHHCCCC--EEEEEECCCHHH----HHHHHCCCCCEEEEEEC-CCCCCHHHHHHHCCCCCCC Q ss_conf 1258848-5-799999999977997--799995798899----99973026972899985-9988614466410000123 Q gi|254780942|r 26 LQKIAGK-P-MISHVMETIAAAGIE--NVALVLGYGAEE----ITRINFPPTLSVEYYIQ-DCQQGTAHAVLTAQDAIKP 96 (442) Q Consensus 26 Llpi~gk-p-li~~~i~~l~~~g~~--~ivvv~~~~~e~----i~~~~~~~~~~i~~v~q-~~~~GTa~Ai~~a~~~l~~ 96 (442) ++|..|. | .++..++++.+.... +++++..-..|. ++++......++....+ .+..|-++|+..+....+. T Consensus 59 iiP~ynE~~~~~~~~l~s~~~~dyp~~evivv~d~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~gK~~al~~~l~~~~~ 138 (439) T COG1215 59 IIPAYNEEPEVLEETLESLLSQDYPRYEVIVVDDGSTDETYEILEELGAEYGPNFRVIYPEKKNGGKAGALNNGLKRAKG 138 (439) T ss_pred EEECCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHCCC T ss_conf 98369998678999999998278987159999689983099999999744378579996244555248999988750578 Q ss_pred CCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 446058985154111533220000001358610 Q gi|254780942|r 97 GYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSI 129 (442) Q Consensus 97 ~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~ 129 (442) |-++++-+|. ...++.+..++......... T Consensus 139 --d~V~~~DaD~-~~~~d~l~~~~~~f~~~~~~ 168 (439) T COG1215 139 --DVVVILDADT-VPEPDALRELVSPFEDPPVG 168 (439) T ss_pred --CEEEEECCCC-CCCCHHHHHHHHHHCCCCEE T ss_conf --8899983887-78711999999973258804 No 260 >TIGR03472 HpnI hopanoid biosynthesis associated glycosyl transferase protein HpnI. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The member of this clade from Acidithiobacillus ferrooxidans ATCC 23270 (AFE_0974) is found in the same locus as squalene-hopene cyclase (SHC, TIGR01507) and other genes associated with the biosynthesis of hopanoid natural products. Similarly, in Ralstonia eutropha JMP134 (Reut_B4902) this gene is adjacent to HpnAB, IspH and HpnH (TIGR03470), although SHC itself is elsewhere in the genome. Notably, this gene (here named HpnI) and three others form a conserved set (HpnIJKL) which occur in a subset of all genomes containing the SHC enzyme. This relationship was discerned using the method of partial phylogenetic profiling. This group includes Zymomonas mobilis, the organism where the initial hopano Probab=84.51 E-value=2.5 Score=20.26 Aligned_cols=100 Identities=13% Similarity=0.268 Sum_probs=62.0 Q ss_pred CCCC--EEEECCE-EHHHHHHHHHHHCCC--CEEEEEECCCH----HHHHHHHCC-CCCEEEEEECCCCCCHH---HHHH Q ss_conf 8744--1258848-579999999997799--77999957988----999997302-69728999859988614---4664 Q gi|254780942|r 22 SSKV--LQKIAGK-PMISHVMETIAAAGI--ENVALVLGYGA----EEITRINFP-PTLSVEYYIQDCQQGTA---HAVL 88 (442) Q Consensus 22 ~pKp--Llpi~gk-pli~~~i~~l~~~g~--~~ivvv~~~~~----e~i~~~~~~-~~~~i~~v~q~~~~GTa---~Ai~ 88 (442) +|+. ++|+.|. |-|...|+.+.+... .++++++.... +.+++...+ ...+++.+..+++.|.. +++. T Consensus 40 ~P~VSil~P~~~e~~~l~~~L~sl~~qdYP~~evi~~vd~~~D~a~~i~~~l~~~~p~~~~~~v~~~~~~g~n~K~~~L~ 119 (373) T TIGR03472 40 WPPVSVLKPLHGDEPELYENLASFCRQDYPGFQMLFGVQDPDDPALAVVRRLRADFPDADIDLVIDARRHGPNRKVSNLI 119 (373) T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHH T ss_conf 99807998079983049999999986789982899997899997899999999868999879985887678878999999 Q ss_pred HCCCCCCCCCCEEEEE-ECCCCCCCCCCCCCCCCCCCC Q ss_conf 1000012344605898-515411153322000000135 Q gi|254780942|r 89 TAQDAIKPGYDDVIIM-YGDVPLVSSHTLKKAMDKIAQ 125 (442) Q Consensus 89 ~a~~~l~~~~~~~lVl-~GD~~li~~~~l~~l~~~~~~ 125 (442) ++...- ..|++++ -+|+ .+.++.+..++....+ T Consensus 120 ~~~~~a---~~dil~~~DaD~-~~~p~~L~~lVa~l~~ 153 (373) T TIGR03472 120 NMLPHA---RHDILVIADSDI-SVGPDYLRQVVAPLAD 153 (373) T ss_pred HHHHHC---CCCEEEEECCCC-CCCHHHHHHHHHHHHC T ss_conf 999862---788899987997-5284899999999748 No 261 >PRK10018 predicted glycosyl transferase; Provisional Probab=84.40 E-value=2.6 Score=20.23 Aligned_cols=98 Identities=10% Similarity=0.050 Sum_probs=63.1 Q ss_pred EEEECCE-EHHHHHHHHHHHCCCCE--EEEEE--CCCHHHHHHHHCC-CCCEEEEEECCCCCCHHHHHHHCCCCCCCCCC Q ss_conf 1258848-57999999999779977--99995--7988999997302-69728999859988614466410000123446 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAGIEN--VALVL--GYGAEEITRINFP-PTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYD 99 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g~~~--ivvv~--~~~~e~i~~~~~~-~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~ 99 (442) .+|..|. ..|...|+.+.+....+ +++|= +...+.++++... ....+.++.++...|.+.|.-.+...-.+ + T Consensus 10 IIP~yN~~~~l~~aI~SVl~Qty~n~EiIIVDD~Std~~~~~~~~~~~~d~RI~~i~~~~N~G~~~aRN~gi~~A~G--e 87 (279) T PRK10018 10 YMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTALNDPRITYIHNDINSGACAVRNQAIMLAQG--E 87 (279) T ss_pred EECCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHHHHHHHHCC--C T ss_conf 99479977999999999995799898999998999987999999997589988999878978899999999999569--9 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCC Q ss_conf 058985154111533220000001358 Q gi|254780942|r 100 DVIIMYGDVPLVSSHTLKKAMDKIAQG 126 (442) Q Consensus 100 ~~lVl~GD~~li~~~~l~~l~~~~~~~ 126 (442) -+..+-+|- ++.+.-|+.++++..+. T Consensus 88 yIafLDsDD-~~~PnkLE~ql~~~~~~ 113 (279) T PRK10018 88 YITGIDDDD-EWTPNRLSVFLAHKQQL 113 (279) T ss_pred EEEEECCCC-CCCCCHHHHHHHHHHHC T ss_conf 899999876-87856699999614422 No 262 >cd04179 DPM_DPG-synthase_like DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily. DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi, and animals, have no transmembrane region, suggesting the ex Probab=84.16 E-value=2.6 Score=20.17 Aligned_cols=106 Identities=21% Similarity=0.237 Sum_probs=64.2 Q ss_pred EEEECCE-EHHHHHHHHHHHC---CC-CEEEEEEC--C-CHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCC Q ss_conf 1258848-5799999999977---99-77999957--9-88999997302697289998599886144664100001234 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAA---GI-ENVALVLG--Y-GAEEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPG 97 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~---g~-~~ivvv~~--~-~~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~ 97 (442) .+|+.|. .-|.-+++.+.+. .. -++++|=. . +..++.+.+......+.++..++..|-+.|+..+.++... T Consensus 2 iIP~yNE~~~i~~~l~~i~~~~~~~~~~eiiiVddgS~D~T~~i~~~~~~~~~~~~~i~~~~n~G~g~ai~~G~~~a~~- 80 (185) T cd04179 2 VIPAYNEEENIPELVERLLAVLEEGYDYEIIVVDDGSTDGTAEIARELAARVPRVRVIRLSRNFGKGAAVRAGFKAARG- 80 (185) T ss_pred EEEECCCHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHHCCEEEEEEECCCCCCCHHHHHHHHHHCCC- T ss_conf 9984487889999999999875357987999998999620788888764233049986324577705899988875258- Q ss_pred CCEEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCCEEEE Q ss_conf 46058985154111533220000001-358610010010 Q gi|254780942|r 98 YDDVIIMYGDVPLVSSHTLKKAMDKI-AQGYSIAVVGFN 135 (442) Q Consensus 98 ~~~~lVl~GD~~li~~~~l~~l~~~~-~~~~~~~i~~~~ 135 (442) +-++++.||. -.++.++..+++.. +.+.++.+ +.+ T Consensus 81 -d~i~~~D~D~-~~~~~~i~~~i~~~~~~~~d~V~-gsR 116 (185) T cd04179 81 -DIVVTMDADL-QHPPEDIPKLLEKLLEGGADVVI-GSR 116 (185) T ss_pred -CEEEEECCCC-CCCHHHHHHHHHHHHHCCCEEEE-EEE T ss_conf -8899927998-70999999999999978985999-997 No 263 >KOG4184 consensus Probab=83.10 E-value=1.1 Score=22.29 Aligned_cols=14 Identities=14% Similarity=0.366 Sum_probs=5.5 Q ss_pred EECCCCCCCCCCCCCC Q ss_conf 8515411153322000 Q gi|254780942|r 104 MYGDVPLVSSHTLKKA 119 (442) Q Consensus 104 l~GD~~li~~~~l~~l 119 (442) ++||. |..+++.-. T Consensus 180 ~~~~~--I~~DdiHlI 193 (478) T KOG4184 180 LAGDE--IPNDDIHLI 193 (478) T ss_pred CCCCC--CCCCCEEEE T ss_conf 04675--767733688 No 264 >KOG2978 consensus Probab=77.93 E-value=4.4 Score=18.89 Aligned_cols=101 Identities=15% Similarity=0.187 Sum_probs=56.4 Q ss_pred CEEHHHHHHHHHH-HCCCC-EEEEEE--CCC-HHHHHHHHC--CCCCEEEEEECCCCCCHHHHHHHCCCCCCCCCCEEE- Q ss_conf 4857999999999-77997-799995--798-899999730--269728999859988614466410000123446058- Q gi|254780942|r 31 GKPMISHVMETIA-AAGIE-NVALVL--GYG-AEEITRINF--PPTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYDDVI- 102 (442) Q Consensus 31 gkpli~~~i~~l~-~~g~~-~ivvv~--~~~-~e~i~~~~~--~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~l- 102 (442) |.|++-|.+.... +.+.+ +++++- +.+ ..++.+.+. ....+|....-...+|-+-|...++.+-. ++|+ T Consensus 17 Nlpi~~~li~~~~~e~~~~~eiIivDD~SpDGt~~~a~~L~k~yg~d~i~l~pR~~klGLgtAy~hgl~~a~---g~fiv 93 (238) T KOG2978 17 NLPIITRLIAKYMSEEGKKYEIIIVDDASPDGTQEVAKALQKIYGEDNILLKPRTKKLGLGTAYIHGLKHAT---GDFIV 93 (238) T ss_pred CCEEEHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHCC---CCEEE T ss_conf 871659998754355368647999838999760999999999858981899863676650178775665436---87699 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCC-CCCCEEEEC Q ss_conf 9851541115332200000013586-100100101 Q gi|254780942|r 103 IMYGDVPLVSSHTLKKAMDKIAQGY-SIAVVGFNA 136 (442) Q Consensus 103 Vl~GD~~li~~~~l~~l~~~~~~~~-~~~i~~~~~ 136 (442) ++-+|. --.+..+.+|++..++++ |.+. ..+. T Consensus 94 iMDaDl-sHhPk~ipe~i~lq~~~~~div~-GTRY 126 (238) T KOG2978 94 IMDADL-SHHPKFIPEFIRLQKEGNYDIVL-GTRY 126 (238) T ss_pred EEECCC-CCCCHHHHHHHHHHHCCCCCEEE-EEEE T ss_conf 980766-78915679999986505752566-3037 No 265 >TIGR03469 HonB hopene-associated glycosyltransferase HpnB. This family of genes include a glycosyl transferase, group 2 domain (pfam00535) which are responsible, generally for the transfer of nucleotide-diphosphate sugars to substrates such as polysaccharides and lipids. The genes of this family are often found in the same genetic locus with squalene-hopene cyclase genes, and are never associated with genes for the metabolism of phytoene. Indeed, the members of this family appear to never be found in a genome lacking squalene-hopene cyclase (SHC), although not all genomes encoding SHC have this glycosyl transferase. In the organism Zymomonas mobilis the linkage of this gene to hopanoid biosynthesis has been noted and the gene named HpnB. Hopanoids are known to feature polar glycosyl head groups in many organisms. Probab=76.26 E-value=4.9 Score=18.62 Aligned_cols=101 Identities=13% Similarity=0.110 Sum_probs=57.8 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC---CEEEEEECCCH---HHHHH-HHCCC--CCEEEEEECC-CC---CCHHHHHHHCC Q ss_conf 1258848-579999999997799---77999957988---99999-73026--9728999859-98---86144664100 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAGI---ENVALVLGYGA---EEITR-INFPP--TLSVEYYIQD-CQ---QGTAHAVLTAQ 91 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g~---~~ivvv~~~~~---e~i~~-~~~~~--~~~i~~v~q~-~~---~GTa~Ai~~a~ 91 (442) ++|..|. +-|...|+.|.+... -||+++-..-. .++.+ +.... ...++.+..+ .| .|-..|+.++. T Consensus 45 iIPARNEe~~I~~~L~Sl~~Q~yp~~~EvivvDD~StD~T~~i~~~~~~~~~~~~rl~vi~~~~lP~gW~GK~~A~~qg~ 124 (384) T TIGR03469 45 VVPARNEADVIGECVTSLLEQDYPGKLHVILVDDHSTDGTADIARAAARAYGRGDRLTVVSGQPLPPGWSGKLWAVSQGI 124 (384) T ss_pred EECCCCCHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHHHHHH T ss_conf 97689878889999999983789997599999569986399999999744688863898258989987651188999999 Q ss_pred CCCCC--CCCE-EEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 00123--4460-589851541115332200000013586 Q gi|254780942|r 92 DAIKP--GYDD-VIIMYGDVPLVSSHTLKKAMDKIAQGY 127 (442) Q Consensus 92 ~~l~~--~~~~-~lVl~GD~~li~~~~l~~l~~~~~~~~ 127 (442) +.-.. ..++ ++.+-+|+ .+.++.++.++.+.++.. T Consensus 125 ~~A~~~~~~ge~llF~DADv-~~~p~~l~~~v~~~~~~~ 162 (384) T TIGR03469 125 AAARTLAPPADYLLLTDADI-AHGPDNLARLVARARAEG 162 (384) T ss_pred HHHHCCCCCCCEEEEECCCC-CCCCCHHHHHHHHHHHCC T ss_conf 99852578775799945750-008769999999999759 No 266 >cd02522 GT_2_like_a GT_2_like_a represents a glycosyltransferase family-2 subfamily with unknown function. Glycosyltransferase family 2 (GT-2) subfamily of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=75.66 E-value=5.1 Score=18.53 Aligned_cols=93 Identities=12% Similarity=0.148 Sum_probs=55.2 Q ss_pred EEEECCE-EHHHHHHHHHHHCC--CCEEEEEEC--C-CHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCCC Q ss_conf 1258848-57999999999779--977999957--9-8899999730269728999859988614466410000123446 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAG--IENVALVLG--Y-GAEEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYD 99 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g--~~~ivvv~~--~-~~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~ 99 (442) .+|..|. ..|...|+.|.+.. --||+||-+ . +..++.+.. . +.++. ++.|-|.|.-.+...-.+ + T Consensus 4 IIP~yNe~~~l~~~L~Sl~~q~~~~~EvIVVDdgStD~T~~i~~~~---~--~~~~~--~~~G~a~a~N~G~~~A~G--e 74 (221) T cd02522 4 IIPTLNEAENLPRLLASLRRLNPLPLEIIVVDGGSTDGTVAIARSA---G--VVVIS--SPKGRARQMNAGAAAARG--D 74 (221) T ss_pred EEEECCCHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCHHHHHHHH---C--CEEEE--CCCCHHHHHHHHHHHCCC--C T ss_conf 9960788889999999997578998389999896987649999972---5--22552--697879999999986788--8 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 05898515411153322000000135861 Q gi|254780942|r 100 DVIIMYGDVPLVSSHTLKKAMDKIAQGYS 128 (442) Q Consensus 100 ~~lVl~GD~~li~~~~l~~l~~~~~~~~~ 128 (442) -++.+++|. .+.+.-++.+++....... T Consensus 75 ~i~flDaD~-~~~~~~l~~l~~~~~~~~~ 102 (221) T cd02522 75 WLLFLHADT-RLPPDWDAAIIETLRADGA 102 (221) T ss_pred EEEEECCCC-CCCCCHHHHHHHHHHCCCC T ss_conf 799877523-6890699999999973998 No 267 >PTZ00260 glycosyl transferase group 2; Provisional Probab=71.04 E-value=6.7 Score=17.88 Aligned_cols=175 Identities=13% Similarity=0.096 Sum_probs=89.9 Q ss_pred EHHHHHHHHHHHCC-----CC-EEEEEE-CC--CHHHH-HHHHCC-CCCEEEEEECCCCCCHHHHHHHCCCCCCCCCCEE Q ss_conf 57999999999779-----97-799995-79--88999-997302-6972899985998861446641000012344605 Q gi|254780942|r 33 PMISHVMETIAAAG-----IE-NVALVL-GY--GAEEI-TRINFP-PTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYDDV 101 (442) Q Consensus 33 pli~~~i~~l~~~g-----~~-~ivvv~-~~--~~e~i-~~~~~~-~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~ 101 (442) +|++-.++.|.+.. ++ ||+||= |- +..++ .++... +..+++.+..+.-+|-|+|++.++.+-.+ +-+ T Consensus 96 ~mL~e~~~yL~~~~~~~~~~~yEIIVVDDGStD~T~eVa~~~~~~~~~~~IRvl~l~kNrGKG~AVr~Gml~ArG--~~I 173 (336) T PTZ00260 96 RTLEATFKHFESRNREDPKFIYEIIIINDGSKDKTLKVAKKYWKSINNKNFRLLTYNRNRGKGGAVKLGMLASAG--KYQ 173 (336) T ss_pred HHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCC--CEE T ss_conf 899999999986104588986799998169998869999999997699809999716777846999999998179--889 Q ss_pred EEEECCCCCCCCCCCCCCCCCC----CCCCCCCCEEEEC----CCCC---CCCE-EEEC-CCCCEECCCCCCCCCCCCCC Q ss_conf 8985154111533220000001----3586100100101----2215---7440-4202-33310010014667542100 Q gi|254780942|r 102 IIMYGDVPLVSSHTLKKAMDKI----AQGYSIAVVGFNA----DNPK---GYGR-LLIK-NNEIIAIREENDATDEERKI 168 (442) Q Consensus 102 lVl~GD~~li~~~~l~~l~~~~----~~~~~~~i~~~~~----~dp~---~yGv-V~~d-~~~v~~i~Ek~d~~~~~~~~ 168 (442) |.+-+|. =....++..|.... ..+.++++. .+. ++.. .+-+ +... =+.++.++--+.... T Consensus 174 LfaDADg-At~i~dl~kL~~~l~~~~~~~~~vviG-SRahl~~~~~v~~rs~~R~~l~~gFh~lV~~l~~~~I~D----- 246 (336) T PTZ00260 174 LMMDADG-ATDLNEFEKLEDILIKITQNGLGIVAG-SRNHLVKEDVVVQRKWYRNILMHCFHFIINTICGVRLKD----- 246 (336) T ss_pred EEEECCC-CCCHHHHHHHHHHHHHHCCCCCEEEEE-CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCC----- T ss_conf 9994788-989789999999997432379449983-463435677403551999999999999999980688530----- Q ss_pred CCCCCCHHEECCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEEEC Q ss_conf 03556001112501467775328887400111211489999730771799917 Q gi|254780942|r 169 HYCNSGLMAIDGLYIMDWLLQIKKNKVSQEYYLTDIIEKARLDGKSIASIDVK 221 (442) Q Consensus 169 ~lin~GiY~f~~~~L~~~l~~l~~~~~~gE~yltDii~~l~~~g~~i~~~~~~ 221 (442) -.||.=.|+.+..+..++.+..+.+.-+. .++-.+.+.|.++.-+++. T Consensus 247 --TQCGFKlF~r~~a~~if~~~~~~gwaFDV---ElL~lA~~~g~~I~EvPV~ 294 (336) T PTZ00260 247 --TQCGFKLFTRNTARIIFPVLHIERWAFDV---EIFMIAQKLNVPISEVPVK 294 (336) T ss_pred --CCCCEEEECHHHHHHHHHHHEECCEEEHH---HHHHHHHHCCCCEEEEEEE T ss_conf --65650457389999998763237567609---9999999879947996248 No 268 >PRK10714 undecaprenyl phosphate 4-deoxy-4-formamido-L-arabinose transferase; Provisional Probab=67.64 E-value=7.9 Score=17.45 Aligned_cols=106 Identities=13% Similarity=0.223 Sum_probs=65.8 Q ss_pred EEEECCE-----EHHHHHHHHHHHCCCC-EEEEEE--CCC-H-HHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCC Q ss_conf 1258848-----5799999999977997-799995--798-8-9999973026972899985998861446641000012 Q gi|254780942|r 26 LQKIAGK-----PMISHVMETIAAAGIE-NVALVL--GYG-A-EEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIK 95 (442) Q Consensus 26 Llpi~gk-----pli~~~i~~l~~~g~~-~ivvv~--~~~-~-e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~ 95 (442) .+|+.|. ++++.+...+.+.+.+ |+++|= +.+ . +.+++.....+..+..+.-....|-..|+.+++++-. T Consensus 11 ViP~yNEe~~i~~~~~~~~~~l~~~~~~~EiI~VDDgS~D~T~~~l~~l~~~~~~~v~~i~lsRNfG~~~Ai~aGl~~a~ 90 (324) T PRK10714 11 VIPVYNEQESLPELIRRTTAACESLGKEYEILLIDDGSSDNSAEMLVEASQAEDSHIISILLNRNYGQHSAIMAGFSHVT 90 (324) T ss_pred EEECCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHCC T ss_conf 98017778779999999999998679998999998999867799999986305996899989889885689999987347 Q ss_pred CCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 3446058985154111533220000001358610010010 Q gi|254780942|r 96 PGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFN 135 (442) Q Consensus 96 ~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~ 135 (442) + |-++++-+|. =-++..+.+|++..+++.++ +.+.+ T Consensus 91 G--d~vi~mD~DL-QdpPe~Ip~li~~~~~G~Dv-V~~~r 126 (324) T PRK10714 91 G--DLIITLDADL-QNPPEEIPRLVAKADEGYDV-VGTVR 126 (324) T ss_pred C--CEEEEECCCC-CCCHHHHHHHHHHHHCCCCE-EEEEE T ss_conf 9--9899986888-75988999999987048868-99988 No 269 >PRK00923 sirohydrochlorin cobaltochelatase; Reviewed Probab=64.50 E-value=9.1 Score=17.10 Aligned_cols=51 Identities=20% Similarity=0.522 Sum_probs=32.2 Q ss_pred CCCCCEEEEECCCCCCCCCC--------------CCCCCEEEE----CCEEHHHHHHHHHHHCCCCEEEEEE Q ss_conf 98740899945887723278--------------887441258----8485799999999977997799995 Q gi|254780942|r 1 MKRKRLAIVLAAGRGHRMKS--------------SSSKVLQKI----AGKPMISHVMETIAAAGIENVALVL 54 (442) Q Consensus 1 m~~~~~AiILAaG~GtRl~p--------------~~pKpLlpi----~gkpli~~~i~~l~~~g~~~ivvv~ 54 (442) |.| +|++|. |-|||... ..+-..+.. ...|-+.-.++.|.+.|+++|+++- T Consensus 1 m~k--~alllv-gHGSr~~~~~~~~~~~a~~l~~~~~~~~V~~afle~~~P~i~~~~~~l~~~G~~~ivvvP 69 (130) T PRK00923 1 MEK--IGLLLV-GHGSRLPYNKEVVTEIAEKIKEKFPDYIVEVGFMEFNEPTIPEALKKFAGTGVDKIIVVP 69 (130) T ss_pred CCC--EEEEEE-ECCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCHHHHHHHHHHCCCCEEEEEE T ss_conf 995--699999-789997678999999999999858998298877615789899999999966998899986 No 270 >cd02511 Beta4Glucosyltransferase UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide. UDP-glucose: lipooligosaccharide (LOS) beta-1-4-glucosyltransferase catalyzes the addition of the first residue, glucose, of the lacto-N-neotetrase structure to HepI of the LOS inner core. LOS is the major constituent of the outer leaflet of the outer membrane of gram-positive bacteria. It consists of a short oligosaccharide chain of variable composition (alpha chain) attached to a branched inner core which is lined in turn to lipid A. Beta 1,4 glucosyltransferase is required to attach the alpha chain to the inner core. Probab=63.74 E-value=9.5 Score=17.01 Aligned_cols=92 Identities=13% Similarity=0.104 Sum_probs=54.0 Q ss_pred EEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCCCEEEEE Q ss_conf 1258848-579999999997799779999579889999973026972899985998861446641000012344605898 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYDDVIIM 104 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~lVl 104 (442) +++..|. .-|...|+.+... .+||+||=+.=.|.-.+.....+.. ++.+ +-.|-+.+-..+.+..+ .+.+|.+ T Consensus 5 ~i~t~Nee~~l~~~l~sl~~~-~deiivvDs~StD~T~~ia~~~~~~--v~~~-~~~~~~~qrn~a~~~a~--~dWil~l 78 (229) T cd02511 5 VIITKNEERNIERCLESVKWA-VDEIIVVDSGSTDRTVEIAKEYGAK--VYQR-WWDGFGAQRNFALELAT--NDWVLSL 78 (229) T ss_pred EEEECCCHHHHHHHHHHHHCC-CCEEEEEECCCCCCHHHHHHHCCCE--EEEC-CCCCHHHHHHHHHHHCC--CCEEEEE T ss_conf 998258488899999987643-9889999597987529999984990--8954-77887999999998389--9889997 Q ss_pred ECCCCCCCCCCCCCCCCCCC Q ss_conf 51541115332200000013 Q gi|254780942|r 105 YGDVPLVSSHTLKKAMDKIA 124 (442) Q Consensus 105 ~GD~~li~~~~l~~l~~~~~ 124 (442) -+|. .+++...+++..... T Consensus 79 DADE-~~~~~l~~ei~~~~~ 97 (229) T cd02511 79 DADE-RLTPELADEILALLA 97 (229) T ss_pred CCCC-CCCHHHHHHHHHHHH T ss_conf 1101-279999999999972 No 271 >COG0381 WecB UDP-N-acetylglucosamine 2-epimerase [Cell envelope biogenesis, outer membrane] Probab=60.14 E-value=11 Score=16.64 Aligned_cols=71 Identities=11% Similarity=0.175 Sum_probs=39.4 Q ss_pred HHHHHHHCC-CCEEEEEECCCH--HHHHHHHCCC-----CCEEEEEECCCCCC--HHHHHHHCCCCCCCCCCEEEEEECC Q ss_conf 999999779-977999957988--9999973026-----97289998599886--1446641000012344605898515 Q gi|254780942|r 38 VMETIAAAG-IENVALVLGYGA--EEITRINFPP-----TLSVEYYIQDCQQG--TAHAVLTAQDAIKPGYDDVIIMYGD 107 (442) Q Consensus 38 ~i~~l~~~g-~~~ivvv~~~~~--e~i~~~~~~~-----~~~i~~v~q~~~~G--Ta~Ai~~a~~~l~~~~~~~lVl~GD 107 (442) ++..+.+.+ ++.++++|+-+. +....++... ..+.....+...++ |+..+....+.+.....|.++++|| T Consensus 22 li~~~~~~~~~~~~vi~TGQH~d~em~~~~le~~~i~~pdy~L~i~~~~~tl~~~t~~~i~~~~~vl~~~kPD~VlVhGD 101 (383) T COG0381 22 LVKALEKDPDFELIVIHTGQHRDYEMLDQVLELFGIRKPDYDLNIMKPGQTLGEITGNIIEGLSKVLEEEKPDLVLVHGD 101 (383) T ss_pred HHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHCCCCCCCCHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECC T ss_conf 99999858997359997066542778999999828988883132166688889999999999999998629998999178 Q ss_pred C Q ss_conf 4 Q gi|254780942|r 108 V 108 (442) Q Consensus 108 ~ 108 (442) + T Consensus 102 T 102 (383) T COG0381 102 T 102 (383) T ss_pred C T ss_conf 5 No 272 >cd06436 GlcNAc-1-P_transferase N-acetyl-glucosamine transferase is involved in the synthesis of Poly-beta-1,6-N-acetyl-D-glucosamine. N-acetyl-glucosamine transferase is responsible for the synthesis of bacteria Poly-beta-1,6-N-acetyl-D-glucosamine (PGA). Poly-beta-1,6-N-acetyl-D-glucosamine is a homopolymer that serves as an adhesion for the maintenance of biofilm structural stability in diverse eubacteria. N-acetyl-glucosamine transferase is the product of gene pgaC. Genetic analysis indicated that all four genes of the pgaABCD locus were required for the PGA production, pgaC being a glycosyltransferase. Probab=55.20 E-value=13 Score=16.15 Aligned_cols=96 Identities=13% Similarity=0.102 Sum_probs=54.6 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC-CEEEEEEC-CC---HHHHHHHHCCCCCEEEEEEC---CCCCCHHHHHHHCCCCCCC Q ss_conf 1258848-579999999997799-77999957-98---89999973026972899985---9988614466410000123 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAGI-ENVALVLG-YG---AEEITRINFPPTLSVEYYIQ---DCQQGTAHAVLTAQDAIKP 96 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g~-~~ivvv~~-~~---~e~i~~~~~~~~~~i~~v~q---~~~~GTa~Ai~~a~~~l~~ 96 (442) |+|-.|. ..|..+|+.|.+... .+|+++-. .. .+.++.+.. ...++.+.+ ....|-++|+-.++.++.. T Consensus 2 lIPa~NEe~~I~~ti~sl~~~~~~~eIivvdDgS~D~T~~~~~~~~~--~~~~~vi~~~~~~~~~GK~~ALN~al~~~~~ 79 (191) T cd06436 2 LVPCLNEEAVIQRTLASLLRNKPNFLVLVIDDASDDDTAGIVRLAIT--DSRVHLLRRHLPNARTGKGDALNAAYDQIRQ 79 (191) T ss_pred EEEECCCHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHH--CCCEEEEEECCCCCCCCCHHHHHHHHHHHHH T ss_conf 87307878999999999973899968999979999209999999830--8987999954776678515788889998664 Q ss_pred --------CCCEE-EEEECCCCCCCCCCCCCCCCCCC Q ss_conf --------44605-89851541115332200000013 Q gi|254780942|r 97 --------GYDDV-IIMYGDVPLVSSHTLKKAMDKIA 124 (442) Q Consensus 97 --------~~~~~-lVl~GD~~li~~~~l~~l~~~~~ 124 (442) ..+++ +++-+|. .+++..|..++.+++ T Consensus 80 ~~~~~~~~~~~eii~v~DAD~-~~~~d~L~~~~~~f~ 115 (191) T cd06436 80 ILIEEGADPERVIIAVIDADG-RLDPNALEAVAPYFS 115 (191) T ss_pred HHHHCCCCCCCEEEEEECCCC-CCCHHHHHHHHHHHC T ss_conf 320001356641899953787-328899999999725 No 273 >cd04191 Glucan_BSP_ModH Glucan_BSP_ModH catalyzes the elongation of beta-1,2 polyglucose chains of glucan. Periplasmic Glucan Biosynthesis protein ModH is a glucosyltransferase that catalyzes the elongation of beta-1,2 polyglucose chains of glucan, requiring a beta-glucoside as a primer and UDP-glucose as a substrate. Glucans are composed of 5 to 10 units of glucose forming a highly branched structure, where beta-1,2-linked glucose constitutes a linear backbone to which branches are attached by beta-1,6 linkages. In Escherichia coli, glucans are located in the periplasmic space, functioning as regulator of osmolarity. It is synthesized at a maximum when cells are grown in a medium with low osmolarity. It has been shown to span the cytoplasmic membrane. Probab=52.32 E-value=15 Score=15.88 Aligned_cols=107 Identities=11% Similarity=0.183 Sum_probs=60.4 Q ss_pred CEEEECCEE--HHHH----HHHHHHHCCC---CEEEEEEC-CCHHH-------HHHHHCC--CCCEEEEEECCCCCC-HH Q ss_conf 412588485--7999----9999997799---77999957-98899-------9997302--697289998599886-14 Q gi|254780942|r 25 VLQKIAGKP--MISH----VMETIAAAGI---ENVALVLG-YGAEE-------ITRINFP--PTLSVEYYIQDCQQG-TA 84 (442) Q Consensus 25 pLlpi~gkp--li~~----~i~~l~~~g~---~~ivvv~~-~~~e~-------i~~~~~~--~~~~i~~v~q~~~~G-Ta 84 (442) -|+|+.|.+ .+.- +.+.|...+. -+++|+-- ...+. +.+.... ....+.|..-++..| -| T Consensus 3 vlvP~yNEd~~~v~~~l~a~~~sl~~~~~~~~f~v~vLsDs~~p~~~~~E~~a~~~l~~~~~~~~~v~Yr~R~~n~g~KA 82 (254) T cd04191 3 IVMPVYNEDPARVFAGLRAMYESLAKTGLADHFDFFILSDTRDPDIWLAEEAAWLDLCEELGAQGRIYYRRRRENTGRKA 82 (254) T ss_pred EEEECCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCH T ss_conf 99956889989999999999999986388565379998089975789999999999999847887459885686679873 Q ss_pred HHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 466410000123446058985154111533220000001358610010 Q gi|254780942|r 85 HAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVV 132 (442) Q Consensus 85 ~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~ 132 (442) |.|..++.......+-++|+-+|. +.+.+.|..++.....+..+.++ T Consensus 83 Gni~~~l~~~g~~yd~~~vlDAD~-~~~~d~l~~lv~~~e~dp~~glV 129 (254) T cd04191 83 GNIADFCRRWGSRYDYMVVLDADS-LMSGDTIVRLVRRMEANPRAGII 129 (254) T ss_pred HHHHHHHHHCCCCCCEEEEECCCC-CCCHHHHHHHHHHHHHCCCEEEE T ss_conf 779999995399846799975888-99869999999999769886897 No 274 >PRK00129 upp uracil phosphoribosyltransferase; Reviewed Probab=52.13 E-value=13 Score=16.31 Aligned_cols=22 Identities=9% Similarity=0.318 Sum_probs=16.8 Q ss_pred EEECCEEHHHHHHHHHHHCCCC Q ss_conf 2588485799999999977997 Q gi|254780942|r 27 QKIAGKPMISHVMETIAAAGIE 48 (442) Q Consensus 27 lpi~gkpli~~~i~~l~~~g~~ 48 (442) +-+...|++.|.|..|...... T Consensus 3 v~v~~hPl~~~~Lt~LRd~~T~ 24 (208) T PRK00129 3 VYVVDHPLVQHKLTLLRDKDTS 24 (208) T ss_pred EEECCCHHHHHHHHHHHCCCCC T ss_conf 6885967999899998789998 No 275 >TIGR00461 gcvP glycine dehydrogenase; InterPro: IPR003437 This family consists of glycine cleavage system P-proteins (1.4.4.2 from EC) from bacterial, mammalian and plant sources. The P protein is part of the glycine decarboxylase multienzyme complex (2.1.2.10 from EC (GDC) also annotated as glycine cleavage system or glycine synthase. The P protein binds the alpha-amino group of glycine through its pyridoxal phosphate cofactor, carbon dioxide is released and the remaining methylamin moiety is then transferred to the lipoamide cofactor of the H protein. GDC consists of four proteins P, H, L and T . The reaction catalysed by this protein is: Glycine + lipoylprotein = S-aminomethyldihydrolipoylprotein + CO2 ; GO: 0004375 glycine dehydrogenase (decarboxylating) activity, 0006544 glycine metabolic process, 0005961 glycine dehydrogenase complex (decarboxylating). Probab=50.41 E-value=16 Score=15.70 Aligned_cols=87 Identities=17% Similarity=0.217 Sum_probs=40.9 Q ss_pred HHHHHHHHHHCCCCEEEEEECCCHHHHHHHH--CCCCCEEEEEECCCCCCHHHHHHHCCCCCCCC-CCEEEEEE-C-CCC Q ss_conf 9999999997799779999579889999973--02697289998599886144664100001234-46058985-1-541 Q gi|254780942|r 35 ISHVMETIAAAGIENVALVLGYGAEEITRIN--FPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPG-YDDVIIMY-G-DVP 109 (442) Q Consensus 35 i~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~--~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~-~~~~lVl~-G-D~~ 109 (442) +.|-++..+++ .+ ++|-..-..|-.+.+ +..-.+++.++.|. ..+....+-++++ -.-+|+-| | |-- T Consensus 148 ls~~vSk~Kna--n~-ffva~d~HpQT~~V~~TRA~pfgiEvIv~D~-----~~~~~~~dvfknPdvsG~LlQypaTdG~ 219 (965) T TIGR00461 148 LSFRVSKKKNA--NK-FFVAKDVHPQTLEVVKTRAKPFGIEVIVDDA-----SDVKKAVDVFKNPDVSGVLLQYPATDGE 219 (965) T ss_pred HHHHHHHHHHH--CC-CHHHHCCCCCCHHHHHHCCCCCCEEEEECCH-----HHHHHHHHHHCCCCCCEEEEECCCCCCC T ss_conf 98764222211--01-1021037734368875240314516886344-----5556544332177744157623577772 Q ss_pred CCCCCCCCCCCCCCCCCCCC Q ss_conf 11533220000001358610 Q gi|254780942|r 110 LVSSHTLKKAMDKIAQGYSI 129 (442) Q Consensus 110 li~~~~l~~l~~~~~~~~~~ 129 (442) ...++++++|.+..++...+ T Consensus 220 i~PPddy~~lid~lhshk~l 239 (965) T TIGR00461 220 IYPPDDYKALIDKLHSHKAL 239 (965) T ss_pred CCCCHHHHHHHHHHHCCHHH T ss_conf 36862289999987413012 No 276 >PRK08266 hypothetical protein; Provisional Probab=49.37 E-value=17 Score=15.61 Aligned_cols=73 Identities=15% Similarity=0.151 Sum_probs=39.9 Q ss_pred HHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEEC-CCC------------------------CCHHHHHHHC Q ss_conf 999999997799779999579889999973026972899985-998------------------------8614466410 Q gi|254780942|r 36 SHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQ-DCQ------------------------QGTAHAVLTA 90 (442) Q Consensus 36 ~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q-~~~------------------------~GTa~Ai~~a 90 (442) +++++.|++.|++.++-+.|-....+.+.+......++++.- .|. -|.++++-.. T Consensus 5 ~~l~~~L~~~Gv~~vFg~pG~~~~~l~dal~~~~~~i~~v~~~hE~~A~~~A~gyar~tg~~gv~~~t~GpG~~N~~~~v 84 (531) T PRK08266 5 EAIVRGLVAHGVDTVFGIPGAQLYGLFDALAKAADQIRFIGTRHEQAAGYMAFGYARSTGRPGVFSVVPGPGVLNAGAAL 84 (531) T ss_pred HHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHH T ss_conf 99999999879989998489555999999996589966991287899999999999997997899982582699899999 Q ss_pred CCCCCCCCCEEEEEECCCC Q ss_conf 0001234460589851541 Q gi|254780942|r 91 QDAIKPGYDDVIIMYGDVP 109 (442) Q Consensus 91 ~~~l~~~~~~~lVl~GD~~ 109 (442) .+...+ .-++|++.||.+ T Consensus 85 ~~A~~~-~~Pvl~isG~~~ 102 (531) T PRK08266 85 LTAYGC-NAPVLCLTGQIP 102 (531) T ss_pred HHHHHC-CCCEEEEECCCC T ss_conf 999862-998799958896 No 277 >TIGR00420 trmU tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase; InterPro: IPR004506 tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase (2.1.1.61 from EC) catalyses the addition of 5-methylaminomethyl-2-thiouridylate to tRNAs using S-adenosyl-L-methionine as a substrate and releasing S-adenosyl-L-homocysteine. The enzyme is cytoplasmic and is involved in tRNA processing.; GO: 0004808 tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase activity, 0008033 tRNA processing, 0005737 cytoplasm. Probab=44.17 E-value=18 Score=15.48 Aligned_cols=22 Identities=18% Similarity=0.228 Sum_probs=10.8 Q ss_pred CEECCCCEECCCCCCCCEEEEC Q ss_conf 8997783576643698689813 Q gi|254780942|r 404 TYVASGSIITQDTPENSLVFAR 425 (442) Q Consensus 404 ~~i~ag~~v~~dv~~~~~v~~~ 425 (442) ..-++--||.+|+.-+.|++++ T Consensus 279 g~~~~~FVv~kD~~~N~l~v~~ 300 (394) T TIGR00420 279 GAAEPWFVVEKDLETNELVVSQ 300 (394) T ss_pred CCCCCCCEEEEECCCCEEEEEC T ss_conf 3357632354203148799971 No 278 >TIGR02661 MauD methylamine dehydrogenase accessory protein MauD; InterPro: IPR013478 MauD appears critical to proper formation of the small subunit of methylamine dehydrogenase, which has both an unusual tryptophan tryptophylquinone cofactor and multiple disulphide bonds. MauD shares sequence similarity, including a CPxC motif, with a number of thiol:disulphide interchange proteins. In MauD mutants, the small subunit apparently does not form properly and is rapidly degraded .; GO: 0030416 methylamine metabolic process. Probab=42.21 E-value=22 Score=14.96 Aligned_cols=73 Identities=23% Similarity=0.186 Sum_probs=43.6 Q ss_pred EEEECCCCCCCCCCCC---CCCEEEECCE--EHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCC-EEEEEECCCC Q ss_conf 9994588772327888---7441258848--57999999999779977999957988999997302697-2899985998 Q gi|254780942|r 7 AIVLAAGRGHRMKSSS---SKVLQKIAGK--PMISHVMETIAAAGIENVALVLGYGAEEITRINFPPTL-SVEYYIQDCQ 80 (442) Q Consensus 7 AiILAaG~GtRl~p~~---pKpLlpi~gk--pli~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~-~i~~v~q~~~ 80 (442) +.|+.||.-+.-||+. --|--|||.| ||| +..+++.-.+++++-.=..++-++++.+... +..||+..|- T Consensus 63 ~~V~iG~~~a~~R~~LLMFTaPsCPvC~KL~PiI----kSia~~E~~~Vv~ISDG~~~EHr~FL~~H~L~~~~YVVSAE~ 138 (189) T TIGR02661 63 EPVRIGGSIALGRPTLLMFTAPSCPVCDKLLPII----KSIARAEEIDVVLISDGTPEEHRRFLKDHELGEERYVVSAEI 138 (189) T ss_pred CEEEECCCCCCCCCEEEEECCCCCCHHHHHHHHH----HHHHHHHCCCEEEEECCCCHHHHHHHHCCCCCCEEEEEEECC T ss_conf 6456117556777503544388885322432789----888867253258972799279998864288887137886202 Q ss_pred CCHH Q ss_conf 8614 Q gi|254780942|r 81 QGTA 84 (442) Q Consensus 81 ~GTa 84 (442) |++ T Consensus 139 -GM~ 141 (189) T TIGR02661 139 -GMA 141 (189) T ss_pred -CCC T ss_conf -210 No 279 >TIGR00027 mthyl_TIGR00027 methyltransferase, putative, TIGR00027 family; InterPro: IPR011610 This entry represents a paralogous family with ~15 members in Mycobacterium tuberculosis and two members in Mycobacteriam leprae. No other homologs are found. . Probab=40.75 E-value=11 Score=16.68 Aligned_cols=13 Identities=38% Similarity=0.647 Sum_probs=8.5 Q ss_pred CEEEEECCCCCCC Q ss_conf 0899945887723 Q gi|254780942|r 5 RLAIVLAAGRGHR 17 (442) Q Consensus 5 ~~AiILAaG~GtR 17 (442) ++.||||||.-|| T Consensus 92 rQVViLgAGLDtR 104 (281) T TIGR00027 92 RQVVILGAGLDTR 104 (281) T ss_pred CEEEEECCHHHHH T ss_conf 6754301212358 No 280 >cd03414 CbiX_SirB_C Sirohydrochlorin cobalt chelatase (CbiX) and sirohydrochlorin iron chelatase (SirB), C-terminal domain. SirB catalyzes the ferro-chelation of sirohydrochlorin to siroheme, the prosthetic group of sulfite and nitrite reductases. CbiX is a cobaltochelatase, responsible for the chelation of Co2+ into sirohydrochlorin, an important step in the vitamin B12 biosynthetic pathway. CbiX often contains a C-terminal histidine-rich region that may be important for metal delivery and/or storage, and may also contain an iron-sulfur center. Both CbiX and SirB are found in a wide range of bacteria. Probab=40.31 E-value=23 Score=14.78 Aligned_cols=48 Identities=25% Similarity=0.396 Sum_probs=30.5 Q ss_pred EEEEECCCCCCCCCC--------------CCCCCEEEE----CCEEHHHHHHHHHHHCCCCEEEEEE Q ss_conf 899945887723278--------------887441258----8485799999999977997799995 Q gi|254780942|r 6 LAIVLAAGRGHRMKS--------------SSSKVLQKI----AGKPMISHVMETIAAAGIENVALVL 54 (442) Q Consensus 6 ~AiILAaG~GtRl~p--------------~~pKpLlpi----~gkpli~~~i~~l~~~g~~~ivvv~ 54 (442) +|++|+ |.|||--. ..|-..+.. ...|-+.-.++.+.+.|.++|+++- T Consensus 1 Tavllv-gHGSr~p~a~~~~~~la~~l~~~~~~~~v~~~fle~~~P~l~e~l~~l~~~G~~~ivvvP 66 (117) T cd03414 1 TAVVLV-GRGSSDPDANADVAKIARLLEEGTGFARVETAFAAATRPSLPEALERLRALGARRVVVLP 66 (117) T ss_pred CEEEEE-ECCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCHHHHHHHHHHCCCCEEEEEE T ss_conf 989999-589998789999999999999768998188877536799899999999976997599976 No 281 >COG4565 CitB Response regulator of citrate/malate metabolism [Transcription / Signal transduction mechanisms] Probab=39.52 E-value=7.9 Score=17.46 Aligned_cols=63 Identities=19% Similarity=0.230 Sum_probs=34.9 Q ss_pred CCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCC--CCCCCEEEE-----CCCCCCCCEE Q ss_conf 8614466410000123446058985154111533220000001358--610010010-----1221574404 Q gi|254780942|r 81 QGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQG--YSIAVVGFN-----ADNPKGYGRL 145 (442) Q Consensus 81 ~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~--~~~~i~~~~-----~~dp~~yGvV 145 (442) -|||+.+..++..+.....|++++ |+|+=+..-++-+-+.+++. .++.+.++- +.+.-+||++ T Consensus 29 vg~A~~~~ea~~~i~~~~pDLILL--DiYmPd~~Gi~lL~~ir~~~~~~DVI~iTAA~d~~tI~~alr~Gv~ 98 (224) T COG4565 29 VGTAGTLEEAKMIIEEFKPDLILL--DIYMPDGNGIELLPELRSQHYPVDVIVITAASDMETIKEALRYGVV 98 (224) T ss_pred EEEECCHHHHHHHHHHHCCCEEEE--EECCCCCCCHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHHCCCH T ss_conf 986064999999998408997999--6026798507799999846899788999534437899999965823 No 282 >cd03415 CbiX_CbiC Archaeal sirohydrochlorin cobalt chelatase (CbiX) single domain. Proteins in this subgroup contain a single CbiX domain N-terminal to a precorrin-8X methylmutase (CbiC) domain. CbiX is a cobaltochelatase, responsible for the chelation of Co2+ into sirohydrochlorin, while CbiC catalyzes the conversion of cobalt-precorrin 8 to cobyrinic acid by methyl rearrangement. Both CbiX and CbiC are involved in vitamin B12 biosynthesis. Probab=37.61 E-value=26 Score=14.54 Aligned_cols=48 Identities=17% Similarity=0.333 Sum_probs=29.7 Q ss_pred EEEEECCCCCCCCCC-----------CCCCCEEEE------CCEEHHHHHHHHHHHCCCCEEEEEE Q ss_conf 899945887723278-----------887441258------8485799999999977997799995 Q gi|254780942|r 6 LAIVLAAGRGHRMKS-----------SSSKVLQKI------AGKPMISHVMETIAAAGIENVALVL 54 (442) Q Consensus 6 ~AiILAaG~GtRl~p-----------~~pKpLlpi------~gkpli~~~i~~l~~~g~~~ivvv~ 54 (442) +||+|. |-|||... .+.|.=.|+ .++|-|.-.++.+.+.|+++|+++- T Consensus 1 TAilLv-~HGSR~~~~~e~~~~la~~v~e~~~~~v~~~fmElaePsi~e~l~~~v~~G~~~IiVvP 65 (125) T cd03415 1 TAIIII-THGSRRNTFNEDMEEWAAYLERKLGVPVYLTYNEYAEPNWRDLLNELLSEGYGHIIIAL 65 (125) T ss_pred CEEEEE-ECCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHCCCCCEEEEE T ss_conf 969999-46789577899999999999986199558755110699889999999984998199995 No 283 >TIGR01668 YqeG_hyp_ppase HAD superfamily (subfamily IIIA) phosphatase, TIGR01668; InterPro: IPR010021 This group of hypothetical proteins is a part of the IIIA subfamily of the haloacid dehalogenase (HAD) superfamily of hydrolases. All characterised members of this subfamily and most characterised members of the HAD superfamily are phosphatases. HAD superfamily phosphatases contain active site residues in several conserved catalytic motifs , all of which are found conserved here. This family consists of sequences from fungi, plants, cyanobacteria, Gram-positive bacteria and Deinococcus. There is presently no characterisation of any sequence in this family. . Probab=33.97 E-value=30 Score=14.19 Aligned_cols=100 Identities=16% Similarity=0.226 Sum_probs=48.8 Q ss_pred EHHHHHHHHHHHC--CCCEEEEEE------CCCHHHHHHHHCCCCCEEEEEEC-CCCCCHHHHH---HHCCCCCCCC-CC Q ss_conf 5799999999977--997799995------79889999973026972899985-9988614466---4100001234-46 Q gi|254780942|r 33 PMISHVMETIAAA--GIENVALVL------GYGAEEITRINFPPTLSVEYYIQ-DCQQGTAHAV---LTAQDAIKPG-YD 99 (442) Q Consensus 33 pli~~~i~~l~~~--g~~~ivvv~------~~~~e~i~~~~~~~~~~i~~v~q-~~~~GTa~Ai---~~a~~~l~~~-~~ 99 (442) |-+.=.++.++.+ |. ++.|+- .|+.|+-+--..+...+|.+... .+|.|.|+-+ ..|...+.-. .. T Consensus 55 P~l~~w~E~~k~~yPG~-~l~ivSN~~~~~~yD~D~~r~k~~~~~lgI~~~~~a~Kp~~~Gdeim~~~~A~~~~~l~s~~ 133 (196) T TIGR01668 55 PELREWIEELKAAYPGL-KLLIVSNNAGLLEYDKDESRVKALEKELGIPVLLHAVKPPGKGDEIMAFRRALKEMGLTSSE 133 (196) T ss_pred HHHHHHHHHHHHHCCCC-EEEEEECCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCC T ss_conf 36678899998537995-79998768765431201478888887518851147788788833789999998751798732 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCC Q ss_conf 058985154111533220000001358610010010122157 Q gi|254780942|r 100 DVIIMYGDVPLVSSHTLKKAMDKIAQGYSIAVVGFNADNPKG 141 (442) Q Consensus 100 ~~lVl~GD~~li~~~~l~~l~~~~~~~~~~~i~~~~~~dp~~ 141 (442) +++|+ ||- |+|+.....+ ..-.+|++.++..|.+ T Consensus 134 ~~~vv-GDr-L~TDv~~gNr------~G~~tIlv~p~~~p~~ 167 (196) T TIGR01668 134 QVVVV-GDR-LFTDVLGGNR------NGFYTILVEPLVHPDE 167 (196) T ss_pred CEEEE-CCC-CHHHHHHHHH------CCCEEEEECCCCCCHH T ss_conf 37887-895-1467886532------4723478512555015 No 284 >PRK13337 putative lipid kinase; Reviewed Probab=33.90 E-value=30 Score=14.19 Aligned_cols=46 Identities=22% Similarity=0.209 Sum_probs=24.5 Q ss_pred HHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCCCEEEEEECCC Q ss_conf 9999730269728999859988614466410000123446058985154 Q gi|254780942|r 60 EITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYDDVIIMYGDV 108 (442) Q Consensus 60 ~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~ 108 (442) ++.+.+...+.++++...+ +.+||...+++......|-++++-||- T Consensus 24 ~i~~~l~~~g~~~~~~~T~---~~g~a~~~a~~~~~~~~d~vv~~GGDG 69 (305) T PRK13337 24 DVLQKLEQAGYETSAHATT---GPGDATLAARQAAERNFDLVIAAGGDG 69 (305) T ss_pred HHHHHHHHCCCEEEEEEEC---CHHHHHHHHHHHHHCCCCEEEEEECCH T ss_conf 9999999879969999827---802899999998777999899995762 No 285 >PRK11914 diacylglycerol kinase; Reviewed Probab=32.75 E-value=31 Score=14.08 Aligned_cols=61 Identities=18% Similarity=0.156 Sum_probs=30.5 Q ss_pred CCCCEEEEEECCCH---------HHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCCCEEEEEECCC Q ss_conf 79977999957988---------99999730269728999859988614466410000123446058985154 Q gi|254780942|r 45 AGIENVALVLGYGA---------EEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYDDVIIMYGDV 108 (442) Q Consensus 45 ~g~~~ivvv~~~~~---------e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~ 108 (442) -..+++.++++..+ +++.+.+...+.+++...-+ ..++|...+.+......|-++++-||- T Consensus 4 ~~m~kv~vIvNP~aG~g~~~~~~~~i~~~l~~~g~~~~~~~t~---~~~~a~~la~~a~~~g~d~vv~~GGDG 73 (304) T PRK11914 4 HEIGKVTVLTNPLSGHGAAPHAAERAIARLHHRGVDVVEIVGT---DAHHARHLVAAALAKGTDALVVVGGDG 73 (304) T ss_pred CCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECC---CHHHHHHHHHHHHHCCCCEEEEEECCH T ss_conf 4586699999977799856889999999999879909999327---878999999988864996999995625 No 286 >PRK04015 chromatin protein; Provisional Probab=32.36 E-value=32 Score=14.04 Aligned_cols=30 Identities=17% Similarity=0.323 Sum_probs=23.1 Q ss_pred EEEECCEEHHHHHHHHHHHC--CCCEEEEEEC Q ss_conf 12588485799999999977--9977999957 Q gi|254780942|r 26 LQKIAGKPMISHVMETIAAA--GIENVALVLG 55 (442) Q Consensus 26 Llpi~gkpli~~~i~~l~~~--g~~~ivvv~~ 55 (442) .+=|+.||++.|++..+..+ |.+++.+-.. T Consensus 6 ~I~IG~KP~m~YV~avvtqf~~g~~eV~iKAR 37 (91) T PRK04015 6 TVLVGKKPVMNYVLAVVTQFNEGAKEVVIKAR 37 (91) T ss_pred EEEECCCCHHHHHHHHHHHHHCCCCEEEEEEC T ss_conf 89991885799999999998779977999961 No 287 >PRK09946 hypothetical protein; Provisional Probab=32.19 E-value=32 Score=14.02 Aligned_cols=12 Identities=25% Similarity=0.221 Sum_probs=7.3 Q ss_pred HHHHHHHHHHHC Q ss_conf 114899997307 Q gi|254780942|r 202 TDIIEKARLDGK 213 (442) Q Consensus 202 tDii~~l~~~g~ 213 (442) .|+-..|+++|. T Consensus 32 ~~vh~aL~~~g~ 43 (270) T PRK09946 32 ADIHHALIEEGY 43 (270) T ss_pred HHHHHHHHHCCC T ss_conf 899999997431 No 288 >TIGR01307 pgm_bpd_ind 2,3-bisphosphoglycerate-independent phosphoglycerate mutase; InterPro: IPR005995 This 2,3-bisphosphoglycerate-independent phosphoglycerate mutase (iPGAM) is a metalloenzyme found particularly in eubacteria and higher plants. It is distantly related to archaeal iPGAM (IPR004456 from INTERPRO) and distinct from the unrelated cofactor-dependent PGAM (PIRSF001490 from PIRSF). Activity has been demonstrated for proteins from a variety of organisms, including Pseudomonas syringae pv. tomato , Bacillus subtilis , Bacillus stearothermophilus , maize , castor bean , and Trypanosoma brucei . The structure of the B. stearothermophilus enzyme (PDB:1EJJ) has two domains . Residues 1-76 and 311-511 form the phosphatase domain, containing the active site residue and two metal-binding sites. This domain is similar to alkaline phosphatase (PDB:1ALK) and arylsulphatase (PDB:1AUK), which are members of the SCOP alkaline phosphatase-like superfamily, but there is meagre sequence similarity outside of the metal-binding segments. Residues 77-310 form the phosphotransferase domain, which is poorly conserved (or perhaps unrelated) in the archaeal enzymes.; GO: 0004619 phosphoglycerate mutase activity, 0006007 glucose catabolic process. Probab=31.20 E-value=24 Score=14.75 Aligned_cols=40 Identities=20% Similarity=0.295 Sum_probs=21.6 Q ss_pred CCEEEEECCCCCCCCCCC-CCCCEEEECCEEHHHHHHHHHHH Q ss_conf 408999458877232788-87441258848579999999997 Q gi|254780942|r 4 KRLAIVLAAGRGHRMKSS-SSKVLQKIAGKPMISHVMETIAA 44 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p~-~pKpLlpi~gkpli~~~i~~l~~ 44 (442) |++-||| -|.|-|=.+. .-.=.+-=+.+|+++-.++...+ T Consensus 2 k~vLvIl-DG~G~~~~~srk~~nAi~~A~tP~~D~l~~~~pk 42 (529) T TIGR01307 2 KVVLVIL-DGWGYREDDSRKDGNAIFAAKTPVMDELIAAYPK 42 (529) T ss_pred CEEEEEE-CCCCCCCCCCCCHHHHHHHCCCCHHHHHHHCCCC T ss_conf 4689997-6724788633101479871787757899860775 No 289 >pfam07355 GRDB Glycine/sarcosine/betaine reductase selenoprotein B (GRDB). This family represents a conserved region approximately 350 residues long within the selenoprotein B component of the bacterial glycine, sarcosine and betaine reductase complexes. Probab=30.77 E-value=21 Score=15.03 Aligned_cols=73 Identities=12% Similarity=0.147 Sum_probs=37.4 Q ss_pred CCCEEEEEECCCCCCHHHHHHHCCCCCCCCCCEE--EEEECCCCCCCCC--CCCCCCCCCCC-CCCCCCEEEECCCCCCC Q ss_conf 6972899985998861446641000012344605--8985154111533--22000000135-86100100101221574 Q gi|254780942|r 68 PTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYDDV--IIMYGDVPLVSSH--TLKKAMDKIAQ-GYSIAVVGFNADNPKGY 142 (442) Q Consensus 68 ~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~--lVl~GD~~li~~~--~l~~l~~~~~~-~~~~~i~~~~~~dp~~y 142 (442) ...++....++.+-|.+-++..++ .+. ..+ -|++||.||-... ..+.+++..++ +.|+. ++-+.=++-|| T Consensus 21 ekAd~~pe~~eg~vGpg~~l~~~l---~~~-~eIv~TiICGDnYf~en~dea~~~il~mv~~~~pDlf-iAGPAFnAGRY 95 (349) T pfam07355 21 EKADIPPEVREGLVGPGLQLNGLL---KDE-AEVVHTVICGDSYFNENIEEAVAEILEMLKEEKPDLF-IAGPAFNAGRY 95 (349) T ss_pred CCCCCCCCCCCCCCCHHHHHHHHH---CCC-CEEEEEEEECCHHHHCCHHHHHHHHHHHHHHCCCCEE-EECCCCCCCCH T ss_conf 017898703448888699999984---678-7899999968624440999999999999984299989-87663256425 Q ss_pred CEE Q ss_conf 404 Q gi|254780942|r 143 GRL 145 (442) Q Consensus 143 GvV 145 (442) |+. T Consensus 96 GvA 98 (349) T pfam07355 96 GVA 98 (349) T ss_pred HHH T ss_conf 888 No 290 >cd06420 GT2_Chondriotin_Pol_N N-terminal domain of Chondroitin polymerase functions as a GalNAc transferase. Chondroitin polymerase is a two domain, bi-functional protein. The N-terminal domain functions as a GalNAc transferase. The bacterial chondroitin polymerase catalyzes elongation of the chondroitin chain by alternatively transferring the GlcUA and GalNAc moiety from UDP-GlcUA and UDP-GalNAc to the non-reducing ends of the chondroitin chain. The enzyme consists of N-terminal and C-terminal domains in which the two active sites catalyze the addition of GalNAc and GlcUA, respectively. Chondroitin chains range from 40 to over 100 repeating units of the disaccharide. Sulfated chondroitins are involved in the regulation of various biological functions such as central nervous system development, wound repair, infection, growth factor signaling, and morphogenesis, in addition to its conventional structural roles. In Caenorhabditis elegans, chondroitin is an essential factor for the worm Probab=29.61 E-value=35 Score=13.77 Aligned_cols=92 Identities=15% Similarity=0.157 Sum_probs=48.7 Q ss_pred EEEECCE-EHHHHHHHHHHHCC--CCEEEEEECCCH----HHHHHHHCCCCCEEEEEECCCCCC--HHHHHHHCCCCCCC Q ss_conf 1258848-57999999999779--977999957988----999997302697289998599886--14466410000123 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAG--IENVALVLGYGA----EEITRINFPPTLSVEYYIQDCQQG--TAHAVLTAQDAIKP 96 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g--~~~ivvv~~~~~----e~i~~~~~~~~~~i~~v~q~~~~G--Ta~Ai~~a~~~l~~ 96 (442) .+|..|. ..|...|+.+.+.. ..||+|+=.--. +.++++.......+.++.++. .| -+.|.-.+...-+ T Consensus 2 IIptyN~~~~L~~~L~Sl~~Qt~~~~EIIvvDd~StD~t~~~~~~~~~~~~~~i~~~~~~~-~g~~~~~arN~gi~~a~- 79 (182) T cd06420 2 IITTYNRPEALELVLKSVLNQSILPFEVIIADDGSTEETKELIEEFKSQFPIPIKHVWQED-EGFRKAKIRNKAIAAAK- 79 (182) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHHHHCCCCCCEEEECC-CCHHHHHHHHHHHHHCC- T ss_conf 9987798899999999998379999589999799961489999999851887410001005-67268999999998558- Q ss_pred CCCEEEEEECCCCCCCCCCCCCCCC Q ss_conf 4460589851541115332200000 Q gi|254780942|r 97 GYDDVIIMYGDVPLVSSHTLKKAMD 121 (442) Q Consensus 97 ~~~~~lVl~GD~~li~~~~l~~l~~ 121 (442) .+-++.+.+|. ++.+.-++..++ T Consensus 80 -g~~i~flD~D~-~~~~~~i~~~~~ 102 (182) T cd06420 80 -GDYLIFIDGDC-IPHPDFIADHIE 102 (182) T ss_pred -CCEEEEECCCC-CCCCHHHHHHHH T ss_conf -87288878983-279109999999 No 291 >PRK05782 precorrin-8X methylmutase; Validated Probab=28.66 E-value=32 Score=14.02 Aligned_cols=50 Identities=18% Similarity=0.346 Sum_probs=33.8 Q ss_pred CCEEEEECCCCCCCCCCC------------CCC---CEEEE---CCEEHHHHHHHHHHHCCCCEEEEEEC Q ss_conf 408999458877232788------------874---41258---84857999999999779977999957 Q gi|254780942|r 4 KRLAIVLAAGRGHRMKSS------------SSK---VLQKI---AGKPMISHVMETIAAAGIENVALVLG 55 (442) Q Consensus 4 ~~~AiILAaG~GtRl~p~------------~pK---pLlpi---~gkpli~~~i~~l~~~g~~~ivvv~~ 55 (442) ..++|||- |.|||- |. .-| |..+- ..+|.+.-.|+.+.+.|.++|+++.- T Consensus 2 ~~t~illv-~HGSr~-~~an~~l~~la~~~~e~lG~~~~~~y~e~a~P~l~~gl~~l~~~g~~~ivv~p~ 69 (332) T PRK05782 2 SNTAVILV-NHGSRR-NTFNEDMENLAAYLEEKLGVPAYLTYNEYAEPNWRSLLNDLLREGYDDVVIALA 69 (332) T ss_pred CCEEEEEE-ECCCCC-CCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHCCCCEEEEEE T ss_conf 84089998-368787-516689999999999744984686330026997799999999816786799985 No 292 >COG1105 FruK Fructose-1-phosphate kinase and related fructose-6-phosphate kinase (PfkB) [Carbohydrate transport and metabolism] Probab=28.28 E-value=37 Score=13.63 Aligned_cols=44 Identities=18% Similarity=0.238 Sum_probs=20.1 Q ss_pred ECCCCCC---CCCC---CCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEE Q ss_conf 4588772---3278---8874412588485799999999977997799995 Q gi|254780942|r 10 LAAGRGH---RMKS---SSSKVLQKIAGKPMISHVMETIAAAGIENVALVL 54 (442) Q Consensus 10 LAaG~Gt---Rl~p---~~pKpLlpi~gkpli~~~i~~l~~~g~~~ivvv~ 54 (442) -|||+|- |.-. ..++.+- +.|...=++..+.|.+.|+..-++-+ T Consensus 34 ~aGGKGINVa~vL~~lG~~~~a~G-flGg~tg~~~~~~l~~~gi~~~fv~v 83 (310) T COG1105 34 TAGGKGINVARVLKDLGIPVTALG-FLGGFTGEFFVALLKDEGIPDAFVEV 83 (310) T ss_pred CCCCCCEEHHHHHHHCCCCCEEEE-ECCCCCHHHHHHHHHHCCCCCEEEEC T ss_conf 478873559999997299716999-66886578999998763898307990 No 293 >PRK10063 predicted glycosyl transferase; Provisional Probab=27.68 E-value=38 Score=13.57 Aligned_cols=73 Identities=11% Similarity=0.069 Sum_probs=37.3 Q ss_pred CEEHHHHHHHHHHHC----CC-CEEEEEECCC---HHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCCCEEE Q ss_conf 485799999999977----99-7799995798---899999730269728999859988614466410000123446058 Q gi|254780942|r 31 GKPMISHVMETIAAA----GI-ENVALVLGYG---AEEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYDDVI 102 (442) Q Consensus 31 gkpli~~~i~~l~~~----g~-~~ivvv~~~~---~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~l 102 (442) +.+-|..+++.+.+. .. -|.+|+=|-= ..++.+.. ....++.++.|+ ..|.-+|+-.+...-.+ +-+. T Consensus 12 ~~~~l~~Tl~Sv~~~~~~~~~~~E~IIIDGgStDgT~~ii~~~-~~~~~~~~iSEp-D~GIYdAmNKGI~~A~G--d~I~ 87 (248) T PRK10063 12 NLEGIVKTHASLRHLAQDPGISFEWIVVDGGSNDGTREYLENL-NGIFNLRFVSEP-DNGIYDAMNKGIAMAQG--KFAL 87 (248) T ss_pred CHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHC-CCCCCEEEEEEC-CCCCHHHHHHHHHHCCC--CEEE T ss_conf 9899999999999986187898699999793961179999970-666667999938-98808898851987788--8899 Q ss_pred EEECC Q ss_conf 98515 Q gi|254780942|r 103 IMYGD 107 (442) Q Consensus 103 Vl~GD 107 (442) .+|+| T Consensus 88 FLNsd 92 (248) T PRK10063 88 FLNSG 92 (248) T ss_pred EECCC T ss_conf 98388 No 294 >TIGR03568 NeuC_NnaA UDP-N-acetyl-D-glucosamine 2-epimerase, UDP-hydrolysing. This family of enzymes catalyzes the combined epimerization and UDP-hydrolysis of UDP-N-acetylglucosamine to N-acetylmannosamine. This is in contrast to the related enzyme WecB (TIGR00236) which retains the UDP moiety. NeuC acts in concert with NeuA and NeuB to synthesize CMP-N5-acetyl-neuraminate. Probab=27.41 E-value=39 Score=13.54 Aligned_cols=71 Identities=20% Similarity=0.351 Sum_probs=35.1 Q ss_pred HHHHHHHC-CCCEEEEEECCC-HH-------HHHHHHCCCCCEEEEEE-CCCCCC----HHHHHHHCCCCCCCCCCEEEE Q ss_conf 99999977-997799995798-89-------99997302697289998-599886----144664100001234460589 Q gi|254780942|r 38 VMETIAAA-GIENVALVLGYG-AE-------EITRINFPPTLSVEYYI-QDCQQG----TAHAVLTAQDAIKPGYDDVII 103 (442) Q Consensus 38 ~i~~l~~~-g~~~ivvv~~~~-~e-------~i~~~~~~~~~~i~~v~-q~~~~G----Ta~Ai~~a~~~l~~~~~~~lV 103 (442) ++..|++. .++-.++++|.+ .+ ++..........+.+.. .+...+ ||..+....+.+.....|+++ T Consensus 19 li~~l~~~~~~~~~li~TGqH~~~~~g~t~~~i~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kPD~Vl 98 (365) T TIGR03568 19 LLKALQDDPDLELQLIVTGMHLSPEYGNTVNEIEKDGFDIDEKIEILLDSDSNAGMAKSMGLTIIGFSDAFERLKPDLVV 98 (365) T ss_pred HHHHHHHCCCCCEEEEEECCCCCHHHCCHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEE T ss_conf 99999728998889999077784110708999997579876557654568985339999999999999998543998999 Q ss_pred EECCC Q ss_conf 85154 Q gi|254780942|r 104 MYGDV 108 (442) Q Consensus 104 l~GD~ 108 (442) ++||+ T Consensus 99 V~GDt 103 (365) T TIGR03568 99 VLGDR 103 (365) T ss_pred EECCC T ss_conf 94898 No 295 >pfam07997 DUF1694 Protein of unknown function (DUF1694). This family contains many hypothetical proteins. Probab=26.87 E-value=23 Score=14.85 Aligned_cols=90 Identities=9% Similarity=0.038 Sum_probs=34.1 Q ss_pred CHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCCEEEE Q ss_conf 8899999730269728999859988614466410000123446058985154111533220000001-358610010010 Q gi|254780942|r 57 GAEEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSHTLKKAMDKI-AQGYSIAVVGFN 135 (442) Q Consensus 57 ~~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~-~~~~~~~i~~~~ 135 (442) +.|+=++|++.....|....-.++....+......+.++...+..|.+||.. +...+...+... +.+...++..-. T Consensus 21 kPDEqr~yLGtfrERV~lalt~~qv~~~~~~~~~~~~l~~~~~~~l~ing~l---~~~~~~~YiklA~~~~i~fTiV~~~ 97 (120) T pfam07997 21 KPDEQRKYLGTFRERVVLALTIKQVLDPKVYKEFEQELKDYPNYTLLLNGNL---DYDIQSQYIKLAKKANVPFTIVNDP 97 (120) T ss_pred CHHHHHHHCCCHHHHHHHEEEHHHHHCHHHHHHHHHHHHHCCCCEEEECCCC---CHHHHHHHHHHHHHCCCCEEEECCC T ss_conf 9889988455177686522409998173179999999864888689985899---8789999999999859986995788 Q ss_pred CCCCCCCCEEEECCC Q ss_conf 122157440420233 Q gi|254780942|r 136 ADNPKGYGRLLIKNN 150 (442) Q Consensus 136 ~~dp~~yGvV~~d~~ 150 (442) .. .+.||+|...+. T Consensus 98 ~~-~~~~glVv~s~~ 111 (120) T pfam07997 98 EA-ETPFGLVLAADH 111 (120) T ss_pred CC-CCCEEEEEECCC T ss_conf 66-798489997486 No 296 >TIGR00285 TIGR00285 DNA-binding protein Alba; InterPro: IPR013795 The DNA/RNA-binding protein Alba binds double-stranded DNA tightly but without sequence specificity. It binds rRNA and mRNA in vivo, and may play a role in maintaining the structural and functional stability of RNA, and, perhaps, ribosomes. It is distributed uniformly and abundantly on the chromosome. Alba has been shown to bind DNA and affect DNA supercoiling in a temperature dependent manner . It is regulated by acetylation (alba = acetylation lowers binding affinity) by the Sir2 protein. Alba is proposed to play a role in establishment or maintenance of chromatin architecture and thereby in transcription repression. For further information see .. Probab=25.76 E-value=20 Score=15.16 Aligned_cols=29 Identities=21% Similarity=0.392 Sum_probs=22.6 Q ss_pred EEECCEEHHHHHHHHHHHC--CCCEEEEEEC Q ss_conf 2588485799999999977--9977999957 Q gi|254780942|r 27 QKIAGKPMISHVMETIAAA--GIENVALVLG 55 (442) Q Consensus 27 lpi~gkpli~~~i~~l~~~--g~~~ivvv~~ 55 (442) +=+++||.+.|+|.-|-+. |-+|+++=.. T Consensus 4 V~~G~KPVMNYVlAvlTQl~~G~~EV~iKAR 34 (88) T TIGR00285 4 VYIGNKPVMNYVLAVLTQLNSGADEVIIKAR 34 (88) T ss_pred EEECCCCCHHHEEEEEECCCCCCCEEEEEEC T ss_conf 8853877213102310004699857999535 No 297 >COG1099 Predicted metal-dependent hydrolases with the TIM-barrel fold [General function prediction only] Probab=25.13 E-value=43 Score=13.30 Aligned_cols=13 Identities=38% Similarity=0.662 Sum_probs=5.2 Q ss_pred HHHHHHCCCCEEE Q ss_conf 9999977997799 Q gi|254780942|r 39 METIAAAGIENVA 51 (442) Q Consensus 39 i~~l~~~g~~~iv 51 (442) ++.++.+|+..++ T Consensus 17 lekMa~sGI~~Vi 29 (254) T COG1099 17 LEKMALSGIREVI 29 (254) T ss_pred HHHHHHHCHHHHH T ss_conf 9999983816664 No 298 >pfam02776 TPP_enzyme_N Thiamine pyrophosphate enzyme, N-terminal TPP binding domain. Probab=24.64 E-value=43 Score=13.24 Aligned_cols=97 Identities=13% Similarity=0.151 Sum_probs=48.8 Q ss_pred HHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCC Q ss_conf 99999999977997799995798899999730269728999859988614466410000123446058985154111533 Q gi|254780942|r 35 ISHVMETIAAAGIENVALVLGYGAEEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIKPGYDDVIIMYGDVPLVSSH 114 (442) Q Consensus 35 i~~~i~~l~~~g~~~ivvv~~~~~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~~~~~~~lVl~GD~~li~~~ 114 (442) =+++++.|++.|++.++-+.|.....+.+.+.+.. .++++.-....+.+. ...+.-.+.+...-+++.+|=- .. . T Consensus 4 ~~~i~~~L~~~Gv~~vFg~pG~~~~~l~~al~~~~-~i~~i~~~hE~~A~~-~A~gyar~tg~~gv~~~t~GpG--~~-n 78 (172) T pfam02776 4 ADALAEALKALGVDHVFGVPGGSILPLLDALAKSP-GIRYVLVRHEQGAGF-AADGYARATGKPGVVLVTSGPG--AT-N 78 (172) T ss_pred HHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHCC-CCEEEEECCHHHHHH-HHHHHHHHHCCCEEEEEECCCH--HH-H T ss_conf 99999999987999999968977799999998679-976998687799999-9999999869987999835806--88-7 Q ss_pred CCCCCCCCCCCCCCCCCEEEEC Q ss_conf 2200000013586100100101 Q gi|254780942|r 115 TLKKAMDKIAQGYSIAVVGFNA 136 (442) Q Consensus 115 ~l~~l~~~~~~~~~~~i~~~~~ 136 (442) .+..+.+.+..+..+.+++-.. T Consensus 79 ~~~gl~~A~~~~~Pvl~i~g~~ 100 (172) T pfam02776 79 ALTGLANAYVDSIPVLVISGQV 100 (172) T ss_pred HHHHHHHHHHCCCCEEEEECCC T ss_conf 9999999987599849993688 No 299 >COG3315 O-Methyltransferase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism] Probab=24.48 E-value=42 Score=13.31 Aligned_cols=12 Identities=8% Similarity=-0.031 Sum_probs=4.1 Q ss_pred ECCHHHHHHHHH Q ss_conf 125014677753 Q gi|254780942|r 178 IDGLYIMDWLLQ 189 (442) Q Consensus 178 f~~~~L~~~l~~ 189 (442) |+.+...++|.. T Consensus 183 L~~~~v~~ll~~ 194 (297) T COG3315 183 LPEEAVDRLLSR 194 (297) T ss_pred CCHHHHHHHHHH T ss_conf 998999999999 No 300 >cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans. UDP-GalNAc: polypeptide alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T) initiate the formation of mucin-type, O-linked glycans by catalyzing the transfer of alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to hydroxyl groups of Ser or Thr residues of core proteins to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These enzymes are type II membrane proteins with a GT-A type catalytic domain and a lectin domain located on the lumen side of the Golgi apparatus. In human, there are 15 isozymes of pp-GalNAc-Ts, representing the largest of all glycosyltransferase families. Each isozyme has unique but partially redundant substrate specificity for glycosylation sites on acceptor proteins. Probab=24.01 E-value=45 Score=13.17 Aligned_cols=98 Identities=12% Similarity=0.127 Sum_probs=56.1 Q ss_pred EEECCEE--HHHHHHHHHHHCC----CCEEEEEECC--C---HHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCCCCC Q ss_conf 2588485--7999999999779----9779999579--8---89999973026972899985998861446641000012 Q gi|254780942|r 27 QKIAGKP--MISHVMETIAAAG----IENVALVLGY--G---AEEITRINFPPTLSVEYYIQDCQQGTAHAVLTAQDAIK 95 (442) Q Consensus 27 lpi~gkp--li~~~i~~l~~~g----~~~ivvv~~~--~---~e~i~~~~~~~~~~i~~v~q~~~~GTa~Ai~~a~~~l~ 95 (442) +++.|.. .|-+++....+-- ++||++|=.. . .++..++.......++++..++.+|-..|=..+.+.-. T Consensus 4 I~f~NE~~s~LlRtv~Svl~rTP~~LL~EIILVDD~S~~~~l~~~l~~~~~~~~~~Vkvir~~~r~GlirAR~~Ga~~A~ 83 (299) T cd02510 4 IIFHNEALSTLLRTVHSVINRTPPELLKEIILVDDFSDKPELKLLLEEYYKKYLPKVKVLRLKKREGLIRARIAGARAAT 83 (299) T ss_pred EEEECCHHHHHHHHHHHHHHCCCHHHCCEEEEEECCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHHC T ss_conf 99947808889998999986399889368999967999524312479999861894599987765324999877688734 Q ss_pred CCCCEEE-EEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 3446058-98515411153322000000135861 Q gi|254780942|r 96 PGYDDVI-IMYGDVPLVSSHTLKKAMDKIAQGYS 128 (442) Q Consensus 96 ~~~~~~l-Vl~GD~~li~~~~l~~l~~~~~~~~~ 128 (442) +++| .+.++. -.+..=|+.|++...++.. T Consensus 84 ---G~vlvFLDsH~-e~~~gWLePLL~~I~en~~ 113 (299) T cd02510 84 ---GDVLVFLDSHC-EVNVGWLEPLLARIAENRK 113 (299) T ss_pred ---CCEEEEECCCE-EECCCHHHHHHHHHHHCCC T ss_conf ---86699936872-3146778999999872998 No 301 >cd06913 beta3GnTL1_like Beta 1, 3-N-acetylglucosaminyltransferase is essential for the formation of poly-N-acetyllactosamine . This family includes human Beta3GnTL1 and related eukaryotic proteins. Human Beta3GnTL1 is a putative beta-1,3-N-acetylglucosaminyltransferase. Beta3GnTL1 is expressed at various levels in most of tissues examined. Beta 1, 3-N-acetylglucosaminyltransferase has been found to be essential for the formation of poly-N-acetyllactosamine. Poly-N-acetyllactosamine is a unique carbohydrate composed of N-acetyllactosamine repeats. It is often an important part of cell-type-specific oligosaccharide structures and some functional oligosaccharides. It has been shown that the structure and biosynthesis of poly-N-acetyllactosamine display a dramatic change during development and oncogenesis. Several members of beta-1, 3-N-acetylglucosaminyltransferase have been identified. Probab=23.49 E-value=46 Score=13.11 Aligned_cols=100 Identities=10% Similarity=0.105 Sum_probs=55.5 Q ss_pred EEEECCE-EHHHHHHHHHHHCCCC---EEEEEE--CC-CHHHH-HHHHCC-CCCEEEEEEC----CCCCCHHHHHHHCCC Q ss_conf 1258848-5799999999977997---799995--79-88999-997302-6972899985----998861446641000 Q gi|254780942|r 26 LQKIAGK-PMISHVMETIAAAGIE---NVALVL--GY-GAEEI-TRINFP-PTLSVEYYIQ----DCQQGTAHAVLTAQD 92 (442) Q Consensus 26 Llpi~gk-pli~~~i~~l~~~g~~---~ivvv~--~~-~~e~i-~~~~~~-~~~~i~~v~q----~~~~GTa~Ai~~a~~ 92 (442) .+|+.|. .-|...|+.+.+.... |++||= +. ...++ +++... ....+.+..+ +.++|.+.|.-.+.+ T Consensus 2 IiP~YN~~~~l~~~l~Svl~Qty~~~~EiIiVDDgStD~t~~ii~~~~~~~~~~~~~~~~~~~~~~~~~G~~~arN~gi~ 81 (219) T cd06913 2 ILPVHNGEQWLDECLESVLQQDFEGTLELSVFNDASTDKSAEIIEKWRKKLEDSGVIVLVGSHNSPSPKGVGYAKNQAIA 81 (219) T ss_pred EECCCCCHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHH T ss_conf 88678988999999999984889998999999899981569999999986667776999823677664678999999999 Q ss_pred CCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 012344605898515411153322000000135861 Q gi|254780942|r 93 AIKPGYDDVIIMYGDVPLVSSHTLKKAMDKIAQGYS 128 (442) Q Consensus 93 ~l~~~~~~~lVl~GD~~li~~~~l~~l~~~~~~~~~ 128 (442) .-.+ +-+..+.+|- ++.+.-++..++...+... T Consensus 82 ~A~G--eyI~flDsDD-~~~p~~l~~~~~~~~~~~~ 114 (219) T cd06913 82 QSSG--RYLCFLDSDD-VMMPQRIRLQYEAALQHPN 114 (219) T ss_pred HCCC--CEEEEECCCC-CCCHHHHHHHHHHHHHCCC T ss_conf 7788--8788618976-4276599999999985999 No 302 >PRK10310 galactitol-specific PTS system component IIB; Provisional Probab=21.78 E-value=50 Score=12.91 Aligned_cols=28 Identities=25% Similarity=0.344 Sum_probs=18.1 Q ss_pred CEEEEEECCC-------HHHHHHHHCCCCCEEEEE Q ss_conf 7799995798-------899999730269728999 Q gi|254780942|r 48 ENVALVLGYG-------AEEITRINFPPTLSVEYY 75 (442) Q Consensus 48 ~~ivvv~~~~-------~e~i~~~~~~~~~~i~~v 75 (442) ++|+|.||-. .+.+++++.+.+.++++. T Consensus 3 KkIlVACGsGIATSTvva~kv~~~~~e~gi~v~i~ 37 (94) T PRK10310 3 RKIIVACGGAVATSTMAAEEIKELCQSHNIPVELI 37 (94) T ss_pred CEEEEECCCCHHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 55999858837599999999999999859806899 No 303 >cd03413 CbiK_C Anaerobic cobalamin biosynthetic cobalt chelatase (CbiK), C-terminal domain. CbiK is part of the cobalt-early path for cobalamin biosynthesis. It catalyzes the insertion of cobalt into the oxidized form of precorrin-2, factor II (sirohydrochlorin), the second step of the anaerobic branch of vitamin B12 biosynthesis. CbiK belongs to the class II family of chelatases, and is a homomeric enzyme that does not require ATP for its enzymatic activity. Probab=20.79 E-value=52 Score=12.79 Aligned_cols=23 Identities=35% Similarity=0.632 Sum_probs=10.5 Q ss_pred CCEEHHHHHHHHHHHCCCCEEEE Q ss_conf 84857999999999779977999 Q gi|254780942|r 30 AGKPMISHVMETIAAAGIENVAL 52 (442) Q Consensus 30 ~gkpli~~~i~~l~~~g~~~ivv 52 (442) -|.|-++.+++.|++.|++++.+ T Consensus 39 Eg~P~~e~vl~~L~~~g~k~V~L 61 (103) T cd03413 39 EGYPGLDDVLAKLKKAGIKKVTL 61 (103) T ss_pred CCCCCHHHHHHHHHHCCCCEEEE T ss_conf 78589999999999769986999 No 304 >TIGR01980 sufB FeS assembly protein SufB; InterPro: IPR010231 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] . FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins , . It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly . The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA . SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA , acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins . Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen . This entry represents SufB, which is part of the SUF system and forms a complex with SufBCD.; GO: 0005515 protein binding, 0016226 iron-sulfur cluster assembly. Probab=20.63 E-value=53 Score=12.77 Aligned_cols=34 Identities=24% Similarity=0.297 Sum_probs=21.2 Q ss_pred EECCCCEECCCCEEECC-EEECCCCEECCCCEECCC Q ss_conf 98788798869989378-398899899778357664 Q gi|254780942|r 381 HINENAFIGSNSSLIAP-ITIGQGTYVASGSIITQD 415 (442) Q Consensus 381 ~ig~~~~iG~~~~i~~g-v~ig~~~~i~ag~~v~~d 415 (442) +|-.-.--|.+.+..-| |+|+++|. +|-+-|..| T Consensus 355 I~sKSIS~ggG~~~YRGLVk~~~~A~-~~k~~v~CD 389 (469) T TIGR01980 355 IISKSISKGGGKSSYRGLVKIGKGAK-GAKSHVQCD 389 (469) T ss_pred EEEEEEECCCCEEEEEEEEEECCCCC-CCCCEEEEE T ss_conf 89878875887443025788657655-771112566 No 305 >TIGR00575 dnlj DNA ligase, NAD-dependent; InterPro: IPR001679 DNA ligase (polydeoxyribonucleotide synthase) is the enzyme that joins two DNA fragments by catalyzing the formation of an internucleotide ester bond between phosphate and deoxyribose. It is active during DNA replication, DNA repair and DNA recombination. There are two forms of DNA ligase: one requires ATP (6.5.1.1 from EC), the other NAD (6.5.1.2 from EC). This family is predominantly composed of NAD-dependent bacterial DNA ligases. They are proteins of about 75 to 85 Kd whose sequence is well conserved , . They also show similarity to yicF, an Escherichia coli hypothetical protein of 63 Kd.; GO: 0003911 DNA ligase (NAD+) activity, 0006260 DNA replication, 0006281 DNA repair. Probab=20.26 E-value=54 Score=12.73 Aligned_cols=22 Identities=18% Similarity=0.138 Sum_probs=12.2 Q ss_pred CEEEEEEEEECCCCCEECCCCC Q ss_conf 2799411233178410012232 Q gi|254780942|r 261 TVFLSHDTIIQPDTVIEPHVFF 282 (442) Q Consensus 261 ~~~i~~~~~I~~~~~I~~~~~I 282 (442) |...+-.+.||....|-|=+.+ T Consensus 328 T~L~dv~~qVGRTG~iTPvA~L 349 (706) T TIGR00575 328 TKLLDVVVQVGRTGAITPVAKL 349 (706) T ss_pred EEEEEEEEEECCEEEECCEEEE T ss_conf 0898789984656776003675 Done!